BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= psy3212
(177 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|326433118|gb|EGD78688.1| cathepsin D [Salpingoeca sp. ATCC 50818]
Length = 385
Score = 130 bits (327), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 59/114 (51%), Positives = 77/114 (67%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSNCQAIADT 120
R +GGE+ GG D G I + P++R GYWQF ++S+ + + YCSNCQAIADT
Sbjct: 212 RVQGAPSGGELTLGGYDPKHMSGPIQWVPLTRDGYWQFAMDSLSVNGDSYCSNCQAIADT 271
Query: 121 GTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
GTSL+ GP+ I +LNK IGA+P+A G VDC + MPNVDI+L G+ FTLT
Sbjct: 272 GTSLLAGPTDAIKKLNKQIGAIPIAQGEYMVDCKKIPTMPNVDIVLNGQKFTLT 325
Score = 70.1 bits (170), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 38/83 (45%), Positives = 49/83 (59%), Gaps = 4/83 (4%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASD- 65
AQYYG I++GTPPQ FKV+FDTGSSNLW+PS+ C L D ++F N +
Sbjct: 67 AQYYGEITIGTPPQRFKVVFDTGSSNLWVPSKQCKSLACDLHHKYDSSQSSTYFPNGTKF 126
Query: 66 --ENGGEIMFGGVDKDKF-VGDI 85
E G + G + DK VGD+
Sbjct: 127 AIEYGSGSLTGFLSGDKTCVGDL 149
>gi|307203870|gb|EFN82801.1| Lysosomal aspartic protease [Harpegnathos saltator]
Length = 374
Score = 129 bits (325), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 60/115 (52%), Positives = 83/115 (72%), Gaps = 1/115 (0%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYC-SNCQAIAD 119
R+ + GGE++ GG D D + G+ TY PVSRKGYWQF ++SI++ + C S CQAIAD
Sbjct: 200 RDPDAKVGGEMLLGGSDSDHYEGEFTYVPVSRKGYWQFAMDSIQVHGHTLCASGCQAIAD 259
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
TGTSLI GP + +A +N LIGA + G A VDCD ++K+P +D+I+GGK F+L+
Sbjct: 260 TGTSLIAGPVEEVAVINSLIGATTIIAGEAIVDCDLIEKLPGIDVIIGGKMFSLS 314
Score = 79.7 bits (195), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 33/41 (80%), Positives = 38/41 (92%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQYYG I++GTPPQEF+VIFDTGSSNLW+PS+ CSI NIAC
Sbjct: 53 AQYYGVITIGTPPQEFRVIFDTGSSNLWVPSKKCSITNIAC 93
>gi|170063951|ref|XP_001867326.1| lysosomal aspartic protease [Culex quinquefasciatus]
gi|167881401|gb|EDS44784.1| lysosomal aspartic protease [Culex quinquefasciatus]
Length = 387
Score = 126 bits (316), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 58/114 (50%), Positives = 76/114 (66%), Gaps = 1/114 (0%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIAD 119
R+ S GGEI+FGG D K+ GD TY PV RK YWQF ++S+K+ +C+N C+AIAD
Sbjct: 213 RDPSAAEGGEIIFGGSDSAKYTGDFTYLPVDRKAYWQFKMDSVKVGDTEFCNNGCEAIAD 272
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
TGTSLI GP+ + +NK IG P+ NG VDC + K+P + +LGGK F L
Sbjct: 273 TGTSLIAGPTSEVTAINKAIGGTPIINGEYMVDCSLIPKLPKIKFVLGGKEFEL 326
Score = 75.5 bits (184), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 30/41 (73%), Positives = 36/41 (87%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQY+G I++GTPPQ FKV+FDTGSSNLW+PS+ CS NIAC
Sbjct: 66 AQYFGAITIGTPPQSFKVVFDTGSSNLWVPSKECSFTNIAC 106
>gi|391329068|ref|XP_003738999.1| PREDICTED: lysosomal aspartic protease-like [Metaseiulus
occidentalis]
Length = 384
Score = 125 bits (314), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 63/118 (53%), Positives = 78/118 (66%), Gaps = 4/118 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI---EKNVYCSN-CQA 116
R+ G E++ GG+D G+ITY PVSRKGYWQF ++S+KI K C+N CQA
Sbjct: 209 RDNDHPTGSELVIGGIDPKHHKGEITYIPVSRKGYWQFKMDSVKIGDVSKTTLCANGCQA 268
Query: 117 IADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
IADTGTSLI GP+ + LNK IGA P NG V+C+NL MPN+ LGGK+F LT
Sbjct: 269 IADTGTSLIAGPTSEVKALNKAIGAAPFLNGEYLVNCNNLPTMPNITFTLGGKDFELT 326
Score = 72.0 bits (175), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 29/41 (70%), Positives = 35/41 (85%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQYYG IS+G PPQ F+V+FDTGSSNLW+PS +C I N+AC
Sbjct: 62 AQYYGPISIGNPPQPFQVVFDTGSSNLWVPSANCPITNVAC 102
>gi|427789779|gb|JAA60341.1| Putative cathepsin d isoform 1 protein [Rhipicephalus pulchellus]
Length = 391
Score = 125 bits (313), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 59/115 (51%), Positives = 79/115 (68%), Gaps = 2/115 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKN-VYCS-NCQAIA 118
RNASD GGE++FGG+DK + G+ITY PV+RKGYWQF ++ + + N +C+ C+AIA
Sbjct: 216 RNASDPQGGEVLFGGIDKAHYTGNITYVPVTRKGYWQFHMDGVTVGTNTTFCNGGCEAIA 275
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
DTGTSLI GPS+ I +LN IGA P G V C ++ K+PN+ L G +F L
Sbjct: 276 DTGTSLIAGPSEEIQKLNLAIGAAPFTAGEYLVSCKSIPKLPNITFTLNGHDFQL 330
Score = 76.3 bits (186), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 38/76 (50%), Positives = 48/76 (63%), Gaps = 5/76 (6%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHF 60
+++ AQYYG I+LGTPPQ F+V+FDTGSSNLW+PS CS NIAC + HS
Sbjct: 63 LKNYLDAQYYGDITLGTPPQVFRVVFDTGSSNLWVPSSKCSFTNIACWLHHKY---HSSR 119
Query: 61 RNASDENGG--EIMFG 74
+NG EI +G
Sbjct: 120 STTYQKNGTAFEIRYG 135
>gi|241275826|ref|XP_002406708.1| aspartic protease, putative [Ixodes scapularis]
gi|215496940|gb|EEC06580.1| aspartic protease, putative [Ixodes scapularis]
Length = 345
Score = 124 bits (312), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 58/115 (50%), Positives = 81/115 (70%), Gaps = 2/115 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKN-VYCSN-CQAIA 118
RNASD GGE++FGG+D+ ++G+I+Y PVS++GYWQ ++ ++ N +CS C+AI
Sbjct: 213 RNASDPAGGEVLFGGIDESHYIGNISYVPVSKRGYWQVHMDGTRVGNNGSFCSGGCEAIL 272
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
DTGTSLI GPS I +LN LIGA P A+G V C ++DK+P + L GK+F L
Sbjct: 273 DTGTSLIAGPSDEIEKLNLLIGAAPFASGEYIVSCKSIDKLPKITFTLAGKDFVL 327
Score = 75.5 bits (184), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 30/54 (55%), Positives = 40/54 (74%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKF 54
+++ AQYYG I+LGTPPQ F+V+FDTGSSNLW+PS C NIAC + + +
Sbjct: 60 LKNYLDAQYYGEITLGTPPQSFRVVFDTGSSNLWVPSAKCPFTNIACLLHRKYY 113
>gi|227018334|gb|ACP18833.1| aspartic proteinase 1 [Chrysomela tremula]
Length = 386
Score = 123 bits (309), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 59/115 (51%), Positives = 75/115 (65%), Gaps = 1/115 (0%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIAD 119
R+ + E GGEI+FGG D + GD TY PV+RKGYWQ ++ ++ N C CQAI D
Sbjct: 211 RDPNAEVGGEIVFGGSDPKYYTGDFTYLPVTRKGYWQIKMDKAVVDSNTLCDGGCQAIVD 270
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
TGTSLI GPS I ++ K +GA + G VDC+ L MPN+D +LGGK FTLT
Sbjct: 271 TGTSLITGPSDEIEKIVKAVGATAITAGEYTVDCNKLSSMPNIDFVLGGKTFTLT 325
Score = 72.4 bits (176), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 30/41 (73%), Positives = 35/41 (85%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQYYG I++GTP Q+F VIFDTGSSNLWIPS C +LN+AC
Sbjct: 64 AQYYGEITIGTPGQKFNVIFDTGSSNLWIPSHKCKLLNVAC 104
>gi|315274244|gb|ADU03674.1| cathepsin D2 [Ixodes ricinus]
Length = 387
Score = 123 bits (308), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 58/115 (50%), Positives = 80/115 (69%), Gaps = 2/115 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKN-VYCSN-CQAIA 118
RNASD GGE++FGG+D+ + G+I+Y PVS++GYWQ ++ ++ N +CS C+AI
Sbjct: 212 RNASDPAGGEVLFGGIDESHYTGNISYVPVSKRGYWQVHMDGTRVGNNGSFCSGGCEAIL 271
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
DTGTSLI GPS I +LN LIGA P A+G V C ++DK+P + L GK+F L
Sbjct: 272 DTGTSLIAGPSDEIEKLNLLIGAAPFASGEYIVSCKSIDKLPKITFTLAGKDFVL 326
Score = 75.5 bits (184), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 30/54 (55%), Positives = 40/54 (74%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKF 54
+++ AQYYG I+LGTPPQ F+V+FDTGSSNLW+PS C NIAC + + +
Sbjct: 59 LKNYLDAQYYGEITLGTPPQSFRVVFDTGSSNLWVPSAKCPFTNIACLLHRKYY 112
>gi|347451476|gb|AEO94539.1| aspartate protease cathepsin D [Triatoma infestans]
Length = 393
Score = 123 bits (308), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 54/114 (47%), Positives = 78/114 (68%), Gaps = 1/114 (0%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIAD 119
R+ + GGEI+FGG D +K+VGD TY PV ++GYWQF ++ + + +C CQAIAD
Sbjct: 218 RDPAAAVGGEIIFGGSDPEKYVGDFTYVPVDKQGYWQFNMDKVIVNGKTFCKGGCQAIAD 277
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
TGTSLI GP++ + LNKL+G P+A G + CD + K+P +D ++GG F+L
Sbjct: 278 TGTSLIAGPTEDVIALNKLLGGTPIAGGEYMISCDLIPKLPKIDFVIGGNKFSL 331
Score = 75.9 bits (185), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 31/45 (68%), Positives = 38/45 (84%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK 51
AQYYG I+LG+PPQ F+V+FDTGSSNLW+PS+ CS NIAC V +
Sbjct: 71 AQYYGAITLGSPPQSFRVVFDTGSSNLWVPSKKCSRFNIACWVHR 115
>gi|293230|gb|AAA29350.1| aspartic protease [Aedes aegypti]
Length = 387
Score = 122 bits (307), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 57/114 (50%), Positives = 76/114 (66%), Gaps = 1/114 (0%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIAD 119
R+ S GGEI+FGG D +K+ GD TY V RK YWQF ++S+K+ +C+N C+AIAD
Sbjct: 213 RDPSAAEGGEIIFGGSDSNKYTGDFTYLSVDRKAYWQFKMDSVKVGDTEFCNNGCEAIAD 272
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
TGTSLI GP + +NK IG P+ NG VDC + K+P + +LGGK+F L
Sbjct: 273 TGTSLIAGPVSEVTAINKAIGGTPIMNGEYMVDCSLIPKLPKISFVLGGKSFDL 326
Score = 76.6 bits (187), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 31/41 (75%), Positives = 36/41 (87%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQYYG I++GTPPQ FKV+FDTGSSNLW+PS+ CS NIAC
Sbjct: 66 AQYYGAITIGTPPQSFKVVFDTGSSNLWVPSKECSFTNIAC 106
>gi|157112486|ref|XP_001657556.1| cathepsin d [Aedes aegypti]
gi|205831550|sp|Q03168.2|ASPP_AEDAE RecName: Full=Lysosomal aspartic protease; Flags: Precursor
gi|108878060|gb|EAT42285.1| AAEL006169-PA [Aedes aegypti]
Length = 387
Score = 122 bits (307), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 57/114 (50%), Positives = 76/114 (66%), Gaps = 1/114 (0%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIAD 119
R+ S GGEI+FGG D +K+ GD TY V RK YWQF ++S+K+ +C+N C+AIAD
Sbjct: 213 RDPSAAEGGEIIFGGSDSNKYTGDFTYLSVDRKAYWQFKMDSVKVGDTEFCNNGCEAIAD 272
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
TGTSLI GP + +NK IG P+ NG VDC + K+P + +LGGK+F L
Sbjct: 273 TGTSLIAGPVSEVTAINKAIGGTPIMNGEYMVDCSLIPKLPKISFVLGGKSFDL 326
Score = 76.6 bits (187), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 31/41 (75%), Positives = 36/41 (87%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQYYG I++GTPPQ FKV+FDTGSSNLW+PS+ CS NIAC
Sbjct: 66 AQYYGAITIGTPPQSFKVVFDTGSSNLWVPSKECSFTNIAC 106
>gi|227336874|gb|ACP21315.1| aspartic proteinase precursor [Rhipicephalus microplus]
Length = 391
Score = 122 bits (307), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 57/115 (49%), Positives = 77/115 (66%), Gaps = 2/115 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCS-NCQAIA 118
RN SD GGE++FGG+DK + G+ITY PV+RKGYWQF ++ + + N +C+ C+AIA
Sbjct: 216 RNTSDPQGGEVLFGGIDKAHYTGNITYVPVTRKGYWQFHMDGVTVGTNATFCNGGCEAIA 275
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
DTGTSLI GP+ I +LN IGA P G V C ++ K+PN+ L G+ F L
Sbjct: 276 DTGTSLIAGPTAEIQKLNMAIGAAPFLAGEYMVSCKSIPKLPNITFTLNGQEFQL 330
Score = 75.9 bits (185), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 38/76 (50%), Positives = 48/76 (63%), Gaps = 5/76 (6%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHF 60
+++ AQYYG I+LGTPPQ F+V+FDTGSSNLW+PS CS NIAC + HS
Sbjct: 63 LKNYLDAQYYGDITLGTPPQVFRVVFDTGSSNLWVPSSKCSFTNIACWLHHKY---HSSK 119
Query: 61 RNASDENGG--EIMFG 74
+NG EI +G
Sbjct: 120 STTYQKNGTAFEIRYG 135
>gi|147742993|sp|P85136.1|CARDE_CYNCA RecName: Full=Cardosin-E; Contains: RecName: Full=Cardosin-E heavy
chain; Contains: RecName: Full=Cardosin-E light chain
Length = 224
Score = 122 bits (307), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 62/141 (43%), Positives = 89/141 (63%), Gaps = 9/141 (6%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHF-----RNA 63
Y+G I +GTPPQ++ VI+DTGSS LW+PS I + + D H F RN
Sbjct: 17 YFGEIGIGTPPQKYTVIYDTGSSVLWVPSSKEQDF-IEATDETDNVFLHRRFSFWLNRNV 75
Query: 64 SDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCS-NCQAIADT 120
+E GGE++FGG+D + F GD TY PV+ + YWQFG+ + I + +C+ CQA AD+
Sbjct: 76 DEEEGGELVFGGLDPNHFRGDHTYVPVTYQYYWQFGIGDVLIGDKSTGFCAPGCQAFADS 135
Query: 121 GTSLIIGPSKVIAELNKLIGA 141
GTSL+ GP+ ++ ++N IGA
Sbjct: 136 GTSLLSGPTAIVTQINHAIGA 156
>gi|449666857|ref|XP_002161366.2| PREDICTED: lysosomal aspartic protease-like [Hydra magnipapillata]
Length = 387
Score = 120 bits (302), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 55/115 (47%), Positives = 81/115 (70%), Gaps = 2/115 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-KNVYC-SNCQAIA 118
RN +D GGE++ GGVD KFVGDITY+PV+ +GYWQF ++ + + + ++C S C AIA
Sbjct: 212 RNVNDSTGGELLLGGVDSSKFVGDITYTPVTVEGYWQFKMDKVVVNGEPMFCASGCNAIA 271
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
DTGTSLI GP++ + +LN++IGA P+ G +DC + +P ++ +GGK F L
Sbjct: 272 DTGTSLIAGPTEEVNKLNQMIGATPIVGGEYIIDCAKVPSLPALEFWIGGKQFVL 326
Score = 77.8 bits (190), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 33/47 (70%), Positives = 38/47 (80%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+R+ AQYYG ISLGTPPQ FKV+FDTGSSNLW+PS HC +IAC
Sbjct: 59 LRNYMDAQYYGDISLGTPPQPFKVVFDTGSSNLWVPSSHCGWTDIAC 105
>gi|66560290|ref|XP_392857.2| PREDICTED: lysosomal aspartic protease [Apis mellifera]
Length = 385
Score = 120 bits (302), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 57/115 (49%), Positives = 80/115 (69%), Gaps = 2/115 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSNCQAIA 118
RN D+ GGE++ GG D + + G TY PV +KGYWQF ++SI+I + V C+AIA
Sbjct: 210 RNPDDKYGGELILGGSDPNHYEGSFTYVPVDKKGYWQFRMDSIQIGSDLKVCQQGCEAIA 269
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
DTGTSLI GP K I +NK IGA P+A G A +DC+++ +P ++ +LGGK+F+L
Sbjct: 270 DTGTSLIAGPVKEIEAINKAIGATPIAAGEAMIDCNSIPNLPTINFVLGGKSFSL 324
Score = 77.8 bits (190), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 34/61 (55%), Positives = 46/61 (75%), Gaps = 2/61 (3%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK--DKFVGHSHFRNAS 64
AQYYG IS+GTPPQ+F+VIFDTGSSNLW+PS+ C + NIAC + + D ++ +N +
Sbjct: 63 AQYYGVISIGTPPQDFRVIFDTGSSNLWVPSKKCHLTNIACKLHRKYDNTKSSTYKKNGT 122
Query: 65 D 65
D
Sbjct: 123 D 123
>gi|225717994|gb|ACO14843.1| Lysosomal aspartic protease precursor [Caligus clemensi]
Length = 386
Score = 120 bits (301), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 54/115 (46%), Positives = 79/115 (68%), Gaps = 1/115 (0%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIAD 119
R+ + + GGEI+FGG D D + G+ITY PV++KGYWQF ++ +K+ +C N CQAIAD
Sbjct: 212 RDPNAQVGGEIIFGGSDPDHYKGNITYIPVTKKGYWQFKMDGMKVSSKTFCQNGCQAIAD 271
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
TGTSLI GPS + LN+L+G +P+ NG +C ++ +P + +GG +F LT
Sbjct: 272 TGTSLIAGPSVEVNALNQLLGGMPIVNGEYMFNCADVPTLPAITFTIGGTDFVLT 326
Score = 73.6 bits (179), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 31/41 (75%), Positives = 35/41 (85%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQYYG I++GTPPQ F VIFDTGSSNLW+PS+ C I NIAC
Sbjct: 65 AQYYGPITIGTPPQSFNVIFDTGSSNLWVPSKSCHITNIAC 105
>gi|291416142|ref|XP_002724306.1| PREDICTED: cathepsin D [Oryctolagus cuniculus]
Length = 377
Score = 120 bits (301), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 75/236 (31%), Positives = 114/236 (48%), Gaps = 62/236 (26%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNL--------------WIPSQHCS----- 41
+R+ AQYYG I +GTPPQ F V+FDTGSSNL WI ++ S
Sbjct: 80 LRNYMDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSVHCKLLDIACWIHHKYNSKKSST 139
Query: 42 ------------------------------------ILNIACSVDKDKFVGHSHF----- 60
+L + ++ + K V + F
Sbjct: 140 YVKNGTTFDIHYGSGSLSGYLSQDTVSXXXXXXXXNVLPVFDNLMQQKLVEKNVFSFYLN 199
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIA 118
R+ + + GGE+M GGVD + G ++Y V+RK YWQ ++ + + + C C+AI
Sbjct: 200 RDPAAQPGGELMLGGVDPKYYQGSLSYLNVTRKAYWQVHMDQLNVGSGLTLCEGGCEAIV 259
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSL++GP + EL + IGAVPL G + C+ + +P V + LGG+++TL+
Sbjct: 260 DTGTSLLVGPVDEVRELQRAIGAVPLIQGEYIIPCEKVSSLPPVTLKLGGRDYTLS 315
>gi|260810438|ref|XP_002599971.1| hypothetical protein BRAFLDRAFT_74093 [Branchiostoma floridae]
gi|229285255|gb|EEN55983.1| hypothetical protein BRAFLDRAFT_74093 [Branchiostoma floridae]
Length = 388
Score = 120 bits (301), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 57/115 (49%), Positives = 76/115 (66%), Gaps = 1/115 (0%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIAD 119
R+ S GGE++ GG D + + GD T+ V+RKGYWQF ++ I + + +C CQAIAD
Sbjct: 214 RDPSATAGGELILGGADPNYYTGDFTFLDVTRKGYWQFKMDGINVGGSTFCQEGCQAIAD 273
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
TGTSLI GP + + +L+K IGA PLA G KVDC + +P + ILGGK F LT
Sbjct: 274 TGTSLIAGPIEEVNKLHKQIGATPLAGGEYKVDCSKVTSLPTISFILGGKEFELT 328
Score = 71.2 bits (173), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 29/41 (70%), Positives = 35/41 (85%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQY+G IS+GTPPQ F+V+FDTGSSNLW+PS+ C NIAC
Sbjct: 67 AQYFGPISIGTPPQSFQVVFDTGSSNLWVPSKKCHYSNIAC 107
>gi|380018765|ref|XP_003693293.1| PREDICTED: lysosomal aspartic protease-like [Apis florea]
Length = 385
Score = 120 bits (301), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 56/115 (48%), Positives = 80/115 (69%), Gaps = 2/115 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSNCQAIA 118
RN D+ GGE++ GG D + + G TY PV +KGYWQF ++SI+I + V C+AIA
Sbjct: 210 RNPDDKYGGELILGGSDPNHYEGSFTYVPVDKKGYWQFKMDSIQIGSDLKVCQQGCEAIA 269
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
DTGTSLI GP K + +NK IGA P+A G A +DC+++ +P ++ +LGGK+F+L
Sbjct: 270 DTGTSLIAGPVKEVGAINKAIGATPIAAGEAMIDCNSIPNLPTINFVLGGKSFSL 324
Score = 77.8 bits (190), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 34/61 (55%), Positives = 46/61 (75%), Gaps = 2/61 (3%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK--DKFVGHSHFRNAS 64
AQYYG IS+GTPPQ+F+VIFDTGSSNLW+PS+ C + NIAC + + D ++ +N +
Sbjct: 63 AQYYGVISIGTPPQDFRVIFDTGSSNLWVPSKKCHLTNIACKLHRKYDNTKSSTYKKNGT 122
Query: 65 D 65
D
Sbjct: 123 D 123
>gi|307167890|gb|EFN61279.1| Lysosomal aspartic protease [Camponotus floridanus]
Length = 354
Score = 120 bits (300), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 58/115 (50%), Positives = 81/115 (70%), Gaps = 2/115 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIAD 119
R+ S E GGE++ GG D D + D+TY PV+RKGYWQF ++ I++ +C+N CQAIAD
Sbjct: 182 RDPSAEVGGELILGGSDPDHYEADLTYVPVTRKGYWQFSMDGIEVGNRTFCNNGCQAIAD 241
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
TGTSLI GP +A +NKLIGA +A G A VDC+ + ++P ++ LG K F+L+
Sbjct: 242 TGTSLIAGPVADVAAINKLIGASAIA-GQAIVDCNKIPQLPEINFNLGNKKFSLS 295
Score = 79.0 bits (193), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 33/43 (76%), Positives = 38/43 (88%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSV 49
AQYYG IS+GTPPQEFKVIFDTGSSNLW+PS++C NIAC +
Sbjct: 35 AQYYGVISIGTPPQEFKVIFDTGSSNLWVPSKNCHFTNIACQL 77
>gi|332024025|gb|EGI64243.1| Lysosomal aspartic protease [Acromyrmex echinatior]
Length = 381
Score = 120 bits (300), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 54/114 (47%), Positives = 80/114 (70%), Gaps = 2/114 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIAD 119
R++S GGE++ GG D D + G+ TY PV+RKGYWQF ++ ++++ + +C CQAIAD
Sbjct: 211 RDSSAAEGGEMILGGSDPDHYEGEFTYIPVTRKGYWQFKMDGVQVKDHAFCKEGCQAIAD 270
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
TGTSLI GP+ I ++N++IGA + G A V+C+ + MP++ LG KNFTL
Sbjct: 271 TGTSLIAGPTSEIKDINEMIGATSIG-GEAMVNCNQISSMPSISFTLGNKNFTL 323
Score = 78.2 bits (191), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 33/41 (80%), Positives = 36/41 (87%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQYYG IS+GTPPQ FKVIFDTGSSNLW+PS+ C I NIAC
Sbjct: 64 AQYYGVISIGTPPQNFKVIFDTGSSNLWVPSKKCHITNIAC 104
>gi|46309251|dbj|BAD15111.1| cathepsin D [Todarodes pacificus]
Length = 392
Score = 119 bits (299), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 57/118 (48%), Positives = 78/118 (66%), Gaps = 1/118 (0%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIAD 119
R+ S + GGE++ GG D + G+ TY PV+RKGYWQ ++ + +N +CS CQAIAD
Sbjct: 218 RDPSGKEGGELILGGSDPKHYTGNFTYLPVTRKGYWQIKMDKVVSGENTFCSGGCQAIAD 277
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTRTN 177
TGTSLI GP I +LN+ IG L G VDC ++ K+PNVD +LGGK F+L ++
Sbjct: 278 TGTSLIAGPVDEIKKLNEAIGGRALPGGEYMVDCASIPKLPNVDFVLGGKTFSLKTSD 335
Score = 70.5 bits (171), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 29/41 (70%), Positives = 35/41 (85%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQYYG IS+GTP Q FKV+FDTGSSNLW+PS+ C + +IAC
Sbjct: 71 AQYYGVISIGTPAQNFKVVFDTGSSNLWVPSKKCKLSDIAC 111
>gi|348502026|ref|XP_003438570.1| PREDICTED: LOW QUALITY PROTEIN: chymosin-like [Oreochromis
niloticus]
Length = 318
Score = 119 bits (299), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 74/203 (36%), Positives = 103/203 (50%), Gaps = 47/203 (23%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKF--VGHSHFRN---- 62
Y G IS+GTPPQ FKVIFDTGSSNLW+PS +C N A + KF S +R
Sbjct: 72 YXGVISIGTPPQSFKVIFDTGSSNLWVPSIYC---NSAACNNHAKFDPRRSSTYRQNGRP 128
Query: 63 --------------------------------ASDENGGEIMFGGVDKDKFVGDITYSPV 90
++++ G + FGGVD + + G IT+ P+
Sbjct: 129 LSITYGTGSMTGILGYDTVTKYSISPLVKLFCSNNQRGSVVTFGGVDTNHYSGSITWIPL 188
Query: 91 SRKGYWQFGVESIKIEKNVYCSNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAK 150
S + YWQ V+S+ CQA+ DTGTS+I+GP I +N +GA NG
Sbjct: 189 SNELYWQITVDSVSGN-----GGCQAVVDTGTSMIVGPQSDIDNINSWLGA-SRQNGEYI 242
Query: 151 VDCDNLDKMPNVDIILGGKNFTL 173
V+C+N+ +MP+V + G+ FTL
Sbjct: 243 VNCNNIGQMPDVTFNINGQQFTL 265
>gi|121543617|gb|ABM55520.1| putative cathepsin D [Maconellicoccus hirsutus]
Length = 391
Score = 119 bits (298), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 63/125 (50%), Positives = 84/125 (67%), Gaps = 3/125 (2%)
Query: 51 KDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-KNV 109
KD RN S +GGEI+FGGVD KF G+ TY PVS KGYWQFG+E I + K++
Sbjct: 207 KDSVFSFYLNRNTSAADGGEIIFGGVDPSKFRGNFTYVPVSVKGYWQFGMEKISLGGKDI 266
Query: 110 YCSNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGK 169
S QAIADTGTSLI GPS+ IA +NK IGAV + G V C+++D++P++ + G
Sbjct: 267 QTS--QAIADTGTSLIAGPSEDIAAINKAIGAVEILGGQYTVSCESIDQLPDITFTINGV 324
Query: 170 NFTLT 174
++TL+
Sbjct: 325 DYTLS 329
Score = 80.1 bits (196), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 33/49 (67%), Positives = 41/49 (83%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSV 49
+R+ AQYYG I++GTPPQ F V+FDTGSSNLW+PS+ CSILNIAC +
Sbjct: 64 LRNYLDAQYYGPITIGTPPQPFNVVFDTGSSNLWVPSKQCSILNIACLI 112
>gi|167524529|ref|XP_001746600.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163774870|gb|EDQ88496.1| predicted protein [Monosiga brevicollis MX1]
Length = 381
Score = 119 bits (297), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 53/114 (46%), Positives = 77/114 (67%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSNCQAIADT 120
RN + +GGE+ GG D+ F G I ++ V+ GYWQF + + IE YC+NC+AIADT
Sbjct: 208 RNPNGVSGGELTLGGYDESHFAGPIHWTDVTVDGYWQFTMTGLSIENTPYCTNCKAIADT 267
Query: 121 GTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
GTSL+ GP+ V+ ++NK IGA +A G A VDC+ + MPNV I++ G ++L+
Sbjct: 268 GTSLLAGPTDVVKQINKAIGATTIAAGEAIVDCNKIPHMPNVTIVINGIQYSLS 321
Score = 71.6 bits (174), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 33/48 (68%), Positives = 39/48 (81%), Gaps = 1/48 (2%)
Query: 3 SSYQ-AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSV 49
S+Y+ AQY+G IS+GTP Q+FKVIFDTGSSNLW+PS C NIAC V
Sbjct: 56 SNYEDAQYFGEISIGTPAQKFKVIFDTGSSNLWVPSSQCPKTNIACDV 103
>gi|195443988|ref|XP_002069667.1| GK11645 [Drosophila willistoni]
gi|194165752|gb|EDW80653.1| GK11645 [Drosophila willistoni]
Length = 388
Score = 119 bits (297), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 50/113 (44%), Positives = 77/113 (68%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSNCQAIADT 120
RN S+ +GGE++FGG D KF G +TY P+S GYWQF ++ I+I+ + QAIADT
Sbjct: 219 RNGSERSGGELLFGGTDSTKFTGQLTYVPLSNPGYWQFEMQGIEIDGQRIAEHRQAIADT 278
Query: 121 GTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
GTSL++ P + +N+LIG P +G V+C+++D++P++ I+GG+ F L
Sbjct: 279 GTSLLVAPPREHLIINRLIGGFPTTSGEYVVNCNHIDRLPDISFIIGGQRFAL 331
Score = 60.1 bits (144), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 29/57 (50%), Positives = 36/57 (63%), Gaps = 8/57 (14%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNAS 64
+Y G IS+GTP Q F ++FDTGS+NLW+PS CS+ N AC H H NAS
Sbjct: 73 EYAGPISIGTPGQPFNMLFDTGSANLWVPSADCSLKNRACQ--------HHHRYNAS 121
>gi|296471706|tpg|DAA13821.1| TPA: pregnancy-associated glycoprotein 10-like [Bos taurus]
Length = 287
Score = 119 bits (297), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 70/210 (33%), Positives = 106/210 (50%), Gaps = 39/210 (18%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC-------------------- 40
+R+ Y GTI++GTPPQEF+VIFDTGSS+LW+PS HC
Sbjct: 63 LRNYKDLVYIGTIAIGTPPQEFRVIFDTGSSDLWVPSIHCHSPSCLTHSLFNPRTSTTFK 122
Query: 41 ---SILNIACSVDKDKFV--------------GHSHFRNASDENGGEIMFGGVDKDKFVG 83
+I+++ + + K V G + ++ ENG +MFGGVD G
Sbjct: 123 LLNNIIDLIYASGRIKGVLGQDIIQIKLLGTLGAAQGFSSRRENGSVLMFGGVDHSYHTG 182
Query: 84 DITYSPVSRKGYWQFGVESIKIE-KNVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGA 141
+ + PVSR YWQ + I + K + C CQAI DTGT+ ++GPS+ IA++ +LI
Sbjct: 183 KLNWVPVSRTHYWQITIGRISMNGKLIACKRGCQAIMDTGTTFLLGPSRHIAKIQRLIRI 242
Query: 142 VPLANGPAKVDCDNLDKMPNVDIILGGKNF 171
P + V C+ +P + + G ++
Sbjct: 243 RPFGSLQYTVPCNITSTLPPLIFTIKGIDY 272
>gi|83319201|dbj|BAE53722.1| aspartic protease [Haemaphysalis longicornis]
Length = 391
Score = 118 bits (296), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 57/115 (49%), Positives = 77/115 (66%), Gaps = 2/115 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCS-NCQAIA 118
RNASD NGGE++FGG+D+ + G+ITY PV+RKGYWQF + +K+ N +C+ C AIA
Sbjct: 216 RNASDPNGGEVLFGGIDEAHYTGNITYVPVTRKGYWQFHMNGVKVGDNATFCNGGCAAIA 275
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
DTGTSLI GP++ I +LN IGA P G V C ++ +P ++ L G F L
Sbjct: 276 DTGTSLIAGPTEEIHKLNVAIGAAPFMAGEYIVSCKSIPTLPKINFNLNGNEFVL 330
Score = 73.2 bits (178), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 29/47 (61%), Positives = 37/47 (78%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ AQYYG ++LGTPPQ F+V+FDTGSSNLW+PS C NIAC
Sbjct: 63 LKNYLDAQYYGDVTLGTPPQVFRVVFDTGSSNLWVPSSKCPFTNIAC 109
>gi|157779726|gb|ABV71391.1| aspartic protease [Haemaphysalis longicornis]
Length = 391
Score = 118 bits (296), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 57/115 (49%), Positives = 77/115 (66%), Gaps = 2/115 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCS-NCQAIA 118
RNASD NGGE++FGG+D+ + G+ITY PV+RKGYWQF + +K+ N +C+ C AIA
Sbjct: 216 RNASDPNGGEVLFGGIDEAHYTGNITYVPVTRKGYWQFHMNGVKVGDNATFCNGGCAAIA 275
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
DTGTSLI GP++ I +LN IGA P G V C ++ +P ++ L G F L
Sbjct: 276 DTGTSLIAGPTEEIHKLNVAIGAAPFMAGEYIVSCKSIPTLPKINFNLNGNEFVL 330
Score = 73.2 bits (178), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 29/47 (61%), Positives = 37/47 (78%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ AQYYG ++LGTPPQ F+V+FDTGSSNLW+PS C NIAC
Sbjct: 63 LKNYLDAQYYGDVTLGTPPQVFRVVFDTGSSNLWVPSSKCPFTNIAC 109
>gi|290561455|gb|ADD38128.1| Lysosomal aspartic protease [Lepeophtheirus salmonis]
Length = 384
Score = 118 bits (296), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 53/115 (46%), Positives = 78/115 (67%), Gaps = 1/115 (0%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIAD 119
RN + GGEI+FGG D D + G+ITY PV++KGYWQF ++ +++ +C N CQAIAD
Sbjct: 210 RNPDAKVGGEIIFGGSDPDHYKGNITYIPVTKKGYWQFKMDKMEVNSKSFCQNGCQAIAD 269
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
TGTSLI GPS + LN+L+G P+ NG +C+++ +P + +GG+ F L+
Sbjct: 270 TGTSLIAGPSIEVNALNQLLGGTPIINGEYMFNCEDIPNLPPITFTIGGEEFVLS 324
Score = 74.7 bits (182), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 32/41 (78%), Positives = 36/41 (87%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQYYG I++G+PPQ FKVIFDTGSSNLWIPS+ C I NIAC
Sbjct: 63 AQYYGPITIGSPPQSFKVIFDTGSSNLWIPSKSCHITNIAC 103
>gi|358419658|ref|XP_869896.4| PREDICTED: pregnancy-associated glycoprotein 2 [Bos taurus]
gi|359081122|ref|XP_002699305.2| PREDICTED: pregnancy-associated glycoprotein 2 [Bos taurus]
Length = 334
Score = 118 bits (296), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 70/210 (33%), Positives = 106/210 (50%), Gaps = 39/210 (18%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC-------------------- 40
+R+ Y GTI++GTPPQEF+VIFDTGSS+LW+PS HC
Sbjct: 63 LRNYKDLVYIGTIAIGTPPQEFRVIFDTGSSDLWVPSIHCHSPSCLTHSLFNPRTSTTFK 122
Query: 41 ---SILNIACSVDKDKFV--------------GHSHFRNASDENGGEIMFGGVDKDKFVG 83
+I+++ + + K V G + ++ ENG +MFGGVD G
Sbjct: 123 LLNNIIDLIYASGRIKGVLGQDIIQIKLLGTLGAAQGFSSRRENGSVLMFGGVDHSYHTG 182
Query: 84 DITYSPVSRKGYWQFGVESIKIE-KNVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGA 141
+ + PVSR YWQ + I + K + C CQAI DTGT+ ++GPS+ IA++ +LI
Sbjct: 183 KLNWVPVSRTHYWQITIGRISMNGKLIACKRGCQAIMDTGTTFLLGPSRHIAKIQRLIRI 242
Query: 142 VPLANGPAKVDCDNLDKMPNVDIILGGKNF 171
P + V C+ +P + + G ++
Sbjct: 243 RPFGSLQYTVPCNITSTLPPLIFTIKGIDY 272
>gi|225713714|gb|ACO12703.1| Lysosomal aspartic protease precursor [Lepeophtheirus salmonis]
gi|290462953|gb|ADD24524.1| Lysosomal aspartic protease [Lepeophtheirus salmonis]
Length = 384
Score = 118 bits (296), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 53/115 (46%), Positives = 78/115 (67%), Gaps = 1/115 (0%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIAD 119
RN + GGEI+FGG D D + G+ITY PV++KGYWQF ++ +++ +C N CQAIAD
Sbjct: 210 RNPDAKVGGEIIFGGSDPDHYKGNITYIPVTKKGYWQFKMDKMEVNSKSFCQNGCQAIAD 269
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
TGTSLI GPS + LN+L+G P+ NG +C+++ +P + +GG+ F L+
Sbjct: 270 TGTSLIAGPSIEVNALNQLLGGTPIINGEYMFNCEDIPNLPPITFTIGGEEFVLS 324
Score = 74.7 bits (182), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 32/41 (78%), Positives = 36/41 (87%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQYYG I++G+PPQ FKVIFDTGSSNLWIPS+ C I NIAC
Sbjct: 63 AQYYGPITIGSPPQSFKVIFDTGSSNLWIPSKSCHITNIAC 103
>gi|156553448|ref|XP_001600543.1| PREDICTED: lysosomal aspartic protease-like [Nasonia vitripennis]
Length = 384
Score = 118 bits (296), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 56/114 (49%), Positives = 75/114 (65%), Gaps = 1/114 (0%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIAD 119
R+ + + GGE++ GG D + GD TY PVSRK YWQF ++ I I ++C N C+AIAD
Sbjct: 210 RDPNAKIGGEMILGGSDSAHYTGDFTYVPVSRKAYWQFKMDKITIGDKLFCENGCEAIAD 269
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
TGTSLI GP I +NK IGA P+ G A V CD + +P +D ++GGK F+L
Sbjct: 270 TGTSLIAGPVGEIEGINKKIGATPIVAGEAMVSCDAVPNLPTIDFVVGGKKFSL 323
Score = 68.9 bits (167), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 29/41 (70%), Positives = 34/41 (82%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQYYG I +G+P Q+F VIFDTGSSNLW+PS+ C I NIAC
Sbjct: 63 AQYYGEIGIGSPMQKFTVIFDTGSSNLWVPSKKCHITNIAC 103
>gi|346469557|gb|AEO34623.1| hypothetical protein [Amblyomma maculatum]
Length = 391
Score = 118 bits (295), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 55/115 (47%), Positives = 78/115 (67%), Gaps = 2/115 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCS-NCQAIA 118
RNA+D NGGE++FGG+D+ + G+I+Y PV+RKGYWQF ++ +K+ N +C+ C+AIA
Sbjct: 216 RNATDPNGGEVLFGGIDEAHYTGNISYVPVTRKGYWQFHMDGLKVGDNATFCNGGCEAIA 275
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
DTGTSLI GP++ I +LN IGA P G V C ++ +P + L G F L
Sbjct: 276 DTGTSLIAGPTEEIQKLNLAIGAAPFTAGEYLVSCKSIPTLPKITFNLNGHEFVL 330
Score = 73.9 bits (180), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 31/58 (53%), Positives = 40/58 (68%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHS 58
+++ AQYYG I+LGTPPQ F+V+FDTGSSNLW+PS C NIAC + + S
Sbjct: 63 LKNYLDAQYYGDITLGTPPQVFRVVFDTGSSNLWVPSSKCPFTNIACMLHHKYYAKKS 120
>gi|238816835|gb|ACR56788.1| aspartic protease 4 [Strongyloides ratti]
Length = 428
Score = 117 bits (294), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 59/116 (50%), Positives = 76/116 (65%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVES-IKIEKNVYCSN-CQAIA 118
R A D +GGEI GG+D + GDITY PVSRKGYWQF ++ + + + C N CQAIA
Sbjct: 214 RIADDSDGGEITLGGMDPKHYKGDITYVPVSRKGYWQFKMDGFVGDNEKIACKNGCQAIA 273
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSLI GP + + K IGA PLA G V CD + +P V+I++GG+ F L+
Sbjct: 274 DTGTSLIAGPKAQVEAIQKFIGAEPLARGEYMVPCDKVSSLPIVNIVIGGQAFALS 329
Score = 74.3 bits (181), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 33/47 (70%), Positives = 37/47 (78%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+R+ AQYYG IS+GTP Q F VIFDTGSSNLWIPS+ C I NIAC
Sbjct: 61 LRNYMDAQYYGEISIGTPGQNFSVIFDTGSSNLWIPSKKCPIYNIAC 107
>gi|315274255|gb|ADU03675.1| putative cathepsin D3 [Ixodes ricinus]
Length = 398
Score = 117 bits (294), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 55/117 (47%), Positives = 79/117 (67%), Gaps = 3/117 (2%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVY---CSNCQAI 117
RN + GGEI FGG D + GDI+Y PVS KGYWQF V++I ++ + + C+AI
Sbjct: 219 RNTTSPTGGEITFGGTDDRFYSGDISYVPVSTKGYWQFTVDNIVVKNSSFKLCAEGCEAI 278
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
ADTGTSL+ GPS I +L KLIGA+P ++G V C+++ K+P++ +GG+ + LT
Sbjct: 279 ADTGTSLMAGPSLEIMKLQKLIGALPFSHGQYTVRCEDIHKLPDIKFHIGGQEYVLT 335
Score = 75.1 bits (183), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 34/63 (53%), Positives = 43/63 (68%), Gaps = 4/63 (6%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASDE 66
AQYYG IS+G+PPQ F+V+FDTGSSNLW+PS+ C NIAC + K + H R+ S
Sbjct: 72 AQYYGPISIGSPPQPFRVVFDTGSSNLWVPSKQCKWTNIACLLHKK----YDHTRSRSYR 127
Query: 67 NGG 69
G
Sbjct: 128 KNG 130
>gi|31197673|ref|XP_307784.1| AGAP003277-PA [Anopheles gambiae str. PEST]
gi|347969584|ref|XP_003436430.1| AGAP003277-PB [Anopheles gambiae str. PEST]
gi|347969586|ref|XP_003436431.1| AGAP003277-PC [Anopheles gambiae str. PEST]
gi|347969588|ref|XP_003436432.1| AGAP003277-PD [Anopheles gambiae str. PEST]
gi|30179074|gb|EAA03535.2| AGAP003277-PA [Anopheles gambiae str. PEST]
gi|333466215|gb|EGK96172.1| AGAP003277-PB [Anopheles gambiae str. PEST]
gi|333466216|gb|EGK96173.1| AGAP003277-PC [Anopheles gambiae str. PEST]
gi|333466217|gb|EGK96174.1| AGAP003277-PD [Anopheles gambiae str. PEST]
Length = 389
Score = 117 bits (293), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 56/114 (49%), Positives = 71/114 (62%), Gaps = 1/114 (0%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIAD 119
R+ S GGEI+FGG D + GD TY V RK YWQF ++S+ + YC+N C+AIAD
Sbjct: 215 RDPSAAEGGEIIFGGSDSKHYTGDFTYLSVDRKAYWQFKMDSVTVGDAQYCNNGCEAIAD 274
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
TGTSLI GP + +NK IG P+ NG VDC + +P + LGGK FTL
Sbjct: 275 TGTSLIAGPVAEVTAINKAIGGTPVLNGEYMVDCSLIPSLPKITFTLGGKQFTL 328
Score = 76.6 bits (187), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 31/41 (75%), Positives = 36/41 (87%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQY+G IS+GTPPQ FKV+FDTGSSNLW+PS+ CS NIAC
Sbjct: 68 AQYFGAISIGTPPQSFKVVFDTGSSNLWVPSKQCSFTNIAC 108
>gi|268581165|ref|XP_002645565.1| C. briggsae CBR-ASP-4 protein [Caenorhabditis briggsae]
Length = 446
Score = 116 bits (291), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 56/114 (49%), Positives = 73/114 (64%), Gaps = 1/114 (0%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIAD 119
RN E GGEI FGG+D ++V ITY+PV+RKGYWQF ++ + + CSN CQAIAD
Sbjct: 240 RNPDSELGGEITFGGIDARRYVEPITYTPVTRKGYWQFKMDKVVGSGVLGCSNGCQAIAD 299
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
TGTSLI GP I + IGA PL G + CD + +P V ++GG+ F+L
Sbjct: 300 TGTSLIAGPKAQIEAIQNFIGAEPLIKGEYMISCDKVPTLPPVSFVIGGQEFSL 353
Score = 70.5 bits (171), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 30/49 (61%), Positives = 38/49 (77%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSV 49
+R+ AQY+GTIS+GTP Q F VIFDTGSSNLW+PS+ C +IAC +
Sbjct: 87 LRNYMDAQYFGTISIGTPGQNFTVIFDTGSSNLWVPSKKCPFYDIACML 135
>gi|329754204|gb|AEC03508.1| cathepsin-D [Polyrhachis vicina]
Length = 384
Score = 116 bits (290), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 55/115 (47%), Positives = 78/115 (67%), Gaps = 2/115 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIA 118
R+ + GGE++ GG D + + GD TY PV RK YWQF ++S++I ++ C C+AIA
Sbjct: 209 RDPDAKEGGELILGGSDPNHYKGDFTYVPVDRKAYWQFKMDSVQIGSDLKLCKQGCEAIA 268
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
DTGTSLI GP K I +NK IGA P+ G VDC+++ +P ++ +LGGK+FTL
Sbjct: 269 DTGTSLIAGPVKEIEAINKAIGATPIVGGEYMVDCNSIPNLPTINFVLGGKSFTL 323
Score = 75.5 bits (184), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 32/41 (78%), Positives = 35/41 (85%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQYYG IS+GTPPQ FKVIFDTGSSNLW+PS+ C NIAC
Sbjct: 62 AQYYGAISIGTPPQNFKVIFDTGSSNLWVPSKKCHFTNIAC 102
>gi|324507249|gb|ADY43078.1| Cathepsin D [Ascaris suum]
Length = 437
Score = 116 bits (290), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 55/119 (46%), Positives = 78/119 (65%), Gaps = 2/119 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVY-CSN-CQAIA 118
RN D+ GGEI FGG D ++V ITY+PV+R+GYWQF ++ + E+ V C+N CQAIA
Sbjct: 233 RNPDDQIGGEITFGGTDTKRYVEPITYTPVTRRGYWQFKMDKVVGEEAVLACANGCQAIA 292
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTRTN 177
DTGTSLI GP + + + K IGA PL G + CD + +P+V ++ K ++L T+
Sbjct: 293 DTGTSLIAGPKQQVDTIQKFIGAEPLFRGEYMIPCDKVPSLPDVSFVIASKTYSLKPTD 351
Score = 71.2 bits (173), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 29/47 (61%), Positives = 37/47 (78%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ AQYYG IS+GTPPQ F VIFDTGS+NLW+PS+ C +IAC
Sbjct: 80 LKNYMDAQYYGDISIGTPPQNFTVIFDTGSANLWVPSRKCPFTDIAC 126
>gi|56118817|ref|NP_001008172.1| MGC89016 protein precursor [Xenopus (Silurana) tropicalis]
gi|51950197|gb|AAH82490.1| MGC89016 protein [Xenopus (Silurana) tropicalis]
Length = 421
Score = 116 bits (290), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 55/126 (43%), Positives = 80/126 (63%), Gaps = 12/126 (9%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN------- 113
R+AS + GGEI+ GG D D +VG+ TY V+RK YWQ ++S+ + C N
Sbjct: 215 RDASAKEGGEIILGGSDPDHYVGNFTYLDVTRKAYWQIKMDSVTVSSESECMNAMMVGGE 274
Query: 114 -----CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGG 168
CQAIADTGTSLI+GPS + +LN IGA+P+ +G ++C + +P ++ +LGG
Sbjct: 275 YCKGGCQAIADTGTSLIVGPSSDVEKLNAEIGALPIISGEYWINCSKIASLPTINFVLGG 334
Query: 169 KNFTLT 174
K+F+LT
Sbjct: 335 KSFSLT 340
Score = 70.9 bits (172), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 28/44 (63%), Positives = 36/44 (81%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK 51
QYYG IS+GTPPQ F+V+FDTGSSNLW+PS+ C +IAC + +
Sbjct: 69 QYYGVISIGTPPQSFRVVFDTGSSNLWVPSKKCKWTDIACWLHR 112
>gi|91093044|ref|XP_966517.1| PREDICTED: similar to cathepsin D isoform 1 [Tribolium castaneum]
gi|270002651|gb|EEZ99098.1| hypothetical protein TcasGA2_TC004989 [Tribolium castaneum]
Length = 384
Score = 116 bits (290), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 58/115 (50%), Positives = 75/115 (65%), Gaps = 2/115 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-KNVYCSN-CQAIA 118
R+ S GGEI+ GG D + GD TY V R+ YWQF ++SI + KN +C+N C+AIA
Sbjct: 209 RDPSAAQGGEIILGGSDPAHYKGDFTYLSVDRQAYWQFKMDSISVGGKNTFCANGCEAIA 268
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
DTGTSLI GP + +NK IGA P+ G VDC+ + +P +D LGGKNFTL
Sbjct: 269 DTGTSLIAGPVSEVQGINKAIGATPIVGGEYMVDCNLIPNLPLIDFTLGGKNFTL 323
Score = 72.0 bits (175), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 30/41 (73%), Positives = 34/41 (82%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQYYG IS+G PPQ FKV+FDTGSSNLW+PS+ C NIAC
Sbjct: 62 AQYYGPISIGNPPQNFKVVFDTGSSNLWVPSKKCHYTNIAC 102
>gi|9581805|emb|CAC00543.1| necepsin II [Necator americanus]
Length = 446
Score = 116 bits (290), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 56/115 (48%), Positives = 75/115 (65%), Gaps = 2/115 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI-EKNVYCSN-CQAIA 118
RN E GGEI FGGVD ++V IT++PV+R+GYWQF ++ ++ ++ C N CQAIA
Sbjct: 240 RNPESEIGGEITFGGVDTRRYVEPITWTPVTRRGYWQFKMDMVQGGSSSIACPNGCQAIA 299
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
DTGTSLI GP + + K IGA PL G + CD + +P+V I+ GK FTL
Sbjct: 300 DTGTSLIAGPKAQVEAIQKYIGAEPLMKGEYMIPCDKVPSLPDVSFIIDGKTFTL 354
Score = 69.7 bits (169), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 43/113 (38%), Positives = 55/113 (48%), Gaps = 28/113 (24%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHF 60
+R+ AQYYG I +GTP Q F VIFDTGSSNLW+PS+ C +IAC + H +
Sbjct: 87 LRNYMDAQYYGVIQIGTPAQNFTVIFDTGSSNLWVPSRKCPFYDIACMLH------HRYD 140
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIK--IEKNVYC 111
AS TY RK Q+G S+K I K++ C
Sbjct: 141 SGASS--------------------TYKEDGRKMAIQYGTGSMKGFISKDIVC 173
>gi|262232673|gb|ACY38599.1| cathepsin D-like aspartic protease [Anisakis simplex]
Length = 453
Score = 115 bits (289), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 56/116 (48%), Positives = 76/116 (65%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEK-NVYCSN-CQAIA 118
R DE GGEI FGG D FV I Y+PV+R+ YWQF ++ I + + CS+ CQAIA
Sbjct: 253 RTPDDEIGGEITFGGTDPKHFVEPIVYAPVTRRAYWQFKMDKISGQDGTLACSDGCQAIA 312
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSLI GP + + + K IGA PL +G + CD + +P+V +++GGK F+LT
Sbjct: 313 DTGTSLIAGPKQQVQLIQKYIGAEPLFSGEYMIPCDKVPSLPDVSLVIGGKTFSLT 368
Score = 73.6 bits (179), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 31/47 (65%), Positives = 37/47 (78%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+R+ AQYYG IS+GTPPQ F VIFDTGSSNLW+PS+ C +IAC
Sbjct: 100 LRNYMDAQYYGVISIGTPPQNFTVIFDTGSSNLWVPSRKCKWTDIAC 146
>gi|241813645|ref|XP_002416518.1| aspartic protease, putative [Ixodes scapularis]
gi|215510982|gb|EEC20435.1| aspartic protease, putative [Ixodes scapularis]
Length = 392
Score = 115 bits (288), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 54/117 (46%), Positives = 78/117 (66%), Gaps = 3/117 (2%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVY---CSNCQAI 117
RN + GGEI FGG D + GDI+Y PVS KGYWQF V++I ++ + + C+AI
Sbjct: 213 RNTTSPTGGEITFGGTDDRFYSGDISYVPVSTKGYWQFTVDNIVVKNSSFKLCAEGCEAI 272
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
ADTGTSL+ GPS I +L KLIGA+P ++G V C ++ ++P++ +GG+ + LT
Sbjct: 273 ADTGTSLMAGPSLEIMKLQKLIGALPFSHGQYTVRCQDIHQLPDIKFHIGGQEYVLT 329
Score = 74.7 bits (182), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 34/63 (53%), Positives = 43/63 (68%), Gaps = 4/63 (6%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASDE 66
AQYYG IS+G+PPQ F+V+FDTGSSNLW+PS+ C NIAC + K + H R+ S
Sbjct: 66 AQYYGPISIGSPPQPFRVVFDTGSSNLWVPSKQCKWTNIACLLHKK----YDHTRSRSYR 121
Query: 67 NGG 69
G
Sbjct: 122 KNG 124
>gi|194376172|dbj|BAG62845.1| unnamed protein product [Homo sapiens]
Length = 282
Score = 115 bits (288), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 68/208 (32%), Positives = 105/208 (50%), Gaps = 58/208 (27%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACS------------------- 48
+Y+GTI +GTP Q+F V+FDTGSSNLW+PS +CS ++AC+
Sbjct: 75 EYFGTIGIGTPAQDFTVLFDTGSSNLWVPSVYCS--SLACTNHNRFNPEDSSTYQSTSET 132
Query: 49 --------------------------------VDKDKFVGHSHFRNASDENGGEIMFGGV 76
V +D F S + +A D++G ++FGG+
Sbjct: 133 VSITYGISSLPDIVFTINGVQYPVPPSAYILLVSQDLF---SVYLSADDQSGSVVIFGGI 189
Query: 77 DKDKFVGDITYSPVSRKGYWQFGVESIKIE-KNVYCSN-CQAIADTGTSLIIGPSKVIAE 134
D + G + + PV+ +GYWQ V+SI + + + C+ CQAI DTGTSL+ GP+ IA
Sbjct: 190 DSSYYTGSLNWVPVTVEGYWQITVDSITMNGEAIACAEGCQAIVDTGTSLLTGPTSPIAN 249
Query: 135 LNKLIGAVPLANGPAKVDCDNLDKMPNV 162
+ IGA ++G V C + +P++
Sbjct: 250 IQSDIGASENSDGDMVVSCSAISSLPDI 277
>gi|308483047|ref|XP_003103726.1| CRE-ASP-4 protein [Caenorhabditis remanei]
gi|308259744|gb|EFP03697.1| CRE-ASP-4 protein [Caenorhabditis remanei]
Length = 462
Score = 115 bits (288), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 55/114 (48%), Positives = 73/114 (64%), Gaps = 1/114 (0%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIAD 119
RN + GGEI FGG+D ++V ITY+PV+RKGYWQF ++ + + CSN CQAIAD
Sbjct: 256 RNPDSDLGGEITFGGIDPRRYVEPITYTPVTRKGYWQFKMDKVVGSGVLGCSNGCQAIAD 315
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
TGTSLI GP I + IGA PL G + CD + +P V ++GG+ F+L
Sbjct: 316 TGTSLIAGPKAQIEAIQNFIGAEPLIKGEYMISCDKIPTLPPVSFVIGGQEFSL 369
Score = 71.6 bits (174), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 30/45 (66%), Positives = 37/45 (82%)
Query: 5 YQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSV 49
+QAQY+GTIS+GTP Q F VIFDTGSSNLW+PS+ C +IAC +
Sbjct: 107 FQAQYFGTISIGTPGQNFTVIFDTGSSNLWVPSKKCPFYDIACML 151
>gi|17549909|ref|NP_510191.1| Protein ASP-4 [Caenorhabditis elegans]
gi|3879202|emb|CAA90633.1| Protein ASP-4 [Caenorhabditis elegans]
Length = 444
Score = 115 bits (288), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 56/114 (49%), Positives = 72/114 (63%), Gaps = 1/114 (0%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIAD 119
RN E GGEI FGG+D ++V ITY PV+RKGYWQF ++ + + CSN CQAIAD
Sbjct: 239 RNPDSEIGGEITFGGIDSRRYVEPITYVPVTRKGYWQFKMDKVVGSGVLGCSNGCQAIAD 298
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
TGTSLI GP I + IGA PL G + CD + +P V ++GG+ F+L
Sbjct: 299 TGTSLIAGPKAQIEAIQNFIGAEPLIKGEYMISCDKVPTLPPVSFVIGGQEFSL 352
Score = 72.0 bits (175), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 31/47 (65%), Positives = 37/47 (78%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+R+ AQY+GTIS+GTP Q F VIFDTGSSNLWIPS+ C +IAC
Sbjct: 86 LRNYMDAQYFGTISIGTPAQNFTVIFDTGSSNLWIPSKKCPFYDIAC 132
>gi|195159706|ref|XP_002020719.1| GL15694 [Drosophila persimilis]
gi|194117669|gb|EDW39712.1| GL15694 [Drosophila persimilis]
Length = 401
Score = 115 bits (287), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 58/125 (46%), Positives = 76/125 (60%), Gaps = 2/125 (1%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKN 108
+D+ F G RN S E GG+++ GGVD+ GD+TY PVS++GYWQF V +I
Sbjct: 222 IDEPTF-GFYLARNGSSEEGGQLLLGGVDETLMAGDLTYVPVSQEGYWQFSVNNISWNGT 280
Query: 109 VYCSNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGG 168
V C CQAIADTGTSL+ P V ++N+LIGAV L G + C LD +P + +GG
Sbjct: 281 VLCDGCQAIADTGTSLLACPQAVYTQINQLIGAV-LIEGSNYIPCATLDSLPVLSFNIGG 339
Query: 169 KNFTL 173
F L
Sbjct: 340 TTFDL 344
Score = 65.1 bits (157), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 27/39 (69%), Positives = 31/39 (79%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
YYG IS+GTPPQ F V+FDTGSSNLWIPS C ++AC
Sbjct: 88 YYGQISIGTPPQYFNVVFDTGSSNLWIPSAQCLSTDVAC 126
>gi|341884635|gb|EGT40570.1| CBN-ASP-4 protein [Caenorhabditis brenneri]
Length = 447
Score = 115 bits (287), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 55/114 (48%), Positives = 73/114 (64%), Gaps = 1/114 (0%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIAD 119
RN + GGEI FGG+D ++V ITY+PV+RKGYWQF ++ + + CSN CQAIAD
Sbjct: 240 RNPDSDLGGEITFGGIDSRRYVEPITYAPVTRKGYWQFKMDKVVGSGVLGCSNGCQAIAD 299
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
TGTSLI GP I + IGA PL G + CD + +P V ++GG+ F+L
Sbjct: 300 TGTSLIAGPKAQIEAIQNFIGAEPLIKGEYMISCDKVPTLPPVSFVIGGQEFSL 353
Score = 71.2 bits (173), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 31/49 (63%), Positives = 38/49 (77%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSV 49
+R+ AQY+GTIS+GTP Q F VIFDTGSSNLWIPS+ C +IAC +
Sbjct: 87 LRNYMDAQYFGTISIGTPGQNFTVIFDTGSSNLWIPSKKCPFYDIACML 135
>gi|198475392|ref|XP_001357030.2| GA17303 [Drosophila pseudoobscura pseudoobscura]
gi|198138802|gb|EAL34096.2| GA17303 [Drosophila pseudoobscura pseudoobscura]
Length = 401
Score = 115 bits (287), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 58/125 (46%), Positives = 76/125 (60%), Gaps = 2/125 (1%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKN 108
+D+ F G RN S E GG+++ GGVD+ GD+TY PVS++GYWQF V +I
Sbjct: 222 IDEPTF-GFYLARNGSSEEGGQLLLGGVDETLMAGDLTYVPVSQEGYWQFSVNNISWNGT 280
Query: 109 VYCSNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGG 168
V C CQAIADTGTSL+ P V ++N+LIGAV L G + C LD +P + +GG
Sbjct: 281 VLCDGCQAIADTGTSLLACPQAVYTQINQLIGAV-LIEGSNYIPCATLDSLPVLSFNIGG 339
Query: 169 KNFTL 173
F L
Sbjct: 340 TTFDL 344
Score = 64.7 bits (156), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 27/39 (69%), Positives = 31/39 (79%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
YYG IS+GTPPQ F V+FDTGSSNLWIPS C ++AC
Sbjct: 88 YYGQISIGTPPQYFNVVFDTGSSNLWIPSAQCLSTDVAC 126
>gi|383859202|ref|XP_003705085.1| PREDICTED: lysosomal aspartic protease-like [Megachile rotundata]
Length = 384
Score = 115 bits (287), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 54/115 (46%), Positives = 76/115 (66%), Gaps = 2/115 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYC--SNCQAIA 118
R+ + E GGE++ GG D + +VG TY PV +KGYWQF ++ +++ +V C+AIA
Sbjct: 209 RDPNAEFGGEMILGGSDPNHYVGPFTYVPVDKKGYWQFAMDRVEVGSDVKVCEKGCEAIA 268
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
DTGTSLI GP K I LNK IGA P+A G A V+CD + +P + + GG++F L
Sbjct: 269 DTGTSLIAGPVKEIELLNKKIGATPIAAGEAMVECDKIPDLPTITFVFGGRSFPL 323
Score = 75.5 bits (184), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 30/43 (69%), Positives = 38/43 (88%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSV 49
AQYYG I++GTPPQ+F+VIFDTGSSNLW+PS+ C + NIAC +
Sbjct: 62 AQYYGVINIGTPPQKFRVIFDTGSSNLWVPSKKCHLTNIACKL 104
>gi|195997283|ref|XP_002108510.1| expressed hypothetical protein [Trichoplax adhaerens]
gi|190589286|gb|EDV29308.1| expressed hypothetical protein [Trichoplax adhaerens]
Length = 389
Score = 114 bits (286), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 58/130 (44%), Positives = 79/130 (60%), Gaps = 7/130 (5%)
Query: 52 DKFVGHSHF-----RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE 106
K VG S F R + GGE++ GG D + G+ TY PV++KGYWQF ++ I ++
Sbjct: 199 QKQVGKSVFSFYLNRKEGAKPGGELILGGSDSKHYSGNFTYLPVTKKGYWQFKMDGISVK 258
Query: 107 -KNVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDI 164
K +C CQAIADTGTSL+ GP+ + ++ LIGA PL NG +DC + +P +
Sbjct: 259 GKGSFCKGGCQAIADTGTSLLAGPTAEVNKIQTLIGATPLLNGEYTIDCSKISSLPPITF 318
Query: 165 ILGGKNFTLT 174
LGGK FTLT
Sbjct: 319 TLGGKKFTLT 328
Score = 79.3 bits (194), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 33/41 (80%), Positives = 37/41 (90%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQYYG IS+GTPPQ+FKV+FDTGSSNLW+PS CS LNIAC
Sbjct: 66 AQYYGEISIGTPPQKFKVLFDTGSSNLWVPSSKCSFLNIAC 106
>gi|350411706|ref|XP_003489428.1| PREDICTED: lysosomal aspartic protease-like [Bombus impatiens]
Length = 386
Score = 114 bits (285), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 55/117 (47%), Positives = 77/117 (65%), Gaps = 3/117 (2%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSN-CQAI 117
RN D+ GGE++ GG D + + G TY PV RKGYWQF ++ IK+ + C C+AI
Sbjct: 210 RNPDDKAGGELILGGSDPNHYEGPFTYVPVDRKGYWQFRMDGIKVGSQHLAICEKGCEAI 269
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
ADTGTSLI GP K + +N IGA +A G A VDC+++ +P ++ +LGG++F LT
Sbjct: 270 ADTGTSLIAGPVKEVEAINSAIGATNIAAGEAMVDCNSIPNLPTINFVLGGRSFPLT 326
Score = 75.9 bits (185), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 32/43 (74%), Positives = 37/43 (86%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSV 49
AQYYG IS+GTP Q+FKVIFDTGSSNLW+PSQ C + NIAC +
Sbjct: 63 AQYYGVISIGTPSQDFKVIFDTGSSNLWVPSQKCHLTNIACKL 105
>gi|224548868|dbj|BAH24176.1| aspartic proteinase [Sitophilus zeamais]
Length = 389
Score = 114 bits (285), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 54/114 (47%), Positives = 76/114 (66%), Gaps = 1/114 (0%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIAD 119
R+ + + GGEI+ GG D + + GD TY PV R+ YWQF ++S+++ C C+AIAD
Sbjct: 213 RDPNAQTGGEIILGGSDPNYYEGDFTYLPVDRQAYWQFKMDSVQVADQSLCKGGCEAIAD 272
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
TGTSLI GP++ IA LNK IGA + G VDC+++ +P ++I LGG FTL
Sbjct: 273 TGTSLIAGPTEEIAALNKAIGASAIVGGEYIVDCNSISSLPKINITLGGNLFTL 326
Score = 78.2 bits (191), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 50/123 (40%), Positives = 66/123 (53%), Gaps = 23/123 (18%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSV-DKDKFVGHSHFRNASD 65
AQYYG IS+GTPPQ F VIFDTGSSNLW+PS+ C +LNIAC + +K S ++
Sbjct: 66 AQYYGPISIGTPPQNFNVIFDTGSSNLWVPSKKCELLNIACLLHNKYDATKSSTYK---- 121
Query: 66 ENGGEIMFGGVDKDKFVGDITYSPVSRKGYW---QFGVESIKIEKNVYCSNCQAIADTGT 122
ENG E ITY S GY V S++++ + +AI + G
Sbjct: 122 ENGTEFA------------ITYGSGSLSGYLSTDSLSVGSVQVKDQTF---GEAIKEPGL 166
Query: 123 SLI 125
+ I
Sbjct: 167 TFI 169
>gi|24583545|ref|NP_609457.1| CG6508 [Drosophila melanogaster]
gi|7297765|gb|AAF53015.1| CG6508 [Drosophila melanogaster]
Length = 423
Score = 114 bits (285), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 52/114 (45%), Positives = 77/114 (67%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSNCQAIADT 120
R+ + ++GGE+++GG+D+ + G I Y PVS YWQF S+KIE + C+ CQAIADT
Sbjct: 218 RDGTSQSGGEVIWGGIDRSIYRGCINYVPVSMPAYWQFTANSVKIEGILLCNGCQAIADT 277
Query: 121 GTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
GTSLI P + +NK++ A +G A VDC +L ++PNV++ +GG +TLT
Sbjct: 278 GTSLIAVPLRAYKAINKVLNATDAGDGEAFVDCSSLCRLPNVNLNIGGTTYTLT 331
Score = 53.5 bits (127), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 27/74 (36%), Positives = 41/74 (55%), Gaps = 1/74 (1%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHF 60
+ + + +Y + +GTPPQ F + FDTGSS+LW+PS CS N AC +K+ +
Sbjct: 66 LANGFNLEYTIRLCIGTPPQCFNLQFDTGSSDLWVPSVKCSSTNEACQ-KHNKYNSSASS 124
Query: 61 RNASDENGGEIMFG 74
+ D G I +G
Sbjct: 125 SHVEDGKGFSIQYG 138
>gi|321472775|gb|EFX83744.1| hypothetical protein DAPPUDRAFT_92408 [Daphnia pulex]
Length = 379
Score = 114 bits (284), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 51/117 (43%), Positives = 76/117 (64%), Gaps = 4/117 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV---YCSN-CQA 116
RN GGEI FGG D +++ G+I+++PV+RK YWQF V+ +++ +C CQ
Sbjct: 201 RNPDAAQGGEITFGGADPERYTGEISWAPVTRKAYWQFKVDGVQVSNEADGAFCQGGCQM 260
Query: 117 IADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
IADTGTSLI GP I +LN LIG +P+ G ++C +D++P + +GGK+F+L
Sbjct: 261 IADTGTSLIAGPVDEIKKLNTLIGGIPIMAGEYFINCSRIDELPTISFSIGGKSFSL 317
Score = 75.1 bits (183), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 29/47 (61%), Positives = 38/47 (80%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ AQYYG I+LGTPPQ F V+FDTGS+NLW+PS HC + N+AC
Sbjct: 48 LKNYMDAQYYGQITLGTPPQTFNVVFDTGSANLWVPSTHCHLTNLAC 94
>gi|195339959|ref|XP_002036584.1| GM18741 [Drosophila sechellia]
gi|194130464|gb|EDW52507.1| GM18741 [Drosophila sechellia]
Length = 378
Score = 113 bits (283), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 52/114 (45%), Positives = 74/114 (64%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSNCQAIADT 120
R+ + + GGE+++GG+D+ + G I Y PVS YWQF S+K+ + C+ CQAIADT
Sbjct: 173 RDGTSQLGGEVIWGGIDRSIYRGCINYVPVSMPAYWQFTANSVKVRDILLCNGCQAIADT 232
Query: 121 GTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
GTSL P + +NK++ A NG A VDC NL K+PNV++ +GG +TLT
Sbjct: 233 GTSLFAVPLRAYRAINKVLNATDAGNGEAFVDCSNLCKLPNVNLNIGGTTYTLT 286
Score = 37.7 bits (86), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 19/49 (38%), Positives = 26/49 (53%), Gaps = 1/49 (2%)
Query: 26 FDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASDENGGEIMFG 74
FDTGSS+LW+PS C N AC +K+ + + D G I +G
Sbjct: 46 FDTGSSDLWVPSAKCPSTNEACQ-KHNKYNSSASSSHVEDGRGFSIQYG 93
>gi|195471990|ref|XP_002088285.1| GE18490 [Drosophila yakuba]
gi|194174386|gb|EDW87997.1| GE18490 [Drosophila yakuba]
Length = 403
Score = 113 bits (282), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 52/114 (45%), Positives = 75/114 (65%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSNCQAIADT 120
R+ S ++GGE+++GG+D+ + G I Y PVS YWQF V S+K+ + C+ CQAIADT
Sbjct: 199 RDGSSQSGGEVIWGGIDRSIYRGCINYVPVSLPAYWQFTVNSVKVRDILLCNGCQAIADT 258
Query: 121 GTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
GTSLI+ P + +N+ + A G A VDC NL K+PN ++ +GG +TLT
Sbjct: 259 GTSLIVVPLRAYNAINRALNATDNGEGEAFVDCSNLCKLPNFNLNIGGSTYTLT 312
Score = 53.1 bits (126), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 25/74 (33%), Positives = 41/74 (55%), Gaps = 1/74 (1%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHF 60
+ + + +Y + +GTPPQ F + FDTGSS+LW+PS C N+AC + +K+ +
Sbjct: 47 LTNGFNLEYTIRVCIGTPPQCFNLQFDTGSSDLWVPSVKCPSTNVACQ-EHNKYNSSASS 105
Query: 61 RNASDENGGEIMFG 74
D G + +G
Sbjct: 106 TYVEDGGGFSLEYG 119
>gi|195349117|ref|XP_002041093.1| GM15229 [Drosophila sechellia]
gi|194122698|gb|EDW44741.1| GM15229 [Drosophila sechellia]
Length = 395
Score = 113 bits (282), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 49/113 (43%), Positives = 73/113 (64%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSNCQAIADT 120
RN SD GGE++FGGVDK KF G +TY P++ GYWQF +++I++ + QAIADT
Sbjct: 226 RNGSDRKGGELLFGGVDKTKFSGSLTYVPLTHAGYWQFPLDAIEVAGTRISQHRQAIADT 285
Query: 121 GTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
GTSL+ P + +N L+G +P +N ++C +D +P + I+GG+ F L
Sbjct: 286 GTSLLAAPPREYLIINSLLGGLPTSNNEYLLNCSEIDSLPEIVFIIGGQRFGL 338
Score = 57.4 bits (137), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 27/57 (47%), Positives = 36/57 (63%), Gaps = 8/57 (14%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNAS 64
+Y G IS+G+P Q F ++FDTGS+NLW+PS CS ++AC H H NAS
Sbjct: 80 EYAGPISIGSPGQPFNMLFDTGSANLWVPSAECSPKSVACH--------HHHRYNAS 128
>gi|336454164|gb|AEI58896.1| cathepsin D [Pinctada maxima]
Length = 390
Score = 113 bits (282), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 55/115 (47%), Positives = 75/115 (65%), Gaps = 2/115 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIA 118
R+ S GGE++ GG D + G+ TY PV+R+GYWQF ++ + + + +CS C AIA
Sbjct: 214 RDPSASEGGELILGGSDTKHYKGNFTYLPVTRQGYWQFEMDGVSVGGSAKFCSGGCNAIA 273
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
DTGTSLI GP+ I++LNK IGA PL G VDC+ + K+P + LGGK F L
Sbjct: 274 DTGTSLIAGPTSEISKLNKAIGAKPLVAGEYTVDCNAIPKLPKITFTLGGKQFDL 328
Score = 69.7 bits (169), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 28/41 (68%), Positives = 34/41 (82%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQYYG I +GTP Q FKV+FDTGSSNLW+PS+ C + +IAC
Sbjct: 67 AQYYGVIGIGTPAQNFKVVFDTGSSNLWVPSKKCKVTDIAC 107
>gi|195578343|ref|XP_002079025.1| GD23735 [Drosophila simulans]
gi|194191034|gb|EDX04610.1| GD23735 [Drosophila simulans]
Length = 404
Score = 112 bits (281), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 52/114 (45%), Positives = 76/114 (66%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSNCQAIADT 120
R+ + + GGE+++GG+D+ + G I Y PVS YWQF S+K+ + C+ CQAIADT
Sbjct: 199 RDGTSQLGGEVIWGGIDRSIYRGCINYVPVSMPTYWQFTANSVKVRDILLCNGCQAIADT 258
Query: 121 GTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
GTSLI P + +NK++ A +G A VDC+NL K+PNV++ +GG +TLT
Sbjct: 259 GTSLIAVPLRAYKAINKVLNATDAGDGEAFVDCNNLCKLPNVNLNIGGTTYTLT 312
Score = 52.8 bits (125), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 26/74 (35%), Positives = 40/74 (54%), Gaps = 1/74 (1%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHF 60
+ + + +Y + +GTPPQ F + FDTGSS+LW+PS C N AC +K+ +
Sbjct: 47 LANGFNLEYTIRLCIGTPPQCFNLQFDTGSSDLWVPSAKCPSTNEACQ-KHNKYNSSASS 105
Query: 61 RNASDENGGEIMFG 74
+ D G I +G
Sbjct: 106 SHVEDGKGFSIQYG 119
>gi|194745302|ref|XP_001955127.1| GF16402 [Drosophila ananassae]
gi|190628164|gb|EDV43688.1| GF16402 [Drosophila ananassae]
Length = 300
Score = 112 bits (281), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 50/113 (44%), Positives = 72/113 (63%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSNCQAIADT 120
RN S+ GGE++FGGVD KF G +TY P++ GYWQF ++ ++I V + QAIADT
Sbjct: 131 RNGSERKGGELLFGGVDSTKFSGTLTYVPLTHAGYWQFQMDGVEIGGTVISRHRQAIADT 190
Query: 121 GTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
GTSL+ P + +N L+G +P N ++C LDK+P + I+GG+ F L
Sbjct: 191 GTSLLAAPPREYLIINSLLGGLPTTNNEYLLNCSGLDKLPEIVFIIGGQRFGL 243
Score = 40.4 bits (93), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 14/24 (58%), Positives = 20/24 (83%)
Query: 24 VIFDTGSSNLWIPSQHCSILNIAC 47
++FDTGS+N+W+PS CS N+AC
Sbjct: 1 MLFDTGSANIWVPSAECSAKNLAC 24
>gi|194246053|gb|ACF35518.1| putative aspartic protease [Dermacentor variabilis]
Length = 246
Score = 112 bits (280), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 50/116 (43%), Positives = 80/116 (68%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI-EKNVYCS-NCQAIA 118
RN +D NGGE++FGG+D+ + GDITY PV++KGYWQF ++ + + +K +C C A+A
Sbjct: 71 RNVADPNGGEVVFGGIDEAHYKGDITYLPVTKKGYWQFNMDGVSVADKATFCEGGCAAVA 130
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTS+++ PS+ + ++NKL+GA + G VDC++L +P + L + F L+
Sbjct: 131 DTGTSMLVAPSEDVKKINKLLGAKEASPGQYLVDCESLPSLPKITFHLNKREFVLS 186
>gi|119567604|gb|ABL84270.1| aspartic protease [Musca domestica]
Length = 379
Score = 112 bits (280), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 56/114 (49%), Positives = 72/114 (63%), Gaps = 1/114 (0%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSNCQAIADT 120
R+ + +GG+++ GGVD + GDITY PVS +GYWQF V S I+ C NCQAIADT
Sbjct: 214 RDGTSTDGGQMILGGVDSSLYTGDITYVPVSSQGYWQFEVTSGAIKGQSICDNCQAIADT 273
Query: 121 GTSLIIGPSKVIAELNKLIGAV-PLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
GTSLI+ PS LN IGA +G VDC +D +P+V ++GG FTL
Sbjct: 274 GTSLIVAPSDAYNTLNAEIGATYNEDDGNYYVDCSAVDSLPDVTFVIGGTTFTL 327
Score = 72.4 bits (176), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 29/47 (61%), Positives = 38/47 (80%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+ +S +YYG I++GTPPQ+F V+FDTGSSNLW+PS HC I +IAC
Sbjct: 61 LENSLNMKYYGDITIGTPPQKFVVLFDTGSSNLWVPSSHCWIWDIAC 107
>gi|291223847|ref|XP_002731917.1| PREDICTED: putative gut cathepsin D-like aspartic protease-like
[Saccoglossus kowalevskii]
Length = 389
Score = 112 bits (279), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 52/116 (44%), Positives = 75/116 (64%), Gaps = 3/116 (2%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSN-CQAI 117
RN +D GGE+ GG D + G+ TY PV+RKGYWQF ++ I + + +C CQAI
Sbjct: 213 RNVNDSQGGELFLGGSDPKYYTGNFTYVPVTRKGYWQFKMDGITLGGSASQFCKGGCQAI 272
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
ADTGTSLI GP++ + +NK IGA P+ +G V+C+ +D +P++ +L K F L
Sbjct: 273 ADTGTSLIAGPTEEVQAINKAIGATPIVSGEYMVNCNKIDSLPDITFVLNNKPFIL 328
Score = 70.5 bits (171), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 30/51 (58%), Positives = 38/51 (74%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK 51
+++ AQYYG I LGTP Q+F V+FDTGSSNLW+PS+ C I +IAC K
Sbjct: 60 LKNYLDAQYYGEIGLGTPQQKFNVVFDTGSSNLWVPSKKCPITDIACLFHK 110
>gi|205289916|gb|ACI02330.1| aspartic protease 1 [Uncinaria stenocephala]
Length = 447
Score = 112 bits (279), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 51/115 (44%), Positives = 75/115 (65%), Gaps = 2/115 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI-EKNVYCSN-CQAIA 118
RN E GGEI GG+D ++V +T++PV+R+GYWQF ++ ++ ++ C N CQAIA
Sbjct: 241 RNPDSELGGEITLGGMDPRRYVEPLTWTPVTRRGYWQFKMDMVQGGSSSIACPNGCQAIA 300
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
DTGTSLI GP + + K IGA PL G + CD + +P++ ++GG+ FTL
Sbjct: 301 DTGTSLIAGPKAQVEAIQKFIGAEPLMRGEYMIPCDKVPSLPDLSFVIGGQTFTL 355
Score = 69.7 bits (169), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 29/49 (59%), Positives = 37/49 (75%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSV 49
+R+ AQY+GTI +GTP Q F VIFDTGSSNLW+PS+ C +IAC +
Sbjct: 88 LRNYMDAQYFGTIQIGTPAQNFTVIFDTGSSNLWVPSRKCPFYDIACML 136
>gi|226437842|gb|ACO56332.1| putative gut cathepsin D-like aspartic protease [Callosobruchus
maculatus]
Length = 389
Score = 112 bits (279), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 50/115 (43%), Positives = 75/115 (65%), Gaps = 2/115 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSNCQAIA 118
R+ + GGE++ GG DK + G+ TY PV R+ YWQF ++ +++ E + C+AIA
Sbjct: 214 RDPAAPQGGELILGGSDKAHYKGEFTYLPVDRQAYWQFKMDKVQVGPETTLCAKGCEAIA 273
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
DTGTSLI GPS+ + +NK IGA P+ G V C+++ K+P ++ +LGGK F L
Sbjct: 274 DTGTSLIAGPSEEVKAINKAIGATPIMGGEYLVSCESIPKLPTINFVLGGKPFAL 328
Score = 74.3 bits (181), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 32/41 (78%), Positives = 35/41 (85%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQYYG IS+GTPPQ FKVIFDTGSSNLW+PS+ C NIAC
Sbjct: 67 AQYYGPISIGTPPQTFKVIFDTGSSNLWVPSKLCHFTNIAC 107
>gi|195570151|ref|XP_002103072.1| GD19155 [Drosophila simulans]
gi|194198999|gb|EDX12575.1| GD19155 [Drosophila simulans]
Length = 395
Score = 111 bits (278), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 48/113 (42%), Positives = 73/113 (64%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSNCQAIADT 120
RN S+ GGE++FGGVDK KF G +TY P++ GYWQF +++I++ + QAIADT
Sbjct: 226 RNGSERKGGELLFGGVDKTKFSGSLTYVPLTHAGYWQFPLDAIEVAGTRITQHRQAIADT 285
Query: 121 GTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
GTSL+ P + +N L+G +P +N ++C +D +P + I+GG+ F L
Sbjct: 286 GTSLLAAPPREYLIINSLLGGLPTSNNEYLLNCSEIDSLPEIVFIIGGQRFGL 338
Score = 55.8 bits (133), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 26/57 (45%), Positives = 36/57 (63%), Gaps = 8/57 (14%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNAS 64
+Y G IS+G+P Q F ++FDTGS+NLW+PS CS ++AC H H N+S
Sbjct: 80 EYAGPISIGSPGQPFNMLFDTGSANLWVPSAECSPKSVACH--------HHHRYNSS 128
>gi|126681053|gb|ABO26561.1| cathepsin D-like aspartic protease [Ixodes ricinus]
Length = 382
Score = 111 bits (277), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 55/120 (45%), Positives = 77/120 (64%), Gaps = 7/120 (5%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQF---GVESIKIEKN---VYCSN- 113
R+ S + GGE++FGG+D D + G ITY PV+ KGYWQF GV+S+ K+ + C +
Sbjct: 205 RDPSSKEGGEVLFGGIDHDHYKGSITYVPVTAKGYWQFHVDGVKSVSASKSAPELLCKDG 264
Query: 114 CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
C+AIADTGTSLI GP + + LN+ +G G +DCD L+ +PNV + GK F+L
Sbjct: 265 CEAIADTGTSLITGPPEEVDSLNQYLGGTKTEGGQYLLDCDKLESLPNVTFTISGKEFSL 324
Score = 62.0 bits (149), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 28/57 (49%), Positives = 38/57 (66%), Gaps = 10/57 (17%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNAS 64
+YYG IS+GTPPQ+F+VIFDTGS+NLW+PS C+ K+ H H ++S
Sbjct: 60 EYYGPISIGTPPQDFQVIFDTGSANLWLPSSKCTT----------KYCLHHHRYDSS 106
>gi|24647683|ref|NP_650623.1| CG5863 [Drosophila melanogaster]
gi|7300255|gb|AAF55418.1| CG5863 [Drosophila melanogaster]
Length = 395
Score = 111 bits (277), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 49/113 (43%), Positives = 72/113 (63%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSNCQAIADT 120
RN S+ GGE++FGGVDK KF G +TY P++ GYWQF ++ I++ N QAIADT
Sbjct: 226 RNGSERKGGELLFGGVDKTKFSGSLTYVPLTHAGYWQFPLDVIEVAGTRINQNRQAIADT 285
Query: 121 GTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
GTSL+ P + +N L+G +P +N ++C +D +P + I+GG+ F L
Sbjct: 286 GTSLLAAPPREYLIINSLLGGLPTSNNEYLLNCSEIDSLPEIVFIIGGQRFGL 338
Score = 57.4 bits (137), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 27/57 (47%), Positives = 36/57 (63%), Gaps = 8/57 (14%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNAS 64
+Y G IS+G+P Q F ++FDTGS+NLW+PS CS ++AC H H NAS
Sbjct: 80 EYAGPISIGSPGQPFNMLFDTGSANLWVPSAECSPKSVACH--------HHHRYNAS 128
>gi|125984612|ref|XP_001356070.1| GA14340 [Drosophila pseudoobscura pseudoobscura]
gi|54644388|gb|EAL33129.1| GA14340 [Drosophila pseudoobscura pseudoobscura]
Length = 387
Score = 110 bits (276), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 53/113 (46%), Positives = 71/113 (62%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSNCQAIADT 120
R + GGE++ GGVD + G++TY PVS GYWQF V S+K + CS CQAIADT
Sbjct: 216 RQGTAVQGGELILGGVDPSLYTGNLTYVPVSVAGYWQFKVNSVKSGGFLLCSGCQAIADT 275
Query: 121 GTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
GTSLI+ P A++N L+GA G A V C ++ +P V++ +GG FTL
Sbjct: 276 GTSLIVVPEAAYAKINSLLGATDNGEGEAFVKCADVSSLPKVNLNIGGTIFTL 328
Score = 63.2 bits (152), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 24/47 (51%), Positives = 35/47 (74%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
++++ +YYG I +GTP Q F+V+FDTGS+NLW+PS C N+AC
Sbjct: 63 LQNTLNMEYYGLIGIGTPEQIFRVLFDTGSANLWVPSAKCPSTNVAC 109
>gi|195161645|ref|XP_002021673.1| GL26637 [Drosophila persimilis]
gi|194103473|gb|EDW25516.1| GL26637 [Drosophila persimilis]
Length = 387
Score = 110 bits (275), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 53/113 (46%), Positives = 71/113 (62%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSNCQAIADT 120
R + GGE++ GGVD + G++TY PVS GYWQF V S+K + CS CQAIADT
Sbjct: 216 RQGTAVQGGELILGGVDPSLYTGNLTYVPVSVAGYWQFKVNSVKSGGILLCSGCQAIADT 275
Query: 121 GTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
GTSLI+ P A++N L+GA G A V C ++ +P V++ +GG FTL
Sbjct: 276 GTSLIVVPEAAYAKINSLLGATDNGEGEAFVKCADVSSLPKVNLNIGGTIFTL 328
Score = 65.9 bits (159), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 39/128 (30%), Positives = 65/128 (50%), Gaps = 15/128 (11%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHF 60
++++ +YYG I +GTP Q F+V+FDTGS+NLW+PS C N+AC G S
Sbjct: 63 LQNTLNMEYYGLIGIGTPEQIFRVLFDTGSANLWVPSAKCPSTNVACQKHNQYHSGQSST 122
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSNCQAIADT 120
A+ E+ I +G F+ + T W G+E I++ + +A+ +
Sbjct: 123 YVANGES-FSIQYGTGSLTGFLSEDT--------VWVAGIE---IQQQTF---AEALNEP 167
Query: 121 GTSLIIGP 128
G++ + P
Sbjct: 168 GSTFVSAP 175
>gi|194900440|ref|XP_001979765.1| GG22202 [Drosophila erecta]
gi|190651468|gb|EDV48723.1| GG22202 [Drosophila erecta]
Length = 395
Score = 110 bits (275), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 48/113 (42%), Positives = 72/113 (63%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSNCQAIADT 120
RN S+ GGE++FGGVDK KF G +TY P++ GYWQF ++ I++ + QAIADT
Sbjct: 226 RNGSERMGGELLFGGVDKTKFSGSLTYVPLTHAGYWQFPLDGIELGGTTISRHRQAIADT 285
Query: 121 GTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
GTSL+ P + +N L+G +P +N ++C +D +P + I+GG+ F L
Sbjct: 286 GTSLLAAPPREYLIINSLLGGLPTSNNEYLLNCSEIDSLPEIVFIIGGRRFGL 338
Score = 59.3 bits (142), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 27/57 (47%), Positives = 37/57 (64%), Gaps = 8/57 (14%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNAS 64
+Y G IS+G+P Q F ++FDTGS+NLW+PS CS+ ++AC H H NAS
Sbjct: 80 EYAGPISIGSPGQPFNMLFDTGSANLWVPSAECSLKSVACH--------HHHRYNAS 128
>gi|241687194|ref|XP_002412838.1| aspartyl protease, putative [Ixodes scapularis]
gi|215506640|gb|EEC16134.1| aspartyl protease, putative [Ixodes scapularis]
Length = 320
Score = 110 bits (275), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 54/115 (46%), Positives = 74/115 (64%), Gaps = 2/115 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVE-SIKIEKNVYCSN-CQAIA 118
R+ S + GGEI+FGG+D D + G ITY PV+ KGYWQF V+ + K + C + C+AIA
Sbjct: 148 RDPSSKEGGEILFGGIDHDHYKGSITYVPVTAKGYWQFHVDGASKSVPELLCKDGCEAIA 207
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
DTGTSLI GP + + LN+ +G G +DCD L+ +PNV + GK F+L
Sbjct: 208 DTGTSLITGPPEEVDSLNQYLGGTKTEGGQYLLDCDKLESLPNVTFTISGKEFSL 262
Score = 60.8 bits (146), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 27/58 (46%), Positives = 38/58 (65%), Gaps = 10/58 (17%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNAS 64
+YYG I++GTPPQ+F+VIFDTGS+NLW+PS C+ K+ H H ++S
Sbjct: 2 VEYYGPITIGTPPQDFQVIFDTGSANLWLPSSKCTT----------KYCLHHHRYDSS 49
>gi|3378673|emb|CAA08878.1| Cathepsin D [Podarcis siculus]
Length = 399
Score = 110 bits (275), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 52/116 (44%), Positives = 75/116 (64%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIA 118
R+ GGE++FGGVD + GD T+ V+RK YWQ +E + ++ + C + C+AI
Sbjct: 221 RDPKGATGGELLFGGVDPQYYTGDFTWVNVTRKAYWQIHMEKVDVDNGLTVCKDGCEAIV 280
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSLI GP+ I +L K IGA P+ G + CD L +PNV+++LGGK++ LT
Sbjct: 281 DTGTSLITGPTDEIKQLQKAIGAKPIIKGQYMLPCDKLSSLPNVNLVLGGKSYALT 336
Score = 73.2 bits (178), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 33/67 (49%), Positives = 46/67 (68%), Gaps = 2/67 (2%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK--DKFVGHS 58
+++ AQYYG I +GTPPQ+F V+FDTGSSNLW+PS C +L+IAC + D S
Sbjct: 68 LKNYMDAQYYGEIGIGTPPQKFTVVFDTGSSNLWVPSVKCHLLDIACLLHHKYDSSKSSS 127
Query: 59 HFRNASD 65
+ +N +D
Sbjct: 128 YVKNGTD 134
>gi|147743015|sp|P85139.1|CARDH_CYNCA RecName: Full=Cardosin-H; Contains: RecName: Full=Cardosin-H heavy
chain; Contains: RecName: Full=Cardosin-H light chain
Length = 265
Score = 110 bits (275), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 64/182 (35%), Positives = 90/182 (49%), Gaps = 49/182 (26%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILN-------------IACSVDKDKFV 55
Y+G I +GTPPQ+F VIFDTGSS LW+PS + I + + D
Sbjct: 17 YFGEIGIGTPPQKFTVIFDTGSSVLWVPSSKAHSMYESSGSSTYKEQDFIEATDETDNVF 76
Query: 56 GHSHF---------------------------------RNASDENGGEIMFGGVDKDKFV 82
H F RN +E GGE++FGG+D + F
Sbjct: 77 LHRLFDGILGLSFQTISVPVWYNMLNQGLVKRFSFWLNRNVDEEEGGELVFGGLDPNHFR 136
Query: 83 GDITYSPVSRKGYWQFGVESIKI--EKNVYCS-NCQAIADTGTSLIIGPSKVIAELNKLI 139
GD TY PV+ + YWQFG+ + I + +C+ CQA AD+GTSL+ GP+ ++ ++N I
Sbjct: 137 GDHTYVPVTYQYYWQFGIGDVLIGDKSTGFCAPGCQAFADSGTSLLSGPTAIVTQINHAI 196
Query: 140 GA 141
GA
Sbjct: 197 GA 198
>gi|17981530|gb|AAL51056.1|AF454831_1 cathepsin D [Apriona germari]
Length = 386
Score = 110 bits (274), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 54/114 (47%), Positives = 72/114 (63%), Gaps = 2/114 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYC-SNCQAIAD 119
R+ GGE+ GG D + G+ TY V R+ YWQF ++ I++ K V+C S CQAIAD
Sbjct: 213 RDPDAAEGGELSLGGSDPKYYKGNFTYLSVDRQAYWQFKMDKIQLGKTVFCKSGCQAIAD 272
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
TGTSL+ GP + +NKLIG P+ G VDC + K+P +D ILGGK +TL
Sbjct: 273 TGTSLVAGPVDEVTSINKLIGGTPIIGGEYVVDC-LIPKLPEIDFILGGKTYTL 325
Score = 71.2 bits (173), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 29/41 (70%), Positives = 35/41 (85%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQY+G IS+G PPQ+FKV+FDTGSSNLW+PS+ C NIAC
Sbjct: 65 AQYFGPISIGNPPQKFKVVFDTGSSNLWVPSKKCHYTNIAC 105
>gi|195386060|ref|XP_002051722.1| GJ17077 [Drosophila virilis]
gi|194148179|gb|EDW63877.1| GJ17077 [Drosophila virilis]
Length = 404
Score = 109 bits (273), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 57/125 (45%), Positives = 77/125 (61%), Gaps = 2/125 (1%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKN 108
+D+ F G R S ++GG+++ GGVD F G++TY PVS++GYWQF V S +
Sbjct: 226 IDEPSF-GFYLARTGSAQDGGQLVLGGVDYQLFSGNLTYVPVSQEGYWQFVVTSAVMNGF 284
Query: 109 VYCSNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGG 168
V CSNCQAIADTGTSL+ P +LN+LIG L +G VDC +D +P + +GG
Sbjct: 285 VVCSNCQAIADTGTSLLACPGSSYTQLNQLIGGY-LMDGDYYVDCSTVDSLPVLSFNIGG 343
Query: 169 KNFTL 173
F L
Sbjct: 344 TIFNL 348
Score = 60.8 bits (146), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 24/39 (61%), Positives = 30/39 (76%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
YYG I +GTPPQ F V+FDTGS+NLW+PS C ++AC
Sbjct: 92 YYGVIGIGTPPQYFNVVFDTGSANLWVPSVQCLPTDVAC 130
>gi|86278345|gb|ABC88426.1| cathepsin D-like aspartic proteinase preproprotein [Meloidogyne
incognita]
Length = 454
Score = 109 bits (273), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 51/116 (43%), Positives = 75/116 (64%), Gaps = 3/116 (2%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEK--NVYCSN-CQAI 117
R+ + GGEI GG DK ++V + Y+PV+RK YWQF +E + K + C N C+AI
Sbjct: 235 RDPYSKVGGEITIGGTDKRRYVEPLNYTPVTRKAYWQFKMEGVHNSKGEKIACQNGCEAI 294
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
ADTGTSLI GP I E+ IGAVPL +G V C+ + ++P++ +++GG ++ L
Sbjct: 295 ADTGTSLIAGPKAQIEEIQHYIGAVPLMHGEYMVSCERVPRLPDIALVIGGHSYVL 350
Score = 72.8 bits (177), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 30/47 (63%), Positives = 37/47 (78%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+R+ AQYYG IS+G+PPQ F VIFDTGSSNLW+PS+ C +IAC
Sbjct: 82 LRNYMDAQYYGPISIGSPPQNFSVIFDTGSSNLWVPSKKCPFYDIAC 128
>gi|443723962|gb|ELU12180.1| hypothetical protein CAPTEDRAFT_225009 [Capitella teleta]
Length = 364
Score = 109 bits (273), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 52/117 (44%), Positives = 75/117 (64%), Gaps = 3/117 (2%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYC-SNCQAI 117
R+ S GGE++ GG D + G+ TY PVS+KGYWQF ++ +K+ YC CQAI
Sbjct: 170 RDPSASEGGELILGGSDPKYYEGNFTYLPVSKKGYWQFKMDGMKLAGSSTSYCDGGCQAI 229
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
ADTGTSL+ GPS + +LN+ +G + G +DC+ + ++PN+ +L GK FTLT
Sbjct: 230 ADTGTSLLAGPSAEVQKLNQELGGTAIPGGEYIIDCNKIPQLPNITFMLAGKPFTLT 286
Score = 69.7 bits (169), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 29/42 (69%), Positives = 34/42 (80%)
Query: 6 QAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQYYG I++GTP Q FKV+FDTGSSNLW+PSQ C +IAC
Sbjct: 22 HAQYYGAITIGTPAQTFKVVFDTGSSNLWVPSQKCKWTDIAC 63
>gi|405951067|gb|EKC19012.1| Lysosomal aspartic protease [Crassostrea gigas]
Length = 439
Score = 109 bits (273), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 55/116 (47%), Positives = 73/116 (62%), Gaps = 3/116 (2%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSN-CQAI 117
RN + GGE++ GG D + G+ TY V+RKGYWQF ++ +K+ + + YCS C AI
Sbjct: 215 RNPTGTPGGEMILGGSDPKYYSGNFTYVNVTRKGYWQFKMDGVKVNGKASKYCSGGCNAI 274
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
ADTGTSL+ GPS + LN +IGA P A G VDC + +P V L GK+FTL
Sbjct: 275 ADTGTSLLAGPSTEVKSLNAMIGAKPFAAGEYTVDCSKIGSLPPVSFTLNGKDFTL 330
Score = 73.9 bits (180), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 31/41 (75%), Positives = 36/41 (87%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQYYG IS+GTPPQ FKVIFDTGSSNLW+PS+ C + +IAC
Sbjct: 68 AQYYGPISIGTPPQNFKVIFDTGSSNLWVPSKKCKLSDIAC 108
>gi|18677168|gb|AAL78216.1| aspartic protease precursor Hgg-33 [Heterodera glycines]
Length = 334
Score = 109 bits (273), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 54/115 (46%), Positives = 76/115 (66%), Gaps = 2/115 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI-EKNVYCSN-CQAIA 118
R+ + E GGEI GG D+ +FV ITY+PV+ K YWQF ++SI + C N CQAIA
Sbjct: 78 RDPTSEFGGEITIGGTDQRRFVAPITYTPVTHKAYWQFKMDSISGGNGKLACQNGCQAIA 137
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
DTGTSL+ GP I ++ K IGA+PL +G + C+ + +P+V I +GGK +T+
Sbjct: 138 DTGTSLLAGPKNDIEKIQKHIGAIPLFHGEYMISCERIPTLPDVTINIGGKAYTV 192
>gi|332376487|gb|AEE63383.1| unknown [Dendroctonus ponderosae]
Length = 388
Score = 109 bits (272), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 51/114 (44%), Positives = 70/114 (61%), Gaps = 1/114 (0%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIAD 119
RN + GGE++FGG D + + G+ TY PV R+ YWQF ++ + + + +C C+AIAD
Sbjct: 214 RNPDGKVGGELIFGGSDPNYYSGNFTYLPVDRQAYWQFKMDEVIVGQKTFCKGGCEAIAD 273
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
TGTSLI GP + LN+ IGA PL G VDC + +P + ILGG F L
Sbjct: 274 TGTSLIAGPVDEVKALNEAIGATPLVGGEYAVDCSLIPNLPAIKFILGGNTFVL 327
Score = 73.6 bits (179), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 30/41 (73%), Positives = 36/41 (87%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQY+G IS+GTPPQ+F V+FDTGSSNLW+PS+ CS NIAC
Sbjct: 67 AQYFGAISIGTPPQKFVVVFDTGSSNLWVPSKKCSFTNIAC 107
>gi|195501958|ref|XP_002098019.1| GE10129 [Drosophila yakuba]
gi|194184120|gb|EDW97731.1| GE10129 [Drosophila yakuba]
Length = 396
Score = 109 bits (272), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 47/113 (41%), Positives = 72/113 (63%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSNCQAIADT 120
RN S+ GGE++FGG+DK KF G +TY P++ YWQF +++I++ + QAIADT
Sbjct: 227 RNGSERMGGELLFGGLDKTKFSGTLTYVPLTHAAYWQFPLDAIEVGGTAISHHRQAIADT 286
Query: 121 GTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
GTSL+ P + +N L+G +P AN ++C +D +P + I+GG+ F L
Sbjct: 287 GTSLLAAPPREYLIINSLLGGLPTANNEYLLNCSEIDSLPEIVFIIGGQRFGL 339
Score = 56.6 bits (135), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 28/58 (48%), Positives = 37/58 (63%), Gaps = 10/58 (17%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFR-NAS 64
+Y G IS+G+P Q F ++FDTGS+NLW+PS CS ++AC H H R NAS
Sbjct: 81 EYAGPISIGSPGQPFNMLFDTGSANLWVPSAECSPKSVAC---------HRHHRYNAS 129
>gi|28573989|ref|NP_787961.1| CG33128 [Drosophila melanogaster]
gi|7296076|gb|AAF51371.1| CG33128 [Drosophila melanogaster]
gi|17862850|gb|AAL39902.1| LP12231p [Drosophila melanogaster]
gi|220956466|gb|ACL90776.1| CG33128-PA [synthetic construct]
Length = 405
Score = 109 bits (272), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 53/113 (46%), Positives = 71/113 (62%), Gaps = 1/113 (0%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSNCQAIADT 120
RN S +GG++ GG D++ G++TY+PV+++GYWQF V +I V S CQAIADT
Sbjct: 238 RNGSAVDGGQLTLGGTDQNLIAGEMTYTPVTQQGYWQFAVNNITWNGTVISSGCQAIADT 297
Query: 121 GTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
GTSLI PS +LN LIG VP+ G V C + +P + I +GG NF L
Sbjct: 298 GTSLIAAPSAAYIQLNNLIGGVPI-QGDYYVPCSTVSSLPVLTINIGGTNFYL 349
Score = 60.8 bits (146), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 25/47 (53%), Positives = 33/47 (70%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+ +S YYG I +GTP Q FKV+FDTGS+NLW+PS C ++AC
Sbjct: 85 LGNSMNMYYYGLIGIGTPEQYFKVVFDTGSANLWVPSAQCLATDVAC 131
>gi|260822088|ref|XP_002606435.1| hypothetical protein BRAFLDRAFT_67687 [Branchiostoma floridae]
gi|229291776|gb|EEN62445.1| hypothetical protein BRAFLDRAFT_67687 [Branchiostoma floridae]
Length = 423
Score = 108 bits (271), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 53/117 (45%), Positives = 71/117 (60%), Gaps = 3/117 (2%)
Query: 60 FRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSN-CQA 116
R+ GGE++ GG D + GD T+ V+ GYWQF ++ I I + + YC C A
Sbjct: 239 LRDPLGTTGGELLLGGTDPKYYSGDFTFVNVTEPGYWQFKMDGIMINGQASAYCKGGCNA 298
Query: 117 IADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
IADTGTSLI GP+ I LNKLIGA P+ G VDC+ + +P + +LGGK+F L
Sbjct: 299 IADTGTSLIAGPTSEIQALNKLIGATPIVGGEYTVDCNKIPSLPTISFVLGGKSFGL 355
Score = 76.3 bits (186), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 30/41 (73%), Positives = 37/41 (90%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQYYGTI++GTPPQ F+V+FDTGSSNLW+PS HC + +IAC
Sbjct: 68 AQYYGTIAIGTPPQSFQVVFDTGSSNLWVPSSHCPLTDIAC 108
>gi|21907889|dbj|BAC05689.1| aspartic protease BmAsp-2 [Brugia malayi]
Length = 452
Score = 108 bits (270), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 54/115 (46%), Positives = 75/115 (65%), Gaps = 2/115 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-KNVYCSN-CQAIA 118
RN SD+ GGEI FGG+D ++FV ITY+PVSR GYWQF ++ + K + C N CQAIA
Sbjct: 244 RNPSDKIGGEITFGGIDANRFVSPITYTPVSRHGYWQFKMDRVLGRGKAIGCGNGCQAIA 303
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
DTGTSLI GP I ++ + IGA + G + C + +P + ++ GK++TL
Sbjct: 304 DTGTSLIAGPKSQIDKIQEYIGAEHVYAGEYIIPCYKVPSLPEITFVIAGKSYTL 358
Score = 73.6 bits (179), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 50/82 (60%), Gaps = 1/82 (1%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHF 60
+++ AQYYG IS+GTPPQ F V+FDTGSSNLW+PS C L+IAC + +K+ G
Sbjct: 91 LKNYMDAQYYGEISIGTPPQNFSVVFDTGSSNLWVPSVKCPFLDIAC-LFHNKYKGTKST 149
Query: 61 RNASDENGGEIMFGGVDKDKFV 82
D +I +G + F+
Sbjct: 150 TYKPDGRKIQIQYGTGSMEGFI 171
>gi|156406785|ref|XP_001641225.1| predicted protein [Nematostella vectensis]
gi|156228363|gb|EDO49162.1| predicted protein [Nematostella vectensis]
Length = 370
Score = 108 bits (270), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 50/115 (43%), Positives = 75/115 (65%), Gaps = 1/115 (0%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIAD 119
R GGE++ GG D + G+ +Y PV+++GYWQF ++ I +++ +CS+ CQAIAD
Sbjct: 196 REPGASPGGELLLGGSDPKYYKGNFSYVPVTQEGYWQFKMDGISVKEGSFCSDGCQAIAD 255
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
TGTSLI GP+ I +LN LIGA + G V+C +D +P++ +GGK + LT
Sbjct: 256 TGTSLIAGPTDEIEKLNNLIGAKIIIGGEYTVNCSAIDSLPDITFTIGGKKYVLT 310
Score = 72.8 bits (177), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 32/48 (66%), Positives = 40/48 (83%), Gaps = 1/48 (2%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKF 54
AQYYG I++GTPPQ+F V+FDTGSSNLW+PS+ CS NIAC + DK+
Sbjct: 49 AQYYGEITIGTPPQKFTVVFDTGSSNLWVPSKKCSWTNIACLL-HDKY 95
>gi|147743007|sp|P85138.1|CARDG_CYNCA RecName: Full=Cardosin-G; Contains: RecName: Full=Cardosin-G heavy
chain; Contains: RecName: Full=Cardosin-G light chain
Length = 266
Score = 108 bits (269), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 63/182 (34%), Positives = 90/182 (49%), Gaps = 49/182 (26%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILN-------------IACSVDKDKFV 55
Y+G I +GTPPQ+F VIFDTGSS LW+PS + I + + D
Sbjct: 17 YFGEIGIGTPPQKFTVIFDTGSSYLWVPSSKAHSMYESSDSSTYKEQDFIEATEEADNVF 76
Query: 56 GHSHF---------------------------------RNASDENGGEIMFGGVDKDKFV 82
+ F RN +E GGE++FGG+D + F
Sbjct: 77 LNRLFDGILGLSFQTISVPVWYNMVNQGLVKRFSFWLNRNVDEEEGGELVFGGLDPNHFR 136
Query: 83 GDITYSPVSRKGYWQFGVESIKI--EKNVYCS-NCQAIADTGTSLIIGPSKVIAELNKLI 139
GD TY PV+ + YWQFG+ + I + +C+ CQA AD+GTSL+ GP+ ++ ++N I
Sbjct: 137 GDHTYVPVTYQYYWQFGIGDVLIGDKSTGFCAPGCQAFADSGTSLLSGPTAIVTQINHAI 196
Query: 140 GA 141
GA
Sbjct: 197 GA 198
>gi|336454162|gb|AEI58895.1| cathepsin D [Pteria penguin]
Length = 392
Score = 108 bits (269), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 52/116 (44%), Positives = 76/116 (65%), Gaps = 3/116 (2%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSN-CQAI 117
R+ +GGE++ GG D + G+ TY PV+++GYW+F ++ I + + + YCS C+AI
Sbjct: 214 RDPGASDGGELILGGSDSKYYKGNFTYLPVTKQGYWRFKMDGIMMNGKASKYCSGGCKAI 273
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
ADTGTSL+ GP + LNK IGA PLA G VDC ++ K+P + +LGG+ F L
Sbjct: 274 ADTGTSLLAGPKTEVDALNKQIGATPLAAGEYMVDCSSVSKLPVISFMLGGQQFDL 329
Score = 69.3 bits (168), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 29/41 (70%), Positives = 35/41 (85%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQYYG I++GTP Q FKVIFDTGSSNLW+PS+ C + +IAC
Sbjct: 67 AQYYGDITIGTPGQSFKVIFDTGSSNLWVPSKKCKLSDIAC 107
>gi|322796189|gb|EFZ18765.1| hypothetical protein SINV_10075 [Solenopsis invicta]
Length = 366
Score = 108 bits (269), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 56/118 (47%), Positives = 77/118 (65%), Gaps = 6/118 (5%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI----EKNVYCSN-CQ 115
R+ + GGE++ GG D + +VG +TY V+RKGYWQF ++ I + ++ C N CQ
Sbjct: 192 RDPTAAQGGEMILGGSDPEHYVGSMTYVDVTRKGYWQFTMDRITVGDSSPSHILCKNTCQ 251
Query: 116 AIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
AIADTGTSLI GP+ I E+NK IGA + G A V+C + +P V+ ILGGK F+L
Sbjct: 252 AIADTGTSLIAGPTVEINEINKQIGAT-MIGGQALVNCAMVPHLPKVNFILGGKTFSL 308
Score = 75.1 bits (183), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 31/41 (75%), Positives = 36/41 (87%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQYYG I++GTPPQ+FKVIFDTGSSNLW+PS+ C NIAC
Sbjct: 45 AQYYGEITIGTPPQKFKVIFDTGSSNLWVPSKKCRYTNIAC 85
>gi|27803878|gb|AAO22152.1| cathepsin D-like aspartic protease [Ancylostoma ceylanicum]
Length = 446
Score = 108 bits (269), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 50/115 (43%), Positives = 72/115 (62%), Gaps = 2/115 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI-EKNVYCSN-CQAIA 118
RN E GGEI GG+D ++V IT++PV+R+GYWQF ++ ++ ++ C N CQAIA
Sbjct: 240 RNPDSELGGEITLGGMDPRRYVEPITWTPVTRRGYWQFKMDKVQGGSTSIACPNGCQAIA 299
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
DTGTSLI GP + + K IGA PL G + CD + +P + ++ G+ F L
Sbjct: 300 DTGTSLIAGPKAQVEAIQKFIGAEPLMKGEYMIPCDKVPSLPELSFVIEGRTFIL 354
Score = 69.7 bits (169), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 29/49 (59%), Positives = 37/49 (75%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSV 49
+R+ AQY+GTI +GTP Q F VIFDTGSSNLW+PS+ C +IAC +
Sbjct: 87 LRNYMDAQYFGTIQIGTPAQNFTVIFDTGSSNLWVPSRKCPFYDIACML 135
>gi|224085770|ref|XP_002189383.1| PREDICTED: cathepsin E [Taeniopygia guttata]
Length = 435
Score = 108 bits (269), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 53/115 (46%), Positives = 77/115 (66%), Gaps = 3/115 (2%)
Query: 62 NASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIAD 119
N GGE++FGG D +F G + + PV+++GYWQ +++I++ V +C+N CQAI D
Sbjct: 261 NPDSPQGGEVLFGGFDTSRFTGTLNWVPVTQQGYWQIQLDNIQLGGTVTFCANGCQAIVD 320
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
TGTSLI GP+K I +L LIGAV + +G VDC NL MP++ I + G +TL+
Sbjct: 321 TGTSLITGPTKEIKKLQNLIGAVSV-DGEYTVDCSNLSSMPDLTITINGLPYTLS 374
Score = 59.7 bits (143), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 25/41 (60%), Positives = 34/41 (82%), Gaps = 2/41 (4%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACS 48
+Y+G IS+GTPPQ F V+FDTGSSNLW+PS +C ++ AC+
Sbjct: 116 EYFGQISIGTPPQNFTVVFDTGSSNLWVPSVYC--VSKACT 154
>gi|195034430|ref|XP_001988894.1| GH11416 [Drosophila grimshawi]
gi|193904894|gb|EDW03761.1| GH11416 [Drosophila grimshawi]
Length = 400
Score = 107 bits (268), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 54/126 (42%), Positives = 80/126 (63%), Gaps = 2/126 (1%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKN 108
+DK F G N S E GGE++ GGVD F G++T PVS+ GYWQF + + ++ N
Sbjct: 222 IDKPTF-GFYLTHNGSAELGGELILGGVDNTLFEGNLTSVPVSQMGYWQFAMAVVAMDNN 280
Query: 109 VYCSNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGG 168
V CS+CQAIADTGTSL+ P+ + +N +IGA + +G VDC ++ +P ++ ++G
Sbjct: 281 VICSDCQAIADTGTSLLAVPANQLTYINNIIGAYQM-DGDYFVDCSLVNSLPTLNFLIGE 339
Query: 169 KNFTLT 174
F+LT
Sbjct: 340 SVFSLT 345
Score = 64.3 bits (155), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 26/39 (66%), Positives = 33/39 (84%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
YYG I++GTPPQ FKV+FDTGSSNLW+PS +C ++AC
Sbjct: 88 YYGEIAIGTPPQYFKVVFDTGSSNLWVPSVNCLPTDLAC 126
>gi|354478111|ref|XP_003501259.1| PREDICTED: cathepsin E-like isoform 1 [Cricetulus griseus]
Length = 396
Score = 107 bits (267), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 56/119 (47%), Positives = 78/119 (65%), Gaps = 6/119 (5%)
Query: 63 ASDENGG---EIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAI 117
+SD GG E+ FGG D F G++ + PV+++GYWQ ++ +++ V +CS CQAI
Sbjct: 220 SSDPQGGSGSELTFGGFDPSHFSGNLNWIPVTKQGYWQIALDGVQVGDTVMFCSEGCQAI 279
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTRT 176
DTGTSLI GPS I +L + IGA P+ +G VDC NL+ MPNV IL G ++TL+ T
Sbjct: 280 VDTGTSLITGPSHKIKQLQEAIGATPM-DGEYAVDCANLNTMPNVAFILNGVSYTLSPT 337
Score = 60.8 bits (146), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 24/34 (70%), Positives = 31/34 (91%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCS 41
+Y+GTIS+G+PPQ F VIFDTGSSNLW+PS +C+
Sbjct: 77 EYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT 110
>gi|205363469|gb|ACI04164.1| cathepsin D-like aspartic protease precursor [Fasciola hepatica]
Length = 429
Score = 107 bits (267), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 51/114 (44%), Positives = 72/114 (63%), Gaps = 1/114 (0%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIAD 119
RNASD GGE++ GG+D + + GDITY PV+ + YWQF V+ I+ C++ CQAIAD
Sbjct: 215 RNASDPVGGELLLGGIDPNYYTGDITYVPVTHEAYWQFKVDKIEFPGVSICADGCQAIAD 274
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
TGTSLI GP K + LN+ IG + G V+CD +D + + ++ G+ L
Sbjct: 275 TGTSLIAGPKKEVDALNEQIGGTWMPGGIYVVNCDKIDNLSAITFVVAGRKMVL 328
Score = 72.8 bits (177), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 31/41 (75%), Positives = 35/41 (85%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQYYG I +GTPPQ FKVIFDTGSSNLW+PS+ CS L+ AC
Sbjct: 68 AQYYGEIGIGTPPQTFKVIFDTGSSNLWVPSKRCSYLSWAC 108
>gi|332514729|gb|AEE69372.1| cathepsin D [Fasciola gigantica]
Length = 429
Score = 107 bits (267), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 51/114 (44%), Positives = 72/114 (63%), Gaps = 1/114 (0%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIAD 119
RNASD GGE++ GG+D + + GDITY PV+ + YWQF V+ I+ C++ CQAIAD
Sbjct: 215 RNASDPVGGELLLGGIDPNYYTGDITYVPVTHEAYWQFKVDKIEFPGVSICADGCQAIAD 274
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
TGTSLI GP K + LN+ IG + G V+CD +D + + ++ G+ L
Sbjct: 275 TGTSLIAGPKKEVDALNEQIGGTWMPGGIYVVNCDKIDNLSAITFVVAGRKMVL 328
Score = 72.8 bits (177), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 31/41 (75%), Positives = 35/41 (85%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQYYG I +GTPPQ FKVIFDTGSSNLW+PS+ CS L+ AC
Sbjct: 68 AQYYGEIGIGTPPQTFKVIFDTGSSNLWVPSKRCSYLSWAC 108
>gi|387015018|gb|AFJ49628.1| Cathepsin D [Crotalus adamanteus]
Length = 399
Score = 107 bits (267), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 51/116 (43%), Positives = 73/116 (62%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIA 118
R+ E GGE++FGG D + GD ++ VSRK YWQ ++ + + + C + C+AI
Sbjct: 221 RDPKGETGGELLFGGTDSQYYSGDFSWVNVSRKAYWQVHMDKVDVANGLTVCKDGCEAIV 280
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSLI GP+K I EL K IGA P+ G + CD L +P V ++LGG+++ LT
Sbjct: 281 DTGTSLITGPTKEIKELQKAIGAKPIIKGQYMLPCDKLSTLPTVSLVLGGQSYALT 336
Score = 75.1 bits (183), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 32/67 (47%), Positives = 46/67 (68%), Gaps = 2/67 (2%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK--DKFVGHS 58
+++ AQYYG I +GTP Q F V+FDTGSSNLW+PS HC++L+IAC + D +
Sbjct: 68 LKNYMDAQYYGEIGIGTPQQRFTVVFDTGSSNLWVPSSHCTLLDIACLIHHKYDSSKSST 127
Query: 59 HFRNASD 65
+ +N +D
Sbjct: 128 YVKNGTD 134
>gi|344236711|gb|EGV92814.1| Cathepsin E [Cricetulus griseus]
Length = 388
Score = 107 bits (267), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 56/119 (47%), Positives = 78/119 (65%), Gaps = 6/119 (5%)
Query: 63 ASDENGG---EIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAI 117
+SD GG E+ FGG D F G++ + PV+++GYWQ ++ +++ V +CS CQAI
Sbjct: 212 SSDPQGGSGSELTFGGFDPSHFSGNLNWIPVTKQGYWQIALDGVQVGDTVMFCSEGCQAI 271
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTRT 176
DTGTSLI GPS I +L + IGA P+ +G VDC NL+ MPNV IL G ++TL+ T
Sbjct: 272 VDTGTSLITGPSHKIKQLQEAIGATPM-DGEYAVDCANLNTMPNVAFILNGVSYTLSPT 329
Score = 61.2 bits (147), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 24/34 (70%), Positives = 31/34 (91%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCS 41
+Y+GTIS+G+PPQ F VIFDTGSSNLW+PS +C+
Sbjct: 77 EYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT 110
>gi|194862077|ref|XP_001969915.1| GG23677 [Drosophila erecta]
gi|190661782|gb|EDV58974.1| GG23677 [Drosophila erecta]
Length = 427
Score = 107 bits (266), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 48/113 (42%), Positives = 72/113 (63%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSNCQAIADT 120
R+ + ++GGE+++GG+D + G I Y PVS YWQF S+K+ + C+ CQAIADT
Sbjct: 218 RDGTSQSGGEVIWGGIDPSIYRGCINYVPVSMPAYWQFTANSVKVRDILLCNGCQAIADT 277
Query: 121 GTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
GTSLI+ P + +N ++ A G A VDC L ++PNV++ +GG +TL
Sbjct: 278 GTSLIVVPLRAYIAINIVLNATDNGEGEAFVDCSRLSRLPNVNLNIGGSTYTL 330
Score = 50.8 bits (120), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 21/43 (48%), Positives = 29/43 (67%)
Query: 5 YQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+ +Y + +GTPPQ F + FDTGSS+LW+PS C+ N AC
Sbjct: 70 FNLEYTIRVCIGTPPQCFNLQFDTGSSDLWVPSSKCASTNEAC 112
>gi|242013446|ref|XP_002427417.1| Lysosomal aspartic protease precursor, putative [Pediculus humanus
corporis]
gi|212511797|gb|EEB14679.1| Lysosomal aspartic protease precursor, putative [Pediculus humanus
corporis]
Length = 383
Score = 107 bits (266), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 47/114 (41%), Positives = 76/114 (66%), Gaps = 1/114 (0%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIAD 119
RN +++ GGE++ GG D + + G+ TY PV+RK YWQF ++ + +E C CQAIAD
Sbjct: 209 RNPNEKVGGELLLGGSDPNYYKGNFTYLPVNRKAYWQFQMDKVMMEDITVCRGGCQAIAD 268
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
TGTSLI GP + + ++NK + VP++ G ++C N+ +P ++ +L G++F L
Sbjct: 269 TGTSLIAGPVEDVNKINKKLNGVPVSGGEYMIECRNIPNLPKINFVLKGRSFVL 322
Score = 77.0 bits (188), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 36/61 (59%), Positives = 44/61 (72%), Gaps = 2/61 (3%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK--DKFVGHSHFRNAS 64
AQYYG IS+GTPPQ+FKVIFDTGSSNLWIPS+ C NIAC + D ++ RN +
Sbjct: 62 AQYYGPISIGTPPQQFKVIFDTGSSNLWIPSKKCLFSNIACLLHNKYDSSRSSTYIRNGT 121
Query: 65 D 65
+
Sbjct: 122 E 122
>gi|218944225|gb|ACL13150.1| cathepsin D [Azumapecten farreri]
Length = 396
Score = 107 bits (266), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 48/116 (41%), Positives = 75/116 (64%), Gaps = 3/116 (2%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSN-CQAI 117
R+ GGE++ GG D + G+ +Y+P+++KGYWQF + I++ + + YC+ C AI
Sbjct: 219 RDPDASAGGELIIGGSDPKHYSGNFSYAPITKKGYWQFDMAGIQVGGKASAYCNGGCSAI 278
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
ADTGTSL++GP+ + +LNK IGA P A G VDCD + +P + ++ + FTL
Sbjct: 279 ADTGTSLLVGPTAEVQQLNKQIGATPFAGGEYTVDCDKISSLPPISFMIDKQLFTL 334
Score = 71.2 bits (173), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 29/41 (70%), Positives = 35/41 (85%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQYYG I +GTP QEFKV+FDTGSSNLW+PS+ C + +IAC
Sbjct: 72 AQYYGAIQIGTPAQEFKVVFDTGSSNLWVPSKKCKLSDIAC 112
>gi|312097106|ref|XP_003148873.1| aspartic protease BmAsp-2 [Loa loa]
gi|307755962|gb|EFO15196.1| aspartic protease BmAsp-2 [Loa loa]
Length = 417
Score = 107 bits (266), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 57/130 (43%), Positives = 81/130 (62%), Gaps = 7/130 (5%)
Query: 51 KDKFVGHSHF-----RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI 105
K K V S F RN +DE GGEI GG+D ++FV + Y+P+S+ GYWQF ++SI+
Sbjct: 201 KQKTVKESLFAFWLDRNPNDEIGGEITLGGIDVNRFVAPLVYTPISKHGYWQFQMDSIQG 260
Query: 106 E-KNVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVD 163
+ K + C+N CQAIADTGTSLI GP I ++ K IGA L + C + +P +
Sbjct: 261 DGKAISCANGCQAIADTGTSLIAGPKSQIDKIQKYIGAEHLYADEYIIPCYKVPSLPEIT 320
Query: 164 IILGGKNFTL 173
++ GK++TL
Sbjct: 321 FVIAGKSYTL 330
Score = 67.8 bits (164), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 35/82 (42%), Positives = 48/82 (58%), Gaps = 1/82 (1%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHF 60
+++ AQYYG IS+GTP Q F V+FDTGSSNLWIPS C +IAC + +K+ G
Sbjct: 63 LKNYMDAQYYGQISIGTPAQNFSVVFDTGSSNLWIPSVKCPFSDIAC-LFHNKYKGAQST 121
Query: 61 RNASDENGGEIMFGGVDKDKFV 82
D +I +G + F+
Sbjct: 122 TYKPDGRKIKIQYGRGSMEGFI 143
>gi|205364148|gb|ACI04532.1| aspartic protease 1 precursor [Ancylostoma duodenale]
Length = 446
Score = 106 bits (265), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 50/115 (43%), Positives = 72/115 (62%), Gaps = 2/115 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI-EKNVYCSN-CQAIA 118
RN E GGEI GG+D ++V IT++PV+R+GYWQF ++ ++ ++ C N CQAIA
Sbjct: 240 RNPDSELGGEITLGGMDTRRYVEPITWTPVTRRGYWQFKMDKVQGGSTSIACPNGCQAIA 299
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
DTGTSLI GP + + K IGA PL G + CD + +P + ++ G+ TL
Sbjct: 300 DTGTSLIAGPKAQVEAIQKFIGAEPLMKGEYMIPCDKVPSLPELSFVIEGRTSTL 354
Score = 70.9 bits (172), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 37/84 (44%), Positives = 49/84 (58%), Gaps = 5/84 (5%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHF 60
+R+ AQY+GTI +GTP Q F VIFDTGSSNLW+PS+ C +IAC + + G S
Sbjct: 87 LRNYMDAQYFGTIQIGTPAQNFTVIFDTGSSNLWVPSRKCPFYDIACMLHRRYDSGASST 146
Query: 61 -----RNASDENGGEIMFGGVDKD 79
R + + G M G + KD
Sbjct: 147 YKEDGRKMAIQYGTGSMKGFISKD 170
>gi|195575783|ref|XP_002077756.1| GD23099 [Drosophila simulans]
gi|194189765|gb|EDX03341.1| GD23099 [Drosophila simulans]
Length = 405
Score = 106 bits (265), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 53/113 (46%), Positives = 69/113 (61%), Gaps = 1/113 (0%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSNCQAIADT 120
RN S GG++ GG D++ G++TY+PV+++GYWQF V +I V S CQAIADT
Sbjct: 238 RNGSAVEGGQLTLGGTDQNLIAGEMTYTPVTQQGYWQFSVNNITWNGTVISSGCQAIADT 297
Query: 121 GTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
GTSLI PS +LN LIG V L G V C + +P + I +GG NF L
Sbjct: 298 GTSLIAAPSAAYIQLNNLIGGV-LIQGEYYVPCSTVSSLPVLTINIGGTNFYL 349
Score = 60.5 bits (145), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 24/39 (61%), Positives = 30/39 (76%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
YYG I +GTP Q FKV+FDTGS+NLW+PS C ++AC
Sbjct: 93 YYGLIGIGTPEQYFKVVFDTGSANLWVPSSQCLSTDVAC 131
>gi|195350353|ref|XP_002041705.1| GM16820 [Drosophila sechellia]
gi|194123478|gb|EDW45521.1| GM16820 [Drosophila sechellia]
Length = 405
Score = 106 bits (265), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 53/113 (46%), Positives = 69/113 (61%), Gaps = 1/113 (0%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSNCQAIADT 120
RN S GG++ GG D++ G++TY+PV+++GYWQF V +I V S CQAIADT
Sbjct: 238 RNGSAVEGGQLTLGGTDQNLIAGEMTYTPVTQQGYWQFSVNNITWNGTVISSGCQAIADT 297
Query: 121 GTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
GTSLI PS +LN LIG V L G V C + +P + I +GG NF L
Sbjct: 298 GTSLIAAPSAAYIQLNNLIGGV-LIQGEYYVPCSTVSSLPVLTINIGGTNFYL 349
Score = 60.8 bits (146), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 24/39 (61%), Positives = 30/39 (76%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
YYG I +GTP Q FKV+FDTGS+NLW+PS C ++AC
Sbjct: 93 YYGLIGIGTPEQYFKVVFDTGSANLWVPSSQCLATDVAC 131
>gi|195433873|ref|XP_002064931.1| GK19045 [Drosophila willistoni]
gi|194161016|gb|EDW75917.1| GK19045 [Drosophila willistoni]
Length = 411
Score = 106 bits (265), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 51/113 (45%), Positives = 71/113 (62%), Gaps = 1/113 (0%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSNCQAIADT 120
RN + +GGE++ GG D+ +VG+++Y PVS++GYWQF V +I V C NCQAIADT
Sbjct: 244 RNGTATDGGELILGGSDESLYVGNLSYVPVSQQGYWQFAVNNITWNGTVVCDNCQAIADT 303
Query: 121 GTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
GTSLI P ++LN+LIGA+ G V C +D +P + +G F L
Sbjct: 304 GTSLIACPFSAYSQLNQLIGAL-YVEGSYYVSCSTVDSLPVLSFSIGNGIFEL 355
Score = 63.5 bits (153), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 26/47 (55%), Positives = 35/47 (74%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+ +S Y+GTI +GTP Q F V+FDTGS+NLWIPS HC+ ++AC
Sbjct: 91 LGNSMNMYYFGTIGIGTPEQYFNVVFDTGSANLWIPSVHCASTDVAC 137
>gi|195120065|ref|XP_002004549.1| GI19550 [Drosophila mojavensis]
gi|193909617|gb|EDW08484.1| GI19550 [Drosophila mojavensis]
Length = 387
Score = 106 bits (264), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 51/114 (44%), Positives = 67/114 (58%), Gaps = 1/114 (0%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIAD 119
R+ GGEI+FGG D + + GD TY PV+RKGYWQ ++S +I C CQ IAD
Sbjct: 213 RDPKAPEGGEIIFGGSDPNHYTGDFTYLPVTRKGYWQIKMDSAQINNVELCKGGCQVIAD 272
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
TGTSLI P+ +N+ IG P+ G V CD + +P + +LGGK F L
Sbjct: 273 TGTSLIAAPAAEATSINQAIGGTPIVGGQYVVSCDMIPNLPVIKFVLGGKTFEL 326
Score = 75.1 bits (183), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 31/41 (75%), Positives = 36/41 (87%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQYYG IS+GTPPQ FKV+FDTGSSNLW+PS+ C + NIAC
Sbjct: 66 AQYYGPISIGTPPQNFKVVFDTGSSNLWVPSKKCHLTNIAC 106
>gi|47210711|emb|CAF90003.1| unnamed protein product [Tetraodon nigroviridis]
Length = 361
Score = 106 bits (264), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 51/116 (43%), Positives = 75/116 (64%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEK-NVYC-SNCQAIA 118
R + + GE++ GG+D+ F G I + PV+ KGYWQ V+S+ ++ + +C S C+AI
Sbjct: 184 RKSGSKLQGELLLGGIDEALFTGPINWLPVTAKGYWQIKVDSVAVQGVDSFCPSGCEAIV 243
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSLI GP++ I L +LIGA P G +DC L +P+V +LGG+ +TLT
Sbjct: 244 DTGTSLITGPTRDILRLQQLIGATPTNIGEFLIDCIRLSSLPHVTFVLGGEEYTLT 299
Score = 60.5 bits (145), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 25/35 (71%), Positives = 29/35 (82%)
Query: 6 QAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
QAQYYG I LGTP Q F V+FDTGSS+LW+PS +C
Sbjct: 2 QAQYYGEIRLGTPGQNFSVVFDTGSSDLWVPSVYC 36
>gi|345322126|ref|XP_001511988.2| PREDICTED: cathepsin E-like [Ornithorhynchus anatinus]
Length = 295
Score = 105 bits (263), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 62/170 (36%), Positives = 89/170 (52%), Gaps = 41/170 (24%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASDE 66
AQYYG +S+GTPPQ F V+FDTGSSN W+PS +C ++ AC + H F+
Sbjct: 79 AQYYGAVSIGTPPQRFTVVFDTGSSNFWVPSAYC--ISEACRM-------HQKFK----- 124
Query: 67 NGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIADTGTSL 124
S +SR + + +KI+ V +C++ C+AI D+GTSL
Sbjct: 125 ---------------------SFLSRS----YRHDGVKIQGRVAFCAHGCEAIVDSGTSL 159
Query: 125 IIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
I GP I L + IGA P +G VDC + MP+V +G + +TL+
Sbjct: 160 ITGPPAQIRRLQEHIGATPSPSGEYAVDCRRVSSMPHVTFTIGHREYTLS 209
>gi|307166067|gb|EFN60339.1| Lysosomal aspartic protease [Camponotus floridanus]
Length = 370
Score = 105 bits (263), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 53/115 (46%), Positives = 76/115 (66%), Gaps = 3/115 (2%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIAD 119
RN S E GGE++ GG D + +VG+ TY PV+ +GYWQF ++S+ + C+ CQAIAD
Sbjct: 203 RNPSAEEGGELILGGCDPNHYVGEFTYVPVTVEGYWQFTMDSVIAGNYILCAQGCQAIAD 262
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
TGTSLI+GPS+ I +N I + +G VDCD ++++P ++ IL GK LT
Sbjct: 263 TGTSLIVGPSEDIDVINGYIQNISDNDG--NVDCDKINELPTINFILSGKPHNLT 315
Score = 60.8 bits (146), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 25/31 (80%), Positives = 28/31 (90%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPS 37
AQYYG IS+GTPPQ F V+FDTGSSNLW+PS
Sbjct: 52 AQYYGVISIGTPPQNFTVLFDTGSSNLWVPS 82
>gi|307168170|gb|EFN61449.1| Lysosomal aspartic protease [Camponotus floridanus]
Length = 382
Score = 105 bits (263), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 55/116 (47%), Positives = 71/116 (61%), Gaps = 3/116 (2%)
Query: 61 RNASDE-NGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYC-SNCQAIA 118
R+ SD NGGE++ GG D + GD TY PVSRKGYWQF ++ I C NCQA+A
Sbjct: 211 RDFSDALNGGELILGGSDPTHYEGDFTYIPVSRKGYWQFTLDKIIASYINLCDENCQAVA 270
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
D I+GP + I +N LIG V + NG +V+C +D +P + ILGGK F LT
Sbjct: 271 DVSADAIVGPKQHIVFINDLIGTVNI-NGEERVNCHRIDLLPTISFILGGKAFNLT 325
Score = 49.3 bits (116), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 20/33 (60%), Positives = 25/33 (75%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCS 41
Y G I++GTPPQEF VIFD + LWIPS+ C+
Sbjct: 68 YSGIITIGTPPQEFTVIFDIRAPYLWIPSKKCN 100
>gi|198451348|ref|XP_001358330.2| GA19187 [Drosophila pseudoobscura pseudoobscura]
gi|198131448|gb|EAL27468.2| GA19187 [Drosophila pseudoobscura pseudoobscura]
Length = 393
Score = 105 bits (262), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 43/113 (38%), Positives = 70/113 (61%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSNCQAIADT 120
RN S + GGE++FGG+D +F +TY P++ GYWQF ++S+++ QAI DT
Sbjct: 224 RNGSAQQGGELLFGGIDASRFTAPLTYVPLTHAGYWQFQMQSVEVVGKTISQGRQAIVDT 283
Query: 121 GTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
GTSL+ P + +N L+G +P A+G + C ++ ++P V ++GG+ F L
Sbjct: 284 GTSLLAAPPREYLIINSLLGGLPTASGEYLLRCSDIGRLPEVFFVIGGQRFGL 336
Score = 62.4 bits (150), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 29/57 (50%), Positives = 36/57 (63%), Gaps = 8/57 (14%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNAS 64
+Y G IS+GTP Q F ++FDTGS+NLW+PS CS N+AC H H NAS
Sbjct: 78 EYAGPISIGTPRQPFNMLFDTGSANLWVPSAECSARNVACQ--------HHHRYNAS 126
>gi|195144214|ref|XP_002013091.1| GL23572 [Drosophila persimilis]
gi|194102034|gb|EDW24077.1| GL23572 [Drosophila persimilis]
Length = 393
Score = 105 bits (262), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 43/113 (38%), Positives = 70/113 (61%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSNCQAIADT 120
RN S + GGE++FGG+D +F +TY P++ GYWQF ++S+++ QAI DT
Sbjct: 224 RNGSAQQGGELLFGGIDASRFTAPLTYVPLTHAGYWQFQMQSVEVVGKTISQGRQAIVDT 283
Query: 121 GTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
GTSL+ P + +N L+G +P A+G + C ++ ++P V ++GG+ F L
Sbjct: 284 GTSLLAAPPREYLIINSLLGGLPTASGEYLLRCSDIGRLPEVFFVIGGQRFGL 336
Score = 62.4 bits (150), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 29/57 (50%), Positives = 36/57 (63%), Gaps = 8/57 (14%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNAS 64
+Y G IS+GTP Q F ++FDTGS+NLW+PS CS N+AC H H NAS
Sbjct: 78 EYAGPISIGTPRQPFNMLFDTGSANLWVPSAECSARNVACQ--------HHHRYNAS 126
>gi|395544889|ref|XP_003774338.1| PREDICTED: uncharacterized protein LOC100917285 [Sarcophilus
harrisii]
Length = 370
Score = 105 bits (261), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 52/111 (46%), Positives = 70/111 (63%), Gaps = 2/111 (1%)
Query: 66 ENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIADTGTS 123
+ GGE+ FGG D + GD TY V+RK YWQ ++ +K+E + C + C+AI DTGTS
Sbjct: 200 QPGGELTFGGTDSKYYKGDFTYLNVTRKAYWQIHMDMVKVESALTLCKDGCEAIVDTGTS 259
Query: 124 LIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
LI GP + I LNK +GAV L G + C+ + +P V +LGGKNF LT
Sbjct: 260 LITGPVQEIKALNKALGAVALVGGEYMIPCEKVTSLPPVSFVLGGKNFKLT 310
>gi|431892878|gb|ELK03306.1| Cathepsin E [Pteropus alecto]
Length = 396
Score = 105 bits (261), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 56/119 (47%), Positives = 77/119 (64%), Gaps = 6/119 (5%)
Query: 63 ASDENGG---EIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAI 117
+SD GG E++FGG D F G + + PV+++GYWQ +++I++ V +CS CQAI
Sbjct: 220 SSDLEGGAGSELIFGGYDHSHFSGSLNWVPVTKQGYWQIALDTIQVGGAVIFCSEGCQAI 279
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTRT 176
DTGTSLI GPS+ I +L K IGA P NG V+CDNL+ MP+V + G +TL T
Sbjct: 280 VDTGTSLITGPSEEIKQLQKAIGAEP-TNGEYAVECDNLNVMPDVTFTINGVPYTLQPT 337
Score = 60.8 bits (146), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 24/34 (70%), Positives = 31/34 (91%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCS 41
+Y+GTIS+G+PPQ F VIFDTGSSNLW+PS +C+
Sbjct: 77 EYFGTISVGSPPQNFTVIFDTGSSNLWVPSVYCT 110
>gi|357627475|gb|EHJ77155.1| cathepsin D [Danaus plexippus]
Length = 358
Score = 104 bits (260), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 50/118 (42%), Positives = 73/118 (61%), Gaps = 1/118 (0%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIAD 119
R+AS GGE++ GG D + G +TY P+S+ YWQF ++ + + + +C CQAIAD
Sbjct: 184 RDASAAQGGELVLGGSDPAHYRGPLTYVPLSKDTYWQFQMDGVLVNGSSFCKRGCQAIAD 243
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTRTN 177
TGTSLI GP + +A LN IGA P+A G +DC + ++P V + + FTL T+
Sbjct: 244 TGTSLIGGPVEEVAALNAKIGATPMAFGQFALDCSLIPRLPPVTFTIANQKFTLEGTD 301
Score = 72.0 bits (175), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 30/41 (73%), Positives = 34/41 (82%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQYYG IS+G PPQ FKV+FDTGSSNLW+PS+ C NIAC
Sbjct: 38 AQYYGPISIGNPPQTFKVVFDTGSSNLWVPSKKCHYTNIAC 78
>gi|195471992|ref|XP_002088286.1| GE18491 [Drosophila yakuba]
gi|194174387|gb|EDW87998.1| GE18491 [Drosophila yakuba]
Length = 392
Score = 104 bits (260), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 49/113 (43%), Positives = 68/113 (60%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSNCQAIADT 120
R + GGE++ GG+D + G +TY PVS YWQF V +IK V C+ CQAIADT
Sbjct: 221 RQGTAVRGGELILGGIDSSLYKGSLTYVPVSVPAYWQFAVNTIKTNGIVLCNGCQAIADT 280
Query: 121 GTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
GTSLI+ P ++N+ +GA +G A V C + +P V++ +GG FTL
Sbjct: 281 GTSLIVAPLAAYRKINRQLGATDNGDGEAFVSCSRVSTLPKVNLNIGGTIFTL 333
Score = 64.3 bits (155), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 25/47 (53%), Positives = 34/47 (72%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+ +S +YYG I++GTP Q F ++FDTGS+NLW+PS C NIAC
Sbjct: 68 LHNSMNNEYYGVIAIGTPKQRFNILFDTGSANLWVPSSSCPASNIAC 114
>gi|195134378|ref|XP_002011614.1| GI11124 [Drosophila mojavensis]
gi|193906737|gb|EDW05604.1| GI11124 [Drosophila mojavensis]
Length = 373
Score = 104 bits (260), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 49/113 (43%), Positives = 70/113 (61%), Gaps = 4/113 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSNCQAIADT 120
R+ + GGE++FGG D + G++TY P+S++GYWQF V+SI I+ C NCQAIADT
Sbjct: 213 RDGTSNEGGELIFGGSDSSLYTGELTYVPISQQGYWQFAVDSISIDGQTLCDNCQAIADT 272
Query: 121 GTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
GTSL++ LN L+ + VDC +D MP +++ +GG FTL
Sbjct: 273 GTSLLVVSEDAYDILNNLLN----VDEDGLVDCSAVDSMPVLNLNIGGTKFTL 321
Score = 70.5 bits (171), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 32/70 (45%), Positives = 49/70 (70%), Gaps = 4/70 (5%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHF 60
+ +S YYGTIS+GTPPQ FKV+FD+GSSNLW+PS C ++AC ++ +++ H
Sbjct: 60 LSNSINMAYYGTISIGTPPQSFKVLFDSGSSNLWVPSSTCWFFDVAC-MNHNQY---DHD 115
Query: 61 RNASDENGGE 70
++++ E GE
Sbjct: 116 KSSTYEANGE 125
>gi|190576608|gb|ACE79095.1| cathepsin D precursor (predicted) [Sorex araneus]
Length = 405
Score = 104 bits (260), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 52/128 (40%), Positives = 78/128 (60%), Gaps = 3/128 (2%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKN 108
VDK+ F + + R+ + + GGE+M GG+D F G +TY V+R+ YWQ ++ I +
Sbjct: 217 VDKNIFSFYLN-RDPTAQPGGELMLGGIDSKYFKGSMTYHNVTRQAYWQVHMDQIDVGNG 275
Query: 109 VYC--SNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ C+AI DTGTSL++GP + EL K IGAVPL G + C+ L +P V + L
Sbjct: 276 LTLCKGGCEAIVDTGTSLLVGPVDEVKELQKAIGAVPLIQGEYIIPCEKLPDLPTVSLTL 335
Query: 167 GGKNFTLT 174
GGK ++L+
Sbjct: 336 GGKEYSLS 343
Score = 71.6 bits (174), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 30/47 (63%), Positives = 37/47 (78%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+R+ AQYYG I +GTPPQ F V+FDTGSSNLW+PS C +L+IAC
Sbjct: 72 LRNYMDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSVKCQLLDIAC 118
>gi|348521340|ref|XP_003448184.1| PREDICTED: cathepsin E-A-like [Oreochromis niloticus]
Length = 406
Score = 104 bits (260), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 51/128 (39%), Positives = 78/128 (60%), Gaps = 2/128 (1%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEK- 107
VD+ F + +++ + GE++ GG ++ ++G I + PV+ KGYWQ ++S+ ++
Sbjct: 214 VDQPVFSFYLSRKSSRTKPAGELLLGGTNEALYIGPINWHPVTAKGYWQIKMDSVAVQGV 273
Query: 108 NVYC-SNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
N++C S CQAI DTGTSLI GP+ I L +LIGA P G +DC L +P V +L
Sbjct: 274 NLFCPSGCQAIVDTGTSLIAGPTNDILRLQQLIGATPSNLGEFVIDCARLSSLPQVTFVL 333
Query: 167 GGKNFTLT 174
G +TLT
Sbjct: 334 NGTEYTLT 341
Score = 61.6 bits (148), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 30/64 (46%), Positives = 43/64 (67%), Gaps = 6/64 (9%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASDE 66
AQ+YG ISLGTP Q F VIFDTGS++LW+PS +C ++ AC++ + F++AS
Sbjct: 80 AQFYGEISLGTPEQNFSVIFDTGSADLWVPSTYC--ISEACALHRR----FKAFKSASFR 133
Query: 67 NGGE 70
+ G
Sbjct: 134 HDGR 137
>gi|71727523|gb|AAZ39883.1| cathepsin D-like aspartic protease [Opisthorchis viverrini]
Length = 425
Score = 104 bits (260), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 50/114 (43%), Positives = 72/114 (63%), Gaps = 1/114 (0%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIAD 119
RNASD GGE++ GG D + G+I ++P++ + YWQF V+S+ + C N CQAIAD
Sbjct: 214 RNASDPVGGELLLGGTDPKYYKGEILWAPLTHEAYWQFKVDSMSVGGMKLCENGCQAIAD 273
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
TGTSLI GPS+ + +LN +GA+ L G ++CD + +P V + GK L
Sbjct: 274 TGTSLIAGPSEEVGKLNDALGAIKLPGGTYYINCDRVSTLPLVQFNINGKLMEL 327
Score = 78.6 bits (192), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 32/41 (78%), Positives = 36/41 (87%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQYYG I +GTPPQ F+V+FDTGSSNLW+PS HCSI NIAC
Sbjct: 67 AQYYGEIGIGTPPQSFQVVFDTGSSNLWVPSTHCSIFNIAC 107
>gi|358255149|dbj|GAA56870.1| cathepsin D [Clonorchis sinensis]
Length = 425
Score = 104 bits (260), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 49/114 (42%), Positives = 71/114 (62%), Gaps = 1/114 (0%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIAD 119
RNASD GGE++ GG D + G+I ++P++ + YWQF V+S+ + C N CQAIAD
Sbjct: 214 RNASDPVGGELLLGGTDPKYYKGEILWAPLTHEAYWQFKVDSMNVGSMKLCENGCQAIAD 273
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
TGTSLI GPS+ + +LN +GA+ + G +DC + +P V + GK L
Sbjct: 274 TGTSLIAGPSEEVGKLNDALGAIKIPGGTYYIDCSRVSTLPPVQFSISGKLMQL 327
Score = 77.4 bits (189), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 31/41 (75%), Positives = 37/41 (90%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQYYG I +GTPPQ F+V+FDTGSSNLW+PS+HCSI +IAC
Sbjct: 67 AQYYGEIGIGTPPQSFEVVFDTGSSNLWVPSKHCSIFSIAC 107
>gi|118429511|gb|ABK91803.1| aspartic protease precursor [Clonorchis sinensis]
Length = 425
Score = 104 bits (260), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 49/114 (42%), Positives = 71/114 (62%), Gaps = 1/114 (0%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIAD 119
RNASD GGE++ GG D + G+I ++P++ + YWQF V+S+ + C N CQAIAD
Sbjct: 214 RNASDPVGGELLLGGTDPKYYKGEILWAPLTHEAYWQFKVDSMNVGSMKLCENGCQAIAD 273
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
TGTSLI GPS+ + +LN +GA+ + G +DC + +P V + GK L
Sbjct: 274 TGTSLIAGPSEEVGKLNDALGAIKIPGGTYYIDCSRVSTLPPVQFSISGKLMQL 327
Score = 77.4 bits (189), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 31/41 (75%), Positives = 37/41 (90%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQYYG I +GTPPQ F+V+FDTGSSNLW+PS+HCSI +IAC
Sbjct: 67 AQYYGEIGIGTPPQSFEVVFDTGSSNLWVPSKHCSIFSIAC 107
>gi|315440803|gb|ADU20407.1| aspartic protease 1 [Clonorchis sinensis]
Length = 425
Score = 104 bits (259), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 49/114 (42%), Positives = 71/114 (62%), Gaps = 1/114 (0%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIAD 119
RNASD GGE++ GG D + G+I ++P++ + YWQF V+S+ + C N CQAIAD
Sbjct: 214 RNASDPVGGELLLGGTDPKYYKGEILWAPLTHEAYWQFKVDSMNVGSMKLCENGCQAIAD 273
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
TGTSLI GPS+ + +LN +GA+ + G +DC + +P V + GK L
Sbjct: 274 TGTSLIAGPSEEVGKLNDALGAIKIPGGTYYIDCSRVSTLPPVQFSISGKLMQL 327
Score = 77.0 bits (188), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 31/41 (75%), Positives = 37/41 (90%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQYYG I +GTPPQ F+V+FDTGSSNLW+PS+HCSI +IAC
Sbjct: 67 AQYYGEIGIGTPPQSFEVVFDTGSSNLWVPSKHCSIFSIAC 107
>gi|407728652|gb|AFU24355.1| cathepsin D [Ctenopharyngodon idella]
Length = 398
Score = 104 bits (259), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 49/116 (42%), Positives = 68/116 (58%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSNCQAIA 118
RN + GGE++ GG D + GD Y +SR+ YWQ ++ + I E + C+AI
Sbjct: 221 RNPDTQPGGELLLGGTDPKYYTGDFNYVDISRQAYWQIHMDGMSIGSELTLCKGGCEAIV 280
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSLI GP+ I L K IGA+PL G VDC + +P + +LGGK ++LT
Sbjct: 281 DTGTSLITGPATEIKALQKAIGAIPLIQGEYMVDCKKVPTLPTISFVLGGKTYSLT 336
Score = 73.2 bits (178), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 30/47 (63%), Positives = 38/47 (80%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ AQYYG I LGTP Q F V+FDTGSSNLW+PS HCS+++IAC
Sbjct: 68 LKNYLDAQYYGEIGLGTPVQSFTVVFDTGSSNLWVPSVHCSLMDIAC 114
>gi|56417363|gb|AAV90625.1| cathepsin D protein [Sus scrofa]
Length = 395
Score = 104 bits (259), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 51/131 (38%), Positives = 79/131 (60%), Gaps = 3/131 (2%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKN 108
VDK+ F + + R+ + GGE+M GG+D + G + Y V+RK YWQ ++ + + +
Sbjct: 207 VDKNIFSFYLN-RDPGAQPGGELMLGGIDSKYYKGSLDYHNVTRKAYWQIHMDQVAVGSS 265
Query: 109 VYC--SNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ C+AI DTGTSLI+GP + + EL K IGAVPL G + C+ + +P+V + L
Sbjct: 266 LTLCKGGCEAIVDTGTSLIVGPVEEVRELQKAIGAVPLIQGEYMIPCEKVPSLPDVTVTL 325
Query: 167 GGKNFTLTRTN 177
GGK + L+ N
Sbjct: 326 GGKKYKLSSEN 336
Score = 75.9 bits (185), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 44/122 (36%), Positives = 61/122 (50%), Gaps = 27/122 (22%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHF 60
+++ AQYYG I +GTPPQ F V+FDTGSSNLW+PS HC +L+IAC + H
Sbjct: 56 LKNYMDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWI---------HH 106
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQ------------FGVESIKIEKN 108
+ S ++ + K+ I Y S GYW GV IK+E+
Sbjct: 107 KYNSGKSSTYV------KNGTTFAIHYGSGSLSGYWSQDTVSVPCNSALLGVGGIKVERQ 160
Query: 109 VY 110
+
Sbjct: 161 TF 162
>gi|83523775|ref|NP_001032810.1| cathepsin D precursor [Sus scrofa]
gi|65330113|gb|AAY42144.1| cathepsin D [Sus scrofa]
Length = 410
Score = 104 bits (259), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 51/131 (38%), Positives = 79/131 (60%), Gaps = 3/131 (2%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKN 108
VDK+ F + + R+ + GGE+M GG+D + G + Y V+RK YWQ ++ + + +
Sbjct: 222 VDKNIFSFYLN-RDPGAQPGGELMLGGIDSKYYKGSLDYHNVTRKAYWQIHMDQVAVGSS 280
Query: 109 VYC--SNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ C+AI DTGTSLI+GP + + EL K IGAVPL G + C+ + +P+V + L
Sbjct: 281 LTLCKGGCEAIVDTGTSLIVGPVEEVRELQKAIGAVPLIQGEYMIPCEKVPSLPDVTVTL 340
Query: 167 GGKNFTLTRTN 177
GGK + L+ N
Sbjct: 341 GGKKYKLSSEN 351
Score = 72.0 bits (175), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 43/122 (35%), Positives = 60/122 (49%), Gaps = 27/122 (22%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHF 60
+++ AQ YG I +GTPPQ F V+FDTGSSNLW+PS HC +L+IAC + H
Sbjct: 71 LKNYMDAQNYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWI---------HH 121
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQ------------FGVESIKIEKN 108
+ S ++ + K+ I Y S GYW GV IK+E+
Sbjct: 122 KYNSGKSSTYV------KNGTTFAIHYGSGSLSGYWSQDTVSVPCNSALLGVGGIKVERQ 175
Query: 109 VY 110
+
Sbjct: 176 TF 177
>gi|332024606|gb|EGI64804.1| Lysosomal aspartic protease [Acromyrmex echinatior]
Length = 785
Score = 104 bits (259), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 49/120 (40%), Positives = 77/120 (64%), Gaps = 2/120 (1%)
Query: 59 HFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAI 117
H+ N+ E GE++ GG D + ++G++TY V+ KGYWQ ++ I+I N CSN C+AI
Sbjct: 177 HYENSVSEFAGELILGGSDPNHYLGELTYVNVTHKGYWQITMDKIQIGHNSLCSNSCEAI 236
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTRTN 177
DTGT+++IG I +N+LIGA + N VDC+ + +PN+ +GGK+F +T +
Sbjct: 237 VDTGTTMLIGSRLDIEIINELIGATYI-NDEIIVDCNRIHYLPNISFFIGGKSFEVTHAD 295
Score = 50.4 bits (119), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 29/76 (38%), Positives = 42/76 (55%), Gaps = 4/76 (5%)
Query: 103 IKIEKNVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGA---VPLANGPAKVDCDNLDK 158
IK+ V+C + C+AIA TG S + GP+ I +N + V +++G VDC + K
Sbjct: 436 IKMRHMVFCDDGCEAIAHTGFSGLSGPASEIEFINNEVDTLRRVGISHGDTLVDCHEISK 495
Query: 159 MPNVDIILGGKNFTLT 174
+PNV L K F LT
Sbjct: 496 LPNVTFFLNNKPFVLT 511
Score = 43.5 bits (101), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 27/77 (35%), Positives = 39/77 (50%), Gaps = 5/77 (6%)
Query: 103 IKIEKNVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLAN----GPAKVDCDNLD 157
IK+ ++C + C+AIA TG S + GP+ I +N I ++ G VDC +
Sbjct: 576 IKMRHMIFCEDGCEAIAHTGFSGLSGPASEIEFINNEIDSLRRVGISHAGDLFVDCRQIS 635
Query: 158 KMPNVDIILGGKNFTLT 174
K+PNV L K LT
Sbjct: 636 KLPNVIFFLNNKPLVLT 652
>gi|195399279|ref|XP_002058248.1| GJ15983 [Drosophila virilis]
gi|194150672|gb|EDW66356.1| GJ15983 [Drosophila virilis]
Length = 372
Score = 103 bits (258), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 47/113 (41%), Positives = 67/113 (59%), Gaps = 4/113 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSNCQAIADT 120
R+ + GGE++FGG D + GD+TY P+S +GYWQF + I+ C NCQAIADT
Sbjct: 212 RDGTSSQGGELIFGGSDSSLYSGDLTYVPISEQGYWQFTMAGASIDGQSLCDNCQAIADT 271
Query: 121 GTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
GTSLI+ P+ +LN ++ + VDC ++ MP + +GG NF L
Sbjct: 272 GTSLIVAPANAYMQLNDILN----VDDQGLVDCSSVSSMPVITFNIGGTNFDL 320
Score = 64.7 bits (156), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 28/49 (57%), Positives = 37/49 (75%), Gaps = 2/49 (4%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSV 49
+ +S YYG I++GTPPQ FKV+FD+GSSNLW+PS+ CS + AC V
Sbjct: 61 LSNSMNMAYYGAITIGTPPQSFKVLFDSGSSNLWVPSKTCS--SYACEV 107
>gi|260826476|ref|XP_002608191.1| hypothetical protein BRAFLDRAFT_115250 [Branchiostoma floridae]
gi|229293542|gb|EEN64201.1| hypothetical protein BRAFLDRAFT_115250 [Branchiostoma floridae]
Length = 225
Score = 103 bits (258), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 55/128 (42%), Positives = 75/128 (58%), Gaps = 4/128 (3%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKN 108
VDK F + + R+AS GGE++ GG D + GD T+ V++ GYWQF ++ I I
Sbjct: 24 VDKPVFSFYLN-RDASGATGGELLLGGTDPKYYSGDFTFLDVTKPGYWQFKMDGIMINGK 82
Query: 109 V--YC-SNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDII 165
+C C AIADTGTSLI GP+ + LN+ IGA P+ G VDC + +P + +
Sbjct: 83 ASDFCKGGCAAIADTGTSLIAGPTSEVKALNEQIGATPIPGGEYMVDCSQIKSLPPISFM 142
Query: 166 LGGKNFTL 173
LGGK F L
Sbjct: 143 LGGKAFQL 150
>gi|1246039|gb|AAB35843.1| pepsinogen 2 [tuna, Peptide, 360 aa]
Length = 360
Score = 103 bits (258), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 50/118 (42%), Positives = 73/118 (61%), Gaps = 2/118 (1%)
Query: 58 SHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-KNVYCSN-CQ 115
S + +++ + G E++FGG+D + G IT+ P++ YWQ ++S+ I + V CS CQ
Sbjct: 190 SVYLSSNSQQGSEVVFGGIDSSHYTGQITWIPLTSATYWQIQMDSVTINGQTVACSGGCQ 249
Query: 116 AIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
AI DTGTSLI+GPS+ I +N +GA NG A V+C N+ MP V L G FT+
Sbjct: 250 AIIDTGTSLIVGPSRDIYNMNAWVGASTTQNGDATVNCQNIQSMPEVTFTLNGHAFTI 307
Score = 63.9 bits (154), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 26/33 (78%), Positives = 30/33 (90%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCS 41
YYG +S+GTPPQ FKVIFDTGSSNLW+PS +CS
Sbjct: 53 YYGVVSIGTPPQSFKVIFDTGSSNLWVPSVYCS 85
>gi|24571212|gb|AAN62917.1| cathepsin D [Ctenopharyngodon idella]
Length = 203
Score = 103 bits (258), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 49/116 (42%), Positives = 68/116 (58%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSNCQAIA 118
RN + GGE++ GG D + GD Y +SR+ YWQ ++ + I E + C+AI
Sbjct: 26 RNPDTQPGGELLLGGTDPKYYTGDFNYVDISRQAYWQIHMDGMSIGSELTLCKGGCEAIV 85
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSLI GP+ I L K IGA+PL G VDC + +P + +LGGK ++LT
Sbjct: 86 DTGTSLITGPATEIKALQKAIGAIPLIQGEYMVDCKKVPTLPTISFVLGGKTYSLT 141
>gi|321461134|gb|EFX72169.1| hypothetical protein DAPPUDRAFT_189045 [Daphnia pulex]
Length = 391
Score = 103 bits (258), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 52/118 (44%), Positives = 78/118 (66%), Gaps = 5/118 (4%)
Query: 61 RNASDE-NGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE---KNVYC-SNCQ 115
R+ASD NGGEI+FGG + D +VG+I Y PV+RK YWQF + + IE + +C C+
Sbjct: 211 RDASDPVNGGEIVFGGTNPDHYVGEINYIPVTRKAYWQFRADGLMIEGIPEYPFCDGGCE 270
Query: 116 AIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
I+DTGTS+I GP++ + LN+L+GA+ + NG A + C + +P + I + G +TL
Sbjct: 271 MISDTGTSVIAGPAEEVNLLNRLLGAINIINGEAVISCLRIPYLPPITITISGLPYTL 328
Score = 73.9 bits (180), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 38/84 (45%), Positives = 51/84 (60%), Gaps = 5/84 (5%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHF 60
+R+ +QY+G I+LGTPPQ+F VIFDTGS+NLW+PS CS N+AC V +S
Sbjct: 58 LRNFQDSQYFGPITLGTPPQDFTVIFDTGSANLWVPSSQCSEENLACKVHNQY---NSSL 114
Query: 61 RNASDENGGE--IMFGGVDKDKFV 82
+ NG E I +G D F+
Sbjct: 115 SDTYKPNGTEFSIQYGTGAMDGFL 138
>gi|194862073|ref|XP_001969914.1| GG23678 [Drosophila erecta]
gi|190661781|gb|EDV58973.1| GG23678 [Drosophila erecta]
Length = 392
Score = 103 bits (258), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 49/113 (43%), Positives = 66/113 (58%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSNCQAIADT 120
R + GGE++ GG+D + G +TY PVS YWQF V +IK + CS CQAIADT
Sbjct: 221 RQGTAVLGGELILGGIDSSLYKGSLTYVPVSVPAYWQFTVNTIKTNGVLLCSGCQAIADT 280
Query: 121 GTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
GTSLI+ P +N+ +GA G A V C + +P V++ +GG FTL
Sbjct: 281 GTSLIVAPLAAYKRINRQLGATDNGGGEAFVSCSRVSALPKVNLNIGGTAFTL 333
Score = 62.8 bits (151), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 24/47 (51%), Positives = 34/47 (72%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++S +YYG I++GTP Q F ++FDTGS+NLW+PS C N AC
Sbjct: 68 LQNSMNNEYYGVITIGTPQQRFNILFDTGSANLWVPSASCPASNTAC 114
>gi|334562337|gb|AEG79714.1| cathepsin D [Apostichopus japonicus]
Length = 372
Score = 103 bits (258), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 52/117 (44%), Positives = 74/117 (63%), Gaps = 4/117 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCS--NCQA 116
++ S GGE++ GG D + G+ TY VS+KGYWQF ++ + I + YC+ NC A
Sbjct: 214 KDPSAAVGGELLLGGTDSKYYTGNFTYVKVSKKGYWQFAMDKVSIGGKDAGYCTGKNCSA 273
Query: 117 IADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
I DTGTSLI GP+ I +LNK IGA+PL G A + C+ + +P++ L G +FTL
Sbjct: 274 ICDTGTSLIAGPTADINDLNKKIGAIPLIKGEAIILCNTIPSLPDISFQLNGHDFTL 330
Score = 74.3 bits (181), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 35/64 (54%), Positives = 44/64 (68%), Gaps = 2/64 (3%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK--DKFVGHS 58
++ AQYYG I+LGTPPQ+F V+FDTGSSNLW+PS CS +IACS K D V +
Sbjct: 61 LKDYLDAQYYGPITLGTPPQDFVVVFDTGSSNLWVPSSTCSWKDIACSFTKKYDHSVSST 120
Query: 59 HFRN 62
+ N
Sbjct: 121 YVAN 124
>gi|26354406|dbj|BAC40831.1| unnamed protein product [Mus musculus]
Length = 445
Score = 103 bits (258), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 51/127 (40%), Positives = 79/127 (62%), Gaps = 3/127 (2%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--E 106
VDK+ F + + R+ + GGE+M GG D + G+++Y V+RK YWQ ++ +++ E
Sbjct: 222 VDKNIFSFYLN-RDPEGQPGGELMLGGTDSKYYHGELSYLNVTRKAYWQVHMDQLEVGNE 280
Query: 107 KNVYCSNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ C+AI DTGTSL++GP + + EL K IGAVPL G + C+ + +P V + L
Sbjct: 281 LTLCKGGCEAIVDTGTSLLVGPVEEVKELQKAIGAVPLIQGEYMIPCEKVSSLPTVYLKL 340
Query: 167 GGKNFTL 173
GGKN+ L
Sbjct: 341 GGKNYEL 347
Score = 76.3 bits (186), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 47/122 (38%), Positives = 62/122 (50%), Gaps = 27/122 (22%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHF 60
+++ AQYYG I +GTPPQ F V+FDTGSSNLW+PS HC IL+IAC V H
Sbjct: 71 LKNYLDAQYYGDIGIGTPPQCFTVVFDTGSSNLWVPSIHCKILDIACWV---------HH 121
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYW------------QFGVESIKIEKN 108
+ SD++ + K+ DI Y S GY Q IK+EK
Sbjct: 122 KYNSDKSSTYV------KNGTSFDIHYGSGSLSGYLSQDTVSVPCKSDQSKARGIKVEKQ 175
Query: 109 VY 110
++
Sbjct: 176 IF 177
>gi|195046637|ref|XP_001992191.1| GH24623 [Drosophila grimshawi]
gi|193893032|gb|EDV91898.1| GH24623 [Drosophila grimshawi]
Length = 374
Score = 103 bits (258), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 50/126 (39%), Positives = 74/126 (58%), Gaps = 4/126 (3%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKN 108
VD+ F + S +GGE++FGG D + GD+TY PVS++GYWQF ++S + N
Sbjct: 202 VDQSVFSFYLARAGTSTTDGGELIFGGSDSSLYSGDLTYVPVSQEGYWQFTMDSATADGN 261
Query: 109 VYCSNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGG 168
C +CQAIADTGTSLI+ P+ LN+++ + VDC + +P + +GG
Sbjct: 262 SLCDDCQAIADTGTSLIVAPANAYELLNEILN----VDDEGLVDCSTISSLPVITFNIGG 317
Query: 169 KNFTLT 174
NF L+
Sbjct: 318 TNFDLS 323
Score = 62.0 bits (149), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 26/49 (53%), Positives = 36/49 (73%), Gaps = 2/49 (4%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSV 49
+ +S YYG I++GTPPQ F+V+FD+GSSNLW+PS C+ + AC V
Sbjct: 62 LSNSMNMAYYGAITIGTPPQSFEVLFDSGSSNLWVPSNTCT--STACEV 108
>gi|118344566|ref|NP_001072055.1| nothepsin precursor [Takifugu rubripes]
gi|55771088|dbj|BAD69804.1| nothepsin [Takifugu rubripes]
Length = 414
Score = 103 bits (258), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 52/116 (44%), Positives = 73/116 (62%), Gaps = 3/116 (2%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEK-NVYC-SNCQAIA 118
R + + GE++ GG+D+D F G I + PV+ KGYWQ V+S+ ++ + +C CQAI
Sbjct: 234 RFSGSKLQGELLLGGMDQDLFTGPINWLPVTTKGYWQIKVDSVAVQGVDTFCPEGCQAIV 293
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSLI GP++ I L +LIGA P G DC L +P V +LGG+ +TLT
Sbjct: 294 DTGTSLIAGPTRDILRLQQLIGATPTNIG-VVTDCVRLSSLPRVTFVLGGEEYTLT 348
Score = 56.2 bits (134), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 37/68 (54%), Gaps = 2/68 (2%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASDE 66
Q+YG I LGTP Q F V+FDTGSS+LW+PS +C ++ C +F D
Sbjct: 85 VQFYGEIELGTPGQNFSVVFDTGSSDLWVPSVYC--VSQTCGTVHRRFKAFESTSYRHDG 142
Query: 67 NGGEIMFG 74
EI +G
Sbjct: 143 RVFEIHYG 150
>gi|195470499|ref|XP_002087544.1| GE17593 [Drosophila yakuba]
gi|194173645|gb|EDW87256.1| GE17593 [Drosophila yakuba]
Length = 404
Score = 103 bits (258), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 51/113 (45%), Positives = 68/113 (60%), Gaps = 1/113 (0%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSNCQAIADT 120
RN S GG++ GG D++ G++TY+PV+ +GYWQF V +I V S CQAIADT
Sbjct: 237 RNGSAVEGGQLTLGGTDQELIAGEMTYTPVTEQGYWQFAVNNITWNGTVISSGCQAIADT 296
Query: 121 GTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
GTSLI PS +LN LIG + L G V C + +P + I +GG +F L
Sbjct: 297 GTSLIAAPSAAYIQLNNLIGGI-LIQGDYYVPCSTVSSLPVLTINIGGTDFYL 348
Score = 60.8 bits (146), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 25/47 (53%), Positives = 33/47 (70%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+ +S YYG I +GTP Q FKV+FDTGS+NLW+PS C ++AC
Sbjct: 84 LGNSMNMYYYGLIGIGTPEQYFKVVFDTGSANLWVPSAQCLATDVAC 130
>gi|449280945|gb|EMC88160.1| Cathepsin E, partial [Columba livia]
Length = 374
Score = 103 bits (257), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 50/115 (43%), Positives = 77/115 (66%), Gaps = 3/115 (2%)
Query: 62 NASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIAD 119
N GGE++FGG D +F+G + + PV+++GYWQ +++I++ V +C+N CQAI D
Sbjct: 200 NPESSLGGELLFGGFDPSRFMGTLNWVPVTQQGYWQIQLDNIQLAGTVAFCTNGCQAIVD 259
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
TGTSLI GP+K + L K IGA P+ +G V+C+NL+ MP+V + G + L+
Sbjct: 260 TGTSLITGPTKDVKVLQKYIGATPV-DGEYAVECNNLNVMPDVTFTINGLPYLLS 313
Score = 59.7 bits (143), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 25/41 (60%), Positives = 34/41 (82%), Gaps = 2/41 (4%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACS 48
+Y+G IS+GTPPQ F V+FDTGSSNLW+PS +C ++ AC+
Sbjct: 55 EYFGQISIGTPPQNFTVVFDTGSSNLWVPSVYC--VSKACA 93
>gi|190014572|dbj|BAG48264.1| pepsinogen 2 [Thunnus orientalis]
Length = 376
Score = 103 bits (257), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 50/118 (42%), Positives = 73/118 (61%), Gaps = 2/118 (1%)
Query: 58 SHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-KNVYCSN-CQ 115
S + +++ + G E++FGG+D + G IT+ P++ YWQ ++S+ I + V CS CQ
Sbjct: 206 SVYLSSNSQQGNEVVFGGIDSSHYTGKITWIPLTSATYWQIQMDSVTINGQTVACSGGCQ 265
Query: 116 AIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
AI DTGTSLI+GPS+ I +N +GA NG A V+C N+ MP V L G FT+
Sbjct: 266 AIIDTGTSLIVGPSRDIYNMNAWVGASTTQNGDATVNCQNIQSMPEVTFTLNGHAFTI 323
Score = 64.3 bits (155), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 26/33 (78%), Positives = 30/33 (90%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCS 41
YYG +S+GTPPQ FKVIFDTGSSNLW+PS +CS
Sbjct: 69 YYGVVSIGTPPQSFKVIFDTGSSNLWVPSVYCS 101
>gi|307175237|gb|EFN65289.1| Lysosomal aspartic protease [Camponotus floridanus]
Length = 464
Score = 103 bits (257), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 51/114 (44%), Positives = 75/114 (65%), Gaps = 2/114 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYC-SNCQAIAD 119
R++S E GGE + GG D + G+ TY PV+RKG+WQF +++I I ++ C +CQAIAD
Sbjct: 108 RDSSAELGGEFILGGSDPAHYDGNFTYIPVTRKGFWQFTMDNIIINDHILCVESCQAIAD 167
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
TG S I GP I ++NKLIG + + +G +V+C + ++P + ILG K F L
Sbjct: 168 TGASHICGPKSDITKINKLIGTINV-DGDERVNCSRISELPTIRFILGDKVFNL 220
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 47/122 (38%), Positives = 70/122 (57%), Gaps = 2/122 (1%)
Query: 54 FVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN 113
FV ++++S+E GGE + GG D D + GD TY + K WQF ++++ I ++ C
Sbjct: 286 FVTSKPYKDSSNETGGEFILGGSDPDHYEGDFTYLSILHKEQWQFNMDTVIINDHILCEE 345
Query: 114 -CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFT 172
C AIADTG+S I GP I +NK IG + NG +V+C + ++P + IL F
Sbjct: 346 GCLAIADTGSSDISGPISDITYINKFIGTFNV-NGQERVNCSRISELPTISFILDNVAFD 404
Query: 173 LT 174
LT
Sbjct: 405 LT 406
>gi|195380081|ref|XP_002048799.1| GJ21122 [Drosophila virilis]
gi|194143596|gb|EDW59992.1| GJ21122 [Drosophila virilis]
Length = 391
Score = 103 bits (257), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 50/114 (43%), Positives = 66/114 (57%), Gaps = 1/114 (0%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIAD 119
R+ GGEI+FGG D + + GD TY PV+RKGYWQ ++S ++ C CQ IAD
Sbjct: 217 RDPKAPEGGEIIFGGSDPNHYTGDFTYLPVTRKGYWQIKMDSAQLNNLELCKGGCQIIAD 276
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
TGTSLI P +N+ IG P+ G V CD + +P + +LGGK F L
Sbjct: 277 TGTSLIAAPVAEATSINQAIGGTPIVGGQYIVSCDMIPNLPVIKFVLGGKTFEL 330
Score = 73.9 bits (180), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 30/41 (73%), Positives = 36/41 (87%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQYYG IS+G+PPQ FKV+FDTGSSNLW+PS+ C + NIAC
Sbjct: 70 AQYYGPISIGSPPQNFKVVFDTGSSNLWVPSKKCHLTNIAC 110
>gi|74191361|dbj|BAE30263.1| unnamed protein product [Mus musculus]
Length = 410
Score = 103 bits (257), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 51/127 (40%), Positives = 79/127 (62%), Gaps = 3/127 (2%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--E 106
VDK+ F + + R+ + GGE+M GG D + G+++Y V+RK YWQ ++ +++ E
Sbjct: 222 VDKNTFSFYLN-RDPEGQPGGELMLGGTDSKYYHGELSYLNVTRKAYWQVHMDQLEVGNE 280
Query: 107 KNVYCSNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ C+AI DTGTSL++GP + + EL K IGAVPL G + C+ + +P V + L
Sbjct: 281 LTLCKGGCEAIVDTGTSLLVGPVEEVKELQKAIGAVPLIQGEYMIPCEKVSSLPTVYLKL 340
Query: 167 GGKNFTL 173
GGKN+ L
Sbjct: 341 GGKNYEL 347
Score = 76.3 bits (186), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 47/122 (38%), Positives = 62/122 (50%), Gaps = 27/122 (22%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHF 60
+++ AQYYG I +GTPPQ F V+FDTGSSNLW+PS HC IL+IAC V H
Sbjct: 71 LKNYLDAQYYGDIGIGTPPQCFTVVFDTGSSNLWVPSIHCKILDIACWV---------HH 121
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYW------------QFGVESIKIEKN 108
+ SD++ + K+ DI Y S GY Q IK+EK
Sbjct: 122 KYNSDKSSTYV------KNGTSFDIHYGSGSLSGYLSQDTVSVPCKSDQSKARGIKVEKQ 175
Query: 109 VY 110
++
Sbjct: 176 IF 177
>gi|2851407|sp|P16228.3|CATE_RAT RecName: Full=Cathepsin E; Flags: Precursor
gi|1113086|dbj|BAA08128.1| cathepsin E precursor [Rattus rattus]
gi|149058663|gb|EDM09820.1| cathepsin E, isoform CRA_a [Rattus norvegicus]
Length = 398
Score = 103 bits (257), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 54/119 (45%), Positives = 76/119 (63%), Gaps = 6/119 (5%)
Query: 63 ASDENGG---EIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAI 117
+SD GG E+ FGG D F G + + PV+++GYWQ ++ I++ V +CS CQAI
Sbjct: 222 SSDPQGGSGSELTFGGYDPSHFSGSLNWIPVTKQGYWQIALDGIQVGDTVMFCSEGCQAI 281
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTRT 176
DTGTSLI GP K I +L + IGA P+ +G VDC L+ MPNV ++ G ++TL+ T
Sbjct: 282 VDTGTSLITGPPKKIKQLQEAIGATPM-DGEYAVDCATLNMMPNVTFLINGVSYTLSPT 339
Score = 57.0 bits (136), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 22/34 (64%), Positives = 30/34 (88%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCS 41
+Y+GT+S+G+P Q F VIFDTGSSNLW+PS +C+
Sbjct: 79 EYFGTVSIGSPSQNFTVIFDTGSSNLWVPSVYCT 112
>gi|146217392|gb|ABQ10738.1| cathepsin D [Penaeus monodon]
Length = 386
Score = 103 bits (257), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 50/116 (43%), Positives = 71/116 (61%), Gaps = 3/116 (2%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV--YC-SNCQAI 117
R+ + GGE++ GG D + GD TY PV R+GYWQF ++ +++ +C C+AI
Sbjct: 210 RDPAAAEGGELILGGSDPAYYTGDFTYVPVDRQGYWQFKMDGLQMNGTTVPFCDGGCEAI 269
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
ADTGTSLI PS+ +NK IGA P+ G VDC+ + +P + +L GK FTL
Sbjct: 270 ADTGTSLIAAPSEEARLINKKIGAKPIMGGEWSVDCNLIPHLPTISFVLAGKPFTL 325
Score = 73.6 bits (179), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 29/43 (67%), Positives = 36/43 (83%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSV 49
AQYYG I++GTPPQ F+V+FDTGSSNLW+PS+ C NIAC +
Sbjct: 63 AQYYGPITIGTPPQSFRVVFDTGSSNLWVPSKQCHFTNIACLI 105
>gi|38303893|gb|AAH62002.1| Ctse protein [Rattus norvegicus]
Length = 398
Score = 103 bits (257), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 54/119 (45%), Positives = 76/119 (63%), Gaps = 6/119 (5%)
Query: 63 ASDENGG---EIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAI 117
+SD GG E+ FGG D F G + + PV+++GYWQ ++ I++ V +CS CQAI
Sbjct: 222 SSDPQGGSGSELTFGGYDPSHFSGSLNWIPVTKQGYWQIALDGIQVGDTVMFCSEGCQAI 281
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTRT 176
DTGTSLI GP K I +L + IGA P+ +G VDC L+ MPNV ++ G ++TL+ T
Sbjct: 282 VDTGTSLITGPPKKIKQLQEAIGATPM-DGEYAVDCATLNMMPNVTFLINGVSYTLSPT 339
Score = 57.0 bits (136), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 22/34 (64%), Positives = 30/34 (88%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCS 41
+Y+GT+S+G+P Q F VIFDTGSSNLW+PS +C+
Sbjct: 79 EYFGTVSIGSPSQNFTVIFDTGSSNLWVPSVYCT 112
>gi|312861579|gb|ADR10277.1| cathepsin D [Branchiostoma belcheri]
Length = 395
Score = 103 bits (257), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 55/128 (42%), Positives = 73/128 (57%), Gaps = 4/128 (3%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKN 108
VDK F + + R+ S GE++ GG D + GD T+ V++ GYWQF ++ I I
Sbjct: 203 VDKPVFSFYLN-RDPSSTTRGELLLGGTDPKYYTGDFTFLDVTKPGYWQFKMDGIMINGK 261
Query: 109 V--YCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDII 165
YC C AIADTGTSLI GP+ + LNK IGA P+ G VDC + +P + +
Sbjct: 262 ATDYCKGGCAAIADTGTSLIAGPTTEVQALNKQIGATPIPGGEYMVDCSQVSSLPPISFM 321
Query: 166 LGGKNFTL 173
LGGK F L
Sbjct: 322 LGGKAFEL 329
Score = 70.5 bits (171), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 31/61 (50%), Positives = 43/61 (70%), Gaps = 2/61 (3%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK--DKFVGHSHFRNAS 64
AQYYG IS+GTP Q F+V+FDTGSSNLW+PS+ C + +IAC + D ++ +N +
Sbjct: 67 AQYYGPISIGTPAQNFQVVFDTGSSNLWVPSKKCKLSDIACLLHNKYDSTQSSTYMKNGT 126
Query: 65 D 65
D
Sbjct: 127 D 127
>gi|315440805|gb|ADU20408.1| aspartic protease 2 [Clonorchis sinensis]
Length = 385
Score = 103 bits (257), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 45/118 (38%), Positives = 78/118 (66%), Gaps = 1/118 (0%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIAD 119
RN + GGEIMFGG++K+ ++GD+ + P+S + YWQ ++ I++ C+ CQAI D
Sbjct: 213 RNDVQDIGGEIMFGGINKEHYMGDMYFVPLSSETYWQIDLDGIQVTSLTLCAQGCQAIVD 272
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTRTN 177
TGT+LI+GP+ + +LN+ +GAV + G + ++C + +P ++ + G+N TL T+
Sbjct: 273 TGTTLIVGPTADVNQLNEALGAVSIEGGLSVLECSQIYTLPPIEFSINGENLTLQPTD 330
Score = 70.1 bits (170), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 29/43 (67%), Positives = 35/43 (81%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSV 49
+QYYG I++GTPPQ FKV+FDTGSSNLW+PS CS N AC +
Sbjct: 66 SQYYGEIAIGTPPQPFKVVFDTGSSNLWVPSNRCSPWNEACRL 108
>gi|327278613|ref|XP_003224055.1| PREDICTED: cathepsin E-like [Anolis carolinensis]
Length = 396
Score = 103 bits (257), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 53/116 (45%), Positives = 75/116 (64%), Gaps = 3/116 (2%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIA 118
RN GGE++FGG D F G++ + PVS+KGYWQ +++I++ + +C+ CQAI
Sbjct: 217 RNPDSSIGGELIFGGYDPSLFSGNLNWIPVSKKGYWQIQLDNIQVGGTIAFCAEGCQAIV 276
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSLI GPS I ++ LIGA P+ +G V+C NL MP+V L G ++LT
Sbjct: 277 DTGTSLITGPSDDIKQMQNLIGAQPV-DGEYAVECSNLSMMPSVTFTLNGIPYSLT 331
Score = 62.0 bits (149), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 30/67 (44%), Positives = 40/67 (59%), Gaps = 9/67 (13%)
Query: 5 YQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNAS 64
+ +Y+G IS+GTPPQ F V+FDTGSSNLW+PS +C+ + AC V HS F
Sbjct: 70 FDVEYFGEISIGTPPQNFTVLFDTGSSNLWVPSVYCA--SKAC-------VEHSRFHPTE 120
Query: 65 DENGGEI 71
E+
Sbjct: 121 SSTYNEV 127
>gi|327279867|ref|XP_003224677.1| PREDICTED: cathepsin E-A-like [Anolis carolinensis]
Length = 406
Score = 103 bits (256), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 52/119 (43%), Positives = 70/119 (58%), Gaps = 2/119 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVY--CSNCQAIA 118
R + +NGG ++ GG+D F G I + PV++KGYWQ + S+KI+ V S C+AI
Sbjct: 224 REHNVDNGGVLILGGIDHSLFTGPIHWFPVTKKGYWQIHMNSVKIQGQVTSCISGCEAIV 283
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTRTN 177
D+GTSLI GP I L + IGA P A G VDC + +P V +G + FTLT N
Sbjct: 284 DSGTSLITGPLSQIVRLQQSIGAFPTATGEFLVDCRRVSSLPPVTFSIGEREFTLTAEN 342
Score = 57.8 bits (138), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 36/89 (40%), Positives = 54/89 (60%), Gaps = 16/89 (17%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASDEN 67
+YYG +S+GTP Q+F VIFDTGS++ W+PS +C ++ AC + + KF F + S +
Sbjct: 80 EYYGEVSIGTPAQKFTVIFDTGSADFWVPSAYC--ISDACELHQ-KF---KAFSSESYAH 133
Query: 68 GGE---------IMFGGVDKDKF-VGDIT 86
GG+ + G V KDK +G+IT
Sbjct: 134 GGQKFTLQYGTGRLMGIVAKDKVQIGNIT 162
>gi|237874218|ref|NP_001153867.1| cathepsin D [Acyrthosiphon pisum]
Length = 393
Score = 103 bits (256), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 48/114 (42%), Positives = 70/114 (61%), Gaps = 1/114 (0%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI-EKNVYCSNCQAIAD 119
R+ S GGEI+FGG D +K+ G TY PV+R GYWQFG++ + + ++ QAIAD
Sbjct: 219 RDPSSTPGGEIIFGGSDPEKYTGPFTYVPVTRHGYWQFGLDEVIVGNTSIVSGALQAIAD 278
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
TGTSLI GP I ++N+L+G + G + CD +D +P + ++G F L
Sbjct: 279 TGTSLIAGPVDNIKQINELLGGTAIPGGEYIIACDQIDNLPVLSFVIGSTTFKL 332
Score = 77.8 bits (190), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 46/119 (38%), Positives = 61/119 (51%), Gaps = 15/119 (12%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASDE 66
AQYYG I++GTPPQ F V+FDTGSSNLW+PS+ CS+LNIAC + + S +
Sbjct: 72 AQYYGPITIGTPPQPFNVVFDTGSSNLWVPSKQCSVLNIACMLHNKYNMAKS---TTYXK 128
Query: 67 NGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSNCQAIADTGTSLI 125
NG E I Y S GY V S+ V + +AI + G + +
Sbjct: 129 NGTEF------------SIHYGSGSLSGYLSTDVMSMDGTSIVNQTFAEAIQEPGLAFV 175
>gi|41053329|ref|NP_956325.1| uncharacterized protein LOC336746 precursor [Danio rerio]
gi|34783813|gb|AAH56836.1| Zgc:63831 [Danio rerio]
Length = 412
Score = 103 bits (256), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 50/116 (43%), Positives = 70/116 (60%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYC--SNCQAIA 118
R+ + + GGE+M GG D+ F GD+ Y V+RK YWQ ++ +++ + S CQAI
Sbjct: 237 RDPAGDVGGELMLGGFDQQYFNGDLHYVNVTRKAYWQIKMDEVQVGSTLTLCKSGCQAIV 296
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTS+I GP + + L K IGA+PL G +DC + +P V LGGK F LT
Sbjct: 297 DTGTSMITGPVQEVRALQKAIGAIPLLMGEYWIDCKKIPTLPVVSFSLGGKMFNLT 352
Score = 77.4 bits (189), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 31/45 (68%), Positives = 39/45 (86%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK 51
AQYYG IS+GTPPQ+F V+FDTGSSNLW+PS HC+ L+IAC + +
Sbjct: 90 AQYYGMISIGTPPQDFSVLFDTGSSNLWVPSIHCAFLDIACWLHR 134
>gi|301618285|ref|XP_002938556.1| PREDICTED: cathepsin E-A-like [Xenopus (Silurana) tropicalis]
Length = 402
Score = 103 bits (256), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 49/128 (38%), Positives = 81/128 (63%), Gaps = 3/128 (2%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-K 107
V++ F H + R+ E GGE++FGG+D + G I + P++ KGYWQ ++++K++ +
Sbjct: 213 VEQQLFSFHLN-RDYDSEYGGELIFGGIDHSLYKGQIHWIPLTEKGYWQIRLDNVKVDGE 271
Query: 108 NVYC-SNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
++C S+CQ I D+GTSLI GP I +L +L+GA P G +DC + +P V +
Sbjct: 272 AMFCQSSCQVIVDSGTSLITGPKAEIKKLQELLGATPTLFGEYILDCSRVSSLPRVTFTI 331
Query: 167 GGKNFTLT 174
G +++TLT
Sbjct: 332 GQRDYTLT 339
Score = 68.9 bits (167), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 35/64 (54%), Positives = 44/64 (68%), Gaps = 6/64 (9%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASDE 66
AQYYG IS+GTPPQ F V+FDTGSSN W+PS +C L+ AC V ++F F + S E
Sbjct: 79 AQYYGEISVGTPPQNFSVVFDTGSSNFWVPSSYC--LSEACQV-HERF---KSFESTSYE 132
Query: 67 NGGE 70
+GG
Sbjct: 133 HGGR 136
>gi|198422402|ref|XP_002130569.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
Length = 389
Score = 103 bits (256), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 51/130 (39%), Positives = 76/130 (58%), Gaps = 7/130 (5%)
Query: 51 KDKFVGHSHF-----RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI 105
K + + H+ F R+A+ +GGE+ GGVD KF G +Y PV+ KGYWQ ++S+ +
Sbjct: 198 KQEALAHNQFSFYLNRDANASSGGELYLGGVDTKKFTGSFSYHPVTVKGYWQISMDSVSV 257
Query: 106 --EKNVYCSNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVD 163
+ S C+AI D+GTSL+ GP+ I ++NKLIGA NG V C+ + MP++
Sbjct: 258 GSSTSACVSGCKAIVDSGTSLLAGPTDEIEKINKLIGATKFLNGEYIVQCNKMATMPDIT 317
Query: 164 IILGGKNFTL 173
L G + L
Sbjct: 318 FSLSGVKYIL 327
Score = 73.9 bits (180), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 30/43 (69%), Positives = 36/43 (83%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSV 49
AQYYG I +GTPPQ F V+FDTGSSNLW+PS HC+I +IAC +
Sbjct: 66 AQYYGKIYIGTPPQPFTVVFDTGSSNLWVPSVHCAITDIACLI 108
>gi|158254091|gb|AAI54325.1| Zgc:63831 [Danio rerio]
Length = 412
Score = 103 bits (256), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 50/116 (43%), Positives = 70/116 (60%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYC--SNCQAIA 118
R+ + + GGE+M GG D+ F GD+ Y V+RK YWQ ++ +++ + S CQAI
Sbjct: 237 RDPAGDVGGELMLGGFDQQYFNGDLHYVNVTRKAYWQIKMDEVQVGSTLTLCKSGCQAIV 296
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTS+I GP + + L K IGA+PL G +DC + +P V LGGK F LT
Sbjct: 297 DTGTSMITGPVQEVRALQKAIGAIPLLMGEYWIDCKKIPTLPVVSFSLGGKMFNLT 352
Score = 77.4 bits (189), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 31/45 (68%), Positives = 39/45 (86%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK 51
AQYYG IS+GTPPQ+F V+FDTGSSNLW+PS HC+ L+IAC + +
Sbjct: 90 AQYYGMISIGTPPQDFSVLFDTGSSNLWVPSIHCAFLDIACWLHR 134
>gi|118082412|ref|XP_416090.2| PREDICTED: cathepsin E-A-like [Gallus gallus]
Length = 404
Score = 103 bits (256), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 50/116 (43%), Positives = 72/116 (62%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-KNVYCSN-CQAIA 118
R ENGGE++ GG+D + G I + PV+ K YWQ + +IKI+ + V+CS+ C+AI
Sbjct: 224 RGDDTENGGELILGGIDHSLYKGSIHWVPVTEKSYWQIHLNNIKIQGRVVFCSHGCEAIV 283
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
D+GTSLI GPS I L + IGA P +G VDC L +P++ +G ++ LT
Sbjct: 284 DSGTSLITGPSSQIRRLQEYIGASPSRSGEFLVDCRRLSSLPHISFTIGHHDYKLT 339
Score = 67.8 bits (164), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 35/64 (54%), Positives = 45/64 (70%), Gaps = 6/64 (9%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASDE 66
AQYYG IS+GTPPQ F V+FDTGSSN W+PS +C ++ AC V + +F F + S E
Sbjct: 79 AQYYGVISVGTPPQSFTVVFDTGSSNFWVPSVYC--ISEACRVHQ-RF---KSFLSDSYE 132
Query: 67 NGGE 70
+GGE
Sbjct: 133 HGGE 136
>gi|25452827|sp|Q9DEX3.1|CATD_CLUHA RecName: Full=Cathepsin D; Flags: Precursor
gi|11037777|gb|AAG27733.1|AF312364_1 muscular cathepsin D [Clupea harengus]
Length = 396
Score = 103 bits (256), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 48/116 (41%), Positives = 68/116 (58%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIA 118
RN E GGE++ GG D + GD Y PV+R+ YWQ ++ + I + C + C+AI
Sbjct: 221 RNPDTEPGGELLLGGTDPKYYTGDFNYVPVTRQAYWQIHMDGMSIGSQLTLCKDGCEAIV 280
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSLI GP + L K IGA+PL G +DC + +P + +GGK ++LT
Sbjct: 281 DTGTSLITGPPAEVRALQKAIGAIPLIQGEYMIDCKKVPTLPTISFNVGGKTYSLT 336
Score = 69.7 bits (169), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 30/47 (63%), Positives = 36/47 (76%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ AQYYG I LGTP Q F V+FDTGSSNLW+PS HCS +IAC
Sbjct: 68 LKNYMDAQYYGEIGLGTPVQMFTVVFDTGSSNLWLPSIHCSFTDIAC 114
>gi|358333762|dbj|GAA52230.1| cathepsin D [Clonorchis sinensis]
Length = 408
Score = 103 bits (256), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 45/118 (38%), Positives = 78/118 (66%), Gaps = 1/118 (0%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIAD 119
RN + GGEIMFGG++K+ ++GD+ + P+S + YWQ ++ I++ C+ CQAI D
Sbjct: 236 RNDVQDIGGEIMFGGINKEHYMGDMYFVPLSSETYWQIDLDGIQVTSLTLCAQGCQAIVD 295
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTRTN 177
TGT+LI+GP+ + +LN+ +GAV + G + ++C + +P ++ + G+N TL T+
Sbjct: 296 TGTTLIVGPTADVNQLNEALGAVSIEGGLSVLECSQIYTLPPIEFSINGENLTLQPTD 353
Score = 70.1 bits (170), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 29/43 (67%), Positives = 35/43 (81%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSV 49
+QYYG I++GTPPQ FKV+FDTGSSNLW+PS CS N AC +
Sbjct: 89 SQYYGEIAIGTPPQPFKVVFDTGSSNLWVPSNRCSPWNEACRL 131
>gi|318977821|ref|NP_001187407.1| cathepsin D precursor [Ictalurus punctatus]
gi|308322929|gb|ADO28602.1| cathepsin D [Ictalurus punctatus]
Length = 398
Score = 102 bits (255), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 53/116 (45%), Positives = 71/116 (61%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIA 118
R+ E GGE+M GG DK F GD+ Y V+RK YWQ ++++++ + C + CQAI
Sbjct: 223 RDPKAEVGGELMLGGCDKQYFDGDLHYLNVTRKAYWQIKMDTVEVGSTLTLCKDGCQAIV 282
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
D+GTS+I GP + I LNK IGAVPL G + C + +P V LGGK F LT
Sbjct: 283 DSGTSMITGPVEEIRALNKAIGAVPLIMGEYWISCSKIPSLPVVSFHLGGKVFNLT 338
Score = 76.3 bits (186), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 30/41 (73%), Positives = 36/41 (87%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQYYG IS+GTPPQEF V+FDTGSSNLW+PS HC+ ++AC
Sbjct: 76 AQYYGVISIGTPPQEFTVLFDTGSSNLWVPSIHCAFFDLAC 116
>gi|74198040|dbj|BAE35200.1| unnamed protein product [Mus musculus]
Length = 410
Score = 102 bits (255), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 51/127 (40%), Positives = 79/127 (62%), Gaps = 3/127 (2%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--E 106
VDK+ F + + R+ + GGE+M GG D + G+++Y V+RK YWQ ++ +++ E
Sbjct: 222 VDKNIFSFYLN-RDPEGQPGGELMLGGTDSKYYHGELSYLNVTRKAYWQVHMDQLEVGNE 280
Query: 107 KNVYCSNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ C+AI DTGTSL++GP + + EL K IGAVPL G + C+ + +P V + L
Sbjct: 281 LTLCKGGCEAIVDTGTSLLVGPVEEVKELQKAIGAVPLIQGEYMIPCEKVSSLPTVYLKL 340
Query: 167 GGKNFTL 173
GGKN+ L
Sbjct: 341 GGKNYEL 347
Score = 75.9 bits (185), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 47/122 (38%), Positives = 62/122 (50%), Gaps = 27/122 (22%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHF 60
+++ AQYYG I +GTPPQ F V+FDTGSSNLW+PS HC IL+IAC V H
Sbjct: 71 LKNYLDAQYYGDIGIGTPPQCFTVVFDTGSSNLWVPSIHCKILDIACWV---------HH 121
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYW------------QFGVESIKIEKN 108
+ SD++ + K+ DI Y S GY Q IK+EK
Sbjct: 122 KYNSDKSSTYV------KNGTSFDIHYGSGSLSGYLSQDTVSVPCKSDQSKARGIKVEKQ 175
Query: 109 VY 110
++
Sbjct: 176 IF 177
>gi|74204520|dbj|BAE35336.1| unnamed protein product [Mus musculus]
Length = 410
Score = 102 bits (255), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 51/127 (40%), Positives = 79/127 (62%), Gaps = 3/127 (2%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--E 106
VDK+ F + + R+ + GGE+M GG D + G+++Y V+RK YWQ ++ +++ E
Sbjct: 222 VDKNIFSFYLN-RDPEGQPGGELMLGGTDSKYYHGELSYLNVTRKAYWQVHMDQLEVGNE 280
Query: 107 KNVYCSNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ C+AI DTGTSL++GP + + EL K IGAVPL G + C+ + +P V + L
Sbjct: 281 LTLCKGGCEAIVDTGTSLLVGPVEEVKELQKAIGAVPLIQGEYMIPCEKVSSLPTVYLKL 340
Query: 167 GGKNFTL 173
GGKN+ L
Sbjct: 341 GGKNYEL 347
Score = 77.0 bits (188), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 47/122 (38%), Positives = 62/122 (50%), Gaps = 27/122 (22%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHF 60
+++ AQYYG I +GTPPQ F V+FDTGSSNLW+PS HC IL+IAC V H
Sbjct: 71 LKNYLDAQYYGDIGIGTPPQCFTVVFDTGSSNLWVPSIHCKILDIACWV---------HH 121
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYW------------QFGVESIKIEKN 108
+ SD++ + K+ DI Y S GY Q IK+EK
Sbjct: 122 KYNSDKSSTHV------KNGTSFDIHYGSGSLSGYLSQDTVSVPCKSDQSKARGIKVEKQ 175
Query: 109 VY 110
++
Sbjct: 176 IF 177
>gi|74192771|dbj|BAE34900.1| unnamed protein product [Mus musculus]
Length = 410
Score = 102 bits (255), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 51/127 (40%), Positives = 79/127 (62%), Gaps = 3/127 (2%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--E 106
VDK+ F + + R+ + GGE+M GG D + G+++Y V+RK YWQ ++ +++ E
Sbjct: 222 VDKNIFSFYLN-RDPEGQPGGELMLGGTDSKYYHGELSYLNVTRKAYWQVHMDQLEVGSE 280
Query: 107 KNVYCSNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ C+AI DTGTSL++GP + + EL K IGAVPL G + C+ + +P V + L
Sbjct: 281 LTLCKGGCEAIVDTGTSLLVGPVEEVKELQKAIGAVPLIQGEYMIPCEKVSSLPTVYLKL 340
Query: 167 GGKNFTL 173
GGKN+ L
Sbjct: 341 GGKNYEL 347
Score = 75.9 bits (185), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 47/122 (38%), Positives = 62/122 (50%), Gaps = 27/122 (22%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHF 60
+++ AQYYG I +GTPPQ F V+FDTGSSNLW+PS HC IL+IAC V H
Sbjct: 71 LKNYLDAQYYGDIGIGTPPQCFTVVFDTGSSNLWVPSIHCKILDIACWV---------HH 121
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYW------------QFGVESIKIEKN 108
+ SD++ + K+ DI Y S GY Q IK+EK
Sbjct: 122 KYNSDKSSTYV------KNGTSFDIHYGSGSLSGYLSQDTVSVPCKSDQSKARGIKVEKQ 175
Query: 109 VY 110
++
Sbjct: 176 IF 177
>gi|74142218|dbj|BAE31874.1| unnamed protein product [Mus musculus]
Length = 410
Score = 102 bits (255), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 51/127 (40%), Positives = 79/127 (62%), Gaps = 3/127 (2%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--E 106
VDK+ F + + R+ + GGE+M GG D + G+++Y V+RK YWQ ++ +++ E
Sbjct: 222 VDKNIFSFYLN-RDPEGQPGGELMLGGTDSKYYHGELSYLNVTRKAYWQVHMDQLEVGNE 280
Query: 107 KNVYCSNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ C+AI DTGTSL++GP + + EL K IGAVPL G + C+ + +P V + L
Sbjct: 281 LTLCKGGCEAIVDTGTSLLVGPVEEVKELQKAIGAVPLIQGEYMIPCEKVSSLPTVYLKL 340
Query: 167 GGKNFTL 173
GGKN+ L
Sbjct: 341 GGKNYEL 347
Score = 75.9 bits (185), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 47/122 (38%), Positives = 62/122 (50%), Gaps = 27/122 (22%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHF 60
+++ AQYYG I +GTPPQ F V+FDTGSSNLW+PS HC IL+IAC V H
Sbjct: 71 LKNYLDAQYYGDIGIGTPPQCFTVVFDTGSSNLWVPSIHCKILDIACWV---------HH 121
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYW------------QFGVESIKIEKN 108
+ SD++ + K+ DI Y S GY Q IK+EK
Sbjct: 122 KYNSDKSSTYV------KNGTSFDIHYGSGSLSGYLSQDTVSVPCKSDQSKARGIKVEKQ 175
Query: 109 VY 110
++
Sbjct: 176 IF 177
>gi|6753556|ref|NP_034113.1| cathepsin D precursor [Mus musculus]
gi|115718|sp|P18242.1|CATD_MOUSE RecName: Full=Cathepsin D; Flags: Precursor
gi|50299|emb|CAA37067.1| cathepsin D [Mus musculus]
gi|50301|emb|CAA37423.1| unnamed protein product [Mus musculus]
gi|817945|emb|CAA48453.1| cathepsin d [Mus musculus]
gi|32452040|gb|AAH54758.1| Cathepsin D [Mus musculus]
gi|34785578|gb|AAH57931.1| Cathepsin D [Mus musculus]
gi|74139562|dbj|BAE40918.1| unnamed protein product [Mus musculus]
gi|74139905|dbj|BAE31791.1| unnamed protein product [Mus musculus]
gi|74151769|dbj|BAE29674.1| unnamed protein product [Mus musculus]
gi|74177956|dbj|BAE29773.1| unnamed protein product [Mus musculus]
gi|74178091|dbj|BAE29834.1| unnamed protein product [Mus musculus]
gi|74181413|dbj|BAE29980.1| unnamed protein product [Mus musculus]
gi|74184920|dbj|BAE39078.1| unnamed protein product [Mus musculus]
gi|74185047|dbj|BAE39131.1| unnamed protein product [Mus musculus]
gi|74185557|dbj|BAE30245.1| unnamed protein product [Mus musculus]
gi|74186716|dbj|BAE34813.1| unnamed protein product [Mus musculus]
gi|74189047|dbj|BAE39288.1| unnamed protein product [Mus musculus]
gi|74191359|dbj|BAE30262.1| unnamed protein product [Mus musculus]
gi|74191542|dbj|BAE30346.1| unnamed protein product [Mus musculus]
gi|74197068|dbj|BAE35086.1| unnamed protein product [Mus musculus]
gi|74197198|dbj|BAE35144.1| unnamed protein product [Mus musculus]
gi|74199016|dbj|BAE30724.1| unnamed protein product [Mus musculus]
gi|74204247|dbj|BAE39883.1| unnamed protein product [Mus musculus]
gi|74207294|dbj|BAE30833.1| unnamed protein product [Mus musculus]
gi|74207430|dbj|BAE30895.1| unnamed protein product [Mus musculus]
gi|74212520|dbj|BAE31001.1| unnamed protein product [Mus musculus]
gi|74212556|dbj|BAE31018.1| unnamed protein product [Mus musculus]
gi|74212558|dbj|BAE31019.1| unnamed protein product [Mus musculus]
gi|74213416|dbj|BAE35523.1| unnamed protein product [Mus musculus]
gi|74214708|dbj|BAE31193.1| unnamed protein product [Mus musculus]
gi|74217133|dbj|BAE31236.1| unnamed protein product [Mus musculus]
gi|74219445|dbj|BAE29499.1| unnamed protein product [Mus musculus]
gi|74220283|dbj|BAE31319.1| unnamed protein product [Mus musculus]
gi|74220373|dbj|BAE31412.1| unnamed protein product [Mus musculus]
gi|74220638|dbj|BAE31529.1| unnamed protein product [Mus musculus]
gi|74220740|dbj|BAE31342.1| unnamed protein product [Mus musculus]
gi|74222921|dbj|BAE42305.1| unnamed protein product [Mus musculus]
gi|74225262|dbj|BAE31566.1| unnamed protein product [Mus musculus]
gi|74225282|dbj|BAE31575.1| unnamed protein product [Mus musculus]
gi|148686195|gb|EDL18142.1| cathepsin D, isoform CRA_a [Mus musculus]
Length = 410
Score = 102 bits (255), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 51/127 (40%), Positives = 79/127 (62%), Gaps = 3/127 (2%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--E 106
VDK+ F + + R+ + GGE+M GG D + G+++Y V+RK YWQ ++ +++ E
Sbjct: 222 VDKNIFSFYLN-RDPEGQPGGELMLGGTDSKYYHGELSYLNVTRKAYWQVHMDQLEVGNE 280
Query: 107 KNVYCSNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ C+AI DTGTSL++GP + + EL K IGAVPL G + C+ + +P V + L
Sbjct: 281 LTLCKGGCEAIVDTGTSLLVGPVEEVKELQKAIGAVPLIQGEYMIPCEKVSSLPTVYLKL 340
Query: 167 GGKNFTL 173
GGKN+ L
Sbjct: 341 GGKNYEL 347
Score = 75.9 bits (185), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 47/122 (38%), Positives = 62/122 (50%), Gaps = 27/122 (22%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHF 60
+++ AQYYG I +GTPPQ F V+FDTGSSNLW+PS HC IL+IAC V H
Sbjct: 71 LKNYLDAQYYGDIGIGTPPQCFTVVFDTGSSNLWVPSIHCKILDIACWV---------HH 121
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYW------------QFGVESIKIEKN 108
+ SD++ + K+ DI Y S GY Q IK+EK
Sbjct: 122 KYNSDKSSTYV------KNGTSFDIHYGSGSLSGYLSQDTVSVPCKSDQSKARGIKVEKQ 175
Query: 109 VY 110
++
Sbjct: 176 IF 177
>gi|74220304|dbj|BAE31329.1| unnamed protein product [Mus musculus]
Length = 410
Score = 102 bits (255), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 51/127 (40%), Positives = 79/127 (62%), Gaps = 3/127 (2%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--E 106
VDK+ F + + R+ + GGE+M GG D + G+++Y V+RK YWQ ++ +++ E
Sbjct: 222 VDKNIFSFYLN-RDPEGQPGGELMLGGTDSKYYHGELSYLNVTRKAYWQVHMDQLEVGNE 280
Query: 107 KNVYCSNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ C+AI DTGTSL++GP + + EL K IGAVPL G + C+ + +P V + L
Sbjct: 281 LTLCKGGCEAIVDTGTSLLVGPVEEVKELQKAIGAVPLIQGEYMIPCEKVSSLPTVYLKL 340
Query: 167 GGKNFTL 173
GGKN+ L
Sbjct: 341 GGKNYEL 347
Score = 75.9 bits (185), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 47/122 (38%), Positives = 62/122 (50%), Gaps = 27/122 (22%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHF 60
+++ AQYYG I +GTPPQ F V+FDTGSSNLW+PS HC IL+IAC V H
Sbjct: 71 LKNYLDAQYYGDIGIGTPPQCFTVVFDTGSSNLWVPSIHCKILDIACWV---------HH 121
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYW------------QFGVESIKIEKN 108
+ SD++ + K+ DI Y S GY Q IK+EK
Sbjct: 122 KYNSDKSSTYV------KNGTSFDIHYGSGSLSGYLSQDTVSVPCKSDQSKARGIKVEKQ 175
Query: 109 VY 110
++
Sbjct: 176 IF 177
>gi|404498469|dbj|BAM44630.1| cathepsin, partial [Acipenser schrenckii]
Length = 249
Score = 102 bits (255), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 49/115 (42%), Positives = 75/115 (65%), Gaps = 2/115 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIA 118
RN +GGE++FGG D F G++ + PV+++GYWQ V++IK+ V +CS CQAI
Sbjct: 75 RNPDYSSGGELIFGGFDPSHFSGELYWVPVTKQGYWQILVDNIKVNGEVMFCSGGCQAIV 134
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
DTGTS++ GP+ I L +++GA PL G V+C +++ +P+V + G FTL
Sbjct: 135 DTGTSMLTGPTADITALQQVLGASPLDEGTYAVECGSINILPDVTFTINGAEFTL 189
>gi|74207446|dbj|BAE30902.1| unnamed protein product [Mus musculus]
Length = 410
Score = 102 bits (255), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 51/127 (40%), Positives = 79/127 (62%), Gaps = 3/127 (2%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--E 106
VDK+ F + + R+ + GGE+M GG D + G+++Y V+RK YWQ ++ +++ E
Sbjct: 222 VDKNIFSFYLN-RDPEGQPGGELMLGGTDSKYYHGELSYLNVTRKAYWQVHMDQLEVGNE 280
Query: 107 KNVYCSNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ C+AI DTGTSL++GP + + EL K IGAVPL G + C+ + +P V + L
Sbjct: 281 LTLCKGGCEAIVDTGTSLLVGPVEEVKELQKAIGAVPLIQGEYMIPCEKVSSLPTVYMKL 340
Query: 167 GGKNFTL 173
GGKN+ L
Sbjct: 341 GGKNYEL 347
Score = 76.3 bits (186), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 47/122 (38%), Positives = 62/122 (50%), Gaps = 27/122 (22%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHF 60
+++ AQYYG I +GTPPQ F V+FDTGSSNLW+PS HC IL+IAC V H
Sbjct: 71 LKNYLDAQYYGDIGIGTPPQCFTVVFDTGSSNLWVPSIHCKILDIACWV---------HH 121
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYW------------QFGVESIKIEKN 108
+ SD++ + K+ DI Y S GY Q IK+EK
Sbjct: 122 KYNSDKSSTYV------KNGTSFDIHYGSGSLSGYLSQDTVSVPCKSDQSKARGIKVEKQ 175
Query: 109 VY 110
++
Sbjct: 176 IF 177
>gi|297468717|ref|XP_001788171.2| PREDICTED: pregnancy-associated glycoprotein 2-like [Bos taurus]
Length = 330
Score = 102 bits (255), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 66/216 (30%), Positives = 102/216 (47%), Gaps = 44/216 (20%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSIL-----NIACSVDKDKFV 55
+++ Y G I++GTPPQEF+V+FDTGS++LW+PS L + S ++ F
Sbjct: 60 LKNYLDMAYVGNITIGTPPQEFRVVFDTGSADLWVPSSAIGNLVSLKQSFGLSQEEYGFD 119
Query: 56 G------------------------------------HSHFRNASDENGGEIMFGGVDKD 79
G + + N G +MFGGVD
Sbjct: 120 GAPFDGVLGLAYPSISIKGIIPIFDNLWSQGAFSEPVFAFYLNTCKPEGSVVMFGGVDHR 179
Query: 80 KFVGDITYSPVSRKGYWQFGVESIKIEKNVY-CSN-CQAIADTGTSLIIGPSKVIAELNK 137
+ G++ + PVS+ YWQ + I + NV CS CQA+ DTGTS+I GP+++I ++K
Sbjct: 180 YYKGELNWIPVSQTRYWQISMNRISMNGNVTACSRGCQALLDTGTSMIHGPTRLITNIHK 239
Query: 138 LIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
L+ A V CD + +P V + G ++ L
Sbjct: 240 LMNARH-QGSEYVVSCDAVKTLPPVIFNINGIDYPL 274
>gi|187472089|gb|ACD11354.1| aspartic protease 2 [Wuchereria bancrofti]
Length = 166
Score = 102 bits (255), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 53/109 (48%), Positives = 71/109 (65%), Gaps = 2/109 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIK-IEKNVYCSN-CQAIA 118
RN SD+ GGEI FGG+D ++FV ITY+PVSR GYWQF ++S++ K + CSN CQAIA
Sbjct: 57 RNPSDKIGGEITFGGIDANRFVSPITYTPVSRHGYWQFKMDSVRGGGKAIGCSNGCQAIA 116
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILG 167
DTGTSLI GP I ++ + IGA + G + C +P + + G
Sbjct: 117 DTGTSLIAGPKSQIDKIQEYIGAEHVYAGEYIIPCYKEPSLPEISFVRG 165
>gi|417400425|gb|JAA47158.1| Putative cathepsin d [Desmodus rotundus]
Length = 409
Score = 102 bits (254), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 50/116 (43%), Positives = 72/116 (62%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIA 118
R+ + + GGE+M GGVD + G ITY V+RK YWQ ++ + + + C C+AI
Sbjct: 232 RDPNAQPGGELMLGGVDSKYYKGPITYLNVTRKAYWQVHMDEVAVGSGLTLCKEGCEAIV 291
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSL++GP + + EL K IGAVPL G V C+ + +P V + LGGK + L+
Sbjct: 292 DTGTSLLVGPVEEVRELQKAIGAVPLIQGEYMVPCEKVSSLPEVTLKLGGKAYRLS 347
Score = 72.0 bits (175), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 29/47 (61%), Positives = 37/47 (78%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ AQYYG I +GTPPQ F V+FDTGSSNLW+PS HC +L+ AC
Sbjct: 71 LKNYMDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDFAC 117
>gi|340729556|ref|XP_003403066.1| PREDICTED: lysosomal aspartic protease-like [Bombus terrestris]
Length = 385
Score = 102 bits (254), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 56/117 (47%), Positives = 76/117 (64%), Gaps = 3/117 (2%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSN-CQAI 117
RN D+ GGE++ GG D + + G TY PV RKGYWQF ++ IK+ + C C+AI
Sbjct: 209 RNPDDKAGGELILGGSDPNHYEGPFTYVPVDRKGYWQFRMDGIKVGSQHLAICQKGCEAI 268
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
ADTGTSLI GP K + +N IGA +A G A VDC ++ +P ++ +LGGK+F LT
Sbjct: 269 ADTGTSLIAGPVKEVEAINSAIGATAIAAGEAMVDCSSIPNLPTINFVLGGKSFPLT 325
Score = 74.3 bits (181), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 31/43 (72%), Positives = 37/43 (86%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSV 49
AQYYG IS+GTP Q+FKVIFDTGSSNLW+PS+ C + NIAC +
Sbjct: 62 AQYYGVISIGTPSQDFKVIFDTGSSNLWVPSKKCHLTNIACKL 104
>gi|301769501|ref|XP_002920177.1| PREDICTED: cathepsin D-like [Ailuropoda melanoleuca]
Length = 371
Score = 102 bits (254), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 48/116 (41%), Positives = 70/116 (60%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYC--SNCQAIA 118
RN + GGE+M GG D + G ++Y V+RK YWQ +E + + ++ C+AI
Sbjct: 233 RNPGAQPGGELMLGGTDSKYYKGPLSYLNVTRKAYWQVHMEQVDVGSSLTLCKGGCEAIL 292
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSLI+GP + EL K IGAVPL G + C+ + +P V + LGG+ +TL+
Sbjct: 293 DTGTSLIVGPVDEVRELQKAIGAVPLIQGEYMIPCEKVSTLPEVTLKLGGRAYTLS 348
Score = 73.6 bits (179), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 30/47 (63%), Positives = 38/47 (80%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ AQYYG I +GTPPQ F V+FDTGSSNLW+PS HC +L+IAC
Sbjct: 71 LKNYMDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIAC 117
>gi|449282010|gb|EMC88940.1| Cathepsin E-B, partial [Columba livia]
Length = 387
Score = 102 bits (254), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 50/116 (43%), Positives = 70/116 (60%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIA 118
R ENGGE++ GG+D + G I + PV+ K YWQ + +IKI+ V +CS+ C+AI
Sbjct: 206 REDDTENGGELILGGIDHSLYKGSIHWVPVTEKSYWQIHLNNIKIQGRVAFCSHGCEAIV 265
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
D+GTSLI GPS I L + IGA P +G VDC L +P++ +G + LT
Sbjct: 266 DSGTSLITGPSSQIRRLQEYIGASPSHSGEFLVDCRRLSSLPHISFTIGHHEYKLT 321
Score = 68.6 bits (166), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 35/64 (54%), Positives = 45/64 (70%), Gaps = 6/64 (9%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASDE 66
AQYYG +S+GTPPQ F V+FDTGSSN W+PS +C ++ AC V + KF F + S E
Sbjct: 61 AQYYGVVSVGTPPQRFTVVFDTGSSNFWVPSAYC--ISEACRVHQ-KF---KSFLSDSYE 114
Query: 67 NGGE 70
+GGE
Sbjct: 115 HGGE 118
>gi|348511299|ref|XP_003443182.1| PREDICTED: cathepsin D-like [Oreochromis niloticus]
Length = 397
Score = 102 bits (254), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 47/116 (40%), Positives = 69/116 (59%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYC--SNCQAIA 118
R+ + GGE+M GG D + GD+ Y V+RK +WQ G+ + + + + CQAI
Sbjct: 222 RDPTAAVGGELMLGGTDPQYYTGDLHYVNVTRKAFWQIGMNRVDVGNQLTLCKAGCQAIV 281
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSLI+GP + + L K IGA+PL G A ++C + +P + +GGK F LT
Sbjct: 282 DTGTSLIVGPKEEVKALQKAIGAIPLLMGEALIECTKIPTLPVISFDIGGKTFNLT 337
Score = 76.3 bits (186), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 31/41 (75%), Positives = 37/41 (90%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+QYYG IS+GTPPQ F V+FDTGSSNLW+PS HCS+L+IAC
Sbjct: 75 SQYYGIISIGTPPQNFTVLFDTGSSNLWVPSIHCSLLDIAC 115
>gi|339237491|ref|XP_003380300.1| lysosomal aspartic protease [Trichinella spiralis]
gi|316976887|gb|EFV60084.1| lysosomal aspartic protease [Trichinella spiralis]
Length = 405
Score = 102 bits (254), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 51/115 (44%), Positives = 74/115 (64%), Gaps = 2/115 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI-EKNVYC-SNCQAIA 118
RN D+ GG I FG VD+ ++G+IT+ P+ + YW+F +E+IK+ +++V C C IA
Sbjct: 226 RNPEDDLGGLISFGTVDEKYYIGNITWVPLVNQRYWEFNMETIKVGDEHVGCIDGCTTIA 285
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
DTGTSLI GP + L + IGA PL G V C+ +D +PNV + +GG+ F L
Sbjct: 286 DTGTSLIAGPKDEVERLQEAIGAKPLIMGQYYVSCNEVDSLPNVQMKIGGRMFDL 340
Score = 73.2 bits (178), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 31/41 (75%), Positives = 34/41 (82%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQYYG IS+GTPPQ F VIFDTGSSNLW+PS CS +IAC
Sbjct: 79 AQYYGEISIGTPPQNFTVIFDTGSSNLWVPSSKCSFFDIAC 119
>gi|18043133|gb|AAH19682.1| Ctsd protein, partial [Mus musculus]
Length = 217
Score = 102 bits (254), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 51/127 (40%), Positives = 79/127 (62%), Gaps = 3/127 (2%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--E 106
VDK+ F + + R+ + GGE+M GG D + G+++Y V+RK YWQ ++ +++ E
Sbjct: 29 VDKNIFSFYLN-RDPEGQPGGELMLGGTDSKYYHGELSYLNVTRKAYWQVHMDQLEVGNE 87
Query: 107 KNVYCSNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ C+AI DTGTSL++GP + + EL K IGAVPL G + C+ + +P V + L
Sbjct: 88 LTLCKGGCEAIVDTGTSLLVGPVEEVKELQKAIGAVPLIQGEYMIPCEKVSSLPTVYLKL 147
Query: 167 GGKNFTL 173
GGKN+ L
Sbjct: 148 GGKNYEL 154
>gi|195134380|ref|XP_002011615.1| GI11125 [Drosophila mojavensis]
gi|193906738|gb|EDW05605.1| GI11125 [Drosophila mojavensis]
Length = 371
Score = 102 bits (254), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 46/114 (40%), Positives = 67/114 (58%), Gaps = 4/114 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSNCQAIADT 120
R+ + GGE++FGG D + GD TY P+S++GYWQF + +E C NCQAIADT
Sbjct: 211 RDGTSSQGGELIFGGSDSSLYSGDFTYVPISQEGYWQFTMAGASVEGYSLCDNCQAIADT 270
Query: 121 GTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
GTSL++ P+ LN+++ N VDC + +P + +GG NF L+
Sbjct: 271 GTSLLVAPANAYELLNEILN----VNDEGLVDCSTVSSLPVITFNIGGTNFDLS 320
Score = 63.5 bits (153), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 27/49 (55%), Positives = 37/49 (75%), Gaps = 2/49 (4%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSV 49
+ +S YYG I++GTPPQ FKV+FD+GSSNLW+PS++C + AC V
Sbjct: 60 LSNSLNMAYYGAITIGTPPQNFKVLFDSGSSNLWVPSKNCP--SYACEV 106
>gi|116282368|gb|ABJ97285.1| cathepsin D-like aspartic protease [Fasciola hepatica]
Length = 429
Score = 102 bits (254), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 49/114 (42%), Positives = 70/114 (61%), Gaps = 1/114 (0%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIAD 119
RNASD GGE++ GG+D + + GDITY PV+ + YWQF V+ I+ C++ CQAIAD
Sbjct: 215 RNASDPVGGELLLGGIDPNYYTGDITYVPVTHEAYWQFKVDKIEFPGVSICADGCQAIAD 274
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
TGTSLI GP K + LN+ IG + G V+ D +D + + ++ G+
Sbjct: 275 TGTSLIAGPKKEVDALNEQIGGTWMPGGIYVVNWDKIDNLSAITFVVAGRKMVF 328
Score = 72.8 bits (177), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 31/41 (75%), Positives = 35/41 (85%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQYYG I +GTPPQ FKVIFDTGSSNLW+PS+ CS L+ AC
Sbjct: 68 AQYYGEIGIGTPPQTFKVIFDTGSSNLWVPSKRCSYLSWAC 108
>gi|125807245|ref|XP_001360320.1| GA13759 [Drosophila pseudoobscura pseudoobscura]
gi|195149648|ref|XP_002015768.1| GL11239 [Drosophila persimilis]
gi|54635492|gb|EAL24895.1| GA13759 [Drosophila pseudoobscura pseudoobscura]
gi|194109615|gb|EDW31658.1| GL11239 [Drosophila persimilis]
Length = 388
Score = 102 bits (254), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 50/114 (43%), Positives = 65/114 (57%), Gaps = 1/114 (0%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIAD 119
R+ + GGEI+FGG D + GD TY PV+RK YWQ ++S + C CQ IAD
Sbjct: 214 RDPASPEGGEIIFGGSDPKHYTGDFTYLPVTRKAYWQIKMDSAALGDLELCKGGCQVIAD 273
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
TGTSLI P +N+ IG P+ G V CD + K+P + +LGGK F L
Sbjct: 274 TGTSLIAAPMTEATSINQKIGGTPIIGGQYIVSCDLIPKLPVIKFVLGGKTFEL 327
Score = 73.6 bits (179), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 29/41 (70%), Positives = 36/41 (87%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQYYG IS+G+PPQ F+V+FDTGSSNLW+PS+ C + NIAC
Sbjct: 67 AQYYGPISIGSPPQSFRVVFDTGSSNLWVPSKKCHLTNIAC 107
>gi|281344446|gb|EFB20030.1| hypothetical protein PANDA_008874 [Ailuropoda melanoleuca]
Length = 345
Score = 102 bits (253), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 48/116 (41%), Positives = 70/116 (60%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYC--SNCQAIA 118
RN + GGE+M GG D + G ++Y V+RK YWQ +E + + ++ C+AI
Sbjct: 218 RNPGAQPGGELMLGGTDSKYYKGPLSYLNVTRKAYWQVHMEQVDVGSSLTLCKGGCEAIL 277
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSLI+GP + EL K IGAVPL G + C+ + +P V + LGG+ +TL+
Sbjct: 278 DTGTSLIVGPVDEVRELQKAIGAVPLIQGEYMIPCEKVSTLPEVTLKLGGRAYTLS 333
Score = 73.6 bits (179), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 30/47 (63%), Positives = 38/47 (80%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ AQYYG I +GTPPQ F V+FDTGSSNLW+PS HC +L+IAC
Sbjct: 56 LKNYMDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIAC 102
>gi|405117936|gb|AFR92711.1| endopeptidase [Cryptococcus neoformans var. grubii H99]
Length = 438
Score = 102 bits (253), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 52/117 (44%), Positives = 72/117 (61%), Gaps = 2/117 (1%)
Query: 60 FR-NASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI-EKNVYCSNCQAI 117
FR +SDE+GGE +FGG+D + G + Y PV RKGYW+ +ESI ++ + N A
Sbjct: 266 FRLGSSDEDGGEAIFGGIDDSAYSGKLAYVPVRRKGYWEVELESISFGDEELELENTGAA 325
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSLI+ P+ V LNK IGA NG VDC+ + +P + GGK++TL+
Sbjct: 326 IDTGTSLIVMPTDVAELLNKEIGAEKSWNGQYTVDCNTVSSLPELAFTFGGKDYTLS 382
Score = 59.7 bits (143), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 36/84 (42%), Positives = 48/84 (57%), Gaps = 7/84 (8%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK--DKFVGHSHFRNAS 64
AQY+ T+ LGTP Q FKV+ DTGSSNLW+PS C+ +IAC + D ++ N S
Sbjct: 125 AQYFATVELGTPFQTFKVVLDTGSSNLWVPSVKCT--SIACFLHNKYDSSQSSTYKANGS 182
Query: 65 DENGGEIMFGGVDKDKFVGDITYS 88
D EI +G + F+ T S
Sbjct: 183 D---FEIHYGSGSLEGFISQDTLS 203
>gi|326911558|ref|XP_003202125.1| PREDICTED: cathepsin E-A-like [Meleagris gallopavo]
Length = 404
Score = 102 bits (253), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 50/116 (43%), Positives = 70/116 (60%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIA 118
R ENGGE++ GG+D + G I + PV+ K YWQ + +IKI+ V +CS+ C+AI
Sbjct: 224 RGDDTENGGELILGGIDHSLYKGSIHWVPVTEKSYWQIHLNNIKIQGRVAFCSHGCEAIV 283
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
D+GTSLI GPS I L + IGA P +G VDC L +P++ +G + LT
Sbjct: 284 DSGTSLITGPSSQIRRLQEYIGASPSRSGEFLVDCRRLSSLPHISFTIGHHEYKLT 339
Score = 67.8 bits (164), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 35/64 (54%), Positives = 45/64 (70%), Gaps = 6/64 (9%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASDE 66
AQYYG IS+GTPPQ F V+FDTGSSN W+PS +C ++ AC V + +F F + S E
Sbjct: 79 AQYYGVISVGTPPQSFTVVFDTGSSNFWVPSVYC--ISEACRVHQ-RF---KSFLSDSYE 132
Query: 67 NGGE 70
+GGE
Sbjct: 133 HGGE 136
>gi|122938524|gb|ABM69086.1| aspartic proteinase AspMD03 [Musca domestica]
Length = 390
Score = 102 bits (253), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 50/115 (43%), Positives = 69/115 (60%), Gaps = 3/115 (2%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN--CQAIA 118
R+ GGEI+FGG D D + GD TY PV+RK YWQ ++S + ++ C+ CQ IA
Sbjct: 216 RDPKAPEGGEIIFGGSDPDHYTGDFTYLPVTRKAYWQIKMDSASM-GDLKCAKGGCQVIA 274
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
DTGTSLI P +N+ IG P+ G V C+++ K+P + +LGGK F L
Sbjct: 275 DTGTSLIALPPSEATSINQAIGGTPIMGGQYMVACEDIPKLPVIKFVLGGKTFEL 329
Score = 76.3 bits (186), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 31/41 (75%), Positives = 37/41 (90%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQYYG IS+GTPPQ+FKV+FDTGSSNLW+PS+ C + NIAC
Sbjct: 69 AQYYGPISIGTPPQDFKVVFDTGSSNLWVPSKKCHLTNIAC 109
>gi|307167892|gb|EFN61281.1| Lysosomal aspartic protease [Camponotus floridanus]
Length = 243
Score = 102 bits (253), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 51/113 (45%), Positives = 74/113 (65%), Gaps = 4/113 (3%)
Query: 62 NASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSNCQAIADTG 121
N+S E GGE++ GG D + G+ TY VS+KGYWQ+ ++SI + + + QAIADTG
Sbjct: 80 NSSAEEGGELILGGSDPKYYEGNFTYVNVSKKGYWQYPLQSITVGGKLVAEDIQAIADTG 139
Query: 122 TSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
TSLI+GPS++I +NK I A+ VDCD +DK+P +D + G ++ LT
Sbjct: 140 TSLIVGPSEIIDPINKDIH----ADSDGSVDCDKIDKLPVIDFNINGVSYNLT 188
>gi|122114359|gb|AAY42145.2| cathepsin D [Sus scrofa]
Length = 410
Score = 102 bits (253), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 50/131 (38%), Positives = 78/131 (59%), Gaps = 3/131 (2%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKN 108
VDK+ F + + R+ + G E+M GG+D + G + Y V+RK YWQ ++ + + +
Sbjct: 222 VDKNIFSFYLN-RDPGAQPGSELMLGGIDSKYYKGSLDYHNVTRKAYWQIHMDQVAVGSS 280
Query: 109 VYC--SNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ C+AI DTGTSLI+GP + + EL K IGAVPL G + C+ + +P+V + L
Sbjct: 281 LTLCKGGCEAIVDTGTSLIVGPVEEVRELQKAIGAVPLIQGEYMIPCEKVPSLPDVTVTL 340
Query: 167 GGKNFTLTRTN 177
GGK + L+ N
Sbjct: 341 GGKKYKLSSEN 351
Score = 73.6 bits (179), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 30/47 (63%), Positives = 38/47 (80%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ AQYYG I +GTPPQ F V+FDTGSSNLW+PS HC +L+IAC
Sbjct: 71 LKNYMDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIAC 117
>gi|345797646|ref|XP_545694.3| PREDICTED: cathepsin E [Canis lupus familiaris]
Length = 396
Score = 102 bits (253), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 54/119 (45%), Positives = 78/119 (65%), Gaps = 6/119 (5%)
Query: 63 ASDENGG---EIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAI 117
+SD GG E++FGG D F G++ + PV+++GYWQ +++I++ V +CS CQAI
Sbjct: 220 SSDPEGGTGSELIFGGYDHSHFSGNLNWVPVTKQGYWQIALDAIQVGGTVMFCSEGCQAI 279
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTRT 176
DTGTSLI GPS I +L IGA P+ +G V+C NL+ MP+V I+ G ++TL T
Sbjct: 280 VDTGTSLITGPSDEIKQLQNAIGAEPM-DGEYGVECANLNVMPDVTFIINGVSYTLQPT 337
Score = 61.2 bits (147), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 24/34 (70%), Positives = 31/34 (91%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCS 41
+Y+GTIS+G+PPQ F VIFDTGSSNLW+PS +C+
Sbjct: 77 EYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT 110
>gi|74198620|dbj|BAE39786.1| unnamed protein product [Mus musculus]
Length = 410
Score = 102 bits (253), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 51/127 (40%), Positives = 78/127 (61%), Gaps = 3/127 (2%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--E 106
VDK+ F + + R+ + GGE+M GG D + G+++Y V+RK YWQ ++ +++ E
Sbjct: 222 VDKNIFSFYLN-RDPEGQPGGELMLGGTDSKYYHGELSYLNVTRKAYWQVHMDQLEVGNE 280
Query: 107 KNVYCSNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ C+AI DTGTSL++GP + EL K IGAVPL G + C+ + +P V + L
Sbjct: 281 LTLCKGGCEAIVDTGTSLLVGPVGEVKELQKAIGAVPLIQGEYMIPCEKVSSLPTVYLKL 340
Query: 167 GGKNFTL 173
GGKN+ L
Sbjct: 341 GGKNYEL 347
Score = 76.6 bits (187), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 46/115 (40%), Positives = 65/115 (56%), Gaps = 13/115 (11%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHF 60
+++ AQYYG I +GTPPQ F V+FDTGSSNLW+PS HC IL+IAC +V H +
Sbjct: 71 LKNYMDAQYYGDIGIGTPPQCFTVVFDTGSSNLWVPSIHCKILDIAC------WVHHKYN 124
Query: 61 RNASD---ENGG--EIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVY 110
+ S +NG +I +G ++ T S + G Q IK+EK ++
Sbjct: 125 SDKSSTYVKNGTSFDIHYGSGSLSGYLSQDTVSVPCKSG--QSKARGIKVEKQIF 177
>gi|395851770|ref|XP_003798425.1| PREDICTED: cathepsin D [Otolemur garnettii]
Length = 405
Score = 101 bits (252), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 46/116 (39%), Positives = 74/116 (63%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIA 118
R+ + + GGE+M GGVD + G ++Y V+RK YW+ +E +++ + C C+AI
Sbjct: 228 RDPNAQPGGELMLGGVDSKYYTGSLSYLNVTRKAYWEVHMEQVEVASGLTLCKGGCEAIV 287
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSL++GP + EL K IGA+PL G + C+ + +P+V + L GK++TL+
Sbjct: 288 DTGTSLMVGPVDEVRELQKAIGAIPLIQGEYMIPCEKVSSLPSVTLKLAGKDYTLS 343
Score = 72.0 bits (175), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 29/47 (61%), Positives = 37/47 (78%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+R+ AQYYG I +GTPPQ F V+FDTGS+NLW+PS C +L+IAC
Sbjct: 69 LRNYMDAQYYGEIGIGTPPQCFTVVFDTGSANLWVPSSKCKMLDIAC 115
>gi|194863696|ref|XP_001970568.1| GG10707 [Drosophila erecta]
gi|190662435|gb|EDV59627.1| GG10707 [Drosophila erecta]
Length = 390
Score = 101 bits (252), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 49/114 (42%), Positives = 67/114 (58%), Gaps = 1/114 (0%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIAD 119
R+ + GGEI+FGG D + + G+ TY PV+RK YWQ +++ I C CQ IAD
Sbjct: 216 RDPASPEGGEIIFGGSDPNHYTGEFTYLPVTRKAYWQIKMDAASIGDLQLCKGGCQVIAD 275
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
TGTSLI P + +N+ IG P+ G V CD + K+P + +LGGK F L
Sbjct: 276 TGTSLIAAPLEEATSINQKIGGTPIIGGQYVVSCDLIPKLPVIKFVLGGKTFEL 329
Score = 72.4 bits (176), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 28/41 (68%), Positives = 36/41 (87%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQYYG I++G+PPQ F+V+FDTGSSNLW+PS+ C + NIAC
Sbjct: 69 AQYYGPIAIGSPPQNFRVVFDTGSSNLWVPSKKCHLTNIAC 109
>gi|410986349|ref|XP_003999473.1| PREDICTED: cathepsin E [Felis catus]
Length = 396
Score = 101 bits (252), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 52/111 (46%), Positives = 73/111 (65%), Gaps = 3/111 (2%)
Query: 68 GGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIADTGTSLI 125
G E++FGG D F G + + PV+++GYWQ ++ I++ V +CS CQAI DTGTSLI
Sbjct: 228 GSELIFGGYDHSHFSGTLNWVPVTKQGYWQIALDVIQVGGTVMFCSEGCQAIVDTGTSLI 287
Query: 126 IGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTRT 176
GPS I +L K IGA P+ +G V+C NL+ MP+V I+ G ++TL T
Sbjct: 288 TGPSDKIKQLQKAIGAEPM-DGEYAVECANLNVMPDVTFIINGVSYTLQPT 337
Score = 59.7 bits (143), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 23/35 (65%), Positives = 31/35 (88%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCS 41
+Y+G+IS+G+PPQ F VIFDTGSSNLW+PS +C+
Sbjct: 76 TEYFGSISIGSPPQNFTVIFDTGSSNLWVPSVYCT 110
>gi|432870116|ref|XP_004071815.1| PREDICTED: cathepsin D-like [Oryzias latipes]
Length = 397
Score = 101 bits (252), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 47/116 (40%), Positives = 72/116 (62%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYC--SNCQAIA 118
R+ + E GGE++ GG+D F GD+ Y V+RK YWQ ++ + + + + CQ+I
Sbjct: 222 RDTAAEVGGELILGGIDPQYFSGDLHYVNVTRKAYWQIQMDRVDVGNQLTLCKAGCQSIV 281
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSL++GP++ I L+K IGA+PL G +DC + +P + +GGK F LT
Sbjct: 282 DTGTSLMVGPAEEIRALHKAIGALPLLMGEYFIDCKKIPSLPVISFNIGGKTFNLT 337
Score = 78.2 bits (191), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 32/45 (71%), Positives = 38/45 (84%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK 51
AQYYG IS+GTPPQ F V+FDTGSSNLW+PS HCS L++AC V +
Sbjct: 75 AQYYGLISIGTPPQNFSVLFDTGSSNLWVPSIHCSFLDVACWVHR 119
>gi|195474504|ref|XP_002089531.1| GE23596 [Drosophila yakuba]
gi|194175632|gb|EDW89243.1| GE23596 [Drosophila yakuba]
Length = 392
Score = 101 bits (252), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 49/114 (42%), Positives = 67/114 (58%), Gaps = 1/114 (0%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIAD 119
R+ + GGEI+FGG D + + G+ TY PV+RK YWQ +++ I C CQ IAD
Sbjct: 218 RDPASPEGGEIIFGGSDPNHYTGEFTYLPVTRKAYWQIKMDAASIGDLQLCKGGCQVIAD 277
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
TGTSLI P + +N+ IG P+ G V CD + K+P + +LGGK F L
Sbjct: 278 TGTSLIAAPLEEATSINQKIGGTPIIGGQYVVSCDLIPKLPVIKFVLGGKTFEL 331
Score = 72.4 bits (176), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 28/41 (68%), Positives = 36/41 (87%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQYYG I++G+PPQ F+V+FDTGSSNLW+PS+ C + NIAC
Sbjct: 71 AQYYGPIAIGSPPQNFRVVFDTGSSNLWVPSKKCHLTNIAC 111
>gi|390344805|ref|XP_780533.3| PREDICTED: lysosomal aspartic protease-like [Strongylocentrotus
purpuratus]
Length = 313
Score = 101 bits (252), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 72/194 (37%), Positives = 92/194 (47%), Gaps = 47/194 (24%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK--DKFVGHS 58
M AQYYG ISLGTPPQEF V+FDTGS++LW+PS C L+IAC + D +
Sbjct: 66 MSDYLDAQYYGPISLGTPPQEFTVVFDTGSADLWVPSSQCGFLDIACKFHRKYDSSKSST 125
Query: 59 HFRN----------------------------ASDENGGE------IMFGGVDKDKFVG- 83
+ +N A +++ GE + F D +G
Sbjct: 126 YKKNGTKWAIQYGSGSCSGFVSNDVIELGALKAKNQSFGEATAEPGLTFVAAKFDGILGL 185
Query: 84 ------DITYSPVSRKGYWQFGVE---SIKIEKN-VYCSNCQAIADTGTSLIIGPSKVIA 133
I+ PV K Q V + + N YC+NC AIADTGTSLI GPS IA
Sbjct: 186 GYPTITRISKVPVFEKIVQQGLVTDAXXLSVGNNGTYCTNCAAIADTGTSLIAGPSAEIA 245
Query: 134 ELNKLIGAVPLANG 147
+L IGA P A+G
Sbjct: 246 KLQLQIGAAPFASG 259
>gi|16119024|gb|AAL14708.1|AF420068_1 aspartic protease [Clonorchis sinensis]
Length = 419
Score = 101 bits (252), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 48/114 (42%), Positives = 70/114 (61%), Gaps = 1/114 (0%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIAD 119
RN SD GGE++ GG D + G+I ++P++ + YWQF V+S+ + C N CQAIAD
Sbjct: 208 RNRSDPVGGELLLGGTDPKYYKGEILWAPLTHEAYWQFKVDSMNVGSMKLCENGCQAIAD 267
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
TGTSLI GPS+ + +LN +GA+ + G +DC + +P V + GK L
Sbjct: 268 TGTSLIAGPSEEVGKLNDALGAINIPGGTYYIDCSRVSTLPPVQFSISGKLMQL 321
Score = 77.0 bits (188), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 31/41 (75%), Positives = 37/41 (90%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQYYG I +GTPPQ F+V+FDTGSSNLW+PS+HCSI +IAC
Sbjct: 67 AQYYGEIGIGTPPQSFEVVFDTGSSNLWVPSKHCSIFSIAC 107
>gi|148227998|ref|NP_001079043.1| cathepsin E-A precursor [Xenopus laevis]
gi|46395761|sp|Q805F3.1|CATEA_XENLA RecName: Full=Cathepsin E-A; Flags: Precursor
gi|28460653|dbj|BAC57453.1| cathepsin E1 [Xenopus laevis]
gi|213625998|gb|AAI69692.1| Cathepsin E1 [Xenopus laevis]
gi|213627772|gb|AAI69694.1| Cathepsin E1 [Xenopus laevis]
Length = 397
Score = 101 bits (252), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 52/116 (44%), Positives = 74/116 (63%), Gaps = 3/116 (2%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCS-NCQAIA 118
RN + GGE++FGG D +F G + + PV+ +GYWQ +++++I V +CS CQAI
Sbjct: 217 RNPNSAVGGELVFGGFDASRFSGQLNWVPVTNQGYWQIQLDNVQINGEVLFCSGGCQAIV 276
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSLI GPS I +L +IGA ANG +VDC L++MP V + G + +T
Sbjct: 277 DTGTSLITGPSSDIVQLQNIIGA-SAANGDYEVDCSVLNEMPTVTFTINGIGYQMT 331
Score = 62.0 bits (149), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 31/68 (45%), Positives = 43/68 (63%), Gaps = 3/68 (4%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASDE 66
+Y+G IS+GTPPQ F VIFDTGSSNLW+PS +C ++ AC+ D+F S+
Sbjct: 72 VEYFGEISVGTPPQNFTVIFDTGSSNLWVPSVYC--ISQACA-QHDRFQPQLSSTYESNG 128
Query: 67 NGGEIMFG 74
N + +G
Sbjct: 129 NNFSLQYG 136
>gi|115719|sp|P00795.2|CATD_PIG RecName: Full=Cathepsin D; Contains: RecName: Full=Cathepsin D
light chain; Contains: RecName: Full=Cathepsin D heavy
chain; Flags: Precursor
Length = 345
Score = 101 bits (252), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 51/131 (38%), Positives = 77/131 (58%), Gaps = 3/131 (2%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKN 108
VDKD F + + R+ + GGE+M GG+D + G + Y V+RK YWQ + + + +
Sbjct: 159 VDKDIFSFYLN-RDPGAQPGGELMLGGIDSKYYKGSLDYHNVTRKAYWQIHMNQVAVGSS 217
Query: 109 VYC--SNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ C+AI DTGTSLI+G + + EL K IGAVPL G + C+ + +P+V + L
Sbjct: 218 LTLCKGGCEAIVDTGTSLIVGQPEEVRELGKAIGAVPLIQGEYMIPCEKVPSLPDVTVTL 277
Query: 167 GGKNFTLTRTN 177
GGK + L+ N
Sbjct: 278 GGKKYKLSSEN 288
Score = 73.6 bits (179), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 30/49 (61%), Positives = 39/49 (79%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSV 49
+++ AQYYG I +GTPPQ F V+FDTGSSNLW+PS HC +L+IAC +
Sbjct: 7 LKNYMDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWI 55
>gi|307187508|gb|EFN72561.1| Lysosomal aspartic protease [Camponotus floridanus]
Length = 275
Score = 101 bits (252), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 52/112 (46%), Positives = 75/112 (66%), Gaps = 6/112 (5%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEK-NVYCSNCQAIAD 119
R+ S+ENGG ++ GG D + G++TY PV+RKGYWQF ++ IKI+ N+ +CQAIAD
Sbjct: 62 RDVSEENGGVLILGGSDPAFYEGNLTYIPVTRKGYWQFTIDEIKIDYINLCIESCQAIAD 121
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLAN--GPAKVDCDNLDKMPNVDIILGGK 169
TGT IIGP I+E+N++ + + N G VDCD + ++P + IL GK
Sbjct: 122 TGTPWIIGP---ISEINRINNFIEVFNDDGYETVDCDRISELPTISFILDGK 170
>gi|289740593|gb|ADD19044.1| aspartyl protease [Glossina morsitans morsitans]
Length = 394
Score = 101 bits (252), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 50/114 (43%), Positives = 66/114 (57%), Gaps = 1/114 (0%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIAD 119
R+ GGEI+FGG D + + G+ TY PV+RK YWQ ++S + C CQ IAD
Sbjct: 220 RDPKAPEGGEIIFGGSDPNHYKGEFTYLPVTRKAYWQIKMDSASMGNLNLCQGGCQVIAD 279
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
TGTSLI P +NK IG P+ G V C+N+ K+P + +LGGK F L
Sbjct: 280 TGTSLIALPPSEATSINKAIGGTPIMGGQYMVACENIPKLPVIRFVLGGKTFEL 333
Score = 72.0 bits (175), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 30/41 (73%), Positives = 35/41 (85%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQYYG IS+GTP Q+FKV+FDTGSSNLW+PS+ C NIAC
Sbjct: 73 AQYYGPISIGTPSQDFKVVFDTGSSNLWVPSKQCYFTNIAC 113
>gi|291223845|ref|XP_002731921.1| PREDICTED: expressed hypothetical protein-like [Saccoglossus
kowalevskii]
Length = 959
Score = 101 bits (252), Expect = 1e-19, Method: Composition-based stats.
Identities = 50/112 (44%), Positives = 71/112 (63%), Gaps = 3/112 (2%)
Query: 65 DENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYC-SNCQAIADTG 121
D N GE++ GG D+ + G+ TY PV+ KGYWQF ++SI + + + YC CQA+ DTG
Sbjct: 787 DGNKGELILGGSDQHHYSGEFTYLPVTIKGYWQFTMDSIHVYDKPSQYCLDGCQAVVDTG 846
Query: 122 TSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
TS+I GP + I LN IGAV N ++C +D +P++ +LGGK F L
Sbjct: 847 TSVIAGPMEDIETLNTEIGAVQYENNQFVINCHLVDSLPDISFVLGGKLFAL 898
Score = 60.5 bits (145), Expect = 3e-07, Method: Composition-based stats.
Identities = 25/42 (59%), Positives = 30/42 (71%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACS 48
A YYG I +GTPP F V+FDTGSS LW+PS C N+AC+
Sbjct: 639 ASYYGEIGIGTPPATFLVLFDTGSSYLWVPSAMCPESNMACA 680
>gi|257228998|gb|ACV53024.1| cathepsin D2 [Homarus americanus]
Length = 385
Score = 101 bits (252), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 50/116 (43%), Positives = 72/116 (62%), Gaps = 3/116 (2%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSN-CQAI 117
R+ S GGE++ GG D + + G+ TY PV RKGYWQ ++ I++ + +C C+AI
Sbjct: 209 RDPSSPEGGELILGGSDPNYYSGNFTYIPVDRKGYWQIKMDGIQMNGARVPFCEGGCEAI 268
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
ADTGTSLI P + +NK IGA P+A+G VDC + +P + +L G+ FTL
Sbjct: 269 ADTGTSLIAAPVEEARSINKKIGAKPIASGEWSVDCSLIPHLPKISFVLNGQPFTL 324
Score = 74.7 bits (182), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 30/43 (69%), Positives = 36/43 (83%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSV 49
AQYYG IS+GTPPQ F+V+FDTGSSNLW+PS+ C NIAC +
Sbjct: 62 AQYYGPISIGTPPQSFRVVFDTGSSNLWVPSKQCHYTNIACMI 104
>gi|355681644|gb|AER96811.1| cathepsin E [Mustela putorius furo]
Length = 375
Score = 101 bits (252), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 54/119 (45%), Positives = 78/119 (65%), Gaps = 6/119 (5%)
Query: 63 ASDENGG---EIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAI 117
+SD GG E++FGG D F G++ + PV+++GYWQ +++I++ V +CS CQAI
Sbjct: 199 SSDPEGGAGSELIFGGYDHSHFSGNLNWVPVTKQGYWQIALDAIQVGGAVMFCSEGCQAI 258
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTRT 176
DTGTSLI GPS I +L K IGA P+ +G V+C NL+ MP+V + G ++TL T
Sbjct: 259 VDTGTSLITGPSDKIKQLQKAIGAEPM-DGEYGVECANLNVMPDVTFTINGVSYTLQPT 316
Score = 60.8 bits (146), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 24/34 (70%), Positives = 31/34 (91%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCS 41
+Y+GTIS+G+PPQ F VIFDTGSSNLW+PS +C+
Sbjct: 56 EYFGTISVGSPPQNFTVIFDTGSSNLWVPSVYCT 89
>gi|125858582|gb|AAI29608.1| Ce1-A protein [Xenopus laevis]
Length = 394
Score = 101 bits (252), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 52/116 (44%), Positives = 74/116 (63%), Gaps = 3/116 (2%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCS-NCQAIA 118
RN + GGE++FGG D +F G + + PV+ +GYWQ +++++I V +CS CQAI
Sbjct: 214 RNPNSAVGGELVFGGFDASRFSGQLNWVPVTNQGYWQIQLDNVQINGEVLFCSGGCQAIV 273
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSLI GPS I +L +IGA ANG +VDC L++MP V + G + +T
Sbjct: 274 DTGTSLITGPSSDIVQLQNIIGA-SAANGDYEVDCSVLNEMPTVTFTINGIGYQMT 328
Score = 61.6 bits (148), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 31/68 (45%), Positives = 43/68 (63%), Gaps = 3/68 (4%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASDE 66
+Y+G IS+GTPPQ F VIFDTGSSNLW+PS +C ++ AC+ D+F S+
Sbjct: 69 VEYFGEISVGTPPQNFTVIFDTGSSNLWVPSVYC--ISQACA-QHDRFQPQLSSTYESNG 125
Query: 67 NGGEIMFG 74
N + +G
Sbjct: 126 NNFSLQYG 133
>gi|351712803|gb|EHB15722.1| Cathepsin D, partial [Heterocephalus glaber]
Length = 390
Score = 101 bits (251), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 51/128 (39%), Positives = 78/128 (60%), Gaps = 3/128 (2%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEK- 107
VDK+ F + + R+ S + GGE+M GG+D + G TY V+RK YWQ ++ +++
Sbjct: 202 VDKNVFSFYLN-RDPSAQPGGELMLGGIDSKYYKGSFTYLNVTRKAYWQVHMDQLEVGSG 260
Query: 108 -NVYCSNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
N+ C+AI DTGTSL++GP + EL K IGA+PL G V C+ + +P+V + L
Sbjct: 261 LNLCKGGCEAIVDTGTSLLVGPVDEVKELQKAIGAIPLIQGEYMVPCEKVSSLPSVTLKL 320
Query: 167 GGKNFTLT 174
GG + L+
Sbjct: 321 GGSAYPLS 328
Score = 71.2 bits (173), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 29/47 (61%), Positives = 38/47 (80%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ AQYYG I +GTPPQ F V+FDTGSSNLW+PS C++L+IAC
Sbjct: 51 LKNYMDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSSRCNMLDIAC 97
>gi|195438439|ref|XP_002067144.1| GK24836 [Drosophila willistoni]
gi|194163229|gb|EDW78130.1| GK24836 [Drosophila willistoni]
Length = 372
Score = 101 bits (251), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 46/113 (40%), Positives = 69/113 (61%), Gaps = 4/113 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSNCQAIADT 120
R+ S +GGE++FGG D + GD+TY P+S +GYWQF + S + C +CQAIADT
Sbjct: 212 RDGSATDGGELIFGGSDASLYTGDLTYVPISEQGYWQFEMTSASFDGYTLCDDCQAIADT 271
Query: 121 GTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
GTSLI+ P + L++L+ + VDC + +P++ +GG +FTL
Sbjct: 272 GTSLIVAPYEAYEILSELLN----VDDDGLVDCSTVSSLPDLTFNIGGTDFTL 320
Score = 60.1 bits (144), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 25/40 (62%), Positives = 30/40 (75%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
+ +S YYG IS+GTP Q FKV+FD+GSSNLWIPS C
Sbjct: 61 LSNSLNMAYYGAISIGTPAQSFKVLFDSGSSNLWIPSNTC 100
>gi|47213062|emb|CAF91576.1| unnamed protein product [Tetraodon nigroviridis]
Length = 395
Score = 101 bits (251), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 50/128 (39%), Positives = 76/128 (59%), Gaps = 3/128 (2%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKN 108
V+K+ F + + RN + GGE++ GG D + GD +Y V+R+ YWQ V+ + +
Sbjct: 210 VEKNVFSFYLN-RNPQTQPGGELLLGGTDPQYYTGDFSYVNVTRQAYWQIHVDELSVGSQ 268
Query: 109 VYC--SNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ S C+AI DTGTSL+ GPS+ + L K IGA+PL G V CD + +P + +
Sbjct: 269 LTLCKSGCEAIVDTGTSLLTGPSEEVRSLQKAIGALPLIQGEYMVSCDKIPTLPVITFNI 328
Query: 167 GGKNFTLT 174
GGK ++LT
Sbjct: 329 GGKPYSLT 336
Score = 77.4 bits (189), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 32/45 (71%), Positives = 38/45 (84%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK 51
AQYYG I LGTPPQ F V+FDTGSSNLW+PS HCS+L+IAC + +
Sbjct: 63 AQYYGEIGLGTPPQPFTVVFDTGSSNLWVPSVHCSLLDIACLLHR 107
>gi|198477124|ref|XP_002136749.1| GA22213 [Drosophila pseudoobscura pseudoobscura]
gi|198145061|gb|EDY71766.1| GA22213 [Drosophila pseudoobscura pseudoobscura]
Length = 373
Score = 101 bits (251), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 47/114 (41%), Positives = 67/114 (58%), Gaps = 4/114 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSNCQAIADT 120
R S +GGE++FGG D + G +TY P+S +GYWQF + S + N C++CQAIADT
Sbjct: 213 RAGSSSDGGELIFGGSDSSLYTGALTYVPISEQGYWQFTMASASSDGNSLCADCQAIADT 272
Query: 121 GTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
GTSLI+ P L ++ + VDC + +P+V +GG +FTLT
Sbjct: 273 GTSLIVAPYNAYITLTDILN----VDDDGYVDCSTVSSLPDVTFNIGGTDFTLT 322
Score = 58.5 bits (140), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 23/40 (57%), Positives = 30/40 (75%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
+ +S YYG I++GTP Q FKV+FD+GSSNLW+PS C
Sbjct: 62 LSNSMNMAYYGAITIGTPAQSFKVLFDSGSSNLWVPSNTC 101
>gi|321461133|gb|EFX72168.1| hypothetical protein DAPPUDRAFT_227643 [Daphnia pulex]
Length = 394
Score = 101 bits (251), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 53/118 (44%), Positives = 76/118 (64%), Gaps = 5/118 (4%)
Query: 61 RNASD-ENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI---EKNVYCS-NCQ 115
RN ++ ENGGEI+FGG + + G+I+Y PVSRK YWQF V+ + + ++ +C+ C+
Sbjct: 212 RNLNNPENGGEILFGGTNPTHYEGEISYVPVSRKAYWQFSVDGVNLAGYDEYPFCNGGCE 271
Query: 116 AIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
I+DTGTSLI GPS+ I +KLIGA G VDC+ + +P + +GGK F L
Sbjct: 272 MISDTGTSLITGPSEEITLFHKLIGAQVNIVGEGIVDCNEIPNLPAMTFTIGGKPFVL 329
Score = 70.9 bits (172), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 29/46 (63%), Positives = 39/46 (84%), Gaps = 1/46 (2%)
Query: 3 SSYQ-AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
++YQ AQY+G ++LGTPPQEF +IFDTGS+NLW+PS C+ N+AC
Sbjct: 60 TNYQDAQYFGPLTLGTPPQEFDIIFDTGSANLWVPSSECAPTNLAC 105
>gi|432116085|gb|ELK37212.1| Cathepsin E [Myotis davidii]
Length = 396
Score = 101 bits (251), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 54/119 (45%), Positives = 76/119 (63%), Gaps = 6/119 (5%)
Query: 63 ASDENGG---EIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAI 117
+SD GG E++FGG D F G + + PV+++GYWQ +++I++ V +CS CQAI
Sbjct: 220 SSDPEGGAGSELIFGGYDHSHFSGSLNWVPVTKQGYWQIALDTIQVGGAVMFCSEGCQAI 279
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTRT 176
DTGTSLI GP I +L K IGA P+ +G V+CDNL+ MP+V + G +TL T
Sbjct: 280 VDTGTSLITGPPAEIKQLQKAIGAEPV-DGEYAVECDNLNVMPDVTFTINGVPYTLQPT 337
Score = 60.8 bits (146), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 24/34 (70%), Positives = 31/34 (91%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCS 41
+Y+GTIS+G+PPQ F VIFDTGSSNLW+PS +C+
Sbjct: 77 EYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT 110
>gi|344277046|ref|XP_003410316.1| PREDICTED: cathepsin E [Loxodonta africana]
Length = 396
Score = 101 bits (251), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 54/119 (45%), Positives = 77/119 (64%), Gaps = 6/119 (5%)
Query: 63 ASDENGG---EIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAI 117
+SD GG E++FGG D F G + + PV+++GYWQ +++I++ V +CS CQAI
Sbjct: 220 SSDPAGGMGSELIFGGYDHSHFSGSLNWVPVTKQGYWQIALDNIQVGGTVMFCSEGCQAI 279
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTRT 176
DTGTSLI GPS I +L + IGA P NG V+C NL+ MP+V + G ++TL+ T
Sbjct: 280 VDTGTSLITGPSNNIKQLQRAIGAEP-ENGEYAVECVNLNVMPDVTFTINGVSYTLSPT 337
Score = 56.6 bits (135), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 22/37 (59%), Positives = 30/37 (81%)
Query: 5 YQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCS 41
+ +Y+G IS+G+P Q F VIFDTGSSNLW+PS +C+
Sbjct: 74 FDTEYFGAISIGSPSQNFTVIFDTGSSNLWVPSVYCT 110
>gi|334322038|ref|XP_001372237.2| PREDICTED: cathepsin E-like [Monodelphis domestica]
Length = 352
Score = 101 bits (251), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 48/115 (41%), Positives = 74/115 (64%), Gaps = 3/115 (2%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIA 118
RN G E++FGG D F G + + PV+++GYWQ +++I++ + +C+ CQAI
Sbjct: 177 RNPESTTGSELIFGGYDHSHFSGSLNWVPVTKQGYWQIALDNIQVGGTIMFCAEGCQAIV 236
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
DTGTSLI GPS+ I +L IGAVP +G ++C+NL+ MP++ + G +TL
Sbjct: 237 DTGTSLITGPSEKIKQLQNAIGAVP-TDGEYAMECNNLNVMPDITFTINGIPYTL 290
>gi|74219443|dbj|BAE29498.1| unnamed protein product [Mus musculus]
Length = 410
Score = 100 bits (250), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 50/127 (39%), Positives = 78/127 (61%), Gaps = 3/127 (2%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--E 106
VDK+ F + + R+ + GGE+M GG D + G+++Y V+RK YWQ ++ +++ E
Sbjct: 222 VDKNIFSFYLN-RDPEGQPGGELMLGGTDSKYYHGELSYLNVTRKAYWQVHMDQLEVGNE 280
Query: 107 KNVYCSNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ C+AI DTG SL++GP + + EL K IGAVPL G + C+ + +P V + L
Sbjct: 281 LTLCKGGCEAIVDTGASLLVGPVEEVKELQKAIGAVPLIQGEYMIPCEKVSSLPTVYLKL 340
Query: 167 GGKNFTL 173
GGKN+ L
Sbjct: 341 GGKNYEL 347
Score = 75.9 bits (185), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 47/122 (38%), Positives = 62/122 (50%), Gaps = 27/122 (22%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHF 60
+++ AQYYG I +GTPPQ F V+FDTGSSNLW+PS HC IL+IAC V H
Sbjct: 71 LKNYLDAQYYGDIGIGTPPQCFTVVFDTGSSNLWVPSIHCKILDIACWV---------HH 121
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYW------------QFGVESIKIEKN 108
+ SD++ + K+ DI Y S GY Q IK+EK
Sbjct: 122 KYNSDKSSTYV------KNGTSFDIHYGSGSLSGYLSQDTVSVPCKSDQSKARGIKVEKQ 175
Query: 109 VY 110
++
Sbjct: 176 IF 177
>gi|194854120|ref|XP_001968292.1| GG24793 [Drosophila erecta]
gi|190660159|gb|EDV57351.1| GG24793 [Drosophila erecta]
Length = 404
Score = 100 bits (250), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 50/113 (44%), Positives = 68/113 (60%), Gaps = 1/113 (0%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSNCQAIADT 120
RN S GG++ GG D++ G++TY+PV+++GYWQF V +I + S CQAIADT
Sbjct: 237 RNGSAVEGGQLTLGGTDQNLIAGEMTYTPVTQQGYWQFAVNNITWNGTLISSGCQAIADT 296
Query: 121 GTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
GTSLI PS +LN LIG V L G V C +P + + +GG +F L
Sbjct: 297 GTSLIAVPSAAYIQLNNLIGGV-LVAGDYYVPCSTASSLPVLTLNIGGTDFYL 348
Score = 60.8 bits (146), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 24/39 (61%), Positives = 30/39 (76%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
YYG I +GTP Q FKV+FDTGS+NLW+PS C ++AC
Sbjct: 92 YYGLIGIGTPEQLFKVVFDTGSANLWVPSAQCLATDVAC 130
>gi|74198157|dbj|BAE35255.1| unnamed protein product [Mus musculus]
Length = 335
Score = 100 bits (250), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 50/127 (39%), Positives = 79/127 (62%), Gaps = 3/127 (2%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--E 106
VDK+ F + + R+ + GGE+M GG D + G+++Y V+RK YWQ ++ +++ E
Sbjct: 147 VDKNIFSFYLN-RDPEGQPGGELMLGGTDSKYYHGELSYLNVTRKAYWQVHMDQLEVGNE 205
Query: 107 KNVYCSNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ C+AI DTGTSL++GP + + EL K IGA+PL G + C+ + +P V + L
Sbjct: 206 LTLCKGGCEAIVDTGTSLLVGPVEEVKELQKAIGAMPLIQGEYMIPCEKVSSLPTVYLKL 265
Query: 167 GGKNFTL 173
GGKN+ L
Sbjct: 266 GGKNYEL 272
Score = 75.1 bits (183), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 47/116 (40%), Positives = 59/116 (50%), Gaps = 27/116 (23%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASDE 66
AQYYG I +GTPPQ F V+FDTGSSNLW+PS HC IL+IAC V H + SD+
Sbjct: 2 AQYYGDIGIGTPPQCFTVVFDTGSSNLWVPSIHCKILDIACWV---------HHKYNSDK 52
Query: 67 NGGEIMFGGVDKDKFVGDITYSPVSRKGYW------------QFGVESIKIEKNVY 110
+ + K+ DI Y S GY Q IK+EK ++
Sbjct: 53 SSTYV------KNGTSFDIHYGSGSLSGYLSQDTVSVPCKSDQSKARGIKVEKQIF 102
>gi|355329699|dbj|BAL14143.1| pepsinogen 2 [Pagrus major]
Length = 377
Score = 100 bits (250), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 50/118 (42%), Positives = 71/118 (60%), Gaps = 2/118 (1%)
Query: 58 SHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-KNVYCSN-CQ 115
S + + + E G E++FGG D + + G IT+ P+S YWQ ++S+ I + V CS CQ
Sbjct: 207 SVYLSGNSEQGSEVVFGGTDSNHYTGQITWIPLSSATYWQISMDSVTINGQTVACSGGCQ 266
Query: 116 AIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
AI DTGTSLI+GP+ I +N +GA G A V+C N+ MP+V L G FT+
Sbjct: 267 AIIDTGTSLIVGPTNDINNMNSWVGASTNQYGEATVNCQNIQSMPDVTFTLNGHAFTV 324
Score = 61.6 bits (148), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 25/32 (78%), Positives = 28/32 (87%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
YYG +S+GTPPQ F VIFDTGSSNLWIPS +C
Sbjct: 70 YYGVVSIGTPPQSFTVIFDTGSSNLWIPSVYC 101
>gi|307171409|gb|EFN63273.1| Lysosomal aspartic protease [Camponotus floridanus]
Length = 234
Score = 100 bits (250), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 50/115 (43%), Positives = 73/115 (63%), Gaps = 5/115 (4%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYC-SNCQAIAD 119
R+ S + GGE+ GG D + GD TY PV+RKGYWQ ++ I+++ + C +CQAI D
Sbjct: 24 RDTSADLGGELTLGGSDPAYYEGDFTYIPVTRKGYWQITIDRIEMKFDYLCEESCQAIVD 83
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
TG+SLI+GP IA++N IG + VDC+ + ++P + I+GGK F LT
Sbjct: 84 TGSSLILGPKLDIAKINTFIG----VDDSRIVDCNKIFQLPTIRFIMGGKAFDLT 134
>gi|74191270|dbj|BAE39462.1| unnamed protein product [Mus musculus]
gi|74204799|dbj|BAE35462.1| unnamed protein product [Mus musculus]
Length = 410
Score = 100 bits (250), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 50/127 (39%), Positives = 78/127 (61%), Gaps = 3/127 (2%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--E 106
VDK+ F + + R+ + GGE+M GG D + G+++Y V+RK YWQ ++ +++ E
Sbjct: 222 VDKNIFSFYLN-RDPEGQPGGELMLGGTDSKYYHGELSYLNVTRKAYWQVHMDQLEVGNE 280
Query: 107 KNVYCSNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ C+AI DTGTSL++GP + + EL K GAVPL G + C+ + +P V + L
Sbjct: 281 LTLCKGGCEAIVDTGTSLLVGPVEEVKELQKATGAVPLIQGEYMIPCEKVSSLPTVYLKL 340
Query: 167 GGKNFTL 173
GGKN+ L
Sbjct: 341 GGKNYEL 347
Score = 76.3 bits (186), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 47/122 (38%), Positives = 62/122 (50%), Gaps = 27/122 (22%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHF 60
+++ AQYYG I +GTPPQ F V+FDTGSSNLW+PS HC IL+IAC V H
Sbjct: 71 LKNYLDAQYYGDIGIGTPPQCFTVVFDTGSSNLWVPSIHCKILDIACWV---------HH 121
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYW------------QFGVESIKIEKN 108
+ SD++ + K+ DI Y S GY Q IK+EK
Sbjct: 122 KYNSDKSSTYV------KNGTSFDIHYGSGSLSGYLSQDTVSVPCKSDQSKARGIKVEKQ 175
Query: 109 VY 110
++
Sbjct: 176 IF 177
>gi|225907444|gb|ACO36148.1| aspartic protease [Perca flavescens]
Length = 283
Score = 100 bits (250), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 52/129 (40%), Positives = 79/129 (61%), Gaps = 2/129 (1%)
Query: 48 SVDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEK 107
+VD+ F + + +S GE++ GG+DK + G I + PV+ KGYWQ ++S+ ++
Sbjct: 83 TVDEPVFSFYLSRKTSSSNPEGELLLGGIDKLLYSGPINWLPVTAKGYWQIKMDSVAVQG 142
Query: 108 -NVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDII 165
+ +C + CQAI DTGTSLI GP+ I +L +LIGA P +DC L +P+V +
Sbjct: 143 VSSFCPHGCQAIVDTGTSLIAGPTTDILKLQQLIGATPSNINEYLIDCARLSSLPHVTFV 202
Query: 166 LGGKNFTLT 174
LGG +TLT
Sbjct: 203 LGGIEYTLT 211
>gi|195156259|ref|XP_002019018.1| GL25647 [Drosophila persimilis]
gi|198476098|ref|XP_002132261.1| GA25369 [Drosophila pseudoobscura pseudoobscura]
gi|194115171|gb|EDW37214.1| GL25647 [Drosophila persimilis]
gi|198137543|gb|EDY69663.1| GA25369 [Drosophila pseudoobscura pseudoobscura]
Length = 373
Score = 100 bits (250), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 47/114 (41%), Positives = 67/114 (58%), Gaps = 4/114 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSNCQAIADT 120
R S +GGE++FGG D + G +TY P+S +GYWQF + S + N C++CQAIADT
Sbjct: 213 RAGSSSDGGELIFGGSDSSLYTGALTYVPISEQGYWQFTMASASSDGNSLCADCQAIADT 272
Query: 121 GTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
GTSLI+ P L ++ + VDC + +P+V +GG +FTLT
Sbjct: 273 GTSLIVAPYNAYITLTDILN----VDDNGYVDCSTVSSLPDVTFNIGGTDFTLT 322
Score = 58.5 bits (140), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 23/40 (57%), Positives = 30/40 (75%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
+ +S YYG I++GTP Q FKV+FD+GSSNLW+PS C
Sbjct: 62 LSNSMNMAYYGAITIGTPAQSFKVLFDSGSSNLWVPSNTC 101
>gi|380746491|gb|AFE48185.1| cathepsin D [Pinctada margaritifera]
Length = 390
Score = 100 bits (250), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 47/115 (40%), Positives = 70/115 (60%), Gaps = 2/115 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI-EKNVYCSN-CQAIA 118
R+ S GGE++ GG D + G+ TY PV+R+GYW+F ++ + + E + +C+ C IA
Sbjct: 214 RDPSASEGGELILGGSDTKHYKGNFTYLPVTRQGYWEFKMDGVSVGENHKFCTGGCNTIA 273
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
DTGTSLI GPS + +LN IGA + G +DC + +P + LGG+ F L
Sbjct: 274 DTGTSLIAGPSSEVKKLNAAIGATAIPGGEYMIDCTKIPDLPKITFSLGGQQFDL 328
Score = 68.2 bits (165), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 28/41 (68%), Positives = 33/41 (80%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQYYG I +GTP Q FKV+FDTGSSNLW+PS+ C +IAC
Sbjct: 67 AQYYGVIGIGTPAQNFKVVFDTGSSNLWVPSKKCKFSDIAC 107
>gi|253762215|gb|ACT35559.1| pepsinogen A2 precursor [Siniperca scherzeri]
Length = 376
Score = 100 bits (250), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 50/118 (42%), Positives = 72/118 (61%), Gaps = 2/118 (1%)
Query: 58 SHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-KNVYCSN-CQ 115
S + +++ E G E++FGG+D + G IT+ P+S YWQ ++S+ I + V CS CQ
Sbjct: 206 SVYLSSNSEQGSEVVFGGIDSSHYTGQITWIPLSSATYWQIKMDSVTINGQTVACSGGCQ 265
Query: 116 AIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
AI DTGTSLI+GP+ I +N +GA G A V C N+ MP+V L G+ FT+
Sbjct: 266 AIIDTGTSLIVGPTSDINNMNAWVGASTNQYGEAVVSCQNIQSMPDVTFTLNGQAFTI 323
Score = 62.0 bits (149), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 26/33 (78%), Positives = 29/33 (87%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCS 41
YYG IS+G+PPQ F VIFDTGSSNLWIPS +CS
Sbjct: 69 YYGVISIGSPPQSFSVIFDTGSSNLWIPSVYCS 101
>gi|195997417|ref|XP_002108577.1| hypothetical protein TRIADDRAFT_19349 [Trichoplax adhaerens]
gi|190589353|gb|EDV29375.1| hypothetical protein TRIADDRAFT_19349, partial [Trichoplax
adhaerens]
Length = 370
Score = 100 bits (250), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 45/115 (39%), Positives = 78/115 (67%), Gaps = 4/115 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCS-NCQAIAD 119
R + GGE++FGG D + + G +Y P+S++GYWQF ++ IE +C CQA+ D
Sbjct: 197 RQGTQNVGGELVFGGSDPNYYTGAFSYVPLSKEGYWQFELDGGTIENEFFCEGGCQAVID 256
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
TGTSLI+GP++ +A++N LIGA + + V+C+++ ++P + + +GGK ++L+
Sbjct: 257 TGTSLIVGPNEEVAKINHLIGADSIQ---SLVNCNSMPELPVITLTIGGKEYSLS 308
Score = 57.8 bits (138), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 21/35 (60%), Positives = 30/35 (85%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCS 41
A+Y+G I++GTPPQ+F V+FDTGSS+ W+PS C+
Sbjct: 52 AEYFGPITIGTPPQDFLVLFDTGSSDFWVPSSECT 86
>gi|71159493|gb|AAZ29603.1| pepsinogen, partial [Pagrus pagrus]
Length = 205
Score = 100 bits (250), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 50/118 (42%), Positives = 71/118 (60%), Gaps = 2/118 (1%)
Query: 58 SHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-KNVYCSN-CQ 115
S + + + E G E++FGG D + + G IT+ P+S YWQ ++S+ I + V CS CQ
Sbjct: 87 SVYLSGNSEQGSEVVFGGTDSNHYTGQITWIPLSSATYWQISMDSVTINGQTVACSGGCQ 146
Query: 116 AIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
AI DTGTSLI+GP+ I +N +GA G A V+C N+ MP+V L G FT+
Sbjct: 147 AIIDTGTSLIVGPTNDINNMNSWVGASTNQYGEATVNCQNVQSMPDVTFTLNGHAFTV 204
>gi|354496335|ref|XP_003510282.1| PREDICTED: cathepsin D [Cricetulus griseus]
gi|344248735|gb|EGW04839.1| Cathepsin D [Cricetulus griseus]
Length = 408
Score = 100 bits (250), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 45/116 (38%), Positives = 73/116 (62%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYC--SNCQAIA 118
R+ + + GGE+M GG+D + G+++Y V+RK YWQ ++ + + + C+AI
Sbjct: 231 RDPTGQPGGELMLGGIDSKYYEGELSYLNVTRKAYWQVHMDQLDVANGLTLCKGGCEAIV 290
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSL++GP + EL K IGAVPL G + C+ + +P+V + LGGK++ L+
Sbjct: 291 DTGTSLLVGPVDEVKELQKAIGAVPLIQGEYMIPCEKVSSLPSVTLKLGGKDYELS 346
Score = 73.9 bits (180), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 30/47 (63%), Positives = 38/47 (80%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ AQYYG I +GTPPQ F V+FDTGSSNLW+PS HC +L+IAC
Sbjct: 71 LKNYLDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIAC 117
>gi|195332251|ref|XP_002032812.1| GM20753 [Drosophila sechellia]
gi|194124782|gb|EDW46825.1| GM20753 [Drosophila sechellia]
Length = 392
Score = 100 bits (250), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 48/114 (42%), Positives = 67/114 (58%), Gaps = 1/114 (0%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIAD 119
R+ + GGEI+FGG D + + G+ TY PV+RK YWQ +++ I C CQ IAD
Sbjct: 218 RDPASPEGGEIIFGGSDPNHYTGEFTYLPVTRKAYWQIKMDAASIGDLQLCKGGCQVIAD 277
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
TGTSLI P + +N+ IG P+ G V CD + ++P + +LGGK F L
Sbjct: 278 TGTSLIAAPLEEATSINQKIGGTPIIGGQYVVSCDLIPQLPVIKFVLGGKTFEL 331
Score = 72.4 bits (176), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 28/41 (68%), Positives = 36/41 (87%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQYYG I++G+PPQ F+V+FDTGSSNLW+PS+ C + NIAC
Sbjct: 71 AQYYGPIAIGSPPQNFRVVFDTGSSNLWVPSKKCHLTNIAC 111
>gi|147906891|ref|NP_001082550.1| cathepsin D precursor [Xenopus laevis]
gi|28436104|dbj|BAC57431.1| cathepsin D [Xenopus laevis]
Length = 409
Score = 100 bits (250), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 47/119 (39%), Positives = 71/119 (59%), Gaps = 2/119 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI-EKNVYCSN-CQAIA 118
RN + GGE++ GG D + GD Y V+RK YWQ ++ + + ++ C C+AI
Sbjct: 224 RNPDTQPGGELLLGGTDPKYYTGDFHYLNVTRKAYWQIHMDQLGVGDQLTLCKGGCEAIV 283
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTRTN 177
DTGTSLI GP + +A L + IGA+PL G + CDN+ +P + GG+ ++LT T+
Sbjct: 284 DTGTSLITGPVEEVAALQRAIGAIPLIRGEYMILCDNIPSLPVISFTFGGRVYSLTGTD 342
Score = 73.2 bits (178), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 38/87 (43%), Positives = 56/87 (64%), Gaps = 6/87 (6%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK--DKFVGHSHFRNAS 64
AQYYG IS+GTPPQ F V+FDTGSSNLW+ S HCS+ +IAC + + D ++ +N +
Sbjct: 77 AQYYGEISIGTPPQPFTVVFDTGSSNLWVASVHCSMFDIACWMHRKYDSSKSSTYVKNGT 136
Query: 65 D---ENGGEIMFGGVDKDKF-VGDITY 87
+ + G + G + KD +G++ Y
Sbjct: 137 EFAIQYGTGSISGYLSKDTVTIGNLGY 163
>gi|148229393|ref|NP_001085403.1| MGC82347 protein precursor [Xenopus laevis]
gi|48734644|gb|AAH72252.1| MGC82347 protein [Xenopus laevis]
Length = 401
Score = 100 bits (250), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 49/116 (42%), Positives = 69/116 (59%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI-EKNVYCSN-CQAIA 118
RN + GGE++ GG D + GD Y V+RK YWQ ++ + + ++ C C+AI
Sbjct: 226 RNPDTQPGGELLLGGTDPKYYTGDFHYLNVTRKAYWQIHMDQLGVGDQLTLCKGGCEAIV 285
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSLI GP + + L K IGAVPL G V CD + +P + + LGG+ +TLT
Sbjct: 286 DTGTSLITGPLEEVTALQKAIGAVPLIQGQYMVQCDKIPTLPVISLTLGGQVYTLT 341
Score = 74.3 bits (181), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 30/41 (73%), Positives = 35/41 (85%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQYYG I +GTPPQ F V+FDTGSSNLW+PS HCS+ +IAC
Sbjct: 79 AQYYGEIGIGTPPQTFTVVFDTGSSNLWVPSVHCSMFDIAC 119
>gi|197631813|gb|ACH70630.1| cathepsin D [Salmo salar]
gi|223648160|gb|ACN10838.1| Cathepsin D precursor [Salmo salar]
Length = 398
Score = 100 bits (250), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 49/116 (42%), Positives = 69/116 (59%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIA 118
RN E GGE++ GG D + GD Y VSR+ YWQ ++ + + + C C+AI
Sbjct: 223 RNPESEPGGELLLGGTDPKYYSGDFQYLNVSRQAYWQVHMDGMGVGSQLSLCKGGCEAIV 282
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSLI GP+ + L K IGA PL G V+CD + MP++ LGG++++LT
Sbjct: 283 DTGTSLITGPTAEVKALQKAIGATPLIQGEYMVNCDKIPTMPDITFNLGGQSYSLT 338
Score = 71.2 bits (173), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 30/47 (63%), Positives = 36/47 (76%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ AQYYG I LGTP Q F V+FDTGSSNLW+PS HCS +IAC
Sbjct: 70 LKNFMDAQYYGEIGLGTPAQTFTVVFDTGSSNLWVPSVHCSFTDIAC 116
>gi|193499293|gb|ACF18589.1| pepsinogen A2 precursor [Siniperca scherzeri]
Length = 376
Score = 100 bits (250), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 50/118 (42%), Positives = 72/118 (61%), Gaps = 2/118 (1%)
Query: 58 SHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-KNVYCSN-CQ 115
S + +++ E G E++FGG+D + G IT+ P+S YWQ ++S+ I + V CS CQ
Sbjct: 206 SVYLSSNSEQGSEVVFGGIDSSHYTGQITWIPLSSATYWQIKMDSVTINGQTVACSGGCQ 265
Query: 116 AIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
AI DTGTSLI+GP+ I +N +GA G A V C N+ MP+V L G+ FT+
Sbjct: 266 AIIDTGTSLIVGPTSDINNMNAWVGASTNQYGEAVVSCQNIQSMPDVTFTLNGQAFTI 323
Score = 62.0 bits (149), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 26/33 (78%), Positives = 29/33 (87%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCS 41
YYG IS+G+PPQ F VIFDTGSSNLWIPS +CS
Sbjct: 69 YYGVISIGSPPQSFSVIFDTGSSNLWIPSVYCS 101
>gi|253762219|gb|ACT35561.1| pepsinogen A2 precursor [Siniperca chuatsi]
Length = 376
Score = 100 bits (250), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 50/118 (42%), Positives = 72/118 (61%), Gaps = 2/118 (1%)
Query: 58 SHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-KNVYCSN-CQ 115
S + +++ E G E++FGG+D + G IT+ P+S YWQ ++S+ I + V CS CQ
Sbjct: 206 SVYLSSNSEQGSEVVFGGIDSSHYTGQITWIPLSSATYWQIKMDSVTINGQTVACSGGCQ 265
Query: 116 AIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
AI DTGTSLI+GP+ I +N +GA G A V C N+ MP+V L G+ FT+
Sbjct: 266 AIIDTGTSLIVGPTSDINNMNAWVGASTNQYGEAVVSCQNIQSMPDVTFTLNGQAFTI 323
Score = 62.0 bits (149), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 26/33 (78%), Positives = 29/33 (87%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCS 41
YYG IS+G+PPQ F VIFDTGSSNLWIPS +CS
Sbjct: 69 YYGVISIGSPPQSFSVIFDTGSSNLWIPSVYCS 101
>gi|194762104|ref|XP_001963198.1| GF19727 [Drosophila ananassae]
gi|190616895|gb|EDV32419.1| GF19727 [Drosophila ananassae]
Length = 449
Score = 100 bits (250), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 53/125 (42%), Positives = 73/125 (58%), Gaps = 1/125 (0%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKN 108
+DK +F + RN S GGEI++GG D + G ITY PVS YWQF V++ I
Sbjct: 200 LDKPEFSVYLR-RNGSSLIGGEIIWGGTDPSIYHGSITYVPVSVPQYWQFTVDTGTINGQ 258
Query: 109 VYCSNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGG 168
+ C CQAIADTGTSLII P + +NK + A +G A + C + K+P + + +GG
Sbjct: 259 ILCRGCQAIADTGTSLIIVPKRAFTAINKQLNATDNGDGTASIPCWEICKLPTLYLNIGG 318
Query: 169 KNFTL 173
F+L
Sbjct: 319 TRFSL 323
Score = 62.0 bits (149), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 25/39 (64%), Positives = 30/39 (76%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
YYG IS+GTPPQ F V+FDTGSSN W+PS C ++AC
Sbjct: 67 YYGEISIGTPPQNFSVLFDTGSSNTWVPSSLCPASDVAC 105
>gi|21552717|gb|AAM62283.1|AF396662_1 cathepsin D preproprotein [Silurus asotus]
Length = 395
Score = 100 bits (250), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 47/116 (40%), Positives = 67/116 (57%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN--CQAIA 118
RN + GGE++ GG D + GD Y ++R+ YWQ ++ + I + N C+AI
Sbjct: 220 RNPDTQPGGELLLGGTDPKFYTGDFHYVNITRQAYWQIHMDGMSIGSQLSLCNGGCEAIV 279
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSLI GP+ + L K IGA+PL G VDC + +P + LGG+ +TLT
Sbjct: 280 DTGTSLITGPAAEVKALQKAIGAIPLIQGEYMVDCKKVPSLPTISFNLGGQTYTLT 335
Score = 71.6 bits (174), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 30/47 (63%), Positives = 37/47 (78%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ AQYYG I LGTP Q F V+FDTGSSNLW+PS HCS+ +IAC
Sbjct: 68 LKNYLDAQYYGEIGLGTPVQTFTVVFDTGSSNLWVPSVHCSLTDIAC 114
>gi|47197223|emb|CAF89153.1| unnamed protein product [Tetraodon nigroviridis]
Length = 268
Score = 100 bits (249), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 51/128 (39%), Positives = 77/128 (60%), Gaps = 3/128 (2%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKN 108
V+K+ F + + RN + GGE++ GG D + GD +Y V+R+ YWQ V+ + +
Sbjct: 25 VEKNVFSFYLN-RNPQTQPGGELLLGGTDPQYYTGDFSYVNVTRQAYWQIHVDELSVGSQ 83
Query: 109 V-YC-SNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ C S C+AI DTGTSL+ GPS+ + L K IGA+PL G V CD + +P + +
Sbjct: 84 LTLCKSGCEAIVDTGTSLLTGPSEEVRSLQKAIGALPLIQGEYMVSCDKIPTLPVITFNI 143
Query: 167 GGKNFTLT 174
GGK ++LT
Sbjct: 144 GGKPYSLT 151
>gi|301786118|ref|XP_002928474.1| PREDICTED: cathepsin E-like [Ailuropoda melanoleuca]
Length = 396
Score = 100 bits (249), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 53/119 (44%), Positives = 78/119 (65%), Gaps = 6/119 (5%)
Query: 63 ASDENGG---EIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAI 117
+SD GG E++FGG D F G++ + PV+++GYWQ +++I++ V +CS CQAI
Sbjct: 220 SSDPEGGAGSELIFGGYDHSHFSGNLHWVPVTKQGYWQIALDAIQVGGAVMFCSEGCQAI 279
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTRT 176
DTGTSLI GPS + +L K IGA P+ +G V+C NL+ MP+V + G ++TL T
Sbjct: 280 VDTGTSLITGPSDKVKQLQKAIGAEPM-DGEYGVECANLNVMPDVTFTINGISYTLQPT 337
Score = 61.2 bits (147), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 29/53 (54%), Positives = 37/53 (69%), Gaps = 9/53 (16%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHF 60
+Y+GTIS+G+PPQ F VIFDTGSSNLW+PS +C+ + AC HS F
Sbjct: 77 EYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT--SAACKT-------HSRF 120
>gi|189502972|gb|ACE06867.1| unknown [Schistosoma japonicum]
Length = 429
Score = 100 bits (249), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 46/118 (38%), Positives = 76/118 (64%), Gaps = 1/118 (0%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIAD 119
RN ++ GGE+M GG+D + G+I Y ++ K YW F ++++ I C++ CQAIAD
Sbjct: 211 RNITNVLGGELMIGGIDDKYYTGEINYVNLTEKSYWLFKMDNLTISDLSICTDGCQAIAD 270
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTRTN 177
TGTS+I GP+ + ++N+ +GA L G V CD ++ +P++D ++ GK+ TL T+
Sbjct: 271 TGTSMIAGPTDEVKQINQKLGATHLPGGIYTVSCDVINNLPSIDFVINGKHMTLEPTD 328
Score = 76.6 bits (187), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 30/51 (58%), Positives = 41/51 (80%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK 51
+++ AQYYG I++GTPPQ F V+FDTGSSNLW+PS+HCS +IAC + +
Sbjct: 58 LKNYLDAQYYGDITIGTPPQTFSVVFDTGSSNLWVPSKHCSYFDIACLLHR 108
>gi|226476906|emb|CAX72305.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
Length = 429
Score = 100 bits (249), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 46/118 (38%), Positives = 76/118 (64%), Gaps = 1/118 (0%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIAD 119
RN ++ GGE+M GG+D + G+I Y ++ K YW F ++++ I C++ CQAIAD
Sbjct: 211 RNITNVLGGELMIGGIDDKYYTGEINYVNLTEKSYWLFKMDNLTISDLSICTDGCQAIAD 270
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTRTN 177
TGTS+I GP+ + ++N+ +GA L G V CD ++ +P++D ++ GK+ TL T+
Sbjct: 271 TGTSMIAGPTDEVKQINQKLGATHLPGGIYTVSCDVINNLPSIDFVINGKHMTLEPTD 328
Score = 76.3 bits (186), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 30/51 (58%), Positives = 41/51 (80%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK 51
+++ AQYYG I++GTPPQ F V+FDTGSSNLW+PS+HCS +IAC + +
Sbjct: 58 LKNYLDAQYYGDITVGTPPQTFSVVFDTGSSNLWVPSKHCSYFDIACLLHR 108
>gi|226476848|emb|CAX72340.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
Length = 429
Score = 100 bits (249), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 46/118 (38%), Positives = 76/118 (64%), Gaps = 1/118 (0%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIAD 119
RN ++ GGE+M GG+D + G+I Y ++ K YW F ++++ I C++ CQAIAD
Sbjct: 211 RNITNVLGGELMIGGIDDKYYTGEINYVNLTEKSYWLFKMDNLTISDLSICTDGCQAIAD 270
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTRTN 177
TGTS+I GP+ + ++N+ +GA L G V CD ++ +P++D ++ GK+ TL T+
Sbjct: 271 TGTSMIAGPTDEVKQINQKLGATHLPGGIYTVSCDVINNLPSIDFVINGKHMTLEPTD 328
Score = 76.6 bits (187), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 30/51 (58%), Positives = 41/51 (80%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK 51
+++ AQYYG I++GTPPQ F V+FDTGSSNLW+PS+HCS +IAC + +
Sbjct: 58 LKNYLDAQYYGDITIGTPPQTFSVVFDTGSSNLWVPSKHCSYFDIACLLHR 108
>gi|2347147|gb|AAC37302.1| aspartic proteinase precursor [Schistosoma japonicum]
gi|226476814|emb|CAX72323.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476816|emb|CAX72324.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476820|emb|CAX72326.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476822|emb|CAX72327.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476824|emb|CAX72328.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476826|emb|CAX72329.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476834|emb|CAX72333.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476836|emb|CAX72334.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476840|emb|CAX72336.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476842|emb|CAX72337.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476844|emb|CAX72338.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476846|emb|CAX72339.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476852|emb|CAX72342.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476880|emb|CAX72318.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476882|emb|CAX72317.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476886|emb|CAX72315.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476890|emb|CAX72313.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476892|emb|CAX72312.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476894|emb|CAX72311.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476896|emb|CAX72310.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476898|emb|CAX72309.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476900|emb|CAX72308.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226482870|emb|CAX79402.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
Length = 429
Score = 100 bits (249), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 46/118 (38%), Positives = 76/118 (64%), Gaps = 1/118 (0%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIAD 119
RN ++ GGE+M GG+D + G+I Y ++ K YW F ++++ I C++ CQAIAD
Sbjct: 211 RNITNVLGGELMIGGIDDKYYTGEINYVNLTEKSYWLFKMDNLTISDLSICTDGCQAIAD 270
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTRTN 177
TGTS+I GP+ + ++N+ +GA L G V CD ++ +P++D ++ GK+ TL T+
Sbjct: 271 TGTSMIAGPTDEVKQINQKLGATHLPGGIYTVSCDVINNLPSIDFVINGKHMTLEPTD 328
Score = 76.6 bits (187), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 30/51 (58%), Positives = 41/51 (80%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK 51
+++ AQYYG I++GTPPQ F V+FDTGSSNLW+PS+HCS +IAC + +
Sbjct: 58 LKNYLDAQYYGDITIGTPPQTFSVVFDTGSSNLWVPSKHCSYFDIACLLHR 108
>gi|226476888|emb|CAX72314.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
gi|226476904|emb|CAX72306.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
Length = 429
Score = 100 bits (249), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 46/118 (38%), Positives = 76/118 (64%), Gaps = 1/118 (0%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIAD 119
RN ++ GGE+M GG+D + G+I Y ++ K YW F ++++ I C++ CQAIAD
Sbjct: 211 RNITNVLGGELMIGGIDDKYYTGEINYVNLTEKSYWLFKMDNLTISDLSICTDGCQAIAD 270
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTRTN 177
TGTS+I GP+ + ++N+ +GA L G V CD ++ +P++D ++ GK+ TL T+
Sbjct: 271 TGTSMIAGPTDEVKQINQKLGATHLPGGIYTVSCDVINNLPSIDFVINGKHMTLEPTD 328
Score = 76.6 bits (187), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 30/51 (58%), Positives = 41/51 (80%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK 51
+++ AQYYG I++GTPPQ F V+FDTGSSNLW+PS+HCS +IAC + +
Sbjct: 58 LKNYLDAQYYGDITIGTPPQTFSVVFDTGSSNLWVPSKHCSYFDIACLLHR 108
>gi|449481456|ref|XP_002189698.2| PREDICTED: cathepsin E-A-like [Taeniopygia guttata]
Length = 405
Score = 100 bits (249), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 49/116 (42%), Positives = 68/116 (58%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIA 118
R ENGGE++ GG+D + G I + PV+ K YWQ + +IKI+ V +CS+ C+AI
Sbjct: 224 RGEDTENGGELILGGIDHSLYKGSIHWVPVTEKSYWQIHMNNIKIQGRVTFCSHGCEAIV 283
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
D+GTSLI GPS I L IGA P G VDC L +P++ +G + + L
Sbjct: 284 DSGTSLITGPSSQIRRLQAYIGASPSNTGEFLVDCRRLSSLPHISFTIGHREYKLA 339
Score = 70.1 bits (170), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 35/64 (54%), Positives = 46/64 (71%), Gaps = 6/64 (9%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASDE 66
AQYYG +S+GTPPQ F V+FDTGSSN W+PS +C ++ AC V + KF F++ S E
Sbjct: 79 AQYYGVVSVGTPPQSFTVVFDTGSSNFWVPSAYC--ISEACRVHQ-KF---KSFKSDSYE 132
Query: 67 NGGE 70
+GGE
Sbjct: 133 HGGE 136
>gi|226476876|emb|CAX72320.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
Length = 429
Score = 100 bits (249), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 46/118 (38%), Positives = 76/118 (64%), Gaps = 1/118 (0%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIAD 119
RN ++ GGE+M GG+D + G+I Y ++ K YW F ++++ I C++ CQAIAD
Sbjct: 211 RNITNVLGGELMIGGIDDKYYTGEINYVNLTEKSYWLFKMDNLTISDLSICTDGCQAIAD 270
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTRTN 177
TGTS+I GP+ + ++N+ +GA L G V CD ++ +P++D ++ GK+ TL T+
Sbjct: 271 TGTSMIAGPTDEVKQINQKLGATHLPGGIYTVSCDVINNLPSIDFVINGKHMTLEPTD 328
Score = 76.6 bits (187), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 30/51 (58%), Positives = 41/51 (80%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK 51
+++ AQYYG I++GTPPQ F V+FDTGSSNLW+PS+HCS +IAC + +
Sbjct: 58 LKNYLDAQYYGDITIGTPPQTFSVVFDTGSSNLWVPSKHCSYFDIACLLHR 108
>gi|226476830|emb|CAX72331.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
Length = 429
Score = 100 bits (249), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 46/118 (38%), Positives = 76/118 (64%), Gaps = 1/118 (0%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIAD 119
RN ++ GGE+M GG+D + G+I Y ++ K YW F ++++ I C++ CQAIAD
Sbjct: 211 RNITNVLGGELMIGGIDDKYYTGEINYVNLTEKSYWLFKMDNLTISDLSICTDGCQAIAD 270
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTRTN 177
TGTS+I GP+ + ++N+ +GA L G V CD ++ +P++D ++ GK+ TL T+
Sbjct: 271 TGTSMIAGPTDEVKQINQKLGATHLPGGIYTVSCDVINNLPSIDFVINGKHMTLEPTD 328
Score = 76.6 bits (187), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 30/51 (58%), Positives = 41/51 (80%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK 51
+++ AQYYG I++GTPPQ F V+FDTGSSNLW+PS+HCS +IAC + +
Sbjct: 58 LKNYLDAQYYGDITIGTPPQTFSVVFDTGSSNLWVPSKHCSYFDIACLLHR 108
>gi|195429864|ref|XP_002062977.1| GK21682 [Drosophila willistoni]
gi|194159062|gb|EDW73963.1| GK21682 [Drosophila willistoni]
Length = 389
Score = 100 bits (249), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 49/114 (42%), Positives = 65/114 (57%), Gaps = 1/114 (0%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIAD 119
R+ S GGEI+FGG D + + GD TY PV+RK YWQ ++S + C CQ IAD
Sbjct: 215 RDPSAPEGGEIIFGGSDPNHYTGDFTYLPVTRKAYWQIKMDSASVGDLQVCQGGCQVIAD 274
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
TGTSLI P +N+ IG P+ G V CD + +P + +LGG+ F L
Sbjct: 275 TGTSLIAAPLSEATSINQKIGGTPIIGGQYVVSCDLIPNLPVIKFVLGGRTFEL 328
Score = 71.6 bits (174), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 30/41 (73%), Positives = 34/41 (82%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQYYG IS+GTP Q FKV+FDTGSSNLW+PS+ C NIAC
Sbjct: 68 AQYYGPISIGTPAQSFKVVFDTGSSNLWVPSKKCHFTNIAC 108
>gi|226476812|emb|CAX72322.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
Length = 429
Score = 100 bits (249), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 46/118 (38%), Positives = 76/118 (64%), Gaps = 1/118 (0%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIAD 119
RN ++ GGE+M GG+D + G+I Y ++ K YW F ++++ I C++ CQAIAD
Sbjct: 211 RNITNVLGGELMIGGIDDKYYTGEINYVNLTEKSYWLFKMDNLTISDLSICTDGCQAIAD 270
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTRTN 177
TGTS+I GP+ + ++N+ +GA L G V CD ++ +P++D ++ GK+ TL T+
Sbjct: 271 TGTSMIAGPTDEVKQINQKLGATHLPGGIYTVSCDVINNLPSIDFVINGKHMTLEPTD 328
Score = 75.9 bits (185), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 30/51 (58%), Positives = 40/51 (78%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK 51
+++ AQYYG I++GTPPQ F V+FDTGSSNLW+PS HCS +IAC + +
Sbjct: 58 LKNYLDAQYYGDITIGTPPQTFSVVFDTGSSNLWVPSTHCSYFDIACLLHR 108
>gi|327271203|ref|XP_003220377.1| PREDICTED: gastricsin-like [Anolis carolinensis]
Length = 389
Score = 100 bits (249), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 46/111 (41%), Positives = 68/111 (61%), Gaps = 3/111 (2%)
Query: 67 NGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV--YCSN-CQAIADTGTS 123
GGE+ GGVD + + G I ++PV+++ YWQ G++S I + +CS CQAI DTGTS
Sbjct: 221 QGGELALGGVDTNLYSGQIVWTPVTQEAYWQIGIQSFYIGNQITDWCSQGCQAIVDTGTS 280
Query: 124 LIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
L+ P +V EL + +GA NG V C N+ MP + ++GG +F L+
Sbjct: 281 LLTAPQQVFGELMQYVGAEQDGNGQYVVSCSNIQSMPTITFVIGGSSFPLS 331
Score = 56.2 bits (134), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 23/32 (71%), Positives = 27/32 (84%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
YYG IS+GTPPQ F V+FDTGSSNLW+ S +C
Sbjct: 73 YYGEISIGTPPQNFLVLFDTGSSNLWVASVYC 104
>gi|226476902|emb|CAX72307.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
Length = 429
Score = 100 bits (249), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 46/118 (38%), Positives = 76/118 (64%), Gaps = 1/118 (0%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIAD 119
RN ++ GGE+M GG+D + G+I Y ++ K YW F ++++ I C++ CQAIAD
Sbjct: 211 RNITNVLGGELMIGGIDDKYYTGEINYVNLTEKSYWLFKMDNLTISDLSICTDGCQAIAD 270
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTRTN 177
TGTS+I GP+ + ++N+ +GA L G V CD ++ +P++D ++ GK+ TL T+
Sbjct: 271 TGTSMIAGPTDEVKQINQKLGATHLPGGIYTVSCDVINNLPSIDFVINGKHMTLEPTD 328
Score = 76.6 bits (187), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 30/51 (58%), Positives = 41/51 (80%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK 51
+++ AQYYG I++GTPPQ F V+FDTGSSNLW+PS+HCS +IAC + +
Sbjct: 58 LKNYLDAQYYGDITIGTPPQTFSVVFDTGSSNLWVPSKHCSYFDIACLLHR 108
>gi|226476854|emb|CAX72343.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
Length = 435
Score = 100 bits (249), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 46/118 (38%), Positives = 76/118 (64%), Gaps = 1/118 (0%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIAD 119
RN ++ GGE+M GG+D + G+I Y ++ K YW F ++++ I C++ CQAIAD
Sbjct: 217 RNITNVLGGELMIGGIDDKYYTGEINYVNLTEKSYWLFKMDNLTISDLSICTDGCQAIAD 276
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTRTN 177
TGTS+I GP+ + ++N+ +GA L G V CD ++ +P++D ++ GK+ TL T+
Sbjct: 277 TGTSMIAGPTDEVKQINQKLGATHLPGGIYTVSCDVINNLPSIDFVINGKHMTLEPTD 334
Score = 76.6 bits (187), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 30/51 (58%), Positives = 41/51 (80%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK 51
+++ AQYYG I++GTPPQ F V+FDTGSSNLW+PS+HCS +IAC + +
Sbjct: 64 LKNYLDAQYYGDITIGTPPQTFSVVFDTGSSNLWVPSKHCSYFDIACLLHR 114
>gi|226476838|emb|CAX72335.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
Length = 429
Score = 100 bits (249), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 46/118 (38%), Positives = 76/118 (64%), Gaps = 1/118 (0%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIAD 119
RN ++ GGE+M GG+D + G+I Y ++ K YW F ++++ I C++ CQAIAD
Sbjct: 211 RNITNVLGGELMIGGIDDKYYTGEINYVNLTEKSYWLFKMDNLTISDLSICTDGCQAIAD 270
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTRTN 177
TGTS+I GP+ + ++N+ +GA L G V CD ++ +P++D ++ GK+ TL T+
Sbjct: 271 TGTSMIAGPTDEVKQINQKLGATHLPGGIYTVSCDVINNLPSIDFVINGKHMTLEPTD 328
Score = 76.6 bits (187), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 30/51 (58%), Positives = 41/51 (80%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK 51
+++ AQYYG I++GTPPQ F V+FDTGSSNLW+PS+HCS +IAC + +
Sbjct: 58 LKNYLDAQYYGDITIGTPPQTFSVVFDTGSSNLWVPSKHCSYFDIACLLHR 108
>gi|226476818|emb|CAX72325.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
Length = 429
Score = 100 bits (249), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 46/118 (38%), Positives = 76/118 (64%), Gaps = 1/118 (0%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIAD 119
RN ++ GGE+M GG+D + G+I Y ++ K YW F ++++ I C++ CQAIAD
Sbjct: 211 RNITNVLGGELMIGGIDDKYYTGEINYVNLTEKSYWLFKMDNLTISDLSICTDGCQAIAD 270
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTRTN 177
TGTS+I GP+ + ++N+ +GA L G V CD ++ +P++D ++ GK+ TL T+
Sbjct: 271 TGTSMIAGPTDEVKQINQKLGATHLPGGIYTVSCDVINNLPSIDFVINGKHMTLEPTD 328
Score = 76.6 bits (187), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 30/51 (58%), Positives = 41/51 (80%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK 51
+++ AQYYG I++GTPPQ F V+FDTGSSNLW+PS+HCS +IAC + +
Sbjct: 58 LKNYLDAQYYGDITIGTPPQTFSVVFDTGSSNLWVPSKHCSYFDIACLLHR 108
>gi|146286061|sp|O93428.2|CATD_CHIHA RecName: Full=Cathepsin D; Flags: Precursor
Length = 396
Score = 100 bits (249), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 48/116 (41%), Positives = 69/116 (59%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSNCQAIA 118
RN E GGE++ GG D + GD Y V+R+ YWQ V+S+ + + ++ C+AI
Sbjct: 221 RNPDTEPGGELLLGGTDPKYYTGDFNYVNVTRQAYWQIRVDSMAVGDQLSLCTGGCEAIV 280
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
D+GTSLI GPS + L K IGA PL G V+CD + +P + +GG+ +TLT
Sbjct: 281 DSGTSLITGPSVEVKALQKAIGAFPLIQGEYMVNCDTVPSLPVISFTVGGQVYTLT 336
Score = 77.8 bits (190), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 32/47 (68%), Positives = 39/47 (82%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ AQYYG I LGTPPQ F V+FDTGSSNLW+PS HCS+L+IAC
Sbjct: 68 LKNYLDAQYYGEIGLGTPPQPFTVVFDTGSSNLWVPSIHCSLLDIAC 114
>gi|2102722|gb|AAB63357.1| aspartic protease precursor, partial [Schistosoma japonicum]
Length = 428
Score = 100 bits (249), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 46/118 (38%), Positives = 76/118 (64%), Gaps = 1/118 (0%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIAD 119
RN ++ GGE+M GG+D + G+I Y ++ K YW F ++++ I C++ CQAIAD
Sbjct: 210 RNITNVLGGELMIGGIDDKYYTGEINYVNLTEKSYWLFKMDNLTISDLSICTDGCQAIAD 269
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTRTN 177
TGTS+I GP+ + ++N+ +GA L G V CD ++ +P++D ++ GK+ TL T+
Sbjct: 270 TGTSMIAGPTDEVKQINQKLGATHLPGGIYTVSCDVINNLPSIDFVINGKHMTLEPTD 327
Score = 76.6 bits (187), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 30/51 (58%), Positives = 41/51 (80%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK 51
+++ AQYYG I++GTPPQ F V+FDTGSSNLW+PS+HCS +IAC + +
Sbjct: 57 LKNYLDAQYYGDITIGTPPQTFSVVFDTGSSNLWVPSKHCSYFDIACLLHR 107
>gi|148231809|ref|NP_001085308.1| cathepsin D precursor [Xenopus laevis]
gi|62739292|gb|AAH94178.1| LOC443721 protein [Xenopus laevis]
Length = 399
Score = 100 bits (249), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 46/116 (39%), Positives = 70/116 (60%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI-EKNVYCSN-CQAIA 118
RN + GGE++ GG D + GD +Y V+RK YWQ ++ + + ++ C C+AI
Sbjct: 224 RNPDTQPGGELLLGGTDPTYYTGDFSYMNVTRKAYWQIRMDQLSVGDQLTLCKGGCEAIV 283
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSLI GP + +A L + IGA+PL G + CDN+ +P + GG+ ++LT
Sbjct: 284 DTGTSLITGPVEEVAALQRAIGAIPLIRGEYMILCDNIPSLPVISFTFGGRVYSLT 339
Score = 74.7 bits (182), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 31/41 (75%), Positives = 35/41 (85%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQYYG IS+GTPPQ F V+FDTGSSNLW+PS HCS +IAC
Sbjct: 77 AQYYGEISIGTPPQPFTVVFDTGSSNLWVPSVHCSFWDIAC 117
>gi|58258949|ref|XP_566887.1| endopeptidase [Cryptococcus neoformans var. neoformans JEC21]
gi|134107071|ref|XP_777848.1| hypothetical protein CNBA5450 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|50260546|gb|EAL23201.1| hypothetical protein CNBA5450 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|57223024|gb|AAW41068.1| endopeptidase, putative [Cryptococcus neoformans var. neoformans
JEC21]
Length = 438
Score = 100 bits (249), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 52/117 (44%), Positives = 71/117 (60%), Gaps = 2/117 (1%)
Query: 60 FR-NASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI-EKNVYCSNCQAI 117
FR +SDE+GGE +FGG+D + G++ Y PV RKGYW+ +ESI ++ + N A
Sbjct: 266 FRLGSSDEDGGEAIFGGIDDSAYSGELQYVPVRRKGYWEVELESISFGDEELELENTGAA 325
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSLI+ P+ V LNK IGA NG VDC + +P + GGK++ LT
Sbjct: 326 IDTGTSLIVMPTDVAELLNKEIGAEKSWNGQYTVDCSTVSSLPVLAFTFGGKDYKLT 382
Score = 58.5 bits (140), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 36/84 (42%), Positives = 48/84 (57%), Gaps = 7/84 (8%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK--DKFVGHSHFRNAS 64
AQY+ T+ +GTP Q FKVI DTGSSNLW+PS C+ +IAC + D ++ N S
Sbjct: 125 AQYFATMEIGTPFQTFKVILDTGSSNLWVPSVKCT--SIACFLHSKYDSSQSSTYKANGS 182
Query: 65 DENGGEIMFGGVDKDKFVGDITYS 88
D EI +G + F+ T S
Sbjct: 183 D---FEIHYGSGSLEGFISQDTVS 203
>gi|226476856|emb|CAX72344.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
Length = 429
Score = 100 bits (249), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 46/118 (38%), Positives = 76/118 (64%), Gaps = 1/118 (0%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIAD 119
RN ++ GGE+M GG+D + G+I Y ++ K YW F ++++ I C++ CQAIAD
Sbjct: 211 RNITNVLGGELMIGGIDDKYYTGEINYVNLTEKSYWLFKMDNLTISDLSICTDGCQAIAD 270
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTRTN 177
TGTS+I GP+ + ++N+ +GA L G V CD ++ +P++D ++ GK+ TL T+
Sbjct: 271 TGTSMIAGPTDEVKQINQKLGATHLPGGIYTVSCDVINNLPSIDFVINGKHMTLEPTD 328
Score = 76.6 bits (187), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 30/51 (58%), Positives = 41/51 (80%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK 51
+++ AQYYG I++GTPPQ F V+FDTGSSNLW+PS+HCS +IAC + +
Sbjct: 58 LKNYLDAQYYGDITIGTPPQTFSVVFDTGSSNLWVPSKHCSYFDIACLLHR 108
>gi|226476832|emb|CAX72332.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
Length = 429
Score = 100 bits (249), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 46/118 (38%), Positives = 76/118 (64%), Gaps = 1/118 (0%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIAD 119
RN ++ GGE+M GG+D + G+I Y ++ K YW F ++++ I C++ CQAIAD
Sbjct: 211 RNITNVLGGELMIGGIDDKYYTGEINYVNLTEKSYWLFKMDNLTISDLSICTDGCQAIAD 270
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTRTN 177
TGTS+I GP+ + ++N+ +GA L G V CD ++ +P++D ++ GK+ TL T+
Sbjct: 271 TGTSMIAGPTDEVKQINQKLGATHLPGGIYTVSCDVINNLPSIDFVINGKHMTLEPTD 328
Score = 76.6 bits (187), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 30/51 (58%), Positives = 41/51 (80%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK 51
+++ AQYYG I++GTPPQ F V+FDTGSSNLW+PS+HCS +IAC + +
Sbjct: 58 LKNYLDAQYYGDITIGTPPQTFSVVFDTGSSNLWVPSKHCSYFDIACLLHR 108
>gi|195027894|ref|XP_001986817.1| GH21578 [Drosophila grimshawi]
gi|193902817|gb|EDW01684.1| GH21578 [Drosophila grimshawi]
Length = 388
Score = 100 bits (249), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 48/114 (42%), Positives = 64/114 (56%), Gaps = 1/114 (0%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIAD 119
R+ GGEI+FGG D + + GD TY PV+RKGYWQ ++S ++ C CQ IAD
Sbjct: 214 RDPKAPEGGEIIFGGSDPNHYTGDFTYLPVTRKGYWQIKMDSAQLNDIELCKGGCQVIAD 273
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
TGTSLI P +N+ IG P+ G V CD + +P + + GK F L
Sbjct: 274 TGTSLIAAPQDEATSINQAIGGTPILGGQYVVSCDAIPNLPVIKFVFNGKTFEL 327
Score = 73.9 bits (180), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 30/41 (73%), Positives = 36/41 (87%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQYYG IS+G+PPQ FKV+FDTGSSNLW+PS+ C + NIAC
Sbjct: 67 AQYYGPISIGSPPQNFKVVFDTGSSNLWVPSKKCHLTNIAC 107
>gi|195114666|ref|XP_002001888.1| GI14567 [Drosophila mojavensis]
gi|193912463|gb|EDW11330.1| GI14567 [Drosophila mojavensis]
Length = 402
Score = 100 bits (249), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 52/125 (41%), Positives = 74/125 (59%), Gaps = 2/125 (1%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKN 108
+D+ F G R+ S + GG+++ GG+D F G++TY V ++GYWQF V S ++
Sbjct: 224 IDEPTF-GFYLARDGSAQAGGQLVLGGIDSQLFSGNLTYVSVVQQGYWQFVVNSAEMGGY 282
Query: 109 VYCSNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGG 168
V C NCQAIADTGTSL+ P LN+LIG L +G VDC + +P + +GG
Sbjct: 283 VVCYNCQAIADTGTSLLACPGSAYTMLNQLIGGY-LMDGDYYVDCSTVSSLPALKFNIGG 341
Query: 169 KNFTL 173
F+L
Sbjct: 342 TIFSL 346
Score = 62.4 bits (150), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 25/39 (64%), Positives = 30/39 (76%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
YYG I +GTPPQ F V+FDTGSSNLW+PS C ++AC
Sbjct: 90 YYGVIGIGTPPQYFNVVFDTGSSNLWVPSVQCLSTDVAC 128
>gi|195581342|ref|XP_002080493.1| GD10217 [Drosophila simulans]
gi|194192502|gb|EDX06078.1| GD10217 [Drosophila simulans]
Length = 324
Score = 100 bits (249), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 48/114 (42%), Positives = 67/114 (58%), Gaps = 1/114 (0%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIAD 119
R+ + GGEI+FGG D + + G+ TY PV+RK YWQ +++ I C CQ IAD
Sbjct: 150 RDPASPEGGEIIFGGSDPNHYTGEFTYLPVTRKAYWQIKMDAASIGDLQLCKGGCQVIAD 209
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
TGTSLI P + +N+ IG P+ G V CD + ++P + +LGGK F L
Sbjct: 210 TGTSLIAAPLEEATSINQKIGGTPIIGGQYLVSCDLIPQLPVIKFVLGGKTFEL 263
Score = 72.0 bits (175), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 28/41 (68%), Positives = 36/41 (87%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQYYG I++G+PPQ F+V+FDTGSSNLW+PS+ C + NIAC
Sbjct: 3 AQYYGPIAIGSPPQNFRVVFDTGSSNLWVPSKKCHLTNIAC 43
>gi|281339451|gb|EFB15035.1| hypothetical protein PANDA_018433 [Ailuropoda melanoleuca]
Length = 388
Score = 100 bits (249), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 53/119 (44%), Positives = 78/119 (65%), Gaps = 6/119 (5%)
Query: 63 ASDENGG---EIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAI 117
+SD GG E++FGG D F G++ + PV+++GYWQ +++I++ V +CS CQAI
Sbjct: 212 SSDPEGGAGSELIFGGYDHSHFSGNLHWVPVTKQGYWQIALDAIQVGGAVMFCSEGCQAI 271
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTRT 176
DTGTSLI GPS + +L K IGA P+ +G V+C NL+ MP+V + G ++TL T
Sbjct: 272 VDTGTSLITGPSDKVKQLQKAIGAEPM-DGEYGVECANLNVMPDVTFTINGISYTLQPT 329
Score = 61.2 bits (147), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 24/34 (70%), Positives = 31/34 (91%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCS 41
+Y+GTIS+G+PPQ F VIFDTGSSNLW+PS +C+
Sbjct: 59 EYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT 92
>gi|3378161|emb|CAA07719.1| cathepsin D precursor [Chionodraco hamatus]
Length = 396
Score = 100 bits (249), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 48/116 (41%), Positives = 69/116 (59%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSNCQAIA 118
RN E GGE++ GG D + GD Y V+R+ YWQ V+S+ + + ++ C+AI
Sbjct: 221 RNPDTEPGGELLLGGTDPKYYTGDFNYVNVTRQAYWQIRVDSMAVGDQLSLCTGGCEAIV 280
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
D+GTSLI GPS + L K IGA PL G V+CD + +P + +GG+ +TLT
Sbjct: 281 DSGTSLITGPSVEVKALQKAIGAFPLIQGEYMVNCDTVPSLPVISFTVGGQVYTLT 336
Score = 77.4 bits (189), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 32/47 (68%), Positives = 39/47 (82%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ AQYYG I LGTPPQ F V+FDTGSSNLW+PS HCS+L+IAC
Sbjct: 68 LKNYLDAQYYGEIGLGTPPQPFTVVFDTGSSNLWVPSIHCSLLDIAC 114
>gi|21355083|ref|NP_652013.1| cathD [Drosophila melanogaster]
gi|6685167|gb|AAF23824.1|AF220040_1 cathepsin D precursor [Drosophila melanogaster]
gi|7304149|gb|AAF59186.1| cathD [Drosophila melanogaster]
gi|15292549|gb|AAK93543.1| SD07085p [Drosophila melanogaster]
gi|220946566|gb|ACL85826.1| cathD-PA [synthetic construct]
Length = 392
Score = 100 bits (249), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 48/114 (42%), Positives = 67/114 (58%), Gaps = 1/114 (0%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIAD 119
R+ + GGEI+FGG D + + G+ TY PV+RK YWQ +++ I C CQ IAD
Sbjct: 218 RDPASPEGGEIIFGGSDPNHYTGEFTYLPVTRKAYWQIKMDAASIGDLQLCKGGCQVIAD 277
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
TGTSLI P + +N+ IG P+ G V CD + ++P + +LGGK F L
Sbjct: 278 TGTSLIAAPLEEATSINQKIGGTPIIGGQYVVSCDLIPQLPVIKFVLGGKTFEL 331
Score = 72.0 bits (175), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 28/41 (68%), Positives = 36/41 (87%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQYYG I++G+PPQ F+V+FDTGSSNLW+PS+ C + NIAC
Sbjct: 71 AQYYGPIAIGSPPQNFRVVFDTGSSNLWVPSKKCHLTNIAC 111
>gi|226476810|emb|CAX72321.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
Length = 429
Score = 100 bits (249), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 46/118 (38%), Positives = 76/118 (64%), Gaps = 1/118 (0%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIAD 119
RN ++ GGE+M GG+D + G+I Y ++ K YW F ++++ I C++ CQAIAD
Sbjct: 211 RNITNVLGGELMIGGIDDKYYTGEINYVNLTEKSYWLFKMDNLTISDLSICTDGCQAIAD 270
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTRTN 177
TGTS+I GP+ + ++N+ +GA L G V CD ++ +P++D ++ GK+ TL T+
Sbjct: 271 TGTSMIAGPTDEVKQINQKLGATHLPGGIYTVSCDVINNLPSIDFVINGKHMTLEPTD 328
Score = 76.6 bits (187), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 30/51 (58%), Positives = 41/51 (80%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK 51
+++ AQYYG I++GTPPQ F V+FDTGSSNLW+PS+HCS +IAC + +
Sbjct: 58 LKNYLDAQYYGDITIGTPPQTFSVVFDTGSSNLWVPSKHCSYFDIACLLHR 108
>gi|195433875|ref|XP_002064932.1| GK15196 [Drosophila willistoni]
gi|194161017|gb|EDW75918.1| GK15196 [Drosophila willistoni]
Length = 415
Score = 100 bits (249), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 50/120 (41%), Positives = 75/120 (62%), Gaps = 3/120 (2%)
Query: 61 RNASDE-NGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSNCQAIAD 119
RN ++ NGGE++ GG+D + G +TY PVS++GYWQF + S + +C NCQAI D
Sbjct: 246 RNGTNATNGGELILGGIDPALYSGCLTYVPVSQQGYWQFEMTSATLNDQEFCDNCQAILD 305
Query: 120 TGTSLIIGPSKVIAELNKLIGAV-PLA-NGPAKVDCDNLDKMPNVDIILGGKNFTLTRTN 177
GTSLI+ P+ I E+N+++G P A +G VDC + K+P++ + K F L T+
Sbjct: 306 VGTSLIVVPNSEIKEINQILGVTNPNATSGAFLVDCATISKLPDIIFTIARKEFALKSTD 365
Score = 60.8 bits (146), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 23/51 (45%), Positives = 36/51 (70%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK 51
+ ++Y +YY T+ +GTPPQEF+++ DTGS+NLW+PS C AC+ +
Sbjct: 88 LGNAYNTEYYITVHIGTPPQEFRLLIDTGSANLWVPSSKCPSTVKACAAHQ 138
>gi|1585311|prf||2124395A Asp protease
Length = 380
Score = 100 bits (249), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 46/118 (38%), Positives = 76/118 (64%), Gaps = 1/118 (0%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIAD 119
RN ++ GGE+M GG+D + G+I Y ++ K YW F ++++ I C++ CQAIAD
Sbjct: 211 RNITNVLGGELMIGGIDDKYYTGEINYVNLTEKSYWLFKMDNLTISDLSICTDGCQAIAD 270
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTRTN 177
TGTS+I GP+ + ++N+ +GA L G V CD ++ +P++D ++ GK+ TL T+
Sbjct: 271 TGTSMIAGPTDEVKQINQKLGATHLPGGIYTVSCDVINNLPSIDFVINGKHMTLEPTD 328
Score = 73.2 bits (178), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 28/51 (54%), Positives = 40/51 (78%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK 51
+++ AQY+G I++GTPPQ F +FDTGSSNLW+PS+HCS +IAC + +
Sbjct: 58 LKNYLDAQYHGDITIGTPPQTFSAVFDTGSSNLWVPSKHCSYFDIACLLHR 108
>gi|307170097|gb|EFN62519.1| Lysosomal aspartic protease [Camponotus floridanus]
Length = 285
Score = 100 bits (248), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 48/115 (41%), Positives = 69/115 (60%), Gaps = 1/115 (0%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCS-NCQAIAD 119
R+ S + GG ++ GG D + GD TY PV+RKGYWQ ++ IK+ C +CQ + D
Sbjct: 113 RDTSADLGGTLILGGSDPTYYEGDFTYIPVTRKGYWQITIDRIKMTSEDLCEESCQVVVD 172
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
TG+SLI GP IA L L+ + + KVDC+ + ++P + I+GGK F LT
Sbjct: 173 TGSSLITGPELDIARLIHLLELMNIKQNYLKVDCNRIFQLPTIRFIMGGKAFDLT 227
>gi|49522906|gb|AAH75134.1| LOC443721 protein, partial [Xenopus laevis]
Length = 398
Score = 100 bits (248), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 46/116 (39%), Positives = 70/116 (60%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI-EKNVYCSN-CQAIA 118
RN + GGE++ GG D + GD +Y V+RK YWQ ++ + + ++ C C+AI
Sbjct: 223 RNPDTQPGGELLLGGTDPTYYTGDFSYMNVTRKAYWQIRMDQLSVGDQLTLCKGGCEAIV 282
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSLI GP + +A L + IGA+PL G + CDN+ +P + GG+ ++LT
Sbjct: 283 DTGTSLITGPVEEVAALQRAIGAIPLIRGEYMILCDNIPSLPVISFTFGGRVYSLT 338
Score = 74.7 bits (182), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 31/41 (75%), Positives = 35/41 (85%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQYYG IS+GTPPQ F V+FDTGSSNLW+PS HCS +IAC
Sbjct: 76 AQYYGEISIGTPPQPFTVVFDTGSSNLWVPSVHCSFWDIAC 116
>gi|313219527|emb|CBY30450.1| unnamed protein product [Oikopleura dioica]
Length = 396
Score = 100 bits (248), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 49/120 (40%), Positives = 74/120 (61%), Gaps = 6/120 (5%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-KNVYC-----SNC 114
R A+ E+GGE+ GGVD +F GD +++ V+R+ YWQ +++ +++ K V + C
Sbjct: 217 RTANAEDGGELTIGGVDNSRFTGDFSWNEVTRQAYWQIKMDNFEVQGKGVSACGGNENGC 276
Query: 115 QAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
Q I D+GTSL+ P + E+N IGA ANG V C ++D MP++D L GK +TLT
Sbjct: 277 QVIVDSGTSLLAVPKNLAEEINHAIGAFQFANGEWIVPCRHMDTMPDIDFTLNGKVYTLT 336
Score = 75.5 bits (184), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 31/45 (68%), Positives = 36/45 (80%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK 51
AQYYGTI +GTPPQEF VIFDTGSSNLW+PS C N+AC + +
Sbjct: 70 AQYYGTIHIGTPPQEFSVIFDTGSSNLWVPSTKCKFTNVACFLHR 114
>gi|195447370|ref|XP_002071184.1| GK25659 [Drosophila willistoni]
gi|194167269|gb|EDW82170.1| GK25659 [Drosophila willistoni]
Length = 406
Score = 100 bits (248), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 45/108 (41%), Positives = 72/108 (66%), Gaps = 3/108 (2%)
Query: 67 NGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSNCQAIADTGTSLII 126
NGGE++ GG D + GD+TY P+S++GYWQF ++ + + N+ CS+CQAIADTGTSL++
Sbjct: 237 NGGELILGGSDSTLYSGDLTYVPISQEGYWQFQMDGVNLGDNLLCSSCQAIADTGTSLLV 296
Query: 127 GPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
P+ V + +++ + N +VDC + +P++ +GG FTLT
Sbjct: 297 VPTDVAQSILQIL---QIMNDAYEVDCSLVPFLPSLYFQIGGTTFTLT 341
Score = 57.0 bits (136), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 23/33 (69%), Positives = 27/33 (81%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
+YYGTIS+GTP Q F V+FDTGSSNLW+P C
Sbjct: 81 EYYGTISIGTPAQTFVVLFDTGSSNLWVPGSGC 113
>gi|431910128|gb|ELK13201.1| Cathepsin D [Pteropus alecto]
Length = 375
Score = 100 bits (248), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 49/127 (38%), Positives = 77/127 (60%), Gaps = 3/127 (2%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKN 108
VDK+ F + + R+ + + GGE+M GG D + G ++Y V+RK YWQ +E + + +
Sbjct: 187 VDKNIFSFYLN-RDPNAQPGGELMLGGTDSKYYTGSLSYLNVTRKAYWQVHMEQVDVGNS 245
Query: 109 VYC--SNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ + C+AI DTGTSL++GP + + L K IGAVPL G + C+ + +P V + L
Sbjct: 246 LTLCKAGCEAIVDTGTSLVVGPVEEVRALQKAIGAVPLIQGEYMIPCEKVSSLPEVTLKL 305
Query: 167 GGKNFTL 173
GGK + L
Sbjct: 306 GGKGYKL 312
Score = 74.7 bits (182), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 34/66 (51%), Positives = 45/66 (68%), Gaps = 2/66 (3%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHS-- 58
+++ AQYYG I +GTPPQ F V+FDTGSSNLW+PS HC +L+IAC + G S
Sbjct: 36 LKNYMDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWIHHKYNSGKSST 95
Query: 59 HFRNAS 64
+ RN +
Sbjct: 96 YVRNGT 101
>gi|1619323|emb|CAA69878.1| aspartic protease [Trematomus bernacchii]
Length = 406
Score = 100 bits (248), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 50/128 (39%), Positives = 79/128 (61%), Gaps = 2/128 (1%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEK- 107
+D+ F + + + GE++ GG+D+ + G I + PV+ KGYWQ ++S+ ++
Sbjct: 214 LDQPVFSFYLSRKQLVEPQRGELLLGGIDEALYNGPINWLPVTAKGYWQIKMDSLVVQGV 273
Query: 108 NVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ +CS+ CQAI DTGTSLI GP++ I + +LIGA P G +DC L +P + +L
Sbjct: 274 SPFCSHGCQAIIDTGTSLIAGPTEDILNIQQLIGASPTNIGEFIIDCARLISLPRITFVL 333
Query: 167 GGKNFTLT 174
GGK +TLT
Sbjct: 334 GGKEYTLT 341
Score = 59.3 bits (142), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 32/68 (47%), Positives = 43/68 (63%), Gaps = 7/68 (10%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASDE 66
AQY+G I+LGTP Q F V+FDTGSS+LW+PS +C ++ AC+ K F + S
Sbjct: 81 AQYFGDITLGTPGQNFSVVFDTGSSDLWVPSTYC--VSQACASHKR----FRAFESTSFR 134
Query: 67 NGGEIMFG 74
+ G MFG
Sbjct: 135 HDGR-MFG 141
>gi|328869722|gb|EGG18099.1| cathepsin D [Dictyostelium fasciculatum]
Length = 476
Score = 100 bits (248), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 48/121 (39%), Positives = 75/121 (61%), Gaps = 4/121 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV--YC--SNCQA 116
+ AS NGGE+ FG +D KF G ITY P++ + YW+F ++ ++ + N YC + C+A
Sbjct: 297 KTASPTNGGELSFGSIDNSKFTGAITYVPLTNRTYWEFSMDDVQYDGNSLGYCGKTGCRA 356
Query: 117 IADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTRT 176
IAD+GTSL+ GP++ I +N +GAV + C+ + +P+V I+L G F LT T
Sbjct: 357 IADSGTSLLAGPTEQIEAINTKLGAVSVNGEAIFPSCNVISSLPDVQIVLAGTTFVLTPT 416
Query: 177 N 177
+
Sbjct: 417 D 417
Score = 71.2 bits (173), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 39/78 (50%), Positives = 48/78 (61%), Gaps = 5/78 (6%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK--DKFVGHSHFRNAS 64
AQYYG I++GTP Q FKV+FDTGSSNLWIPS+ C I IAC + D S +N +
Sbjct: 150 AQYYGAITIGTPGQPFKVVFDTGSSNLWIPSKKCPITVIACDLHNKYDSTKSSSFVQNGT 209
Query: 65 D---ENGGEIMFGGVDKD 79
D + G M G V +D
Sbjct: 210 DFSIQYGSGAMSGFVSED 227
>gi|327259983|ref|XP_003214815.1| PREDICTED: cathepsin D-like [Anolis carolinensis]
Length = 399
Score = 100 bits (248), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 54/133 (40%), Positives = 80/133 (60%), Gaps = 7/133 (5%)
Query: 49 VDKDKFVGHSHF--RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE 106
+DK+ F S F R+ S GGEI+FGGVD + GD + V+RK YWQ ++ +++
Sbjct: 210 LDKNMF---SFFLNRDPSSSPGGEIIFGGVDPKYYSGDFNWVNVTRKAYWQVHMDRVEVP 266
Query: 107 KNV-YCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDI 164
+ C N C+AI DTGTSLI GP+ + L K IGA P+ G + C+ L +P V
Sbjct: 267 SGLTVCKNGCEAIVDTGTSLITGPTDEVKALQKAIGAKPIIKGQYILPCEKLATLPIVSF 326
Query: 165 ILGGKNFTLTRTN 177
+LGG++++L+ N
Sbjct: 327 VLGGRSYSLSAEN 339
Score = 75.9 bits (185), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 30/47 (63%), Positives = 39/47 (82%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ AQYYG I +GTPPQ+F V+FDTGSSNLW+PS HC +L+IAC
Sbjct: 68 LKNYMDAQYYGEIGIGTPPQKFTVVFDTGSSNLWVPSVHCRLLDIAC 114
>gi|313226363|emb|CBY21507.1| unnamed protein product [Oikopleura dioica]
Length = 396
Score = 100 bits (248), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 49/120 (40%), Positives = 74/120 (61%), Gaps = 6/120 (5%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-KNVYC-----SNC 114
R A+ E+GGE+ GGVD +F GD +++ V+R+ YWQ +++ +++ K V + C
Sbjct: 217 RTANAEDGGELTIGGVDNSRFTGDFSWNEVTRQAYWQIKMDNFEVQGKGVSACGGNENGC 276
Query: 115 QAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
Q I D+GTSL+ P + E+N IGA ANG V C ++D MP++D L GK +TLT
Sbjct: 277 QVIVDSGTSLLAVPKNLAEEINHAIGAFQFANGEWIVPCRHMDTMPDIDFTLNGKVYTLT 336
Score = 75.1 bits (183), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 31/45 (68%), Positives = 36/45 (80%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK 51
AQYYGTI +GTPPQEF VIFDTGSSNLW+PS C N+AC + +
Sbjct: 70 AQYYGTIHIGTPPQEFSVIFDTGSSNLWVPSTKCKFTNVACLLHR 114
>gi|253762217|gb|ACT35560.1| pepsinogen A2 precursor [Siniperca chuatsi]
Length = 376
Score = 100 bits (248), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 50/118 (42%), Positives = 71/118 (60%), Gaps = 2/118 (1%)
Query: 58 SHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-KNVYCSN-CQ 115
S + +++ E G E++FGG+D + G IT+ P+S YWQ ++S+ I + V CS CQ
Sbjct: 206 SVYLSSNSEQGSEVVFGGIDSSHYTGQITWIPLSSATYWQIKMDSVTINGQTVACSGGCQ 265
Query: 116 AIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
AI DTGTSLI+GP+ I +N +GA G A V C N+ MP V L G+ FT+
Sbjct: 266 AIIDTGTSLIVGPTSDINNMNAWVGASTNQYGEAVVSCQNIQSMPAVTFTLNGQAFTI 323
Score = 62.4 bits (150), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 26/33 (78%), Positives = 29/33 (87%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCS 41
YYG IS+G+PPQ F VIFDTGSSNLWIPS +CS
Sbjct: 69 YYGVISIGSPPQSFSVIFDTGSSNLWIPSVYCS 101
>gi|74151850|dbj|BAE29712.1| unnamed protein product [Mus musculus]
gi|74151877|dbj|BAE29725.1| unnamed protein product [Mus musculus]
Length = 410
Score = 100 bits (248), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 50/127 (39%), Positives = 78/127 (61%), Gaps = 3/127 (2%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--E 106
VDK+ F + + R+ + GGE+M G D + G+++Y V+RK YWQ ++ +++ E
Sbjct: 222 VDKNIFSFYLN-RDPEGQPGGELMLGDTDSKYYHGELSYLNVTRKAYWQVHMDQLEVGNE 280
Query: 107 KNVYCSNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ C+AI DTGTSL++GP + + EL K IGAVPL G + C+ + +P V + L
Sbjct: 281 LTLCKGGCEAIVDTGTSLLVGPVEEVKELQKAIGAVPLIQGEYMIPCEKVSSLPTVYLKL 340
Query: 167 GGKNFTL 173
GGKN+ L
Sbjct: 341 GGKNYEL 347
Score = 75.9 bits (185), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 47/122 (38%), Positives = 62/122 (50%), Gaps = 27/122 (22%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHF 60
+++ AQYYG I +GTPPQ F V+FDTGSSNLW+PS HC IL+IAC V H
Sbjct: 71 LKNYLDAQYYGDIGIGTPPQCFTVVFDTGSSNLWVPSIHCKILDIACWV---------HH 121
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYW------------QFGVESIKIEKN 108
+ SD++ + K+ DI Y S GY Q IK+EK
Sbjct: 122 KYNSDKSSTYV------KNGTSFDIHYGSGSLSGYLSQDTVSVPCKSDQSKARGIKVEKQ 175
Query: 109 VY 110
++
Sbjct: 176 IF 177
>gi|162423778|gb|ABX89619.1| pepsinogen [Diplodus sargus]
Length = 376
Score = 100 bits (248), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 50/118 (42%), Positives = 70/118 (59%), Gaps = 2/118 (1%)
Query: 58 SHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-KNVYCSN-CQ 115
S + + + E G E++FGG D + G IT+ P+S YWQ ++S+ I + V CS CQ
Sbjct: 206 SVYLSGNSEQGSEVVFGGTDSSHYTGQITWIPLSSATYWQISMDSVTINGQTVACSGGCQ 265
Query: 116 AIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
AI DTGTSLI+GP+ I +N +GA G A V+C N+ MP+V L G FT+
Sbjct: 266 AIIDTGTSLIVGPTSDINNMNSWVGASTNEYGEATVNCQNIQGMPDVTFTLNGHAFTV 323
Score = 63.2 bits (152), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 26/33 (78%), Positives = 29/33 (87%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCS 41
YYG +S+GTPPQ F VIFDTGSSNLWIPS +CS
Sbjct: 69 YYGVVSIGTPPQSFSVIFDTGSSNLWIPSVYCS 101
>gi|402857430|ref|XP_003893258.1| PREDICTED: cathepsin E [Papio anubis]
Length = 396
Score = 100 bits (248), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 51/117 (43%), Positives = 74/117 (63%), Gaps = 3/117 (2%)
Query: 62 NASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIAD 119
N G E++FGG D F G +++ PV+++GYWQ +++I++ V +CS CQAI D
Sbjct: 222 NPEGGAGSELIFGGYDHSHFSGSLSWVPVTKQGYWQIALDNIQVGGTVMFCSEGCQAIVD 281
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTRT 176
TGTSLI GPS I +L IGA P+ +G V+C NL+ MP+V + G +TL+ T
Sbjct: 282 TGTSLITGPSDKIKQLQNAIGAAPV-DGEYAVECANLNVMPDVTFTINGVPYTLSPT 337
Score = 60.8 bits (146), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 24/34 (70%), Positives = 31/34 (91%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCS 41
+Y+GTIS+G+PPQ F VIFDTGSSNLW+PS +C+
Sbjct: 77 EYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT 110
>gi|374431137|gb|AEZ51819.1| pepsin, partial [Oreochromis niloticus]
Length = 339
Score = 100 bits (248), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 51/118 (43%), Positives = 71/118 (60%), Gaps = 2/118 (1%)
Query: 58 SHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-KNVYCSN-CQ 115
S + + + E G E++FGG D + G IT+ P+S YWQ ++S+ I + V CS CQ
Sbjct: 169 SVYLSGNSEQGSEVVFGGTDSTHYTGTITWIPLSSATYWQINMDSVTINGQTVACSGGCQ 228
Query: 116 AIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
AI DTGTSLI+GP+ I LN +GA +G A V+C N+ MP+V L G FT+
Sbjct: 229 AIIDTGTSLIVGPTTDINNLNSWVGASTDQSGDAIVNCQNIPSMPDVTFTLNGNAFTV 286
Score = 63.9 bits (154), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 26/32 (81%), Positives = 29/32 (90%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
YYGTIS+GTPPQ F VIFDTGSSNLW+PS +C
Sbjct: 32 YYGTISIGTPPQSFSVIFDTGSSNLWVPSVYC 63
>gi|60600181|gb|AAX26634.1| unknown [Schistosoma japonicum]
gi|226476850|emb|CAX72341.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
Length = 262
Score = 100 bits (248), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 46/118 (38%), Positives = 76/118 (64%), Gaps = 1/118 (0%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIAD 119
RN ++ GGE+M GG+D + G+I Y ++ K YW F ++++ I C++ CQAIAD
Sbjct: 44 RNITNVLGGELMIGGIDDKYYTGEINYVNLTEKSYWLFKMDNLTISDLSICTDGCQAIAD 103
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTRTN 177
TGTS+I GP+ + ++N+ +GA L G V CD ++ +P++D ++ GK+ TL T+
Sbjct: 104 TGTSMIAGPTDEVKQINQKLGATHLPGGIYTVSCDVINNLPSIDFVINGKHMTLEPTD 161
>gi|18203300|sp|Q9MZS8.1|CATD_SHEEP RecName: Full=Cathepsin D; Flags: Precursor
gi|8886526|gb|AAF80494.1|AF164143_1 cathepsin D [Ovis aries]
Length = 365
Score = 100 bits (248), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 51/130 (39%), Positives = 78/130 (60%), Gaps = 7/130 (5%)
Query: 49 VDKDKFVGHSHF--RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE 106
VDK+ F S F R+ + G E+M GG D + G +TY V+R+ YWQ ++ + +
Sbjct: 197 VDKNVF---SFFLNRDPKAQPGEELMLGGTDSKYYRGSLTYHNVTRQAYWQIHMDQLDVG 253
Query: 107 KNVYC--SNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDI 164
++ C+AI DTGTSL++GP + EL+K IGAVPL G + C+ + +P V +
Sbjct: 254 SSLTVCKGGCEAIVDTGTSLMVGPVDEVRELHKAIGAVPLIQGEYMIPCEKVSSLPQVTL 313
Query: 165 ILGGKNFTLT 174
LGGK++TL+
Sbjct: 314 KLGGKDYTLS 323
Score = 72.8 bits (177), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 40/89 (44%), Positives = 51/89 (57%), Gaps = 15/89 (16%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASDE 66
AQYYG I +GTPPQ F V+FDTGS+NLW+PS HC +L+IAC V H + SD+
Sbjct: 52 AQYYGEIGIGTPPQCFTVVFDTGSANLWVPSIHCKLLDIACWV---------HHKYNSDK 102
Query: 67 NGGEIMFGGVDKDKFVGDITYSPVSRKGY 95
+ + K+ DI Y S GY
Sbjct: 103 SSTYV------KNGTTFDIHYGSGSLSGY 125
>gi|301619112|ref|XP_002938948.1| PREDICTED: cathepsin D-like [Xenopus (Silurana) tropicalis]
Length = 355
Score = 99.8 bits (247), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 48/116 (41%), Positives = 68/116 (58%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI-EKNVYCSN-CQAIA 118
RN + GGE++ GG D + GD Y V+RK YWQ ++ + + ++ C C+ I
Sbjct: 180 RNPDTQPGGELLLGGTDPKYYTGDFHYLSVTRKAYWQIHMDQLGVGDQLTLCKGGCEVIV 239
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSLI GP + + L K IGAVPL G V CD + +P + + LGG+ +TLT
Sbjct: 240 DTGTSLITGPLEEVTALQKAIGAVPLIQGQYMVQCDKVPTLPVISLTLGGQVYTLT 295
Score = 77.8 bits (190), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 32/42 (76%), Positives = 37/42 (88%)
Query: 6 QAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
QAQYYG I LG+PPQ F V+FDTGSSNLW+PS HCS+L+IAC
Sbjct: 32 QAQYYGEIGLGSPPQNFTVVFDTGSSNLWVPSVHCSMLDIAC 73
>gi|226476878|emb|CAX72319.1| cathepsin D (lysosomal aspartyl protease) [Schistosoma japonicum]
Length = 262
Score = 99.8 bits (247), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 46/118 (38%), Positives = 76/118 (64%), Gaps = 1/118 (0%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIAD 119
RN ++ GGE+M GG+D + G+I Y ++ K YW F ++++ I C++ CQAIAD
Sbjct: 44 RNITNVLGGELMIGGIDDKYYTGEINYVNLTEKSYWLFKMDNLTISDLSICTDGCQAIAD 103
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTRTN 177
TGTS+I GP+ + ++N+ +GA L G V CD ++ +P++D ++ GK+ TL T+
Sbjct: 104 TGTSMIAGPTDEVKQINQKLGATHLPGGIYTVSCDVINNLPSIDFVINGKHMTLEPTD 161
>gi|4927648|gb|AAD33219.1| cathepsin D [Hynobius leechii]
Length = 397
Score = 99.8 bits (247), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 49/116 (42%), Positives = 69/116 (59%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI-EKNVYCSN-CQAIA 118
RN GGE++ GG D + + GD TY V+ K YWQ ++ + + ++ C C+AI
Sbjct: 221 RNPDTRPGGELLLGGTDPNYYTGDFTYLNVTPKAYWQIHMDQLGVGDQLTLCKGGCEAIV 280
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSLIIGPS + L K IGA+PL G + CD + +P + LGGK FT++
Sbjct: 281 DTGTSLIIGPSAEVTALQKAIGAIPLIQGEYMIPCDKVPSLPVITFNLGGKAFTVS 336
Score = 76.3 bits (186), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 32/49 (65%), Positives = 40/49 (81%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSV 49
+++ AQYYG I +GTPPQ F V+FDTGSSNLW+PS HCS+L+IAC V
Sbjct: 68 LKNYLDAQYYGEICIGTPPQCFTVVFDTGSSNLWVPSVHCSLLDIACLV 116
>gi|27503926|gb|AAH42316.1| Ctsd protein [Danio rerio]
gi|38571742|gb|AAH62824.1| Ctsd protein [Danio rerio]
gi|197247273|gb|AAI64814.1| Ctsd protein [Danio rerio]
Length = 398
Score = 99.8 bits (247), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 48/116 (41%), Positives = 67/116 (57%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIA 118
RN + GGE++ GG D + GD Y +SR+ YWQ ++ + I + C C+AI
Sbjct: 221 RNPDTQPGGELLLGGTDPKYYTGDFNYVDISRQAYWQIHMDGMSIGSGLSLCKGGCEAIV 280
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSLI GP+ + L K IGA+PL G VDC + +P + LGGK ++LT
Sbjct: 281 DTGTSLITGPAAEVKALQKAIGAIPLMQGEYMVDCKKVPTLPTISFSLGGKVYSLT 336
Score = 72.0 bits (175), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 30/47 (63%), Positives = 37/47 (78%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ AQYYG I LGTP Q F V+FDTGSSNLW+PS HCS+ +IAC
Sbjct: 68 LKNYLDAQYYGEIGLGTPVQTFTVVFDTGSSNLWVPSVHCSLTDIAC 114
>gi|45384002|ref|NP_990508.1| cathepsin D precursor [Gallus gallus]
gi|461696|sp|Q05744.1|CATD_CHICK RecName: Full=Cathepsin D; Contains: RecName: Full=Cathepsin D
light chain; Contains: RecName: Full=Cathepsin D heavy
chain; Flags: Precursor
gi|259835|gb|AAB24157.1| prepro-cathepsin D [Gallus gallus]
Length = 398
Score = 99.8 bits (247), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 47/116 (40%), Positives = 70/116 (60%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYC--SNCQAIA 118
R+ + + GGE++ GG D + GD ++ V+RK YWQ ++S+ + + C+AI
Sbjct: 223 RDPTAQPGGELLLGGTDPKYYSGDFSWVNVTRKAYWQVHMDSVDVANGLTLCKGGCEAIV 282
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSLI GP+K + EL IGA PL G + CD + +P V ++LGGK + LT
Sbjct: 283 DTGTSLITGPTKEVKELQTAIGAKPLIKGQYVISCDKISSLPVVTLMLGGKPYQLT 338
Score = 75.9 bits (185), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 30/47 (63%), Positives = 39/47 (82%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ AQYYG I +GTPPQ+F V+FDTGSSNLW+PS HC +L+IAC
Sbjct: 70 LKNYMDAQYYGEIGIGTPPQKFTVVFDTGSSNLWVPSVHCHLLDIAC 116
>gi|31559113|gb|AAP50847.1| cathepsin D [Bombyx mori]
gi|90992734|gb|ABE03014.1| aspartic protease [Bombyx mori]
Length = 385
Score = 99.8 bits (247), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 46/114 (40%), Positives = 67/114 (58%), Gaps = 1/114 (0%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIAD 119
R+ GGE++ GG D + GD+ P+ R YW+F ++S+ + + +C+ C AIAD
Sbjct: 210 RDPGATTGGELLLGGSDPAHYRGDLVRVPLLRDTYWEFHMDSVNVNASRFCAQGCSAIAD 269
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
TGTSLI GPSK + LN +GA +A G VDC + +P V + G +FTL
Sbjct: 270 TGTSLIAGPSKEVEALNAAVGATAIAFGQYAVDCSLIPHLPRVTFTIAGNDFTL 323
Score = 75.1 bits (183), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 31/41 (75%), Positives = 35/41 (85%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQYYG IS+GTPPQ FKV+FDTGSSNLW+PS+ C NIAC
Sbjct: 64 AQYYGVISIGTPPQSFKVVFDTGSSNLWVPSKKCHYTNIAC 104
>gi|22651403|gb|AAL61540.1| cathepsin D precursor [Danio rerio]
Length = 398
Score = 99.8 bits (247), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 48/116 (41%), Positives = 67/116 (57%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIA 118
RN + GGE++ GG D + GD Y +SR+ YWQ ++ + I + C C+AI
Sbjct: 221 RNPDTQPGGELLLGGTDPKYYTGDFNYVDISRQAYWQIHMDGMSIGSGLSLCKGGCEAIV 280
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSLI GP+ + L K IGA+PL G VDC + +P + LGGK ++LT
Sbjct: 281 DTGTSLITGPAAEVKALQKAIGAIPLMQGEYMVDCKKVPTLPTISFSLGGKVYSLT 336
Score = 72.0 bits (175), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 30/47 (63%), Positives = 37/47 (78%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ AQYYG I LGTP Q F V+FDTGSSNLW+PS HCS+ +IAC
Sbjct: 68 LKNYLDAQYYGEIGLGTPVQTFTVVFDTGSSNLWVPSVHCSLTDIAC 114
>gi|94732449|emb|CAK11131.1| cathepsin D [Danio rerio]
gi|94733132|emb|CAK05390.1| cathepsin D [Danio rerio]
gi|158253911|gb|AAI54316.1| Ctsd protein [Danio rerio]
Length = 398
Score = 99.8 bits (247), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 48/116 (41%), Positives = 67/116 (57%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIA 118
RN + GGE++ GG D + GD Y +SR+ YWQ ++ + I + C C+AI
Sbjct: 221 RNPDTQPGGELLLGGTDPKYYTGDFNYVDISRQAYWQIHMDGMSIGSGLSLCKGGCEAIV 280
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSLI GP+ + L K IGA+PL G VDC + +P + LGGK ++LT
Sbjct: 281 DTGTSLITGPAAEVKALQKAIGAIPLMQGEYMVDCKKVPTLPTISFSLGGKVYSLT 336
Score = 71.6 bits (174), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 30/47 (63%), Positives = 37/47 (78%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ AQYYG I LGTP Q F V+FDTGSSNLW+PS HCS+ +IAC
Sbjct: 68 LKNYLDAQYYGEIGLGTPVQTFTVVFDTGSSNLWVPSVHCSLTDIAC 114
>gi|57233525|gb|AAW48311.1| pepsinogen [Oreochromis mossambicus]
Length = 260
Score = 99.8 bits (247), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 51/118 (43%), Positives = 71/118 (60%), Gaps = 2/118 (1%)
Query: 58 SHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-KNVYCSN-CQ 115
S + + + E G E++FGG D + G IT+ P+S YWQ ++S+ I + V CS CQ
Sbjct: 114 SVYLSGNSEQGSEVVFGGTDSTHYTGTITWIPLSSATYWQINMDSVTINGQTVACSGGCQ 173
Query: 116 AIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
AI DTGTSLI+GP+ I LN +GA +G A V+C N+ MP+V L G FT+
Sbjct: 174 AIIDTGTSLIVGPTTDINNLNSWVGASTDQSGDAIVNCQNIPSMPDVTFTLNGNAFTV 231
>gi|126723599|ref|NP_001075713.1| cathepsin E precursor [Oryctolagus cuniculus]
gi|1168791|sp|P43159.1|CATE_RABIT RecName: Full=Cathepsin E; Flags: Precursor
gi|402729|gb|AAC37308.1| procathepsin E [Oryctolagus cuniculus]
Length = 396
Score = 99.8 bits (247), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 49/117 (41%), Positives = 74/117 (63%), Gaps = 3/117 (2%)
Query: 62 NASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKN-VYC-SNCQAIAD 119
N +G E+ FGG D F G + + PV+++GYWQ ++ I++ + ++C CQAI D
Sbjct: 222 NPEGGSGSELTFGGYDSSHFSGSLNWVPVTKQGYWQIALDEIQVGGSPMFCPEGCQAIVD 281
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTRT 176
TGTSLI GPS I +L IGA P+ +G V+C+NL+ MP+V ++ G +TL+ T
Sbjct: 282 TGTSLITGPSDKIIQLQAAIGATPM-DGEYAVECENLNIMPDVTFVINGVPYTLSAT 337
Score = 60.1 bits (144), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 29/64 (45%), Positives = 40/64 (62%), Gaps = 9/64 (14%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASDEN 67
+Y+GTIS+G+PPQ F VIFDT SSNLW+PS +C+ + AC + H FR +
Sbjct: 77 EYFGTISIGSPPQNFTVIFDTVSSNLWVPSVYCT--SPACQM-------HPQFRPSQSNT 127
Query: 68 GGEI 71
E+
Sbjct: 128 YSEV 131
>gi|348514690|ref|XP_003444873.1| PREDICTED: pepsin A-like [Oreochromis niloticus]
Length = 377
Score = 99.8 bits (247), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 51/118 (43%), Positives = 71/118 (60%), Gaps = 2/118 (1%)
Query: 58 SHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-KNVYCSN-CQ 115
S + + + E G E++FGG D + G IT+ P+S YWQ ++S+ I + V CS CQ
Sbjct: 207 SVYLSGNSEQGSEVVFGGTDSTHYTGTITWIPLSSATYWQINMDSVTINGQTVACSGGCQ 266
Query: 116 AIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
AI DTGTSLI+GP+ I LN +GA +G A V+C N+ MP+V L G FT+
Sbjct: 267 AIIDTGTSLIVGPTTDINNLNSWVGASTDQSGDAIVNCQNIPSMPDVTFTLNGNAFTV 324
Score = 63.9 bits (154), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 26/32 (81%), Positives = 29/32 (90%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
YYGTIS+GTPPQ F VIFDTGSSNLW+PS +C
Sbjct: 70 YYGTISIGTPPQSFSVIFDTGSSNLWVPSVYC 101
>gi|195578345|ref|XP_002079026.1| GD23736 [Drosophila simulans]
gi|194191035|gb|EDX04611.1| GD23736 [Drosophila simulans]
Length = 564
Score = 99.8 bits (247), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 48/113 (42%), Positives = 66/113 (58%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSNCQAIADT 120
R + GGE++ GG+D + G +TY PVS YWQF V +IK + C+ CQAIADT
Sbjct: 393 RQGTAVRGGELILGGIDSSLYRGSLTYVPVSVPAYWQFTVNTIKTNGILLCNGCQAIADT 452
Query: 121 GTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
GTSLI P ++N+ +GA G A V C + +P V++ +GG FTL
Sbjct: 453 GTSLIAVPLAAYRKINRQLGATDNGGGEAFVRCGRISALPKVNLNIGGTVFTL 505
Score = 62.0 bits (149), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 24/47 (51%), Positives = 33/47 (70%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+ +S +YYG I++GTP Q F ++FDTGS+NLW+PS C N AC
Sbjct: 68 LHNSMNNEYYGVIAIGTPKQRFNILFDTGSANLWVPSASCPASNTAC 114
Score = 44.3 bits (103), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 17/38 (44%), Positives = 23/38 (60%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQF 98
R + GGE++ GG+D + G +TY PVS YWQF
Sbjct: 221 RQGTAVRGGELILGGIDSSLYRGSLTYVPVSVPAYWQF 258
Score = 41.6 bits (96), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 15/27 (55%), Positives = 19/27 (70%)
Query: 21 EFKVIFDTGSSNLWIPSQHCSILNIAC 47
F ++FDTGS+NLW+PS C N AC
Sbjct: 260 RFNILFDTGSANLWVPSASCPASNTAC 286
>gi|218847782|ref|NP_001136375.1| cathepsin D-like precursor [Xenopus (Silurana) tropicalis]
gi|159155417|gb|AAI54878.1| LOC613063 protein [Xenopus (Silurana) tropicalis]
Length = 399
Score = 99.8 bits (247), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 48/116 (41%), Positives = 68/116 (58%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI-EKNVYCSN-CQAIA 118
RN + GGE++ GG D + GD Y V+RK YWQ ++ + + ++ C C+ I
Sbjct: 224 RNPDTQPGGELLLGGTDPKYYTGDFHYLSVTRKAYWQIHMDQLGVGDQLTLCKGGCEVIV 283
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSLI GP + + L K IGAVPL G V CD + +P + + LGG+ +TLT
Sbjct: 284 DTGTSLITGPLEEVTALQKAIGAVPLIQGQYMVQCDKVPTLPVISLTLGGQVYTLT 339
Score = 76.3 bits (186), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 31/47 (65%), Positives = 39/47 (82%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ AQYYG I LG+PPQ F V+FDTGSSNLW+PS HCS+L+IAC
Sbjct: 71 LKNYLDAQYYGEIGLGSPPQNFTVVFDTGSSNLWVPSVHCSMLDIAC 117
>gi|115279794|gb|ABI85390.1| cathepsin D [Hippoglossus hippoglossus]
Length = 399
Score = 99.8 bits (247), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 47/116 (40%), Positives = 70/116 (60%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYC--SNCQAIA 118
R+AS GGE++ GG D + GD+ Y V+RK YWQ ++ +++ + + CQAI
Sbjct: 224 RDASATVGGELILGGTDPQYYTGDLHYVNVTRKAYWQIKMDGVEVGTQLTLCKAGCQAIV 283
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSLI+GP + + L++ IGA+PL G +DC + +P V +GGK LT
Sbjct: 284 DTGTSLIVGPREEVRALHRAIGALPLIMGEYLIDCKKIPSLPVVSFNIGGKMLNLT 339
Score = 74.7 bits (182), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 30/41 (73%), Positives = 35/41 (85%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQYYG I +GTPPQ F V+FDTGSSNLWIPS HC++ N+AC
Sbjct: 77 AQYYGEIGIGTPPQPFTVLFDTGSSNLWIPSIHCNLFNVAC 117
>gi|344307517|ref|XP_003422427.1| PREDICTED: LOW QUALITY PROTEIN: cathepsin D-like [Loxodonta
africana]
Length = 419
Score = 99.8 bits (247), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 45/128 (35%), Positives = 80/128 (62%), Gaps = 3/128 (2%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKN 108
V+K+ F + + R+ + + GGE+M GG+D + G + ++ V+R+ YWQ ++ + +
Sbjct: 231 VEKNMFSFYLN-RDPTAQPGGELMLGGIDSKYYTGTLNFNKVTREAYWQIHMDRVDVGNG 289
Query: 109 VYC--SNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ C+AI DTGTSL++GP + I EL K +GA+PL G + C+ + +P V + L
Sbjct: 290 LTLCKGGCEAIVDTGTSLMVGPVEEITELQKALGAIPLIQGEYMIPCEKVSSLPPVSLQL 349
Query: 167 GGKNFTLT 174
GG+++TL+
Sbjct: 350 GGRSYTLS 357
Score = 75.5 bits (184), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 31/47 (65%), Positives = 38/47 (80%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+R+ AQYYG I +GTPPQ F V+FDTGSSNLW+PS HC +L+IAC
Sbjct: 79 LRNYMDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSVHCKLLDIAC 125
>gi|4099023|gb|AAD00524.1| aspartic protease [Onchocerca volvulus]
Length = 422
Score = 99.8 bits (247), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 50/115 (43%), Positives = 74/115 (64%), Gaps = 2/115 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIK-IEKNVYCSN-CQAIA 118
RN SDE GGEI GG+D ++FV ITY+PVSR GYWQF ++SI+ ++ + C+N CQAIA
Sbjct: 245 RNPSDEVGGEITLGGIDTNRFVSPITYTPVSRHGYWQFKMDSIQGKDEAIGCANGCQAIA 304
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
DTGTSLI G + + + ++ G + C + +P + ++ GK++TL
Sbjct: 305 DTGTSLIAGQKVKLIKFSNILVLNMCMPGEYIIPCYKVSSLPEITFVIAGKSYTL 359
Score = 77.0 bits (188), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 41/82 (50%), Positives = 50/82 (60%), Gaps = 1/82 (1%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHF 60
+++ AQYYG IS+GTPPQ F VIFDTGSSNLWIPS C L+IAC + +K+ G
Sbjct: 92 LKNYMDAQYYGEISIGTPPQNFSVIFDTGSSNLWIPSIKCPFLDIACLL-HNKYKGTESK 150
Query: 61 RNASDENGGEIMFGGVDKDKFV 82
SD EI +G FV
Sbjct: 151 TYKSDGRKIEIQYGRGSMKGFV 172
>gi|157423181|gb|AAI53793.1| Cathepsin E2 [Xenopus laevis]
Length = 397
Score = 99.8 bits (247), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 51/116 (43%), Positives = 75/116 (64%), Gaps = 3/116 (2%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-KNVYCS-NCQAIA 118
R+ + GGE++FGG D +F G + + PV+ +GYWQ +++I+I + V+CS CQAI
Sbjct: 217 RDPNSPVGGELVFGGFDASRFSGQLNWVPVTNQGYWQIQLDNIQINGEVVFCSGGCQAIV 276
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTS+I GPS I +L +IGA ANG +VDC L+KMP + + G + +T
Sbjct: 277 DTGTSMITGPSSDIVQLQSIIGA-SAANGDYEVDCTVLNKMPTMTFTINGIGYQMT 331
Score = 61.6 bits (148), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 31/68 (45%), Positives = 43/68 (63%), Gaps = 3/68 (4%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASDE 66
QY+G IS+GTPPQ F VIFDTGSSNLW+PS +C ++ AC+ ++F S+
Sbjct: 72 VQYFGEISIGTPPQNFTVIFDTGSSNLWVPSVYC--ISPACA-QHNRFQPQLSSTYESNG 128
Query: 67 NGGEIMFG 74
N + +G
Sbjct: 129 NNFSLQYG 136
>gi|184185542|gb|ACC68942.1| cathepsin D (predicted) [Rhinolophus ferrumequinum]
Length = 410
Score = 99.4 bits (246), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 50/127 (39%), Positives = 79/127 (62%), Gaps = 3/127 (2%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKN 108
VDK+ F + + R+ + + GGE+M GG D + G ++Y V+RK YWQ ++ + + +
Sbjct: 222 VDKNIFSFYLN-RDPNAQPGGELMLGGTDSRYYKGALSYLNVTRKAYWQVHMDQVDVGNS 280
Query: 109 VYC--SNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ + C+AI DTGTSLI+GP + + EL K IGAVPL G + C+ + +P V + L
Sbjct: 281 LTLCKAGCEAIVDTGTSLIVGPVEEVRELQKAIGAVPLIQGEYMIPCEKVSSLPEVILKL 340
Query: 167 GGKNFTL 173
GGK++ L
Sbjct: 341 GGKDYKL 347
Score = 74.3 bits (181), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 41/112 (36%), Positives = 62/112 (55%), Gaps = 7/112 (6%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHS-- 58
+++ AQYYG I +GTPPQ F V+FDTGSSNLW+PS HC +L+IAC + G S
Sbjct: 71 LKNYMDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWIHHKYNSGKSST 130
Query: 59 HFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVY 110
+ +N + +I +G ++ T S G+ +K+E+ V+
Sbjct: 131 YVKNGTSF---DIHYGSGSLSGYLSQDTVSVPCNSAL--LGLGGVKVERQVF 177
>gi|61742104|gb|AAX54898.1| pepsinogen A2 precursor [Hippoglossus hippoglossus]
Length = 127
Score = 99.4 bits (246), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 49/110 (44%), Positives = 68/110 (61%), Gaps = 2/110 (1%)
Query: 66 ENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-KNVYCSN-CQAIADTGTS 123
+ G E++FGG D + F G I++ P++ YWQ ++S+ I + V CS CQAI DTGTS
Sbjct: 5 QQGSEVVFGGYDANHFTGQISWIPLTSATYWQIKMDSVTINGQTVACSGGCQAIIDTGTS 64
Query: 124 LIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
LI GPS I+ +N +GA G A V+C N+ MP+V L GK FT+
Sbjct: 65 LIAGPSSDISNMNSWVGASTNQYGEATVNCQNIQSMPDVTFTLDGKAFTI 114
>gi|66911216|gb|AAH96630.1| LOC613063 protein, partial [Xenopus (Silurana) tropicalis]
Length = 395
Score = 99.4 bits (246), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 48/116 (41%), Positives = 68/116 (58%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI-EKNVYCSN-CQAIA 118
RN + GGE++ GG D + GD Y V+RK YWQ ++ + + ++ C C+ I
Sbjct: 220 RNPDTQPGGELLLGGTDPKYYTGDFHYLSVTRKAYWQIHMDQLGVGDQLTLCKGGCEVIV 279
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSLI GP + + L K IGAVPL G V CD + +P + + LGG+ +TLT
Sbjct: 280 DTGTSLITGPLEEVTALQKAIGAVPLIQGQYMVQCDKVPTLPVISLTLGGQVYTLT 335
Score = 75.9 bits (185), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 31/47 (65%), Positives = 39/47 (82%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ AQYYG I LG+PPQ F V+FDTGSSNLW+PS HCS+L+IAC
Sbjct: 67 LKNYLDAQYYGEIGLGSPPQNFTVVFDTGSSNLWVPSVHCSMLDIAC 113
>gi|146386352|gb|ABQ23964.1| cathepsin D [Oryctolagus cuniculus]
Length = 292
Score = 99.4 bits (246), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 46/116 (39%), Positives = 73/116 (62%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIA 118
R+ + + GGE+M GGVD + G ++Y V+RK YWQ ++ + + + C C+AI
Sbjct: 132 RDPAAQPGGELMLGGVDPKYYQGSLSYLNVTRKAYWQVHMDQLNVGSGLTLCEGGCEAIV 191
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSL++GP + EL + IGAVPL G + C+ + +P V + LGG+++TL+
Sbjct: 192 DTGTSLLVGPVDEVRELQRAIGAVPLIQGEYIIPCEKVSSLPPVTLKLGGRDYTLS 247
Score = 40.4 bits (93), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 15/22 (68%), Positives = 19/22 (86%)
Query: 28 TGSSNLWIPSQHCSILNIACSV 49
TGSSNLW+PS HC +L+IAC +
Sbjct: 1 TGSSNLWVPSVHCKLLDIACWI 22
>gi|116284100|gb|AAI23963.1| LOC613063 protein [Xenopus (Silurana) tropicalis]
Length = 396
Score = 99.4 bits (246), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 48/116 (41%), Positives = 68/116 (58%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI-EKNVYCSN-CQAIA 118
RN + GGE++ GG D + GD Y V+RK YWQ ++ + + ++ C C+ I
Sbjct: 221 RNPDTQPGGELLLGGTDPKYYTGDFHYLSVTRKAYWQIHMDQLGVGDQLTLCKGGCEVIV 280
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSLI GP + + L K IGAVPL G V CD + +P + + LGG+ +TLT
Sbjct: 281 DTGTSLITGPLEEVTALQKAIGAVPLIQGQYMVQCDKVPTLPVISLTLGGQVYTLT 336
Score = 75.9 bits (185), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 31/47 (65%), Positives = 39/47 (82%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ AQYYG I LG+PPQ F V+FDTGSSNLW+PS HCS+L+IAC
Sbjct: 68 LKNYLDAQYYGEIGLGSPPQNFTVVFDTGSSNLWVPSVHCSMLDIAC 114
>gi|348565205|ref|XP_003468394.1| PREDICTED: cathepsin D-like [Cavia porcellus]
Length = 407
Score = 99.4 bits (246), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 45/115 (39%), Positives = 72/115 (62%), Gaps = 2/115 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSNCQAIA 118
R+ + + GGE++ GG+D + G TY V+RK YWQ ++ +++ E + C+AI
Sbjct: 230 RDPTAQPGGELVLGGIDSKYYKGSFTYLNVTRKAYWQVHMDQLQVGSELTLCKGGCEAIV 289
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
DTGTSL++GP + EL K IGA+PL G + C+ + +P+V + LGG ++TL
Sbjct: 290 DTGTSLLVGPVDEVKELQKAIGALPLIQGEYMIPCEKVSSLPSVTLKLGGTDYTL 344
Score = 70.1 bits (170), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 29/41 (70%), Positives = 34/41 (82%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQYYG I +GTPPQ F V+FDTGSSNLW+PS C +L+IAC
Sbjct: 77 AQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSAKCKMLDIAC 117
>gi|260837471|ref|XP_002613727.1| hypothetical protein BRAFLDRAFT_114822 [Branchiostoma floridae]
gi|229299116|gb|EEN69736.1| hypothetical protein BRAFLDRAFT_114822 [Branchiostoma floridae]
Length = 392
Score = 99.4 bits (246), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 54/127 (42%), Positives = 73/127 (57%), Gaps = 2/127 (1%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKN 108
VDK+ F + R+ S GGE++ GG D + G+ TY PVS +GYWQ ++ + +
Sbjct: 206 VDKNVFSFYLD-RDPSRAPGGELLLGGTDPTYYTGNFTYIPVSYQGYWQLNMDGVHVGDQ 264
Query: 109 VYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILG 167
C+ CQAI DTGTSLI GPS+ I +L IG+ ++ G VDC LD +P V G
Sbjct: 265 KLCAGGCQAIVDTGTSLIAGPSEEIHKLQAAIGSQQISPGQYLVDCGRLDDLPVVSFQFG 324
Query: 168 GKNFTLT 174
K F LT
Sbjct: 325 DKLFNLT 331
Score = 63.9 bits (154), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 27/40 (67%), Positives = 32/40 (80%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
+++ QYYG ISLGTPPQ+F VIFDTGSSNLW+PS C
Sbjct: 66 LKNFMDVQYYGVISLGTPPQDFNVIFDTGSSNLWVPSVKC 105
>gi|148236737|ref|NP_001079044.1| cathepsin E-B precursor [Xenopus laevis]
gi|46395760|sp|Q805F2.1|CATEB_XENLA RecName: Full=Cathepsin E-B; Flags: Precursor
gi|28460655|dbj|BAC57454.1| cathepsin E2 [Xenopus laevis]
Length = 397
Score = 99.4 bits (246), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 51/116 (43%), Positives = 74/116 (63%), Gaps = 3/116 (2%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKN-VYCS-NCQAIA 118
R+ + GGE++FGG D +F G + + PV+ +GYWQ +++I+I V+CS CQAI
Sbjct: 217 RDPNSPVGGELVFGGFDASRFSGQLNWVPVTNQGYWQIQLDNIQINGEVVFCSGGCQAIV 276
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTS+I GPS I +L +IGA ANG +VDC L+KMP + + G + +T
Sbjct: 277 DTGTSMITGPSSDIVQLQSIIGA-SAANGDYEVDCTVLNKMPTMTFTINGIGYQMT 331
Score = 61.6 bits (148), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 27/42 (64%), Positives = 34/42 (80%), Gaps = 2/42 (4%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACS 48
QY+G IS+GTPPQ F VIFDTGSSNLW+PS +C ++ AC+
Sbjct: 72 VQYFGEISIGTPPQNFTVIFDTGSSNLWVPSVYC--ISPACA 111
>gi|281182624|ref|NP_001162374.1| cathepsin D precursor [Papio anubis]
gi|160904227|gb|ABX52210.1| cathepsin D (predicted) [Papio anubis]
Length = 412
Score = 99.4 bits (246), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 45/116 (38%), Positives = 71/116 (61%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIA 118
R+ + + GGE+M GG D + G ++Y V+RK YWQ ++ +++ + C C+AI
Sbjct: 235 RDPTAQPGGELMLGGTDSKYYRGSLSYLNVTRKAYWQVHLDQVEVASGLTLCKEGCEAIV 294
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSL++GP + EL K IGAVPL G + C+ + +P + + LGGK + L+
Sbjct: 295 DTGTSLMVGPVDEVRELQKAIGAVPLIQGEYMIPCEKVSTLPTITLKLGGKGYKLS 350
Score = 75.1 bits (183), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 42/125 (33%), Positives = 65/125 (52%), Gaps = 4/125 (3%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHF 60
+++ AQYYG I +GTPPQ F V+FDTGSSNLW+PS HC +L+IAC + + K+
Sbjct: 71 LKNYMDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWLHR-KYNSDKSS 129
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSNCQAIADT 120
+ I +G ++ T S + + +K+E+ V+ +AI
Sbjct: 130 TYVKNGTSFAIHYGSGSLSGYLSQDTVSVPCKSASSTAALGGVKVERQVF---GEAIKQP 186
Query: 121 GTSLI 125
G + I
Sbjct: 187 GITFI 191
>gi|262073106|ref|NP_001159993.1| cathepsin D precursor [Bos taurus]
gi|296471411|tpg|DAA13526.1| TPA: cathepsin D [Bos taurus]
Length = 410
Score = 99.4 bits (246), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 50/130 (38%), Positives = 77/130 (59%), Gaps = 7/130 (5%)
Query: 49 VDKDKFVGHSHF--RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE 106
VDK+ F S F R+ + GGE+M GG D + G + + V+R+ YWQ ++ + +
Sbjct: 222 VDKNVF---SFFLNRDPKAQPGGELMLGGTDSKYYRGSLMFHNVTRQAYWQIHMDQLDVG 278
Query: 107 KNVYC--SNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDI 164
++ C+AI DTGTSLI+GP + + EL K IGAVPL G + C+ + +P V +
Sbjct: 279 SSLTVCKGGCEAIVDTGTSLIVGPVEEVRELQKAIGAVPLIQGEYMIPCEKVSSLPQVTV 338
Query: 165 ILGGKNFTLT 174
LGGK++ L+
Sbjct: 339 KLGGKDYALS 348
Score = 68.2 bits (165), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 27/39 (69%), Positives = 33/39 (84%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
YYG I +GTPPQ F V+FDTGS+NLW+PS HC +L+IAC
Sbjct: 79 YYGEIGIGTPPQCFTVVFDTGSANLWVPSIHCKLLDIAC 117
>gi|386869594|ref|NP_001247483.1| cathepsin D precursor [Macaca mulatta]
gi|67971186|dbj|BAE01935.1| unnamed protein product [Macaca fascicularis]
gi|384939322|gb|AFI33266.1| cathepsin D preproprotein [Macaca mulatta]
Length = 412
Score = 99.4 bits (246), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 45/116 (38%), Positives = 71/116 (61%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIA 118
R+ + + GGE+M GG D + G ++Y V+RK YWQ ++ +++ + C C+AI
Sbjct: 235 RDPTAQPGGELMLGGTDSKYYRGSLSYLNVTRKAYWQVRLDQVEVASGLTLCKEGCEAIV 294
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSL++GP + EL K IGAVPL G + C+ + +P + + LGGK + L+
Sbjct: 295 DTGTSLMVGPVDEVRELQKAIGAVPLIQGEYMIPCEKVSTLPTITLKLGGKGYKLS 350
Score = 74.3 bits (181), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 45/130 (34%), Positives = 69/130 (53%), Gaps = 14/130 (10%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHF 60
+++ AQYYG I +GTPPQ F V+FDTGSSNLW+PS HC +L+IAC ++ H +
Sbjct: 71 LKNYMDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIAC------WLHHKYN 124
Query: 61 RNASD---ENGGE--IMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSNCQ 115
+ S +NG I +G ++ T S + + +K+E+ V+ +
Sbjct: 125 SDKSSTYVKNGTSFAIHYGSGSLSGYLSQDTVSVPCKSASSTAALGGVKVERQVF---GE 181
Query: 116 AIADTGTSLI 125
AI G + I
Sbjct: 182 AIKQPGITFI 191
>gi|395531206|ref|XP_003767673.1| PREDICTED: cathepsin E [Sarcophilus harrisii]
Length = 395
Score = 99.4 bits (246), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 49/115 (42%), Positives = 73/115 (63%), Gaps = 3/115 (2%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIA 118
RN G E++FGG D F G + + PV+++GYWQ +++I++ + +C+ CQAI
Sbjct: 220 RNPDSPTGSELVFGGYDHAHFTGSLNWVPVTKQGYWQIALDNIQVGGTIMFCAEGCQAIV 279
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
DTGTSLI GPS I +L IGAV L +G ++C+NL+ MP+V + G +TL
Sbjct: 280 DTGTSLITGPSDKIKQLQNAIGAV-LTDGEYAMECNNLNVMPDVTFTINGIPYTL 333
Score = 60.1 bits (144), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 30/67 (44%), Positives = 42/67 (62%), Gaps = 3/67 (4%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASDEN 67
+YYG IS+G+PPQ F VIFDTGSSNLW+PS +C ++ AC + ++F + N
Sbjct: 76 EYYGVISIGSPPQNFTVIFDTGSSNLWVPSVYC--VSPACK-NHNRFYPSQSSTYVENGN 132
Query: 68 GGEIMFG 74
I +G
Sbjct: 133 SFSIQYG 139
>gi|195339961|ref|XP_002036585.1| GM18746 [Drosophila sechellia]
gi|194130465|gb|EDW52508.1| GM18746 [Drosophila sechellia]
Length = 392
Score = 99.4 bits (246), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 48/113 (42%), Positives = 66/113 (58%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSNCQAIADT 120
R + GGE++ GG+D + G +TY PVS YWQF V +IK + C+ CQAIADT
Sbjct: 221 RQGTAVRGGELILGGIDSSLYRGSLTYVPVSVPAYWQFTVNTIKTNGILLCNGCQAIADT 280
Query: 121 GTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
GTSLI P ++N+ +GA G A V C + +P V++ +GG FTL
Sbjct: 281 GTSLIAVPLAAYRKINRQLGATDNGGGEAFVRCGRVSSLPKVNLNIGGTVFTL 333
Score = 62.8 bits (151), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 24/47 (51%), Positives = 34/47 (72%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++S +YYG I++GTP Q F ++FDTGS+NLW+PS C N AC
Sbjct: 68 LQNSMNNEYYGVIAIGTPKQRFNILFDTGSANLWVPSASCPASNTAC 114
>gi|71043798|ref|NP_001020792.1| cathepsin D precursor [Canis lupus familiaris]
gi|85540968|sp|Q4LAL9.1|CATD_CANFA RecName: Full=Cathepsin D; Flags: Precursor
gi|70561318|emb|CAJ14973.1| cathepsin D [Canis lupus familiaris]
Length = 410
Score = 99.4 bits (246), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 47/116 (40%), Positives = 71/116 (61%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYC--SNCQAIA 118
R+ + + GGE+M GG D + G ++Y V+RK YWQ +E + + ++ C+AI
Sbjct: 233 RDPNAQPGGELMLGGTDSKYYKGPLSYLNVTRKAYWQVHMEQVDVGSSLTLCKGGCEAIV 292
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSLI+GP + EL K IGAVPL G + C+ + +P+V + LGGK + L+
Sbjct: 293 DTGTSLIVGPVDEVRELQKAIGAVPLIQGEYMIPCEKVSTLPDVTLKLGGKLYKLS 348
Score = 75.1 bits (183), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 42/112 (37%), Positives = 62/112 (55%), Gaps = 7/112 (6%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHS-- 58
+R+ AQYYG I +GTPPQ F V+FDTGSSNLW+PS HC +L+IAC + G S
Sbjct: 71 LRNYMDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWIHHKYNSGKSST 130
Query: 59 HFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVY 110
+ +N + +I +G ++ T S + G+ IK+E+ +
Sbjct: 131 YVKNGTSF---DIHYGSGSLSGYLSQDTVSVPCKSAL--SGLAGIKVERQTF 177
>gi|387915174|gb|AFK11196.1| cathepsin D1 [Callorhinchus milii]
Length = 394
Score = 99.4 bits (246), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 50/116 (43%), Positives = 68/116 (58%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI-EKNVYCSN-CQAIA 118
RN GGE++ GG D + GD Y V+RK YWQ ++ + I E+ C C AI
Sbjct: 219 RNPDSLPGGELILGGTDPKYYTGDFHYLNVTRKAYWQVKMDEVSIGEQLTLCKGGCAAIV 278
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSLI GP++ I L K IGA+PL G +DC + +P ++ LGG+ +TLT
Sbjct: 279 DTGTSLITGPAQEIKALQKAIGAIPLIQGEYLIDCKKVASLPAINFKLGGQVYTLT 334
Score = 74.7 bits (182), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 34/68 (50%), Positives = 47/68 (69%), Gaps = 6/68 (8%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSV----DKDKFVG 56
+++ AQYYG + +GTPPQ F V+FDTGSSNLW+PS HCS+ +IAC + + DK
Sbjct: 66 LKNYLDAQYYGEVGIGTPPQPFTVVFDTGSSNLWVPSVHCSMFDIACLLHHKYNSDK--S 123
Query: 57 HSHFRNAS 64
S+ RN +
Sbjct: 124 SSYVRNGT 131
>gi|326920173|ref|XP_003206349.1| PREDICTED: cathepsin D-like [Meleagris gallopavo]
Length = 397
Score = 99.4 bits (246), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 47/116 (40%), Positives = 70/116 (60%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYC--SNCQAIA 118
R+ + + GGE++ GG D + GD ++ V+RK YWQ ++S+ + + C+AI
Sbjct: 222 RDPTAQPGGELLLGGTDPKYYRGDFSWVNVTRKAYWQVHMDSVNVANGLTLCKGGCEAIV 281
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSLI GP+K + EL IGA PL G + CD + +P V ++LGGK + LT
Sbjct: 282 DTGTSLITGPTKEVKELQTAIGAKPLIKGQYIIPCDKISSLPVVTLMLGGKPYKLT 337
Score = 75.5 bits (184), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 30/47 (63%), Positives = 39/47 (82%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ AQYYG I +GTPPQ+F V+FDTGSSNLW+PS HC +L+IAC
Sbjct: 69 LKNYMDAQYYGEIGIGTPPQKFTVVFDTGSSNLWVPSVHCHLLDIAC 115
>gi|109018632|ref|XP_001090284.1| PREDICTED: cathepsin E isoform 4 [Macaca mulatta]
Length = 396
Score = 99.4 bits (246), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 50/111 (45%), Positives = 72/111 (64%), Gaps = 3/111 (2%)
Query: 68 GGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIADTGTSLI 125
G E++FGG D F G + + PV+++GYWQ +++I++ V +CS CQAI DTGTSLI
Sbjct: 228 GSELIFGGYDHSHFSGSLNWVPVTKQGYWQIALDNIQVGGTVMFCSEGCQAIVDTGTSLI 287
Query: 126 IGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTRT 176
GPS I +L IGA P+ +G V+C NL+ MP+V + G +TL+ T
Sbjct: 288 TGPSDKIKQLQNAIGAAPV-DGEYAVECANLNVMPDVTFTINGVPYTLSPT 337
Score = 60.8 bits (146), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 24/34 (70%), Positives = 31/34 (91%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCS 41
+Y+GTIS+G+PPQ F VIFDTGSSNLW+PS +C+
Sbjct: 77 EYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT 110
>gi|440899428|gb|ELR50729.1| Cathepsin D, partial [Bos grunniens mutus]
Length = 394
Score = 99.4 bits (246), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 50/130 (38%), Positives = 77/130 (59%), Gaps = 7/130 (5%)
Query: 49 VDKDKFVGHSHF--RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE 106
VDK+ F S F R+ + GGE+M GG D + G + + V+R+ YWQ ++ + +
Sbjct: 206 VDKNVF---SFFLNRDPKAQPGGELMLGGTDSKYYRGSLMFHNVTRQAYWQIHMDQLDVG 262
Query: 107 KNVYC--SNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDI 164
++ C+AI DTGTSLI+GP + + EL K IGAVPL G + C+ + +P V +
Sbjct: 263 SSLTVCKGGCEAIVDTGTSLIVGPVEEVRELQKAIGAVPLIQGEYMIPCEKVSSLPQVTV 322
Query: 165 ILGGKNFTLT 174
LGGK++ L+
Sbjct: 323 KLGGKDYALS 332
Score = 72.4 bits (176), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 29/47 (61%), Positives = 38/47 (80%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ AQYYG I +GTPPQ F V+FDTGS+NLW+PS HC +L+IAC
Sbjct: 55 LKNYMDAQYYGEIGIGTPPQCFTVVFDTGSANLWVPSIHCKLLDIAC 101
>gi|355558837|gb|EHH15617.1| hypothetical protein EGK_01732 [Macaca mulatta]
Length = 401
Score = 99.4 bits (246), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 50/111 (45%), Positives = 72/111 (64%), Gaps = 3/111 (2%)
Query: 68 GGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIADTGTSLI 125
G E++FGG D F G + + PV+++GYWQ +++I++ V +CS CQAI DTGTSLI
Sbjct: 233 GSELIFGGYDHSHFSGSLNWVPVTKQGYWQIALDNIQVGGTVMFCSEGCQAIVDTGTSLI 292
Query: 126 IGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTRT 176
GPS I +L IGA P+ +G V+C NL+ MP+V + G +TL+ T
Sbjct: 293 TGPSDKIKQLQNAIGAAPV-DGEYAVECANLNVMPDVTFTINGVPYTLSPT 342
Score = 60.8 bits (146), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 24/34 (70%), Positives = 31/34 (91%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCS 41
+Y+GTIS+G+PPQ F VIFDTGSSNLW+PS +C+
Sbjct: 77 EYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT 110
>gi|13637914|sp|P80209.2|CATD_BOVIN RecName: Full=Cathepsin D; Flags: Precursor
Length = 390
Score = 99.4 bits (246), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 50/130 (38%), Positives = 77/130 (59%), Gaps = 7/130 (5%)
Query: 49 VDKDKFVGHSHF--RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE 106
VDK+ F S F R+ + GGE+M GG D + G + + V+R+ YWQ ++ + +
Sbjct: 202 VDKNVF---SFFLNRDPKAQPGGELMLGGTDSKYYRGSLMFHNVTRQAYWQIHMDQLDVG 258
Query: 107 KNVYC--SNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDI 164
++ C+AI DTGTSLI+GP + + EL K IGAVPL G + C+ + +P V +
Sbjct: 259 SSLTVCKGGCEAIVDTGTSLIVGPVEEVRELQKAIGAVPLIQGEYMIPCEKVSSLPEVTV 318
Query: 165 ILGGKNFTLT 174
LGGK++ L+
Sbjct: 319 KLGGKDYALS 328
Score = 72.8 bits (177), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 29/47 (61%), Positives = 38/47 (80%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ AQYYG I +GTPPQ F V+FDTGS+NLW+PS HC +L+IAC
Sbjct: 51 LKNYMDAQYYGEIGIGTPPQCFTVVFDTGSANLWVPSIHCKLLDIAC 97
>gi|90076280|dbj|BAE87820.1| unnamed protein product [Macaca fascicularis]
Length = 412
Score = 99.4 bits (246), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 45/116 (38%), Positives = 71/116 (61%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIA 118
R+ + + GGE+M GG D + G ++Y V+RK YWQ ++ +++ + C C+AI
Sbjct: 235 RDPTAQPGGELMLGGTDSKYYRGSLSYLNVTRKAYWQVRLDQVEVASGLTLCKEGCEAIV 294
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSL++GP + EL K IGAVPL G + C+ + +P + + LGGK + L+
Sbjct: 295 DTGTSLMVGPVDEVRELQKAIGAVPLIQGEYMIPCEKVSTLPTITLKLGGKGYKLS 350
Score = 73.9 bits (180), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 45/130 (34%), Positives = 69/130 (53%), Gaps = 14/130 (10%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHF 60
+++ AQYYG I +GTPPQ F V+FDTGSSNLW+PS HC +L+IAC ++ H +
Sbjct: 71 LKNYMDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIAC------WLHHKYN 124
Query: 61 RNASD---ENGGE--IMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSNCQ 115
+ S +NG I +G ++ T S + + +K+E+ V+ +
Sbjct: 125 SDKSSTYVKNGTSFAIHYGSGSLSGYLSQDTVSVPCKSAPSTAALGGVKVERQVF---GE 181
Query: 116 AIADTGTSLI 125
AI G + I
Sbjct: 182 AIKQPGITFI 191
>gi|112983576|ref|NP_001037351.1| cathepsin D precursor [Bombyx mori]
gi|66269351|gb|AAY43135.1| CathD [Bombyx mori]
Length = 384
Score = 99.4 bits (246), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 46/114 (40%), Positives = 67/114 (58%), Gaps = 1/114 (0%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIAD 119
R+ GGE++ GG D + GD+ P+ R YW+F ++S+ + + +C+ C AIAD
Sbjct: 210 RDPGATTGGELLLGGSDPAHYRGDLVRVPLLRDTYWEFHMDSVNVNASRFCAQGCSAIAD 269
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
TGTSLI GPSK + LN +GA +A G VDC + +P V + G +FTL
Sbjct: 270 TGTSLIAGPSKEVEALNAAVGATAIAFGQYVVDCSLIPHLPRVTFTIAGNDFTL 323
Score = 75.1 bits (183), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 31/41 (75%), Positives = 35/41 (85%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQYYG IS+GTPPQ FKV+FDTGSSNLW+PS+ C NIAC
Sbjct: 64 AQYYGVISIGTPPQSFKVVFDTGSSNLWVPSKKCHYTNIAC 104
>gi|342305186|dbj|BAK55647.1| cathepsin D [Oplegnathus fasciatus]
Length = 396
Score = 99.0 bits (245), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 46/116 (39%), Positives = 68/116 (58%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSNCQAIA 118
RN E GGE++ GG D + GD Y ++R+ YWQ ++ + + + N+ S C+AI
Sbjct: 221 RNPDTEPGGELLLGGTDPKYYSGDFHYVNITRQAYWQIHMDGMAVGGQLNLCTSGCEAIV 280
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSLI GPS + L K IGA+P G V CD + +P + +GG+++ LT
Sbjct: 281 DTGTSLITGPSAEVRSLQKAIGAIPFIQGEYMVSCDKIPSLPVITFNVGGQSYVLT 336
Score = 79.0 bits (193), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 33/47 (70%), Positives = 39/47 (82%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ AQYYG I LGTPPQ F V+FDTGSSNLW+PS HCSIL+IAC
Sbjct: 68 LKNYLDAQYYGEIGLGTPPQPFTVVFDTGSSNLWVPSVHCSILDIAC 114
>gi|42476045|ref|NP_599161.2| cathepsin D precursor [Rattus norvegicus]
gi|38303993|gb|AAH62032.1| Cathepsin D [Rattus norvegicus]
gi|149061703|gb|EDM12126.1| cathepsin D, isoform CRA_c [Rattus norvegicus]
Length = 407
Score = 99.0 bits (245), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 50/139 (35%), Positives = 81/139 (58%), Gaps = 7/139 (5%)
Query: 42 ILNIACSVDKDKFVGHSHF-----RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYW 96
+L + ++ K K V + F R+ + + GGE+M GG D + G+++Y V+RK YW
Sbjct: 206 VLPVFDNLMKQKLVEKNIFSFYLNRDPTGQPGGELMLGGTDSRYYHGELSYLNVTRKAYW 265
Query: 97 QFGVESIKI--EKNVYCSNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCD 154
Q ++ +++ E + C+AI DTGTSL++GP + EL K IGAVPL G + C+
Sbjct: 266 QVHMDQLEVGSELTLCKGGCEAIVDTGTSLLVGPVDEVKELQKAIGAVPLIQGEYMIPCE 325
Query: 155 NLDKMPNVDIILGGKNFTL 173
+ +P + LGG+N+ L
Sbjct: 326 KVSSLPIITFKLGGQNYEL 344
Score = 74.7 bits (182), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 46/119 (38%), Positives = 62/119 (52%), Gaps = 24/119 (20%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHF 60
+++ AQYYG I +GTPPQ F V+FDTGSSNLW+PS HC +L+IAC V H
Sbjct: 71 LKNYLDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWV---------HH 121
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVES---------IKIEKNVY 110
+ SD++ + K+ DI Y S GY S IK+EK ++
Sbjct: 122 KYNSDKSSTYV------KNGTSFDIHYGSGSLSGYLSQDTVSVPCKSDLGGIKVEKQIF 174
>gi|115720|sp|P24268.1|CATD_RAT RecName: Full=Cathepsin D; Contains: RecName: Full=Cathepsin D 12
kDa light chain; Contains: RecName: Full=Cathepsin D 9
kDa light chain; Contains: RecName: Full=Cathepsin D 34
kDa heavy chain; Contains: RecName: Full=Cathepsin D 30
kDa heavy chain; Flags: Precursor
gi|55882|emb|CAA38349.1| preprocathepsin D [Rattus norvegicus]
Length = 407
Score = 99.0 bits (245), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 50/139 (35%), Positives = 81/139 (58%), Gaps = 7/139 (5%)
Query: 42 ILNIACSVDKDKFVGHSHF-----RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYW 96
+L + ++ K K V + F R+ + + GGE+M GG D + G+++Y V+RK YW
Sbjct: 206 VLPVFDNLMKQKLVEKNIFSFYLNRDPTGQPGGELMLGGTDSRYYHGELSYLNVTRKAYW 265
Query: 97 QFGVESIKI--EKNVYCSNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCD 154
Q ++ +++ E + C+AI DTGTSL++GP + EL K IGAVPL G + C+
Sbjct: 266 QVHMDQLEVGSELTLCKGGCEAIVDTGTSLLVGPVDEVKELQKAIGAVPLIQGEYMIPCE 325
Query: 155 NLDKMPNVDIILGGKNFTL 173
+ +P + LGG+N+ L
Sbjct: 326 KVSSLPIITFKLGGQNYEL 344
Score = 74.7 bits (182), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 46/119 (38%), Positives = 62/119 (52%), Gaps = 24/119 (20%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHF 60
+++ AQYYG I +GTPPQ F V+FDTGSSNLW+PS HC +L+IAC V H
Sbjct: 71 LKNYLDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWV---------HH 121
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVES---------IKIEKNVY 110
+ SD++ + K+ DI Y S GY S IK+EK ++
Sbjct: 122 KYNSDKSSTYV------KNGTSFDIHYGSGSLSGYLSQDTVSVPCKSDLGGIKVEKQIF 174
>gi|432850603|ref|XP_004066829.1| PREDICTED: cathepsin D-like isoform 3 [Oryzias latipes]
Length = 416
Score = 99.0 bits (245), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 46/116 (39%), Positives = 69/116 (59%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIA 118
RN E GGE++ GG D + GD Y +SR+ YWQ ++ + + + C C+AI
Sbjct: 241 RNPDTEPGGELLLGGTDPKYYSGDFHYVNISRQAYWQIHMDGMAVGSQLSLCKGGCEAIV 300
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSL+ GPS + L K IGA+PL G ++CD + +P + +GG+++TLT
Sbjct: 301 DTGTSLLTGPSAEVKALQKAIGAIPLIQGEYMINCDKIPSLPAITFNIGGQSYTLT 356
Score = 78.6 bits (192), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 32/47 (68%), Positives = 40/47 (85%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ AQYYG I+LGTPPQ F V+FDTGSSNLW+PS HCS+L+IAC
Sbjct: 68 LKNYLDAQYYGEIALGTPPQPFTVVFDTGSSNLWVPSVHCSLLDIAC 114
>gi|355566182|gb|EHH22561.1| Cathepsin D [Macaca mulatta]
Length = 450
Score = 99.0 bits (245), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 44/116 (37%), Positives = 70/116 (60%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYC--SNCQAIA 118
R+ + + GGE+M GG D + G ++Y V+RK YWQ ++ +++ + C+AI
Sbjct: 273 RDPTAQPGGELMLGGTDSKYYRGSLSYLNVTRKAYWQVRLDQVEVASGLTLCKEGCEAIV 332
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSL++GP + EL K IGAVPL G + C+ + +P + + LGGK + L+
Sbjct: 333 DTGTSLMVGPVDEVRELQKAIGAVPLIQGEYMIPCEKVSTLPTITLKLGGKGYKLS 388
Score = 73.9 bits (180), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 45/130 (34%), Positives = 69/130 (53%), Gaps = 14/130 (10%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHF 60
+++ AQYYG I +GTPPQ F V+FDTGSSNLW+PS HC +L+IAC ++ H +
Sbjct: 109 LKNYMDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIAC------WLHHKYN 162
Query: 61 RNASD---ENGGE--IMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSNCQ 115
+ S +NG I +G ++ T S + + +K+E+ V+ +
Sbjct: 163 SDKSSTYVKNGTSFAIHYGSGSLSGYLSQDTVSVPCKSASSTAALGGVKVERQVF---GE 219
Query: 116 AIADTGTSLI 125
AI G + I
Sbjct: 220 AIKQPGITFI 229
>gi|307188693|gb|EFN73361.1| Lysosomal aspartic protease [Camponotus floridanus]
Length = 173
Score = 99.0 bits (245), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 50/115 (43%), Positives = 70/115 (60%), Gaps = 5/115 (4%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCS-NCQAIAD 119
RN S + GG++ FGG D +VG TY PV++KGYWQF +ESI + ++C +CQAI D
Sbjct: 1 RNTSVKFGGKLTFGGSDPAYYVGGFTYVPVTKKGYWQFTLESIDMNDFIWCEKSCQAIVD 60
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
+ I+GP I +N LIG N +VDC+ + ++P + LGGK F LT
Sbjct: 61 SSAWQIMGPQTDILYINHLIG----TNTQGRVDCNRILELPTIRFKLGGKTFDLT 111
>gi|355745980|gb|EHH50605.1| hypothetical protein EGM_01462 [Macaca fascicularis]
Length = 401
Score = 99.0 bits (245), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 51/117 (43%), Positives = 73/117 (62%), Gaps = 3/117 (2%)
Query: 62 NASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIAD 119
N G E++FGG D F G + + PV+++GYWQ +++I++ V +CS CQAI D
Sbjct: 227 NPEGGAGSELIFGGYDHSHFSGSLDWVPVTKQGYWQIALDNIQVGGTVMFCSEGCQAIVD 286
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTRT 176
TGTSLI GPS I +L IGA P+ +G V+C NL+ MP+V + G +TL+ T
Sbjct: 287 TGTSLITGPSDKIKQLQNAIGAAPV-DGEYAVECANLNVMPDVTFTINGVPYTLSPT 342
Score = 60.8 bits (146), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 24/34 (70%), Positives = 31/34 (91%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCS 41
+Y+GTIS+G+PPQ F VIFDTGSSNLW+PS +C+
Sbjct: 77 EYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT 110
>gi|194757447|ref|XP_001960976.1| GF11236 [Drosophila ananassae]
gi|190622274|gb|EDV37798.1| GF11236 [Drosophila ananassae]
Length = 388
Score = 99.0 bits (245), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 49/114 (42%), Positives = 64/114 (56%), Gaps = 1/114 (0%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIAD 119
R+ + GGEI+FGG D + GD TY PV+RK YWQ +++ I C CQ IAD
Sbjct: 214 RDPAGPEGGEIIFGGSDPKHYSGDFTYLPVTRKAYWQIKMDAASIGDLELCKGGCQVIAD 273
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
TGTSLI P +N+ IG P+ G V CD + +P + +LGGK F L
Sbjct: 274 TGTSLIAAPMSEATSINQKIGGTPIIGGQYVVSCDLIPNLPVIKFVLGGKTFEL 327
Score = 73.2 bits (178), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 29/41 (70%), Positives = 36/41 (87%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQYYG IS+G+PPQ F+V+FDTGSSNLW+PS+ C + NIAC
Sbjct: 67 AQYYGPISIGSPPQNFRVVFDTGSSNLWVPSKKCHLTNIAC 107
>gi|194759256|ref|XP_001961865.1| GF15184 [Drosophila ananassae]
gi|190615562|gb|EDV31086.1| GF15184 [Drosophila ananassae]
Length = 403
Score = 99.0 bits (245), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 48/113 (42%), Positives = 67/113 (59%), Gaps = 1/113 (0%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSNCQAIADT 120
RN + + GG++ GG D + GD+TY PV+++GYWQF V I V QAIADT
Sbjct: 236 RNGTSDEGGQLTLGGTDYNLIDGDLTYVPVTKQGYWQFAVNQITWNGTVVSGPVQAIADT 295
Query: 121 GTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
GTSLI+ P+ ++N+LIGA+ + G V C +D +P + GG NF L
Sbjct: 296 GTSLIVVPADAYTKINELIGAIYI-QGEWYVPCSTVDSLPVITFNFGGTNFDL 347
Score = 63.5 bits (153), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 27/47 (57%), Positives = 34/47 (72%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+ +S YYG IS+GTPPQ F V+FDTGS+NLWIPS C ++AC
Sbjct: 83 LGNSMNMYYYGEISIGTPPQYFNVVFDTGSANLWIPSVQCLSTDVAC 129
>gi|380036056|ref|NP_001244039.1| cathepsin D1 precursor [Ictalurus punctatus]
gi|330689904|gb|AEC33270.1| cathepsin D1 [Ictalurus punctatus]
Length = 396
Score = 99.0 bits (245), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 46/116 (39%), Positives = 66/116 (56%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYC--SNCQAIA 118
RN + GGE++ GG D + GD Y ++R+ YWQ ++ + I + C+AI
Sbjct: 221 RNPDTQPGGELLLGGTDPKFYTGDFHYVNITRQAYWQIHMDGMTIGSQLTLCKGGCEAIV 280
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSLI GP+ + L K IGA+PL G VDC + +P + LGG+ +TLT
Sbjct: 281 DTGTSLITGPAAEVKALQKAIGAIPLIQGEYMVDCKKVPSLPTISFNLGGQTYTLT 336
Score = 70.5 bits (171), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 29/47 (61%), Positives = 37/47 (78%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ AQYYG I LG+P Q F V+FDTGSSNLW+PS HCS+ +IAC
Sbjct: 68 LKNYLDAQYYGEIGLGSPVQTFTVVFDTGSSNLWVPSVHCSLTDIAC 114
>gi|432850599|ref|XP_004066827.1| PREDICTED: cathepsin D-like isoform 1 [Oryzias latipes]
Length = 396
Score = 99.0 bits (245), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 46/116 (39%), Positives = 69/116 (59%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIA 118
RN E GGE++ GG D + GD Y +SR+ YWQ ++ + + + C C+AI
Sbjct: 221 RNPDTEPGGELLLGGTDPKYYSGDFHYVNISRQAYWQIHMDGMAVGSQLSLCKGGCEAIV 280
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSL+ GPS + L K IGA+PL G ++CD + +P + +GG+++TLT
Sbjct: 281 DTGTSLLTGPSAEVKALQKAIGAIPLIQGEYMINCDKIPSLPAITFNIGGQSYTLT 336
Score = 78.6 bits (192), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 32/47 (68%), Positives = 40/47 (85%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ AQYYG I+LGTPPQ F V+FDTGSSNLW+PS HCS+L+IAC
Sbjct: 68 LKNYLDAQYYGEIALGTPPQPFTVVFDTGSSNLWVPSVHCSLLDIAC 114
>gi|34740274|dbj|BAC87742.1| pepsinogen [Paralichthys olivaceus]
Length = 377
Score = 99.0 bits (245), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 47/118 (39%), Positives = 74/118 (62%), Gaps = 2/118 (1%)
Query: 58 SHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-KNVYCSN-CQ 115
S + +++++ G E++FGG+D + + G +++ P++ YWQ ++S+ I + V CS CQ
Sbjct: 207 SVYLSSNNQQGSEVVFGGIDGNHYTGQVSWIPLTSATYWQIKMDSVTINGQTVACSGGCQ 266
Query: 116 AIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
AI DTGTSLI+GP+ I +N +GA G A V+C N+ MP V L GK FT+
Sbjct: 267 AIIDTGTSLIVGPTNDINNMNSWVGASTNQYGEATVNCQNIQSMPEVTFTLNGKAFTV 324
Score = 63.5 bits (153), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 27/33 (81%), Positives = 29/33 (87%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCS 41
YYG IS+GTPPQ F VIFDTGSSNLWIPS +CS
Sbjct: 70 YYGVISIGTPPQSFSVIFDTGSSNLWIPSVYCS 102
>gi|60678795|gb|AAX33732.1| Blo t allergen isoform 2 [Blomia tropicalis]
Length = 402
Score = 99.0 bits (245), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 54/140 (38%), Positives = 81/140 (57%), Gaps = 13/140 (9%)
Query: 48 SVDKDKFVGHSHF-----RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVES 102
S+ K V H F R+ + + GGEI+FGG D + GD TY+P+++ GYWQF +
Sbjct: 197 SMVKQGLVQHPVFSFYLNRDTNGKVGGEIIFGGSDPAYYKGDFTYAPLTKIGYWQFQMHG 256
Query: 103 IKIEK--------NVYCSNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCD 154
I +E +V S C+AIADTGTSLI GPS + LN+ +GA+ NG ++C
Sbjct: 257 ILLENKSNNKTVGHVCESGCEAIADTGTSLIAGPSDQVEHLNRALGAIGPLNGIFVLNCS 316
Query: 155 NLDKMPNVDIILGGKNFTLT 174
+++ +PN+ + G F L+
Sbjct: 317 HINALPNIIFQINGVKFPLS 336
Score = 67.8 bits (164), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 27/41 (65%), Positives = 32/41 (78%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQYYG I +G+PPQ F VIFDTGSSNLW+PS+ C N+ C
Sbjct: 68 AQYYGEIQIGSPPQPFNVIFDTGSSNLWVPSKKCKFTNLVC 108
>gi|432850601|ref|XP_004066828.1| PREDICTED: cathepsin D-like isoform 2 [Oryzias latipes]
Length = 398
Score = 99.0 bits (245), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 46/116 (39%), Positives = 69/116 (59%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIA 118
RN E GGE++ GG D + GD Y +SR+ YWQ ++ + + + C C+AI
Sbjct: 223 RNPDTEPGGELLLGGTDPKYYSGDFHYVNISRQAYWQIHMDGMAVGSQLSLCKGGCEAIV 282
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSL+ GPS + L K IGA+PL G ++CD + +P + +GG+++TLT
Sbjct: 283 DTGTSLLTGPSAEVKALQKAIGAIPLIQGEYMINCDKIPSLPAITFNIGGQSYTLT 338
Score = 78.6 bits (192), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 32/47 (68%), Positives = 40/47 (85%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ AQYYG I+LGTPPQ F V+FDTGSSNLW+PS HCS+L+IAC
Sbjct: 68 LKNYLDAQYYGEIALGTPPQPFTVVFDTGSSNLWVPSVHCSLLDIAC 114
>gi|326933745|ref|XP_003212960.1| PREDICTED: cathepsin E-like [Meleagris gallopavo]
Length = 403
Score = 99.0 bits (245), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 48/115 (41%), Positives = 73/115 (63%), Gaps = 3/115 (2%)
Query: 62 NASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKN-VYCSN-CQAIAD 119
N GGE++FGG D +F+G + + PV+ +GYWQ ++++++ V+C+N CQAI D
Sbjct: 229 NPDSSLGGEVLFGGFDPSRFLGTLHWVPVTVQGYWQIQLDNVQVGGTVVFCANGCQAIVD 288
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
TGTSL+ GP+K I E+ + IGA P+ +G VDC L MP V + G + L+
Sbjct: 289 TGTSLLTGPTKDIKEMQRYIGATPM-DGEYVVDCSLLSSMPIVTFTINGMPYLLS 342
Score = 60.1 bits (144), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 24/34 (70%), Positives = 30/34 (88%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCS 41
+Y+G IS+GTPPQ F VIFDTGSSNLW+PS +C+
Sbjct: 84 EYFGQISIGTPPQNFTVIFDTGSSNLWVPSIYCT 117
>gi|5921657|gb|AAD56287.1| pepsinogen A form IIb precursor [Pseudopleuronectes americanus]
Length = 151
Score = 99.0 bits (245), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 48/118 (40%), Positives = 72/118 (61%), Gaps = 2/118 (1%)
Query: 58 SHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-KNVYCSN-CQ 115
S + +++ + G E++FGG D + G I++ P++ YWQ ++S+ I + V CS CQ
Sbjct: 3 SVYLSSNAQQGSEVIFGGTDSSHYTGQISWIPLTSATYWQIKMDSVTINGQTVACSGGCQ 62
Query: 116 AIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
AI DTGTSLI+GP+ I+ +N +GA G A V C N+ MP+V L GK FT+
Sbjct: 63 AIIDTGTSLIVGPNSDISNMNSWVGASTNQYGEATVSCQNIQSMPDVTFTLNGKAFTV 120
>gi|213625094|gb|AAI69806.1| LOC443721 protein [Xenopus laevis]
Length = 399
Score = 99.0 bits (245), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 45/116 (38%), Positives = 69/116 (59%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI-EKNVYCSN-CQAIA 118
RN + GGE++ GG D + GD +Y V+RK YWQ ++ + + ++ C C+AI
Sbjct: 224 RNPDTQPGGELLLGGTDPTYYTGDFSYMNVTRKAYWQIRMDQLSVGDQLTLCKGGCEAIV 283
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSLI GP + + L + IGA+PL G + CDN+ +P + GG+ ++LT
Sbjct: 284 DTGTSLITGPVEEVTALQRAIGAIPLIRGEYMILCDNIPSLPVISFTFGGRVYSLT 339
Score = 74.7 bits (182), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 31/41 (75%), Positives = 35/41 (85%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQYYG IS+GTPPQ F V+FDTGSSNLW+PS HCS +IAC
Sbjct: 77 AQYYGEISIGTPPQPFTVVFDTGSSNLWVPSVHCSFWDIAC 117
>gi|355681641|gb|AER96810.1| cathepsin D [Mustela putorius furo]
Length = 410
Score = 98.6 bits (244), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 47/115 (40%), Positives = 68/115 (59%), Gaps = 2/115 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYC--SNCQAIA 118
R+ + GGE+M GG D + G ++Y V+RK YWQ +E + + ++ C+AI
Sbjct: 233 RDPGAQPGGELMLGGTDSKYYKGPLSYLNVTRKAYWQVHMEXVDVGSSLTLCKGGCEAIV 292
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
DTGTSLI+GP + EL K IGAVPL G + C+ + +P V + LGGK + L
Sbjct: 293 DTGTSLIVGPVDEVRELQKAIGAVPLIQGEYMIPCEKVSTLPEVTLKLGGKPYKL 347
Score = 75.1 bits (183), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 31/47 (65%), Positives = 38/47 (80%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+R+ AQYYG I +GTPPQ F V+FDTGSSNLW+PS HC +L+IAC
Sbjct: 71 LRNYMDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIAC 117
>gi|209154266|gb|ACI33365.1| Cathepsin D precursor [Salmo salar]
Length = 402
Score = 98.6 bits (244), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 49/116 (42%), Positives = 70/116 (60%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYC--SNCQAIA 118
R+ GGE+M GG D + GD+ Y V+RK YWQ + ++++ + + CQAI
Sbjct: 227 RDPLAAVGGELMLGGTDPLYYTGDLHYVNVTRKAYWQIEMSNVEVGNQLTLCKAGCQAIV 286
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSLIIGP++ + L+K IGA+PL G +DC + +P + LGGK F LT
Sbjct: 287 DTGTSLIIGPAEEVRVLHKAIGALPLLMGEYWIDCKKVPSLPVIAFNLGGKMFNLT 342
Score = 77.0 bits (188), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 31/41 (75%), Positives = 37/41 (90%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQYYG IS+GTPPQ+F V+FDTGSSNLW+PS HCS L++AC
Sbjct: 80 AQYYGVISIGTPPQDFTVLFDTGSSNLWVPSIHCSFLDVAC 120
>gi|299522|gb|AAB26186.1| cathepsin D {EC 3.4.23.5} [cattle, Peptide Partial, 346 aa]
Length = 346
Score = 98.6 bits (244), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 50/130 (38%), Positives = 77/130 (59%), Gaps = 7/130 (5%)
Query: 49 VDKDKFVGHSHF--RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE 106
VDK+ F S F R+ + GGE+M GG D + G + + V+R+ YWQ ++ + +
Sbjct: 158 VDKNVF---SFFLNRDPKAQPGGELMLGGTDSKYYRGSLMFHNVTRQAYWQIHMDQLDVG 214
Query: 107 KNVYC--SNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDI 164
++ C+AI DTGTSLI+GP + + EL K IGAVPL G + C+ + +P V +
Sbjct: 215 SSLTVCKGGCEAIVDTGTSLIVGPVEEVRELQKAIGAVPLIQGEYMIPCEKVSSLPEVTV 274
Query: 165 ILGGKNFTLT 174
LGGK++ L+
Sbjct: 275 KLGGKDYALS 284
Score = 72.4 bits (176), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 29/47 (61%), Positives = 38/47 (80%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ AQYYG I +GTPPQ F V+FDTGS+NLW+PS HC +L+IAC
Sbjct: 7 LKNYMDAQYYGEIGIGTPPQCFTVVFDTGSANLWVPSIHCKLLDIAC 53
>gi|185132376|ref|NP_001118183.1| cathepsin D precursor [Oncorhynchus mykiss]
gi|1858020|gb|AAC60301.1| cathepsin D [Oncorhynchus mykiss]
Length = 398
Score = 98.6 bits (244), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 47/116 (40%), Positives = 67/116 (57%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYC--SNCQAIA 118
RN E GGE++ GG D + GD Y VSR+ YWQ ++ + + + C+AI
Sbjct: 223 RNPDSEPGGELLLGGTDPKYYSGDFQYLDVSRQAYWQIHMDGMGVGSQLSLCKGGCEAIV 282
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSLI GP+ + L + IGA PL G V+CD + MP + LGG++++LT
Sbjct: 283 DTGTSLITGPAAEVKALQRAIGATPLIQGEYMVNCDKIPTMPVITFNLGGQSYSLT 338
Score = 70.9 bits (172), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 30/47 (63%), Positives = 36/47 (76%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ AQYYG I LGTP Q F V+FDTGSSNLW+PS HCS +IAC
Sbjct: 70 LKNFMDAQYYGEIGLGTPVQTFTVVFDTGSSNLWVPSVHCSFTDIAC 116
>gi|311324976|gb|ADP89523.1| cathepsin D [Miichthys miiuy]
Length = 396
Score = 98.6 bits (244), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 46/116 (39%), Positives = 68/116 (58%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYC--SNCQAIA 118
RN + GGE++ GG D + GD Y ++R+ YWQ V+ + + + S C+AI
Sbjct: 221 RNPDTQPGGELLLGGTDPKYYSGDFHYVNITRQAYWQIHVDGMAVGSQLTLCKSGCEAIV 280
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSLI GPS + L K IGA+PL G V CD + +P + +GG++++LT
Sbjct: 281 DTGTSLITGPSAEVRSLQKAIGAIPLIQGEYMVSCDKIPSLPVITFNVGGQSYSLT 336
Score = 76.6 bits (187), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 32/47 (68%), Positives = 38/47 (80%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ AQYYG I LGTPPQ F V+FDTGSSNLW+PS HC IL+IAC
Sbjct: 68 LKNYLDAQYYGEIGLGTPPQLFTVVFDTGSSNLWVPSVHCQILDIAC 114
>gi|403305561|ref|XP_003943328.1| PREDICTED: cathepsin D [Saimiri boliviensis boliviensis]
Length = 522
Score = 98.6 bits (244), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 48/128 (37%), Positives = 78/128 (60%), Gaps = 3/128 (2%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKN 108
VD++ F + + R+ + GGE+M GG D + G ++Y V+RK YWQ ++ +++
Sbjct: 189 VDQNIFSFYLN-RDPDAQPGGELMLGGTDSKYYKGSLSYLNVTRKAYWQVHMDQVEVASG 247
Query: 109 V-YCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ C C+AI DTGTSL++GP + EL K IGAVPL G + C+ + +P + + L
Sbjct: 248 LTLCKGGCEAIVDTGTSLMVGPVDEVRELQKAIGAVPLIQGEYMIPCEKVSTLPTITLKL 307
Query: 167 GGKNFTLT 174
GGK++ L+
Sbjct: 308 GGKDYKLS 315
Score = 73.6 bits (179), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 41/116 (35%), Positives = 62/116 (53%), Gaps = 13/116 (11%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHF 60
+++ AQYYG I +GTPPQ F V+FDTGSSNLW+PS HC +L+IAC + H +
Sbjct: 36 LKNYMDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWI-------HHKY 88
Query: 61 RNASD----ENGG--EIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVY 110
+A +NG +I +G ++ T R + +K+E+ V+
Sbjct: 89 NSAKSSTYVKNGTSFDIHYGSGSLSGYLSQDTVLVPCRPSSSASALGGVKVERQVF 144
>gi|5921651|gb|AAD56284.1|AF156788_1 pepsinogen A form IIb precursor [Pseudopleuronectes americanus]
Length = 376
Score = 98.6 bits (244), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 48/118 (40%), Positives = 72/118 (61%), Gaps = 2/118 (1%)
Query: 58 SHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-KNVYCSN-CQ 115
S + +++ + G E++FGG D + G I++ P++ YWQ ++S+ I + V CS CQ
Sbjct: 207 SVYLSSNAQQGSEVIFGGTDSSHYTGQISWIPLTSATYWQIKMDSVTINGQTVACSGGCQ 266
Query: 116 AIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
AI DTGTSLI+GP+ I+ +N +GA G A V C N+ MP+V L GK FT+
Sbjct: 267 AIIDTGTSLIVGPNSDISNMNSWVGASTNQYGEATVSCQNIQSMPDVTFTLNGKAFTV 324
Score = 63.9 bits (154), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 27/33 (81%), Positives = 29/33 (87%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCS 41
YYG IS+GTPPQ F VIFDTGSSNLWIPS +CS
Sbjct: 70 YYGVISIGTPPQSFSVIFDTGSSNLWIPSVYCS 102
>gi|61742102|gb|AAX54897.1| pepsinogen A1 precursor [Hippoglossus hippoglossus]
Length = 131
Score = 98.6 bits (244), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 47/110 (42%), Positives = 67/110 (60%), Gaps = 2/110 (1%)
Query: 66 ENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-KNVYCSN-CQAIADTGTS 123
+ G E++FGG D + G I++ P++ YWQ ++S+ I + V CS CQAI DTGTS
Sbjct: 9 QQGSEVIFGGTDSSHYTGQISWIPLTSATYWQIKMDSVTINGQTVACSGGCQAIIDTGTS 68
Query: 124 LIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
LI+GP+ I+ +N +GA G A V C N+ MP+V L GK FT+
Sbjct: 69 LIVGPNSDISNMNSWVGASTNQYGEATVSCQNIQSMPDVTFTLNGKAFTV 118
>gi|254596794|gb|ACT75642.1| pepsinogen A [Channa argus]
Length = 361
Score = 98.2 bits (243), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 49/121 (40%), Positives = 72/121 (59%), Gaps = 2/121 (1%)
Query: 58 SHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-KNVYCSN-CQ 115
S + +++ G E++FGGVD + + G I + P++ YWQ ++S+ I + V CS CQ
Sbjct: 191 SVYLSSNSAQGSEVVFGGVDSNHYTGQIAWIPLTSATYWQIKMDSVSINGQTVACSGGCQ 250
Query: 116 AIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTR 175
AI DTGTSLI+GP+ I+ +N +GA G A V+C N+ MP V L G FT+
Sbjct: 251 AIIDTGTSLIVGPTSDISNINSWVGASTDQYGDATVNCQNIQSMPEVTFTLNGNAFTIPA 310
Query: 176 T 176
T
Sbjct: 311 T 311
Score = 61.6 bits (148), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 25/33 (75%), Positives = 29/33 (87%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCS 41
YYG IS+GTPPQ F VIFD+GSSNLW+PS +CS
Sbjct: 53 YYGVISIGTPPQSFSVIFDSGSSNLWVPSVYCS 85
>gi|395838962|ref|XP_003792373.1| PREDICTED: cathepsin E [Otolemur garnettii]
Length = 394
Score = 98.2 bits (243), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 55/130 (42%), Positives = 75/130 (57%), Gaps = 2/130 (1%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKN 108
VD+ F + N G E++FGG D F G + + PVS++ YWQ V+ I++ ++
Sbjct: 211 VDQPIFSIYMSSTNQKGGKGSELIFGGYDHSHFTGRLNWVPVSKQEYWQIKVDKIRVGRS 270
Query: 109 V-YCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
V CS CQAI DTGTS I GPS I +L K I AV NG V C+ L MP+V I++
Sbjct: 271 VMLCSKGCQAIVDTGTSSITGPSDDIRQLQKAIRAVRRKNGDYTVRCNKLKVMPDVTIVI 330
Query: 167 GGKNFTLTRT 176
G ++TL T
Sbjct: 331 KGVSYTLKPT 340
Score = 51.2 bits (121), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 19/34 (55%), Positives = 25/34 (73%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
+Y+G IS+G+PPQ F V FDTGS + W+PS C
Sbjct: 76 VEYFGNISIGSPPQNFTVCFDTGSPDFWVPSAFC 109
>gi|2288908|emb|CAA71859.1| cathepsin E [Mus musculus]
Length = 397
Score = 98.2 bits (243), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 52/120 (43%), Positives = 73/120 (60%), Gaps = 6/120 (5%)
Query: 63 ASDENGG---EIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAI 117
+SD GG E+ FGG D F G + + PV+++ YWQ ++ I++ V +CS CQAI
Sbjct: 221 SSDPQGGSGSELTFGGYDPSHFSGSLNWIPVTKQAYWQIALDGIQVGDTVMFCSEGCQAI 280
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTRTN 177
DTGTSLI GP I +L + IGA P+ +G VDC LD MPNV ++ ++TL T+
Sbjct: 281 VDTGTSLITGPPDKIKQLQEAIGATPI-DGEYAVDCATLDTMPNVTFLINEVSYTLNPTD 339
Score = 62.0 bits (149), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 25/34 (73%), Positives = 31/34 (91%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCS 41
+Y+GTIS+GTPPQ F VIFDTGSSNLW+PS +C+
Sbjct: 78 EYFGTISIGTPPQNFTVIFDTGSSNLWVPSVYCT 111
>gi|166235886|ref|NP_031825.2| cathepsin E preproprotein [Mus musculus]
gi|341940308|sp|P70269.2|CATE_MOUSE RecName: Full=Cathepsin E; Flags: Precursor
gi|5748654|emb|CAA08880.2| cathepsin E protein [Mus musculus]
gi|74146932|dbj|BAE25449.1| unnamed protein product [Mus musculus]
gi|74192082|dbj|BAE34257.1| unnamed protein product [Mus musculus]
gi|74219155|dbj|BAE26716.1| unnamed protein product [Mus musculus]
gi|74222421|dbj|BAE38113.1| unnamed protein product [Mus musculus]
gi|148707758|gb|EDL39705.1| cathepsin E [Mus musculus]
Length = 397
Score = 98.2 bits (243), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 52/120 (43%), Positives = 73/120 (60%), Gaps = 6/120 (5%)
Query: 63 ASDENGG---EIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAI 117
+SD GG E+ FGG D F G + + PV+++ YWQ ++ I++ V +CS CQAI
Sbjct: 221 SSDPQGGSGSELTFGGYDPSHFSGSLNWIPVTKQAYWQIALDGIQVGDTVMFCSEGCQAI 280
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTRTN 177
DTGTSLI GP I +L + IGA P+ +G VDC LD MPNV ++ ++TL T+
Sbjct: 281 VDTGTSLITGPPDKIKQLQEAIGATPI-DGEYAVDCATLDTMPNVTFLINEVSYTLNPTD 339
Score = 62.0 bits (149), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 25/34 (73%), Positives = 31/34 (91%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCS 41
+Y+GTIS+GTPPQ F VIFDTGSSNLW+PS +C+
Sbjct: 78 EYFGTISIGTPPQNFTVIFDTGSSNLWVPSVYCT 111
>gi|197247086|gb|AAI65335.1| Nots protein [Danio rerio]
Length = 416
Score = 98.2 bits (243), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 49/128 (38%), Positives = 78/128 (60%), Gaps = 5/128 (3%)
Query: 52 DKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-Y 110
D+ V + N GGE++FGG+D+ +F+ I + PV++KGYWQ ++++K++ + +
Sbjct: 219 DQPVFSFYLTNNGSGFGGELVFGGMDESRFLPPINWIPVTQKGYWQIKLDAVKVQGALSF 278
Query: 111 C----SNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
C CQAI DTGTSLI GP++ I L + IGA P ANG VDC + +P V ++
Sbjct: 279 CYRSVQGCQAIVDTGTSLIGGPARDILILQQFIGATPTANGEFVVDCVRVSSLPVVSFLI 338
Query: 167 GGKNFTLT 174
++L+
Sbjct: 339 NSVEYSLS 346
Score = 56.6 bits (135), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 26/48 (54%), Positives = 37/48 (77%), Gaps = 3/48 (6%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKF 54
AQ++G ISLG P Q F V+FDTGSS+LW+PS +C ++ AC++ +KF
Sbjct: 84 AQFFGQISLGRPEQNFTVVFDTGSSDLWVPSSYC--VSQACAL-HNKF 128
>gi|56269596|gb|AAH86835.1| Nots protein [Danio rerio]
Length = 443
Score = 98.2 bits (243), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 49/128 (38%), Positives = 78/128 (60%), Gaps = 5/128 (3%)
Query: 52 DKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-Y 110
D+ V + N GGE++FGG+D+ +F+ I + PV++KGYWQ ++++K++ + +
Sbjct: 246 DQPVFSFYLTNNGSGFGGELVFGGMDESRFLPPINWIPVTQKGYWQIKLDAVKVQGALSF 305
Query: 111 C----SNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
C CQAI DTGTSLI GP++ I L + IGA P ANG VDC + +P V ++
Sbjct: 306 CYRSVQGCQAIVDTGTSLIGGPARDILILQQFIGATPTANGEFVVDCVRVSSLPVVSFLI 365
Query: 167 GGKNFTLT 174
++L+
Sbjct: 366 NSVEYSLS 373
Score = 56.6 bits (135), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 26/48 (54%), Positives = 37/48 (77%), Gaps = 3/48 (6%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKF 54
AQ++G ISLG P Q F V+FDTGSS+LW+PS +C ++ AC++ +KF
Sbjct: 111 AQFFGQISLGRPEQNFTVVFDTGSSDLWVPSSYC--VSQACAL-HNKF 155
>gi|410044668|ref|XP_001154405.3| PREDICTED: cathepsin D [Pan troglodytes]
Length = 322
Score = 98.2 bits (243), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 45/123 (36%), Positives = 71/123 (57%), Gaps = 2/123 (1%)
Query: 54 FVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCS 112
F+ + + GGE+M GG D + G ++Y V+RK YWQ ++ +++ + C
Sbjct: 138 FLSQQPLLDPDAQPGGELMLGGTDSKYYKGSLSYLNVTRKAYWQVHLDQVEVASGLTLCK 197
Query: 113 N-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNF 171
C+AI DTGTSL++GP + EL K IGAVPL G + C+ + +P + + LGGK +
Sbjct: 198 EGCEAIVDTGTSLMVGPVDEVRELQKAIGAVPLIQGEYMIPCEKVSTLPAITLKLGGKGY 257
Query: 172 TLT 174
L+
Sbjct: 258 KLS 260
>gi|123993743|gb|ABM84473.1| cathepsin D (lysosomal aspartyl peptidase) [synthetic construct]
Length = 412
Score = 97.8 bits (242), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 44/116 (37%), Positives = 69/116 (59%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYC--SNCQAIA 118
R+ + GGE+M GG D + G ++Y V+RK YWQ ++ +++ + C+AI
Sbjct: 235 RDPDAQPGGELMLGGTDSKYYKGSLSYLNVTRKAYWQVHLDQVEVASGLTLCKEGCEAIV 294
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSL++GP + EL K IGAVPL G + C+ + +P + + LGGK + L+
Sbjct: 295 DTGTSLMVGPVDEVRELQKAIGAVPLIQGEYMIPCEKVSTLPAITLKLGGKGYKLS 350
Score = 73.6 bits (179), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 30/47 (63%), Positives = 38/47 (80%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ AQYYG I +GTPPQ F V+FDTGSSNLW+PS HC +L+IAC
Sbjct: 71 LKNYMDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIAC 117
>gi|30584113|gb|AAP36305.1| Homo sapiens cathepsin D (lysosomal aspartyl protease) [synthetic
construct]
gi|60653917|gb|AAX29651.1| cathepsin D [synthetic construct]
Length = 413
Score = 97.8 bits (242), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 44/116 (37%), Positives = 69/116 (59%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYC--SNCQAIA 118
R+ + GGE+M GG D + G ++Y V+RK YWQ ++ +++ + C+AI
Sbjct: 235 RDPDAQPGGELMLGGTDSKYYKGSLSYLNVTRKAYWQVHLDQVEVASGLTLCKEGCEAIV 294
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSL++GP + EL K IGAVPL G + C+ + +P + + LGGK + L+
Sbjct: 295 DTGTSLMVGPVDEVRELQKAIGAVPLIQGEYMIPCEKVSTLPAITLKLGGKGYKLS 350
Score = 73.6 bits (179), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 30/47 (63%), Positives = 38/47 (80%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ AQYYG I +GTPPQ F V+FDTGSSNLW+PS HC +L+IAC
Sbjct: 71 LKNYMDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIAC 117
>gi|197099366|ref|NP_001125492.1| cathepsin D precursor [Pongo abelii]
gi|55728229|emb|CAH90861.1| hypothetical protein [Pongo abelii]
Length = 412
Score = 97.8 bits (242), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 45/116 (38%), Positives = 70/116 (60%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIA 118
R+ + GGE+M GG D + G ++Y V+RK YWQ ++ +++ + C C+AI
Sbjct: 235 RDPDAQPGGELMLGGTDSKYYKGSLSYLNVTRKAYWQVHLDQVEVASGLTLCKEGCEAIV 294
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSL++GP + EL K IGAVPL G + C+ + +P + + LGGK + L+
Sbjct: 295 DTGTSLMVGPVDEVRELQKAIGAVPLIQGEYMIPCEKVSTLPAITLKLGGKGYKLS 350
Score = 74.3 bits (181), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 38/110 (34%), Positives = 60/110 (54%), Gaps = 1/110 (0%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHF 60
+++ AQYYG I +GTPPQ F V+FDTGSSNLW+PS HC +L+IAC + + K+
Sbjct: 71 LKNYMDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWIHR-KYNSDKSS 129
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVY 110
+ +I +G ++ T S + + +K+E+ V+
Sbjct: 130 TYVKNGTSFDIHYGSGSLSGYLSQDTVSVPCQSASSASALGGVKVERQVF 179
>gi|270124457|dbj|BAI52799.1| pepsinogen [Anguilla japonica]
Length = 375
Score = 97.8 bits (242), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 53/127 (41%), Positives = 76/127 (59%), Gaps = 5/127 (3%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-K 107
V +D F S + +++ + G E++FGGVD + G I + P+S + YWQ ++S+ I +
Sbjct: 200 VSQDLF---SVYLSSNSQAGSEVVFGGVDPSHYTGSINWIPLSSETYWQITMQSVTINGQ 256
Query: 108 NVYC-SNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
V C S CQAI DTGTSLI+GPS I +N +GA G A V C N+ MP+V +
Sbjct: 257 TVACTSGCQAIVDTGTSLIVGPSSDIGNINYYVGASTNQYGDATVQCGNVGSMPDVTFNI 316
Query: 167 GGKNFTL 173
G +FTL
Sbjct: 317 NGVSFTL 323
Score = 64.3 bits (155), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 30/46 (65%), Positives = 37/46 (80%), Gaps = 3/46 (6%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKF 54
YYG IS+GTPPQ FKVIFD+GS+NLW+PS +CS + AC + DKF
Sbjct: 69 YYGVISIGTPPQSFKVIFDSGSANLWVPSVYCS--SQACQ-NHDKF 111
>gi|4503143|ref|NP_001900.1| cathepsin D preproprotein [Homo sapiens]
gi|115717|sp|P07339.1|CATD_HUMAN RecName: Full=Cathepsin D; Contains: RecName: Full=Cathepsin D
light chain; Contains: RecName: Full=Cathepsin D heavy
chain; Flags: Precursor
gi|29678|emb|CAA28955.1| cathepsin D [Homo sapiens]
gi|179948|gb|AAA51922.1| cathepsin D [Homo sapiens]
gi|181180|gb|AAB59529.1| preprocathepsin D [Homo sapiens]
gi|16740920|gb|AAH16320.1| Cathepsin D [Homo sapiens]
gi|30582659|gb|AAP35556.1| cathepsin D (lysosomal aspartyl protease) [Homo sapiens]
gi|48146011|emb|CAG33228.1| CTSD [Homo sapiens]
gi|54697170|gb|AAV38957.1| cathepsin D (lysosomal aspartyl protease) [Homo sapiens]
gi|61356567|gb|AAX41260.1| cathepsin D [synthetic construct]
gi|61362282|gb|AAX42193.1| cathepsin D [synthetic construct]
gi|119622866|gb|EAX02461.1| cathepsin D (lysosomal aspartyl peptidase), isoform CRA_a [Homo
sapiens]
gi|119622867|gb|EAX02462.1| cathepsin D (lysosomal aspartyl peptidase), isoform CRA_a [Homo
sapiens]
gi|119622868|gb|EAX02463.1| cathepsin D (lysosomal aspartyl peptidase), isoform CRA_a [Homo
sapiens]
gi|123994405|gb|ABM84804.1| cathepsin D (lysosomal aspartyl peptidase) [synthetic construct]
gi|261860344|dbj|BAI46694.1| cathepsin D [synthetic construct]
Length = 412
Score = 97.8 bits (242), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 44/116 (37%), Positives = 69/116 (59%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYC--SNCQAIA 118
R+ + GGE+M GG D + G ++Y V+RK YWQ ++ +++ + C+AI
Sbjct: 235 RDPDAQPGGELMLGGTDSKYYKGSLSYLNVTRKAYWQVHLDQVEVASGLTLCKEGCEAIV 294
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSL++GP + EL K IGAVPL G + C+ + +P + + LGGK + L+
Sbjct: 295 DTGTSLMVGPVDEVRELQKAIGAVPLIQGEYMIPCEKVSTLPAITLKLGGKGYKLS 350
Score = 73.6 bits (179), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 30/47 (63%), Positives = 38/47 (80%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ AQYYG I +GTPPQ F V+FDTGSSNLW+PS HC +L+IAC
Sbjct: 71 LKNYMDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIAC 117
>gi|187608619|ref|NP_001120469.1| cathepsin E precursor [Xenopus (Silurana) tropicalis]
gi|170284872|gb|AAI61297.1| LOC100145572 protein [Xenopus (Silurana) tropicalis]
Length = 397
Score = 97.8 bits (242), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 51/116 (43%), Positives = 73/116 (62%), Gaps = 3/116 (2%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKN-VYCS-NCQAIA 118
RN + GGE++FGG D +F G + + V+ +GYWQ +++I+I V+C+ CQAI
Sbjct: 217 RNPNSPVGGELVFGGFDASRFSGQLNWVSVTNQGYWQIQLDNIQINGEVVFCTGGCQAIV 276
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSLI GPS I +L +IGA ANG +VDC L++MP V + G + +T
Sbjct: 277 DTGTSLITGPSSDIVQLQSIIGA-SAANGDYEVDCSVLNEMPTVTFTINGIGYQMT 331
Score = 61.2 bits (147), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 30/62 (48%), Positives = 41/62 (66%), Gaps = 5/62 (8%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASDE 66
+Y+G IS+GTPPQ F VIFDTGSSNLW+PS +C ++ AC+ ++F F +
Sbjct: 72 VEYFGEISIGTPPQNFTVIFDTGSSNLWVPSVYC--ISPACA-QHNRF--QPQFSSTYQS 126
Query: 67 NG 68
NG
Sbjct: 127 NG 128
>gi|60820131|gb|AAX36524.1| cathepsin D [synthetic construct]
gi|61363243|gb|AAX42359.1| cathepsin D [synthetic construct]
Length = 412
Score = 97.8 bits (242), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 44/116 (37%), Positives = 69/116 (59%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYC--SNCQAIA 118
R+ + GGE+M GG D + G ++Y V+RK YWQ ++ +++ + C+AI
Sbjct: 235 RDPDAQPGGELMLGGTDSKYYKGSLSYLNVTRKAYWQVHLDQVEVASGLTLCKEGCEAIV 294
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSL++GP + EL K IGAVPL G + C+ + +P + + LGGK + L+
Sbjct: 295 DTGTSLMVGPVDEVRELQKAIGAVPLIEGEYMIPCEKVSTLPAITLKLGGKGYKLS 350
Score = 73.6 bits (179), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 30/47 (63%), Positives = 38/47 (80%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ AQYYG I +GTPPQ F V+FDTGSSNLW+PS HC +L+IAC
Sbjct: 71 LKNYMDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIAC 117
>gi|426366854|ref|XP_004050458.1| PREDICTED: cathepsin D [Gorilla gorilla gorilla]
Length = 412
Score = 97.8 bits (242), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 44/116 (37%), Positives = 69/116 (59%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYC--SNCQAIA 118
R+ + GGE+M GG D + G ++Y V+RK YWQ ++ +++ + C+AI
Sbjct: 235 RDPDAQPGGELMLGGTDSKYYKGSLSYLNVTRKAYWQVHLDQVEVASGLTLCKEGCEAIV 294
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSL++GP + EL K IGAVPL G + C+ + +P + + LGGK + L+
Sbjct: 295 DTGTSLMVGPVDEVRELQKAIGAVPLIQGEYMIPCEKVSTLPAITLKLGGKGYKLS 350
Score = 73.6 bits (179), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 30/47 (63%), Positives = 38/47 (80%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ AQYYG I +GTPPQ F V+FDTGSSNLW+PS HC +L+IAC
Sbjct: 71 LKNYMDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIAC 117
>gi|60654209|gb|AAX29797.1| cathepsin D [synthetic construct]
Length = 413
Score = 97.8 bits (242), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 44/116 (37%), Positives = 69/116 (59%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYC--SNCQAIA 118
R+ + GGE+M GG D + G ++Y V+RK YWQ ++ +++ + C+AI
Sbjct: 235 RDPDAQPGGELMLGGTDSKYYKGSLSYLNVTRKAYWQVHLDQVEVASGLTLCKEGCEAIV 294
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSL++GP + EL K IGAVPL G + C+ + +P + + LGGK + L+
Sbjct: 295 DTGTSLMVGPVDEVRELQKAIGAVPLIEGEYMIPCEKVSTLPAITLKLGGKGYKLS 350
Score = 73.6 bits (179), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 30/47 (63%), Positives = 38/47 (80%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ AQYYG I +GTPPQ F V+FDTGSSNLW+PS HC +L+IAC
Sbjct: 71 LKNYMDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIAC 117
>gi|332264729|ref|XP_003281384.1| PREDICTED: cathepsin D [Nomascus leucogenys]
Length = 412
Score = 97.8 bits (242), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 45/116 (38%), Positives = 70/116 (60%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIA 118
R+ + GGE+M GG D + G ++Y V+RK YWQ ++ +++ + C C+AI
Sbjct: 235 RDPDAQPGGELMLGGTDSKYYKGSLSYLNVTRKAYWQVHLDQVEVASGLTLCKEGCEAIV 294
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSL++GP + EL K IGAVPL G + C+ + +P + + LGGK + L+
Sbjct: 295 DTGTSLMVGPVDEVRELQKAIGAVPLIQGEYMIPCEKVSTLPAITLKLGGKGYKLS 350
Score = 75.1 bits (183), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 42/115 (36%), Positives = 65/115 (56%), Gaps = 11/115 (9%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHF 60
+++ AQYYG I +GTPPQ F V+FDTGSSNLW+PS HC +L+IAC ++ H +
Sbjct: 71 LKNYMDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSVHCKLLDIAC------WIHHKYN 124
Query: 61 RNASD---ENGG--EIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVY 110
+ S +NG +I +G ++ T S + + S+K+E+ V+
Sbjct: 125 SDKSSTYVKNGTSFDIHYGSGSLSGYLSQDTVSVPCQSASSASALGSVKVERQVF 179
>gi|158523297|gb|ABW70789.1| cathepsin D [Scophthalmus maximus]
Length = 396
Score = 97.8 bits (242), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 46/116 (39%), Positives = 65/116 (56%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN--CQAIA 118
RN GGE++ GG D + GD Y ++RK YWQ ++ + + + N C+ I
Sbjct: 221 RNPDTAPGGELLLGGTDPKYYTGDFNYINITRKAYWQIHMDGLAVGSQLTLCNGGCEVIV 280
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSLI GP+ + L K IGAVPL G V CD + +P + LGG+ ++LT
Sbjct: 281 DTGTSLITGPAAEVKALQKAIGAVPLIQGEYMVSCDKIPSLPVITFNLGGRGYSLT 336
Score = 77.4 bits (189), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 31/47 (65%), Positives = 40/47 (85%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ AQYYG I+LG+PPQ F V+FDTGSSNLW+PS HCS+L+IAC
Sbjct: 68 LKNFLDAQYYGDIALGSPPQTFSVVFDTGSSNLWVPSVHCSLLDIAC 114
>gi|170649686|gb|ACB21270.1| cathepsin D preproprotein (predicted) [Callicebus moloch]
Length = 412
Score = 97.8 bits (242), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 44/116 (37%), Positives = 70/116 (60%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYC--SNCQAIA 118
R+ + GGE+M GG D + G ++Y V+RK YWQ ++ +++ + C+AI
Sbjct: 235 RDPDAQPGGELMLGGTDSKYYKGSLSYLNVTRKAYWQVHMDQVEVASGLTLCKGGCEAIV 294
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSL++GP + EL K IGAVPL G + C+ + +P + + LGGK++ L+
Sbjct: 295 DTGTSLMVGPVDEVRELQKAIGAVPLIQGEYMIPCEKVSTLPAITLKLGGKDYRLS 350
Score = 73.6 bits (179), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 30/49 (61%), Positives = 39/49 (79%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSV 49
+++ AQYYG I +GTPPQ F V+FDTGSSNLW+PS HC +L+IAC +
Sbjct: 71 LKNYMDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWI 119
>gi|149707989|ref|XP_001491088.1| PREDICTED: cathepsin E [Equus caballus]
Length = 396
Score = 97.8 bits (242), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 52/119 (43%), Positives = 75/119 (63%), Gaps = 6/119 (5%)
Query: 63 ASDENGG---EIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAI 117
+SD GG E++FGG D F G + + PV+++GYWQ +++I++ V +CS CQAI
Sbjct: 220 SSDPEGGAGSELIFGGYDHSHFSGSLNWVPVTKQGYWQIALDAIQVGGTVMFCSQGCQAI 279
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTRT 176
DTGTSLI GP I +L + IGA P+ +G V+C NL+ MP+V + G +TL T
Sbjct: 280 VDTGTSLITGPPDKIKQLQEAIGAQPM-DGEYAVECVNLNVMPDVTFTINGVPYTLQPT 337
Score = 61.2 bits (147), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 24/34 (70%), Positives = 31/34 (91%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCS 41
+Y+GTIS+G+PPQ F VIFDTGSSNLW+PS +C+
Sbjct: 77 EYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT 110
>gi|449280808|gb|EMC88033.1| Cathepsin D, partial [Columba livia]
Length = 387
Score = 97.8 bits (242), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 46/116 (39%), Positives = 69/116 (59%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYC--SNCQAIA 118
R+ S + GGE++ GG D + GD ++ V+RK YWQ ++++ + + C+AI
Sbjct: 212 RDPSAQPGGELLLGGTDPKYYSGDFSWVNVTRKAYWQVHMDAVDVANGLTLCKGGCEAIV 271
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSLI GP+K + EL IGA PL G + CD + +P + + LGGK + LT
Sbjct: 272 DTGTSLITGPTKEVKELQTAIGAKPLIKGQYVIPCDKVSSLPVITLTLGGKPYQLT 327
Score = 75.5 bits (184), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 33/67 (49%), Positives = 46/67 (68%), Gaps = 2/67 (2%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK--DKFVGHS 58
+++ AQYYG I +GTPPQ+F V+FDTGSSNLW+PS HC +L+IAC + D +
Sbjct: 59 LKNYMDAQYYGEIGIGTPPQKFTVVFDTGSSNLWVPSVHCHLLDIACLLHHKYDSSKSST 118
Query: 59 HFRNASD 65
+ N +D
Sbjct: 119 YVENGTD 125
>gi|397490270|ref|XP_003816129.1| PREDICTED: cathepsin D [Pan paniscus]
Length = 603
Score = 97.8 bits (242), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 45/116 (38%), Positives = 70/116 (60%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIA 118
R+ + GGE+M GG D + G ++Y V+RK YWQ ++ +++ + C C+AI
Sbjct: 235 RDPDAQPGGELMLGGTDSKYYKGSLSYLNVTRKAYWQVHLDQVEVASGLTLCKEGCEAIV 294
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSL++GP + EL K IGAVPL G + C+ + +P + + LGGK + L+
Sbjct: 295 DTGTSLMVGPVDEVRELQKAIGAVPLIQGEYMIPCEKVSTLPAITLKLGGKGYKLS 350
Score = 73.6 bits (179), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 30/47 (63%), Positives = 38/47 (80%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ AQYYG I +GTPPQ F V+FDTGSSNLW+PS HC +L+IAC
Sbjct: 71 LKNYMDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIAC 117
>gi|194760707|ref|XP_001962579.1| GF14369 [Drosophila ananassae]
gi|190616276|gb|EDV31800.1| GF14369 [Drosophila ananassae]
Length = 371
Score = 97.8 bits (242), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 45/113 (39%), Positives = 66/113 (58%), Gaps = 5/113 (4%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSNCQAIADT 120
R+ + GGE++FGG D + GD+TY P+S +GYWQF ++ + C +CQAIADT
Sbjct: 212 RDGTSSQGGELIFGGSDSSLYSGDLTYVPISEQGYWQFTMDGSSFDGYTLCEDCQAIADT 271
Query: 121 GTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
GTSLI+ P + L++L+ VDC + +P+V +GG NF L
Sbjct: 272 GTSLIVAPYEAYITLSELLNV-----DDGYVDCSIVSSLPDVIFNIGGSNFAL 319
Score = 59.3 bits (142), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 24/41 (58%), Positives = 31/41 (75%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCS 41
+ +S YYG IS+GTP Q FKV+FD+GSSNLW+PS C+
Sbjct: 61 LANSLNMAYYGAISIGTPAQSFKVLFDSGSSNLWVPSNTCT 101
>gi|118344558|ref|NP_001072052.1| cathepsin D1 precursor [Takifugu rubripes]
gi|55771082|dbj|BAD69801.1| cathepsin D1 [Takifugu rubripes]
Length = 396
Score = 97.8 bits (242), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 47/116 (40%), Positives = 69/116 (59%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YC-SNCQAIA 118
RN + GGE++ GG D + GD Y V+R+ YWQ ++ + + + C S C+AI
Sbjct: 221 RNPDTQPGGELLLGGTDPKYYTGDFDYVNVTRQAYWQIHMDGMSVGSQLSLCKSGCEAIV 280
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSL+ GPS+ + L K IGA+PL G V CD + +P + +GGK F+L+
Sbjct: 281 DTGTSLLTGPSEEVKALQKAIGAMPLIQGEYMVSCDKIPSLPVITFNIGGKPFSLS 336
Score = 78.2 bits (191), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 32/47 (68%), Positives = 39/47 (82%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ AQYYG I LGTPPQ F V+FDTGSSNLW+PS HCS+L+IAC
Sbjct: 68 LKNYLDAQYYGEIGLGTPPQPFTVVFDTGSSNLWVPSVHCSLLDIAC 114
>gi|162423776|gb|ABX89618.1| pepsinogen [Sparus aurata]
Length = 376
Score = 97.4 bits (241), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 49/118 (41%), Positives = 71/118 (60%), Gaps = 2/118 (1%)
Query: 58 SHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-KNVYCSN-CQ 115
S + + + E G E++FGG D + + G IT+ P+S Y+Q ++S+ I + V CS CQ
Sbjct: 206 SVYLSGNSEQGSEVVFGGTDSNHYTGQITWIPLSSATYYQISMDSVTINGQTVACSGGCQ 265
Query: 116 AIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
AI DTGTSLI+GP+ I +N +GA G A V+C N+ MP+V L G FT+
Sbjct: 266 AIIDTGTSLIVGPTSDINNMNSWVGASTNEYGEATVNCQNIQSMPDVTFTLNGHAFTV 323
Score = 63.2 bits (152), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 26/33 (78%), Positives = 29/33 (87%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCS 41
YYG IS+GTPPQ F VIFDTGSSNLW+PS +CS
Sbjct: 69 YYGVISIGTPPQSFTVIFDTGSSNLWVPSVYCS 101
>gi|1657354|emb|CAA66056.1| procathepsin E [Mus musculus]
gi|13529380|gb|AAH05432.1| Cathepsin E [Mus musculus]
gi|71059833|emb|CAJ18460.1| Ctse [Mus musculus]
Length = 397
Score = 97.4 bits (241), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 52/120 (43%), Positives = 72/120 (60%), Gaps = 6/120 (5%)
Query: 63 ASDENGG---EIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAI 117
+SD GG E+ FGG D F G + + PV+++ YWQ ++ I++ V +CS CQAI
Sbjct: 221 SSDPQGGSGSELTFGGYDPSHFSGSLNWIPVTKQAYWQIALDGIQVGDTVMFCSEGCQAI 280
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTRTN 177
DTGTSLI GP I L + IGA P+ +G VDC LD MPNV ++ ++TL T+
Sbjct: 281 VDTGTSLITGPPDKIKHLQEAIGATPI-DGEYAVDCATLDTMPNVTFLINEVSYTLNPTD 339
Score = 62.0 bits (149), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 25/34 (73%), Positives = 31/34 (91%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCS 41
+Y+GTIS+GTPPQ F VIFDTGSSNLW+PS +C+
Sbjct: 78 EYFGTISIGTPPQNFTVIFDTGSSNLWVPSVYCT 111
>gi|195399277|ref|XP_002058247.1| GJ15982 [Drosophila virilis]
gi|194150671|gb|EDW66355.1| GJ15982 [Drosophila virilis]
Length = 374
Score = 97.4 bits (241), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 46/113 (40%), Positives = 67/113 (59%), Gaps = 4/113 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSNCQAIADT 120
R+ + GGE++FGG D + GD+TY P+S +GYWQF + I+ C NCQAIADT
Sbjct: 214 RDGTSSQGGELIFGGSDSSLYSGDLTYVPISEQGYWQFTMAGASIDGQSLCDNCQAIADT 273
Query: 121 GTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
GTSL++ LN ++ + VDC +DK+P +++ +GG FTL
Sbjct: 274 GTSLLVVSEAAYDILNNVLN----VDENGLVDCSTVDKLPVLNLNIGGGKFTL 322
Score = 67.0 bits (162), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 29/57 (50%), Positives = 38/57 (66%), Gaps = 4/57 (7%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC----SVDKDK 53
+ +S YYG I++GTPPQ FKV+FD+GSSNLW+PS C ++AC D DK
Sbjct: 61 LSNSINMAYYGAITIGTPPQSFKVLFDSGSSNLWVPSSTCWFFDVACMNHNQYDHDK 117
>gi|147905812|ref|NP_001079036.1| gastricsin precursor [Xenopus laevis]
gi|12082174|dbj|BAB20797.1| pepsinogen C [Xenopus laevis]
gi|213625030|gb|AAI69665.1| Pepsinogen C [Xenopus laevis]
gi|213626584|gb|AAI69663.1| Pepsinogen C [Xenopus laevis]
Length = 383
Score = 97.4 bits (241), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 46/118 (38%), Positives = 68/118 (57%), Gaps = 3/118 (2%)
Query: 59 HFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSN-CQ 115
+ S +NGGE+ FGGVD++ + G I ++PV+ + YWQ G++ I + +CS CQ
Sbjct: 207 YLSGQSSQNGGEVAFGGVDQNYYTGQIYWTPVTSETYWQIGIQGFSINGQATGWCSQGCQ 266
Query: 116 AIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
AI DTGTSL+ P V + L + IGA NG V C N+ +P + + G +F L
Sbjct: 267 AIVDTGTSLLTAPQSVFSSLIQSIGAQQDQNGQYVVSCSNIQNLPTISFTISGVSFPL 324
Score = 56.6 bits (135), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 23/32 (71%), Positives = 27/32 (84%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
YYG IS+GTPPQ F V+FDTGSSNLW+ S +C
Sbjct: 67 YYGEISIGTPPQNFLVLFDTGSSNLWVASTYC 98
>gi|13928928|ref|NP_113858.1| napsin A aspartic peptidase precursor [Rattus norvegicus]
gi|6689137|emb|CAB65392.1| napsin [Rattus norvegicus]
gi|51260062|gb|AAH78790.1| Napsin A aspartic peptidase [Rattus norvegicus]
gi|149056039|gb|EDM07470.1| napsin A aspartic peptidase, isoform CRA_a [Rattus norvegicus]
Length = 420
Score = 97.4 bits (241), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 48/116 (41%), Positives = 68/116 (58%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEK--NVYCSNCQAIA 118
R++ +GGE++ GG D D +V +T+ PV+ YWQ ++S+K+ N+ C AI
Sbjct: 219 RDSEGSDGGELVLGGSDPDHYVPPLTFIPVTIPAYWQVHMQSVKVGTGLNLCAQGCGAIL 278
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSLI GPS+ I LNK +G PL G + C + ++P V LGG F LT
Sbjct: 279 DTGTSLITGPSEEIRALNKAVGGFPLLTGQYLIQCSKIPELPTVSFSLGGVWFNLT 334
Score = 66.2 bits (160), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 26/40 (65%), Positives = 31/40 (77%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
QY+G I LGTPPQ F V+FDTGSSNLW+PS C ++AC
Sbjct: 73 QYFGDIGLGTPPQNFTVVFDTGSSNLWVPSTRCHFFSLAC 112
>gi|395534115|ref|XP_003769093.1| PREDICTED: gastricsin-like [Sarcophilus harrisii]
Length = 392
Score = 97.4 bits (241), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 47/115 (40%), Positives = 66/115 (57%), Gaps = 3/115 (2%)
Query: 62 NASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSN-CQAIA 118
N + NGGE+ FGGVD F GDI ++PV+++ YWQ + I + +CS CQA+
Sbjct: 219 NENSNNGGEVTFGGVDTSMFTGDIYWAPVTQEAYWQIAINGFSIGGQATGWCSEGCQAVV 278
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
DTGTSL+ P ++ +EL + IGA NG V C N+ M + + G NF L
Sbjct: 279 DTGTSLLTAPQQIFSELMQYIGAQQNENGAYLVSCSNVQNMSTITFNINGVNFPL 333
Score = 55.8 bits (133), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 23/32 (71%), Positives = 27/32 (84%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
YYG IS+GTPPQ F V+FDTGSSNLW+ S +C
Sbjct: 75 YYGEISIGTPPQNFLVLFDTGSSNLWVSSIYC 106
>gi|19921120|ref|NP_609458.1| CG17134 [Drosophila melanogaster]
gi|7297766|gb|AAF53016.1| CG17134 [Drosophila melanogaster]
gi|17944939|gb|AAL48533.1| RE02351p [Drosophila melanogaster]
gi|220947772|gb|ACL86429.1| CG17134-PA [synthetic construct]
gi|220957078|gb|ACL91082.1| CG17134-PA [synthetic construct]
Length = 391
Score = 97.4 bits (241), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 48/113 (42%), Positives = 67/113 (59%), Gaps = 1/113 (0%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSNCQAIADT 120
R + GGE++ GG+D + G +TY PVS YWQF V +IK + C+ CQAIADT
Sbjct: 221 RQGTAVRGGELILGGIDSSLYRGSLTYVPVSVPAYWQFKVNTIKTNGTLLCNGCQAIADT 280
Query: 121 GTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
GTSLI P ++N+ +GA +G A V C + +P V++ +GG FTL
Sbjct: 281 GTSLIAVPLAAYRKINRQLGATD-NDGEAFVRCGRVSSLPKVNLNIGGTVFTL 332
Score = 62.4 bits (150), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 24/47 (51%), Positives = 33/47 (70%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+ +S +YYG I++GTP Q F ++FDTGS+NLW+PS C N AC
Sbjct: 68 LHNSMNNEYYGVIAIGTPEQRFNILFDTGSANLWVPSASCPASNTAC 114
>gi|46397366|sp|P14091.2|CATE_HUMAN RecName: Full=Cathepsin E; Contains: RecName: Full=Cathepsin E form
I; Contains: RecName: Full=Cathepsin E form II; Flags:
Precursor
Length = 401
Score = 97.4 bits (241), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 49/111 (44%), Positives = 71/111 (63%), Gaps = 3/111 (2%)
Query: 68 GGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIADTGTSLI 125
G E++FGG D F G + + PV+++ YWQ +++I++ V +CS CQAI DTGTSLI
Sbjct: 233 GSELIFGGYDHSHFSGSLNWVPVTKQAYWQIALDNIQVGGTVMFCSEGCQAIVDTGTSLI 292
Query: 126 IGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTRT 176
GPS I +L IGA P+ +G V+C NL+ MP+V + G +TL+ T
Sbjct: 293 TGPSDKIKQLQNAIGAAPV-DGEYAVECANLNVMPDVTFTINGVPYTLSPT 342
Score = 61.2 bits (147), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 29/54 (53%), Positives = 38/54 (70%), Gaps = 9/54 (16%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFR 61
+Y+GTIS+G+PPQ F VIFDTGSSNLW+PS +C+ + AC HS F+
Sbjct: 77 EYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT--SPACKT-------HSRFQ 121
>gi|332247693|ref|XP_003272996.1| PREDICTED: cathepsin E [Nomascus leucogenys]
Length = 396
Score = 97.4 bits (241), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 50/117 (42%), Positives = 72/117 (61%), Gaps = 3/117 (2%)
Query: 62 NASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIAD 119
N G E++FGG D F G + + PV+++ YWQ +++I++ V +CS CQAI D
Sbjct: 222 NPEGGAGSELIFGGYDHSHFSGSLNWVPVTKQAYWQIALDNIQVGGTVMFCSEGCQAIVD 281
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTRT 176
TGTSLI GPS I +L IGA P+ +G V+C NL+ MP+V + G +TL+ T
Sbjct: 282 TGTSLITGPSDKIKQLQNTIGAAPV-DGEYAVECANLNVMPDVTFTINGVPYTLSPT 337
Score = 60.8 bits (146), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 24/34 (70%), Positives = 31/34 (91%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCS 41
+Y+GTIS+G+PPQ F VIFDTGSSNLW+PS +C+
Sbjct: 77 EYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT 110
>gi|60816208|gb|AAX36374.1| cathepsin E [synthetic construct]
Length = 396
Score = 97.4 bits (241), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 50/117 (42%), Positives = 72/117 (61%), Gaps = 3/117 (2%)
Query: 62 NASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIAD 119
N G E++FGG D F G + + PV+++ YWQ +++I++ V +CS CQAI D
Sbjct: 222 NPEGGAGSELIFGGYDHSHFSGSLNWVPVTKQAYWQIALDNIQVGGTVMFCSEGCQAIVD 281
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTRT 176
TGTSLI GPS I +L IGA P+ +G V+C NL+ MP+V + G +TL+ T
Sbjct: 282 TGTSLITGPSDKIKQLQNAIGAAPV-DGEYAVECANLNVMPDVTFTINGVPYTLSPT 337
Score = 61.2 bits (147), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 29/54 (53%), Positives = 38/54 (70%), Gaps = 9/54 (16%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFR 61
+Y+GTIS+G+PPQ F VIFDTGSSNLW+PS +C+ + AC HS F+
Sbjct: 77 EYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT--SPACKT-------HSRFQ 121
>gi|73535294|pdb|1TZS|A Chain A, Crystal Structure Of An Activation Intermediate Of
Cathepsin E
Length = 351
Score = 97.4 bits (241), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 50/117 (42%), Positives = 72/117 (61%), Gaps = 3/117 (2%)
Query: 62 NASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIAD 119
N G E++FGG D F G + + PV+++ YWQ +++I++ V +CS CQAI D
Sbjct: 169 NPEGGAGSELIFGGYDHSHFSGSLNWVPVTKQAYWQIALDNIQVGGTVMFCSEGCQAIVD 228
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTRT 176
TGTSLI GPS I +L IGA P+ +G V+C NL+ MP+V + G +TL+ T
Sbjct: 229 TGTSLITGPSDKIKQLQNAIGAAPV-DGEYAVECANLNVMPDVTFTINGVPYTLSPT 284
Score = 60.8 bits (146), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 29/54 (53%), Positives = 38/54 (70%), Gaps = 9/54 (16%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFR 61
+Y+GTIS+G+PPQ F VIFDTGSSNLW+PS +C+ + AC HS F+
Sbjct: 24 EYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT--SPACKT-------HSRFQ 68
>gi|410974821|ref|XP_003993838.1| PREDICTED: cathepsin D [Felis catus]
Length = 418
Score = 97.4 bits (241), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 50/140 (35%), Positives = 80/140 (57%), Gaps = 7/140 (5%)
Query: 42 ILNIACSVDKDKFVGHSHF-----RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYW 96
+L + ++ K K V + F R+ + + GGE+M GG D + G ++Y V+RK YW
Sbjct: 217 VLPVFDNLMKQKLVEKNIFSFYLNRDPNAQPGGELMLGGTDSKYYKGPLSYLNVTRKAYW 276
Query: 97 QFGVESIKIEKNVYC--SNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCD 154
Q ++ + + ++ C+AI DTGTSL++GP + EL K IGAVPL G + C+
Sbjct: 277 QVHMDQVDVGTSLTLCKGGCEAILDTGTSLMVGPVDEVRELQKAIGAVPLIQGEYMIPCE 336
Query: 155 NLDKMPNVDIILGGKNFTLT 174
+ +P V + LGGK + L+
Sbjct: 337 KVSTLPEVTVKLGGKGYKLS 356
Score = 72.8 bits (177), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 29/47 (61%), Positives = 38/47 (80%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ AQYYG I +GTPPQ F V+FDTGS+NLW+PS HC +L+IAC
Sbjct: 81 LKNYLDAQYYGEIGIGTPPQCFTVVFDTGSANLWVPSIHCKLLDIAC 127
>gi|4503145|ref|NP_001901.1| cathepsin E isoform a preproprotein [Homo sapiens]
gi|114572172|ref|XP_001163151.1| PREDICTED: cathepsin E isoform 2 [Pan troglodytes]
gi|181194|gb|AAA52130.1| cathepsin E precursor [Homo sapiens]
gi|181205|gb|AAA52300.1| cathepsin E [Homo sapiens]
gi|7339520|emb|CAB82850.1| procathepsin E [Homo sapiens]
gi|27502799|gb|AAH42537.1| Cathepsin E [Homo sapiens]
gi|61358295|gb|AAX41543.1| cathepsin E [synthetic construct]
gi|119611998|gb|EAW91592.1| cathepsin E, isoform CRA_a [Homo sapiens]
gi|158257546|dbj|BAF84746.1| unnamed protein product [Homo sapiens]
gi|325463731|gb|ADZ15636.1| cathepsin E [synthetic construct]
Length = 396
Score = 97.4 bits (241), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 50/117 (42%), Positives = 72/117 (61%), Gaps = 3/117 (2%)
Query: 62 NASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIAD 119
N G E++FGG D F G + + PV+++ YWQ +++I++ V +CS CQAI D
Sbjct: 222 NPEGGAGSELIFGGYDHSHFSGSLNWVPVTKQAYWQIALDNIQVGGTVMFCSEGCQAIVD 281
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTRT 176
TGTSLI GPS I +L IGA P+ +G V+C NL+ MP+V + G +TL+ T
Sbjct: 282 TGTSLITGPSDKIKQLQNAIGAAPV-DGEYAVECANLNVMPDVTFTINGVPYTLSPT 337
Score = 61.2 bits (147), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 29/54 (53%), Positives = 38/54 (70%), Gaps = 9/54 (16%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFR 61
+Y+GTIS+G+PPQ F VIFDTGSSNLW+PS +C+ + AC HS F+
Sbjct: 77 EYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT--SPACKT-------HSRFQ 121
>gi|397504824|ref|XP_003822980.1| PREDICTED: cathepsin E [Pan paniscus]
Length = 396
Score = 97.1 bits (240), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 50/117 (42%), Positives = 72/117 (61%), Gaps = 3/117 (2%)
Query: 62 NASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIAD 119
N G E++FGG D F G + + PV+++ YWQ +++I++ V +CS CQAI D
Sbjct: 222 NPEGGAGSELIFGGYDHSHFSGSLNWVPVTKQAYWQIALDNIQVGGTVMFCSEGCQAIVD 281
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTRT 176
TGTSLI GPS I +L IGA P+ +G V+C NL+ MP+V + G +TL+ T
Sbjct: 282 TGTSLITGPSDKIKQLQNAIGAAPV-DGEYAVECANLNVMPDVTFTINGVPYTLSPT 337
Score = 61.2 bits (147), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 29/54 (53%), Positives = 38/54 (70%), Gaps = 9/54 (16%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFR 61
+Y+GTIS+G+PPQ F VIFDTGSSNLW+PS +C+ + AC HS F+
Sbjct: 77 EYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT--SPACKT-------HSRFQ 121
>gi|12697815|dbj|BAB21620.1| cathepsin D [Bos taurus]
Length = 386
Score = 97.1 bits (240), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 49/128 (38%), Positives = 75/128 (58%), Gaps = 7/128 (5%)
Query: 49 VDKDKFVGHSHF--RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE 106
VDK+ F S F R+ + GGE+M GG D + G + + V+R+ YWQ ++ + +
Sbjct: 198 VDKNVF---SFFLNRDPKAQPGGELMLGGTDSKYYRGSLMFHNVTRQAYWQIHMDQLDVG 254
Query: 107 KNVYC--SNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDI 164
++ C+AI DTGTSLI+GP + + EL K IGAVPL G + C+ + +P V +
Sbjct: 255 SSLTVCKGGCEAIVDTGTSLIVGPVEEVRELQKAIGAVPLIQGEYMIPCEKVSSLPQVTV 314
Query: 165 ILGGKNFT 172
LGGK++
Sbjct: 315 KLGGKDYA 322
Score = 72.4 bits (176), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 29/47 (61%), Positives = 38/47 (80%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ AQYYG I +GTPPQ F V+FDTGS+NLW+PS HC +L+IAC
Sbjct: 47 LKNYMDAQYYGEIGIGTPPQCFTVVFDTGSANLWVPSIHCKLLDIAC 93
>gi|126309851|ref|XP_001370482.1| PREDICTED: gastricsin-like [Monodelphis domestica]
Length = 390
Score = 97.1 bits (240), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 47/115 (40%), Positives = 66/115 (57%), Gaps = 3/115 (2%)
Query: 62 NASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSN-CQAIA 118
N + NGGE++FGGVD + GDI ++PV+ + YWQ + I + +CS CQAI
Sbjct: 217 NENSNNGGEVVFGGVDTSMYTGDIYWAPVTEEAYWQIAINGFSIGGQATGWCSGGCQAIV 276
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
DTGTSL+ P ++ +EL + IGA NG V C N MP + + G +F L
Sbjct: 277 DTGTSLLTAPQQIFSELMQYIGAQQDENGSYLVSCSNTQSMPTITFNINGVDFPL 331
Score = 58.9 bits (141), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 24/32 (75%), Positives = 28/32 (87%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
YYG IS+GTPPQ F V+FDTGSSNLW+PS +C
Sbjct: 73 YYGEISIGTPPQNFLVLFDTGSSNLWVPSIYC 104
>gi|444731560|gb|ELW71913.1| Cathepsin D [Tupaia chinensis]
Length = 684
Score = 97.1 bits (240), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 49/127 (38%), Positives = 77/127 (60%), Gaps = 3/127 (2%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKN 108
V+K+ F + + R+ S + GGE+M GGVD + G + Y V+RK YWQ ++ +++
Sbjct: 256 VEKNIFAFYLN-RDPSGQPGGELMLGGVDTKYYTGSLDYYNVTRKAYWQIHMDKLEVGDG 314
Query: 109 V-YCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ C C+ I DTGTSLI+GP + EL+K +GAVPL G + C+ + +P + I L
Sbjct: 315 LTLCQEGCEVIVDTGTSLIVGPVDEVRELHKAMGAVPLIQGEYMIPCEKVASLPQITIRL 374
Query: 167 GGKNFTL 173
G K++ L
Sbjct: 375 GNKDYHL 381
Score = 70.9 bits (172), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 29/47 (61%), Positives = 37/47 (78%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ AQYYG I +GTPPQ F VIFDTGS+NLW+PS HC +L+ AC
Sbjct: 73 LKNYMDAQYYGEIGIGTPPQCFTVIFDTGSANLWVPSIHCGMLDFAC 119
>gi|126309845|ref|XP_001370435.1| PREDICTED: gastricsin-like [Monodelphis domestica]
Length = 390
Score = 97.1 bits (240), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 47/115 (40%), Positives = 66/115 (57%), Gaps = 3/115 (2%)
Query: 62 NASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSN-CQAIA 118
N + NGGE++FGGVD + GDI ++PV+ + YWQ + I + +CS CQAI
Sbjct: 217 NENSNNGGEVVFGGVDTSMYTGDIYWAPVTEEAYWQIAINGFSIGGQATGWCSGGCQAIV 276
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
DTGTSL+ P ++ +EL + IGA NG V C N MP + + G +F L
Sbjct: 277 DTGTSLLTAPQQIFSELMQYIGAQQDENGSYLVSCSNTQSMPTITFNINGVDFPL 331
Score = 55.8 bits (133), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 23/32 (71%), Positives = 27/32 (84%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
YYG IS+GTPPQ F V+FDTGSSNLW+ S +C
Sbjct: 73 YYGEISIGTPPQNFLVLFDTGSSNLWVASIYC 104
>gi|407280288|pdb|1LYA|B Chain B, Crystal Structures Of Native And Inhibited Forms Of Human
Cathepsin D: Implications For Lysosomal Targeting And
Drug Design
gi|407280290|pdb|1LYA|D Chain D, Crystal Structures Of Native And Inhibited Forms Of Human
Cathepsin D: Implications For Lysosomal Targeting And
Drug Design
gi|425684912|pdb|1LYW|B Chain B, Cathepsin D At Ph 7.5
gi|425684913|pdb|1LYW|D Chain D, Cathepsin D At Ph 7.5
gi|425684914|pdb|1LYW|F Chain F, Cathepsin D At Ph 7.5
gi|425684915|pdb|1LYW|H Chain H, Cathepsin D At Ph 7.5
gi|449112963|pdb|1LYB|B Chain B, Crystal Structures Of Native And Inhibited Forms Of Human
Cathepsin D: Implications For Lysosomal Targeting And
Drug Design
gi|449112965|pdb|1LYB|D Chain D, Crystal Structures Of Native And Inhibited Forms Of Human
Cathepsin D: Implications For Lysosomal Targeting And
Drug Design
Length = 241
Score = 97.1 bits (240), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 48/141 (34%), Positives = 80/141 (56%), Gaps = 7/141 (4%)
Query: 41 SILNIACSVDKDKFVGHSHF-----RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGY 95
++L + ++ + K V + F R+ + GGE+M GG D + G ++Y V+RK Y
Sbjct: 41 NVLPVFDNLMQQKLVDQNIFSFYLSRDPDAQPGGELMLGGTDSKYYKGSLSYLNVTRKAY 100
Query: 96 WQFGVESIKIEKNVYC--SNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDC 153
WQ ++ +++ + C+AI DTGTSL++GP + EL K IGAVPL G + C
Sbjct: 101 WQVHLDQVEVASGLTLCKEGCEAIVDTGTSLMVGPVDEVRELQKAIGAVPLIQGEYMIPC 160
Query: 154 DNLDKMPNVDIILGGKNFTLT 174
+ + +P + + LGGK + L+
Sbjct: 161 EKVSTLPAITLKLGGKGYKLS 181
>gi|126309849|ref|XP_001370462.1| PREDICTED: gastricsin-like [Monodelphis domestica]
Length = 390
Score = 97.1 bits (240), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 47/115 (40%), Positives = 66/115 (57%), Gaps = 3/115 (2%)
Query: 62 NASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSN-CQAIA 118
N + NGGE++FGGVD + GDI ++PV+ + YWQ + I + +CS CQAI
Sbjct: 217 NENSNNGGEVVFGGVDTSMYTGDIYWAPVTEEAYWQIAINGFSIGGQATGWCSGGCQAIV 276
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
DTGTSL+ P ++ +EL + IGA NG V C N MP + + G +F L
Sbjct: 277 DTGTSLLTAPQQIFSELMQYIGAQQDENGSYLVSCSNTQSMPTITFNINGVDFPL 331
Score = 58.9 bits (141), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 24/32 (75%), Positives = 28/32 (87%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
YYG IS+GTPPQ F V+FDTGSSNLW+PS +C
Sbjct: 73 YYGEISIGTPPQNFLVLFDTGSSNLWVPSIYC 104
>gi|403294878|ref|XP_003938389.1| PREDICTED: cathepsin E [Saimiri boliviensis boliviensis]
Length = 396
Score = 97.1 bits (240), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 49/111 (44%), Positives = 71/111 (63%), Gaps = 3/111 (2%)
Query: 68 GGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIADTGTSLI 125
G E++FGG D F G + + PV+++ YWQ +++I++ V +CS CQAI DTGTSLI
Sbjct: 228 GSELIFGGYDHSHFSGSLNWVPVTKQAYWQIALDNIQVGGTVMFCSEGCQAIVDTGTSLI 287
Query: 126 IGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTRT 176
GPS I +L IGA P+ +G V+C NL+ MP+V + G +TL+ T
Sbjct: 288 TGPSDKIKQLQNAIGAAPV-DGEYAVECANLNVMPDVTFTINGVPYTLSPT 337
Score = 60.8 bits (146), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 24/34 (70%), Positives = 31/34 (91%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCS 41
+Y+GTIS+G+PPQ F VIFDTGSSNLW+PS +C+
Sbjct: 77 EYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT 110
>gi|256072901|ref|XP_002572772.1| cathepsin D (A01 family) [Schistosoma mansoni]
gi|360043052|emb|CCD78464.1| cathepsin D (A01 family) [Schistosoma mansoni]
Length = 428
Score = 97.1 bits (240), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 47/118 (39%), Positives = 70/118 (59%), Gaps = 1/118 (0%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIAD 119
RN S GGE+M GG+DK + G+I Y ++ + YW F ++ + I C + C AIAD
Sbjct: 211 RNISAVLGGELMIGGIDKKYYSGEINYVDLTEQSYWLFKMDKLTISDMTACPDGCLAIAD 270
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTRTN 177
TGTS+I GP+ I ++N +GA L G V C N++ +P +D ++ GK TL T+
Sbjct: 271 TGTSMIAGPTDEIQKINAKLGATRLPGGIYTVSCGNINNLPTIDFVINGKAMTLEPTD 328
Score = 74.3 bits (181), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 29/51 (56%), Positives = 41/51 (80%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK 51
+++ AQYYG I++GTPPQ F V+FDTGSSNLW+PS++CS +IAC + +
Sbjct: 58 LKNYLDAQYYGDITIGTPPQTFSVVFDTGSSNLWVPSKYCSYFDIACLLHR 108
>gi|1778026|gb|AAB63442.1| aspartic proteinase [Schistosoma mansoni]
Length = 427
Score = 97.1 bits (240), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 47/118 (39%), Positives = 70/118 (59%), Gaps = 1/118 (0%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIAD 119
RN S GGE+M GG+DK + G+I Y ++ + YW F ++ + I C + C AIAD
Sbjct: 210 RNISAVLGGELMIGGIDKKYYSGEINYVDLTEQSYWLFKMDKLTISDMTACPDGCLAIAD 269
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTRTN 177
TGTS+I GP+ I ++N +GA L G V C N++ +P +D ++ GK TL T+
Sbjct: 270 TGTSMIAGPTDEIQKINAKLGATRLPGGIYTVSCGNINNLPTIDFVINGKAMTLEPTD 327
Score = 74.3 bits (181), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 29/51 (56%), Positives = 41/51 (80%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK 51
+++ AQYYG I++GTPPQ F V+FDTGSSNLW+PS++CS +IAC + +
Sbjct: 57 LKNYLDAQYYGDITIGTPPQTFSVVFDTGSSNLWVPSKYCSYFDIACLLHR 107
>gi|256072903|ref|XP_002572773.1| cathepsin D (A01 family) [Schistosoma mansoni]
gi|360043053|emb|CCD78465.1| cathepsin D (A01 family) [Schistosoma mansoni]
Length = 430
Score = 96.7 bits (239), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 47/118 (39%), Positives = 70/118 (59%), Gaps = 1/118 (0%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIAD 119
RN S GGE+M GG+DK + G+I Y ++ + YW F ++ + I C + C AIAD
Sbjct: 213 RNISAVLGGELMIGGIDKKYYSGEINYVDLTEQSYWLFKMDKLTISDMTACPDGCLAIAD 272
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTRTN 177
TGTS+I GP+ I ++N +GA L G V C N++ +P +D ++ GK TL T+
Sbjct: 273 TGTSMIAGPTDEIQKINAKLGATRLPGGIYTVSCGNINNLPTIDFVINGKAMTLEPTD 330
Score = 74.3 bits (181), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 29/51 (56%), Positives = 41/51 (80%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK 51
+++ AQYYG I++GTPPQ F V+FDTGSSNLW+PS++CS +IAC + +
Sbjct: 60 LKNYLDAQYYGDITIGTPPQTFSVVFDTGSSNLWVPSKYCSYFDIACLLHR 110
>gi|342675479|gb|AEL31665.1| cathepsin D [Cynoglossus semilaevis]
Length = 396
Score = 96.7 bits (239), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 44/116 (37%), Positives = 67/116 (57%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYC--SNCQAIA 118
RN GGE++ GG D + G+ Y V+R+ YWQ ++ + + + CQAI
Sbjct: 221 RNPDTAPGGELLLGGTDPTYYTGEFNYVNVTRQAYWQVSMDELAVGSQLTLCKGGCQAIV 280
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSL+ GPS + L K IGA+PL G V+CD + +P + +GG++++LT
Sbjct: 281 DTGTSLLTGPSAEVKALQKAIGAIPLIQGEYMVNCDKIPSLPVITFKMGGQSYSLT 336
Score = 79.3 bits (194), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 33/51 (64%), Positives = 42/51 (82%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK 51
+++ AQYYG I+LGTPPQ F V+FDTGSSNLW+PS HCS+L+IAC + K
Sbjct: 68 LKNYLDAQYYGDITLGTPPQTFSVVFDTGSSNLWVPSIHCSLLDIACLLHK 118
>gi|331215715|ref|XP_003320537.1| saccharopepsin [Puccinia graminis f. sp. tritici CRL 75-36-700-3]
gi|309299527|gb|EFP76118.1| saccharopepsin [Puccinia graminis f. sp. tritici CRL 75-36-700-3]
Length = 430
Score = 96.7 bits (239), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 48/114 (42%), Positives = 70/114 (61%), Gaps = 1/114 (0%)
Query: 62 NASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEK-NVYCSNCQAIADT 120
N +D NGGE++FGGVD+ + G+I Y+PV R+GYW+ ++S+K K + N A DT
Sbjct: 261 NKADPNGGEVVFGGVDEAHYEGEIFYAPVRRRGYWEVELKSVKFGKEEMKLHNVGAAIDT 320
Query: 121 GTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
GTSLI P+ +N IGA +G VDC + ++P++ GGK FT+T
Sbjct: 321 GTSLIALPTDTAEIINAEIGATKSWSGQYTVDCSRIPELPDLTFNFGGKEFTIT 374
Score = 64.3 bits (155), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 28/41 (68%), Positives = 33/41 (80%), Gaps = 2/41 (4%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQY+ ISLGTPPQ FKV+ DTGSSNLW+PS C+ +IAC
Sbjct: 112 AQYFSEISLGTPPQSFKVVLDTGSSNLWVPSTRCT--SIAC 150
>gi|49019802|emb|CAD80095.2| pepsin A1 [Trematomus bernacchii]
Length = 375
Score = 96.7 bits (239), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 49/128 (38%), Positives = 75/128 (58%), Gaps = 5/128 (3%)
Query: 51 KDKFVGH---SHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE- 106
K++ V S + +++ G E++FGG+D + G+I + P+S YWQ ++S+ I
Sbjct: 196 KERLVSQPLFSVYLSSNSAQGSEVVFGGIDSSHYTGEIAWIPLSSATYWQINMDSVTING 255
Query: 107 KNVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDII 165
+ V CS CQAI DTGTSLI+GP+ I+ +N +GA G A V+C ++ MP V
Sbjct: 256 QTVACSGGCQAIIDTGTSLIVGPNSDISSMNSRVGASTNQYGEATVNCRDIQSMPEVTFT 315
Query: 166 LGGKNFTL 173
L G FT+
Sbjct: 316 LNGHAFTI 323
Score = 63.5 bits (153), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 27/33 (81%), Positives = 29/33 (87%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCS 41
YYG IS+GTPPQ F VIFDTGSSNLWIPS +CS
Sbjct: 69 YYGVISIGTPPQSFSVIFDTGSSNLWIPSVYCS 101
>gi|166361871|gb|ABY87034.1| pepsinogen A1 [Epinephelus coioides]
gi|166361875|gb|ABY87036.1| pepsinogen A1 [Epinephelus coioides]
Length = 376
Score = 96.7 bits (239), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 46/118 (38%), Positives = 71/118 (60%), Gaps = 2/118 (1%)
Query: 58 SHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-KNVYCSN-CQ 115
S + ++ E G E++FGG+D + G +T+ P++ YWQ ++ +KI + V C+ CQ
Sbjct: 206 SVYLSSHGEQGSEVVFGGIDSSHYTGQVTWVPLTSATYWQIKMDGVKINGQTVACAGGCQ 265
Query: 116 AIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
AI DTGTSLI+GP+ I +N +GA G + V+C N+ MP V L G +FT+
Sbjct: 266 AIIDTGTSLIVGPTNDINNMNSWVGASTNQYGESTVNCQNVGSMPEVTFTLNGHDFTI 323
Score = 62.8 bits (151), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 26/33 (78%), Positives = 29/33 (87%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCS 41
YYG IS+GTPPQ F VIFDTGSSNLW+PS +CS
Sbjct: 69 YYGVISIGTPPQSFSVIFDTGSSNLWVPSVYCS 101
>gi|112950081|gb|ABI26643.1| aspartic proteinase [Cucumis sativus]
Length = 399
Score = 96.7 bits (239), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 45/113 (39%), Positives = 68/113 (60%), Gaps = 1/113 (0%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSNCQAIADT 120
R+ + N GE++FGG D + G I Y PV+++GYWQF ++ + + S+ QAIADT
Sbjct: 214 RSKAANNNGEVIFGGSDSTVYQGPINYVPVTQQGYWQFTMDGVYVNGQQVISSAQAIADT 273
Query: 121 GTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
GTSL+ P+ LN+ IGA G VDC ++ +PN+ +GG N++L
Sbjct: 274 GTSLLAAPTSAFYTLNEAIGAT-YQEGDYFVDCSSVSSLPNIQFSIGGINYSL 325
Score = 63.9 bits (154), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 26/41 (63%), Positives = 33/41 (80%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
++YYGTI +GTP QEF V+FD+GSSNLW+PS CS + AC
Sbjct: 70 SEYYGTIGIGTPAQEFTVVFDSGSSNLWVPSAKCSSSDQAC 110
>gi|194854124|ref|XP_001968293.1| GG24794 [Drosophila erecta]
gi|190660160|gb|EDV57352.1| GG24794 [Drosophila erecta]
Length = 412
Score = 96.7 bits (239), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 47/117 (40%), Positives = 73/117 (62%), Gaps = 5/117 (4%)
Query: 61 RNASDE-NGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSNCQAIAD 119
RN +D +GGE++ GG D + G +TY PVS GYWQF + S + +C NC+AI D
Sbjct: 243 RNGTDATHGGELILGGTDSGLYSGCLTYVPVSSAGYWQFTMTSATVNGFQFCENCEAILD 302
Query: 120 TGTSLIIGPSKVIAELNKLIG---AVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
GTSLI+ P V+A +N+++G A P +NG +V+C + ++P+V + + + F L
Sbjct: 303 VGTSLIVVPEPVLAAINQILGVLDATP-SNGVYQVNCSTVSELPDVVLTIARRKFPL 358
Score = 49.7 bits (117), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 18/40 (45%), Positives = 26/40 (65%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
+ + Y +YY T+ G PPQ +V+ DTGS+NLW+ S C
Sbjct: 84 LENRYNTEYYTTLGFGNPPQAMQVLIDTGSANLWVLSSKC 123
>gi|194762106|ref|XP_001963199.1| GF19728 [Drosophila ananassae]
gi|190616896|gb|EDV32420.1| GF19728 [Drosophila ananassae]
Length = 390
Score = 96.3 bits (238), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 46/108 (42%), Positives = 65/108 (60%)
Query: 67 NGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSNCQAIADTGTSLII 126
GGE++ GG+D + G +T+ PV++ YWQF + +IK + N QAIADTGTSLI+
Sbjct: 226 QGGEMILGGIDSSLYKGSLTWVPVTKAAYWQFKLTAIKTKGVFISRNTQAIADTGTSLIV 285
Query: 127 GPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
P +N LIGA G A V C + +PNV++ +G + FTLT
Sbjct: 286 LPKAAYTRINNLIGAEDNGEGEAFVRCGRVSALPNVNLHIGDRFFTLT 333
Score = 63.5 bits (153), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 29/67 (43%), Positives = 39/67 (58%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHF 60
+ S +YYG +S+GTP Q F ++FDTGS+NLW+PS CS N AC G S
Sbjct: 67 LHDSADREYYGLLSIGTPKQNFNILFDTGSANLWVPSAKCSASNKACQKHNKYHSGESST 126
Query: 61 RNASDEN 67
A+ E+
Sbjct: 127 YVANGES 133
>gi|326933881|ref|XP_003213026.1| PREDICTED: gastricsin-like [Meleagris gallopavo]
Length = 389
Score = 96.3 bits (238), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 44/110 (40%), Positives = 71/110 (64%), Gaps = 3/110 (2%)
Query: 67 NGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSN-CQAIADTGTS 123
GGE++FGGV+ + + G I+++PV++ YWQ G+E + + + +CS CQAI DTGTS
Sbjct: 221 QGGELIFGGVNPNLYTGQISWTPVTQTTYWQIGIEDFTVGGQSSGWCSQGCQAIVDTGTS 280
Query: 124 LIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
L+ P++V +EL + IGA +NG C N++ MP + ++ G +F L
Sbjct: 281 LLTVPNQVFSELMQYIGAQADSNGQYVASCSNIEYMPTLTFVISGTSFPL 330
Score = 58.5 bits (140), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 25/40 (62%), Positives = 30/40 (75%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
+ +S YYG IS+GTPPQ F V+FDTGSSNLW+PS C
Sbjct: 65 LANSMDMSYYGEISIGTPPQNFLVLFDTGSSNLWVPSTLC 104
>gi|355329701|dbj|BAL14144.1| pepsinogen 1 [Pagrus major]
Length = 369
Score = 96.3 bits (238), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 50/127 (39%), Positives = 78/127 (61%), Gaps = 6/127 (4%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKN 108
V++D F S + +++ + G + FGGVD + + G IT+ P+S + YWQ V+S+ +
Sbjct: 194 VNQDLF---SVYLSSNSQQGSVVTFGGVDPNYYTGSITWIPLSNELYWQITVDSVTVNGQ 250
Query: 109 VYCSN--CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
V N CQAI DTGTSLI+GP I+ +N +GA NG V+C+++ +MP+V +
Sbjct: 251 VVACNGGCQAIVDTGTSLIVGPQSSISNINSYVGASS-QNGDYTVNCNSIAQMPDVTFYI 309
Query: 167 GGKNFTL 173
G+ FTL
Sbjct: 310 HGQQFTL 316
Score = 67.4 bits (163), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 33/46 (71%), Positives = 38/46 (82%), Gaps = 3/46 (6%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKF 54
YYG IS+GTPPQ FKVIFDTGSSNLWIPS +CS + AC+ + DKF
Sbjct: 63 YYGIISIGTPPQSFKVIFDTGSSNLWIPSIYCS--SAACN-NHDKF 105
>gi|195134382|ref|XP_002011616.1| GI11126 [Drosophila mojavensis]
gi|193906739|gb|EDW05606.1| GI11126 [Drosophila mojavensis]
Length = 421
Score = 96.3 bits (238), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 42/111 (37%), Positives = 67/111 (60%), Gaps = 4/111 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSNCQAIADT 120
RN + + GGE++FGG D + G +TY+ ++++ YWQF ++S + V C+NC AIADT
Sbjct: 225 RNGTSQQGGELIFGGSDPSLYKGSMTYADITQQNYWQFNMDSATLNGQVLCTNCAAIADT 284
Query: 121 GTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNF 171
GTSL++ P+ + ++ ++G N +DC N MP +GGK F
Sbjct: 285 GTSLLVAPTDIYNKIKVVLG----VNTDDTIDCSNTSNMPTFLFTIGGKVF 331
Score = 60.5 bits (145), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 23/34 (67%), Positives = 29/34 (85%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCS 41
+YYG I++GTPPQ F V+FDTGSSNLW+PS C+
Sbjct: 81 EYYGIITIGTPPQTFNVLFDTGSSNLWVPSIQCA 114
>gi|54020914|ref|NP_001005701.1| napsin A aspartic peptidase precursor [Xenopus (Silurana)
tropicalis]
gi|49522956|gb|AAH75272.1| cathepsin D (lysosomal aspartyl protease) [Xenopus (Silurana)
tropicalis]
Length = 402
Score = 96.3 bits (238), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 49/116 (42%), Positives = 66/116 (56%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKN-VYCSN-CQAIA 118
R+ GGE++ GG D + + GD Y V+R YWQ + +++ V C CQAI
Sbjct: 212 RDPMAMVGGELVLGGTDPNYYTGDFHYLNVTRMAYWQIKADEVRVANQLVLCKGGCQAIV 271
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSLI GP + I L+K IGA PL +G V+C + +P V ILGG + LT
Sbjct: 272 DTGTSLITGPREEIRALHKAIGAFPLFSGEYFVNCKRIQSLPTVSFILGGVAYNLT 327
Score = 70.9 bits (172), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 30/45 (66%), Positives = 35/45 (77%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK 51
AQYYG I +GTPPQ+F VIFDTGSSNLW+PS CS + AC + K
Sbjct: 65 AQYYGEIFIGTPPQKFAVIFDTGSSNLWVPSIKCSFFDFACWLHK 109
>gi|60678793|gb|AAX33731.1| Blo t allergen [Blomia tropicalis]
Length = 402
Score = 96.3 bits (238), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 48/122 (39%), Positives = 75/122 (61%), Gaps = 8/122 (6%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEK--------NVYCS 112
R+ + + GGEI+FGG D + GD TY+P+++ GYWQF + I +E +V S
Sbjct: 215 RDTNGKVGGEIIFGGSDPAYYKGDFTYAPLTKIGYWQFQMHGILLENKSNNKTVGHVCES 274
Query: 113 NCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFT 172
C+AIADTGTSLI GPS + LN+ +GA+ NG ++C +++ +P++ + G F
Sbjct: 275 GCEAIADTGTSLIAGPSDQVEHLNRALGAIGPLNGIFVLNCSHINTLPSIIFQINGVKFP 334
Query: 173 LT 174
L+
Sbjct: 335 LS 336
Score = 69.3 bits (168), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 28/41 (68%), Positives = 33/41 (80%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQYYG I +G+PPQ F VIFDTGSSNLW+PS+ C N+AC
Sbjct: 68 AQYYGEIQIGSPPQPFNVIFDTGSSNLWVPSKKCKFTNLAC 108
>gi|297711897|ref|XP_002832550.1| PREDICTED: cathepsin E-like, partial [Pongo abelii]
Length = 175
Score = 96.3 bits (238), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 50/117 (42%), Positives = 72/117 (61%), Gaps = 3/117 (2%)
Query: 62 NASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIAD 119
N G E++FGG D F G + + PV+++ YWQ +++I++ V +CS CQAI D
Sbjct: 1 NPEGGAGSELIFGGYDHSHFSGSLNWVPVTKQAYWQIALDNIQVGGTVMFCSEGCQAIVD 60
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTRT 176
TGTSLI GPS I +L IGA P+ +G V+C NL+ MP+V + G +TL+ T
Sbjct: 61 TGTSLITGPSDKIKQLQNAIGAAPV-DGEYAVECANLNVMPDVTFTINGVPYTLSPT 116
>gi|426333516|ref|XP_004028322.1| PREDICTED: cathepsin E isoform 1 [Gorilla gorilla gorilla]
Length = 396
Score = 95.9 bits (237), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 49/117 (41%), Positives = 72/117 (61%), Gaps = 3/117 (2%)
Query: 62 NASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIAD 119
N G E++FGG D F G + + PV+++ YWQ +++I++ V +CS CQAI D
Sbjct: 222 NPEGGAGSELIFGGYDHSHFSGSLNWVPVTKQAYWQIALDNIQVGGTVMFCSEGCQAIVD 281
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTRT 176
TGTSLI GPS I +L IG+ P+ +G V+C NL+ MP+V + G +TL+ T
Sbjct: 282 TGTSLITGPSDKIKQLQNAIGSAPV-DGEYAVECANLNVMPDVTFTINGVPYTLSPT 337
Score = 61.2 bits (147), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 29/54 (53%), Positives = 38/54 (70%), Gaps = 9/54 (16%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFR 61
+Y+GTIS+G+PPQ F VIFDTGSSNLW+PS +C+ + AC HS F+
Sbjct: 77 EYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT--SPACKT-------HSRFQ 121
>gi|74220823|dbj|BAE31380.1| unnamed protein product [Mus musculus]
Length = 404
Score = 95.9 bits (237), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 50/125 (40%), Positives = 74/125 (59%), Gaps = 5/125 (4%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKN 108
VDK+ F + + R+ + GGE+M GG D + G+++Y V+RK YW + E
Sbjct: 222 VDKNIFSFYLN-RDPEGQPGGELMLGGTDSKYYHGELSYLNVTRKAYWL----EVGNELT 276
Query: 109 VYCSNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGG 168
+ C+AI DTGTSL++GP + + EL K IGAVPL G + C+ + +P V + LGG
Sbjct: 277 LCKGGCEAIVDTGTSLLVGPVEEVKELQKAIGAVPLIQGEYMIPCEKVSSLPTVYLKLGG 336
Query: 169 KNFTL 173
KN+ L
Sbjct: 337 KNYEL 341
Score = 75.9 bits (185), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 45/115 (39%), Positives = 65/115 (56%), Gaps = 13/115 (11%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHF 60
+++ AQYYG I +GTPPQ F V+FDTGSSNLW+PS HC IL+IAC +V H +
Sbjct: 71 LKNYLDAQYYGDIGIGTPPQCFTVVFDTGSSNLWVPSIHCKILDIAC------WVHHKYN 124
Query: 61 RNASD---ENGG--EIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVY 110
+ S +NG +I +G +++ T S + Q IK+EK ++
Sbjct: 125 SDKSSTYVKNGTSFDIHYGSGSLSRYLSQDTVSVPCKSD--QSKARGIKVEKQIF 177
>gi|46395759|sp|Q800A0.1|CATE_RANCA RecName: Full=Cathepsin E; Flags: Precursor
gi|29647357|dbj|BAC75398.1| cathepsin E [Rana catesbeiana]
Length = 397
Score = 95.9 bits (237), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 50/116 (43%), Positives = 72/116 (62%), Gaps = 3/116 (2%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIA 118
R+ + +GGE++ GG D +F G + + P++ +GYWQ V+SI++ V +CS+ CQAI
Sbjct: 217 RDPNSADGGELVLGGFDTSRFSGQLNWVPITVQGYWQIQVDSIQVAGQVIFCSDGCQAIV 276
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSLI GPS I +L IG V NG V C L MP+V + G +++LT
Sbjct: 277 DTGTSLITGPSGDIEQLQNYIG-VTNTNGEYGVSCSTLSLMPSVTFTINGLDYSLT 331
Score = 61.2 bits (147), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 24/35 (68%), Positives = 31/35 (88%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCS 41
+Y+G IS+GTPPQ+F VIFDTGSSNLW+PS +C+
Sbjct: 72 VEYFGQISIGTPPQQFTVIFDTGSSNLWVPSIYCT 106
>gi|195433871|ref|XP_002064930.1| GK15195 [Drosophila willistoni]
gi|194161015|gb|EDW75916.1| GK15195 [Drosophila willistoni]
Length = 431
Score = 95.9 bits (237), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 47/115 (40%), Positives = 70/115 (60%), Gaps = 4/115 (3%)
Query: 63 ASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKN--VYCSNCQAIADT 120
+S NGG+++ GG D + G +TY P+S+ GYWQF V SI + N CS C+AI D
Sbjct: 249 SSASNGGQLILGGTDPTLYSGCLTYVPLSQVGYWQFTVTSIVLGSNGKSLCSQCEAILDV 308
Query: 121 GTSLIIGPSKVIAELNKLIG--AVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
GTSLI+ PS +A +N+ +G A L NG +DC+ +P++ + + K+F L
Sbjct: 309 GTSLIVAPSAALATINQQLGITAANLLNGIYTIDCNKTSTLPDLILTIARKDFVL 363
Score = 45.4 bits (106), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 17/38 (44%), Positives = 27/38 (71%), Gaps = 2/38 (5%)
Query: 4 SYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCS 41
+Y +Y G I++G+ Q FK++ DT S+NLW+PS C+
Sbjct: 93 AYNTEYSGNITIGS--QSFKILLDTASANLWVPSSQCT 128
>gi|317419142|emb|CBN81179.1| Pepsinogen 1 [Dicentrarchus labrax]
Length = 378
Score = 95.9 bits (237), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 50/127 (39%), Positives = 78/127 (61%), Gaps = 6/127 (4%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKN 108
V++D F S + +++ + G + FGGVD + + G IT+ P+S + YWQ V+S+ +
Sbjct: 203 VNQDLF---SVYLSSNSQQGSVVTFGGVDSNHYHGSITWIPLSNELYWQITVDSVTVNGQ 259
Query: 109 VYCSN--CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
V N CQAI DTGTSL++GP I+ +N ++GA NG V+C+N+ +MP V +
Sbjct: 260 VVACNGGCQAIVDTGTSLVVGPQSSISNINSMVGASS-QNGDYVVNCNNIAQMPAVTFHI 318
Query: 167 GGKNFTL 173
G+ FTL
Sbjct: 319 HGQEFTL 325
Score = 64.3 bits (155), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 30/46 (65%), Positives = 38/46 (82%), Gaps = 3/46 (6%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKF 54
YYG IS+GTPPQ FKV+FDTGSSNLW+PS +C+ + AC+ + DKF
Sbjct: 72 YYGIISIGTPPQSFKVVFDTGSSNLWVPSIYCN--SPACN-NHDKF 114
>gi|326523981|dbj|BAJ97001.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 428
Score = 95.9 bits (237), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 51/131 (38%), Positives = 69/131 (52%), Gaps = 2/131 (1%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--E 106
+DK F N ENGGE++FG +DKD + GDI Y+PV RKGYW+ + I E
Sbjct: 235 IDKPLFTFWLGDTNKDAENGGELVFGEIDKDHYEGDIVYAPVVRKGYWEVKFNELLINDE 294
Query: 107 KNVYCSNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ N A DTGTSLI P++ +N ++GA G +DC LD +P +
Sbjct: 295 PADFLGNATAAIDTGTSLIACPTEAAETINTMLGATKNFLGQWTLDCATLDSLPTLTFTF 354
Query: 167 GGKNFTLTRTN 177
GG F L T+
Sbjct: 355 GGHKFPLAPTD 365
Score = 61.6 bits (148), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 27/45 (60%), Positives = 33/45 (73%), Gaps = 2/45 (4%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK 51
AQYY I +GTPPQ F V+ DTGSSNLW+PS CS +IAC + +
Sbjct: 101 AQYYAEIGIGTPPQPFGVVMDTGSSNLWVPSTRCS--SIACWLHR 143
>gi|390477486|ref|XP_003735302.1| PREDICTED: cathepsin E isoform 2 [Callithrix jacchus]
Length = 401
Score = 95.9 bits (237), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 48/111 (43%), Positives = 70/111 (63%), Gaps = 3/111 (2%)
Query: 68 GGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKN-VYCSN-CQAIADTGTSLI 125
G E++FGG D F G + + PV+++ YWQ ++ I++ ++CS CQAI DTGTSLI
Sbjct: 233 GSELIFGGYDHSHFSGSLNWVPVTKQAYWQIALDDIQVGGTAMFCSEGCQAIVDTGTSLI 292
Query: 126 IGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTRT 176
GPS I +L IGA P+ +G V+C NL+ MP+V + G +TL+ T
Sbjct: 293 TGPSDKIKQLQNAIGAAPV-DGEYAVECANLNVMPDVTFTINGVPYTLSPT 342
Score = 61.2 bits (147), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 24/34 (70%), Positives = 31/34 (91%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCS 41
+Y+GTIS+G+PPQ F VIFDTGSSNLW+PS +C+
Sbjct: 77 EYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT 110
>gi|340373429|ref|XP_003385244.1| PREDICTED: cathepsin D-like [Amphimedon queenslandica]
Length = 382
Score = 95.9 bits (237), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 51/119 (42%), Positives = 70/119 (58%), Gaps = 5/119 (4%)
Query: 60 FRNASDENG---GEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQ 115
F DENG GE+ GG D + I Y P+S K YWQF ++ IK+ CSN CQ
Sbjct: 207 FYLDRDENGTLGGELALGGTDPSHYKAPINYVPLSDKTYWQFKLDKIKVGGTTLCSNGCQ 266
Query: 116 AIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
AIADTGTSL++GPS + ++ K IGA +G +DC N+ +P V ++GG + L+
Sbjct: 267 AIADTGTSLLVGPSVDVQKIMKEIGAKN-TDGVYMIDCGNMSNLPTVSFVIGGAQYLLS 324
Score = 64.3 bits (155), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 30/52 (57%), Positives = 34/52 (65%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHS 58
A+YYG I++GTPPQ F VIFDTGSSNLWIPS C + AC HS
Sbjct: 64 AEYYGNITIGTPPQNFLVIFDTGSSNLWIPSSKCDPKDKACQTHHQYNHDHS 115
>gi|335955136|gb|AEH76574.1| pepsinogen [Epinephelus bruneus]
Length = 375
Score = 95.9 bits (237), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 46/118 (38%), Positives = 71/118 (60%), Gaps = 2/118 (1%)
Query: 58 SHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-KNVYCSN-CQ 115
S + ++ + G E++FGG+D + G +T+ P++ YWQ ++ +KI + V C+ CQ
Sbjct: 206 SVYLSSHGDQGSEVVFGGIDNSHYTGQVTWVPLTSATYWQIKMDGVKINGQTVACAGGCQ 265
Query: 116 AIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
AI DTGTSLI+GP+ I +N +GA G + V+C N+ MP V L G +FTL
Sbjct: 266 AIIDTGTSLIVGPTNDINNMNSWVGASTNQYGESTVNCQNVGSMPEVTFTLNGHDFTL 323
Score = 61.2 bits (147), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 25/32 (78%), Positives = 28/32 (87%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
YYG IS+GTPPQ F VIFDTGSSNLW+PS +C
Sbjct: 69 YYGVISIGTPPQSFTVIFDTGSSNLWVPSVYC 100
>gi|407726061|dbj|BAM46128.1| pepsinogen C [Cynops pyrrhogaster]
Length = 383
Score = 95.9 bits (237), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 44/118 (37%), Positives = 69/118 (58%), Gaps = 3/118 (2%)
Query: 59 HFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSN-CQ 115
+ ++GGE++FGGVD + + G IT++PV+++ YWQ G+ + + +C CQ
Sbjct: 208 YLGQQGSQSGGELVFGGVDSNYYTGQITWTPVTQQMYWQIGISGFGVNGQPTGWCGQGCQ 267
Query: 116 AIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
I DTGTSL+ P + IA L + IGA +NG V C N+D +P + +GG + L
Sbjct: 268 GIVDTGTSLLTAPGQYIAALMQEIGATQDSNGEYVVSCSNIDSLPTLSFTIGGTSLPL 325
Score = 58.2 bits (139), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 24/33 (72%), Positives = 28/33 (84%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCS 41
YYG IS+GTPPQ F V+FDTGSSNLW+ S +CS
Sbjct: 68 YYGEISIGTPPQNFLVLFDTGSSNLWVASTYCS 100
>gi|338712318|ref|XP_001501960.2| PREDICTED: pepsin II-1-like [Equus caballus]
Length = 397
Score = 95.9 bits (237), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 49/128 (38%), Positives = 77/128 (60%), Gaps = 5/128 (3%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-K 107
V +D F S + +++DE+G +MFGG+D + G + + PVS +GYWQ V+SI + +
Sbjct: 217 VSQDLF---SVYLSSNDESGSVVMFGGIDSSYYTGSLHWVPVSHEGYWQITVDSITVNGE 273
Query: 108 NVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
++ CS CQA+ DTGTSL+ GP+ I + IGA G A + C ++D +P++ +
Sbjct: 274 SIACSGGCQAVVDTGTSLLTGPTSAIDNIQSYIGARKDLLGEAVISCSSIDSLPDIVFTI 333
Query: 167 GGKNFTLT 174
G F LT
Sbjct: 334 NGVEFPLT 341
Score = 68.6 bits (166), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 31/47 (65%), Positives = 39/47 (82%), Gaps = 3/47 (6%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKF 54
+Y+GTIS+GTPPQEF VIFDTGSSNLW+PS +CS ++AC D +F
Sbjct: 84 EYFGTISIGTPPQEFTVIFDTGSSNLWVPSTYCS--SLAC-YDHKRF 127
>gi|392575952|gb|EIW69084.1| hypothetical protein TREMEDRAFT_39371 [Tremella mesenterica DSM
1558]
Length = 446
Score = 95.5 bits (236), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 62/152 (40%), Positives = 86/152 (56%), Gaps = 8/152 (5%)
Query: 24 VIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASDEN-GGEIMFGGVDKDKFV 82
+ +DT S N +P + ++LN +D+ F FR SDEN GGE +FGGVD +
Sbjct: 244 LAYDTISVNHIVPPFY-TMLNRGL-LDEPVF----SFRLGSDENDGGECIFGGVDDSAYT 297
Query: 83 GDITYSPVSRKGYWQFGVESIKI-EKNVYCSNCQAIADTGTSLIIGPSKVIAELNKLIGA 141
G I Y P+ RKGYW+ +E I E+ + N A DTGTSLI+ PS V LNK IGA
Sbjct: 298 GKIQYVPIRRKGYWEVELEKIGFGEEELELENTGAAIDTGTSLIVMPSDVAEMLNKEIGA 357
Query: 142 VPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
NG VDC+ + +P + + +GG ++ L
Sbjct: 358 TKSWNGQYTVDCNTVPSLPELSLTMGGIDWVL 389
Score = 65.5 bits (158), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 28/41 (68%), Positives = 34/41 (82%), Gaps = 2/41 (4%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQYY I++GTPPQEFKV+ DTGSSNLW+PS C+ +IAC
Sbjct: 133 AQYYAPITIGTPPQEFKVVLDTGSSNLWVPSSSCT--SIAC 171
>gi|348530268|ref|XP_003452633.1| PREDICTED: cathepsin D-like [Oreochromis niloticus]
Length = 396
Score = 95.5 bits (236), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 46/116 (39%), Positives = 69/116 (59%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIA 118
RN E GGE++ GG D + GD Y+ +SR+ YWQ ++ + + + C C+AI
Sbjct: 221 RNPDTEPGGELLLGGTDPKYYDGDFHYANISRQAYWQVHMDGMTVGSQLSLCKGGCEAIV 280
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSLI GP+ + L K IGA+PL G V+C + +P + +GG+++TLT
Sbjct: 281 DTGTSLITGPAAEVKALQKAIGAIPLIQGEYLVNCSKIPSLPVITFNVGGQSYTLT 336
Score = 77.0 bits (188), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 31/47 (65%), Positives = 39/47 (82%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ AQYYG I+LGTPPQ+F V+FDTGSSNLW+PS HCS +IAC
Sbjct: 68 LKNYLDAQYYGEITLGTPPQKFTVVFDTGSSNLWVPSVHCSFFDIAC 114
>gi|296230582|ref|XP_002760770.1| PREDICTED: cathepsin E isoform 1 [Callithrix jacchus]
Length = 396
Score = 95.5 bits (236), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 48/111 (43%), Positives = 70/111 (63%), Gaps = 3/111 (2%)
Query: 68 GGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKN-VYCSN-CQAIADTGTSLI 125
G E++FGG D F G + + PV+++ YWQ ++ I++ ++CS CQAI DTGTSLI
Sbjct: 228 GSELIFGGYDHSHFSGSLNWVPVTKQAYWQIALDDIQVGGTAMFCSEGCQAIVDTGTSLI 287
Query: 126 IGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTRT 176
GPS I +L IGA P+ +G V+C NL+ MP+V + G +TL+ T
Sbjct: 288 TGPSDKIKQLQNAIGAAPV-DGEYAVECANLNVMPDVTFTINGVPYTLSPT 337
Score = 61.2 bits (147), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 24/34 (70%), Positives = 31/34 (91%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCS 41
+Y+GTIS+G+PPQ F VIFDTGSSNLW+PS +C+
Sbjct: 77 EYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT 110
>gi|68051036|emb|CAI46901.1| nothepsin [Podarcis siculus]
Length = 414
Score = 95.5 bits (236), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 45/116 (38%), Positives = 71/116 (61%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVY-CSN-CQAIA 118
R + ENGGE++FGG+D + G I + PV+ + YW+ ++++KI+ ++ C + C AI
Sbjct: 224 RGGNTENGGELIFGGIDHSLYKGSIHWVPVTEQKYWKIHMDNVKIQGHIAACKDGCAAIV 283
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
D+GTSLI GP I L + IGA P +G VDC L +P + +G + +T+T
Sbjct: 284 DSGTSLITGPPSQIIRLQQKIGAHPAPHGEFIVDCRRLSSLPPITFTIGQREYTIT 339
Score = 63.5 bits (153), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 29/55 (52%), Positives = 40/55 (72%), Gaps = 4/55 (7%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK--DKFVGHSH 59
AQYYG +S+GTPPQ F V+FDTGSS+ W+PS C + ACS+ K + F+ +S+
Sbjct: 79 AQYYGEVSVGTPPQRFTVVFDTGSSDFWVPSARC--YSKACSMHKRFESFMSYSY 131
>gi|118102416|ref|XP_001235024.1| PREDICTED: cathepsin E [Gallus gallus]
Length = 397
Score = 95.5 bits (236), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 46/115 (40%), Positives = 73/115 (63%), Gaps = 3/115 (2%)
Query: 62 NASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIAD 119
N GGE++FGG D +F+G + + PV+++GYWQ ++++++ V +C++ CQAI D
Sbjct: 223 NPDSSLGGEVLFGGFDPSRFLGTLHWVPVTQQGYWQIQLDNVQVGGTVAFCADGCQAIVD 282
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
TGTSL+ GP+K I E+ + IGA + +G VDC L MP V + G + L+
Sbjct: 283 TGTSLLTGPTKDIKEMQRYIGATAM-DGEYIVDCGRLSSMPIVTFTINGIPYVLS 336
Score = 59.3 bits (142), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 23/34 (67%), Positives = 30/34 (88%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCS 41
+Y+G IS+GTPPQ F V+FDTGSSNLW+PS +C+
Sbjct: 78 EYFGQISIGTPPQNFTVVFDTGSSNLWVPSIYCT 111
>gi|148232796|ref|NP_001083566.1| napsin A aspartic peptidase precursor [Xenopus laevis]
gi|38197533|gb|AAH61685.1| MGC68767 protein [Xenopus laevis]
Length = 392
Score = 95.5 bits (236), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 48/116 (41%), Positives = 65/116 (56%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKN-VYCSN-CQAIA 118
R+ GGE++ GG D + + GD Y V+R YWQ + +++ V C CQAI
Sbjct: 212 RDPMATVGGELILGGTDPNYYTGDFHYLNVTRMAYWQIKADEVRVNNQLVLCKGGCQAIV 271
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSLI GP + I L+K IGA PL G ++C + +P V ILGG + LT
Sbjct: 272 DTGTSLITGPKEEIRALHKAIGAFPLFAGEYFINCKRIQSLPTVSFILGGVAYNLT 327
Score = 72.0 bits (175), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 31/45 (68%), Positives = 35/45 (77%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK 51
AQYYG I +GTPPQ+F VIFDTGSSNLW+PS CS + AC V K
Sbjct: 65 AQYYGEIFIGTPPQKFAVIFDTGSSNLWVPSVKCSFFDFACWVHK 109
>gi|342186743|emb|CBI83255.1| cathepsin D [Oreochromis mossambicus]
Length = 239
Score = 95.5 bits (236), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 49/128 (38%), Positives = 76/128 (59%), Gaps = 3/128 (2%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKN 108
V+K+ F + + RN E GGE++ GG D + GD Y+ +SR+ YWQ ++ + +
Sbjct: 79 VEKNVFSFYLN-RNPDTEPGGELLLGGTDPKYYDGDFHYANISRQAYWQVHMDGMTVGSQ 137
Query: 109 V-YC-SNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ C C+AI DTGTSLI GP+ + L K IGA+PL G V+C + +P + +
Sbjct: 138 LSLCKGGCEAIVDTGTSLITGPAAEVKALQKAIGAIPLIQGEYLVNCSKIPSLPVITFNV 197
Query: 167 GGKNFTLT 174
GG+++TLT
Sbjct: 198 GGQSYTLT 205
>gi|226822856|gb|ACO83090.1| cathepsin D preproprotein (predicted) [Dasypus novemcinctus]
Length = 410
Score = 95.1 bits (235), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 45/128 (35%), Positives = 78/128 (60%), Gaps = 3/128 (2%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKN 108
VDK+ F + + R+ + + GGE++ GG+D + G + Y ++RK YWQ ++ +++
Sbjct: 222 VDKNVFSFYLN-RDPTAQPGGELVLGGMDPRHYTGSVDYLNITRKAYWQVHMDRLEVGDG 280
Query: 109 V-YCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ C C+AI DTGTSL++GP + EL K IGAVPL G + C+ + +P + ++L
Sbjct: 281 LTLCKQGCEAIVDTGTSLMVGPVAEVRELQKAIGAVPLIQGEYMISCEKVASLPPITLML 340
Query: 167 GGKNFTLT 174
G + + L+
Sbjct: 341 GNRGYRLS 348
Score = 72.4 bits (176), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 33/66 (50%), Positives = 45/66 (68%), Gaps = 2/66 (3%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHS-- 58
+R+ AQYYG I +GTP Q F+V+FDTGSSNLW+PS HC +L+ AC + + G S
Sbjct: 71 LRNYMDAQYYGEIGIGTPAQCFRVVFDTGSSNLWVPSIHCRLLDFACWLHRKYNSGRSST 130
Query: 59 HFRNAS 64
+ +N S
Sbjct: 131 YVKNGS 136
>gi|157644743|gb|ABV59077.1| cathepsin D [Lates calcarifer]
gi|396084116|gb|AFN84539.1| cathepsin D [Lates calcarifer]
Length = 396
Score = 95.1 bits (235), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 45/116 (38%), Positives = 68/116 (58%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIA 118
RN GGE++ GG D + GD Y ++R+ YWQ ++ + + + C C+AI
Sbjct: 221 RNPDTAPGGELLLGGTDPKYYTGDFNYVNITRQAYWQIHMDELVVGTQLSLCKGGCEAIV 280
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSLI GPS + L K IGA+PL G V+CD + +P + +GG++++LT
Sbjct: 281 DTGTSLITGPSAEVKALQKAIGAIPLIQGEYMVNCDKVPSLPVITFNVGGQSYSLT 336
Score = 79.7 bits (195), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 33/47 (70%), Positives = 40/47 (85%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ AQYYG ISLGTPPQ F V+FDTGSSNLW+PS HCS+L+IAC
Sbjct: 68 LKNYLDAQYYGDISLGTPPQTFSVVFDTGSSNLWVPSVHCSLLDIAC 114
>gi|149725292|ref|XP_001501875.1| PREDICTED: pepsin A-like [Equus caballus]
Length = 387
Score = 95.1 bits (235), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 48/128 (37%), Positives = 78/128 (60%), Gaps = 5/128 (3%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-K 107
V +D F S + ++ DE+G +MFGG+D + G + + PVS +GYWQ ++SI + +
Sbjct: 207 VSQDLF---SVYLSSDDESGSVVMFGGIDSSYYSGSLNWVPVSNEGYWQITMDSITMNGE 263
Query: 108 NVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
++ CS CQAI DTGTSL+ GP+ I + IGA ++G + + C ++D +P++ L
Sbjct: 264 SIACSGGCQAIVDTGTSLLAGPTSAIDNIQSYIGASEDSSGESVISCSSIDSLPDIVFTL 323
Query: 167 GGKNFTLT 174
G F L+
Sbjct: 324 NGVEFPLS 331
Score = 69.3 bits (168), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 30/41 (73%), Positives = 37/41 (90%), Gaps = 2/41 (4%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACS 48
+Y+GTIS+GTPPQEF VIFDTGSSNLW+PS +CS ++ACS
Sbjct: 74 EYFGTISIGTPPQEFTVIFDTGSSNLWVPSTYCS--SLACS 112
>gi|193499289|gb|ACF18587.1| pepsinogen A1 precursor [Siniperca chuatsi]
gi|193499299|gb|ACF18592.1| pepsinogen A1 precursor [Siniperca chuatsi]
gi|210076783|gb|ACJ06745.1| pepsinogen [Siniperca chuatsi]
gi|312369796|gb|ADQ74489.1| pepsinogen [Siniperca chuatsi]
Length = 378
Score = 95.1 bits (235), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 53/127 (41%), Positives = 79/127 (62%), Gaps = 6/127 (4%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKN 108
V++D F S + +A+ + G + FGGVD + + G IT+ P+S + YWQ V+S+ +
Sbjct: 203 VNQDLF---SVYLSANSQQGSVVTFGGVDPNHYYGSITWIPLSSELYWQITVDSVTVNGQ 259
Query: 109 VY-CS-NCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
V CS CQAI DTGTSLI+GP I+ +N +GA NG V+C+N+ +MP+V +
Sbjct: 260 VVACSGGCQAIVDTGTSLIVGPQNSISNINSGVGASG-QNGDYVVNCNNIAQMPDVTFHI 318
Query: 167 GGKNFTL 173
G+ FTL
Sbjct: 319 HGQEFTL 325
Score = 65.9 bits (159), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 31/46 (67%), Positives = 35/46 (76%), Gaps = 3/46 (6%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKF 54
YYG IS+GTPPQ FKVIFDTGSSNLW+PS +C N A + DKF
Sbjct: 72 YYGIISIGTPPQSFKVIFDTGSSNLWVPSIYC---NSAACNNHDKF 114
>gi|73621385|sp|Q9GMY7.1|PEPA_RHIFE RecName: Full=Pepsin A; Flags: Precursor
gi|9798658|dbj|BAB11751.1| pepsinogen A [Rhinolophus ferrumequinum]
Length = 386
Score = 95.1 bits (235), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 50/127 (39%), Positives = 75/127 (59%), Gaps = 5/127 (3%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKN 108
V +D F S + +++D+ G +MFGG+D F G++ + P+S + YWQ V+SI +
Sbjct: 206 VSQDLF---SVYLSSNDQGGSVVMFGGIDSSYFTGNLNWVPLSSETYWQITVDSITMNGQ 262
Query: 109 VY-CS-NCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
V CS +CQAI DTGTSL+ GP+ IA + IGA ANG V C ++ +PN+ +
Sbjct: 263 VIACSGSCQAIVDTGTSLLSGPTNAIASIQGYIGASQNANGEMVVSCSAINTLPNIVFTI 322
Query: 167 GGKNFTL 173
G + L
Sbjct: 323 NGVQYPL 329
Score = 64.7 bits (156), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 30/47 (63%), Positives = 39/47 (82%), Gaps = 3/47 (6%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKF 54
+Y+GTI +GTPPQEF VIFDTGSSNLW+PS +CS + ACS + ++F
Sbjct: 73 EYFGTIGIGTPPQEFTVIFDTGSSNLWVPSVYCS--SPACS-NHNRF 116
>gi|432943847|ref|XP_004083297.1| PREDICTED: cathepsin E-A-like [Oryzias latipes]
Length = 412
Score = 94.7 bits (234), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 45/108 (41%), Positives = 67/108 (62%), Gaps = 2/108 (1%)
Query: 69 GEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE--KNVYCSNCQAIADTGTSLII 126
G+++ GG D+ + G I + PV+ KGYWQ ++S+ ++ ++ C+AI DTGTSLI
Sbjct: 235 GQLLLGGTDESLYSGPINWVPVTIKGYWQIRMDSVSVQGVSSLCRRGCEAIVDTGTSLIA 294
Query: 127 GPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
GP + I L++LIGA P G VDC L +P+V +LG +TLT
Sbjct: 295 GPPREILRLHQLIGATPTHFGDFVVDCARLSSLPHVTFVLGEVEYTLT 342
Score = 62.4 bits (150), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 30/55 (54%), Positives = 37/55 (67%), Gaps = 9/55 (16%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFR 61
AQYYG I LGTP Q F VIFDTGSS+LW+PS +C ++ AC+ H HF+
Sbjct: 81 AQYYGEIRLGTPEQNFSVIFDTGSSDLWVPSSYC--VSQACAF-------HRHFK 126
>gi|18858489|ref|NP_571785.1| cathepsin D [Danio rerio]
gi|12053845|emb|CAC20111.1| cathepsin D enzyme [Danio rerio]
Length = 399
Score = 94.7 bits (234), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 48/118 (40%), Positives = 67/118 (56%), Gaps = 4/118 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIA 118
RN + GGE++ GG D + GD Y +SR+ YWQ ++ + I + C C+AI
Sbjct: 220 RNPDTQPGGELLLGGTDPKYYTGDFNYVDISRQAYWQIHMDGMSIGSGLSLCKGGCEAIV 279
Query: 119 DTGTS--LIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTS LI GP+ + L K IGA+PL G VDC + +P + LGGK ++LT
Sbjct: 280 DTGTSTSLITGPAAEVKALQKAIGAIPLMQGEYMVDCKKVPTLPTISFSLGGKVYSLT 337
Score = 71.6 bits (174), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 30/47 (63%), Positives = 37/47 (78%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ AQYYG I LGTP Q F V+FDTGSSNLW+PS HCS+ +IAC
Sbjct: 67 LKNYLDAQYYGEIGLGTPVQTFTVVFDTGSSNLWVPSVHCSLTDIAC 113
>gi|195470497|ref|XP_002087543.1| GE17582 [Drosophila yakuba]
gi|194173644|gb|EDW87255.1| GE17582 [Drosophila yakuba]
Length = 421
Score = 94.7 bits (234), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 50/117 (42%), Positives = 70/117 (59%), Gaps = 3/117 (2%)
Query: 61 RNASD-ENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSNCQAIAD 119
RNASD NGG ++ GG D + G +TY P+S+ G+WQ V + I CSNCQAI D
Sbjct: 238 RNASDASNGGVLLLGGADPTLYSGCLTYVPLSKVGFWQITVGQVAIGSKKLCSNCQAIFD 297
Query: 120 TGTSLIIGPSKVIAELNKLIG--AVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
GTSLII P + +N+ +G +G +DC + ++PNV + +G K+FTLT
Sbjct: 298 MGTSLIIVPCPALKIINRKLGIKEADKKDGVYIIDCSKVSRLPNVVLNIGWKDFTLT 354
>gi|353234557|emb|CCA66581.1| probable PEP4-aspartyl protease [Piriformospora indica DSM 11827]
Length = 411
Score = 94.7 bits (234), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 48/120 (40%), Positives = 69/120 (57%), Gaps = 2/120 (1%)
Query: 60 FR-NASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI-EKNVYCSNCQAI 117
FR AS+ +GGE +FGG+D + G ITY PV RKGYW+ +ES+ + + N A
Sbjct: 239 FRVGASEADGGEAVFGGIDSSHYTGKITYVPVRRKGYWEVELESVAFGDDELELENTGAA 298
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTRTN 177
DTGTSLI+ P+ + LN IGA NG + CD + +P+ + GGK + + T+
Sbjct: 299 IDTGTSLIVMPTTIAEMLNSEIGATRSWNGQYTLPCDKVPGLPDFTFVFGGKPYPIASTD 358
Score = 63.2 bits (152), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 27/41 (65%), Positives = 33/41 (80%), Gaps = 2/41 (4%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQYY I++GTPPQ FKV+ DTGSSNLW+PS C+ +IAC
Sbjct: 98 AQYYADITIGTPPQTFKVVLDTGSSNLWVPSTSCT--SIAC 136
>gi|308321648|gb|ADO27975.1| pepsin A [Ictalurus furcatus]
Length = 241
Score = 94.7 bits (234), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 48/128 (37%), Positives = 77/128 (60%), Gaps = 5/128 (3%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-K 107
V +D F S + +++ + G E++FG +D + G I + P+S + Y+Q ++S+ I +
Sbjct: 66 VSQDVF---SVYLSSNGQQGSEVLFGEIDTSYYTGSIYWIPLSSESYYQIAMDSVSINGQ 122
Query: 108 NVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
V CS CQAI DTGTS I+GPS I+ +N +GA G A V+C+N+ MP++ L
Sbjct: 123 TVACSGGCQAIIDTGTSYIVGPSSDISNINSWVGATVDQYGDASVNCNNIQSMPSITFTL 182
Query: 167 GGKNFTLT 174
G FT++
Sbjct: 183 NGYTFTIS 190
>gi|4589842|dbj|BAA76892.1| pepsinogen C [Gallus gallus]
Length = 389
Score = 94.7 bits (234), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 44/110 (40%), Positives = 68/110 (61%), Gaps = 3/110 (2%)
Query: 67 NGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSN-CQAIADTGTS 123
GGE++FGGVD + + G IT++PV++ YWQ G+E + + + +CS CQ I DTGTS
Sbjct: 221 QGGELVFGGVDPNLYTGQITWTPVTQTTYWQIGIEDFAVGGQSSGWCSQGCQGIVDTGTS 280
Query: 124 LIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
L+ P++V EL + IGA +G C N++ MP + ++ G +F L
Sbjct: 281 LLTVPNQVFTELMQYIGAQADDSGQYVASCSNIEYMPTITFVISGTSFPL 330
Score = 57.8 bits (138), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 24/32 (75%), Positives = 27/32 (84%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
YYG IS+GTPPQ F V+FDTGSSNLW+PS C
Sbjct: 73 YYGEISIGTPPQNFLVLFDTGSSNLWVPSTLC 104
>gi|45382395|ref|NP_990208.1| gastricsin precursor [Gallus gallus]
gi|4589840|dbj|BAA76893.1| pepsinogen C [Gallus gallus]
Length = 389
Score = 94.7 bits (234), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 44/110 (40%), Positives = 68/110 (61%), Gaps = 3/110 (2%)
Query: 67 NGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSN-CQAIADTGTS 123
GGE++FGGVD + + G IT++PV++ YWQ G+E + + + +CS CQ I DTGTS
Sbjct: 221 QGGELVFGGVDPNLYTGQITWTPVTQTTYWQIGIEDFAVGGQSSGWCSQGCQGIVDTGTS 280
Query: 124 LIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
L+ P++V EL + IGA +G C N++ MP + ++ G +F L
Sbjct: 281 LLTVPNQVFTELMQYIGAQADDSGQYVASCSNIEYMPTITFVISGTSFPL 330
Score = 57.8 bits (138), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 24/32 (75%), Positives = 27/32 (84%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
YYG IS+GTPPQ F V+FDTGSSNLW+PS C
Sbjct: 73 YYGEISIGTPPQNFLVLFDTGSSNLWVPSTLC 104
>gi|302696543|ref|XP_003037950.1| hypothetical protein SCHCODRAFT_71897 [Schizophyllum commune H4-8]
gi|300111647|gb|EFJ03048.1| hypothetical protein SCHCODRAFT_71897 [Schizophyllum commune H4-8]
Length = 406
Score = 94.7 bits (234), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 58/154 (37%), Positives = 86/154 (55%), Gaps = 8/154 (5%)
Query: 26 FDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFR-NASDENGGEIMFGGVDKDKFVGD 84
+DT S N +P H +++N +D+ F FR S+E+GGE +FGGVDK + GD
Sbjct: 206 YDTISVNHIVPP-HYNMINKGL-LDEPVF----SFRLGKSEEDGGEAIFGGVDKSAYKGD 259
Query: 85 ITYSPVSRKGYWQFGVESIKI-EKNVYCSNCQAIADTGTSLIIGPSKVIAELNKLIGAVP 143
+TY PV RK YW+ +E I + + + A DTGTSLI P+ + +N IGA
Sbjct: 260 LTYVPVRRKAYWEVELEKISFGSEELELESTGAAIDTGTSLIALPTDMAEMINAEIGAKK 319
Query: 144 LANGPAKVDCDNLDKMPNVDIILGGKNFTLTRTN 177
NG +V+C + +P + + GGK +TL T+
Sbjct: 320 SWNGQYQVECSKVPDLPELSLYFGGKPYTLKGTD 353
Score = 64.7 bits (156), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 35/84 (41%), Positives = 49/84 (58%), Gaps = 4/84 (4%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASD- 65
AQY+ I+LGTPPQ FKVI DTGSSNLW+PS C+ + D ++ +N ++
Sbjct: 93 AQYFTEITLGTPPQNFKVILDTGSSNLWVPSSKCTSIACFLHAKYDSSASSTYKQNGTEF 152
Query: 66 --ENGGEIMFGGVDKDKF-VGDIT 86
+ G M G V +D +GD+T
Sbjct: 153 SIQYGSGSMEGFVSQDVLTIGDLT 176
>gi|321250483|ref|XP_003191823.1| endopeptidase [Cryptococcus gattii WM276]
gi|317458290|gb|ADV20036.1| Endopeptidase, putative [Cryptococcus gattii WM276]
Length = 432
Score = 94.7 bits (234), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 51/117 (43%), Positives = 67/117 (57%), Gaps = 2/117 (1%)
Query: 60 FRNASDEN-GGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI-EKNVYCSNCQAI 117
FR S EN GGE +FGG+DK + G + Y PV RKGYW+ +ESI + + N A
Sbjct: 260 FRLGSSENDGGEAIFGGIDKSAYSGSLHYVPVRRKGYWEVELESISFGDDELELENTGAA 319
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSLI+ P+ V LNK IGA NG VDC+ + +P + GK + L+
Sbjct: 320 IDTGTSLIVMPTDVAEMLNKEIGAEKSWNGQYTVDCNTVPSLPELAFTFDGKAYKLS 376
Score = 58.5 bits (140), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 27/41 (65%), Positives = 31/41 (75%), Gaps = 2/41 (4%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQY+ I LGTP Q FKVI DTGSSNLW+PS C+ +IAC
Sbjct: 119 AQYFAQIELGTPAQTFKVILDTGSSNLWVPSVGCT--SIAC 157
>gi|18859121|ref|NP_571879.1| nothepsin [Danio rerio]
gi|12053847|emb|CAC20112.1| nothepsin [Danio rerio]
Length = 416
Score = 94.4 bits (233), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 45/112 (40%), Positives = 71/112 (63%), Gaps = 5/112 (4%)
Query: 68 GGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-----CQAIADTGT 122
GGE++FG D+ +F+ I + PV++KGYWQ ++++K++ + S+ CQAI DTGT
Sbjct: 235 GGELVFGANDESRFLPPINWIPVTQKGYWQIKLDAVKVQGALSFSDRSVQGCQAIVDTGT 294
Query: 123 SLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
SLI GP++ I L + IGA P ANG VDC + +P V ++ ++L+
Sbjct: 295 SLIGGPARDILILQQFIGATPTANGEFVVDCVRVSSLPVVSFLINSVEYSLS 346
Score = 57.0 bits (136), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 26/48 (54%), Positives = 36/48 (75%), Gaps = 3/48 (6%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKF 54
AQ++G ISLG P Q F V+FDTGSS+LW+PS +C + AC++ +KF
Sbjct: 84 AQFFGQISLGRPEQNFTVVFDTGSSDLWVPSSYC--VTQACAL-HNKF 128
>gi|328860092|gb|EGG09199.1| hypothetical protein MELLADRAFT_42703 [Melampsora larici-populina
98AG31]
Length = 429
Score = 94.4 bits (233), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 47/122 (38%), Positives = 71/122 (58%), Gaps = 2/122 (1%)
Query: 54 FVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEK-NVYCS 112
++G SH + D GGE +FGG+D+ + GDI Y+PV R+GYW+ +E ++ K +
Sbjct: 253 YLGTSH-ESGVDNQGGEAIFGGIDEAHYEGDIHYAPVRRRGYWEVALEGVRFGKEEMKLV 311
Query: 113 NCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFT 172
N A DTGTSLI P+ +N +GA +G VDCD + +P++ GK+FT
Sbjct: 312 NVGAAIDTGTSLIALPTDTAEIINASLGAKKSWSGQYTVDCDKIPTLPDLTFTFAGKDFT 371
Query: 173 LT 174
+T
Sbjct: 372 IT 373
Score = 62.8 bits (151), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 27/41 (65%), Positives = 33/41 (80%), Gaps = 2/41 (4%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQY+ I++GTPPQ FKVI DTGSSNLW+PS C+ +IAC
Sbjct: 111 AQYFSEITIGTPPQSFKVILDTGSSNLWVPSTRCT--SIAC 149
>gi|443286988|dbj|BAM76488.1| pepsinogen, partial [Gadus macrocephalus]
Length = 323
Score = 94.4 bits (233), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 46/118 (38%), Positives = 70/118 (59%), Gaps = 2/118 (1%)
Query: 58 SHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN--CQ 115
S + ++ + G ++FGG+++ + G IT+ P++ YWQ ++S+ I N N CQ
Sbjct: 154 SVYLSSHSQEGSVVVFGGIEESYYTGQITWIPLTSATYWQIKMDSVTINGNPVACNGGCQ 213
Query: 116 AIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
AI DTGTSLI+GPS I+ +N +GA G A V+C N+ MP V L G +FT+
Sbjct: 214 AIIDTGTSLIVGPSSDISNMNSWVGASTDQYGDATVNCQNIGSMPEVVFTLSGHSFTV 271
Score = 63.2 bits (152), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 30/47 (63%), Positives = 37/47 (78%), Gaps = 3/47 (6%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFV 55
YYG IS+GTPPQ F VIFDTGSSNLW+PS +CS + AC+ + KF+
Sbjct: 17 YYGVISIGTPPQSFSVIFDTGSSNLWVPSVYCS--SQACN-NHRKFI 60
>gi|193499291|gb|ACF18588.1| pepsinogen A1 precursor [Siniperca scherzeri]
gi|193499301|gb|ACF18593.1| pepsinogen A1 precursor [Siniperca scherzeri]
Length = 378
Score = 94.4 bits (233), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 52/127 (40%), Positives = 79/127 (62%), Gaps = 6/127 (4%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKN 108
V++D F S + +A+ + G + FGG+D + + G IT+ P+S + YWQ V+S+ +
Sbjct: 203 VNQDLF---SVYLSANSQQGSVVTFGGIDPNHYYGSITWIPLSSELYWQITVDSVTVNGQ 259
Query: 109 VY-CS-NCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
V CS CQAI DTGTSLI+GP I+ +N +GA NG V+C+N+ +MP+V +
Sbjct: 260 VVACSGGCQAIVDTGTSLIVGPQSSISNINSGVGASG-QNGDYVVNCNNVAQMPDVTFHI 318
Query: 167 GGKNFTL 173
G+ FTL
Sbjct: 319 HGQEFTL 325
Score = 66.2 bits (160), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 31/46 (67%), Positives = 35/46 (76%), Gaps = 3/46 (6%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKF 54
YYG IS+GTPPQ FKVIFDTGSSNLW+PS +C N A + DKF
Sbjct: 72 YYGIISIGTPPQSFKVIFDTGSSNLWVPSVYC---NSAACNNHDKF 114
>gi|307175238|gb|EFN65290.1| Lysosomal aspartic protease [Camponotus floridanus]
Length = 357
Score = 94.4 bits (233), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 58/174 (33%), Positives = 90/174 (51%), Gaps = 20/174 (11%)
Query: 20 QEFKVIFDTGSSNLWIPSQHCSILNIACSVDKD------------KFVGHSHF------R 61
Q F + + + L++P+ IL +A S D + + SH R
Sbjct: 126 QTFAELTNMSNEELFLPAPFDGILGLAYSYISDNNIIPVFDNMVNQNLVSSHIFSFYLNR 185
Query: 62 NASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYC-SNCQAIADT 120
+ S E GE + GG D + G+ TY PV+ KG+WQF ++ I++ C S+CQAIADT
Sbjct: 186 DPSAELDGEFILGGSDPAHYDGNFTYVPVTHKGFWQFTMDKIEVNNISLCQSSCQAIADT 245
Query: 121 GTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
G GP+ + +N+LIG + +G +V+C + ++P + ILGGK F LT
Sbjct: 246 GMGETYGPTSDVKTINELIGTTNI-DGMERVNCSRIPELPTIRFILGGKAFNLT 298
Score = 67.0 bits (162), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 31/50 (62%), Positives = 40/50 (80%), Gaps = 3/50 (6%)
Query: 1 MRSSYQ-AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSV 49
+ S+YQ YYG I++GTPPQEFKVIFDTGS+NLWIPS+ C++ AC +
Sbjct: 32 ILSNYQNINYYGVITIGTPPQEFKVIFDTGSANLWIPSKKCNL--TACLI 79
>gi|194218273|ref|XP_001501915.2| PREDICTED: pepsin A-like [Equus caballus]
Length = 387
Score = 94.4 bits (233), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 48/128 (37%), Positives = 75/128 (58%), Gaps = 5/128 (3%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-K 107
V +D F S + ++ DE+G +MFGG+D + G + + PVS + YWQ V+SI + +
Sbjct: 207 VSQDLF---SVYLSSDDESGSVVMFGGIDSSYYSGSLNWVPVSEEAYWQITVDSITMNGE 263
Query: 108 NVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
++ CS CQAI DTGTSL+ GP I + IGA ++G + C ++D +P++ +
Sbjct: 264 SIACSGGCQAIVDTGTSLLAGPPSAIDNIQSYIGASEDSSGEGAISCSSIDSLPDIVFTI 323
Query: 167 GGKNFTLT 174
G F LT
Sbjct: 324 NGVEFPLT 331
Score = 67.0 bits (162), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 31/47 (65%), Positives = 40/47 (85%), Gaps = 3/47 (6%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKF 54
+Y+GTIS+GTP QEF VIFDTGSSNLW+PS +CS ++ACS D ++F
Sbjct: 74 EYFGTISIGTPAQEFTVIFDTGSSNLWVPSVYCS--SLACS-DHNRF 117
>gi|166361873|gb|ABY87035.1| pepsinogen A2 [Epinephelus coioides]
gi|166361877|gb|ABY87037.1| pepsinogen A2 [Epinephelus coioides]
Length = 377
Score = 94.4 bits (233), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 49/127 (38%), Positives = 77/127 (60%), Gaps = 6/127 (4%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKN 108
V +D F S + +++ G + FGG+D + + G I++ P+S + YWQ V+S+ +
Sbjct: 203 VSQDLF---SVYLSSNSNRGSVVTFGGIDPNHYSGSISWIPLSSELYWQITVDSVTVNGQ 259
Query: 109 VYCSN--CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
V N CQAI DTGTSLI+GP I+ +N+++GA NG V C+N+ +MP+V +
Sbjct: 260 VVACNGGCQAIVDTGTSLIVGPQSSISNINQVVGAYS-QNGNDMVSCNNIGQMPDVTFHI 318
Query: 167 GGKNFTL 173
G+ FTL
Sbjct: 319 QGQEFTL 325
Score = 64.7 bits (156), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 38/89 (42%), Positives = 54/89 (60%), Gaps = 10/89 (11%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASDENG 68
Y+G IS+GTPPQ FKV+FDTGSSNLW+PS +CS + AC+ + DKF + + +NG
Sbjct: 72 YFGIISIGTPPQSFKVVFDTGSSNLWVPSVYCS--SPACN-NHDKF--NPSLSSTYRQNG 126
Query: 69 G--EIMFGGVDKDKFVGDITYSPVSRKGY 95
I +G +G + Y V+ G+
Sbjct: 127 ASLRIQYG---TGSMIGFLGYDTVTVGGF 152
>gi|301786583|ref|XP_002928700.1| PREDICTED: pepsin A-like isoform 2 [Ailuropoda melanoleuca]
Length = 393
Score = 94.4 bits (233), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 47/127 (37%), Positives = 76/127 (59%), Gaps = 5/127 (3%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-K 107
V +D F S + ++ D++G +MFGG+D F G++ + PVS +GYWQ ++S+ I +
Sbjct: 206 VSQDLF---SVYLSSDDQSGSVVMFGGIDSSYFTGNLNWVPVSVEGYWQITMDSVTINGQ 262
Query: 108 NVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ CS CQAI DTGTSL+ GP+ IA + IGA +NG + C ++ +P++ +
Sbjct: 263 AIACSQGCQAIVDTGTSLLAGPTNSIANIQSYIGASEDSNGEMTISCSAINDLPDIVFTI 322
Query: 167 GGKNFTL 173
G + L
Sbjct: 323 NGIQYPL 329
Score = 65.1 bits (157), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 29/41 (70%), Positives = 35/41 (85%), Gaps = 2/41 (4%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACS 48
+Y+GTI +GTPPQEF VIFDTGSSNLW+PS +CS + ACS
Sbjct: 73 EYFGTIGIGTPPQEFTVIFDTGSSNLWVPSVYCS--SPACS 111
>gi|73620984|sp|P81497.2|PEPA_SUNMU RecName: Full=Pepsin A; Flags: Precursor
gi|9798654|dbj|BAB11749.1| pepsinogen A [Suncus murinus]
Length = 387
Score = 94.0 bits (232), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 48/127 (37%), Positives = 79/127 (62%), Gaps = 5/127 (3%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-K 107
V +D F S + +++D++G ++FGG+D + G++ + P+S +GYWQ V+SI + +
Sbjct: 207 VSQDLF---SVYLSSNDQSGSVVIFGGIDSSYYTGNLNWVPLSSEGYWQITVDSITMNGQ 263
Query: 108 NVYCS-NCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ CS +CQAI DTGTSL+ GP+ IA + K IGA ANG V C ++ +P++ +
Sbjct: 264 AIACSGSCQAIVDTGTSLLSGPNNAIANIQKSIGASQNANGQMVVSCSSIQSLPDIVFTI 323
Query: 167 GGKNFTL 173
G + L
Sbjct: 324 NGIQYPL 330
Score = 64.7 bits (156), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 29/41 (70%), Positives = 35/41 (85%), Gaps = 2/41 (4%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACS 48
+Y+GTI +GTPPQEF VIFDTGSSNLW+PS +CS + ACS
Sbjct: 74 EYFGTIGIGTPPQEFTVIFDTGSSNLWVPSVYCS--SPACS 112
>gi|296219067|ref|XP_002755720.1| PREDICTED: cathepsin D [Callithrix jacchus]
Length = 392
Score = 94.0 bits (232), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 47/128 (36%), Positives = 77/128 (60%), Gaps = 3/128 (2%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKN 108
VD++ F + + R+ + GGE+M GG D + G + Y V+RK YWQ ++ +++
Sbjct: 204 VDQNIFSFYLN-RDPDAQPGGELMLGGTDSKYYKGSLFYLNVTRKAYWQVHMDQVEVASG 262
Query: 109 V-YCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ C C+AI DTGTSL++GP + EL K IGA+PL G + C+ + +P + + L
Sbjct: 263 LTLCKGGCEAIVDTGTSLMVGPVDEVRELQKAIGAMPLIQGEYMIPCEKVSTLPVIMLKL 322
Query: 167 GGKNFTLT 174
GGK++ L+
Sbjct: 323 GGKDYELS 330
Score = 74.7 bits (182), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 31/48 (64%), Positives = 39/48 (81%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACS 48
+++ AQYYG I +GTPPQ F V+FDTGSSNLW+PS HC +L+IACS
Sbjct: 71 LKNYMDAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACS 118
>gi|11990128|emb|CAC19555.1| pepsin A [Camelus dromedarius]
Length = 390
Score = 94.0 bits (232), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 47/128 (36%), Positives = 79/128 (61%), Gaps = 5/128 (3%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-K 107
+ +D F S + +++DE+G ++FGG+D + G + + PVS +GYWQ V+SI +E +
Sbjct: 209 ISEDLF---SVYLSSNDESGSVVIFGGIDSSYYTGSLNWVPVSVEGYWQITVDSITMEGE 265
Query: 108 NVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
++ CS+ CQAI DTGTSL+ GP+ I+ + IGA + G V C ++ +PN+ +
Sbjct: 266 SIACSSGCQAIVDTGTSLLAGPTDAISNIQSYIGASEDSYGDMVVSCSSISSLPNIVFTI 325
Query: 167 GGKNFTLT 174
G + L+
Sbjct: 326 NGVQYPLS 333
Score = 60.5 bits (145), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 25/34 (73%), Positives = 30/34 (88%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCS 41
+Y+GTIS+GTP Q F VIFDTGSSNLW+PS +CS
Sbjct: 76 EYFGTISIGTPAQNFTVIFDTGSSNLWVPSIYCS 109
>gi|387915422|gb|AFK11320.1| cathepsin E-A-like protein [Callorhinchus milii]
Length = 401
Score = 94.0 bits (232), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 45/116 (38%), Positives = 72/116 (62%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIA 118
R + GGE++ GG++ + + G I + PV+ +GYWQ ++++KI+ + C N C AI
Sbjct: 225 REMDSDYGGELLLGGINHECYTGSINWVPVTERGYWQIRMDNVKIDGMLTLCINGCAAIV 284
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSLI GP K I +L+K +GA+ + +G VDC + M +V +G F+L+
Sbjct: 285 DTGTSLITGPEKEIRKLHKQLGAMSVGDGEYVVDCKRISSMASVTFTIGEVEFSLS 340
Score = 60.5 bits (145), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 28/52 (53%), Positives = 37/52 (71%), Gaps = 2/52 (3%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHS 58
AQYYG I +GTP Q+F V+FDTGSSNLW+PS +C ++ AC + + HS
Sbjct: 80 AQYYGEIGIGTPLQKFTVVFDTGSSNLWVPSAYC--ISEACKMHEQFKSFHS 129
>gi|301786581|ref|XP_002928699.1| PREDICTED: pepsin A-like isoform 1 [Ailuropoda melanoleuca]
gi|281347483|gb|EFB23067.1| hypothetical protein PANDA_018738 [Ailuropoda melanoleuca]
Length = 385
Score = 94.0 bits (232), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 47/127 (37%), Positives = 76/127 (59%), Gaps = 5/127 (3%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-K 107
V +D F S + ++ D++G +MFGG+D F G++ + PVS +GYWQ ++S+ I +
Sbjct: 206 VSQDLF---SVYLSSDDQSGSVVMFGGIDSSYFTGNLNWVPVSVEGYWQITMDSVTINGQ 262
Query: 108 NVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ CS CQAI DTGTSL+ GP+ IA + IGA +NG + C ++ +P++ +
Sbjct: 263 AIACSQGCQAIVDTGTSLLAGPTNSIANIQSYIGASEDSNGEMTISCSAINDLPDIVFTI 322
Query: 167 GGKNFTL 173
G + L
Sbjct: 323 NGIQYPL 329
Score = 64.7 bits (156), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 29/41 (70%), Positives = 35/41 (85%), Gaps = 2/41 (4%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACS 48
+Y+GTI +GTPPQEF VIFDTGSSNLW+PS +CS + ACS
Sbjct: 73 EYFGTIGIGTPPQEFTVIFDTGSSNLWVPSVYCS--SPACS 111
>gi|440898030|gb|ELR49612.1| Napsin-A, partial [Bos grunniens mutus]
Length = 406
Score = 94.0 bits (232), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 46/116 (39%), Positives = 68/116 (58%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIA 118
RN +GGE++ GG D ++ +T+ PV+ +WQ +E +++ + C+ C AI
Sbjct: 223 RNPEAADGGELVLGGSDPAHYIPPLTFVPVTIPAFWQIHMERVQVGTGLTLCARGCAAIL 282
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSLI GP++ I L K IGAVPL G ++C + +P V +LGG F LT
Sbjct: 283 DTGTSLITGPTEEIRALQKAIGAVPLLMGKYYIECSKIPTLPPVSFLLGGVWFNLT 338
Score = 65.5 bits (158), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 26/40 (65%), Positives = 30/40 (75%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
QYYG I LGTPPQ F V+FDTGSSNLW+PS C ++ C
Sbjct: 77 QYYGEIGLGTPPQNFSVVFDTGSSNLWVPSVRCHFFSLPC 116
>gi|66815097|ref|XP_641645.1| cathepsin D [Dictyostelium discoideum AX4]
gi|74960832|sp|O76856.1|CATD_DICDI RecName: Full=Cathepsin D; AltName: Full=Ddp44; Flags: Precursor
gi|3288145|emb|CAA76563.1| preprocathepsin D [Dictyostelium discoideum]
gi|6010025|emb|CAB57223.1| cathepsin D [Dictyostelium discoideum]
gi|60469656|gb|EAL67644.1| cathepsin D [Dictyostelium discoideum AX4]
Length = 383
Score = 94.0 bits (232), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 47/111 (42%), Positives = 67/111 (60%), Gaps = 3/111 (2%)
Query: 67 NGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV--YC-SNCQAIADTGTS 123
NGGE+ FG +D K+ GDITY P++ + YW+F ++ I+ +C + C AI D+GTS
Sbjct: 213 NGGELSFGSIDNTKYTGDITYVPLTNETYWEFVMDDFAIDGQSAGFCGTTCHAICDSGTS 272
Query: 124 LIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
LI GP I LN+ +GAV L DC ++ +PNV I + G+ F LT
Sbjct: 273 LIAGPMADITALNEKLGAVILNGEGVFSDCSVINTLPNVTITVAGREFVLT 323
Score = 71.2 bits (173), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 41/86 (47%), Positives = 52/86 (60%), Gaps = 6/86 (6%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHS--HFRNAS 64
AQYYG I++GTP Q FKV+FDTGSSNLWIPS+ C I +AC + G S + N +
Sbjct: 61 AQYYGAITIGTPGQAFKVVFDTGSSNLWIPSKKCPITVVACDLHNKYNSGASSTYVANGT 120
Query: 65 D---ENGGEIMFGGVDKDKF-VGDIT 86
D + G M G V +D VG +T
Sbjct: 121 DFTIQYGSGAMSGFVSQDSVTVGSLT 146
>gi|297462061|ref|XP_001790669.2| PREDICTED: napsin-A [Bos taurus]
gi|297485858|ref|XP_002695173.1| PREDICTED: napsin-A [Bos taurus]
gi|296477597|tpg|DAA19712.1| TPA: napsin A aspartic peptidase [Bos taurus]
Length = 408
Score = 94.0 bits (232), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 46/116 (39%), Positives = 68/116 (58%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIA 118
RN +GGE++ GG D ++ +T+ PV+ +WQ +E +++ + C+ C AI
Sbjct: 223 RNPEAADGGELVLGGSDPAHYIPPLTFVPVTIPAFWQIHMERVQVGTGLTLCARGCAAIL 282
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSLI GP++ I L K IGAVPL G ++C + +P V +LGG F LT
Sbjct: 283 DTGTSLITGPTEEIRALQKAIGAVPLLMGKYYIECSKIPTLPPVSFLLGGVWFNLT 338
Score = 65.5 bits (158), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 26/40 (65%), Positives = 30/40 (75%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
QYYG I LGTPPQ F V+FDTGSSNLW+PS C ++ C
Sbjct: 77 QYYGEIGLGTPPQNFSVVFDTGSSNLWVPSVRCHFFSLPC 116
>gi|307188769|gb|EFN73376.1| Lysosomal aspartic protease [Camponotus floridanus]
Length = 175
Score = 94.0 bits (232), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 48/115 (41%), Positives = 68/115 (59%), Gaps = 5/115 (4%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIAD 119
RNAS + GG+++FGG D + G+ TY PV+ KGYWQF + SI+I ++C CQAI D
Sbjct: 5 RNASADLGGKLIFGGSDPAYYEGNFTYVPVTDKGYWQFTINSIQINDFIWCEKGCQAIVD 64
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
T IIGP + +N+ N +VDC+ + ++P + LGGK F LT
Sbjct: 65 TNAWKIIGPGMDVLLINRFFE----TNLQGRVDCNRISQLPIIRFSLGGKTFDLT 115
>gi|302761354|ref|XP_002964099.1| hypothetical protein SELMODRAFT_142401 [Selaginella moellendorffii]
gi|300167828|gb|EFJ34432.1| hypothetical protein SELMODRAFT_142401 [Selaginella moellendorffii]
Length = 497
Score = 93.6 bits (231), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 45/95 (47%), Positives = 70/95 (73%), Gaps = 4/95 (4%)
Query: 61 RNASDE-NGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSN-CQA 116
R+ASDE +GGEI+FGGV+KD+F G Y+PV+R+GYWQF + + + + +C+ C A
Sbjct: 224 RDASDEEDGGEIVFGGVNKDRFKGKHVYTPVTREGYWQFNMGDVVVDGQSTGFCAKGCAA 283
Query: 117 IADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKV 151
IAD+GTSL++GP+ ++A++N+ IGA L + K+
Sbjct: 284 IADSGTSLLVGPTGIVAQINQAIGATGLVSEECKM 318
Score = 65.5 bits (158), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 29/51 (56%), Positives = 37/51 (72%), Gaps = 1/51 (1%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK 51
+R+ AQYYG I +G+PPQ F VIFDTGSSNLW+PS C I + AC + +
Sbjct: 72 LRNYLDAQYYGEIGIGSPPQVFTVIFDTGSSNLWVPSSRC-IFSPACWLHR 121
Score = 42.7 bits (99), Expect = 0.053, Method: Compositional matrix adjust.
Identities = 24/75 (32%), Positives = 38/75 (50%), Gaps = 1/75 (1%)
Query: 101 ESIKIEKNVYCSNCQ-AIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKM 159
E+ K+ + CS C+ A+ L P+K +LN+L +P NG + VDC + +
Sbjct: 363 ENQKVGDHGLCSVCEMAVVSVQNQLRQNPTKQQIDLNQLCERLPSPNGQSFVDCAKISSL 422
Query: 160 PNVDIILGGKNFTLT 174
PNV + + F LT
Sbjct: 423 PNVSFTIANQMFELT 437
>gi|57619120|ref|NP_001009868.1| pregnancy-associated glycoprotein precursor [Felis catus]
gi|2689727|gb|AAB91422.1| pregnancy-associated glycoprotein [Felis catus]
Length = 388
Score = 93.6 bits (231), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 45/116 (38%), Positives = 70/116 (60%), Gaps = 2/116 (1%)
Query: 58 SHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN--CQ 115
+ + + DE G +MFGGVD + GD+ + PVS++ YWQ ++SI + V + CQ
Sbjct: 213 AFYLSKKDEEGSVVMFGGVDHSYYSGDLNWVPVSKRLYWQLSMDSISMNGEVIACDGGCQ 272
Query: 116 AIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNF 171
AI DTGTSL+IGPS V+ + +IGA +G VDCD + +P++ + G ++
Sbjct: 273 AIIDTGTSLLIGPSHVVFNIQMIIGANQSYSGEYVVDCDAANTLPDIVFTINGIDY 328
Score = 67.0 bits (162), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 29/41 (70%), Positives = 34/41 (82%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCS 41
MR+ Y GTIS+GTPPQEFKVIFDTGSS+LW+PS +CS
Sbjct: 67 MRNYLDLAYVGTISIGTPPQEFKVIFDTGSSDLWVPSIYCS 107
>gi|198421979|ref|XP_002130758.1| PREDICTED: similar to Ctsd protein [Ciona intestinalis]
Length = 385
Score = 93.6 bits (231), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 49/131 (37%), Positives = 75/131 (57%), Gaps = 5/131 (3%)
Query: 48 SVDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE- 106
+VD++ F + + R+ +GGEI GGV+ ++VGD Y V+R+GYWQ ++ + I
Sbjct: 196 AVDQNLFAFYLN-RDPEAADGGEITLGGVNPARYVGDFNYHDVTRQGYWQIKMDGLSIAD 254
Query: 107 --KNVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVD 163
K C+ CQ I D+GTSLI GPS +N+ IGA+ G V C + +MP++
Sbjct: 255 TAKTTACNGGCQVIVDSGTSLITGPSADTDAINQAIGAIKFVQGEYLVICRRIPEMPDIT 314
Query: 164 IILGGKNFTLT 174
+L G + LT
Sbjct: 315 FVLDGIEYVLT 325
Score = 64.7 bits (156), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 28/41 (68%), Positives = 31/41 (75%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQY+G IS+GTP Q F VIFDTGSSNLW+PS C N AC
Sbjct: 61 AQYFGEISIGTPEQTFTVIFDTGSSNLWVPSASCPSTNYAC 101
>gi|431892909|gb|ELK03337.1| Renin [Pteropus alecto]
Length = 353
Score = 93.6 bits (231), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 66/224 (29%), Positives = 99/224 (44%), Gaps = 53/224 (23%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK--DKFVGHSHFRNAS 64
QYYG I +GTPPQ FKVIFDTGS+NLW+PS CS L AC + D S+ N +
Sbjct: 73 TQYYGEIGIGTPPQTFKVIFDTGSANLWVPSTKCSPLYTACEIHSLYDSLESSSYMENGT 132
Query: 65 D-------------ENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQF------------- 98
+ + + GG+ + G++T P+ +F
Sbjct: 133 EFTIHYGSGKVKGFLSQDTVTVGGITVTQTFGEVTELPLIPFMLAKFDGVLGMGFPAQAV 192
Query: 99 -GVESI-------KIEK----NVYCSN-------------CQAIADTGTSLIIGPSKVIA 133
GV + K+ K +VY S C + DTG S I GP+ +
Sbjct: 193 GGVTPVFDHILSQKVLKEDVFSVYYSRVSVRSATLLCEEGCMVVVDTGASYISGPTSSLR 252
Query: 134 ELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTRTN 177
L + +GA L+ V+C+ + +P++ LGG+ +TLT +
Sbjct: 253 LLMETLGAKELSTDEYVVNCNQVPTLPDISFHLGGRAYTLTSAD 296
>gi|18959216|ref|NP_579818.1| gastricsin precursor [Rattus norvegicus]
gi|129798|sp|P04073.1|PEPC_RAT RecName: Full=Gastricsin; AltName: Full=Pepsinogen C; Flags:
Precursor
gi|56881|emb|CAA28305.1| unnamed protein product [Rattus norvegicus]
gi|206083|gb|AAA41827.1| pepsinogen [Rattus norvegicus]
gi|149069457|gb|EDM18898.1| progastricsin (pepsinogen C) [Rattus norvegicus]
Length = 392
Score = 93.6 bits (231), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 45/115 (39%), Positives = 75/115 (65%), Gaps = 4/115 (3%)
Query: 67 NGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSN--CQAIADTGT 122
NGG+I+FGGVDK+ + G+IT+ PV+++ YWQ ++ I + + +CS+ CQ I DTGT
Sbjct: 224 NGGQIVFGGVDKNLYTGEITWVPVTQELYWQITIDDFLIGDQASGWCSSQGCQGIVDTGT 283
Query: 123 SLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTRTN 177
SL++ P++ ++EL + IGA G V CD++ +P + +L G F L+ ++
Sbjct: 284 SLLVMPAQYLSELLQTIGAQEGEYGEYFVSCDSVSSLPTLSFVLNGVQFPLSPSS 338
Score = 55.8 bits (133), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 23/34 (67%), Positives = 28/34 (82%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
A Y+G IS+GTPPQ F V+FDTGSSNLW+ S +C
Sbjct: 74 ASYFGEISIGTPPQNFLVLFDTGSSNLWVSSVYC 107
>gi|256075652|ref|XP_002574131.1| subfamily A1A unassigned peptidase (A01 family) [Schistosoma
mansoni]
gi|360043433|emb|CCD78846.1| subfamily A1A unassigned peptidase (A01 family) [Schistosoma
mansoni]
Length = 401
Score = 93.6 bits (231), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 47/126 (37%), Positives = 75/126 (59%), Gaps = 2/126 (1%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKN 108
VDK F + + N GEIMFGG+D + G++TYS V + YW ++ I I
Sbjct: 212 VDKPLFAVYLNL-NEDKTTSGEIMFGGIDDRYYTGNLTYSDVVSEEYWMINIDGISINGE 270
Query: 109 VYC-SNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILG 167
++C S A+ DTGT+LI GP++ I +NK +G++ ++N VDC+ + K+P ++I +
Sbjct: 271 IFCPSGSTALIDTGTALISGPTEKINNINKYLGSIQMSNNEYIVDCNKIQKLPIIEIKIN 330
Query: 168 GKNFTL 173
GK+ L
Sbjct: 331 GKSIQL 336
Score = 62.4 bits (150), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 27/42 (64%), Positives = 32/42 (76%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSV 49
+YYG IS+GTPPQ F VIFDTGS LWIPS+ C N+AC +
Sbjct: 76 EYYGEISIGTPPQIFHVIFDTGSPYLWIPSKKCDPSNLACQL 117
>gi|6680552|ref|NP_032463.1| napsin-A precursor [Mus musculus]
gi|6016430|sp|O09043.1|NAPSA_MOUSE RecName: Full=Napsin-A; AltName: Full=KDAP-1; AltName:
Full=Kidney-derived aspartic protease-like protein;
Short=KAP; Flags: Precursor
gi|1906810|dbj|BAA19004.1| kidney-derived aspartic protease-like protein [Mus musculus]
gi|7340352|emb|CAB82907.1| Napsin [Mus musculus]
gi|15928694|gb|AAH14813.1| Napsin A aspartic peptidase [Mus musculus]
gi|74220342|dbj|BAE31398.1| unnamed protein product [Mus musculus]
Length = 419
Score = 93.6 bits (231), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 49/116 (42%), Positives = 66/116 (56%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIA 118
R++ +GGE++ GG D +V +T+ PV+ YWQ +ES+K+ + C+ C AI
Sbjct: 218 RDSEGSDGGELVLGGSDPAHYVPPLTFIPVTIPAYWQVHMESVKVGTGLSLCAQGCSAIL 277
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSLI GPS+ I LNK IG P NG + C +P V LGG F LT
Sbjct: 278 DTGTSLITGPSEEIRALNKAIGGYPFLNGQYFIQCSKTPTLPPVSFHLGGVWFNLT 333
Score = 68.2 bits (165), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 27/40 (67%), Positives = 32/40 (80%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
QY+GTI LGTPPQ F V+FDTGSSNLW+PS C ++AC
Sbjct: 72 QYFGTIGLGTPPQNFTVVFDTGSSNLWVPSTRCHFFSLAC 111
>gi|12832561|dbj|BAB22158.1| unnamed protein product [Mus musculus]
Length = 419
Score = 93.6 bits (231), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 49/116 (42%), Positives = 66/116 (56%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIA 118
R++ +GGE++ GG D +V +T+ PV+ YWQ +ES+K+ + C+ C AI
Sbjct: 218 RDSEGSDGGELVLGGSDPAHYVPPLTFIPVTIPAYWQVHMESVKVGTGLSLCAQGCSAIL 277
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSLI GPS+ I LNK IG P NG + C +P V LGG F LT
Sbjct: 278 DTGTSLITGPSEEIRALNKAIGGYPFLNGQYFIQCSKTPTLPPVSFHLGGVWFNLT 333
Score = 68.2 bits (165), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 27/40 (67%), Positives = 32/40 (80%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
QY+GTI LGTPPQ F V+FDTGSSNLW+PS C ++AC
Sbjct: 72 QYFGTIGLGTPPQNFTVVFDTGSSNLWVPSTRCHFFSLAC 111
>gi|367000932|ref|XP_003685201.1| hypothetical protein TPHA_0D01260 [Tetrapisispora phaffii CBS 4417]
gi|357523499|emb|CCE62767.1| hypothetical protein TPHA_0D01260 [Tetrapisispora phaffii CBS 4417]
Length = 419
Score = 93.6 bits (231), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 56/152 (36%), Positives = 83/152 (54%), Gaps = 3/152 (1%)
Query: 24 VIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVG 83
+ +DT + N +P + +I +D++KF + N +++GGE FGG DK KF G
Sbjct: 214 LAYDTIAVNKVVPPFYNAIKQ--GILDENKFAFYLGDTNKDNKSGGEATFGGYDKSKFTG 271
Query: 84 DITYSPVSRKGYWQFGVESIKIEKNVYCSNCQAIA-DTGTSLIIGPSKVIAELNKLIGAV 142
DIT+ PV RK YW+ +SI + V + A DTGTSLI PS + +N IGA
Sbjct: 272 DITWLPVRRKAYWEVKFDSIALGDEVASLDGYGAAIDTGTSLITLPSGLAEVINTQIGAK 331
Query: 143 PLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
+G +DCD D +P++ G NFT++
Sbjct: 332 KSWSGQYTIDCDTRDALPDMTFNFNGYNFTVS 363
Score = 61.6 bits (148), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 28/41 (68%), Positives = 33/41 (80%), Gaps = 2/41 (4%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQYY ISLGTP Q FKVI DTGSSNLW+PS+ C+ ++AC
Sbjct: 102 AQYYTDISLGTPKQNFKVILDTGSSNLWVPSKDCT--SLAC 140
>gi|148690790|gb|EDL22737.1| napsin A aspartic peptidase, isoform CRA_a [Mus musculus]
Length = 393
Score = 93.6 bits (231), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 49/116 (42%), Positives = 66/116 (56%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIA 118
R++ +GGE++ GG D +V +T+ PV+ YWQ +ES+K+ + C+ C AI
Sbjct: 193 RDSEGSDGGELVLGGSDPAHYVPPLTFIPVTIPAYWQVHMESVKVGTGLSLCAQGCSAIL 252
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSLI GPS+ I LNK IG P NG + C +P V LGG F LT
Sbjct: 253 DTGTSLITGPSEEIRALNKAIGGYPFLNGQYFIQCSKTPTLPPVSFHLGGVWFNLT 308
Score = 68.2 bits (165), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 27/40 (67%), Positives = 32/40 (80%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
QY+GTI LGTPPQ F V+FDTGSSNLW+PS C ++AC
Sbjct: 47 QYFGTIGLGTPPQNFTVVFDTGSSNLWVPSTRCHFFSLAC 86
>gi|74199699|dbj|BAE41511.1| unnamed protein product [Mus musculus]
Length = 419
Score = 93.6 bits (231), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 49/116 (42%), Positives = 66/116 (56%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIA 118
R++ +GGE++ GG D +V +T+ PV+ YWQ +ES+K+ + C+ C AI
Sbjct: 218 RDSEGSDGGELVLGGSDPAHYVPPLTFIPVTIPAYWQVHMESVKVGTGLSLCAQGCSAIL 277
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSLI GPS+ I LNK IG P NG + C +P V LGG F LT
Sbjct: 278 DTGTSLITGPSEEIRALNKAIGGYPFLNGQYFIQCSKTPTLPPVSFHLGGVWFNLT 333
Score = 68.2 bits (165), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 27/40 (67%), Positives = 32/40 (80%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
QY+GTI LGTPPQ F V+FDTGSSNLW+PS C ++AC
Sbjct: 72 QYFGTIGLGTPPQNFTVVFDTGSSNLWVPSTRCHFFSLAC 111
>gi|320163747|gb|EFW40646.1| cathepsin D [Capsaspora owczarzaki ATCC 30864]
Length = 382
Score = 93.6 bits (231), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 45/115 (39%), Positives = 69/115 (60%), Gaps = 1/115 (0%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYC-SNCQAIAD 119
R+ + +GGEI FG +D + G I Y+PV+R+GYWQF + ++ + YC S CQAIAD
Sbjct: 207 RDPTAADGGEIDFGAIDDSHYTGPILYTPVTRQGYWQFALGAVTVSGKNYCASGCQAIAD 266
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
+GTSL++GP+ + + GA G +DC + +PN+ + G+ F LT
Sbjct: 267 SGTSLLVGPTDAVTAIAAAAGATKNIAGEYTLDCSKIASLPNLVFTISGQQFALT 321
Score = 67.4 bits (163), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 28/49 (57%), Positives = 40/49 (81%), Gaps = 1/49 (2%)
Query: 2 RSSYQ-AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSV 49
+ +YQ AQYYG I++GTP Q+F V+FDTGS+NLW+PS+ C + +IAC +
Sbjct: 54 QHNYQDAQYYGDITIGTPGQKFTVVFDTGSANLWVPSKKCPVTDIACQL 102
>gi|302761356|ref|XP_002964100.1| hypothetical protein SELMODRAFT_438819 [Selaginella moellendorffii]
gi|300167829|gb|EFJ34433.1| hypothetical protein SELMODRAFT_438819 [Selaginella moellendorffii]
Length = 503
Score = 93.6 bits (231), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 45/95 (47%), Positives = 70/95 (73%), Gaps = 4/95 (4%)
Query: 61 RNASDE-NGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSN-CQA 116
R+ASDE +GGEI+FGGV+KD+F G Y+PV+R+GYWQF + + + + +C+ C A
Sbjct: 230 RDASDEEDGGEIVFGGVNKDRFKGKHVYTPVTREGYWQFNMGDVVVDGQSTGFCAKGCAA 289
Query: 117 IADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKV 151
IAD+GTSL++GP+ ++A++N+ IGA L + K+
Sbjct: 290 IADSGTSLLVGPTGIVAQINQAIGATGLVSEECKM 324
Score = 65.5 bits (158), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 29/51 (56%), Positives = 37/51 (72%), Gaps = 1/51 (1%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK 51
+R+ AQYYG I +G+PPQ F VIFDTGSSNLW+PS C I + AC + +
Sbjct: 78 LRNYLDAQYYGEIGIGSPPQVFTVIFDTGSSNLWVPSSRC-IFSPACWLHR 127
Score = 44.3 bits (103), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 24/75 (32%), Positives = 39/75 (52%), Gaps = 1/75 (1%)
Query: 101 ESIKIEKNVYCSNCQ-AIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKM 159
E+ K+ +V CS C+ A+ L P+K +LN+L +P NG + V+C + +
Sbjct: 369 ENQKVGDDVLCSVCEMAVVSVQNQLRQNPTKQQIDLNQLCERLPSPNGQSLVECAKISSL 428
Query: 160 PNVDIILGGKNFTLT 174
PNV + + F LT
Sbjct: 429 PNVSFTIANQMFELT 443
>gi|67003878|gb|AAY60835.1| pepsinogen A [Melanogrammus aeglefinus]
Length = 269
Score = 93.6 bits (231), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 48/118 (40%), Positives = 68/118 (57%), Gaps = 2/118 (1%)
Query: 58 SHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN--CQ 115
S + + + G ++FGGVD+ G IT+ P++ YWQ ++S+ I N N CQ
Sbjct: 121 SVYLSRGGQEGSVVVFGGVDESLMSGQITWIPLTSATYWQIKMDSVTINGNPVACNGGCQ 180
Query: 116 AIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
AI DTGTSLI+GPS I+ +N +GA G A V+C N+ MP V L G +FT+
Sbjct: 181 AIIDTGTSLIVGPSSDISNMNSWVGASTNQYGDATVNCQNIGSMPEVVFTLSGHSFTV 238
>gi|327270926|ref|XP_003220239.1| PREDICTED: embryonic pepsinogen-like [Anolis carolinensis]
Length = 382
Score = 93.2 bits (230), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 48/111 (43%), Positives = 67/111 (60%), Gaps = 2/111 (1%)
Query: 68 GGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-KNVYCSN-CQAIADTGTSLI 125
G I FGG+D+ + G I + PV+ +GYWQ ++SI + + + CS+ CQAI DTGTSL+
Sbjct: 222 GSIITFGGIDESYYTGSINWIPVTEQGYWQIELDSILVNGEAIACSDGCQAIVDTGTSLV 281
Query: 126 IGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTRT 176
GP I+ L IGA P G ++C NL MP+V ++ G F LT T
Sbjct: 282 AGPPSDISNLQNAIGATPGQYGQYDINCGNLGNMPDVVFVINGIQFPLTPT 332
Score = 63.9 bits (154), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 25/35 (71%), Positives = 31/35 (88%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCS 41
+YYGTI++GTPPQ F V+FDTGSSNLW+PS +CS
Sbjct: 73 TEYYGTINIGTPPQAFTVVFDTGSSNLWVPSTYCS 107
>gi|82469881|gb|ABB77194.1| cathepsin D2-like protein [Schistosoma mansoni]
Length = 401
Score = 93.2 bits (230), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 47/126 (37%), Positives = 75/126 (59%), Gaps = 2/126 (1%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKN 108
VDK F + + N GEIMFGG+D + G++TYS V + YW ++ I I
Sbjct: 212 VDKPLFAVYLNL-NEDKTTSGEIMFGGIDDRYYTGNLTYSDVVSEEYWMINIDGISINGE 270
Query: 109 VYC-SNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILG 167
++C S A+ DTGT+LI GP++ I +NK +G++ ++N VDC+ + K+P ++I +
Sbjct: 271 IFCPSGSTALIDTGTALISGPTEKINNINKYLGSIQISNNEYIVDCNKIQKLPIIEIKIN 330
Query: 168 GKNFTL 173
GK+ L
Sbjct: 331 GKSIQL 336
Score = 62.4 bits (150), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 27/42 (64%), Positives = 32/42 (76%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSV 49
+YYG IS+GTPPQ F VIFDTGS LWIPS+ C N+AC +
Sbjct: 76 EYYGEISIGTPPQIFHVIFDTGSPYLWIPSKKCDPSNLACQL 117
>gi|224460527|gb|ACN43675.1| cathepsin D [Paralichthys olivaceus]
Length = 396
Score = 93.2 bits (230), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 48/116 (41%), Positives = 66/116 (56%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIA 118
RN GGE++ GG D + GD Y V+R+ YWQ + + + C + C+AI
Sbjct: 221 RNPDMAPGGELLLGGTDPKYYSGDFNYVNVTRQAYWQIHMGGMGAGSQLTLCKDGCEAIV 280
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSLI GPS + L K IGAVPL G V CD + +P + LGG++++LT
Sbjct: 281 DTGTSLITGPSAEVKALQKAIGAVPLIQGEYMVSCDKIPSLPVITFNLGGQSYSLT 336
Score = 79.3 bits (194), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 33/47 (70%), Positives = 40/47 (85%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ AQYYG I+LGTPPQ F V+FDTGSSNLW+PS HCSIL+IAC
Sbjct: 68 LKNYLDAQYYGDIALGTPPQTFSVVFDTGSSNLWVPSVHCSILDIAC 114
>gi|403299330|ref|XP_003940442.1| PREDICTED: napsin-A-like [Saimiri boliviensis boliviensis]
Length = 425
Score = 93.2 bits (230), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 44/116 (37%), Positives = 67/116 (57%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSNCQAIA 118
R+ +GGE++ GG D ++ +T+ PV+ YWQ +E +K+ E + C A+
Sbjct: 228 RDPEVADGGELVLGGSDPAHYIPPLTFVPVTVPAYWQIHMERVKVGSELTLCARGCAAVL 287
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGT +IIGP++ I L+K IG +PL G + C + K+P V + LGG F LT
Sbjct: 288 DTGTPVIIGPAEEIRALHKAIGGLPLLAGEYIIRCSEIPKLPTVSLFLGGVWFNLT 343
Score = 63.5 bits (153), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 25/40 (62%), Positives = 31/40 (77%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
QY+G I LGTPPQ F V FDTGSSNLW+PS+ C + ++ C
Sbjct: 82 QYFGEIGLGTPPQNFTVAFDTGSSNLWVPSKRCHLSSVPC 121
>gi|149757990|ref|XP_001490885.1| PREDICTED: napsin-A [Equus caballus]
Length = 401
Score = 93.2 bits (230), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 44/116 (37%), Positives = 69/116 (59%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIA 118
R+ +GGE++ GG D ++ +T+ PV+ YWQ ++ +K+ + C+ C AI
Sbjct: 218 RDPEAADGGELVLGGSDPSHYIPPLTFVPVTIPAYWQIHMKRVKVGTGLTLCAQGCAAIL 277
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSLI GP++ I L+ IG +PL G + C + ++P V ++LGG FTLT
Sbjct: 278 DTGTSLITGPTEEIRALHAAIGGIPLLAGEYLLQCSTIPRLPPVSLLLGGTWFTLT 333
Score = 67.0 bits (162), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 27/41 (65%), Positives = 31/41 (75%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQYYG I LGTPPQ F V+FDTGSSNLW+PS C ++ C
Sbjct: 71 AQYYGEIGLGTPPQNFSVLFDTGSSNLWVPSVRCHFFSLPC 111
>gi|426244096|ref|XP_004015868.1| PREDICTED: napsin-A [Ovis aries]
Length = 443
Score = 93.2 bits (230), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 46/116 (39%), Positives = 67/116 (57%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIA 118
RN +GGE++ GG D ++ +T+ PV+ +WQ +E +++ + C+ C AI
Sbjct: 223 RNPEAADGGELVLGGSDPAHYIPPLTFVPVTIPAFWQIHMERVQVGTGLTLCARGCAAIL 282
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSLI GP++ I L K IGAVPL G + C + +P V +LGG F LT
Sbjct: 283 DTGTSLITGPTEEIRALQKAIGAVPLLMGEYYIKCSKIPTLPPVSFLLGGVWFNLT 338
Score = 67.4 bits (163), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 27/41 (65%), Positives = 31/41 (75%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQYYG I LGTPPQ F V+FDTGSSNLW+PS C ++ C
Sbjct: 76 AQYYGEIGLGTPPQNFSVVFDTGSSNLWVPSVRCRFFSLPC 116
>gi|213688|gb|AAA49530.1| pepsinogen [Rana catesbeiana]
Length = 384
Score = 93.2 bits (230), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 43/112 (38%), Positives = 65/112 (58%), Gaps = 3/112 (2%)
Query: 65 DENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSN-CQAIADTG 121
+NGGE+ FGGVD++ + G I ++PV+ + YWQ G++ + + +CS CQ I DTG
Sbjct: 214 SQNGGEVAFGGVDQNYYSGQIYWTPVTSETYWQIGIQGFSVNGQATGWCSQGCQGIVDTG 273
Query: 122 TSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
TSL+ P V + L + IGA NG V C N+ +P + + G +F L
Sbjct: 274 TSLLTAPQSVFSSLMQSIGAQQDQNGQYAVSCSNIQSLPTISFTISGVSFPL 325
Score = 59.7 bits (143), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 24/32 (75%), Positives = 28/32 (87%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
YYG IS+GTPPQ F V+FDTGSSNLW+PS +C
Sbjct: 67 YYGEISIGTPPQNFLVLFDTGSSNLWVPSTYC 98
>gi|307167891|gb|EFN61280.1| Lysosomal aspartic protease [Camponotus floridanus]
Length = 431
Score = 93.2 bits (230), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 49/132 (37%), Positives = 74/132 (56%), Gaps = 11/132 (8%)
Query: 57 HSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVY------ 110
++ +R+ S + GGE++ GG D + G Y V++KGYWQF ++ ++I + +
Sbjct: 248 YNVYRDPSAKVGGELILGGSDPAYYTGHFKYVDVTKKGYWQFLMDRVRITRTKFNKGRTL 307
Query: 111 -CSNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAK----VDCDNLDKMPNVDII 165
CQAIADTG SLI+GP+ I +NK IGA + V+C+ + K+P + I
Sbjct: 308 CMGGCQAIADTGMSLIVGPTSEIDIINKYIGANKTTDSSGNIINVVNCNTIHKLPIIRFI 367
Query: 166 LGGKNFTLTRTN 177
LGGK F L N
Sbjct: 368 LGGKRFPLNSNN 379
Score = 72.8 bits (177), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 29/43 (67%), Positives = 38/43 (88%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSV 49
++YYG I++GTPPQ+FKV+FDTGSSNLWIPS CS N+AC++
Sbjct: 62 SEYYGNITIGTPPQQFKVLFDTGSSNLWIPSILCSTANVACAL 104
>gi|196123668|gb|ACG70181.1| cathepsin D-like protein [Homarus americanus]
Length = 386
Score = 92.8 bits (229), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 46/118 (38%), Positives = 69/118 (58%), Gaps = 5/118 (4%)
Query: 61 RNASDEN---GGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV--YCSNCQ 115
+ SD N GGE++ GG D + + G+ Y PVS+ GYWQ E+IK+ NV +C+ C+
Sbjct: 208 HDVSDMNETLGGELVLGGSDPNHYEGEFHYVPVSKVGYWQVTAEAIKVGDNVTGFCNPCE 267
Query: 116 AIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
AI DTGTSLI GP+ + E+ ++G G + C + +MP L GK+F++
Sbjct: 268 AIVDTGTSLIAGPNAEVKEIVHMLGGYGFIAGEYLISCHKVPEMPEFTFTLNGKDFSI 325
Score = 67.8 bits (164), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 30/40 (75%), Positives = 34/40 (85%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIA 46
AQYYG I++GTP Q F VIFDTGSSNLWIPS+ C ILN+A
Sbjct: 61 AQYYGPITIGTPGQGFDVIFDTGSSNLWIPSEKCFILNLA 100
>gi|20129385|ref|NP_609235.1| beta-site APP-cleaving enzyme [Drosophila melanogaster]
gi|7297427|gb|AAF52686.1| beta-site APP-cleaving enzyme [Drosophila melanogaster]
gi|54650878|gb|AAV37018.1| GH11417p [Drosophila melanogaster]
gi|220951556|gb|ACL88321.1| CG13095-PA [synthetic construct]
gi|220959834|gb|ACL92460.1| CG13095-PA [synthetic construct]
Length = 372
Score = 92.8 bits (229), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 43/113 (38%), Positives = 65/113 (57%), Gaps = 4/113 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSNCQAIADT 120
R+ + GGE++FGG D + G +TY P+S +GYWQF + I+ C +CQAIADT
Sbjct: 212 RDGTSTMGGELIFGGSDASLYSGALTYVPISEQGYWQFTMAGSSIDGYSLCDDCQAIADT 271
Query: 121 GTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
GTSLI+ P L++++ +DC ++ +P+V +GG NF L
Sbjct: 272 GTSLIVAPYNAYITLSEILN----VGEDGYLDCSSVSSLPDVTFNIGGTNFVL 320
Score = 58.9 bits (141), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 24/40 (60%), Positives = 30/40 (75%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
+ +S YYG IS+GTP Q FKV+FD+GSSNLW+PS C
Sbjct: 61 LSNSMNMAYYGAISIGTPAQSFKVLFDSGSSNLWVPSNTC 100
>gi|195046656|ref|XP_001992194.1| GH24344 [Drosophila grimshawi]
gi|193893035|gb|EDV91901.1| GH24344 [Drosophila grimshawi]
Length = 373
Score = 92.8 bits (229), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 49/125 (39%), Positives = 69/125 (55%), Gaps = 4/125 (3%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKN 108
VD F + S +GGE++FGG D K+ GD++Y P+S +GYWQF V+S I+
Sbjct: 201 VDDSVFSFYLARDGTSTTDGGELIFGGSDPAKYTGDLSYVPISEQGYWQFAVDSATIDGQ 260
Query: 109 VYCSNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGG 168
+ QAIADTGTSL++ S LN L+ + VDC +D MP + +GG
Sbjct: 261 TLGESFQAIADTGTSLLVVSSDAYDILNNLLN----VDEDGLVDCSTVDSMPVLTFTIGG 316
Query: 169 KNFTL 173
K + L
Sbjct: 317 KQYPL 321
Score = 70.1 bits (170), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 28/47 (59%), Positives = 36/47 (76%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+ +S YYG I++GTPPQ FKV+FD+GSSNLW+PS C L+IAC
Sbjct: 59 LSNSINMAYYGAITIGTPPQSFKVLFDSGSSNLWVPSSRCFFLDIAC 105
>gi|397485038|ref|XP_003813670.1| PREDICTED: napsin-A-like [Pan paniscus]
Length = 420
Score = 92.8 bits (229), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 45/116 (38%), Positives = 68/116 (58%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIA 118
R+ + +GGE++ GG D ++ +T+ PV+ YWQ +E +K+ + C+ C AI
Sbjct: 223 RDPEEPDGGELVLGGSDPAHYIPPLTFVPVTVPAYWQIHMERVKVGPGLTLCAQGCAAIL 282
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSLI GP++ I L+ IG +PL G + C + K+P V +LGG F LT
Sbjct: 283 DTGTSLITGPTEEIRALHAAIGGIPLLAGEYIILCSEIPKLPAVSFLLGGVWFNLT 338
Score = 64.3 bits (155), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 27/46 (58%), Positives = 34/46 (73%), Gaps = 1/46 (2%)
Query: 3 SSYQ-AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
S+Y+ QY+G I LGTPPQ F V FDTGSSNLW+PS+ C ++ C
Sbjct: 71 SNYRDVQYFGEIGLGTPPQNFTVAFDTGSSNLWVPSRRCHFFSVPC 116
>gi|9858101|gb|AAG00993.1|AF286865_1 heme-binding aspartic proteinase, partial [Rhipicephalus microplus]
Length = 354
Score = 92.8 bits (229), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 42/114 (36%), Positives = 68/114 (59%), Gaps = 1/114 (0%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYC-SNCQAIAD 119
+ S++NGGEI FGG++ ++ G I Y PVS+ +WQ +++I ++ C C + D
Sbjct: 181 KQPSEQNGGEIYFGGINAQRYTGAIHYVPVSQAAHWQVVMDNINVQGTTLCVGGCPTVVD 240
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
+GTS + GPS + LN++IGA A G +V+C + +P + L GK+F L
Sbjct: 241 SGTSFLSGPSADVETLNRVIGATKTAAGYFEVNCATISSLPPITFNLNGKSFPL 294
Score = 56.6 bits (135), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 25/47 (53%), Positives = 35/47 (74%), Gaps = 2/47 (4%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKF 54
Q+YG I++GTPPQ FK++ DTGSSN W+PS +C ++AC D K+
Sbjct: 36 QFYGIITIGTPPQSFKLLMDTGSSNFWVPSINCD-QSMACR-DHAKY 80
>gi|302761358|ref|XP_002964101.1| hypothetical protein SELMODRAFT_166719 [Selaginella moellendorffii]
gi|300167830|gb|EFJ34434.1| hypothetical protein SELMODRAFT_166719 [Selaginella moellendorffii]
Length = 505
Score = 92.8 bits (229), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 45/95 (47%), Positives = 69/95 (72%), Gaps = 4/95 (4%)
Query: 61 RNASDE-NGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSN-CQA 116
R+ASDE +GGEI+FGGV+KD+F G Y+PV+R+GYWQF + + + + +C+ C A
Sbjct: 230 RDASDEEDGGEIVFGGVNKDRFKGKHVYTPVTREGYWQFNMGDVAVDGQSTGFCAKGCAA 289
Query: 117 IADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKV 151
IAD+GTSL+ GP+ ++A++N+ IGA L + K+
Sbjct: 290 IADSGTSLLAGPTGIVAQINQAIGATGLVSEECKM 324
Score = 65.1 bits (157), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 29/47 (61%), Positives = 35/47 (74%), Gaps = 1/47 (2%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+R+ AQYYG I +G+PPQ F VIFDTGSSNLW+PS C I + AC
Sbjct: 78 LRNYLDAQYYGEIGIGSPPQVFTVIFDTGSSNLWVPSSRC-IFSPAC 123
Score = 37.0 bits (84), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 24/77 (31%), Positives = 38/77 (49%), Gaps = 3/77 (3%)
Query: 101 ESIKIEKNVYCSNCQ-AIADTGTSLIIGPSKVIAE--LNKLIGAVPLANGPAKVDCDNLD 157
E+ K+ +V CS C+ A+ L +K E LN+L +P NG + V+C +
Sbjct: 369 ENQKVGDDVLCSVCEMAVVWVQNQLRQNRTKQQIEDYLNQLCERLPSPNGQSVVECAKIS 428
Query: 158 KMPNVDIILGGKNFTLT 174
+PNV + + F LT
Sbjct: 429 SLPNVSFTIANQTFELT 445
>gi|195350351|ref|XP_002041704.1| GM16818 [Drosophila sechellia]
gi|194123477|gb|EDW45520.1| GM16818 [Drosophila sechellia]
Length = 419
Score = 92.4 bits (228), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 49/116 (42%), Positives = 69/116 (59%), Gaps = 3/116 (2%)
Query: 61 RNASD-ENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSNCQAIAD 119
RNASD NGG ++ GG D + G +TY PVS+ G+WQ V ++I CSNCQAI D
Sbjct: 238 RNASDANNGGVLLLGGSDPTLYRGCLTYVPVSKVGFWQITVGQVEIGSKKLCSNCQAIFD 297
Query: 120 TGTSLIIGPSKVIAELNKLIG--AVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
GTSLII P + +NK +G +G +DC+ + ++P + +G K+FTL
Sbjct: 298 VGTSLIIVPCPALKIINKKLGIKETDRKDGAYIIDCNKVSRLPKIVFNIGWKDFTL 353
>gi|114678580|ref|XP_524345.2| PREDICTED: napsin-A isoform 4 [Pan troglodytes]
Length = 420
Score = 92.4 bits (228), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 45/116 (38%), Positives = 68/116 (58%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIA 118
R+ + +GGE++ GG D ++ +T+ PV+ YWQ +E +K+ + C+ C AI
Sbjct: 223 RDPEEPDGGELVLGGSDPAHYIPPLTFVPVTVPAYWQIHMERVKVGPGLTLCAQGCAAIL 282
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSLI GP++ I L+ IG +PL G + C + K+P V +LGG F LT
Sbjct: 283 DTGTSLITGPTEEIRALHAAIGGIPLLAGEYIILCSEIPKLPAVSFLLGGVWFNLT 338
Score = 64.3 bits (155), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 27/46 (58%), Positives = 34/46 (73%), Gaps = 1/46 (2%)
Query: 3 SSYQ-AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
S+Y+ QY+G I LGTPPQ F V FDTGSSNLW+PS+ C ++ C
Sbjct: 71 SNYRDVQYFGEIGLGTPPQNFTVAFDTGSSNLWVPSRRCHFFSVPC 116
>gi|33347411|gb|AAQ15288.1| aspartic protease [Pyrus pyrifolia]
Length = 199
Score = 92.4 bits (228), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 44/84 (52%), Positives = 61/84 (72%), Gaps = 3/84 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV--YCSN-CQAI 117
RNA +E GGEI+FGGVD + + G TY PV++KGYWQF + + I+ +C++ C AI
Sbjct: 98 RNADEEEGGEIVFGGVDPNHYKGKHTYVPVTQKGYWQFDMGDVMIDGQTTGFCADGCSAI 157
Query: 118 ADTGTSLIIGPSKVIAELNKLIGA 141
AD+GTSL++GP+ +I ELN IGA
Sbjct: 158 ADSGTSLLVGPTTIITELNHAIGA 181
>gi|1326165|gb|AAB03108.1| aspartic protease [Brassica napus]
Length = 506
Score = 92.4 bits (228), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 48/98 (48%), Positives = 63/98 (64%), Gaps = 3/98 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYC-SNCQAI 117
RNA DE GGE++FGGVD + F G+ TY PV++KGYWQF + + I YC S C AI
Sbjct: 226 RNAEDEEGGELVFGGVDPNHFKGEHTYVPVTQKGYWQFDMGDVLIGGAPTGYCESGCSAI 285
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDN 155
AD+GTSL+ GP+ VI +N IGA + + K+ D
Sbjct: 286 ADSGTSLLAGPTTVITMINHAIGAAGVVSQQCKIVVDQ 323
Score = 67.0 bits (162), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 28/47 (59%), Positives = 37/47 (78%), Gaps = 1/47 (2%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ AQYYG I++GTPPQ+F V+FDTGSSNLW+PS C +IAC
Sbjct: 74 LKNYLDAQYYGEIAIGTPPQKFTVVFDTGSSNLWVPSSKC-YFSIAC 119
Score = 36.2 bits (82), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 16/45 (35%), Positives = 24/45 (53%)
Query: 130 KVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
+++ +N L +P G + VDC L MP V + +GGK F L
Sbjct: 402 RILDYINDLCERLPSPMGESAVDCAQLSTMPTVSLTIGGKVFDLA 446
>gi|12231172|dbj|BAB20969.1| aspartic proteinase 1 [Nepenthes alata]
Length = 514
Score = 92.4 bits (228), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 43/84 (51%), Positives = 62/84 (73%), Gaps = 3/84 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCS-NCQAI 117
RNA++E GGEI+FGGVD + + G+ T+ PV+ KGYWQF ++ + + E YCS C AI
Sbjct: 234 RNATEEEGGEIVFGGVDPNHYKGEHTFVPVTHKGYWQFDMDDVLVGGETTGYCSGGCSAI 293
Query: 118 ADTGTSLIIGPSKVIAELNKLIGA 141
AD+GTSL+ GP+ ++A++N IGA
Sbjct: 294 ADSGTSLLAGPTTIVAQINHAIGA 317
Score = 63.9 bits (154), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 27/47 (57%), Positives = 36/47 (76%), Gaps = 1/47 (2%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ AQY+G I +GTPPQ+F +IFDTGSSNLW+PS C +IAC
Sbjct: 82 LKNYMNAQYFGEIGIGTPPQKFTLIFDTGSSNLWVPSAKC-YFSIAC 127
Score = 36.6 bits (83), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 16/44 (36%), Positives = 25/44 (56%)
Query: 130 KVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
+++ +N+L +P G + VDC +L MP V +GGK F L
Sbjct: 410 RILNYVNELCNRLPSPMGESAVDCSSLSSMPGVSFTVGGKVFDL 453
>gi|33347413|gb|AAQ15289.1| aspartic protease [Pyrus pyrifolia]
Length = 199
Score = 92.4 bits (228), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 44/84 (52%), Positives = 61/84 (72%), Gaps = 3/84 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV--YCSN-CQAI 117
RNA +E GGEI+FGGVD + + G TY PV++KGYWQF + + I+ +C++ C AI
Sbjct: 98 RNADEEEGGEIVFGGVDPNHYKGKHTYVPVTQKGYWQFDMGDVMIDGQTTGFCADGCSAI 157
Query: 118 ADTGTSLIIGPSKVIAELNKLIGA 141
AD+GTSL++GP+ +I ELN IGA
Sbjct: 158 ADSGTSLLVGPTTIITELNHAIGA 181
>gi|195339269|ref|XP_002036242.1| GM12869 [Drosophila sechellia]
gi|194130122|gb|EDW52165.1| GM12869 [Drosophila sechellia]
Length = 372
Score = 92.4 bits (228), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 43/113 (38%), Positives = 65/113 (57%), Gaps = 4/113 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSNCQAIADT 120
R+ + GGE++FGG D + G +TY P+S +GYWQF + I+ C +CQAIADT
Sbjct: 212 RDGTSTMGGELIFGGSDSSLYSGALTYVPISEQGYWQFTMAGSSIDGYSLCDDCQAIADT 271
Query: 121 GTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
GTSLI+ P L++++ +DC ++ +P+V +GG NF L
Sbjct: 272 GTSLIVAPYYAYITLSEILN----VGEDGYLDCSSVSSLPDVTFNIGGTNFVL 320
Score = 58.9 bits (141), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 24/40 (60%), Positives = 30/40 (75%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
+ +S YYG IS+GTP Q FKV+FD+GSSNLW+PS C
Sbjct: 61 LSNSMNMAYYGAISIGTPAQSFKVLFDSGSSNLWVPSNTC 100
>gi|5921649|gb|AAD56283.1|AF156787_1 pepsinogen A form IIa [Pseudopleuronectes americanus]
Length = 378
Score = 92.4 bits (228), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 51/127 (40%), Positives = 78/127 (61%), Gaps = 6/127 (4%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKN 108
VD+D F S + + + + G + FGGVD + + G IT+ P+S + YWQ V+S+ +
Sbjct: 203 VDQDLF---SVYLSPNAQQGSVVTFGGVDPNHYNGAITWIPLSSELYWQITVDSVTVNGQ 259
Query: 109 VY-CS-NCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
V CS CQAI DTGTSLI+GP I+ +N +GA NG V+C+++ +MP+V +
Sbjct: 260 VVACSGGCQAIVDTGTSLIVGPQSSISNINNYVGATS-QNGDYVVNCNSISQMPDVIFHI 318
Query: 167 GGKNFTL 173
G+ FT+
Sbjct: 319 HGQQFTI 325
Score = 66.6 bits (161), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 34/63 (53%), Positives = 45/63 (71%), Gaps = 5/63 (7%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASDENG 68
YYG IS+GTPPQ FKV+FDTGSSNLWIPS +CS + AC+ + DK+ + + +NG
Sbjct: 72 YYGIISIGTPPQSFKVVFDTGSSNLWIPSIYCS--SAACN-NHDKY--NPGLSSTYKKNG 126
Query: 69 GEI 71
G +
Sbjct: 127 GSL 129
>gi|4758754|ref|NP_004842.1| napsin-A preproprotein [Homo sapiens]
gi|6225749|sp|O96009.1|NAPSA_HUMAN RecName: Full=Napsin-A; AltName: Full=Aspartyl protease 4;
Short=ASP4; Short=Asp 4; AltName: Full=Napsin-1;
AltName: Full=TA01/TA02; Flags: Precursor
gi|4154287|gb|AAD04917.1| napsin A [Homo sapiens]
gi|4235425|gb|AAD13215.1| napsin 1 precursor [Homo sapiens]
gi|6561818|gb|AAF17081.1| aspartyl protease 4 [Homo sapiens]
gi|119592253|gb|EAW71847.1| napsin A aspartic peptidase, isoform CRA_a [Homo sapiens]
Length = 420
Score = 92.4 bits (228), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 45/116 (38%), Positives = 68/116 (58%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIA 118
R+ + +GGE++ GG D ++ +T+ PV+ YWQ +E +K+ + C+ C AI
Sbjct: 223 RDPEEPDGGELVLGGSDPAHYIPPLTFVPVTVPAYWQIHMERVKVGPGLTLCAKGCAAIL 282
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSLI GP++ I L+ IG +PL G + C + K+P V +LGG F LT
Sbjct: 283 DTGTSLITGPTEEIRALHAAIGGIPLLAGEYIILCSEIPKLPAVSFLLGGVWFNLT 338
Score = 64.3 bits (155), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 27/46 (58%), Positives = 34/46 (73%), Gaps = 1/46 (2%)
Query: 3 SSYQ-AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
S+Y+ QY+G I LGTPPQ F V FDTGSSNLW+PS+ C ++ C
Sbjct: 71 SNYRDVQYFGEIGLGTPPQNFTVAFDTGSSNLWVPSRRCHFFSVPC 116
>gi|195159708|ref|XP_002020720.1| GL15705 [Drosophila persimilis]
gi|194117670|gb|EDW39713.1| GL15705 [Drosophila persimilis]
Length = 408
Score = 92.4 bits (228), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 44/114 (38%), Positives = 70/114 (61%), Gaps = 4/114 (3%)
Query: 62 NASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSNCQAIADTG 121
NA+D GGE++ GG+D F G +TY PVS++GYWQF + S + +C++CQAI D G
Sbjct: 243 NATD--GGELVLGGIDATLFSGCLTYVPVSQQGYWQFVMTSAVLGGKTFCTHCQAILDVG 300
Query: 122 TSLIIGPSKVIAELNKLIGAV--PLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
TSL++ P+ I ++N+L+ + A+G V+C + +P + + K F L
Sbjct: 301 TSLLVAPTAAIKKINQLLAVLNPKDASGVFLVNCSTIASLPTMVFTIARKEFPL 354
Score = 55.5 bits (132), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 24/67 (35%), Positives = 40/67 (59%), Gaps = 7/67 (10%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHF 60
+ +++ +YY +++GTPPQEF ++ DTGSSNLW+PS C +C V H+ +
Sbjct: 81 LGNAFNTEYYLPVTIGTPPQEFILLIDTGSSNLWVPSSKCPATVKSC-------VSHNQY 133
Query: 61 RNASDEN 67
+ S +
Sbjct: 134 DSKSSSS 140
>gi|119592254|gb|EAW71848.1| napsin A aspartic peptidase, isoform CRA_b [Homo sapiens]
Length = 357
Score = 92.4 bits (228), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 45/116 (38%), Positives = 68/116 (58%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIA 118
R+ + +GGE++ GG D ++ +T+ PV+ YWQ +E +K+ + C+ C AI
Sbjct: 160 RDPEEPDGGELVLGGSDPAHYIPPLTFVPVTVPAYWQIHMERVKVGPGLTLCAKGCAAIL 219
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSLI GP++ I L+ IG +PL G + C + K+P V +LGG F LT
Sbjct: 220 DTGTSLITGPTEEIRALHAAIGGIPLLAGEYIILCSEIPKLPAVSFLLGGVWFNLT 275
Score = 63.9 bits (154), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 27/46 (58%), Positives = 34/46 (73%), Gaps = 1/46 (2%)
Query: 3 SSYQ-AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
S+Y+ QY+G I LGTPPQ F V FDTGSSNLW+PS+ C ++ C
Sbjct: 8 SNYRDVQYFGEIGLGTPPQNFTVAFDTGSSNLWVPSRRCHFFSVPC 53
>gi|6978973|dbj|BAA90785.1| aspartic proteinase family member similar to renin [Mus musculus]
Length = 419
Score = 92.4 bits (228), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 49/116 (42%), Positives = 66/116 (56%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIA 118
R++ +GGE++ GG D +V +T+ PV+ YWQ +ES+K+ + C+ C AI
Sbjct: 218 RDSEGSDGGELVLGGSDPAHYVPPLTFIPVTIPAYWQVHMESVKVGTGLSLCAQGCSAIL 277
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSLI GPS+ I LNK IG P NG + C +P V LGG F LT
Sbjct: 278 DTGTSLITGPSEEIRALNKAIGGYPFLNGQYFIQCSKTPTLPPVSSHLGGVWFNLT 333
Score = 68.2 bits (165), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 27/40 (67%), Positives = 32/40 (80%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
QY+GTI LGTPPQ F V+FDTGSSNLW+PS C ++AC
Sbjct: 72 QYFGTIGLGTPPQNFTVVFDTGSSNLWVPSTRCHFFSLAC 111
>gi|426389739|ref|XP_004061277.1| PREDICTED: napsin-A-like [Gorilla gorilla gorilla]
Length = 420
Score = 92.4 bits (228), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 45/116 (38%), Positives = 68/116 (58%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIA 118
R+ + +GGE++ GG D ++ +T+ PV+ YWQ +E +K+ + C+ C AI
Sbjct: 223 RDPEEPDGGELVLGGSDPAHYIPPLTFVPVTVPAYWQIHMERVKVGPGLTLCAQGCAAIL 282
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSLI GP++ I L+ IG +PL G + C + K+P V +LGG F LT
Sbjct: 283 DTGTSLITGPTEEIRALHAAIGGIPLLAGEYIILCSEIPKLPAVSFLLGGVWFNLT 338
Score = 65.1 bits (157), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 27/47 (57%), Positives = 34/47 (72%), Gaps = 1/47 (2%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKF 54
QY+G I LGTPPQ F V FDTGSSNLW+PS+ C ++ C + D+F
Sbjct: 77 QYFGEIGLGTPPQNFTVAFDTGSSNLWVPSRRCHFFSVPCWL-HDRF 122
>gi|17389633|gb|AAH17842.1| Napsin A aspartic peptidase [Homo sapiens]
gi|123982255|gb|ABM82919.1| napsin A aspartic peptidase [synthetic construct]
gi|123997015|gb|ABM86109.1| napsin A aspartic peptidase [synthetic construct]
Length = 420
Score = 92.4 bits (228), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 45/116 (38%), Positives = 68/116 (58%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIA 118
R+ + +GGE++ GG D ++ +T+ PV+ YWQ +E +K+ + C+ C AI
Sbjct: 223 RDPEEPDGGELVLGGSDPAHYIPPLTFVPVTVPAYWQIHMERVKVGPGLTLCAKGCAAIL 282
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSLI GP++ I L+ IG +PL G + C + K+P V +LGG F LT
Sbjct: 283 DTGTSLITGPTEEIRALHAAIGGIPLLAGEYIILCSEIPKLPAVSFLLGGVWFNLT 338
Score = 64.3 bits (155), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 27/46 (58%), Positives = 34/46 (73%), Gaps = 1/46 (2%)
Query: 3 SSYQ-AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
S+Y+ QY+G I LGTPPQ F V FDTGSSNLW+PS+ C ++ C
Sbjct: 71 SNYRDVQYFGEIGLGTPPQNFTVAFDTGSSNLWVPSRRCHFFSVPC 116
>gi|363743175|ref|XP_003642787.1| PREDICTED: renin-like [Gallus gallus]
Length = 451
Score = 92.4 bits (228), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 44/116 (37%), Positives = 67/116 (57%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI-EKNVYCSN-CQAIA 118
RN+ + GGEI+ GG D + GD Y +SR GYWQ ++ + + + ++C C
Sbjct: 276 RNSPLKPGGEIILGGTDPAYYTGDFHYLSISRSGYWQISMKGVSVGAEMLFCKEGCSVAI 335
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTG S I GP+ ++ L K IGA + G VDC+ + ++PN+ LGGK +TL+
Sbjct: 336 DTGASYITGPAGPVSVLMKAIGAAEMTEGEYVVDCEKVPQLPNISFHLGGKAYTLS 391
Score = 67.8 bits (164), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 29/40 (72%), Positives = 33/40 (82%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
QYYG IS+GTPPQ FKV+FDTGS+NLW+PS CS L AC
Sbjct: 131 QYYGEISIGTPPQTFKVVFDTGSANLWVPSCKCSPLYSAC 170
>gi|301344561|gb|ADK74002.1| pepsinogen A1 precursor [Lateolabrax japonicus]
Length = 370
Score = 92.4 bits (228), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 51/127 (40%), Positives = 79/127 (62%), Gaps = 6/127 (4%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKN 108
V++D F S + +++ + G + FGG+D + + G IT+ P+S + YWQ V+S+ +
Sbjct: 203 VNQDLF---SVYLSSNSQQGSVVTFGGIDPNHYYGPITWIPLSNELYWQITVDSVTVNGQ 259
Query: 109 VY-CSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
V CS+ CQAI DTGTSLI+GP I+ +N +GA NG V+C+N+ +MP V +
Sbjct: 260 VVACSDGCQAIVDTGTSLIVGPQGSISSINSGVGASS-QNGDYVVNCNNIAQMPAVTFHI 318
Query: 167 GGKNFTL 173
G+ FTL
Sbjct: 319 HGQEFTL 325
Score = 65.1 bits (157), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 34/56 (60%), Positives = 42/56 (75%), Gaps = 5/56 (8%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFV--GHSHFRN 62
YYG IS+GTPPQ FKV+FDTGSSNLW+PS +C+ + AC+ + DKF S FRN
Sbjct: 72 YYGVISIGTPPQSFKVVFDTGSSNLWVPSIYCN--SPACN-NHDKFNPGTSSTFRN 124
>gi|403299328|ref|XP_003940441.1| PREDICTED: napsin-A-like [Saimiri boliviensis boliviensis]
Length = 421
Score = 92.4 bits (228), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 50/126 (39%), Positives = 71/126 (56%), Gaps = 3/126 (2%)
Query: 52 DKFVGHSHF-RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV- 109
DK V +F R+ +GGE++ GG D ++ +T+ PV+ YWQ +E +K+ +
Sbjct: 214 DKPVFSFYFNRDPEKPDGGELVLGGSDPAHYIPPLTFVPVTVPAYWQIHMERVKVGSGLT 273
Query: 110 YCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGG 168
C+ C AI DTGTSLI GP++ I LN IG PL G + C + K+P V +LGG
Sbjct: 274 LCARGCAAILDTGTSLITGPTEEIQALNAAIGGFPLLAGEYIILCSEIPKLPAVSFLLGG 333
Query: 169 KNFTLT 174
F LT
Sbjct: 334 VWFNLT 339
Score = 63.9 bits (154), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 26/46 (56%), Positives = 34/46 (73%), Gaps = 1/46 (2%)
Query: 3 SSYQ-AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
S+Y+ QY+G I LG PPQ F V+FDTGSSNLW+PS+ C ++ C
Sbjct: 72 SNYRDVQYFGEIGLGMPPQNFTVVFDTGSSNLWVPSRRCHFFSVPC 117
>gi|49019530|emb|CAD80097.1| pepsin A3 [Trematomus bernacchii]
Length = 378
Score = 92.4 bits (228), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 49/127 (38%), Positives = 78/127 (61%), Gaps = 6/127 (4%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKN 108
V+KD F S + ++ + G + FGGVD + + G I++ P+SR+ YWQ V+S+ +
Sbjct: 203 VNKDMF---SVYLSSDAQQGSVVTFGGVDPNHYSGSISWIPLSRELYWQITVDSVTVNGE 259
Query: 109 VYCSN--CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
V N CQAI DTGTSLI+GP I+ +N +GA NG V+C+++ ++P+V +
Sbjct: 260 VVACNGGCQAIVDTGTSLIVGPQSSISNINSKVGATD-HNGDYLVNCNSIAQLPDVIFHI 318
Query: 167 GGKNFTL 173
G+ FT+
Sbjct: 319 HGETFTI 325
Score = 62.8 bits (151), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 34/56 (60%), Positives = 41/56 (73%), Gaps = 5/56 (8%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKF--VGHSHFRN 62
YYG IS+GTP Q FKV+FDTGSSNLWIPS +C+ + AC+ + DKF S FRN
Sbjct: 72 YYGIISIGTPAQSFKVVFDTGSSNLWIPSIYCN--SPACN-NHDKFNPSTSSTFRN 124
>gi|307103455|gb|EFN51715.1| hypothetical protein CHLNCDRAFT_59800 [Chlorella variabilis]
Length = 523
Score = 92.4 bits (228), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 43/86 (50%), Positives = 60/86 (69%), Gaps = 2/86 (2%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-KNVYCS-NCQAIA 118
R E GGE++ GGVD D FVG+ T+ PV+R+G+WQF ++ +++E +C CQAIA
Sbjct: 216 RKVEGEEGGELVLGGVDPDHFVGEHTWVPVTRRGFWQFKMDGMEVEGGGEFCKGGCQAIA 275
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPL 144
DTGTSL++GP VI +N IGA P+
Sbjct: 276 DTGTSLLVGPPDVIDAINAAIGAEPV 301
Score = 77.0 bits (188), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 40/84 (47%), Positives = 50/84 (59%), Gaps = 5/84 (5%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASDE 66
AQYYG I LG+PPQ F+VIFDTGSSNLW+PS CS L++AC + + SH E
Sbjct: 69 AQYYGEIGLGSPPQSFQVIFDTGSSNLWVPSSKCSYLSVACYLHSKYYAERSH---TYKE 125
Query: 67 NGGE--IMFGGVDKDKFVGDITYS 88
+G E I +G F+ T S
Sbjct: 126 DGREFAIQYGSGQLSGFLSQDTLS 149
>gi|194854115|ref|XP_001968291.1| GG24792 [Drosophila erecta]
gi|190660158|gb|EDV57350.1| GG24792 [Drosophila erecta]
Length = 419
Score = 92.4 bits (228), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 51/117 (43%), Positives = 68/117 (58%), Gaps = 3/117 (2%)
Query: 61 RNASD-ENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSNCQAIAD 119
RNASD NGG ++ GG D + G +TY P+S+ +WQ V + I CSNCQAI D
Sbjct: 237 RNASDASNGGFLLLGGSDPTLYQGCLTYVPLSKVAFWQITVGRVGIGSKNLCSNCQAIFD 296
Query: 120 TGTSLIIGPSKVIAELNKLIG--AVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
GTSLII P + +NK +G NG +DC + +PNV + +G K+FTLT
Sbjct: 297 VGTSLIIVPCPALKIINKKLGLKEADKKNGVYIIDCSKVSSLPNVVLNIGWKDFTLT 353
>gi|291409611|ref|XP_002721072.1| PREDICTED: pepsin-3-like isoform 2 [Oryctolagus cuniculus]
Length = 387
Score = 92.4 bits (228), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 46/125 (36%), Positives = 76/125 (60%), Gaps = 5/125 (4%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-K 107
V +D F S + ++ D++G +MFGGVD + G + + PVS +GYWQ V+SI ++ +
Sbjct: 207 VSEDLF---SVYLSSDDDSGSVVMFGGVDSSYYTGSLNWVPVSYEGYWQITVDSITMDGE 263
Query: 108 NVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ C++ CQAI DTGTSL+ GP+ I+ + IGA ++G V C ++ +PN+ +
Sbjct: 264 TIACADGCQAIVDTGTSLLAGPTSAISNIQSYIGASENSDGEMIVSCSSMYSLPNIVFTI 323
Query: 167 GGKNF 171
G +
Sbjct: 324 NGVQY 328
Score = 67.8 bits (164), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 30/42 (71%), Positives = 37/42 (88%), Gaps = 2/42 (4%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSV 49
+Y+GTIS+GTPPQ+F VIFDTGSSNLW+PS +CS + ACSV
Sbjct: 74 EYFGTISIGTPPQDFTVIFDTGSSNLWVPSVYCS--SAACSV 113
>gi|556819|emb|CAA57510.1| cyprosin [Cynara cardunculus]
Length = 509
Score = 92.0 bits (227), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 45/84 (53%), Positives = 59/84 (70%), Gaps = 3/84 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV--YCSN-CQAI 117
RNA +E GGE++FGGVD + F G TY PV+ KGYWQF + + IE +CS+ C AI
Sbjct: 229 RNADEEEGGELVFGGVDPNHFKGKHTYVPVTEKGYWQFDMGDVLIEDKTTGFCSDGCAAI 288
Query: 118 ADTGTSLIIGPSKVIAELNKLIGA 141
AD+GTSL+ GP+ +I E+N IGA
Sbjct: 289 ADSGTSLLAGPTAIITEINHAIGA 312
Score = 64.7 bits (156), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 27/47 (57%), Positives = 36/47 (76%), Gaps = 1/47 (2%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ AQYYG I +G+PPQ+F VIFDTGSSNLW+PS C ++AC
Sbjct: 77 LKNYMDAQYYGEIGIGSPPQKFTVIFDTGSSNLWVPSAKC-YFSVAC 122
Score = 38.5 bits (88), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 17/43 (39%), Positives = 26/43 (60%)
Query: 131 VIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
+I +N+L +P G + VDC++L MPN+ +GGK F L
Sbjct: 406 IINYVNELCDRLPSPMGESAVDCNSLSSMPNIAFTIGGKVFEL 448
>gi|195575781|ref|XP_002077755.1| GD23098 [Drosophila simulans]
gi|194189764|gb|EDX03340.1| GD23098 [Drosophila simulans]
Length = 419
Score = 92.0 bits (227), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 49/116 (42%), Positives = 69/116 (59%), Gaps = 3/116 (2%)
Query: 61 RNASD-ENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSNCQAIAD 119
RNASD NGG ++ GG D + G +TY PVS+ G+WQ V ++I CSNCQAI D
Sbjct: 238 RNASDASNGGVLLLGGSDPTLYRGCLTYVPVSKVGFWQITVGQVEIGSKKLCSNCQAIFD 297
Query: 120 TGTSLIIGPSKVIAELNKLIG--AVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
GTSLII P + +NK +G +G +DC+ + ++P + +G K+FTL
Sbjct: 298 VGTSLIIVPCPALKIINKKLGIKETDRKDGAYIIDCNKVPRLPKIVFNIGWKDFTL 353
>gi|283806594|ref|NP_001164550.1| pepsin-3 precursor [Oryctolagus cuniculus]
gi|129783|sp|P27822.1|PEPA3_RABIT RecName: Full=Pepsin-3; AltName: Full=Pepsin A; AltName:
Full=Pepsin III; Flags: Precursor
gi|165598|gb|AAA85370.1| pepsinogen [Oryctolagus cuniculus]
Length = 387
Score = 92.0 bits (227), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 45/125 (36%), Positives = 76/125 (60%), Gaps = 5/125 (4%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-K 107
V +D F S + ++ DE+G +MFGG+D + G + + PVS +GYWQ ++SI ++ +
Sbjct: 207 VSEDLF---SVYLSSDDESGSVVMFGGIDSSYYTGSLNWVPVSYEGYWQITLDSITMDGE 263
Query: 108 NVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ C++ CQAI DTGTSL+ GP+ I+ + IGA ++G V C ++ +PN+ +
Sbjct: 264 TIACADSCQAIVDTGTSLLAGPTSAISNIQSYIGASENSDGEMIVSCSSMYSLPNIVFTI 323
Query: 167 GGKNF 171
G +
Sbjct: 324 NGVQY 328
Score = 63.9 bits (154), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 39/85 (45%), Positives = 49/85 (57%), Gaps = 10/85 (11%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSV-------DKDKFVGHSHF 60
+Y+GTI +GTP Q+F VIFDTGSSNLW+PS +CS + ACSV D F S
Sbjct: 74 EYFGTIGIGTPAQDFTVIFDTGSSNLWVPSVYCS--SAACSVHNQFNPEDSSTFQATSES 131
Query: 61 RNASDENGGEIMFGGVDKDKFVGDI 85
+ + G F G D K VG+I
Sbjct: 132 LSITYGTGSMTGFLGYDTVK-VGNI 155
>gi|344312912|emb|CCC33063.1| cathepsin D-1 [Dermanyssus gallinae]
Length = 383
Score = 92.0 bits (227), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 44/117 (37%), Positives = 67/117 (57%), Gaps = 3/117 (2%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKN---VYCSNCQAI 117
R+ G E++ GG+D + GDITY+P++R+ YWQF V+ + + V C+ I
Sbjct: 209 RDPQHPIGSELVLGGIDPKHYKGDITYAPLTRESYWQFRVDKVTLNGKAAPVCQKGCEGI 268
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
ADTGTSL +GP+ +A L + A A G VDC+ +PN++ + G+ F LT
Sbjct: 269 ADTGTSLFVGPTADVAALASQLDAQETAPGLYLVDCEKAGDLPNIEFTIAGRPFELT 325
Score = 69.7 bits (169), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 29/42 (69%), Positives = 35/42 (83%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACS 48
AQYYG I++GTPPQ F+VIFDTGSS+LW+PS C NIAC+
Sbjct: 63 AQYYGPITIGTPPQTFQVIFDTGSSDLWVPSSKCPSSNIACA 104
>gi|330800100|ref|XP_003288077.1| preprocathepsin D [Dictyostelium purpureum]
gi|325081901|gb|EGC35401.1| preprocathepsin D [Dictyostelium purpureum]
Length = 386
Score = 92.0 bits (227), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 46/113 (40%), Positives = 67/113 (59%), Gaps = 4/113 (3%)
Query: 66 ENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSN--CQAIADTG 121
NGGE+ FG +D K+ G ITY P++ YW+F ++ I + +C + C AIAD+G
Sbjct: 212 SNGGELSFGSIDSSKYTGPITYVPLTNTTYWEFKMDDFAIGGQSAGFCGSQGCPAIADSG 271
Query: 122 TSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
TSLI GP I LN+ +GAV ++ DC ++ +PNV + L G+ F LT
Sbjct: 272 TSLIAGPIDFITALNQKLGAVVISGEAIFPDCSVINTLPNVTVTLAGRQFNLT 324
Score = 68.2 bits (165), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 30/45 (66%), Positives = 36/45 (80%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK 51
AQYYG I++GTP Q FKV+FDTGSSNLWIPS+ CSI AC + +
Sbjct: 61 AQYYGAITIGTPGQPFKVVFDTGSSNLWIPSKKCSITVPACDLHE 105
>gi|332024604|gb|EGI64802.1| Lysosomal aspartic protease [Acromyrmex echinatior]
Length = 361
Score = 92.0 bits (227), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 3/116 (2%)
Query: 61 RNASDENGGEIM-FGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIA 118
RN D + G ++ GG D + G++TY V+ KGYWQF ++ I++E C N CQAIA
Sbjct: 186 RNLLDSSAGSVLILGGSDPALYDGELTYVNVTHKGYWQFTMDKIQMENETLCVNGCQAIA 245
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTG S + GP IA + I A+ NG VDCD + +PNV L GK F LT
Sbjct: 246 DTGFSRLAGPPTDIAIITSRI-AIDDFNGVVYVDCDQISNLPNVTFFLSGKPFVLT 300
Score = 70.1 bits (170), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 28/41 (68%), Positives = 35/41 (85%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQYYG IS+GTP Q FKV+FDTGS+NLW+PS HC++ +I C
Sbjct: 39 AQYYGVISIGTPRQRFKVLFDTGSANLWVPSVHCNLEDITC 79
>gi|49019527|emb|CAD80096.1| pepsin A2 [Trematomus bernacchii]
Length = 373
Score = 92.0 bits (227), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 45/118 (38%), Positives = 68/118 (57%), Gaps = 2/118 (1%)
Query: 58 SHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-KNVYCSN-CQ 115
S + ++ E++FGG+D + G I + P+S YWQ ++S+ I + V C+ CQ
Sbjct: 204 SVYLSSKSAQDSEVVFGGIDSSHYTGQIIWIPLSSDTYWQIKMDSVTINGQTVACAGGCQ 263
Query: 116 AIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
AI DTGTS I+GP+ I+ +N +GA G A+V+C N+ MP V L G FT+
Sbjct: 264 AIIDTGTSQIVGPTSDISNMNSWVGASTNQYGEARVNCQNIQSMPEVTFTLNGNAFTI 321
Score = 63.2 bits (152), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 26/33 (78%), Positives = 29/33 (87%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCS 41
YYG IS+GTPPQ F VIFDTGSSNLW+PS +CS
Sbjct: 67 YYGVISIGTPPQSFSVIFDTGSSNLWVPSVYCS 99
>gi|284925237|gb|ADC27638.1| MIP16750p [Drosophila melanogaster]
Length = 416
Score = 92.0 bits (227), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 46/116 (39%), Positives = 71/116 (61%), Gaps = 3/116 (2%)
Query: 61 RNASDE-NGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSNCQAIAD 119
RN ++ NGGE++ GG D + G +TY PVS GYWQF + S + +C NC+AI D
Sbjct: 247 RNGTNAINGGELILGGSDSGLYSGCLTYVPVSSAGYWQFTMTSANLNGFQFCENCEAILD 306
Query: 120 TGTSLIIGPSKVIAELNKLIGAV-PLA-NGPAKVDCDNLDKMPNVDIILGGKNFTL 173
GTSLI+ P +V+ +N+++G + P A NG VDC ++ +P++ + + F L
Sbjct: 307 VGTSLIVVPEQVLDTINQILGVLNPTASNGVFLVDCSSIGDLPDIVFTVARRKFPL 362
Score = 51.6 bits (122), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 19/40 (47%), Positives = 27/40 (67%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
+ + Y +YY T+ G PPQ+ KV+ DTGS+NLW+ S C
Sbjct: 88 LENLYNTEYYTTLGFGNPPQDLKVLIDTGSANLWVLSSKC 127
>gi|22218078|dbj|BAC07516.1| pepsinogen III [Oryctolagus cuniculus]
Length = 387
Score = 92.0 bits (227), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 45/125 (36%), Positives = 76/125 (60%), Gaps = 5/125 (4%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-K 107
V +D F S + ++ DE+G +MFGG+D + G + + PVS +GYWQ ++SI ++ +
Sbjct: 207 VSEDLF---SVYLSSDDESGSVVMFGGIDSSYYTGSLNWVPVSYEGYWQITLDSITMDGE 263
Query: 108 NVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ C++ CQAI DTGTSL+ GP+ I+ + IGA ++G V C ++ +PN+ +
Sbjct: 264 TIACADSCQAIVDTGTSLLAGPTSAISNIQSYIGASENSDGEMIVSCSSMYSLPNIVFTI 323
Query: 167 GGKNF 171
G +
Sbjct: 324 NGVQY 328
Score = 63.9 bits (154), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 39/85 (45%), Positives = 49/85 (57%), Gaps = 10/85 (11%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSV-------DKDKFVGHSHF 60
+Y+GTI +GTP Q+F VIFDTGSSNLW+PS +CS + ACSV D F S
Sbjct: 74 EYFGTIGIGTPAQDFTVIFDTGSSNLWVPSVYCS--SAACSVHNKFNPEDSSTFQATSES 131
Query: 61 RNASDENGGEIMFGGVDKDKFVGDI 85
+ + G F G D K VG+I
Sbjct: 132 LSITYGTGSMTGFLGYDTVK-VGNI 155
>gi|291409609|ref|XP_002721071.1| PREDICTED: pepsin-3-like isoform 1 [Oryctolagus cuniculus]
Length = 387
Score = 92.0 bits (227), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 46/125 (36%), Positives = 76/125 (60%), Gaps = 5/125 (4%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-K 107
V +D F S + ++ D++G +MFGGVD + G + + PVS +GYWQ V+SI ++ +
Sbjct: 207 VSEDLF---SVYLSSDDDSGSVVMFGGVDSSYYTGSLNWVPVSYEGYWQITVDSITMDGE 263
Query: 108 NVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ C++ CQAI DTGTSL+ GP+ I+ + IGA ++G V C ++ +PN+ +
Sbjct: 264 TIACADGCQAIVDTGTSLLAGPTSAISNIQSYIGASENSDGEMIVSCSSMYSLPNIVFTI 323
Query: 167 GGKNF 171
G +
Sbjct: 324 NGVQY 328
Score = 67.8 bits (164), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 30/42 (71%), Positives = 37/42 (88%), Gaps = 2/42 (4%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSV 49
+Y+GTIS+GTPPQ+F VIFDTGSSNLW+PS +CS + ACSV
Sbjct: 74 EYFGTISIGTPPQDFTVIFDTGSSNLWVPSVYCS--SAACSV 113
>gi|18152941|gb|AAB68519.2| proteinase A [Ogataea angusta]
gi|320580237|gb|EFW94460.1| proteinase A [Ogataea parapolymorpha DL-1]
Length = 413
Score = 92.0 bits (227), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 55/152 (36%), Positives = 84/152 (55%), Gaps = 3/152 (1%)
Query: 24 VIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVG 83
+ +DT S N +P + +I N+ +D +F + + S+++GGE FGG D K+ G
Sbjct: 208 LAYDTISVNRIVPPIYNAI-NLGL-LDTPQFGFYLGDTSKSEQDGGEATFGGYDVSKYTG 265
Query: 84 DITYSPVSRKGYWQFGVESIKI-EKNVYCSNCQAIADTGTSLIIGPSKVIAELNKLIGAV 142
DIT+ PV RK YW+ I + ++ N A DTGTSLI PS++ LN IGA
Sbjct: 266 DITWLPVRRKAYWEVKFSGIALGDEYAPLENTGAAIDTGTSLIALPSQLAEILNSQIGAE 325
Query: 143 PLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
+G ++DCD D +P++ G NFT++
Sbjct: 326 KSWSGQYQIDCDKRDSLPDLTFNFDGYNFTIS 357
Score = 58.9 bits (141), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 34/83 (40%), Positives = 45/83 (54%), Gaps = 4/83 (4%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASD- 65
AQY+ I LGTP Q FKVI DTGSSNLW+PS C+ L D ++ +N S
Sbjct: 97 AQYFTEIQLGTPGQSFKVILDTGSSNLWVPSSDCTSLACYLHTKYDHDESSTYQKNGSSF 156
Query: 66 --ENGGEIMFGGVDKDKF-VGDI 85
+ G + G V +D +GD+
Sbjct: 157 AIQYGSGSLEGYVSQDTLTIGDL 179
>gi|114678578|ref|XP_530061.2| PREDICTED: napsin-A-like [Pan troglodytes]
Length = 420
Score = 92.0 bits (227), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 43/116 (37%), Positives = 69/116 (59%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI-EKNVYCSN-CQAIA 118
R+ +GGE++ GG D ++ +T+ PV+ YWQ +E +K+ + C+ C AI
Sbjct: 223 RDPEVADGGELVLGGSDPAHYIPPLTFVPVTVPAYWQIHMERVKVGSRLTLCAQGCAAIL 282
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGT +I+GP++ I L+ IG +PL G + C + K+P V +++GG FTLT
Sbjct: 283 DTGTPVIVGPTEEIRALHAAIGGIPLLAGEYIIRCSEIPKLPAVSLLIGGVWFTLT 338
Score = 65.9 bits (159), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 26/41 (63%), Positives = 31/41 (75%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQY+G I LGTPPQ F V FDTGSSNLW+PS+ C ++ C
Sbjct: 76 AQYFGEIGLGTPPQNFTVAFDTGSSNLWVPSRRCHFFSVPC 116
>gi|388517285|gb|AFK46704.1| unknown [Medicago truncatula]
Length = 510
Score = 92.0 bits (227), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 45/84 (53%), Positives = 60/84 (71%), Gaps = 3/84 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV--YCSN-CQAI 117
RN ++E GGE++FGGVD F G+ TY PV+RKGYWQF + + I+ YC+N C AI
Sbjct: 230 RNPNEEQGGELVFGGVDPAHFKGEHTYVPVTRKGYWQFAMGDVLIDGKPTGYCANDCSAI 289
Query: 118 ADTGTSLIIGPSKVIAELNKLIGA 141
AD+GTSL+ GP+ VI +N+ IGA
Sbjct: 290 ADSGTSLLAGPTTVITMINQAIGA 313
Score = 63.9 bits (154), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 29/49 (59%), Positives = 36/49 (73%), Gaps = 1/49 (2%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSV 49
+++ AQYYG IS+GTPPQ F VIFDTGSSN W+PS C ++AC V
Sbjct: 78 LKNYLDAQYYGEISIGTPPQTFTVIFDTGSSNTWVPSVKC-YFSLACLV 125
>gi|125986531|ref|XP_001357029.1| GA16571 [Drosophila pseudoobscura pseudoobscura]
gi|54645355|gb|EAL34095.1| GA16571 [Drosophila pseudoobscura pseudoobscura]
Length = 415
Score = 92.0 bits (227), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 48/117 (41%), Positives = 71/117 (60%), Gaps = 4/117 (3%)
Query: 61 RNASD-ENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSNCQAIA 118
RNAS+ NGG+++ GG D + G +TY P+S+ GYWQ V SI ++ CSNC+AI
Sbjct: 241 RNASEPSNGGQLLLGGSDPTLYSGCLTYVPLSQVGYWQITVGSISLDTGSDLCSNCEAII 300
Query: 119 DTGTSLIIGPSKVIAELNKLIG--AVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
D GTSLI+ PS +A +N+ G A +G + CD + +P + +G ++FTL
Sbjct: 301 DAGTSLIVVPSATLAAINQRFGITAADKRDGVYTISCDKVSSLPALTFNIGRRDFTL 357
Score = 51.2 bits (121), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 19/41 (46%), Positives = 32/41 (78%), Gaps = 2/41 (4%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCS 41
+ +SY +YYG +++G+ Q+F+V+FDT S+NLW+PS C+
Sbjct: 88 LTNSYNTEYYGQVTIGS--QQFQVLFDTASANLWVPSVECT 126
>gi|24580868|ref|NP_722706.1| CG31926, isoform A [Drosophila melanogaster]
gi|442625183|ref|NP_001259870.1| CG31926, isoform B [Drosophila melanogaster]
gi|7296075|gb|AAF51370.1| CG31926, isoform A [Drosophila melanogaster]
gi|440213128|gb|AGB92407.1| CG31926, isoform B [Drosophila melanogaster]
Length = 410
Score = 92.0 bits (227), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 46/116 (39%), Positives = 71/116 (61%), Gaps = 3/116 (2%)
Query: 61 RNASDE-NGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSNCQAIAD 119
RN ++ NGGE++ GG D + G +TY PVS GYWQF + S + +C NC+AI D
Sbjct: 241 RNGTNAINGGELILGGSDSGLYSGCLTYVPVSSAGYWQFTMTSANLNGFQFCENCEAILD 300
Query: 120 TGTSLIIGPSKVIAELNKLIGAV-PLA-NGPAKVDCDNLDKMPNVDIILGGKNFTL 173
GTSLI+ P +V+ +N+++G + P A NG VDC ++ +P++ + + F L
Sbjct: 301 VGTSLIVVPEQVLDTINQILGVLNPTASNGVFLVDCSSIGDLPDIVFTVARRKFPL 356
Score = 51.6 bits (122), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 19/40 (47%), Positives = 27/40 (67%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
+ + Y +YY T+ G PPQ+ KV+ DTGS+NLW+ S C
Sbjct: 82 LENLYNTEYYTTLGFGNPPQDLKVLIDTGSANLWVLSSKC 121
>gi|302820804|ref|XP_002992068.1| hypothetical protein SELMODRAFT_186535 [Selaginella moellendorffii]
gi|300140190|gb|EFJ06917.1| hypothetical protein SELMODRAFT_186535 [Selaginella moellendorffii]
Length = 499
Score = 92.0 bits (227), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 45/95 (47%), Positives = 69/95 (72%), Gaps = 4/95 (4%)
Query: 61 RNASDE-NGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSN-CQA 116
R+ASDE +GGEI+FGGV+KD+F G Y+PV+R+GYWQF + + + + +C+ C A
Sbjct: 224 RDASDEEDGGEIVFGGVNKDRFKGKHVYTPVTREGYWQFNMGDVVVDGQSTGFCAKGCAA 283
Query: 117 IADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKV 151
IAD+GTSL+ GP+ ++A++N+ IGA L + K+
Sbjct: 284 IADSGTSLLAGPTGIVAQINQAIGATGLVSEECKM 318
Score = 65.5 bits (158), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 29/51 (56%), Positives = 37/51 (72%), Gaps = 1/51 (1%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK 51
+R+ AQYYG I +G+PPQ F VIFDTGSSNLW+PS C I + AC + +
Sbjct: 72 LRNYLDAQYYGEIGIGSPPQVFTVIFDTGSSNLWVPSSRC-IFSPACWLHR 121
Score = 37.0 bits (84), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 24/77 (31%), Positives = 38/77 (49%), Gaps = 3/77 (3%)
Query: 101 ESIKIEKNVYCSNCQ-AIADTGTSLIIGPSKVIAE--LNKLIGAVPLANGPAKVDCDNLD 157
E+ K+ +V CS C+ A+ L +K E LN+L +P NG + V+C +
Sbjct: 363 ENQKVGDDVLCSVCEMAVVWVQNQLRQNRTKQQIEDYLNQLCERLPSPNGQSVVECAKIS 422
Query: 158 KMPNVDIILGGKNFTLT 174
+PNV + + F LT
Sbjct: 423 SLPNVSFTIANQTFELT 439
>gi|291409620|ref|XP_002721076.1| PREDICTED: pepsinogen III-like [Oryctolagus cuniculus]
Length = 387
Score = 92.0 bits (227), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 45/125 (36%), Positives = 76/125 (60%), Gaps = 5/125 (4%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-K 107
V +D F S + ++ DE+G +MFGG+D + G + + PVS +GYWQ ++SI ++ +
Sbjct: 207 VSEDLF---SVYLSSDDESGSVVMFGGIDSSYYTGSLNWVPVSYEGYWQITLDSITMDGE 263
Query: 108 NVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ C++ CQAI DTGTSL+ GP+ I+ + IGA ++G V C ++ +PN+ +
Sbjct: 264 TIACADGCQAIVDTGTSLLAGPTSAISNIQSYIGASENSDGEMIVSCSSMYSLPNIVFTI 323
Query: 167 GGKNF 171
G +
Sbjct: 324 NGVQY 328
Score = 63.5 bits (153), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 39/85 (45%), Positives = 49/85 (57%), Gaps = 10/85 (11%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSV-------DKDKFVGHSHF 60
+Y+GTI +GTP Q+F VIFDTGSSNLW+PS +CS + ACSV D F S
Sbjct: 74 EYFGTIGIGTPAQDFTVIFDTGSSNLWVPSVYCS--SAACSVHNKFNPEDSSTFQATSES 131
Query: 61 RNASDENGGEIMFGGVDKDKFVGDI 85
+ + G F G D K VG+I
Sbjct: 132 LSITYGTGSMTGFLGYDTVK-VGNI 155
>gi|126310959|ref|XP_001372683.1| PREDICTED: chymosin-like [Monodelphis domestica]
Length = 383
Score = 92.0 bits (227), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 43/113 (38%), Positives = 64/113 (56%), Gaps = 2/113 (1%)
Query: 63 ASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN--CQAIADT 120
+ D G ++ G +D + G + + PV+ +GYWQF V+SI + V CQAI DT
Sbjct: 218 SRDSQGSMLILGAIDPSYYTGSLHWVPVTEQGYWQFSVDSITVNGQVVACEGGCQAILDT 277
Query: 121 GTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
GTSL++GPS IA + +IGA G ++C NL MP V + + G+ + L
Sbjct: 278 GTSLLVGPSYDIANIQSIIGATQGQYGEYDINCSNLSSMPTVVVHINGRQYPL 330
Score = 61.6 bits (148), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 29/58 (50%), Positives = 36/58 (62%), Gaps = 9/58 (15%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNAS 64
+QY+G I +GTPPQEF V+FDTGSSNLW+PS +C + D H F AS
Sbjct: 74 SQYFGKIYIGTPPQEFTVVFDTGSSNLWVPSVYC---------NSDACQNHHRFNPAS 122
>gi|384498765|gb|EIE89256.1| endopeptidase [Rhizopus delemar RA 99-880]
Length = 401
Score = 92.0 bits (227), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 48/113 (42%), Positives = 68/113 (60%), Gaps = 1/113 (0%)
Query: 62 NASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI-EKNVYCSNCQAIADT 120
N +NGGE++FGGVD+D F GDI +S V RKGYW+ +E+IK + V A DT
Sbjct: 232 NKDQDNGGELIFGGVDEDHFEGDIHWSDVRRKGYWEITMENIKFGDDYVDIDPVGAAIDT 291
Query: 121 GTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
G+SL++ P+ V A +NK +GA G VDC+ + +P + GK+F L
Sbjct: 292 GSSLLVAPTTVAALINKELGAEKNWAGQYVVDCNKVPSLPEFCFVFNGKDFCL 344
Score = 70.1 bits (170), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 41/88 (46%), Positives = 52/88 (59%), Gaps = 15/88 (17%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNAS-- 64
AQYYG I +GTPPQ F V+FDTGSSNLW+PS HC+ +IAC + H + +AS
Sbjct: 85 AQYYGEIEIGTPPQPFTVVFDTGSSNLWVPSTHCT--SIACFL-------HKRYDSASSR 135
Query: 65 --DENGGE--IMFGGVDKDKFVGDITYS 88
ENG E I +G + F+ T S
Sbjct: 136 TYSENGTEFAIQYGTGSLEGFISQDTLS 163
>gi|395858453|ref|XP_003801583.1| PREDICTED: napsin-A [Otolemur garnettii]
Length = 419
Score = 92.0 bits (227), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 44/116 (37%), Positives = 68/116 (58%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIA 118
R+ +GGE++ GG D ++ +T+ PV+ YWQ +E +K+ + C+ C AI
Sbjct: 222 RDPDVADGGELVLGGSDPAHYIPPLTFVPVTIPAYWQIHMERVKVGTGLTLCAQGCAAIL 281
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSLI GP++ I L+ IG +PL G ++C + ++P V +LGG F LT
Sbjct: 282 DTGTSLITGPTEEIRALHAAIGGIPLPPGEHLIECSEIPRLPPVSFLLGGVWFNLT 337
Score = 65.9 bits (159), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 25/40 (62%), Positives = 31/40 (77%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
QY+G I LGTPPQ F V+FDTGSSNLW+PS+ C ++ C
Sbjct: 76 QYFGEIGLGTPPQNFSVVFDTGSSNLWVPSRRCHFFSVPC 115
>gi|426251840|ref|XP_004019629.1| PREDICTED: pepsin A-like [Ovis aries]
Length = 386
Score = 92.0 bits (227), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 45/125 (36%), Positives = 78/125 (62%), Gaps = 5/125 (4%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-K 107
V +D F S + ++++E+G +MFGG+D + G + + PVS +GYWQ V+SI + +
Sbjct: 206 VSQDLF---SVYLSSNEESGSVVMFGGIDSSYYSGSLNWVPVSVEGYWQITVDSITMNGE 262
Query: 108 NVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
++ CS+ CQAI DTGTSL+ GP+ I+ + IGA ++G + C ++D +P++ +
Sbjct: 263 SIACSDGCQAIVDTGTSLLAGPTTAISNIQSYIGASEDSSGEEVISCSSIDSLPDIVFTI 322
Query: 167 GGKNF 171
G +
Sbjct: 323 NGVQY 327
Score = 59.7 bits (143), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 24/41 (58%), Positives = 33/41 (80%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCS 41
+++ +Y+GTI +GTP Q+F VIFDTGSSNLW+PS +CS
Sbjct: 66 LQNYLDTEYFGTIGIGTPAQDFTVIFDTGSSNLWVPSIYCS 106
>gi|12231174|dbj|BAB20970.1| aspartic proteinase 2 [Nepenthes alata]
Length = 514
Score = 91.7 bits (226), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 45/94 (47%), Positives = 63/94 (67%), Gaps = 3/94 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCS-NCQAI 117
R + +E GGEI+FGGVD + + G+ TY PV+RKGYWQF ++ + + E YCS C AI
Sbjct: 234 RKSEEEEGGEIVFGGVDPNHYKGEHTYVPVTRKGYWQFDMDDVLVGGETTGYCSGGCSAI 293
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKV 151
AD+GTSL+ GP+ +I ++N IGA L + K
Sbjct: 294 ADSGTSLLAGPTTIIVQINHAIGASGLVSQECKA 327
Score = 63.5 bits (153), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 27/47 (57%), Positives = 36/47 (76%), Gaps = 1/47 (2%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ AQY+G I +G+PPQ+F VIFDTGSSNLW+PS C +IAC
Sbjct: 82 LKNYMDAQYFGEIGIGSPPQKFTVIFDTGSSNLWVPSAKC-YFSIAC 127
Score = 39.7 bits (91), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 17/45 (37%), Positives = 28/45 (62%)
Query: 130 KVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
+++ +N+L +P G + VDC +L MPNV + +GGK F L+
Sbjct: 410 QILNYVNELCNRLPSPMGESSVDCGSLSSMPNVSLTIGGKVFDLS 454
>gi|449433980|ref|XP_004134774.1| PREDICTED: aspartic proteinase-like [Cucumis sativus]
gi|449526063|ref|XP_004170034.1| PREDICTED: aspartic proteinase-like [Cucumis sativus]
Length = 516
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 46/94 (48%), Positives = 61/94 (64%), Gaps = 3/94 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCS-NCQAI 117
RNA ++ GGE++FGGVD F G TY PV+ KGYWQF + I I E YC+ C AI
Sbjct: 237 RNAEEKEGGELVFGGVDPKHFKGQHTYVPVTDKGYWQFDIGDILIGGETTKYCAGGCSAI 296
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKV 151
AD+GTSL+ GPS ++ +N+ IGA +A+ K
Sbjct: 297 ADSGTSLLAGPSNIVVSINRAIGAAAVAHPECKA 330
Score = 67.8 bits (164), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 29/47 (61%), Positives = 37/47 (78%), Gaps = 1/47 (2%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ AQYYG I +GTPPQ+F VIFDTGSSNLW+PS C I ++AC
Sbjct: 85 LKNYLDAQYYGEIGIGTPPQKFTVIFDTGSSNLWVPSAKC-IFSLAC 130
>gi|335287195|ref|XP_003355296.1| PREDICTED: gastricsin-like [Sus scrofa]
Length = 391
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 44/117 (37%), Positives = 62/117 (52%), Gaps = 4/117 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV----YCSNCQA 116
R + E GGE++ GGVD + G I ++PV+R+ YWQ ++ I + CQA
Sbjct: 217 RQPTYEYGGELILGGVDTQLYSGQIVWTPVTRELYWQIAIQEFAIGDQATGWCFSQGCQA 276
Query: 117 IADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
I DTGT L+ P + +A + GA NG VDCD + MP + I+GG F L
Sbjct: 277 IVDTGTFLLAVPQQYLASFLQATGAQEAQNGDFVVDCDLVQSMPTITFIIGGSQFPL 333
Score = 59.7 bits (143), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 28/46 (60%), Positives = 34/46 (73%), Gaps = 3/46 (6%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKF 54
Y+G IS+GTPPQ F V+FDTGSSNLW+PS +C ACS D +F
Sbjct: 74 YFGEISIGTPPQNFLVLFDTGSSNLWVPSTYCQ--TQACS-DHRRF 116
>gi|66865247|gb|AAY57527.1| pepsinogen A [Gadus morhua]
Length = 277
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 45/118 (38%), Positives = 68/118 (57%), Gaps = 2/118 (1%)
Query: 58 SHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN--CQ 115
S + ++ + G ++FGG++ + G I + P++ YWQ ++S+ I N N CQ
Sbjct: 125 SVYLSSHSQEGSVVVFGGIEASYYTGQIAWIPLTSATYWQIKMDSVTINGNPVACNGGCQ 184
Query: 116 AIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
AI DTGTSLI+GPS I+ +N +GA G A V+C N+ MP V L G +FT+
Sbjct: 185 AIIDTGTSLIVGPSSDISNMNSWVGASTDQYGDATVNCQNIGSMPEVVFTLSGHSFTV 242
>gi|453232811|ref|NP_509142.2| Protein ASP-3 [Caenorhabditis elegans]
gi|412984028|emb|CCD72415.2| Protein ASP-3 [Caenorhabditis elegans]
Length = 398
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 46/115 (40%), Positives = 70/115 (60%), Gaps = 2/115 (1%)
Query: 61 RNASD-ENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIA 118
R+A+D NGGEI D + +VG+I + P+ + YW+ + S+ I+ Y S +I
Sbjct: 219 RDANDITNGGEITLCDTDPNHYVGNIAWEPLVSEDYWRIKLASVVIDGTTYTSGPIDSIV 278
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
DTGTSL+ GP+ VI ++ IG +PL NG +V+C + +PN+ LGG+NF L
Sbjct: 279 DTGTSLLTGPTDVIKKIQHKIGGIPLFNGEYEVECSKIPSLPNITFNLGGQNFDL 333
Score = 66.2 bits (160), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 26/43 (60%), Positives = 35/43 (81%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSV 49
AQYYG +++GTPPQ F+V+FDTGSSNLW+P +C +IAC +
Sbjct: 67 AQYYGPVTIGTPPQNFQVLFDTGSSNLWVPCANCPFGDIACRM 109
>gi|395852554|ref|XP_003798803.1| PREDICTED: pepsin A-like [Otolemur garnettii]
Length = 387
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 46/125 (36%), Positives = 75/125 (60%), Gaps = 5/125 (4%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-K 107
V +D F S F +++D++G +MFGG+D + G++ + P+S +GYWQ V+SI + +
Sbjct: 207 VSQDLF---SVFLSSNDQSGSVVMFGGIDSSYYTGELNWIPLSSEGYWQITVDSITMNGE 263
Query: 108 NVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ CS CQAI DTGTSL+ GP+ IA + IGA + G + C ++ +PN+ +
Sbjct: 264 PIACSQGCQAIVDTGTSLLSGPTSPIANIQSYIGASEDSYGQMVISCSAINSLPNIVFTI 323
Query: 167 GGKNF 171
G +
Sbjct: 324 NGVQY 328
Score = 61.6 bits (148), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 29/47 (61%), Positives = 38/47 (80%), Gaps = 3/47 (6%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKF 54
+Y+GTI +GTP QEF VIFDTGSSNLW+PS +CS + ACS + ++F
Sbjct: 74 EYFGTIGIGTPAQEFTVIFDTGSSNLWVPSVYCS--SPACS-NHNRF 117
>gi|302899226|ref|XP_003048007.1| predicted protein [Nectria haematococca mpVI 77-13-4]
gi|256728939|gb|EEU42294.1| predicted protein [Nectria haematococca mpVI 77-13-4]
Length = 396
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 49/119 (41%), Positives = 67/119 (56%), Gaps = 1/119 (0%)
Query: 60 FRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVY-CSNCQAIA 118
F A E E++FGGVDK + GDI Y P+ RK YW+ +++I + V N AI
Sbjct: 225 FYLADQEGESEVVFGGVDKSHYEGDIEYIPLRRKAYWEVDLDAIALGDEVAEQENTGAIL 284
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTRTN 177
DTGTSL + PS + LNK IGA NG V+CD +P++ L G N++L T+
Sbjct: 285 DTGTSLNVLPSALAELLNKEIGAKKGYNGQYTVECDKRQTLPDITFTLAGSNYSLPATD 343
Score = 62.0 bits (149), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 26/41 (63%), Positives = 32/41 (78%), Gaps = 2/41 (4%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQY+ I++G PPQ FKV+ DTGSSNLW+PSQ C +IAC
Sbjct: 84 AQYFSEITIGNPPQSFKVVLDTGSSNLWVPSQECG--SIAC 122
>gi|21542388|sp|P55956.2|ASP3_CAEEL RecName: Full=Aspartic protease 3; Flags: Precursor
Length = 398
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 46/115 (40%), Positives = 70/115 (60%), Gaps = 2/115 (1%)
Query: 61 RNASD-ENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIA 118
R+A+D NGGEI D + +VG+I + P+ + YW+ + S+ I+ Y S +I
Sbjct: 219 RDANDITNGGEITLCETDPNHYVGNIAWEPLVSEDYWRIKLASVVIDGTTYTSGPIDSIV 278
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
DTGTSL+ GP+ VI ++ IG +PL NG +V+C + +PN+ LGG+NF L
Sbjct: 279 DTGTSLLTGPTDVIKKIQHKIGGIPLFNGEYEVECSKIPSLPNITFNLGGQNFDL 333
Score = 66.6 bits (161), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 26/43 (60%), Positives = 35/43 (81%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSV 49
AQYYG +++GTPPQ F+V+FDTGSSNLW+P +C +IAC +
Sbjct: 67 AQYYGPVTIGTPPQNFQVLFDTGSSNLWVPCANCPFGDIACRM 109
>gi|56182674|gb|AAV84086.1| aspartic proteinase 12 [Fagopyrum esculentum]
Length = 387
Score = 91.7 bits (226), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 44/93 (47%), Positives = 63/93 (67%), Gaps = 3/93 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE--KNVYCS-NCQAI 117
RNA +E GGE++FGG+D D F G TY PV++KGYWQF ++ + I+ +C+ C AI
Sbjct: 127 RNAEEEEGGELVFGGIDPDHFRGQHTYVPVTQKGYWQFDMDDVLIDGMSTGFCAGGCAAI 186
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAK 150
AD+GTSL+ GP V+A++N IGA + + K
Sbjct: 187 ADSGTSLLAGPMAVVAQINHAIGATGIVSQECK 219
Score = 35.4 bits (80), Expect = 8.7, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 22/39 (56%)
Query: 136 NKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
N+L +P G + VDC +L +P+V +GGK F L
Sbjct: 309 NQLCERLPSPMGESAVDCSSLSTLPDVSFTIGGKTFDLA 347
>gi|195470501|ref|XP_002087545.1| GE17604 [Drosophila yakuba]
gi|194173646|gb|EDW87257.1| GE17604 [Drosophila yakuba]
Length = 410
Score = 91.3 bits (225), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 46/120 (38%), Positives = 72/120 (60%), Gaps = 3/120 (2%)
Query: 61 RNASDE-NGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSNCQAIAD 119
RN +D +GGE++ GG D + G +TY PVS GYWQF + S + +C+NC+AI D
Sbjct: 241 RNGTDAIHGGELILGGTDSGLYSGCLTYVPVSVAGYWQFTMTSASVSGFQFCANCEAILD 300
Query: 120 TGTSLIIGPSKVIAELNKLIGAV-PLA-NGPAKVDCDNLDKMPNVDIILGGKNFTLTRTN 177
GTSLI+ P V+ +N+++G P A NG VDC + ++P++ + + F L ++
Sbjct: 301 VGTSLIVVPKPVLRIINQILGVRNPTASNGVFLVDCSTISELPDIVFTIARRQFPLKSSD 360
Score = 50.8 bits (120), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 19/40 (47%), Positives = 26/40 (65%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
+ + Y +YY T+ G PPQ KV+ DTGS+NLW+ S C
Sbjct: 82 LSNMYNTEYYTTLEFGNPPQAMKVLIDTGSANLWVLSSKC 121
>gi|325087547|gb|EGC40857.1| aspartic endopeptidase Pep2 [Ajellomyces capsulatus H88]
Length = 398
Score = 91.3 bits (225), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 47/110 (42%), Positives = 66/110 (60%), Gaps = 1/110 (0%)
Query: 65 DENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKN-VYCSNCQAIADTGTS 123
D++ E++FGG++KD+F G++T P+ RK YW+ ++SI K +N I DTGTS
Sbjct: 232 DDDQSEVVFGGMNKDRFTGELTKIPLRRKAYWEVDLDSITFGKQTAMMTNTGVILDTGTS 291
Query: 124 LIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
LI PS + LNK IGA NG V+C D +PN+ L G NFT+
Sbjct: 292 LIALPSTIAELLNKEIGAKKSFNGQYTVECAKRDSLPNLTFGLSGHNFTI 341
Score = 63.2 bits (152), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 30/61 (49%), Positives = 39/61 (63%), Gaps = 4/61 (6%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK--DKFVGHSHFRNAS 64
AQY+ I +GTPPQ FKV+ DTGSSNLW+PS C +IAC + D +H +N S
Sbjct: 83 AQYFSEIGIGTPPQTFKVVLDTGSSNLWVPSSECG--SIACYLHNKYDSSASSTHKKNGS 140
Query: 65 D 65
+
Sbjct: 141 E 141
>gi|291409613|ref|XP_002721073.1| PREDICTED: pepsinogen III-like [Oryctolagus cuniculus]
Length = 387
Score = 91.3 bits (225), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 45/125 (36%), Positives = 74/125 (59%), Gaps = 5/125 (4%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-K 107
V +D F S + ++ DE+G +MFGG+D + G + + PVS +GYWQ ++SI ++ +
Sbjct: 207 VSQDLF---SVYLSSDDESGSLVMFGGIDSSYYTGSLNWVPVSYEGYWQITLDSITMDGE 263
Query: 108 NVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ C++ CQAI DTGTSL+ GP+ I+ + IGA G V C ++ +PN+ +
Sbjct: 264 TIACADGCQAIVDTGTSLLAGPTSAISNIQSYIGASENYEGEMIVSCSSMYSLPNIVFTI 323
Query: 167 GGKNF 171
G +
Sbjct: 324 NGVQY 328
Score = 67.4 bits (163), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 29/42 (69%), Positives = 37/42 (88%), Gaps = 2/42 (4%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSV 49
+Y+GTIS+GTPPQ+F VIFDTGSSNLW+PS +CS + AC+V
Sbjct: 74 EYFGTISIGTPPQDFTVIFDTGSSNLWVPSTYCS--SAACTV 113
>gi|332024603|gb|EGI64801.1| Lysosomal aspartic protease [Acromyrmex echinatior]
Length = 380
Score = 91.3 bits (225), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 47/112 (41%), Positives = 66/112 (58%), Gaps = 2/112 (1%)
Query: 64 SDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIADTGT 122
+ E GE++ G +D +VG +T V+RKGYWQF + +++ N C N CQAI D+
Sbjct: 210 TSEVVGELILGDIDSSLYVGKLTNVNVTRKGYWQFNMNKVQLGNNTLCENDCQAIIDSSN 269
Query: 123 SLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
I GP IA +NK I + L N PA V C + K+P++ I+GGK F LT
Sbjct: 270 VRISGPPSAIAVINKYIRNISL-NDPAVVHCKQIYKLPDIYFIIGGKVFELT 320
Score = 53.1 bits (126), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 21/48 (43%), Positives = 34/48 (70%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACS 48
+R ++ ++G +S+G+ PQ FKV+FDT SSN WI S++C +AC+
Sbjct: 50 LRDDHENSFFGYLSIGSYPQYFKVLFDTSSSNFWILSKNCHSNTMACA 97
>gi|395534129|ref|XP_003769100.1| PREDICTED: LOW QUALITY PROTEIN: gastricsin-like [Sarcophilus
harrisii]
Length = 391
Score = 91.3 bits (225), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 43/114 (37%), Positives = 68/114 (59%), Gaps = 3/114 (2%)
Query: 64 SDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSN-CQAIADT 120
S NGGE++FGGVD + + G I ++PV+++ YWQ G++ I + +CS CQAI DT
Sbjct: 218 SSXNGGEVIFGGVDNNLYTGQIYWAPVTQELYWQIGIQEFSIGGQATGWCSQGCQAIVDT 277
Query: 121 GTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
GTSL+ P + ++ + GA G VDC+N+ +P + ++ G F L+
Sbjct: 278 GTSLLTVPQQYMSAFLQATGAQQDQYGQYVVDCNNIQSLPTISFLINGVQFPLS 331
Score = 60.1 bits (144), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 31/60 (51%), Positives = 38/60 (63%), Gaps = 9/60 (15%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHF 60
+ S A YYG IS+GTPPQ F V+FDTG SNLW+PS +C + ACS GH+ F
Sbjct: 64 LPSYLDAAYYGEISIGTPPQNFLVLFDTGFSNLWVPSIYCQ--SQACS-------GHAQF 114
>gi|1665867|emb|CAA70340.1| aspartic proteinase [Centaurea calcitrapa]
Length = 509
Score = 91.3 bits (225), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 46/101 (45%), Positives = 65/101 (64%), Gaps = 3/101 (2%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV--YCSN-CQAI 117
RNA +E GGE++FGGVD + F G TY PV++KGYWQF + + IE +C++ C AI
Sbjct: 229 RNADEEEGGELVFGGVDPNHFKGKHTYVPVTQKGYWQFNMGDVLIEDKTTGFCADGCAAI 288
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDK 158
AD+GTSL+ GP+ +I ++N IGA + + K D K
Sbjct: 289 ADSGTSLLAGPTAIITQINHAIGAKGVMSQQCKTLVDQYGK 329
Score = 61.6 bits (148), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 26/47 (55%), Positives = 35/47 (74%), Gaps = 1/47 (2%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ AQYYG I +G+P Q+F VIFDTGSSNLW+PS C ++AC
Sbjct: 77 LKNYMDAQYYGEIGIGSPAQKFTVIFDTGSSNLWVPSAKC-YFSVAC 122
Score = 38.5 bits (88), Expect = 0.99, Method: Compositional matrix adjust.
Identities = 17/43 (39%), Positives = 26/43 (60%)
Query: 131 VIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
+I +N+L +P G + VDC++L MPN+ +GGK F L
Sbjct: 406 IINYVNELCDRLPSPMGESAVDCNDLSSMPNIAFTIGGKVFEL 448
>gi|444316168|ref|XP_004178741.1| hypothetical protein TBLA_0B03830 [Tetrapisispora blattae CBS 6284]
gi|387511781|emb|CCH59222.1| hypothetical protein TBLA_0B03830 [Tetrapisispora blattae CBS 6284]
Length = 413
Score = 91.3 bits (225), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 54/152 (35%), Positives = 83/152 (54%), Gaps = 3/152 (1%)
Query: 24 VIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVG 83
+ +DT + N +P + +I +D+ KF + + S +NGGE +FGG+D+ KF G
Sbjct: 208 LAYDTIAVNRVVPPVYNAINQ--GLLDEPKFAFYLGDASKSKDNGGEAVFGGIDETKFEG 265
Query: 84 DITYSPVSRKGYWQFGVESIKI-EKNVYCSNCQAIADTGTSLIIGPSKVIAELNKLIGAV 142
DIT+ PV RK YW+ +E + + E+ N A DTGTSLI PS + +N IGA
Sbjct: 266 DITWLPVRRKAYWEVKLEGLGLGEEYTELENHGAAIDTGTSLITLPSGLAEIINSEIGAK 325
Query: 143 PLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
G ++CD +P++ G NFT++
Sbjct: 326 KGWTGQYTIECDKRASLPDMTFTFDGYNFTIS 357
Score = 60.8 bits (146), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 25/41 (60%), Positives = 32/41 (78%), Gaps = 2/41 (4%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQY+ I +GTPPQ FKV+ DTGSSNLW+PS+ C ++AC
Sbjct: 97 AQYFADIKIGTPPQSFKVVLDTGSSNLWVPSKECG--SLAC 135
>gi|170091822|ref|XP_001877133.1| aspartic peptidase A1 [Laccaria bicolor S238N-H82]
gi|164648626|gb|EDR12869.1| aspartic peptidase A1 [Laccaria bicolor S238N-H82]
Length = 408
Score = 91.3 bits (225), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 57/154 (37%), Positives = 84/154 (54%), Gaps = 8/154 (5%)
Query: 26 FDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFR-NASDENGGEIMFGGVDKDKFVGD 84
+DT S N IP + S++N +D F FR +S+E+GGE +FGG+D+ + G
Sbjct: 208 YDTISVNHIIPPFY-SMINQGL-IDSPVF----SFRLGSSEEDGGEAVFGGIDESAYKGK 261
Query: 85 ITYSPVSRKGYWQFGVESIKI-EKNVYCSNCQAIADTGTSLIIGPSKVIAELNKLIGAVP 143
ITY PV RK YW+ +E + ++ + A DTGTSLI+ P+ + LN IGA
Sbjct: 262 ITYVPVRRKAYWEVELEKVSFGNDDLELESTGAAIDTGTSLIVLPTDIAEMLNTQIGAKK 321
Query: 144 LANGPAKVDCDNLDKMPNVDIILGGKNFTLTRTN 177
NG +VDC + +P + GGK + L T+
Sbjct: 322 SWNGQYQVDCAKVPSLPELSFYFGGKPYPLKGTD 355
Score = 62.4 bits (150), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 41/93 (44%), Positives = 52/93 (55%), Gaps = 22/93 (23%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASDE 66
AQY+ IS+G PPQ FKVI DTGSSNLW+PS C+ +IAC + H+ + +AS
Sbjct: 95 AQYFTEISIGNPPQSFKVILDTGSSNLWVPSVKCT--SIACFL-------HTKYDSASSS 145
Query: 67 ----NGGE--IMFGGVDKDKFV-------GDIT 86
NG E I +G + FV GDIT
Sbjct: 146 TFKANGSEFSIHYGSGSMEGFVSNDLLSIGDIT 178
>gi|407726059|dbj|BAM46127.1| pepsinogen C [Cynops pyrrhogaster]
Length = 385
Score = 91.3 bits (225), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 41/118 (34%), Positives = 71/118 (60%), Gaps = 3/118 (2%)
Query: 59 HFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSN-CQ 115
+ + GGE++FGGVD + + G+IT++PV+++ YWQ G++ + ++ +CS CQ
Sbjct: 208 YMSEEGTQPGGELIFGGVDSNYYTGEITWTPVTQQMYWQIGIQGFAVNGQETGWCSQGCQ 267
Query: 116 AIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
I DTGTSL+ P + +A L + IGA ++G V C ++ +P + +GG + L
Sbjct: 268 GIVDTGTSLLTAPGQYMAALMQDIGATLDSSGQYVVTCSSVTSLPTLSFTIGGTSLPL 325
Score = 58.2 bits (139), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 24/33 (72%), Positives = 28/33 (84%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCS 41
YYG IS+GTPPQ F V+FDTGSSNLW+ S +CS
Sbjct: 68 YYGEISIGTPPQNFLVLFDTGSSNLWVASTYCS 100
>gi|401623301|gb|EJS41405.1| pep4p [Saccharomyces arboricola H-6]
Length = 405
Score = 90.9 bits (224), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 51/126 (40%), Positives = 70/126 (55%), Gaps = 1/126 (0%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI-EK 107
+D+ KF + + ENGGE FGG+D+ KF GDIT+ PV RK YW+ E I + ++
Sbjct: 223 LDEKKFAFYLGDTSKDSENGGEATFGGIDESKFKGDITWLPVRRKAYWEVKFEGIGLGDE 282
Query: 108 NVYCSNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILG 167
N A DTGTSLI PS + +N IGA G +DC+ D +P++ L
Sbjct: 283 FAELENHGAAIDTGTSLITLPSGLAEMINAEIGAKKGWTGQYTLDCNTRDGLPDLTFNLN 342
Query: 168 GKNFTL 173
G NFT+
Sbjct: 343 GYNFTI 348
Score = 63.5 bits (153), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 36/84 (42%), Positives = 47/84 (55%), Gaps = 4/84 (4%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASD- 65
AQYY I+LGTPPQ FKVI DTGSSNLW+PS C L D S+ N ++
Sbjct: 89 AQYYTDITLGTPPQNFKVILDTGSSNLWVPSNECGSLACFLHSKYDHEASSSYKANGTEF 148
Query: 66 --ENGGEIMFGGVDKDKF-VGDIT 86
+ G + G + +D +GD+T
Sbjct: 149 AIQYGTGSLEGYISQDTLSIGDLT 172
>gi|117662285|gb|ABK55693.1| aspartic proteinase [Cucumis sativus]
Length = 196
Score = 90.9 bits (224), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 44/84 (52%), Positives = 60/84 (71%), Gaps = 3/84 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV--YCS-NCQAI 117
RNA +E GGEI+FGGVD D + G+ TY PV++KGYWQF + + I + +CS C AI
Sbjct: 109 RNADEEQGGEIVFGGVDPDHYKGEHTYVPVTKKGYWQFDMGDVLINGSTTGFCSGGCSAI 168
Query: 118 ADTGTSLIIGPSKVIAELNKLIGA 141
AD+GTSL+ GP+ +I ++N IGA
Sbjct: 169 ADSGTSLLAGPTTIITQVNHAIGA 192
>gi|125986537|ref|XP_001357032.1| GA16570 [Drosophila pseudoobscura pseudoobscura]
gi|54645358|gb|EAL34098.1| GA16570 [Drosophila pseudoobscura pseudoobscura]
Length = 408
Score = 90.9 bits (224), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 44/114 (38%), Positives = 71/114 (62%), Gaps = 4/114 (3%)
Query: 62 NASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSNCQAIADTG 121
NA+D GGE++ GG+D F G +TY PVS++GYWQF + S + +C++CQAI D G
Sbjct: 243 NATD--GGELVLGGIDATLFSGCLTYVPVSQQGYWQFVMTSAVLGGKTFCTHCQAILDVG 300
Query: 122 TSLIIGPSKVIAELNKLIGAV-PL-ANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
TSL++ P+ I ++N+L+ + P ++G V+C + +P + + K F L
Sbjct: 301 TSLLVAPTAAIKKINQLLAVLNPQDSSGVFLVNCSTIASLPTMVFTIARKEFPL 354
Score = 55.1 bits (131), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 24/64 (37%), Positives = 38/64 (59%), Gaps = 7/64 (10%)
Query: 4 SYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNA 63
++ +YY +++GTPPQEF ++ DTGSSNLW+PS C +C V H+ + +
Sbjct: 84 AFNTEYYLPVTIGTPPQEFILLIDTGSSNLWVPSSKCPATVKSC-------VSHNQYDSK 136
Query: 64 SDEN 67
S +
Sbjct: 137 SSSS 140
>gi|281207795|gb|EFA81975.1| cathepsin D [Polysphondylium pallidum PN500]
Length = 390
Score = 90.9 bits (224), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 45/115 (39%), Positives = 66/115 (57%), Gaps = 4/115 (3%)
Query: 67 NGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV--YCS--NCQAIADTGT 122
NGGE+ FG +D ++ G ITY P++ + YW+F ++ + N YC C AI D+GT
Sbjct: 217 NGGELTFGSIDTTRYTGPITYVPLTNETYWEFKMDDFALNGNSLGYCGADGCHAICDSGT 276
Query: 123 SLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTRTN 177
SLI GPS I LN +GAV + C + +PN++I + G+ F LT T+
Sbjct: 277 SLIAGPSAQINALNTKLGAVVMNGEGIFTSCSVISTLPNIEITVAGRQFLLTPTD 331
Score = 68.9 bits (167), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 28/43 (65%), Positives = 35/43 (81%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSV 49
AQYYG I++GTP Q FKV+FDTGSSNLWIPS+ C + +AC +
Sbjct: 65 AQYYGAITIGTPAQSFKVVFDTGSSNLWIPSKKCPVTVVACDL 107
>gi|346322842|gb|EGX92440.1| vacuolar protease A precursor [Cordyceps militaris CM01]
Length = 395
Score = 90.9 bits (224), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 50/119 (42%), Positives = 65/119 (54%), Gaps = 1/119 (0%)
Query: 60 FRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVY-CSNCQAIA 118
F S+E G E +FGGVDK+ + G I Y P+ RK YW+ ++I K V N I
Sbjct: 224 FYLGSEEEGSEAVFGGVDKNHYEGKIEYLPLRRKAYWEVDFDAIAFGKEVAELENTGVIL 283
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTRTN 177
DTGTSL PS + LNK IGA G +DC DK+P++ L G N+TL T+
Sbjct: 284 DTGTSLNTLPSDLAELLNKEIGAKKGFGGQYTIDCAARDKLPDITFTLAGSNYTLPATD 342
Score = 65.5 bits (158), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 28/41 (68%), Positives = 34/41 (82%), Gaps = 2/41 (4%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQY+ I++GTPPQ FKV+ DTGSSNLW+PSQ CS +IAC
Sbjct: 83 AQYFSEITIGTPPQTFKVVLDTGSSNLWVPSQSCS--SIAC 121
>gi|332241362|ref|XP_003269849.1| PREDICTED: napsin-A-like [Nomascus leucogenys]
Length = 421
Score = 90.9 bits (224), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 45/116 (38%), Positives = 67/116 (57%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIA 118
R+ + +GGE++ GG D ++ +T+ PV+ YWQ +E +K+ + C+ C AI
Sbjct: 224 RDPEEPDGGELVLGGSDPAHYIPPLTFVPVTVPAYWQIHMERVKVGPGLTLCARGCAAIL 283
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSLI GP++ I L+ IG PL G + C + K+P V +LGG F LT
Sbjct: 284 DTGTSLITGPTEEIRALHAAIGGYPLLAGEYIILCSEIPKLPAVSFLLGGVWFNLT 339
Score = 65.9 bits (159), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 27/46 (58%), Positives = 35/46 (76%), Gaps = 1/46 (2%)
Query: 3 SSYQ-AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
S+Y+ QY+G I LGTPPQ F V+FDTGSSNLW+PS+ C ++ C
Sbjct: 72 SNYRDVQYFGEIGLGTPPQNFTVVFDTGSSNLWVPSRRCHFFSVPC 117
>gi|166796432|gb|AAI59305.1| LOC496913 protein [Xenopus (Silurana) tropicalis]
Length = 381
Score = 90.9 bits (224), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 42/118 (35%), Positives = 68/118 (57%), Gaps = 3/118 (2%)
Query: 59 HFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSN-CQ 115
+ + ++GGE+ FGGVD++ + G I ++PV+ + YWQ G++ I + + +CS CQ
Sbjct: 205 YLSGENTQSGGEVAFGGVDQNYYTGQIYWTPVTSETYWQIGIQGFSINGQASGWCSQGCQ 264
Query: 116 AIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
I DTGTSL+ P + A L + IGA NG V C ++ +P + + G +F L
Sbjct: 265 GIVDTGTSLLTAPQSIFASLMQDIGAQQDQNGEYVVSCSSIQNLPTISFTISGVSFPL 322
Score = 52.8 bits (125), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 22/32 (68%), Positives = 26/32 (81%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
YY IS+GTPPQ F V+FDTGSSNLW+ S +C
Sbjct: 65 YYEEISIGTPPQNFLVLFDTGSSNLWVASTNC 96
>gi|301606846|ref|XP_002933025.1| PREDICTED: gastricsin isoform 1 [Xenopus (Silurana) tropicalis]
Length = 383
Score = 90.9 bits (224), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 42/118 (35%), Positives = 68/118 (57%), Gaps = 3/118 (2%)
Query: 59 HFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSN-CQ 115
+ + ++GGE+ FGGVD++ + G I ++PV+ + YWQ G++ I + + +CS CQ
Sbjct: 207 YLSGENTQSGGEVAFGGVDQNYYTGQIYWTPVTSETYWQIGIQGFSINGQASGWCSQGCQ 266
Query: 116 AIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
I DTGTSL+ P + A L + IGA NG V C ++ +P + + G +F L
Sbjct: 267 GIVDTGTSLLTAPQSIFASLMQDIGAQQDQNGEYVVSCSSIQNLPTISFTISGVSFPL 324
Score = 55.8 bits (133), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 23/32 (71%), Positives = 27/32 (84%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
YYG IS+GTPPQ F V+FDTGSSNLW+ S +C
Sbjct: 67 YYGEISIGTPPQNFLVLFDTGSSNLWVASTNC 98
>gi|183180254|gb|ACC44394.1| ASP-3 [Caenorhabditis remanei]
gi|183180258|gb|ACC44396.1| ASP-3 [Caenorhabditis remanei]
gi|183180266|gb|ACC44400.1| ASP-3 [Caenorhabditis remanei]
gi|183180268|gb|ACC44401.1| ASP-3 [Caenorhabditis remanei]
Length = 225
Score = 90.9 bits (224), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 45/115 (39%), Positives = 69/115 (60%), Gaps = 2/115 (1%)
Query: 61 RNASD-ENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIA 118
R+A+D NGGEI D + + G+I + P+ YW+ + ++ I+ Y + +I
Sbjct: 58 RDANDIANGGEITLCDTDSNHYTGNIAWEPLVSMDYWRIKLGAVSIDGTTYTNGPIDSIV 117
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
DTGTSL+ GPS VI ++ IG +PL NG +V+C + K+PN+ LGG+NF L
Sbjct: 118 DTGTSLLTGPSDVIKKIQHKIGGIPLFNGEYEVECSKIPKLPNITFTLGGQNFDL 172
>gi|449503193|ref|XP_004161880.1| PREDICTED: aspartic proteinase-like [Cucumis sativus]
Length = 516
Score = 90.9 bits (224), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 44/84 (52%), Positives = 60/84 (71%), Gaps = 3/84 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV--YCS-NCQAI 117
RNA +E GGEI+FGGVD D + G+ TY PV++KGYWQF + + I + +CS C AI
Sbjct: 236 RNADEEQGGEIVFGGVDPDHYKGEHTYVPVTKKGYWQFDMGDVLINGSTTGFCSGGCSAI 295
Query: 118 ADTGTSLIIGPSKVIAELNKLIGA 141
AD+GTSL+ GP+ +I ++N IGA
Sbjct: 296 ADSGTSLLAGPTTIITQVNHAIGA 319
Score = 65.1 bits (157), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 27/47 (57%), Positives = 36/47 (76%), Gaps = 2/47 (4%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ AQY+G I +GTPPQ+F VIFDTGSSNLW+PS C ++AC
Sbjct: 82 LKNYLNAQYFGEIGIGTPPQKFAVIFDTGSSNLWVPSSKC--FSVAC 126
>gi|444706401|gb|ELW47743.1| Cathepsin E [Tupaia chinensis]
Length = 396
Score = 90.9 bits (224), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 46/104 (44%), Positives = 66/104 (63%), Gaps = 3/104 (2%)
Query: 67 NGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI-EKNVYCSN-CQAIADTGTSL 124
G E++FGG D F G + + PV+++G+WQ ++ +++ + ++CS CQAI DTGTS
Sbjct: 225 TGSELIFGGYDCSHFSGSLNWIPVTKQGFWQIALDGVQVGDTMMFCSKGCQAIVDTGTSR 284
Query: 125 IIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGG 168
IIGP I L++ IGA L NG V+C NL MPNV I+ G
Sbjct: 285 IIGPLNKIERLHRAIGAT-LVNGIYFVECVNLTVMPNVTFIISG 327
Score = 54.7 bits (130), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 25/59 (42%), Positives = 38/59 (64%), Gaps = 2/59 (3%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSH 59
+ +S+ QYYGT+S+G+P Q F V+FDTGSS+ W+ S +C ++ AC F S+
Sbjct: 68 LTNSFNMQYYGTVSIGSPLQNFSVLFDTGSSDFWVTSVYC--ISPACEKHTKFFSSRSN 124
>gi|194858591|ref|XP_001969211.1| GG24067 [Drosophila erecta]
gi|190661078|gb|EDV58270.1| GG24067 [Drosophila erecta]
Length = 372
Score = 90.9 bits (224), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 41/114 (35%), Positives = 65/114 (57%), Gaps = 4/114 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSNCQAIADT 120
R+ + GGE++FGG D + G +TY P+S +GYWQF + S I+ C +CQAIADT
Sbjct: 212 RDGTSTKGGELIFGGSDSSLYSGSLTYVPISEQGYWQFNMASSSIDGFSLCDDCQAIADT 271
Query: 121 GTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
GTSLI+ P + +++ +DC + +P++ +GG +F L+
Sbjct: 272 GTSLIVAPYNAYITIFEILD----VGEDGYLDCSTVSSLPDITFNIGGTDFVLS 321
Score = 58.5 bits (140), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 24/40 (60%), Positives = 30/40 (75%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
+ +S YYG IS+GTP Q FKV+FD+GSSNLW+PS C
Sbjct: 61 LSNSMNMAYYGAISIGTPAQSFKVLFDSGSSNLWVPSNTC 100
>gi|110162110|emb|CAL07969.1| aspartic proteinase [Cynara cardunculus]
Length = 506
Score = 90.9 bits (224), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 45/93 (48%), Positives = 64/93 (68%), Gaps = 3/93 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCS-NCQAI 117
RN +E GGE++FGGVD + F G+ TY PV+RKGYWQF + + I + + +C+ C AI
Sbjct: 229 RNVDEEEGGELVFGGVDPNHFRGNHTYVPVTRKGYWQFEMGDVLIGDKSSGFCAGGCAAI 288
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAK 150
AD+GTSLI GP+ +I ++N+ IGA + N K
Sbjct: 289 ADSGTSLIAGPTAIITQINQAIGAKGVLNQQCK 321
Score = 58.9 bits (141), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 25/41 (60%), Positives = 31/41 (75%), Gaps = 1/41 (2%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSV 49
YYG I +GTPPQ F VIFDTGSS+LW+PS C ++AC +
Sbjct: 85 YYGEIGIGTPPQNFAVIFDTGSSDLWVPSSKC-YTSLACVI 124
>gi|449466825|ref|XP_004151126.1| PREDICTED: aspartic proteinase-like [Cucumis sativus]
Length = 513
Score = 90.9 bits (224), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 44/84 (52%), Positives = 60/84 (71%), Gaps = 3/84 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV--YCS-NCQAI 117
RNA +E GGEI+FGGVD D + G+ TY PV++KGYWQF + + I + +CS C AI
Sbjct: 233 RNADEEQGGEIVFGGVDPDHYKGEHTYVPVTKKGYWQFDMGDVLINGSTTGFCSGGCSAI 292
Query: 118 ADTGTSLIIGPSKVIAELNKLIGA 141
AD+GTSL+ GP+ +I ++N IGA
Sbjct: 293 ADSGTSLLAGPTTIITQVNHAIGA 316
Score = 65.1 bits (157), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 27/47 (57%), Positives = 36/47 (76%), Gaps = 2/47 (4%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ AQY+G I +GTPPQ+F VIFDTGSSNLW+PS C ++AC
Sbjct: 82 LKNYLNAQYFGEIGIGTPPQKFAVIFDTGSSNLWVPSSKC--FSVAC 126
>gi|56971213|gb|AAH88063.1| LOC496913 protein, partial [Xenopus (Silurana) tropicalis]
Length = 380
Score = 90.9 bits (224), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 42/118 (35%), Positives = 68/118 (57%), Gaps = 3/118 (2%)
Query: 59 HFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSN-CQ 115
+ + ++GGE+ FGGVD++ + G I ++PV+ + YWQ G++ I + + +CS CQ
Sbjct: 204 YLSGENTQSGGEVAFGGVDQNYYTGQIYWTPVTSETYWQIGIQGFSINGQASGWCSQGCQ 263
Query: 116 AIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
I DTGTSL+ P + A L + IGA NG V C ++ +P + + G +F L
Sbjct: 264 GIVDTGTSLLTAPQSIFASLMQDIGAQQDQNGEYVVSCSSIQNLPTISFTISGVSFPL 321
Score = 55.8 bits (133), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 23/32 (71%), Positives = 27/32 (84%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
YYG IS+GTPPQ F V+FDTGSSNLW+ S +C
Sbjct: 64 YYGEISIGTPPQNFLVLFDTGSSNLWVASTNC 95
>gi|301606850|ref|XP_002933027.1| PREDICTED: gastricsin isoform 3 [Xenopus (Silurana) tropicalis]
Length = 380
Score = 90.5 bits (223), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 43/117 (36%), Positives = 68/117 (58%), Gaps = 3/117 (2%)
Query: 60 FRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSN-CQA 116
F + ++GGE+ FGGVD++ + G I ++PV+ + YWQ G++ I + + +CS CQ
Sbjct: 206 FYLKNTQSGGEVAFGGVDQNYYTGQIYWTPVTSETYWQIGIQGFSINGQASGWCSQGCQG 265
Query: 117 IADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
I DTGTSL+ P + A L + IGA NG V C ++ +P + + G +F L
Sbjct: 266 IVDTGTSLLTAPQSIFASLMQDIGAQQDQNGEYVVSCSSIQNLPTISFTISGVSFPL 322
Score = 55.8 bits (133), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 23/32 (71%), Positives = 27/32 (84%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
YYG IS+GTPPQ F V+FDTGSSNLW+ S +C
Sbjct: 67 YYGEISIGTPPQNFLVLFDTGSSNLWVASTNC 98
>gi|301606848|ref|XP_002933026.1| PREDICTED: gastricsin isoform 2 [Xenopus (Silurana) tropicalis]
Length = 382
Score = 90.5 bits (223), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 42/118 (35%), Positives = 68/118 (57%), Gaps = 3/118 (2%)
Query: 59 HFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSN-CQ 115
+ + ++GGE+ FGGVD++ + G I ++PV+ + YWQ G++ I + + +CS CQ
Sbjct: 207 YLSGENTQSGGEVAFGGVDQNYYTGQIYWTPVTSETYWQIGIQGFSINGQASGWCSQGCQ 266
Query: 116 AIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
I DTGTSL+ P + A L + IGA NG V C ++ +P + + G +F L
Sbjct: 267 GIVDTGTSLLTAPQSIFASLMQDIGAQQDQNGEYVVSCSSIQNLPTISFTISGVSFPL 324
Score = 55.8 bits (133), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 23/32 (71%), Positives = 27/32 (84%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
YYG IS+GTPPQ F V+FDTGSSNLW+ S +C
Sbjct: 67 YYGEISIGTPPQNFLVLFDTGSSNLWVASTNC 98
>gi|366991455|ref|XP_003675493.1| hypothetical protein NCAS_0C01360 [Naumovozyma castellii CBS 4309]
gi|342301358|emb|CCC69126.1| hypothetical protein NCAS_0C01360 [Naumovozyma castellii CBS 4309]
Length = 406
Score = 90.5 bits (223), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 50/127 (39%), Positives = 70/127 (55%), Gaps = 1/127 (0%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI-EK 107
+D+ KF + ++NGGEI GG+D+ KF GDI + PV RK YW+ E I + ++
Sbjct: 224 LDEKKFAFYLGDTKKDEKNGGEITIGGIDESKFKGDIEWLPVRRKAYWEVKFEGIALGDQ 283
Query: 108 NVYCSNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILG 167
N A DTGTSLI PS + +N IGA G +DCD D +P++
Sbjct: 284 YAALENHGAAIDTGTSLITLPSGLAEIINTEIGAKKGWTGQYTLDCDTRDGLPDLTFNFN 343
Query: 168 GKNFTLT 174
GKNFT++
Sbjct: 344 GKNFTIS 350
Score = 62.4 bits (150), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 27/41 (65%), Positives = 33/41 (80%), Gaps = 2/41 (4%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQY+ IS+GTPPQ FKVI DTGSSNLW+PS C+ ++AC
Sbjct: 90 AQYFADISVGTPPQNFKVILDTGSSNLWVPSSECN--SLAC 128
>gi|308512535|ref|XP_003118450.1| CRE-ASP-3 protein [Caenorhabditis remanei]
gi|308239096|gb|EFO83048.1| CRE-ASP-3 protein [Caenorhabditis remanei]
Length = 397
Score = 90.5 bits (223), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 44/115 (38%), Positives = 69/115 (60%), Gaps = 2/115 (1%)
Query: 61 RNASD-ENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIA 118
R+A+D NGGEI D + + G+I + P+ YW+ + ++ I+ Y + +I
Sbjct: 219 RDANDIANGGEITLCDTDSNHYTGNIAWEPLVSMDYWRIKLGAVSIDGTTYTNGPIDSIV 278
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
DTGTSL+ GP+ VI ++ IG +PL NG +V+C + K+PN+ LGG+NF L
Sbjct: 279 DTGTSLLTGPTDVIKKIQHKIGGIPLFNGEYEVECSKIPKLPNITFTLGGQNFDL 333
Score = 67.0 bits (162), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 27/43 (62%), Positives = 35/43 (81%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSV 49
AQYYG I++GTPPQ F+V+FDTGSSNLW+P +C +IAC +
Sbjct: 67 AQYYGPITIGTPPQNFQVLFDTGSSNLWVPCANCPFGDIACRM 109
>gi|224050910|ref|XP_002199093.1| PREDICTED: cathepsin D [Taeniopygia guttata]
Length = 396
Score = 90.5 bits (223), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 50/134 (37%), Positives = 76/134 (56%), Gaps = 7/134 (5%)
Query: 48 SVDKDKFVGHSHF-----RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVES 102
+V K K V + F R+ S GGE++ GG D + G+ ++ V+RK YWQ ++S
Sbjct: 203 NVMKQKLVEKNMFSFYLNRDPSGVPGGEMVLGGTDPKYYKGEFSWFNVTRKAYWQIHMDS 262
Query: 103 IKIEK--NVYCSNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMP 160
+ + V C+AI DTGTSLI GP+K + ++ + IGA PL G + C+ + +P
Sbjct: 263 VDVGNGPTVCEGGCEAIVDTGTSLITGPTKEVKKIQEAIGAKPLIKGEYMIPCEKVPTLP 322
Query: 161 NVDIILGGKNFTLT 174
V + +GGK F LT
Sbjct: 323 VVSMNIGGKTFGLT 336
Score = 77.4 bits (189), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 32/49 (65%), Positives = 40/49 (81%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSV 49
+++ AQY+G I +GTPPQ F VIFDTGSSNLW+PS HCS+L+IAC V
Sbjct: 68 LKNYMDAQYFGVIGIGTPPQNFTVIFDTGSSNLWVPSVHCSLLDIACMV 116
>gi|224056377|ref|XP_002298827.1| predicted protein [Populus trichocarpa]
gi|222846085|gb|EEE83632.1| predicted protein [Populus trichocarpa]
Length = 494
Score = 90.5 bits (223), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 45/93 (48%), Positives = 64/93 (68%), Gaps = 3/93 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYC-SNCQAI 117
RNA ++ GGEI+FGGVD D + G+ TY PV++KGYWQF + + I + + +C S C AI
Sbjct: 214 RNADEKEGGEIVFGGVDPDHYKGEHTYVPVTQKGYWQFDMGDVLIGGQTSGFCASGCAAI 273
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAK 150
AD+GTSL+ GP+ +I E+N IGA + + K
Sbjct: 274 ADSGTSLLAGPTTIITEVNHAIGATGVVSQECK 306
Score = 65.9 bits (159), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 35/76 (46%), Positives = 45/76 (59%), Gaps = 6/76 (7%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHF 60
+++ AQY+G I +GTPPQ+F VIFDTGSSNLW+PS C ++AC HS
Sbjct: 62 LKNYMDAQYFGEIGIGTPPQKFTVIFDTGSSNLWVPSSKC-YFSVACYFHSKYKSSHSR- 119
Query: 61 RNASDENG--GEIMFG 74
ENG EI +G
Sbjct: 120 --TYKENGKSAEIHYG 133
Score = 40.8 bits (94), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 17/45 (37%), Positives = 27/45 (60%)
Query: 130 KVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
+++ +N+L +P G + VDCD L MPNV +GG+ F L+
Sbjct: 390 RILDYVNELCERLPSPMGESAVDCDGLSSMPNVSFTIGGRVFELS 434
>gi|149725197|ref|XP_001502028.1| PREDICTED: pepsin A-like [Equus caballus]
Length = 387
Score = 90.5 bits (223), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 48/127 (37%), Positives = 73/127 (57%), Gaps = 5/127 (3%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-K 107
V +D F S + ++ DE+G +MFGG+D + G + + PV+ +GYWQ V+SI I +
Sbjct: 207 VSQDLF---SVYLSSDDESGSVVMFGGIDSSYYTGSLHWVPVTTEGYWQIAVDSITINGE 263
Query: 108 NVYCS-NCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
++ CS CQAI DTGTSL+ GP+ I + IGA G + C +D +P++ +
Sbjct: 264 SIACSGGCQAIVDTGTSLLAGPTSGIDNIQSYIGARKDLLGEGVISCSAIDSLPDIVFTM 323
Query: 167 GGKNFTL 173
G F L
Sbjct: 324 NGVEFPL 330
Score = 68.2 bits (165), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 31/47 (65%), Positives = 39/47 (82%), Gaps = 3/47 (6%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKF 54
+Y+GTIS+GTPPQEF VIFDTGSSNLW+PS +CS ++AC D +F
Sbjct: 74 EYFGTISIGTPPQEFTVIFDTGSSNLWVPSTYCS--SLAC-YDHKRF 117
>gi|119592251|gb|EAW71845.1| hCG1733572, isoform CRA_a [Homo sapiens]
Length = 449
Score = 90.5 bits (223), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 46/126 (36%), Positives = 73/126 (57%), Gaps = 3/126 (2%)
Query: 52 DKFVGHSHF-RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI-EKNV 109
DK V +F R+ +GGE++ GG D ++ +T+ PV+ YWQ +E +K+ +
Sbjct: 213 DKPVFSFYFNRDPEVADGGELVLGGSDPAHYIPPLTFVPVTVPAYWQIHMERVKVGSRLT 272
Query: 110 YCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGG 168
C+ C AI DTGT +I+GP++ I L+ IG +PL G + C + K+P V +++GG
Sbjct: 273 LCAQGCAAILDTGTPVIVGPTEEIRALHAAIGGIPLLAGEYIIRCSEIPKLPAVSLLIGG 332
Query: 169 KNFTLT 174
F LT
Sbjct: 333 VWFNLT 338
Score = 65.9 bits (159), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 26/41 (63%), Positives = 31/41 (75%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQY+G I LGTPPQ F V FDTGSSNLW+PS+ C ++ C
Sbjct: 76 AQYFGEIGLGTPPQNFTVAFDTGSSNLWVPSRRCHFFSVPC 116
>gi|297809619|ref|XP_002872693.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297318530|gb|EFH48952.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 507
Score = 90.5 bits (223), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 48/101 (47%), Positives = 62/101 (61%), Gaps = 3/101 (2%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSN-CQAI 117
RN D+ GGEI+FGGVD F G+ TY PV+ KGYWQF + ++I + YC+ C AI
Sbjct: 230 RNPQDQEGGEIVFGGVDPKHFKGEHTYVPVTHKGYWQFDMGDLQIAGKPTGYCAKGCSAI 289
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDK 158
AD+GTSL+ GPS VI +N IGA + + K D K
Sbjct: 290 ADSGTSLLTGPSTVITMINHAIGAQGIVSRECKAVVDQYGK 330
Score = 67.8 bits (164), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 30/47 (63%), Positives = 38/47 (80%), Gaps = 1/47 (2%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ AQYYG I++GTPPQ+F VIFDTGSSNLWIPS C L++AC
Sbjct: 78 LKNYLDAQYYGDITIGTPPQKFTVIFDTGSSNLWIPSTKC-YLSVAC 123
>gi|291409616|ref|XP_002721074.1| PREDICTED: pepsin II-4-like [Oryctolagus cuniculus]
Length = 387
Score = 90.5 bits (223), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 47/127 (37%), Positives = 78/127 (61%), Gaps = 5/127 (3%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-K 107
V +D F S + ++ D+ G +MFGG+D+ + G + + PVS +GYWQF ++S+ I K
Sbjct: 207 VSEDLF---SVYLSSDDKKGSLVMFGGIDESYYKGSLHWVPVSYEGYWQFTMDSVTINGK 263
Query: 108 NVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ C++ CQAI DTGTSL+ GP+ I+++ + I A + G A V C ++ +P+V +
Sbjct: 264 TIACADSCQAIIDTGTSLLAGPTNAISKIQRHIRAYDNSEGEAIVKCSDVKSLPDVVFTI 323
Query: 167 GGKNFTL 173
G + L
Sbjct: 324 HGVKYPL 330
Score = 70.1 bits (170), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 31/51 (60%), Positives = 41/51 (80%), Gaps = 2/51 (3%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK 51
M + A+Y+GTIS+GTPPQ+F VIFDTGSSNLW+PS +CS ++AC+ K
Sbjct: 67 MENYMDAEYFGTISIGTPPQDFTVIFDTGSSNLWVPSIYCS--SLACAFHK 115
>gi|109125662|ref|XP_001116026.1| PREDICTED: napsin-A-like [Macaca mulatta]
Length = 421
Score = 90.5 bits (223), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 44/116 (37%), Positives = 65/116 (56%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVY--CSNCQAIA 118
R+ + +GGE++ GG D ++ +T+ PV+ YWQ +E +K+ + C AI
Sbjct: 224 RDPEEPDGGELVLGGSDPAHYIPPLTFVPVTVPAYWQIHMERVKVGPGLTLCVRGCAAIL 283
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSLI GP++ I L+ IG PL G + C + K+P V +LGG F LT
Sbjct: 284 DTGTSLITGPTEEIRALHAAIGGYPLLAGEYIILCSEIPKLPAVSFLLGGVWFNLT 339
Score = 65.9 bits (159), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 27/46 (58%), Positives = 35/46 (76%), Gaps = 1/46 (2%)
Query: 3 SSYQ-AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
S+Y+ QY+G I LGTPPQ F V+FDTGSSNLW+PS+ C ++ C
Sbjct: 72 SNYRDVQYFGKIGLGTPPQNFTVVFDTGSSNLWVPSRRCHFFSVPC 117
>gi|190014570|dbj|BAG48263.1| pepsinogen 1 [Thunnus orientalis]
Length = 378
Score = 90.5 bits (223), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 50/127 (39%), Positives = 76/127 (59%), Gaps = 6/127 (4%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKN 108
V++D F S + ++ G + FGG D + + G IT+ P+S + YWQ V+S+ +
Sbjct: 203 VNQDIF---SVYLSSHSAQGSVVTFGGTDPNHYTGPITWIPLSNELYWQITVDSVTVNGQ 259
Query: 109 VY-CSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
V CS CQAI DTGTSLI+GP I+ +N +GA NG V+C+N+ +MP+V +
Sbjct: 260 VVACSGGCQAIVDTGTSLIVGPQSSISNINSFVGASS-QNGDYIVNCNNIGQMPHVVFHI 318
Query: 167 GGKNFTL 173
G+ FT+
Sbjct: 319 QGQEFTI 325
Score = 64.7 bits (156), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 30/46 (65%), Positives = 35/46 (76%), Gaps = 3/46 (6%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKF 54
YYG IS+GTPPQ FKVIFD+GSSNLW+PS +C N A + DKF
Sbjct: 72 YYGIISIGTPPQSFKVIFDSGSSNLWVPSVYC---NSAACNNHDKF 114
>gi|6561816|gb|AAF17080.1| aspartyl protease 3 [Homo sapiens]
Length = 450
Score = 90.5 bits (223), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 46/126 (36%), Positives = 73/126 (57%), Gaps = 3/126 (2%)
Query: 52 DKFVGHSHF-RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI-EKNV 109
DK V +F R+ +GGE++ GG D ++ +T+ PV+ YWQ +E +K+ +
Sbjct: 213 DKPVFSFYFNRDPEVADGGELVLGGSDPAHYIPPLTFVPVTVPAYWQIHMERVKVGSRLT 272
Query: 110 YCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGG 168
C+ C AI DTGT +I+GP++ I L+ IG +PL G + C + K+P V +++GG
Sbjct: 273 LCAQGCAAILDTGTPVIVGPTEEIRALHAAIGGIPLLAGEYIIRCSEIPKLPAVSLLIGG 332
Query: 169 KNFTLT 174
F LT
Sbjct: 333 VWFNLT 338
Score = 65.9 bits (159), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 26/41 (63%), Positives = 31/41 (75%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQY+G I LGTPPQ F V FDTGSSNLW+PS+ C ++ C
Sbjct: 76 AQYFGEIGLGTPPQNFTVAFDTGSSNLWVPSRRCHFFSVPC 116
>gi|51534968|dbj|BAD36917.1| pepsinogen C [Mus caroli]
Length = 377
Score = 90.5 bits (223), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 43/115 (37%), Positives = 74/115 (64%), Gaps = 4/115 (3%)
Query: 67 NGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSN--CQAIADTGT 122
NGG+I+FGGVD++ + G++T+ PV+++ YWQ ++ I + + +CS+ CQ I DTGT
Sbjct: 209 NGGQIVFGGVDENLYTGELTWIPVTQELYWQITIDDFLIGNQASGWCSSSGCQGIVDTGT 268
Query: 123 SLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTRTN 177
SL++ P++ ++EL + IGA G V CD + +P + +L G F L+ ++
Sbjct: 269 SLLVMPAQYLSELLQTIGAQEGEYGQYFVSCDTVSSLPTLTFVLNGVQFPLSPSS 323
Score = 57.4 bits (137), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 24/34 (70%), Positives = 28/34 (82%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
A YYG IS+GTPPQ F V+FDTGSSNLW+ S +C
Sbjct: 59 ASYYGEISIGTPPQNFLVLFDTGSSNLWVSSVYC 92
>gi|391852923|ref|NP_001254696.1| pepsin A precursor [Callithrix jacchus]
gi|18203305|sp|Q9N2D4.1|PEPA_CALJA RecName: Full=Pepsin A; Flags: Precursor
gi|7008021|dbj|BAA90871.1| pepsinogen A [Callithrix jacchus]
Length = 387
Score = 90.5 bits (223), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 46/125 (36%), Positives = 74/125 (59%), Gaps = 5/125 (4%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-K 107
V +D F S + +++D++G +MFGG+D + G + + PVS +GYWQ V+SI + +
Sbjct: 207 VSQDLF---SVYLSSNDQSGSVVMFGGIDSSYYTGSLNWVPVSAEGYWQITVDSITMNGE 263
Query: 108 NVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ C+ CQAI DTGTSL+ GP+ IA + IGA +NG V C + +P++ +
Sbjct: 264 AIACAEGCQAIVDTGTSLLSGPTSPIANIQSYIGASENSNGEMVVSCSAISSLPDIVFTI 323
Query: 167 GGKNF 171
G +
Sbjct: 324 NGIQY 328
Score = 60.8 bits (146), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 25/34 (73%), Positives = 30/34 (88%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCS 41
+Y+GTI +GTP QEF VIFDTGSSNLW+PS +CS
Sbjct: 74 EYFGTIGIGTPAQEFTVIFDTGSSNLWVPSIYCS 107
>gi|109675118|gb|ABG37021.1| aspartic protease [Nicotiana tabacum]
Length = 508
Score = 90.5 bits (223), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 46/99 (46%), Positives = 65/99 (65%), Gaps = 7/99 (7%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCS-NCQAI 117
RNA +E GGE++FGGVD + F G TY PV+ KGYWQF + + + E +CS C AI
Sbjct: 229 RNAQEEEGGELVFGGVDPNHFKGKHTYVPVTHKGYWQFDMGDVLVGGETTGFCSGGCSAI 288
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNL 156
AD+GTSL+ GP+ +I ++N +IG A+G +C +L
Sbjct: 289 ADSGTSLLAGPTTIITQINHVIG----ASGVVSQECKSL 323
Score = 63.2 bits (152), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 28/58 (48%), Positives = 39/58 (67%), Gaps = 1/58 (1%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHS 58
+++ AQY+G I +G+PPQ+F VIFDTGSSNLW+PS C ++AC + HS
Sbjct: 77 LKNYLDAQYFGEICIGSPPQKFTVIFDTGSSNLWVPSARC-YFSLACYLHPKYKSSHS 133
Score = 36.2 bits (82), Expect = 5.8, Method: Compositional matrix adjust.
Identities = 16/40 (40%), Positives = 23/40 (57%)
Query: 135 LNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
+N+L +P G + VDC +L MPNV +G + F LT
Sbjct: 409 VNQLCDRLPSPMGESAVDCSSLASMPNVSFTVGNQTFGLT 448
>gi|162944764|gb|ABY20451.1| IP19142p [Drosophila melanogaster]
Length = 418
Score = 90.5 bits (223), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 49/116 (42%), Positives = 67/116 (57%), Gaps = 3/116 (2%)
Query: 61 RNASD-ENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSNCQAIAD 119
RNASD NGG ++ GG D + G +TY PVS+ G+WQ V ++I CSNCQAI D
Sbjct: 237 RNASDASNGGVLLLGGSDPTLYSGCLTYVPVSKVGFWQITVGQVEIGSKKLCSNCQAIFD 296
Query: 120 TGTSLIIGPSKVIAELNKLIG--AVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
GTSLII P + +NK +G +G +DC + +P + +G K+FTL
Sbjct: 297 MGTSLIIVPCPALKIINKKLGIKETDRKDGVYIIDCKKVSHLPKIVFNIGWKDFTL 352
>gi|24580865|ref|NP_722705.1| CG31928, isoform A [Drosophila melanogaster]
gi|442625179|ref|NP_001259869.1| CG31928, isoform B [Drosophila melanogaster]
gi|7296077|gb|AAF51372.1| CG31928, isoform A [Drosophila melanogaster]
gi|162944774|gb|ABY20456.1| IP19242p [Drosophila melanogaster]
gi|162944778|gb|ABY20458.1| IP19348p [Drosophila melanogaster]
gi|162951741|gb|ABY21732.1| IP19042p [Drosophila melanogaster]
gi|440213127|gb|AGB92406.1| CG31928, isoform B [Drosophila melanogaster]
Length = 418
Score = 90.1 bits (222), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 49/116 (42%), Positives = 67/116 (57%), Gaps = 3/116 (2%)
Query: 61 RNASD-ENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSNCQAIAD 119
RNASD NGG ++ GG D + G +TY PVS+ G+WQ V ++I CSNCQAI D
Sbjct: 237 RNASDASNGGVLLLGGSDPTLYSGCLTYVPVSKVGFWQITVGQVEIGSKKLCSNCQAIFD 296
Query: 120 TGTSLIIGPSKVIAELNKLIG--AVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
GTSLII P + +NK +G +G +DC + +P + +G K+FTL
Sbjct: 297 MGTSLIIVPCPALKIINKKLGIKETDRKDGVYIIDCKKVSHLPKIVFNIGWKDFTL 352
>gi|73621386|sp|Q9GMY8.1|PEPA_SORUN RecName: Full=Pepsin A; Flags: Precursor
gi|9798656|dbj|BAB11750.1| pepsinogen A [Sorex unguiculatus]
Length = 387
Score = 90.1 bits (222), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 45/127 (35%), Positives = 76/127 (59%), Gaps = 5/127 (3%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-K 107
V +D F S + +++D++G +MFGG+D + G + + P+S +GYWQ V+SI + +
Sbjct: 207 VSQDLF---SVYLSSNDQSGSVVMFGGIDSSYYTGSLNWVPLSSEGYWQITVDSITMNGQ 263
Query: 108 NVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
++ C+ CQAI DTGTSL+ GP+ IA + IGA + G V C ++ +P++ +
Sbjct: 264 SIACNGGCQAIVDTGTSLLSGPTNAIANIQSKIGASQNSQGQMAVSCSSIKNLPDIVFTI 323
Query: 167 GGKNFTL 173
G + L
Sbjct: 324 NGIQYPL 330
Score = 66.2 bits (160), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 30/41 (73%), Positives = 36/41 (87%), Gaps = 2/41 (4%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACS 48
+Y+GTIS+GTPPQEF VIFDTGSSNLW+PS +CS + ACS
Sbjct: 74 EYFGTISIGTPPQEFTVIFDTGSSNLWVPSIYCS--SPACS 112
>gi|163502|gb|AAA30691.1| pepsin, partial [Bos taurus]
Length = 120
Score = 90.1 bits (222), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 41/109 (37%), Positives = 68/109 (62%), Gaps = 2/109 (1%)
Query: 65 DENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-KNVYCSN-CQAIADTGT 122
DE+G +MFGG+D + G + + PVS +GYWQ ++SI ++ + + C++ CQAI DTGT
Sbjct: 2 DESGSVVMFGGIDSSYYTGSLNWVPVSYEGYWQITLDSITMDGETIACADSCQAIVDTGT 61
Query: 123 SLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNF 171
SL+ GP+ I+ + IGA ++G V C ++ +PN+ + G +
Sbjct: 62 SLLAGPTSAISNIQSYIGASENSDGEMIVSCSSMYSLPNIVFTINGVQY 110
>gi|149725191|ref|XP_001501954.1| PREDICTED: pepsin A-like [Equus caballus]
Length = 387
Score = 90.1 bits (222), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 47/128 (36%), Positives = 76/128 (59%), Gaps = 5/128 (3%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-K 107
V +D F S + ++ DE+G +MFGG+D + G + + PVS + YWQ V+SI + +
Sbjct: 207 VSQDLF---SVYLSSDDESGSVVMFGGIDSSYYSGSLNWVPVSEEAYWQITVDSITMNGE 263
Query: 108 NVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
++ CS CQAI DTGTSL+ GP+ I + IGA ++G A + C ++ +P++ +
Sbjct: 264 SIACSGGCQAIVDTGTSLLAGPTSGIDNIQSYIGASEDSSGEAVISCSSIYSLPDIVFTI 323
Query: 167 GGKNFTLT 174
G F L+
Sbjct: 324 NGVEFPLS 331
Score = 67.8 bits (164), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 31/47 (65%), Positives = 40/47 (85%), Gaps = 3/47 (6%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKF 54
+Y+GTIS+GTP QEF VIFDTGSSNLW+PS +CS ++ACS D ++F
Sbjct: 74 EYFGTISIGTPAQEFTVIFDTGSSNLWVPSTYCS--SLACS-DHNRF 117
>gi|240273649|gb|EER37169.1| aspartic endopeptidase Pep2 [Ajellomyces capsulatus H143]
Length = 226
Score = 90.1 bits (222), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 47/110 (42%), Positives = 66/110 (60%), Gaps = 1/110 (0%)
Query: 65 DENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSNCQAIADTGTS 123
D++ E++FGG++KD+F G++T P+ RK YW+ ++SI K +N I DTGTS
Sbjct: 60 DDDQSEVVFGGMNKDRFTGELTKIPLRRKAYWEVDLDSITFGKQTAMMTNTGVILDTGTS 119
Query: 124 LIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
LI PS + LNK IGA NG V+C D +PN+ L G NFT+
Sbjct: 120 LIALPSTIAELLNKEIGAKKSFNGQYTVECAKRDSLPNLTFGLSGHNFTI 169
>gi|195473093|ref|XP_002088830.1| GE10927 [Drosophila yakuba]
gi|194174931|gb|EDW88542.1| GE10927 [Drosophila yakuba]
Length = 372
Score = 90.1 bits (222), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 40/113 (35%), Positives = 64/113 (56%), Gaps = 4/113 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSNCQAIADT 120
R+ + GGE++ GG D + G +TY P+S +GYWQF ++ ++ C +CQAIADT
Sbjct: 212 RDGTSTFGGELILGGSDSSLYSGSLTYVPISEQGYWQFTMDGSSVDGESLCDDCQAIADT 271
Query: 121 GTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
GTSLI+ P L++++ +DC + +P+V +GG +F L
Sbjct: 272 GTSLIVAPYNAYITLSEILN----VGNDGYLDCSTVSSLPDVTFNIGGTDFVL 320
Score = 57.8 bits (138), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 23/40 (57%), Positives = 30/40 (75%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
+ +S YYG IS+GTP Q FKV+FD+GS+NLW+PS C
Sbjct: 61 LSNSMNMAYYGAISIGTPAQSFKVLFDSGSANLWVPSNTC 100
>gi|119592252|gb|EAW71846.1| hCG1733572, isoform CRA_b [Homo sapiens]
Length = 512
Score = 90.1 bits (222), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 46/126 (36%), Positives = 73/126 (57%), Gaps = 3/126 (2%)
Query: 52 DKFVGHSHF-RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI-EKNV 109
DK V +F R+ +GGE++ GG D ++ +T+ PV+ YWQ +E +K+ +
Sbjct: 276 DKPVFSFYFNRDPEVADGGELVLGGSDPAHYIPPLTFVPVTVPAYWQIHMERVKVGSRLT 335
Query: 110 YCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGG 168
C+ C AI DTGT +I+GP++ I L+ IG +PL G + C + K+P V +++GG
Sbjct: 336 LCAQGCAAILDTGTPVIVGPTEEIRALHAAIGGIPLLAGEYIIRCSEIPKLPAVSLLIGG 395
Query: 169 KNFTLT 174
F LT
Sbjct: 396 VWFNLT 401
Score = 66.2 bits (160), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 26/41 (63%), Positives = 31/41 (75%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQY+G I LGTPPQ F V FDTGSSNLW+PS+ C ++ C
Sbjct: 76 AQYFGEIGLGTPPQNFTVAFDTGSSNLWVPSRRCHFFSVPC 116
>gi|194218276|ref|XP_001501986.2| PREDICTED: pepsin A-like [Equus caballus]
Length = 387
Score = 90.1 bits (222), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 48/127 (37%), Positives = 73/127 (57%), Gaps = 5/127 (3%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-K 107
V +D F S + ++ DE+G +MFGG+D + G + + PV+ +GYWQ V+SI I +
Sbjct: 207 VSQDLF---SVYLSSDDESGSVVMFGGIDSSYYTGSLHWVPVTTEGYWQIAVDSITINGE 263
Query: 108 NVYCS-NCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
++ CS CQAI DTGTSL+ GP+ I + IGA G + C +D +P++ +
Sbjct: 264 SIACSGGCQAIVDTGTSLLAGPTSGIDNIQSYIGARKDLLGEEVISCSAIDSLPDIVFTM 323
Query: 167 GGKNFTL 173
G F L
Sbjct: 324 NGVEFPL 330
Score = 68.2 bits (165), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 31/47 (65%), Positives = 39/47 (82%), Gaps = 3/47 (6%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKF 54
+Y+GTIS+GTPPQEF VIFDTGSSNLW+PS +CS ++AC D +F
Sbjct: 74 EYFGTISIGTPPQEFTVIFDTGSSNLWVPSTYCS--SLAC-YDHKRF 117
>gi|281347482|gb|EFB23066.1| hypothetical protein PANDA_018737 [Ailuropoda melanoleuca]
Length = 270
Score = 90.1 bits (222), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 43/116 (37%), Positives = 70/116 (60%), Gaps = 2/116 (1%)
Query: 58 SHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYC--SNCQ 115
+ + + +E G +MFGGVD + G++ + PVSR YWQ ++SI + V S C+
Sbjct: 147 AFYLSKKEEEGSVVMFGGVDHSYYSGELNWVPVSRPFYWQLSMDSISMNGVVIACDSGCE 206
Query: 116 AIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNF 171
AI DTGTSL+IGPS ++ + K+I A +G +DCD ++ +P++ + G N+
Sbjct: 207 AIIDTGTSLLIGPSDIVFNIQKIINANQSYSGEYIIDCDAVNTLPDIVFTINGINY 262
Score = 63.9 bits (154), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 27/36 (75%), Positives = 32/36 (88%)
Query: 6 QAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCS 41
Q Y GTIS+GTPPQEFKVIFDTGS++LW+PS +CS
Sbjct: 6 QLAYVGTISIGTPPQEFKVIFDTGSTDLWVPSIYCS 41
>gi|444513055|gb|ELV10247.1| Pepsin A [Tupaia chinensis]
Length = 396
Score = 90.1 bits (222), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 42/114 (36%), Positives = 71/114 (62%), Gaps = 2/114 (1%)
Query: 62 NASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-KNVYCS-NCQAIAD 119
+++DE+G ++FGG+D + G + + P+S +GYWQ V+SI + + + CS +CQAI D
Sbjct: 226 HSNDESGSVVIFGGIDSSYYTGSLNWVPLSAEGYWQITVDSITMNGQPIACSGSCQAIVD 285
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
TGTSL+ GP+ IA + IGA +NG + C ++ +P++ + G + L
Sbjct: 286 TGTSLLSGPTNAIANIQSYIGASQNSNGEMVISCSAINNLPDIVFTINGVQYPL 339
Score = 61.6 bits (148), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 29/47 (61%), Positives = 38/47 (80%), Gaps = 3/47 (6%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKF 54
+Y+GTI +GTP QEF VIFDTGSSNLW+PS +CS + ACS + ++F
Sbjct: 68 EYFGTIGIGTPAQEFTVIFDTGSSNLWVPSVYCS--SPACS-NHNRF 111
>gi|494476|pdb|1PSA|A Chain A, Structure Of A Pepsin(Slash)renin Inhibitor Complex
Reveals A Novel Crystal Packing Induced By Minor
Chemical Alterations In The Inhibitor
gi|494478|pdb|1PSA|B Chain B, Structure Of A Pepsin(Slash)renin Inhibitor Complex
Reveals A Novel Crystal Packing Induced By Minor
Chemical Alterations In The Inhibitor
gi|67463919|pdb|1YX9|A Chain A, Effect Of Dimethyl Sulphoxide On The Crystal Structure Of
Porcine Pepsin
Length = 326
Score = 90.1 bits (222), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 44/128 (34%), Positives = 78/128 (60%), Gaps = 5/128 (3%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-K 107
V +D F S + +++D++G ++ GG+D + G + + PVS +GYWQ ++SI ++ +
Sbjct: 146 VSQDLF---SVYLSSNDDSGSVVLLGGIDSSYYTGSLNWVPVSVEGYWQITLDSITMDGE 202
Query: 108 NVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ CS CQAI DTGTSL+ GP+ IA + IGA ++G + C ++D +P++ +
Sbjct: 203 TIACSGGCQAIVDTGTSLLTGPTSAIANIQSDIGASENSDGEMVISCSSIDSLPDIVFTI 262
Query: 167 GGKNFTLT 174
G + L+
Sbjct: 263 NGVQYPLS 270
Score = 63.5 bits (153), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 29/54 (53%), Positives = 41/54 (75%), Gaps = 3/54 (5%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKF 54
+ + +Y+GTI +GTP Q+F VIFDTGSSNLW+PS +CS ++ACS D ++F
Sbjct: 6 LENYLDTEYFGTIGIGTPAQDFTVIFDTGSSNLWVPSVYCS--SLACS-DHNQF 56
>gi|410974069|ref|XP_003993470.1| PREDICTED: pepsin A-like [Felis catus]
Length = 387
Score = 90.1 bits (222), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 46/127 (36%), Positives = 76/127 (59%), Gaps = 5/127 (3%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-K 107
V +D F S + + +D++G +MFGG+D + G++ + PVS +GYWQ V+SI + +
Sbjct: 207 VSQDLF---SVYLSGNDQSGSVVMFGGIDSSYYTGNLNWIPVSVEGYWQISVDSITMNGQ 263
Query: 108 NVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
++ C+ CQAI DTGTSL+ GPS IA + IGA + G + C ++ +P++ +
Sbjct: 264 SIACNGGCQAIVDTGTSLLTGPSNAIANIQSDIGASQNSYGQMGISCSAINNLPDIVFTI 323
Query: 167 GGKNFTL 173
G + L
Sbjct: 324 NGNEYPL 330
Score = 61.2 bits (147), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 24/33 (72%), Positives = 30/33 (90%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
+Y+GTI +GTPPQ+F VIFDTGSSNLW+PS +C
Sbjct: 74 EYFGTIGIGTPPQQFTVIFDTGSSNLWVPSVYC 106
>gi|301786579|ref|XP_002928698.1| PREDICTED: pepsin F-like [Ailuropoda melanoleuca]
Length = 388
Score = 90.1 bits (222), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 43/116 (37%), Positives = 70/116 (60%), Gaps = 2/116 (1%)
Query: 58 SHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYC--SNCQ 115
+ + + +E G +MFGGVD + G++ + PVSR YWQ ++SI + V S C+
Sbjct: 213 AFYLSKKEEEGSVVMFGGVDHSYYSGELNWVPVSRPFYWQLSMDSISMNGVVIACDSGCE 272
Query: 116 AIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNF 171
AI DTGTSL+IGPS ++ + K+I A +G +DCD ++ +P++ + G N+
Sbjct: 273 AIIDTGTSLLIGPSDIVFNIQKIINANQSYSGEYIIDCDAVNTLPDIVFTINGINY 328
Score = 65.5 bits (158), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 28/41 (68%), Positives = 34/41 (82%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCS 41
MR+ Y GTIS+GTPPQEFKVIFDTGS++LW+PS +CS
Sbjct: 67 MRNYLDLAYVGTISIGTPPQEFKVIFDTGSTDLWVPSIYCS 107
>gi|148910494|gb|ABR18322.1| unknown [Picea sitchensis]
Length = 471
Score = 90.1 bits (222), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 46/93 (49%), Positives = 61/93 (65%), Gaps = 3/93 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCS-NCQAI 117
R DE GGEI+FGGVD + F G TY PV+R+GYWQF + I + +CS C AI
Sbjct: 230 RKVGDEEGGEIVFGGVDPNHFKGKHTYVPVTREGYWQFNMGDFLIGGQSTGFCSGGCAAI 289
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAK 150
D+GTSL+ GPS ++A++N+ IGA LA+ K
Sbjct: 290 VDSGTSLLAGPSGIVAQINEAIGASGLASQECK 322
Score = 68.9 bits (167), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 31/47 (65%), Positives = 37/47 (78%), Gaps = 1/47 (2%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ AQYYG I LGTPPQ+F VIFDTGSSNLW+PS C L+IAC
Sbjct: 78 LKNYLDAQYYGEIGLGTPPQKFTVIFDTGSSNLWVPSTKC-YLSIAC 123
>gi|13096225|pdb|1F34|A Chain A, Crystal Structure Of Ascaris Pepsin Inhibitor-3 Bound To
Porcine Pepsin
Length = 326
Score = 90.1 bits (222), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 44/128 (34%), Positives = 78/128 (60%), Gaps = 5/128 (3%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-K 107
V +D F S + +++D++G ++ GG+D + G + + PVS +GYWQ ++SI ++ +
Sbjct: 146 VSQDLF---SVYLSSNDDSGSVVLLGGIDSSYYTGSLNWVPVSVEGYWQITLDSITMDGE 202
Query: 108 NVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ CS CQAI DTGTSL+ GP+ IA + IGA ++G + C ++D +P++ +
Sbjct: 203 TIACSGGCQAIVDTGTSLLTGPTSAIANIQSDIGASENSDGEMVISCSSIDSLPDIVFTI 262
Query: 167 GGKNFTLT 174
G + L+
Sbjct: 263 NGVQYPLS 270
Score = 63.5 bits (153), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 29/54 (53%), Positives = 41/54 (75%), Gaps = 3/54 (5%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKF 54
+ + +Y+GTI +GTP Q+F VIFDTGSSNLW+PS +CS ++ACS D ++F
Sbjct: 6 LENYLDTEYFGTIGIGTPAQDFTVIFDTGSSNLWVPSVYCS--SLACS-DHNQF 56
>gi|385301236|gb|EIF45441.1| proteinase a [Dekkera bruxellensis AWRI1499]
Length = 429
Score = 90.1 bits (222), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 47/110 (42%), Positives = 65/110 (59%), Gaps = 1/110 (0%)
Query: 66 ENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI-EKNVYCSNCQAIADTGTSL 124
E+GG FGG+D KF G IT+ PV RK YW+ E I + ++ + A DTGTSL
Sbjct: 264 EDGGVCTFGGIDDSKFTGKITWLPVRRKAYWEVKFEGIGLGDEYAELQSHGAAIDTGTSL 323
Query: 125 IIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
I+ PS++ LN IGA +G VDC+ D +P++ + GG NFTL+
Sbjct: 324 IVLPSQLAEILNSEIGAEKSWSGQYTVDCNKRDSLPDLTLTFGGYNFTLS 373
Score = 59.7 bits (143), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 32/76 (42%), Positives = 43/76 (56%), Gaps = 3/76 (3%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASD- 65
AQY+ I LGTP Q+FKVI DTGSSNLW+PS C+ L D ++ +N S+
Sbjct: 114 AQYFSEIELGTPGQKFKVILDTGSSNLWVPSSDCASLACYLHTKYDHEQSSTYKKNGSEF 173
Query: 66 --ENGGEIMFGGVDKD 79
+ G M G + +D
Sbjct: 174 SIQYGSGSMKGYISQD 189
>gi|50978822|ref|NP_001003117.1| pepsin A preproprotein [Canis lupus familiaris]
gi|73621384|sp|Q9GMY6.1|PEPA_CANFA RecName: Full=Pepsin A; Flags: Precursor
gi|9798660|dbj|BAB11752.1| pepsinogen A [Canis lupus familiaris]
Length = 386
Score = 90.1 bits (222), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 45/127 (35%), Positives = 76/127 (59%), Gaps = 5/127 (3%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-K 107
V +D F S + ++ D++G +MFGG+D + G++ + PVS +GYWQ V+S+ + +
Sbjct: 206 VSQDLF---SVYLSSDDQSGSVVMFGGIDSSYYSGNLNWVPVSVEGYWQITVDSVTMNGQ 262
Query: 108 NVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ CS+ CQAI DTGTSL+ GP+ IA + IGA + G + C ++ +P++ +
Sbjct: 263 AIACSDGCQAIVDTGTSLLAGPTNAIANIQSYIGASQNSYGQMVISCSAINSLPDIVFTI 322
Query: 167 GGKNFTL 173
G + L
Sbjct: 323 NGIQYPL 329
Score = 65.1 bits (157), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 30/54 (55%), Positives = 42/54 (77%), Gaps = 3/54 (5%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKF 54
+++ +Y+GTI +GTPPQEF VIFDTGSSNLW+PS +CS + ACS + ++F
Sbjct: 66 LKNYMDMEYFGTIGIGTPPQEFTVIFDTGSSNLWVPSVYCS--SPACS-NHNRF 116
>gi|129786|sp|P27678.1|PEPA4_MACFU RecName: Full=Pepsin A-4; AltName: Full=Pepsin I/II; Flags:
Precursor
gi|38071|emb|CAA42425.1| prepropepsin A [Macaca fuscata]
Length = 388
Score = 90.1 bits (222), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 47/127 (37%), Positives = 74/127 (58%), Gaps = 5/127 (3%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-K 107
V +D F S + +A D++G ++FGG+D + G + + PVS +GYWQ V+SI + K
Sbjct: 208 VSQDLF---SVYLSADDQSGSVVIFGGIDSSYYTGSLNWVPVSVEGYWQISVDSITMNGK 264
Query: 108 NVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ C+ CQAI DTGTSL+ GP+ IA + IGA ++G V C + +P++ +
Sbjct: 265 TIACAKGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGEMVVSCSAISSLPDIVFTI 324
Query: 167 GGKNFTL 173
G + L
Sbjct: 325 NGVQYPL 331
Score = 57.8 bits (138), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 26/47 (55%), Positives = 36/47 (76%), Gaps = 3/47 (6%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKF 54
+Y+GTI +GTP Q F V+FDTGSSNLW+PS +C ++AC +D + F
Sbjct: 75 EYFGTIGIGTPAQNFTVVFDTGSSNLWVPSVYC--YSLAC-MDHNLF 118
>gi|351702766|gb|EHB05685.1| Napsin-A [Heterocephalus glaber]
Length = 417
Score = 90.1 bits (222), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 44/116 (37%), Positives = 67/116 (57%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIA 118
R+ + +GGE++ GG D ++ +T+ PV+ YWQ +E +K+ + C+ C AI
Sbjct: 220 RDFAGADGGELVLGGADPAHYIPPLTFVPVTVPAYWQIHMERVKVGTGLTLCAQGCAAIV 279
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSLI GPS+ I L++ IG +P G + C + +P V +LGG F LT
Sbjct: 280 DTGTSLITGPSEEIRALHRAIGGLPWLAGEHFILCSKIPTLPPVSFLLGGVWFNLT 335
Score = 65.9 bits (159), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 25/40 (62%), Positives = 31/40 (77%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
QY+G I LGTPPQ F V+FDTGSSNLW+PS+ C ++ C
Sbjct: 74 QYFGEIGLGTPPQNFSVVFDTGSSNLWVPSKRCHFFSVPC 113
>gi|291409618|ref|XP_002721075.1| PREDICTED: pepsin II-4-like [Oryctolagus cuniculus]
Length = 387
Score = 90.1 bits (222), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 46/127 (36%), Positives = 73/127 (57%), Gaps = 5/127 (3%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-K 107
V +D F S + ++ DE G +MFGG+D + G + + PVS +GYWQ ++SI I +
Sbjct: 207 VSQDLF---SVYLSSDDEKGSLVMFGGIDSSYYTGSLNWVPVSYEGYWQITMDSISINGE 263
Query: 108 NVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ C++ CQAI DTGTSL+ GP+ I+ + IGA G + C +D +P++ +
Sbjct: 264 TIACADSCQAIVDTGTSLLAGPTSAISNIQSYIGASKNLLGENVISCSAIDSLPDIVFTI 323
Query: 167 GGKNFTL 173
G + L
Sbjct: 324 NGIQYPL 330
Score = 70.5 bits (171), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 30/45 (66%), Positives = 40/45 (88%), Gaps = 2/45 (4%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK 51
A+Y+GTIS+GTPPQ+F VIFDTGSSNLW+PS +CS ++AC++ K
Sbjct: 73 AEYFGTISIGTPPQDFTVIFDTGSSNLWVPSTYCS--SLACALHK 115
>gi|255713834|ref|XP_002553199.1| KLTH0D11264p [Lachancea thermotolerans]
gi|238934579|emb|CAR22761.1| KLTH0D11264p [Lachancea thermotolerans CBS 6340]
Length = 417
Score = 90.1 bits (222), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 47/127 (37%), Positives = 75/127 (59%), Gaps = 1/127 (0%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI-EK 107
+D+ +F + + + S+E+ GE+ FGG+D K+ G+IT+ PV RK YW+ + I + ++
Sbjct: 235 LDEPRFAFYLNNADDSEESTGEVTFGGIDSSKYKGNITWLPVRRKAYWEVKFDGIGLGDE 294
Query: 108 NVYCSNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILG 167
A DTGTSLI PS + LN IGA +G VDC++ D++P++
Sbjct: 295 YAELEGTGAAIDTGTSLIALPSGLAEVLNAEIGAKKGWSGQYTVDCESRDQLPDLTFTFN 354
Query: 168 GKNFTLT 174
GKNFT++
Sbjct: 355 GKNFTIS 361
Score = 62.0 bits (149), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 27/41 (65%), Positives = 32/41 (78%), Gaps = 2/41 (4%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQY+ I+LGTPPQ FKVI DTGSSNLW+PS C ++AC
Sbjct: 101 AQYFTEITLGTPPQSFKVILDTGSSNLWVPSDECG--SLAC 139
>gi|229368735|gb|ACQ63017.1| progastricsin (predicted) [Dasypus novemcinctus]
Length = 276
Score = 90.1 bits (222), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 42/112 (37%), Positives = 70/112 (62%), Gaps = 3/112 (2%)
Query: 65 DENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSN-CQAIADTG 121
++GGE++FGGVD++ + G+I ++PV+++ YWQ G+E I + + +CS CQAI DTG
Sbjct: 107 SQDGGEVVFGGVDQNLYSGEIYWAPVTQELYWQIGIEEFLIGGQASGWCSQGCQAIVDTG 166
Query: 122 TSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
TSL+ P + ++ L + GA G VDC ++ +P + ++ G F L
Sbjct: 167 TSLLTVPQQYMSALMQATGAQENEYGEYAVDCSSVQSLPTLTFVINGVQFPL 218
>gi|148906206|gb|ABR16259.1| unknown [Picea sitchensis]
Length = 509
Score = 90.1 bits (222), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 46/93 (49%), Positives = 61/93 (65%), Gaps = 3/93 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCS-NCQAI 117
R DE GGEI+FGGVD + F G TY PV+R+GYWQF + I + +CS C AI
Sbjct: 230 RKVGDEEGGEIVFGGVDPNHFKGKHTYVPVTREGYWQFNMGDFLIGGQSTGFCSGGCAAI 289
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAK 150
D+GTSL+ GPS ++A++N+ IGA LA+ K
Sbjct: 290 VDSGTSLLAGPSGIVAQINEAIGASGLASQECK 322
Score = 68.9 bits (167), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 31/47 (65%), Positives = 37/47 (78%), Gaps = 1/47 (2%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ AQYYG I LGTPPQ+F VIFDTGSSNLW+PS C L+IAC
Sbjct: 78 LKNYLDAQYYGEIGLGTPPQKFTVIFDTGSSNLWVPSTKC-YLSIAC 123
Score = 43.9 bits (102), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 26/79 (32%), Positives = 40/79 (50%), Gaps = 3/79 (3%)
Query: 99 GVESIKIEKNVYCSNCQ-AIADTGTSLIIGPSK--VIAELNKLIGAVPLANGPAKVDCDN 155
G E+ + C+ C+ A+ + SK ++ LN+L +P NG A VDC+N
Sbjct: 371 GNEATSTSSSGMCAACEMAVVWAKNQIARNASKDQIMTYLNQLCDRLPNPNGQAAVDCNN 430
Query: 156 LDKMPNVDIILGGKNFTLT 174
L MP V +G ++F LT
Sbjct: 431 LSSMPTVSFTIGDRSFDLT 449
>gi|195159704|ref|XP_002020718.1| GL15683 [Drosophila persimilis]
gi|194117668|gb|EDW39711.1| GL15683 [Drosophila persimilis]
Length = 413
Score = 90.1 bits (222), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 48/117 (41%), Positives = 70/117 (59%), Gaps = 4/117 (3%)
Query: 61 RNASD-ENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSNCQAIA 118
RNAS+ NGG+++ GG D + G +TY P+S+ GYWQ V SI ++ CSNC+AI
Sbjct: 239 RNASEPSNGGQLLLGGSDPTLYSGCLTYVPLSQVGYWQITVGSISLDTGSDLCSNCEAII 298
Query: 119 DTGTSLIIGPSKVIAELNKLIG--AVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
D GTSLI+ PS +A +N G A +G + CD + +P + +G ++FTL
Sbjct: 299 DAGTSLIVVPSATLAAINLRFGITAADKRDGVYTISCDKVSSLPVLTFNIGRRDFTL 355
Score = 50.4 bits (119), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 19/40 (47%), Positives = 31/40 (77%), Gaps = 2/40 (5%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
+ +SY +YYG +++G+ Q+F+V+FDT S+NLW+PS C
Sbjct: 86 LTNSYNTEYYGQVTIGS--QQFQVLFDTASANLWVPSVKC 123
>gi|348560226|ref|XP_003465915.1| PREDICTED: pepsin F-like [Cavia porcellus]
Length = 574
Score = 90.1 bits (222), Expect = 3e-16, Method: Composition-based stats.
Identities = 44/114 (38%), Positives = 68/114 (59%), Gaps = 2/114 (1%)
Query: 60 FRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-KNVYC-SNCQAI 117
+ ++ E G +M GGVD + GD+ + PVS+ YWQ ++SI + K++ C CQAI
Sbjct: 401 YLSSKAEAGSMLMLGGVDPAYYSGDLHWVPVSKPRYWQLAMDSISMGGKDIACHGGCQAI 460
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNF 171
DTGTSL+ GP + IA + K++GA NG V+C + +P V +GG ++
Sbjct: 461 LDTGTSLVNGPRQAIAAIQKILGAKGSRNGELVVNCSTISALPEVIFTIGGLSY 514
Score = 56.2 bits (134), Expect = 5e-06, Method: Composition-based stats.
Identities = 23/51 (45%), Positives = 35/51 (68%), Gaps = 2/51 (3%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK 51
+R+ A Y I +GTPPQEF+V+ DTGS++LW+PS C+ + AC+ +
Sbjct: 253 IRNFLDAAYVAVIGVGTPPQEFQVVLDTGSASLWVPSIRCA--SPACATHR 301
>gi|359483345|ref|XP_003632941.1| PREDICTED: aspartic proteinase isoform 2 [Vitis vinifera]
gi|359483347|ref|XP_002262915.2| PREDICTED: aspartic proteinase isoform 1 [Vitis vinifera]
Length = 514
Score = 89.7 bits (221), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 45/93 (48%), Positives = 61/93 (65%), Gaps = 3/93 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCS-NCQAI 117
R D+ GGE++FGGVD D F G+ TY PV++KGYWQF + + I E YC+ C AI
Sbjct: 234 RKTDDDEGGELVFGGVDPDHFKGEHTYVPVTQKGYWQFDMGEVLIDGETTGYCAGGCAAI 293
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAK 150
AD+GTSL+ GP+ V+A +N IGA + + K
Sbjct: 294 ADSGTSLLAGPTAVVAMINHAIGATGVVSQECK 326
Score = 62.4 bits (150), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 25/40 (62%), Positives = 31/40 (77%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
+++ AQY+G I +GTPPQ F VIFDTGSSNLW+PS C
Sbjct: 82 LKNYMDAQYFGEIGIGTPPQTFTVIFDTGSSNLWVPSSKC 121
Score = 41.6 bits (96), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 29/89 (32%), Positives = 44/89 (49%), Gaps = 12/89 (13%)
Query: 98 FGVESIKIEKN---------VYCSNCQ-AIADTGTSLIIGPSK--VIAELNKLIGAVPLA 145
G+ES+ EKN CS C+ A+ + L +K ++ +N+L +P
Sbjct: 366 MGIESVVDEKNGDKSSGVHDAGCSACEMAVVWMQSQLRQNQTKERILEYVNELCDRLPSP 425
Query: 146 NGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
G + VDC L MPNV + +GGK F L+
Sbjct: 426 MGESAVDCLQLSSMPNVSLTIGGKVFDLS 454
>gi|183180242|gb|ACC44388.1| ASP-3 [Caenorhabditis remanei]
gi|183180244|gb|ACC44389.1| ASP-3 [Caenorhabditis remanei]
gi|183180246|gb|ACC44390.1| ASP-3 [Caenorhabditis remanei]
gi|183180248|gb|ACC44391.1| ASP-3 [Caenorhabditis remanei]
gi|183180250|gb|ACC44392.1| ASP-3 [Caenorhabditis remanei]
gi|183180252|gb|ACC44393.1| ASP-3 [Caenorhabditis remanei]
gi|183180256|gb|ACC44395.1| ASP-3 [Caenorhabditis remanei]
gi|183180260|gb|ACC44397.1| ASP-3 [Caenorhabditis remanei]
gi|183180262|gb|ACC44398.1| ASP-3 [Caenorhabditis remanei]
gi|183180264|gb|ACC44399.1| ASP-3 [Caenorhabditis remanei]
gi|183180270|gb|ACC44402.1| ASP-3 [Caenorhabditis remanei]
gi|183180272|gb|ACC44403.1| ASP-3 [Caenorhabditis remanei]
Length = 225
Score = 89.7 bits (221), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 44/115 (38%), Positives = 69/115 (60%), Gaps = 2/115 (1%)
Query: 61 RNASD-ENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIA 118
R+A+D NGGEI D + + G+I + P+ YW+ + ++ I+ Y + +I
Sbjct: 58 RDANDIANGGEITLCDTDSNHYTGNIAWEPLVSMDYWRIKLGAVSIDGTTYTNGPIDSIV 117
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
DTGTSL+ GP+ VI ++ IG +PL NG +V+C + K+PN+ LGG+NF L
Sbjct: 118 DTGTSLLTGPTDVIKKIQHKIGGIPLFNGEYEVECSKIPKLPNITFTLGGQNFDL 172
>gi|283806612|ref|NP_001164557.1| pepsin II-2/3 precursor [Oryctolagus cuniculus]
gi|129781|sp|P27821.1|PEPA2_RABIT RecName: Full=Pepsin II-2/3; AltName: Full=Pepsin A; Flags:
Precursor
gi|165600|gb|AAA85369.1| pepsinogen [Oryctolagus cuniculus]
Length = 387
Score = 89.7 bits (221), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 45/127 (35%), Positives = 73/127 (57%), Gaps = 5/127 (3%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-K 107
V +D F S + ++ DE G +MFGG+D + G + + PVS +GYWQ ++S+ I +
Sbjct: 207 VSQDLF---SVYLSSDDEKGSLVMFGGIDSSYYTGSLNWVPVSYEGYWQITMDSVSINGE 263
Query: 108 NVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ C++ CQAI DTGTSL+ GP+ I+ + IGA G + C +D +P++ +
Sbjct: 264 TIACADSCQAIVDTGTSLLTGPTSAISNIQSYIGASKNLLGENVISCSAIDSLPDIVFTI 323
Query: 167 GGKNFTL 173
G + L
Sbjct: 324 NGIQYPL 330
Score = 72.0 bits (175), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 31/51 (60%), Positives = 42/51 (82%), Gaps = 2/51 (3%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK 51
M + A+Y+GTIS+GTPPQ+F VIFDTGSSNLW+PS +CS ++AC++ K
Sbjct: 67 MENYLDAEYFGTISIGTPPQDFTVIFDTGSSNLWVPSTYCS--SLACALHK 115
>gi|425892460|gb|AFB73927.2| preprocirsin [Cirsium vulgare]
Length = 509
Score = 89.7 bits (221), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 44/84 (52%), Positives = 59/84 (70%), Gaps = 3/84 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV--YCSN-CQAI 117
RNA++E GGE++FGGVD + F G TY PV+ KGYWQF + + IE +CS+ C AI
Sbjct: 229 RNANEEEGGELVFGGVDPNHFKGKHTYVPVTEKGYWQFNMGDVLIEDKTTGFCSDGCAAI 288
Query: 118 ADTGTSLIIGPSKVIAELNKLIGA 141
AD+GTSL+ GP+ +I E+N GA
Sbjct: 289 ADSGTSLLAGPTAIITEINHASGA 312
Score = 64.3 bits (155), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 27/47 (57%), Positives = 35/47 (74%), Gaps = 1/47 (2%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ AQYYG I +G PPQ+F VIFDTGSSNLW+PS C ++AC
Sbjct: 77 LKNYMDAQYYGEIGIGAPPQKFTVIFDTGSSNLWVPSAKC-YFSVAC 122
Score = 38.5 bits (88), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 17/43 (39%), Positives = 26/43 (60%)
Query: 131 VIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
+I +N+L +P G + VDC++L MPN+ +GGK F L
Sbjct: 406 IINYVNELCDRLPSPMGESAVDCNSLSSMPNIAFTIGGKVFEL 448
>gi|253723333|pdb|4PEP|A Chain A, The Molecular And Crystal Structures Of Monoclinic Porcine
Pepsin Refined At 1.8 Angstroms Resolution
Length = 326
Score = 89.7 bits (221), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 44/128 (34%), Positives = 78/128 (60%), Gaps = 5/128 (3%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-K 107
V +D F S + +++D++G ++ GG+D + G + + PVS +GYWQ ++SI ++ +
Sbjct: 146 VSQDLF---SVYLSSNDDSGSVVLLGGIDSSYYTGSLNWVPVSVEGYWQITLDSITMDGE 202
Query: 108 NVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ CS CQAI DTGTSL+ GP+ IA + IGA ++G + C ++D +P++ +
Sbjct: 203 TIACSGGCQAIVDTGTSLLTGPTSAIANIQSDIGASENSDGEMVISCSSIDSLPDIVFTI 262
Query: 167 GGKNFTLT 174
G + L+
Sbjct: 263 DGVQYPLS 270
Score = 63.5 bits (153), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 29/54 (53%), Positives = 41/54 (75%), Gaps = 3/54 (5%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKF 54
+ + +Y+GTI +GTP Q+F VIFDTGSSNLW+PS +CS ++ACS D ++F
Sbjct: 6 LENYLDTEYFGTIGIGTPAQDFTVIFDTGSSNLWVPSVYCS--SLACS-DHNQF 56
>gi|40641523|emb|CAE52913.1| putative vacuaolar aspartic proteinase [Physcomitrella patens]
Length = 504
Score = 89.7 bits (221), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 43/94 (45%), Positives = 63/94 (67%), Gaps = 3/94 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV--YCSN-CQAI 117
RN + +GGE++ GGVD F G+ Y+PV+RKGYWQF + + I +C+N C AI
Sbjct: 224 RNPDESSGGELVLGGVDPKHFKGEHVYTPVTRKGYWQFDLGDVTINGRTTGFCANGCTAI 283
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKV 151
AD+GTSL+ GPS ++AE+N+ IGA + + K+
Sbjct: 284 ADSGTSLLAGPSGIVAEINQAIGATGVVSQQCKM 317
Score = 62.8 bits (151), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 27/41 (65%), Positives = 33/41 (80%), Gaps = 1/41 (2%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQY+G I +G+PPQ F VIFDTGSSNLW+PS C L++AC
Sbjct: 78 AQYFGEIGIGSPPQPFAVIFDTGSSNLWVPSAKC-YLSLAC 117
Score = 39.3 bits (90), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 18/44 (40%), Positives = 26/44 (59%)
Query: 130 KVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
++ A LN+L +P NG + VDC++L MPNV + K F L
Sbjct: 395 QIDAYLNQLCERLPSPNGESAVDCNSLSSMPNVAFTISNKTFEL 438
>gi|157836865|pdb|3PEP|A Chain A, Revised 2.3 Angstroms Structure Of Porcine Pepsin.
Evidence For A Flexible Subdomain
Length = 326
Score = 89.7 bits (221), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 44/128 (34%), Positives = 78/128 (60%), Gaps = 5/128 (3%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-K 107
V +D F S + +++D++G ++ GG+D + G + + PVS +GYWQ ++SI ++ +
Sbjct: 146 VSQDLF---SVYLSSNDDSGSVVLLGGIDSSYYTGSLNWVPVSVEGYWQITLDSITMDGE 202
Query: 108 NVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ CS CQAI DTGTSL+ GP+ IA + IGA ++G + C ++D +P++ +
Sbjct: 203 TIACSGGCQAIVDTGTSLLTGPTSAIANIQSDIGASENSDGEMVISCSSIDSLPDIVFTI 262
Query: 167 GGKNFTLT 174
G + L+
Sbjct: 263 DGVQYPLS 270
Score = 63.5 bits (153), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 29/54 (53%), Positives = 41/54 (75%), Gaps = 3/54 (5%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKF 54
+ + +Y+GTI +GTP Q+F VIFDTGSSNLW+PS +CS ++ACS D ++F
Sbjct: 6 LENYLDTEYFGTIGIGTPAQDFTVIFDTGSSNLWVPSVYCS--SLACS-DHNQF 56
>gi|224118038|ref|XP_002331542.1| predicted protein [Populus trichocarpa]
gi|222873766|gb|EEF10897.1| predicted protein [Populus trichocarpa]
Length = 512
Score = 89.7 bits (221), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 44/84 (52%), Positives = 59/84 (70%), Gaps = 3/84 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYC-SNCQAI 117
RNA DE GGEI+FGG+D + + G T+ PV+RKGYWQF + + I + YC S C AI
Sbjct: 232 RNAEDEEGGEIVFGGMDPNHYKGKHTFVPVTRKGYWQFNMGDVHIGDKPTGYCASGCAAI 291
Query: 118 ADTGTSLIIGPSKVIAELNKLIGA 141
AD+GTSL+ GP+ +I +N+ IGA
Sbjct: 292 ADSGTSLLAGPTTIITMINQAIGA 315
Score = 64.7 bits (156), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 27/47 (57%), Positives = 37/47 (78%), Gaps = 1/47 (2%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ +QYYG I +G+PPQ+F VIFDTGSSNLW+PS C L++AC
Sbjct: 80 LKNYLDSQYYGEIGVGSPPQKFTVIFDTGSSNLWVPSSKC-YLSVAC 125
Score = 35.8 bits (81), Expect = 6.3, Method: Compositional matrix adjust.
Identities = 15/45 (33%), Positives = 26/45 (57%)
Query: 130 KVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
+++ +N+L +P G + VDC+++ MP V +GGK F L
Sbjct: 408 RILDYVNQLCERMPNPMGESAVDCESVPSMPTVAFTIGGKEFELA 452
>gi|12843350|dbj|BAB25952.1| unnamed protein product [Mus musculus]
Length = 396
Score = 89.7 bits (221), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 43/123 (34%), Positives = 76/123 (61%), Gaps = 4/123 (3%)
Query: 59 HFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSN--C 114
+ + NGG+I+FGGVD++ + G++T+ PV+++ YWQ ++ I + + +CS+ C
Sbjct: 216 YLGSQQGSNGGQIVFGGVDENLYTGELTWIPVTQELYWQITIDDFLIGNQASGWCSSSGC 275
Query: 115 QAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
Q I DTGTSL++ P++ + EL + IGA G V CD++ +P + +L G F L+
Sbjct: 276 QGIVDTGTSLLVMPAQYLNELLQTIGAQEGEYGQYFVSCDSVSSLPTLTFVLNGVQFPLS 335
Query: 175 RTN 177
++
Sbjct: 336 PSS 338
Score = 57.4 bits (137), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 24/34 (70%), Positives = 28/34 (82%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
A YYG IS+GTPPQ F V+FDTGSSNLW+ S +C
Sbjct: 74 ASYYGEISIGTPPQNFLVLFDTGSSNLWVSSVYC 107
>gi|283806610|ref|NP_001164556.1| pepsin II-4 precursor [Oryctolagus cuniculus]
gi|129787|sp|P28713.1|PEPA4_RABIT RecName: Full=Pepsin II-4; AltName: Full=Pepsin A; Flags: Precursor
gi|22218076|dbj|BAC07515.1| pepsinogen II-4 [Oryctolagus cuniculus]
Length = 387
Score = 89.7 bits (221), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 45/127 (35%), Positives = 73/127 (57%), Gaps = 5/127 (3%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-K 107
V +D F S + ++ DE G +MFGG+D + G + + PVS +GYWQ ++S+ I +
Sbjct: 207 VSQDLF---SVYLSSDDEKGSLVMFGGIDSSYYTGSLNWVPVSYEGYWQITMDSVSINGE 263
Query: 108 NVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ C++ CQAI DTGTSL+ GP+ I+ + IGA G + C +D +P++ +
Sbjct: 264 TIACADSCQAIVDTGTSLLTGPTSAISNIQSYIGASKNLLGENVISCSAIDSLPDIVFTI 323
Query: 167 GGKNFTL 173
G + L
Sbjct: 324 NGIQYPL 330
Score = 70.9 bits (172), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 30/45 (66%), Positives = 40/45 (88%), Gaps = 2/45 (4%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK 51
A+Y+GTIS+GTPPQ+F VIFDTGSSNLW+PS +CS ++AC++ K
Sbjct: 73 AEYFGTISIGTPPQDFTVIFDTGSSNLWVPSTYCS--SLACALHK 115
>gi|397485042|ref|XP_003813672.1| PREDICTED: napsin-A-like [Pan paniscus]
Length = 420
Score = 89.7 bits (221), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 42/116 (36%), Positives = 68/116 (58%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI-EKNVYCSN-CQAIA 118
R+ +GGE++ GG D ++ +T+ PV+ YWQ +E +K+ + C+ C AI
Sbjct: 223 RDPEVADGGELVLGGSDPAHYIPPLTFVPVTVPAYWQIHMERVKVGSRLTLCAQGCAAIL 282
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGT +I+GP++ I L+ IG +PL G + C + K+P V +++GG F LT
Sbjct: 283 DTGTPVIVGPTEEIRALHAAIGGIPLLAGEYIIRCSEIPKLPAVSLLIGGVWFNLT 338
Score = 65.9 bits (159), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 26/41 (63%), Positives = 31/41 (75%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQY+G I LGTPPQ F V FDTGSSNLW+PS+ C ++ C
Sbjct: 76 AQYFGEIGLGTPPQNFTVAFDTGSSNLWVPSRRCHFFSVPC 116
>gi|1507725|gb|AAB06575.1| aspartic protease, partial [Ancylostoma caninum]
Length = 442
Score = 89.7 bits (221), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 48/119 (40%), Positives = 70/119 (58%), Gaps = 7/119 (5%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIK-IEKNVYCSN----CQ 115
RN E GGEI GG+D ++V IT++PV+R+GYWQF ++ ++ ++ C N CQ
Sbjct: 233 RNPDSELGGEITLGGMDTRRYVEPITWTPVTRRGYWQFKMDKVQGGSTSIACPNEFSGCQ 292
Query: 116 AIADTGTSLIIGP-SKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
AIADTGTSLI GP ++ A N L+ P G + CD + P + ++ + FTL
Sbjct: 293 AIADTGTSLIAGPKAQSRASRNSLV-LEPTYEGEYMIPCDKVPFPPRLSFVIEARTFTL 350
Score = 65.9 bits (159), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 28/47 (59%), Positives = 35/47 (74%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+R+ AQY+GTI +GTP Q F VIFDTGSSNLW+PS+ +IAC
Sbjct: 80 LRNYMDAQYFGTIQIGTPAQNFTVIFDTGSSNLWVPSEKMPFHDIAC 126
>gi|302144105|emb|CBI23210.3| unnamed protein product [Vitis vinifera]
Length = 429
Score = 89.7 bits (221), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 45/93 (48%), Positives = 61/93 (65%), Gaps = 3/93 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCS-NCQAI 117
R D+ GGE++FGGVD D F G+ TY PV++KGYWQF + + I E YC+ C AI
Sbjct: 149 RKTDDDEGGELVFGGVDPDHFKGEHTYVPVTQKGYWQFDMGEVLIDGETTGYCAGGCAAI 208
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAK 150
AD+GTSL+ GP+ V+A +N IGA + + K
Sbjct: 209 ADSGTSLLAGPTAVVAMINHAIGATGVVSQECK 241
Score = 61.2 bits (147), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 25/34 (73%), Positives = 28/34 (82%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
AQY+G I +GTPPQ F VIFDTGSSNLW+PS C
Sbjct: 3 AQYFGEIGIGTPPQTFTVIFDTGSSNLWVPSSKC 36
Score = 41.6 bits (96), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 29/89 (32%), Positives = 44/89 (49%), Gaps = 12/89 (13%)
Query: 98 FGVESIKIEKN---------VYCSNCQ-AIADTGTSLIIGPSK--VIAELNKLIGAVPLA 145
G+ES+ EKN CS C+ A+ + L +K ++ +N+L +P
Sbjct: 281 MGIESVVDEKNGDKSSGVHDAGCSACEMAVVWMQSQLRQNQTKERILEYVNELCDRLPSP 340
Query: 146 NGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
G + VDC L MPNV + +GGK F L+
Sbjct: 341 MGESAVDCLQLSSMPNVSLTIGGKVFDLS 369
>gi|168029783|ref|XP_001767404.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162681300|gb|EDQ67728.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 499
Score = 89.7 bits (221), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 42/84 (50%), Positives = 59/84 (70%), Gaps = 3/84 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV--YCSN-CQAI 117
RN + +GGE++ GGVD F G+ Y+PV+RKGYWQF + + I +C+N C AI
Sbjct: 224 RNPDESSGGELVLGGVDPKHFKGEHVYTPVTRKGYWQFDLGDVTINGRTTGFCANGCTAI 283
Query: 118 ADTGTSLIIGPSKVIAELNKLIGA 141
AD+GTSL+ GPS ++AE+N+ IGA
Sbjct: 284 ADSGTSLLAGPSGIVAEINQAIGA 307
Score = 62.8 bits (151), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 27/41 (65%), Positives = 33/41 (80%), Gaps = 1/41 (2%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQY+G I +G+PPQ F VIFDTGSSNLW+PS C L++AC
Sbjct: 78 AQYFGEIGIGSPPQPFAVIFDTGSSNLWVPSAKC-YLSLAC 117
Score = 39.7 bits (91), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 18/44 (40%), Positives = 26/44 (59%)
Query: 130 KVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
++ A LN+L +P NG + VDC++L MPNV + K F L
Sbjct: 395 QIDAYLNQLCERLPSPNGESAVDCNSLSSMPNVAFTISNKTFEL 438
>gi|402906426|ref|XP_003916003.1| PREDICTED: napsin-A-like [Papio anubis]
Length = 423
Score = 89.7 bits (221), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 42/116 (36%), Positives = 69/116 (59%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIA 118
R++ +GGE++ GG D ++ +T+ PV+ YWQ +E + + + C+ C AI
Sbjct: 224 RDSEVADGGELVLGGSDPAHYIPPLTFVPVTVPAYWQIHMERVTVGSGLTLCARGCAAIL 283
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGT +IIGP++ I L++ IG +PL G + C + K+P V +++GG F LT
Sbjct: 284 DTGTPVIIGPTEEIRALHEAIGGIPLLAGEYIIRCSEIPKLPTVSLLIGGVWFNLT 339
Score = 67.4 bits (163), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 26/41 (63%), Positives = 32/41 (78%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQY+G I LGTPPQ F V+FDTGSSNLW+PS+ C ++ C
Sbjct: 77 AQYFGEIGLGTPPQNFTVVFDTGSSNLWVPSRRCHFFSVPC 117
>gi|355756059|gb|EHH59806.1| hypothetical protein EGM_10003 [Macaca fascicularis]
Length = 423
Score = 89.7 bits (221), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 42/116 (36%), Positives = 69/116 (59%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIA 118
R++ +GGE++ GG D ++ +T+ PV+ YWQ +E + + + C+ C AI
Sbjct: 224 RDSEVADGGELVLGGSDPAHYIPPLTFVPVTVPAYWQIHMERVTVGSGLTLCARGCAAIL 283
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGT +IIGP++ I L++ IG +PL G + C + K+P V +++GG F LT
Sbjct: 284 DTGTPVIIGPTEEIRALHEAIGGIPLLAGEYIIRCSEIPKLPTVSLLIGGVWFNLT 339
Score = 67.4 bits (163), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 26/41 (63%), Positives = 32/41 (78%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQY+G I LGTPPQ F V+FDTGSSNLW+PS+ C ++ C
Sbjct: 77 AQYFGEIGLGTPPQNFTVVFDTGSSNLWVPSRRCHFFSVPC 117
>gi|292658825|ref|NP_999038.2| pepsin A preproprotein [Sus scrofa]
gi|121073319|gb|ABM47074.1| pepsinogen A [Sus scrofa]
Length = 385
Score = 89.7 bits (221), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 44/128 (34%), Positives = 78/128 (60%), Gaps = 5/128 (3%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-K 107
V +D F S + +++D++G ++ GG+D + G + + PVS +GYWQ ++SI ++ +
Sbjct: 205 VSQDLF---SVYLSSNDDSGSVVLLGGIDSSYYTGSLNWVPVSVEGYWQITLDSITMDGE 261
Query: 108 NVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ CS CQAI DTGTSL+ GP+ IA + IGA ++G + C ++D +P++ +
Sbjct: 262 TIACSGGCQAIVDTGTSLLTGPTSAIANIQSDIGASENSDGEMVISCSSIDSLPDIVFTI 321
Query: 167 GGKNFTLT 174
G + L+
Sbjct: 322 NGVQYPLS 329
Score = 61.6 bits (148), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 29/54 (53%), Positives = 40/54 (74%), Gaps = 3/54 (5%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKF 54
+ + +Y GTI +GTP Q+F VIFDTGSSNLW+PS +CS ++ACS D ++F
Sbjct: 65 LENYLDTEYLGTIGIGTPAQDFTVIFDTGSSNLWVPSVYCS--SLACS-DHNQF 115
>gi|296471707|tpg|DAA13822.1| TPA: pregnancy-associated glycoprotein 11-like [Bos taurus]
Length = 332
Score = 89.7 bits (221), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 64/222 (28%), Positives = 103/222 (46%), Gaps = 59/222 (26%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHF 60
+R+ Y G I++GTP QEF+VIFDTG S+LW+PS C + ACS K V H+
Sbjct: 63 LRNIMDMLYEGNITIGTPLQEFQVIFDTGLSDLWMPSLLCR--SSACS----KQVRFRHY 116
Query: 61 RNAS----------------------------------------------DENGGEIMFG 74
++++ G +MFG
Sbjct: 117 KSSTFRPTQETFSIASRSGSMKGFLAYETLSDSAVVTVFQISPPLFSLICRAKGSVVMFG 176
Query: 75 GVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSNC-QAIADTGTSLIIGPSKVI 132
GVDK G + + + + G W ++ I +++ + CS+C +A+ DT TSLI+GP +++
Sbjct: 177 GVDKSYCQGALNWISLIKVGDWSVYMDRISMKRKIIVCSSCCKALVDTRTSLILGPRRLV 236
Query: 133 AELNKLIGAVPLANGPAK---VDCDNLDKMPNVDIILGGKNF 171
+ KL GA+P GP V C + +P++ + G N+
Sbjct: 237 NNIQKLSGAMP--QGPRXGHYVSCFVDNTLPSIIFTINGINY 276
>gi|118572685|sp|P00791.3|PEPA_PIG RecName: Full=Pepsin A; Flags: Precursor
Length = 385
Score = 89.7 bits (221), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 44/128 (34%), Positives = 78/128 (60%), Gaps = 5/128 (3%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-K 107
V +D F S + +++D++G ++ GG+D + G + + PVS +GYWQ ++SI ++ +
Sbjct: 205 VSQDLF---SVYLSSNDDSGSVVLLGGIDSSYYTGSLNWVPVSVEGYWQITLDSITMDGE 261
Query: 108 NVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ CS CQAI DTGTSL+ GP+ IA + IGA ++G + C ++D +P++ +
Sbjct: 262 TIACSGGCQAIVDTGTSLLTGPTSAIANIQSDIGASENSDGEMVISCSSIDSLPDIVFTI 321
Query: 167 GGKNFTLT 174
G + L+
Sbjct: 322 NGVQYPLS 329
Score = 63.2 bits (152), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 29/54 (53%), Positives = 41/54 (75%), Gaps = 3/54 (5%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKF 54
+ + +Y+GTI +GTP Q+F VIFDTGSSNLW+PS +CS ++ACS D ++F
Sbjct: 65 LENYLDTEYFGTIGIGTPAQDFTVIFDTGSSNLWVPSVYCS--SLACS-DHNQF 115
>gi|403255011|ref|XP_003920243.1| PREDICTED: pepsin A-like [Saimiri boliviensis boliviensis]
Length = 405
Score = 89.4 bits (220), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 44/125 (35%), Positives = 74/125 (59%), Gaps = 5/125 (4%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-K 107
V +D F + + +A+D++G +MFGG+D + G + + P+S +GYWQ V+SI + +
Sbjct: 225 VSQDLF---AVYLSANDQSGSVVMFGGIDSSYYTGSLNWVPLSAEGYWQITVDSITMNGE 281
Query: 108 NVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ C+ CQAI DTGTSL+ GP+ IA + IGA +NG + C + +P++ +
Sbjct: 282 AIACAEGCQAIVDTGTSLLSGPTSPIANIQSYIGASENSNGEMVISCSAISSLPDIVFTI 341
Query: 167 GGKNF 171
G +
Sbjct: 342 NGIQY 346
Score = 60.8 bits (146), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 25/34 (73%), Positives = 30/34 (88%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCS 41
+Y+GTI +GTP QEF VIFDTGSSNLW+PS +CS
Sbjct: 92 EYFGTIGIGTPAQEFTVIFDTGSSNLWVPSVYCS 125
>gi|2832610|emb|CAA11580.1| cathepsin [Chionodraco hamatus]
Length = 402
Score = 89.4 bits (220), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 50/116 (43%), Positives = 70/116 (60%), Gaps = 5/116 (4%)
Query: 62 NASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEK-NVYCSN-CQAIAD 119
N S E GE++ GG D+ + I + PV+ K YWQ ++S+ ++ N +C + CQAI D
Sbjct: 229 NGSPE--GELLLGGTDERLYDLPINWLPVTAKAYWQIKIDSVVVQGVNPFCPHGCQAIVD 286
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMP-NVDIILGGKNFTLT 174
TGTSLI GP+ I ++ +LIGA P G VDC L P + +LGGK +TLT
Sbjct: 287 TGTSLITGPTDDILDIQQLIGATPTNFGEFIVDCARLSNFPQHQHFVLGGKEYTLT 342
Score = 61.2 bits (147), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 27/45 (60%), Positives = 35/45 (77%), Gaps = 2/45 (4%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK 51
AQYYG I+LGTP Q F V+FDTGSS+LW+PS +C + AC++ K
Sbjct: 81 AQYYGDIALGTPEQNFSVVFDTGSSDLWVPSAYC--VTEACALPK 123
>gi|163504|gb|AAA30692.1| pepsin A (AA at 12), partial [Bos taurus]
Length = 120
Score = 89.4 bits (220), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 41/109 (37%), Positives = 68/109 (62%), Gaps = 2/109 (1%)
Query: 65 DENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-KNVYCSN-CQAIADTGT 122
D++G +MFGG+D + G + + PVS +GYWQ V+SI ++ + + C++ CQAI DTGT
Sbjct: 2 DDSGSVVMFGGIDSSYYTGSLNWVPVSYEGYWQITVDSITMDGETIACADGCQAIVDTGT 61
Query: 123 SLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNF 171
SL+ GP+ I+ + IGA ++G V C ++ +PN+ + G +
Sbjct: 62 SLLAGPTSAISNIQSYIGASENSDGEMIVSCSSMYSLPNIVFTINGVQY 110
>gi|222425184|dbj|BAH20541.1| pepsinogen A-14 [Pongo abelii]
Length = 388
Score = 89.4 bits (220), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 47/127 (37%), Positives = 74/127 (58%), Gaps = 5/127 (3%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-K 107
V +D F S + +A D++G ++FGG+D + G + + PV+ +GYWQ V+SI + K
Sbjct: 208 VSQDLF---SVYLSADDKSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQITVDSITMNGK 264
Query: 108 NVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ C+ CQAI DTGTSL+ GP+ IA + IGA +NG V C + +P++ +
Sbjct: 265 TIACAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSNGDMVVSCSAISSLPDIVFTI 324
Query: 167 GGKNFTL 173
G + L
Sbjct: 325 NGVQYPL 331
Score = 58.5 bits (140), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 26/48 (54%), Positives = 37/48 (77%), Gaps = 3/48 (6%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKF 54
+Y+GTI +GTP Q+F V+FDTGSSNLW+PS +C ++AC +D + F
Sbjct: 74 VEYFGTIGIGTPAQDFTVVFDTGSSNLWVPSVYC--YSLAC-MDHNLF 118
>gi|355703800|gb|EHH30291.1| hypothetical protein EGK_10923 [Macaca mulatta]
Length = 423
Score = 89.4 bits (220), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 42/116 (36%), Positives = 69/116 (59%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIA 118
R++ +GGE++ GG D ++ +T+ PV+ YWQ +E + + + C+ C AI
Sbjct: 224 RDSEVADGGELVLGGSDPAHYIPPLTFVPVTVPAYWQIHMERVMVGSGLTLCARGCAAIL 283
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGT +IIGP++ I L++ IG +PL G + C + K+P V +++GG F LT
Sbjct: 284 DTGTPVIIGPTEEIRALHEAIGGIPLLAGEYIIRCSEIPKLPTVSLLIGGVWFNLT 339
Score = 67.4 bits (163), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 26/41 (63%), Positives = 32/41 (78%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQY+G I LGTPPQ F V+FDTGSSNLW+PS+ C ++ C
Sbjct: 77 AQYFGEIGLGTPPQNFTVVFDTGSSNLWVPSRRCHFFSVPC 117
>gi|336373584|gb|EGO01922.1| hypothetical protein SERLA73DRAFT_177556 [Serpula lacrymans var.
lacrymans S7.3]
gi|336386403|gb|EGO27549.1| hypothetical protein SERLADRAFT_461213 [Serpula lacrymans var.
lacrymans S7.9]
Length = 413
Score = 89.4 bits (220), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 48/120 (40%), Positives = 67/120 (55%), Gaps = 2/120 (1%)
Query: 60 FR-NASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKN-VYCSNCQAI 117
FR +S+++GGE +FGG+D + G ITY PV RK YW+ +E + + + N A
Sbjct: 241 FRLGSSEDDGGEAVFGGIDSSAYTGSITYVPVRRKAYWEVELEKVSFGGDELDLENTGAA 300
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTRTN 177
DTGTSLI P+ V LN IGA NG +VDC + +P + GGK + L T+
Sbjct: 301 IDTGTSLIALPTDVAEMLNTQIGATRSWNGQYQVDCAKVPSLPELSFYFGGKPYPLKGTD 360
Score = 59.7 bits (143), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 27/41 (65%), Positives = 32/41 (78%), Gaps = 2/41 (4%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQYY I+LG+P Q FKVI DTGSSNLW+PS C+ +IAC
Sbjct: 100 AQYYTEITLGSPAQTFKVILDTGSSNLWVPSSKCT--SIAC 138
>gi|358419664|ref|XP_002705480.2| PREDICTED: LOW QUALITY PROTEIN: pregnancy-associated glycoprotein 1
[Bos taurus]
Length = 327
Score = 89.4 bits (220), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 62/216 (28%), Positives = 101/216 (46%), Gaps = 52/216 (24%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHF 60
+R+ Y G I++GTP QEF+VIFDTG S+LW+PS C + ACS K V H+
Sbjct: 63 LRNIMDMLYEGNITIGTPLQEFQVIFDTGLSDLWMPSLLCR--SSACS----KQVRFRHY 116
Query: 61 RNAS-------------------------------------------DENGGEIMFGGVD 77
++++ G +MFGGVD
Sbjct: 117 KSSTFRPTQETFSIASRSGSMKGFLAYETVQETLKRRSPPLFSPHMXRAKGSVVMFGGVD 176
Query: 78 KDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSNC-QAIADTGTSLIIGPSKVIAEL 135
K G + + + + G W ++ I +++ + CS+C +A+ DT TSLI+GP +++ +
Sbjct: 177 KSYCQGALNWISLIKVGDWSVYMDRISMKRKIIVCSSCCKALVDTRTSLILGPRRLVNNI 236
Query: 136 NKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNF 171
KL GA+P V C + +P++ + G N+
Sbjct: 237 QKLSGAMP-QGSEHYVSCFVDNTLPSIIFTINGINY 271
>gi|342882947|gb|EGU83511.1| hypothetical protein FOXB_05921 [Fusarium oxysporum Fo5176]
Length = 396
Score = 89.4 bits (220), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 48/113 (42%), Positives = 63/113 (55%), Gaps = 1/113 (0%)
Query: 66 ENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVY-CSNCQAIADTGTSL 124
E E FGG+DK KF GDI Y P+ RK YW+ +E+I V N AI DTGTSL
Sbjct: 231 EGESEATFGGIDKSKFTGDIEYIPLRRKAYWEVDLEAIAFGDEVAEQENTGAILDTGTSL 290
Query: 125 IIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTRTN 177
+ PS + LNK IGA NG ++CD +P++ L G N++L T+
Sbjct: 291 NVLPSALAELLNKEIGAKKGYNGQYTIECDKRASLPDITFNLAGSNYSLPATD 343
Score = 65.1 bits (157), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 33/69 (47%), Positives = 44/69 (63%), Gaps = 13/69 (18%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASD- 65
AQY+ I++GTPPQ FKV+ DTGSSNLW+PSQ C +IAC + HS + +++
Sbjct: 84 AQYFSEITIGTPPQSFKVVLDTGSSNLWVPSQQCG--SIACYL-------HSKYDSSASS 134
Query: 66 ---ENGGEI 71
ENG E
Sbjct: 135 TYKENGTEF 143
>gi|29244579|ref|NP_080249.2| gastricsin precursor [Mus musculus]
gi|73921722|sp|Q9D7R7.1|PEPC_MOUSE RecName: Full=Gastricsin; AltName: Full=Pepsinogen C; Flags:
Precursor
gi|12843461|dbj|BAB25990.1| unnamed protein product [Mus musculus]
gi|68534888|gb|AAH99409.1| Progastricsin (pepsinogen C) [Mus musculus]
Length = 392
Score = 89.4 bits (220), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 43/115 (37%), Positives = 74/115 (64%), Gaps = 4/115 (3%)
Query: 67 NGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSN--CQAIADTGT 122
NGG+I+FGGVD++ + G++T+ PV+++ YWQ ++ I + + +CS+ CQ I DTGT
Sbjct: 224 NGGQIVFGGVDENLYTGELTWIPVTQELYWQITIDDFLIGNQASGWCSSSGCQGIVDTGT 283
Query: 123 SLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTRTN 177
SL++ P++ + EL + IGA G V CD++ +P + +L G F L+ ++
Sbjct: 284 SLLVMPAQYLNELLQTIGAQEGEYGQYFVSCDSVSSLPTLTFVLNGVQFPLSPSS 338
Score = 57.4 bits (137), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 24/34 (70%), Positives = 28/34 (82%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
A YYG IS+GTPPQ F V+FDTGSSNLW+ S +C
Sbjct: 74 ASYYGEISIGTPPQNFLVLFDTGSSNLWVSSVYC 107
>gi|194756946|ref|XP_001960731.1| GF13504 [Drosophila ananassae]
gi|190622029|gb|EDV37553.1| GF13504 [Drosophila ananassae]
Length = 402
Score = 89.4 bits (220), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 43/110 (39%), Positives = 60/110 (54%), Gaps = 1/110 (0%)
Query: 65 DENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIADTGTS 123
E GG + FGG + + + G+ TY PVSR+ YWQ +++ I K C N C+ I DTGTS
Sbjct: 231 QEEGGVLFFGGSNPEYYRGNFTYVPVSRRAYWQVKMDAATIRKLELCQNGCEVIIDTGTS 290
Query: 124 LIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
+ P +NK IG P A G V CD + +P + +GG+ F L
Sbjct: 291 FLALPYDQAILINKSIGGRPSAYGQFSVPCDQVSDLPRITFTMGGRAFFL 340
Score = 69.7 bits (169), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 30/53 (56%), Positives = 37/53 (69%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSH 59
AQY+G I++GTPPQ F+VIFDTGSSNLW+PS CS +AC V + S
Sbjct: 79 AQYFGPITIGTPPQTFQVIFDTGSSNLWVPSATCSSTMVACRVHSRYYARRSR 131
>gi|157836875|pdb|3PSG|A Chain A, The High Resolution Crystal Structure Of Porcine
Pepsinogen
Length = 370
Score = 89.4 bits (220), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 44/128 (34%), Positives = 78/128 (60%), Gaps = 5/128 (3%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-K 107
V +D F S + +++D++G ++ GG+D + G + + PVS +GYWQ ++SI ++ +
Sbjct: 190 VSQDLF---SVYLSSNDDSGSVVLLGGIDSSYYTGSLNWVPVSVEGYWQITLDSITMDGE 246
Query: 108 NVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ CS CQAI DTGTSL+ GP+ IA + IGA ++G + C ++D +P++ +
Sbjct: 247 TIACSGGCQAIVDTGTSLLTGPTSAIANIQSDIGASENSDGEMVISCSSIDSLPDIVFTI 306
Query: 167 GGKNFTLT 174
G + L+
Sbjct: 307 DGVQYPLS 314
Score = 63.5 bits (153), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 29/54 (53%), Positives = 41/54 (75%), Gaps = 3/54 (5%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKF 54
+ + +Y+GTI +GTP Q+F VIFDTGSSNLW+PS +CS ++ACS D ++F
Sbjct: 50 LENYLDTEYFGTIGIGTPAQDFTVIFDTGSSNLWVPSVYCS--SLACS-DHNQF 100
>gi|154284392|ref|XP_001542991.1| vacuolar protease A precursor [Ajellomyces capsulatus NAm1]
gi|150406632|gb|EDN02173.1| vacuolar protease A precursor [Ajellomyces capsulatus NAm1]
Length = 398
Score = 89.4 bits (220), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 47/110 (42%), Positives = 64/110 (58%), Gaps = 1/110 (0%)
Query: 65 DENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKN-VYCSNCQAIADTGTS 123
D + E++FGG+++D+F G +T P+ RK YW+ ++SI K SN I DTGTS
Sbjct: 232 DSDDSEVVFGGMNEDRFTGKLTKIPLRRKAYWEVDLDSITFGKQTALMSNTGVILDTGTS 291
Query: 124 LIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
LI PS + LNK IGA NG V+C D +PN+ L G NFT+
Sbjct: 292 LIALPSTIAELLNKEIGAKKSFNGQYTVECAKRDSLPNLTFGLSGHNFTI 341
Score = 63.2 bits (152), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 30/61 (49%), Positives = 39/61 (63%), Gaps = 4/61 (6%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK--DKFVGHSHFRNAS 64
AQY+ I +GTPPQ FKV+ DTGSSNLW+PS C +IAC + D +H +N S
Sbjct: 83 AQYFSEIGIGTPPQTFKVVLDTGSSNLWVPSSECG--SIACYLHNKYDSSASSTHKKNGS 140
Query: 65 D 65
+
Sbjct: 141 E 141
>gi|225460913|ref|XP_002279049.1| PREDICTED: aspartic proteinase [Vitis vinifera]
gi|297737462|emb|CBI26663.3| unnamed protein product [Vitis vinifera]
Length = 514
Score = 89.4 bits (220), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 43/84 (51%), Positives = 60/84 (71%), Gaps = 3/84 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYC-SNCQAI 117
RN+++E GGEI+FGG+D D + G TY PV++KGYWQF + + I + +C S C AI
Sbjct: 234 RNSNEEVGGEIVFGGIDSDHYKGKHTYVPVTKKGYWQFDLGDVMIGGKTTGFCASGCSAI 293
Query: 118 ADTGTSLIIGPSKVIAELNKLIGA 141
AD+GTSL+ GP+ +I E+N IGA
Sbjct: 294 ADSGTSLLAGPTTIITEVNHAIGA 317
Score = 65.5 bits (158), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 28/41 (68%), Positives = 34/41 (82%), Gaps = 1/41 (2%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQY+G IS+GTPPQ+F VIFDTGSSNLW+PS C ++AC
Sbjct: 88 AQYFGEISIGTPPQKFTVIFDTGSSNLWVPSSKC-YFSVAC 127
Score = 40.4 bits (93), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 17/45 (37%), Positives = 28/45 (62%)
Query: 130 KVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
+++ +N+L +P G + VDC +L MPNV + +GGK F L+
Sbjct: 410 RILKYVNELCDRLPSPMGESAVDCGSLSSMPNVSLTIGGKVFDLS 454
>gi|322708430|gb|EFZ00008.1| vacuolar protease A [Metarhizium anisopliae ARSEF 23]
Length = 395
Score = 89.4 bits (220), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 48/119 (40%), Positives = 65/119 (54%), Gaps = 1/119 (0%)
Query: 60 FRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVY-CSNCQAIA 118
F S E G E +FGG+DKD + G I Y P+ RK YW+ + SI +V AI
Sbjct: 224 FYLGSSEEGSEAVFGGIDKDHYTGKIEYIPLRRKAYWEVDIHSIAFGDDVAELDRTGAIL 283
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTRTN 177
DTGTSL + PS + LNK IGA NG VDC + +P++ L G N++L ++
Sbjct: 284 DTGTSLNVLPSTLAELLNKEIGAKKSWNGQYTVDCAQIKSLPDIVFNLAGSNYSLPASD 342
Score = 65.1 bits (157), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 28/41 (68%), Positives = 34/41 (82%), Gaps = 2/41 (4%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQY+ I++GTPPQ FKV+ DTGSSNLW+PSQ CS +IAC
Sbjct: 83 AQYFSEITVGTPPQTFKVVLDTGSSNLWVPSQSCS--SIAC 121
>gi|253723303|pdb|2PSG|A Chain A, Refined Structure Of Porcine Pepsinogen At 1.8 Angstroms
Resolution
Length = 370
Score = 89.4 bits (220), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 44/128 (34%), Positives = 78/128 (60%), Gaps = 5/128 (3%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-K 107
V +D F S + +++D++G ++ GG+D + G + + PVS +GYWQ ++SI ++ +
Sbjct: 190 VSQDLF---SVYLSSNDDSGSVVLLGGIDSSYYTGSLNWVPVSVEGYWQITLDSITMDGE 246
Query: 108 NVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ CS CQAI DTGTSL+ GP+ IA + IGA ++G + C ++D +P++ +
Sbjct: 247 TIACSGGCQAIVDTGTSLLTGPTSAIANIQSDIGASENSDGEMVISCSSIDSLPDIVFTI 306
Query: 167 GGKNFTLT 174
G + L+
Sbjct: 307 DGVQYPLS 314
Score = 63.5 bits (153), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 29/54 (53%), Positives = 41/54 (75%), Gaps = 3/54 (5%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKF 54
+ + +Y+GTI +GTP Q+F VIFDTGSSNLW+PS +CS ++ACS D ++F
Sbjct: 50 LENYLDTEYFGTIGIGTPAQDFTVIFDTGSSNLWVPSVYCS--SLACS-DHNQF 100
>gi|46138535|ref|XP_390958.1| hypothetical protein FG10782.1 [Gibberella zeae PH-1]
gi|408391598|gb|EKJ70970.1| hypothetical protein FPSE_08829 [Fusarium pseudograminearum CS3096]
Length = 396
Score = 89.4 bits (220), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 46/113 (40%), Positives = 64/113 (56%), Gaps = 1/113 (0%)
Query: 66 ENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVY-CSNCQAIADTGTSL 124
E E FGGVDK K+ GD+ Y P+ RK YW+ +++I V N AI DTGTSL
Sbjct: 231 EGQSEATFGGVDKSKYTGDLEYIPLRRKAYWEVDLDAIAFGDEVAEQENTGAILDTGTSL 290
Query: 125 IIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTRTN 177
+ PS + LNK IGA NG ++CD + +P++ L G N++L T+
Sbjct: 291 NVLPSALAELLNKEIGAKKGYNGQYTIECDKVSSLPDITFTLAGSNYSLPSTD 343
Score = 64.7 bits (156), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 35/78 (44%), Positives = 47/78 (60%), Gaps = 7/78 (8%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK--DKFVGHSHFRNAS 64
AQY+ I++GTPPQ FKV+ DTGSSNLW+PSQ C +IAC + D ++ +N S
Sbjct: 84 AQYFSEITIGTPPQSFKVVLDTGSSNLWVPSQECG--SIACYLHSKYDSSASSTYKKNGS 141
Query: 65 DENGGEIMFGGVDKDKFV 82
+ EI +G FV
Sbjct: 142 EF---EIHYGSGSLSGFV 156
>gi|50306705|ref|XP_453326.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
gi|49642460|emb|CAH00422.1| KLLA0D05929p [Kluyveromyces lactis]
Length = 409
Score = 89.4 bits (220), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 47/110 (42%), Positives = 66/110 (60%), Gaps = 1/110 (0%)
Query: 66 ENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI-EKNVYCSNCQAIADTGTSL 124
E+GGE FGG+D++K+ G+IT+ PV RK YW+ E I + E+ A DTGTSL
Sbjct: 244 EDGGEASFGGIDEEKYTGEITWLPVRRKAYWEVKFEGIGLGEEYATLEGHGAAIDTGTSL 303
Query: 125 IIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
I PS + LN IGA +G VDC++ D +P++ + G NFT+T
Sbjct: 304 IALPSGLAEILNAEIGAKKGWSGQYSVDCESRDSLPDLTLNFNGYNFTIT 353
Score = 61.2 bits (147), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 34/83 (40%), Positives = 46/83 (55%), Gaps = 4/83 (4%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASD- 65
AQY+ I+LG+PPQ FKVI DTGSSNLW+PS C L D ++ N S+
Sbjct: 94 AQYFTEITLGSPPQSFKVILDTGSSNLWVPSAECGSLACFLHTKYDHEASSTYKANGSEF 153
Query: 66 --ENGGEIMFGGVDKDKF-VGDI 85
+ G + G V +D +GD+
Sbjct: 154 AIQYGSGSLEGYVSRDLLTIGDL 176
>gi|193499295|gb|ACF18590.1| pepsinogen C precursor [Siniperca scherzeri]
Length = 387
Score = 89.4 bits (220), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 40/119 (33%), Positives = 73/119 (61%), Gaps = 3/119 (2%)
Query: 58 SHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSN-C 114
+ + ++ ++ G E+ FGGVD + G I ++PV+ + YWQ GV+ +I +++ +CS C
Sbjct: 207 AFYLSSGEQQGSELSFGGVDSSMYQGQIYWTPVTSETYWQIGVQGFQINGQESGWCSQGC 266
Query: 115 QAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
Q+I DTGTS++ PS+++ + + IGA G VDC ++ +P + ++ G +F L
Sbjct: 267 QSIVDTGTSMLTAPSQLLGYIMQAIGAQQSQYGMYMVDCSQVNNLPTLTFVISGVSFPL 325
Score = 56.6 bits (135), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 22/32 (68%), Positives = 28/32 (87%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
YYG IS+GTPPQ F+V+FDTGS+NLW+ S +C
Sbjct: 69 YYGAISIGTPPQSFQVLFDTGSANLWVDSVYC 100
>gi|73621391|sp|Q9GMY4.1|PEPC_SORUN RecName: Full=Gastricsin; AltName: Full=Pepsinogen C; Flags:
Precursor
gi|9798664|dbj|BAB11754.1| pepsinogen C [Sorex unguiculatus]
Length = 389
Score = 89.0 bits (219), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 43/113 (38%), Positives = 68/113 (60%), Gaps = 3/113 (2%)
Query: 64 SDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSN-CQAIADT 120
S ++GG ++FGGVD + G I ++PV+++ YWQ GVE I + +CS CQAI DT
Sbjct: 219 SSKDGGAVVFGGVDNSLYTGQIFWTPVTQELYWQIGVEQFLIGGQATGWCSQGCQAIVDT 278
Query: 121 GTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
GTSL+ P + ++ L + GA +G V+C+N+ +P + ++ G F L
Sbjct: 279 GTSLLTVPQQYLSALQQATGAQLDQDGQMVVNCNNIQNLPTLTFVINGVQFPL 331
Score = 60.5 bits (145), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 29/54 (53%), Positives = 37/54 (68%), Gaps = 9/54 (16%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHF 60
A Y+G IS+GTPPQ F V+FDTGSSNLW+PS +C + AC+ GH+ F
Sbjct: 71 AAYFGEISIGTPPQNFLVLFDTGSSNLWVPSVYCQ--SQACT-------GHARF 115
>gi|327278828|ref|XP_003224162.1| PREDICTED: pepsin A-like isoform 2 [Anolis carolinensis]
Length = 386
Score = 89.0 bits (219), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 46/127 (36%), Positives = 75/127 (59%), Gaps = 5/127 (3%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-K 107
V +D F S + ++ D++G +MFGGVD + G + + P+S + YWQ ++SI + +
Sbjct: 206 VSQDLF---SVYLSSDDQSGSFVMFGGVDTSYYSGSLNWVPLSSESYWQITLDSITLNGQ 262
Query: 108 NVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
++ CS CQAI DTGTSL+ GP IA + IGA +NG + C+ ++ +P++ +
Sbjct: 263 SIACSGGCQAIVDTGTSLLAGPPNGIANIQYYIGASQSSNGGYMISCNAMNSLPDIIFTI 322
Query: 167 GGKNFTL 173
G F L
Sbjct: 323 NGIEFPL 329
Score = 58.5 bits (140), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 28/47 (59%), Positives = 38/47 (80%), Gaps = 3/47 (6%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKF 54
+Y GTIS+GTP Q+F V+FDTGSSNLW+PS +CS + ACS + ++F
Sbjct: 73 EYIGTISIGTPAQQFVVLFDTGSSNLWVPSVYCS--SSACS-NHNRF 116
>gi|307177550|gb|EFN66654.1| Lysosomal aspartic protease [Camponotus floridanus]
Length = 328
Score = 89.0 bits (219), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 50/120 (41%), Positives = 71/120 (59%), Gaps = 10/120 (8%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPV----SRKGYWQFGVESIKIEKN-VYC-SNC 114
RN S + GG+++FGG D + GD TY PV + KGYWQF ++SI+I +N C ++C
Sbjct: 187 RNTSADLGGKLIFGGSDPACYEGDFTYIPVLHIFTDKGYWQFIIDSIQINENFTLCEASC 246
Query: 115 QAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
A DT IIGP K ++ +N+ I N +VDCD + ++P + LGGK F LT
Sbjct: 247 YATVDTSAWKIIGPEKDVSSINRFIE----TNSQGRVDCDRIFQLPTIRFNLGGKAFNLT 302
Score = 52.4 bits (124), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 24/45 (53%), Positives = 30/45 (66%), Gaps = 2/45 (4%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK 51
YYG I +GTP QEFKVIFD+ S+N W+ S+ C+ N CS K
Sbjct: 36 VNYYGCIKIGTPQQEFKVIFDSRSANFWVFSEKCT--NPICSRHK 78
>gi|327278826|ref|XP_003224161.1| PREDICTED: pepsin A-like isoform 1 [Anolis carolinensis]
Length = 387
Score = 89.0 bits (219), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 46/127 (36%), Positives = 75/127 (59%), Gaps = 5/127 (3%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-K 107
V +D F S + ++ D++G +MFGGVD + G + + P+S + YWQ ++SI + +
Sbjct: 206 VSQDLF---SVYLSSDDQSGSFVMFGGVDTSYYSGSLNWVPLSSESYWQITLDSITLNGQ 262
Query: 108 NVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
++ CS CQAI DTGTSL+ GP IA + IGA +NG + C+ ++ +P++ +
Sbjct: 263 SIACSGGCQAIVDTGTSLLAGPPNGIANIQYYIGASQSSNGGYMISCNAMNSLPDIIFTI 322
Query: 167 GGKNFTL 173
G F L
Sbjct: 323 NGIEFPL 329
Score = 58.5 bits (140), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 28/47 (59%), Positives = 38/47 (80%), Gaps = 3/47 (6%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKF 54
+Y GTIS+GTP Q+F V+FDTGSSNLW+PS +CS + ACS + ++F
Sbjct: 73 EYIGTISIGTPAQQFVVLFDTGSSNLWVPSVYCS--SSACS-NHNRF 116
>gi|56752671|gb|AAW24549.1| SJCHGC02875 protein [Schistosoma japonicum]
Length = 387
Score = 89.0 bits (219), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 39/113 (34%), Positives = 68/113 (60%), Gaps = 1/113 (0%)
Query: 62 NASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCS-NCQAIADT 120
N GE+M GG+DK + G+ITYS V + +W ++ I I+ ++C A+ DT
Sbjct: 210 NEDKTTSGELMLGGIDKRYYTGEITYSSVVNQEFWMINIDGISIKDKIFCPPGSTALIDT 269
Query: 121 GTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
GT+L++GP++ I +N+ +G+V + VDC+ + ++P +DI + GK+ L
Sbjct: 270 GTALLLGPTEAIDNINRYLGSVRQSTMEYNVDCNKIHELPPIDIKINGKSIQL 322
Score = 62.8 bits (151), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 28/46 (60%), Positives = 36/46 (78%), Gaps = 1/46 (2%)
Query: 3 SSYQ-AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
++YQ +YYG IS+GTPPQ F+V+FDTGS+ LWIPS C N+AC
Sbjct: 71 TNYQNIEYYGEISIGTPPQIFRVMFDTGSAYLWIPSIKCDPTNLAC 116
>gi|5921659|gb|AAD56288.1|AF156792_1 pepsinogen A form IIa [Pseudopleuronectes americanus]
Length = 151
Score = 89.0 bits (219), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 47/118 (39%), Positives = 73/118 (61%), Gaps = 3/118 (2%)
Query: 58 SHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVY-CS-NCQ 115
S + + + + G + FGGVD + + G IT+ P+S + YWQ V+S+ + V CS CQ
Sbjct: 3 SVYLSPNAQQGSVVTFGGVDPNHYNGAITWIPLSSELYWQITVDSVTVNGQVVACSGGCQ 62
Query: 116 AIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
AI DTGTSLI+GP I+ +N +GA NG V+C+++ +MP+V + G+ FT+
Sbjct: 63 AIVDTGTSLIVGPQSSISNINNYVGATS-QNGDYVVNCNSISQMPDVIFHIHGQQFTI 119
>gi|238231529|ref|NP_001153947.1| pepsinogen precursor [Oncorhynchus mykiss]
gi|195970349|gb|ACG60662.1| pepsinogen [Oncorhynchus mykiss]
Length = 377
Score = 89.0 bits (219), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 44/118 (37%), Positives = 66/118 (55%), Gaps = 2/118 (1%)
Query: 58 SHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN--CQ 115
S + + + G + FG ++ + + G IT+ P+S + YWQ ++S+ I N N CQ
Sbjct: 207 SVYLSGNSAQGSVVSFGDIESNYYTGQITWIPLSSETYWQINMDSVTINGNTVACNGGCQ 266
Query: 116 AIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
AI DTGTS I+GP+ I +N +GA G A V+C+N+ MP V L G FT+
Sbjct: 267 AIIDTGTSQIVGPTTDINNMNSWVGATTDQYGDATVNCNNIPNMPEVTFTLNGNAFTI 324
Score = 63.5 bits (153), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 26/33 (78%), Positives = 29/33 (87%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCS 41
YYG IS+GTPPQ F VIFDTGSSNLW+PS +CS
Sbjct: 70 YYGVISIGTPPQSFNVIFDTGSSNLWVPSVYCS 102
>gi|45643446|gb|AAS72876.1| aspartyl protease [Triatoma infestans]
Length = 387
Score = 89.0 bits (219), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 48/118 (40%), Positives = 71/118 (60%), Gaps = 6/118 (5%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGD-ITYSPVSRKGYWQFGVESIKIEKN---VYCSN-CQ 115
RN +E GGEI+FGGVD+ +F + +T P++ YW F ++ + +C N C+
Sbjct: 214 RNPDEEVGGEIIFGGVDEKRFNKESLTTVPLTNPTYWMFKMDEVSTSGTNGKSWCQNGCR 273
Query: 116 AIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
A ADTGTS I+GP+K +AE+ + + A L G V CD L K+P++ L GK +TL
Sbjct: 274 ATADTGTSFIVGPTKEVAEIMEFLDAQVL-QGVGYVPCDELHKLPDITFHLNGKGYTL 330
Score = 63.5 bits (153), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 26/49 (53%), Positives = 36/49 (73%), Gaps = 2/49 (4%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSV 49
+R+S QYYG ++LGTPPQE V+FDTGS+NLW+P +C + AC +
Sbjct: 63 LRNSLNTQYYGNVTLGTPPQELTVVFDTGSANLWVPLANCP--SFACII 109
>gi|22330379|ref|NP_176419.2| phytepsin [Arabidopsis thaliana]
gi|79320483|ref|NP_001031219.1| phytepsin [Arabidopsis thaliana]
gi|75331143|sp|Q8VYL3.1|APA2_ARATH RecName: Full=Aspartic proteinase A2; AltName: Full=Aspartic
protease 57; Short=AtASP57; Flags: Precursor
gi|17979428|gb|AAL49856.1| putative aspartic protease [Arabidopsis thaliana]
gi|23297031|gb|AAN13225.1| putative aspartic protease [Arabidopsis thaliana]
gi|222424000|dbj|BAH19961.1| AT1G62290 [Arabidopsis thaliana]
gi|332195825|gb|AEE33946.1| phytepsin [Arabidopsis thaliana]
gi|332195826|gb|AEE33947.1| phytepsin [Arabidopsis thaliana]
Length = 513
Score = 89.0 bits (219), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 48/109 (44%), Positives = 68/109 (62%), Gaps = 4/109 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYC-SNCQAI 117
R+ E GGEI+FGGVD F G+ T+ PV+++GYWQF + + I E YC S C AI
Sbjct: 233 RDPKSEEGGEIVFGGVDPKHFRGEHTFVPVTQRGYWQFDMGEVLIAGESTGYCGSGCSAI 292
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
AD+GTSL+ GP+ V+A +NK IGA + + K D + +D++L
Sbjct: 293 ADSGTSLLAGPTAVVAMINKAIGASGVVSQQCKTVVDQYGQTI-LDLLL 340
Score = 65.1 bits (157), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 27/47 (57%), Positives = 37/47 (78%), Gaps = 1/47 (2%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ AQYYG I++GTPPQ+F VIFDTGSSNLW+PS C +++C
Sbjct: 81 LKNYLDAQYYGEIAIGTPPQKFTVIFDTGSSNLWVPSGKC-FFSLSC 126
Score = 41.2 bits (95), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 17/45 (37%), Positives = 26/45 (57%)
Query: 130 KVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
+++ +N++ +P NG + VDC L KMP V +GGK F L
Sbjct: 409 RIVNYINEICERMPSPNGESAVDCSQLSKMPTVSFTIGGKVFDLA 453
>gi|327271205|ref|XP_003220378.1| PREDICTED: gastricsin-like [Anolis carolinensis]
Length = 388
Score = 89.0 bits (219), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 45/116 (38%), Positives = 66/116 (56%), Gaps = 4/116 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSN-CQAI 117
R + + GGE++ GGVD F GD++++PV+R+ YWQ GVE I E +CS CQAI
Sbjct: 216 RQPTVQYGGELILGGVDTQLFSGDVSWAPVTREVYWQIGVEEFAIGNEATGWCSEGCQAI 275
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
DTGT + P + + +GA NG V+C+N+ MP + ++ G F L
Sbjct: 276 VDTGTCQLTIPRQYFDTFLQAVGAEEY-NGELLVNCNNVQNMPTITFVINGAQFPL 330
Score = 54.7 bits (130), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 21/32 (65%), Positives = 27/32 (84%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
Y+G IS+GTPPQ F V+ D+GSSNLW+PS +C
Sbjct: 73 YFGEISIGTPPQNFLVVMDSGSSNLWVPSVYC 104
>gi|74136511|ref|NP_001028152.1| gastricsin precursor [Monodelphis domestica]
gi|73621388|sp|Q689Z7.1|PEPC_MONDO RecName: Full=Gastricsin; AltName: Full=Pepsinogen C; Flags:
Precursor
gi|51534970|dbj|BAD36918.1| pepsinogen C [Monodelphis domestica]
Length = 391
Score = 89.0 bits (219), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 42/116 (36%), Positives = 66/116 (56%), Gaps = 3/116 (2%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSN-CQAI 117
R + + GGE++ GGVD + G IT++PV+++ YWQ G+E I + +CS CQAI
Sbjct: 218 RQPTHQYGGELILGGVDPQLYSGQITWTPVTQEVYWQIGIEEFAIGNQATGWCSQGCQAI 277
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
DTGT L+ P + ++ + GA NG V+C+ + MP + ++ G F L
Sbjct: 278 VDTGTFLLAVPQQYMSAFLQATGAQQAQNGDFMVNCNYIQDMPTITFVINGSQFPL 333
Score = 60.5 bits (145), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 31/62 (50%), Positives = 45/62 (72%), Gaps = 6/62 (9%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASDENG 68
Y+G IS+GTPPQ F V+FDTGSSNLW+PS +C + ACS + ++F S ++++ NG
Sbjct: 75 YFGEISIGTPPQNFLVLFDTGSSNLWVPSTYCQ--SQACS-NHNRF---SPSQSSTFTNG 128
Query: 69 GE 70
G+
Sbjct: 129 GQ 130
>gi|122890420|emb|CAM12780.1| aspartic proteinase [Fagopyrum esculentum]
Length = 506
Score = 89.0 bits (219), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 43/84 (51%), Positives = 60/84 (71%), Gaps = 3/84 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCS-NCQAI 117
RNA +E GGEI+FGG+D G+ TY PV++KGYWQF ++ + + E +CS C AI
Sbjct: 226 RNADEEVGGEIVFGGIDPAHHKGEHTYLPVTQKGYWQFDLDDVLVGGESTGFCSGGCSAI 285
Query: 118 ADTGTSLIIGPSKVIAELNKLIGA 141
AD+GTSL+ GP+ V+A++N IGA
Sbjct: 286 ADSGTSLLAGPTPVVAQINHAIGA 309
Score = 62.4 bits (150), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 28/47 (59%), Positives = 36/47 (76%), Gaps = 1/47 (2%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ AQY+G I++GTP Q F VIFDTGSSNLW+PS C L+IAC
Sbjct: 74 LKNYMDAQYFGEIAIGTPSQTFTVIFDTGSSNLWVPSGKC-YLSIAC 119
Score = 36.6 bits (83), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 16/45 (35%), Positives = 24/45 (53%)
Query: 130 KVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
+++ N+L +P G + VDC +L +P V LGGK F L
Sbjct: 402 RILDYANQLCERLPSPMGESAVDCGSLSTLPTVSFTLGGKTFALA 446
>gi|268579917|ref|XP_002644941.1| C. briggsae CBR-ASP-3 protein [Caenorhabditis briggsae]
Length = 397
Score = 88.6 bits (218), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 43/115 (37%), Positives = 68/115 (59%), Gaps = 2/115 (1%)
Query: 61 RNASD-ENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIA 118
R+A+D NGGEI D + + G+I + P+ YW+ + ++ I+ + + +I
Sbjct: 219 RDANDIANGGEITLCDTDSNHYTGNIAWEPLVSMDYWRIKLGAVSIQGTTFTNGPMDSIV 278
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
DTGTSL+ GPS +I ++ IGA+PL NG +V C + +PN+ LGG+NF L
Sbjct: 279 DTGTSLLTGPSDIIKKIQHKIGAIPLLNGEYEVACSKIPSLPNITFTLGGQNFDL 333
Score = 65.9 bits (159), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 27/43 (62%), Positives = 34/43 (79%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSV 49
AQYYG I +GTPPQ F+V+FDTGSSNLW+P +C +IAC +
Sbjct: 67 AQYYGPIVIGTPPQNFQVLFDTGSSNLWVPCANCPFGDIACRM 109
>gi|432099182|gb|ELK28547.1| Cathepsin D [Myotis davidii]
Length = 351
Score = 88.6 bits (218), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 45/117 (38%), Positives = 65/117 (55%), Gaps = 3/117 (2%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYC--SNCQAIA 118
R+ S + GGE+M GG D + G I Y V+RK YWQ ++ + + + C+AI
Sbjct: 191 RDPSAQPGGELMLGGTDSKYYKGPIAYLNVTRKAYWQVHMDQVDVGNGLTLCKEGCEAIV 250
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPN-VDIILGGKNFTLT 174
DTGTSL++GP + EL K IGAVPL G + C+ + +P + GGK L+
Sbjct: 251 DTGTSLMVGPVDEVRELQKAIGAVPLIQGEYMIPCEKVSSLPEPSQVSQGGKTICLS 307
Score = 76.6 bits (187), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 44/107 (41%), Positives = 60/107 (56%), Gaps = 7/107 (6%)
Query: 6 QAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASD 65
+AQYYG I +GTPPQ F V+FDTGSSNLW+PS HC +L+IAC + G S +
Sbjct: 34 EAQYYGEIGIGTPPQCFTVVFDTGSSNLWVPSIHCKLLDIACWIHHKYNSGKS---STYV 90
Query: 66 ENGG--EIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVY 110
ENG +I +G ++ T S G G +K+E+ V+
Sbjct: 91 ENGTTFDIHYGSGSLSGYLSQDTVSVPCNSGLASLG--GVKVERQVF 135
>gi|190576563|gb|ACE79054.1| gastricsin precursor (predicted) [Sorex araneus]
Length = 389
Score = 88.6 bits (218), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 43/113 (38%), Positives = 67/113 (59%), Gaps = 3/113 (2%)
Query: 64 SDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSN-CQAIADT 120
S ++GG ++FGGVD + G I ++PV+++ YWQ GVE I + +CS CQAI DT
Sbjct: 219 SSQDGGAVVFGGVDNSLYTGQIFWTPVTQELYWQIGVEQFLIGGQATGWCSQGCQAIVDT 278
Query: 121 GTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
GTSL+ P + ++ L + GA G V+C+N+ +P + ++ G F L
Sbjct: 279 GTSLLTVPQQYMSALQQATGAQLDQYGQMVVNCNNIQNLPTLTFVINGVQFPL 331
Score = 60.5 bits (145), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 29/54 (53%), Positives = 37/54 (68%), Gaps = 9/54 (16%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHF 60
A Y+G IS+GTPPQ F V+FDTGSSNLW+PS +C + AC+ GH+ F
Sbjct: 71 AAYFGEISIGTPPQNFLVLFDTGSSNLWVPSVYCQ--SQACT-------GHARF 115
>gi|426250269|ref|XP_004018860.1| PREDICTED: gastricsin [Ovis aries]
Length = 431
Score = 88.6 bits (218), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 42/111 (37%), Positives = 66/111 (59%), Gaps = 3/111 (2%)
Query: 66 ENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCS-NCQAIADTGT 122
++GG ++FGGVD + G I ++PV+++ YWQ G+E I + +CS CQAI DTGT
Sbjct: 222 QDGGAVIFGGVDSRLYTGQIYWAPVTQELYWQIGIEEFLIGDQATGWCSAGCQAIVDTGT 281
Query: 123 SLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
SL+ P + ++ L + GA G VDC+N+ +P + ++ G F L
Sbjct: 282 SLLTVPQQFLSALLQATGAQKDQYGQFPVDCNNIQNLPTLTFVINGMQFPL 332
Score = 59.3 bits (142), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 24/34 (70%), Positives = 29/34 (85%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
A Y+G IS+GTPPQ F V+FDTGSSNLW+PS +C
Sbjct: 72 AAYFGEISIGTPPQNFLVLFDTGSSNLWVPSLYC 105
>gi|195997419|ref|XP_002108578.1| expressed hypothetical protein [Trichoplax adhaerens]
gi|190589354|gb|EDV29376.1| expressed hypothetical protein [Trichoplax adhaerens]
Length = 383
Score = 88.6 bits (218), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 44/116 (37%), Positives = 64/116 (55%), Gaps = 3/116 (2%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN--CQAIA 118
R GGE++ GG D + + G Y P+SR+ YWQF ++ K+ N CQAIA
Sbjct: 211 RKQGGAVGGELILGGSDPNYYSGQFNYVPLSRESYWQFAMDGGKVATGTTVCNGGCQAIA 270
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGT+LI+GP + + + + IGA A G VDC + +P + + G N+ LT
Sbjct: 271 DTGTTLIVGPPEDVQRIQQAIGAQN-AGGQYTVDCSTISSLPTITFTINGVNYPLT 325
Score = 63.9 bits (154), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 29/48 (60%), Positives = 37/48 (77%), Gaps = 2/48 (4%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKF 54
AQYYG I++GTPPQ FK++FDTGSS+LW+PS C+ N AC DK+
Sbjct: 65 AQYYGPITIGTPPQNFKILFDTGSSDLWVPSTKCN-GNAACE-SHDKY 110
>gi|195046646|ref|XP_001992192.1| GH24624 [Drosophila grimshawi]
gi|193893033|gb|EDV91899.1| GH24624 [Drosophila grimshawi]
Length = 446
Score = 88.6 bits (218), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 41/110 (37%), Positives = 67/110 (60%), Gaps = 4/110 (3%)
Query: 62 NASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSNCQAIADTG 121
+ S+ GGE++FGG D F+G++ Y+ V+R+GYWQF +++ + N+ C+NC AI DTG
Sbjct: 227 DGSNSYGGELIFGGSDPMWFIGNMVYTNVTREGYWQFMMDNATLNGNILCTNCPAIVDTG 286
Query: 122 TSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNF 171
TSLI+ P+ + +++G N VDC + +MP + + G F
Sbjct: 287 TSLIMAPAASYQIIMEVVG----VNVFGIVDCSTISQMPVLTFAINGVMF 332
Score = 61.6 bits (148), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 23/33 (69%), Positives = 30/33 (90%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
QYYG I++GTPPQ+F+V+FDTGS+NLW+PS C
Sbjct: 83 QYYGLITIGTPPQDFQVVFDTGSANLWVPSIKC 115
>gi|77808107|gb|AAV84085.2| aspartic proteinase 9 [Fagopyrum esculentum]
Length = 506
Score = 88.6 bits (218), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 43/84 (51%), Positives = 60/84 (71%), Gaps = 3/84 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCS-NCQAI 117
RNA +E GGEI+FGG+D G+ TY PV++KGYWQF ++ + + E +CS C AI
Sbjct: 226 RNADEEIGGEIVFGGIDPAHHKGEHTYLPVTQKGYWQFDLDDVLVGGESTGFCSGGCSAI 285
Query: 118 ADTGTSLIIGPSKVIAELNKLIGA 141
AD+GTSL+ GP+ V+A++N IGA
Sbjct: 286 ADSGTSLLAGPTPVVAQINHAIGA 309
Score = 62.4 bits (150), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 28/47 (59%), Positives = 36/47 (76%), Gaps = 1/47 (2%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ AQY+G I++GTP Q F VIFDTGSSNLW+PS C L+IAC
Sbjct: 74 LKNYMDAQYFGEIAIGTPSQTFTVIFDTGSSNLWVPSGKC-YLSIAC 119
Score = 36.6 bits (83), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 16/45 (35%), Positives = 24/45 (53%)
Query: 130 KVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
+++ N+L +P G + VDC +L +P V LGGK F L
Sbjct: 402 RILDYANQLCERLPSPMGESAVDCGSLSTLPTVSFTLGGKTFALA 446
>gi|351724625|ref|NP_001237064.1| aspartic proteinase 1 precursor [Glycine max]
gi|15186732|dbj|BAB62890.1| aspartic proteinase 1 [Glycine max]
Length = 514
Score = 88.6 bits (218), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 45/84 (53%), Positives = 57/84 (67%), Gaps = 3/84 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSN-CQAI 117
RN +E GGEI+FGGVD + G TY PV+RKGYWQF + + I + YC+N C AI
Sbjct: 234 RNPEEEEGGEIVFGGVDPAHYKGKHTYVPVTRKGYWQFDMGDVLIGGKPTGYCANGCSAI 293
Query: 118 ADTGTSLIIGPSKVIAELNKLIGA 141
AD+GTSL+ GP+ VI +N IGA
Sbjct: 294 ADSGTSLLAGPTTVITMINHAIGA 317
Score = 64.3 bits (155), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 27/47 (57%), Positives = 37/47 (78%), Gaps = 1/47 (2%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ AQYYG I++GT PQ+F VIFDTGSSNLW+PS C+ ++AC
Sbjct: 82 LKNYLDAQYYGEIAIGTSPQKFAVIFDTGSSNLWVPSSKCT-FSVAC 127
Score = 36.6 bits (83), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 14/45 (31%), Positives = 27/45 (60%)
Query: 130 KVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
++++ +N+L +P G + VDC N+ +P V +GG+ F L+
Sbjct: 410 QILSYINQLCDKMPSPMGESAVDCGNISSLPVVSFTIGGRTFDLS 454
>gi|189011689|ref|NP_001098064.1| pepsin A precursor [Macaca mulatta]
gi|129793|sp|P11489.1|PEPA_MACMU RecName: Full=Pepsin A; Flags: Precursor
gi|342275|gb|AAA36902.1| pepsinogen A precursor (EC 3.4.23.1) [Macaca mulatta]
Length = 388
Score = 88.6 bits (218), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 46/127 (36%), Positives = 74/127 (58%), Gaps = 5/127 (3%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-K 107
V +D F S + +A D++G ++FGG+D + G + + PVS +GYWQ V+SI + +
Sbjct: 208 VSQDLF---SVYLSADDQSGSVVIFGGIDSSYYTGSLNWVPVSVEGYWQISVDSITMNGE 264
Query: 108 NVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ C+ CQAI DTGTSL+ GP+ IA + IGA ++G V C + +P++ +
Sbjct: 265 AIACAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGEMVVSCSAISSLPDIVFTI 324
Query: 167 GGKNFTL 173
G + L
Sbjct: 325 NGVQYPL 331
Score = 60.8 bits (146), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 26/41 (63%), Positives = 35/41 (85%), Gaps = 2/41 (4%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACS 48
+Y+GTI +GTP Q+F VIFDTGSSNLW+PS +CS ++AC+
Sbjct: 75 EYFGTIGIGTPAQDFTVIFDTGSSNLWVPSVYCS--SLACT 113
>gi|403217759|emb|CCK72252.1| hypothetical protein KNAG_0J01710 [Kazachstania naganishii CBS
8797]
Length = 415
Score = 88.6 bits (218), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 47/126 (37%), Positives = 72/126 (57%), Gaps = 1/126 (0%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI-EK 107
+D+ KF + N E+GGE +FGGVDK K+ GD+T+ PV RK YW+ +E + + ++
Sbjct: 233 LDEAKFAFYLGDTNKDAEDGGEAIFGGVDKSKYTGDVTWLPVRRKAYWEVKLEGLGLGDE 292
Query: 108 NVYCSNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILG 167
+ A DTGTSLI PS + +N IGA G ++C+ D++P++
Sbjct: 293 YAELESHGAAIDTGTSLITLPSGLAEIINSEIGAKKGWTGQYTLECNTRDQLPDLTFNFN 352
Query: 168 GKNFTL 173
G NFT+
Sbjct: 353 GYNFTI 358
Score = 67.0 bits (162), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 37/84 (44%), Positives = 48/84 (57%), Gaps = 4/84 (4%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASD- 65
AQYY I+LGTPPQ+FKVI DTGSSNLW+PS C L D S+ N +D
Sbjct: 99 AQYYTDITLGTPPQQFKVILDTGSSNLWVPSSECGSLACFLHEKYDHSASSSYKANGTDF 158
Query: 66 --ENGGEIMFGGVDKDKF-VGDIT 86
+ G + G + +D +GD+T
Sbjct: 159 SIQYGSGSLEGYISQDTLSIGDLT 182
>gi|392568782|gb|EIW61956.1| aspartic peptidase A1 [Trametes versicolor FP-101664 SS1]
Length = 415
Score = 88.6 bits (218), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 46/115 (40%), Positives = 65/115 (56%), Gaps = 1/115 (0%)
Query: 64 SDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI-EKNVYCSNCQAIADTGT 122
S+E+GGE +FGG+D + G I Y PV RK YW+ +E I++ ++ + N A DTGT
Sbjct: 247 SEEDGGEAIFGGIDDSAYSGKIEYVPVRRKAYWEVELEKIRLGDEELELENTGAAIDTGT 306
Query: 123 SLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTRTN 177
SLI PS + LN IGA NG VDC + +P++ GK + L T+
Sbjct: 307 SLIALPSDLAEMLNAQIGAKKSWNGQYTVDCAKVPDLPDLTFFFNGKPYVLKGTD 361
Score = 66.2 bits (160), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 36/84 (42%), Positives = 48/84 (57%), Gaps = 4/84 (4%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASD- 65
AQY+ I+LGTPPQ FKVI DTGSSNLW+PS C+ + D ++ N S+
Sbjct: 101 AQYFAEITLGTPPQSFKVILDTGSSNLWVPSTKCTSIACFLHAKYDSSASSTYKANGSEF 160
Query: 66 --ENGGEIMFGGVDKDKF-VGDIT 86
+ G M G V +D +GD+T
Sbjct: 161 SIQYGSGSMEGFVSRDVLTIGDLT 184
>gi|193499297|gb|ACF18591.1| pepsinogen C precursor [Siniperca chuatsi]
gi|253762213|gb|ACT35558.1| pepsinogen C precursor [Siniperca scherzeri]
Length = 387
Score = 88.6 bits (218), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 40/119 (33%), Positives = 73/119 (61%), Gaps = 3/119 (2%)
Query: 58 SHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSN-C 114
+ + ++ ++ G E+ FGGVD + G I ++PV+ + YWQ GV+ +I +++ +CS C
Sbjct: 207 AFYLSSGEQQGSELSFGGVDSSMYQGQIYWTPVTSETYWQIGVQGFQINGQESGWCSQGC 266
Query: 115 QAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
Q+I DTGTS++ PS+++ + + IGA G VDC ++ +P + ++ G +F L
Sbjct: 267 QSIVDTGTSMLTAPSQLLGYIMQAIGAQQNQYGMYMVDCSQVNNLPTLTFVISGVSFPL 325
Score = 56.6 bits (135), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 22/32 (68%), Positives = 28/32 (87%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
YYG IS+GTPPQ F+V+FDTGS+NLW+ S +C
Sbjct: 69 YYGAISIGTPPQSFQVLFDTGSANLWVDSVYC 100
>gi|354497676|ref|XP_003510945.1| PREDICTED: napsin-A [Cricetulus griseus]
Length = 569
Score = 88.6 bits (218), Expect = 1e-15, Method: Composition-based stats.
Identities = 46/116 (39%), Positives = 65/116 (56%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIA 118
R+A +GGE++ GG D ++ +T+ PV+ YWQ +ES+ + + C+ C I
Sbjct: 218 RDAEGSDGGELVLGGSDPAHYIPPLTFIPVTIPAYWQVHMESVNVGTGLSLCAQGCGVIL 277
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSLI GPS+ I LNK IG +P G + C ++P V LGG F LT
Sbjct: 278 DTGTSLITGPSEEIHALNKAIGGLPFLAGQYFIQCSKTPELPTVSFRLGGVWFNLT 333
Score = 65.9 bits (159), Expect = 7e-09, Method: Composition-based stats.
Identities = 25/40 (62%), Positives = 30/40 (75%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
QY+G I LGTPPQ F V+FDTGSSNLW+PS C ++ C
Sbjct: 72 QYFGDIGLGTPPQNFTVVFDTGSSNLWVPSVRCHFFSLPC 111
>gi|365986877|ref|XP_003670270.1| hypothetical protein NDAI_0E02105 [Naumovozyma dairenensis CBS 421]
gi|343769040|emb|CCD25027.1| hypothetical protein NDAI_0E02105 [Naumovozyma dairenensis CBS 421]
Length = 408
Score = 88.6 bits (218), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 56/151 (37%), Positives = 79/151 (52%), Gaps = 3/151 (1%)
Query: 24 VIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVG 83
+ +D+ S N +P + +I +D+ KF + N E+GGEI GG+DK KF G
Sbjct: 203 LAYDSISVNKVVPPFYNAIEQEL--LDEKKFAFYLGDANKKSEDGGEITIGGIDKTKFKG 260
Query: 84 DITYSPVSRKGYWQFGVESIKI-EKNVYCSNCQAIADTGTSLIIGPSKVIAELNKLIGAV 142
DI + PV RK YW+ E I + ++ N A DTGTSLI PS + +N IGA
Sbjct: 261 DIDWLPVRRKAYWEVKFEGIGLGDQFAELENHGAAIDTGTSLIALPSGLAEIINTEIGAK 320
Query: 143 PLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
G V+CD +P++ GKNFT+
Sbjct: 321 KGWTGQYTVECDARPNLPDLTFNFNGKNFTI 351
Score = 61.6 bits (148), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 30/51 (58%), Positives = 36/51 (70%), Gaps = 6/51 (11%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSV----DKDK 53
AQY+ I+LGTPPQ FKVI DTGSSNLW+PS C ++AC + D DK
Sbjct: 92 AQYFADITLGTPPQSFKVILDTGSSNLWVPSVECG--SLACYLHSKYDHDK 140
>gi|410982348|ref|XP_003997519.1| PREDICTED: napsin-A [Felis catus]
Length = 422
Score = 88.6 bits (218), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 44/116 (37%), Positives = 66/116 (56%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIA 118
R+ +GGE++ GG D ++ +T+ PV+ YWQ +E +K+ + C+ C AI
Sbjct: 221 RDPEAADGGELVLGGSDPAHYIPPLTFVPVTIPAYWQIHMERMKVGTGLTLCAQGCAAIL 280
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSLI GP++ I LN IG + L G + C+ + +P V +LGG F LT
Sbjct: 281 DTGTSLITGPTEEIRALNTAIGGISLLVGEYLIQCETIPTLPPVSFLLGGVWFNLT 336
Score = 65.1 bits (157), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 26/40 (65%), Positives = 30/40 (75%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
QYYG I LGTPPQ F V+FDTGSSNLW+PS C ++ C
Sbjct: 75 QYYGEIGLGTPPQNFSVVFDTGSSNLWVPSIRCHFFSLPC 114
>gi|164604|gb|AAA31096.1| pepsinogen A precursor [Sus scrofa]
Length = 385
Score = 88.6 bits (218), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 44/128 (34%), Positives = 77/128 (60%), Gaps = 5/128 (3%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-K 107
V +D F S + +++D++G ++ GG+D + G + + PVS +GYWQ ++SI ++ +
Sbjct: 205 VSQDLF---SVYLSSNDDSGSVVLLGGIDSSYYTGSLNWVPVSVEGYWQITLDSITMDGE 261
Query: 108 NVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ CS CQAI DTGTSL+ GP+ IA + IGA + G + C ++D +P++ +
Sbjct: 262 TIACSGGCQAIVDTGTSLLTGPTSAIANIQSDIGASENSYGEMVISCSSIDSLPDIVFTI 321
Query: 167 GGKNFTLT 174
G + L+
Sbjct: 322 NGVQYPLS 329
Score = 63.2 bits (152), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 29/54 (53%), Positives = 41/54 (75%), Gaps = 3/54 (5%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKF 54
+ + +Y+GTI +GTP Q+F VIFDTGSSNLW+PS +CS ++ACS D ++F
Sbjct: 65 LENYLDTEYFGTIGIGTPAQDFTVIFDTGSSNLWVPSVYCS--SLACS-DHNQF 115
>gi|195382956|ref|XP_002050194.1| GJ22010 [Drosophila virilis]
gi|194144991|gb|EDW61387.1| GJ22010 [Drosophila virilis]
Length = 394
Score = 88.2 bits (217), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 43/114 (37%), Positives = 60/114 (52%), Gaps = 1/114 (0%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIAD 119
R AS GG + FGG + G+ TY PV+ + YWQ +E+ +I C N CQ I D
Sbjct: 219 RIASSRQGGYLFFGGSSPRYYRGNFTYVPVTHRAYWQVKLEAARIGPLQLCLNGCQVIID 278
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
TGTS + P + +N+ IG P A G V C+ + +P + LGG+ F L
Sbjct: 279 TGTSFLAVPYEQAILINESIGGTPAAYGQFSVPCEQVPHLPTLSFTLGGRRFEL 332
Score = 65.1 bits (157), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 33/68 (48%), Positives = 43/68 (63%), Gaps = 2/68 (2%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASDE 66
AQY+G IS+GTPPQ+F VIFDTGS+NLW+PS+ C +AC + HS SD
Sbjct: 73 AQYFGPISIGTPPQKFNVIFDTGSANLWVPSESCH-QKLACQIHSRYNSRHSR-SYKSDG 130
Query: 67 NGGEIMFG 74
+I +G
Sbjct: 131 KQFDIQYG 138
>gi|406861956|gb|EKD15008.1| aspartic endopeptidase Pep2 [Marssonina brunnea f. sp.
'multigermtubi' MB_m1]
Length = 401
Score = 88.2 bits (217), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 55/150 (36%), Positives = 79/150 (52%), Gaps = 3/150 (2%)
Query: 26 FDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDI 85
+DT S N IP +++N +D+ F + + +E+ E +FGGV+KD F G I
Sbjct: 197 YDTISVN-KIPPPFYNMVNQEL-LDEPVFAFYLGSTDKGEEDQSEAIFGGVNKDHFTGKI 254
Query: 86 TYSPVSRKGYWQFGVESIKI-EKNVYCSNCQAIADTGTSLIIGPSKVIAELNKLIGAVPL 144
T P+ RK YW+ +++I + N I DTGTSLI PS + LNK +GA
Sbjct: 255 TEIPLRRKAYWEVDLDAITFGDATAELENTGVILDTGTSLIALPSTLAELLNKEMGAKKG 314
Query: 145 ANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
NG VDC D +P++ L G FT+T
Sbjct: 315 YNGQYTVDCAKRDSLPDMSFTLSGHEFTIT 344
Score = 62.0 bits (149), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 26/41 (63%), Positives = 32/41 (78%), Gaps = 2/41 (4%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQY+ I++GTPPQ FKV+ DTGSSNLW+PS C +IAC
Sbjct: 84 AQYFSEITIGTPPQTFKVVLDTGSSNLWVPSSQCG--SIAC 122
>gi|260940805|ref|XP_002615242.1| hypothetical protein CLUG_04124 [Clavispora lusitaniae ATCC 42720]
gi|238850532|gb|EEQ39996.1| hypothetical protein CLUG_04124 [Clavispora lusitaniae ATCC 42720]
Length = 406
Score = 88.2 bits (217), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 53/152 (34%), Positives = 81/152 (53%), Gaps = 3/152 (1%)
Query: 24 VIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVG 83
+ +DT S N +P + +I +D +F + N ++ENGG FGG D+ F G
Sbjct: 201 LAYDTISVNKIVPPVYNAIAQ--GLLDAPQFGFYLGDTNKNEENGGVATFGGYDEALFKG 258
Query: 84 DITYSPVSRKGYWQFGVESIKI-EKNVYCSNCQAIADTGTSLIIGPSKVIAELNKLIGAV 142
D+T+ PV RK YW+ + I + ++ + A DTGTSLI PS + +N IGA
Sbjct: 259 DLTWLPVRRKAYWEVSFDGIGLGDEYAELTATGAAIDTGTSLITLPSSLAEIINAKIGAT 318
Query: 143 PLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
+G +VDC D +P++ + G NFTL+
Sbjct: 319 KSWSGQYQVDCATRDNLPDLTLTFAGYNFTLS 350
Score = 60.5 bits (145), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 35/83 (42%), Positives = 46/83 (55%), Gaps = 4/83 (4%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASD- 65
AQY+ I LGTP Q FKVI DTGSSNLW+PS+ CS L D ++ N S+
Sbjct: 90 AQYFTEIQLGTPGQTFKVILDTGSSNLWVPSRDCSSLACFLHTKYDHDESSTYKANGSEF 149
Query: 66 --ENGGEIMFGGVDKDKF-VGDI 85
+ G M G + +D +GD+
Sbjct: 150 SIQYGSGAMEGYISQDVLAIGDL 172
>gi|444725492|gb|ELW66056.1| Gastricsin [Tupaia chinensis]
Length = 389
Score = 88.2 bits (217), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 44/112 (39%), Positives = 66/112 (58%), Gaps = 3/112 (2%)
Query: 65 DENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSN-CQAIADTG 121
E+GG ++FGGVD + G I ++PV+++ YWQ G+E I + + +CS CQAI DTG
Sbjct: 220 SEDGGAVIFGGVDNSLYSGQIYWAPVTQELYWQIGIEEFLIGGQASGWCSQGCQAIVDTG 279
Query: 122 TSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
TSL+ P + ++ L + GA G V+CDN+ +P I+ G F L
Sbjct: 280 TSLLTVPQQYMSTLLQATGAQEDEYGQFLVNCDNIQSLPTFTFIINGVQFPL 331
Score = 59.3 bits (142), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 24/34 (70%), Positives = 29/34 (85%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
A Y+G IS+GTPPQ F V+FDTGSSNLW+PS +C
Sbjct: 71 AAYFGEISIGTPPQNFLVLFDTGSSNLWVPSVYC 104
>gi|195110663|ref|XP_001999899.1| GI24781 [Drosophila mojavensis]
gi|193916493|gb|EDW15360.1| GI24781 [Drosophila mojavensis]
Length = 354
Score = 88.2 bits (217), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 49/138 (35%), Positives = 73/138 (52%), Gaps = 10/138 (7%)
Query: 42 ILNIACSVDKDKFVGHSHF-----RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYW 96
L + C + V H F R + GGEI FGG+D+ +++GD+ Y P+SR GYW
Sbjct: 163 FLELLC---QQHLVEHCLFSVYLRRMPGELYGGEITFGGIDESRYLGDLHYVPLSRVGYW 219
Query: 97 QFGVESIKIEKNVYCSNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNL 156
QF + S+ + + I DTGTS I+ P + EL + IGA+ + NG V C
Sbjct: 220 QFEMSSVAVGNRQVAEKIEVILDTGTSFILLPIDLFEELQRAIGAI-IVNGSNVVSCQA- 277
Query: 157 DKMPNVDIILGGKNFTLT 174
+ + +V +G K FTL+
Sbjct: 278 ENLQDVYFHVGEKKFTLS 295
Score = 66.2 bits (160), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 27/42 (64%), Positives = 34/42 (80%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSV 49
+YYG IS+GTPPQ F+VIFDTGS+N W+PS +C NIAC +
Sbjct: 38 EYYGRISMGTPPQLFRVIFDTGSANTWLPSSNCPESNIACQL 79
>gi|130484814|ref|NP_001076103.1| gastricsin precursor [Oryctolagus cuniculus]
gi|73621389|sp|Q9GMY2.1|PEPC_RABIT RecName: Full=Gastricsin; AltName: Full=Pepsinogen C; Flags:
Precursor
gi|9798668|dbj|BAB11756.1| pepsinogen C [Oryctolagus cuniculus]
Length = 388
Score = 88.2 bits (217), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 43/111 (38%), Positives = 67/111 (60%), Gaps = 3/111 (2%)
Query: 67 NGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSN-CQAIADTGTS 123
+GG ++ GGVD + GDI ++PV+R+ YWQ G++ I E + +CS CQAI DTGTS
Sbjct: 221 DGGALVLGGVDSSLYTGDIYWAPVTRELYWQIGIDEFLISSEASGWCSQGCQAIVDTGTS 280
Query: 124 LIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
L+ P + +++L + GA G VDCD+ + +P ++ G F L+
Sbjct: 281 LLTVPQEYMSDLLEATGAQENEYGEFLVDCDSTESLPTFTFVINGVEFPLS 331
Score = 56.2 bits (134), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 23/34 (67%), Positives = 28/34 (82%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
A Y+G IS+GTP Q F V+FDTGSSNLW+PS +C
Sbjct: 71 AAYFGEISIGTPSQNFLVLFDTGSSNLWVPSVYC 104
>gi|440803835|gb|ELR24718.1| aspartic proteinase, partial [Acanthamoeba castellanii str. Neff]
Length = 489
Score = 88.2 bits (217), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 44/100 (44%), Positives = 65/100 (65%), Gaps = 8/100 (8%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV--YC--SNCQA 116
R+ S +GGE++ GGVD+ + GD TY+P++++GYWQF I +C C+A
Sbjct: 237 RDPSGISGGELVLGGVDESHYTGDFTYTPITKEGYWQFLAHDFLINGKSMGFCPAGGCKA 296
Query: 117 IADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNL 156
IADTGTSL+ GPSK++A++NK+I A G + +CD L
Sbjct: 297 IADTGTSLLAGPSKIVAQINKMIN----ATGILESECDML 332
Score = 63.9 bits (154), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 27/43 (62%), Positives = 31/43 (72%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSV 49
AQYYG IS+G PPQ F V+ DTGSSNLW+PS C IAC +
Sbjct: 90 AQYYGEISIGNPPQYFNVVLDTGSSNLWVPSIQCPWYEIACDL 132
>gi|390601248|gb|EIN10642.1| endopeptidase [Punctularia strigosozonata HHB-11173 SS5]
Length = 412
Score = 88.2 bits (217), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 57/154 (37%), Positives = 81/154 (52%), Gaps = 8/154 (5%)
Query: 26 FDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFR-NASDENGGEIMFGGVDKDKFVGD 84
+DT S N P + S++N A +D F FR +S+E+GGE +FGG+D + G
Sbjct: 212 YDTISVNHITPPFY-SMINAAL-LDDPVF----SFRLGSSEEDGGEAVFGGIDSSAYEGK 265
Query: 85 ITYSPVSRKGYWQFGVESIKI-EKNVYCSNCQAIADTGTSLIIGPSKVIAELNKLIGAVP 143
ITY PV RK YW+ +E IK + + N A DTGTSLI P+ + LN IGA
Sbjct: 266 ITYVPVRRKAYWEVELEKIKFGDDELELENTGAAIDTGTSLIALPTDLAEMLNAQIGATK 325
Query: 144 LANGPAKVDCDNLDKMPNVDIILGGKNFTLTRTN 177
NG V+C + +P + G+ + L T+
Sbjct: 326 SWNGQYTVECSKVPDLPELSFYFDGQAYPLKGTD 359
Score = 66.6 bits (161), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 40/86 (46%), Positives = 53/86 (61%), Gaps = 8/86 (9%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK--DKFVGHSHFRNAS 64
AQY+ I+LGTPPQ FKVI DTGSSNLW+PS C+ +IAC + + D S+ N S
Sbjct: 99 AQYFSEITLGTPPQSFKVILDTGSSNLWVPSVKCT--SIACFLHQKYDSSQSSSYKANGS 156
Query: 65 D---ENGGEIMFGGVDKDKF-VGDIT 86
+ + G M G V +D +GD+T
Sbjct: 157 EFSIQYGSGSMEGFVSRDTLTIGDLT 182
>gi|431910410|gb|ELK13483.1| Pepsin F [Pteropus alecto]
Length = 487
Score = 88.2 bits (217), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 41/103 (39%), Positives = 68/103 (66%), Gaps = 2/103 (1%)
Query: 71 IMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-KNVYC-SNCQAIADTGTSLIIGP 128
++FGGVD + GD+ + PV+++G+WQ ++S+ + K + C S CQAI DTGTSL+IGP
Sbjct: 327 VLFGGVDPSYYKGDLKWVPVTQRGFWQISMDSVSMNGKIIACKSGCQAIVDTGTSLVIGP 386
Query: 129 SKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNF 171
+ I E+ K I A P+++G V+C ++D +P+ + G +
Sbjct: 387 NDPILEILKTINAQPISSGEYIVNCSSVDTLPDTIFTINGIKY 429
Score = 61.2 bits (147), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 25/33 (75%), Positives = 30/33 (90%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCS 41
Y G IS+GTPPQEFKVIFDTGS++LW+PS +CS
Sbjct: 176 YVGNISIGTPPQEFKVIFDTGSADLWVPSIYCS 208
>gi|168033581|ref|XP_001769293.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162679399|gb|EDQ65847.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 485
Score = 88.2 bits (217), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 42/91 (46%), Positives = 61/91 (67%), Gaps = 10/91 (10%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSNCQAIADT 120
R++ + +GGE++FGGVD D F G+ TY+PV+RKGYWQF + C AIAD+
Sbjct: 223 RDSDESDGGELVFGGVDPDHFKGEHTYTPVTRKGYWQFDLG----------DGCSAIADS 272
Query: 121 GTSLIIGPSKVIAELNKLIGAVPLANGPAKV 151
GTSL+ GPS ++AE+N+ IGA + + K+
Sbjct: 273 GTSLLAGPSGIVAEINQAIGATGIVSQQCKM 303
Score = 65.1 bits (157), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 38/100 (38%), Positives = 54/100 (54%), Gaps = 6/100 (6%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASDE 66
AQY+G I +G+PPQ F VIFDTGSSNLW+PS C L++AC + ++ D
Sbjct: 77 AQYFGEIGIGSPPQPFSVIFDTGSSNLWVPSAKC-YLSLACYFHR-RYKSSKSSTYKEDG 134
Query: 67 NGGEIMFGGVDKDKFVG--DITYSPVSRKGYWQFGVESIK 104
I +G + F+ D+T ++ K WQ E+ K
Sbjct: 135 TSFAIQYGTGSMEGFLSQDDVTLGDLTVK--WQVFAEATK 172
Score = 43.1 bits (100), Expect = 0.048, Method: Compositional matrix adjust.
Identities = 20/45 (44%), Positives = 28/45 (62%)
Query: 130 KVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
++ A LN+L +P NG + VDC+ L MPNV + GK+F LT
Sbjct: 381 EIDAYLNQLCERLPSPNGESAVDCNALSYMPNVGFTIAGKSFELT 425
>gi|217038345|gb|ACJ76637.1| pepsinogen C (predicted) [Oryctolagus cuniculus]
Length = 391
Score = 88.2 bits (217), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 43/111 (38%), Positives = 67/111 (60%), Gaps = 3/111 (2%)
Query: 67 NGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSN-CQAIADTGTS 123
+GG ++ GGVD + GDI ++PV+R+ YWQ G++ I E + +CS CQAI DTGTS
Sbjct: 224 DGGALVLGGVDSSLYTGDIYWAPVTRELYWQIGIDEFLISSEASGWCSQGCQAIVDTGTS 283
Query: 124 LIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
L+ P + +++L + GA G VDCD+ + +P ++ G F L+
Sbjct: 284 LLTVPQEYMSDLLEATGAQENEYGEFLVDCDSTESLPTFTFVINGVEFPLS 334
Score = 55.8 bits (133), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 23/34 (67%), Positives = 28/34 (82%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
A Y+G IS+GTP Q F V+FDTGSSNLW+PS +C
Sbjct: 74 AAYFGEISIGTPSQNFLVLFDTGSSNLWVPSVYC 107
>gi|148222689|ref|NP_001079037.1| pepsinogen 4, group I (pepsinogen A) precursor [Xenopus laevis]
gi|12082176|dbj|BAB20798.1| pepsinogen A [Xenopus laevis]
gi|213623162|gb|AAI69372.1| Pepsinogen A [Xenopus laevis]
gi|213626396|gb|AAI69374.1| Pepsinogen A [Xenopus laevis]
Length = 384
Score = 88.2 bits (217), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 43/119 (36%), Positives = 70/119 (58%), Gaps = 2/119 (1%)
Query: 58 SHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVY-CS-NCQ 115
S + ++ + G ++FGGVD + G + + P++ + YWQ ++S+ I V CS +CQ
Sbjct: 210 SVYLSSDGQTGSYVLFGGVDNSYYSGSLNWVPLTAETYWQITLDSVSINGQVIACSQSCQ 269
Query: 116 AIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
AI DTGTSL+ GPS IA + IGA +NG ++C+N+ MP + + G + L+
Sbjct: 270 AIVDTGTSLMTGPSTPIANIQNYIGASQDSNGQYVINCNNISNMPTIVFTINGVQYPLS 328
Score = 67.4 bits (163), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 35/76 (46%), Positives = 47/76 (61%), Gaps = 3/76 (3%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASDEN 67
+YYGTIS+GTPPQEF VIFDTGS+NLW+PS +CS + ACS + ++F +
Sbjct: 71 EYYGTISIGTPPQEFTVIFDTGSANLWVPSVYCS--SQACS-NHNRFNPQQSSTFQATNT 127
Query: 68 GGEIMFGGVDKDKFVG 83
I +G F+G
Sbjct: 128 PVSIQYGTGSMSGFLG 143
>gi|301030231|gb|ADK47877.1| cathepsin D [Triatoma infestans]
Length = 390
Score = 88.2 bits (217), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 47/118 (39%), Positives = 73/118 (61%), Gaps = 6/118 (5%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGD-ITYSPVSRKGYWQFGVESIKI---EKNVYCSN-CQ 115
RN +E GGEI+FGGVD++ + + +T P++ YW F ++ I + +C N C
Sbjct: 214 RNPDEEVGGEIIFGGVDEELYNKESMTTVPLTSTSYWMFQMDGISTSAEDGTSWCQNGCP 273
Query: 116 AIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
IADTGTS I+GPS + E+ +L+GA + G V CD+LDK+P++ + GK +T+
Sbjct: 274 GIADTGTSFIVGPSSDVDEIMELVGA-EVYQGIGFVSCDDLDKLPDITFHINGKGYTI 330
Score = 77.4 bits (189), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 38/72 (52%), Positives = 50/72 (69%), Gaps = 6/72 (8%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHF 60
+R+S+ QYYG I+LGTPPQEF VIFDTGSSNLWIPS CS ++AC V + H
Sbjct: 63 LRNSFNTQYYGNITLGTPPQEFTVIFDTGSSNLWIPSAVCS--SVACRVHNT----YDHD 116
Query: 61 RNASDENGGEIM 72
R+++ + G I+
Sbjct: 117 RSSTYQPDGRIL 128
>gi|256274192|gb|EEU09100.1| Pep4p [Saccharomyces cerevisiae JAY291]
Length = 405
Score = 88.2 bits (217), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 49/126 (38%), Positives = 70/126 (55%), Gaps = 1/126 (0%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI-EK 107
+D+ +F + + ENGGE FGG+D+ KF GDIT+ PV RK YW+ E I + ++
Sbjct: 223 LDEKRFAFYLGDTSKDTENGGEATFGGIDESKFKGDITWLPVRRKAYWEVKFEGIGLGDE 282
Query: 108 NVYCSNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILG 167
+ A DTGTSLI PS + +N IGA G +DC+ D +P++ L
Sbjct: 283 YAELESHGAAIDTGTSLITLPSGLAEMINAEIGAKKGWTGQYTLDCNTRDNLPDLIFNLN 342
Query: 168 GKNFTL 173
G NFT+
Sbjct: 343 GYNFTI 348
Score = 63.9 bits (154), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 36/84 (42%), Positives = 47/84 (55%), Gaps = 4/84 (4%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASD- 65
AQYY I+LGTPPQ FKVI DTGSSNLW+PS C L D S+ N ++
Sbjct: 89 AQYYTDITLGTPPQNFKVILDTGSSNLWVPSNECGSLACFLHSKYDHEASSSYKANGTEF 148
Query: 66 --ENGGEIMFGGVDKDKF-VGDIT 86
+ G + G + +D +GD+T
Sbjct: 149 AIQYGTGSLEGYISQDTLSIGDLT 172
>gi|459426|emb|CAA54478.1| aspartic protease [Brassica oleracea]
Length = 292
Score = 88.2 bits (217), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 47/98 (47%), Positives = 60/98 (61%), Gaps = 3/98 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYC-SNCQAI 117
RNA DE GGE++FGGVD F G Y PV++KGYWQF + + I YC S C AI
Sbjct: 12 RNADDEEGGELVFGGVDPKHFKGQHIYVPVTQKGYWQFDMGDVLIGGAPTGYCESGCSAI 71
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDN 155
AD+GTSL+ GP+ +I +N IGA +A+ K D
Sbjct: 72 ADSGTSLLAGPTTIITMINHAIGASGVASQQCKTVVDQ 109
Score = 38.9 bits (89), Expect = 0.85, Method: Compositional matrix adjust.
Identities = 42/184 (22%), Positives = 73/184 (39%), Gaps = 23/184 (12%)
Query: 11 GTISLGTPPQEF-----KVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASD 65
G + +G P + I D+G+S L P+ +++N A V + D
Sbjct: 52 GDVLIGGAPTGYCESGCSAIADSGTSLLAGPTTIITMINHAIGASG---VASQQCKTVVD 108
Query: 66 ENGG---EIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV---------YCSN 113
+ G +++ K I K G+ES+ ++N CS
Sbjct: 109 QYGQTILDLLLSETQPKKICSQIGLCTFDGKRGVSMGIESVVDKENAKLSNGVGDAACSA 168
Query: 114 CQ-AIADTGTSLI--IGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKN 170
C+ A+ + L + +++ +N+L +P G + VDC L MP V + +GGK
Sbjct: 169 CEMAVVWIQSQLRQNMTQERILDYVNELCRRIPSPMGESAVDCAQLSTMPTVSLTIGGKV 228
Query: 171 FTLT 174
F L
Sbjct: 229 FDLA 232
>gi|378731872|gb|EHY58331.1| vacuolar protease A [Exophiala dermatitidis NIH/UT8656]
Length = 398
Score = 88.2 bits (217), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 47/117 (40%), Positives = 63/117 (53%), Gaps = 1/117 (0%)
Query: 59 HFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVY-CSNCQAI 117
+ N +D E FGG+DKD + G + P+ RK YW+ +++I K N I
Sbjct: 226 YLGNTNDGTESEATFGGIDKDHYTGKMVKIPLRRKAYWEVNLDAITFGKETADLDNTGVI 285
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSLI PS + LNK IGA NG V+CD D +P+V L G NF++T
Sbjct: 286 LDTGTSLIALPSTLAELLNKEIGAKKGFNGQYTVECDKRDSLPDVSFTLSGYNFSIT 342
Score = 67.4 bits (163), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 32/61 (52%), Positives = 43/61 (70%), Gaps = 4/61 (6%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK--DKFVGHSHFRNAS 64
AQY+ TI+LGTPPQEFKV+ DTGSSNLW+PS C +IAC + + D ++ +N S
Sbjct: 84 AQYFSTIALGTPPQEFKVVLDTGSSNLWVPSSECG--SIACYLHQKYDSSASSTYKKNGS 141
Query: 65 D 65
+
Sbjct: 142 E 142
>gi|356547093|ref|XP_003541952.1| PREDICTED: LOW QUALITY PROTEIN: cyprosin-like, partial [Glycine
max]
Length = 470
Score = 87.8 bits (216), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 43/93 (46%), Positives = 61/93 (65%), Gaps = 3/93 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV--YC-SNCQAI 117
RN ++ GG+I+FGGVD D + G+ TY PV+ KGYWQ + + I++ +C S C AI
Sbjct: 191 RNTDEKQGGQIVFGGVDSDHYXGEHTYVPVTHKGYWQTEIGDVLIDRKTTEFCASKCSAI 250
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAK 150
D+GTSL+ GP+ IA++N IGAV + N K
Sbjct: 251 DDSGTSLLAGPTGAIAQINHAIGAVGVVNQECK 283
Score = 57.8 bits (138), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 33/80 (41%), Positives = 50/80 (62%), Gaps = 15/80 (18%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHF 60
+++ AQY+G I +GTP Q+F VIFDTGSSNLW+PS C ++AC + HS +
Sbjct: 52 LKNYMNAQYFGEIGIGTP-QKFTVIFDTGSSNLWVPSSKC-YFSVACYL-------HSRY 102
Query: 61 RNA----SDENG--GEIMFG 74
+++ ++NG EI +G
Sbjct: 103 KSSQSSTQNKNGSSAEIRYG 122
Score = 39.7 bits (91), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 27/89 (30%), Positives = 44/89 (49%), Gaps = 12/89 (13%)
Query: 98 FGVESI---KIEKNVY------CSNCQ-AIADTGTSLIIGPS--KVIAELNKLIGAVPLA 145
G++S+ IEK Y C+ C+ A+ L + + +++ N L +P
Sbjct: 322 MGIQSVVDKTIEKTSYSWNDAGCTACEMAVVWIKNPLRLNETEDQILDYANALCDMLPSP 381
Query: 146 NGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
NG + V+C L +MPNV +GGK F L+
Sbjct: 382 NGESVVECSTLSEMPNVSFTIGGKVFELS 410
>gi|449549767|gb|EMD40732.1| hypothetical protein CERSUDRAFT_44393 [Ceriporiopsis subvermispora
B]
Length = 413
Score = 87.8 bits (216), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 47/120 (39%), Positives = 69/120 (57%), Gaps = 2/120 (1%)
Query: 60 FR-NASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI-EKNVYCSNCQAI 117
FR +S+E+GGE +FGGVD+ + G I Y PV RK YW+ ++ I + + + N A
Sbjct: 240 FRLGSSEEDGGEAIFGGVDESAYTGKIDYVPVRRKAYWEVELQKISLGDDELELENTGAA 299
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTRTN 177
DTGTSLI PS + LN IGA NG V+C+ + +P++ GK++ L T+
Sbjct: 300 IDTGTSLIALPSDMAEMLNTQIGAKRSWNGQYTVECEKVPDLPDLTFTFDGKDYPLKGTD 359
Score = 63.2 bits (152), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 27/41 (65%), Positives = 33/41 (80%), Gaps = 2/41 (4%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQY+ I+LGTPPQ FKV+ DTGSSNLW+PS C+ +IAC
Sbjct: 99 AQYFAEITLGTPPQSFKVVLDTGSSNLWVPSTKCT--SIAC 137
>gi|334186351|ref|NP_001190671.1| phytepsin [Arabidopsis thaliana]
gi|332656991|gb|AEE82391.1| phytepsin [Arabidopsis thaliana]
Length = 504
Score = 87.8 bits (216), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 47/101 (46%), Positives = 61/101 (60%), Gaps = 3/101 (2%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSN-CQAI 117
RN D GGEI+FGGVD F G+ T+ PV+ KGYWQF + ++I + YC+ C AI
Sbjct: 231 RNPKDPEGGEIVFGGVDPKHFKGEHTFVPVTHKGYWQFDMGDLQIAGKPTGYCAKGCSAI 290
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDK 158
AD+GTSL+ GPS VI +N IGA + + K D K
Sbjct: 291 ADSGTSLLTGPSTVITMINHAIGAQGIVSRECKAVVDQYGK 331
Score = 67.8 bits (164), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 30/47 (63%), Positives = 38/47 (80%), Gaps = 1/47 (2%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ AQYYG I++GTPPQ+F VIFDTGSSNLWIPS C L++AC
Sbjct: 79 LKNYLDAQYYGDITIGTPPQKFTVIFDTGSSNLWIPSTKC-YLSVAC 124
>gi|222425186|dbj|BAH20542.1| pepsinogen A-35 [Pongo abelii]
Length = 388
Score = 87.8 bits (216), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 46/127 (36%), Positives = 74/127 (58%), Gaps = 5/127 (3%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-K 107
V +D F S + +A D++G ++FGG+D + G + + PV+ +GYWQ V+SI + K
Sbjct: 208 VSQDLF---SVYLSADDKSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQITVDSITMNGK 264
Query: 108 NVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ C+ CQAI DTGTSL+ GP+ IA + IGA ++G V C + +P++ +
Sbjct: 265 TIACAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGDMVVSCSAISSLPDIVFTI 324
Query: 167 GGKNFTL 173
G + L
Sbjct: 325 NGVQYPL 331
Score = 56.2 bits (134), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 21/34 (61%), Positives = 29/34 (85%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
+Y+G+I +GTP Q+F V+FDTGSSNLW+PS +C
Sbjct: 74 VEYFGSIGIGTPAQDFTVVFDTGSSNLWVPSVYC 107
>gi|15233518|ref|NP_192355.1| phytepsin [Arabidopsis thaliana]
gi|75338508|sp|Q9XEC4.1|APA3_ARATH RecName: Full=Aspartic proteinase A3; Flags: Precursor
gi|4773885|gb|AAD29758.1|AF076243_5 putative aspartic protease [Arabidopsis thaliana]
gi|13937238|gb|AAK50111.1|AF372974_1 AT4g04460/T26N6_7 [Arabidopsis thaliana]
gi|7267203|emb|CAB77914.1| putative aspartic protease [Arabidopsis thaliana]
gi|332656990|gb|AEE82390.1| phytepsin [Arabidopsis thaliana]
Length = 508
Score = 87.8 bits (216), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 47/101 (46%), Positives = 61/101 (60%), Gaps = 3/101 (2%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSN-CQAI 117
RN D GGEI+FGGVD F G+ T+ PV+ KGYWQF + ++I + YC+ C AI
Sbjct: 231 RNPKDPEGGEIVFGGVDPKHFKGEHTFVPVTHKGYWQFDMGDLQIAGKPTGYCAKGCSAI 290
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDK 158
AD+GTSL+ GPS VI +N IGA + + K D K
Sbjct: 291 ADSGTSLLTGPSTVITMINHAIGAQGIVSRECKAVVDQYGK 331
Score = 67.8 bits (164), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 30/47 (63%), Positives = 38/47 (80%), Gaps = 1/47 (2%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ AQYYG I++GTPPQ+F VIFDTGSSNLWIPS C L++AC
Sbjct: 79 LKNYLDAQYYGDITIGTPPQKFTVIFDTGSSNLWIPSTKC-YLSVAC 124
>gi|440893605|gb|ELR46308.1| Pepsin A, partial [Bos grunniens mutus]
Length = 388
Score = 87.8 bits (216), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 43/125 (34%), Positives = 77/125 (61%), Gaps = 5/125 (4%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-K 107
V +D F S + ++++E+G ++FG +D + G + + PVS +GYWQ V+SI + +
Sbjct: 208 VSQDLF---SVYLSSNEESGSVVIFGDIDSSYYSGSLNWVPVSVEGYWQITVDSITMNGE 264
Query: 108 NVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
++ CS+ CQAI DTGTSL+ GP+ I+ + IGA ++G + C ++D +P++ +
Sbjct: 265 SIACSDGCQAIVDTGTSLLAGPTTAISNIQSYIGASEDSSGEVVISCSSIDSLPDIVFTI 324
Query: 167 GGKNF 171
G +
Sbjct: 325 NGVQY 329
Score = 59.7 bits (143), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 24/41 (58%), Positives = 33/41 (80%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCS 41
+++ +Y+GTI +GTP Q+F VIFDTGSSNLW+PS +CS
Sbjct: 68 LQNYLDTEYFGTIGIGTPAQDFTVIFDTGSSNLWVPSIYCS 108
>gi|73620983|sp|P00792.2|PEPA_BOVIN RecName: Full=Pepsin A; Flags: Precursor
gi|24415088|emb|CAD55693.1| pepsinogen A [synthetic construct]
gi|37622272|gb|AAQ95219.1| pepsinogen A [Bos taurus]
Length = 372
Score = 87.8 bits (216), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 43/125 (34%), Positives = 77/125 (61%), Gaps = 5/125 (4%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-K 107
V +D F S + ++++E+G ++FG +D + G + + PVS +GYWQ V+SI + +
Sbjct: 192 VSQDLF---SVYLSSNEESGSVVIFGDIDSSYYSGSLNWVPVSVEGYWQITVDSITMNGE 248
Query: 108 NVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
++ CS+ CQAI DTGTSL+ GP+ I+ + IGA ++G + C ++D +P++ +
Sbjct: 249 SIACSDGCQAIVDTGTSLLAGPTTAISNIQSYIGASEDSSGEVVISCSSIDSLPDIVFTI 308
Query: 167 GGKNF 171
G +
Sbjct: 309 NGVQY 313
Score = 59.7 bits (143), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 24/41 (58%), Positives = 33/41 (80%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCS 41
+++ +Y+GTI +GTP Q+F VIFDTGSSNLW+PS +CS
Sbjct: 52 LQNYLDTEYFGTIGIGTPAQDFTVIFDTGSSNLWVPSIYCS 92
>gi|448517372|ref|XP_003867779.1| vacuolar aspartic proteinase [Candida orthopsilosis Co 90-125]
gi|380352118|emb|CCG22342.1| vacuolar aspartic proteinase [Candida orthopsilosis]
Length = 429
Score = 87.8 bits (216), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 54/152 (35%), Positives = 80/152 (52%), Gaps = 3/152 (1%)
Query: 24 VIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVG 83
+ +DT S N +P + +I +D +F + N +E+GG FGG D+ F G
Sbjct: 223 LAYDTISVNRIVPPIYNAINQEL--LDSPQFGFYLGDTNKDEEDGGVATFGGYDESLFQG 280
Query: 84 DITYSPVSRKGYWQFGVESIKI-EKNVYCSNCQAIADTGTSLIIGPSKVIAELNKLIGAV 142
IT+ PV RK YW+ E I + ++ + A DTGTSLI PS + +N IGA
Sbjct: 281 KITWLPVRRKAYWEVSFEGIGLGDEYAELTKTGAAIDTGTSLITLPSSLAEIINAKIGAT 340
Query: 143 PLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
+G +VDC D++P++ + G NFTLT
Sbjct: 341 KSWSGQYQVDCAKRDELPDLTLTFSGYNFTLT 372
Score = 59.3 bits (142), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 34/83 (40%), Positives = 46/83 (55%), Gaps = 4/83 (4%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASD- 65
AQY+ I +GTP Q FKVI DTGSSNLW+PSQ C+ L D ++ N S+
Sbjct: 112 AQYFTEIQIGTPGQTFKVILDTGSSNLWVPSQDCTSLACFLHSKYDHDASSTYKANGSEF 171
Query: 66 --ENGGEIMFGGVDKDKF-VGDI 85
+ G M G + +D +GD+
Sbjct: 172 SIQYGSGSMEGYISQDTVSIGDL 194
>gi|56971217|gb|AAH88066.1| pga5-prov protein, partial [Xenopus (Silurana) tropicalis]
Length = 382
Score = 87.8 bits (216), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 44/121 (36%), Positives = 71/121 (58%), Gaps = 2/121 (1%)
Query: 58 SHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVY-CS-NCQ 115
S + ++ ++G ++FGGVD + G + + P++ + YWQ ++SI I V CS +CQ
Sbjct: 208 SVYLSSDGQSGSYVLFGGVDTSYYSGSLNWVPLTAETYWQITLDSISINGQVIACSQSCQ 267
Query: 116 AIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTR 175
AI DTGTSL+ GP+ IA + IGA +NG ++C+N+ MP + + G + L
Sbjct: 268 AIVDTGTSLMTGPTTPIANIQYYIGASQDSNGQYVINCNNISNMPTIVFTINGVQYPLPP 327
Query: 176 T 176
T
Sbjct: 328 T 328
Score = 66.6 bits (161), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 34/76 (44%), Positives = 47/76 (61%), Gaps = 3/76 (3%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASDEN 67
+YYGTIS+GTPPQEF VIFDTGS+NLW+PS +CS + AC+ + ++F +
Sbjct: 69 EYYGTISIGTPPQEFTVIFDTGSANLWVPSVYCS--SSACT-NHNRFNPQQSTTFQATNT 125
Query: 68 GGEIMFGGVDKDKFVG 83
I +G F+G
Sbjct: 126 PVSIQYGTGSMSGFLG 141
>gi|151553998|gb|AAI49645.1| PGA5 protein [Bos taurus]
Length = 381
Score = 87.8 bits (216), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 43/125 (34%), Positives = 77/125 (61%), Gaps = 5/125 (4%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-K 107
V +D F S + ++++E+G ++FG +D + G + + PVS +GYWQ V+SI + +
Sbjct: 201 VSQDLF---SVYLSSNEESGSVVIFGDIDSSYYSGSLNWVPVSVEGYWQITVDSITMNGE 257
Query: 108 NVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
++ CS+ CQAI DTGTSL+ GP+ I+ + IGA ++G + C ++D +P++ +
Sbjct: 258 SIACSDGCQAIVDTGTSLLAGPTTAISNIQSYIGASEDSSGEVVISCSSIDSLPDIVFTI 317
Query: 167 GGKNF 171
G +
Sbjct: 318 NGVQY 322
Score = 59.7 bits (143), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 24/41 (58%), Positives = 33/41 (80%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCS 41
+++ +Y+GTI +GTP Q+F VIFDTGSSNLW+PS +CS
Sbjct: 61 LQNYLDTEYFGTIGIGTPAQDFTVIFDTGSSNLWVPSIYCS 101
>gi|354493821|ref|XP_003509038.1| PREDICTED: gastricsin-like [Cricetulus griseus]
gi|344238302|gb|EGV94405.1| Gastricsin [Cricetulus griseus]
Length = 391
Score = 87.8 bits (216), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 44/111 (39%), Positives = 69/111 (62%), Gaps = 3/111 (2%)
Query: 67 NGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV--YCSN-CQAIADTGTS 123
NGG+I+FGGVD++ + G+IT+ PV+++ YWQ ++ I V +CS C I DTGTS
Sbjct: 224 NGGQIVFGGVDENLYTGEITWIPVTQELYWQITIDDFLIGDQVSGWCSQGCAGIVDTGTS 283
Query: 124 LIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
L+ PS+ +++L + IGA G V CD++ +P + +L G F L+
Sbjct: 284 LLTMPSQYLSDLLQTIGAQEGEYGEYFVSCDSVSSLPTFNFVLNGVEFPLS 334
Score = 58.9 bits (141), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 24/34 (70%), Positives = 29/34 (85%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
A Y+G IS+GTPPQ F V+FDTGSSNLW+PS +C
Sbjct: 74 AAYFGEISIGTPPQNFLVLFDTGSSNLWVPSIYC 107
>gi|292658855|ref|NP_001001600.2| pepsin A preproprotein [Bos taurus]
Length = 386
Score = 87.8 bits (216), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 43/125 (34%), Positives = 77/125 (61%), Gaps = 5/125 (4%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-K 107
V +D F S + ++++E+G ++FG +D + G + + PVS +GYWQ V+SI + +
Sbjct: 206 VSQDLF---SVYLSSNEESGSVVIFGDIDSSYYSGSLNWVPVSVEGYWQITVDSITMNGE 262
Query: 108 NVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
++ CS+ CQAI DTGTSL+ GP+ I+ + IGA ++G + C ++D +P++ +
Sbjct: 263 SIACSDGCQAIVDTGTSLLAGPTTAISNIQSYIGASEDSSGEVVISCSSIDSLPDIVFTI 322
Query: 167 GGKNF 171
G +
Sbjct: 323 NGVQY 327
Score = 59.7 bits (143), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 24/41 (58%), Positives = 33/41 (80%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCS 41
+++ +Y+GTI +GTP Q+F VIFDTGSSNLW+PS +CS
Sbjct: 66 LQNYLDTEYFGTIGIGTPAQDFTVIFDTGSSNLWVPSIYCS 106
>gi|14278413|pdb|1G0V|A Chain A, The Structure Of Proteinase A Complexed With A Ia3 Mutant,
Mvv
Length = 329
Score = 87.8 bits (216), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 48/126 (38%), Positives = 70/126 (55%), Gaps = 1/126 (0%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI-EK 107
+D+ +F + + ENGGE FGG+D+ KF GDIT+ PV RK YW+ E I + ++
Sbjct: 147 LDEKRFAFYLGDTSKDTENGGEATFGGIDESKFKGDITWLPVRRKAYWEVKFEGIGLGDE 206
Query: 108 NVYCSNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILG 167
+ A DTGTSLI PS + +N IGA + G +DC+ D +P++
Sbjct: 207 YAELESHGAAIDTGTSLITLPSGLAEMINAEIGAKKGSTGQYTLDCNTRDNLPDLIFNFN 266
Query: 168 GKNFTL 173
G NFT+
Sbjct: 267 GYNFTI 272
Score = 63.9 bits (154), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 36/84 (42%), Positives = 47/84 (55%), Gaps = 4/84 (4%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASD- 65
AQYY I+LGTPPQ FKVI DTGSSNLW+PS C L D S+ N ++
Sbjct: 13 AQYYTDITLGTPPQNFKVILDTGSSNLWVPSNECGSLACFLHSKYDHEASSSYKANGTEF 72
Query: 66 --ENGGEIMFGGVDKDKF-VGDIT 86
+ G + G + +D +GD+T
Sbjct: 73 AIQYGTGSLEGYISQDTLSIGDLT 96
>gi|224458278|ref|NP_001138942.1| pepsinogen A precursor [Pongo abelii]
gi|222425178|dbj|BAH20538.1| pepsinogen A-75 [Pongo abelii]
Length = 388
Score = 87.8 bits (216), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 46/127 (36%), Positives = 74/127 (58%), Gaps = 5/127 (3%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-K 107
V +D F S + +A D++G ++FGG+D + G + + PV+ +GYWQ V+SI + K
Sbjct: 208 VSQDLF---SVYLSADDKSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQITVDSITMNGK 264
Query: 108 NVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ C+ CQAI DTGTSL+ GP+ IA + IGA ++G V C + +P++ +
Sbjct: 265 TIACAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGDMVVSCSAISSLPDIVFTI 324
Query: 167 GGKNFTL 173
G + L
Sbjct: 325 NGVQYPL 331
Score = 58.5 bits (140), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 26/48 (54%), Positives = 37/48 (77%), Gaps = 3/48 (6%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKF 54
+Y+GTI +GTP Q+F V+FDTGSSNLW+PS +C ++AC +D + F
Sbjct: 74 VEYFGTIGIGTPAQDFTVVFDTGSSNLWVPSVYC--YSLAC-MDHNLF 118
>gi|296471634|tpg|DAA13749.1| TPA: pepsin A precursor [Bos taurus]
Length = 367
Score = 87.8 bits (216), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 43/125 (34%), Positives = 77/125 (61%), Gaps = 5/125 (4%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-K 107
V +D F S + ++++E+G ++FG +D + G + + PVS +GYWQ V+SI + +
Sbjct: 192 VSQDLF---SVYLSSNEESGSVVIFGDIDSSYYSGSLNWVPVSVEGYWQITVDSITMNGE 248
Query: 108 NVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
++ CS+ CQAI DTGTSL+ GP+ I+ + IGA ++G + C ++D +P++ +
Sbjct: 249 SIACSDGCQAIVDTGTSLLAGPTTAISNIQSYIGASEDSSGEVVISCSSIDSLPDIVFTI 308
Query: 167 GGKNF 171
G +
Sbjct: 309 NGVQY 313
Score = 59.7 bits (143), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 24/41 (58%), Positives = 33/41 (80%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCS 41
+++ +Y+GTI +GTP Q+F VIFDTGSSNLW+PS +CS
Sbjct: 52 LQNYLDTEYFGTIGIGTPAQDFTVIFDTGSSNLWVPSIYCS 92
>gi|222425180|dbj|BAH20539.1| pepsinogen A-43 [Pongo abelii]
Length = 388
Score = 87.8 bits (216), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 46/127 (36%), Positives = 74/127 (58%), Gaps = 5/127 (3%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-K 107
V +D F S + +A D++G ++FGG+D + G + + PV+ +GYWQ V+SI + K
Sbjct: 208 VSQDLF---SVYLSADDKSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQITVDSITMNGK 264
Query: 108 NVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ C+ CQAI DTGTSL+ GP+ IA + IGA ++G V C + +P++ +
Sbjct: 265 TIACAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGDMVVSCSAISSLPDIVFTI 324
Query: 167 GGKNFTL 173
G + L
Sbjct: 325 NGVQYPL 331
Score = 57.4 bits (137), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 25/47 (53%), Positives = 37/47 (78%), Gaps = 3/47 (6%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKF 54
+Y+G+I +GTP Q+F V+FDTGSSNLW+PS +C ++AC +D + F
Sbjct: 75 EYFGSIGIGTPAQDFTVVFDTGSSNLWVPSVYC--YSLAC-MDHNLF 118
>gi|47198554|emb|CAF89152.1| unnamed protein product [Tetraodon nigroviridis]
Length = 301
Score = 87.8 bits (216), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 51/157 (32%), Positives = 77/157 (49%), Gaps = 32/157 (20%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKN 108
V+K+ F + + RN + GGE++ GG D + GD +Y V+R+ YWQ V+ + +
Sbjct: 137 VEKNVFSFYLN-RNPQTQPGGELLLGGTDPQYYTGDFSYVNVTRQAYWQIHVDELSVGSQ 195
Query: 109 V-YC-SNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAK---------------- 150
+ C S C+AI DTGTSL+ GPS+ + L K IGA+PL G
Sbjct: 196 LTLCKSGCEAIVDTGTSLLTGPSEEVRSLQKAIGALPLIQGEVSCGRVGPPVFGPSVVCG 255
Query: 151 -------------VDCDNLDKMPNVDIILGGKNFTLT 174
V CD + +P + +GGK ++LT
Sbjct: 256 RVKSPVVFGFQYMVSCDKIPTLPVITFNIGGKPYSLT 292
Score = 77.0 bits (188), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 32/45 (71%), Positives = 38/45 (84%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK 51
AQYYG I LGTPPQ F V+FDTGSSNLW+PS HCS+L+IAC + +
Sbjct: 1 AQYYGEIGLGTPPQPFTVVFDTGSSNLWVPSVHCSLLDIACLLHR 45
>gi|344234771|gb|EGV66639.1| Asp-domain-containing protein [Candida tenuis ATCC 10573]
Length = 425
Score = 87.8 bits (216), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 54/152 (35%), Positives = 81/152 (53%), Gaps = 3/152 (1%)
Query: 24 VIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVG 83
+ +DT S N +P + +I +D+ F + N +E+GG FGG D+ KF G
Sbjct: 220 LAYDTISVNKIVPPVYNAINQ--GLLDEKSFAFYLGDTNKDEEDGGVATFGGYDESKFTG 277
Query: 84 DITYSPVSRKGYWQFGVESIKI-EKNVYCSNCQAIADTGTSLIIGPSKVIAELNKLIGAV 142
IT+ PV RK YW+ +E + + ++ + A DTGTSLI PS + +N IGAV
Sbjct: 278 KITWLPVRRKAYWEVSLEGLGLGDEFAELKSTGAAIDTGTSLITLPSSLAEIINAKIGAV 337
Query: 143 PLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
+G V+CD +P++ L G NFTL+
Sbjct: 338 KSWSGQYTVECDARANLPDLTFNLNGYNFTLS 369
Score = 62.0 bits (149), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 37/83 (44%), Positives = 45/83 (54%), Gaps = 4/83 (4%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASD- 65
AQY+ I +GTP Q FKVI DTGSSNLWIPSQ CS L D ++ N S+
Sbjct: 109 AQYFTEIEVGTPGQPFKVILDTGSSNLWIPSQDCSSLACYLHSKYDHDASSTYKANGSEF 168
Query: 66 --ENGGEIMFGGVDKDKF-VGDI 85
+ G M G V D +GD+
Sbjct: 169 AIQYGSGAMEGYVSTDALRIGDL 191
>gi|148691635|gb|EDL23582.1| progastricsin (pepsinogen C) [Mus musculus]
Length = 392
Score = 87.8 bits (216), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 42/115 (36%), Positives = 74/115 (64%), Gaps = 4/115 (3%)
Query: 67 NGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSN--CQAIADTGT 122
+GG+I+FGGVD++ + G++T+ PV+++ YWQ ++ I + + +CS+ CQ I DTGT
Sbjct: 224 DGGQIVFGGVDENLYTGELTWIPVTQELYWQITIDDFLIGNQASGWCSSSGCQGIVDTGT 283
Query: 123 SLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTRTN 177
SL++ P++ + EL + IGA G V CD++ +P + +L G F L+ ++
Sbjct: 284 SLLVMPAQYLNELLQTIGAQEGEYGQYFVSCDSVSSLPTLTFVLNGVQFPLSPSS 338
Score = 57.4 bits (137), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 24/34 (70%), Positives = 28/34 (82%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
A YYG IS+GTPPQ F V+FDTGSSNLW+ S +C
Sbjct: 74 ASYYGEISIGTPPQNFLVLFDTGSSNLWVSSVYC 107
>gi|75338567|sp|Q9XFX4.1|CARDB_CYNCA RecName: Full=Procardosin-B; Contains: RecName: Full=Cardosin-B
heavy chain; AltName: Full=Cardosin-B 34 kDa subunit;
Contains: RecName: Full=Cardosin-B light chain; AltName:
Full=Cardosin-B 14 kDa subunit; Flags: Precursor
gi|4582534|emb|CAB40349.1| preprocardosin B [Cynara cardunculus]
Length = 506
Score = 87.8 bits (216), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 43/93 (46%), Positives = 62/93 (66%), Gaps = 3/93 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCS-NCQAI 117
RN +E GGE++FGGVD + F G+ TY PV+RKGYWQF + + I + + +C+ C AI
Sbjct: 229 RNVDEEEGGELVFGGVDPNHFRGNHTYVPVTRKGYWQFEMGDVLIGDKSSGFCAGGCAAI 288
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAK 150
AD+GTS GP+ +I ++N+ IGA + N K
Sbjct: 289 ADSGTSFFAGPTAIITQINQAIGAKGVLNQQCK 321
Score = 58.9 bits (141), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 25/41 (60%), Positives = 31/41 (75%), Gaps = 1/41 (2%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSV 49
YYG I +GTPPQ F VIFDTGSS+LW+PS C ++AC +
Sbjct: 85 YYGEIGIGTPPQNFAVIFDTGSSDLWVPSTKCDT-SLACVI 124
>gi|297837199|ref|XP_002886481.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297332322|gb|EFH62740.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 513
Score = 87.8 bits (216), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 46/98 (46%), Positives = 62/98 (63%), Gaps = 3/98 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYC-SNCQAI 117
R+ E GGEI+FGGVD F G+ T+ PV+++GYWQF + + I + YC S C AI
Sbjct: 233 RDPKSEEGGEIVFGGVDPKHFKGEHTFVPVTQRGYWQFDMGEVLIAGDSTGYCGSGCSAI 292
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDN 155
AD+GTSL+ GP+ VIA +NK IGA + + K D
Sbjct: 293 ADSGTSLLAGPTAVIAMINKAIGASGVVSQQCKTVVDQ 330
Score = 64.7 bits (156), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 27/47 (57%), Positives = 37/47 (78%), Gaps = 1/47 (2%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ AQYYG I++GTPPQ+F VIFDTGSSNLW+PS C +++C
Sbjct: 81 LKNYLDAQYYGEIAIGTPPQKFTVIFDTGSSNLWVPSGKC-FFSLSC 126
Score = 40.8 bits (94), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 17/45 (37%), Positives = 26/45 (57%)
Query: 130 KVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
+++ +N++ +P NG + VDC L KMP V +GGK F L
Sbjct: 409 RIVNYINEICERMPSPNGESAVDCSQLSKMPTVSFTIGGKVFDLA 453
>gi|254572447|ref|XP_002493333.1| Vacuolar aspartyl protease (proteinase A) [Komagataella pastoris
GS115]
gi|238033131|emb|CAY71154.1| Vacuolar aspartyl protease (proteinase A) [Komagataella pastoris
GS115]
gi|328352648|emb|CCA39046.1| vacuolar aspartic proteinase precursor similar to S. cerevisiae
PEP4 (YPL154C) [Komagataella pastoris CBS 7435]
Length = 410
Score = 87.8 bits (216), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 53/153 (34%), Positives = 82/153 (53%), Gaps = 5/153 (3%)
Query: 24 VIFDTGSSNLWIPSQHCSI-LNIACSVDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFV 82
+ +D+ S N +P + ++ L++ +D+ KF + + + +GG FGGVDK K+
Sbjct: 205 LAYDSISVNKIVPPIYKALELDL---LDEPKFAFYLGDTDKDESDGGLATFGGVDKSKYE 261
Query: 83 GDITYSPVSRKGYWQFGVESIKI-EKNVYCSNCQAIADTGTSLIIGPSKVIAELNKLIGA 141
G IT+ PV RK YW+ + + + + A DTGTSLI PS + LN IGA
Sbjct: 262 GKITWLPVRRKAYWEVSFDGVGLGSEYAELQKTGAAIDTGTSLIALPSGLAEILNAEIGA 321
Query: 142 VPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
+G VDCD D +P++ + G NFT+T
Sbjct: 322 TKGWSGQYAVDCDTRDSLPDLTLTFAGYNFTIT 354
Score = 64.7 bits (156), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 37/84 (44%), Positives = 47/84 (55%), Gaps = 4/84 (4%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASD- 65
AQY+ +SLGTPPQ FKVI DTGSSNLW+PS+ C L D ++ +N S
Sbjct: 94 AQYFTEVSLGTPPQSFKVILDTGSSNLWVPSKDCGSLACFLHAKYDHDESSTYKKNGSSF 153
Query: 66 --ENGGEIMFGGVDKDKF-VGDIT 86
G M G V +D +GD+T
Sbjct: 154 EIRYGSGSMEGYVSQDVLQIGDLT 177
>gi|397516556|ref|XP_003846117.1| PREDICTED: LOW QUALITY PROTEIN: pepsin A-5, partial [Pan paniscus]
Length = 274
Score = 87.8 bits (216), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 45/125 (36%), Positives = 73/125 (58%), Gaps = 5/125 (4%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-K 107
V +D F S + +A D++G ++FGG+D + G + + PV+ +GYWQ V+SI + K
Sbjct: 135 VSQDLF---SVYLSADDKSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQITVDSITMNGK 191
Query: 108 NVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ C+ CQAI DTGTSL+ GP+ IA + IGA ++G V C + +P++ +
Sbjct: 192 TIACAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGDMVVSCSAISSLPDIVFTI 251
Query: 167 GGKNF 171
G +
Sbjct: 252 NGVQY 256
Score = 58.9 bits (141), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 24/41 (58%), Positives = 34/41 (82%), Gaps = 2/41 (4%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACS 48
+Y+GTI +GTP Q+F V+FDTGSSNLW+PS +C ++AC+
Sbjct: 2 EYFGTIGIGTPAQDFTVVFDTGSSNLWVPSVYC--YSLACT 40
>gi|73620985|sp|P81498.2|PEPC_SUNMU RecName: Full=Gastricsin; AltName: Full=Pepsinogen C-1; Flags:
Precursor
gi|9798662|dbj|BAB11753.1| pepsinogen C [Suncus murinus]
Length = 389
Score = 87.8 bits (216), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 42/112 (37%), Positives = 65/112 (58%), Gaps = 3/112 (2%)
Query: 65 DENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSN-CQAIADTG 121
+NGG ++FGGVD + G I ++PV+++ YWQ GVE I + +C CQAI DTG
Sbjct: 220 SQNGGAVIFGGVDNSLYTGQIFWAPVTQELYWQIGVEEFLIGGQATGWCQQGCQAIVDTG 279
Query: 122 TSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
TSL+ P + ++ L + GA G V+C+++ +P + I+ G F L
Sbjct: 280 TSLLTVPQQFMSALQQATGAQQDQYGQLAVNCNSIQSLPTLTFIINGVQFPL 331
Score = 60.5 bits (145), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 29/54 (53%), Positives = 37/54 (68%), Gaps = 9/54 (16%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHF 60
A Y+G IS+GTPPQ F V+FDTGSSNLW+PS +C + AC+ GH+ F
Sbjct: 71 ASYFGEISIGTPPQNFLVLFDTGSSNLWVPSVYCQ--SQACT-------GHARF 115
>gi|393246119|gb|EJD53628.1| aspartic peptidase A1 [Auricularia delicata TFB-10046 SS5]
Length = 415
Score = 87.4 bits (215), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 50/117 (42%), Positives = 64/117 (54%), Gaps = 2/117 (1%)
Query: 60 FR-NASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI-EKNVYCSNCQAI 117
FR AS+ +GG +FGGVD + G ITY PV RK YW+ +E IK+ + V N A
Sbjct: 243 FRLGASESDGGSAVFGGVDSSHYKGQITYVPVRRKAYWEVELEGIKLGDDEVDFENTGAA 302
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSLI+ P + +N IGA NG V C+ +PN GGK + LT
Sbjct: 303 IDTGTSLIVLPVDIGEMINAQIGAKKSWNGQYTVPCEKRSSLPNFTFNFGGKPYVLT 359
Score = 60.1 bits (144), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 33/84 (39%), Positives = 46/84 (54%), Gaps = 4/84 (4%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASD- 65
AQY+ ISLG+P Q FKV+ DTGSSNLW+PS C+ + D ++ +N S
Sbjct: 102 AQYFAEISLGSPAQNFKVVLDTGSSNLWVPSSGCTSIACFLHAKYDSSASSTYKKNGSSF 161
Query: 66 --ENGGEIMFGGVDKDKF-VGDIT 86
G M G + +D +GDI+
Sbjct: 162 EIHYGSGSMEGFISQDTLKIGDIS 185
>gi|51534964|dbj|BAD36915.1| pepsinogen C [Myocastor coypus]
Length = 393
Score = 87.4 bits (215), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 44/117 (37%), Positives = 74/117 (63%), Gaps = 4/117 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSN-CQAI 117
++A++E GG ++ GGVD+ + G I+++PV+++ YWQ G+E + E + +CS CQAI
Sbjct: 221 QDATNE-GGALILGGVDESLYSGAISWTPVTQELYWQIGIEDFLLDGEASGWCSEGCQAI 279
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSL+ P + ++ L + IGA G VDC+N+ +P + ++ G F L+
Sbjct: 280 VDTGTSLLTVPQQYLSTLIEAIGAEEDEYGEYLVDCNNVQNLPTLTFVISGVQFPLS 336
Score = 60.8 bits (146), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 29/54 (53%), Positives = 37/54 (68%), Gaps = 9/54 (16%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHF 60
A Y+G ISLGTPPQ F+V+FDTGSSNLW+ S +C ++AC+ HS F
Sbjct: 74 ASYFGEISLGTPPQSFQVLFDTGSSNLWVASIYCK--SLACTT-------HSRF 118
>gi|225556537|gb|EEH04825.1| aspartic endopeptidase Pep2 [Ajellomyces capsulatus G186AR]
Length = 398
Score = 87.4 bits (215), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 45/110 (40%), Positives = 66/110 (60%), Gaps = 1/110 (0%)
Query: 65 DENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKN-VYCSNCQAIADTGTS 123
D + E++FGG++K++F+G++T P+ RK YW+ ++SI K +N I DTGTS
Sbjct: 232 DGDESEVVFGGMNKNRFMGELTKIPLRRKAYWEVDLDSITFGKQTAMMANTGVILDTGTS 291
Query: 124 LIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
LI PS + LNK IGA NG ++C D +PN+ L G NFT+
Sbjct: 292 LIALPSTIAELLNKEIGAKKSFNGQYTIECAKRDSLPNLTFGLSGHNFTI 341
Score = 63.2 bits (152), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 30/61 (49%), Positives = 39/61 (63%), Gaps = 4/61 (6%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK--DKFVGHSHFRNAS 64
AQY+ I +GTPPQ FKV+ DTGSSNLW+PS C +IAC + D +H +N S
Sbjct: 83 AQYFSEIGIGTPPQTFKVVLDTGSSNLWVPSSECG--SIACYLHNKYDSSASSTHKKNGS 140
Query: 65 D 65
+
Sbjct: 141 E 141
>gi|402906424|ref|XP_003916002.1| PREDICTED: napsin-A-like [Papio anubis]
Length = 421
Score = 87.4 bits (215), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 43/116 (37%), Positives = 64/116 (55%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVY--CSNCQAIA 118
R+ + +GGE++ GG D ++ +T+ PV+ YWQ +E +K+ + C AI
Sbjct: 224 RDPEEPDGGELVLGGSDPAHYIPPLTFVPVTVPAYWQIHMERVKVGPGLTLCVPGCAAIL 283
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSLI GP++ I L+ IG PL G + C + K+P V +LG F LT
Sbjct: 284 DTGTSLITGPTEEIRALHAAIGGYPLLAGEYIILCSEIPKLPAVSFLLGEVWFNLT 339
Score = 65.9 bits (159), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 27/46 (58%), Positives = 35/46 (76%), Gaps = 1/46 (2%)
Query: 3 SSYQ-AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
S+Y+ QY+G I LGTPPQ F V+FDTGSSNLW+PS+ C ++ C
Sbjct: 72 SNYRDVQYFGKIGLGTPPQNFTVVFDTGSSNLWVPSRRCHFFSVPC 117
>gi|345328796|ref|XP_003431303.1| PREDICTED: LOW QUALITY PROTEIN: chymosin-like [Ornithorhynchus
anatinus]
Length = 383
Score = 87.4 bits (215), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 46/116 (39%), Positives = 65/116 (56%), Gaps = 2/116 (1%)
Query: 63 ASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYC--SNCQAIADT 120
+ +N I FG +D+ + G I + PV++ YWQ V+ I ++ V CQAI DT
Sbjct: 219 SRSDNESLITFGAIDEFHYTGPIHWVPVTQAMYWQIAVDKITVDGEVVACGGGCQAILDT 278
Query: 121 GTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTRT 176
GTSL+ GPS IA + LIGA NG +VDC L ++P+V + G + LT T
Sbjct: 279 GTSLMAGPSPDIAGIQYLIGATEGRNGLYQVDCRKLQRLPDVVFHISGVKYPLTAT 334
Score = 60.5 bits (145), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 25/46 (54%), Positives = 35/46 (76%), Gaps = 2/46 (4%)
Query: 6 QAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK 51
+ +Y+GTIS+GTPPQEF V+FDTGSS+ W+PS C+ + AC +
Sbjct: 71 ETEYFGTISIGTPPQEFTVVFDTGSSDFWVPSVFCN--SHACKTHR 114
>gi|261264943|gb|ACX55830.1| aspartic proteinase 2 [Castanea mollissima]
Length = 513
Score = 87.4 bits (215), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 43/93 (46%), Positives = 61/93 (65%), Gaps = 3/93 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV--YC-SNCQAI 117
RN +E GGEI+FGGVD + + G TY PV++KGYWQF + + I+ +C + C AI
Sbjct: 234 RNTDEEEGGEIVFGGVDPNHYKGKHTYVPVTQKGYWQFDMGDVLIDGQTTGFCVTTCSAI 293
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAK 150
AD+GTSL+ GP+ +I E+N IGA + + K
Sbjct: 294 ADSGTSLLAGPTTIITEVNHAIGATGVVSQECK 326
Score = 64.3 bits (155), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 27/47 (57%), Positives = 36/47 (76%), Gaps = 1/47 (2%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ AQY+G I +GTPPQ+F VIFDTGSSNLW+PS C ++AC
Sbjct: 82 LKNYMDAQYFGEIGVGTPPQKFTVIFDTGSSNLWVPSSKC-YFSVAC 127
Score = 38.1 bits (87), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 15/45 (33%), Positives = 28/45 (62%)
Query: 130 KVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
+++ +N+L +P G + VDC +L +PNV + +GG+ F L+
Sbjct: 409 RILTYVNELCDRLPSPMGESAVDCGSLSSLPNVSLTIGGRVFDLS 453
>gi|222425182|dbj|BAH20540.1| pepsinogen A-15 [Pongo abelii]
Length = 388
Score = 87.4 bits (215), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 46/127 (36%), Positives = 74/127 (58%), Gaps = 5/127 (3%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-K 107
V +D F S + +A D++G ++FGG+D + G + + PV+ +GYWQ V+SI + +
Sbjct: 208 VSQDLF---SVYLSADDKSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQITVDSITMNGE 264
Query: 108 NVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ C+ CQAI DTGTSL+ GP+ IA + IGA +NG V C + +P++ +
Sbjct: 265 AIACAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSNGDMVVSCSAISSLPDIVFTI 324
Query: 167 GGKNFTL 173
G + L
Sbjct: 325 NGVQYPL 331
Score = 58.5 bits (140), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 26/47 (55%), Positives = 37/47 (78%), Gaps = 3/47 (6%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKF 54
+Y+GTI +GTP Q+F V+FDTGSSNLW+PS +C ++AC +D + F
Sbjct: 75 EYFGTIGIGTPAQDFTVVFDTGSSNLWVPSVYC--YSLAC-MDHNLF 118
>gi|349581664|dbj|GAA26821.1| K7_Pep4p [Saccharomyces cerevisiae Kyokai no. 7]
Length = 405
Score = 87.4 bits (215), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 49/126 (38%), Positives = 69/126 (54%), Gaps = 1/126 (0%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI-EK 107
+D+ KF + + ENGGE FGG+D+ KF GDIT+ PV RK YW+ E I + ++
Sbjct: 223 LDEKKFAFYLGDTSKDTENGGEATFGGIDESKFKGDITWLPVRRKAYWEVKFEGIGLGDE 282
Query: 108 NVYCSNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILG 167
+ A DTGTSLI PS + +N IGA G +DC+ D +P++
Sbjct: 283 YAELESHGAAIDTGTSLITLPSGLAEMINAEIGAKKGWTGQYTLDCNTRDNLPDLIFNFN 342
Query: 168 GKNFTL 173
G NFT+
Sbjct: 343 GYNFTI 348
Score = 63.9 bits (154), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 36/84 (42%), Positives = 47/84 (55%), Gaps = 4/84 (4%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASD- 65
AQYY I+LGTPPQ FKVI DTGSSNLW+PS C L D S+ N ++
Sbjct: 89 AQYYTDITLGTPPQNFKVILDTGSSNLWVPSNECGSLACFLHSKYDHEASSSYKANGTEF 148
Query: 66 --ENGGEIMFGGVDKDKF-VGDIT 86
+ G + G + +D +GD+T
Sbjct: 149 AIQYGTGSLEGYISQDTLSIGDLT 172
>gi|311258028|ref|XP_003127411.1| PREDICTED: napsin-A [Sus scrofa]
Length = 416
Score = 87.4 bits (215), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 44/116 (37%), Positives = 63/116 (54%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIA 118
R+ +GGE++ GG D ++ +T+ PV+ YWQ VE + + + C+ C AI
Sbjct: 219 RDPEAADGGELVLGGSDPAHYIPPLTFVPVTVPAYWQVHVERVHVGTGLTLCAQGCAAIL 278
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSLI GP++ I L IG +PL G + C + +P V LGG F LT
Sbjct: 279 DTGTSLITGPTEEIQALQAAIGGIPLLMGEYLIQCSKIPTLPPVSFHLGGVWFNLT 334
Score = 67.8 bits (164), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 28/40 (70%), Positives = 31/40 (77%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
QYYG I LGTPPQ F VIFDTGSSNLW+PS C L++ C
Sbjct: 73 QYYGEIGLGTPPQNFSVIFDTGSSNLWVPSGRCHFLSLPC 112
>gi|327271207|ref|XP_003220379.1| PREDICTED: gastricsin-like [Anolis carolinensis]
Length = 388
Score = 87.4 bits (215), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 41/116 (35%), Positives = 68/116 (58%), Gaps = 3/116 (2%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSN-CQAI 117
R + + GGE++FGGVD + G+I ++PV+++ YWQ G++ I + +CS CQAI
Sbjct: 215 RQPNSQYGGEVVFGGVDTRLYSGEIYWAPVTQELYWQIGIQEFSIGGQATGWCSQGCQAI 274
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
DTGTSL+ P + ++ +GA G V+C+N+ +P + + G +F L
Sbjct: 275 VDTGTSLLTVPQQYMSNFLSAVGAQQNQYGQYAVNCNNVQNLPTISFTINGVSFPL 330
Score = 55.8 bits (133), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 23/35 (65%), Positives = 28/35 (80%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCS 41
A YYG I +GTP Q F V+FDTGSSNLW+PS +C+
Sbjct: 70 ASYYGQIGIGTPAQNFLVLFDTGSSNLWVPSIYCN 104
>gi|194218271|ref|XP_001501895.2| PREDICTED: pepsin A-like [Equus caballus]
Length = 387
Score = 87.4 bits (215), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 44/127 (34%), Positives = 74/127 (58%), Gaps = 5/127 (3%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-K 107
V +D F S + ++ DE+G +MFGG+D + G + + PVS +GYWQ ++S+ + +
Sbjct: 207 VSQDLF---SVYLSSDDESGSVVMFGGIDPSYYTGSLHWVPVSNEGYWQITMDSVTVNGE 263
Query: 108 NVYCS-NCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
++ CS CQAI DTGTSL+ GP+ I + +G ++G + C ++ +P++ L
Sbjct: 264 SIACSGGCQAIVDTGTSLLAGPTSAIDNIQSYLGFSEDSSGEGVISCSSIYSLPDIVFTL 323
Query: 167 GGKNFTL 173
G F L
Sbjct: 324 NGVEFPL 330
Score = 65.9 bits (159), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 31/46 (67%), Positives = 39/46 (84%), Gaps = 3/46 (6%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKF 54
Y+GTIS+GTP QEF VIFDTGSSNLW+PS +CS ++ACS D ++F
Sbjct: 75 YFGTISIGTPAQEFTVIFDTGSSNLWVPSIYCS--SLACS-DHNRF 117
>gi|12248414|dbj|BAB20092.1| pepsinogen A [Rana catesbeiana]
Length = 385
Score = 87.4 bits (215), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 47/127 (37%), Positives = 74/127 (58%), Gaps = 5/127 (3%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKN 108
+ +D F S + ++ ++G ++FGGVD + G++ + P++ + YWQ V+SI I
Sbjct: 205 IPQDLF---SVYLSSQGQSGSFVLFGGVDTSYYTGNLNWVPLTAETYWQITVDSISIGGQ 261
Query: 109 VY-CS-NCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
V CS +C AI DTGTSL+ GPS IA + IGA +NG ++C+N+ MP V +
Sbjct: 262 VIACSGSCSAIVDTGTSLLAGPSTPIANIQYYIGANQDSNGQYVINCNNISNMPTVVFTI 321
Query: 167 GGKNFTL 173
G + L
Sbjct: 322 NGVQYPL 328
Score = 63.5 bits (153), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 26/34 (76%), Positives = 31/34 (91%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCS 41
+Y+GTIS+GTPPQ F VIFDTGSSNLW+PS +CS
Sbjct: 72 EYFGTISIGTPPQSFTVIFDTGSSNLWVPSVYCS 105
>gi|261264941|gb|ACX55829.1| aspartic proteinase 1 [Castanea mollissima]
Length = 513
Score = 87.4 bits (215), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 43/93 (46%), Positives = 61/93 (65%), Gaps = 3/93 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV--YCSN-CQAI 117
RN +E GGEI+FGGVD + + G TY PV++KGYWQF + + I+ +C+ C AI
Sbjct: 234 RNTDEEEGGEIVFGGVDPNHYKGKHTYVPVTQKGYWQFDMGDVLIDGQTTGFCARGCSAI 293
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAK 150
AD+GTSL+ GP+ +I E+N IGA + + K
Sbjct: 294 ADSGTSLLAGPTTIITEVNHAIGATGVVSQECK 326
Score = 64.3 bits (155), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 27/47 (57%), Positives = 36/47 (76%), Gaps = 1/47 (2%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ AQY+G I +GTPPQ+F VIFDTGSSNLW+PS C ++AC
Sbjct: 82 LKNYMDAQYFGEIGVGTPPQKFTVIFDTGSSNLWVPSSKC-YFSVAC 127
Score = 38.1 bits (87), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 15/45 (33%), Positives = 28/45 (62%)
Query: 130 KVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
+++ +N+L +P G + VDC +L +PNV + +GG+ F L+
Sbjct: 409 RILTYVNELCDRLPSPMGESAVDCGSLSSLPNVSLTIGGRVFDLS 453
>gi|161015872|gb|ABX55935.1| preprochymosin b [Bubalus bubalis]
Length = 381
Score = 87.4 bits (215), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 50/125 (40%), Positives = 64/125 (51%), Gaps = 4/125 (3%)
Query: 54 FVGHSHFRNASDENGGEIMF--GGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYC 111
V F D NG E M G +D + G + + PV+ + YWQF V+SI I V
Sbjct: 205 LVAQDLFSVYMDRNGQESMLTLGAIDPSYYTGSLHWVPVTVQQYWQFTVDSITISGVVVA 264
Query: 112 SN--CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGK 169
CQAI DTGTS ++GPS I + + IGA G +DCDNL MP V + GK
Sbjct: 265 CEGGCQAILDTGTSKLVGPSSDILNIQQAIGATQNQYGEFDIDCDNLSYMPTVVFEINGK 324
Query: 170 NFTLT 174
+ LT
Sbjct: 325 MYPLT 329
Score = 59.3 bits (142), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 23/34 (67%), Positives = 29/34 (85%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
+QY+G I LGTPPQEF V+FDTGSS+ W+PS +C
Sbjct: 72 SQYFGKIYLGTPPQEFTVLFDTGSSDFWVPSIYC 105
>gi|297705581|ref|XP_002829653.1| PREDICTED: napsin-A, partial [Pongo abelii]
Length = 392
Score = 87.4 bits (215), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 41/116 (35%), Positives = 67/116 (57%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIA 118
R+ +GGE++ GG D ++ +T+ PV+ YWQ +E +K+ + C+ C AI
Sbjct: 195 RDPKVADGGELVLGGSDPAHYIPPLTFVPVTVPAYWQIHMERVKVGSGLTLCARGCAAIL 254
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGT +I+GP++ I L+ IG +PL G + C + K+P V +++ G F LT
Sbjct: 255 DTGTPVIVGPTEEIRALHAAIGGIPLLAGEYIIRCSEIPKLPAVSLLIAGVWFNLT 310
Score = 65.1 bits (157), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 25/43 (58%), Positives = 31/43 (72%)
Query: 5 YQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+ QY+G I LGTPPQ F V FDTGSSNLW+PS+ C ++ C
Sbjct: 46 WDVQYFGEIGLGTPPQNFTVAFDTGSSNLWVPSRRCHFFSVPC 88
>gi|351707910|gb|EHB10829.1| Gastricsin [Heterocephalus glaber]
Length = 391
Score = 87.4 bits (215), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 41/110 (37%), Positives = 68/110 (61%), Gaps = 3/110 (2%)
Query: 67 NGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV--YCSN-CQAIADTGTS 123
+GG ++ GGVD+ + G I+++PV+++ YWQ G+E ++++ +CS CQ I DTGTS
Sbjct: 224 DGGALILGGVDESLYNGQISWTPVTQELYWQIGIEDVQLDNQALGWCSQGCQGIVDTGTS 283
Query: 124 LIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
L+ P + + L + IGA G VDC+++ +P + +IL G F L
Sbjct: 284 LLTLPQQYLTTLIQAIGAQENEFGEYVVDCNSIQSLPTLTVILSGVKFPL 333
Score = 63.5 bits (153), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 27/42 (64%), Positives = 35/42 (83%), Gaps = 2/42 (4%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACS 48
A Y+G ISLGTPPQ F+V+FDTGSSNLW+PS +C ++AC+
Sbjct: 73 AAYFGEISLGTPPQSFQVLFDTGSSNLWVPSVYCQ--SLACT 112
>gi|384040313|gb|AFH58568.1| aspartic acid protease [Ananas comosus]
Length = 514
Score = 87.4 bits (215), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 42/84 (50%), Positives = 61/84 (72%), Gaps = 3/84 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCS-NCQAI 117
RNA+D GGEI+FGG D + + G+ TY PV++KGYWQF + + + + +C+ C AI
Sbjct: 234 RNANDGEGGEIVFGGADPNHYKGNHTYVPVTQKGYWQFEMGDVLVGGQSTGFCNGGCAAI 293
Query: 118 ADTGTSLIIGPSKVIAELNKLIGA 141
AD+GTSL+ GP+ +IAE+N+ IGA
Sbjct: 294 ADSGTSLLAGPTTIIAEINQKIGA 317
Score = 65.1 bits (157), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 28/47 (59%), Positives = 36/47 (76%), Gaps = 1/47 (2%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ AQY+G I +GTPPQ+F VIFDTGSSNLW+PS C +IAC
Sbjct: 82 LKNYMNAQYFGEIGIGTPPQKFTVIFDTGSSNLWVPSSKC-YFSIAC 127
Score = 37.0 bits (84), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 15/39 (38%), Positives = 24/39 (61%)
Query: 135 LNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
LN+L +P G + VDC ++ MP++ +GGK F+L
Sbjct: 415 LNQLCEKLPSPMGESSVDCSSVASMPDISFTIGGKKFSL 453
>gi|157837066|pdb|5PEP|A Chain A, X-Ray Analyses Of Aspartic Proteases. Ii.
Three-Dimensional Structure Of The Hexagonal Crystal
Form Of Porcine Pepsin At 2.3 Angstroms Resolution
Length = 326
Score = 87.4 bits (215), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 43/128 (33%), Positives = 77/128 (60%), Gaps = 5/128 (3%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-K 107
V +D F S + +++D++G ++ GG+D + G + + PVS +GYWQ ++SI ++ +
Sbjct: 146 VSQDLF---SVYLSSNDDSGSVVLLGGIDSSYYTGSLNWVPVSVEGYWQITLDSITMDGE 202
Query: 108 NVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ CS CQAI DTGTSL+ GP+ IA + IGA ++G + C ++ +P++ +
Sbjct: 203 TIACSGGCQAIVDTGTSLLTGPTSAIANIQSDIGASENSDGEMVISCSSIASLPDIVFTI 262
Query: 167 GGKNFTLT 174
G + L+
Sbjct: 263 NGVQYPLS 270
Score = 63.5 bits (153), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 29/54 (53%), Positives = 41/54 (75%), Gaps = 3/54 (5%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKF 54
+ + +Y+GTI +GTP Q+F VIFDTGSSNLW+PS +CS ++ACS D ++F
Sbjct: 6 LENYLDTEYFGTIGIGTPAQDFTVIFDTGSSNLWVPSVYCS--SLACS-DHNQF 56
>gi|129776|sp|P03954.2|PEPA1_MACFU RecName: Full=Pepsin A-1; AltName: Full=Pepsin III-3; Flags:
Precursor
gi|38075|emb|CAA42424.1| prepropepsin a [Macaca fuscata]
Length = 388
Score = 87.4 bits (215), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 45/125 (36%), Positives = 73/125 (58%), Gaps = 5/125 (4%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-K 107
V +D F S + +A D++G ++FGG+D + G + + PVS +GYWQ V+SI + +
Sbjct: 208 VSQDLF---SVYLSADDQSGSVVIFGGIDSSYYTGSLNWVPVSVEGYWQISVDSITMNGE 264
Query: 108 NVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ C+ CQAI DTGTSL+ GP+ IA + IGA ++G V C + +P++ +
Sbjct: 265 AIACAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGEMVVSCSAISSLPDIVFTI 324
Query: 167 GGKNF 171
G +
Sbjct: 325 NGIQY 329
Score = 60.8 bits (146), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 26/41 (63%), Positives = 35/41 (85%), Gaps = 2/41 (4%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACS 48
+Y+GTI +GTP Q+F VIFDTGSSNLW+PS +CS ++AC+
Sbjct: 75 EYFGTIGIGTPAQDFTVIFDTGSSNLWVPSVYCS--SLACT 113
>gi|322700747|gb|EFY92500.1| vacuolar protease A [Metarhizium acridum CQMa 102]
Length = 395
Score = 87.4 bits (215), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 45/119 (37%), Positives = 67/119 (56%), Gaps = 1/119 (0%)
Query: 60 FRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVY-CSNCQAIA 118
F S E G E +FGG+DK+ + G++ Y P+ RK YW+ + SI + + + AI
Sbjct: 224 FYLGSKEEGSEAVFGGIDKNHYTGELEYLPLRRKAYWEVDINSIALGDEIAELDHTGAIL 283
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTRTN 177
DTGTSL + PS + LNK IGA NG VDCD + +P++ L N++L ++
Sbjct: 284 DTGTSLNVLPSTLAELLNKEIGAKKSWNGQYTVDCDKIKSLPDIVFNLSNSNYSLPASD 342
Score = 62.4 bits (150), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 26/41 (63%), Positives = 34/41 (82%), Gaps = 2/41 (4%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQY+ I++G+PPQ FKV+ DTGSSNLW+PSQ C+ +IAC
Sbjct: 83 AQYFSEITIGSPPQSFKVVLDTGSSNLWVPSQSCN--SIAC 121
>gi|255578112|ref|XP_002529926.1| Aspartic proteinase precursor, putative [Ricinus communis]
gi|223530603|gb|EEF32480.1| Aspartic proteinase precursor, putative [Ricinus communis]
Length = 514
Score = 87.4 bits (215), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 42/93 (45%), Positives = 62/93 (66%), Gaps = 3/93 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE---KNVYCSNCQAI 117
RNA ++ GGEI+FGG+D + + G+ TY PV++KGYWQF + + I+ + S C AI
Sbjct: 234 RNADEDEGGEIVFGGMDPNHYKGEHTYVPVTQKGYWQFDMGDVLIDGKTTGICSSGCAAI 293
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAK 150
AD+GTSL+ GP+ +I E+N IGA + + K
Sbjct: 294 ADSGTSLLAGPTTIITEVNHAIGATGVVSQECK 326
Score = 64.3 bits (155), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 27/47 (57%), Positives = 36/47 (76%), Gaps = 1/47 (2%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ AQY+G I +GTPPQ+F VIFDTGSSNLW+PS C ++AC
Sbjct: 82 LKNYMDAQYFGEIGIGTPPQKFTVIFDTGSSNLWVPSSKC-YFSVAC 127
Score = 37.4 bits (85), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 26/89 (29%), Positives = 42/89 (47%), Gaps = 12/89 (13%)
Query: 98 FGVESIKIEK---------NVYCSNCQ-AIADTGTSLIIGPSK--VIAELNKLIGAVPLA 145
G+ES+ EK + CS C+ A+ L ++ ++ +N+L +P
Sbjct: 366 MGIESVVNEKIQEVAGGLHDAMCSTCEMAVVWMQNQLKQNQTQEHILNYVNELCERLPSP 425
Query: 146 NGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
G + VDC +L MPNV +GG+ F L
Sbjct: 426 MGESAVDCGSLSTMPNVSFTIGGRVFDLA 454
>gi|311260416|ref|XP_003128442.1| PREDICTED: gastricsin-like [Sus scrofa]
Length = 394
Score = 87.0 bits (214), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 44/113 (38%), Positives = 66/113 (58%), Gaps = 3/113 (2%)
Query: 64 SDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSN-CQAIADT 120
S ++GGE++ GGVD + G I ++PV+++ YWQ G+E I E + +CS CQAI DT
Sbjct: 224 SSQDGGELVLGGVDSSLYTGQIYWAPVTQELYWQIGIEEFLIGDEASGWCSEGCQAIVDT 283
Query: 121 GTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
GTSL+ P +++L + GA G VDC ++ +P I+ G F L
Sbjct: 284 GTSLLTVPQDYLSDLVQATGAEENEYGEFLVDCKDIQSLPTFTFIINGVEFPL 336
Score = 63.5 bits (153), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 36/101 (35%), Positives = 54/101 (53%), Gaps = 19/101 (18%)
Query: 6 QAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASD 65
+A Y+G IS+GTPPQ F V+FDTGSSNLW+PS +C ++AC+ H+ F +
Sbjct: 75 EAAYFGEISIGTPPQNFLVLFDTGSSNLWVPSVYCK--SLACTT-------HARFNPSKS 125
Query: 66 ENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE 106
D+ + Y S G+ FG +++KI+
Sbjct: 126 ST--------YSTDRQTFSLQYGSGSLTGF--FGYDTLKIQ 156
>gi|351710945|gb|EHB13864.1| Cathepsin E, partial [Heterocephalus glaber]
Length = 391
Score = 87.0 bits (214), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 45/112 (40%), Positives = 68/112 (60%), Gaps = 3/112 (2%)
Query: 67 NGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIADTGTSL 124
+GGE+ FGG D F G + + PV+++ YWQ ++ I + +V +CS CQAI DTGTSL
Sbjct: 222 SGGELTFGGYDPSHFSGSLNWVPVTKQAYWQIALDGILVGDSVMFCSEGCQAIVDTGTSL 281
Query: 125 IIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTRT 176
I GP I +L + +GA + V+C NL+ M +V ++ G +TL+ T
Sbjct: 282 ITGPPPKIKQLQEALGAT-YVDEEYAVECANLNMMQDVTFVINGVLYTLSPT 332
Score = 62.8 bits (151), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 32/61 (52%), Positives = 40/61 (65%), Gaps = 5/61 (8%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASDEN 67
+Y+GTIS+G+PPQ F VIFDTGSSNLW+PS +C+ + AC K V H N E
Sbjct: 73 EYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT--SPAC---KTHPVFHPSLSNTYSEV 127
Query: 68 G 68
G
Sbjct: 128 G 128
>gi|118102563|ref|XP_425832.2| PREDICTED: gastricsin-like [Gallus gallus]
Length = 388
Score = 87.0 bits (214), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 42/117 (35%), Positives = 71/117 (60%), Gaps = 4/117 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV--YCSN-CQAI 117
RN + GGE++ GGVD F GD+ ++PV+++ YWQ ++ I ++V +CS CQAI
Sbjct: 216 RNPTYNYGGELVLGGVDSRLFTGDVVWAPVTQELYWQVAIDEFAIGQSVMGWCSQGCQAI 275
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGT L+ P + ++ + +GA + G A VDC++++ +P + I+ G L+
Sbjct: 276 VDTGTFLLTVPQQYLSRFLQAVGAQETSYGYA-VDCNDINSLPTITFIISGARLPLS 331
Score = 60.1 bits (144), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 25/42 (59%), Positives = 32/42 (76%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSI 42
M S + Y+G IS+GTPPQ F V+FDTGSSNLW+PS C++
Sbjct: 65 MASHLDSSYFGEISIGTPPQNFLVLFDTGSSNLWVPSTFCNM 106
>gi|357134751|ref|XP_003568979.1| PREDICTED: aspartic proteinase-like [Brachypodium distachyon]
Length = 498
Score = 87.0 bits (214), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 45/102 (44%), Positives = 65/102 (63%), Gaps = 8/102 (7%)
Query: 48 SVDKDKFVGHSHF-----RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVES 102
S+ K K + F RNA +GGE++FGGVD + + G+ TY PVSRKGYWQF +
Sbjct: 205 SMQKQKLLADDVFSFWLNRNADATSGGELVFGGVDSNHYKGNHTYVPVSRKGYWQFNMGD 264
Query: 103 IKI--EKNVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGA 141
+ I + +C+ C AI D+GTSL+ GP+ ++A++N IGA
Sbjct: 265 LLIDGQSTGFCAKGCAAIVDSGTSLLAGPTAIVAQVNHAIGA 306
Score = 62.0 bits (149), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 27/40 (67%), Positives = 31/40 (77%), Gaps = 1/40 (2%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
QY+G I +GTPPQ F VIFDTGSSNLW+PS C +IAC
Sbjct: 78 QYFGVIGVGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIAC 116
Score = 40.8 bits (94), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 26/83 (31%), Positives = 42/83 (50%), Gaps = 7/83 (8%)
Query: 99 GVESIKIEKNV----YCSNCQ-AIADTGTSLIIGPSK--VIAELNKLIGAVPLANGPAKV 151
G+ES+ ++NV C+ C+ A+ L +K ++ N+L +P NG + V
Sbjct: 356 GIESVVGKENVGSDVMCTACEMAVVWIENQLRENKTKELILQYANQLCERLPSPNGESTV 415
Query: 152 DCDNLDKMPNVDIILGGKNFTLT 174
C + KMPN+ + GK F LT
Sbjct: 416 SCHEISKMPNLAFTIAGKTFVLT 438
>gi|195443984|ref|XP_002069665.1| GK11643 [Drosophila willistoni]
gi|194165750|gb|EDW80651.1| GK11643 [Drosophila willistoni]
Length = 473
Score = 87.0 bits (214), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 44/112 (39%), Positives = 62/112 (55%), Gaps = 4/112 (3%)
Query: 64 SDENGGEIMFGGVDKDKFVGD--ITYSPVSRKGYWQFGVESIKIEKNVYCSNCQAIADTG 121
S GGE++FG D + G TY+PV+ +GYWQF ++S+ I + QAI DTG
Sbjct: 306 SSTRGGELIFGSTDTSTYTGSNSYTYTPVTVQGYWQFKLQSVSIGSDETAGESQAICDTG 365
Query: 122 TSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
TSLI P K A +NK IG +G + C K+P++ +GG FT+
Sbjct: 366 TSLIAAPKKAFAAINKKIGCHTTTSGECWMKCS--KKIPDITFKIGGTKFTM 415
Score = 43.9 bits (102), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 16/33 (48%), Positives = 25/33 (75%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
+Y +++GTP Q+F V+ DTGSSN+W+P +C
Sbjct: 159 EYTCKMTIGTPAQKFTVLPDTGSSNIWVPGLNC 191
>gi|344257339|gb|EGW13443.1| Napsin-A [Cricetulus griseus]
Length = 532
Score = 87.0 bits (214), Expect = 2e-15, Method: Composition-based stats.
Identities = 46/116 (39%), Positives = 65/116 (56%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIA 118
R+A +GGE++ GG D ++ +T+ PV+ YWQ +ES+ + + C+ C I
Sbjct: 152 RDAEGSDGGELVLGGSDPAHYIPPLTFIPVTIPAYWQVHMESVNVGTGLSLCAQGCGVIL 211
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSLI GPS+ I LNK IG +P G + C ++P V LGG F LT
Sbjct: 212 DTGTSLITGPSEEIHALNKAIGGLPFLAGQYFIQCSKTPELPTVSFRLGGVWFNLT 267
Score = 48.9 bits (115), Expect = 9e-04, Method: Composition-based stats.
Identities = 20/26 (76%), Positives = 22/26 (84%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNL 33
QY+G I LGTPPQ F V+FDTGSSNL
Sbjct: 4 QYFGDIGLGTPPQNFTVVFDTGSSNL 29
>gi|320588396|gb|EFX00865.1| aspartic endopeptidase pep2 [Grosmannia clavigera kw1407]
Length = 401
Score = 87.0 bits (214), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 63/173 (36%), Positives = 84/173 (48%), Gaps = 15/173 (8%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNA-SD 65
A++ G + LG FDT S N +P + N+ D V + NA SD
Sbjct: 188 ARFDGILGLG---------FDTISVNHIVPPFY----NLINQKLIDSGVFAFYLGNADSD 234
Query: 66 ENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI-EKNVYCSNCQAIADTGTSL 124
+ E +FGGVDK + G IT P+ RK YW+ ++SI + E N I DTGTSL
Sbjct: 235 GDDSEAVFGGVDKAHYTGKITTIPLRRKAYWEVDLDSISLGEDTAELENTGVILDTGTSL 294
Query: 125 IIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTRTN 177
I PS + LN IGA NG VDC +P+V L G NF+L ++
Sbjct: 295 IALPSSLAEMLNAQIGAKKGYNGQYSVDCSRKSSLPDVTFTLSGYNFSLPASD 347
Score = 63.5 bits (153), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 27/41 (65%), Positives = 32/41 (78%), Gaps = 2/41 (4%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQY+ IS+GTPPQ FKV+ DTGSSNLW+PS C +IAC
Sbjct: 85 AQYFAEISIGTPPQSFKVVLDTGSSNLWVPSSQCG--SIAC 123
>gi|291416270|ref|XP_002724368.1| PREDICTED: pepsin II-4-like [Oryctolagus cuniculus]
Length = 387
Score = 87.0 bits (214), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 44/127 (34%), Positives = 72/127 (56%), Gaps = 5/127 (3%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-K 107
V +D F S + ++ DE G +MFGG+D + G + + PVS +GYWQ ++S+ I +
Sbjct: 207 VSQDLF---SVYLSSDDEKGSLVMFGGIDSSYYTGSLNWVPVSYEGYWQITMDSVSINGE 263
Query: 108 NVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ C++ CQAI DTGTSL+ GP+ I+ + IGA G + C + +P++ +
Sbjct: 264 TIACADSCQAIVDTGTSLLAGPTSAISNIQSYIGASKNLLGENVISCSAISSLPDIVFTI 323
Query: 167 GGKNFTL 173
G + L
Sbjct: 324 NGIQYPL 330
Score = 70.9 bits (172), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 30/45 (66%), Positives = 40/45 (88%), Gaps = 2/45 (4%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK 51
A+Y+GTIS+GTPPQ+F VIFDTGSSNLW+PS +CS ++AC++ K
Sbjct: 73 AEYFGTISIGTPPQDFTVIFDTGSSNLWVPSTYCS--SLACALHK 115
>gi|431910409|gb|ELK13482.1| Pepsin A [Pteropus alecto]
Length = 386
Score = 87.0 bits (214), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 44/127 (34%), Positives = 75/127 (59%), Gaps = 5/127 (3%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-K 107
V +D F S + ++ D++G ++FGG+D + G + + P+S + YWQ V+S+ ++ +
Sbjct: 206 VSQDLF---SVYLSSDDDSGSVVIFGGIDSSYYSGSLNWVPLSSETYWQITVDSVILDGE 262
Query: 108 NVYCS-NCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ CS CQAI DTGTSL+ GP+ I+ + K IGA ++G V C ++PN+ +
Sbjct: 263 AIACSATCQAIVDTGTSLLAGPTTAISSIQKYIGASENSDGDMVVSCSAASELPNIIFTI 322
Query: 167 GGKNFTL 173
G + L
Sbjct: 323 NGVQYPL 329
Score = 62.8 bits (151), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 27/40 (67%), Positives = 34/40 (85%), Gaps = 2/40 (5%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+Y+GTI +GTP QEF VIFDTGSSNLW+PS +CS ++AC
Sbjct: 73 EYFGTIGIGTPAQEFTVIFDTGSSNLWVPSVYCS--SLAC 110
>gi|326933879|ref|XP_003213025.1| PREDICTED: gastricsin-like [Meleagris gallopavo]
Length = 390
Score = 87.0 bits (214), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 45/119 (37%), Positives = 70/119 (58%), Gaps = 5/119 (4%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV--YCSN-CQAI 117
RN + GGE++ GGVD F GDI ++PV+++ YWQ ++ I ++V +CS CQAI
Sbjct: 215 RNPTYNYGGELVLGGVDSRLFTGDIVWAPVTQELYWQVAIDEFAIGQSVMGWCSQGCQAI 274
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAK--VDCDNLDKMPNVDIILGGKNFTLT 174
DTGT L+ P + ++ L K +GA + G + VDC+ + +P + I+ G LT
Sbjct: 275 VDTGTFLLTVPQQYLSRLLKAVGAQETSYGVREYAVDCNVVHSLPTISFIINGVQLPLT 333
Score = 59.7 bits (143), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 25/42 (59%), Positives = 32/42 (76%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSI 42
M S + Y+G IS+GTPPQ F V+FDTGSSNLW+PS C++
Sbjct: 65 MASHLDSSYFGEISIGTPPQNFLVLFDTGSSNLWVPSTLCNM 106
>gi|360431|prf||1403354A pepsinogen
Length = 383
Score = 87.0 bits (214), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 43/106 (40%), Positives = 66/106 (62%), Gaps = 2/106 (1%)
Query: 68 GGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKN-VYCSN-CQAIADTGTSLI 125
G ++FGG+D+ F G I + PVS +GYWQ ++SI + K + CS+ CQAI DTGTSL+
Sbjct: 223 GSMVVFGGIDESYFTGSINWIPVSYQGYWQISMDSIIVNKQEIACSSGCQAIIDTGTSLV 282
Query: 126 IGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNF 171
GP+ I ++ +GA G V+C ++ MP+V ++GG +
Sbjct: 283 AGPASDINDIQSAVGANQNTYGEYSVNCSHILAMPDVVFVIGGIQY 328
Score = 63.2 bits (152), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 25/34 (73%), Positives = 31/34 (91%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCS 41
+YYGTIS+GTPPQ+F V+FDTGSSNLW+PS C+
Sbjct: 75 EYYGTISIGTPPQDFTVVFDTGSSNLWVPSVSCT 108
>gi|45384244|ref|NP_990385.1| embryonic pepsinogen precursor [Gallus gallus]
gi|129801|sp|P16476.1|PEPE_CHICK RecName: Full=Embryonic pepsinogen; Flags: Precursor
gi|222853|dbj|BAA00153.1| pepsinogen [Gallus gallus]
Length = 383
Score = 87.0 bits (214), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 43/106 (40%), Positives = 66/106 (62%), Gaps = 2/106 (1%)
Query: 68 GGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKN-VYCSN-CQAIADTGTSLI 125
G ++FGG+D+ F G I + PVS +GYWQ ++SI + K + CS+ CQAI DTGTSL+
Sbjct: 223 GSMVVFGGIDESYFTGSINWIPVSYQGYWQISMDSIIVNKQEIACSSGCQAIIDTGTSLV 282
Query: 126 IGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNF 171
GP+ I ++ +GA G V+C ++ MP+V ++GG +
Sbjct: 283 AGPASDINDIQSAVGANQNTYGEYSVNCSHILAMPDVVFVIGGIQY 328
Score = 63.2 bits (152), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 25/34 (73%), Positives = 31/34 (91%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCS 41
+YYGTIS+GTPPQ+F V+FDTGSSNLW+PS C+
Sbjct: 75 EYYGTISIGTPPQDFTVVFDTGSSNLWVPSVSCT 108
>gi|212526768|ref|XP_002143541.1| aspartic endopeptidase Pep2 [Talaromyces marneffei ATCC 18224]
gi|210072939|gb|EEA27026.1| aspartic endopeptidase Pep2 [Talaromyces marneffei ATCC 18224]
Length = 395
Score = 87.0 bits (214), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 56/150 (37%), Positives = 81/150 (54%), Gaps = 4/150 (2%)
Query: 26 FDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDI 85
FDT S N +P + ++LN S+D+ F + N + + E FGG+DK + G++
Sbjct: 193 FDTISVNKIVPPFY-NMLN-QKSLDEPVFAFYLGDSN-KEGDASEATFGGIDKSHYTGEL 249
Query: 86 TYSPVSRKGYWQFGVESIKIEKNVY-CSNCQAIADTGTSLIIGPSKVIAELNKLIGAVPL 144
P+ RK YW+ ++I +NV N I DTGTSLI PS + LNK IGA
Sbjct: 250 VKIPLRRKAYWEVDFDAIAFGENVAELENTGVILDTGTSLIALPSTLAELLNKEIGASKS 309
Query: 145 ANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
NG VDC D +P++ + L G NF++T
Sbjct: 310 WNGQYTVDCAKRDSLPDLTVTLSGHNFSIT 339
Score = 65.5 bits (158), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 38/86 (44%), Positives = 53/86 (61%), Gaps = 8/86 (9%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK--DKFVGHSHFRNAS 64
AQY+ I++GTPPQ FKV+ DTGSSNLW+PS C+ +IAC + D ++ +N S
Sbjct: 80 AQYFSEITIGTPPQNFKVVLDTGSSNLWVPSSSCN--SIACYLHSKYDSSSSSTYKKNGS 137
Query: 65 D---ENGGEIMFGGVDKDKF-VGDIT 86
D + G + G V +D +GDIT
Sbjct: 138 DFAIQYGSGSLEGFVSRDTVTIGDIT 163
>gi|312282703|dbj|BAJ34217.1| unnamed protein product [Thellungiella halophila]
Length = 506
Score = 87.0 bits (214), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 45/101 (44%), Positives = 62/101 (61%), Gaps = 3/101 (2%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYC-SNCQAI 117
RNA D+ GGE++FGGVD + F G TY PV++KGYWQF + + I +C S C AI
Sbjct: 226 RNAEDDEGGELVFGGVDPNHFKGKHTYVPVTQKGYWQFDMGDVLIGNAPTGFCESGCSAI 285
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDK 158
AD+GTSL+ GP+ +I +N IGA + + K D +
Sbjct: 286 ADSGTSLLAGPTTIITMINHAIGAAGVVSQQCKTVVDQYGR 326
Score = 67.0 bits (162), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 28/47 (59%), Positives = 37/47 (78%), Gaps = 1/47 (2%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ AQYYG I++GTPPQ+F V+FDTGSSNLW+PS C +IAC
Sbjct: 74 LKNYLDAQYYGEIAIGTPPQKFTVVFDTGSSNLWVPSSKC-YFSIAC 119
Score = 38.9 bits (89), Expect = 0.71, Method: Compositional matrix adjust.
Identities = 44/184 (23%), Positives = 73/184 (39%), Gaps = 23/184 (12%)
Query: 11 GTISLGTPPQEF-----KVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASD 65
G + +G P F I D+G+S L P+ +++N A V + D
Sbjct: 266 GDVLIGNAPTGFCESGCSAIADSGTSLLAGPTTIITMINHAIGAAG---VVSQQCKTVVD 322
Query: 66 ENGG---EIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV---------YCSN 113
+ G E++ K I + K G+ES+ ++N CS
Sbjct: 323 QYGRTILELLLSETQPKKICSQIGLCTFNGKRGVSMGIESVVDKENAKLSNGVGDAACSA 382
Query: 114 CQ-AIADTGTSLI--IGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKN 170
C+ A+ + L + +++ N+L +P G + VDC L MP V + +GGK
Sbjct: 383 CEMAVVWIQSQLRQNMTQERILDYANELCERLPSPMGESAVDCAQLSTMPTVSLTIGGKV 442
Query: 171 FTLT 174
F L
Sbjct: 443 FDLA 446
>gi|129780|sp|P27677.1|PEPA2_MACFU RecName: Full=Pepsin A-2/A-3; AltName: Full=Pepsin III-2/III-1;
Flags: Precursor
gi|38069|emb|CAA42427.1| prepropepsin a [Macaca fuscata]
Length = 388
Score = 87.0 bits (214), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 45/125 (36%), Positives = 73/125 (58%), Gaps = 5/125 (4%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-K 107
V +D F S + +A D++G ++FGG+D + G + + PVS +GYWQ V+SI + +
Sbjct: 208 VSQDLF---SVYLSADDQSGSVVIFGGIDSSYYTGSLNWVPVSVEGYWQISVDSITMNGE 264
Query: 108 NVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ C+ CQAI DTGTSL+ GP+ IA + IGA ++G V C + +P++ +
Sbjct: 265 AIACAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGEMVVSCSAISSLPDIVFTI 324
Query: 167 GGKNF 171
G +
Sbjct: 325 NGIQY 329
Score = 60.8 bits (146), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 26/41 (63%), Positives = 35/41 (85%), Gaps = 2/41 (4%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACS 48
+Y+GTI +GTP Q+F VIFDTGSSNLW+PS +CS ++AC+
Sbjct: 75 EYFGTIGIGTPAQDFTVIFDTGSSNLWVPSVYCS--SLACT 113
>gi|253762221|gb|ACT35562.1| pepsinogen C precursor [Siniperca chuatsi]
Length = 387
Score = 87.0 bits (214), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 39/119 (32%), Positives = 73/119 (61%), Gaps = 3/119 (2%)
Query: 58 SHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSN-C 114
+ + ++ ++ G E+ FGGVD + G I ++PV+ + +WQ GV+ +I +++ +CS C
Sbjct: 207 AFYLSSGEQQGSELSFGGVDSSMYQGQIYWTPVTSETHWQIGVQGFQINGQESGWCSQGC 266
Query: 115 QAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
Q+I DTGTS++ PS+++ + + IGA G VDC ++ +P + ++ G +F L
Sbjct: 267 QSIVDTGTSMLTAPSQLLGYIMQAIGAQQSQYGMYMVDCSQVNNLPTLTFVISGVSFPL 325
Score = 56.6 bits (135), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 22/32 (68%), Positives = 28/32 (87%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
YYG IS+GTPPQ F+V+FDTGS+NLW+ S +C
Sbjct: 69 YYGAISIGTPPQSFQVLFDTGSANLWVDSVYC 100
>gi|402893203|ref|XP_003909790.1| PREDICTED: pepsin A-2/A-3-like [Papio anubis]
Length = 388
Score = 87.0 bits (214), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 45/125 (36%), Positives = 73/125 (58%), Gaps = 5/125 (4%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-K 107
V +D F S + +A D++G ++FGG+D + G + + PVS +GYWQ V+SI + +
Sbjct: 208 VSQDLF---SVYLSADDQSGSVVIFGGIDSSYYTGSLNWVPVSVEGYWQISVDSITMNGE 264
Query: 108 NVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ C+ CQAI DTGTSL+ GP+ IA + IGA ++G V C + +P++ +
Sbjct: 265 AIACAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGEMVVSCSAISSLPDIVFTI 324
Query: 167 GGKNF 171
G +
Sbjct: 325 NGIQY 329
Score = 61.2 bits (147), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 27/44 (61%), Positives = 36/44 (81%), Gaps = 2/44 (4%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK 51
+Y+GTI +GTP Q+F VIFDTGSSNLW+PS +CS ++AC+ K
Sbjct: 75 EYFGTIGIGTPAQDFTVIFDTGSSNLWVPSVYCS--SLACTNHK 116
>gi|301625941|ref|XP_002942158.1| PREDICTED: pepsin A [Xenopus (Silurana) tropicalis]
Length = 384
Score = 87.0 bits (214), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 44/121 (36%), Positives = 71/121 (58%), Gaps = 2/121 (1%)
Query: 58 SHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVY-CS-NCQ 115
S + ++ ++G ++FGGVD + G + + P++ + YWQ ++SI I V CS +CQ
Sbjct: 210 SVYLSSDGQSGSYVLFGGVDTSYYSGSLNWVPLTAETYWQIILDSISINGQVIACSQSCQ 269
Query: 116 AIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTR 175
AI DTGTSL+ GP+ IA + IGA +NG ++C+N+ MP + + G + L
Sbjct: 270 AIVDTGTSLMTGPTTPIANIQYYIGASQDSNGQYVINCNNISNMPTIVFTINGVQYPLPP 329
Query: 176 T 176
T
Sbjct: 330 T 330
Score = 66.6 bits (161), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 34/76 (44%), Positives = 47/76 (61%), Gaps = 3/76 (3%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASDEN 67
+YYGTIS+GTPPQEF VIFDTGS+NLW+PS +CS + AC+ + ++F +
Sbjct: 71 EYYGTISIGTPPQEFTVIFDTGSANLWVPSVYCS--SSACT-NHNRFNPQQSTTFQATNT 127
Query: 68 GGEIMFGGVDKDKFVG 83
I +G F+G
Sbjct: 128 PVSIQYGTGSMSGFLG 143
>gi|297849560|ref|XP_002892661.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297338503|gb|EFH68920.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 506
Score = 87.0 bits (214), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 45/98 (45%), Positives = 61/98 (62%), Gaps = 3/98 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYC-SNCQAI 117
RNA +E GGE++FGGVD + F G TY PV++KGYWQF + + I +C S C AI
Sbjct: 226 RNADEEEGGELVFGGVDPNHFKGKHTYVPVTQKGYWQFDMGDVLIGGAPTGFCESGCSAI 285
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDN 155
AD+GTSL+ GP+ +I +N IGA + + K D
Sbjct: 286 ADSGTSLLAGPTTIITMINHAIGAAGVVSQQCKTVVDQ 323
Score = 66.2 bits (160), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 27/47 (57%), Positives = 37/47 (78%), Gaps = 1/47 (2%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ AQYYG I++GTPPQ+F V+FDTGSSNLW+PS C ++AC
Sbjct: 74 LKNYLDAQYYGEIAIGTPPQKFTVVFDTGSSNLWVPSSKC-YFSLAC 119
Score = 36.6 bits (83), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 16/45 (35%), Positives = 25/45 (55%)
Query: 130 KVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
+++ +N+L +P G + VDC L MP V + +GGK F L
Sbjct: 402 RILNYVNELCERLPSPMGESAVDCAQLSTMPTVSLTIGGKVFDLA 446
>gi|229576947|ref|NP_001153272.1| pepsinogen A precursor [Pongo abelii]
gi|222425188|dbj|BAH20543.1| pepsinogen A-19 [Pongo abelii]
gi|222425190|dbj|BAH20544.1| pepsinogen A-13 [Pongo abelii]
gi|222425204|dbj|BAH20551.1| pepsinogen A-41 [Pongo abelii]
Length = 388
Score = 87.0 bits (214), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 45/127 (35%), Positives = 74/127 (58%), Gaps = 5/127 (3%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-K 107
V +D F S + +A D++G ++FGG+D + G + + PV+ +GYWQ V+SI + +
Sbjct: 208 VSQDLF---SVYLSADDQSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQITVDSITMNGE 264
Query: 108 NVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ C+ CQAI DTGTSL+ GP+ IA + IGA ++G V C + +P++ +
Sbjct: 265 AIACAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGDMVVSCSAISSLPDIVFTI 324
Query: 167 GGKNFTL 173
G + L
Sbjct: 325 NGVQYPL 331
Score = 60.8 bits (146), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 26/41 (63%), Positives = 35/41 (85%), Gaps = 2/41 (4%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACS 48
+Y+GTI +GTP Q+F VIFDTGSSNLW+PS +CS ++AC+
Sbjct: 75 EYFGTIGIGTPAQDFTVIFDTGSSNLWVPSVYCS--SLACT 113
>gi|323335315|gb|EGA76604.1| Pep4p [Saccharomyces cerevisiae Vin13]
Length = 368
Score = 87.0 bits (214), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 48/126 (38%), Positives = 69/126 (54%), Gaps = 1/126 (0%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI-EK 107
+D+ +F + + ENGGE FGG+D+ KF GDIT+ PV RK YW+ E I + ++
Sbjct: 186 LDEKRFAFYLGDTSKDTENGGEATFGGIDESKFKGDITWLPVRRKAYWEVKFEGIGLGDE 245
Query: 108 NVYCSNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILG 167
+ A DTGTSLI PS + +N IGA G +DC+ D +P++
Sbjct: 246 YAELESHGAAIDTGTSLITLPSGLAEMINAEIGAKKGWTGQYTLDCNTRDNLPDLIFNFN 305
Query: 168 GKNFTL 173
G NFT+
Sbjct: 306 GYNFTI 311
Score = 63.5 bits (153), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 36/84 (42%), Positives = 47/84 (55%), Gaps = 4/84 (4%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASD- 65
AQYY I+LGTPPQ FKVI DTGSSNLW+PS C L D S+ N ++
Sbjct: 52 AQYYTDITLGTPPQNFKVILDTGSSNLWVPSNECGSLACFLHSKYDHEASSSYKANGTEF 111
Query: 66 --ENGGEIMFGGVDKDKF-VGDIT 86
+ G + G + +D +GD+T
Sbjct: 112 AIQYGTGSLEGYISQDTLSIGDLT 135
>gi|149725185|ref|XP_001501907.1| PREDICTED: pepsin A-like [Equus caballus]
Length = 387
Score = 87.0 bits (214), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 45/128 (35%), Positives = 73/128 (57%), Gaps = 5/128 (3%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-K 107
V +D F S + ++ DE+G ++F G+D + G + + PVS + YWQ V+SI + +
Sbjct: 207 VSQDLF---SVYLSSDDESGSMVIFSGIDSSYYSGSLCWVPVSEEAYWQITVDSITMNGE 263
Query: 108 NVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
++ CS CQAI DTGTSL+ GP I + IGA + A + C ++D +P++ +
Sbjct: 264 SIACSGGCQAIVDTGTSLLAGPPSAIDNIQSYIGASEDYSSEAVISCSSIDSLPDIVFTI 323
Query: 167 GGKNFTLT 174
G F L+
Sbjct: 324 NGVEFPLS 331
Score = 70.9 bits (172), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 32/47 (68%), Positives = 41/47 (87%), Gaps = 3/47 (6%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKF 54
+Y+GTIS+GTPPQEF VIFDTGSSNLW+PS +CS ++ACS D ++F
Sbjct: 74 EYFGTISIGTPPQEFTVIFDTGSSNLWVPSTYCS--SLACS-DHNRF 117
>gi|121705756|ref|XP_001271141.1| aspartic endopeptidase Pep2 [Aspergillus clavatus NRRL 1]
gi|119399287|gb|EAW09715.1| aspartic endopeptidase Pep2 [Aspergillus clavatus NRRL 1]
Length = 398
Score = 87.0 bits (214), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 45/102 (44%), Positives = 60/102 (58%), Gaps = 1/102 (0%)
Query: 73 FGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVY-CSNCQAIADTGTSLIIGPSKV 131
FGG+DKD F G++T P+ RK YW+ +++I + NV N I DTGTSLI PS +
Sbjct: 240 FGGIDKDHFTGELTKIPLRRKAYWEVDLDAIALGDNVAELDNTGVILDTGTSLIALPSTL 299
Query: 132 IAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
LNK IGA G V+CD D +P++ L G NFT+
Sbjct: 300 ADLLNKEIGAKKGFTGQYSVECDKRDSLPDLTFTLSGHNFTI 341
Score = 63.2 bits (152), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 27/41 (65%), Positives = 32/41 (78%), Gaps = 2/41 (4%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQY+ I LGTPPQ+FKV+ DTGSSNLW+PS C +IAC
Sbjct: 83 AQYFSEIELGTPPQKFKVVLDTGSSNLWVPSSECG--SIAC 121
>gi|207340638|gb|EDZ68928.1| YPL154Cp-like protein [Saccharomyces cerevisiae AWRI1631]
Length = 385
Score = 86.7 bits (213), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 48/126 (38%), Positives = 69/126 (54%), Gaps = 1/126 (0%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI-EK 107
+D+ +F + + ENGGE FGG+D+ KF GDIT+ PV RK YW+ E I + ++
Sbjct: 223 LDEKRFAFYLGDTSKDTENGGEATFGGIDESKFKGDITWLPVRRKAYWEVKFEGIGLGDE 282
Query: 108 NVYCSNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILG 167
+ A DTGTSLI PS + +N IGA G +DC+ D +P++
Sbjct: 283 YAELESHGAAIDTGTSLITLPSGLAEMINAEIGAKKGWTGQYTLDCNTRDNLPDLIFNFN 342
Query: 168 GKNFTL 173
G NFT+
Sbjct: 343 GYNFTI 348
Score = 63.5 bits (153), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 36/84 (42%), Positives = 47/84 (55%), Gaps = 4/84 (4%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASD- 65
AQYY I+LGTPPQ FKVI DTGSSNLW+PS C L D S+ N ++
Sbjct: 89 AQYYTDITLGTPPQNFKVILDTGSSNLWVPSNECGSLACFLHSKYDHEASSSYKANGTEF 148
Query: 66 --ENGGEIMFGGVDKDKF-VGDIT 86
+ G + G + +D +GD+T
Sbjct: 149 AIQYGTGSLEGYISQDTLSIGDLT 172
>gi|19851896|gb|AAL99909.1|AF421164_1 chymosin precursor [Bos taurus]
Length = 297
Score = 86.7 bits (213), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 49/125 (39%), Positives = 64/125 (51%), Gaps = 4/125 (3%)
Query: 54 FVGHSHFRNASDENGGEIMF--GGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYC 111
V F D NG E M G +D + G + + PV+ + YWQF V+S+ I V
Sbjct: 121 LVAQDLFSVYMDRNGQESMLTLGAIDPSYYTGSLHWVPVTVQQYWQFTVDSVTISGVVVA 180
Query: 112 SN--CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGK 169
CQAI DTGTS ++GPS I + + IGA G +DCDNL MP V + GK
Sbjct: 181 CEGGCQAILDTGTSKLVGPSSDILNIQQAIGATQNQYGEFDIDCDNLSYMPTVVFEINGK 240
Query: 170 NFTLT 174
+ LT
Sbjct: 241 MYPLT 245
>gi|12043774|gb|AAG47643.1|AF275939_1 progastricsin [Salvelinus fontinalis]
Length = 387
Score = 86.7 bits (213), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 52/168 (30%), Positives = 85/168 (50%), Gaps = 7/168 (4%)
Query: 13 ISLGTPPQEFKVIFDTGSSNLWIPS----QHCSILNIACSVDKDKFVGHSHFRNASDENG 68
+S P Q F V G L PS Q +++ S + + + + + G
Sbjct: 158 LSTDEPGQTFVVAQFDGILGLSYPSISAGQETPVMDNIMSQNLLQANLFAFYLTRDGQQG 217
Query: 69 GEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSN-CQAIADTGTSLI 125
E+ FG VD K+ G I ++PV+ + YWQ GV+ +I ++ +C CQAI DTGTS++
Sbjct: 218 SELSFGEVDNTKYQGQIYWTPVTSQTYWQIGVQGFQINGQETGWCGQGCQAIVDTGTSML 277
Query: 126 IGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
PS+++ L + IGA G V+C+ ++ +P + + G NF L
Sbjct: 278 TAPSQIMGTLMQSIGAQQDQYGQYTVNCNQMNSLPTLTFTINGINFPL 325
Score = 53.5 bits (127), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 21/33 (63%), Positives = 28/33 (84%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCS 41
YYG I++GTPPQ F+V+FDTGS+NLW+ S C+
Sbjct: 69 YYGAITIGTPPQSFQVLFDTGSANLWVDSVLCN 101
>gi|221048011|gb|ACL98113.1| pepsinogen [Epinephelus coioides]
Length = 311
Score = 86.7 bits (213), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 41/105 (39%), Positives = 64/105 (60%), Gaps = 2/105 (1%)
Query: 58 SHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-KNVYCSN-CQ 115
S + ++ E G E++FGG+D + G +T+ P++ YWQ ++ +KI + V C+ CQ
Sbjct: 206 SVYLSSHGEQGSEVVFGGIDSSHYTGQVTWVPLTSATYWQIKMDGVKINGQTVACAGGCQ 265
Query: 116 AIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMP 160
AI DTGTSLI+GP+ I +N +GA G + V+C N+ MP
Sbjct: 266 AIIDTGTSLIVGPTNDINNMNSWVGASTNQYGESTVNCQNVGSMP 310
Score = 62.8 bits (151), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 26/33 (78%), Positives = 29/33 (87%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCS 41
YYG IS+GTPPQ F VIFDTGSSNLW+PS +CS
Sbjct: 69 YYGVISIGTPPQSFSVIFDTGSSNLWVPSVYCS 101
>gi|289742705|gb|ADD20100.1| aspartyl protease [Glossina morsitans morsitans]
Length = 381
Score = 86.7 bits (213), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 47/125 (37%), Positives = 65/125 (52%), Gaps = 1/125 (0%)
Query: 51 KDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVY 110
K+ + RN S GGE+ GGV+ D F G + Y +SRK YWQF ++S + K
Sbjct: 198 KENVFSFYYLRNGSTRLGGELTLGGVNHDHFEGQLHYVDISRKFYWQFEMKSAHVNKRQV 257
Query: 111 CSNCQAIADTGTSLIIGPSKVIAELNKLIGA-VPLANGPAKVDCDNLDKMPNVDIILGGK 169
C CQ IADTG + I P +N+ IGA + VDC +D++P V +G
Sbjct: 258 CRKCQVIADTGCNGIGLPRDDYQRVNRAIGATIDPETFQYVVDCGKIDQLPEVTFKIGDG 317
Query: 170 NFTLT 174
FTL+
Sbjct: 318 VFTLS 322
Score = 50.8 bits (120), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 20/39 (51%), Positives = 28/39 (71%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+YG I++G PPQ+F V+FDTGS LW+P + C+ N C
Sbjct: 63 FYGKIAIGKPPQDFLVLFDTGSHLLWVPLKPCTQDNETC 101
>gi|194759254|ref|XP_001961864.1| GF15183 [Drosophila ananassae]
gi|190615561|gb|EDV31085.1| GF15183 [Drosophila ananassae]
Length = 423
Score = 86.7 bits (213), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 49/124 (39%), Positives = 75/124 (60%), Gaps = 4/124 (3%)
Query: 54 FVGHSHFRNASD-ENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKN-VYC 111
+ S RNASD NGG+++ GG D + G +TY P+S GYWQ V S + N V C
Sbjct: 225 IISLSLTRNASDPSNGGKLLLGGSDPTLYSGCLTYVPLSVVGYWQITVGSFSLGCNKVIC 284
Query: 112 SNCQAIADTGTSLIIGPSKVIAELNKLIG--AVPLANGPAKVDCDNLDKMPNVDIILGGK 169
S +AI D GTSLI+ PS + ++NK++G + +G ++C+ + +P++ I +G K
Sbjct: 285 SKFEAIIDAGTSLIVVPSCALKKINKILGIKSSDKRDGVYTINCNKVSSLPDITINIGRK 344
Query: 170 NFTL 173
+FTL
Sbjct: 345 DFTL 348
Score = 47.8 bits (112), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 19/37 (51%), Positives = 26/37 (70%), Gaps = 2/37 (5%)
Query: 4 SYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
SY ++YG I +G Q FK++FDT S+NLW+PS C
Sbjct: 81 SYNTEFYGDIQIGC--QSFKILFDTASANLWVPSSQC 115
>gi|440909656|gb|ELR59541.1| Chymosin [Bos grunniens mutus]
Length = 381
Score = 86.7 bits (213), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 49/125 (39%), Positives = 64/125 (51%), Gaps = 4/125 (3%)
Query: 54 FVGHSHFRNASDENGGEIMF--GGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYC 111
V F D NG E M G +D + G + + PV+ + YWQF V+S+ I V
Sbjct: 205 LVAQDLFSVYMDRNGQESMLTLGAIDPSYYTGSLHWVPVTVQQYWQFTVDSVTISGVVVA 264
Query: 112 SN--CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGK 169
CQAI DTGTS ++GPS I + + IGA G +DCDNL MP V + GK
Sbjct: 265 CEGGCQAILDTGTSKLVGPSSDILNIQQAIGATQNQYGEFDIDCDNLSYMPTVVFEINGK 324
Query: 170 NFTLT 174
+ LT
Sbjct: 325 MYPLT 329
Score = 59.3 bits (142), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 23/34 (67%), Positives = 29/34 (85%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
+QY+G I LGTPPQEF V+FDTGSS+ W+PS +C
Sbjct: 72 SQYFGKIYLGTPPQEFTVLFDTGSSDFWVPSIYC 105
>gi|195399281|ref|XP_002058249.1| GJ15984 [Drosophila virilis]
gi|194150673|gb|EDW66357.1| GJ15984 [Drosophila virilis]
Length = 423
Score = 86.7 bits (213), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 45/111 (40%), Positives = 64/111 (57%), Gaps = 6/111 (5%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSNCQAIADT 120
RNAS + GGE++FGG D F G++ Y VS + YWQF V++I + V CS CQA+ADT
Sbjct: 228 RNASSDFGGELIFGGSDHSLFAGNMVYVDVSTQDYWQFEVDNITMNGQVLCSQCQAVADT 287
Query: 121 GTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNF 171
GTSLI+ P+ +LI + + +DC P + + +GG F
Sbjct: 288 GTSLILAPTAAF----ELIESQLDIDADGLIDCTR--TYPTLKLAIGGVIF 332
Score = 62.4 bits (150), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 26/47 (55%), Positives = 36/47 (76%), Gaps = 2/47 (4%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+ ++ +YYG I++GTPPQ FKV FDTGSSNLW+PS C+ ++AC
Sbjct: 77 LSNTMNMEYYGAITIGTPPQVFKVQFDTGSSNLWVPSNQCT--SLAC 121
>gi|157836785|pdb|3CMS|A Chain A, Engineering Enzyme Sub-Site Specificity: Preparation,
Kinetic Characterization And X-Ray Analysis At
2.0-Angstroms Resolution Of Val111phe Site-Mutated Calf
Chymosin
Length = 323
Score = 86.7 bits (213), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 49/126 (38%), Positives = 64/126 (50%), Gaps = 4/126 (3%)
Query: 53 KFVGHSHFRNASDENGGEIMF--GGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVY 110
V F D NG E M G +D + G + + PV+ + YWQF V+S+ I V
Sbjct: 146 HLVAQDLFSVYMDRNGQESMLTLGAIDPSYYTGSLHWVPVTVQQYWQFTVDSVTISGVVV 205
Query: 111 CSN--CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGG 168
CQAI DTGTS ++GPS I + + IGA G +DCDNL MP V + G
Sbjct: 206 ACEGGCQAILDTGTSKLVGPSSDILNIQQAIGATQNQYGEFDIDCDNLSYMPTVVFEING 265
Query: 169 KNFTLT 174
K + LT
Sbjct: 266 KMYPLT 271
Score = 58.9 bits (141), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 23/34 (67%), Positives = 29/34 (85%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
+QY+G I LGTPPQEF V+FDTGSS+ W+PS +C
Sbjct: 14 SQYFGKIYLGTPPQEFTVLFDTGSSDFWVPSIYC 47
>gi|60391920|sp|P00794.3|CHYM_BOVIN RecName: Full=Chymosin; AltName: Full=Preprorennin; Flags:
Precursor
gi|155967150|gb|ABU41411.1| preprochymosin [Bos taurus]
gi|225196434|gb|ACN82375.1| chymosin precursor [Bos taurus]
gi|296489363|tpg|DAA31476.1| TPA: chymosin precursor [Bos taurus]
Length = 381
Score = 86.7 bits (213), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 49/125 (39%), Positives = 64/125 (51%), Gaps = 4/125 (3%)
Query: 54 FVGHSHFRNASDENGGEIMF--GGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYC 111
V F D NG E M G +D + G + + PV+ + YWQF V+S+ I V
Sbjct: 205 LVAQDLFSVYMDRNGQESMLTLGAIDPSYYTGSLHWVPVTVQQYWQFTVDSVTISGVVVA 264
Query: 112 SN--CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGK 169
CQAI DTGTS ++GPS I + + IGA G +DCDNL MP V + GK
Sbjct: 265 CEGGCQAILDTGTSKLVGPSSDILNIQQAIGATQNQYGEFDIDCDNLSYMPTVVFEINGK 324
Query: 170 NFTLT 174
+ LT
Sbjct: 325 MYPLT 329
Score = 59.3 bits (142), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 23/34 (67%), Positives = 29/34 (85%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
+QY+G I LGTPPQEF V+FDTGSS+ W+PS +C
Sbjct: 72 SQYFGKIYLGTPPQEFTVLFDTGSSDFWVPSIYC 105
>gi|222425198|dbj|BAH20548.1| pepsinogen A-36 [Pongo abelii]
Length = 388
Score = 86.7 bits (213), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 45/127 (35%), Positives = 74/127 (58%), Gaps = 5/127 (3%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-K 107
V +D F S + +A D++G ++FGG+D + G + + PV+ +GYWQ V+SI + +
Sbjct: 208 VSQDLF---SVYLSADDQSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQITVDSITMNGE 264
Query: 108 NVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ C+ CQAI DTGTSL+ GP+ IA + IGA ++G V C + +P++ +
Sbjct: 265 AIACAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGDMVVSCSAISSLPDIVFTI 324
Query: 167 GGKNFTL 173
G + L
Sbjct: 325 NGVQYPL 331
Score = 59.7 bits (143), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 25/41 (60%), Positives = 35/41 (85%), Gaps = 2/41 (4%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACS 48
+Y+G+I +GTP Q+F VIFDTGSSNLW+PS +CS ++AC+
Sbjct: 75 EYFGSIGIGTPAQDFTVIFDTGSSNLWVPSVYCS--SLACT 113
>gi|6325103|ref|NP_015171.1| Pep4p [Saccharomyces cerevisiae S288c]
gi|115643|sp|P07267.1|CARP_YEAST RecName: Full=Saccharopepsin; AltName: Full=Aspartate protease;
Short=PrA; Short=Proteinase A; AltName:
Full=Carboxypeptidase Y-deficient protein 4; AltName:
Full=Proteinase YSCA; Flags: Precursor
gi|172122|gb|AAB63975.1| vacuolar proteinase A precursor [Saccharomyces cerevisiae]
gi|1370328|emb|CAA97859.1| PEP4 [Saccharomyces cerevisiae]
gi|1403555|emb|CAA65567.1| P2585 protein [Saccharomyces cerevisiae]
gi|151942645|gb|EDN60991.1| vacuolar proteinase A [Saccharomyces cerevisiae YJM789]
gi|190407806|gb|EDV11071.1| vacuolar proteinase A [Saccharomyces cerevisiae RM11-1a]
gi|259150002|emb|CAY86805.1| Pep4p [Saccharomyces cerevisiae EC1118]
gi|285815388|tpg|DAA11280.1| TPA: Pep4p [Saccharomyces cerevisiae S288c]
gi|323302701|gb|EGA56507.1| Pep4p [Saccharomyces cerevisiae FostersB]
gi|323331178|gb|EGA72596.1| Pep4p [Saccharomyces cerevisiae AWRI796]
gi|323346153|gb|EGA80443.1| Pep4p [Saccharomyces cerevisiae Lalvin QA23]
gi|323351977|gb|EGA84516.1| Pep4p [Saccharomyces cerevisiae VL3]
gi|365762755|gb|EHN04288.1| Pep4p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
gi|392295854|gb|EIW06957.1| Pep4p [Saccharomyces cerevisiae CEN.PK113-7D]
Length = 405
Score = 86.7 bits (213), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 48/126 (38%), Positives = 69/126 (54%), Gaps = 1/126 (0%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI-EK 107
+D+ +F + + ENGGE FGG+D+ KF GDIT+ PV RK YW+ E I + ++
Sbjct: 223 LDEKRFAFYLGDTSKDTENGGEATFGGIDESKFKGDITWLPVRRKAYWEVKFEGIGLGDE 282
Query: 108 NVYCSNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILG 167
+ A DTGTSLI PS + +N IGA G +DC+ D +P++
Sbjct: 283 YAELESHGAAIDTGTSLITLPSGLAEMINAEIGAKKGWTGQYTLDCNTRDNLPDLIFNFN 342
Query: 168 GKNFTL 173
G NFT+
Sbjct: 343 GYNFTI 348
Score = 63.9 bits (154), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 36/84 (42%), Positives = 47/84 (55%), Gaps = 4/84 (4%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASD- 65
AQYY I+LGTPPQ FKVI DTGSSNLW+PS C L D S+ N ++
Sbjct: 89 AQYYTDITLGTPPQNFKVILDTGSSNLWVPSNECGSLACFLHSKYDHEASSSYKANGTEF 148
Query: 66 --ENGGEIMFGGVDKDKF-VGDIT 86
+ G + G + +D +GD+T
Sbjct: 149 AIQYGTGSLEGYISQDTLSIGDLT 172
>gi|2098518|pdb|1CZI|E Chain E, Chymosin Complex With The Inhibitor Cp-113972
gi|157830643|pdb|1CMS|A Chain A, The Three-Dimensional Structure Of Recombinant Bovine
Chymosin At 2.3 Angstroms Resolution
gi|157836930|pdb|4CMS|A Chain A, X-Ray Analyses Of Aspartic Proteinases Iv. Structure And
Refinement At 2.2 Angstroms Resolution Of Bovine
Chymosin
gi|429544289|pdb|4AA8|A Chain A, Bovine Chymosin At 1.8a Resolution
Length = 323
Score = 86.7 bits (213), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 49/126 (38%), Positives = 64/126 (50%), Gaps = 4/126 (3%)
Query: 53 KFVGHSHFRNASDENGGEIMF--GGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVY 110
V F D NG E M G +D + G + + PV+ + YWQF V+S+ I V
Sbjct: 146 HLVAQDLFSVYMDRNGQESMLTLGAIDPSYYTGSLHWVPVTVQQYWQFTVDSVTISGVVV 205
Query: 111 CSN--CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGG 168
CQAI DTGTS ++GPS I + + IGA G +DCDNL MP V + G
Sbjct: 206 ACEGGCQAILDTGTSKLVGPSSDILNIQQAIGATQNQYGEFDIDCDNLSYMPTVVFEING 265
Query: 169 KNFTLT 174
K + LT
Sbjct: 266 KMYPLT 271
Score = 58.9 bits (141), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 23/34 (67%), Positives = 29/34 (85%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
+QY+G I LGTPPQEF V+FDTGSS+ W+PS +C
Sbjct: 14 SQYFGKIYLGTPPQEFTVLFDTGSSDFWVPSIYC 47
>gi|222425194|dbj|BAH20546.1| pepsinogen A-28 [Pongo abelii]
gi|222425196|dbj|BAH20547.1| pepsinogen A-17 [Pongo abelii]
gi|222425202|dbj|BAH20550.1| pepsinogen A-71 [Pongo abelii]
Length = 388
Score = 86.7 bits (213), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 45/127 (35%), Positives = 74/127 (58%), Gaps = 5/127 (3%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-K 107
V +D F S + +A D++G ++FGG+D + G + + PV+ +GYWQ V+SI + +
Sbjct: 208 VSQDLF---SVYLSADDQSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQITVDSITMNGE 264
Query: 108 NVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ C+ CQAI DTGTSL+ GP+ IA + IGA ++G V C + +P++ +
Sbjct: 265 AIACAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGDMVVSCSAISSLPDIVFTI 324
Query: 167 GGKNFTL 173
G + L
Sbjct: 325 NGVQYPL 331
Score = 59.7 bits (143), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 25/41 (60%), Positives = 35/41 (85%), Gaps = 2/41 (4%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACS 48
+Y+G+I +GTP Q+F VIFDTGSSNLW+PS +CS ++AC+
Sbjct: 75 EYFGSIGIGTPAQDFTVIFDTGSSNLWVPSVYCS--SLACT 113
>gi|440633873|gb|ELR03792.1| vacuolar protease A [Geomyces destructans 20631-21]
Length = 395
Score = 86.7 bits (213), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 54/150 (36%), Positives = 76/150 (50%), Gaps = 7/150 (4%)
Query: 26 FDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDI 85
FDT S N +P + I +D+ F F A D++ E +FGG+DK + GD+
Sbjct: 196 FDTISVNKVVPPFYSMIDQ--GLLDEKVF----SFYLADDKSQSEAVFGGIDKSHYTGDL 249
Query: 86 TYSPVSRKGYWQFGVESIKI-EKNVYCSNCQAIADTGTSLIIGPSKVIAELNKLIGAVPL 144
TY P+ RK YW+ ++I + N I DTGTSL PS + LNK IGA
Sbjct: 250 TYIPLRRKAYWEVDFDAISFGDVKADLDNTGVILDTGTSLNTLPSSLAELLNKEIGAKKG 309
Query: 145 ANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
NG +DC D +P++ L G +F L+
Sbjct: 310 YNGQYTIDCKKRDDLPDITFTLAGHDFALS 339
Score = 61.6 bits (148), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 26/41 (63%), Positives = 32/41 (78%), Gaps = 2/41 (4%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQY+ I++G PPQ FKV+ DTGSSNLW+PSQ C +IAC
Sbjct: 83 AQYFSEITIGNPPQTFKVVLDTGSSNLWVPSQSCG--SIAC 121
>gi|15221141|ref|NP_172655.1| aspartic proteinase A1 [Arabidopsis thaliana]
gi|75318541|sp|O65390.1|APA1_ARATH RecName: Full=Aspartic proteinase A1; Flags: Precursor
gi|3157937|gb|AAC17620.1| Identical to aspartic proteinase cDNA gb|U51036 from A. thaliana.
ESTs gb|N96313, gb|T21893, gb|R30158, gb|T21482,
gb|T43650, gb|R64749, gb|R65157, gb|T88269, gb|T44552,
gb|T22542, gb|T76533, gb|T44350, gb|Z34591, gb|AA728734,
gb|T46003, gb|R65157, gb|N38290, gb|AA395468, gb|T20815
and gb|Z34173 come from this gene [Arabidopsis thaliana]
gi|15912219|gb|AAL08243.1| At1g11910/F12F1_24 [Arabidopsis thaliana]
gi|15912251|gb|AAL08259.1| At1g11910/F12F1_24 [Arabidopsis thaliana]
gi|17381036|gb|AAL36330.1| putative aspartic proteinase [Arabidopsis thaliana]
gi|21617929|gb|AAM66979.1| putative aspartic proteinase [Arabidopsis thaliana]
gi|25055040|gb|AAN71979.1| putative aspartic proteinase [Arabidopsis thaliana]
gi|332190692|gb|AEE28813.1| aspartic proteinase A1 [Arabidopsis thaliana]
Length = 506
Score = 86.7 bits (213), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 45/98 (45%), Positives = 61/98 (62%), Gaps = 3/98 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYC-SNCQAI 117
RNA +E GGE++FGGVD + F G TY PV++KGYWQF + + I +C S C AI
Sbjct: 226 RNADEEEGGELVFGGVDPNHFKGKHTYVPVTQKGYWQFDMGDVLIGGAPTGFCESGCSAI 285
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDN 155
AD+GTSL+ GP+ +I +N IGA + + K D
Sbjct: 286 ADSGTSLLAGPTTIITMINHAIGAAGVVSQQCKTVVDQ 323
Score = 66.2 bits (160), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 27/47 (57%), Positives = 37/47 (78%), Gaps = 1/47 (2%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ AQYYG I++GTPPQ+F V+FDTGSSNLW+PS C ++AC
Sbjct: 74 LKNYLDAQYYGEIAIGTPPQKFTVVFDTGSSNLWVPSSKC-YFSLAC 119
Score = 36.6 bits (83), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 16/45 (35%), Positives = 25/45 (55%)
Query: 130 KVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
+++ +N+L +P G + VDC L MP V + +GGK F L
Sbjct: 402 RILNYVNELCERLPSPMGESAVDCAQLSTMPTVSLTIGGKVFDLA 446
>gi|356555682|ref|XP_003546159.1| PREDICTED: aspartic proteinase-like [Glycine max]
Length = 507
Score = 86.7 bits (213), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 43/84 (51%), Positives = 57/84 (67%), Gaps = 3/84 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSN-CQAI 117
R +ENGGE++FGG D + G TY PV+RKGYWQF + + I + YC+N C AI
Sbjct: 227 RKPEEENGGELVFGGADPAHYKGKHTYVPVTRKGYWQFDMGDVLISGKPTGYCTNDCSAI 286
Query: 118 ADTGTSLIIGPSKVIAELNKLIGA 141
AD+GTSL+ GP+ VI +N+ IGA
Sbjct: 287 ADSGTSLLAGPTTVITMINQAIGA 310
Score = 67.8 bits (164), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 33/72 (45%), Positives = 47/72 (65%), Gaps = 12/72 (16%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHF 60
+++ AQYYG I++GTPPQ+F VIFDTGSSNLW+PS C ++AC + H+ +
Sbjct: 75 LKNYLDAQYYGEIAIGTPPQKFTVIFDTGSSNLWVPSSKC-YFSVACFM-------HARY 126
Query: 61 RNASD----ENG 68
R++ ENG
Sbjct: 127 RSSQSSTYRENG 138
Score = 38.1 bits (87), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 18/45 (40%), Positives = 25/45 (55%)
Query: 130 KVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
++I N+L +P GP+ VDC L MP V +GGK F L+
Sbjct: 403 RIIDYANELCEKLPNPMGPSSVDCGKLSSMPIVSFTIGGKVFDLS 447
>gi|290543422|ref|NP_001166408.1| cathepsin E precursor [Cavia porcellus]
gi|115721|sp|P25796.1|CATE_CAVPO RecName: Full=Cathepsin E; Flags: Precursor
gi|191295|gb|AAA37052.1| procathepsin E [Cavia porcellus]
gi|1246041|gb|AAB35844.1| procathepsin E [Cavia]
Length = 391
Score = 86.7 bits (213), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 46/112 (41%), Positives = 67/112 (59%), Gaps = 3/112 (2%)
Query: 67 NGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIADTGTSL 124
+G E+ FGG D F G + + PV+++ YWQ ++ I++ +V +CS CQAI DTGTSL
Sbjct: 222 SGSELTFGGYDPSHFSGSLNWVPVTKQAYWQIALDGIQVGDSVMFCSEGCQAIVDTGTSL 281
Query: 125 IIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTRT 176
I GP I +L + +GA + G V C NL+ M +V I+ G +TL T
Sbjct: 282 ITGPPGKIKQLQEALGATYVDEG-YSVQCANLNMMLDVTFIINGVPYTLNPT 332
Score = 61.2 bits (147), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 24/34 (70%), Positives = 31/34 (91%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCS 41
+Y+GTIS+G+PPQ F VIFDTGSSNLW+PS +C+
Sbjct: 73 EYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCT 106
>gi|341887183|gb|EGT43118.1| hypothetical protein CAEBREN_03875 [Caenorhabditis brenneri]
Length = 397
Score = 86.7 bits (213), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 42/115 (36%), Positives = 68/115 (59%), Gaps = 2/115 (1%)
Query: 61 RNASD-ENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIA 118
R+A+D NGGEI D + + G+I + P+ + YW+ + ++ I+ Y + +I
Sbjct: 219 RDANDIANGGEITLCDTDSNHYSGNIAWEPLVSEDYWRIKLGAVSIQGTTYTNGPMDSIV 278
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
DTGTSL+ GPS +I ++ IG +PL NG +V C + +PN+ LGG++F L
Sbjct: 279 DTGTSLLTGPSDIIKKIQHKIGGIPLLNGEYEVVCSKIPSLPNITFTLGGQDFNL 333
Score = 67.0 bits (162), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 27/43 (62%), Positives = 35/43 (81%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSV 49
AQYYG I++GTPPQ F+V+FDTGSSNLW+P +C +IAC +
Sbjct: 67 AQYYGPITIGTPPQNFQVLFDTGSSNLWVPCANCPFGDIACRM 109
>gi|222424506|dbj|BAH20208.1| AT1G11910 [Arabidopsis thaliana]
Length = 389
Score = 86.7 bits (213), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 45/98 (45%), Positives = 61/98 (62%), Gaps = 3/98 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYC-SNCQAI 117
RNA +E GGE++FGGVD + F G TY PV++KGYWQF + + I +C S C AI
Sbjct: 109 RNADEEEGGELVFGGVDPNHFKGKHTYVPVTQKGYWQFDMGDVLIGGAPTGFCESGCSAI 168
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDN 155
AD+GTSL+ GP+ +I +N IGA + + K D
Sbjct: 169 ADSGTSLLAGPTTIITMINHAIGAAGVVSQQCKTVVDQ 206
Score = 36.2 bits (82), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 16/45 (35%), Positives = 25/45 (55%)
Query: 130 KVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
+++ +N+L +P G + VDC L MP V + +GGK F L
Sbjct: 285 RILNYVNELCERLPSPMGESAVDCAQLSTMPTVSLTIGGKVFDLA 329
>gi|4589716|dbj|BAA76870.1| aspartic proteinase [Helianthus annuus]
Length = 509
Score = 86.7 bits (213), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 44/101 (43%), Positives = 63/101 (62%), Gaps = 3/101 (2%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCS-NCQAI 117
R+ +E GGE++FGGVD + F G TY PV++KGYWQF + + I + +CS C AI
Sbjct: 229 RHTGEEEGGELVFGGVDPNHFKGKHTYVPVTQKGYWQFDMGDVLIGDKTTGFCSGGCAAI 288
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDK 158
AD+GTSL+ GP+ +I ++N IGA + + K D K
Sbjct: 289 ADSGTSLLAGPTTIITQINHAIGAAGVMSQQCKTLVDQYGK 329
Score = 65.1 bits (157), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 26/47 (55%), Positives = 37/47 (78%), Gaps = 1/47 (2%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ AQY+G I +GTPPQ+F V+FDTGS+NLW+PS C L++AC
Sbjct: 77 LKNYMDAQYFGEIGIGTPPQKFTVVFDTGSANLWVPSSKC-FLSVAC 122
Score = 41.2 bits (95), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 18/44 (40%), Positives = 25/44 (56%)
Query: 131 VIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
+I +N+L +P G + VDC L MPN+ +GGK F LT
Sbjct: 406 IINYVNELCDRIPSPMGESAVDCQTLSNMPNIAFTIGGKTFDLT 449
>gi|307167627|gb|EFN61155.1| Lysosomal aspartic protease [Camponotus floridanus]
Length = 208
Score = 86.7 bits (213), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 45/115 (39%), Positives = 66/115 (57%), Gaps = 5/115 (4%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCS-NCQAIAD 119
RN S + GG+ +FGG D + G ITY PV+ +G WQ ++SI++ +C +CQA D
Sbjct: 38 RNVSADFGGKFIFGGSDPAFYEGKITYVPVTHRGLWQITIDSIRLNNFTWCEKSCQANVD 97
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
T T IIGP K ++ +N+LI N ++C + +P + LGGK F LT
Sbjct: 98 TSTWKIIGPEKDVSLINRLIE----INSHGSINCSRIFHLPTITFNLGGKAFDLT 148
>gi|355711035|gb|AES03877.1| pepsinogen 5, group I [Mustela putorius furo]
Length = 165
Score = 86.7 bits (213), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 43/108 (39%), Positives = 64/108 (59%), Gaps = 2/108 (1%)
Query: 68 GGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-KNVYCSN-CQAIADTGTSLI 125
G +MFGG+D F G++ + PVS +GYWQ V+SI + + + CS CQAI DTGTSL+
Sbjct: 1 GSVVMFGGIDSSYFTGNLNWVPVSVEGYWQISVDSITMNGQPIACSQGCQAIVDTGTSLL 60
Query: 126 IGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
GPS IA + IGA + G + C ++ +P++ + G + L
Sbjct: 61 TGPSNAIANIQSTIGASQDSYGQMVISCSAINNLPDIVFTINGIQYPL 108
>gi|195350355|ref|XP_002041706.1| GM16821 [Drosophila sechellia]
gi|194123479|gb|EDW45522.1| GM16821 [Drosophila sechellia]
Length = 410
Score = 86.7 bits (213), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 44/116 (37%), Positives = 69/116 (59%), Gaps = 3/116 (2%)
Query: 61 RNASDE-NGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSNCQAIAD 119
RN ++ NGGE++ G D + G +TY PVS GYWQF + S + +C NC+AI D
Sbjct: 241 RNGTNAINGGELILEGSDSGLYSGCLTYVPVSNAGYWQFTMTSANLNGFQFCENCEAILD 300
Query: 120 TGTSLIIGPSKVIAELNKLIGAV-PLA-NGPAKVDCDNLDKMPNVDIILGGKNFTL 173
GTSLI+ P +V+ +N+++ + P A NG VDC ++ +P++ + + F L
Sbjct: 301 VGTSLIVVPEQVLDTINQILRVLNPTASNGVFLVDCSSIGDLPDIVFTIARRKFPL 356
Score = 52.8 bits (125), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 19/40 (47%), Positives = 28/40 (70%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
+++ Y +YY T+ G PPQ+ KV+ DTGS+NLW+ S C
Sbjct: 82 LKNLYNTEYYTTLGFGNPPQDLKVLIDTGSANLWVLSSKC 121
>gi|7766834|pdb|1DP5|A Chain A, The Structure Of Proteinase A Complexed With A Ia3 Mutant
Inhibitor
gi|7766836|pdb|1DPJ|A Chain A, The Structure Of Proteinase A Complexed With Ia3 Peptide
Inhibitor
gi|22218637|pdb|1FMU|A Chain A, Structure Of Native Proteinase A In P3221 Space Group.
gi|22218638|pdb|1FMX|A Chain A, Structure Of Native Proteinase A In The Space Group P21
gi|22218639|pdb|1FMX|B Chain B, Structure Of Native Proteinase A In The Space Group P21
gi|225346|prf||1301217A proteinase A,Asp
Length = 329
Score = 86.7 bits (213), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 48/126 (38%), Positives = 69/126 (54%), Gaps = 1/126 (0%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI-EK 107
+D+ +F + + ENGGE FGG+D+ KF GDIT+ PV RK YW+ E I + ++
Sbjct: 147 LDEKRFAFYLGDTSKDTENGGEATFGGIDESKFKGDITWLPVRRKAYWEVKFEGIGLGDE 206
Query: 108 NVYCSNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILG 167
+ A DTGTSLI PS + +N IGA G +DC+ D +P++
Sbjct: 207 YAELESHGAAIDTGTSLITLPSGLAEMINAEIGAKKGWTGQYTLDCNTRDNLPDLIFNFN 266
Query: 168 GKNFTL 173
G NFT+
Sbjct: 267 GYNFTI 272
Score = 63.9 bits (154), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 36/84 (42%), Positives = 47/84 (55%), Gaps = 4/84 (4%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASD- 65
AQYY I+LGTPPQ FKVI DTGSSNLW+PS C L D S+ N ++
Sbjct: 13 AQYYTDITLGTPPQNFKVILDTGSSNLWVPSNECGSLACFLHSKYDHEASSSYKANGTEF 72
Query: 66 --ENGGEIMFGGVDKDKF-VGDIT 86
+ G + G + +D +GD+T
Sbjct: 73 AIQYGTGSLEGYISQDTLSIGDLT 96
>gi|410045159|ref|XP_001145764.3| PREDICTED: pepsin A-5 isoform 1 [Pan troglodytes]
Length = 434
Score = 86.7 bits (213), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 45/125 (36%), Positives = 73/125 (58%), Gaps = 5/125 (4%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-K 107
V +D F S + +A D++G ++FGG+D + G + + PV+ +GYWQ V+SI + K
Sbjct: 208 VSQDLF---SVYLSADDKSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQITVDSITMNGK 264
Query: 108 NVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ C+ CQAI DTGTSL+ GP+ IA + IGA ++G V C + +P++ +
Sbjct: 265 TIACAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGDMVVSCSAISSLPDIVFTI 324
Query: 167 GGKNF 171
G +
Sbjct: 325 NGVQY 329
Score = 58.2 bits (139), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 26/48 (54%), Positives = 37/48 (77%), Gaps = 3/48 (6%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKF 54
+Y+GTI +GTP Q+F V+FDTGSSNLW+PS +C ++AC +D + F
Sbjct: 74 VEYFGTIGIGTPAQDFTVVFDTGSSNLWVPSVYC--YSLAC-MDHNLF 118
>gi|162856|gb|AAA30446.1| chymosin [Bos taurus]
Length = 381
Score = 86.7 bits (213), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 49/125 (39%), Positives = 64/125 (51%), Gaps = 4/125 (3%)
Query: 54 FVGHSHFRNASDENGGEIMF--GGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYC 111
V F D NG E M G +D + G + + PV+ + YWQF V+S+ I V
Sbjct: 205 LVAQDLFSVYMDRNGQESMLTLGAIDPSYYTGSLHWVPVTVQQYWQFTVDSVTISGVVVA 264
Query: 112 SN--CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGK 169
CQAI DTGTS ++GPS I + + IGA G +DCDNL MP V + GK
Sbjct: 265 CEGGCQAILDTGTSKLVGPSSDILNIQQAIGATQNQYGEFDIDCDNLSYMPTVVFEINGK 324
Query: 170 NFTLT 174
+ LT
Sbjct: 325 MYPLT 329
Score = 58.9 bits (141), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 23/34 (67%), Positives = 29/34 (85%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
+QY+G I LGTPPQEF V+FDTGSS+ W+PS +C
Sbjct: 72 SQYFGKIYLGTPPQEFTVLFDTGSSDFWVPSIYC 105
>gi|1354272|gb|AAC49730.1| aspartic proteinase [Arabidopsis thaliana]
Length = 486
Score = 86.7 bits (213), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 45/98 (45%), Positives = 61/98 (62%), Gaps = 3/98 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYC-SNCQAI 117
RNA +E GGE++FGGVD + F G TY PV++KGYWQF + + I +C S C AI
Sbjct: 206 RNADEEEGGELVFGGVDPNHFKGKHTYVPVTQKGYWQFDMGDVLIGGAPTGFCESGCSAI 265
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDN 155
AD+GTSL+ GP+ +I +N IGA + + K D
Sbjct: 266 ADSGTSLLAGPTTIITMINHAIGAAGVVSQQCKTVVDQ 303
Score = 66.2 bits (160), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 27/47 (57%), Positives = 37/47 (78%), Gaps = 1/47 (2%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ AQYYG I++GTPPQ+F V+FDTGSSNLW+PS C ++AC
Sbjct: 54 LKNYLDAQYYGEIAIGTPPQKFTVVFDTGSSNLWVPSSKC-YFSLAC 99
Score = 36.6 bits (83), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 16/45 (35%), Positives = 25/45 (55%)
Query: 130 KVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
+++ +N+L +P G + VDC L MP V + +GGK F L
Sbjct: 382 RILNYVNELCERLPSPMGESAVDCAQLSTMPTVSLTIGGKVFDLA 426
>gi|326933645|ref|XP_003212911.1| PREDICTED: embryonic pepsinogen-like [Meleagris gallopavo]
Length = 383
Score = 86.7 bits (213), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 43/106 (40%), Positives = 66/106 (62%), Gaps = 2/106 (1%)
Query: 68 GGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKN-VYCSN-CQAIADTGTSLI 125
G ++FGG+D F G I + PVS +GYWQ ++SI + K + CS+ CQAI DTGTSL+
Sbjct: 223 GSMVIFGGIDDSYFTGSINWIPVSYQGYWQISMDSIIVNKQEIACSSGCQAIIDTGTSLV 282
Query: 126 IGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNF 171
GP+ I ++ +GA + G V+C ++ MP+V ++GG +
Sbjct: 283 AGPASDINDIQSAVGANQNSYGEYNVNCSHVLAMPDVVFVIGGIQY 328
Score = 63.5 bits (153), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 25/41 (60%), Positives = 34/41 (82%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCS 41
+ ++ +YYGTIS+GTPPQ+F V+FDTGSSNLW+PS C+
Sbjct: 68 LLNTLDMEYYGTISIGTPPQDFTVVFDTGSSNLWVPSVSCT 108
>gi|348502024|ref|XP_003438569.1| PREDICTED: pepsin A-like [Oreochromis niloticus]
Length = 355
Score = 86.7 bits (213), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 46/116 (39%), Positives = 69/116 (59%), Gaps = 3/116 (2%)
Query: 60 FRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN--CQAI 117
+ +++++ + FGGVD + + G IT+ P+S + YWQ V+S+ V N CQAI
Sbjct: 211 YLSSNNQRSSVVTFGGVDTNHYSGSITWIPLSNELYWQITVDSVSGNGQVVACNGGCQAI 270
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
DTGTSLI+GP I +N +GA NG V C+N+ +MP+V + G+ FTL
Sbjct: 271 VDTGTSLIVGPQGDIDNINYWLGASS-QNGEYMVSCNNIGQMPDVIFNINGQQFTL 325
Score = 61.2 bits (147), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 25/33 (75%), Positives = 29/33 (87%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCS 41
YYG IS+G PPQ FKVIFDTGSSNLW+PS +C+
Sbjct: 72 YYGVISIGNPPQSFKVIFDTGSSNLWVPSIYCN 104
>gi|407924694|gb|EKG17726.1| Peptidase A1 [Macrophomina phaseolina MS6]
Length = 378
Score = 86.7 bits (213), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 47/118 (39%), Positives = 65/118 (55%), Gaps = 2/118 (1%)
Query: 59 HFRNASDENGGEI-MFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI-EKNVYCSNCQA 116
+ + +DE + FGG+D+ + G +T P+ RK YW+ ++SI + N A
Sbjct: 205 YLSDTNDEGSESVATFGGIDESHYTGKLTKIPLRRKAYWEVDLDSITFGDATAELDNTGA 264
Query: 117 IADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
I DTGTSLI PS + LNK IGA NG VDCD D +P++ L G NFT+T
Sbjct: 265 ILDTGTSLIALPSTLAELLNKEIGAKKSFNGQYTVDCDKRDGLPDLTFTLTGHNFTIT 322
Score = 65.5 bits (158), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 36/84 (42%), Positives = 46/84 (54%), Gaps = 4/84 (4%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASD- 65
AQY+ +SLGTPPQ FKVI DTGSSNLW+PS C + D ++ +N S
Sbjct: 63 AQYFSEVSLGTPPQTFKVILDTGSSNLWVPSSECGSIACYLHTKYDSSASSTYSKNGSTF 122
Query: 66 --ENGGEIMFGGVDKDKF-VGDIT 86
G + G V D F +GD+T
Sbjct: 123 EIRYGSGSLSGFVSNDVFTIGDLT 146
>gi|451992127|gb|EMD84649.1| hypothetical protein COCHEDRAFT_1189444 [Cochliobolus
heterostrophus C5]
gi|452004574|gb|EMD97030.1| hypothetical protein COCHEDRAFT_1189956 [Cochliobolus
heterostrophus C5]
Length = 399
Score = 86.7 bits (213), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 46/124 (37%), Positives = 67/124 (54%), Gaps = 1/124 (0%)
Query: 52 DKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVY- 110
D+ V + + +D+ G E FGG+D+ + G + P+ RK YW+ +++I K
Sbjct: 220 DEPVFAFYLGDVADKQGSEATFGGIDESHYTGKLIKLPLRRKAYWEVDLDAITFGKETAE 279
Query: 111 CSNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKN 170
N I DTGTSLI PS + LNK IGA NG V+CD D +P++ L G N
Sbjct: 280 TENVGVILDTGTSLIALPSAMAELLNKEIGAKKGFNGQYSVECDKRDSLPDLTFTLTGHN 339
Query: 171 FTLT 174
FT++
Sbjct: 340 FTIS 343
Score = 67.4 bits (163), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 40/80 (50%), Positives = 50/80 (62%), Gaps = 7/80 (8%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSV-DK-DKFVGHSHFRNAS 64
AQY+ ISLGTPPQ FKVI DTGSSNLW+PS C+ +IAC + DK D ++ +N S
Sbjct: 85 AQYFSEISLGTPPQSFKVILDTGSSNLWVPSTQCT--SIACFLHDKYDSSSSSTYQKNGS 142
Query: 65 DENGGEIMFGGVDKDKFVGD 84
D EI +G FV +
Sbjct: 143 D---FEIRYGSGSMKGFVSN 159
>gi|242781757|ref|XP_002479865.1| aspartic endopeptidase Pep2 [Talaromyces stipitatus ATCC 10500]
gi|218720012|gb|EED19431.1| aspartic endopeptidase Pep2 [Talaromyces stipitatus ATCC 10500]
Length = 395
Score = 86.3 bits (212), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 55/150 (36%), Positives = 80/150 (53%), Gaps = 4/150 (2%)
Query: 26 FDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDI 85
FDT S N +P + ++LN ++D+ F + N +N E FGG+DK + G++
Sbjct: 193 FDTISVNKIVPPFY-NMLN-QKTLDEPVFAFYLGDSNKEGDNS-EATFGGIDKSHYTGEL 249
Query: 86 TYSPVSRKGYWQFGVESIKIEKNVY-CSNCQAIADTGTSLIIGPSKVIAELNKLIGAVPL 144
P+ RK YW+ +++ NV N I DTGTSLI PS + LNK IGA
Sbjct: 250 VKIPLRRKAYWEVDFDAVAFGDNVAELENTGVILDTGTSLIALPSTLAELLNKEIGASKS 309
Query: 145 ANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
NG VDC D +P++ + L G NF++T
Sbjct: 310 WNGQYTVDCTKRDSLPDLTVTLSGHNFSIT 339
Score = 62.8 bits (151), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 37/86 (43%), Positives = 53/86 (61%), Gaps = 8/86 (9%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK--DKFVGHSHFRNAS 64
AQY+ I++GTPPQ FKV+ DTGSSNLW+PS C+ +IAC + D ++ +N S
Sbjct: 80 AQYFSEITIGTPPQNFKVVLDTGSSNLWVPSASCN--SIACYLHNKYDSSSSSTYKKNGS 137
Query: 65 D---ENGGEIMFGGVDKDKF-VGDIT 86
+ + G + G V +D +GDIT
Sbjct: 138 EFAIQYGSGSLEGFVSRDVVTIGDIT 163
>gi|453084572|gb|EMF12616.1| aspartyl proteinase [Mycosphaerella populorum SO2202]
Length = 396
Score = 86.3 bits (212), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 50/126 (39%), Positives = 70/126 (55%), Gaps = 3/126 (2%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKN 108
+D+ F + + DE+ E +FGGV+KD + GD+T P+ RK YW+ +++I K
Sbjct: 216 LDEKVFAFYLSSTDKGDES--EAIFGGVNKDHYTGDMTKIPLRRKAYWEVDLDAITFGKQ 273
Query: 109 VYCSNCQ-AIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILG 167
+ AI DTGTSLI PS + LNK IGA NG VDC D +P++ L
Sbjct: 274 TAEIDATGAILDTGTSLIALPSTLAELLNKEIGAKKSYNGQYTVDCSARDSLPDLTFTLT 333
Query: 168 GKNFTL 173
G NFT+
Sbjct: 334 GHNFTI 339
Score = 65.5 bits (158), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 33/65 (50%), Positives = 41/65 (63%), Gaps = 5/65 (7%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASDE 66
AQY+ I++GTPPQEFKV+ DTGSSNLW+PS C +IAC + G S N +
Sbjct: 82 AQYFSEIAVGTPPQEFKVVLDTGSSNLWVPSSECG--SIACYLHSKYNHGDS---NTYKQ 136
Query: 67 NGGEI 71
NG E
Sbjct: 137 NGSEF 141
>gi|402226359|gb|EJU06419.1| endopeptidase [Dacryopinax sp. DJM-731 SS1]
Length = 413
Score = 86.3 bits (212), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 44/112 (39%), Positives = 63/112 (56%), Gaps = 1/112 (0%)
Query: 64 SDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI-EKNVYCSNCQAIADTGT 122
SD +GGE FGG+D+ + G I Y PV RKGYW+ ++ I + +++ + A DTGT
Sbjct: 246 SDADGGEATFGGIDESAYTGKIDYVPVRRKGYWEIELDKISLGGEDLELESTGAAIDTGT 305
Query: 123 SLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
SLI PS + LNK IGA N V+C +D +P + GK + L+
Sbjct: 306 SLIALPSDIAEMLNKEIGATKSWNNQYTVECSTVDSLPELTFYFNGKPYPLS 357
Score = 64.7 bits (156), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 34/78 (43%), Positives = 48/78 (61%), Gaps = 3/78 (3%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASDE 66
AQY+ I+LGTPPQ FKV+ DTGSSNLW+PS C+ +IAC + + K+ + S+
Sbjct: 100 AQYFAEITLGTPPQTFKVVLDTGSSNLWVPSIKCT--SIACFLHQ-KYDSAASSTYKSNG 156
Query: 67 NGGEIMFGGVDKDKFVGD 84
EI +G + FV +
Sbjct: 157 TAFEIHYGSGSMEGFVSN 174
>gi|168031065|ref|XP_001768042.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162680680|gb|EDQ67114.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 455
Score = 86.3 bits (212), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 42/98 (42%), Positives = 63/98 (64%), Gaps = 3/98 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCS-NCQAI 117
R+ E GGE++ GGVD F G+ TY+PV+R GYWQF + + + + +C+ C AI
Sbjct: 176 RDVEGEKGGELVLGGVDPHHFKGNHTYTPVTRLGYWQFDMGDVLLDGQSTGFCAGGCAAI 235
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDN 155
AD+GTSL+ GP+ ++AE+N IGA + +G K+ D
Sbjct: 236 ADSGTSLLAGPTGIVAEINYAIGATGIISGECKLVVDQ 273
Score = 64.3 bits (155), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 28/41 (68%), Positives = 33/41 (80%), Gaps = 1/41 (2%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQYYG I +GTP QEF V+FDTGSSNLW+PS C L++AC
Sbjct: 30 AQYYGVIEIGTPKQEFTVVFDTGSSNLWVPSAKC-YLSLAC 69
Score = 38.9 bits (89), Expect = 0.73, Method: Compositional matrix adjust.
Identities = 17/46 (36%), Positives = 27/46 (58%)
Query: 129 SKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
+++ LN+L +P NG + VDC++L MP+V + G F LT
Sbjct: 350 AQIKEHLNQLCERLPNPNGESMVDCNSLSSMPDVSFTISGTTFKLT 395
>gi|356542078|ref|XP_003539498.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase oryzasin-1-like
[Glycine max]
Length = 449
Score = 86.3 bits (212), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 41/93 (44%), Positives = 61/93 (65%), Gaps = 3/93 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE---KNVYCSNCQAI 117
RN ++E GG+I+FGG+D D + G+ TY PV++KGYWQ + + I + + C AI
Sbjct: 169 RNTNEEQGGQIVFGGIDSDHYKGEHTYVPVTQKGYWQIEIGDVLINGKTTGLCAAKCLAI 228
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAK 150
D+GTSL+ GP+ VIA++N IGAV + + K
Sbjct: 229 VDSGTSLLAGPTGVIAQINHAIGAVGIVSQECK 261
Score = 61.2 bits (147), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 33/80 (41%), Positives = 50/80 (62%), Gaps = 14/80 (17%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHF 60
+++ AQY+G I +GT PQ+F VIFDTGSSNLW+PS C ++AC + HS +
Sbjct: 33 LKNYMNAQYFGEIGIGTLPQKFTVIFDTGSSNLWVPSSKC-YFSVACYL-------HSRY 84
Query: 61 RNAS----DENG--GEIMFG 74
+++ ++NG EI +G
Sbjct: 85 KSSQSSTCNKNGSSAEIHYG 104
Score = 37.7 bits (86), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 16/45 (35%), Positives = 26/45 (57%)
Query: 130 KVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
+++ N L VP G + V+C+ L +MPNV +GG+ F L+
Sbjct: 345 QILDHANALCDLVPSPKGESVVECNTLSEMPNVSFTIGGEVFELS 389
>gi|254583898|ref|XP_002497517.1| ZYRO0F07392p [Zygosaccharomyces rouxii]
gi|238940410|emb|CAR28584.1| ZYRO0F07392p [Zygosaccharomyces rouxii]
Length = 418
Score = 86.3 bits (212), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 49/127 (38%), Positives = 69/127 (54%), Gaps = 2/127 (1%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI-EK 107
+D+ KF + R+ ++GG FGGVD K+ G+IT+ P+ RK YW+ + I + E+
Sbjct: 237 LDEPKFAFYLG-RDGESQDGGVATFGGVDDSKYEGEITWLPIRRKAYWEVKFDGIGLGEE 295
Query: 108 NVYCSNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILG 167
N A DTGTSLI PS + +N IGA G V+C+ +PN+ LG
Sbjct: 296 YAELENHGAAIDTGTSLIALPSGLAEIINAEIGAKKSWTGQYTVECEARSSLPNMTFTLG 355
Query: 168 GKNFTLT 174
G NF LT
Sbjct: 356 GHNFELT 362
Score = 64.7 bits (156), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 29/41 (70%), Positives = 33/41 (80%), Gaps = 2/41 (4%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQYY +SLGTPPQ FKVI DTGSSNLW+PS CS ++AC
Sbjct: 103 AQYYTEVSLGTPPQNFKVILDTGSSNLWVPSTECS--SLAC 141
>gi|401838744|gb|EJT42213.1| PEP4-like protein [Saccharomyces kudriavzevii IFO 1802]
Length = 405
Score = 86.3 bits (212), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 48/126 (38%), Positives = 68/126 (53%), Gaps = 1/126 (0%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI-EK 107
+D+ KF + + ENGGE FGG+D+ KF GDIT+ PV RK YW+ E I + ++
Sbjct: 223 LDEKKFAFYLGDTSKDSENGGEATFGGIDESKFKGDITWLPVRRKAYWEVKFEGIGLGDE 282
Query: 108 NVYCSNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILG 167
A DTGTSLI PS + +N +GA G +DC+ D +P++
Sbjct: 283 YAELEGHGAAIDTGTSLITLPSGLAEMINAELGAKKGWTGQYTLDCNTRDSLPDLIFNFN 342
Query: 168 GKNFTL 173
G NFT+
Sbjct: 343 GYNFTI 348
Score = 63.5 bits (153), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 36/84 (42%), Positives = 47/84 (55%), Gaps = 4/84 (4%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASD- 65
AQYY I+LGTPPQ FKVI DTGSSNLW+PS C L D S+ N ++
Sbjct: 89 AQYYTDITLGTPPQNFKVILDTGSSNLWVPSNECGSLACFLHSKYDHEASSSYKANGTEF 148
Query: 66 --ENGGEIMFGGVDKDKF-VGDIT 86
+ G + G + +D +GD+T
Sbjct: 149 AIQYGTGSLEGYISQDTLSIGDLT 172
>gi|332241360|ref|XP_003269848.1| PREDICTED: napsin-A-like [Nomascus leucogenys]
Length = 421
Score = 86.3 bits (212), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 42/116 (36%), Positives = 67/116 (57%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIA 118
R+ +GGE++ GG D ++ +T+ PV+ YWQ +E +K+ + C+ C AI
Sbjct: 224 RDPEVADGGELVLGGSDPAHYIPPLTFVPVTVPAYWQIHMERVKVGSGLTLCARGCAAIL 283
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGT +IIGP++ I L+ IG + L G + C + K+P V +++GG F LT
Sbjct: 284 DTGTPVIIGPTEEIRALHAAIGGISLLAGEYLIRCSEIPKLPAVSLLIGGVWFNLT 339
Score = 66.2 bits (160), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 26/41 (63%), Positives = 31/41 (75%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQY+G I LGTPPQ F V FDTGSSNLW+PS+ C ++ C
Sbjct: 77 AQYFGEIGLGTPPQNFTVTFDTGSSNLWVPSRRCHFFSVPC 117
>gi|218188712|gb|EEC71139.1| hypothetical protein OsI_02961 [Oryza sativa Indica Group]
Length = 540
Score = 86.3 bits (212), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 57/147 (38%), Positives = 76/147 (51%), Gaps = 15/147 (10%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASDE 66
A++ G + LG FK I G+ +W I S+ DK RNA+D
Sbjct: 256 AKFDGILGLG-----FKEISVEGADPVWYNM-------IQQSLVTDKVFSFWLNRNANDI 303
Query: 67 NGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI---EKNVYCSNCQAIADTGTS 123
NGGEI+FGG D+ + GD TY+ V+RK YWQF + I + C IAD+GTS
Sbjct: 304 NGGEIVFGGADESHYKGDHTYTRVTRKAYWQFEMGDFLIGGRSTGICVDGCAVIADSGTS 363
Query: 124 LIIGPSKVIAELNKLIGAVPLANGPAK 150
LI GP IA+++ IGA +AN K
Sbjct: 364 LIAGPIAAIAQIHAHIGATGVANEECK 390
Score = 62.0 bits (149), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 25/47 (53%), Positives = 35/47 (74%), Gaps = 1/47 (2%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ AQY+G I +G PPQ F V+FDTGSSNLW+PS C + ++AC
Sbjct: 146 LKNFLNAQYFGEIGVGCPPQNFTVVFDTGSSNLWVPSAKC-VFSLAC 191
>gi|162287225|ref|NP_001104613.1| embryonic pepsinogen precursor [Mus musculus]
gi|148669949|gb|EDL01896.1| mCG21729 [Mus musculus]
gi|219520887|gb|AAI71994.1| Gene model 131, (NCBI) [Mus musculus]
gi|223462011|gb|AAI47478.1| Gm131 protein [Mus musculus]
Length = 379
Score = 86.3 bits (212), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 42/113 (37%), Positives = 63/113 (55%), Gaps = 2/113 (1%)
Query: 63 ASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN--CQAIADT 120
+ +E G + G +D+ F+G + + PV+ +GYWQF V+ I I V C A+ DT
Sbjct: 216 SRNEQGSMLTLGAIDQSYFIGSLHWVPVTVQGYWQFTVDRITINGEVVACQGGCPAVLDT 275
Query: 121 GTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
GT+L+ GP + I + ++IGAV N +DC LD MP V + G+ F L
Sbjct: 276 GTALLTGPGRDILNIQQVIGAVQGHNDQFDIDCWRLDIMPTVVFEIHGREFPL 328
Score = 60.5 bits (145), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 23/35 (65%), Positives = 31/35 (88%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCS 41
++Y+GTI +GTPPQEF V+FDTGSS LW+PS +C+
Sbjct: 72 SEYFGTIYIGTPPQEFTVVFDTGSSELWVPSVYCN 106
>gi|2624629|pdb|2JXR|A Chain A, Structure Of Yeast Proteinase A
gi|10835733|pdb|1FQ4|A Chain A, Crystal Structure Of A Complex Between Hydroxyethylene
Inhibitor Cp- 108,420 And Yeast Aspartic Proteinase A
gi|10835734|pdb|1FQ5|A Chain A, X-Ray Struture Of A Cyclic Statine Inhibitor Pd-129,541
Bound To Yeast Proteinase A
gi|10835735|pdb|1FQ6|A Chain A, X-Ray Structure Of Glycol Inhibitor Pd-133,450 Bound To
Saccharopepsin
gi|10835736|pdb|1FQ7|A Chain A, X-Ray Structure Of Inhibitor Cp-72,647 Bound To
Saccharopepsin
gi|10835737|pdb|1FQ8|A Chain A, X-Ray Structure Of Difluorostatine Inhibitor Cp81,198
Bound To Saccharopepsin
Length = 329
Score = 86.3 bits (212), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 48/126 (38%), Positives = 69/126 (54%), Gaps = 1/126 (0%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI-EK 107
+D+ +F + + ENGGE FGG+D+ KF GDIT+ PV RK YW+ E I + ++
Sbjct: 147 LDEKRFAFYLGDTSKDTENGGEATFGGIDESKFKGDITWLPVRRKAYWEVKFEGIGLGDE 206
Query: 108 NVYCSNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILG 167
+ A DTGTSLI PS + +N IGA G +DC+ D +P++
Sbjct: 207 YAELESHGAAIDTGTSLITLPSGLAEMINAEIGAKKGWTGQYTLDCNTRDNLPDLIFNFN 266
Query: 168 GKNFTL 173
G NFT+
Sbjct: 267 GYNFTI 272
Score = 63.5 bits (153), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 36/84 (42%), Positives = 47/84 (55%), Gaps = 4/84 (4%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASD- 65
AQYY I+LGTPPQ FKVI DTGSSNLW+PS C L D S+ N ++
Sbjct: 13 AQYYTDITLGTPPQNFKVILDTGSSNLWVPSNECGSLACFLHSKYDHEASSSYKANGTEF 72
Query: 66 --ENGGEIMFGGVDKDKF-VGDIT 86
+ G + G + +D +GD+T
Sbjct: 73 AIQYGTGSLEGYISQDTLSIGDLT 96
>gi|289743097|gb|ADD20296.1| cathepsin D [Glossina morsitans morsitans]
Length = 258
Score = 86.3 bits (212), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 47/125 (37%), Positives = 65/125 (52%), Gaps = 1/125 (0%)
Query: 51 KDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVY 110
K+ + RN S GGE+ GGV+ D F G + Y +SRK YWQF ++S + K
Sbjct: 75 KENVFSFYYLRNGSTRLGGELTLGGVNPDHFEGQLHYVDISRKFYWQFEMKSAHVNKRQV 134
Query: 111 CSNCQAIADTGTSLIIGPSKVIAELNKLIGA-VPLANGPAKVDCDNLDKMPNVDIILGGK 169
C CQ IADTG + I P +N+ IGA + VDC +D++P V +G
Sbjct: 135 CRKCQVIADTGCNGIGLPRDDYQRVNRAIGATIDPETFQYVVDCGKIDQLPEVTFKIGDG 194
Query: 170 NFTLT 174
FTL+
Sbjct: 195 VFTLS 199
>gi|115438741|ref|NP_001043650.1| Os01g0631900 [Oryza sativa Japonica Group]
gi|55297073|dbj|BAD68642.1| putative aspartic proteinase [Oryza sativa Japonica Group]
gi|113533181|dbj|BAF05564.1| Os01g0631900 [Oryza sativa Japonica Group]
Length = 522
Score = 86.3 bits (212), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 57/147 (38%), Positives = 76/147 (51%), Gaps = 15/147 (10%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASDE 66
A++ G + LG FK I G+ +W I S+ DK RNA+D
Sbjct: 241 AKFDGILGLG-----FKEISVEGADPVWYNM-------IQQSLVTDKVFSFWLNRNANDI 288
Query: 67 NGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI---EKNVYCSNCQAIADTGTS 123
NGGEI+FGG D+ + GD TY+ V+RK YWQF + I + C IAD+GTS
Sbjct: 289 NGGEIVFGGADESHYKGDHTYTRVTRKAYWQFEMGDFLIGGRSTGICVDGCAVIADSGTS 348
Query: 124 LIIGPSKVIAELNKLIGAVPLANGPAK 150
LI GP IA+++ IGA +AN K
Sbjct: 349 LIAGPIAAIAQIHAHIGATGVANEECK 375
Score = 61.6 bits (148), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 25/51 (49%), Positives = 36/51 (70%), Gaps = 1/51 (1%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK 51
+++ AQY+G I +G PPQ F V+FDTGSSNLW+PS C + ++AC +
Sbjct: 131 LKNFLNAQYFGEIGVGCPPQNFTVVFDTGSSNLWVPSAKC-VFSLACYFHR 180
>gi|365758066|gb|EHM99929.1| Pep4p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
Length = 405
Score = 86.3 bits (212), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 48/126 (38%), Positives = 68/126 (53%), Gaps = 1/126 (0%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI-EK 107
+D+ KF + + ENGGE FGG+D+ KF GDIT+ PV RK YW+ E I + ++
Sbjct: 223 LDEKKFAFYLGDTSKDSENGGEATFGGIDESKFKGDITWLPVRRKAYWEVKFEGIGLGDE 282
Query: 108 NVYCSNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILG 167
A DTGTSLI PS + +N +GA G +DC+ D +P++
Sbjct: 283 YAELEGHGAAIDTGTSLITLPSGLAEMINAELGAKKGWTGQYTLDCNTRDSLPDLIFNFN 342
Query: 168 GKNFTL 173
G NFT+
Sbjct: 343 GYNFTI 348
Score = 63.5 bits (153), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 36/84 (42%), Positives = 47/84 (55%), Gaps = 4/84 (4%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASD- 65
AQYY I+LGTPPQ FKVI DTGSSNLW+PS C L D S+ N ++
Sbjct: 89 AQYYTDITLGTPPQNFKVILDTGSSNLWVPSNECGSLACFLHSKYDHEASSSYKANGTEF 148
Query: 66 --ENGGEIMFGGVDKDKF-VGDIT 86
+ G + G + +D +GD+T
Sbjct: 149 AIQYGTGSLEGYISQDTLTIGDLT 172
>gi|169731523|gb|ACA64894.1| progastricsin (predicted) [Callicebus moloch]
Length = 388
Score = 86.3 bits (212), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 42/118 (35%), Positives = 67/118 (56%), Gaps = 3/118 (2%)
Query: 59 HFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSN-CQ 115
+ N +GG ++FGGVD + G I ++PV+++ YWQ G+E I + + +CS CQ
Sbjct: 213 YLSNQQGSSGGAVVFGGVDSSLYTGQIYWAPVTQELYWQIGIEEFLIGGQASGWCSEGCQ 272
Query: 116 AIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
AI DTGTSL+ P + ++ + GA G V+CD++ +P + I+ G F L
Sbjct: 273 AIVDTGTSLLTVPQQYLSAFLEATGAEEDEYGQFLVNCDSIQSLPTLTFIINGVEFPL 330
Score = 60.8 bits (146), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 35/89 (39%), Positives = 49/89 (55%), Gaps = 9/89 (10%)
Query: 1 MRSSYQ------AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKF 54
+R SY+ A Y+G IS+GTPPQ F V+FDTGSSNLW+PS +C + AC+ +F
Sbjct: 59 LRVSYEPMDYMDAAYFGEISIGTPPQNFLVLFDTGSSNLWVPSVYCQ--SQACT-SHSRF 115
Query: 55 VGHSHFRNASDENGGEIMFGGVDKDKFVG 83
+S+E + +G F G
Sbjct: 116 NPSKSSTYSSNEQTFSLQYGSGSLTGFFG 144
>gi|195997429|ref|XP_002108583.1| hypothetical protein TRIADDRAFT_19041 [Trichoplax adhaerens]
gi|190589359|gb|EDV29381.1| hypothetical protein TRIADDRAFT_19041 [Trichoplax adhaerens]
Length = 354
Score = 86.3 bits (212), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 47/119 (39%), Positives = 72/119 (60%), Gaps = 4/119 (3%)
Query: 59 HFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-KNVYCSN--CQ 115
+FR S GGE++ GG+DK G ITY PV ++ YWQ ++ + E NV N C
Sbjct: 170 YFRRGSTYPGGELILGGIDKSYIEGPITYIPVIKQAYWQIKLDLDRKEGDNVRLCNRDCS 229
Query: 116 AIADTGTSLIIGPSKVIAELNKLIGAVPLAN-GPAKVDCDNLDKMPNVDIILGGKNFTL 173
AI DTGTS+IIGP++ + +++ I AV L +++DC+ +D +P ++ GK+F L
Sbjct: 230 AIIDTGTSMIIGPTEQVDRIHQEIHAVNLDELMDSRIDCNVVDSLPVINFRFNGKDFPL 288
Score = 57.0 bits (136), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 25/44 (56%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK 51
QY+G IS+GTP Q F+VIFDTGS++ WIPS C N C++ K
Sbjct: 25 QYFGEISIGTPSQTFQVIFDTGSADFWIPSSKCH--NSYCTLHK 66
>gi|75267434|sp|Q9XFX3.1|CARDA_CYNCA RecName: Full=Procardosin-A; Contains: RecName: Full=Cardosin-A
intermediate form 35 kDa subunit; Contains: RecName:
Full=Cardosin-A heavy chain; AltName: Full=Cardosin-A 31
kDa subunit; Contains: RecName: Full=Cardosin-A
intermediate form 30 kDa subunit; Contains: RecName:
Full=Cardosin-A light chain; AltName: Full=Cardosin-A 15
kDa subunit; Flags: Precursor
gi|4581209|emb|CAB40134.1| preprocardosin A [Cynara cardunculus]
Length = 504
Score = 86.3 bits (212), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 45/108 (41%), Positives = 65/108 (60%), Gaps = 7/108 (6%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCS-NCQAI 117
RN +E GGE++FGG+D + F GD TY PV+ + YWQFG+ + I + +C+ CQA
Sbjct: 225 RNVDEEEGGELVFGGLDPNHFRGDHTYVPVTYQYYWQFGIGDVLIGDKSTGFCAPGCQAF 284
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDII 165
AD+GTSL+ GP+ ++ ++N IG ANG C + DII
Sbjct: 285 ADSGTSLLSGPTAIVTQINHAIG----ANGVMNQQCKTVVSRYGRDII 328
Score = 56.2 bits (134), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 33/72 (45%), Positives = 43/72 (59%), Gaps = 14/72 (19%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASD--- 65
Y+G I +GTPPQ+F VIFDTGSS LW+PS C I + AC HS + ++
Sbjct: 85 YFGEIGIGTPPQKFTVIFDTGSSVLWVPSSKC-INSKACR-------AHSMYESSDSSTY 136
Query: 66 -ENG--GEIMFG 74
ENG G I++G
Sbjct: 137 KENGTFGAIIYG 148
Score = 37.0 bits (84), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 17/44 (38%), Positives = 25/44 (56%)
Query: 131 VIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
+I N+L + ++ +VDC+ L MPNV +GGK F LT
Sbjct: 402 IINYANELCEHLSTSSEELQVDCNTLSSMPNVSFTIGGKKFGLT 445
>gi|332024568|gb|EGI64766.1| Lysosomal aspartic protease [Acromyrmex echinatior]
Length = 403
Score = 86.3 bits (212), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 47/105 (44%), Positives = 62/105 (59%), Gaps = 2/105 (1%)
Query: 71 IMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIADTGTSLIIGPS 129
++ GG DK + G +TY V R+ W+F ++ I+IE V C N CQAI DT T I GPS
Sbjct: 241 LILGGSDKRFYQGKLTYVDVIRETVWKFTIDQIEIEDTVLCMNGCQAIFDTLTLRITGPS 300
Query: 130 KVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
+A +N+LIGA VDCDN+ +P V LGG+ F LT
Sbjct: 301 SDVANINRLIGAT-YTKEEMIVDCDNISDLPYVYFYLGGELFELT 344
Score = 49.7 bits (117), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 19/32 (59%), Positives = 25/32 (78%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
YYGTI++GTPPQ+F V+ D +LWIPS+ C
Sbjct: 82 YYGTITIGTPPQKFNVLIDITFPDLWIPSKLC 113
>gi|126309841|ref|XP_001370380.1| PREDICTED: gastricsin-like isoform 1 [Monodelphis domestica]
Length = 388
Score = 86.3 bits (212), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 40/114 (35%), Positives = 68/114 (59%), Gaps = 3/114 (2%)
Query: 64 SDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSN-CQAIADT 120
S ++GGE++FGGVD + + G I ++PV+++ YWQ G++ I + +CS CQAI DT
Sbjct: 218 SSQSGGEVIFGGVDNNLYSGQIYWAPVTQELYWQIGIQEFSIGGQATGWCSQGCQAIVDT 277
Query: 121 GTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
GTSL+ P + ++ + G G VDC+++ +P + ++ G F L+
Sbjct: 278 GTSLLTVPQQYMSAFLQATGGQQDQYGQYVVDCNSIQNLPTISFLINGVQFPLS 331
Score = 61.2 bits (147), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 30/54 (55%), Positives = 37/54 (68%), Gaps = 9/54 (16%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHF 60
+ YYG IS+GTPPQ F V+FDTGSSNLW+PS +C + ACS GH+ F
Sbjct: 70 SSYYGEISIGTPPQNFLVLFDTGSSNLWVPSIYCQ--SQACS-------GHARF 114
>gi|73947914|ref|XP_533610.2| PREDICTED: napsin-A [Canis lupus familiaris]
Length = 422
Score = 86.3 bits (212), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 43/116 (37%), Positives = 64/116 (55%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKN-VYCSN-CQAIA 118
R+ +GGE++ GG D ++ +T+ PV+ YWQ +E +K+ + C+ C AI
Sbjct: 221 RDPEAVDGGELVLGGSDPAHYIPPLTFLPVTVPAYWQIHMERVKVGTGLILCAQGCAAIL 280
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSLI GP++ I LN IG L G + C + +P + +LGG F LT
Sbjct: 281 DTGTSLITGPTEEIQALNAAIGGFSLLLGEYLIQCSEIPTLPPISFLLGGVWFNLT 336
Score = 65.5 bits (158), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 27/40 (67%), Positives = 30/40 (75%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
QYYG I LGTPPQ F VIFDTGSSNLW+PS C ++ C
Sbjct: 75 QYYGEIGLGTPPQNFSVIFDTGSSNLWVPSIRCHFFSLPC 114
>gi|21616051|emb|CAC86003.1| aspartic proteinase [Theobroma cacao]
Length = 514
Score = 86.3 bits (212), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 42/84 (50%), Positives = 58/84 (69%), Gaps = 3/84 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCS-NCQAI 117
R+ D+ GGE++FGG+D F GD TY P++RKGYWQF + + I + C+ C AI
Sbjct: 234 RDPEDDIGGEVVFGGMDPKHFKGDHTYVPITRKGYWQFDMGDVLIGNQTTGLCAGGCSAI 293
Query: 118 ADTGTSLIIGPSKVIAELNKLIGA 141
AD+GTSLI GP+ +IA++N IGA
Sbjct: 294 ADSGTSLITGPTAIIAQVNHAIGA 317
Score = 64.7 bits (156), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 30/63 (47%), Positives = 42/63 (66%), Gaps = 8/63 (12%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHF 60
+++ AQY+G I +GTPPQ F VIFDTGSSNLW+PS C +IAC + HS +
Sbjct: 82 LKNYLDAQYFGEIGIGTPPQNFTVIFDTGSSNLWVPSSKC-YFSIACYL-------HSRY 133
Query: 61 RNA 63
+++
Sbjct: 134 KSS 136
Score = 40.0 bits (92), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 17/45 (37%), Positives = 27/45 (60%)
Query: 130 KVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
+++ +N+L +P G + VDC +L MPNV +GGK F L+
Sbjct: 410 RILEYINELCDRLPSPMGESAVDCSSLSTMPNVSFTIGGKIFELS 454
>gi|297848226|ref|XP_002891994.1| hypothetical protein ARALYDRAFT_314946 [Arabidopsis lyrata subsp.
lyrata]
gi|297337836|gb|EFH68253.1| hypothetical protein ARALYDRAFT_314946 [Arabidopsis lyrata subsp.
lyrata]
Length = 504
Score = 86.3 bits (212), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 47/103 (45%), Positives = 61/103 (59%), Gaps = 3/103 (2%)
Query: 51 KDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV- 109
+DK R+ E GGEI+FGGVD F G TY PV+RKGYWQF + I + N
Sbjct: 225 RDKVFSFWLNRDTEAEVGGEIVFGGVDPAHFKGKHTYVPVTRKGYWQFNMGDIFVGSNST 284
Query: 110 -YCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAK 150
+C C AI D+GTSL+ GP+ VIA++N IGA + + K
Sbjct: 285 GFCEQGCDAIMDSGTSLLAGPTTVIAQINHAIGAEGIVSAECK 327
Score = 66.2 bits (160), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 29/47 (61%), Positives = 37/47 (78%), Gaps = 1/47 (2%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ AQYYG I +GTP QEF+VIFDTGSSNLW+PS C L++AC
Sbjct: 83 LKNYLDAQYYGVIGIGTPSQEFEVIFDTGSSNLWVPSSKC-YLSLAC 128
Score = 40.8 bits (94), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 23/74 (31%), Positives = 40/74 (54%), Gaps = 3/74 (4%)
Query: 104 KIEKNVYCSNCQ-AIADTGTSLIIGPSK--VIAELNKLIGAVPLANGPAKVDCDNLDKMP 160
K +V C C+ A+ T L + +K V +N+L ++P G + +DC+N+ MP
Sbjct: 371 KERSSVLCEVCEMAVVWVQTKLKVNETKEKVFEYVNQLCESLPSPAGESIIDCNNIKNMP 430
Query: 161 NVDIILGGKNFTLT 174
+V +GG F+L+
Sbjct: 431 SVTFTIGGNPFSLS 444
>gi|126309843|ref|XP_001370404.1| PREDICTED: gastricsin-like isoform 2 [Monodelphis domestica]
Length = 389
Score = 86.3 bits (212), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 40/114 (35%), Positives = 68/114 (59%), Gaps = 3/114 (2%)
Query: 64 SDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSN-CQAIADT 120
S ++GGE++FGGVD + + G I ++PV+++ YWQ G++ I + +CS CQAI DT
Sbjct: 218 SSQSGGEVIFGGVDNNLYSGQIYWAPVTQELYWQIGIQEFSIGGQATGWCSQGCQAIVDT 277
Query: 121 GTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
GTSL+ P + ++ + G G VDC+++ +P + ++ G F L+
Sbjct: 278 GTSLLTVPQQYMSAFLQATGGQQDQYGQYVVDCNSIQNLPTISFLINGVQFPLS 331
Score = 61.2 bits (147), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 30/52 (57%), Positives = 36/52 (69%), Gaps = 9/52 (17%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHF 60
YYG IS+GTPPQ F V+FDTGSSNLW+PS +C + ACS GH+ F
Sbjct: 72 YYGEISIGTPPQNFLVLFDTGSSNLWVPSIYCQ--SQACS-------GHARF 114
>gi|426368715|ref|XP_004051348.1| PREDICTED: pepsin A-5-like isoform 1 [Gorilla gorilla gorilla]
Length = 388
Score = 86.3 bits (212), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 44/125 (35%), Positives = 73/125 (58%), Gaps = 5/125 (4%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-K 107
V +D F S + +A D++G ++FGG+D + G + + PV+ +GYWQ V+SI + +
Sbjct: 208 VSQDLF---SVYLSADDQSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQITVDSITMNGE 264
Query: 108 NVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ C+ CQAI DTGTSL+ GP+ IA + IGA ++G V C + +P++ +
Sbjct: 265 TIACAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGDMVVSCSAISSLPDIVFTI 324
Query: 167 GGKNF 171
G +
Sbjct: 325 NGVQY 329
Score = 61.2 bits (147), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 26/41 (63%), Positives = 35/41 (85%), Gaps = 2/41 (4%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACS 48
+Y+GTI +GTP Q+F VIFDTGSSNLW+PS +CS ++AC+
Sbjct: 75 EYFGTIGIGTPAQDFTVIFDTGSSNLWVPSVYCS--SLACT 113
>gi|6739580|gb|AAF27315.1| prochymosin [Bubalus bubalis]
Length = 366
Score = 86.3 bits (212), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 50/125 (40%), Positives = 64/125 (51%), Gaps = 4/125 (3%)
Query: 54 FVGHSHFRNASDENGGEIMF--GGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYC 111
V F D NG E M G +D + G + + PV+ + YWQF V+SI I V
Sbjct: 190 LVAQDLFSVYMDRNGQESMLTLGAIDPSYYTGSLHWVPVTVQQYWQFTVDSITISGVVVA 249
Query: 112 SN--CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGK 169
CQAI DTGTS ++GPS I + + IGA G +DCDNL MP V + GK
Sbjct: 250 CEGGCQAILDTGTSKLVGPSSDILNIQQAIGATQNQYGEFDIDCDNLSYMPTVVSEINGK 309
Query: 170 NFTLT 174
+ LT
Sbjct: 310 MYPLT 314
Score = 58.9 bits (141), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 23/34 (67%), Positives = 29/34 (85%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
+QY+G I LGTPPQEF V+FDTGSS+ W+PS +C
Sbjct: 57 SQYFGKIYLGTPPQEFTVLFDTGSSDFWVPSIYC 90
>gi|343425806|emb|CBQ69339.1| probable PEP4-aspartyl protease [Sporisorium reilianum SRZ2]
Length = 419
Score = 86.3 bits (212), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 43/117 (36%), Positives = 67/117 (57%), Gaps = 1/117 (0%)
Query: 58 SHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI-EKNVYCSNCQA 116
S + +S+E+GGE +FGG+D+ + G I ++PV RKGYW+ ++ + + ++ + N A
Sbjct: 246 SFYLGSSEEDGGEAVFGGIDESHYSGKIHWAPVKRKGYWEVALDKLALGDEELELENGSA 305
Query: 117 IADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
DTGTSLI + LN IGA NG VDCD + +P + + G+ F L
Sbjct: 306 AIDTGTSLIAMATDTAEILNAEIGATKSWNGQYSVDCDKVKDLPPLTFYIDGQPFKL 362
Score = 68.9 bits (167), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 42/86 (48%), Positives = 54/86 (62%), Gaps = 8/86 (9%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK--DKFVGHSHFRNAS 64
AQY+ ISLGTP QEFKVI DTGSSNLW+PS CS +IAC + K D S+ +N +
Sbjct: 106 AQYFCDISLGTPAQEFKVILDTGSSNLWVPSTKCS--SIACFLHKKYDSSASSSYKKNGT 163
Query: 65 D---ENGGEIMFGGVDKDKF-VGDIT 86
+ + G M G V +D +GD+T
Sbjct: 164 EFKIQYGSGSMEGIVSQDTLKIGDLT 189
>gi|307190924|gb|EFN74737.1| Lysosomal aspartic protease [Camponotus floridanus]
Length = 177
Score = 86.3 bits (212), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 46/115 (40%), Positives = 68/115 (59%), Gaps = 2/115 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYC-SNCQAIAD 119
R+ S E GG++ GG D + G +TY P+S+KGYWQF ++ I I C +C+ I +
Sbjct: 3 RDTSVELGGKLTLGGSDSAHYDGHLTYIPISQKGYWQFNMDKIVINNINLCKKSCETIIE 62
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
T ++IIGP K I +N+LIG L +G V+C + ++P + I G K F LT
Sbjct: 63 TDAAIIIGPEKDIRFINELIGTNNL-HGEEIVNCSQISELPTIQFISGDKAFNLT 116
>gi|426368717|ref|XP_004051349.1| PREDICTED: pepsin A-5-like isoform 2 [Gorilla gorilla gorilla]
Length = 388
Score = 86.3 bits (212), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 44/125 (35%), Positives = 73/125 (58%), Gaps = 5/125 (4%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-K 107
V +D F S + +A D++G ++FGG+D + G + + PV+ +GYWQ V+SI + +
Sbjct: 208 VSQDLF---SVYLSADDQSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQITVDSITMNGE 264
Query: 108 NVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ C+ CQAI DTGTSL+ GP+ IA + IGA ++G V C + +P++ +
Sbjct: 265 TIACAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGDMVVSCSAISSLPDIVFTI 324
Query: 167 GGKNF 171
G +
Sbjct: 325 NGVQY 329
Score = 61.2 bits (147), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 26/41 (63%), Positives = 35/41 (85%), Gaps = 2/41 (4%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACS 48
+Y+GTI +GTP Q+F VIFDTGSSNLW+PS +CS ++AC+
Sbjct: 75 EYFGTIGIGTPAQDFTVIFDTGSSNLWVPSVYCS--SLACT 113
>gi|296198131|ref|XP_002746573.1| PREDICTED: gastricsin [Callithrix jacchus]
gi|18203304|sp|Q9N2D3.1|PEPC_CALJA RecName: Full=Gastricsin; AltName: Full=Pepsinogen C; Flags:
Precursor
gi|7008023|dbj|BAA90872.1| pepsinogen C [Callithrix jacchus]
Length = 388
Score = 86.3 bits (212), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 42/118 (35%), Positives = 67/118 (56%), Gaps = 3/118 (2%)
Query: 59 HFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSN-CQ 115
+ N +GG ++FGGVD + G I ++PV+++ YWQ G+E I + + +CS CQ
Sbjct: 213 YLSNQQGSSGGAVIFGGVDSSLYTGQIYWAPVTQELYWQIGIEEFLIGGQASGWCSEGCQ 272
Query: 116 AIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
AI DTGTSL+ P + ++ + GA G V+CD++ +P + I+ G F L
Sbjct: 273 AIVDTGTSLLTVPQQYMSAFLEATGAQEDEYGQFLVNCDSIQNLPTLTFIINGVEFPL 330
Score = 58.9 bits (141), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 24/34 (70%), Positives = 29/34 (85%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
A Y+G IS+GTPPQ F V+FDTGSSNLW+PS +C
Sbjct: 71 AAYFGEISIGTPPQNFLVLFDTGSSNLWVPSVYC 104
>gi|115439013|ref|NP_001043786.1| Os01g0663400 [Oryza sativa Japonica Group]
gi|113533317|dbj|BAF05700.1| Os01g0663400 [Oryza sativa Japonica Group]
gi|215701483|dbj|BAG92907.1| unnamed protein product [Oryza sativa Japonica Group]
gi|218188796|gb|EEC71223.1| hypothetical protein OsI_03158 [Oryza sativa Indica Group]
gi|222618996|gb|EEE55128.1| hypothetical protein OsJ_02912 [Oryza sativa Japonica Group]
gi|385717674|gb|AFI71272.1| unnamed protein [Oryza sativa Japonica Group]
Length = 522
Score = 86.3 bits (212), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 41/84 (48%), Positives = 60/84 (71%), Gaps = 3/84 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV--YC-SNCQAI 117
R+A + GGEI+FGG+D + + G+ TY PV+RKGYWQF + + I N +C + C AI
Sbjct: 242 RHADEGQGGEIVFGGIDPNHYKGNHTYVPVTRKGYWQFNMGDVLIGGNSTGFCAAGCAAI 301
Query: 118 ADTGTSLIIGPSKVIAELNKLIGA 141
AD+GTSL+ GP+ +I ++N+ IGA
Sbjct: 302 ADSGTSLLTGPTAIITQINEKIGA 325
Score = 67.8 bits (164), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 32/58 (55%), Positives = 39/58 (67%), Gaps = 1/58 (1%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHS 58
+++ AQYYG I++GTPPQ F VIFDTGSSNLW+PS C L+IAC G S
Sbjct: 90 LKNYLNAQYYGEIAIGTPPQMFTVIFDTGSSNLWVPSSKCH-LSIACYFHSRYKAGQS 146
>gi|999902|pdb|1HTR|B Chain B, Crystal And Molecular Structures Of Human Progastricsin At
1.62 Angstroms Resolution
gi|2982065|pdb|1AVF|A Chain A, Activation Intermediate 2 Of Human Gastricsin From Human
Stomach
gi|2982067|pdb|1AVF|J Chain J, Activation Intermediate 2 Of Human Gastricsin From Human
Stomach
Length = 329
Score = 86.3 bits (212), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 42/118 (35%), Positives = 69/118 (58%), Gaps = 3/118 (2%)
Query: 59 HFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSN-CQ 115
+ N +GG ++FGGVD + G I ++PV+++ YWQ G+E I + + +CS CQ
Sbjct: 154 YLSNQQGSSGGAVVFGGVDSSLYTGQIYWAPVTQELYWQIGIEEFLIGGQASGWCSEGCQ 213
Query: 116 AIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
AI DTGTSL+ P + ++ L + GA G V+C+++ +P++ I+ G F L
Sbjct: 214 AIVDTGTSLLTVPQQYMSALLQATGAQEDEYGQFLVNCNSIQNLPSLTFIINGVEFPL 271
Score = 58.9 bits (141), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 24/34 (70%), Positives = 29/34 (85%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
A Y+G IS+GTPPQ F V+FDTGSSNLW+PS +C
Sbjct: 12 AAYFGEISIGTPPQNFLVLFDTGSSNLWVPSVYC 45
>gi|329665035|ref|NP_001192720.1| gastricsin precursor [Bos taurus]
Length = 391
Score = 86.3 bits (212), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 42/111 (37%), Positives = 66/111 (59%), Gaps = 3/111 (2%)
Query: 66 ENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSN-CQAIADTGT 122
++GG ++FGGVD + G I ++PV+++ YWQ G E I + +CS CQAI DTGT
Sbjct: 222 QDGGAVIFGGVDNCLYTGQIYWAPVTQELYWQIGFEEFLIGDQATGWCSTGCQAIVDTGT 281
Query: 123 SLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
SL+ P + ++ L + GA G VDC+N+ +P + +++ G F L
Sbjct: 282 SLLTVPQQFLSALLQATGAQEDQYGQFPVDCNNIQNLPTLTLVINGVQFPL 332
Score = 59.3 bits (142), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 24/34 (70%), Positives = 29/34 (85%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
A Y+G IS+GTPPQ F V+FDTGSSNLW+PS +C
Sbjct: 72 AAYFGEISIGTPPQNFLVLFDTGSSNLWVPSVYC 105
>gi|169861123|ref|XP_001837196.1| endopeptidase [Coprinopsis cinerea okayama7#130]
gi|116501918|gb|EAU84813.1| endopeptidase [Coprinopsis cinerea okayama7#130]
Length = 411
Score = 86.3 bits (212), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 45/116 (38%), Positives = 66/116 (56%), Gaps = 2/116 (1%)
Query: 60 FR-NASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI-EKNVYCSNCQAI 117
FR +S+E+GGE FGG+D + + G + Y PV RK YW+ +E I + + + A
Sbjct: 238 FRIGSSEEDGGEATFGGIDHEAYTGKLHYVPVRRKAYWEVELEKISFGDDELELEHTGAA 297
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
DTGTSLI P+ + LN IGA NG +VDC+ + +P++ GGK + L
Sbjct: 298 IDTGTSLIALPTDMAEMLNTQIGARKSWNGQYQVDCNKVPSLPDLTFQFGGKPYPL 353
Score = 63.9 bits (154), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 36/84 (42%), Positives = 48/84 (57%), Gaps = 4/84 (4%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASD- 65
AQYY I+LGTPPQ FKVI DTGSSNLW+PS C+ + D ++ N ++
Sbjct: 97 AQYYTEITLGTPPQTFKVILDTGSSNLWVPSIKCTSIACFLHTKYDSSQSTTYKANGTEF 156
Query: 66 --ENGGEIMFGGVDKDKF-VGDIT 86
+ G M G V +D +GD+T
Sbjct: 157 SIQYGSGSMEGFVSQDTLGIGDLT 180
>gi|345783299|ref|XP_540923.3| PREDICTED: pepsin F-like [Canis lupus familiaris]
Length = 388
Score = 85.9 bits (211), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 42/116 (36%), Positives = 68/116 (58%), Gaps = 2/116 (1%)
Query: 58 SHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVY-CSN-CQ 115
+ + + +E G +MFG VD + GD+ + PVS+ YWQ ++SI + V CS C+
Sbjct: 213 AFYLSKKEEEGSVVMFGDVDHSYYRGDLNWVPVSKPFYWQLSMDSISVNGAVIACSGGCE 272
Query: 116 AIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNF 171
AI DTGTSL+IGP V+ + K+I A +G +DC ++ +P++ + G N+
Sbjct: 273 AIIDTGTSLLIGPPSVVLNIQKIINAKRSYSGEFMIDCSTINTLPDIVFAINGVNY 328
Score = 65.1 bits (157), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 28/41 (68%), Positives = 33/41 (80%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCS 41
MR+ Y GTI +GTPPQEFKVIFDTGSS+LW+PS +CS
Sbjct: 67 MRNYLDLAYVGTIRIGTPPQEFKVIFDTGSSDLWVPSIYCS 107
>gi|1585064|prf||2124254A pepsin:ISOTYPE=3a
gi|1585065|prf||2124254B pepsin:ISOTYPE=3b
Length = 326
Score = 85.9 bits (211), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 44/125 (35%), Positives = 73/125 (58%), Gaps = 5/125 (4%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-K 107
V +D F S + +A D++G ++FGG+D + G + + PV+ +GYWQ V+SI + +
Sbjct: 146 VSQDLF---SVYLSADDQSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQITVDSITMNGE 202
Query: 108 NVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ C+ CQAI DTGTSL+ GP+ IA + IGA ++G V C + +P++ +
Sbjct: 203 AIACAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGDMVVSCSAISSLPDIVFTI 262
Query: 167 GGKNF 171
G +
Sbjct: 263 NGVQY 267
Score = 60.5 bits (145), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 25/41 (60%), Positives = 35/41 (85%), Gaps = 2/41 (4%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACS 48
+Y+GTI +GTP Q+F V+FDTGSSNLW+PS +CS ++AC+
Sbjct: 13 EYFGTIGIGTPAQDFTVVFDTGSSNLWVPSVYCS--SLACT 51
>gi|431920733|gb|ELK18506.1| Napsin-A [Pteropus alecto]
Length = 760
Score = 85.9 bits (211), Expect = 5e-15, Method: Composition-based stats.
Identities = 44/116 (37%), Positives = 68/116 (58%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIA 118
R+ + +GGE++ GG D ++ +TY PV+ YWQ +E +++ + C++ C AI
Sbjct: 209 RDPEEADGGELVLGGSDPTHYIPPLTYVPVTVPAYWQIHMERVQVGTGLTLCAHGCAAIL 268
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSLI GPS+ I L++ IG + L G + C + ++P V LGG F LT
Sbjct: 269 DTGTSLITGPSEEIRALHRAIGGISLLVGEYLIQCSLITELPPVSFNLGGVWFNLT 324
Score = 70.5 bits (171), Expect = 2e-10, Method: Composition-based stats.
Identities = 27/41 (65%), Positives = 32/41 (78%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQYYG I LGTPPQ F V+FDTGSSNLW+PS+ C ++ C
Sbjct: 62 AQYYGEIGLGTPPQNFSVVFDTGSSNLWVPSKRCYFFSLPC 102
>gi|4505757|ref|NP_002621.1| gastricsin isoform 1 preproprotein [Homo sapiens]
gi|129796|sp|P20142.1|PEPC_HUMAN RecName: Full=Gastricsin; AltName: Full=Pepsinogen C; Flags:
Precursor
gi|387015|gb|AAA60063.1| pepsinogen C [Homo sapiens]
gi|551176|gb|AAA60074.1| pepsinogen [Homo sapiens]
gi|1658286|gb|AAB18273.1| gastricsin [Homo sapiens]
gi|49522219|gb|AAH73740.1| Progastricsin (pepsinogen C) [Homo sapiens]
gi|119624464|gb|EAX04059.1| progastricsin (pepsinogen C) [Homo sapiens]
Length = 388
Score = 85.9 bits (211), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 42/118 (35%), Positives = 69/118 (58%), Gaps = 3/118 (2%)
Query: 59 HFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSN-CQ 115
+ N +GG ++FGGVD + G I ++PV+++ YWQ G+E I + + +CS CQ
Sbjct: 213 YLSNQQGSSGGAVVFGGVDSSLYTGQIYWAPVTQELYWQIGIEEFLIGGQASGWCSEGCQ 272
Query: 116 AIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
AI DTGTSL+ P + ++ L + GA G V+C+++ +P++ I+ G F L
Sbjct: 273 AIVDTGTSLLTVPQQYMSALLQATGAQEDEYGQFLVNCNSIQNLPSLTFIINGVEFPL 330
Score = 59.3 bits (142), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 24/34 (70%), Positives = 29/34 (85%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
A Y+G IS+GTPPQ F V+FDTGSSNLW+PS +C
Sbjct: 71 AAYFGEISIGTPPQNFLVLFDTGSSNLWVPSVYC 104
>gi|16974928|pdb|1FLH|A Chain A, Crystal Structure Of Human Uropepsin At 2.45 A Resolution
Length = 326
Score = 85.9 bits (211), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 44/125 (35%), Positives = 73/125 (58%), Gaps = 5/125 (4%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-K 107
V +D F S + +A D++G ++FGG+D + G + + PV+ +GYWQ V+SI + +
Sbjct: 146 VSQDLF---SVYLSADDQSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQITVDSITMNGE 202
Query: 108 NVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ C+ CQAI DTGTSL+ GP+ IA + IGA ++G V C + +P++ +
Sbjct: 203 AIACAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGDMVVSCSAISSLPDIVFTI 262
Query: 167 GGKNF 171
G +
Sbjct: 263 NGVQY 267
Score = 60.5 bits (145), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 25/41 (60%), Positives = 35/41 (85%), Gaps = 2/41 (4%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACS 48
+Y+GTI +GTP Q+F V+FDTGSSNLW+PS +CS ++AC+
Sbjct: 13 EYFGTIGIGTPAQDFTVVFDTGSSNLWVPSVYCS--SLACT 51
>gi|1065259|pdb|1PSO|E Chain E, The Crystal Structure Of Human Pepsin And Its Complex With
Pepstatin
gi|5542461|pdb|1QRP|E Chain E, Human Pepsin 3a In Complex With A Phosphonate Inhibitor
Iva-Val-Val- Leu(P)-(O)phe-Ala-Ala-Ome
gi|157833570|pdb|1PSN|A Chain A, The Crystal Structure Of Human Pepsin And Its Complex With
Pepstatin
gi|361132440|pdb|3UTL|A Chain A, Human Pepsin 3b
Length = 326
Score = 85.9 bits (211), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 44/125 (35%), Positives = 73/125 (58%), Gaps = 5/125 (4%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-K 107
V +D F S + +A D++G ++FGG+D + G + + PV+ +GYWQ V+SI + +
Sbjct: 146 VSQDLF---SVYLSADDQSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQITVDSITMNGE 202
Query: 108 NVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ C+ CQAI DTGTSL+ GP+ IA + IGA ++G V C + +P++ +
Sbjct: 203 AIACAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGDMVVSCSAISSLPDIVFTI 262
Query: 167 GGKNF 171
G +
Sbjct: 263 NGVQY 267
Score = 60.5 bits (145), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 25/41 (60%), Positives = 35/41 (85%), Gaps = 2/41 (4%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACS 48
+Y+GTI +GTP Q+F V+FDTGSSNLW+PS +CS ++AC+
Sbjct: 13 EYFGTIGIGTPAQDFTVVFDTGSSNLWVPSVYCS--SLACT 51
>gi|387014|gb|AAA60062.1| pepsinogen [Homo sapiens]
Length = 385
Score = 85.9 bits (211), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 42/118 (35%), Positives = 69/118 (58%), Gaps = 3/118 (2%)
Query: 59 HFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSN-CQ 115
+ N +GG ++FGGVD + G I ++PV+++ YWQ G+E I + + +CS CQ
Sbjct: 210 YLSNQQGSSGGAVVFGGVDSSLYTGQIYWAPVTQELYWQIGIEEFLIGGQASGWCSEGCQ 269
Query: 116 AIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
AI DTGTSL+ P + ++ L + GA G V+C+++ +P++ I+ G F L
Sbjct: 270 AIVDTGTSLLTVPQQYMSALLQATGAQEDEYGQFLVNCNSIQNLPSLTFIINGVEFPL 327
Score = 59.3 bits (142), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 24/34 (70%), Positives = 29/34 (85%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
A Y+G IS+GTPPQ F V+FDTGSSNLW+PS +C
Sbjct: 68 AAYFGEISIGTPPQNFLVLFDTGSSNLWVPSVYC 101
>gi|354543755|emb|CCE40477.1| hypothetical protein CPAR2_105130 [Candida parapsilosis]
Length = 427
Score = 85.9 bits (211), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 54/152 (35%), Positives = 79/152 (51%), Gaps = 3/152 (1%)
Query: 24 VIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVG 83
+ +DT S N +P + +I +D +F + N +E+GG FGG D+ F G
Sbjct: 221 LAYDTISVNKIVPPIYNAINQDL--LDAPQFGFYLGDTNKDEEDGGVATFGGYDESLFQG 278
Query: 84 DITYSPVSRKGYWQFGVESIKI-EKNVYCSNCQAIADTGTSLIIGPSKVIAELNKLIGAV 142
IT+ PV RK YW+ E I + ++ A DTGTSLI PS + +N IGA
Sbjct: 279 KITWLPVRRKAYWEVAFEGIGLGDEYAELFKTGAAIDTGTSLITLPSTLAEIINSKIGAT 338
Query: 143 PLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
+G +VDC D++P++ + G NFTLT
Sbjct: 339 KSWSGQYQVDCAKRDELPDLTLTFSGYNFTLT 370
Score = 59.3 bits (142), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 34/83 (40%), Positives = 46/83 (55%), Gaps = 4/83 (4%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASD- 65
AQY+ I +GTP Q FKVI DTGSSNLW+PSQ C+ L D ++ N S+
Sbjct: 110 AQYFTEIQIGTPGQTFKVILDTGSSNLWVPSQDCTSLACFLHSKYDHDASSTYKANGSEF 169
Query: 66 --ENGGEIMFGGVDKDKF-VGDI 85
+ G M G + +D +GD+
Sbjct: 170 SIQYGSGSMEGYISQDTVSIGDL 192
>gi|224115794|ref|XP_002317126.1| predicted protein [Populus trichocarpa]
gi|222860191|gb|EEE97738.1| predicted protein [Populus trichocarpa]
Length = 512
Score = 85.9 bits (211), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 42/84 (50%), Positives = 58/84 (69%), Gaps = 3/84 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCS-NCQAI 117
RNA DE GGEI+FGG+D + + G TY PV++KGYWQF + + + + YC+ C AI
Sbjct: 232 RNADDEEGGEIVFGGMDPNHYKGKHTYVPVTQKGYWQFDMGDVIVGDKSTGYCAGGCAAI 291
Query: 118 ADTGTSLIIGPSKVIAELNKLIGA 141
AD+GTSL+ GP+ +I +N IGA
Sbjct: 292 ADSGTSLLAGPTAIITMINHAIGA 315
Score = 68.9 bits (167), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 29/47 (61%), Positives = 38/47 (80%), Gaps = 1/47 (2%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ + AQYYG I +GTPPQ+F VIFDTGSSNLW+PS C L++AC
Sbjct: 80 LKNYFDAQYYGEIGVGTPPQKFTVIFDTGSSNLWVPSSKC-YLSVAC 125
>gi|21616053|emb|CAC86004.1| aspartic proteinase [Theobroma cacao]
Length = 514
Score = 85.9 bits (211), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 45/93 (48%), Positives = 61/93 (65%), Gaps = 3/93 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCS-NCQAI 117
RN +E GGEI+FGGVD + + G TY PV++KGYWQF + + I + YC+ +C AI
Sbjct: 234 RNVDEEAGGEIVFGGVDPNHYKGKHTYVPVTQKGYWQFDMGDVLIADKPTGYCAGSCAAI 293
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAK 150
AD+GTSL+ GPS VI +N IGA + + K
Sbjct: 294 ADSGTSLLAGPSTVITMINHAIGATGVVSQECK 326
Score = 59.3 bits (142), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 26/47 (55%), Positives = 34/47 (72%), Gaps = 1/47 (2%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ AQYYG I +GTP Q+F VIFDTGSSNLW+ S C ++AC
Sbjct: 82 LKNYMDAQYYGEIGIGTPTQKFTVIFDTGSSNLWVSSTKC-YFSVAC 127
Score = 39.3 bits (90), Expect = 0.64, Method: Compositional matrix adjust.
Identities = 17/45 (37%), Positives = 27/45 (60%)
Query: 130 KVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
++++ +N+L VP G + VDC +L MP + +GGK F LT
Sbjct: 410 RILSYVNELCDRVPNPMGESAVDCGSLSSMPTISFTIGGKVFDLT 454
>gi|388579370|gb|EIM19694.1| aspartyl proteinase [Wallemia sebi CBS 633.66]
Length = 411
Score = 85.9 bits (211), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 44/115 (38%), Positives = 65/115 (56%), Gaps = 4/115 (3%)
Query: 64 SDENGG---EIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI-EKNVYCSNCQAIAD 119
+DE G + +FGG+D D + G + Y P+ RKGYW+ +E + + V N A D
Sbjct: 241 TDEQSGKESQAVFGGIDHDHYKGQLHYVPLRRKGYWEVELEKLTFGDDEVELENTGAAID 300
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
TGTSLI P+ + LNK+IGA +G VDC+ +D +P + GGK + L+
Sbjct: 301 TGTSLIAIPTDMAEMLNKMIGAKKSWSGQYTVDCNKVDDLPELSFTFGGKKYPLS 355
Score = 58.9 bits (141), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 25/45 (55%), Positives = 33/45 (73%), Gaps = 2/45 (4%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK 51
AQYY I LG+P Q+F V+ DTGSSNLW+PS C ++IAC + +
Sbjct: 97 AQYYAEIGLGSPEQKFNVVLDTGSSNLWVPSNKC--MSIACFLHR 139
>gi|509163|emb|CAA48939.1| cyprosin [Cynara cardunculus]
Length = 474
Score = 85.9 bits (211), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 43/101 (42%), Positives = 65/101 (64%), Gaps = 3/101 (2%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYC-SNCQAI 117
RNA ++ GGE++FGGVD + F G+ TY PV++KGYWQF + + I + +C S C AI
Sbjct: 195 RNADEQEGGELVFGGVDPNHFKGEHTYVPVTQKGYWQFEMGDVLIGDKTTGFCASGCAAI 254
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDK 158
AD+GTSL+ G + ++ ++N+ IGA + + K D K
Sbjct: 255 ADSGTSLLAGTTTIVTQINQAIGAAGVMSQQCKSLVDQYGK 295
Score = 65.1 bits (157), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 30/62 (48%), Positives = 42/62 (67%), Gaps = 8/62 (12%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHF 60
+++ AQY+G I +GTPPQ+F VIFDTGSSNLW+PS C ++AC + HS +
Sbjct: 43 LKNYMDAQYFGEIGIGTPPQKFTVIFDTGSSNLWVPSSKC-YFSVAC-------LFHSKY 94
Query: 61 RN 62
R+
Sbjct: 95 RS 96
Score = 39.3 bits (90), Expect = 0.64, Method: Compositional matrix adjust.
Identities = 17/44 (38%), Positives = 26/44 (59%)
Query: 131 VIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
+I ++KL +P G + VDC +L MPN+ +GGK F L+
Sbjct: 371 IINYVDKLCERLPSPMGESAVDCSSLSSMPNIAFTVGGKTFNLS 414
>gi|441648777|ref|XP_003266334.2| PREDICTED: LOW QUALITY PROTEIN: gastricsin [Nomascus leucogenys]
Length = 388
Score = 85.9 bits (211), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 42/118 (35%), Positives = 68/118 (57%), Gaps = 3/118 (2%)
Query: 59 HFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSN-CQ 115
+ N +GG ++FGGVD + G I ++PV+++ YWQ G+E I + + +CS CQ
Sbjct: 213 YLSNQEGSSGGAVVFGGVDSSLYTGQIYWAPVTQELYWQIGIEEFLIGGQASGWCSEGCQ 272
Query: 116 AIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
AI DTGTSL+ P + ++ L + GA G V+C+++ +P + I+ G F L
Sbjct: 273 AIVDTGTSLLTVPQQYMSALLQATGAQEDEYGQFLVNCNSIQNLPTLTFIINGVEFPL 330
Score = 58.5 bits (140), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 23/34 (67%), Positives = 29/34 (85%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
A Y+G +S+GTPPQ F V+FDTGSSNLW+PS +C
Sbjct: 71 AAYFGEVSIGTPPQNFLVLFDTGSSNLWVPSVYC 104
>gi|1585066|prf||2124254C pepsin:ISOTYPE=3c
Length = 326
Score = 85.9 bits (211), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 44/125 (35%), Positives = 74/125 (59%), Gaps = 5/125 (4%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-K 107
V +D F S + +A D++G ++FGG+D + G + + PV+ +GYWQ V+SI + +
Sbjct: 146 VSQDLF---SVYLSADDKSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQITVDSITMNGE 202
Query: 108 NVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ C+ CQAI DTGTSL+ GP+ IA++ IGA ++G V C + +P++ +
Sbjct: 203 AIACAEGCQAIVDTGTSLLTGPTSPIAKIQSDIGASENSDGDMVVSCSAISSLPDIVFTI 262
Query: 167 GGKNF 171
G +
Sbjct: 263 NGVQY 267
Score = 60.5 bits (145), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 25/41 (60%), Positives = 35/41 (85%), Gaps = 2/41 (4%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACS 48
+Y+GTI +GTP Q+F V+FDTGSSNLW+PS +CS ++AC+
Sbjct: 13 EYFGTIGIGTPAQDFTVVFDTGSSNLWVPSVYCS--SLACT 51
>gi|291409605|ref|XP_002721070.1| PREDICTED: pepsin II-1-like [Oryctolagus cuniculus]
Length = 387
Score = 85.9 bits (211), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 42/127 (33%), Positives = 73/127 (57%), Gaps = 5/127 (3%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-K 107
V +D F S + ++ ++ G +MFGG+D + G + + PVS +GYWQ V+SI ++ +
Sbjct: 207 VSQDLF---SVYLSSDEQKGSLVMFGGIDSSYYTGSLNWVPVSHEGYWQITVDSITMDGE 263
Query: 108 NVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ C++ CQA+ DTGTSL+ GP+ I+ + IGA G + C +D +P++ +
Sbjct: 264 TIACADSCQAVVDTGTSLLAGPTSAISNIQSYIGASKNLLGENIISCSAIDSLPDIVFTI 323
Query: 167 GGKNFTL 173
+ L
Sbjct: 324 NNVQYPL 330
Score = 69.3 bits (168), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 30/45 (66%), Positives = 38/45 (84%), Gaps = 2/45 (4%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK 51
+Y+GTIS+GTPPQEF VIFDTGSSNLW+PS +CS ++AC + K
Sbjct: 73 TEYFGTISIGTPPQEFTVIFDTGSSNLWVPSTYCS--SLACILHK 115
>gi|222425200|dbj|BAH20549.1| pepsinogen A-50 [Pongo abelii]
Length = 388
Score = 85.9 bits (211), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 44/125 (35%), Positives = 73/125 (58%), Gaps = 5/125 (4%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-K 107
V +D F S + +A D++G ++FGG+D + G + + PV+ +GYWQ V+SI + +
Sbjct: 208 VSQDLF---SVYLSADDQSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQITVDSITMNGE 264
Query: 108 NVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ C+ CQAI DTGTSL+ GP+ IA + IGA ++G V C + +P++ +
Sbjct: 265 AIACAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGDMVVSCSAISSLPDIVFTI 324
Query: 167 GGKNF 171
G +
Sbjct: 325 NGVQY 329
Score = 60.8 bits (146), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 26/41 (63%), Positives = 35/41 (85%), Gaps = 2/41 (4%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACS 48
+Y+GTI +GTP Q+F VIFDTGSSNLW+PS +CS ++AC+
Sbjct: 75 EYFGTIGIGTPAQDFTVIFDTGSSNLWVPSVYCS--SLACT 113
>gi|452840489|gb|EME42427.1| hypothetical protein DOTSEDRAFT_73302 [Dothistroma septosporum
NZE10]
Length = 398
Score = 85.9 bits (211), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 48/126 (38%), Positives = 71/126 (56%), Gaps = 3/126 (2%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI-EK 107
+D+ F + N DE+ E +FGGV+KD + G++T P+ RK YW+ +++I ++
Sbjct: 218 LDEKVFAFYLSDTNKGDES--EAIFGGVNKDHYTGEMTKIPLRRKAYWEVDLDAITFGDQ 275
Query: 108 NVYCSNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILG 167
+ AI DTGTSL+ PS + LNK IGA NG VDC D +P++ L
Sbjct: 276 TAEIDSTGAILDTGTSLLALPSTLAELLNKEIGAKKGYNGQYTVDCSKRDSLPDLTFTLT 335
Query: 168 GKNFTL 173
G NFT+
Sbjct: 336 GHNFTI 341
Score = 67.0 bits (162), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 39/85 (45%), Positives = 52/85 (61%), Gaps = 8/85 (9%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK--DKFVGHSHFRNAS 64
AQY+ I++GTPPQEFKV+ DTGSSNLW+PSQ C +IAC + D ++ +N S
Sbjct: 84 AQYFSEIAIGTPPQEFKVVLDTGSSNLWVPSQDCG--SIACYLHSKYDHSESSTYKKNGS 141
Query: 65 D---ENGGEIMFGGVDKDKF-VGDI 85
D G + G V KD +GD+
Sbjct: 142 DFAIRYGSGSLEGYVSKDTVQIGDL 166
>gi|195121164|ref|XP_002005091.1| GI20282 [Drosophila mojavensis]
gi|193910159|gb|EDW09026.1| GI20282 [Drosophila mojavensis]
Length = 392
Score = 85.9 bits (211), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 42/114 (36%), Positives = 58/114 (50%), Gaps = 1/114 (0%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIAD 119
R GG + FGG + G+ TY PV+ + YWQ +E+ +I K C N CQ I D
Sbjct: 217 RELGSNQGGYLFFGGSSSRYYRGNFTYVPVTHRAYWQVKLETARIGKLQLCLNGCQVIID 276
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
TGTS + P + +N+ IG P A G V CD + +P + LG + F L
Sbjct: 277 TGTSFLAVPYEQAILINESIGGTPAAYGQFSVPCDQVAHLPTLTFTLGNRRFQL 330
Score = 60.5 bits (145), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 29/54 (53%), Positives = 36/54 (66%), Gaps = 8/54 (14%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHF 60
AQY+G IS+GTP Q F VIFDTGS+NLW+PS+ C +AC + HS F
Sbjct: 71 AQYFGPISIGTPQQTFNVIFDTGSANLWVPSESCQ-KKLACQI-------HSRF 116
>gi|224458280|ref|NP_001138943.1| gastricsin precursor [Pongo abelii]
gi|222425206|dbj|BAH20552.1| pepsinogen C [Pongo abelii]
Length = 388
Score = 85.9 bits (211), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 42/118 (35%), Positives = 68/118 (57%), Gaps = 3/118 (2%)
Query: 59 HFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSN-CQ 115
+ N +GG ++FGGVD + G I ++PV+++ YWQ G+E I + + +CS CQ
Sbjct: 213 YLSNQQGSSGGAVVFGGVDSSLYTGQIYWAPVTQELYWQIGIEEFLIGGQASGWCSEGCQ 272
Query: 116 AIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
AI DTGTSL+ P + ++ L + GA G V+C+++ +P + I+ G F L
Sbjct: 273 AIVDTGTSLLTVPQQYMSALLQATGAQEDEYGQFLVNCNSIQNLPTLTFIINGVEFPL 330
Score = 58.9 bits (141), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 24/34 (70%), Positives = 29/34 (85%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
A Y+G IS+GTPPQ F V+FDTGSSNLW+PS +C
Sbjct: 71 AAYFGEISIGTPPQNFLVLFDTGSSNLWVPSVYC 104
>gi|150866171|ref|XP_001385673.2| aspartic proteinase precursor [Scheffersomyces stipitis CBS 6054]
gi|149387427|gb|ABN67644.2| aspartic proteinase precursor [Scheffersomyces stipitis CBS 6054]
Length = 417
Score = 85.9 bits (211), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 46/127 (36%), Positives = 68/127 (53%), Gaps = 1/127 (0%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI-EK 107
+D+ +F + + +GG FGG D+ F G IT+ PV RK YW+ E I + ++
Sbjct: 235 LDEPQFAFYLGDTKKDENDGGLATFGGYDESAFTGKITWLPVRRKAYWEVSFEGIGLGDE 294
Query: 108 NVYCSNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILG 167
N A DTGTSLI PS + +N IGA +G ++DC+ D +P++ +
Sbjct: 295 YAELDNTGAAIDTGTSLITLPSSLAEIINAKIGATKSWSGQYQIDCEKQDTLPDLTLNFA 354
Query: 168 GKNFTLT 174
G NFTLT
Sbjct: 355 GYNFTLT 361
Score = 67.4 bits (163), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 38/83 (45%), Positives = 48/83 (57%), Gaps = 4/83 (4%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASD- 65
AQY+ ISLGTP Q+FKVI DTGSSNLW+PSQ CS L D ++ N S+
Sbjct: 101 AQYFTEISLGTPAQQFKVILDTGSSNLWVPSQECSSLACFLHTKYDHDSSSTYKANGSEF 160
Query: 66 --ENGGEIMFGGVDKDKF-VGDI 85
+ G M G V +D +GD+
Sbjct: 161 SIQYGSGAMEGYVSQDTLAIGDL 183
>gi|297688536|ref|XP_002821738.1| PREDICTED: pepsin A-4 [Pongo abelii]
Length = 388
Score = 85.9 bits (211), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 44/125 (35%), Positives = 73/125 (58%), Gaps = 5/125 (4%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-K 107
V +D F S + +A D++G ++FGG+D + G + + PV+ +GYWQ V+SI + +
Sbjct: 208 VSQDLF---SVYLSADDQSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQITVDSITMNGE 264
Query: 108 NVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ C+ CQAI DTGTSL+ GP+ IA + IGA ++G V C + +P++ +
Sbjct: 265 AIACAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGDMVVSCSAISSLPDIVFTI 324
Query: 167 GGKNF 171
G +
Sbjct: 325 NGVQY 329
Score = 60.8 bits (146), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 26/41 (63%), Positives = 35/41 (85%), Gaps = 2/41 (4%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACS 48
+Y+GTI +GTP Q+F VIFDTGSSNLW+PS +CS ++AC+
Sbjct: 75 EYFGTIGIGTPAQDFTVIFDTGSSNLWVPSVYCS--SLACT 113
>gi|24417300|gb|AAN60260.1| unknown [Arabidopsis thaliana]
Length = 168
Score = 85.9 bits (211), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 43/84 (51%), Positives = 57/84 (67%), Gaps = 3/84 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYC-SNCQAI 117
RNA +E GGE++FGGVD + F G TY PV++KGYWQF + + I +C S C AI
Sbjct: 73 RNADEEEGGELVFGGVDPNHFKGKHTYVPVTQKGYWQFDMGDVLIGGAPTGFCESGCSAI 132
Query: 118 ADTGTSLIIGPSKVIAELNKLIGA 141
AD+GTSL+ GP+ +I +N IGA
Sbjct: 133 ADSGTSLLAGPTTIITMINHAIGA 156
>gi|146414892|ref|XP_001483416.1| hypothetical protein PGUG_04145 [Meyerozyma guilliermondii ATCC
6260]
gi|146391889|gb|EDK40047.1| hypothetical protein PGUG_04145 [Meyerozyma guilliermondii ATCC
6260]
Length = 408
Score = 85.9 bits (211), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 46/115 (40%), Positives = 66/115 (57%), Gaps = 2/115 (1%)
Query: 62 NASDEN-GGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI-EKNVYCSNCQAIAD 119
+ DEN GG FGG DK +F G IT+ PV RK YW+ E I + ++ ++ A D
Sbjct: 238 TSKDENDGGVATFGGYDKSQFTGKITWLPVRRKAYWEVSFEGIGLGDEYAELTSTGAAID 297
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
TGTSLI PS + +N IGA +G ++DC+ D +P++ + G NFTL+
Sbjct: 298 TGTSLITLPSSLAEIMNTKIGATKSWSGQYQIDCEKRDSLPDLTLNFSGYNFTLS 352
Score = 60.5 bits (145), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 36/83 (43%), Positives = 45/83 (54%), Gaps = 4/83 (4%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASD- 65
AQY+ I LGTP Q FKVI DTGSSNLW+PS CS L D ++ N S+
Sbjct: 92 AQYFTEIQLGTPGQTFKVILDTGSSNLWVPSADCSSLACFLHTKYDHDSSSTYKANGSEF 151
Query: 66 --ENGGEIMFGGVDKDKF-VGDI 85
+ G M G V +D +GD+
Sbjct: 152 SIQYGSGAMEGYVSRDTLALGDL 174
>gi|400598686|gb|EJP66395.1| vacuolar protease A [Beauveria bassiana ARSEF 2860]
Length = 395
Score = 85.5 bits (210), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 54/153 (35%), Positives = 79/153 (51%), Gaps = 7/153 (4%)
Query: 26 FDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDI 85
+DT S N +P + ++N +D+ F F S+++G E +FGGVDKD + G I
Sbjct: 196 YDTISVNKMVPPFY-QMINQKL-IDEPVFA----FYLGSEDSGSEAIFGGVDKDHYEGKI 249
Query: 86 TYSPVSRKGYWQFGVESIKIEKNVY-CSNCQAIADTGTSLIIGPSKVIAELNKLIGAVPL 144
Y P+ RK YW+ ++I V N I DTGTSL P+ + LNK IGA
Sbjct: 250 EYIPLRRKAYWEVDFDAIAFGDEVAELENTGVILDTGTSLNTLPTDLAELLNKEIGAKKG 309
Query: 145 ANGPAKVDCDNLDKMPNVDIILGGKNFTLTRTN 177
G +DC D +P++ L G N+TL ++
Sbjct: 310 FGGQYSIDCKARDSLPDITFTLAGSNYTLPASD 342
Score = 65.1 bits (157), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 28/41 (68%), Positives = 34/41 (82%), Gaps = 2/41 (4%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQY+ I++GTPPQ FKV+ DTGSSNLW+PSQ CS +IAC
Sbjct: 83 AQYFSEITIGTPPQTFKVVLDTGSSNLWVPSQSCS--SIAC 121
>gi|224068986|ref|XP_002302872.1| predicted protein [Populus trichocarpa]
gi|222844598|gb|EEE82145.1| predicted protein [Populus trichocarpa]
Length = 505
Score = 85.5 bits (210), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 43/84 (51%), Positives = 57/84 (67%), Gaps = 3/84 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCS-NCQAI 117
RN E GGEI+FGGVD + + G+ TY PV+ KGYWQF + + I E C+ C+AI
Sbjct: 225 RNVEGEEGGEIVFGGVDPNHYKGEHTYVPVTHKGYWQFDMGDLLIGTETTGLCAGGCKAI 284
Query: 118 ADTGTSLIIGPSKVIAELNKLIGA 141
AD+GTSL+ GP+ VI ++N IGA
Sbjct: 285 ADSGTSLLAGPTTVITQINNAIGA 308
Score = 65.9 bits (159), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 28/47 (59%), Positives = 36/47 (76%), Gaps = 1/47 (2%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ AQYYG I++GTPPQ F VIFDTGSSNLW+PS C ++AC
Sbjct: 73 LKNYLDAQYYGEITIGTPPQTFTVIFDTGSSNLWVPSSKC-YFSLAC 118
>gi|219521691|gb|AAI71808.1| Pepsinogen 4, group I (pepsinogen A) [Homo sapiens]
Length = 388
Score = 85.5 bits (210), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 44/125 (35%), Positives = 73/125 (58%), Gaps = 5/125 (4%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-K 107
V +D F S + +A D++G ++FGG+D + G + + PV+ +GYWQ V+SI + +
Sbjct: 208 VSQDLF---SVYLSADDQSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQITVDSITMNGE 264
Query: 108 NVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ C+ CQAI DTGTSL+ GP+ IA + IGA ++G V C + +P++ +
Sbjct: 265 AIACAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGDMVVSCSAISSLPDIVFTI 324
Query: 167 GGKNF 171
G +
Sbjct: 325 NGVQY 329
Score = 60.1 bits (144), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 25/41 (60%), Positives = 35/41 (85%), Gaps = 2/41 (4%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACS 48
+Y+GTI +GTP Q+F V+FDTGSSNLW+PS +CS ++AC+
Sbjct: 75 EYFGTIGIGTPAQDFTVLFDTGSSNLWVPSVYCS--SLACT 113
>gi|440905526|gb|ELR55898.1| Gastricsin [Bos grunniens mutus]
Length = 391
Score = 85.5 bits (210), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 42/111 (37%), Positives = 65/111 (58%), Gaps = 3/111 (2%)
Query: 66 ENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSN-CQAIADTGT 122
++GG ++FGGVD + G I ++PV+++ YWQ G E I + +CS CQAI DTGT
Sbjct: 222 QDGGAVIFGGVDSCLYTGQIYWAPVTQELYWQIGFEEFLIGDQATGWCSTGCQAIVDTGT 281
Query: 123 SLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
SL+ P + ++ L + GA G VDC+N+ +P + ++ G F L
Sbjct: 282 SLLTVPQQFLSALLQATGAQEDQYGQFPVDCNNIQNLPTLTFVINGVQFPL 332
Score = 59.3 bits (142), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 24/34 (70%), Positives = 29/34 (85%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
A Y+G IS+GTPPQ F V+FDTGSSNLW+PS +C
Sbjct: 72 AAYFGEISIGTPPQNFLVLFDTGSSNLWVPSVYC 105
>gi|189066533|dbj|BAG35783.1| unnamed protein product [Homo sapiens]
gi|193785072|dbj|BAG54225.1| unnamed protein product [Homo sapiens]
gi|219521010|gb|AAI71815.1| Pepsinogen 3, group I (pepsinogen A) [Homo sapiens]
Length = 388
Score = 85.5 bits (210), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 44/125 (35%), Positives = 73/125 (58%), Gaps = 5/125 (4%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-K 107
V +D F S + +A D++G ++FGG+D + G + + PV+ +GYWQ V+SI + +
Sbjct: 208 VSQDLF---SVYLSADDQSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQITVDSITMNGE 264
Query: 108 NVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ C+ CQAI DTGTSL+ GP+ IA + IGA ++G V C + +P++ +
Sbjct: 265 AIACAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGDMVVSCSAISSLPDIVFTI 324
Query: 167 GGKNF 171
G +
Sbjct: 325 NGVQY 329
Score = 60.1 bits (144), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 25/41 (60%), Positives = 35/41 (85%), Gaps = 2/41 (4%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACS 48
+Y+GTI +GTP Q+F V+FDTGSSNLW+PS +CS ++AC+
Sbjct: 75 EYFGTIGIGTPAQDFTVLFDTGSSNLWVPSVYCS--SLACT 113
>gi|847749|gb|AAA73476.1| prochymosin [Bos primigenius]
Length = 345
Score = 85.5 bits (210), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 47/114 (41%), Positives = 62/114 (54%), Gaps = 4/114 (3%)
Query: 65 DENGGEIMF--GGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYC--SNCQAIADT 120
D NG E M G +D + G + + PV+ + YWQF V+S+ I V CQAI DT
Sbjct: 180 DRNGQESMLTLGAIDPSYYTGSLHWVPVTVQQYWQFTVDSVTISGVVVACEGGCQAILDT 239
Query: 121 GTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
GTS ++GPS I + + IGA G +DCDNL MP V + GK + LT
Sbjct: 240 GTSKLVGPSSDILNIQQAIGATQNQYGEFDIDCDNLSYMPTVVFEINGKMYPLT 293
Score = 45.8 bits (107), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 19/26 (73%), Positives = 23/26 (88%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSN 32
+QY+G I LGTPPQEF V+FDTGSS+
Sbjct: 56 SQYFGKIYLGTPPQEFTVLFDTGSSD 81
>gi|219520803|gb|AAI71814.1| Pepsinogen 4, group I (pepsinogen A) [Homo sapiens]
Length = 388
Score = 85.5 bits (210), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 44/125 (35%), Positives = 73/125 (58%), Gaps = 5/125 (4%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-K 107
V +D F S + +A D++G ++FGG+D + G + + PV+ +GYWQ V+SI + +
Sbjct: 208 VSQDLF---SVYLSADDQSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQITVDSITMNGE 264
Query: 108 NVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ C+ CQAI DTGTSL+ GP+ IA + IGA ++G V C + +P++ +
Sbjct: 265 AIACAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGDMVVSCSAISSLPDIVFTI 324
Query: 167 GGKNF 171
G +
Sbjct: 325 NGVQY 329
Score = 60.1 bits (144), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 25/41 (60%), Positives = 35/41 (85%), Gaps = 2/41 (4%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACS 48
+Y+GTI +GTP Q+F V+FDTGSSNLW+PS +CS ++AC+
Sbjct: 75 EYFGTIGIGTPAQDFTVLFDTGSSNLWVPSVYCS--SLACT 113
>gi|119372298|ref|NP_001073275.1| pepsin A preproprotein [Homo sapiens]
gi|378521956|sp|P0DJD8.1|PEPA3_HUMAN RecName: Full=Pepsin A-3; AltName: Full=Pepsinogen-3; Flags:
Precursor
gi|182887917|gb|AAI60184.1| Pepsinogen 3, group I (pepsinogen A) [synthetic construct]
Length = 388
Score = 85.5 bits (210), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 44/125 (35%), Positives = 73/125 (58%), Gaps = 5/125 (4%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-K 107
V +D F S + +A D++G ++FGG+D + G + + PV+ +GYWQ V+SI + +
Sbjct: 208 VSQDLF---SVYLSADDQSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQITVDSITMNGE 264
Query: 108 NVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ C+ CQAI DTGTSL+ GP+ IA + IGA ++G V C + +P++ +
Sbjct: 265 AIACAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGDMVVSCSAISSLPDIVFTI 324
Query: 167 GGKNF 171
G +
Sbjct: 325 NGVQY 329
Score = 60.5 bits (145), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 25/41 (60%), Positives = 35/41 (85%), Gaps = 2/41 (4%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACS 48
+Y+GTI +GTP Q+F V+FDTGSSNLW+PS +CS ++AC+
Sbjct: 75 EYFGTIGIGTPAQDFTVVFDTGSSNLWVPSVYCS--SLACT 113
>gi|300835014|gb|ADK37836.1| putative aspartic endopeptidase [Entomophthora planchoniana]
Length = 282
Score = 85.5 bits (210), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 41/113 (36%), Positives = 65/113 (57%), Gaps = 1/113 (0%)
Query: 62 NASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI-EKNVYCSNCQAIADT 120
N ++GGE++ GG+DK F G IT+ PVSRKGYW+ ++ + ++ V A DT
Sbjct: 113 NGGSDHGGEMVLGGIDKKHFKGKITWVPVSRKGYWEVSLDKVAFGDEEVELPKTGAAIDT 172
Query: 121 GTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
G+SL++ PS + +N+ IGA G +DC + +P++ + G FTL
Sbjct: 173 GSSLLVVPSDLADMINRFIGAKKGFGGQYTIDCAQVPSLPDLTLTFAGHPFTL 225
>gi|387013|gb|AAA60061.1| pepsinogen A [Homo sapiens]
Length = 388
Score = 85.5 bits (210), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 44/125 (35%), Positives = 73/125 (58%), Gaps = 5/125 (4%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-K 107
V +D F S + +A D++G ++FGG+D + G + + PV+ +GYWQ V+SI + +
Sbjct: 208 VSQDLF---SVYLSADDQSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQITVDSITMNGE 264
Query: 108 NVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ C+ CQAI DTGTSL+ GP+ IA + IGA ++G V C + +P++ +
Sbjct: 265 AIACAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGDMVVSCSAISSLPDIVFTI 324
Query: 167 GGKNF 171
G +
Sbjct: 325 NGVQY 329
Score = 60.1 bits (144), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 25/41 (60%), Positives = 35/41 (85%), Gaps = 2/41 (4%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACS 48
+Y+GTI +GTP Q+F V+FDTGSSNLW+PS +CS ++AC+
Sbjct: 75 EYFGTIGIGTPAQDFTVLFDTGSSNLWVPSVYCS--SLACT 113
>gi|195029909|ref|XP_001987814.1| GH19747 [Drosophila grimshawi]
gi|193903814|gb|EDW02681.1| GH19747 [Drosophila grimshawi]
Length = 390
Score = 85.5 bits (210), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 40/114 (35%), Positives = 60/114 (52%), Gaps = 1/114 (0%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIAD 119
R+++ GG + FGG + + G+ TY PV+ + YWQ +E+ I K C C I D
Sbjct: 215 RDSTSLQGGYLFFGGSSRRYYRGNFTYVPVTHRAYWQVKLEAAYIGKLQMCQKGCHVIID 274
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
TGTS I P + +N+ IG P A G V C+ + +P + LGG+ F +
Sbjct: 275 TGTSFIAVPYEQAILINESIGGTPAAYGQFSVPCEQVPHLPTLSFALGGRRFQM 328
Score = 63.5 bits (153), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 27/43 (62%), Positives = 34/43 (79%), Gaps = 1/43 (2%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSV 49
AQY+G IS+GTPPQ F VIFDTGS+NLW+PS+ C +AC +
Sbjct: 69 AQYFGPISIGTPPQTFNVIFDTGSANLWVPSETCH-RKLACQI 110
>gi|114607413|ref|XP_518465.2| PREDICTED: gastricsin isoform 2 [Pan troglodytes]
Length = 388
Score = 85.5 bits (210), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 42/118 (35%), Positives = 68/118 (57%), Gaps = 3/118 (2%)
Query: 59 HFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSN-CQ 115
+ N +GG ++FGGVD + G I ++PV+++ YWQ G+E I + + +CS CQ
Sbjct: 213 YLSNQQGSSGGAVVFGGVDSSLYTGQIYWAPVTQELYWQIGIEEFLIGGQASGWCSEGCQ 272
Query: 116 AIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
AI DTGTSL+ P + ++ L + GA G V+C+++ +P + I+ G F L
Sbjct: 273 AIVDTGTSLLTVPQQYMSALLEATGAQEDEYGQFLVNCNSIQNLPTLTFIINGVEFPL 330
Score = 59.3 bits (142), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 29/54 (53%), Positives = 36/54 (66%), Gaps = 9/54 (16%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHF 60
A Y+G IS+GTPPQ F V+FDTGSSNLW+PS +C + AC+ HS F
Sbjct: 71 AAYFGEISIGTPPQNFLVLFDTGSSNLWVPSVYCQ--SQACT-------SHSRF 115
>gi|23943854|ref|NP_055039.1| pepsin A-5 preproprotein [Homo sapiens]
gi|378522017|sp|P0DJD9.1|PEPA5_HUMAN RecName: Full=Pepsin A-5; AltName: Full=Pepsinogen-5; Flags:
Precursor
gi|20810074|gb|AAH29055.1| Pepsinogen 5, group I (pepsinogen A) [Homo sapiens]
gi|119594334|gb|EAW73928.1| pepsinogen 5, group I (pepsinogen A) [Homo sapiens]
gi|219520836|gb|AAI71889.1| Pepsinogen 5, group I (pepsinogen A) [Homo sapiens]
gi|223461673|gb|AAI47000.1| Pepsinogen 5, group I (pepsinogen A) [Homo sapiens]
Length = 388
Score = 85.5 bits (210), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 44/125 (35%), Positives = 73/125 (58%), Gaps = 5/125 (4%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-K 107
V +D F S + +A D++G ++FGG+D + G + + PV+ +GYWQ V+SI + +
Sbjct: 208 VSQDLF---SVYLSADDKSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQITVDSITMNGE 264
Query: 108 NVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ C+ CQAI DTGTSL+ GP+ IA + IGA ++G V C + +P++ +
Sbjct: 265 TIACAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGDMVVSCSAISSLPDIVFTI 324
Query: 167 GGKNF 171
G +
Sbjct: 325 NGVQY 329
Score = 60.5 bits (145), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 25/41 (60%), Positives = 35/41 (85%), Gaps = 2/41 (4%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACS 48
+Y+GTI +GTP Q+F V+FDTGSSNLW+PS +CS ++AC+
Sbjct: 75 EYFGTIGIGTPAQDFTVVFDTGSSNLWVPSVYCS--SLACT 113
>gi|426353119|ref|XP_004044046.1| PREDICTED: gastricsin [Gorilla gorilla gorilla]
Length = 388
Score = 85.5 bits (210), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 42/118 (35%), Positives = 68/118 (57%), Gaps = 3/118 (2%)
Query: 59 HFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSN-CQ 115
+ N +GG ++FGGVD + G I ++PV+++ YWQ G+E I + + +CS CQ
Sbjct: 213 YLSNQQGSSGGAVVFGGVDNSLYTGQIYWAPVTQELYWQIGIEEFLIGGQASGWCSEGCQ 272
Query: 116 AIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
AI DTGTSL+ P + ++ L + GA G V+C+++ +P + I+ G F L
Sbjct: 273 AIVDTGTSLLTVPQQYMSALLQATGAQEDEYGQFLVNCNSIQNLPTLTFIINGVEFPL 330
Score = 58.9 bits (141), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 24/34 (70%), Positives = 29/34 (85%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
A Y+G IS+GTPPQ F V+FDTGSSNLW+PS +C
Sbjct: 71 AAYFGEISIGTPPQNFLVLFDTGSSNLWVPSVYC 104
>gi|119372302|ref|NP_001073276.1| pepsin A preproprotein [Homo sapiens]
gi|378521995|sp|P0DJD7.1|PEPA4_HUMAN RecName: Full=Pepsin A-4; AltName: Full=Pepsinogen-4; Flags:
Precursor
gi|387012|gb|AAA98529.1| pepsinogen [Homo sapiens]
gi|157170280|gb|AAI52845.1| Pepsinogen 4, group I (pepsinogen A) [synthetic construct]
gi|219520853|gb|AAI71920.1| Pepsinogen 4, group I (pepsinogen A) [Homo sapiens]
gi|219521176|gb|AAI71910.1| Pepsinogen 4, group I (pepsinogen A) [Homo sapiens]
gi|223462201|gb|AAI50660.1| Pepsinogen 4, group I (pepsinogen A) [Homo sapiens]
gi|261860840|dbj|BAI46942.1| pepsinogen 4, group I [synthetic construct]
Length = 388
Score = 85.5 bits (210), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 44/125 (35%), Positives = 73/125 (58%), Gaps = 5/125 (4%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-K 107
V +D F S + +A D++G ++FGG+D + G + + PV+ +GYWQ V+SI + +
Sbjct: 208 VSQDLF---SVYLSADDQSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQITVDSITMNGE 264
Query: 108 NVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ C+ CQAI DTGTSL+ GP+ IA + IGA ++G V C + +P++ +
Sbjct: 265 AIACAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGDMVVSCSAISSLPDIVFTI 324
Query: 167 GGKNF 171
G +
Sbjct: 325 NGVQY 329
Score = 60.5 bits (145), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 25/41 (60%), Positives = 35/41 (85%), Gaps = 2/41 (4%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACS 48
+Y+GTI +GTP Q+F V+FDTGSSNLW+PS +CS ++AC+
Sbjct: 75 EYFGTIGIGTPAQDFTVVFDTGSSNLWVPSVYCS--SLACT 113
>gi|1169175|sp|P40782.2|CYPR1_CYNCA RecName: Full=Cyprosin; Flags: Precursor
gi|1585067|prf||2124255A cyprosin
Length = 473
Score = 85.5 bits (210), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 43/101 (42%), Positives = 65/101 (64%), Gaps = 3/101 (2%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYC-SNCQAI 117
RNA ++ GGE++FGGVD + F G+ TY PV++KGYWQF + + I + +C S C AI
Sbjct: 195 RNADEQEGGELVFGGVDPNHFKGEHTYVPVTQKGYWQFEMGDVLIGDKTTGFCASGCAAI 254
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDK 158
AD+GTSL+ G + ++ ++N+ IGA + + K D K
Sbjct: 255 ADSGTSLLAGTTTIVTQINQAIGAAGVMSQQCKSLVDQYGK 295
Score = 65.1 bits (157), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 30/62 (48%), Positives = 42/62 (67%), Gaps = 8/62 (12%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHF 60
+++ AQY+G I +GTPPQ+F VIFDTGSSNLW+PS C ++AC + HS +
Sbjct: 43 LKNYMDAQYFGEIGIGTPPQKFTVIFDTGSSNLWVPSSKC-YFSVAC-------LFHSKY 94
Query: 61 RN 62
R+
Sbjct: 95 RS 96
Score = 39.3 bits (90), Expect = 0.62, Method: Compositional matrix adjust.
Identities = 17/44 (38%), Positives = 26/44 (59%)
Query: 131 VIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
+I ++KL +P G + VDC +L MPN+ +GGK F L+
Sbjct: 370 IINYVDKLCERLPSPMGESAVDCSSLSSMPNIAFTVGGKTFNLS 413
>gi|348559312|ref|XP_003465460.1| PREDICTED: napsin-A-like [Cavia porcellus]
Length = 523
Score = 85.5 bits (210), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 41/116 (35%), Positives = 65/116 (56%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIA 118
R+ +GGE++ GG D ++ +T+ PV+ YWQ ++ + + + C+ C AI
Sbjct: 220 RDFEGTHGGELVLGGSDPAHYIPPLTFVPVTIPAYWQIHMDRVMVGTGLTLCAQGCAAIV 279
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSLI GPS+ I L++ IG +P G + C + +P + +LGG F LT
Sbjct: 280 DTGTSLITGPSEEIRALHRAIGGLPWLAGEHFIQCSKIPTLPPISFLLGGVWFNLT 335
Score = 64.7 bits (156), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 25/40 (62%), Positives = 31/40 (77%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
QY+G I LGTPPQ F V+FDTGSSNLW+PS+ C ++ C
Sbjct: 74 QYFGEIGLGTPPQNFSVVFDTGSSNLWVPSKSCRFFSLPC 113
>gi|283806592|ref|NP_001164549.1| pepsin II-1 precursor [Oryctolagus cuniculus]
gi|129777|sp|P28712.1|PEPA1_RABIT RecName: Full=Pepsin II-1; AltName: Full=Pepsin A; Flags: Precursor
gi|22218074|dbj|BAC07514.1| pepsinogen II-1 [Oryctolagus cuniculus]
Length = 387
Score = 85.5 bits (210), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 43/127 (33%), Positives = 73/127 (57%), Gaps = 5/127 (3%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-K 107
V +D F S + +++ E G +MFGG+D + G + + PVS +GYWQ ++SI I +
Sbjct: 207 VSEDLF---SVYLSSNGEKGSMVMFGGIDSSYYTGSLNWVPVSHEGYWQITMDSITINGE 263
Query: 108 NVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ C++ CQA+ DTGTSL+ GP+ I+++ IGA G + C +D +P++ +
Sbjct: 264 TIACADSCQAVVDTGTSLLAGPTSAISKIQSYIGASKNLLGENIISCSAIDSLPDIVFTI 323
Query: 167 GGKNFTL 173
+ L
Sbjct: 324 NNVQYPL 330
Score = 70.9 bits (172), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 31/51 (60%), Positives = 41/51 (80%), Gaps = 2/51 (3%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK 51
+ + A+Y+GTIS+GTPPQEF VIFDTGSSNLW+PS +CS ++AC + K
Sbjct: 67 LENYLDAEYFGTISIGTPPQEFTVIFDTGSSNLWVPSTYCS--SLACFLHK 115
>gi|222425192|dbj|BAH20545.1| pepsinogen A-59 [Pongo abelii]
Length = 388
Score = 85.5 bits (210), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 44/125 (35%), Positives = 73/125 (58%), Gaps = 5/125 (4%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-K 107
V +D F S + +A D++G ++FGG+D + G + + PV+ +GYWQ V+SI + +
Sbjct: 208 VSQDLF---SVYLSADDQSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQITVDSITMNGE 264
Query: 108 NVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ C+ CQAI DTGTSL+ GP+ IA + IGA ++G V C + +P++ +
Sbjct: 265 AIACAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGDMVVSCSAISSLPDIVFTI 324
Query: 167 GGKNF 171
G +
Sbjct: 325 NGVQY 329
Score = 59.7 bits (143), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 25/41 (60%), Positives = 35/41 (85%), Gaps = 2/41 (4%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACS 48
+Y+G+I +GTP Q+F VIFDTGSSNLW+PS +CS ++AC+
Sbjct: 75 EYFGSIGIGTPAQDFTVIFDTGSSNLWVPSVYCS--SLACT 113
>gi|219521036|gb|AAI71897.1| Pepsinogen 5, group I (pepsinogen A) [Homo sapiens]
Length = 388
Score = 85.5 bits (210), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 44/125 (35%), Positives = 73/125 (58%), Gaps = 5/125 (4%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-K 107
V +D F S + +A D++G ++FGG+D + G + + PV+ +GYWQ V+SI + +
Sbjct: 208 VSQDLF---SVYLSADDKSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQITVDSITMNGE 264
Query: 108 NVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ C+ CQAI DTGTSL+ GP+ IA + IGA ++G V C + +P++ +
Sbjct: 265 TIACAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGDMVVSCSAISSLPDIVFTI 324
Query: 167 GGKNF 171
G +
Sbjct: 325 NGVQY 329
Score = 60.5 bits (145), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 25/41 (60%), Positives = 35/41 (85%), Gaps = 2/41 (4%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACS 48
+Y+GTI +GTP Q+F V+FDTGSSNLW+PS +CS ++AC+
Sbjct: 75 EYFGTIGIGTPAQDFTVVFDTGSSNLWVPSVYCS--SLACT 113
>gi|449454758|ref|XP_004145121.1| PREDICTED: aspartic proteinase-like [Cucumis sativus]
gi|449472326|ref|XP_004153558.1| PREDICTED: aspartic proteinase-like [Cucumis sativus]
Length = 514
Score = 85.5 bits (210), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 42/84 (50%), Positives = 56/84 (66%), Gaps = 3/84 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV--YCS-NCQAI 117
RNA +E GGEI+FGGVD + G TY PV++KGYWQF + + I+ YC C AI
Sbjct: 234 RNAEEEEGGEIVFGGVDPKHYKGKHTYVPVTQKGYWQFDMGDVLIDGKPTGYCEGGCSAI 293
Query: 118 ADTGTSLIIGPSKVIAELNKLIGA 141
AD+GTSL+ GP+ ++ +N IGA
Sbjct: 294 ADSGTSLLAGPTTIVTMINHAIGA 317
Score = 68.2 bits (165), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 32/67 (47%), Positives = 43/67 (64%), Gaps = 10/67 (14%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHF 60
+++ AQYYG I++GTPPQ+F VIFDTGSSNLW+PS C + ++AC H H
Sbjct: 82 LKNYLDAQYYGEIAIGTPPQKFTVIFDTGSSNLWVPSAKC-LFSVAC---------HFHA 131
Query: 61 RNASDEN 67
R S +
Sbjct: 132 RYKSSRS 138
>gi|397526910|ref|XP_003833357.1| PREDICTED: gastricsin [Pan paniscus]
Length = 388
Score = 85.5 bits (210), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 42/118 (35%), Positives = 68/118 (57%), Gaps = 3/118 (2%)
Query: 59 HFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSN-CQ 115
+ N +GG ++FGGVD + G I ++PV+++ YWQ G+E I + + +CS CQ
Sbjct: 213 YLSNQQGSSGGAVVFGGVDSSLYTGQIYWAPVTQELYWQIGIEEFLIGGQASGWCSEGCQ 272
Query: 116 AIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
AI DTGTSL+ P + ++ L + GA G V+C+++ +P + I+ G F L
Sbjct: 273 AIVDTGTSLLTVPQQYMSALLEATGAQEDEYGQFLVNCNSIQNLPTLTFIINGVEFPL 330
Score = 58.9 bits (141), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 29/54 (53%), Positives = 36/54 (66%), Gaps = 9/54 (16%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHF 60
A Y+G IS+GTPPQ F V+FDTGSSNLW+PS +C + AC+ HS F
Sbjct: 71 AAYFGEISIGTPPQNFLVLFDTGSSNLWVPSVYCQ--SQACT-------SHSRF 115
>gi|147743000|sp|P85137.1|CARDF_CYNCA RecName: Full=Cardosin-F; Contains: RecName: Full=Cardosin-F heavy
chain; Contains: RecName: Full=Cardosin-F light chain
Length = 281
Score = 85.5 bits (210), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 39/84 (46%), Positives = 58/84 (69%), Gaps = 3/84 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCS-NCQAI 117
RN +E GGE++FGG+D + F GD TY PV+ + YWQFG+ + I + +C+ CQA
Sbjct: 129 RNVDEEEGGELVFGGLDPNHFRGDHTYVPVTYQYYWQFGIGDVLIGDKSTGFCAPGCQAF 188
Query: 118 ADTGTSLIIGPSKVIAELNKLIGA 141
AD+GTSL+ GP+ ++ ++N IGA
Sbjct: 189 ADSGTSLLSGPTAIVTQINHAIGA 212
Score = 52.8 bits (125), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 22/29 (75%), Positives = 25/29 (86%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPS 37
YYG I +GTPPQ+F VIFDTGSS LW+PS
Sbjct: 18 YYGEIGIGTPPQKFTVIFDTGSSVLWVPS 46
>gi|255554815|ref|XP_002518445.1| Aspartic proteinase precursor, putative [Ricinus communis]
gi|223542290|gb|EEF43832.1| Aspartic proteinase precursor, putative [Ricinus communis]
Length = 511
Score = 85.5 bits (210), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 44/93 (47%), Positives = 61/93 (65%), Gaps = 3/93 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCS-NCQAI 117
RN E GGEI+FGGVD + + G TY PV++KGYWQF + + I + YC+ C AI
Sbjct: 231 RNTQGEEGGEIVFGGVDLNHYKGKHTYVPVTQKGYWQFEMGDVLIGHKPTEYCAGGCSAI 290
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAK 150
AD+GTSL+ GP+ V+ +N+ IGA +A+ K
Sbjct: 291 ADSGTSLLAGPTTVVTLINEAIGATGVASQECK 323
Score = 67.8 bits (164), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 28/47 (59%), Positives = 37/47 (78%), Gaps = 1/47 (2%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ AQYYG I +GTPPQ+F V+FDTGSSNLW+PS C I ++AC
Sbjct: 79 LKNYLDAQYYGEIGIGTPPQKFTVVFDTGSSNLWVPSSKC-IFSVAC 124
>gi|122938522|gb|ABM69085.1| aspartic proteinase AspMD02 [Musca domestica]
Length = 379
Score = 85.5 bits (210), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 46/114 (40%), Positives = 59/114 (51%), Gaps = 1/114 (0%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSNCQAIADT 120
R + + GG ++ GG D + GD Y PVS +GYWQF + + C CQAIADT
Sbjct: 213 RRGTSDEGGVMVVGGNDDRYYEGDFHYVPVSEQGYWQFEMAEAHVNGVRICDRCQAIADT 272
Query: 121 GTSLIIGPSKVIAELNKLIGAV-PLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
GTSLI P+ E+ K IGA +DC +D +P V LG FTL
Sbjct: 273 GTSLIAVPTDKYEEIQKEIGATFSYDTYEYMLDCSKIDDLPVVTFRLGDGTFTL 326
Score = 60.8 bits (146), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 26/40 (65%), Positives = 30/40 (75%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACS 48
YYG I++GTP QEF V+FDTGSSNLW+P CS N AC
Sbjct: 68 YYGKITIGTPGQEFLVLFDTGSSNLWVPVAPCSADNAACE 107
>gi|332267172|ref|XP_003282561.1| PREDICTED: pepsin A-5 [Nomascus leucogenys]
Length = 372
Score = 85.5 bits (210), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 44/125 (35%), Positives = 73/125 (58%), Gaps = 5/125 (4%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-K 107
V +D F S + +A D++G ++FGG+D + G + + PV+ +GYWQ V+SI + +
Sbjct: 192 VSQDLF---SVYLSADDQSGSVVIFGGIDSSYYSGSLNWVPVTVEGYWQITVDSITMNGE 248
Query: 108 NVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ C+ CQAI DTGTSL+ GP+ IA + IGA ++G V C + +P++ +
Sbjct: 249 TIACAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGDMVVSCSAISSLPDIVFTI 308
Query: 167 GGKNF 171
G +
Sbjct: 309 NGVQY 313
Score = 61.2 bits (147), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 26/41 (63%), Positives = 35/41 (85%), Gaps = 2/41 (4%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACS 48
+Y+GTI +GTP Q+F VIFDTGSSNLW+PS +CS ++AC+
Sbjct: 59 EYFGTIGIGTPAQDFTVIFDTGSSNLWVPSVYCS--SLACT 97
>gi|317770606|ref|NP_001187944.1| pepsin A precursor [Ictalurus punctatus]
gi|308324409|gb|ADO29339.1| pepsin A [Ictalurus punctatus]
Length = 374
Score = 85.1 bits (209), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 46/127 (36%), Positives = 74/127 (58%), Gaps = 5/127 (3%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-K 107
V +D F S + +++ + G E++FG +D + G I + P+S + Y+Q ++S+ I +
Sbjct: 199 VSQDVF---SVYLSSNGQQGSEVLFGEIDTSYYTGSIYWIPLSSESYYQITMDSVFINGQ 255
Query: 108 NVYC-SNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
V C CQAI DTGTS I+GPS I+ +N +GA G V+C+N+ MP++ L
Sbjct: 256 TVACFGGCQAIIDTGTSNIVGPSSDISNINSWVGAPVDQYGDGSVNCNNIQSMPSITFTL 315
Query: 167 GGKNFTL 173
G FT+
Sbjct: 316 KGYTFTI 322
Score = 58.9 bits (141), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 23/33 (69%), Positives = 29/33 (87%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCS 41
YYG IS+G PPQ F+VIFD+GS+NLW+PS +CS
Sbjct: 68 YYGVISIGNPPQSFQVIFDSGSANLWVPSIYCS 100
>gi|50557048|ref|XP_505932.1| YALI0F27071p [Yarrowia lipolytica]
gi|49651802|emb|CAG78744.1| YALI0F27071p [Yarrowia lipolytica CLIB122]
Length = 396
Score = 85.1 bits (209), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 56/153 (36%), Positives = 84/153 (54%), Gaps = 8/153 (5%)
Query: 24 VIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASDE--NGGEIMFGGVDKDKF 81
+ +DT S N +P + ++ N +DK++F S F +++ +GG FGGVD+D F
Sbjct: 192 LAYDTISVNKIVPPVYNAV-NRGL-LDKNQF---SFFLGDTNKGTDGGVATFGGVDEDYF 246
Query: 82 VGDITYSPVSRKGYWQFGVESIKI-EKNVYCSNCQAIADTGTSLIIGPSKVIAELNKLIG 140
G IT+ PV RK YW+ SI + ++ N A DTGTSL+ PS + LN IG
Sbjct: 247 EGKITWLPVRRKAYWEVEFNSITLGDQTAELVNTGAAIDTGTSLLALPSGLAEVLNSEIG 306
Query: 141 AVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
A +G V+CD +D +P++ G NFT+
Sbjct: 307 ATKGWSGQYTVECDKVDSLPDLTFNFAGYNFTI 339
Score = 62.4 bits (150), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 34/80 (42%), Positives = 47/80 (58%), Gaps = 3/80 (3%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASDE 66
AQYY I +GTPPQ+F VI DTGSSNLW+PS C+ +IAC + + K+ + ++
Sbjct: 81 AQYYTEIEIGTPPQKFNVILDTGSSNLWVPSVQCN--SIACYLHQ-KYDSAASSSYKANG 137
Query: 67 NGGEIMFGGVDKDKFVGDIT 86
EI +G + FV T
Sbjct: 138 TAFEIQYGSGSMEGFVSQDT 157
>gi|414887123|tpg|DAA63137.1| TPA: hypothetical protein ZEAMMB73_794362 [Zea mays]
Length = 608
Score = 85.1 bits (209), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 39/85 (45%), Positives = 59/85 (69%), Gaps = 3/85 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV--YCS-NCQAI 117
R+A + GGEI+FGG+D + GD T+ PV+RKGYWQF + + ++ +C+ C A+
Sbjct: 327 RHADEGEGGEIVFGGMDSSHYKGDHTFVPVTRKGYWQFNMGDVLVDGKSTGFCAGGCAAV 386
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAV 142
AD+GTSL+ GP+ +I E+N+ IGA
Sbjct: 387 ADSGTSLLAGPTAIITEINEKIGAA 411
Score = 35.4 bits (80), Expect = 8.0, Method: Compositional matrix adjust.
Identities = 15/43 (34%), Positives = 25/43 (58%)
Query: 131 VIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
++ +N+L +P G + VDC +L MP++ +GGK F L
Sbjct: 505 ILNYINQLCERLPSPMGESAVDCGSLASMPDIAFTIGGKKFKL 547
>gi|194759276|ref|XP_001961875.1| GF15194 [Drosophila ananassae]
gi|190615572|gb|EDV31096.1| GF15194 [Drosophila ananassae]
Length = 411
Score = 85.1 bits (209), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 44/131 (33%), Positives = 72/131 (54%), Gaps = 2/131 (1%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKN 108
+ K F + + S GGE++ GG+D F G +TY PVS +GYWQF + S I
Sbjct: 231 IKKPVFSIYLNRNGTSAITGGELILGGIDSGLFRGCLTYVPVSTQGYWQFTMGSADINGT 290
Query: 109 VYCSNCQAIADTGTSLIIGPSKVIAELNKLIGAV--PLANGPAKVDCDNLDKMPNVDIIL 166
+C+ C+AI D GTSL++ P + ++N+++ + NG VDC + P++ +
Sbjct: 291 NFCNKCEAILDVGTSLMVIPQNALTKVNQILEVLNPKETNGVFLVDCAKIPNFPDIIFNI 350
Query: 167 GGKNFTLTRTN 177
G K+F L ++
Sbjct: 351 GRKDFPLKSSD 361
Score = 49.7 bits (117), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 19/48 (39%), Positives = 29/48 (60%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACS 48
+ + Y Y + +G PPQ+F ++ DTGSSNLW+PS C + C+
Sbjct: 84 LSNYYNTAYSVAVEIGNPPQKFTLLIDTGSSNLWVPSIKCPATDRTCA 131
>gi|395821502|ref|XP_003784077.1| PREDICTED: gastricsin-like [Otolemur garnettii]
Length = 390
Score = 85.1 bits (209), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 41/116 (35%), Positives = 62/116 (53%), Gaps = 3/116 (2%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSN-CQAI 117
R + + GGE++ GGVD + G+I ++PV+++ YWQ ++ I + CS CQ I
Sbjct: 217 RQPTAQYGGELILGGVDSQLYSGEIVWTPVTQEMYWQIAIQEFSIGNQATGLCSQGCQGI 276
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
DTGTSL+ P + I+ + GA NG V C N+ MP + +GG L
Sbjct: 277 VDTGTSLLTVPQQYISSFVEATGAQQAENGDFVVSCSNVQNMPTIAFTIGGAQLPL 332
Score = 59.3 bits (142), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 26/40 (65%), Positives = 32/40 (80%), Gaps = 2/40 (5%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACS 48
Y+G IS+GTPPQ F V+FDTGSSNLW+PS +C + ACS
Sbjct: 74 YFGEISIGTPPQNFLVLFDTGSSNLWVPSTYCQ--SQACS 111
>gi|71021685|ref|XP_761073.1| hypothetical protein UM04926.1 [Ustilago maydis 521]
gi|46100637|gb|EAK85870.1| hypothetical protein UM04926.1 [Ustilago maydis 521]
Length = 418
Score = 85.1 bits (209), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 43/117 (36%), Positives = 66/117 (56%), Gaps = 1/117 (0%)
Query: 58 SHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI-EKNVYCSNCQA 116
S + +S+E+GGE +FGG+D + G I +SPV RKGYW+ ++ + + ++ + N A
Sbjct: 245 SFYLGSSEEDGGEAVFGGIDDSHYTGKIHWSPVKRKGYWEVALDKLALGDEELELDNGSA 304
Query: 117 IADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
DTGTSLI + LN IGA NG VDC+ + +P + + G+ F L
Sbjct: 305 AIDTGTSLIAMATDTAEILNAEIGATKSWNGQYSVDCEKVKDLPPLTFYIDGQPFKL 361
Score = 65.9 bits (159), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 41/86 (47%), Positives = 53/86 (61%), Gaps = 8/86 (9%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK--DKFVGHSHFRNAS 64
AQY+ ISLGTP Q+FKVI DTGSSNLW+PS CS +IAC + K D S+ +N +
Sbjct: 105 AQYFCDISLGTPAQDFKVILDTGSSNLWVPSTKCS--SIACFLHKKYDSSASSSYKKNGT 162
Query: 65 D---ENGGEIMFGGVDKDKF-VGDIT 86
+ + G M G V D +GD+T
Sbjct: 163 EFKIQYGSGSMEGIVSNDVLKIGDLT 188
>gi|307187033|gb|EFN72332.1| Lysosomal aspartic protease [Camponotus floridanus]
Length = 206
Score = 85.1 bits (209), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 44/83 (53%), Positives = 58/83 (69%), Gaps = 2/83 (2%)
Query: 61 RNASDE-NGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEK-NVYCSNCQAIA 118
R+ SDE NGG ++ GG D + G++TY PV+RKGYWQF ++ IKI+ N+ +CQAIA
Sbjct: 18 RDVSDEINGGVLILGGSDPAFYEGNLTYIPVTRKGYWQFTIDKIKIDYINLCIESCQAIA 77
Query: 119 DTGTSLIIGPSKVIAELNKLIGA 141
DTGT IIGP I+ +N LIG
Sbjct: 78 DTGTPWIIGPISDISRINDLIGV 100
>gi|345318884|ref|XP_001520972.2| PREDICTED: renin-like [Ornithorhynchus anatinus]
Length = 388
Score = 85.1 bits (209), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 52/179 (29%), Positives = 92/179 (51%), Gaps = 23/179 (12%)
Query: 7 AQYYGTISLGTPPQEF---KVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNA 63
A++ G + +G P Q +FD I +QH V K+ + RN+
Sbjct: 168 AKFDGVLGMGYPAQAIGGITPVFDH------ILTQH---------VLKEDVFSVYYSRNS 212
Query: 64 SDEN---GGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIA 118
+++ GGEI+ GG D + GD Y VS+KG+WQ ++ + +++ + +C C A+
Sbjct: 213 KNDHMVPGGEIILGGRDPTYYQGDFYYLDVSKKGFWQVNMKGVSVDRTLQFCQEGCAAMV 272
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTRTN 177
DTG +LI GP K + + ++GA + VDC + ++P++ LGG+ F L+ ++
Sbjct: 273 DTGATLITGPVKDVKHMMDILGAQKIGGNMYAVDCKEVAQLPDISFHLGGRVFPLSSSD 331
Score = 61.2 bits (147), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 25/41 (60%), Positives = 33/41 (80%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQY+G I +G+P Q FKVIFDTGS+NLW+PS +C ++ AC
Sbjct: 64 AQYFGEIGIGSPAQTFKVIFDTGSANLWVPSINCKPIHSAC 104
>gi|413942271|gb|AFW74920.1| hypothetical protein ZEAMMB73_522985 [Zea mays]
Length = 468
Score = 85.1 bits (209), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 41/93 (44%), Positives = 62/93 (66%), Gaps = 3/93 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE--KNVYCS-NCQAI 117
R+A + GGEI+FGG+D + GD T+ PV+RKGYWQF + + ++ +C+ C AI
Sbjct: 187 RHADEGEGGEIVFGGMDSSHYKGDHTFVPVTRKGYWQFNMGDVLVDGKSTGFCAGGCAAI 246
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAK 150
AD+GTSL+ GP+ +I E+N+ IGA + + K
Sbjct: 247 ADSGTSLLAGPTAIITEINEKIGAAGVVSQECK 279
>gi|109287596|emb|CAJ55260.1| renin-like aspartic protease [Echis ocellatus]
Length = 395
Score = 85.1 bits (209), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 44/119 (36%), Positives = 68/119 (57%), Gaps = 3/119 (2%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-KNVYCSN-CQAIA 118
R++ GGEI+ GG D + GD Y SR+GYW ++ + IE K V C + C A
Sbjct: 223 RHSESNTGGEIILGGSDPSHYTGDFHYVSTSREGYWHVDLKGVSIENKIVLCHDGCTATI 282
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTRTN 177
DTGTS I GP+ I+ L + IGA L++G +DC ++ +P++ LG ++L+ +
Sbjct: 283 DTGTSFISGPASSISVLMETIGAT-LSDGDYVIDCKKINLLPDITFHLGDMTYSLSSST 340
Score = 67.0 bits (162), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 29/41 (70%), Positives = 33/41 (80%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
QYYG IS+GTP Q FKV+FDTGSSNLW+PS+ CS L AC
Sbjct: 77 TQYYGEISIGTPAQIFKVVFDTGSSNLWVPSRQCSPLYSAC 117
>gi|395328846|gb|EJF61236.1| endopeptidase [Dichomitus squalens LYAD-421 SS1]
Length = 412
Score = 85.1 bits (209), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 45/120 (37%), Positives = 66/120 (55%), Gaps = 2/120 (1%)
Query: 60 FR-NASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVY-CSNCQAI 117
FR +S+++GGE +FGGVD + G I Y PV RK YW+ +E + + +V + A
Sbjct: 240 FRLGSSEDDGGEAIFGGVDDSAYTGKIQYVPVRRKAYWEVELEKVSLGDDVLELESTGAA 299
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTRTN 177
DTGTSLI P+ + +N IGA NG VDC + +P++ GG + L T+
Sbjct: 300 IDTGTSLIALPTDIAEMINTQIGATKSWNGQYTVDCAKVPSLPDLTFTFGGNPYVLKGTD 359
Score = 67.0 bits (162), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 37/84 (44%), Positives = 49/84 (58%), Gaps = 4/84 (4%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASD- 65
AQY+ ISLGTPPQ FKVI DTGSSNLW+PS C+ + D ++ N ++
Sbjct: 99 AQYFAEISLGTPPQTFKVILDTGSSNLWVPSVKCTSIACFLHTKYDSSSSSTYKANGTEF 158
Query: 66 --ENGGEIMFGGVDKDKF-VGDIT 86
+ G M G V +D F +GD+T
Sbjct: 159 SIQYGSGSMEGFVSQDTFRIGDLT 182
>gi|223946977|gb|ACN27572.1| unknown [Zea mays]
gi|238014788|gb|ACR38429.1| unknown [Zea mays]
gi|413946556|gb|AFW79205.1| aspartic proteinase oryzasin-1 isoform 1 [Zea mays]
gi|413946557|gb|AFW79206.1| aspartic proteinase oryzasin-1 isoform 2 [Zea mays]
Length = 510
Score = 85.1 bits (209), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 40/85 (47%), Positives = 59/85 (69%), Gaps = 3/85 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV--YCS-NCQAI 117
R+A + GGEI+FGG+D + GD T+ PV+RKGYWQF + + ++ +C+ C AI
Sbjct: 229 RHADEGEGGEIVFGGMDSSHYKGDHTFVPVTRKGYWQFNMGDVLVDGKSTGFCAGGCAAI 288
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAV 142
AD+GTSL+ GP+ +I E+N+ IGA
Sbjct: 289 ADSGTSLLAGPTAIITEINEKIGAA 313
Score = 60.5 bits (145), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 26/47 (55%), Positives = 35/47 (74%), Gaps = 1/47 (2%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ AQY+G I +G+P Q+F VIFDTGSSNLW+PS C +IAC
Sbjct: 77 LKNYMNAQYFGEIGVGSPQQKFTVIFDTGSSNLWVPSSKC-YFSIAC 122
Score = 35.8 bits (81), Expect = 6.5, Method: Compositional matrix adjust.
Identities = 15/43 (34%), Positives = 25/43 (58%)
Query: 131 VIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
++ +N+L +P G + VDC +L MP++ +GGK F L
Sbjct: 407 ILNYINQLCERLPSPMGESAVDCGSLASMPDIAFTIGGKKFKL 449
>gi|51534966|dbj|BAD36916.1| pepsinogen C [Octodon degus]
Length = 378
Score = 85.1 bits (209), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 40/111 (36%), Positives = 66/111 (59%), Gaps = 3/111 (2%)
Query: 67 NGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV--YCSN-CQAIADTGTS 123
GG ++ GGVD+ + G+I+++PV+ YWQ G+E ++ V +CS CQ I DTGTS
Sbjct: 211 EGGALILGGVDESLYSGEISWTPVTEALYWQIGIEDFLLDGEVSGWCSQGCQGIVDTGTS 270
Query: 124 LIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
L+ P + ++ L + IGA G VDC+++ +P + ++ G F L+
Sbjct: 271 LLTVPQEYLSTLMQAIGAQENEFGEYMVDCNSIQNLPTLTFVISGVQFPLS 321
Score = 63.9 bits (154), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 27/43 (62%), Positives = 35/43 (81%), Gaps = 2/43 (4%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSV 49
A Y+G ISLGTPPQ F+V+FDTGSSNLW+PS +C ++AC+
Sbjct: 59 ASYFGEISLGTPPQSFQVLFDTGSSNLWVPSVYCK--SLACTT 99
>gi|395535589|ref|XP_003769805.1| PREDICTED: chymosin-like [Sarcophilus harrisii]
Length = 382
Score = 85.1 bits (209), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 40/108 (37%), Positives = 61/108 (56%), Gaps = 2/108 (1%)
Query: 68 GGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN--CQAIADTGTSLI 125
G + G +D + G + + P++ +GYWQF V+ I + V + CQAI DTGTSL+
Sbjct: 222 GSMLTLGAIDSSYYTGSLHWVPITVQGYWQFSVDRITVNGQVVACDGGCQAILDTGTSLL 281
Query: 126 IGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
+GPS I+ + +IGA G +DC +L MP V I + G+ + L
Sbjct: 282 VGPSYDISNIQSVIGATQGQYGEFDIDCSSLSSMPTVVIHINGRQYPL 329
Score = 61.6 bits (148), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 26/41 (63%), Positives = 34/41 (82%), Gaps = 2/41 (4%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+QY+G I +GTPPQEF V+FDTGSSNLW+PS +C+ + AC
Sbjct: 73 SQYFGKIYIGTPPQEFTVVFDTGSSNLWVPSVYCN--STAC 111
>gi|384490965|gb|EIE82161.1| hypothetical protein RO3G_06866 [Rhizopus delemar RA 99-880]
Length = 403
Score = 85.1 bits (209), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 43/107 (40%), Positives = 65/107 (60%), Gaps = 1/107 (0%)
Query: 68 GGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI-EKNVYCSNCQAIADTGTSLII 126
GGE+ FGG+D+ +F G+IT+SPV+RKGYW+ +++ K ++ + + A DTGTSL+I
Sbjct: 240 GGELAFGGIDEARFSGNITWSPVTRKGYWEIELQNTKFNDQPMNMGSIGAAIDTGTSLLI 299
Query: 127 GPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
P+ V +N IG A G VDC ++ +P GK+F L
Sbjct: 300 APTAVAEFVNNQIGGQADAYGQYTVDCSSVGNLPEFCFQFSGKDFCL 346
Score = 64.7 bits (156), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 28/45 (62%), Positives = 34/45 (75%), Gaps = 2/45 (4%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK 51
AQYYG I +GTP Q F VIFDTGSSNLW+PS HC ++ AC + +
Sbjct: 86 AQYYGEIQIGTPAQTFTVIFDTGSSNLWVPSTHC--MSFACLMHR 128
>gi|33352213|emb|CAE18153.1| aspartic proteinase [Chlamydomonas reinhardtii]
Length = 578
Score = 85.1 bits (209), Expect = 1e-14, Method: Composition-based stats.
Identities = 40/84 (47%), Positives = 55/84 (65%), Gaps = 3/84 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI---EKNVYCSNCQAI 117
R+ + NGGE++ GG+D F G+ T+ PV+R+GYWQF +E + + + + C AI
Sbjct: 220 RDPNAPNGGELVLGGIDPTHFTGEHTWVPVTRQGYWQFTMEGLDLGPGSQKMCAKGCAAI 279
Query: 118 ADTGTSLIIGPSKVIAELNKLIGA 141
ADTGTSLI GPS +A LN IGA
Sbjct: 280 ADTGTSLIAGPSDEVAALNHAIGA 303
Score = 75.5 bits (184), Expect = 8e-12, Method: Composition-based stats.
Identities = 30/51 (58%), Positives = 39/51 (76%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK 51
+++ AQYYG I LGTPPQ F VIFDTGS+NLW+PS C++ NIAC + +
Sbjct: 67 LKNFMDAQYYGEIGLGTPPQLFNVIFDTGSANLWVPSSKCALFNIACRLHR 117
>gi|4389326|pdb|1B5F|A Chain A, Native Cardosin A From Cynara Cardunculus L.
gi|6729875|pdb|1B5F|C Chain C, Native Cardosin A From Cynara Cardunculus L
Length = 239
Score = 85.1 bits (209), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 39/84 (46%), Positives = 58/84 (69%), Gaps = 3/84 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCS-NCQAI 117
RN +E GGE++FGG+D + F GD TY PV+ + YWQFG+ + I + +C+ CQA
Sbjct: 155 RNVDEEEGGELVFGGLDPNHFRGDHTYVPVTYQYYWQFGIGDVLIGDKSTGFCAPGCQAF 214
Query: 118 ADTGTSLIIGPSKVIAELNKLIGA 141
AD+GTSL+ GP+ ++ ++N IGA
Sbjct: 215 ADSGTSLLSGPTAIVTQINHAIGA 238
Score = 55.5 bits (132), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 33/72 (45%), Positives = 43/72 (59%), Gaps = 14/72 (19%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASD--- 65
Y+G I +GTPPQ+F VIFDTGSS LW+PS C I + AC HS + ++
Sbjct: 15 YFGEIGIGTPPQKFTVIFDTGSSVLWVPSSKC-INSKACRA-------HSMYESSDSSTY 66
Query: 66 -ENG--GEIMFG 74
ENG G I++G
Sbjct: 67 KENGTFGAIIYG 78
>gi|307187094|gb|EFN72348.1| Cathepsin D [Camponotus floridanus]
Length = 308
Score = 85.1 bits (209), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 46/115 (40%), Positives = 66/115 (57%), Gaps = 5/115 (4%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYC-SNCQAIAD 119
R+ S + GGE+ GG + + GD TY PV+ KGYWQF ++SI+++ C +CQAI D
Sbjct: 12 RDTSADFGGELTLGGSNPTYYEGDFTYVPVTNKGYWQFIIDSIQVKHITLCKESCQAIVD 71
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
TG I+GP I + LI L +V+C + ++P + LGGK F LT
Sbjct: 72 TGCWQIVGPKMDIQFIYLLIETDSL----GRVNCKKILQLPTIRFNLGGKAFDLT 122
Score = 62.0 bits (149), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 31/84 (36%), Positives = 46/84 (54%), Gaps = 1/84 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYC-SNCQAIAD 119
R+ S + GG+++FG D + GD TY PV+ K YWQF ++SI++ +C C+AI D
Sbjct: 206 RDTSADLGGKLIFGSSDPACYEGDFTYIPVTSKQYWQFTIDSIQMNDITWCLGGCEAIVD 265
Query: 120 TGTSLIIGPSKVIAELNKLIGAVP 143
T I GP I+ + P
Sbjct: 266 TNAWQITGPQSDISNIYDFTETNP 289
>gi|226503984|ref|NP_001148782.1| aspartic proteinase oryzasin-1 precursor [Zea mays]
gi|195622118|gb|ACG32889.1| aspartic proteinase oryzasin-1 precursor [Zea mays]
Length = 510
Score = 85.1 bits (209), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 41/93 (44%), Positives = 62/93 (66%), Gaps = 3/93 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE--KNVYCS-NCQAI 117
R+A + GGEI+FGG+D + GD T+ PV+RKGYWQF + + ++ +C+ C AI
Sbjct: 229 RHADEGEGGEIVFGGMDSSHYKGDHTFVPVTRKGYWQFNMGDVLVDGKSTGFCAGGCAAI 288
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAK 150
AD+GTSL+ GP+ +I E+N+ IGA + + K
Sbjct: 289 ADSGTSLLAGPTAIITEINEKIGAAGVVSQECK 321
Score = 60.5 bits (145), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 26/47 (55%), Positives = 35/47 (74%), Gaps = 1/47 (2%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ AQY+G I +G+P Q+F VIFDTGSSNLW+PS C +IAC
Sbjct: 77 LKNYMNAQYFGEIGVGSPQQKFTVIFDTGSSNLWVPSSKC-YFSIAC 122
Score = 35.8 bits (81), Expect = 6.5, Method: Compositional matrix adjust.
Identities = 15/43 (34%), Positives = 25/43 (58%)
Query: 131 VIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
++ +N+L +P G + VDC +L MP++ +GGK F L
Sbjct: 407 ILNYINQLCERLPSPMGESAVDCGSLASMPDIAFTIGGKKFKL 449
>gi|443927046|gb|ELU45582.1| endopeptidase [Rhizoctonia solani AG-1 IA]
Length = 934
Score = 85.1 bits (209), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 44/116 (37%), Positives = 66/116 (56%), Gaps = 2/116 (1%)
Query: 60 FR-NASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI-EKNVYCSNCQAI 117
FR +S+++GGE +FGG+D+ + G I Y PV R+ YW+ + S+ + E + N A
Sbjct: 270 FRVGSSEQDGGEAVFGGIDESHYKGKIHYVPVRRQAYWEVELSSVSLGEDTLELENTGAA 329
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
DTGTSLI P+ + +N IGA NG V CD + +P++ GGK + L
Sbjct: 330 IDTGTSLIALPTDIAEMINAQIGASRSWNGQYTVPCDKVPSLPDLTFQFGGKPYAL 385
Score = 62.0 bits (149), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 26/41 (63%), Positives = 33/41 (80%), Gaps = 2/41 (4%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQYY I+LG+PPQ FKV+ DTGSSNLW+P + C+ +IAC
Sbjct: 129 AQYYADITLGSPPQSFKVVLDTGSSNLWVPGKSCT--SIAC 167
>gi|156846613|ref|XP_001646193.1| hypothetical protein Kpol_1013p6 [Vanderwaltozyma polyspora DSM
70294]
gi|156116867|gb|EDO18335.1| hypothetical protein Kpol_1013p6 [Vanderwaltozyma polyspora DSM
70294]
Length = 402
Score = 85.1 bits (209), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 46/110 (41%), Positives = 61/110 (55%), Gaps = 1/110 (0%)
Query: 66 ENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVY-CSNCQAIADTGTSL 124
E+GG++ FGG D F GDIT+ PV RK YW+ ++I + V N A DTGTSL
Sbjct: 237 EDGGQVTFGGYDSSLFTGDITWLPVRRKAYWEVKFDAIALGNEVADLVNHGAAIDTGTSL 296
Query: 125 IIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
I PS + +N IGA +G VDC D +P++ G NFT+T
Sbjct: 297 ITLPSGLAEVINSQIGAKKSWSGQWIVDCKTRDTLPDMTFTFDGYNFTIT 346
Score = 58.5 bits (140), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 27/41 (65%), Positives = 32/41 (78%), Gaps = 2/41 (4%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQYY I+LGTP Q FKVI DTGSSNLW+PS C+ ++AC
Sbjct: 87 AQYYTDITLGTPAQSFKVILDTGSSNLWVPSVDCN--SLAC 125
>gi|164602|gb|AAA31095.1| pepsinogen precursor [Sus scrofa]
Length = 386
Score = 84.7 bits (208), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 44/129 (34%), Positives = 78/129 (60%), Gaps = 6/129 (4%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-K 107
V +D F S + +++D++G ++ GG+D + G + + PVS +GYWQ ++SI ++ +
Sbjct: 205 VSQDLF---SVYLSSNDDSGSVVLLGGIDSSYYTGSLNWVPVSVEGYWQITLDSITMDGE 261
Query: 108 NVYCS-NCQAIADTGTSLIIGPSKVIA-ELNKLIGAVPLANGPAKVDCDNLDKMPNVDII 165
+ CS CQAI DTGTSL+ GP+ IA + IGA ++G + C ++D +P++
Sbjct: 262 TIACSGGCQAIVDTGTSLLTGPTSAIAINIQSDIGASENSDGEMVISCSSIDSLPDIVFT 321
Query: 166 LGGKNFTLT 174
+ G + L+
Sbjct: 322 INGVQYPLS 330
Score = 63.2 bits (152), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 29/54 (53%), Positives = 41/54 (75%), Gaps = 3/54 (5%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKF 54
+ + +Y+GTI +GTP Q+F VIFDTGSSNLW+PS +CS ++ACS D ++F
Sbjct: 65 LENYLDTEYFGTIGIGTPAQDFTVIFDTGSSNLWVPSVYCS--SLACS-DHNQF 115
>gi|255567717|ref|XP_002524837.1| Aspartic proteinase precursor, putative [Ricinus communis]
gi|223535897|gb|EEF37557.1| Aspartic proteinase precursor, putative [Ricinus communis]
Length = 456
Score = 84.7 bits (208), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 43/102 (42%), Positives = 63/102 (61%), Gaps = 3/102 (2%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV--YCSN-CQAI 117
RN E GGEI+FGG+D + + G TY PV++KGYWQF + + I + C++ C+AI
Sbjct: 228 RNVQAEEGGEIVFGGMDPNHYKGQHTYVPVTQKGYWQFDMGEVLIGNEITGLCADGCKAI 287
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKM 159
AD+GTSL+ GP+ VI ++N IGA + + K + K
Sbjct: 288 ADSGTSLLAGPTTVITQINHAIGASGIVSQECKTVVEQYGKF 329
Score = 67.0 bits (162), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 29/47 (61%), Positives = 36/47 (76%), Gaps = 1/47 (2%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ AQYYG I++GTPPQ F VIFDTGSSNLWIPS C ++AC
Sbjct: 76 LKNYLDAQYYGEIAIGTPPQTFTVIFDTGSSNLWIPSSKC-YFSVAC 121
Score = 42.0 bits (97), Expect = 0.094, Method: Compositional matrix adjust.
Identities = 19/45 (42%), Positives = 27/45 (60%)
Query: 130 KVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
+++ +NKL +P NG + VDC +L MP V +GGK F LT
Sbjct: 405 QILNYVNKLCDRLPSPNGESAVDCSSLSSMPIVSFTIGGKAFKLT 449
>gi|451853159|gb|EMD66453.1| hypothetical protein COCSADRAFT_34972 [Cochliobolus sativus ND90Pr]
Length = 399
Score = 84.7 bits (208), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 45/116 (38%), Positives = 62/116 (53%), Gaps = 1/116 (0%)
Query: 59 HFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVY-CSNCQAI 117
+ + D E FGG+D+ + G +T P+ RK YW+ +++I K N AI
Sbjct: 227 YLGDTKDGKESEATFGGIDESHYTGKLTKLPLRRKAYWEVDLDAITFGKETAEMENIGAI 286
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
DTGTSLI PS + LNK IGA NG V+C+ D +PN+ L G NFT+
Sbjct: 287 LDTGTSLIALPSAIAELLNKEIGAKKGFNGQYSVECNKRDSLPNLTFTLTGHNFTI 342
Score = 67.0 bits (162), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 37/83 (44%), Positives = 47/83 (56%), Gaps = 4/83 (4%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASD- 65
AQY+ ISLGTPPQ FKVI DTGSSNLW+PS CS + D ++ +N S+
Sbjct: 85 AQYFSDISLGTPPQSFKVILDTGSSNLWVPSTECSSIACYLHTKYDSSASSTYKKNGSEF 144
Query: 66 --ENGGEIMFGGVDKDKF-VGDI 85
G + G V D F +GD+
Sbjct: 145 EIRYGSGSLSGFVSNDVFQIGDL 167
>gi|426251808|ref|XP_004019613.1| PREDICTED: pregnancy-associated glycoprotein 1-like [Ovis aries]
Length = 381
Score = 84.7 bits (208), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 44/110 (40%), Positives = 71/110 (64%), Gaps = 5/110 (4%)
Query: 67 NGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-KNVYCSN-CQAIADTGTSL 124
NG +M GGVDK + G + + P+SR G W+ ++ I ++ K + CS C+A+ DTGTSL
Sbjct: 216 NGSVLMLGGVDKAYYKGQLNWIPLSRVGDWRINLDHISMKGKLIGCSGGCEALVDTGTSL 275
Query: 125 IIGPSKVIAELNKLIGAVPLANGPAK-VDCDNLDKMPNVDIILGGKNFTL 173
I GP++++ + +LIGA+PL GP V C ++ +P++ + G N+TL
Sbjct: 276 INGPTRLVTNIQRLIGAMPL--GPKHYVSCSTINILPSIIFTINGINYTL 323
Score = 60.1 bits (144), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 23/41 (56%), Positives = 32/41 (78%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCS 41
MR+ Y G I++GTPPQ+F+V+FDTGSS+LW+PS C+
Sbjct: 63 MRNFLDTAYVGNIAIGTPPQQFRVVFDTGSSDLWVPSNFCT 103
>gi|393215979|gb|EJD01470.1| aspartic peptidase A1 [Fomitiporia mediterranea MF3/22]
Length = 412
Score = 84.7 bits (208), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 46/116 (39%), Positives = 64/116 (55%), Gaps = 2/116 (1%)
Query: 60 FR-NASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVY-CSNCQAI 117
FR +S+E+GGE +FGG+D+ + G I Y+PV RK YW+ + + +V N A
Sbjct: 240 FRLGSSEEDGGEAVFGGIDESAYSGKINYAPVRRKAYWEVELPKVAFGDDVLELENTGAA 299
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
DTGTSLI PS V LN IGA NG VDC + +P+ + G+ + L
Sbjct: 300 IDTGTSLIALPSDVAEMLNAQIGATKSWNGQYTVDCKKVPDLPDFTLWFNGQAYPL 355
Score = 69.7 bits (169), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 38/83 (45%), Positives = 48/83 (57%), Gaps = 4/83 (4%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASD- 65
AQY+ TI+LGTPPQEFKVI DTGSSNLW+PS C+ + D +H +N +
Sbjct: 99 AQYFTTITLGTPPQEFKVILDTGSSNLWVPSTKCTSIACFLHAKYDSSASSTHKKNGTSF 158
Query: 66 --ENGGEIMFGGVDKDKF-VGDI 85
E G M G V D +GD+
Sbjct: 159 KIEYGSGSMEGFVSNDVLSIGDL 181
>gi|114637856|ref|XP_001145457.1| PREDICTED: pepsin A-5 isoform 6 [Pan troglodytes]
Length = 388
Score = 84.7 bits (208), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 44/125 (35%), Positives = 73/125 (58%), Gaps = 5/125 (4%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-K 107
V +D F S + +A D++G ++FGG+D + G + + PV+ +GYWQ V+SI + +
Sbjct: 208 VSQDLF---SVYLSADDKSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQITVDSITMNGE 264
Query: 108 NVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ C+ CQAI DTGTSL+ GP+ IA + IGA ++G V C + +P++ +
Sbjct: 265 AIACAEGCQAIVDTGTSLLTGPTSPIANIQSDIGASENSDGDMVVSCSAISSLPDIVFTI 324
Query: 167 GGKNF 171
G +
Sbjct: 325 NGVQY 329
Score = 60.8 bits (146), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 26/41 (63%), Positives = 35/41 (85%), Gaps = 2/41 (4%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACS 48
+Y+GTI +GTP Q+F VIFDTGSSNLW+PS +CS ++AC+
Sbjct: 75 EYFGTIGIGTPAQDFTVIFDTGSSNLWVPSVYCS--SLACT 113
>gi|30794284|ref|NP_851337.1| chymosin preproprotein [Bos taurus]
gi|162860|gb|AAA30448.1| preprochymosin b [Bos taurus]
Length = 381
Score = 84.7 bits (208), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 48/125 (38%), Positives = 64/125 (51%), Gaps = 4/125 (3%)
Query: 54 FVGHSHFRNASDENGGEIMF--GGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYC 111
V F D NG E M G ++ + G + + PV+ + YWQF V+S+ I V
Sbjct: 205 LVAQDLFSVYMDRNGQESMLTLGAINPSYYTGSLHWVPVTVQQYWQFTVDSVTISGVVVA 264
Query: 112 SN--CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGK 169
CQAI DTGTS ++GPS I + + IGA G +DCDNL MP V + GK
Sbjct: 265 CEGGCQAILDTGTSKLVGPSSDILNIQQAIGATQNQYGEFDIDCDNLSYMPTVVFEINGK 324
Query: 170 NFTLT 174
+ LT
Sbjct: 325 MYPLT 329
Score = 58.9 bits (141), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 23/34 (67%), Positives = 29/34 (85%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
+QY+G I LGTPPQEF V+FDTGSS+ W+PS +C
Sbjct: 72 SQYFGKIYLGTPPQEFTVLFDTGSSDFWVPSIYC 105
>gi|413946558|gb|AFW79207.1| hypothetical protein ZEAMMB73_486493 [Zea mays]
Length = 382
Score = 84.7 bits (208), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 40/84 (47%), Positives = 59/84 (70%), Gaps = 3/84 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV--YCS-NCQAI 117
R+A + GGEI+FGG+D + GD T+ PV+RKGYWQF + + ++ +C+ C AI
Sbjct: 101 RHADEGEGGEIVFGGMDSSHYKGDHTFVPVTRKGYWQFNMGDVLVDGKSTGFCAGGCAAI 160
Query: 118 ADTGTSLIIGPSKVIAELNKLIGA 141
AD+GTSL+ GP+ +I E+N+ IGA
Sbjct: 161 ADSGTSLLAGPTAIITEINEKIGA 184
Score = 35.8 bits (81), Expect = 7.5, Method: Compositional matrix adjust.
Identities = 15/43 (34%), Positives = 25/43 (58%)
Query: 131 VIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
++ +N+L +P G + VDC +L MP++ +GGK F L
Sbjct: 279 ILNYINQLCERLPSPMGESAVDCGSLASMPDIAFTIGGKKFKL 321
>gi|190014574|dbj|BAG48265.1| pepsinogen 3 [Thunnus orientalis]
Length = 383
Score = 84.7 bits (208), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 42/112 (37%), Positives = 66/112 (58%), Gaps = 3/112 (2%)
Query: 66 ENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYC-SNCQAIADTGT 122
+ G + FG VD + GDI ++PV+ + YWQ GVE +I ++ +C S CQAI DTGT
Sbjct: 213 KEGSVLSFGTVDTSMYRGDIYWTPVTSESYWQIGVEGFQINGQETGWCRSGCQAIVDTGT 272
Query: 123 SLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
S++ PS+ ++ + + IGA NG VDC+ ++ +P +L G L+
Sbjct: 273 SMLTAPSQFMSSIMQAIGAQESDNGMFMVDCNQVNNLPTFYFVLSGVPLPLS 324
Score = 58.2 bits (139), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 26/43 (60%), Positives = 33/43 (76%), Gaps = 2/43 (4%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK 51
YYG IS+GTPPQ F+V+FDTGSSNLW+ S +C+ AC+ K
Sbjct: 67 YYGAISIGTPPQSFQVLFDTGSSNLWVDSVYCNTQ--ACNTHK 107
>gi|344295434|ref|XP_003419417.1| PREDICTED: pepsin A-2/A-3-like [Loxodonta africana]
Length = 384
Score = 84.7 bits (208), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 43/115 (37%), Positives = 71/115 (61%), Gaps = 5/115 (4%)
Query: 63 ASDENGGEI-MFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-KNVYCSN-CQAIAD 119
+SDE GG + +FGG+D + G + + PVS +GYWQ ++S+ I+ ++V CS+ CQAI D
Sbjct: 216 SSDEEGGSVVIFGGIDSSYYTGSLNWVPVSYEGYWQITLDSVSIDGESVACSDTCQAIID 275
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
TGTSL+ GP+ IA + + +G + +V C D +PN+ + G + ++
Sbjct: 276 TGTSLLAGPTTAIANIQEYLGLGD--SSEEEVSCSTADSLPNIVFTINGVQYPVS 328
Score = 63.5 bits (153), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 27/42 (64%), Positives = 35/42 (83%), Gaps = 2/42 (4%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACS 48
+Y+GTI +GTP QEF VIFDTGSSNLW+PS +CS ++AC+
Sbjct: 72 VEYFGTIGIGTPAQEFTVIFDTGSSNLWVPSTYCS--SLACT 111
>gi|301622166|ref|XP_002940408.1| PREDICTED: renin-like [Xenopus (Silurana) tropicalis]
Length = 371
Score = 84.7 bits (208), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 40/116 (34%), Positives = 65/116 (56%), Gaps = 2/116 (1%)
Query: 60 FRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAI 117
+R++ + GGEI+ GG D + G Y + ++GYW ++ + I + +C + C
Sbjct: 195 YRDSHLKPGGEIILGGSDPSYYTGSFQYLNLEKEGYWHIRMKGVSIGAEILFCKDGCSVA 254
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
DTG + I GP+ ++ L K IGA LA G VDCD + ++P+V +GG +TL
Sbjct: 255 IDTGAAYITGPASSVSVLMKAIGATELAEGEYTVDCDKISQLPDVSFHMGGNEYTL 310
Score = 68.9 bits (167), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 28/40 (70%), Positives = 34/40 (85%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
QY+G IS+G+PPQ FKV+FDTGS+NLW+PSQ CS L AC
Sbjct: 48 QYFGEISIGSPPQTFKVVFDTGSANLWVPSQRCSPLYSAC 87
>gi|195430468|ref|XP_002063276.1| GK21477 [Drosophila willistoni]
gi|194159361|gb|EDW74262.1| GK21477 [Drosophila willistoni]
Length = 402
Score = 84.7 bits (208), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 40/114 (35%), Positives = 61/114 (53%), Gaps = 1/114 (0%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIAD 119
R+ + + GG + FGG + + G+ TY PVSR+ YWQ ++S I+ C C+ I D
Sbjct: 227 RDVAAKEGGALFFGGSNPQYYTGNFTYVPVSRRSYWQITMDSAHIKDLNLCEQGCEVIID 286
Query: 120 TGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
TGTS + P +NK IG P + G + C+ + +P + LGG+ F L
Sbjct: 287 TGTSFLAMPYDQAMLINKSIGGTPSSYGMFSIPCEQVPHLPTMTFQLGGRKFHL 340
Score = 68.9 bits (167), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 31/54 (57%), Positives = 39/54 (72%), Gaps = 7/54 (12%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHF 60
AQY+G I++GTPPQ FKVIFDTGS+NLW+PS CS ++AC + HS F
Sbjct: 80 AQYFGPITIGTPPQIFKVIFDTGSANLWVPSTSCSPASVACMI-------HSRF 126
>gi|194376184|dbj|BAG62851.1| unnamed protein product [Homo sapiens]
Length = 255
Score = 84.7 bits (208), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 55/164 (33%), Positives = 79/164 (48%), Gaps = 42/164 (25%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASDEN 67
+Y+GTI +GTP Q+F V+FDTGSSNLW+PS +CS ++AC+ H+ F N D +
Sbjct: 75 EYFGTIGIGTPAQDFTVLFDTGSSNLWVPSVYCS--SLACT-------NHNRF-NPEDSS 124
Query: 68 GGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSNCQAIADTGTSLIIG 127
TY S +G CQAI DTGTSL+ G
Sbjct: 125 ------------------TYQSTSETVSITYG--------------CQAIVDTGTSLLTG 152
Query: 128 PSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNF 171
P+ IA + IGA ++G V C + +P++ + G +
Sbjct: 153 PTSPIANIQSDIGASENSDGDMVVSCSAISSLPDIVFTINGVQY 196
>gi|148709428|gb|EDL41374.1| pepsinogen 5, group I [Mus musculus]
Length = 387
Score = 84.7 bits (208), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 40/116 (34%), Positives = 64/116 (55%), Gaps = 2/116 (1%)
Query: 58 SHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN--CQ 115
+ + ++ DE G +M GGVD + G++ + PVS+ YWQ V+SI + V + CQ
Sbjct: 212 AFYLSSKDEKGSMLMLGGVDPSYYHGELHWVPVSKPSYWQLAVDSISMNGEVIACDGGCQ 271
Query: 116 AIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNF 171
I DTGTSL+ GP I + LIGA +G + CD ++ +P++ +G +
Sbjct: 272 GIMDTGTSLLTGPRSSIVNIQNLIGAKASGDGEYFLKCDTINTLPDIVFTIGSVTY 327
Score = 59.3 bits (142), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 28/51 (54%), Positives = 36/51 (70%), Gaps = 2/51 (3%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK 51
MR+ Y G IS+GTPPQEF+V+ DTGSS LW+PS +CS + AC+ K
Sbjct: 66 MRNYLDLVYIGIISIGTPPQEFRVVLDTGSSVLWVPSIYCS--SPACAHHK 114
>gi|31981154|ref|NP_067428.2| pepsinogen 5, group I precursor [Mus musculus]
gi|12835164|dbj|BAB23174.1| unnamed protein product [Mus musculus]
gi|26343895|dbj|BAC35604.1| unnamed protein product [Mus musculus]
gi|74223201|dbj|BAE40737.1| unnamed protein product [Mus musculus]
gi|111601447|gb|AAI19524.1| Pepsinogen 5, group I [Mus musculus]
Length = 387
Score = 84.7 bits (208), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 40/116 (34%), Positives = 64/116 (55%), Gaps = 2/116 (1%)
Query: 58 SHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN--CQ 115
+ + ++ DE G +M GGVD + G++ + PVS+ YWQ V+SI + V + CQ
Sbjct: 212 AFYLSSKDEKGSMLMLGGVDPSYYHGELHWVPVSKPSYWQLAVDSISMNGEVIACDGGCQ 271
Query: 116 AIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNF 171
I DTGTSL+ GP I + LIGA +G + CD ++ +P++ +G +
Sbjct: 272 GIMDTGTSLLTGPRSSIVNIQNLIGAKASGDGEYFLKCDTINTLPDIVFTIGSVTY 327
Score = 59.3 bits (142), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 28/51 (54%), Positives = 36/51 (70%), Gaps = 2/51 (3%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK 51
MR+ Y G IS+GTPPQEF+V+ DTGSS LW+PS +CS + AC+ K
Sbjct: 66 MRNYLDLVYIGIISIGTPPQEFRVVLDTGSSVLWVPSIYCS--SPACAHHK 114
>gi|354508489|gb|AER26913.1| prochymosin [synthetic construct]
Length = 365
Score = 84.7 bits (208), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 48/125 (38%), Positives = 64/125 (51%), Gaps = 4/125 (3%)
Query: 54 FVGHSHFRNASDENGGEIMF--GGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYC 111
V F D NG E M G ++ + G + + PV+ + YWQF V+S+ I V
Sbjct: 189 LVAQDLFSVYMDRNGQESMLTLGAINPSYYTGSLHWVPVTVQQYWQFTVDSVTISGVVVA 248
Query: 112 SN--CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGK 169
CQAI DTGTS ++GPS I + + IGA G +DCDNL MP V + GK
Sbjct: 249 CEGGCQAILDTGTSKLVGPSSDILNIQQAIGATQNQYGEFDIDCDNLSYMPTVVFEINGK 308
Query: 170 NFTLT 174
+ LT
Sbjct: 309 MYPLT 313
Score = 58.9 bits (141), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 23/34 (67%), Positives = 29/34 (85%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
+QY+G I LGTPPQEF V+FDTGSS+ W+PS +C
Sbjct: 56 SQYFGKIYLGTPPQEFTVLFDTGSSDFWVPSIYC 89
>gi|226506070|ref|NP_001150729.1| aspartic proteinase oryzasin-1 precursor [Zea mays]
gi|195641348|gb|ACG40142.1| aspartic proteinase oryzasin-1 precursor [Zea mays]
Length = 518
Score = 84.7 bits (208), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 40/85 (47%), Positives = 59/85 (69%), Gaps = 3/85 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV--YCS-NCQAI 117
R+A + GGEI+FGG+D + GD TY PV++KGYWQF + + ++ +C+ C AI
Sbjct: 237 RHAGEGEGGEIVFGGMDSSHYKGDHTYVPVTQKGYWQFNMGDVLVDGKSTGFCAGGCAAI 296
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAV 142
AD+GTSL+ GP+ +I E+N+ IGA
Sbjct: 297 ADSGTSLLAGPTAIITEINEKIGAA 321
Score = 64.3 bits (155), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 28/41 (68%), Positives = 33/41 (80%), Gaps = 1/41 (2%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQY+G I +GTPPQ+F VIFDTGSSNLW+PS C +IAC
Sbjct: 91 AQYFGEIGVGTPPQKFTVIFDTGSSNLWVPSSKC-YFSIAC 130
Score = 36.6 bits (83), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 15/43 (34%), Positives = 25/43 (58%)
Query: 131 VIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
++ +N+L +P G + VDC +L MP++ +GGK F L
Sbjct: 415 ILTYINQLCERLPSPMGESAVDCASLGSMPDIAFTIGGKKFKL 457
>gi|311621988|gb|ADP95550.1| ASP-3 [Caenorhabditis sp. 5 AC-2008]
Length = 180
Score = 84.7 bits (208), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 42/115 (36%), Positives = 66/115 (57%), Gaps = 2/115 (1%)
Query: 61 RNASD-ENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIA 118
R+ +D NGGEI D + + G+I + P+ + YW+ + ++ I Y + +I
Sbjct: 42 RDVNDIANGGEITLCDTDSNHYSGNIAWEPLVSEDYWRIKLGAVSISGTTYTNGPMDSIV 101
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
DTGTSL+ GPS +I ++ IG +PL NG +V C + +PN+ LGG+NF L
Sbjct: 102 DTGTSLLTGPSDIIKKIQHKIGGIPLINGEYEVVCSKIPSLPNITFTLGGQNFDL 156
>gi|392586802|gb|EIW76137.1| Asp-domain-containing protein [Coniophora puteana RWD-64-598 SS2]
Length = 409
Score = 84.7 bits (208), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 45/116 (38%), Positives = 63/116 (54%), Gaps = 2/116 (1%)
Query: 60 FR-NASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKN-VYCSNCQAI 117
FR +S+ +GGE +FGG+D + GDI Y PV RK YW+ +E + + + + A
Sbjct: 237 FRLGSSESDGGEAVFGGIDHSAYTGDIEYVPVRRKAYWEVELEKVSFGGDELELESTGAA 296
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
DTGTSLI P+ V LN IGA NG +DC + +P+ GGK + L
Sbjct: 297 IDTGTSLIALPTDVAEMLNTQIGAKRSWNGQYTIDCSKVPSLPDFTFYFGGKPYPL 352
Score = 58.5 bits (140), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 34/86 (39%), Positives = 47/86 (54%), Gaps = 4/86 (4%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASD- 65
AQY+ I LG+P Q FKVI DTGSSNLW+PS C+ + D S+ N ++
Sbjct: 96 AQYFTEIELGSPAQTFKVILDTGSSNLWVPSAQCTSIACFLHAKYDSSSSASYKANGTEF 155
Query: 66 --ENGGEIMFGGVDKDKF-VGDITYS 88
+ G M G V +D +GD++ S
Sbjct: 156 SIQYGTGSMEGFVSQDTLKIGDVSIS 181
>gi|11493777|gb|AAG35646.1|AF203473_1 progastricsin [Salvelinus fontinalis]
Length = 383
Score = 84.7 bits (208), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 50/168 (29%), Positives = 84/168 (50%), Gaps = 7/168 (4%)
Query: 13 ISLGTPPQEFKVIFDTGSSNLWIPS----QHCSILNIACSVDKDKFVGHSHFRNASDENG 68
+S P Q F V G L PS Q +++ S + + + + + G
Sbjct: 158 LSTDEPGQNFVVAQFDGILGLSYPSISAGQETPVMDNMMSQNLLQANIFAFYMTRDGQQG 217
Query: 69 GEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSN-CQAIADTGTSLI 125
E+ FG VD K+ G I ++PV+ + YWQ G++ +I ++ +C CQAI DTGTS++
Sbjct: 218 SELSFGEVDNTKYQGQIYWTPVTSQTYWQIGIQGFQINGQETGWCGQGCQAIVDTGTSML 277
Query: 126 IGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
P +++ L + IGA G V+C+ ++ +P + + G NF L
Sbjct: 278 TAPRQIMGTLMQSIGAQQDQYGQYTVNCNQINSLPTLTFTINGINFPL 325
Score = 53.5 bits (127), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 21/33 (63%), Positives = 28/33 (84%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCS 41
YYG I++GTPPQ F+V+FDTGS+NLW+ S C+
Sbjct: 69 YYGAITIGTPPQSFQVLFDTGSANLWVDSVLCN 101
>gi|307170557|gb|EFN62764.1| Lysosomal aspartic protease [Camponotus floridanus]
Length = 349
Score = 84.7 bits (208), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 40/82 (48%), Positives = 55/82 (67%), Gaps = 1/82 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYC-SNCQAIAD 119
R+ S + GGE++ GG D + GD TY PV+ KGYWQ ++ IKI+ + C +CQAI D
Sbjct: 184 RDTSADLGGELILGGSDPAYYEGDFTYIPVTHKGYWQISIDRIKIKSDNLCEESCQAIVD 243
Query: 120 TGTSLIIGPSKVIAELNKLIGA 141
TG+SLI+GP IA++N IG
Sbjct: 244 TGSSLILGPKLDIAKINTFIGV 265
Score = 43.5 bits (101), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 21/55 (38%), Positives = 31/55 (56%), Gaps = 9/55 (16%)
Query: 17 TPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASDENGGEI 71
TP QEFKV+FDTGS++LW+ S+ C+ + GH+ + +A E I
Sbjct: 46 TPQQEFKVMFDTGSADLWVLSKKCTF---------PIYSGHNQYDSARSETYNNI 91
>gi|7341306|gb|AAF61241.1|AF240776_1 pepsinogen F [Mus musculus]
Length = 387
Score = 84.7 bits (208), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 40/116 (34%), Positives = 64/116 (55%), Gaps = 2/116 (1%)
Query: 58 SHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN--CQ 115
+ + ++ DE G +M GGVD + G++ + PVS+ YWQ V+SI + V + CQ
Sbjct: 212 AFYLSSKDEKGSMLMLGGVDPSYYHGELHWVPVSKPSYWQLAVDSISMNGEVIACDGGCQ 271
Query: 116 AIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNF 171
I DTGTSL+ GP I + LIGA +G + CD ++ +P++ +G +
Sbjct: 272 GIMDTGTSLLTGPRSSIVNIQNLIGAKASGDGEYFLKCDTINTLPDIVFTIGSVTY 327
Score = 59.3 bits (142), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 28/51 (54%), Positives = 36/51 (70%), Gaps = 2/51 (3%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK 51
MR+ Y G IS+GTPPQEF+V+ DTGSS LW+PS +CS + AC+ K
Sbjct: 66 MRNYLDLVYIGIISIGTPPQEFRVVLDTGSSVLWVPSIYCS--SPACAHHK 114
>gi|109287598|emb|CAJ55261.1| renin-like aspartic protease [Echis ocellatus]
Length = 395
Score = 84.7 bits (208), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 42/119 (35%), Positives = 67/119 (56%), Gaps = 3/119 (2%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIA 118
R++ GGEI+ GG D + GD Y SR+GYW ++ + IE + C + C A
Sbjct: 223 RHSESNTGGEIILGGSDPSHYTGDFHYVSTSREGYWHVDLKGVSIENKIALCHDGCTATI 282
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTRTN 177
DTGTS I GP+ I+ L + IGA L+ G +DC+ ++ +P++ LG ++L+ +
Sbjct: 283 DTGTSFISGPASSISVLMETIGAT-LSRGDYVIDCNQINLLPDISFHLGDMTYSLSSST 340
Score = 66.2 bits (160), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 29/41 (70%), Positives = 32/41 (78%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
QYYG IS+GTP Q FKV+FDTGSSNLW+PS CS L AC
Sbjct: 77 TQYYGEISIGTPAQIFKVVFDTGSSNLWVPSHQCSPLYSAC 117
>gi|58396912|gb|AAW72828.1| prochymosin [synthetic construct]
Length = 366
Score = 84.7 bits (208), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 48/126 (38%), Positives = 64/126 (50%), Gaps = 4/126 (3%)
Query: 53 KFVGHSHFRNASDENGGEIMF--GGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVY 110
+ V F D NG M G +D + G + + PV+ + YWQF V+S+ I V
Sbjct: 189 RLVAQDLFSVYMDRNGQGSMLTLGAIDPSYYTGSLHWVPVTLQKYWQFTVDSVTISGAVV 248
Query: 111 C--SNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGG 168
CQAI DTGTS ++GPS I + + IGA G VDCD+L MP V + G
Sbjct: 249 ACEGGCQAILDTGTSKLVGPSSDILNIQQAIGATQNQYGEFDVDCDSLSSMPTVVFEING 308
Query: 169 KNFTLT 174
K + LT
Sbjct: 309 KMYPLT 314
Score = 59.3 bits (142), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 23/34 (67%), Positives = 29/34 (85%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
+QY+G I LGTPPQEF V+FDTGSS+ W+PS +C
Sbjct: 57 SQYFGKIYLGTPPQEFTVLFDTGSSDFWVPSIYC 90
>gi|311621962|gb|ADP95537.1| ASP-3 [Caenorhabditis sp. 5 AC-2008]
gi|311621968|gb|ADP95540.1| ASP-3 [Caenorhabditis sp. 5 AC-2008]
gi|311621976|gb|ADP95544.1| ASP-3 [Caenorhabditis sp. 5 AC-2008]
gi|311621986|gb|ADP95549.1| ASP-3 [Caenorhabditis sp. 5 AC-2008]
gi|375581440|gb|AFA55976.1| aspartyl protease, partial [Caenorhabditis sp. 5 AC-2008]
gi|375581442|gb|AFA55977.1| aspartyl protease, partial [Caenorhabditis sp. 5 AC-2008]
gi|375581468|gb|AFA55990.1| aspartyl protease, partial [Caenorhabditis sp. 5 AC-2008]
gi|375581470|gb|AFA55991.1| aspartyl protease, partial [Caenorhabditis sp. 5 AC-2008]
Length = 224
Score = 84.3 bits (207), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 42/115 (36%), Positives = 66/115 (57%), Gaps = 2/115 (1%)
Query: 61 RNASD-ENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIA 118
R+ +D NGGEI D + + G+I + P+ + YW+ + ++ I Y + +I
Sbjct: 58 RDVNDIANGGEITLCDTDSNHYSGNIAWEPLVSEDYWRIKLGAVSISGTTYTNGPMDSIV 117
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
DTGTSL+ GPS +I ++ IG +PL NG +V C + +PN+ LGG+NF L
Sbjct: 118 DTGTSLLTGPSDIIKKIQHKIGGIPLINGEYEVVCSKIPSLPNITFTLGGQNFDL 172
>gi|119491657|ref|XP_001263323.1| aspartic endopeptidase Pep2 [Neosartorya fischeri NRRL 181]
gi|119411483|gb|EAW21426.1| aspartic endopeptidase Pep2 [Neosartorya fischeri NRRL 181]
Length = 398
Score = 84.3 bits (207), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 44/105 (41%), Positives = 60/105 (57%), Gaps = 1/105 (0%)
Query: 70 EIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVY-CSNCQAIADTGTSLIIGP 128
E FGGVDK+ + G++T P+ RK YW+ ++I + NV N I DTGTSLI P
Sbjct: 237 EASFGGVDKNHYTGELTKIPLRRKAYWEVDFDAIALGDNVAELENTGVILDTGTSLIALP 296
Query: 129 SKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
S + LNK IGA G ++CD D +P++ L G NFT+
Sbjct: 297 STLADLLNKEIGAKKGFTGQYSIECDKRDSLPDLTFTLAGHNFTI 341
Score = 63.9 bits (154), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 28/41 (68%), Positives = 33/41 (80%), Gaps = 2/41 (4%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQY+ ISLGTPPQ+FKV+ DTGSSNLW+P CS +IAC
Sbjct: 83 AQYFSEISLGTPPQKFKVVLDTGSSNLWVPGSDCS--SIAC 121
>gi|357480353|ref|XP_003610462.1| Aspartic proteinase [Medicago truncatula]
gi|355511517|gb|AES92659.1| Aspartic proteinase [Medicago truncatula]
Length = 519
Score = 84.3 bits (207), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 43/84 (51%), Positives = 55/84 (65%), Gaps = 3/84 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE--KNVYCSN-CQAI 117
R +E GGEI+FGGVD + G+ TY PV RKGYWQF + + I+ YC + C AI
Sbjct: 240 RKPEEEEGGEIVFGGVDPAHYKGNHTYVPVKRKGYWQFDMGDVTIDGKSTGYCVDGCSAI 299
Query: 118 ADTGTSLIIGPSKVIAELNKLIGA 141
AD+GTSL+ GP+ VI +N IGA
Sbjct: 300 ADSGTSLLAGPTTVITMINHAIGA 323
Score = 65.5 bits (158), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 28/47 (59%), Positives = 37/47 (78%), Gaps = 1/47 (2%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ AQYYG IS+GT PQ+F VIFDTGSSNLW+PS C+ ++AC
Sbjct: 88 LKNYLDAQYYGEISIGTSPQKFTVIFDTGSSNLWVPSSKCT-FSVAC 133
Score = 35.8 bits (81), Expect = 6.4, Method: Compositional matrix adjust.
Identities = 13/44 (29%), Positives = 25/44 (56%)
Query: 130 KVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
+++ +N L +P G + VDC+N+ +P + +GG+ F L
Sbjct: 416 QILTYINNLCDKMPSPMGESSVDCENISSLPVISFTIGGRTFDL 459
>gi|311621984|gb|ADP95548.1| ASP-3 [Caenorhabditis sp. 5 AC-2008]
Length = 208
Score = 84.3 bits (207), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 42/115 (36%), Positives = 66/115 (57%), Gaps = 2/115 (1%)
Query: 61 RNASD-ENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIA 118
R+ +D NGGEI D + + G+I + P+ + YW+ + ++ I Y + +I
Sbjct: 42 RDVNDIANGGEITLCDTDSNHYSGNIAWEPLVSEDYWRIKLGAVSISGTTYTNGPMDSIV 101
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
DTGTSL+ GPS +I ++ IG +PL NG +V C + +PN+ LGG+NF L
Sbjct: 102 DTGTSLLTGPSDIIKKIQHKIGGIPLINGEYEVVCSKIPSLPNITFTLGGQNFDL 156
>gi|354504625|ref|XP_003514374.1| PREDICTED: pepsin F-like, partial [Cricetulus griseus]
Length = 316
Score = 84.3 bits (207), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 39/107 (36%), Positives = 62/107 (57%), Gaps = 2/107 (1%)
Query: 58 SHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYC--SNCQ 115
+ + ++ DE G +M GGVD + GD+ + PVS+ YWQ ++SI + V CQ
Sbjct: 141 AFYLSSKDEKGSMLMLGGVDPSYYSGDLHWVPVSKPSYWQLALDSISVNGEVIACEGGCQ 200
Query: 116 AIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNV 162
I DTGTSL+ GP I ++ LIGA ++G + CD ++ +P++
Sbjct: 201 GIMDTGTSLLTGPRNSILNIHNLIGAKASSDGEYVLRCDTINTLPDI 247
Score = 56.6 bits (135), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 26/43 (60%), Positives = 33/43 (76%), Gaps = 2/43 (4%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK 51
Y G IS+GTPPQEFKV+ DTGS+ LW+PS +CS + AC+ K
Sbjct: 3 YIGIISIGTPPQEFKVVLDTGSAYLWVPSIYCS--SPACAHHK 43
>gi|344263771|ref|XP_003403969.1| PREDICTED: gastricsin-like [Loxodonta africana]
Length = 389
Score = 84.3 bits (207), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 41/111 (36%), Positives = 66/111 (59%), Gaps = 3/111 (2%)
Query: 66 ENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSN-CQAIADTGT 122
++GG ++ GGVD + G I ++PV+R+ YWQ G+E I + + +CS CQAI DTGT
Sbjct: 221 QDGGALILGGVDDSLYSGQIYWAPVTRELYWQIGIEEFLIGGQASGWCSQGCQAIVDTGT 280
Query: 123 SLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
SL+ P + +++L + GAV G V+C+++ +P I+ G L
Sbjct: 281 SLLTVPQQYLSDLLQATGAVEDEYGQFLVNCEDIQDLPTFTFIINGAQLPL 331
Score = 59.7 bits (143), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 27/42 (64%), Positives = 33/42 (78%), Gaps = 2/42 (4%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACS 48
A Y+G IS+GTPPQ F V+FDTGSSNLW+PS +C + ACS
Sbjct: 71 AAYFGEISIGTPPQNFLVLFDTGSSNLWVPSVYCQ--SEACS 110
>gi|388856266|emb|CCF50075.1| probable PEP4-aspartyl protease [Ustilago hordei]
Length = 418
Score = 84.3 bits (207), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 42/117 (35%), Positives = 67/117 (57%), Gaps = 1/117 (0%)
Query: 58 SHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI-EKNVYCSNCQA 116
S + +S+++GGE +FGG+D+ + G I ++PV RKGYW+ ++ + + ++ + N A
Sbjct: 245 SFYLGSSEQDGGEAVFGGIDESHYTGKIHWAPVKRKGYWEVALDKLALGDEALELDNGSA 304
Query: 117 IADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
DTGTSLI + LN IGA NG VDC+ + +P + + GK F L
Sbjct: 305 AIDTGTSLIAMATDTAEILNAEIGATKSWNGQYSVDCEKVKDLPPLTFYIDGKPFKL 361
Score = 67.8 bits (164), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 42/86 (48%), Positives = 53/86 (61%), Gaps = 8/86 (9%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK--DKFVGHSHFRNAS 64
AQY+ ISLGTP QEFKVI DTGSSNLW+PS CS +IAC + K D S+ +N +
Sbjct: 105 AQYFCDISLGTPAQEFKVILDTGSSNLWVPSNKCS--SIACFLHKKYDSSASSSYKKNGT 162
Query: 65 D---ENGGEIMFGGVDKDKF-VGDIT 86
+ + G M G V D +GD+T
Sbjct: 163 EFKIQYGSGSMEGIVSNDVLKIGDLT 188
>gi|356522015|ref|XP_003529645.1| PREDICTED: aspartic proteinase-like [Glycine max]
Length = 514
Score = 84.3 bits (207), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 43/84 (51%), Positives = 56/84 (66%), Gaps = 3/84 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSN-CQAI 117
R +E GGEI+FGGVD + G TY PV+RKGYWQF + + I + YC++ C AI
Sbjct: 234 RKPEEEEGGEIVFGGVDPAHYKGKHTYVPVTRKGYWQFDMGDVLIGGKPTGYCADGCSAI 293
Query: 118 ADTGTSLIIGPSKVIAELNKLIGA 141
AD+GTSL+ GP+ VI +N IGA
Sbjct: 294 ADSGTSLLAGPTTVITMINHAIGA 317
Score = 63.9 bits (154), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 27/47 (57%), Positives = 37/47 (78%), Gaps = 1/47 (2%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ AQYYG I++GT PQ+F VIFDTGSSNLW+PS C+ ++AC
Sbjct: 82 LKNYLDAQYYGEIAIGTSPQKFTVIFDTGSSNLWVPSSKCT-FSVAC 127
Score = 37.0 bits (84), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 14/45 (31%), Positives = 27/45 (60%)
Query: 130 KVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
++++ +N+L +P G + VDC N+ +P V +GG+ F L+
Sbjct: 410 QILSYINQLCDKMPSPMGESAVDCGNISSLPVVSFTIGGRTFELS 454
>gi|130497820|ref|NP_001076165.1| pepsin F precursor [Oryctolagus cuniculus]
gi|129802|sp|P27823.1|PEPAF_RABIT RecName: Full=Pepsin F; Flags: Precursor
gi|165602|gb|AAA31440.1| pepsinogen [Oryctolagus cuniculus]
Length = 388
Score = 84.3 bits (207), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 41/116 (35%), Positives = 66/116 (56%), Gaps = 2/116 (1%)
Query: 58 SHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-KNVYC-SNCQ 115
+ + ++ +E G +M GGVD + GD+ + PVSR YWQ V+ I + + + C S CQ
Sbjct: 212 AFYLSSKEERGSMLMLGGVDPSYYSGDLHWVPVSRPLYWQLAVDRISMNGEAIGCDSGCQ 271
Query: 116 AIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNF 171
I DTGTSL+IGP + + K+I A G +DCD + +P++ + G ++
Sbjct: 272 GIVDTGTSLLIGPRDPVLNIQKIINAQHSHGGEYIIDCDTISTLPDIIFTIDGVDY 327
Score = 59.3 bits (142), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 24/41 (58%), Positives = 32/41 (78%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCS 41
+R+ Y G IS+GTPPQEFKV+ DTGS++LW+PS +CS
Sbjct: 66 LRNYLDLAYIGIISIGTPPQEFKVVLDTGSADLWVPSIYCS 106
>gi|116793748|gb|ABK26865.1| unknown [Picea sitchensis]
Length = 284
Score = 84.3 bits (207), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 42/93 (45%), Positives = 61/93 (65%), Gaps = 3/93 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV--YCS-NCQAI 117
RN+ +E+GGEI+FGGVD + F G+ Y+ V+RKGYWQF + I+ +C+ C AI
Sbjct: 3 RNSDEEDGGEIVFGGVDPNHFKGEHEYASVTRKGYWQFDMGDFLIDNQSTGFCAGGCAAI 62
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAK 150
D+GTSL+ GPS +I ++N IGA + + K
Sbjct: 63 VDSGTSLLAGPSGIITQINNAIGASGIVSQECK 95
Score = 42.4 bits (98), Expect = 0.080, Method: Compositional matrix adjust.
Identities = 18/45 (40%), Positives = 27/45 (60%)
Query: 130 KVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
+++ LN+L +P NG + VDCD + MP V +G K F+LT
Sbjct: 180 QIMTYLNQLCDRLPSPNGESVVDCDQVSSMPTVSFSIGNKTFSLT 224
>gi|356532081|ref|XP_003534602.1| PREDICTED: aspartic proteinase [Glycine max]
Length = 507
Score = 84.3 bits (207), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 41/84 (48%), Positives = 57/84 (67%), Gaps = 3/84 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCS-NCQAI 117
R +ENGGE++FGG D + G TY PV+RKGYWQF + + I + YC+ +C AI
Sbjct: 227 RKPEEENGGELVFGGADPAHYKGKHTYVPVTRKGYWQFDMGDVLIAGKPTGYCADDCSAI 286
Query: 118 ADTGTSLIIGPSKVIAELNKLIGA 141
AD+GTSL+ GP+ V+ +N+ IGA
Sbjct: 287 ADSGTSLLAGPTTVVTMINQAIGA 310
Score = 68.2 bits (165), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 34/72 (47%), Positives = 47/72 (65%), Gaps = 12/72 (16%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHF 60
+++ AQYYG I++GTPPQ+F VIFDTGSSNLW+PS C +IAC + H+ +
Sbjct: 75 LKNYLDAQYYGEIAIGTPPQKFTVIFDTGSSNLWVPSSKC-YFSIACFM-------HARY 126
Query: 61 RNASD----ENG 68
R++ ENG
Sbjct: 127 RSSQSSTYRENG 138
Score = 36.2 bits (82), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 17/45 (37%), Positives = 25/45 (55%)
Query: 130 KVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
++I N+L +P G + VDC+ L MP V +GGK F L+
Sbjct: 403 RIIDYANELCDKLPNPMGQSSVDCEKLSSMPIVSFTIGGKVFDLS 447
>gi|395821525|ref|XP_003784088.1| PREDICTED: chymosin-like [Otolemur garnettii]
Length = 380
Score = 84.3 bits (207), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 41/108 (37%), Positives = 65/108 (60%), Gaps = 2/108 (1%)
Query: 68 GGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI-EKNVYCSN-CQAIADTGTSLI 125
G + FG +D+ + G + + PV+ + YWQF V+ + I + V C+ CQAI DTGTSL+
Sbjct: 221 GSMLTFGAIDQSYYTGSLHWVPVTVQRYWQFTVDRVTINDVVVACAGGCQAILDTGTSLL 280
Query: 126 IGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
+GP++ I + K IGA P G ++C +L +MP + + GK + L
Sbjct: 281 LGPTRDIYYIQKAIGATPGRFGLFNINCKSLSRMPTIVFEIHGKKYPL 328
Score = 60.1 bits (144), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 22/34 (64%), Positives = 30/34 (88%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
QY+GT+S+GTPPQ+F V+FDTGSS+ W+PS +C
Sbjct: 72 VQYFGTVSIGTPPQKFTVVFDTGSSDFWVPSVYC 105
>gi|164657049|ref|XP_001729651.1| hypothetical protein MGL_3195 [Malassezia globosa CBS 7966]
gi|159103544|gb|EDP42437.1| hypothetical protein MGL_3195 [Malassezia globosa CBS 7966]
Length = 419
Score = 84.3 bits (207), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 47/126 (37%), Positives = 67/126 (53%), Gaps = 4/126 (3%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI-EK 107
+D+++F + +S+ GGE FGGVD +F G I Y+PV R+GYW+ + I +
Sbjct: 238 LDQNQF---GFYLGSSESEGGEATFGGVDPSRFEGPIVYAPVRRRGYWEVALNKIGFGNE 294
Query: 108 NVYCSNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILG 167
+ + A DTGTSLI P+ V LNK IGA G VDC + +P + L
Sbjct: 295 ELVLTRTGAAIDTGTSLIAMPTDVAEILNKEIGAKRSWTGQYSVDCSKVPSLPALTFYLD 354
Query: 168 GKNFTL 173
K +TL
Sbjct: 355 NKPYTL 360
Score = 64.7 bits (156), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 37/85 (43%), Positives = 52/85 (61%), Gaps = 8/85 (9%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK--DKFVGHSHFRNAS 64
AQY+ I LG+PPQ FKVI DTGS+NLW+PS+ C+ +IAC + K D + ++ N S
Sbjct: 104 AQYFADIELGSPPQSFKVILDTGSANLWVPSESCT--SIACLLHKKYDNSLSKTYQANGS 161
Query: 65 D---ENGGEIMFGGVDKDKF-VGDI 85
+ G M G V +D +GD+
Sbjct: 162 EFQIHYGSGSMEGFVSRDTLRIGDL 186
>gi|281339613|gb|EFB15197.1| hypothetical protein PANDA_017311 [Ailuropoda melanoleuca]
Length = 318
Score = 84.3 bits (207), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 39/109 (35%), Positives = 60/109 (55%), Gaps = 3/109 (2%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV--YCSN-CQAI 117
R + E GGE++ GGVD + G+I ++PV+R+ YWQ ++ + CS CQAI
Sbjct: 148 RQPTYEYGGELILGGVDSQFYSGEIVWTPVTREMYWQIAIDEFLVSNQATGLCSQGCQAI 207
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
DTGT ++ P + I + GA NG VDCD++ MP + ++
Sbjct: 208 VDTGTYMLAVPQQFIGSFLQTTGAQEAQNGDFVVDCDSIQSMPTITFVI 256
Score = 60.1 bits (144), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 24/35 (68%), Positives = 30/35 (85%)
Query: 6 QAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
Q+ Y+G IS+GTPPQ F V+FDTGSSNLW+PS +C
Sbjct: 2 QSYYFGEISIGTPPQNFLVLFDTGSSNLWVPSTYC 36
>gi|311621980|gb|ADP95546.1| ASP-3 [Caenorhabditis sp. 5 AC-2008]
Length = 194
Score = 84.3 bits (207), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 42/115 (36%), Positives = 65/115 (56%), Gaps = 2/115 (1%)
Query: 61 RNASD-ENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIA 118
R+ +D NGGEI D + + G I + P+ + YW+ + ++ I Y + +I
Sbjct: 28 RDVNDIANGGEITLCDTDSNHYSGSIAWEPLVSEDYWRIKLGAVSISGTTYTNGPMDSIV 87
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
DTGTSL+ GPS +I ++ IG +PL NG +V C + +PN+ LGG+NF L
Sbjct: 88 DTGTSLLTGPSDIIKKIQHKIGGIPLINGEYEVVCSKIPSLPNITFTLGGQNFDL 142
>gi|357132502|ref|XP_003567869.1| PREDICTED: phytepsin-like [Brachypodium distachyon]
Length = 505
Score = 84.3 bits (207), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 39/84 (46%), Positives = 60/84 (71%), Gaps = 3/84 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCS-NCQAI 117
R+A + GGEI+FGG+D ++G+ TY PV++KGYWQF + + + + +C+ C AI
Sbjct: 225 RHAGEGEGGEIVFGGMDPKHYIGEHTYVPVTQKGYWQFDMGDVLVGGKSTGFCAGGCAAI 284
Query: 118 ADTGTSLIIGPSKVIAELNKLIGA 141
AD+GTSL+ GP+ +I E+N+ IGA
Sbjct: 285 ADSGTSLLAGPTAIITEINEKIGA 308
Score = 64.3 bits (155), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 30/58 (51%), Positives = 38/58 (65%), Gaps = 1/58 (1%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHS 58
+++ AQY+G I +GTPPQ+F VIFDTGSSNLW+PS C +IAC G S
Sbjct: 73 LQNYMNAQYFGEIGVGTPPQKFTVIFDTGSSNLWVPSAKC-YFSIACYFHSRYKAGQS 129
Score = 37.4 bits (85), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 15/43 (34%), Positives = 25/43 (58%)
Query: 131 VIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
++ +N+L +P G + VDC +L MP ++ +GGK F L
Sbjct: 402 ILNYINQLCDRLPSPMGESSVDCGSLASMPEIEFTIGGKKFAL 444
>gi|311621972|gb|ADP95542.1| ASP-3 [Caenorhabditis sp. 5 AC-2008]
Length = 222
Score = 84.3 bits (207), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 42/115 (36%), Positives = 65/115 (56%), Gaps = 2/115 (1%)
Query: 61 RNASD-ENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIA 118
R+ +D NGGEI D + + G I + P+ + YW+ + ++ I Y + +I
Sbjct: 58 RDVNDIANGGEITLCDTDSNHYSGSIAWEPLVSEDYWRIKLGAVSISGTTYTNGPMDSIV 117
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
DTGTSL+ GPS +I ++ IG +PL NG +V C + +PN+ LGG+NF L
Sbjct: 118 DTGTSLLTGPSDIIKKIQHKIGGIPLINGEYEVVCSKIPSLPNITFTLGGQNFDL 172
>gi|311621954|gb|ADP95533.1| ASP-3 [Caenorhabditis sp. 5 AC-2008]
gi|311621956|gb|ADP95534.1| ASP-3 [Caenorhabditis sp. 5 AC-2008]
gi|311621958|gb|ADP95535.1| ASP-3 [Caenorhabditis sp. 5 AC-2008]
gi|311621960|gb|ADP95536.1| ASP-3 [Caenorhabditis sp. 5 AC-2008]
gi|311621964|gb|ADP95538.1| ASP-3 [Caenorhabditis sp. 5 AC-2008]
gi|311621966|gb|ADP95539.1| ASP-3 [Caenorhabditis sp. 5 AC-2008]
gi|311621970|gb|ADP95541.1| ASP-3 [Caenorhabditis sp. 5 AC-2008]
gi|311621978|gb|ADP95545.1| ASP-3 [Caenorhabditis sp. 5 AC-2008]
gi|311621990|gb|ADP95551.1| ASP-3 [Caenorhabditis sp. 5 AC-2008]
gi|311621994|gb|ADP95553.1| ASP-3 [Caenorhabditis sp. 5 AC-2008]
gi|311621996|gb|ADP95554.1| ASP-3 [Caenorhabditis sp. 5 AC-2008]
gi|375581434|gb|AFA55973.1| aspartyl protease, partial [Caenorhabditis sp. 5 AC-2008]
gi|375581436|gb|AFA55974.1| aspartyl protease, partial [Caenorhabditis sp. 5 AC-2008]
gi|375581444|gb|AFA55978.1| aspartyl protease, partial [Caenorhabditis sp. 5 AC-2008]
gi|375581446|gb|AFA55979.1| aspartyl protease, partial [Caenorhabditis sp. 5 AC-2008]
gi|375581448|gb|AFA55980.1| aspartyl protease, partial [Caenorhabditis sp. 5 AC-2008]
gi|375581450|gb|AFA55981.1| aspartyl protease, partial [Caenorhabditis sp. 5 AC-2008]
gi|375581452|gb|AFA55982.1| aspartyl protease, partial [Caenorhabditis sp. 5 AC-2008]
gi|375581454|gb|AFA55983.1| aspartyl protease, partial [Caenorhabditis sp. 5 AC-2008]
gi|375581460|gb|AFA55986.1| aspartyl protease, partial [Caenorhabditis sp. 5 AC-2008]
gi|375581462|gb|AFA55987.1| aspartyl protease, partial [Caenorhabditis sp. 5 AC-2008]
gi|375581464|gb|AFA55988.1| aspartyl protease, partial [Caenorhabditis sp. 5 AC-2008]
gi|375581466|gb|AFA55989.1| aspartyl protease, partial [Caenorhabditis sp. 5 AC-2008]
Length = 224
Score = 84.0 bits (206), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 42/115 (36%), Positives = 65/115 (56%), Gaps = 2/115 (1%)
Query: 61 RNASD-ENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIA 118
R+ +D NGGEI D + + G I + P+ + YW+ + ++ I Y + +I
Sbjct: 58 RDVNDIANGGEITLCDTDSNHYSGSIAWEPLVSEDYWRIKLGAVSISGTTYTNGPMDSIV 117
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
DTGTSL+ GPS +I ++ IG +PL NG +V C + +PN+ LGG+NF L
Sbjct: 118 DTGTSLLTGPSDIIKKIQHKIGGIPLINGEYEVVCSKIPSLPNITFTLGGQNFDL 172
>gi|375581456|gb|AFA55984.1| aspartyl protease, partial [Caenorhabditis sp. 5 AC-2008]
Length = 223
Score = 84.0 bits (206), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 42/115 (36%), Positives = 65/115 (56%), Gaps = 2/115 (1%)
Query: 61 RNASD-ENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIA 118
R+ +D NGGEI D + + G I + P+ + YW+ + ++ I Y + +I
Sbjct: 57 RDVNDIANGGEITLCDTDSNHYSGSIAWEPLVSEDYWRIKLGAVSISGTTYTNGPMDSIV 116
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
DTGTSL+ GPS +I ++ IG +PL NG +V C + +PN+ LGG+NF L
Sbjct: 117 DTGTSLLTGPSDIIKKIQHKIGGIPLINGEYEVVCSKIPSLPNITFTLGGQNFDL 171
>gi|281183192|ref|NP_001162218.1| gastricsin precursor [Papio anubis]
gi|157939796|gb|ABW05535.1| progastricsin (predicted) [Papio anubis]
Length = 388
Score = 84.0 bits (206), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 41/118 (34%), Positives = 68/118 (57%), Gaps = 3/118 (2%)
Query: 59 HFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSN-CQ 115
+ + +GG ++FGGVD + G I ++PV+++ YWQ G+E I + + +CS CQ
Sbjct: 213 YLSDQQGSSGGAVVFGGVDSSLYTGQIYWAPVTQELYWQIGIEEFLIGGQASGWCSEGCQ 272
Query: 116 AIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
AI DTGTSL+ P + ++ L + GA G V+C+++ +P + I+ G F L
Sbjct: 273 AIVDTGTSLLTVPQQYLSALLQATGAQEDEYGQFLVNCNSIQNLPTLTFIINGVEFPL 330
Score = 58.9 bits (141), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 24/34 (70%), Positives = 29/34 (85%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
A Y+G IS+GTPPQ F V+FDTGSSNLW+PS +C
Sbjct: 71 AAYFGEISIGTPPQNFLVLFDTGSSNLWVPSVYC 104
>gi|255926586|gb|ACU40898.1| preprochymosin [Capra hircus]
Length = 381
Score = 84.0 bits (206), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 46/126 (36%), Positives = 64/126 (50%), Gaps = 4/126 (3%)
Query: 53 KFVGHSHFRNASDENG--GEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVY 110
+ V F D NG + G +D + G + + PV+ + YWQF V+S+ I V
Sbjct: 204 RLVAQDLFSVYMDRNGQGSMLTLGAIDPSYYTGSLHWVPVTLQKYWQFTVDSVTISGAVV 263
Query: 111 CSN--CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGG 168
CQAI DTGTS ++GPS I + + IGA G +DCD+L MP V + G
Sbjct: 264 ACEGGCQAILDTGTSKLVGPSSDILNIQQAIGATQNQYGEFDIDCDSLSSMPTVVFEING 323
Query: 169 KNFTLT 174
K + LT
Sbjct: 324 KMYPLT 329
Score = 59.3 bits (142), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 23/34 (67%), Positives = 29/34 (85%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
+QY+G I LGTPPQEF V+FDTGSS+ W+PS +C
Sbjct: 72 SQYFGKIYLGTPPQEFTVLFDTGSSDFWVPSIYC 105
>gi|38640718|gb|AAR25994.1| prochymosin [Capra hircus]
Length = 381
Score = 84.0 bits (206), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 46/126 (36%), Positives = 64/126 (50%), Gaps = 4/126 (3%)
Query: 53 KFVGHSHFRNASDENG--GEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVY 110
+ V F D NG + G +D + G + + PV+ + YWQF V+S+ I V
Sbjct: 204 RLVAQDLFSVYMDRNGQGSMLTLGAIDPSYYTGSLHWVPVTLQKYWQFTVDSVTISGAVV 263
Query: 111 CSN--CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGG 168
CQAI DTGTS ++GPS I + + IGA G +DCD+L MP V + G
Sbjct: 264 ACEGGCQAILDTGTSKLVGPSSDILNIQQAIGATQNQYGEFDIDCDSLSSMPTVVFEING 323
Query: 169 KNFTLT 174
K + LT
Sbjct: 324 KMYPLT 329
Score = 59.3 bits (142), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 23/34 (67%), Positives = 29/34 (85%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
+QY+G I LGTPPQEF V+FDTGSS+ W+PS +C
Sbjct: 72 SQYFGKIYLGTPPQEFTVLFDTGSSDFWVPSIYC 105
>gi|345568347|gb|EGX51242.1| hypothetical protein AOL_s00054g478 [Arthrobotrys oligospora ATCC
24927]
Length = 392
Score = 84.0 bits (206), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 45/115 (39%), Positives = 63/115 (54%), Gaps = 1/115 (0%)
Query: 60 FRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI-EKNVYCSNCQAIA 118
F +E+ E +FGG+DK + GDIT+ V RK YW+ +SI ++ + A+
Sbjct: 221 FYLGREEDESEAVFGGIDKSHYTGDITWVDVRRKAYWEVPFDSISFGDQTAELDSWGAVL 280
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
DTGTSLI PS LN IGA NG V C+ + +P++ LGG NFT+
Sbjct: 281 DTGTSLITLPSDYAEMLNSAIGATKGWNGQYSVPCEKVPDLPSLTFNLGGTNFTI 335
Score = 66.6 bits (161), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 35/84 (41%), Positives = 49/84 (58%), Gaps = 4/84 (4%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASD- 65
AQYY I+LGTPPQ FKV+ DTGSSNLW+PS+ CS + D ++ N ++
Sbjct: 80 AQYYSEITLGTPPQTFKVVLDTGSSNLWVPSKSCSSIACFLHTKYDSSESSTYKANGTEF 139
Query: 66 --ENGGEIMFGGVDKDKF-VGDIT 86
+ G M G + +D +GD+T
Sbjct: 140 SIQYGSGSMEGFISQDTLTIGDLT 163
>gi|311621982|gb|ADP95547.1| ASP-3 [Caenorhabditis sp. 5 AC-2008]
Length = 189
Score = 84.0 bits (206), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 42/115 (36%), Positives = 65/115 (56%), Gaps = 2/115 (1%)
Query: 61 RNASD-ENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIA 118
R+ +D NGGEI D + + G I + P+ + YW+ + ++ I Y + +I
Sbjct: 23 RDVNDIANGGEITLCDTDSNHYSGSIAWEPLVSEDYWRIKLGAVSISGTTYTNGPMDSIV 82
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
DTGTSL+ GPS +I ++ IG +PL NG +V C + +PN+ LGG+NF L
Sbjct: 83 DTGTSLLTGPSDIIKKIQHKIGGIPLINGEYEVVCSKIPSLPNITFTLGGQNFDL 137
>gi|129797|sp|P03955.2|PEPC_MACFU RecName: Full=Gastricsin; AltName: Full=Pepsinogen C; Flags:
Precursor
gi|38073|emb|CAA42426.1| pepsinogen C [Macaca fuscata]
Length = 377
Score = 84.0 bits (206), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 41/118 (34%), Positives = 68/118 (57%), Gaps = 3/118 (2%)
Query: 59 HFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSN-CQ 115
+ + +GG ++FGGVD + G I ++PV+++ YWQ G+E I + + +CS CQ
Sbjct: 202 YLSDQQGSSGGAVVFGGVDSSLYTGQIYWAPVTQELYWQIGIEEFLIGGQASGWCSEGCQ 261
Query: 116 AIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
AI DTGTSL+ P + ++ L + GA G V+C+++ +P + I+ G F L
Sbjct: 262 AIVDTGTSLLTVPQQYMSALLQATGAQEDEYGQFLVNCNSIQNLPTLTFIINGVEFPL 319
Score = 58.9 bits (141), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 24/34 (70%), Positives = 29/34 (85%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
A Y+G IS+GTPPQ F V+FDTGSSNLW+PS +C
Sbjct: 60 AAYFGEISIGTPPQNFLVLFDTGSSNLWVPSVYC 93
>gi|375581438|gb|AFA55975.1| aspartyl protease, partial [Caenorhabditis sp. 5 AC-2008]
Length = 216
Score = 84.0 bits (206), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 42/115 (36%), Positives = 65/115 (56%), Gaps = 2/115 (1%)
Query: 61 RNASD-ENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIA 118
R+ +D NGGEI D + + G I + P+ + YW+ + ++ I Y + +I
Sbjct: 50 RDVNDIANGGEITLCDTDSNHYSGSIAWEPLVSEDYWRIKLGAVSISGTTYTNGPMDSIV 109
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
DTGTSL+ GPS +I ++ IG +PL NG +V C + +PN+ LGG+NF L
Sbjct: 110 DTGTSLLTGPSDIIKKIQHKIGGIPLINGEYEVVCSKIPSLPNITFTLGGQNFDL 164
>gi|301764903|ref|XP_002917936.1| PREDICTED: napsin-A-like [Ailuropoda melanoleuca]
Length = 406
Score = 84.0 bits (206), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 43/116 (37%), Positives = 63/116 (54%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIA 118
R+ +GGE++ GG D +V +T+ PV+ YWQ +E + + + C+ C AI
Sbjct: 221 RDPEAADGGELVLGGSDPAHYVPPLTFLPVTIPAYWQIHMERVNVGTGLTLCAQGCAAIL 280
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSLI GP++ I L+ IG V L G + C + +P + LGG F LT
Sbjct: 281 DTGTSLITGPTEEIQALHAAIGGVSLLVGEYLIQCSKIPTLPPISFFLGGVWFNLT 336
Score = 68.2 bits (165), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 28/41 (68%), Positives = 32/41 (78%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQYYG I LGTPPQ F V+FDTGSSNLW+PS C L++ C
Sbjct: 74 AQYYGEIGLGTPPQNFSVVFDTGSSNLWVPSIRCHFLSLPC 114
>gi|70999520|ref|XP_754479.1| aspartic endopeptidase Pep2 [Aspergillus fumigatus Af293]
gi|74675969|sp|O42630.1|CARP_ASPFU RecName: Full=Vacuolar protease A; AltName: Full=Aspartic
endopeptidase pep2; AltName: Full=Aspartic protease
pep2; Flags: Precursor
gi|2664292|emb|CAA75754.1| cellular aspartic protease [Aspergillus fumigatus]
gi|4200293|emb|CAA10674.1| aspartic protease [Aspergillus fumigatus]
gi|66852116|gb|EAL92441.1| aspartic endopeptidase Pep2 [Aspergillus fumigatus Af293]
gi|159127496|gb|EDP52611.1| aspartic endopeptidase Pep2 [Aspergillus fumigatus A1163]
Length = 398
Score = 84.0 bits (206), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 44/105 (41%), Positives = 60/105 (57%), Gaps = 1/105 (0%)
Query: 70 EIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVY-CSNCQAIADTGTSLIIGP 128
E FGGVDK+ + G++T P+ RK YW+ ++I + NV N I DTGTSLI P
Sbjct: 237 EASFGGVDKNHYTGELTKIPLRRKAYWEVDFDAIALGDNVAELENTGIILDTGTSLIALP 296
Query: 129 SKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
S + LNK IGA G ++CD D +P++ L G NFT+
Sbjct: 297 STLADLLNKEIGAKKGFTGQYSIECDKRDSLPDLTFTLAGHNFTI 341
Score = 64.3 bits (155), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 28/41 (68%), Positives = 33/41 (80%), Gaps = 2/41 (4%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQY+ ISLGTPPQ+FKV+ DTGSSNLW+P CS +IAC
Sbjct: 83 AQYFSEISLGTPPQKFKVVLDTGSSNLWVPGSDCS--SIAC 121
>gi|403261257|ref|XP_003923041.1| PREDICTED: gastricsin [Saimiri boliviensis boliviensis]
Length = 388
Score = 84.0 bits (206), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 41/118 (34%), Positives = 67/118 (56%), Gaps = 3/118 (2%)
Query: 59 HFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSN-CQ 115
+ N +GG ++FGGVD + G I ++PV+++ YWQ G+E I + + +CS CQ
Sbjct: 213 YLSNQQGSSGGAVVFGGVDSSLYTGQIYWAPVTQELYWQIGIEEFLIGGQASGWCSEGCQ 272
Query: 116 AIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
AI DTGTSL+ P + ++ + GA G V+C+++ +P + I+ G F L
Sbjct: 273 AIVDTGTSLLTVPQQYMSAFLEATGAQEDEYGQFLVNCNSIQNLPTLTFIINGVEFPL 330
Score = 59.3 bits (142), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 24/34 (70%), Positives = 29/34 (85%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
A Y+G IS+GTPPQ F V+FDTGSSNLW+PS +C
Sbjct: 71 AAYFGEISIGTPPQNFLVLFDTGSSNLWVPSVYC 104
>gi|301784222|ref|XP_002927531.1| PREDICTED: pepsin B-like [Ailuropoda melanoleuca]
Length = 390
Score = 84.0 bits (206), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 39/109 (35%), Positives = 60/109 (55%), Gaps = 3/109 (2%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV--YCSN-CQAI 117
R + E GGE++ GGVD + G+I ++PV+R+ YWQ ++ + CS CQAI
Sbjct: 217 RQPTYEYGGELILGGVDSQFYSGEIVWTPVTREMYWQIAIDEFLVSNQATGLCSQGCQAI 276
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
DTGT ++ P + I + GA NG VDCD++ MP + ++
Sbjct: 277 VDTGTYMLAVPQQFIGSFLQTTGAQEAQNGDFVVDCDSIQSMPTITFVI 325
Score = 58.5 bits (140), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 23/32 (71%), Positives = 28/32 (87%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
Y+G IS+GTPPQ F V+FDTGSSNLW+PS +C
Sbjct: 74 YFGEISIGTPPQNFLVLFDTGSSNLWVPSTYC 105
>gi|158257160|dbj|BAF84553.1| unnamed protein product [Homo sapiens]
Length = 388
Score = 84.0 bits (206), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 43/125 (34%), Positives = 72/125 (57%), Gaps = 5/125 (4%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-K 107
V +D F S + +A D++G ++FGG+D + G + + PV+ +GYWQ V+SI + +
Sbjct: 208 VSQDLF---SVYLSADDQSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQITVDSITMNGE 264
Query: 108 NVYCSN-CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
+ C+ CQAI DTGTSL+ GP+ I + IGA ++G V C + +P++ +
Sbjct: 265 AIACAEGCQAIVDTGTSLLTGPTSPITNIQSDIGASENSDGDMVVSCSAISSLPDIVFTI 324
Query: 167 GGKNF 171
G +
Sbjct: 325 NGVQY 329
Score = 60.5 bits (145), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 25/41 (60%), Positives = 35/41 (85%), Gaps = 2/41 (4%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACS 48
+Y+GTI +GTP Q+F V+FDTGSSNLW+PS +CS ++AC+
Sbjct: 75 EYFGTIGIGTPAQDFTVVFDTGSSNLWVPSVYCS--SLACT 113
>gi|311621992|gb|ADP95552.1| ASP-3 [Caenorhabditis sp. 5 AC-2008]
Length = 207
Score = 84.0 bits (206), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 42/115 (36%), Positives = 65/115 (56%), Gaps = 2/115 (1%)
Query: 61 RNASD-ENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIA 118
R+ +D NGGEI D + + G I + P+ + YW+ + ++ I Y + +I
Sbjct: 52 RDVNDIANGGEITLCDTDSNHYSGSIAWEPLVSEDYWRIKLGAVSISGTTYTNGPMDSIV 111
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
DTGTSL+ GPS +I ++ IG +PL NG +V C + +PN+ LGG+NF L
Sbjct: 112 DTGTSLLTGPSDIIKKIQHKIGGIPLINGEYEVVCSKIPSLPNITFTLGGQNFDL 166
>gi|311621974|gb|ADP95543.1| ASP-3 [Caenorhabditis sp. 5 AC-2008]
Length = 217
Score = 84.0 bits (206), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 42/115 (36%), Positives = 65/115 (56%), Gaps = 2/115 (1%)
Query: 61 RNASD-ENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIA 118
R+ +D NGGEI D + + G I + P+ + YW+ + ++ I Y + +I
Sbjct: 51 RDVNDIANGGEITLCDTDSNHYSGSIAWEPLVSEDYWRIKLGAVSISGTTYTNGPMDSIV 110
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
DTGTSL+ GPS +I ++ IG +PL NG +V C + +PN+ LGG+NF L
Sbjct: 111 DTGTSLLTGPSDIIKKIQHKIGGIPLINGEYEVVCSKIPSLPNITFTLGGQNFDL 165
>gi|413934460|gb|AFW69011.1| hypothetical protein ZEAMMB73_821214 [Zea mays]
Length = 324
Score = 84.0 bits (206), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 39/85 (45%), Positives = 59/85 (69%), Gaps = 3/85 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE--KNVYCS-NCQAI 117
R+A + GGEI+FGG+D + GD T+ PV+RKGYWQF + + ++ +C+ C A+
Sbjct: 70 RHADEGEGGEIVFGGMDSSHYKGDHTFVPVTRKGYWQFNMGDVLVDGKSTGFCAGGCAAM 129
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAV 142
AD+GTSL+ GP+ +I E+N+ IGA
Sbjct: 130 ADSGTSLLAGPTAIITEINEKIGAA 154
>gi|375581458|gb|AFA55985.1| aspartyl protease, partial [Caenorhabditis sp. 5 AC-2008]
Length = 208
Score = 84.0 bits (206), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 42/115 (36%), Positives = 65/115 (56%), Gaps = 2/115 (1%)
Query: 61 RNASD-ENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN-CQAIA 118
R+ +D NGGEI D + + G I + P+ + YW+ + ++ I Y + +I
Sbjct: 52 RDVNDIANGGEITLCDTDSNHYSGSIAWEPLVSEDYWRIKLGAVSISGTTYTNGPMDSIV 111
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
DTGTSL+ GPS +I ++ IG +PL NG +V C + +PN+ LGG+NF L
Sbjct: 112 DTGTSLLTGPSDIIKKIQHKIGGIPLINGEYEVVCSKIPSLPNITFTLGGQNFDL 166
>gi|290491167|ref|NP_001166470.1| gastricsin preproprotein [Cavia porcellus]
gi|3024367|sp|Q64411.1|PEPC_CAVPO RecName: Full=Gastricsin; AltName: Full=Pepsinogen C; Flags:
Precursor
gi|191297|gb|AAA37053.1| progastricsin [Cavia porcellus]
Length = 394
Score = 84.0 bits (206), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 43/117 (36%), Positives = 69/117 (58%), Gaps = 5/117 (4%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV--YCSN-CQAI 117
+ SDE G+++ GGVD+ + GDI ++PV+++ YWQ G+E I+ + +CS CQ I
Sbjct: 223 QQGSDE--GQLILGGVDESLYTGDIYWTPVTQELYWQIGIEGFLIDGSASGWCSRGCQGI 280
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSL+ PS ++ L + IGA G V C ++ +P + ++ G F L+
Sbjct: 281 VDTGTSLLTVPSDYLSTLVQAIGAEENEYGEYFVSCSSIQDLPTLTFVISGVEFPLS 337
Score = 65.5 bits (158), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 28/42 (66%), Positives = 36/42 (85%), Gaps = 2/42 (4%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACS 48
A Y+G ISLGTPPQ F+V+FDTGSSNLW+PS +CS ++AC+
Sbjct: 77 AAYFGQISLGTPPQSFQVLFDTGSSNLWVPSVYCS--SLACT 116
>gi|355561685|gb|EHH18317.1| hypothetical protein EGK_14890 [Macaca mulatta]
gi|355748551|gb|EHH53034.1| hypothetical protein EGM_13592 [Macaca fascicularis]
Length = 388
Score = 84.0 bits (206), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 41/118 (34%), Positives = 68/118 (57%), Gaps = 3/118 (2%)
Query: 59 HFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSN-CQ 115
+ + +GG ++FGGVD + G I ++PV+++ YWQ G+E I + + +CS CQ
Sbjct: 213 YLSDQQGSSGGAVVFGGVDSSLYTGQIYWAPVTQELYWQIGIEEFLIGGQASGWCSEGCQ 272
Query: 116 AIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
AI DTGTSL+ P + ++ L + GA G V+C+++ +P + I+ G F L
Sbjct: 273 AIVDTGTSLLTVPQQYMSALLQATGAQEDEYGQFLVNCNSIQNLPTLTFIINGVEFPL 330
Score = 58.9 bits (141), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 24/34 (70%), Positives = 29/34 (85%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
A Y+G IS+GTPPQ F V+FDTGSSNLW+PS +C
Sbjct: 71 AAYFGEISIGTPPQNFLVLFDTGSSNLWVPSVYC 104
>gi|350733|prf||0803215A rennin,pro
Length = 375
Score = 84.0 bits (206), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 48/125 (38%), Positives = 63/125 (50%), Gaps = 4/125 (3%)
Query: 54 FVGHSHFRNASDENGGEIMF--GGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYC 111
V F D NG E M G +D + G + + PV+ + YWQF V+S+ I V
Sbjct: 199 LVAQDLFSVYMDRNGQESMLTLGAIDPSYYTGSLHWVPVTVQQYWQFTVDSVTISGVVVA 258
Query: 112 SN--CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGK 169
CQAI DTGTS ++GPS I + + IGA +DCDNL MP V + GK
Sbjct: 259 CEGGCQAILDTGTSKLVGPSSDILNIQQAIGATQNQYDEFDIDCDNLSYMPTVVFEINGK 318
Query: 170 NFTLT 174
+ LT
Sbjct: 319 MYPLT 323
Score = 56.6 bits (135), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 22/34 (64%), Positives = 28/34 (82%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
+QY+G I LG PPQEF V+FDTGSS+ W+PS +C
Sbjct: 66 SQYFGKIYLGYPPQEFTVLFDTGSSDFWVPSIYC 99
>gi|73621390|sp|Q9GMY3.1|PEPC_RHIFE RecName: Full=Gastricsin; AltName: Full=Pepsinogen C; Flags:
Precursor
gi|9798666|dbj|BAB11755.1| pepsinogen C [Rhinolophus ferrumequinum]
Length = 389
Score = 84.0 bits (206), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 42/112 (37%), Positives = 65/112 (58%), Gaps = 3/112 (2%)
Query: 65 DENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSN-CQAIADTG 121
+NGG ++FGGVD + G I ++PV+++ YWQ G+E I + + +CS CQAI DTG
Sbjct: 220 SQNGGAVIFGGVDNSLYQGQIYWAPVTQELYWQIGIEEFLIGGQASGWCSQGCQAIVDTG 279
Query: 122 TSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
TSL+ P + ++ L + GA G V+C+ + +P I+ G F L
Sbjct: 280 TSLLTVPQQYMSALLQATGAQEDQYGQFFVNCNYIQNLPTFTFIINGVQFPL 331
Score = 60.1 bits (144), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 29/54 (53%), Positives = 36/54 (66%), Gaps = 9/54 (16%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHF 60
A Y+G IS+GTPPQ F V+FDTGSSNLW+PS +C AC+ GH+ F
Sbjct: 71 AAYFGEISIGTPPQNFLVLFDTGSSNLWVPSVYCQ--TQACT-------GHTRF 115
>gi|198471181|ref|XP_002133680.1| GA22674 [Drosophila pseudoobscura pseudoobscura]
gi|198145803|gb|EDY72307.1| GA22674 [Drosophila pseudoobscura pseudoobscura]
Length = 423
Score = 84.0 bits (206), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 36/106 (33%), Positives = 60/106 (56%), Gaps = 8/106 (7%)
Query: 68 GGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSNCQAIADTGTSLIIG 127
GGE++ GG D + G + Y+PVS + YWQF ++ + ++ C+ CQA+ADTGTSL+I
Sbjct: 243 GGELILGGSDASLYAGKLVYAPVSNQNYWQFQMDGVTLDGTTLCTYCQAVADTGTSLLIA 302
Query: 128 PSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
P ++ L+ ++DC +L +P + I + G F +
Sbjct: 303 PYPTYVKIVSLVN--------QEIDCASLSYLPTLTITISGVPFQI 340
Score = 59.7 bits (143), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 26/37 (70%), Positives = 30/37 (81%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILN 44
QYYG IS+GTPPQ F V FDTGS+NLWIPS +C L+
Sbjct: 90 QYYGEISIGTPPQNFMVQFDTGSANLWIPSVNCYSLD 126
>gi|281348334|gb|EFB23918.1| hypothetical protein PANDA_006240 [Ailuropoda melanoleuca]
Length = 379
Score = 84.0 bits (206), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 43/116 (37%), Positives = 63/116 (54%), Gaps = 2/116 (1%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIA 118
R+ +GGE++ GG D +V +T+ PV+ YWQ +E + + + C+ C AI
Sbjct: 196 RDPEAADGGELVLGGSDPAHYVPPLTFLPVTIPAYWQIHMERVNVGTGLTLCAQGCAAIL 255
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSLI GP++ I L+ IG V L G + C + +P + LGG F LT
Sbjct: 256 DTGTSLITGPTEEIQALHAAIGGVSLLVGEYLIQCSKIPTLPPISFFLGGVWFNLT 311
Score = 68.2 bits (165), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 28/41 (68%), Positives = 32/41 (78%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQYYG I LGTPPQ F V+FDTGSSNLW+PS C L++ C
Sbjct: 49 AQYYGEIGLGTPPQNFSVVFDTGSSNLWVPSIRCHFLSLPC 89
>gi|380865655|gb|AFF19538.1| pepsin F, partial [Camelus dromedarius]
Length = 354
Score = 83.6 bits (205), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 38/116 (32%), Positives = 68/116 (58%), Gaps = 2/116 (1%)
Query: 58 SHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVY-CSN-CQ 115
+ + + ENG +MFGG+D + G++ + PVS++ YWQ ++SI + + C CQ
Sbjct: 206 AFYLSTKKENGSVVMFGGLDHSYYKGELKWVPVSQRLYWQISMDSITMNGKILGCKGGCQ 265
Query: 116 AIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNF 171
AI DTGT++++GP+ V+ + K I A PL + CD ++ + ++ + G N+
Sbjct: 266 AIVDTGTAVLVGPTNVVTNIQKAINARPLTGYEYFIGCDAINTLSDIIFTINGVNY 321
Score = 57.0 bits (136), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 23/40 (57%), Positives = 30/40 (75%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
+R+ Y IS+GTPPQ FKV+FDTGS+NLW+PS +C
Sbjct: 60 LRNYLDLVYIADISIGTPPQNFKVVFDTGSANLWVPSIYC 99
>gi|20800441|gb|AAB03843.2| aspartic proteinase [Vigna unguiculata]
gi|33339734|gb|AAQ14346.1| aspartic proteinase [Vigna unguiculata]
Length = 513
Score = 83.6 bits (205), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 41/84 (48%), Positives = 56/84 (66%), Gaps = 3/84 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCS-NCQAI 117
R +E GGE++FGGVD + G+ TY PV+RKGYWQF + + I + YC+ C AI
Sbjct: 233 RKTEEEEGGELVFGGVDPAHYKGEHTYVPVTRKGYWQFDMGDVLIGGKPTGYCAGGCAAI 292
Query: 118 ADTGTSLIIGPSKVIAELNKLIGA 141
AD+GTSL+ GP+ +I +N IGA
Sbjct: 293 ADSGTSLLAGPTAIITMINHAIGA 316
Score = 65.1 bits (157), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 28/47 (59%), Positives = 37/47 (78%), Gaps = 1/47 (2%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ AQYYG IS+GT PQ+F VIFDTGSSNLW+PS C+ ++AC
Sbjct: 81 LKNYLDAQYYGEISIGTSPQKFTVIFDTGSSNLWVPSSRCT-FSLAC 126
>gi|296417651|ref|XP_002838466.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295634405|emb|CAZ82657.1| unnamed protein product [Tuber melanosporum]
Length = 396
Score = 83.6 bits (205), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 46/116 (39%), Positives = 63/116 (54%), Gaps = 1/116 (0%)
Query: 60 FRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKN-VYCSNCQAIA 118
F ++ E +FGG+DK + G + PV RK YW+ E+I K+ N AI
Sbjct: 225 FYLGDKDDQSEAIFGGIDKAHYQGKLIKLPVRRKAYWEVEFEAITFGKSTAQFENTGAIV 284
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSLI PS + LNK IGA NG V+C+ D +P++ L G +FT+T
Sbjct: 285 DTGTSLIALPSTLAELLNKEIGAKKGFNGQYSVECEKRDSLPDLTFTLTGHDFTIT 340
Score = 61.6 bits (148), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 26/41 (63%), Positives = 32/41 (78%), Gaps = 2/41 (4%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQY+ I++GTPPQ FKV+ DTGSSNLW+PS C +IAC
Sbjct: 84 AQYFSEITIGTPPQTFKVVLDTGSSNLWVPSSQCG--SIAC 122
>gi|426198518|gb|EKV48444.1| hypothetical protein AGABI2DRAFT_192052 [Agaricus bisporus var.
bisporus H97]
Length = 413
Score = 83.6 bits (205), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 44/116 (37%), Positives = 65/116 (56%), Gaps = 2/116 (1%)
Query: 60 FR-NASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI-EKNVYCSNCQAI 117
FR +SDE+GGE +FGG+D+ + G + Y P+ +K YW+ ++ I + + + N A
Sbjct: 241 FRLGSSDEDGGEAVFGGIDESAYKGKMHYVPIRQKAYWEVQLDKISLGGEELELENTGAA 300
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
DTGTSLI PS + LN IGA NG +DC + +P + GG+ F L
Sbjct: 301 IDTGTSLIALPSDMAEMLNTQIGAKKSWNGQYTIDCAKVASLPELTFHFGGRAFPL 356
Score = 62.8 bits (151), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 37/86 (43%), Positives = 50/86 (58%), Gaps = 8/86 (9%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASD- 65
AQY+ I +G+PPQ FKVI DTGSSNLW+PS C+ +IAC + G S A+
Sbjct: 100 AQYFTEIQIGSPPQTFKVILDTGSSNLWVPSVKCT--SIACFLHTKYDSGQSSTYKANGS 157
Query: 66 ----ENGGEIMFGGVDKDKF-VGDIT 86
+ G M G V +D+ +GD+T
Sbjct: 158 TFEIQYGSGAMEGFVSQDQLQIGDLT 183
>gi|2687645|gb|AAB88862.1| cathepsin D [Sparus aurata]
Length = 399
Score = 83.6 bits (205), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 44/117 (37%), Positives = 68/117 (58%), Gaps = 3/117 (2%)
Query: 61 RNASDENGGEIMFGGVDKDKF-VGDITYSPVSRKGYWQFGVESIKI--EKNVYCSNCQAI 117
R+ GGE+ GG D +GD+ Y V+RK YW G++ +++ + ++ + C+AI
Sbjct: 223 RDPKAAVGGELTLGGTDPHVLTLGDLHYVNVTRKAYWHIGMDGLQVGNQLSLCKAGCEAI 282
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSLI+GP + + L+K IGA+PL +G +DC + LGG+ F LT
Sbjct: 283 VDTGTSLIVGPVEEVRALHKAIGALPLIDGEYGLDCSGSHRCLLSLSTLGGRMFNLT 339
Score = 68.2 bits (165), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 30/42 (71%), Positives = 36/42 (85%), Gaps = 1/42 (2%)
Query: 7 AQYYGTISLGTP-PQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQYYG IS+GTP ++F V+FDTGSSNLW+PS HCS L+IAC
Sbjct: 75 AQYYGVISIGTPVHRDFTVLFDTGSSNLWVPSIHCSFLDIAC 116
>gi|409079719|gb|EKM80080.1| hypothetical protein AGABI1DRAFT_113304 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 413
Score = 83.6 bits (205), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 44/116 (37%), Positives = 65/116 (56%), Gaps = 2/116 (1%)
Query: 60 FR-NASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI-EKNVYCSNCQAI 117
FR +SDE+GGE +FGG+D+ + G + Y P+ +K YW+ ++ I + + + N A
Sbjct: 241 FRLGSSDEDGGEAVFGGIDESAYKGKMHYVPIRQKAYWEVQLDKISLGGEELELENTGAA 300
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
DTGTSLI PS + LN IGA NG +DC + +P + GG+ F L
Sbjct: 301 IDTGTSLIALPSDMAEMLNTQIGAKKSWNGQYTIDCAKVASLPELTFHFGGRAFPL 356
Score = 62.4 bits (150), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 37/86 (43%), Positives = 50/86 (58%), Gaps = 8/86 (9%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASD- 65
AQY+ I +G+PPQ FKVI DTGSSNLW+PS C+ +IAC + G S A+
Sbjct: 100 AQYFTEIQIGSPPQTFKVILDTGSSNLWVPSVKCT--SIACFLHTKYDSGQSSTYKANGS 157
Query: 66 ----ENGGEIMFGGVDKDKF-VGDIT 86
+ G M G V +D+ +GD+T
Sbjct: 158 TFEIQYGSGAMEGFVSQDQLQIGDLT 183
>gi|361128953|gb|EHL00878.1| putative Vacuolar protease A [Glarea lozoyensis 74030]
Length = 399
Score = 83.6 bits (205), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 54/150 (36%), Positives = 81/150 (54%), Gaps = 5/150 (3%)
Query: 26 FDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDI 85
FDT S N +P + S++N +D+ F + N +E+ E FGGV++D + G +
Sbjct: 197 FDTISVNKIVPPFY-SMINQGL-LDEPVFAFYLGDTNNGEES--EATFGGVNEDHYTGKM 252
Query: 86 TYSPVSRKGYWQFGVESIKI-EKNVYCSNCQAIADTGTSLIIGPSKVIAELNKLIGAVPL 144
T P+ RK YW+ +++I + N I DTGTSLI PS + LNK +GA
Sbjct: 253 TTIPLRRKAYWEVDLDAITFGDATAELENTGVILDTGTSLIALPSTLAELLNKEMGAKKG 312
Query: 145 ANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
NG V+C+ D +P++ L G NFT+T
Sbjct: 313 YNGQYTVECEKRDSLPDMSFNLSGYNFTIT 342
Score = 63.2 bits (152), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 33/82 (40%), Positives = 44/82 (53%), Gaps = 3/82 (3%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASDE 66
AQY+ I++GTPPQ FKV+ DTGSSNLW+PS C + D ++ +N SD
Sbjct: 84 AQYFSEITIGTPPQSFKVVLDTGSSNLWVPSSECGSIACYLHTKYDSSSSSTYKKNGSDF 143
Query: 67 NGGEIMFGGVDKDKFVGDITYS 88
EI +G FV T +
Sbjct: 144 ---EIRYGSGSLSGFVSQDTMT 162
>gi|457097|gb|AAA30449.1| chymosin C [Bos taurus]
Length = 242
Score = 83.6 bits (205), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 48/126 (38%), Positives = 63/126 (50%), Gaps = 4/126 (3%)
Query: 53 KFVGHSHFRNASDENGGEIMF--GGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVY 110
V F D NG E M G +D + G + + PV+ + YWQF V+S+ I V
Sbjct: 65 HLVAQDLFSVYMDRNGQESMLTLGAIDPSYYTGSLHWVPVTVQQYWQFTVDSVTISGVVV 124
Query: 111 CSN--CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGG 168
CQAI DTGTS ++GPS I + + IGA +DCDNL MP V + G
Sbjct: 125 ACEGGCQAILDTGTSKLVGPSSDILNIQQAIGATQNQYDEFDIDCDNLSYMPTVVFEING 184
Query: 169 KNFTLT 174
K + LT
Sbjct: 185 KIYPLT 190
>gi|448113357|ref|XP_004202330.1| Piso0_001822 [Millerozyma farinosa CBS 7064]
gi|359465319|emb|CCE89024.1| Piso0_001822 [Millerozyma farinosa CBS 7064]
Length = 414
Score = 83.6 bits (205), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 52/152 (34%), Positives = 81/152 (53%), Gaps = 3/152 (1%)
Query: 24 VIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVG 83
+ +DT S N +P + +I N +D+ KF + ++ + +GG FGGVD + G
Sbjct: 209 LAYDTISVNNIVPPIYNAI-NQGL-LDEPKFAFYLGDKDKDENDGGVATFGGVDTKHYKG 266
Query: 84 DITYSPVSRKGYWQFGVESIKI-EKNVYCSNCQAIADTGTSLIIGPSKVIAELNKLIGAV 142
DI P+ RK YW+ + I + ++ ++ A DTGTSLI PS + +N IGA
Sbjct: 267 DIVELPIRRKAYWEVSFDGIGLGDEYAELTSTGAAIDTGTSLITLPSSLAEIINAKIGAK 326
Query: 143 PLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
+G VDCD+ D +P + + G NFTL+
Sbjct: 327 KSWSGQYSVDCDSRDSLPELTMTFHGHNFTLS 358
Score = 63.9 bits (154), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 31/59 (52%), Positives = 37/59 (62%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASD 65
AQYY TI LG+P QEFKV+ DTGSSNLW+PS CS L D S+ +N S+
Sbjct: 98 AQYYTTIGLGSPVQEFKVVLDTGSSNLWVPSTDCSSLACFLHTKYDHSESSSYKQNGSE 156
>gi|11120702|ref|NP_068521.1| pepsinogen 5, group I precursor [Rattus norvegicus]
gi|392344905|ref|XP_003749106.1| PREDICTED: pepsin F-like [Rattus norvegicus]
gi|7105998|emb|CAB75982.1| pepsinogen F protein [Rattus norvegicus]
gi|149062409|gb|EDM12832.1| pepsinogen 5, group I (pepsinogen A) [Rattus norvegicus]
Length = 387
Score = 83.6 bits (205), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 52/166 (31%), Positives = 81/166 (48%), Gaps = 22/166 (13%)
Query: 2 RSSYQAQYYGTISLGTP---PQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHS 58
R A + G + LG P Q +FD NLWI + ++ F +
Sbjct: 170 RFMEHAVFDGILGLGYPNLGLQGVTPVFD----NLWIQGL----------IPQNLF---A 212
Query: 59 HFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN--CQA 116
+ ++ DE G +M GGVD + G++ + PVS+ YWQ V+SI + + + CQ
Sbjct: 213 FYLSSKDEKGSVLMLGGVDPSYYHGELHWVPVSKPSYWQLAVDSISMNGEIIACDGGCQG 272
Query: 117 IADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNV 162
I DTGTSL+ GP I + LIGA +G + CD ++ +P++
Sbjct: 273 IMDTGTSLVTGPRSSILNIQNLIGAKASGDGEYFLKCDTINTLPDI 318
Score = 63.5 bits (153), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 30/51 (58%), Positives = 38/51 (74%), Gaps = 2/51 (3%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK 51
MR+ Y GTIS+GTPPQEFKV+ DTGSS+LW+PS +CS + AC+ K
Sbjct: 66 MRNYLDLVYIGTISIGTPPQEFKVVLDTGSSDLWVPSIYCS--SPACAHHK 114
>gi|414871124|tpg|DAA49681.1| TPA: hypothetical protein ZEAMMB73_239621 [Zea mays]
Length = 299
Score = 83.2 bits (204), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 41/93 (44%), Positives = 61/93 (65%), Gaps = 3/93 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE--KNVYCS-NCQAI 117
R+A + GGEI+FGG+D + GD T+ PV+RKGYWQF + + ++ +C+ C AI
Sbjct: 18 RHADEGEGGEIVFGGMDSSHYKGDHTFVPVTRKGYWQFNMGDVLVDGKSTGFCAGGCAAI 77
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAK 150
AD+GTSL+ GP +I E+N+ IGA + + K
Sbjct: 78 ADSGTSLLAGPIAIITEINEKIGAAGVVSQECK 110
Score = 35.4 bits (80), Expect = 8.3, Method: Compositional matrix adjust.
Identities = 15/43 (34%), Positives = 25/43 (58%)
Query: 131 VIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
++ +N+L +P G + VDC +L MP++ +GGK F L
Sbjct: 196 ILNYINQLCERLPSPMGESAVDCGSLASMPDIAFTIGGKKFKL 238
>gi|261194088|ref|XP_002623449.1| aspartyl proteinase [Ajellomyces dermatitidis SLH14081]
gi|239588463|gb|EEQ71106.1| aspartyl proteinase [Ajellomyces dermatitidis SLH14081]
gi|239606974|gb|EEQ83961.1| aspartyl proteinase [Ajellomyces dermatitidis ER-3]
gi|327354563|gb|EGE83420.1| aspartyl proteinase [Ajellomyces dermatitidis ATCC 18188]
Length = 398
Score = 83.2 bits (204), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 42/110 (38%), Positives = 62/110 (56%), Gaps = 1/110 (0%)
Query: 65 DENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKN-VYCSNCQAIADTGTS 123
+++ E +FGG++KD + G++ P+ RK YW+ +++I K N I DTGTS
Sbjct: 232 EDDDSEAVFGGINKDHYTGELVMIPLRRKAYWEVDLDAITFGKETAQLENTGVILDTGTS 291
Query: 124 LIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
LI PS + LNK IGA NG +DC D +P++ L G NFT+
Sbjct: 292 LIALPSTLAELLNKEIGAKKSFNGQYTIDCTKRDGLPDLTFTLTGHNFTI 341
Score = 64.3 bits (155), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 37/86 (43%), Positives = 51/86 (59%), Gaps = 8/86 (9%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK--DKFVGHSHFRNAS 64
AQYY I++GTPPQ FKV+ DTGSSNLW+PS C +IAC + D ++ +N S
Sbjct: 83 AQYYSEITIGTPPQTFKVVLDTGSSNLWVPSSECG--SIACYLHNKYDSSTSSTYQKNGS 140
Query: 65 D---ENGGEIMFGGVDKDKF-VGDIT 86
+ G + G V +D +GD+T
Sbjct: 141 EFAIRYGSGSLSGFVSQDTVRIGDLT 166
>gi|443286990|dbj|BAM76489.1| pepsinogen, partial [Gadus macrocephalus]
Length = 326
Score = 83.2 bits (204), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 48/127 (37%), Positives = 70/127 (55%), Gaps = 6/127 (4%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-K 107
V +D F S + + + G + FGGVD + G IT+ P+S + YWQ V S+ + +
Sbjct: 151 VTQDMF---SVYLSKNSAQGSVVTFGGVDPHHYSGSITWIPLSSELYWQITVHSVSVNGR 207
Query: 108 NVYC-SNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIIL 166
V C CQAI DTGTSLI+GP I+ +N +GA N V+C+ + +MP V +
Sbjct: 208 PVACVGGCQAIVDTGTSLIVGPQSDISSINSAVGATSSGN-DFVVNCNGIQQMPAVVFHI 266
Query: 167 GGKNFTL 173
G F+L
Sbjct: 267 NGHKFSL 273
Score = 63.9 bits (154), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 29/46 (63%), Positives = 35/46 (76%), Gaps = 3/46 (6%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKF 54
Y+G IS+GTPPQ FKVIFD+GSSNLW+PS +C N A + DKF
Sbjct: 20 YFGVISIGTPPQSFKVIFDSGSSNLWVPSVYC---NSAACNNHDKF 62
>gi|49019533|emb|CAD80098.1| gastricsin [Trematomus bernacchii]
Length = 386
Score = 83.2 bits (204), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 42/119 (35%), Positives = 65/119 (54%), Gaps = 3/119 (2%)
Query: 58 SHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSN-C 114
S + + + G + FGGVD + G I ++PV+ + YWQ GV+ +I + +CS C
Sbjct: 206 SFYLSRGGQQGSVLSFGGVDTSLYRGQIYWTPVTSETYWQIGVQGFEINGRETGWCSQGC 265
Query: 115 QAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
Q+I DTGTS + PS I ++ + IGA NG VDC L+ +P + ++ G L
Sbjct: 266 QSIVDTGTSTLTAPSHFIGDIMQAIGAQRSQNGMYMVDCSELNNLPTLSFVISGVALPL 324
Score = 57.4 bits (137), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 23/33 (69%), Positives = 29/33 (87%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCS 41
YYG IS+GTPPQ F+V+FDTGS+NLW+ S +CS
Sbjct: 68 YYGAISIGTPPQSFQVLFDTGSANLWVDSIYCS 100
>gi|452981069|gb|EME80829.1| hypothetical protein MYCFIDRAFT_89289 [Pseudocercospora fijiensis
CIRAD86]
Length = 396
Score = 83.2 bits (204), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 47/126 (37%), Positives = 69/126 (54%), Gaps = 3/126 (2%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI-EK 107
+D+ KF + + DE+ E +FGGV++D + G + P+ RK YW+ +++I ++
Sbjct: 216 LDEKKFAFYLSSTDKGDES--EAIFGGVNEDHYTGKMINIPLRRKAYWEVDLDAITFGDQ 273
Query: 108 NVYCSNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILG 167
AI DTGTSLI PS + LNK IGA NG VDC D +P++ L
Sbjct: 274 TAEIDATGAILDTGTSLIALPSTLAELLNKEIGAKKSYNGQYTVDCSKRDSLPDLTFTLT 333
Query: 168 GKNFTL 173
G NFT+
Sbjct: 334 GHNFTI 339
Score = 66.2 bits (160), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 33/61 (54%), Positives = 42/61 (68%), Gaps = 4/61 (6%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHS--HFRNAS 64
AQY+ I++GTPPQEFKV+ DTGSSNLW+PSQ C +IAC + G S + +N S
Sbjct: 82 AQYFSQIAIGTPPQEFKVVLDTGSSNLWVPSQDCG--SIACYLHSKYDHGESTTYKQNGS 139
Query: 65 D 65
D
Sbjct: 140 D 140
>gi|403414885|emb|CCM01585.1| predicted protein [Fibroporia radiculosa]
Length = 414
Score = 83.2 bits (204), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 43/120 (35%), Positives = 67/120 (55%), Gaps = 2/120 (1%)
Query: 60 FR-NASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI-EKNVYCSNCQAI 117
FR +S+E+GGE +FGG+D+ + G I Y PV RK YW+ ++ + + + + + A
Sbjct: 241 FRLGSSEEDGGEAVFGGIDRTAYTGSIDYVPVRRKAYWEVELQKVALGDDELDLEHTGAA 300
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTRTN 177
DTGTSLI P+ + +N IGA NG VDC + +P + + GK + L T+
Sbjct: 301 IDTGTSLIALPTDIAEMINTQIGAQKQWNGQYTVDCSKVPSLPELVLTFNGKPYPLKGTD 360
Score = 60.5 bits (145), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 47/82 (57%), Gaps = 15/82 (18%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASD- 65
AQY+ I LGTP Q FKVI DTGSSNLW+PS C+ +IAC + H+ + ++S
Sbjct: 100 AQYFAEIQLGTPAQSFKVILDTGSSNLWVPSSKCT--SIACFL-------HAKYDSSSST 150
Query: 66 ---ENGGE--IMFGGVDKDKFV 82
NG E I +G + FV
Sbjct: 151 TYKANGSEFSIQYGSGSMEGFV 172
>gi|124514108|gb|ABN13683.1| preprochymosin [Capra hircus]
Length = 381
Score = 83.2 bits (204), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 47/125 (37%), Positives = 63/125 (50%), Gaps = 4/125 (3%)
Query: 54 FVGHSHFRNASDENG--GEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYC 111
V F D NG + G +D + G + + PV+ + YWQF V+S+ I V
Sbjct: 205 LVAQDLFSVYMDRNGQGSMLTLGAIDPSYYTGSLHWVPVTLQKYWQFTVDSVTISGAVVA 264
Query: 112 SN--CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGK 169
CQAI DTGTS ++GPS I + + IGA G VDCD+L MP V + GK
Sbjct: 265 CEGGCQAILDTGTSKLVGPSSDILNIQQAIGATQNQYGEFDVDCDSLSSMPTVVFEINGK 324
Query: 170 NFTLT 174
+ LT
Sbjct: 325 MYPLT 329
Score = 59.3 bits (142), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 23/34 (67%), Positives = 29/34 (85%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
+QY+G I LGTPPQEF V+FDTGSS+ W+PS +C
Sbjct: 72 SQYFGKIYLGTPPQEFTVLFDTGSSDFWVPSIYC 105
>gi|1030715|dbj|BAA06876.1| aspartic protease [Oryza sativa]
gi|1711289|dbj|BAA06875.1| aspartic protease [Oryza sativa]
Length = 509
Score = 83.2 bits (204), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 41/84 (48%), Positives = 59/84 (70%), Gaps = 3/84 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYC-SNCQAI 117
R++ + GGEI+FGG+D + G+ TY PVS+KGYWQF + + I + +C S C AI
Sbjct: 229 RHSDEGEGGEIVFGGMDPSHYKGNHTYVPVSQKGYWQFEMGDVLIGGKTTGFCASGCSAI 288
Query: 118 ADTGTSLIIGPSKVIAELNKLIGA 141
AD+GTSL+ GP+ +I E+N+ IGA
Sbjct: 289 ADSGTSLLAGPTAIITEINEKIGA 312
Score = 64.7 bits (156), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 28/47 (59%), Positives = 36/47 (76%), Gaps = 1/47 (2%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ AQY+G I +GTPPQ+F VIFDTGSSNLW+PS C +IAC
Sbjct: 77 LKNYMNAQYFGEIGVGTPPQKFTVIFDTGSSNLWVPSAKC-YFSIAC 122
>gi|443894057|dbj|GAC71407.1| aspartyl protease [Pseudozyma antarctica T-34]
Length = 418
Score = 82.8 bits (203), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 42/117 (35%), Positives = 65/117 (55%), Gaps = 1/117 (0%)
Query: 58 SHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI-EKNVYCSNCQA 116
S + +S+ +GGE +FGG+D + G I ++PV RKGYW+ ++ + + ++ + N A
Sbjct: 245 SFYLGSSEADGGEAVFGGIDDSHYTGKIHWAPVKRKGYWEVALDKLALGDEELELDNGSA 304
Query: 117 IADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
DTGTSLI + LN IGA NG VDC+ + +P + + GK F L
Sbjct: 305 AIDTGTSLIAMATDTAEILNAEIGATKSWNGQYSVDCEKVKDLPPLTFYIDGKPFKL 361
Score = 67.4 bits (163), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 42/86 (48%), Positives = 53/86 (61%), Gaps = 8/86 (9%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK--DKFVGHSHFRNAS 64
AQY+ ISLGTP QEFKVI DTGSSNLW+PS CS +IAC + K D S+ +N +
Sbjct: 105 AQYFCDISLGTPAQEFKVILDTGSSNLWVPSTKCS--SIACFLHKKYDSSASSSYKKNGT 162
Query: 65 D---ENGGEIMFGGVDKDKF-VGDIT 86
+ + G M G V D +GD+T
Sbjct: 163 EFKIQYGSGSMEGIVSNDVLKIGDLT 188
>gi|115465497|ref|NP_001056348.1| Os05g0567100 [Oryza sativa Japonica Group]
gi|78099759|sp|Q42456.2|ASPR1_ORYSJ RecName: Full=Aspartic proteinase oryzasin-1; Flags: Precursor
gi|51854282|gb|AAU10663.1| aspartic proteinase oryzasin 1 precursor [Oryza sativa Japonica
Group]
gi|113579899|dbj|BAF18262.1| Os05g0567100 [Oryza sativa Japonica Group]
gi|125553350|gb|EAY99059.1| hypothetical protein OsI_21016 [Oryza sativa Indica Group]
gi|169244443|gb|ACA50495.1| aspartic proteinase oryzasin 1 [Oryza sativa Japonica Group]
gi|215695381|dbj|BAG90572.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215737145|dbj|BAG96074.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215740829|dbj|BAG96985.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222632587|gb|EEE64719.1| hypothetical protein OsJ_19575 [Oryza sativa Japonica Group]
Length = 509
Score = 82.8 bits (203), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 41/84 (48%), Positives = 59/84 (70%), Gaps = 3/84 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYC-SNCQAI 117
R++ + GGEI+FGG+D + G+ TY PVS+KGYWQF + + I + +C S C AI
Sbjct: 229 RHSDEGEGGEIVFGGMDPSHYKGNHTYVPVSQKGYWQFEMGDVLIGGKTTGFCASGCSAI 288
Query: 118 ADTGTSLIIGPSKVIAELNKLIGA 141
AD+GTSL+ GP+ +I E+N+ IGA
Sbjct: 289 ADSGTSLLAGPTAIITEINEKIGA 312
Score = 64.7 bits (156), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 28/47 (59%), Positives = 36/47 (76%), Gaps = 1/47 (2%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
+++ AQY+G I +GTPPQ+F VIFDTGSSNLW+PS C +IAC
Sbjct: 77 LKNYMNAQYFGEIGVGTPPQKFTVIFDTGSSNLWVPSAKC-YFSIAC 122
Score = 37.0 bits (84), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 15/43 (34%), Positives = 24/43 (55%)
Query: 131 VIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
++ +N+L +P G + VDC +L MP + +GGK F L
Sbjct: 406 ILNYINQLCDKLPSPMGESSVDCGSLASMPEISFTIGGKKFAL 448
>gi|425767355|gb|EKV05929.1| Vacuolar protease A [Penicillium digitatum PHI26]
gi|425779798|gb|EKV17829.1| Vacuolar protease A [Penicillium digitatum Pd1]
Length = 399
Score = 82.8 bits (203), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 43/102 (42%), Positives = 58/102 (56%), Gaps = 1/102 (0%)
Query: 73 FGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVY-CSNCQAIADTGTSLIIGPSKV 131
FGG+D+ + G++ PV RK YW+ + SI + NV + I DTGTSLI PS +
Sbjct: 241 FGGIDESHYTGELIKIPVRRKAYWEVELNSIALGNNVAELDDTGVILDTGTSLIALPSTM 300
Query: 132 IAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
LNK IGA G V+CD D +P++ LGG NFT+
Sbjct: 301 AELLNKEIGATKGFTGQYSVECDKRDSLPDLTFTLGGHNFTI 342
Score = 63.5 bits (153), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 36/85 (42%), Positives = 51/85 (60%), Gaps = 8/85 (9%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK--DKFVGHSHFRNAS 64
AQY+ I++GTPPQ FKV+ DTGSSNLW+PS CS +IAC + D ++ +N +
Sbjct: 84 AQYFSEITIGTPPQTFKVVLDTGSSNLWVPSSQCS--SIACFLHSKYDSSSSSTYQKNGT 141
Query: 65 D---ENGGEIMFGGVDKDKF-VGDI 85
D G + G V +D +GD+
Sbjct: 142 DFEIRYGSGSLSGFVSRDTLQIGDL 166
>gi|398396710|ref|XP_003851813.1| hypothetical protein MYCGRDRAFT_104895 [Zymoseptoria tritici
IPO323]
gi|339471693|gb|EGP86789.1| hypothetical protein MYCGRDRAFT_104895 [Zymoseptoria tritici
IPO323]
Length = 398
Score = 82.8 bits (203), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 47/127 (37%), Positives = 71/127 (55%), Gaps = 3/127 (2%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI-EK 107
+D+ F + + DE+ E +FGGV+KD + G +T P+ RK YW+ ++I + ++
Sbjct: 218 LDEKVFAFYLSSTDKGDES--EAIFGGVNKDHYTGKMTNIPLRRKAYWEVDFDAITLGDQ 275
Query: 108 NVYCSNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILG 167
+ AI DTGTSLI PS + LNK IGA NG V+C D +P++ L
Sbjct: 276 TAELDSTGAILDTGTSLIALPSTMAELLNKEIGAKKGYNGQYSVECSARDSLPDLTFTLT 335
Query: 168 GKNFTLT 174
G NFT++
Sbjct: 336 GHNFTIS 342
Score = 64.3 bits (155), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 27/41 (65%), Positives = 34/41 (82%), Gaps = 2/41 (4%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQY+ I++GTPPQEFKV+ DTGSSNLW+PS+ C +IAC
Sbjct: 84 AQYFSEIAIGTPPQEFKVVLDTGSSNLWVPSKDCG--SIAC 122
>gi|432102593|gb|ELK30160.1| Napsin-A [Myotis davidii]
Length = 357
Score = 82.8 bits (203), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 44/116 (37%), Positives = 64/116 (55%), Gaps = 3/116 (2%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV-YCSN-CQAIA 118
R+ GGE++ GG D ++ +TY PV+ YWQ +E + + + C+ C AI
Sbjct: 224 RDPEAAEGGELVLGGSDPAHYIPPLTYVPVTVPAYWQVHMERVTVGPGLTLCAQGCPAIL 283
Query: 119 DTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
DTGTSLI GP++ I L++ IG PL G ++C + +P V LGG F LT
Sbjct: 284 DTGTSLITGPTEEIRALHRAIGGFPLL-GKYIIECSVIPALPPVSFSLGGVWFNLT 338
Score = 70.1 bits (170), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 28/41 (68%), Positives = 33/41 (80%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQYYG I LGTPPQ F V+FDTGSSNLW+PS+ CS ++ C
Sbjct: 77 AQYYGKIGLGTPPQNFSVVFDTGSSNLWVPSRRCSFFSLPC 117
>gi|384485237|gb|EIE77417.1| hypothetical protein RO3G_02121 [Rhizopus delemar RA 99-880]
Length = 399
Score = 82.8 bits (203), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 40/113 (35%), Positives = 66/113 (58%), Gaps = 1/113 (0%)
Query: 62 NASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKN-VYCSNCQAIADT 120
A ++ GGEI FG +D+ ++ GD+ +SPV RKGYW+ +++ ++ + V S+ A DT
Sbjct: 227 QAENDIGGEIAFGEIDQTRYTGDLIWSPVQRKGYWEIAIDNFRVGADPVNPSSLTAAIDT 286
Query: 121 GTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
GTSLI+ P+ V E++ +GA NG C + +P + + G +F L
Sbjct: 287 GTSLILVPTSVSIEIHARLGAQLSGNGLYIFSCATVSSLPEICVTFSGVDFCL 339
Score = 63.5 bits (153), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 31/61 (50%), Positives = 38/61 (62%), Gaps = 4/61 (6%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHS--HFRNAS 64
AQYYG ISLGTPPQ F V+FDTGSSN W+PS C ++AC + S + RN +
Sbjct: 78 AQYYGEISLGTPPQIFSVVFDTGSSNTWVPSTRC--FSLACLTHRRYSASRSSTYVRNGT 135
Query: 65 D 65
Sbjct: 136 Q 136
>gi|448115983|ref|XP_004202951.1| Piso0_001822 [Millerozyma farinosa CBS 7064]
gi|359383819|emb|CCE79735.1| Piso0_001822 [Millerozyma farinosa CBS 7064]
Length = 414
Score = 82.8 bits (203), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 46/127 (36%), Positives = 68/127 (53%), Gaps = 1/127 (0%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI-EK 107
+D+ KF + ++ + GG FGGVD + GDI PV RK YW+ + I + ++
Sbjct: 232 LDEPKFAFYLGDKDKDENEGGVATFGGVDTKHYKGDIIELPVRRKAYWEVSFDGIGLGDE 291
Query: 108 NVYCSNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILG 167
++ A DTGTSLI PS + +N IGA +G VDCD+ D +P + +
Sbjct: 292 YAELTSTGAAIDTGTSLITLPSSLAEIINAKIGAKKSWSGQYSVDCDSRDSLPELTMTFH 351
Query: 168 GKNFTLT 174
G NFTL+
Sbjct: 352 GHNFTLS 358
Score = 63.9 bits (154), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 33/61 (54%), Positives = 41/61 (67%), Gaps = 4/61 (6%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKF--VGHSHFRNAS 64
AQYY TI LG+P QEFKVI DTGSSNLW+PS CS ++AC + + S+ +N S
Sbjct: 98 AQYYTTIGLGSPAQEFKVILDTGSSNLWVPSTDCS--SLACFLHSKYYHDESSSYKQNGS 155
Query: 65 D 65
D
Sbjct: 156 D 156
>gi|12231176|dbj|BAB20971.1| aspartic proteinase 3 [Nepenthes alata]
Length = 507
Score = 82.8 bits (203), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 41/84 (48%), Positives = 54/84 (64%), Gaps = 3/84 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV--YCS-NCQAI 117
R +E GGEI+FGGVD + F G +Y PV+ KGYWQF + + I+ YC C AI
Sbjct: 226 RKVGEEEGGEIVFGGVDPNHFKGTHSYVPVTHKGYWQFDMGDVLIDGKATEYCEGGCSAI 285
Query: 118 ADTGTSLIIGPSKVIAELNKLIGA 141
AD+GTSL+ GP+ V+ +N IGA
Sbjct: 286 ADSGTSLLAGPTSVVTMINHAIGA 309
Score = 62.8 bits (151), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 25/40 (62%), Positives = 32/40 (80%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
+++ AQY+G I +GTPPQ+F VIFDTGSSNLW+PS C
Sbjct: 74 LKNYMDAQYFGEIGIGTPPQKFTVIFDTGSSNLWVPSSKC 113
>gi|73912435|dbj|BAE20414.1| aspartic proteinase [Triticum aestivum]
Length = 498
Score = 82.8 bits (203), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 38/84 (45%), Positives = 60/84 (71%), Gaps = 3/84 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV--YCSN-CQAI 117
R++ +GGE++FGG+D D + G+ TY PVSR+GYWQF + + I+ + +C+ C AI
Sbjct: 223 RDSDALSGGELVFGGMDPDHYKGNHTYVPVSRRGYWQFNMGDLLIDGHSTGFCAKGCAAI 282
Query: 118 ADTGTSLIIGPSKVIAELNKLIGA 141
D+GTSL+ GP+ ++A++N IGA
Sbjct: 283 VDSGTSLLAGPTAIVAQVNHAIGA 306
Score = 64.3 bits (155), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 29/40 (72%), Positives = 31/40 (77%), Gaps = 1/40 (2%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
QYYG I LGTPPQ F VIFDTGSSNLW+PS C +IAC
Sbjct: 78 QYYGVIGLGTPPQNFTVIFDTGSSNLWVPSAKC-YFSIAC 116
Score = 38.5 bits (88), Expect = 0.93, Method: Compositional matrix adjust.
Identities = 25/83 (30%), Positives = 41/83 (49%), Gaps = 7/83 (8%)
Query: 99 GVESIKIEKNV----YCSNCQ-AIADTGTSLIIGPSK--VIAELNKLIGAVPLANGPAKV 151
G+ES+ ++NV C+ C+ A+ L +K ++ N+L +P NG + V
Sbjct: 356 GIESVVGKENVGSDVMCTACEMAVVWIENQLRENKTKELILQYANQLCERLPSPNGESTV 415
Query: 152 DCDNLDKMPNVDIILGGKNFTLT 174
C + KMPN+ + K F LT
Sbjct: 416 SCHEMSKMPNLAFTIASKTFVLT 438
>gi|355329703|dbj|BAL14145.1| pepsinogen 3 [Pagrus major]
Length = 386
Score = 82.8 bits (203), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 41/129 (31%), Positives = 68/129 (52%), Gaps = 6/129 (4%)
Query: 49 VDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--- 105
+D D F + + + + G + FG VD + G I ++PV+ + YWQ GV+ +I
Sbjct: 200 LDADIF---AFYLSRGGQQGSVLSFGDVDSSLYEGQIYWTPVTAQTYWQIGVQGFQISGR 256
Query: 106 EKNVYCSNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDII 165
E + CQAI DTGTS++ PS+++ + + IGA G VDC ++ +P +
Sbjct: 257 ETGLCSQGCQAIVDTGTSMLTAPSQILGNIMQAIGAQQSQYGMYMVDCSQVNNLPTFSFV 316
Query: 166 LGGKNFTLT 174
+ G F L+
Sbjct: 317 ISGTAFPLS 325
Score = 54.7 bits (130), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 25/43 (58%), Positives = 32/43 (74%), Gaps = 2/43 (4%)
Query: 9 YYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK 51
YYG IS+GTPPQ +V+FDTGSSNLW+ S +C+ AC+ K
Sbjct: 68 YYGAISIGTPPQSSEVLFDTGSSNLWVDSVYCNTQ--ACNTHK 108
>gi|149244964|ref|XP_001527016.1| vacuolar aspartic protease precursor [Lodderomyces elongisporus
NRRL YB-4239]
gi|146449410|gb|EDK43666.1| vacuolar aspartic protease precursor [Lodderomyces elongisporus
NRRL YB-4239]
Length = 429
Score = 82.8 bits (203), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 44/114 (38%), Positives = 62/114 (54%), Gaps = 1/114 (0%)
Query: 62 NASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI-EKNVYCSNCQAIADT 120
N + +GG FGG D+ F G IT+ PV RK YW+ E + + ++ A DT
Sbjct: 259 NKDENDGGVATFGGYDESLFQGKITWLPVRRKAYWEVAFEGLGLGDEYAELIQTGAAIDT 318
Query: 121 GTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
GTSLI PS + +N IGA +G +VDCD D +P++ + G NFTL+
Sbjct: 319 GTSLITLPSTLAEIINAKIGATKSWSGQYQVDCDKRDSLPDLTLTFSGYNFTLS 372
Score = 64.3 bits (155), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 37/83 (44%), Positives = 47/83 (56%), Gaps = 4/83 (4%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASD- 65
AQY+ I LGTPPQ FKVI DTGSSNLW+PS+ CS L D S+ N S+
Sbjct: 112 AQYFTEIQLGTPPQTFKVILDTGSSNLWVPSKDCSSLACFLHSKYDHDASSSYKANGSEF 171
Query: 66 --ENGGEIMFGGVDKDKF-VGDI 85
+ G M G + +D +GD+
Sbjct: 172 SIQYGSGSMEGYISQDILSIGDL 194
>gi|367047895|ref|XP_003654327.1| hypothetical protein THITE_2117251 [Thielavia terrestris NRRL 8126]
gi|347001590|gb|AEO67991.1| hypothetical protein THITE_2117251 [Thielavia terrestris NRRL 8126]
Length = 396
Score = 82.8 bits (203), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 47/112 (41%), Positives = 63/112 (56%), Gaps = 2/112 (1%)
Query: 64 SDENG-GEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI-EKNVYCSNCQAIADTG 121
+D NG E++FGGVDKD++ G IT P+ RK YW+ ES+ + N AI DTG
Sbjct: 228 ADTNGESEVVFGGVDKDRYKGKITTIPLRRKAYWEVDFESLSYGDDTADFENTGAILDTG 287
Query: 122 TSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
TSLI PS++ LN +GA G +DC D + ++ L G NFTL
Sbjct: 288 TSLITLPSQLAEMLNAQLGAKKNFAGQYVLDCSKRDSLEDITFNLAGYNFTL 339
Score = 63.9 bits (154), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 37/86 (43%), Positives = 50/86 (58%), Gaps = 8/86 (9%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK--DKFVGHSHFRNAS 64
AQY+ I+LGTPPQ FKV+ DTGSSNLW+PS C +IAC + D ++ +N +
Sbjct: 84 AQYFSEITLGTPPQSFKVVLDTGSSNLWVPSVECG--SIACYLHSKYDSSASSTYKKNGT 141
Query: 65 D---ENGGEIMFGGVDKDKF-VGDIT 86
G + G V +D +GDIT
Sbjct: 142 SFDIRYGSGSLSGFVSQDTLSIGDIT 167
>gi|222630120|gb|EEE62252.1| hypothetical protein OsJ_17039 [Oryza sativa Japonica Group]
Length = 501
Score = 82.8 bits (203), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 39/84 (46%), Positives = 58/84 (69%), Gaps = 3/84 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV--YCSN-CQAI 117
R+ +GGE++FGG+D + GD TY PVSRKGYWQF + + I+ + +C+ C AI
Sbjct: 221 RDPDASSGGELVFGGMDPKHYKGDHTYVPVSRKGYWQFNMGDLLIDGHSTGFCAKGCAAI 280
Query: 118 ADTGTSLIIGPSKVIAELNKLIGA 141
D+GTSL+ GP+ ++A++N IGA
Sbjct: 281 VDSGTSLLAGPTAIVAQVNHAIGA 304
Score = 63.2 bits (152), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 28/40 (70%), Positives = 31/40 (77%), Gaps = 1/40 (2%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
QYYG I LG+PPQ F VIFDTGSSNLW+PS C +IAC
Sbjct: 76 QYYGVIGLGSPPQNFTVIFDTGSSNLWVPSAKC-YFSIAC 114
Score = 37.0 bits (84), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 25/83 (30%), Positives = 41/83 (49%), Gaps = 7/83 (8%)
Query: 99 GVESIKIEKNV----YCSNCQ-AIADTGTSLIIGPSK--VIAELNKLIGAVPLANGPAKV 151
G+ES+ ++N+ CS C+ A+ L +K ++ N+L +P NG + V
Sbjct: 354 GIESVVDKENLGSDAMCSVCEMAVVWIENQLRENKTKELILNYANQLCERLPSPNGESTV 413
Query: 152 DCDNLDKMPNVDIILGGKNFTLT 174
C + KMPN+ + K F LT
Sbjct: 414 SCHQISKMPNLAFTIANKTFILT 436
>gi|162858|gb|AAA30447.1| preprochymosin a [Bos taurus]
Length = 381
Score = 82.8 bits (203), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 48/125 (38%), Positives = 63/125 (50%), Gaps = 4/125 (3%)
Query: 54 FVGHSHFRNASDENGGEIMF--GGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYC 111
V F D NG E M G +D + G + + PV+ + YWQF V+S+ I V
Sbjct: 205 LVAQDLFSVYMDRNGQESMLTLGAIDPCYYTGSLHWVPVTVQQYWQFTVDSVTISGVVVA 264
Query: 112 SN--CQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGK 169
CQAI DTGTS ++GPS I + + IGA +DCDNL MP V + GK
Sbjct: 265 CEGGCQAILDTGTSKLVGPSSDILNIQQAIGATQNQYDEFDIDCDNLSYMPTVVFEINGK 324
Query: 170 NFTLT 174
+ LT
Sbjct: 325 MYPLT 329
Score = 58.9 bits (141), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 23/34 (67%), Positives = 29/34 (85%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
+QY+G I LGTPPQEF V+FDTGSS+ W+PS +C
Sbjct: 72 SQYFGKIYLGTPPQEFTVLFDTGSSDFWVPSIYC 105
>gi|255724976|ref|XP_002547417.1| vacuolar aspartic protease precursor [Candida tropicalis MYA-3404]
gi|240135308|gb|EER34862.1| vacuolar aspartic protease precursor [Candida tropicalis MYA-3404]
Length = 421
Score = 82.8 bits (203), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 56/153 (36%), Positives = 78/153 (50%), Gaps = 5/153 (3%)
Query: 24 VIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASDEN-GGEIMFGGVDKDKFV 82
+ +DT S N +P + +I N +DK +F G DEN GG FGG D F
Sbjct: 215 LAYDTISVNHIVPPIYNAI-NQGL-LDKPQF-GFYLGNTEKDENDGGLATFGGYDASLFQ 271
Query: 83 GDITYSPVSRKGYWQFGVESIKI-EKNVYCSNCQAIADTGTSLIIGPSKVIAELNKLIGA 141
G +T+ PV RK YW+ E I + ++ A DTGTSLI PS + +N IGA
Sbjct: 272 GKVTWLPVRRKAYWEVAFEGIGLGDEYAELQKTGAAIDTGTSLITLPSSLAEIINAKIGA 331
Query: 142 VPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
+G ++DC D +P++ + G NFTLT
Sbjct: 332 TKSWSGQYQIDCAKRDSLPDLTLTFSGYNFTLT 364
Score = 61.2 bits (147), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 36/83 (43%), Positives = 46/83 (55%), Gaps = 4/83 (4%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASD- 65
AQY+ I LGTP Q FKVI DTGSSNLW+PSQ C+ L D S+ N S+
Sbjct: 104 AQYFTEIELGTPGQPFKVILDTGSSNLWVPSQDCTSLACFLHSKYDHDASSSYKANGSEF 163
Query: 66 --ENGGEIMFGGVDKDKF-VGDI 85
+ G M G + +D +GD+
Sbjct: 164 SIQYGSGSMEGYISQDILTIGDL 186
>gi|2510|emb|CAA31962.1| pre-aspartyl proteinase [Candida albicans]
Length = 380
Score = 82.8 bits (203), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 53/152 (34%), Positives = 79/152 (51%), Gaps = 3/152 (1%)
Query: 24 VIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVG 83
+ +DT S N +P + +I N A ++K +F + + + +GG FGG D F G
Sbjct: 175 LAYDTISVNHIVPPIYNAI-NQAL-LEKPQFGFYLGSTDKDENDGGLATFGGYDASLFQG 232
Query: 84 DITYSPVSRKGYWQFGVESIKI-EKNVYCSNCQAIADTGTSLIIGPSKVIAELNKLIGAV 142
IT+ P+ RK YW+ E I + ++ A DTGTSLI PS + +N IGA
Sbjct: 233 KITWLPIRRKDYWEVSFEGIGLGDEYAELHKTGAAIDTGTSLITLPSSLAEIINAKIGAT 292
Query: 143 PLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
+G +VDC D +P++ + G NFTLT
Sbjct: 293 KSWSGQYQVDCAKRDSLPDLTLTFAGYNFTLT 324
Score = 58.2 bits (139), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 28/59 (47%), Positives = 35/59 (59%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASD 65
AQY+ I +GTP Q FKVI DTGSSNLW+PSQ C+ L D ++ N S+
Sbjct: 64 AQYFTEIEIGTPGQPFKVILDTGSSNLWVPSQDCTSLACFLHAKYDHDASSTYKANGSE 122
>gi|395742691|ref|XP_002821741.2| PREDICTED: pepsin A-5-like, partial [Pongo abelii]
Length = 201
Score = 82.4 bits (202), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 40/111 (36%), Positives = 66/111 (59%), Gaps = 2/111 (1%)
Query: 65 DENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIE-KNVYCSN-CQAIADTGT 122
D++G ++FGG+D + G + + PV+ +GYWQ V+SI + + + C+ CQAI DTGT
Sbjct: 34 DKSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQITVDSITMNGEAIACAEGCQAIVDTGT 93
Query: 123 SLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
SL+ GP+ IA + IGA ++G V C + +P++ + G + L
Sbjct: 94 SLLTGPTSPIANIQSDIGASENSDGDMVVSCSAISSLPDIVFTINGVQYPL 144
>gi|389747274|gb|EIM88453.1| Asp-domain-containing protein [Stereum hirsutum FP-91666 SS1]
Length = 416
Score = 82.4 bits (202), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 45/120 (37%), Positives = 66/120 (55%), Gaps = 2/120 (1%)
Query: 60 FR-NASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI-EKNVYCSNCQAI 117
FR +S+++GGE +FGG+D + G I Y+PV RK YW+ +E + + ++ N A
Sbjct: 244 FRLGSSEDDGGEAIFGGIDPSAYSGKIDYAPVRRKAYWEVELEKVSFGDDDLELENTGAA 303
Query: 118 ADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLTRTN 177
DTGTSLI P+ V LN IGA NG VDC + +P++ K + L T+
Sbjct: 304 IDTGTSLIALPTDVAEMLNTQIGAKKSWNGQYTVDCAKVPDLPDLTFYFNEKPYPLKGTD 363
Score = 65.5 bits (158), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 38/88 (43%), Positives = 48/88 (54%), Gaps = 4/88 (4%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASD- 65
AQYY I +GTPPQ FKVI DTGSSNLW+PS C+ + D S+ N ++
Sbjct: 103 AQYYTEIDIGTPPQTFKVILDTGSSNLWVPSSQCTSIACFLHTKYDSSASSSYKANGTEF 162
Query: 66 --ENGGEIMFGGVDKDKFV-GDITYSPV 90
+ G M G V D V GD++ S V
Sbjct: 163 SIQYGSGSMEGFVSNDDIVFGDMSLSSV 190
>gi|2921120|gb|AAC04681.1| pregnancy-associated glycoprotein 8 [Bos taurus]
Length = 387
Score = 82.4 bits (202), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 38/113 (33%), Positives = 67/113 (59%), Gaps = 2/113 (1%)
Query: 58 SHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVY-CSN-CQ 115
+ + ++ EN +MFGGVD G + + PV++ +WQ + S+ + NV CS CQ
Sbjct: 209 AFYLSSRPENISTVMFGGVDHTYHKGKLQWIPVTQARFWQVAMSSMTMNGNVVGCSQGCQ 268
Query: 116 AIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGG 168
A+ DTGTSL++GP+ ++ ++ KLI P+ N + CD ++ +P + + + G
Sbjct: 269 AVVDTGTSLLVGPTHLVTDILKLINPNPILNDEQMLSCDAINSLPTLLLTING 321
Score = 58.5 bits (140), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 27/51 (52%), Positives = 38/51 (74%), Gaps = 2/51 (3%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK 51
+++ A Y+GTI++GTPPQEF+V FDTGSS+LW+PS C + +CS K
Sbjct: 63 LKNFQNAVYFGTITIGTPPQEFQVNFDTGSSDLWVPSVDCQ--SPSCSKHK 111
>gi|115461973|ref|NP_001054586.1| Os05g0137400 [Oryza sativa Japonica Group]
gi|78099760|sp|P42211.2|ASPRX_ORYSJ RecName: Full=Aspartic proteinase; Flags: Precursor
gi|46485798|gb|AAS98423.1| aspartic proteinase [Oryza sativa Japonica Group]
gi|113578137|dbj|BAF16500.1| Os05g0137400 [Oryza sativa Japonica Group]
gi|215694423|dbj|BAG89416.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 496
Score = 82.4 bits (202), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 39/84 (46%), Positives = 58/84 (69%), Gaps = 3/84 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV--YCSN-CQAI 117
R+ +GGE++FGG+D + GD TY PVSRKGYWQF + + I+ + +C+ C AI
Sbjct: 221 RDPDASSGGELVFGGMDPKHYKGDHTYVPVSRKGYWQFNMGDLLIDGHSTGFCAKGCAAI 280
Query: 118 ADTGTSLIIGPSKVIAELNKLIGA 141
D+GTSL+ GP+ ++A++N IGA
Sbjct: 281 VDSGTSLLAGPTAIVAQVNHAIGA 304
Score = 63.2 bits (152), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 28/40 (70%), Positives = 31/40 (77%), Gaps = 1/40 (2%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
QYYG I LG+PPQ F VIFDTGSSNLW+PS C +IAC
Sbjct: 76 QYYGVIGLGSPPQNFTVIFDTGSSNLWVPSAKC-YFSIAC 114
Score = 37.0 bits (84), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 25/83 (30%), Positives = 41/83 (49%), Gaps = 7/83 (8%)
Query: 99 GVESIKIEKNV----YCSNCQ-AIADTGTSLIIGPSK--VIAELNKLIGAVPLANGPAKV 151
G+ES+ ++N+ CS C+ A+ L +K ++ N+L +P NG + V
Sbjct: 354 GIESVVDKENLGSDAMCSVCEMAVVWIENQLRENKTKELILNYANQLCERLPSPNGESTV 413
Query: 152 DCDNLDKMPNVDIILGGKNFTLT 174
C + KMPN+ + K F LT
Sbjct: 414 SCHQISKMPNLAFTIANKTFILT 436
>gi|125991761|ref|NP_788792.2| pregnancy-associated glycoprotein 8 precursor [Bos taurus]
gi|124829028|gb|AAI33477.1| Pregnancy-associated glycoprotein 8 [Bos taurus]
Length = 387
Score = 82.4 bits (202), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 38/113 (33%), Positives = 67/113 (59%), Gaps = 2/113 (1%)
Query: 58 SHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVY-CSN-CQ 115
+ + ++ EN +MFGGVD G + + PV++ +WQ + S+ + NV CS CQ
Sbjct: 209 AFYLSSRPENISTVMFGGVDHTYHKGKLQWIPVTQARFWQVAMSSMTMNGNVVGCSQGCQ 268
Query: 116 AIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGG 168
A+ DTGTSL++GP+ ++ ++ KLI P+ N + CD ++ +P + + + G
Sbjct: 269 AVVDTGTSLLVGPTHLVTDILKLINPNPILNDEQMLSCDAINSLPTLLLTING 321
Score = 58.5 bits (140), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 27/51 (52%), Positives = 38/51 (74%), Gaps = 2/51 (3%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK 51
+++ A Y+GTI++GTPPQEF+V FDTGSS+LW+PS C + +CS K
Sbjct: 63 LKNFQNAVYFGTITIGTPPQEFQVNFDTGSSDLWVPSVDCQ--SPSCSKHK 111
>gi|218143|dbj|BAA02242.1| aspartic proteinase [Oryza sativa Japonica Group]
Length = 496
Score = 82.4 bits (202), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 39/84 (46%), Positives = 58/84 (69%), Gaps = 3/84 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV--YCSN-CQAI 117
R+ +GGE++FGG+D + GD TY PVSRKGYWQF + + I+ + +C+ C AI
Sbjct: 221 RDPDASSGGELVFGGMDPKHYKGDHTYVPVSRKGYWQFNMGDLLIDGHSTGFCAKGCAAI 280
Query: 118 ADTGTSLIIGPSKVIAELNKLIGA 141
D+GTSL+ GP+ ++A++N IGA
Sbjct: 281 VDSGTSLLAGPTAIVAQVNHAIGA 304
Score = 63.2 bits (152), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 28/40 (70%), Positives = 31/40 (77%), Gaps = 1/40 (2%)
Query: 8 QYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
QYYG I LG+PPQ F VIFDTGSSNLW+PS C +IAC
Sbjct: 76 QYYGVIGLGSPPQNFTVIFDTGSSNLWVPSAKC-YFSIAC 114
Score = 37.0 bits (84), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 25/83 (30%), Positives = 41/83 (49%), Gaps = 7/83 (8%)
Query: 99 GVESIKIEKNV----YCSNCQ-AIADTGTSLIIGPSK--VIAELNKLIGAVPLANGPAKV 151
G+ES+ ++N+ CS C+ A+ L +K ++ N+L +P NG + V
Sbjct: 354 GIESVVDKENLGSDAMCSVCEMAVVWIENQLRENKTKELILNYANQLCERLPSPNGESTV 413
Query: 152 DCDNLDKMPNVDIILGGKNFTLT 174
C + KMPN+ + K F LT
Sbjct: 414 SCHQISKMPNLAFTIANKTFILT 436
>gi|367031892|ref|XP_003665229.1| aspartic protease [Myceliophthora thermophila ATCC 42464]
gi|347012500|gb|AEO59984.1| aspartic protease [Myceliophthora thermophila ATCC 42464]
Length = 397
Score = 82.4 bits (202), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 47/112 (41%), Positives = 62/112 (55%), Gaps = 2/112 (1%)
Query: 64 SDENG-GEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI-EKNVYCSNCQAIADTG 121
+D NG E++FGGVD DK+ G IT P+ RK YW+ ++I + N I DTG
Sbjct: 229 ADTNGQSEVVFGGVDHDKYKGKITTIPLRRKAYWEVDFDAISYGDDTAELENTGIILDTG 288
Query: 122 TSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
TSLI PS++ LN IGA G +DC+ D + +V L G NFTL
Sbjct: 289 TSLIALPSQLAEMLNAQIGAKKSYTGQYTIDCNKRDSLKDVTFNLAGYNFTL 340
Score = 61.6 bits (148), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 36/86 (41%), Positives = 50/86 (58%), Gaps = 8/86 (9%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK--DKFVGHSHFRNAS 64
AQY+ I++GTPPQ FKV+ DTGSSNLW+PS C +IAC + D ++ +N +
Sbjct: 85 AQYFSEITIGTPPQSFKVVLDTGSSNLWVPSVECG--SIACYLHSKYDSSASSTYKKNGT 142
Query: 65 D---ENGGEIMFGGVDKDKF-VGDIT 86
G + G V +D +GDIT
Sbjct: 143 SFEIRYGSGSLSGFVSQDTVSIGDIT 168
>gi|241951846|ref|XP_002418645.1| Vacuolar aspartic proteinase, putative [Candida dubliniensis CD36]
gi|223641984|emb|CAX43948.1| Vacuolar aspartic proteinase, putative [Candida dubliniensis CD36]
Length = 419
Score = 82.4 bits (202), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 53/152 (34%), Positives = 78/152 (51%), Gaps = 3/152 (1%)
Query: 24 VIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVG 83
+ +DT S N +P + +I N ++K +F + + + +GG FGG D F G
Sbjct: 213 LAYDTISVNHIVPPIYNAI-NQGL-LEKPQFGFYLGSTDKDENDGGLATFGGYDASLFQG 270
Query: 84 DITYSPVSRKGYWQFGVESIKI-EKNVYCSNCQAIADTGTSLIIGPSKVIAELNKLIGAV 142
IT+ PV RK YW+ E I + ++ A DTGTSLI PS + +N IGA
Sbjct: 271 KITWLPVRRKAYWEVSFEGIGLGDEYAELHKTGAAIDTGTSLITLPSSLAEIINAKIGAT 330
Query: 143 PLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
+G +VDC D +P++ + G NFTLT
Sbjct: 331 KSWSGQYQVDCAKRDSLPDLTLTFAGYNFTLT 362
Score = 59.7 bits (143), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 34/83 (40%), Positives = 46/83 (55%), Gaps = 4/83 (4%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASD- 65
AQY+ I +GTP Q FKVI DTGSSNLW+PSQ C+ L D ++ N S+
Sbjct: 102 AQYFTEIQIGTPGQPFKVILDTGSSNLWVPSQDCTSLACFLHAKYDHDASSTYKANGSEF 161
Query: 66 --ENGGEIMFGGVDKDKF-VGDI 85
+ G M G + +D +GD+
Sbjct: 162 SIQYGSGSMEGYISQDVLTIGDL 184
>gi|8272388|dbj|BAA96446.1| aspartic endopeptidase [Pyrus pyrifolia]
Length = 273
Score = 82.4 bits (202), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 40/76 (52%), Positives = 55/76 (72%), Gaps = 3/76 (3%)
Query: 69 GEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKI--EKNVYCSN-CQAIADTGTSLI 125
GEI+FGGVD F G+ TY PV++KGYWQF + + I E + +C+N C AIAD+GTSL+
Sbjct: 1 GEIVFGGVDSSHFKGEHTYVPVTQKGYWQFDMGDVLIDGESSGFCANGCSAIADSGTSLL 60
Query: 126 IGPSKVIAELNKLIGA 141
GP+ V+ ++N IGA
Sbjct: 61 AGPTTVVTQINHAIGA 76
Score = 35.8 bits (81), Expect = 6.6, Method: Compositional matrix adjust.
Identities = 15/45 (33%), Positives = 27/45 (60%)
Query: 130 KVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
+++ +N+L +P +G + V CD+L +P+V +GGK F L
Sbjct: 169 QILDYVNQLCERLPSPSGESVVQCDSLSSLPSVSFTIGGKVFDLA 213
>gi|406608071|emb|CCH40505.1| Saccharopepsin [Wickerhamomyces ciferrii]
Length = 401
Score = 82.4 bits (202), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 55/151 (36%), Positives = 81/151 (53%), Gaps = 4/151 (2%)
Query: 24 VIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASDENGGEIMFGGVDKDKFVG 83
+ +DT S N +P + + LN +D+ KF + + + E+GG FGG+D+ K+ G
Sbjct: 195 LAYDTISVNKIVPPVYKA-LNSGL-LDEPKFAFYLGDADKT-EDGGVATFGGIDESKYTG 251
Query: 84 DITYSPVSRKGYWQFGVESIKI-EKNVYCSNCQAIADTGTSLIIGPSKVIAELNKLIGAV 142
IT+ PV RK YW+ I + ++ N A DTGTSLI PS + LN IGA
Sbjct: 252 KITWLPVRRKAYWEVKFNGIGLGDEFAELENTGAAIDTGTSLIALPSGLAEILNSEIGAK 311
Query: 143 PLANGPAKVDCDNLDKMPNVDIILGGKNFTL 173
+G VDC+ D +P++ L G NFT+
Sbjct: 312 KGWSGQYSVDCETRDSLPDLTFNLDGYNFTI 342
Score = 59.7 bits (143), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 30/59 (50%), Positives = 34/59 (57%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASD 65
AQYY I +GTP Q FKVI DTGSSNLW+PS CS L D ++ N SD
Sbjct: 84 AQYYTEIQIGTPGQPFKVILDTGSSNLWVPSTDCSSLACYLHSKYDHEASSTYKANGSD 142
>gi|189211129|ref|XP_001941895.1| vacuolar protease A precursor [Pyrenophora tritici-repentis
Pt-1C-BFP]
gi|187977988|gb|EDU44614.1| vacuolar protease A precursor [Pyrenophora tritici-repentis
Pt-1C-BFP]
Length = 399
Score = 82.4 bits (202), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 44/124 (35%), Positives = 66/124 (53%), Gaps = 1/124 (0%)
Query: 52 DKFVGHSHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVY- 110
D+ V + + + + E FGG+D+ K+ G + P+ RK YW+ ++++ K
Sbjct: 220 DEPVFAFYLGDTNQQQESEATFGGIDESKYTGKMIKLPLRRKAYWEVELDALTFGKETAE 279
Query: 111 CSNCQAIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKN 170
N I DTGTSLI PS + LNK IGA NG V+CD D +P++ L G N
Sbjct: 280 MDNTGIILDTGTSLIALPSTIAELLNKEIGAKKSFNGQYTVECDKRDSLPDLTFTLTGHN 339
Query: 171 FTLT 174
FT++
Sbjct: 340 FTIS 343
Score = 64.3 bits (155), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 29/41 (70%), Positives = 33/41 (80%), Gaps = 2/41 (4%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIAC 47
AQY+ ISLGTPPQ FKVI DTGSSNLW+PS C+ +IAC
Sbjct: 85 AQYFSEISLGTPPQTFKVILDTGSSNLWVPSSSCN--SIAC 123
>gi|57526769|ref|NP_001009804.1| chymosin precursor [Ovis aries]
gi|116405|sp|P18276.1|CHYM_SHEEP RecName: Full=Chymosin; AltName: Full=Preprorennin; Flags:
Precursor
gi|1374|emb|CAA37209.1| preprochymosin [Ovis aries]
gi|229045|prf||1817165A prepro-chymosin
Length = 381
Score = 82.4 bits (202), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 42/110 (38%), Positives = 59/110 (53%), Gaps = 2/110 (1%)
Query: 67 NGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCSN--CQAIADTGTSL 124
G + G +D + G + + PV+ + YWQF V+S+ I V CQAI DTGTS
Sbjct: 220 QGSMLTLGAIDPSYYTGSLHWVPVTLQKYWQFTVDSVTISGAVVACEGGCQAILDTGTSK 279
Query: 125 IIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGGKNFTLT 174
++GPS I + + IGA G +DCD+L MP V + GK + LT
Sbjct: 280 LVGPSSDILNIQQAIGATQNQYGEFDIDCDSLSSMPTVVFEINGKMYPLT 329
Score = 59.3 bits (142), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 23/34 (67%), Positives = 29/34 (85%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHC 40
+QY+G I LGTPPQEF V+FDTGSS+ W+PS +C
Sbjct: 72 SQYFGKIYLGTPPQEFTVLFDTGSSDFWVPSIYC 105
>gi|296471627|tpg|DAA13742.1| TPA: pregnancy-associated glycoprotein 8 [Bos taurus]
Length = 387
Score = 82.4 bits (202), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 38/113 (33%), Positives = 67/113 (59%), Gaps = 2/113 (1%)
Query: 58 SHFRNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVY-CSN-CQ 115
+ + ++ EN +MFGGVD G + + PV++ +WQ + S+ + NV CS CQ
Sbjct: 209 AFYLSSRPENISMVMFGGVDHTYHKGKLQWIPVTQARFWQVAMSSMTMNGNVVGCSQGCQ 268
Query: 116 AIADTGTSLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNLDKMPNVDIILGG 168
A+ DTGTSL++GP+ ++ ++ KLI P+ N + CD ++ +P + + + G
Sbjct: 269 AVVDTGTSLLVGPTHLVTDILKLINPNPILNDEQMLSCDAINSLPTLLLTING 321
Score = 58.5 bits (140), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 27/51 (52%), Positives = 38/51 (74%), Gaps = 2/51 (3%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDK 51
+++ A Y+GTI++GTPPQEF+V FDTGSS+LW+PS C + +CS K
Sbjct: 63 LKNFQNAVYFGTITIGTPPQEFQVNFDTGSSDLWVPSVDCQ--SPSCSKHK 111
>gi|218196057|gb|EEC78484.1| hypothetical protein OsI_18377 [Oryza sativa Indica Group]
Length = 389
Score = 82.4 bits (202), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 39/84 (46%), Positives = 58/84 (69%), Gaps = 3/84 (3%)
Query: 61 RNASDENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV--YCSN-CQAI 117
R+ +GGE++FGG+D + GD TY PVSRKGYWQF + + I+ + +C+ C AI
Sbjct: 114 RDPDASSGGELVFGGMDPKHYKGDHTYVPVSRKGYWQFNMGDLLIDGHSTGFCAKGCAAI 173
Query: 118 ADTGTSLIIGPSKVIAELNKLIGA 141
D+GTSL+ GP+ ++A++N IGA
Sbjct: 174 VDSGTSLLAGPTAIVAQVNHAIGA 197
Score = 36.6 bits (83), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 25/83 (30%), Positives = 41/83 (49%), Gaps = 7/83 (8%)
Query: 99 GVESIKIEKNV----YCSNCQ-AIADTGTSLIIGPSK--VIAELNKLIGAVPLANGPAKV 151
G+ES+ ++N+ CS C+ A+ L +K ++ N+L +P NG + V
Sbjct: 247 GIESVVDKENLGSDAMCSVCEMAVVWIENQLRENKTKELILNYANQLCERLPSPNGESTV 306
Query: 152 DCDNLDKMPNVDIILGGKNFTLT 174
C + KMPN+ + K F LT
Sbjct: 307 SCHQISKMPNLAFTIANKTFILT 329
>gi|357450315|ref|XP_003595434.1| Aspartic proteinase [Medicago truncatula]
gi|355484482|gb|AES65685.1| Aspartic proteinase [Medicago truncatula]
Length = 507
Score = 82.0 bits (201), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 45/94 (47%), Positives = 57/94 (60%), Gaps = 7/94 (7%)
Query: 66 ENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNV--YCS-NCQAIADTGT 122
E GGEI+FGGVDK F GD TY P+S+KGYWQ V I + N C C AI D+GT
Sbjct: 229 EVGGEIVFGGVDKRHFRGDHTYFPISQKGYWQIEVGDILLANNTTGLCEGGCAAIVDSGT 288
Query: 123 SLIIGPSKVIAELNKLIGAVPLANGPAKVDCDNL 156
SLI GP+ V+ ++N +IG G +C N+
Sbjct: 289 SLIAGPTGVVTQINHVIG----TEGYVSYECKNI 318
Score = 66.6 bits (161), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 32/71 (45%), Positives = 43/71 (60%), Gaps = 8/71 (11%)
Query: 1 MRSSYQAQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHF 60
+++ + QYYG I +G+PPQ F V+FDTGSSNLW+PS C I +IAC HS +
Sbjct: 72 LKNYFDVQYYGEIGIGSPPQYFNVVFDTGSSNLWVPSSRC-IFSIACYF-------HSKY 123
Query: 61 RNASDENGGEI 71
R+ EI
Sbjct: 124 RSGISSTYNEI 134
>gi|145352062|ref|XP_001420378.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144580612|gb|ABO98671.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 454
Score = 82.0 bits (201), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 37/81 (45%), Positives = 56/81 (69%), Gaps = 5/81 (6%)
Query: 66 ENGGEIMFGGVDKDKFVGDITYSPVSRKGYWQFGVESIKIEKNVYCS-----NCQAIADT 120
E GGE++FGG D D F+G+ TY+PV+R+GYWQ ++ K++ + CQ IADT
Sbjct: 181 EVGGELIFGGSDPDHFIGEHTYAPVTREGYWQIKMDDFKVDGRSLGACDGDDGCQVIADT 240
Query: 121 GTSLIIGPSKVIAELNKLIGA 141
GTSL+ GP++++ ++N IGA
Sbjct: 241 GTSLLAGPTEIVNKINDYIGA 261
Score = 66.2 bits (160), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 29/61 (47%), Positives = 38/61 (62%), Gaps = 7/61 (11%)
Query: 7 AQYYGTISLGTPPQEFKVIFDTGSSNLWIPSQHCSILNIACSVDKDKFVGHSHFRNASDE 66
AQYYG I +G P Q+F+V+FDTGSSNLW+PS C L I C + H+ F + + E
Sbjct: 24 AQYYGEIEIGNPRQKFQVVFDTGSSNLWVPSSKCGFLQIPCDL-------HAKFDSRASE 76
Query: 67 N 67
Sbjct: 77 T 77
Score = 50.4 bits (119), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 23/80 (28%), Positives = 41/80 (51%), Gaps = 3/80 (3%)
Query: 98 FGVESIKIEKNVYCSNCQAIADTGTSLI---IGPSKVIAELNKLIGAVPLANGPAKVDCD 154
G S E + C+ C+ + + ++ + ++ E+ ++ VP G A VDCD
Sbjct: 315 LGKSSSSFEGQIACTACKTVVNYAQDMLAQNVTEKIIVNEVKRVCDMVPSVGGTASVDCD 374
Query: 155 NLDKMPNVDIILGGKNFTLT 174
N+ MP+V+ ++GG F LT
Sbjct: 375 NIPNMPDVEFVIGGVPFKLT 394
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.317 0.136 0.414
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,030,282,679
Number of Sequences: 23463169
Number of extensions: 126477724
Number of successful extensions: 253847
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 3569
Number of HSP's successfully gapped in prelim test: 831
Number of HSP's that attempted gapping in prelim test: 243852
Number of HSP's gapped (non-prelim): 8390
length of query: 177
length of database: 8,064,228,071
effective HSP length: 133
effective length of query: 44
effective length of database: 9,238,593,890
effective search space: 406498131160
effective search space used: 406498131160
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 71 (32.0 bits)