BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 017398
(372 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|P49043|VPE_CITSI Vacuolar-processing enzyme OS=Citrus sinensis PE=2 SV=1
Length = 494
Score = 751 bits (1938), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 362/365 (99%), Positives = 363/365 (99%)
Query: 1 MTRLASGVLITLLVALAGIADGSRDIAGDILKLPSEAYRFFHNGGGGAKVNDDDDSVGTR 60
MTRLASGVLITLLVALAGIADGSRDIAGDILKLPSEAYRFFHNGGGGAKVNDDDDSVGTR
Sbjct: 1 MTRLASGVLITLLVALAGIADGSRDIAGDILKLPSEAYRFFHNGGGGAKVNDDDDSVGTR 60
Query: 61 WAVLLAGSNGFWNYRHQADICHAYQLLRKGGLKDENIIVFMYDDIAFNEENPRPGVIINH 120
WAVLLAGSNGFWNYRHQADICHAYQLLRKGGLKDENIIVFMYDDIAFNEENPRPGVIINH
Sbjct: 61 WAVLLAGSNGFWNYRHQADICHAYQLLRKGGLKDENIIVFMYDDIAFNEENPRPGVIINH 120
Query: 121 PHGDDVYKGVPKDYTGEDVTVENFFAVILGNKTALTGGSGKVVDSGPNDHIFIFYSDHGG 180
PHGDDVYKGVPKDYTGEDVTVE FFAV+LGNKTALTGGSGKVVDSGPNDHIFIFYSDHGG
Sbjct: 121 PHGDDVYKGVPKDYTGEDVTVEKFFAVVLGNKTALTGGSGKVVDSGPNDHIFIFYSDHGG 180
Query: 181 PGVLGMPTSRYIYADELIDVLKKKHASGNYKSLVFYLEACESGSIFEGLLPEGLNIYATT 240
PGVLGMPTSRYIYADELIDVLKKKHASGNYKSLVFYLEACESGSIFEGLL EGLNIYATT
Sbjct: 181 PGVLGMPTSRYIYADELIDVLKKKHASGNYKSLVFYLEACESGSIFEGLLLEGLNIYATT 240
Query: 241 ASNAEESSWGTYCPGEIPGPPPEYSTCLGDLYSIAWMEDSDIHNLRTETLHQQYELVKTR 300
ASNAEESSWGTYCPGEIPGPPPEYSTCLGDLYSIAWMEDSDIHNLRTETLHQQYELVKTR
Sbjct: 241 ASNAEESSWGTYCPGEIPGPPPEYSTCLGDLYSIAWMEDSDIHNLRTETLHQQYELVKTR 300
Query: 301 TASYNSYGSHVMQYGDIGLSKNNLFTYLGTNPANDNYTFVDENSLRPASKAVNQRDADLL 360
TASYNSYGSHVMQYGDIGLSKNNLFTYLGTNPANDNYTFVDENSLRPASKAVNQRDADLL
Sbjct: 301 TASYNSYGSHVMQYGDIGLSKNNLFTYLGTNPANDNYTFVDENSLRPASKAVNQRDADLL 360
Query: 361 HFWDK 365
HFWDK
Sbjct: 361 HFWDK 365
>sp|O24325|VPE1_PHAVU Vacuolar-processing enzyme OS=Phaseolus vulgaris PE=1 SV=1
Length = 484
Score = 546 bits (1408), Expect = e-155, Method: Compositional matrix adjust.
Identities = 266/343 (77%), Positives = 295/343 (86%), Gaps = 14/343 (4%)
Query: 24 RDIAGDILKLPSEAYRFFHNGGGGAKVNDDDDSVGTRWAVLLAGSNGFWNYRHQADICHA 83
RD+ GD L+LPS++ G G V+ GTRWA+L AGS+G+WNYRHQADICHA
Sbjct: 26 RDLVGDFLRLPSDS-------GNGDNVH------GTRWAILFAGSSGYWNYRHQADICHA 72
Query: 84 YQLLRKGGLKDENIIVFMYDDIAFNEENPRPGVIINHPHGDDVYKGVPKDYTGEDVTVEN 143
YQLLRKGGLKDENIIVFMYDDIAFN ENPR GVIIN P+GD+VYKGVPKDYTGEDVT N
Sbjct: 73 YQLLRKGGLKDENIIVFMYDDIAFNSENPRRGVIINSPNGDEVYKGVPKDYTGEDVTAHN 132
Query: 144 FFAVILGNKTALTGGSGKVVDSGPNDHIFIFYSDHGGPGVLGMPTSRYIYADELIDVLKK 203
F+A +LG+K+ LTGGSGKVV+SGPNDHIFIFYSDHGGPGVLG P YIYA +L +VLKK
Sbjct: 133 FYAALLGDKSKLTGGSGKVVNSGPNDHIFIFYSDHGGPGVLGSPAGPYIYASDLNEVLKK 192
Query: 204 KHASGNYKSLVFYLEACESGSIFEGLLPEGLNIYATTASNAEESSWGTYCPGEIPGPPPE 263
KHASG YK+LVFYLEACESGSIFEGLLPE +N+YATTASNA+ESSWGTYCPGE P PPPE
Sbjct: 193 KHASGTYKNLVFYLEACESGSIFEGLLPEDINVYATTASNADESSWGTYCPGEDPSPPPE 252
Query: 264 YSTCLGDLYSIAWMEDSDIHNLRTETLHQQYELVKTRTASYN-SYGSHVMQYGDIGLSKN 322
YSTCLGDLYS+AWMEDSD HNLRTETLHQQY+LVK RT S YGSHVMQYGD+GLSK+
Sbjct: 253 YSTCLGDLYSVAWMEDSDRHNLRTETLHQQYKLVKERTISGGLYYGSHVMQYGDVGLSKD 312
Query: 323 NLFTYLGTNPANDNYTFVDENSLRPASKAVNQRDADLLHFWDK 365
LF YLGT+PAN+N TFVDENSL +SKAVNQRDADL+HFWDK
Sbjct: 313 ILFHYLGTDPANENLTFVDENSLWSSSKAVNQRDADLVHFWDK 355
>sp|P49047|VPEA_ARATH Vacuolar-processing enzyme alpha-isozyme OS=Arabidopsis thaliana
GN=At2g25940 PE=2 SV=2
Length = 478
Score = 546 bits (1408), Expect = e-155, Method: Compositional matrix adjust.
Identities = 263/366 (71%), Positives = 308/366 (84%), Gaps = 17/366 (4%)
Query: 1 MTRLASGVLITLLVALAGIADGSRDIAGDILKLPSEAYRFFHNGGGGAKVNDDDDSVGTR 60
MT + S + + L + A ++GD++KLPS A +FF +DDDS T+
Sbjct: 1 MTTVVSFLALFLFLVAA--------VSGDVIKLPSLASKFFR------PTENDDDS--TK 44
Query: 61 WAVLLAGSNGFWNYRHQADICHAYQLLRKGGLKDENIIVFMYDDIAFNEENPRPGVIINH 120
WAVL+AGS+G+WNYRHQAD+CHAYQLL+KGG+K+ENI+VFMYDDIA NEENPRPGVIIN
Sbjct: 45 WAVLVAGSSGYWNYRHQADVCHAYQLLKKGGVKEENIVVFMYDDIAKNEENPRPGVIINS 104
Query: 121 PHGDDVYKGVPKDYTGEDVTVENFFAVILGNKTALTGGSGKVVDSGPNDHIFIFYSDHGG 180
P+G+DVY GVPKDYTG++V V+N AVILGNKTAL GGSGKVVDSGPNDHIFI+YSDHGG
Sbjct: 105 PNGEDVYNGVPKDYTGDEVNVDNLLAVILGNKTALKGGSGKVVDSGPNDHIFIYYSDHGG 164
Query: 181 PGVLGMPTSRYIYADELIDVLKKKHASGNYKSLVFYLEACESGSIFEGLLPEGLNIYATT 240
PGVLGMPTS +YA++L DVLKKK+ASG YKSLVFYLEACESGSIFEGLLPEGLNIYATT
Sbjct: 165 PGVLGMPTSPNLYANDLNDVLKKKYASGTYKSLVFYLEACESGSIFEGLLPEGLNIYATT 224
Query: 241 ASNAEESSWGTYCPGEIPGPPPEYSTCLGDLYSIAWMEDSDIHNLRTETLHQQYELVKTR 300
ASNAEESSWGTYCPGE P PP EY TCLGDLYS+AW+EDS+ HNL+TETLH+QYELVK R
Sbjct: 225 ASNAEESSWGTYCPGEDPSPPSEYETCLGDLYSVAWIEDSEKHNLQTETLHEQYELVKKR 284
Query: 301 TA-SYNSYGSHVMQYGDIGLSKNNLFTYLGTNPANDNYTFVDENSLRPASKAVNQRDADL 359
TA S SYGSHVM++GDIGLSK L ++GTNPA++N+TFV+ENS+RP S+ NQRDADL
Sbjct: 285 TAGSGKSYGSHVMEFGDIGLSKEKLVLFMGTNPADENFTFVNENSIRPPSRVTNQRDADL 344
Query: 360 LHFWDK 365
+HFW K
Sbjct: 345 VHFWHK 350
>sp|P49044|VPE_VICSA Vacuolar-processing enzyme OS=Vicia sativa PE=1 SV=1
Length = 493
Score = 542 bits (1397), Expect = e-153, Method: Compositional matrix adjust.
Identities = 265/368 (72%), Positives = 306/368 (83%), Gaps = 11/368 (2%)
Query: 2 TRLASGVLITLLVALAGIADGSRDIAGDILKLPSEAYRFFHNGGGGAKVNDDDDSVGTRW 61
++L++ + T++V + RD+ GD L+LPSE RFF + +DDD GTRW
Sbjct: 4 SQLSTLLFFTIVVTFLTVVSSGRDLPGDYLRLPSETSRFFR------EPKNDDDFEGTRW 57
Query: 62 AVLLAGSNGFWNYRHQADICHAYQLLRKGGLKDENIIVFMYDDIAFNEENPRPGVIINHP 121
A+LLAGSNG+WNYRHQ+D+CHAYQLLRKGG K+ENIIVFMYDDIA NEENPRPGVIIN P
Sbjct: 58 AILLAGSNGYWNYRHQSDVCHAYQLLRKGGSKEENIIVFMYDDIASNEENPRPGVIINKP 117
Query: 122 HGDDVYKGVPKDYTGEDVTVENFFAVILGNKTALTGGSGKVVDSGPNDHIFIFYSDHGGP 181
GDDVY GVPKDYTG +V +NF+A +LGNK+ALTGGSGKVVDSGPNDHIF++Y+DHGGP
Sbjct: 118 DGDDVYAGVPKDYTGAEVHADNFYAALLGNKSALTGGSGKVVDSGPNDHIFVYYTDHGGP 177
Query: 182 GVLGMPTSRYIYADELIDVLKKKHASGNYKSLVFYLEACESGSIFEGLLPEGLNIYATTA 241
GVLGMP Y+YA +L +VLKKKHASG YKSLVFYLEACESGSIFEGLLP+ LNIYATTA
Sbjct: 178 GVLGMPVGPYLYASDLNEVLKKKHASGTYKSLVFYLEACESGSIFEGLLPDDLNIYATTA 237
Query: 242 SNAEESSWGTYCPGEIPGPPPEYSTCLGDLYSIAWMEDSDIHNLRTETLHQQYELVKTRT 301
SNAEESSWG YCPG+ P PPPEYSTCLGDLYSIAWMEDS++HNL+TE+L QQY+LVK RT
Sbjct: 238 SNAEESSWGYYCPGDKPPPPPEYSTCLGDLYSIAWMEDSEVHNLQTESLQQQYKLVKNRT 297
Query: 302 ASYNSYGSHVMQYGDIGLSKNNLFTYLGTNPANDNYTFVDEN----SLRPASKAVNQRDA 357
S YGSHVM+YGDIGLSKN+L+ YLGTNPANDN +FVDE LR S AVNQRDA
Sbjct: 298 IS-EPYGSHVMEYGDIGLSKNDLYQYLGTNPANDNNSFVDETENSLKLRTPSAAVNQRDA 356
Query: 358 DLLHFWDK 365
DL+HFW+K
Sbjct: 357 DLIHFWEK 364
>sp|Q39119|VPEG_ARATH Vacuolar-processing enzyme gamma-isozyme OS=Arabidopsis thaliana
GN=At4g32940 PE=2 SV=2
Length = 494
Score = 542 bits (1396), Expect = e-153, Method: Compositional matrix adjust.
Identities = 269/338 (79%), Positives = 298/338 (88%), Gaps = 6/338 (1%)
Query: 29 DILKLPSEAYRFFHNGGGGAKVNDDDDSVGTRWAVLLAGSNGFWNYRHQADICHAYQLLR 88
D++KLPS+A RFF NDDD + GTRWAVL+AGS+G+WNYRHQADICHAYQLLR
Sbjct: 33 DVIKLPSQASRFFR-----PAENDDDSNSGTRWAVLVAGSSGYWNYRHQADICHAYQLLR 87
Query: 89 KGGLKDENIIVFMYDDIAFNEENPRPGVIINHPHGDDVYKGVPKDYTGEDVTVENFFAVI 148
KGGLK+ENI+VFMYDDIA N ENPRPG IIN PHG DVY+GVPKDYTG+DV V+N FAVI
Sbjct: 88 KGGLKEENIVVFMYDDIANNYENPRPGTIINSPHGKDVYQGVPKDYTGDDVNVDNLFAVI 147
Query: 149 LGNKTALTGGSGKVVDSGPNDHIFIFYSDHGGPGVLGMPTSRYIYADELIDVLKKKHASG 208
LG+KTA+ GGSGKVVDSGPNDHIFIFYSDHGGPGVLGMPTS Y+YA++L DVLKKKHA G
Sbjct: 148 LGDKTAVKGGSGKVVDSGPNDHIFIFYSDHGGPGVLGMPTSPYLYANDLNDVLKKKHALG 207
Query: 209 NYKSLVFYLEACESGSIFEGLLPEGLNIYATTASNAEESSWGTYCPGEIPGPPPEYSTCL 268
YKSLVFYLEACESGSIFEGLLPEGLNIYATTASNAEESSWGTYCPGE P PPPEY TCL
Sbjct: 208 TYKSLVFYLEACESGSIFEGLLPEGLNIYATTASNAEESSWGTYCPGEEPSPPPEYETCL 267
Query: 269 GDLYSIAWMEDSDIHNLRTETLHQQYELVKTRTASYN-SYGSHVMQYGDIGLSKNNLFTY 327
GDLYS+AWMEDS +HNL+TETLHQQYELVK RTA SYGSHVMQYGD+G+SK+NL Y
Sbjct: 268 GDLYSVAWMEDSGMHNLQTETLHQQYELVKRRTAPVGYSYGSHVMQYGDVGISKDNLDLY 327
Query: 328 LGTNPANDNYTFVDENSLRPASKAVNQRDADLLHFWDK 365
+GTNPANDN+TF D NSL+P S+ NQRDADL+HFW+K
Sbjct: 328 MGTNPANDNFTFADANSLKPPSRVTNQRDADLVHFWEK 365
>sp|P49042|VPE_RICCO Vacuolar-processing enzyme OS=Ricinus communis PE=1 SV=1
Length = 497
Score = 455 bits (1171), Expect = e-127, Method: Compositional matrix adjust.
Identities = 212/315 (67%), Positives = 257/315 (81%), Gaps = 3/315 (0%)
Query: 52 DDDDSVGTRWAVLLAGSNGFWNYRHQADICHAYQLLRKGGLKDENIIVFMYDDIAFNEEN 111
DDDD +GTRWAVL+AGS GF NYRHQAD+CHAYQLLRKGGLK+ENIIVFMYDDIA NE N
Sbjct: 54 DDDDQLGTRWAVLVAGSMGFGNYRHQADVCHAYQLLRKGGLKEENIIVFMYDDIAKNELN 113
Query: 112 PRPGVIINHPHGDDVYKGVPKDYTGEDVTVENFFAVILGNKTALTGGSGKVVDSGPNDHI 171
PRPGVIINHP G+DVY GVPKDYTGE VT +N +AV+LG+K+A+ GGSGKVVDS PND I
Sbjct: 114 PRPGVIINHPQGEDVYAGVPKDYTGEHVTAKNLYAVLLGDKSAVQGGSGKVVDSKPNDRI 173
Query: 172 FIFYSDHGGPGVLGMPTSRYIYADELIDVLKKKHASGNYKSLVFYLEACESGSIFEGLLP 231
F++YSDHGGPGVLGMP Y+YA + I+VLKKKHA+G YK +V Y+EACESGSIFEG++P
Sbjct: 174 FLYYSDHGGPGVLGMPNLPYLYAMDFIEVLKKKHAAGGYKKMVIYVEACESGSIFEGIMP 233
Query: 232 EGLNIYATTASNAEESSWGTYCPGEIPGPPPEYSTCLGDLYSIAWMEDSDIHNLRTETLH 291
+ ++IY TTASNA+ESSWGTYCPG P PPPE++TCLGDLYS+AWMEDS+ HNL+ ET+
Sbjct: 234 KDVDIYVTTASNAQESSWGTYCPGMEPSPPPEFTTCLGDLYSVAWMEDSESHNLKKETVK 293
Query: 292 QQYELVKTRTASYNSY--GSHVMQYGDIGLSKNNLFTYLGTNPANDNYTFVDENSLRPAS 349
QQY VK RT++YN+Y GSHVMQYG+ + + L+ + G +PA+ N+ + L
Sbjct: 294 QQYSSVKARTSNYNTYAAGSHVMQYGNQSIKADKLYLFQGFDPASVNFP-PNNAHLNAPM 352
Query: 350 KAVNQRDADLLHFWD 364
+ VNQRDA+L W
Sbjct: 353 EVVNQRDAELHFMWQ 367
>sp|P49045|VPE_SOYBN Vacuolar-processing enzyme OS=Glycine max PE=2 SV=1
Length = 495
Score = 452 bits (1163), Expect = e-126, Method: Compositional matrix adjust.
Identities = 214/337 (63%), Positives = 261/337 (77%), Gaps = 15/337 (4%)
Query: 30 ILKLPSEAYRFFHNGGGGAKVNDDDDSVGTRWAVLLAGSNGFWNYRHQADICHAYQLLRK 89
++KLP+E V+ D D VGTRWAVL+AGSNG+ NYRHQAD+CHAYQLL K
Sbjct: 42 VIKLPTEP------------VDADSDEVGTRWAVLVAGSNGYGNYRHQADVCHAYQLLIK 89
Query: 90 GGLKDENIIVFMYDDIAFNEENPRPGVIINHPHGDDVYKGVPKDYTGEDVTVENFFAVIL 149
GGLK+ENI+VFMYDDIA NE NPR GVIINHP G+D+Y GVPKDYTG++VT EN FAVIL
Sbjct: 90 GGLKEENIVVFMYDDIATNELNPRHGVIINHPEGEDLYAGVPKDYTGDNVTTENLFAVIL 149
Query: 150 GNKTALTGGSGKVVDSGPNDHIFIFYSDHGGPGVLGMPTSRYIYADELIDVLKKKHASGN 209
G+K+ L GGSGKV++S P D IFI+YSDHGGPG+LGMP Y+YA + IDVLKKKHASG+
Sbjct: 150 GDKSKLKGGSGKVINSKPEDRIFIYYSDHGGPGILGMPNMPYLYAMDFIDVLKKKHASGS 209
Query: 210 YKSLVFYLEACESGSIFEGLLPEGLNIYATTASNAEESSWGTYCPGEIPGPPPEYSTCLG 269
YK +V Y+EACESGS+FEG++P+ LNIY TTASNA+E+SWGTYCPG P PPPEY TCLG
Sbjct: 210 YKEMVIYVEACESGSVFEGIMPKDLNIYVTTASNAQENSWGTYCPGMDPSPPPEYITCLG 269
Query: 270 DLYSIAWMEDSDIHNLRTETLHQQYELVKTRTASYNSY--GSHVMQYGDIGLSKNNLFTY 327
DLYS+AWMEDS+ HNL+ E++ QQY+ VK RT+++N+Y GSHVMQYGD ++ L+ Y
Sbjct: 270 DLYSVAWMEDSEAHNLKRESVKQQYKSVKQRTSNFNNYAMGSHVMQYGDTNITAEKLYLY 329
Query: 328 LGTNPANDNYTFVDENSLRPASKAVNQRDADLLHFWD 364
G +PA N+ L + VNQRDA+L W
Sbjct: 330 QGFDPATVNFP-PQNGRLETKMEVVNQRDAELFLLWQ 365
>sp|P49046|LEGU_CANEN Legumain OS=Canavalia ensiformis PE=1 SV=1
Length = 475
Score = 450 bits (1158), Expect = e-126, Method: Compositional matrix adjust.
Identities = 219/360 (60%), Positives = 272/360 (75%), Gaps = 19/360 (5%)
Query: 8 VLITLLVALAGIADG-SRDIAGDILKLPSEAYRFFHNGGGGAKVNDDDDSVGTRWAVLLA 66
V++ ++++L G A +R +++LP+E DD VGTRWAVL+A
Sbjct: 2 VMMLVMLSLHGTAARLNRREWDSVIQLPTEPV---------------DDEVGTRWAVLVA 46
Query: 67 GSNGFWNYRHQADICHAYQLLRKGGLKDENIIVFMYDDIAFNEENPRPGVIINHPHGDDV 126
GSNG+ NYRHQAD+CHAYQLL KGG+K+ENI+VFMYDDIA+N NPRPGVIINHP G DV
Sbjct: 47 GSNGYGNYRHQADVCHAYQLLIKGGVKEENIVVFMYDDIAYNAMNPRPGVIINHPQGPDV 106
Query: 127 YKGVPKDYTGEDVTVENFFAVILGNKTALTGGSGKVVDSGPNDHIFIFYSDHGGPGVLGM 186
Y GVPKDYTGEDVT EN +AVILG+K+ + GGSGKV++S P D IFIFYSDHGGPGVLGM
Sbjct: 107 YAGVPKDYTGEDVTPENLYAVILGDKSKVKGGSGKVINSNPEDRIFIFYSDHGGPGVLGM 166
Query: 187 PTSRYIYADELIDVLKKKHASGNYKSLVFYLEACESGSIFEGLLPEGLNIYATTASNAEE 246
P + ++YA + IDVLKKKHASG YK +V Y+EACESGSIFEG++P+ LNIY TTASNA+E
Sbjct: 167 PNAPFVYAMDFIDVLKKKHASGGYKEMVIYIEACESGSIFEGIMPKDLNIYVTTASNAQE 226
Query: 247 SSWGTYCPGEIPGPPPEYSTCLGDLYSIAWMEDSDIHNLRTETLHQQYELVKTRTASYNS 306
+S+GTYCPG P PP EY TCLGDLYS++WMEDS+ HNL+ ET+ QQY+ V+ RT++ NS
Sbjct: 227 NSFGTYCPGMNPPPPEEYVTCLGDLYSVSWMEDSETHNLKRETVQQQYQSVRKRTSNSNS 286
Query: 307 Y--GSHVMQYGDIGLSKNNLFTYLGTNPANDNYTFVDENSLRPASKAVNQRDADLLHFWD 364
Y GSHVMQYGD ++ L+ Y G +PA N+ + N L + VNQRDA+LL W
Sbjct: 287 YRFGSHVMQYGDTNITAEKLYLYHGFDPATVNFPPHNGN-LEAKMEVVNQRDAELLFMWQ 345
>sp|Q39044|VPEB_ARATH Vacuolar-processing enzyme beta-isozyme OS=Arabidopsis thaliana
GN=At1g62710 PE=2 SV=3
Length = 486
Score = 440 bits (1132), Expect = e-123, Method: Compositional matrix adjust.
Identities = 204/316 (64%), Positives = 251/316 (79%), Gaps = 3/316 (0%)
Query: 50 VNDDDDSVGTRWAVLLAGSNGFWNYRHQADICHAYQLLRKGGLKDENIIVFMYDDIAFNE 109
+ D+D VGTRWAVL+AGS+G+ NYRHQAD+CHAYQ+LRKGGLK+ENI+V MYDDIA +
Sbjct: 41 ADQDEDGVGTRWAVLVAGSSGYGNYRHQADVCHAYQILRKGGLKEENIVVLMYDDIANHP 100
Query: 110 ENPRPGVIINHPHGDDVYKGVPKDYTGEDVTVENFFAVILGNKTALTGGSGKVVDSGPND 169
NPRPG +INHP GDDVY GVPKDYTG VT NF+AV+LG++ A+ GGSGKV+ S PND
Sbjct: 101 LNPRPGTLINHPDGDDVYAGVPKDYTGSSVTAANFYAVLLGDQKAVKGGSGKVIASKPND 160
Query: 170 HIFIFYSDHGGPGVLGMPTSRYIYADELIDVLKKKHASGNYKSLVFYLEACESGSIFEGL 229
HIF++Y+DHGGPGVLGMP + +IYA + I+ LKKKHASG YK +V Y+EACESGSIFEG+
Sbjct: 161 HIFVYYADHGGPGVLGMPNTPHIYAADFIETLKKKHASGTYKEMVIYVEACESGSIFEGI 220
Query: 230 LPEGLNIYATTASNAEESSWGTYCPGEIPGPPPEYSTCLGDLYSIAWMEDSDIHNLRTET 289
+P+ LNIY TTASNA+ESS+GTYCPG P PP EY TCLGDLYS+AWMEDS+ HNL+ ET
Sbjct: 221 MPKDLNIYVTTASNAQESSYGTYCPGMNPSPPSEYITCLGDLYSVAWMEDSETHNLKKET 280
Query: 290 LHQQYELVKTRTASYNSY--GSHVMQYGDIGLSKNNLFTYLGTNPANDNYTFVDENSLRP 347
+ QQY VK RT++YN+Y GSHVM+YG+ + L+ Y G +PA N ++E ++
Sbjct: 281 IKQQYHTVKMRTSNYNTYSGGSHVMEYGNNSIKSEKLYLYQGFDPATVNLP-LNELPVKS 339
Query: 348 ASKAVNQRDADLLHFW 363
VNQRDADLL W
Sbjct: 340 KIGVVNQRDADLLFLW 355
>sp|O24326|VPE2_PHAVU Vacuolar-processing enzyme OS=Phaseolus vulgaris PE=2 SV=1
Length = 493
Score = 435 bits (1118), Expect = e-121, Method: Compositional matrix adjust.
Identities = 213/358 (59%), Positives = 270/358 (75%), Gaps = 18/358 (5%)
Query: 9 LITLLVALAGIADGSRDIAGDILKLPSEAYRFFHNGGGGAKVNDDDDSVGTRWAVLLAGS 68
++ +++ + G +D ++KLP++ +V+ + D VGTRWAVL+AGS
Sbjct: 22 MLVMVMRIQGTNGKEQD---SVIKLPTQ------------EVDAESDEVGTRWAVLVAGS 66
Query: 69 NGFWNYRHQADICHAYQLLRKGGLKDENIIVFMYDDIAFNEENPRPGVIINHPHGDDVYK 128
NG+ NYRHQAD+CHAYQLL KGG+K+ENI+VFMYDDIA +E NPRPGVIIN+P G DVY
Sbjct: 67 NGYGNYRHQADVCHAYQLLIKGGVKEENIVVFMYDDIATHELNPRPGVIINNPQGPDVYA 126
Query: 129 GVPKDYTGEDVTVENFFAVILGNKTALTGGSGKVVDSGPNDHIFIFYSDHGGPGVLGMPT 188
GVPKDYTGE VT NFFAV+LG+K+ + GGSGKV++S P D IF++YSDHGGPGVLGMP
Sbjct: 127 GVPKDYTGESVTSHNFFAVLLGDKSKVKGGSGKVINSKPEDRIFVYYSDHGGPGVLGMPN 186
Query: 189 SRYIYADELIDVLKKKHASGNYKSLVFYLEACESGSIFEGLLPEGLNIYATTASNAEESS 248
Y+YA + IDVLKKKHASG YK +V Y+EACESGSIFEG++P+ LNIY TTASNA+E+S
Sbjct: 187 MPYLYAMDFIDVLKKKHASGGYKEMVIYVEACESGSIFEGIMPKDLNIYVTTASNAQENS 246
Query: 249 WGTYCPGEIPGPPPEYSTCLGDLYSIAWMEDSDIHNLRTETLHQQYELVKTRTASYNSY- 307
WGTYCPG P PPPEY TCLGDLYS+AWMEDS+ HNL+ E++ QQY+ VK RT+++ +Y
Sbjct: 247 WGTYCPGMYPPPPPEYITCLGDLYSVAWMEDSESHNLKKESVEQQYQSVKQRTSNFEAYA 306
Query: 308 -GSHVMQYGDIGLSKNNLFTYLGTNPANDNYTFVDENSLRPASKAVNQRDADLLHFWD 364
GSHVMQYGD ++ L+ Y G +PA N+ L+ + VNQRDA+LL W
Sbjct: 307 MGSHVMQYGDANMTAEKLYLYHGFDPATVNFP-PHNGRLKSKMEVVNQRDAELLFMWQ 363
>sp|Q9R0J8|LGMN_RAT Legumain OS=Rattus norvegicus GN=Lgmn PE=2 SV=1
Length = 435
Score = 292 bits (747), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 147/288 (51%), Positives = 196/288 (68%), Gaps = 16/288 (5%)
Query: 44 GGGGAKVN-DDDDSVGTRWAVLLAGSNGFWNYRHQADICHAYQLLRKGGLKDENIIVFMY 102
G G + DD + G W V++AGSNG++NYRHQAD CHAYQ++ + G+ DE IIV MY
Sbjct: 14 GAGAVHIGVDDPEDGGKHWVVIVAGSNGWYNYRHQADACHAYQIIHRNGIPDEQIIVMMY 73
Query: 103 DDIAFNEENPRPGVIINHPHGDDVYKGVPKDYTGEDVTVENFFAVILGNKTALTG-GSGK 161
DDIA NEENP PGV+IN P+G DVYKGVPKDYTGEDVT ENF AV+ G++ A+ G GSGK
Sbjct: 74 DDIANNEENPTPGVVINRPNGTDVYKGVPKDYTGEDVTPENFLAVLRGDEEAVKGKGSGK 133
Query: 162 VVDSGPNDHIFIFYSDHGGPGVLGMPTSRYIYADELIDVLKKKHASGNYKSLVFYLEACE 221
V+ SGP DH+F++++DHG G+L P ++ +L ++ + Y+ +VFY+EACE
Sbjct: 134 VLKSGPRDHVFVYFTDHGATGILVFPNED-LHVKDLNKTIRYMYEHKMYQKMVFYIEACE 192
Query: 222 SGSIFEGLLPEGLNIYATTASNAEESSWGTYCPGEIPGPPPEYSTCLGDLYSIAWMEDSD 281
SGS+ LP+ +++YATTA+N ESS+ Y E ST LGD YS+ WMEDSD
Sbjct: 193 SGSMMNH-LPDDIDVYATTAANPNESSYACYY-------DEERSTYLGDWYSVNWMEDSD 244
Query: 282 IHNLRTETLHQQYELVKTRTASYNSYGSHVMQYGDIGLSKNNLFTYLG 329
+ +L ETLH+QY LVK+ T + SHVMQYG+ +S + + G
Sbjct: 245 VEDLTKETLHKQYHLVKSHTNT-----SHVMQYGNKSISTMKVMQFQG 287
>sp|Q99538|LGMN_HUMAN Legumain OS=Homo sapiens GN=LGMN PE=1 SV=1
Length = 433
Score = 289 bits (739), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 143/286 (50%), Positives = 196/286 (68%), Gaps = 15/286 (5%)
Query: 45 GGGAKVNDDDDSVGTRWAVLLAGSNGFWNYRHQADICHAYQLLRKGGLKDENIIVFMYDD 104
G GA DD + G W V++AGSNG++NYRHQAD CHAYQ++ + G+ DE I+V MYDD
Sbjct: 14 GIGAVPIDDPEDGGKHWVVIVAGSNGWYNYRHQADACHAYQIIHRNGIPDEQIVVMMYDD 73
Query: 105 IAFNEENPRPGVIINHPHGDDVYKGVPKDYTGEDVTVENFFAVILGNKTALTG-GSGKVV 163
IA++E+NP PG++IN P+G DVY+GVPKDYTGEDVT +NF AV+ G+ A+ G GSGKV+
Sbjct: 74 IAYSEDNPTPGIVINRPNGTDVYQGVPKDYTGEDVTPQNFLAVLRGDAEAVKGIGSGKVL 133
Query: 164 DSGPNDHIFIFYSDHGGPGVLGMPTSRYIYADELIDVLKKKHASGNYKSLVFYLEACESG 223
SGP DH+FI+++DHG G+L P ++ +L + + + Y+ +VFY+EACESG
Sbjct: 134 KSGPQDHVFIYFTDHGSTGILVFPNED-LHVKDLNETIHYMYKHKMYRKMVFYIEACESG 192
Query: 224 SIFEGLLPEGLNIYATTASNAEESSWGTYCPGEIPGPPPEYSTCLGDLYSIAWMEDSDIH 283
S+ LP+ +N+YATTA+N ESS+ Y + ST LGD YS+ WMEDSD+
Sbjct: 193 SMMNH-LPDNINVYATTAANPRESSYACYYDE-------KRSTYLGDWYSVNWMEDSDVE 244
Query: 284 NLRTETLHQQYELVKTRTASYNSYGSHVMQYGDIGLSKNNLFTYLG 329
+L ETLH+QY LVK+ T + SHVMQYG+ +S + + G
Sbjct: 245 DLTKETLHKQYHLVKSHTNT-----SHVMQYGNKTISTMKVMQFQG 285
>sp|Q4R4T8|LGMN_MACFA Legumain OS=Macaca fascicularis GN=LGMN PE=2 SV=1
Length = 433
Score = 288 bits (736), Expect = 6e-77, Method: Compositional matrix adjust.
Identities = 142/286 (49%), Positives = 196/286 (68%), Gaps = 15/286 (5%)
Query: 45 GGGAKVNDDDDSVGTRWAVLLAGSNGFWNYRHQADICHAYQLLRKGGLKDENIIVFMYDD 104
G GA DD + G W V++AGSNG++NYRHQAD CHAYQ++ + G+ DE I+V MYDD
Sbjct: 14 GIGAVPIDDPEDGGKHWVVIVAGSNGWYNYRHQADACHAYQIIHRNGIPDEQIVVMMYDD 73
Query: 105 IAFNEENPRPGVIINHPHGDDVYKGVPKDYTGEDVTVENFFAVILGNKTALTG-GSGKVV 163
IA++E+NP PG++IN P+G DVY+GVPKDYTGEDVT +NF AV+ G+ A+ G GSGKV+
Sbjct: 74 IAYSEDNPTPGIVINRPNGTDVYQGVPKDYTGEDVTPQNFLAVLRGDAEAVKGIGSGKVL 133
Query: 164 DSGPNDHIFIFYSDHGGPGVLGMPTSRYIYADELIDVLKKKHASGNYKSLVFYLEACESG 223
SGP DH+F++++DHG G+L P ++ +L + + + Y+ +VFY+EACESG
Sbjct: 134 KSGPQDHVFVYFTDHGSTGILVFPNED-LHVKDLNETIYYMYKHKMYRKMVFYIEACESG 192
Query: 224 SIFEGLLPEGLNIYATTASNAEESSWGTYCPGEIPGPPPEYSTCLGDLYSIAWMEDSDIH 283
S+ LP+ +N+YATTA+N ESS+ Y + ST LGD YS+ WMEDSD+
Sbjct: 193 SMMNH-LPDNINVYATTAANPRESSYACYYDE-------KRSTYLGDWYSVNWMEDSDVE 244
Query: 284 NLRTETLHQQYELVKTRTASYNSYGSHVMQYGDIGLSKNNLFTYLG 329
+L ETLH+QY LVK+ T + SHVMQYG+ +S + + G
Sbjct: 245 DLTKETLHKQYHLVKSHTNT-----SHVMQYGNKTISTMKVMQFQG 285
>sp|O89017|LGMN_MOUSE Legumain OS=Mus musculus GN=Lgmn PE=1 SV=1
Length = 435
Score = 286 bits (733), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 147/288 (51%), Positives = 194/288 (67%), Gaps = 16/288 (5%)
Query: 44 GGGGAKVN-DDDDSVGTRWAVLLAGSNGFWNYRHQADICHAYQLLRKGGLKDENIIVFMY 102
G G V DD + G W V++AGSNG++NYRHQAD CHAYQ++ + G+ DE IIV MY
Sbjct: 14 GAGAVPVGVDDPEDGGKHWVVIVAGSNGWYNYRHQADACHAYQIIHRNGIPDEQIIVMMY 73
Query: 103 DDIAFNEENPRPGVIINHPHGDDVYKGVPKDYTGEDVTVENFFAVILGNKTALTG-GSGK 161
DDIA +EENP PGV+IN P+G DVYKGV KDYTGEDVT ENF AV+ G+ A+ G GSGK
Sbjct: 74 DDIANSEENPTPGVVINRPNGTDVYKGVLKDYTGEDVTPENFLAVLRGDAEAVKGKGSGK 133
Query: 162 VVDSGPNDHIFIFYSDHGGPGVLGMPTSRYIYADELIDVLKKKHASGNYKSLVFYLEACE 221
V+ SGP DH+FI+++DHG G+L P ++ +L ++ + Y+ +VFY+EACE
Sbjct: 134 VLKSGPRDHVFIYFTDHGATGILVFPNDD-LHVKDLNKTIRYMYEHKMYQKMVFYIEACE 192
Query: 222 SGSIFEGLLPEGLNIYATTASNAEESSWGTYCPGEIPGPPPEYSTCLGDLYSIAWMEDSD 281
SGS+ LP+ +N+YATTA+N +ESS+ Y E T LGD YS+ WMEDSD
Sbjct: 193 SGSMMNH-LPDDINVYATTAANPKESSYACYY-------DEERGTYLGDWYSVNWMEDSD 244
Query: 282 IHNLRTETLHQQYELVKTRTASYNSYGSHVMQYGDIGLSKNNLFTYLG 329
+ +L ETLH+QY LVK+ T + SHVMQYG+ +S + + G
Sbjct: 245 VEDLTKETLHKQYHLVKSHTNT-----SHVMQYGNKSISTMKVMQFQG 287
>sp|Q5R5D9|LGMN_PONAB Legumain OS=Pongo abelii GN=LGMN PE=2 SV=1
Length = 433
Score = 286 bits (732), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 141/284 (49%), Positives = 195/284 (68%), Gaps = 15/284 (5%)
Query: 47 GAKVNDDDDSVGTRWAVLLAGSNGFWNYRHQADICHAYQLLRKGGLKDENIIVFMYDDIA 106
GA DD + G W V++AGSNG++NYRHQAD CHAYQ++ + G+ DE I+V MYDDIA
Sbjct: 16 GAVPIDDPEDGGKHWVVIVAGSNGWYNYRHQADACHAYQIIHRNGIPDEQIVVMMYDDIA 75
Query: 107 FNEENPRPGVIINHPHGDDVYKGVPKDYTGEDVTVENFFAVILGNKTALTG-GSGKVVDS 165
++E+NP PG++IN P+G DVY+GVPKDYTGEDVT +NF AV+ G+ A+ G GSGKV+ S
Sbjct: 76 YSEDNPTPGIVINRPNGTDVYQGVPKDYTGEDVTPQNFLAVLRGDAEAVKGIGSGKVLKS 135
Query: 166 GPNDHIFIFYSDHGGPGVLGMPTSRYIYADELIDVLKKKHASGNYKSLVFYLEACESGSI 225
GP DH+F++ +DHG G+L P ++ ++L + + + Y+ +VFY+EACESGS+
Sbjct: 136 GPQDHVFVYSTDHGSTGILVFPNED-LHVEDLNETIHYMYKHKMYRKMVFYIEACESGSM 194
Query: 226 FEGLLPEGLNIYATTASNAEESSWGTYCPGEIPGPPPEYSTCLGDLYSIAWMEDSDIHNL 285
LP+ +N+YATTA+N ESS+ Y + ST LGD YS+ WMEDSD+ +L
Sbjct: 195 MNH-LPDNINVYATTAANPRESSYACYYDE-------KRSTYLGDWYSVNWMEDSDVEDL 246
Query: 286 RTETLHQQYELVKTRTASYNSYGSHVMQYGDIGLSKNNLFTYLG 329
ETLH+QY LVK+ T + SHVMQYG+ +S + + G
Sbjct: 247 TKETLHKQYHLVKSHTNT-----SHVMQYGNKTISTMKVMQFQG 285
>sp|Q95M12|LGMN_BOVIN Legumain OS=Bos taurus GN=LGMN PE=1 SV=1
Length = 433
Score = 285 bits (728), Expect = 5e-76, Method: Compositional matrix adjust.
Identities = 142/287 (49%), Positives = 196/287 (68%), Gaps = 16/287 (5%)
Query: 44 GGGGAKVNDDDDSVGTRWAVLLAGSNGFWNYRHQADICHAYQLLRKGGLKDENIIVFMYD 103
G G + D +D G W V++AGSNG++NYRHQAD CHAYQ++ + G+ DE IIV MYD
Sbjct: 14 GTGAVPLEDPEDG-GKHWVVIVAGSNGWYNYRHQADACHAYQIVHRNGIPDEQIIVMMYD 72
Query: 104 DIAFNEENPRPGVIINHPHGDDVYKGVPKDYTGEDVTVENFFAVILGNKTALTG-GSGKV 162
DIA +E+NP PG++IN P+G DVY+GV KDYTGEDVT +NF AV+ G+ A+ G GSGKV
Sbjct: 73 DIANSEDNPTPGIVINRPNGSDVYQGVLKDYTGEDVTPKNFLAVLRGDAEAVKGVGSGKV 132
Query: 163 VDSGPNDHIFIFYSDHGGPGVLGMPTSRYIYADELIDVLKKKHASGNYKSLVFYLEACES 222
+ SGP DH+F++++DHG G+L P ++ +L + ++ + Y+ +VFY+EACES
Sbjct: 133 LKSGPRDHVFVYFTDHGATGILVFPNED-LHVKDLNETIRYMYEHKMYQKMVFYIEACES 191
Query: 223 GSIFEGLLPEGLNIYATTASNAEESSWGTYCPGEIPGPPPEYSTCLGDLYSIAWMEDSDI 282
GS+ L P+ +N+YATTA+N ESS+ Y + ST LGD YS+ WMEDSD+
Sbjct: 192 GSMMNHLPPD-INVYATTAANPRESSYACYY-------DEQRSTFLGDWYSVNWMEDSDV 243
Query: 283 HNLRTETLHQQYELVKTRTASYNSYGSHVMQYGDIGLSKNNLFTYLG 329
+L ETLH+QY+LVK+ T + SHVMQYG+ +S L + G
Sbjct: 244 EDLTKETLHKQYQLVKSHTNT-----SHVMQYGNKSISAMKLMQFQG 285
>sp|P42665|HGLB_SCHJA Hemoglobinase OS=Schistosoma japonicum GN=HAEM PE=2 SV=1
Length = 423
Score = 255 bits (652), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 141/300 (47%), Positives = 180/300 (60%), Gaps = 19/300 (6%)
Query: 60 RWAVLLAGSNGFWNYRHQADICHAYQLLRKGGLKDENIIVFMYDDIAFNEENPRPGVIIN 119
+WAVL+AGSNGF NYRHQAD+CHAY +L G+K E+II FMYDDIA N+ENP PG I N
Sbjct: 31 KWAVLVAGSNGFENYRHQADVCHAYHVLLSKGVKPEHIITFMYDDIAHNKENPFPGKIFN 90
Query: 120 HPHGDDVYKGVPKDYTGEDVTVENFFAVILGNKTALTGGSGKVVDSGPNDHIFIFYSDHG 179
D YKGV DY G+ V + F V+ G+K A GKV+ SG ND +FI+++DHG
Sbjct: 91 DYRHKDYYKGVVIDYKGKKVNPKTFLQVLKGDKRA----GGKVLKSGKNDDVFIYFTDHG 146
Query: 180 GPGVLGMPTSRYIYADELIDVLKKKHASGNYKSLVFYLEACESGSIFEGLLPEGLNIYAT 239
PG+L P ++A I+ LK Y LV Y+EACESGS+F GLLP +NIYAT
Sbjct: 147 APGILAFPDDD-LHAKPFINTLKYLRQHRRYSKLVIYVEACESGSMFAGLLPTDINIYAT 205
Query: 240 TASNAEESSWGTYCPGEIPGPPPEYSTCLGDLYSIAWMEDSDIHNLRTETLHQQYELVKT 299
TA+ +ESS+ T+C P S+CL DLYS W+ DS+ H L TL QQY+ VK
Sbjct: 206 TAARPDESSYATFCDD------PRISSCLADLYSYDWIVDSEKHQLTQRTLDQQYKEVKF 259
Query: 300 RTASYNSYGSHVMQYGDIGLSKNNLFTYLGTNPANDNYTFVDENSLRPASKAVNQRDADL 359
T SHV +YGD + K L + G+ T DE ++P ++ RD L
Sbjct: 260 ETNL-----SHVQRYGDKKMGKLYLSEFQGSRKKAS--TEHDEPPMKPKD-SIPSRDIPL 311
>sp|P09841|HGLB_SCHMA Hemoglobinase OS=Schistosoma mansoni PE=1 SV=3
Length = 429
Score = 251 bits (640), Expect = 7e-66, Method: Compositional matrix adjust.
Identities = 136/311 (43%), Positives = 190/311 (61%), Gaps = 19/311 (6%)
Query: 49 KVNDDDDSVGTRWAVLLAGSNGFWNYRHQADICHAYQLLRKGGLKDENIIVFMYDDIAFN 108
+V+D+ S +WAVL+AGSNG+ NYRHQAD+CHAY +LR G+K E+II MYDDIA+N
Sbjct: 26 EVSDETVSDNNKWAVLVAGSNGYPNYRHQADVCHAYHVLRSKGIKPEHIITMMYDDIAYN 85
Query: 109 EENPRPGVIINHPHGDDVYKGVPKDYTGEDVTVENFFAVILGNKTALTGGSGKVVDSGPN 168
NP PG + N + D Y+GV DY G++V + F V+ G+K+A GKV+ SG N
Sbjct: 86 LMNPFPGKLFNDYNHKDWYEGVVIDYRGKNVNSKTFLKVLKGDKSA----GGKVLKSGKN 141
Query: 169 DHIFIFYSDHGGPGVLGMPTSRYIYADELIDVLKKKHASGNYKSLVFYLEACESGSIFEG 228
D +FI+++DHG PG++ P +YA E + LK H+ Y LV Y+EA ESGS+F+
Sbjct: 142 DDVFIYFTDHGAPGLIAFPDDE-LYAKEFMSTLKYLHSHKRYSKLVIYIEANESGSMFQQ 200
Query: 229 LLPEGLNIYATTASNAEESSWGTYCPGEIPGPPPEYSTCLGDLYSIAWMEDSDIHNLRTE 288
+LP L+IYATTA+N+ E S+ T+C P +TCL DLYS W+ DS H+L
Sbjct: 201 ILPSNLSIYATTAANSTECSYSTFCGD------PTITTCLADLYSYNWIVDSQTHHLTQR 254
Query: 289 TLHQQYELVKTRTASYNSYGSHVMQYGDIGLSKNNLFTYLGTNPANDNYTFVDENSLRPA 348
TL QQY+ VK T SHV +YGD + K + + G+ + + DE ++P
Sbjct: 255 TLDQQYKEVKRETDL-----SHVQRYGDTRMGKLYVSEFQGSRDKSSSEN--DEPPMKP- 306
Query: 349 SKAVNQRDADL 359
++ RD L
Sbjct: 307 RHSIASRDIPL 317
>sp|Q8T4E1|GPI8_DROME Putative GPI-anchor transamidase OS=Drosophila melanogaster
GN=CG4406 PE=2 SV=1
Length = 355
Score = 110 bits (275), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 67/195 (34%), Positives = 104/195 (53%), Gaps = 15/195 (7%)
Query: 61 WAVLLAGSNGFWNYRHQADICHAYQLLRKGGLKDENIIVFMYDDIAFNEENPRPGVIINH 120
WAVL+ S ++NYRH A++ Y+ +++ G+ D II+ + DD+A N NPRPG + N+
Sbjct: 47 WAVLVDASRFWFNYRHVANVLSIYRSVKRLGIPDSQIILMIADDMACNARNPRPGQVYNN 106
Query: 121 P------HGDDVYKGVPKDYTGEDVTVENFFAVILGNKTALTGGSGKVV-DSGPNDHIFI 173
+GDD V DY G +VTVENF ++ G T S K++ D+G N + I
Sbjct: 107 ANQHINVYGDD----VEVDYRGYEVTVENFVRLLTGRTQNGTARSKKLLSDAGSN--VLI 160
Query: 174 FYSDHGGPGVLGMPTSRYIYADELIDVLKKKHASGNYKSLVFYLEACESGSIFEGLLPEG 233
+ + HGG G L S I + EL D +++ Y L F ++ C++ S++E
Sbjct: 161 YLTGHGGDGFLKFQDSEEITSQELADGIQQMWEKKRYNELFFMVDTCQAASLYEKFTSP- 219
Query: 234 LNIYATTASNAEESS 248
N+ A +S E S
Sbjct: 220 -NVLAVASSLVGEDS 233
>sp|P49018|GPI8_YEAST GPI-anchor transamidase OS=Saccharomyces cerevisiae (strain ATCC
204508 / S288c) GN=GPI8 PE=1 SV=1
Length = 411
Score = 109 bits (272), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 60/199 (30%), Positives = 107/199 (53%), Gaps = 5/199 (2%)
Query: 61 WAVLLAGSNGFWNYRHQADICHAYQLLRKGGLKDENIIVFMYDDIAFNEENPRPGVIINH 120
WAVL++ S ++NYRH A++ Y+ +++ G+ D II+ + DD+A N N PG + N+
Sbjct: 39 WAVLVSTSRFWFNYRHMANVLSMYRTVKRLGIPDSQIILMLSDDVACNSRNLFPGSVFNN 98
Query: 121 P-HGDDVY-KGVPKDYTGEDVTVENFFAVILGNKTALTGGSGKVVDSGPNDHIFIFYSDH 178
H D+Y V DY G +VTVENF ++ T S +++ + N +IFI+ + H
Sbjct: 99 KDHAIDLYGDSVEVDYRGYEVTVENFIRLLTDRWTEDHPKSKRLL-TDENSNIFIYMTGH 157
Query: 179 GGPGVLGMPTSRYIYADELIDVLKKKHASGNYKSLVFYLEACESGSIFEGLLPEGLNIYA 238
GG L + I ++++ D ++ + Y + F ++ C++ +++ NI A
Sbjct: 158 GGDDFLKFQDAEEIASEDIADAFQQMYEKKRYNEIFFMIDTCQANTMYSKFYSP--NILA 215
Query: 239 TTASNAEESSWGTYCPGEI 257
+S +ESS+ + EI
Sbjct: 216 VGSSEMDESSYSHHSDVEI 234
>sp|Q5R6L8|GPI8_PONAB GPI-anchor transamidase OS=Pongo abelii GN=PIGK PE=2 SV=1
Length = 395
Score = 108 bits (269), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 79/284 (27%), Positives = 126/284 (44%), Gaps = 41/284 (14%)
Query: 1 MTRLASGVLITLLVALAGIADGSRDIAGDILKLPSEAYRFFHNGGGGAKVNDDDDSVGTR 60
++R AS + LL++ +A + +A +FF +G
Sbjct: 7 LSRAASTLAAVLLLSFGSVAAS---------HIEDQAEQFFRSGHTN------------N 45
Query: 61 WAVLLAGSNGFWNYRHQADICHAYQLLRKGGLKDENIIVFMYDDIAFNEENPRPGVIINH 120
WAVL+ S ++NYRH A+ Y+ +++ G+ D +I++ + DD+A N NP+P + +H
Sbjct: 46 WAVLVCTSRFWFNYRHVANTLSVYRSVKRLGIPDSHIVLMLADDMACNPRNPKPATVFSH 105
Query: 121 P------HGDDVYKGVPKDYTGEDVTVENFFAVILGNKTALTGGSGKVVDSGPNDHIFIF 174
+GDD V DY +VTVENF V+ G T S +++ S +I I+
Sbjct: 106 KNMELNVYGDD----VEVDYRSYEVTVENFLRVLTGRIPPSTPRSKRLL-SDDRSNILIY 160
Query: 175 YSDHGGPGVLGMPTSRYIYADELIDVLKKKHASGNYKSLVFYLEACESGSIFEGLLPEGL 234
+ HGG G L S I EL D ++ Y L+F ++ C+ S++E
Sbjct: 161 MTGHGGNGFLKFQDSEEITNIELADAFEQMWQKRRYNELLFIIDTCQGASMYERFYSP-- 218
Query: 235 NIYATTASNAEESSWGTYCPGEIPGPPPEYSTCLGDLYSIAWME 278
NI A +S E S P P L D Y+ +E
Sbjct: 219 NIMALASSQVGEDSLSH-------QPDPAIGVHLMDRYTFYVLE 255
>sp|Q9CXY9|GPI8_MOUSE GPI-anchor transamidase OS=Mus musculus GN=Pigk PE=2 SV=2
Length = 395
Score = 107 bits (267), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 78/281 (27%), Positives = 123/281 (43%), Gaps = 32/281 (11%)
Query: 4 LASGVLITLLVALAGIADGSRDIAGDILKLPSEAYRFFHNGGGGAKVNDDDDSVGTRWAV 63
+A+ +TL VA + + +A +FF +G WAV
Sbjct: 1 MAAPCFLTLRVATLAALALLSLGSSAAGHIEDQAEQFFRSGHTN------------NWAV 48
Query: 64 LLAGSNGFWNYRHQADICHAYQLLRKGGLKDENIIVFMYDDIAFNEENPRPGVIINHP-- 121
L+ S ++NYRH A+ Y+ +++ G+ D +I++ + DD+A N NP+P + +H
Sbjct: 49 LVCTSRFWFNYRHVANTLSVYRSVKRLGIPDSHIVLMLADDMACNARNPKPATVFSHKNM 108
Query: 122 ----HGDDVYKGVPKDYTGEDVTVENFFAVILGNKTALTGGSGKVVDSGPNDHIFIFYSD 177
+GDD V DY +VTVENF V+ G T S +++ S +I I+ +
Sbjct: 109 ELNVYGDD----VEVDYRSYEVTVENFLRVLTGRVPPSTPRSKRLL-SDDRSNILIYMTG 163
Query: 178 HGGPGVLGMPTSRYIYADELIDVLKKKHASGNYKSLVFYLEACESGSIFEGLLPEGLNIY 237
HGG G L S I EL D ++ Y L+F ++ C+ S++E NI
Sbjct: 164 HGGNGFLKFQDSEEITNIELADAFEQMWQKRRYNELLFIIDTCQGASMYERFY--SPNIM 221
Query: 238 ATTASNAEESSWGTYCPGEIPGPPPEYSTCLGDLYSIAWME 278
A +S E S P P L D Y+ +E
Sbjct: 222 ALASSQVGEDSLSH-------QPDPAIGVHLMDRYTFYVLE 255
>sp|Q92643|GPI8_HUMAN GPI-anchor transamidase OS=Homo sapiens GN=PIGK PE=1 SV=2
Length = 395
Score = 107 bits (266), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 78/284 (27%), Positives = 126/284 (44%), Gaps = 41/284 (14%)
Query: 1 MTRLASGVLITLLVALAGIADGSRDIAGDILKLPSEAYRFFHNGGGGAKVNDDDDSVGTR 60
++R A+ + LL++ +A + +A +FF +G
Sbjct: 7 LSRAATVLATVLLLSFGSVAAS---------HIEDQAEQFFRSGHTN------------N 45
Query: 61 WAVLLAGSNGFWNYRHQADICHAYQLLRKGGLKDENIIVFMYDDIAFNEENPRPGVIINH 120
WAVL+ S ++NYRH A+ Y+ +++ G+ D +I++ + DD+A N NP+P + +H
Sbjct: 46 WAVLVCTSRFWFNYRHVANTLSVYRSVKRLGIPDSHIVLMLADDMACNPRNPKPATVFSH 105
Query: 121 P------HGDDVYKGVPKDYTGEDVTVENFFAVILGNKTALTGGSGKVVDSGPNDHIFIF 174
+GDD V DY +VTVENF V+ G T S +++ S +I I+
Sbjct: 106 KNMELNVYGDD----VEVDYRSYEVTVENFLRVLTGRIPPSTPRSKRLL-SDDRSNILIY 160
Query: 175 YSDHGGPGVLGMPTSRYIYADELIDVLKKKHASGNYKSLVFYLEACESGSIFEGLLPEGL 234
+ HGG G L S I EL D ++ Y L+F ++ C+ S++E
Sbjct: 161 MTGHGGNGFLKFQDSEEITNIELADAFEQMWQKRRYNELLFIIDTCQGASMYERFYSP-- 218
Query: 235 NIYATTASNAEESSWGTYCPGEIPGPPPEYSTCLGDLYSIAWME 278
NI A +S E S P P L D Y+ +E
Sbjct: 219 NIMALASSQVGEDSLSH-------QPDPAIGVHLMDRYTFYVLE 255
>sp|Q3MHZ7|GPI8_BOVIN GPI-anchor transamidase OS=Bos taurus GN=PIGK PE=2 SV=1
Length = 395
Score = 105 bits (262), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 69/224 (30%), Positives = 107/224 (47%), Gaps = 20/224 (8%)
Query: 61 WAVLLAGSNGFWNYRHQADICHAYQLLRKGGLKDENIIVFMYDDIAFNEENPRPGVIINH 120
WAVL+ S ++NYRH A+ Y+ +++ G+ D +I++ + DD+A N NP+P + +H
Sbjct: 46 WAVLVCTSRFWFNYRHVANTLSVYRSVKRLGIPDSHIVLMLADDMACNPRNPKPATVYSH 105
Query: 121 P------HGDDVYKGVPKDYTGEDVTVENFFAVILGNKTALTGGSGKVVDSGPNDHIFIF 174
+GDD V DY +VTVENF V+ G + T S +++ S +I I+
Sbjct: 106 KNMELNVYGDD----VEVDYRSYEVTVENFLRVLTGRIPSSTPRSKRLL-SDDRSNILIY 160
Query: 175 YSDHGGPGVLGMPTSRYIYADELIDVLKKKHASGNYKSLVFYLEACESGSIFEGLLPEGL 234
+ HGG G L S I EL D ++ Y L+F ++ C+ S++E
Sbjct: 161 MTGHGGNGFLKFQDSEEITNIELADAFEQMWQKRRYNELLFIIDTCQGASMYERFY--SP 218
Query: 235 NIYATTASNAEESSWGTYCPGEIPGPPPEYSTCLGDLYSIAWME 278
NI A +S E S P P L D Y+ +E
Sbjct: 219 NIMALASSQVGEDSLSH-------QPDPAVGVHLMDRYTFYVLE 255
>sp|Q4KRV1|GPI8_PIG GPI-anchor transamidase OS=Sus scrofa GN=PIGK PE=2 SV=1
Length = 395
Score = 102 bits (254), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 69/224 (30%), Positives = 105/224 (46%), Gaps = 20/224 (8%)
Query: 61 WAVLLAGSNGFWNYRHQADICHAYQLLRKGGLKDENIIVFMYDDIAFNEENPRPGVIINH 120
WAVL+ S ++NYRH A+ Y+ +++ G+ D +I++ + DD+A N NP+P + +H
Sbjct: 46 WAVLVCTSRFWFNYRHVANTLSVYRSVKRLGIPDSHIVLMLADDMACNPRNPKPATVYSH 105
Query: 121 P------HGDDVYKGVPKDYTGEDVTVENFFAVILGNKTALTGGSGKVVDSGPNDHIFIF 174
+GDD V DY VTVENF V+ G T S +++ S +I I+
Sbjct: 106 KNMELNVYGDD----VEVDYRSYVVTVENFLRVLTGRIPPSTPRSKRLL-SDDRSNILIY 160
Query: 175 YSDHGGPGVLGMPTSRYIYADELIDVLKKKHASGNYKSLVFYLEACESGSIFEGLLPEGL 234
+ HGG G L S I EL D ++ Y L+F ++ C+ S++E
Sbjct: 161 MTGHGGNGFLKFQDSEEITNIELADAFEQMWQKRRYNELLFIIDTCQGASMYERFYSP-- 218
Query: 235 NIYATTASNAEESSWGTYCPGEIPGPPPEYSTCLGDLYSIAWME 278
NI A +S E S P P L D Y+ +E
Sbjct: 219 NIMALASSQVGEDSLSH-------QPDPAIGVHLMDRYTFYVLE 255
>sp|P49048|GPI8_CAEEL Putative GPI-anchor transamidase OS=Caenorhabditis elegans
GN=T05E11.6 PE=1 SV=3
Length = 319
Score = 99.4 bits (246), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 58/211 (27%), Positives = 108/211 (51%), Gaps = 6/211 (2%)
Query: 43 NGGGGAKVNDDDDSVG--TRWAVLLAGSNGFWNYRHQADICHAYQLLRKGGLKDENIIVF 100
N G K+++ D+ G WAVL+ S ++NYRH +++ Y +++ G+ D NII+
Sbjct: 19 NTGLQLKIDELFDTPGHTNNWAVLVCTSKFWFNYRHVSNVLALYHSIKRLGIPDSNIIMM 78
Query: 101 MYDDIAFNEENPRPGVIINHPHGDDVY-KGVPKDYTGEDVTVENFFAVILGNKTALTGGS 159
+ +D+ N NPRPG + G ++Y V DY GE+VTVE+F V+ G T S
Sbjct: 79 LAEDVPCNSRNPRPGTVYAARAGTNLYGSDVEVDYRGEEVTVESFIRVLTGRHHPATPRS 138
Query: 160 GKVVDSGPNDHIFIFYSDHGGPGVLGMPTSRYIYADELIDVLKKKHASGNYKSLVFYLEA 219
+++ + ++ I+ + HGG + S + +L ++ Y ++ ++
Sbjct: 139 KRLL-TDHQSNVLIYLTGHGGDSFMKFQDSEELTNVDLAYAIQTMFEDNRYHEMLVIADS 197
Query: 220 CESGSIFEGLLPEGLNIYATTASNAEESSWG 250
C S S++E + + N+ + ++S E S+
Sbjct: 198 CRSASMYEWI--DSPNVLSLSSSLTHEESYS 226
>sp|Q9USP5|GPI8_SCHPO GPI-anchor transamidase OS=Schizosaccharomyces pombe (strain 972 /
ATCC 24843) GN=gpi8 PE=2 SV=1
Length = 380
Score = 92.4 bits (228), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 54/199 (27%), Positives = 104/199 (52%), Gaps = 5/199 (2%)
Query: 61 WAVLLAGSNGFWNYRHQADICHAYQLLRKGGLKDENIIVFMYDDIAFNEENPRPGVII-N 119
WAVL++ S ++NYRH A++ Y+ +++ G+ D II+ + DD A N N PG + N
Sbjct: 27 WAVLISTSRFWFNYRHTANVLGIYRSVKRLGIPDSQIILMIADDYACNSRNLFPGTVFDN 86
Query: 120 HPHGDDVY-KGVPKDYTGEDVTVENFFAVILGNKTALTGGSGKVVDSGPNDHIFIFYSDH 178
D+Y + + DY G +VTVE F ++ T S +++ + +I I+ + H
Sbjct: 87 ADRALDLYGEEIEIDYKGYEVTVEAFIRLLTERVPENTPASKRLL-TNERSNILIYMTGH 145
Query: 179 GGPGVLGMPTSRYIYADELIDVLKKKHASGNYKSLVFYLEACESGSIFEGLLPEGLNIYA 238
GG G + + + +++L D +++ H Y ++F ++ C++ S++ + N+ A
Sbjct: 146 GGDGFIKFQDAEELSSEDLADAIEQIHQHKRYNEILFMVDTCQANSLYTKIYSP--NVLA 203
Query: 239 TTASNAEESSWGTYCPGEI 257
+S SS+ + +I
Sbjct: 204 IGSSEVGTSSYSHHADIDI 222
>sp|P80530|HGLB2_FASHE Hemoglobinase-like protein 2 (Fragment) OS=Fasciola hepatica PE=1
SV=1
Length = 22
Score = 34.3 bits (77), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 14/20 (70%), Positives = 17/20 (85%)
Query: 61 WAVLLAGSNGFWNYRHQADI 80
WAVL+AGS+G NYRH AD+
Sbjct: 3 WAVLVAGSDGLPNYRHHADV 22
>sp|Q75B43|MCA1_ASHGO Metacaspase-1 OS=Ashbya gossypii (strain ATCC 10895 / CBS 109.51 /
FGSC 9923 / NRRL Y-1056) GN=MCA1 PE=3 SV=1
Length = 452
Score = 33.9 bits (76), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 31/104 (29%), Positives = 45/104 (43%), Gaps = 23/104 (22%)
Query: 162 VVDSGPNDHIFIFYSDHGGPG----------------VLGMPTSRYIYADELIDVLKKKH 205
V + PND +F+ YS HGG + T+ +I DE+ D+L K
Sbjct: 222 VQGAQPNDSLFLHYSGHGGETEDLDGDEQDGKDSTLYPVDFETNGHIVDDEIHDILVKPL 281
Query: 206 ASGNYKSLVFYLEACESGSIFEGLLPEGLNIYATTASNAEESSW 249
A G L ++AC SGS + LP +Y+T E + W
Sbjct: 282 APG--VRLTALIDACHSGSALD--LPY---MYSTKGIIKEPNVW 318
>sp|P80527|HGLB1_FASHE Hemoglobinase-like protein 1 (Fragment) OS=Fasciola hepatica PE=1
SV=1
Length = 20
Score = 33.5 bits (75), Expect = 2.5, Method: Composition-based stats.
Identities = 14/18 (77%), Positives = 16/18 (88%)
Query: 61 WAVLLAGSNGFWNYRHQA 78
WAVL+AGSNG+ NYRH A
Sbjct: 3 WAVLVAGSNGWPNYRHHA 20
>sp|Q9Z3D6|PMP12_CHLPN Probable outer membrane protein pmp12 OS=Chlamydia pneumoniae
GN=pmp12 PE=2 SV=1
Length = 514
Score = 32.7 bits (73), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 22/71 (30%), Positives = 37/71 (52%), Gaps = 7/71 (9%)
Query: 99 VFMYDDIAFNEENPRPGVIINHPHGDDVYKGVPKDYTG-----EDVTVENFFAVILGNKT 153
V+ YD I+ +E + ++IN P G + Y+G ++G +V EN + IL + T
Sbjct: 287 VYFYDPISHSESHKITDLVINAPEGKETYEGT-ISFSGLCLDDHEVCAENLTSTILQDVT 345
Query: 154 ALTGGSGKVVD 164
L GG+ + D
Sbjct: 346 -LAGGTLSLSD 355
>sp|Q06H07|RPOC2_DRIGR DNA-directed RNA polymerase subunit beta'' OS=Drimys granadensis
GN=rpoC2 PE=3 SV=1
Length = 1381
Score = 32.3 bits (72), Expect = 6.6, Method: Compositional matrix adjust.
Identities = 20/54 (37%), Positives = 25/54 (46%), Gaps = 1/54 (1%)
Query: 68 SNGFWNYRHQADICHAYQLLRKGGLKDENIIVFMYDDIAFNEENPRPGVIINHP 121
SNG WN+ + A I H L ++ II F YD E PR G+ I P
Sbjct: 553 SNGHWNFIYPA-ILHENSYLLAKRRRNRFIIPFQYDQEREKELMPRSGISIEIP 605
>sp|Q08601|MCA1_YEAST Metacaspase-1 OS=Saccharomyces cerevisiae (strain ATCC 204508 /
S288c) GN=MCA1 PE=1 SV=2
Length = 432
Score = 32.0 bits (71), Expect = 7.7, Method: Compositional matrix adjust.
Identities = 28/104 (26%), Positives = 44/104 (42%), Gaps = 23/104 (22%)
Query: 162 VVDSGPNDHIFIFYSDHGGPG----------------VLGMPTSRYIYADELIDVLKKKH 205
V D+ PND +F+ YS HGG + T I DE+ D++ K
Sbjct: 204 VKDAQPNDSLFLHYSGHGGQTEDLDGDEEDGMDDVIYPVDFETQGPIIDDEMHDIMVKPL 263
Query: 206 ASGNYKSLVFYLEACESGSIFEGLLPEGLNIYATTASNAEESSW 249
G + +F ++C SG++ + LP Y+T E + W
Sbjct: 264 QQGVRLTALF--DSCHSGTVLD--LPY---TYSTKGIIKEPNIW 300
>sp|A6ZP43|MCA1_YEAS7 Metacaspase-1 OS=Saccharomyces cerevisiae (strain YJM789) GN=MCA1
PE=3 SV=2
Length = 432
Score = 32.0 bits (71), Expect = 7.7, Method: Compositional matrix adjust.
Identities = 28/104 (26%), Positives = 44/104 (42%), Gaps = 23/104 (22%)
Query: 162 VVDSGPNDHIFIFYSDHGGPG----------------VLGMPTSRYIYADELIDVLKKKH 205
V D+ PND +F+ YS HGG + T I DE+ D++ K
Sbjct: 204 VKDAQPNDSLFLHYSGHGGQTEDLDGDEEDGMDDVIYPVDFETQGPIIDDEMHDIMVKPL 263
Query: 206 ASGNYKSLVFYLEACESGSIFEGLLPEGLNIYATTASNAEESSW 249
G + +F ++C SG++ + LP Y+T E + W
Sbjct: 264 QQGVRLTALF--DSCHSGTVLD--LPY---TYSTKGIIKEPNIW 300
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.317 0.138 0.423
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 163,586,811
Number of Sequences: 539616
Number of extensions: 7998231
Number of successful extensions: 15648
Number of sequences better than 100.0: 35
Number of HSP's better than 100.0 without gapping: 29
Number of HSP's successfully gapped in prelim test: 6
Number of HSP's that attempted gapping in prelim test: 15545
Number of HSP's gapped (non-prelim): 35
length of query: 372
length of database: 191,569,459
effective HSP length: 119
effective length of query: 253
effective length of database: 127,355,155
effective search space: 32220854215
effective search space used: 32220854215
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 62 (28.5 bits)