BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 037212
(551 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|Q9FZP1|HPSE3_ARATH Heparanase-like protein 3 OS=Arabidopsis thaliana GN=At5g34940 PE=2
SV=2
Length = 536
Score = 649 bits (1673), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 305/522 (58%), Positives = 389/522 (74%), Gaps = 9/522 (1%)
Query: 30 TASGNVIE-GTVFINGTVAIAKTDDNFICATLDWWPPDKCDYGTCSWGRASLLNLDLGNP 88
T S V E GTVF+ G A+ D++FICATLDWWPP+KCDYG+CSW AS+LNLDL N
Sbjct: 22 TVSSAVEENGTVFVYGRAAVGTIDEDFICATLDWWPPEKCDYGSCSWDHASILNLDLNNV 81
Query: 89 ILLKAVKAFSPLKIRMGGTLQDKAVYESKGIDEPCTPFVKNNSELFGFSEGCLTMRRWDE 148
IL A+KAF+PLKIR+GGTLQD +YE+ +PC PF KN+S LFG+++GCL MRRWDE
Sbjct: 82 ILQNAIKAFAPLKIRIGGTLQDIVIYETPDSKQPCLPFTKNSSILFGYTQGCLPMRRWDE 141
Query: 149 LNIFFRRAGAVVTFGLNALRGRIIQSDGSARGAWDSSNAESLMRYTVNKGYKIYGWELGN 208
LN FFR+ G V FGLNAL GR I+S+G A GAW+ +NAES +R+T Y I GWELGN
Sbjct: 142 LNAFFRKTGTKVIFGLNALSGRSIKSNGEAIGAWNYTNAESFIRFTAENNYTIDGWELGN 201
Query: 209 ELSGNGVGARIGADQYASDVDRLQSIVQNIYKGFKIKPLIIAPGGFFDANWFKEFIDKTP 268
EL G+GVGAR+GA+QYA D L++IV +YK PL+I PGGFF+ +WF E+++K
Sbjct: 202 ELCGSGVGARVGANQYAIDTINLRNIVNRVYKNVSPMPLVIGPGGFFEVDWFTEYLNKAE 261
Query: 269 NSLQVVTHHIYNLGPGVDDRLIDKILDPSFLNSGSEPFSSLQSILRSSATSAVAWVGEAG 328
NSL T HIY+LGPGVD+ LI+KIL+PS+L+ ++ F SL++I+++S+T AVAWVGE+G
Sbjct: 262 NSLNATTRHIYDLGPGVDEHLIEKILNPSYLDQEAKSFRSLKNIIKNSSTKAVAWVGESG 321
Query: 329 GAYNSGHNLVTNAFVFSFWYLDQLGMASIYGTKTYCRQTLIGGNYGLLNTNTFVPNPGYY 388
GAYNSG NLV+NAFV+SFWYLDQLGMAS+Y TKTYCRQ+LIGGNYGLLNT F PNP YY
Sbjct: 322 GAYNSGRNLVSNAFVYSFWYLDQLGMASLYDTKTYCRQSLIGGNYGLLNTTNFTPNPDYY 381
Query: 389 SALLWHRLMGSTVLSTSLSETTKIRAYAHCSKQSQGTTLLLINLDGNTTARVRVTISREN 448
SAL+W +LMG L T+ S T KIR+Y HC++QS+G T+LL+NLD TT +V +
Sbjct: 382 SALIWRQLMGRKALFTTFSGTKKIRSYTHCARQSKGITVLLMNLDNTTTVVAKVEL---- 437
Query: 449 LAGNGALVVQQEQQNQRKRFAKISQDTKTNGKM-REEYHLTARDGDLHSQTMLLNGEILA 507
N + ++ + + + A NG + REEYHLTA+DG+LHSQTMLLNG L
Sbjct: 438 ---NNSFSLRHTKHMKSYKRASSQLFGGPNGVIQREEYHLTAKDGNLHSQTMLLNGNALQ 494
Query: 508 VNSSEDFPPLEPIYVSETAPIIVAPFSIVFAEIPSSVLPACA 549
VNS D PP+EPI+++ T PI +AP+SIVF + + V+PACA
Sbjct: 495 VNSMGDLPPIEPIHINSTEPITIAPYSIVFVHMRNVVVPACA 536
>sp|Q8L608|HPSE2_ARATH Heparanase-like protein 2 OS=Arabidopsis thaliana GN=At5g61250 PE=2
SV=1
Length = 539
Score = 543 bits (1399), Expect = e-153, Method: Compositional matrix adjust.
Identities = 263/532 (49%), Positives = 359/532 (67%), Gaps = 17/532 (3%)
Query: 27 PSRTASGNVIEGTVFINGTVAIAKTDDNFICATLDWWPPDKCDYGTCSWGRASLLNLDLG 86
P T N+ T+ I+G+ IA+TD+NFICATLDWWPP+KC+Y C WG ASL+NL+L
Sbjct: 16 PPVTFGSNMERTTLVIDGSRRIAETDENFICATLDWWPPEKCNYDQCPWGYASLINLNLA 75
Query: 87 NPILLKAVKAFSPLKIRMGGTLQDKAVYESKGIDEPCTPFVKNNSELFGFSEGCLTMRRW 146
+P+L KA++AF L+IR+GG+LQD+ +Y+ + PCT F K + LFGFSEGCL M+RW
Sbjct: 76 SPLLAKAIQAFRTLRIRIGGSLQDQVIYDVGDLKTPCTQFKKTDDGLFGFSEGCLYMKRW 135
Query: 147 DELNIFFRRAGAVVTFGLNALRGRIIQSDGSARGAWDSSNAESLMRYTVNKGYKIYGWEL 206
DE+N FF GA+VTFGLNAL GR + + G WD +N + M YTV+KGY I WE
Sbjct: 136 DEVNHFFNATGAIVTFGLNALHGRNKLNGTAWGGDWDHTNTQDFMNYTVSKGYAIDSWEF 195
Query: 207 GNELSGNGVGARIGADQYASDVDRLQSIVQNIYKGFKIKPLIIAPGGFFDANWFKEFIDK 266
GNELSG+G+ A + + Y D+ L+++++N+YK + KPL++APGGFF+ W+ E +
Sbjct: 196 GNELSGSGIWASVSVELYGKDLIVLKNVIKNVYKNSRTKPLVVAPGGFFEEQWYSELLRL 255
Query: 267 T-PNSLQVVTHHIYNLGPGVDDRLIDKILDPSFLNSGSEPFSSLQSILRSSATSAVAWVG 325
+ P L V+THHIYNLGPG D +L++KILDP++L+ SE F+++ ++ A AWVG
Sbjct: 256 SGPGVLDVLTHHIYNLGPGNDPKLVNKILDPNYLSGISELFANVNQTIQEHGPWAAAWVG 315
Query: 326 EAGGAYNSGHNLVTNAFVFSFWYLDQLGMASIYGTKTYCRQTLIGGNYGLLNTNTFVPNP 385
EAGGA+NSG V+ F+ SFWYLDQLG++S + TK YCRQ L+GG YGLL TFVPNP
Sbjct: 316 EAGGAFNSGGRQVSETFINSFWYLDQLGISSKHNTKVYCRQALVGGFYGLLEKETFVPNP 375
Query: 386 GYYSALLWHRLMGSTVLSTSLSETTKIRAYAHCSKQSQGTTLLLINLDGNTTARVRVTIS 445
YYSALLWHRLMG +L + + +RAY HCSK+ G T+LLINL +TT V V+
Sbjct: 376 DYYSALLWHRLMGKGILGVQTTASEYLRAYVHCSKRRAGITILLINLSKHTTFTVAVS-- 433
Query: 446 RENLAGNGALVVQQEQQNQRKRF-----AKIS--QDTKTNGKM-REEYHLTARDGDLHSQ 497
NG VV Q + +RK F +K+S + ++G + REEYHL+ +DGDL S+
Sbjct: 434 ------NGVKVVLQAESMKRKSFLETIKSKVSWVGNKASDGYLNREEYHLSPKDGDLRSK 487
Query: 498 TMLLNGEILAVNSSEDFPPLEPIYVSETAPIIVAPFSIVFAEIPSSVLPACA 549
MLLNG+ L ++ D P LEP+ +P+ + P SI F +P+ PAC+
Sbjct: 488 IMLLNGKPLVPTATGDIPKLEPVRHGVKSPVYINPLSISFIVLPTFDAPACS 539
>sp|Q9FF10|HPSE1_ARATH Heparanase-like protein 1 OS=Arabidopsis thaliana GN=At5g07830 PE=2
SV=1
Length = 543
Score = 515 bits (1326), Expect = e-145, Method: Compositional matrix adjust.
Identities = 253/533 (47%), Positives = 341/533 (63%), Gaps = 18/533 (3%)
Query: 27 PSRTASGNVIEGTVFINGTVAIAKTDDNFICATLDWWPPDKCDYGTCSWGRASLLNLDLG 86
P +T + + ++ I G + +TD+NF+CATLDWWP DKC+Y C WG +S++N+DL
Sbjct: 19 PEKTMAQEMKRASIVIQGARRVCETDENFVCATLDWWPHDKCNYDQCPWGYSSVINMDLT 78
Query: 87 NPILLKAVKAFSPLKIRMGGTLQDKAVYESKGIDEPCTPFVKNNSELFGFSEGCLTMRRW 146
P+L KA+KAF PL+IR+GG+LQD+ +Y+ + PC PF K NS LFGFS+GCL M+RW
Sbjct: 79 RPLLTKAIKAFKPLRIRIGGSLQDQVIYDVGNLKTPCRPFQKMNSGLFGFSKGCLHMKRW 138
Query: 147 DELNIFFRRAGAVVTFGLNALRGRIIQSDGSARGAWDSSNAESLMRYTVNKGYKIYGWEL 206
DELN F GAVVTFGLNALRGR + GAWD N + + YTV+KGY I WE
Sbjct: 139 DELNSFLTATGAVVTFGLNALRGRHKLRGKAWGGAWDHINTQDFLNYTVSKGYVIDSWEF 198
Query: 207 GNELSGNGVGARIGADQYASDVDRLQSIVQNIYKGFKI-KPLIIAPGGFFDANWFKEFID 265
GNELSG+GVGA + A+ Y D+ L+ ++ +YK + KP+++APGGF++ W+ + ++
Sbjct: 199 GNELSGSGVGASVSAELYGKDLIVLKDVINKVYKNSWLHKPILVAPGGFYEQQWYTKLLE 258
Query: 266 KT-PNSLQVVTHHIYNLGPGVDDRLIDKILDPSFLNSGSEPFSSLQSILRSSATSAVAWV 324
+ P+ + VVTHHIYNLG G D L+ KI+DPS+L+ S+ F + ++ A WV
Sbjct: 259 ISGPSVVDVVTHHIYNLGSGNDPALVKKIMDPSYLSQVSKTFKDVNQTIQEHGPWASPWV 318
Query: 325 GEAGGAYNSGHNLVTNAFVFSFWYLDQLGMASIYGTKTYCRQTLIGGNYGLLNTNTFVPN 384
GE+GGAYNSG V++ F+ SFWYLDQLGM++ + TK YCRQTL+GG YGLL TFVPN
Sbjct: 319 GESGGAYNSGGRHVSDTFIDSFWYLDQLGMSARHNTKVYCRQTLVGGFYGLLEKGTFVPN 378
Query: 385 PGYYSALLWHRLMGSTVLSTSLSETTKIRAYAHCSKQSQGTTLLLINLDGNTTARVRVTI 444
P YYSALLWHRLMG VL+ ++R YAHCSK G TLLLINL + V V+
Sbjct: 379 PDYYSALLWHRLMGKGVLAVQTDGPPQLRVYAHCSKGRAGVTLLLINLSNQSDFTVSVS- 437
Query: 445 SRENLAGNGALVVQQEQQNQRKR--------FAKISQDTKTNGKMREEYHLTARDGDLHS 496
NG VV + ++K F+ I REEYHLT +G L S
Sbjct: 438 -------NGINVVLNAESRKKKSLLDTLKRPFSWIGSKASDGYLNREEYHLTPENGVLRS 490
Query: 497 QTMLLNGEILAVNSSEDFPPLEPIYVSETAPIIVAPFSIVFAEIPSSVLPACA 549
+TM+LNG+ L ++ D P LEP+ S +P+ V P S+ F +P+ AC+
Sbjct: 491 KTMVLNGKSLKPTATGDIPSLEPVLRSVNSPLNVLPLSMSFIVLPNFDASACS 543
>sp|Q9LRC8|BAGLU_SCUBA Baicalin-beta-D-glucuronidase OS=Scutellaria baicalensis GN=SGUS
PE=1 SV=1
Length = 527
Score = 469 bits (1206), Expect = e-131, Method: Compositional matrix adjust.
Identities = 235/519 (45%), Positives = 321/519 (61%), Gaps = 49/519 (9%)
Query: 48 IAKTDDNFICATLDWWPPDKCDYGTCSWGRASLLNLDLGNPILLKAVKAFSPLKIRMGGT 107
+A+TD+N++CATLD WPP KC+YG C WG++S LNLDL N I+ AVK F+PLK+R GGT
Sbjct: 38 VAQTDENYVCATLDLWPPTKCNYGNCPWGKSSFLNLDLNNNIIRNAVKEFAPLKLRFGGT 97
Query: 108 LQDKAVYESKGIDEPC-TPFVKNNSELFGFSEGCLTMRRWDELNIFFRRAGAVVTFGLNA 166
LQD+ VY++ DEPC + F N + + FS CL++ RWDE+N F G+ FGLNA
Sbjct: 98 LQDRLVYQTSR-DEPCDSTFYNNTNLILDFSHACLSLDRWDEINQFILETGSEAVFGLNA 156
Query: 167 LRGRIIQSDG------------SARGAWDSSNAESLMRYTVNKGYK-IYGWELGNELSGN 213
LRG+ ++ G +A G WD SN++ L+ Y++ KGYK I GW LGNEL G+
Sbjct: 157 LRGKTVEIKGIIKDGQYLGETTTAVGEWDYSNSKFLIEYSLKKGYKHIRGWTLGNELGGH 216
Query: 214 GVGARIGADQYASDVDRLQSIVQNIYKGFKIKPLIIAPGGFFDANWFKEFIDKTPNSLQV 273
+ + + YA+D +L +V+ IY+ PLIIAPG FD W+ EFID+TP L V
Sbjct: 217 TLFIGVSPEDYANDAKKLHELVKEIYQDQGTMPLIIAPGAIFDLEWYTEFIDRTP-ELHV 275
Query: 274 VTHHIYNLGPGVDDRLIDKILDPSFLNSGSEP-FSSLQSILRSSATSAVAWVGEAGGAYN 332
THH+YNLG G DD L D +L SF + ++ + LQ I+ T AVAW+GEAGGA+N
Sbjct: 276 ATHHMYNLGSGGDDALKDVLLTASFFDEATKSMYEGLQKIVNRPGTKAVAWIGEAGGAFN 335
Query: 333 SGHNLVTNAFVFSFWYLDQLGMASIYGTKTYCRQTLIGGNYGLLNTNTFVPNPGYYSALL 392
SG + ++N F+ FWYL+ LG +++ TKT+CRQTL GGNYGLL T T++PNP YYSALL
Sbjct: 336 SGQDGISNTFINGFWYLNMLGYSALLDTKTFCRQTLTGGNYGLLQTGTYIPNPDYYSALL 395
Query: 393 WHRLMGSTVLSTSLSETTKIRAYAHCSKQSQGTTLLLINLDGNTTARVRVTISRENLAGN 452
WHRLMGS VL T + T + YAHC+K+S G T+L++N DG ++
Sbjct: 396 WHRLMGSKVLKTEIVGTKNVYIYAHCAKKSNGITMLVLNHDGESS--------------- 440
Query: 453 GALVVQQEQQNQRKRFAKISQDTKTNGKMREEYHLTARDGDLHSQTMLLNGEILAVNSSE 512
KIS D G REEYHLT + +L S+ + LNGE+L ++ S
Sbjct: 441 ----------------VKISLDPSKYGSKREEYHLTPVNNNLQSRLVKLNGELLHLDPSG 484
Query: 513 DFPPLEPIYVSETAPIIVAPFSIVFAEIPS-SVLPACAQ 550
P L P+ + + VAP+S +F +P ++ AC +
Sbjct: 485 VIPALNPVEKDNSKQLEVAPYSFMFVHLPGPTMFSACEK 523
>sp|Q9Y251|HPSE_HUMAN Heparanase OS=Homo sapiens GN=HPSE PE=1 SV=2
Length = 543
Score = 115 bits (287), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 130/511 (25%), Positives = 211/511 (41%), Gaps = 90/511 (17%)
Query: 85 LGNPILLKAVKAFSPLKIRMGGT--------------LQDKAVYESKGIDEPCT-----P 125
LG+P L + SP +R GGT ++++ ++S+ + C P
Sbjct: 75 LGSPKLRTLARGLSPAYLRFGGTKTDFLIFDPKKESTFEERSYWQSQVNQDICKYGSIPP 134
Query: 126 FVKNNSEL-FGFSEGCLTMRRW--------------DELNIFFRRAGAVVTFGLNALRGR 170
V+ L + + E L + D L F +G + FGLNAL
Sbjct: 135 DVEEKLRLEWPYQEQLLLREHYQKKFKNSTYSRSSVDVLYTFANCSGLDLIFGLNALLR- 193
Query: 171 IIQSDGSARGAWDSSNAESLMRYTVNKGYKIYGWELGNELSG--NGVGARIGADQYASDV 228
+A W+SSNA+ L+ Y +KGY I WELGNE + I Q D
Sbjct: 194 ------TADLQWNSSNAQLLLDYCSSKGYNI-SWELGNEPNSFLKKADIFINGSQLGEDF 246
Query: 229 DRLQSIV-QNIYKGFKIKPLIIAPGGFFDANWFKEFIDKTPNSLQVVTHHIYNLGPGVDD 287
+L ++ ++ +K K+ + A K F+ + VT H Y L
Sbjct: 247 IQLHKLLRKSTFKNAKLYGPDVGQPRRKTAKMLKSFLKAGGEVIDSVTWHHYYLNGRTAT 306
Query: 288 RLIDKILDPSFLNSGSEPFSSLQSILRSSATSAVAWVGEAGGAYNSGHNLVTNAFVFSFW 347
+ + L+P L+ + ++ S+ W+GE AY G L+++ F F
Sbjct: 307 K--EDFLNPDVLDIFISSVQKVFQVVESTRPGKKVWLGETSSAYGGGAPLLSDTFAAGFM 364
Query: 348 YLDQLGMASIYGTKTYCRQTLIG-GNYGLLNTNTFVPNPGYYSALLWHRLMGSTVLSTSL 406
+LD+LG+++ G + RQ G GNY L++ N F P P Y+ +LL+ +L+G+ VL S+
Sbjct: 365 WLDKLGLSARMGIEVVMRQVFFGAGNYHLVDEN-FDPLPDYWLSLLFKKLVGTKVLMASV 423
Query: 407 --SETTKIRAYAHCS-----KQSQGT-TLLLINLDGNTTARVRVTISRENLAGNGALVVQ 458
S+ K+R Y HC+ + +G TL INL N T +R+ N
Sbjct: 424 QGSKRRKLRVYLHCTNTDNPRYKEGDLTLYAINLH-NVTKYLRLPYPFSN---------- 472
Query: 459 QEQQNQRKRFAKISQDTKTNGKMREEYHLTARDGD-LHSQTMLLNGEILAVNSSEDFPPL 517
K ++Y L L S+++ LNG L + + PPL
Sbjct: 473 ---------------------KQVDKYLLRPLGPHGLLSKSVQLNGLTLKMVDDQTLPPL 511
Query: 518 EPIYVSETAPIIVAPFSIVFAEIPSSVLPAC 548
+ + + + FS F I ++ + AC
Sbjct: 512 MEKPLRPGSSLGLPAFSYSFFVIRNAKVAAC 542
>sp|Q90YK5|HPSE_CHICK Heparanase OS=Gallus gallus GN=HPSE PE=1 SV=1
Length = 523
Score = 114 bits (286), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 111/416 (26%), Positives = 181/416 (43%), Gaps = 57/416 (13%)
Query: 147 DELNIFFRRAGAVVTFGLNALRGRIIQSDGSARGAWDSSNAESLMRYTVNKGYKIYGWEL 206
D L+ F +G + FGLNAL R A WDSSNA+ L+ Y + Y I WEL
Sbjct: 150 DILHTFASSSGFRLVFGLNALLRR-------AGLQWDSSNAKQLLGYCAQRSYNI-SWEL 201
Query: 207 GNELSG--NGVGARIGADQYASDVDRLQSIVQN--IYKGFKIKPLIIAPGGFFDANWFKE 262
GNE + G I Q D L+ ++ +Y+ ++ L + + +
Sbjct: 202 GNEPNSFRKKSGICIDGFQLGRDFVHLRQLLSQHPLYRHAELYGLDVGQPRKHTQHLLRS 261
Query: 263 FIDKTPNSLQVVTHHIYNLGPGVDDRLIDKILDPSFLNSGSEPFSSLQSILRSSATSAVA 322
F+ ++ VT H Y + R + L P L+S + + I+ ++
Sbjct: 262 FMKSGGKAIDSVTWHHYYVNGRSATR--EDFLSPEVLDSFATAIHDVLGIVEATVPGKKV 319
Query: 323 WVGEAGGAYNSGHNLVTNAFVFSFWYLDQLGMASIYGTKTYCRQTLIG-GNYGLLNTNTF 381
W+GE G AY G ++N +V F +LD+LG+A+ G RQ G G+Y L++ F
Sbjct: 320 WLGETGSAYGGGAPQLSNTYVAGFMWLDKLGLAARRGIDVVMRQVSFGAGSYHLVDAG-F 378
Query: 382 VPNPGYYSALLWHRLMGSTVLSTSL--SETTKIRAYAHCS-----KQSQG-TTLLLINLD 433
P P Y+ +LL+ RL+G+ VL S+ ++ + R Y HC+ K +G TL +NL
Sbjct: 379 KPLPDYWLSLLYKRLVGTRVLQASVEQADARRPRVYLHCTNPRHPKYREGDVTLFALNLS 438
Query: 434 GNTTARVRVTISRENLAGNGALVVQQEQQNQRKRFAKISQDTKTNGKMREEYHLTARDGD 493
V Q Q ++ ++K ++Y L D
Sbjct: 439 N----------------------VTQSLQLPKQLWSKSV----------DQYLLLPHGKD 466
Query: 494 -LHSQTMLLNGEILAVNSSEDFPPLEPIYVSETAPIIVAPFSIVFAEIPSSVLPAC 548
+ S+ + LNG +L + E P L + ++ + + + FS F I ++ AC
Sbjct: 467 SILSREVQLNGRLLQMVDDETLPALHEMALAPGSTLGLPAFSYGFYVIRNAKAIAC 522
>sp|Q6YGZ1|HPSE_MOUSE Heparanase OS=Mus musculus GN=Hpse PE=1 SV=3
Length = 535
Score = 112 bits (280), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 116/411 (28%), Positives = 179/411 (43%), Gaps = 72/411 (17%)
Query: 85 LGNPILLKAVKAFSPLKIRMGGTLQDKAVYESKG-------------------IDEPCTP 125
LG+P L + SP +R GGT D +++ EP +
Sbjct: 67 LGSPRLRALARGLSPAYLRFGGTKTDFLIFDPDKEPTSEERSYWKSQVNHDICRSEPVSA 126
Query: 126 FVKNNSEL-FGFSEGCLTMRRW--------------DELNIFFRRAGAVVTFGLNALRGR 170
V ++ + F E L ++ D L F + +G + FGLNAL
Sbjct: 127 AVLRKLQVEWPFQELLLLREQYQKEFKNSTYSRSSVDMLYSFAKCSGLDLIFGLNALLR- 185
Query: 171 IIQSDGSARGAWDSSNAESLMRYTVNKGYKIYGWELGNELSGNGVGARIGAD--QYASDV 228
+ W+SSNA+ L+ Y +KGY I WELGNE + A I D Q D
Sbjct: 186 ------TPDLRWNSSNAQLLLDYCSSKGYNI-SWELGNEPNSFWKKAHILIDGLQLGEDF 238
Query: 229 DRLQSIVQ-NIYKGFKIK-PLIIAPGGFFDANWFKEFIDKTPNSLQVVTHHIYNLGPGV- 285
L ++Q + ++ K+ P I P G + F+ + +T H Y L +
Sbjct: 239 VELHKLLQRSAFQNAKLYGPDIGQPRGK-TVKLLRSFLKAGGEVIDSLTWHHYYLNGRIA 297
Query: 286 --DDRLIDKILDPSFLNSGSEPFSSLQSILRSS---ATSAVAWVGEAGGAYNSGHNLVTN 340
+D L +LD L S+Q IL+ + W+GE AY G L++N
Sbjct: 298 TKEDFLSSDVLDTFIL--------SVQKILKVTKEITPGKKVWLGETSSAYGGGAPLLSN 349
Query: 341 AFVFSFWYLDQLGMASIYGTKTYCRQTLIG-GNYGLLNTNTFVPNPGYYSALLWHRLMGS 399
F F +LD+LG+++ G + RQ G GNY L++ N F P P Y+ +LL+ +L+G
Sbjct: 350 TFAAGFMWLDKLGLSAQMGIEVVMRQVFFGAGNYHLVDEN-FEPLPDYWLSLLFKKLVGP 408
Query: 400 TVLSTSLS--ETTKIRAYAHCSK------QSQGTTLLLINLDGNTTARVRV 442
VL + + + +K+R Y HC+ Q TL ++NL N T ++V
Sbjct: 409 RVLLSRVKGPDRSKLRVYLHCTNVYHPRYQEGDLTLYVLNLH-NVTKHLKV 458
>sp|Q9MYY0|HPSE_BOVIN Heparanase OS=Bos taurus GN=HPSE PE=2 SV=2
Length = 545
Score = 109 bits (272), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 112/417 (26%), Positives = 179/417 (42%), Gaps = 60/417 (14%)
Query: 147 DELNIFFRRAGAVVTFGLNALRGRIIQSDGSARGAWDSSNAESLMRYTVNKGYKIYGWEL 206
D L F +G + FG+NAL + +D WDSSNA+ L+ Y +K Y I WEL
Sbjct: 173 DMLYTFASCSGLNLIFGVNAL---LRTTDMH----WDSSNAQLLLDYCSSKNYNI-SWEL 224
Query: 207 GNELSG--NGVGARIGADQYASDVDRLQSIV-QNIYKGFKIKPLIIAPGGFFDANWFKEF 263
GNE + G I Q D + ++ ++ +K K+ I K F
Sbjct: 225 GNEPNSFQRKAGIFINGRQLGEDFIEFRKLLGKSAFKNAKLYGPDIGQPRRNTVKMLKSF 284
Query: 264 IDKTPNSLQVVT-HHIYNLGPGVDDRLIDK--ILDPSFLNSGSEPFSSLQSILRSSATSA 320
+ + VT HH Y V+ R+ K L+P L++ I+
Sbjct: 285 LKAGGEVIDSVTWHHYY-----VNGRIATKEDFLNPDILDTFISSVQKTLRIVEKIRPLK 339
Query: 321 VAWVGEAGGAYNSGHNLVTNAFVFSFWYLDQLGMASIYGTKTYCRQTLIG-GNYGLLNTN 379
W+GE A+ G ++N F F +LD+LG+++ G + RQ L G GNY L++ N
Sbjct: 340 KVWLGETSSAFGGGAPFLSNTFAAGFMWLDKLGLSARMGIEVVMRQVLFGAGNYHLVDGN 399
Query: 380 TFVPNPGYYSALLWHRLMGSTVLSTSLS--ETTKIRAYAHCSKQSQ------GTTLLLIN 431
F P P Y+ +LL+ +L+G+ VL S+ + +K R Y HC+ TL +N
Sbjct: 400 -FEPLPDYWLSLLFKKLVGNKVLMASVKGPDRSKFRVYLHCTNTKHPRYKEGDLTLYALN 458
Query: 432 LDGNTTARVRVTISRENLAGNGALVVQQEQQNQRKRFAKISQDTKTNGKMREEYHLTARD 491
L N T + + N + L+ + + T+G
Sbjct: 459 LH-NVTKHLELPHHLFNKQVDKYLI----------------KPSGTDG------------ 489
Query: 492 GDLHSQTMLLNGEILAVNSSEDFPPLEPIYVSETAPIIVAPFSIVFAEIPSSVLPAC 548
L S+++ LNG+IL + + P L + + + + PFS F I ++ + AC
Sbjct: 490 --LLSKSVQLNGQILKMVDEQTLPALTEKPLHPGSSLGMPPFSYGFFVIRNAKVAAC 544
>sp|Q71RP1|HPSE_RAT Heparanase OS=Rattus norvegicus GN=Hpse PE=2 SV=1
Length = 536
Score = 107 bits (267), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 114/413 (27%), Positives = 181/413 (43%), Gaps = 76/413 (18%)
Query: 85 LGNPILLKAVKAFSPLKIRMGGTLQDKAVYESKGIDEPCTPF-----VKNNSELFG---- 135
LG+P L + SP +R GGT D +++ EP + ++N+++ G
Sbjct: 68 LGSPRLRALARGLSPAYLRFGGTKTDFLIFDPN--KEPTSEERSYWQSQDNNDICGSERV 125
Query: 136 -------------FSEGCLTMRRW--------------DELNIFFRRAGAVVTFGLNALR 168
F E L ++ D L F + + + FGLNAL
Sbjct: 126 SADVLRKLQMEWPFQELLLLREQYQREFKNSTYSRSSVDMLYSFAKCSRLDLIFGLNALL 185
Query: 169 GRIIQSDGSARGAWDSSNAESLMRYTVNKGYKIYGWELGNELSGNGVGARIGAD--QYAS 226
+ W+SSNA+ L+ Y +KGY I WELGNE + A I D Q
Sbjct: 186 R-------TPDLRWNSSNAQLLLNYCSSKGYNI-SWELGNEPNSFWKKAHISIDGLQLGE 237
Query: 227 DVDRLQSIVQ-NIYKGFKIK-PLIIAPGGFFDANWFKEFIDKTPNSLQVVTHHIYNLGPG 284
D L ++Q + ++ K+ P I P G + F+ + +T H Y L
Sbjct: 238 DFVELHKLLQKSAFQNAKLYGPDIGQPRGK-TVKLLRSFLKAGGEVIDSLTWHHYYLNGR 296
Query: 285 V---DDRLIDKILDPSFLNSGSEPFSSLQSILRSS---ATSAVAWVGEAGGAYNSGHNLV 338
V +D L +LD L S+Q IL+ + W+GE AY G L+
Sbjct: 297 VATKEDFLSSDVLDTFIL--------SVQKILKVTKEMTPGKKVWLGETSSAYGGGAPLL 348
Query: 339 TNAFVFSFWYLDQLGMASIYGTKTYCRQTLIG-GNYGLLNTNTFVPNPGYYSALLWHRLM 397
++ F F +LD+LG+++ G + RQ G GNY L++ N F P P Y+ +LL+ +L+
Sbjct: 349 SDTFAAGFMWLDKLGLSAQLGIEVVMRQVFFGAGNYHLVDEN-FEPLPDYWLSLLFKKLV 407
Query: 398 GSTVLSTSLS--ETTKIRAYAHCSK------QSQGTTLLLINLDGNTTARVRV 442
G VL + + + +K+R Y HC+ + TL ++NL N T +++
Sbjct: 408 GPKVLMSRVKGPDRSKLRVYLHCTNVYHPRYREGDLTLYVLNLH-NVTKHLKL 459
>sp|Q8WWQ2|HPSE2_HUMAN Inactive heparanase-2 OS=Homo sapiens GN=HPSE2 PE=1 SV=3
Length = 592
Score = 81.6 bits (200), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 102/433 (23%), Positives = 185/433 (42%), Gaps = 60/433 (13%)
Query: 136 FSEGCLTMRRWDELNIFFRRAGAVVTFGLNALRGRIIQSDGSARGAWDSSNAESLMRYTV 195
+S LT R D+L F +G + F LNALR + +W+SS+A SL++Y+
Sbjct: 197 YSNLILTARSLDKLYNFADCSGLHLIFALNALRR-------NPNNSWNSSSALSLLKYSA 249
Query: 196 NKGYKIYGWELGNELSGNGV--GARIGADQYASDVDRLQSIVQNIY---KGFKIKPLIIA 250
+K Y I WELGNE + G + Q D +L+S++Q I + P I
Sbjct: 250 SKKYNI-SWELGNEPNNYRTMHGRAVNGSQLGKDYIQLKSLLQPIRIYSRASLYGPNIGR 308
Query: 251 PGGFFDANWFKEFIDKTPNSLQVVT-HHIYNLGPGVDDRLIDKILD---PSFLNSGSEPF 306
P A F+ +++ VT H Y +D R++ K++D L++ S+
Sbjct: 309 PRKNVIA-LLDGFMKVAGSTVDAVTWQHCY-----IDGRVV-KVMDFLKTRLLDTLSDQI 361
Query: 307 SSLQSILRSSATSAVAWVGEAGGAYNSGHNLVTNAFVFSFWYLDQLGMASIYGTKTYCRQ 366
+Q ++ + W+ G N +++++ F +L+ LGM + G R
Sbjct: 362 RKIQKVVNTYTPGKKIWLEGVVTTSAGGTNNLSDSYAAGFLWLNTLGMLANQGIDVVIRH 421
Query: 367 TLIGGNYGLLNTNTFVPNPGYYSALLWHRLMGSTVLSTSLSETT-----------KIRAY 415
+ Y L F P P Y+ +LL+ RL+G VL+ ++ K+R Y
Sbjct: 422 SFFDHGYNHLVDQNFNPLPDYWLSLLYKRLIGPKVLAVHVAGLQRKPRPGRVIRDKLRIY 481
Query: 416 AHCSKQSQGTTLLLINLDGNTTARVRVTISRENLAGNGALVVQQEQQNQRKRFAKISQDT 475
AHC+ N + R +T+ ++ + ++ + A +D
Sbjct: 482 AHCT-----------NHHNHNYVRGSITL----------FIINLHRSRKKIKLAGTLRDK 520
Query: 476 KTNGKMREEYHLTARDGDLHSQTMLLNGEILAVNSSEDFPPLEPIYVSETAPIIVAPFSI 535
+ + + Y ++G L S+++ LNG+ L + P L+P + +++ P ++
Sbjct: 521 LVHQYLLQPY---GQEG-LKSKSVQLNGQPLVMVDDGTLPELKPRPLRAGRTLVIPPVTM 576
Query: 536 VFAEIPSSVLPAC 548
F + + AC
Sbjct: 577 GFYVVKNVNALAC 589
>sp|B2RY83|HPSE2_MOUSE Inactive heparanase-2 OS=Mus musculus GN=Hpse2 PE=2 SV=1
Length = 592
Score = 81.3 bits (199), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 102/433 (23%), Positives = 186/433 (42%), Gaps = 60/433 (13%)
Query: 136 FSEGCLTMRRWDELNIFFRRAGAVVTFGLNALRGRIIQSDGSARGAWDSSNAESLMRYTV 195
+S LT R D+L F +G + F LNALR + +W+SS+A SL++Y+
Sbjct: 197 YSNLILTARSLDKLYNFADCSGLHLIFALNALRR-------NPNNSWNSSSALSLLKYSA 249
Query: 196 NKGYKIYGWELGNELSG--NGVGARIGADQYASDVDRLQSIVQNIY---KGFKIKPLIIA 250
+K Y I WELGNE + + G + Q D +L+S++Q I + P I
Sbjct: 250 SKKYNI-SWELGNEPNNYRSIHGRAVNGSQLGKDYIQLKSLLQPIRVYSRASLYGPNIGR 308
Query: 251 PGGFFDANWFKEFIDKTPNSLQVVT-HHIYNLGPGVDDRLIDKILD---PSFLNSGSEPF 306
P A F+ +++ VT H Y +D R++ K++D L++ S+
Sbjct: 309 PRKNVIA-LLDGFMKVAGSTVDAVTWQHCY-----IDGRVV-KVMDFLKTRLLDTLSDQI 361
Query: 307 SSLQSILRSSATSAVAWVGEAGGAYNSGHNLVTNAFVFSFWYLDQLGMASIYGTKTYCRQ 366
+Q ++ + W+ G N +++++ F +L+ LGM + G R
Sbjct: 362 RKIQKVVNTYTPGKKIWLEGVVTTSAGGTNNLSDSYAAGFLWLNTLGMLANQGIDVVIRH 421
Query: 367 TLIGGNYGLLNTNTFVPNPGYYSALLWHRLMGSTVLSTSLSETT-----------KIRAY 415
+ Y L F P P Y+ +LL+ RL+G VL+ ++ K+R Y
Sbjct: 422 SFFDHGYNHLVDQNFNPLPDYWLSLLYKRLIGPKVLAVHVAGLQRKPRPGRVIRDKLRIY 481
Query: 416 AHCSKQSQGTTLLLINLDGNTTARVRVTISRENLAGNGALVVQQEQQNQRKRFAKISQDT 475
AHC+ N + R +T+ ++ + ++ + A +D
Sbjct: 482 AHCT-----------NHHNHNYVRGSITL----------FIINLHRSRKKIKLAGTLRDK 520
Query: 476 KTNGKMREEYHLTARDGDLHSQTMLLNGEILAVNSSEDFPPLEPIYVSETAPIIVAPFSI 535
+ + + Y ++G L S+++ LNG+ L + P L+P + +++ P ++
Sbjct: 521 LVHQYLLQPY---GQEG-LKSKSVQLNGQPLVMVDDGTLPELKPRPLRAGRTLVIPPVTM 576
Query: 536 VFAEIPSSVLPAC 548
F + + AC
Sbjct: 577 GFYVVKNVNALAC 589
>sp|P06531|TRPG_EMENI Anthranilate synthase component 2 OS=Emericella nidulans (strain
FGSC A4 / ATCC 38163 / CBS 112.46 / NRRL 194 / M139)
GN=trpC PE=2 SV=3
Length = 768
Score = 36.6 bits (83), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 30/114 (26%), Positives = 49/114 (42%), Gaps = 16/114 (14%)
Query: 134 FGFSEGCLTMRRWDELNIFFRRAGAVVTFGLNALR-GRIIQSDGSAR----GAWDSSNAE 188
FG E + R + L + AG L+ +R +I++SD R G D N
Sbjct: 666 FGLDEFGIARRAYHTLPLLDSGAGGSGEL-LDQMRVKQILKSDDGLRVILAGGLDPLNVT 724
Query: 189 SLMRYTVNKGYKIYGWELGNELSGNGVGARIGADQYASDVDRLQSIVQNIYKGF 242
+++ GYKI G ++ + + NGV D+D+++S VQ F
Sbjct: 725 EIIKQLDESGYKIVGVDVSSGVETNGV----------QDLDKIRSFVQAAKSAF 768
>sp|Q07Z24|G6PI_SHEFN Glucose-6-phosphate isomerase OS=Shewanella frigidimarina (strain
NCIMB 400) GN=pgi PE=3 SV=1
Length = 545
Score = 34.7 bits (78), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 19/46 (41%), Positives = 29/46 (63%), Gaps = 1/46 (2%)
Query: 199 YKIYGWELGNELSGNGVGARIGADQYASDVDRLQSIVQNIYKGFKI 244
+ +G ELG +L GN V RIGAD A+D+D + + N+Y+ K+
Sbjct: 501 FDQWGVELGKQL-GNDVLERIGADHDATDLDGSSNALVNLYRKGKL 545
>sp|A0L003|G6PI_SHESA Glucose-6-phosphate isomerase OS=Shewanella sp. (strain ANA-3)
GN=pgi PE=3 SV=1
Length = 545
Score = 32.7 bits (73), Expect = 6.6, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 29/46 (63%), Gaps = 1/46 (2%)
Query: 199 YKIYGWELGNELSGNGVGARIGADQYASDVDRLQSIVQNIYKGFKI 244
+ +G ELG L GN V ARIGA+Q A+ +D + + N+Y+ KI
Sbjct: 501 FDQWGVELGKSL-GNDVLARIGAEQDATALDASSNGLINLYRQGKI 545
>sp|Q0HFX8|G6PI_SHESM Glucose-6-phosphate isomerase OS=Shewanella sp. (strain MR-4)
GN=pgi PE=3 SV=1
Length = 545
Score = 32.7 bits (73), Expect = 7.0, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 29/46 (63%), Gaps = 1/46 (2%)
Query: 199 YKIYGWELGNELSGNGVGARIGADQYASDVDRLQSIVQNIYKGFKI 244
+ +G ELG L GN V ARIGA+Q A+ +D + + N+Y+ KI
Sbjct: 501 FDQWGVELGKSL-GNDVLARIGAEQDATALDASSNGLINLYRQGKI 545
>sp|Q0HS71|G6PI_SHESR Glucose-6-phosphate isomerase OS=Shewanella sp. (strain MR-7)
GN=pgi PE=3 SV=1
Length = 545
Score = 32.7 bits (73), Expect = 7.1, Method: Compositional matrix adjust.
Identities = 20/46 (43%), Positives = 29/46 (63%), Gaps = 1/46 (2%)
Query: 199 YKIYGWELGNELSGNGVGARIGADQYASDVDRLQSIVQNIYKGFKI 244
+ +G ELG L GN V ARIGA+Q A+ +D + + N+Y+ KI
Sbjct: 501 FDQWGVELGKSL-GNDVLARIGAEQDATALDASSNGLINLYRQGKI 545
>sp|Q7TTW9|SYY_SYNPX Tyrosine--tRNA ligase OS=Synechococcus sp. (strain WH8102) GN=tyrS
PE=3 SV=1
Length = 415
Score = 32.7 bits (73), Expect = 7.6, Method: Compositional matrix adjust.
Identities = 27/85 (31%), Positives = 45/85 (52%), Gaps = 18/85 (21%)
Query: 411 KIRAYAHCSKQSQGTTLLLINLDGNTTARV---------RVTISRENLAGNGALVVQQEQ 461
K+RA+ Q G T +LI G+ TAR+ RV +S+E++A N + ++Q
Sbjct: 69 KLRAF-----QDAGHTAVLII--GDFTARIGDPTGKCATRVQLSKEDVAANASTYLRQLG 121
Query: 462 QNQRKRFAKISQDTKTNGKMREEYH 486
Q+Q K A + D +T G++ Y+
Sbjct: 122 QDQPKETALL--DFETPGRLEVRYN 144
>sp|Q8EBH1|G6PI_SHEON Glucose-6-phosphate isomerase OS=Shewanella oneidensis (strain
MR-1) GN=pgi PE=3 SV=1
Length = 545
Score = 32.3 bits (72), Expect = 9.6, Method: Compositional matrix adjust.
Identities = 20/43 (46%), Positives = 27/43 (62%), Gaps = 1/43 (2%)
Query: 202 YGWELGNELSGNGVGARIGADQYASDVDRLQSIVQNIYKGFKI 244
+G ELG L GN V RIGADQ A+ +D + + N+Y+ KI
Sbjct: 504 WGVELGKTL-GNDVLTRIGADQEATVLDASSNGLINLYRRGKI 545
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.320 0.136 0.417
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 206,696,550
Number of Sequences: 539616
Number of extensions: 8821667
Number of successful extensions: 19829
Number of sequences better than 100.0: 26
Number of HSP's better than 100.0 without gapping: 12
Number of HSP's successfully gapped in prelim test: 14
Number of HSP's that attempted gapping in prelim test: 19783
Number of HSP's gapped (non-prelim): 37
length of query: 551
length of database: 191,569,459
effective HSP length: 123
effective length of query: 428
effective length of database: 125,196,691
effective search space: 53584183748
effective search space used: 53584183748
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 64 (29.3 bits)