BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 008762
(554 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|Q9FZP1|HPSE3_ARATH Heparanase-like protein 3 OS=Arabidopsis thaliana GN=At5g34940 PE=2
SV=2
Length = 536
Score = 739 bits (1909), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 339/513 (66%), Positives = 419/513 (81%), Gaps = 9/513 (1%)
Query: 44 GNVFIDRRSVIGRTDDDFVCATLDWWPPEKCDYGTCSWDRASLLNLDLNSNILLNAVKAF 103
G VF+ R+ +G D+DF+CATLDWWPPEKCDYG+CSWD AS+LNLDLN+ IL NA+KAF
Sbjct: 31 GTVFVYGRAAVGTIDEDFICATLDWWPPEKCDYGSCSWDHASILNLDLNNVILQNAIKAF 90
Query: 104 SPLKIRLGGTLQDKVIYDTEDNRQPCKQFVKNSSEMFGFTQGCLPMHRWDELNAFFKKSG 163
+PLKIR+GGTLQD VIY+T D++QPC F KNSS +FG+TQGCLPM RWDELNAFF+K+G
Sbjct: 91 APLKIRIGGTLQDIVIYETPDSKQPCLPFTKNSSILFGYTQGCLPMRRWDELNAFFRKTG 150
Query: 164 CPCIQFRAKIVFGLNALTGRSIQNDGSVKGAWDYTNAESFISYTVKKNYSIHGWELGNEL 223
K++FGLNAL+GRSI+++G GAW+YTNAESFI +T + NY+I GWELGNEL
Sbjct: 151 -------TKVIFGLNALSGRSIKSNGEAIGAWNYTNAESFIRFTAENNYTIDGWELGNEL 203
Query: 224 CGNGVGTRVAAAQYATDTISLRNVVQKIYTGVDSKPLIIAPGGFFDAKWFKEFLDKSGQS 283
CG+GVG RV A QYA DTI+LRN+V ++Y V PL+I PGGFF+ WF E+L+K+ S
Sbjct: 204 CGSGVGARVGANQYAIDTINLRNIVNRVYKNVSPMPLVIGPGGFFEVDWFTEYLNKAENS 263
Query: 284 LDVATHHIYNLGPGVDQHLVEKILDPLYLDREVDTFSQLENTLKSSATSAVAWVGESGGA 343
L+ T HIY+LGPGVD+HL+EKIL+P YLD+E +F L+N +K+S+T AVAWVGESGGA
Sbjct: 264 LNATTRHIYDLGPGVDEHLIEKILNPSYLDQEAKSFRSLKNIIKNSSTKAVAWVGESGGA 323
Query: 344 YNSGHNLVTNAFVFSFWYLDQLGMAAAHDTKTYCRQSLIGGNYGLLNTTTFVPNPDYYSA 403
YNSG NLV+NAFV+SFWYLDQLGMA+ +DTKTYCRQSLIGGNYGLLNTT F PNPDYYSA
Sbjct: 324 YNSGRNLVSNAFVYSFWYLDQLGMASLYDTKTYCRQSLIGGNYGLLNTTNFTPNPDYYSA 383
Query: 404 LLWHRLMGRNALSTSFSGTKKIRSYAHCAKQSKGLVLLLINLDNSTTVHASVAFNGTLTS 463
L+W +LMGR AL T+FSGTKKIRSY HCA+QSKG+ +LL+NLDN+TTV A V N + +
Sbjct: 384 LIWRQLMGRKALFTTFSGTKKIRSYTHCARQSKGITVLLMNLDNTTTVVAKVELNNSFSL 443
Query: 464 RH-KH-KSLKMKIIKLPQASVGGNEREEYHLTAKDGDLHSQTMLLNGNILSVNSIGDIPT 521
RH KH KS K +L G +REEYHLTAKDG+LHSQTMLLNGN L VNS+GD+P
Sbjct: 444 RHTKHMKSYKRASSQLFGGPNGVIQREEYHLTAKDGNLHSQTMLLNGNALQVNSMGDLPP 503
Query: 522 LEPLRVKSTQPVSVGPFSIVFVHMPHVILPACS 554
+EP+ + ST+P+++ P+SIVFVHM +V++PAC+
Sbjct: 504 IEPIHINSTEPITIAPYSIVFVHMRNVVVPACA 536
>sp|Q8L608|HPSE2_ARATH Heparanase-like protein 2 OS=Arabidopsis thaliana GN=At5g61250 PE=2
SV=1
Length = 539
Score = 524 bits (1349), Expect = e-148, Method: Compositional matrix adjust.
Identities = 258/519 (49%), Positives = 338/519 (65%), Gaps = 18/519 (3%)
Query: 46 VFIDRRSVIGRTDDDFVCATLDWWPPEKCDYGTCSWDRASLLNLDLNSNILLNAVKAFSP 105
+ ID I TD++F+CATLDWWPPEKC+Y C W ASL+NL+L S +L A++AF
Sbjct: 29 LVIDGSRRIAETDENFICATLDWWPPEKCNYDQCPWGYASLINLNLASPLLAKAIQAFRT 88
Query: 106 LKIRLGGTLQDKVIYDTEDNRQPCKQFVKNSSEMFGFTQGCLPMHRWDELNAFFKKSGCP 165
L+IR+GG+LQD+VIYD D + PC QF K +FGF++GCL M RWDE+N FF +G
Sbjct: 89 LRIRIGGSLQDQVIYDVGDLKTPCTQFKKTDDGLFGFSEGCLYMKRWDEVNHFFNATG-- 146
Query: 166 CIQFRAKIVFGLNALTGRSIQNDGSVKGAWDYTNAESFISYTVKKNYSIHGWELGNELCG 225
A + FGLNAL GR+ N + G WD+TN + F++YTV K Y+I WE GNEL G
Sbjct: 147 -----AIVTFGLNALHGRNKLNGTAWGGDWDHTNTQDFMNYTVSKGYAIDSWEFGNELSG 201
Query: 226 NGVGTRVAAAQYATDTISLRNVVQKIYTGVDSKPLIIAPGGFFDAKWFKEFLDKSGQS-L 284
+G+ V+ Y D I L+NV++ +Y +KPL++APGGFF+ +W+ E L SG L
Sbjct: 202 SGIWASVSVELYGKDLIVLKNVIKNVYKNSRTKPLVVAPGGFFEEQWYSELLRLSGPGVL 261
Query: 285 DVATHHIYNLGPGVDQHLVEKILDPLYLDREVDTFSQLENTLKSSATSAVAWVGESGGAY 344
DV THHIYNLGPG D LV KILDP YL + F+ + T++ A AWVGE+GGA+
Sbjct: 262 DVLTHHIYNLGPGNDPKLVNKILDPNYLSGISELFANVNQTIQEHGPWAAAWVGEAGGAF 321
Query: 345 NSGHNLVTNAFVFSFWYLDQLGMAAAHDTKTYCRQSLIGGNYGLLNTTTFVPNPDYYSAL 404
NSG V+ F+ SFWYLDQLG+++ H+TK YCRQ+L+GG YGLL TFVPNPDYYSAL
Sbjct: 322 NSGGRQVSETFINSFWYLDQLGISSKHNTKVYCRQALVGGFYGLLEKETFVPNPDYYSAL 381
Query: 405 LWHRLMGRNALSTSFSGTKKIRSYAHCAKQSKGLVLLLINLDNSTTVHASVAFNG---TL 461
LWHRLMG+ L + ++ +R+Y HC+K+ G+ +LLINL TT +V+ NG L
Sbjct: 382 LWHRLMGKGILGVQTTASEYLRAYVHCSKRRAGITILLINLSKHTTFTVAVS-NGVKVVL 440
Query: 462 TSRHKHKSLKMKIIKLP------QASVGGNEREEYHLTAKDGDLHSQTMLLNGNILSVNS 515
+ + ++ IK +AS G REEYHL+ KDGDL S+ MLLNG L +
Sbjct: 441 QAESMKRKSFLETIKSKVSWVGNKASDGYLNREEYHLSPKDGDLRSKIMLLNGKPLVPTA 500
Query: 516 IGDIPTLEPLRVKSTQPVSVGPFSIVFVHMPHVILPACS 554
GDIP LEP+R PV + P SI F+ +P PACS
Sbjct: 501 TGDIPKLEPVRHGVKSPVYINPLSISFIVLPTFDAPACS 539
>sp|Q9FF10|HPSE1_ARATH Heparanase-like protein 1 OS=Arabidopsis thaliana GN=At5g07830 PE=2
SV=1
Length = 543
Score = 510 bits (1314), Expect = e-143, Method: Compositional matrix adjust.
Identities = 253/520 (48%), Positives = 339/520 (65%), Gaps = 17/520 (3%)
Query: 45 NVFIDRRSVIGRTDDDFVCATLDWWPPEKCDYGTCSWDRASLLNLDLNSNILLNAVKAFS 104
++ I + TD++FVCATLDWWP +KC+Y C W +S++N+DL +L A+KAF
Sbjct: 31 SIVIQGARRVCETDENFVCATLDWWPHDKCNYDQCPWGYSSVINMDLTRPLLTKAIKAFK 90
Query: 105 PLKIRLGGTLQDKVIYDTEDNRQPCKQFVKNSSEMFGFTQGCLPMHRWDELNAFFKKSGC 164
PL+IR+GG+LQD+VIYD + + PC+ F K +S +FGF++GCL M RWDELN+F +G
Sbjct: 91 PLRIRIGGSLQDQVIYDVGNLKTPCRPFQKMNSGLFGFSKGCLHMKRWDELNSFLTATG- 149
Query: 165 PCIQFRAKIVFGLNALTGRSIQNDGSVKGAWDYTNAESFISYTVKKNYSIHGWELGNELC 224
A + FGLNAL GR + GAWD+ N + F++YTV K Y I WE GNEL
Sbjct: 150 ------AVVTFGLNALRGRHKLRGKAWGGAWDHINTQDFLNYTVSKGYVIDSWEFGNELS 203
Query: 225 GNGVGTRVAAAQYATDTISLRNVVQKIYTGV-DSKPLIIAPGGFFDAKWFKEFLDKSGQS 283
G+GVG V+A Y D I L++V+ K+Y KP+++APGGF++ +W+ + L+ SG S
Sbjct: 204 GSGVGASVSAELYGKDLIVLKDVINKVYKNSWLHKPILVAPGGFYEQQWYTKLLEISGPS 263
Query: 284 L-DVATHHIYNLGPGVDQHLVEKILDPLYLDREVDTFSQLENTLKSSATSAVAWVGESGG 342
+ DV THHIYNLG G D LV+KI+DP YL + TF + T++ A WVGESGG
Sbjct: 264 VVDVVTHHIYNLGSGNDPALVKKIMDPSYLSQVSKTFKDVNQTIQEHGPWASPWVGESGG 323
Query: 343 AYNSGHNLVTNAFVFSFWYLDQLGMAAAHDTKTYCRQSLIGGNYGLLNTTTFVPNPDYYS 402
AYNSG V++ F+ SFWYLDQLGM+A H+TK YCRQ+L+GG YGLL TFVPNPDYYS
Sbjct: 324 AYNSGGRHVSDTFIDSFWYLDQLGMSARHNTKVYCRQTLVGGFYGLLEKGTFVPNPDYYS 383
Query: 403 ALLWHRLMGRNALSTSFSGTKKIRSYAHCAKQSKGLVLLLINLDNST--TVHASVAFNGT 460
ALLWHRLMG+ L+ G ++R YAHC+K G+ LLLINL N + TV S N
Sbjct: 384 ALLWHRLMGKGVLAVQTDGPPQLRVYAHCSKGRAGVTLLLINLSNQSDFTVSVSNGINVV 443
Query: 461 LTSRHKHKSLKMKIIKLP------QASVGGNEREEYHLTAKDGDLHSQTMLLNGNILSVN 514
L + + K + +K P +AS G REEYHLT ++G L S+TM+LNG L
Sbjct: 444 LNAESRKKKSLLDTLKRPFSWIGSKASDGYLNREEYHLTPENGVLRSKTMVLNGKSLKPT 503
Query: 515 SIGDIPTLEPLRVKSTQPVSVGPFSIVFVHMPHVILPACS 554
+ GDIP+LEP+ P++V P S+ F+ +P+ ACS
Sbjct: 504 ATGDIPSLEPVLRSVNSPLNVLPLSMSFIVLPNFDASACS 543
>sp|Q9LRC8|BAGLU_SCUBA Baicalin-beta-D-glucuronidase OS=Scutellaria baicalensis GN=SGUS
PE=1 SV=1
Length = 527
Score = 451 bits (1161), Expect = e-126, Method: Compositional matrix adjust.
Identities = 235/573 (41%), Positives = 337/573 (58%), Gaps = 70/573 (12%)
Query: 1 MGSQAWLK---VLLFGFCFWLSSRSSSSSSLSILQAEAAGGAGFVGGNVFIDRRSVIGRT 57
MG Q W K VL F F ++ + I + + +T
Sbjct: 1 MGFQVWQKGLCVLCFSLIFICGVIGEETTIVKI-------------------EENPVAQT 41
Query: 58 DDDFVCATLDWWPPEKCDYGTCSWDRASLLNLDLNSNILLNAVKAFSPLKIRLGGTLQDK 117
D+++VCATLD WPP KC+YG C W ++S LNLDLN+NI+ NAVK F+PLK+R GGTLQD+
Sbjct: 42 DENYVCATLDLWPPTKCNYGNCPWGKSSFLNLDLNNNIIRNAVKEFAPLKLRFGGTLQDR 101
Query: 118 VIYDTEDNRQPCKQ-FVKNSSEMFGFTQGCLPMHRWDELNAFFKKSGCPCIQFRAKIVFG 176
++Y T + +PC F N++ + F+ CL + RWDE+N F ++G ++ VFG
Sbjct: 102 LVYQTSRD-EPCDSTFYNNTNLILDFSHACLSLDRWDEINQFILETG-------SEAVFG 153
Query: 177 LNALTGRSIQNDGSVK------------GAWDYTNAESFISYTVKKNYS-IHGWELGNEL 223
LNAL G++++ G +K G WDY+N++ I Y++KK Y I GW LGNEL
Sbjct: 154 LNALRGKTVEIKGIIKDGQYLGETTTAVGEWDYSNSKFLIEYSLKKGYKHIRGWTLGNEL 213
Query: 224 CGNGVGTRVAAAQYATDTISLRNVVQKIYTGVDSKPLIIAPGGFFDAKWFKEFLDKSGQS 283
G+ + V+ YA D L +V++IY + PLIIAPG FD +W+ EF+D++ +
Sbjct: 214 GGHTLFIGVSPEDYANDAKKLHELVKEIYQDQGTMPLIIAPGAIFDLEWYTEFIDRTPE- 272
Query: 284 LDVATHHIYNLGPGVDQHLVEKILDPLYLDREVDT-FSQLENTLKSSATSAVAWVGESGG 342
L VATHH+YNLG G D L + +L + D + + L+ + T AVAW+GE+GG
Sbjct: 273 LHVATHHMYNLGSGGDDALKDVLLTASFFDEATKSMYEGLQKIVNRPGTKAVAWIGEAGG 332
Query: 343 AYNSGHNLVTNAFVFSFWYLDQLGMAAAHDTKTYCRQSLIGGNYGLLNTTTFVPNPDYYS 402
A+NSG + ++N F+ FWYL+ LG +A DTKT+CRQ+L GGNYGLL T T++PNPDYYS
Sbjct: 333 AFNSGQDGISNTFINGFWYLNMLGYSALLDTKTFCRQTLTGGNYGLLQTGTYIPNPDYYS 392
Query: 403 ALLWHRLMGRNALSTSFSGTKKIRSYAHCAKQSKGLVLLLINLDNSTTVHASVAFNGTLT 462
ALLWHRLMG L T GTK + YAHCAK+S G+ +L++N D ++V S+
Sbjct: 393 ALLWHRLMGSKVLKTEIVGTKNVYIYAHCAKKSNGITMLVLNHDGESSVKISL------- 445
Query: 463 SRHKHKSLKMKIIKLPQASVGGNEREEYHLTAKDGDLHSQTMLLNGNILSVNSIGDIPTL 522
S G++REEYHLT + +L S+ + LNG +L ++ G IP L
Sbjct: 446 ----------------DPSKYGSKREEYHLTPVNNNLQSRLVKLNGELLHLDPSGVIPAL 489
Query: 523 EPLRVKSTQPVSVGPFSIVFVHMP-HVILPACS 554
P+ +++ + V P+S +FVH+P + AC
Sbjct: 490 NPVEKDNSKQLEVAPYSFMFVHLPGPTMFSACE 522
>sp|Q9Y251|HPSE_HUMAN Heparanase OS=Homo sapiens GN=HPSE PE=1 SV=2
Length = 543
Score = 115 bits (288), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 138/510 (27%), Positives = 216/510 (42%), Gaps = 89/510 (17%)
Query: 91 LNSNILLNAVKAFSPLKIRLGGTLQDKVIYD--------------TEDNRQPCK--QFVK 134
L S L + SP +R GGT D +I+D ++ N+ CK
Sbjct: 75 LGSPKLRTLARGLSPAYLRFGGTKTDFLIFDPKKESTFEERSYWQSQVNQDICKYGSIPP 134
Query: 135 NSSEMFGFT-----QGCLPMHRWDEL-NAFFKKSGCPCIQFRAK-----IVFGLNALTGR 183
+ E Q L H + N+ + +S + A ++FGLNAL
Sbjct: 135 DVEEKLRLEWPYQEQLLLREHYQKKFKNSTYSRSSVDVLYTFANCSGLDLIFGLNALLR- 193
Query: 184 SIQNDGSVKGAWDYTNAESFISYTVKKNYSIHGWELGNELCG--NGVGTRVAAAQYATDT 241
+ W+ +NA+ + Y K Y+I WELGNE + +Q D
Sbjct: 194 ------TADLQWNSSNAQLLLDYCSSKGYNI-SWELGNEPNSFLKKADIFINGSQLGEDF 246
Query: 242 ISLRNVVQKIYTGVDSK---PLIIAPGGFFDAKWFKEFLDKSGQSLDVAT-HHIYNLGPG 297
I L +++K T ++K P + P AK K FL G+ +D T HH Y G
Sbjct: 247 IQLHKLLRK-STFKNAKLYGPDVGQPRRK-TAKMLKSFLKAGGEVIDSVTWHHYYLNGRT 304
Query: 298 VDQHLVEKILDPLYLDREVDTFSQLENTLKSSATSAVAWVGESGGAYNSGHNLVTNAFVF 357
+ E L+P LD + + ++ ++S+ W+GE+ AY G L+++ F
Sbjct: 305 ATK---EDFLNPDVLDIFISSVQKVFQVVESTRPGKKVWLGETSSAYGGGAPLLSDTFAA 361
Query: 358 SFWYLDQLGMAAAHDTKTYCRQSLIG-GNYGLLNTTTFVPNPDYYSALLWHRLMGRNALS 416
F +LD+LG++A + RQ G GNY L++ F P PDY+ +LL+ +L+G L
Sbjct: 362 GFMWLDKLGLSARMGIEVVMRQVFFGAGNYHLVD-ENFDPLPDYWLSLLFKKLVGTKVLM 420
Query: 417 TSFSGTK--KIRSYAHCAK------QSKGLVLLLINLDNSTTVHASVAFNGTLTSRHKHK 468
S G+K K+R Y HC + L L INL N T
Sbjct: 421 ASVQGSKRRKLRVYLHCTNTDNPRYKEGDLTLYAINLHNVT------------------- 461
Query: 469 SLKMKIIKLPQASVGGNEREEYHLTAKDGD--LHSQTMLLNGNILSVNSIGDIPTLEPLR 526
K ++LP N++ + +L G L S+++ LNG L++ + D TL PL
Sbjct: 462 ----KYLRLPYPF--SNKQVDKYLLRPLGPHGLLSKSVQLNG--LTLKMVDD-QTLPPLM 512
Query: 527 VKSTQP---VSVGPFSIVFVHMPHVILPAC 553
K +P + + FS F + + + AC
Sbjct: 513 EKPLRPGSSLGLPAFSYSFFVIRNAKVAAC 542
>sp|Q9MYY0|HPSE_BOVIN Heparanase OS=Bos taurus GN=HPSE PE=2 SV=2
Length = 545
Score = 114 bits (286), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 131/513 (25%), Positives = 197/513 (38%), Gaps = 95/513 (18%)
Query: 91 LNSNILLNAVKAFSPLKIRLGGTLQDKVIYD--------------TEDNRQPCK------ 130
L S+ L + +P +R GG D +I+D ++ N+ CK
Sbjct: 77 LGSSKLRTLARGLAPAYLRFGGNKGDFLIFDPKKEPAFEERSYWLSQSNQDICKSGSIPS 136
Query: 131 --------------QFVKNSSEMFGFTQGCLPMHRWDELNAFFKKSGCPCIQFRAKIVFG 176
Q + FT D L F SG ++FG
Sbjct: 137 DVEEKLRLEWPFQEQVLLREQYQKKFTNSTYSRSSVDMLYTFASCSGL-------NLIFG 189
Query: 177 LNALTGRSIQNDGSVKGAWDYTNAESFISYTVKKNYSIHGWELGNELCG--NGVGTRVAA 234
+NAL + + WD +NA+ + Y KNY+I WELGNE G +
Sbjct: 190 VNALLRTTDMH-------WDSSNAQLLLDYCSSKNYNI-SWELGNEPNSFQRKAGIFING 241
Query: 235 AQYATDTISLRNVVQK--IYTGVDSKPLIIAPGGFFDAKWFKEFLDKSGQSLDVAT-HHI 291
Q D I R ++ K P I P K K FL G+ +D T HH
Sbjct: 242 RQLGEDFIEFRKLLGKSAFKNAKLYGPDIGQPRRN-TVKMLKSFLKAGGEVIDSVTWHHY 300
Query: 292 YNLGPGVDQHLVEKILDPLYLDREVDTFSQLENTLKSSATSAVAWVGESGGAYNSGHNLV 351
Y G + E L+P LD + + + ++ W+GE+ A+ G +
Sbjct: 301 YVNGRIATK---EDFLNPDILDTFISSVQKTLRIVEKIRPLKKVWLGETSSAFGGGAPFL 357
Query: 352 TNAFVFSFWYLDQLGMAAAHDTKTYCRQSLIG-GNYGLLNTTTFVPNPDYYSALLWHRLM 410
+N F F +LD+LG++A + RQ L G GNY L++ F P PDY+ +LL+ +L+
Sbjct: 358 SNTFAAGFMWLDKLGLSARMGIEVVMRQVLFGAGNYHLVD-GNFEPLPDYWLSLLFKKLV 416
Query: 411 GRNALSTSFSGT--KKIRSYAHCAK------QSKGLVLLLINLDNSTTVHASVAFNGTLT 462
G L S G K R Y HC + L L +NL N T
Sbjct: 417 GNKVLMASVKGPDRSKFRVYLHCTNTKHPRYKEGDLTLYALNLHNVT------------- 463
Query: 463 SRHKHKSLKMKIIKLPQASVGGNEREEYHLTAKDGD--LHSQTMLLNGNILSVNSIGDIP 520
KH L + N++ + +L G L S+++ LNG IL + +P
Sbjct: 464 ---KHLELPHHLF---------NKQVDKYLIKPSGTDGLLSKSVQLNGQILKMVDEQTLP 511
Query: 521 TLEPLRVKSTQPVSVGPFSIVFVHMPHVILPAC 553
L + + + PFS F + + + AC
Sbjct: 512 ALTEKPLHPGSSLGMPPFSYGFFVIRNAKVAAC 544
>sp|Q71RP1|HPSE_RAT Heparanase OS=Rattus norvegicus GN=Hpse PE=2 SV=1
Length = 536
Score = 113 bits (283), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 129/504 (25%), Positives = 201/504 (39%), Gaps = 77/504 (15%)
Query: 91 LNSNILLNAVKAFSPLKIRLGGTLQDKVIYD--------------TEDNRQPC------K 130
L S L + SP +R GGT D +I+D ++DN C
Sbjct: 68 LGSPRLRALARGLSPAYLRFGGTKTDFLIFDPNKEPTSEERSYWQSQDNNDICGSERVSA 127
Query: 131 QFVKNSSEMFGFTQGCLPMHRWDE--LNAFFKKSGCPCIQFRAK-----IVFGLNALTGR 183
++ + F + L ++ N+ + +S + AK ++FGLNAL
Sbjct: 128 DVLRKLQMEWPFQELLLLREQYQREFKNSTYSRSSVDMLYSFAKCSRLDLIFGLNALLR- 186
Query: 184 SIQNDGSVKGAWDYTNAESFISYTVKKNYSIHGWELGNELCG--NGVGTRVAAAQYATDT 241
+ W+ +NA+ ++Y K Y+I WELGNE + Q D
Sbjct: 187 ------TPDLRWNSSNAQLLLNYCSSKGYNI-SWELGNEPNSFWKKAHISIDGLQLGEDF 239
Query: 242 ISLRNVVQK--IYTGVDSKPLIIAPGGFFDAKWFKEFLDKSGQSLDVATHHIYNLGPGVD 299
+ L ++QK P I P G K + FL G+ +D T H Y L V
Sbjct: 240 VELHKLLQKSAFQNAKLYGPDIGQPRGK-TVKLLRSFLKAGGEVIDSLTWHHYYLNGRVA 298
Query: 300 QHLVEKILDPLYLDREVDTFSQLENTLKSSATSAVAWVGESGGAYNSGHNLVTNAFVFSF 359
E L LD + + ++ K W+GE+ AY G L+++ F F
Sbjct: 299 TK--EDFLSSDVLDTFILSVQKILKVTKEMTPGKKVWLGETSSAYGGGAPLLSDTFAAGF 356
Query: 360 WYLDQLGMAAAHDTKTYCRQSLIG-GNYGLLNTTTFVPNPDYYSALLWHRLMGRNALSTS 418
+LD+LG++A + RQ G GNY L++ F P PDY+ +LL+ +L+G L +
Sbjct: 357 MWLDKLGLSAQLGIEVVMRQVFFGAGNYHLVD-ENFEPLPDYWLSLLFKKLVGPKVLMSR 415
Query: 419 FSGT--KKIRSYAHCAK------QSKGLVLLLINLDNSTTVHASVAFNGTLTSRHKHKSL 470
G K+R Y HC + L L ++NL N T KH L
Sbjct: 416 VKGPDRSKLRVYLHCTNVYHPRYREGDLTLYVLNLHNVT----------------KHLKL 459
Query: 471 KMKIIKLPQASVGGNEREEYHLTAKDGD-LHSQTMLLNGNILSVNSIGDIPTLEPLRVKS 529
+ P ++Y L D L S+++ LNG L + +P L + +
Sbjct: 460 PPPMFSRPV--------DKYLLKPFGSDGLLSKSVQLNGQTLKMVDEQTLPALTEKPLPA 511
Query: 530 TQPVSVGPFSIVFVHMPHVILPAC 553
+SV FS F + + + AC
Sbjct: 512 GSSLSVPAFSYGFFVIRNAKIAAC 535
>sp|Q6YGZ1|HPSE_MOUSE Heparanase OS=Mus musculus GN=Hpse PE=1 SV=3
Length = 535
Score = 113 bits (282), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 127/504 (25%), Positives = 201/504 (39%), Gaps = 77/504 (15%)
Query: 91 LNSNILLNAVKAFSPLKIRLGGTLQDKVIYDTED--------------NRQPCKQ----- 131
L S L + SP +R GGT D +I+D + N C+
Sbjct: 67 LGSPRLRALARGLSPAYLRFGGTKTDFLIFDPDKEPTSEERSYWKSQVNHDICRSEPVSA 126
Query: 132 -FVKNSSEMFGFTQGCLPMHRWDE--LNAFFKKSGCPCIQFRAK-----IVFGLNALTGR 183
++ + F + L ++ + N+ + +S + AK ++FGLNAL
Sbjct: 127 AVLRKLQVEWPFQELLLLREQYQKEFKNSTYSRSSVDMLYSFAKCSGLDLIFGLNALLR- 185
Query: 184 SIQNDGSVKGAWDYTNAESFISYTVKKNYSIHGWELGNELCG--NGVGTRVAAAQYATDT 241
+ W+ +NA+ + Y K Y+I WELGNE + Q D
Sbjct: 186 ------TPDLRWNSSNAQLLLDYCSSKGYNI-SWELGNEPNSFWKKAHILIDGLQLGEDF 238
Query: 242 ISLRNVVQK--IYTGVDSKPLIIAPGGFFDAKWFKEFLDKSGQSLDVATHHIYNLGPGVD 299
+ L ++Q+ P I P G K + FL G+ +D T H Y L +
Sbjct: 239 VELHKLLQRSAFQNAKLYGPDIGQPRGK-TVKLLRSFLKAGGEVIDSLTWHHYYLNGRIA 297
Query: 300 QHLVEKILDPLYLDREVDTFSQLENTLKSSATSAVAWVGESGGAYNSGHNLVTNAFVFSF 359
E L LD + + ++ K W+GE+ AY G L++N F F
Sbjct: 298 TK--EDFLSSDVLDTFILSVQKILKVTKEITPGKKVWLGETSSAYGGGAPLLSNTFAAGF 355
Query: 360 WYLDQLGMAAAHDTKTYCRQSLIG-GNYGLLNTTTFVPNPDYYSALLWHRLMGRNALSTS 418
+LD+LG++A + RQ G GNY L++ F P PDY+ +LL+ +L+G L +
Sbjct: 356 MWLDKLGLSAQMGIEVVMRQVFFGAGNYHLVD-ENFEPLPDYWLSLLFKKLVGPRVLLSR 414
Query: 419 FSGT--KKIRSYAHCAK------QSKGLVLLLINLDNSTTVHASVAFNGTLTSRHKHKSL 470
G K+R Y HC Q L L ++NL N T KH +
Sbjct: 415 VKGPDRSKLRVYLHCTNVYHPRYQEGDLTLYVLNLHNVT----------------KHLKV 458
Query: 471 KMKIIKLPQASVGGNEREEYHLTAKDGD-LHSQTMLLNGNILSVNSIGDIPTLEPLRVKS 529
+ + P + Y L D L S+++ LNG IL + +P L + +
Sbjct: 459 PPPLFRKPV--------DTYLLKPSGPDGLLSKSVQLNGQILKMVDEQTLPALTEKPLPA 510
Query: 530 TQPVSVGPFSIVFVHMPHVILPAC 553
+S+ FS F + + + AC
Sbjct: 511 GSALSLPAFSYGFFVIRNAKIAAC 534
>sp|Q90YK5|HPSE_CHICK Heparanase OS=Gallus gallus GN=HPSE PE=1 SV=1
Length = 523
Score = 112 bits (280), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 111/417 (26%), Positives = 181/417 (43%), Gaps = 60/417 (14%)
Query: 153 DELNAFFKKSGCPCIQFRAKIVFGLNALTGRS-IQNDGSVKGAWDYTNAESFISYTVKKN 211
D L+ F SG FR +VFGLNAL R+ +Q WD +NA+ + Y +++
Sbjct: 150 DILHTFASSSG-----FR--LVFGLNALLRRAGLQ--------WDSSNAKQLLGYCAQRS 194
Query: 212 YSIHGWELGNELCG--NGVGTRVAAAQYATDTISLRNVVQK--IYTGVDSKPLIIAPGGF 267
Y+I WELGNE G + Q D + LR ++ + +Y + L +
Sbjct: 195 YNI-SWELGNEPNSFRKKSGICIDGFQLGRDFVHLRQLLSQHPLYRHAELYGLDVGQPRK 253
Query: 268 FDAKWFKEFLDKSGQSLDVAT-HHIYNLGPGVDQHLVEKILDPLYLDREVDTFSQLENTL 326
+ F+ G+++D T HH Y G + E L P LD + +
Sbjct: 254 HTQHLLRSFMKSGGKAIDSVTWHHYYVNGRSATR---EDFLSPEVLDSFATAIHDVLGIV 310
Query: 327 KSSATSAVAWVGESGGAYNSGHNLVTNAFVFSFWYLDQLGMAAAHDTKTYCRQSLIG-GN 385
+++ W+GE+G AY G ++N +V F +LD+LG+AA RQ G G+
Sbjct: 311 EATVPGKKVWLGETGSAYGGGAPQLSNTYVAGFMWLDKLGLAARRGIDVVMRQVSFGAGS 370
Query: 386 YGLLNTTTFVPNPDYYSALLWHRLMGRNALSTSF--SGTKKIRSYAHCA-----KQSKG- 437
Y L++ F P PDY+ +LL+ RL+G L S + ++ R Y HC K +G
Sbjct: 371 YHLVD-AGFKPLPDYWLSLLYKRLVGTRVLQASVEQADARRPRVYLHCTNPRHPKYREGD 429
Query: 438 LVLLLINLDNSTTVHASVAFNGTLTSRHKHKSLKMKIIKLPQASVGGNEREEYHLTAKDG 497
+ L +NL N T + ++LP+ + ++Y L
Sbjct: 430 VTLFALNLSNVT-----------------------QSLQLPK-QLWSKSVDQYLLLPHGK 465
Query: 498 D-LHSQTMLLNGNILSVNSIGDIPTLEPLRVKSTQPVSVGPFSIVFVHMPHVILPAC 553
D + S+ + LNG +L + +P L + + + + FS F + + AC
Sbjct: 466 DSILSREVQLNGRLLQMVDDETLPALHEMALAPGSTLGLPAFSYGFYVIRNAKAIAC 522
>sp|Q8WWQ2|HPSE2_HUMAN Inactive heparanase-2 OS=Homo sapiens GN=HPSE2 PE=1 SV=3
Length = 592
Score = 87.4 bits (215), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 95/375 (25%), Positives = 154/375 (41%), Gaps = 57/375 (15%)
Query: 205 SYTVKKNYSIHGWELGNELCGNGV--GTRVAAAQYATDTISLRNVVQ--KIYTGVD-SKP 259
Y+ K Y+I WELGNE G V +Q D I L++++Q +IY+ P
Sbjct: 246 KYSASKKYNI-SWELGNEPNNYRTMHGRAVNGSQLGKDYIQLKSLLQPIRIYSRASLYGP 304
Query: 260 LIIAPGGFFDAKWFKEFLDKSGQSLDVAT-HHIYNLGPGVDQHLVEKILDPL---YLDRE 315
I P A F+ +G ++D T H Y +D +V K++D L LD
Sbjct: 305 NIGRPRKNVIA-LLDGFMKVAGSTVDAVTWQHCY-----IDGRVV-KVMDFLKTRLLDTL 357
Query: 316 VDTFSQLENTLKSSATSAVAWVGESGGAYNSGHNLVTNAFVFSFWYLDQLGMAAAHDTKT 375
D +++ + + W+ G N +++++ F +L+ LGM A
Sbjct: 358 SDQIRKIQKVVNTYTPGKKIWLEGVVTTSAGGTNNLSDSYAAGFLWLNTLGMLANQGIDV 417
Query: 376 YCRQSLIGGNYGLLNTTTFVPNPDYYSALLWHRLMGRNALSTSFSGTK-----------K 424
R S Y L F P PDY+ +LL+ RL+G L+ +G + K
Sbjct: 418 VIRHSFFDHGYNHLVDQNFNPLPDYWLSLLYKRLIGPKVLAVHVAGLQRKPRPGRVIRDK 477
Query: 425 IRSYAHCAKQSK------GLVLLLINLDNSTTVHASVAFNGTLTSRHKHKSLKMKIIKLP 478
+R YAHC + L +INL S + GTL + H+ L P
Sbjct: 478 LRIYAHCTNHHNHNYVRGSITLFIINLHRS---RKKIKLAGTLRDKLVHQYLLQ-----P 529
Query: 479 QASVGGNEREEYHLTAKDGDLHSQTMLLNGNILSVNSIGDIPTLEPLRVKSTQPVSVGPF 538
G L S+++ LNG L + G +P L+P +++ + + + P
Sbjct: 530 YGQEG---------------LKSKSVQLNGQPLVMVDDGTLPELKPRPLRAGRTLVIPPV 574
Query: 539 SIVFVHMPHVILPAC 553
++ F + +V AC
Sbjct: 575 TMGFYVVKNVNALAC 589
>sp|B2RY83|HPSE2_MOUSE Inactive heparanase-2 OS=Mus musculus GN=Hpse2 PE=2 SV=1
Length = 592
Score = 86.7 bits (213), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 94/375 (25%), Positives = 153/375 (40%), Gaps = 59/375 (15%)
Query: 206 YTVKKNYSIHGWELGNELCGNGV--GTRVAAAQYATDTISLRNVVQKIYTGVDSKPLIIA 263
Y+ K Y+I WELGNE G V +Q D I L++++Q I V S+ +
Sbjct: 247 YSASKKYNI-SWELGNEPNNYRSIHGRAVNGSQLGKDYIQLKSLLQPIR--VYSRASLYG 303
Query: 264 PGGFFDAK----WFKEFLDKSGQSLDVAT-HHIYNLGPGVDQHLVEKILDPL---YLDRE 315
P K F+ +G ++D T H Y +D +V K++D L LD
Sbjct: 304 PNIGRPRKNVIALLDGFMKVAGSTVDAVTWQHCY-----IDGRVV-KVMDFLKTRLLDTL 357
Query: 316 VDTFSQLENTLKSSATSAVAWVGESGGAYNSGHNLVTNAFVFSFWYLDQLGMAAAHDTKT 375
D +++ + + W+ G N +++++ F +L+ LGM A
Sbjct: 358 SDQIRKIQKVVNTYTPGKKIWLEGVVTTSAGGTNNLSDSYAAGFLWLNTLGMLANQGIDV 417
Query: 376 YCRQSLIGGNYGLLNTTTFVPNPDYYSALLWHRLMGRNALSTSFSGTK-----------K 424
R S Y L F P PDY+ +LL+ RL+G L+ +G + K
Sbjct: 418 VIRHSFFDHGYNHLVDQNFNPLPDYWLSLLYKRLIGPKVLAVHVAGLQRKPRPGRVIRDK 477
Query: 425 IRSYAHCAKQSK------GLVLLLINLDNSTTVHASVAFNGTLTSRHKHKSLKMKIIKLP 478
+R YAHC + L +INL S + GTL + H+ L P
Sbjct: 478 LRIYAHCTNHHNHNYVRGSITLFIINLHRS---RKKIKLAGTLRDKLVHQYLLQ-----P 529
Query: 479 QASVGGNEREEYHLTAKDGDLHSQTMLLNGNILSVNSIGDIPTLEPLRVKSTQPVSVGPF 538
G L S+++ LNG L + G +P L+P +++ + + + P
Sbjct: 530 YGQEG---------------LKSKSVQLNGQPLVMVDDGTLPELKPRPLRAGRTLVIPPV 574
Query: 539 SIVFVHMPHVILPAC 553
++ F + +V AC
Sbjct: 575 TMGFYVVKNVNALAC 589
>sp|Q85056|CAPSD_AHV2H Capsid protein OS=Atkinsonella hypoxylon virus (isolate 2H) PE=3
SV=1
Length = 652
Score = 35.0 bits (79), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 40/159 (25%), Positives = 65/159 (40%), Gaps = 26/159 (16%)
Query: 314 REVDTFSQLENTLKSSATSAVAWVGE-SGGAYNSGHNLVTNAFVFSFWY---LDQLGMAA 369
RE + + +++ + AV VG GG Y S H T F + W+ L +L +A
Sbjct: 235 RESTAYEKWLDSIVIHYSRAVIRVGNLVGGLYQSSHGSTTTHFTYRNWFARSLSRLADSA 294
Query: 370 AHDTKTYCRQSLIGG---NYGLLNTTTFVPNPDYYSALLWHRLMGRNAL----------- 415
H +T+ R+ +I N +N T+ P Y LL RN
Sbjct: 295 TH--RTHLRRPMISEFDYNIPSVNNNTYNP----YVHLLMLEPNNRNITLDFIRSLSSFC 348
Query: 416 STSFSGTKKIRSYAHCAKQSKGLVLLLINLDNSTTVHAS 454
ST T+ +R H +++S + +I + T H+S
Sbjct: 349 STELKATRTLRD--HISRRSAAISRCVIKGPEAPTWHSS 385
>sp|Q8TCU4|ALMS1_HUMAN Alstrom syndrome protein 1 OS=Homo sapiens GN=ALMS1 PE=1 SV=3
Length = 4167
Score = 33.9 bits (76), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 23/96 (23%), Positives = 44/96 (45%), Gaps = 11/96 (11%)
Query: 29 SILQAEAAGGAGFVGGNVFIDRRSVIGRTDDDFVCATLDWWPPEKCDYGTCSWDRASLLN 88
S++++E G +G +G + I +VI G+CSWD +
Sbjct: 2385 SVMRSEPEGCSGTIGNKIIIPMMTVIKSDSSSDASD----------GNGSCSWDSNLPES 2434
Query: 89 LDLNSNILLNAVKAFSPLKIRLGGTLQDKVIYDTED 124
L+ S++LLN SP K + + +++ + ++ED
Sbjct: 2435 LESVSDVLLNFFPYVSP-KTSITDSREEEGVSESED 2469
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.320 0.135 0.417
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 206,878,667
Number of Sequences: 539616
Number of extensions: 8699207
Number of successful extensions: 19670
Number of sequences better than 100.0: 18
Number of HSP's better than 100.0 without gapping: 13
Number of HSP's successfully gapped in prelim test: 5
Number of HSP's that attempted gapping in prelim test: 19619
Number of HSP's gapped (non-prelim): 23
length of query: 554
length of database: 191,569,459
effective HSP length: 123
effective length of query: 431
effective length of database: 125,196,691
effective search space: 53959773821
effective search space used: 53959773821
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 64 (29.3 bits)