BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 008951
(547 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|Q9FZP1|HPSE3_ARATH Heparanase-like protein 3 OS=Arabidopsis thaliana GN=At5g34940 PE=2
SV=2
Length = 536
Score = 748 bits (1930), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 339/506 (66%), Positives = 419/506 (82%), Gaps = 2/506 (0%)
Query: 44 GNVFIDRRSVIGRTDDDFVCATLDWWPPEKCDYGTCSWDRASLLNLDLNSNILLNAVKAF 103
G VF+ R+ +G D+DF+CATLDWWPPEKCDYG+CSWD AS+LNLDLN+ IL NA+KAF
Sbjct: 31 GTVFVYGRAAVGTIDEDFICATLDWWPPEKCDYGSCSWDHASILNLDLNNVILQNAIKAF 90
Query: 104 SPLKIRLGGTLQDKVIYDTEDNRQPCKQFVKNSSEMFGFTQGCLPMHRWDELNAFFKKSG 163
+PLKIR+GGTLQD VIY+T D++QPC F KNSS +FG+TQGCLPM RWDELNAFF+K+G
Sbjct: 91 APLKIRIGGTLQDIVIYETPDSKQPCLPFTKNSSILFGYTQGCLPMRRWDELNAFFRKTG 150
Query: 164 AKIVFGLNALTGRSIQNDGSVKGAWDYTNAESFISYTVKKNYSIHGWELGNELCGNGVGT 223
K++FGLNAL+GRSI+++G GAW+YTNAESFI +T + NY+I GWELGNELCG+GVG
Sbjct: 151 TKVIFGLNALSGRSIKSNGEAIGAWNYTNAESFIRFTAENNYTIDGWELGNELCGSGVGA 210
Query: 224 RVAAAQYATDTISLRNVVQKIYTGVDSKPLIIAPGGFFDAKWFKEFLDKSGQSLDVATHH 283
RV A QYA DTI+LRN+V ++Y V PL+I PGGFF+ WF E+L+K+ SL+ T H
Sbjct: 211 RVGANQYAIDTINLRNIVNRVYKNVSPMPLVIGPGGFFEVDWFTEYLNKAENSLNATTRH 270
Query: 284 IYNLGPGVDQHLVEKILDPLYLDREVDTFSQLENTLKSSATSAVAWVGESGGAYNSGHNL 343
IY+LGPGVD+HL+EKIL+P YLD+E +F L+N +K+S+T AVAWVGESGGAYNSG NL
Sbjct: 271 IYDLGPGVDEHLIEKILNPSYLDQEAKSFRSLKNIIKNSSTKAVAWVGESGGAYNSGRNL 330
Query: 344 VTNAFVFSFWYLDQLGMAAAHDTKTYCRQSLIGGNYGLLNTTTFVPNPDYYSALLWHRLM 403
V+NAFV+SFWYLDQLGMA+ +DTKTYCRQSLIGGNYGLLNTT F PNPDYYSAL+W +LM
Sbjct: 331 VSNAFVYSFWYLDQLGMASLYDTKTYCRQSLIGGNYGLLNTTNFTPNPDYYSALIWRQLM 390
Query: 404 GRNALSTSFSGTKKIRSYAHCAKQSKGLVLLLINLDNSTTVHASVAFNGTLTSRH-KH-K 461
GR AL T+FSGTKKIRSY HCA+QSKG+ +LL+NLDN+TTV A V N + + RH KH K
Sbjct: 391 GRKALFTTFSGTKKIRSYTHCARQSKGITVLLMNLDNTTTVVAKVELNNSFSLRHTKHMK 450
Query: 462 SLKMKIIKLPQASVGGNEREEYHLTAKDGDLHSQTMLLNGNILSVNSIGDIPTLEPLRVK 521
S K +L G +REEYHLTAKDG+LHSQTMLLNGN L VNS+GD+P +EP+ +
Sbjct: 451 SYKRASSQLFGGPNGVIQREEYHLTAKDGNLHSQTMLLNGNALQVNSMGDLPPIEPIHIN 510
Query: 522 STQPVSVGPFSIVFVHMPHVILPACS 547
ST+P+++ P+SIVFVHM +V++PAC+
Sbjct: 511 STEPITIAPYSIVFVHMRNVVVPACA 536
>sp|Q8L608|HPSE2_ARATH Heparanase-like protein 2 OS=Arabidopsis thaliana GN=At5g61250 PE=2
SV=1
Length = 539
Score = 531 bits (1369), Expect = e-150, Method: Compositional matrix adjust.
Identities = 258/512 (50%), Positives = 338/512 (66%), Gaps = 11/512 (2%)
Query: 46 VFIDRRSVIGRTDDDFVCATLDWWPPEKCDYGTCSWDRASLLNLDLNSNILLNAVKAFSP 105
+ ID I TD++F+CATLDWWPPEKC+Y C W ASL+NL+L S +L A++AF
Sbjct: 29 LVIDGSRRIAETDENFICATLDWWPPEKCNYDQCPWGYASLINLNLASPLLAKAIQAFRT 88
Query: 106 LKIRLGGTLQDKVIYDTEDNRQPCKQFVKNSSEMFGFTQGCLPMHRWDELNAFFKKSGAK 165
L+IR+GG+LQD+VIYD D + PC QF K +FGF++GCL M RWDE+N FF +GA
Sbjct: 89 LRIRIGGSLQDQVIYDVGDLKTPCTQFKKTDDGLFGFSEGCLYMKRWDEVNHFFNATGAI 148
Query: 166 IVFGLNALTGRSIQNDGSVKGAWDYTNAESFISYTVKKNYSIHGWELGNELCGNGVGTRV 225
+ FGLNAL GR+ N + G WD+TN + F++YTV K Y+I WE GNEL G+G+ V
Sbjct: 149 VTFGLNALHGRNKLNGTAWGGDWDHTNTQDFMNYTVSKGYAIDSWEFGNELSGSGIWASV 208
Query: 226 AAAQYATDTISLRNVVQKIYTGVDSKPLIIAPGGFFDAKWFKEFLDKSGQS-LDVATHHI 284
+ Y D I L+NV++ +Y +KPL++APGGFF+ +W+ E L SG LDV THHI
Sbjct: 209 SVELYGKDLIVLKNVIKNVYKNSRTKPLVVAPGGFFEEQWYSELLRLSGPGVLDVLTHHI 268
Query: 285 YNLGPGVDQHLVEKILDPLYLDREVDTFSQLENTLKSSATSAVAWVGESGGAYNSGHNLV 344
YNLGPG D LV KILDP YL + F+ + T++ A AWVGE+GGA+NSG V
Sbjct: 269 YNLGPGNDPKLVNKILDPNYLSGISELFANVNQTIQEHGPWAAAWVGEAGGAFNSGGRQV 328
Query: 345 TNAFVFSFWYLDQLGMAAAHDTKTYCRQSLIGGNYGLLNTTTFVPNPDYYSALLWHRLMG 404
+ F+ SFWYLDQLG+++ H+TK YCRQ+L+GG YGLL TFVPNPDYYSALLWHRLMG
Sbjct: 329 SETFINSFWYLDQLGISSKHNTKVYCRQALVGGFYGLLEKETFVPNPDYYSALLWHRLMG 388
Query: 405 RNALSTSFSGTKKIRSYAHCAKQSKGLVLLLINLDNSTTVHASVAFNG---TLTSRHKHK 461
+ L + ++ +R+Y HC+K+ G+ +LLINL TT +V+ NG L + +
Sbjct: 389 KGILGVQTTASEYLRAYVHCSKRRAGITILLINLSKHTTFTVAVS-NGVKVVLQAESMKR 447
Query: 462 SLKMKIIKLP------QASVGGNEREEYHLTAKDGDLHSQTMLLNGNILSVNSIGDIPTL 515
++ IK +AS G REEYHL+ KDGDL S+ MLLNG L + GDIP L
Sbjct: 448 KSFLETIKSKVSWVGNKASDGYLNREEYHLSPKDGDLRSKIMLLNGKPLVPTATGDIPKL 507
Query: 516 EPLRVKSTQPVSVGPFSIVFVHMPHVILPACS 547
EP+R PV + P SI F+ +P PACS
Sbjct: 508 EPVRHGVKSPVYINPLSISFIVLPTFDAPACS 539
>sp|Q9FF10|HPSE1_ARATH Heparanase-like protein 1 OS=Arabidopsis thaliana GN=At5g07830 PE=2
SV=1
Length = 543
Score = 518 bits (1334), Expect = e-146, Method: Compositional matrix adjust.
Identities = 253/513 (49%), Positives = 339/513 (66%), Gaps = 10/513 (1%)
Query: 45 NVFIDRRSVIGRTDDDFVCATLDWWPPEKCDYGTCSWDRASLLNLDLNSNILLNAVKAFS 104
++ I + TD++FVCATLDWWP +KC+Y C W +S++N+DL +L A+KAF
Sbjct: 31 SIVIQGARRVCETDENFVCATLDWWPHDKCNYDQCPWGYSSVINMDLTRPLLTKAIKAFK 90
Query: 105 PLKIRLGGTLQDKVIYDTEDNRQPCKQFVKNSSEMFGFTQGCLPMHRWDELNAFFKKSGA 164
PL+IR+GG+LQD+VIYD + + PC+ F K +S +FGF++GCL M RWDELN+F +GA
Sbjct: 91 PLRIRIGGSLQDQVIYDVGNLKTPCRPFQKMNSGLFGFSKGCLHMKRWDELNSFLTATGA 150
Query: 165 KIVFGLNALTGRSIQNDGSVKGAWDYTNAESFISYTVKKNYSIHGWELGNELCGNGVGTR 224
+ FGLNAL GR + GAWD+ N + F++YTV K Y I WE GNEL G+GVG
Sbjct: 151 VVTFGLNALRGRHKLRGKAWGGAWDHINTQDFLNYTVSKGYVIDSWEFGNELSGSGVGAS 210
Query: 225 VAAAQYATDTISLRNVVQKIYTGV-DSKPLIIAPGGFFDAKWFKEFLDKSGQS-LDVATH 282
V+A Y D I L++V+ K+Y KP+++APGGF++ +W+ + L+ SG S +DV TH
Sbjct: 211 VSAELYGKDLIVLKDVINKVYKNSWLHKPILVAPGGFYEQQWYTKLLEISGPSVVDVVTH 270
Query: 283 HIYNLGPGVDQHLVEKILDPLYLDREVDTFSQLENTLKSSATSAVAWVGESGGAYNSGHN 342
HIYNLG G D LV+KI+DP YL + TF + T++ A WVGESGGAYNSG
Sbjct: 271 HIYNLGSGNDPALVKKIMDPSYLSQVSKTFKDVNQTIQEHGPWASPWVGESGGAYNSGGR 330
Query: 343 LVTNAFVFSFWYLDQLGMAAAHDTKTYCRQSLIGGNYGLLNTTTFVPNPDYYSALLWHRL 402
V++ F+ SFWYLDQLGM+A H+TK YCRQ+L+GG YGLL TFVPNPDYYSALLWHRL
Sbjct: 331 HVSDTFIDSFWYLDQLGMSARHNTKVYCRQTLVGGFYGLLEKGTFVPNPDYYSALLWHRL 390
Query: 403 MGRNALSTSFSGTKKIRSYAHCAKQSKGLVLLLINLDNST--TVHASVAFNGTLTSRHKH 460
MG+ L+ G ++R YAHC+K G+ LLLINL N + TV S N L + +
Sbjct: 391 MGKGVLAVQTDGPPQLRVYAHCSKGRAGVTLLLINLSNQSDFTVSVSNGINVVLNAESRK 450
Query: 461 KSLKMKIIKLP------QASVGGNEREEYHLTAKDGDLHSQTMLLNGNILSVNSIGDIPT 514
K + +K P +AS G REEYHLT ++G L S+TM+LNG L + GDIP+
Sbjct: 451 KKSLLDTLKRPFSWIGSKASDGYLNREEYHLTPENGVLRSKTMVLNGKSLKPTATGDIPS 510
Query: 515 LEPLRVKSTQPVSVGPFSIVFVHMPHVILPACS 547
LEP+ P++V P S+ F+ +P+ ACS
Sbjct: 511 LEPVLRSVNSPLNVLPLSMSFIVLPNFDASACS 543
>sp|Q9LRC8|BAGLU_SCUBA Baicalin-beta-D-glucuronidase OS=Scutellaria baicalensis GN=SGUS
PE=1 SV=1
Length = 527
Score = 459 bits (1180), Expect = e-128, Method: Compositional matrix adjust.
Identities = 235/566 (41%), Positives = 337/566 (59%), Gaps = 63/566 (11%)
Query: 1 MGSQAWLK---VLLFGFCFWLSSRSSSSSSLSILQAEAAGGAGFVGGNVFIDRRSVIGRT 57
MG Q W K VL F F ++ + I + + +T
Sbjct: 1 MGFQVWQKGLCVLCFSLIFICGVIGEETTIVKI-------------------EENPVAQT 41
Query: 58 DDDFVCATLDWWPPEKCDYGTCSWDRASLLNLDLNSNILLNAVKAFSPLKIRLGGTLQDK 117
D+++VCATLD WPP KC+YG C W ++S LNLDLN+NI+ NAVK F+PLK+R GGTLQD+
Sbjct: 42 DENYVCATLDLWPPTKCNYGNCPWGKSSFLNLDLNNNIIRNAVKEFAPLKLRFGGTLQDR 101
Query: 118 VIYDTEDNRQPCKQ-FVKNSSEMFGFTQGCLPMHRWDELNAFFKKSGAKIVFGLNALTGR 176
++Y T + +PC F N++ + F+ CL + RWDE+N F ++G++ VFGLNAL G+
Sbjct: 102 LVYQTSRD-EPCDSTFYNNTNLILDFSHACLSLDRWDEINQFILETGSEAVFGLNALRGK 160
Query: 177 SIQNDGSVK------------GAWDYTNAESFISYTVKKNYS-IHGWELGNELCGNGVGT 223
+++ G +K G WDY+N++ I Y++KK Y I GW LGNEL G+ +
Sbjct: 161 TVEIKGIIKDGQYLGETTTAVGEWDYSNSKFLIEYSLKKGYKHIRGWTLGNELGGHTLFI 220
Query: 224 RVAAAQYATDTISLRNVVQKIYTGVDSKPLIIAPGGFFDAKWFKEFLDKSGQSLDVATHH 283
V+ YA D L +V++IY + PLIIAPG FD +W+ EF+D++ + L VATHH
Sbjct: 221 GVSPEDYANDAKKLHELVKEIYQDQGTMPLIIAPGAIFDLEWYTEFIDRTPE-LHVATHH 279
Query: 284 IYNLGPGVDQHLVEKILDPLYLDREVDT-FSQLENTLKSSATSAVAWVGESGGAYNSGHN 342
+YNLG G D L + +L + D + + L+ + T AVAW+GE+GGA+NSG +
Sbjct: 280 MYNLGSGGDDALKDVLLTASFFDEATKSMYEGLQKIVNRPGTKAVAWIGEAGGAFNSGQD 339
Query: 343 LVTNAFVFSFWYLDQLGMAAAHDTKTYCRQSLIGGNYGLLNTTTFVPNPDYYSALLWHRL 402
++N F+ FWYL+ LG +A DTKT+CRQ+L GGNYGLL T T++PNPDYYSALLWHRL
Sbjct: 340 GISNTFINGFWYLNMLGYSALLDTKTFCRQTLTGGNYGLLQTGTYIPNPDYYSALLWHRL 399
Query: 403 MGRNALSTSFSGTKKIRSYAHCAKQSKGLVLLLINLDNSTTVHASVAFNGTLTSRHKHKS 462
MG L T GTK + YAHCAK+S G+ +L++N D ++V S+
Sbjct: 400 MGSKVLKTEIVGTKNVYIYAHCAKKSNGITMLVLNHDGESSVKISL-------------- 445
Query: 463 LKMKIIKLPQASVGGNEREEYHLTAKDGDLHSQTMLLNGNILSVNSIGDIPTLEPLRVKS 522
S G++REEYHLT + +L S+ + LNG +L ++ G IP L P+ +
Sbjct: 446 ---------DPSKYGSKREEYHLTPVNNNLQSRLVKLNGELLHLDPSGVIPALNPVEKDN 496
Query: 523 TQPVSVGPFSIVFVHMP-HVILPACS 547
++ + V P+S +FVH+P + AC
Sbjct: 497 SKQLEVAPYSFMFVHLPGPTMFSACE 522
>sp|Q9Y251|HPSE_HUMAN Heparanase OS=Homo sapiens GN=HPSE PE=1 SV=2
Length = 543
Score = 121 bits (303), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 138/510 (27%), Positives = 211/510 (41%), Gaps = 96/510 (18%)
Query: 91 LNSNILLNAVKAFSPLKIRLGGTLQDKVIYD--------------TEDNRQPCK------ 130
L S L + SP +R GGT D +I+D ++ N+ CK
Sbjct: 75 LGSPKLRTLARGLSPAYLRFGGTKTDFLIFDPKKESTFEERSYWQSQVNQDICKYGSIPP 134
Query: 131 --------------QFVKNSSEMFGFTQGCLPMHRWDELNAFFKKSGAKIVFGLNALTGR 176
Q + F D L F SG ++FGLNAL
Sbjct: 135 DVEEKLRLEWPYQEQLLLREHYQKKFKNSTYSRSSVDVLYTFANCSGLDLIFGLNALLR- 193
Query: 177 SIQNDGSVKGAWDYTNAESFISYTVKKNYSIHGWELGNELCG--NGVGTRVAAAQYATDT 234
+ W+ +NA+ + Y K Y+I WELGNE + +Q D
Sbjct: 194 ------TADLQWNSSNAQLLLDYCSSKGYNI-SWELGNEPNSFLKKADIFINGSQLGEDF 246
Query: 235 ISLRNVVQKIYTGVDSK---PLIIAPGGFFDAKWFKEFLDKSGQSLDVAT-HHIYNLGPG 290
I L +++K T ++K P + P AK K FL G+ +D T HH Y G
Sbjct: 247 IQLHKLLRK-STFKNAKLYGPDVGQPRRK-TAKMLKSFLKAGGEVIDSVTWHHYYLNGRT 304
Query: 291 VDQHLVEKILDPLYLDREVDTFSQLENTLKSSATSAVAWVGESGGAYNSGHNLVTNAFVF 350
+ E L+P LD + + ++ ++S+ W+GE+ AY G L+++ F
Sbjct: 305 ATK---EDFLNPDVLDIFISSVQKVFQVVESTRPGKKVWLGETSSAYGGGAPLLSDTFAA 361
Query: 351 SFWYLDQLGMAAAHDTKTYCRQSLIG-GNYGLLNTTTFVPNPDYYSALLWHRLMGRNALS 409
F +LD+LG++A + RQ G GNY L++ F P PDY+ +LL+ +L+G L
Sbjct: 362 GFMWLDKLGLSARMGIEVVMRQVFFGAGNYHLVD-ENFDPLPDYWLSLLFKKLVGTKVLM 420
Query: 410 TSFSGTK--KIRSYAHCAK------QSKGLVLLLINLDNSTTVHASVAFNGTLTSRHKHK 461
S G+K K+R Y HC + L L INL N T
Sbjct: 421 ASVQGSKRRKLRVYLHCTNTDNPRYKEGDLTLYAINLHNVT------------------- 461
Query: 462 SLKMKIIKLPQASVGGNEREEYHLTAKDGD--LHSQTMLLNGNILSVNSIGDIPTLEPLR 519
K ++LP N++ + +L G L S+++ LNG L++ + D TL PL
Sbjct: 462 ----KYLRLPYPF--SNKQVDKYLLRPLGPHGLLSKSVQLNG--LTLKMVDD-QTLPPLM 512
Query: 520 VKSTQP---VSVGPFSIVFVHMPHVILPAC 546
K +P + + FS F + + + AC
Sbjct: 513 EKPLRPGSSLGLPAFSYSFFVIRNAKVAAC 542
>sp|Q9MYY0|HPSE_BOVIN Heparanase OS=Bos taurus GN=HPSE PE=2 SV=2
Length = 545
Score = 121 bits (303), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 131/506 (25%), Positives = 197/506 (38%), Gaps = 88/506 (17%)
Query: 91 LNSNILLNAVKAFSPLKIRLGGTLQDKVIYD--------------TEDNRQPCK------ 130
L S+ L + +P +R GG D +I+D ++ N+ CK
Sbjct: 77 LGSSKLRTLARGLAPAYLRFGGNKGDFLIFDPKKEPAFEERSYWLSQSNQDICKSGSIPS 136
Query: 131 --------------QFVKNSSEMFGFTQGCLPMHRWDELNAFFKKSGAKIVFGLNALTGR 176
Q + FT D L F SG ++FG+NAL
Sbjct: 137 DVEEKLRLEWPFQEQVLLREQYQKKFTNSTYSRSSVDMLYTFASCSGLNLIFGVNALLRT 196
Query: 177 SIQNDGSVKGAWDYTNAESFISYTVKKNYSIHGWELGNELCG--NGVGTRVAAAQYATDT 234
+ + WD +NA+ + Y KNY+I WELGNE G + Q D
Sbjct: 197 TDMH-------WDSSNAQLLLDYCSSKNYNI-SWELGNEPNSFQRKAGIFINGRQLGEDF 248
Query: 235 ISLRNVVQK--IYTGVDSKPLIIAPGGFFDAKWFKEFLDKSGQSLDVAT-HHIYNLGPGV 291
I R ++ K P I P K K FL G+ +D T HH Y G
Sbjct: 249 IEFRKLLGKSAFKNAKLYGPDIGQPRRN-TVKMLKSFLKAGGEVIDSVTWHHYYVNGRIA 307
Query: 292 DQHLVEKILDPLYLDREVDTFSQLENTLKSSATSAVAWVGESGGAYNSGHNLVTNAFVFS 351
+ E L+P LD + + + ++ W+GE+ A+ G ++N F
Sbjct: 308 TK---EDFLNPDILDTFISSVQKTLRIVEKIRPLKKVWLGETSSAFGGGAPFLSNTFAAG 364
Query: 352 FWYLDQLGMAAAHDTKTYCRQSLIG-GNYGLLNTTTFVPNPDYYSALLWHRLMGRNALST 410
F +LD+LG++A + RQ L G GNY L++ F P PDY+ +LL+ +L+G L
Sbjct: 365 FMWLDKLGLSARMGIEVVMRQVLFGAGNYHLVD-GNFEPLPDYWLSLLFKKLVGNKVLMA 423
Query: 411 SFSGT--KKIRSYAHCAK------QSKGLVLLLINLDNSTTVHASVAFNGTLTSRHKHKS 462
S G K R Y HC + L L +NL N T KH
Sbjct: 424 SVKGPDRSKFRVYLHCTNTKHPRYKEGDLTLYALNLHNVT----------------KHLE 467
Query: 463 LKMKIIKLPQASVGGNEREEYHLTAKDGD--LHSQTMLLNGNILSVNSIGDIPTLEPLRV 520
L + N++ + +L G L S+++ LNG IL + +P L +
Sbjct: 468 LPHHLF---------NKQVDKYLIKPSGTDGLLSKSVQLNGQILKMVDEQTLPALTEKPL 518
Query: 521 KSTQPVSVGPFSIVFVHMPHVILPAC 546
+ + PFS F + + + AC
Sbjct: 519 HPGSSLGMPPFSYGFFVIRNAKVAAC 544
>sp|Q90YK5|HPSE_CHICK Heparanase OS=Gallus gallus GN=HPSE PE=1 SV=1
Length = 523
Score = 118 bits (296), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 109/410 (26%), Positives = 180/410 (43%), Gaps = 53/410 (12%)
Query: 153 DELNAFFKKSGAKIVFGLNALTGRS-IQNDGSVKGAWDYTNAESFISYTVKKNYSIHGWE 211
D L+ F SG ++VFGLNAL R+ +Q WD +NA+ + Y +++Y+I WE
Sbjct: 150 DILHTFASSSGFRLVFGLNALLRRAGLQ--------WDSSNAKQLLGYCAQRSYNI-SWE 200
Query: 212 LGNELCG--NGVGTRVAAAQYATDTISLRNVVQK--IYTGVDSKPLIIAPGGFFDAKWFK 267
LGNE G + Q D + LR ++ + +Y + L + +
Sbjct: 201 LGNEPNSFRKKSGICIDGFQLGRDFVHLRQLLSQHPLYRHAELYGLDVGQPRKHTQHLLR 260
Query: 268 EFLDKSGQSLDVAT-HHIYNLGPGVDQHLVEKILDPLYLDREVDTFSQLENTLKSSATSA 326
F+ G+++D T HH Y G + E L P LD + ++++
Sbjct: 261 SFMKSGGKAIDSVTWHHYYVNGRSATR---EDFLSPEVLDSFATAIHDVLGIVEATVPGK 317
Query: 327 VAWVGESGGAYNSGHNLVTNAFVFSFWYLDQLGMAAAHDTKTYCRQSLIG-GNYGLLNTT 385
W+GE+G AY G ++N +V F +LD+LG+AA RQ G G+Y L++
Sbjct: 318 KVWLGETGSAYGGGAPQLSNTYVAGFMWLDKLGLAARRGIDVVMRQVSFGAGSYHLVD-A 376
Query: 386 TFVPNPDYYSALLWHRLMGRNALSTSF--SGTKKIRSYAHCA-----KQSKG-LVLLLIN 437
F P PDY+ +LL+ RL+G L S + ++ R Y HC K +G + L +N
Sbjct: 377 GFKPLPDYWLSLLYKRLVGTRVLQASVEQADARRPRVYLHCTNPRHPKYREGDVTLFALN 436
Query: 438 LDNSTTVHASVAFNGTLTSRHKHKSLKMKIIKLPQASVGGNEREEYHLTAKDGD-LHSQT 496
L N T + ++LP+ + ++Y L D + S+
Sbjct: 437 LSNVT-----------------------QSLQLPK-QLWSKSVDQYLLLPHGKDSILSRE 472
Query: 497 MLLNGNILSVNSIGDIPTLEPLRVKSTQPVSVGPFSIVFVHMPHVILPAC 546
+ LNG +L + +P L + + + + FS F + + AC
Sbjct: 473 VQLNGRLLQMVDDETLPALHEMALAPGSTLGLPAFSYGFYVIRNAKAIAC 522
>sp|Q6YGZ1|HPSE_MOUSE Heparanase OS=Mus musculus GN=Hpse PE=1 SV=3
Length = 535
Score = 117 bits (294), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 129/504 (25%), Positives = 199/504 (39%), Gaps = 84/504 (16%)
Query: 91 LNSNILLNAVKAFSPLKIRLGGTLQDKVIYDTED--------------NRQPCKQ----- 131
L S L + SP +R GGT D +I+D + N C+
Sbjct: 67 LGSPRLRALARGLSPAYLRFGGTKTDFLIFDPDKEPTSEERSYWKSQVNHDICRSEPVSA 126
Query: 132 -FVKNSSEMFGFTQGCLPMHRW--------------DELNAFFKKSGAKIVFGLNALTGR 176
++ + F + L ++ D L +F K SG ++FGLNAL
Sbjct: 127 AVLRKLQVEWPFQELLLLREQYQKEFKNSTYSRSSVDMLYSFAKCSGLDLIFGLNALLR- 185
Query: 177 SIQNDGSVKGAWDYTNAESFISYTVKKNYSIHGWELGNELCG--NGVGTRVAAAQYATDT 234
+ W+ +NA+ + Y K Y+I WELGNE + Q D
Sbjct: 186 ------TPDLRWNSSNAQLLLDYCSSKGYNI-SWELGNEPNSFWKKAHILIDGLQLGEDF 238
Query: 235 ISLRNVVQK--IYTGVDSKPLIIAPGGFFDAKWFKEFLDKSGQSLDVATHHIYNLGPGVD 292
+ L ++Q+ P I P G K + FL G+ +D T H Y L +
Sbjct: 239 VELHKLLQRSAFQNAKLYGPDIGQPRGK-TVKLLRSFLKAGGEVIDSLTWHHYYLNGRIA 297
Query: 293 QHLVEKILDPLYLDREVDTFSQLENTLKSSATSAVAWVGESGGAYNSGHNLVTNAFVFSF 352
E L LD + + ++ K W+GE+ AY G L++N F F
Sbjct: 298 TK--EDFLSSDVLDTFILSVQKILKVTKEITPGKKVWLGETSSAYGGGAPLLSNTFAAGF 355
Query: 353 WYLDQLGMAAAHDTKTYCRQSLIG-GNYGLLNTTTFVPNPDYYSALLWHRLMGRNALSTS 411
+LD+LG++A + RQ G GNY L++ F P PDY+ +LL+ +L+G L +
Sbjct: 356 MWLDKLGLSAQMGIEVVMRQVFFGAGNYHLVD-ENFEPLPDYWLSLLFKKLVGPRVLLSR 414
Query: 412 FSGT--KKIRSYAHCAK------QSKGLVLLLINLDNSTTVHASVAFNGTLTSRHKHKSL 463
G K+R Y HC Q L L ++NL N T KH +
Sbjct: 415 VKGPDRSKLRVYLHCTNVYHPRYQEGDLTLYVLNLHNVT----------------KHLKV 458
Query: 464 KMKIIKLPQASVGGNEREEYHLTAKDGD-LHSQTMLLNGNILSVNSIGDIPTLEPLRVKS 522
+ + P + Y L D L S+++ LNG IL + +P L + +
Sbjct: 459 PPPLFRKPV--------DTYLLKPSGPDGLLSKSVQLNGQILKMVDEQTLPALTEKPLPA 510
Query: 523 TQPVSVGPFSIVFVHMPHVILPAC 546
+S+ FS F + + + AC
Sbjct: 511 GSALSLPAFSYGFFVIRNAKIAAC 534
>sp|Q71RP1|HPSE_RAT Heparanase OS=Rattus norvegicus GN=Hpse PE=2 SV=1
Length = 536
Score = 115 bits (288), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 130/504 (25%), Positives = 199/504 (39%), Gaps = 84/504 (16%)
Query: 91 LNSNILLNAVKAFSPLKIRLGGTLQDKVIYD--------------TEDNRQPC------K 130
L S L + SP +R GGT D +I+D ++DN C
Sbjct: 68 LGSPRLRALARGLSPAYLRFGGTKTDFLIFDPNKEPTSEERSYWQSQDNNDICGSERVSA 127
Query: 131 QFVKNSSEMFGFTQGCLPMHRW--------------DELNAFFKKSGAKIVFGLNALTGR 176
++ + F + L ++ D L +F K S ++FGLNAL
Sbjct: 128 DVLRKLQMEWPFQELLLLREQYQREFKNSTYSRSSVDMLYSFAKCSRLDLIFGLNALLR- 186
Query: 177 SIQNDGSVKGAWDYTNAESFISYTVKKNYSIHGWELGNELCG--NGVGTRVAAAQYATDT 234
+ W+ +NA+ ++Y K Y+I WELGNE + Q D
Sbjct: 187 ------TPDLRWNSSNAQLLLNYCSSKGYNI-SWELGNEPNSFWKKAHISIDGLQLGEDF 239
Query: 235 ISLRNVVQK--IYTGVDSKPLIIAPGGFFDAKWFKEFLDKSGQSLDVATHHIYNLGPGVD 292
+ L ++QK P I P G K + FL G+ +D T H Y L V
Sbjct: 240 VELHKLLQKSAFQNAKLYGPDIGQPRGK-TVKLLRSFLKAGGEVIDSLTWHHYYLNGRVA 298
Query: 293 QHLVEKILDPLYLDREVDTFSQLENTLKSSATSAVAWVGESGGAYNSGHNLVTNAFVFSF 352
E L LD + + ++ K W+GE+ AY G L+++ F F
Sbjct: 299 TK--EDFLSSDVLDTFILSVQKILKVTKEMTPGKKVWLGETSSAYGGGAPLLSDTFAAGF 356
Query: 353 WYLDQLGMAAAHDTKTYCRQSLIG-GNYGLLNTTTFVPNPDYYSALLWHRLMGRNALSTS 411
+LD+LG++A + RQ G GNY L++ F P PDY+ +LL+ +L+G L +
Sbjct: 357 MWLDKLGLSAQLGIEVVMRQVFFGAGNYHLVD-ENFEPLPDYWLSLLFKKLVGPKVLMSR 415
Query: 412 FSGT--KKIRSYAHCAK------QSKGLVLLLINLDNSTTVHASVAFNGTLTSRHKHKSL 463
G K+R Y HC + L L ++NL N T KH L
Sbjct: 416 VKGPDRSKLRVYLHCTNVYHPRYREGDLTLYVLNLHNVT----------------KHLKL 459
Query: 464 KMKIIKLPQASVGGNEREEYHLTAKDGD-LHSQTMLLNGNILSVNSIGDIPTLEPLRVKS 522
+ P ++Y L D L S+++ LNG L + +P L + +
Sbjct: 460 PPPMFSRPV--------DKYLLKPFGSDGLLSKSVQLNGQTLKMVDEQTLPALTEKPLPA 511
Query: 523 TQPVSVGPFSIVFVHMPHVILPAC 546
+SV FS F + + + AC
Sbjct: 512 GSSLSVPAFSYGFFVIRNAKIAAC 535
>sp|Q8WWQ2|HPSE2_HUMAN Inactive heparanase-2 OS=Homo sapiens GN=HPSE2 PE=1 SV=3
Length = 592
Score = 90.9 bits (224), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 108/420 (25%), Positives = 174/420 (41%), Gaps = 64/420 (15%)
Query: 153 DELNAFFKKSGAKIVFGLNALTGRSIQNDGSVKGAWDYTNAESFISYTVKKNYSIHGWEL 212
D+L F SG ++F LNAL + S ++A S + Y+ K Y+I WEL
Sbjct: 208 DKLYNFADCSGLHLIFALNALRRNPNNSWNS-------SSALSLLKYSASKKYNI-SWEL 259
Query: 213 GNELCGNGV--GTRVAAAQYATDTISLRNVVQ--KIYTGVD-SKPLIIAPGGFFDAKWFK 267
GNE G V +Q D I L++++Q +IY+ P I P A
Sbjct: 260 GNEPNNYRTMHGRAVNGSQLGKDYIQLKSLLQPIRIYSRASLYGPNIGRPRKNVIA-LLD 318
Query: 268 EFLDKSGQSLDVAT-HHIYNLGPGVDQHLVEKILDPL---YLDREVDTFSQLENTLKSSA 323
F+ +G ++D T H Y +D +V K++D L LD D +++ + +
Sbjct: 319 GFMKVAGSTVDAVTWQHCY-----IDGRVV-KVMDFLKTRLLDTLSDQIRKIQKVVNTYT 372
Query: 324 TSAVAWVGESGGAYNSGHNLVTNAFVFSFWYLDQLGMAAAHDTKTYCRQSLIGGNYGLLN 383
W+ G N +++++ F +L+ LGM A R S Y L
Sbjct: 373 PGKKIWLEGVVTTSAGGTNNLSDSYAAGFLWLNTLGMLANQGIDVVIRHSFFDHGYNHLV 432
Query: 384 TTTFVPNPDYYSALLWHRLMGRNALSTSFSGTK-----------KIRSYAHCAKQSK--- 429
F P PDY+ +LL+ RL+G L+ +G + K+R YAHC
Sbjct: 433 DQNFNPLPDYWLSLLYKRLIGPKVLAVHVAGLQRKPRPGRVIRDKLRIYAHCTNHHNHNY 492
Query: 430 ---GLVLLLINLDNSTTVHASVAFNGTLTSRHKHKSLKMKIIKLPQASVGGNEREEYHLT 486
+ L +INL S + GTL + H+ L P G
Sbjct: 493 VRGSITLFIINLHRS---RKKIKLAGTLRDKLVHQYLLQ-----PYGQEG---------- 534
Query: 487 AKDGDLHSQTMLLNGNILSVNSIGDIPTLEPLRVKSTQPVSVGPFSIVFVHMPHVILPAC 546
L S+++ LNG L + G +P L+P +++ + + + P ++ F + +V AC
Sbjct: 535 -----LKSKSVQLNGQPLVMVDDGTLPELKPRPLRAGRTLVIPPVTMGFYVVKNVNALAC 589
>sp|B2RY83|HPSE2_MOUSE Inactive heparanase-2 OS=Mus musculus GN=Hpse2 PE=2 SV=1
Length = 592
Score = 90.9 bits (224), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 107/421 (25%), Positives = 173/421 (41%), Gaps = 66/421 (15%)
Query: 153 DELNAFFKKSGAKIVFGLNALTGRSIQNDGSVKGAWDYTNAESFISYTVKKNYSIHGWEL 212
D+L F SG ++F LNAL + S ++A S + Y+ K Y+I WEL
Sbjct: 208 DKLYNFADCSGLHLIFALNALRRNPNNSWNS-------SSALSLLKYSASKKYNI-SWEL 259
Query: 213 GNELCGNGV--GTRVAAAQYATDTISLRNVVQKIYTGVDSKPLIIAPGGFFDAK----WF 266
GNE G V +Q D I L++++Q I V S+ + P K
Sbjct: 260 GNEPNNYRSIHGRAVNGSQLGKDYIQLKSLLQPIR--VYSRASLYGPNIGRPRKNVIALL 317
Query: 267 KEFLDKSGQSLDVAT-HHIYNLGPGVDQHLVEKILDPL---YLDREVDTFSQLENTLKSS 322
F+ +G ++D T H Y +D +V K++D L LD D +++ + +
Sbjct: 318 DGFMKVAGSTVDAVTWQHCY-----IDGRVV-KVMDFLKTRLLDTLSDQIRKIQKVVNTY 371
Query: 323 ATSAVAWVGESGGAYNSGHNLVTNAFVFSFWYLDQLGMAAAHDTKTYCRQSLIGGNYGLL 382
W+ G N +++++ F +L+ LGM A R S Y L
Sbjct: 372 TPGKKIWLEGVVTTSAGGTNNLSDSYAAGFLWLNTLGMLANQGIDVVIRHSFFDHGYNHL 431
Query: 383 NTTTFVPNPDYYSALLWHRLMGRNALSTSFSGTK-----------KIRSYAHCAKQSK-- 429
F P PDY+ +LL+ RL+G L+ +G + K+R YAHC
Sbjct: 432 VDQNFNPLPDYWLSLLYKRLIGPKVLAVHVAGLQRKPRPGRVIRDKLRIYAHCTNHHNHN 491
Query: 430 ----GLVLLLINLDNSTTVHASVAFNGTLTSRHKHKSLKMKIIKLPQASVGGNEREEYHL 485
+ L +INL S + GTL + H+ L P G
Sbjct: 492 YVRGSITLFIINLHRS---RKKIKLAGTLRDKLVHQYLLQ-----PYGQEG--------- 534
Query: 486 TAKDGDLHSQTMLLNGNILSVNSIGDIPTLEPLRVKSTQPVSVGPFSIVFVHMPHVILPA 545
L S+++ LNG L + G +P L+P +++ + + + P ++ F + +V A
Sbjct: 535 ------LKSKSVQLNGQPLVMVDDGTLPELKPRPLRAGRTLVIPPVTMGFYVVKNVNALA 588
Query: 546 C 546
C
Sbjct: 589 C 589
>sp|Q85056|CAPSD_AHV2H Capsid protein OS=Atkinsonella hypoxylon virus (isolate 2H) PE=3
SV=1
Length = 652
Score = 35.0 bits (79), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 40/159 (25%), Positives = 65/159 (40%), Gaps = 26/159 (16%)
Query: 307 REVDTFSQLENTLKSSATSAVAWVGE-SGGAYNSGHNLVTNAFVFSFWY---LDQLGMAA 362
RE + + +++ + AV VG GG Y S H T F + W+ L +L +A
Sbjct: 235 RESTAYEKWLDSIVIHYSRAVIRVGNLVGGLYQSSHGSTTTHFTYRNWFARSLSRLADSA 294
Query: 363 AHDTKTYCRQSLIGG---NYGLLNTTTFVPNPDYYSALLWHRLMGRNAL----------- 408
H +T+ R+ +I N +N T+ P Y LL RN
Sbjct: 295 TH--RTHLRRPMISEFDYNIPSVNNNTYNP----YVHLLMLEPNNRNITLDFIRSLSSFC 348
Query: 409 STSFSGTKKIRSYAHCAKQSKGLVLLLINLDNSTTVHAS 447
ST T+ +R H +++S + +I + T H+S
Sbjct: 349 STELKATRTLRD--HISRRSAAISRCVIKGPEAPTWHSS 385
>sp|Q8TCU4|ALMS1_HUMAN Alstrom syndrome protein 1 OS=Homo sapiens GN=ALMS1 PE=1 SV=3
Length = 4167
Score = 33.9 bits (76), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 23/96 (23%), Positives = 44/96 (45%), Gaps = 11/96 (11%)
Query: 29 SILQAEAAGGAGFVGGNVFIDRRSVIGRTDDDFVCATLDWWPPEKCDYGTCSWDRASLLN 88
S++++E G +G +G + I +VI G+CSWD +
Sbjct: 2385 SVMRSEPEGCSGTIGNKIIIPMMTVIKSDSSSDASD----------GNGSCSWDSNLPES 2434
Query: 89 LDLNSNILLNAVKAFSPLKIRLGGTLQDKVIYDTED 124
L+ S++LLN SP K + + +++ + ++ED
Sbjct: 2435 LESVSDVLLNFFPYVSP-KTSITDSREEEGVSESED 2469
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.319 0.135 0.413
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 204,778,656
Number of Sequences: 539616
Number of extensions: 8647077
Number of successful extensions: 19875
Number of sequences better than 100.0: 18
Number of HSP's better than 100.0 without gapping: 13
Number of HSP's successfully gapped in prelim test: 5
Number of HSP's that attempted gapping in prelim test: 19822
Number of HSP's gapped (non-prelim): 24
length of query: 547
length of database: 191,569,459
effective HSP length: 123
effective length of query: 424
effective length of database: 125,196,691
effective search space: 53083396984
effective search space used: 53083396984
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 64 (29.3 bits)