BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= psy5286
(311 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|91082275|ref|XP_973628.1| PREDICTED: similar to heparin sulfate O-sulfotransferase [Tribolium
castaneum]
gi|270008176|gb|EFA04624.1| heparan-sulfate-2-sulfotransferase [Tribolium castaneum]
Length = 318
Score = 389 bits (998), Expect = e-105, Method: Compositional matrix adjust.
Identities = 175/284 (61%), Positives = 225/284 (79%), Gaps = 10/284 (3%)
Query: 27 DTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWR 86
D V++YNRVPKTGSTSF+ +AYD+C+K +F+VLHVN+T NNHVLSL +Q+ FV+NVT W+
Sbjct: 43 DLVVVYNRVPKTGSTSFIGVAYDLCKKNKFHVLHVNITANNHVLSLNNQHEFVHNVTTWK 102
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKH 146
+P +YHGHF FIDF +FG +PLFINILRKPL+R +SYYYF+RYGDNYRP+LVR+KH
Sbjct: 103 AMKPGIYHGHFAFIDFTKFGG-PKPLFINILRKPLERFISYYYFVRYGDNYRPYLVRRKH 161
Query: 147 GDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVG 206
G+ +FDEC+ N +C +MWLQ+PFLCGHAA CW PGN WAL +AK+NLV YLLVG
Sbjct: 162 GNTMSFDECVEKNLPDCDPNHMWLQIPFLCGHAANCWKPGNKWALTEAKKNLVNNYLLVG 221
Query: 207 VTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIWEL 266
VT+E+ DFV++LE LP F+G +H+LTSNKSHLR+T +K PS TV++I++S +W++
Sbjct: 222 VTDEINDFVAVLEQTLPRIFKGAFNHYLTSNKSHLRQTVQKDAPSPTTVKKIQESTVWQM 281
Query: 267 ENELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYEKIYPK 310
ENELYE+AL+QFHF+KKH L DK + MYEKI PK
Sbjct: 282 ENELYEFALDQFHFIKKHTL---------KDKLQNVMYEKIRPK 316
>gi|357624176|gb|EHJ75052.1| hypothetical protein KGM_19142 [Danaus plexippus]
Length = 327
Score = 387 bits (995), Expect = e-105, Method: Compositional matrix adjust.
Identities = 178/299 (59%), Positives = 229/299 (76%), Gaps = 12/299 (4%)
Query: 13 SAKSPSPETDSLSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSL 72
+ + P + D+L V+IYNRVPKTGSTSFV +AYD+C+K F VLH+N+T N HV+SL
Sbjct: 40 TTRDPPRDDDNL----VVIYNRVPKTGSTSFVGVAYDLCKKNHFKVLHINITANMHVMSL 95
Query: 73 ADQYRFVNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLR 132
++QYRF NVTKW++ +PALYHGH F++F++ G+ +PLFIN++RKPLDRLVSYYYFLR
Sbjct: 96 SNQYRFAQNVTKWQEVKPALYHGHMAFLNFERLGTNARPLFINLIRKPLDRLVSYYYFLR 155
Query: 133 YGDNYRPHLVRKKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALE 192
+GDN+RPHLVRKKHGDK TFDEC+ + +C NMWLQVPF CGHAA CW PG+PWAL+
Sbjct: 156 HGDNFRPHLVRKKHGDKMTFDECVEKGQADCDPSNMWLQVPFFCGHAAECWRPGSPWALQ 215
Query: 193 KAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSE 252
+AK NLV YL+VGVTEE+ F+S+LEA LP FRG TDH+ +SN+SHLR+T+ KI+PS+
Sbjct: 216 QAKHNLVHHYLVVGVTEEMLAFISVLEATLPRLFRGATDHYRSSNRSHLRQTSAKIEPSQ 275
Query: 253 ETVQQIKKSKIWELENELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQ-FMYEKIYPK 310
TV +I++S IW++ENELYE+A E H K + EA+ Q F YEKI PK
Sbjct: 276 RTVDRIQQSVIWKMENELYEFASE-------HFKFVKKKVLKEANSAPQVFFYEKIRPK 327
>gi|157135820|ref|XP_001656685.1| heparan sulfate 2-o-sulfotransferase [Aedes aegypti]
gi|108881142|gb|EAT45367.1| AAEL003346-PA [Aedes aegypti]
Length = 362
Score = 382 bits (982), Expect = e-104, Method: Compositional matrix adjust.
Identities = 177/310 (57%), Positives = 225/310 (72%), Gaps = 10/310 (3%)
Query: 3 TQKSHQIHISSAKSPSPETDSLSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVN 62
T H + + + P+ +T + V+IYNRVPKTGSTSFVN+ YD+CRK F+VLH+N
Sbjct: 51 TGARHWQTMINQEDPALKTINFDEQLVVIYNRVPKTGSTSFVNLTYDLCRKNAFHVLHIN 110
Query: 63 VTGNNHVLSLADQYRFVNNVTKWRDRRPALYHGHFGFIDFQQFG-SKEQPLFINILRKPL 121
+T N HV+SL +Q RFV NVT W +PA YHGH +++F + G +PL+IN++RKPL
Sbjct: 111 ITANMHVMSLPNQIRFVRNVTAWDAMKPAFYHGHLAYLNFAKLGVPAARPLYINLIRKPL 170
Query: 122 DRLVSYYYFLRYGDNYRPHLVRKKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAA 181
DRLVSYYYFLRYGD+YRPHLVR + GD TFDEC+ + +C NMWLQ+PF CGHAA
Sbjct: 171 DRLVSYYYFLRYGDDYRPHLVRHRAGDTMTFDECVSRQKPDCDPTNMWLQIPFFCGHAAE 230
Query: 182 CWVPGNPWALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHL 241
CW PG+ WALE+AK NLV +Y LVGVTEE+ +F+ LLE ALP FF+G TDHF S+KSHL
Sbjct: 231 CWKPGSTWALEQAKRNLVNEYFLVGVTEEMDEFIELLEVALPRFFKGATDHFRKSSKSHL 290
Query: 242 RRTNRKIDPSEETVQQIKKSKIWELENELYEYALEQFHFVKKHNLVYNKVLGYEADKG-- 299
RRT K++P ET+ QI+KS IW++ENELYE+ALEQFHF++K L + KG
Sbjct: 291 RRTKSKVEPQSETISQIQKSSIWQMENELYEFALEQFHFMQKK-------LRTPSGKGSM 343
Query: 300 KQFMYEKIYP 309
+ F YEKI P
Sbjct: 344 QDFFYEKIKP 353
>gi|195049910|ref|XP_001992787.1| GH13467 [Drosophila grimshawi]
gi|193899846|gb|EDV98712.1| GH13467 [Drosophila grimshawi]
Length = 351
Score = 378 bits (970), Expect = e-102, Method: Compositional matrix adjust.
Identities = 173/301 (57%), Positives = 229/301 (76%), Gaps = 8/301 (2%)
Query: 12 SSAKSPSPETDSLSWD--TVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHV 69
SS +P D+ +++ V++YNRVPKTGSTSFVN+AYD+C++ R++VLH+NVT N HV
Sbjct: 57 SSLAQIAPTLDNFNYEEQLVVLYNRVPKTGSTSFVNIAYDLCKQNRYHVLHINVTANMHV 116
Query: 70 LSLADQYRFVNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYY 129
LSL +Q FV NVT+W + +PALYHGH F+DF +F +P++IN++RKPLDRLVSYYY
Sbjct: 117 LSLPNQISFVRNVTRWHEMKPALYHGHMAFLDFSKFQIAHKPIYINVVRKPLDRLVSYYY 176
Query: 130 FLRYGDNYRPHLVRKKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPW 189
FLRYGDNYRP+LVRKK G+K TFDEC+ + +C +NMWLQ+PF CGHAA CW PG+ W
Sbjct: 177 FLRYGDNYRPNLVRKKAGNKITFDECVIQKQADCDPKNMWLQIPFFCGHAAECWEPGSDW 236
Query: 190 ALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKID 249
AL++AK NLV +Y LVGVTE++ +FV LLE +LP F+G +H+ SNKSHLR T+ K+
Sbjct: 237 ALKQAKRNLVNEYFLVGVTEQMYEFVDLLERSLPRIFQGFREHYQNSNKSHLRVTSSKLP 296
Query: 250 PSEETVQQIKKSKIWELENELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYEKIYP 309
P+E T Q I++SKIW++ENELYE+AL QF F KK K++ + +QFMYEK+ P
Sbjct: 297 PTESTKQTIQRSKIWQMENELYEFALAQFEFTKK------KLMQPDNKHLQQFMYEKVRP 350
Query: 310 K 310
K
Sbjct: 351 K 351
>gi|442754093|gb|JAA69206.1| Putative sulfotransferase [Ixodes ricinus]
Length = 366
Score = 377 bits (969), Expect = e-102, Method: Compositional matrix adjust.
Identities = 169/284 (59%), Positives = 217/284 (76%), Gaps = 1/284 (0%)
Query: 27 DTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWR 86
D VIIYNRVPKTGSTSF+ +AYD+C +F+VLH+N + N HV+SL DQ RFV NV+ W+
Sbjct: 84 DLVIIYNRVPKTGSTSFMGVAYDLCGVNKFHVLHLNTSKNMHVMSLPDQIRFVYNVSLWK 143
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKH 146
+PA+YHGH F+DF ++G K +P++INI+RKPLDRLVSY+YFLR+GD++RP+LVR++
Sbjct: 144 AMKPAIYHGHVAFLDFAKYGIKSKPIYINIIRKPLDRLVSYFYFLRHGDDFRPYLVRRRQ 203
Query: 147 GDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVG 206
G+K TFDEC+ +C+ E +WLQVPF CGHA+ CW+PGNPWALE+AK NL+ Y VG
Sbjct: 204 GNKMTFDECVLKKGVDCAEERLWLQVPFFCGHASECWIPGNPWALEQAKHNLINSYFAVG 263
Query: 207 VTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIWEL 266
+TEEL DFVSLLE + P F+G T FLT K+HLR+T K DPS+ETV+ KKS+IW++
Sbjct: 264 LTEELQDFVSLLEVSFPRIFKGATAKFLTGKKAHLRKTFNKQDPSDETVESFKKSRIWQM 323
Query: 267 ENELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYEKIYPK 310
ENE YE+A EQF FVKK L G + G+QF YEKI PK
Sbjct: 324 ENEFYEFAAEQFDFVKKRTLATQNG-GEATELGQQFFYEKIRPK 366
>gi|383858144|ref|XP_003704562.1| PREDICTED: heparin sulfate O-sulfotransferase-like [Megachile
rotundata]
Length = 343
Score = 377 bits (968), Expect = e-102, Method: Compositional matrix adjust.
Identities = 170/284 (59%), Positives = 217/284 (76%), Gaps = 5/284 (1%)
Query: 27 DTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWR 86
+ +IIYNRVPKT STSFV + YD+C++ +++VLH+NVT N H L+LA+Q +FVNNVT W
Sbjct: 65 EIIIIYNRVPKTASTSFVGLVYDLCKQNKYHVLHINVTNNMHTLTLANQIQFVNNVTGWN 124
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKH 146
++PA YHGH F+DF++FG ++ PL+IN+LRKPLDR VSYYYFLRYGDN+RPHL+RKKH
Sbjct: 125 AKKPAFYHGHIAFLDFEKFGVQQTPLYINLLRKPLDRFVSYYYFLRYGDNFRPHLIRKKH 184
Query: 147 GDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVG 206
GD TFDECI + +C +NMWLQ+PFLCGH ACW GN WALE+AK NL YLLVG
Sbjct: 185 GDTKTFDECIDAGQPDCDPDNMWLQIPFLCGHDPACWEIGNSWALEEAKRNLQKHYLLVG 244
Query: 207 VTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIWEL 266
VTEELT+FV L+ LP FF+G + FL SNKSHLR+T +KI+P ET+++I+KS +W++
Sbjct: 245 VTEELTEFVETLQIVLPRFFKGAYNSFLHSNKSHLRQTTQKINPRPETIEKIQKSVVWKM 304
Query: 267 ENELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYEKIYPK 310
ENELY +AL FH VKK + D ++FMYEKI PK
Sbjct: 305 ENELYNFALMHFHAVKK-----RLINASPQDINQRFMYEKIRPK 343
>gi|195398113|ref|XP_002057669.1| GJ17976 [Drosophila virilis]
gi|194141323|gb|EDW57742.1| GJ17976 [Drosophila virilis]
Length = 344
Score = 376 bits (966), Expect = e-102, Method: Compositional matrix adjust.
Identities = 174/301 (57%), Positives = 227/301 (75%), Gaps = 8/301 (2%)
Query: 12 SSAKSPSPETDSLSWD--TVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHV 69
SS P D+ ++ V++YNRVPKTGSTSFVN+AYD+C++ R++VLH+NVT N HV
Sbjct: 50 SSLARNIPTLDNFDYEDQLVVVYNRVPKTGSTSFVNIAYDLCKQNRYHVLHINVTANMHV 109
Query: 70 LSLADQYRFVNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYY 129
LSL +Q FV NVT W + +PALYHGH F+DF +F +P++IN++RKPLDRLVSYYY
Sbjct: 110 LSLPNQISFVRNVTTWHEMKPALYHGHMAFLDFSKFQIAHKPIYINLVRKPLDRLVSYYY 169
Query: 130 FLRYGDNYRPHLVRKKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPW 189
FLRYGDNYRP+LVRKK G+K TFDEC+ + +C NMWLQ+PF CGHAA CW PG+ W
Sbjct: 170 FLRYGDNYRPNLVRKKAGNKITFDECVVQKQPDCDPRNMWLQIPFFCGHAAECWEPGSDW 229
Query: 190 ALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKID 249
AL++AK NLV +Y LVGVTE++ +FV LLE +LP F+G +H+ SNKSHLR T+ K+
Sbjct: 230 ALKQAKHNLVNEYFLVGVTEQMYEFVDLLERSLPRIFQGFREHYQNSNKSHLRVTSSKLP 289
Query: 250 PSEETVQQIKKSKIWELENELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYEKIYP 309
PSE T+Q I++SKIW++EN+LYE+ALEQF F +K K++ + +QFMYEKI P
Sbjct: 290 PSESTIQTIQRSKIWQMENDLYEFALEQFEFTRK------KLMQPDNKHLQQFMYEKIRP 343
Query: 310 K 310
K
Sbjct: 344 K 344
>gi|443728063|gb|ELU14538.1| hypothetical protein CAPTEDRAFT_223387 [Capitella teleta]
Length = 309
Score = 375 bits (962), Expect = e-101, Method: Compositional matrix adjust.
Identities = 167/284 (58%), Positives = 221/284 (77%), Gaps = 2/284 (0%)
Query: 27 DTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWR 86
DTV+IYNRVPKTGSTSF +AYD+C + +FNVLH+NV+ NNHVL L+DQ RFV N+T W
Sbjct: 28 DTVLIYNRVPKTGSTSFAGVAYDLCVQNKFNVLHLNVSKNNHVLGLSDQRRFVLNITHWE 87
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKH 146
++PALYHGH ++ F +FG QPL+INI+R PLDRLVSYYYFLR+GD++RP+L R +
Sbjct: 88 SKKPALYHGHLAYLPFSRFGISRQPLYINIIRDPLDRLVSYYYFLRHGDDFRPYLKRSRS 147
Query: 147 GDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVG 206
G+K TFDEC+ + +C N+W+Q+PF CGHAA CW+PGNPWALE AK NLV YL+VG
Sbjct: 148 GNKETFDECVASDGEDCDPMNLWMQIPFFCGHAAECWIPGNPWALETAKYNLVHNYLVVG 207
Query: 207 VTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIWEL 266
+TEEL DFV++LEA LP FF G T+ F + +KSHLR+T+ K+ PS+ET+ +I+++ W++
Sbjct: 208 LTEELGDFVAILEATLPRFFSGATELFNSGHKSHLRKTSSKVPPSQETLDKIQQTVYWKM 267
Query: 267 ENELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYEKIYPK 310
E+E YE+A EQFHF+KK L V G +++ QFMYEKI P+
Sbjct: 268 EHEFYEFAKEQFHFIKKRTL--QSVDGVLSERKPQFMYEKIRPR 309
>gi|241087441|ref|XP_002409196.1| heparan sulfate 2-O-sulfotransferase, putative [Ixodes scapularis]
gi|215492665|gb|EEC02306.1| heparan sulfate 2-O-sulfotransferase, putative [Ixodes scapularis]
Length = 366
Score = 374 bits (961), Expect = e-101, Method: Compositional matrix adjust.
Identities = 167/284 (58%), Positives = 216/284 (76%), Gaps = 1/284 (0%)
Query: 27 DTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWR 86
D VIIYNRVPKTGSTSF+ +AYD+C +F+VLH+N + N HV+SL DQ RFV NV+ W+
Sbjct: 84 DLVIIYNRVPKTGSTSFMGVAYDLCGVNKFHVLHLNTSKNMHVMSLPDQIRFVYNVSLWK 143
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKH 146
+PA+YHGH F+DF ++G K +P++INI+RKPLDRLVSY+YFLR+GD++RP+LVR++
Sbjct: 144 AMKPAIYHGHVAFLDFAKYGVKSKPIYINIIRKPLDRLVSYFYFLRHGDDFRPYLVRRRQ 203
Query: 147 GDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVG 206
G+K TFDEC+ +C+ E +WLQVPF CGHA+ CW+PGN WALE+AK NL+ Y VG
Sbjct: 204 GNKMTFDECVLKKGVDCAEERLWLQVPFFCGHASECWIPGNLWALEQAKHNLINSYFAVG 263
Query: 207 VTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIWEL 266
+TEEL DFVSLLE + P F+G T FLT K+HLR+T K DPS+ETV+ KKS+IW++
Sbjct: 264 LTEELQDFVSLLEVSFPRIFKGATAKFLTGKKAHLRKTFNKQDPSDETVESFKKSRIWQM 323
Query: 267 ENELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYEKIYPK 310
ENE YE+A EQF FV+K L G + G+QF YEKI PK
Sbjct: 324 ENEFYEFAAEQFDFVRKRTLATQNA-GEATELGQQFFYEKIRPK 366
>gi|242014869|ref|XP_002428105.1| Heparin sulfate O-sulfotransferase, putative [Pediculus humanus
corporis]
gi|212512636|gb|EEB15367.1| Heparin sulfate O-sulfotransferase, putative [Pediculus humanus
corporis]
Length = 349
Score = 373 bits (957), Expect = e-101, Method: Compositional matrix adjust.
Identities = 171/287 (59%), Positives = 221/287 (77%), Gaps = 3/287 (1%)
Query: 25 SWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTK 84
S D V+IYNRVPKTGSTSFVN+AY++C+K F +LH+NVTGN H+LS+ +Q +FV+NVT+
Sbjct: 65 SEDIVVIYNRVPKTGSTSFVNVAYELCKKNHFKILHINVTGNLHLLSMKNQIKFVHNVTE 124
Query: 85 WRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRK 144
W +PALYHGH FIDF++F + PL+INI+RKPLDRLVSYYYF+RYGDNYRP+LVRK
Sbjct: 125 WDAMKPALYHGHMAFIDFKKFRIDKTPLYINIIRKPLDRLVSYYYFVRYGDNYRPNLVRK 184
Query: 145 KHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLL 204
KHGDK +FDEC+ N +C+ +NMWLQ+PF CGHAA CW G+ WALE+AK NLV Y
Sbjct: 185 KHGDKLSFDECVAKNMPDCNYDNMWLQIPFFCGHAAECWEVGSNWALEEAKRNLVRNYFA 244
Query: 205 VGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIW 264
VGVTEE+ +F+ L+E LP F+G T H+L SN+SHLR T +K+ PSE TV +I+KSK+W
Sbjct: 245 VGVTEEMEEFIRLMEMILPRMFKGATQHYLNSNRSHLRETTQKVMPSETTVAKIQKSKVW 304
Query: 265 ELENELYEYALEQFHF-VKKHNLVYNKVLGYEADKGKQFMYEKIYPK 310
E+E E Y +AL+QF F ++K N+ + +K + F YEKI PK
Sbjct: 305 EMEQEFYNFALQQFKFSLRKLNI--KRESNNPNEKEQIFFYEKIRPK 349
>gi|332376206|gb|AEE63243.1| unknown [Dendroctonus ponderosae]
Length = 335
Score = 372 bits (956), Expect = e-101, Method: Compositional matrix adjust.
Identities = 168/293 (57%), Positives = 221/293 (75%), Gaps = 10/293 (3%)
Query: 18 SPETDSLSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYR 77
SP +LS + V++YNRVPKTGSTSFV +AYD+C+K +++VLHVNVTGN+HVLSL +Q +
Sbjct: 51 SPAKPNLSDNLVVLYNRVPKTGSTSFVGIAYDLCKKGQYHVLHVNVTGNSHVLSLTNQIK 110
Query: 78 FVNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNY 137
FV NVT+W +PALYHGHF +IDF +FG +PL+IN++RKPLDR +SYYYFLRYGDN+
Sbjct: 111 FVTNVTEWGSMKPALYHGHFAYIDFSKFGV-FKPLYINVIRKPLDRFISYYYFLRYGDNF 169
Query: 138 RPHLVRKKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKEN 197
RP+LVR+K G+ TFDEC++ EC MWLQVPF CGHA+ CW PGN WAL +AK+N
Sbjct: 170 RPYLVRRKAGNTMTFDECVQQKLPECDPNAMWLQVPFFCGHASNCWKPGNKWALTEAKKN 229
Query: 198 LVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQ 257
LV Y LVGVTEEL DF+++LE +LP F+G +H+ SNKSHLR+T +K PS ETV +
Sbjct: 230 LVNNYFLVGVTEELEDFIAVLETSLPRIFKGALEHYTGSNKSHLRQTVQKNSPSVETVNK 289
Query: 258 IKKSKIWELENELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYEKIYPK 310
K + +W++ENE YE+AL+ FHF+KK L ++ + +YEKI P+
Sbjct: 290 FKSNSVWQMENEFYEFALDNFHFIKKQTL---------KNQQQHVIYEKIRPR 333
>gi|350422926|ref|XP_003493331.1| PREDICTED: heparin sulfate O-sulfotransferase-like [Bombus
impatiens]
Length = 343
Score = 372 bits (955), Expect = e-100, Method: Compositional matrix adjust.
Identities = 167/284 (58%), Positives = 214/284 (75%), Gaps = 5/284 (1%)
Query: 27 DTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWR 86
D +IIYNRVPKT STSFV + YD+C+ +++VLH+NV+ N H L+L +Q +F NN+T W
Sbjct: 65 DLIIIYNRVPKTASTSFVGLVYDLCKHNKYHVLHINVSNNMHTLTLPNQIQFANNITVWN 124
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKH 146
++PA YHGH F++F++FG ++ PL+INILRKPLDR VSYYYFLRYGDN+RPHL+RKKH
Sbjct: 125 TKKPAFYHGHVAFLNFEKFGVRQSPLYINILRKPLDRFVSYYYFLRYGDNFRPHLIRKKH 184
Query: 147 GDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVG 206
GD TFDECI + +C +NMWLQ+PFLCGH ACW GN WALE+AK NL YLLVG
Sbjct: 185 GDTKTFDECIEAGQPDCDPDNMWLQIPFLCGHDPACWEIGNSWALEEAKRNLQKHYLLVG 244
Query: 207 VTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIWEL 266
VTEELT+FV +L+ LP FFRG + FL +NKSHLR+T +KI+P ET+ +I+KS +W++
Sbjct: 245 VTEELTEFVEILQIVLPRFFRGAYNSFLHNNKSHLRQTTQKINPRPETIDKIQKSVVWKM 304
Query: 267 ENELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYEKIYPK 310
ENELY +AL FH VKK + D ++FMYEKI PK
Sbjct: 305 ENELYNFALMHFHAVKK-----RLINASPQDVNQRFMYEKIRPK 343
>gi|170040072|ref|XP_001847836.1| heparin sulfate O-sulfotransferase [Culex quinquefasciatus]
gi|167863648|gb|EDS27031.1| heparin sulfate O-sulfotransferase [Culex quinquefasciatus]
Length = 355
Score = 372 bits (955), Expect = e-100, Method: Compositional matrix adjust.
Identities = 171/284 (60%), Positives = 212/284 (74%), Gaps = 10/284 (3%)
Query: 29 VIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWRDR 88
V+IYNRVPKTGSTSFVN+ YD+CRK F+VLH+N+T N HV SL +Q RFV NVT W
Sbjct: 71 VVIYNRVPKTGSTSFVNLTYDLCRKNAFHVLHINITANMHVFSLPNQIRFVRNVTAWEAM 130
Query: 89 RPALYHGHFGFIDFQQFG-SKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKHG 147
+PA YHGH ++DF + G +PL+IN++RKPLDRLVSYYYFLRYGD+YRPHLVR + G
Sbjct: 131 KPAFYHGHLAYLDFSKMGVPAAKPLYINLVRKPLDRLVSYYYFLRYGDDYRPHLVRHRAG 190
Query: 148 DKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVGV 207
D TFDEC+ + +C NMWLQ+PF CGHAA CW PG+ WAL++AK NLV +Y LVGV
Sbjct: 191 DTMTFDECVAKQKQDCDPNNMWLQIPFFCGHAAECWKPGSAWALQEAKRNLVNEYFLVGV 250
Query: 208 TEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIWELE 267
TEE+T+FV LLE ALP FRG +DHF SNKSHLR+T K++P ETV +I++S +W++E
Sbjct: 251 TEEMTEFVELLEMALPRLFRGASDHFAKSNKSHLRKTKSKVEPLPETVAKIQQSLVWQME 310
Query: 268 NELYEYALEQFHF--VKKHNLVYNKVLGYEADKGKQFMYEKIYP 309
NELY+YALEQFHF +K H N + + F YEKI P
Sbjct: 311 NELYQYALEQFHFAQMKLHAPGKNAL-------PQDFFYEKIKP 347
>gi|340727531|ref|XP_003402095.1| PREDICTED: heparin sulfate O-sulfotransferase-like [Bombus
terrestris]
Length = 343
Score = 372 bits (955), Expect = e-100, Method: Compositional matrix adjust.
Identities = 167/284 (58%), Positives = 214/284 (75%), Gaps = 5/284 (1%)
Query: 27 DTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWR 86
D +IIYNRVPKT STSFV + YD+C+ +++VLH+NV+ N H L+L +Q +F NN+T W
Sbjct: 65 DLIIIYNRVPKTASTSFVGLVYDLCKHNKYHVLHINVSNNMHTLTLPNQIQFANNITVWN 124
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKH 146
++PA YHGH F++F++FG ++ PL+INILRKPLDR VSYYYFLRYGDN+RPHL+RKKH
Sbjct: 125 TKKPAFYHGHVAFLNFEKFGVRQSPLYINILRKPLDRFVSYYYFLRYGDNFRPHLIRKKH 184
Query: 147 GDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVG 206
GD TFDECI + +C +NMWLQ+PFLCGH ACW GN WALE+AK NL YLLVG
Sbjct: 185 GDTKTFDECIEAGQPDCDPDNMWLQIPFLCGHDPACWEIGNSWALEEAKRNLQKHYLLVG 244
Query: 207 VTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIWEL 266
VTEELT+FV +L+ LP FFRG + FL +NKSHLR+T +KI+P ET+ +I+KS +W++
Sbjct: 245 VTEELTEFVEILQIVLPRFFRGAYNSFLHNNKSHLRQTTQKINPRPETIDKIQKSVVWKM 304
Query: 267 ENELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYEKIYPK 310
ENELY +AL FH VKK + D ++FMYEKI PK
Sbjct: 305 ENELYNFALMHFHAVKK-----RLINASPQDVNQRFMYEKIRPK 343
>gi|340727533|ref|XP_003402096.1| PREDICTED: heparin sulfate O-sulfotransferase-like [Bombus
terrestris]
Length = 309
Score = 372 bits (954), Expect = e-100, Method: Compositional matrix adjust.
Identities = 167/284 (58%), Positives = 214/284 (75%), Gaps = 5/284 (1%)
Query: 27 DTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWR 86
D +IIYNRVPKT STSFV + YD+C+ +++VLH+NV+ N H L+L +Q +F NN+T W
Sbjct: 31 DLIIIYNRVPKTASTSFVGLVYDLCKHNKYHVLHINVSNNMHTLTLPNQIQFANNITVWN 90
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKH 146
++PA YHGH F++F++FG ++ PL+INILRKPLDR VSYYYFLRYGDN+RPHL+RKKH
Sbjct: 91 TKKPAFYHGHVAFLNFEKFGVRQSPLYINILRKPLDRFVSYYYFLRYGDNFRPHLIRKKH 150
Query: 147 GDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVG 206
GD TFDECI + +C +NMWLQ+PFLCGH ACW GN WALE+AK NL YLLVG
Sbjct: 151 GDTKTFDECIEAGQPDCDPDNMWLQIPFLCGHDPACWEIGNSWALEEAKRNLQKHYLLVG 210
Query: 207 VTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIWEL 266
VTEELT+FV +L+ LP FFRG + FL +NKSHLR+T +KI+P ET+ +I+KS +W++
Sbjct: 211 VTEELTEFVEILQIVLPRFFRGAYNSFLHNNKSHLRQTTQKINPRPETIDKIQKSVVWKM 270
Query: 267 ENELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYEKIYPK 310
ENELY +AL FH VKK + D ++FMYEKI PK
Sbjct: 271 ENELYNFALMHFHAVKKR-----LINASPQDVNQRFMYEKIRPK 309
>gi|66562961|ref|XP_623840.1| PREDICTED: heparin sulfate O-sulfotransferase [Apis mellifera]
Length = 343
Score = 372 bits (954), Expect = e-100, Method: Compositional matrix adjust.
Identities = 168/284 (59%), Positives = 215/284 (75%), Gaps = 5/284 (1%)
Query: 27 DTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWR 86
D +IIYNRVPKT STSFV + YD+C+ +++VLH+NV+ N H L+LA+Q +F NN+T W
Sbjct: 65 DLIIIYNRVPKTASTSFVGLVYDLCKHNKYHVLHINVSNNMHTLTLANQIQFANNITVWN 124
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKH 146
++PA YHGH F++F++FG K+ PL+IN+LRKPLDR VSYYYFLRYGDN+RPHL+RKKH
Sbjct: 125 TKKPAFYHGHVAFLNFEKFGIKQTPLYINLLRKPLDRFVSYYYFLRYGDNFRPHLIRKKH 184
Query: 147 GDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVG 206
GD TFDECI + +C +NMWLQ+PFLCGH ACW GN WALE+AK NL YLLVG
Sbjct: 185 GDTKTFDECIDAGQPDCDPDNMWLQIPFLCGHDPACWEIGNSWALEEAKRNLQKHYLLVG 244
Query: 207 VTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIWEL 266
VTEELT+FV +L+ LP FF+G + FL +NKSHLR+T +KI+P ETV +I+KS +W++
Sbjct: 245 VTEELTEFVEILQIVLPRFFKGAYNSFLHNNKSHLRQTTQKINPRPETVDKIQKSVVWKM 304
Query: 267 ENELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYEKIYPK 310
ENELY +AL FH VKK + D ++FMYEKI PK
Sbjct: 305 ENELYNFALMHFHAVKK-----RFINASPQDINQRFMYEKIRPK 343
>gi|380027890|ref|XP_003697648.1| PREDICTED: heparin sulfate O-sulfotransferase-like [Apis florea]
Length = 355
Score = 372 bits (954), Expect = e-100, Method: Compositional matrix adjust.
Identities = 168/284 (59%), Positives = 215/284 (75%), Gaps = 5/284 (1%)
Query: 27 DTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWR 86
D +IIYNRVPKT STSFV + YD+C+ +++VLH+NV+ N H L+LA+Q +F NN+T W
Sbjct: 77 DLIIIYNRVPKTASTSFVGLVYDLCKHNKYHVLHINVSNNMHTLTLANQIQFANNITVWN 136
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKH 146
++PA YHGH F++F++FG K+ PL+IN+LRKPLDR VSYYYFLRYGDN+RPHL+RKKH
Sbjct: 137 TKKPAFYHGHVAFLNFEKFGIKQTPLYINLLRKPLDRFVSYYYFLRYGDNFRPHLIRKKH 196
Query: 147 GDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVG 206
GD TFDECI + +C +NMWLQ+PFLCGH ACW GN WALE+AK NL YLLVG
Sbjct: 197 GDTKTFDECIDAGQPDCDPDNMWLQIPFLCGHDPACWEIGNSWALEEAKRNLQKHYLLVG 256
Query: 207 VTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIWEL 266
VTEELT+FV +L+ LP FF+G + FL +NKSHLR+T +KI+P ETV +I+KS +W++
Sbjct: 257 VTEELTEFVEILQIVLPRFFKGAYNSFLHNNKSHLRQTTQKINPRPETVDKIQKSVVWKM 316
Query: 267 ENELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYEKIYPK 310
ENELY +AL FH VKK + D ++FMYEKI PK
Sbjct: 317 ENELYNFALMHFHAVKK-----RFINASPQDINQRFMYEKIRPK 355
>gi|195436622|ref|XP_002066256.1| GK18194 [Drosophila willistoni]
gi|194162341|gb|EDW77242.1| GK18194 [Drosophila willistoni]
Length = 348
Score = 371 bits (952), Expect = e-100, Method: Compositional matrix adjust.
Identities = 173/305 (56%), Positives = 230/305 (75%), Gaps = 12/305 (3%)
Query: 8 QIHISSAKSPSPETDSLSWD--TVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTG 65
Q+ +S P+P TD ++ V++YNRVPKTGSTSFVN+AYD+C+ +++VLH+NVT
Sbjct: 54 QVQVSG---PAP-TDVYDYEEQLVVLYNRVPKTGSTSFVNIAYDLCKLNKYHVLHINVTA 109
Query: 66 NNHVLSLADQYRFVNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLV 125
N HVLSL +Q FV NVTKW + +PALYHGH F+DF +F +P++IN++RKPLDRLV
Sbjct: 110 NMHVLSLPNQISFVRNVTKWHEMKPALYHGHMAFLDFSKFQIAHKPIYINLVRKPLDRLV 169
Query: 126 SYYYFLRYGDNYRPHLVRKKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVP 185
SYYYFLR+GDNYRP+LVRKK G+K TFDEC+ + +C +NMWLQ+PF CGHAA CW P
Sbjct: 170 SYYYFLRFGDNYRPNLVRKKAGNKITFDECVVQKQADCDPKNMWLQIPFFCGHAAECWEP 229
Query: 186 GNPWALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTN 245
G+ WAL++AK NLV +Y LVGVTE++ DFV LLE +LP F G +H+ SNKSHLR T+
Sbjct: 230 GSVWALDQAKRNLVNEYFLVGVTEQMYDFVDLLERSLPRIFHGFREHYQNSNKSHLRVTS 289
Query: 246 RKIDPSEETVQQIKKSKIWELENELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYE 305
K+ PSE T++ I+K+KIW++EN+LYE+AL QF F K+ K++ + ++FMYE
Sbjct: 290 SKLPPSEATIKAIQKTKIWQMENDLYEFALAQFEFTKR------KLMQPDNKHLQKFMYE 343
Query: 306 KIYPK 310
KI PK
Sbjct: 344 KIRPK 348
>gi|198473353|ref|XP_001356264.2| GA10178 [Drosophila pseudoobscura pseudoobscura]
gi|198139418|gb|EAL33327.2| GA10178 [Drosophila pseudoobscura pseudoobscura]
Length = 348
Score = 370 bits (951), Expect = e-100, Method: Compositional matrix adjust.
Identities = 171/300 (57%), Positives = 227/300 (75%), Gaps = 8/300 (2%)
Query: 13 SAKSPSPETDSLSWD--TVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVL 70
SA + DS ++ V++YNRVPKTGSTSFVN+AYD+C+ +++VLH+NVT N HVL
Sbjct: 55 SAPVATAAVDSFDYEEQLVVLYNRVPKTGSTSFVNIAYDLCKLNKYHVLHINVTANMHVL 114
Query: 71 SLADQYRFVNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYF 130
SL +Q FV NV+KW + +PALYHGH F+DF +F +P++IN++RKPLDRLVSYYYF
Sbjct: 115 SLPNQIAFVRNVSKWHEMKPALYHGHMAFLDFSKFQIAHKPIYINLVRKPLDRLVSYYYF 174
Query: 131 LRYGDNYRPHLVRKKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWA 190
LR+GDNYRP+LVRKK G+K TFDEC+ + +C +NMWLQ+PF CGHAA CW PG+ WA
Sbjct: 175 LRFGDNYRPNLVRKKAGNKITFDECVVQKQPDCDPKNMWLQIPFFCGHAAECWEPGSDWA 234
Query: 191 LEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDP 250
L++AK NLV +Y LVGVTE++ +FV LLE +LP F G +H+ TSNKSHLR T+ K+ P
Sbjct: 235 LDQAKRNLVNEYFLVGVTEQMYEFVDLLERSLPRIFHGFREHYQTSNKSHLRVTSSKLPP 294
Query: 251 SEETVQQIKKSKIWELENELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYEKIYPK 310
SE T++ I+K+KIW++EN+LYE+AL QF F KK K++ + ++FMYEKI PK
Sbjct: 295 SESTIKSIQKTKIWQMENDLYEFALAQFEFNKK------KLMQPDNKHLQKFMYEKIRPK 348
>gi|410924598|ref|XP_003975768.1| PREDICTED: heparan sulfate 2-O-sulfotransferase 1-like, partial
[Takifugu rubripes]
Length = 318
Score = 370 bits (950), Expect = e-100, Method: Compositional matrix adjust.
Identities = 168/292 (57%), Positives = 220/292 (75%), Gaps = 3/292 (1%)
Query: 19 PETDSLSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRF 78
PE+D D VIIYNRVPKT STSF N+AYD+C K R++VLH+N T NN V+S+ DQ RF
Sbjct: 29 PESDGED-DVVIIYNRVPKTASTSFTNIAYDLCGKNRYHVLHINTTKNNPVMSIQDQVRF 87
Query: 79 VNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYR 138
V NVT+WR+ +PA YHGH F+DF +FG K +P++IN++R P++RLVSYYYFLR+GD+YR
Sbjct: 88 VKNVTQWREMKPAFYHGHVSFLDFTKFGLKRKPIYINVIRDPIERLVSYYYFLRFGDDYR 147
Query: 139 PHLVRKKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENL 198
P L R+K GDK TFDEC+ ++C+ E +WLQ+PF CGH + CW G+ WALE+AK NL
Sbjct: 148 PGLRRRKQGDKKTFDECVSAGGSDCAPEKLWLQIPFFCGHYSECWNAGSQWALEQAKYNL 207
Query: 199 VTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQI 258
V +YLLVGVTEEL DFV +LEAALP FFRG T+ + T KSHLR+T+ K P++E++ ++
Sbjct: 208 VNEYLLVGVTEELEDFVMMLEAALPRFFRGATELYRTGKKSHLRKTSEKKPPTKESIAKL 267
Query: 259 KKSKIWELENELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYEKIYPK 310
++S IW++ENE YE+ALEQF FV+ H + Y + F YEKIYPK
Sbjct: 268 QQSAIWKMENEFYEFALEQFQFVRAHAVREKDGELYLL--AQNFFYEKIYPK 317
>gi|307181339|gb|EFN68973.1| Heparin sulfate O-sulfotransferase [Camponotus floridanus]
Length = 345
Score = 369 bits (947), Expect = e-100, Method: Compositional matrix adjust.
Identities = 165/299 (55%), Positives = 218/299 (72%), Gaps = 5/299 (1%)
Query: 12 SSAKSPSPETDSLSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLS 71
+ A +P S + D ++IYNRVPKT STSFV + YD+C++ +++VLH+NVT N H L+
Sbjct: 52 NDAHVKTPSNGSDTEDIIVIYNRVPKTASTSFVGLVYDLCKQNKYHVLHINVTNNMHTLT 111
Query: 72 LADQYRFVNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFL 131
L++Q +F NN+T W +PA YHGH F++F +FG PL+IN+LRKPLDR VSYYYFL
Sbjct: 112 LSNQVQFANNITNWNSIKPAFYHGHMAFLNFGKFGINHTPLYINLLRKPLDRFVSYYYFL 171
Query: 132 RYGDNYRPHLVRKKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWAL 191
RYGDN+RPHL+RKKHGD TFDECI + + +C NMWLQ+PFLCGH ACW GN WAL
Sbjct: 172 RYGDNFRPHLIRKKHGDTKTFDECINIGQPDCDPNNMWLQIPFLCGHDPACWEVGNSWAL 231
Query: 192 EKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPS 251
++AK NL Y L+GVTEEL DFV +LE LP FF+G + FL +NKSHLR+T +K++P
Sbjct: 232 DEAKRNLQRHYFLIGVTEELNDFVEVLENVLPRFFKGAYNFFLHNNKSHLRQTTQKLNPL 291
Query: 252 EETVQQIKKSKIWELENELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYEKIYPK 310
ETV++I++S +W++ENELY +ALE FH VK+ + D ++FMYEKI PK
Sbjct: 292 PETVEKIQQSVVWKMENELYNFALEHFHAVKRR-----LINASPQDANQRFMYEKIRPK 345
>gi|194762182|ref|XP_001963235.1| GF15842 [Drosophila ananassae]
gi|190616932|gb|EDV32456.1| GF15842 [Drosophila ananassae]
Length = 355
Score = 368 bits (945), Expect = 2e-99, Method: Compositional matrix adjust.
Identities = 168/300 (56%), Positives = 224/300 (74%), Gaps = 6/300 (2%)
Query: 11 ISSAKSPSPETDSLSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVL 70
+ PS + V++YNRVPKTGSTSFVN+AYD+C+ +F+VLH+NVT N HVL
Sbjct: 62 VPDPHGPSADDFDFEEQLVVLYNRVPKTGSTSFVNIAYDLCKPNKFHVLHINVTANMHVL 121
Query: 71 SLADQYRFVNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYF 130
SL +Q +FV NV+KW + +PALYHGH ++DF +F +P++IN++RKPLDRLVSYYYF
Sbjct: 122 SLPNQIQFVRNVSKWHEMKPALYHGHMAYLDFSKFQIAHKPIYINLVRKPLDRLVSYYYF 181
Query: 131 LRYGDNYRPHLVRKKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWA 190
LR+GDNYRP+LVRKK G+K TFDEC+ + +C +NMWLQ+PF CGHAA CW PG+ WA
Sbjct: 182 LRFGDNYRPNLVRKKAGNKITFDECVVQKQPDCDPKNMWLQIPFFCGHAAECWEPGSAWA 241
Query: 191 LEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDP 250
L++AK NLV +Y LVGVTE++ +FV LLE +LP F G +H+ SNKSHLR T+ K+ P
Sbjct: 242 LDQAKRNLVNEYFLVGVTEQMYEFVDLLERSLPRIFHGFREHYQNSNKSHLRVTSSKLPP 301
Query: 251 SEETVQQIKKSKIWELENELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYEKIYPK 310
SE T++ I+K+KIW++EN+LYE+AL QF F KK K++ + ++FMYEKI PK
Sbjct: 302 SESTIKAIQKTKIWQMENDLYEFALAQFEFNKK------KLMQPDNKHLQKFMYEKIRPK 355
>gi|158296984|ref|XP_317295.4| AGAP008166-PA [Anopheles gambiae str. PEST]
gi|157014976|gb|EAA12392.4| AGAP008166-PA [Anopheles gambiae str. PEST]
Length = 360
Score = 368 bits (944), Expect = 2e-99, Method: Compositional matrix adjust.
Identities = 163/283 (57%), Positives = 210/283 (74%), Gaps = 6/283 (2%)
Query: 29 VIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWRDR 88
VIIYNRVPKTGSTSFVN+ YD+C+K F+VLH+N+T N VLSL +Q +FV N+T W
Sbjct: 78 VIIYNRVPKTGSTSFVNLTYDLCKKNAFHVLHINITANMQVLSLPNQLKFVRNITAWDSM 137
Query: 89 RPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKHGD 148
+PA YHGH F+DF +FG +PL+IN++R+PLDRLVSYYYFLRYGD+YRP+LVR + GD
Sbjct: 138 KPAFYHGHMAFLDFSKFGMPSKPLYINLIRQPLDRLVSYYYFLRYGDDYRPYLVRHRAGD 197
Query: 149 KTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVGVT 208
TFDEC+ + +C NMWLQ+PF CGH A CW PG+ WALE+AK NL Y LVG+T
Sbjct: 198 TMTFDECVARQKPDCDPTNMWLQIPFFCGHHAECWNPGSSWALEQAKRNLANDYFLVGLT 257
Query: 209 EELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIWELEN 268
EE+ +F+ LLE +LP +RG HF SNKSHLR+T K++P+ ETV +IK+S +W++EN
Sbjct: 258 EEMDEFIELLELSLPRLYRGAVTHFQKSNKSHLRKTKSKVEPAAETVAKIKESTVWQMEN 317
Query: 269 ELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYEKIYPKP 311
ELYE+A +QFHFV+K K+ + + FMYEKI P P
Sbjct: 318 ELYEFARDQFHFVQK------KLRTPGRNVMQDFMYEKIKPNP 354
>gi|17137518|ref|NP_477339.1| heparan sulfate 2-O-sulfotransferase [Drosophila melanogaster]
gi|20455495|sp|P25722.2|HS2ST_DROME RecName: Full=Heparin sulfate O-sulfotransferase
gi|7298583|gb|AAF53800.1| heparan sulfate 2-O-sulfotransferase [Drosophila melanogaster]
gi|16183179|gb|AAL13651.1| GH20044p [Drosophila melanogaster]
gi|220945494|gb|ACL85290.1| Hs2st-PA [synthetic construct]
gi|220955308|gb|ACL90197.1| Hs2st-PA [synthetic construct]
Length = 349
Score = 367 bits (942), Expect = 3e-99, Method: Compositional matrix adjust.
Identities = 169/302 (55%), Positives = 228/302 (75%), Gaps = 8/302 (2%)
Query: 11 ISSAKSPSPETDSLSWD--TVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNH 68
+S + S TD ++ V++YNRVPKTGSTSFVN+AYD+C+ +F+VLH+NVT N H
Sbjct: 54 LSPDQHASSTTDDFDFEEHLVVLYNRVPKTGSTSFVNIAYDLCKPNKFHVLHINVTANMH 113
Query: 69 VLSLADQYRFVNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYY 128
VLSL +Q +FV NV++W + +PALYHGH F+DF +F +P++IN++RKPLDRLVSYY
Sbjct: 114 VLSLPNQIQFVRNVSRWHEMKPALYHGHMAFLDFSKFQIAHKPIYINLVRKPLDRLVSYY 173
Query: 129 YFLRYGDNYRPHLVRKKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNP 188
YFLR+GDNYRP+LVRKK G+K TFDEC+ + +C +NMWLQ+PF CGHAA CW PG+
Sbjct: 174 YFLRFGDNYRPNLVRKKAGNKITFDECVVQKQPDCDPKNMWLQIPFFCGHAAECWEPGSS 233
Query: 189 WALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKI 248
WAL++AK NLV +Y LVGVTE++ +FV LLE +LP F G +H+ SNKSHLR T+ K+
Sbjct: 234 WALDQAKRNLVNEYFLVGVTEQMYEFVDLLERSLPRIFHGFREHYHNSNKSHLRVTSSKL 293
Query: 249 DPSEETVQQIKKSKIWELENELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYEKIY 308
PSE T++ I+K+KIW++EN+LY++AL QF F KK K++ + ++FMYEKI
Sbjct: 294 PPSESTIKSIQKTKIWQMENDLYDFALAQFEFNKK------KLMQPDNKHVQKFMYEKIR 347
Query: 309 PK 310
PK
Sbjct: 348 PK 349
>gi|195345097|ref|XP_002039112.1| GM17009 [Drosophila sechellia]
gi|194134242|gb|EDW55758.1| GM17009 [Drosophila sechellia]
Length = 349
Score = 367 bits (942), Expect = 4e-99, Method: Compositional matrix adjust.
Identities = 169/302 (55%), Positives = 228/302 (75%), Gaps = 8/302 (2%)
Query: 11 ISSAKSPSPETDSLSWD--TVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNH 68
+S + S TD ++ V++YNRVPKTGSTSFVN+AYD+C+ +F+VLH+NVT N H
Sbjct: 54 LSPDQHASSTTDDFDFEEHLVVLYNRVPKTGSTSFVNIAYDLCKPNKFHVLHINVTANMH 113
Query: 69 VLSLADQYRFVNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYY 128
VLSL +Q +FV NV++W + +PALYHGH F+DF +F +P++IN++RKPLDRLVSYY
Sbjct: 114 VLSLPNQIQFVRNVSRWHEMKPALYHGHMAFLDFSKFQIAHKPIYINLVRKPLDRLVSYY 173
Query: 129 YFLRYGDNYRPHLVRKKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNP 188
YFLR+GDNYRP+LVRKK G+K TFDEC+ + +C +NMWLQ+PF CGHAA CW PG+
Sbjct: 174 YFLRFGDNYRPNLVRKKAGNKITFDECVVQKQPDCDPKNMWLQIPFFCGHAAECWEPGSS 233
Query: 189 WALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKI 248
WAL++AK NLV +Y LVGVTE++ +FV LLE +LP F G +H+ SNKSHLR T+ K+
Sbjct: 234 WALDQAKRNLVNEYFLVGVTEQMYEFVDLLERSLPRIFHGFREHYHNSNKSHLRVTSSKL 293
Query: 249 DPSEETVQQIKKSKIWELENELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYEKIY 308
PSE T++ I+K+KIW++EN+LY++AL QF F KK K++ + ++FMYEKI
Sbjct: 294 PPSESTIKAIQKTKIWQMENDLYDFALAQFEFNKK------KLMQPDNKHVQKFMYEKIR 347
Query: 309 PK 310
PK
Sbjct: 348 PK 349
>gi|195580165|ref|XP_002079926.1| GD21759 [Drosophila simulans]
gi|194191935|gb|EDX05511.1| GD21759 [Drosophila simulans]
Length = 349
Score = 367 bits (942), Expect = 4e-99, Method: Compositional matrix adjust.
Identities = 169/302 (55%), Positives = 228/302 (75%), Gaps = 8/302 (2%)
Query: 11 ISSAKSPSPETDSLSWD--TVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNH 68
+S + S TD ++ V++YNRVPKTGSTSFVN+AYD+C+ +F+VLH+NVT N H
Sbjct: 54 LSPDRHASSTTDDFDFEEHLVVLYNRVPKTGSTSFVNIAYDLCKPNKFHVLHINVTANMH 113
Query: 69 VLSLADQYRFVNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYY 128
VLSL +Q +FV NV++W + +PALYHGH F+DF +F +P++IN++RKPLDRLVSYY
Sbjct: 114 VLSLPNQIQFVRNVSRWHEMKPALYHGHMAFLDFSKFQIAHKPIYINLVRKPLDRLVSYY 173
Query: 129 YFLRYGDNYRPHLVRKKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNP 188
YFLR+GDNYRP+LVRKK G+K TFDEC+ + +C +NMWLQ+PF CGHAA CW PG+
Sbjct: 174 YFLRFGDNYRPNLVRKKAGNKITFDECVVQKQPDCDPKNMWLQIPFFCGHAAECWEPGSS 233
Query: 189 WALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKI 248
WAL++AK NLV +Y LVGVTE++ +FV LLE +LP F G +H+ SNKSHLR T+ K+
Sbjct: 234 WALDQAKRNLVNEYFLVGVTEQMYEFVDLLERSLPRIFHGFREHYHNSNKSHLRVTSSKL 293
Query: 249 DPSEETVQQIKKSKIWELENELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYEKIY 308
PSE T++ I+K+KIW++EN+LY++AL QF F KK K++ + ++FMYEKI
Sbjct: 294 PPSESTIKAIQKTKIWQMENDLYDFALAQFEFNKK------KLMQPDNKHVQKFMYEKIR 347
Query: 309 PK 310
PK
Sbjct: 348 PK 349
>gi|348513567|ref|XP_003444313.1| PREDICTED: heparan sulfate 2-O-sulfotransferase 1-like [Oreochromis
niloticus]
Length = 357
Score = 367 bits (942), Expect = 4e-99, Method: Compositional matrix adjust.
Identities = 165/284 (58%), Positives = 215/284 (75%), Gaps = 2/284 (0%)
Query: 27 DTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWR 86
DTVIIYNRVPKT STSF N+AYD+C K RF+VLH+N T NN V+SL DQ RFV NVT WR
Sbjct: 74 DTVIIYNRVPKTASTSFTNIAYDLCGKNRFHVLHINTTKNNPVMSLQDQVRFVRNVTSWR 133
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKH 146
+ +P YHGH ++DF ++G K +PL+IN++R P++RLVSYYYFLR+GD+YRP L R+K
Sbjct: 134 EMKPGFYHGHVAYLDFSKYGVKGKPLYINVVRDPIERLVSYYYFLRFGDDYRPGLRRRKQ 193
Query: 147 GDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVG 206
GDK TFDEC+ ++C+ E +WLQ+PF CGH + CW G+ WALE+AK NLV +YLLVG
Sbjct: 194 GDKKTFDECVSSGGSDCAPEKLWLQIPFFCGHHSECWNAGSRWALEQAKYNLVNEYLLVG 253
Query: 207 VTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIWEL 266
VTEEL DF+ +LEAALP FF+G TD + + KSHLR+T K P++ET+ ++++S IW++
Sbjct: 254 VTEELEDFIMILEAALPRFFKGATDLYRSGKKSHLRKTTEKKPPTKETIAKLQQSNIWKI 313
Query: 267 ENELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYEKIYPK 310
ENE YE+ALEQF FV+ H + + G + F YEKIYPK
Sbjct: 314 ENEFYEFALEQFQFVRAHAV--REKDGELFVLAQSFFYEKIYPK 355
>gi|194879561|ref|XP_001974255.1| GG21631 [Drosophila erecta]
gi|190657442|gb|EDV54655.1| GG21631 [Drosophila erecta]
Length = 349
Score = 367 bits (941), Expect = 5e-99, Method: Compositional matrix adjust.
Identities = 167/302 (55%), Positives = 229/302 (75%), Gaps = 8/302 (2%)
Query: 11 ISSAKSPSPETDSLSWD--TVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNH 68
+++ + S TD ++ V++YNRVPKTGSTSFVN+AYD+C+ +F+VLH+NVT N H
Sbjct: 54 LTADQHASSTTDDFDFEEHLVVLYNRVPKTGSTSFVNIAYDLCKPNKFHVLHINVTANMH 113
Query: 69 VLSLADQYRFVNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYY 128
VLSL +Q +FV NV++W + +PALYHGH F+DF +F +P++IN++RKPLDRL+SYY
Sbjct: 114 VLSLPNQIQFVRNVSRWHEMKPALYHGHMAFLDFSKFQIAHKPIYINLVRKPLDRLISYY 173
Query: 129 YFLRYGDNYRPHLVRKKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNP 188
YFLR+GDNYRP+LVRKK G+K TFDEC+ + +C +NMWLQ+PF CGHAA CW PG+
Sbjct: 174 YFLRFGDNYRPNLVRKKAGNKITFDECVVQKQPDCDPKNMWLQIPFFCGHAAECWEPGSS 233
Query: 189 WALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKI 248
WAL++AK NLV +Y LVGVTE++ +FV LLE +LP F G +H+ SNKSHLR T+ K+
Sbjct: 234 WALDQAKRNLVNEYFLVGVTEQMYEFVDLLERSLPRIFHGFREHYHNSNKSHLRVTSSKL 293
Query: 249 DPSEETVQQIKKSKIWELENELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYEKIY 308
PSE T++ I+K+KIW++EN+LYE+AL QF F K+ K++ + ++FMYEKI
Sbjct: 294 PPSESTIKAIQKTKIWQMENDLYEFALAQFEFNKR------KLMQPDNKHVQKFMYEKIR 347
Query: 309 PK 310
PK
Sbjct: 348 PK 349
>gi|427788279|gb|JAA59591.1| Putative heparan sulfate 2-o-sulfotransferase heparan sulfate
2-o-sulfotransferase [Rhipicephalus pulchellus]
Length = 366
Score = 365 bits (938), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 159/284 (55%), Positives = 218/284 (76%), Gaps = 1/284 (0%)
Query: 27 DTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWR 86
+ VI+YNRVPKTGSTSF+ +AYD+C +F+VLH+N + N HV+SL DQ RFV N++ W
Sbjct: 84 NLVILYNRVPKTGSTSFMGVAYDLCATNKFHVLHLNTSKNMHVMSLPDQIRFVYNISLWH 143
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKH 146
+PA+YHGH F++F ++G ++P++IN++R+PLDRLVSY+YFLR+GD++RP+LVR++
Sbjct: 144 YMKPAIYHGHIAFLNFAKYGVIQRPVYINLIRRPLDRLVSYFYFLRHGDDFRPYLVRRRQ 203
Query: 147 GDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVG 206
G+K TFDEC+ +C+ E +WLQVPF CGHAA CW+PGNPWALE+AK NLV Y LVG
Sbjct: 204 GNKMTFDECVAKKGPDCAEERLWLQVPFFCGHAARCWIPGNPWALEQAKHNLVNHYFLVG 263
Query: 207 VTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIWEL 266
+TE+L +FV++LEA+ P F+G TD F+T +SHLR+T K+ PS ET++ K+S IW++
Sbjct: 264 LTEQLPEFVAMLEASFPRIFKGATDKFITGKRSHLRKTFNKVQPSPETIEHFKRSPIWQM 323
Query: 267 ENELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYEKIYPK 310
ENE YE+A EQF F KK LV + G + G+QF YEKI PK
Sbjct: 324 ENEFYEFAAEQFEFAKKRTLVATQD-GQLTELGQQFFYEKIRPK 366
>gi|410920792|ref|XP_003973867.1| PREDICTED: heparan sulfate 2-O-sulfotransferase 1-like [Takifugu
rubripes]
Length = 354
Score = 365 bits (938), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 163/284 (57%), Positives = 214/284 (75%), Gaps = 2/284 (0%)
Query: 27 DTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWR 86
D VIIYNRVPKTGSTSF N+AYD+C K F+VLH+N T NN V+SL DQ RFV N++ WR
Sbjct: 71 DRVIIYNRVPKTGSTSFTNIAYDLCAKNHFHVLHINTTKNNPVMSLQDQMRFVRNISSWR 130
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKH 146
+ +P YHGH ++DF ++G+K +P++IN++R P++RLVSYYYFLR+GDNYRP L R+K
Sbjct: 131 EMKPGFYHGHVAYLDFSKYGAKVKPMYINVVRDPIERLVSYYYFLRFGDNYRPGLRRRKQ 190
Query: 147 GDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVG 206
GDK TFDEC+ ++C+ E +WLQ+PF CGH + CW G+ WALE+AK NLV +YLLVG
Sbjct: 191 GDKKTFDECVSSGGSDCAPEKLWLQIPFFCGHHSECWNAGSKWALEQAKYNLVNEYLLVG 250
Query: 207 VTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIWEL 266
VTEEL DF+ +LEA LP FF+G T+ F T KSHLR+T K P++ET ++++S IW++
Sbjct: 251 VTEELEDFIMILEAVLPRFFKGATELFKTGKKSHLRKTTEKKPPTKETTAKLQQSNIWKM 310
Query: 267 ENELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYEKIYPK 310
ENE YE+ALEQF FV+ H + + G G+ F YEKIYPK
Sbjct: 311 ENEFYEFALEQFQFVRAHAV--REKDGELYVLGQNFFYEKIYPK 352
>gi|45383201|ref|NP_989812.1| heparan sulfate 2-O-sulfotransferase 1 [Gallus gallus]
gi|67461034|sp|Q76KB1.1|HS2ST_CHICK RecName: Full=Heparan sulfate 2-O-sulfotransferase 1; Short=cHS2ST
gi|38141769|dbj|BAD00706.1| heparan sulfate 2-O-sulfotransferase [Gallus gallus]
Length = 356
Score = 365 bits (938), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 162/284 (57%), Positives = 216/284 (76%), Gaps = 2/284 (0%)
Query: 27 DTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWR 86
D VIIYNRVPKT STSF N+AYD+C K R++VLH+N T NN V+SL DQ RFV NVT W+
Sbjct: 73 DVVIIYNRVPKTASTSFTNIAYDLCAKNRYHVLHINTTKNNPVMSLQDQVRFVKNVTSWK 132
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKH 146
+ +P YHGH ++DF +FG K++P++IN++R P++RLVSYYYFLR+GD+YRP L R+K
Sbjct: 133 EMKPGFYHGHVSYLDFAKFGVKKKPIYINVIRDPIERLVSYYYFLRFGDDYRPGLRRRKQ 192
Query: 147 GDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVG 206
GDK TFDEC+ ++C+ E +WLQ+PF CGH++ CW G+ WALE+AK NL+ +Y LVG
Sbjct: 193 GDKKTFDECVAAGGSDCAPEKLWLQIPFFCGHSSECWNVGSRWALEQAKYNLINEYFLVG 252
Query: 207 VTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIWEL 266
VTEEL DF+ LLEAALP FFRG T+ + T KSHLR+T K P++ET+ ++++S+IW++
Sbjct: 253 VTEELEDFIMLLEAALPRFFRGATELYRTGKKSHLRKTTEKKLPTKETIAKLQQSEIWKM 312
Query: 267 ENELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYEKIYPK 310
ENE YE+ALEQF FV+ H + + G + F YEKIYPK
Sbjct: 313 ENEFYEFALEQFQFVRAHAV--REKDGELYILAQNFFYEKIYPK 354
>gi|291190176|ref|NP_001167199.1| Heparan sulfate 2-O-sulfotransferase 1 [Salmo salar]
gi|223648612|gb|ACN11064.1| Heparan sulfate 2-O-sulfotransferase 1 [Salmo salar]
Length = 358
Score = 365 bits (937), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 164/284 (57%), Positives = 217/284 (76%), Gaps = 2/284 (0%)
Query: 27 DTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWR 86
D V+IYNRVPKT STSF N+AYD+C K R++VLH+N T NN V+S+ DQ RFV NVT+WR
Sbjct: 75 DVVVIYNRVPKTASTSFTNIAYDLCGKNRYHVLHINTTKNNPVMSMQDQVRFVKNVTEWR 134
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKH 146
+ +PA YHGH F+DF +FG K++P++IN++R P++RLVSYYYFLR+GD+YRP L R+K
Sbjct: 135 EMKPAFYHGHVSFLDFTKFGVKKKPVYINVIRDPIERLVSYYYFLRFGDDYRPGLRRRKQ 194
Query: 147 GDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVG 206
GDK TFDEC+ ++C+ E +WLQ+PF CGH + CW G+ WALE+AK NLV +YLLVG
Sbjct: 195 GDKKTFDECVSAGGSDCAPEKLWLQIPFFCGHYSECWNVGSHWALEQAKYNLVNEYLLVG 254
Query: 207 VTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIWEL 266
VTEEL DFV +LEAALP FFRG T+ + T KSHLR+T+ K P++E+ ++++S IW++
Sbjct: 255 VTEELEDFVMMLEAALPRFFRGATELYKTGKKSHLRKTSEKKPPTKESSAKLQQSAIWKM 314
Query: 267 ENELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYEKIYPK 310
ENE YE+ALEQF FV+ H + + G + F YEKIYPK
Sbjct: 315 ENEFYEFALEQFQFVRAHAV--REKDGELYLLAQNFFYEKIYPK 356
>gi|326925126|ref|XP_003208772.1| PREDICTED: heparan sulfate 2-O-sulfotransferase 1-like [Meleagris
gallopavo]
Length = 336
Score = 364 bits (935), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 162/284 (57%), Positives = 216/284 (76%), Gaps = 2/284 (0%)
Query: 27 DTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWR 86
D VIIYNRVPKT STSF N+AYD+C K R++VLH+N T NN V+SL DQ RFV NVT W+
Sbjct: 53 DVVIIYNRVPKTASTSFTNIAYDLCAKNRYHVLHINTTKNNPVMSLQDQVRFVKNVTSWK 112
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKH 146
+ +P YHGH ++DF +FG K++P++IN++R P++RLVSYYYFLR+GD+YRP L R+K
Sbjct: 113 EMKPGFYHGHVSYLDFAKFGVKKKPIYINVIRDPIERLVSYYYFLRFGDDYRPGLRRRKQ 172
Query: 147 GDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVG 206
GDK TFDEC+ ++C+ E +WLQ+PF CGH++ CW G+ WALE+AK NL+ +Y LVG
Sbjct: 173 GDKKTFDECVAAGGSDCAPEKLWLQIPFFCGHSSECWNVGSRWALEQAKYNLINEYFLVG 232
Query: 207 VTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIWEL 266
VTEEL DF+ LLEAALP FFRG T+ + T KSHLR+T K P++ET+ ++++S+IW++
Sbjct: 233 VTEELEDFIMLLEAALPRFFRGATELYRTGKKSHLRKTTEKKLPTKETIAKLQQSEIWKM 292
Query: 267 ENELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYEKIYPK 310
ENE YE+ALEQF FV+ H + + G + F YEKIYPK
Sbjct: 293 ENEFYEFALEQFQFVRAHAV--REKDGELYILAQNFFYEKIYPK 334
>gi|195484458|ref|XP_002090703.1| Hs2st [Drosophila yakuba]
gi|194176804|gb|EDW90415.1| Hs2st [Drosophila yakuba]
Length = 349
Score = 364 bits (935), Expect = 3e-98, Method: Compositional matrix adjust.
Identities = 167/302 (55%), Positives = 227/302 (75%), Gaps = 8/302 (2%)
Query: 11 ISSAKSPSPETDSLSWD--TVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNH 68
++ + S TD ++ V++YNRVPKTGSTSFVN+AYD+C+ +F+VLH+NVT N H
Sbjct: 54 LTPDQHASSTTDDFDFEEHLVVLYNRVPKTGSTSFVNIAYDLCKPNKFHVLHINVTANMH 113
Query: 69 VLSLADQYRFVNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYY 128
VLSL +Q +FV NV++W + +PALYHGH F+DF +F +P++IN++RKPLDRLVSYY
Sbjct: 114 VLSLPNQIQFVRNVSRWHEMKPALYHGHMAFLDFSKFQIAHKPIYINLVRKPLDRLVSYY 173
Query: 129 YFLRYGDNYRPHLVRKKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNP 188
YFLR+GDNYRP+LVRKK G+K TFD C+ + +C +NMWLQ+PF CGHAA CW PG+
Sbjct: 174 YFLRFGDNYRPNLVRKKAGNKITFDACVVQKQPDCDPKNMWLQIPFFCGHAAECWEPGSS 233
Query: 189 WALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKI 248
WAL++AK NLV +Y LVGVTE++ +FV LLE +LP F G +H+ SNKSHLR T+ K+
Sbjct: 234 WALDQAKRNLVNEYFLVGVTEQMYEFVDLLERSLPRIFHGFREHYHNSNKSHLRVTSSKL 293
Query: 249 DPSEETVQQIKKSKIWELENELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYEKIY 308
PSE T++ I+K+KIW++EN+LYE+AL QF F K+ K++ + ++FMYEKI
Sbjct: 294 PPSESTIKAIQKTKIWQMENDLYEFALNQFEFNKR------KLMQPDNKHVQKFMYEKIR 347
Query: 309 PK 310
PK
Sbjct: 348 PK 349
>gi|292614942|ref|XP_001922005.2| PREDICTED: heparan sulfate 2-O-sulfotransferase 1-like [Danio
rerio]
Length = 312
Score = 364 bits (934), Expect = 3e-98, Method: Compositional matrix adjust.
Identities = 164/284 (57%), Positives = 216/284 (76%), Gaps = 2/284 (0%)
Query: 27 DTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWR 86
D +IIYNRVPKT STSF N+AYD+C K RF+VLH+N T NN V+SL DQ RFV NVT WR
Sbjct: 29 DLIIIYNRVPKTASTSFTNIAYDLCGKNRFHVLHINTTKNNPVMSLQDQMRFVRNVTSWR 88
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKH 146
+ +P YHGH ++DF +FG K +P++INI+R P++RLVSYYYFLR+GD+YRP L R+K
Sbjct: 89 EMKPGFYHGHVSYLDFTKFGVKGKPVYINIVRDPIERLVSYYYFLRFGDDYRPGLRRRKQ 148
Query: 147 GDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVG 206
GDK TFDEC+ ++C+ E +WLQ+PF CGH + CW G+ WALE+AK NLV +YLLVG
Sbjct: 149 GDKKTFDECVSSGGSDCAPEKLWLQIPFFCGHYSECWNVGSKWALEQAKYNLVNEYLLVG 208
Query: 207 VTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIWEL 266
VTEEL DF+ +LEAALP FFRG T+ + T KSHLR+T+ K P++E++ ++++S IW++
Sbjct: 209 VTEELEDFIMMLEAALPRFFRGATELYRTGKKSHLRKTSEKKPPTKESISKLQQSNIWKM 268
Query: 267 ENELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYEKIYPK 310
ENE YE+ALEQF FV+ H++ + G + F YEKIYPK
Sbjct: 269 ENEFYEFALEQFQFVRAHSV--REKDGELYLLSQNFFYEKIYPK 310
>gi|327270761|ref|XP_003220157.1| PREDICTED: heparan sulfate 2-O-sulfotransferase 1-like [Anolis
carolinensis]
Length = 356
Score = 364 bits (934), Expect = 3e-98, Method: Compositional matrix adjust.
Identities = 164/299 (54%), Positives = 221/299 (73%), Gaps = 5/299 (1%)
Query: 12 SSAKSPSPETDSLSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLS 71
SS P+ D + VIIYNRVPKT STSF N+AYD+C + +++VLH+N T NN V+S
Sbjct: 61 SSRSDVVPDEDE---NVVIIYNRVPKTASTSFTNIAYDLCARNKYHVLHINTTKNNPVMS 117
Query: 72 LADQYRFVNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFL 131
L DQ RFV NVT W++ +P YHGH F+DF +FG K++P++IN++R P++RLVSYYYFL
Sbjct: 118 LQDQVRFVKNVTSWKEMKPGFYHGHISFLDFAKFGVKKKPVYINVIRDPIERLVSYYYFL 177
Query: 132 RYGDNYRPHLVRKKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWAL 191
R+GD+YRP L R+K GDK TFDEC+ ++C+ E +WLQ+PF CGH++ CW G+ WAL
Sbjct: 178 RFGDDYRPGLRRRKQGDKKTFDECVAAGGSDCAPEKLWLQIPFFCGHSSECWNVGSRWAL 237
Query: 192 EKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPS 251
E+AK NL+ +Y LVGVTEEL DF+ LLEAALP FFRG T+ + T KSHLR+T K P+
Sbjct: 238 EQAKYNLINEYFLVGVTEELEDFIMLLEAALPRFFRGATELYRTGKKSHLRKTTEKKLPT 297
Query: 252 EETVQQIKKSKIWELENELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYEKIYPK 310
+ET+ ++++S+IW++ENE YE+ALEQF FV+ H + + G + F YEKIYPK
Sbjct: 298 KETIAKLQQSEIWKMENEFYEFALEQFQFVRAHAV--REKDGELYILAQNFFYEKIYPK 354
>gi|218681912|pdb|3F5F|A Chain A, Crystal Structure Of Heparan Sulfate 2-O-Sulfotransferase
From Gallus Gallus As A Maltose Binding Protein Fusion
Length = 658
Score = 364 bits (934), Expect = 3e-98, Method: Compositional matrix adjust.
Identities = 161/284 (56%), Positives = 214/284 (75%), Gaps = 2/284 (0%)
Query: 27 DTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWR 86
D VIIYNRVPKT STSF N+AYD+C K R++VLH+N T NN V+SL DQ RFV NVT W+
Sbjct: 375 DVVIIYNRVPKTASTSFTNIAYDLCAKNRYHVLHINTTKNNPVMSLQDQVRFVKNVTSWK 434
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKH 146
+ +P YHGH ++DF +FG K++P++IN++R P++RLVSYYYFLR+GD+YRP L R+K
Sbjct: 435 EMKPGFYHGHVSYLDFAKFGVKKKPIYINVIRDPIERLVSYYYFLRFGDDYRPGLRRRKQ 494
Query: 147 GDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVG 206
GDK TFDEC+ ++C+ E +WLQ+PF CGH++ CW G+ WALE+AK NL+ +Y LVG
Sbjct: 495 GDKKTFDECVAAGGSDCAPEKLWLQIPFFCGHSSECWNVGSRWALEQAKYNLINEYFLVG 554
Query: 207 VTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIWEL 266
VTEEL DF+ LLEAALP FFRG T+ + T KSHLR+T K P+ T+ ++++S+IW++
Sbjct: 555 VTEELEDFIMLLEAALPRFFRGATELYRTGKKSHLRKTTEKKLPTAATIAKLQQSEIWKM 614
Query: 267 ENELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYEKIYPK 310
ENE YE+ALEQF FV+ H + + G + F YEKIYPK
Sbjct: 615 ENEFYEFALEQFQFVRAHAV--REKDGELYILAQNFFYEKIYPK 656
>gi|348500550|ref|XP_003437836.1| PREDICTED: heparan sulfate 2-O-sulfotransferase 1-like [Oreochromis
niloticus]
Length = 359
Score = 363 bits (933), Expect = 4e-98, Method: Compositional matrix adjust.
Identities = 162/284 (57%), Positives = 216/284 (76%), Gaps = 2/284 (0%)
Query: 27 DTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWR 86
D VIIYNRVPKT STSF N+AYD+C K ++VLH+N T NN V+SL DQ RFV NVT+WR
Sbjct: 77 DVVIIYNRVPKTASTSFTNIAYDLCGKNHYHVLHINTTKNNPVMSLQDQVRFVKNVTEWR 136
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKH 146
+ +PA YHGH F+DF +FG K++P++IN++R P++RLVSYYYFLR+GD+YRP L R+K
Sbjct: 137 EMKPAFYHGHVSFLDFTKFGVKKKPIYINVIRDPIERLVSYYYFLRFGDDYRPGLRRRKQ 196
Query: 147 GDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVG 206
GDK TFDEC+ ++C+ E +WLQ+PF CGH + CW G+ WALE+AK NLV +Y+LVG
Sbjct: 197 GDKKTFDECVSAGGSDCAPEKLWLQIPFFCGHYSECWNVGSQWALEQAKYNLVNEYMLVG 256
Query: 207 VTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIWEL 266
VTEEL DFV +LEAALP FF+G T+ + T KSHLR+T+ K P++E++ ++++S IW++
Sbjct: 257 VTEELEDFVMMLEAALPRFFKGATELYKTGKKSHLRKTSEKKPPTKESIAKLQQSTIWKM 316
Query: 267 ENELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYEKIYPK 310
EN+ YE+ALEQF FV+ H + Y + F YEKIYPK
Sbjct: 317 ENDFYEFALEQFQFVRAHAVREKDGELYLL--AQNFFYEKIYPK 358
>gi|260789211|ref|XP_002589641.1| hypothetical protein BRAFLDRAFT_236592 [Branchiostoma floridae]
gi|229274821|gb|EEN45652.1| hypothetical protein BRAFLDRAFT_236592 [Branchiostoma floridae]
Length = 304
Score = 363 bits (932), Expect = 5e-98, Method: Compositional matrix adjust.
Identities = 160/284 (56%), Positives = 214/284 (75%), Gaps = 2/284 (0%)
Query: 27 DTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWR 86
D ++IYNRVPKT STSFV +AYD+C++ +NV+H+N T N+ ++S+ DQ RFV NVT W
Sbjct: 23 DQIVIYNRVPKTASTSFVGLAYDLCQRNGYNVIHLNTTRNSPIMSIQDQERFVTNVTNWN 82
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKH 146
++P YHGH +++F +FG ++P++INI+RKPLDRLVSYYYF+R+GD++RPHL R KH
Sbjct: 83 AKKPVFYHGHLSYLEFGRFGLSQKPVYINIVRKPLDRLVSYYYFVRWGDDFRPHLRRNKH 142
Query: 147 GDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVG 206
GD TFD+C+ +C+ E +W+QVPF CGHA CW PGN WALE+AK NLV Y LVG
Sbjct: 143 GDSKTFDDCVEQGEPDCAPEKLWMQVPFFCGHAVECWEPGNRWALEEAKRNLVANYFLVG 202
Query: 207 VTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIWEL 266
VTEEL DFV LLEAALP FFRG T F KSHLR+T+ K +PS+ET+++I++S+IW++
Sbjct: 203 VTEELEDFVMLLEAALPKFFRGATSLFQQGGKSHLRKTSNKQEPSKETIRKIQRSQIWQM 262
Query: 267 ENELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYEKIYPK 310
EN+ Y++AL QFH V + +L +V G G QF YEKI P+
Sbjct: 263 ENDFYQFALNQFHHVVRRSL--RRVNGELTPLGAQFFYEKIRPR 304
>gi|307194445|gb|EFN76743.1| Heparan sulfate 2-O-sulfotransferase 1 [Harpegnathos saltator]
Length = 345
Score = 363 bits (932), Expect = 5e-98, Method: Compositional matrix adjust.
Identities = 165/284 (58%), Positives = 212/284 (74%), Gaps = 5/284 (1%)
Query: 27 DTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWR 86
D V+IYNRVPKT STSFV + YD+C++ +++VLH+NVT N H L+L +Q +F NN++ W
Sbjct: 67 DIVVIYNRVPKTASTSFVGLVYDLCKQNKYHVLHINVTNNMHTLTLNNQVQFANNISNWD 126
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKH 146
+PA YHGH F++F++FG K PL+IN+LRKPL+R VSYYYFLRYGDN+RP+LVRKKH
Sbjct: 127 IIKPAFYHGHMAFLNFEKFGIKRTPLYINLLRKPLERFVSYYYFLRYGDNFRPYLVRKKH 186
Query: 147 GDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVG 206
GD TFDECI + + +C NMWLQ+PFLCGH ACW GN WALE+AK NL Y LVG
Sbjct: 187 GDTKTFDECINIGQPDCDPNNMWLQIPFLCGHDPACWEVGNSWALEEAKRNLQKHYFLVG 246
Query: 207 VTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIWEL 266
VTEEL DFV +LE LP FF+G + FL +NKSHLR+T +K++P ETV++I++S +W +
Sbjct: 247 VTEELNDFVEILENVLPRFFKGAYNFFLHNNKSHLRQTTQKLNPLPETVEKIQQSVVWRM 306
Query: 267 ENELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYEKIYPK 310
ENELY +ALE FH VK+ L D ++FMYEKI PK
Sbjct: 307 ENELYNFALEHFHAVKRRLLN-----ASPQDANQRFMYEKIRPK 345
>gi|148222729|ref|NP_001084584.1| uncharacterized protein LOC414536 [Xenopus laevis]
gi|46250094|gb|AAH68733.1| MGC81201 protein [Xenopus laevis]
Length = 356
Score = 363 bits (932), Expect = 6e-98, Method: Compositional matrix adjust.
Identities = 162/284 (57%), Positives = 213/284 (75%), Gaps = 2/284 (0%)
Query: 27 DTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWR 86
D +IIYNRVPKT STSF N+AYD+C K +++VLH+N T NN V+SL DQ RFV NV+ WR
Sbjct: 73 DFLIIYNRVPKTASTSFTNIAYDLCAKNKYHVLHINTTKNNPVMSLQDQMRFVKNVSSWR 132
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKH 146
+ +P YHGH F+DF +FG K++P++IN++R P++RLVSYYYFLR+GD+YRP L R+K
Sbjct: 133 EMKPGFYHGHISFLDFTKFGVKKKPIYINVIRDPIERLVSYYYFLRFGDDYRPGLRRRKQ 192
Query: 147 GDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVG 206
GDK TFDEC+ ++C+ E +WLQ+PF CGH++ CW G+ WALE+AK NLV +Y LVG
Sbjct: 193 GDKKTFDECVAAGGSDCAPEKLWLQIPFFCGHSSECWNVGSRWALEQAKYNLVNEYFLVG 252
Query: 207 VTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIWEL 266
VTEEL DF+ LLEAALP FFRG T+ + + KSHLR+T K PS+ET ++++S IW++
Sbjct: 253 VTEELEDFIMLLEAALPRFFRGATELYRSGKKSHLRKTTEKKAPSKETTAKLQQSDIWKM 312
Query: 267 ENELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYEKIYPK 310
ENE YE+ALEQF FV+ H + Y + F YEKIYPK
Sbjct: 313 ENEFYEFALEQFQFVRAHAVREKDGELYVL--AQNFFYEKIYPK 354
>gi|224057489|ref|XP_002191658.1| PREDICTED: heparan sulfate 2-O-sulfotransferase 1 [Taeniopygia
guttata]
Length = 394
Score = 363 bits (932), Expect = 6e-98, Method: Compositional matrix adjust.
Identities = 161/284 (56%), Positives = 216/284 (76%), Gaps = 2/284 (0%)
Query: 27 DTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWR 86
D VIIYNRVPKT STSF N+AYD+C K +++VLH+N T NN V+SL DQ RFV NVT W+
Sbjct: 111 DVVIIYNRVPKTASTSFTNIAYDLCAKNKYHVLHINTTKNNPVMSLQDQVRFVKNVTSWK 170
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKH 146
+ +P YHGH ++DF +FG K++P++IN++R P++RLVSYYYFLR+GD+YRP L R+K
Sbjct: 171 EMKPGFYHGHVSYLDFAKFGVKKKPIYINVIRDPIERLVSYYYFLRFGDDYRPGLRRRKQ 230
Query: 147 GDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVG 206
GDK TFDEC+ ++C+ E +WLQ+PF CGH++ CW G+ WALE+AK NL+ +Y LVG
Sbjct: 231 GDKKTFDECVAAGGSDCAPEKLWLQIPFFCGHSSECWNVGSRWALEQAKYNLINEYFLVG 290
Query: 207 VTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIWEL 266
VTEEL DF+ LLEAALP FFRG T+ + T KSHLR+T K P++ET+ ++++S+IW++
Sbjct: 291 VTEELEDFIMLLEAALPRFFRGATELYRTGKKSHLRKTTEKKLPTKETIAKLQQSEIWKM 350
Query: 267 ENELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYEKIYPK 310
ENE YE+ALEQF FV+ H + + G + F YEKIYPK
Sbjct: 351 ENEFYEFALEQFQFVRAHAV--REKDGELYILAQNFFYEKIYPK 392
>gi|187608506|ref|NP_001120463.1| heparan sulfate 2-O-sulfotransferase 1 [Xenopus (Silurana)
tropicalis]
gi|170285016|gb|AAI61283.1| LOC100145562 protein [Xenopus (Silurana) tropicalis]
Length = 356
Score = 362 bits (930), Expect = 8e-98, Method: Compositional matrix adjust.
Identities = 161/284 (56%), Positives = 213/284 (75%), Gaps = 2/284 (0%)
Query: 27 DTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWR 86
D +IIYNRVPKT STSF N+AYD+C K +++VLH+N T NN V+SL DQ RFV NV+ WR
Sbjct: 73 DILIIYNRVPKTASTSFTNIAYDLCAKNKYHVLHINTTKNNPVMSLQDQMRFVKNVSSWR 132
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKH 146
+ +P YHGH F+DF +FG K++P++IN++R P++RLVSYYYFLR+GD+YRP L R+K
Sbjct: 133 EMKPGFYHGHISFLDFTKFGVKKKPIYINVIRDPIERLVSYYYFLRFGDDYRPGLRRRKQ 192
Query: 147 GDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVG 206
GDK TFDEC+ ++C+ E +WLQ+PF CGH++ CW G+ WAL++AK NLV +Y LVG
Sbjct: 193 GDKKTFDECVAAGGSDCAPEKLWLQIPFFCGHSSECWNVGSRWALDQAKYNLVNEYFLVG 252
Query: 207 VTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIWEL 266
VTEEL DF+ LLEAALP FFRG T+ + + KSHLR+T K PS+ET ++++S IW++
Sbjct: 253 VTEELEDFIMLLEAALPRFFRGATELYRSGKKSHLRKTTEKKAPSKETTAKLQQSDIWKM 312
Query: 267 ENELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYEKIYPK 310
ENE YE+ALEQF FV+ H + Y + F YEKIYPK
Sbjct: 313 ENEFYEFALEQFQFVRAHAVREKDGELYVL--AQNFFYEKIYPK 354
>gi|432911720|ref|XP_004078690.1| PREDICTED: heparan sulfate 2-O-sulfotransferase 1-like [Oryzias
latipes]
Length = 357
Score = 362 bits (929), Expect = 1e-97, Method: Compositional matrix adjust.
Identities = 164/298 (55%), Positives = 221/298 (74%), Gaps = 6/298 (2%)
Query: 13 SAKSPSPETDSLSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSL 72
SA +P D D VIIYNRVPKT STSF N+AYD+C K ++VLH+N T NN V+S+
Sbjct: 65 SAAAPPDGED----DVVIIYNRVPKTASTSFTNIAYDLCGKNHYHVLHINTTKNNPVMSM 120
Query: 73 ADQYRFVNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLR 132
DQ RFV NVT+WR+ +PA YHGH F+DF +FG K++P++IN++R P++RLVSYYYFLR
Sbjct: 121 QDQVRFVKNVTEWREMKPAFYHGHVSFLDFTRFGVKKKPIYINVIRDPIERLVSYYYFLR 180
Query: 133 YGDNYRPHLVRKKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALE 192
+GD+YRP L R+K GDK TFDEC+ ++C+ E +WLQ+PF CGH + CW G+ WALE
Sbjct: 181 FGDDYRPGLRRRKQGDKKTFDECVSAGGSDCAPEKLWLQIPFFCGHYSECWNVGSQWALE 240
Query: 193 KAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSE 252
+AK NLV +Y+LVGVTEEL DF+ +LEA+LP FF+G T+ + T KSHLR+T+ K P++
Sbjct: 241 QAKYNLVNEYMLVGVTEELEDFIMMLEASLPRFFKGATELYKTGKKSHLRKTSEKKPPTK 300
Query: 253 ETVQQIKKSKIWELENELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYEKIYPK 310
E+V ++++S IW++EN+ YE+ALEQF FV+ H + Y + F YEKIYPK
Sbjct: 301 ESVTKLQQSNIWKMENDFYEFALEQFQFVRAHAVREKDGELYLL--AQNFFYEKIYPK 356
>gi|19343562|gb|AAH25443.1| Heparan sulfate 2-O-sulfotransferase 1 [Mus musculus]
Length = 356
Score = 362 bits (928), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 158/289 (54%), Positives = 215/289 (74%), Gaps = 12/289 (4%)
Query: 27 DTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWR 86
D +IIYNRVPKT STSF N+AYD+C K R++VLH+N T NN V+SL DQ RFV N+T W
Sbjct: 73 DIIIIYNRVPKTASTSFTNIAYDLCAKNRYHVLHINTTKNNPVMSLQDQVRFVKNITTWN 132
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKH 146
+ +P YHGH ++DF +FG K++P++IN++R P++RLVSYYYFLR+GD+YRP L R+K
Sbjct: 133 EMKPGFYHGHISYLDFAKFGVKKKPIYINVIRDPIERLVSYYYFLRFGDDYRPGLRRRKQ 192
Query: 147 GDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVG 206
GDK TFDEC+ ++C+ E +WLQ+PF CGH++ CW G+ WA+++AK NL+ +Y LVG
Sbjct: 193 GDKKTFDECVAEGGSDCAPEKLWLQIPFFCGHSSECWNVGSRWAMDQAKSNLINEYFLVG 252
Query: 207 VTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIWEL 266
VTEEL DF+ LLEAALP FFRG TD + T KSHLR+T K P+++T+ ++++S IW++
Sbjct: 253 VTEELEDFIMLLEAALPRFFRGATDLYRTGKKSHLRKTTEKKLPTKQTIAKLQQSDIWKM 312
Query: 267 ENELYEYALEQFHFVKKHNLVYNKVLGYEADK-----GKQFMYEKIYPK 310
ENE YE+ALEQF F++ H + +E D + F YEKIYPK
Sbjct: 313 ENEFYEFALEQFQFIRAHAV-------HEKDGDLYILAQNFFYEKIYPK 354
>gi|147901207|ref|NP_001083748.1| heparan sulfate 2-O-sulfotransferase 1 [Xenopus laevis]
gi|67460984|sp|O93336.1|HS2ST_XENLA RecName: Full=Heparan sulfate 2-O-sulfotransferase 1
gi|3228538|gb|AAC41301.1| heparan sulfate 2-sulfotransferase [Xenopus laevis]
gi|50416538|gb|AAH78097.1| Hs2st protein [Xenopus laevis]
Length = 356
Score = 361 bits (927), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 161/284 (56%), Positives = 212/284 (74%), Gaps = 2/284 (0%)
Query: 27 DTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWR 86
D +IIYNRVPKT STSF N+AYD+C K +++VLH+N T NN V+SL DQ RFV NV+ WR
Sbjct: 73 DILIIYNRVPKTASTSFTNIAYDLCAKNKYHVLHINTTKNNPVMSLQDQVRFVKNVSSWR 132
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKH 146
+ +P YHGH F+DF +FG K++P++IN++R P++RLVSYYYFLR+GD+YRP L R+K
Sbjct: 133 EMKPGFYHGHVSFLDFTKFGVKKKPIYINVIRDPIERLVSYYYFLRFGDDYRPGLRRRKQ 192
Query: 147 GDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVG 206
GDK TFDEC+ ++C+ E +WLQ+PF CGH++ CW G+ WAL++AK NLV +Y LVG
Sbjct: 193 GDKKTFDECVAAGGSDCAPEKLWLQIPFFCGHSSECWNVGSRWALDQAKYNLVNEYFLVG 252
Query: 207 VTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIWEL 266
VTEEL DF+ LLEAALP FFRG T+ + + KSHLR+T K PS+ET ++++S IW++
Sbjct: 253 VTEELEDFIMLLEAALPRFFRGATELYRSGKKSHLRKTTEKKAPSKETTAKLQQSDIWKM 312
Query: 267 ENELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYEKIYPK 310
ENE YE+ALEQF FV+ H + Y F YEKIYPK
Sbjct: 313 ENEFYEFALEQFQFVRAHAVREKDGELYVL--APNFFYEKIYPK 354
>gi|170172560|ref|NP_035958.3| heparan sulfate 2-O-sulfotransferase 1 [Mus musculus]
gi|67461059|sp|Q8R3H7.2|HS2ST_MOUSE RecName: Full=Heparan sulfate 2-O-sulfotransferase 1;
Short=2-O-sulfotransferase; Short=2-OST; Short=2OST
gi|7329070|gb|AAF59900.1|AF169243_1 heparan sulfate 2-O-sulfotransferase [Mus musculus]
gi|26353762|dbj|BAC40511.1| unnamed protein product [Mus musculus]
gi|37590704|gb|AAH59008.1| Heparan sulfate 2-O-sulfotransferase 1 [Mus musculus]
gi|74138367|dbj|BAE38041.1| unnamed protein product [Mus musculus]
gi|148680083|gb|EDL12030.1| heparan sulfate 2-O-sulfotransferase 1 [Mus musculus]
Length = 356
Score = 361 bits (926), Expect = 3e-97, Method: Compositional matrix adjust.
Identities = 157/284 (55%), Positives = 214/284 (75%), Gaps = 2/284 (0%)
Query: 27 DTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWR 86
D +IIYNRVPKT STSF N+AYD+C K R++VLH+N T NN V+SL DQ RFV N+T W
Sbjct: 73 DIIIIYNRVPKTASTSFTNIAYDLCAKNRYHVLHINTTKNNPVMSLQDQVRFVKNITTWN 132
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKH 146
+ +P YHGH ++DF +FG K++P++IN++R P++RLVSYYYFLR+GD+YRP L R+K
Sbjct: 133 EMKPGFYHGHISYLDFAKFGVKKKPIYINVIRDPIERLVSYYYFLRFGDDYRPGLRRRKQ 192
Query: 147 GDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVG 206
GDK TFDEC+ ++C+ E +WLQ+PF CGH++ CW G+ WA+++AK NL+ +Y LVG
Sbjct: 193 GDKKTFDECVAEGGSDCAPEKLWLQIPFFCGHSSECWNVGSRWAMDQAKSNLINEYFLVG 252
Query: 207 VTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIWEL 266
VTEEL DF+ LLEAALP FFRG TD + T KSHLR+T K P+++T+ ++++S IW++
Sbjct: 253 VTEELEDFIMLLEAALPRFFRGATDLYRTGKKSHLRKTTEKKLPTKQTIAKLQQSDIWKM 312
Query: 267 ENELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYEKIYPK 310
ENE YE+ALEQF F++ H + + G + F YEKIYPK
Sbjct: 313 ENEFYEFALEQFQFIRAHAV--REKDGDLYILAQNFFYEKIYPK 354
>gi|291398571|ref|XP_002715567.1| PREDICTED: heparan sulfate 2-O-sulfotransferase 1, partial
[Oryctolagus cuniculus]
Length = 362
Score = 361 bits (926), Expect = 3e-97, Method: Compositional matrix adjust.
Identities = 158/284 (55%), Positives = 215/284 (75%), Gaps = 2/284 (0%)
Query: 27 DTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWR 86
D VIIYNRVPKT STSF N+AYD+C K R++VLH+N T NN V+SL DQ RFV N+T W+
Sbjct: 79 DMVIIYNRVPKTASTSFTNIAYDLCAKNRYHVLHINTTKNNPVMSLQDQVRFVKNITSWK 138
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKH 146
+ +P YHGH ++DF +FG K++P++IN++R P++RLVSYYYFLR+GD+YRP L R+K
Sbjct: 139 EMKPGFYHGHVSYLDFAKFGVKKKPIYINVIRDPIERLVSYYYFLRFGDDYRPGLRRRKQ 198
Query: 147 GDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVG 206
GDK TFDEC+ ++C+ E +WLQ+PF CGH++ CW G+ WA+++AK NLV +Y LVG
Sbjct: 199 GDKKTFDECVAEGGSDCAPEKLWLQIPFFCGHSSECWNVGSRWAMDQAKYNLVNEYFLVG 258
Query: 207 VTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIWEL 266
VTEEL DF+ LLEAALP FFRG T+ + T KSHLR+T K P+++T+ ++++S IW++
Sbjct: 259 VTEELEDFIMLLEAALPRFFRGATELYRTGKKSHLRKTTEKKLPTKQTIAKLQQSDIWKM 318
Query: 267 ENELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYEKIYPK 310
ENE YE+ALEQF F++ H + + G + F YEKIYPK
Sbjct: 319 ENEFYEFALEQFQFIRAHAV--REKDGDLYILAQNFFYEKIYPK 360
>gi|321477957|gb|EFX88915.1| hypothetical protein DAPPUDRAFT_191314 [Daphnia pulex]
Length = 343
Score = 360 bits (925), Expect = 3e-97, Method: Compositional matrix adjust.
Identities = 166/310 (53%), Positives = 224/310 (72%), Gaps = 15/310 (4%)
Query: 1 INTQKSHQIHISSAKSPSPETDSLSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLH 60
+NTQ + + SS S +P + YNRVPKTGSTSFV +AYD+C + +F VLH
Sbjct: 49 LNTQVNGALLPSSFPSQNP---------TLFYNRVPKTGSTSFVGLAYDLCSRNKFKVLH 99
Query: 61 VNVTGNNHVLSLADQYRFVNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKP 120
VNV+ N H +SL DQ RF N++ W ++P+ YHGH ++DF +FGS P+FIN++R+P
Sbjct: 100 VNVSKNAHTMSLNDQLRFARNLSSWDLKQPSFYHGHIAYLDFTKFGSTA-PIFINLIRQP 158
Query: 121 LDRLVSYYYFLRYGDNYRPHLVRKKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAA 180
LDR+VSYYYFLRYGD++RPHL+R+K G+KT+FD+C++ +TEC N+WLQVPF CGH A
Sbjct: 159 LDRMVSYYYFLRYGDDFRPHLIRRKQGNKTSFDDCVKEGQTECDPNNLWLQVPFFCGHHA 218
Query: 181 ACWVPGNPWALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSH 240
CWVPG+ WA E+AK NL+ YLLVGVTE++ +FV++LEA LP+FF+G + +KSH
Sbjct: 219 DCWVPGSSWAFEQAKNNLIKNYLLVGVTEQMEEFVAVLEATLPNFFKGSLKLYRQGSKSH 278
Query: 241 LRRTNRKIDPSEETVQQIKKSKIWELENELYEYALEQFHFVKKHNLVYNKVLGYEADKGK 300
LR+TN KI P ET+++ + S +W+LENE YE+AL QF +VK L N L + KG+
Sbjct: 279 LRKTNLKIPPKAETIERFRNSTVWQLENEFYEFALRQFEYVKSRTL--NPDL---SGKGQ 333
Query: 301 QFMYEKIYPK 310
QF YEKI PK
Sbjct: 334 QFFYEKIRPK 343
>gi|67460981|sp|O08889.1|HS2ST_CRILO RecName: Full=Heparan sulfate 2-O-sulfotransferase 1;
Short=2-O-sulfotransferase; Short=2OST
gi|2196447|dbj|BAA20422.1| heparan sulfate 2-sulfotransferase [Cricetulus longicaudatus]
Length = 356
Score = 360 bits (925), Expect = 3e-97, Method: Compositional matrix adjust.
Identities = 158/284 (55%), Positives = 214/284 (75%), Gaps = 2/284 (0%)
Query: 27 DTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWR 86
D VIIYNRVPKT STSF N+AYD+C K R++VLH+N T NN V+SL DQ RFV N+T W
Sbjct: 73 DIVIIYNRVPKTASTSFTNIAYDLCAKNRYHVLHINTTKNNPVMSLQDQVRFVKNITTWN 132
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKH 146
+ +P YHGH ++DF +FG K++P++IN++R P++RLVSYYYFLR+GD+YRP L R+K
Sbjct: 133 EMKPGFYHGHISYLDFAKFGVKKKPIYINVIRDPIERLVSYYYFLRFGDDYRPGLRRRKQ 192
Query: 147 GDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVG 206
GDK TFDEC+ ++C+ E +WLQ+PF CGH++ CW G+ WA+++AK NL+ +Y LVG
Sbjct: 193 GDKKTFDECVAEGGSDCAPEKLWLQIPFFCGHSSECWNVGSRWAMDQAKYNLINEYFLVG 252
Query: 207 VTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIWEL 266
VTEEL DF+ LLEAALP FFRG TD + T KSHLR+T K P+++T+ ++++S IW++
Sbjct: 253 VTEELEDFIMLLEAALPRFFRGATDLYRTGKKSHLRKTTEKKLPTKQTIAKLQQSDIWKM 312
Query: 267 ENELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYEKIYPK 310
ENE YE+ALEQF F++ H + + G + F YEKIYPK
Sbjct: 313 ENEFYEFALEQFQFIRAHAV--REKDGDLYILAQNFFYEKIYPK 354
>gi|213688388|ref|NP_001093988.1| heparan sulfate 2-O-sulfotransferase 1 [Rattus norvegicus]
gi|149026134|gb|EDL82377.1| heparan sulfate 2-O-sulfotransferase 1, isoform CRA_a [Rattus
norvegicus]
Length = 356
Score = 360 bits (924), Expect = 4e-97, Method: Compositional matrix adjust.
Identities = 159/284 (55%), Positives = 214/284 (75%), Gaps = 2/284 (0%)
Query: 27 DTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWR 86
D VIIYNRVPKT STSF N+AYD+C K R++VLH+N T NN V+SL DQ RFV N+T W
Sbjct: 73 DIVIIYNRVPKTASTSFTNIAYDLCAKNRYHVLHINTTKNNPVMSLQDQVRFVKNITSWN 132
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKH 146
+ +P YHGH ++DF +FG K++P++IN++R P++RLVSYYYFLR+GD+YRP L R+K
Sbjct: 133 EMKPGFYHGHISYLDFAKFGVKKKPIYINVVRDPIERLVSYYYFLRFGDDYRPGLRRRKQ 192
Query: 147 GDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVG 206
GDK TFDEC+ ++C+ E +WLQ+PF CGH++ CW G+ WA+++AK NLV +Y LVG
Sbjct: 193 GDKKTFDECVAEGGSDCAPEKLWLQIPFFCGHSSECWNVGSRWAMDQAKYNLVNEYFLVG 252
Query: 207 VTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIWEL 266
VTEEL DF+ LLEAALP FFRG TD + T KSHLR+T K P+++T+ ++++S IW++
Sbjct: 253 VTEELEDFIMLLEAALPRFFRGATDLYRTGKKSHLRKTTEKKLPTKQTIAKLQQSDIWKM 312
Query: 267 ENELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYEKIYPK 310
ENE YE+ALEQF F++ H + + G + F YEKIYPK
Sbjct: 313 ENEFYEFALEQFQFMRAHAV--REKDGDLYILAQNFFYEKIYPK 354
>gi|332030058|gb|EGI69883.1| Heparan sulfate 2-O-sulfotransferase 1 [Acromyrmex echinatior]
Length = 344
Score = 360 bits (923), Expect = 7e-97, Method: Compositional matrix adjust.
Identities = 161/284 (56%), Positives = 209/284 (73%), Gaps = 5/284 (1%)
Query: 27 DTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWR 86
D ++IYNRVPKT STSF+ + YD+C++ +++VLH+NVT N H L+ A+Q +F NN++ W
Sbjct: 66 DVIVIYNRVPKTASTSFMGLVYDLCKQNKYHVLHINVTNNMHTLTFANQIQFANNISNWN 125
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKH 146
+PA YHGH F++F +FG+K PL+IN+LRKPLDR +SYYYFLRYGDN+RPH++RKKH
Sbjct: 126 SIKPAFYHGHMAFLNFGKFGTKRMPLYINLLRKPLDRFISYYYFLRYGDNFRPHVIRKKH 185
Query: 147 GDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVG 206
GD TFDECI + +C NMWLQ+PFLCGH ACW GN WALE+AK NL Y LVG
Sbjct: 186 GDTKTFDECINSGQPDCDPNNMWLQIPFLCGHDPACWEIGNSWALEEAKRNLQRYYFLVG 245
Query: 207 VTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIWEL 266
VTEEL +FV +LE LP FFRG + FL +NKSHLR+T +K++P ETV I++S IW++
Sbjct: 246 VTEELNEFVEVLENVLPRFFRGAYNFFLHNNKSHLRQTTQKLNPLPETVDIIQQSVIWKM 305
Query: 267 ENELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYEKIYPK 310
ENE Y +ALE FH VK+ + D ++FMYEKI PK
Sbjct: 306 ENEFYNFALEHFHAVKRR-----LINASPQDVNQRFMYEKIRPK 344
>gi|344278938|ref|XP_003411248.1| PREDICTED: heparan sulfate 2-O-sulfotransferase 1 [Loxodonta
africana]
Length = 356
Score = 359 bits (922), Expect = 7e-97, Method: Compositional matrix adjust.
Identities = 157/284 (55%), Positives = 215/284 (75%), Gaps = 2/284 (0%)
Query: 27 DTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWR 86
D VIIYNRVPKT STSF N+AYD+C K +++VLH+N T NN V+SL DQ RFV N+T W+
Sbjct: 73 DMVIIYNRVPKTASTSFTNIAYDLCAKNKYHVLHINTTKNNPVMSLQDQVRFVKNITSWK 132
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKH 146
+ +P YHGH ++DF +FG K++P++IN++R P++RLVSYYYFLR+GD+YRP L R+K
Sbjct: 133 EMKPGFYHGHVSYLDFAKFGVKKKPIYINVIRDPIERLVSYYYFLRFGDDYRPGLRRRKQ 192
Query: 147 GDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVG 206
GDK TFDEC+ ++C+ E +WLQ+PF CGH++ CW G+ WA+E+AK NL+ +Y LVG
Sbjct: 193 GDKKTFDECVAEGGSDCAPEKLWLQIPFFCGHSSECWNVGSRWAMEQAKYNLINEYFLVG 252
Query: 207 VTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIWEL 266
VTEEL DF+ LLEAALP FFRG T+ + T KSHLR+T K P+++T+ ++++S IW++
Sbjct: 253 VTEELEDFIMLLEAALPRFFRGATELYRTGKKSHLRKTTEKKLPTKQTIAKLQQSDIWKM 312
Query: 267 ENELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYEKIYPK 310
ENE YE+ALEQF F++ H + + G + F YEKIYPK
Sbjct: 313 ENEFYEFALEQFQFIRAHAV--REKDGDLYILTQNFFYEKIYPK 354
>gi|386781197|ref|NP_001247844.1| heparan sulfate 2-O-sulfotransferase 1 [Macaca mulatta]
gi|402855143|ref|XP_003892199.1| PREDICTED: heparan sulfate 2-O-sulfotransferase 1 [Papio anubis]
gi|355558145|gb|EHH14925.1| hypothetical protein EGK_00936 [Macaca mulatta]
gi|355745432|gb|EHH50057.1| hypothetical protein EGM_00821 [Macaca fascicularis]
gi|380818276|gb|AFE81012.1| heparan sulfate 2-O-sulfotransferase 1 isoform 1 [Macaca mulatta]
gi|383423113|gb|AFH34770.1| heparan sulfate 2-O-sulfotransferase 1 isoform 1 [Macaca mulatta]
gi|384950544|gb|AFI38877.1| heparan sulfate 2-O-sulfotransferase 1 isoform 1 [Macaca mulatta]
Length = 356
Score = 359 bits (922), Expect = 8e-97, Method: Compositional matrix adjust.
Identities = 157/284 (55%), Positives = 215/284 (75%), Gaps = 2/284 (0%)
Query: 27 DTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWR 86
D VIIYNRVPKT STSF N+AYD+C K +++VLH+N T NN V+SL DQ RFV N+T W+
Sbjct: 73 DMVIIYNRVPKTASTSFTNIAYDLCAKNKYHVLHINTTKNNPVMSLQDQVRFVKNITSWK 132
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKH 146
+ +P YHGH ++DF +FG K++P++IN++R P++RLVSYYYFLR+GD+YRP L R+K
Sbjct: 133 EMKPGFYHGHVSYLDFAKFGVKKKPIYINVIRDPIERLVSYYYFLRFGDDYRPGLRRRKQ 192
Query: 147 GDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVG 206
GDK TFDEC+ ++C+ E +WLQ+PF CGH++ CW G+ WA+++AK NLV +Y LVG
Sbjct: 193 GDKKTFDECVAEGGSDCAPEKLWLQIPFFCGHSSECWNVGSRWAMDQAKYNLVNEYFLVG 252
Query: 207 VTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIWEL 266
VTEEL DF+ LLEAALP FFRG T+ + T KSHLR+T K P+++T+ ++++S IW++
Sbjct: 253 VTEELEDFIMLLEAALPRFFRGATELYRTGKKSHLRKTTEKKLPTKQTIAKLQQSDIWKM 312
Query: 267 ENELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYEKIYPK 310
ENE YE+ALEQF F++ H + + G + F YEKIYPK
Sbjct: 313 ENEFYEFALEQFQFIRAHAV--REKDGDLYILAQNFFYEKIYPK 354
>gi|432107071|gb|ELK32503.1| Heparan sulfate 2-O-sulfotransferase 1 [Myotis davidii]
Length = 298
Score = 359 bits (922), Expect = 8e-97, Method: Compositional matrix adjust.
Identities = 157/284 (55%), Positives = 215/284 (75%), Gaps = 2/284 (0%)
Query: 27 DTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWR 86
D VIIYNRVPKT STSF N+AYD+C K +++VLH+N T NN V+SL DQ RFV N+T W+
Sbjct: 15 DMVIIYNRVPKTASTSFTNIAYDLCAKNKYHVLHINTTKNNPVMSLQDQVRFVKNITSWK 74
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKH 146
+ +P YHGH ++DF +FG K++P++IN++R P++RLVSYYYFLR+GD+YRP L R+K
Sbjct: 75 EMKPGFYHGHISYLDFAKFGVKKKPIYINVIRDPIERLVSYYYFLRFGDDYRPGLRRRKQ 134
Query: 147 GDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVG 206
GDK TFDEC+ ++C+ E +WLQ+PF CGH++ CW G+ WA+++AK NL+ +Y LVG
Sbjct: 135 GDKKTFDECVAEGGSDCAPEKLWLQIPFFCGHSSECWNVGSRWAMDQAKYNLINEYFLVG 194
Query: 207 VTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIWEL 266
VTEEL DF+ LLEAALP FFRG TD + T KSHLR+T K P+++T+ ++++S IW++
Sbjct: 195 VTEELEDFIMLLEAALPRFFRGATDLYRTGKKSHLRKTTEKKLPTKQTIAKLQQSDIWKM 254
Query: 267 ENELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYEKIYPK 310
ENE YE+ALEQF F++ H + + G + F YEKIYPK
Sbjct: 255 ENEFYEFALEQFQFIRAHAV--REKDGDLYILAQNFFYEKIYPK 296
>gi|126306112|ref|XP_001362263.1| PREDICTED: heparan sulfate 2-O-sulfotransferase 1 [Monodelphis
domestica]
Length = 356
Score = 359 bits (922), Expect = 9e-97, Method: Compositional matrix adjust.
Identities = 160/304 (52%), Positives = 219/304 (72%), Gaps = 3/304 (0%)
Query: 7 HQIHISSAKSPSPETDSLSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGN 66
Q H P D D VIIYNRVPKT STSF N+AYD+C K +++VLH+N T N
Sbjct: 54 EQRHTMDGPRPDAALDE-EEDMVIIYNRVPKTASTSFTNIAYDLCAKNKYHVLHINTTKN 112
Query: 67 NHVLSLADQYRFVNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVS 126
N V+SL DQ RFV N+T W++ +P YHGH ++DF +FG K++P++IN++R P++RLVS
Sbjct: 113 NPVMSLQDQVRFVKNITSWKEMKPGFYHGHVSYLDFAKFGVKKKPIYINVIRDPIERLVS 172
Query: 127 YYYFLRYGDNYRPHLVRKKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPG 186
YYYFLR+GD+YRP L R+K GDK TFDEC+ ++C+ E +WLQ+PF CGH++ CW G
Sbjct: 173 YYYFLRFGDDYRPGLRRRKQGDKKTFDECVAEGGSDCAPEKLWLQIPFFCGHSSECWNVG 232
Query: 187 NPWALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNR 246
+ WA+++AK NL+ +Y LVGVTEEL DF+ LLEAALP FFRG T+ + T KSHLR+T
Sbjct: 233 SRWAMDQAKYNLINEYFLVGVTEELEDFIMLLEAALPRFFRGATELYRTGKKSHLRKTTE 292
Query: 247 KIDPSEETVQQIKKSKIWELENELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYEK 306
K P+++T+ ++++S IW++ENE YE+ALEQF F++ H + + G + F YEK
Sbjct: 293 KKLPTKQTIAKLQQSDIWKMENEFYEFALEQFQFIRAHAV--REKDGDLYILSQNFFYEK 350
Query: 307 IYPK 310
IYPK
Sbjct: 351 IYPK 354
>gi|395530585|ref|XP_003767371.1| PREDICTED: heparan sulfate 2-O-sulfotransferase 1 [Sarcophilus
harrisii]
Length = 356
Score = 359 bits (921), Expect = 9e-97, Method: Compositional matrix adjust.
Identities = 160/304 (52%), Positives = 219/304 (72%), Gaps = 3/304 (0%)
Query: 7 HQIHISSAKSPSPETDSLSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGN 66
Q H P D D VIIYNRVPKT STSF N+AYD+C K +++VLH+N T N
Sbjct: 54 EQRHTMDGPRPDAALDE-EEDMVIIYNRVPKTASTSFTNIAYDLCAKNKYHVLHINTTKN 112
Query: 67 NHVLSLADQYRFVNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVS 126
N V+SL DQ RFV N+T W++ +P YHGH ++DF +FG K++P++IN++R P++RLVS
Sbjct: 113 NPVMSLQDQVRFVKNITSWKEMKPGFYHGHVSYLDFAKFGVKKKPIYINVIRDPIERLVS 172
Query: 127 YYYFLRYGDNYRPHLVRKKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPG 186
YYYFLR+GD+YRP L R+K GDK TFDEC+ ++C+ E +WLQ+PF CGH++ CW G
Sbjct: 173 YYYFLRFGDDYRPGLRRRKQGDKKTFDECVAEGGSDCAPEKLWLQIPFFCGHSSECWNVG 232
Query: 187 NPWALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNR 246
+ WA+++AK NL+ +Y LVGVTEEL DF+ LLEAALP FFRG T+ + T KSHLR+T
Sbjct: 233 SRWAMDQAKYNLINEYFLVGVTEELEDFIMLLEAALPRFFRGATELYRTGKKSHLRKTTE 292
Query: 247 KIDPSEETVQQIKKSKIWELENELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYEK 306
K P+++T+ ++++S IW++ENE YE+ALEQF F++ H + + G + F YEK
Sbjct: 293 KKLPTKQTIAKLQQSDIWKMENEFYEFALEQFQFIRAHAV--REKDGDLYILSQNFFYEK 350
Query: 307 IYPK 310
IYPK
Sbjct: 351 IYPK 354
>gi|193785742|dbj|BAG51177.1| unnamed protein product [Homo sapiens]
Length = 356
Score = 359 bits (921), Expect = 9e-97, Method: Compositional matrix adjust.
Identities = 156/284 (54%), Positives = 215/284 (75%), Gaps = 2/284 (0%)
Query: 27 DTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWR 86
D VIIYNRVPKT STSF N+AYD+C K +++VLH+N T NN V+SL DQ RFV N+T W+
Sbjct: 73 DMVIIYNRVPKTASTSFTNIAYDLCAKNKYHVLHINTTKNNPVMSLQDQVRFVKNITSWK 132
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKH 146
+ +P YHGH ++DF +FG K++P++IN++R P++RLVSYYYFLR+GD+YRP L R+K
Sbjct: 133 EMKPGFYHGHVSYLDFAKFGVKKKPIYINVIRDPIERLVSYYYFLRFGDDYRPGLRRRKQ 192
Query: 147 GDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVG 206
GDK TFDEC+ ++C+ E +WLQ+PF CGH++ CW G+ WA+++AK NL+ +Y LVG
Sbjct: 193 GDKKTFDECVAEGGSDCAPEKLWLQIPFFCGHSSECWNVGSRWAMDQAKYNLINEYFLVG 252
Query: 207 VTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIWEL 266
VTEEL DF+ LLEAALP FFRG T+ + T KSHLR+T K P+++T+ ++++S IW++
Sbjct: 253 VTEELEDFIMLLEAALPRFFRGATELYRTGKKSHLRKTTEKKLPTKQTIAKLQQSDIWKM 312
Query: 267 ENELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYEKIYPK 310
ENE YE+ALEQF F++ H + + G + F YEKIYPK
Sbjct: 313 ENEFYEFALEQFQFIRAHAV--REKDGDLYILAQNFFYEKIYPK 354
>gi|410967647|ref|XP_003990329.1| PREDICTED: heparan sulfate 2-O-sulfotransferase 1 [Felis catus]
Length = 356
Score = 359 bits (921), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 156/284 (54%), Positives = 215/284 (75%), Gaps = 2/284 (0%)
Query: 27 DTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWR 86
D VIIYNRVPKT STSF N+AYD+C K +++VLH+N T NN V+SL DQ RFV N+T W+
Sbjct: 73 DMVIIYNRVPKTASTSFTNIAYDLCAKNKYHVLHINTTKNNPVMSLQDQVRFVKNITSWK 132
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKH 146
+ +P YHGH ++DF +FG K++P++IN++R P++RLVSYYYFLR+GD+YRP L R+K
Sbjct: 133 EMKPGFYHGHVSYLDFAKFGVKKKPIYINVIRDPIERLVSYYYFLRFGDDYRPGLRRRKQ 192
Query: 147 GDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVG 206
GDK TFDEC+ ++C+ E +WLQ+PF CGH++ CW G+ WA+++AK NL+ +Y LVG
Sbjct: 193 GDKKTFDECVAEGGSDCAPEKLWLQIPFFCGHSSECWNVGSRWAMDQAKYNLINEYFLVG 252
Query: 207 VTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIWEL 266
VTEEL DF+ LLEAALP FFRG T+ + T KSHLR+T K P+++T+ ++++S IW++
Sbjct: 253 VTEELEDFIMLLEAALPRFFRGATELYRTGKKSHLRKTTEKKLPTKQTIAKLQQSDIWKM 312
Query: 267 ENELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYEKIYPK 310
ENE YE+ALEQF F++ H + + G + F YEKIYPK
Sbjct: 313 ENEFYEFALEQFQFIRAHAV--REKDGDLYILAQNFFYEKIYPK 354
>gi|67461002|sp|Q5R621.1|HS2ST_PONAB RecName: Full=Heparan sulfate 2-O-sulfotransferase 1;
Short=2-O-sulfotransferase; Short=2OST
gi|55732182|emb|CAH92795.1| hypothetical protein [Pongo abelii]
Length = 356
Score = 359 bits (921), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 156/284 (54%), Positives = 215/284 (75%), Gaps = 2/284 (0%)
Query: 27 DTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWR 86
D VIIYNRVPKT STSF N+AYD+C K +++VLH+N T NN V+SL DQ RFV N+T W+
Sbjct: 73 DMVIIYNRVPKTASTSFTNIAYDLCAKNKYHVLHINTTKNNPVMSLQDQMRFVKNITSWK 132
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKH 146
+ +P YHGH ++DF +FG K++P++IN++R P++RLVSYYYFLR+GD+YRP L R+K
Sbjct: 133 EMKPGFYHGHVSYLDFAKFGVKKKPIYINVIRDPIERLVSYYYFLRFGDDYRPGLRRRKQ 192
Query: 147 GDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVG 206
GDK TFDEC+ ++C+ E +WLQ+PF CGH++ CW G+ WA+++AK NL+ +Y LVG
Sbjct: 193 GDKKTFDECVAEGGSDCAPEKLWLQIPFFCGHSSECWNVGSRWAMDQAKYNLINEYFLVG 252
Query: 207 VTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIWEL 266
VTEEL DF+ LLEAALP FFRG T+ + T KSHLR+T K P+++T+ ++++S IW++
Sbjct: 253 VTEELEDFIMLLEAALPRFFRGATELYRTGKKSHLRKTTEKKLPTKQTIAKLQQSDIWKM 312
Query: 267 ENELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYEKIYPK 310
ENE YE+ALEQF F++ H + + G + F YEKIYPK
Sbjct: 313 ENEFYEFALEQFQFIRAHAV--REKDGDLYILAQNFFYEKIYPK 354
>gi|301764583|ref|XP_002917708.1| PREDICTED: heparan sulfate 2-O-sulfotransferase 1-like, partial
[Ailuropoda melanoleuca]
Length = 361
Score = 359 bits (921), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 156/284 (54%), Positives = 215/284 (75%), Gaps = 2/284 (0%)
Query: 27 DTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWR 86
D VIIYNRVPKT STSF N+AYD+C K +++VLH+N T NN V+SL DQ RFV N+T W+
Sbjct: 78 DMVIIYNRVPKTASTSFTNIAYDLCAKNKYHVLHINTTKNNPVMSLQDQVRFVKNITSWK 137
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKH 146
+ +P YHGH ++DF +FG K++P++IN++R P++RLVSYYYFLR+GD+YRP L R+K
Sbjct: 138 EMKPGFYHGHVSYLDFAKFGVKKKPIYINVIRDPIERLVSYYYFLRFGDDYRPGLRRRKQ 197
Query: 147 GDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVG 206
GDK TFDEC+ ++C+ E +WLQ+PF CGH++ CW G+ WA+++AK NL+ +Y LVG
Sbjct: 198 GDKKTFDECVAEGGSDCAPEKLWLQIPFFCGHSSECWNVGSRWAMDQAKYNLINEYFLVG 257
Query: 207 VTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIWEL 266
VTEEL DF+ LLEAALP FFRG T+ + T KSHLR+T K P+++T+ ++++S IW++
Sbjct: 258 VTEELEDFIMLLEAALPRFFRGATELYRTGKKSHLRKTTEKKLPTKQTIAKLQQSDIWKM 317
Query: 267 ENELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYEKIYPK 310
ENE YE+ALEQF F++ H + + G + F YEKIYPK
Sbjct: 318 ENEFYEFALEQFQFIRAHAV--REKDGDLYILAQNFFYEKIYPK 359
>gi|338725475|ref|XP_001494608.2| PREDICTED: heparan sulfate 2-O-sulfotransferase 1 [Equus caballus]
Length = 356
Score = 359 bits (921), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 156/284 (54%), Positives = 215/284 (75%), Gaps = 2/284 (0%)
Query: 27 DTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWR 86
D VIIYNRVPKT STSF N+AYD+C K +++VLH+N T NN V+SL DQ RFV N+T W+
Sbjct: 73 DMVIIYNRVPKTASTSFTNIAYDLCAKNKYHVLHINTTKNNPVMSLQDQVRFVKNITSWK 132
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKH 146
+ +P YHGH ++DF +FG K++P++IN++R P++RLVSYYYFLR+GD+YRP L R+K
Sbjct: 133 EMKPGFYHGHVSYLDFAKFGVKKKPIYINVIRDPIERLVSYYYFLRFGDDYRPGLRRRKQ 192
Query: 147 GDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVG 206
GDK TFDEC+ ++C+ E +WLQ+PF CGH++ CW G+ WA+++AK NL+ +Y LVG
Sbjct: 193 GDKKTFDECVAEGGSDCAPEKLWLQIPFFCGHSSECWNVGSRWAMDQAKYNLINEYFLVG 252
Query: 207 VTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIWEL 266
VTEEL DF+ LLEAALP FFRG T+ + T KSHLR+T K P+++T+ ++++S IW++
Sbjct: 253 VTEELEDFIMLLEAALPRFFRGATELYRTGKKSHLRKTTEKKLPTKQTIAKLQQSDIWKM 312
Query: 267 ENELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYEKIYPK 310
ENE YE+ALEQF F++ H + + G + F YEKIYPK
Sbjct: 313 ENEFYEFALEQFQFIRAHAV--REKDGDLYILAQNFFYEKIYPK 354
>gi|6912420|ref|NP_036394.1| heparan sulfate 2-O-sulfotransferase 1 isoform 1 [Homo sapiens]
gi|114557521|ref|XP_524759.2| PREDICTED: heparan sulfate 2-O-sulfotransferase 1 isoform 2 [Pan
troglodytes]
gi|397467288|ref|XP_003805356.1| PREDICTED: heparan sulfate 2-O-sulfotransferase 1 [Pan paniscus]
gi|68052326|sp|Q7LGA3.1|HS2ST_HUMAN RecName: Full=Heparan sulfate 2-O-sulfotransferase 1;
Short=2-O-sulfotransferase; Short=2OST
gi|6683564|dbj|BAA89250.1| heparan sulfate 2-sulfotransferase [Homo sapiens]
gi|119593577|gb|EAW73171.1| heparan sulfate 2-O-sulfotransferase 1, isoform CRA_b [Homo
sapiens]
gi|119593578|gb|EAW73172.1| heparan sulfate 2-O-sulfotransferase 1, isoform CRA_b [Homo
sapiens]
gi|168267372|dbj|BAG09742.1| heparan sulfate 2-O-sulfotransferase 1 [synthetic construct]
gi|410215140|gb|JAA04789.1| heparan sulfate 2-O-sulfotransferase 1 [Pan troglodytes]
gi|410257474|gb|JAA16704.1| heparan sulfate 2-O-sulfotransferase 1 [Pan troglodytes]
gi|410306004|gb|JAA31602.1| heparan sulfate 2-O-sulfotransferase 1 [Pan troglodytes]
gi|410339397|gb|JAA38645.1| heparan sulfate 2-O-sulfotransferase 1 [Pan troglodytes]
Length = 356
Score = 359 bits (921), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 156/284 (54%), Positives = 215/284 (75%), Gaps = 2/284 (0%)
Query: 27 DTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWR 86
D VIIYNRVPKT STSF N+AYD+C K +++VLH+N T NN V+SL DQ RFV N+T W+
Sbjct: 73 DMVIIYNRVPKTASTSFTNIAYDLCAKNKYHVLHINTTKNNPVMSLQDQVRFVKNITSWK 132
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKH 146
+ +P YHGH ++DF +FG K++P++IN++R P++RLVSYYYFLR+GD+YRP L R+K
Sbjct: 133 EMKPGFYHGHVSYLDFAKFGVKKKPIYINVIRDPIERLVSYYYFLRFGDDYRPGLRRRKQ 192
Query: 147 GDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVG 206
GDK TFDEC+ ++C+ E +WLQ+PF CGH++ CW G+ WA+++AK NL+ +Y LVG
Sbjct: 193 GDKKTFDECVAEGGSDCAPEKLWLQIPFFCGHSSECWNVGSRWAMDQAKYNLINEYFLVG 252
Query: 207 VTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIWEL 266
VTEEL DF+ LLEAALP FFRG T+ + T KSHLR+T K P+++T+ ++++S IW++
Sbjct: 253 VTEELEDFIMLLEAALPRFFRGATELYRTGKKSHLRKTTEKKLPTKQTIAKLQQSDIWKM 312
Query: 267 ENELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYEKIYPK 310
ENE YE+ALEQF F++ H + + G + F YEKIYPK
Sbjct: 313 ENEFYEFALEQFQFIRAHAV--REKDGDLYILAQNFFYEKIYPK 354
>gi|345801724|ref|XP_537087.3| PREDICTED: heparan sulfate 2-O-sulfotransferase 1 [Canis lupus
familiaris]
Length = 356
Score = 359 bits (921), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 156/284 (54%), Positives = 215/284 (75%), Gaps = 2/284 (0%)
Query: 27 DTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWR 86
D VIIYNRVPKT STSF N+AYD+C K +++VLH+N T NN V+SL DQ RFV N+T W+
Sbjct: 73 DMVIIYNRVPKTASTSFTNIAYDLCAKNKYHVLHINTTKNNPVMSLQDQVRFVKNITSWK 132
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKH 146
+ +P YHGH ++DF +FG K++P++IN++R P++RLVSYYYFLR+GD+YRP L R+K
Sbjct: 133 EMKPGFYHGHVSYLDFAKFGVKKKPIYINVIRDPIERLVSYYYFLRFGDDYRPGLRRRKQ 192
Query: 147 GDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVG 206
GDK TFDEC+ ++C+ E +WLQ+PF CGH++ CW G+ WA+++AK NL+ +Y LVG
Sbjct: 193 GDKKTFDECVAEGGSDCAPEKLWLQIPFFCGHSSECWNVGSRWAMDQAKYNLINEYFLVG 252
Query: 207 VTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIWEL 266
VTEEL DF+ LLEAALP FFRG T+ + T KSHLR+T K P+++T+ ++++S IW++
Sbjct: 253 VTEELEDFIMLLEAALPRFFRGATELYRTGKKSHLRKTTEKKLPTKQTIAKLQQSDIWKM 312
Query: 267 ENELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYEKIYPK 310
ENE YE+ALEQF F++ H + + G + F YEKIYPK
Sbjct: 313 ENEFYEFALEQFQFIRAHAV--REKDGDLYILAQNFFYEKIYPK 354
>gi|123703016|ref|NP_001074139.1| heparan sulfate 2-O-sulfotransferase 1 [Danio rerio]
gi|111609814|gb|ABH11459.1| heparan sulfate 2-O-sulfotransferase 1 [Danio rerio]
gi|120538116|gb|AAI29166.1| Heparan sulfate 2-O-sulfotransferase 1 [Danio rerio]
Length = 354
Score = 359 bits (921), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 159/284 (55%), Positives = 216/284 (76%), Gaps = 2/284 (0%)
Query: 27 DTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWR 86
DTV+IYNRVPKT STSF N+AYD+C K ++VLH+N + NN V+SL DQ RFV NVT W+
Sbjct: 71 DTVVIYNRVPKTASTSFTNIAYDLCNKNHYHVLHINTSKNNPVMSLQDQVRFVKNVTLWK 130
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKH 146
+ +PA YHGH F+DF +FG K++P++IN++R P++RLVSYYYFLR+GD+YRP L R+K
Sbjct: 131 EMKPAFYHGHVSFLDFTKFGVKKKPIYINVIRDPIERLVSYYYFLRFGDDYRPGLRRRKQ 190
Query: 147 GDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVG 206
GDK TFDEC+ ++C+ E +WLQ+PF CGH + CW G+ WALE+AK NLV +Y+LVG
Sbjct: 191 GDKKTFDECVSAGGSDCAPEKLWLQIPFFCGHYSECWNIGSRWALEQAKYNLVNEYMLVG 250
Query: 207 VTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIWEL 266
VTEEL DFV +LEAALP FF+G T+ + T +SHLR+T+ K P++E++ ++++S IW++
Sbjct: 251 VTEELEDFVMMLEAALPRFFKGATELYKTGKRSHLRKTSEKKPPTKESIARLQQSNIWKM 310
Query: 267 ENELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYEKIYPK 310
ENE YE+ALEQF +V+ H + Y + F YEKIYPK
Sbjct: 311 ENEFYEFALEQFQYVRAHAVREKDGELYLLTQN--FFYEKIYPK 352
>gi|40788265|dbj|BAA32293.2| KIAA0448 protein [Homo sapiens]
Length = 362
Score = 358 bits (920), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 156/284 (54%), Positives = 215/284 (75%), Gaps = 2/284 (0%)
Query: 27 DTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWR 86
D VIIYNRVPKT STSF N+AYD+C K +++VLH+N T NN V+SL DQ RFV N+T W+
Sbjct: 79 DMVIIYNRVPKTASTSFTNIAYDLCAKNKYHVLHINTTKNNPVMSLQDQVRFVKNITSWK 138
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKH 146
+ +P YHGH ++DF +FG K++P++IN++R P++RLVSYYYFLR+GD+YRP L R+K
Sbjct: 139 EMKPGFYHGHVSYLDFAKFGVKKKPIYINVIRDPIERLVSYYYFLRFGDDYRPGLRRRKQ 198
Query: 147 GDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVG 206
GDK TFDEC+ ++C+ E +WLQ+PF CGH++ CW G+ WA+++AK NL+ +Y LVG
Sbjct: 199 GDKKTFDECVAEGGSDCAPEKLWLQIPFFCGHSSECWNVGSRWAMDQAKYNLINEYFLVG 258
Query: 207 VTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIWEL 266
VTEEL DF+ LLEAALP FFRG T+ + T KSHLR+T K P+++T+ ++++S IW++
Sbjct: 259 VTEELEDFIMLLEAALPRFFRGATELYRTGKKSHLRKTTEKKLPTKQTIAKLQQSDIWKM 318
Query: 267 ENELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYEKIYPK 310
ENE YE+ALEQF F++ H + + G + F YEKIYPK
Sbjct: 319 ENEFYEFALEQFQFIRAHAV--REKDGDLYILAQNFFYEKIYPK 360
>gi|296208423|ref|XP_002751085.1| PREDICTED: heparan sulfate 2-O-sulfotransferase 1 [Callithrix
jacchus]
Length = 356
Score = 358 bits (920), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 156/284 (54%), Positives = 215/284 (75%), Gaps = 2/284 (0%)
Query: 27 DTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWR 86
D VIIYNRVPKT STSF N+AYD+C K +++VLH+N T NN V+SL DQ RFV N+T W+
Sbjct: 73 DMVIIYNRVPKTASTSFTNIAYDLCAKNKYHVLHINTTKNNPVMSLQDQVRFVKNITSWK 132
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKH 146
+ +P YHGH ++DF +FG K++P++IN++R P++RLVSYYYFLR+GD+YRP L R+K
Sbjct: 133 EMKPGFYHGHVSYLDFTKFGVKKKPIYINVIRDPIERLVSYYYFLRFGDDYRPGLRRRKQ 192
Query: 147 GDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVG 206
GDK TFDEC+ ++C+ E +WLQ+PF CGH++ CW G+ WA+++AK NL+ +Y LVG
Sbjct: 193 GDKKTFDECVAEGGSDCAPEKLWLQIPFFCGHSSECWNVGSRWAMDQAKYNLINEYFLVG 252
Query: 207 VTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIWEL 266
VTEEL DF+ LLEAALP FFRG T+ + T KSHLR+T K P+++T+ ++++S IW++
Sbjct: 253 VTEELEDFIMLLEAALPRFFRGATELYRTGKKSHLRKTTEKKLPTKQTIAKLQQSDIWKM 312
Query: 267 ENELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYEKIYPK 310
ENE YE+ALEQF F++ H + + G + F YEKIYPK
Sbjct: 313 ENEFYEFALEQFQFIRAHAV--REKDGDLYILAQNFFYEKIYPK 354
>gi|417410081|gb|JAA51518.1| Putative heparan sulfate 2-o-sulfotransferase 1, partial [Desmodus
rotundus]
Length = 362
Score = 358 bits (920), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 156/284 (54%), Positives = 215/284 (75%), Gaps = 2/284 (0%)
Query: 27 DTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWR 86
D VIIYNRVPKT STSF N+AYD+C K +++VLH+N T NN V+SL DQ RFV N+T W+
Sbjct: 79 DMVIIYNRVPKTASTSFTNIAYDLCAKNKYHVLHINTTKNNPVMSLQDQVRFVKNITSWK 138
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKH 146
+ +P YHGH ++DF +FG K++P++IN++R P++RLVSYYYFLR+GD+YRP L R+K
Sbjct: 139 EMKPGFYHGHVSYLDFAKFGVKKKPIYINVIRDPIERLVSYYYFLRFGDDYRPGLRRRKQ 198
Query: 147 GDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVG 206
GDK TFDEC+ ++C+ E +WLQ+PF CGH++ CW G+ WA+++AK NL+ +Y LVG
Sbjct: 199 GDKKTFDECVAEGGSDCAPEKLWLQIPFFCGHSSECWNVGSRWAMDQAKYNLINEYFLVG 258
Query: 207 VTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIWEL 266
VTEEL DF+ LLEAALP FFRG T+ + T KSHLR+T K P+++T+ ++++S IW++
Sbjct: 259 VTEELEDFIMLLEAALPRFFRGATELYRTGKKSHLRKTTEKKLPTKQTIAKLQQSDIWKM 318
Query: 267 ENELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYEKIYPK 310
ENE YE+ALEQF F++ H + + G + F YEKIYPK
Sbjct: 319 ENEFYEFALEQFQFIRAHAV--REKDGDLYILAQNFFYEKIYPK 360
>gi|335287397|ref|XP_001925527.2| PREDICTED: heparan sulfate 2-O-sulfotransferase 1 isoform 1 [Sus
scrofa]
Length = 356
Score = 358 bits (920), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 157/284 (55%), Positives = 215/284 (75%), Gaps = 2/284 (0%)
Query: 27 DTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWR 86
D VIIYNRVPKT STSF N+AYD+C K +++VLH+N T NN V+SL DQ RFV N+T W+
Sbjct: 73 DMVIIYNRVPKTASTSFTNIAYDLCAKNKYHVLHINTTKNNPVMSLQDQVRFVKNITSWK 132
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKH 146
+ +P YHGH ++DF +FG K++P++IN++R P++RLVSYYYFLR+GD+YRP L R+K
Sbjct: 133 EMKPGFYHGHVSYLDFAKFGVKKKPIYINVVRDPIERLVSYYYFLRFGDDYRPGLRRRKQ 192
Query: 147 GDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVG 206
GDK TFDEC+ ++C+ E +WLQ+PF CGH++ CW G+ WA+++AK NL+ +Y LVG
Sbjct: 193 GDKKTFDECVAEGGSDCAPEKLWLQIPFFCGHSSECWNVGSRWAMDQAKYNLINEYFLVG 252
Query: 207 VTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIWEL 266
VTEEL DFV LLEAALP FFRG T+ + T KSHLR+T K P+++T+ ++++S IW++
Sbjct: 253 VTEELEDFVMLLEAALPRFFRGATELYRTGKKSHLRKTTEKKLPTKQTIAKLQQSHIWKM 312
Query: 267 ENELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYEKIYPK 310
ENE YE+ALEQF F++ H + + G + F YEKIYPK
Sbjct: 313 ENEFYEFALEQFQFIRAHAV--REKDGDLFILAQNFFYEKIYPK 354
>gi|403305531|ref|XP_003943315.1| PREDICTED: heparan sulfate 2-O-sulfotransferase 1 [Saimiri
boliviensis boliviensis]
Length = 356
Score = 358 bits (920), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 156/284 (54%), Positives = 215/284 (75%), Gaps = 2/284 (0%)
Query: 27 DTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWR 86
D VIIYNRVPKT STSF N+AYD+C K +++VLH+N T NN V+SL DQ RFV N+T W+
Sbjct: 73 DMVIIYNRVPKTASTSFTNIAYDLCAKNKYHVLHINTTKNNPVMSLQDQVRFVKNITSWK 132
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKH 146
+ +P YHGH ++DF +FG K++P++IN++R P++RLVSYYYFLR+GD+YRP L R+K
Sbjct: 133 EMKPGFYHGHVSYLDFTKFGVKKKPIYINVIRDPIERLVSYYYFLRFGDDYRPGLRRRKQ 192
Query: 147 GDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVG 206
GDK TFDEC+ ++C+ E +WLQ+PF CGH++ CW G+ WA+++AK NL+ +Y LVG
Sbjct: 193 GDKKTFDECVAEGGSDCAPEKLWLQIPFFCGHSSECWNVGSRWAMDQAKYNLINEYFLVG 252
Query: 207 VTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIWEL 266
VTEEL DF+ LLEAALP FFRG T+ + T KSHLR+T K P+++T+ ++++S IW++
Sbjct: 253 VTEELEDFIMLLEAALPRFFRGATELYRTGKKSHLRKTTEKKLPTKQTIAKLQQSDIWKM 312
Query: 267 ENELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYEKIYPK 310
ENE YE+ALEQF F++ H + + G + F YEKIYPK
Sbjct: 313 ENEFYEFALEQFQFIRAHAV--REKDGDLYILAQNFFYEKIYPK 354
>gi|395821811|ref|XP_003784225.1| PREDICTED: heparan sulfate 2-O-sulfotransferase 1 [Otolemur
garnettii]
Length = 356
Score = 358 bits (920), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 156/284 (54%), Positives = 215/284 (75%), Gaps = 2/284 (0%)
Query: 27 DTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWR 86
D VIIYNRVPKT STSF N+AYD+C K +++VLH+N T NN V+SL DQ RFV N+T W+
Sbjct: 73 DMVIIYNRVPKTASTSFTNIAYDLCAKNKYHVLHINTTKNNPVMSLQDQVRFVKNITSWK 132
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKH 146
+ +P YHGH ++DF +FG K++P++IN++R P++RLVSYYYFLR+GD+YRP L R+K
Sbjct: 133 EMKPGFYHGHVSYLDFAKFGVKKKPIYINVIRDPIERLVSYYYFLRFGDDYRPGLRRRKQ 192
Query: 147 GDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVG 206
GDK TFDEC+ ++C+ E +WLQ+PF CGH++ CW G+ WA+++AK NL+ +Y LVG
Sbjct: 193 GDKKTFDECVAEGGSDCAPEKLWLQIPFFCGHSSECWNVGSRWAMDQAKYNLINEYFLVG 252
Query: 207 VTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIWEL 266
VTEEL DF+ LLEAALP FFRG T+ + T KSHLR+T K P+++T+ ++++S IW++
Sbjct: 253 VTEELEDFIMLLEAALPRFFRGATELYRTGKKSHLRKTTEKKLPTKQTIAKLQQSDIWKM 312
Query: 267 ENELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYEKIYPK 310
ENE YE+ALEQF F++ H + + G + F YEKIYPK
Sbjct: 313 ENEFYEFALEQFQFIRAHAV--REKDGDLYILAQNFFYEKIYPK 354
>gi|297473014|ref|XP_002686351.1| PREDICTED: heparan sulfate 2-O-sulfotransferase 1 [Bos taurus]
gi|296489245|tpg|DAA31358.1| TPA: heparan sulfate 2-O-sulfotransferase 1 [Bos taurus]
Length = 370
Score = 358 bits (920), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 156/284 (54%), Positives = 215/284 (75%), Gaps = 2/284 (0%)
Query: 27 DTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWR 86
D VIIYNRVPKT STSF N+AYD+C K +++VLH+N T NN V+SL DQ RFV N+T W+
Sbjct: 87 DMVIIYNRVPKTASTSFTNIAYDLCAKNKYHVLHINTTKNNPVMSLQDQVRFVKNITSWK 146
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKH 146
+ +P YHGH ++DF +FG K++P++IN++R P++RLVSYYYFLR+GD+YRP L R+K
Sbjct: 147 EMKPGFYHGHISYLDFAKFGVKKKPIYINVIRDPIERLVSYYYFLRFGDDYRPGLRRRKQ 206
Query: 147 GDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVG 206
GDK TFDEC+ ++C+ E +WLQ+PF CGH++ CW G+ WA+++AK NL+ +Y LVG
Sbjct: 207 GDKKTFDECVAEGGSDCAPEKLWLQIPFFCGHSSECWNVGSRWAMDQAKYNLINEYFLVG 266
Query: 207 VTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIWEL 266
VTEEL DF+ LLEAALP FFRG T+ + T KSHLR+T K P+++T+ ++++S IW++
Sbjct: 267 VTEELEDFIMLLEAALPRFFRGATELYRTGKKSHLRKTTEKKLPTKQTIAKLQQSDIWKM 326
Query: 267 ENELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYEKIYPK 310
ENE YE+ALEQF F++ H + + G + F YEKIYPK
Sbjct: 327 ENEFYEFALEQFQFIRAHAV--REKDGDLYILAQNFFYEKIYPK 368
>gi|3228536|gb|AAC40135.1| heparan sulfate 2-sulfotransferase [Mus musculus]
Length = 356
Score = 358 bits (919), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 156/284 (54%), Positives = 213/284 (75%), Gaps = 2/284 (0%)
Query: 27 DTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWR 86
D +IIYNRVPKT STSF N+AYD+C K R++VLH+N T NN V+SL DQ RFV N+T W
Sbjct: 73 DIIIIYNRVPKTASTSFTNIAYDLCAKNRYHVLHINTTKNNPVMSLQDQVRFVKNITTWN 132
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKH 146
+ +P YHGH ++DF +FG K++P++IN++R P++RLVSYYYFLR+GD+YRP L R+K
Sbjct: 133 EMKPGFYHGHISYLDFAKFGVKKKPIYINVIRDPIERLVSYYYFLRFGDDYRPGLRRRKQ 192
Query: 147 GDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVG 206
GDK TFDEC+ ++C+ E +WLQ+PF CGH++ CW G+ WA+++AK NL+ +Y LVG
Sbjct: 193 GDKKTFDECVAEGGSDCAPEKLWLQIPFFCGHSSECWNVGSRWAMDQAKSNLINEYFLVG 252
Query: 207 VTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIWEL 266
VTEEL DF+ LLEAALP FRG TD + T KSHLR+T K P+++T+ ++++S IW++
Sbjct: 253 VTEELEDFIMLLEAALPRVFRGATDLYRTGKKSHLRKTTEKKLPTKQTIAKLQQSDIWKM 312
Query: 267 ENELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYEKIYPK 310
ENE YE+ALEQF F++ H + + G + F YEKIYPK
Sbjct: 313 ENEFYEFALEQFQFIRAHAV--REKDGDLYILAQNFFYEKIYPK 354
>gi|348586164|ref|XP_003478839.1| PREDICTED: heparan sulfate 2-O-sulfotransferase 1-like [Cavia
porcellus]
Length = 356
Score = 358 bits (918), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 156/284 (54%), Positives = 214/284 (75%), Gaps = 2/284 (0%)
Query: 27 DTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWR 86
D VIIYNRVPKT STSF N+AYD+C K +++VLH+N T NN V+SL DQ RFV N+T W+
Sbjct: 73 DMVIIYNRVPKTASTSFTNIAYDLCAKNKYHVLHINTTKNNPVMSLQDQVRFVKNITSWK 132
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKH 146
+ +P YHGH ++DF +FG K++P++IN++R P++RLVSYYYFLR+GD+YRP L R+K
Sbjct: 133 EMKPGFYHGHISYLDFAKFGVKKKPIYINVIRDPIERLVSYYYFLRFGDDYRPGLRRRKQ 192
Query: 147 GDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVG 206
GDK TFDEC+ ++C+ E +WLQ+PF CGH++ CW G+ WA+++AK NL+ +Y LVG
Sbjct: 193 GDKKTFDECVAEGGSDCAPEKLWLQIPFFCGHSSECWNVGSRWAMDQAKYNLINEYFLVG 252
Query: 207 VTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIWEL 266
VTEEL DF+ LLEAALP FFRG T+ + T KSHLR+T K P+++T+ ++++S IW++
Sbjct: 253 VTEELEDFIMLLEAALPRFFRGATELYRTGKKSHLRKTTEKKLPTKQTIAKLQQSDIWKM 312
Query: 267 ENELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYEKIYPK 310
ENE YE+ALEQF F++ H + Y + F YEKIYPK
Sbjct: 313 ENEFYEFALEQFQFIRAHAVREKDGDLYILTQN--FFYEKIYPK 354
>gi|440912101|gb|ELR61700.1| Heparan sulfate 2-O-sulfotransferase 1, partial [Bos grunniens
mutus]
Length = 315
Score = 357 bits (917), Expect = 3e-96, Method: Compositional matrix adjust.
Identities = 156/284 (54%), Positives = 215/284 (75%), Gaps = 2/284 (0%)
Query: 27 DTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWR 86
D VIIYNRVPKT STSF N+AYD+C K +++VLH+N T NN V+SL DQ RFV N+T W+
Sbjct: 32 DMVIIYNRVPKTASTSFTNIAYDLCAKNKYHVLHINTTKNNPVMSLQDQVRFVKNITAWK 91
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKH 146
+ +P YHGH ++DF +FG K++P++IN++R P++RLVSYYYFLR+GD+YRP L R+K
Sbjct: 92 EMKPGFYHGHISYLDFAKFGVKKKPIYINVIRDPIERLVSYYYFLRFGDDYRPGLRRRKQ 151
Query: 147 GDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVG 206
GDK TFDEC+ ++C+ E +WLQ+PF CGH++ CW G+ WA+++AK NL+ +Y LVG
Sbjct: 152 GDKKTFDECVAEGGSDCAPEKLWLQIPFFCGHSSECWNVGSRWAMDQAKYNLINEYFLVG 211
Query: 207 VTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIWEL 266
VTEEL DF+ LLEAALP FFRG T+ + T KSHLR+T K P+++T+ ++++S IW++
Sbjct: 212 VTEELEDFIMLLEAALPRFFRGATELYRTGKKSHLRKTTEKKLPTKQTIAKLQQSDIWKM 271
Query: 267 ENELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYEKIYPK 310
ENE YE+ALEQF F++ H + + G + F YEKIYPK
Sbjct: 272 ENEFYEFALEQFQFIRAHAV--REKDGDLYILAQNFFYEKIYPK 313
>gi|358411477|ref|XP_602084.4| PREDICTED: heparan sulfate 2-O-sulfotransferase 1, partial [Bos
taurus]
Length = 314
Score = 357 bits (917), Expect = 3e-96, Method: Compositional matrix adjust.
Identities = 156/284 (54%), Positives = 215/284 (75%), Gaps = 2/284 (0%)
Query: 27 DTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWR 86
D VIIYNRVPKT STSF N+AYD+C K +++VLH+N T NN V+SL DQ RFV N+T W+
Sbjct: 31 DMVIIYNRVPKTASTSFTNIAYDLCAKNKYHVLHINTTKNNPVMSLQDQVRFVKNITSWK 90
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKH 146
+ +P YHGH ++DF +FG K++P++IN++R P++RLVSYYYFLR+GD+YRP L R+K
Sbjct: 91 EMKPGFYHGHISYLDFAKFGVKKKPIYINVIRDPIERLVSYYYFLRFGDDYRPGLRRRKQ 150
Query: 147 GDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVG 206
GDK TFDEC+ ++C+ E +WLQ+PF CGH++ CW G+ WA+++AK NL+ +Y LVG
Sbjct: 151 GDKKTFDECVAEGGSDCAPEKLWLQIPFFCGHSSECWNVGSRWAMDQAKYNLINEYFLVG 210
Query: 207 VTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIWEL 266
VTEEL DF+ LLEAALP FFRG T+ + T KSHLR+T K P+++T+ ++++S IW++
Sbjct: 211 VTEELEDFIMLLEAALPRFFRGATELYRTGKKSHLRKTTEKKLPTKQTIAKLQQSDIWKM 270
Query: 267 ENELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYEKIYPK 310
ENE YE+ALEQF F++ H + + G + F YEKIYPK
Sbjct: 271 ENEFYEFALEQFQFIRAHAV--REKDGDLYILAQNFFYEKIYPK 312
>gi|426330259|ref|XP_004026138.1| PREDICTED: heparan sulfate 2-O-sulfotransferase 1, partial [Gorilla
gorilla gorilla]
Length = 314
Score = 357 bits (917), Expect = 3e-96, Method: Compositional matrix adjust.
Identities = 156/284 (54%), Positives = 215/284 (75%), Gaps = 2/284 (0%)
Query: 27 DTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWR 86
D VIIYNRVPKT STSF N+AYD+C K +++VLH+N T NN V+SL DQ RFV N+T W+
Sbjct: 31 DMVIIYNRVPKTASTSFTNIAYDLCAKNKYHVLHINTTKNNPVMSLQDQVRFVKNITSWK 90
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKH 146
+ +P YHGH ++DF +FG K++P++IN++R P++RLVSYYYFLR+GD+YRP L R+K
Sbjct: 91 EMKPGFYHGHVSYLDFAKFGVKKKPIYINVIRDPIERLVSYYYFLRFGDDYRPGLRRRKQ 150
Query: 147 GDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVG 206
GDK TFDEC+ ++C+ E +WLQ+PF CGH++ CW G+ WA+++AK NL+ +Y LVG
Sbjct: 151 GDKKTFDECVAEGGSDCAPEKLWLQIPFFCGHSSECWNVGSRWAMDQAKYNLINEYFLVG 210
Query: 207 VTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIWEL 266
VTEEL DF+ LLEAALP FFRG T+ + T KSHLR+T K P+++T+ ++++S IW++
Sbjct: 211 VTEELEDFIMLLEAALPRFFRGATELYRTGKKSHLRKTTEKKLPTKQTIAKLQQSDIWKM 270
Query: 267 ENELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYEKIYPK 310
ENE YE+ALEQF F++ H + + G + F YEKIYPK
Sbjct: 271 ENEFYEFALEQFQFIRAHAV--REKDGDLYILAQNFFYEKIYPK 312
>gi|444512971|gb|ELV10229.1| Heparan sulfate 2-O-sulfotransferase 1 [Tupaia chinensis]
Length = 298
Score = 357 bits (917), Expect = 3e-96, Method: Compositional matrix adjust.
Identities = 156/284 (54%), Positives = 215/284 (75%), Gaps = 2/284 (0%)
Query: 27 DTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWR 86
D VIIYNRVPKT STSF N+AYD+C K +++VLH+N T NN V+SL DQ RFV N+T W+
Sbjct: 15 DMVIIYNRVPKTASTSFTNIAYDLCAKNKYHVLHINTTKNNPVMSLQDQVRFVKNITSWK 74
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKH 146
+ +P YHGH ++DF +FG K++P++IN++R P++RLVSYYYFLR+GD+YRP L R+K
Sbjct: 75 EMKPGFYHGHISYLDFAKFGVKKKPIYINVIRDPIERLVSYYYFLRFGDDYRPGLRRRKQ 134
Query: 147 GDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVG 206
GDK TFDEC+ ++C+ E +WLQ+PF CGH++ CW G+ WA+++AK NL+ +Y LVG
Sbjct: 135 GDKKTFDECVAEGGSDCAPEKLWLQIPFFCGHSSECWNVGSRWAMDQAKYNLINEYFLVG 194
Query: 207 VTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIWEL 266
VTEEL DF+ LLEAALP FFRG T+ + T KSHLR+T K P+++T+ ++++S IW++
Sbjct: 195 VTEELEDFIMLLEAALPRFFRGATELYRTGKKSHLRKTTEKKLPTKQTIAKLQQSDIWKM 254
Query: 267 ENELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYEKIYPK 310
ENE YE+ALEQF F++ H + + G + F YEKIYPK
Sbjct: 255 ENEFYEFALEQFQFIRAHAV--REKDGDLYILAQNFFYEKIYPK 296
>gi|281338513|gb|EFB14097.1| hypothetical protein PANDA_006063 [Ailuropoda melanoleuca]
Length = 315
Score = 357 bits (917), Expect = 3e-96, Method: Compositional matrix adjust.
Identities = 156/284 (54%), Positives = 215/284 (75%), Gaps = 2/284 (0%)
Query: 27 DTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWR 86
D VIIYNRVPKT STSF N+AYD+C K +++VLH+N T NN V+SL DQ RFV N+T W+
Sbjct: 32 DMVIIYNRVPKTASTSFTNIAYDLCAKNKYHVLHINTTKNNPVMSLQDQVRFVKNITSWK 91
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKH 146
+ +P YHGH ++DF +FG K++P++IN++R P++RLVSYYYFLR+GD+YRP L R+K
Sbjct: 92 EMKPGFYHGHVSYLDFAKFGVKKKPIYINVIRDPIERLVSYYYFLRFGDDYRPGLRRRKQ 151
Query: 147 GDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVG 206
GDK TFDEC+ ++C+ E +WLQ+PF CGH++ CW G+ WA+++AK NL+ +Y LVG
Sbjct: 152 GDKKTFDECVAEGGSDCAPEKLWLQIPFFCGHSSECWNVGSRWAMDQAKYNLINEYFLVG 211
Query: 207 VTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIWEL 266
VTEEL DF+ LLEAALP FFRG T+ + T KSHLR+T K P+++T+ ++++S IW++
Sbjct: 212 VTEELEDFIMLLEAALPRFFRGATELYRTGKKSHLRKTTEKKLPTKQTIAKLQQSDIWKM 271
Query: 267 ENELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYEKIYPK 310
ENE YE+ALEQF F++ H + + G + F YEKIYPK
Sbjct: 272 ENEFYEFALEQFQFIRAHAV--REKDGDLYILAQNFFYEKIYPK 313
>gi|431897052|gb|ELK06316.1| Heparan sulfate 2-O-sulfotransferase 1 [Pteropus alecto]
Length = 335
Score = 357 bits (917), Expect = 3e-96, Method: Compositional matrix adjust.
Identities = 156/284 (54%), Positives = 215/284 (75%), Gaps = 2/284 (0%)
Query: 27 DTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWR 86
D VIIYNRVPKT STSF N+AYD+C K +++VLH+N T NN V+SL DQ RFV N+T W+
Sbjct: 52 DMVIIYNRVPKTASTSFTNIAYDLCAKNKYHVLHINTTKNNPVMSLQDQVRFVKNITSWK 111
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKH 146
+ +P YHGH ++DF +FG K++P++IN++R P++RLVSYYYFLR+GD+YRP L R+K
Sbjct: 112 EMKPGFYHGHVSYLDFAKFGVKKKPIYINVIRDPIERLVSYYYFLRFGDDYRPGLRRRKQ 171
Query: 147 GDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVG 206
GDK TFDEC+ ++C+ E +WLQ+PF CGH++ CW G+ WA+++AK NL+ +Y LVG
Sbjct: 172 GDKKTFDECVAEGGSDCAPEKLWLQIPFFCGHSSECWNVGSRWAMDQAKYNLINEYFLVG 231
Query: 207 VTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIWEL 266
VTEEL DF+ LLEAALP FFRG T+ + T KSHLR+T K P+++T+ ++++S IW++
Sbjct: 232 VTEELEDFIMLLEAALPRFFRGATELYRTGKKSHLRKTTEKKLPTKQTIAKLQQSDIWKM 291
Query: 267 ENELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYEKIYPK 310
ENE YE+ALEQF F++ H + + G + F YEKIYPK
Sbjct: 292 ENEFYEFALEQFQFIRAHAV--REKDGDLYILAQNFFYEKIYPK 333
>gi|426215914|ref|XP_004002214.1| PREDICTED: heparan sulfate 2-O-sulfotransferase 1 [Ovis aries]
Length = 356
Score = 357 bits (915), Expect = 4e-96, Method: Compositional matrix adjust.
Identities = 155/284 (54%), Positives = 214/284 (75%), Gaps = 2/284 (0%)
Query: 27 DTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWR 86
D VIIYNRVPKT STSF N+AYD+C K +++VLH+N T NN V+SL DQ RFV N+T W+
Sbjct: 73 DMVIIYNRVPKTASTSFTNIAYDLCAKNKYHVLHINTTKNNPVMSLQDQVRFVKNITSWK 132
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKH 146
+ +P YHGH ++DF +FG K++P++IN++R P++RLVSYYYFLR+GD+YRP L R+K
Sbjct: 133 EMKPGFYHGHISYLDFAKFGVKKKPIYINVIRDPIERLVSYYYFLRFGDDYRPGLRRRKQ 192
Query: 147 GDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVG 206
GDK TFDEC+ ++C+ E +WLQ+PF CGH++ CW G+ WA+++AK NL+ +Y LVG
Sbjct: 193 GDKKTFDECVAEGGSDCAPEKLWLQIPFFCGHSSECWNVGSRWAMDQAKYNLINEYFLVG 252
Query: 207 VTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIWEL 266
VTEEL DF+ LLEAALP FFRG + + T KSHLR+T K P+++T+ ++++S IW++
Sbjct: 253 VTEELEDFIMLLEAALPRFFRGAAELYRTGKKSHLRKTTEKKLPTKQTIAKLQQSDIWKM 312
Query: 267 ENELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYEKIYPK 310
ENE YE+ALEQF F++ H + + G + F YEKIYPK
Sbjct: 313 ENEFYEFALEQFQFIRAHAV--REKDGDLYILAQNFFYEKIYPK 354
>gi|335287399|ref|XP_003355346.1| PREDICTED: heparan sulfate 2-O-sulfotransferase 1 isoform 2 [Sus
scrofa]
Length = 370
Score = 355 bits (911), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 157/284 (55%), Positives = 215/284 (75%), Gaps = 2/284 (0%)
Query: 27 DTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWR 86
D VIIYNRVPKT STSF N+AYD+C K +++VLH+N T NN V+SL DQ RFV N+T W+
Sbjct: 87 DMVIIYNRVPKTASTSFTNIAYDLCAKNKYHVLHINTTKNNPVMSLQDQVRFVKNITSWK 146
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKH 146
+ +P YHGH ++DF +FG K++P++IN++R P++RLVSYYYFLR+GD+YRP L R+K
Sbjct: 147 EMKPGFYHGHVSYLDFAKFGVKKKPIYINVVRDPIERLVSYYYFLRFGDDYRPGLRRRKQ 206
Query: 147 GDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVG 206
GDK TFDEC+ ++C+ E +WLQ+PF CGH++ CW G+ WA+++AK NL+ +Y LVG
Sbjct: 207 GDKKTFDECVAEGGSDCAPEKLWLQIPFFCGHSSECWNVGSRWAMDQAKYNLINEYFLVG 266
Query: 207 VTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIWEL 266
VTEEL DFV LLEAALP FFRG T+ + T KSHLR+T K P+++T+ ++++S IW++
Sbjct: 267 VTEELEDFVMLLEAALPRFFRGATELYRTGKKSHLRKTTEKKLPTKQTIAKLQQSHIWKM 326
Query: 267 ENELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYEKIYPK 310
ENE YE+ALEQF F++ H + + G + F YEKIYPK
Sbjct: 327 ENEFYEFALEQFQFIRAHAV--REKDGDLFILAQNFFYEKIYPK 368
>gi|197099460|ref|NP_001126497.1| heparan sulfate 2-O-sulfotransferase 1 [Pongo abelii]
gi|55731701|emb|CAH92556.1| hypothetical protein [Pongo abelii]
Length = 356
Score = 354 bits (909), Expect = 3e-95, Method: Compositional matrix adjust.
Identities = 155/284 (54%), Positives = 214/284 (75%), Gaps = 2/284 (0%)
Query: 27 DTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWR 86
D VIIYNRVPKT STSF N+AYD+C K +++VLH+N T NN V+SL DQ RFV N+T W+
Sbjct: 73 DMVIIYNRVPKTASTSFTNIAYDLCAKNKYHVLHINTTKNNPVMSLQDQMRFVKNITSWK 132
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKH 146
+ +P YHGH ++DF +FG K++P++IN++R P++RLVSYYYFLR+GD+YRP L R+K
Sbjct: 133 EMKPGFYHGHVSYLDFAKFGVKKKPIYINVIRDPIERLVSYYYFLRFGDDYRPGLRRRKQ 192
Query: 147 GDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVG 206
GDK TFDEC+ ++C+ E +WLQ+PF CGH++ W G+ WA+++AK NL+ +Y LVG
Sbjct: 193 GDKKTFDECVAEGGSDCAPEKLWLQIPFFCGHSSERWNVGSRWAMDQAKYNLINEYFLVG 252
Query: 207 VTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIWEL 266
VTEEL DF+ LLEAALP FFRG T+ + T KSHLR+T K P+++T+ ++++S IW++
Sbjct: 253 VTEELEDFIMLLEAALPRFFRGATELYRTGKKSHLRKTTEKKLPTKQTIAKLQQSDIWKM 312
Query: 267 ENELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYEKIYPK 310
ENE YE+ALEQF F++ H + + G + F YEKIYPK
Sbjct: 313 ENEFYEFALEQFQFIRAHAV--REKDGDLYILAQNFFYEKIYPK 354
>gi|351705455|gb|EHB08374.1| Heparan sulfate 2-O-sulfotransferase 1 [Heterocephalus glaber]
Length = 283
Score = 354 bits (908), Expect = 3e-95, Method: Compositional matrix adjust.
Identities = 155/282 (54%), Positives = 214/282 (75%), Gaps = 2/282 (0%)
Query: 29 VIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWRDR 88
VIIYNRVPKT STSF N+AYD+C K +++VLH+N T NN V+SL DQ RFV N+T W++
Sbjct: 2 VIIYNRVPKTASTSFTNIAYDLCAKNKYHVLHINTTKNNPVMSLQDQVRFVKNITSWKEM 61
Query: 89 RPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKHGD 148
+P YHGH ++DF +FG K++P++IN++R P++RLVSYYYFLR+GD+YRP L R+K GD
Sbjct: 62 KPGFYHGHISYLDFAKFGVKKKPIYINVIRDPIERLVSYYYFLRFGDDYRPGLRRRKQGD 121
Query: 149 KTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVGVT 208
K TFDEC+ ++C+ E +WLQ+PF CGH++ CW G+ WA+++AK NL+ +Y LVGVT
Sbjct: 122 KKTFDECVAEGGSDCAPEKLWLQIPFFCGHSSECWNVGSRWAMDQAKYNLINEYFLVGVT 181
Query: 209 EELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIWELEN 268
EEL DF+ LLEAALP FFRG T+ + T KSHLR+T K P+++T+ ++++S IW++EN
Sbjct: 182 EELEDFIMLLEAALPRFFRGATELYRTGKKSHLRKTTEKKLPTKQTIAKLQQSDIWKMEN 241
Query: 269 ELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYEKIYPK 310
E YE+ALEQF F++ H + + G + F YEKIYPK
Sbjct: 242 EFYEFALEQFQFIRAHAV--REKDGDLYILTQNFFYEKIYPK 281
>gi|322789060|gb|EFZ14513.1| hypothetical protein SINV_01318 [Solenopsis invicta]
Length = 372
Score = 352 bits (903), Expect = 1e-94, Method: Compositional matrix adjust.
Identities = 150/257 (58%), Positives = 200/257 (77%)
Query: 27 DTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWR 86
D V+IYNRVPKT STSF+ + YD+C++ +++VLH+NVT N H L+ A+Q +F NN++ W
Sbjct: 67 DVVVIYNRVPKTASTSFMGLVYDLCKQNKYHVLHINVTNNMHTLTFANQVQFANNISNWN 126
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKH 146
+PA YHGH F++F +FG+K PL+IN+LRKPLDR +SYYYFLRYGDN+RPH++RKKH
Sbjct: 127 SIKPAFYHGHMAFLNFGKFGTKRMPLYINLLRKPLDRFISYYYFLRYGDNFRPHVIRKKH 186
Query: 147 GDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVG 206
GD TFDECI + + +C NMWLQ+PFLCGH ACW GN WALE+AK NL Y LVG
Sbjct: 187 GDTKTFDECINIGQPDCDPNNMWLQIPFLCGHDPACWEIGNSWALEEAKRNLQRYYFLVG 246
Query: 207 VTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIWEL 266
VTEEL +FV +LE LP FF+G + FL +NKSHLR+T +K++P ETV++I++S +W++
Sbjct: 247 VTEELNEFVEVLENVLPRFFKGAYNFFLHNNKSHLRQTTQKLNPLPETVEKIQQSPVWKM 306
Query: 267 ENELYEYALEQFHFVKK 283
EN+ Y +ALE FH VK+
Sbjct: 307 ENDFYNFALEHFHAVKR 323
>gi|196000168|ref|XP_002109952.1| hypothetical protein TRIADDRAFT_20786 [Trichoplax adhaerens]
gi|190588076|gb|EDV28118.1| hypothetical protein TRIADDRAFT_20786, partial [Trichoplax
adhaerens]
Length = 294
Score = 352 bits (902), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 158/286 (55%), Positives = 214/286 (74%), Gaps = 4/286 (1%)
Query: 29 VIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWRDR 88
VIIYNRVPKTGSTS + + YD+C+ +F VLH+NV+ N+HV+ +ADQ RF+ N+T W
Sbjct: 1 VIIYNRVPKTGSTSVMALFYDLCKINKFRVLHLNVSKNSHVMHVADQGRFIRNITSWSKM 60
Query: 89 RPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKHGD 148
+PA++HGH ++DF+++G+ +P++IN+LRKPLDRLVS+YYFLRYGD++RPHL R+K GD
Sbjct: 61 QPAIFHGHLAYLDFEKYGAFNRPIYINVLRKPLDRLVSFYYFLRYGDDFRPHLHRRKMGD 120
Query: 149 KTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVGVT 208
K TFDEC+ N +C E +WLQ+PF CGH CW+PGN WAL++AK NL KYLLVGVT
Sbjct: 121 KITFDECVAKNLPDCRPEKLWLQIPFFCGHHTQCWIPGNEWALQQAKHNLFHKYLLVGVT 180
Query: 209 EELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIWELEN 268
E+LT F+++LEA LP F+G T+ FLT +KSH R+T K+ PSE T+ ++KSKIW++EN
Sbjct: 181 EDLTGFINVLEATLPKLFKGATNRFLTFSKSHARKTKYKLPPSEATINAMQKSKIWKMEN 240
Query: 269 ELYEYALEQFHFVKKHNLVYNKVLGYEADKGK----QFMYEKIYPK 310
E YE+AL F +K+ LV K L + K +F Y+KI PK
Sbjct: 241 EFYEFALAIFQRIKESTLVKGKKLNDKTIGRKVVQQKFFYDKIRPK 286
>gi|328717864|ref|XP_003246326.1| PREDICTED: heparin sulfate O-sulfotransferase-like [Acyrthosiphon
pisum]
Length = 385
Score = 333 bits (855), Expect = 4e-89, Method: Compositional matrix adjust.
Identities = 162/333 (48%), Positives = 222/333 (66%), Gaps = 32/333 (9%)
Query: 8 QIHISSAKSPSPETDSLSWDT------------------VIIYNRVPKTGSTSFVNMAYD 49
QI ++ AK S +++ +D+ ++IYNRVPKTGSTSFVN+AYD
Sbjct: 43 QIQVAVAKLASGDSNYQRFDSSASQYHQNSVMILEEQKPLVIYNRVPKTGSTSFVNVAYD 102
Query: 50 MCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWRDRRPALYHGHFGFIDFQQFGSKE 109
+ F VLHVNVTGN+H+LS+ DQ+RF++N T+W RPA YHGHF +IDF+++G K
Sbjct: 103 LHSYNAFRVLHVNVTGNSHLLSIYDQFRFIDNTTRWM--RPAFYHGHFAYIDFERYGYK- 159
Query: 110 QPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKHGDK-TTFDECIRLNRTECSLENM 168
+P ++ +LRKPLDRLVSYYYFLRYGD+YRPHLVRKKH D TTFDEC+ ++C +
Sbjct: 160 KPYYVQLLRKPLDRLVSYYYFLRYGDDYRPHLVRKKHMDSSTTFDECVERGGSDCQANLL 219
Query: 169 WLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVGVTEELTDFVSLLEAALP-SFFR 227
WLQ+PFLCG +A CW+PG+ WAL +AK N++ KY LVGVTE++ +++ +LE +P FR
Sbjct: 220 WLQIPFLCGQSADCWIPGSEWALRQAKRNVLEKYTLVGVTEQMGEYLQMLELVIPGGMFR 279
Query: 228 GGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIWELENELYEYALEQFHFV------ 281
++HF S+KSHLR+T +K S +T+Q+ +S +W++ENELY + +F F
Sbjct: 280 NASEHFKHSSKSHLRKTAKKYPVSRKTIQKFHESTVWQMENELYAFVAREFAFAYAKQFP 339
Query: 282 ---KKHNLVYNKVLGYEADKGKQFMYEKIYPKP 311
H+ K L A F YEKIYPKP
Sbjct: 340 NISSGHSGTMKKHLKISAIPPPSFRYEKIYPKP 372
>gi|391346183|ref|XP_003747358.1| PREDICTED: heparan sulfate 2-O-sulfotransferase 1-like [Metaseiulus
occidentalis]
Length = 560
Score = 318 bits (816), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 149/285 (52%), Positives = 200/285 (70%), Gaps = 11/285 (3%)
Query: 27 DTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWR 86
+ VIIYNRVPKT STSF +AYD+C K +F VLH+N + N HV+S+ADQ R+ N+++W
Sbjct: 283 NQVIIYNRVPKTASTSFTGVAYDLCVKNKFFVLHINTSRNMHVMSIADQMRYSVNISQWE 342
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKH 146
RP++YHGH F+DF +FG P++INI+R+PL+RLVSYYYFLR GD++RPHL R++
Sbjct: 343 SMRPSMYHGHVAFLDFARFGMPP-PIYINIVREPLERLVSYYYFLRNGDDFRPHLQRRRS 401
Query: 147 GDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVG 206
GDK TFDEC++ +CS + MWLQ+PF CGH CW PG+ WALE+AK NL+ KYLLVG
Sbjct: 402 GDKRTFDECVKQMGQDCSEDKMWLQIPFFCGHTPNCWQPGSRWALEQAKMNLINKYLLVG 461
Query: 207 VTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIWEL 266
TE+L DFV +LE LP FFRG + F + KSHLR+T+ K + ETV IK+ + L
Sbjct: 462 TTEQLQDFVDILEVVLPRFFRGASQLFHSGKKSHLRKTSNKKPVNAETVALIKQWPTYLL 521
Query: 267 ENELYEYALEQFHFVKKHNLVYNKVLGYEADKG-KQFMYEKIYPK 310
E+E YE+A F+ +K +++G +QF +EKI PK
Sbjct: 522 ESEFYEFAKRHFNSIKDK---------LNSEQGSQQFFFEKIRPK 557
>gi|11013|emb|CAA42779.1| Sd protein [Drosophila melanogaster]
Length = 363
Score = 311 bits (797), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 141/254 (55%), Positives = 190/254 (74%), Gaps = 2/254 (0%)
Query: 11 ISSAKSPSPETDSLSWD--TVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNH 68
+S + S TD ++ V++YNRVPKTGSTSFVN+AYD+C+ +F+VLH+NVT N H
Sbjct: 54 LSPDQHASSTTDDFDFEEHLVVLYNRVPKTGSTSFVNIAYDLCKPNKFHVLHINVTANMH 113
Query: 69 VLSLADQYRFVNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYY 128
VLSL +Q +FV NV++W + +PALYHGH F+DF +F +P++IN++RKPLDRLVSYY
Sbjct: 114 VLSLPNQIQFVRNVSRWHEMKPALYHGHMAFLDFSKFQIAHKPIYINLVRKPLDRLVSYY 173
Query: 129 YFLRYGDNYRPHLVRKKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNP 188
YFLR+G++YRP+LVRKK G+K TFDEC+ + +C +NMWLQ+PF CGHAA CW PG+
Sbjct: 174 YFLRFGEHYRPNLVRKKAGNKITFDECVVQKQPDCDPKNMWLQIPFFCGHAAECWEPGSS 233
Query: 189 WALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKI 248
WAL++AK NLV +Y LVGVTE++ +FV LLE +LP F G +H+ SNKSHLR T+ K+
Sbjct: 234 WALDQAKRNLVNEYFLVGVTEQMYEFVDLLERSLPRIFHGFREHYHNSNKSHLRVTSSKL 293
Query: 249 DPSEETVQQIKKSK 262
P + K+ K
Sbjct: 294 PPRNRQLNPFKRQK 307
>gi|198432961|ref|XP_002128889.1| PREDICTED: similar to heparan sulfate 2-sulfotransferase [Ciona
intestinalis]
Length = 359
Score = 300 bits (769), Expect = 5e-79, Method: Compositional matrix adjust.
Identities = 139/280 (49%), Positives = 194/280 (69%), Gaps = 2/280 (0%)
Query: 29 VIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWRDR 88
+I+YNRVPKTGSTSF N+ YD+ + + LH+N+T N+ + + DQY N+T W +R
Sbjct: 72 IILYNRVPKTGSTSFSNLVYDLTKTNKMYCLHLNITRNSLKIPIGDQYNLALNMTTWVER 131
Query: 89 RPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKHGD 148
RPALYHGHFG+ F QFG + P++INILR+PLDRL+S+YYF+RYGD++R L R K GD
Sbjct: 132 RPALYHGHFGYFSFAQFGFPD-PMYINILREPLDRLLSFYYFIRYGDDFRKGLKRTKQGD 190
Query: 149 KTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVGVT 208
KTTFDEC+ N +C +WLQ+P +CG +A CW G+ WAL++AKENLV +Y LVGVT
Sbjct: 191 KTTFDECVAQNGHDCQPRALWLQIPMMCGQSAECWKVGSQWALQQAKENLVNRYALVGVT 250
Query: 209 EELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIWELEN 268
E+L DFV +LEA P F G + F T +KSH+R T K PSE T+ +K +K + +E
Sbjct: 251 EQLEDFVVVLEAIQPRIFNGIINKFRTGSKSHIRNTIHKEPPSEATLAAMKNTKTYRMER 310
Query: 269 ELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYEKIY 308
E Y++A+ QF +K+++ ++ + G Q+ YEKI+
Sbjct: 311 EFYDFAVRQFEHIKRYSTFKDETGTLQPLYG-QYHYEKIF 349
>gi|195115178|ref|XP_002002141.1| GI17219 [Drosophila mojavensis]
gi|193912716|gb|EDW11583.1| GI17219 [Drosophila mojavensis]
Length = 291
Score = 300 bits (767), Expect = 7e-79, Method: Compositional matrix adjust.
Identities = 135/237 (56%), Positives = 177/237 (74%), Gaps = 2/237 (0%)
Query: 1 INTQKSHQIHISSAKSPSPETDSLSWD--TVIIYNRVPKTGSTSFVNMAYDMCRKKRFNV 58
+N+ K ++ P D ++ V++YNRVPKTGSTSFVN+AYD+C++ R++V
Sbjct: 48 VNSGKQSVADQAALSRSIPTFDGFDYEEQLVVLYNRVPKTGSTSFVNIAYDLCKQNRYHV 107
Query: 59 LHVNVTGNNHVLSLADQYRFVNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILR 118
LH+NVT N HVLSL +Q FV NVTKW +PALYHGH F+DF +F +P++IN++R
Sbjct: 108 LHINVTANMHVLSLPNQISFVRNVTKWHVMKPALYHGHMAFLDFSKFQIAHKPIYINLVR 167
Query: 119 KPLDRLVSYYYFLRYGDNYRPHLVRKKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGH 178
KPLDRLVSYYYFLRYGDNYRP+LVRKK G+K TFDEC+ + +C +NMWLQ+PF CGH
Sbjct: 168 KPLDRLVSYYYFLRYGDNYRPNLVRKKAGNKITFDECVVQKQPDCDPKNMWLQIPFFCGH 227
Query: 179 AAACWVPGNPWALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLT 235
AA CW PG+ WAL++AK NLV +Y LVGVTE++ +FV LLE +LP F+G +H+ T
Sbjct: 228 AAECWEPGSDWALKQAKHNLVNEYFLVGVTEQMYEFVDLLERSLPRIFQGFREHYQT 284
>gi|195164662|ref|XP_002023165.1| GL21106 [Drosophila persimilis]
gi|194105250|gb|EDW27293.1| GL21106 [Drosophila persimilis]
Length = 314
Score = 293 bits (751), Expect = 6e-77, Method: Compositional matrix adjust.
Identities = 143/300 (47%), Positives = 195/300 (65%), Gaps = 42/300 (14%)
Query: 13 SAKSPSPETDSLSWD--TVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVL 70
SA + DS ++ V++YNRVPKTGSTSFVN+AYD+C+ +++VLH+NVT N HVL
Sbjct: 55 SAPVATAAVDSFDYEEQLVVLYNRVPKTGSTSFVNIAYDLCKLNKYHVLHINVTANMHVL 114
Query: 71 SLADQYRFVNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYF 130
SL +Q FV NV+KW + +PALYHGH F+DF +F +P++IN+
Sbjct: 115 SLPNQIAFVRNVSKWHEMKPALYHGHMAFLDFSKFQIAHKPIYINL-------------- 160
Query: 131 LRYGDNYRPHLVRKKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWA 190
TFDEC+ + +C +NMWLQ+PF CGHAA CW PG+ WA
Sbjct: 161 --------------------TFDECVVQKQPDCDPKNMWLQIPFFCGHAAECWEPGSDWA 200
Query: 191 LEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDP 250
L++AK NLV +Y LVGVTE++ +FV LLE +LP F G +H+ TSNKSHLR T+ K+ P
Sbjct: 201 LDQAKRNLVNEYFLVGVTEQMYEFVDLLERSLPRIFHGFREHYQTSNKSHLRVTSSKLPP 260
Query: 251 SEETVQQIKKSKIWELENELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYEKIYPK 310
SE T++ I+K+KIW++EN+LYE+AL QF F KK K++ + ++FMYEKI PK
Sbjct: 261 SESTIKSIQKTKIWQMENDLYEFALAQFEFNKK------KLMQPDNKHLQKFMYEKIRPK 314
>gi|449268063|gb|EMC78933.1| Heparan sulfate 2-O-sulfotransferase 1, partial [Columba livia]
Length = 236
Score = 293 bits (750), Expect = 7e-77, Method: Compositional matrix adjust.
Identities = 129/236 (54%), Positives = 177/236 (75%), Gaps = 2/236 (0%)
Query: 75 QYRFVNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYG 134
Q RFV NVT W++ +P YHGH ++DF +FG K++P++IN++R P++RLVSYYYFLR+G
Sbjct: 1 QVRFVKNVTSWKEMKPGFYHGHISYLDFAKFGVKKKPIYINVIRDPIERLVSYYYFLRFG 60
Query: 135 DNYRPHLVRKKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKA 194
D+YRP L R+K GDK TFDEC+ ++C+ E +WLQ+PF CGH++ CW G+ WALE+A
Sbjct: 61 DDYRPGLRRRKQGDKKTFDECVAAGGSDCAPEKLWLQIPFFCGHSSECWNVGSRWALEQA 120
Query: 195 KENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEET 254
K NL+ +Y LVGVTEEL DF+ LLEAALP FFRG T+ + T KSHLR+T K P++ET
Sbjct: 121 KYNLINEYFLVGVTEELEDFIMLLEAALPRFFRGATELYRTGKKSHLRKTTEKKLPTKET 180
Query: 255 VQQIKKSKIWELENELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYEKIYPK 310
+ ++++S+IW++ENE YE+ALEQF FV+ H + + G + F YEKIYPK
Sbjct: 181 IAKLQQSEIWKMENEFYEFALEQFQFVRAHAV--REKDGELYILAQNFFYEKIYPK 234
>gi|7023900|dbj|BAA92125.1| unnamed protein product [Homo sapiens]
gi|119593576|gb|EAW73170.1| heparan sulfate 2-O-sulfotransferase 1, isoform CRA_a [Homo
sapiens]
Length = 303
Score = 292 bits (748), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 123/210 (58%), Positives = 166/210 (79%)
Query: 27 DTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWR 86
D VIIYNRVPKT STSF N+AYD+C K +++VLH+N T NN V+SL DQ RFV N+T W+
Sbjct: 47 DMVIIYNRVPKTASTSFTNIAYDLCAKNKYHVLHINTTKNNPVMSLQDQVRFVKNITSWK 106
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKH 146
+ +P YHGH ++DF +FG K++P++IN++R P++RLVSYYYFLR+GD+YRP L R+K
Sbjct: 107 EMKPGFYHGHVSYLDFAKFGVKKKPIYINVIRDPIERLVSYYYFLRFGDDYRPGLRRRKQ 166
Query: 147 GDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVG 206
GDK TFDEC+ ++C+ E +WLQ+PF CGH++ CW G+ WA+++AK NL+ +Y LVG
Sbjct: 167 GDKKTFDECVAEGGSDCAPEKLWLQIPFFCGHSSECWNVGSRWAMDQAKYNLINEYFLVG 226
Query: 207 VTEELTDFVSLLEAALPSFFRGGTDHFLTS 236
VTEEL DF+ LLEAALP FFRG T+ + T+
Sbjct: 227 VTEELEDFIMLLEAALPRFFRGATELYRTA 256
>gi|341903528|gb|EGT59463.1| CBN-HST-2 protein [Caenorhabditis brenneri]
Length = 325
Score = 290 bits (742), Expect = 5e-76, Method: Compositional matrix adjust.
Identities = 138/298 (46%), Positives = 200/298 (67%), Gaps = 19/298 (6%)
Query: 24 LSW---DTVIIYNRVPKTGSTSFVN-MAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFV 79
L W ++++IYNR+PKTGST+F N +AYD+ ++ FNVLHVN+T N V+SL DQY F+
Sbjct: 34 LKWPAGNSIVIYNRIPKTGSTTFTNAIAYDLYKENGFNVLHVNMTKNRQVMSLPDQYTFI 93
Query: 80 NNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRP 139
NNVT W DR PA YHGH FIDFQ+FG P++INI+R+PL+RL+S+YYFLRYGDNYR
Sbjct: 94 NNVTTWTDRLPAFYHGHVAFIDFQRFGV-ANPIYINIIREPLERLLSHYYFLRYGDNYRV 152
Query: 140 HLVRKKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLV 199
L R + G+ TFDEC + +C ++ MW+Q+P+ CGH C GNP AL AK+N +
Sbjct: 153 GLKRSRAGNNETFDECYTRSGKDCDMKQMWIQIPYFCGHYHFCGEVGNPEALRMAKQNAM 212
Query: 200 TKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHF--LTSNKSHLRRTNRKIDPSEETVQQ 257
KYLLVG T + D ++LLE +P+FF+G HF L +N++HLR T +KI P+++T+
Sbjct: 213 EKYLLVGTTSRMRDMIALLEVTVPNFFKGALQHFDSLDANRAHLRYTKKKIPPNDQTLSM 272
Query: 258 IKKSKIWELENELYEYALEQFHFVKKHNLVYNKVLGYEADKGK------QFMYEKIYP 309
I++ +++++E E Y++ + F + ++ K G + Q+ +EKI P
Sbjct: 273 IRRDEVYKMEREFYDFIRDLF------DAIFKKATGGSSKAEDLVKMPLQYHFEKIKP 324
>gi|345306386|ref|XP_001506529.2| PREDICTED: heparan sulfate 2-O-sulfotransferase 1-like
[Ornithorhynchus anatinus]
Length = 318
Score = 288 bits (736), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 125/236 (52%), Positives = 176/236 (74%), Gaps = 2/236 (0%)
Query: 75 QYRFVNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYG 134
Q RFV NVT W++ +P YHGH ++DF +FG K++P++IN++R P++RLVSYYYFLR+G
Sbjct: 83 QVRFVKNVTSWKEMKPGFYHGHISYLDFAKFGVKKKPIYINVIRDPIERLVSYYYFLRFG 142
Query: 135 DNYRPHLVRKKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKA 194
D+YRP L R+K GDK TFDEC+ ++C+ E +WLQ+PF CGH++ CW G+ WA+++A
Sbjct: 143 DDYRPGLRRRKQGDKKTFDECVAEGGSDCAPEKLWLQIPFFCGHSSECWNVGSRWAMDQA 202
Query: 195 KENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEET 254
K NL+ +Y LVGVTEEL DF+ LLEAALP FFRG T+ + T KSHLR+T K P+++T
Sbjct: 203 KYNLINEYFLVGVTEELEDFIMLLEAALPRFFRGATELYRTGKKSHLRKTTEKKLPTKQT 262
Query: 255 VQQIKKSKIWELENELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYEKIYPK 310
+ ++++S IW++ENE YE+ALEQF F++ H + + G + F YEKIYPK
Sbjct: 263 IAKLQQSDIWKMENEFYEFALEQFQFIRAHAV--REKDGELYILTQNFFYEKIYPK 316
>gi|308495157|ref|XP_003109767.1| CRE-HST-2 protein [Caenorhabditis remanei]
gi|308245957|gb|EFO89909.1| CRE-HST-2 protein [Caenorhabditis remanei]
Length = 379
Score = 288 bits (736), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 136/288 (47%), Positives = 194/288 (67%), Gaps = 4/288 (1%)
Query: 25 SWDTVIIYNRVPKTGSTSFVN-MAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVT 83
S + ++ YNR+PKTGST+F N +AYD+ ++ FNVLHVN+T N ++SL DQY FVNNVT
Sbjct: 92 SSNKIVFYNRIPKTGSTTFTNAIAYDLYKENGFNVLHVNMTKNRQIMSLPDQYTFVNNVT 151
Query: 84 KWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVR 143
W +R PA YHGH +IDFQ+FG P++INI+R+PL+RL+S+YYFLRYGDNYR L R
Sbjct: 152 TWTERLPAFYHGHVAYIDFQRFGL-ANPIYINIIREPLERLLSHYYFLRYGDNYRIGLKR 210
Query: 144 KKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYL 203
+ G+ TFDEC +C ++ MW+Q+PF CGH C GNP AL+ AK+N + KYL
Sbjct: 211 SRAGNNETFDECYTRGGKDCDMKQMWVQIPFFCGHYHFCSEVGNPEALKMAKQNAMEKYL 270
Query: 204 LVGVTEELTDFVSLLEAALPSFFRGGTDHF--LTSNKSHLRRTNRKIDPSEETVQQIKKS 261
LVG T + D ++LLE +P FF G +HF L +N++HLR T +K P+++T+ I++
Sbjct: 271 LVGTTARMRDMIALLEVTVPDFFNGALNHFDHLDANRAHLRYTKKKFPPNDQTLSMIRRD 330
Query: 262 KIWELENELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYEKIYP 309
+++++E E Y++ + F V K + A+ Q+ YEKI P
Sbjct: 331 EVYKMEREFYDFVSDLFDAVFKKATNGSSKAEDLANMPTQYHYEKIKP 378
>gi|354505510|ref|XP_003514811.1| PREDICTED: heparan sulfate 2-O-sulfotransferase 1-like, partial
[Cricetulus griseus]
Length = 235
Score = 286 bits (732), Expect = 7e-75, Method: Compositional matrix adjust.
Identities = 124/234 (52%), Positives = 174/234 (74%), Gaps = 2/234 (0%)
Query: 77 RFVNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDN 136
RFV N+T W + +P YHGH ++DF +FG K++P++IN++R P++RLVSYYYFLR+GD+
Sbjct: 2 RFVKNITTWNEMKPGFYHGHISYLDFAKFGVKKKPIYINVIRDPIERLVSYYYFLRFGDD 61
Query: 137 YRPHLVRKKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKE 196
YRP L R+K GDK TFDEC+ ++C+ E +WLQ+PF CGH++ CW G+ WA+++AK
Sbjct: 62 YRPGLRRRKQGDKKTFDECVAEGGSDCAPEKLWLQIPFFCGHSSECWNVGSRWAMDQAKY 121
Query: 197 NLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQ 256
NL+ +Y LVGVTEEL DF+ LLEAALP FFRG TD + T KSHLR+T K P+++T+
Sbjct: 122 NLINEYFLVGVTEELEDFIMLLEAALPRFFRGATDLYRTGKKSHLRKTTEKKLPTKQTIA 181
Query: 257 QIKKSKIWELENELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYEKIYPK 310
++++S IW++ENE YE+ALEQF F++ H + + G + F YEKIYPK
Sbjct: 182 KLQQSDIWKMENEFYEFALEQFQFIRAHAV--REKDGDLYILAQNFFYEKIYPK 233
>gi|60688653|gb|AAH91291.1| Hs2st1 protein, partial [Rattus norvegicus]
Length = 234
Score = 286 bits (731), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 125/234 (53%), Positives = 174/234 (74%), Gaps = 2/234 (0%)
Query: 77 RFVNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDN 136
RFV N+T W + +P YHGH ++DF +FG K++P++IN++R P++RLVSYYYFLR+GD+
Sbjct: 1 RFVKNITSWNEMKPGFYHGHISYLDFAKFGVKKKPIYINVVRDPIERLVSYYYFLRFGDD 60
Query: 137 YRPHLVRKKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKE 196
YRP L R+K GDK TFDEC+ ++C+ E +WLQ+PF CGH++ CW G+ WA+++AK
Sbjct: 61 YRPGLRRRKQGDKKTFDECVAEGGSDCAPEKLWLQIPFFCGHSSECWNVGSRWAMDQAKY 120
Query: 197 NLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQ 256
NLV +Y LVGVTEEL DF+ LLEAALP FFRG TD + T KSHLR+T K P+++T+
Sbjct: 121 NLVNEYFLVGVTEELEDFIMLLEAALPRFFRGATDLYRTGKKSHLRKTTEKKLPTKQTIA 180
Query: 257 QIKKSKIWELENELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYEKIYPK 310
++++S IW++ENE YE+ALEQF F++ H + + G + F YEKIYPK
Sbjct: 181 KLQQSDIWKMENEFYEFALEQFQFMRAHAV--REKDGDLYILAQNFFYEKIYPK 232
>gi|432853693|ref|XP_004067834.1| PREDICTED: heparan sulfate 2-O-sulfotransferase 1-like [Oryzias
latipes]
Length = 302
Score = 285 bits (729), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 125/234 (53%), Positives = 172/234 (73%), Gaps = 2/234 (0%)
Query: 77 RFVNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDN 136
RFV NV+ WR+ +PA YHGH ++DF ++G +P++IN++R P++RLVSYYYFLR+GD+
Sbjct: 69 RFVQNVSAWREMKPAFYHGHVAYLDFSKYGVMRKPMYINVVRDPIERLVSYYYFLRFGDD 128
Query: 137 YRPHLVRKKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKE 196
YRP L R+K GDK TFDEC+ ++C+ E +WLQ+PF CGH A CW G+ WALE+AK
Sbjct: 129 YRPGLRRRKQGDKKTFDECVSSGGSDCAPEKLWLQIPFFCGHHAECWNVGSKWALEQAKY 188
Query: 197 NLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQ 256
NL+ ++LLVGVTEEL DF+ +LEAALP FFRG T+ + T KSHLR+T+ K P++E
Sbjct: 189 NLLNEFLLVGVTEELEDFIMILEAALPHFFRGATELYRTGKKSHLRKTSEKKPPTKEATA 248
Query: 257 QIKKSKIWELENELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYEKIYPK 310
++++S IW++EN+ YE+ALEQF FV+ H + G + F YEKIYPK
Sbjct: 249 KLQQSHIWKMENDFYEFALEQFQFVRAHAVREKN--GELYVLAQSFFYEKIYPK 300
>gi|71984722|ref|NP_509871.2| Protein HST-2 [Caenorhabditis elegans]
gi|67460719|sp|O17645.2|HST2_CAEEL RecName: Full=Heparan sulfate 2-O-sulfotransferase hst-2;
Short=Heparan sulfotransferase 2; AltName: Full=HS2ST1
homolog
gi|32440602|emb|CAB03945.2| Protein HST-2 [Caenorhabditis elegans]
gi|46361292|gb|AAS89253.1| heparan sulfate 2O-sulfotransferase [Caenorhabditis elegans]
Length = 324
Score = 284 bits (726), Expect = 5e-74, Method: Compositional matrix adjust.
Identities = 129/258 (50%), Positives = 183/258 (70%), Gaps = 4/258 (1%)
Query: 29 VIIYNRVPKTGSTSFVN-MAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWRD 87
++IYNR+PKTGST+F N +AYD+ ++ F+VLHVN+T N V+SL DQY FVNN+T W +
Sbjct: 41 IVIYNRIPKTGSTTFTNAIAYDLYKENGFSVLHVNMTKNRQVMSLPDQYTFVNNITTWTE 100
Query: 88 RRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKHG 147
R PA YHGH FIDFQ+FG P++INI+R+PL+RL+S+YYFLRYGDNYR L R + G
Sbjct: 101 RLPAFYHGHVAFIDFQRFGI-ANPIYINIIREPLERLLSHYYFLRYGDNYRIGLKRSRAG 159
Query: 148 DKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVGV 207
+ TFDEC +C ++ MW+Q+P+ CGH C GNP AL AK+N++ KYLLVG
Sbjct: 160 NNETFDECYSRGGKDCDMKQMWIQIPYFCGHYHFCTEVGNPEALRVAKQNVLEKYLLVGT 219
Query: 208 TEELTDFVSLLEAALPSFFRGGTDHF--LTSNKSHLRRTNRKIDPSEETVQQIKKSKIWE 265
T + D ++LLE +P FF+G HF L +N++HLR T +KI P+++T+ I++ ++++
Sbjct: 220 TSRMRDMIALLEVTVPDFFKGALGHFDSLDANRAHLRYTKKKIPPNDQTLSMIRRDEVYK 279
Query: 266 LENELYEYALEQFHFVKK 283
+E E Y++ F V K
Sbjct: 280 MEREFYDFINNLFDAVFK 297
>gi|291221603|ref|XP_002730809.1| PREDICTED: heparan sulfate 2-O-sulfotransferase 1-like
[Saccoglossus kowalevskii]
Length = 272
Score = 283 bits (724), Expect = 8e-74, Method: Compositional matrix adjust.
Identities = 125/242 (51%), Positives = 174/242 (71%), Gaps = 2/242 (0%)
Query: 69 VLSLADQYRFVNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYY 128
+L + + RFV NVT W+ ++PA YHGH F+DF +FG ++PL+INI+R+PL RLVSYY
Sbjct: 33 MLFVEESARFVQNVTSWQVKKPAFYHGHIAFLDFARFGVVQKPLYINIIREPLARLVSYY 92
Query: 129 YFLRYGDNYRPHLVRKKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNP 188
YF+RYGD++RPHL R++ GD TFD+C+ + TEC E +WLQVPF CG + CW PG+
Sbjct: 93 YFVRYGDDFRPHLKRRRSGDSQTFDDCVMKDETECQPEKIWLQVPFFCGQSQECWRPGSQ 152
Query: 189 WALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKI 248
WALE+AK NLV Y LVGVTE+L +F+ +LEA+L + FRG ++ F T KSHLR+T+ K
Sbjct: 153 WALEQAKNNLVQHYFLVGVTEQLDEFIGVLEASLSTMFRGASERFQTGGKSHLRKTSNKQ 212
Query: 249 DPSEETVQQIKKSKIWELENELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYEKIY 308
PS ET+ + SKI+++E E Y++A+EQF +K+ + + V G +QFMYEKI
Sbjct: 213 LPSAETLAKFHSSKIYQMEKEFYDFAVEQFEHIKRRTISF--VEGKHVPIPQQFMYEKIR 270
Query: 309 PK 310
P+
Sbjct: 271 PR 272
>gi|156390781|ref|XP_001635448.1| predicted protein [Nematostella vectensis]
gi|156222542|gb|EDO43385.1| predicted protein [Nematostella vectensis]
Length = 276
Score = 282 bits (722), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 139/284 (48%), Positives = 192/284 (67%), Gaps = 12/284 (4%)
Query: 27 DTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWR 86
+TVIIYNRVPKT STSF+ + YD+ + ++ +H+NVT N+HV+S+ DQ RF +N+T+W
Sbjct: 2 NTVIIYNRVPKTASTSFMGVVYDLSEQNNYHTIHLNVTKNSHVMSVTDQLRFAHNITQWS 61
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKH 146
+R PA YHGH +I+FQ G + ++IN++RKPLDRLVSYYYFLR+GD +RPH R +
Sbjct: 62 ERLPAFYHGHVQYIEFQSLGVTKPVIYINVIRKPLDRLVSYYYFLRFGDTFRPHKRRSRQ 121
Query: 147 GDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVG 206
G+K + I L LE + Q+ F C PGN WALE+AK +LV KYLLVG
Sbjct: 122 GNKEVWGGLIEL------LEYIIKQLNFFF----PCRSPGNKWALEQAKRHLVEKYLLVG 171
Query: 207 VTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIWEL 266
+TE+L DF+++LE ALP FF+G T+ F T NKSHLR+T K ++T+ K+KIW++
Sbjct: 172 LTEQLEDFITILETALPRFFKGATNRFQTGNKSHLRKTASKQPLQQQTLDFFYKNKIWKM 231
Query: 267 ENELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYEKIYPK 310
ENE YE+A + F+ VKK L+ N+ +F YEKI P+
Sbjct: 232 ENEFYEFARKVFNGVKKKTLIVNQ--DGSVTSSVEFFYEKIRPR 273
>gi|312074082|ref|XP_003139810.1| heparan sulfate 2-O-sulfotransferase [Loa loa]
Length = 322
Score = 281 bits (718), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 133/294 (45%), Positives = 206/294 (70%), Gaps = 13/294 (4%)
Query: 22 DSLSW-DTVIIYNRVPKTGSTSFVN-MAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFV 79
+ LS+ +++IYNR+PKTGST+ N + Y++C + F+V+H+N+T N +++++ DQ RF+
Sbjct: 35 EKLSYTSSIVIYNRIPKTGSTTLTNAVMYNLCYRNGFSVIHLNLTRNRYLMNIVDQRRFI 94
Query: 80 NNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRP 139
+N+T W++R PA+YHGH FI+F +FG P++IN++R+PL+RL+SYYYFLRYGDNYR
Sbjct: 95 DNITNWKERMPAIYHGHVAFINFNRFGLP-NPVYINLIREPLERLISYYYFLRYGDNYRT 153
Query: 140 HLVRKKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLV 199
L R + G+ TFD+CI +C ++ MWLQ+P+ CG C GN ALE+AK NL+
Sbjct: 154 GLKRSRAGNNETFDQCIVRKGRDCDMKQMWLQIPYFCGTHHFCSEVGNNRALEQAKTNLI 213
Query: 200 TKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSH--LRRTNRKIDPSEETVQQ 257
YLLVG+++++ DF+ LLE LP+FFRG +F + ++ H LR TNRKI PS+ TV+
Sbjct: 214 NYYLLVGLSDKMRDFIELLELLLPTFFRGALKNFDSLDEKHANLRHTNRKIPPSKATVEA 273
Query: 258 IKKSKIWELENELYEYALEQFHFVKKHNLVYNKVLGYEADK--GKQFMYEKIYP 309
I+K +++ +E E Y++A +QF +++ ++L A++ QF YEKI P
Sbjct: 274 IRKERVYIMEREFYDFAQKQFSEMRR------RILDGTANELLPPQFHYEKIKP 321
>gi|393908207|gb|EFO24256.2| heparan sulfate 2-O-sulfotransferase [Loa loa]
gi|393908208|gb|EJD74954.1| heparan sulfate 2-O-sulfotransferase, variant [Loa loa]
Length = 335
Score = 280 bits (717), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 133/294 (45%), Positives = 206/294 (70%), Gaps = 13/294 (4%)
Query: 22 DSLSW-DTVIIYNRVPKTGSTSFVN-MAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFV 79
+ LS+ +++IYNR+PKTGST+ N + Y++C + F+V+H+N+T N +++++ DQ RF+
Sbjct: 48 EKLSYTSSIVIYNRIPKTGSTTLTNAVMYNLCYRNGFSVIHLNLTRNRYLMNIVDQRRFI 107
Query: 80 NNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRP 139
+N+T W++R PA+YHGH FI+F +FG P++IN++R+PL+RL+SYYYFLRYGDNYR
Sbjct: 108 DNITNWKERMPAIYHGHVAFINFNRFGL-PNPVYINLIREPLERLISYYYFLRYGDNYRT 166
Query: 140 HLVRKKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLV 199
L R + G+ TFD+CI +C ++ MWLQ+P+ CG C GN ALE+AK NL+
Sbjct: 167 GLKRSRAGNNETFDQCIVRKGRDCDMKQMWLQIPYFCGTHHFCSEVGNNRALEQAKTNLI 226
Query: 200 TKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSH--LRRTNRKIDPSEETVQQ 257
YLLVG+++++ DF+ LLE LP+FFRG +F + ++ H LR TNRKI PS+ TV+
Sbjct: 227 NYYLLVGLSDKMRDFIELLELLLPTFFRGALKNFDSLDEKHANLRHTNRKIPPSKATVEA 286
Query: 258 IKKSKIWELENELYEYALEQFHFVKKHNLVYNKVLGYEADK--GKQFMYEKIYP 309
I+K +++ +E E Y++A +QF +++ ++L A++ QF YEKI P
Sbjct: 287 IRKERVYIMEREFYDFAQKQFSEMRR------RILDGTANELLPPQFHYEKIKP 334
>gi|170584694|ref|XP_001897129.1| Heparan sulfate 2-O-sulfotransferase protein [Brugia malayi]
gi|158595459|gb|EDP34012.1| Heparan sulfate 2-O-sulfotransferase protein, putative [Brugia
malayi]
Length = 315
Score = 276 bits (705), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 134/294 (45%), Positives = 203/294 (69%), Gaps = 13/294 (4%)
Query: 22 DSLSW-DTVIIYNRVPKTGSTSFVN-MAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFV 79
+ LS+ +++IYNR+PKTGST+ N + Y++C FNV+H+N+T N +++++ DQ RF+
Sbjct: 28 EKLSYASSIVIYNRIPKTGSTTLTNAIMYNLCHHNGFNVIHLNLTRNRYLMNIVDQRRFI 87
Query: 80 NNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRP 139
+N+T W++R PA+YHGH FI+F +FG P++IN++R+PLDRL+SYYYFLRYGDNYR
Sbjct: 88 DNITNWKERIPAIYHGHVAFINFNRFGLP-NPVYINLIREPLDRLISYYYFLRYGDNYRV 146
Query: 140 HLVRKKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLV 199
L R + G+ TFD+CI +C ++ MWLQ+P+ CG C GN ALE+AK NL+
Sbjct: 147 GLKRSRAGNNETFDQCIERKGHDCDMKQMWLQIPYFCGTHHFCSEVGNNRALEQAKVNLI 206
Query: 200 TKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSH--LRRTNRKIDPSEETVQQ 257
YLLVG++E++ F+ LLE LP+FFR +F + ++ H LR TN KI P++ TV+
Sbjct: 207 NYYLLVGLSEQMRHFIELLELLLPTFFRDALKNFDSLDEKHANLRHTNLKIPPNKATVEA 266
Query: 258 IKKSKIWELENELYEYALEQFHFVKKHNLVYNKVLGYEADK--GKQFMYEKIYP 309
I+K+ I+ +E E Y++A +QF+ +++ +VL A++ QF YEKI P
Sbjct: 267 IRKNPIYIMEREFYDFAQKQFNEMRR------RVLDESANELLPPQFHYEKIKP 314
>gi|332222079|ref|XP_003260191.1| PREDICTED: heparan sulfate 2-O-sulfotransferase 1 [Nomascus
leucogenys]
Length = 330
Score = 274 bits (701), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 131/284 (46%), Positives = 183/284 (64%), Gaps = 28/284 (9%)
Query: 27 DTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWR 86
D VIIYNRVPKT STSF N+AYD+C K +++VLH+N T NN V+SL DQ RFV N+T W+
Sbjct: 73 DMVIIYNRVPKTASTSFTNIAYDLCAKNKYHVLHINTTKNNPVMSLQDQVRFVKNITSWK 132
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKH 146
+ +P YHGH ++DF + NI + + + +R G
Sbjct: 133 EMKPGFYHGHVSYLDFAKCSVD------NISK------ILLHCLMRTG------------ 168
Query: 147 GDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVG 206
+TFDEC+ ++C+ E +WLQ+PF CGH++ CW G+ WA+++AK NL+ +Y LVG
Sbjct: 169 --LSTFDECVAEGGSDCAPEKLWLQIPFFCGHSSECWNVGSRWAMDQAKYNLINEYFLVG 226
Query: 207 VTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIWEL 266
VTEEL DF+ LLEAALP FFRG T+ + T KSHLR+T K P+++T+ ++++S IW++
Sbjct: 227 VTEELEDFIMLLEAALPRFFRGATELYRTGKKSHLRKTTEKKLPTKQTIAKLQQSDIWKM 286
Query: 267 ENELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYEKIYPK 310
ENE YE+ALEQF F++ H + G + F YEKIYPK
Sbjct: 287 ENEFYEFALEQFQFIRAHAVREKD--GDLYILAQNFFYEKIYPK 328
>gi|402593917|gb|EJW87844.1| hypothetical protein WUBG_01243 [Wuchereria bancrofti]
Length = 329
Score = 271 bits (692), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 130/294 (44%), Positives = 200/294 (68%), Gaps = 13/294 (4%)
Query: 22 DSLSW-DTVIIYNRVPKTGSTSFVN-MAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFV 79
+ LS+ +++IYNR+PKTGST+ N + Y++C + FNV+H+N+T N +++++ DQ RF+
Sbjct: 42 EKLSYASSIVIYNRIPKTGSTTLTNAIMYNLCHRNGFNVIHLNLTRNRYLMNIVDQRRFI 101
Query: 80 NNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRP 139
+N+T W++R PA+YHGH FI+F +FG P++IN++R+PLDRL+SYYYFLRYGDNYR
Sbjct: 102 DNITNWKERMPAVYHGHVAFINFNRFGLP-NPVYINLIREPLDRLISYYYFLRYGDNYRV 160
Query: 140 HLVRKKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLV 199
L R + G+ TFD+CI +C ++ MWLQ+P+ CG C GN ALE+AK NL+
Sbjct: 161 GLKRSRAGNNETFDQCIARKGHDCDMKQMWLQIPYFCGTHHFCSEVGNNRALEQAKVNLI 220
Query: 200 TKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSH--LRRTNRKIDPSEETVQQ 257
YLLVG++ ++ F+ LLE LP FFR +F + ++ H LR TN KI P++ TV+
Sbjct: 221 NYYLLVGLSNQMRHFIELLELLLPKFFRDALKNFDSLDEKHANLRHTNLKIPPNKATVEA 280
Query: 258 IKKSKIWELENELYEYALEQFHFVKKHNLVYNKVLGYEADK--GKQFMYEKIYP 309
I+ ++ +E E Y++A +QF+ +++ ++L A++ QF YEKI P
Sbjct: 281 IRNDPVYIMEREFYDFAQKQFNEMRR------RMLDESANELLPPQFHYEKIKP 328
>gi|268578055|ref|XP_002644010.1| C. briggsae CBR-HST-2 protein [Caenorhabditis briggsae]
Length = 487
Score = 269 bits (688), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 131/297 (44%), Positives = 193/297 (64%), Gaps = 25/297 (8%)
Query: 11 ISSAKSPSPETDSLSWDTVIIYNRVPKTGSTSFVN-MAYDMCRKKRFNVLHVNVTGNNHV 69
I ++ P P +++ ++IYNR+PKTGST+F N +AYD+ ++ FNVLHVN+T N V
Sbjct: 72 IPNSVRPWPTSNN-----IVIYNRIPKTGSTTFTNAIAYDLYKENGFNVLHVNMTKNRQV 126
Query: 70 LS----------------LADQYRFVNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLF 113
LS L DQY F+NNVT WR+R PA YHGH +IDF +FG P++
Sbjct: 127 LSSLNVKKDGPLFVQVMSLPDQYVFINNVTTWRERLPAFYHGHVAYIDFTRFGV-ANPIY 185
Query: 114 INILRKPLDRLVSYYYFLRYGDNYRPHLVRKKHGDKTTFDECIRLNRTECSLENMWLQVP 173
INI+R+PL+RL+S+YYFLRYGDN+R L R + G+ TFDEC +C ++ MW+Q+P
Sbjct: 186 INIIREPLERLLSHYYFLRYGDNFRVGLKRSRAGNNETFDECYSRGGKDCDMKQMWMQIP 245
Query: 174 FLCGHAAACWVPGNPWALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHF 233
+ CGH C G+ AL+ AK+N + KYLLVG T+ + D ++LLE +P FF+G HF
Sbjct: 246 YFCGHYHFCTEVGSAEALKMAKQNALEKYLLVGTTDRMRDMIALLEVTVPHFFKGALGHF 305
Query: 234 --LTSNKSHLRRTNRKIDPSEETVQQIKKSKIWELENELYEYALEQFHFVKKHNLVY 288
L N++HLR T +KI P+E+T+ I++ +++++E E Y++ + F V L++
Sbjct: 306 DQLDENRAHLRYTKKKIPPNEQTLSMIRRDEVYKMEREFYDFVRDLFDAVFNPKLLW 362
>gi|344257964|gb|EGW14068.1| Heparan sulfate 2-O-sulfotransferase 1 [Cricetulus griseus]
Length = 224
Score = 267 bits (683), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 118/223 (52%), Positives = 166/223 (74%), Gaps = 3/223 (1%)
Query: 89 RPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKHGD 148
+P YHGH ++DF +FG K++P++IN++R P++RLVSYYYFLR+GD+YRP L R+K GD
Sbjct: 2 KPGFYHGHISYLDFAKFGVKKKPIYINVIRDPIERLVSYYYFLRFGDDYRPGLRRRKQGD 61
Query: 149 KTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVGVT 208
K TFDEC+ ++C+ E +WLQ+PF CGH++ CW G+ WA+++AK NL+ +Y LVGVT
Sbjct: 62 KKTFDECVAEGGSDCAPEKLWLQIPFFCGHSSECWNVGSRWAMDQAKYNLINEYFLVGVT 121
Query: 209 EELTDFVSLLEAALPSFFRGGTDHFLT-SNKSHLRRTNRKIDPSEETVQQIKKSKIWELE 267
EEL DF+ LLEAALP FFRG TD + T KSHLR+T K P+++T+ ++++S IW++E
Sbjct: 122 EELEDFIMLLEAALPRFFRGATDLYRTVGKKSHLRKTTEKKLPTKQTIAKLQQSDIWKME 181
Query: 268 NELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYEKIYPK 310
NE YE+ALEQF F++ H + + G + F YEKIYPK
Sbjct: 182 NEFYEFALEQFQFIRAHAV--REKDGDLYILAQNFFYEKIYPK 222
>gi|221114983|ref|XP_002162119.1| PREDICTED: heparin sulfate O-sulfotransferase-like [Hydra
magnipapillata]
Length = 327
Score = 267 bits (682), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 130/283 (45%), Positives = 184/283 (65%), Gaps = 11/283 (3%)
Query: 29 VIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWRDR 88
+I+YNRVPKTGSTSF+++ Y + +F+V VNV+ +H DQYRF NVT W R
Sbjct: 53 IILYNRVPKTGSTSFMSLLYALHSVNKFSVAFVNVSWISHRFLFLDQYRFALNVTSWNIR 112
Query: 89 RPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKHGD 148
+PA+Y GHF F+DF + G QPL+INI+RKPLDR VS+YYF+RYGD + P+ R +HGD
Sbjct: 113 KPAIYSGHFPFLDFTKLG-MHQPLYINIVRKPLDRAVSHYYFIRYGDTFLPNKKRLRHGD 171
Query: 149 KTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVGVT 208
+T+FD+C++LN +ECS +W+Q+P+ CG A CW PG+ AL AK NL YLLVG+
Sbjct: 172 ETSFDDCVKLNSSECSEAKLWMQIPYFCGSDAFCWEPGSEKALMHAKINLEKHYLLVGLM 231
Query: 209 EELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIWELEN 268
E++ +F+ +LE+ LP FF G FL + LR+T K +TV+ KKSK+W++E
Sbjct: 232 EKMDNFIEILESILPRFFHGSLKLFLKDGQVQLRKTKIKKSLLPQTVEHFKKSKVWQMEE 291
Query: 269 ELYEYALEQFHFVKKH-NLVYNKVLGYEADKGKQFMYEKIYPK 310
Y+ FVKKH + Y + + + QF + K+ P+
Sbjct: 292 SFYQ-------FVKKHFDTTYQEFIHSRTE--TQFSFSKLKPE 325
>gi|349967686|dbj|GAA31625.1| heparan sulfate 2-O-sulfotransferase HS2ST1 [Clonorchis sinensis]
Length = 359
Score = 254 bits (649), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 114/257 (44%), Positives = 172/257 (66%), Gaps = 3/257 (1%)
Query: 29 VIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWRDR 88
+++YNRVPKT TS V++ Y +C++ + VN++ N L+ +Q + V+N++
Sbjct: 66 LVVYNRVPKTAGTSLVHLIYRLCKRNNVGITMVNISWNGVYLNRFNQLQLVHNISNRHYT 125
Query: 89 RPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKK--- 145
RP L HGHF F+DF QFGS QP+++N++R PL+RLVS+YYFLR+GD++RP+++RK+
Sbjct: 126 RPLLLHGHFAFLDFAQFGSTLQPVYLNMIRDPLERLVSHYYFLRFGDDFRPNVIRKRMNH 185
Query: 146 HGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLV 205
G + TFD+C+ +C +++W+QVPF CGHAA C +PGNP ALE AK + YLLV
Sbjct: 186 SGRQQTFDDCVLNGGLDCQPKDLWVQVPFFCGHAAYCRIPGNPEALETAKRRVTEDYLLV 245
Query: 206 GVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIWE 265
G+TEE +FV+LLE LP FF+G TD S+ HLRRT K+ + T + S++W+
Sbjct: 246 GLTEEFDEFVTLLEKLLPRFFQGSTDLLQGSDGWHLRRTKHKLPINASTRAVFRDSRVWQ 305
Query: 266 LENELYEYALEQFHFVK 282
+E YE+ +F ++
Sbjct: 306 IEQAFYEFVRAEFWTIR 322
>gi|312376821|gb|EFR23803.1| hypothetical protein AND_12214 [Anopheles darlingi]
Length = 205
Score = 253 bits (645), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 112/202 (55%), Positives = 151/202 (74%), Gaps = 6/202 (2%)
Query: 110 QPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKHGDKTTFDECIRLNRTECSLENMW 169
+P++IN++R+PLDRLVSYYYFLRYGD+YRP+LVR + GD TFDEC+ + +C NMW
Sbjct: 4 KPMYINLIRQPLDRLVSYYYFLRYGDDYRPYLVRHRAGDTMTFDECVAKQKPDCDPTNMW 63
Query: 170 LQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGG 229
LQ+PF CGH A CW PG+ WALE+AK NL +Y LVG+TEE+ +F+ LLE +LP +RG
Sbjct: 64 LQIPFFCGHHAECWNPGSSWALEQAKRNLANEYFLVGLTEEMDEFIELLELSLPRLYRGA 123
Query: 230 TDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIWELENELYEYALEQFHFVKKHNLVYN 289
HF SNKSHLRRT K++P+ +TV +IK+S IW++ENELYE+A +QFHFV+ +
Sbjct: 124 VSHFQKSNKSHLRRTKSKVEPTPDTVAKIKESTIWQMENELYEFARDQFHFVQ------H 177
Query: 290 KVLGYEADKGKQFMYEKIYPKP 311
K+ + ++F+YEKI P P
Sbjct: 178 KLRTPGRNVMQEFLYEKIKPNP 199
>gi|226479734|emb|CAX73163.1| heparan sulfate 2-O-sulfotransferase HS2ST1 [Schistosoma japonicum]
Length = 371
Score = 252 bits (644), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 120/262 (45%), Positives = 171/262 (65%), Gaps = 11/262 (4%)
Query: 29 VIIYNRVPKTGSTSFVNMAYDMCRK--KRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWR 86
++IYNR+PKTGSTS +N+ Y + + +V+H+N++ N L+ + V+N+T W
Sbjct: 77 IVIYNRIPKTGSTSLINLVYQLLEENYSHTHVIHLNISSNKRYLNRLSELHLVDNLTHWT 136
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKH 146
P + HGHF FI+F ++GS P++IN++R PLDRLVSYYYFLRYGDNYRP+L+RK+
Sbjct: 137 RMHPLMIHGHFTFINFVKYGSPLNPIYINMIRNPLDRLVSYYYFLRYGDNYRPYLIRKRM 196
Query: 147 GD----KTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKY 202
D TFDEC+ +N ++CS + +W+QVP+ CG A C +PGN A+E AK +++ Y
Sbjct: 197 FDHMVRNQTFDECVLVNGSDCSPQLLWVQVPYFCGQAMYCRIPGNLAAVETAKRHVIENY 256
Query: 203 LLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSH---LRRTNRKIDPSEETVQQIK 259
L+VG+TEE FV LLE LPSFF G H L S H LRRTN K S+ T++ +
Sbjct: 257 LIVGITEEFDKFVDLLEILLPSFFTGA--HSLRSRSKHKWYLRRTNLKFPISQATIKIYQ 314
Query: 260 KSKIWELENELYEYALEQFHFV 281
+ IW+ E + Y + +FH V
Sbjct: 315 GNPIWQAEQDFYNFVRTEFHAV 336
>gi|313225614|emb|CBY07088.1| unnamed protein product [Oikopleura dioica]
Length = 339
Score = 251 bits (641), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 122/284 (42%), Positives = 178/284 (62%), Gaps = 19/284 (6%)
Query: 29 VIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVT---GNNHVLSLADQYRFVNNVTKW 85
+I+YNRVPKT ST+F ++ YD+ ++ V+HVN T N ++SL D+ N+T W
Sbjct: 63 IIVYNRVPKTASTAFTHLLYDLTQENSIYVIHVNTTVPKQNAAIMSLQDKMLLRQNITTW 122
Query: 86 RDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKK 145
PA YHGHF F D +IN++R P DRLVS YYFLRYGDN+R L+R K
Sbjct: 123 -GLTPAFYHGHFAFFDVPNV------FWINLIRNPFDRLVSNYYFLRYGDNFRKGLLRSK 175
Query: 146 HGDKTTFDECI-RLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLL 204
+GD TTF+EC+ + + +CS++ MW+Q+P+ CG AACW PG+ WAL++AK+N++ Y L
Sbjct: 176 NGDTTTFNECVEKESSKDCSIQKMWVQIPYFCGQVAACWEPGSQWALDRAKQNVLEHYFL 235
Query: 205 VGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIW 264
VG TE L+ FV +LE +P+ F+G + FL + +R+TN K + +EET ++ ++IW
Sbjct: 236 VGTTENLSQFVEVLENEIPAIFKGSYEKFLRQDP--IRKTNHKDEITEETKAKLSNTRIW 293
Query: 265 ELENELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYEKIY 308
+E + Y + VK NL+Y + + K F YEKIY
Sbjct: 294 RMEFDFYNF------IVKNFNLIYTRSITDGKLTKKNFFYEKIY 331
>gi|197382758|ref|NP_001127964.1| heparan sulfate 2-O-sulfotransferase 1 isoform 2 [Homo sapiens]
gi|19263489|gb|AAH25384.1| HS2ST1 protein [Homo sapiens]
gi|19684079|gb|AAH25990.1| HS2ST1 protein [Homo sapiens]
gi|80477761|gb|AAI08736.1| HS2ST1 protein [Homo sapiens]
gi|325464139|gb|ADZ15840.1| heparan sulfate 2-O-sulfotransferase 1 [synthetic construct]
Length = 229
Score = 230 bits (586), Expect = 8e-58, Method: Compositional matrix adjust.
Identities = 92/157 (58%), Positives = 125/157 (79%)
Query: 27 DTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWR 86
D VIIYNRVPKT STSF N+AYD+C K +++VLH+N T NN V+SL DQ RFV N+T W+
Sbjct: 73 DMVIIYNRVPKTASTSFTNIAYDLCAKNKYHVLHINTTKNNPVMSLQDQVRFVKNITSWK 132
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKH 146
+ +P YHGH ++DF +FG K++P++IN++R P++RLVSYYYFLR+GD+YRP L R+K
Sbjct: 133 EMKPGFYHGHVSYLDFAKFGVKKKPIYINVIRDPIERLVSYYYFLRFGDDYRPGLRRRKQ 192
Query: 147 GDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACW 183
GDK TFDEC+ ++C+ E +WLQ+PF CGH++ CW
Sbjct: 193 GDKKTFDECVAEGGSDCAPEKLWLQIPFFCGHSSECW 229
>gi|339237287|ref|XP_003380198.1| heparan sulfate 2-O-sulfotransferase hst-2 [Trichinella spiralis]
gi|316977006|gb|EFV60186.1| heparan sulfate 2-O-sulfotransferase hst-2 [Trichinella spiralis]
Length = 537
Score = 203 bits (517), Expect = 7e-50, Method: Compositional matrix adjust.
Identities = 84/172 (48%), Positives = 121/172 (70%), Gaps = 4/172 (2%)
Query: 15 KSPSPETDSLSWD---TVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLS 71
K P E W+ V +YNR+PKTGSTS + + Y++C+K F+V+H+N++ N+HV++
Sbjct: 359 KPPLAEGSLDEWNLQNAVFLYNRIPKTGSTSLMGIIYELCQKNSFHVIHLNMSRNSHVMT 418
Query: 72 LADQYRFVNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFL 131
DQ F N + W R+PA YHGH +IDF +FG K P+++N++R PL+R++SYYYFL
Sbjct: 419 PWDQVHFAGNFSNWTQRKPAFYHGHVAYIDFTKFGMK-NPIYLNVVRDPLERMISYYYFL 477
Query: 132 RYGDNYRPHLVRKKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACW 183
RYGD++RPHL RK+ G+ TFDEC++ +C N+WLQ+PF CGH A CW
Sbjct: 478 RYGDDFRPHLSRKRKGNNETFDECVKRKGRDCDPANLWLQIPFFCGHHADCW 529
>gi|355695121|gb|AER99901.1| heparan sulfate 2-O-sulfotransferase 1 [Mustela putorius furo]
Length = 211
Score = 194 bits (494), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 80/130 (61%), Positives = 106/130 (81%)
Query: 27 DTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWR 86
D VIIYNRVPKT STSF N+AYD+C K +++VLH+N T NN V+SL DQ RFV N+T W+
Sbjct: 78 DMVIIYNRVPKTASTSFTNIAYDLCAKNKYHVLHINTTKNNPVMSLQDQVRFVKNITSWK 137
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKH 146
+ +P YHGH ++DF +FG K++P++IN++R P++RLVSYYYFLR+GD+YRP L R+K
Sbjct: 138 EMKPGFYHGHVSYLDFAKFGVKKKPIYINVIRDPIERLVSYYYFLRFGDDYRPGLRRRKQ 197
Query: 147 GDKTTFDECI 156
GDK TFDEC+
Sbjct: 198 GDKKTFDECV 207
>gi|47215963|emb|CAF96365.1| unnamed protein product [Tetraodon nigroviridis]
Length = 280
Score = 189 bits (479), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 86/160 (53%), Positives = 115/160 (71%), Gaps = 2/160 (1%)
Query: 151 TFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVGVTEE 210
TFDEC+ ++C+ E +WLQ+PF CGH + CW G+ WALE+AK NLV +YLLVGVTEE
Sbjct: 122 TFDECVSAGGSDCAPEKLWLQIPFFCGHYSECWNAGSQWALEQAKYNLVNEYLLVGVTEE 181
Query: 211 LTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIWELENEL 270
L DFV +LEAALP FFRG T+ + T KSHLR+T+ K P++E++ ++++S IW++ENE
Sbjct: 182 LEDFVMMLEAALPRFFRGATELYRTGKKSHLRKTSEKKPPTKESIAKLQQSAIWKMENEF 241
Query: 271 YEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYEKIYPK 310
YE+ALEQF FV+ H + Y + F YEKIYPK
Sbjct: 242 YEFALEQFQFVRAHAVREKDGELYLL--AQNFFYEKIYPK 279
Score = 77.4 bits (189), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 36/64 (56%), Positives = 47/64 (73%), Gaps = 3/64 (4%)
Query: 12 SSAKSPSPETDSLSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLS 71
SS++ P + + D VIIYNRVPKT STSF N+AYD+C K R++VLH+N T NN V+S
Sbjct: 61 SSSRVPDSDGED---DVVIIYNRVPKTASTSFTNIAYDLCGKNRYHVLHINTTKNNPVMS 117
Query: 72 LADQ 75
+ DQ
Sbjct: 118 IQDQ 121
>gi|324528990|gb|ADY48975.1| Heparan sulfate 2-O-sulfotransferase hst-2, partial [Ascaris suum]
Length = 161
Score = 180 bits (457), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 81/161 (50%), Positives = 113/161 (70%), Gaps = 1/161 (0%)
Query: 48 YDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWRDRRPALYHGHFGFIDFQQFGS 107
YD+CR+ F+VLH+N+T N + ++L DQ RF+ N+T W + +PA+YHGH FIDF FG
Sbjct: 2 YDLCRRNGFHVLHLNLTRNRYSMNLIDQRRFIENITSWHEMQPAIYHGHAAFIDFTLFGV 61
Query: 108 KEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKHGDKTTFDECIRLNRTECSLEN 167
P++IN+LR+PL+ L+S+YYFLRYGDNYR L R + G+ TFDEC+ +C ++
Sbjct: 62 P-NPIYINLLREPLEHLLSHYYFLRYGDNYRIGLKRSRAGNNETFDECVARRGKDCDMKQ 120
Query: 168 MWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVGVT 208
MWLQ+P+ CG C GN ALE+AK NL+ YL+VG+
Sbjct: 121 MWLQIPYFCGTHHFCSEVGNVRALEQAKRNLLDHYLIVGLN 161
>gi|256085031|ref|XP_002578728.1| heparan sulfate 2-o-sulfotransferase [Schistosoma mansoni]
gi|350645848|emb|CCD59452.1| heparan sulfate 2-o-sulfotransferase, putative [Schistosoma
mansoni]
Length = 206
Score = 179 bits (453), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 88/173 (50%), Positives = 117/173 (67%), Gaps = 9/173 (5%)
Query: 116 ILRKPLDRLVSYYYFLRYGDNYRPHLVRKKHGD----KTTFDECIRLNRTECSLENMWLQ 171
++R PLDRLVSYYYFLRYGD+YRP+L+RK+ D TFDEC+ +N ++CS + +W+Q
Sbjct: 1 MIRNPLDRLVSYYYFLRYGDDYRPYLMRKRMFDLVTRNQTFDECVLVNGSDCSPQLLWVQ 60
Query: 172 VPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTD 231
VPF CG A C +PGNP ALE AK ++ YL+VG+TEE FV+LLE LPSFF G
Sbjct: 61 VPFFCGQAMYCRIPGNPVALETAKRRVIEDYLIVGLTEEFDKFVNLLELLLPSFFTGA-- 118
Query: 232 HFLTS---NKSHLRRTNRKIDPSEETVQQIKKSKIWELENELYEYALEQFHFV 281
H L S +K HLRRTN K+ S+ T + + + IW+ E E Y + +FH +
Sbjct: 119 HNLISRSKDKWHLRRTNYKLPISKATTKIYQDNPIWQAEQEFYNFVRTEFHTI 171
>gi|115725159|ref|XP_781906.2| PREDICTED: heparan sulfate 2-O-sulfotransferase 1-like, partial
[Strongylocentrotus purpuratus]
Length = 196
Score = 166 bits (420), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 86/231 (37%), Positives = 121/231 (52%), Gaps = 45/231 (19%)
Query: 79 VNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYR 138
V N++ W ++P YHGHF +IDF
Sbjct: 4 VRNISNWSAKQPGFYHGHFAYIDF------------------------------------ 27
Query: 139 PHLVRKKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENL 198
P L T DEC+ +C+ E MW+QVPF CGH CW PG+ WALE+AK N+
Sbjct: 28 PSL--------RTLDECVMRQGFDCAPERMWIQVPFFCGHNPECWKPGSRWALEQAKSNV 79
Query: 199 VTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQI 258
V KY LVGVTE + +FV++LEA+LPSFF+G F+ KSHLR+T +K++P T+ +
Sbjct: 80 VNKYFLVGVTEHMEEFVAMLEASLPSFFKGAHHIFMKGEKSHLRKTVQKLNPLPYTLDTL 139
Query: 259 KKSKIWELENELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQFMYEKIYP 309
K + +W +E++ Y + LE F +L+ K K +QF +EKI P
Sbjct: 140 KNNDVWTVEDKFYRFVLETFRTNAARSLIIEKDHSIRTVK-QQFSFEKIRP 189
>gi|324505268|gb|ADY42266.1| Heparan sulfate 2-O-sulfotransferase hst-2 [Ascaris suum]
Length = 248
Score = 166 bits (419), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 92/206 (44%), Positives = 131/206 (63%), Gaps = 15/206 (7%)
Query: 111 PLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKHGDKTTFDECIRLNRTECSLENMWL 170
P++IN+LR+PL+RL+S+YYFLRYGDNYR L R + G+ TFDEC+ +C ++ MWL
Sbjct: 51 PIYINLLREPLERLLSHYYFLRYGDNYRIGLKRSRAGNNETFDECVARRGKDCDMKQMWL 110
Query: 171 QVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGT 230
Q+P+ CG C GN ALE+AK NL+ YL+VG+ E + DF++LLE LP FF G
Sbjct: 111 QIPYFCGTHHFCSEVGNVRALEQAKRNLLDHYLIVGLNERMRDFIALLEILLPKFFNGAL 170
Query: 231 DHF--LTSNKSHLRRTNRKIDPSEETVQQIKKSKIWELENELYEYALEQFHFVKKHNLVY 288
+HF L ++HLR T +KI P+E T++ IK K++ +E E Y++A+ QF + K
Sbjct: 171 EHFDSLDERRAHLRNTKKKIPPNERTLEVIKSDKVYIMEREFYDFAVMQFENIWKRT--- 227
Query: 289 NKVLGYEADK-----GKQFMYEKIYP 309
+E D +QF +EKI P
Sbjct: 228 -----HEDDSENDFLPQQFHFEKIKP 248
>gi|38048649|gb|AAR10227.1| similar to Drosophila melanogaster Hs2st, partial [Drosophila
yakuba]
Length = 180
Score = 164 bits (414), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 72/127 (56%), Positives = 99/127 (77%), Gaps = 2/127 (1%)
Query: 11 ISSAKSPSPETDSLSWD--TVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNH 68
++ + S TD ++ V++YNRVPKTGSTSFVN+AYD+C+ +F+VLH+NVT N H
Sbjct: 54 LTPDQHASSTTDDFDFEEHLVVLYNRVPKTGSTSFVNIAYDLCKPNKFHVLHINVTANMH 113
Query: 69 VLSLADQYRFVNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYY 128
VLSL +Q +FV NV++W + +PALYHGH F+DF +F +P++IN++RKPLDRLVSYY
Sbjct: 114 VLSLPNQIQFVRNVSRWHEMKPALYHGHMAFLDFSKFQIAHKPIYINLVRKPLDRLVSYY 173
Query: 129 YFLRYGD 135
YFLR+GD
Sbjct: 174 YFLRFGD 180
>gi|156377738|ref|XP_001630803.1| predicted protein [Nematostella vectensis]
gi|156217831|gb|EDO38740.1| predicted protein [Nematostella vectensis]
Length = 275
Score = 160 bits (405), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 103/268 (38%), Positives = 145/268 (54%), Gaps = 18/268 (6%)
Query: 30 IIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWRDRR 89
IIYNRV K GS S + + ++ F +N H L+ Q FV+ V + +
Sbjct: 1 IIYNRVAKCGSRSMILLISELAVANGFKF--INHPKTQHFLTAKQQLAFVDMVEQ--QSQ 56
Query: 90 PALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGD--NYRPHLVRKKHG 147
++ H FIDFQ+FGS P++IN++R PL RLVS YYF R+GD N + G
Sbjct: 57 SFIFTRHMYFIDFQRFGSV-TPIYINLIRDPLSRLVSQYYFRRFGDGRNRTWDFKGSEAG 115
Query: 148 DKTTFDECIRLNRTECSLEN-MWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVG 206
+FDEC+ NRTEC N ++ +PF CG + C V + WA +A NLV +YL+VG
Sbjct: 116 RHRSFDECVITNRTECMDPNALFYVIPFFCGQSRKCRV-SSKWAFRQALYNLVNRYLVVG 174
Query: 207 VTEELTDFVSLLEAALPSFFRGGTDHF------LTSNKSHLRRTNRKIDPSEETVQQIKK 260
+ EE+ DF+++LE LP+FF+G + + + RT K PS E V+ +KK
Sbjct: 175 ILEEVDDFLTVLEKLLPNFFKGALEMWKMPETRRKGRRQDETRTKNKKAPSAEVVRIMKK 234
Query: 261 SKIWELENELYEYALEQFHFVKK-HNLV 287
LE E YE E+FH +KK H L
Sbjct: 235 R--LHLEYEFYEAVKERFHRLKKEHGLA 260
>gi|291220908|ref|XP_002730466.1| PREDICTED: predicted protein-like [Saccoglossus kowalevskii]
Length = 341
Score = 151 bits (382), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 88/258 (34%), Positives = 140/258 (54%), Gaps = 8/258 (3%)
Query: 29 VIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNN--HVLSLADQYRFVNNVTKWR 86
++ YN+V K GS S V + + R F + T N+ L+ DQ V +T
Sbjct: 80 LVFYNKVGKCGSRSLVYLLRRLGRINNFTSAGQSKTPNSKSRYLTPTDQLELVQRITSLP 139
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKH 146
RPA ++ H FIDF +FG+ ++P++INI+R+PLDRLVS YYF R+GD+
Sbjct: 140 --RPATFYRHTVFIDFLRFGA-QRPIYINIIRRPLDRLVSEYYFKRFGDDKNSSKGFLGE 196
Query: 147 GDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVG 206
+FD+C+ LN++EC +N++ +P+ CG C WAL+ AK N++ +L+VG
Sbjct: 197 TKYQSFDDCVLLNKSECRGDNIFYIIPYFCGQQQQCR-SATEWALQTAKVNVINHFLVVG 255
Query: 207 VTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIWEL 266
++EE + + +LE LP FF F T +T +K P E V+ + + +L
Sbjct: 256 LSEEYENTLRVLEKMLPQFFTTAVRAFKTPGVIPSTKTRKKQPPKPEVVKIMTER--LKL 313
Query: 267 ENELYEYALEQFHFVKKH 284
E E YE+ ++F +K+
Sbjct: 314 EIEFYEFIKDRFQRLKRQ 331
>gi|291233613|ref|XP_002736748.1| PREDICTED: uronyl-2-sulfotransferase-like [Saccoglossus
kowalevskii]
Length = 402
Score = 145 bits (367), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 91/274 (33%), Positives = 139/274 (50%), Gaps = 29/274 (10%)
Query: 30 IIYNRVPKTGSTSFVNMAYDMCRKKRF-----NVLHVNVTGNNHVLSLADQYRFVNNVTK 84
++YNR+ K GS F+ M + F V H+ H LA R +N
Sbjct: 143 LVYNRIEKCGSRMFLTMIAFLSFGHGFISIRNQVFHLKYLSEKHQEYLA---RIINT--- 196
Query: 85 WRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDN-YRPHLVR 143
P +Y H F+DF +FG EQP +IN++R PL R +SYYYF R+GD+ + P +
Sbjct: 197 --QEPPFIYERHLHFLDFAKFGF-EQPYYINVIRDPLQRFISYYYFRRFGDSMFNPVFIL 253
Query: 144 KKHGDKTTFDECIRLNRTECSLENM----WLQVPFLCGHAAACWVPGNPWALEKAKENLV 199
+ TFD+C+ +R ECSL N+ + +PF CG C +PG WALEKAK+ +V
Sbjct: 254 NETERYETFDDCVLRHRAECSLNNVQYYTFFIIPFFCGQEPGCRIPGR-WALEKAKQRVV 312
Query: 200 TKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKID-----PSEET 254
Y+ VG+ E+ + + + E LP FF G+ + ++ L +R + PS+
Sbjct: 313 NDYIFVGILEDFENSLRIFEILLPQFF--GSALKVYNSVVGLELFDRNVSSLKHPPSQTA 370
Query: 255 VQQIKKSKIWELENELYEYALEQFHFVKKHNLVY 288
+Q++ K ELE E Y + + ++K L Y
Sbjct: 371 LQEMTKR--LELEYEFYYFVKSRMELIRKELLGY 402
>gi|291242119|ref|XP_002740956.1| PREDICTED: uronyl-2-sulfotransferase-like [Saccoglossus
kowalevskii]
Length = 344
Score = 144 bits (363), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 93/284 (32%), Positives = 153/284 (53%), Gaps = 15/284 (5%)
Query: 2 NTQKSHQIHISSAKSPSPETD--SLSW-DTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNV 58
N+Q S + +S + P +L+W + +I+NR+ +TGS + + +A + +K + V
Sbjct: 59 NSQVSLALREASYEDLKPVVSFSNLTWSERSVIFNRIERTGSRTLLTIAEKLSKKYNYTV 118
Query: 59 LHVNVTGNNHVLSLADQYRFVNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILR 118
+ + + L +Q V+ V + P +Y +I F ++GSK QP +I+++R
Sbjct: 119 QTRDAWAMKY-MGLGEQKALVHTVNAVKT--PFIYDQSVHYIHFPRYGSK-QPFWISLVR 174
Query: 119 KPLDRLVSYYYFLRYGD-NYRPHLVRKKHGDKTTFDECIRLNRTECSLENMWLQVPFLCG 177
PL R++S YY+ RYGD RP+ + G K +FD+C+ N+ ECSL+N++ +P+ CG
Sbjct: 175 DPLRRIISLYYYKRYGDVGTRPNASQPTAG-KPSFDDCVLKNQPECSLKNVFRVIPYFCG 233
Query: 178 HAAACWVPGNPWALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGT---DHFL 234
+A C VP N WALE AK+N+ + VGV EEL +LE LP FF G +
Sbjct: 234 QSAGCRVP-NKWALETAKKNVEENFKFVGVLEELNTTFQVLEVLLPQFFHGAPRVHKSII 292
Query: 235 TSNKSHLRRTNRKIDPSEETVQQIKKSKIWELENELYEYALEQF 278
+S ++ PSE+ + ++K LE E YE+ E+
Sbjct: 293 SSGVVDHFKSFPGQPPSEQALTKMKGR--LALEYEFYEFVKEKM 334
>gi|291241495|ref|XP_002740653.1| PREDICTED: uronyl-2-sulfotransferase-like [Saccoglossus
kowalevskii]
Length = 355
Score = 142 bits (358), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 85/260 (32%), Positives = 134/260 (51%), Gaps = 13/260 (5%)
Query: 30 IIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWRDRR 89
+I+NRV K GS S V+ + RK F + V L+ +Q FV V
Sbjct: 100 LIFNRVGKCGSRSMVHTIDILARKNNFPNMKSQVFTQKR-LNEQEQAEFVAEVDVLDP-- 156
Query: 90 PALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGD-NYRPHLVRKKHGD 148
P +Y+ H ++DF +FGS EQP IN++R PLDR +S+YY+ R+GD Y+
Sbjct: 157 PYIYNRHIDYVDFSRFGS-EQPYGINLIRDPLDRTISFYYYTRFGDATYQREAGATMKDL 215
Query: 149 KTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVGVT 208
TFDEC+ ++ ECS N + +P+ CG C VP WALE A ++V Y+ VG+
Sbjct: 216 NQTFDECVLFDKEECSTNNTFRIIPYFCGQDDMCRVPSR-WALETAIRHVVKDYVFVGIL 274
Query: 209 EELTDFVSLLEAALPSFFRGGTDHFLT----SNKSHLRRTNRKIDPSEETVQQIKKSKIW 264
E+ + + +LE +P FF G ++ + T + + +RK +P+E +K+ +
Sbjct: 275 EDFENTLRILEIIMPQFFGGASEAYSTIVTKGDVQSFKSVSRK-EPAEVATTIMKQRMAY 333
Query: 265 ELENELYEYALEQFHFVKKH 284
E E Y++ + +KK
Sbjct: 334 EY--EFYDFVKRRMELIKKQ 351
>gi|291242115|ref|XP_002740955.1| PREDICTED: uronyl-2-sulfotransferase-like [Saccoglossus
kowalevskii]
Length = 335
Score = 142 bits (357), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 92/280 (32%), Positives = 149/280 (53%), Gaps = 20/280 (7%)
Query: 19 PETDSLSW-DTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYR 77
P + +W ++ +I+NRV K GS S +N+ + +K +F V L+ Q
Sbjct: 65 PLDPTAAWNESRVIFNRVSKVGSRSLLNVIMKLSQKNKFERARSPVFVKVF-LNETGQKI 123
Query: 78 FVNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNY 137
F + P +Y H ++DF++ SK P +IN+ R P+ RLVS+YY+ R+GD++
Sbjct: 124 FTKEIEVITP--PFIYDRHLDYVDFERHASKP-PTWINLTRDPVSRLVSFYYYTRFGDSF 180
Query: 138 R-PHLVRKKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKE 196
P TFD+C+ ++ ECS +N + VP+ CG C + WA+E+AK
Sbjct: 181 SVPEWTGAAEDFNQTFDQCVLQDKFECSTQNTFRVVPYFCGQDDECRT-ASRWAVEQAKR 239
Query: 197 NLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHF---LTSNKSHLRRTNRKIDPSEE 253
N+V K+L VG+TE+ + +LE LP FF G + + L++ + ++ KI+PSEE
Sbjct: 240 NVVEKFLFVGITEDFNSTLVVLERLLPQFFEGALESYKITLSTIRDQF-KSKSKIEPSEE 298
Query: 254 TVQQIKKSKIWELENELYEYALEQFHFVKKHNLVYNKVLG 293
V +I++ ++ LE E YE FVK L+ + LG
Sbjct: 299 -VLKIQRERM-ALEVEFYE-------FVKARMLLIKQQLG 329
>gi|443683687|gb|ELT87850.1| hypothetical protein CAPTEDRAFT_205630 [Capitella teleta]
Length = 335
Score = 141 bits (356), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 81/264 (30%), Positives = 141/264 (53%), Gaps = 19/264 (7%)
Query: 30 IIYNRVPKTGSTSFVNMAYDMCRKKRFNV-----LHVNVTGNNHVLSLADQYRFVNNVTK 84
I YNR+PK GST+ + + + +K R+ L N+T + + D+ +
Sbjct: 28 IFYNRMPKCGSTTMMALLKIVAKKNRWTYQYEVKLVANLTAKSTEPWIGDEQTLTKLIHL 87
Query: 85 WRD-RRPALYHGHFGFIDFQQFGSKEQ-PLFINILRKPLDRLVSYYYFLRYGDNYRPHLV 142
D +P + H + DF +F S ++ P +INI+R P +RL+S+YYF+R+ N++ +
Sbjct: 88 IMDHEKPMMASCHLYYTDFARFWSDQRLPTYINIIRNPQERLISFYYFIRFFPNHQRPMS 147
Query: 143 RKKHGDKTTFDECIRLNRTECS---LENMWLQVPFLCGHAAACWVPGNPWALEKAKENLV 199
K ++DEC+R N EC+ + W +P+ CGH C P ALE+AK N++
Sbjct: 148 DAKRN--MSYDECVRQNDDECTGAHPKGYWTLIPYFCGHDPVCRKPSQA-ALEQAKANVM 204
Query: 200 TKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRR----TNRKIDPSEETV 255
Y +VG+TE++ F +LE +P +F+G + K+ + RK+ P ET+
Sbjct: 205 RSYAVVGITEDIDSFTQVLEQTIPKYFKGLAAEYRRLKKAGRNKAYSNARRKVKPKPETL 264
Query: 256 QQIKKSKIWELENELYEYALEQFH 279
++++ + +LENE YE+ +F+
Sbjct: 265 EKMR--PLLKLENEFYEFVKNRFY 286
>gi|291221371|ref|XP_002730695.1| PREDICTED: uronyl-2-sulfotransferase-like [Saccoglossus
kowalevskii]
Length = 318
Score = 141 bits (356), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 77/212 (36%), Positives = 119/212 (56%), Gaps = 6/212 (2%)
Query: 24 LSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVT 83
LS + ++YNRV K GS + + + + + F + NV HV SL +Q VN V+
Sbjct: 92 LSPQSRVVYNRVGKCGSRTVIAVLNKLSTRNGFTLYSSNVYNRTHV-SLKEQIDIVNTVS 150
Query: 84 KWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVR 143
+ P +Y HF +IDF +FG+ QP++INI+R P++R VS YY+ R+GD ++
Sbjct: 151 SLKP--PYVYQRHFHYIDFPKFGAI-QPVYINIIRDPINRFVSGYYYKRFGDADNQKYMK 207
Query: 144 KKHGDKT-TFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKY 202
K K T DEC+ LN+ ECS + ++ +PF CG C P AL++A NL ++
Sbjct: 208 KSFPSKNMTVDECVLLNKEECSDKKLFYMIPFFCGQHPYCSTPSQ-LALDRALSNLDKRF 266
Query: 203 LLVGVTEELTDFVSLLEAALPSFFRGGTDHFL 234
L+VG E++ + + +LE LP F G + L
Sbjct: 267 LVVGFIEKIDETMQVLEQLLPDIFGGAVEILL 298
>gi|390356309|ref|XP_797651.3| PREDICTED: uronyl 2-sulfotransferase-like [Strongylocentrotus
purpuratus]
Length = 482
Score = 140 bits (353), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 92/262 (35%), Positives = 137/262 (52%), Gaps = 15/262 (5%)
Query: 29 VIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWRDR 88
I++NRV K GS +N+ ++ RF L ++T N L+ + + V + +
Sbjct: 224 AIVFNRVGKCGSRVVINVLQELSXSNRF-YLVCSLTYNRTRLTPSAEEHLVKVTSSLK-- 280
Query: 89 RPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKHGD 148
RP ++ H FID+ + QPL+INI+R PLDR+VS YY+ R+GD ++ K+
Sbjct: 281 RPNIFQRHIHFIDYSRH-QMPQPLYINIIRDPLDRMVSQYYYSRFGDERSTGHIKGKY-Q 338
Query: 149 KTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVGVT 208
TFDEC+ EC + +P+ CG C PWA +KA ENL Y++ G+
Sbjct: 339 YQTFDECVLSGSEECLGPKAFYIIPYFCGQDLNC-TKDRPWAFQKAVENLNKYYIVTGIL 397
Query: 209 EELTDFVSLLEAALPSFFRGGTDHFLT-SNKSHLRR------TNRKIDPSEETVQQIKKS 261
EEL D L E LPSFF+G + + + S +L++ T KI PS E V +I K
Sbjct: 398 EELEDTFRLFERVLPSFFKGALEIYQSLSIGDNLKKNLTTTVTKHKIKPSPE-VSRIMKE 456
Query: 262 KIWELENELYEYALEQFHFVKK 283
+ +LE YE E+FH K+
Sbjct: 457 HM-KLEYSFYELVKEKFHDQKR 477
>gi|291233611|ref|XP_002736747.1| PREDICTED: uronyl-2-sulfotransferase-like [Saccoglossus
kowalevskii]
Length = 427
Score = 140 bits (352), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 93/274 (33%), Positives = 133/274 (48%), Gaps = 34/274 (12%)
Query: 30 IIYNRVPKTGSTSFVN-MAYDMCRKKRFN----VLHVNVTGNNHVLSLADQYRFVNNVTK 84
++YNRV K GS F+ +A+ N V H+ H LA R +NN
Sbjct: 167 LVYNRVDKCGSRLFLAAIAFLSFGHGYINIWSRVYHLKYLSEKHQKYLA---RIINN--- 220
Query: 85 WRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNY--RPHLV 142
P +Y H F+DF +FG EQP +IN++R PL R +SYYYF R+GD+ P
Sbjct: 221 --QEPPFIYERHLHFLDFAKFGF-EQPYYINVIRDPLQRFISYYYFRRFGDSLVSDPFFA 277
Query: 143 RKKHGDKTTFDECIRLNRTECSLEN--MWLQVPFLCGHAAACWVPGNPWALEKAKENLVT 200
+ TFDEC+ + ECS++N M+ +PF CG C VPG WALEKAK+ +V
Sbjct: 278 GNETERYQTFDECVLQHSIECSMKNPYMFFIIPFFCGQEPECRVPG-IWALEKAKQRVVN 336
Query: 201 KYLLVGVTEELTDFVSLLEAALPSFFRGGTD--------HFLTSNKSHLRRTNRKIDPSE 252
Y+ VG+ E+ + + + E LP FF N S L+ T PS
Sbjct: 337 DYIFVGILEDFENSLRIFEILLPQFFGSALKIYNSVIELELFDRNASSLKHT-----PST 391
Query: 253 ETVQQIKKSKIWELENELYEYALEQFHFVKKHNL 286
+Q++ K ELE + Y + + ++K L
Sbjct: 392 AAIQEMTKR--LELEYQFYYFVKSRMELIRKQLL 423
>gi|313233448|emb|CBY24563.1| unnamed protein product [Oikopleura dioica]
Length = 332
Score = 139 bits (349), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 90/299 (30%), Positives = 149/299 (49%), Gaps = 18/299 (6%)
Query: 2 NTQKSHQIHISSAKSPSPETDSLSWDT------VIIYNRVPKTGSTSFVNMAYDMCRKKR 55
N +KSHQ KS + L+ + V++YNRVPK GS + + + RK
Sbjct: 40 NDEKSHQEKDQKTKSKLEARNDLNVTSEAIHPKVVVYNRVPKCGSQTMSMLVNQLSRKNG 99
Query: 56 FNVLHVNVTGNNHVLSLADQYRFVNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFIN 115
F V G + Q F+ + + + +Y H FIDF QF + P++IN
Sbjct: 100 FLSKAVFEAGETPDRTTTQQKAFMGELKGYAKDQKVMYTRHQYFIDFDQFEWAD-PVYIN 158
Query: 116 ILRKPLDRLVSYYYFLRYGDNYRPHLVRKKHGDKTTF-----DECIRLNRTECSLENMWL 170
++R P+DR S+YYF R+G+ R K + D+CI RTEC+ E +W
Sbjct: 159 LIRDPVDRFASFYYFSRFGNKRAQDAGRTKQQVPSNILNENIDDCITRRRTECT-EPIWH 217
Query: 171 QVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGT 230
VP++CG+ +C + + A+++AK+N+ +KY++VG+ EEL + +LE LP FF G
Sbjct: 218 TVPYICGNDKSC-LQRHESAVQQAKQNIDSKYVVVGILEELKGTLGVLEHVLPDFFEGAV 276
Query: 231 DHFL-TSNKSHLRRTNRKIDPSEETVQQIKKSKIWELENELYEYALEQFHFVKKHNLVY 288
+H N ++ + D + E + +LE +LY +A + + K+N ++
Sbjct: 277 NHLTEIRNDTYTVKKKAMTDAAREY---LANETALKLEYDLYNHAKGKHPVILKNNQLF 332
>gi|260833388|ref|XP_002611639.1| hypothetical protein BRAFLDRAFT_63696 [Branchiostoma floridae]
gi|229297010|gb|EEN67649.1| hypothetical protein BRAFLDRAFT_63696 [Branchiostoma floridae]
Length = 319
Score = 136 bits (343), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 88/259 (33%), Positives = 138/259 (53%), Gaps = 15/259 (5%)
Query: 29 VIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWRDR 88
++ YNRVPK GS S + M + F+ L V + + D + +R
Sbjct: 58 LVFYNRVPKCGSNSMKILLRTMAKNNYFSFLEDKVYV---IETFEDNELQIYTELVYRLP 114
Query: 89 RPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVR--KKH 146
P++Y ++DF++FG +QPL+IN++R PL+R VS+YY++R+G + R +
Sbjct: 115 TPSIYEKQIFYVDFRRFGF-QQPLYINLVRDPLERRVSWYYYIRFGRVVHRPIPRNFSQQ 173
Query: 147 GDKTTFDECIRLNRTECSLENM--WLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLL 204
TFDEC+ N EC E +L F CGH C P A+E+AKEN+ Y +
Sbjct: 174 EMAQTFDECVLSNAWECDAEGQESFLMTKFFCGHDPVCRQPSQA-AVERAKENIRRHYAV 232
Query: 205 VGVTEELTDFVSLLEAALPSFFRGGTDHF--LTSNKSHL--RRTNRKIDPSEETVQQIKK 260
VGV EE + F+ +LE +P FFRG D + LT L +RT + PS + Q+I +
Sbjct: 233 VGVLEEFSSFLKVLEVVMPQFFRGAHDTWKELTQQTQLLEEQRTVNRSPPSPRS-QKIMR 291
Query: 261 SKIWELENELYEYALEQFH 279
++ +L+ ++Y + E+FH
Sbjct: 292 ERL-KLDYQVYYFIRERFH 309
>gi|443710632|gb|ELU04794.1| hypothetical protein CAPTEDRAFT_201489 [Capitella teleta]
Length = 366
Score = 135 bits (341), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 84/268 (31%), Positives = 143/268 (53%), Gaps = 26/268 (9%)
Query: 30 IIYNRVPKTGSTSFVNMAYDMCRKKRFN-------VLHVNVTGNNHVLSLADQYRFVNNV 82
I YNR+PK GS + + ++ +K + + + G+ + + + ++++
Sbjct: 94 IFYNRMPKCGSEMTMTLLRNVAKKNHWTYPPENKRIWTITKQGHQYWVDNEKVLKEISHL 153
Query: 83 TKWRDRRPALYHGHFGFIDFQQFGSKEQ-PLFINILRKPLDRLVSYYYFLRYGDNYRPHL 141
T ++P L+ H + DF + SK+ P +INI+R P DRL+S YY+ R+ P
Sbjct: 154 T-MDHKKPMLFSCHLYYTDFSRLWSKKSLPTYINIIRNPQDRLISLYYYFRF----HPVQ 208
Query: 142 VRKKHG---DKTTFDECIRLNRTECSL---ENMWLQVPFLCGHAAACWVPGNPWALEKAK 195
RKK K T+D C+R+ EC+ E W VP+ CGH C P AL++AK
Sbjct: 209 RRKKMPIALRKMTYDTCVRMELPECTAPNPEGFWTMVPYFCGHDPICRTPSRD-ALDRAK 267
Query: 196 ENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNR----KIDPS 251
N+V+ Y ++G+TE++ F +LE +P +FRG TD + K++ N+ K++P
Sbjct: 268 ANIVSNYAVIGLTEDMETFTRVLENVIPKYFRGMTDLYRQMAKNYKASVNKNTYHKVEPL 327
Query: 252 EETVQQIKKSKIWELENELYEYALEQFH 279
+ET+ +I K ++ LE E YE+ +F+
Sbjct: 328 QETL-KIMKPRL-RLEVEFYEFVKSRFY 353
>gi|291241489|ref|XP_002740642.1| PREDICTED: uronyl-2-sulfotransferase-like [Saccoglossus
kowalevskii]
Length = 348
Score = 135 bits (340), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 81/261 (31%), Positives = 137/261 (52%), Gaps = 13/261 (4%)
Query: 28 TVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWRD 87
T +++NRV K GS S ++ + +K + + + +L +Q F V +
Sbjct: 83 TRMVFNRVGKCGSRSLIHTIDLLSKKMSYPHVKSKIFTQKKLLE-HEQAEFAFEVDSYDP 141
Query: 88 RRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKHG 147
P +Y+ H F+DF ++G +QP +IN +R PL+R VS++Y+ R+GD VR G
Sbjct: 142 --PFIYNRHVNFVDFARYGV-DQPYWINQIRDPLNRTVSFFYYTRFGDGMS---VRDTSG 195
Query: 148 DKT--TFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLV 205
TFD+C+ N EC++ N + +P+ CG C+ P +ALE AK+N++ ++ V
Sbjct: 196 RNVDLTFDDCVLKNHPECNITNTFAIIPYFCGQDPGCYEPTR-YALETAKQNVLKHFIFV 254
Query: 206 GVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSE--ETVQQIKKSKI 263
GV EE T + +LE LP FF+G + + + + L + + + E E V+ I K ++
Sbjct: 255 GVLEEFTTSLLILEQILPQFFQGAPEAYQQTQEKGLVESYKSVQRKEPSENVKAIMKERL 314
Query: 264 WELENELYEYALEQFHFVKKH 284
LE E Y + +F +K
Sbjct: 315 -SLEYEFYYFVKRRFDLIKDQ 334
>gi|390350793|ref|XP_794450.3| PREDICTED: uronyl 2-sulfotransferase-like [Strongylocentrotus
purpuratus]
Length = 439
Score = 134 bits (338), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 87/283 (30%), Positives = 144/283 (50%), Gaps = 34/283 (12%)
Query: 19 PETDSLSWDTV-------IIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLS 71
P T+ +W+ +I+NRV K GS S +N+ + + F ++ V N+ +S
Sbjct: 152 PITEGYTWNNTREEVKRHVIFNRVGKCGSRSVLNLLQSLAKNNHFYLISSQVY-NDKRIS 210
Query: 72 LADQYRFVNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFL 131
+Q + V+ ++K + P +Y H FI+F ++G + P +INI+R PL+R VS YYF+
Sbjct: 211 SENQEKLVSILSKVQS--PFIYQRHLHFINFTEYGF-QHPTYINIIRDPLERAVSQYYFI 267
Query: 132 RYGDNYRPHLVRKKHGDKT------TFDECIRLNRTECSLENMWLQVPFLCGHAAACWVP 185
R+GD + R+ + T ++++C+ EC + + VPF CGH C P
Sbjct: 268 RFGDEKKQE--RRFSQNSTDPRKLMSYEQCVIQQVPECIGKRAFYIVPFFCGHDPRCRFP 325
Query: 186 GNPWALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFL------TSNKS 239
WALE+A EN+ Y+ VG+ EEL D + + + LP F G D FL NK+
Sbjct: 326 -TAWALERAIENVKKHYVAVGILEELQDSLQVFQKVLPDMFSGALDTFLRFEQIANRNKT 384
Query: 240 HLRRTNRKIDPSEETVQQIKKSKIWELENELYEYALEQFHFVK 282
+ T K PS +T + +N L +Y + ++++K
Sbjct: 385 SVGVTQYKKKPSAKTALYV--------QNVLMKYEYQFYNWIK 419
>gi|332213584|ref|XP_003255905.1| PREDICTED: uronyl 2-sulfotransferase isoform 1 [Nomascus
leucogenys]
Length = 412
Score = 132 bits (333), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 93/279 (33%), Positives = 149/279 (53%), Gaps = 16/279 (5%)
Query: 14 AKSPSPETDSLSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLA 73
+ P P + L + + ++YNRV K GS + V + + K FN++ ++ N L+
Sbjct: 95 SHCPHPPSKVLPFPSQVVYNRVGKCGSRTVVLLLRILSEKHGFNLVTSDIH-NKTRLTKN 153
Query: 74 DQYRFVNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRY 133
+Q + N++ +P L+ H F++F +FG +QP++INI+R P++R +S Y+F R+
Sbjct: 154 EQMELIKNIST--AEQPYLFTRHVHFLNFSRFGG-DQPVYINIIRDPVNRFLSNYFFRRF 210
Query: 134 GD--NYRPHLVR----KKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGN 187
GD + H++R ++ +ECI N ECS ++ +P+ CG C PG
Sbjct: 211 GDWRGEQNHMIRTPSMRQEERYLDINECILENYPECSNPRLFYIIPYFCGQHPRCREPGE 270
Query: 188 PWALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRR---T 244
WALE+AK N+ +LLVG+ EEL D + LLE LP +F+G + L T
Sbjct: 271 -WALERAKLNVNENFLLVGILEELEDVLLLLERFLPHYFKGVLSIYKDPEHRKLGNMTVT 329
Query: 245 NRKIDPSEETVQQIKKSKIWELENELYEYALEQFHFVKK 283
+K PS E VQ + + +E E Y Y EQFH +K+
Sbjct: 330 VKKTVPSPEAVQILYQRMRYEY--EFYHYVKEQFHLLKR 366
>gi|332213586|ref|XP_003255906.1| PREDICTED: uronyl 2-sulfotransferase isoform 2 [Nomascus
leucogenys]
Length = 409
Score = 132 bits (333), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 93/279 (33%), Positives = 149/279 (53%), Gaps = 16/279 (5%)
Query: 14 AKSPSPETDSLSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLA 73
+ P P + L + + ++YNRV K GS + V + + K FN++ ++ N L+
Sbjct: 92 SHCPHPPSKVLPFPSQVVYNRVGKCGSRTVVLLLRILSEKHGFNLVTSDIH-NKTRLTKN 150
Query: 74 DQYRFVNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRY 133
+Q + N++ +P L+ H F++F +FG +QP++INI+R P++R +S Y+F R+
Sbjct: 151 EQMELIKNIST--AEQPYLFTRHVHFLNFSRFGG-DQPVYINIIRDPVNRFLSNYFFRRF 207
Query: 134 GD--NYRPHLVR----KKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGN 187
GD + H++R ++ +ECI N ECS ++ +P+ CG C PG
Sbjct: 208 GDWRGEQNHMIRTPSMRQEERYLDINECILENYPECSNPRLFYIIPYFCGQHPRCREPGE 267
Query: 188 PWALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRR---T 244
WALE+AK N+ +LLVG+ EEL D + LLE LP +F+G + L T
Sbjct: 268 -WALERAKLNVNENFLLVGILEELEDVLLLLERFLPHYFKGVLSIYKDPEHRKLGNMTVT 326
Query: 245 NRKIDPSEETVQQIKKSKIWELENELYEYALEQFHFVKK 283
+K PS E VQ + + +E E Y Y EQFH +K+
Sbjct: 327 VKKTVPSPEAVQILYQRMRYEY--EFYHYVKEQFHLLKR 363
>gi|148671588|gb|EDL03535.1| uronyl-2-sulfotransferase [Mus musculus]
Length = 355
Score = 132 bits (332), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 95/284 (33%), Positives = 151/284 (53%), Gaps = 16/284 (5%)
Query: 19 PETDSLSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRF 78
P + L + + ++YNRV K GS + V + + K FN++ ++ N L+ +Q
Sbjct: 43 PPSKVLPFPSQVVYNRVGKCGSRTVVLLLRILSEKHGFNLVTSDIH-NKTRLTKNEQMEL 101
Query: 79 VNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGD--N 136
+ N++ +P L+ H F++F +FG +QP++INI+R P+ R +S Y+F R+GD
Sbjct: 102 IKNIST--AEQPYLFTRHVHFLNFSRFGG-DQPVYINIIRDPVSRFLSNYFFRRFGDWRG 158
Query: 137 YRPHLVR----KKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALE 192
+ H++R ++ +ECI N ECS ++ +P+ CG C PG WALE
Sbjct: 159 EQNHMIRTPSMRQEERYLDINECILENYPECSNPRLFYIIPYFCGQHPRCREPGE-WALE 217
Query: 193 KAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRR---TNRKID 249
+AK N+ +LLVG+ EEL D + LLE LP +F+G + L T RK
Sbjct: 218 RAKLNVNENFLLVGILEELEDVLLLLERFLPHYFKGVLSIYKDPEHRKLGNMTVTVRKTV 277
Query: 250 PSEETVQQIKKSKIWELENELYEYALEQFHFVKKHNLVYNKVLG 293
PS E VQ + + +E E Y Y EQFH +K+ + ++V G
Sbjct: 278 PSPEAVQILYQRMRYEY--EFYHYVREQFHLLKRKLGLKSRVSG 319
>gi|110556631|ref|NP_796361.2| uronyl 2-sulfotransferase [Mus musculus]
gi|342187110|sp|Q8BUB6.3|UST_MOUSE RecName: Full=Uronyl 2-sulfotransferase
gi|187951875|gb|AAI38156.1| Uronyl-2-sulfotransferase [Mus musculus]
Length = 407
Score = 132 bits (332), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 95/284 (33%), Positives = 151/284 (53%), Gaps = 16/284 (5%)
Query: 19 PETDSLSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRF 78
P + L + + ++YNRV K GS + V + + K FN++ ++ N L+ +Q
Sbjct: 95 PPSKVLPFPSQVVYNRVGKCGSRTVVLLLRILSEKHGFNLVTSDIH-NKTRLTKNEQMEL 153
Query: 79 VNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGD--N 136
+ N++ +P L+ H F++F +FG +QP++INI+R P+ R +S Y+F R+GD
Sbjct: 154 IKNIST--AEQPYLFTRHVHFLNFSRFGG-DQPVYINIIRDPVSRFLSNYFFRRFGDWRG 210
Query: 137 YRPHLVR----KKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALE 192
+ H++R ++ +ECI N ECS ++ +P+ CG C PG WALE
Sbjct: 211 EQNHMIRTPSMRQEERYLDINECILENYPECSNPRLFYIIPYFCGQHPRCREPGE-WALE 269
Query: 193 KAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRR---TNRKID 249
+AK N+ +LLVG+ EEL D + LLE LP +F+G + L T RK
Sbjct: 270 RAKLNVNENFLLVGILEELEDVLLLLERFLPHYFKGVLSIYKDPEHRKLGNMTVTVRKTV 329
Query: 250 PSEETVQQIKKSKIWELENELYEYALEQFHFVKKHNLVYNKVLG 293
PS E VQ + + +E E Y Y EQFH +K+ + ++V G
Sbjct: 330 PSPEAVQILYQRMRYEY--EFYHYVREQFHLLKRKLGLKSRVSG 371
>gi|74188013|dbj|BAE37128.1| unnamed protein product [Mus musculus]
Length = 406
Score = 132 bits (332), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 95/284 (33%), Positives = 151/284 (53%), Gaps = 16/284 (5%)
Query: 19 PETDSLSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRF 78
P + L + + ++YNRV K GS + V + + K FN++ ++ N L+ +Q
Sbjct: 94 PPSKVLPFPSQVVYNRVGKCGSRTVVLLLRILSEKHGFNLVTSDIH-NKTRLTKNEQMEL 152
Query: 79 VNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGD--N 136
+ N++ +P L+ H F++F +FG +QP++INI+R P+ R +S Y+F R+GD
Sbjct: 153 IKNIST--AEQPYLFTRHVHFLNFSRFGG-DQPVYINIIRDPVSRFLSNYFFRRFGDWRG 209
Query: 137 YRPHLVR----KKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALE 192
+ H++R ++ +ECI N ECS ++ +P+ CG C PG WALE
Sbjct: 210 EQNHMIRTPSMRQEERYLDINECILENYPECSNPRLFYIIPYFCGQHPRCREPGE-WALE 268
Query: 193 KAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRR---TNRKID 249
+AK N+ +LLVG+ EEL D + LLE LP +F+G + L T RK
Sbjct: 269 RAKLNVNENFLLVGILEELEDVLLLLERFLPHYFKGVLSIYKDPEHRKLGNMTVTVRKTV 328
Query: 250 PSEETVQQIKKSKIWELENELYEYALEQFHFVKKHNLVYNKVLG 293
PS E VQ + + +E E Y Y EQFH +K+ + ++V G
Sbjct: 329 PSPEAVQILYQRMRYEY--EFYHYVREQFHLLKRKLGLKSRVSG 370
>gi|198417083|ref|XP_002130122.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
Length = 320
Score = 132 bits (332), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 81/260 (31%), Positives = 134/260 (51%), Gaps = 13/260 (5%)
Query: 30 IIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWRDRR 89
++YNRVPK GS S + Y + F V G + ++ FV + ++
Sbjct: 63 VVYNRVPKCGSMSMTTLCYRLGGANGFKVASPYEDGEKPNKNEEEEANFVEFL---HEQS 119
Query: 90 PA-LYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKHGD 148
P +Y H FIDF +FG K QPL++N++R P+ R S+YYF R+G+ + +
Sbjct: 120 PGYMYIRHQYFIDFTRFGQK-QPLYVNMIRDPVSRFESFYYFSRFGNERGGGGASSRMSE 178
Query: 149 ---KTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLV 205
+ + DECI+ R EC ++ W VP+ CG C + + WA++KAK+N+ Y+ V
Sbjct: 179 IRRQESIDECIQRRRQEC-IKPYWQVVPYFCGQDPGC-MSRSKWAVDKAKQNIKEHYVFV 236
Query: 206 GVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLR---RTNRKIDPSEETVQQIKKSK 262
G+ E+L + + +LEA LPS+F ++ +L ++ T K + SE +K
Sbjct: 237 GILEDLDNSLKVLEAILPSYFNDASNIYLNPENERMKHETHTRNKRETSEAARDFLKSET 296
Query: 263 IWELENELYEYALEQFHFVK 282
+LE +LYE+ + +K
Sbjct: 297 SLKLEYDLYEFVKSELENLK 316
>gi|344263874|ref|XP_003404020.1| PREDICTED: uronyl 2-sulfotransferase [Loxodonta africana]
Length = 418
Score = 132 bits (331), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 93/276 (33%), Positives = 149/276 (53%), Gaps = 20/276 (7%)
Query: 19 PETDSLSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRF 78
P + L + + ++YNRV K GS + V + + K FN++ ++ N L+ +Q
Sbjct: 106 PPSKVLPFPSQVVYNRVGKCGSRTVVLLLRILSEKHGFNLVTSDIH-NKTRLTKNEQMEL 164
Query: 79 VNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGD--N 136
+ N++ +P L+ H F++F +FG +QP++INI+R P++R +S Y+F R+GD
Sbjct: 165 IKNIST--AEQPYLFTRHVHFLNFSRFGG-DQPVYINIIRDPVNRFLSNYFFRRFGDWRG 221
Query: 137 YRPHLVR----KKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALE 192
+ H++R ++ +ECI N ECS ++ +P+ CG C PG WALE
Sbjct: 222 EQNHMIRTPSMRQEERYLDINECILENYPECSNPRLFYIIPYFCGQHPRCREPGE-WALE 280
Query: 193 KAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTN-----RK 247
+AK N+ +LLVG+ EEL D + LLE LP +F+G + N H + N +K
Sbjct: 281 RAKLNVNENFLLVGILEELEDVLLLLERFLPHYFKGVLG--IYKNPEHRKLGNMTVTVKK 338
Query: 248 IDPSEETVQQIKKSKIWELENELYEYALEQFHFVKK 283
PS E VQ + + +E E Y Y EQFH +K+
Sbjct: 339 TVPSPEAVQILYQRMRYEY--EFYHYVREQFHLLKR 372
>gi|358413856|ref|XP_003582675.1| PREDICTED: uronyl 2-sulfotransferase-like [Bos taurus]
Length = 387
Score = 131 bits (329), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 92/274 (33%), Positives = 147/274 (53%), Gaps = 16/274 (5%)
Query: 19 PETDSLSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRF 78
P + L + + ++YNRV K GS + V + + K FN++ ++ N L+ +Q
Sbjct: 43 PPSKVLPFPSQVVYNRVGKCGSRTVVLLLRILSEKHGFNLVTSDIH-NKTRLTKNEQMEL 101
Query: 79 VNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGD--N 136
+ N++ +P L+ H F++F +FG +QP++INI+R P++R +S Y+F R+GD
Sbjct: 102 IKNIST--AEQPYLFTRHVHFLNFSRFGG-DQPVYINIIRDPVNRFLSNYFFRRFGDWRG 158
Query: 137 YRPHLVR----KKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALE 192
+ H++R ++ +ECI N ECS ++ +P+ CG C PG WALE
Sbjct: 159 EQNHMIRTPSMRQEERYLDINECILENYPECSNPRLFYIIPYFCGQHPRCREPGE-WALE 217
Query: 193 KAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRR---TNRKID 249
+AK N+ +LLVG+ EEL D + LLE LP +F+G + L T +K
Sbjct: 218 RAKLNVNENFLLVGILEELEDVLLLLERFLPHYFKGVLSIYKDPEHRKLGNMTVTVKKTA 277
Query: 250 PSEETVQQIKKSKIWELENELYEYALEQFHFVKK 283
PS E VQ + + +E E Y Y EQFH +K+
Sbjct: 278 PSPEAVQILYQRMRYEY--EFYHYVKEQFHLLKR 309
>gi|440906919|gb|ELR57132.1| Uronyl 2-sulfotransferase, partial [Bos grunniens mutus]
Length = 326
Score = 131 bits (329), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 92/274 (33%), Positives = 147/274 (53%), Gaps = 16/274 (5%)
Query: 19 PETDSLSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRF 78
P + L + + ++YNRV K GS + V + + K FN++ ++ N L+ +Q
Sbjct: 14 PPSKVLPFPSQVVYNRVGKCGSRTVVLLLRILSEKHGFNLVTSDIH-NKTRLTKNEQMEL 72
Query: 79 VNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGD--N 136
+ N++ +P L+ H F++F +FG +QP++INI+R P++R +S Y+F R+GD
Sbjct: 73 IKNIST--AEQPYLFTRHVHFLNFSRFGG-DQPVYINIIRDPVNRFLSNYFFRRFGDWRG 129
Query: 137 YRPHLVR----KKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALE 192
+ H++R ++ +ECI N ECS ++ +P+ CG C PG WALE
Sbjct: 130 EQNHMIRTPSMRQEERYLDINECILENYPECSNPRLFYIIPYFCGQHPRCREPGE-WALE 188
Query: 193 KAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRR---TNRKID 249
+AK N+ +LLVG+ EEL D + LLE LP +F+G + L T +K
Sbjct: 189 RAKLNVNENFLLVGILEELEDVLLLLERFLPHYFKGVLSIYKDPEHRKLGNMTVTVKKTA 248
Query: 250 PSEETVQQIKKSKIWELENELYEYALEQFHFVKK 283
PS E VQ + + +E E Y Y EQFH +K+
Sbjct: 249 PSPEAVQILYQRMRYEY--EFYHYVKEQFHLLKR 280
>gi|291242117|ref|XP_002740952.1| PREDICTED: uronyl-2-sulfotransferase-like [Saccoglossus
kowalevskii]
Length = 412
Score = 131 bits (329), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 87/283 (30%), Positives = 146/283 (51%), Gaps = 21/283 (7%)
Query: 15 KSPSPETDSLSWD-------TVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNN 67
+ P+PE L D T +IYN+V K GS +F+ + + + +FN + +
Sbjct: 78 RPPTPEA-GLPMDVSRTCKETKVIYNKVEKCGSRTFLYIVQKLALRNKFNHGSSRLWAHK 136
Query: 68 HVLSLADQYRFVNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSY 127
+ + A Q F+ V P LY ++ F ++G+ ++P +I+++R PL R +S
Sbjct: 137 YTGAEA-QKAFLTEVNSLE--LPLLYDRSIHYVHFPRYGA-QRPSWISLIRDPLQRFMSQ 192
Query: 128 YYFLRYGDN-YRPHLVRKKHGD--KTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWV 184
+Y+ RYGD RP K TFD+C+ N EC+ +N +P+ CG A C +
Sbjct: 193 FYYKRYGDKRTRPDTFTNITDSIAKQTFDKCVLNNHKECNKKNAMQVIPYFCGQGAGCRI 252
Query: 185 PGNPWALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTD-HFLTSNKSHLRR 243
P N WALE A+ N++ + +VG+ E+L + +LE LP FF+G D H ++
Sbjct: 253 P-NRWALETAQRNVIDNFDVVGILEDLNSTLFVLENVLPQFFKGSRDAHLSLIYAGVIQD 311
Query: 244 TNRKID--PSEETVQQIKKSKIWELENELYEYALEQFHFVKKH 284
+ + D PS E ++++KK LE E Y++ ++ F+KK
Sbjct: 312 YSSQTDDTPSTEAMEEMKKR--LALEYEFYDFVKQRLMFMKKQ 352
>gi|426235206|ref|XP_004011580.1| PREDICTED: uronyl 2-sulfotransferase [Ovis aries]
Length = 399
Score = 130 bits (328), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 92/274 (33%), Positives = 147/274 (53%), Gaps = 16/274 (5%)
Query: 19 PETDSLSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRF 78
P + L + + ++YNRV K GS + V + + K FN++ ++ N L+ +Q
Sbjct: 55 PPSKVLPFPSQVVYNRVGKCGSRTVVLLLRILSEKHGFNLVTSDIH-NKTRLTKNEQMEL 113
Query: 79 VNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGD--N 136
+ N++ +P L+ H F++F +FG +QP++INI+R P++R +S Y+F R+GD
Sbjct: 114 IKNIST--AEQPYLFTRHVHFLNFSRFGG-DQPVYINIIRDPVNRFLSNYFFRRFGDWRG 170
Query: 137 YRPHLVR----KKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALE 192
+ H++R ++ +ECI N ECS ++ +P+ CG C PG WALE
Sbjct: 171 EQNHMIRTPSMRQEERYLDINECILENYPECSNPRLFYIIPYFCGQHPRCREPGE-WALE 229
Query: 193 KAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRR---TNRKID 249
+AK N+ +LLVG+ EEL D + LLE LP +F+G + L T +K
Sbjct: 230 RAKLNVNENFLLVGILEELEDVLLLLERFLPHYFKGVLSIYKDPEHRKLGNMTVTVKKTV 289
Query: 250 PSEETVQQIKKSKIWELENELYEYALEQFHFVKK 283
PS E VQ + + +E E Y Y EQFH +K+
Sbjct: 290 PSPEAVQILYQRMRYEY--EFYHYVKEQFHLLKR 321
>gi|410960234|ref|XP_003986699.1| PREDICTED: uronyl 2-sulfotransferase [Felis catus]
Length = 408
Score = 130 bits (328), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 92/274 (33%), Positives = 147/274 (53%), Gaps = 16/274 (5%)
Query: 19 PETDSLSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRF 78
P + L + + ++YNRV K GS + V + + K FN++ ++ N L+ +Q
Sbjct: 96 PPSKVLPFPSQVVYNRVGKCGSRTVVLLLRILSEKHGFNLVTSDIH-NKTRLTKNEQMEL 154
Query: 79 VNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGD--N 136
+ N++ +P L+ H F++F +FG +QP++INI+R P++R +S Y+F R+GD
Sbjct: 155 IKNIST--AEQPYLFTRHVHFLNFSRFGG-DQPVYINIIRDPVNRFLSNYFFRRFGDWRG 211
Query: 137 YRPHLVR----KKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALE 192
+ H++R ++ +ECI N ECS ++ +P+ CG C PG WALE
Sbjct: 212 EQNHMIRTPSMRQEERYLDINECILENYPECSNPRLFYIIPYFCGQHPRCREPGE-WALE 270
Query: 193 KAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRR---TNRKID 249
+AK N+ +LLVG+ EEL D + LLE LP +F+G + L T +K
Sbjct: 271 RAKLNVNENFLLVGILEELEDVLLLLERFLPHYFKGVLSIYKDPEHRKLGNMTVTVKKTV 330
Query: 250 PSEETVQQIKKSKIWELENELYEYALEQFHFVKK 283
PS E VQ + + +E E Y Y EQFH +K+
Sbjct: 331 PSPEAVQILYQRMRYEY--EFYHYVKEQFHLLKR 362
>gi|326433109|gb|EGD78679.1| hypothetical protein PTSG_01658 [Salpingoeca sp. ATCC 50818]
Length = 329
Score = 130 bits (328), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 91/279 (32%), Positives = 143/279 (51%), Gaps = 19/279 (6%)
Query: 12 SSAKSPSPETDSLSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKR-FNVLHVNVTGNNHVL 70
+S + +D + T + YNRVPK ST+ ++ D+ R KR + V +
Sbjct: 25 TSCGAAGIASDQATDATKVFYNRVPKAASTALRSIISDLARHKRNLKFVSSKVYDDRKRW 84
Query: 71 SLADQ---YRFVNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSY 127
S ++ +N V + + +Y H +I+F QFG +QP++IN++R+P R+VS
Sbjct: 85 SREEERANAERINFVFETNFDQHVIYDQHVRYINFTQFG-LQQPVYINMVREPAARIVSS 143
Query: 128 YYFLRYGDNYRPHLVRKKHGDKT--TFDECIRLNRTECSLENMW--LQVPFLCGHAAACW 183
YYF R GD+ + VR+K G++ + DEC+ NM L F CGHA C
Sbjct: 144 YYFARTGDSSKREQVRRKLGEQADWSVDECLAHGDNCTFARNMGGNLMTKFFCGHADVCN 203
Query: 184 VPGNPWALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRR 243
G P ALE+A NL Y +VG+TE D ++LLEA LP FRG + +++
Sbjct: 204 RVG-PAALEQALYNLEHNYAVVGITERFDDTLALLEATLPHIFRGAV---RLRERHGIKK 259
Query: 244 TNRKI----DPSEETVQQIKKSKIWELENELYEYALEQF 278
+NR P+ ET+ I+ S +++ LY+ A++ F
Sbjct: 260 SNRNTRAGPTPNAETIAAIRHSARYDV--ALYDRAVDLF 296
>gi|355561993|gb|EHH18625.1| hypothetical protein EGK_15270, partial [Macaca mulatta]
gi|355748833|gb|EHH53316.1| hypothetical protein EGM_13933, partial [Macaca fascicularis]
Length = 333
Score = 130 bits (328), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 92/274 (33%), Positives = 147/274 (53%), Gaps = 16/274 (5%)
Query: 19 PETDSLSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRF 78
P + L + + ++YNRV K GS + V + + K FN++ ++ N L+ +Q
Sbjct: 21 PPSKVLPFPSQVVYNRVGKCGSRTVVLLLRILSEKHGFNLVTSDIH-NKTRLTKNEQMEL 79
Query: 79 VNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGD--N 136
+ N++ +P L+ H F++F +FG +QP++INI+R P++R +S Y+F R+GD
Sbjct: 80 IKNIST--AEQPYLFTRHVHFLNFSRFGG-DQPVYINIIRDPVNRFLSNYFFRRFGDWRG 136
Query: 137 YRPHLVR----KKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALE 192
+ H++R ++ +ECI N ECS ++ +P+ CG C PG WALE
Sbjct: 137 EQNHMIRTPSMRQEERYLDINECILENYPECSNPRLFYIIPYFCGQHPRCREPGE-WALE 195
Query: 193 KAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRR---TNRKID 249
+AK N+ +LLVG+ EEL D + LLE LP +F+G + L T +K
Sbjct: 196 RAKLNVNENFLLVGILEELEDVLLLLERFLPHYFKGVLSIYKDPEHRKLGNMTVTVKKTV 255
Query: 250 PSEETVQQIKKSKIWELENELYEYALEQFHFVKK 283
PS E VQ + + +E E Y Y EQFH +K+
Sbjct: 256 PSPEAVQILYQRMRYEY--EFYHYVKEQFHLLKR 287
>gi|332205909|ref|NP_001193755.1| uronyl 2-sulfotransferase isoform 1 [Bos taurus]
Length = 410
Score = 130 bits (327), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 92/274 (33%), Positives = 147/274 (53%), Gaps = 16/274 (5%)
Query: 19 PETDSLSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRF 78
P + L + + ++YNRV K GS + V + + K FN++ ++ N L+ +Q
Sbjct: 98 PPSKVLPFPSQVVYNRVGKCGSRTVVLLLRILSEKHGFNLVTSDIH-NKTRLTKNEQMEL 156
Query: 79 VNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGD--N 136
+ N++ +P L+ H F++F +FG +QP++INI+R P++R +S Y+F R+GD
Sbjct: 157 IKNIST--AEQPYLFTRHVHFLNFSRFGG-DQPVYINIIRDPVNRFLSNYFFRRFGDWRG 213
Query: 137 YRPHLVR----KKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALE 192
+ H++R ++ +ECI N ECS ++ +P+ CG C PG WALE
Sbjct: 214 EQNHMIRTPSMRQEERYLDINECILENYPECSNPRLFYIIPYFCGQHPRCREPGE-WALE 272
Query: 193 KAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRR---TNRKID 249
+AK N+ +LLVG+ EEL D + LLE LP +F+G + L T +K
Sbjct: 273 RAKLNVNENFLLVGILEELEDVLLLLERFLPHYFKGVLSIYKDPEHRKLGNMTVTVKKTA 332
Query: 250 PSEETVQQIKKSKIWELENELYEYALEQFHFVKK 283
PS E VQ + + +E E Y Y EQFH +K+
Sbjct: 333 PSPEAVQILYQRMRYEY--EFYHYVKEQFHLLKR 364
>gi|332205911|ref|NP_001193756.1| uronyl 2-sulfotransferase [Bos taurus]
Length = 442
Score = 130 bits (327), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 92/274 (33%), Positives = 147/274 (53%), Gaps = 16/274 (5%)
Query: 19 PETDSLSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRF 78
P + L + + ++YNRV K GS + V + + K FN++ ++ N L+ +Q
Sbjct: 98 PPSKVLPFPSQVVYNRVGKCGSRTVVLLLRILSEKHGFNLVTSDIH-NKTRLTKNEQMEL 156
Query: 79 VNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGD--N 136
+ N++ +P L+ H F++F +FG +QP++INI+R P++R +S Y+F R+GD
Sbjct: 157 IKNIST--AEQPYLFTRHVHFLNFSRFGG-DQPVYINIIRDPVNRFLSNYFFRRFGDWRG 213
Query: 137 YRPHLVR----KKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALE 192
+ H++R ++ +ECI N ECS ++ +P+ CG C PG WALE
Sbjct: 214 EQNHMIRTPSMRQEERYLDINECILENYPECSNPRLFYIIPYFCGQHPRCREPGE-WALE 272
Query: 193 KAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRR---TNRKID 249
+AK N+ +LLVG+ EEL D + LLE LP +F+G + L T +K
Sbjct: 273 RAKLNVNENFLLVGILEELEDVLLLLERFLPHYFKGVLSIYKDPEHRKLGNMTVTVKKTA 332
Query: 250 PSEETVQQIKKSKIWELENELYEYALEQFHFVKK 283
PS E VQ + + +E E Y Y EQFH +K+
Sbjct: 333 PSPEAVQILYQRMRYEY--EFYHYVKEQFHLLKR 364
>gi|410222354|gb|JAA08396.1| uronyl-2-sulfotransferase [Pan troglodytes]
gi|410250726|gb|JAA13330.1| uronyl-2-sulfotransferase [Pan troglodytes]
gi|410250728|gb|JAA13331.1| uronyl-2-sulfotransferase [Pan troglodytes]
gi|410250730|gb|JAA13332.1| uronyl-2-sulfotransferase [Pan troglodytes]
gi|410287142|gb|JAA22171.1| uronyl-2-sulfotransferase [Pan troglodytes]
gi|410287144|gb|JAA22172.1| uronyl-2-sulfotransferase [Pan troglodytes]
gi|410287146|gb|JAA22173.1| uronyl-2-sulfotransferase [Pan troglodytes]
gi|410287148|gb|JAA22174.1| uronyl-2-sulfotransferase [Pan troglodytes]
gi|410331665|gb|JAA34779.1| uronyl-2-sulfotransferase [Pan troglodytes]
Length = 406
Score = 130 bits (327), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 92/274 (33%), Positives = 147/274 (53%), Gaps = 16/274 (5%)
Query: 19 PETDSLSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRF 78
P + L + + ++YNRV K GS + V + + K FN++ ++ N L+ +Q
Sbjct: 94 PPSKVLPFPSQVVYNRVGKCGSRTVVLLLRILSEKHGFNLVTSDIH-NKTRLTKNEQMEL 152
Query: 79 VNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGD--N 136
+ N++ +P L+ H F++F +FG +QP++INI+R P++R +S Y+F R+GD
Sbjct: 153 IKNIST--AEQPYLFTRHVHFLNFSRFGG-DQPVYINIIRDPVNRFLSNYFFRRFGDWRG 209
Query: 137 YRPHLVR----KKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALE 192
+ H++R ++ +ECI N ECS ++ +P+ CG C PG WALE
Sbjct: 210 EQNHMIRTPSMRQEERYLDINECILENYPECSNPRLFYIIPYFCGQHPRCREPGE-WALE 268
Query: 193 KAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRR---TNRKID 249
+AK N+ +LLVG+ EEL D + LLE LP +F+G + L T +K
Sbjct: 269 RAKLNVNENFLLVGILEELEDVLLLLERFLPHYFKGVLSIYKDPEHRKLGNMTVTVKKTV 328
Query: 250 PSEETVQQIKKSKIWELENELYEYALEQFHFVKK 283
PS E VQ + + +E E Y Y EQFH +K+
Sbjct: 329 PSPEAVQILYQRMRYEY--EFYHYVKEQFHLLKR 360
>gi|397480613|ref|XP_003811573.1| PREDICTED: uronyl 2-sulfotransferase [Pan paniscus]
Length = 406
Score = 130 bits (327), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 92/274 (33%), Positives = 147/274 (53%), Gaps = 16/274 (5%)
Query: 19 PETDSLSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRF 78
P + L + + ++YNRV K GS + V + + K FN++ ++ N L+ +Q
Sbjct: 94 PPSKVLPFPSQVVYNRVGKCGSRTVVLLLRILSEKHGFNLVTSDIH-NKTRLTKNEQMEL 152
Query: 79 VNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGD--N 136
+ N++ +P L+ H F++F +FG +QP++INI+R P++R +S Y+F R+GD
Sbjct: 153 IKNIST--AEQPYLFTRHVHFLNFSRFGG-DQPVYINIIRDPVNRFLSNYFFRRFGDWRG 209
Query: 137 YRPHLVR----KKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALE 192
+ H++R ++ +ECI N ECS ++ +P+ CG C PG WALE
Sbjct: 210 EQNHMIRTPSMRQEERYLDINECILENYPECSNPRLFYIIPYFCGQHPRCREPGE-WALE 268
Query: 193 KAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRR---TNRKID 249
+AK N+ +LLVG+ EEL D + LLE LP +F+G + L T +K
Sbjct: 269 RAKLNVNENFLLVGILEELEDVLLLLERFLPHYFKGVLSIYKDPEHRKLGNMTVTVKKTV 328
Query: 250 PSEETVQQIKKSKIWELENELYEYALEQFHFVKK 283
PS E VQ + + +E E Y Y EQFH +K+
Sbjct: 329 PSPEAVQILYQRMRYEY--EFYHYVKEQFHLLKR 360
>gi|403269741|ref|XP_003926872.1| PREDICTED: uronyl 2-sulfotransferase [Saimiri boliviensis
boliviensis]
Length = 406
Score = 130 bits (327), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 92/274 (33%), Positives = 147/274 (53%), Gaps = 16/274 (5%)
Query: 19 PETDSLSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRF 78
P + L + + ++YNRV K GS + V + + K FN++ ++ N L+ +Q
Sbjct: 94 PPSKVLPFPSQVVYNRVGKCGSRTVVLLLRILSEKHGFNLVTSDIH-NKTRLTKNEQMEL 152
Query: 79 VNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGD--N 136
+ N++ +P L+ H F++F +FG +QP++INI+R P++R +S Y+F R+GD
Sbjct: 153 IKNIST--AEQPYLFTRHVHFLNFSRFGG-DQPVYINIIRDPVNRFLSNYFFRRFGDWRG 209
Query: 137 YRPHLVR----KKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALE 192
+ H++R ++ +ECI N ECS ++ +P+ CG C PG WALE
Sbjct: 210 EQNHMIRTPSMRQEERYLDINECILENYPECSNPRLFYIIPYFCGQHPRCREPGE-WALE 268
Query: 193 KAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRR---TNRKID 249
+AK N+ +LLVG+ EEL D + LLE LP +F+G + L T +K
Sbjct: 269 RAKLNVNENFLLVGILEELEDVLLLLERFLPHYFKGVLSIYKDPEHRKLGNMTVTVKKTV 328
Query: 250 PSEETVQQIKKSKIWELENELYEYALEQFHFVKK 283
PS E VQ + + +E E Y Y EQFH +K+
Sbjct: 329 PSPEAVQILYQRMRYEY--EFYHYVKEQFHLLKR 360
>gi|158259305|dbj|BAF85611.1| unnamed protein product [Homo sapiens]
Length = 406
Score = 130 bits (327), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 92/274 (33%), Positives = 147/274 (53%), Gaps = 16/274 (5%)
Query: 19 PETDSLSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRF 78
P + L + + ++YNRV K GS + V + + K FN++ ++ N L+ +Q
Sbjct: 94 PPSKVLPFPSQVVYNRVGKCGSRTVVLLLRILSEKHGFNLVTSDIH-NKTRLTKNEQMEL 152
Query: 79 VNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGD--N 136
+ N++ +P L+ H F++F +FG +QP++INI+R P++R +S Y+F R+GD
Sbjct: 153 IKNIST--AEQPYLFTRHVHFLNFSRFGG-DQPVYINIIRDPVNRFLSNYFFRRFGDWRG 209
Query: 137 YRPHLVR----KKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALE 192
+ H++R ++ +ECI N ECS ++ +P+ CG C PG WALE
Sbjct: 210 EQNHMIRTPSMRQEERYLDINECILENYPECSNPRLFYIIPYFCGQHPRCREPGE-WALE 268
Query: 193 KAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRR---TNRKID 249
+AK N+ +LLVG+ EEL D + LLE LP +F+G + L T +K
Sbjct: 269 RAKLNVNENFLLVGILEELEDVLLLLERFLPHYFKGVLSIYKDPEHRKLGNMTVTVKKTV 328
Query: 250 PSEETVQQIKKSKIWELENELYEYALEQFHFVKK 283
PS E VQ + + +E E Y Y EQFH +K+
Sbjct: 329 PSPEAVQILYQRMRYEY--EFYHYVKEQFHLLKR 360
>gi|57031732|ref|XP_533443.1| PREDICTED: uronyl 2-sulfotransferase [Canis lupus familiaris]
Length = 407
Score = 130 bits (327), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 92/274 (33%), Positives = 147/274 (53%), Gaps = 16/274 (5%)
Query: 19 PETDSLSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRF 78
P + L + + ++YNRV K GS + V + + K FN++ ++ N L+ +Q
Sbjct: 95 PPSKVLPFPSQVVYNRVGKCGSRTVVLLLRILSEKHGFNLVTSDIH-NKTRLTKNEQMEL 153
Query: 79 VNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGD--N 136
+ N++ +P L+ H F++F +FG +QP++INI+R P++R +S Y+F R+GD
Sbjct: 154 IKNIST--AEQPYLFTRHVHFLNFSRFGG-DQPVYINIIRDPVNRFLSNYFFRRFGDWRG 210
Query: 137 YRPHLVR----KKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALE 192
+ H++R ++ +ECI N ECS ++ +P+ CG C PG WALE
Sbjct: 211 EQNHMIRTPSMRQEERYLDINECILENYPECSNPRLFYIIPYFCGQHPRCREPGE-WALE 269
Query: 193 KAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRR---TNRKID 249
+AK N+ +LLVG+ EEL D + LLE LP +F+G + L T +K
Sbjct: 270 RAKLNVNENFLLVGILEELEDVLLLLERFLPHYFKGVLSIYKDPEHRKLGNMTVTVKKTV 329
Query: 250 PSEETVQQIKKSKIWELENELYEYALEQFHFVKK 283
PS E VQ + + +E E Y Y EQFH +K+
Sbjct: 330 PSPEAVQILYQRMRYEY--EFYHYVKEQFHLLKR 361
>gi|5032219|ref|NP_005706.1| uronyl 2-sulfotransferase [Homo sapiens]
gi|68052988|sp|Q9Y2C2.1|UST_HUMAN RecName: Full=Uronyl 2-sulfotransferase
gi|4803735|dbj|BAA77510.1| dermatan/chondroitin sulfate 2-sulfotransferase [Homo sapiens]
gi|62739433|gb|AAH93668.1| Uronyl-2-sulfotransferase [Homo sapiens]
gi|62739710|gb|AAH93694.1| Uronyl-2-sulfotransferase [Homo sapiens]
gi|119568197|gb|EAW47812.1| uronyl-2-sulfotransferase [Homo sapiens]
gi|189067528|dbj|BAG37723.1| unnamed protein product [Homo sapiens]
Length = 406
Score = 130 bits (327), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 92/274 (33%), Positives = 147/274 (53%), Gaps = 16/274 (5%)
Query: 19 PETDSLSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRF 78
P + L + + ++YNRV K GS + V + + K FN++ ++ N L+ +Q
Sbjct: 94 PPSKVLPFPSQVVYNRVGKCGSRTVVLLLRILSEKHGFNLVTSDIH-NKTRLTKNEQMEL 152
Query: 79 VNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGD--N 136
+ N++ +P L+ H F++F +FG +QP++INI+R P++R +S Y+F R+GD
Sbjct: 153 IKNIST--AEQPYLFTRHVHFLNFSRFGG-DQPVYINIIRDPVNRFLSNYFFRRFGDWRG 209
Query: 137 YRPHLVR----KKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALE 192
+ H++R ++ +ECI N ECS ++ +P+ CG C PG WALE
Sbjct: 210 EQNHMIRTPSMRQEERYLDINECILENYPECSNPRLFYIIPYFCGQHPRCREPGE-WALE 268
Query: 193 KAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRR---TNRKID 249
+AK N+ +LLVG+ EEL D + LLE LP +F+G + L T +K
Sbjct: 269 RAKLNVNENFLLVGILEELEDVLLLLERFLPHYFKGVLSIYKDPEHRKLGNMTVTVKKTV 328
Query: 250 PSEETVQQIKKSKIWELENELYEYALEQFHFVKK 283
PS E VQ + + +E E Y Y EQFH +K+
Sbjct: 329 PSPEAVQILYQRMRYEY--EFYHYVKEQFHLLKR 360
>gi|296199420|ref|XP_002747116.1| PREDICTED: uronyl 2-sulfotransferase [Callithrix jacchus]
Length = 406
Score = 130 bits (326), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 92/274 (33%), Positives = 147/274 (53%), Gaps = 16/274 (5%)
Query: 19 PETDSLSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRF 78
P + L + + ++YNRV K GS + V + + K FN++ ++ N L+ +Q
Sbjct: 94 PPSKVLPFPSQVVYNRVGKCGSRTVVLLLRILSEKHGFNLVTSDIH-NKTRLTKNEQMEL 152
Query: 79 VNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGD--N 136
+ N++ +P L+ H F++F +FG +QP++INI+R P++R +S Y+F R+GD
Sbjct: 153 IKNIST--AEQPYLFTRHVHFLNFSRFGG-DQPVYINIIRDPVNRFLSNYFFRRFGDWRG 209
Query: 137 YRPHLVR----KKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALE 192
+ H++R ++ +ECI N ECS ++ +P+ CG C PG WALE
Sbjct: 210 EQNHMIRTPSMRQEERYLDINECILENYPECSNPRLFYIIPYFCGQHPRCREPGE-WALE 268
Query: 193 KAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRR---TNRKID 249
+AK N+ +LLVG+ EEL D + LLE LP +F+G + L T +K
Sbjct: 269 RAKLNVNENFLLVGILEELEDVLLLLERFLPHYFKGVLSIYKDPEHRKLGNMTVTVKKTV 328
Query: 250 PSEETVQQIKKSKIWELENELYEYALEQFHFVKK 283
PS E VQ + + +E E Y Y EQFH +K+
Sbjct: 329 PSPEAVQILYQRMRYEY--EFYHYVKEQFHLLKR 360
>gi|301770287|ref|XP_002920560.1| PREDICTED: uronyl 2-sulfotransferase-like [Ailuropoda melanoleuca]
Length = 406
Score = 130 bits (326), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 92/274 (33%), Positives = 147/274 (53%), Gaps = 16/274 (5%)
Query: 19 PETDSLSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRF 78
P + L + + ++YNRV K GS + V + + K FN++ ++ N L+ +Q
Sbjct: 94 PPSKVLPFPSQVVYNRVGKCGSRTVVLLLRILSEKHGFNLVTSDIH-NKTRLTKNEQMEL 152
Query: 79 VNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGD--N 136
+ N++ +P L+ H F++F +FG +QP++INI+R P++R +S Y+F R+GD
Sbjct: 153 IKNIST--AEQPYLFTRHVHFLNFSRFGG-DQPVYINIIRDPVNRFLSNYFFRRFGDWRG 209
Query: 137 YRPHLVR----KKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALE 192
+ H++R ++ +ECI N ECS ++ +P+ CG C PG WALE
Sbjct: 210 EQNHMIRTPSMRQEERYLDINECILENYPECSNPRLFYIIPYFCGQHPRCREPGE-WALE 268
Query: 193 KAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRR---TNRKID 249
+AK N+ +LLVG+ EEL D + LLE LP +F+G + L T +K
Sbjct: 269 RAKLNVNENFLLVGILEELEDVLLLLERFLPHYFKGVLSIYKDPEHRKLGNMTVTVKKTV 328
Query: 250 PSEETVQQIKKSKIWELENELYEYALEQFHFVKK 283
PS E VQ + + +E E Y Y EQFH +K+
Sbjct: 329 PSPEAVQILYQRMRYEY--EFYHYVKEQFHLLKR 360
>gi|410041346|ref|XP_003950984.1| PREDICTED: LOW QUALITY PROTEIN: uronyl 2-sulfotransferase, partial
[Pan troglodytes]
Length = 399
Score = 130 bits (326), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 92/274 (33%), Positives = 147/274 (53%), Gaps = 16/274 (5%)
Query: 19 PETDSLSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRF 78
P + L + + ++YNRV K GS + V + + K FN++ ++ N L+ +Q
Sbjct: 87 PPSKVLPFPSQVVYNRVGKCGSRTVVLLLRILSEKHGFNLVTSDIH-NKTRLTKNEQMEL 145
Query: 79 VNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGD--N 136
+ N++ +P L+ H F++F +FG +QP++INI+R P++R +S Y+F R+GD
Sbjct: 146 IKNIST--AEQPYLFTRHVHFLNFSRFGG-DQPVYINIIRDPVNRFLSNYFFRRFGDWRG 202
Query: 137 YRPHLVR----KKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALE 192
+ H++R ++ +ECI N ECS ++ +P+ CG C PG WALE
Sbjct: 203 EQNHMIRTPSMRQEERYLDINECILENYPECSNPRLFYIIPYFCGQHPRCREPGE-WALE 261
Query: 193 KAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRR---TNRKID 249
+AK N+ +LLVG+ EEL D + LLE LP +F+G + L T +K
Sbjct: 262 RAKLNVNENFLLVGILEELEDVLLLLERFLPHYFKGVLSIYKDPEHRKLGNMTVTVKKTV 321
Query: 250 PSEETVQQIKKSKIWELENELYEYALEQFHFVKK 283
PS E VQ + + +E E Y Y EQFH +K+
Sbjct: 322 PSPEAVQILYQRMRYEY--EFYHYVKEQFHLLKR 353
>gi|426354852|ref|XP_004044857.1| PREDICTED: uronyl 2-sulfotransferase [Gorilla gorilla gorilla]
Length = 406
Score = 130 bits (326), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 92/274 (33%), Positives = 147/274 (53%), Gaps = 16/274 (5%)
Query: 19 PETDSLSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRF 78
P + L + + ++YNRV K GS + V + + K FN++ ++ N L+ +Q
Sbjct: 94 PPSKVLPFPSQVVYNRVGKCGSRTVVLLLRILSEKHGFNLVTSDIH-NKTRLTKNEQMEL 152
Query: 79 VNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGD--N 136
+ N++ +P L+ H F++F +FG +QP++INI+R P++R +S Y+F R+GD
Sbjct: 153 IKNIST--AEQPYLFTRHVHFLNFSRFGG-DQPVYINIIRDPVNRFLSNYFFRRFGDWRG 209
Query: 137 YRPHLVR----KKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALE 192
+ H++R ++ +ECI N ECS ++ +P+ CG C PG WALE
Sbjct: 210 EQNHMIRTPSMRQEERYLDINECILENYPECSNPRLFYIIPYFCGQHPRCREPGE-WALE 268
Query: 193 KAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRR---TNRKID 249
+AK N+ +LLVG+ EEL D + LLE LP +F+G + L T +K
Sbjct: 269 RAKLNVNENFLLVGILEELEDVLLLLERFLPHYFKGVLSIYKDPEHRKLGNMTVTVKKTV 328
Query: 250 PSEETVQQIKKSKIWELENELYEYALEQFHFVKK 283
PS E VQ + + +E E Y Y EQFH +K+
Sbjct: 329 PSPEAVQILYQRMRYEY--EFYHYVKEQFHLLKR 360
>gi|383873161|ref|NP_001244442.1| uronyl 2-sulfotransferase [Macaca mulatta]
gi|380788365|gb|AFE66058.1| uronyl 2-sulfotransferase [Macaca mulatta]
Length = 406
Score = 130 bits (326), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 92/274 (33%), Positives = 147/274 (53%), Gaps = 16/274 (5%)
Query: 19 PETDSLSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRF 78
P + L + + ++YNRV K GS + V + + K FN++ ++ N L+ +Q
Sbjct: 94 PPSKVLPFPSQVVYNRVGKCGSRTVVLLLRILSEKHGFNLVTSDIH-NKTRLTKNEQMEL 152
Query: 79 VNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGD--N 136
+ N++ +P L+ H F++F +FG +QP++INI+R P++R +S Y+F R+GD
Sbjct: 153 IKNIST--AEQPYLFTRHVHFLNFSRFGG-DQPVYINIIRDPVNRFLSNYFFRRFGDWRG 209
Query: 137 YRPHLVR----KKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALE 192
+ H++R ++ +ECI N ECS ++ +P+ CG C PG WALE
Sbjct: 210 EQNHMIRTPSMRQEERYLDINECILENYPECSNPRLFYIIPYFCGQHPRCREPGE-WALE 268
Query: 193 KAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRR---TNRKID 249
+AK N+ +LLVG+ EEL D + LLE LP +F+G + L T +K
Sbjct: 269 RAKLNVNENFLLVGILEELEDVLLLLERFLPHYFKGVLSIYKDPEHRKLGNMTVTVKKTV 328
Query: 250 PSEETVQQIKKSKIWELENELYEYALEQFHFVKK 283
PS E VQ + + +E E Y Y EQFH +K+
Sbjct: 329 PSPEAVQILYQRMRYEY--EFYHYVKEQFHLLKR 360
>gi|350578075|ref|XP_003480284.1| PREDICTED: LOW QUALITY PROTEIN: uronyl 2-sulfotransferase-like,
partial [Sus scrofa]
Length = 388
Score = 130 bits (326), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 92/274 (33%), Positives = 147/274 (53%), Gaps = 16/274 (5%)
Query: 19 PETDSLSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRF 78
P + L + + ++YNRV K GS + V + + K FN++ ++ N L+ +Q
Sbjct: 76 PPSKVLPFPSQVVYNRVGKCGSRTVVLLLRILSEKHGFNLVTSDIH-NKTRLTKNEQMEL 134
Query: 79 VNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGD--N 136
+ N++ +P L+ H F++F +FG +QP++INI+R P++R +S Y+F R+GD
Sbjct: 135 IKNIST--AEQPYLFTRHVHFLNFSRFGG-DQPVYINIIRDPVNRFLSNYFFRRFGDWRG 191
Query: 137 YRPHLVR----KKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALE 192
+ H++R ++ +ECI N ECS ++ +P+ CG C PG WALE
Sbjct: 192 EQNHMIRTPSMRQEERYLDINECILENYPECSNPRLFYIIPYFCGQHPRCREPGE-WALE 250
Query: 193 KAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRR---TNRKID 249
+AK N+ +LLVG+ EEL D + LLE LP +F+G + L T +K
Sbjct: 251 RAKLNVNENFLLVGILEELEDVLLLLERFLPHYFKGVLSIYKDPEHRKLGNMTVTVKKTV 310
Query: 250 PSEETVQQIKKSKIWELENELYEYALEQFHFVKK 283
PS E VQ + + +E E Y Y EQFH +K+
Sbjct: 311 PSPEAVQILYQRMRYEY--EFYHYVKEQFHLLKR 342
>gi|354473545|ref|XP_003498995.1| PREDICTED: uronyl 2-sulfotransferase [Cricetulus griseus]
gi|344241060|gb|EGV97163.1| Uronyl 2-sulfotransferase [Cricetulus griseus]
Length = 407
Score = 129 bits (325), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 92/274 (33%), Positives = 146/274 (53%), Gaps = 16/274 (5%)
Query: 19 PETDSLSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRF 78
P + L + + ++YNRV K GS + V + + K FN++ ++ N L+ +Q
Sbjct: 95 PPSKVLPFPSQVVYNRVGKCGSRTVVLLLRILSEKHGFNLVTSDIH-NKTRLTKNEQMEL 153
Query: 79 VNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGD--N 136
+ N++ +P L+ H F++F +FG +QP++INI+R P+ R +S Y+F R+GD
Sbjct: 154 IKNIST--AEQPYLFTRHVHFLNFSRFGG-DQPVYINIIRDPVSRFLSNYFFRRFGDWRG 210
Query: 137 YRPHLVR----KKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALE 192
+ H++R ++ +ECI N ECS ++ +P+ CG C PG WALE
Sbjct: 211 EQNHMIRTPSMRQEERYLDINECILENYPECSNPRLFYIIPYFCGQHPRCREPGE-WALE 269
Query: 193 KAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRR---TNRKID 249
+AK N+ +LLVG+ EEL D + LLE LP +F+G + L T +K
Sbjct: 270 RAKLNVNENFLLVGILEELEDVLLLLERFLPHYFKGVLSIYKDPEHRKLGNMTVTVKKTV 329
Query: 250 PSEETVQQIKKSKIWELENELYEYALEQFHFVKK 283
PS E VQ + + +E E Y Y EQFH +K+
Sbjct: 330 PSPEAVQILYQRMRYEY--EFYHYVKEQFHLLKR 361
>gi|260791150|ref|XP_002590603.1| hypothetical protein BRAFLDRAFT_123612 [Branchiostoma floridae]
gi|229275798|gb|EEN46614.1| hypothetical protein BRAFLDRAFT_123612 [Branchiostoma floridae]
Length = 566
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 94/295 (31%), Positives = 140/295 (47%), Gaps = 47/295 (15%)
Query: 30 IIYNRVPKTGSTSFVNMAYDMCRKKRFN------VLHV---------------------- 61
++YNRV K GS S + + + K FN V+H
Sbjct: 75 LVYNRVGKCGSRSLITILSVLSSKNGFNFAKDPSVVHSQTRFPLPDQSKSLLLVWYLAKV 134
Query: 62 -NVTGNNHVLS--------LADQYRFVNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPL 112
NV +LS L Q V +V K P Y+ HF FIDF +FG++ QP+
Sbjct: 135 QNVKPLWELLSEVLILMRTLVTQIALVQHVDKISP--PFFYNRHFHFIDFTRFGAR-QPI 191
Query: 113 FINILRKPLDRLVSYYYFLRYGDNYRPHLVR--KKHGDKTTFDECIRLNRTECSLENMWL 170
+IN++R P DRLVS YYF R+GD R K+ +FD C+ + EC+ +
Sbjct: 192 YINMIRDPFDRLVSSYYFKRFGDGRSDDRGRYLKEEDKLRSFDACVLEEQVECT-RGLHY 250
Query: 171 QVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGT 230
+PF CG C P WALE+AK+N++ K+L+VG+ EE D + + E LP+FF+G
Sbjct: 251 IIPFFCGQRPGCRDPSR-WALERAKDNVLDKFLVVGILEEFNDTLRVFEHLLPNFFKGAM 309
Query: 231 DHFLTSNK--SHLRRTNRKI-DPSEETVQQIKKSKIWELENELYEYALEQFHFVK 282
+ + S L T++ + P + K + +LE E Y + + FH +K
Sbjct: 310 SVWENPPQWVSQLYNTSKTVKKPQPSPFIRDKMRRRMKLEYEFYYFVRDIFHNLK 364
>gi|351707516|gb|EHB10435.1| Uronyl 2-sulfotransferase, partial [Heterocephalus glaber]
Length = 326
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 91/274 (33%), Positives = 146/274 (53%), Gaps = 16/274 (5%)
Query: 19 PETDSLSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRF 78
P + L + + ++YNRV K GS + V + + K FN++ ++ N L+ +Q
Sbjct: 14 PPSKVLPFPSQVVYNRVGKCGSRTVVLLLRILSEKHGFNLVTSDIH-NKTRLTKNEQMEL 72
Query: 79 VNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGD--N 136
+ N++ +P L+ H F++F +FG +QP++INI+R P+ R +S Y+F R+GD
Sbjct: 73 IKNIST--AEQPYLFTRHVHFLNFSRFGG-DQPVYINIIRDPVSRFLSNYFFRRFGDWRG 129
Query: 137 YRPHLVR----KKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALE 192
+ H++R ++ +ECI N ECS ++ +P+ CG C PG WALE
Sbjct: 130 EQNHMIRTPSMRQEERYLDINECILENYPECSNPRLFYIIPYFCGQHPRCREPGE-WALE 188
Query: 193 KAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRR---TNRKID 249
+AK N+ +LLVG+ EEL D + LLE LP +F+G + L T +K
Sbjct: 189 RAKLNVNENFLLVGILEELEDVLLLLERFLPHYFKGVLSIYKDPEHRKLGNMTVTVKKTV 248
Query: 250 PSEETVQQIKKSKIWELENELYEYALEQFHFVKK 283
PS E VQ + + +E E Y Y +QFH +K+
Sbjct: 249 PSPEAVQILYQRMRYEY--EFYHYVKQQFHLLKR 280
>gi|348561187|ref|XP_003466394.1| PREDICTED: uronyl 2-sulfotransferase-like [Cavia porcellus]
Length = 375
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 91/272 (33%), Positives = 145/272 (53%), Gaps = 16/272 (5%)
Query: 21 TDSLSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVN 80
+ L + + ++YNRV K GS + V + + K FN++ ++ N L+ +Q +
Sbjct: 65 SSVLPFPSQVVYNRVGKCGSRTVVLLLRILSEKHGFNLVTSDI-HNKTRLTKNEQMELIK 123
Query: 81 NVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGD--NYR 138
N++ +P L+ H F++F +FG +QP++INI+R P+ R +S Y+F R+GD +
Sbjct: 124 NIST--AEQPYLFTRHVHFLNFSRFGG-DQPVYINIIRDPVSRFLSNYFFRRFGDWRGEQ 180
Query: 139 PHLVR----KKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKA 194
H++R ++ +ECI N ECS ++ +P+ CG C PG WALE+A
Sbjct: 181 NHMIRTPSMRQEERYLDINECILENYPECSNPRLFYIIPYFCGQHPRCREPGE-WALERA 239
Query: 195 KENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRR---TNRKIDPS 251
K N+ +LLVG+ EEL D + LLE LP +F+G + L T +K PS
Sbjct: 240 KLNVNENFLLVGILEELEDVLLLLERFLPHYFKGVLSIYKDPEHRKLGNMTVTVKKTVPS 299
Query: 252 EETVQQIKKSKIWELENELYEYALEQFHFVKK 283
E VQ + + +E E Y Y EQFH +K+
Sbjct: 300 PEAVQILYQRMKYEY--EFYHYVREQFHLLKR 329
>gi|355728173|gb|AES09440.1| uronyl-2-sulfotransferase [Mustela putorius furo]
Length = 337
Score = 129 bits (323), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 91/274 (33%), Positives = 146/274 (53%), Gaps = 16/274 (5%)
Query: 19 PETDSLSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRF 78
P + L + + ++YNRV K GS + V + + K FN++ ++ N L+ +Q
Sbjct: 26 PPSKVLPFPSQVVYNRVGKCGSRTVVLLLRILSEKHGFNLVTSDIH-NKTRLTKNEQMEL 84
Query: 79 VNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGD--N 136
+ N++ +P L+ H F++F +FG +QP++INI+R P+ R +S Y+F R+GD
Sbjct: 85 IKNIST--AEQPYLFTRHVHFLNFSRFGG-DQPVYINIIRDPVSRFLSNYFFRRFGDWRG 141
Query: 137 YRPHLVR----KKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALE 192
+ H++R ++ +ECI N ECS ++ +P+ CG C PG WAL+
Sbjct: 142 EQNHMIRTPSMRQEERYLDINECILENYPECSNPRLFYIIPYFCGQHPRCREPGE-WALQ 200
Query: 193 KAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRR---TNRKID 249
+AK N+ +LLVG+ EEL D + LLE LP +F+G + L T +K
Sbjct: 201 RAKLNVNENFLLVGILEELEDVLLLLERFLPHYFKGVLSIYKDPEHRKLGNMTVTVKKTV 260
Query: 250 PSEETVQQIKKSKIWELENELYEYALEQFHFVKK 283
PS E VQ + + +E E Y Y EQFH +K+
Sbjct: 261 PSPEAVQILYQRMRYEY--EFYHYVKEQFHLLKR 292
>gi|281354213|gb|EFB29797.1| hypothetical protein PANDA_009304 [Ailuropoda melanoleuca]
Length = 311
Score = 129 bits (323), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 91/272 (33%), Positives = 146/272 (53%), Gaps = 16/272 (5%)
Query: 21 TDSLSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVN 80
+ L + + ++YNRV K GS + V + + K FN++ ++ N L+ +Q +
Sbjct: 1 SQVLPFPSQVVYNRVGKCGSRTVVLLLRILSEKHGFNLVTSDI-HNKTRLTKNEQMELIK 59
Query: 81 NVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGD--NYR 138
N++ +P L+ H F++F +FG +QP++INI+R P++R +S Y+F R+GD +
Sbjct: 60 NIST--AEQPYLFTRHVHFLNFSRFGG-DQPVYINIIRDPVNRFLSNYFFRRFGDWRGEQ 116
Query: 139 PHLVR----KKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKA 194
H++R ++ +ECI N ECS ++ +P+ CG C PG WALE+A
Sbjct: 117 NHMIRTPSMRQEERYLDINECILENYPECSNPRLFYIIPYFCGQHPRCREPGE-WALERA 175
Query: 195 KENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRR---TNRKIDPS 251
K N+ +LLVG+ EEL D + LLE LP +F+G + L T +K PS
Sbjct: 176 KLNVNENFLLVGILEELEDVLLLLERFLPHYFKGVLSIYKDPEHRKLGNMTVTVKKTVPS 235
Query: 252 EETVQQIKKSKIWELENELYEYALEQFHFVKK 283
E VQ + + +E E Y Y EQFH +K+
Sbjct: 236 PEAVQILYQRMRYEY--EFYHYVKEQFHLLKR 265
>gi|395834648|ref|XP_003790307.1| PREDICTED: uronyl 2-sulfotransferase [Otolemur garnettii]
Length = 406
Score = 129 bits (323), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 92/274 (33%), Positives = 147/274 (53%), Gaps = 16/274 (5%)
Query: 19 PETDSLSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRF 78
P + L + + ++YNRV K GS + V + + K FN++ ++ N L+ +Q
Sbjct: 94 PPSKVLPFPSQVVYNRVGKCGSRTVVLLLRILSEKHGFNLVTSDIH-NKTRLTKNEQMEL 152
Query: 79 VNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGD--N 136
+ N++ +P L+ H F++F +FG +QP++INI+R P++R +S Y+F R+GD
Sbjct: 153 IKNIST--AEQPYLFTRHVHFLNFSRFGG-DQPVYINIIRDPVNRFLSNYFFRRFGDWRG 209
Query: 137 YRPHLVR----KKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALE 192
+ H++R ++ +ECI N ECS ++ +P+ CG C PG WALE
Sbjct: 210 EQNHMIRTPSMRQEERYLDINECILENYPECSNPRLFYIIPYFCGQHPRCREPGE-WALE 268
Query: 193 KAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRR---TNRKID 249
+AK N+ +LLVG+ EEL D + LLE LP +F+G + L T +K
Sbjct: 269 RAKLNVNENFLLVGILEELEDVLLLLERFLPHYFKGVLSIYKDPEHRKLGNMTVTVKKTV 328
Query: 250 PSEETVQQIKKSKIWELENELYEYALEQFHFVKK 283
PS E VQ + + +E E Y Y EQFH +K+
Sbjct: 329 PSPEAVQILYQRMRYEY--EFYHYIREQFHRLKR 360
>gi|338722961|ref|XP_001915801.2| PREDICTED: uronyl 2-sulfotransferase-like [Equus caballus]
Length = 435
Score = 128 bits (322), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 91/271 (33%), Positives = 146/271 (53%), Gaps = 16/271 (5%)
Query: 22 DSLSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNN 81
+ L + + ++YNRV K GS + V + + K FN++ ++ N L+ +Q + N
Sbjct: 94 EVLPFPSQVVYNRVGKCGSRTVVLLLRILSEKHGFNLVTSDI-HNKTRLTKNEQMELIKN 152
Query: 82 VTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGD--NYRP 139
++ +P L+ H F++F +FG +QP++INI+R P++R +S Y+F R+GD +
Sbjct: 153 IST--AEQPYLFTRHVHFLNFSRFGG-DQPVYINIIRDPVNRFLSNYFFRRFGDWRGEQN 209
Query: 140 HLVR----KKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAK 195
H++R ++ +ECI N ECS ++ +P+ CG C PG WALE+AK
Sbjct: 210 HMIRTPSMRQEERYLDINECILENYPECSNPRLFYIIPYFCGQHPRCREPGE-WALERAK 268
Query: 196 ENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRR---TNRKIDPSE 252
N+ +LLVG+ EEL D + LLE LP +F+G + L T +K PS
Sbjct: 269 LNVNENFLLVGILEELEDVLLLLERFLPHYFKGVLSIYKDPEHRKLGNMTVTVKKTVPSP 328
Query: 253 ETVQQIKKSKIWELENELYEYALEQFHFVKK 283
E VQ + + +E E Y Y EQFH +K+
Sbjct: 329 EAVQILYQRMRYEY--EFYHYVKEQFHLLKR 357
>gi|390358645|ref|XP_798108.3| PREDICTED: uronyl 2-sulfotransferase-like [Strongylocentrotus
purpuratus]
Length = 468
Score = 127 bits (320), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 88/273 (32%), Positives = 135/273 (49%), Gaps = 21/273 (7%)
Query: 19 PETDSLSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRF 78
PET + I YNRV K GS S + + + K +F H+ + + L +Y
Sbjct: 204 PETQT---GAAIFYNRVGKCGSRSVIAVLRLLALKNQF---HLVSSLTYNATRLVPEYEK 257
Query: 79 VNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGD--N 136
+ + ++P L+ H FIDF+++G K QP +INI+R PL R+VS+YYF R+GD +
Sbjct: 258 MMVTVLSQIQKPYLFQRHVYFIDFRRYGVK-QPKYINIIRDPLSRMVSHYYFQRFGDGKS 316
Query: 137 YRPHLVRKKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKE 196
R + + K+ TFD C+ + EC + +P+ CG C P WAL +AK+
Sbjct: 317 SRNYSGKDKY---QTFDSCVLNQKKECFGGRTFYIIPYFCGQDPRCRDPST-WALNEAKQ 372
Query: 197 NLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRR------TNRKIDP 250
N+ Y+ VG+ EEL D + E LP F G D + + + T K+ P
Sbjct: 373 NVQDHYVAVGLLEELEDTFRVFEKVLPDAFDGVLDIYRNIVSGEVGKNLSVMVTKHKVQP 432
Query: 251 SEETVQQIKKSKIWELENELYEYALEQFHFVKK 283
S E + +K LE YE+ +FH +K+
Sbjct: 433 SPEVARIMK--DYMRLEYIFYEFIKTRFHTLKR 463
>gi|291412440|ref|XP_002722488.1| PREDICTED: uronyl-2-sulfotransferase, partial [Oryctolagus
cuniculus]
Length = 309
Score = 127 bits (320), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 91/269 (33%), Positives = 144/269 (53%), Gaps = 16/269 (5%)
Query: 24 LSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVT 83
L + + ++YNRV K GS + V + + K FN++ ++ N L+ +Q + N++
Sbjct: 2 LPFPSQVVYNRVGKCGSRTVVLLLRILSEKHGFNLVTSDI-HNKTRLTKNEQMELIKNIS 60
Query: 84 KWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGD--NYRPHL 141
+P L+ H F++F +FG +QP++INI+R P+ R +S Y+F R+GD + H+
Sbjct: 61 T--AEQPYLFTRHVHFLNFSRFGG-DQPVYINIIRDPVSRFLSNYFFRRFGDWRGEQNHM 117
Query: 142 VR----KKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKEN 197
+R ++ +ECI N ECS ++ +P+ CG C PG WALE+AK N
Sbjct: 118 IRTPSMRQEERYLDINECILENYPECSNPRLFYIIPYFCGQHPRCREPGE-WALERAKLN 176
Query: 198 LVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRR---TNRKIDPSEET 254
+ +LLVG+ EEL D + LLE LP +F+G + L T +K PS E
Sbjct: 177 VNENFLLVGILEELEDVLLLLERFLPHYFKGVLSIYKDPEHRKLGNMTVTVKKTVPSPEA 236
Query: 255 VQQIKKSKIWELENELYEYALEQFHFVKK 283
VQ + + +E E Y Y EQFH +K+
Sbjct: 237 VQILYQRMRYEY--EFYHYIKEQFHLLKR 263
>gi|383414887|gb|AFH30657.1| uronyl 2-sulfotransferase [Macaca mulatta]
gi|383414889|gb|AFH30658.1| uronyl 2-sulfotransferase [Macaca mulatta]
gi|383414891|gb|AFH30659.1| uronyl 2-sulfotransferase [Macaca mulatta]
Length = 406
Score = 127 bits (320), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 91/274 (33%), Positives = 146/274 (53%), Gaps = 16/274 (5%)
Query: 19 PETDSLSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRF 78
P + L + + ++YN V K GS + V + + K FN++ ++ N L+ +Q
Sbjct: 94 PPSKVLPFPSQVVYNSVGKCGSRTVVLLLRILSEKHGFNLVTSDIH-NKTRLTKNEQMEL 152
Query: 79 VNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGD--N 136
+ N++ +P L+ H F++F +FG +QP++INI+R P++R +S Y+F R+GD
Sbjct: 153 IKNIST--AEQPYLFTRHVHFLNFSRFGG-DQPVYINIIRDPVNRFLSNYFFRRFGDWRG 209
Query: 137 YRPHLVR----KKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALE 192
+ H++R ++ +ECI N ECS ++ +P+ CG C PG WALE
Sbjct: 210 EQNHMIRTPSMRQEERYLDINECILENYPECSNPRLFYIIPYFCGQHPRCREPGE-WALE 268
Query: 193 KAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRR---TNRKID 249
+AK N+ +LLVG+ EEL D + LLE LP +F+G + L T +K
Sbjct: 269 RAKLNVNENFLLVGILEELEDVLLLLERFLPHYFKGVLSIYKDPEHRKLGNMTVTVKKTV 328
Query: 250 PSEETVQQIKKSKIWELENELYEYALEQFHFVKK 283
PS E VQ + + +E E Y Y EQFH +K+
Sbjct: 329 PSPEAVQILYQRMRYEY--EFYHYVKEQFHLLKR 360
>gi|157823625|ref|NP_001101928.1| uronyl 2-sulfotransferase [Rattus norvegicus]
gi|149039533|gb|EDL93695.1| uronyl-2-sulfotransferase (predicted) [Rattus norvegicus]
Length = 408
Score = 127 bits (318), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 91/274 (33%), Positives = 145/274 (52%), Gaps = 16/274 (5%)
Query: 19 PETDSLSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRF 78
P + L + + ++YNRV K GS + V + + K F ++ ++ N L+ +Q
Sbjct: 95 PPSKVLPFPSQVVYNRVGKCGSRTVVLLLRILSEKHGFKLVTSDIH-NKTRLTKNEQMEL 153
Query: 79 VNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGD--N 136
+ N++ +P L+ H F++F +FG +QP++INI+R P+ R +S Y+F R+GD
Sbjct: 154 IKNISTVE--QPYLFTRHVHFLNFSRFGG-DQPVYINIIRDPVSRFLSNYFFRRFGDWRG 210
Query: 137 YRPHLVR----KKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALE 192
+ H++R ++ +ECI N ECS ++ +P+ CG C PG WALE
Sbjct: 211 EQNHMIRTPSMRQEERYLDINECILENYPECSNPRLFYIIPYFCGQHPRCREPGE-WALE 269
Query: 193 KAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRR---TNRKID 249
+AK N+ +LLVG+ EEL D + LLE LP +F+G + L T +K
Sbjct: 270 RAKLNVNENFLLVGILEELEDVLLLLERFLPHYFKGVLSIYKDPEHRKLGNMTVTVKKTV 329
Query: 250 PSEETVQQIKKSKIWELENELYEYALEQFHFVKK 283
PS E VQ + + +E E Y Y EQFH +K+
Sbjct: 330 PSPEAVQILYQRMRYEY--EFYHYVKEQFHLLKR 361
>gi|301613624|ref|XP_002936303.1| PREDICTED: uronyl 2-sulfotransferase-like [Xenopus (Silurana)
tropicalis]
Length = 406
Score = 127 bits (318), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 92/270 (34%), Positives = 145/270 (53%), Gaps = 20/270 (7%)
Query: 24 LSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVT 83
L + + ++YNRV K GS + V + + K FN++ ++ N L+ +Q + N++
Sbjct: 97 LPFPSQVVYNRVGKCGSRTIVLLLRILSEKHGFNLVTSDIH-NKTRLTRNEQMELIKNIS 155
Query: 84 KWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGD--NYRPHL 141
+P L+ H F++F +FG+ EQP++INI+R P+ R +S Y+F R+GD + H+
Sbjct: 156 T--ADQPYLFTRHVHFLNFSRFGA-EQPVYINIVRDPVSRFLSNYFFRRFGDWRGEQNHM 212
Query: 142 VR----KKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKEN 197
+R K+ +ECI N ECS ++ +P+ CG C PG WALE+AK+N
Sbjct: 213 IRTPGMKEEERNLDINECILENYAECSNPRLFYIIPYFCGQHPKCRDPGE-WALERAKQN 271
Query: 198 LVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSH-----LRRTNRKIDPSE 252
+ +LLVG+ EEL D + LLE LP +F+ + N H L T +K PS
Sbjct: 272 VNENFLLVGILEELEDVLLLLERFLPHYFKDVMS--IYKNPEHRKLGNLTVTVKKTFPSP 329
Query: 253 ETVQQIKKSKIWELENELYEYALEQFHFVK 282
+Q + + +E E Y Y EQFH +K
Sbjct: 330 AALQVLYQRMKYEY--EFYYYVKEQFHLLK 357
>gi|269785061|ref|NP_001161486.1| uronyl-2-sulfotransferase [Saccoglossus kowalevskii]
gi|268054385|gb|ACY92679.1| uronyl-2-sulfotransfase [Saccoglossus kowalevskii]
Length = 397
Score = 127 bits (318), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 81/260 (31%), Positives = 133/260 (51%), Gaps = 18/260 (6%)
Query: 31 IYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWRDRRP 90
+YNRV K GS S + + K + + + N L++ +Q V+ + + P
Sbjct: 142 VYNRVDKCGSRSLLYTIDQLTVKHGYEKV-TSAEYNKKNLTVEEQKELVDEINGLKP--P 198
Query: 91 ALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKHGD-- 148
+Y H F FQ+FGSK P++ N++R PL R +S YYF R+GD PH +
Sbjct: 199 FIYDRHVYFTPFQRFGSK-NPVWFNLIRDPLRRFISLYYFKRFGD---PHTAAGEFNMPE 254
Query: 149 ---KTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLV 205
+FD+C+ N ECS + + VP+ CGH C +P N WA+E+AK+N+V Y LV
Sbjct: 255 EIVNRSFDDCVLNNVFECSTKASFRVVPYFCGHGEGCRMP-NKWAVEQAKKNVVENYALV 313
Query: 206 GVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLR---RTNRKIDPSEETVQQIKKSK 262
GV EE + +L+ +P +F G + + + + ++ PSEE + ++K+
Sbjct: 314 GVLEEFNTTLKVLDKLVPQYFEGAMAAYESVQSTGIVEKFKSYAGKPPSEEAMAKMKER- 372
Query: 263 IWELENELYEYALEQFHFVK 282
LE + Y + + + H +K
Sbjct: 373 -LGLEYDFYNFVVARQHALK 391
>gi|291241493|ref|XP_002740644.1| PREDICTED: uronyl-2-sulfotransferase-like [Saccoglossus
kowalevskii]
Length = 353
Score = 126 bits (317), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 81/243 (33%), Positives = 132/243 (54%), Gaps = 19/243 (7%)
Query: 28 TVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHV-NVTGNNHVLSLADQYRFVNNVTKWR 86
T +IYNRV K GS S + + R RFN H+ ++ + S Q + V +++
Sbjct: 93 TRLIYNRVGKCGSRSLQYVNDILSRWNRFN--HIKSMDFHQKRFSDEQQMQVVKEISELE 150
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKH 146
P +Y H F+DF +FG P +IN++R PL+R++S++Y+ RYGD ++ KK
Sbjct: 151 --APFIYDRHVNFLDFTKFGHSRVP-YINLVRDPLERVMSFFYYKRYGD-----MIDKKE 202
Query: 147 GDKT---TFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYL 203
D + TF++C+ +ECS VPF CGH+ C P WALE A N++ Y+
Sbjct: 203 LDPSLNNTFEDCVLKEMSECSSIGPNRMVPFFCGHSEECMKPTR-WALETALRNIIENYV 261
Query: 204 LVGVTEELTDFVSLLEAALPSFFRGGTDHF-LTSNKSHLR--RTNRKIDPSEETVQQIKK 260
VG E+ + +LE +P FF G +D + L ++ H+ +T K + SEE +++I +
Sbjct: 262 FVGTIEDFPTTLYILEFIMPQFFTGASDSYKLVQSRGHVDAFKTAYKEEASEE-MKEIMR 320
Query: 261 SKI 263
++
Sbjct: 321 ERL 323
>gi|443719469|gb|ELU09623.1| hypothetical protein CAPTEDRAFT_189619, partial [Capitella teleta]
Length = 308
Score = 126 bits (316), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 72/219 (32%), Positives = 117/219 (53%), Gaps = 20/219 (9%)
Query: 30 IIYNRVPKTGSTSFVNMAYDMCRKKRFN-------VLHVNVTGNNHVLSLADQYRFVNNV 82
I YNR+PK GS + + ++ +K + + + G+ + + + ++++
Sbjct: 91 IFYNRMPKCGSEMTMTLLRNIAKKNHWTYPPENKRIWTITKQGHQYWVDNEKVLKEISHL 150
Query: 83 TKWRD-RRPALYHGHFGFIDFQQFGSKEQ-PLFINILRKPLDRLVSYYYFLRYGDNYRPH 140
T D ++P L+ H + DF + SK+ P +INI+R P DRL+S YY+ R+ P
Sbjct: 151 TMTIDHKKPMLFSCHLYYTDFSRLWSKKSLPTYINIIRNPQDRLISLYYYFRF----HPV 206
Query: 141 LVRKKHG---DKTTFDECIRLNRTECSL---ENMWLQVPFLCGHAAACWVPGNPWALEKA 194
RKK K T+D C+R+ EC+ E W VP+ CGH C P ALE+A
Sbjct: 207 QRRKKMPIALRKMTYDTCVRMELPECTAPNPEGFWTMVPYFCGHDPICRTPSRD-ALERA 265
Query: 195 KENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHF 233
K N+V+ Y ++G+TE++ F +LE +P +FRG TD +
Sbjct: 266 KANIVSNYAVIGLTEDMETFTRVLENVIPKYFRGMTDLY 304
>gi|198429201|ref|XP_002124810.1| PREDICTED: similar to uronyl-2-sulfotransferase [Ciona
intestinalis]
Length = 334
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 71/212 (33%), Positives = 109/212 (51%), Gaps = 25/212 (11%)
Query: 30 IIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTG--NNHVLSLADQYRFVNNVTKWRD 87
++YNRV K GS S N+ ++ + FN N++ +++ L + + ++
Sbjct: 72 LVYNRVGKCGSRSMHNIISELSKINHFNFFPSNISNVTRPNIMHLKTEVELIQSL----- 126
Query: 88 RRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKHG 147
+RP LY H +I+FQ+FG + PL+IN++R P++R S YYF RYGD VR HG
Sbjct: 127 KRPMLYSRHIHYINFQKFG-QSPPLYINMIRDPIERFQSQYYFKRYGD------VRSAHG 179
Query: 148 DKT----------TFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKEN 197
K + EC+ N ECS +W VP+ CG C P P +L +AK +
Sbjct: 180 RKVKPWKRGELEMSISECVLSNHYECSTSKLWYIVPYFCGQDLMCKHP-TPESLNRAKRH 238
Query: 198 LVTKYLLVGVTEELTDFVSLLEAALPSFFRGG 229
L+ YL VG+ E+ + + E LP F+G
Sbjct: 239 LIENYLAVGLLEDFESSLLVFEKLLPHHFKGA 270
>gi|410916647|ref|XP_003971798.1| PREDICTED: uronyl 2-sulfotransferase-like [Takifugu rubripes]
Length = 403
Score = 125 bits (315), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 95/274 (34%), Positives = 146/274 (53%), Gaps = 16/274 (5%)
Query: 19 PETDSLSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRF 78
P L + + ++YNRV K GS + V + + K +FN++ ++ N L+ +Q
Sbjct: 88 PVPRVLPFPSQVVYNRVGKCGSRTVVILLRLLAEKHQFNLVSSDIH-NKTRLTKHEQVDL 146
Query: 79 VNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGD--N 136
+ N++ +P LY H F++F +F EQP++INI+R P+ R +S Y+F R+GD
Sbjct: 147 MRNISGIP--QPFLYTRHVHFLNFTRF-RIEQPVYINIIRDPISRFLSNYFFRRFGDWRG 203
Query: 137 YRPHLVR----KKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALE 192
+ HL+R K + CI N ECS ++ +P+ CG C PG WALE
Sbjct: 204 EQNHLIRTPGMKDEERYLDINVCILENYPECSNPRVFYIIPYFCGQHPQCREPGV-WALE 262
Query: 193 KAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSN---KSHLRRTNRKID 249
+AK+N++ YLLVG+ EEL D + LLE LP +F G + + T + +L T RK
Sbjct: 263 RAKQNVLENYLLVGILEELEDVLLLLERLLPHYFSGVLNIYKTPDYKKMGNLTGTVRKHT 322
Query: 250 PSEETVQQIKKSKIWELENELYEYALEQFHFVKK 283
PS E +Q + +E E Y + +QFH KK
Sbjct: 323 PSLEALQVLYHRMRYEY--EFYNFIRDQFHLTKK 354
>gi|291233615|ref|XP_002736749.1| PREDICTED: uronyl-2-sulfotransferase-like [Saccoglossus
kowalevskii]
Length = 471
Score = 125 bits (314), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 80/271 (29%), Positives = 134/271 (49%), Gaps = 30/271 (11%)
Query: 27 DTVIIYNRVPKTGSTSFVNMA--------YDMCRKKRFNVLHVNVTGNNHVLSLADQYRF 78
D ++YNRV K GS + + + Y + +N+ +V T + S+ +
Sbjct: 214 DIRLVYNRVDKCGSQTLLAVIGILSFDNDYRSIWSRVWNIYNVTETRQKWIASIIN---- 269
Query: 79 VNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYR 138
+ + P +++ H +++ +FG ++ P +INI+R PL R +S YYF R+GDN +
Sbjct: 270 -------KQKPPYIFNRHLHYLNLSKFGFEQIP-YINIIRDPLPRFISRYYFKRFGDNLQ 321
Query: 139 PHLVRKKHGDKT-TFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKEN 197
P+ G++T TFDEC+ N+ EC E + +P+ CG C VP WALE AK
Sbjct: 322 PN--EDFQGNRTQTFDECVFENKPECMEEMAFQMIPYFCGQEPECRVPSR-WALEMAKNR 378
Query: 198 LVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTS----NKSHLRRTNRKIDPSEE 253
+V+ Y+ VGV E+ + + E LP FF G + T K ++ K PS
Sbjct: 379 VVSDYVFVGVLEDFEQSLRIFEILLPQFFESGLKVYKTMIFGLKKYEEFQSPAKFTPSTM 438
Query: 254 TVQQIKKSKIWELENELYEYALEQFHFVKKH 284
++ + S+ LE E Y++ + +K+
Sbjct: 439 ALEIM--SQRLALEYEFYDFVRARMELIKEQ 467
>gi|348524857|ref|XP_003449939.1| PREDICTED: uronyl 2-sulfotransferase-like [Oreochromis niloticus]
Length = 403
Score = 125 bits (314), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 93/274 (33%), Positives = 147/274 (53%), Gaps = 16/274 (5%)
Query: 19 PETDSLSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRF 78
P L + + +IYNRV K GS + V + + K +FN++ ++ N L+ +Q
Sbjct: 88 PAPRVLPFPSQVIYNRVGKCGSRTVVILLRLLAEKHQFNLVSSDIH-NKTRLTKHEQVDL 146
Query: 79 VNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGD--N 136
+ N++ +P LY H F++F +F EQP++INI+R P++R +S Y+F R+GD
Sbjct: 147 MKNISSIP--QPFLYTRHVHFLNFTRF-KIEQPVYINIIRDPINRFLSNYFFRRFGDWRG 203
Query: 137 YRPHLVR----KKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALE 192
+ HL+R K + CI N ECS ++ +P+ CG C PG WALE
Sbjct: 204 EQNHLIRTPGMKDDERYLDINVCILENYPECSNPRVFYIIPYFCGQHPQCREPG-VWALE 262
Query: 193 KAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSN---KSHLRRTNRKID 249
+AK+N++ YLLVG+ EEL D + LLE LP +F G + + + + ++ T RK
Sbjct: 263 RAKQNVLENYLLVGILEELEDVLLLLERLLPHYFTGVLNIYKSPDYKKMGNMTGTVRKQT 322
Query: 250 PSEETVQQIKKSKIWELENELYEYALEQFHFVKK 283
P+ E +Q + +E E Y + +QFH KK
Sbjct: 323 PTLEALQVLYHRMRYEY--EFYNFVRDQFHLTKK 354
>gi|443733475|gb|ELU17830.1| hypothetical protein CAPTEDRAFT_106349 [Capitella teleta]
Length = 204
Score = 125 bits (313), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 65/191 (34%), Positives = 112/191 (58%), Gaps = 13/191 (6%)
Query: 96 HFGFIDFQQFGSKEQ-PLFINILRKPLDRLVSYYYFLRYGDNY-RPHLVRKKHGDKTTFD 153
H + DF ++G + P +INI+R P +RL+S+YYF+R+ N+ RP K++ ++D
Sbjct: 5 HLYYTDFARYGCRSTLPTYINIIRNPQERLISFYYFIRFFPNHQRPMSDAKRN---MSYD 61
Query: 154 ECIRLNRTECS---LENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVGVTEE 210
EC+R N EC+ + W +P+ CGH C P ALE+AK N++ Y +VG+TE+
Sbjct: 62 ECVRQNDDECTGAHSKGYWTLIPYFCGHDPVCRKPSQA-ALEQAKANVMRSYAVVGITED 120
Query: 211 LTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTN--RKIDPSEETVQQIKKSKIWELEN 268
+ F +LE +P +F+G + + + +N RK+ P ET+++++ + +LEN
Sbjct: 121 VDSFTQVLEQTIPKYFKGLAAEYQILKRRNKAYSNAKRKVKPKPETLEKMRP--LLKLEN 178
Query: 269 ELYEYALEQFH 279
E YE+ +F+
Sbjct: 179 EFYEFVKNRFY 189
>gi|47224594|emb|CAG03578.1| unnamed protein product [Tetraodon nigroviridis]
Length = 370
Score = 125 bits (313), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 95/277 (34%), Positives = 148/277 (53%), Gaps = 16/277 (5%)
Query: 16 SPSPETDSLSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQ 75
SP +L + + ++YNRV K GS + V + + K +FN++ ++ N L+ +Q
Sbjct: 60 SPVLCLQALPFPSQVVYNRVGKCGSRTVVILLRLLAEKHQFNLVSSDIH-NKTRLTKHEQ 118
Query: 76 YRFVNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGD 135
+ N++ +P LY H F++F +F EQP++INI+R P+ R +S Y+F R+GD
Sbjct: 119 VDLMRNISGIP--QPFLYTRHVHFLNFTRF-HIEQPVYINIIRDPISRFLSNYFFRRFGD 175
Query: 136 --NYRPHLVR----KKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPW 189
+ HL+R K + CI N ECS + +P+ CG C PG W
Sbjct: 176 WRGEQNHLIRTPGMKDDERYLDINVCILENYPECSNPRAFYIIPYFCGQHPQCREPGV-W 234
Query: 190 ALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSN---KSHLRRTNR 246
ALE+AK+N++ YLLVG+ EEL D + LLE LP +F G + + T + +L T R
Sbjct: 235 ALERAKQNVLENYLLVGILEELEDVLLLLERLLPHYFSGVLNIYKTPDYKKMGNLTGTVR 294
Query: 247 KIDPSEETVQQIKKSKIWELENELYEYALEQFHFVKK 283
K PS E +Q + + +E + Y + +QFH KK
Sbjct: 295 KHTPSLEALQVLYRRMRYEY--DFYNFIRDQFHLTKK 329
>gi|443730784|gb|ELU16142.1| hypothetical protein CAPTEDRAFT_140021 [Capitella teleta]
Length = 274
Score = 124 bits (312), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 79/258 (30%), Positives = 136/258 (52%), Gaps = 22/258 (8%)
Query: 30 IIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWRDRR 89
I++NRVPK GS++ +++ M F H L D + + + +
Sbjct: 13 ILFNRVPKCGSSTLMDVMVQMADTHNFTFTRAKEYMRFH---LDDFWMLIRRIKT--EPT 67
Query: 90 PALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKHGDK 149
P +Y H + D +++G E+P+FIN++R PL+R+VS+YYF RY D + R+ ++
Sbjct: 68 PVIYERHIHYFDSEKYGM-ERPVFINVIRDPLERMVSWYYFRRYQDGHE----RQLPIEQ 122
Query: 150 TTFDECIRLNRTEC-------SLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKY 202
TFDEC+ N +EC + E + +PF CG C P AL +A +N+ Y
Sbjct: 123 RTFDECVLGNHSECMEHGGAKAEEGFFKIIPFFCGQEEFCREPTKE-ALVQAIKNVKKHY 181
Query: 203 LLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRR--TNRKIDPSEETVQQIKK 260
L+VGV E+ F+ +LE LP +F+G + + + R T K+ SE+T +QI +
Sbjct: 182 LVVGVLEDFEGFIEVLEFLLPDYFQGAQEVYTKQGRDLKDRHKTTYKLPVSEQT-EQIMR 240
Query: 261 SKIWELENELYEYALEQF 278
+++++ E + Y + ++F
Sbjct: 241 ARLFK-EYQFYMFVQKRF 257
>gi|449497306|ref|XP_002193512.2| PREDICTED: uronyl 2-sulfotransferase [Taeniopygia guttata]
Length = 335
Score = 124 bits (312), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 91/276 (32%), Positives = 146/276 (52%), Gaps = 20/276 (7%)
Query: 19 PETDSLSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRF 78
P L + + ++YNRV K GS + V + + K FN++ ++ N L+ +Q
Sbjct: 23 PPQKVLPFPSQVVYNRVGKCGSRTVVLLLRILSEKHGFNLVTSDIH-NKTRLTKNEQMEL 81
Query: 79 VNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGD--N 136
+ N++ +P L+ H F++F +FG +QP++INI+R P++R +S Y+F R+GD
Sbjct: 82 IKNIST--AEQPYLFTRHVHFLNFSRFGG-DQPVYINIIRDPVNRFLSNYFFRRFGDWRG 138
Query: 137 YRPHLVR----KKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALE 192
+ H++R ++ + CI N ECS ++ +P+ CG C PG WALE
Sbjct: 139 EQNHMIRTPNMRQEERYLDINVCILENYPECSNPRLFYIIPYFCGQHPRCREPGE-WALE 197
Query: 193 KAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSH-----LRRTNRK 247
+AK N+ +LLVG+ EEL D + LLE LP +F+ + N H L T +K
Sbjct: 198 RAKLNVNENFLLVGILEELEDVLLLLERFLPHYFKDVLS--IYKNPEHRKLGNLTVTVKK 255
Query: 248 IDPSEETVQQIKKSKIWELENELYEYALEQFHFVKK 283
PS E +Q + + +E E Y Y EQFH +K+
Sbjct: 256 TVPSPEAIQILYQRMRYEY--EFYYYVKEQFHLLKR 289
>gi|313230080|emb|CBY07784.1| unnamed protein product [Oikopleura dioica]
Length = 340
Score = 124 bits (311), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 87/275 (31%), Positives = 135/275 (49%), Gaps = 21/275 (7%)
Query: 17 PSPETDSLSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQY 76
P PE ++YNRVPK S S ++Y + K F V G ++ +Q
Sbjct: 74 PMPEAQ------FVVYNRVPKCASMSMTTLSYKLGGKNNFKVESPYEPGEKPEKTITEQR 127
Query: 77 RFVNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDN 136
FV+ + P +Y H +IDF + +IN++R P+DR S+YYF R+G N
Sbjct: 128 EFVDFLQNQEP--PYMYIRHQYYIDFSELNEDFSAAYINMIRDPIDRFESFYYFSRFG-N 184
Query: 137 YRPHLVRKKHGDKT---TFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEK 193
+ + D+ + DEC+ R EC+ + +W VP+ CG A C + A++
Sbjct: 185 EKGGGGNARLSDEQRNESVDECVAKRRRECT-QPVWQVVPYFCGMDAECN-NRHIRAVQI 242
Query: 194 AKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRG-----GTDHFLTSNKSHLRRTNRKI 248
AKE++ YL VG EE+ + +LE LPSFF G GTD + N +T K
Sbjct: 243 AKEHIEENYLFVGTLEEMNLSLGILEKLLPSFFGGARELTGTDE--SQNMKTQTKTLNKK 300
Query: 249 DPSEETVQQIKKSKIWELENELYEYALEQFHFVKK 283
S ET + +K++ +LE ELY++ +++ H K
Sbjct: 301 KTSSETREWLKENTSLKLEYELYDFVVKRLHRAAK 335
>gi|126310677|ref|XP_001370807.1| PREDICTED: uronyl 2-sulfotransferase [Monodelphis domestica]
Length = 407
Score = 124 bits (310), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 89/271 (32%), Positives = 146/271 (53%), Gaps = 20/271 (7%)
Query: 24 LSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVT 83
L + + ++YNRV K GS + V + + K FN++ ++ N L+ +Q + N++
Sbjct: 100 LPFPSQVVYNRVGKCGSRTVVLLLRILSEKHGFNLVTSDIH-NKTRLTKNEQMELIKNIS 158
Query: 84 KWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGD--NYRPHL 141
+P L+ H F++F +FG +QP++INI+R P++R +S Y+F R+GD + H+
Sbjct: 159 T--AEQPYLFTRHVHFLNFSRFGG-DQPVYINIIRDPVNRFLSNYFFRRFGDWRGEQNHM 215
Query: 142 VR----KKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKEN 197
+R ++ +ECI N ECS ++ +P+ CG C PG WALE+AK N
Sbjct: 216 IRTPSMRQEERYLDINECILENYPECSNPRLFYIIPYFCGQHPRCREPGE-WALERAKLN 274
Query: 198 LVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTN-----RKIDPSE 252
+ +LLVG+ EEL D + LLE LP +F+ + N H + N +K PS
Sbjct: 275 VNEHFLLVGILEELEDVLLLLERFLPHYFKDVLS--IYKNPEHRKLGNMTVTVKKTVPSP 332
Query: 253 ETVQQIKKSKIWELENELYEYALEQFHFVKK 283
E +Q + + +E E Y + EQFH +K+
Sbjct: 333 EAIQILYQRMRYEY--EFYYFVKEQFHLLKR 361
>gi|363731679|ref|XP_001231938.2| PREDICTED: uronyl 2-sulfotransferase [Gallus gallus]
Length = 450
Score = 124 bits (310), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 94/291 (32%), Positives = 151/291 (51%), Gaps = 26/291 (8%)
Query: 10 HISSAKSP------SPETDSLSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNV 63
H+S+ S P L + + ++YNRV K GS + V + + K FN++ ++
Sbjct: 102 HVSTGNSTYLDDHGPPPQKVLPFPSQVVYNRVGKCGSRTVVLLLRILSEKHGFNLVTSDI 161
Query: 64 TGNNHVLSLADQYRFVNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDR 123
N L+ +Q + N++ +P L+ H F++F +FG +QP++INI+R P++R
Sbjct: 162 H-NKTRLTKNEQMELIKNIST--AEQPYLFTRHVHFLNFSRFGG-DQPVYINIIRDPVNR 217
Query: 124 LVSYYYFLRYGD--NYRPHLVR----KKHGDKTTFDECIRLNRTECSLENMWLQVPFLCG 177
+S Y+F R+GD + H++R ++ + CI N ECS ++ +P+ CG
Sbjct: 218 FLSNYFFRRFGDWRGEQNHMIRTPSMRQEERYLDINVCILENYPECSNPRLFYIIPYFCG 277
Query: 178 HAAACWVPGNPWALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSN 237
C PG WALE+AK N+ +LLVG+ EEL D + LLE LP +F+ + N
Sbjct: 278 QHPRCREPGE-WALERAKLNVNENFLLVGILEELEDVLLLLERFLPHYFKDVLS--IYKN 334
Query: 238 KSH-----LRRTNRKIDPSEETVQQIKKSKIWELENELYEYALEQFHFVKK 283
H L T +K PS E +Q + + +E E Y Y EQFH +K+
Sbjct: 335 PEHRKLGNLTVTVKKTVPSPEAIQILYQRMRYEY--EFYYYVKEQFHLLKR 383
>gi|395535128|ref|XP_003769584.1| PREDICTED: uronyl 2-sulfotransferase [Sarcophilus harrisii]
Length = 408
Score = 124 bits (310), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 89/271 (32%), Positives = 146/271 (53%), Gaps = 20/271 (7%)
Query: 24 LSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVT 83
L + + ++YNRV K GS + V + + K FN++ ++ N L+ +Q + N++
Sbjct: 101 LPFPSQVVYNRVGKCGSRTVVLLLRILSEKHGFNLVTSDIH-NKTRLTKNEQMELIKNIS 159
Query: 84 KWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGD--NYRPHL 141
+P L+ H F++F +FG +QP++INI+R P++R +S Y+F R+GD + H+
Sbjct: 160 T--AEQPYLFTRHVHFLNFSRFGG-DQPVYINIIRDPVNRFLSNYFFRRFGDWRGEQNHM 216
Query: 142 VR----KKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKEN 197
+R ++ +ECI N ECS ++ +P+ CG C PG WALE+AK N
Sbjct: 217 IRTPSMRQEERYLDINECILENYPECSNPRLFYIIPYFCGQHPRCREPGE-WALERAKLN 275
Query: 198 LVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTN-----RKIDPSE 252
+ +LLVG+ EEL D + LLE LP +F+ + N H + N +K PS
Sbjct: 276 VNEHFLLVGILEELEDVLLLLERFLPHYFKDVLS--IYKNPEHRKLGNMTVTVKKTVPSP 333
Query: 253 ETVQQIKKSKIWELENELYEYALEQFHFVKK 283
E +Q + + +E E Y + EQFH +K+
Sbjct: 334 EAIQILYQRMRYEY--EFYYFVKEQFHLLKR 362
>gi|313236912|emb|CBY12161.1| unnamed protein product [Oikopleura dioica]
Length = 359
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 80/264 (30%), Positives = 141/264 (53%), Gaps = 22/264 (8%)
Query: 29 VIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQ-YRFVNNVTKWRD 87
+++YNRVPK GST+ +++ + +K +NV + H + +Q Y V N+T+++
Sbjct: 83 IVVYNRVPKCGSTTTLDIIRFLRKKLHYNVFNDIAPKMKHFMESDEQEYGLVRNITQFK- 141
Query: 88 RRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYG----------DNY 137
RP LY H FIDF ++G ++ PL+INI R P+ +S Y++LR+G N+
Sbjct: 142 -RPILYIRHVYFIDFPKYGYRD-PLYINIARDPVSLFISNYFYLRFGFQTAKNTTNAQNW 199
Query: 138 RPHLVRKKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKEN 197
+ + ++ + D+C++ EC+ L +PF CG ++ C G+ A+ AK+N
Sbjct: 200 KHEMPDERRA--MSIDDCVKTEAQECARPYSNL-IPFFCGSSSICQKRGDE-AVAIAKQN 255
Query: 198 LVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKS-HLR-RTNRKIDPSEETV 255
++ +Y +VG+ E+ + E P FF G + +N+ H R +T K +P TV
Sbjct: 256 VLDRYAIVGIMEDFHSTMKAFEVVSPRFFMGASVLLDKANQVLHERSKTAHKKEPDPATV 315
Query: 256 QQIKKSKIWELENELYEYALEQFH 279
+ ++K + E ELY + E F+
Sbjct: 316 EYLRKG--LKREYELYNFIKEIFY 337
>gi|313245855|emb|CBY34843.1| unnamed protein product [Oikopleura dioica]
Length = 359
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 80/264 (30%), Positives = 141/264 (53%), Gaps = 22/264 (8%)
Query: 29 VIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQ-YRFVNNVTKWRD 87
+++YNRVPK GST+ +++ + +K +NV + H + +Q Y V N+T+++
Sbjct: 83 IVVYNRVPKCGSTTTLDIIRFLRKKLHYNVFNDIAPKMKHFMESDEQEYGLVRNITQFK- 141
Query: 88 RRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYG----------DNY 137
RP LY H FIDF ++G ++ PL+INI R P+ +S Y++LR+G N+
Sbjct: 142 -RPILYIRHVYFIDFPKYGYRD-PLYINIARDPVSLFISNYFYLRFGFQTAKNTTNAQNW 199
Query: 138 RPHLVRKKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKEN 197
+ + ++ + D+C++ EC+ L +PF CG ++ C G+ A+ AK+N
Sbjct: 200 KHEMPDERRA--MSIDDCVKTEAQECARPYSNL-IPFFCGSSSICQKRGDE-AVAIAKQN 255
Query: 198 LVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKS-HLR-RTNRKIDPSEETV 255
++ +Y +VG+ E+ + E P FF G + +N+ H R +T K +P TV
Sbjct: 256 VLDRYAIVGIMEDFHSTMKAFEVVSPRFFMGASVLLDKANQVLHERSKTAHKKEPDPATV 315
Query: 256 QQIKKSKIWELENELYEYALEQFH 279
+ ++K + E ELY + E F+
Sbjct: 316 EYLRKG--LKREYELYNFIKEIFY 337
>gi|432945323|ref|XP_004083541.1| PREDICTED: uronyl 2-sulfotransferase-like [Oryzias latipes]
Length = 405
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 89/269 (33%), Positives = 145/269 (53%), Gaps = 16/269 (5%)
Query: 24 LSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVT 83
L + + ++YNRV K GS + V + + K +FN+ ++ N L+ +Q + N++
Sbjct: 95 LPFPSQVVYNRVGKCGSRTVVILLRLLAEKHQFNLFSSDIH-NKTRLTKHEQVDLMRNIS 153
Query: 84 KWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGD--NYRPHL 141
K P LY H F++F +F EQP++INI+R P++R +S Y+F R+GD + HL
Sbjct: 154 KIPP--PFLYTRHVHFLNFSRF-RIEQPVYINIIRDPINRFLSNYFFRRFGDWRGEQNHL 210
Query: 142 VR----KKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKEN 197
+R K + CI N ECS ++ +P+ CG C PG WALE+AK+N
Sbjct: 211 IRTPGMKDDERYLDINVCILENYPECSNPRVFYIIPYFCGQHPQCREPGV-WALERAKQN 269
Query: 198 LVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSN---KSHLRRTNRKIDPSEET 254
++ YL+VG+ EEL + + +LE LP +F G + + + ++ T RK PS E
Sbjct: 270 VLENYLIVGILEELEEVLLMLERLLPHYFAGVLNIYKSPEYKKMGNMTGTVRKQTPSLEA 329
Query: 255 VQQIKKSKIWELENELYEYALEQFHFVKK 283
++ + +E E Y + +QFH +KK
Sbjct: 330 LKVLYHRMRYEY--EFYNFIRDQFHLMKK 356
>gi|443709964|gb|ELU04384.1| hypothetical protein CAPTEDRAFT_223508 [Capitella teleta]
Length = 381
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 84/277 (30%), Positives = 140/277 (50%), Gaps = 38/277 (13%)
Query: 30 IIYNRVPKTGSTSFVNMAYDMCRKKRFNVL--------HVNVTGNNHV------------ 69
I+YNRVPK ST+ + D+ FN+ VN TG V
Sbjct: 95 ILYNRVPKCSSTTLRYIFGDLATANNFNLEVSNNFAQEAVNDTGEGEVSMQSYCSIVFKA 154
Query: 70 --LSLADQYRFVNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSY 127
+S + + V + P+LY H F+DFQ+ + P++IN++R P+DRLVS
Sbjct: 155 FEISCLAEKVLTDEVKNLK--VPSLYIRHVFFVDFQK-HDRPNPIYINLVRDPIDRLVSD 211
Query: 128 YYFLRYG-DNYRPHLVRKKHGDKTTFDECIRLNRTEC---SLENMWLQVPFLCGHAAACW 183
YYF R+ N P ++ K T+DEC+ EC + + + +P+ CG C+
Sbjct: 212 YYFKRFQLQNVFPMPEERR---KRTYDECVFGMFPECVAPTPKGFFRIIPYFCGQDEKCF 268
Query: 184 VPGNPWALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRR 243
P WAL++AKEN++ YL+VG++E++ F+ +L+ +P+FF+ G + + S S +
Sbjct: 269 TPSQ-WALDRAKENVIKHYLVVGMSEDVDKFLEVLDMMMPTFFKKGREMY-ESKLSFFKE 326
Query: 244 TNR--KIDPSEETVQQIKKSKIWELENELYEYALEQF 278
+ KI P+ T+ +K + E + YE+ ++F
Sbjct: 327 KFKSGKIPPTNRTISIMK--DMMEYDYHFYEFVRQRF 361
>gi|195092462|ref|XP_001997636.1| GH19644 [Drosophila grimshawi]
gi|193906054|gb|EDW04921.1| GH19644 [Drosophila grimshawi]
Length = 154
Score = 122 bits (305), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 53/95 (55%), Positives = 73/95 (76%), Gaps = 2/95 (2%)
Query: 12 SSAKSPSPETDSLSWD--TVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHV 69
SS +P D+ +++ V++YNRVPKTGSTSFVN+AYD+C++ R++VLH+NVT N HV
Sbjct: 60 SSLAQIAPTLDNFNYEEQLVVLYNRVPKTGSTSFVNIAYDLCKQNRYHVLHINVTANMHV 119
Query: 70 LSLADQYRFVNNVTKWRDRRPALYHGHFGFIDFQQ 104
LSL +Q FV NVT+W + +PALYHGH F+DF +
Sbjct: 120 LSLPNQISFVRNVTRWHEMKPALYHGHMAFLDFSK 154
>gi|72080234|ref|XP_792962.1| PREDICTED: uronyl 2-sulfotransferase-like [Strongylocentrotus
purpuratus]
Length = 351
Score = 121 bits (304), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 87/268 (32%), Positives = 145/268 (54%), Gaps = 24/268 (8%)
Query: 30 IIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWRDRR 89
++Y VPK GS +FV +A+ + K + +T ++L+ +++ + + +T+
Sbjct: 88 VVYVSVPKCGSRTFVWVAWILKDKNNISA----ITDLPYLLANSEENKMKSLLTEKLQSV 143
Query: 90 P--ALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDN----YRPHLVR 143
P L+HGH FI F P+++++LR PL R VS+YYF+R+GD + H +
Sbjct: 144 PPGGLFHGHIRFIGFTN--PAMMPVYVSMLRDPLARHVSWYYFMRFGDADMDVNKLHEMI 201
Query: 144 KKHGDKT---TFDECIRLNRTECSLENMW-LQVPFLCGHAAACWVPGNPWALEKAKENLV 199
K G T+DEC+ R C+ E + + CG+ C + +ALE+AK NL
Sbjct: 202 VKEGVSAVNQTYDECVERGRPSCTGEYYTNINIRTFCGYEEKC-ITSPEYALEQAKRNL- 259
Query: 200 TKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNK---SHLRRTNRKIDPSEETVQ 256
+L+VG+ EE F+ ++E LPS F GG H +NK S + +T+ K PS +V+
Sbjct: 260 DNFLVVGIVEEYESFLRVIERLLPSMF-GGAIHDYQANKDEFSAVSKTSFKKMPSARSVE 318
Query: 257 QIKKSKIWELENELYEYALEQFHFVKKH 284
+K+ +L+ E Y+Y E+FH +K+H
Sbjct: 319 IMKER--MKLDYEFYDYVKERFHKLKRH 344
>gi|349802527|gb|AEQ16736.1| putative heparan sulfate 2-o-sulfotransferase 1 [Pipa carvalhoi]
Length = 94
Score = 121 bits (303), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 55/97 (56%), Positives = 71/97 (73%), Gaps = 3/97 (3%)
Query: 145 KHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLL 204
K GDK TFDEC+ ++C+ E +WLQ+PF CGH++ CW G+ WAL++AK NLV +Y L
Sbjct: 1 KQGDKKTFDECVAAGGSDCAPEKLWLQIPFFCGHSSECWNVGSRWALDQAKYNLVNEYFL 60
Query: 205 VGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHL 241
VGVTEEL DF+ LLEAALP F GT+ + + KSHL
Sbjct: 61 VGVTEELEDFIMLLEAALPRF---GTELYRSGKKSHL 94
>gi|74096439|ref|NP_001027901.1| uronyl 2-sulfotransferase [Danio rerio]
gi|73760257|dbj|BAE20053.1| uronyl 2-sulfotransferase [Danio rerio]
gi|213624639|gb|AAI71379.1| Uronyl-2-sulfotransferase [Danio rerio]
gi|213627816|gb|AAI71381.1| Uronyl-2-sulfotransferase [Danio rerio]
Length = 407
Score = 120 bits (301), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 95/278 (34%), Positives = 149/278 (53%), Gaps = 24/278 (8%)
Query: 19 PETDSLSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRF 78
P L + + +IYNRV K GS + V + + K +FN++ ++ N L+ +Q
Sbjct: 87 PTPRELPFPSQVIYNRVGKCGSRTVVLLLRILAEKHQFNLVSSDIH-NKTRLTKHEQVDL 145
Query: 79 VNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGD--N 136
+ N++ +P LY H F++F +F EQP++INI+R P++R +S Y+F R+GD
Sbjct: 146 ITNISNIP--QPFLYTRHVHFLNFTRF-RIEQPVYINIIRDPINRFLSNYFFRRFGDWRG 202
Query: 137 YRPHLVR--KKHGDKTTFD--ECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALE 192
+ HL+R + D+ D CI N ECS ++ VP+ CG C PG WA+E
Sbjct: 203 EQNHLIRTPQMKDDERYLDINVCIMENYPECSNPRLFYIVPYFCGQHPQCREPGM-WAVE 261
Query: 193 KAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKS-------HLRRTN 245
+AK+N++ +LLVG+ EEL D + LLE LP +F LT KS +L T
Sbjct: 262 RAKQNVIENFLLVGILEELEDVLLLLERLLPHYF----SDVLTIYKSPAFWKMGNLTGTV 317
Query: 246 RKIDPSEETVQQIKKSKIWELENELYEYALEQFHFVKK 283
+K P+ E +Q + + +E + Y + +QFH KK
Sbjct: 318 KKHMPTIEALQVLYQRMKYEY--DFYNFIRDQFHLTKK 353
>gi|449277842|gb|EMC85864.1| Uronyl 2-sulfotransferase, partial [Columba livia]
Length = 327
Score = 120 bits (301), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 91/278 (32%), Positives = 146/278 (52%), Gaps = 22/278 (7%)
Query: 19 PETDSLSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQ--Y 76
P L + + ++YNRV K GS + V + + K FN++ ++ N L+ +Q
Sbjct: 14 PPQKVLPFPSQVVYNRVGKCGSRTVVLLLRILSEKHGFNLVTSDIH-NKTRLTKNEQASM 72
Query: 77 RFVNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGD- 135
+ N++ +P L+ H F++F +FG +QP++INI+R P++R +S Y+F R+GD
Sbjct: 73 ELIKNIST--AEQPYLFTRHVHFLNFSRFGG-DQPVYINIIRDPVNRFLSNYFFRRFGDW 129
Query: 136 -NYRPHLVR----KKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWA 190
+ H++R ++ + CI N ECS ++ +P+ CG C PG WA
Sbjct: 130 RGEQNHMIRTPSMRQEERYLDINVCILENYPECSNPRLFYIIPYFCGQHPRCREPGE-WA 188
Query: 191 LEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSH-----LRRTN 245
LE+AK N+ +LLVG+ EEL D + LLE LP +F+ + N H L T
Sbjct: 189 LERAKLNVNENFLLVGILEELEDVLLLLERFLPHYFKDVLS--IYKNPEHRKLGNLTVTV 246
Query: 246 RKIDPSEETVQQIKKSKIWELENELYEYALEQFHFVKK 283
+K PS E +Q + + +E E Y Y EQFH +K+
Sbjct: 247 KKTVPSPEAIQVLYQRMRYEY--EFYYYVKEQFHLLKR 282
>gi|443719467|gb|ELU09621.1| hypothetical protein CAPTEDRAFT_189618 [Capitella teleta]
Length = 334
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 67/218 (30%), Positives = 118/218 (54%), Gaps = 24/218 (11%)
Query: 30 IIYNRVPKTGSTSFVNMAYDMCRKKRFN-------VLHVNVTGNNHVLSLADQYRFVNNV 82
I YNR+PK GS + + ++ +K + + + G+ + + + ++++
Sbjct: 72 IFYNRMPKCGSEMTMTLLRNIAKKNHWTYPPENKRIWTITKQGHQYWVDNEKVLKEISHL 131
Query: 83 TKWRDRRPALYHGHFGFIDFQQFGSKEQ-PLFINILRKPLDRLVSYYYFLRYGDNYRPHL 141
T ++P L+ H + DF + SK+ P +INI+R P DRL+S YY+ R+ H
Sbjct: 132 TM-DHKKPMLFSCHLYYTDFSRLWSKKSLPTYINIIRNPQDRLISLYYYFRF------HP 184
Query: 142 VRKKH----GDKT-TFDECIRLNRTECSL---ENMWLQVPFLCGHAAACWVPGNPWALEK 193
+++++ D+ T+D+C+ + EC+ E W VP+ CGH C P ALE+
Sbjct: 185 IQRRNDMSNNDRAMTYDDCVEQQKEECTAPNPEGFWTMVPYFCGHDPICRTPSRD-ALER 243
Query: 194 AKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTD 231
AK N+V+ Y ++G+TE++ F +LE +P +FRG TD
Sbjct: 244 AKANVVSNYAVIGLTEDMETFTRVLENVIPKYFRGMTD 281
>gi|313239281|emb|CBY14231.1| unnamed protein product [Oikopleura dioica]
Length = 338
Score = 118 bits (295), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 84/282 (29%), Positives = 138/282 (48%), Gaps = 19/282 (6%)
Query: 8 QIHISSAKSPSPETDSLSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNN 67
++H SS + +L +++NRVPK GS S +AYD+ K +F V G
Sbjct: 54 RVHRSSFPENNVNKLALKNKEYVVFNRVPKCGSMSMTQLAYDLGGKNQFKVESPYEPGEK 113
Query: 68 HVLSLADQYRFVNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSY 127
S +Q F V + + P +Y H ++DF KE+ +IN++R P+ R S+
Sbjct: 114 QTKSQEEQDAFRKYV--FDQKPPYMYIRHQNYVDFWDPVEKEKVAYINMIRDPIARFESF 171
Query: 128 YYFLRYGDNY----RPHLVRKKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAA--A 181
YYF R+G+N R L ++ K T D+C+ R EC ++ W VP+LCG
Sbjct: 172 YYFSRFGNNLGGGGRAKLNEERK--KETVDDCVAKKRQEC-VKPWWQIVPYLCGQVTDPR 228
Query: 182 CWVPGNPWALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGG-----TDHFLTS 236
C + WA+++AK N+ Y VG+ +EL +++LE LP F++ D F+
Sbjct: 229 CQ-ERDQWAVDRAKYNIDQNYAFVGLLDELEMSLAVLEQLLPEFYKDARSLVKQDSFVKM 287
Query: 237 NKSHLRRTNRKIDPSEETVQQIKKSKIWELENELYEYALEQF 278
L T K SE++ + + + E ++Y++ LE+
Sbjct: 288 KNGTL--TTFKKPASEKSREYLMTQTSLKYEYQIYDHVLEKL 327
>gi|313245447|emb|CBY40178.1| unnamed protein product [Oikopleura dioica]
Length = 338
Score = 118 bits (295), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 84/282 (29%), Positives = 138/282 (48%), Gaps = 19/282 (6%)
Query: 8 QIHISSAKSPSPETDSLSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNN 67
++H SS + +L +++NRVPK GS S +AYD+ K +F V G
Sbjct: 54 RVHRSSLPENNINRLALKNKEYVVFNRVPKCGSMSMTQLAYDLGGKNQFKVESPYEPGEK 113
Query: 68 HVLSLADQYRFVNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSY 127
S +Q F V + + P +Y H ++DF KE+ +IN++R P+ R S+
Sbjct: 114 QTKSQEEQDAFRKYV--FDQKPPYMYIRHQNYVDFWDPVEKEKVAYINMIRDPIARFESF 171
Query: 128 YYFLRYGDNY----RPHLVRKKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAA--A 181
YYF R+G+N R L ++ K T D+C+ R EC ++ W VP+LCG
Sbjct: 172 YYFSRFGNNLGGGGRAKLNEERK--KETVDDCVAKKRQEC-VKPWWQIVPYLCGQVTDPR 228
Query: 182 CWVPGNPWALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGG-----TDHFLTS 236
C + WA+++AK N+ Y VG+ +EL +++LE LP F++ D F+
Sbjct: 229 CQ-ERDQWAVDRAKYNIDQNYAFVGLLDELEMSLAVLEQLLPEFYKDARSLVKQDSFVKM 287
Query: 237 NKSHLRRTNRKIDPSEETVQQIKKSKIWELENELYEYALEQF 278
L T K SE++ + + + E ++Y++ LE+
Sbjct: 288 KNGTL--TTFKKPASEKSREYLMTQTSLKYEYQIYDHVLEKL 327
>gi|443685601|gb|ELT89155.1| hypothetical protein CAPTEDRAFT_171544 [Capitella teleta]
Length = 274
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 88/270 (32%), Positives = 139/270 (51%), Gaps = 23/270 (8%)
Query: 24 LSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVT 83
LS D V YNRV K GS + + + + + F + + + L L +Q FV+++
Sbjct: 15 LSVDRVF-YNRVAKCGSRTTMRILEKLEKLNNFTIYKSKIY-DKMKLQLNEQSEFVDDIM 72
Query: 84 KWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVR 143
R P L H F+DF+ F + QP++ N++R P+DR VS +YF++ +N
Sbjct: 73 --RVPAPLLIDRHIHFLDFEAFDA-PQPVYFNVIRDPVDRAVSTFYFVQ--NNLTSPEEA 127
Query: 144 KKHGDKTTFDECIRLNRTECSLENMWLQVPF-LCGHAAACWV-PGNPWALEKAKENLVTK 201
KK K+ F+EC+ + C+ ++++++ + CG CW W L KAK+NLV
Sbjct: 128 KKIKIKS-FEECVYKKKEGCTGRHVFMKMLYYFCGQDPRCWFDKSRSWTLAKAKQNLVKY 186
Query: 202 YLLVGVTEELTDFVSLLEAALPSFFRGGTDHF--LTSNKSHLRRTNRKIDPSEETVQQIK 259
Y +VG+ E++ F LE +P FF+G F S+ +T KI PSEE V+ I
Sbjct: 187 YSVVGIVEDMDSFFYALEKRMPRFFKGAFGLFGRYGSSLKEAYKTKGKIYPSEE-VRTIM 245
Query: 260 KSKIWELENELYEYALEQFHFVKK--HNLV 287
K + E A E ++FVK+ HNL+
Sbjct: 246 KKNMPE--------AFELYYFVKQRFHNLL 267
>gi|167518590|ref|XP_001743635.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163777597|gb|EDQ91213.1| predicted protein [Monosiga brevicollis MX1]
Length = 357
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 83/256 (32%), Positives = 126/256 (49%), Gaps = 16/256 (6%)
Query: 30 IIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWRDR- 88
+ YNRVPK STS +A K + + V + + V + D+
Sbjct: 61 VFYNRVPKAASTSLKTLASQRAHKNGYIHISSTVYNDRGFFETDQEEANVQRIQAIFDQY 120
Query: 89 RPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKHGD 148
LY H F++F +FG K QP +IN++R+P+ R VS YYF R G + R +R G+
Sbjct: 121 DKVLYDQHIRFLNFSKFG-KHQPAYINMVREPISRAVSTYYFARVGQHSRRDEIRALLGE 179
Query: 149 KTTFD--ECIRLNRTEC-----SLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTK 201
+ +D CI R EC +E+ L F CGH+ AC V + A E AK NL
Sbjct: 180 QADWDINTCID-RREECRWFTSKIEHFNLMTRFFCGHSEACRVVDDA-AFEVAKYNLEHN 237
Query: 202 YLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRR-TNRKID--PSEETVQQI 258
++ VGVTE + V +LE LP FFRG ++ R+ N+K P++E ++ +
Sbjct: 238 FVFVGVTERFAESVRVLEKILPKFFRGAYRTISSAAPDASRQNVNKKAGAKPNQENLEIL 297
Query: 259 KKSKIWELENELYEYA 274
+ ++L LY++A
Sbjct: 298 QHLARYDL--ALYQFA 311
>gi|443719427|gb|ELU09608.1| hypothetical protein CAPTEDRAFT_229374 [Capitella teleta]
Length = 346
Score = 115 bits (287), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 88/307 (28%), Positives = 148/307 (48%), Gaps = 33/307 (10%)
Query: 4 QKSHQIHISSAKSPSPETDSLSWDTV--------IIYNRVPKTGSTSFVNMAYDMCRKKR 55
++S +H + + +P+ D +++ I YNRVPK GS + + + + K
Sbjct: 54 EESEFVHFTPPPTLTPDLDDTGVNSIEVLTNVSRIFYNRVPKCGSRTILRILEKLSEKNE 113
Query: 56 FNVLHVNVTGNNHVLSLADQYRFVNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFIN 115
+ H + N L+ + RFV + P LY H F+DF++ G E PL+IN
Sbjct: 114 IHNYHSEIY-NVKQLTEEGEVRFVQEFMEHDP--PLLYDRHLYFVDFKKHG-HEPPLYIN 169
Query: 116 ILRKPLDRLVSYYYFLRYGDNYRPHLVRKKHGDKTTFDECIRLNRTEC-SLENMWLQVPF 174
++R P +R++S +Y++ Y P + + TF EC++ N+T C S N+ V +
Sbjct: 170 LIRDPFERILSRHYYILYESRNTPKETKDEFN--MTFTECVKQNKTICMSPSNLIELVRY 227
Query: 175 LCGHAAACWVPG---NPWALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTD 231
CG C G N L KAK +++ + +VGV E+ F LLE LP+FF G +
Sbjct: 228 FCGQDPVCESKGGANNAAILRKAKTHVIKHFPVVGVIEDAESFFFLLEKRLPTFFGGALE 287
Query: 232 ---HFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIWELENELYEYALEQFHFV--KKHNL 286
H ++ K R + +KI P T K + +E Y++ ++FV ++HN+
Sbjct: 288 IYQHIISKMKVRYRNS-QKIMPDNAT-----KDYVISKMSEAYDF----YNFVRQRQHNM 337
Query: 287 VYNKVLG 293
+ LG
Sbjct: 338 MTAFQLG 344
>gi|443726858|gb|ELU13870.1| hypothetical protein CAPTEDRAFT_139975, partial [Capitella teleta]
Length = 79
Score = 114 bits (286), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 49/78 (62%), Positives = 63/78 (80%)
Query: 27 DTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWR 86
DTV+IYNRVPKTGSTSF +AYD+C + +FNVLH+NV+ NNHVL L+DQ RFV N+T W
Sbjct: 2 DTVLIYNRVPKTGSTSFAGVAYDLCVQNKFNVLHLNVSKNNHVLGLSDQRRFVLNITHWE 61
Query: 87 DRRPALYHGHFGFIDFQQ 104
++PALYHGH ++ F +
Sbjct: 62 SKKPALYHGHLAYLPFSR 79
>gi|443718404|gb|ELU09056.1| hypothetical protein CAPTEDRAFT_203735 [Capitella teleta]
Length = 328
Score = 114 bits (286), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 75/253 (29%), Positives = 137/253 (54%), Gaps = 13/253 (5%)
Query: 29 VIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWRDR 88
+I YNRVPK GS S +++ +M RK + + ++ H L+ + +FV++ + +
Sbjct: 52 LIFYNRVPKCGSRSVMSVIDEMGRKNGYKWISSDIFNQAH-LNTVQRAQFVSDFIELKP- 109
Query: 89 RPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKHGD 148
P LY H FI+F ++G P+++N++R P +RL+S++ ++R + P +++H
Sbjct: 110 -PYLYDRHIFFINFTEYGFP-LPIYMNLIRDPFERLLSFHCYVR-DEKPLPMADKRRHQF 166
Query: 149 KTTFDECIRLNRTECSLENMWLQV-PFLCGHAAACWVPGNPW-ALEKAKENLVTKYLLVG 206
TT++EC+ + C + L + + CG + C NP +L +AK N++ Y +VG
Sbjct: 167 NTTYEECVNQGYSICVANKVLLNLLAYFCGQDSVCT--ENPAVSLARAKRNIIKHYSIVG 224
Query: 207 VTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLR--RTNRKIDPSEETVQQIKKSKIW 264
V E+L F LE P FF+G D FL + L + + K P + TV ++K K+
Sbjct: 225 VMEDLEGFFYTLEKKFPGFFKGAQDVFLEHERGLLSKFKNSGKEYPPQYTVDIMRK-KLA 283
Query: 265 ELENELYEYALEQ 277
E + Y++ +++
Sbjct: 284 E-SYDFYQFVMQR 295
>gi|26351977|dbj|BAC39625.1| unnamed protein product [Mus musculus]
Length = 396
Score = 114 bits (285), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 75/216 (34%), Positives = 122/216 (56%), Gaps = 11/216 (5%)
Query: 19 PETDSLSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRF 78
P + L + + ++YNRV K GS + V + + K FN++ ++ N L+ +Q
Sbjct: 95 PPSKVLPFPSQVVYNRVGKCGSRTVVLLLRILSEKHGFNLVTSDIH-NKTRLTKNEQMEL 153
Query: 79 VNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGD--N 136
+ N++ +P L+ H F++F +FG +QP++INI+R P+ R +S Y+F R+GD
Sbjct: 154 IKNIST--AEQPYLFTRHVHFLNFSRFGG-DQPVYINIIRDPVSRFLSNYFFRRFGDWRG 210
Query: 137 YRPHLVR----KKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALE 192
+ H++R ++ +ECI N ECS ++ +P+ CG C PG WALE
Sbjct: 211 EQNHMIRTPSMRQEERYLDINECILENYPECSNPRLFYIIPYFCGQHPRCREPGE-WALE 269
Query: 193 KAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRG 228
+AK N+ +LLVG+ EEL D + LLE LP +F+G
Sbjct: 270 RAKLNVNENFLLVGILEELEDVLLLLERFLPHYFKG 305
>gi|260833386|ref|XP_002611638.1| hypothetical protein BRAFLDRAFT_63697 [Branchiostoma floridae]
gi|229297009|gb|EEN67648.1| hypothetical protein BRAFLDRAFT_63697 [Branchiostoma floridae]
Length = 632
Score = 113 bits (283), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 82/249 (32%), Positives = 115/249 (46%), Gaps = 17/249 (6%)
Query: 32 YNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSL--ADQYRFVNNVTKWRDRR 89
Y+R+PK GS + + + K F N H +L D RF + K
Sbjct: 34 YHRLPKCGSRAMKVLIAQLQEKNHFKFFS---AANLHAKALHGEDLLRFAQMMDKLPT-- 88
Query: 90 PALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRP-----HLVRK 144
P++Y H FIDF +K QPL+IN++R P++R VS YY+LRYG + L R
Sbjct: 89 PSIYEKHTYFIDFPLMAAK-QPLYINLVRDPIERRVSAYYYLRYGRHGNEFADILKLRRT 147
Query: 145 KHGDKTTFDECIRLNRTECSLE--NMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKY 202
+ T D C+ N ECS N ++ + CG + C P A+E AKEN+ Y
Sbjct: 148 EEQRNQTLDYCVANNLRECSASEPNSFILTQYFCGQSTICTKPSQV-AVEVAKENIRRHY 206
Query: 203 LLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSK 262
+VGV EE + F+ +LE +P FFRG + S+ I E VQ +
Sbjct: 207 AVVGVLEEFSSFLKVLEVVMPQFFRGAVTEW-EHIGSNYEAKPASIQARVEHVQDQQPLT 265
Query: 263 IWELENELY 271
IW N Y
Sbjct: 266 IWPGSNHSY 274
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 54/170 (31%), Positives = 94/170 (55%), Gaps = 12/170 (7%)
Query: 114 INILRKPLDRLVSYYYFLRYGDNYRPHLVRKKHGDKTTFDECIRLNRTECSLEN--MWLQ 171
+ I+RK DR + ++F Y + V + G TFD+C+ N EC+ + +++
Sbjct: 461 VTIIRKLKDR--NGFHFFE-DKTYTAYTVLR--GRNLTFDDCVFYNTWECNAKGPLVFMM 515
Query: 172 VPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTD 231
F CGH C P A+EKAKEN+ Y +VGV EE + F+ +LE +P FFRG D
Sbjct: 516 TKFFCGHDDICMQPTQA-AVEKAKENIRRHYAVVGVLEEFSSFLKVLEVVMPQFFRGAHD 574
Query: 232 HF--LTSNKSHLRRTNRKIDPSEETVQQIKKSKIWELENELYEYALEQFH 279
+ + S +++T+ K P+ E +++ + ++ L+ ++Y++ E+FH
Sbjct: 575 TWREIGSKLMKMQKTSNK-RPASEKAREVMRERL-HLDYQVYDFIKERFH 622
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 51/147 (34%), Positives = 67/147 (45%), Gaps = 14/147 (9%)
Query: 147 GDKTTFDECIRLNRTECSLENM--WLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLL 204
G TFDEC+ N EC+ +L F CG + C P A+EKAKEN+ Y +
Sbjct: 318 GRNLTFDECVFNNLWECNASGAKRFLMTQFFCGQDSICMEPSQA-AVEKAKENIRRHYAV 376
Query: 205 VGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIW 264
VGV EE + F+ +LE +P FFRG D + +H ++ VQ K IW
Sbjct: 377 VGVLEEFSSFLKVLEMVMPQFFRGAHDTWRGIASAH-KKYAAATQNWTGHVQDQKTLNIW 435
Query: 265 ELENELYEYALEQFHFVKKHNLVYNKV 291
N Y A LVYN+V
Sbjct: 436 PCFNHNYSMA----------KLVYNRV 452
>gi|198437288|ref|XP_002125232.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
Length = 364
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 80/274 (29%), Positives = 132/274 (48%), Gaps = 20/274 (7%)
Query: 25 SWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTK 84
S V++YNRVPK ST+ +N + K +F +++VN + Q R N + +
Sbjct: 100 SSSAVVLYNRVPKCASTTMINTLNYLKTKLKFRLVNVNEPRIKMFMDAKAQARMANGILE 159
Query: 85 WRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRK 144
+ A++ H FI+F +FGSK P++IN++R P++R +SYYY++RYG V K
Sbjct: 160 LKS--DAIFVRHLHFINFSRFGSK-WPVYINVIRDPVERFISYYYYVRYGFQSNKGEVAK 216
Query: 145 K-------HGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKEN 197
K +EC+ + +C N + F CG C + +ALEKAK N
Sbjct: 217 KWKLDISPERQNMPLEECVMKDPDKCVRTNSVTMLAFFCGQENMCREYSD-YALEKAKIN 275
Query: 198 LVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQ 257
+ +G+ E+L + + A LP +F T L + RT K +PS
Sbjct: 276 AARYFTAIGLVEDLPNSFKIFHALLPRYF---TTTALIDTPAKDTRTFMKEEPSLLVKSL 332
Query: 258 IKKSKIWELENELYEYALEQFH----FVKKHNLV 287
++K+ ++ E Y + +F+ + K+NLV
Sbjct: 333 LRKA--LRVDVEFYNFIKRRFYKQLDALVKNNLV 364
>gi|260817950|ref|XP_002603848.1| hypothetical protein BRAFLDRAFT_240338 [Branchiostoma floridae]
gi|229289171|gb|EEN59859.1| hypothetical protein BRAFLDRAFT_240338 [Branchiostoma floridae]
Length = 265
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 84/265 (31%), Positives = 127/265 (47%), Gaps = 31/265 (11%)
Query: 30 IIYNRVPKTGSTSFVNMAYDMCRKKRF--------NVLHVNVTGNNHVLSLADQYRFVNN 81
I+YNR+PK GSTS + + +K +F + +N TG ++S D +
Sbjct: 6 ILYNRLPKCGSTSLKALTRRLAKKNQFYFKESKIWDQFQLNQTG---LVSKNDLSVVLKI 62
Query: 82 VTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRP-- 139
V H F+ + + P +IN++R PLDRLVS Y+F+RYG P
Sbjct: 63 VM-------CDIFEHATFVLYLSRLGLKSPKYINLVRDPLDRLVSSYHFMRYGRKGGPNH 115
Query: 140 ---HLVRKKHGD---KTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEK 193
HL K H + TFD C+ EC + + L F CG C P + ALEK
Sbjct: 116 IVTHLFNKYHNETDRNQTFDSCVLNKSKECWGQRVNLMTRFFCGQDPVCREP-SIQALEK 174
Query: 194 AKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEE 253
AKEN+ YL+VG+ E+ F+ +L LP F+RG TD + KS K P
Sbjct: 175 AKENIRRHYLVVGLLEDFNSFLKVLSRILPQFYRGVTDLWKDLGKSTW---TVKKQPPSP 231
Query: 254 TVQQIKKSKIWELENELYEYALEQF 278
Q++ + ++ +L+ +LY + ++F
Sbjct: 232 LAQKVMRQRM-DLDYQLYSFIEDRF 255
>gi|241999656|ref|XP_002434471.1| heparan sulfate 2-O-sulfotransferase, putative [Ixodes scapularis]
gi|215497801|gb|EEC07295.1| heparan sulfate 2-O-sulfotransferase, putative [Ixodes scapularis]
Length = 273
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 66/202 (32%), Positives = 104/202 (51%), Gaps = 40/202 (19%)
Query: 30 IIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWRDRR 89
++YNRVPK GST+ V++ + + F+ +H + T N +LS Q +FV +++
Sbjct: 61 LLYNRVPKCGSTTLVHLLRRLSKSNGFSHVH-SKTYNQRLLSPEQQAQFVRDMSAA---- 115
Query: 90 PALYH--GHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKHG 147
PA Y H F+DF QFG + P ++N++R P+DRLVS +Y+ R
Sbjct: 116 PAPYSHDRHIYFVDFGQFG-RPSPAYVNVIRDPVDRLVSSFYYRR--------------- 159
Query: 148 DKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVGV 207
+ L +P+ CGH C G+PWAL A E++ +++VGV
Sbjct: 160 -----------------ATHRSLMMPYFCGHDVRCTTVGDPWALRTAMEHVDRHFVVVGV 202
Query: 208 TEELTDFVSLLEAALPSFFRGG 229
E++ ++LLE LP+FFRG
Sbjct: 203 LEDMNATLALLERRLPAFFRGA 224
>gi|313214836|emb|CBY41079.1| unnamed protein product [Oikopleura dioica]
Length = 209
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 66/204 (32%), Positives = 111/204 (54%), Gaps = 12/204 (5%)
Query: 91 ALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKHGDKT 150
+Y H FIDF QF + P++IN++R P+DR S+YYF R+G+ R K +
Sbjct: 12 VMYTRHQYFIDFDQFEWAD-PVYINLIRDPVDRFASFYYFSRFGNKRAQDAGRTKQQVPS 70
Query: 151 TF-----DECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLV 205
D+CI RTEC+ E +W VP++CG+ +C + + A+++AK+N+ +KY++V
Sbjct: 71 NILNENIDDCITRRRTECT-EPIWHTVPYICGNDKSC-LQRHESAVQQAKQNIDSKYVVV 128
Query: 206 GVTEELTDFVSLLEAALPSFFRGGTDHFL-TSNKSHLRRTNRKIDPSEETVQQIKKSKIW 264
G+ EEL + +LE LP FF G +H N ++ + D + E +
Sbjct: 129 GILEELNGTLGVLEHVLPDFFEGAVNHLTEIRNDTYTVKKKALTDAAREY---LANETAL 185
Query: 265 ELENELYEYALEQFHFVKKHNLVY 288
+LE +LY +A + + K+N ++
Sbjct: 186 KLEYDLYNHAKGKHPVILKNNQLF 209
>gi|443693772|gb|ELT95052.1| hypothetical protein CAPTEDRAFT_225063 [Capitella teleta]
Length = 323
Score = 111 bits (277), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 79/257 (30%), Positives = 131/257 (50%), Gaps = 21/257 (8%)
Query: 30 IIYNRVPKTGSTSFVNMAYDMCRKKRFNVL----HVNVTGNNHVLSLADQYRFVNNVTKW 85
+ YNRVPK GS + A + + F + + + N+ +SL + N
Sbjct: 70 VFYNRVPKCGSRGLLYNARILSHRNHFTWISSKQYRDERLNHSKISLPRKLSRYNT---- 125
Query: 86 RDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKK 145
+ Y H +I+F + QP +IN++R P+DR++S+YYF+R+ + + +
Sbjct: 126 ---KAVFYDKHVHYINFTRL-HLPQPAYINLIRDPVDRMISWYYFIRFEKGHIRSMTDSE 181
Query: 146 HGDKTTFDECIRLNRTECSLE-NMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLL 204
+FD+C+R + +C N + VP+ CG C PG P +L +AK NL Y +
Sbjct: 182 RN--MSFDDCVRSSHPDCVHPFNYSVLVPYFCGLDEFCRYPG-PKSLAQAKANLKQHYTI 238
Query: 205 VGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLR---RTNRKIDPSEETVQQIKKS 261
VG+ +++ DF LE LP +FRG D + N+S +R +TN K S ET +K++
Sbjct: 239 VGLADQMEDFYWALERLLPDYFRGILDLY-RRNESKMRAEYKTNAKGTCSNETRSLLKRN 297
Query: 262 KIWELENELYEYALEQF 278
+ E E +LY +A + F
Sbjct: 298 ILIE-EYDLYHFARKMF 313
>gi|125841195|ref|XP_001336097.1| PREDICTED: uronyl 2-sulfotransferase-like [Danio rerio]
Length = 370
Score = 110 bits (275), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 86/269 (31%), Positives = 141/269 (52%), Gaps = 16/269 (5%)
Query: 24 LSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVT 83
L + + ++YNRV K GS + V + + + +F ++ ++ N L+ +Q + N++
Sbjct: 87 LPFPSQVVYNRVGKCGSRTIVLLLRMLADRHQFTLVSSDIH-NKTRLTKREQENLIQNIS 145
Query: 84 KWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGD--NYRPHL 141
+P LY H F++F +F + E+P+ INI+R P+ R +S Y+F R+GD H+
Sbjct: 146 T--TPQPFLYTRHVHFLNFSRFKT-EEPVHINIIRDPISRFLSNYFFRRFGDWRGEENHV 202
Query: 142 VR----KKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKEN 197
VR K+ + CI + EC+ ++ +P+ CG C PG WALE AK+N
Sbjct: 203 VRTPGMKEDERYLDINTCILESYPECTNPRLFYIIPYFCGQHPQCREPGL-WALETAKQN 261
Query: 198 LVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNK---SHLRRTNRKIDPSEET 254
++ YLLVGV EEL D + LLE LP FF + + + +L T RK P+ E
Sbjct: 262 VLDHYLLVGVLEELEDVLLLLERLLPHFFSDVLNIYRSPEYRKLGNLTGTVRKQSPNPEA 321
Query: 255 VQQIKKSKIWELENELYEYALEQFHFVKK 283
++ + + E E Y++ QFH K+
Sbjct: 322 LRVLHQR--MEYEYRFYQFIRAQFHQTKR 348
>gi|313213100|emb|CBY36963.1| unnamed protein product [Oikopleura dioica]
Length = 343
Score = 110 bits (275), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 82/267 (30%), Positives = 125/267 (46%), Gaps = 31/267 (11%)
Query: 29 VIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVT---GNNHVLSLADQYRFVNNVTKW 85
+++YNRVPK GS + Y + R N HV G L+ ++ + ++++ +
Sbjct: 83 LLVYNRVPKCGSIWMTRLLYILGAGDR-NEYHVESPYEPGEKPFLTGDEEKKVIDHLKEV 141
Query: 86 RDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKK 145
+P +Y H FIDF + K +PL+IN++R P+++ S+YYF+R G+ +
Sbjct: 142 P--KPGVYIRHQYFIDFAEHKQK-RPLYINVIRDPVEKFRSFYYFIRNGN------LEGD 192
Query: 146 HGDK--------TTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKEN 197
GD ++C+ EC+ E W VP+ CG C N WA+ KAKEN
Sbjct: 193 GGDVPMSESKRLMNINDCVSRREKECT-EPKWQMVPYFCGQDPRCR-QRNSWAVTKAKEN 250
Query: 198 LVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTN-----RKIDPSE 252
+ Y VG+TEEL +SL E +P FF G D + R N K +
Sbjct: 251 IEKYYAAVGLTEELPASLSLFETLMPRFFHGAID---VKKEGEERIKNDTYTLNKAALTP 307
Query: 253 ETVQQIKKSKIWELENELYEYALEQFH 279
ETV K LE +LY + +F
Sbjct: 308 ETVDFFKTKTSIALEYDLYNFVKARFE 334
>gi|291238909|ref|XP_002739368.1| PREDICTED: uronyl-2-sulfotransferase-like [Saccoglossus
kowalevskii]
Length = 359
Score = 109 bits (273), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 81/269 (30%), Positives = 135/269 (50%), Gaps = 27/269 (10%)
Query: 30 IIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQ--YRFVNNVTKWRD 87
+IYNRV K GS + MA + N HV + H ++L +Q F++ V
Sbjct: 96 LIYNRVGKCGSRTL--MAVTERAAEWNNFKHVK-SQEYHGMTLTNQEEMEFISEVMSLNP 152
Query: 88 RRPALYHGHFGFIDFQQFGS--KEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKK 145
P +Y+ H +++F G P +IN++R P+DR +S +Y+ R+GD K+
Sbjct: 153 --PFIYNRHVRYVNFSSHGIDISSSPKYINLIRDPVDRKISTFYYTRFGDVKHEREDTKR 210
Query: 146 HGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLV 205
+ + TF+EC+ N EC+ + + +P+ CG AC + WALE AK N+V ++ V
Sbjct: 211 NAN-LTFEECVLDNHPECAAGKIGI-IPYFCGQVDACH-EDHEWALETAKRNVVENFVFV 267
Query: 206 GVTEELTDFVSLLEAALPSFFRGGTDHF--------LTSNKSHLRRTNRKIDPSEETVQQ 257
G+ E+ + + + LP FFR + LT KS R ++PSE+ V+
Sbjct: 268 GILEQFEISLKIWQYLLPQFFRSAPVAYQKIVRREALTKFKSKSR-----VEPSEK-VKA 321
Query: 258 IKKSKIWELENELYEYALEQFHFVKKHNL 286
I + ++ EL+ E Y + E+ ++K L
Sbjct: 322 IMRERM-ELDYEFYNFITERMRLIEKQIL 349
>gi|431904243|gb|ELK09640.1| Uronyl 2-sulfotransferase [Pteropus alecto]
Length = 273
Score = 109 bits (273), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 75/204 (36%), Positives = 113/204 (55%), Gaps = 13/204 (6%)
Query: 89 RPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGD--NYRPHLVR--- 143
+P L+ H F++F +FG +QP++INI+R P++R +S Y+F R+GD + H++R
Sbjct: 28 QPYLFTRHVHFLNFSRFGG-DQPVYINIIRDPVNRFLSNYFFRRFGDWRGEQNHMIRTPS 86
Query: 144 -KKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKY 202
++ +ECI N ECS ++ +P+ CG C PG WALE+AK N+ +
Sbjct: 87 MRQEERYLDINECILENYPECSNPRLFYIIPYFCGQHPRCREPGE-WALERAKLNVNENF 145
Query: 203 LLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRR---TNRKIDPSEETVQQIK 259
LLVG+ EEL D + LLE LP +F+G + L T RK PS E VQ +
Sbjct: 146 LLVGILEELEDVLLLLERFLPHYFKGVLSIYKDPEHRKLGNMTVTVRKTVPSPEAVQILY 205
Query: 260 KSKIWELENELYEYALEQFHFVKK 283
+ +E + Y Y EQFH +K+
Sbjct: 206 QRMRYEY--DFYHYVKEQFHLLKR 227
>gi|313229616|emb|CBY18431.1| unnamed protein product [Oikopleura dioica]
Length = 343
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 81/267 (30%), Positives = 125/267 (46%), Gaps = 31/267 (11%)
Query: 29 VIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVT---GNNHVLSLADQYRFVNNVTKW 85
+++YNRVPK GS + Y + R N HV G L+ ++ + ++++ +
Sbjct: 83 LLVYNRVPKCGSIWMTRLLYILGAGDR-NEYHVESPYEPGEKPFLTGDEEKKVIDHLKEV 141
Query: 86 RDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKK 145
+P +Y H FIDF + K +PL+IN++R P+++ S+YYF+R G+ +
Sbjct: 142 P--KPGVYIRHQYFIDFAEHKQK-RPLYINVIRDPVEKFRSFYYFIRNGN------LEGD 192
Query: 146 HGDK--------TTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKEN 197
GD ++C+ EC+ E W VP+ CG C N WA+ KAKEN
Sbjct: 193 GGDVPMSESKRLMNINDCVSRREKECT-EPKWQMVPYFCGQDPRCR-QRNSWAVTKAKEN 250
Query: 198 LVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTN-----RKIDPSE 252
+ Y VG+TEEL ++L E +P FF G D + R N K +
Sbjct: 251 IEKYYAAVGLTEELPASLALFETLMPRFFHGAID---VKKEGEERIKNDTYTLNKAALTP 307
Query: 253 ETVQQIKKSKIWELENELYEYALEQFH 279
ETV K LE +LY + +F
Sbjct: 308 ETVDFFKTKTSIALEYDLYNFVKARFE 334
>gi|326675440|ref|XP_002665179.2| PREDICTED: uronyl 2-sulfotransferase-like [Danio rerio]
Length = 288
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 86/269 (31%), Positives = 139/269 (51%), Gaps = 16/269 (5%)
Query: 24 LSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVT 83
L + + ++YNRV K GS + V + + + +F ++ ++ N L+ +Q + N++
Sbjct: 5 LPFPSQVVYNRVGKCGSRTIVLLLRMLADRHQFTLVSSDI-HNKTRLTKLEQENLIQNIS 63
Query: 84 KWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGD--NYRPHL 141
+P LY H F++F +F + E P+ INI+R P+ R +S Y+F R+GD H+
Sbjct: 64 T--TPQPFLYTRHVHFLNFSRFKTDE-PVHINIIRDPISRFLSNYFFRRFGDWRGEENHV 120
Query: 142 VR----KKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKEN 197
VR K+ + CI + EC+ ++ +P+ CG C PG WALE AK+N
Sbjct: 121 VRTPGMKEDERYLDINTCILESYPECTNPRLFYIIPYFCGQHPQCREPGL-WALETAKQN 179
Query: 198 LVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNK---SHLRRTNRKIDPSEET 254
++ YLLVGV EEL D + LLE LP FF + + + +L T RK P+ E
Sbjct: 180 VLDHYLLVGVLEELEDVLLLLERLLPHFFSDVLNIYRSPEYRKLGNLTGTVRKQSPNLEA 239
Query: 255 VQQIKKSKIWELENELYEYALEQFHFVKK 283
++ + + E E Y + QFH K+
Sbjct: 240 LRVLHQR--MEYEYRFYHFIRAQFHQTKR 266
>gi|327261977|ref|XP_003215803.1| PREDICTED: uronyl 2-sulfotransferase-like [Anolis carolinensis]
Length = 405
Score = 108 bits (269), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 72/215 (33%), Positives = 120/215 (55%), Gaps = 11/215 (5%)
Query: 19 PETDSLSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRF 78
P L + + ++YNRV K GS + V + + + FN++ ++ N L+ +Q
Sbjct: 154 PPRKVLPFPSQVVYNRVGKCGSRTVVLLLRILSERHGFNLVTSDIH-NKTRLTKNEQMEL 212
Query: 79 VNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGD--N 136
+ N++ +P L+ H F++F +FG +QP++INI+R P++R +S Y+F R+GD
Sbjct: 213 IKNIST--AEQPYLFTRHVHFLNFSRFGG-DQPVYINIIRDPINRFLSNYFFRRFGDWRG 269
Query: 137 YRPHLVR----KKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALE 192
+ H++R ++ + CI N ECS ++ +P+ CG C PG WALE
Sbjct: 270 EQNHMIRTPSMRQEERYLDINVCILENYPECSNPRLFYIIPYFCGQHPRCREPGE-WALE 328
Query: 193 KAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFR 227
+AK N+ +LLVG+ EEL D + LLE LP +F+
Sbjct: 329 RAKANVNENFLLVGILEELEDVLLLLERFLPHYFK 363
>gi|291233801|ref|XP_002736843.1| PREDICTED: uronyl-2-sulfotransferase-like [Saccoglossus
kowalevskii]
Length = 495
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 74/262 (28%), Positives = 127/262 (48%), Gaps = 12/262 (4%)
Query: 30 IIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWRDRR 89
++YN V + GS S + + + + F + V N L+ Q V N+
Sbjct: 237 VVYNCVDQCGSQSVLAVIGILSFEHNFRSIWNRVRRYN--LTTKRQEAAVTNINT--RLP 292
Query: 90 PALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKHGDK 149
P +++ H ++DF FG P +IN++R+PL R +S+YY + + + +
Sbjct: 293 PYVFNRHIYYLDFDLFGI-HNPTYINLIREPLPRFLSWYYTHKINEPKVSGTRMYEPLNT 351
Query: 150 TTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVGVTE 209
+FDEC+ N CS +N + +P+ CGH+ C +P WALE AK+++V KY VGV E
Sbjct: 352 LSFDECVLTNNKVCSEKNAFQVIPYFCGHSPDCQLPSR-WALEMAKKHVVEKYAFVGVLE 410
Query: 210 ELTDFVSLLEAALPSFFRGGTDHFLTS----NKSHLRRTNRKIDPSEETVQQIKKSKIWE 265
+L + + E +P FF + + T K +T PS+ ++ + S+ E
Sbjct: 411 DLDSSLRIFEILMPQFFESASKVYKTMVMGLKKYEFYKTVPTPTPSDAAIRLM--SERLE 468
Query: 266 LENELYEYALEQFHFVKKHNLV 287
LE + Y + + +KK L+
Sbjct: 469 LEYQFYSFVRTRMELLKKQLLL 490
>gi|242018806|ref|XP_002429862.1| Uronyl 2-sulfotransferase, putative [Pediculus humanus corporis]
gi|212514891|gb|EEB17124.1| Uronyl 2-sulfotransferase, putative [Pediculus humanus corporis]
Length = 377
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 68/220 (30%), Positives = 114/220 (51%), Gaps = 21/220 (9%)
Query: 27 DTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNN-HVLSLADQYRFVNNVTKW 85
D V+ +NRVPK+GS V + + F HV + G+ LS+ +Q V +T
Sbjct: 76 DHVLFFNRVPKSGSEMLVLLLQWLQGPNGFR--HVRLPGSEKRSLSVFEQEELVEQITNT 133
Query: 86 R--DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVR 143
+ P + H FI+F +F K+ P+++N++R P+++ +S +Y+ R + + VR
Sbjct: 134 EKAEAVPISFDRHVYFINFTKF-DKQWPIYVNLIRDPVEKAISRFYYARVTPDIKNPDVR 192
Query: 144 ----------KKHGDKT--TFDECIRLNRTECSL---ENMWLQVPFLCGHAAACWVPGNP 188
K DK F++C+ + EC+ N L +P+ CG C + +
Sbjct: 193 YALSKGLIPSSKSPDKKFENFEDCVMASDPECNFITGNNYDLAIPYFCGQEPKCRILNDE 252
Query: 189 WALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRG 228
WAL KAKEN+ + +VG+ EEL +++LE+ LP FF+G
Sbjct: 253 WALRKAKENVENYFPVVGILEELNMTLAVLESKLPMFFKG 292
>gi|242018996|ref|XP_002429954.1| Heparan sulfate 2-O-sulfotransferase pipe, putative [Pediculus
humanus corporis]
gi|212515002|gb|EEB17216.1| Heparan sulfate 2-O-sulfotransferase pipe, putative [Pediculus
humanus corporis]
Length = 296
Score = 105 bits (261), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 78/274 (28%), Positives = 129/274 (47%), Gaps = 32/274 (11%)
Query: 29 VIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLA--DQYRFVNNVTKWR 86
V+ +NRVPK GS +F+ + + K F H + + LA DQ + + +
Sbjct: 19 VVFFNRVPKVGSQTFMELLRRLSMKNNF-AFHRDHIQRVETIRLAPGDQMDLASMIAAYE 77
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKK- 145
P++Y H FI+F QF E P+++N++R P++R++S+YY++R Y + RK+
Sbjct: 78 P--PSVYVKHVCFINFTQFRLPE-PIYVNLVRDPVERVISWYYYVRAPWYY---VERKQA 131
Query: 146 ---------HGDKTTFDECIRLNRTEC---------SLENMWLQVPFLCGHAAACWVPGN 187
H K F+ C+ EC + + Q F CGH+ AC
Sbjct: 132 FPDIPLPDPHWLKKDFETCVLRGDRECKYVEGETHEGIGDHRRQSLFFCGHSDACTPFNT 191
Query: 188 PWALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRK 247
AL++AK + Y +VGV E+L ++LE +P FF+G F R NR
Sbjct: 192 QGALQRAKRAVEQHYAVVGVLEDLNSTFTVLENYIPRFFKGAAQVF-KDEVDRFARINRN 250
Query: 248 I--DPSEETVQQIKKSKIWELENELYEYALEQFH 279
+ P E V++I + + E E Y++ ++ H
Sbjct: 251 LFKPPVSEEVKEIVRRN-FTREIEFYQFCRQRLH 283
>gi|326915694|ref|XP_003204148.1| PREDICTED: uronyl 2-sulfotransferase-like [Meleagris gallopavo]
Length = 278
Score = 104 bits (260), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 74/206 (35%), Positives = 113/206 (54%), Gaps = 17/206 (8%)
Query: 89 RPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGD--NYRPHLVR--- 143
+P L+ H F++F +FG +QP++INI+R P++R +S Y+F R+GD + H++R
Sbjct: 12 QPYLFTRHVHFLNFSRFGG-DQPVYINIIRDPVNRFLSNYFFRRFGDWRGEQNHMIRTPS 70
Query: 144 -KKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKY 202
++ + CI N ECS ++ +P+ CG C PG WALE+AK N+ +
Sbjct: 71 MRQEERYLDINVCILENYPECSNPRLFYIIPYFCGQHPRCREPGE-WALERAKLNVNENF 129
Query: 203 LLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSH-----LRRTNRKIDPSEETVQQ 257
LLVG+ EEL D + LLE LP +F+ + N H L T +K PS E +Q
Sbjct: 130 LLVGILEELEDVLLLLERFLPHYFKDVLS--IYKNPEHRKLGNLTVTVKKTVPSPEAIQI 187
Query: 258 IKKSKIWELENELYEYALEQFHFVKK 283
+ + +E E Y Y EQFH +K+
Sbjct: 188 LYQRMRYEY--EFYYYVKEQFHLLKR 211
>gi|76154151|gb|AAX25649.2| SJCHGC08078 protein [Schistosoma japonicum]
Length = 154
Score = 104 bits (260), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 52/117 (44%), Positives = 71/117 (60%), Gaps = 5/117 (4%)
Query: 168 MWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFR 227
+W+QVP+ CG A C +PGN A+E AK +++ YL+VG+TEE FV LLE LPSFF
Sbjct: 5 LWVQVPYFCGQAMYCRIPGNLAAVETAKRHVIENYLIVGITEEFDKFVDLLEILLPSFFT 64
Query: 228 GGTDHFLTSNKSH---LRRTNRKIDPSEETVQQIKKSKIWELENELYEYALEQFHFV 281
G H L S H LRRTN K S+ T++ + + IW+ E + Y + +FH V
Sbjct: 65 GA--HSLRSRSKHKWYLRRTNLKFPISQATIKIYQGNPIWQAEQDFYNFVRTEFHAV 119
>gi|313238894|emb|CBY13889.1| unnamed protein product [Oikopleura dioica]
Length = 410
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 68/223 (30%), Positives = 108/223 (48%), Gaps = 19/223 (8%)
Query: 30 IIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTG----NNHVLSLADQYRFV-NNVTK 84
I +N++PK GS++ N+ + RK RFN +N + + + FV NNVT
Sbjct: 191 IFHNKLPKAGSSTMNNILIMLGRKNRFNYRKLNPHNLEELGDGLTAEGPLVNFVKNNVTS 250
Query: 85 WRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYG----DNYRPH 140
W P L H +DF+++ + +QP +IN++R P D S+YYF R+G ++ R
Sbjct: 251 W----PFLLLKHHLPMDFEKY-NIQQPTYINVIRDPADWFQSHYYFERFGWTRKEDDRGS 305
Query: 141 LVRKKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNP-----WALEKAK 195
+ + T D+C+ N +C W + F CG+A C A+ KA
Sbjct: 306 FIGSDEDKQRTVDQCVEQNNAQCMEPITWKYIEFFCGNAFPCNSRSGKDEITKQAMMKAM 365
Query: 196 ENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNK 238
N+ + VGV E+ D + L E LP FF G T+ + + +K
Sbjct: 366 HNVEHNFFAVGVLEQFDDTLKLFEKMLPRFFTGATEVYHSDSK 408
>gi|195354144|ref|XP_002043560.1| GM19105 [Drosophila sechellia]
gi|194127728|gb|EDW49771.1| GM19105 [Drosophila sechellia]
Length = 697
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 78/276 (28%), Positives = 137/276 (49%), Gaps = 33/276 (11%)
Query: 30 IIYNRVPKTGSTSFVNMAYDMCRKKRFN-VLHVNVTGNNHVLSLADQYRFVNNVTKWRDR 88
+ +NRVPK GS S + + M R + N H G+ H + ++ R + + R
Sbjct: 114 VFFNRVPKVGSQSLMEL---MARLGKINGFTHARNKGSAHETIVMNKQRQNDLIADLLTR 170
Query: 89 -RPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKHG 147
+P +Y H +I+F +F +P++IN++R P+DR++S++Y++R YR ++ K G
Sbjct: 171 PKPHIYSQHIAYINFTRF-HLPKPIYINLIRDPIDRIISWHYYIRAPWYYRD--MQAKLG 227
Query: 148 DKTT-----------FDECIRLNRTECSLENMWLQVP---------FLCGHAAACWVPGN 187
DK D C+R + C+ M ++ P F CG +P N
Sbjct: 228 DKAIPMPSEEFMNLDLDTCVRNHDPHCTFTQMQIKNPVGDHRRQTLFFCGMNQKLCMPFN 287
Query: 188 P-WALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGG-TDHFLTSNK-SHLRRT 244
A++KAK + T+Y +VG E+ +S+LEA +P FFR ++L ++ S + R
Sbjct: 288 SEAAMQKAKRTVETEYAVVGTWEDTNITLSVLEAYIPRFFRNAKVAYYLGKDRLSRVNRN 347
Query: 245 NRKIDPSEETVQQIKKSKIWELENELYEYALEQFHF 280
N S+ET ++K+ E+ E YE+ ++ +
Sbjct: 348 NVTRIVSDETRLILRKNLTNEI--EFYEFCKQRLYL 381
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 53/219 (24%), Positives = 109/219 (49%), Gaps = 17/219 (7%)
Query: 29 VIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVL-SLADQYRFVNNVTKWRD 87
+I YNRVPKTGS + + + + +K F + + + Q + + ++
Sbjct: 418 IIFYNRVPKTGSETLIELMIQLGKKNDFQNERSPFSKPTGIYWGVERQKEEATRILELQE 477
Query: 88 RRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYG-DNYRPHLVRKKH 146
+Y H +++ + F QP++IN++R P++R++S++Y+ R ++ + + V K
Sbjct: 478 EPAFVYVEHMNYMNIRPFNLP-QPIYINMIRDPVERVISWFYYKRTPWNSVKMYKVTGKF 536
Query: 147 GDKT----TFDECIRLNRTECSLENMWL----------QVPFLCGHAAACWVPGNPWALE 192
G++T F++C+ + EC + + Q F CGH+ C P A+
Sbjct: 537 GNRTHYTKNFEDCVLTHDPECRYDYGLMFKDDSADHKRQSLFFCGHSPICEPFNTPAAIA 596
Query: 193 KAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTD 231
+AK+N+ + ++G E+ +++LE +P FF+G +
Sbjct: 597 RAKQNIERDFSVIGSWEDTNVTLTVLEHYIPRFFKGSME 635
>gi|195496280|ref|XP_002095626.1| GE22506 [Drosophila yakuba]
gi|194181727|gb|EDW95338.1| GE22506 [Drosophila yakuba]
Length = 968
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 78/276 (28%), Positives = 137/276 (49%), Gaps = 33/276 (11%)
Query: 30 IIYNRVPKTGSTSFVNMAYDMCRKKRFN-VLHVNVTGNNHVLSLADQYRFVNNVTKWRDR 88
+ +NRVPK GS S + + M R + N H G+ H L ++ R + + R
Sbjct: 114 VFFNRVPKVGSQSLMEL---MARLGKINGFTHARNKGSAHETVLMNKQRQNDLIADLLTR 170
Query: 89 -RPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKHG 147
+P +Y H +I+F +F +P++IN++R P+DR++S++Y++R YR ++ K G
Sbjct: 171 PKPHIYSQHIAYINFTRFHLP-KPIYINLIRDPIDRIISWHYYVRAPWYYRD--MQAKLG 227
Query: 148 DKTT-----------FDECIRLNRTECSLENMWLQVP---------FLCGHAAACWVPGN 187
+K D C+R + C+ M ++ P F CG +P N
Sbjct: 228 EKAIPTPSDEFMNLDLDTCVRNHDPHCTFTQMQVKNPVGDHRRQTLFFCGMNQKLCMPFN 287
Query: 188 PW-ALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGG-TDHFLTSNK-SHLRRT 244
A++KAK + T+Y +VG E+ +S+LEA +P FFR ++L ++ S + R
Sbjct: 288 SEEAMQKAKRTVETEYAVVGTWEDTNITLSVLEAYIPRFFRNAKVAYYLGKDRLSRVNRN 347
Query: 245 NRKIDPSEETVQQIKKSKIWELENELYEYALEQFHF 280
N S+ET ++K+ E+ E YE+ ++ +
Sbjct: 348 NVTRTVSDETKLILRKNLTNEI--EFYEFCKQRLYL 381
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 59/232 (25%), Positives = 110/232 (47%), Gaps = 43/232 (18%)
Query: 29 VIIYNRVPKTGSTSFVNMAYDMCRKKRFN----------VLHVNVTGNN----HVLSLAD 74
+I YNRVPKTGS + + + +K F ++ NV H+L L +
Sbjct: 418 IIFYNRVPKTGSETLTELMVRLGKKNDFQNERSPFSKPTGIYWNVERQKEEAMHILDLQE 477
Query: 75 QYRFVNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYG 134
+ FV Y H +++ + F QP++IN++R P++R++S++Y+ R
Sbjct: 478 EPAFV-------------YVEHMNYMNIRPFHLP-QPIYINMIRDPVERVISWFYYRRTP 523
Query: 135 -DNYRPHLVRKKHGDKT----TFDECIRLNRTECSLENMWL----------QVPFLCGHA 179
++ + + V + ++T F+EC+ + EC + + Q F CGH+
Sbjct: 524 WNSVKMYEVTGEFQNRTFYTKNFEECVLTHDVECRYDYGLMFKDEFADHKRQSLFFCGHS 583
Query: 180 AACWVPGNPWALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTD 231
C P A+ +AK+N+ + +VG E+ +++LE +P FF+G +
Sbjct: 584 PICEPFNTPAAIARAKQNVERDFSVVGSWEDTNVTLTVLEHYIPRFFKGSME 635
Score = 71.6 bits (174), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 62/240 (25%), Positives = 115/240 (47%), Gaps = 23/240 (9%)
Query: 30 IIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVL-SLADQYRFVNNVTKWRDR 88
I YNR+ KTGS S + + + F V + + S D+ V + + +
Sbjct: 677 IFYNRLEKTGSQSMTRLIKQLGDRLGFETYRNIVRPSKSITESEEDENDLVEQLFELGEH 736
Query: 89 RPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVR----- 143
A+Y H +++F + S +P++IN++R P+ +++S YY+ R+ + L+R
Sbjct: 737 --AVYVEHANWVNFTKHESP-RPIYINMIRHPIQKVISAYYYQRHPLIFAQSLMRNPNKP 793
Query: 144 ---KKHGDKTTFDECIRLNRTE--CSLE------NMWLQVPF-LCGHAAACWVPGNPWAL 191
KK D TTF++C+R NR C + W + LCG++ C +
Sbjct: 794 MQNKKFFD-TTFNDCVR-NRVRPYCVFDAHNPFNGDWRRFSLHLCGNSEICTHFNSETTT 851
Query: 192 EKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPS 251
+ AK N+ +Y +VG E+ +++LEA +P +F T + + + ++ T+ + PS
Sbjct: 852 QIAKMNVEREYAVVGSWEDTNVTLAVLEAYIPRYFTDATKVYYSMSIPSIQETSANVKPS 911
>gi|350421797|ref|XP_003492960.1| PREDICTED: heparan sulfate 2-O-sulfotransferase pipe-like [Bombus
impatiens]
Length = 398
Score = 101 bits (251), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 76/271 (28%), Positives = 132/271 (48%), Gaps = 24/271 (8%)
Query: 28 TVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHV-LSLADQYRFVNNVTKWR 86
+V+ +NRVPK GS +F+ + + + F+ V + L+ +Q + V+ +
Sbjct: 113 SVLFFNRVPKVGSQTFMELLRRLSMRNGFSFNRDRVQRVETIRLAPIEQLQLARMVSSYS 172
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYR------PH 140
+ P++Y H F +F +F QP++INI+R P++R++S+YY++R Y P
Sbjct: 173 E--PSVYIKHVCFTNFTEFNLP-QPIYINIVRDPVERVISWYYYVRAPWYYVERKQIFPD 229
Query: 141 L-VRKKHGDKTTFDECIRLNRTEC---------SLENMWLQVPFLCGHAAACWVPGNPWA 190
L + + K F+ C+ EC + + Q F CGH+ C A
Sbjct: 230 LPLPDPNWLKKDFESCVLKADRECRYLEGEIHEGIGDHRRQTLFFCGHSEKCTPFNTVGA 289
Query: 191 LEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKI-- 248
LE+AK + Y +VGV E++ +++LE +P FFRG TD + + R NR
Sbjct: 290 LERAKMAVEKHYAVVGVLEDVNTTLTVLENYIPRFFRGATDVYYDEVNA-FTRINRNFFK 348
Query: 249 DPSEETVQQIKKSKIWELENELYEYALEQFH 279
P E V+ I +S + E E Y++ ++ +
Sbjct: 349 PPVSEEVKDIVRSN-FTREIEFYQFCKQRLY 378
>gi|167537769|ref|XP_001750552.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163770973|gb|EDQ84648.1| predicted protein [Monosiga brevicollis MX1]
Length = 303
Score = 101 bits (251), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 73/232 (31%), Positives = 117/232 (50%), Gaps = 23/232 (9%)
Query: 16 SPSPETDSLSWDTVIIYNRVPKTGSTSFVNMAYDMCR--KKRFNVLHVNVTGNNHVLSLA 73
SP P T D ++ YNR+PK G TS V +A+ + K+ F+ + + N
Sbjct: 26 SPVPITS----DDILFYNRIPKAGGTSLVRLAHAIADSPKRPFSAVWQIMDDRN----FF 77
Query: 74 DQYRFVNNVTKWRDR------RPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSY 127
D+ + +NV + RP Y H ++F +FG + QP+ INI+R+P +R+ S
Sbjct: 78 DEDKERSNVGSVMAKKHHHADRPMFYEQHVRLLNFSRFGYR-QPIHINIVREPTERVQSS 136
Query: 128 YYFLRYGDNYRPHLVRKKHGDKT--TFDECIRLNRTECS----LENMWLQVPFLCGHAAA 181
YY+ RY + +R G++ + ++CI S ++NM L V F CGH
Sbjct: 137 YYYQRYANIPYTAQLRSWLGEQFDWSINKCIEAEYGCKSWPWMIDNMNLMVGFFCGHHED 196
Query: 182 CWVPGNPWALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHF 233
C + ALE+A+ N++ Y +VGVTE + V +L LP+FF+ D +
Sbjct: 197 CRDRTSAIALERAQANVLHHYAVVGVTERYNESVWMLSQVLPTFFKTLPDFY 248
>gi|340716106|ref|XP_003396543.1| PREDICTED: LOW QUALITY PROTEIN: heparan sulfate
2-O-sulfotransferase pipe-like [Bombus terrestris]
Length = 402
Score = 100 bits (250), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 76/271 (28%), Positives = 134/271 (49%), Gaps = 24/271 (8%)
Query: 28 TVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHV-LSLADQYRFVNNVTKWR 86
+V+ +NRVPK GS +F+ + + + F+ V + L+ +Q + V+ +
Sbjct: 117 SVLFFNRVPKVGSQTFMELLRRLSMRNGFSFNRDRVQRVETIRLAPIEQLQLARMVSSYS 176
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLR----YGDNYR--PH 140
+ P++Y H F +F +F QP++INI+R P++R++S+YY++R Y + + P
Sbjct: 177 E--PSVYIKHVCFTNFTEFNLP-QPIYINIVRDPVERVISWYYYVRAPWYYVERKQIFPD 233
Query: 141 L-VRKKHGDKTTFDECIRLNRTEC---------SLENMWLQVPFLCGHAAACWVPGNPWA 190
L + + K F+ C+ EC + + Q F CGH+ C A
Sbjct: 234 LPLPDPNWLKKDFESCVLKADRECRYLEGEIHEGIGDHRRQTLFFCGHSEKCTPFNTVGA 293
Query: 191 LEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKI-- 248
LE+AK + Y +VGV E++ +++LE +P FFRG TD + + R NR
Sbjct: 294 LERAKMAVEKHYAVVGVLEDVNTTLTVLENYIPRFFRGATDVYYDEVNA-FTRINRNFFK 352
Query: 249 DPSEETVQQIKKSKIWELENELYEYALEQFH 279
P E V+ I +S + E E Y++ ++ +
Sbjct: 353 PPVSEEVKDIVRSN-FTREIEFYQFCKQRLY 382
>gi|391328701|ref|XP_003738823.1| PREDICTED: uronyl 2-sulfotransferase-like [Metaseiulus
occidentalis]
Length = 271
Score = 100 bits (249), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 60/200 (30%), Positives = 97/200 (48%), Gaps = 36/200 (18%)
Query: 32 YNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWRDRRPA 91
+NR+PK STS +N+ + + K F +V N +L+ Q +V + P
Sbjct: 57 FNRIPKAASTSMLNILHALSAKNNFTHKSSSVY-NMRILTSEQQNELAASVVE--SEAPV 113
Query: 92 LYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKHGDKTT 151
+ H F++F G + P+FIN++R P++R++S YY+ R V +KH
Sbjct: 114 TFDRHVHFVNFNSLGF-DSPIFINMVRDPVERIISDYYYRRS--------VARKH----- 159
Query: 152 FDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVGVTEEL 211
VP+ CGH C V + WALE+AK+N+ Y +VG+ E L
Sbjct: 160 -------------------MVPYFCGHHQKCMVVNDEWALEQAKKNIERYYDVVGLVEML 200
Query: 212 TDFVSLLEAALPSFFRGGTD 231
+ +++LE LP FF G ++
Sbjct: 201 AETIAVLEKRLPQFFSGASE 220
>gi|312383105|gb|EFR28315.1| hypothetical protein AND_03941 [Anopheles darlingi]
Length = 421
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 74/267 (27%), Positives = 131/267 (49%), Gaps = 40/267 (14%)
Query: 29 VIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLA-----DQYRFVNNVT 83
+I +NRVPK GS +F+ + + + +N H + V+ L+ D V N+
Sbjct: 146 MIFFNRVPKVGSQTFMELLRRLAVRNEYN-FHRDAVQRLEVIRLSLERQQDLAEMVMNLP 204
Query: 84 KWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVR 143
P++Y H + +F +FG P+++N++R P++R++S+YY++R Y V
Sbjct: 205 V-----PSVYVKHVCYTNFTRFGLP-MPIYVNMVRDPVERIISWYYYVRAPWYY----VE 254
Query: 144 KKHG-----------DKTTFDECIRLNRTECS---------LENMWLQVPFLCGHAAACW 183
+K K F+ C+ EC+ + + Q F CGH AC
Sbjct: 255 RKQAFPDLPLPDPRWLKKDFETCVLQGDPECTYTQNVVHEGIGDHRRQTLFFCGHDEACL 314
Query: 184 VPGNPWALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRR 243
+P ALE+AK + ++Y +VGV E+L +S+LE +P FF G + + + + LR+
Sbjct: 315 PFNSPGALERAKYAVESQYAVVGVLEDLNTTLSVLEQYVPRFFAGASSIYF-NEVNVLRK 373
Query: 244 TNR---KIDPSEETVQQIKKSKIWELE 267
N+ K SEE + ++++ E+E
Sbjct: 374 INKNNFKPPVSEEIKELVRRNFTKEIE 400
>gi|380012956|ref|XP_003690538.1| PREDICTED: heparan sulfate 2-O-sulfotransferase pipe-like [Apis
florea]
Length = 400
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 75/271 (27%), Positives = 132/271 (48%), Gaps = 24/271 (8%)
Query: 28 TVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHV-LSLADQYRFVNNVTKWR 86
+V+ +NRVPK GS +F+ + + + F+ V + L+ +Q + V+ +
Sbjct: 115 SVLFFNRVPKVGSQTFMELLRRLSLRNGFSFNRDRVQRVETIRLAPIEQLQLATMVSSYS 174
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYR------PH 140
+ P++Y H F +F +F QP++INI+R P++R++S+YY++R Y P
Sbjct: 175 E--PSVYIKHVCFTNFTEFNLP-QPIYINIVRDPVERVISWYYYVRAPWYYVERKQIFPD 231
Query: 141 L-VRKKHGDKTTFDECIRLNRTEC---------SLENMWLQVPFLCGHAAACWVPGNPWA 190
L + + K F+ C+ EC + + Q F CGH+ C A
Sbjct: 232 LPLPDPNWLKKDFESCVLKADRECRYLEGEIHEGIGDHRRQTLFFCGHSEKCTPFNTMGA 291
Query: 191 LEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKI-- 248
LE+AK + Y +VGV E++ +++LE +P FFRG TD + + R NR
Sbjct: 292 LERAKMAVEKHYAVVGVLEDVNSTLTVLENYIPRFFRGATDVYYDEVNA-FTRINRNFFK 350
Query: 249 DPSEETVQQIKKSKIWELENELYEYALEQFH 279
P E V+ + +S + E E Y++ ++ +
Sbjct: 351 PPVSEEVKDMVRSN-FTREIEFYQFCRQRLY 380
>gi|328777404|ref|XP_395133.3| PREDICTED: heparan sulfate 2-O-sulfotransferase pipe-like [Apis
mellifera]
Length = 400
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 75/271 (27%), Positives = 132/271 (48%), Gaps = 24/271 (8%)
Query: 28 TVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHV-LSLADQYRFVNNVTKWR 86
+V+ +NRVPK GS +F+ + + + F+ V + L+ +Q + V+ +
Sbjct: 115 SVLFFNRVPKVGSQTFMELLRRLSLRNGFSFNRDRVQRVETIRLAPIEQLQLATMVSSYS 174
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYR------PH 140
+ P++Y H F +F +F QP++INI+R P++R++S+YY++R Y P
Sbjct: 175 E--PSVYIKHVCFTNFTEFNLP-QPIYINIVRDPVERVISWYYYVRAPWYYVERKQIFPD 231
Query: 141 L-VRKKHGDKTTFDECIRLNRTEC---------SLENMWLQVPFLCGHAAACWVPGNPWA 190
L + + K F+ C+ EC + + Q F CGH+ C A
Sbjct: 232 LPLPDPNWLKKDFESCVLKADRECRYLEGEIHEGIGDHRRQTLFFCGHSEKCTPFNTMGA 291
Query: 191 LEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKI-- 248
LE+AK + Y +VGV E++ +++LE +P FFRG TD + + R NR
Sbjct: 292 LERAKMAVEKHYAVVGVLEDVNSTLTVLENYIPRFFRGATDVYYDEVNA-FTRINRNFFK 350
Query: 249 DPSEETVQQIKKSKIWELENELYEYALEQFH 279
P E V+ + +S + E E Y++ ++ +
Sbjct: 351 PPVSEEVKDMVRSN-FTREIEFYQFCRQRLY 380
>gi|313212461|emb|CBY36436.1| unnamed protein product [Oikopleura dioica]
Length = 250
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 73/246 (29%), Positives = 118/246 (47%), Gaps = 17/246 (6%)
Query: 43 FVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWRDRRPALYHGHFGFIDF 102
+AYD+ K +F V G S +Q F V + + P +Y H ++DF
Sbjct: 1 MTQLAYDLGGKNQFKVESPYEPGEKQTKSQEEQDAFRKYV--FDQKPPYMYIRHQNYVDF 58
Query: 103 QQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKHGD---KTTFDECIRLN 159
KE+ +IN++R P+ R S+YYF R+G+N R K D K T D+C+
Sbjct: 59 WDPVEKEKVAYINMIRDPIARFESFYYFSRFGNNLGGG-GRAKLNDERKKETVDDCVAKK 117
Query: 160 RTECSLENMWLQVPFLCGHAA--ACWVPGNPWALEKAKENLVTKYLLVGVTEELTDFVSL 217
R EC ++ W VP+LCG C + WA+++AK N+ Y VG+ +EL +++
Sbjct: 118 RQEC-VKPWWQIVPYLCGQVTDPRC-QERDQWAVDRAKYNIDQNYAFVGLLDELEMSLAV 175
Query: 218 LEAALPSFFRGG-----TDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIWELENELYE 272
LE LP F++ D F+ L T K SE++ + + + E ++Y+
Sbjct: 176 LEQLLPEFYKDARSLVKQDSFVKMKNGTL--TTFKKPASEKSREYLMTQTSLKYEYQIYD 233
Query: 273 YALEQF 278
+ LE+
Sbjct: 234 HVLEKL 239
>gi|195173240|ref|XP_002027401.1| GL20902 [Drosophila persimilis]
gi|194113253|gb|EDW35296.1| GL20902 [Drosophila persimilis]
Length = 357
Score = 98.6 bits (244), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 74/275 (26%), Positives = 134/275 (48%), Gaps = 31/275 (11%)
Query: 30 IIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWRDR- 88
+ +NRVPK GS S + + + + FN H G H L ++ + ++ R
Sbjct: 59 VFFNRVPKVGSQSLMELMARLGKINGFN--HARNKGGAHETVLMNKQHQSDLLSDLLTRP 116
Query: 89 RPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKHGD 148
+P +Y H +I+F +F +P++IN++R P+DR++S++Y++R YR ++ K GD
Sbjct: 117 KPHIYSQHIAYINFTRF-HLPRPIYINLVRDPIDRIISWHYYIRARWYYRD--MQAKLGD 173
Query: 149 KT-----------TFDECIRLNRTECSLENMWL---------QVPFLCGHAAACWVPGNP 188
K D C+R C+ M + Q F CG +P N
Sbjct: 174 KAPAMPSDEFLDMDLDTCVRNKDRHCTFNQMQIKNEAGDHRRQTLFFCGMNQKLCMPFNS 233
Query: 189 -WALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGG-TDHFLTSNK-SHLRRTN 245
A++KAK + ++Y +VG E+ +++LEA +P +FR ++L ++ S + R N
Sbjct: 234 EMAMQKAKRTVESEYAVVGTWEDTNITLAVLEAYIPRYFRNAKVAYYLGKDRLSRVNRNN 293
Query: 246 RKIDPSEETVQQIKKSKIWELENELYEYALEQFHF 280
S+ET ++K+ E+ E YE+ ++ +
Sbjct: 294 VTRIVSDETRLILRKNLTNEI--EFYEFCKQRLYL 326
>gi|443716227|gb|ELU07852.1| hypothetical protein CAPTEDRAFT_35712, partial [Capitella teleta]
Length = 256
Score = 98.2 bits (243), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 76/257 (29%), Positives = 117/257 (45%), Gaps = 27/257 (10%)
Query: 30 IIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVN--VTGNNHVLSLADQYRFVNNVTKWRD 87
+ YNR PK GS + ++ + +K + V VT + +F ++ +
Sbjct: 13 VFYNRAPKCGSRTVLSTFKLLAKKLNYTVYDKEKPVTNRPFISDDMGLRKFASDFDELAA 72
Query: 88 RRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKHG 147
P LY H F +F F K P FIN++R P++ LVS+Y+F +G P K
Sbjct: 73 --PFLYAQHIHFFNFTTF-HKVAPSFINVIRDPIEGLVSHYFFNAFGSKNAPLSTPKPPY 129
Query: 148 DKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVGV 207
+ + C+R + C N+ +PF CGH AC NP +L +AK + YL+VG+
Sbjct: 130 NMV--NACVRHRLSHCM--NIHKLIPFFCGHDKACRS-SNPSSLARAKRAVKESYLIVGL 184
Query: 208 TEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIWE-- 265
TE+L F+ LE +P FFR + F + + E + +KK ++ E
Sbjct: 185 TEDLHAFMESLETLMPQFFRNASAVFALQDSALY-----------EHYRTVKKPRVTEAT 233
Query: 266 ---LENEL-YEYALEQF 278
L N L YEY F
Sbjct: 234 KSILRNHLKYEYDFYNF 250
>gi|313224755|emb|CBY20546.1| unnamed protein product [Oikopleura dioica]
Length = 358
Score = 98.2 bits (243), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 69/247 (27%), Positives = 110/247 (44%), Gaps = 26/247 (10%)
Query: 16 SPSPETDSLSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQ 75
+P P T + +N++PK GST+ N+ + RK F + ++G V+ D+
Sbjct: 82 APVPNT------KFVFHNKLPKCGSTTMHNIVGLLSRKNNFTYWKI-MSG---VMKFTDE 131
Query: 76 YRFVNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGD 135
+ + K R R P H ++DF ++ S QP F+N++R P+ S+Y F+R+G
Sbjct: 132 ETLIQAL-KMRYREPFFLLQHHFWMDFNKY-SMHQPTFVNMIRDPISWFQSHYTFMRFGM 189
Query: 136 NYRPHLVRKKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPW------ 189
N K G + D+CI+ C + N W + F CG C +
Sbjct: 190 NKGRGENDPKLG--SDIDDCIKNKEKNC-VSNQWTYIEFFCGSEKLCATMKAAYDSGDLD 246
Query: 190 ----ALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTN 245
LE K +V Y +VG+ E+ D + L E LP F++G D + S+ RR
Sbjct: 247 VKRQVLETVKRRVVNDYFIVGILEQFEDTLQLFETMLPMFYKGAMDAW-KSDYIQFRRNQ 305
Query: 246 RKIDPSE 252
K E
Sbjct: 306 TKTTDKE 312
>gi|443722835|gb|ELU11537.1| hypothetical protein CAPTEDRAFT_215347 [Capitella teleta]
Length = 639
Score = 98.2 bits (243), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 77/267 (28%), Positives = 131/267 (49%), Gaps = 19/267 (7%)
Query: 18 SPETDSLSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYR 77
SP +L I+YNRVPK GS++ ++ + F + + V+S ADQ
Sbjct: 47 SPFNATLGKAKRIVYNRVPKCGSSAVESVLRHLAVLNGFTYYRSKLF-RDPVISEADQLA 105
Query: 78 FVNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLR---YG 134
+ P +Y H F+D ++ + P+++N++R P+++ VS +Y+ R Y
Sbjct: 106 LAKKLASIP--APFIYDRHIHFVDLGKYWNV-PPVYLNLVRDPIEQRVSAFYYRRTMLYN 162
Query: 135 DNYRPHLVRKKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKA 194
N + K TFDEC+ EC+ + + ++CG C G+ AL+ A
Sbjct: 163 QN------KPKGWLNMTFDECVAKGGLECT-GPYAVSLGYICGQEPHCRYIGSD-ALKDA 214
Query: 195 KENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHF--LTSNKSHLRRTNRKIDPSE 252
K ++ Y +VG+TE+L F+ +LE +P FF+G D + L + RT + SE
Sbjct: 215 KRHIENDYAIVGITEDLEAFLFVLEKTIPHFFKGALDIYEPLKAGLLSKYRTKNRGTISE 274
Query: 253 ETVQQIKKSKIWELENELYEYALEQFH 279
E+ + + S I E E YE+A ++F+
Sbjct: 275 ESREIL--SDILRDELEFYEFARQRFY 299
>gi|442633347|ref|NP_788535.2| pipe, isoform N [Drosophila melanogaster]
gi|440216000|gb|AAO41231.2| pipe, isoform N [Drosophila melanogaster]
Length = 513
Score = 98.2 bits (243), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 75/276 (27%), Positives = 136/276 (49%), Gaps = 33/276 (11%)
Query: 30 IIYNRVPKTGSTSFVNMAYDMCRKKRFN-VLHVNVTGNNHVLSLADQYRFVNNVTKWRDR 88
+ +NRVPK GS S + + M R + N H G+ H + ++ R + + R
Sbjct: 216 VFFNRVPKVGSQSLMEL---MARLGKINGFTHARNKGSAHETIVMNKQRQNDLIADLLTR 272
Query: 89 -RPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKHG 147
+P +Y H +I+F +F +P++IN++R P+DR++S++Y++R YR ++ K G
Sbjct: 273 PKPHIYSQHIAYINFTRF-HLPKPIYINLIRDPIDRIISWHYYIRAPWYYRD--MQAKLG 329
Query: 148 DKTT-----------FDECIRLNRTECSLENMWLQVP---------FLCGHAAACWVPGN 187
+ D C+R + C+ M ++ P F CG +P N
Sbjct: 330 ENAIPMPSEEFMNLDLDTCVRNHDPHCTFTQMQIKNPVGDHRRQTLFFCGMNQKLCMPFN 389
Query: 188 P-WALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGG-TDHFLTSNK-SHLRRT 244
A++KAK + T+Y +VG E+ +S+LEA +P +FR ++L ++ S + R
Sbjct: 390 SEAAMQKAKRTVETEYAVVGTWEDTNITLSVLEAYIPRYFRNAKVAYYLGKDRLSRVNRN 449
Query: 245 NRKIDPSEETVQQIKKSKIWELENELYEYALEQFHF 280
N S+ET ++K+ E+ E YE+ ++ +
Sbjct: 450 NVTRIVSDETRLILRKNLTNEI--EFYEFCKQRLYL 483
>gi|28574851|ref|NP_788536.1| pipe, isoform L [Drosophila melanogaster]
gi|67460945|sp|Q86BJ3.1|PIPE_DROME RecName: Full=Heparan sulfate 2-O-sulfotransferase pipe
gi|28380477|gb|AAO41232.1| pipe, isoform L [Drosophila melanogaster]
Length = 514
Score = 98.2 bits (243), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 75/276 (27%), Positives = 136/276 (49%), Gaps = 33/276 (11%)
Query: 30 IIYNRVPKTGSTSFVNMAYDMCRKKRFN-VLHVNVTGNNHVLSLADQYRFVNNVTKWRDR 88
+ +NRVPK GS S + + M R + N H G+ H + ++ R + + R
Sbjct: 217 VFFNRVPKVGSQSLMEL---MARLGKINGFTHARNKGSAHETIVMNKQRQNDLIADLLTR 273
Query: 89 -RPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKHG 147
+P +Y H +I+F +F +P++IN++R P+DR++S++Y++R YR ++ K G
Sbjct: 274 PKPHIYSQHIAYINFTRF-HLPKPIYINLIRDPIDRIISWHYYIRAPWYYRD--MQAKLG 330
Query: 148 DKTT-----------FDECIRLNRTECSLENMWLQVP---------FLCGHAAACWVPGN 187
+ D C+R + C+ M ++ P F CG +P N
Sbjct: 331 ENAIPMPSEEFMNLDLDTCVRNHDPHCTFTQMQIKNPVGDHRRQTLFFCGMNQKLCMPFN 390
Query: 188 P-WALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGG-TDHFLTSNK-SHLRRT 244
A++KAK + T+Y +VG E+ +S+LEA +P +FR ++L ++ S + R
Sbjct: 391 SEAAMQKAKRTVETEYAVVGTWEDTNITLSVLEAYIPRYFRNAKVAYYLGKDRLSRVNRN 450
Query: 245 NRKIDPSEETVQQIKKSKIWELENELYEYALEQFHF 280
N S+ET ++K+ E+ E YE+ ++ +
Sbjct: 451 NVTRIVSDETRLILRKNLTNEI--EFYEFCKQRLYL 484
>gi|313237158|emb|CBY12378.1| unnamed protein product [Oikopleura dioica]
Length = 451
Score = 98.2 bits (243), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 76/276 (27%), Positives = 127/276 (46%), Gaps = 26/276 (9%)
Query: 30 IIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWRDRR 89
I +N++PK GST F + + + +F + V G DQ + N + +
Sbjct: 178 IFHNKMPKCGSTMFQKLLHKLSIVNKFTFMDVYEPGTR------DQDLVLVNKIRGNFKP 231
Query: 90 PALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKK---- 145
P H +++F ++ K P INI+R P+D S YYF R G +P K+
Sbjct: 232 PMAIMKHHFWMNFTKYHLK-TPTVINIVRNPVDWFASEYYFCRNGWERKPDYKGKECQNM 290
Query: 146 --HGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPW-----ALEKAKENL 198
H K T +EC+R R EC N+ + F+CG+ C + ALE AK L
Sbjct: 291 SEHDLKMTLEECVRAKRPECKTPNIEY-IEFICGNHEICKANQQNYQKKRLALEMAKIRL 349
Query: 199 VTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQI 258
+ +Y +VG E++ + + LLE LP FF G F + + + R ++ + Q +
Sbjct: 350 LKEYYIVGTLEKIEESLRLLEHTLPQFFSGVLGVFREQDVQEVANSTRTLE-KPQLSQHL 408
Query: 259 KKSKIWELENELYEYALEQFHFVKKHNLVYNKVLGY 294
++ EL + Y +E F F++ +++Y K +
Sbjct: 409 RQ----ELAMDSLRYEMELFAFIQ--SVLYKKYTSF 438
>gi|313218405|emb|CBY43005.1| unnamed protein product [Oikopleura dioica]
Length = 247
Score = 97.8 bits (242), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 65/202 (32%), Positives = 97/202 (48%), Gaps = 21/202 (10%)
Query: 89 RPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKHGD 148
+P +Y H FIDF + K +PL+IN++R P+++ S+YYF+R G+ + GD
Sbjct: 47 KPGVYIRHQYFIDFAEHKQK-RPLYINVIRDPVEKFRSFYYFIRNGN------LEGDGGD 99
Query: 149 K--------TTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVT 200
++C+ EC+ E W VP+ CG C N WA+ KAKEN+
Sbjct: 100 VPMSESKRLMNINDCVSRREKECT-EPKWQMVPYFCGQDPRCR-QRNSWAVTKAKENIEK 157
Query: 201 KYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRR---TNRKIDPSEETVQQ 257
Y VG+TEEL ++L E +P FF G D + ++ T K + ETV
Sbjct: 158 YYAAVGLTEELPASLALFETLMPRFFHGAID-MKKEGEERIKNDTYTLNKAALTPETVDF 216
Query: 258 IKKSKIWELENELYEYALEQFH 279
K LE +LY + +F
Sbjct: 217 FKTKTSIALEYDLYNFVKARFE 238
>gi|313225568|emb|CBY07042.1| unnamed protein product [Oikopleura dioica]
Length = 410
Score = 97.8 bits (242), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 71/272 (26%), Positives = 121/272 (44%), Gaps = 24/272 (8%)
Query: 27 DTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNV-------LHVNVTGNNHVLSLADQYRFV 79
+ I +N++PK+GS++ N+ + +K RFN L+ G H+ S FV
Sbjct: 123 EQFIFHNKLPKSGSSTMNNILSMLGQKNRFNYRKLFPHDLNSIQFGAEHLTSEKPLVNFV 182
Query: 80 -NNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYG---- 134
NN+T W P + H +DF+++ + +QP +IN++R P D S+YYF R+G
Sbjct: 183 KNNITSW----PFVLLKHHLPMDFEKY-NIQQPTYINVIRDPADWFQSHYYFERFGWTRK 237
Query: 135 DNYRPHLVRKKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNP-----W 189
++ R + + T D+C+ +C W + F CG+ C
Sbjct: 238 EDDRGSFHGSEEDKQRTVDQCVEQENAQCMEPITWKYIEFFCGNKEDCSSRSGEDEITKQ 297
Query: 190 ALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKID 249
A+ KA N+ + VGV E+ D + L E LP F G T+ + + + R + +
Sbjct: 298 AMMKAMHNVEHNFFAVGVLEQFDDTLKLFEKMLPRIFTGATEVYHSDRIAQKRAATKTVG 357
Query: 250 --PSEETVQQIKKSKIWELENELYEYALEQFH 279
P + S E +LY + + F+
Sbjct: 358 AVPMNNATRAFFASGPLRYEYQLYAFTRQLFN 389
>gi|350597021|ref|XP_003361914.2| PREDICTED: uronyl 2-sulfotransferase-like [Sus scrofa]
Length = 374
Score = 97.4 bits (241), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 74/234 (31%), Positives = 119/234 (50%), Gaps = 38/234 (16%)
Query: 24 LSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNV-------------------- 63
L + + ++YNRV K GS + V + + K FN++ ++
Sbjct: 24 LPFPSQVVYNRVGKCGSRTVVLLLRILSEKHGFNLVTSDIHNKTRLTKNEQACVCCRHCK 83
Query: 64 ---TGNNHVLSLADQYRFVNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKP 120
G +L A + ++ RDR + H F++F +FG +QP++INI+R P
Sbjct: 84 PQAEGRKTILPPAQR---TSSCRSKRDR----FTRHVHFLNFSRFGG-DQPVYINIIRDP 135
Query: 121 LDRLVSYYYFLRYGD--NYRPHLVR----KKHGDKTTFDECIRLNRTECSLENMWLQVPF 174
++R +S Y+F R+GD + H++R ++ +ECI N ECS ++ +P+
Sbjct: 136 VNRFLSNYFFRRFGDWRGEQNHMIRTPSMRQEERYLDINECILENYPECSNPRLFYIIPY 195
Query: 175 LCGHAAACWVPGNPWALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRG 228
CG C PG WALE+AK N+ +LLVG+ EEL D + LLE LP +F+G
Sbjct: 196 FCGQHPRCREPGE-WALERAKLNVNENFLLVGILEELEDVLLLLERFLPHYFKG 248
>gi|313221005|emb|CBY31837.1| unnamed protein product [Oikopleura dioica]
Length = 247
Score = 97.4 bits (241), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 66/204 (32%), Positives = 96/204 (47%), Gaps = 25/204 (12%)
Query: 89 RPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKHGD 148
+P +Y H FIDF + K +PL+IN++R P+++ S+YYF+R G+ + GD
Sbjct: 47 KPGVYIRHQYFIDFAEHKQK-RPLYINVIRDPVEKFRSFYYFIRNGN------LEGDGGD 99
Query: 149 K--------TTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVT 200
++C+ EC+ E W VP+ CG C N WA+ KAKEN+
Sbjct: 100 VPMSESKRLMNINDCVSRREKECT-EPKWQMVPYFCGQDPRCR-QRNSWAVTKAKENIEK 157
Query: 201 KYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTN-----RKIDPSEETV 255
Y VG+TEEL ++L E +P FF G D + R N K + ETV
Sbjct: 158 YYAAVGLTEELPASLALFETLMPRFFHGAID---VKKEGEERIKNDTYTLNKAALTPETV 214
Query: 256 QQIKKSKIWELENELYEYALEQFH 279
K LE +LY + +F
Sbjct: 215 DFFKTKTSIALEYDLYNFVKARFE 238
>gi|390365976|ref|XP_003730937.1| PREDICTED: heparan sulfate 2-O-sulfotransferase 1-like
[Strongylocentrotus purpuratus]
Length = 338
Score = 97.4 bits (241), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 81/288 (28%), Positives = 129/288 (44%), Gaps = 25/288 (8%)
Query: 1 INTQKSHQIHISSAKSPSPETDSLSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLH 60
+NT ++ Q SS P VII N VP+TGS A N
Sbjct: 64 VNT-RTTQARTSSWTGPRFGQTVKGMKHVIIMNDVPRTGSRLLYRAA--------LNAFD 114
Query: 61 VNVTG-NNHVLSLADQYRFVNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRK 119
N T +H++ + + + + V + + PA HGH F + + P++IN +R+
Sbjct: 115 YNWTSVTDHLVRIEYEKKNIEEVQQLKP--PAFVHGHTSFYPYTDIW-ENPPIYINFMRE 171
Query: 120 PLDRLVSYYYFLRYGDNYRPHLVRKKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHA 179
P+ R+ S +Y+ R+GD + + H D T ++C+ C W CG+
Sbjct: 172 PIARMESNFYYERFGDYAQDPSIH--HNDNLTLEQCVNKGLVWCG--PWWNWHILFCGYE 227
Query: 180 AACWVPGNPWALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLT---- 235
+ C + W+LEKAKEN+ Y +G+TEE + ++E +P G +
Sbjct: 228 SRCQT-DHAWSLEKAKENIDRHYTFIGITEEFETSLQIIERLVPDLLGGLLKAYKELNDD 286
Query: 236 -SNKSHLRRTNRKIDPSEETVQQIKKSKIWELENELYEYALEQFHFVK 282
+N + L RT K S V K +I + + ELYEYA E+F +K
Sbjct: 287 GTNWNELFRTRDKKRLSPRLVD--KAREIMKEDVELYEYAYEKFRQLK 332
>gi|339237289|ref|XP_003380199.1| heparan sulfate 2-O-sulfotransferase 1 [Trichinella spiralis]
gi|316977005|gb|EFV60185.1| heparan sulfate 2-O-sulfotransferase 1 [Trichinella spiralis]
Length = 177
Score = 97.1 bits (240), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 49/100 (49%), Positives = 67/100 (67%), Gaps = 2/100 (2%)
Query: 203 LLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSK 262
+LVGV+EEL DFV LLE P FF G T + KS+LR+T +KI PSE+T+ QI++S
Sbjct: 1 MLVGVSEELQDFVELLELIFPDFFSGATVIYSQGRKSYLRKTVKKIPPSEQTLAQIRQSP 60
Query: 263 IWELENELYEYALEQFHFVK--KHNLVYNKVLGYEADKGK 300
IW++E + YE+A QFHF+K K L + +GY +K K
Sbjct: 61 IWKMEQDFYEFAKRQFHFLKLIKTRLGGKREIGYHYEKVK 100
>gi|328699973|ref|XP_001950548.2| PREDICTED: heparan sulfate 2-O-sulfotransferase pipe-like
[Acyrthosiphon pisum]
Length = 380
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 71/272 (26%), Positives = 130/272 (47%), Gaps = 29/272 (10%)
Query: 29 VIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHV-LSLADQYRFVNNVTKWRD 87
+I +NRVPK GS +F+ + + + +F +V + L+ +Q R + V+K+
Sbjct: 109 LIFFNRVPKVGSQTFMEILRLLSLRNQFVFYQDHVQRVETIRLAETEQLRVASMVSKYDT 168
Query: 88 RRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKHG 147
P+++ H F++F +F E P++IN++R P++R++S+YY++R Y V +KH
Sbjct: 169 --PSVFIKHISFVNFTKFYLPE-PIYINLVRDPVERVISWYYYIRAPWYY----VERKHA 221
Query: 148 -----------DKTTFDECIRLNRTEC---------SLENMWLQVPFLCGHAAACWVPGN 187
K F+ C+ EC + + Q F CGH C +
Sbjct: 222 FPDTPLPDPNWLKKDFETCVLRGDRECRYTQGEKREGISDHRRQTMFFCGHDEECTPFNS 281
Query: 188 PWALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRK 247
ALE+AK ++ +Y +VGV E+ +++ E +P FF+G + + H N
Sbjct: 282 EGALERAKRSVEQQYAVVGVLEDFNVTLTVFEHYIPRFFKGASKVYYGDMGLHKINHNAF 341
Query: 248 IDPSEETVQQIKKSKIWELENELYEYALEQFH 279
E V+ I + K + E E Y++ ++ +
Sbjct: 342 KPLVSEAVKDIVR-KNFTREIEFYQFCRQRLY 372
>gi|390360667|ref|XP_001200628.2| PREDICTED: heparan sulfate 2-O-sulfotransferase 1-like
[Strongylocentrotus purpuratus]
Length = 265
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 74/260 (28%), Positives = 120/260 (46%), Gaps = 24/260 (9%)
Query: 29 VIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTG-NNHVLSLADQYRFVNNVTKWRD 87
+II N VP+TGS A N N T +H++ + + + + V + +
Sbjct: 18 IIIMNDVPRTGSRLLYRAA--------LNAFDYNWTSVTDHLVRIEYEKKNIEEVQQLKP 69
Query: 88 RRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKHG 147
PA HGH F + + P++IN +R+P+ R+ S +Y+ R+GD + + H
Sbjct: 70 --PAFVHGHTSFYPYTDIW-ENPPIYINFMREPIARMESNFYYERFGDYAQDPSIH--HN 124
Query: 148 DKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVGV 207
D T ++C+ C W CG+ + C + W+LEKAKEN+ Y +G+
Sbjct: 125 DNLTLEQCVNKGLVWCG--PWWNWHILFCGYESLC-QTDHAWSLEKAKENIDQHYTFIGI 181
Query: 208 TEELTDFVSLLEAALPSFFRGGTDHFLT-----SNKSHLRRTNRKIDPSEETVQQIKKSK 262
TEE + ++E +P G + +N + L RT K S V K +
Sbjct: 182 TEEFETSLQIIERLVPDLLGGLLKAYKELNDDGTNWNELFRTRDKKRLSPRLVD--KARE 239
Query: 263 IWELENELYEYALEQFHFVK 282
I + + ELYEYA E+F +K
Sbjct: 240 IMKEDVELYEYAYEKFRQLK 259
>gi|307209570|gb|EFN86485.1| Heparan sulfate 2-O-sulfotransferase pipe [Harpegnathos saltator]
Length = 310
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 74/270 (27%), Positives = 132/270 (48%), Gaps = 24/270 (8%)
Query: 29 VIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHV-LSLADQYRFVNNVTKWRD 87
V+ +NRVPK GS +F+ + + + F+ V + L+ +Q + V+ + +
Sbjct: 26 VLFFNRVPKVGSQTFMELLRRLSIRNLFSFNRDRVQRVETIRLAPIEQLQLARMVSSYSE 85
Query: 88 RRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYR------PHL 141
P++Y H F +F +F E P++INI+R P++R++S+YY++R Y P L
Sbjct: 86 --PSVYVKHVCFTNFTEFHLPE-PIYINIVRDPVERVISWYYYVRAPWYYVERKQIFPDL 142
Query: 142 -VRKKHGDKTTFDECIRLNRTEC---------SLENMWLQVPFLCGHAAACWVPGNPWAL 191
+ + K F+ C+ EC + + Q F CGH+ C AL
Sbjct: 143 PLPDPNWLKKDFEICVLKGDRECRYLQGEIHEGIGDHRRQTLFFCGHSEKCTPFNTVGAL 202
Query: 192 EKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKI--D 249
E+AK + Y +VGV E++ +++LE +P FF+G TD + S + R NR
Sbjct: 203 ERAKLAVEKHYAVVGVLEDMNTTLTVLENYIPRFFQGATDVYYDQVNSFM-RINRNFFKP 261
Query: 250 PSEETVQQIKKSKIWELENELYEYALEQFH 279
P E V+ + ++ + E E Y++ ++ +
Sbjct: 262 PVSEEVKNLVRNN-FTREVEFYQFCKQRLY 290
>gi|194751644|ref|XP_001958135.1| GF10765 [Drosophila ananassae]
gi|190625417|gb|EDV40941.1| GF10765 [Drosophila ananassae]
Length = 342
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 76/276 (27%), Positives = 135/276 (48%), Gaps = 33/276 (11%)
Query: 30 IIYNRVPKTGSTSFVNMAYDMCRKKRFN-VLHVNVTGNNHVLSLADQYRFVNNVTKWRDR 88
+ +NRVPK GS S + + M R + N H G+ H + ++ R + V R
Sbjct: 45 VFFNRVPKVGSQSLMEL---MARLGKINGFTHARNKGSAHETIVMNKQRQNDLVGDLLTR 101
Query: 89 -RPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKHG 147
+P +Y H +I+F +F +P++IN++R P+DR++S++Y++R YR ++ K G
Sbjct: 102 QKPHIYSQHIAYINFTRF-HLPKPIYINLIRDPIDRIISWHYYIRARWYYRD--MKAKLG 158
Query: 148 DKT-----------TFDECIRLNRTECSLENMWL---------QVPFLCGHAAACWVPGN 187
D+ D C+R + C+ + Q F CG +P N
Sbjct: 159 DQAPPMPSDEFLNLDLDTCVRNHDPHCTFNQGEIKNAVGDHRRQTLFFCGMNQKVCMPFN 218
Query: 188 P-WALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGG-TDHFLTSNK-SHLRRT 244
A++KAK + T+Y +VG E+ +S+LEA +P FFR ++L ++ S + R
Sbjct: 219 SKVAMQKAKRTVETEYAVVGTWEDTNITLSVLEAYIPRFFRNAKVAYYLGKDRLSRVNRN 278
Query: 245 NRKIDPSEETVQQIKKSKIWELENELYEYALEQFHF 280
N S+ET ++K+ E+ E Y++ ++ +
Sbjct: 279 NVTRIVSDETRTILRKNLTNEI--EFYDFCKQRLYL 312
>gi|296483925|tpg|DAA26040.1| TPA: uronyl-2-sulfotransferase [Bos taurus]
Length = 409
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 77/278 (27%), Positives = 138/278 (49%), Gaps = 25/278 (8%)
Query: 19 PETDSLSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRF 78
P + L + + ++YNRV K GS + V + + K FN++ ++ N L+ +Q
Sbjct: 98 PPSKVLPFPSQVVYNRVGKCGSRTVVLLLRILSEKHGFNLVTSDIH-NKTRLTKNEQMEL 156
Query: 79 VNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGD--N 136
+ N++ +P L+ H F++F +FG +QP++INI+R P++R +S Y+F R+GD
Sbjct: 157 IKNIST--AEQPYLFTRHVHFLNFSRFGG-DQPVYINIIRDPVNRFLSNYFFRRFGDWRG 213
Query: 137 YRPHLVR----KKHGDKTTFDECIRLNRTECSLENMWLQV--PFLCGHAAACWVPGNPWA 190
+ H++R ++ + CI + EC+ W + F H + PG A
Sbjct: 214 EQNHMIRTPSMRQEERYLVINICIFQSFEECN-SCRWYDMISSFXSQHPSQ--QPGR-HA 269
Query: 191 LEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTN----- 245
L + K N+ T ++L+GV + + L + +P + +G + + H + N
Sbjct: 270 LARRKLNVRTGFVLIGVLSQFQQNLLLRQKFIPHYNQGVLE--IQKEPEHRKLGNMTVTV 327
Query: 246 RKIDPSEETVQQIKKSKIWELENELYEYALEQFHFVKK 283
+K PS E VQ + + +E E Y Y EQFH +K+
Sbjct: 328 KKTAPSPEAVQILYQRMRYEY--EFYHYVKEQFHLLKR 363
>gi|170027997|ref|XP_001841883.1| heparan sulfate 2-o-sulfotransferase [Culex quinquefasciatus]
gi|167868353|gb|EDS31736.1| heparan sulfate 2-o-sulfotransferase [Culex quinquefasciatus]
Length = 377
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 70/264 (26%), Positives = 124/264 (46%), Gaps = 34/264 (12%)
Query: 29 VIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLAD--QYRFVNNVTKWR 86
+I +NRVPK GS +F+ + + + + H +V V+ L+ Q V+
Sbjct: 102 IIFFNRVPKVGSQTFMELLRRLAIRNDYT-FHRDVVQRLEVIRLSPDRQQELAEMVSDLP 160
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRP-HLVRKK 145
P++Y H + +F +FG P+++N++R P++R++S+YY++R P + V +K
Sbjct: 161 --VPSVYVKHVCYTNFTRFGLP-MPIYVNMVRDPVERIISWYYYVR-----APWYFVERK 212
Query: 146 HG-----------DKTTFDECIRLNRTECSLENMWL---------QVPFLCGHAAACWVP 185
K F+ C+ EC+ + Q F CGH C
Sbjct: 213 QAFPDLPLPDPRWLKKDFETCVLQGDPECTYAQGAIHEGIGDHRRQTLFFCGHDDQCLPF 272
Query: 186 GNPWALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNK--SHLRR 243
+P ALE+AK + ++Y +VGV E+L +S+LE +P FF G + + +
Sbjct: 273 NSPGALERAKYAVESQYAVVGVLEDLNTTLSVLEKYVPKFFSGAPSVYFNEVNLLQKINK 332
Query: 244 TNRKIDPSEETVQQIKKSKIWELE 267
N K SEE + ++++ E+E
Sbjct: 333 NNFKPPVSEEIKELVRRNFTREIE 356
>gi|390363785|ref|XP_789352.2| PREDICTED: uronyl 2-sulfotransferase-like [Strongylocentrotus
purpuratus]
Length = 358
Score = 95.5 bits (236), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 83/276 (30%), Positives = 122/276 (44%), Gaps = 36/276 (13%)
Query: 24 LSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVT 83
L D ++YNRVPK GS + + + +K + + + LA R V V
Sbjct: 89 LKDDITVVYNRVPKCGSRALLQCVKSLVKKNQIRG-PIPMVPPTFKAELALLTRTVEGV- 146
Query: 84 KWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVR 143
R +R A GH F FQ E+ +IN++R PL+RLVS +YF R+GD L
Sbjct: 147 --RSQRRAFLEGHVRFNHFQD----ERVHYINMIRDPLNRLVSSFYFNRHGDGL---LSP 197
Query: 144 KKHGD----------KTTFDECIRLNRTECS-LENMWLQVPFLCGHAAACWVPGNPWALE 192
++ D TFDEC+ R C+ + + + F CG C W L+
Sbjct: 198 RQLADVRNRTKPEVIDETFDECVSHERLSCTGPQVLSYVIGFFCGFHPRCR-KATQWTLD 256
Query: 193 KAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSN-------KSHLRRTN 245
+AK NL Y VG+ EE + +LE PS F G + + K+H +
Sbjct: 257 EAKRNL-DYYTAVGIVEEYNSSMRVLEFLFPSMFTGLVKRYTRLSRDTDFQVKAH---AH 312
Query: 246 RKIDPSEETVQQIKKSKIWELENELYEYALEQFHFV 281
R + PS + + K +LE E YEYA +F +
Sbjct: 313 RYVPPSPKVKAYMTKK--LKLEYEFYEYAKSRFDML 346
>gi|195173228|ref|XP_002027395.1| GL20909 [Drosophila persimilis]
gi|194113247|gb|EDW35290.1| GL20909 [Drosophila persimilis]
Length = 532
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 77/303 (25%), Positives = 141/303 (46%), Gaps = 34/303 (11%)
Query: 1 INTQKSHQIHISSAKSPSPETDSLSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLH 60
I T++ +SS L+ ++ +NRVPK GS +F+ + + + F H
Sbjct: 215 IRTKRFLHSQMSSLNVRDLNNTRLAQMELVFFNRVPKVGSQTFMELLRRLSERNNFQ-FH 273
Query: 61 VNVTGNNHVLSLAD--QYRFVNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILR 118
+ + LA+ Q +++ + P+++ H F +F +F + +P+++N++R
Sbjct: 274 RDAVQKVETIRLAEDQQQEMAEVISELPE--PSVFIKHVCFTNFTKF-NLPKPIYLNVVR 330
Query: 119 KPLDRLVSYYYFLR-----------YGDNYRPHLVRKKHGDKTTFDECIRLNRTECS--- 164
P++R++S++Y++R + D PH K F+ C+ EC+
Sbjct: 331 DPVERVISWFYYVRAPWYFVERKAAFPDLPLPH----PAWLKKDFETCVLSGDQECTYTQ 386
Query: 165 ------LENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVGVTEELTDFVSLL 218
+ + Q F CGH C ALEKAK + +Y +VGV E+L +S+L
Sbjct: 387 GVTVEGIGDHRRQSLFFCGHDYECTPFNTVGALEKAKFAVEQQYAVVGVLEDLNTTLSVL 446
Query: 219 EAALPSFFRGGTDHFLTSNK--SHLRRTNRKIDPSEETVQQIKKSKIWELENELYEYALE 276
E +P FF G D + TS + + + + N K P E+V+ I + E E Y++ +
Sbjct: 447 EKYVPRFFEGVRDIYATSAEYLTKINKNNFK-PPVSESVKDIVRRNFTN-EIEFYQFCRQ 504
Query: 277 QFH 279
+ H
Sbjct: 505 RLH 507
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 41/136 (30%), Positives = 66/136 (48%), Gaps = 12/136 (8%)
Query: 91 ALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKHGDKT 150
LY H ++DF+ FG K +P++++++R P+DR+V YY R+ R R G
Sbjct: 38 TLYIAHSNWLDFKSFGYK-KPIYLSMVRDPIDRVVHDYY-KRHSRTKRQIYRRMFPGQPL 95
Query: 151 TFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVGVTEE 210
C L V LCG+ C +P A+++AK + +Y +VG EE
Sbjct: 96 GHGLCGGLQEA----------VTILCGNHLNCLPFNSPHAVQEAKSRVEKEYSVVGTWEE 145
Query: 211 LTDFVSLLEAALPSFF 226
+++LE +P FF
Sbjct: 146 KNITLTVLEKYVPRFF 161
>gi|383860488|ref|XP_003705721.1| PREDICTED: heparan sulfate 2-O-sulfotransferase pipe-like
[Megachile rotundata]
Length = 408
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 72/270 (26%), Positives = 131/270 (48%), Gaps = 24/270 (8%)
Query: 29 VIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHV-LSLADQYRFVNNVTKWRD 87
++ +NRVPK GS +F+ + + + F V + L+ +Q + V+ + +
Sbjct: 124 ILFFNRVPKVGSQTFMELLRRLSMRNGFTFNRDRVQRVETIRLAPIEQIQLARMVSGYAE 183
Query: 88 RRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYR------PHL 141
P++Y H F +F +F + QP++INI+R P++R++S+YY++R Y P L
Sbjct: 184 --PSVYIKHVCFTNFTEF-NLPQPIYINIVRDPVERVISWYYYVRAPWYYVERKQIFPDL 240
Query: 142 -VRKKHGDKTTFDECIRLNRTEC---------SLENMWLQVPFLCGHAAACWVPGNPWAL 191
+ + K F+ C+ EC + + Q F CGH+ C AL
Sbjct: 241 PLPDPNWLKKDFETCVLKADRECRYLEGEIHEGIGDHRRQTLFFCGHSEKCTPFNTVGAL 300
Query: 192 EKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKI--D 249
E+AK + Y +VGV E++ +++LE +P FF+G TD + + R NR
Sbjct: 301 ERAKMAVEKHYAVVGVLEDVNATLTVLENYIPRFFQGATDVYYDEVNA-FTRINRNFFKP 359
Query: 250 PSEETVQQIKKSKIWELENELYEYALEQFH 279
P E ++ + +S + E E Y++ ++ +
Sbjct: 360 PVSEEIKDMVRSN-FTREIEFYQFCKQRLY 388
>gi|332024386|gb|EGI64584.1| Heparan sulfate 2-O-sulfotransferase pipe [Acromyrmex echinatior]
Length = 333
Score = 95.1 bits (235), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 74/269 (27%), Positives = 129/269 (47%), Gaps = 24/269 (8%)
Query: 30 IIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHV-LSLADQYRFVNNVTKWRDR 88
+ +NRVPK GS +F+ + + + F+ V + L+ +Q V+ + +
Sbjct: 55 LFFNRVPKVGSQTFMELLRRLSIRNAFSFNRDRVQRVETIRLAPIEQLHLARMVSSYSE- 113
Query: 89 RPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYR------PHL- 141
P++Y H F +F +F E P++INI+R P++R++S+YY++R Y P L
Sbjct: 114 -PSVYVKHVCFTNFTEFHLPE-PIYINIVRDPVERVISWYYYVRAPWYYVERKQIFPDLP 171
Query: 142 VRKKHGDKTTFDECIRLNRTEC---------SLENMWLQVPFLCGHAAACWVPGNPWALE 192
+ + K F+ C+ EC + + Q F CGH+ C ALE
Sbjct: 172 LPDPNWLKKDFESCVLKGDRECRYLQGEIHEGIGDHRRQTLFFCGHSEKCTPFNTVGALE 231
Query: 193 KAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKI--DP 250
+AK + Y +VGV E++ +++LE +P FF+G TD + S R NR P
Sbjct: 232 RAKLAVEKHYAVVGVLEDINTTLTVLENYVPRFFQGATDVYYDQVNS-FTRINRNFFKPP 290
Query: 251 SEETVQQIKKSKIWELENELYEYALEQFH 279
E V+ + +S + E E Y++ ++ +
Sbjct: 291 VSEEVKNLVRSN-FTREVEFYQFCKQRLY 318
>gi|198463771|ref|XP_002135576.1| GA28240 [Drosophila pseudoobscura pseudoobscura]
gi|198151404|gb|EDY74203.1| GA28240 [Drosophila pseudoobscura pseudoobscura]
Length = 308
Score = 94.7 bits (234), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 72/275 (26%), Positives = 131/275 (47%), Gaps = 34/275 (12%)
Query: 29 VIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLAD--QYRFVNNVTKWR 86
++ +NRVPK GS +F+ + + + F H + + LA+ Q +++
Sbjct: 19 LVFFNRVPKVGSQTFMELLRRLSERNNFQ-FHRDAVQKVETIRLAEDQQQEMAEVISELP 77
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLR-----------YGD 135
+ P+++ H F +F +F +P+++N++R P++R++S++Y++R + D
Sbjct: 78 E--PSVFIKHVCFTNFTKFNLP-KPIYLNVVRDPVERVISWFYYVRAPWYFVERKAAFPD 134
Query: 136 NYRPHLVRKKHGDKTTFDECIRLNRTECS---------LENMWLQVPFLCGHAAACWVPG 186
PH K F+ C+ EC+ + + Q F CGH C
Sbjct: 135 LPLPH----PAWLKKDFETCVLSGDQECTYTQGVTVEGIGDHRRQSLFFCGHDYECTPFN 190
Query: 187 NPWALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNK--SHLRRT 244
ALEKAK + +Y +VGV E+L +S+LE +P FF G D + TS + + + +
Sbjct: 191 TVGALEKAKFAVEQQYAVVGVLEDLNTTLSVLEKYVPRFFEGVRDIYATSAEYLTKINKN 250
Query: 245 NRKIDPSEETVQQIKKSKIWELENELYEYALEQFH 279
N K P E+V+ I + E E Y++ ++ H
Sbjct: 251 NFK-PPVSESVKDIVRRNFTN-EIEFYQFCRQRLH 283
>gi|307183100|gb|EFN70017.1| Heparan sulfate 2-O-sulfotransferase pipe [Camponotus floridanus]
Length = 360
Score = 94.7 bits (234), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 73/270 (27%), Positives = 130/270 (48%), Gaps = 24/270 (8%)
Query: 29 VIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHV-LSLADQYRFVNNVTKWRD 87
V+ +NRVPK GS +F+ + + + F+ V + L+ +Q V+ + +
Sbjct: 83 VLFFNRVPKVGSQTFMELLRRLSIRNAFSFNRDRVQRVETIRLAPIEQLHLARMVSSYSE 142
Query: 88 RRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYR------PHL 141
P++Y H F +F +F E P++IN++R P++R++S+YY++R Y P L
Sbjct: 143 --PSVYVKHVCFTNFTEFHLPE-PIYINVVRDPVERVISWYYYVRAPWYYVERKQIFPDL 199
Query: 142 -VRKKHGDKTTFDECIRLNRTEC---------SLENMWLQVPFLCGHAAACWVPGNPWAL 191
+ + K F+ C+ EC + + Q F CGH+ C AL
Sbjct: 200 PLPDPNWLKKDFESCVLKGDRECRYLEGEVHEGIGDHRRQTLFFCGHSEKCTPFNTVGAL 259
Query: 192 EKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKI--D 249
E+AK + Y +VGV E++ +++LE +P FF+G T+ + S R NR
Sbjct: 260 ERAKLAVEKHYAVVGVLEDMNTALTVLENYIPRFFQGATNVYYDQVNS-FTRINRNFFKP 318
Query: 250 PSEETVQQIKKSKIWELENELYEYALEQFH 279
P E V+ + +S + E E Y++ ++ +
Sbjct: 319 PVSEEVKNLVRSN-FTREVEFYQFCKQRLY 347
>gi|195435616|ref|XP_002065776.1| GK19555 [Drosophila willistoni]
gi|194161861|gb|EDW76762.1| GK19555 [Drosophila willistoni]
Length = 668
Score = 94.4 bits (233), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 75/276 (27%), Positives = 132/276 (47%), Gaps = 33/276 (11%)
Query: 30 IIYNRVPKTGSTSFVNMAYDMCRKKRFN-VLHVNVTGNNHVLSLADQYRFVNNVTKWRDR 88
+ +NRVPK GS S + + M R + N +H G+ H L ++ N V R
Sbjct: 86 VFFNRVPKVGSQSLMEL---MTRLGKINGFIHARNKGSAHETILTNKIGQKNLVADLLTR 142
Query: 89 -RPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKHG 147
+P +Y H +I+F +F +P++IN++R P++R++S++Y++R Y H ++ K G
Sbjct: 143 PKPHIYSQHIAYINFTRF-HLPRPIYINLVRDPIERIISWHYYIRAPWYY--HDMKAKLG 199
Query: 148 DKT-----------TFDECIRLNRTECSLENMWL---------QVPFLCGHAAACWVPGN 187
D D C+R + C+ M + Q F CG +P N
Sbjct: 200 DSALPMPSDEFLNLDLDTCVRNHDPHCTFTQMQMKNGVGDHRRQTLFFCGMNQKLCMPFN 259
Query: 188 P-WALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGG-TDHFLTSNK-SHLRRT 244
A++KAK + Y +VG E+ +++LEA +P +FR ++L + S + R
Sbjct: 260 SEAAMQKAKRTVEADYAVVGTWEDTNITLAVLEAYIPRYFRNAKVAYYLGKERLSRVNRN 319
Query: 245 NRKIDPSEETVQQIKKSKIWELENELYEYALEQFHF 280
N S+ET Q ++ + E+ E YE+ ++ +
Sbjct: 320 NVTRIVSDETRQILRHNLTNEI--EFYEFCKQRLYL 353
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 62/221 (28%), Positives = 104/221 (47%), Gaps = 21/221 (9%)
Query: 29 VIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHV-LSLADQYRFVNNVTKWRD 87
++ +NRVPKTGS + + + F + L Q + + D
Sbjct: 390 MLFFNRVPKTGSETLNELMRKLGPINGFRFDRAPFKSPIGMRWPLERQRQEAEKFIEMAD 449
Query: 88 RRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYG-------DNYRPH 140
P Y H + DF+QF QP++IN++R P+++++S+YY+ R D Y+
Sbjct: 450 GDPFAYVEHVNYFDFRQF-HLPQPIYINMVRDPVEKVLSWYYYKRTPWHALLMFDGYKKF 508
Query: 141 LVRKKHGDKTTFDECIRLNRTECSLE----------NMWLQVPFLCGHAAACWVPGNPWA 190
R + + +F+ CI EC+ E + Q F CGHA C P A
Sbjct: 509 QSRAFY--RKSFESCIMTGDPECNYEYGHGFDGDAGDHRRQSLFFCGHAPICEPFNTPGA 566
Query: 191 LEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTD 231
+++AK+N+ + +VG E++ +++LE +P FFRG TD
Sbjct: 567 IQRAKQNVERDFAVVGSWEDVNVTLAVLEHYIPRFFRGVTD 607
>gi|195496273|ref|XP_002095623.1| GE22505 [Drosophila yakuba]
gi|194181724|gb|EDW95335.1| GE22505 [Drosophila yakuba]
Length = 351
Score = 94.4 bits (233), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 75/257 (29%), Positives = 122/257 (47%), Gaps = 41/257 (15%)
Query: 10 HISSAK---SPSPETDSLSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGN 66
H+++A+ +P + D+L +NRV K+GS + + + ++ F +V G
Sbjct: 52 HLTTAQLNNTPRAQVDTL------FFNRVTKSGSEKMMELLKILGKRLNFEARR-DVEGF 104
Query: 67 NHVLSLADQY--RFVNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRL 124
V+ + D Y FV + + + Y H F+DF Q + P++IN++R P++RL
Sbjct: 105 YEVVIMHDSYAKNFVRTEV-FNSSKASSYTKHVAFLDFDQL-DEPWPIYINLVRDPVERL 162
Query: 125 VSYYYFLRYGDNYRP-HLVRKKH--GD----------KTTFDECIRLNRTECSLENMWL- 170
VS++Y++R P HL +K GD K F+ CI EC E M L
Sbjct: 163 VSWFYYVR-----APWHLAERKEMFGDAIVLPSIDWLKKDFNRCIEERDPECVYEQMELG 217
Query: 171 -------QVPFLCGHAAACWVPGNPW-ALEKAKENLVTKYLLVGVTEELTDFVSLLEAAL 222
Q +LCG A +P N +++AK+N+ Y +VG E+ +S+LE +
Sbjct: 218 NLGDHRRQSLYLCGQNMAVCMPFNSHETMQRAKKNVEEHYAVVGTWEDTNTTLSVLEGYI 277
Query: 223 PSFFRGGTDHFLTSNKS 239
P FF G D + KS
Sbjct: 278 PRFFTGAKDEYYALKKS 294
>gi|270015724|gb|EFA12172.1| pipe [Tribolium castaneum]
Length = 330
Score = 94.4 bits (233), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 72/270 (26%), Positives = 123/270 (45%), Gaps = 24/270 (8%)
Query: 29 VIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHV-LSLADQYRFVNNVTKWRD 87
++ +NRVPK GS + + + + + F V + LS DQ + V+ +
Sbjct: 54 IVFFNRVPKVGSQTLMELIRRLSIRNNFGFHQDRVQRVETIRLSPEDQAVLSSLVSSYEP 113
Query: 88 RRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVR---- 143
P +Y H F + +FG E P++IN++R P++R++S+YY++R Y +
Sbjct: 114 --PGVYIKHVCFTNISRFGFPE-PIYINLVRDPVERVISWYYYVRAPWYYVERKIAFPDI 170
Query: 144 ---KKHGDKTTFDECIRLNRTEC---------SLENMWLQVPFLCGHAAACWVPGNPWAL 191
K F+ C+ EC + + Q F CGH + C P AL
Sbjct: 171 PLPDPKWLKKDFEHCVLSGDRECKYLTGETREGIGDHRRQSMFFCGHHSECLPFNTPGAL 230
Query: 192 EKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKI--D 249
E+AK + Y +VGV E+L +++LE +P FF G + + + S N+
Sbjct: 231 ERAKRVVEQHYAVVGVLEDLNTTLTVLEKYIPRFFTGAYEIYW-NEISRFNPINKNAFKP 289
Query: 250 PSEETVQQIKKSKIWELENELYEYALEQFH 279
P ETV+ I + + E E Y++ ++ H
Sbjct: 290 PVSETVKNIVRQNFTK-EIEFYQFCKQRLH 318
>gi|91092306|ref|XP_969659.1| PREDICTED: similar to heparan sulfate 2-o-sulfotransferase, partial
[Tribolium castaneum]
Length = 295
Score = 94.0 bits (232), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 72/270 (26%), Positives = 123/270 (45%), Gaps = 24/270 (8%)
Query: 29 VIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHV-LSLADQYRFVNNVTKWRD 87
++ +NRVPK GS + + + + + F V + LS DQ + V+ +
Sbjct: 19 IVFFNRVPKVGSQTLMELIRRLSIRNNFGFHQDRVQRVETIRLSPEDQAVLSSLVSSYEP 78
Query: 88 RRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVR---- 143
P +Y H F + +FG E P++IN++R P++R++S+YY++R Y +
Sbjct: 79 --PGVYIKHVCFTNISRFGFPE-PIYINLVRDPVERVISWYYYVRAPWYYVERKIAFPDI 135
Query: 144 ---KKHGDKTTFDECIRLNRTEC---------SLENMWLQVPFLCGHAAACWVPGNPWAL 191
K F+ C+ EC + + Q F CGH + C P AL
Sbjct: 136 PLPDPKWLKKDFEHCVLSGDRECKYLTGETREGIGDHRRQSMFFCGHHSECLPFNTPGAL 195
Query: 192 EKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKI--D 249
E+AK + Y +VGV E+L +++LE +P FF G + + + S N+
Sbjct: 196 ERAKRVVEQHYAVVGVLEDLNTTLTVLEKYIPRFFTGAYEIYW-NEISRFNPINKNAFKP 254
Query: 250 PSEETVQQIKKSKIWELENELYEYALEQFH 279
P ETV+ I + + E E Y++ ++ H
Sbjct: 255 PVSETVKNIVRQNFTK-EIEFYQFCKQRLH 283
>gi|158295252|ref|XP_316107.4| AGAP006058-PA [Anopheles gambiae str. PEST]
gi|157015946|gb|EAA11659.4| AGAP006058-PA [Anopheles gambiae str. PEST]
Length = 389
Score = 94.0 bits (232), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 67/261 (25%), Positives = 121/261 (46%), Gaps = 28/261 (10%)
Query: 29 VIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWRDR 88
+I +NRVPK GS +F+ + + + + H +V V+ LA + +
Sbjct: 114 IIFFNRVPKVGSQTFMELLRRLAVRNDYT-FHRDVVQRLEVIRLAPERQQELAEMVMDLP 172
Query: 89 RPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKHG- 147
P++Y H + +F +FG P+++N++R P++R++S+YY++R Y V +K
Sbjct: 173 VPSVYVKHVCYTNFTRFGLP-MPIYVNMVRDPVERIISWYYYVRAPWYY----VERKQAF 227
Query: 148 ----------DKTTFDECIRLNRTECSLENMWL---------QVPFLCGHAAACWVPGNP 188
K F+ C+ EC+ + Q F CGH C +
Sbjct: 228 PDLPLPDPRWLKKDFETCVLQGDPECTYSQNAVHEGIGDHRRQTLFFCGHGEECLPFNSA 287
Query: 189 WALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNK--SHLRRTNR 246
ALE+AK + ++Y +VGV E+L +++LE +P FF G + + + + N
Sbjct: 288 GALERAKYAVESQYAVVGVLEDLNTTLTVLEKYVPRFFSGASSVYFNEVNVLQKINKNNF 347
Query: 247 KIDPSEETVQQIKKSKIWELE 267
K SEE ++++ E+E
Sbjct: 348 KPPVSEEIKNLVRRNFTKEIE 368
>gi|194751656|ref|XP_001958141.1| GF10771 [Drosophila ananassae]
gi|190625423|gb|EDV40947.1| GF10771 [Drosophila ananassae]
Length = 319
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 70/275 (25%), Positives = 131/275 (47%), Gaps = 34/275 (12%)
Query: 29 VIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLAD--QYRFVNNVTKWR 86
++ +NRVPK GS +F+ + + + F H + + LA+ Q +++
Sbjct: 29 LVFFNRVPKVGSQTFMELLRRLSERNNFQ-FHRDAVQKVETIRLAEDQQQEMAEVISELP 87
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLR-----------YGD 135
+ P+++ H F +F +F +P+++N++R P++R++S++Y++R + D
Sbjct: 88 E--PSVFIKHVCFTNFTKFNLP-KPIYLNVVRDPVERVISWFYYVRAPWYFVERKAAFPD 144
Query: 136 NYRPHLVRKKHGDKTTFDECIRLNRTECS---------LENMWLQVPFLCGHAAACWVPG 186
PH K F+ C+ EC+ + + Q F CGH C
Sbjct: 145 LPLPH----PAWLKKDFETCVLSGDQECTYTQGVTVEGIGDHRRQSLFFCGHDYECTPFN 200
Query: 187 NPWALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNK--SHLRRT 244
ALEKAK + +Y +VGV E+L +S+LE +P FF G D + TS + + + +
Sbjct: 201 TVGALEKAKFAVEQQYAVVGVLEDLNTTLSVLEKYVPRFFEGVRDIYATSAEYLTKINKN 260
Query: 245 NRKIDPSEETVQQIKKSKIWELENELYEYALEQFH 279
N K SE ++++ E+ E Y++ ++ H
Sbjct: 261 NFKPPVSEHVKDIVRRNFTNEI--EFYQFCRQRLH 293
>gi|195128101|ref|XP_002008504.1| GI13539 [Drosophila mojavensis]
gi|193920113|gb|EDW18980.1| GI13539 [Drosophila mojavensis]
Length = 314
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 69/273 (25%), Positives = 132/273 (48%), Gaps = 31/273 (11%)
Query: 29 VIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGN-NHVLSLADQYRFVNNVTKWRD 87
V+ YNRVPKTGS + + + + ++V G +L A++ +++N+ D
Sbjct: 20 VLFYNRVPKTGSIQLIELMRALGKVHDYDVEKDPQNGGIRALLDSAEEADWIDNIVNLED 79
Query: 88 RRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKHG 147
++ H +++F ++ + +P++IN++R P+DR++S+YY++R + P R
Sbjct: 80 --GTVFASHVNYLNFTKY-EQPRPIYINMVRDPVDRVISWYYYIRAPWIFVPGRRRNNRQ 136
Query: 148 D------KTTFDECIRLNRTECS-LENMWL--------QVPFLCGHAAACWVPGNP-WAL 191
T FD+C+ C+ +E L Q F CGH P N AL
Sbjct: 137 MPNPQWVNTEFDQCVLSGEKACTYIEGSLLERVGDHRRQTLFFCGHNEFKCTPFNSRLAL 196
Query: 192 EKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDP- 250
+ AK+N+ +Y +VG E + +++LEA +P +F + + S L T +P
Sbjct: 197 QLAKQNVEREYAVVGTWEHTNETLAVLEAYVPRYFADASKMYY----SGLHATKANDNPM 252
Query: 251 ----SEETVQQIKKSKIWELENELYEYALEQFH 279
SE+ + ++++ E+ E Y++ ++ H
Sbjct: 253 KPHISEDILNMVRRNFTREI--EFYQFCRQRLH 283
>gi|313219581|emb|CBY30503.1| unnamed protein product [Oikopleura dioica]
Length = 333
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 74/279 (26%), Positives = 130/279 (46%), Gaps = 30/279 (10%)
Query: 30 IIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYR--------FVNN 81
I +N++PK GST+ N+ + FN++H + + D + V
Sbjct: 59 IFHNKIPKCGSTTMANILAALETTNNFNLIHYHPCIKSPCDKALDGRKNSDYLTQELVPE 118
Query: 82 VTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHL 141
V + RP + H F +F + EQP IN+ R P+ R VS +YF RYG N +
Sbjct: 119 VEEASADRPLVLVKHHHFANFTAY-DMEQPTMINVARDPVSRFVSSFYFRRYGFNRNEGV 177
Query: 142 VRKKHGDK---TTFDECIRLNRTECSLENMWLQVPFLCGHA---AACWVPGNP----WAL 191
R+ G K +EC+ ECS E +L+ ++CG C NP AL
Sbjct: 178 RREFIGRKKQEMGLEECVMSEAHECS-EAAFLE--YICGSDKFWPECGNISNPKSRERAL 234
Query: 192 EKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHF--LTSNKSHLRRTNRKID 249
E+AK++++ +YL +G+ E++ + ++L E +P+ F G + + + ++ T +K
Sbjct: 235 ERAKKHVLNEYLAIGILEDIENTLTLFERMVPTVFHGAPAVYRAIGTTITNQTSTAKKEP 294
Query: 250 PSEETVQQIKKS------KIWELENELYEYALEQFHFVK 282
S+ ++++K +++L +YE L F K
Sbjct: 295 VSDLVREKLEKGPLRHQVDLYKLIKAVYEQKLRDFGISK 333
>gi|195352355|ref|XP_002042678.1| GM14880 [Drosophila sechellia]
gi|195477368|ref|XP_002086329.1| GE22927 [Drosophila yakuba]
gi|194124562|gb|EDW46605.1| GM14880 [Drosophila sechellia]
gi|194186119|gb|EDW99730.1| GE22927 [Drosophila yakuba]
gi|325995198|gb|ADZ49072.1| RE11403p [Drosophila melanogaster]
gi|332000046|gb|AED98571.1| RE19313p [Drosophila melanogaster]
Length = 308
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 69/275 (25%), Positives = 131/275 (47%), Gaps = 34/275 (12%)
Query: 29 VIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLAD--QYRFVNNVTKWR 86
++ +NRVPK GS +F+ + + + F H + + LA+ Q +++
Sbjct: 19 LVFFNRVPKVGSQTFMELLRRLSERNNFQ-FHRDAVQKVETIRLAEDQQQEMAEVISELP 77
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLR-----------YGD 135
+ P+++ H F +F +F +P+++N++R P++R++S++Y++R + D
Sbjct: 78 E--PSVFIKHVCFTNFTKFNLP-RPIYLNVVRDPVERVISWFYYVRAPWYFVERKAAFPD 134
Query: 136 NYRPHLVRKKHGDKTTFDECIRLNRTECS---------LENMWLQVPFLCGHAAACWVPG 186
PH K F+ C+ EC+ + + Q F CGH C
Sbjct: 135 LPLPH----PAWLKKDFETCVLNGDQECTYTQGVTVEGIGDHRRQSLFFCGHDYECTPFN 190
Query: 187 NPWALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNK--SHLRRT 244
ALE+AK + +Y +VGV E+L +S+LE +P FF G D + TS + + + +
Sbjct: 191 TVGALERAKFAVEQQYAVVGVLEDLNTTLSVLEKYVPRFFEGVRDIYATSAEYLTKINKN 250
Query: 245 NRKIDPSEETVQQIKKSKIWELENELYEYALEQFH 279
N K SE ++++ E+ E Y++ ++ H
Sbjct: 251 NFKPPVSEHVKDIVRRNFTNEI--EFYQFCRQRLH 283
>gi|28574833|ref|NP_524158.2| pipe, isoform A [Drosophila melanogaster]
gi|28380469|gb|AAF49170.2| pipe, isoform A [Drosophila melanogaster]
Length = 413
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 69/275 (25%), Positives = 132/275 (48%), Gaps = 34/275 (12%)
Query: 29 VIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLAD--QYRFVNNVTKWR 86
++ +NRVPK GS +F+ + + + F H + + LA+ Q +++
Sbjct: 124 LVFFNRVPKVGSQTFMELLRRLSERNNFQ-FHRDAVQKVETIRLAEDQQQEMAEVISELP 182
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLR-----------YGD 135
+ P+++ H F +F +F + +P+++N++R P++R++S++Y++R + D
Sbjct: 183 E--PSVFIKHVCFTNFTKF-NLPRPIYLNVVRDPVERVISWFYYVRAPWYFVERKAAFPD 239
Query: 136 NYRPHLVRKKHGDKTTFDECIRLNRTECS---------LENMWLQVPFLCGHAAACWVPG 186
PH K F+ C+ EC+ + + Q F CGH C
Sbjct: 240 LPLPH----PAWLKKDFETCVLNGDQECTYTQGVTVEGIGDHRRQSLFFCGHDYECTPFN 295
Query: 187 NPWALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNK--SHLRRT 244
ALE+AK + +Y +VGV E+L +S+LE +P FF G D + TS + + + +
Sbjct: 296 TVGALERAKFAVEQQYAVVGVLEDLNTTLSVLEKYVPRFFEGVRDIYATSAEYLTKINKN 355
Query: 245 NRKIDPSEETVQQIKKSKIWELENELYEYALEQFH 279
N K SE ++++ E+ E Y++ ++ H
Sbjct: 356 NFKPPVSEHVKDIVRRNFTNEI--EFYQFCRQRLH 388
>gi|195496284|ref|XP_002095628.1| GE22508 [Drosophila yakuba]
gi|194181729|gb|EDW95340.1| GE22508 [Drosophila yakuba]
Length = 1311
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 69/275 (25%), Positives = 131/275 (47%), Gaps = 34/275 (12%)
Query: 29 VIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLAD--QYRFVNNVTKWR 86
++ +NRVPK GS +F+ + + + F H + + LA+ Q +++
Sbjct: 1022 LVFFNRVPKVGSQTFMELLRRLSERNNFQ-FHRDAVQKVETIRLAEDQQQEMAEVISELP 1080
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLR-----------YGD 135
+ P+++ H F +F +F +P+++N++R P++R++S++Y++R + D
Sbjct: 1081 E--PSVFIKHVCFTNFTKFNLP-RPIYLNVVRDPVERVISWFYYVRAPWYFVERKAAFPD 1137
Query: 136 NYRPHLVRKKHGDKTTFDECIRLNRTECS---------LENMWLQVPFLCGHAAACWVPG 186
PH K F+ C+ EC+ + + Q F CGH C
Sbjct: 1138 LPLPH----PAWLKKDFETCVLNGDQECTYTQGVTVEGIGDHRRQSLFFCGHDYECTPFN 1193
Query: 187 NPWALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNK--SHLRRT 244
ALE+AK + +Y +VGV E+L +S+LE +P FF G D + TS + + + +
Sbjct: 1194 TVGALERAKFAVEQQYAVVGVLEDLNTTLSVLEKYVPRFFEGVRDIYATSAEYLTKINKN 1253
Query: 245 NRKIDPSEETVQQIKKSKIWELENELYEYALEQFH 279
N K SE ++++ E+ E Y++ ++ H
Sbjct: 1254 NFKPPVSEHVKDIVRRNFTNEI--EFYQFCRQRLH 1286
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 61/218 (27%), Positives = 106/218 (48%), Gaps = 22/218 (10%)
Query: 29 VIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKW-RD 87
VI +NR K GS + + + M FN L V G ++S + W +
Sbjct: 527 VIFFNRGAKVGSEALMELTQTMAP---FNNLTVVTQGPLAIISRTRSPKEQMIQALWVTE 583
Query: 88 RRPA-LYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPH------ 140
P +Y H ++DFQ++ +P++IN++R P++R++S+YY++R G H
Sbjct: 584 LEPGTIYIEHCNWLDFQRY-QLPRPIYINLVRDPVERMISWYYYVRSGYRNAIHHRRFPN 642
Query: 141 -LVRKKHGDKTTFDECIRLNRTEC---------SLENMWLQVPFLCGHAAACWVPGNPWA 190
++ + K ++++C+R EC S+ N Q F CGH C + A
Sbjct: 643 ATIKSEKWFKKSYNDCVRSGDPECQYVPGSIKESVGNYKRQTLFFCGHNRECLPFDSQRA 702
Query: 191 LEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRG 228
++ AK ++ Y +VG EE +++LEA +P FF+G
Sbjct: 703 IQLAKLHVERDYAVVGTWEETNITLTVLEAYIPRFFKG 740
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 53/197 (26%), Positives = 95/197 (48%), Gaps = 23/197 (11%)
Query: 91 ALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKHGD-- 148
+Y H ++DF +F +P+++N++R P++R++S++++ R +Y+ + +K +
Sbjct: 49 TMYIEHINWLDFDEFDLP-KPIYMNLVRDPVERVISWFFYAR--SSYKNAIEYRKRPNQK 105
Query: 149 -------KTTFDECIRLNRTEC---------SLENMWLQVPFLCGHAAACWVPGNPWALE 192
K F+EC+R EC ++ N Q F CGH C +P A++
Sbjct: 106 IKPESWYKKNFNECVRSGDPECQYVPHTVKDTIANFKRQSLFYCGHHDDCLPFNSPTAVQ 165
Query: 193 KAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDP-- 250
AKE++ Y +VG E+ +++LE +P FFRG + N R K P
Sbjct: 166 MAKEHVERDYAVVGSWEDTNITLTVLENYIPRFFRGAKLMYEMHNNKITNRNKNKRKPFI 225
Query: 251 SEETVQQIKKSKIWELE 267
E + I+K+ E E
Sbjct: 226 EPEVKEMIRKNFTSEYE 242
Score = 74.7 bits (182), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 58/222 (26%), Positives = 103/222 (46%), Gaps = 36/222 (16%)
Query: 30 IIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNH-------VLSLADQYRFVNNV 82
+ + R K GS S V M + N V+ G N L+ A ++ N+
Sbjct: 284 LFFTRCAKVGSESLVEF---MEHLQDINNFQVDKYGMNKKSKRQLKPLAQAATAGYIYNL 340
Query: 83 TKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLV 142
+ ++Y H +IDF + +P+FIN++R P++R++S+YY++R ++YR +
Sbjct: 341 DEG-----SVYIEHIPWIDFNDYNLP-KPIFINLVRDPVERMISWYYYVR--NSYRNAIF 392
Query: 143 RKKHGD---------KTTFDECIRLNRTECSL---------ENMWLQVPFLCGHAAACWV 184
+ + K ++++C+R EC N Q F CGH C
Sbjct: 393 YRNNPLAPLKPTAWFKKSYNDCVRSGDPECQYVPLAVRDVEGNFKRQSIFFCGHDQDCLP 452
Query: 185 PGNPWALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFF 226
+P A++ AK + T+Y +VG EE +++LE +P +F
Sbjct: 453 FNSPLAVQIAKRRVETEYAVVGTWEETNITLTVLEHYIPRYF 494
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 53/217 (24%), Positives = 106/217 (48%), Gaps = 24/217 (11%)
Query: 29 VIIYNRVPKTGSTSFVNMAYDMCRKKRFNV-LHVNVTGNNHVLSLADQYRFVNNVTKWRD 87
++I+NR + S V + + NV L+ V N + +Q ++W +
Sbjct: 759 IVIFNRPTRVDSEQMVPLFRQLAAMNDINVVLNGPVRTMNRTRTEKEQL----IESEWAN 814
Query: 88 R--RPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYY----FLRYGDNYRPHL 141
R ++Y H ++DF+ FG K +P++I++++ P+DR+++ +Y +++ R +
Sbjct: 815 ELERGSIYMAHSNWLDFESFGFK-KPIYISLVKDPIDRMITDFYKRRSWVKRAIYRRMYP 873
Query: 142 VRKKHGD---KTTFDECIRLNRTEC---------SLENMWLQVPFLCGHAAACWVPGNPW 189
R++ D + +F+EC+R EC +++ Q + CG+ A C +
Sbjct: 874 GRRERPDEWYQQSFNECVRSRSPECLFVQHAVADPIQDFKRQSLYFCGNEADCLPFNSHH 933
Query: 190 ALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFF 226
A + AK + +Y +VG EE +++LE +P FF
Sbjct: 934 ATQIAKRRVEKEYSVVGTWEERNITLTVLEKYVPRFF 970
>gi|357608416|gb|EHJ65994.1| pipe [Danaus plexippus]
Length = 380
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 74/280 (26%), Positives = 127/280 (45%), Gaps = 42/280 (15%)
Query: 29 VIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHV-LSLADQYRFVNNVTKWRD 87
++ +NRVPK GS +F+ + + K +F +V + L+ A+Q + VT
Sbjct: 87 LLFFNRVPKVGSQTFMELLRRLAIKNQFGFHRDSVQRVETIRLAPANQQVLASVVTSHAP 146
Query: 88 RRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKHG 147
PA Y H + +F +FG P+++N++R P++R++S+YY++R Y V +K
Sbjct: 147 --PASYIKHVCYTNFTRFGYP-SPIYVNVVRDPVERVISWYYYVRAPWYY----VERKQA 199
Query: 148 -----------DKTTFDECIRLNRTEC---------SLENMWLQVPFLCGHAAACWVPGN 187
K F+ C+ EC + + Q F CGH C +
Sbjct: 200 FPDLPLPDPAWLKKDFETCVLSGDRECRYLEGETHEGIGDHRRQTLFFCGHEPQCTPFNS 259
Query: 188 PWALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGG-------TDHFLTSNKSH 240
AL++AK + +Y +VGV E+L + E +P FF G + F N++H
Sbjct: 260 VEALQRAKRVVEQQYAVVGVLEDLNSTLLAFERYIPRFFTGALKMYWEELNTFNRINRNH 319
Query: 241 LRRTNRKIDPSEETVQQIKKSKIWELENELYEYALEQFHF 280
+ P E V+QI ++ + E E YE+ ++ H
Sbjct: 320 FKL------PVSEAVKQIVRAN-FTREIEFYEFCKQRLHL 352
>gi|195591493|ref|XP_002085475.1| GD12287 [Drosophila simulans]
gi|194197484|gb|EDX11060.1| GD12287 [Drosophila simulans]
Length = 516
Score = 91.3 bits (225), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 69/275 (25%), Positives = 131/275 (47%), Gaps = 34/275 (12%)
Query: 29 VIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLAD--QYRFVNNVTKWR 86
++ +NRVPK GS +F+ + + + F H + + LA+ Q +++
Sbjct: 227 LVFFNRVPKVGSQTFMELLRRLSERNNFQ-FHRDAVQKVETIRLAEDQQQEMAEVISELP 285
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLR-----------YGD 135
+ P+++ H F +F +F +P+++N++R P++R++S++Y++R + D
Sbjct: 286 E--PSVFIKHVCFTNFTKFNLP-RPIYLNVVRDPVERVISWFYYVRAPWYFVERKAAFPD 342
Query: 136 NYRPHLVRKKHGDKTTFDECIRLNRTECS---------LENMWLQVPFLCGHAAACWVPG 186
PH K F+ C+ EC+ + + Q F CGH C
Sbjct: 343 LPLPH----PAWLKKDFETCVLNGDQECTYTQGVTVEGIGDHRRQSLFFCGHDYECTPFN 398
Query: 187 NPWALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNK--SHLRRT 244
ALE+AK + +Y +VGV E+L +S+LE +P FF G D + TS + + + +
Sbjct: 399 TVGALERAKFAVEQQYAVVGVLEDLNTTLSVLEKYVPRFFEGVRDIYATSAEYLTKINKN 458
Query: 245 NRKIDPSEETVQQIKKSKIWELENELYEYALEQFH 279
N K SE ++++ E+ E Y++ ++ H
Sbjct: 459 NFKPPVSEHVKDIVRRNFTNEI--EFYQFCRQRLH 491
Score = 47.4 bits (111), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 30/102 (29%), Positives = 56/102 (54%), Gaps = 5/102 (4%)
Query: 50 MCRKKRFN-VLHVNVTGNNHVLSLADQYRFVNNVTKWRDR-RPALYHGHFGFIDFQQFGS 107
M R + N H G+ H + ++ R + + R +P +Y H +I+F +F
Sbjct: 4 MARLGKINGFTHARNKGSAHETIVMNKQRQNDLIADLLTRPKPHIYSQHIAYINFTRF-H 62
Query: 108 KEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKHGDK 149
+P++IN++R P+DR++S++Y++R YR ++ K GDK
Sbjct: 63 LPKPIYINLIRDPIDRIISWHYYIRAPWYYRD--MQAKLGDK 102
>gi|194874014|ref|XP_001973324.1| GG13413 [Drosophila erecta]
gi|190655107|gb|EDV52350.1| GG13413 [Drosophila erecta]
Length = 516
Score = 91.3 bits (225), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 69/275 (25%), Positives = 131/275 (47%), Gaps = 34/275 (12%)
Query: 29 VIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLAD--QYRFVNNVTKWR 86
++ +NRVPK GS +F+ + + + F H + + LA+ Q +++
Sbjct: 227 LVFFNRVPKVGSQTFMELLRRLSERNNFQ-FHRDAVQKVETIRLAEDQQQEMAEVISELP 285
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLR-----------YGD 135
+ P+++ H F +F +F +P+++N++R P++R++S++Y++R + D
Sbjct: 286 E--PSVFIKHVCFTNFTKFNLP-RPIYLNVVRDPVERVISWFYYVRAPWYFVERKAAFPD 342
Query: 136 NYRPHLVRKKHGDKTTFDECIRLNRTECS---------LENMWLQVPFLCGHAAACWVPG 186
PH K F+ C+ EC+ + + Q F CGH C
Sbjct: 343 LPLPH----PAWLKKDFETCVLNGDQECTYTQGVTVEGIGDHRRQSLFFCGHDYECTPFN 398
Query: 187 NPWALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNK--SHLRRT 244
ALE+AK + +Y +VGV E+L +S+LE +P FF G D + TS + + + +
Sbjct: 399 TVGALERAKFAVEQQYAVVGVLEDLNTTLSVLEKYVPRFFEGVRDIYATSAEYLTKINKN 458
Query: 245 NRKIDPSEETVQQIKKSKIWELENELYEYALEQFH 279
N K SE ++++ E+ E Y++ ++ H
Sbjct: 459 NFKPPVSEHVKDIVRRNFTNEI--EFYQFCRQRLH 491
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 30/102 (29%), Positives = 57/102 (55%), Gaps = 5/102 (4%)
Query: 50 MCRKKRFN-VLHVNVTGNNHVLSLADQYRFVNNVTKWRDR-RPALYHGHFGFIDFQQFGS 107
M R + N H G+ H + ++ R + + + R +P +Y H +I+F +F
Sbjct: 4 MARLGKINGFTHARNKGSAHETIVMNRQRQNDLIAELLTRPKPHIYSQHIAYINFTRF-H 62
Query: 108 KEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKHGDK 149
+P++IN++R P+DR++S++Y++R YR ++ K GDK
Sbjct: 63 LPKPIYINLIRDPIDRIISWHYYIRAPWYYRD--MKAKLGDK 102
>gi|195354146|ref|XP_002043561.1| GM18995 [Drosophila sechellia]
gi|194127729|gb|EDW49772.1| GM18995 [Drosophila sechellia]
Length = 407
Score = 91.3 bits (225), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 83/287 (28%), Positives = 130/287 (45%), Gaps = 48/287 (16%)
Query: 16 SPSPETDSLSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQ 75
+P + D+L +NR+ KTGS + + + ++ F +V G V+++ D
Sbjct: 118 TPRAQVDTL------FFNRITKTGSEKMMELLKILGKRHNFEARR-DVEGFYEVVNMHDA 170
Query: 76 Y--RFVN----NVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYY 129
Y F+ N TK Y H F+DF + P++IN++R P++RLVS++Y
Sbjct: 171 YAKNFIRTEVINCTKANS-----YTKHVAFLDFDLL-DEPWPIYINMVRDPIERLVSWFY 224
Query: 130 FLRYGDNYRP-HLVRKKH--GD----------KTTFDECIRLNRTECSLENMWL------ 170
++R P H +K GD K F+ CI EC E M +
Sbjct: 225 YVR-----APWHFAERKEMFGDAIVLPSIDWLKKDFNRCIEERDPECVYEQMEMGNLGDH 279
Query: 171 --QVPFLCGHAAACWVPGNPW-ALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFR 227
Q +LCG A +P N A+++AK+N+ Y +VG E+ +S+LE +P FF
Sbjct: 280 RRQSLYLCGQNMAVCMPFNSHEAMQRAKKNVEEHYAVVGTWEDTNITLSVLEGYIPRFFS 339
Query: 228 GGTDHFLTSNKSHLRRTNRKI-DPSEETVQQIKKSKIWELENELYEY 273
G D + KS L NR PS + S+ E ELY++
Sbjct: 340 GAKDEYYAVKKS-LGNVNRNTYRPSLSDKARAVLSQNLTREIELYQF 385
>gi|313225810|emb|CBY07284.1| unnamed protein product [Oikopleura dioica]
Length = 333
Score = 91.3 bits (225), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 73/279 (26%), Positives = 129/279 (46%), Gaps = 30/279 (10%)
Query: 30 IIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYR--------FVNN 81
I +N++PK GST+ N+ + FN++H + + D +
Sbjct: 59 IFHNKIPKCGSTTMANILAALETTNNFNLIHYHPCIKSPCDKALDGRKNSDYLTQELAPE 118
Query: 82 VTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHL 141
V + RP + H F +F + EQP IN+ R P+ R VS +YF RYG N +
Sbjct: 119 VEEASADRPLVLVKHHHFANFTAY-DMEQPTMINVARDPVSRFVSSFYFRRYGFNRNEGV 177
Query: 142 VRKKHGDK---TTFDECIRLNRTECSLENMWLQVPFLCGHA---AACWVPGNP----WAL 191
R+ G K +EC+ ECS E +L+ ++CG C NP AL
Sbjct: 178 RREFIGRKKQEMGLEECVMSEAHECS-EAAFLE--YICGSDKFWPECGNISNPKSRERAL 234
Query: 192 EKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHF--LTSNKSHLRRTNRKID 249
E+AK++++ +YL +G+ E++ + ++L E +P+ F G + + + ++ T +K
Sbjct: 235 ERAKKHVLNEYLAIGILEDIENTLTLFERMVPTVFHGAPAVYRAIGTTITNQTSTAKKEP 294
Query: 250 PSEETVQQIKKS------KIWELENELYEYALEQFHFVK 282
S+ ++++K +++L +YE L F K
Sbjct: 295 VSDLVREKLEKGPLRHQVDLYKLIKAVYEQKLRDFGISK 333
>gi|194751646|ref|XP_001958136.1| GF10766 [Drosophila ananassae]
gi|190625418|gb|EDV40942.1| GF10766 [Drosophila ananassae]
Length = 311
Score = 90.9 bits (224), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 59/219 (26%), Positives = 111/219 (50%), Gaps = 17/219 (7%)
Query: 29 VIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLAD-QYRFVNNVTKWRD 87
++ +NRVPK GS S + + Y + K F V + V A+ Q + + ++
Sbjct: 32 LLFFNRVPKVGSESLIALMYRLGEKNDFQVERAPFSKPVGVFWTAERQKQEAKRIFDLQE 91
Query: 88 RRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYG-DNYRPHLVRKKH 146
+ Y H F++F+ F QP++IN++R P++R++S++Y+ R ++ + + K
Sbjct: 92 QPAFAYVEHMNFMNFRPF-HHPQPIYINLVRDPVERVISWFYYKRTPWNSVKMFEITGKF 150
Query: 147 GDKT----TFDECIRLNRTECSLE----------NMWLQVPFLCGHAAACWVPGNPWALE 192
+++ F++C+ + EC + + Q F CGH+ C P A+
Sbjct: 151 QNRSHYVKNFEQCVLTHDFECRYDYGLHFKDDTADHKRQSLFFCGHSPLCEPFNTPAAIA 210
Query: 193 KAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTD 231
KAK+N+ Y +VG E++ +++LE +P FF+G TD
Sbjct: 211 KAKQNVERDYSVVGSWEDVNVTLTVLEHYIPRFFKGVTD 249
>gi|198463779|ref|XP_002135580.1| GA28236 [Drosophila pseudoobscura pseudoobscura]
gi|198151408|gb|EDY74207.1| GA28236 [Drosophila pseudoobscura pseudoobscura]
Length = 301
Score = 90.5 bits (223), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 66/270 (24%), Positives = 128/270 (47%), Gaps = 21/270 (7%)
Query: 29 VIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVL-SLADQYRFVNNVTKWRD 87
++ +NRVPKTGS + + + + ++ F V S Q + + +
Sbjct: 22 ILFFNRVPKTGSETLIELMLRLGKRNHFQNARSPFAKPTGVYWSFEKQKEEAHRILDLME 81
Query: 88 RRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYG-DNYRPHLVRKKH 146
Y H +++F+QF QP++IN++R P++R++S+YY+ R ++ + + K
Sbjct: 82 EDAFAYAEHANYVNFRQFHLP-QPIYINLVRDPVERVISWYYYKRTPWNSLQMFKITGKF 140
Query: 147 GDKT----TFDECIRLNRTECSLE----------NMWLQVPFLCGHAAACWVPGNPWALE 192
++T F++C+ + EC + + Q F CGHA C P A+
Sbjct: 141 ENRTHYTKNFEDCVLSHDFECRYDYGLNFKDDPADHKRQSLFFCGHAPLCEPFNTPSAVA 200
Query: 193 KAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKS---HLRRTNRKID 249
+AK+N+ + ++G E++ +++LE +P FFRG TD + K R TN
Sbjct: 201 RAKQNVERDFSVIGSWEDVNVTLAVLEHFIPRFFRGSTDLYYEPVKGLAFKKRNTNHWKP 260
Query: 250 PSEETVQQIKKSKIWELENELYEYALEQFH 279
E +++I ++ + E E Y + ++ +
Sbjct: 261 KISERIKRIMRANFTQ-EYEFYHFCKQRLY 289
>gi|157112628|ref|XP_001657596.1| heparan sulfate 2-o-sulfotransferase [Aedes aegypti]
gi|108878000|gb|EAT42225.1| AAEL006219-PA [Aedes aegypti]
Length = 379
Score = 90.5 bits (223), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 67/264 (25%), Positives = 123/264 (46%), Gaps = 34/264 (12%)
Query: 29 VIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLAD--QYRFVNNVTKWR 86
+I +NRVPK GS +F+ + + + + H +V V+ L+ Q V+
Sbjct: 104 IIFFNRVPKVGSQTFMELLRRLAIRNEY-TFHRDVVQRLEVIRLSPDRQQELAEMVSDLP 162
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRP-HLVRKK 145
P++Y H + +F +FG P+++N++R P++R++S+YY++R P + V +K
Sbjct: 163 --MPSVYVKHVCYTNFTRFGLP-MPIYVNMVRDPVERIISWYYYVR-----APWYFVERK 214
Query: 146 HG-----------DKTTFDECIRLNRTECSLENMWL---------QVPFLCGHAAACWVP 185
K F+ C+ EC+ + Q F CGH C
Sbjct: 215 QAFPDLPLPDPRWLKKDFETCVLQGDPECTYAQGAIHEGIGDHRRQTLFFCGHDEQCLPF 274
Query: 186 GNPWALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNK--SHLRR 243
+ ALE+AK + ++Y +VGV E+L +++LE +P FF G + + + +
Sbjct: 275 NSQGALERAKYAVESQYAVVGVLEDLNTTLAVLERYVPKFFSGAANVYFNEVNLLQKINK 334
Query: 244 TNRKIDPSEETVQQIKKSKIWELE 267
N K S E + ++++ E+E
Sbjct: 335 NNFKPPVSHEIKELVRRNFTREIE 358
>gi|195377461|ref|XP_002047508.1| GJ11899 [Drosophila virilis]
gi|194154666|gb|EDW69850.1| GJ11899 [Drosophila virilis]
Length = 616
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 68/273 (24%), Positives = 133/273 (48%), Gaps = 31/273 (11%)
Query: 29 VIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGN-NHVLSLADQYRFVNNVTKWRD 87
V+ YNRVPKTGS + + + + + V G +L A++ +++N+ D
Sbjct: 23 VLFYNRVPKTGSMQLIELMRALGKVHDYEVEKDPQNGGIRPLLDAAEEGDWIDNIVNLED 82
Query: 88 RRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKHG 147
++ H +++F + + +P++IN++R P+DR++S+YY++R + P R++H
Sbjct: 83 --GTVFASHVNYLNFSKH-EQPRPIYINMVRDPVDRVISWYYYIRAPWVFVPG--RRRHN 137
Query: 148 DK--------TTFDECIRLNRTECS-LENMWL--------QVPFLCGHAAACWVPGNP-W 189
+ T FD+C+ C+ +E L Q F CGH P N
Sbjct: 138 REMPNPQWVNTEFDQCVLSGEKVCTYIEGSLLERVGDHRRQTLFFCGHNEFKCTPFNSRL 197
Query: 190 ALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTN---R 246
AL+ AK+N+ +Y +VG E + +++LEA +P +F + + + H + N
Sbjct: 198 ALQLAKQNVEREYAVVGTWEHTNETLAVLEAYVPRYFADASKMYYSG--LHAAKANDNPM 255
Query: 247 KIDPSEETVQQIKKSKIWELENELYEYALEQFH 279
K +E + ++++ E+ E Y++ ++ H
Sbjct: 256 KPHIRDEIMNMVRRNFTREI--EFYQFCRQRLH 286
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 67/278 (24%), Positives = 119/278 (42%), Gaps = 38/278 (13%)
Query: 29 VIIYNRVPKTGSTSFVNM-----AYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVT 83
V+ +NR K GS + + + Y+ + LH N + + FV +
Sbjct: 333 VLFFNRAAKVGSEALLELFNALVEYNEGLILERSGLHENTVRQMDKEAQQEAAEFVAGLE 392
Query: 84 KWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVR 143
+ +Y H ++DF F QP++IN++R P++R++S++Y+ R +Y+ +
Sbjct: 393 E-----GTIYIRHINWLDFSNFDLP-QPIYINMVRDPVERVISWFYYAR--SSYKNAIEY 444
Query: 144 KKHGDKT---------TFDECIRLNRTECS---------LENMWLQVPFLCGHAAACWVP 185
+K +K F++C+R EC + N Q F CGH C
Sbjct: 445 RKAPNKKIKPASWYKKNFNDCVRSGDPECQYVPHTVKDYVPNFKRQSLFYCGHHDDCIPF 504
Query: 186 GNPWALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTN 245
+P A++ AKE++ Y +VG E+ +++ E +P FF G + N R
Sbjct: 505 NSPTAIQMAKEHVERDYAVVGSWEDTNITLTVFERYIPRFFTGAKLMYEMHNNKITNRNK 564
Query: 246 RKIDPSEETVQQIKKSKIWELENELYEYALEQFHFVKK 283
K P E ++ E+ + Y E +HF K+
Sbjct: 565 NKRKPYIE-------PEVKEMIRRNFTYEYEFYHFCKQ 595
>gi|195377457|ref|XP_002047506.1| GJ11902 [Drosophila virilis]
gi|194154664|gb|EDW69848.1| GJ11902 [Drosophila virilis]
Length = 306
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 68/275 (24%), Positives = 129/275 (46%), Gaps = 34/275 (12%)
Query: 29 VIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLAD--QYRFVNNVTKWR 86
++ +NRVPK GS +F+ + + + F H + + LA+ Q +++
Sbjct: 19 LVFFNRVPKVGSQTFMELLRRLSERNNFQ-FHRDAVQKVETIRLAEDQQQEMAEVISELP 77
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLR-----------YGD 135
+ P+++ H F +F +F P+++N++R P++R++S++Y++R + D
Sbjct: 78 E--PSVFIKHVCFTNFTKFNLP-TPIYLNVVRDPVERVISWFYYVRAPWYFVERKAAFPD 134
Query: 136 NYRPHLVRKKHGDKTTFDECIRLNRTECS---------LENMWLQVPFLCGHAAACWVPG 186
PH K F+ C+ EC+ + + Q F CGH C
Sbjct: 135 LPLPH----PAWLKKDFETCVLSGDQECTYTQGVTVEGIGDHRRQSLFFCGHDYECTPFN 190
Query: 187 NPWALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNK--SHLRRT 244
ALE+AK + +Y +VGV E+ +S+LE +P FF G D + TS + + + +
Sbjct: 191 TVGALERAKFAVEQQYAVVGVLEDFNTTLSVLEKYVPRFFEGVRDIYATSAEYLTKINKN 250
Query: 245 NRKIDPSEETVQQIKKSKIWELENELYEYALEQFH 279
N K SE ++++ E+ E Y++ ++ H
Sbjct: 251 NFKPPVSEHVKDIVRRNFTNEI--EFYQFCRQRLH 283
>gi|195173234|ref|XP_002027398.1| GL20906 [Drosophila persimilis]
gi|194113250|gb|EDW35293.1| GL20906 [Drosophila persimilis]
Length = 471
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 67/269 (24%), Positives = 128/269 (47%), Gaps = 23/269 (8%)
Query: 29 VIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNN-HVLSLADQYRFVNNVTKWRD 87
V+ +NRVPKTGS + + + + + V G +L L +Q + N+ D
Sbjct: 22 VLFFNRVPKTGSMQLIELMRQLGKVHDYEVEKDPQNGGVLPILELPEQSDMIENIVNLED 81
Query: 88 RRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKHG 147
++ H +++F + + +P++IN++R P+DR+VS+YY++R + P R
Sbjct: 82 --GTVFASHVNYLNFTK-NEQPRPIYINMVRDPVDRVVSWYYYIRAPWIFVPGRRRSNRE 138
Query: 148 D------KTTFDECIRLNRTECS-LENMWL--------QVPFLCGHAAACWVPGNP-WAL 191
T +D+C+ C+ +E L Q+ F CGH P N AL
Sbjct: 139 MPNPKWVNTEYDQCVLSGEKVCTYIEGSLLEHVGDHRRQILFFCGHDEFKCTPFNSRLAL 198
Query: 192 EKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTN-RKIDP 250
+ AK N+ +Y +VG E + +++LEA +P +F + + + + + N K
Sbjct: 199 QIAKLNVEREYAVVGTWEHTNETLAVLEAYVPRYFADASKMYYSGLHAEKQNDNPMKPHI 258
Query: 251 SEETVQQIKKSKIWELENELYEYALEQFH 279
S+E + ++++ E+ E Y++ ++ H
Sbjct: 259 SQEILDMVRRNFTREI--EFYQFCRQRLH 285
Score = 51.2 bits (121), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 38/147 (25%), Positives = 75/147 (51%), Gaps = 16/147 (10%)
Query: 29 VIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTG----NNHVLSLADQYRFVNNVTK 84
V+ +NR K GS + + + + + + L ++ TG + L DQ V
Sbjct: 317 VLFFNRAAKVGSEAMLELLQAL--ENYHDDLTLDRTGLSKPTSRQLKKNDQRDMAEYVAD 374
Query: 85 WRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLR----YGDNYR-- 138
+ ++Y H +IDF +F + +P++IN++R P++R++S++++ R YR
Sbjct: 375 LEE--GSIYIEHINWIDFDEF-DQPKPIYINMVRDPVERVISWFFYARGSYKNAIEYRKK 431
Query: 139 PHL-VRKKHGDKTTFDECIRLNRTECS 164
P+L ++K+ K F+EC++ EC
Sbjct: 432 PNLKIKKESWYKKNFNECVKSGDPECQ 458
>gi|195128109|ref|XP_002008508.1| GI13544 [Drosophila mojavensis]
gi|193920117|gb|EDW18984.1| GI13544 [Drosophila mojavensis]
Length = 306
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 68/275 (24%), Positives = 129/275 (46%), Gaps = 34/275 (12%)
Query: 29 VIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLAD--QYRFVNNVTKWR 86
++ +NRVPK GS +F+ + + + F H + + LA+ Q +++
Sbjct: 19 LVFFNRVPKVGSQTFMELLRRLSERNNFQ-FHRDAVQKVETIRLAEDQQQEMAEVISELP 77
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLR-----------YGD 135
+ P+++ H F +F +F P+++N++R P++R++S++Y++R + D
Sbjct: 78 E--PSVFIKHVCFTNFTKFNLP-TPIYLNVVRDPVERVISWFYYVRAPWYFVERKAAFPD 134
Query: 136 NYRPHLVRKKHGDKTTFDECIRLNRTECS---------LENMWLQVPFLCGHAAACWVPG 186
PH K F+ C+ EC+ + + Q F CGH C
Sbjct: 135 LPLPH----PAWLKKDFETCVLSGDQECTYTQGVTVEGIGDHRRQSLFFCGHDYECTPFN 190
Query: 187 NPWALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNK--SHLRRT 244
ALE+AK + +Y +VGV E+ +S+LE +P FF G D + TS + + + +
Sbjct: 191 TVGALERAKFAVEQQYAVVGVLEDFNTTLSVLEKYVPRFFEGVRDIYATSAEYLTKINKN 250
Query: 245 NRKIDPSEETVQQIKKSKIWELENELYEYALEQFH 279
N K SE ++++ E+ E Y++ ++ H
Sbjct: 251 NFKPPVSEHVKDIVRRNFTNEI--EFYQFCRQRLH 283
>gi|195591495|ref|XP_002085476.1| GD12286 [Drosophila simulans]
gi|194197485|gb|EDX11061.1| GD12286 [Drosophila simulans]
Length = 397
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 85/296 (28%), Positives = 135/296 (45%), Gaps = 51/296 (17%)
Query: 10 HISSAK---SPSPETDSLSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGN 66
H+++A+ +P + D+L +NR+ KTGS + + + ++ F +V G
Sbjct: 99 HLTTAQLNNTPRAQVDTL------FFNRITKTGSEKMMELLKILGKRHNFEARR-DVEGF 151
Query: 67 NHVLSLADQY--RFVN----NVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKP 120
V+ + D Y F+ N TK Y H F+DF + P++IN++R P
Sbjct: 152 YEVVIMHDAYAKNFIRTEVINCTKANS-----YTKHVAFLDFDLL-DEPWPIYINMVRDP 205
Query: 121 LDRLVSYYYFLRYGDNYRP-HLVRKKH--GD----------KTTFDECIRLNRTECSLEN 167
++RLVS++Y++R P H +K GD K F+ CI EC E
Sbjct: 206 IERLVSWFYYVR-----APWHFAERKEMFGDAIVLPSIDWLKKDFNRCIEERDPECVYEQ 260
Query: 168 MWL--------QVPFLCGHAAACWVPGNPW-ALEKAKENLVTKYLLVGVTEELTDFVSLL 218
M + Q +LCG A +P N A+++AK+N+ Y +VG E+ +S+L
Sbjct: 261 MEMGNLGDHRRQSLYLCGQNMAVCMPFNSHEAMQRAKKNVEEHYAVVGTWEDTNITLSVL 320
Query: 219 EAALPSFFRGGTDHFLTSNKSHLRRTNRKI-DPSEETVQQIKKSKIWELENELYEY 273
E +P FF G D + KS L NR PS + S+ E ELY++
Sbjct: 321 EGYIPRFFSGVKDEYYALKKS-LGNVNRNTYRPSLSDKARAVLSQNLTREIELYQF 375
>gi|198463775|ref|XP_002135578.1| GA28238 [Drosophila pseudoobscura pseudoobscura]
gi|198151406|gb|EDY74205.1| GA28238 [Drosophila pseudoobscura pseudoobscura]
Length = 1157
Score = 89.7 bits (221), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 67/269 (24%), Positives = 128/269 (47%), Gaps = 23/269 (8%)
Query: 29 VIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNN-HVLSLADQYRFVNNVTKWRD 87
V+ +NRVPKTGS + + + + + V G +L L +Q + N+ D
Sbjct: 22 VLFFNRVPKTGSMQLIELMRQLGKVHDYEVEKDPQNGGVLPILELPEQSDMIENIVNLED 81
Query: 88 RRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKHG 147
++ H +++F + + +P++IN++R P+DR+VS+YY++R + P R
Sbjct: 82 --GTVFASHVNYLNFTK-NEQPRPIYINMVRDPVDRVVSWYYYIRAPWIFVPGRRRSNRE 138
Query: 148 D------KTTFDECIRLNRTECS-LENMWL--------QVPFLCGHAAACWVPGNP-WAL 191
T +D+C+ C+ +E L Q+ F CGH P N AL
Sbjct: 139 MPNPKWVNTEYDQCVLSGEKVCTYIEGSLLEHVGDHRRQILFFCGHDEFKCTPFNSRLAL 198
Query: 192 EKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTN-RKIDP 250
+ AK N+ +Y +VG E + +++LEA +P +F + + + + + N K
Sbjct: 199 QIAKLNVEREYAVVGTWEHTNETLAVLEAYVPRYFADASKMYYSGLHAEKQNDNPMKPHI 258
Query: 251 SEETVQQIKKSKIWELENELYEYALEQFH 279
S+E + ++++ E+ E Y++ ++ H
Sbjct: 259 SQEILDMVRRNFTREI--EFYQFCRQRLH 285
Score = 87.4 bits (215), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 65/242 (26%), Positives = 118/242 (48%), Gaps = 25/242 (10%)
Query: 29 VIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTG----NNHVLSLADQYRFVNNVTK 84
V+ +NR K GS + + + + + + L ++ TG + L DQ V
Sbjct: 317 VLFFNRAAKVGSEAMLELLQAL--ENYHDDLTLDRTGLSKPTSRQLKKNDQRDMAEYVAD 374
Query: 85 WRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLR----YGDNYR-- 138
+ ++Y H +IDF +F + +P++IN++R P++R++S++++ R YR
Sbjct: 375 LEE--GSIYIEHINWIDFDEF-DQPKPIYINMVRDPVERVISWFFYARGSYKNAIEYRKK 431
Query: 139 PHL-VRKKHGDKTTFDECIRLNRTEC---------SLENMWLQVPFLCGHAAACWVPGNP 188
P+L ++K+ K F+EC++ EC ++ N Q F CGH C +P
Sbjct: 432 PNLKIKKESWYKKNFNECVKSGDPECQYIPHTVKDAVPNFKRQTLFYCGHHDDCIPFNSP 491
Query: 189 WALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKI 248
A++ AKE++ Y +VG E+ +++LE +P FFRG + +N + R K
Sbjct: 492 TAVQMAKEHVERDYAVVGSWEDTNITLTVLERYIPRFFRGAKLMYEMNNNKIVNRNKNKR 551
Query: 249 DP 250
P
Sbjct: 552 KP 553
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 59/225 (26%), Positives = 109/225 (48%), Gaps = 26/225 (11%)
Query: 29 VIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKW-RD 87
V+ +NR K GS + + + M V+ N + A + R + V W D
Sbjct: 915 VVFFNRGAKVGSEALMQLTETMAPLNNMTVVTKGPIDINARIR-APRERMLQAV--WVAD 971
Query: 88 RRPA-LYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKH 146
P +Y H ++DF ++ +P++IN++R P++R++S++Y++R G YR ++ +
Sbjct: 972 LEPGTIYIEHCNWLDFHRY-QLPKPIYINMVRDPVERMISWFYYIRSG--YRNAIIHNRF 1028
Query: 147 GDKT---------TFDECIRLNRTEC---------SLENMWLQVPFLCGHAAACWVPGNP 188
+ T ++++C+R EC ++ + Q F CG+ C +P
Sbjct: 1029 PNTTLKSEKWFKKSYNQCVRSGDPECQYVPESIKDAVGDYKRQSLFYCGNNRKCLPFDSP 1088
Query: 189 WALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHF 233
A++ AK N+ Y +VG E+ +++LEA +P FFRG F
Sbjct: 1089 HAIQLAKRNVERDYAVVGSWEDTNITLAVLEAYIPRFFRGARQVF 1133
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 51/177 (28%), Positives = 91/177 (51%), Gaps = 23/177 (12%)
Query: 91 ALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKHGD-- 148
++Y H +IDF + +P+FIN++R P++R++S+YY++R ++YR + +K+
Sbjct: 675 SVYIEHTNWIDFNAYNLP-KPIFINMVRDPVERMISWYYYIR--NSYRNAIFYRKNPLAP 731
Query: 149 -------KTTFDECIRLNRTEC---------SLENMWLQVPFLCGHAAACWVPGNPWALE 192
K +F++C+R EC ++ N Q F CGH C +P A++
Sbjct: 732 IKPTAWFKKSFNDCVRSGDQECQYIPLTVKDAVPNFKRQSIFFCGHEPDCLPFNSPLAVQ 791
Query: 193 KAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHL--RRTNRK 247
AK + T++ +VG EE +++LE +P +F T + S + R NRK
Sbjct: 792 IAKRRVETEFAVVGTWEETNITLAVLEHYIPRYFARATMIYKIYQDSIINRNRNNRK 848
>gi|195022676|ref|XP_001985619.1| GH14409 [Drosophila grimshawi]
gi|193899101|gb|EDV97967.1| GH14409 [Drosophila grimshawi]
Length = 617
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 65/271 (23%), Positives = 133/271 (49%), Gaps = 27/271 (9%)
Query: 29 VIIYNRVPKTGSTSFVNMAYDMCRKKRFNV-LHVNVTGNNHVLSLADQYRFVNNVTKWRD 87
V+ +NRVPKTGS + + + + ++V + G +L ++ ++ N+ D
Sbjct: 323 VLFFNRVPKTGSVHLITLMKSLGKIHDYDVDKDPQIGGIQAILQPDEEADWIENIANLED 382
Query: 88 RRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKHG 147
+++ H +++F ++ S+ +P++IN++R P++R++S++YF+R + P R
Sbjct: 383 --GSVFASHVNYLNFSKY-SQPRPIYINMVRDPVERVISWHYFIRAPWIFVPGRRRNNRE 439
Query: 148 ------DKTTFDECIRLNRTECS-LENMWL--------QVPFLCGHAAACWVPGNP-WAL 191
FD+C+ C+ +E L Q F CGH P N AL
Sbjct: 440 MPNPKWANMEFDQCVESKEKVCTYIEGSLLERAGDHRRQTLFFCGHNEFKCTPFNSRLAL 499
Query: 192 EKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTN---RKI 248
+ AKEN+ +Y +VG E + +++LEA +P +F + + + H +++N K
Sbjct: 500 QLAKENVEREYAVVGTWEHTNETLAVLEAYVPRYFADASKLYYSG--LHAKKSNDNPMKP 557
Query: 249 DPSEETVQQIKKSKIWELENELYEYALEQFH 279
+E + ++++ E+ E Y++ ++ H
Sbjct: 558 HIRKEIIDMVRRNFTREI--EFYQFCRQRLH 586
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 65/290 (22%), Positives = 131/290 (45%), Gaps = 33/290 (11%)
Query: 30 IIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTG-NNHVLSLADQYRFVNNVTKWRDR 88
I++NR+ K GS S + + F+ + + ++ F + + D
Sbjct: 35 ILFNRLEKVGSQSMTRLLGHLSNLNGFHTFRNQIPDVKKPLFDFEEEVSFAEELQEIED- 93
Query: 89 RPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVR----- 143
PA Y H +I+F +P++IN++R P+ +++S YY+ R+ Y L+R
Sbjct: 94 -PAAYVEHTNWINFTA-HDMPRPIYINLVRHPIQKVISAYYYARHPMIYANSLLRNPNKP 151
Query: 144 ---KKHGDKTTFDECIRLNRTE-CSLENM------WLQVPF-LCGHAAACWVPGNPWALE 192
K+ D++ F+EC+R C+ + W + LCG+ C + A +
Sbjct: 152 VDNKEFFDRS-FNECVRKRIAPYCNFDAHLNYNKDWRRFTLHLCGNQKVCLNFNSEEATQ 210
Query: 193 KAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSE 252
AK ++ +Y +VG E +++LEA +P FF T+ + H + + P +
Sbjct: 211 IAKLHVEKEYAVVGSWEHTNITLAVLEAYIPRFFADATNQYYL----HQEKFRINVTPHD 266
Query: 253 ETVQQIKKSKIWELENELYEYALEQFHF----VKKHNLVYNKVLGYEADK 298
+ + + ++ + N+ + Y ++ +HF + K + K+L +A+K
Sbjct: 267 KHLDEDVEAYL----NQQFSYEIDLYHFCMQRLYKQYIAIKKILQLDANK 312
>gi|28574849|ref|NP_788537.1| pipe, isoform K [Drosophila melanogaster]
gi|28380478|gb|AAO41233.1| pipe, isoform K [Drosophila melanogaster]
gi|313661541|gb|ADR71725.1| RE27522p [Drosophila melanogaster]
Length = 406
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 80/296 (27%), Positives = 132/296 (44%), Gaps = 49/296 (16%)
Query: 10 HISSAK---SPSPETDSLSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGN 66
H+++A+ +P + D+L +NR+ KTGS + + + ++ F +V G
Sbjct: 108 HLTTAQLNNTPRAQVDTL------FFNRITKTGSEKMMELLKILGKRHNFEARR-DVEGF 160
Query: 67 NHVLSLADQY--RFVN----NVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKP 120
V+ + D + F+ N TK Y H F+DF + P++IN++R P
Sbjct: 161 YEVVIMHDAFAKNFIRTEVLNCTKANS-----YTKHVAFLDFDLL-DEPWPIYINMVRDP 214
Query: 121 LDRLVSYYYFLRYGDNYRP-HLVRKKH--GD----------KTTFDECIRLNRTECSLEN 167
++RLVS++Y++R P H +K GD + F+ CI EC E
Sbjct: 215 IERLVSWFYYVR-----APWHFAERKEMFGDAIVLPSIDWLRKDFNRCIEERDPECVYEQ 269
Query: 168 MWL--------QVPFLCGHAAACWVPGNPW-ALEKAKENLVTKYLLVGVTEELTDFVSLL 218
M + Q +LCG A +P N A+++AK+N+ Y +VG E+ +S+L
Sbjct: 270 MEMGNLGDHRRQSLYLCGQNMAVCMPFNSHEAMQRAKKNVEEHYAVVGTWEDTNTTLSVL 329
Query: 219 EAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIWELENELYEYA 274
E +P FF G D + KS PS + S+ E ELY++
Sbjct: 330 EGYIPRFFSGAKDQYYALRKSLGNYNRNTYRPSLSDKARAVLSQNLTREIELYQFV 385
>gi|28574835|ref|NP_788533.1| pipe, isoform D [Drosophila melanogaster]
gi|28380474|gb|AAO41229.1| pipe, isoform D [Drosophila melanogaster]
Length = 418
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 69/282 (24%), Positives = 133/282 (47%), Gaps = 29/282 (10%)
Query: 16 SPSPETDSLSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNN-HVLSLAD 74
+P E D V+ +NRVPKTGS + + + + ++V G +L A+
Sbjct: 117 TPKAEID------VLFFNRVPKTGSMQLIELMRQLGKVHDYDVEKDPQQGGVIPILETAE 170
Query: 75 QYRFVNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYG 134
Q ++N+ D ++ H F++F + + +P++IN++R P++R++S+YY++R
Sbjct: 171 QSDLIDNIVNLDD--GTVFASHVNFLNFTKH-EQPRPIYINMVRDPVERVISWYYYIRAP 227
Query: 135 DNYRPHLVRKKHGD------KTTFDECIRLNRTECS-LENMWL--------QVPFLCGHA 179
+ P R T FD+C+ C+ +EN L Q F CGH
Sbjct: 228 WVFVPGRRRNNREMPNPKWVNTEFDQCVTSGEKVCTYIENSLLEHVGDHRRQTLFFCGHN 287
Query: 180 AACWVPGNP-WALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNK 238
P N L+ AK N+ +Y +VG E + +++LEA +P +F + + +
Sbjct: 288 EFQCTPFNARLPLQLAKMNVEREYSVVGTWEHTNETLAVLEAYVPRYFADASKMYYSGLH 347
Query: 239 SHLRRTN-RKIDPSEETVQQIKKSKIWELENELYEYALEQFH 279
+ + N K S++ + ++++ E+ E Y++ ++ H
Sbjct: 348 ADKQNVNPMKPHISQDILDMVRRNFTREI--EFYQFCRQRLH 387
>gi|195435604|ref|XP_002065770.1| GK19630 [Drosophila willistoni]
gi|194161855|gb|EDW76756.1| GK19630 [Drosophila willistoni]
Length = 301
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 68/275 (24%), Positives = 130/275 (47%), Gaps = 34/275 (12%)
Query: 29 VIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLAD--QYRFVNNVTKWR 86
++ +NRVPK GS +F+ + + + F H + + LA+ Q +++
Sbjct: 19 LVFFNRVPKVGSQTFMELLRRLSERNNFQ-FHRDAVQKVETIRLAEDQQQEMAEVISELP 77
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLR-----------YGD 135
+ P+++ H F +F +F P+++N++R P++R++S++Y++R + D
Sbjct: 78 E--PSVFIKHVCFTNFTKFNLP-TPIYLNVVRDPVERVISWFYYVRAPWYFVERKAAFPD 134
Query: 136 NYRPHLVRKKHGDKTTFDECIRLNRTECS---------LENMWLQVPFLCGHAAACWVPG 186
PH K F+ C+ EC+ + + Q F CGH C
Sbjct: 135 LPLPH----PGWLKKDFETCVLSGDQECTYTQGVTVEGIGDHRRQSLFFCGHDYECTPFN 190
Query: 187 NPWALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNK--SHLRRT 244
ALE+AK + +Y +VGV E+L +S+LE +P FF G + + TS + + + +
Sbjct: 191 TVGALERAKFAVEQQYAVVGVLEDLNTTLSVLEKYVPRFFEGVRNIYATSAEYLTKINKN 250
Query: 245 NRKIDPSEETVQQIKKSKIWELENELYEYALEQFH 279
N K SE ++++ E+ E Y++ ++ H
Sbjct: 251 NFKPPVSEHVKDIVRRNFTNEI--EFYQFCRQRLH 283
>gi|195128097|ref|XP_002008502.1| GI13537 [Drosophila mojavensis]
gi|193920111|gb|EDW18978.1| GI13537 [Drosophila mojavensis]
Length = 371
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 71/281 (25%), Positives = 135/281 (48%), Gaps = 43/281 (15%)
Query: 30 IIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNH---VLSLADQYRFVNNVTKWR 86
+ +NRVPK GS S + + + + F +H G+ +L Q + ++
Sbjct: 71 VFFNRVPKVGSQSLMELMRRLGKINGF--VHARNKGSVRETIMLPKEGQKTLIGDLLTRP 128
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLR----YGDNYRPHLV 142
+P +Y H +I+F +F +P++IN++R P++R++S++Y++R Y D +
Sbjct: 129 --KPHVYSQHIAYINFTRF-QMPRPIYINLVRDPIERIISWHYYVRARWYYND------M 179
Query: 143 RKKHGDKTT-----------FDECIRLNRTECSLENMWLQVP---------FLCGHAAAC 182
+ K G+ D C+R + C+ E M ++ P F CG
Sbjct: 180 KAKLGENAIKMPSDEFLNLDLDTCVRNHDPHCTFEQMQVKNPAGDHRRQTLFFCGMNRKL 239
Query: 183 WVPGN-PWALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFR-GGTDHFLTSNK-S 239
+P N P A++KAK + ++Y +VG E+ +++LE +P +FR ++L + S
Sbjct: 240 CMPFNSPVAMQKAKHTVESEYAVVGTWEDTNITLTVLEHYIPRYFRHAKVAYYLGEERLS 299
Query: 240 HLRRTNRKIDPSEETVQQIKKSKIWELENELYEYALEQFHF 280
+ R N S+ET Q ++K+ E+ E YE+ ++ +
Sbjct: 300 RINRNNVTRIVSDETRQILRKNLTNEI--EFYEFCKQRLYL 338
>gi|195435612|ref|XP_002065774.1| GK19587 [Drosophila willistoni]
gi|194161859|gb|EDW76760.1| GK19587 [Drosophila willistoni]
Length = 319
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 67/270 (24%), Positives = 131/270 (48%), Gaps = 25/270 (9%)
Query: 29 VIIYNRVPKTGSTSFVNMAYDMCRKKRFNV-LHVNVTGNNHVLSLADQYRFVNNVTKWRD 87
V+ +NRVPKTGS + + + + ++V + G +L ++ + N++ D
Sbjct: 25 VLFFNRVPKTGSMQMIELMRQLGKVHDYDVEVDPQTGGIIPILEAVEESDWTENISNLED 84
Query: 88 RRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKHG 147
+++ H +++F + + +P++IN++R P++R++S+YYF+R + P R
Sbjct: 85 --GSVFVSHVNYLNFSK-NDQPRPIYINMVRDPVERVISWYYFIRAPWIFVPGRRRSNRE 141
Query: 148 D------KTTFDECIRLNRTECS-LENMWL--------QVPFLCGHAAACWVPGNP-WAL 191
T FD+C+ C+ +E L Q F CGH +P N AL
Sbjct: 142 MPNPKWVNTEFDQCVLNGEKVCTYIEGSPLERVGDHRRQTLFFCGHNEHKCIPFNTRLAL 201
Query: 192 EKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDP- 250
+ AK N+ +Y +VG E + +++LEA +P +F T + S S N + P
Sbjct: 202 QVAKMNVEREYAVVGTWEHTNETLAVLEAYVPRYFADATKMYY-SGLSGENINNNPMKPH 260
Query: 251 -SEETVQQIKKSKIWELENELYEYALEQFH 279
S++ + ++++ E+ E Y++ ++ H
Sbjct: 261 ISQDIIDMVRRNFTREI--EFYQFCRQRLH 288
>gi|4106864|gb|AAD04925.1| pipe sulfotransferase ST2 isoform [Drosophila melanogaster]
Length = 369
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 67/263 (25%), Positives = 126/263 (47%), Gaps = 32/263 (12%)
Query: 29 VIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLAD--QYRFVNNVTKWR 86
++ +NRVPK GS +F+ + + + F H + + LA+ Q +++
Sbjct: 114 LVFFNRVPKVGSQTFMELLRRLSERNNFQ-FHRDAVQKVETIRLAEDQQQEMAEVISELP 172
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLR-----------YGD 135
+ P+++ H F +F +F + +P+++N++R P++R++S++Y++R + D
Sbjct: 173 E--PSVFIKHVCFTNFTKF-NLPRPIYLNVVRDPVERVISWFYYVRAPWYFVERKAAFPD 229
Query: 136 NYRPHLVRKKHGDKTTFDECIRLNRTECS---------LENMWLQVPFLCGHAAACWVPG 186
PH K F+ C+ EC+ + + Q F CGH C
Sbjct: 230 LPLPHPAWLKKD----FETCVLNGDQECTYTQGVTVEGIGDHRRQSLFFCGHDYECTPFN 285
Query: 187 NPWALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNK--SHLRRT 244
ALE+AK + +Y +VGV E+L +S+LE +P FF G D + TS + + + +
Sbjct: 286 TVGALERAKFAVEQQYAVVGVLEDLNTTLSVLEKYVPRFFEGVRDIYATSAEYLTKINKN 345
Query: 245 NRKIDPSEETVQQIKKSKIWELE 267
N K SE ++++ E+E
Sbjct: 346 NFKPPVSEHVKDIVRRNFTNEIE 368
>gi|195377467|ref|XP_002047511.1| GJ11896 [Drosophila virilis]
gi|194154669|gb|EDW69853.1| GJ11896 [Drosophila virilis]
Length = 392
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 71/281 (25%), Positives = 135/281 (48%), Gaps = 43/281 (15%)
Query: 30 IIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNH---VLSLADQYRFVNNVTKWR 86
+ +NRVPK GS S + + + + F +H G+ +L Q + ++
Sbjct: 92 VFFNRVPKVGSQSLMELMRRLGKINGF--VHSRNPGSAKETIMLPKEGQKDLIGDLLTRP 149
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLR----YGDNYRPHLV 142
+P +Y H +I+F ++ +P++IN++R P++R++S++Y++R Y D +
Sbjct: 150 --KPHIYSQHIAYINFTRY-HLPRPIYINLVRDPIERIISWHYYVRARWYYND------M 200
Query: 143 RKKHGDKTT-----------FDECIRLNRTECSLENMWLQVP---------FLCGHAAAC 182
+ K GDK D C+R + C+ E M ++ P F CG
Sbjct: 201 KAKLGDKAIAMPSDEFLDLDLDTCVRNHDPHCTFEQMQIKNPVGDHRRQTLFFCGMNKKL 260
Query: 183 WVPGNP-WALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGG-TDHFLTSNK-S 239
+P N A++KAK + ++Y +VG E+ +++LE +P +FR ++L + S
Sbjct: 261 CMPFNSEMAMQKAKRTVESEYAVVGTWEDTNITLTVLEHYIPRYFRNAKVAYYLGEERLS 320
Query: 240 HLRRTNRKIDPSEETVQQIKKSKIWELENELYEYALEQFHF 280
+ R N S+ET Q ++K+ E+ E YE+ ++ +
Sbjct: 321 RVNRNNVTRIVSDETRQLLRKNLTNEI--EFYEFCKQRLYL 359
>gi|194874019|ref|XP_001973325.1| GG13411 [Drosophila erecta]
gi|190655108|gb|EDV52351.1| GG13411 [Drosophila erecta]
Length = 396
Score = 88.2 bits (217), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 72/261 (27%), Positives = 118/261 (45%), Gaps = 51/261 (19%)
Query: 10 HISSAK---SPSPETDSLSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGN 66
H+++A+ +P + D+L +NRV KTGS + + + ++ F +
Sbjct: 98 HLTTAQLNNTPRAQVDTL------FFNRVAKTGSEKMMELLKILGKRNNFQARR-DAGAV 150
Query: 67 NHVLSLADQYR------FVNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKP 120
V+ + D + V N T+ Y H F+DF G + P++IN++R P
Sbjct: 151 YEVVIMHDAFAKNFIRTEVVNCTQANS-----YTKHVAFLDFDLLG-EPWPIYINLVRDP 204
Query: 121 LDRLVSYYYFLR----YGDNYRPHLVRKKHGD----------KTTFDECIRLNRTECSLE 166
++RLVS++Y++R +G+ R GD K F+ CI EC E
Sbjct: 205 VERLVSWFYYIRSPWHFGER------RNAFGDAIPLPDIDWLKKDFNRCIEERDPECVYE 258
Query: 167 NMWL--------QVPFLCGHAAACWVPGNPW-ALEKAKENLVTKYLLVGVTEELTDFVSL 217
M + Q FLCG A +P N A+++AK+N+ Y +VG E+ +S+
Sbjct: 259 QMEMGNLGDHRRQSLFLCGQNMAVCMPFNSHEAMQRAKKNVEKHYAVVGTWEDTNTTLSV 318
Query: 218 LEAALPSFFRGGTDHFLTSNK 238
LE +P FF G D + K
Sbjct: 319 LEGYIPRFFGGAKDEYYALRK 339
>gi|313228473|emb|CBY23624.1| unnamed protein product [Oikopleura dioica]
Length = 334
Score = 88.2 bits (217), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 62/221 (28%), Positives = 109/221 (49%), Gaps = 24/221 (10%)
Query: 71 SLADQYRFVNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYF 130
S A N+T+ ++ P LY H FIDF ++G K+ PL+INI++ P+ +S YY+
Sbjct: 112 SRARTSEIAQNITRLKE--PMLYIRHIHFIDFPRYGFKD-PLYINIVKDPVKLFISGYYY 168
Query: 131 LRYGDNYRPHLV----RKKHGDKT---TFDECIRLNRTECSLENMWLQ-VPFLCGHAAAC 182
R+G + R K D+ T DEC+R EC+ W + +PF CGH+
Sbjct: 169 RRFGFEGASRISAEKWRIKMTDEVRAMTLDECVRTKAPECA--RPWSKLIPFFCGHSRQ- 225
Query: 183 WVPGNPWALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGG----TDHFLTSNK 238
+ A+ AK+ ++ +Y VG+ E++ + + E P+FF G TD +
Sbjct: 226 ---RDDRAVAIAKQKVIERYAFVGIVEDMDNTMKAFEVVAPNFFAGAFNLLTDEVEVKGR 282
Query: 239 SHLRRTNRKIDPSEETVQQIKKSKIWELENELYEYALEQFH 279
+ ++ + S T++ ++ + E ELY++ + F+
Sbjct: 283 KSSKTAHKDV-TSNSTLEYLRNH--LKREYELYDFIKKTFY 320
>gi|195022681|ref|XP_001985620.1| GH14407 [Drosophila grimshawi]
gi|193899102|gb|EDV97968.1| GH14407 [Drosophila grimshawi]
Length = 421
Score = 87.4 bits (215), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 70/279 (25%), Positives = 127/279 (45%), Gaps = 39/279 (13%)
Query: 30 IIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWRDR- 88
I +NRVPK GS S + + + + F +H G+ + Q + + R
Sbjct: 121 IFFNRVPKVGSQSLMELMRRLGKINGF--VHARNPGSVKESIMMRQEALKDLIADLLTRQ 178
Query: 89 RPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLR----YGDNYRPHLVRK 144
+P +Y H +++F +F +P++IN++R P++R++S++Y++R Y D +
Sbjct: 179 KPHVYSQHIAYVNFTRF-HMPRPIYINLVRDPIERIISWHYYIRARWYYND------MNA 231
Query: 145 KHGDKTT-----------FDECIRLNRTECSLENMWLQVP---------FLCGHAAACWV 184
K G K+ D C+R C+ E M + P F CG +
Sbjct: 232 KLGPKSVKMPPDEFLNLDLDTCVRNKDPYCTFEQMQMNNPVGDHRRQTLFFCGMNKKLCM 291
Query: 185 PGNP-WALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTS--NKSHL 241
P N A++KAK + + Y +VG E+ +++LE +P +FR + N S +
Sbjct: 292 PFNSKMAMQKAKRTVESDYAVVGTWEDTNITLTVLEHYIPRYFRNAKVAYYLGKENLSRI 351
Query: 242 RRTNRKIDPSEETVQQIKKSKIWELENELYEYALEQFHF 280
R N S+ET Q ++ + E+ E YE+ ++ +
Sbjct: 352 NRNNVTRIVSDETRQILRTNLTNEI--EFYEFCKQRLYL 388
>gi|195022653|ref|XP_001985614.1| GH14414 [Drosophila grimshawi]
gi|193899096|gb|EDV97962.1| GH14414 [Drosophila grimshawi]
Length = 306
Score = 87.4 bits (215), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 67/275 (24%), Positives = 128/275 (46%), Gaps = 34/275 (12%)
Query: 29 VIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLAD--QYRFVNNVTKWR 86
++ +NRVPK GS +F+ + + + F H + + LA+ Q +++
Sbjct: 19 LVFFNRVPKVGSQTFMELLRRLSERNNFQ-FHRDAVQKVETIRLAEEQQQEMAEVISELP 77
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLR-----------YGD 135
+ P+++ H F +F +F P+++N++R P++R++S++Y++R + D
Sbjct: 78 E--PSVFIKHVCFTNFTKFNLP-TPIYLNVVRDPVERVISWFYYVRAPWYFVERKAAFPD 134
Query: 136 NYRPHLVRKKHGDKTTFDECIRLNRTECS---------LENMWLQVPFLCGHAAACWVPG 186
PH K F+ C+ EC+ + + Q F CGH C
Sbjct: 135 LPLPH----PAWLKKDFETCVLSGDQECTYTQGVTVEGIGDHRRQSLFFCGHDYECTPFN 190
Query: 187 NPWALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNK--SHLRRT 244
ALE+AK + +Y +VGV E+ +S+LE +P FF G D + S + + + +
Sbjct: 191 TVGALERAKFAVEQQYAVVGVLEDFNTTLSVLEKYVPRFFDGVRDIYANSAEYLTKINKN 250
Query: 245 NRKIDPSEETVQQIKKSKIWELENELYEYALEQFH 279
N K SE ++++ E+ E Y++ ++ H
Sbjct: 251 NFKPPVSEHVKDIVRRNFTNEI--EFYQFCRQRLH 283
>gi|24666714|ref|NP_730402.1| pipe, isoform C [Drosophila melanogaster]
gi|23093146|gb|AAN11662.1| pipe, isoform C [Drosophila melanogaster]
gi|317008635|gb|ADU79245.1| LP12067p [Drosophila melanogaster]
Length = 403
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 57/225 (25%), Positives = 110/225 (48%), Gaps = 17/225 (7%)
Query: 29 VIIYNRVPKTGSTSFVNMAYDMCRKKRF-NVLHVNVTGNNHVLSLADQYRFVNNVTKWRD 87
+I YNRVPKTGS + + + + +K F N + Q + + + ++
Sbjct: 124 IIFYNRVPKTGSETLIELMIQLGKKNDFQNERSPFSKPTGMYWDVKRQKQEATRILELQE 183
Query: 88 RRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYG-DNYRPHLVRKKH 146
+Y H +++ + F QP++IN++R P++R++S++Y+ R ++ + + V K
Sbjct: 184 EPAFVYVEHMNYMNIRPF-HLPQPIYINMIRDPVERVISWFYYKRTPWNSVKMYKVTGKF 242
Query: 147 GDKT----TFDECIRLNRTECSLENMWL----------QVPFLCGHAAACWVPGNPWALE 192
++T F+EC+ + EC + L Q F CGH+ C P A+
Sbjct: 243 QNRTHYTKNFEECVLTHDPECRYDYGLLFKDDSADHKRQSLFFCGHSPICEPFNTPAAIA 302
Query: 193 KAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSN 237
+AK+N+ + +VG E+ +++LE +P FF+G + + N
Sbjct: 303 RAKQNVERDFSVVGSWEDTNVTLTVLEHYIPRFFKGTMELYYEPN 347
>gi|14209819|gb|AAK56855.1|AF263993_1 pipe sulfotransferase box 3 isoform [Drosophila melanogaster]
Length = 403
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 57/225 (25%), Positives = 110/225 (48%), Gaps = 17/225 (7%)
Query: 29 VIIYNRVPKTGSTSFVNMAYDMCRKKRF-NVLHVNVTGNNHVLSLADQYRFVNNVTKWRD 87
+I YNRVPKTGS + + + + +K F N + Q + + + ++
Sbjct: 124 IIFYNRVPKTGSETLIELMIQLGKKNDFQNERSPFSKPTGMYWDVKRQKQEATRILELQE 183
Query: 88 RRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYG-DNYRPHLVRKKH 146
+Y H +++ + F QP++IN++R P++R++S++Y+ R ++ + + V K
Sbjct: 184 EPAFVYVEHMNYMNIRPF-HLPQPIYINMIRDPVERVISWFYYKRTPWNSVKMYKVTGKF 242
Query: 147 GDKT----TFDECIRLNRTECSLENMWL----------QVPFLCGHAAACWVPGNPWALE 192
++T F+EC+ + EC + L Q F CGH+ C P A+
Sbjct: 243 QNRTHYTKNFEECVLTHDPECRYDYGLLFKDDSADHKRQSLFFCGHSPICEPFNTPAAIA 302
Query: 193 KAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSN 237
+AK+N+ + +VG E+ +++LE +P FF+G + + N
Sbjct: 303 RAKQNVERDFSVVGSWEDTNVTLTVLEHYIPRFFKGTMELYYEPN 347
>gi|443719674|gb|ELU09723.1| hypothetical protein CAPTEDRAFT_215902, partial [Capitella teleta]
Length = 162
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 48/126 (38%), Positives = 67/126 (53%), Gaps = 4/126 (3%)
Query: 108 KEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKHGDKTTFDECIRLNRTECSLEN 167
K P FIN++R P++ LVS+Y+F +G P L K D C+R C N
Sbjct: 3 KVAPSFINVIRDPIEGLVSHYFFNAFGSKNAP-LSTPKPPYNMPLDACVRHRLGHCM--N 59
Query: 168 MWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFR 227
+ +PF CGH AC NP +L +AK + YL+VG+TE+L F+ LE +P FFR
Sbjct: 60 IHKLIPFFCGHDKACR-SSNPSSLARAKRAVKESYLIVGLTEDLHAFMESLETLMPQFFR 118
Query: 228 GGTDHF 233
+ F
Sbjct: 119 NASAVF 124
>gi|449139045|gb|AGE89853.1| pipe, partial [Ceratitis capitata]
Length = 240
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 58/225 (25%), Positives = 106/225 (47%), Gaps = 30/225 (13%)
Query: 29 VIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLAD--QYRFVNNVTKWR 86
++ +NRVPK GS +F+ + + + F H + + LAD Q ++
Sbjct: 19 LVFFNRVPKVGSQTFMELLRRLSERNNFQ-FHRDAVQKVETIRLADDQQQELAEVISDLP 77
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLR-----------YGD 135
+ P+++ H + +F +F P+++N++R P++R++S++Y++R + D
Sbjct: 78 E--PSVFIKHVCYTNFTKFNLP-MPIYVNVVRDPIERVISWFYYVRAPWYFVERKAAFPD 134
Query: 136 NYRPHLVRKKHGDKTTFDECIRLNRTECS---------LENMWLQVPFLCGHAAACWVPG 186
PH K F+ C+ EC+ + + Q F CGHA C
Sbjct: 135 LPLPHPAWLKKD----FETCVLSGDQECTYTQGVTVEGIGDHRRQSLFFCGHAYECTPFN 190
Query: 187 NPWALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTD 231
ALE+AK + ++Y +VGV E++ +S+ E +P FF G D
Sbjct: 191 TVGALERAKFAVESQYAVVGVLEDMNTTLSVFEKYIPRFFEGVRD 235
>gi|195435610|ref|XP_002065773.1| GK19608 [Drosophila willistoni]
gi|194161858|gb|EDW76759.1| GK19608 [Drosophila willistoni]
Length = 616
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 75/294 (25%), Positives = 132/294 (44%), Gaps = 46/294 (15%)
Query: 29 VIIYNRVPKTGSTSFVNMAYDMCRKKRFN--------VLHVNVTGNNHVLSLADQYRFVN 80
++ +NR K GS + + + + ++FN LH N + D FV
Sbjct: 27 ILFFNRAAKVGSEAMLELFIAL---EKFNDDLTLERSGLHQNAVRQLNKSRQRDAANFVA 83
Query: 81 NVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPH 140
+ + +Y H ++DF +F +P++IN++R P++R++S++Y++R +Y+
Sbjct: 84 ELDE-----GTMYIEHINWLDFDEFDLP-KPIYINMVRDPVERVISWFYYVR--SSYKNA 135
Query: 141 LVRKKHGD---------KTTFDECIRLNRTECS---------LENMWLQVPFLCGHAAAC 182
+ +K + K F++C+R EC N Q F CGH C
Sbjct: 136 IEYRKFPNRKIKPATWYKKNFNDCVRNGDPECQYIPHTVKDEFSNFKRQSLFYCGHHDDC 195
Query: 183 WVPGNPWALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLR 242
+P A++ AKE++ Y +VG E+ +++LE +P FFRG + +NK
Sbjct: 196 IPFNSPTAIQMAKEHVERDYAVVGSWEDTNITLTVLEQYIPRFFRGAKLMYEMNNKQITN 255
Query: 243 RTNRKIDP--SEETVQQIKKSKIWELE------NELY-EYALEQFHFVKKHNLV 287
R K P E I+++ E E LY +Y H ++KHNL+
Sbjct: 256 RNKNKRKPFIEPEVKDLIRRNFTHEYEFYHFCKQRLYKQYLALNLHELEKHNLL 309
Score = 77.8 bits (190), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 54/220 (24%), Positives = 104/220 (47%), Gaps = 24/220 (10%)
Query: 30 IIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNV-TGNNHVLSLADQYRFVNNVTKWRDR 88
+ + R K GS + + + + F + H + + L Q + ++ +
Sbjct: 341 MFFTRCAKVGSEALIEFMHHLKDLNSFQIDHTGLRKASQRQLRPMAQAQTAAHI--YNQG 398
Query: 89 RPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKHGD 148
++Y H +IDF Q+ +P++IN++R P++R++S+YY++R ++YR + +K+
Sbjct: 399 EGSVYVEHLPWIDFNQYNLP-KPIYINLVRDPVERMISWYYYVR--NSYRNAIYFRKNPL 455
Query: 149 ---------KTTFDECIRLNRTECSLENMWL---------QVPFLCGHAAACWVPGNPWA 190
K +F++C+R EC M + Q F CGH C +P A
Sbjct: 456 APLKPVAWFKKSFNDCVRSGDLECQYIPMTVHDTEGNFKRQSLFFCGHHQDCLPFNSPLA 515
Query: 191 LEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGT 230
++ AK + +Y +VG EE +++LE +P +F T
Sbjct: 516 VQMAKRRVDEEYAVVGTWEETNITLTVLEHYVPRYFARAT 555
>gi|167533393|ref|XP_001748376.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163773188|gb|EDQ86831.1| predicted protein [Monosiga brevicollis MX1]
Length = 318
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 68/213 (31%), Positives = 103/213 (48%), Gaps = 28/213 (13%)
Query: 90 PALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYG-----DNYRPHL-VR 143
P LY H ++DF++ G + P +IN++R+P+ RLVS YY+ G + YR + R
Sbjct: 93 PVLYDQHTRYLDFERHGVQPAPAYINMVREPVARLVSLYYYKLRGSYPKREQYRAAVNAR 152
Query: 144 KKHGDKT---TFDECIRLNRTEC-----SLENMWLQVPFLCGHAAACWVPGNPWALEKAK 195
G + DEC+ + C S+E+ L F CGHA C G L KA+
Sbjct: 153 GPTGTVAADWSVDECLS-HPQHCDLLGKSVEHNNLMTGFFCGHAPECQDFGE-ATLRKAQ 210
Query: 196 ENLVTKYLLVGVTEELTDFVSLLEAALPSFFRG-----GTDHF----LTSNKSHLRRTNR 246
NL KY+ VG+ EE ++L LP F+ G G D L+ S R T+
Sbjct: 211 SNL-DKYVAVGLAEEFDTSMALFARLLPDFYGGPEAAEGPDKVPADRLSEATSLNRNTHA 269
Query: 247 KIDPSEETVQQIKKSKIWELENELYEYALEQFH 279
PS +Q+ ++++ ELY YA+ +F+
Sbjct: 270 GPPPSAAGYRQLAALALYDI--ELYRYAVARFY 300
>gi|195352371|ref|XP_002042686.1| GM14871 [Drosophila sechellia]
gi|194124570|gb|EDW46613.1| GM14871 [Drosophila sechellia]
Length = 237
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 61/212 (28%), Positives = 108/212 (50%), Gaps = 28/212 (13%)
Query: 92 LYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKHGDKTT 151
+Y H +I+F +F +P++IN++R P+DR++S++Y++R YR ++ K GDK
Sbjct: 1 IYSQHIAYINFTRF-HLPKPIYINLIRDPIDRIISWHYYIRAPWYYRD--MQAKLGDKAI 57
Query: 152 -----------FDECIRLNRTECSLENMWLQVP---------FLCGHAAACWVPGNP-WA 190
D C+R + C+ M ++ P F CG +P N A
Sbjct: 58 PMPSEEFMNLDLDTCVRNHDPHCTFTQMQIKNPVGDHRRQTLFFCGMNQKLCMPFNSEAA 117
Query: 191 LEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGG-TDHFLTSNK-SHLRRTNRKI 248
++KAK + T+Y +VG E+ +S+LEA +P FFR ++L ++ S + R N
Sbjct: 118 MQKAKRTVETEYAVVGTWEDTNITLSVLEAYIPRFFRNAKVAYYLGKDRLSRVNRNNVTR 177
Query: 249 DPSEETVQQIKKSKIWELENELYEYALEQFHF 280
S+ET ++K+ E+ E YE+ ++ +
Sbjct: 178 IVSDETRLILRKNLTNEI--EFYEFCKQRLYL 207
>gi|322799531|gb|EFZ20839.1| hypothetical protein SINV_12448 [Solenopsis invicta]
Length = 326
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 76/297 (25%), Positives = 133/297 (44%), Gaps = 51/297 (17%)
Query: 29 VIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHV-LSLADQYRFVNNVTKWRD 87
V+ +NRVPK GS +F+ + + + F+ V + L+ +Q V+ + +
Sbjct: 22 VLFFNRVPKVGSQTFMELLRRLSIRNAFSFNRDRVQRVETIRLAPIEQLHLARMVSSYSE 81
Query: 88 RRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYR------PHL 141
P++Y H F +F +F E P++INI+R P++R++S+YY++R Y P L
Sbjct: 82 --PSVYVKHVCFTNFTEFHLPE-PIYINIVRDPVERVISWYYYVRAPWYYVERKQIFPDL 138
Query: 142 -VRKKHGDKTTFDECIRLNRTEC---------SLENMWLQVPFLCGHAAAC--------- 182
+ + K F+ C+ EC + + Q F CGH+ C
Sbjct: 139 PLPDPNWLKKDFESCVLKGDRECRYLQGEIHEGIGDHRRQTLFFCGHSEKCTKKLFSNIV 198
Query: 183 --------------WVPGNPW----ALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPS 224
+ P+ ALE+AK + Y +VGV E++ +++LE +P
Sbjct: 199 ITVVEYDVLITLVHFFSERPFNTVGALERAKLAVEKHYAVVGVLEDINTTLTVLENYIPQ 258
Query: 225 FFRGGTDHFLTSNKSHLRRTNRKI--DPSEETVQQIKKSKIWELENELYEYALEQFH 279
FFRG TD + S R NR P E V+ + ++ + E E Y++ ++ +
Sbjct: 259 FFRGATDVYYDQVNS-FTRINRNFFKPPVSEEVKNLVRNN-FTREVEFYQFCKQRLY 313
>gi|195173242|ref|XP_002027402.1| GL20901 [Drosophila persimilis]
gi|194113254|gb|EDW35297.1| GL20901 [Drosophila persimilis]
Length = 477
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 77/293 (26%), Positives = 138/293 (47%), Gaps = 45/293 (15%)
Query: 10 HISSAK---SPSPETDSLSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGN 66
H+++A+ +P + D+ I +NRV KTGS + + + ++ F +V G
Sbjct: 177 HLTAAQLNNTPRAQLDT------IFFNRVTKTGSEKMMELLKILGKRNGFEARR-DVEGF 229
Query: 67 NHVLSLADQY--RFVNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRL 124
V+ + Y F+++ Y H FIDF ++ P++IN++R P++RL
Sbjct: 230 YEVVLMDPTYAKNFLHSDV-LNSTGANTYTKHVAFIDFDLL-NEPWPIYINLVRDPIERL 287
Query: 125 VSYYYFLRYGDNYRPHLV---RKKHGDKTT----------FDECIRLNRTECSLENMWL- 170
VS++Y+ R P + R+ G+K F+ECI + EC E M +
Sbjct: 288 VSWFYYAR-----APWYLAERRETFGEKAVLPSVEWVRKDFNECIEEHDPECVYEQMEMG 342
Query: 171 -------QVPFLCGHAAACWVPGNPW-ALEKAKENLVTKYLLVGVTEELTDFVSLLEAAL 222
Q F CGH + +P N A+++AK N+ Y +VG E+ +++LE +
Sbjct: 343 NLGDHRRQSLFFCGHTTSVCMPFNSHEAMQRAKRNVEEHYAVVGTWEDTNTTLTVLEGYI 402
Query: 223 PSFFRGGTDHF--LTSNKSHLRRTNRKIDPSEETVQQIKKSKIWELENELYEY 273
P +F G + + L S ++ R N + SE+ + ++ E+ ELY++
Sbjct: 403 PKYFAGAKEEYYALRSRLGNVNRNNFRPTLSEKARALLARNLTREI--ELYQF 453
>gi|195377469|ref|XP_002047512.1| GJ11895 [Drosophila virilis]
gi|194154670|gb|EDW69854.1| GJ11895 [Drosophila virilis]
Length = 406
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 74/287 (25%), Positives = 130/287 (45%), Gaps = 43/287 (14%)
Query: 10 HISSAK---SPSPETDSLSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGN 66
H++SA+ +P + D+L +NRVPK GS + + + + +F
Sbjct: 108 HLTSAQLNNTPRAQLDTL------FFNRVPKAGSEKLMELLKLLANRNKFQARR----DP 157
Query: 67 NHVL-SLADQYRFVNNVTKWRDRRPAL---YHGHFGFIDFQQFGSKEQPLFINILRKPLD 122
H+ ++ F N+ K + Y H F++F FG P++IN++R P++
Sbjct: 158 KHLFETILMDTGFARNLLKAEILNCSTANSYTKHVAFLNFAAFG-HPWPIYINLVRDPVE 216
Query: 123 RLVSYYYFLR-----------YGDNYRPHLVRKKHGDKTTFDECIRLNRTECSLENM--- 168
RLVS++Y+ R +G +Y+ + D F+ CI + EC E M
Sbjct: 217 RLVSWFYYARAPWYLADRVNAFGSDYKIPTLEWLQKD---FNRCILEHDPECVYEQMDTG 273
Query: 169 -----WLQVPFLCGHAAACWVPGNPW-ALEKAKENLVTKYLLVGVTEELTDFVSLLEAAL 222
Q F CG A +P N A+++AK N+ +Y +VG E+ +S+LEA +
Sbjct: 274 NLGDHRRQTLFFCGQQAELCMPFNSQKAMQQAKRNVENRYAVVGTWEDTNTTLSVLEAYI 333
Query: 223 PSFFRGGTDHF--LTSNKSHLRRTNRKIDPSEETVQQIKKSKIWELE 267
P +F G D + + + ++ R + SE+ I ++ E+E
Sbjct: 334 PRYFAGAKDVYYAMRRDMENVNRNTFRPTISEQARAVISRNLTQEIE 380
>gi|195022662|ref|XP_001985616.1| GH14412 [Drosophila grimshawi]
gi|193899098|gb|EDV97964.1| GH14412 [Drosophila grimshawi]
Length = 303
Score = 84.7 bits (208), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 62/241 (25%), Positives = 111/241 (46%), Gaps = 24/241 (9%)
Query: 9 IHISSAKSPSPETDSLSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNH 68
+H+SS + + + + R K GS S + D+ F+V H + N+
Sbjct: 2 LHLSSLTPQQLNNTAKAEMDRLFFTRCAKVGSESLIEFMEDIQDINNFDVDHTGLRKINN 61
Query: 69 VLSLADQYRFVNNVTKWRDRRPA-LYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSY 127
+ + D+ P +Y H +IDF F +P+FIN++R P++R++S+
Sbjct: 62 RQRVPKTQ--IERALHIFDQEPGTVYVEHTSWIDFNAFNLP-KPIFINLVRDPVERMISW 118
Query: 128 YYFLRYGDNYRPHLVRKKHGD---------KTTFDECIRLNRTEC---------SLENMW 169
YY++R ++YR + +K+ K +F++C+R EC ++ N
Sbjct: 119 YYYVR--NSYRNAIYYRKNPKAPLKPTAWFKKSFNDCVRSGDPECQYVPFSIKETVGNYK 176
Query: 170 LQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGG 229
Q F CGH C +P A++ AK + +Y +VG EE +++LE +P +F
Sbjct: 177 RQSLFFCGHDRDCLPFDSPLAIQIAKRRVEEEYAVVGTWEETNITLTVLEHYIPRYFALA 236
Query: 230 T 230
T
Sbjct: 237 T 237
>gi|195496282|ref|XP_002095627.1| GE22507 [Drosophila yakuba]
gi|194181728|gb|EDW95339.1| GE22507 [Drosophila yakuba]
Length = 821
Score = 84.3 bits (207), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 67/282 (23%), Positives = 130/282 (46%), Gaps = 29/282 (10%)
Query: 16 SPSPETDSLSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNN-HVLSLAD 74
+P E D V+ +NRVPKTGS + + + + + V G +L A+
Sbjct: 520 TPKAEID------VLFFNRVPKTGSMQLIELMRQLGKVHDYEVEKDPQNGGVIPLLETAE 573
Query: 75 QYRFVNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYG 134
Q ++N+ D ++ H F++F + + +P+++N++R P++R++S+YY++R
Sbjct: 574 QSDLIDNIVNLEDG--TVFASHVNFLNFTK-HEQPRPIYVNMVRDPVERVISWYYYIRAP 630
Query: 135 DNYRPHLVRKKHGD------KTTFDECIRLNRTECS-LENMWL--------QVPFLCGHA 179
+ P R T F++C+ C+ +EN L Q F CGH
Sbjct: 631 WVFVPGRRRNNREMPNPKWVNTEFEQCVTSGEKVCTYIENSLLEHVGDHRRQTLFFCGHN 690
Query: 180 AACWVPGNP-WALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNK 238
P N L+ AK N+ +Y +VG E + +++LEA +P +F + + +
Sbjct: 691 EFQCTPFNARLPLQLAKMNVEREYSVVGTWEHTNETLAVLEAYVPRYFADASKMYYSGLH 750
Query: 239 SHLRRTN-RKIDPSEETVQQIKKSKIWELENELYEYALEQFH 279
S + N K + + ++++ E+ E Y++ ++ H
Sbjct: 751 SDKQNVNPMKPHIPQNILDMVRRNFTREI--EFYQFCRQRLH 790
Score = 71.6 bits (174), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 50/208 (24%), Positives = 98/208 (47%), Gaps = 38/208 (18%)
Query: 68 HVLSLADQYRFVNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSY 127
H+L L ++ FV Y H +++ + F QP++IN++R P++R++S+
Sbjct: 36 HILDLQEEPAFV-------------YVEHMNYMNIRPFHLP-QPIYINMIRDPVERVISW 81
Query: 128 YYFLRYG-DNYRPHLVRKKHGDKT----TFDECIRLNRTECSLENMWL----------QV 172
+Y+ R ++ + + V + ++T F+EC+ + EC + + Q
Sbjct: 82 FYYRRTPWNSVKMYEVTGEFQNRTFYTKNFEECVLTHDVECRYDYGLMFKDEFADHKRQS 141
Query: 173 PFLCGHAAACWVPGNPWALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTD- 231
F CGH+ C P A+ +AK+N+ + +VG E+ +++LE +P FF+G +
Sbjct: 142 LFFCGHSPICEPFNTPAAIARAKQNVERDFSVVGSWEDTNVTLTVLEHYIPRFFKGSMEL 201
Query: 232 --------HFLTSNKSHLRRTNRKIDPS 251
F N +H + +DP+
Sbjct: 202 YYEPDTGLAFQKENINHWKPRLENLDPA 229
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 68/260 (26%), Positives = 122/260 (46%), Gaps = 24/260 (9%)
Query: 30 IIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVL-SLADQYRFVNNVTKWRDR 88
I YNR+ KTGS S + + + F V + + S D+ V + + +
Sbjct: 242 IFYNRLEKTGSQSMTRLIKQLGDRLGFETYRNIVRPSKSITESEEDENDLVEQLFELGEH 301
Query: 89 RPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVR----- 143
A+Y H +++F + S +P++IN++R P+ +++S YY+ R+ + L+R
Sbjct: 302 --AVYVEHANWVNFTKHESP-RPIYINMIRHPIQKVISAYYYQRHPLIFAQSLMRNPNKP 358
Query: 144 ---KKHGDKTTFDECIRLNRTE--CSLE------NMWLQVPF-LCGHAAACWVPGNPWAL 191
KK D TTF++C+R NR C + W + LCG++ C +
Sbjct: 359 MQNKKFFD-TTFNDCVR-NRVRPYCVFDAHNPFNGDWRRFSLHLCGNSEICTHFNSETTT 416
Query: 192 EKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPS 251
+ AK N+ +Y +VG E+ +++LEA +P +F T + T ++ T
Sbjct: 417 QIAKMNVEREYAVVGSWEDTNVTLAVLEAYIPRYFTDATKVYYTKTENFTINTVSHDTHL 476
Query: 252 EETVQQIKKSKIWELENELY 271
++ V++ KS + E ELY
Sbjct: 477 DKDVEEYLKSS-FSFEIELY 495
>gi|198463781|ref|XP_002135581.1| GA28235 [Drosophila pseudoobscura pseudoobscura]
gi|198151409|gb|EDY74208.1| GA28235 [Drosophila pseudoobscura pseudoobscura]
Length = 285
Score = 84.0 bits (206), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 59/215 (27%), Positives = 108/215 (50%), Gaps = 28/215 (13%)
Query: 89 RPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKHGD 148
+P +Y H +I+F +F +P++IN++R P+DR++S++Y++R YR ++ K GD
Sbjct: 45 KPHIYSQHIAYINFTRF-HLPRPIYINLVRDPIDRIISWHYYIRARWYYRD--MQAKLGD 101
Query: 149 KT-----------TFDECIRLNRTECSLENMWL---------QVPFLCGHAAACWVPGNP 188
K D C+R C+ M + Q F CG +P N
Sbjct: 102 KAPAMPSDEFLDMDLDTCVRNKDRHCTFNQMQIKNEAGDHRRQTLFFCGMNQKLCMPFNS 161
Query: 189 -WALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGG-TDHFLTSNK-SHLRRTN 245
A++KAK + ++Y +VG E+ +++LEA +P +FR ++L ++ S + R N
Sbjct: 162 EMAMQKAKRTVESEYAVVGTWEDTNITLAVLEAYIPRYFRNAKVAYYLGKDRLSRVNRNN 221
Query: 246 RKIDPSEETVQQIKKSKIWELENELYEYALEQFHF 280
S+ET ++K+ E+ E YE+ ++ +
Sbjct: 222 VTRIVSDETRLILRKNLTNEI--EFYEFCKQRLYL 254
>gi|195435618|ref|XP_002065777.1| GK19566 [Drosophila willistoni]
gi|194161862|gb|EDW76763.1| GK19566 [Drosophila willistoni]
Length = 410
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 60/226 (26%), Positives = 107/226 (47%), Gaps = 26/226 (11%)
Query: 29 VIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSL----ADQYRFVNNVTK 84
I +NRV KTGS + + + ++ F V + V+ + A + + + +
Sbjct: 126 TIFFNRVTKTGSEKMMELLKILAKRNDF-VARRDAEALYEVVIMDPVYAKNFLYTDVLNS 184
Query: 85 WRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLR----YGDNYR-- 138
RP Y H F+DF + P++IN++R P++RLVS++Y+ R D Y+
Sbjct: 185 ---SRPNSYTKHVAFLDFDAL-DEPWPIYINLVRDPIERLVSWFYYARAAWYIADRYQTF 240
Query: 139 --PHLVRKKHGDKTTFDECIRLNRTECSLENMWL--------QVPFLCGHAAACWVPGNP 188
+ + H + F+ CI + EC E M + Q + CG +P N
Sbjct: 241 GAAYQMPDIHWVRKDFNRCINEHDPECIYEQMEMGNLGDHRRQSLYFCGQENKVCMPFNS 300
Query: 189 W-ALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHF 233
A+++AK+N+ +Y +VG E+ +++LE +P FF+G D +
Sbjct: 301 HEAMQRAKQNVEERYAVVGTWEDTNVTLTVLEGYVPRFFKGAFDEY 346
>gi|195128105|ref|XP_002008506.1| GI13541 [Drosophila mojavensis]
gi|193920115|gb|EDW18982.1| GI13541 [Drosophila mojavensis]
Length = 284
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 63/239 (26%), Positives = 112/239 (46%), Gaps = 26/239 (10%)
Query: 30 IIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNN-HVLSLADQYRFVNNVTKWRDR 88
I + R K GS S + D+ F V ++ + + +L+ Q + ++ +
Sbjct: 4 IFFTRCAKVGSESLLEFMEDLQDVNNFEVDYLGLKRSGPRILTPKQQSKRARHI--FNQA 61
Query: 89 RPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKHGD 148
+Y H +IDF + +P+FIN++R P++R++S+YY++R ++Y + +KH
Sbjct: 62 PGTVYIEHTSWIDFHHYNLP-KPIFINLVRDPVERMISWYYYVR--NSYLNAIFYRKHPT 118
Query: 149 KT---------TFDECIRLNRTECS---------LENMWLQVPFLCGHAAACWVPGNPWA 190
T +F++C+R EC + N Q F CGH C +P A
Sbjct: 119 ATIKPVDWYKKSFNDCVRNGDAECQYVPETVKDYVGNYKRQSLFFCGHDRDCLPFDSPLA 178
Query: 191 LEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKS--HLRRTNRK 247
++ AK + +Y +VG EE +++LE +P +F T + KS + R NRK
Sbjct: 179 IQIAKRRVEEEYAVVGTWEETNITLTVLEHYIPRYFARATKLYPLYQKSLQNRNRNNRK 237
>gi|28574841|ref|NP_788532.1| pipe, isoform G [Drosophila melanogaster]
gi|28380473|gb|AAO41228.1| pipe, isoform G [Drosophila melanogaster]
Length = 407
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 67/266 (25%), Positives = 119/266 (44%), Gaps = 37/266 (13%)
Query: 29 VIIYNRVPKTGSTSFVNMAY-------DMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNN 81
V+ +NR K GS S + + D+ ++R LH + FV +
Sbjct: 124 VLFFNRAAKVGSESMLELFMALEKYNDDLTLERR--GLHTRTVRQMDKKQRRESAEFVAD 181
Query: 82 VTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHL 141
+ + +Y H ++DF +F +P++IN++R P++R++S++++ R +Y+ +
Sbjct: 182 LEEG-----TMYIEHINWLDFDEFDLP-KPIYINLVRDPVERVISWFFYAR--SSYKNAI 233
Query: 142 VRKKHGD---------KTTFDECIRLNRTEC---------SLENMWLQVPFLCGHAAACW 183
+K + K F++C+R EC S+ N Q F CGH C
Sbjct: 234 EYRKRPNQKIKPESWYKKNFNDCVRSGDPECQYVPHTVKDSIANFKRQSLFYCGHHDDCL 293
Query: 184 VPGNPWALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRR 243
+P A++ AKE++ Y +VG E+ +++LE +P FFRG + N R
Sbjct: 294 PFNSPTAVQMAKEHVERDYAVVGSWEDTNITLTVLENYIPRFFRGAKLMYEMHNSKITNR 353
Query: 244 TNRKIDP--SEETVQQIKKSKIWELE 267
K P E + I+K+ E E
Sbjct: 354 NKNKRKPFVEPEVKEMIRKNFTNEYE 379
>gi|195022667|ref|XP_001985617.1| GH14411 [Drosophila grimshawi]
gi|193899099|gb|EDV97965.1| GH14411 [Drosophila grimshawi]
Length = 314
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 77/293 (26%), Positives = 127/293 (43%), Gaps = 44/293 (15%)
Query: 29 VIIYNRVPKTGSTSFVNMAYDMCRKKRFNVL---HVNVTGNNHVLSLADQYRFVNNVTKW 85
V+ +NRV K GS + + + FN L H ++ LS Q + + +
Sbjct: 31 VLFFNRVAKVGSEALLEL---------FNSLVEYHDDLELERSGLSAKTQRQLKKPMQRE 81
Query: 86 RDRRPA------LYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLR----YGD 135
A +Y H ++DF +F QP++IN++R P++R++S++Y+ R
Sbjct: 82 AAEFVADLEEGTVYIRHINWLDFAEFDLP-QPIYINLVRDPVERVISWFYYARGSYLNAI 140
Query: 136 NYRPH---LVRKKHGDKTTFDECIRLNRTECS---------LENMWLQVPFLCGHAAACW 183
YR +R + K F++C+R EC + N Q F CGH C
Sbjct: 141 EYRKQPNKEIRPESWYKKNFNDCVRSGDPECQYVPHTVKDFMPNFKRQSLFYCGHHDDCI 200
Query: 184 VPGNPWALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRR 243
+P A++ AKE++ Y +VG E+ +++ E +P FFRG F N R
Sbjct: 201 PFNSPAAIQMAKEHVERDYAVVGSWEDTNITLAVFEGYIPRFFRGAKLLFEMHNDKITNR 260
Query: 244 TNRKIDP--SEETVQQIKKSKIWELE------NELY-EYALEQFHFVKKHNLV 287
K P E + I+++ E E LY +Y H ++KH L+
Sbjct: 261 NKNKRKPYIEPEVKELIRRNFTNEYEFYYFCKQRLYKQYLALNLHELEKHGLL 313
>gi|194751652|ref|XP_001958139.1| GF10769 [Drosophila ananassae]
gi|190625421|gb|EDV40945.1| GF10769 [Drosophila ananassae]
Length = 241
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 64/225 (28%), Positives = 107/225 (47%), Gaps = 26/225 (11%)
Query: 29 VIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKW-RD 87
V+ +NR K GS + + + M FN + V G + S R W D
Sbjct: 20 VVFFNRGAKVGSEALMQLTQTMAP---FNNMTVVTKGPLEINSRTRAPREQMIQAIWVND 76
Query: 88 RRPA-LYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKH 146
P LY H +++F+++ K P++IN++R P++R+VS+YY++R +YR + +K+
Sbjct: 77 LDPGTLYIEHCNWLNFRRYQLK-MPIYINLVRDPVERMVSWYYYVR--SSYRNAIFFRKN 133
Query: 147 GDKT---------TFDECIRLNRTEC---------SLENMWLQVPFLCGHAAACWVPGNP 188
+ T +++C+R EC + N Q F CGH C +
Sbjct: 134 PNATIKAESWYKKNYNDCVRSGDPECQYLPGSVKETEGNYKRQSLFFCGHNRECLPFDSH 193
Query: 189 WALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHF 233
A++ AK N+ Y +VG EE +++LEA +P FF+G F
Sbjct: 194 RAIQLAKINVERDYAVVGTWEETNITLAVLEAYIPRFFKGARQIF 238
>gi|195377465|ref|XP_002047510.1| GJ11897 [Drosophila virilis]
gi|194154668|gb|EDW69852.1| GJ11897 [Drosophila virilis]
Length = 299
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 68/283 (24%), Positives = 137/283 (48%), Gaps = 22/283 (7%)
Query: 16 SPSPETDSLSWDT-VIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTG--NNHVLSL 72
+P ++LS ++ ++ +NRVP+TG+ + + + + F + H + NH L++
Sbjct: 8 NPKHLNNTLSTNSELLFFNRVPRTGAKTLIELLSRLGELHNFILEHTPFSRPIANH-LTV 66
Query: 73 ADQYRFVNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLR 132
Q V++ +Y G+IDF+ + QP+++N++R P+++++S+YY R
Sbjct: 67 KQQLALGQYVSELGQSSAFVYVEPVGYIDFRTYNFP-QPIYVNMVRDPVEKIISWYYHKR 125
Query: 133 YGDN-YRPHLVRKKHGDK----TTFDECIRLNRTECSLE----------NMWLQVPFLCG 177
N R + + K + +F++C+ EC + + Q F CG
Sbjct: 126 TPWNALRMYKITGKFQKRDFYTKSFEDCVLTGDPECRYDYAMGFQNDSGDHKRQSLFFCG 185
Query: 178 HAAACWVPGNPWALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRG-GTDHFLTS 236
HA C P A+ +AK+N+ + +VG E++ +++LE +P FFRG +F T
Sbjct: 186 HAPICEPFNIPAAIARAKQNVERHFAVVGSWEDVNVTLAVLEHYIPRFFRGVKYLYFATE 245
Query: 237 NKSHLRRTNRKIDPSEETVQQIKKSKIWELENELYEYALEQFH 279
+ L + N E ++ + ++ + LE E Y + ++ +
Sbjct: 246 HSLALPQRNHWKPKIGEHIKSLVRAN-FTLEYEFYYFCKQRLY 287
>gi|194751642|ref|XP_001958134.1| GF10764 [Drosophila ananassae]
gi|190625416|gb|EDV40940.1| GF10764 [Drosophila ananassae]
Length = 407
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 62/226 (27%), Positives = 107/226 (47%), Gaps = 26/226 (11%)
Query: 29 VIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQY--RFVNNVTKWR 86
I +NRV KTGS + + + ++ F V + G V+ + Y F+++
Sbjct: 124 TIFFNRVTKTGSEKMMELLKILGKRNDF-VARRDAEGLYEVVIMDPVYAKNFIHSDI-LN 181
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKH 146
+ + Y H FIDF + P++IN++R P++RLVS++Y+ R + R+
Sbjct: 182 ETKANTYTKHVAFIDFTAL-DEPWPIYINMVRDPIERLVSWFYYARAPWYWAER--RETF 238
Query: 147 GD----------KTTFDECIRLNRTECSLENMWL--------QVPFLCGHAAACWVPGNP 188
GD K F++CI + EC E M + Q F CG +P N
Sbjct: 239 GDAIQMPDINWLKKDFNQCIEEHDPECVYEQMEMGNLGDHRRQSLFFCGMRTDICMPFNS 298
Query: 189 W-ALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHF 233
A+++AK+N+ +Y +VG E+ +++LE +P FF G + +
Sbjct: 299 HEAMQRAKKNVEKRYAVVGTWEDTNTTLTVLEGYIPRFFEGAKEEY 344
>gi|194751650|ref|XP_001958138.1| GF10768 [Drosophila ananassae]
gi|190625420|gb|EDV40944.1| GF10768 [Drosophila ananassae]
Length = 851
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 67/265 (25%), Positives = 122/265 (46%), Gaps = 38/265 (14%)
Query: 30 IIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNN-------HVLSLADQYRFVNNV 82
+++ R K GS SF+ + M + N V+ G + AD ++ N
Sbjct: 571 LVFTRCAKVGSESFMEL---MEHLEIINNYRVDKVGTHKKSKRQLEPQGQADLAGYIYNS 627
Query: 83 TKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLV 142
+ ++Y H +IDF + +P+FIN++R P++R++S+YY++R ++YR +
Sbjct: 628 DEG-----SVYVEHVPWIDFNAYNLP-KPIFINLVRDPVERMISWYYYVR--NSYRNAIY 679
Query: 143 RKKHGD---------KTTFDECIRLNRTEC---------SLENMWLQVPFLCGHAAACWV 184
+++ K +++EC+R EC ++ N Q F CGH C
Sbjct: 680 YRRNPLAPLKPTAWFKKSYNECVRSGDPECQYIPMSVRDAVPNFKRQTIFFCGHDPDCLP 739
Query: 185 PGNPWALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKS--HLR 242
+P AL+ AK + +Y +VG EE +++LE +P +F F KS +
Sbjct: 740 FDSPLALQMAKRRVEKEYAVVGTWEETNITLTVLEHYIPRYFSRAQIIFHMYQKSLTNRN 799
Query: 243 RTNRKIDPSEETVQQIKKSKIWELE 267
R NRK ++ ++++ E E
Sbjct: 800 RNNRKPQVDDDVRAMVRRNLTHEYE 824
Score = 77.8 bits (190), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 56/228 (24%), Positives = 109/228 (47%), Gaps = 22/228 (9%)
Query: 69 VLSLADQYRFVNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYY 128
+L A++ + N+ D + + H F++F + + +P++IN++R P++R++S+Y
Sbjct: 30 ILEPAEESDMIENIASLED--GSAFASHVNFLNFTK-ADQPRPIYINMVRDPVERVISWY 86
Query: 129 YFLRYGDNYRPHLVRKKHGD------KTTFDECIRLNRTECS-LENMWL--------QVP 173
Y++R + P R T FD+C+ C+ +EN L Q
Sbjct: 87 YYIRAPWVFVPGRRRNNREMPNPKWVNTEFDQCVLSGEKVCTYIENSMLEHVGDHRRQTL 146
Query: 174 FLCGHAAACWVPGNP-WALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDH 232
F CGH A P N L+ AK N+ +Y +VG E + +++LEA +P FF +
Sbjct: 147 FFCGHNEAKCTPFNGRLPLQIAKMNVEREYSVVGTWEHTNETLAVLEAYVPRFFADASKM 206
Query: 233 FLTSNKSHLRRTN-RKIDPSEETVQQIKKSKIWELENELYEYALEQFH 279
+ + S ++ +N K + + ++++ E+ E Y++ + H
Sbjct: 207 YYSGLHSDVQNSNPMKPHIPQNIIDMVRRNFTREI--EFYQFCRQHLH 252
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 50/191 (26%), Positives = 87/191 (45%), Gaps = 34/191 (17%)
Query: 91 ALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKHGD-- 148
+Y H ++DF +F +P++IN++R P++R++S++++ R +Y+ + +K+ D
Sbjct: 346 TMYIEHINWLDFDEFDLP-KPIYINLVRDPVERVISWFFYAR--SSYKNAIEYRKNPDQK 402
Query: 149 -------KTTFDECIRLNRTECS---------LENMWLQVPFLCGHAAACWVPGNPWALE 192
K F+EC+R EC + N Q F CGH C
Sbjct: 403 IKPESWYKKNFNECVRSGDPECQYVPHTFRDPVANFKRQSLFYCGHHDDC---------- 452
Query: 193 KAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDP-- 250
KE++ Y +VG E+ +++LE +P FFRG + N R K P
Sbjct: 453 -IKEHVERDYAVVGSWEDTNVTLTVLERYIPRFFRGAKLMYEMHNNKITNRNKNKRKPYI 511
Query: 251 SEETVQQIKKS 261
E + I+K+
Sbjct: 512 EPEVKEMIRKN 522
>gi|229367490|gb|ACQ58725.1| Heparan sulfate 2-O-sulfotransferase 1 [Anoplopoma fimbria]
Length = 137
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 39/62 (62%), Positives = 45/62 (72%), Gaps = 4/62 (6%)
Query: 16 SPSPETDSLSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQ 75
SP E D D VIIYNRVPKT STSF N+AYD+C + RF+VLH+N T NN V+SL DQ
Sbjct: 66 SPVAEKD----DMVIIYNRVPKTASTSFTNIAYDLCGENRFHVLHINTTKNNPVMSLQDQ 121
Query: 76 YR 77
R
Sbjct: 122 VR 123
>gi|198463783|ref|XP_002135582.1| GA28234 [Drosophila pseudoobscura pseudoobscura]
gi|198151410|gb|EDY74209.1| GA28234 [Drosophila pseudoobscura pseudoobscura]
Length = 369
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 77/291 (26%), Positives = 130/291 (44%), Gaps = 80/291 (27%)
Query: 10 HISSAK---SPSPETDSLSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGN 66
H+++A+ +P + D+ I +NRV KTGS ++ +VL N TG
Sbjct: 108 HLTAAQLNNTPRAQLDT------IFFNRVTKTGS-------------EKXDVL--NSTGA 146
Query: 67 NHVLSLADQYRFVNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVS 126
N Y H FIDF ++ P++IN++R P++RLVS
Sbjct: 147 N------------------------TYTKHVAFIDFDLL-NEPWPIYINLVRDPIERLVS 181
Query: 127 YYYFLRYGDNYRPHLV---RKKHGDKTT----------FDECIRLNRTECSLENMWL--- 170
++Y+ R P + R+ G+K F+ECI + EC E M +
Sbjct: 182 WFYYAR-----APWYLAERRETFGEKAVLPSVEWVRKDFNECIEEHDPECVYEQMEMGNL 236
Query: 171 -----QVPFLCGHAAACWVPGNPW-ALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPS 224
Q F CGH + +P N A+++AK N+ Y +VG E+ +++LE +P
Sbjct: 237 GDHRRQSLFFCGHTTSVCMPFNSHEAMQRAKRNVEEHYAVVGTWEDTNTTLTVLEGYIPK 296
Query: 225 FFRGGTDHF--LTSNKSHLRRTNRKIDPSEETVQQIKKSKIWELENELYEY 273
+F G + + L S ++ R N + SE+ + ++ E+ ELY++
Sbjct: 297 YFAGAKEEYYALRSRLGNVNRNNFRPTLSEKARALLARNLTREI--ELYQF 345
>gi|198463777|ref|XP_002135579.1| GA28237 [Drosophila pseudoobscura pseudoobscura]
gi|198151407|gb|EDY74206.1| GA28237 [Drosophila pseudoobscura pseudoobscura]
Length = 307
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 64/267 (23%), Positives = 131/267 (49%), Gaps = 22/267 (8%)
Query: 30 IIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLS--LADQYRFVNNVTKWRD 87
+ +NR+ KTGS S + + + NV ++ +Q FV+ +T+ +
Sbjct: 30 VFFNRIEKTGSQSMTRLINQLGLLNGYETFR-NVIQPKRAMTENFEEQREFVHQLTELSE 88
Query: 88 RRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRY----GDNYRPH--- 140
P++Y H +++F +P++IN++R P+++++S YY+LR+ G + R +
Sbjct: 89 --PSVYVEHANWVNFT-VHDMPRPIYINLVRHPIEKVISAYYYLRHPKIVGQSVRRNPNK 145
Query: 141 LVRKKHGDKTTFDECIRLNRT-ECSLE------NMWLQVPF-LCGHAAACWVPGNPWALE 192
+V+ K F++C++ + C + W + LCG+A C + ++
Sbjct: 146 IVQDKTYYDMKFNDCVKQRISPHCVFDAHNRFNGDWRRFALKLCGNAQICEQLNSEATMQ 205
Query: 193 KAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSE 252
AK ++ +Y +VG EE +++LEA +P FF G T + + K T + +
Sbjct: 206 MAKMHVEREYSVVGTWEETNITLAVLEAYIPRFFAGATKVYYSQTKKFTVNTTPHDNSLD 265
Query: 253 ETVQQIKKSKIWELENELYEYALEQFH 279
E V++ K ++ E ELY++ +++ +
Sbjct: 266 EEVERYLKDS-FKFELELYQFIMQRLY 291
>gi|195022671|ref|XP_001985618.1| GH14410 [Drosophila grimshawi]
gi|193899100|gb|EDV97966.1| GH14410 [Drosophila grimshawi]
Length = 312
Score = 80.9 bits (198), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 74/292 (25%), Positives = 129/292 (44%), Gaps = 42/292 (14%)
Query: 29 VIIYNRVPKTGSTSFVNM------AYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNV 82
V+ +NRV K GS + + + +D +R + LH + FV ++
Sbjct: 29 VLFFNRVAKVGSEALLELFNSLVEYHDDLELER-SGLHEKTQRQLKKPMQREAAEFVADL 87
Query: 83 TKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLV 142
+ +Y H ++DF +F QP++IN++R P++R++S++Y+ R +Y+ +
Sbjct: 88 EEG-----TVYIRHINWLDFAEFDLP-QPIYINLVRDPVERVISWFYYAR--GSYKNAIE 139
Query: 143 RKKHGDKT---------TFDECIRLNRTECS---------LENMWLQVPFLCGHAAACWV 184
+K +K F++C+R EC + N Q F CGH C
Sbjct: 140 YRKQPNKEIKPESWYKKNFNDCVRSGDPECQYVPHTVKDFMPNFKRQSLFYCGHHDDCIP 199
Query: 185 PGNPWALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRT 244
+P A++ AKE++ Y +VG E+ +++ E +P FFRG F N R
Sbjct: 200 FNSPAAIQMAKEHVERDYAVVGSWEDTNITLAVFEGYIPRFFRGAKLLFEMHNDKITNRN 259
Query: 245 NRKIDP--SEETVQQIKKSKIWELE------NELY-EYALEQFHFVKKHNLV 287
K P E + I+++ E E LY +Y H ++KH L+
Sbjct: 260 KNKRKPYIEPEVKELIRRNFTNEYEFYYFCKQRLYKQYLALNLHELEKHGLL 311
>gi|47209725|emb|CAF94644.1| unnamed protein product [Tetraodon nigroviridis]
Length = 216
Score = 80.5 bits (197), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 53/149 (35%), Positives = 59/149 (39%), Gaps = 49/149 (32%)
Query: 66 NNHVLSLADQYRFVNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLV 125
NN VLSL DQ RFV NVT WR +PA YHGH + Q S P
Sbjct: 3 NNPVLSLQDQMRFVRNVTSWRQMKPAFYHGHVAYRSTQCVSSGRSP-------------- 48
Query: 126 SYYYFLRYGDNYRPHLVRKKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVP 185
T C E + L + CW
Sbjct: 49 -----------------------------------TACPGEALAADPQNLRPPISECWNT 73
Query: 186 GNPWALEKAKENLVTKYLLVGVTEELTDF 214
G+ WALE+AK NLV YLLVGVTEEL DF
Sbjct: 74 GSRWALEQAKYNLVNDYLLVGVTEELEDF 102
>gi|28574845|ref|NP_788530.1| pipe, isoform I [Drosophila melanogaster]
gi|28380471|gb|AAO41226.1| pipe, isoform I [Drosophila melanogaster]
Length = 358
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 56/223 (25%), Positives = 106/223 (47%), Gaps = 22/223 (9%)
Query: 29 VIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSL--ADQYRFVNNVTKWR 86
VI +NR K GS + + + M +N + V G + S + + + + +
Sbjct: 124 VIFFNRGAKVGSEALMELTQTMAP---YNNMTVVTKGPMDIKSRTRSPKEQMIQAIWVTE 180
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPH------ 140
+Y H ++DF+++ +P++IN++R P++R++S+YY++R G H
Sbjct: 181 LEPGTIYIEHCNWLDFRRY-QLPRPIYINLVRDPVERMISWYYYVRSGYRNAIHHRRFPN 239
Query: 141 -LVRKKHGDKTTFDECIRLNRTECSL---------ENMWLQVPFLCGHAAACWVPGNPWA 190
++ + K ++++C+R EC N Q F CGH+ C + A
Sbjct: 240 ATIKSEKWFKKSYNDCVRSGDPECQYVPGSIKDPEGNYKRQTLFFCGHSRECLPFDSQRA 299
Query: 191 LEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHF 233
++ AK N+ Y +VG EE +++ EA +P FF+G + F
Sbjct: 300 IQLAKLNVERDYAVVGTWEETNITLTVFEAYIPRFFKGVRNIF 342
>gi|313230520|emb|CBY18736.1| unnamed protein product [Oikopleura dioica]
Length = 542
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 61/233 (26%), Positives = 104/233 (44%), Gaps = 21/233 (9%)
Query: 30 IIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHV--NVTGNNHVLSLADQYRFVNNVTKWRD 87
I +N++PK+GS++ + ++ +K FN V N N+ +FV K
Sbjct: 270 IFHNKLPKSGSSTMNQLLRNLAKKNNFNFAKVEPNQIPNDRFDLEKPLVKFVQETKK--- 326
Query: 88 RRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYG----DNYRPHLVR 143
P H +F + G + QP F+N++R P+D S YYF R+G R V
Sbjct: 327 -EPYFLLKHHFHFNFTRHGLR-QPTFVNVIRDPVDWYTSQYYFRRFGWVQSTTTRDSFVG 384
Query: 144 KKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEK-----AKENL 198
+ + + D CI+ EC+ + + + +LCG+ C L+ AK N+
Sbjct: 385 SQEDRERSIDGCIQQGLMECT-KPSYKYIQYLCGNHPHCRTVDVSEELKAKASNLAKINV 443
Query: 199 VTKYLLVGVTEELTDFVSLLEAALPSFFRGGTD----HFLTSNKSHLRRTNRK 247
+ + VG+ E+ D + EA LP+++ G D L ++ + NRK
Sbjct: 444 LRNFYAVGILEQFVDTLKTFEAILPNYYSGVLDIWNSQMLQEKRNRTKTLNRK 496
>gi|195128103|ref|XP_002008505.1| GI13540 [Drosophila mojavensis]
gi|193920114|gb|EDW18981.1| GI13540 [Drosophila mojavensis]
Length = 304
Score = 79.7 bits (195), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 63/226 (27%), Positives = 110/226 (48%), Gaps = 34/226 (15%)
Query: 91 ALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKHGDKT 150
Y H ++DF +F P++IN++R P++R++S++Y+ R +Y+ + +K +K
Sbjct: 83 TFYIRHINWLDFSEFDLP-LPIYINMVRDPVERVISWFYYAR--SSYKNAIEYRKSPNKK 139
Query: 151 ---------TFDECIRLNRTECS---------LENMWLQVPFLCGHAAACWVPGNPWALE 192
F+EC+R EC + N Q F CGH C +P A++
Sbjct: 140 IKPESWYKKNFNECVRSGDPECQYVPHTVKDFIPNFKRQSLFYCGHHDDCLPFNSPTAVQ 199
Query: 193 KAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHF-LTSNKSHLRRTNRK---I 248
AKE++ Y +VG E+ +++LE +P FFRG + + +NK R N++ I
Sbjct: 200 MAKEHVERDYAVVGSWEDTNITLTVLERYIPRFFRGAKLMYEMHTNKITNRNKNKRKPYI 259
Query: 249 DPSEETVQQIKKSKIWELE------NELY-EYALEQFHFVKKHNLV 287
+P E + I+++ E E LY +Y H ++KH L+
Sbjct: 260 EP--EVKEMIRRNFTHEYEFYHFCKQRLYKQYLALNLHELQKHGLL 303
>gi|195128095|ref|XP_002008501.1| GI13536 [Drosophila mojavensis]
gi|193920110|gb|EDW18977.1| GI13536 [Drosophila mojavensis]
Length = 406
Score = 79.7 bits (195), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 54/197 (27%), Positives = 96/197 (48%), Gaps = 26/197 (13%)
Query: 93 YHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLR-----------YGDNYRPHL 141
Y H F++F +FG + P++IN++R P++RLVS++Y+ R +G ++
Sbjct: 189 YTKHMAFLNFTEFG-QPWPIYINLVRDPIERLVSWFYYARAPWYLADRVNTFGSKFK--- 244
Query: 142 VRKKHGDKTTFDECIRLNRTECSLENMWL--------QVPFLCGHAAACWVPGNP-WALE 192
V K F+ C+ + EC E + + Q F CG +P N A++
Sbjct: 245 VPSLQWLKKDFNHCLLTHDPECVYEQLDMEHLDDHRRQTLFFCGQQTKFCMPFNSRSAMQ 304
Query: 193 KAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHF--LTSNKSHLRRTNRKIDP 250
+AK N+ Y +VG E+ + +LEA +P +F G TD + + SN ++ R +
Sbjct: 305 QAKRNVEQHYAVVGTWEDTNTTLRVLEAYIPRYFAGATDLYYAMPSNMENVNRNAFRPAL 364
Query: 251 SEETVQQIKKSKIWELE 267
SE+ + ++ E+E
Sbjct: 365 SEQARALLSRNLTQEIE 381
>gi|195352363|ref|XP_002042682.1| GM14876 [Drosophila sechellia]
gi|194124566|gb|EDW46609.1| GM14876 [Drosophila sechellia]
Length = 270
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 49/178 (27%), Positives = 87/178 (48%), Gaps = 21/178 (11%)
Query: 91 ALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKHGD-- 148
+Y H ++DF +F +P++IN++R P++R++S++++ R +Y+ + +K +
Sbjct: 49 TMYIEHINWLDFDEFDLP-KPIYINLVRDPVERVISWFFYAR--SSYKNAIEYRKRPNQK 105
Query: 149 -------KTTFDECIRLNRTEC---------SLENMWLQVPFLCGHAAACWVPGNPWALE 192
K F++C+R EC S+ N Q F CGH C +P A++
Sbjct: 106 IKPESWYKKNFNDCVRSGDPECQYVPHTVKDSIANFKRQSLFYCGHHDDCLPFNSPTAVQ 165
Query: 193 KAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDP 250
AKE++ Y +VG E+ +++LE +P FFRG + N R K P
Sbjct: 166 MAKEHVERDYAVVGSWEDTNITLTVLENYIPRFFRGAKLMYEMHNSKITNRNKNKRKP 223
>gi|195173232|ref|XP_002027397.1| GL20907 [Drosophila persimilis]
gi|194113249|gb|EDW35292.1| GL20907 [Drosophila persimilis]
Length = 322
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 46/158 (29%), Positives = 84/158 (53%), Gaps = 21/158 (13%)
Query: 91 ALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKHGD-- 148
++Y H +IDF + +P+FIN++R P++R++S+YY++R ++YR + +K+
Sbjct: 102 SVYIEHTNWIDFNAYNLP-KPIFINMVRDPVERMISWYYYIR--NSYRNAIFYRKNPLAP 158
Query: 149 -------KTTFDECIRLNRTEC---------SLENMWLQVPFLCGHAAACWVPGNPWALE 192
K +F++C+R EC ++ N Q F CGH C +P A++
Sbjct: 159 IKPTAWFKKSFNDCVRSGDQECQYIPLTVKDAVPNFKRQSIFFCGHEPDCLPFNSPLAVQ 218
Query: 193 KAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGT 230
AK + T++ +VG EE +++LE +P +F T
Sbjct: 219 IAKRRVETEFAVVGTWEETNITLAVLEHYIPRYFARAT 256
>gi|195435614|ref|XP_002065775.1| GK19576 [Drosophila willistoni]
gi|194161860|gb|EDW76761.1| GK19576 [Drosophila willistoni]
Length = 305
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 55/214 (25%), Positives = 101/214 (47%), Gaps = 16/214 (7%)
Query: 30 IIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWRDRR 89
I YNR+ KTGS S + + ++ F + NV + R
Sbjct: 22 IFYNRLEKTGSQSMTRLINALGKRNNFGT-YRNVIVPKTSPIETFEEETEFIEQLMELER 80
Query: 90 PALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKHGDK 149
P+ Y H +++F Q + +P++IN++R P+ +++S YY+ R+ Y L+R K+ K
Sbjct: 81 PSAYVEHANYMNFTQHDTP-KPIYINLVRHPIQKVISAYYYQRHPVIYANTLLRNKNKKK 139
Query: 150 T------TFDECIR-------LNRTECSLENMWLQVPF-LCGHAAACWVPGNPWALEKAK 195
TF++C++ + + N W + LCG+ C + A +KAK
Sbjct: 140 DKIYFDRTFNDCVKQRVAPFCVFDSHNEFNNDWRRFSLHLCGNEEICTYFNSEIATQKAK 199
Query: 196 ENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGG 229
+N+ +Y ++G E+ +++LEA +P FF+
Sbjct: 200 DNVEREYAVIGSWEDTNITLAVLEAYIPRFFKDA 233
>gi|313231383|emb|CBY08498.1| unnamed protein product [Oikopleura dioica]
Length = 445
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 58/230 (25%), Positives = 105/230 (45%), Gaps = 17/230 (7%)
Query: 30 IIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWRDRR 89
+++N++PK GST+ N+ + + + H+ + + + D + + K R
Sbjct: 146 VLHNKLPKCGSTTMHNILTMLSQWNNYE--HIKI---DSAMMKFDDEKDLAAYLKSILRP 200
Query: 90 PALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKHG-- 147
P H F +F ++G E P +IN++R+P+ S Y+F + G ++ K++
Sbjct: 201 PMTVMKHHYFFNFTEYGM-EAPTWINVMREPISWFESRYWFKQNGWIHKTGSRTKENSHD 259
Query: 148 ---DKTTFDECIRLNRTECSLENMWLQVPFLCGHAAAC-----WVPGNPWALEKAKENLV 199
++ D CIR +C+ +W F CG++A C A E AK+ ++
Sbjct: 260 FEDERLDIDTCIRRKMKDCT-TVVWKYTRFFCGNSATCKGESRGFEAKGAAAEIAKKRIL 318
Query: 200 TKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKID 249
Y LVG+ E+ D +SL + LP FF G + + T R +D
Sbjct: 319 RDYFLVGILEQFEDTLSLFQKLLPQFFTGAREAARSEFAKIAMNTTRTLD 368
>gi|313227325|emb|CBY22471.1| unnamed protein product [Oikopleura dioica]
Length = 435
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 64/229 (27%), Positives = 105/229 (45%), Gaps = 34/229 (14%)
Query: 30 IIYNRVPKTGSTS----FVNMA------YDMCRKKRFNVLHVNVTGNNHVLSLADQYRFV 79
I +NR+PK+G+TS F +A +D + F+ N + + A ++
Sbjct: 198 IFFNRLPKSGATSLRYIFERLANTNKFVFDYQKASMFDCAPGNKNCQDGPSAQAKFGVYI 257
Query: 80 NNVTKW--RDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNY 137
T W +D + H F +F +F QP ++N++R P+ RL S YYF R+G +
Sbjct: 258 RR-THWDTKDENYLMMKQH-HFFNFTEF-KIPQPTYLNMVRDPVSRLASSYYFQRHGWGF 314
Query: 138 RPHLVRKKHGDKT----TFDECIRLNRTECSLENMWLQV--PFLCGHAAAC--------- 182
+ K G + +FD+C++ ECS+ LQV + CG + AC
Sbjct: 315 GSNSSNKFKGSQEDFNRSFDDCVKQGLAECSVP---LQVFSKYFCGTSEACKKQKEDSED 371
Query: 183 -WVPGNPWALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGT 230
+ + + AK N++ Y +G+ E D + LLE LP +FRG
Sbjct: 372 DLLKKIAVSAQIAKRNILNDYFFIGLLEHFDDTLFLLEKILPDYFRGAA 420
>gi|313222243|emb|CBY39211.1| unnamed protein product [Oikopleura dioica]
Length = 265
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 64/229 (27%), Positives = 105/229 (45%), Gaps = 34/229 (14%)
Query: 30 IIYNRVPKTGSTS----FVNMA------YDMCRKKRFNVLHVNVTGNNHVLSLADQYRFV 79
I +NR+PK+G+TS F +A +D + F+ N + + A ++
Sbjct: 28 IFFNRLPKSGATSLRYIFERLANTNKFVFDYQKASMFDCAPGNKNCQDGPSAQAKFGVYI 87
Query: 80 NNVTKW--RDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNY 137
T W +D + H F +F +F QP ++N++R P+ RL S YYF R+G +
Sbjct: 88 RR-THWDTKDENYLMMKQH-HFFNFTEF-KIPQPTYLNMVRDPVSRLASSYYFQRHGWGF 144
Query: 138 RPHLVRKKHGDKT----TFDECIRLNRTECSLENMWLQV--PFLCGHAAAC--------- 182
+ K G + +FD+C++ ECS+ LQV + CG + AC
Sbjct: 145 GSNSSNKFKGSQEDFNRSFDDCVKQGLAECSVP---LQVFSKYFCGTSEACKKQKEDSED 201
Query: 183 -WVPGNPWALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGT 230
+ + + AK N++ Y +G+ E D + LLE LP +FRG
Sbjct: 202 DLLKKIAVSAQIAKRNILNDYFFIGLLEHFDDTLFLLEKILPDYFRGAA 250
>gi|195477376|ref|XP_002086331.1| GE22925 [Drosophila yakuba]
gi|194186121|gb|EDW99732.1| GE22925 [Drosophila yakuba]
Length = 220
Score = 78.6 bits (192), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 46/159 (28%), Positives = 84/159 (52%), Gaps = 17/159 (10%)
Query: 91 ALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPH-------LVR 143
+Y H ++DFQ++ +P++IN++R P++R++S+YY++R G H ++
Sbjct: 47 TIYIEHCNWLDFQRY-QLPRPIYINLVRDPVERMISWYYYVRSGYRNAIHHRRFPNATIK 105
Query: 144 KKHGDKTTFDECIRLNRTEC---------SLENMWLQVPFLCGHAAACWVPGNPWALEKA 194
+ K ++++C+R EC S+ N Q F CGH C + A++ A
Sbjct: 106 SEKWFKKSYNDCVRSGDPECQYVPGSIKESVGNYKRQTLFFCGHNRECLPFDSQRAIQLA 165
Query: 195 KENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHF 233
K ++ Y +VG EE +++LEA +P FF+G + F
Sbjct: 166 KLHVERDYAVVGTWEETNISLTVLEAYIPRFFKGVRNIF 204
>gi|313220353|emb|CBY31209.1| unnamed protein product [Oikopleura dioica]
Length = 359
Score = 78.6 bits (192), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 61/233 (26%), Positives = 103/233 (44%), Gaps = 21/233 (9%)
Query: 30 IIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHV--NVTGNNHVLSLADQYRFVNNVTKWRD 87
I +N++PK+GS++ + ++ K FN V N N+ +FV K
Sbjct: 87 IFHNKLPKSGSSTMNQLLRNLATKNNFNFAKVEPNQIPNDRFDLEKPLVKFVQETKK--- 143
Query: 88 RRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYG----DNYRPHLVR 143
P H +F + G + QP F+N++R P+D S YYF R+G R V
Sbjct: 144 -EPYFLLKHHFHFNFTRNGLR-QPTFVNVIRDPVDWYTSQYYFRRFGWVQSTTTRDSFVG 201
Query: 144 KKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEK-----AKENL 198
+ + + D CI+ EC+ + + + +LCG+ C L+ AK N+
Sbjct: 202 SQEDRERSIDGCIQQGLMECT-KPSYKYIQYLCGNHPHCRTVDVSEELKAKASNLAKINV 260
Query: 199 VTKYLLVGVTEELTDFVSLLEAALPSFFRGGTD----HFLTSNKSHLRRTNRK 247
+ + VG+ E+ D + EA LP+++ G D L ++ + NRK
Sbjct: 261 LRNFFAVGILEQFVDTLKTFEAILPNYYSGVLDIWNSQTLQEKRNRTKTLNRK 313
>gi|195435608|ref|XP_002065772.1| GK19598 [Drosophila willistoni]
gi|194161857|gb|EDW76758.1| GK19598 [Drosophila willistoni]
Length = 219
Score = 78.2 bits (191), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 44/153 (28%), Positives = 80/153 (52%), Gaps = 17/153 (11%)
Query: 91 ALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYG-------DNYRPHLVR 143
+++ H ++DF F +P++IN++R P+DR++S+YY++R G +R H ++
Sbjct: 50 SIFIEHGNWLDFPGF-KLPKPIYINLVRDPVDRVISWYYYIRGGYRNAIFYRRFRDHPIK 108
Query: 144 KKHGDKTTFDECIRLNRTECS---------LENMWLQVPFLCGHAAACWVPGNPWALEKA 194
+ K F++C+R EC N Q F CG+ C +P +++ A
Sbjct: 109 PEAFFKKNFNDCVRTGDPECQYIPNTTNERTGNYMRQTLFFCGNERQCLPFNSPRSVQLA 168
Query: 195 KENLVTKYLLVGVTEELTDFVSLLEAALPSFFR 227
K N+ Y +VG E+ +++LEA +P FF+
Sbjct: 169 KMNVERDYAVVGSWEDTNVTLTVLEAYIPRFFK 201
>gi|313242523|emb|CBY34661.1| unnamed protein product [Oikopleura dioica]
Length = 542
Score = 77.8 bits (190), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 61/233 (26%), Positives = 103/233 (44%), Gaps = 21/233 (9%)
Query: 30 IIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHV--NVTGNNHVLSLADQYRFVNNVTKWRD 87
I +N++PK+GS++ + ++ K FN V N N+ +FV K
Sbjct: 270 IFHNKLPKSGSSTMNQLLRNLATKNNFNFAKVEPNQIPNDRFDLEKPLVKFVQETKK--- 326
Query: 88 RRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYG----DNYRPHLVR 143
P H +F + G + QP F+N++R P+D S YYF R+G R V
Sbjct: 327 -EPYFLLKHHFHFNFTRNGLR-QPTFVNVIRDPVDWYTSQYYFRRFGWVQSTTTRDSFVG 384
Query: 144 KKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEK-----AKENL 198
+ + + D CI+ EC+ + + + +LCG+ C L+ AK N+
Sbjct: 385 SQEDRERSIDGCIQQGLMECT-KPSYKYIQYLCGNHPHCRTVDVSEELKAKASNLAKINV 443
Query: 199 VTKYLLVGVTEELTDFVSLLEAALPSFFRGGTD----HFLTSNKSHLRRTNRK 247
+ + VG+ E+ D + EA LP+++ G D L ++ + NRK
Sbjct: 444 LRNFFAVGILEQFVDTLKTFEAILPNYYSGVLDIWNSQTLQEKRNRTKTLNRK 496
>gi|195435606|ref|XP_002065771.1| GK19619 [Drosophila willistoni]
gi|194161856|gb|EDW76757.1| GK19619 [Drosophila willistoni]
Length = 230
Score = 77.4 bits (189), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 54/196 (27%), Positives = 95/196 (48%), Gaps = 21/196 (10%)
Query: 91 ALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLR-------YGDNYRPHLVR 143
LY H + DF +FG K +P++I+++R P+DR++ YY R Y Y R
Sbjct: 14 TLYMIHAPWSDFTEFGQK-KPVYISMVRDPIDRIIEDYYERRSTIRRAIYRHIYPGRPQR 72
Query: 144 KKHGDKTTFDECIRLNRTECS---------LENMWLQVPFLCGHAAACWVPGNPWALEKA 194
+ + +F+EC+R EC +E+ Q F CG+ C +P+A++ A
Sbjct: 73 TDNWYRQSFNECVRSGDPECQYLPGSIIDYVEDFKRQSLFFCGNHINCLPFNSPFAVQMA 132
Query: 195 KENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLR---RTNRKIDPS 251
K N+ +Y +VG EE +++LE +P +F NK+ ++ R NRK
Sbjct: 133 KRNVEKEYAVVGTWEEKNITLTVLEKYVPKYFNHAR-FIYKLNKTSIKNRNRNNRKPKVD 191
Query: 252 EETVQQIKKSKIWELE 267
+ + ++++ +E E
Sbjct: 192 ADVREMVRRNFTYEYE 207
>gi|260833384|ref|XP_002611637.1| hypothetical protein BRAFLDRAFT_63698 [Branchiostoma floridae]
gi|229297008|gb|EEN67647.1| hypothetical protein BRAFLDRAFT_63698 [Branchiostoma floridae]
Length = 308
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 46/133 (34%), Positives = 74/133 (55%), Gaps = 7/133 (5%)
Query: 151 TFDECIRLNRTECSL--ENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVGVT 208
TFD+C+ N EC + F CG + C P A+E AKEN+ Y +VGV
Sbjct: 169 TFDDCVLNNLWECDEFGPKTFTMTQFFCGQESICMEPSQ-MAVEVAKENIRRHYAVVGVL 227
Query: 209 EELTDFVSLLEAALPSFFRGGTDHF--LTSNKSHLRRTNRKIDPSEETVQQIKKSKIWEL 266
EE + F+ +LE +P FFRG D + + S + ++T KI PS E+ ++I + ++ L
Sbjct: 228 EEFSSFLKVLEVVMPQFFRGAHDTWRKIESKQMEHQKTAIKIPPSNES-REIMRERL-HL 285
Query: 267 ENELYEYALEQFH 279
+ ++Y++ E+FH
Sbjct: 286 DYQVYDFIKERFH 298
>gi|313237137|emb|CBY12357.1| unnamed protein product [Oikopleura dioica]
Length = 430
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 73/301 (24%), Positives = 124/301 (41%), Gaps = 51/301 (16%)
Query: 11 ISSAKSPSPETDSLSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVN-------- 62
+ + SP P + I +N++PK+GST+ M Y + ++ N H++
Sbjct: 118 LPALNSPPPSSQ------FIFHNKLPKSGSTT---MKYIISTLQKANDFHMDYQSPCINK 168
Query: 63 -VTGNNHVLSLADQYRFVNNVTKWRDRRPA--LYHGHFGFIDFQQFGSKEQPLFINILRK 119
+ + + N+V R++ P + H +++F + EQP +IN++R
Sbjct: 169 ATCATDPADGIGAESTLANHVKVEREQHPGKFILLKHQYWLNFTEH-DMEQPTYINVVRD 227
Query: 120 PLDRLVSYYYFLRYGDNYRPHLVRK---KHGDKTT-------FDECIRLNRTECSLENMW 169
P+ R S YYF RYG R+ +H K T D C+ ECS E +
Sbjct: 228 PVTRFASMYYFNRYGFKSMGSAARQGAVRHSWKGTEEDIVRTLDMCMEQQGEECS-EPLQ 286
Query: 170 LQVPFLCGHAAACWVPGNPWALEKAKEN--------------LVTKYLLVGVTEELTDFV 215
+ V + CG A C + A N ++T+Y +G+ E+ + +
Sbjct: 287 VLVRYFCGTAIECNMKSAKMGKFGALNNWDKVAKAAELAKKKIITQYYSIGLMEKFDETL 346
Query: 216 SLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIWELENEL-YEYA 274
SL E LP FF+G + +S + R+ + T ++ W E L YE
Sbjct: 347 SLFEKMLPGFFKGAPAAY----RSQFVQNQRESSKTAHTDGYSNSTRSWLEEGPLRYEMD 402
Query: 275 L 275
L
Sbjct: 403 L 403
>gi|195352361|ref|XP_002042681.1| GM14877 [Drosophila sechellia]
gi|194124565|gb|EDW46608.1| GM14877 [Drosophila sechellia]
Length = 284
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 51/177 (28%), Positives = 89/177 (50%), Gaps = 23/177 (12%)
Query: 91 ALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKHGD-- 148
++Y H +IDF ++ +P+FIN++R P++R++S+YY++R ++YR + + +
Sbjct: 64 SVYIEHVPWIDFNEYNLP-KPIFINLVRDPVERMISWYYYVR--NSYRNAIFYRNNPLAP 120
Query: 149 -------KTTFDECIRLNRTECSL---------ENMWLQVPFLCGHAAACWVPGNPWALE 192
K ++++C+R EC N Q F CGH C +P A++
Sbjct: 121 LKPTAWFKKSYNDCVRSGDPECQYVPLAVRDVEGNFKRQTIFFCGHDQDCLPFNSPLAVQ 180
Query: 193 KAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKS--HLRRTNRK 247
AK + T+Y +VG EE +++LE +P +F F KS + R NRK
Sbjct: 181 IAKRRVETEYAVVGTWEETNITLTVLEHYIPRYFARAKMIFHLYQKSLQNRNRNNRK 237
>gi|326436559|gb|EGD82129.1| hypothetical protein PTSG_02803 [Salpingoeca sp. ATCC 50818]
Length = 345
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 76/301 (25%), Positives = 120/301 (39%), Gaps = 60/301 (19%)
Query: 29 VIIYNRVPKTGSTSF---VNMAYDMCRKKRFNVLHVNVTGN-----NHVLSLADQYR--- 77
++IYNRVPK GSTS VN+ F +L ++ T N N +L+ ++R
Sbjct: 27 LLIYNRVPKAGSTSVLQRVNLTNVAQGGNVFEMLAIHNTDNFKQSGNRARALSPEWRESM 86
Query: 78 ---FVNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPL-----FINILRKPLDRLVSYYY 129
+++ RP HGHF F DF + K+ + ++NILR P+ +LVS++
Sbjct: 87 VDYIMSHAEHATATRPLFVHGHFLFYDFYRHVQKQGKMPPTTAYMNILRHPVTKLVSHFQ 146
Query: 130 FL----------RYGDNYRPHLVRKKHGDKTTFDECIRLNRTECSLENMWLQVP------ 173
+L R G ++ P + T D C+ + + W +P
Sbjct: 147 YLQSAFRGSKRARSGLSFDPDV---------TIDACVSAIASTPPYKPTWEAIPNTTLPC 197
Query: 174 ------------FLCGHAAACWVPGNPWALEKAKENLVTKYLLVGVTEELTDFVSLLEAA 221
+ CG+A C L AKENL + VG+ EE LE
Sbjct: 198 DGTFFMRAVQWRYFCGYARECLQADIEPGLTMAKENLRKHFAFVGLLEETHLTYRALETL 257
Query: 222 LPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKK-SKIW-ELENEL--YEYALEQ 277
P+FF G+ L + K +T + ++ W EL ++ Y +A
Sbjct: 258 FPTFFARGSGGISAPTDEELEQIANKNPEKYKTTRATRRYIAAWAELSGDMDFYHFAASL 317
Query: 278 F 278
F
Sbjct: 318 F 318
>gi|195173230|ref|XP_002027396.1| GL20908 [Drosophila persimilis]
gi|194113248|gb|EDW35291.1| GL20908 [Drosophila persimilis]
Length = 228
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 44/161 (27%), Positives = 85/161 (52%), Gaps = 21/161 (13%)
Query: 91 ALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKHGDKT 150
+Y H ++DF ++ +P++IN++R P++R++S++Y++R G YR ++ + + T
Sbjct: 47 TIYIEHCNWLDFHRY-QLPKPIYINLVRDPVERMISWFYYIRSG--YRNAIIHNRFPNTT 103
Query: 151 ---------TFDECIRLNRTEC---------SLENMWLQVPFLCGHAAACWVPGNPWALE 192
++++C+R EC ++ + Q F CG+ C +P A++
Sbjct: 104 LKSEKWFKKSYNQCVRSGDPECQYVPDSIKDTVGDYKRQSLFYCGNNRECLPFDSPHAIQ 163
Query: 193 KAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHF 233
AK N+ Y +VG E+ +++LEA +P FFRG F
Sbjct: 164 LAKRNVERDYAVVGSWEDTNITLAVLEAYIPRFFRGARQVF 204
>gi|28574843|ref|NP_788531.1| pipe, isoform H [Drosophila melanogaster]
gi|28380472|gb|AAO41227.1| pipe, isoform H [Drosophila melanogaster]
Length = 405
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 51/177 (28%), Positives = 90/177 (50%), Gaps = 23/177 (12%)
Query: 91 ALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKHGD-- 148
++Y H +IDF ++ + +P+FIN++R P++R++S+YY++R ++YR + + +
Sbjct: 185 SVYIEHVPWIDFNEY-NLPKPIFINLVRDPVERMISWYYYVR--NSYRNAIFYRNNPLAP 241
Query: 149 -------KTTFDECIRLNRTECSL---------ENMWLQVPFLCGHAAACWVPGNPWALE 192
K ++++C+R EC N Q F CGH C +P A++
Sbjct: 242 LKPTAWFKKSYNDCVRSGDPECQYVPLAVRDVEGNFKRQTLFFCGHDQDCLPFNSPLAVQ 301
Query: 193 KAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKS--HLRRTNRK 247
AK + T+Y +VG EE +++LE +P +F F KS + R NRK
Sbjct: 302 IAKRRVETEYAVVGTWEETNITLTVLEHYIPRYFARAKMIFNLYQKSLQNRNRNNRK 358
>gi|189241342|ref|XP_001809848.1| PREDICTED: similar to uronyl-2-sulfotransferase (predicted)
[Tribolium castaneum]
gi|270014085|gb|EFA10533.1| hypothetical protein TcasGA2_TC012787 [Tribolium castaneum]
Length = 304
Score = 75.5 bits (184), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 67/268 (25%), Positives = 117/268 (43%), Gaps = 27/268 (10%)
Query: 10 HISSAKSPSPETDSLSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNN-H 68
H++ + + D ++ V++ N VP G V + M + N HV + G
Sbjct: 43 HVTKSMAQLGRMDEIN-KFVLLLNPVPNCGGEILVFLLQKM--QGLNNYRHVRLKGGGVR 99
Query: 69 VLSLADQYRFVNNVTKW--RDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVS 126
L+ Q FV+ + + + P + F++F + K+ P +INI+R+P D+ +S
Sbjct: 100 RLNGRQQEEFVDKLYRVMREEAVPLSFDRQLLFVNFTTY-DKQSPTYINIVREPADKAIS 158
Query: 127 YYYFLRYGDNYRPHLVRKKHGDKTTFDECIRLNRTECSLENM---WLQVPFLCGHAAACW 183
++ K+ D + C+ + C + L +P+ CGH C
Sbjct: 159 RSFY------------NNKNTD-SDLIACLAKGKGNCEGRKVNPYQLTIPYFCGHDPKCM 205
Query: 184 VPGNPWALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRR 243
+ N WAL+ AK N+ Y +VGV EEL + +LE +P FF+G + K + R
Sbjct: 206 L-DNQWALQTAKNNVEKYYPVVGVLEELNATLEVLENEIPYFFKGVQGVY---RKKMISR 261
Query: 244 TNRKIDPSEETVQQIKKSKIWELENELY 271
NR+ T + + +I E + Y
Sbjct: 262 FNRRKTSQPVTKTRKQLHRILATEYDFY 289
>gi|443704226|gb|ELU01375.1| hypothetical protein CAPTEDRAFT_206336, partial [Capitella teleta]
Length = 141
Score = 75.1 bits (183), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 48/143 (33%), Positives = 75/143 (52%), Gaps = 15/143 (10%)
Query: 151 TFDECIRLNRTECSLENMWLQVPF-LCGHAAACWV-PGNPWALEKAKENLVTKYLLVGVT 208
+F+EC+ + C+ ++++++ + CG CW W L KAK+NLV Y +VG+
Sbjct: 1 SFEECVYKKKEGCTGRHVFMKMLYYFCGQDPRCWFDKSRSWTLAKAKQNLVKYYSVVGIV 60
Query: 209 EELTDFVSLLEAALPSFFRGGTDHF--LTSNKSHLRRTNRKIDPSEETVQQIKKSKIWEL 266
E++ F LE +P FF+G F S+ +T KI PSEE V+ I K + E
Sbjct: 61 EDMDSFFYALEKRMPRFFKGAFGLFGRYGSSLKEAYKTKGKIYPSEE-VRTIMKKNMPE- 118
Query: 267 ENELYEYALEQFHFVKK--HNLV 287
A E ++FVK+ HNL+
Sbjct: 119 -------AFELYYFVKQRFHNLL 134
>gi|345481668|ref|XP_003424424.1| PREDICTED: heparan sulfate 2-O-sulfotransferase 1-like [Nasonia
vitripennis]
Length = 333
Score = 74.7 bits (182), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 68/231 (29%), Positives = 106/231 (45%), Gaps = 45/231 (19%)
Query: 19 PETDSLSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVL-HVNV-TGNNHVLSLADQY 76
PET+ D V++ RVP G+ V + + R + FN H+ + +G+ LS Q
Sbjct: 104 PETN----DHVLMVTRVPGAGAELLVLI---LQRLQGFNAFKHIRLPSGDEGTLSNLQQE 156
Query: 77 RFVNNVTKW--RDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYG 134
V VT ++ P + G F++F FG ++ P +I+++R PLD
Sbjct: 157 LLVEEVTSIIRQEAIPLSFDGDVRFLNFSAFG-RQAPTYISLVRDPLDP----------- 204
Query: 135 DNYRPHLVRKKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKA 194
L R K GD +T + R S CGH + C N WALE+A
Sbjct: 205 ----KTLERFKKGDSST------IYRGSLS---------HFCGHDSRCSERNNEWALEQA 245
Query: 195 KENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTN 245
K N++ Y +VGV + + + ++ LE A P FF G + L +K ++TN
Sbjct: 246 KANVLRWYPIVGVLDLMDETLNSLERAFPYFFEGAS---LIYDKLRPKKTN 293
>gi|313225237|emb|CBY06711.1| unnamed protein product [Oikopleura dioica]
Length = 434
Score = 74.7 bits (182), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 78/319 (24%), Positives = 138/319 (43%), Gaps = 54/319 (16%)
Query: 11 ISSAKSPSPETDSLSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVN-----VTG 65
+ + SP P T + +N++PK+GST+ M Y + ++ N H++ +
Sbjct: 121 LPALNSPPPSTQ------FVFHNKLPKSGSTT---MKYIISTLQKANNFHMDYQAPCINK 171
Query: 66 NNHVLSLADQY----RFVNNVTKWRDRRPA--LYHGHFGFIDFQQFGSKEQPLFINILRK 119
L + D + +V + RD P + H +++F + EQP +IN++R
Sbjct: 172 ATCALDVEDGLGATTKLAGHVKEQRDANPGKFILLKHQYWLNFTEH-EMEQPTYINVVRD 230
Query: 120 PLDRLVSYYYFLRY-----GDNYRPHLVR-----KKHGDKTTFDECIRLNRTECSLENMW 169
P+ R S YYF RY G R VR + T D C+ EC+ E +
Sbjct: 231 PVTRFSSMYYFNRYGFKSMGSEARQGAVRHSWKGTEEDIARTLDMCMEEQGEECT-EPLQ 289
Query: 170 LQVPFLCGHAAACWVPG---------NPW-----ALEKAKENLVTKYLLVGVTEELTDFV 215
+ V + CG + C + N W A E AK+ ++T+Y +G+ E+ + +
Sbjct: 290 VLVRYFCGTSQECNMKSPKRGKFGSLNDWNKVAKAAELAKKKIITQYYSIGLMEKFDETL 349
Query: 216 SLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIWELENELYEYAL 275
+L E LP FF G + +S +T R+ + T ++ W LE Y +
Sbjct: 350 ALFEKMLPGFFAGAPAAY----RSQFVQTQRESSKTAHTDGFSNSTRTW-LEEGPLRYEM 404
Query: 276 EQFHFVKKHNLVYNKVLGY 294
+ ++ + + ++ K L Y
Sbjct: 405 DLYNLI---SAIFYKRLSY 420
>gi|390354668|ref|XP_793957.3| PREDICTED: heparan sulfate 2-O-sulfotransferase 1-like
[Strongylocentrotus purpuratus]
Length = 165
Score = 74.3 bits (181), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 45/159 (28%), Positives = 81/159 (50%), Gaps = 10/159 (6%)
Query: 124 LVSYYYFLRYGDNYRPHLVRKKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACW 183
+VS +YF R+GD + + + + + +ECI EC+ M+ Q+ CG C
Sbjct: 1 MVSAFYFNRFGDGFLERGMLSEKKNNMSIEECILDGHNECTNAGMYPQI--FCGFNPRCG 58
Query: 184 VPGNPWALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLT-----SNK 238
WALE+AK N+ Y +G+TEE + +LE +P + G T+ +L+ +++
Sbjct: 59 -KNTTWALEQAKSNIDKYYTFIGITEEYEASLRVLEHLMPDIYNGTTELYLSFLNEETSR 117
Query: 239 SHLRRTNRKIDPSEETVQQIKKSKIWELENELYEYALEQ 277
H +T K S+E + +K++ ++ ELY Y ++
Sbjct: 118 VHSTKTKNKQPLSQELNDTV--TKLFAVDYELYNYIYDK 154
>gi|328776514|ref|XP_003249170.1| PREDICTED: heparin sulfate O-sulfotransferase-like [Apis mellifera]
Length = 304
Score = 74.3 bits (181), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 65/251 (25%), Positives = 113/251 (45%), Gaps = 43/251 (17%)
Query: 29 VIIYNRVPKTGSTSFVNMAYDMCRKKRFNVL-HVNVTGNNH-VLSLADQYRFVNNVTKW- 85
+++ RVP G+ FV + + R + +N H+ + +H +LS + V +T
Sbjct: 77 ILMLTRVPDAGAELFVLL---LQRLQGYNAFKHIRLPPGDHGLLSTLQEELLVEEITNII 133
Query: 86 -RDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRK 144
++ P + G F++F +FG +E P FI+++R P + FLRY + R ++V
Sbjct: 134 RQEAIPLSFDGDVRFLNFSKFG-RESPSFISLVRNP----IGVQNFLRYHERRR-NMVEN 187
Query: 145 KHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLL 204
+ +P CG C N WAL++AK N+V Y +
Sbjct: 188 R-------------------------AIPIFCGQDPRCSEINNKWALQRAKANVVEWYPV 222
Query: 205 VGVTEELTDFVSLLEAALPSFFRGGTDHF--LTSNKSHLRRTNRKIDPSEETVQQIKKSK 262
VG+ E + + +LE P FFRG + + S H ++ S+E + SK
Sbjct: 223 VGILEYMEQSIDILEYKFPYFFRGAKHSYKKIQSKNRHFPDPTFMLN-SQEGYNIL--SK 279
Query: 263 IWELENELYEY 273
++E E E Y++
Sbjct: 280 LFEDEIEFYQW 290
>gi|402868004|ref|XP_003919522.1| PREDICTED: LOW QUALITY PROTEIN: uronyl 2-sulfotransferase [Papio
anubis]
Length = 381
Score = 73.9 bits (180), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 54/143 (37%), Positives = 75/143 (52%), Gaps = 6/143 (4%)
Query: 144 KKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYL 203
K H T +ECI N ECS ++ +P+ CG C PG WALE+AK N+ +L
Sbjct: 196 KDHKATTDINECILENYPECSNPRLFYIIPYFCGQHPRCREPGE-WALERAKLNVNENFL 254
Query: 204 LVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNK---SHLRRTNRKIDPSEETVQQIKK 260
LVG+ EEL D + LLE LP +F+G + ++ T +K PS E VQ + +
Sbjct: 255 LVGILEELEDVLLLLERFLPHYFKGVLSIYKDPEHRKLGNMTVTVKKTVPSPEAVQILYQ 314
Query: 261 SKIWELENELYEYALEQFHFVKK 283
+E E Y Y EQFH +K+
Sbjct: 315 RMRYEY--EFYHYVKEQFHLLKR 335
>gi|195173236|ref|XP_002027399.1| GL20904 [Drosophila persimilis]
gi|194113251|gb|EDW35294.1| GL20904 [Drosophila persimilis]
Length = 239
Score = 73.9 bits (180), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 54/225 (24%), Positives = 115/225 (51%), Gaps = 19/225 (8%)
Query: 71 SLADQYRFVNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYF 130
+ +Q FV+ +T+ + P++Y H +++F +P++IN++R P+++++S YY+
Sbjct: 4 NFEEQREFVHQLTELSE--PSVYVEHANWVNFT-VHDMPRPIYINLVRHPIEKVISAYYY 60
Query: 131 LRY----GDNYRPH---LVRKKHGDKTTFDECIRLNRT-ECSLENM------WLQVPF-L 175
LR+ G + R + +V+ K F++C++ + C + W + L
Sbjct: 61 LRHPKIVGQSVRRNPNKIVQDKTYYDMKFNDCVKQRISPHCVFDAHNRFNGDWRRFALKL 120
Query: 176 CGHAAACWVPGNPWALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLT 235
CG+A C + ++ AK ++ +Y +VG E+ +++LEA +P FF G T + +
Sbjct: 121 CGNAQICEQLNSEATMQMAKMHVEREYSVVGTWEQTNITLAVLEAYIPRFFTGATKVYYS 180
Query: 236 SNKSHLRRTNRKIDPSEETVQQIKKSKIWELENELYEYALEQFHF 280
K T + +E V++ K ++ E ELY++ +++ +
Sbjct: 181 QTKKFTVNTTPHDNSLDEEVERYLKDS-FKFELELYQFIMQRLYM 224
>gi|196009143|ref|XP_002114437.1| hypothetical protein TRIADDRAFT_58255 [Trichoplax adhaerens]
gi|190583456|gb|EDV23527.1| hypothetical protein TRIADDRAFT_58255 [Trichoplax adhaerens]
Length = 271
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 53/170 (31%), Positives = 86/170 (50%), Gaps = 15/170 (8%)
Query: 113 FINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKHGDKTTFDE----CIRLNRTECSLENM 168
+IN++R PLDR +S+YY+ RYGD RP K+ + F+E C + C L M
Sbjct: 99 YINMVRDPLDRYLSHYYYQRYGD--RPKEKLKEMRNLGQFNESLQDCFQQQHQGCELNVM 156
Query: 169 WLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRG 228
F CG+ C + GN AL +AK N++T Y ++G+ EE L LP+FF
Sbjct: 157 ---TRFFCGYDKYCAL-GNQRALRQAKRNILTNYAVIGLLEEWDLSSQLFRKILPNFF-T 211
Query: 229 GTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIWELENELYEYALEQF 278
+ ++ K + R N + P ++ IK++ + + +LY + + F
Sbjct: 212 QINEKISRYKVNKNRKNEPLSPG--LIKSIKEAN--QADYKLYRFIKQLF 257
>gi|340723662|ref|XP_003400208.1| PREDICTED: uronyl 2-sulfotransferase-like [Bombus terrestris]
Length = 302
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 61/226 (26%), Positives = 99/226 (43%), Gaps = 51/226 (22%)
Query: 29 VIIYNRVPKTGSTSFVNMAYDMCRKKRFNVL-HVNVTGNNH-VLSLADQYRFVNNVTKW- 85
V++ RVP G+ V M + R + +N H+ + +H +LS + V +T
Sbjct: 77 VLMLTRVPDAGAELLVLM---LQRLQGYNAFKHIRLPPGDHGLLSTLQEELLVEEITNII 133
Query: 86 -RDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRK 144
++ P + G F++F +FG +E P FI+++R P+ + RY R+
Sbjct: 134 RQEAIPLSFDGDVRFLNFSKFG-RESPTFISLVRNPM----CAHNLRRY---------RE 179
Query: 145 KHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLL 204
+ D L+R +P CG C N WALE+AK N+V +Y +
Sbjct: 180 RRNDM--------LHRA----------IPTFCGQDPRCAKINNKWALERAKANIVERYPV 221
Query: 205 VGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDP 250
VG+ + + +LE P FFRG +R+ RKI P
Sbjct: 222 VGILNYMEQSIDVLEYKFPYFFRGA------------KRSYRKIQP 255
>gi|389614798|dbj|BAM20417.1| heparan sulphate 2-o-sulfotransferase pipe, partial [Papilio
polytes]
Length = 300
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 56/208 (26%), Positives = 98/208 (47%), Gaps = 21/208 (10%)
Query: 90 PALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLR----YGDNYR--PHL-V 142
PA Y H + +F FG P+++ ++R P++R++S+YY++R Y + R P L +
Sbjct: 61 PASYIKHVCYTNFTXFGYPS-PIYVXVVRDPVERVISWYYYVRAPWYYVERKRAFPDLPL 119
Query: 143 RKKHGDKTTFDECIRLNRTEC---------SLENMWLQVPFLCGHAAACWVPGNPWALEK 193
K F+ C+ EC + + Q F CGH C + AL++
Sbjct: 120 PDPAWLKKDFETCVLSGDRECRYVEGETHEGIGDHRRQTLFFCGHEPQCTPFNSREALQR 179
Query: 194 AKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKI--DPS 251
AK + +Y +VGV E++ + E +P FF+G + + + R NR P
Sbjct: 180 AKRVVEQQYAVVGVLEDMNATLLAFERYIPRFFQGALNLYWEELNT-FNRINRNAFKPPV 238
Query: 252 EETVQQIKKSKIWELENELYEYALEQFH 279
E V+QI ++ + E E YE+ ++ +
Sbjct: 239 SEAVKQIVRAN-FTREIEFYEFCKQRLY 265
>gi|195352365|ref|XP_002042683.1| GM14875 [Drosophila sechellia]
gi|194124567|gb|EDW46610.1| GM14875 [Drosophila sechellia]
Length = 282
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 54/228 (23%), Positives = 109/228 (47%), Gaps = 23/228 (10%)
Query: 69 VLSLADQYRFVNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYY 128
+L A+Q ++N+ ++ H F++F + + +P++IN++R P++R++S+Y
Sbjct: 30 ILETAEQSDMIDNIVNLDG---TVFASHVNFLNFTKH-EQPRPIYINMVRDPVERVISWY 85
Query: 129 YFLRYGDNYRPHLVRKKHGD------KTTFDECIRLNRTECS-LENMWL--------QVP 173
Y++R + P R T FD+C+ C+ +EN L Q
Sbjct: 86 YYIRAPWVFVPGRRRNNREMPNPKWVNTEFDQCVTSGEKVCTYIENSLLEHVGDHRRQTL 145
Query: 174 FLCGHAAACWVPGNP-WALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDH 232
F CGH P N L+ AK N+ +Y +VG E + +++LEA +P +F +
Sbjct: 146 FFCGHNEFQCTPFNARLPLQLAKMNVEREYSVVGTWEHTNETLAVLEAYVPRYFADASKM 205
Query: 233 FLTSNKSHLRRTN-RKIDPSEETVQQIKKSKIWELENELYEYALEQFH 279
+ + + + N K S++ + ++++ E+ E Y++ ++ H
Sbjct: 206 YYSGLHADKQNVNPMKPHISQDILDMVRRNFTREI--EFYQFCRQRLH 251
>gi|195352369|ref|XP_002042685.1| GM14872 [Drosophila sechellia]
gi|194124569|gb|EDW46612.1| GM14872 [Drosophila sechellia]
Length = 262
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 42/158 (26%), Positives = 85/158 (53%), Gaps = 17/158 (10%)
Query: 90 PA-LYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYG-DNYRPHLVRKKHG 147
PA +Y H +++ + F QP++IN++R P++R++S++Y+ R ++ + + V K G
Sbjct: 44 PAFVYVEHMNYMNIRPFNLP-QPIYINMIRDPVERVISWFYYKRTPWNSVKMYKVTGKFG 102
Query: 148 DKT----TFDECIRLNRTECSLENMWL----------QVPFLCGHAAACWVPGNPWALEK 193
++T F++C+ + EC + + Q F CGH+ C P A+ +
Sbjct: 103 NRTHYTKNFEDCVLTHDPECRYDYGLMFKDDSADHKRQSLFFCGHSPICEPFNTPAAIAR 162
Query: 194 AKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTD 231
AK+N+ + ++G E+ +++LE +P FF+G +
Sbjct: 163 AKQNIERDFSVIGSWEDTNVTLTVLEHYIPRFFKGSME 200
>gi|195377459|ref|XP_002047507.1| GJ11900 [Drosophila virilis]
gi|194154665|gb|EDW69849.1| GJ11900 [Drosophila virilis]
Length = 221
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 47/165 (28%), Positives = 81/165 (49%), Gaps = 21/165 (12%)
Query: 91 ALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKHGDKT 150
+Y H +IDF F +P++IN++R P++R++S+YY++R ++Y + +K+ T
Sbjct: 46 TVYIEHTSWIDFNAFNLP-KPIYINLVRDPVERVISWYYYVR--NSYLNAIFYRKNPMAT 102
Query: 151 ---------TFDECIRLNRTECS---------LENMWLQVPFLCGHAAACWVPGNPWALE 192
F+EC+R EC + N Q F CGH C +P A++
Sbjct: 103 LKPTAWFKKDFNECVRSGDPECQYVPLTVKDYVGNYKRQSLFFCGHDRNCLPFDSPLAIQ 162
Query: 193 KAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSN 237
AK + +Y +VG EE +++LE +P +F T + N
Sbjct: 163 IAKRRVEEEYAVVGSWEETNITLTVLEHYIPRYFARATTLYPCKN 207
>gi|321477423|gb|EFX88382.1| hypothetical protein DAPPUDRAFT_234511 [Daphnia pulex]
Length = 295
Score = 72.0 bits (175), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 65/268 (24%), Positives = 114/268 (42%), Gaps = 56/268 (20%)
Query: 30 IIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWRDRR 89
+IYNRVP+ G + V + ++ + RF H QYR T W
Sbjct: 56 LIYNRVPRCGGLTMVFLMKELAKVNRF-------AHQRH------QYR-----TPW---- 93
Query: 90 PALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYR--PHLVRKKHG 147
++ +P +INI+R P +R S + R D + + R+
Sbjct: 94 -------------NRYTKNLRPTYINIVRDPAEREYSAFRGRRSQDPLQITQEIKRRDAA 140
Query: 148 DKTT--------FDECIRLNRTECSLEN----MWLQVPFLCGHAAACWVPGNPWALEKAK 195
T FD+CI EC+ + +P+ CG C VP + WAL++AK
Sbjct: 141 GAGTGMEWYTKSFDDCILDEDPECAFNSSEYTFSRAIPYFCGQDPRCLVPRSRWALQRAK 200
Query: 196 ENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDP----S 251
+ +Y +VG+ +++ + + +LE +P FF G + + + R NR I S
Sbjct: 201 FIIEHEYSVVGILDKMNETLQVLERYIPRFFAGSSKIYYSRGYGR-RHENRYIKSKPSLS 259
Query: 252 EETVQQIKKSKIWELENELYEYALEQFH 279
E+ + +++ S E ELY++ ++ +
Sbjct: 260 EKVLAKLRDS--LSDEYELYDFCQQRLY 285
>gi|321464452|gb|EFX75460.1| hypothetical protein DAPPUDRAFT_107932 [Daphnia pulex]
Length = 327
Score = 72.0 bits (175), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 63/222 (28%), Positives = 96/222 (43%), Gaps = 26/222 (11%)
Query: 29 VIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHV-LSLADQYRF---VNNVTK 84
++ + RVPKTGS V + + F HV + H L +Q V V
Sbjct: 48 ILFFFRVPKTGSEMTVLLLQWLQGINGFR--HVRLQNTVHRRLDTFEQRNLREEVLGVLS 105
Query: 85 WRDRRPALYHGHFGFIDFQQFGSKEQPL-------FINILRKPLDRLVSYYYFLRYGDNY 137
+ P + H F D ++ S Q + + LR P+DR+VS +Y+ R
Sbjct: 106 VSEGLPVAFDRHVYFTDLERLYSGIQRTSEAIKVNYFSSLRDPIDRIVSQFYYTRATP-- 163
Query: 138 RPHLVRKKHGDK--------TTFDECIRLNRTECSL---ENMWLQVPFLCGHAAACWVPG 186
RP + H T +EC+ ECS ++ LQ+P+ CGH C
Sbjct: 164 RPDIKLPPHISTPPPTSYRFQTMEECLEAAEPECSFVTGQHYDLQIPYFCGHDEHCTQLN 223
Query: 187 NPWALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRG 228
N ALE+AK N+ + +VGV E+L + + E + +FF G
Sbjct: 224 NARALEQAKSNVELHFRVVGVLEQLNCTLRVAEKRIGTFFSG 265
>gi|195352359|ref|XP_002042680.1| GM14878 [Drosophila sechellia]
gi|194124564|gb|EDW46607.1| GM14878 [Drosophila sechellia]
Length = 220
Score = 71.6 bits (174), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 44/161 (27%), Positives = 83/161 (51%), Gaps = 21/161 (13%)
Query: 91 ALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHL--------- 141
+Y H ++DF+++ +P++IN++R P++R++S+YY++R YR +
Sbjct: 47 TIYIEHCNWLDFRRY-RLPRPIYINLVRDPVERMISWYYYVRSA--YRNAIHHRRFPNAP 103
Query: 142 VRKKHGDKTTFDECIRLNRTEC---------SLENMWLQVPFLCGHAAACWVPGNPWALE 192
++ + K ++++C+R EC S N Q F CGH C + A++
Sbjct: 104 IKSEKWFKKSYNDCVRSGDPECQYVPGSIKDSEGNYKRQTLFFCGHDRECLPFDSQRAIQ 163
Query: 193 KAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHF 233
AK ++ Y +VG EE +++ EA +P FF+G + F
Sbjct: 164 LAKLHVERDYAVVGTWEETNITLTVFEAYIPRFFKGVRNIF 204
>gi|390363509|ref|XP_003730388.1| PREDICTED: uncharacterized protein LOC100891657 [Strongylocentrotus
purpuratus]
Length = 335
Score = 71.2 bits (173), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 42/117 (35%), Positives = 63/117 (53%), Gaps = 4/117 (3%)
Query: 20 ETDSLSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFV 79
E + I YNRV K GS S + + + K RF H+ + + L +Y +
Sbjct: 201 EEPEMQTGDAIFYNRVGKCGSRSVIAVLRLLALKNRF---HLVSSLTYNATKLVPEYEKM 257
Query: 80 NNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDN 136
+ ++P L+ H FIDF+++G K QP +INI+R PL R+VS+YYF R+GD
Sbjct: 258 MVTVLSQIQKPYLFQRHVYFIDFRRYGVK-QPKYINIIRDPLSRMVSHYYFQRFGDG 313
>gi|28574839|ref|NP_788534.1| pipe, isoform F [Drosophila melanogaster]
gi|28380475|gb|AAO41230.1| pipe, isoform F [Drosophila melanogaster]
gi|33636597|gb|AAQ23596.1| RE07829p [Drosophila melanogaster]
gi|220951094|gb|ACL88090.1| pip-PF [synthetic construct]
gi|220959632|gb|ACL92359.1| pip-PF [synthetic construct]
Length = 397
Score = 71.2 bits (173), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 71/280 (25%), Positives = 129/280 (46%), Gaps = 40/280 (14%)
Query: 16 SPSPETDSLSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVL-SLAD 74
+P E D I YNR+ KTGS S + + + F+ V + + S D
Sbjct: 117 TPKAERD------FIFYNRLEKTGSQSMTRLIKQLGDRLGFDTYRNIVRPSRSITESEED 170
Query: 75 QYRFVNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYG 134
+ V + + + A+Y H +++F Q S +P++IN++R P+ +++S YY+ R+
Sbjct: 171 EKDLVEQLFELGEH--AVYVEHANWVNFTQHDSP-RPIYINMVRHPIQKVISAYYYQRHP 227
Query: 135 DNYRPHLVR--------KKHGDKTTFDECIRLN-RTECSLE------NMWLQVPF-LCGH 178
+ L+R KK D T F++C+R R C + W + LCG+
Sbjct: 228 LIFAQSLLRNPNKPMQTKKFFD-TNFNDCVRKRVRPHCVFDAHNPFNGDWRRFSLHLCGN 286
Query: 179 AAACWVPGNPWALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGG-------TD 231
+ C + + AK N+ +Y +VG E+ +++LEA +P FF T+
Sbjct: 287 SEICTHFNSETTTQIAKMNVEREYAVVGSWEDTNVTLAVLEAYIPRFFTDATKVYYSNTE 346
Query: 232 HFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIWELENELY 271
+F +N SH ++ ++ + +K S +E+E L+
Sbjct: 347 NFTINNVSHDTHLDKDVE------EYLKSSFSFEIELYLF 380
>gi|355695124|gb|AER99902.1| heparan sulfate 2-O-sulfotransferase 1 [Mustela putorius furo]
Length = 84
Score = 71.2 bits (173), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 37/85 (43%), Positives = 56/85 (65%), Gaps = 3/85 (3%)
Query: 227 RGGTDHFLT-SNKSHLRRTNRKIDPSEETVQQIKKSKIWELENELYEYALEQFHFVKKHN 285
RG T+ + T KSHLR+T K P+++T+ ++++S IW++ENE YE+ALEQF F++ H
Sbjct: 1 RGATELYRTVGKKSHLRKTTEKKLPTKQTIAKLQQSDIWKMENEFYEFALEQFQFIRAHA 60
Query: 286 LVYNKVLGYEADKGKQFMYEKIYPK 310
+ + G + F YEKIYPK
Sbjct: 61 V--REKDGDLYILAQNFFYEKIYPK 83
>gi|313218451|emb|CBY43025.1| unnamed protein product [Oikopleura dioica]
Length = 273
Score = 70.9 bits (172), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 48/167 (28%), Positives = 79/167 (47%), Gaps = 15/167 (8%)
Query: 16 SPSPETDSLSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQ 75
+P P T + +N++PK GST+ N+ + RK F + ++G V+ D+
Sbjct: 44 APVPNTK------FVFHNKLPKCGSTTMHNIVGLLSRKNNFTYWKI-MSG---VMKFTDE 93
Query: 76 YRFVNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGD 135
+ + K R R P H ++DF ++ S QP F+N++R P+ S+Y F+R+G
Sbjct: 94 ETLIQAL-KMRYREPFFLLQHHFWMDFNKY-SMHQPTFVNMIRDPISWFQSHYTFMRFGM 151
Query: 136 NYRPHLVRKKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAAC 182
N K G + D+CI+ C + N W + F CG C
Sbjct: 152 NKGRGENDPKLG--SDIDDCIKNKEKNC-VSNQWTYIEFFCGSEKLC 195
>gi|313233060|emb|CBY24171.1| unnamed protein product [Oikopleura dioica]
Length = 460
Score = 70.5 bits (171), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 51/202 (25%), Positives = 92/202 (45%), Gaps = 10/202 (4%)
Query: 30 IIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWRD-R 88
+ +N++PK GST+ + + R F L + G + + + + +
Sbjct: 189 VYHNKLPKCGSTTMHAILGVLSRWNSFRYLKLE-PGLVKFFDGEKMSKLITTLVENKKVS 247
Query: 89 RPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKHGD 148
+P H + + + E P +IN++R PL S +YF R+G +P R+ D
Sbjct: 248 KPFFIFKHHYYFNASMYDF-ETPTWINVIRDPLSWFESNFYFKRFGWERQPGSRRRDDQD 306
Query: 149 KTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPW----ALEKAKENLVTKYLL 204
T D+C+ +C W FLCG+A C + +P A E AK N+ + L
Sbjct: 307 -LTIDKCVETGHEDCK-RVKWKYNQFLCGNAPVC-IGHSPAEKQRAAEIAKHNIANNFFL 363
Query: 205 VGVTEELTDFVSLLEAALPSFF 226
VG+ E+ D +++ E +P+++
Sbjct: 364 VGILEQFIDTLNVFEKLMPAYY 385
>gi|313241766|emb|CBY33983.1| unnamed protein product [Oikopleura dioica]
Length = 343
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 41/135 (30%), Positives = 72/135 (53%), Gaps = 8/135 (5%)
Query: 30 IIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWRDRR 89
I +N++PK+GST+ ++ + +K FN ++ + + D V+ + K +
Sbjct: 213 IFHNKLPKSGSTTMHDILRKLSQKNLFNYKKMDSSN----MDFDDDASLVDYI-KENQKT 267
Query: 90 PALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKHGDK 149
P Y H F +F FG EQP IN++R+PLD S+Y+F YG + +P ++ G++
Sbjct: 268 PFFYMQHHFFTNFTSFG-LEQPTMINVIREPLDWFSSHYHFKLYGWSRKPG--QRGEGNE 324
Query: 150 TTFDECIRLNRTECS 164
+ +ECI + CS
Sbjct: 325 MSLEECISSDSPTCS 339
>gi|313226121|emb|CBY21264.1| unnamed protein product [Oikopleura dioica]
Length = 345
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 41/135 (30%), Positives = 72/135 (53%), Gaps = 8/135 (5%)
Query: 30 IIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWRDRR 89
I +N++PK+GST+ ++ + +K FN ++ + + D V+ + K +
Sbjct: 215 IFHNKLPKSGSTTMHDILRKLSQKNLFNYKKMDSSN----MDFDDDASLVDYI-KENQKT 269
Query: 90 PALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKHGDK 149
P Y H F +F FG EQP IN++R+PLD S+Y+F YG + +P ++ G++
Sbjct: 270 PFFYMQHHFFTNFTSFG-LEQPTMINVIREPLDWFSSHYHFKLYGWSRKPG--QRGEGNE 326
Query: 150 TTFDECIRLNRTECS 164
+ +ECI + CS
Sbjct: 327 MSLEECISSDSPTCS 341
>gi|195477372|ref|XP_002086330.1| GE22926 [Drosophila yakuba]
gi|194186120|gb|EDW99731.1| GE22926 [Drosophila yakuba]
Length = 290
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 53/217 (24%), Positives = 106/217 (48%), Gaps = 24/217 (11%)
Query: 29 VIIYNRVPKTGSTSFVNMAYDMCRKKRFNV-LHVNVTGNNHVLSLADQYRFVNNVTKWRD 87
++I+NR + S V + + NV L+ V N + +Q ++W +
Sbjct: 13 IVIFNRPTRVDSEQMVPLFRQLAAMNDINVVLNGPVRTMNRTRTEKEQL----IESEWAN 68
Query: 88 R--RPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYY----FLRYGDNYRPHL 141
R ++Y H ++DF+ FG K +P++I++++ P+DR+++ +Y +++ R +
Sbjct: 69 ELERGSIYMAHSNWLDFESFGFK-KPIYISLVKDPIDRMITDFYKRRSWVKRAIYRRMYP 127
Query: 142 VRKKHGD---KTTFDECIRLNRTEC---------SLENMWLQVPFLCGHAAACWVPGNPW 189
R++ D + +F+EC+R EC +++ Q + CG+ A C +
Sbjct: 128 GRRERPDEWYQQSFNECVRSRSPECLFVQHAVADPIQDFKRQSLYFCGNEADCLPFNSHH 187
Query: 190 ALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFF 226
A + AK + +Y +VG EE +++LE +P FF
Sbjct: 188 ATQIAKRRVEKEYSVVGTWEERNITLTVLEKYVPRFF 224
>gi|198463773|ref|XP_002135577.1| GA28239 [Drosophila pseudoobscura pseudoobscura]
gi|198151405|gb|EDY74204.1| GA28239 [Drosophila pseudoobscura pseudoobscura]
Length = 254
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 45/153 (29%), Positives = 77/153 (50%), Gaps = 19/153 (12%)
Query: 91 ALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKHGDK- 149
LY H ++DF+ FG K +P++++++R P+DR+V YY R+ R R G +
Sbjct: 38 TLYIAHSNWLDFKSFGYK-KPIYLSMVRDPIDRVVHDYY-KRHSRTKRQIYRRMFPGQRE 95
Query: 150 -------TTFDECIRLNRTECS---------LENMWLQVPFLCGHAAACWVPGNPWALEK 193
+F++C+R EC +E+ Q F CG+ C +P A+++
Sbjct: 96 RPEEWYLQSFNQCVRNGSPECQFIQHSVTDYVEDFKRQSLFFCGNHLNCLPFNSPHAVQE 155
Query: 194 AKENLVTKYLLVGVTEELTDFVSLLEAALPSFF 226
AK + +Y +VG EE +++LE +P FF
Sbjct: 156 AKSRVEKEYSVVGTWEEKNITLTVLEKYVPRFF 188
>gi|194751654|ref|XP_001958140.1| GF10770 [Drosophila ananassae]
gi|190625422|gb|EDV40946.1| GF10770 [Drosophila ananassae]
Length = 290
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 40/152 (26%), Positives = 77/152 (50%), Gaps = 17/152 (11%)
Query: 91 ALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLR-------YGDNYRPHLVR 143
++Y H ++DF FG K +P++ +++R P+DR+V+ YY R Y Y + +
Sbjct: 74 SIYMAHSNWLDFNGFGYK-KPIYASLVRDPVDRMVADYYKRRSWTKRMIYRKMYPGRIEK 132
Query: 144 KKHGDKTTFDECIRLNRTECS---------LENMWLQVPFLCGHAAACWVPGNPWALEKA 194
+ K +F++C+R EC +++ Q + CG+ C +P A++ A
Sbjct: 133 PEKWYKQSFNQCVRSGDPECRYIQYSIKDYIDDFKRQSLYFCGNNPDCLPFNSPHAIQMA 192
Query: 195 KENLVTKYLLVGVTEELTDFVSLLEAALPSFF 226
K+ + +Y +VG EE +++ E +P +F
Sbjct: 193 KQRVEKEYSVVGTWEERNITLTVFEKYIPKYF 224
>gi|195128099|ref|XP_002008503.1| GI13538 [Drosophila mojavensis]
gi|193920112|gb|EDW18979.1| GI13538 [Drosophila mojavensis]
Length = 316
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 67/294 (22%), Positives = 130/294 (44%), Gaps = 39/294 (13%)
Query: 30 IIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNH-------VLSLADQYRFVNNV 82
I++NR+ K GS S + LH VT N V + ++ F +
Sbjct: 31 ILFNRLEKVGSQSMTKLL------GHLGELHGYVTYRNEIPPAKKIVYNYEEEKAFAEEL 84
Query: 83 TKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLV 142
+ P+ Y H +I+F + +P++IN++R P+ +++S YY+ R+ Y L+
Sbjct: 85 LE--LEEPSAYVEHTNWINFTEHDMP-RPIYINLVRHPIQKVISAYYYQRHPMIYANSLL 141
Query: 143 R-------KKHGDKTTFDECIRLN-RTECSLE------NMWLQVPF-LCGHAAACWVPGN 187
R KK + +F++C+R EC + W + LCG+ C +
Sbjct: 142 RNPNKPTEKKEFFERSFNDCVRQRIAPECVFDPHLPYNGDWRRFTLHLCGNQNVCTHFNS 201
Query: 188 PWALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRK 247
A + AK ++ +Y +VG E+ +++LEA +P FF T+ + + + + +
Sbjct: 202 EMATQIAKLHVEKEYAVVGSWEDTNITLAVLEAYIPRFFADATNQYYSHQEKFMINSTPH 261
Query: 248 IDPSEETVQQIKKSKIWELENELYEYALEQFHFVKKHNLVYNKVLGYEADKGKQ 301
+E V+ K + + Y +E +HF K+ + +A++G++
Sbjct: 262 DSHLDEDVEAYLKQQ--------FAYEIELYHFCKQRLYKQYIAIRKQAEQGQE 307
>gi|383857737|ref|XP_003704360.1| PREDICTED: heparan sulfate 2-O-sulfotransferase 1-like [Megachile
rotundata]
Length = 300
Score = 67.8 bits (164), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 63/252 (25%), Positives = 110/252 (43%), Gaps = 48/252 (19%)
Query: 29 VIIYNRVPKTGSTSFVNMAYDMCRKKRFNVL-HVNVT-GNNHVLSLADQYRFVNNVTKW- 85
+++ RVP G+ S V + + R + +N H+ + G++ +LS Q V VT
Sbjct: 74 ILMLTRVPDAGAESLVLI---LQRLQGYNAFKHIRLPPGDHKLLSTLQQELLVEEVTNII 130
Query: 86 -RDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRK 144
++ P + G F++F +FG + P FI+++R P+ Y R+
Sbjct: 131 RQEAIPLSFDGDVRFLNFSKFG-RPAPTFISLVRNPMGTRTLQRY-------------RE 176
Query: 145 KHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLL 204
KH F+ +P CG C N WAL++AK N+V Y +
Sbjct: 177 KHN---VFE---------------GRAIPTFCGQDPRCTEINNKWALQRAKANIVEWYPV 218
Query: 205 VGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKS--- 261
+G+ + + + +LE P FFRG + RTN+ T++ ++
Sbjct: 219 IGILDCMEQSIDVLEYKFPYFFRGARQIYKKI------RTNKNFSDYTITLKPRERDILF 272
Query: 262 KIWELENELYEY 273
K++E E +LYE+
Sbjct: 273 KLFEDEIKLYEW 284
>gi|28574847|ref|NP_788529.1| pipe, isoform J [Drosophila melanogaster]
gi|28380470|gb|AAO41225.1| pipe, isoform J [Drosophila melanogaster]
Length = 401
Score = 67.4 bits (163), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 42/152 (27%), Positives = 76/152 (50%), Gaps = 17/152 (11%)
Query: 91 ALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLR-------YGDNYRPHLVR 143
++Y H ++DF FG K +P++I++++ P+DR+++ +Y R Y Y R
Sbjct: 185 SIYMAHSNWLDFASFGFK-KPIYISLVKDPIDRMITDFYKRRSRVKRAIYRRMYPGRRER 243
Query: 144 KKHGDKTTFDECIRLNRTEC---------SLENMWLQVPFLCGHAAACWVPGNPWALEKA 194
+ +F+EC+R EC +++ Q + CG+AA C + A + A
Sbjct: 244 PDEWYQLSFNECVRNRSPECLFVQHAVADYIQDFKRQTLYFCGNAADCLPFNSHHATQVA 303
Query: 195 KENLVTKYLLVGVTEELTDFVSLLEAALPSFF 226
K + +Y +VG EE +++LE +P FF
Sbjct: 304 KRRVEKEYSVVGTWEERNITLTVLEKYVPRFF 335
>gi|195128107|ref|XP_002008507.1| GI13542 [Drosophila mojavensis]
gi|193920116|gb|EDW18983.1| GI13542 [Drosophila mojavensis]
Length = 301
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 53/226 (23%), Positives = 104/226 (46%), Gaps = 28/226 (12%)
Query: 29 VIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWRDR 88
V+ +NR + G+ + + + + N++ ++ A Q R R
Sbjct: 28 VVFFNRPTRVGTELMLPLLSLLSKHNDVNLVLKGPVRKKSLMRTAKQERIETRFVS-RLE 86
Query: 89 RPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRK---- 144
+LY H +IDF ++ ++ +P++I+++R P++R++ YY R + ++ +
Sbjct: 87 NGSLYVAHGNWIDFAEY-NRRKPIYISLVRDPVERMLDNYYQQR---TLKKQIISRNIYP 142
Query: 145 ---KHGD---KTTFDECIRLNRTECS---------LENMWLQVPFLCGHAAACWVPGNPW 189
+H D K +F EC+R EC +E+ Q F CG+ C +
Sbjct: 143 AYPQHPDAWYKQSFSECVRRASPECQYIEYSMRDEVEDFKRQSLFFCGNDIDCLPFNTRY 202
Query: 190 ALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLT 235
++KAK N+ +Y +VG E+ +++LE +P +F +H LT
Sbjct: 203 GVQKAKRNVEKEYSVVGTWEQPNITLTVLEKYVPRYF----NHALT 244
>gi|194751648|ref|XP_001958137.1| GF10767 [Drosophila ananassae]
gi|190625419|gb|EDV40943.1| GF10767 [Drosophila ananassae]
Length = 301
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 55/234 (23%), Positives = 105/234 (44%), Gaps = 26/234 (11%)
Query: 30 IIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQ-YRFVNNVTKWRDR 88
I YNR+ KTGS S + ++ ++ F + + D+ V +++ +
Sbjct: 27 IFYNRLEKTGSQSMTRLINNLGKRNGFETFRNVIRPWRPITDDKDEELDLVEQMSELPE- 85
Query: 89 RPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVR----- 143
P Y H +++F + +P++IN++R P+ +++S YY+ R+ + L+R
Sbjct: 86 -PGAYVEHANYVNFTE-HDMPRPIYINMVRDPIQKVISAYYYQRHPLIFAQSLMRNPKKR 143
Query: 144 --KKHGDKTTFDECIR-------LNRTECSLENMWLQVPF-LCGHAAACWVPGNPWALEK 193
K T+F++C+R + + W + LCG+ C + +
Sbjct: 144 MQSKQFFDTSFNDCVRQRIPPYCVFDSHNPFNGDWRRFSLHLCGNKEICTHFNSETTTQL 203
Query: 194 AKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGG-------TDHFLTSNKSH 240
AK N+ +Y +VG E+ +++LEA +P FF T+ F +N SH
Sbjct: 204 AKMNIEREYSVVGSWEDTNVTLAVLEAYIPRFFTKARQVYYNKTEKFTINNVSH 257
>gi|380022533|ref|XP_003695097.1| PREDICTED: heparan sulfate 2-O-sulfotransferase 1-like [Apis
florea]
Length = 258
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 53/202 (26%), Positives = 87/202 (43%), Gaps = 38/202 (18%)
Query: 29 VIIYNRVPKTGSTSFVNMAYDMCRKKRFNVL-HVNVTGNNH-VLSLADQYRFVNNVTKW- 85
+++ R P G+ FV M + R + +N H+ + +H +LS + V +T
Sbjct: 77 ILMLTRTPDAGAELFVLM---LQRLQGYNAFKHIRLPPGDHGLLSTLQEELLVEEITNII 133
Query: 86 -RDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRK 144
++ P + G F++F +FG +E P FI+++R P V FLRY + R L +
Sbjct: 134 RQEAIPLSFDGDVRFLNFSKFG-RESPSFISLVRNP----VGVQNFLRYHERRRNMLESR 188
Query: 145 KHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLL 204
+P CG C N WAL++AK N+V Y +
Sbjct: 189 A--------------------------IPIFCGQDPRCSEINNKWALQRAKANVVEWYPV 222
Query: 205 VGVTEELTDFVSLLEAALPSFF 226
VG+ E + + +LE P F
Sbjct: 223 VGILEYMEQSIDILEYKFPYSF 244
>gi|332019261|gb|EGI59770.1| Uronyl 2-sulfotransferase [Acromyrmex echinatior]
Length = 308
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 53/217 (24%), Positives = 94/217 (43%), Gaps = 41/217 (18%)
Query: 29 VIIYNRVPKTGSTSFVNMAYDMCRKKRFNVL-HVNVT-GNNHVLSLADQYRFVNNVTKW- 85
V++ R+P G V + + R + +N H+ + G+N +LS + Q + +T
Sbjct: 82 VLMLTRIPGAGGELMVLI---LQRLQGYNAFKHIRLPPGDNGLLSSSQQELLIEEITSII 138
Query: 86 -RDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRK 144
++ P + G F++F +FG ++ P FI+++R PLD LR Y
Sbjct: 139 RQEAIPLTFDGDVRFLNFSEFG-RQGPTFISLVRDPLD--------LRIWQKY------- 182
Query: 145 KHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLL 204
K G++ I +P+ CG C WA+E+AK N++ Y +
Sbjct: 183 KKGEEGMHYYGI---------------IPYFCGQDPRCVKQNKTWAMERAKANVIRWYPV 227
Query: 205 VGVTEELTDFVSLLEAALPSFFRGGT---DHFLTSNK 238
VG+ + + + ++ P FF+ DHF K
Sbjct: 228 VGILDYMEESLNAFAGEFPYFFKDAIRIYDHFRPKEK 264
>gi|350426602|ref|XP_003494487.1| PREDICTED: heparan sulfate 2-O-sulfotransferase 1-like [Bombus
impatiens]
Length = 302
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 59/226 (26%), Positives = 97/226 (42%), Gaps = 51/226 (22%)
Query: 29 VIIYNRVPKTGSTSFVNMAYDMCRKKRFNVL-HVNVTGNNH-VLSLADQYRFVNNVTKW- 85
V++ RVP G+ V + + R + +N H+ + +H +LS + V +T
Sbjct: 77 VLMLTRVPDAGAELLVLI---LQRLQGYNAFKHIRLPPGDHGLLSTLQEELLVEEITSII 133
Query: 86 -RDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRK 144
++ P + G F++F +FG +E P FI+++R P+ N R + R+
Sbjct: 134 RQEAIPLSFDGDVRFLNFSKFG-RESPTFISLVRNPM-----------CAHNLRRY--RE 179
Query: 145 KHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLL 204
+ D L+R +P CG C N WALE+AK N+V Y +
Sbjct: 180 RRSDM--------LHRA----------IPTFCGQDPRCAKINNKWALERAKANIVEWYPV 221
Query: 205 VGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDP 250
V + + + +LE P FFRG +R+ RKI P
Sbjct: 222 VCILNYMEQSIDVLEYKFPYFFRGA------------KRSYRKIQP 255
>gi|195352357|ref|XP_002042679.1| GM14879 [Drosophila sechellia]
gi|194124563|gb|EDW46606.1| GM14879 [Drosophila sechellia]
Length = 264
Score = 64.3 bits (155), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 40/152 (26%), Positives = 75/152 (49%), Gaps = 17/152 (11%)
Query: 91 ALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLR-------YGDNYRPHLVR 143
++Y H ++DF +G K +P++I++++ P+DR+++ +Y R Y Y R
Sbjct: 48 SIYMAHSNWLDFASYGFK-KPIYISLVKDPIDRMITDFYKRRCRVKRAIYRRMYPGRRER 106
Query: 144 KKHGDKTTFDECIRLNRTEC---------SLENMWLQVPFLCGHAAACWVPGNPWALEKA 194
+ +F+EC+R EC +++ Q + CG+ A C + A + A
Sbjct: 107 PDEWYQLSFNECVRSRSPECLFVQHAVADYIQDFKRQTLYFCGNEADCLPFNSHHATQVA 166
Query: 195 KENLVTKYLLVGVTEELTDFVSLLEAALPSFF 226
K + +Y +VG EE +++LE +P FF
Sbjct: 167 KRRVEKEYSVVGTWEERNITLTVLEKYVPRFF 198
>gi|195352367|ref|XP_002042684.1| GM14873 [Drosophila sechellia]
gi|195354142|ref|XP_002043559.1| GM19214 [Drosophila sechellia]
gi|194124568|gb|EDW46611.1| GM14873 [Drosophila sechellia]
gi|194127727|gb|EDW49770.1| GM19214 [Drosophila sechellia]
Length = 260
Score = 63.9 bits (154), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 45/165 (27%), Positives = 85/165 (51%), Gaps = 18/165 (10%)
Query: 91 ALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVR------- 143
A+Y H +++F Q+ S +P++IN++R P+ +++S YY+ R+ + L+R
Sbjct: 48 AVYVEHANWVNFTQYDSP-RPIYINMVRHPIQKVISAYYYQRHPLIFAQSLMRNPNKPMQ 106
Query: 144 -KKHGDKTTFDECIRLNRT-ECSLE------NMWLQVPF-LCGHAAACWVPGNPWALEKA 194
KK D T F++C+R + C + W + LCG++ C + + A
Sbjct: 107 TKKFFD-TNFNDCVRKRVSPHCVFDAHNPFNGDWRRFSLHLCGNSEICTHFNSETTTQIA 165
Query: 195 KENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKS 239
K N+ +Y +VG E+ +++LEA +P FF T + ++ K+
Sbjct: 166 KMNVEREYAVVGSWEDTNVTLAVLEAYIPRFFTDATKVYYSNTKN 210
>gi|390366614|ref|XP_003731078.1| PREDICTED: heparan sulfate 2-O-sulfotransferase 1-like
[Strongylocentrotus purpuratus]
Length = 134
Score = 63.9 bits (154), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 37/85 (43%), Positives = 51/85 (60%), Gaps = 7/85 (8%)
Query: 6 SHQIHISSAKSPSP-ETDSLSWDT----VIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLH 60
SH H+ K P+P D L +IIYNRVPKTGSTSF ++ Y +C +++VLH
Sbjct: 52 SHMRHLQ--KVPTPLVADELYQQKPMYPLIIYNRVPKTGSTSFTSLPYTLCETLKYHVLH 109
Query: 61 VNVTGNNHVLSLADQYRFVNNVTKW 85
V + VLS+ DQ FV++V K+
Sbjct: 110 VCTEYHLQVLSVQDQVSFVDHVNKF 134
>gi|195377463|ref|XP_002047509.1| GJ11898 [Drosophila virilis]
gi|194154667|gb|EDW69851.1| GJ11898 [Drosophila virilis]
Length = 316
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 48/208 (23%), Positives = 100/208 (48%), Gaps = 23/208 (11%)
Query: 90 PALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKHGDK 149
P++Y H +I+F +P++IN++R P+ +++S YY+ R+ Y L+R + K
Sbjct: 102 PSVYVEHTNWINFTA-HDMPRPIYINLVRHPIQKVMSAYYYHRHPVIYANSLLRNPNKPK 160
Query: 150 T-------TFDECIRLN-RTECSLENMWLQVPF----------LCGHAAACWVPGNPWAL 191
+F++C+R +C + +P+ LCG+ C + A+
Sbjct: 161 QNKEFFDRSFNDCVRQRIAPDCVFDP---HIPYNKDWRRFSLHLCGNQNVCVNFNSEMAM 217
Query: 192 EKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPS 251
+ AK ++ +Y +VG E+ +++LEA +P FF T+ + + + + +
Sbjct: 218 QIAKLHVEKEYAVVGSWEDTNITLAVLEAYIPRFFADATNQYYSHREKFMINATPHDNHL 277
Query: 252 EETVQQIKKSKIWELENELYEYALEQFH 279
+E V+ K + + E ELY + ++ +
Sbjct: 278 DEDVEAYLKQQ-FAYEIELYNFCKQRLY 304
>gi|313234610|emb|CBY10565.1| unnamed protein product [Oikopleura dioica]
Length = 164
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 41/138 (29%), Positives = 63/138 (45%), Gaps = 24/138 (17%)
Query: 92 LYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYG----DNYRPHLVRKKHG 147
L+ H +++ G E+ FIN++R P+ R S YYF R+G R +
Sbjct: 19 LFLKHHHWLNMTDLGL-EKATFINVVRDPITRFASRYYFNRFGWGLSSGARRQTWKTDKE 77
Query: 148 DKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACW-------------VPGNPW----- 189
T DEC+ EC +E++ + V +LCG AAC V W
Sbjct: 78 KDQTLDECVENGSEEC-IESLQVMVQYLCGTEAACGTKEGDGIEHDDGEVRRTDWTKTAR 136
Query: 190 ALEKAKENLVTKYLLVGV 207
A EKAK N+++ Y ++G+
Sbjct: 137 ATEKAKHNILSDYYMIGI 154
>gi|313241084|emb|CBY33383.1| unnamed protein product [Oikopleura dioica]
Length = 709
Score = 60.5 bits (145), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 43/160 (26%), Positives = 76/160 (47%), Gaps = 13/160 (8%)
Query: 30 IIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWRDRR 89
I +N++PK+GS++ + + K F H V + + R VN+ K R +
Sbjct: 104 IFHNKLPKSGSSTMKYILKVLSDKNDFFFDHYRVKQ----CDIDNNQRLVNHAAKLRRQH 159
Query: 90 PA---LYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYG---DNYRPHLVR 143
P + H +++F G QP FIN++R P+ + S YYF RYG + + +
Sbjct: 160 PEKKIVLLKHHTWVNFTHRGYP-QPQFINVVRHPVTQFKSRYYFSRYGWGLEKGKRKTFK 218
Query: 144 KKHGD-KTTFDECIRLNRTECSLENMWLQVPFLCGHAAAC 182
H D K + D+C+ ++EC ++ + + + CG C
Sbjct: 219 GSHNDRKRSLDDCVAEGQSEC-MDAVGVFNKYFCGTEPVC 257
>gi|443731210|gb|ELU16443.1| hypothetical protein CAPTEDRAFT_185201, partial [Capitella teleta]
Length = 147
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 43/141 (30%), Positives = 67/141 (47%), Gaps = 13/141 (9%)
Query: 151 TFDECIRLNRTECSLENMWLQV-PFLCGHAAACWVPGNPW-ALEKAKENLVTKYLLVGVT 208
T++EC+ + C + L + + CG + C NP +L +AK N++ Y +VGV
Sbjct: 1 TYEECVNQGYSICVANKVLLNLLAYFCGQDSVC--TENPAVSLARAKRNIIKHYSIVGVM 58
Query: 209 EELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLR--RTNRKIDPSEETVQQIKKSKIWEL 266
E+L F LE P FF+G D FL + L + + K P + TV ++K K+ E
Sbjct: 59 EDLEGFFYTLEKKFPGFFKGAQDVFLEHERGLLSKFKNSGKEYPPQYTVDIMRK-KLAE- 116
Query: 267 ENELYEYALEQFHFVKKHNLV 287
Y QF + NL+
Sbjct: 117 -----SYDFYQFVMQRHQNLM 132
>gi|323136903|ref|ZP_08071983.1| hypothetical protein Met49242DRAFT_1370 [Methylocystis sp. ATCC
49242]
gi|322397664|gb|EFY00186.1| hypothetical protein Met49242DRAFT_1370 [Methylocystis sp. ATCC
49242]
Length = 241
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 52/206 (25%), Positives = 94/206 (45%), Gaps = 34/206 (16%)
Query: 27 DTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWR 86
D++I++N +P+T +S ++ ++D K R N N SL D+ +F +
Sbjct: 3 DSLIVFNHIPRTSGSS-IHASFDEALKSR-NFFVFNG-------SLEDEQKFA---AAFN 50
Query: 87 DRRPALYH--GHFGFIDFQQFGS-KEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVR 143
DR +++ GH GF Q+ G ++ L +I+R P++R++S YY +R +++PH+
Sbjct: 51 DRGAGVFYTGGHIGFQRLQKLGLLNDENLLFSIVRDPVERMLSLYYLMRRSPDWQPHIA- 109
Query: 144 KKHGDKT---TFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVT 200
GD+ +D C + F G+ A C + E A+E +
Sbjct: 110 AHVGDRDFAYYYDFC--------------REKGFHTGN-AQCRAIAGVESFEAARERVSQ 154
Query: 201 KYLLVGVTEELTDFVSLLEAALPSFF 226
Y LVG + + LE + +F
Sbjct: 155 HYSLVGCLSHVAMTYNALEVIVRNFL 180
>gi|389613426|dbj|BAM20062.1| heparan sulphate 2-o-sulfotransferase pipe, partial [Papilio
xuthus]
Length = 261
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 32/106 (30%), Positives = 60/106 (56%), Gaps = 6/106 (5%)
Query: 29 VIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLA--DQYRFVNNVTKWR 86
++ +NRVPK GS +F+ + + + +F H + + LA DQ V+ V+
Sbjct: 137 LLFFNRVPKVGSQTFMELLRRLAIRNQFG-FHRDAVQRVETIRLAPADQQVLVSVVSAHT 195
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLR 132
PA Y H + +F +FG P+++N++R P++R++S+YY++R
Sbjct: 196 P--PASYIKHVCYTNFTRFGYP-SPIYVNVVRDPVERVISWYYYVR 238
>gi|307196978|gb|EFN78353.1| Heparan sulfate 2-O-sulfotransferase 1 [Harpegnathos saltator]
Length = 225
Score = 58.5 bits (140), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 49/202 (24%), Positives = 86/202 (42%), Gaps = 40/202 (19%)
Query: 31 IYNRVPKTGSTSFVNMAYDMCRKKRFNVL-HVNVTGNNH-VLSLADQYRFVNNVTKW--R 86
+ R+P G+ V + + R + +N H+ + +H +LS Q V +T +
Sbjct: 1 MLTRIPGAGAELMVLI---LQRLQGYNAFKHIRLPAGDHGLLSTLQQELLVEEMTSIIKQ 57
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRK-K 145
+ P + G F++F +FG ++ P FI+++R PLD P + R+
Sbjct: 58 EAIPLSFDGDVRFLNFSEFG-RQGPSFISLVRDPLD----------------PRIWRRYS 100
Query: 146 HGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLV 205
G + F +P CG C N WAL +AK N+V Y +V
Sbjct: 101 KGKERVFYRG---------------AIPHFCGQEPRCTERNNTWALARAKANVVRWYPVV 145
Query: 206 GVTEELTDFVSLLEAALPSFFR 227
G+ + + + ++ L P FF+
Sbjct: 146 GILDYMEESLNALALEFPYFFK 167
>gi|15615933|ref|NP_244237.1| hypothetical protein BH3371 [Bacillus halodurans C-125]
gi|10175994|dbj|BAB07090.1| BH3371 [Bacillus halodurans C-125]
Length = 255
Score = 57.8 bits (138), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 62/253 (24%), Positives = 112/253 (44%), Gaps = 39/253 (15%)
Query: 29 VIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTK--WR 86
++I+ +PKTG + N+ R + H+L L + + N + K
Sbjct: 10 LVIFMHIPKTGGITLRNILDQQYR-------------SEHILRLPQKNKLDNLLQKKGAN 56
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPL-FINILRKPLDRLVSYYYFLRYGDNYRPHLVRKK 145
++ +GH F + F +QP +I +LR P++R++S YYF+ + R H K
Sbjct: 57 IKKLQCVYGHHRFGVHEYF---QQPFTYITMLRHPVERIISTYYFILQNERNRMHQKVK- 112
Query: 146 HGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLV 205
TF++ ++ + LQVP L H L KA +++ T + +V
Sbjct: 113 ---PLTFEQFVQSTDPD-------LQVP-LTNHQTRYLSGERKPNLNKALQHMDTHFSVV 161
Query: 206 GVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIWE 265
G+TE + + +++ L G DH K+ ++ R+ D +T++ IK+ +
Sbjct: 162 GITELYNESLFIMKKKL------GWDHISYQKKNVTKKRKRQTDIGLDTIEIIKRKNPLD 215
Query: 266 LENELYEYALEQF 278
L LYE A E+
Sbjct: 216 L--HLYETAKEKL 226
>gi|195022658|ref|XP_001985615.1| GH14413 [Drosophila grimshawi]
gi|193899097|gb|EDV97963.1| GH14413 [Drosophila grimshawi]
Length = 227
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 39/164 (23%), Positives = 77/164 (46%), Gaps = 23/164 (14%)
Query: 83 TKWRDR--RPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYY---------FL 131
T+W + + +Y +++F ++ E+P++I+++R P+DR++ +Y
Sbjct: 4 TQWVSQLEKGTVYIARGNWMNFDEY-QIEKPIYISLVRDPVDRIIHNFYEQRTTKKKAIS 62
Query: 132 RYGDNYRPHLVRKKHGDKTTFDECIRLNRTECS---------LENMWLQVPFLCGHAAAC 182
R D P + K +F++C+R EC + + Q F CG+ C
Sbjct: 63 RSIDANYPQ--QSNEWYKQSFNDCVRSGNPECQYIKYSVVDRVPDFRRQSLFFCGNHVDC 120
Query: 183 WVPGNPWALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFF 226
P A++ AK N+ +Y +VG E+ +++LE +P +F
Sbjct: 121 LPFNTPHAVQVAKRNVEVEYAVVGTWEQANLTLTVLEKYVPRYF 164
>gi|307166685|gb|EFN60682.1| Uronyl 2-sulfotransferase [Camponotus floridanus]
Length = 222
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 48/205 (23%), Positives = 82/205 (40%), Gaps = 40/205 (19%)
Query: 31 IYNRVPKTGSTSFVNMAYDMCRKKRFNVL-HVNV-TGNNHVLSLADQYRFVNNVTKW--R 86
+ R+P TG+ M + R + +N H+ + T + LS Q V V +
Sbjct: 1 MLTRIPGTGAEL---MVLILQRLQGYNAFKHIRLPTSDYGFLSALQQELLVEEVINIIRQ 57
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKH 146
+ P + G F++F FG ++ P FI+++R PLD P + + K
Sbjct: 58 EAIPLTFDGDVKFLNFSAFG-RQGPTFISLVRDPLD----------------PRIWQWKG 100
Query: 147 GDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVG 206
+ + I CG C N WALE+AK N++ Y +VG
Sbjct: 101 KESMLYHGAI----------------SHFCGQEPRCMERNNTWALERAKANVIRWYPVVG 144
Query: 207 VTEELTDFVSLLEAALPSFFRGGTD 231
+ + + + ++ P FF G +
Sbjct: 145 ILDYMEESLNAFATEFPYFFNGAIN 169
>gi|444706738|gb|ELW48061.1| Uronyl 2-sulfotransferase [Tupaia chinensis]
Length = 339
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 39/111 (35%), Positives = 59/111 (53%), Gaps = 1/111 (0%)
Query: 149 KTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVGVT 208
+ +ECI N ECS ++ +P+ CG C PG WALE+AK N+ +LLVG+
Sbjct: 55 RADINECILENYPECSNPRLFYIIPYFCGQHPRCREPGE-WALERAKLNVNENFLLVGIL 113
Query: 209 EELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIK 259
EEL D + LLE LP +F+G + + N + +P E +++ K
Sbjct: 114 EELEDVLLLLERFLPHYFKGVLSIYKDPACCSSEQMNPRREPGEWALERAK 164
>gi|156389052|ref|XP_001634806.1| predicted protein [Nematostella vectensis]
gi|156221893|gb|EDO42743.1| predicted protein [Nematostella vectensis]
Length = 141
Score = 55.1 bits (131), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 43/140 (30%), Positives = 65/140 (46%), Gaps = 12/140 (8%)
Query: 94 HGHFGFIDFQQFGSKEQPL-----FINILRKPLDRLVSYYYFLRYGDNYRPHLVRK--KH 146
H F FI F S+ + L +IN +R P+ R++S+Y++L + +RK K
Sbjct: 4 HTRFVFIAHFYFRSRLKRLHYSHTYINQVRDPVKRVISHYFYLHRSQERPLNRIRKMKKS 63
Query: 147 G-DKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLV 205
G T +EC+ C L F CG + C G+ AL KAK N+ Y V
Sbjct: 64 GFINETLEECLAKQHPGCESN---LMTRFFCGKHSFCR-SGSNKALSKAKHNISRYYASV 119
Query: 206 GVTEELTDFVSLLEAALPSF 225
G+ E + ++ +L LP F
Sbjct: 120 GLLEHFSLYLRVLNKRLPEF 139
>gi|321472524|gb|EFX83494.1| hypothetical protein DAPPUDRAFT_315763 [Daphnia pulex]
Length = 465
Score = 54.7 bits (130), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 50/185 (27%), Positives = 85/185 (45%), Gaps = 37/185 (20%)
Query: 76 YRFVN--------NVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSY 127
YRFVN ++ + A H H FI+FQ++G K P+ I+I+ +P+ R + +
Sbjct: 223 YRFVNRNESRDIASIVGAVPIKLAYLHNH-NFINFQRYGLK-SPIQISIVCEPIARKLRH 280
Query: 128 YYFLRYGDNYRPHL----VRKKHGDKTTFDECIRLNRTECSLENM--------------W 169
+Y RY NYR + + + + T +ECI + EC+ + + W
Sbjct: 281 FYTERY--NYREWAEDFPISRTNWWRVTLEECITKKQRECTYDGLKNIKAKSVLDIGRTW 338
Query: 170 L------QVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVGVTEELTDFVSLLEAALP 223
L +P C + C G L K+ + +Y +VG E++ + + LLE +P
Sbjct: 339 LPRRVEWTIPQFCEYNR-CEELGVEKDLGYVKDVVNQEYTVVGTLEKMGETLDLLETTVP 397
Query: 224 SFFRG 228
FF+G
Sbjct: 398 QFFKG 402
>gi|390333200|ref|XP_003723659.1| PREDICTED: uncharacterized protein LOC100888698 [Strongylocentrotus
purpuratus]
Length = 286
Score = 54.7 bits (130), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 30/107 (28%), Positives = 62/107 (57%), Gaps = 6/107 (5%)
Query: 30 IIYNRVPKTGSTSFVNMAYDMCRKKRFN-VLHVNVTGNNHVLSLADQYRFVNNVTKWRDR 88
+++ R+PK S +F+ + + + F ++ ++H A + R + + T +
Sbjct: 150 VMFVRMPKCASRTFIWTSVRLRQVHHFGKQVNFEYMLDDHP---ALKIRNLMDTTLQKAE 206
Query: 89 RPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGD 135
+ A+ HGH+ +ID +GS ++P+ +++LR P+DR S++YF+R GD
Sbjct: 207 KGAIIHGHYRYIDM--YGSPKRPILVSMLRDPVDRFESHFYFMRNGD 251
>gi|221122335|ref|XP_002161820.1| PREDICTED: heparan sulfate 2-O-sulfotransferase pipe-like [Hydra
magnipapillata]
Length = 234
Score = 54.3 bits (129), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 48/170 (28%), Positives = 80/170 (47%), Gaps = 15/170 (8%)
Query: 113 FINILRKPLDRLVSYYYFLRYGDNYRPH--LVRKKHGD--KTTFDECIRLNRTECSLENM 168
+IN++R P+DR++S+YY++R R L K+ G+ ++ FD CI+ C M
Sbjct: 61 YINLVRNPVDRVLSHYYYMRNEKLRREFRILELKQSGEFNESLFD-CIQNQHRGCEDNVM 119
Query: 169 WLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRG 228
F CG + C G+ AL+ AK N+ Y +VG E + F+ + LP FF
Sbjct: 120 ---TRFFCGPSHYC-KTGSFKALQTAKYNIEHHYAVVGTLENINLFIQVARLRLPIFF-- 173
Query: 229 GTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIWELENELYEYALEQF 278
+H + ++ S + I+ + ++ LYEYA +F
Sbjct: 174 --NHTYVNEIPKIKENKVTRSSSSALITLIRNRN--KADSLLYEYAKTRF 219
>gi|345313993|ref|XP_003429450.1| PREDICTED: uronyl 2-sulfotransferase-like, partial [Ornithorhynchus
anatinus]
Length = 162
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 34/80 (42%), Positives = 48/80 (60%), Gaps = 1/80 (1%)
Query: 148 DKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVGV 207
++ +ECI N ECS ++ +P+ CG C PG WALE+AK N+ +LLVG+
Sbjct: 21 EEKDINECILENYPECSNPRLFYIIPYFCGQHPRCREPGE-WALERAKLNVNENFLLVGI 79
Query: 208 TEELTDFVSLLEAALPSFFR 227
EEL D + LLE LP +F+
Sbjct: 80 LEELEDVLLLLERFLPHYFK 99
>gi|195173238|ref|XP_002027400.1| GL20903 [Drosophila persimilis]
gi|194113252|gb|EDW35295.1| GL20903 [Drosophila persimilis]
Length = 154
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 28/105 (26%), Positives = 53/105 (50%), Gaps = 2/105 (1%)
Query: 29 VIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVL-SLADQYRFVNNVTKWRD 87
++ +NRVPKTGS + + + + + F V S Q + + +
Sbjct: 22 ILFFNRVPKTGSETLIELMLRLGERNHFQNARSPFAKPTGVYWSFEKQKEEAHRILDLME 81
Query: 88 RRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLR 132
Y H +++F+QF QP++IN++R P++R++S+YY+ R
Sbjct: 82 EDAFAYAEHANYVNFRQFHLP-QPIYINLVRDPVERVISWYYYKR 125
>gi|110637271|ref|YP_677479.1| hypothetical protein CHU_0855 [Cytophaga hutchinsonii ATCC 33406]
gi|110279952|gb|ABG58138.1| conserved hypothetical protein [Cytophaga hutchinsonii ATCC 33406]
Length = 267
Score = 51.6 bits (122), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 53/212 (25%), Positives = 97/212 (45%), Gaps = 19/212 (8%)
Query: 66 NNHVLSLADQYRFVNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLV 125
N+ +L + D Y ++ + + + HGHF F + K +I LR P++RL+
Sbjct: 34 NSDLLGMDDTYLMLSQADEKIINKIKIIHGHFPFGLDRLLPQKST--YITFLRNPIERLI 91
Query: 126 SYYYFLRYGDNYRPHLVRKKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVP 185
S YY+ + D H + +F E + + +++N Q F+ G +
Sbjct: 92 SDYYYCK--DFALAH--NHSYASTMSFKEYLSCSDI-LNIDNG--QTRFVAGGENVPYGD 144
Query: 186 GNPWALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTN 245
+ L +A EN+ ++ VG+TE+ + + + A T +TS+KS+
Sbjct: 145 NSIEMLNRAIENIEKRFSFVGITEKFDESLLIANAVFSWNQYYYTSKNITSSKSY----- 199
Query: 246 RKIDPSEETVQQIKKSKIWELENELYEYALEQ 277
D EET + ++K +L +LYEYAL++
Sbjct: 200 ---DFDEETWELLRKRNFLDL--QLYEYALKK 226
>gi|167515554|ref|XP_001742118.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163778742|gb|EDQ92356.1| predicted protein [Monosiga brevicollis MX1]
Length = 324
Score = 51.2 bits (121), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 68/278 (24%), Positives = 112/278 (40%), Gaps = 46/278 (16%)
Query: 1 INTQKSHQIHISSAKSPSPETDSLSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLH 60
+ T++ Q+ +PSP + +LS + + VPKT TS ++ D + N
Sbjct: 39 VVTRRLDQLKRHRVATPSP-SGALS---TLYFLHVPKTAGTSMLHAFLDAVATSQDNAAD 94
Query: 61 VNVTGNNHVLSLADQYRFVN--NVTKWRDRRPA-----LYHGHFGFIDFQQFGSKEQPLF 113
++ + L L++ + N + T+ RPA L H F + L
Sbjct: 95 RSILRCHRHLDLSNCFIVYNATSATQCTGGRPAACGYVLTHAGLDV----TFAMRRDVLA 150
Query: 114 INILRKPLDRLVSYYYFLRYGDNYRPHLVRKKHGDKTTFDECIRLNRTECSLENMWLQVP 173
+ ILR P RL+SYY +LR RP + + F R + + +L+N +
Sbjct: 151 VTILRDPASRLLSYYNYLR-----RP-------ANPSAFARWYRAS-PQRALDN--IMTG 195
Query: 174 FLCGHA--------AACWVPGNPWALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSF 225
+L G A A W P +L AK +L++ L G E+L F+ L P
Sbjct: 196 YLAGEAMGPAGALVAPVWHPITNASLGSAKRHLISAIDLWGFQEDLNPFLRWLAYLWP-- 253
Query: 226 FRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKI 263
H S H T+++ P E + I+ S +
Sbjct: 254 ------HHNASKHIHAMTTSKRYQPHEHGTRGIESSAM 285
>gi|281366423|ref|NP_001163467.1| pipe, isoform M [Drosophila melanogaster]
gi|272455237|gb|ACZ94738.1| pipe, isoform M [Drosophila melanogaster]
Length = 292
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 29/108 (26%), Positives = 54/108 (50%), Gaps = 17/108 (15%)
Query: 91 ALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLR-------YGDNYRPHLVR 143
++Y H ++DF FG K +P++I++++ P+DR+++ +Y R Y Y R
Sbjct: 185 SIYMAHSNWLDFASFGFK-KPIYISLVKDPIDRMITDFYKRRSRVKRAIYRRMYPGRRER 243
Query: 144 KKHGDKTTFDECIRLNRTEC---------SLENMWLQVPFLCGHAAAC 182
+ +F+EC+R EC +++ Q + CG+AA C
Sbjct: 244 PDEWYQLSFNECVRNRSPECLFVQHAVADYIQDFKRQTLYFCGNAADC 291
>gi|410729281|ref|ZP_11367361.1| Sulfotransferase family [Clostridium sp. Maddingley MBC34-26]
gi|410595835|gb|EKQ50524.1| Sulfotransferase family [Clostridium sp. Maddingley MBC34-26]
Length = 577
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 65/259 (25%), Positives = 118/259 (45%), Gaps = 40/259 (15%)
Query: 35 VPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWRDRRPALYH 94
+PK TS + D+ ++ V++ + + + ++Y F+
Sbjct: 10 IPKAAGTSLFKIYNDILGEENVKQF-VSINKGSRQMEVLNRYPFLG-------------- 54
Query: 95 GHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLR--YGDNYRPHLVRKKHGDKTTF 152
GH ++++ ++ S E I LR P++R +S Y++ + G++ +V K D ++
Sbjct: 55 GHTNYLEYLKYFS-EDRYSITFLRNPINRFLSQYFYYKNNVGESRETSVVNAKKLDLKSY 113
Query: 153 DECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVGVTEELT 212
E R + + N L LC + LE AKENL +K VG+ EE
Sbjct: 114 IEHYRHIQRYGDVFNRQL----LCFTGFQKSSLTDNELLEMAKENL-SKINFVGIFEEFN 168
Query: 213 DFVSLL--EAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSE---ETVQQIKKSKIWELE 267
D + LL + LP L +N + TN+K +E +T++ IK+ + +L+
Sbjct: 169 DSIDLLCYDCKLP----------LVNNIPIVNVTNKKPAYAEIDGDTLELIKE--LNDLD 216
Query: 268 NELYEYALEQFHFVKKHNL 286
++LYEYAL+ F+ K+ L
Sbjct: 217 SQLYEYALKLFNDKKRQIL 235
>gi|432112806|gb|ELK35404.1| Uronyl 2-sulfotransferase [Myotis davidii]
Length = 271
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 42/102 (41%), Positives = 55/102 (53%), Gaps = 6/102 (5%)
Query: 185 PGNPWALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRR- 243
PG WALE+AK N+ +LLVG+ EEL D + LLE LP +F+G + L
Sbjct: 127 PGE-WALERAKLNVNENFLLVGILEELEDVLLLLERFLPHYFKGVLSIYKDPEHRKLGNM 185
Query: 244 --TNRKIDPSEETVQQIKKSKIWELENELYEYALEQFHFVKK 283
T RK PS E VQ + + +E E Y Y EQFH +K+
Sbjct: 186 TVTVRKTVPSPEAVQILYQRMRYEY--EFYHYVKEQFHLLKR 225
>gi|428180340|gb|EKX49208.1| hypothetical protein GUITHDRAFT_136354 [Guillardia theta CCMP2712]
Length = 440
Score = 47.8 bits (112), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 38/134 (28%), Positives = 63/134 (47%), Gaps = 10/134 (7%)
Query: 113 FINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKHGDKTTFDECIRLNRTECSLE-NMW-- 169
+I ++R+PL RL + R D + + K F+EC+ ++ S + + W
Sbjct: 267 YITLIREPLARLQEQFEADREHDQQKG---KSKFVTNLNFEECLHVSVCRTSHQFSRWCN 323
Query: 170 LQVPFLCGHAAACWVPGNPWA----LEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSF 225
LQ + CG C N A L+KA N+ T +L VG+ E+ LLE LP++
Sbjct: 324 LQTRYFCGWGKDCLYDKNLNATEAMLKKALHNIDTLFLAVGIFEDFDLSHKLLETLLPTY 383
Query: 226 FRGGTDHFLTSNKS 239
F+G + +S +
Sbjct: 384 FQGLGEELKSSGGA 397
>gi|390357746|ref|XP_797456.3| PREDICTED: uronyl 2-sulfotransferase-like [Strongylocentrotus
purpuratus]
Length = 121
Score = 47.4 bits (111), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 30/106 (28%), Positives = 53/106 (50%), Gaps = 4/106 (3%)
Query: 175 LCGHAAACWVPGNPWALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFL 234
CG C WALE+AK N+ Y +G+TEE + +LE +P + G T+ +L
Sbjct: 6 FCGFNPKCGKDAT-WALEQAKSNIDKYYTFIGITEEYEASLRVLEHLMPDMYNGLTEFYL 64
Query: 235 TS-NKSHLRRTNRKIDPSEETVQQIKKSKI--WELENELYEYALEQ 277
+ NK++ T K + Q++K + + ++ ELY + ++
Sbjct: 65 SRINKTNSLTTLSKTANKKPLSQELKNTATERYAVDYELYNFIYDR 110
>gi|392427588|ref|YP_006468582.1| Sulfotransferase family [Desulfosporosinus acidiphilus SJ4]
gi|391357551|gb|AFM43250.1| Sulfotransferase family [Desulfosporosinus acidiphilus SJ4]
Length = 556
Score = 47.0 bits (110), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 71/260 (27%), Positives = 109/260 (41%), Gaps = 40/260 (15%)
Query: 35 VPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWRD---RRPA 91
+PKT TS N+ G V + D N+ K R R A
Sbjct: 9 IPKTAGTSLFT-------------FFRNILGEEQVYQVRDV-----NIGKQRAEAIRSFA 50
Query: 92 LYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRK--KHGDK 149
+ GH + Q + +E+ + LR+PL+R +S YYF R + + L K K D
Sbjct: 51 MVGGHLTYDQMQTYFEQER-YRLTFLRQPLERFLSMYYFYRQTEEVQRDLSVKMAKSMDL 109
Query: 150 TTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVGVTE 209
T+ + + L N +Q +L G L+ AKENL T VG+TE
Sbjct: 110 ATYINWLLDSEEYKHLRN--VQTWYLTGGLTTKRSTSLAERLDLAKENL-TSLDFVGITE 166
Query: 210 ELT---DFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIWEL 266
+++ DF+S L+ P R +N + R +ID + VQ+I+ +I L
Sbjct: 167 DMSDSLDFLS-LDCCWPLVER-----IPRNNVTASRPKIEEIDG--QLVQRIQ--EISSL 216
Query: 267 ENELYEYALEQFHFVKKHNL 286
+ ELY Y L+ + K+ L
Sbjct: 217 DMELYSYGLKLYKQKKRQLL 236
>gi|323455580|gb|EGB11448.1| hypothetical protein AURANDRAFT_61900 [Aureococcus anophagefferens]
Length = 1416
Score = 47.0 bits (110), Expect = 0.012, Method: Composition-based stats.
Identities = 38/121 (31%), Positives = 54/121 (44%), Gaps = 13/121 (10%)
Query: 113 FINILRKPLDRLVSYYYFLR-----YGDNYRPHLVRKKHGDKTTFDECIRLNRTECSLEN 167
F+ +LR+PL + S++YFLR G P+ D D + + N
Sbjct: 892 FVAVLREPLAWVASHFYFLRGEPPPVGLAGAPN-ATASLADAVLADPLLAYFSFFADVRN 950
Query: 168 MWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFR 227
Q +LCG A AC P +A E L+ + +VGV E+L + LE ALP F
Sbjct: 951 A--QARYLCGAAPACRQPH-----AQAPEELLDAFAVVGVLEDLRGTLLALERALPRIFA 1003
Query: 228 G 228
G
Sbjct: 1004 G 1004
>gi|167527015|ref|XP_001747840.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163773589|gb|EDQ87227.1| predicted protein [Monosiga brevicollis MX1]
Length = 254
Score = 46.6 bits (109), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 35/127 (27%), Positives = 57/127 (44%), Gaps = 6/127 (4%)
Query: 153 DECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVGVTEELT 212
DEC L R + L F CGH C + AL +AK+ L Y V V E +
Sbjct: 105 DECNYL-RELMDASSFNLMTTFFCGHGPECTMQTPQQALARAKQRLERDYAHVAVLERMP 163
Query: 213 DFVSLLEAALPSFFRGGTDHFLTSNKSHLRRT-NRKIDPSEETVQQIKKSKIWEL---EN 268
+ ++L E +P FFR FL+ ++ + + N+ ++ ++ I L +
Sbjct: 164 ESLALFELLMPQFFRNARS-FLSEHREYQQLNHNQAVEADRRRPTSATRAAIARLARYDM 222
Query: 269 ELYEYAL 275
ELY +A+
Sbjct: 223 ELYGFAV 229
>gi|406973498|gb|EKD96909.1| hypothetical protein ACD_23C01190G0004 [uncultured bacterium]
Length = 292
Score = 46.6 bits (109), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 45/183 (24%), Positives = 84/183 (45%), Gaps = 24/183 (13%)
Query: 30 IIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWRDRR 89
I++ +PKTG + +M ++ R + + N + ++Y+ + K +
Sbjct: 25 IVFIHIPKTGGMTLYSMIREIYRPSELHKI-------NPAVESVEKYKHLPQTRK--NSL 75
Query: 90 PALY-HGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKHGD 148
A+Y H +G + S ++ ++R P++R++S+YY++R N +R+
Sbjct: 76 KAIYGHMDYGLRELLPADSA----YVTLMRHPVERVISHYYYVRRTGN---DPLRELAMR 128
Query: 149 KTTFDECIRLNRTECSLENM-WLQVPFLCGHAAAC-WVPGNPWALEKAKENLVTKYLLVG 206
+ +D R C+LE M Q L G A + + AL +AK NL + LVG
Sbjct: 129 SSLYDWVAR-----CNLEEMDNGQTRRLSGMAQGIKFGECSAEALAQAKTNLARDFALVG 183
Query: 207 VTE 209
+TE
Sbjct: 184 ITE 186
>gi|323456630|gb|EGB12496.1| hypothetical protein AURANDRAFT_60414 [Aureococcus anophagefferens]
Length = 806
Score = 46.2 bits (108), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 37/131 (28%), Positives = 54/131 (41%), Gaps = 21/131 (16%)
Query: 113 FINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKHGDKTTFDECIRLNRTECSLENMWLQV 172
+I ++R+P+DR S +Y+ + + RK+ K E RLN T C
Sbjct: 102 WITLVREPVDRAQSLFYY------FVDPVSRKRQ--KALASERARLNDTACGCGGAEFDA 153
Query: 173 ----------PFLCG---HAAACWVPGNPWALEKAKENLVTKYLLVGVTEELTDFVSLLE 219
P C H A C ++ A L Y+ VG+TEE V++LE
Sbjct: 154 CVLLREANGCPLGCSTQQHEAFCAPGARNCSIADAVRALDDDYVFVGLTEEYDLSVAVLE 213
Query: 220 AALPSFFRGGT 230
LP FF G +
Sbjct: 214 RLLPQFFAGAS 224
>gi|294505756|ref|YP_003569816.1| hypothetical protein BMQ_pBM60053 [Bacillus megaterium QM B1551]
gi|294352162|gb|ADE72485.1| conserved hypothetical protein [Bacillus megaterium QM B1551]
Length = 223
Score = 45.1 bits (105), Expect = 0.044, Method: Compositional matrix adjust.
Identities = 63/251 (25%), Positives = 108/251 (43%), Gaps = 40/251 (15%)
Query: 29 VIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWRDR 88
+II+ +PKTG T+ ++ F L+ +HV A F + +
Sbjct: 5 LIIFIHIPKTGGTTLNDI---------FKKLYAENEIYDHVPVEAMNKHFSQLKEEEKKT 55
Query: 89 RPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLR-YGDNYRPHLVRKKHG 147
A+ HF I SK F ++R P++R++S YYFL+ Y Y+ ++
Sbjct: 56 LKAISGHHFYGI--HDLFSKSYTYF-TMMRNPIERVISLYYFLKTYPGYYQENMRNMSFE 112
Query: 148 DKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVGV 207
D +D + +T+ +CG + +LEKAKENL T + +VG+
Sbjct: 113 DYLDWDPQAKNGQTQQ-----------ICGIHSQI-------SLEKAKENLKT-FEVVGI 153
Query: 208 TEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIWELE 267
TE + + LL+ L G + K+ + + E +++I+K+ EL+
Sbjct: 154 TEMFNESLLLLKNKL------GWNDIAYKRKNITKSRPLLQEVPTEIIKKIEKNN--ELD 205
Query: 268 NELYEYALEQF 278
EL+EY F
Sbjct: 206 IELFEYIKSNF 216
>gi|221307720|gb|ACM16727.1| IP16422p [Drosophila melanogaster]
Length = 157
Score = 45.1 bits (105), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 30/107 (28%), Positives = 50/107 (46%), Gaps = 21/107 (19%)
Query: 93 YHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKHGD---- 148
Y H F+DF + P++IN++R P++RLVS++Y++R ++ ++ GD
Sbjct: 51 YTKHVAFLDFDLL-DEPWPIYINMVRDPIERLVSWFYYVRAPWHFAER--KEMFGDAIVL 107
Query: 149 ------KTTFDECIRLNRTECSLENMWL--------QVPFLCGHAAA 181
+ F+ CI EC E M + Q +LCG A
Sbjct: 108 PSIDWLRKDFNRCIEERDPECVYEQMEMGNLGDHRRQSLYLCGQNMA 154
>gi|242279110|ref|YP_002991239.1| capsular polysaccharide biosynthesis protein [Desulfovibrio
salexigens DSM 2638]
gi|242122004|gb|ACS79700.1| putative capsular polysaccharide biosynthesis protein
[Desulfovibrio salexigens DSM 2638]
Length = 271
Score = 44.7 bits (104), Expect = 0.062, Method: Compositional matrix adjust.
Identities = 44/147 (29%), Positives = 65/147 (44%), Gaps = 11/147 (7%)
Query: 78 FVNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLF-INILRKPLDRLVSYYYFLR-YGD 135
F+ ++ R L GH +F +F S + LR P+ R+VS Y FLR + +
Sbjct: 42 FLRKISAEEVDRIQLVQGHIFVHNFNEFFSGAFGKYAFTFLRDPVARVVSEYNFLRTWPE 101
Query: 136 NYRPHLVRKKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAK 195
N HL R + +K T + + R E L LCG V LE+AK
Sbjct: 102 N---HLYRYLNEEKVTLIDYVSSQRPELIYRGKNLMARSLCG-----AVEDGRSMLERAK 153
Query: 196 ENLVTKYLLVGVTEELTDFVSLLEAAL 222
+NL YL G+TE + + LL+ +
Sbjct: 154 DNLQRLYLF-GITERFDESLLLLKRMM 179
>gi|313226120|emb|CBY21263.1| unnamed protein product [Oikopleura dioica]
Length = 112
Score = 44.3 bits (103), Expect = 0.071, Method: Compositional matrix adjust.
Identities = 23/100 (23%), Positives = 47/100 (47%), Gaps = 12/100 (12%)
Query: 165 LENMWLQVPFLCGHAAACWVPG--------NPWALEKAKENLVTKYLLVGVTEELTDFVS 216
L ++ + F CG+ C +P +E AK+ ++ Y +VGV E+ D +S
Sbjct: 2 LASIGRYIEFFCGNGPDCQLPQLARDDDVYKSKMVEIAKKRMLDDYFVVGVLEQFEDSLS 61
Query: 217 LLEAALPSFFRGGTDHF----LTSNKSHLRRTNRKIDPSE 252
+ E LP ++RG + + + + ++ + ++ P E
Sbjct: 62 VFEKLLPRYYRGALEVYESKMIQTTRNQTKSIGKRTLPDE 101
>gi|443313799|ref|ZP_21043409.1| Sulfotransferase family [Synechocystis sp. PCC 7509]
gi|442776212|gb|ELR86495.1| Sulfotransferase family [Synechocystis sp. PCC 7509]
Length = 277
Score = 44.3 bits (103), Expect = 0.072, Method: Compositional matrix adjust.
Identities = 66/285 (23%), Positives = 135/285 (47%), Gaps = 52/285 (18%)
Query: 27 DTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHV-NVTGN-NHVLSLADQYRFVNNVTK 84
+T +I+ +PKT ++ N+ +++N ++ N+ GN + +L L + ++ ++++
Sbjct: 6 NTALIFLHLPKTAGSTLNNII-----SRQYNSKNIYNLYGNADQILELTENFK---HLSE 57
Query: 85 WRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPH-LVR 143
+ + + GH + F + + ++ +LR+P+DR +S YY++R +R + L+
Sbjct: 58 KQHQNIKVIKGHICY-GFHELLVRPAT-YVTLLREPVDRAISLYYYIRRHPAHRHYELIT 115
Query: 144 KKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAA------ACWVPGNPWALEKAKEN 197
K+ + D+ I + L+N Q + G A C V LEKAK+N
Sbjct: 116 SKN---MSLDDYI-YSGVATQLDNG--QTRMIAGVDANKVEFGKCSVA----MLEKAKKN 165
Query: 198 LVTKYLLVGVTEELTDFVSLLEA----ALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEE 253
+ T + VG T + + + LL+ +LP + + +T N+ T+ ++ +
Sbjct: 166 INTHFSFVGTTSKFDESLMLLKNYFSWSLPLYQKQN----VTKNRP---ETSDILNSTLN 218
Query: 254 TVQQIKKSKIWELENELYEYA-------LEQFHFVKKHNLVYNKV 291
++ + K L+ ELY+YA +EQ F+ K +N V
Sbjct: 219 VIKDLNK-----LDIELYKYAETKIEQQIEQQDFLAKELKAFNFV 258
>gi|77164263|ref|YP_342788.1| hypothetical protein Noc_0745 [Nitrosococcus oceani ATCC 19707]
gi|254434982|ref|ZP_05048489.1| hypothetical protein NOC27_2045 [Nitrosococcus oceani AFC27]
gi|76882577|gb|ABA57258.1| hypothetical protein Noc_0745 [Nitrosococcus oceani ATCC 19707]
gi|207088093|gb|EDZ65365.1| hypothetical protein NOC27_2045 [Nitrosococcus oceani AFC27]
Length = 435
Score = 43.9 bits (102), Expect = 0.084, Method: Compositional matrix adjust.
Identities = 66/252 (26%), Positives = 99/252 (39%), Gaps = 54/252 (21%)
Query: 55 RFNVLHVNVTGNNHVLS-LADQYRFVNNV--TKWRD------RRPAL--YHGHFGFIDFQ 103
+ + LH+ T V L QY V T WR+ R P L + GHFG Q
Sbjct: 7 KLHFLHIPKTAGTSVTQFLQRQYDLDQIVFETTWRELFEYQPRMPKLKLFRGHFGINLSQ 66
Query: 104 QFGSKEQPLFINILRKPLDRLVSYYYFLR----------YGDNYRPHLVRKKHGDKTTFD 153
G + + I LR+P+ R++S YY +R G+ + K G K +
Sbjct: 67 MIGPEFK--VITFLREPVSRVISQYYHVRKRATEGLNQFAGEMELAEFMHHKQGRKLCRN 124
Query: 154 ECIRLNRTECSLENMWLQVPFLCGHAAACWVP--------GNPWALEKAKENLVTKYLLV 205
R + +W H A +P +P LE+AK+NL + V
Sbjct: 125 LQTRYIGRSLEVGQLW-------AHPAVLNIPPDRFFDDLASPLLLEQAKQNL-NGFFFV 176
Query: 206 GVTEELTDFVSLL---EAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSK 262
G+TE + LL AALP L + K R + K P+ E +Q ++
Sbjct: 177 GLTEYYDISMLLLCHKLAALPE---------LDAVKRRERERDAKRVPT-EVLQHLEAEN 226
Query: 263 IWELENELYEYA 274
+L+ ELY +
Sbjct: 227 --QLDMELYRHG 236
>gi|254422767|ref|ZP_05036485.1| hypothetical protein S7335_2919 [Synechococcus sp. PCC 7335]
gi|196190256|gb|EDX85220.1| hypothetical protein S7335_2919 [Synechococcus sp. PCC 7335]
Length = 285
Score = 43.5 bits (101), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 69/271 (25%), Positives = 115/271 (42%), Gaps = 43/271 (15%)
Query: 30 IIYNRVPKTGSTSFVNMAYDMCRKK-----RFNVLHVNVTGNNHVLSLADQYRF----VN 80
I ++ +PK G TS +N A C K NVL++N + + V+
Sbjct: 32 IYFHHIPKCGGTS-LNRALQSCYVKWNLWESSNVLNLNSAASWQSAQVLFGKELPPDVVD 90
Query: 81 NVTKWRDRRPALYH-----------GHFGF--IDFQQFGSKEQPLFINILRKPLDRLVSY 127
+ + R L++ GHF F + Q +GS+ FI +LR P+DR +S
Sbjct: 91 DSAVMQLREELLFYFMSLGHVHYLSGHFPFSTLAHQYYGSRFD--FITVLRDPVDRWISS 148
Query: 128 YYFLRYGDNYRPHLVRKKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGN 187
Y++ +NYR +K D+ I +E + FL G A N
Sbjct: 149 YFY----NNYRQQSSYRKIA--IDLDDYI---NSELGRSQGYEYAKFLGGVAKEGSFM-N 198
Query: 188 PWALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRK 247
+++AKENL K+ L+G +++ F + + R T L S ++ N
Sbjct: 199 ADFVQRAKENL-HKFQLIGFLDDMATFQKMFAQRYGTKLRINT---LNQRPSKEKKLNEI 254
Query: 248 IDPSEETVQQIKKSKIWELENELYEYALEQF 278
I+ E+ + IK +I + E+Y YA++ F
Sbjct: 255 IN--EKMMGTIK--EICSPDLEIYNYAVDNF 281
>gi|198420413|ref|XP_002131085.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
Length = 115
Score = 43.5 bits (101), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 18/59 (30%), Positives = 36/59 (61%)
Query: 190 ALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKI 248
+LE AK +LV YL+VG+ E+ D +++LE LP +F+G + + + +++ + +
Sbjct: 10 SLETAKRHLVEDYLMVGILEQFEDSLNMLELVLPRYFKGAVQVWKSKDVQYIQEVTKTL 68
>gi|90425696|ref|YP_534066.1| hypothetical protein RPC_4223 [Rhodopseudomonas palustris BisB18]
gi|90107710|gb|ABD89747.1| hypothetical protein RPC_4223 [Rhodopseudomonas palustris BisB18]
Length = 417
Score = 43.1 bits (100), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 51/216 (23%), Positives = 88/216 (40%), Gaps = 22/216 (10%)
Query: 61 VNVTGNNHVLSLADQYRFVNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPL-FINILRK 119
+N +G+N V D N ++ R + GH + Q +PL ++ ILR
Sbjct: 41 LNQSGDNFVNLSRDSGGVAANTPRYLQTR--IGGGHLVYGVHHQL---RRPLNYVTILRD 95
Query: 120 PLDRLVSYYYFLRYGDN-YRPHLVRKKHGDKTTFDECIRLNR-TECSLENMWLQVPFLCG 177
PL R +S+++++R G N + + I L E SL+ L V L G
Sbjct: 96 PLQRQISHFHYVRTGKNGVMSEGCSVSSEESLVYRGAITLEEWVENSLQETNLLVKMLSG 155
Query: 178 HAAACWVPGNPWALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSN 237
A N +L +AK N+ + + G+ E++ ++ LL R G +
Sbjct: 156 KAP------NERSLAEAKANIESGRIFAGLAEDMESYLLLLCG------RTGLSRPFHFD 203
Query: 238 KSHLRRTNRKIDPSEETVQQIKKSKIWELENELYEY 273
+ R + PS + K + L+ EL+E+
Sbjct: 204 TNRTRTIEKSDSPSPAAIASFK--HLNRLDYELFEF 237
>gi|224075880|ref|XP_002304810.1| predicted protein [Populus trichocarpa]
gi|222842242|gb|EEE79789.1| predicted protein [Populus trichocarpa]
Length = 337
Score = 42.7 bits (99), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 30/112 (26%), Positives = 51/112 (45%), Gaps = 10/112 (8%)
Query: 27 DTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVL---HVNVTGNNHVLSLADQYRF----- 78
DT +I +PK+G+T +++ + +K+F + H + N H L+ +Y+
Sbjct: 70 DTDVILASIPKSGTTWLKALSFAILNRKKFAISSNDHPLLVSNPHDLAPFFEYKLYADKQ 129
Query: 79 VNNVTKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYF 130
V +++K D P L+ H F Q K I I R P D +S + F
Sbjct: 130 VPDLSKLPD--PRLFATHIPFASLQDSIKKSNCRIIYICRNPFDTFISSWTF 179
>gi|66823749|ref|XP_645229.1| hypothetical protein DDB_G0272174 [Dictyostelium discoideum AX4]
gi|60473293|gb|EAL71239.1| hypothetical protein DDB_G0272174 [Dictyostelium discoideum AX4]
Length = 614
Score = 42.7 bits (99), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 50/193 (25%), Positives = 89/193 (46%), Gaps = 35/193 (18%)
Query: 95 GHFGFIDFQQFGSKEQPLF--INILRKPLDRLVSYYYFLRYGDNYRPHLVRKKHGDKTTF 152
GHFG+ Q + + +LR P+DR++S+Y++ + T F
Sbjct: 428 GHFGYGIHTVLREDAQKTYSYLTMLRDPVDRVISHYFYHK----------------ATKF 471
Query: 153 DECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENL-VTKYLL-----VG 206
DE + + SLE WL++ + + G + E L + Y L VG
Sbjct: 472 DEEYAVAH-DTSLEE-WLEISPRGNNEMVRHLSGTNEEFSPSNETLNMAMYNLRSMKFVG 529
Query: 207 VTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIWEL 266
+TE + ++LL F G ++ + S+K ++ R + D S+ET+++IK+ K W +
Sbjct: 530 ITERFDETLALLH------FYIGLENPINSDKKNVAR-KKPSDVSQETIEKIKE-KNW-M 580
Query: 267 ENELYEYALEQFH 279
+ LYE +L+ F
Sbjct: 581 DILLYEESLKMFE 593
>gi|328873311|gb|EGG21678.1| putative cell number regulator [Dictyostelium fasciculatum]
Length = 488
Score = 42.4 bits (98), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 68/283 (24%), Positives = 117/283 (41%), Gaps = 54/283 (19%)
Query: 31 IYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWRDRRP 90
I+ VPKTG +S N+ R+ +F H + Y+ V ++
Sbjct: 229 IFVHVPKTGGSSLANIFKRNERRDKF-----------HHFWMRPSYQEVQYISYLN---- 273
Query: 91 ALYHGHFGFIDFQQFGSKEQP--------------LFINILRKPLDRLVSYYYFLRYGDN 136
+ +GH F + EQP ++ +LR+P+DR++S+YY+ R
Sbjct: 274 -IIYGHIRF-GLHHYYEAEQPGRLGVLASEEMNPYSYMTMLREPVDRVISHYYYHRQNRR 331
Query: 137 YRPHLVRKKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKE 196
H + K+ T D+ L+ T + Q LCG A+ NP L K
Sbjct: 332 DPGHALSMKY----TLDDW--LDHTGAATNE---QAHMLCG-IASTDTNDNPEFLSKCSH 381
Query: 197 -NLVTKYLLVGVTEELTDFVSLLEAALPSFFRG-GTDHFLTSNKSHLRRTNRKIDPSEET 254
+L Y VG+TE+ + + LL + + G F N R IDP+
Sbjct: 382 YHLQYVYKYVGLTEKFPESLVLL-----THYTGFQAIRFSKINTGTQRLKVEDIDPN--V 434
Query: 255 VQQIKKSKIWELENELYEYALEQFHFVKKHNLVYNKVLGYEAD 297
+++IK ++ ++ LY A++ F K + V + + Y++D
Sbjct: 435 IEKIK--RLNAIDISLYNMAVDIFE--KSVDAVGREFVTYQSD 473
>gi|444706737|gb|ELW48060.1| Uronyl 2-sulfotransferase [Tupaia chinensis]
Length = 220
Score = 42.0 bits (97), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 18/42 (42%), Positives = 30/42 (71%), Gaps = 3/42 (7%)
Query: 104 QFGSKEQPLFINILRKPLDRLVSYYYFLRYGD--NYRPHLVR 143
+FG +QP++INI+R P+ R +S Y+F R+GD + H++R
Sbjct: 95 RFGG-DQPVYINIIRDPVSRFLSNYFFRRFGDWRGEQNHMIR 135
>gi|126656459|ref|ZP_01727720.1| hypothetical protein CY0110_22187 [Cyanothece sp. CCY0110]
gi|126622145|gb|EAZ92852.1| hypothetical protein CY0110_22187 [Cyanothece sp. CCY0110]
Length = 255
Score = 41.6 bits (96), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 64/262 (24%), Positives = 111/262 (42%), Gaps = 40/262 (15%)
Query: 30 IIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWRDRR 89
II+ VPKTG TS ++ + M K+ + ++ N V ++ + + + K++ R
Sbjct: 21 IIFMHVPKTGGTS-IDKSLRMIYGKKNSYKVDSILTTNAVKAVNQNEKINSGIDKFQLRE 79
Query: 90 PALYH----------GHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRP 139
L + GHF F Q +I +LR P+ R +S Y+F D Y+P
Sbjct: 80 SLLIYEMATGKKYISGHFHFNTDIWEAYHNQYSWITVLRDPVKRYISQYFF----DAYKP 135
Query: 140 HLVRKKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLV 199
+ G F + R + N + G+ + LE AK NL
Sbjct: 136 EDHARVKGSLENFIDSERGKFRGQNYINYF-------GNFSRYDSTNLQTRLEVAKTNL- 187
Query: 200 TKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRR---TNRKIDPSEETVQ 256
+K+ LVG ++L F + L+A LT H+ + + KID E ++
Sbjct: 188 SKFSLVGFLDDLDQFTNDLQAKFN----------LTVKIPHMNKNPVSKPKID--ESIIK 235
Query: 257 QIKKSKIWELENELYEYALEQF 278
+I+ +I + ++ Y YA ++F
Sbjct: 236 KIE--EICKYDSIFYAYARKKF 255
>gi|222053561|ref|YP_002535923.1| parB-like partition protein [Geobacter daltonii FRC-32]
gi|221562850|gb|ACM18822.1| parB-like partition protein [Geobacter daltonii FRC-32]
Length = 278
Score = 41.2 bits (95), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 35/148 (23%), Positives = 63/148 (42%), Gaps = 4/148 (2%)
Query: 69 VLSLADQYRFVNNVTKWRDRRPALYHGHFGFI-DFQQFGSKEQPLFINILRKPLDRLV-- 125
V++ Y + +WR + A H I D + + E L NI R+ L+ +
Sbjct: 67 VVAKGSHYELIAGERRWRAAQKAGLHEVPVVIQDVSEDTALEMALIENIQREDLNAVEEA 126
Query: 126 -SYYYFLRYGDNYRPHLVRKKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWV 184
+Y+ L + + L ++ D++T IRL + ++ ++ GHA A
Sbjct: 127 EAYHSLLERFNLSQEELAKRVGKDRSTVANAIRLLKLPAEIKLDVIEDRLSMGHARALLT 186
Query: 185 PGNPWALEKAKENLVTKYLLVGVTEELT 212
+++A+E +V K L V TE L
Sbjct: 187 LDTAEQMKEARETVVRKKLTVRATESLV 214
>gi|357632549|ref|ZP_09130427.1| hypothetical protein DFW101_0419 [Desulfovibrio sp. FW1012B]
gi|357581103|gb|EHJ46436.1| hypothetical protein DFW101_0419 [Desulfovibrio sp. FW1012B]
Length = 279
Score = 41.2 bits (95), Expect = 0.64, Method: Compositional matrix adjust.
Identities = 49/196 (25%), Positives = 86/196 (43%), Gaps = 31/196 (15%)
Query: 92 LYHGHFGFIDFQQFGSKEQPL-FINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKHGDKT 150
L GH D+ +F ++P+ LR+P+ R+VS Y+FLR + H+ H +K
Sbjct: 58 LIQGHILLGDYDRFTLYDRPVRAFTFLREPVSRVVSEYFFLRTWPDQ--HVYEYLHREKV 115
Query: 151 TFDECI----RLNRTECSLENMWLQVPFLCGHAAACWVPGNPW-ALEKAKENLVTKYLLV 205
T + + RL R + S F+ + P AL +AK NL +++
Sbjct: 116 TLSDYVTSRNRLLRYKGS--------NFMTRVLSGLDPEERPQEALARAKANLRDRFVCF 167
Query: 206 GVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIWE 265
G+TE + +L AL G L ++ LRR P+ + + +++ + E
Sbjct: 168 GLTERFDASLLVLADAL------GLGDLLYERQNALRR------PAGDRATEAERALVAE 215
Query: 266 ---LENELYEYALEQF 278
L+ +L+ +A F
Sbjct: 216 RNRLDADLHAFAATLF 231
>gi|148266267|ref|YP_001232973.1| parB-like partition protein [Geobacter uraniireducens Rf4]
gi|146399767|gb|ABQ28400.1| chromosome segregation DNA-binding protein [Geobacter
uraniireducens Rf4]
Length = 278
Score = 40.8 bits (94), Expect = 0.82, Method: Compositional matrix adjust.
Identities = 34/148 (22%), Positives = 62/148 (41%), Gaps = 4/148 (2%)
Query: 69 VLSLADQYRFVNNVTKWRDRRPA-LYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLV-- 125
VL +D Y + +WR + A L D + + E L NI R+ L+ +
Sbjct: 67 VLRKSDHYELIAGERRWRAAQKAGLREVPVVIQDVSEDTALEMALIENIQREDLNAVEEA 126
Query: 126 -SYYYFLRYGDNYRPHLVRKKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWV 184
+Y+ L D + L ++ D++T +RL + ++ ++ GHA A
Sbjct: 127 EAYHALLERFDLSQEELAKRVGKDRSTVANSLRLLKLPSEIKLDVIEDRLSMGHARALLT 186
Query: 185 PGNPWALEKAKENLVTKYLLVGVTEELT 212
++ A+E ++ + L V TE L
Sbjct: 187 LDTLEQMKDARETIIKRKLTVRATESLV 214
>gi|394988643|ref|ZP_10381478.1| hypothetical protein SCD_01045 [Sulfuricella denitrificans skB26]
gi|393792022|dbj|GAB71117.1| hypothetical protein SCD_01045 [Sulfuricella denitrificans skB26]
Length = 257
Score = 40.0 bits (92), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 45/193 (23%), Positives = 90/193 (46%), Gaps = 18/193 (9%)
Query: 113 FINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKHGDKTTFDECIRLNRTECSLENM-WLQ 171
++ ++R P++R++S+Y++ Y N + L ++ +++ D+ + T+C+L M Q
Sbjct: 61 YVTLMRNPVERVISHYHY--YRRNAKDPL--RELAMRSSLDDWV----TQCNLNEMDNGQ 112
Query: 172 VPFLCGHAAAC-WVPGNPWALEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGT 230
L G + + + LE+A+ N+ + LVG+TE + L+ F
Sbjct: 113 TRRLSGSMESVRFGECSAEMLERARHNVQRNFALVGITERFDETYGLMS----KLFDWPI 168
Query: 231 DHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIWELENELYEYALEQFHFVKKHNLVYNK 290
+L N + R + ++I T++ I+K L+ ELYE+A F + N+
Sbjct: 169 KLYLPRNVAQQRSSIKEI--PVRTIRLIEKFNA--LDMELYEHATRLFADRLGQTDIENE 224
Query: 291 VLGYEADKGKQFM 303
V + + FM
Sbjct: 225 VRLLKEKRDNPFM 237
>gi|357515389|ref|XP_003627983.1| Flavonol sulfotransferase-like protein [Medicago truncatula]
gi|355522005|gb|AET02459.1| Flavonol sulfotransferase-like protein [Medicago truncatula]
Length = 348
Score = 40.0 bits (92), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 31/120 (25%), Positives = 49/120 (40%), Gaps = 20/120 (16%)
Query: 27 DTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNN--VTK 84
D+ I+ +PK+G+T +AY + ++ F L NNH L L + + V V
Sbjct: 76 DSDIVVASIPKSGTTWLKGLAYAIVNRQHFTSLE-----NNHPLLLFNPHELVPQFEVNL 130
Query: 85 WRDR-------------RPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFL 131
+ D+ P L+ H F + + I I R P D VSY+ F+
Sbjct: 131 YGDKDGPLPQIDVSNMTEPRLFGTHMPFPSLPKSVKESNCKIIYICRNPFDTFVSYWIFI 190
>gi|172039469|ref|YP_001805970.1| hypothetical protein cce_4556 [Cyanothece sp. ATCC 51142]
gi|171700923|gb|ACB53904.1| unknown [Cyanothece sp. ATCC 51142]
Length = 274
Score = 39.7 bits (91), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 51/181 (28%), Positives = 83/181 (45%), Gaps = 25/181 (13%)
Query: 110 QPL-FINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKHGDKTTFDECIRLNRTECS---- 164
QP +I ILR P++R++S Y ++R H K ++ + L + CS
Sbjct: 86 QPFTYITILRDPIERVLSLYCYIRDEPKNPQH--------KELIEKGMNLEQFLCSGIAK 137
Query: 165 -LENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVGVTEELTDFVSLLEAALP 223
EN Q L G A P + + AKENL + +VG+TE+ + + LL+ L
Sbjct: 138 TAENG--QTRILSGIQAENK-PCSDEMFKLAKENLSKYFSVVGLTEQFDETLILLKRLL- 193
Query: 224 SFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIWELENELYEYALEQFHFVKK 283
G + + S K+ +R D S + + I++ ++L +LYEYA + F KK
Sbjct: 194 -----GYNIPIYSTKNKNKRRLSIDDISAKERKMIEQYNSFDL--QLYEYAYQLFEEQKK 246
Query: 284 H 284
Sbjct: 247 Q 247
>gi|424865396|ref|ZP_18289261.1| hypothetical protein NT02SARS_0736 [SAR86 cluster bacterium SAR86B]
gi|400758664|gb|EJP72866.1| hypothetical protein NT02SARS_0736 [SAR86 cluster bacterium SAR86B]
Length = 239
Score = 39.7 bits (91), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 49/211 (23%), Positives = 91/211 (43%), Gaps = 31/211 (14%)
Query: 28 TVIIYNRVPKTGSTSFVNM-AYDMCRKKRFNVLHVNVT---GNNHVLSLADQYRFVNNVT 83
+V I RVPK GSTS M A + K + + N+ N +S +++R + N T
Sbjct: 2 SVYIVIRVPKCGSTSLARMFAKALPDSKEYYISSANLVLAHEENEKISSLEKFRILKNTT 61
Query: 84 K--WRDRR-----------------PALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRL 124
+ W+ R + HGH ID + + ++ L ++++R P DR+
Sbjct: 62 RSIWKKHRCLSFDAVWDKTNRTIKDNDIIHGHLT-IDSIELKNIDKRL-VSVIRDPYDRM 119
Query: 125 VSYYYFLRYGDNYRPHLVRKKHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWV 184
+S Y + R Y+ + + +K+ ++ L E + + P L G + ++
Sbjct: 120 LSDYNYFR-NSFYKKNFISQKYKNRQYIAGNYDL---EGYISYLAENQP-LFGRYISRFI 174
Query: 185 PGNPWALEKAKENLVTKYLLVGVTEELTDFV 215
G ++ E + +KY GV E + F+
Sbjct: 175 IGKE-KVDNPIEYIQSKYFAFGVLERMDLFI 204
>gi|354552264|ref|ZP_08971572.1| hypothetical protein Cy51472DRAFT_0368 [Cyanothece sp. ATCC 51472]
gi|353555586|gb|EHC24974.1| hypothetical protein Cy51472DRAFT_0368 [Cyanothece sp. ATCC 51472]
Length = 271
Score = 39.7 bits (91), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 51/181 (28%), Positives = 83/181 (45%), Gaps = 25/181 (13%)
Query: 110 QPL-FINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKHGDKTTFDECIRLNRTECS---- 164
QP +I ILR P++R++S Y ++R H K ++ + L + CS
Sbjct: 83 QPFTYITILRDPIERVLSLYCYIRDEPKNPQH--------KELIEKGMNLEQFLCSGIAK 134
Query: 165 -LENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVGVTEELTDFVSLLEAALP 223
EN Q L G A P + + AKENL + +VG+TE+ + + LL+ L
Sbjct: 135 TAENG--QTRILSGIQAENK-PCSDEMFKLAKENLSKYFSVVGLTEQFDETLILLKRLL- 190
Query: 224 SFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIWELENELYEYALEQFHFVKK 283
G + + S K+ +R D S + + I++ ++L +LYEYA + F KK
Sbjct: 191 -----GYNIPIYSTKNKNKRRLSIDDISAKERKMIEQYNSFDL--QLYEYAYQLFEEQKK 243
Query: 284 H 284
Sbjct: 244 Q 244
>gi|300114935|ref|YP_003761510.1| hypothetical protein Nwat_2365 [Nitrosococcus watsonii C-113]
gi|299540872|gb|ADJ29189.1| conserved hypothetical protein [Nitrosococcus watsonii C-113]
Length = 461
Score = 39.3 bits (90), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 36/118 (30%), Positives = 51/118 (43%), Gaps = 28/118 (23%)
Query: 35 VPKTGSTSFVNM------AYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWRDR 88
+PKT T+ + + A D+C + + L VT L QYRF
Sbjct: 13 IPKTAGTTLIPLLDARFDANDICPAQLWREL---VTLPQESLP---QYRF---------- 56
Query: 89 RPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYF-LRYGDNYRPHLVRKK 145
+ GHFG Q F E PL++ +LR PL +S Y F LR HLV+++
Sbjct: 57 ----FRGHFGAGGLQPF-LPEPPLYLTMLRHPLSLTLSTYRFILRESGTRVHHLVKER 109
>gi|92886084|gb|ABE88094.1| Sulfotransferase domain [Medicago truncatula]
Length = 316
Score = 39.3 bits (90), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 28/114 (24%), Positives = 53/114 (46%), Gaps = 9/114 (7%)
Query: 27 DTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVL--HVNVTGNNHVLSLA-DQYRFVNNVT 83
D+ ++ +PKTG+T + + + + RF+ L H +T N+H L + + +V+ ++
Sbjct: 45 DSDVVVASMPKTGTTWLKALTFAIVNRNRFSSLENHPLLTSNSHELVPSLESNVYVDTIS 104
Query: 84 KWRD------RRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFL 131
++ P L+ H F + + I I R P D VSY+ F+
Sbjct: 105 QFPKFDILNMIEPRLFGTHIPFASLAKSIKESNCKIIYICRNPFDTYVSYWNFM 158
>gi|307110292|gb|EFN58528.1| hypothetical protein CHLNCDRAFT_50309 [Chlorella variabilis]
Length = 415
Score = 39.3 bits (90), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 49/201 (24%), Positives = 78/201 (38%), Gaps = 21/201 (10%)
Query: 30 IIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWRDRR 89
I+ +P GS + D F + H+ LAD V +
Sbjct: 26 ILRRALPADGSGRYCQKRQDYSNHANFPCVERTGLAPQHLQLLADLKANFTTVCSGFN-- 83
Query: 90 PALYHGHFGFIDFQQFG-SKEQPLFINILRKPLDRLVSYYYF-LRYGDNYRPHLVRKKHG 147
H +GF FQ G + L + LR P++R +S+YYF +R + R + +
Sbjct: 84 ---VHQDYGF--FQALGIDHSRTLTMVALRHPVERTLSHYYFQMRIMQDNRKNFLFWPKN 138
Query: 148 DKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWA-------LEKAKENLVT 200
D + + I + N ++ A C PG P LE+AK NL
Sbjct: 139 D-SDYSGLIAFGSQQRQQANYH---TYMLAGAMGCRWPGGPAPPTSDAEILERAKRNL-E 193
Query: 201 KYLLVGVTEELTDFVSLLEAA 221
K+ ++ +TE + D V +L A
Sbjct: 194 KFCVILITEYMDDSVQMLGEA 214
>gi|251798438|ref|YP_003013169.1| hypothetical protein Pjdr2_4462 [Paenibacillus sp. JDR-2]
gi|247546064|gb|ACT03083.1| conserved hypothetical protein [Paenibacillus sp. JDR-2]
Length = 239
Score = 39.3 bits (90), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 44/185 (23%), Positives = 81/185 (43%), Gaps = 33/185 (17%)
Query: 27 DTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWR 86
+ +++Y +PKT +SF D C N +H + + L L++Q + ++
Sbjct: 3 NNLLLYLHIPKTAGSSFTQAITDNCP----NTVHFHTLKDG--LQLSEQLIDADALS--- 53
Query: 87 DRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKH 146
GHF + F +K +I +LR PL+R +S++YF Y+ + +
Sbjct: 54 --------GHFIY-GIHHF-TKRPYRYITMLRHPLERTLSHFYF-----KYKNPAYKVSY 98
Query: 147 GDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWA-LEKAKENLVTKYLLV 205
+ TF+E + + N LQ + G NP+ +KA+ +L + V
Sbjct: 99 DKELTFEEYVLNPYYDAEYCN--LQARMISGEL------NNPYPNFKKARAHLDKHFAFV 150
Query: 206 GVTEE 210
G+TE+
Sbjct: 151 GLTEQ 155
>gi|357515513|ref|XP_003628045.1| Flavonol sulfotransferase-like protein [Medicago truncatula]
gi|355522067|gb|AET02521.1| Flavonol sulfotransferase-like protein [Medicago truncatula]
Length = 329
Score = 38.9 bits (89), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 28/114 (24%), Positives = 53/114 (46%), Gaps = 9/114 (7%)
Query: 27 DTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVL--HVNVTGNNHVLSLA-DQYRFVNNVT 83
D+ ++ +PKTG+T + + + + RF+ L H +T N+H L + + +V+ ++
Sbjct: 58 DSDVVVASMPKTGTTWLKALTFAIVNRNRFSSLENHPLLTSNSHELVPSLESNVYVDTIS 117
Query: 84 KWRD------RRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFL 131
++ P L+ H F + + I I R P D VSY+ F+
Sbjct: 118 QFPKFDILNMIEPRLFGTHIPFASLAKSIKESNCKIIYICRNPFDTYVSYWNFM 171
>gi|345303519|ref|YP_004825421.1| hypothetical protein Rhom172_1667 [Rhodothermus marinus
SG0.5JP17-172]
gi|345112752|gb|AEN73584.1| hypothetical protein Rhom172_1667 [Rhodothermus marinus
SG0.5JP17-172]
Length = 260
Score = 38.9 bits (89), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 26/107 (24%), Positives = 45/107 (42%)
Query: 112 LFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKHGDKTTFDECIRLNRTECSLENMWLQ 171
+++ LR P+DR +S+YYF+R D R ++ R R + + M
Sbjct: 86 IYLTFLRHPVDRAISHYYFIRQCDPRFCRHDRYEYAMSMDLLTFYRQPRFQNEMTMMLAG 145
Query: 172 VPFLCGHAAACWVPGNPWALEKAKENLVTKYLLVGVTEELTDFVSLL 218
+P+ WAL +A NL T++ G+ E + + L
Sbjct: 146 IPWHKLQRCIAHPVWRQWALRRACYNLKTQFACFGLQERFEESLQLF 192
>gi|254299124|ref|ZP_04966574.1| putative capsular polysaccharide biosynthesis protein [Burkholderia
pseudomallei 406e]
gi|157809163|gb|EDO86333.1| putative capsular polysaccharide biosynthesis protein [Burkholderia
pseudomallei 406e]
Length = 440
Score = 38.9 bits (89), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 67/300 (22%), Positives = 119/300 (39%), Gaps = 55/300 (18%)
Query: 30 IIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWRDRR 89
I+++ +PKT +SF + + R +L D+ ++ V RR
Sbjct: 24 IVFHHIPKTAGSSFNQILRTLYRDDEVC-----------DAALDDE---LDEVMADETRR 69
Query: 90 PALYHGHFGFIDF-QQFGSKEQPLFINILRKPLDRLVSYYY----FLRYGDNY------R 138
L+ GHF F + FG + F LR P+ R +S Y+ RY D +
Sbjct: 70 YELFVGHFSFDALHRHFGGATRLTF---LRDPVQRCISQYHNWHDASRYSDAWIGRSDTN 126
Query: 139 PHLVRK-KHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWA------- 190
P +++ K + + E + N S + +L P W
Sbjct: 127 PDVIKALKMTSEMSLCEFVSSNNLVISDSAQNMMTRYLA--------PSVEWKKERGYYD 178
Query: 191 ---LEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRK 247
+EKAK NLV + G+TE+ + LL L +D LT+ +
Sbjct: 179 AELVEKAKRNLVEYFHFFGLTEQFDRSLVLLAHTLGIRPWERSDALLTNRNPKKASFDSV 238
Query: 248 IDPSEETVQQIKKSKIWELENELYEYALEQFHFVKKHNLVYNKVL----GYEADKGKQFM 303
+ + E ++ + ++ ELYE+A+++F+ ++ + Y K++ Y ADK + M
Sbjct: 239 YNTTPEEGGVLRDYNLMDI--ELYEFAVKEFN--RRFDAGYQKLVECAFEYLADKDTRDM 294
>gi|167912364|ref|ZP_02499455.1| putative capsule polysaccharide biosynthesis protein [Burkholderia
pseudomallei 112]
Length = 428
Score = 38.5 bits (88), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 67/300 (22%), Positives = 119/300 (39%), Gaps = 55/300 (18%)
Query: 30 IIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWRDRR 89
I+++ +PKT +SF + + R +L D+ ++ V RR
Sbjct: 12 IVFHHIPKTAGSSFNQILRTLYRDDEVC-----------DAALDDE---LDEVMADETRR 57
Query: 90 PALYHGHFGFIDF-QQFGSKEQPLFINILRKPLDRLVSYYY----FLRYGDNY------R 138
L+ GHF F + FG + F LR P+ R +S Y+ RY D +
Sbjct: 58 YELFVGHFSFDALHRHFGGATRLTF---LRDPVQRCISQYHNWHDASRYSDAWIGRSDTN 114
Query: 139 PHLVRK-KHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWA------- 190
P +++ K + + E + N S + +L P W
Sbjct: 115 PDVIKALKMTSEMSLCEFVSSNNLVISDSAQNMMTRYLA--------PSVEWKKERGYYD 166
Query: 191 ---LEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRK 247
+EKAK NLV + G+TE+ + LL L +D LT+ +
Sbjct: 167 AELVEKAKRNLVEYFHFFGLTEQFDRSLVLLAHTLGIRPWERSDALLTNRNPKKASFDSV 226
Query: 248 IDPSEETVQQIKKSKIWELENELYEYALEQFHFVKKHNLVYNKVL----GYEADKGKQFM 303
+ + E ++ + ++ ELYE+A+++F+ ++ + Y K++ Y ADK + M
Sbjct: 227 YNTTPEEGGVLRDYNLMDI--ELYEFAVKEFN--RRFDAGYQKLVECAFEYLADKDTRDM 282
>gi|225456529|ref|XP_002264512.1| PREDICTED: sulfotransferase 17-like [Vitis vinifera]
Length = 343
Score = 38.5 bits (88), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 30/109 (27%), Positives = 50/109 (45%), Gaps = 10/109 (9%)
Query: 30 IIYNRVPKTGSTSFVNMAYDMCRKKRFNV-LHVNVTGNNHVLS------LADQYRFVNNV 82
I+ +PK+G+T F + + + + F++ H +T + H L L+ + F N
Sbjct: 75 ILLVTLPKSGTTWFKPLMFAVMNRTHFDLSTHPILTTSPHDLVPFLELYLSHKIPFPNPD 134
Query: 83 TKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFL 131
T + P L+ H F Q+ + Q + I R P D VS +YFL
Sbjct: 135 TFYP---PQLFQTHIPFTSLSQYVMESQCRIVYICRNPKDVFVSTFYFL 180
>gi|254259676|ref|ZP_04950730.1| putative capsular polysaccharide biosynthesis protein [Burkholderia
pseudomallei 1710a]
gi|254218365|gb|EET07749.1| putative capsular polysaccharide biosynthesis protein [Burkholderia
pseudomallei 1710a]
Length = 427
Score = 38.5 bits (88), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 67/300 (22%), Positives = 119/300 (39%), Gaps = 55/300 (18%)
Query: 30 IIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWRDRR 89
I+++ +PKT +SF + + R +L D+ ++ V RR
Sbjct: 11 IVFHHIPKTAGSSFNQILRTLYRDDEVC-----------DAALDDE---LDEVMADETRR 56
Query: 90 PALYHGHFGFIDF-QQFGSKEQPLFINILRKPLDRLVSYYY----FLRYGDNY------R 138
L+ GHF F + FG + F LR P+ R +S Y+ RY D +
Sbjct: 57 YELFVGHFSFDALHRHFGGATRLTF---LRDPVQRCISQYHNWHDASRYSDAWIGRSDTN 113
Query: 139 PHLVRK-KHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWA------- 190
P +++ K + + E + N S + +L P W
Sbjct: 114 PDVIKALKMTSEMSLCEFVSSNNLVISDSAQNMMTRYLA--------PSVEWKKERGYYD 165
Query: 191 ---LEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRK 247
+EKAK NLV + G+TE+ + LL L +D LT+ +
Sbjct: 166 AELVEKAKRNLVEYFHFFGLTEQFDRSLVLLAHTLGIRPWERSDALLTNRNPKKASFDSV 225
Query: 248 IDPSEETVQQIKKSKIWELENELYEYALEQFHFVKKHNLVYNKVL----GYEADKGKQFM 303
+ + E ++ + ++ ELYE+A+++F+ ++ + Y K++ Y ADK + M
Sbjct: 226 YNTTPEEGGVLRDYNLMDI--ELYEFAVKEFN--RRFDAGYQKLVECAFEYLADKDTRDM 281
>gi|76808642|ref|YP_334668.1| protein WcbF [Burkholderia pseudomallei 1710b]
gi|76578095|gb|ABA47570.1| WcbF [Burkholderia pseudomallei 1710b]
Length = 498
Score = 38.5 bits (88), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 68/302 (22%), Positives = 122/302 (40%), Gaps = 59/302 (19%)
Query: 30 IIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVL--SLADQYRFVNNVTKWRD 87
I+++ +PKT +SF + + R ++ V +L D+ ++ V
Sbjct: 82 IVFHHIPKTAGSSFNQILRTLYR-------------DDEVCDAALDDE---LDEVMADET 125
Query: 88 RRPALYHGHFGFIDF-QQFGSKEQPLFINILRKPLDRLVSYYY----FLRYGDNY----- 137
RR L+ GHF F + FG + F LR P+ R +S Y+ RY D +
Sbjct: 126 RRYELFVGHFSFDALHRHFGGATRLTF---LRDPVQRCISQYHNWHDASRYSDAWIGRSD 182
Query: 138 -RPHLVRK-KHGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWA----- 190
P +++ K + + E + N S + +L P W
Sbjct: 183 TNPDVIKALKMTSEMSLCEFVSSNNLVISDSAQNMMTRYLA--------PSVEWKKERGY 234
Query: 191 -----LEKAKENLVTKYLLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTN 245
+EKAK NLV + G+TE+ + LL L +D LT+ +
Sbjct: 235 YDAELVEKAKRNLVEYFHFFGLTEQFDRSLVLLAHTLGIRPWERSDALLTNRNPKKASFD 294
Query: 246 RKIDPSEETVQQIKKSKIWELENELYEYALEQFHFVKKHNLVYNKVL----GYEADKGKQ 301
+ + E ++ + ++ ELYE+A+++F+ ++ + Y K++ Y ADK +
Sbjct: 295 SVYNTTPEEGGVLRDYNLMDI--ELYEFAVKEFN--RRFDAGYQKLVECAFEYLADKDTR 350
Query: 302 FM 303
M
Sbjct: 351 DM 352
>gi|291229686|ref|XP_002734806.1| PREDICTED: sulfotransferase family 1B, member 1-like [Saccoglossus
kowalevskii]
Length = 309
Score = 38.1 bits (87), Expect = 5.7, Method: Compositional matrix adjust.
Identities = 30/113 (26%), Positives = 51/113 (45%), Gaps = 10/113 (8%)
Query: 29 VIIYNRVPKTGSTSFVNMA-------YDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNN 81
++ Y R T +T V++ Y+M + V + V +++ + D +R N+
Sbjct: 43 IVSYPRSGTTWTTEMVSLVMNGGDTEYNMSDIQHTRVPQIEVNYKPNIMRIKD-FRSFND 101
Query: 82 VTKWRDRRPA--LYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLR 132
+W P+ L H + F + K + FI + R P D LVSYYYF +
Sbjct: 102 AFEWSKSIPSPRLMRTHLQYNLFAKEPIKRKCKFIYVARNPKDMLVSYYYFYK 154
>gi|389819195|ref|ZP_10209178.1| glycine betaine ABC transporter substrate-binding protein
[Planococcus antarcticus DSM 14505]
gi|388463477|gb|EIM05831.1| glycine betaine ABC transporter substrate-binding protein
[Planococcus antarcticus DSM 14505]
Length = 300
Score = 38.1 bits (87), Expect = 5.8, Method: Compositional matrix adjust.
Identities = 39/142 (27%), Positives = 59/142 (41%), Gaps = 24/142 (16%)
Query: 176 CGHAAACWVP----GNPWALEKAKENLVTKYLLVGVTEELTDFV-----SLLEAAL--PS 224
C AA P G PW E + Y+L E +D+ L E A+ P+
Sbjct: 19 CSLAAEDAEPIVIGGKPWT-----EQYILPYILGEYIEAHSDYTVEYQDGLGEVAILTPA 73
Query: 225 FFRGGTDHFL----TSNKSHLRRTNRKIDPSEETVQQIKKSKIWELENELYEYALEQFHF 280
+G D ++ T K L+R + SEE +QQ+++ E EL LE F
Sbjct: 74 LEQGDIDMYVEYTGTGLKDVLKRESEAGQSSEEVMQQVREG----YEEELGATWLEPLGF 129
Query: 281 VKKHNLVYNKVLGYEADKGKQF 302
+ L Y+K G++A+ Q
Sbjct: 130 ENGYTLAYSKDSGFDAETYSQL 151
>gi|357514755|ref|XP_003627666.1| Steroid sulfotransferase-like protein [Medicago truncatula]
gi|355521688|gb|AET02142.1| Steroid sulfotransferase-like protein [Medicago truncatula]
Length = 391
Score = 38.1 bits (87), Expect = 5.8, Method: Compositional matrix adjust.
Identities = 39/159 (24%), Positives = 60/159 (37%), Gaps = 27/159 (16%)
Query: 27 DTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVN----NV 82
D I+ +PK+G+T +AY + ++ F L NNH L L + + V N+
Sbjct: 118 DNDIVVASMPKSGTTWLKGLAYAIVNRQHFTSLE----NNNHPLLLFNPHELVPLFEVNL 173
Query: 83 TKWRDR-----------RPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFL 131
+D P L+ H F+ + + I I R P D VSY+
Sbjct: 174 YGGKDILLPQIDVSNMIEPRLFGTHIPFLSLPKSVKESSCKIIYICRNPFDTFVSYW--- 230
Query: 132 RYGDNYRPHLVRKKHGDKTTFDECI-RLNRTECSLENMW 169
NY + KK + T +E R + C W
Sbjct: 231 ----NYINKVRSKKSLTELTLEESFERYCKGICLFGPFW 265
>gi|313236890|emb|CBY12140.1| unnamed protein product [Oikopleura dioica]
Length = 237
Score = 37.7 bits (86), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 33/132 (25%), Positives = 55/132 (41%), Gaps = 26/132 (19%)
Query: 16 SPSPETDSLSWDTVIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVT----------- 64
SP P + + +N+VPK G TS + Y + F + +
Sbjct: 92 SPPPNRN------FVFHNKVPKYGGTSLKYILYVLSEDNNFTLDYQPPCIQGKTGCNKRP 145
Query: 65 --GNNHVLSLADQYRFVNNVTKWRDRRPALYH-GHFGFIDFQQFGSKEQPLFINILRKPL 121
G + +SLA K RD+ + H +++F G E P+++N++R P+
Sbjct: 146 EDGTDGEISLAHHL-----AAKRRDKTGKFFLLKHHHWMNFTDIGM-ENPIYMNVVRHPV 199
Query: 122 DRLVSYYYFLRY 133
R S YYF R+
Sbjct: 200 GRFSSAYYFKRF 211
>gi|225456534|ref|XP_002264929.1| PREDICTED: sulfotransferase 17-like [Vitis vinifera]
Length = 343
Score = 37.7 bits (86), Expect = 6.4, Method: Compositional matrix adjust.
Identities = 30/109 (27%), Positives = 50/109 (45%), Gaps = 10/109 (9%)
Query: 30 IIYNRVPKTGSTSFVNMAYDMCRKKRFNV-LHVNVTGNNHVLS------LADQYRFVNNV 82
I+ +PK+G+T F + + + + F++ H +T + H L L+ + F N
Sbjct: 75 ILLVTLPKSGTTWFKPLMFAVMNRTHFDLSTHPLLTTSPHDLVPFLELFLSHKIPFPNPD 134
Query: 83 TKWRDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFL 131
T + P L+ H F Q+ + Q + I R P D VS +YFL
Sbjct: 135 TFYP---PQLFQTHIPFSSLSQYVMESQCRIVYICRNPKDVFVSTFYFL 180
>gi|423619725|ref|ZP_17595557.1| hypothetical protein IIO_05049 [Bacillus cereus VD115]
gi|401251237|gb|EJR57522.1| hypothetical protein IIO_05049 [Bacillus cereus VD115]
Length = 248
Score = 37.7 bits (86), Expect = 7.0, Method: Compositional matrix adjust.
Identities = 47/184 (25%), Positives = 80/184 (43%), Gaps = 34/184 (18%)
Query: 29 VIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLH-VNVTGNNH--VLSLADQYRFVNNVTKW 85
++IY +PKTG T+ ++ +CR+ N+L+ VN G N VL L ++ + +
Sbjct: 13 LLIYVHIPKTGGTTLTDI---ICRQYSENILYDVNQYGLNQQSVLLLKEKLTVADTLC-- 67
Query: 86 RDRRPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKK 145
GH F + S+ +I +LR P+++++S++Y N L +
Sbjct: 68 ---------GHLLF-GVHHYISRP-CTYITMLRNPVEQVLSWFYSAHKNLNQYKELFFEG 116
Query: 146 HGDKTTFDECIRLNRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLLV 205
T D+ I N + Q F+ G A LE AKE + + +V
Sbjct: 117 -----TIDDYI--NNPNFDYYTINFQSRFITGSDVA--------DLEIAKETIANYFSVV 161
Query: 206 GVTE 209
G+TE
Sbjct: 162 GITE 165
>gi|254409361|ref|ZP_05023142.1| hypothetical protein MC7420_6994 [Coleofasciculus chthonoplastes
PCC 7420]
gi|196183358|gb|EDX78341.1| hypothetical protein MC7420_6994 [Coleofasciculus chthonoplastes
PCC 7420]
Length = 277
Score = 37.7 bits (86), Expect = 7.2, Method: Compositional matrix adjust.
Identities = 56/252 (22%), Positives = 110/252 (43%), Gaps = 34/252 (13%)
Query: 29 VIIYNRVPKTGSTSFVNMAYDMCRKKRFNVLHVNVTGNNHVLSLADQYRFVNNVTKWRDR 88
V+I+ +PK+G + + +K++ + + ++++++ ++ R
Sbjct: 15 VVIFIHIPKSGGVTLQRIL-----EKQYTSDTIFTIQSKMFHESIERFKYLPE-SQHRAI 68
Query: 89 RPALYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKHGD 148
R H FG +F + +I ILR P+DR++S+YY++ N +L +
Sbjct: 69 RLLKGHMFFGLHEFLPIPCR----YITILRNPVDRIISHYYYVL--QNPTHYLYQDVVSQ 122
Query: 149 KTTFDECIRLNRTECS-----LENMWLQVPFLCGHAAACWVPGNPWA-LEKAKENLVTKY 202
K + + CS L+N Q L G + V + LEKAK NL + +
Sbjct: 123 KMNLGDYV------CSGLSPELDN--CQTRLLSGVQESIGVGQCSFELLEKAKTNLKSHF 174
Query: 203 LLVGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSK 262
+VG+ E + + L + F ++ NK+ + + I S++ + I K+
Sbjct: 175 AVVGLLERFNETMILFKKV----FGWKMPFYIQRNKTKKAFSYQNI--SQKILNSIIKTN 228
Query: 263 IWELENELYEYA 274
+L+ ELY+Y
Sbjct: 229 --QLDIELYKYG 238
>gi|225456527|ref|XP_002262621.1| PREDICTED: sulfotransferase 17-like [Vitis vinifera]
Length = 352
Score = 37.4 bits (85), Expect = 7.7, Method: Compositional matrix adjust.
Identities = 30/105 (28%), Positives = 49/105 (46%), Gaps = 5/105 (4%)
Query: 36 PKTGSTSFVNMAYDMCRKKRFNV-LHVNVTGNNH-VLSLADQYRFVNNVTKWRD--RRPA 91
PK+G+T F + + + + +FN H +T + H ++ + + +N D P
Sbjct: 92 PKSGTTWFKALLFAIMNRTQFNTSTHPLLTTSPHELVPFMEMFLHMNIPFPDPDPLSPPR 151
Query: 92 LYHGHFGFIDFQQFGSKEQPLFINILRKPLDRLVSYYYFLRYGDN 136
L+H H F Q Q + + R P D VS+Y FL+ GDN
Sbjct: 152 LFHTHTPFTSLPQSVIDSQCRIVYVSRNPKDVFVSFYCFLQ-GDN 195
>gi|330790738|ref|XP_003283453.1| hypothetical protein DICPUDRAFT_147129 [Dictyostelium purpureum]
gi|325086718|gb|EGC40104.1| hypothetical protein DICPUDRAFT_147129 [Dictyostelium purpureum]
Length = 602
Score = 37.4 bits (85), Expect = 9.5, Method: Compositional matrix adjust.
Identities = 47/195 (24%), Positives = 85/195 (43%), Gaps = 40/195 (20%)
Query: 95 GHFGFIDFQQFGSKEQPL-----FINILRKPLDRLVSYYYFLRYGDNYRPHLVRKKHGDK 149
GHF ++ ++PL ++ +LR P+DR++S+YY+ R H + K
Sbjct: 391 GHF---EYGIHNDYDEPLKSTHSYLTVLRDPVDRVISHYYYHRNTAYDAEHDI----AAK 443
Query: 150 TTFDECIRL-----NRTECSLENMWLQVPFLCGHAAACWVPGNPWALEKAKENLVTKYLL 204
T +E I L N +L + + P + + + L K
Sbjct: 444 NTLEEWIELSPRANNEQTRALSGIHHEDPLMTNETFNMAL----YHLRTMK--------F 491
Query: 205 VGVTEELTDFVSLLEAALPSFFRGGTDHFLTSNKSHLRRTNRKIDPSEETVQQIKKSKIW 264
VG+TE++ + ++LL+ F G D+ K ++ RK+D + + +IKK K W
Sbjct: 492 VGLTEKMPETLALLK------FFTGLDNPKVQEKQNV---GRKLDVDQSVIDKIKK-KNW 541
Query: 265 ELENELYEYALEQFH 279
++ +YE A + F
Sbjct: 542 -MDIIMYEEACKMFQ 555
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.321 0.136 0.425
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,226,224,598
Number of Sequences: 23463169
Number of extensions: 219855825
Number of successful extensions: 461440
Number of sequences better than 100.0: 472
Number of HSP's better than 100.0 without gapping: 361
Number of HSP's successfully gapped in prelim test: 111
Number of HSP's that attempted gapping in prelim test: 460136
Number of HSP's gapped (non-prelim): 533
length of query: 311
length of database: 8,064,228,071
effective HSP length: 142
effective length of query: 169
effective length of database: 9,027,425,369
effective search space: 1525634887361
effective search space used: 1525634887361
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 76 (33.9 bits)