BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 017178
(376 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|Q8NBL1|PGLT1_HUMAN Protein O-glucosyltransferase 1 OS=Homo sapiens GN=POGLUT1 PE=1
SV=1
Length = 392
Score = 91.7 bits (226), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 70/274 (25%), Positives = 127/274 (46%), Gaps = 26/274 (9%)
Query: 39 GILQLLRLYPGRLPDLELMFDCNDRPVVRARDFGGPNSGPP--PLFRYCSDGSSLDIVFP 96
G+ + GRLPD+E++ + D P V P P P+F + DI++P
Sbjct: 110 GVEHFILEVIGRLPDMEMVINVRDYPQV-------PKWMEPAIPVFSFSKTSEYHDIMYP 162
Query: 97 DWSFWGWAE-------TNIRPWSNVLKDIEEGNKRTKWKERVPYAYWRGNPNVSPIRKEL 149
W+FW T + W +D+ + WK++ AY+RG+ SP R L
Sbjct: 163 AWTFWEGGPAVWPIYPTGLGRWDLFREDLVRSAAQWPWKKKNSTAYFRGS-RTSPERDPL 221
Query: 150 MTCNASDKNDWNARLYVQDWGQESKQNF------KQSNLGDQCSHRYKIYIEGWAWSVSE 203
+ + + +A Y ++ +S ++ K +L D C ++Y G A S
Sbjct: 222 ILLSRKNPKLVDAE-YTKNQAWKSMKDTLGKPAAKDVHLVDHCKYKYLFNFRGVAASFRF 280
Query: 204 KYILACDSMTLIVRPRYYDFFSRGMVPMQHYWPIRDNSKCTSLKFAVDWGNAHTEKAEAI 263
K++ C S+ V + +FF + P HY P++ + ++++ + + A+ + A+ I
Sbjct: 281 KHLFLCGSLVFHVGDEWLEFFYPQLKPWVHYIPVK--TDLSNVQELLQFVKANDDVAQEI 338
Query: 264 GEAASRFIREDLKMGYVYDYMFHLLNEYARLLRF 297
E S+FIR L+M + Y +LL+EY++ L +
Sbjct: 339 AERGSQFIRNHLQMDDITCYWENLLSEYSKFLSY 372
>sp|Q5E9Q1|PGLT1_BOVIN Protein O-glucosyltransferase 1 OS=Bos taurus GN=POGLUT1 PE=2 SV=1
Length = 392
Score = 89.0 bits (219), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 73/274 (26%), Positives = 127/274 (46%), Gaps = 26/274 (9%)
Query: 39 GILQLLRLYPGRLPDLELMFDCNDRPVVRARDFGGPNSGPP--PLFRYCSDGSSLDIVFP 96
G+ + GRLPD+E++ + D P V P P P+F + DI++P
Sbjct: 110 GVEHFILEVIGRLPDMEMVINVRDYPQV-------PKWMEPAIPIFSFSKTLEYHDIMYP 162
Query: 97 DWSFW--GWAETNIRP-----WSNVLKDIEEGNKRTKWKERVPYAYWRGNPNVSPIRKEL 149
W+FW G A I P W +D+ + WK++ AY+RG+ SP R L
Sbjct: 163 AWTFWEGGPAVWPIYPMGLGRWDLFREDLVRSAAQWPWKKKNSTAYFRGS-RTSPERDPL 221
Query: 150 MTCNASDKNDWNARLYVQDWGQESKQNF------KQSNLGDQCSHRYKIYIEGWAWSVSE 203
+ + + +A Y ++ +S ++ K +L D C ++Y G A S
Sbjct: 222 ILLSRKNPKLVDAE-YTKNQAWKSMKDTLGKPAAKDVHLVDHCKYKYLFNFRGVAASFRF 280
Query: 204 KYILACDSMTLIVRPRYYDFFSRGMVPMQHYWPIRDNSKCTSLKFAVDWGNAHTEKAEAI 263
K++ C S+ V + +FF + P HY P++ + ++++ + + A+ + A+ I
Sbjct: 281 KHLFLCGSLVFHVGDEWLEFFYPQLKPWVHYIPVK--TDLSNVQELLQFVKANDDVAQEI 338
Query: 264 GEAASRFIREDLKMGYVYDYMFHLLNEYARLLRF 297
E S+FI LKM + Y +LL EY++ L +
Sbjct: 339 AERGSQFILNHLKMDDITCYWENLLTEYSKFLSY 372
>sp|Q7ZVE6|KDEL1_DANRE KDEL motif-containing protein 1 OS=Danio rerio GN=kdelc1 PE=2 SV=1
Length = 500
Score = 88.6 bits (218), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 79/318 (24%), Positives = 139/318 (43%), Gaps = 21/318 (6%)
Query: 1 MIERARKTAHF-RLVIVNGKAYVEKYKQSIQTRDKFTLWGILQLLRLYPGRLPDLELMFD 59
+I+R K+ I N + Y++ + + + R F +L L R +LPD+E +
Sbjct: 178 IIQRFGKSHSLCHYTIKNNQVYIKTHGEHVGFR-IFMDAFLLSLTR--KVKLPDIEFFVN 234
Query: 60 CNDRPVVRARDFGGPNSGPPPLFRYCSDGSSLDIVFPDWSFWGWAETNIRPWSNVLKDIE 119
D P+ + R + P P+F +C + DIV P + E+ + V D+
Sbjct: 235 LGDWPLEKRR----ASQNPSPVFSWCGSNDTRDIVMPTYDL---TESVLETMGRVSLDMM 287
Query: 120 --EGNKRTKWKERVPYAYWRGNPNVSPIRKELMTCNASDKNDWNARLYVQDWGQESKQNF 177
+G+ W++++ +WRG + R EL+ ++ +A L + + + +
Sbjct: 288 SVQGHTGPVWEKKINKGFWRGRDSRKE-RLELVKLARANTAMLDAALTNFFFFKHDESLY 346
Query: 178 ----KQSNLGDQCSHRYKIYIEGWAWSVSEKYILACDSMTLIVRPRYYDFFSRGMVPMQH 233
K + D ++Y+I ++G + Y+LA DS+ YY+ F + P H
Sbjct: 347 GPLVKHVSFFDFFKYKYQINVDGTVAAYRLPYLLAGDSVVFKHDSIYYEHFYNELQPWVH 406
Query: 234 YWPIRDNSKCTSLKFAVDWGNAHTEKAEAIGEAASRFIREDLKMGYVYDYMFHLLNEYAR 293
Y P R S + L + W H E+A+ I A +F R L V+ Y L +YA
Sbjct: 407 YIPFR--SDLSDLLEKIQWAKDHDEEAKKIALAGQQFARTHLMGDSVFCYYHKLFQKYAE 464
Query: 294 LLRFKPSIPAGALELCSE 311
L KP + G +EL +
Sbjct: 465 LQVTKPKVRDG-MELVEQ 481
>sp|Q6UW63|KDEL1_HUMAN KDEL motif-containing protein 1 OS=Homo sapiens GN=KDELC1 PE=1 SV=1
Length = 502
Score = 88.6 bits (218), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 76/295 (25%), Positives = 129/295 (43%), Gaps = 21/295 (7%)
Query: 17 NGKAYVEKYKQSIQTRDKFTLWGILQLLRLYPGRLPDLELMFDCNDRPVVRARDFGGPNS 76
+ K Y++ + + + R F +L L R ++PD+EL + D P+ + + NS
Sbjct: 196 DNKVYIKTHGEHVGFR-IFMDAILLSLTR--KVKMPDVELFVNLGDWPLEKKKS----NS 248
Query: 77 GPPPLFRYCSDGSSLDIVFPDWSFWGWAETNIRPWSNVLKDIE--EGNKRTKWKERVPYA 134
P+F +C S DIV P + ++ + V D+ + N W+ + A
Sbjct: 249 NIHPIFSWCGSTDSKDIVMPTYDL---TDSVLETMGRVSLDMMSVQANTGPPWESKNSTA 305
Query: 135 YWRGNPNVSPIRKELMTCNASDKNDWNARLYVQDWGQESKQNF-----KQSNLGDQCSHR 189
WRG + R EL+ + +A + + + +N K + D H+
Sbjct: 306 VWRGRDSRKE-RLELVKLSRKHPELIDAA-FTNFFFFKHDENLYGPIVKHISFFDFFKHK 363
Query: 190 YKIYIEGWAWSVSEKYILACDSMTLIVRPRYYDFFSRGMVPMQHYWPIRDNSKCTSLKFA 249
Y+I I+G + Y+L DS+ L YY+ F + P +HY P++ N + L
Sbjct: 364 YQINIDGTVAAYRLPYLLVGDSVVLKQDSIYYEHFYNELQPWKHYIPVKSN--LSDLLEK 421
Query: 250 VDWGNAHTEKAEAIGEAASRFIREDLKMGYVYDYMFHLLNEYARLLRFKPSIPAG 304
+ W H E+A+ I +A F R +L ++ Y F L EYA L +P I G
Sbjct: 422 LKWAKDHDEEAKKIAKAGQEFARNNLMGDDIFCYYFKLFQEYANLQVSEPQIREG 476
>sp|Q7Z4H8|KDEL2_HUMAN KDEL motif-containing protein 2 OS=Homo sapiens GN=KDELC2 PE=1 SV=2
Length = 507
Score = 84.7 bits (208), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 73/297 (24%), Positives = 133/297 (44%), Gaps = 11/297 (3%)
Query: 25 YKQSIQTRDKFTLWGILQLLRLY-PGRLPDLELMFDCNDRPVVRARDFGGPNSGPPPLFR 83
Y++S+ F ++ LL L LPDLE + D P+ + G P+ P P+
Sbjct: 204 YRRSLGKYTDFKMFSDEILLSLTRKVLLPDLEFYVNLGDWPLEHRKVNGTPS--PIPIIS 261
Query: 84 YCSDGSSLDIVFPDWSFWGWAETNIRPWSNVLKDIEEGNKRTKWKERVPYAYWRGNPNVS 143
+C S D+V P + +R +N L I+ GN W + A++RG +
Sbjct: 262 WCGSLDSRDVVLPTYDITHSMLEAMRGVTNDLLSIQ-GNTGPSWINKTERAFFRGRDSRE 320
Query: 144 PIRKELMTCNASDKNDWNARLYVQDWGQESKQNFKQSNLG---DQCSHRYKIYIEGWAWS 200
R +L+ + + +A + + QE ++ ++ L D ++Y++ ++G +
Sbjct: 321 E-RLQLVQLSKENPQLLDAGITGYFFFQEKEKELGKAKLMGFFDFFKYKYQVNVDGTVAA 379
Query: 201 VSEKYILACDSMTLIVRPRYYDFFSRGMVPMQHYWPIRDNSKCTSLKFAVDWGNAHTEKA 260
Y++ DS+ L YY+ F + P +HY PI+ N + L V W + E+A
Sbjct: 380 YRYPYLMLGDSLVLKQDSPYYEHFYMALEPWKHYVPIKRN--LSDLLEKVKWAKENDEEA 437
Query: 261 EAIGEAASRFIREDLKMGYVYDYMFHLLNEYARLLRFKPSIPAGALELCSETMACSA 317
+ I + R+ L+ +Y Y + +L +YA KP + G +EL + +A
Sbjct: 438 KKIAKEGQLMARDLLQPHRLYCYYYQVLQKYAERQSSKPEVRDG-MELVPQPEDSTA 493
>sp|Q9JHP7|KDEL1_MOUSE KDEL motif-containing protein 1 OS=Mus musculus GN=Kdelc1 PE=1 SV=1
Length = 502
Score = 83.6 bits (205), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 73/294 (24%), Positives = 125/294 (42%), Gaps = 19/294 (6%)
Query: 17 NGKAYVEKYKQSIQTRDKFTLWGILQLLRLYPGRLPDLELMFDCNDRPVVRARDFGGPNS 76
+ K Y++ + + + R F +L L R R+PD+E + D P+ + + NS
Sbjct: 196 DNKVYIKTHGEHVGFR-IFMDAILLSLTRKV--RMPDVEFFVNLGDWPLEKKKS----NS 248
Query: 77 GPPPLFRYCSDGSSLDIVFPDWSFWGWAETNIRPWSNVLKDIE--EGNKRTKWKERVPYA 134
P+F +C S DIV P + ++ + V D+ + N W+ + A
Sbjct: 249 NIQPIFSWCGSTESRDIVMPTYDL---TDSVLETMGRVSLDMMSVQANTGPPWESKNSTA 305
Query: 135 YWRGNPNVSPIRKELMTCNASDKNDWNARLYVQDWGQESKQNF----KQSNLGDQCSHRY 190
WRG + R EL+ + +A + + + + K + D H+Y
Sbjct: 306 VWRGRDSRKE-RLELVKLSRKHPELIDAAFTNFFFFKHDESLYGPIVKHISFFDFFKHKY 364
Query: 191 KIYIEGWAWSVSEKYILACDSMTLIVRPRYYDFFSRGMVPMQHYWPIRDNSKCTSLKFAV 250
+I I+G + Y+L DS+ L YY+ F + P +HY P++ N + L +
Sbjct: 365 QINIDGTVAAYRLPYLLVGDSVVLKQDSIYYEHFYNELQPWKHYIPVKSN--LSDLLEKL 422
Query: 251 DWGNAHTEKAEAIGEAASRFIREDLKMGYVYDYMFHLLNEYARLLRFKPSIPAG 304
W H +A+ I +A F R +L ++ Y F L YA L +P I G
Sbjct: 423 KWAKEHDAEAKKIAKAGQEFARNNLMGDDIFCYYFKLFQGYANLQVSEPQIREG 476
>sp|B0X1Q4|RUMI_CULQU O-glucosyltransferase rumi homolog OS=Culex quinquefasciatus
GN=CPIJ013394 PE=3 SV=1
Length = 403
Score = 81.6 bits (200), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 66/275 (24%), Positives = 115/275 (41%), Gaps = 22/275 (8%)
Query: 39 GILQLLRLYPGRLPDLELMFDCNDRPVVRARDFGGPNSGPPPLFRYCSDGSSLDIVFPDW 98
G+ +R +LPD+EL+ +C D P + +R + P P+ + LDI++P W
Sbjct: 125 GVEHFIRPNLPKLPDMELIINCRDWPQI-SRHWNASRE-PLPVLSFSKTNDYLDIMYPTW 182
Query: 99 SFW-GWAETNIRP-----WSNVLKDIEEGNKRTKWKERVPYAYWRGNPN---------VS 143
FW G ++ P W + + K W++++ A++RG+ +S
Sbjct: 183 GFWEGGPAISLYPTGLGRWDQHRVSVRKAAKVWPWEKKLQQAFFRGSRTSDERDPLVLLS 242
Query: 144 PIRKELMTCNASDKNDWNARLYVQDWGQESKQNFKQSNLGDQCSHRYKIYIEGWAWSVSE 203
+R EL+ + W R E Q + L D C ++Y G A S
Sbjct: 243 RMRPELVDAQYTKNQAW--RSPKDTLHAEPAQEVR---LEDHCQYKYLFNFRGVAASFRF 297
Query: 204 KYILACDSMTLIVRPRYYDFFSRGMVPMQHYWPIRDNSKCTSLKFAVDWGNAHTEKAEAI 263
K++ C S+ V + +FF + P HY P+ L+ + + H + A+ I
Sbjct: 298 KHLFLCKSLVFHVGQEWQEFFYDSLKPWVHYVPVPVGINEWELEHLIQFFREHDQLAQEI 357
Query: 264 GEAASRFIREDLKMGYVYDYMFHLLNEYARLLRFK 298
I L+M V Y LL Y +L++++
Sbjct: 358 ANRGYEHIWNHLRMEDVECYWKRLLRRYGKLVKYE 392
>sp|Q8BYB9|PGLT1_MOUSE Protein O-glucosyltransferase 1 OS=Mus musculus GN=Poglut1 PE=2
SV=2
Length = 392
Score = 81.6 bits (200), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 66/274 (24%), Positives = 124/274 (45%), Gaps = 26/274 (9%)
Query: 39 GILQLLRLYPGRLPDLELMFDCNDRPVVRARDFGGPNSGPP--PLFRYCSDGSSLDIVFP 96
G+ + RLPD+E++ + D P V P P P+F + DI++P
Sbjct: 110 GVEHFILEVIHRLPDMEMVINVRDYPQV-------PKWMEPTIPVFSFSKTSEYHDIMYP 162
Query: 97 DWSFWGWAE-------TNIRPWSNVLKDIEEGNKRTKWKERVPYAYWRGNPNVSPIRKEL 149
W+FW T + W +D+ + W+++ AY+RG+ SP R L
Sbjct: 163 AWTFWEGGPAVWPLYPTGLGRWDLFREDLLRSAAQWPWEKKNSTAYFRGS-RTSPERDPL 221
Query: 150 MTCNASDKNDWNARLYVQDWGQESKQNF------KQSNLGDQCSHRYKIYIEGWAWSVSE 203
+ + + +A Y ++ +S ++ K +L D C +RY G A S
Sbjct: 222 ILLSRKNPKLVDAE-YTKNQAWKSMKDTLGKPAAKDVHLIDHCKYRYLFNFRGVAASFRF 280
Query: 204 KYILACDSMTLIVRPRYYDFFSRGMVPMQHYWPIRDNSKCTSLKFAVDWGNAHTEKAEAI 263
K++ C S+ V + +FF + P HY P++ + ++++ + + A+ + A+ I
Sbjct: 281 KHLFLCGSLVFHVGDEWVEFFYPQLKPWVHYIPVK--TDLSNVQELLQFVKANDDIAQEI 338
Query: 264 GEAASRFIREDLKMGYVYDYMFHLLNEYARLLRF 297
+ S+FI L+M + Y +LL +Y++ L +
Sbjct: 339 AKRGSQFIINHLQMDDITCYWENLLTDYSKFLSY 372
>sp|Q566E5|KDEL2_RAT KDEL motif-containing protein 2 OS=Rattus norvegicus GN=Kdelc2 PE=2
SV=1
Length = 508
Score = 81.3 bits (199), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 73/303 (24%), Positives = 132/303 (43%), Gaps = 10/303 (3%)
Query: 25 YKQSIQTRDKFTLWGILQLLRLY-PGRLPDLELMFDCNDRPVVRARDFGGPNSGPPPLFR 83
Y++S+ F ++ LL L LPDLE + D P+ + P GP P+
Sbjct: 205 YRRSLGKYTDFKMFSDEILLSLARKVTLPDLEFYINLGDWPLEHRKVNDTP--GPIPIIS 262
Query: 84 YCSDGSSLDIVFPDWSFWGWAETNIRPWSNVLKDIEEGNKRTKWKERVPYAYWRGNPNVS 143
+C S DI+ P + +R +N L ++ GN W + A++RG +
Sbjct: 263 WCGSLDSRDIILPTYDVTHSTLEAMRGVTNDLLSVQ-GNTGPSWINKTEKAFFRGRDSRE 321
Query: 144 PIRKELMTCNASDKNDWNARLYVQDWGQESKQNFKQSNLG---DQCSHRYKIYIEGWAWS 200
R +L+ + + +A + + QE ++ ++ L D ++Y++ ++G +
Sbjct: 322 E-RLQLVLLSKENPQLLDAGITGYFFFQEKEKELGKAKLMGFFDFFKYKYQVNVDGTVAA 380
Query: 201 VSEKYILACDSMTLIVRPRYYDFFSRGMVPMQHYWPIRDNSKCTSLKFAVDWGNAHTEKA 260
Y++ DS+ L YY+ F + P +HY PI+ N + L V W + E+A
Sbjct: 381 YRYPYLMLGDSLVLKQESPYYEHFYVELRPWKHYVPIKRN--LSDLLEKVKWAKENDEEA 438
Query: 261 EAIGEAASRFIREDLKMGYVYDYMFHLLNEYARLLRFKPSIPAGALELCSETMACSAKGT 320
+ I + R+ L+ +Y Y + +L +YA KP I G + + S +
Sbjct: 439 KRIAKEGQLTARDLLQPPRLYCYYYRVLQKYAERQVSKPMIRDGMERVPQPDDSTSVRQC 498
Query: 321 WRK 323
RK
Sbjct: 499 HRK 501
>sp|Q8T045|RUMI_DROME O-glucosyltransferase rumi OS=Drosophila melanogaster GN=rumi PE=1
SV=1
Length = 411
Score = 79.7 bits (195), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 63/272 (23%), Positives = 117/272 (43%), Gaps = 17/272 (6%)
Query: 39 GILQLLRLYPGRLPDLELMFDCNDRPVVRARDFGGPNSGPPPLFRYCSDGSSLDIVFPDW 98
GI L LPD++L+ + D P + A N+ P+F + DI++P W
Sbjct: 128 GIEHFLLPLVATLPDMDLIINTRDYPQLNA---AWGNAAGGPVFSFSKTKEYRDIMYPAW 184
Query: 99 SFW-GWAETNIRP-----WSNVLKDIEEGNKRTKWKERVPYAYWRGNPNVSPIRKELMTC 152
+FW G T + P W + + +E+ W ++ ++RG+ S R L+
Sbjct: 185 TFWAGGPATKLHPRGIGRWDQMREKLEKRAAAIPWSQKRSLGFFRGS-RTSDERDSLILL 243
Query: 153 NASDKNDWNARLYVQDWGQESKQNF------KQSNLGDQCSHRYKIYIEGWAWSVSEKYI 206
+ + A+ Y ++ G +S ++ + + D C ++Y G A S K++
Sbjct: 244 SRRNPELVEAQ-YTKNQGWKSPKDTLDAPAADEVSFEDHCKYKYLFNFRGVAASFRLKHL 302
Query: 207 LACDSMTLIVRPRYYDFFSRGMVPMQHYWPIRDNSKCTSLKFAVDWGNAHTEKAEAIGEA 266
C S+ V + +FF + P HY P++ + + + + A+ I +
Sbjct: 303 FLCKSLVFHVGDEWQEFFYDQLKPWVHYVPLKSYPSQQEYEHILSFFKKNDALAQEIAQR 362
Query: 267 ASRFIREDLKMGYVYDYMFHLLNEYARLLRFK 298
FI E L+M + Y LL Y +LL+++
Sbjct: 363 GYDFIWEHLRMKDIKCYWRKLLKRYVKLLQYE 394
>sp|A0NDG6|RUMI_ANOGA O-glucosyltransferase rumi homolog OS=Anopheles gambiae
GN=AGAP004267 PE=3 SV=1
Length = 399
Score = 76.3 bits (186), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 67/282 (23%), Positives = 122/282 (43%), Gaps = 20/282 (7%)
Query: 39 GILQLLRLYPGRLPDLELMFDCNDRPVVRARDFGGPNSGPPPLFRYCSDGSSLDIVFPDW 98
G+ +R LPD++L+ +C D P + R + P+ + LDI++P W
Sbjct: 126 GVEHFVRPLLPLLPDMDLIVNCRDWPQIH-RHWSKEKI---PVLSFSKTAEYLDIMYPAW 181
Query: 99 SFW------GWAETNIRPWSNVLKDIEEGNKRTKWKERVPYAYWRGNPNVSPIRKELMTC 152
+FW T + W + I + + W+ + P A++RG+ S R L+
Sbjct: 182 AFWEGGPAIALYPTGLGRWDLHRQTITKAS--ADWEAKEPKAFFRGS-RTSDERDALVLL 238
Query: 153 NASDKNDWNARLYVQDWGQESKQNF------KQSNLGDQCSHRYKIYIEGWAWSVSEKYI 206
+ + + +A+ Y ++ +S Q+ ++ L + C +R+ G A S K++
Sbjct: 239 SRAQPSLVDAQ-YTKNQAWKSPQDTLNAEPAREVTLEEHCRYRFLFNFRGVAASFRFKHL 297
Query: 207 LACDSMTLIVRPRYYDFFSRGMVPMQHYWPIRDNSKCTSLKFAVDWGNAHTEKAEAIGEA 266
C S+ V + +FF + P HY P+ S L+ + + H + A AI E
Sbjct: 298 FLCRSLVFHVGDEWQEFFYPSLKPWVHYVPVPVRSTPEELEALITFFQEHDQLARAIAER 357
Query: 267 ASRFIREDLKMGYVYDYMFHLLNEYARLLRFKPSIPAGALEL 308
I L+M V Y LL Y +L+R+ + +E+
Sbjct: 358 GYEHIWNHLRMADVECYWKKLLKRYGKLIRYTVERDSTLIEV 399
>sp|Q29AU6|RUMI_DROPS O-glucosyltransferase rumi OS=Drosophila pseudoobscura
pseudoobscura GN=rumi PE=3 SV=1
Length = 409
Score = 74.7 bits (182), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 60/272 (22%), Positives = 115/272 (42%), Gaps = 17/272 (6%)
Query: 39 GILQLLRLYPGRLPDLELMFDCNDRPVVRARDFGGPNSGPPPLFRYCSDGSSLDIVFPDW 98
GI L LPD++L+ + D P + + N P+ + DI++P W
Sbjct: 126 GIEHFLLPLVATLPDMDLVINTRDYPQI---NMAWGNGAQGPILSFSKTKDHRDIMYPAW 182
Query: 99 SFW-GWAETNIRP-----WSNVLKDIEEGNKRTKWKERVPYAYWRGNPNVSPIRKELMTC 152
+FW G T + P W + + +E+ W ++ ++RG+ S R L+
Sbjct: 183 TFWAGGPATKLHPRGIGRWDLMREKLEKRAAAIPWSQKRELGFFRGS-RTSDERDSLILL 241
Query: 153 NASDKNDWNARLYVQDWGQESKQNF------KQSNLGDQCSHRYKIYIEGWAWSVSEKYI 206
+ + A+ Y ++ G +S ++ + + D C ++Y G A S K++
Sbjct: 242 SRRNPELVEAQ-YTKNQGWKSPKDTLDAPPAGEVSFEDHCKYKYLFNFRGVAASFRLKHL 300
Query: 207 LACDSMTLIVRPRYYDFFSRGMVPMQHYWPIRDNSKCTSLKFAVDWGNAHTEKAEAIGEA 266
C S+ V + +FF + P HY P+++ + + + + A+ I +
Sbjct: 301 FLCQSLVFHVGDEWQEFFYDQLKPWVHYVPLKNYPSQQEYEELLTFFRKNDALAQEIAQR 360
Query: 267 ASRFIREDLKMGYVYDYMFHLLNEYARLLRFK 298
FI + L+M + Y LL Y +LL ++
Sbjct: 361 GRDFIWQHLRMKDIKCYWRRLLKSYVKLLTYE 392
>sp|Q16QY8|RUMI_AEDAE O-glucosyltransferase rumi homolog OS=Aedes aegypti GN=AAEL011121
PE=3 SV=1
Length = 402
Score = 63.2 bits (152), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 71/310 (22%), Positives = 132/310 (42%), Gaps = 24/310 (7%)
Query: 1 MIERARKTAHFRLVIVNGKAYVEKYKQSIQTRDKFTLWGILQLLRLYPGRLPDLELMFDC 60
M+ERAR + IV+ + Y +K G+ ++ LPD+EL+ +C
Sbjct: 94 MVERARSYG-TKYQIVDHRLYRQK-----DCMFPARCSGVEHFIKPNLPHLPDMELIINC 147
Query: 61 NDRPVVRARDFGGPNSGPPPLFRYCSDGSSLDIVFPDWSFW-GWAETNIRP-----WSNV 114
D P + P+ + LDI++P W FW G ++ P W
Sbjct: 148 RDWPQINRH----WKQEKLPVLSFSKTDDYLDIMYPTWGFWEGGPAISLYPTGLGRWDQH 203
Query: 115 LKDIEEGNKRTKWKERVPYAYWRGNPNVSPIRKELMTCNASDKNDWNARLYVQDWGQESK 174
I++ KW+++ A++RG+ S R L+ + +A+ Y ++ +S
Sbjct: 204 RVSIKKAADSWKWEKKKAKAFFRGS-RTSDERDPLVLLSRRKPELVDAQ-YTKNQAWKSP 261
Query: 175 QNF------KQSNLGDQCSHRYKIYIEGWAWSVSEKYILACDSMTLIVRPRYYDFFSRGM 228
++ ++ L D C ++Y G A S K++ C S+ V + +FF +
Sbjct: 262 KDTLNAKPAQEVRLEDHCQYKYLFNFRGVAASFRFKHLFLCRSLVFHVGSEWQEFFYPSL 321
Query: 229 VPMQHYWPIRDNSKCTSLKFAVDWGNAHTEKAEAIGEAASRFIREDLKMGYVYDYMFHLL 288
P HY P+R + L+ +++ H + A I + + + L+M V Y LL
Sbjct: 322 KPWVHYVPVRVGATQEELEELIEFFAEHDDLAREIADRGFEHVWKHLRMKDVECYWRKLL 381
Query: 289 NEYARLLRFK 298
Y +L++++
Sbjct: 382 RRYGKLVKYE 391
>sp|O55767|VF179_IIV6 Uncharacterized protein 179R OS=Invertebrate iridescent virus 6
GN=IIV6-179R PE=4 SV=1
Length = 1186
Score = 47.0 bits (110), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 78/336 (23%), Positives = 129/336 (38%), Gaps = 90/336 (26%)
Query: 50 RLPDLELMFDCNDRPVVRARD-------FGGPNS-----------GPPPLFRYCSDGSSL 91
+PD+EL + D P++ FG +S G P+ C+
Sbjct: 206 EVPDIELFINRRDFPLLTKNGTEPYYNIFGKDHSLDSKSLKLISEGMCPILSMCTSDMYA 265
Query: 92 DIVFP---DWSFWGWAE---TNIRPWSN-------------VLKDIEEGNKRTKWKERVP 132
DIV P DW+ E T++ N KD + N T W++R+P
Sbjct: 266 DIVIPTHEDWARVASTEGVKTHVGDKENNENSLPKKITFPPQCKDYSKDNFNTPWEKRIP 325
Query: 133 YAYWRG---------NPNVSPIRKELMTCNAS-----DK----------NDWNAR----- 163
A +RG + N+ + L+ S DK WN R
Sbjct: 326 TAVFRGGSTGCGVSSDTNLETFNQRLVAAKISYNSKPDKYNVPLITAGITKWNLRPRKIL 385
Query: 164 --LYVQDWGQESKQNFKQSNLG--DQCSHRYKIYIEGW--AWSVSEKYILACDSMTLIVR 217
Y+Q E K+ K S L +Q ++Y I I+G A+ +S + + C L+V+
Sbjct: 386 NEKYLQTINIE-KEAPKVSPLTPEEQSKYKYIINIDGHVSAFRLSLEMSMGC--CILLVK 442
Query: 218 PRYYD-------FFSRGMVPMQHYWPIRDNSKCTSLKFAVDWGNAHTEKAEAIGEAASRF 270
+ + +FS + P HY P++ S + L + W + EK + I + A +F
Sbjct: 443 SKIPNETFGWKMWFSHLLKPYIHYVPVK--SDLSDLIEKIQWCRDNDEKCKEISQEALKF 500
Query: 271 IREDLKMGYVYDYMFHLLNEYARLLRFKPSIPAGAL 306
+ L + DYM +L +++ K S P +
Sbjct: 501 YQTYLSRESILDYMQNL------MVKLKLSFPTNEI 530
>sp|Q197C5|VF179_IIV3 Uncharacterized protein 035R OS=Invertebrate iridescent virus 3
GN=IIV3-035R PE=4 SV=1
Length = 1098
Score = 42.4 bits (98), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 28/111 (25%), Positives = 53/111 (47%), Gaps = 4/111 (3%)
Query: 185 QCSHRYKIYIEGWAWSVSEKYILACDSMTLIVRPRY--YDFFSRGMVPMQHYWPIRDNSK 242
Q +++ +++EG + L S L+V+ + ++S + P HY P+R +
Sbjct: 383 QSQYKFIVHVEGHVSAFRLSLELGMKSCILLVQSLHGWKMWYSDLLKPWVHYVPVRPD-- 440
Query: 243 CTSLKFAVDWGNAHTEKAEAIGEAASRFIREDLKMGYVYDYMFHLLNEYAR 293
+ L ++W A+ + A+ E A +F R L + D++ H LN AR
Sbjct: 441 LSDLFDRIEWCRANDAQCRAMAENAYQFYRTHLDKESILDHLQHTLNRLAR 491
>sp|P39907|LPSA_DICNO O-glucosyltransferase LpsA OS=Dichelobacter nodosus GN=lpsA PE=3
SV=1
Length = 318
Score = 34.7 bits (78), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 28/102 (27%), Positives = 53/102 (51%), Gaps = 7/102 (6%)
Query: 181 NLGDQCSHRYKIYIEGWAWSVSEKYILACDSMTLIVRPRYYDFFSRGMVPMQ-HYWPIRD 239
++ Q +RY + +EG + + K+I+A +S+ L+ PRY + G + HY +RD
Sbjct: 202 SVKRQLQYRYILSLEGNDVATNLKWIMASNSLCLMPAPRYETWMMEGRLQAGVHYVQLRD 261
Query: 240 NSKCTSLKFAVDWGNAHTEKAEAIGEAA----SRFIREDLKM 277
+ L+ + + + E AEAI + A S+F R + ++
Sbjct: 262 D--FADLEEQILFFERYPEAAEAIIKNAQQWMSQFARPETEL 301
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.320 0.136 0.450
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 151,128,715
Number of Sequences: 539616
Number of extensions: 6525212
Number of successful extensions: 12881
Number of sequences better than 100.0: 17
Number of HSP's better than 100.0 without gapping: 14
Number of HSP's successfully gapped in prelim test: 3
Number of HSP's that attempted gapping in prelim test: 12823
Number of HSP's gapped (non-prelim): 24
length of query: 376
length of database: 191,569,459
effective HSP length: 119
effective length of query: 257
effective length of database: 127,355,155
effective search space: 32730274835
effective search space used: 32730274835
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 62 (28.5 bits)