BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 044043
(428 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|Q7ZVE6|KDEL1_DANRE KDEL motif-containing protein 1 OS=Danio rerio GN=kdelc1 PE=2 SV=1
Length = 500
Score = 101 bits (251), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 88/353 (24%), Positives = 156/353 (44%), Gaps = 39/353 (11%)
Query: 26 ECPEYFRWIHKDLEPWKHTGISR---EMLER-AKAHAQFRLVIINGDAYVEKYKNAYQTR 81
CP F I DL ++ R E+++R K+H+ I N Y++ + R
Sbjct: 151 HCPASFSQIESDLSIFQSVDPDRNAHEIIQRFGKSHSLCHYTIKNNQVYIKTHGEHVGFR 210
Query: 82 DVFTVWGILQLLRLYPGKVPDLELMFSCGDRPVVKKRDYEGANSTSPPPVFHYCGDQESL 141
+F +L L R K+PD+E + GD P+ K+R S +P PVF +CG ++
Sbjct: 211 -IFMDAFLLSLTR--KVKLPDIEFFVNLGDWPLEKRR-----ASQNPSPVFSWCGSNDTR 262
Query: 142 DIVFPDWSFWGWAETNIRPWNSILEDIK--EGNKRTKWINRAPYAYWKGNP-------YV 192
DIV P + E+ + + D+ +G+ W + +W+G V
Sbjct: 263 DIVMPTYDL---TESVLETMGRVSLDMMSVQGHTGPVWEKKINKGFWRGRDSRKERLELV 319
Query: 193 SIARED--LMKCNVTDKYQWKTRLYVQDWHKEKKQG--FEKSKLADQCTHRYKIYVEGQA 248
+AR + ++ +T+ + +K H E G + D ++Y+I V+G
Sbjct: 320 KLARANTAMLDAALTNFFFFK--------HDESLYGPLVKHVSFFDFFKYKYQINVDGTV 371
Query: 249 WSVSEKYILACDSMTLLIEPKYYDFYSRSLVPLQHYWPVRTARKCRDIKFAVEWGNTHTR 308
+ Y+LA DS+ + YY+ + L P HY P R+ D+ ++W H
Sbjct: 372 AAYRLPYLLAGDSVVFKHDSIYYEHFYNELQPWVHYIPFRS--DLSDLLEKIQWAKDHDE 429
Query: 309 QARAIGRAGSRYMQEKLKMKYVYDYMFHLLIEYAKLLKFEPRIPNEGKKVCAQ 361
+A+ I AG ++ + L V+ Y L +YA+L +P++ +G ++ Q
Sbjct: 430 EAKKIALAGQQFARTHLMGDSVFCYYHKLFQKYAELQVTKPKV-RDGMELVEQ 481
>sp|B0X1Q4|RUMI_CULQU O-glucosyltransferase rumi homolog OS=Culex quinquefasciatus
GN=CPIJ013394 PE=3 SV=1
Length = 403
Score = 99.4 bits (246), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 82/340 (24%), Positives = 152/340 (44%), Gaps = 36/340 (10%)
Query: 27 CPEYFRWIHKDLEPWKHTGISREMLERAKAHAQFRLVIINGDAYVEKYKNAYQTRDVF-- 84
C + + DL P++ +GI+++++E A+++ G Y ++ RD
Sbjct: 71 CSCHLDVLKTDLRPFR-SGITQDLIELARSY---------GTKYQIIGHRMFRQRDCMFP 120
Query: 85 -TVWGILQLLRLYPGKVPDLELMFSCGDRPVVKKRDYEGANSTSPPPVFHYCGDQESLDI 143
G+ +R K+PD+EL+ +C D P + + S P PV + + LDI
Sbjct: 121 ARCSGVEHFIRPNLPKLPDMELIINCRDWPQISR---HWNASREPLPVLSFSKTNDYLDI 177
Query: 144 VFPDWSFW-GWAETNIRP-----WNSILEDIKEGNKRTKWINRAPYAYWKG-------NP 190
++P W FW G ++ P W+ +++ K W + A+++G +P
Sbjct: 178 MYPTWGFWEGGPAISLYPTGLGRWDQHRVSVRKAAKVWPWEKKLQQAFFRGSRTSDERDP 237
Query: 191 YVSIAR--EDLMKCNVTDKYQWKTRLYVQDWHKEKKQGFEKSKLADQCTHRYKIYVEGQA 248
V ++R +L+ T W++ H E Q + +L D C ++Y G A
Sbjct: 238 LVLLSRMRPELVDAQYTKNQAWRSP--KDTLHAEPAQ---EVRLEDHCQYKYLFNFRGVA 292
Query: 249 WSVSEKYILACDSMTLLIEPKYYDFYSRSLVPLQHYWPVRTARKCRDIKFAVEWGNTHTR 308
S K++ C S+ + ++ +F+ SL P HY PV +++ +++ H +
Sbjct: 293 ASFRFKHLFLCKSLVFHVGQEWQEFFYDSLKPWVHYVPVPVGINEWELEHLIQFFREHDQ 352
Query: 309 QARAIGRAGSRYMQEKLKMKYVYDYMFHLLIEYAKLLKFE 348
A+ I G ++ L+M+ V Y LL Y KL+K+E
Sbjct: 353 LAQEIANRGYEHIWNHLRMEDVECYWKRLLRRYGKLVKYE 392
>sp|Q6UW63|KDEL1_HUMAN KDEL motif-containing protein 1 OS=Homo sapiens GN=KDELC1 PE=1 SV=1
Length = 502
Score = 97.4 bits (241), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 87/352 (24%), Positives = 156/352 (44%), Gaps = 38/352 (10%)
Query: 27 CPEYFRWIHKDLEPWKHTGISREMLERAKAHAQFRLV----IINGDAYVEKYKNAYQTRD 82
CPE I +DL + + +E K Q + + + + Y++ + R
Sbjct: 153 CPETIAQIQRDLAHFPAVDPEKIAVEIPKRFGQRQSLCHYTLKDNKVYIKTHGEHVGFR- 211
Query: 83 VFTVWGILQLLRLYPGKVPDLELMFSCGDRPVVKKRDYEGANSTSPPPVFHYCGDQESLD 142
+F +L L R K+PD+EL + GD P+ KK+ ++++ P+F +CG +S D
Sbjct: 212 IFMDAILLSLTRKV--KMPDVELFVNLGDWPLEKKK-----SNSNIHPIFSWCGSTDSKD 264
Query: 143 IVFPDWSFWGWAETNIRPWNSILEDIK--EGNKRTKWINRAPYAYWKGNP-------YVS 193
IV P + ++ + + D+ + N W ++ A W+G V
Sbjct: 265 IVMPTYDL---TDSVLETMGRVSLDMMSVQANTGPPWESKNSTAVWRGRDSRKERLELVK 321
Query: 194 IARE--DLMKCNVTDKYQWKTRLYVQDWHKEKKQG--FEKSKLADQCTHRYKIYVEGQAW 249
++R+ +L+ T+ + +K H E G + D H+Y+I ++G
Sbjct: 322 LSRKHPELIDAAFTNFFFFK--------HDENLYGPIVKHISFFDFFKHKYQINIDGTVA 373
Query: 250 SVSEKYILACDSMTLLIEPKYYDFYSRSLVPLQHYWPVRTARKCRDIKFAVEWGNTHTRQ 309
+ Y+L DS+ L + YY+ + L P +HY PV++ D+ ++W H +
Sbjct: 374 AYRLPYLLVGDSVVLKQDSIYYEHFYNELQPWKHYIPVKS--NLSDLLEKLKWAKDHDEE 431
Query: 310 ARAIGRAGSRYMQEKLKMKYVYDYMFHLLIEYAKLLKFEPRIPNEGKKVCAQ 361
A+ I +AG + + L ++ Y F L EYA L EP+I K+V Q
Sbjct: 432 AKKIAKAGQEFARNNLMGDDIFCYYFKLFQEYANLQVSEPQIREGMKRVEPQ 483
>sp|Q8T045|RUMI_DROME O-glucosyltransferase rumi OS=Drosophila melanogaster GN=rumi PE=1
SV=1
Length = 411
Score = 95.5 bits (236), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 81/350 (23%), Positives = 154/350 (44%), Gaps = 40/350 (11%)
Query: 19 SSEESLGECPEYFRWIHKDLEPWKHTGISREMLER-AKAHAQFRLVIINGDAYVEKYKNA 77
SS+ +C + + +DL P+K TG++R+M+E A+ +++ I G
Sbjct: 65 SSDPQDSDCSCHANVLKRDLAPYKSTGVTRQMIESSARYGTKYK---IYGHRLYRDANCM 121
Query: 78 YQTRDVFTVWGILQLLRLYPGKVPDLELMFSCGDRPVVKKRDYEGANST----SPPPVFH 133
+ R GI L +PD++L ++ RDY N+ + PVF
Sbjct: 122 FPAR----CEGIEHFLLPLVATLPDMDL--------IINTRDYPQLNAAWGNAAGGPVFS 169
Query: 134 YCGDQESLDIVFPDWSFW-GWAETNIRP-----WNSILEDIKEGNKRTKWINRAPYAYWK 187
+ +E DI++P W+FW G T + P W+ + E +++ W + +++
Sbjct: 170 FSKTKEYRDIMYPAWTFWAGGPATKLHPRGIGRWDQMREKLEKRAAAIPWSQKRSLGFFR 229
Query: 188 G-------NPYVSIARED--LMKCNVTDKYQWKTRLYVQDWHKEKKQGFEKSKLADQCTH 238
G + + ++R + L++ T WK+ D + FE D C +
Sbjct: 230 GSRTSDERDSLILLSRRNPELVEAQYTKNQGWKSPKDTLDAPAADEVSFE-----DHCKY 284
Query: 239 RYKIYVEGQAWSVSEKYILACDSMTLLIEPKYYDFYSRSLVPLQHYWPVRTARKCRDIKF 298
+Y G A S K++ C S+ + ++ +F+ L P HY P+++ ++ +
Sbjct: 285 KYLFNFRGVAASFRLKHLFLCKSLVFHVGDEWQEFFYDQLKPWVHYVPLKSYPSQQEYEH 344
Query: 299 AVEWGNTHTRQARAIGRAGSRYMQEKLKMKYVYDYMFHLLIEYAKLLKFE 348
+ + + A+ I + G ++ E L+MK + Y LL Y KLL++E
Sbjct: 345 ILSFFKKNDALAQEIAQRGYDFIWEHLRMKDIKCYWRKLLKRYVKLLQYE 394
>sp|Q29AU6|RUMI_DROPS O-glucosyltransferase rumi OS=Drosophila pseudoobscura
pseudoobscura GN=rumi PE=3 SV=1
Length = 409
Score = 92.8 bits (229), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 79/348 (22%), Positives = 148/348 (42%), Gaps = 36/348 (10%)
Query: 19 SSEESLGECPEYFRWIHKDLEPWKHTGISREMLERAKAHAQFRLVIINGDAYVEKYKNAY 78
SS+ + C + I DL P+K TG+SR+M+E + + G Y K Y
Sbjct: 63 SSDANDANCSCHAAVIKSDLAPYKATGVSRQMIESSARY---------GTRYKIYEKRLY 113
Query: 79 QTRDVF---TVWGILQLLRLYPGKVPDLELMFSCGDRPVVKKRDYEGANSTSPPPVFHYC 135
+ + GI L +PD++L+ + D P + GA P+ +
Sbjct: 114 REENCMFPARCQGIEHFLLPLVATLPDMDLVINTRDYPQINMAWGNGAQG----PILSFS 169
Query: 136 GDQESLDIVFPDWSFW-GWAETNIRP-----WNSILEDIKEGNKRTKWINRAPYAYWKG- 188
++ DI++P W+FW G T + P W+ + E +++ W + +++G
Sbjct: 170 KTKDHRDIMYPAWTFWAGGPATKLHPRGIGRWDLMREKLEKRAAAIPWSQKRELGFFRGS 229
Query: 189 ------NPYVSIARED--LMKCNVTDKYQWKTRLYVQDWHKEKKQGFEKSKLADQCTHRY 240
+ + ++R + L++ T WK+ D + FE D C ++Y
Sbjct: 230 RTSDERDSLILLSRRNPELVEAQYTKNQGWKSPKDTLDAPPAGEVSFE-----DHCKYKY 284
Query: 241 KIYVEGQAWSVSEKYILACDSMTLLIEPKYYDFYSRSLVPLQHYWPVRTARKCRDIKFAV 300
G A S K++ C S+ + ++ +F+ L P HY P++ ++ + +
Sbjct: 285 LFNFRGVAASFRLKHLFLCQSLVFHVGDEWQEFFYDQLKPWVHYVPLKNYPSQQEYEELL 344
Query: 301 EWGNTHTRQARAIGRAGSRYMQEKLKMKYVYDYMFHLLIEYAKLLKFE 348
+ + A+ I + G ++ + L+MK + Y LL Y KLL +E
Sbjct: 345 TFFRKNDALAQEIAQRGRDFIWQHLRMKDIKCYWRRLLKSYVKLLTYE 392
>sp|Q5E9Q1|PGLT1_BOVIN Protein O-glucosyltransferase 1 OS=Bos taurus GN=POGLUT1 PE=2 SV=1
Length = 392
Score = 92.4 bits (228), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 88/340 (25%), Positives = 154/340 (45%), Gaps = 40/340 (11%)
Query: 27 CPEYFRWIHKDLEPWKHTGISREMLE---RAKAHAQFRLVIINGDAYVEKYKNAYQTRDV 83
C Y I +DL P++ GISR+M+ R K ++ II Y E + +R
Sbjct: 54 CSCYHGVIEEDLTPFR-GGISRKMMAEVVRRKLGTHYQ--IIKNRLYRES-DCMFPSRCS 109
Query: 84 FTVWGILQLLRLYPGKVPDLELMFSCGDRPVVKKRDYEGANSTSPPPVFHYCGDQESLDI 143
IL+++ G++PD+E++ + D P V K P+F + E DI
Sbjct: 110 GVEHFILEVI----GRLPDMEMVINVRDYPQVPKWMEPAI------PIFSFSKTLEYHDI 159
Query: 144 VFPDWSFW--GWAETNIRP-----WNSILEDIKEGNKRTKWINRAPYAYWKG-------N 189
++P W+FW G A I P W+ ED+ + W + AY++G +
Sbjct: 160 MYPAWTFWEGGPAVWPIYPMGLGRWDLFREDLVRSAAQWPWKKKNSTAYFRGSRTSPERD 219
Query: 190 PYVSIARED--LMKCNVTDKYQWKTRLYVQDWHKEKKQGFEKSKLADQCTHRYKIYVEGQ 247
P + ++R++ L+ T WK+ ++D K + L D C ++Y G
Sbjct: 220 PLILLSRKNPKLVDAEYTKNQAWKS---MKD--TLGKPAAKDVHLVDHCKYKYLFNFRGV 274
Query: 248 AWSVSEKYILACDSMTLLIEPKYYDFYSRSLVPLQHYWPVRTARKCRDIKFAVEWGNTHT 307
A S K++ C S+ + ++ +F+ L P HY PV+T +++ +++ +
Sbjct: 275 AASFRFKHLFLCGSLVFHVGDEWLEFFYPQLKPWVHYIPVKT--DLSNVQELLQFVKAND 332
Query: 308 RQARAIGRAGSRYMQEKLKMKYVYDYMFHLLIEYAKLLKF 347
A+ I GS+++ LKM + Y +LL EY+K L +
Sbjct: 333 DVAQEIAERGSQFILNHLKMDDITCYWENLLTEYSKFLSY 372
>sp|A0NDG6|RUMI_ANOGA O-glucosyltransferase rumi homolog OS=Anopheles gambiae
GN=AGAP004267 PE=3 SV=1
Length = 399
Score = 92.4 bits (228), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 81/329 (24%), Positives = 143/329 (43%), Gaps = 33/329 (10%)
Query: 34 IHKDLEPWKHTGISREMLERAKAHAQFRLVIINGDAYVEKYKNAYQTRDVFTVWGILQLL 93
+ DL+P+K GI++EM+ RAK + VI G + + + R G+ +
Sbjct: 78 LKADLKPFKAHGITKEMINRAKQYGTHYQVI--GHKLYRQRECMFPAR----CSGVEHFV 131
Query: 94 RLYPGKVPDLELMFSCGDRPVVKKRDYEGANSTSPPPVFHYCGDQESLDIVFPDWSFW-- 151
R +PD++L+ +C D P + + S PV + E LDI++P W+FW
Sbjct: 132 RPLLPLLPDMDLIVNCRDWPQIHRH-----WSKEKIPVLSFSKTAEYLDIMYPAWAFWEG 186
Query: 152 ----GWAETNIRPWNSILEDIKEGNKRTKWINRAPYAYWKGN---------PYVSIARED 198
T + W+ + I + + W + P A+++G+ +S A+
Sbjct: 187 GPAIALYPTGLGRWDLHRQTITKAS--ADWEAKEPKAFFRGSRTSDERDALVLLSRAQPS 244
Query: 199 LMKCNVTDKYQWKTRLYVQDWHKEKKQGFEKSKLADQCTHRYKIYVEGQAWSVSEKYILA 258
L+ T WK+ QD + + L + C +R+ G A S K++
Sbjct: 245 LVDAQYTKNQAWKS---PQD--TLNAEPAREVTLEEHCRYRFLFNFRGVAASFRFKHLFL 299
Query: 259 CDSMTLLIEPKYYDFYSRSLVPLQHYWPVRTARKCRDIKFAVEWGNTHTRQARAIGRAGS 318
C S+ + ++ +F+ SL P HY PV +++ + + H + ARAI G
Sbjct: 300 CRSLVFHVGDEWQEFFYPSLKPWVHYVPVPVRSTPEELEALITFFQEHDQLARAIAERGY 359
Query: 319 RYMQEKLKMKYVYDYMFHLLIEYAKLLKF 347
++ L+M V Y LL Y KL+++
Sbjct: 360 EHIWNHLRMADVECYWKKLLKRYGKLIRY 388
>sp|Q9JHP7|KDEL1_MOUSE KDEL motif-containing protein 1 OS=Mus musculus GN=Kdelc1 PE=1 SV=1
Length = 502
Score = 90.9 bits (224), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 85/352 (24%), Positives = 154/352 (43%), Gaps = 38/352 (10%)
Query: 27 CPEYFRWIHKDLEPWKHTG---ISREMLER-AKAHAQFRLVIINGDAYVEKYKNAYQTRD 82
C E I KDL + I+ E+ +R + + + + Y++ + R
Sbjct: 153 CSETISQIQKDLAHFPTVDPEKIAAEIPKRFGQRQSLCHYTLKDNKVYIKTHGEHVGFR- 211
Query: 83 VFTVWGILQLLRLYPGKVPDLELMFSCGDRPVVKKRDYEGANSTSPPPVFHYCGDQESLD 142
+F +L L R ++PD+E + GD P+ KK+ ++++ P+F +CG ES D
Sbjct: 212 IFMDAILLSLTRKV--RMPDVEFFVNLGDWPLEKKK-----SNSNIQPIFSWCGSTESRD 264
Query: 143 IVFPDWSFWGWAETNIRPWNSILEDIK--EGNKRTKWINRAPYAYWKGNP-------YVS 193
IV P + ++ + + D+ + N W ++ A W+G V
Sbjct: 265 IVMPTYDL---TDSVLETMGRVSLDMMSVQANTGPPWESKNSTAVWRGRDSRKERLELVK 321
Query: 194 IARE--DLMKCNVTDKYQWKTRLYVQDWHKEKKQG--FEKSKLADQCTHRYKIYVEGQAW 249
++R+ +L+ T+ + +K H E G + D H+Y+I ++G
Sbjct: 322 LSRKHPELIDAAFTNFFFFK--------HDESLYGPIVKHISFFDFFKHKYQINIDGTVA 373
Query: 250 SVSEKYILACDSMTLLIEPKYYDFYSRSLVPLQHYWPVRTARKCRDIKFAVEWGNTHTRQ 309
+ Y+L DS+ L + YY+ + L P +HY PV++ D+ ++W H +
Sbjct: 374 AYRLPYLLVGDSVVLKQDSIYYEHFYNELQPWKHYIPVKS--NLSDLLEKLKWAKEHDAE 431
Query: 310 ARAIGRAGSRYMQEKLKMKYVYDYMFHLLIEYAKLLKFEPRIPNEGKKVCAQ 361
A+ I +AG + + L ++ Y F L YA L EP+I K+V Q
Sbjct: 432 AKKIAKAGQEFARNNLMGDDIFCYYFKLFQGYANLQVSEPQIREGMKRVEPQ 483
>sp|Q7Z4H8|KDEL2_HUMAN KDEL motif-containing protein 2 OS=Homo sapiens GN=KDELC2 PE=1 SV=2
Length = 507
Score = 90.9 bits (224), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 84/353 (23%), Positives = 157/353 (44%), Gaps = 39/353 (11%)
Query: 27 CPEYFRWIHKDLEPWKHTGISREMLERAKAHAQFR-----LVIINGDAYVEKYKNAYQTR 81
CP I KD + + + + E K R I+N Y + Y
Sbjct: 156 CPTKEPQIAKDFASFPSINLQQMLKEVPKRFGDERGAIVHYTILNNHVY-RRSLGKYTDF 214
Query: 82 DVFTVWGILQLLRLYPGKVPDLELMFSCGDRPVVKKRDYEGANST-SPPPVFHYCGDQES 140
+F+ +L L R +PDLE + GD P+ ++ N T SP P+ +CG +S
Sbjct: 215 KMFSDEILLSLTRKV--LLPDLEFYVNLGDWPL----EHRKVNGTPSPIPIISWCGSLDS 268
Query: 141 LDIVFPDWSFWGWAETNIRPWNSILEDIKEGNKRTKWINRAPYAYWKGNP-------YVS 193
D+V P + +R + L I +GN WIN+ A+++G V
Sbjct: 269 RDVVLPTYDITHSMLEAMRGVTNDLLSI-QGNTGPSWINKTERAFFRGRDSREERLQLVQ 327
Query: 194 IARED--LMKCNVTDKYQWKTRLYVQDWHKEKKQGFEKSKLA---DQCTHRYKIYVEGQA 248
+++E+ L+ +T + + +EK++ K+KL D ++Y++ V+G
Sbjct: 328 LSKENPQLLDAGITGYF----------FFQEKEKELGKAKLMGFFDFFKYKYQVNVDGTV 377
Query: 249 WSVSEKYILACDSMTLLIEPKYYDFYSRSLVPLQHYWPVRTARKCRDIKFAVEWGNTHTR 308
+ Y++ DS+ L + YY+ + +L P +HY P++ R D+ V+W +
Sbjct: 378 AAYRYPYLMLGDSLVLKQDSPYYEHFYMALEPWKHYVPIK--RNLSDLLEKVKWAKENDE 435
Query: 309 QARAIGRAGSRYMQEKLKMKYVYDYMFHLLIEYAKLLKFEPRIPNEGKKVCAQ 361
+A+ I + G ++ L+ +Y Y + +L +YA+ +P + +G ++ Q
Sbjct: 436 EAKKIAKEGQLMARDLLQPHRLYCYYYQVLQKYAERQSSKPEV-RDGMELVPQ 487
>sp|Q8NBL1|PGLT1_HUMAN Protein O-glucosyltransferase 1 OS=Homo sapiens GN=POGLUT1 PE=1
SV=1
Length = 392
Score = 90.9 bits (224), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 83/337 (24%), Positives = 150/337 (44%), Gaps = 34/337 (10%)
Query: 27 CPEYFRWIHKDLEPWKHTGISREMLERAKAHAQFRLVIINGDAYVEKYKNAYQTRDVFTV 86
C Y I +DL P++ GISR+M+ I + + + +R
Sbjct: 54 CSCYHGVIEEDLTPFR-GGISRKMMAEVVRRKLGTHYQITKNRLYRENDCMFPSRCSGVE 112
Query: 87 WGILQLLRLYPGKVPDLELMFSCGDRPVVKKRDYEGANSTSPPPVFHYCGDQESLDIVFP 146
IL+++ G++PD+E++ + D P V K PVF + E DI++P
Sbjct: 113 HFILEVI----GRLPDMEMVINVRDYPQVPKWMEPAI------PVFSFSKTSEYHDIMYP 162
Query: 147 DWSFW--GWAETNIRP-----WNSILEDIKEGNKRTKWINRAPYAYWKG-------NPYV 192
W+FW G A I P W+ ED+ + W + AY++G +P +
Sbjct: 163 AWTFWEGGPAVWPIYPTGLGRWDLFREDLVRSAAQWPWKKKNSTAYFRGSRTSPERDPLI 222
Query: 193 SIARED--LMKCNVTDKYQWKTRLYVQDWHKEKKQGFEKSKLADQCTHRYKIYVEGQAWS 250
++R++ L+ T WK+ ++D K + L D C ++Y G A S
Sbjct: 223 LLSRKNPKLVDAEYTKNQAWKS---MKD--TLGKPAAKDVHLVDHCKYKYLFNFRGVAAS 277
Query: 251 VSEKYILACDSMTLLIEPKYYDFYSRSLVPLQHYWPVRTARKCRDIKFAVEWGNTHTRQA 310
K++ C S+ + ++ +F+ L P HY PV+T +++ +++ + A
Sbjct: 278 FRFKHLFLCGSLVFHVGDEWLEFFYPQLKPWVHYIPVKT--DLSNVQELLQFVKANDDVA 335
Query: 311 RAIGRAGSRYMQEKLKMKYVYDYMFHLLIEYAKLLKF 347
+ I GS++++ L+M + Y +LL EY+K L +
Sbjct: 336 QEIAERGSQFIRNHLQMDDITCYWENLLSEYSKFLSY 372
>sp|Q16QY8|RUMI_AEDAE O-glucosyltransferase rumi homolog OS=Aedes aegypti GN=AAEL011121
PE=3 SV=1
Length = 402
Score = 87.8 bits (216), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 88/339 (25%), Positives = 153/339 (45%), Gaps = 32/339 (9%)
Query: 25 GECPEYFRWIHKDLEPWKHTGISREMLERAKAHAQFRLVIINGDAYVEKYKNAYQTRDVF 84
C + + DL P+K GIS +M+ERA+++ + I++ Y +K + R
Sbjct: 70 ANCSCHADVLKTDLRPFK-GGISEQMVERARSYGT-KYQIVDHRLYRQK-DCMFPAR--- 123
Query: 85 TVWGILQLLRLYPGKVPDLELMFSCGDRPVVKKRDYEGANSTSPPPVFHYCGDQESLDIV 144
G+ ++ +PD+EL+ +C D P + + PV + + LDI+
Sbjct: 124 -CSGVEHFIKPNLPHLPDMELIINCRDWPQINRH-----WKQEKLPVLSFSKTDDYLDIM 177
Query: 145 FPDWSFW-GWAETNIRP-----WNSILEDIKEGNKRTKWINRAPYAYWKG-------NPY 191
+P W FW G ++ P W+ IK+ KW + A+++G +P
Sbjct: 178 YPTWGFWEGGPAISLYPTGLGRWDQHRVSIKKAADSWKWEKKKAKAFFRGSRTSDERDPL 237
Query: 192 VSIARE--DLMKCNVTDKYQWKTRLYVQDWHKEKKQGFEKSKLADQCTHRYKIYVEGQAW 249
V ++R +L+ T WK+ +D K ++ +L D C ++Y G A
Sbjct: 238 VLLSRRKPELVDAQYTKNQAWKS---PKDTLNAKPA--QEVRLEDHCQYKYLFNFRGVAA 292
Query: 250 SVSEKYILACDSMTLLIEPKYYDFYSRSLVPLQHYWPVRTARKCRDIKFAVEWGNTHTRQ 309
S K++ C S+ + ++ +F+ SL P HY PVR +++ +E+ H
Sbjct: 293 SFRFKHLFLCRSLVFHVGSEWQEFFYPSLKPWVHYVPVRVGATQEELEELIEFFAEHDDL 352
Query: 310 ARAIGRAGSRYMQEKLKMKYVYDYMFHLLIEYAKLLKFE 348
AR I G ++ + L+MK V Y LL Y KL+K+E
Sbjct: 353 AREIADRGFEHVWKHLRMKDVECYWRKLLRRYGKLVKYE 391
>sp|Q8BYB9|PGLT1_MOUSE Protein O-glucosyltransferase 1 OS=Mus musculus GN=Poglut1 PE=2
SV=2
Length = 392
Score = 87.0 bits (214), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 82/338 (24%), Positives = 150/338 (44%), Gaps = 36/338 (10%)
Query: 27 CPEYFRWIHKDLEPWKHTGISREML-ERAKAHAQFRLVIINGDAYVEKYKNAYQTRDVFT 85
C Y I +DL P++ GISR+M+ E + II + E + +R
Sbjct: 54 CSCYHGVIEEDLTPFR-GGISRKMMAEVVRRKLGTHYQIIKNRLFRED-DCMFPSRCSGV 111
Query: 86 VWGILQLLRLYPGKVPDLELMFSCGDRPVVKKRDYEGANSTSPPPVFHYCGDQESLDIVF 145
IL+++ ++PD+E++ + D P V K PVF + E DI++
Sbjct: 112 EHFILEVIH----RLPDMEMVINVRDYPQVPKW------MEPTIPVFSFSKTSEYHDIMY 161
Query: 146 PDWSFWGWAE-------TNIRPWNSILEDIKEGNKRTKWINRAPYAYWKG-------NPY 191
P W+FW T + W+ ED+ + W + AY++G +P
Sbjct: 162 PAWTFWEGGPAVWPLYPTGLGRWDLFREDLLRSAAQWPWEKKNSTAYFRGSRTSPERDPL 221
Query: 192 VSIARED--LMKCNVTDKYQWKTRLYVQDWHKEKKQGFEKSKLADQCTHRYKIYVEGQAW 249
+ ++R++ L+ T WK+ ++D K + L D C +RY G A
Sbjct: 222 ILLSRKNPKLVDAEYTKNQAWKS---MKD--TLGKPAAKDVHLIDHCKYRYLFNFRGVAA 276
Query: 250 SVSEKYILACDSMTLLIEPKYYDFYSRSLVPLQHYWPVRTARKCRDIKFAVEWGNTHTRQ 309
S K++ C S+ + ++ +F+ L P HY PV+T +++ +++ +
Sbjct: 277 SFRFKHLFLCGSLVFHVGDEWVEFFYPQLKPWVHYIPVKT--DLSNVQELLQFVKANDDI 334
Query: 310 ARAIGRAGSRYMQEKLKMKYVYDYMFHLLIEYAKLLKF 347
A+ I + GS+++ L+M + Y +LL +Y+K L +
Sbjct: 335 AQEIAKRGSQFIINHLQMDDITCYWENLLTDYSKFLSY 372
>sp|Q566E5|KDEL2_RAT KDEL motif-containing protein 2 OS=Rattus norvegicus GN=Kdelc2 PE=2
SV=1
Length = 508
Score = 85.5 bits (210), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 67/272 (24%), Positives = 129/272 (47%), Gaps = 30/272 (11%)
Query: 100 VPDLELMFSCGDRPVVKKRDYEGANST-SPPPVFHYCGDQESLDIVFPDWSFWGWAETNI 158
+PDLE + GD P+ ++ N T P P+ +CG +S DI+ P + +
Sbjct: 232 LPDLEFYINLGDWPL----EHRKVNDTPGPIPIISWCGSLDSRDIILPTYDVTHSTLEAM 287
Query: 159 RPWNSILEDIKEGNKRTKWINRAPYAYWKGNP-------YVSIARED--LMKCNVTDKYQ 209
R + L + +GN WIN+ A+++G V +++E+ L+ +T +
Sbjct: 288 RGVTNDLLSV-QGNTGPSWINKTEKAFFRGRDSREERLQLVLLSKENPQLLDAGITGYF- 345
Query: 210 WKTRLYVQDWHKEKKQGFEKSKLA---DQCTHRYKIYVEGQAWSVSEKYILACDSMTLLI 266
+ +EK++ K+KL D ++Y++ V+G + Y++ DS+ L
Sbjct: 346 ---------FFQEKEKELGKAKLMGFFDFFKYKYQVNVDGTVAAYRYPYLMLGDSLVLKQ 396
Query: 267 EPKYYDFYSRSLVPLQHYWPVRTARKCRDIKFAVEWGNTHTRQARAIGRAGSRYMQEKLK 326
E YY+ + L P +HY P++ R D+ V+W + +A+ I + G ++ L+
Sbjct: 397 ESPYYEHFYVELRPWKHYVPIK--RNLSDLLEKVKWAKENDEEAKRIAKEGQLTARDLLQ 454
Query: 327 MKYVYDYMFHLLIEYAKLLKFEPRIPNEGKKV 358
+Y Y + +L +YA+ +P I + ++V
Sbjct: 455 PPRLYCYYYRVLQKYAERQVSKPMIRDGMERV 486
>sp|Q197C5|VF179_IIV3 Uncharacterized protein 035R OS=Invertebrate iridescent virus 3
GN=IIV3-035R PE=4 SV=1
Length = 1098
Score = 42.4 bits (98), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 36/161 (22%), Positives = 66/161 (40%), Gaps = 14/161 (8%)
Query: 192 VSIAREDLMKCNVTDKYQWKTRL-------YVQDWHKEKKQGFEKSKLADQCTHRYKIYV 244
+ R DL+ +T+ W R+ Y+Q ++ Q +++ ++V
Sbjct: 336 LGTVRPDLLDAGITN---WNLRVRVSKHSPYLQIPDPGTLTAVDRLSPHQQSQYKFIVHV 392
Query: 245 EGQAWSVSEKYILACDSMTLLIEP--KYYDFYSRSLVPLQHYWPVRTARKCRDIKFAVEW 302
EG + L S LL++ + +YS L P HY PVR D+ +EW
Sbjct: 393 EGHVSAFRLSLELGMKSCILLVQSLHGWKMWYSDLLKPWVHYVPVRP--DLSDLFDRIEW 450
Query: 303 GNTHTRQARAIGRAGSRYMQEKLKMKYVYDYMFHLLIEYAK 343
+ Q RA+ ++ + L + + D++ H L A+
Sbjct: 451 CRANDAQCRAMAENAYQFYRTHLDKESILDHLQHTLNRLAR 491
>sp|O55767|VF179_IIV6 Uncharacterized protein 179R OS=Invertebrate iridescent virus 6
GN=IIV6-179R PE=4 SV=1
Length = 1186
Score = 38.9 bits (89), Expect = 0.081, Method: Compositional matrix adjust.
Identities = 79/375 (21%), Positives = 142/375 (37%), Gaps = 96/375 (25%)
Query: 99 KVPDLELMFSCGDRPVVKKRDYE------GAN-----------STSPPPVFHYCGDQESL 141
+VPD+EL + D P++ K E G + S P+ C
Sbjct: 206 EVPDIELFINRRDFPLLTKNGTEPYYNIFGKDHSLDSKSLKLISEGMCPILSMCTSDMYA 265
Query: 142 DIVFP---DWSFWGWAE---TNI----RPWNSI---------LEDIKEGNKRTKWINRAP 182
DIV P DW+ E T++ NS+ +D + N T W R P
Sbjct: 266 DIVIPTHEDWARVASTEGVKTHVGDKENNENSLPKKITFPPQCKDYSKDNFNTPWEKRIP 325
Query: 183 YAYWKG---------NPYVSIAREDLMKCNVT-----DKY----------QWKTR----- 213
A ++G + + + L+ ++ DKY +W R
Sbjct: 326 TAVFRGGSTGCGVSSDTNLETFNQRLVAAKISYNSKPDKYNVPLITAGITKWNLRPRKIL 385
Query: 214 --LYVQDWHKEKKQGFEKSKLA-----DQCTHRYKIYVEGQ--AWSVSEKYILACDSMTL 264
Y+Q + EK E K++ +Q ++Y I ++G A+ +S + + C L
Sbjct: 386 NEKYLQTINIEK----EAPKVSPLTPEEQSKYKYIINIDGHVSAFRLSLEMSMGC--CIL 439
Query: 265 LIEPKYYD-------FYSRSLVPLQHYWPVRTARKCRDIKFAVEWGNTHTRQARAIGRAG 317
L++ K + ++S L P HY PV++ D+ ++W + + + I +
Sbjct: 440 LVKSKIPNETFGWKMWFSHLLKPYIHYVPVKS--DLSDLIEKIQWCRDNDEKCKEISQEA 497
Query: 318 SRYMQEKLKMKYVYDYMFHLLIEYAKLLKFEPRIPNEGKKVCAQKLASSQNGLGKRFMVE 377
++ Q L + + DYM +L++ K + P + V QN + +++
Sbjct: 498 LKFYQTYLSRESILDYMQNLMV------KLKLSFPT-NEIVYGTDPLFIQNSVQSKYLFN 550
Query: 378 SMVNSSSETLPCTMP 392
M NS + P P
Sbjct: 551 MMNNSINPIFPFLTP 565
>sp|A4WDE9|UNG_ENT38 Uracil-DNA glycosylase OS=Enterobacter sp. (strain 638) GN=ung PE=3
SV=1
Length = 229
Score = 32.3 bits (72), Expect = 6.7, Method: Compositional matrix adjust.
Identities = 23/82 (28%), Positives = 35/82 (42%), Gaps = 4/82 (4%)
Query: 11 STPPFLNSSSEESLGECPEYFRWIHKDLEPWKHTGI----SREMLERAKAHAQFRLVIIN 66
+TPP L + +E G P + R H LE W H G+ + + +AH+ L
Sbjct: 84 ATPPSLLNMYKELEGSLPGFVRPAHGYLESWAHQGVLLLNTVLTVRAGQAHSHASLGWET 143
Query: 67 GDAYVEKYKNAYQTRDVFTVWG 88
V N ++ VF +WG
Sbjct: 144 FTDKVISLINEHREGVVFLLWG 165
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.319 0.135 0.431
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 167,125,382
Number of Sequences: 539616
Number of extensions: 7029264
Number of successful extensions: 14337
Number of sequences better than 100.0: 20
Number of HSP's better than 100.0 without gapping: 14
Number of HSP's successfully gapped in prelim test: 6
Number of HSP's that attempted gapping in prelim test: 14275
Number of HSP's gapped (non-prelim): 28
length of query: 428
length of database: 191,569,459
effective HSP length: 120
effective length of query: 308
effective length of database: 126,815,539
effective search space: 39059186012
effective search space used: 39059186012
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 63 (28.9 bits)