BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 044043
         (428 letters)

Database: swissprot 
           539,616 sequences; 191,569,459 total letters

Searching..................................................done



>sp|Q7ZVE6|KDEL1_DANRE KDEL motif-containing protein 1 OS=Danio rerio GN=kdelc1 PE=2 SV=1
          Length = 500

 Score =  101 bits (251), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 88/353 (24%), Positives = 156/353 (44%), Gaps = 39/353 (11%)

Query: 26  ECPEYFRWIHKDLEPWKHTGISR---EMLER-AKAHAQFRLVIINGDAYVEKYKNAYQTR 81
            CP  F  I  DL  ++     R   E+++R  K+H+     I N   Y++ +      R
Sbjct: 151 HCPASFSQIESDLSIFQSVDPDRNAHEIIQRFGKSHSLCHYTIKNNQVYIKTHGEHVGFR 210

Query: 82  DVFTVWGILQLLRLYPGKVPDLELMFSCGDRPVVKKRDYEGANSTSPPPVFHYCGDQESL 141
            +F    +L L R    K+PD+E   + GD P+ K+R      S +P PVF +CG  ++ 
Sbjct: 211 -IFMDAFLLSLTR--KVKLPDIEFFVNLGDWPLEKRR-----ASQNPSPVFSWCGSNDTR 262

Query: 142 DIVFPDWSFWGWAETNIRPWNSILEDIK--EGNKRTKWINRAPYAYWKGNP-------YV 192
           DIV P +      E+ +     +  D+   +G+    W  +    +W+G          V
Sbjct: 263 DIVMPTYDL---TESVLETMGRVSLDMMSVQGHTGPVWEKKINKGFWRGRDSRKERLELV 319

Query: 193 SIARED--LMKCNVTDKYQWKTRLYVQDWHKEKKQG--FEKSKLADQCTHRYKIYVEGQA 248
            +AR +  ++   +T+ + +K        H E   G   +     D   ++Y+I V+G  
Sbjct: 320 KLARANTAMLDAALTNFFFFK--------HDESLYGPLVKHVSFFDFFKYKYQINVDGTV 371

Query: 249 WSVSEKYILACDSMTLLIEPKYYDFYSRSLVPLQHYWPVRTARKCRDIKFAVEWGNTHTR 308
            +    Y+LA DS+    +  YY+ +   L P  HY P R+     D+   ++W   H  
Sbjct: 372 AAYRLPYLLAGDSVVFKHDSIYYEHFYNELQPWVHYIPFRS--DLSDLLEKIQWAKDHDE 429

Query: 309 QARAIGRAGSRYMQEKLKMKYVYDYMFHLLIEYAKLLKFEPRIPNEGKKVCAQ 361
           +A+ I  AG ++ +  L    V+ Y   L  +YA+L   +P++  +G ++  Q
Sbjct: 430 EAKKIALAGQQFARTHLMGDSVFCYYHKLFQKYAELQVTKPKV-RDGMELVEQ 481


>sp|B0X1Q4|RUMI_CULQU O-glucosyltransferase rumi homolog OS=Culex quinquefasciatus
           GN=CPIJ013394 PE=3 SV=1
          Length = 403

 Score = 99.4 bits (246), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 82/340 (24%), Positives = 152/340 (44%), Gaps = 36/340 (10%)

Query: 27  CPEYFRWIHKDLEPWKHTGISREMLERAKAHAQFRLVIINGDAYVEKYKNAYQTRDVF-- 84
           C  +   +  DL P++ +GI+++++E A+++         G  Y       ++ RD    
Sbjct: 71  CSCHLDVLKTDLRPFR-SGITQDLIELARSY---------GTKYQIIGHRMFRQRDCMFP 120

Query: 85  -TVWGILQLLRLYPGKVPDLELMFSCGDRPVVKKRDYEGANSTSPPPVFHYCGDQESLDI 143
               G+   +R    K+PD+EL+ +C D P + +       S  P PV  +    + LDI
Sbjct: 121 ARCSGVEHFIRPNLPKLPDMELIINCRDWPQISR---HWNASREPLPVLSFSKTNDYLDI 177

Query: 144 VFPDWSFW-GWAETNIRP-----WNSILEDIKEGNKRTKWINRAPYAYWKG-------NP 190
           ++P W FW G    ++ P     W+     +++  K   W  +   A+++G       +P
Sbjct: 178 MYPTWGFWEGGPAISLYPTGLGRWDQHRVSVRKAAKVWPWEKKLQQAFFRGSRTSDERDP 237

Query: 191 YVSIAR--EDLMKCNVTDKYQWKTRLYVQDWHKEKKQGFEKSKLADQCTHRYKIYVEGQA 248
            V ++R   +L+    T    W++       H E  Q   + +L D C ++Y     G A
Sbjct: 238 LVLLSRMRPELVDAQYTKNQAWRSP--KDTLHAEPAQ---EVRLEDHCQYKYLFNFRGVA 292

Query: 249 WSVSEKYILACDSMTLLIEPKYYDFYSRSLVPLQHYWPVRTARKCRDIKFAVEWGNTHTR 308
            S   K++  C S+   +  ++ +F+  SL P  HY PV       +++  +++   H +
Sbjct: 293 ASFRFKHLFLCKSLVFHVGQEWQEFFYDSLKPWVHYVPVPVGINEWELEHLIQFFREHDQ 352

Query: 309 QARAIGRAGSRYMQEKLKMKYVYDYMFHLLIEYAKLLKFE 348
            A+ I   G  ++   L+M+ V  Y   LL  Y KL+K+E
Sbjct: 353 LAQEIANRGYEHIWNHLRMEDVECYWKRLLRRYGKLVKYE 392


>sp|Q6UW63|KDEL1_HUMAN KDEL motif-containing protein 1 OS=Homo sapiens GN=KDELC1 PE=1 SV=1
          Length = 502

 Score = 97.4 bits (241), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 87/352 (24%), Positives = 156/352 (44%), Gaps = 38/352 (10%)

Query: 27  CPEYFRWIHKDLEPWKHTGISREMLERAKAHAQFRLV----IINGDAYVEKYKNAYQTRD 82
           CPE    I +DL  +      +  +E  K   Q + +    + +   Y++ +      R 
Sbjct: 153 CPETIAQIQRDLAHFPAVDPEKIAVEIPKRFGQRQSLCHYTLKDNKVYIKTHGEHVGFR- 211

Query: 83  VFTVWGILQLLRLYPGKVPDLELMFSCGDRPVVKKRDYEGANSTSPPPVFHYCGDQESLD 142
           +F    +L L R    K+PD+EL  + GD P+ KK+     ++++  P+F +CG  +S D
Sbjct: 212 IFMDAILLSLTRKV--KMPDVELFVNLGDWPLEKKK-----SNSNIHPIFSWCGSTDSKD 264

Query: 143 IVFPDWSFWGWAETNIRPWNSILEDIK--EGNKRTKWINRAPYAYWKGNP-------YVS 193
           IV P +      ++ +     +  D+   + N    W ++   A W+G          V 
Sbjct: 265 IVMPTYDL---TDSVLETMGRVSLDMMSVQANTGPPWESKNSTAVWRGRDSRKERLELVK 321

Query: 194 IARE--DLMKCNVTDKYQWKTRLYVQDWHKEKKQG--FEKSKLADQCTHRYKIYVEGQAW 249
           ++R+  +L+    T+ + +K        H E   G   +     D   H+Y+I ++G   
Sbjct: 322 LSRKHPELIDAAFTNFFFFK--------HDENLYGPIVKHISFFDFFKHKYQINIDGTVA 373

Query: 250 SVSEKYILACDSMTLLIEPKYYDFYSRSLVPLQHYWPVRTARKCRDIKFAVEWGNTHTRQ 309
           +    Y+L  DS+ L  +  YY+ +   L P +HY PV++     D+   ++W   H  +
Sbjct: 374 AYRLPYLLVGDSVVLKQDSIYYEHFYNELQPWKHYIPVKS--NLSDLLEKLKWAKDHDEE 431

Query: 310 ARAIGRAGSRYMQEKLKMKYVYDYMFHLLIEYAKLLKFEPRIPNEGKKVCAQ 361
           A+ I +AG  + +  L    ++ Y F L  EYA L   EP+I    K+V  Q
Sbjct: 432 AKKIAKAGQEFARNNLMGDDIFCYYFKLFQEYANLQVSEPQIREGMKRVEPQ 483


>sp|Q8T045|RUMI_DROME O-glucosyltransferase rumi OS=Drosophila melanogaster GN=rumi PE=1
           SV=1
          Length = 411

 Score = 95.5 bits (236), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 81/350 (23%), Positives = 154/350 (44%), Gaps = 40/350 (11%)

Query: 19  SSEESLGECPEYFRWIHKDLEPWKHTGISREMLER-AKAHAQFRLVIINGDAYVEKYKNA 77
           SS+    +C  +   + +DL P+K TG++R+M+E  A+   +++   I G          
Sbjct: 65  SSDPQDSDCSCHANVLKRDLAPYKSTGVTRQMIESSARYGTKYK---IYGHRLYRDANCM 121

Query: 78  YQTRDVFTVWGILQLLRLYPGKVPDLELMFSCGDRPVVKKRDYEGANST----SPPPVFH 133
           +  R      GI   L      +PD++L        ++  RDY   N+     +  PVF 
Sbjct: 122 FPAR----CEGIEHFLLPLVATLPDMDL--------IINTRDYPQLNAAWGNAAGGPVFS 169

Query: 134 YCGDQESLDIVFPDWSFW-GWAETNIRP-----WNSILEDIKEGNKRTKWINRAPYAYWK 187
           +   +E  DI++P W+FW G   T + P     W+ + E +++      W  +    +++
Sbjct: 170 FSKTKEYRDIMYPAWTFWAGGPATKLHPRGIGRWDQMREKLEKRAAAIPWSQKRSLGFFR 229

Query: 188 G-------NPYVSIARED--LMKCNVTDKYQWKTRLYVQDWHKEKKQGFEKSKLADQCTH 238
           G       +  + ++R +  L++   T    WK+     D     +  FE     D C +
Sbjct: 230 GSRTSDERDSLILLSRRNPELVEAQYTKNQGWKSPKDTLDAPAADEVSFE-----DHCKY 284

Query: 239 RYKIYVEGQAWSVSEKYILACDSMTLLIEPKYYDFYSRSLVPLQHYWPVRTARKCRDIKF 298
           +Y     G A S   K++  C S+   +  ++ +F+   L P  HY P+++    ++ + 
Sbjct: 285 KYLFNFRGVAASFRLKHLFLCKSLVFHVGDEWQEFFYDQLKPWVHYVPLKSYPSQQEYEH 344

Query: 299 AVEWGNTHTRQARAIGRAGSRYMQEKLKMKYVYDYMFHLLIEYAKLLKFE 348
            + +   +   A+ I + G  ++ E L+MK +  Y   LL  Y KLL++E
Sbjct: 345 ILSFFKKNDALAQEIAQRGYDFIWEHLRMKDIKCYWRKLLKRYVKLLQYE 394


>sp|Q29AU6|RUMI_DROPS O-glucosyltransferase rumi OS=Drosophila pseudoobscura
           pseudoobscura GN=rumi PE=3 SV=1
          Length = 409

 Score = 92.8 bits (229), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 79/348 (22%), Positives = 148/348 (42%), Gaps = 36/348 (10%)

Query: 19  SSEESLGECPEYFRWIHKDLEPWKHTGISREMLERAKAHAQFRLVIINGDAYVEKYKNAY 78
           SS+ +   C  +   I  DL P+K TG+SR+M+E +  +         G  Y    K  Y
Sbjct: 63  SSDANDANCSCHAAVIKSDLAPYKATGVSRQMIESSARY---------GTRYKIYEKRLY 113

Query: 79  QTRDVF---TVWGILQLLRLYPGKVPDLELMFSCGDRPVVKKRDYEGANSTSPPPVFHYC 135
           +  +        GI   L      +PD++L+ +  D P +      GA      P+  + 
Sbjct: 114 REENCMFPARCQGIEHFLLPLVATLPDMDLVINTRDYPQINMAWGNGAQG----PILSFS 169

Query: 136 GDQESLDIVFPDWSFW-GWAETNIRP-----WNSILEDIKEGNKRTKWINRAPYAYWKG- 188
             ++  DI++P W+FW G   T + P     W+ + E +++      W  +    +++G 
Sbjct: 170 KTKDHRDIMYPAWTFWAGGPATKLHPRGIGRWDLMREKLEKRAAAIPWSQKRELGFFRGS 229

Query: 189 ------NPYVSIARED--LMKCNVTDKYQWKTRLYVQDWHKEKKQGFEKSKLADQCTHRY 240
                 +  + ++R +  L++   T    WK+     D     +  FE     D C ++Y
Sbjct: 230 RTSDERDSLILLSRRNPELVEAQYTKNQGWKSPKDTLDAPPAGEVSFE-----DHCKYKY 284

Query: 241 KIYVEGQAWSVSEKYILACDSMTLLIEPKYYDFYSRSLVPLQHYWPVRTARKCRDIKFAV 300
                G A S   K++  C S+   +  ++ +F+   L P  HY P++     ++ +  +
Sbjct: 285 LFNFRGVAASFRLKHLFLCQSLVFHVGDEWQEFFYDQLKPWVHYVPLKNYPSQQEYEELL 344

Query: 301 EWGNTHTRQARAIGRAGSRYMQEKLKMKYVYDYMFHLLIEYAKLLKFE 348
            +   +   A+ I + G  ++ + L+MK +  Y   LL  Y KLL +E
Sbjct: 345 TFFRKNDALAQEIAQRGRDFIWQHLRMKDIKCYWRRLLKSYVKLLTYE 392


>sp|Q5E9Q1|PGLT1_BOVIN Protein O-glucosyltransferase 1 OS=Bos taurus GN=POGLUT1 PE=2 SV=1
          Length = 392

 Score = 92.4 bits (228), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 88/340 (25%), Positives = 154/340 (45%), Gaps = 40/340 (11%)

Query: 27  CPEYFRWIHKDLEPWKHTGISREMLE---RAKAHAQFRLVIINGDAYVEKYKNAYQTRDV 83
           C  Y   I +DL P++  GISR+M+    R K    ++  II    Y E     + +R  
Sbjct: 54  CSCYHGVIEEDLTPFR-GGISRKMMAEVVRRKLGTHYQ--IIKNRLYRES-DCMFPSRCS 109

Query: 84  FTVWGILQLLRLYPGKVPDLELMFSCGDRPVVKKRDYEGANSTSPPPVFHYCGDQESLDI 143
                IL+++    G++PD+E++ +  D P V K            P+F +    E  DI
Sbjct: 110 GVEHFILEVI----GRLPDMEMVINVRDYPQVPKWMEPAI------PIFSFSKTLEYHDI 159

Query: 144 VFPDWSFW--GWAETNIRP-----WNSILEDIKEGNKRTKWINRAPYAYWKG-------N 189
           ++P W+FW  G A   I P     W+   ED+     +  W  +   AY++G       +
Sbjct: 160 MYPAWTFWEGGPAVWPIYPMGLGRWDLFREDLVRSAAQWPWKKKNSTAYFRGSRTSPERD 219

Query: 190 PYVSIARED--LMKCNVTDKYQWKTRLYVQDWHKEKKQGFEKSKLADQCTHRYKIYVEGQ 247
           P + ++R++  L+    T    WK+   ++D     K   +   L D C ++Y     G 
Sbjct: 220 PLILLSRKNPKLVDAEYTKNQAWKS---MKD--TLGKPAAKDVHLVDHCKYKYLFNFRGV 274

Query: 248 AWSVSEKYILACDSMTLLIEPKYYDFYSRSLVPLQHYWPVRTARKCRDIKFAVEWGNTHT 307
           A S   K++  C S+   +  ++ +F+   L P  HY PV+T     +++  +++   + 
Sbjct: 275 AASFRFKHLFLCGSLVFHVGDEWLEFFYPQLKPWVHYIPVKT--DLSNVQELLQFVKAND 332

Query: 308 RQARAIGRAGSRYMQEKLKMKYVYDYMFHLLIEYAKLLKF 347
             A+ I   GS+++   LKM  +  Y  +LL EY+K L +
Sbjct: 333 DVAQEIAERGSQFILNHLKMDDITCYWENLLTEYSKFLSY 372


>sp|A0NDG6|RUMI_ANOGA O-glucosyltransferase rumi homolog OS=Anopheles gambiae
           GN=AGAP004267 PE=3 SV=1
          Length = 399

 Score = 92.4 bits (228), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 81/329 (24%), Positives = 143/329 (43%), Gaps = 33/329 (10%)

Query: 34  IHKDLEPWKHTGISREMLERAKAHAQFRLVIINGDAYVEKYKNAYQTRDVFTVWGILQLL 93
           +  DL+P+K  GI++EM+ RAK +     VI  G     + +  +  R      G+   +
Sbjct: 78  LKADLKPFKAHGITKEMINRAKQYGTHYQVI--GHKLYRQRECMFPAR----CSGVEHFV 131

Query: 94  RLYPGKVPDLELMFSCGDRPVVKKRDYEGANSTSPPPVFHYCGDQESLDIVFPDWSFW-- 151
           R     +PD++L+ +C D P + +       S    PV  +    E LDI++P W+FW  
Sbjct: 132 RPLLPLLPDMDLIVNCRDWPQIHRH-----WSKEKIPVLSFSKTAEYLDIMYPAWAFWEG 186

Query: 152 ----GWAETNIRPWNSILEDIKEGNKRTKWINRAPYAYWKGN---------PYVSIARED 198
                   T +  W+   + I + +    W  + P A+++G+           +S A+  
Sbjct: 187 GPAIALYPTGLGRWDLHRQTITKAS--ADWEAKEPKAFFRGSRTSDERDALVLLSRAQPS 244

Query: 199 LMKCNVTDKYQWKTRLYVQDWHKEKKQGFEKSKLADQCTHRYKIYVEGQAWSVSEKYILA 258
           L+    T    WK+    QD      +   +  L + C +R+     G A S   K++  
Sbjct: 245 LVDAQYTKNQAWKS---PQD--TLNAEPAREVTLEEHCRYRFLFNFRGVAASFRFKHLFL 299

Query: 259 CDSMTLLIEPKYYDFYSRSLVPLQHYWPVRTARKCRDIKFAVEWGNTHTRQARAIGRAGS 318
           C S+   +  ++ +F+  SL P  HY PV       +++  + +   H + ARAI   G 
Sbjct: 300 CRSLVFHVGDEWQEFFYPSLKPWVHYVPVPVRSTPEELEALITFFQEHDQLARAIAERGY 359

Query: 319 RYMQEKLKMKYVYDYMFHLLIEYAKLLKF 347
            ++   L+M  V  Y   LL  Y KL+++
Sbjct: 360 EHIWNHLRMADVECYWKKLLKRYGKLIRY 388


>sp|Q9JHP7|KDEL1_MOUSE KDEL motif-containing protein 1 OS=Mus musculus GN=Kdelc1 PE=1 SV=1
          Length = 502

 Score = 90.9 bits (224), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 85/352 (24%), Positives = 154/352 (43%), Gaps = 38/352 (10%)

Query: 27  CPEYFRWIHKDLEPWKHTG---ISREMLER-AKAHAQFRLVIINGDAYVEKYKNAYQTRD 82
           C E    I KDL  +       I+ E+ +R  +  +     + +   Y++ +      R 
Sbjct: 153 CSETISQIQKDLAHFPTVDPEKIAAEIPKRFGQRQSLCHYTLKDNKVYIKTHGEHVGFR- 211

Query: 83  VFTVWGILQLLRLYPGKVPDLELMFSCGDRPVVKKRDYEGANSTSPPPVFHYCGDQESLD 142
           +F    +L L R    ++PD+E   + GD P+ KK+     ++++  P+F +CG  ES D
Sbjct: 212 IFMDAILLSLTRKV--RMPDVEFFVNLGDWPLEKKK-----SNSNIQPIFSWCGSTESRD 264

Query: 143 IVFPDWSFWGWAETNIRPWNSILEDIK--EGNKRTKWINRAPYAYWKGNP-------YVS 193
           IV P +      ++ +     +  D+   + N    W ++   A W+G          V 
Sbjct: 265 IVMPTYDL---TDSVLETMGRVSLDMMSVQANTGPPWESKNSTAVWRGRDSRKERLELVK 321

Query: 194 IARE--DLMKCNVTDKYQWKTRLYVQDWHKEKKQG--FEKSKLADQCTHRYKIYVEGQAW 249
           ++R+  +L+    T+ + +K        H E   G   +     D   H+Y+I ++G   
Sbjct: 322 LSRKHPELIDAAFTNFFFFK--------HDESLYGPIVKHISFFDFFKHKYQINIDGTVA 373

Query: 250 SVSEKYILACDSMTLLIEPKYYDFYSRSLVPLQHYWPVRTARKCRDIKFAVEWGNTHTRQ 309
           +    Y+L  DS+ L  +  YY+ +   L P +HY PV++     D+   ++W   H  +
Sbjct: 374 AYRLPYLLVGDSVVLKQDSIYYEHFYNELQPWKHYIPVKS--NLSDLLEKLKWAKEHDAE 431

Query: 310 ARAIGRAGSRYMQEKLKMKYVYDYMFHLLIEYAKLLKFEPRIPNEGKKVCAQ 361
           A+ I +AG  + +  L    ++ Y F L   YA L   EP+I    K+V  Q
Sbjct: 432 AKKIAKAGQEFARNNLMGDDIFCYYFKLFQGYANLQVSEPQIREGMKRVEPQ 483


>sp|Q7Z4H8|KDEL2_HUMAN KDEL motif-containing protein 2 OS=Homo sapiens GN=KDELC2 PE=1 SV=2
          Length = 507

 Score = 90.9 bits (224), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 84/353 (23%), Positives = 157/353 (44%), Gaps = 39/353 (11%)

Query: 27  CPEYFRWIHKDLEPWKHTGISREMLERAKAHAQFR-----LVIINGDAYVEKYKNAYQTR 81
           CP     I KD   +    + + + E  K     R       I+N   Y  +    Y   
Sbjct: 156 CPTKEPQIAKDFASFPSINLQQMLKEVPKRFGDERGAIVHYTILNNHVY-RRSLGKYTDF 214

Query: 82  DVFTVWGILQLLRLYPGKVPDLELMFSCGDRPVVKKRDYEGANST-SPPPVFHYCGDQES 140
            +F+   +L L R     +PDLE   + GD P+    ++   N T SP P+  +CG  +S
Sbjct: 215 KMFSDEILLSLTRKV--LLPDLEFYVNLGDWPL----EHRKVNGTPSPIPIISWCGSLDS 268

Query: 141 LDIVFPDWSFWGWAETNIRPWNSILEDIKEGNKRTKWINRAPYAYWKGNP-------YVS 193
            D+V P +         +R   + L  I +GN    WIN+   A+++G          V 
Sbjct: 269 RDVVLPTYDITHSMLEAMRGVTNDLLSI-QGNTGPSWINKTERAFFRGRDSREERLQLVQ 327

Query: 194 IARED--LMKCNVTDKYQWKTRLYVQDWHKEKKQGFEKSKLA---DQCTHRYKIYVEGQA 248
           +++E+  L+   +T  +          + +EK++   K+KL    D   ++Y++ V+G  
Sbjct: 328 LSKENPQLLDAGITGYF----------FFQEKEKELGKAKLMGFFDFFKYKYQVNVDGTV 377

Query: 249 WSVSEKYILACDSMTLLIEPKYYDFYSRSLVPLQHYWPVRTARKCRDIKFAVEWGNTHTR 308
            +    Y++  DS+ L  +  YY+ +  +L P +HY P++  R   D+   V+W   +  
Sbjct: 378 AAYRYPYLMLGDSLVLKQDSPYYEHFYMALEPWKHYVPIK--RNLSDLLEKVKWAKENDE 435

Query: 309 QARAIGRAGSRYMQEKLKMKYVYDYMFHLLIEYAKLLKFEPRIPNEGKKVCAQ 361
           +A+ I + G    ++ L+   +Y Y + +L +YA+    +P +  +G ++  Q
Sbjct: 436 EAKKIAKEGQLMARDLLQPHRLYCYYYQVLQKYAERQSSKPEV-RDGMELVPQ 487


>sp|Q8NBL1|PGLT1_HUMAN Protein O-glucosyltransferase 1 OS=Homo sapiens GN=POGLUT1 PE=1
           SV=1
          Length = 392

 Score = 90.9 bits (224), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 83/337 (24%), Positives = 150/337 (44%), Gaps = 34/337 (10%)

Query: 27  CPEYFRWIHKDLEPWKHTGISREMLERAKAHAQFRLVIINGDAYVEKYKNAYQTRDVFTV 86
           C  Y   I +DL P++  GISR+M+             I  +    +    + +R     
Sbjct: 54  CSCYHGVIEEDLTPFR-GGISRKMMAEVVRRKLGTHYQITKNRLYRENDCMFPSRCSGVE 112

Query: 87  WGILQLLRLYPGKVPDLELMFSCGDRPVVKKRDYEGANSTSPPPVFHYCGDQESLDIVFP 146
             IL+++    G++PD+E++ +  D P V K            PVF +    E  DI++P
Sbjct: 113 HFILEVI----GRLPDMEMVINVRDYPQVPKWMEPAI------PVFSFSKTSEYHDIMYP 162

Query: 147 DWSFW--GWAETNIRP-----WNSILEDIKEGNKRTKWINRAPYAYWKG-------NPYV 192
            W+FW  G A   I P     W+   ED+     +  W  +   AY++G       +P +
Sbjct: 163 AWTFWEGGPAVWPIYPTGLGRWDLFREDLVRSAAQWPWKKKNSTAYFRGSRTSPERDPLI 222

Query: 193 SIARED--LMKCNVTDKYQWKTRLYVQDWHKEKKQGFEKSKLADQCTHRYKIYVEGQAWS 250
            ++R++  L+    T    WK+   ++D     K   +   L D C ++Y     G A S
Sbjct: 223 LLSRKNPKLVDAEYTKNQAWKS---MKD--TLGKPAAKDVHLVDHCKYKYLFNFRGVAAS 277

Query: 251 VSEKYILACDSMTLLIEPKYYDFYSRSLVPLQHYWPVRTARKCRDIKFAVEWGNTHTRQA 310
              K++  C S+   +  ++ +F+   L P  HY PV+T     +++  +++   +   A
Sbjct: 278 FRFKHLFLCGSLVFHVGDEWLEFFYPQLKPWVHYIPVKT--DLSNVQELLQFVKANDDVA 335

Query: 311 RAIGRAGSRYMQEKLKMKYVYDYMFHLLIEYAKLLKF 347
           + I   GS++++  L+M  +  Y  +LL EY+K L +
Sbjct: 336 QEIAERGSQFIRNHLQMDDITCYWENLLSEYSKFLSY 372


>sp|Q16QY8|RUMI_AEDAE O-glucosyltransferase rumi homolog OS=Aedes aegypti GN=AAEL011121
           PE=3 SV=1
          Length = 402

 Score = 87.8 bits (216), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 88/339 (25%), Positives = 153/339 (45%), Gaps = 32/339 (9%)

Query: 25  GECPEYFRWIHKDLEPWKHTGISREMLERAKAHAQFRLVIINGDAYVEKYKNAYQTRDVF 84
             C  +   +  DL P+K  GIS +M+ERA+++   +  I++   Y +K    +  R   
Sbjct: 70  ANCSCHADVLKTDLRPFK-GGISEQMVERARSYGT-KYQIVDHRLYRQK-DCMFPAR--- 123

Query: 85  TVWGILQLLRLYPGKVPDLELMFSCGDRPVVKKRDYEGANSTSPPPVFHYCGDQESLDIV 144
              G+   ++     +PD+EL+ +C D P + +            PV  +    + LDI+
Sbjct: 124 -CSGVEHFIKPNLPHLPDMELIINCRDWPQINRH-----WKQEKLPVLSFSKTDDYLDIM 177

Query: 145 FPDWSFW-GWAETNIRP-----WNSILEDIKEGNKRTKWINRAPYAYWKG-------NPY 191
           +P W FW G    ++ P     W+     IK+     KW  +   A+++G       +P 
Sbjct: 178 YPTWGFWEGGPAISLYPTGLGRWDQHRVSIKKAADSWKWEKKKAKAFFRGSRTSDERDPL 237

Query: 192 VSIARE--DLMKCNVTDKYQWKTRLYVQDWHKEKKQGFEKSKLADQCTHRYKIYVEGQAW 249
           V ++R   +L+    T    WK+    +D    K    ++ +L D C ++Y     G A 
Sbjct: 238 VLLSRRKPELVDAQYTKNQAWKS---PKDTLNAKPA--QEVRLEDHCQYKYLFNFRGVAA 292

Query: 250 SVSEKYILACDSMTLLIEPKYYDFYSRSLVPLQHYWPVRTARKCRDIKFAVEWGNTHTRQ 309
           S   K++  C S+   +  ++ +F+  SL P  HY PVR      +++  +E+   H   
Sbjct: 293 SFRFKHLFLCRSLVFHVGSEWQEFFYPSLKPWVHYVPVRVGATQEELEELIEFFAEHDDL 352

Query: 310 ARAIGRAGSRYMQEKLKMKYVYDYMFHLLIEYAKLLKFE 348
           AR I   G  ++ + L+MK V  Y   LL  Y KL+K+E
Sbjct: 353 AREIADRGFEHVWKHLRMKDVECYWRKLLRRYGKLVKYE 391


>sp|Q8BYB9|PGLT1_MOUSE Protein O-glucosyltransferase 1 OS=Mus musculus GN=Poglut1 PE=2
           SV=2
          Length = 392

 Score = 87.0 bits (214), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 82/338 (24%), Positives = 150/338 (44%), Gaps = 36/338 (10%)

Query: 27  CPEYFRWIHKDLEPWKHTGISREML-ERAKAHAQFRLVIINGDAYVEKYKNAYQTRDVFT 85
           C  Y   I +DL P++  GISR+M+ E  +        II    + E     + +R    
Sbjct: 54  CSCYHGVIEEDLTPFR-GGISRKMMAEVVRRKLGTHYQIIKNRLFRED-DCMFPSRCSGV 111

Query: 86  VWGILQLLRLYPGKVPDLELMFSCGDRPVVKKRDYEGANSTSPPPVFHYCGDQESLDIVF 145
              IL+++     ++PD+E++ +  D P V K            PVF +    E  DI++
Sbjct: 112 EHFILEVIH----RLPDMEMVINVRDYPQVPKW------MEPTIPVFSFSKTSEYHDIMY 161

Query: 146 PDWSFWGWAE-------TNIRPWNSILEDIKEGNKRTKWINRAPYAYWKG-------NPY 191
           P W+FW           T +  W+   ED+     +  W  +   AY++G       +P 
Sbjct: 162 PAWTFWEGGPAVWPLYPTGLGRWDLFREDLLRSAAQWPWEKKNSTAYFRGSRTSPERDPL 221

Query: 192 VSIARED--LMKCNVTDKYQWKTRLYVQDWHKEKKQGFEKSKLADQCTHRYKIYVEGQAW 249
           + ++R++  L+    T    WK+   ++D     K   +   L D C +RY     G A 
Sbjct: 222 ILLSRKNPKLVDAEYTKNQAWKS---MKD--TLGKPAAKDVHLIDHCKYRYLFNFRGVAA 276

Query: 250 SVSEKYILACDSMTLLIEPKYYDFYSRSLVPLQHYWPVRTARKCRDIKFAVEWGNTHTRQ 309
           S   K++  C S+   +  ++ +F+   L P  HY PV+T     +++  +++   +   
Sbjct: 277 SFRFKHLFLCGSLVFHVGDEWVEFFYPQLKPWVHYIPVKT--DLSNVQELLQFVKANDDI 334

Query: 310 ARAIGRAGSRYMQEKLKMKYVYDYMFHLLIEYAKLLKF 347
           A+ I + GS+++   L+M  +  Y  +LL +Y+K L +
Sbjct: 335 AQEIAKRGSQFIINHLQMDDITCYWENLLTDYSKFLSY 372


>sp|Q566E5|KDEL2_RAT KDEL motif-containing protein 2 OS=Rattus norvegicus GN=Kdelc2 PE=2
           SV=1
          Length = 508

 Score = 85.5 bits (210), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 67/272 (24%), Positives = 129/272 (47%), Gaps = 30/272 (11%)

Query: 100 VPDLELMFSCGDRPVVKKRDYEGANST-SPPPVFHYCGDQESLDIVFPDWSFWGWAETNI 158
           +PDLE   + GD P+    ++   N T  P P+  +CG  +S DI+ P +         +
Sbjct: 232 LPDLEFYINLGDWPL----EHRKVNDTPGPIPIISWCGSLDSRDIILPTYDVTHSTLEAM 287

Query: 159 RPWNSILEDIKEGNKRTKWINRAPYAYWKGNP-------YVSIARED--LMKCNVTDKYQ 209
           R   + L  + +GN    WIN+   A+++G          V +++E+  L+   +T  + 
Sbjct: 288 RGVTNDLLSV-QGNTGPSWINKTEKAFFRGRDSREERLQLVLLSKENPQLLDAGITGYF- 345

Query: 210 WKTRLYVQDWHKEKKQGFEKSKLA---DQCTHRYKIYVEGQAWSVSEKYILACDSMTLLI 266
                    + +EK++   K+KL    D   ++Y++ V+G   +    Y++  DS+ L  
Sbjct: 346 ---------FFQEKEKELGKAKLMGFFDFFKYKYQVNVDGTVAAYRYPYLMLGDSLVLKQ 396

Query: 267 EPKYYDFYSRSLVPLQHYWPVRTARKCRDIKFAVEWGNTHTRQARAIGRAGSRYMQEKLK 326
           E  YY+ +   L P +HY P++  R   D+   V+W   +  +A+ I + G    ++ L+
Sbjct: 397 ESPYYEHFYVELRPWKHYVPIK--RNLSDLLEKVKWAKENDEEAKRIAKEGQLTARDLLQ 454

Query: 327 MKYVYDYMFHLLIEYAKLLKFEPRIPNEGKKV 358
              +Y Y + +L +YA+    +P I +  ++V
Sbjct: 455 PPRLYCYYYRVLQKYAERQVSKPMIRDGMERV 486


>sp|Q197C5|VF179_IIV3 Uncharacterized protein 035R OS=Invertebrate iridescent virus 3
           GN=IIV3-035R PE=4 SV=1
          Length = 1098

 Score = 42.4 bits (98), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 36/161 (22%), Positives = 66/161 (40%), Gaps = 14/161 (8%)

Query: 192 VSIAREDLMKCNVTDKYQWKTRL-------YVQDWHKEKKQGFEKSKLADQCTHRYKIYV 244
           +   R DL+   +T+   W  R+       Y+Q          ++     Q  +++ ++V
Sbjct: 336 LGTVRPDLLDAGITN---WNLRVRVSKHSPYLQIPDPGTLTAVDRLSPHQQSQYKFIVHV 392

Query: 245 EGQAWSVSEKYILACDSMTLLIEP--KYYDFYSRSLVPLQHYWPVRTARKCRDIKFAVEW 302
           EG   +      L   S  LL++    +  +YS  L P  HY PVR      D+   +EW
Sbjct: 393 EGHVSAFRLSLELGMKSCILLVQSLHGWKMWYSDLLKPWVHYVPVRP--DLSDLFDRIEW 450

Query: 303 GNTHTRQARAIGRAGSRYMQEKLKMKYVYDYMFHLLIEYAK 343
              +  Q RA+     ++ +  L  + + D++ H L   A+
Sbjct: 451 CRANDAQCRAMAENAYQFYRTHLDKESILDHLQHTLNRLAR 491


>sp|O55767|VF179_IIV6 Uncharacterized protein 179R OS=Invertebrate iridescent virus 6
           GN=IIV6-179R PE=4 SV=1
          Length = 1186

 Score = 38.9 bits (89), Expect = 0.081,   Method: Compositional matrix adjust.
 Identities = 79/375 (21%), Positives = 142/375 (37%), Gaps = 96/375 (25%)

Query: 99  KVPDLELMFSCGDRPVVKKRDYE------GAN-----------STSPPPVFHYCGDQESL 141
           +VPD+EL  +  D P++ K   E      G +           S    P+   C      
Sbjct: 206 EVPDIELFINRRDFPLLTKNGTEPYYNIFGKDHSLDSKSLKLISEGMCPILSMCTSDMYA 265

Query: 142 DIVFP---DWSFWGWAE---TNI----RPWNSI---------LEDIKEGNKRTKWINRAP 182
           DIV P   DW+     E   T++       NS+          +D  + N  T W  R P
Sbjct: 266 DIVIPTHEDWARVASTEGVKTHVGDKENNENSLPKKITFPPQCKDYSKDNFNTPWEKRIP 325

Query: 183 YAYWKG---------NPYVSIAREDLMKCNVT-----DKY----------QWKTR----- 213
            A ++G         +  +    + L+   ++     DKY          +W  R     
Sbjct: 326 TAVFRGGSTGCGVSSDTNLETFNQRLVAAKISYNSKPDKYNVPLITAGITKWNLRPRKIL 385

Query: 214 --LYVQDWHKEKKQGFEKSKLA-----DQCTHRYKIYVEGQ--AWSVSEKYILACDSMTL 264
              Y+Q  + EK    E  K++     +Q  ++Y I ++G   A+ +S +  + C    L
Sbjct: 386 NEKYLQTINIEK----EAPKVSPLTPEEQSKYKYIINIDGHVSAFRLSLEMSMGC--CIL 439

Query: 265 LIEPKYYD-------FYSRSLVPLQHYWPVRTARKCRDIKFAVEWGNTHTRQARAIGRAG 317
           L++ K  +       ++S  L P  HY PV++     D+   ++W   +  + + I +  
Sbjct: 440 LVKSKIPNETFGWKMWFSHLLKPYIHYVPVKS--DLSDLIEKIQWCRDNDEKCKEISQEA 497

Query: 318 SRYMQEKLKMKYVYDYMFHLLIEYAKLLKFEPRIPNEGKKVCAQKLASSQNGLGKRFMVE 377
            ++ Q  L  + + DYM +L++      K +   P   + V        QN +  +++  
Sbjct: 498 LKFYQTYLSRESILDYMQNLMV------KLKLSFPT-NEIVYGTDPLFIQNSVQSKYLFN 550

Query: 378 SMVNSSSETLPCTMP 392
            M NS +   P   P
Sbjct: 551 MMNNSINPIFPFLTP 565


>sp|A4WDE9|UNG_ENT38 Uracil-DNA glycosylase OS=Enterobacter sp. (strain 638) GN=ung PE=3
           SV=1
          Length = 229

 Score = 32.3 bits (72), Expect = 6.7,   Method: Compositional matrix adjust.
 Identities = 23/82 (28%), Positives = 35/82 (42%), Gaps = 4/82 (4%)

Query: 11  STPPFLNSSSEESLGECPEYFRWIHKDLEPWKHTGI----SREMLERAKAHAQFRLVIIN 66
           +TPP L +  +E  G  P + R  H  LE W H G+    +   +   +AH+   L    
Sbjct: 84  ATPPSLLNMYKELEGSLPGFVRPAHGYLESWAHQGVLLLNTVLTVRAGQAHSHASLGWET 143

Query: 67  GDAYVEKYKNAYQTRDVFTVWG 88
               V    N ++   VF +WG
Sbjct: 144 FTDKVISLINEHREGVVFLLWG 165


  Database: swissprot
    Posted date:  Mar 23, 2013  2:32 AM
  Number of letters in database: 191,569,459
  Number of sequences in database:  539,616
  
Lambda     K      H
   0.319    0.135    0.431 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 167,125,382
Number of Sequences: 539616
Number of extensions: 7029264
Number of successful extensions: 14337
Number of sequences better than 100.0: 20
Number of HSP's better than 100.0 without gapping: 14
Number of HSP's successfully gapped in prelim test: 6
Number of HSP's that attempted gapping in prelim test: 14275
Number of HSP's gapped (non-prelim): 28
length of query: 428
length of database: 191,569,459
effective HSP length: 120
effective length of query: 308
effective length of database: 126,815,539
effective search space: 39059186012
effective search space used: 39059186012
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 63 (28.9 bits)