BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 046815
(463 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|B0X1Q4|RUMI_CULQU O-glucosyltransferase rumi homolog OS=Culex quinquefasciatus
GN=CPIJ013394 PE=3 SV=1
Length = 403
Score = 91.3 bits (225), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 86/380 (22%), Positives = 166/380 (43%), Gaps = 46/380 (12%)
Query: 65 KEESRRSQAYKIVHCTYLTCLSAMNPIPERRRVASPQKVQTCPDFFKSIHKDLEPWAKSR 124
++E+ + YK Y+T I E P + C + DL P+ +S
Sbjct: 36 QQEAPTNNLYKAADNKYITL------IEEALAAYKPCESSNCSCHLDVLKTDLRPF-RSG 88
Query: 125 ITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMF--TIWGFLQLLRRYPGMVPDV 182
IT + ++E R + I+ +++ Q MF G +R +PD+
Sbjct: 89 IT-QDLIELARSYGTKYQIIGHRMFR-------QRDCMFPARCSGVEHFIRPNLPKLPDM 140
Query: 183 DIMFDCMDKPVIDKKEHGSF-PLPLFRYCTNDAHFDIPFPDWSFW-GWSEVNLQP----- 235
+++ +C D P I + + S PLP+ + + + DI +P W FW G ++L P
Sbjct: 141 ELIINCRDWPQISRHWNASREPLPVLSFSKTNDYLDIMYPTWGFWEGGPAISLYPTGLGR 200
Query: 236 WNEEFKDIKHGSQAKSWKEKLPFAYWKGNPD--------VLSPLRVELMKCNDSKLWGAE 287
W++ ++ ++ W++KL A+++G+ +LS +R EL+ +K
Sbjct: 201 WDQHRVSVRKAAKVWPWEKKLQQAFFRGSRTSDERDPLVLLSRMRPELVDAQYTK----- 255
Query: 288 ILRQNWAEEAKDGF-----KKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQ 342
Q W KD ++ +L + C ++Y G A S K++ C S+ + Q
Sbjct: 256 --NQAW-RSPKDTLHAEPAQEVRLEDHCQYKYLFNFRGVAASFRFKHLFLCKSLVFHVGQ 312
Query: 343 QYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-ESLTMD 401
++++FF L P ++ P+P ++ ++ + + A++I G + + L M+
Sbjct: 313 EWQEFFYDSLKPWVHYVPVPVGINEWELEHLIQFFREHDQLAQEIANRGYEHIWNHLRME 372
Query: 402 RVYDYMLHLITEYSKLLDYK 421
V Y L+ Y KL+ Y+
Sbjct: 373 DVECYWKRLLRRYGKLVKYE 392
>sp|Q8NBL1|PGLT1_HUMAN Protein O-glucosyltransferase 1 OS=Homo sapiens GN=POGLUT1 PE=1
SV=1
Length = 392
Score = 91.3 bits (225), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 87/338 (25%), Positives = 156/338 (46%), Gaps = 30/338 (8%)
Query: 100 PQKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEA-KRFAALRILIVRGKLYVDPYYDCV- 157
P Q C + I +DL P+ + I+ + + E +R I + +LY + DC+
Sbjct: 48 PCSSQNCSCYHGVIEEDLTPF-RGGISRKMMAEVVRRKLGTHYQITKNRLYREN--DCMF 104
Query: 158 QSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFD 217
SR G + G +PD++++ + D P + K + +P+F + + D
Sbjct: 105 PSRCS----GVEHFILEVIGRLPDMEMVINVRDYPQVPKWMEPA--IPVFSFSKTSEYHD 158
Query: 218 IPFPDWSFW-GWSEV------NLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSP 270
I +P W+FW G V L W+ +D+ + WK+K AY++G+ SP
Sbjct: 159 IMYPAWTFWEGGPAVWPIYPTGLGRWDLFREDLVRSAAQWPWKKKNSTAYFRGSR--TSP 216
Query: 271 LR--VELMKCNDSKLWGAEILRQNWAEEAKDGF-----KKSKLSNQCNHRYKIYAEGYAW 323
R + L+ + KL AE + + KD K L + C ++Y G A
Sbjct: 217 ERDPLILLSRKNPKLVDAEYTKNQAWKSMKDTLGKPAAKDVHLVDHCKYKYLFNFRGVAA 276
Query: 324 SVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSE 383
S K++ C S+ + ++ +FF L P ++ P+ + DL +++ ++ + AN
Sbjct: 277 SFRFKHLFLCGSLVFHVGDEWLEFFYPQLKPWVHYIPVKT-DLS-NVQELLQFVKANDDV 334
Query: 384 AEKIGKAGQDFMES-LTMDRVYDYMLHLITEYSKLLDY 420
A++I + G F+ + L MD + Y +L++EYSK L Y
Sbjct: 335 AQEIAERGSQFIRNHLQMDDITCYWENLLSEYSKFLSY 372
>sp|Q5E9Q1|PGLT1_BOVIN Protein O-glucosyltransferase 1 OS=Bos taurus GN=POGLUT1 PE=2 SV=1
Length = 392
Score = 90.9 bits (224), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 84/333 (25%), Positives = 153/333 (45%), Gaps = 30/333 (9%)
Query: 105 TCPDFFKSIHKDLEPWAKSRITMRHIMEA-KRFAALRILIVRGKLYVDPYYDCV-QSRAM 162
C + I +DL P+ + I+ + + E +R I++ +LY + DC+ SR
Sbjct: 53 NCSCYHGVIEEDLTPF-RGGISRKMMAEVVRRKLGTHYQIIKNRLYRES--DCMFPSRCS 109
Query: 163 FTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPD 222
G + G +PD++++ + D P + K + +P+F + + DI +P
Sbjct: 110 ----GVEHFILEVIGRLPDMEMVINVRDYPQVPKWMEPA--IPIFSFSKTLEYHDIMYPA 163
Query: 223 WSFWGWSE-------VNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLR--V 273
W+FW + L W+ +D+ + WK+K AY++G+ SP R +
Sbjct: 164 WTFWEGGPAVWPIYPMGLGRWDLFREDLVRSAAQWPWKKKNSTAYFRGSRT--SPERDPL 221
Query: 274 ELMKCNDSKLWGAEILRQNWAEEAKDGF-----KKSKLSNQCNHRYKIYAEGYAWSVSLK 328
L+ + KL AE + + KD K L + C ++Y G A S K
Sbjct: 222 ILLSRKNPKLVDAEYTKNQAWKSMKDTLGKPAAKDVHLVDHCKYKYLFNFRGVAASFRFK 281
Query: 329 YILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIG 388
++ C S+ + ++ +FF L P ++ P+ + DL +++ ++ + AN A++I
Sbjct: 282 HLFLCGSLVFHVGDEWLEFFYPQLKPWVHYIPVKT-DLS-NVQELLQFVKANDDVAQEIA 339
Query: 389 KAGQDF-MESLTMDRVYDYMLHLITEYSKLLDY 420
+ G F + L MD + Y +L+TEYSK L Y
Sbjct: 340 ERGSQFILNHLKMDDITCYWENLLTEYSKFLSY 372
>sp|Q8BYB9|PGLT1_MOUSE Protein O-glucosyltransferase 1 OS=Mus musculus GN=Poglut1 PE=2
SV=2
Length = 392
Score = 90.5 bits (223), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 87/338 (25%), Positives = 158/338 (46%), Gaps = 30/338 (8%)
Query: 100 PQKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEA-KRFAALRILIVRGKLYVDPYYDCV- 157
P Q C + I +DL P+ + I+ + + E +R I++ +L+ + DC+
Sbjct: 48 PCSSQNCSCYHGVIEEDLTPF-RGGISRKMMAEVVRRKLGTHYQIIKNRLFRED--DCMF 104
Query: 158 QSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFD 217
SR L+++ R +PD++++ + D P + K + +P+F + + D
Sbjct: 105 PSRCSGVEHFILEVIHR----LPDMEMVINVRDYPQVPKWMEPT--IPVFSFSKTSEYHD 158
Query: 218 IPFPDWSFW-GWSEV------NLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSP 270
I +P W+FW G V L W+ +D+ + W++K AY++G+ SP
Sbjct: 159 IMYPAWTFWEGGPAVWPLYPTGLGRWDLFREDLLRSAAQWPWEKKNSTAYFRGSR--TSP 216
Query: 271 LR--VELMKCNDSKLWGAEILRQNWAEEAKDGF-----KKSKLSNQCNHRYKIYAEGYAW 323
R + L+ + KL AE + + KD K L + C +RY G A
Sbjct: 217 ERDPLILLSRKNPKLVDAEYTKNQAWKSMKDTLGKPAAKDVHLIDHCKYRYLFNFRGVAA 276
Query: 324 SVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSE 383
S K++ C S+ + ++ +FF L P ++ P+ + DL +++ ++ + AN
Sbjct: 277 SFRFKHLFLCGSLVFHVGDEWVEFFYPQLKPWVHYIPVKT-DLS-NVQELLQFVKANDDI 334
Query: 384 AEKIGKAGQDF-MESLTMDRVYDYMLHLITEYSKLLDY 420
A++I K G F + L MD + Y +L+T+YSK L Y
Sbjct: 335 AQEIAKRGSQFIINHLQMDDITCYWENLLTDYSKFLSY 372
>sp|Q8T045|RUMI_DROME O-glucosyltransferase rumi OS=Drosophila melanogaster GN=rumi PE=1
SV=1
Length = 411
Score = 89.7 bits (221), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 73/324 (22%), Positives = 144/324 (44%), Gaps = 18/324 (5%)
Query: 113 IHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFTIWGFLQLL 172
+ +DL P+ + +T + I + R+ + I +LY D +C+ I FL L
Sbjct: 80 LKRDLAPYKSTGVTRQMIESSARYGT-KYKIYGHRLYRDA--NCMFPARCEGIEHFLLPL 136
Query: 173 RRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDWSFW-GWSEV 231
+PD+D++ + D P ++ + P+F + + DI +P W+FW G
Sbjct: 137 ---VATLPDMDLIINTRDYPQLNAAWGNAAGGPVFSFSKTKEYRDIMYPAWTFWAGGPAT 193
Query: 232 NLQP-----WNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGA 286
L P W++ + ++ + A W +K +++G+ + L+ + +L A
Sbjct: 194 KLHPRGIGRWDQMREKLEKRAAAIPWSQKRSLGFFRGSRTSDERDSLILLSRRNPELVEA 253
Query: 287 EILRQNWAEEAKDGFK-----KSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIIS 341
+ + + KD + + C ++Y G A S LK++ C S+ +
Sbjct: 254 QYTKNQGWKSPKDTLDAPAADEVSFEDHCKYKYLFNFRGVAASFRLKHLFLCKSLVFHVG 313
Query: 342 QQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-ESLTM 400
++++FF L P ++ P+ S + + ++ + N + A++I + G DF+ E L M
Sbjct: 314 DEWQEFFYDQLKPWVHYVPLKSYPSQQEYEHILSFFKKNDALAQEIAQRGYDFIWEHLRM 373
Query: 401 DRVYDYMLHLITEYSKLLDYKPAP 424
+ Y L+ Y KLL Y+ P
Sbjct: 374 KDIKCYWRKLLKRYVKLLQYEVKP 397
>sp|Q29AU6|RUMI_DROPS O-glucosyltransferase rumi OS=Drosophila pseudoobscura
pseudoobscura GN=rumi PE=3 SV=1
Length = 409
Score = 85.9 bits (211), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 72/324 (22%), Positives = 141/324 (43%), Gaps = 18/324 (5%)
Query: 113 IHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMFTIWGFLQLL 172
I DL P+ + ++ + I + R+ R I +LY + +C+ I FL L
Sbjct: 78 IKSDLAPYKATGVSRQMIESSARYGT-RYKIYEKRLYREE--NCMFPARCQGIEHFLLPL 134
Query: 173 RRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDWSFW-GWSEV 231
+PD+D++ + D P I+ P+ + H DI +P W+FW G
Sbjct: 135 ---VATLPDMDLVINTRDYPQINMAWGNGAQGPILSFSKTKDHRDIMYPAWTFWAGGPAT 191
Query: 232 NLQP-----WNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGA 286
L P W+ + ++ + A W +K +++G+ + L+ + +L A
Sbjct: 192 KLHPRGIGRWDLMREKLEKRAAAIPWSQKRELGFFRGSRTSDERDSLILLSRRNPELVEA 251
Query: 287 EILRQNWAEEAKDGFK-----KSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIIS 341
+ + + KD + + C ++Y G A S LK++ C S+ +
Sbjct: 252 QYTKNQGWKSPKDTLDAPPAGEVSFEDHCKYKYLFNFRGVAASFRLKHLFLCQSLVFHVG 311
Query: 342 QQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-ESLTM 400
++++FF L P ++ P+ + + + ++ + N + A++I + G+DF+ + L M
Sbjct: 312 DEWQEFFYDQLKPWVHYVPLKNYPSQQEYEELLTFFRKNDALAQEIAQRGRDFIWQHLRM 371
Query: 401 DRVYDYMLHLITEYSKLLDYKPAP 424
+ Y L+ Y KLL Y+ P
Sbjct: 372 KDIKCYWRRLLKSYVKLLTYEVQP 395
>sp|A0NDG6|RUMI_ANOGA O-glucosyltransferase rumi homolog OS=Anopheles gambiae
GN=AGAP004267 PE=3 SV=1
Length = 399
Score = 76.6 bits (187), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 74/330 (22%), Positives = 136/330 (41%), Gaps = 25/330 (7%)
Query: 116 DLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQSRAMF--TIWGFLQLLR 173
DL+P+ IT I AK++ ++ KLY Q MF G +R
Sbjct: 81 DLKPFKAHGITKEMINRAKQYGT-HYQVIGHKLYR-------QRECMFPARCSGVEHFVR 132
Query: 174 RYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPFPDWSFW-GWSEVN 232
++PD+D++ +C D P I + +P+ + + DI +P W+FW G +
Sbjct: 133 PLLPLLPDMDLIVNCRDWPQI-HRHWSKEKIPVLSFSKTAEYLDIMYPAWAFWEGGPAIA 191
Query: 233 LQP-----WNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAE 287
L P W+ + I S W+ K P A+++G+ + L+ L A+
Sbjct: 192 LYPTGLGRWDLHRQTITKAS--ADWEAKEPKAFFRGSRTSDERDALVLLSRAQPSLVDAQ 249
Query: 288 ILRQNWAEEAKDGF-----KKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQ 342
+ + +D ++ L C +R+ G A S K++ C S+ +
Sbjct: 250 YTKNQAWKSPQDTLNAEPAREVTLEEHCRYRFLFNFRGVAASFRFKHLFLCRSLVFHVGD 309
Query: 343 QYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSEAEKIGKAGQDFM-ESLTMD 401
++++FF L P ++ P+P +++++ + + A I + G + + L M
Sbjct: 310 EWQEFFYPSLKPWVHYVPVPVRSTPEELEALITFFQEHDQLARAIAERGYEHIWNHLRMA 369
Query: 402 RVYDYMLHLITEYSKLLDYKPAPPSSAFEA 431
V Y L+ Y KL+ Y S+ E
Sbjct: 370 DVECYWKKLLKRYGKLIRYTVERDSTLIEV 399
>sp|Q7Z4H8|KDEL2_HUMAN KDEL motif-containing protein 2 OS=Homo sapiens GN=KDELC2 PE=1 SV=2
Length = 507
Score = 72.0 bits (175), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 77/335 (22%), Positives = 144/335 (42%), Gaps = 22/335 (6%)
Query: 101 QKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEA--KRFAALRILIVRGKLYVDPYYDCVQ 158
QK +CP I KD + I ++ +++ KRF R IV + + Y
Sbjct: 151 QKTLSCPTKEPQIAKDFASFPS--INLQQMLKEVPKRFGDERGAIVHYTILNNHVYRRSL 208
Query: 159 SR----AMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGS-FPLPLFRYCTND 213
+ MF+ L L R+ ++PD++ + D P+ +K +G+ P+P+ +C +
Sbjct: 209 GKYTDFKMFSDEILLSLTRKV--LLPDLEFYVNLGDWPLEHRKVNGTPSPIPIISWCGSL 266
Query: 214 AHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRV 273
D+ P + ++ + I+ G+ SW K A+++G L++
Sbjct: 267 DSRDVVLPTYDITHSMLEAMRGVTNDLLSIQ-GNTGPSWINKTERAFFRGRDSREERLQL 325
Query: 274 ELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCN---HRYKIYAEGYAWSVSLKYI 330
+ + +L A I + +E + K+KL + ++Y++ +G + Y+
Sbjct: 326 VQLSKENPQLLDAGITGYFFFQEKEKELGKAKLMGFFDFFKYKYQVNVDGTVAAYRYPYL 385
Query: 331 LSCNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIG 388
+ +S+ L Y + F L P K++ PI +DL +K W N EA+KI
Sbjct: 386 MLGDSLVLKQDSPYYEHFYMALEPWKHYVPIKRNLSDLLEKVK----WAKENDEEAKKIA 441
Query: 389 KAGQDFMESLTM-DRVYDYMLHLITEYSKLLDYKP 422
K GQ L R+Y Y ++ +Y++ KP
Sbjct: 442 KEGQLMARDLLQPHRLYCYYYQVLQKYAERQSSKP 476
>sp|Q6UW63|KDEL1_HUMAN KDEL motif-containing protein 1 OS=Homo sapiens GN=KDELC1 PE=1 SV=1
Length = 502
Score = 70.1 bits (170), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 80/331 (24%), Positives = 139/331 (41%), Gaps = 24/331 (7%)
Query: 105 TCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRIL----IVRGKLYVDPYYDCVQSR 160
CP+ I +DL + + KRF + L + K+Y+ + + V R
Sbjct: 152 NCPETIAQIQRDLAHFPAVDPEKIAVEIPKRFGQRQSLCHYTLKDNKVYIKTHGEHVGFR 211
Query: 161 AMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPF 220
+F L L R+ +PDV++ + D P ++KK+ S P+F +C + DI
Sbjct: 212 -IFMDAILLSLTRKVK--MPDVELFVNLGDWP-LEKKKSNSNIHPIFSWCGSTDSKDIVM 267
Query: 221 PDWSFWGWSEVNLQPWNEEFKDIK--HGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKC 278
P + ++ L+ D+ + W+ K A W+G L + +
Sbjct: 268 PTYDL---TDSVLETMGRVSLDMMSVQANTGPPWESKNSTAVWRGRDSRKERLELVKLSR 324
Query: 279 NDSKLWGAEILRQNWAEEAKDGF----KKSKLSNQCNHRYKIYAEGYAWSVSLKYILSCN 334
+L A + + ++ + K + H+Y+I +G + L Y+L +
Sbjct: 325 KHPELIDAAFTNFFFFKHDENLYGPIVKHISFFDFFKHKYQINIDGTVAAYRLPYLLVGD 384
Query: 335 SVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIGKAGQ 392
SV L Y + F L P K++ P+ S +DL +K W + EA+KI KAGQ
Sbjct: 385 SVVLKQDSIYYEHFYNELQPWKHYIPVKSNLSDLLEKLK----WAKDHDEEAKKIAKAGQ 440
Query: 393 DFM-ESLTMDRVYDYMLHLITEYSKLLDYKP 422
+F +L D ++ Y L EY+ L +P
Sbjct: 441 EFARNNLMGDDIFCYYFKLFQEYANLQVSEP 471
>sp|Q16QY8|RUMI_AEDAE O-glucosyltransferase rumi homolog OS=Aedes aegypti GN=AAEL011121
PE=3 SV=1
Length = 402
Score = 66.6 bits (161), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 77/349 (22%), Positives = 151/349 (43%), Gaps = 28/349 (8%)
Query: 99 SPQKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLYVDPYYDCVQ 158
P + C + DL P+ K I+ + + A+ + + IV +LY DC+
Sbjct: 65 EPCQQANCSCHADVLKTDLRPF-KGGISEQMVERARSYGT-KYQIVDHRLYRQK--DCMF 120
Query: 159 SRAMFTIWGFLQLLRRYPGM--VPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHF 216
+ F++ P + +PD++++ +C D P I++ LP+ + D +
Sbjct: 121 PARCSGVEHFIK-----PNLPHLPDMELIINCRDWPQINRHWKQE-KLPVLSFSKTDDYL 174
Query: 217 DIPFPDWSFW-GWSEVNLQP-----WNEEFKDIKHGSQAKSWKEKLPFAYWKGN--PDVL 268
DI +P W FW G ++L P W++ IK + + W++K A+++G+ D
Sbjct: 175 DIMYPTWGFWEGGPAISLYPTGLGRWDQHRVSIKKAADSWKWEKKKAKAFFRGSRTSDER 234
Query: 269 SPLRVELMKCNDSKLWGAEILRQNWAEEAKDGF-----KKSKLSNQCNHRYKIYAEGYAW 323
PL L+ +L A+ + + KD ++ +L + C ++Y G A
Sbjct: 235 DPL--VLLSRRKPELVDAQYTKNQAWKSPKDTLNAKPAQEVRLEDHCQYKYLFNFRGVAA 292
Query: 324 SVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSADLCRSIKSVVDWGNANPSE 383
S K++ C S+ + ++++FF L P ++ P+ ++ ++++ +
Sbjct: 293 SFRFKHLFLCRSLVFHVGSEWQEFFYPSLKPWVHYVPVRVGATQEELEELIEFFAEHDDL 352
Query: 384 AEKIGKAG-QDFMESLTMDRVYDYMLHLITEYSKLLDYKPAPPSSAFEA 431
A +I G + + L M V Y L+ Y KL+ Y+ S E
Sbjct: 353 AREIADRGFEHVWKHLRMKDVECYWRKLLRRYGKLVKYEVKRDHSLVEV 401
>sp|Q7ZVE6|KDEL1_DANRE KDEL motif-containing protein 1 OS=Danio rerio GN=kdelc1 PE=2 SV=1
Length = 500
Score = 65.9 bits (159), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 81/337 (24%), Positives = 139/337 (41%), Gaps = 28/337 (8%)
Query: 101 QKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRIL----IVRGKLYVDPYYDC 156
+K CP F I DL + +RF L I ++Y+ + +
Sbjct: 147 EKNMHCPASFSQIESDLSIFQSVDPDRNAHEIIQRFGKSHSLCHYTIKNNQVYIKTHGEH 206
Query: 157 VQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHF 216
V R +F L L R+ +PD++ + D P ++K+ P P+F +C ++
Sbjct: 207 VGFR-IFMDAFLLSLTRKVK--LPDIEFFVNLGDWP-LEKRRASQNPSPVFSWCGSNDTR 262
Query: 217 DIPFPDWSFWGWSEVNLQPWNEEFKDIK--HGSQAKSWKEKLPFAYWKGNPDVLSPLRVE 274
DI P + +E L+ D+ G W++K+ +W+G R+E
Sbjct: 263 DIVMPTYDL---TESVLETMGRVSLDMMSVQGHTGPVWEKKINKGFWRGRDS--RKERLE 317
Query: 275 LMKCNDSKLWGAEILRQNW----AEEAKDG--FKKSKLSNQCNHRYKIYAEGYAWSVSLK 328
L+K + + N+ +E+ G K + ++Y+I +G + L
Sbjct: 318 LVKLARANTAMLDAALTNFFFFKHDESLYGPLVKHVSFFDFFKYKYQINVDGTVAAYRLP 377
Query: 329 YILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEK 386
Y+L+ +SV Y + F L P ++ P S +DL I+ W + EA+K
Sbjct: 378 YLLAGDSVVFKHDSIYYEHFYNELQPWVHYIPFRSDLSDLLEKIQ----WAKDHDEEAKK 433
Query: 387 IGKAGQDFMESLTM-DRVYDYMLHLITEYSKLLDYKP 422
I AGQ F + M D V+ Y L +Y++L KP
Sbjct: 434 IALAGQQFARTHLMGDSVFCYYHKLFQKYAELQVTKP 470
>sp|Q9JHP7|KDEL1_MOUSE KDEL motif-containing protein 1 OS=Mus musculus GN=Kdelc1 PE=1 SV=1
Length = 502
Score = 65.5 bits (158), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 83/333 (24%), Positives = 139/333 (41%), Gaps = 28/333 (8%)
Query: 105 TCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRIL----IVRGKLYVDPYYDCVQSR 160
C + I KDL + KRF + L + K+Y+ + + V R
Sbjct: 152 NCSETISQIQKDLAHFPTVDPEKIAAEIPKRFGQRQSLCHYTLKDNKVYIKTHGEHVGFR 211
Query: 161 AMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGSFPLPLFRYCTNDAHFDIPF 220
+F L L R+ +PDV+ + D P ++KK+ S P+F +C + DI
Sbjct: 212 -IFMDAILLSLTRKV--RMPDVEFFVNLGDWP-LEKKKSNSNIQPIFSWCGSTESRDIVM 267
Query: 221 PDWSFWGWSEVNLQPWNEEFKDIK--HGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKC 278
P + ++ L+ D+ + W+ K A W+G R+EL+K
Sbjct: 268 PTYDL---TDSVLETMGRVSLDMMSVQANTGPPWESKNSTAVWRGRDS--RKERLELVKL 322
Query: 279 NDSKLWGAEILRQNW----AEEAKDG--FKKSKLSNQCNHRYKIYAEGYAWSVSLKYILS 332
+ + N+ +E+ G K + H+Y+I +G + L Y+L
Sbjct: 323 SRKHPELIDAAFTNFFFFKHDESLYGPIVKHISFFDFFKHKYQINIDGTVAAYRLPYLLV 382
Query: 333 CNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIGKA 390
+SV L Y + F L P K++ P+ S +DL +K W + +EA+KI KA
Sbjct: 383 GDSVVLKQDSIYYEHFYNELQPWKHYIPVKSNLSDLLEKLK----WAKEHDAEAKKIAKA 438
Query: 391 GQDFM-ESLTMDRVYDYMLHLITEYSKLLDYKP 422
GQ+F +L D ++ Y L Y+ L +P
Sbjct: 439 GQEFARNNLMGDDIFCYYFKLFQGYANLQVSEP 471
>sp|Q566E5|KDEL2_RAT KDEL motif-containing protein 2 OS=Rattus norvegicus GN=Kdelc2 PE=2
SV=1
Length = 508
Score = 64.7 bits (156), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 75/335 (22%), Positives = 144/335 (42%), Gaps = 22/335 (6%)
Query: 101 QKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEA--KRFAALRILIVRGKLYVDPYYDCVQ 158
QK +CP I +D + I ++ +++ KRF R IV + + Y
Sbjct: 152 QKTLSCPANEPQIEQDFISFPS--INLQQMLKEVPKRFGDERGAIVHYTILNNHIYRRSL 209
Query: 159 SR----AMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKKEHGS-FPLPLFRYCTND 213
+ MF+ L L R+ +PD++ + D P+ +K + + P+P+ +C +
Sbjct: 210 GKYTDFKMFSDEILLSLARKV--TLPDLEFYINLGDWPLEHRKVNDTPGPIPIISWCGSL 267
Query: 214 AHFDIPFPDWSFWGWSEVNLQPWNEEFKDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRV 273
DI P + + ++ + ++ G+ SW K A+++G L++
Sbjct: 268 DSRDIILPTYDVTHSTLEAMRGVTNDLLSVQ-GNTGPSWINKTEKAFFRGRDSREERLQL 326
Query: 274 ELMKCNDSKLWGAEILRQNWAEEAKDGFKKSKLSNQCN---HRYKIYAEGYAWSVSLKYI 330
L+ + +L A I + +E + K+KL + ++Y++ +G + Y+
Sbjct: 327 VLLSKENPQLLDAGITGYFFFQEKEKELGKAKLMGFFDFFKYKYQVNVDGTVAAYRYPYL 386
Query: 331 LSCNSVALIISQQYKDFFSRGLIPTKNHFPIPS--ADLCRSIKSVVDWGNANPSEAEKIG 388
+ +S+ L Y + F L P K++ PI +DL +K W N EA++I
Sbjct: 387 MLGDSLVLKQESPYYEHFYVELRPWKHYVPIKRNLSDLLEKVK----WAKENDEEAKRIA 442
Query: 389 KAGQDFMESLTM-DRVYDYMLHLITEYSKLLDYKP 422
K GQ L R+Y Y ++ +Y++ KP
Sbjct: 443 KEGQLTARDLLQPPRLYCYYYRVLQKYAERQVSKP 477
>sp|P39907|LPSA_DICNO O-glucosyltransferase LpsA OS=Dichelobacter nodosus GN=lpsA PE=3
SV=1
Length = 318
Score = 36.2 bits (82), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 22/94 (23%), Positives = 44/94 (46%), Gaps = 1/94 (1%)
Query: 306 LSNQCNHRYKIYAEGYAWSVSLKYILSCNSVALIISQQYKDFFSRGLIPTKNHFPIPSAD 365
+ Q +RY + EG + +LK+I++ NS+ L+ + +Y+ + G + H+ + D
Sbjct: 203 VKRQLQYRYILSLEGNDVATNLKWIMASNSLCLMPAPRYETWMMEGRLQAGVHY-VQLRD 261
Query: 366 LCRSIKSVVDWGNANPSEAEKIGKAGQDFMESLT 399
++ + + P AE I K Q +M
Sbjct: 262 DFADLEEQILFFERYPEAAEAIIKNAQQWMSQFA 295
>sp|B2VGR6|SYI_ERWT9 Isoleucine--tRNA ligase OS=Erwinia tasmaniensis (strain DSM 17950 /
Et1/99) GN=ileS PE=3 SV=1
Length = 938
Score = 35.8 bits (81), Expect = 0.75, Method: Compositional matrix adjust.
Identities = 32/123 (26%), Positives = 50/123 (40%), Gaps = 11/123 (8%)
Query: 95 RRVASPQKVQTCPDFFKSIHKDLEPWAKSRITMRHIMEAKRFAALRILIVRGKLY--VDP 152
R+ A+ Q DF + L W + +TM EA AL +I G L+ P
Sbjct: 127 RQYAAEQVEGQKADFIRL--GVLGDWDRPYLTMDFKTEANIIRALGKIIGNGHLHKGAKP 184
Query: 153 YYDCVQSRAMFTIWGFLQLLRRYPGMVPDVDIMFDCMDKPVIDKK---EHGSFPLPLFRY 209
+ C+ R+ + Y P +D+MFD +DK + K H + P+ L +
Sbjct: 185 VHWCLDCRSALA----EAEVEYYDKTSPSIDVMFDAVDKDAVQAKFGAAHVNGPISLVIW 240
Query: 210 CTN 212
T
Sbjct: 241 TTT 243
>sp|Q9W6R5|XLRS1_TAKRU Retinoschisin OS=Takifugu rubripes GN=xlrs1 PE=3 SV=1
Length = 280
Score = 34.3 bits (77), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 14/47 (29%), Positives = 27/47 (57%)
Query: 241 KDIKHGSQAKSWKEKLPFAYWKGNPDVLSPLRVELMKCNDSKLWGAE 287
KD ++G++A+ W +K P W+G V + + V L + S+ W ++
Sbjct: 39 KDFRNGTKAEQWGKKTPTQIWRGCMSVCNAIVVCLFELGVSETWNSK 85
>sp|Q6GZU9|027R_FRG3G Uncharacterized protein 027R OS=Frog virus 3 (isolate Goorha)
GN=FV3-027R PE=4 SV=1
Length = 970
Score = 32.0 bits (71), Expect = 9.3, Method: Compositional matrix adjust.
Identities = 33/138 (23%), Positives = 58/138 (42%), Gaps = 11/138 (7%)
Query: 310 CNHRYKIYAEGYAWSVSLKYILSCNSVALIIS----QQYKDFFSRGLIPTKNHFPIPSAD 365
++Y + G+A + L LS SV L+ S Q+ D + ++ H+ IP
Sbjct: 334 AGYKYVLCLWGHAPAFRLARDLSLGSVVLLPSRPPGQEGLDMWHSSVLKPWTHY-IPVRG 392
Query: 366 LCRSIKSVVDWGNANPSEAEKIGKAGQDFMESL-----TMDRVYDYMLHLITEYS-KLLD 419
++ ++W N +E EKI AG + +L +DR D + + E D
Sbjct: 393 DLSDLEKRIEWCRDNDAECEKIAAAGMEASLNLLGWEGQLDRWMDVLRSVRLECCPGGYD 452
Query: 420 YKPAPPSSAFEACVESLL 437
P+P + CV ++
Sbjct: 453 MPPSPSLVSDSMCVRQMV 470
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.322 0.136 0.441
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 180,320,992
Number of Sequences: 539616
Number of extensions: 7653563
Number of successful extensions: 14535
Number of sequences better than 100.0: 21
Number of HSP's better than 100.0 without gapping: 10
Number of HSP's successfully gapped in prelim test: 11
Number of HSP's that attempted gapping in prelim test: 14499
Number of HSP's gapped (non-prelim): 21
length of query: 463
length of database: 191,569,459
effective HSP length: 121
effective length of query: 342
effective length of database: 126,275,923
effective search space: 43186365666
effective search space used: 43186365666
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 63 (28.9 bits)