BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 041855
(203 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|Q9FZ37|GUX4_ARATH Putative UDP-glucuronate:xylan alpha-glucuronosyltransferase 4
OS=Arabidopsis thaliana GN=GUX4 PE=3 SV=1
Length = 557
Score = 253 bits (645), Expect = 8e-67, Method: Compositional matrix adjust.
Identities = 119/205 (58%), Positives = 149/205 (72%), Gaps = 25/205 (12%)
Query: 24 REAYVTILHSSEAYVCGAIALAESIIQKNSSRDLVLLHDKSISGKSLRSLRAAGWKTKWI 83
R AYVT+LHSSEAYVCGAIALA+SI Q S +D++LLHD +I+ KSL L AAGW + I
Sbjct: 270 RVAYVTVLHSSEAYVCGAIALAQSIRQSGSHKDMILLHDHTITNKSLIGLSAAGWNLRLI 329
Query: 84 SRIRSPFAKKDSYNEWNYSKLR-------------------------FFFYPELSAAGND 118
RIRSPF++KDSYNEWNYSKLR F+YP+LSA+GND
Sbjct: 330 DRIRSPFSQKDSYNEWNYSKLRVWQVTDYDKLVFIDADFIILKKLDHLFYYPQLSASGND 389
Query: 119 KVLFNSGVMVIEPSLCKFEDLMLKSFQVSSYNGGDQGFLNEVFTWWHRLPKRINHLKVFS 178
KVLFNSG+MV+EPS C F+DLM KSF++ SYNGGDQGFLNE+F WWHRL KR+N +K F
Sbjct: 390 KVLFNSGIMVLEPSACMFKDLMEKSFKIESYNGGDQGFLNEIFVWWHRLSKRVNTMKYFD 449
Query: 179 KQDDKEHQVGDGLYAIHYLGLKPWM 203
+++ + H + + + +HYLGLKPW+
Sbjct: 450 EKNHRRHDLPENVEGLHYLGLKPWV 474
>sp|F4HZC3|GUX5_ARATH Putative UDP-glucuronate:xylan alpha-glucuronosyltransferase 5
OS=Arabidopsis thaliana GN=GUX5 PE=2 SV=1
Length = 566
Score = 245 bits (625), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 124/207 (59%), Positives = 144/207 (69%), Gaps = 27/207 (13%)
Query: 23 QREAYVTILHSSEAYVCGAIALAESIIQKNSSRDLVLLHDKSISGKSLRSLRAAGWKTKW 82
QR AYVT+LHSSE YVCGAIALA+SI Q S++D++LLHD SI+ SL L AGWK +
Sbjct: 276 QRVAYVTLLHSSEVYVCGAIALAQSIRQSGSTKDMILLHDDSITNISLIGLSLAGWKLRR 335
Query: 83 ISRIRSPFAKKDSYNEWNYSKLR-------------------------FFFYPELSAAGN 117
+ RIRSPF+KK SYNEWNYSKLR F YP+LSAAGN
Sbjct: 336 VERIRSPFSKKRSYNEWNYSKLRVWQVTDYDKLVFIDADFIIVKNIDYLFSYPQLSAAGN 395
Query: 118 DKVLFNSGVMVIEPSLCKFEDLMLKSFQVSSYNGGDQGFLNEVFTWWHRLPKRINHLKVF 177
+KVLFNSGVMV+EPS C FEDLMLKSF++ SYNGGDQGFLNE F WWHRL KR+N +K F
Sbjct: 396 NKVLFNSGVMVLEPSACLFEDLMLKSFKIGSYNGGDQGFLNEYFVWWHRLSKRLNTMKYF 455
Query: 178 SKQD--DKEHQVGDGLYAIHYLGLKPW 202
+ DK + + L IHYLGLKPW
Sbjct: 456 GDESRHDKARNLPENLEGIHYLGLKPW 482
>sp|Q8GWW4|GUX2_ARATH UDP-glucuronate:xylan alpha-glucuronosyltransferase 2
OS=Arabidopsis thaliana GN=GUX2 PE=2 SV=1
Length = 596
Score = 227 bits (579), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 112/218 (51%), Positives = 143/218 (65%), Gaps = 33/218 (15%)
Query: 19 TTYHQREAYVTILHSSEAYVCGAIALAESIIQKNSSRDLVLLHDKSISGKSLRSLRAAGW 78
T +REAYVT+LHSSE+YVCGAI LA+S++Q N+ RDL+LLHD SIS LR+L AAGW
Sbjct: 295 TKRPKREAYVTVLHSSESYVCGAITLAQSLLQTNTKRDLILLHDDSISITKLRALAAAGW 354
Query: 79 KTKWISRIRSPFAKKDSYNEWNYSKLR-------------------------FFFYPELS 113
K + I RIR+P A+KDSYNE+NYSK R F +P++S
Sbjct: 355 KLRRIIRIRNPLAEKDSYNEYNYSKFRLWQLTDYDKVIFIDADIIVLRNLDLLFHFPQMS 414
Query: 114 AAGNDKVLFNSGVMVIEPSLCKFEDLMLKSFQVSSYNGGDQGFLNEVFTWWHRLPKRINH 173
A GND ++NSG+MVIEPS C F +M + ++ SYNGGDQG+LNE+F WWHRLP+R+N
Sbjct: 415 ATGNDVWIYNSGIMVIEPSNCTFTTIMSQRSEIVSYNGGDQGYLNEIFVWWHRLPRRVNF 474
Query: 174 LKVFSKQDDKEHQVGDGL--------YAIHYLGLKPWM 203
LK F KE + + L YA+HYLG KPW+
Sbjct: 475 LKNFWSNTTKERNIKNNLFAAEPPQVYAVHYLGWKPWL 512
>sp|Q9LSB1|GUX1_ARATH UDP-glucuronate:xylan alpha-glucuronosyltransferase 1
OS=Arabidopsis thaliana GN=GUX1 PE=2 SV=1
Length = 659
Score = 207 bits (527), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 109/215 (50%), Positives = 135/215 (62%), Gaps = 34/215 (15%)
Query: 23 QREAYVTILHSSEAYVCGAIALAESIIQKNSSRDLVLLHDKSISGKSLRSLRAAGWKTKW 82
+REAY TILHS+ YVCGAIA A+SI Q S+RDLV+L D +ISG L AAGW+ +
Sbjct: 320 KREAYATILHSAHVYVCGAIAAAQSIRQSGSTRDLVILVDDNISGYHRSGLEAAGWQIRT 379
Query: 83 ISRIRSPFAKKDSYNEWNYSKLR-------------------------FFFYPELSAAGN 117
I RIR+P A+KD+YNEWNYSK R F PE+SA GN
Sbjct: 380 IQRIRNPKAEKDAYNEWNYSKFRLWQLTDYDKIIFIDADLLILRNIDFLFSMPEISATGN 439
Query: 118 DKVLFNSGVMVIEPSLCKFEDLMLKSFQVSSYNGGDQGFLNEVFTWWHRLPKRINHLKVF 177
+ LFNSGVMVIEP C F+ LM ++ SYNGGDQG+LNEVFTWWHR+PK +N LK F
Sbjct: 440 NGTLFNSGVMVIEPCNCTFQLLMEHINEIESYNGGDQGYLNEVFTWWHRIPKHMNFLKHF 499
Query: 178 --SKQDDKEHQVGD-------GLYAIHYLGLKPWM 203
+DD + + + LY +HYLG+KPW+
Sbjct: 500 WIGDEDDAKRKKTELFGAEPPVLYVLHYLGMKPWL 534
>sp|Q8W4A7|GUX3_ARATH Putative UDP-glucuronate:xylan alpha-glucuronosyltransferase 3
OS=Arabidopsis thaliana GN=GUX3 PE=2 SV=1
Length = 618
Score = 187 bits (475), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 103/221 (46%), Positives = 133/221 (60%), Gaps = 35/221 (15%)
Query: 18 YTTYHQREAYVTILHSSEAYVCGAIALAESIIQKNSSRDLVLLHDKSISGKSLRSLRAAG 77
Y+ ++EAY TILHS++ YVCGAIA A+SI S+RDLV+L D++IS L AAG
Sbjct: 278 YSAGAKKEAYATILHSAQFYVCGAIAAAQSIRMSGSTRDLVILVDETISEYHKSGLVAAG 337
Query: 78 WKTKWISRIRSPFAKKDSYNEWNYSKLRF-------------------------FFYPEL 112
WK + RIR+P A ++YNEWNYSK R F +PE+
Sbjct: 338 WKIQMFQRIRNPNAVPNAYNEWNYSKFRLWQLTEYSKIIFIDADMLILRNIDFLFEFPEI 397
Query: 113 SAAGNDKVLFNSGVMVIEPSLCKFEDLMLKSFQVSSYNGGDQGFLNEVFTWWHRLPKRIN 172
SA GN+ LFNSG+MV+EPS F+ LM +V SYNGGDQG+LNE+FTWWHR+PK +N
Sbjct: 398 SATGNNATLFNSGLMVVEPSNSTFQLLMDNINEVVSYNGGDQGYLNEIFTWWHRIPKHMN 457
Query: 173 HLKVFSKQDDKEHQVGDG---------LYAIHYLGL-KPWM 203
LK F + D+ E + LY +HYLG KPW+
Sbjct: 458 FLKHFWEGDEPEIKKMKTSLFGADPPILYVLHYLGYNKPWL 498
>sp|Q8GWB7|GUX6_ARATH Putative glucuronosyltransferase PGSIP6 OS=Arabidopsis thaliana
GN=PGSIP6 PE=2 SV=1
Length = 537
Score = 78.2 bits (191), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 55/164 (33%), Positives = 81/164 (49%), Gaps = 28/164 (17%)
Query: 24 REAYVTILHSSEAYVCGAIALAESIIQKNSSRDLVLLHDKSISGKSLRSLRAAGWKTKWI 83
+ AYVT+L+ E ++ G L +SI S++D+V L +S S + L+A GWK + I
Sbjct: 30 KVAYVTLLYGDE-FLLGVRVLGKSIRDTGSTKDMVALVSDGVSDYSKKLLKADGWKVEKI 88
Query: 84 SRIRSP-----------FAKKDSYNEWNYSKLRFFFYPELSAAGNDKVLF---------- 122
S + +P + K +N +Y K+ + + N + LF
Sbjct: 89 SLLANPNQVHPTRFWGVYTKLKIFNMTDYKKV-VYLDADTIVVKNIEDLFKCSKFCANLK 147
Query: 123 -----NSGVMVIEPSLCKFEDLMLKSFQVSSYNGGDQGFLNEVF 161
NSGVMV+EPS F D+M K +SSY GGDQGFLN +
Sbjct: 148 HSERLNSGVMVVEPSEALFNDMMRKVKTLSSYTGGDQGFLNSYY 191
>sp|P46976|GLYG_HUMAN Glycogenin-1 OS=Homo sapiens GN=GYG1 PE=1 SV=4
Length = 350
Score = 55.8 bits (133), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 57/219 (26%), Positives = 94/219 (42%), Gaps = 43/219 (19%)
Query: 25 EAYVTILHSSEAYVCGAIALAESIIQKNSSRDLVLLHDKSISGKSLRSLRAA-------- 76
+A+VT L +++AY GA+ L S+ Q ++R LV+L +S + L
Sbjct: 4 QAFVT-LTTNDAYAKGALVLGSSLKQHRTTRRLVVLATPQVSDSMRKVLETVFDEVIMVD 62
Query: 77 ---GWKTKWISRIRSP-----FAKKDSYNEWNYSKLRF--------------FFYPELSA 114
+ ++ ++ P K ++ YSK F F ELSA
Sbjct: 63 VLDSGDSAHLTLMKRPELGVTLTKLHCWSLTQYSKCVFMDADTLVLANIDDLFDREELSA 122
Query: 115 AGND--KVLFNSGVMVIEPSLCKFEDLMLKSFQVSSYNGGDQGFLNEVFTWWHRLPKR-- 170
A + FNSGV V +PS+ + L+ + + S++GGDQG LN F+ W R
Sbjct: 123 APDPGWPDCFNSGVFVYQPSVETYNQLLHLASEQGSFDGGDQGILNTFFSSWATTDIRKH 182
Query: 171 ------INHLKVFSKQDDKEHQVGDGLYAIHYLG-LKPW 202
++ + ++S G +H+LG +KPW
Sbjct: 183 LPFIYNLSSISIYSYL-PAFKVFGASAKVVHFLGRVKPW 220
>sp|Q9R062|GLYG_MOUSE Glycogenin-1 OS=Mus musculus GN=Gyg1 PE=2 SV=3
Length = 333
Score = 55.1 bits (131), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 47/172 (27%), Positives = 77/172 (44%), Gaps = 33/172 (19%)
Query: 25 EAYVTILHSSEAYVCGAIALAESIIQKNSSRDLVLLHDKSISGKSLRSLRAA-------- 76
+A+VT L +++AY GA+ L S+ Q ++R +V+L +S + L
Sbjct: 4 QAFVT-LTTNDAYAKGALVLGSSLKQHRTTRRMVVLTSPQVSDSMRKVLETVFDDVIMVD 62
Query: 77 ---GWKTKWISRIRSP-----FAKKDSYNEWNYSKLRF--------------FFYPELSA 114
+ ++ ++ P K ++ YSK F F ELSA
Sbjct: 63 VLDSGDSAHLTLMKRPELGITLTKLHCWSLTQYSKCVFMDADTLVLSNIDDLFEREELSA 122
Query: 115 AGND--KVLFNSGVMVIEPSLCKFEDLMLKSFQVSSYNGGDQGFLNEVFTWW 164
A + FNSGV V +PS+ + L+ + + S++GGDQG LN F+ W
Sbjct: 123 APDPGWPDCFNSGVFVYQPSIETYNQLLHLASEQGSFDGGDQGLLNTYFSGW 174
>sp|P13280|GLYG_RABIT Glycogenin-1 OS=Oryctolagus cuniculus GN=GYG1 PE=1 SV=3
Length = 333
Score = 54.7 bits (130), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 57/219 (26%), Positives = 93/219 (42%), Gaps = 43/219 (19%)
Query: 25 EAYVTILHSSEAYVCGAIALAESIIQKNSSRDLVLLHDKSISGKSLRSLRAA-------- 76
+A+VT L +++AY GA+ L S+ Q +SR L +L +S ++L
Sbjct: 4 QAFVT-LTTNDAYAKGALVLGSSLKQHRTSRRLAVLTTPQVSDTMRKALEIVFDEVITVD 62
Query: 77 ---GWKTKWISRIRSP-----FAKKDSYNEWNYSKLRF--------------FFYPELSA 114
+ ++ ++ P K ++ YSK F F ELSA
Sbjct: 63 ILDSGDSAHLTLMKRPELGVTLTKLHCWSLTQYSKCVFMDADTLVLANIDDLFEREELSA 122
Query: 115 AGND--KVLFNSGVMVIEPSLCKFEDLMLKSFQVSSYNGGDQGFLNEVFTWWHRLPKR-- 170
A + FNSGV V +PS+ + L+ + + S++GGDQG LN F W R
Sbjct: 123 APDPGWPDCFNSGVFVYQPSVETYNQLLHVASEQGSFDGGDQGLLNTFFNSWATTDIRKH 182
Query: 171 ------INHLKVFSKQDDKEHQVGDGLYAIHYLG-LKPW 202
++ + ++S + G +H+LG KPW
Sbjct: 183 LPFIYNLSSISIYSYLPAFK-AFGANAKVVHFLGQTKPW 220
>sp|O08730|GLYG_RAT Glycogenin-1 OS=Rattus norvegicus GN=Gyg1 PE=2 SV=4
Length = 333
Score = 54.3 bits (129), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 58/219 (26%), Positives = 94/219 (42%), Gaps = 43/219 (19%)
Query: 25 EAYVTILHSSEAYVCGAIALAESIIQKNSSRDLVLLHDKSISGKSLRSLRAA-------- 76
+A+VT L +++AY GA+ L S+ Q ++R V+L +S + L
Sbjct: 4 QAFVT-LTTNDAYAKGALVLGSSLKQHRTTRRTVVLASPQVSDSMRKVLETVFDEVIMVD 62
Query: 77 ---GWKTKWISRIRSP-----FAKKDSYNEWNYSKLRF--------------FFYPELSA 114
+ ++ ++ P K ++ YSK F F ELSA
Sbjct: 63 VLDSGDSAHLTLMKRPELGITLTKLHCWSLTQYSKCVFMDADTLVLSNIDDLFEREELSA 122
Query: 115 AGND--KVLFNSGVMVIEPSLCKFEDLMLKSFQVSSYNGGDQGFLNEVFTWW------HR 166
A + FNSGV V +PS+ + L+ + + S++GGDQG LN F+ W
Sbjct: 123 APDPGWPDCFNSGVFVYQPSIETYNQLLHLASEQGSFDGGDQGLLNTYFSGWATTDITKH 182
Query: 167 LP--KRINHLKVFSKQDDKEHQVGDGLYAIHYLG-LKPW 202
LP ++ L ++S + G +H+LG KPW
Sbjct: 183 LPFVYNLSSLSIYSYLPAFK-AFGKNAKVVHFLGRTKPW 220
>sp|Q8H1S1|GOLS6_ARATH Galactinol synthase 6 OS=Arabidopsis thaliana GN=GOLS6 PE=2 SV=1
Length = 336
Score = 49.7 bits (117), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 62/265 (23%), Positives = 98/265 (36%), Gaps = 71/265 (26%)
Query: 8 MSQARLAKLNYTTYHQR--EAYVTILHSSEAYVCGAIALAESIIQKNSSRDLVLLHDKSI 65
M+ + K + T H R AYVT L ++ Y G + LA+ + + S+ LV+ +
Sbjct: 6 MTVEKSIKADVTVSHDRVKRAYVTFLAGNKDYWMGVVGLAKGLRKVKSAYPLVVAILPDV 65
Query: 66 SGKSLRSLRAAGWKTKWISRIRSP------------------------------------ 89
+ + L A G + I + P
Sbjct: 66 PEEHRQILLAQGCIIREIEPVYPPENKTGYSMAYYVINYSKLRIWEFVEYEKMIYLDGDI 125
Query: 90 -----------------FAKKDSYNEWNYSKLRFF------------FYPELSAAGNDKV 120
+A KD + E ++SK F +P S V
Sbjct: 126 QVFSNIDHLFDTPRGYLYAVKDCFCEISWSKTPQFKIGYCQQCPEKVTWPVESLGSPPPV 185
Query: 121 LFNSGVMVIEPSLCKFEDLMLKSFQVSSYN-GGDQGFLNEVFTWWHRLPKRINHLKVFSK 179
FN+G++V EP+L +EDL L+ Q+++ +Q FLNE FT ++ +L +
Sbjct: 186 YFNAGMLVFEPNLLTYEDL-LRVVQITTPTYFAEQDFLNEYFTDIYKPIPSTYNLVMAML 244
Query: 180 QDDKEHQVGDGLYAIHYL--GLKPW 202
EH D + IHY G KPW
Sbjct: 245 WRHPEHIDLDQISVIHYCANGSKPW 269
>sp|O15488|GLYG2_HUMAN Glycogenin-2 OS=Homo sapiens GN=GYG2 PE=1 SV=2
Length = 501
Score = 44.3 bits (103), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 36/105 (34%), Positives = 50/105 (47%), Gaps = 16/105 (15%)
Query: 111 ELSAAGND--KVLFNSGVMVIEPSLCKFEDLMLKSFQVSSYNGGDQGFLNEVFTWW---- 164
E SAA + FNSGV V +PSL + L+ + + S++G DQG LN F W
Sbjct: 152 EFSAAPDPGWPDCFNSGVFVFQPSLHTHKLLLQHAMEHGSFDGADQGLLNSFFRNWSTTD 211
Query: 165 -HR-LP----KRINHLKVFSKQDDKEHQVGDGLYAIHYLG-LKPW 202
H+ LP N + +S Q G +H+LG +KPW
Sbjct: 212 IHKHLPFIYNLSSNTMYTYSP---AFKQFGSSAKVVHFLGSMKPW 253
>sp|O22693|GOLS4_ARATH Galactinol synthase 4 OS=Arabidopsis thaliana GN=GOLS4 PE=2 SV=1
Length = 334
Score = 38.9 bits (89), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 28/93 (30%), Positives = 45/93 (48%), Gaps = 3/93 (3%)
Query: 24 REAYVTILHSSEAYVCGAIALAESIIQKNSSRDLVLLHDKSISGKSLRSLRAAGWKTKWI 83
R AYVT L + YV G + LA+ + + S+ LV+ + + LR+ G + I
Sbjct: 22 RRAYVTFLAGNGDYVKGVVGLAKGLRKVKSAYPLVVAMLPDVPEEHREILRSQGCVVREI 81
Query: 84 SRIRSPFAKKD---SYNEWNYSKLRFFFYPELS 113
+ P + + +Y NYSKLR + + E S
Sbjct: 82 EPVYPPDNQVEFAMAYYVLNYSKLRIWNFEEYS 114
Score = 33.1 bits (74), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 25/97 (25%), Positives = 47/97 (48%), Gaps = 4/97 (4%)
Query: 109 YPELSAAGNDKVLFNSGVMVIEPSLCKFEDLMLKSFQVSSYNG-GDQGFLNEVFTWWHRL 167
+PE + + FN+G+ V EPS +E L L++ +++ + +Q FLN F ++
Sbjct: 172 WPEDMESPPPPLYFNAGMFVFEPSPLTYESL-LQTLEITPPSPFAEQDFLNMFFEKVYKP 230
Query: 168 PKRINHLKVFSKQDDKEHQVGDGLYAIHYL--GLKPW 202
+ +L + E+ + + +HY G KPW
Sbjct: 231 IPLVYNLVLAMLWRHPENVELEKVKVVHYCAAGSKPW 267
>sp|Q9FFA1|GOLS5_ARATH Galactinol synthase 5 OS=Arabidopsis thaliana GN=GOLS5 PE=2 SV=1
Length = 333
Score = 38.5 bits (88), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 35/128 (27%), Positives = 57/128 (44%), Gaps = 16/128 (12%)
Query: 90 FAKKDSYNEWNYSKLRFF------------FYPELSAAGNDKVLFNSGVMVIEPSLCKFE 137
+A KD + E ++SK + +P S V FN+G++V P+L +E
Sbjct: 140 YAVKDCFCEVSWSKTPQYKIGYCQQSPEKVTWPVESLGAPPPVYFNAGMLVFGPNLVTYE 199
Query: 138 DLMLKSFQVSSYN-GGDQGFLNEVFTWWHRLPKRINHLKVFSKQDDKEHQVGDGLYAIHY 196
DL L+ Q+++ +Q FLN F ++ +L + EH D + +HY
Sbjct: 200 DL-LRVVQITTPTYFAEQDFLNIYFRDIYKPIPSTYNLVMAMLWRHPEHIDLDQISVVHY 258
Query: 197 L--GLKPW 202
G KPW
Sbjct: 259 CANGSKPW 266
>sp|Q8VZP6|GUX8_ARATH Putative glucuronosyltransferase PGSIP8 OS=Arabidopsis thaliana
GN=PGSIP8 PE=2 SV=1
Length = 497
Score = 35.8 bits (81), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 19/47 (40%), Positives = 29/47 (61%), Gaps = 3/47 (6%)
Query: 118 DKVLFNSGVMVIEPSLCKFEDLMLKSFQVSSYN--GGDQGFLNEVFT 162
+ +F++G+ V++PS+ F+D ML QV N G DQGFL F+
Sbjct: 189 NPCIFHTGLFVLQPSVEVFKD-MLHELQVGRKNPDGADQGFLVSYFS 234
>sp|Q5UNW1|YR707_MIMIV Uncharacterized protein R707 OS=Acanthamoeba polyphaga mimivirus
GN=MIMI_R707 PE=4 SV=1
Length = 281
Score = 35.4 bits (80), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 15/41 (36%), Positives = 26/41 (63%)
Query: 26 AYVTILHSSEAYVCGAIALAESIIQKNSSRDLVLLHDKSIS 66
AYVT+++ + Y+ GA+ L ++ Q N+ D V+L K +S
Sbjct: 5 AYVTVIYGNNIYLTGALVLGYTLQQTNTKYDRVILATKDVS 45
>sp|Q947G8|GOLS1_SOLLC Galactinol synthase 1 OS=Solanum lycopersicum GN=GOLS1 PE=2 SV=1
Length = 318
Score = 35.0 bits (79), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 31/104 (29%), Positives = 43/104 (41%), Gaps = 4/104 (3%)
Query: 14 AKLNYTTYHQREAYVTILHSSEAYVCGAIALAESIIQKNSSRDLVLLHDKSISGKSLRSL 73
K+ T AYVT L + YV G + LA+ +I+ S LV+ + + L
Sbjct: 9 TKMATTIQKSSCAYVTFLAGNGDYVKGVVGLAKGLIKAKSMYPLVVAILPDVPEEHRMIL 68
Query: 74 RAAGWKTKWISRIRSPFAKKD----SYNEWNYSKLRFFFYPELS 113
G K I + D SY NYSKLR + + E S
Sbjct: 69 TRHGCIVKEIEPLAPSLQSLDKYARSYYVLNYSKLRIWEFVEYS 112
>sp|F4JMI5|GUX7_ARATH Putative glucuronosyltransferase PGSIP7 OS=Arabidopsis thaliana
GN=PGSIP7 PE=3 SV=1
Length = 494
Score = 34.7 bits (78), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 19/54 (35%), Positives = 30/54 (55%), Gaps = 3/54 (5%)
Query: 111 ELSAAGNDKVLFNSGVMVIEPSLCKFEDLMLKSFQVSSYN--GGDQGFLNEVFT 162
+ A + +F++G+ V++PS+ F D ML +V N G DQGFL F+
Sbjct: 178 QFCAVFINPCIFHTGLFVLQPSMEVFRD-MLHELEVKRDNPDGADQGFLVSYFS 230
>sp|P36143|GLG1_YEAST Glycogenin-1 OS=Saccharomyces cerevisiae (strain ATCC 204508 /
S288c) GN=GLG1 PE=1 SV=4
Length = 616
Score = 33.9 bits (76), Expect = 0.68, Method: Composition-based stats.
Identities = 17/41 (41%), Positives = 24/41 (58%)
Query: 121 LFNSGVMVIEPSLCKFEDLMLKSFQVSSYNGGDQGFLNEVF 161
+FNSGVM++ P L F+ +S +G DQG LN+ F
Sbjct: 156 MFNSGVMMLIPDADTASVLQNYIFENTSIDGSDQGILNQFF 196
>sp|P47011|GLG2_YEAST Glycogenin-2 OS=Saccharomyces cerevisiae (strain ATCC 204508 /
S288c) GN=GLG2 PE=1 SV=1
Length = 380
Score = 33.9 bits (76), Expect = 0.84, Method: Compositional matrix adjust.
Identities = 39/149 (26%), Positives = 64/149 (42%), Gaps = 19/149 (12%)
Query: 35 EAYVCGAIALAESIIQKNSSRDLVLLHDKSISGKSLRSLRAAGWKTKWISRIRSPFAKKD 94
E + + E I+KN + +L LL +S L+ A W+ ++ A
Sbjct: 69 EIIIIEPLKDQEKSIEKNKA-NLELLKRPELSHTLLK---ARLWELVQFDQVLFLDADTL 124
Query: 95 SYNEWNYSKLRFFFYPELS----AAGND---KVLFNSGVMVIEPSL---CKFEDLMLKSF 144
N+ + LR YPE + AA D +FN+GV+++ P L +D ++K+
Sbjct: 125 PLNKEFFEILRL--YPEQTRFQIAAVPDIGWPDMFNTGVLLLIPDLDMATSLQDFLIKTV 182
Query: 145 QVSSYNGGDQGFLNEVFTWWHRLPKRINH 173
S +G DQG N+ F K + H
Sbjct: 183 ---SIDGADQGIFNQFFNPICNYSKEVLH 208
>sp|Q9FXB2|GOLS2_ARATH Galactinol synthase 2 OS=Arabidopsis thaliana GN=GOLS2 PE=1 SV=1
Length = 335
Score = 33.5 bits (75), Expect = 0.89, Method: Compositional matrix adjust.
Identities = 25/96 (26%), Positives = 42/96 (43%), Gaps = 2/96 (2%)
Query: 109 YPELSAAGNDKVLFNSGVMVIEPSLCKFEDLMLKSFQVSSYNGGDQGFLNEVFTWWHRLP 168
+PE + FN+G+ V EP+L + +L+ V +Q FLN F ++
Sbjct: 171 WPEAKLGPKPPLYFNAGMFVYEPNLSTYHNLLETVKIVPPTLFAEQDFLNMYFKDIYKPI 230
Query: 169 KRINHLKVFSKQDDKEHQVGDGLYAIHYL--GLKPW 202
+ +L + E+ D + +HY G KPW
Sbjct: 231 PPVYNLVLAMLWRHPENIELDQVKVVHYCAAGAKPW 266
>sp|A6ZQJ2|GLG2_YEAS7 Glycogenin-2 OS=Saccharomyces cerevisiae (strain YJM789) GN=GLG2
PE=3 SV=1
Length = 380
Score = 33.5 bits (75), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 39/149 (26%), Positives = 64/149 (42%), Gaps = 19/149 (12%)
Query: 35 EAYVCGAIALAESIIQKNSSRDLVLLHDKSISGKSLRSLRAAGWKTKWISRIRSPFAKKD 94
E + + E I+KN + +L LL +S L+ A W+ ++ A
Sbjct: 69 EIIIIEPLKDQEKSIEKNKA-NLELLKRPELSHTLLK---ARLWELVQFDQVLFLDADTL 124
Query: 95 SYNEWNYSKLRFFFYPELS----AAGND---KVLFNSGVMVIEPSL---CKFEDLMLKSF 144
N+ + LR YPE + AA D +FN+GV+++ P L +D ++K+
Sbjct: 125 PLNKDFFEILRL--YPEQTRFQIAAVPDIGWPDMFNTGVLLLIPDLDMATSLQDFLIKTV 182
Query: 145 QVSSYNGGDQGFLNEVFTWWHRLPKRINH 173
S +G DQG N+ F K + H
Sbjct: 183 ---SIDGADQGIFNQFFNPICNYSKEVLH 208
>sp|C7G304|GOLS2_SOLLC Galactinol synthase 2 OS=Solanum lycopersicum GN=GOLS2 PE=2 SV=1
Length = 338
Score = 32.7 bits (73), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 18/41 (43%), Positives = 26/41 (63%), Gaps = 2/41 (4%)
Query: 122 FNSGVMVIEPSLCKFEDLMLKSFQVSSYNG-GDQGFLNEVF 161
FN+G+ V EPSL ++DL LK+ +V+ +Q FLN F
Sbjct: 185 FNAGMFVYEPSLSTYDDL-LKTLKVTPPTPFAEQDFLNMYF 224
>sp|O31530|YETA_BACSU Uncharacterized protein YetA OS=Bacillus subtilis (strain 168)
GN=yetA PE=4 SV=2
Length = 857
Score = 32.7 bits (73), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 33/125 (26%), Positives = 57/125 (45%), Gaps = 13/125 (10%)
Query: 42 IALAESIIQKNSSRDLVLLHDKSISGKSLRSLRAAGWKTKWISRIRSPFAKKDSYNEWNY 101
+A+ E+ +K S+ +VLLH++S+S ++ +G K + +I +Y EW
Sbjct: 135 VAIRET--RKESAAKMVLLHERSVSFIKRAAIEQSG-PVKAVVKIEGVHVLHKTYEEWLP 191
Query: 102 SKLRFFFYPELSAAG--NDKVLFNSGVMVIEPSLCKFEDLMLKSFQVSSYN-----GGDQ 154
+R FY LS G + +++ SG + L DL L+ YN G++
Sbjct: 192 FVIRLTFYAGLSEIGLVHTQLIDRSGKLEFVKGLGIEFDLFLEG---EPYNRHFRFAGEK 248
Query: 155 GFLNE 159
G E
Sbjct: 249 GMYKE 253
>sp|A7A018|GLG1_YEAS7 Glycogenin-1 OS=Saccharomyces cerevisiae (strain YJM789) GN=GLG1
PE=2 SV=2
Length = 616
Score = 31.6 bits (70), Expect = 3.6, Method: Composition-based stats.
Identities = 16/41 (39%), Positives = 23/41 (56%)
Query: 121 LFNSGVMVIEPSLCKFEDLMLKSFQVSSYNGGDQGFLNEVF 161
+FNSGVM++ P L + +S +G DQG LN+ F
Sbjct: 156 MFNSGVMMLIPDTDTASVLQNYIIENTSIDGSDQGILNQFF 196
>sp|Q9XGN3|GOLS2_AJURE Galactinol synthase 2 (Fragment) OS=Ajuga reptans GN=GOLS2 PE=1
SV=1
Length = 292
Score = 31.6 bits (70), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 24/84 (28%), Positives = 40/84 (47%), Gaps = 4/84 (4%)
Query: 122 FNSGVMVIEPSLCKFEDLMLKSFQVSSYNG-GDQGFLNEVFTWWHRLPKRINHLKVFSKQ 180
FN+G+ V EPSL + DL L + +++ +Q FLN +R + +L +
Sbjct: 145 FNAGMFVYEPSLPTYHDL-LHTLKITPPTPFAEQDFLNMFLRDVYRPIPNVYNLVLAMLW 203
Query: 181 DDKEHQVGDGLYAIHYL--GLKPW 202
E+ + + +HY G KPW
Sbjct: 204 RHPENVNLEAVKVVHYCAAGSKPW 227
>sp|P46958|IDS2_YEAST IME2-dependent-signaling protein OS=Saccharomyces cerevisiae
(strain ATCC 204508 / S288c) GN=IDS2 PE=1 SV=1
Length = 469
Score = 31.2 bits (69), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 19/47 (40%), Positives = 27/47 (57%), Gaps = 1/47 (2%)
Query: 22 HQREAYVTILHSSEAYVCGA-IALAESIIQKNSSRDLVLLHDKSISG 67
+Q A++TILH S A A I LAES+ NS +L +LH ++
Sbjct: 182 NQSRAWITILHDSSAESLQAVIVLAESLKNVNSQYNLWVLHSSEVNA 228
>sp|O80518|GOLS3_ARATH Galactinol synthase 3 OS=Arabidopsis thaliana GN=GOLS3 PE=1 SV=1
Length = 334
Score = 31.2 bits (69), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 25/96 (26%), Positives = 41/96 (42%), Gaps = 2/96 (2%)
Query: 109 YPELSAAGNDKVLFNSGVMVIEPSLCKFEDLMLKSFQVSSYNGGDQGFLNEVFTWWHRLP 168
+PE + FN+G+ V EPSL + +L+ V +Q FLN F ++
Sbjct: 165 WPESELGPKPPLYFNAGMFVYEPSLPTYYNLLETLKVVPPTPFAEQDFLNMYFKDIYKPI 224
Query: 169 KRINHLKVFSKQDDKEHQVGDGLYAIHYL--GLKPW 202
+ +L + E+ + +HY G KPW
Sbjct: 225 PPVYNLVLAMLWRHPENIELNEAKVVHYCAAGAKPW 260
>sp|P09436|SYIC_YEAST Isoleucine--tRNA ligase, cytoplasmic OS=Saccharomyces cerevisiae
(strain ATCC 204508 / S288c) GN=ILS1 PE=1 SV=1
Length = 1072
Score = 30.8 bits (68), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 17/51 (33%), Positives = 30/51 (58%), Gaps = 2/51 (3%)
Query: 28 VTILHSSEAYVCGAIALAESIIQKNSSRDLVLLHDKSISGKSLRSLRAAGW 78
+ ILHS E+Y+ AL II++ + RD+V+ D++ G +++ A W
Sbjct: 852 LVILHSDESYLKDVEALKNYIIEELNVRDVVITSDEAKYGVEYKAV--ADW 900
>sp|P19817|RFAJ_SALTY Lipopolysaccharide 1,2-glucosyltransferase OS=Salmonella
typhimurium (strain LT2 / SGSC1412 / ATCC 700720)
GN=rfaJ PE=3 SV=4
Length = 336
Score = 30.8 bits (68), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 30/95 (31%), Positives = 43/95 (45%), Gaps = 14/95 (14%)
Query: 122 FNSGVMVI------EPSLCKFEDLMLKSFQVSSYNGGDQGFLNEVF-TWWHRLPKRINHL 174
FNSGV+ + E +L K L+L + S+ DQ LN + LP+ N +
Sbjct: 176 FNSGVVFVNLKLWKENALTKKAFLLLAGKEADSFKYPDQDVLNILLQDKVIFLPRPYNTI 235
Query: 175 -KVFSKQDDKEHQ-----VGDGLYAIHYLG-LKPW 202
+ S+ DK H+ + D IHY G KPW
Sbjct: 236 YTIKSELKDKSHKKYSNIINDNTILIHYTGATKPW 270
>sp|O22893|GOLS1_ARATH Galactinol synthase 1 OS=Arabidopsis thaliana GN=GOLS1 PE=1 SV=1
Length = 344
Score = 30.8 bits (68), Expect = 7.2, Method: Compositional matrix adjust.
Identities = 24/85 (28%), Positives = 45/85 (52%), Gaps = 6/85 (7%)
Query: 122 FNSGVMVIEPSLCKFEDLMLKSFQVSSYNG-GDQGFLNEVFTWWHR-LPKRINHLKVFSK 179
FN+G+ + EP+L +EDL L++ +++ +Q FLN F ++ +P N +
Sbjct: 192 FNAGMFLYEPNLETYEDL-LRTLKITPPTPFAEQDFLNMYFKKIYKPIPLVYNLVLAMLW 250
Query: 180 QDDKEHQVGDGLYAIHYL--GLKPW 202
+ + ++G + +HY G KPW
Sbjct: 251 RHPENVELG-KVKVVHYCAAGSKPW 274
>sp|Q9XW42|ANM7_CAEEL Protein arginine N-methyltransferase 7 OS=Caenorhabditis elegans
GN=prmt-7 PE=3 SV=2
Length = 647
Score = 30.4 bits (67), Expect = 7.9, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 34/69 (49%), Gaps = 3/69 (4%)
Query: 95 SYNEWNYSKLRFFFYPE-LSAAGNDKVLFNSGVMVIEPSLCKFEDLMLKSFQVSSYNGGD 153
+ N WN+ LRF + E L D++ + V++ KFEDL + V + NG D
Sbjct: 453 AMNPWNH--LRFLYDVEVLKMMHGDELRVEPHMGVLKAIPEKFEDLQNIASDVGTVNGFD 510
Query: 154 QGFLNEVFT 162
F +E+ T
Sbjct: 511 LSFFDEIST 519
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.321 0.135 0.422
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 74,501,258
Number of Sequences: 539616
Number of extensions: 2824245
Number of successful extensions: 6091
Number of sequences better than 100.0: 34
Number of HSP's better than 100.0 without gapping: 16
Number of HSP's successfully gapped in prelim test: 18
Number of HSP's that attempted gapping in prelim test: 6058
Number of HSP's gapped (non-prelim): 40
length of query: 203
length of database: 191,569,459
effective HSP length: 112
effective length of query: 91
effective length of database: 131,132,467
effective search space: 11933054497
effective search space used: 11933054497
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 58 (26.9 bits)