BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 000842
(1258 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|Q0WL80|UGGG_ARATH UDP-glucose:glycoprotein glucosyltransferase OS=Arabidopsis thaliana
GN=UGGT PE=1 SV=1
Length = 1613
Score = 1918 bits (4968), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 940/1266 (74%), Positives = 1078/1266 (85%), Gaps = 13/1266 (1%)
Query: 1 MQEISQNFPSVVSSLSRMKLNDSIKDEIVANQRYMPPGKSLMALNGALINIEDIDLYLLI 60
MQEI+QNFPSVVSSLSRMKLN+SIKDEI++NQR +PPGK+L+ALNGAL+NIEDIDLY+L+
Sbjct: 353 MQEINQNFPSVVSSLSRMKLNESIKDEILSNQRMVPPGKALLALNGALLNIEDIDLYMLM 412
Query: 61 DLVHQELSLADQFSKLKIPRTITQKLLSTVPPAESSMFRVDFRSTHVQYLNNLEEDAMYK 120
DL HQELSLA+ FSKLKIP +KLL T P E +RVDFRS HV YLNNLEED MYK
Sbjct: 413 DLAHQELSLANHFSKLKIPDGAIRKLLLTTPLPEPDSYRVDFRSVHVTYLNNLEEDDMYK 472
Query: 121 RWRSNINEILMPVFPGQLRYIRKNLFHAVYVLDPATVCGLEVIDMIMSLYENHFPLRFGV 180
RWRSNINEILMP FPGQLRYIRKNLFHAVYV+DPAT CGLE I+ + SLYEN P+RFGV
Sbjct: 473 RWRSNINEILMPAFPGQLRYIRKNLFHAVYVIDPATACGLESIETLRSLYENQLPVRFGV 532
Query: 181 ILYSSKFIKSIEINGGELHSPVAEDDSPVNEDISSLIIRLFLFIKESHGTQTAFQFLSNV 240
ILYS++ IK+IE NGG++ S A ++ V ED+S+++IRLFL+IKE HG QTAFQFL N+
Sbjct: 533 ILYSTQLIKTIENNGGQIPSSDAVTNAQVKEDLSTMVIRLFLYIKEHHGIQTAFQFLGNL 592
Query: 241 NRLRMESADSADDDALEIHHVEGAFVETILPKAKTPPQDMLLKLEKEKTFMDQSQESSMF 300
N LR ESADS++ D +E HV+GAFVETILPK KT PQD+LLKL +E T + S+ SSMF
Sbjct: 593 NTLRTESADSSEAD-IEQEHVDGAFVETILPKVKTLPQDILLKLRQEHTLKEASEASSMF 651
Query: 301 VFKLGLTKLKCCLLMNGLVSES-SEEALLNAMNDELQRIQEQVYYGNINSYTDVLEKVLS 359
VFKLGL KLKC LMNGLV +S EE LLNAMN+EL +IQEQVYYG I S+T VL+K+LS
Sbjct: 652 VFKLGLAKLKCSFLMNGLVFDSVEEETLLNAMNEELPKIQEQVYYGQIESHTKVLDKLLS 711
Query: 360 ESGINRYNPQIITDAKVKPKFISLASSFLGRETELKDINYLHSPETVDDVKPVTHLLAVD 419
ESG++RYNPQII+ K KP+F+SLASS E+ L D+NYLHSPET +DVK VTHLLA D
Sbjct: 712 ESGLSRYNPQIISGGKNKPRFVSLASSTRKGESMLNDVNYLHSPETSEDVKYVTHLLAAD 771
Query: 420 VTSKKGMKLLHEGIRFLIGGSNGARLGVLFSASREADLPSIIFVKAFEITASTYSHKKKV 479
V +KKGMKLLHEG+R+LIGGS ARLGVLFS+S+ AD S++F+K FE TAS++SHK+KV
Sbjct: 772 VATKKGMKLLHEGVRYLIGGSKSARLGVLFSSSQNADPHSLLFIKFFEKTASSFSHKEKV 831
Query: 480 LEFLDQLCSFYERTYLLASSATADSTQAFIDKVCEFAEANGLSSKVYRASLPEYSKGKVR 539
L FLD+LC FYER YLL +S + S+Q FIDKV E A+ GLSSK YR+ L E ++
Sbjct: 832 LYFLDKLCLFYEREYLLKTSVESASSQMFIDKVLELADEYGLSSKAYRSCLVESVDEELL 891
Query: 540 KQLNKVVQFLHRQLGVESGANAVITNGRVTFPIDESTFLSHDLSLLESVEFKHRIKHIWE 599
K+L KV QFL +LG+ES ANA+I+NGRV FP+DE TFL DL LLES+EF R+K + E
Sbjct: 892 KRLTKVAQFLSWELGLESDANAIISNGRVIFPVDERTFLGQDLHLLESMEFNQRVKPVQE 951
Query: 600 IIEEVNWQETYPDIDPDMLTSKFVSDIILFVTSSMAMRDRSSESARFEILSAEYSAVVFN 659
IIE + WQ D+DPD+LTSK+ SD+ +FV+S+MA RDRSSESARFE+L++EYSAV+
Sbjct: 952 IIEGIEWQ----DVDPDLLTSKYFSDVFMFVSSAMATRDRSSESARFEVLNSEYSAVLLG 1007
Query: 660 SENSTIHIDAVIDPLSPTGQKLSSLLRVLQRYAQPSMRIVLNPMSSLVDIPLKNYYRYVV 719
+EN+TIHIDAVIDPLSPTGQKL+SLL+VLQ++ Q SMRIVLNPMSSLVDIPLKNYYRYV+
Sbjct: 1008 NENATIHIDAVIDPLSPTGQKLASLLQVLQKHVQTSMRIVLNPMSSLVDIPLKNYYRYVL 1067
Query: 720 PTMDDFSNTDYSISGPKAFFANMPLSKTLTMNLDVPEPWLVEPVIAVHDLDNILLEKLGD 779
P DD+S+T + + GPKAFFANMPLSKTLTMNLDVPEPWLVEPVIA+HDLDNILLE LGD
Sbjct: 1068 PNTDDYSSTGFDVDGPKAFFANMPLSKTLTMNLDVPEPWLVEPVIAIHDLDNILLENLGD 1127
Query: 780 TRTLQAVFELEALVLTGHCSEKDHEPPQGLQLILGTKSTPHLVDTLVMANLGYWQMKVSP 839
T TLQAVFE+E+LVLTGHC+EKDHE P+GLQLILGTK+ PHLVDTLVMANLGYWQMKVSP
Sbjct: 1128 TTTLQAVFEVESLVLTGHCAEKDHEAPRGLQLILGTKNRPHLVDTLVMANLGYWQMKVSP 1187
Query: 840 GVWYLQLAPGRSSELYVLKEDGNVNEDRSLSKRITINDLRGKVVHMEVVKKKGKENEKLL 899
GVWYLQLAPGRSSELY LK + ++D+S KRITI+DLRGKVVH+EVVK+KGKE+EKLL
Sbjct: 1188 GVWYLQLAPGRSSELYALKGGNDGSQDQSSLKRITIDDLRGKVVHLEVVKRKGKEHEKLL 1247
Query: 900 VSSDEDSHSQ--AEGHWNSNFLKWASGFIGGSEQSKKEKAAVDHGKVERHGKTINIFSIA 957
V SD D Q EG WNSNFLKWASGF+GG +QS K +H K R GKTINIFSIA
Sbjct: 1248 VPSDGDDAVQQNKEGSWNSNFLKWASGFVGGRQQSMKGGPDKEHEKGGRQGKTINIFSIA 1307
Query: 958 SGHLYERFLKIMILSVLKNTCRPVKFWFIKNYLSPQFKDVIPHMAQEYGFEYELITYKWP 1017
SGHLYERFLKIMILSVLKNT RPVKFWFIKNYLSPQFKDVIPHMAQEY FEYELITYKWP
Sbjct: 1308 SGHLYERFLKIMILSVLKNTNRPVKFWFIKNYLSPQFKDVIPHMAQEYNFEYELITYKWP 1367
Query: 1018 TWLHKQKEKQRIIWAYKILFLDVIFPLSLEKVIFVDADQVVRADMGELYDMDIKGRPLAY 1077
+WLHKQKEKQRIIWAYKILFLDVIFPLSLEKVIFVDADQ++R DMGELYDMDIKGRPLAY
Sbjct: 1368 SWLHKQKEKQRIIWAYKILFLDVIFPLSLEKVIFVDADQIIRTDMGELYDMDIKGRPLAY 1427
Query: 1078 TPFCDNNKDMDGYRFWRQGFWKDHLRGRPYHISALYVVDLKRFRETAAGDNLRVFYETLS 1137
TPFCDNN++MDGY+FW+QGFWK+HLRGRPYHISALYVVDL +FRETAAGDNLRVFYETLS
Sbjct: 1428 TPFCDNNREMDGYKFWKQGFWKEHLRGRPYHISALYVVDLVKFRETAAGDNLRVFYETLS 1487
Query: 1138 KDPNSLANLDQDLPNYAQHTVPIFSLPQEWLWCESWCGNATKSKAKTIDLCNNPMTKEPK 1197
KDPNSL+NLDQDLPNYAQHTVPIFSLPQEWLWCESWCGNATK+KA+TIDLCNNPMTKEPK
Sbjct: 1488 KDPNSLSNLDQDLPNYAQHTVPIFSLPQEWLWCESWCGNATKAKARTIDLCNNPMTKEPK 1547
Query: 1198 LQGARRIVSEWPDLDSEARQFTAKILGEEVVTLE---TPAPVGPMQTSGSDAS--SKGDL 1252
LQGARRIV+EWPDLD EAR+FTAKILGE+V E PA P +D S ++ DL
Sbjct: 1548 LQGARRIVTEWPDLDLEARKFTAKILGEDVELNEPVAAPATDKPNPLPSNDISEDTEQDL 1607
Query: 1253 ESKAEL 1258
ESKAEL
Sbjct: 1608 ESKAEL 1613
>sp|Q9JLA3|UGGG1_RAT UDP-glucose:glycoprotein glucosyltransferase 1 OS=Rattus norvegicus
GN=Uggt1 PE=1 SV=2
Length = 1551
Score = 738 bits (1905), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 461/1276 (36%), Positives = 701/1276 (54%), Gaps = 156/1276 (12%)
Query: 1 MQEISQNFPSVVSSLSRMKLNDSIKDEIVANQRY------MPPGKSLMALNGALINIEDI 54
M++ISQNFP+ ++++ ++ ++ E+ NQ+Y + PG S + +NG I+++
Sbjct: 346 MKDISQNFPTKARAITKTAVSAQLRAEVEENQKYFKGTIGLQPGDSALFINGLHIDLDTQ 405
Query: 55 DLYLLIDLVHQELSLADQFSKLKIPRTITQKLLS-TVPPAESSMFRVDFRSTHVQYLNNL 113
D++ L D + E + + +L I +L + P+E+ + VD RS + ++NNL
Sbjct: 406 DIFSLFDTLRNEARVMEGLHRLGIEGLSLHNILKLNIQPSETD-YAVDIRSPAISWVNNL 464
Query: 114 EEDAMYKRWRSNINEILMPVFPGQLRYIRKNLFHAVYVLDPATVCGLEVIDMIMSLYENH 173
E D+ Y W S++ E+L P FPG +R IRKNL + V+++DP E++ + NH
Sbjct: 465 EVDSRYNSWPSSLQELLRPTFPGVIRQIRKNLHNMVFIVDPVHETTAELVSIAEMFLSNH 524
Query: 174 FPLRFGVILYSSKFIKSIEINGGELHSPVAEDDSPVNEDISSLIIRLFLFIKESHGTQTA 233
PLR G I +N E D +D ++R + ++ + A
Sbjct: 525 IPLRIGFIFV---------VNDSE--------DVDGMQDAGVAVLRAYNYVGQEVDGYHA 567
Query: 234 FQFLSNV-NRLRMESADSADDDALEIHHVEGAFVETILPKAKTPPQDMLLKLEKEKTFMD 292
FQ L+ + N++R + +++ H V ++L K K P ++ L + +
Sbjct: 568 FQTLTQIYNKVRT-------GEKVKVEH-----VVSVLEK-KYPYVEVNSILGIDSAYDQ 614
Query: 293 QSQESSMFVFKLGLTKLKCCLLMNGLVSESS-------EEALLNAMNDELQRIQEQVYYG 345
+E+ + + G+ L +L NG+ E E ++ + + Q VY G
Sbjct: 615 NRKEARGYYEQTGVGPLP-VVLFNGMPFEKEQLDPDELETITMHKILETTTFFQRAVYLG 673
Query: 346 NINSYTDVLEKVLSESG-INRYNPQIITDAKVKPKFISLASS----------FLGRETEL 394
++ DV+E ++++ + R N +I+T K +++ L +S F ++
Sbjct: 674 ELSHDQDVVEYIMNQPNVVPRINSRILT---AKREYLDLTASNNFYVDDFARFSALDSRG 730
Query: 395 K------DINYL-----HSPETVDD--VKPVTHLLAVDVTSKKGMKLLHEGIRFLIGGSN 441
K +NYL S E DD ++PVT + D S G +LL++ I+ SN
Sbjct: 731 KTAAIANSMNYLTKKGMSSKEIYDDSFIRPVTFWIVGDFDSPSGRQLLYDAIKHQ-KTSN 789
Query: 442 GARLGVLFSASREADLPSIIFVKAFEITASTYSHKKKVLEFLDQLCSFYERTYLLASSAT 501
R+ ++ + SRE S +A + + T
Sbjct: 790 NVRISMINNPSREISDSSTPVSRAI---------------------------WAALQTQT 822
Query: 502 ADSTQAFIDKVCEFAEANGLSSKVYRASLPEYSKGKVRKQLNKVV-------------QF 548
++S + FI K+ + A L++ V + E+S G + L K V +
Sbjct: 823 SNSAKNFITKMVKEETAEALAAGV---DIGEFSVGGMDVSLFKEVFESSRMDFILSHALY 879
Query: 549 LHRQLGVESGANAVITNGRVTFPIDESTFLSH-DLSLLESVEFKHRIKHIWEIIEEVNWQ 607
L ++ G VI+NGR+ P+++S + D LLE++ K + I I+++ +
Sbjct: 880 CRDVLKLKKGQRVVISNGRIIGPLEDSELFNQDDFHLLENIILKTSGQKIKSHIQQLRVE 939
Query: 608 ETYPDIDPDMLTSKFVSDIILFVTSSMAMRDRSSESARFEILSAEYSAVVFNSENSTIHI 667
E SD+++ V + ++ + + ++ ++SA+ + +
Sbjct: 940 E------------DVASDLVMKVDALLSAQPKGEARIEYQFFEDKHSAIKLKPKEGETYY 987
Query: 668 D--AVIDPLSPTGQKLSSLLRVLQRYAQPSMRIVLNPMSSLVDIPLKNYYRYVVPTMDDF 725
D AV+DP++ Q+L+ LL VL + S+R+ +N S L D+PLK++YRYV+ F
Sbjct: 988 DVVAVVDPVTREAQRLAPLLLVLAQLINMSLRVFMNCQSKLSDMPLKSFYRYVLEPEISF 1047
Query: 726 SNTDYSISGPKAFFANMPLSKTLTMNLDVPEPWLVEPVIAVHDLDNILLEKLGDTRTLQA 785
+ + GP A F +MP S T+NL+ PE W+VE V +DLDNI LE++ + A
Sbjct: 1048 TADNSFAKGPIAKFLDMPQSPLFTLNLNTPESWMVESVRTPYDLDNIYLEEVDS--IVAA 1105
Query: 786 VFELEALVLTGHCSE-KDHEPPQGLQLILGTKSTPHLVDTLVMANLGYWQMKVSPGVWYL 844
+ELE L+L GHC + +PP+GLQ LGT + P VDT+VMANLGY+Q+K +PG W L
Sbjct: 1106 EYELEYLLLEGHCYDITTGQPPRGLQFTLGTSANPTTVDTIVMANLGYFQLKANPGAWIL 1165
Query: 845 QLAPGRSSELY-VLKEDGNVNEDRSLSKRITINDLRGKVVHMEVVKKKGKENEKLLVSSD 903
+L GRS ++Y + DG + + + +N+ + K++ ++V KK NE LL
Sbjct: 1166 RLRKGRSDDIYRIYSHDGTDSPPDANDVVVILNNFKSKIIKVKVQKKADMANEDLLSDGT 1225
Query: 904 EDSHSQAEGHWNSNFLKWASGFIG-GSEQSKKEKAAVDHGKVERHGKTINIFSIASGHLY 962
++ S G W+S KW GF G +E+ K++K + INIFS+ASGHLY
Sbjct: 1226 NENES---GFWDS--FKW--GFSGQKTEEVKQDKDDI-----------INIFSVASGHLY 1267
Query: 963 ERFLKIMILSVLKNTCRPVKFWFIKNYLSPQFKDVIPHMAQEYGFEYELITYKWPTWLHK 1022
ERFL+IM+LSVLKNT PVKFWF+KNYLSP FK+ IP+MA++Y F+YEL+ YKWP WLH+
Sbjct: 1268 ERFLRIMMLSVLKNTKTPVKFWFLKNYLSPTFKEFIPYMAKKYNFQYELVQYKWPRWLHQ 1327
Query: 1023 QKEKQRIIWAYKILFLDVIFPLSLEKVIFVDADQVVRADMGELYDMDIKGRPLAYTPFCD 1082
Q EKQRIIW YKILFLDV+FPL ++K +FVDADQ+VR D+ EL D ++ G P YTPFCD
Sbjct: 1328 QTEKQRIIWGYKILFLDVLFPLVVDKFLFVDADQIVRTDLKELRDFNLDGAPYGYTPFCD 1387
Query: 1083 NNKDMDGYRFWRQGFWKDHLRGRPYHISALYVVDLKRFRETAAGDNLRVFYETLSKDPNS 1142
+ ++MDGYRFW+ G+W HL GR YHISALYVVDLK+FR+ AAGD LR Y+ LS+DPNS
Sbjct: 1388 SRREMDGYRFWKSGYWASHLAGRKYHISALYVVDLKKFRKIAAGDRLRGQYQGLSQDPNS 1447
Query: 1143 LANLDQDLPNYAQHTVPIFSLPQEWLWCESWCGNATKSKAKTIDLCNNPMTKEPKLQGAR 1202
L+NLDQDLPN H VPI SLPQEWLWCE+WC +A+K +AKTIDLCNNPMTKEPKL+ A
Sbjct: 1448 LSNLDQDLPNNMIHQVPIKSLPQEWLWCETWCDDASKKRAKTIDLCNNPMTKEPKLEAAV 1507
Query: 1203 RIVSEWPDLDSEARQF 1218
RIV EW D D E +Q
Sbjct: 1508 RIVPEWQDYDQEIKQL 1523
>sp|Q6P5E4|UGGG1_MOUSE UDP-glucose:glycoprotein glucosyltransferase 1 OS=Mus musculus
GN=Uggt1 PE=1 SV=4
Length = 1551
Score = 738 bits (1905), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 465/1279 (36%), Positives = 712/1279 (55%), Gaps = 133/1279 (10%)
Query: 1 MQEISQNFPSVVSSLSRMKLNDSIKDEIVANQRY------MPPGKSLMALNGALINIEDI 54
M++ISQNFP+ ++++ ++ ++ E+ NQ+Y + PG S + +NG I+++
Sbjct: 346 MKDISQNFPTKARAITKTAVSAQLRAEVEENQKYFKGTIGLQPGDSALFINGLHIDLDTQ 405
Query: 55 DLYLLIDLVHQELSLADQFSKLKIPRTITQKLLS-TVPPAESSMFRVDFRSTHVQYLNNL 113
D++ L D + E + + +L I +L + P+E+ + VD RS + ++NNL
Sbjct: 406 DIFSLFDTLRNEARVMEGLHRLGIEGLSLHNILKLNIQPSETD-YAVDIRSPAISWVNNL 464
Query: 114 EEDAMYKRWRSNINEILMPVFPGQLRYIRKNLFHAVYVLDPATVCGLEVIDMIMSLYENH 173
E D+ Y W S++ E+L P FPG +R IRKNL + V+++DP E+I + NH
Sbjct: 465 EVDSRYNSWPSSLQELLRPTFPGVIRQIRKNLHNMVFIIDPVHETTAELISIAEMFLSNH 524
Query: 174 FPLRFGVILYSSKFIKSIEINGGELHSPVAEDDSPVNEDISSLIIRLFLFIKESHGTQTA 233
PLR G I +N E D +D ++R + ++ + A
Sbjct: 525 IPLRIGFIFV---------VNDSE--------DVDGMQDAGVAVLRAYNYVAQEVDGYHA 567
Query: 234 FQFLSNV-NRLRMESADSADDDALEIHHVEGAFVETILPKAKTPPQDMLLKLEKEKTFMD 292
FQ L+ + N++R + +++ HV ++L K K P ++ L + +
Sbjct: 568 FQTLTQIYNKVRT-------GETVKVEHVV-----SVLEK-KYPYVEVNSILGIDSAYDQ 614
Query: 293 QSQESSMFVFKLGLTKLKCCLLMNGLVSESS-------EEALLNAMNDELQRIQEQVYYG 345
+E+ + + G+ L +L NG+ E E ++ + + Q VY G
Sbjct: 615 NRKEARGYYEQTGVGPLPV-VLFNGMPFEKEQLDPDELETITMHKILETTTFFQRAVYLG 673
Query: 346 NINSYTDVLEKVLSESGI-NRYNPQIITDAKVKPKFISLASS----------FLGRETEL 394
++ DV+E ++++ + R N +I+T K +++ L +S F ++
Sbjct: 674 ELSHDQDVVEYIMNQPNVVPRINSRILT---AKREYLDLTASNNFYVDDFARFSALDSRG 730
Query: 395 K------DINYL-----HSPETVDD--VKPVTHLLAVDVTSKKGMKLLHEGIRFLIGGSN 441
K +NYL S E DD ++PVT + D S G +LL++ I+ SN
Sbjct: 731 KTAAIANSMNYLTKKGMSSKEIYDDSFIRPVTFWIVGDFDSPSGRQLLYDAIKHQ-KTSN 789
Query: 442 GARLGVLFSASRE-ADLPSIIFVKAFEITASTYSHKKKVLEFLDQLCSFYERTYLLASSA 500
R+ ++ + S+E +D + IF + + S K F+ ++ A
Sbjct: 790 NVRISMINNPSQEISDSSTPIFRAIWAALQTQASSSAK--NFITKM----------AKEE 837
Query: 501 TADSTQAFIDKVCEFAEANGLSSKVYRASLPEYSKGKVRKQLNKVVQFLHRQLGVESGAN 560
TA++ A +D + EF+ G+ +++ + ++ L+ + + L ++ G
Sbjct: 838 TAEALAAGVD-IAEFS-VGGMDVSLFKEV---FESSRMDFILSHAL-YCRDVLKLKKGQR 891
Query: 561 AVITNGRVTFPI-DESTFLSHDLSLLESVEFKHRIKHIWEIIEEVNWQETYPDIDPDMLT 619
VI+NGR+ P+ D F D LLE++ K + I I+++ +E
Sbjct: 892 VVISNGRIIGPLEDNELFNQDDFHLLENIILKTSGQKIKSHIQQLRVEE----------- 940
Query: 620 SKFVSDIILFVTSSMAMRDRSSESARFEILSAEYSAVVFNSENSTIHID--AVIDPLSPT 677
SD+++ V + ++ + + ++ ++SA+ + + D AV+DP++
Sbjct: 941 -DVASDLVMKVDALLSAQPKGEARIEYQFFEDKHSAIKLKPKEGETYYDVVAVVDPVTRE 999
Query: 678 GQKLSSLLRVLQRYAQPSMRIVLNPMSSLVDIPLKNYYRYVVPTMDDFSNTDYSISGPKA 737
Q+L+ LL VL + ++R+ +N S L D+PLK++YRYV+ F+ GP A
Sbjct: 1000 AQRLAPLLLVLTQLINMNLRVFMNCQSKLSDMPLKSFYRYVLEPEISFTADSSFAKGPIA 1059
Query: 738 FFANMPLSKTLTMNLDVPEPWLVEPVIAVHDLDNILLEKLGDTRTLQAVFELEALVLTGH 797
F +MP S T+NL+ PE W+VE V +DLDNI LE++ + A +ELE L+L GH
Sbjct: 1060 KFLDMPQSPLFTLNLNTPESWMVESVRTPYDLDNIYLEEVDSI--VAAEYELEYLLLEGH 1117
Query: 798 CSE-KDHEPPQGLQLILGTKSTPHLVDTLVMANLGYWQMKVSPGVWYLQLAPGRSSELY- 855
C + +PP+GLQ LGT + P +VDT+VMANLGY+Q+K +PG W L+L GRS ++Y
Sbjct: 1118 CYDITTGQPPRGLQFTLGTSANPTIVDTIVMANLGYFQLKANPGAWILRLRKGRSDDIYR 1177
Query: 856 VLKEDGNVNEDRSLSKRITINDLRGKVVHMEVVKKKGKENEKLLVSSDEDSHSQAEGHWN 915
+ DG + + + +N+ + K++ ++V KK NE LL ++ S G W+
Sbjct: 1178 IYSHDGTDSPPDANDVVVILNNFKSKIIKVKVQKKADMANEDLLSDGTNENES---GFWD 1234
Query: 916 SNFLKWASGFIG-GSEQSKKEKAAVDHGKVERHGKTINIFSIASGHLYERFLKIMILSVL 974
S KW GF G +E+ K++K + INIFS+ASGHLYERFL+IM+LSVL
Sbjct: 1235 S--FKW--GFSGQKAEEVKQDKDDI-----------INIFSVASGHLYERFLRIMMLSVL 1279
Query: 975 KNTCRPVKFWFIKNYLSPQFKDVIPHMAQEYGFEYELITYKWPTWLHKQKEKQRIIWAYK 1034
KNT PVKFWF+KNYLSP FK+ IP+MA++Y F+YEL+ YKWP WLH+Q EKQRIIW YK
Sbjct: 1280 KNTKTPVKFWFLKNYLSPTFKEFIPYMAKKYNFQYELVQYKWPRWLHQQTEKQRIIWGYK 1339
Query: 1035 ILFLDVIFPLSLEKVIFVDADQVVRADMGELYDMDIKGRPLAYTPFCDNNKDMDGYRFWR 1094
ILFLDV+FPL ++K +FVDADQ+VR D+ EL D ++ G P YTPFCD+ ++MDGYRFW+
Sbjct: 1340 ILFLDVLFPLVVDKFLFVDADQIVRTDLKELRDFNLDGAPYGYTPFCDSRREMDGYRFWK 1399
Query: 1095 QGFWKDHLRGRPYHISALYVVDLKRFRETAAGDNLRVFYETLSKDPNSLANLDQDLPNYA 1154
G+W HL GR YHISALYVVDLK+FR+ AAGD LR Y+ LS+DPNSL+NLDQDLPN
Sbjct: 1400 SGYWASHLAGRKYHISALYVVDLKKFRKIAAGDRLRGQYQGLSQDPNSLSNLDQDLPNNM 1459
Query: 1155 QHTVPIFSLPQEWLWCESWCGNATKSKAKTIDLCNNPMTKEPKLQGARRIVSEWPDLDSE 1214
H VPI SLPQEWLWCE+WC +A+K +AKTIDLCNNPMTKEPKL+ A RIV EW D D E
Sbjct: 1460 IHQVPIKSLPQEWLWCETWCDDASKKRAKTIDLCNNPMTKEPKLEAAVRIVPEWQDYDQE 1519
Query: 1215 ARQFTAKILGE-EVVTLET 1232
+Q E E+ TL T
Sbjct: 1520 IKQLQTLFQEEKELGTLHT 1538
>sp|Q9NYU2|UGGG1_HUMAN UDP-glucose:glycoprotein glucosyltransferase 1 OS=Homo sapiens
GN=UGGT1 PE=1 SV=3
Length = 1555
Score = 729 bits (1883), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 455/1270 (35%), Positives = 706/1270 (55%), Gaps = 128/1270 (10%)
Query: 1 MQEISQNFPSVVSSLSRMKLNDSIKDEIVANQRY------MPPGKSLMALNGALINIEDI 54
M+++SQNFP+ ++++ ++ ++ E+ NQ+Y + PG S + +NG ++++
Sbjct: 346 MKDLSQNFPTKARAITKTAVSSELRTEVEENQKYFKGTLGLQPGDSALFINGLHMDLDTQ 405
Query: 55 DLYLLIDLVHQELSLADQFSKLKIPRTITQKLLS-TVPPAESSMFRVDFRSTHVQYLNNL 113
D++ L D++ E + + +L I +L + P+E+ + VD RS + ++NNL
Sbjct: 406 DIFSLFDVLRNEARVMEGLHRLGIEGLSLHNVLKLNIQPSEAD-YAVDIRSPAISWVNNL 464
Query: 114 EEDAMYKRWRSNINEILMPVFPGQLRYIRKNLFHAVYVLDPATVCGLEVIDMIMSLYENH 173
E D+ Y W S++ E+L P FPG +R IRKNL + V+++DPA E+++ NH
Sbjct: 465 EVDSRYNSWPSSLQELLRPTFPGVIRQIRKNLHNMVFIVDPAHETTAELMNTAEMFLSNH 524
Query: 174 FPLRFGVILYSSKFIKSIEINGGELHSPVAEDDSPVN--EDISSLIIRLFLFIKESHGTQ 231
PLR G I V D V+ +D ++R + ++ +
Sbjct: 525 IPLRIGFIF-------------------VVNDSEDVDGMQDAGVAVLRAYNYVAQEVDDY 565
Query: 232 TAFQFLSNV-NRLRMESADSADDDALEIHHVEGAFVETILPKAKTPPQDMLLKLEKEKTF 290
AFQ L+++ N++R + +++ HV ++L K K P ++ L + +
Sbjct: 566 HAFQTLTHIYNKVRT-------GEKVKVEHVV-----SVLEK-KYPYVEVNSILGIDSAY 612
Query: 291 MDQSQESSMFVFKLGLTKLKCCLLMNGLVSESS-------EEALLNAMNDELQRIQEQVY 343
+E+ + + G+ L +L NG+ E E ++ + + Q VY
Sbjct: 613 DRNRKEARGYYEQTGVGPLPV-VLFNGMPFEREQLDPDELETITMHKILETTTFFQRAVY 671
Query: 344 YGNINSYTDVLEKVLSESGI-NRYNPQIITDAKVKPKFISLASSFL------------GR 390
G + DV+E ++++ + R N +I+T + + + F+ G+
Sbjct: 672 LGELPHDQDVVEYIMNQPNVVPRINSRILTAERDYLDLTASNNFFVDDYARFTILDSQGK 731
Query: 391 ETELKD-INYL-----HSPETVDD--VKPVTHLLAVDVTSKKGMKLLHEGIRFLIGGSNG 442
+ + +NYL S E DD ++PVT + D S G +LL++ I+ SN
Sbjct: 732 TAAVANSMNYLTKKGMSSKEIYDDSFIRPVTFWIVGDFDSPSGRQLLYDAIKHQ-KSSNN 790
Query: 443 ARLGVLFSASREADLPSIIFVKAFEITASTYSHKKKVLEFLDQLCSFYERTYLLASSATA 502
R+ ++ + ++E + +A T + F+ ++ + A
Sbjct: 791 VRISMINNPAKEISYENTQISRAIWAALQTQT-SNAAKNFITKM-----------AKEGA 838
Query: 503 DSTQAFIDKVCEFAEANGLSSKVYRASLPEYSKGKVRKQLNKVVQFLHRQLGVESGANAV 562
A + EF+ G+ +++ + K+ L+ V + L ++ G AV
Sbjct: 839 AEALAAGADIAEFS-VGGMDFSLFKEV---FESSKMDFILSHAV-YCRDVLKLKKGQRAV 893
Query: 563 ITNGRVTFPIDESTFLSHD-LSLLESVEFKHRIKHIWEIIEEVNWQETYPDIDPDMLTSK 621
I+NGR+ P+++S + D LLE++ K + I I+++ +E
Sbjct: 894 ISNGRIIGPLEDSELFNQDDFHLLENIILKTSGQKIKSHIQQLRVEE------------D 941
Query: 622 FVSDIILFVTSSMAMRDRSSESARFEILSAEYSAVVFNSENSTIHID--AVIDPLSPTGQ 679
SD+++ V + ++ + + ++ +SA+ + + D AV+DP++ Q
Sbjct: 942 VASDLVMKVDALLSAQPKGDPRIEYQFFEDRHSAIKLRPKEGETYFDVVAVVDPVTREAQ 1001
Query: 680 KLSSLLRVLQRYAQPSMRIVLNPMSSLVDIPLKNYYRYVVPTMDDFSNTDYSISGPKAFF 739
+L+ LL VL + ++R+ +N S L D+PLK++YRYV+ F++ + GP A F
Sbjct: 1002 RLAPLLLVLAQLINMNLRVFMNCQSKLSDMPLKSFYRYVLEPEISFTSDNSFAKGPIAKF 1061
Query: 740 ANMPLSKTLTMNLDVPEPWLVEPVIAVHDLDNILLEKLGDTRTLQAVFELEALVLTGHCS 799
+MP S T+NL+ PE W+VE V +DLDNI LE++ + A +ELE L+L GHC
Sbjct: 1062 LDMPQSPLFTLNLNTPESWMVESVRTPYDLDNIYLEEVDSV--VAAEYELEYLLLEGHCY 1119
Query: 800 E-KDHEPPQGLQLILGTKSTPHLVDTLVMANLGYWQMKVSPGVWYLQLAPGRSSELY-VL 857
+ +PP+GLQ LGT + P +VDT+VMANLGY+Q+K +PG W L+L GRS ++Y +
Sbjct: 1120 DITTGQPPRGLQFTLGTSANPVIVDTIVMANLGYFQLKANPGAWILRLRKGRSEDIYRIY 1179
Query: 858 KEDGNVNEDRSLSKRITINDLRGKVVHMEVVKKKGKENEKLLVSSDEDSHSQAEGHWNSN 917
DG + + I +N+ + K++ ++V KK NE LL SD S +++ G W+S
Sbjct: 1180 SHDGTDSPPDADEVVIVLNNFKSKIIKVKVQKKADMVNEDLL--SDGTSENES-GFWDS- 1235
Query: 918 FLKWASGFIG-GSEQSKKEKAAVDHGKVERHGKTINIFSIASGHLYERFLKIMILSVLKN 976
KW GF G +E+ K++K + INIFS+ASGHLYERFL+IM+LSVLKN
Sbjct: 1236 -FKW--GFTGQKTEEVKQDKDDI-----------INIFSVASGHLYERFLRIMMLSVLKN 1281
Query: 977 TCRPVKFWFIKNYLSPQFKDVIPHMAQEYGFEYELITYKWPTWLHKQKEKQRIIWAYKIL 1036
T PVKFWF+KNYLSP FK+ IP+MA EY F+YEL+ YKWP WLH+Q EKQRIIW YKIL
Sbjct: 1282 TKTPVKFWFLKNYLSPTFKEFIPYMANEYNFQYELVQYKWPRWLHQQTEKQRIIWGYKIL 1341
Query: 1037 FLDVIFPLSLEKVIFVDADQVVRADMGELYDMDIKGRPLAYTPFCDNNKDMDGYRFWRQG 1096
FLDV+FPL ++K +FVDADQ+VR D+ EL D ++ G P YTPFCD+ ++MDGYRFW+ G
Sbjct: 1342 FLDVLFPLVVDKFLFVDADQIVRTDLKELRDFNLDGAPYGYTPFCDSRREMDGYRFWKSG 1401
Query: 1097 FWKDHLRGRPYHISALYVVDLKRFRETAAGDNLRVFYETLSKDPNSLANLDQDLPNYAQH 1156
+W HL GR YHISALYVVDLK+FR+ AAGD LR Y+ LS+DPNSL+NLDQDLPN H
Sbjct: 1402 YWASHLAGRKYHISALYVVDLKKFRKIAAGDRLRGQYQGLSQDPNSLSNLDQDLPNNMIH 1461
Query: 1157 TVPIFSLPQEWLWCESWCGNATKSKAKTIDLCNNPMTKEPKLQGARRIVSEWPDLDSEAR 1216
VPI SLPQEWLWCE+WC +A+K +AKTIDLCNNPMTKEPKL+ A RIV EW D D E +
Sbjct: 1462 QVPIKSLPQEWLWCETWCDDASKKRAKTIDLCNNPMTKEPKLEAAVRIVPEWQDYDQEIK 1521
Query: 1217 QFTAKILGEE 1226
Q + E+
Sbjct: 1522 QLQIRFQKEK 1531
>sp|Q09332|UGGG_DROME UDP-glucose:glycoprotein glucosyltransferase OS=Drosophila
melanogaster GN=Ugt PE=1 SV=2
Length = 1548
Score = 703 bits (1814), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 455/1265 (35%), Positives = 679/1265 (53%), Gaps = 122/1265 (9%)
Query: 1 MQEISQNFPSVVSSLSRMKLNDSIKDEIVANQRYM-------PPGKSLMALNGALINIED 53
+Q + NFP + +L K+ D ++ E+ N PP +L +NG + +
Sbjct: 334 LQYTAHNFPMLARTLLAHKVTDGLRAEVKHNTEAFGRSLNVAPPDGALF-INGLFFDADT 392
Query: 54 IDLYLLIDLVHQELSLADQFSKLKIPRTITQKLLSTVPPAESSM-FRVDFRSTHVQYLNN 112
+DLY LI+ + E+ + + + ++ LL+ A S F +D R T VQ++N+
Sbjct: 393 MDLYSLIETLRSEMRVLESLHSNNVRGSLASSLLALDLTASSKKEFAIDIRDTAVQWVND 452
Query: 113 LEEDAMYKRWRSNINEILMPVFPGQLRYIRKNLFHAVYVLDPATVCGLEVIDMIMSLYEN 172
+E D Y+RW S++ ++L P FPG LR IRKN+F+ V V+D VI + S +
Sbjct: 453 IENDVQYRRWPSSVMDLLRPTFPGMLRNIRKNVFNLVLVVDALQPTARSVIKLSESFVIH 512
Query: 173 HFPLRFGVILYSSKFIKSIEINGGELHSPVAEDDSPVNEDISSLIIRLFLFIKESHGTQT 232
P+R G++ + + N L VA I + ++ + +
Sbjct: 513 QAPIRLGLVF------DARDANEDNLADYVA-------------ITCAYNYVSQKKDARA 553
Query: 233 AFQFLSNVNRLRMESADSADDDALEIHHVEGAFVETILPKAKTPPQDMLLKLEKEKTFMD 292
A FL+++ E+ D + + F KA+ LE++ T+
Sbjct: 554 ALSFLTDIYAAVGETKVVTKKDI--VKQLTKEFTSLSFAKAEE-------FLEEDSTYDY 604
Query: 293 QSQESSMFVFKLGLT-KLKCCLLMNGLVSESS--------EEALLNAMNDELQRIQEQVY 343
+ ++ F+ +LG K + L+NG+ S+ EEA+ + +Q+ VY
Sbjct: 605 GRELAAEFIQRLGFGDKGQPQALLNGVPMPSNVVTADSDFEEAIFTEIMTHTSNLQKAVY 664
Query: 344 YGNINSYTDVLEKVLSESGI-NRYNPQIITDAKVKPKFISLASSFLGRETELKDINYLHS 402
G + ++ ++++ + R N +I++ VK +L DIN + +
Sbjct: 665 KGELTDNDVAIDYLMNQPHVMPRLNQRILSQEDVK---------YL-------DINGV-A 707
Query: 403 PETVDDVKPVTHLLAVDVTSKKGMKLLHEGIRF---LIGGSNGARLGVLFSASREADLPS 459
+ + +V + L D+T+ L + G + LIG ++ L + A E D
Sbjct: 708 YKNLGNVGVLNRLSNRDMTATLMDNLKYFGGKKSTELIGRASLQFLTIWVFADLETDQGR 767
Query: 460 IIFVKAFEITASTYSHKKKVLEFLDQLCSFYERTY---LLASSATADSTQAFIDKVCEFA 516
+ A + S S + + + + R + A+ + TQA ++V ++
Sbjct: 768 DLLTHALDYVQSGESVRVAFIPNTESSSASSRRNLNRLVWAAMQSLPPTQA-TEQVLKWL 826
Query: 517 EANGLSSKVYRASLPEYSKGKVRKQLNKVVQFLHRQLGVESGANAVITNGRVTFPI--DE 574
+ K+ + E G L + + R LG+ VI NGR+ P+ DE
Sbjct: 827 KKP--KEKIEIPTQLEDILGSTELHLKMLRVYSQRVLGLNKSQRLVIGNGRLYGPLSSDE 884
Query: 575 STFLSHDLSLL---ESVEFKHRIKHIWEIIEEVNWQETYPDIDPDMLTSKFVSDIILFVT 631
S F S D +LL S+++ +++ + +E+ D++ +F SD +L +
Sbjct: 885 S-FDSADFALLARFSSLQYSDKVRQVL--------KESAQDVN-----EEFNSDTLLKLY 930
Query: 632 SSMAMRDRSSESARFEI---LSAEYSAVVFNSENSTI-HID--AVIDPLSPTGQKLSSLL 685
+S+ R + RF++ L ++S V + + H D AV+DP S QKL+ +L
Sbjct: 931 ASLLPRQTKT---RFKLPTDLKTDHSVVKLPPKQENLPHFDVAAVLDPASRAAQKLTPIL 987
Query: 686 RVLQRYAQPSMRIVLNPMSSLVDIPLKNYYRYVVPTMDDFSNTDYSISGPKAFFANMPLS 745
+L++ + + L P+ D+P+KN+YRYVV F GP A F+ +P +
Sbjct: 988 ILLRQVLNCQLNLYLIPVPQHSDMPVKNFYRYVVEPEVQFEANGGRSDGPLAKFSGLPAN 1047
Query: 746 KTLTMNLDVPEPWLVEPVIAVHDLDNILLEKLGDTRTLQAVFELEALVLTGHCSEK-DHE 804
LT L VPE WLVE V AV+DLDNI L +G + + F+LE L+L GHC +
Sbjct: 1048 PLLTQQLQVPENWLVEAVRAVYDLDNIKLTDIGGP--VHSEFDLEYLLLEGHCFDAASGA 1105
Query: 805 PPQGLQLILGTKSTPHLVDTLVMANLGYWQMKVSPGVWYLQLAPGRSSELYVLKEDGNVN 864
PP+GLQL+LGT+S P LVDT+VMANLGY+Q+K +PG W L+L G+S+++Y + N
Sbjct: 1106 PPRGLQLVLGTQSQPTLVDTIVMANLGYFQLKANPGAWSLRLREGKSADIYAISHIEGTN 1165
Query: 865 EDRSLSK---RITINDLRGKVVHMEVVKKKGKENEKLLVSSDEDSHSQAEGHWNSNFLKW 921
S ++ I LR VV + V KK G + +LL SD++ + G WNS
Sbjct: 1166 THHSAGSSEVQVLITSLRSHVVKLRVSKKPGMQQAELL--SDDNEQAAQSGMWNS----I 1219
Query: 922 ASGFIGGSEQSKKEKAAVDHGKVERHGKTINIFSIASGHLYERFLKIMILSVLKNTCRPV 981
AS F GGS +AA D +TINIFS+ASGHLYER L+IM++S+LK+T PV
Sbjct: 1220 ASSFGGGS----ANQAATDEDT-----ETINIFSVASGHLYERLLRIMMVSLLKHTKSPV 1270
Query: 982 KFWFIKNYLSPQFKDVIPHMAQEYGFEYELITYKWPTWLHKQKEKQRIIWAYKILFLDVI 1041
KFWF+KNYLSPQF D +PHMA EY F+YEL+ YKWP WLH+Q EKQR IW YKILFLDV+
Sbjct: 1271 KFWFLKNYLSPQFTDFLPHMASEYNFQYELVQYKWPRWLHQQTEKQRTIWGYKILFLDVL 1330
Query: 1042 FPLSLEKVIFVDADQVVRADMGELYDMDIKGRPLAYTPFCDNNKDMDGYRFWRQGFWKDH 1101
FPL++ K+IFVDAD +VR D+ ELYDMD+ G P AYTPFCD+ K+M+G+RFW+QG+W+ H
Sbjct: 1331 FPLNVRKIIFVDADAIVRTDIKELYDMDLGGAPYAYTPFCDSRKEMEGFRFWKQGYWRSH 1390
Query: 1102 LRGRPYHISALYVVDLKRFRETAAGDNLRVFYETLSKDPNSLANLDQDLPNYAQHTVPIF 1161
L GR YHISALYVVDLKRFR+ AAGD LR Y+ LS+DPNSL+NLDQDLPN H V I
Sbjct: 1391 LMGRRYHISALYVVDLKRFRKIAAGDRLRGQYQALSQDPNSLSNLDQDLPNNMIHQVAIK 1450
Query: 1162 SLPQEWLWCESWCGNATKSKAKTIDLCNNPMTKEPKLQGARRIVSEWPDLDSEARQFTAK 1221
SLP +WLWC++WC ++ AK IDLCNNP TKE KL A+RIV EW D D+E + ++
Sbjct: 1451 SLPDDWLWCQTWCSDSNFKTAKVIDLCNNPQTKEAKLTAAQRIVPEWKDYDAELKTLMSR 1510
Query: 1222 ILGEE 1226
I E
Sbjct: 1511 IEDHE 1515
>sp|Q9NYU1|UGGG2_HUMAN UDP-glucose:glycoprotein glucosyltransferase 2 OS=Homo sapiens
GN=UGGT2 PE=1 SV=4
Length = 1516
Score = 672 bits (1734), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 430/1259 (34%), Positives = 670/1259 (53%), Gaps = 133/1259 (10%)
Query: 1 MQEISQNFPSVVSSLSRMKLNDSIKDEIVANQR------YMPPGKSLMALNGALINIEDI 54
M++ISQNFP SL+R+ +N +++EI NQ+ + PG + + +NG ++++
Sbjct: 332 MKDISQNFPIKARSLTRIAVNQHMREEIKENQKDLQVRFKIQPGDARLFINGLRVDMDVY 391
Query: 55 DLYLLIDLVHQELSLADQFSKLKIPRTITQKLLSTVPPAESSMFRVDFRSTHVQYLNNLE 114
D + ++D++ E + + L I K L + +D R + + ++N+LE
Sbjct: 392 DAFSILDMLKLEGKMMNGLRNLGINGEDMSKFLKLNSHIWEYTYVLDIRHSSIMWINDLE 451
Query: 115 EDAMYKRWRSNINEILMPVFPGQLRYIRKNLFHAVYVLDPATVCGLEVIDMIMSLYENHF 174
D +Y W ++ ++L PVFPG + IR+N + V +DPA L+ I + Y +
Sbjct: 452 NDDLYITWPTSCQKLLKPVFPGSVPSIRRNFHNLVLFIDPAQEYTLDFIKLADVFYSHEV 511
Query: 175 PLRFGVILYSSKFIKSIEINGGELHSPVAEDDSPVNEDISSLIIRLFLFIKESHGTQTAF 234
PLR G + FI + +D+ D + R F +I E AF
Sbjct: 512 PLRIGFV-----FILN------------TDDEVDGANDAGVALWRAFNYIAEEFDISEAF 554
Query: 235 QFLSNVNRLRMESADSADDDALEIHHVEGAFVETILPKAKTPPQDMLLKLEKEKTFMDQS 294
++ + M D + L + +V+ T P ++ L + ++
Sbjct: 555 -----ISIVHMYQKVKKDQNILTVDNVKSVLQNTF------PHANIWDILGIHSKYDEER 603
Query: 295 QESSMFVFKLGLTKLKCCLLMNGLVSESSEE-------ALLNAMNDELQRIQEQVYYGNI 347
+ + F GL L L NG + E A+L M D +Q +V+ G +
Sbjct: 604 KAGASFYKMTGLGPLPQAL-YNGEPFKHEEMNIKELKMAVLQRMMDASVYLQREVFLGTL 662
Query: 348 NSYTDVLEKVLSESGI-NRYNPQIITDAKVKPKFISLASSFLGRETE------------- 393
N T+ ++ ++ + + R N I+ + ++++L S+ + + E
Sbjct: 663 NDRTNAIDFLMDRNNVVPRINTLIL---RTNQQYLNLISTSVTADVEDFSTFFFLDSQDK 719
Query: 394 ----LKDINYLHSPETVDD---VKPVTHLLAVDVTSKKGMKLLHEGIRFLIGGSNGARLG 446
K++ YL T DD + VT + D G KLL ++ + S +RLG
Sbjct: 720 SAVIAKNMYYL----TQDDESIISAVTLWIIADFDKPSGRKLLFNALKHM-KTSVHSRLG 774
Query: 447 VLFSASREADLPSIIFVKAFEITASTYSHKKKVLE-FLDQLCSFYERTYLLASSATADST 505
++++ + + + + + I A+ + K L FL QL + + + + D
Sbjct: 775 IIYNPTSKINEENTAISRG--ILAAFLTQKNMFLRSFLGQLA----KEEIATAIYSGDKI 828
Query: 506 QAFIDKVCEFAEANGLSSKVYRASLPEYSKGKVRKQLNKVVQFLHRQLGVESGANAVITN 565
+ F+ E + N K + + ++ F L + G +++N
Sbjct: 829 KTFL---IEGMDKNAFEKKYNTVGVNIFRTHQL---------FCQDVLKLRPGEMGIVSN 876
Query: 566 GRVTFPIDESTFLSHDLSLLESVEFKHRIKHIWEIIEEVNWQETYPDIDPDMLTSKFVSD 625
GR P+DE F + D LLE + F + + I I+E + + + +SD
Sbjct: 877 GRFLGPLDED-FYAEDFYLLEKITFSNLGEKIKGIVENMG------------INANNMSD 923
Query: 626 IILFVTSSMAMRDRSSESARFEILSAEYSAVVFNSENSTIHID--AVIDPLSPTGQKLSS 683
I+ V + M+ + + L +S + N + + + + A++DPL+ QK++
Sbjct: 924 FIMKVDALMSSVPKRASRYDVTFLRENHSVIKTNPQENDMFFNVIAIVDPLTREAQKMAQ 983
Query: 684 LLRVLQRYAQPSMRIVLNPMSSLVDIPLKNYYRYVV-PTMDDFSNTDYSISGPKAFFANM 742
LL VL + +++ +N L + PL+++YR+V+ P + +N D S GP A F ++
Sbjct: 984 LLVVLGKIINMKIKLFMNCRGRLSEAPLESFYRFVLEPELMSGAN-DVSSLGPVAKFLDI 1042
Query: 743 PLSKTLTMNLDVPEPWLVEPVIAVHDLDNILLEKLGDT-RTLQAVFELEALVLTGHCSEK 801
P S L +N+ PE WLVE V + DLDNI L DT +T+ A +ELE L+L G C +K
Sbjct: 1043 PESPLLILNMITPEGWLVETVHSNCDLDNI---HLKDTEKTVTAEYELEYLLLEGQCFDK 1099
Query: 802 -DHEPPQGLQLILGTKSTPHLVDTLVMANLGYWQMKVSPGVWYLQLAPGRSSELY-VLKE 859
+PP+GLQ LGTK+ P +VDT+VMA+ GY+Q+K +PG W L+L G+S ++Y ++
Sbjct: 1100 VTEQPPRGLQFTLGTKNKPAVVDTIVMAHHGYFQLKANPGAWILRLHQGKSEDIYQIVGH 1159
Query: 860 DGNVNEDRSLSKRITINDLRGKVVHMEVVKKKGKENEKLLVSSDEDSHSQAEGHWNSNFL 919
+G ++ + +N + K++ ++V K+ K E +L DE + +G W+S
Sbjct: 1160 EGTDSQADLEDIIVVLNSFKSKILKVKVKKETDKIKEDILTDEDE----KTKGLWDS--- 1212
Query: 920 KWASGFIGGSEQSKKEKAAVDHGKVERHGKTINIFSIASGHLYERFLKIMILSVLKNTCR 979
S + +++KKEK +NIFS+ASGHLYERFL+IM+LSVL+NT
Sbjct: 1213 -IKSFTVSLHKENKKEK------------DVLNIFSVASGHLYERFLRIMMLSVLRNTKT 1259
Query: 980 PVKFWFIKNYLSPQFKDVIPHMAQEYGFEYELITYKWPTWLHKQKEKQRIIWAYKILFLD 1039
PVKFW +KNYLSP FK+VIPHMA+EYGF YEL+ Y+WP WL +Q E+QRIIW YKILFLD
Sbjct: 1260 PVKFWLLKNYLSPTFKEVIPHMAKEYGFRYELVQYRWPRWLRQQTERQRIIWGYKILFLD 1319
Query: 1040 VIFPLSLEKVIFVDADQVVRADMGELYDMDIKGRPLAYTPFCDNNKDMDGYRFWRQGFWK 1099
V+FPL+++K+IFVDADQ+VR D+ EL D D+ G P YTPFCD+ ++MDGYRFW+ G+W
Sbjct: 1320 VLFPLAVDKIIFVDADQIVRHDLKELRDFDLDGAPYGYTPFCDSRREMDGYRFWKTGYWA 1379
Query: 1100 DHLRGRPYHISALYVVDLKRFRETAAGDNLRVFYETLSKDPNSLANLDQDLPNYAQHTVP 1159
HL R YHISALYVVDLK+FR AGD LR Y+ LS+DPNSL+NLDQDLPN + V
Sbjct: 1380 SHLLRRKYHISALYVVDLKKFRRIGAGDRLRSQYQALSQDPNSLSNLDQDLPNNMIYQVA 1439
Query: 1160 IFSLPQEWLWCESWCGNATKSKAKTIDLCNNPMTKEPKLQGARRIVSEWPDLDSEARQF 1218
I SLPQ+WLWCE+WC + +K +AKTIDLCNNP TKE KL+ A RIV EW + D+E RQ
Sbjct: 1440 IKSLPQDWLWCETWCDDESKQRAKTIDLCNNPKTKESKLKAAARIVPEWVEYDAEIRQL 1498
>sp|Q8T191|UGGG_DICDI Probable UDP-glucose:glycoprotein glucosyltransferase A
OS=Dictyostelium discoideum GN=ggtA PE=1 SV=2
Length = 1681
Score = 595 bits (1533), Expect = e-168, Method: Compositional matrix adjust.
Identities = 289/561 (51%), Positives = 384/561 (68%), Gaps = 9/561 (1%)
Query: 670 VIDPLSPTGQKLSSLLRVLQRYAQPSMRIVLNPMSSLVDIPLKNYYRYVVPTMDDFSNTD 729
+I+P + QKL ++R + ++LNP SL ++PLK YY YV+ +F+N +
Sbjct: 1081 IINPFNKVSQKLVPMVREFSNKLNIPVDVILNPPVSLSELPLKTYYTYVIKLSSEFNNEN 1140
Query: 730 YSISGPKAFFANMPLSKTLTMNLDVPEPWLVEPVIAVHDLDNILLEKLGDTRTLQAVFEL 789
+ P ++P + +T+ LD+P WLV+P+IA +DLDNI L+ LGD + L AV+EL
Sbjct: 1141 VLYNQPLGIATDIPEDRVVTLALDIPSSWLVQPIIAKYDLDNIRLKDLGDEQVLTAVYEL 1200
Query: 790 EALVLTGHCSEKDHE-PPQGLQLILG--TKSTPHLVDTLVMANLGYWQMKVSPGVWYLQL 846
E +V+ G ++ + P GL+L+L + T DT+VM N GY+Q+K +PG+W L +
Sbjct: 1201 ENIVIEGSANDMTTDNAPAGLELLLNPISTQTNKTQDTIVMNNFGYYQLKSNPGIWKLTI 1260
Query: 847 APGRSSELYVLKEDGNVNEDRSL----SKRITINDLRGKVVHMEVVKKKGKENEKLLVSS 902
APGRSS++ + + N E + + + I+ L + + VV+K G+E +L
Sbjct: 1261 APGRSSDIMDMVDHPNQKEKETFVIVPHRLVVIDSLYQSLSSLSVVRKAGQELRPILQPI 1320
Query: 903 DEDSHSQAEGHWNSNFLKWASGFIGGSEQSKKEKA-AVDHGKVERHGKTINIFSIASGHL 961
DE + + +SGF SK + +V + + + TI+IFS+ASGHL
Sbjct: 1321 DEYEKQKEQEKEQKLKQN-SSGFFSNLFSSKNDATDSVATHQKKSNLDTIHIFSVASGHL 1379
Query: 962 YERFLKIMILSVLKNTCRPVKFWFIKNYLSPQFKDVIPHMAQEYGFEYELITYKWPTWLH 1021
YERFLKIM+LSV+KNT P+KFWF+KNYLSP FK+ IP MA+EYGF+YEL+TYKWP WL
Sbjct: 1380 YERFLKIMMLSVVKNTESPIKFWFLKNYLSPAFKEFIPEMAKEYGFQYELVTYKWPWWLR 1439
Query: 1022 KQKEKQRIIWAYKILFLDVIFPLSLEKVIFVDADQVVRADMGELYDMDIKGRPLAYTPFC 1081
KQ EKQRIIW+YKILFLDV+FPL + K+IFVDADQVVR D+ EL+DMD+ G L YTPFC
Sbjct: 1440 KQTEKQRIIWSYKILFLDVLFPLDVPKIIFVDADQVVRTDLKELWDMDLHGASLGYTPFC 1499
Query: 1082 DNNKDMDGYRFWRQGFWKDHLRGRPYHISALYVVDLKRFRETAAGDNLRVFYETLSKDPN 1141
D+NKD +G+RFW+ G+W+ HL GR YHISALYVVDL RFR AAGD LR Y+ LS+DPN
Sbjct: 1500 DSNKDTEGFRFWKSGYWRQHLAGRSYHISALYVVDLVRFRRLAAGDQLRATYDQLSRDPN 1559
Query: 1142 SLANLDQDLPNYAQHTVPIFSLPQEWLWCESWCGNATKSKAKTIDLCNNPMTKEPKLQGA 1201
SLANLDQDLPNY QH V I SLPQEWLWCE+WC +KSKAKTIDLCNNP+TK PKL+ A
Sbjct: 1560 SLANLDQDLPNYLQHYVRIHSLPQEWLWCETWCDQESKSKAKTIDLCNNPLTKTPKLENA 1619
Query: 1202 RRIVSEWPDLDSEARQFTAKI 1222
RI+ EW LD+EA++F KI
Sbjct: 1620 VRIIDEWTTLDNEAKEFELKI 1640
Score = 64.3 bits (155), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 65/257 (25%), Positives = 123/257 (47%), Gaps = 18/257 (7%)
Query: 1 MQEISQNFPSVVSSLSRMKLNDSIKDEIVANQRYMPPGK-SLMALNGALINIEDIDLYLL 59
++ ISQ FP++ +SLS++ LN+S+K I +NQ+ +P + LNG LI+ ++ L
Sbjct: 371 LEYISQKFPTLSNSLSKITLNESLKSVIESNQKIIPSTTDQTLLLNGRLIDTNELSPIEL 430
Query: 60 IDLVHQELSLADQFSKLK-IPRTITQKLLSTVPPAESSMF----RVDFRSTHVQY--LNN 112
++ +E + + + Q ++S P + ++ + + LNN
Sbjct: 431 SRIILEEYEHSTTIQQQGPLSSKTVQDIISAQLPIRIQLLPTKEELELNGGNEPFVSLNN 490
Query: 113 LEEDAMYKRWRSNINEILM--PVFPGQLRYIRKNLFHAVYVLDPATVCGLEVIDMIMSLY 170
LE D +Y++W + ++ PV Q +IRKNL V VLD + E+I I +
Sbjct: 491 LELDYIYRQWEPKLQSSVLDKPVTSPQDIFIRKNLLTTVIVLDWNNINTFEIIPEIQEMV 550
Query: 171 ENH--FPLRFGVILYSSKFIKSIEINGGELHSPVAEDDSPVNEDISSLIIRLFLFIKESH 228
+ + P R +L+++K + N + +D++ ++FL IK S+
Sbjct: 551 QGNSLIPTRIQ-LLFNTKSNNNNNNNNNNNDQNSQTSNFIQGKDLA----KVFLTIKNSN 605
Query: 229 -GTQTAFQFLSNVNRLR 244
G + AF F++ +N +
Sbjct: 606 LGNRGAFFFITALNYFK 622
>sp|Q09140|UGGG_SCHPO UDP-glucose:glycoprotein glucosyltransferase OS=Schizosaccharomyces
pombe (strain 972 / ATCC 24843) GN=gpt1 PE=1 SV=2
Length = 1448
Score = 508 bits (1307), Expect = e-142, Method: Compositional matrix adjust.
Identities = 258/551 (46%), Positives = 352/551 (63%), Gaps = 32/551 (5%)
Query: 669 AVIDPLSPTGQKLSSLLRVLQRYAQPSMRIVLNPMSSLVDIPLKNYYRYVVPTMDDFSNT 728
AV+DPLS QK S++L + + +RI NP +L ++PL +YRY + +F
Sbjct: 897 AVLDPLSKDSQKWSAILEAVSKLNGVGVRIHFNPKQTLSELPLTRFYRYSISAEPEFDAL 956
Query: 729 DYSISGPKAFFANMPLSKTLTMNLDVPEPWLVEPVIAVHDLDNILLEKLGDTRTLQ---A 785
+ + F N+P LTM+++ + W V DL NI LE + L A
Sbjct: 957 GH-LEESYVEFDNLPADTLLTMDIEARDAWTVMQKDVDIDLFNIKLEHTSEAEALDSHTA 1015
Query: 786 VFELEALVLTGHCSEK-DHEPPQGLQLILGTKSTPHLVDTLVMANLGYWQMKVSPGVWYL 844
++EL+ +++ G+ E+ PP+G+QL LG + H+ DT+V++NLGY+Q+K +PGVW L
Sbjct: 1016 IYELKNILVQGYSQEEFRKSPPRGMQLKLGNLTNSHVTDTIVLSNLGYFQLKANPGVWTL 1075
Query: 845 QLAPGRSSELYVLKEDGNVNEDRSLSK-RITINDLRGKVVHMEVVKKKGKENEKLLVSSD 903
+ GRSS+ Y E ++N+ S ++ ++ G ++ + +K G E+ ++ D
Sbjct: 1076 EPMDGRSSQFY---EILSLNKKNSYKDPQVIVDSFEGVTLNPVMRRKPGFESADIM---D 1129
Query: 904 EDSHSQAEGHWNSNFLKWASGFIGGSEQSKKEKAAVDHGKVERHGKTINIFSIASGHLYE 963
ED S F K K ++ +R +INIFS+ASGHLYE
Sbjct: 1130 EDLSSH-------KFFD-------------KIKKSLSFFNFKRKEASINIFSVASGHLYE 1169
Query: 964 RFLKIMILSVLKNTCRPVKFWFIKNYLSPQFKDVIPHMAQEYGFEYELITYKWPTWLHKQ 1023
RFL IM SV+++T + VKFWFI+N+LSP FK IP +A++Y FEYE ITY WP WL KQ
Sbjct: 1170 RFLYIMTKSVIEHTDKKVKFWFIENFLSPSFKSSIPAIAKKYNFEYEYITYNWPHWLRKQ 1229
Query: 1024 KEKQRIIWAYKILFLDVIFPLSLEKVIFVDADQVVRADMGELYDMDIKGRPLAYTPFCDN 1083
+EKQR IW YKILFLDV+FPL L KVI+VDADQ+VRAD+ EL DMD+ G P YTP CD+
Sbjct: 1230 EEKQREIWGYKILFLDVLFPLELHKVIYVDADQIVRADLQELMDMDLHGAPYGYTPMCDS 1289
Query: 1084 NKDMDGYRFWRQGFWKDHLRGRPYHISALYVVDLKRFRETAAGDNLRVFYETLSKDPNSL 1143
++M+G+RFW++G+WK LRG YHISALYVVDL RFR+ AGD LR Y+ LS DPNSL
Sbjct: 1290 REEMEGFRFWKKGYWKKFLRGLKYHISALYVVDLDRFRKMGAGDLLRRQYQLLSADPNSL 1349
Query: 1144 ANLDQDLPNYAQHTVPIFSLPQEWLWCESWCGNATKSKAKTIDLCNNPMTKEPKLQGARR 1203
+NLDQDLPN+ QH +PI+SLPQ+WLWCE+WC + + AKTIDLC NP+TKE KL ARR
Sbjct: 1350 SNLDQDLPNHLQHLIPIYSLPQDWLWCETWCSDESLKTAKTIDLCQNPLTKEKKLDRARR 1409
Query: 1204 IVSEWPDLDSE 1214
VSEW D+E
Sbjct: 1410 QVSEWTSYDNE 1420
Score = 80.9 bits (198), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 91/402 (22%), Positives = 185/402 (46%), Gaps = 59/402 (14%)
Query: 1 MQEISQNFPSVVSSLSRMK--LNDSIKDEIVANQRYMPPGKSLMALNGALINIEDIDLYL 58
+E++Q+FP LS ND I++ +Y+P G + + LNG +++E+ D +
Sbjct: 297 FRELTQDFPIYAHYLSIQPDVSNDLIEELNQFQSQYVPEGINTIWLNGLSLDLEETDAFS 356
Query: 59 LIDLVHQELSLADQFSKLKIPRTITQKLLSTVPPA-ESSMFRV----------DFRSTHV 107
++ L+ +E + D+F L I + +++ A E S F+ D+++ H
Sbjct: 357 ILSLIKKEKDMFDRFEALGIKSSKVLDIVTNEAFANEDSDFKFVKFHCQDDIEDWKAIH- 415
Query: 108 QYLNNLEEDAMYKRWRSNINEILMPVFPGQLRYIRKNLFHAVYVLDPATVCGLEVIDMIM 167
++N +E + Y W +I +L P++PGQL + K L +Y + P++ L ++ ++
Sbjct: 416 -WVNEIESNPKYDNWPKSIQILLKPIYPGQLHMLGKQLHTVIYPIFPSSPSSLPLLSELI 474
Query: 168 SLYENHFPLRFGVILYSSKFIKSIEINGGELHSPVAEDDSPVNEDISSLIIRLFLFIKES 227
P++ G++ A DD ++ + + + F +I +
Sbjct: 475 QFSRRPSPVQTGMVC-------------------AANDD----DEFAQTVCKSFFYISKE 511
Query: 228 HGTQTAFQFLSNVNRLRMESADSADDDALEIHHVEGAFVETILPKAKTPPQDMLLKLEKE 287
GT +A +FL + S SAD +L +E LP ++ D L L+K+
Sbjct: 512 SGTDSALKFLYKC----LNSDSSADLYSL---------LEEHLPLSEH-DDDTLANLKKD 557
Query: 288 --KTFMDQ-SQESSMFVFKLGLTKLKCCLLMNGLV---SESSEEALLNAMNDELQRIQEQ 341
+F D +S+ +V +LG+ +++NG + E+ + ++ +++ +Q
Sbjct: 558 LSSSFFDHYMSKSNSWVNRLGIDSSASEVIVNGRIISHDENYDRSMYGIFLEDIPEVQIA 617
Query: 342 VYYGNINSYTDVLEKVLSESGINRYNPQIITDAKVKPKFISL 383
V G I+ ++L+ +L ++ + R NP + AK K I +
Sbjct: 618 VAEGKISEDDNLLDFILRDASLTR-NPLVYPSAKSSIKSIDI 658
>sp|P22023|KRE5_YEAST Killer toxin-resistance protein 5 OS=Saccharomyces cerevisiae (strain
ATCC 204508 / S288c) GN=KRE5 PE=1 SV=2
Length = 1365
Score = 94.0 bits (232), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 68/219 (31%), Positives = 104/219 (47%), Gaps = 16/219 (7%)
Query: 951 INIFSI-ASGHLYERFLKIMILSVLKNT--CRPVKFWFIKN-YLSPQFKDVIPHM--AQE 1004
INIF+I SG E MILS+L + V F+ + ++S + ++ + E
Sbjct: 1095 INIFTILESGPDEEERYMQMILSILSKCPETQKVNFFILDQPFISDTLRKSCEYINSSDE 1154
Query: 1005 YGFEYELITYKWPTWLHKQKEKQRIIWAYKILFLDVIFPLSLEKVIFVDADQVVRADMGE 1064
+ Y+WP WL Q+ R + LFLDV+ P ++ KV+++ +V
Sbjct: 1155 MRGNVIFLNYEWPQWLRPQRFSSRRRDVSRFLFLDVLLPQNISKVLYMSPTEVPLDPFDI 1214
Query: 1065 LYDMDIKGRPLAYTPFCDNNKDMDGYRFWRQGFWKDHLRGRP---YHISALYVVDLKRFR 1121
+K PL M G +W++G+W+ LR Y ++V+L+RFR
Sbjct: 1215 FQFQGLKRAPLGLF-------RMSGDGYWKEGYWEKMLRENNLEFYSTEPAFLVNLERFR 1267
Query: 1122 ETAAGDNLRVFYETLSKDPNSLANLDQDLPNYAQHTVPI 1160
E AGD R+ Y+ +S D SL N+ QDL N Q VPI
Sbjct: 1268 ELDAGDKYRIHYQRISTDAMSLVNIGQDLVNNLQLEVPI 1306
>sp|P43974|Y258_HAEIN Putative glycosyltransferase HI_0258 OS=Haemophilus influenzae
(strain ATCC 51907 / DSM 11121 / KW20 / Rd) GN=HI_0258
PE=3 SV=2
Length = 330
Score = 46.2 bits (108), Expect = 0.002, Method: Composition-based stats.
Identities = 31/128 (24%), Positives = 63/128 (49%), Gaps = 1/128 (0%)
Query: 949 KTINIFSIASGHLYERFLKIMILSVLKNTCRPVKFWFIKNYLSPQFKDVIPHMAQEYGFE 1008
+T+NI +S H Y +L + I S++KNT + + F+ + ++ + K +I ++A Y +
Sbjct: 37 QTMNII-FSSDHYYAPYLAVSIFSIIKNTPKKINFYILDMKINQENKTIINNLASAYSCK 95
Query: 1009 YELITYKWPTWLHKQKEKQRIIWAYKILFLDVIFPLSLEKVIFVDADQVVRADMGELYDM 1068
+ + + K I A + ++EK I++D D + + + EL+++
Sbjct: 96 VFFLPVCESDFQNFPKTIDYISLATYARLNLTKYIKNIEKAIYIDVDTLTNSSLQELWNI 155
Query: 1069 DIKGRPLA 1076
DI LA
Sbjct: 156 DITNYYLA 163
>sp|Q9M8J2|GATL4_ARATH Probable galacturonosyltransferase-like 4 OS=Arabidopsis thaliana
GN=GATL4 PE=2 SV=1
Length = 351
Score = 44.3 bits (103), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 46/176 (26%), Positives = 90/176 (51%), Gaps = 22/176 (12%)
Query: 962 YERFLKIMILSVLKNTCRP--VKFWFI-----KNYLSPQFKDVIPHMA-QEYGFEYELIT 1013
Y R +LS+L+++ P + F F+ +N L K P++ + Y F+ L+
Sbjct: 76 YLRGTMAAVLSLLQHSTCPENLSFHFLSLPHFENDLFTSIKSTFPYLNFKIYQFDPNLVR 135
Query: 1014 YKWPTWLHKQKEKQRIIWAYKILFLDVIFPLSLEKVIFVDADQVVRADMGELYDMDIKGR 1073
K + +Q Q + Y ++L I P S++++I++D+D VV D+ +L+ ++++G+
Sbjct: 136 SKISKSI-RQALDQPL--NYARIYLADIIPSSVDRIIYLDSDLVVVDDIEKLWHVEMEGK 192
Query: 1074 PLAYTPFCDNNKDMDGYRFWRQGFWKDH-----LRG-RP-YHISALYVVDLKRFRE 1122
+A +C N ++ + FW D L G RP Y + + VVD+ ++R+
Sbjct: 193 VVAAPEYCHANFT----HYFTRTFWSDPVLVKVLEGKRPCYFNTGVMVVDVNKWRK 244
>sp|P25148|GSPA_BACSU General stress protein A OS=Bacillus subtilis (strain 168) GN=gspA
PE=1 SV=2
Length = 286
Score = 43.1 bits (100), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 38/135 (28%), Positives = 65/135 (48%), Gaps = 6/135 (4%)
Query: 946 RHGKTINIFSIASGHLYERFLKIMILSVLKNTC--RPVKFWFIKNYLSPQFKDVIPHMAQ 1003
R + ++I S A + Y R L M +S+L N R VK + I + P K +
Sbjct: 2 RKDEIMHIVSCADDN-YARHLGGMFVSLLTNMDQEREVKLYVIDGGIKPDNKKRLEETTL 60
Query: 1004 EYGFEYELITYKWPTWLHKQKEKQRIIWA--YKILFLDVIFPLSLEKVIFVDADQVVRAD 1061
++G E + + H E I A Y+I D+I S++++I++D D +V D
Sbjct: 61 KFGVPIEFLEVDTNMYEHAV-ESSHITKAAYYRISIPDLIKDESIKRMIYIDCDALVLED 119
Query: 1062 MGELYDMDIKGRPLA 1076
+ +L+D+DI +A
Sbjct: 120 ISKLWDLDIAPYTVA 134
>sp|O04253|GATL6_ARATH Probable galacturonosyltransferase-like 6 OS=Arabidopsis thaliana
GN=GATL6 PE=2 SV=1
Length = 346
Score = 42.4 bits (98), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 36/147 (24%), Positives = 69/147 (46%), Gaps = 14/147 (9%)
Query: 1022 KQKEKQRIIWAYKILFLDVIFPLSLEKVIFVDADQVVRADMGELYDMDIKGRPLAYTPFC 1081
+Q +Q + +A L D++ P + +VI++D+D VV D+ +L+ + R + +C
Sbjct: 143 RQALEQPLNYARSYL-ADLLEP-CVNRVIYLDSDLVVVDDIAKLWKTSLGSRIIGAPEYC 200
Query: 1082 DNNKDMDGYRFWRQGFWKDH-----LRGRP--YHISALYVVDLKRFRETAAGDNLRVFYE 1134
N +++ GFW + RGR Y + + V+DLK++R + + E
Sbjct: 201 HANFT----KYFTGGFWSEERFSGTFRGRKPCYFNTGVMVIDLKKWRRGGYTKRIEKWME 256
Query: 1135 TLSKDP-NSLANLDQDLPNYAQHTVPI 1160
++ L +L L ++ H PI
Sbjct: 257 IQRRERIYELGSLPPFLLVFSGHVAPI 283
>sp|Q9Z1M7|LARGE_MOUSE Glycosyltransferase-like protein LARGE1 OS=Mus musculus GN=Large PE=1
SV=1
Length = 756
Score = 40.4 bits (93), Expect = 0.092, Method: Compositional matrix adjust.
Identities = 64/275 (23%), Positives = 119/275 (43%), Gaps = 32/275 (11%)
Query: 906 SHSQAEGHWNSNFLKWASGFIGGSEQSKKEKAAVDHGKVERHGKTINIFSIASGHLYERF 965
++S EG +S L+ +G + G+ ++ AV+ K E TI++ + +G+ R
Sbjct: 101 TYSMEEGTGDSENLR--AGIVAGNSSECGQQPAVE--KCE----TIHVAIVCAGYNASRD 152
Query: 966 LKIMILSVLKNTCRPVKFWFIKNYLSPQ-----FKD-VIPHMAQEYGFEYELITYKWPTW 1019
+ ++ SVL + P+ F I + ++ Q F+ ++P + ++ EL + +W
Sbjct: 153 VVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQTWMVPAVRVDFYNADELKSE--VSW 210
Query: 1020 LHKQKEKQRIIWAYKILFLDVIFPLSLEKVIFVDADQVVRADMGELYDMDIKGRPLAYTP 1079
+ + I+ L L P +LE+VI +D D D+ EL+ + K +
Sbjct: 211 IPNKHYSG--IYGLMKLVLTKTLPANLERVIVLDTDITFATDIAELWAVFHKFKGQQVLG 268
Query: 1080 FCDNNKDMDGYRFWRQG-FWKDH----LRGRPYHISALYVVDLKRFRETAAGDNLRVFYE 1134
+N D W G WK+H GR Y+ + ++ L + R+ R+ E
Sbjct: 269 LVENQSD------WYLGNLWKNHRPWPALGRGYNTGVILLL-LDKLRKMKWEQMWRLTAE 321
Query: 1135 TLSKDPNSLANLDQDLPNYA--QHTVPIFSLPQEW 1167
S + DQD+ N Q+ ++ LP W
Sbjct: 322 RELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFW 356
>sp|Q0V7R1|GATL3_ARATH Probable galacturonosyltransferase-like 3 OS=Arabidopsis thaliana
GN=GATL3 PE=2 SV=1
Length = 345
Score = 40.0 bits (92), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 52/212 (24%), Positives = 97/212 (45%), Gaps = 18/212 (8%)
Query: 922 ASGFIGGSEQSKKEKAAVDHGKVERHGKTINIFSIASGHLYERFLKIMILSVLKNTCRP- 980
A F G E SK DH H +I ++ +Y R + SVL++ P
Sbjct: 34 APAFRNGRECSKTTWIPSDH----EHNPSIIHIAMTLDAIYLRGSVAGVFSVLQHASCPE 89
Query: 981 -VKFWFIKNYL-SPQFKDVI----PHMAQE-YGFEYELITYKWPTWLHKQKEKQRIIWAY 1033
+ F FI + S + +I P++ Y F+ L+ K + + + ++ Y
Sbjct: 90 NIVFHFIATHRRSADLRRIISSTFPYLTYHIYHFDPNLVRSKISSSIRRALDQPL---NY 146
Query: 1034 KILFLDVIFPLSLEKVIFVDADQVVRADMGELYDMDIKGRPLAYTPFCDNN-KDMDGYRF 1092
++L + P+++ +VI+ D+D VV D+ +L+ +D++ + +C N + RF
Sbjct: 147 ARIYLADLLPIAVRRVIYFDSDLVVVDDVAKLWRIDLRRHVVGAPEYCHANFTNYFTSRF 206
Query: 1093 WRQGFWKDHLRGRP--YHISALYVVDLKRFRE 1122
W +K L+ R Y + + V+DL ++RE
Sbjct: 207 WSSQGYKSALKDRKPCYFNTGVMVIDLGKWRE 238
>sp|Q66PG3|LARGE_CHICK Glycosyltransferase-like protein LARGE1 OS=Gallus gallus GN=LARGE
PE=2 SV=1
Length = 756
Score = 40.0 bits (92), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 67/277 (24%), Positives = 122/277 (44%), Gaps = 36/277 (12%)
Query: 906 SHSQAEGHWNSNFLKWASGFIGGSEQSKKEKAAVDHGKVERHGKTINIFSIASGHLYERF 965
++S EG +S L+ +G + G+ ++ AV+ K E TI++ + +G+ R
Sbjct: 101 TYSMEEGTGDSESLR--AGIVAGNSSECGQQPAVE--KCE----TIHVAIVCAGYNASRD 152
Query: 966 LKIMILSVLKNTCRPVKFWFIKNYLSPQ-----FKD-VIPHMAQEYGFEYELITYKWPTW 1019
+ ++ SVL + P+ F I + ++ Q F+ ++P + ++ EL + +W
Sbjct: 153 VVTLVKSVLFHRRNPLHFHLIADAIAKQILATLFQTWMVPAVRIDFYDADELKSEV--SW 210
Query: 1020 LHKQKEKQRIIWAYKILFLDVIFPLSLEKVIFVDADQVVRADMGELYDMDIKGRPLAYTP 1079
+ + I+ L L P +LE+VI +D D D+ EL+ + K +
Sbjct: 211 IPNKHYSG--IYGLMKLVLTKTLPSNLERVIVLDTDITFATDIAELWAVFHKFKGQQVLG 268
Query: 1080 FCDNNKDMDGYRFWRQG-FWKDH----LRGRPYHISALYVVDLKRFRETAAGDNLRVFYE 1134
+N D W G WK+H GR Y+ + ++ L + R+ R+ E
Sbjct: 269 LVENQSD------WYLGNLWKNHRPWPALGRGYNTGVILLL-LDKLRKMKWEQMWRLTAE 321
Query: 1135 T--LSKDPNSLANLDQDLPNYA--QHTVPIFSLPQEW 1167
+S SLA DQD+ N Q+ ++ LP W
Sbjct: 322 RELMSMLSTSLA--DQDIFNAVIKQNPFLVYQLPCFW 356
>sp|Q9FWY9|GATL5_ARATH Probable galacturonosyltransferase-like 5 OS=Arabidopsis thaliana
GN=GATL5 PE=2 SV=1
Length = 361
Score = 39.7 bits (91), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 37/147 (25%), Positives = 70/147 (47%), Gaps = 14/147 (9%)
Query: 1022 KQKEKQRIIWAYKILFLDVIFPLSLEKVIFVDADQVVRADMGELYDMDIKGRPLAYTPFC 1081
+Q +Q + +A L D++ P +++VI++D+D VV D+ +L+ + R + +C
Sbjct: 158 RQALEQPLNYARNYL-ADLLEP-CVKRVIYLDSDLVVVDDIVKLWKTGLGQRTIGAPEYC 215
Query: 1082 DNNKDMDGYRFWRQGFWKDH-----LRGR--PYHISALYVVDLKRFRETAAGDNLRVFYE 1134
N +++ GFW D +GR Y + + V+DLK++R+ + + E
Sbjct: 216 HANFT----KYFTGGFWSDKRFNGTFKGRNPCYFNTGVMVIDLKKWRQFRFTKRIEKWME 271
Query: 1135 TLSKDP-NSLANLDQDLPNYAQHTVPI 1160
+ L +L L +A H PI
Sbjct: 272 IQKIERIYELGSLPPFLLVFAGHVAPI 298
>sp|Q66PG1|LARG2_DANRE Glycosyltransferase-like protein LARGE2 OS=Danio rerio GN=gyltl1b
PE=2 SV=1
Length = 750
Score = 39.3 bits (90), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 78/343 (22%), Positives = 140/343 (40%), Gaps = 46/343 (13%)
Query: 854 LYVLKEDGNVNEDRSLS-KRITINDLRGKVVHMEVVKKKGKENE------KLLVSSDEDS 906
LY+L GN RSL + +++ +V+ + +E E +L +S +
Sbjct: 25 LYLLV--GNSENGRSLLLSACLVESTEARLLERDVLASRVREVEEENRQIRLQLSQSQGL 82
Query: 907 HSQ-AEGHWNSNFLKW-ASGFIGGSEQSKKEKAAVDHGKVER-----HGKTINIFSIASG 959
Q AEG++ + +W AS G + + +H + R + +++ + +G
Sbjct: 83 AGQPAEGNYGNQ--QWVASADTGPEDVENTAEERANHSECSRSPTAEKCELLHVACVCAG 140
Query: 960 HLYERFLKIMILSVLKNTCRPVKFWFIKNYLSPQFKD------VIPHMAQEYGFEYELIT 1013
H R + ++ S+L + P+ F FI + ++ Q ++P + + EL +
Sbjct: 141 HNASRDVVTLVKSILFHRRNPLHFHFITDTVANQILSTLFQSWMVPSVQVSFYDADELKS 200
Query: 1014 YKWPTWLHKQKEKQRIIWAYKILFLDVIFPLSLEKVIFVDADQVVRADMGELYDMDIKGR 1073
+W+ + I+ L L P +L KVI +D D D+ EL+ + K
Sbjct: 201 E--VSWIPNKHYSG--IYGLMKLTLTKALPSNLSKVIVLDTDITFATDIAELWAIFRKFT 256
Query: 1074 PLAYTPFCDNNKDMDGYRFWRQG-FWKDH----LRGRPYHISALYVVDLKRFRETAAGDN 1128
+N D W G WK+H GR ++ + ++ L+R R
Sbjct: 257 EKQVIGLVENQSD------WYLGNLWKNHKPWPALGRGFNTGVI-LLYLERLRRMGWEQM 309
Query: 1129 LRVFYET--LSKDPNSLANLDQDLPN--YAQHTVPIFSLPQEW 1167
R+ E +S SLA DQD+ N Q+ V + LP W
Sbjct: 310 WRLTAERELMSMLSTSLA--DQDIFNAFIKQNPVLVHQLPCFW 350
>sp|Q0WV13|GAUTD_ARATH Probable galacturonosyltransferase 13 OS=Arabidopsis thaliana
GN=GAUT13 PE=2 SV=1
Length = 533
Score = 38.9 bits (89), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 35/133 (26%), Positives = 63/133 (47%), Gaps = 21/133 (15%)
Query: 1036 LFLDVIFPLSLEKVIFVDADQVVRADMGELYDMDIKGRPLAYTPFCDN------NKDMDG 1089
++L +FP +L+KV+F+D D V++ D+ L+D+D+ G+ C +K +
Sbjct: 335 IYLPELFP-NLDKVVFLDDDIVIQKDLSPLWDIDLNGKVNGAVETCRGEDVWVMSKRLRN 393
Query: 1090 YRFWRQGFWKDHLRGRPYHISALY---VVDLKRFRETAAGDNLRVFYETLSKDPNSLANL 1146
Y + HL P + Y + DL+ +R+T N+R Y + K+ NL
Sbjct: 394 YFNFSHPLIAKHL--DPEECAWAYGMNIFDLRTWRKT----NIRETYHSWLKE-----NL 442
Query: 1147 DQDLPNYAQHTVP 1159
+L + T+P
Sbjct: 443 KSNLTMWKLGTLP 455
>sp|O04536|GATL9_ARATH Probable galacturonosyltransferase-like 9 OS=Arabidopsis thaliana
GN=GATL9 PE=2 SV=1
Length = 390
Score = 38.5 bits (88), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 28/110 (25%), Positives = 53/110 (48%), Gaps = 12/110 (10%)
Query: 1037 FLDVIFPLSLEKVIFVDADQVVRADMGELYDMDIKGRPLAYTP-FCDNNKDMDGYRFWRQ 1095
+L I +++VI++D+D +V D+ +L++ + G + P +C N +++
Sbjct: 175 YLGDILDPCVDRVIYLDSDIIVVDDITKLWNTSLTGSRIIGAPEYCHANFT----KYFTS 230
Query: 1096 GFWKD-----HLRGRP--YHISALYVVDLKRFRETAAGDNLRVFYETLSK 1138
GFW D GR Y + + V+DL R+RE + L + + K
Sbjct: 231 GFWSDPALPGFFSGRKPCYFNTGVMVMDLVRWREGNYREKLETWMQIQKK 280
>sp|O95461|LARGE_HUMAN Glycosyltransferase-like protein LARGE1 OS=Homo sapiens GN=LARGE PE=1
SV=1
Length = 756
Score = 38.5 bits (88), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 63/275 (22%), Positives = 118/275 (42%), Gaps = 32/275 (11%)
Query: 906 SHSQAEGHWNSNFLKWASGFIGGSEQSKKEKAAVDHGKVERHGKTINIFSIASGHLYERF 965
++S EG +S L+ +G + G+ ++ V+ K E TI++ + +G+ R
Sbjct: 101 TYSMEEGTGDSENLR--AGIVAGNSSECGQQPVVE--KCE----TIHVAIVCAGYNASRD 152
Query: 966 LKIMILSVLKNTCRPVKFWFIKNYLSPQ-----FKD-VIPHMAQEYGFEYELITYKWPTW 1019
+ ++ SVL + P+ F I + ++ Q F+ ++P + ++ EL + +W
Sbjct: 153 VVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQTWMVPAVRVDFYNADELKSE--VSW 210
Query: 1020 LHKQKEKQRIIWAYKILFLDVIFPLSLEKVIFVDADQVVRADMGELYDMDIKGRPLAYTP 1079
+ + I+ L L P +LE+VI +D D D+ EL+ + K +
Sbjct: 211 IPNKHYSG--IYGLMKLVLTKTLPANLERVIVLDTDITFATDIAELWAVFHKFKGQQVLG 268
Query: 1080 FCDNNKDMDGYRFWRQG-FWKDH----LRGRPYHISALYVVDLKRFRETAAGDNLRVFYE 1134
+N D W G WK+H GR Y+ + ++ L + R+ R+ E
Sbjct: 269 LVENQSD------WYLGNLWKNHRPWPALGRGYNTGVILLL-LDKLRKMKWEQMWRLTAE 321
Query: 1135 TLSKDPNSLANLDQDLPNYA--QHTVPIFSLPQEW 1167
S + DQD+ N Q+ ++ LP W
Sbjct: 322 RELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFW 356
>sp|Q9LHD2|GATLA_ARATH Probable galacturonosyltransferase-like 10 OS=Arabidopsis thaliana
GN=GATL10 PE=2 SV=1
Length = 365
Score = 38.1 bits (87), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 27/106 (25%), Positives = 50/106 (47%), Gaps = 15/106 (14%)
Query: 1033 YKILFLDVIFPLSLEKVIFVDADQVVRADMGELYDMDIKG-RPLAYTPFCDNNKDMDGYR 1091
Y +L I + +VI++D+D +V D+ +L+ + + G R + +C N +
Sbjct: 157 YARSYLSEILSSCVSRVIYLDSDVIVVDDIQKLWKISLSGSRTIGAPEYCHANFT----K 212
Query: 1092 FWRQGFWKDH-------LRGRPYHISALYVVDLKRFRETAAGDNLR 1130
++ FW D + Y + + V+DL+R+RE GD R
Sbjct: 213 YFTDSFWSDQKLSSVFDSKTPCYFNTGVMVIDLERWRE---GDYTR 255
>sp|O48684|GATL8_ARATH Probable galacturonosyltransferase-like 8 OS=Arabidopsis thaliana
GN=GATL8 PE=2 SV=1
Length = 393
Score = 37.0 bits (84), Expect = 0.94, Method: Compositional matrix adjust.
Identities = 28/94 (29%), Positives = 50/94 (53%), Gaps = 12/94 (12%)
Query: 1037 FLDVIFPLSLEKVIFVDADQVVRADMGELYDMDIKGRPLAYTP-FCDNNKDMDGYRFWRQ 1095
+L I S+E+VI++D+D + D+ +L++ + G + P +C N +++
Sbjct: 178 YLGDILDRSVERVIYLDSDVITVDDITKLWNTVLTGSRVIGAPEYCHANFT----QYFTS 233
Query: 1096 GFWKD-HLRG-----RP-YHISALYVVDLKRFRE 1122
GFW D L G +P Y + + V+DL R+RE
Sbjct: 234 GFWSDPALPGLISGQKPCYFNTGVMVMDLVRWRE 267
>sp|Q8GWT1|GAUTE_ARATH Probable galacturonosyltransferase 14 OS=Arabidopsis thaliana
GN=GAUT14 PE=2 SV=1
Length = 532
Score = 36.6 bits (83), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 34/133 (25%), Positives = 63/133 (47%), Gaps = 21/133 (15%)
Query: 1036 LFLDVIFPLSLEKVIFVDADQVVRADMGELYDMDIKGRPLAYTPFCDN------NKDMDG 1089
+++ +FP +L+KV+F+D D VV+ D+ L+D+D+ G+ C +K +
Sbjct: 334 IYIPELFP-NLDKVVFLDDDIVVQGDLTPLWDVDLGGKVNGAVETCRGEDEWVMSKRLRN 392
Query: 1090 YRFWRQGFWKDHLRGRPYHISALY---VVDLKRFRETAAGDNLRVFYETLSKDPNSLANL 1146
Y + HL P + Y + DL+ +R+T N+R Y + ++ NL
Sbjct: 393 YFNFSHPLIAKHL--DPEECAWAYGMNIFDLQAWRKT----NIRETYHSWLRE-----NL 441
Query: 1147 DQDLPNYAQHTVP 1159
+L + T+P
Sbjct: 442 KSNLTMWKLGTLP 454
>sp|Q9FH36|GAUTC_ARATH Probable galacturonosyltransferase 12 OS=Arabidopsis thaliana
GN=GAUT12 PE=2 SV=1
Length = 535
Score = 36.6 bits (83), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 16/38 (42%), Positives = 27/38 (71%), Gaps = 1/38 (2%)
Query: 1036 LFLDVIFPLSLEKVIFVDADQVVRADMGELYDMDIKGR 1073
+ L +FP SL KV+F+D D V++ D+ L+D+D+ G+
Sbjct: 336 IHLPELFP-SLNKVVFLDDDIVIQTDLSPLWDIDMNGK 372
>sp|Q9QUS6|TRIM1_MOUSE Probable E3 ubiquitin-protein ligase MID2 OS=Mus musculus GN=Mid2
PE=2 SV=2
Length = 705
Score = 36.2 bits (82), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 22/53 (41%), Positives = 29/53 (54%)
Query: 434 RFLIGGSNGARLGVLFSASREADLPSIIFVKAFEITASTYSHKKKVLEFLDQL 486
RFL N A + +AS + +P I F AFE A +S +KK+LE LD L
Sbjct: 347 RFLQSAKNIAERVAMATASSQVLIPDINFNDAFENFALDFSREKKLLEGLDYL 399
>sp|Q9UJV3|TRIM1_HUMAN Probable E3 ubiquitin-protein ligase MID2 OS=Homo sapiens GN=MID2
PE=1 SV=3
Length = 735
Score = 36.2 bits (82), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 22/53 (41%), Positives = 29/53 (54%)
Query: 434 RFLIGGSNGARLGVLFSASREADLPSIIFVKAFEITASTYSHKKKVLEFLDQL 486
RFL N A + +AS + +P I F AFE A +S +KK+LE LD L
Sbjct: 347 RFLQSAKNIAERVAMATASSQVLIPDINFNDAFENFALDFSREKKLLEGLDYL 399
>sp|Q2HJ96|GL8D2_BOVIN Glycosyltransferase 8 domain-containing protein 2 OS=Bos taurus
GN=GLT8D2 PE=2 SV=1
Length = 350
Score = 35.8 bits (81), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 21/60 (35%), Positives = 32/60 (53%), Gaps = 4/60 (6%)
Query: 1037 FLDVIFPLSL---EKVIFVDADQVVRADMGELYDMDIK-GRPLAYTPFCDNNKDMDGYRF 1092
F+ PL + EKVI++D D +V+ D+ ELYD + G A++ CD D +R
Sbjct: 137 FVRFYLPLLIHQHEKVIYLDDDVIVQGDIQELYDTTLALGHAAAFSDDCDLPSSQDIHRL 196
>sp|Q0WQD2|GAUT3_ARATH Probable galacturonosyltransferase 3 OS=Arabidopsis thaliana GN=GAUT3
PE=2 SV=2
Length = 680
Score = 35.4 bits (80), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 14/38 (36%), Positives = 28/38 (73%), Gaps = 1/38 (2%)
Query: 1036 LFLDVIFPLSLEKVIFVDADQVVRADMGELYDMDIKGR 1073
+L ++P LEK++F+D D VV+ D+ L+++D++G+
Sbjct: 489 FYLPEVYP-KLEKILFLDDDIVVQKDLAPLWEIDMQGK 525
>sp|Q93ZX7|GAUT4_ARATH Probable galacturonosyltransferase 4 OS=Arabidopsis thaliana GN=GAUT4
PE=2 SV=1
Length = 616
Score = 35.0 bits (79), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 25/37 (67%), Gaps = 1/37 (2%)
Query: 1036 LFLDVIFPLSLEKVIFVDADQVVRADMGELYDMDIKG 1072
+L IFP L KV+F+D D VV+ D+ L+ +D+KG
Sbjct: 425 FYLPEIFP-KLSKVLFLDDDIVVQKDLSGLWSVDLKG 460
>sp|Q640P4|GL8D2_MOUSE Glycosyltransferase 8 domain-containing protein 2 OS=Mus musculus
GN=Glt8d2 PE=2 SV=1
Length = 349
Score = 34.7 bits (78), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 21/60 (35%), Positives = 32/60 (53%), Gaps = 4/60 (6%)
Query: 1037 FLDVIFPLSL---EKVIFVDADQVVRADMGELYDMDIK-GRPLAYTPFCDNNKDMDGYRF 1092
F+ PL + EKVI++D D +V+ D+ ELYD + G A++ CD D +R
Sbjct: 137 FVRFYLPLLVHQHEKVIYLDDDVIVQGDIQELYDTTLALGHAAAFSDDCDLPSAQDIHRL 196
>sp|Q66PG4|LARG2_CHICK Glycosyltransferase-like protein LARGE2 OS=Gallus gallus GN=GYLTL1B
PE=2 SV=1
Length = 739
Score = 34.3 bits (77), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 48/230 (20%), Positives = 94/230 (40%), Gaps = 38/230 (16%)
Query: 888 VKKKGKENEKLLVSSDEDSHSQAEGHWNSNFLKWASGF-----IGGSEQSKKEKAAVDHG 942
V++ +EN++L + + QAEG + +WA+ +GG + + A
Sbjct: 61 VRRVEEENQQLRMQLGQ---VQAEG--SDGNPQWAASAEDGPPLGGE---RNNRTACPEQ 112
Query: 943 KVERHGKTINIFSIASGHLYERFLKIMILSVLKNTCRPVKFWFIKNYLSPQFKDVIPHMA 1002
++ R + +++ + +GH R + ++ S+L + P+ F FI D + H
Sbjct: 113 RMVRKCELLHVAIVCAGHNASRDVVTLVKSILFHRKNPLHFHFI--------TDSVAHQI 164
Query: 1003 QEYGFEYELITYKWPTWLHKQKEKQRIIW--------AYKILFLDVI--FPLSLEKVIFV 1052
+ F+ ++ ++ + K + W Y ++ L + P +L KVI +
Sbjct: 165 LQTLFQSWMVPSIHVSFYNADDLKPEVSWIPNKHYSGIYGLMKLTLTKALPSNLSKVIVL 224
Query: 1053 DADQVVRADMGELYDMDIKGRPLAYTPFCDNNKDMDGYRFWRQG-FWKDH 1101
D D D+ EL+ + K +N D W G WK+H
Sbjct: 225 DTDITFATDIAELWAVFGKFSEKQVIGLVENQSD------WYLGNLWKNH 268
>sp|Q8VYF4|GATL7_ARATH Probable galacturonosyltransferase-like 7 OS=Arabidopsis thaliana
GN=GATL7 PE=2 SV=1
Length = 361
Score = 34.3 bits (77), Expect = 6.3, Method: Compositional matrix adjust.
Identities = 28/124 (22%), Positives = 54/124 (43%), Gaps = 12/124 (9%)
Query: 1045 SLEKVIFVDADQVVRADMGELYDMDIKGRPLAYTPFCDNNKDMDGYRFWRQGFWKDH--- 1101
+ +VI++D+D +V D+ +L+ + + + +C N +++ FW D
Sbjct: 178 CVRRVIYLDSDLIVVDDIAKLWMTKLGSKTIGAPEYCHANFT----KYFTPAFWSDERFS 233
Query: 1102 --LRGRP--YHISALYVVDLKRFRETAAGDNLRVFYETLSKDP-NSLANLDQDLPNYAQH 1156
GR Y + + V+DL+R+R + + + E D L +L L +A
Sbjct: 234 GAFSGRKPCYFNTGVMVMDLERWRRVGYTEVIEKWMEIQKSDRIYELGSLPPFLLVFAGE 293
Query: 1157 TVPI 1160
PI
Sbjct: 294 VAPI 297
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.318 0.134 0.392
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 456,438,790
Number of Sequences: 539616
Number of extensions: 19487509
Number of successful extensions: 49148
Number of sequences better than 100.0: 45
Number of HSP's better than 100.0 without gapping: 18
Number of HSP's successfully gapped in prelim test: 27
Number of HSP's that attempted gapping in prelim test: 49047
Number of HSP's gapped (non-prelim): 68
length of query: 1258
length of database: 191,569,459
effective HSP length: 129
effective length of query: 1129
effective length of database: 121,958,995
effective search space: 137691705355
effective search space used: 137691705355
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 67 (30.4 bits)