BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 000842
         (1258 letters)

Database: swissprot 
           539,616 sequences; 191,569,459 total letters

Searching..................................................done



>sp|Q0WL80|UGGG_ARATH UDP-glucose:glycoprotein glucosyltransferase OS=Arabidopsis thaliana
            GN=UGGT PE=1 SV=1
          Length = 1613

 Score = 1918 bits (4968), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 940/1266 (74%), Positives = 1078/1266 (85%), Gaps = 13/1266 (1%)

Query: 1    MQEISQNFPSVVSSLSRMKLNDSIKDEIVANQRYMPPGKSLMALNGALINIEDIDLYLLI 60
            MQEI+QNFPSVVSSLSRMKLN+SIKDEI++NQR +PPGK+L+ALNGAL+NIEDIDLY+L+
Sbjct: 353  MQEINQNFPSVVSSLSRMKLNESIKDEILSNQRMVPPGKALLALNGALLNIEDIDLYMLM 412

Query: 61   DLVHQELSLADQFSKLKIPRTITQKLLSTVPPAESSMFRVDFRSTHVQYLNNLEEDAMYK 120
            DL HQELSLA+ FSKLKIP    +KLL T P  E   +RVDFRS HV YLNNLEED MYK
Sbjct: 413  DLAHQELSLANHFSKLKIPDGAIRKLLLTTPLPEPDSYRVDFRSVHVTYLNNLEEDDMYK 472

Query: 121  RWRSNINEILMPVFPGQLRYIRKNLFHAVYVLDPATVCGLEVIDMIMSLYENHFPLRFGV 180
            RWRSNINEILMP FPGQLRYIRKNLFHAVYV+DPAT CGLE I+ + SLYEN  P+RFGV
Sbjct: 473  RWRSNINEILMPAFPGQLRYIRKNLFHAVYVIDPATACGLESIETLRSLYENQLPVRFGV 532

Query: 181  ILYSSKFIKSIEINGGELHSPVAEDDSPVNEDISSLIIRLFLFIKESHGTQTAFQFLSNV 240
            ILYS++ IK+IE NGG++ S  A  ++ V ED+S+++IRLFL+IKE HG QTAFQFL N+
Sbjct: 533  ILYSTQLIKTIENNGGQIPSSDAVTNAQVKEDLSTMVIRLFLYIKEHHGIQTAFQFLGNL 592

Query: 241  NRLRMESADSADDDALEIHHVEGAFVETILPKAKTPPQDMLLKLEKEKTFMDQSQESSMF 300
            N LR ESADS++ D +E  HV+GAFVETILPK KT PQD+LLKL +E T  + S+ SSMF
Sbjct: 593  NTLRTESADSSEAD-IEQEHVDGAFVETILPKVKTLPQDILLKLRQEHTLKEASEASSMF 651

Query: 301  VFKLGLTKLKCCLLMNGLVSES-SEEALLNAMNDELQRIQEQVYYGNINSYTDVLEKVLS 359
            VFKLGL KLKC  LMNGLV +S  EE LLNAMN+EL +IQEQVYYG I S+T VL+K+LS
Sbjct: 652  VFKLGLAKLKCSFLMNGLVFDSVEEETLLNAMNEELPKIQEQVYYGQIESHTKVLDKLLS 711

Query: 360  ESGINRYNPQIITDAKVKPKFISLASSFLGRETELKDINYLHSPETVDDVKPVTHLLAVD 419
            ESG++RYNPQII+  K KP+F+SLASS    E+ L D+NYLHSPET +DVK VTHLLA D
Sbjct: 712  ESGLSRYNPQIISGGKNKPRFVSLASSTRKGESMLNDVNYLHSPETSEDVKYVTHLLAAD 771

Query: 420  VTSKKGMKLLHEGIRFLIGGSNGARLGVLFSASREADLPSIIFVKAFEITASTYSHKKKV 479
            V +KKGMKLLHEG+R+LIGGS  ARLGVLFS+S+ AD  S++F+K FE TAS++SHK+KV
Sbjct: 772  VATKKGMKLLHEGVRYLIGGSKSARLGVLFSSSQNADPHSLLFIKFFEKTASSFSHKEKV 831

Query: 480  LEFLDQLCSFYERTYLLASSATADSTQAFIDKVCEFAEANGLSSKVYRASLPEYSKGKVR 539
            L FLD+LC FYER YLL +S  + S+Q FIDKV E A+  GLSSK YR+ L E    ++ 
Sbjct: 832  LYFLDKLCLFYEREYLLKTSVESASSQMFIDKVLELADEYGLSSKAYRSCLVESVDEELL 891

Query: 540  KQLNKVVQFLHRQLGVESGANAVITNGRVTFPIDESTFLSHDLSLLESVEFKHRIKHIWE 599
            K+L KV QFL  +LG+ES ANA+I+NGRV FP+DE TFL  DL LLES+EF  R+K + E
Sbjct: 892  KRLTKVAQFLSWELGLESDANAIISNGRVIFPVDERTFLGQDLHLLESMEFNQRVKPVQE 951

Query: 600  IIEEVNWQETYPDIDPDMLTSKFVSDIILFVTSSMAMRDRSSESARFEILSAEYSAVVFN 659
            IIE + WQ    D+DPD+LTSK+ SD+ +FV+S+MA RDRSSESARFE+L++EYSAV+  
Sbjct: 952  IIEGIEWQ----DVDPDLLTSKYFSDVFMFVSSAMATRDRSSESARFEVLNSEYSAVLLG 1007

Query: 660  SENSTIHIDAVIDPLSPTGQKLSSLLRVLQRYAQPSMRIVLNPMSSLVDIPLKNYYRYVV 719
            +EN+TIHIDAVIDPLSPTGQKL+SLL+VLQ++ Q SMRIVLNPMSSLVDIPLKNYYRYV+
Sbjct: 1008 NENATIHIDAVIDPLSPTGQKLASLLQVLQKHVQTSMRIVLNPMSSLVDIPLKNYYRYVL 1067

Query: 720  PTMDDFSNTDYSISGPKAFFANMPLSKTLTMNLDVPEPWLVEPVIAVHDLDNILLEKLGD 779
            P  DD+S+T + + GPKAFFANMPLSKTLTMNLDVPEPWLVEPVIA+HDLDNILLE LGD
Sbjct: 1068 PNTDDYSSTGFDVDGPKAFFANMPLSKTLTMNLDVPEPWLVEPVIAIHDLDNILLENLGD 1127

Query: 780  TRTLQAVFELEALVLTGHCSEKDHEPPQGLQLILGTKSTPHLVDTLVMANLGYWQMKVSP 839
            T TLQAVFE+E+LVLTGHC+EKDHE P+GLQLILGTK+ PHLVDTLVMANLGYWQMKVSP
Sbjct: 1128 TTTLQAVFEVESLVLTGHCAEKDHEAPRGLQLILGTKNRPHLVDTLVMANLGYWQMKVSP 1187

Query: 840  GVWYLQLAPGRSSELYVLKEDGNVNEDRSLSKRITINDLRGKVVHMEVVKKKGKENEKLL 899
            GVWYLQLAPGRSSELY LK   + ++D+S  KRITI+DLRGKVVH+EVVK+KGKE+EKLL
Sbjct: 1188 GVWYLQLAPGRSSELYALKGGNDGSQDQSSLKRITIDDLRGKVVHLEVVKRKGKEHEKLL 1247

Query: 900  VSSDEDSHSQ--AEGHWNSNFLKWASGFIGGSEQSKKEKAAVDHGKVERHGKTINIFSIA 957
            V SD D   Q   EG WNSNFLKWASGF+GG +QS K     +H K  R GKTINIFSIA
Sbjct: 1248 VPSDGDDAVQQNKEGSWNSNFLKWASGFVGGRQQSMKGGPDKEHEKGGRQGKTINIFSIA 1307

Query: 958  SGHLYERFLKIMILSVLKNTCRPVKFWFIKNYLSPQFKDVIPHMAQEYGFEYELITYKWP 1017
            SGHLYERFLKIMILSVLKNT RPVKFWFIKNYLSPQFKDVIPHMAQEY FEYELITYKWP
Sbjct: 1308 SGHLYERFLKIMILSVLKNTNRPVKFWFIKNYLSPQFKDVIPHMAQEYNFEYELITYKWP 1367

Query: 1018 TWLHKQKEKQRIIWAYKILFLDVIFPLSLEKVIFVDADQVVRADMGELYDMDIKGRPLAY 1077
            +WLHKQKEKQRIIWAYKILFLDVIFPLSLEKVIFVDADQ++R DMGELYDMDIKGRPLAY
Sbjct: 1368 SWLHKQKEKQRIIWAYKILFLDVIFPLSLEKVIFVDADQIIRTDMGELYDMDIKGRPLAY 1427

Query: 1078 TPFCDNNKDMDGYRFWRQGFWKDHLRGRPYHISALYVVDLKRFRETAAGDNLRVFYETLS 1137
            TPFCDNN++MDGY+FW+QGFWK+HLRGRPYHISALYVVDL +FRETAAGDNLRVFYETLS
Sbjct: 1428 TPFCDNNREMDGYKFWKQGFWKEHLRGRPYHISALYVVDLVKFRETAAGDNLRVFYETLS 1487

Query: 1138 KDPNSLANLDQDLPNYAQHTVPIFSLPQEWLWCESWCGNATKSKAKTIDLCNNPMTKEPK 1197
            KDPNSL+NLDQDLPNYAQHTVPIFSLPQEWLWCESWCGNATK+KA+TIDLCNNPMTKEPK
Sbjct: 1488 KDPNSLSNLDQDLPNYAQHTVPIFSLPQEWLWCESWCGNATKAKARTIDLCNNPMTKEPK 1547

Query: 1198 LQGARRIVSEWPDLDSEARQFTAKILGEEVVTLE---TPAPVGPMQTSGSDAS--SKGDL 1252
            LQGARRIV+EWPDLD EAR+FTAKILGE+V   E    PA   P     +D S  ++ DL
Sbjct: 1548 LQGARRIVTEWPDLDLEARKFTAKILGEDVELNEPVAAPATDKPNPLPSNDISEDTEQDL 1607

Query: 1253 ESKAEL 1258
            ESKAEL
Sbjct: 1608 ESKAEL 1613


>sp|Q9JLA3|UGGG1_RAT UDP-glucose:glycoprotein glucosyltransferase 1 OS=Rattus norvegicus
            GN=Uggt1 PE=1 SV=2
          Length = 1551

 Score =  738 bits (1905), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 461/1276 (36%), Positives = 701/1276 (54%), Gaps = 156/1276 (12%)

Query: 1    MQEISQNFPSVVSSLSRMKLNDSIKDEIVANQRY------MPPGKSLMALNGALINIEDI 54
            M++ISQNFP+   ++++  ++  ++ E+  NQ+Y      + PG S + +NG  I+++  
Sbjct: 346  MKDISQNFPTKARAITKTAVSAQLRAEVEENQKYFKGTIGLQPGDSALFINGLHIDLDTQ 405

Query: 55   DLYLLIDLVHQELSLADQFSKLKIPRTITQKLLS-TVPPAESSMFRVDFRSTHVQYLNNL 113
            D++ L D +  E  + +   +L I       +L   + P+E+  + VD RS  + ++NNL
Sbjct: 406  DIFSLFDTLRNEARVMEGLHRLGIEGLSLHNILKLNIQPSETD-YAVDIRSPAISWVNNL 464

Query: 114  EEDAMYKRWRSNINEILMPVFPGQLRYIRKNLFHAVYVLDPATVCGLEVIDMIMSLYENH 173
            E D+ Y  W S++ E+L P FPG +R IRKNL + V+++DP      E++ +      NH
Sbjct: 465  EVDSRYNSWPSSLQELLRPTFPGVIRQIRKNLHNMVFIVDPVHETTAELVSIAEMFLSNH 524

Query: 174  FPLRFGVILYSSKFIKSIEINGGELHSPVAEDDSPVNEDISSLIIRLFLFIKESHGTQTA 233
             PLR G I           +N  E        D    +D    ++R + ++ +      A
Sbjct: 525  IPLRIGFIFV---------VNDSE--------DVDGMQDAGVAVLRAYNYVGQEVDGYHA 567

Query: 234  FQFLSNV-NRLRMESADSADDDALEIHHVEGAFVETILPKAKTPPQDMLLKLEKEKTFMD 292
            FQ L+ + N++R         + +++ H     V ++L K K P  ++   L  +  +  
Sbjct: 568  FQTLTQIYNKVRT-------GEKVKVEH-----VVSVLEK-KYPYVEVNSILGIDSAYDQ 614

Query: 293  QSQESSMFVFKLGLTKLKCCLLMNGLVSESS-------EEALLNAMNDELQRIQEQVYYG 345
              +E+  +  + G+  L   +L NG+  E         E   ++ + +     Q  VY G
Sbjct: 615  NRKEARGYYEQTGVGPLP-VVLFNGMPFEKEQLDPDELETITMHKILETTTFFQRAVYLG 673

Query: 346  NINSYTDVLEKVLSESG-INRYNPQIITDAKVKPKFISLASS----------FLGRETEL 394
             ++   DV+E ++++   + R N +I+T    K +++ L +S          F   ++  
Sbjct: 674  ELSHDQDVVEYIMNQPNVVPRINSRILT---AKREYLDLTASNNFYVDDFARFSALDSRG 730

Query: 395  K------DINYL-----HSPETVDD--VKPVTHLLAVDVTSKKGMKLLHEGIRFLIGGSN 441
            K       +NYL      S E  DD  ++PVT  +  D  S  G +LL++ I+     SN
Sbjct: 731  KTAAIANSMNYLTKKGMSSKEIYDDSFIRPVTFWIVGDFDSPSGRQLLYDAIKHQ-KTSN 789

Query: 442  GARLGVLFSASREADLPSIIFVKAFEITASTYSHKKKVLEFLDQLCSFYERTYLLASSAT 501
              R+ ++ + SRE    S    +A                            +    + T
Sbjct: 790  NVRISMINNPSREISDSSTPVSRAI---------------------------WAALQTQT 822

Query: 502  ADSTQAFIDKVCEFAEANGLSSKVYRASLPEYSKGKVRKQLNKVV-------------QF 548
            ++S + FI K+ +   A  L++ V    + E+S G +   L K V              +
Sbjct: 823  SNSAKNFITKMVKEETAEALAAGV---DIGEFSVGGMDVSLFKEVFESSRMDFILSHALY 879

Query: 549  LHRQLGVESGANAVITNGRVTFPIDESTFLSH-DLSLLESVEFKHRIKHIWEIIEEVNWQ 607
                L ++ G   VI+NGR+  P+++S   +  D  LLE++  K   + I   I+++  +
Sbjct: 880  CRDVLKLKKGQRVVISNGRIIGPLEDSELFNQDDFHLLENIILKTSGQKIKSHIQQLRVE 939

Query: 608  ETYPDIDPDMLTSKFVSDIILFVTSSMAMRDRSSESARFEILSAEYSAVVFNSENSTIHI 667
            E               SD+++ V + ++ + +      ++    ++SA+    +    + 
Sbjct: 940  E------------DVASDLVMKVDALLSAQPKGEARIEYQFFEDKHSAIKLKPKEGETYY 987

Query: 668  D--AVIDPLSPTGQKLSSLLRVLQRYAQPSMRIVLNPMSSLVDIPLKNYYRYVVPTMDDF 725
            D  AV+DP++   Q+L+ LL VL +    S+R+ +N  S L D+PLK++YRYV+     F
Sbjct: 988  DVVAVVDPVTREAQRLAPLLLVLAQLINMSLRVFMNCQSKLSDMPLKSFYRYVLEPEISF 1047

Query: 726  SNTDYSISGPKAFFANMPLSKTLTMNLDVPEPWLVEPVIAVHDLDNILLEKLGDTRTLQA 785
            +  +    GP A F +MP S   T+NL+ PE W+VE V   +DLDNI LE++     + A
Sbjct: 1048 TADNSFAKGPIAKFLDMPQSPLFTLNLNTPESWMVESVRTPYDLDNIYLEEVDS--IVAA 1105

Query: 786  VFELEALVLTGHCSE-KDHEPPQGLQLILGTKSTPHLVDTLVMANLGYWQMKVSPGVWYL 844
             +ELE L+L GHC +    +PP+GLQ  LGT + P  VDT+VMANLGY+Q+K +PG W L
Sbjct: 1106 EYELEYLLLEGHCYDITTGQPPRGLQFTLGTSANPTTVDTIVMANLGYFQLKANPGAWIL 1165

Query: 845  QLAPGRSSELY-VLKEDGNVNEDRSLSKRITINDLRGKVVHMEVVKKKGKENEKLLVSSD 903
            +L  GRS ++Y +   DG  +   +    + +N+ + K++ ++V KK    NE LL    
Sbjct: 1166 RLRKGRSDDIYRIYSHDGTDSPPDANDVVVILNNFKSKIIKVKVQKKADMANEDLLSDGT 1225

Query: 904  EDSHSQAEGHWNSNFLKWASGFIG-GSEQSKKEKAAVDHGKVERHGKTINIFSIASGHLY 962
             ++ S   G W+S   KW  GF G  +E+ K++K  +           INIFS+ASGHLY
Sbjct: 1226 NENES---GFWDS--FKW--GFSGQKTEEVKQDKDDI-----------INIFSVASGHLY 1267

Query: 963  ERFLKIMILSVLKNTCRPVKFWFIKNYLSPQFKDVIPHMAQEYGFEYELITYKWPTWLHK 1022
            ERFL+IM+LSVLKNT  PVKFWF+KNYLSP FK+ IP+MA++Y F+YEL+ YKWP WLH+
Sbjct: 1268 ERFLRIMMLSVLKNTKTPVKFWFLKNYLSPTFKEFIPYMAKKYNFQYELVQYKWPRWLHQ 1327

Query: 1023 QKEKQRIIWAYKILFLDVIFPLSLEKVIFVDADQVVRADMGELYDMDIKGRPLAYTPFCD 1082
            Q EKQRIIW YKILFLDV+FPL ++K +FVDADQ+VR D+ EL D ++ G P  YTPFCD
Sbjct: 1328 QTEKQRIIWGYKILFLDVLFPLVVDKFLFVDADQIVRTDLKELRDFNLDGAPYGYTPFCD 1387

Query: 1083 NNKDMDGYRFWRQGFWKDHLRGRPYHISALYVVDLKRFRETAAGDNLRVFYETLSKDPNS 1142
            + ++MDGYRFW+ G+W  HL GR YHISALYVVDLK+FR+ AAGD LR  Y+ LS+DPNS
Sbjct: 1388 SRREMDGYRFWKSGYWASHLAGRKYHISALYVVDLKKFRKIAAGDRLRGQYQGLSQDPNS 1447

Query: 1143 LANLDQDLPNYAQHTVPIFSLPQEWLWCESWCGNATKSKAKTIDLCNNPMTKEPKLQGAR 1202
            L+NLDQDLPN   H VPI SLPQEWLWCE+WC +A+K +AKTIDLCNNPMTKEPKL+ A 
Sbjct: 1448 LSNLDQDLPNNMIHQVPIKSLPQEWLWCETWCDDASKKRAKTIDLCNNPMTKEPKLEAAV 1507

Query: 1203 RIVSEWPDLDSEARQF 1218
            RIV EW D D E +Q 
Sbjct: 1508 RIVPEWQDYDQEIKQL 1523


>sp|Q6P5E4|UGGG1_MOUSE UDP-glucose:glycoprotein glucosyltransferase 1 OS=Mus musculus
            GN=Uggt1 PE=1 SV=4
          Length = 1551

 Score =  738 bits (1905), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 465/1279 (36%), Positives = 712/1279 (55%), Gaps = 133/1279 (10%)

Query: 1    MQEISQNFPSVVSSLSRMKLNDSIKDEIVANQRY------MPPGKSLMALNGALINIEDI 54
            M++ISQNFP+   ++++  ++  ++ E+  NQ+Y      + PG S + +NG  I+++  
Sbjct: 346  MKDISQNFPTKARAITKTAVSAQLRAEVEENQKYFKGTIGLQPGDSALFINGLHIDLDTQ 405

Query: 55   DLYLLIDLVHQELSLADQFSKLKIPRTITQKLLS-TVPPAESSMFRVDFRSTHVQYLNNL 113
            D++ L D +  E  + +   +L I       +L   + P+E+  + VD RS  + ++NNL
Sbjct: 406  DIFSLFDTLRNEARVMEGLHRLGIEGLSLHNILKLNIQPSETD-YAVDIRSPAISWVNNL 464

Query: 114  EEDAMYKRWRSNINEILMPVFPGQLRYIRKNLFHAVYVLDPATVCGLEVIDMIMSLYENH 173
            E D+ Y  W S++ E+L P FPG +R IRKNL + V+++DP      E+I +      NH
Sbjct: 465  EVDSRYNSWPSSLQELLRPTFPGVIRQIRKNLHNMVFIIDPVHETTAELISIAEMFLSNH 524

Query: 174  FPLRFGVILYSSKFIKSIEINGGELHSPVAEDDSPVNEDISSLIIRLFLFIKESHGTQTA 233
             PLR G I           +N  E        D    +D    ++R + ++ +      A
Sbjct: 525  IPLRIGFIFV---------VNDSE--------DVDGMQDAGVAVLRAYNYVAQEVDGYHA 567

Query: 234  FQFLSNV-NRLRMESADSADDDALEIHHVEGAFVETILPKAKTPPQDMLLKLEKEKTFMD 292
            FQ L+ + N++R         + +++ HV      ++L K K P  ++   L  +  +  
Sbjct: 568  FQTLTQIYNKVRT-------GETVKVEHVV-----SVLEK-KYPYVEVNSILGIDSAYDQ 614

Query: 293  QSQESSMFVFKLGLTKLKCCLLMNGLVSESS-------EEALLNAMNDELQRIQEQVYYG 345
              +E+  +  + G+  L   +L NG+  E         E   ++ + +     Q  VY G
Sbjct: 615  NRKEARGYYEQTGVGPLPV-VLFNGMPFEKEQLDPDELETITMHKILETTTFFQRAVYLG 673

Query: 346  NINSYTDVLEKVLSESGI-NRYNPQIITDAKVKPKFISLASS----------FLGRETEL 394
             ++   DV+E ++++  +  R N +I+T    K +++ L +S          F   ++  
Sbjct: 674  ELSHDQDVVEYIMNQPNVVPRINSRILT---AKREYLDLTASNNFYVDDFARFSALDSRG 730

Query: 395  K------DINYL-----HSPETVDD--VKPVTHLLAVDVTSKKGMKLLHEGIRFLIGGSN 441
            K       +NYL      S E  DD  ++PVT  +  D  S  G +LL++ I+     SN
Sbjct: 731  KTAAIANSMNYLTKKGMSSKEIYDDSFIRPVTFWIVGDFDSPSGRQLLYDAIKHQ-KTSN 789

Query: 442  GARLGVLFSASRE-ADLPSIIFVKAFEITASTYSHKKKVLEFLDQLCSFYERTYLLASSA 500
              R+ ++ + S+E +D  + IF   +    +  S   K   F+ ++          A   
Sbjct: 790  NVRISMINNPSQEISDSSTPIFRAIWAALQTQASSSAK--NFITKM----------AKEE 837

Query: 501  TADSTQAFIDKVCEFAEANGLSSKVYRASLPEYSKGKVRKQLNKVVQFLHRQLGVESGAN 560
            TA++  A +D + EF+   G+   +++     +   ++   L+  + +    L ++ G  
Sbjct: 838  TAEALAAGVD-IAEFS-VGGMDVSLFKEV---FESSRMDFILSHAL-YCRDVLKLKKGQR 891

Query: 561  AVITNGRVTFPI-DESTFLSHDLSLLESVEFKHRIKHIWEIIEEVNWQETYPDIDPDMLT 619
             VI+NGR+  P+ D   F   D  LLE++  K   + I   I+++  +E           
Sbjct: 892  VVISNGRIIGPLEDNELFNQDDFHLLENIILKTSGQKIKSHIQQLRVEE----------- 940

Query: 620  SKFVSDIILFVTSSMAMRDRSSESARFEILSAEYSAVVFNSENSTIHID--AVIDPLSPT 677
                SD+++ V + ++ + +      ++    ++SA+    +    + D  AV+DP++  
Sbjct: 941  -DVASDLVMKVDALLSAQPKGEARIEYQFFEDKHSAIKLKPKEGETYYDVVAVVDPVTRE 999

Query: 678  GQKLSSLLRVLQRYAQPSMRIVLNPMSSLVDIPLKNYYRYVVPTMDDFSNTDYSISGPKA 737
             Q+L+ LL VL +    ++R+ +N  S L D+PLK++YRYV+     F+       GP A
Sbjct: 1000 AQRLAPLLLVLTQLINMNLRVFMNCQSKLSDMPLKSFYRYVLEPEISFTADSSFAKGPIA 1059

Query: 738  FFANMPLSKTLTMNLDVPEPWLVEPVIAVHDLDNILLEKLGDTRTLQAVFELEALVLTGH 797
             F +MP S   T+NL+ PE W+VE V   +DLDNI LE++     + A +ELE L+L GH
Sbjct: 1060 KFLDMPQSPLFTLNLNTPESWMVESVRTPYDLDNIYLEEVDSI--VAAEYELEYLLLEGH 1117

Query: 798  CSE-KDHEPPQGLQLILGTKSTPHLVDTLVMANLGYWQMKVSPGVWYLQLAPGRSSELY- 855
            C +    +PP+GLQ  LGT + P +VDT+VMANLGY+Q+K +PG W L+L  GRS ++Y 
Sbjct: 1118 CYDITTGQPPRGLQFTLGTSANPTIVDTIVMANLGYFQLKANPGAWILRLRKGRSDDIYR 1177

Query: 856  VLKEDGNVNEDRSLSKRITINDLRGKVVHMEVVKKKGKENEKLLVSSDEDSHSQAEGHWN 915
            +   DG  +   +    + +N+ + K++ ++V KK    NE LL     ++ S   G W+
Sbjct: 1178 IYSHDGTDSPPDANDVVVILNNFKSKIIKVKVQKKADMANEDLLSDGTNENES---GFWD 1234

Query: 916  SNFLKWASGFIG-GSEQSKKEKAAVDHGKVERHGKTINIFSIASGHLYERFLKIMILSVL 974
            S   KW  GF G  +E+ K++K  +           INIFS+ASGHLYERFL+IM+LSVL
Sbjct: 1235 S--FKW--GFSGQKAEEVKQDKDDI-----------INIFSVASGHLYERFLRIMMLSVL 1279

Query: 975  KNTCRPVKFWFIKNYLSPQFKDVIPHMAQEYGFEYELITYKWPTWLHKQKEKQRIIWAYK 1034
            KNT  PVKFWF+KNYLSP FK+ IP+MA++Y F+YEL+ YKWP WLH+Q EKQRIIW YK
Sbjct: 1280 KNTKTPVKFWFLKNYLSPTFKEFIPYMAKKYNFQYELVQYKWPRWLHQQTEKQRIIWGYK 1339

Query: 1035 ILFLDVIFPLSLEKVIFVDADQVVRADMGELYDMDIKGRPLAYTPFCDNNKDMDGYRFWR 1094
            ILFLDV+FPL ++K +FVDADQ+VR D+ EL D ++ G P  YTPFCD+ ++MDGYRFW+
Sbjct: 1340 ILFLDVLFPLVVDKFLFVDADQIVRTDLKELRDFNLDGAPYGYTPFCDSRREMDGYRFWK 1399

Query: 1095 QGFWKDHLRGRPYHISALYVVDLKRFRETAAGDNLRVFYETLSKDPNSLANLDQDLPNYA 1154
             G+W  HL GR YHISALYVVDLK+FR+ AAGD LR  Y+ LS+DPNSL+NLDQDLPN  
Sbjct: 1400 SGYWASHLAGRKYHISALYVVDLKKFRKIAAGDRLRGQYQGLSQDPNSLSNLDQDLPNNM 1459

Query: 1155 QHTVPIFSLPQEWLWCESWCGNATKSKAKTIDLCNNPMTKEPKLQGARRIVSEWPDLDSE 1214
             H VPI SLPQEWLWCE+WC +A+K +AKTIDLCNNPMTKEPKL+ A RIV EW D D E
Sbjct: 1460 IHQVPIKSLPQEWLWCETWCDDASKKRAKTIDLCNNPMTKEPKLEAAVRIVPEWQDYDQE 1519

Query: 1215 ARQFTAKILGE-EVVTLET 1232
             +Q       E E+ TL T
Sbjct: 1520 IKQLQTLFQEEKELGTLHT 1538


>sp|Q9NYU2|UGGG1_HUMAN UDP-glucose:glycoprotein glucosyltransferase 1 OS=Homo sapiens
            GN=UGGT1 PE=1 SV=3
          Length = 1555

 Score =  729 bits (1883), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 455/1270 (35%), Positives = 706/1270 (55%), Gaps = 128/1270 (10%)

Query: 1    MQEISQNFPSVVSSLSRMKLNDSIKDEIVANQRY------MPPGKSLMALNGALINIEDI 54
            M+++SQNFP+   ++++  ++  ++ E+  NQ+Y      + PG S + +NG  ++++  
Sbjct: 346  MKDLSQNFPTKARAITKTAVSSELRTEVEENQKYFKGTLGLQPGDSALFINGLHMDLDTQ 405

Query: 55   DLYLLIDLVHQELSLADQFSKLKIPRTITQKLLS-TVPPAESSMFRVDFRSTHVQYLNNL 113
            D++ L D++  E  + +   +L I       +L   + P+E+  + VD RS  + ++NNL
Sbjct: 406  DIFSLFDVLRNEARVMEGLHRLGIEGLSLHNVLKLNIQPSEAD-YAVDIRSPAISWVNNL 464

Query: 114  EEDAMYKRWRSNINEILMPVFPGQLRYIRKNLFHAVYVLDPATVCGLEVIDMIMSLYENH 173
            E D+ Y  W S++ E+L P FPG +R IRKNL + V+++DPA     E+++       NH
Sbjct: 465  EVDSRYNSWPSSLQELLRPTFPGVIRQIRKNLHNMVFIVDPAHETTAELMNTAEMFLSNH 524

Query: 174  FPLRFGVILYSSKFIKSIEINGGELHSPVAEDDSPVN--EDISSLIIRLFLFIKESHGTQ 231
             PLR G I                    V  D   V+  +D    ++R + ++ +     
Sbjct: 525  IPLRIGFIF-------------------VVNDSEDVDGMQDAGVAVLRAYNYVAQEVDDY 565

Query: 232  TAFQFLSNV-NRLRMESADSADDDALEIHHVEGAFVETILPKAKTPPQDMLLKLEKEKTF 290
             AFQ L+++ N++R         + +++ HV      ++L K K P  ++   L  +  +
Sbjct: 566  HAFQTLTHIYNKVRT-------GEKVKVEHVV-----SVLEK-KYPYVEVNSILGIDSAY 612

Query: 291  MDQSQESSMFVFKLGLTKLKCCLLMNGLVSESS-------EEALLNAMNDELQRIQEQVY 343
                +E+  +  + G+  L   +L NG+  E         E   ++ + +     Q  VY
Sbjct: 613  DRNRKEARGYYEQTGVGPLPV-VLFNGMPFEREQLDPDELETITMHKILETTTFFQRAVY 671

Query: 344  YGNINSYTDVLEKVLSESGI-NRYNPQIITDAKVKPKFISLASSFL------------GR 390
             G +    DV+E ++++  +  R N +I+T  +      +  + F+            G+
Sbjct: 672  LGELPHDQDVVEYIMNQPNVVPRINSRILTAERDYLDLTASNNFFVDDYARFTILDSQGK 731

Query: 391  ETELKD-INYL-----HSPETVDD--VKPVTHLLAVDVTSKKGMKLLHEGIRFLIGGSNG 442
               + + +NYL      S E  DD  ++PVT  +  D  S  G +LL++ I+     SN 
Sbjct: 732  TAAVANSMNYLTKKGMSSKEIYDDSFIRPVTFWIVGDFDSPSGRQLLYDAIKHQ-KSSNN 790

Query: 443  ARLGVLFSASREADLPSIIFVKAFEITASTYSHKKKVLEFLDQLCSFYERTYLLASSATA 502
             R+ ++ + ++E    +    +A      T +       F+ ++           +   A
Sbjct: 791  VRISMINNPAKEISYENTQISRAIWAALQTQT-SNAAKNFITKM-----------AKEGA 838

Query: 503  DSTQAFIDKVCEFAEANGLSSKVYRASLPEYSKGKVRKQLNKVVQFLHRQLGVESGANAV 562
                A    + EF+   G+   +++     +   K+   L+  V +    L ++ G  AV
Sbjct: 839  AEALAAGADIAEFS-VGGMDFSLFKEV---FESSKMDFILSHAV-YCRDVLKLKKGQRAV 893

Query: 563  ITNGRVTFPIDESTFLSHD-LSLLESVEFKHRIKHIWEIIEEVNWQETYPDIDPDMLTSK 621
            I+NGR+  P+++S   + D   LLE++  K   + I   I+++  +E             
Sbjct: 894  ISNGRIIGPLEDSELFNQDDFHLLENIILKTSGQKIKSHIQQLRVEE------------D 941

Query: 622  FVSDIILFVTSSMAMRDRSSESARFEILSAEYSAVVFNSENSTIHID--AVIDPLSPTGQ 679
              SD+++ V + ++ + +      ++     +SA+    +    + D  AV+DP++   Q
Sbjct: 942  VASDLVMKVDALLSAQPKGDPRIEYQFFEDRHSAIKLRPKEGETYFDVVAVVDPVTREAQ 1001

Query: 680  KLSSLLRVLQRYAQPSMRIVLNPMSSLVDIPLKNYYRYVVPTMDDFSNTDYSISGPKAFF 739
            +L+ LL VL +    ++R+ +N  S L D+PLK++YRYV+     F++ +    GP A F
Sbjct: 1002 RLAPLLLVLAQLINMNLRVFMNCQSKLSDMPLKSFYRYVLEPEISFTSDNSFAKGPIAKF 1061

Query: 740  ANMPLSKTLTMNLDVPEPWLVEPVIAVHDLDNILLEKLGDTRTLQAVFELEALVLTGHCS 799
             +MP S   T+NL+ PE W+VE V   +DLDNI LE++     + A +ELE L+L GHC 
Sbjct: 1062 LDMPQSPLFTLNLNTPESWMVESVRTPYDLDNIYLEEVDSV--VAAEYELEYLLLEGHCY 1119

Query: 800  E-KDHEPPQGLQLILGTKSTPHLVDTLVMANLGYWQMKVSPGVWYLQLAPGRSSELY-VL 857
            +    +PP+GLQ  LGT + P +VDT+VMANLGY+Q+K +PG W L+L  GRS ++Y + 
Sbjct: 1120 DITTGQPPRGLQFTLGTSANPVIVDTIVMANLGYFQLKANPGAWILRLRKGRSEDIYRIY 1179

Query: 858  KEDGNVNEDRSLSKRITINDLRGKVVHMEVVKKKGKENEKLLVSSDEDSHSQAEGHWNSN 917
              DG  +   +    I +N+ + K++ ++V KK    NE LL  SD  S +++ G W+S 
Sbjct: 1180 SHDGTDSPPDADEVVIVLNNFKSKIIKVKVQKKADMVNEDLL--SDGTSENES-GFWDS- 1235

Query: 918  FLKWASGFIG-GSEQSKKEKAAVDHGKVERHGKTINIFSIASGHLYERFLKIMILSVLKN 976
              KW  GF G  +E+ K++K  +           INIFS+ASGHLYERFL+IM+LSVLKN
Sbjct: 1236 -FKW--GFTGQKTEEVKQDKDDI-----------INIFSVASGHLYERFLRIMMLSVLKN 1281

Query: 977  TCRPVKFWFIKNYLSPQFKDVIPHMAQEYGFEYELITYKWPTWLHKQKEKQRIIWAYKIL 1036
            T  PVKFWF+KNYLSP FK+ IP+MA EY F+YEL+ YKWP WLH+Q EKQRIIW YKIL
Sbjct: 1282 TKTPVKFWFLKNYLSPTFKEFIPYMANEYNFQYELVQYKWPRWLHQQTEKQRIIWGYKIL 1341

Query: 1037 FLDVIFPLSLEKVIFVDADQVVRADMGELYDMDIKGRPLAYTPFCDNNKDMDGYRFWRQG 1096
            FLDV+FPL ++K +FVDADQ+VR D+ EL D ++ G P  YTPFCD+ ++MDGYRFW+ G
Sbjct: 1342 FLDVLFPLVVDKFLFVDADQIVRTDLKELRDFNLDGAPYGYTPFCDSRREMDGYRFWKSG 1401

Query: 1097 FWKDHLRGRPYHISALYVVDLKRFRETAAGDNLRVFYETLSKDPNSLANLDQDLPNYAQH 1156
            +W  HL GR YHISALYVVDLK+FR+ AAGD LR  Y+ LS+DPNSL+NLDQDLPN   H
Sbjct: 1402 YWASHLAGRKYHISALYVVDLKKFRKIAAGDRLRGQYQGLSQDPNSLSNLDQDLPNNMIH 1461

Query: 1157 TVPIFSLPQEWLWCESWCGNATKSKAKTIDLCNNPMTKEPKLQGARRIVSEWPDLDSEAR 1216
             VPI SLPQEWLWCE+WC +A+K +AKTIDLCNNPMTKEPKL+ A RIV EW D D E +
Sbjct: 1462 QVPIKSLPQEWLWCETWCDDASKKRAKTIDLCNNPMTKEPKLEAAVRIVPEWQDYDQEIK 1521

Query: 1217 QFTAKILGEE 1226
            Q   +   E+
Sbjct: 1522 QLQIRFQKEK 1531


>sp|Q09332|UGGG_DROME UDP-glucose:glycoprotein glucosyltransferase OS=Drosophila
            melanogaster GN=Ugt PE=1 SV=2
          Length = 1548

 Score =  703 bits (1814), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 455/1265 (35%), Positives = 679/1265 (53%), Gaps = 122/1265 (9%)

Query: 1    MQEISQNFPSVVSSLSRMKLNDSIKDEIVANQRYM-------PPGKSLMALNGALINIED 53
            +Q  + NFP +  +L   K+ D ++ E+  N           PP  +L  +NG   + + 
Sbjct: 334  LQYTAHNFPMLARTLLAHKVTDGLRAEVKHNTEAFGRSLNVAPPDGALF-INGLFFDADT 392

Query: 54   IDLYLLIDLVHQELSLADQFSKLKIPRTITQKLLSTVPPAESSM-FRVDFRSTHVQYLNN 112
            +DLY LI+ +  E+ + +      +  ++   LL+    A S   F +D R T VQ++N+
Sbjct: 393  MDLYSLIETLRSEMRVLESLHSNNVRGSLASSLLALDLTASSKKEFAIDIRDTAVQWVND 452

Query: 113  LEEDAMYKRWRSNINEILMPVFPGQLRYIRKNLFHAVYVLDPATVCGLEVIDMIMSLYEN 172
            +E D  Y+RW S++ ++L P FPG LR IRKN+F+ V V+D        VI +  S   +
Sbjct: 453  IENDVQYRRWPSSVMDLLRPTFPGMLRNIRKNVFNLVLVVDALQPTARSVIKLSESFVIH 512

Query: 173  HFPLRFGVILYSSKFIKSIEINGGELHSPVAEDDSPVNEDISSLIIRLFLFIKESHGTQT 232
              P+R G++        + + N   L   VA             I   + ++ +    + 
Sbjct: 513  QAPIRLGLVF------DARDANEDNLADYVA-------------ITCAYNYVSQKKDARA 553

Query: 233  AFQFLSNVNRLRMESADSADDDALEIHHVEGAFVETILPKAKTPPQDMLLKLEKEKTFMD 292
            A  FL+++     E+      D   +  +   F      KA+         LE++ T+  
Sbjct: 554  ALSFLTDIYAAVGETKVVTKKDI--VKQLTKEFTSLSFAKAEE-------FLEEDSTYDY 604

Query: 293  QSQESSMFVFKLGLT-KLKCCLLMNGLVSESS--------EEALLNAMNDELQRIQEQVY 343
              + ++ F+ +LG   K +   L+NG+   S+        EEA+   +      +Q+ VY
Sbjct: 605  GRELAAEFIQRLGFGDKGQPQALLNGVPMPSNVVTADSDFEEAIFTEIMTHTSNLQKAVY 664

Query: 344  YGNINSYTDVLEKVLSESGI-NRYNPQIITDAKVKPKFISLASSFLGRETELKDINYLHS 402
             G +      ++ ++++  +  R N +I++   VK         +L       DIN + +
Sbjct: 665  KGELTDNDVAIDYLMNQPHVMPRLNQRILSQEDVK---------YL-------DINGV-A 707

Query: 403  PETVDDVKPVTHLLAVDVTSKKGMKLLHEGIRF---LIGGSNGARLGVLFSASREADLPS 459
             + + +V  +  L   D+T+     L + G +    LIG ++   L +   A  E D   
Sbjct: 708  YKNLGNVGVLNRLSNRDMTATLMDNLKYFGGKKSTELIGRASLQFLTIWVFADLETDQGR 767

Query: 460  IIFVKAFEITASTYSHKKKVLEFLDQLCSFYERTY---LLASSATADSTQAFIDKVCEFA 516
             +   A +   S  S +   +   +   +   R     + A+  +   TQA  ++V ++ 
Sbjct: 768  DLLTHALDYVQSGESVRVAFIPNTESSSASSRRNLNRLVWAAMQSLPPTQA-TEQVLKWL 826

Query: 517  EANGLSSKVYRASLPEYSKGKVRKQLNKVVQFLHRQLGVESGANAVITNGRVTFPI--DE 574
            +      K+   +  E   G     L  +  +  R LG+      VI NGR+  P+  DE
Sbjct: 827  KKP--KEKIEIPTQLEDILGSTELHLKMLRVYSQRVLGLNKSQRLVIGNGRLYGPLSSDE 884

Query: 575  STFLSHDLSLL---ESVEFKHRIKHIWEIIEEVNWQETYPDIDPDMLTSKFVSDIILFVT 631
            S F S D +LL    S+++  +++ +         +E+  D++      +F SD +L + 
Sbjct: 885  S-FDSADFALLARFSSLQYSDKVRQVL--------KESAQDVN-----EEFNSDTLLKLY 930

Query: 632  SSMAMRDRSSESARFEI---LSAEYSAVVFNSENSTI-HID--AVIDPLSPTGQKLSSLL 685
            +S+  R   +   RF++   L  ++S V    +   + H D  AV+DP S   QKL+ +L
Sbjct: 931  ASLLPRQTKT---RFKLPTDLKTDHSVVKLPPKQENLPHFDVAAVLDPASRAAQKLTPIL 987

Query: 686  RVLQRYAQPSMRIVLNPMSSLVDIPLKNYYRYVVPTMDDFSNTDYSISGPKAFFANMPLS 745
             +L++     + + L P+    D+P+KN+YRYVV     F        GP A F+ +P +
Sbjct: 988  ILLRQVLNCQLNLYLIPVPQHSDMPVKNFYRYVVEPEVQFEANGGRSDGPLAKFSGLPAN 1047

Query: 746  KTLTMNLDVPEPWLVEPVIAVHDLDNILLEKLGDTRTLQAVFELEALVLTGHCSEK-DHE 804
              LT  L VPE WLVE V AV+DLDNI L  +G    + + F+LE L+L GHC +     
Sbjct: 1048 PLLTQQLQVPENWLVEAVRAVYDLDNIKLTDIGGP--VHSEFDLEYLLLEGHCFDAASGA 1105

Query: 805  PPQGLQLILGTKSTPHLVDTLVMANLGYWQMKVSPGVWYLQLAPGRSSELYVLKEDGNVN 864
            PP+GLQL+LGT+S P LVDT+VMANLGY+Q+K +PG W L+L  G+S+++Y +      N
Sbjct: 1106 PPRGLQLVLGTQSQPTLVDTIVMANLGYFQLKANPGAWSLRLREGKSADIYAISHIEGTN 1165

Query: 865  EDRSLSK---RITINDLRGKVVHMEVVKKKGKENEKLLVSSDEDSHSQAEGHWNSNFLKW 921
               S      ++ I  LR  VV + V KK G +  +LL  SD++  +   G WNS     
Sbjct: 1166 THHSAGSSEVQVLITSLRSHVVKLRVSKKPGMQQAELL--SDDNEQAAQSGMWNS----I 1219

Query: 922  ASGFIGGSEQSKKEKAAVDHGKVERHGKTINIFSIASGHLYERFLKIMILSVLKNTCRPV 981
            AS F GGS      +AA D        +TINIFS+ASGHLYER L+IM++S+LK+T  PV
Sbjct: 1220 ASSFGGGS----ANQAATDEDT-----ETINIFSVASGHLYERLLRIMMVSLLKHTKSPV 1270

Query: 982  KFWFIKNYLSPQFKDVIPHMAQEYGFEYELITYKWPTWLHKQKEKQRIIWAYKILFLDVI 1041
            KFWF+KNYLSPQF D +PHMA EY F+YEL+ YKWP WLH+Q EKQR IW YKILFLDV+
Sbjct: 1271 KFWFLKNYLSPQFTDFLPHMASEYNFQYELVQYKWPRWLHQQTEKQRTIWGYKILFLDVL 1330

Query: 1042 FPLSLEKVIFVDADQVVRADMGELYDMDIKGRPLAYTPFCDNNKDMDGYRFWRQGFWKDH 1101
            FPL++ K+IFVDAD +VR D+ ELYDMD+ G P AYTPFCD+ K+M+G+RFW+QG+W+ H
Sbjct: 1331 FPLNVRKIIFVDADAIVRTDIKELYDMDLGGAPYAYTPFCDSRKEMEGFRFWKQGYWRSH 1390

Query: 1102 LRGRPYHISALYVVDLKRFRETAAGDNLRVFYETLSKDPNSLANLDQDLPNYAQHTVPIF 1161
            L GR YHISALYVVDLKRFR+ AAGD LR  Y+ LS+DPNSL+NLDQDLPN   H V I 
Sbjct: 1391 LMGRRYHISALYVVDLKRFRKIAAGDRLRGQYQALSQDPNSLSNLDQDLPNNMIHQVAIK 1450

Query: 1162 SLPQEWLWCESWCGNATKSKAKTIDLCNNPMTKEPKLQGARRIVSEWPDLDSEARQFTAK 1221
            SLP +WLWC++WC ++    AK IDLCNNP TKE KL  A+RIV EW D D+E +   ++
Sbjct: 1451 SLPDDWLWCQTWCSDSNFKTAKVIDLCNNPQTKEAKLTAAQRIVPEWKDYDAELKTLMSR 1510

Query: 1222 ILGEE 1226
            I   E
Sbjct: 1511 IEDHE 1515


>sp|Q9NYU1|UGGG2_HUMAN UDP-glucose:glycoprotein glucosyltransferase 2 OS=Homo sapiens
            GN=UGGT2 PE=1 SV=4
          Length = 1516

 Score =  672 bits (1734), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 430/1259 (34%), Positives = 670/1259 (53%), Gaps = 133/1259 (10%)

Query: 1    MQEISQNFPSVVSSLSRMKLNDSIKDEIVANQR------YMPPGKSLMALNGALINIEDI 54
            M++ISQNFP    SL+R+ +N  +++EI  NQ+       + PG + + +NG  ++++  
Sbjct: 332  MKDISQNFPIKARSLTRIAVNQHMREEIKENQKDLQVRFKIQPGDARLFINGLRVDMDVY 391

Query: 55   DLYLLIDLVHQELSLADQFSKLKIPRTITQKLLSTVPPAESSMFRVDFRSTHVQYLNNLE 114
            D + ++D++  E  + +    L I      K L          + +D R + + ++N+LE
Sbjct: 392  DAFSILDMLKLEGKMMNGLRNLGINGEDMSKFLKLNSHIWEYTYVLDIRHSSIMWINDLE 451

Query: 115  EDAMYKRWRSNINEILMPVFPGQLRYIRKNLFHAVYVLDPATVCGLEVIDMIMSLYENHF 174
             D +Y  W ++  ++L PVFPG +  IR+N  + V  +DPA    L+ I +    Y +  
Sbjct: 452  NDDLYITWPTSCQKLLKPVFPGSVPSIRRNFHNLVLFIDPAQEYTLDFIKLADVFYSHEV 511

Query: 175  PLRFGVILYSSKFIKSIEINGGELHSPVAEDDSPVNEDISSLIIRLFLFIKESHGTQTAF 234
            PLR G +     FI +             +D+     D    + R F +I E      AF
Sbjct: 512  PLRIGFV-----FILN------------TDDEVDGANDAGVALWRAFNYIAEEFDISEAF 554

Query: 235  QFLSNVNRLRMESADSADDDALEIHHVEGAFVETILPKAKTPPQDMLLKLEKEKTFMDQS 294
                 ++ + M      D + L + +V+     T       P  ++   L     + ++ 
Sbjct: 555  -----ISIVHMYQKVKKDQNILTVDNVKSVLQNTF------PHANIWDILGIHSKYDEER 603

Query: 295  QESSMFVFKLGLTKLKCCLLMNGLVSESSEE-------ALLNAMNDELQRIQEQVYYGNI 347
            +  + F    GL  L   L  NG   +  E        A+L  M D    +Q +V+ G +
Sbjct: 604  KAGASFYKMTGLGPLPQAL-YNGEPFKHEEMNIKELKMAVLQRMMDASVYLQREVFLGTL 662

Query: 348  NSYTDVLEKVLSESGI-NRYNPQIITDAKVKPKFISLASSFLGRETE------------- 393
            N  T+ ++ ++  + +  R N  I+   +   ++++L S+ +  + E             
Sbjct: 663  NDRTNAIDFLMDRNNVVPRINTLIL---RTNQQYLNLISTSVTADVEDFSTFFFLDSQDK 719

Query: 394  ----LKDINYLHSPETVDD---VKPVTHLLAVDVTSKKGMKLLHEGIRFLIGGSNGARLG 446
                 K++ YL    T DD   +  VT  +  D     G KLL   ++ +   S  +RLG
Sbjct: 720  SAVIAKNMYYL----TQDDESIISAVTLWIIADFDKPSGRKLLFNALKHM-KTSVHSRLG 774

Query: 447  VLFSASREADLPSIIFVKAFEITASTYSHKKKVLE-FLDQLCSFYERTYLLASSATADST 505
            ++++ + + +  +    +   I A+  + K   L  FL QL     +  +  +  + D  
Sbjct: 775  IIYNPTSKINEENTAISRG--ILAAFLTQKNMFLRSFLGQLA----KEEIATAIYSGDKI 828

Query: 506  QAFIDKVCEFAEANGLSSKVYRASLPEYSKGKVRKQLNKVVQFLHRQLGVESGANAVITN 565
            + F+    E  + N    K     +  +   ++         F    L +  G   +++N
Sbjct: 829  KTFL---IEGMDKNAFEKKYNTVGVNIFRTHQL---------FCQDVLKLRPGEMGIVSN 876

Query: 566  GRVTFPIDESTFLSHDLSLLESVEFKHRIKHIWEIIEEVNWQETYPDIDPDMLTSKFVSD 625
            GR   P+DE  F + D  LLE + F +  + I  I+E +             + +  +SD
Sbjct: 877  GRFLGPLDED-FYAEDFYLLEKITFSNLGEKIKGIVENMG------------INANNMSD 923

Query: 626  IILFVTSSMAMRDRSSESARFEILSAEYSAVVFNSENSTIHID--AVIDPLSPTGQKLSS 683
             I+ V + M+   + +       L   +S +  N + + +  +  A++DPL+   QK++ 
Sbjct: 924  FIMKVDALMSSVPKRASRYDVTFLRENHSVIKTNPQENDMFFNVIAIVDPLTREAQKMAQ 983

Query: 684  LLRVLQRYAQPSMRIVLNPMSSLVDIPLKNYYRYVV-PTMDDFSNTDYSISGPKAFFANM 742
            LL VL +     +++ +N    L + PL+++YR+V+ P +   +N D S  GP A F ++
Sbjct: 984  LLVVLGKIINMKIKLFMNCRGRLSEAPLESFYRFVLEPELMSGAN-DVSSLGPVAKFLDI 1042

Query: 743  PLSKTLTMNLDVPEPWLVEPVIAVHDLDNILLEKLGDT-RTLQAVFELEALVLTGHCSEK 801
            P S  L +N+  PE WLVE V +  DLDNI    L DT +T+ A +ELE L+L G C +K
Sbjct: 1043 PESPLLILNMITPEGWLVETVHSNCDLDNI---HLKDTEKTVTAEYELEYLLLEGQCFDK 1099

Query: 802  -DHEPPQGLQLILGTKSTPHLVDTLVMANLGYWQMKVSPGVWYLQLAPGRSSELY-VLKE 859
               +PP+GLQ  LGTK+ P +VDT+VMA+ GY+Q+K +PG W L+L  G+S ++Y ++  
Sbjct: 1100 VTEQPPRGLQFTLGTKNKPAVVDTIVMAHHGYFQLKANPGAWILRLHQGKSEDIYQIVGH 1159

Query: 860  DGNVNEDRSLSKRITINDLRGKVVHMEVVKKKGKENEKLLVSSDEDSHSQAEGHWNSNFL 919
            +G  ++       + +N  + K++ ++V K+  K  E +L   DE    + +G W+S   
Sbjct: 1160 EGTDSQADLEDIIVVLNSFKSKILKVKVKKETDKIKEDILTDEDE----KTKGLWDS--- 1212

Query: 920  KWASGFIGGSEQSKKEKAAVDHGKVERHGKTINIFSIASGHLYERFLKIMILSVLKNTCR 979
               S  +   +++KKEK              +NIFS+ASGHLYERFL+IM+LSVL+NT  
Sbjct: 1213 -IKSFTVSLHKENKKEK------------DVLNIFSVASGHLYERFLRIMMLSVLRNTKT 1259

Query: 980  PVKFWFIKNYLSPQFKDVIPHMAQEYGFEYELITYKWPTWLHKQKEKQRIIWAYKILFLD 1039
            PVKFW +KNYLSP FK+VIPHMA+EYGF YEL+ Y+WP WL +Q E+QRIIW YKILFLD
Sbjct: 1260 PVKFWLLKNYLSPTFKEVIPHMAKEYGFRYELVQYRWPRWLRQQTERQRIIWGYKILFLD 1319

Query: 1040 VIFPLSLEKVIFVDADQVVRADMGELYDMDIKGRPLAYTPFCDNNKDMDGYRFWRQGFWK 1099
            V+FPL+++K+IFVDADQ+VR D+ EL D D+ G P  YTPFCD+ ++MDGYRFW+ G+W 
Sbjct: 1320 VLFPLAVDKIIFVDADQIVRHDLKELRDFDLDGAPYGYTPFCDSRREMDGYRFWKTGYWA 1379

Query: 1100 DHLRGRPYHISALYVVDLKRFRETAAGDNLRVFYETLSKDPNSLANLDQDLPNYAQHTVP 1159
             HL  R YHISALYVVDLK+FR   AGD LR  Y+ LS+DPNSL+NLDQDLPN   + V 
Sbjct: 1380 SHLLRRKYHISALYVVDLKKFRRIGAGDRLRSQYQALSQDPNSLSNLDQDLPNNMIYQVA 1439

Query: 1160 IFSLPQEWLWCESWCGNATKSKAKTIDLCNNPMTKEPKLQGARRIVSEWPDLDSEARQF 1218
            I SLPQ+WLWCE+WC + +K +AKTIDLCNNP TKE KL+ A RIV EW + D+E RQ 
Sbjct: 1440 IKSLPQDWLWCETWCDDESKQRAKTIDLCNNPKTKESKLKAAARIVPEWVEYDAEIRQL 1498


>sp|Q8T191|UGGG_DICDI Probable UDP-glucose:glycoprotein glucosyltransferase A
            OS=Dictyostelium discoideum GN=ggtA PE=1 SV=2
          Length = 1681

 Score =  595 bits (1533), Expect = e-168,   Method: Compositional matrix adjust.
 Identities = 289/561 (51%), Positives = 384/561 (68%), Gaps = 9/561 (1%)

Query: 670  VIDPLSPTGQKLSSLLRVLQRYAQPSMRIVLNPMSSLVDIPLKNYYRYVVPTMDDFSNTD 729
            +I+P +   QKL  ++R         + ++LNP  SL ++PLK YY YV+    +F+N +
Sbjct: 1081 IINPFNKVSQKLVPMVREFSNKLNIPVDVILNPPVSLSELPLKTYYTYVIKLSSEFNNEN 1140

Query: 730  YSISGPKAFFANMPLSKTLTMNLDVPEPWLVEPVIAVHDLDNILLEKLGDTRTLQAVFEL 789
               + P     ++P  + +T+ LD+P  WLV+P+IA +DLDNI L+ LGD + L AV+EL
Sbjct: 1141 VLYNQPLGIATDIPEDRVVTLALDIPSSWLVQPIIAKYDLDNIRLKDLGDEQVLTAVYEL 1200

Query: 790  EALVLTGHCSEKDHE-PPQGLQLILG--TKSTPHLVDTLVMANLGYWQMKVSPGVWYLQL 846
            E +V+ G  ++   +  P GL+L+L   +  T    DT+VM N GY+Q+K +PG+W L +
Sbjct: 1201 ENIVIEGSANDMTTDNAPAGLELLLNPISTQTNKTQDTIVMNNFGYYQLKSNPGIWKLTI 1260

Query: 847  APGRSSELYVLKEDGNVNEDRSL----SKRITINDLRGKVVHMEVVKKKGKENEKLLVSS 902
            APGRSS++  + +  N  E  +      + + I+ L   +  + VV+K G+E   +L   
Sbjct: 1261 APGRSSDIMDMVDHPNQKEKETFVIVPHRLVVIDSLYQSLSSLSVVRKAGQELRPILQPI 1320

Query: 903  DEDSHSQAEGHWNSNFLKWASGFIGGSEQSKKEKA-AVDHGKVERHGKTINIFSIASGHL 961
            DE    + +          +SGF      SK +   +V   + + +  TI+IFS+ASGHL
Sbjct: 1321 DEYEKQKEQEKEQKLKQN-SSGFFSNLFSSKNDATDSVATHQKKSNLDTIHIFSVASGHL 1379

Query: 962  YERFLKIMILSVLKNTCRPVKFWFIKNYLSPQFKDVIPHMAQEYGFEYELITYKWPTWLH 1021
            YERFLKIM+LSV+KNT  P+KFWF+KNYLSP FK+ IP MA+EYGF+YEL+TYKWP WL 
Sbjct: 1380 YERFLKIMMLSVVKNTESPIKFWFLKNYLSPAFKEFIPEMAKEYGFQYELVTYKWPWWLR 1439

Query: 1022 KQKEKQRIIWAYKILFLDVIFPLSLEKVIFVDADQVVRADMGELYDMDIKGRPLAYTPFC 1081
            KQ EKQRIIW+YKILFLDV+FPL + K+IFVDADQVVR D+ EL+DMD+ G  L YTPFC
Sbjct: 1440 KQTEKQRIIWSYKILFLDVLFPLDVPKIIFVDADQVVRTDLKELWDMDLHGASLGYTPFC 1499

Query: 1082 DNNKDMDGYRFWRQGFWKDHLRGRPYHISALYVVDLKRFRETAAGDNLRVFYETLSKDPN 1141
            D+NKD +G+RFW+ G+W+ HL GR YHISALYVVDL RFR  AAGD LR  Y+ LS+DPN
Sbjct: 1500 DSNKDTEGFRFWKSGYWRQHLAGRSYHISALYVVDLVRFRRLAAGDQLRATYDQLSRDPN 1559

Query: 1142 SLANLDQDLPNYAQHTVPIFSLPQEWLWCESWCGNATKSKAKTIDLCNNPMTKEPKLQGA 1201
            SLANLDQDLPNY QH V I SLPQEWLWCE+WC   +KSKAKTIDLCNNP+TK PKL+ A
Sbjct: 1560 SLANLDQDLPNYLQHYVRIHSLPQEWLWCETWCDQESKSKAKTIDLCNNPLTKTPKLENA 1619

Query: 1202 RRIVSEWPDLDSEARQFTAKI 1222
             RI+ EW  LD+EA++F  KI
Sbjct: 1620 VRIIDEWTTLDNEAKEFELKI 1640



 Score = 64.3 bits (155), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 65/257 (25%), Positives = 123/257 (47%), Gaps = 18/257 (7%)

Query: 1   MQEISQNFPSVVSSLSRMKLNDSIKDEIVANQRYMPPGK-SLMALNGALINIEDIDLYLL 59
           ++ ISQ FP++ +SLS++ LN+S+K  I +NQ+ +P      + LNG LI+  ++    L
Sbjct: 371 LEYISQKFPTLSNSLSKITLNESLKSVIESNQKIIPSTTDQTLLLNGRLIDTNELSPIEL 430

Query: 60  IDLVHQELSLADQFSKLK-IPRTITQKLLSTVPPAESSMF----RVDFRSTHVQY--LNN 112
             ++ +E   +    +   +     Q ++S   P    +      ++    +  +  LNN
Sbjct: 431 SRIILEEYEHSTTIQQQGPLSSKTVQDIISAQLPIRIQLLPTKEELELNGGNEPFVSLNN 490

Query: 113 LEEDAMYKRWRSNINEILM--PVFPGQLRYIRKNLFHAVYVLDPATVCGLEVIDMIMSLY 170
           LE D +Y++W   +   ++  PV   Q  +IRKNL   V VLD   +   E+I  I  + 
Sbjct: 491 LELDYIYRQWEPKLQSSVLDKPVTSPQDIFIRKNLLTTVIVLDWNNINTFEIIPEIQEMV 550

Query: 171 ENH--FPLRFGVILYSSKFIKSIEINGGELHSPVAEDDSPVNEDISSLIIRLFLFIKESH 228
           + +   P R   +L+++K   +   N           +    +D++    ++FL IK S+
Sbjct: 551 QGNSLIPTRIQ-LLFNTKSNNNNNNNNNNNDQNSQTSNFIQGKDLA----KVFLTIKNSN 605

Query: 229 -GTQTAFQFLSNVNRLR 244
            G + AF F++ +N  +
Sbjct: 606 LGNRGAFFFITALNYFK 622


>sp|Q09140|UGGG_SCHPO UDP-glucose:glycoprotein glucosyltransferase OS=Schizosaccharomyces
            pombe (strain 972 / ATCC 24843) GN=gpt1 PE=1 SV=2
          Length = 1448

 Score =  508 bits (1307), Expect = e-142,   Method: Compositional matrix adjust.
 Identities = 258/551 (46%), Positives = 352/551 (63%), Gaps = 32/551 (5%)

Query: 669  AVIDPLSPTGQKLSSLLRVLQRYAQPSMRIVLNPMSSLVDIPLKNYYRYVVPTMDDFSNT 728
            AV+DPLS   QK S++L  + +     +RI  NP  +L ++PL  +YRY +    +F   
Sbjct: 897  AVLDPLSKDSQKWSAILEAVSKLNGVGVRIHFNPKQTLSELPLTRFYRYSISAEPEFDAL 956

Query: 729  DYSISGPKAFFANMPLSKTLTMNLDVPEPWLVEPVIAVHDLDNILLEKLGDTRTLQ---A 785
             + +      F N+P    LTM+++  + W V       DL NI LE   +   L    A
Sbjct: 957  GH-LEESYVEFDNLPADTLLTMDIEARDAWTVMQKDVDIDLFNIKLEHTSEAEALDSHTA 1015

Query: 786  VFELEALVLTGHCSEK-DHEPPQGLQLILGTKSTPHLVDTLVMANLGYWQMKVSPGVWYL 844
            ++EL+ +++ G+  E+    PP+G+QL LG  +  H+ DT+V++NLGY+Q+K +PGVW L
Sbjct: 1016 IYELKNILVQGYSQEEFRKSPPRGMQLKLGNLTNSHVTDTIVLSNLGYFQLKANPGVWTL 1075

Query: 845  QLAPGRSSELYVLKEDGNVNEDRSLSK-RITINDLRGKVVHMEVVKKKGKENEKLLVSSD 903
            +   GRSS+ Y   E  ++N+  S    ++ ++   G  ++  + +K G E+  ++   D
Sbjct: 1076 EPMDGRSSQFY---EILSLNKKNSYKDPQVIVDSFEGVTLNPVMRRKPGFESADIM---D 1129

Query: 904  EDSHSQAEGHWNSNFLKWASGFIGGSEQSKKEKAAVDHGKVERHGKTINIFSIASGHLYE 963
            ED  S         F               K K ++     +R   +INIFS+ASGHLYE
Sbjct: 1130 EDLSSH-------KFFD-------------KIKKSLSFFNFKRKEASINIFSVASGHLYE 1169

Query: 964  RFLKIMILSVLKNTCRPVKFWFIKNYLSPQFKDVIPHMAQEYGFEYELITYKWPTWLHKQ 1023
            RFL IM  SV+++T + VKFWFI+N+LSP FK  IP +A++Y FEYE ITY WP WL KQ
Sbjct: 1170 RFLYIMTKSVIEHTDKKVKFWFIENFLSPSFKSSIPAIAKKYNFEYEYITYNWPHWLRKQ 1229

Query: 1024 KEKQRIIWAYKILFLDVIFPLSLEKVIFVDADQVVRADMGELYDMDIKGRPLAYTPFCDN 1083
            +EKQR IW YKILFLDV+FPL L KVI+VDADQ+VRAD+ EL DMD+ G P  YTP CD+
Sbjct: 1230 EEKQREIWGYKILFLDVLFPLELHKVIYVDADQIVRADLQELMDMDLHGAPYGYTPMCDS 1289

Query: 1084 NKDMDGYRFWRQGFWKDHLRGRPYHISALYVVDLKRFRETAAGDNLRVFYETLSKDPNSL 1143
             ++M+G+RFW++G+WK  LRG  YHISALYVVDL RFR+  AGD LR  Y+ LS DPNSL
Sbjct: 1290 REEMEGFRFWKKGYWKKFLRGLKYHISALYVVDLDRFRKMGAGDLLRRQYQLLSADPNSL 1349

Query: 1144 ANLDQDLPNYAQHTVPIFSLPQEWLWCESWCGNATKSKAKTIDLCNNPMTKEPKLQGARR 1203
            +NLDQDLPN+ QH +PI+SLPQ+WLWCE+WC + +   AKTIDLC NP+TKE KL  ARR
Sbjct: 1350 SNLDQDLPNHLQHLIPIYSLPQDWLWCETWCSDESLKTAKTIDLCQNPLTKEKKLDRARR 1409

Query: 1204 IVSEWPDLDSE 1214
             VSEW   D+E
Sbjct: 1410 QVSEWTSYDNE 1420



 Score = 80.9 bits (198), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 91/402 (22%), Positives = 185/402 (46%), Gaps = 59/402 (14%)

Query: 1   MQEISQNFPSVVSSLSRMK--LNDSIKDEIVANQRYMPPGKSLMALNGALINIEDIDLYL 58
            +E++Q+FP     LS      ND I++      +Y+P G + + LNG  +++E+ D + 
Sbjct: 297 FRELTQDFPIYAHYLSIQPDVSNDLIEELNQFQSQYVPEGINTIWLNGLSLDLEETDAFS 356

Query: 59  LIDLVHQELSLADQFSKLKIPRTITQKLLSTVPPA-ESSMFRV----------DFRSTHV 107
           ++ L+ +E  + D+F  L I  +    +++    A E S F+           D+++ H 
Sbjct: 357 ILSLIKKEKDMFDRFEALGIKSSKVLDIVTNEAFANEDSDFKFVKFHCQDDIEDWKAIH- 415

Query: 108 QYLNNLEEDAMYKRWRSNINEILMPVFPGQLRYIRKNLFHAVYVLDPATVCGLEVIDMIM 167
            ++N +E +  Y  W  +I  +L P++PGQL  + K L   +Y + P++   L ++  ++
Sbjct: 416 -WVNEIESNPKYDNWPKSIQILLKPIYPGQLHMLGKQLHTVIYPIFPSSPSSLPLLSELI 474

Query: 168 SLYENHFPLRFGVILYSSKFIKSIEINGGELHSPVAEDDSPVNEDISSLIIRLFLFIKES 227
                  P++ G++                     A DD    ++ +  + + F +I + 
Sbjct: 475 QFSRRPSPVQTGMVC-------------------AANDD----DEFAQTVCKSFFYISKE 511

Query: 228 HGTQTAFQFLSNVNRLRMESADSADDDALEIHHVEGAFVETILPKAKTPPQDMLLKLEKE 287
            GT +A +FL       + S  SAD  +L         +E  LP ++    D L  L+K+
Sbjct: 512 SGTDSALKFLYKC----LNSDSSADLYSL---------LEEHLPLSEH-DDDTLANLKKD 557

Query: 288 --KTFMDQ-SQESSMFVFKLGLTKLKCCLLMNGLV---SESSEEALLNAMNDELQRIQEQ 341
              +F D    +S+ +V +LG+      +++NG +    E+ + ++     +++  +Q  
Sbjct: 558 LSSSFFDHYMSKSNSWVNRLGIDSSASEVIVNGRIISHDENYDRSMYGIFLEDIPEVQIA 617

Query: 342 VYYGNINSYTDVLEKVLSESGINRYNPQIITDAKVKPKFISL 383
           V  G I+   ++L+ +L ++ + R NP +   AK   K I +
Sbjct: 618 VAEGKISEDDNLLDFILRDASLTR-NPLVYPSAKSSIKSIDI 658


>sp|P22023|KRE5_YEAST Killer toxin-resistance protein 5 OS=Saccharomyces cerevisiae (strain
            ATCC 204508 / S288c) GN=KRE5 PE=1 SV=2
          Length = 1365

 Score = 94.0 bits (232), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 68/219 (31%), Positives = 104/219 (47%), Gaps = 16/219 (7%)

Query: 951  INIFSI-ASGHLYERFLKIMILSVLKNT--CRPVKFWFIKN-YLSPQFKDVIPHM--AQE 1004
            INIF+I  SG   E     MILS+L      + V F+ +   ++S   +    ++  + E
Sbjct: 1095 INIFTILESGPDEEERYMQMILSILSKCPETQKVNFFILDQPFISDTLRKSCEYINSSDE 1154

Query: 1005 YGFEYELITYKWPTWLHKQKEKQRIIWAYKILFLDVIFPLSLEKVIFVDADQVVRADMGE 1064
                   + Y+WP WL  Q+   R     + LFLDV+ P ++ KV+++   +V       
Sbjct: 1155 MRGNVIFLNYEWPQWLRPQRFSSRRRDVSRFLFLDVLLPQNISKVLYMSPTEVPLDPFDI 1214

Query: 1065 LYDMDIKGRPLAYTPFCDNNKDMDGYRFWRQGFWKDHLRGRP---YHISALYVVDLKRFR 1121
                 +K  PL           M G  +W++G+W+  LR      Y     ++V+L+RFR
Sbjct: 1215 FQFQGLKRAPLGLF-------RMSGDGYWKEGYWEKMLRENNLEFYSTEPAFLVNLERFR 1267

Query: 1122 ETAAGDNLRVFYETLSKDPNSLANLDQDLPNYAQHTVPI 1160
            E  AGD  R+ Y+ +S D  SL N+ QDL N  Q  VPI
Sbjct: 1268 ELDAGDKYRIHYQRISTDAMSLVNIGQDLVNNLQLEVPI 1306


>sp|P43974|Y258_HAEIN Putative glycosyltransferase HI_0258 OS=Haemophilus influenzae
            (strain ATCC 51907 / DSM 11121 / KW20 / Rd) GN=HI_0258
            PE=3 SV=2
          Length = 330

 Score = 46.2 bits (108), Expect = 0.002,   Method: Composition-based stats.
 Identities = 31/128 (24%), Positives = 63/128 (49%), Gaps = 1/128 (0%)

Query: 949  KTINIFSIASGHLYERFLKIMILSVLKNTCRPVKFWFIKNYLSPQFKDVIPHMAQEYGFE 1008
            +T+NI   +S H Y  +L + I S++KNT + + F+ +   ++ + K +I ++A  Y  +
Sbjct: 37   QTMNII-FSSDHYYAPYLAVSIFSIIKNTPKKINFYILDMKINQENKTIINNLASAYSCK 95

Query: 1009 YELITYKWPTWLHKQKEKQRIIWAYKILFLDVIFPLSLEKVIFVDADQVVRADMGELYDM 1068
               +      + +  K    I  A         +  ++EK I++D D +  + + EL+++
Sbjct: 96   VFFLPVCESDFQNFPKTIDYISLATYARLNLTKYIKNIEKAIYIDVDTLTNSSLQELWNI 155

Query: 1069 DIKGRPLA 1076
            DI    LA
Sbjct: 156  DITNYYLA 163


>sp|Q9M8J2|GATL4_ARATH Probable galacturonosyltransferase-like 4 OS=Arabidopsis thaliana
            GN=GATL4 PE=2 SV=1
          Length = 351

 Score = 44.3 bits (103), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 46/176 (26%), Positives = 90/176 (51%), Gaps = 22/176 (12%)

Query: 962  YERFLKIMILSVLKNTCRP--VKFWFI-----KNYLSPQFKDVIPHMA-QEYGFEYELIT 1013
            Y R     +LS+L+++  P  + F F+     +N L    K   P++  + Y F+  L+ 
Sbjct: 76   YLRGTMAAVLSLLQHSTCPENLSFHFLSLPHFENDLFTSIKSTFPYLNFKIYQFDPNLVR 135

Query: 1014 YKWPTWLHKQKEKQRIIWAYKILFLDVIFPLSLEKVIFVDADQVVRADMGELYDMDIKGR 1073
             K    + +Q   Q +   Y  ++L  I P S++++I++D+D VV  D+ +L+ ++++G+
Sbjct: 136  SKISKSI-RQALDQPL--NYARIYLADIIPSSVDRIIYLDSDLVVVDDIEKLWHVEMEGK 192

Query: 1074 PLAYTPFCDNNKDMDGYRFWRQGFWKDH-----LRG-RP-YHISALYVVDLKRFRE 1122
             +A   +C  N       ++ + FW D      L G RP Y  + + VVD+ ++R+
Sbjct: 193  VVAAPEYCHANFT----HYFTRTFWSDPVLVKVLEGKRPCYFNTGVMVVDVNKWRK 244


>sp|P25148|GSPA_BACSU General stress protein A OS=Bacillus subtilis (strain 168) GN=gspA
            PE=1 SV=2
          Length = 286

 Score = 43.1 bits (100), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 38/135 (28%), Positives = 65/135 (48%), Gaps = 6/135 (4%)

Query: 946  RHGKTINIFSIASGHLYERFLKIMILSVLKNTC--RPVKFWFIKNYLSPQFKDVIPHMAQ 1003
            R  + ++I S A  + Y R L  M +S+L N    R VK + I   + P  K  +     
Sbjct: 2    RKDEIMHIVSCADDN-YARHLGGMFVSLLTNMDQEREVKLYVIDGGIKPDNKKRLEETTL 60

Query: 1004 EYGFEYELITYKWPTWLHKQKEKQRIIWA--YKILFLDVIFPLSLEKVIFVDADQVVRAD 1061
            ++G   E +      + H   E   I  A  Y+I   D+I   S++++I++D D +V  D
Sbjct: 61   KFGVPIEFLEVDTNMYEHAV-ESSHITKAAYYRISIPDLIKDESIKRMIYIDCDALVLED 119

Query: 1062 MGELYDMDIKGRPLA 1076
            + +L+D+DI    +A
Sbjct: 120  ISKLWDLDIAPYTVA 134


>sp|O04253|GATL6_ARATH Probable galacturonosyltransferase-like 6 OS=Arabidopsis thaliana
            GN=GATL6 PE=2 SV=1
          Length = 346

 Score = 42.4 bits (98), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 36/147 (24%), Positives = 69/147 (46%), Gaps = 14/147 (9%)

Query: 1022 KQKEKQRIIWAYKILFLDVIFPLSLEKVIFVDADQVVRADMGELYDMDIKGRPLAYTPFC 1081
            +Q  +Q + +A   L  D++ P  + +VI++D+D VV  D+ +L+   +  R +    +C
Sbjct: 143  RQALEQPLNYARSYL-ADLLEP-CVNRVIYLDSDLVVVDDIAKLWKTSLGSRIIGAPEYC 200

Query: 1082 DNNKDMDGYRFWRQGFWKDH-----LRGRP--YHISALYVVDLKRFRETAAGDNLRVFYE 1134
              N      +++  GFW +       RGR   Y  + + V+DLK++R       +  + E
Sbjct: 201  HANFT----KYFTGGFWSEERFSGTFRGRKPCYFNTGVMVIDLKKWRRGGYTKRIEKWME 256

Query: 1135 TLSKDP-NSLANLDQDLPNYAQHTVPI 1160
               ++    L +L   L  ++ H  PI
Sbjct: 257  IQRRERIYELGSLPPFLLVFSGHVAPI 283


>sp|Q9Z1M7|LARGE_MOUSE Glycosyltransferase-like protein LARGE1 OS=Mus musculus GN=Large PE=1
            SV=1
          Length = 756

 Score = 40.4 bits (93), Expect = 0.092,   Method: Compositional matrix adjust.
 Identities = 64/275 (23%), Positives = 119/275 (43%), Gaps = 32/275 (11%)

Query: 906  SHSQAEGHWNSNFLKWASGFIGGSEQSKKEKAAVDHGKVERHGKTINIFSIASGHLYERF 965
            ++S  EG  +S  L+  +G + G+     ++ AV+  K E    TI++  + +G+   R 
Sbjct: 101  TYSMEEGTGDSENLR--AGIVAGNSSECGQQPAVE--KCE----TIHVAIVCAGYNASRD 152

Query: 966  LKIMILSVLKNTCRPVKFWFIKNYLSPQ-----FKD-VIPHMAQEYGFEYELITYKWPTW 1019
            +  ++ SVL +   P+ F  I + ++ Q     F+  ++P +  ++    EL +    +W
Sbjct: 153  VVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQTWMVPAVRVDFYNADELKSE--VSW 210

Query: 1020 LHKQKEKQRIIWAYKILFLDVIFPLSLEKVIFVDADQVVRADMGELYDMDIKGRPLAYTP 1079
            +  +      I+    L L    P +LE+VI +D D     D+ EL+ +  K +      
Sbjct: 211  IPNKHYSG--IYGLMKLVLTKTLPANLERVIVLDTDITFATDIAELWAVFHKFKGQQVLG 268

Query: 1080 FCDNNKDMDGYRFWRQG-FWKDH----LRGRPYHISALYVVDLKRFRETAAGDNLRVFYE 1134
              +N  D      W  G  WK+H      GR Y+   + ++ L + R+       R+  E
Sbjct: 269  LVENQSD------WYLGNLWKNHRPWPALGRGYNTGVILLL-LDKLRKMKWEQMWRLTAE 321

Query: 1135 TLSKDPNSLANLDQDLPNYA--QHTVPIFSLPQEW 1167
                   S +  DQD+ N    Q+   ++ LP  W
Sbjct: 322  RELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFW 356


>sp|Q0V7R1|GATL3_ARATH Probable galacturonosyltransferase-like 3 OS=Arabidopsis thaliana
            GN=GATL3 PE=2 SV=1
          Length = 345

 Score = 40.0 bits (92), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 52/212 (24%), Positives = 97/212 (45%), Gaps = 18/212 (8%)

Query: 922  ASGFIGGSEQSKKEKAAVDHGKVERHGKTINIFSIASGHLYERFLKIMILSVLKNTCRP- 980
            A  F  G E SK      DH     H  +I   ++    +Y R     + SVL++   P 
Sbjct: 34   APAFRNGRECSKTTWIPSDH----EHNPSIIHIAMTLDAIYLRGSVAGVFSVLQHASCPE 89

Query: 981  -VKFWFIKNYL-SPQFKDVI----PHMAQE-YGFEYELITYKWPTWLHKQKEKQRIIWAY 1033
             + F FI  +  S   + +I    P++    Y F+  L+  K  + + +  ++      Y
Sbjct: 90   NIVFHFIATHRRSADLRRIISSTFPYLTYHIYHFDPNLVRSKISSSIRRALDQPL---NY 146

Query: 1034 KILFLDVIFPLSLEKVIFVDADQVVRADMGELYDMDIKGRPLAYTPFCDNN-KDMDGYRF 1092
              ++L  + P+++ +VI+ D+D VV  D+ +L+ +D++   +    +C  N  +    RF
Sbjct: 147  ARIYLADLLPIAVRRVIYFDSDLVVVDDVAKLWRIDLRRHVVGAPEYCHANFTNYFTSRF 206

Query: 1093 WRQGFWKDHLRGRP--YHISALYVVDLKRFRE 1122
            W    +K  L+ R   Y  + + V+DL ++RE
Sbjct: 207  WSSQGYKSALKDRKPCYFNTGVMVIDLGKWRE 238


>sp|Q66PG3|LARGE_CHICK Glycosyltransferase-like protein LARGE1 OS=Gallus gallus GN=LARGE
            PE=2 SV=1
          Length = 756

 Score = 40.0 bits (92), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 67/277 (24%), Positives = 122/277 (44%), Gaps = 36/277 (12%)

Query: 906  SHSQAEGHWNSNFLKWASGFIGGSEQSKKEKAAVDHGKVERHGKTINIFSIASGHLYERF 965
            ++S  EG  +S  L+  +G + G+     ++ AV+  K E    TI++  + +G+   R 
Sbjct: 101  TYSMEEGTGDSESLR--AGIVAGNSSECGQQPAVE--KCE----TIHVAIVCAGYNASRD 152

Query: 966  LKIMILSVLKNTCRPVKFWFIKNYLSPQ-----FKD-VIPHMAQEYGFEYELITYKWPTW 1019
            +  ++ SVL +   P+ F  I + ++ Q     F+  ++P +  ++    EL +    +W
Sbjct: 153  VVTLVKSVLFHRRNPLHFHLIADAIAKQILATLFQTWMVPAVRIDFYDADELKSEV--SW 210

Query: 1020 LHKQKEKQRIIWAYKILFLDVIFPLSLEKVIFVDADQVVRADMGELYDMDIKGRPLAYTP 1079
            +  +      I+    L L    P +LE+VI +D D     D+ EL+ +  K +      
Sbjct: 211  IPNKHYSG--IYGLMKLVLTKTLPSNLERVIVLDTDITFATDIAELWAVFHKFKGQQVLG 268

Query: 1080 FCDNNKDMDGYRFWRQG-FWKDH----LRGRPYHISALYVVDLKRFRETAAGDNLRVFYE 1134
              +N  D      W  G  WK+H      GR Y+   + ++ L + R+       R+  E
Sbjct: 269  LVENQSD------WYLGNLWKNHRPWPALGRGYNTGVILLL-LDKLRKMKWEQMWRLTAE 321

Query: 1135 T--LSKDPNSLANLDQDLPNYA--QHTVPIFSLPQEW 1167
               +S    SLA  DQD+ N    Q+   ++ LP  W
Sbjct: 322  RELMSMLSTSLA--DQDIFNAVIKQNPFLVYQLPCFW 356


>sp|Q9FWY9|GATL5_ARATH Probable galacturonosyltransferase-like 5 OS=Arabidopsis thaliana
            GN=GATL5 PE=2 SV=1
          Length = 361

 Score = 39.7 bits (91), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 37/147 (25%), Positives = 70/147 (47%), Gaps = 14/147 (9%)

Query: 1022 KQKEKQRIIWAYKILFLDVIFPLSLEKVIFVDADQVVRADMGELYDMDIKGRPLAYTPFC 1081
            +Q  +Q + +A   L  D++ P  +++VI++D+D VV  D+ +L+   +  R +    +C
Sbjct: 158  RQALEQPLNYARNYL-ADLLEP-CVKRVIYLDSDLVVVDDIVKLWKTGLGQRTIGAPEYC 215

Query: 1082 DNNKDMDGYRFWRQGFWKDH-----LRGR--PYHISALYVVDLKRFRETAAGDNLRVFYE 1134
              N      +++  GFW D       +GR   Y  + + V+DLK++R+      +  + E
Sbjct: 216  HANFT----KYFTGGFWSDKRFNGTFKGRNPCYFNTGVMVIDLKKWRQFRFTKRIEKWME 271

Query: 1135 TLSKDP-NSLANLDQDLPNYAQHTVPI 1160
                +    L +L   L  +A H  PI
Sbjct: 272  IQKIERIYELGSLPPFLLVFAGHVAPI 298


>sp|Q66PG1|LARG2_DANRE Glycosyltransferase-like protein LARGE2 OS=Danio rerio GN=gyltl1b
            PE=2 SV=1
          Length = 750

 Score = 39.3 bits (90), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 78/343 (22%), Positives = 140/343 (40%), Gaps = 46/343 (13%)

Query: 854  LYVLKEDGNVNEDRSLS-KRITINDLRGKVVHMEVVKKKGKENE------KLLVSSDEDS 906
            LY+L   GN    RSL      +     +++  +V+  + +E E      +L +S  +  
Sbjct: 25   LYLLV--GNSENGRSLLLSACLVESTEARLLERDVLASRVREVEEENRQIRLQLSQSQGL 82

Query: 907  HSQ-AEGHWNSNFLKW-ASGFIGGSEQSKKEKAAVDHGKVER-----HGKTINIFSIASG 959
              Q AEG++ +   +W AS   G  +     +   +H +  R       + +++  + +G
Sbjct: 83   AGQPAEGNYGNQ--QWVASADTGPEDVENTAEERANHSECSRSPTAEKCELLHVACVCAG 140

Query: 960  HLYERFLKIMILSVLKNTCRPVKFWFIKNYLSPQFKD------VIPHMAQEYGFEYELIT 1013
            H   R +  ++ S+L +   P+ F FI + ++ Q         ++P +   +    EL +
Sbjct: 141  HNASRDVVTLVKSILFHRRNPLHFHFITDTVANQILSTLFQSWMVPSVQVSFYDADELKS 200

Query: 1014 YKWPTWLHKQKEKQRIIWAYKILFLDVIFPLSLEKVIFVDADQVVRADMGELYDMDIKGR 1073
                +W+  +      I+    L L    P +L KVI +D D     D+ EL+ +  K  
Sbjct: 201  E--VSWIPNKHYSG--IYGLMKLTLTKALPSNLSKVIVLDTDITFATDIAELWAIFRKFT 256

Query: 1074 PLAYTPFCDNNKDMDGYRFWRQG-FWKDH----LRGRPYHISALYVVDLKRFRETAAGDN 1128
                    +N  D      W  G  WK+H      GR ++   + ++ L+R R       
Sbjct: 257  EKQVIGLVENQSD------WYLGNLWKNHKPWPALGRGFNTGVI-LLYLERLRRMGWEQM 309

Query: 1129 LRVFYET--LSKDPNSLANLDQDLPN--YAQHTVPIFSLPQEW 1167
             R+  E   +S    SLA  DQD+ N    Q+ V +  LP  W
Sbjct: 310  WRLTAERELMSMLSTSLA--DQDIFNAFIKQNPVLVHQLPCFW 350


>sp|Q0WV13|GAUTD_ARATH Probable galacturonosyltransferase 13 OS=Arabidopsis thaliana
            GN=GAUT13 PE=2 SV=1
          Length = 533

 Score = 38.9 bits (89), Expect = 0.29,   Method: Compositional matrix adjust.
 Identities = 35/133 (26%), Positives = 63/133 (47%), Gaps = 21/133 (15%)

Query: 1036 LFLDVIFPLSLEKVIFVDADQVVRADMGELYDMDIKGRPLAYTPFCDN------NKDMDG 1089
            ++L  +FP +L+KV+F+D D V++ D+  L+D+D+ G+       C        +K +  
Sbjct: 335  IYLPELFP-NLDKVVFLDDDIVIQKDLSPLWDIDLNGKVNGAVETCRGEDVWVMSKRLRN 393

Query: 1090 YRFWRQGFWKDHLRGRPYHISALY---VVDLKRFRETAAGDNLRVFYETLSKDPNSLANL 1146
            Y  +       HL   P   +  Y   + DL+ +R+T    N+R  Y +  K+     NL
Sbjct: 394  YFNFSHPLIAKHL--DPEECAWAYGMNIFDLRTWRKT----NIRETYHSWLKE-----NL 442

Query: 1147 DQDLPNYAQHTVP 1159
              +L  +   T+P
Sbjct: 443  KSNLTMWKLGTLP 455


>sp|O04536|GATL9_ARATH Probable galacturonosyltransferase-like 9 OS=Arabidopsis thaliana
            GN=GATL9 PE=2 SV=1
          Length = 390

 Score = 38.5 bits (88), Expect = 0.32,   Method: Compositional matrix adjust.
 Identities = 28/110 (25%), Positives = 53/110 (48%), Gaps = 12/110 (10%)

Query: 1037 FLDVIFPLSLEKVIFVDADQVVRADMGELYDMDIKGRPLAYTP-FCDNNKDMDGYRFWRQ 1095
            +L  I    +++VI++D+D +V  D+ +L++  + G  +   P +C  N      +++  
Sbjct: 175  YLGDILDPCVDRVIYLDSDIIVVDDITKLWNTSLTGSRIIGAPEYCHANFT----KYFTS 230

Query: 1096 GFWKD-----HLRGRP--YHISALYVVDLKRFRETAAGDNLRVFYETLSK 1138
            GFW D        GR   Y  + + V+DL R+RE    + L  + +   K
Sbjct: 231  GFWSDPALPGFFSGRKPCYFNTGVMVMDLVRWREGNYREKLETWMQIQKK 280


>sp|O95461|LARGE_HUMAN Glycosyltransferase-like protein LARGE1 OS=Homo sapiens GN=LARGE PE=1
            SV=1
          Length = 756

 Score = 38.5 bits (88), Expect = 0.33,   Method: Compositional matrix adjust.
 Identities = 63/275 (22%), Positives = 118/275 (42%), Gaps = 32/275 (11%)

Query: 906  SHSQAEGHWNSNFLKWASGFIGGSEQSKKEKAAVDHGKVERHGKTINIFSIASGHLYERF 965
            ++S  EG  +S  L+  +G + G+     ++  V+  K E    TI++  + +G+   R 
Sbjct: 101  TYSMEEGTGDSENLR--AGIVAGNSSECGQQPVVE--KCE----TIHVAIVCAGYNASRD 152

Query: 966  LKIMILSVLKNTCRPVKFWFIKNYLSPQ-----FKD-VIPHMAQEYGFEYELITYKWPTW 1019
            +  ++ SVL +   P+ F  I + ++ Q     F+  ++P +  ++    EL +    +W
Sbjct: 153  VVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQTWMVPAVRVDFYNADELKSE--VSW 210

Query: 1020 LHKQKEKQRIIWAYKILFLDVIFPLSLEKVIFVDADQVVRADMGELYDMDIKGRPLAYTP 1079
            +  +      I+    L L    P +LE+VI +D D     D+ EL+ +  K +      
Sbjct: 211  IPNKHYSG--IYGLMKLVLTKTLPANLERVIVLDTDITFATDIAELWAVFHKFKGQQVLG 268

Query: 1080 FCDNNKDMDGYRFWRQG-FWKDH----LRGRPYHISALYVVDLKRFRETAAGDNLRVFYE 1134
              +N  D      W  G  WK+H      GR Y+   + ++ L + R+       R+  E
Sbjct: 269  LVENQSD------WYLGNLWKNHRPWPALGRGYNTGVILLL-LDKLRKMKWEQMWRLTAE 321

Query: 1135 TLSKDPNSLANLDQDLPNYA--QHTVPIFSLPQEW 1167
                   S +  DQD+ N    Q+   ++ LP  W
Sbjct: 322  RELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFW 356


>sp|Q9LHD2|GATLA_ARATH Probable galacturonosyltransferase-like 10 OS=Arabidopsis thaliana
            GN=GATL10 PE=2 SV=1
          Length = 365

 Score = 38.1 bits (87), Expect = 0.49,   Method: Compositional matrix adjust.
 Identities = 27/106 (25%), Positives = 50/106 (47%), Gaps = 15/106 (14%)

Query: 1033 YKILFLDVIFPLSLEKVIFVDADQVVRADMGELYDMDIKG-RPLAYTPFCDNNKDMDGYR 1091
            Y   +L  I    + +VI++D+D +V  D+ +L+ + + G R +    +C  N      +
Sbjct: 157  YARSYLSEILSSCVSRVIYLDSDVIVVDDIQKLWKISLSGSRTIGAPEYCHANFT----K 212

Query: 1092 FWRQGFWKDH-------LRGRPYHISALYVVDLKRFRETAAGDNLR 1130
            ++   FW D         +   Y  + + V+DL+R+RE   GD  R
Sbjct: 213  YFTDSFWSDQKLSSVFDSKTPCYFNTGVMVIDLERWRE---GDYTR 255


>sp|O48684|GATL8_ARATH Probable galacturonosyltransferase-like 8 OS=Arabidopsis thaliana
            GN=GATL8 PE=2 SV=1
          Length = 393

 Score = 37.0 bits (84), Expect = 0.94,   Method: Compositional matrix adjust.
 Identities = 28/94 (29%), Positives = 50/94 (53%), Gaps = 12/94 (12%)

Query: 1037 FLDVIFPLSLEKVIFVDADQVVRADMGELYDMDIKGRPLAYTP-FCDNNKDMDGYRFWRQ 1095
            +L  I   S+E+VI++D+D +   D+ +L++  + G  +   P +C  N      +++  
Sbjct: 178  YLGDILDRSVERVIYLDSDVITVDDITKLWNTVLTGSRVIGAPEYCHANFT----QYFTS 233

Query: 1096 GFWKD-HLRG-----RP-YHISALYVVDLKRFRE 1122
            GFW D  L G     +P Y  + + V+DL R+RE
Sbjct: 234  GFWSDPALPGLISGQKPCYFNTGVMVMDLVRWRE 267


>sp|Q8GWT1|GAUTE_ARATH Probable galacturonosyltransferase 14 OS=Arabidopsis thaliana
            GN=GAUT14 PE=2 SV=1
          Length = 532

 Score = 36.6 bits (83), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 34/133 (25%), Positives = 63/133 (47%), Gaps = 21/133 (15%)

Query: 1036 LFLDVIFPLSLEKVIFVDADQVVRADMGELYDMDIKGRPLAYTPFCDN------NKDMDG 1089
            +++  +FP +L+KV+F+D D VV+ D+  L+D+D+ G+       C        +K +  
Sbjct: 334  IYIPELFP-NLDKVVFLDDDIVVQGDLTPLWDVDLGGKVNGAVETCRGEDEWVMSKRLRN 392

Query: 1090 YRFWRQGFWKDHLRGRPYHISALY---VVDLKRFRETAAGDNLRVFYETLSKDPNSLANL 1146
            Y  +       HL   P   +  Y   + DL+ +R+T    N+R  Y +  ++     NL
Sbjct: 393  YFNFSHPLIAKHL--DPEECAWAYGMNIFDLQAWRKT----NIRETYHSWLRE-----NL 441

Query: 1147 DQDLPNYAQHTVP 1159
              +L  +   T+P
Sbjct: 442  KSNLTMWKLGTLP 454


>sp|Q9FH36|GAUTC_ARATH Probable galacturonosyltransferase 12 OS=Arabidopsis thaliana
            GN=GAUT12 PE=2 SV=1
          Length = 535

 Score = 36.6 bits (83), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 16/38 (42%), Positives = 27/38 (71%), Gaps = 1/38 (2%)

Query: 1036 LFLDVIFPLSLEKVIFVDADQVVRADMGELYDMDIKGR 1073
            + L  +FP SL KV+F+D D V++ D+  L+D+D+ G+
Sbjct: 336  IHLPELFP-SLNKVVFLDDDIVIQTDLSPLWDIDMNGK 372


>sp|Q9QUS6|TRIM1_MOUSE Probable E3 ubiquitin-protein ligase MID2 OS=Mus musculus GN=Mid2
           PE=2 SV=2
          Length = 705

 Score = 36.2 bits (82), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 22/53 (41%), Positives = 29/53 (54%)

Query: 434 RFLIGGSNGARLGVLFSASREADLPSIIFVKAFEITASTYSHKKKVLEFLDQL 486
           RFL    N A    + +AS +  +P I F  AFE  A  +S +KK+LE LD L
Sbjct: 347 RFLQSAKNIAERVAMATASSQVLIPDINFNDAFENFALDFSREKKLLEGLDYL 399


>sp|Q9UJV3|TRIM1_HUMAN Probable E3 ubiquitin-protein ligase MID2 OS=Homo sapiens GN=MID2
           PE=1 SV=3
          Length = 735

 Score = 36.2 bits (82), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 22/53 (41%), Positives = 29/53 (54%)

Query: 434 RFLIGGSNGARLGVLFSASREADLPSIIFVKAFEITASTYSHKKKVLEFLDQL 486
           RFL    N A    + +AS +  +P I F  AFE  A  +S +KK+LE LD L
Sbjct: 347 RFLQSAKNIAERVAMATASSQVLIPDINFNDAFENFALDFSREKKLLEGLDYL 399


>sp|Q2HJ96|GL8D2_BOVIN Glycosyltransferase 8 domain-containing protein 2 OS=Bos taurus
            GN=GLT8D2 PE=2 SV=1
          Length = 350

 Score = 35.8 bits (81), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 21/60 (35%), Positives = 32/60 (53%), Gaps = 4/60 (6%)

Query: 1037 FLDVIFPLSL---EKVIFVDADQVVRADMGELYDMDIK-GRPLAYTPFCDNNKDMDGYRF 1092
            F+    PL +   EKVI++D D +V+ D+ ELYD  +  G   A++  CD     D +R 
Sbjct: 137  FVRFYLPLLIHQHEKVIYLDDDVIVQGDIQELYDTTLALGHAAAFSDDCDLPSSQDIHRL 196


>sp|Q0WQD2|GAUT3_ARATH Probable galacturonosyltransferase 3 OS=Arabidopsis thaliana GN=GAUT3
            PE=2 SV=2
          Length = 680

 Score = 35.4 bits (80), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 14/38 (36%), Positives = 28/38 (73%), Gaps = 1/38 (2%)

Query: 1036 LFLDVIFPLSLEKVIFVDADQVVRADMGELYDMDIKGR 1073
             +L  ++P  LEK++F+D D VV+ D+  L+++D++G+
Sbjct: 489  FYLPEVYP-KLEKILFLDDDIVVQKDLAPLWEIDMQGK 525


>sp|Q93ZX7|GAUT4_ARATH Probable galacturonosyltransferase 4 OS=Arabidopsis thaliana GN=GAUT4
            PE=2 SV=1
          Length = 616

 Score = 35.0 bits (79), Expect = 4.2,   Method: Compositional matrix adjust.
 Identities = 17/37 (45%), Positives = 25/37 (67%), Gaps = 1/37 (2%)

Query: 1036 LFLDVIFPLSLEKVIFVDADQVVRADMGELYDMDIKG 1072
             +L  IFP  L KV+F+D D VV+ D+  L+ +D+KG
Sbjct: 425  FYLPEIFP-KLSKVLFLDDDIVVQKDLSGLWSVDLKG 460


>sp|Q640P4|GL8D2_MOUSE Glycosyltransferase 8 domain-containing protein 2 OS=Mus musculus
            GN=Glt8d2 PE=2 SV=1
          Length = 349

 Score = 34.7 bits (78), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 21/60 (35%), Positives = 32/60 (53%), Gaps = 4/60 (6%)

Query: 1037 FLDVIFPLSL---EKVIFVDADQVVRADMGELYDMDIK-GRPLAYTPFCDNNKDMDGYRF 1092
            F+    PL +   EKVI++D D +V+ D+ ELYD  +  G   A++  CD     D +R 
Sbjct: 137  FVRFYLPLLVHQHEKVIYLDDDVIVQGDIQELYDTTLALGHAAAFSDDCDLPSAQDIHRL 196


>sp|Q66PG4|LARG2_CHICK Glycosyltransferase-like protein LARGE2 OS=Gallus gallus GN=GYLTL1B
            PE=2 SV=1
          Length = 739

 Score = 34.3 bits (77), Expect = 5.9,   Method: Compositional matrix adjust.
 Identities = 48/230 (20%), Positives = 94/230 (40%), Gaps = 38/230 (16%)

Query: 888  VKKKGKENEKLLVSSDEDSHSQAEGHWNSNFLKWASGF-----IGGSEQSKKEKAAVDHG 942
            V++  +EN++L +   +    QAEG  +    +WA+       +GG    +  + A    
Sbjct: 61   VRRVEEENQQLRMQLGQ---VQAEG--SDGNPQWAASAEDGPPLGGE---RNNRTACPEQ 112

Query: 943  KVERHGKTINIFSIASGHLYERFLKIMILSVLKNTCRPVKFWFIKNYLSPQFKDVIPHMA 1002
            ++ R  + +++  + +GH   R +  ++ S+L +   P+ F FI         D + H  
Sbjct: 113  RMVRKCELLHVAIVCAGHNASRDVVTLVKSILFHRKNPLHFHFI--------TDSVAHQI 164

Query: 1003 QEYGFEYELITYKWPTWLHKQKEKQRIIW--------AYKILFLDVI--FPLSLEKVIFV 1052
             +  F+  ++     ++ +    K  + W         Y ++ L +    P +L KVI +
Sbjct: 165  LQTLFQSWMVPSIHVSFYNADDLKPEVSWIPNKHYSGIYGLMKLTLTKALPSNLSKVIVL 224

Query: 1053 DADQVVRADMGELYDMDIKGRPLAYTPFCDNNKDMDGYRFWRQG-FWKDH 1101
            D D     D+ EL+ +  K          +N  D      W  G  WK+H
Sbjct: 225  DTDITFATDIAELWAVFGKFSEKQVIGLVENQSD------WYLGNLWKNH 268


>sp|Q8VYF4|GATL7_ARATH Probable galacturonosyltransferase-like 7 OS=Arabidopsis thaliana
            GN=GATL7 PE=2 SV=1
          Length = 361

 Score = 34.3 bits (77), Expect = 6.3,   Method: Compositional matrix adjust.
 Identities = 28/124 (22%), Positives = 54/124 (43%), Gaps = 12/124 (9%)

Query: 1045 SLEKVIFVDADQVVRADMGELYDMDIKGRPLAYTPFCDNNKDMDGYRFWRQGFWKDH--- 1101
             + +VI++D+D +V  D+ +L+   +  + +    +C  N      +++   FW D    
Sbjct: 178  CVRRVIYLDSDLIVVDDIAKLWMTKLGSKTIGAPEYCHANFT----KYFTPAFWSDERFS 233

Query: 1102 --LRGRP--YHISALYVVDLKRFRETAAGDNLRVFYETLSKDP-NSLANLDQDLPNYAQH 1156
                GR   Y  + + V+DL+R+R     + +  + E    D    L +L   L  +A  
Sbjct: 234  GAFSGRKPCYFNTGVMVMDLERWRRVGYTEVIEKWMEIQKSDRIYELGSLPPFLLVFAGE 293

Query: 1157 TVPI 1160
              PI
Sbjct: 294  VAPI 297


  Database: swissprot
    Posted date:  Mar 23, 2013  2:32 AM
  Number of letters in database: 191,569,459
  Number of sequences in database:  539,616
  
Lambda     K      H
   0.318    0.134    0.392 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 456,438,790
Number of Sequences: 539616
Number of extensions: 19487509
Number of successful extensions: 49148
Number of sequences better than 100.0: 45
Number of HSP's better than 100.0 without gapping: 18
Number of HSP's successfully gapped in prelim test: 27
Number of HSP's that attempted gapping in prelim test: 49047
Number of HSP's gapped (non-prelim): 68
length of query: 1258
length of database: 191,569,459
effective HSP length: 129
effective length of query: 1129
effective length of database: 121,958,995
effective search space: 137691705355
effective search space used: 137691705355
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 67 (30.4 bits)