BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= psy3847
         (664 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|242008779|ref|XP_002425177.1| conserved hypothetical protein [Pediculus humanus corporis]
 gi|212508879|gb|EEB12439.1| conserved hypothetical protein [Pediculus humanus corporis]
          Length = 685

 Score =  704 bits (1817), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 340/523 (65%), Positives = 394/523 (75%), Gaps = 60/523 (11%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           P +   FY A++VV DVSWIPNKHYSGVYGL+KLTLPK+LP  L K +VLDTDV FATDI
Sbjct: 122 PQVDVRFYPAENVVPDVSWIPNKHYSGVYGLMKLTLPKILPRNLTKVVVLDTDVTFATDI 181

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LW L +++  +Q  GLVENQSDWYLGKLWKNHKPWPALGRG+NTGVILL + +LR + 
Sbjct: 182 AELWKLMTKMSDKQAFGLVENQSDWYLGKLWKNHKPWPALGRGYNTGVILLQMKRLRRLK 241

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           WA  WR IAEK L+T   TSLADQDI NAII ++P LVY LPCQWNVQLSDNTRS ELCY
Sbjct: 242 WAHLWRFIAEKDLVTMYSTSLADQDILNAIIKQYPELVYKLPCQWNVQLSDNTRS-ELCY 300

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTTLQQAE 405
           TE+TDLK+IHWNSPKKLKV+NKH+EFFRNLYLTFLEYDGNLLRRELFGCN+++ ++ Q E
Sbjct: 301 TEVTDLKVIHWNSPKKLKVQNKHVEFFRNLYLTFLEYDGNLLRRELFGCNISRPSVPQQE 360

Query: 406 LSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINPN 465
           L  LNEED CYD RR++++ LRTHLYFL+Y+YE S+DG DVTLVAQLSMDRL        
Sbjct: 361 LRKLNEEDQCYDFRRARISRLRTHLYFLDYDYEPSTDGADVTLVAQLSMDRL-------- 412

Query: 466 NVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNS 525
                                    QMVE L KHWEGPISLTLYMSDAEAQQFLSYA +S
Sbjct: 413 -------------------------QMVENLLKHWEGPISLTLYMSDAEAQQFLSYALSS 447

Query: 526 EALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSI 585
           E L SR+NIGYH+VYKEGNFYP+N LRNVAL QV TP+VFL DIDFLPMFGLY YLK+ +
Sbjct: 448 EILSSRKNIGYHIVYKEGNFYPVNLLRNVALQQVKTPFVFLTDIDFLPMFGLYEYLKTCV 507

Query: 586 RSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------HAPTN 621
           + +++      K LVVPAFETQRYR  FP S                        HAPTN
Sbjct: 508 QLLNL--ENSNKALVVPAFETQRYRITFPRSKADLLRMLDMGSLFTFRYDVWPKGHAPTN 565

Query: 622 FSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           +++W  AT+PY++ W PD+EPYIV  +D+P YDTRFVGFGWNK
Sbjct: 566 YTKWRTATSPYEVSWEPDYEPYIVVKKDIPLYDTRFVGFGWNK 608



 Score =  267 bits (683), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 121/168 (72%), Positives = 142/168 (84%)

Query: 5   EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
           +E  I  CE+I VAIVCAGYN++RS+VTLIKSILFY +NPL  H I D+VA +IL+TLF 
Sbjct: 58  KEAEISKCEIIHVAIVCAGYNASRSVVTLIKSILFYTRNPLQLHFIADSVAWHILETLFK 117

Query: 65  TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
           TWSVPQV+V FY A++VV DVSWIPNKHYSGVYGL+KLTLPK+LP  L K +VLDTDV F
Sbjct: 118 TWSVPQVDVRFYPAENVVPDVSWIPNKHYSGVYGLMKLTLPKILPRNLTKVVVLDTDVTF 177

Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           ATDIA+LW L +++  +Q  GLVENQSDWYLGKLWKNHKPWPALGRG+
Sbjct: 178 ATDIAELWKLMTKMSDKQAFGLVENQSDWYLGKLWKNHKPWPALGRGY 225


>gi|193669092|ref|XP_001944635.1| PREDICTED: glycosyltransferase-like protein LARGE1-like isoform 1
           [Acyrthosiphon pisum]
 gi|328699413|ref|XP_003240926.1| PREDICTED: glycosyltransferase-like protein LARGE1-like isoform 2
           [Acyrthosiphon pisum]
          Length = 709

 Score =  661 bits (1705), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 325/523 (62%), Positives = 380/523 (72%), Gaps = 63/523 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           P L   FY A+ ++ DVSWI NKHYSGVYGL+KL L +V P+TL + +VLDTD+ FATDI
Sbjct: 156 PQLEIEFYPAEKIIADVSWINNKHYSGVYGLMKLCLTQVFPDTLKQVLVLDTDITFATDI 215

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           AQLW++    ++ Q++GLVENQSDWYLGKLWK HKPWPALGRG+NTGVIL+DL KLR I 
Sbjct: 216 AQLWSMLKSFKKGQSLGLVENQSDWYLGKLWKKHKPWPALGRGYNTGVILMDLRKLRLIG 275

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W G WR +A K L+T  WTSLADQDIFN II E P L+Y +PCQWNVQLSDNTRS ELCY
Sbjct: 276 WNGLWRSVAVKTLVTHYWTSLADQDIFNTIIKERPELLYPMPCQWNVQLSDNTRS-ELCY 334

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTTLQQAE 405
            E+ +LKIIHWNSPKKLKVKNKH+EFFRNLYLTFLEYDGNLL+RELFGCN + T     E
Sbjct: 335 AEVIELKIIHWNSPKKLKVKNKHVEFFRNLYLTFLEYDGNLLQRELFGCNYSSTIQDVPE 394

Query: 406 LSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINPN 465
           +S   E+D CY+ RR+ + SLRTHLYFL++EY++S D  D+TLVAQLSMDRLQ       
Sbjct: 395 IS---EDDLCYEFRRAGMASLRTHLYFLDFEYQSSPDECDITLVAQLSMDRLQ------- 444

Query: 466 NVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNS 525
                                     MVEML ++W+GPISL+LYMSDAEAQQFLSYA NS
Sbjct: 445 --------------------------MVEMLCEYWDGPISLSLYMSDAEAQQFLSYAQNS 478

Query: 526 EALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSI 585
           E LR+R+NIGYHVVYKEG+FYPIN LRNVAL  V TPYVFL DIDFLPM GLY YLK  I
Sbjct: 479 EVLRNRKNIGYHVVYKEGDFYPINLLRNVALENVVTPYVFLSDIDFLPMSGLYSYLKKYI 538

Query: 586 RSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------HAPTN 621
             M++      K LVVPAFETQRYRTAFP +                        HA TN
Sbjct: 539 FIMNI--KSSDKALVVPAFETQRYRTAFPQTKADILRMLDDGALFSFRYHVWPKGHAATN 596

Query: 622 FSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           +++W  +T PY++ W PD+EPY+V   D+PRYDTRFVGFGWNK
Sbjct: 597 YAKWKGSTVPYKVSWEPDYEPYVVVRSDVPRYDTRFVGFGWNK 639



 Score =  254 bits (648), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 111/164 (67%), Positives = 136/164 (82%)

Query: 9   IPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSV 68
           I  CEVI+V IVCAGYNSTRS+VTLIKSILFYRKNPL FH + D ++  IL+TLF TW +
Sbjct: 96  IDKCEVIEVGIVCAGYNSTRSVVTLIKSILFYRKNPLRFHFVVDVISRKILKTLFDTWKI 155

Query: 69  PQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 128
           PQ+E+ FY A+ ++ DVSWI NKHYSGVYGL+KL L +V P+TL + +VLDTD+ FATDI
Sbjct: 156 PQLEIEFYPAEKIIADVSWINNKHYSGVYGLMKLCLTQVFPDTLKQVLVLDTDITFATDI 215

Query: 129 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           AQLW++    ++ Q++GLVENQSDWYLGKLWK HKPWPALGRG+
Sbjct: 216 AQLWSMLKSFKKGQSLGLVENQSDWYLGKLWKKHKPWPALGRGY 259


>gi|94733298|emb|CAK04442.1| novel protein similar to vertebrate discs, large homolog 3
           (neuroendocrine-dlg, Drosophila) (DLG3) [Danio rerio]
          Length = 750

 Score =  657 bits (1696), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 320/527 (60%), Positives = 379/527 (71%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           P++   FY AD +  +VSWIPNKHYSG+YGL+KLTL K LP  L+K IVLDTD+ FATDI
Sbjct: 186 PSVQVSFYDADELKSEVSWIPNKHYSGIYGLMKLTLTKALPSNLSKVIVLDTDITFATDI 245

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWA+F +  ++Q IGLVENQSDWYLG LWKNHKPWPALGRGFNTGVILL L +LR + 
Sbjct: 246 AELWAIFRKFTEKQVIGLVENQSDWYLGNLWKNHKPWPALGRGFNTGVILLYLERLRRMG 305

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ AE+ L++ L TSLADQDIFNA I ++P LV+ LPC WNVQLSD+TRS++ CY
Sbjct: 306 WEQMWRLTAERELMSMLSTSLADQDIFNAFIKQNPVLVHQLPCFWNVQLSDHTRSEQ-CY 364

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
           TE++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC    +   T L
Sbjct: 365 TEVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSQASSESTVL 424

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           QQA L  L+E+D CYD RR ++   R HLYFL+YEY  + DG D+TLVAQLSMDR     
Sbjct: 425 QQA-LEELDEDDQCYDFRRERIMLHRVHLYFLQYEYSPTDDGTDITLVAQLSMDR----- 478

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+E + KHWEGPISL LYMSDAEAQQFL Y
Sbjct: 479 ----------------------------LQMLEAICKHWEGPISLALYMSDAEAQQFLRY 510

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
           A  SE L++R+N+GYH+VYKEG FYP+N +RNVAL  V+TPYVFL D+DFLPM+GLY YL
Sbjct: 511 AQASEVLKNRKNVGYHIVYKEGQFYPVNLVRNVALRNVNTPYVFLTDVDFLPMYGLYDYL 570

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           + SI  +DM     KK LVVPAFET RYR +FP S                        H
Sbjct: 571 RKSIVQLDM--ANTKKALVVPAFETLRYRLSFPKSKAELLSMLDMGTLYTFRYHVWTKGH 628

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           APTN+++W  ATTPY++EW  DFEPY+V  RD P YD RFVGFGWNK
Sbjct: 629 APTNYAKWRTATTPYKVEWEADFEPYVVVRRDCPEYDQRFVGFGWNK 675



 Score =  259 bits (663), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 115/166 (69%), Positives = 137/166 (82%)

Query: 7   PVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTW 66
           P    CE++ VA VCAG+N++R +VTL+KSILF+R+NPLHFH ITDTVA  IL TLF +W
Sbjct: 124 PTAEKCELLHVACVCAGHNASRDVVTLVKSILFHRRNPLHFHFITDTVANQILSTLFQSW 183

Query: 67  SVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 126
            VP V+VSFY AD +  +VSWIPNKHYSG+YGL+KLTL K LP  L+K IVLDTD+ FAT
Sbjct: 184 MVPSVQVSFYDADELKSEVSWIPNKHYSGIYGLMKLTLTKALPSNLSKVIVLDTDITFAT 243

Query: 127 DIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           DIA+LWA+F +  ++Q IGLVENQSDWYLG LWKNHKPWPALGRGF
Sbjct: 244 DIAELWAIFRKFTEKQVIGLVENQSDWYLGNLWKNHKPWPALGRGF 289


>gi|52218928|ref|NP_001004538.1| glycosyltransferase-like protein LARGE2 [Danio rerio]
 gi|82083909|sp|Q66PG1.1|LARG2_DANRE RecName: Full=Glycosyltransferase-like protein LARGE2; AltName:
           Full=Glycosyltransferase-like 1B
 gi|51872297|gb|AAU12252.1| glycosyltransferase-like 1B [Danio rerio]
 gi|190336631|gb|AAI62103.1| Glycosyltransferase-like 1b [Danio rerio]
          Length = 750

 Score =  657 bits (1696), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 320/527 (60%), Positives = 379/527 (71%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           P++   FY AD +  +VSWIPNKHYSG+YGL+KLTL K LP  L+K IVLDTD+ FATDI
Sbjct: 186 PSVQVSFYDADELKSEVSWIPNKHYSGIYGLMKLTLTKALPSNLSKVIVLDTDITFATDI 245

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWA+F +  ++Q IGLVENQSDWYLG LWKNHKPWPALGRGFNTGVILL L +LR + 
Sbjct: 246 AELWAIFRKFTEKQVIGLVENQSDWYLGNLWKNHKPWPALGRGFNTGVILLYLERLRRMG 305

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ AE+ L++ L TSLADQDIFNA I ++P LV+ LPC WNVQLSD+TRS++ CY
Sbjct: 306 WEQMWRLTAERELMSMLSTSLADQDIFNAFIKQNPVLVHQLPCFWNVQLSDHTRSEQ-CY 364

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
           TE++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC    +   T L
Sbjct: 365 TEVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSQASSESTVL 424

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           QQA L  L+E+D CYD RR ++   R HLYFL+YEY  + DG D+TLVAQLSMDR     
Sbjct: 425 QQA-LEELDEDDQCYDFRRERIMLHRVHLYFLQYEYSPTDDGTDITLVAQLSMDR----- 478

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+E + KHWEGPISL LYMSDAEAQQFL Y
Sbjct: 479 ----------------------------LQMLEAICKHWEGPISLALYMSDAEAQQFLRY 510

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
           A  SE L++R+N+GYH+VYKEG FYP+N +RNVAL  V+TPYVFL D+DFLPM+GLY YL
Sbjct: 511 AQASEVLKNRKNVGYHIVYKEGQFYPVNLVRNVALRNVNTPYVFLTDVDFLPMYGLYDYL 570

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           + SI  +DM     KK LVVPAFET RYR +FP S                        H
Sbjct: 571 RKSIVQLDM--ANTKKALVVPAFETLRYRLSFPKSKAELLSMLDMGTLYTFRYHVWTKGH 628

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           APTN+++W  ATTPY++EW  DFEPY+V  RD P YD RFVGFGWNK
Sbjct: 629 APTNYAKWRTATTPYKVEWEADFEPYVVVRRDCPEYDQRFVGFGWNK 675



 Score =  259 bits (663), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 115/166 (69%), Positives = 137/166 (82%)

Query: 7   PVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTW 66
           P    CE++ VA VCAG+N++R +VTL+KSILF+R+NPLHFH ITDTVA  IL TLF +W
Sbjct: 124 PTAEKCELLHVACVCAGHNASRDVVTLVKSILFHRRNPLHFHFITDTVANQILSTLFQSW 183

Query: 67  SVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 126
            VP V+VSFY AD +  +VSWIPNKHYSG+YGL+KLTL K LP  L+K IVLDTD+ FAT
Sbjct: 184 MVPSVQVSFYDADELKSEVSWIPNKHYSGIYGLMKLTLTKALPSNLSKVIVLDTDITFAT 243

Query: 127 DIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           DIA+LWA+F +  ++Q IGLVENQSDWYLG LWKNHKPWPALGRGF
Sbjct: 244 DIAELWAIFRKFTEKQVIGLVENQSDWYLGNLWKNHKPWPALGRGF 289


>gi|410912556|ref|XP_003969755.1| PREDICTED: glycosyltransferase-like protein LARGE2-like [Takifugu
           rubripes]
          Length = 748

 Score =  652 bits (1682), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 321/527 (60%), Positives = 379/527 (71%), Gaps = 67/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           P++   FY AD +  +VSWIPNKHYSG+YGL+KLTL K LP  L K IVLDTD+ FATDI
Sbjct: 186 PSVQVSFYDADELKSEVSWIPNKHYSGIYGLMKLTLTKALPSDLRKVIVLDTDITFATDI 245

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LW +F +  ++Q IGLVENQSDWYLG LWKNHKPWPALGRGFNTGVILL L +LR I 
Sbjct: 246 AELWGIFRKFTEKQVIGLVENQSDWYLGNLWKNHKPWPALGRGFNTGVILLYLERLRRIG 305

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ AEK L++ L TSLADQDIFNA I ++P LV+ LPC WNVQLSD+TRS++ CY
Sbjct: 306 WEQMWRLTAEKELMSMLSTSLADQDIFNAFIKQNPVLVHQLPCFWNVQLSDHTRSEQ-CY 364

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTTLQ--- 402
           TE++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC  +Q++ +   
Sbjct: 365 TEVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCP-SQSSPENIQ 423

Query: 403 -QAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
            QA L  L+E+D CYD RR +LT+ R HLYFL+YEY  + D  D+TLVAQLSMDR     
Sbjct: 424 LQAALEELDEDDQCYDFRRERLTAHRVHLYFLQYEYTPTED--DITLVAQLSMDR----- 476

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+E + KHWEGPISL LYMSDAEAQQFL Y
Sbjct: 477 ----------------------------LQMLEAICKHWEGPISLALYMSDAEAQQFLRY 508

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
           A  SE L++R+N+GYH+VYKEG FYP+N +RNVAL    TPYVFL D+DFLPM+GLY YL
Sbjct: 509 AQASEVLKNRKNVGYHIVYKEGQFYPVNLVRNVALKNARTPYVFLTDVDFLPMYGLYEYL 568

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           + S+  MDM  H  KK LVVPAFET RYR +FP S                        H
Sbjct: 569 RKSVLQMDM-AH-TKKALVVPAFETLRYRLSFPKSKAELLSMLDMGTLYTFRYHVWPKGH 626

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           APTN+++W  ATTPY++EW PDFEPY+V  RD P YD RFVGFGWNK
Sbjct: 627 APTNYAKWRTATTPYKVEWEPDFEPYVVVRRDCPEYDQRFVGFGWNK 673



 Score =  257 bits (656), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 113/166 (68%), Positives = 136/166 (81%)

Query: 7   PVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTW 66
           PV+  CE+I VA VCAG+N++R +VTL+KS+LF+R+NPLHFH ITD VA  IL +LF +W
Sbjct: 124 PVVQKCELIHVACVCAGHNASRDVVTLVKSVLFHRRNPLHFHFITDAVANQILSSLFQSW 183

Query: 67  SVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 126
            VP V+VSFY AD +  +VSWIPNKHYSG+YGL+KLTL K LP  L K IVLDTD+ FAT
Sbjct: 184 MVPSVQVSFYDADELKSEVSWIPNKHYSGIYGLMKLTLTKALPSDLRKVIVLDTDITFAT 243

Query: 127 DIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           DIA+LW +F +  ++Q IGLVENQSDWYLG LWKNHKPWPALGRGF
Sbjct: 244 DIAELWGIFRKFTEKQVIGLVENQSDWYLGNLWKNHKPWPALGRGF 289


>gi|298684035|gb|ADI96197.1| LARGE-1 [Mastomys erythroleucus]
          Length = 671

 Score =  652 bits (1682), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 315/527 (59%), Positives = 379/527 (71%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+G  FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ FATDI
Sbjct: 128 PAVGVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 187

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR + 
Sbjct: 188 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 247

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ AE+ L+  L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 248 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 306

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
            +++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC    ++    L
Sbjct: 307 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSETDVNSENL 366

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           Q+ +LS L+E+D CY+ RR + T  RTHLYFL YE+E S+D  DVTLVAQLSMDR     
Sbjct: 367 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEFEPSADNTDVTLVAQLSMDR----- 420

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 421 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 452

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
           A  SE L SR+N+GYH+VYKEG FYP+N LRNVA+  +STPY+FL DIDFLPM+GLY YL
Sbjct: 453 AQGSEVLMSRQNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYL 512

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           + S+  +D+     KK ++VPAFET RYR +FP S                        H
Sbjct: 513 RKSVIQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVXXKGH 570

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           APTNF++W  ATTPYQ+EW  DFEPY+V  RD P YD RFVGFGWNK
Sbjct: 571 APTNFAKWRTATTPYQVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 617



 Score =  254 bits (650), Expect = 8e-65,   Method: Compositional matrix adjust.
 Identities = 110/168 (65%), Positives = 136/168 (80%)

Query: 5   EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
           ++P +  CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A  IL TLF 
Sbjct: 64  QQPAVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 123

Query: 65  TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
           TW VP V V FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ F
Sbjct: 124 TWMVPAVGVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 183

Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 184 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 231


>gi|298684009|gb|ADI96184.1| LARGE-1 [Mastomys kollmannspergeri]
          Length = 671

 Score =  652 bits (1681), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 315/527 (59%), Positives = 379/527 (71%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+G  FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ FATDI
Sbjct: 128 PAVGVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 187

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR + 
Sbjct: 188 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 247

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ AE+ L+  L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 248 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 306

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
            +++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC    ++    L
Sbjct: 307 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSETDVNSENL 366

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           Q+ +LS L+E+D CY+ RR + T  RTHLYFL YE+E S+D  DVTLVAQLSMDR     
Sbjct: 367 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEFEPSADNTDVTLVAQLSMDR----- 420

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 421 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 452

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
           A  SE L SR+N+GYH+VYKEG FYP+N LRNVA+  +STPY+FL DIDFLPM+GLY YL
Sbjct: 453 AQGSEVLMSRQNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYL 512

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           + S+  +D+     KK ++VPAFET RYR +FP S                        H
Sbjct: 513 RKSVIQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGH 570

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           APTNF++W  ATTPYQ+EW  DFEPY+V  RD P YD RFVGFGWNK
Sbjct: 571 APTNFAKWRTATTPYQVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 617



 Score =  250 bits (638), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 109/168 (64%), Positives = 135/168 (80%)

Query: 5   EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
           ++P +  CE I VAIVCAGYN++  +VTL+KS+LF+R+NPLHFHLI D++A  IL TLF 
Sbjct: 64  QQPDVQKCETIHVAIVCAGYNASXXVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 123

Query: 65  TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
           TW VP V V FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ F
Sbjct: 124 TWMVPAVGVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 183

Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 184 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 231


>gi|298684023|gb|ADI96191.1| LARGE-1 [Mastomys natalensis]
          Length = 671

 Score =  651 bits (1680), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 315/527 (59%), Positives = 379/527 (71%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+G  FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ FATDI
Sbjct: 128 PAVGVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 187

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR + 
Sbjct: 188 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 247

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ AE+ L+  L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 248 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 306

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
            +++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC    ++    L
Sbjct: 307 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSETDVNSENL 366

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           Q+ +LS L+E+D CY+ RR + T  RTHLYFL YE+E S+D  DVTLVAQLSMDR     
Sbjct: 367 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEFEPSADNTDVTLVAQLSMDR----- 420

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 421 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 452

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
           A  SE L SR+N+GYH+VYKEG FYP+N LRNVA+  +STPY+FL DIDFLPM+GLY YL
Sbjct: 453 AQGSEVLMSRQNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYL 512

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           + S+  +D+     KK ++VPAFET RYR +FP S                        H
Sbjct: 513 RKSVIQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGH 570

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           APTNF++W  ATTPYQ+EW  DFEPY+V  RD P YD RFVGFGWNK
Sbjct: 571 APTNFAKWRTATTPYQVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 617



 Score =  254 bits (649), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 110/168 (65%), Positives = 136/168 (80%)

Query: 5   EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
           ++P +  CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A  IL TLF 
Sbjct: 64  QQPAVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 123

Query: 65  TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
           TW VP V V FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ F
Sbjct: 124 TWMVPAVGVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 183

Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 184 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 231


>gi|298684019|gb|ADI96189.1| LARGE-1 [Mastomys natalensis]
 gi|298684021|gb|ADI96190.1| LARGE-1 [Mastomys natalensis]
          Length = 671

 Score =  651 bits (1680), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 315/527 (59%), Positives = 379/527 (71%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+G  FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ FATDI
Sbjct: 128 PAVGVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 187

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR + 
Sbjct: 188 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 247

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ AE+ L+  L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 248 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 306

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
            +++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC    ++    L
Sbjct: 307 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSETDVNSENL 366

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           Q+ +LS L+E+D CY+ RR + T  RTHLYFL YE+E S+D  DVTLVAQLSMDR     
Sbjct: 367 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEFEPSADNTDVTLVAQLSMDR----- 420

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 421 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 452

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
           A  SE L SR+N+GYH+VYKEG FYP+N LRNVA+  +STPY+FL DIDFLPM+GLY YL
Sbjct: 453 AQGSEVLMSRQNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYL 512

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           + S+  +D+     KK ++VPAFET RYR +FP S                        H
Sbjct: 513 RKSVIQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGH 570

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           APTNF++W  ATTPYQ+EW  DFEPY+V  RD P YD RFVGFGWNK
Sbjct: 571 APTNFAKWRTATTPYQVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 617



 Score =  254 bits (649), Expect = 9e-65,   Method: Compositional matrix adjust.
 Identities = 110/168 (65%), Positives = 136/168 (80%)

Query: 5   EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
           ++P +  CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A  IL TLF 
Sbjct: 64  QQPAVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 123

Query: 65  TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
           TW VP V V FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ F
Sbjct: 124 TWMVPAVGVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 183

Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 184 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 231


>gi|298684033|gb|ADI96196.1| LARGE-1 [Mastomys erythroleucus]
          Length = 671

 Score =  651 bits (1680), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 315/527 (59%), Positives = 379/527 (71%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+G  FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ FATDI
Sbjct: 128 PAVGVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 187

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR + 
Sbjct: 188 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 247

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ AE+ L+  L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 248 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 306

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
            +++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC    ++    L
Sbjct: 307 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSETDVNSENL 366

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           Q+ +LS L+E+D CY+ RR + T  RTHLYFL YE+E S+D  DVTLVAQLSMDR     
Sbjct: 367 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEFEPSADNTDVTLVAQLSMDR----- 420

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 421 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 452

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
           A  SE L SR+N+GYH+VYKEG FYP+N LRNVA+  +STPY+FL DIDFLPM+GLY YL
Sbjct: 453 AQGSEVLMSRQNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYL 512

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           + S+  +D+     KK ++VPAFET RYR +FP S                        H
Sbjct: 513 RKSVIQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGH 570

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           APTNF++W  ATTPYQ+EW  DFEPY+V  RD P YD RFVGFGWNK
Sbjct: 571 APTNFAKWRTATTPYQVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 617



 Score =  252 bits (644), Expect = 4e-64,   Method: Compositional matrix adjust.
 Identities = 109/168 (64%), Positives = 135/168 (80%)

Query: 5   EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
           ++P +  CE I VA VCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A  IL TLF 
Sbjct: 64  QQPAVEKCETIHVAXVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 123

Query: 65  TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
           TW VP V V FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ F
Sbjct: 124 TWMVPAVGVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 183

Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 184 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 231


>gi|348530274|ref|XP_003452636.1| PREDICTED: glycosyltransferase-like protein LARGE2-like
           [Oreochromis niloticus]
          Length = 750

 Score =  650 bits (1678), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 318/527 (60%), Positives = 377/527 (71%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           P++   FY AD +  +VSWIPNKHYSG+YGL+KLTL K LP  L+K IVLDTD+ FATDI
Sbjct: 186 PSVQVSFYDADELKSEVSWIPNKHYSGIYGLMKLTLTKALPSDLSKVIVLDTDITFATDI 245

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LW +F +   +Q IGLVENQSDWYLG LWKNHKPWPALGRGFNTGVILL L +LR I 
Sbjct: 246 AELWGIFRKFTDKQVIGLVENQSDWYLGNLWKNHKPWPALGRGFNTGVILLYLERLRRIG 305

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ AE+ L++ L TSLADQDIFNA I ++P LV+ LPC WNVQLSD+TRS++ CY
Sbjct: 306 WEQMWRLTAERELMSMLSTSLADQDIFNAFIKQNPVLVHQLPCFWNVQLSDHTRSEQ-CY 364

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTTLQ--- 402
           TE++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC  TQ + +   
Sbjct: 365 TEVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCP-TQASPESIK 423

Query: 403 -QAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
            Q  L  L+E+D CYD RR +LT  R HLYFL+Y+Y  + D  D+TLVAQLSMDR     
Sbjct: 424 LQKALEELDEDDQCYDFRRERLTVHRVHLYFLQYDYTPTEDNTDITLVAQLSMDR----- 478

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+E + KHWEGPISL LYMSDAEAQQFL Y
Sbjct: 479 ----------------------------LQMLEAICKHWEGPISLALYMSDAEAQQFLRY 510

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
           A  SE L++R+N+GYH+VYKEG FYP+N +RNVAL   +TPYVFL D+DFLPM+GLY YL
Sbjct: 511 AQASEVLKNRKNVGYHIVYKEGQFYPVNLVRNVALKNANTPYVFLTDVDFLPMYGLYDYL 570

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           + S+  MDM  H  KK LVVPAFET RYR +FP S                        H
Sbjct: 571 RKSVVQMDM-AH-TKKALVVPAFETLRYRLSFPKSKAELLSMLDMGTLYTFRYHVWPKGH 628

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           APTN+++W  ATTPY++EW PDFEPY+V  R+ P YD RFVGFGWNK
Sbjct: 629 APTNYAKWRTATTPYKVEWEPDFEPYVVVRRECPEYDQRFVGFGWNK 675



 Score =  257 bits (656), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 113/166 (68%), Positives = 136/166 (81%)

Query: 7   PVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTW 66
           P +  CE+I VA VCAG+N++R +VTL+KS+LF+R+NPLHFH ITDTVA  IL +LF +W
Sbjct: 124 PTVQKCELIHVACVCAGHNASRDVVTLVKSVLFHRRNPLHFHFITDTVANRILSSLFQSW 183

Query: 67  SVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 126
            VP V+VSFY AD +  +VSWIPNKHYSG+YGL+KLTL K LP  L+K IVLDTD+ FAT
Sbjct: 184 MVPSVQVSFYDADELKSEVSWIPNKHYSGIYGLMKLTLTKALPSDLSKVIVLDTDITFAT 243

Query: 127 DIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           DIA+LW +F +   +Q IGLVENQSDWYLG LWKNHKPWPALGRGF
Sbjct: 244 DIAELWGIFRKFTDKQVIGLVENQSDWYLGNLWKNHKPWPALGRGF 289


>gi|298684031|gb|ADI96195.1| LARGE-1 [Mastomys erythroleucus]
          Length = 671

 Score =  650 bits (1678), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 315/527 (59%), Positives = 379/527 (71%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+G  FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ FATDI
Sbjct: 128 PAVGVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 187

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR + 
Sbjct: 188 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 247

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ AE+ L+  L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 248 WEQXWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 306

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
            +++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC    ++    L
Sbjct: 307 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSETDVNSENL 366

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           Q+ +LS L+E+D CY+ RR + T  RTHLYFL YE+E S+D  DVTLVAQLSMDR     
Sbjct: 367 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEFEPSADNTDVTLVAQLSMDR----- 420

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 421 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 452

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
           A  SE L SR+N+GYH+VYKEG FYP+N LRNVA+  +STPY+FL DIDFLPM+GLY YL
Sbjct: 453 AQGSEVLMSRQNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYL 512

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           + S+  +D+     KK ++VPAFET RYR +FP S                        H
Sbjct: 513 RKSVIQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGH 570

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           APTNF++W  ATTPYQ+EW  DFEPY+V  RD P YD RFVGFGWNK
Sbjct: 571 APTNFAKWRTATTPYQVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 617



 Score =  254 bits (649), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 110/168 (65%), Positives = 136/168 (80%)

Query: 5   EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
           ++P +  CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A  IL TLF 
Sbjct: 64  QQPAVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 123

Query: 65  TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
           TW VP V V FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ F
Sbjct: 124 TWMVPAVGVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 183

Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 184 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 231


>gi|260805248|ref|XP_002597499.1| hypothetical protein BRAFLDRAFT_281013 [Branchiostoma floridae]
 gi|229282764|gb|EEN53511.1| hypothetical protein BRAFLDRAFT_281013 [Branchiostoma floridae]
          Length = 677

 Score =  650 bits (1676), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 323/525 (61%), Positives = 382/525 (72%), Gaps = 63/525 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           P +   FY A+++ E+VSWIPNKHYSGVYGL+KL L + LP  L K IVLDTDV FATDI
Sbjct: 117 PGVTANFYSAENLKEEVSWIPNKHYSGVYGLMKLVLTRALPTDLEKVIVLDTDVTFATDI 176

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LW LF + + +++IGLVENQSDWYLGK+WKNH+PWPALGRGFNTGVILL L KLR + 
Sbjct: 177 AELWKLFKKFKGKKSIGLVENQSDWYLGKIWKNHRPWPALGRGFNTGVILLHLEKLRGMK 236

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ AE+ L++ L TSLADQDIFNA+I +HPYLV+ LPC WNVQLSDNT+S++ CY
Sbjct: 237 WMQMWRLTAERELMSMLSTSLADQDIFNAVIKQHPYLVHKLPCAWNVQLSDNTKSEQ-CY 295

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTTLQ-QA 404
           TE+TDLK+IHWNSPKKL+VKNKH+EFF+N+YLTFLEYDGNLLRRELFGC   Q   +   
Sbjct: 296 TEVTDLKVIHWNSPKKLRVKNKHVEFFKNMYLTFLEYDGNLLRRELFGCEKVQHLDEVDQ 355

Query: 405 ELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINP 464
           +L+ L+E+D CYD RR ++   RTHLYFL+Y YE S D NDVTLVAQLSMDR        
Sbjct: 356 KLNALDEDDQCYDFRRERVLMHRTHLYFLDYMYE-SVDDNDVTLVAQLSMDR-------- 406

Query: 465 NNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGN 524
                                    LQM+E L +HWEGPISL LYMSDAEAQQFL YA  
Sbjct: 407 -------------------------LQMLEALCRHWEGPISLALYMSDAEAQQFLRYALG 441

Query: 525 SEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSS 584
           SE L  R+N+GYH+VYKEG FYP+N LRN+AL QV+TPYVFL+DIDFLPM+ LYPYLK +
Sbjct: 442 SETLMQRKNVGYHIVYKEGQFYPVNLLRNIALQQVNTPYVFLMDIDFLPMYNLYPYLKKT 501

Query: 585 IRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------HAPT 620
           +  MDM     KK LVVPAFET RYR +FP S                        HAPT
Sbjct: 502 VGMMDMRRQ--KKALVVPAFETLRYRLSFPKSKAELLSMLDMGTLYTFRYHVWQKGHAPT 559

Query: 621 NFSRWVNATTPYQIEWAPDFEPYIVAHRD-LPRYDTRFVGFGWNK 664
           N+++W  ATTPY++ W PDFEPYIV  +D LP YDTRFVGFGWNK
Sbjct: 560 NYAKWRTATTPYRVNWEPDFEPYIVVRKDQLPEYDTRFVGFGWNK 604



 Score =  247 bits (631), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 115/167 (68%), Positives = 138/167 (82%)

Query: 6   EPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFST 65
           E  +P CE I+VAIVCAGYN+TR +VTL+KSILFYRKNPLHFH ++D  A  ILQTL  T
Sbjct: 54  EDQVPKCETIEVAIVCAGYNATRGVVTLVKSILFYRKNPLHFHFMSDRAATLILQTLLRT 113

Query: 66  WSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFA 125
           W+VP V  +FY A+++ E+VSWIPNKHYSGVYGL+KL L + LP  L K IVLDTDV FA
Sbjct: 114 WNVPGVTANFYSAENLKEEVSWIPNKHYSGVYGLMKLVLTRALPTDLEKVIVLDTDVTFA 173

Query: 126 TDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           TDIA+LW LF + + +++IGLVENQSDWYLGK+WKNH+PWPALGRGF
Sbjct: 174 TDIAELWKLFKKFKGKKSIGLVENQSDWYLGKIWKNHRPWPALGRGF 220


>gi|38494377|gb|AAH61506.1| Large protein, partial [Mus musculus]
          Length = 732

 Score =  649 bits (1674), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 314/527 (59%), Positives = 378/527 (71%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ FATDI
Sbjct: 168 PAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 227

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR + 
Sbjct: 228 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 287

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ AE+ L+  L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 288 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 346

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
            +++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC    ++    L
Sbjct: 347 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSETDVNNENL 406

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           Q+ +LS L+E+D CY+ RR + T  RTHLYFL YE+E S+D  DVTLVAQLSMDR     
Sbjct: 407 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEFEPSADNTDVTLVAQLSMDR----- 460

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 461 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 492

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
           A  SE L SR+N+GYH+VYKEG FYP+N LRNVA+  +STPY+FL DIDFLPM+GLY YL
Sbjct: 493 AQGSEVLMSRQNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYL 552

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           + S+  +D+     KK ++VPAFET RYR +FP S                        H
Sbjct: 553 RKSVIQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGH 610

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           APTNF++W  ATTPYQ+EW  DFEPY+V  RD P YD RFVGFGWNK
Sbjct: 611 APTNFAKWRTATTPYQVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 657



 Score =  256 bits (653), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 110/168 (65%), Positives = 136/168 (80%)

Query: 5   EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
           ++P +  CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A  IL TLF 
Sbjct: 104 QQPAVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 163

Query: 65  TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
           TW VP V V FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ F
Sbjct: 164 TWMVPAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 223

Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 224 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 271


>gi|326912113|ref|XP_003202398.1| PREDICTED: glycosyltransferase-like protein LARGE1-like, partial
           [Meleagris gallopavo]
          Length = 720

 Score =  649 bits (1674), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 313/527 (59%), Positives = 378/527 (71%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ FATDI
Sbjct: 156 PAVRIDFYDADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPSNLERVIVLDTDITFATDI 215

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR + 
Sbjct: 216 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 275

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ AE+ L++ L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 276 WEQMWRLTAERELMSMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 334

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
            +++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC    ++    L
Sbjct: 335 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENL 394

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           Q+ +LS L+E+D CY+ RR + T  RTHLYFL YEYE +SD  DVTLVAQLSMDR     
Sbjct: 395 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEYEPASDNTDVTLVAQLSMDR----- 448

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 449 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 480

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
           A  SE L SR N+GYH+VYKEG FYP+N LRNVA+  +STPY+FL DIDFLPM+GLY YL
Sbjct: 481 AQGSEVLMSRHNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYL 540

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           + S+  +D+     KK L++PAFET RYR +FP S                        H
Sbjct: 541 RKSVTQLDL--ANTKKALIIPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGH 598

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           APTNF++W  ATTPY++EW  DFEPY+V  +D P YD RFVGFGWNK
Sbjct: 599 APTNFAKWRTATTPYRVEWEADFEPYVVVRKDCPEYDRRFVGFGWNK 645



 Score =  255 bits (651), Expect = 6e-65,   Method: Compositional matrix adjust.
 Identities = 109/168 (64%), Positives = 135/168 (80%)

Query: 5   EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
           ++P +  CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D +A  IL TLF 
Sbjct: 92  QQPAVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADAIAKQILATLFQ 151

Query: 65  TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
           TW VP V + FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ F
Sbjct: 152 TWMVPAVRIDFYDADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPSNLERVIVLDTDITF 211

Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 212 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 259


>gi|298684015|gb|ADI96187.1| LARGE-1 [Mastomys kollmannspergeri]
          Length = 671

 Score =  649 bits (1674), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 314/527 (59%), Positives = 378/527 (71%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+G  FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ FATDI
Sbjct: 128 PAVGVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 187

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR + 
Sbjct: 188 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 247

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ AE+ L+  L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 248 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 306

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
            +++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC    ++    L
Sbjct: 307 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSETDVNSENL 366

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           Q+ +LS L+E+D CY+ RR + T  RTHLYFL YE+E S+D  DVTLVAQLSMDR     
Sbjct: 367 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEFEPSADNTDVTLVAQLSMDR----- 420

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+  + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 421 ----------------------------LQMLXAICKHWEGPISLALYLSDAEAQQFLRY 452

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
           A  SE L SR+N+GYH+VYKEG FYP+N LRNVA+  +STPY+FL DIDFLPM+GLY YL
Sbjct: 453 AQGSEVLMSRQNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYL 512

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           + S+  +D+     KK ++VPAFET RYR +FP S                        H
Sbjct: 513 RKSVIQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGH 570

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           APTNF++W  ATTPYQ+EW  DFEPY+V  RD P YD RFVGFGWNK
Sbjct: 571 APTNFAKWRTATTPYQVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 617



 Score =  254 bits (649), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 110/168 (65%), Positives = 136/168 (80%)

Query: 5   EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
           ++P +  CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A  IL TLF 
Sbjct: 64  QQPAVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 123

Query: 65  TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
           TW VP V V FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ F
Sbjct: 124 TWMVPAVGVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 183

Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 184 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 231


>gi|449276407|gb|EMC84939.1| Glycosyltransferase-like protein LARGE1, partial [Columba livia]
          Length = 722

 Score =  649 bits (1674), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 314/527 (59%), Positives = 378/527 (71%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ FATDI
Sbjct: 158 PAVRIDFYDADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPSNLERVIVLDTDITFATDI 217

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR + 
Sbjct: 218 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 277

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ AE+ L++ L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 278 WEQMWRLTAERELMSMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 336

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
            +++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC    ++    L
Sbjct: 337 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENL 396

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           Q+ +LS L+E+D CY+ RR + T  RTHLYFL YEYE +SD  DVTLVAQLSMDR     
Sbjct: 397 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEYEPASDNTDVTLVAQLSMDR----- 450

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 451 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 482

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
           A  SE L SR N+GYH+VYKEG FYP+N LRNVA+  +STPY+FL DIDFLPM+GLY YL
Sbjct: 483 AQGSEVLMSRHNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYL 542

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           + S+  +D+     KK L+VPAFET RYR +FP S                        H
Sbjct: 543 RKSVTQLDL--ANTKKALIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGH 600

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           APTNF++W  ATTPY++EW  DFEPY+V  +D P YD RFVGFGWNK
Sbjct: 601 APTNFAKWRTATTPYRVEWEADFEPYVVVRKDCPEYDRRFVGFGWNK 647



 Score =  255 bits (651), Expect = 6e-65,   Method: Compositional matrix adjust.
 Identities = 109/168 (64%), Positives = 135/168 (80%)

Query: 5   EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
           ++P +  CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D +A  IL TLF 
Sbjct: 94  QQPAVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADAIAKQILATLFQ 153

Query: 65  TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
           TW VP V + FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ F
Sbjct: 154 TWMVPAVRIDFYDADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPSNLERVIVLDTDITF 213

Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 214 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 261


>gi|6754506|ref|NP_034817.1| glycosyltransferase-like protein LARGE1 [Mus musculus]
 gi|22001690|sp|Q9Z1M7.1|LARGE_MOUSE RecName: Full=Glycosyltransferase-like protein LARGE1; AltName:
           Full=Acetylglucosaminyltransferase-like 1A
 gi|3954978|emb|CAA06945.1| acetylglucosaminyltransferase-like protein [Mus musculus]
 gi|116138324|gb|AAI25490.1| Like-glycosyltransferase [Mus musculus]
 gi|116138711|gb|AAI25488.1| Like-glycosyltransferase [Mus musculus]
 gi|148678872|gb|EDL10819.1| like-glycosyltransferase [Mus musculus]
          Length = 756

 Score =  649 bits (1673), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 314/527 (59%), Positives = 378/527 (71%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ FATDI
Sbjct: 192 PAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 251

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR + 
Sbjct: 252 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 311

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ AE+ L+  L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 312 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 370

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
            +++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC    ++    L
Sbjct: 371 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSETDVNNENL 430

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           Q+ +LS L+E+D CY+ RR + T  RTHLYFL YE+E S+D  DVTLVAQLSMDR     
Sbjct: 431 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEFEPSADNTDVTLVAQLSMDR----- 484

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 485 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 516

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
           A  SE L SR+N+GYH+VYKEG FYP+N LRNVA+  +STPY+FL DIDFLPM+GLY YL
Sbjct: 517 AQGSEVLMSRQNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYL 576

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           + S+  +D+     KK ++VPAFET RYR +FP S                        H
Sbjct: 577 RKSVIQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGH 634

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           APTNF++W  ATTPYQ+EW  DFEPY+V  RD P YD RFVGFGWNK
Sbjct: 635 APTNFAKWRTATTPYQVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 681



 Score =  255 bits (652), Expect = 5e-65,   Method: Compositional matrix adjust.
 Identities = 110/168 (65%), Positives = 136/168 (80%)

Query: 5   EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
           ++P +  CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A  IL TLF 
Sbjct: 128 QQPAVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 187

Query: 65  TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
           TW VP V V FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ F
Sbjct: 188 TWMVPAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 247

Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 248 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 295


>gi|224095339|ref|XP_002199503.1| PREDICTED: glycosyltransferase-like protein LARGE1 [Taeniopygia
           guttata]
          Length = 756

 Score =  648 bits (1672), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 314/527 (59%), Positives = 378/527 (71%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ FATDI
Sbjct: 192 PAVRIDFYDADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPSNLERVIVLDTDITFATDI 251

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR + 
Sbjct: 252 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 311

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ AE+ L++ L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 312 WEQMWRLTAERELMSMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 370

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
            +++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC    ++    L
Sbjct: 371 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENL 430

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           Q+ +LS L+E+D CY+ RR + T  RTHLYFL YEYE +SD  DVTLVAQLSMDR     
Sbjct: 431 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEYEPASDNTDVTLVAQLSMDR----- 484

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 485 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 516

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
           A  SE L SR N+GYH+VYKEG FYP+N LRNVA+  +STPY+FL DIDFLPM+GLY YL
Sbjct: 517 AQGSEVLMSRHNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYL 576

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           + S+  +D+     KK L+VPAFET RYR +FP S                        H
Sbjct: 577 RKSVIQLDL--ANTKKALIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGH 634

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           APTNF++W  ATTPY++EW  DFEPY+V  +D P YD RFVGFGWNK
Sbjct: 635 APTNFAKWRTATTPYRVEWEADFEPYVVVRKDCPEYDRRFVGFGWNK 681



 Score =  256 bits (654), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 110/168 (65%), Positives = 136/168 (80%)

Query: 5   EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
           ++PV+  CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D +A  IL TLF 
Sbjct: 128 QQPVVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADAIAKQILATLFQ 187

Query: 65  TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
           TW VP V + FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ F
Sbjct: 188 TWMVPAVRIDFYDADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPSNLERVIVLDTDITF 247

Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 248 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 295


>gi|28972313|dbj|BAC65610.1| mKIAA0609 protein [Mus musculus]
          Length = 740

 Score =  648 bits (1672), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 314/527 (59%), Positives = 378/527 (71%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ FATDI
Sbjct: 176 PAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 235

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR + 
Sbjct: 236 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 295

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ AE+ L+  L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 296 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 354

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
            +++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC    ++    L
Sbjct: 355 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSETDVNNENL 414

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           Q+ +LS L+E+D CY+ RR + T  RTHLYFL YE+E S+D  DVTLVAQLSMDRLQM+E
Sbjct: 415 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEFEPSADNTDVTLVAQLSMDRLQMLE 473

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                             + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 474 A---------------------------------ICKHWEGPISLALYLSDAEAQQFLRY 500

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
           A  SE L SR+N+GYH+VYKEG FYP+N LRNVA+  +STPY+FL DIDFLPM+GLY YL
Sbjct: 501 AQGSEVLMSRQNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYL 560

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           + S+  +D+     KK ++VPAFET RYR +FP S                        H
Sbjct: 561 RKSVIQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGH 618

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           APTNF++W  ATTPYQ+EW  DFEPY+V  RD P YD RFVGFGWNK
Sbjct: 619 APTNFAKWRTATTPYQVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 665



 Score =  255 bits (652), Expect = 5e-65,   Method: Compositional matrix adjust.
 Identities = 110/168 (65%), Positives = 136/168 (80%)

Query: 5   EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
           ++P +  CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A  IL TLF 
Sbjct: 112 QQPAVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 171

Query: 65  TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
           TW VP V V FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ F
Sbjct: 172 TWMVPAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 231

Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 232 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 279


>gi|298684045|gb|ADI96202.1| LARGE-1 [Mus musculus]
          Length = 671

 Score =  648 bits (1672), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 314/527 (59%), Positives = 378/527 (71%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ FATDI
Sbjct: 128 PAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 187

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR + 
Sbjct: 188 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 247

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ AE+ L+  L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 248 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 306

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
            +++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC    ++    L
Sbjct: 307 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSETDVNNENL 366

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           Q+ +LS L+E+D CY+ RR + T  RTHLYFL YE+E S+D  DVTLVAQLSMDR     
Sbjct: 367 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEFEPSADNTDVTLVAQLSMDR----- 420

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 421 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 452

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
           A  SE L SR+N+GYH+VYKEG FYP+N LRNVA+  +STPY+FL DIDFLPM+GLY YL
Sbjct: 453 AQGSEVLMSRQNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYL 512

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           + S+  +D+     KK ++VPAFET RYR +FP S                        H
Sbjct: 513 RKSVIQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGH 570

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           APTNF++W  ATTPYQ+EW  DFEPY+V  RD P YD RFVGFGWNK
Sbjct: 571 APTNFAKWRTATTPYQVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 617



 Score =  255 bits (651), Expect = 6e-65,   Method: Compositional matrix adjust.
 Identities = 110/168 (65%), Positives = 136/168 (80%)

Query: 5   EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
           ++P +  CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A  IL TLF 
Sbjct: 64  QQPAVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 123

Query: 65  TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
           TW VP V V FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ F
Sbjct: 124 TWMVPAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 183

Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 184 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 231


>gi|355563615|gb|EHH20177.1| hypothetical protein EGK_02976, partial [Macaca mulatta]
          Length = 721

 Score =  647 bits (1670), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 313/527 (59%), Positives = 377/527 (71%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ FATDI
Sbjct: 157 PAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 216

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR + 
Sbjct: 217 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 276

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ AE+ L+  L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 277 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 335

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
            +++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC    ++    L
Sbjct: 336 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENL 395

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           Q+ +LS L+E+D CY+ RR + T  RTHLYFL YEYE ++D  DVTLVAQLSMDR     
Sbjct: 396 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEYEPAADSTDVTLVAQLSMDR----- 449

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 450 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 481

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
           A  SE L SR N+GYH+VYKEG FYP+N LRNVA+  +STPY+FL DIDFLPM+GLY YL
Sbjct: 482 AQGSEVLMSRHNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYL 541

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           + S+  +D+     KK ++VPAFET RYR +FP S                        H
Sbjct: 542 RKSVIQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGH 599

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           APTNF++W  ATTPY++EW  DFEPY+V  RD P YD RFVGFGWNK
Sbjct: 600 APTNFAKWRTATTPYRVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 646



 Score =  257 bits (657), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 111/168 (66%), Positives = 137/168 (81%)

Query: 5   EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
           ++PV+  CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A  IL TLF 
Sbjct: 93  QQPVVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 152

Query: 65  TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
           TW VP V V FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ F
Sbjct: 153 TWMVPAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 212

Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 213 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 260


>gi|298684013|gb|ADI96186.1| LARGE-1 [Mastomys kollmannspergeri]
          Length = 671

 Score =  647 bits (1670), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 313/527 (59%), Positives = 377/527 (71%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+G  FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ FATDI
Sbjct: 128 PAVGVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 187

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR + 
Sbjct: 188 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 247

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ AE+ L+  L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 248 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 306

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
            +++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC    ++    L
Sbjct: 307 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSETDVNSENL 366

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           Q+ +LS L+E+D CY+ RR + T  RTHLYFL YE+E S+D  DVTLVAQLSMDR     
Sbjct: 367 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEFEPSADNTDVTLVAQLSMDR----- 420

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 421 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 452

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
           A  SE L SR+N+GYH+VYKEG FYP+N LRNV +  +STPY+FL DIDFLPM+GLY YL
Sbjct: 453 AQGSEVLMSRQNVGYHIVYKEGQFYPVNLLRNVXMKHISTPYMFLSDIDFLPMYGLYEYL 512

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           +  +  +D+     KK ++VPAFET RYR +FP S                        H
Sbjct: 513 RKXVIQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGH 570

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           APTNF++W  ATTPYQ+EW  DFEPY+V  RD P YD RFVGFGWNK
Sbjct: 571 APTNFAKWRTATTPYQVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 617



 Score =  254 bits (649), Expect = 9e-65,   Method: Compositional matrix adjust.
 Identities = 110/168 (65%), Positives = 136/168 (80%)

Query: 5   EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
           ++P +  CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A  IL TLF 
Sbjct: 64  QQPAVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 123

Query: 65  TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
           TW VP V V FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ F
Sbjct: 124 TWMVPAVGVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 183

Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 184 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 231


>gi|297708696|ref|XP_002831094.1| PREDICTED: like-glycosyltransferase isoform 1 [Pongo abelii]
 gi|297708698|ref|XP_002831095.1| PREDICTED: like-glycosyltransferase isoform 2 [Pongo abelii]
          Length = 756

 Score =  647 bits (1670), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 313/527 (59%), Positives = 377/527 (71%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ FATDI
Sbjct: 192 PAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 251

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR + 
Sbjct: 252 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 311

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ AE+ L+  L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 312 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 370

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
            +++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC    ++    L
Sbjct: 371 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENL 430

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           Q+ +LS L+E+D CY+ RR + T  RTHLYFL YEYE ++D  DVTLVAQLSMDR     
Sbjct: 431 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEYEPAADSTDVTLVAQLSMDR----- 484

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 485 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 516

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
           A  SE L SR N+GYH+VYKEG FYP+N LRNVA+  +STPY+FL DIDFLPM+GLY YL
Sbjct: 517 AQGSEVLMSRHNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYL 576

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           + S+  +D+     KK ++VPAFET RYR +FP S                        H
Sbjct: 577 RKSVIQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGH 634

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           APTNF++W  ATTPY++EW  DFEPY+V  RD P YD RFVGFGWNK
Sbjct: 635 APTNFAKWRTATTPYRVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 681



 Score =  257 bits (656), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 111/168 (66%), Positives = 137/168 (81%)

Query: 5   EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
           ++PV+  CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A  IL TLF 
Sbjct: 128 QQPVVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 187

Query: 65  TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
           TW VP V V FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ F
Sbjct: 188 TWMVPAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 247

Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 248 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 295


>gi|332859455|ref|XP_003317212.1| PREDICTED: like-glycosyltransferase [Pan troglodytes]
 gi|397501767|ref|XP_003821547.1| PREDICTED: glycosyltransferase-like protein LARGE1 isoform 4 [Pan
           paniscus]
          Length = 688

 Score =  647 bits (1669), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 313/527 (59%), Positives = 377/527 (71%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ FATDI
Sbjct: 124 PAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 183

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR + 
Sbjct: 184 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 243

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ AE+ L+  L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 244 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 302

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
            +++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC    ++    L
Sbjct: 303 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENL 362

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           Q+ +LS L+E+D CY+ RR + T  RTHLYFL YEYE ++D  DVTLVAQLSMDR     
Sbjct: 363 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEYEPAADSTDVTLVAQLSMDR----- 416

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 417 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 448

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
           A  SE L SR N+GYH+VYKEG FYP+N LRNVA+  +STPY+FL DIDFLPM+GLY YL
Sbjct: 449 AQGSEVLMSRHNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYL 508

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           + S+  +D+     KK ++VPAFET RYR +FP S                        H
Sbjct: 509 RKSVIQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGH 566

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           APTNF++W  ATTPY++EW  DFEPY+V  RD P YD RFVGFGWNK
Sbjct: 567 APTNFAKWRTATTPYRVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 613



 Score =  257 bits (657), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 111/168 (66%), Positives = 137/168 (81%)

Query: 5   EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
           ++PV+  CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A  IL TLF 
Sbjct: 60  QQPVVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 119

Query: 65  TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
           TW VP V V FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ F
Sbjct: 120 TWMVPAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 179

Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 180 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 227


>gi|402884050|ref|XP_003905506.1| PREDICTED: glycosyltransferase-like protein LARGE1 isoform 4 [Papio
           anubis]
          Length = 688

 Score =  647 bits (1669), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 313/527 (59%), Positives = 377/527 (71%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ FATDI
Sbjct: 124 PAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 183

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR + 
Sbjct: 184 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 243

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ AE+ L+  L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 244 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 302

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
            +++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC    ++    L
Sbjct: 303 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENL 362

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           Q+ +LS L+E+D CY+ RR + T  RTHLYFL YEYE ++D  DVTLVAQLSMDR     
Sbjct: 363 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEYEPAADSTDVTLVAQLSMDR----- 416

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 417 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 448

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
           A  SE L SR N+GYH+VYKEG FYP+N LRNVA+  +STPY+FL DIDFLPM+GLY YL
Sbjct: 449 AQGSEVLMSRHNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYL 508

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           + S+  +D+     KK ++VPAFET RYR +FP S                        H
Sbjct: 509 RKSVIQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGH 566

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           APTNF++W  ATTPY++EW  DFEPY+V  RD P YD RFVGFGWNK
Sbjct: 567 APTNFAKWRTATTPYRVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 613



 Score =  257 bits (657), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 111/168 (66%), Positives = 137/168 (81%)

Query: 5   EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
           ++PV+  CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A  IL TLF 
Sbjct: 60  QQPVVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 119

Query: 65  TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
           TW VP V V FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ F
Sbjct: 120 TWMVPAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 179

Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 180 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 227


>gi|297708700|ref|XP_002831096.1| PREDICTED: like-glycosyltransferase isoform 3 [Pongo abelii]
          Length = 688

 Score =  647 bits (1669), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 313/527 (59%), Positives = 377/527 (71%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ FATDI
Sbjct: 124 PAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 183

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR + 
Sbjct: 184 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 243

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ AE+ L+  L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 244 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 302

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
            +++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC    ++    L
Sbjct: 303 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENL 362

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           Q+ +LS L+E+D CY+ RR + T  RTHLYFL YEYE ++D  DVTLVAQLSMDR     
Sbjct: 363 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEYEPAADSTDVTLVAQLSMDR----- 416

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 417 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 448

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
           A  SE L SR N+GYH+VYKEG FYP+N LRNVA+  +STPY+FL DIDFLPM+GLY YL
Sbjct: 449 AQGSEVLMSRHNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYL 508

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           + S+  +D+     KK ++VPAFET RYR +FP S                        H
Sbjct: 509 RKSVIQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGH 566

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           APTNF++W  ATTPY++EW  DFEPY+V  RD P YD RFVGFGWNK
Sbjct: 567 APTNFAKWRTATTPYRVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 613



 Score =  257 bits (657), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 111/168 (66%), Positives = 137/168 (81%)

Query: 5   EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
           ++PV+  CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A  IL TLF 
Sbjct: 60  QQPVVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 119

Query: 65  TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
           TW VP V V FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ F
Sbjct: 120 TWMVPAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 179

Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 180 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 227


>gi|296191746|ref|XP_002743761.1| PREDICTED: glycosyltransferase-like protein LARGE1 isoform 3
           [Callithrix jacchus]
          Length = 688

 Score =  647 bits (1669), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 313/527 (59%), Positives = 377/527 (71%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ FATDI
Sbjct: 124 PAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 183

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR + 
Sbjct: 184 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 243

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ AE+ L+  L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 244 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 302

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
            +++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC    ++    L
Sbjct: 303 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENL 362

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           Q+ +LS L+E+D CY+ RR + T  RTHLYFL YEYE ++D  DVTLVAQLSMDR     
Sbjct: 363 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEYEPAADSTDVTLVAQLSMDR----- 416

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 417 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 448

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
           A  SE L SR N+GYH+VYKEG FYP+N LRNVA+  +STPY+FL DIDFLPM+GLY YL
Sbjct: 449 AQGSEVLMSRHNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYL 508

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           + S+  +D+     KK ++VPAFET RYR +FP S                        H
Sbjct: 509 RKSVIQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGH 566

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           APTNF++W  ATTPY++EW  DFEPY+V  RD P YD RFVGFGWNK
Sbjct: 567 APTNFAKWRTATTPYRVEWEADFEPYVVVKRDCPEYDRRFVGFGWNK 613



 Score =  257 bits (657), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 111/168 (66%), Positives = 137/168 (81%)

Query: 5   EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
           ++PV+  CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A  IL TLF 
Sbjct: 60  QQPVVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 119

Query: 65  TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
           TW VP V V FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ F
Sbjct: 120 TWMVPAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 179

Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 180 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 227


>gi|403283126|ref|XP_003932978.1| PREDICTED: glycosyltransferase-like protein LARGE1 isoform 4
           [Saimiri boliviensis boliviensis]
          Length = 688

 Score =  647 bits (1669), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 313/527 (59%), Positives = 377/527 (71%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ FATDI
Sbjct: 124 PAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 183

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR + 
Sbjct: 184 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 243

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ AE+ L+  L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 244 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 302

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
            +++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC    ++    L
Sbjct: 303 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENL 362

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           Q+ +LS L+E+D CY+ RR + T  RTHLYFL YEYE ++D  DVTLVAQLSMDR     
Sbjct: 363 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEYEPAADSTDVTLVAQLSMDR----- 416

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 417 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 448

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
           A  SE L SR N+GYH+VYKEG FYP+N LRNVA+  +STPY+FL DIDFLPM+GLY YL
Sbjct: 449 AQGSEVLMSRHNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYL 508

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           + S+  +D+     KK ++VPAFET RYR +FP S                        H
Sbjct: 509 RKSVIQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGH 566

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           APTNF++W  ATTPY++EW  DFEPY+V  RD P YD RFVGFGWNK
Sbjct: 567 APTNFAKWRTATTPYRVEWEADFEPYVVVKRDCPEYDRRFVGFGWNK 613



 Score =  257 bits (656), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 111/168 (66%), Positives = 137/168 (81%)

Query: 5   EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
           ++PV+  CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A  IL TLF 
Sbjct: 60  QQPVVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 119

Query: 65  TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
           TW VP V V FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ F
Sbjct: 120 TWMVPAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 179

Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 180 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 227


>gi|4758664|ref|NP_004728.1| glycosyltransferase-like protein LARGE1 [Homo sapiens]
 gi|19924141|ref|NP_598397.1| glycosyltransferase-like protein LARGE1 [Homo sapiens]
 gi|114686090|ref|XP_001154608.1| PREDICTED: like-glycosyltransferase isoform 2 [Pan troglodytes]
 gi|114686092|ref|XP_001154669.1| PREDICTED: like-glycosyltransferase isoform 3 [Pan troglodytes]
 gi|397501761|ref|XP_003821544.1| PREDICTED: glycosyltransferase-like protein LARGE1 isoform 1 [Pan
           paniscus]
 gi|397501763|ref|XP_003821545.1| PREDICTED: glycosyltransferase-like protein LARGE1 isoform 2 [Pan
           paniscus]
 gi|397501765|ref|XP_003821546.1| PREDICTED: glycosyltransferase-like protein LARGE1 isoform 3 [Pan
           paniscus]
 gi|22001684|sp|O95461.1|LARGE_HUMAN RecName: Full=Glycosyltransferase-like protein LARGE1; AltName:
           Full=Acetylglucosaminyltransferase-like 1A
 gi|3954938|emb|CAA07571.1| acetylglucosaminyltransferase-like protein [Homo sapiens]
 gi|47678551|emb|CAG30396.1| LARGE [Homo sapiens]
 gi|55957087|emb|CAI17890.1| like-glycosyltransferase [Homo sapiens]
 gi|109451358|emb|CAK54540.1| LARGE [synthetic construct]
 gi|109451936|emb|CAK54839.1| LARGE [synthetic construct]
 gi|109659262|gb|AAI17426.1| Like-glycosyltransferase [Homo sapiens]
 gi|116496729|gb|AAI26405.1| Like-glycosyltransferase [Homo sapiens]
 gi|119580445|gb|EAW60041.1| like-glycosyltransferase, isoform CRA_a [Homo sapiens]
 gi|119580446|gb|EAW60042.1| like-glycosyltransferase, isoform CRA_a [Homo sapiens]
 gi|313883560|gb|ADR83266.1| like-glycosyltransferase (LARGE), transcript variant 2 [synthetic
           construct]
 gi|410216402|gb|JAA05420.1| like-glycosyltransferase [Pan troglodytes]
 gi|410216404|gb|JAA05421.1| like-glycosyltransferase [Pan troglodytes]
 gi|410267726|gb|JAA21829.1| like-glycosyltransferase [Pan troglodytes]
 gi|410267728|gb|JAA21830.1| like-glycosyltransferase [Pan troglodytes]
 gi|410291548|gb|JAA24374.1| like-glycosyltransferase [Pan troglodytes]
 gi|410291550|gb|JAA24375.1| like-glycosyltransferase [Pan troglodytes]
 gi|410331887|gb|JAA34890.1| like-glycosyltransferase [Pan troglodytes]
 gi|410331889|gb|JAA34891.1| like-glycosyltransferase [Pan troglodytes]
          Length = 756

 Score =  647 bits (1668), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 313/527 (59%), Positives = 377/527 (71%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ FATDI
Sbjct: 192 PAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 251

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR + 
Sbjct: 252 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 311

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ AE+ L+  L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 312 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 370

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
            +++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC    ++    L
Sbjct: 371 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENL 430

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           Q+ +LS L+E+D CY+ RR + T  RTHLYFL YEYE ++D  DVTLVAQLSMDR     
Sbjct: 431 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEYEPAADSTDVTLVAQLSMDR----- 484

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 485 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 516

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
           A  SE L SR N+GYH+VYKEG FYP+N LRNVA+  +STPY+FL DIDFLPM+GLY YL
Sbjct: 517 AQGSEVLMSRHNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYL 576

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           + S+  +D+     KK ++VPAFET RYR +FP S                        H
Sbjct: 577 RKSVIQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGH 634

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           APTNF++W  ATTPY++EW  DFEPY+V  RD P YD RFVGFGWNK
Sbjct: 635 APTNFAKWRTATTPYRVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 681



 Score =  257 bits (656), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 111/168 (66%), Positives = 137/168 (81%)

Query: 5   EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
           ++PV+  CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A  IL TLF 
Sbjct: 128 QQPVVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 187

Query: 65  TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
           TW VP V V FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ F
Sbjct: 188 TWMVPAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 247

Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 248 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 295


>gi|403283120|ref|XP_003932975.1| PREDICTED: glycosyltransferase-like protein LARGE1 isoform 1
           [Saimiri boliviensis boliviensis]
 gi|403283122|ref|XP_003932976.1| PREDICTED: glycosyltransferase-like protein LARGE1 isoform 2
           [Saimiri boliviensis boliviensis]
 gi|403283124|ref|XP_003932977.1| PREDICTED: glycosyltransferase-like protein LARGE1 isoform 3
           [Saimiri boliviensis boliviensis]
          Length = 756

 Score =  647 bits (1668), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 313/527 (59%), Positives = 377/527 (71%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ FATDI
Sbjct: 192 PAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 251

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR + 
Sbjct: 252 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 311

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ AE+ L+  L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 312 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 370

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
            +++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC    ++    L
Sbjct: 371 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENL 430

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           Q+ +LS L+E+D CY+ RR + T  RTHLYFL YEYE ++D  DVTLVAQLSMDR     
Sbjct: 431 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEYEPAADSTDVTLVAQLSMDR----- 484

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 485 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 516

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
           A  SE L SR N+GYH+VYKEG FYP+N LRNVA+  +STPY+FL DIDFLPM+GLY YL
Sbjct: 517 AQGSEVLMSRHNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYL 576

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           + S+  +D+     KK ++VPAFET RYR +FP S                        H
Sbjct: 577 RKSVIQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGH 634

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           APTNF++W  ATTPY++EW  DFEPY+V  RD P YD RFVGFGWNK
Sbjct: 635 APTNFAKWRTATTPYRVEWEADFEPYVVVKRDCPEYDRRFVGFGWNK 681



 Score =  257 bits (656), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 111/168 (66%), Positives = 137/168 (81%)

Query: 5   EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
           ++PV+  CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A  IL TLF 
Sbjct: 128 QQPVVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 187

Query: 65  TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
           TW VP V V FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ F
Sbjct: 188 TWMVPAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 247

Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 248 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 295


>gi|386781137|ref|NP_001247834.1| glycosyltransferase-like protein LARGE1 [Macaca mulatta]
 gi|402884044|ref|XP_003905503.1| PREDICTED: glycosyltransferase-like protein LARGE1 isoform 1 [Papio
           anubis]
 gi|402884046|ref|XP_003905504.1| PREDICTED: glycosyltransferase-like protein LARGE1 isoform 2 [Papio
           anubis]
 gi|402884048|ref|XP_003905505.1| PREDICTED: glycosyltransferase-like protein LARGE1 isoform 3 [Papio
           anubis]
 gi|355784935|gb|EHH65786.1| hypothetical protein EGM_02623 [Macaca fascicularis]
 gi|380817940|gb|AFE80844.1| glycosyltransferase-like protein LARGE1 [Macaca mulatta]
          Length = 756

 Score =  646 bits (1667), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 313/527 (59%), Positives = 377/527 (71%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ FATDI
Sbjct: 192 PAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 251

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR + 
Sbjct: 252 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 311

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ AE+ L+  L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 312 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 370

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
            +++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC    ++    L
Sbjct: 371 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENL 430

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           Q+ +LS L+E+D CY+ RR + T  RTHLYFL YEYE ++D  DVTLVAQLSMDR     
Sbjct: 431 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEYEPAADSTDVTLVAQLSMDR----- 484

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 485 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 516

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
           A  SE L SR N+GYH+VYKEG FYP+N LRNVA+  +STPY+FL DIDFLPM+GLY YL
Sbjct: 517 AQGSEVLMSRHNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYL 576

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           + S+  +D+     KK ++VPAFET RYR +FP S                        H
Sbjct: 577 RKSVIQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGH 634

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           APTNF++W  ATTPY++EW  DFEPY+V  RD P YD RFVGFGWNK
Sbjct: 635 APTNFAKWRTATTPYRVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 681



 Score =  256 bits (655), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 111/168 (66%), Positives = 137/168 (81%)

Query: 5   EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
           ++PV+  CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A  IL TLF 
Sbjct: 128 QQPVVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 187

Query: 65  TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
           TW VP V V FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ F
Sbjct: 188 TWMVPAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 247

Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 248 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 295


>gi|296191742|ref|XP_002743759.1| PREDICTED: glycosyltransferase-like protein LARGE1 isoform 1
           [Callithrix jacchus]
          Length = 756

 Score =  646 bits (1667), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 313/527 (59%), Positives = 377/527 (71%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ FATDI
Sbjct: 192 PAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 251

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR + 
Sbjct: 252 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 311

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ AE+ L+  L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 312 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 370

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
            +++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC    ++    L
Sbjct: 371 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENL 430

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           Q+ +LS L+E+D CY+ RR + T  RTHLYFL YEYE ++D  DVTLVAQLSMDR     
Sbjct: 431 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEYEPAADSTDVTLVAQLSMDR----- 484

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 485 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 516

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
           A  SE L SR N+GYH+VYKEG FYP+N LRNVA+  +STPY+FL DIDFLPM+GLY YL
Sbjct: 517 AQGSEVLMSRHNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYL 576

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           + S+  +D+     KK ++VPAFET RYR +FP S                        H
Sbjct: 577 RKSVIQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGH 634

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           APTNF++W  ATTPY++EW  DFEPY+V  RD P YD RFVGFGWNK
Sbjct: 635 APTNFAKWRTATTPYRVEWEADFEPYVVVKRDCPEYDRRFVGFGWNK 681



 Score =  257 bits (656), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 111/168 (66%), Positives = 137/168 (81%)

Query: 5   EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
           ++PV+  CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A  IL TLF 
Sbjct: 128 QQPVVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 187

Query: 65  TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
           TW VP V V FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ F
Sbjct: 188 TWMVPAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 247

Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 248 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 295


>gi|395819892|ref|XP_003783312.1| PREDICTED: glycosyltransferase-like protein LARGE1 [Otolemur
           garnettii]
          Length = 756

 Score =  646 bits (1666), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 314/527 (59%), Positives = 376/527 (71%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ FATDI
Sbjct: 192 PAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 251

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR + 
Sbjct: 252 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 311

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ AE+ L+  L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 312 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 370

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
            +++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC    ++    L
Sbjct: 371 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENL 430

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           Q+ +LS L+E+D CY+ RR + T  RTHLYFL YEYE  +D  DVTLVAQLSMDR     
Sbjct: 431 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEYEPVADNTDVTLVAQLSMDR----- 484

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 485 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 516

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
           A  SE L SR N+GYH+VYKEG FYP+N LRNVA+  +STPY+FL DIDFLPM+GLY YL
Sbjct: 517 AQGSEVLMSRHNVGYHIVYKEGQFYPVNLLRNVAMTHISTPYMFLSDIDFLPMYGLYEYL 576

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           + S+  +D+     KK L+VPAFET RYR +FP S                        H
Sbjct: 577 RKSVIQLDL--ANTKKALIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGH 634

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           APTNF++W  ATTPY++EW  DFEPY+V  RD P YD RFVGFGWNK
Sbjct: 635 APTNFAKWRTATTPYRVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 681



 Score =  257 bits (656), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 111/168 (66%), Positives = 137/168 (81%)

Query: 5   EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
           ++PV+  CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A  IL TLF 
Sbjct: 128 QQPVVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 187

Query: 65  TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
           TW VP V V FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ F
Sbjct: 188 TWMVPAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 247

Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 248 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 295


>gi|126339665|ref|XP_001366396.1| PREDICTED: glycosyltransferase-like protein LARGE1-like
           [Monodelphis domestica]
          Length = 756

 Score =  646 bits (1666), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 314/527 (59%), Positives = 377/527 (71%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ FATDI
Sbjct: 192 PAVRVDFYDADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 251

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR + 
Sbjct: 252 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 311

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ AE+ L++ L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 312 WEQMWRLTAERELMSMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 370

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
            +++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC    ++    L
Sbjct: 371 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENL 430

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           Q+ +LS L+E+D CY+ RR + T  RTHLYFL YEYE +SD  DVTLVAQLSMDR     
Sbjct: 431 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEYEPTSDNTDVTLVAQLSMDR----- 484

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 485 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 516

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
           A  SE L SR N+GYH+VYKEG FYP+N LRNVA+  +STPY+FL DIDFLPM+GL  YL
Sbjct: 517 AQGSEVLMSRHNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLCEYL 576

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           + S+  +D+     KK L+VPAFET RYR +FP S                        H
Sbjct: 577 RKSVIQLDL--ANTKKALIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGH 634

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           APTNF++W  ATTPY++EW  DFEPY+V  RD P YD RFVGFGWNK
Sbjct: 635 APTNFAKWRTATTPYRVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 681



 Score =  254 bits (650), Expect = 7e-65,   Method: Compositional matrix adjust.
 Identities = 110/168 (65%), Positives = 135/168 (80%)

Query: 5   EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
           ++P +  CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D +A  IL TLF 
Sbjct: 128 QQPAVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADAIAKQILATLFQ 187

Query: 65  TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
           TW VP V V FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ F
Sbjct: 188 TWMVPAVRVDFYDADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 247

Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 248 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 295


>gi|427792509|gb|JAA61706.1| Putative glycosyltransferase-like 1b, partial [Rhipicephalus
           pulchellus]
          Length = 699

 Score =  646 bits (1666), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 319/530 (60%), Positives = 387/530 (73%), Gaps = 67/530 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FYLAD++  +VSWIPNKHYSGVYGL+KL LPK LP+TL K IVLDTD+ FA+DI
Sbjct: 131 PAVEVSFYLADNLQGEVSWIPNKHYSGVYGLMKLVLPKALPDTLDKVIVLDTDITFASDI 190

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LW +F +   +Q IGLVENQSDWYLGKLWKNH+PWPALGRGFNTGVILL L +LR+ +
Sbjct: 191 AELWRIFHKYTSKQAIGLVENQSDWYLGKLWKNHRPWPALGRGFNTGVILLRLQRLRERN 250

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   W++IAEK L++  +TSLADQDIFNA++ +H  L++ LPCQWNVQLSDNT S ELCY
Sbjct: 251 WMQMWKLIAEKELISMWFTSLADQDIFNAVLKQHTELLFRLPCQWNVQLSDNTLS-ELCY 309

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTTLQ--- 402
           +E+ +LKIIHWNSPKKLKVKNKH+EFFRNLYLTFLEYDGNLLRREL GCN T ++     
Sbjct: 310 SEVQELKIIHWNSPKKLKVKNKHVEFFRNLYLTFLEYDGNLLRRELIGCNGTTSSSAGVH 369

Query: 403 ----QAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQ 458
               Q  L+ L+E+DPCYD RR+++   RTHLY++EY+YE S +GNDVTLVAQLSMDRL 
Sbjct: 370 ASHVQDALNLLDEDDPCYDFRRARVAQHRTHLYYIEYDYEPSPEGNDVTLVAQLSMDRL- 428

Query: 459 MVEINPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQF 518
                                           QMVE + KHWEGPISL LYMSD+E QQF
Sbjct: 429 --------------------------------QMVEAVCKHWEGPISLALYMSDSEVQQF 456

Query: 519 LSYAGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLY 578
           LSY  +SE L+SR+N+GYH+VYK+G FYP+N LRNVAL QV+TP+VFL DIDFLPM GLY
Sbjct: 457 LSYTLSSEILQSRKNVGYHIVYKDGTFYPVNMLRNVALQQVNTPFVFLTDIDFLPMHGLY 516

Query: 579 PYLKSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS---------------------- 616
            YLK S+ ++ +     +K L+VPAFETQRYR +FP +                      
Sbjct: 517 EYLKRSVFALGL--ESSRKALIVPAFETQRYRLSFPKTKAELLSMLDMGTLFTFRYHVWQ 574

Query: 617 --HAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
             HAPTN+++W  ATTPY + W P+FEPY+V  RD+P YD RFVGFGWNK
Sbjct: 575 KGHAPTNYAKWRTATTPYLVNWEPNFEPYVVVKRDVPEYDKRFVGFGWNK 624



 Score =  275 bits (703), Expect = 6e-71,   Method: Compositional matrix adjust.
 Identities = 122/171 (71%), Positives = 147/171 (85%)

Query: 2   SSDEEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQT 61
             +E P +P CEV+QVAIVCAGYN++RS++TL+KSILFYRKNPLHFH I+D VA  +LQT
Sbjct: 64  GGNELPEVPKCEVLQVAIVCAGYNASRSVITLVKSILFYRKNPLHFHFISDAVAHLVLQT 123

Query: 62  LFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTD 121
           LF +W++P VEVSFYLAD++  +VSWIPNKHYSGVYGL+KL LPK LP+TL K IVLDTD
Sbjct: 124 LFRSWNMPAVEVSFYLADNLQGEVSWIPNKHYSGVYGLMKLVLPKALPDTLDKVIVLDTD 183

Query: 122 VIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           + FA+DIA+LW +F +   +Q IGLVENQSDWYLGKLWKNH+PWPALGRGF
Sbjct: 184 ITFASDIAELWRIFHKYTSKQAIGLVENQSDWYLGKLWKNHRPWPALGRGF 234


>gi|395538357|ref|XP_003771149.1| PREDICTED: glycosyltransferase-like protein LARGE1 [Sarcophilus
           harrisii]
          Length = 756

 Score =  645 bits (1665), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 313/527 (59%), Positives = 377/527 (71%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  + + IVLDTD+ FATDI
Sbjct: 192 PAVRVDFYDADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANMERVIVLDTDITFATDI 251

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR + 
Sbjct: 252 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 311

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ AE+ L++ L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 312 WEQMWRLTAERELMSMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 370

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
            +++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC    ++    L
Sbjct: 371 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENL 430

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           Q+ +LS L+E+D CY+ RR + T  RTHLYFL YEYE +SD  DVTLVAQLSMDR     
Sbjct: 431 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEYEPTSDNTDVTLVAQLSMDR----- 484

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 485 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 516

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
           A  SE L SR N+GYH+VYKEG FYP+N LRNVA+  +STPY+FL DIDFLPM+GL  YL
Sbjct: 517 AQGSEVLMSRHNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLCEYL 576

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           + S+  +D+     KK L+VPAFET RYR +FP S                        H
Sbjct: 577 RKSVIQLDL--ANTKKALIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGH 634

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           APTNF++W  ATTPY++EW  DFEPY+V  RD P YD RFVGFGWNK
Sbjct: 635 APTNFAKWRTATTPYRVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 681



 Score =  254 bits (649), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 109/168 (64%), Positives = 135/168 (80%)

Query: 5   EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
           ++P +  CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D +A  IL TLF 
Sbjct: 128 QQPAVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADAIAKQILATLFQ 187

Query: 65  TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
           TW VP V V FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  + + IVLDTD+ F
Sbjct: 188 TWMVPAVRVDFYDADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANMERVIVLDTDITF 247

Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 248 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 295


>gi|327272296|ref|XP_003220921.1| PREDICTED: glycosyltransferase-like protein LARGE1-like [Anolis
           carolinensis]
          Length = 756

 Score =  645 bits (1665), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 313/527 (59%), Positives = 377/527 (71%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ FATDI
Sbjct: 192 PAVRIDFYDADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 251

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR + 
Sbjct: 252 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 311

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ AE+ L++ L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 312 WEQMWRLTAERELMSMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 370

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
            +++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC    ++    L
Sbjct: 371 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENL 430

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           Q+ +LS L+E+D CY+ RR + T  RTHLYFL YEYE + D  DVTLVAQLSMDR     
Sbjct: 431 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEYEPAPDNTDVTLVAQLSMDR----- 484

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 485 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 516

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
           A  SE L SR N+GYH+VYKEG FYP+N LRNVA+  +STPY+FL DIDFLPM+GLY YL
Sbjct: 517 AQGSEVLMSRHNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYL 576

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           + S+  +D+     KK L+VPAFET RYR +FP S                        H
Sbjct: 577 RKSVVQLDL--ANTKKALIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGH 634

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           APTNF++W  ATTPY++EW  DFEPY+V  +D P YD RFVGFGWNK
Sbjct: 635 APTNFAKWRTATTPYRVEWEADFEPYVVVRQDCPEYDRRFVGFGWNK 681



 Score =  254 bits (650), Expect = 8e-65,   Method: Compositional matrix adjust.
 Identities = 109/168 (64%), Positives = 135/168 (80%)

Query: 5   EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
           ++P +  CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D +A  IL TLF 
Sbjct: 128 QQPAVEKCEAIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADAIAKQILATLFQ 187

Query: 65  TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
           TW VP V + FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ F
Sbjct: 188 TWMVPAVRIDFYDADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 247

Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 248 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 295


>gi|344266604|ref|XP_003405370.1| PREDICTED: glycosyltransferase-like protein LARGE1 [Loxodonta
           africana]
          Length = 756

 Score =  645 bits (1664), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 313/527 (59%), Positives = 377/527 (71%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ FATDI
Sbjct: 192 PAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPADLERVIVLDTDITFATDI 251

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR + 
Sbjct: 252 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 311

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ AE+ L+  L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 312 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 370

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
            +++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC    ++    L
Sbjct: 371 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENL 430

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           Q+ +LS L+E+D CY+ RR + T  RTHLYFL YEYE +SD  DVTLVAQLSMDR     
Sbjct: 431 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEYEPASDNTDVTLVAQLSMDR----- 484

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 485 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 516

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
           A  S+ L SR N+GYH+VYKEG FYP+N LRNVA+  +STPY+FL DIDFLPM+GLY YL
Sbjct: 517 AQGSDVLMSRHNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYL 576

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           + S+  +D+     KK ++VPAFET RYR +FP S                        H
Sbjct: 577 RKSVIQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGH 634

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           APTNF++W  ATTPY++EW  DFEPY+V  RD P YD RFVGFGWNK
Sbjct: 635 APTNFAKWRTATTPYRVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 681



 Score =  257 bits (657), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 112/168 (66%), Positives = 137/168 (81%)

Query: 5   EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
           ++PV+  CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI DT+A  IL TLF 
Sbjct: 128 QQPVVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADTIAEQILATLFQ 187

Query: 65  TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
           TW VP V V FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ F
Sbjct: 188 TWMVPAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPADLERVIVLDTDITF 247

Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 248 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 295


>gi|291389926|ref|XP_002711463.1| PREDICTED: like-glycosyltransferase [Oryctolagus cuniculus]
          Length = 756

 Score =  645 bits (1663), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 312/527 (59%), Positives = 376/527 (71%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ FATDI
Sbjct: 192 PAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 251

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR + 
Sbjct: 252 AELWAVFHKFKGQQALGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 311

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ AE+ L+  L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 312 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 370

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
            +++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC    ++    L
Sbjct: 371 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENL 430

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           Q+ +LS L+E+D CY+ RR + T  RTHLYFL YEYE ++D  DVTLVAQLSMDR     
Sbjct: 431 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEYEPAADSTDVTLVAQLSMDR----- 484

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 485 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 516

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
           A  SE L SR N+GYH+VYKEG FYP+N LRNVA+  + TPY+FL DIDFLPM+GLY YL
Sbjct: 517 AQGSEVLMSRHNVGYHIVYKEGQFYPVNLLRNVAMKHIGTPYMFLSDIDFLPMYGLYEYL 576

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           + S+  +D+     KK ++VPAFET RYR +FP S                        H
Sbjct: 577 RKSVIQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGH 634

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           APTNF++W  ATTPY++EW  DFEPY+V  RD P YD RFVGFGWNK
Sbjct: 635 APTNFAKWRTATTPYRVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 681



 Score =  256 bits (655), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 111/168 (66%), Positives = 137/168 (81%)

Query: 5   EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
           ++PV+  CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A  IL TLF 
Sbjct: 128 QQPVVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 187

Query: 65  TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
           TW VP V V FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ F
Sbjct: 188 TWMVPAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 247

Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 248 ATDIAELWAVFHKFKGQQALGLVENQSDWYLGNLWKNHRPWPALGRGY 295


>gi|432094353|gb|ELK25930.1| Glycosyltransferase-like protein LARGE1 [Myotis davidii]
          Length = 730

 Score =  644 bits (1662), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 312/527 (59%), Positives = 375/527 (71%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ FATDI
Sbjct: 166 PAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 225

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR + 
Sbjct: 226 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 285

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ AE+ L+  L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 286 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 344

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
            +++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC    ++    L
Sbjct: 345 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENL 404

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           Q+ +LS L+E+D CY+ RR + T  RTHLYFL YEYE ++D  DVTLVAQLSMDR     
Sbjct: 405 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEYEPAADNTDVTLVAQLSMDR----- 458

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 459 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 490

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
           A  SE L SR N+ YH+VYKEG FYP+N LRNVA+  V TPY+FL DIDFLPM+GLY YL
Sbjct: 491 AQGSEVLMSRHNVAYHIVYKEGQFYPVNLLRNVAMKHVGTPYMFLSDIDFLPMYGLYEYL 550

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           + S+  +D+     KK ++VPAFET RYR +FP S                        H
Sbjct: 551 RKSVIQLDL--ANSKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGH 608

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           APTNF++W  ATTPY++EW  DFEPY+V  RD P YD RFVGFGWNK
Sbjct: 609 APTNFAKWRTATTPYRVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 655



 Score =  256 bits (654), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 110/168 (65%), Positives = 137/168 (81%)

Query: 5   EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
           ++PV+  CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A  IL +LF 
Sbjct: 102 QQPVVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILASLFQ 161

Query: 65  TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
           TW VP V V FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ F
Sbjct: 162 TWMVPAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 221

Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 222 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 269


>gi|221040868|dbj|BAH12135.1| unnamed protein product [Homo sapiens]
          Length = 688

 Score =  644 bits (1661), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 312/527 (59%), Positives = 376/527 (71%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY AD +  +VSWIPNKHYS +YGL+KL L K LP  L + IVLDTD+ FATDI
Sbjct: 124 PAVRVDFYNADELKSEVSWIPNKHYSEIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 183

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR + 
Sbjct: 184 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 243

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ AE+ L+  L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 244 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 302

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
            +++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC    ++    L
Sbjct: 303 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENL 362

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           Q+ +LS L+E+D CY+ RR + T  RTHLYFL YEYE ++D  DVTLVAQLSMDR     
Sbjct: 363 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEYEPAADSTDVTLVAQLSMDR----- 416

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 417 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 448

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
           A  SE L SR N+GYH+VYKEG FYP+N LRNVA+  +STPY+FL DIDFLPM+GLY YL
Sbjct: 449 AQGSEVLMSRHNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYL 508

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           + S+  +D+     KK ++VPAFET RYR +FP S                        H
Sbjct: 509 RKSVIQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGH 566

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           APTNF++W  ATTPY++EW  DFEPY+V  RD P YD RFVGFGWNK
Sbjct: 567 APTNFAKWRTATTPYRVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 613



 Score =  254 bits (648), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 110/168 (65%), Positives = 136/168 (80%)

Query: 5   EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
           ++PV+  CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A  IL TLF 
Sbjct: 60  QQPVVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 119

Query: 65  TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
           TW VP V V FY AD +  +VSWIPNKHYS +YGL+KL L K LP  L + IVLDTD+ F
Sbjct: 120 TWMVPAVRVDFYNADELKSEVSWIPNKHYSEIYGLMKLVLTKTLPANLERVIVLDTDITF 179

Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 180 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 227


>gi|410965461|ref|XP_003989266.1| PREDICTED: glycosyltransferase-like protein LARGE1 [Felis catus]
          Length = 756

 Score =  644 bits (1661), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 313/527 (59%), Positives = 375/527 (71%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ FATDI
Sbjct: 192 PAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 251

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR + 
Sbjct: 252 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 311

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ AE+ L+  L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 312 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 370

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
            +++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC    ++    L
Sbjct: 371 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENL 430

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           Q+ +LS L+E+D CY+ RR + T  RTHLYFL YEYE +SD  DVTLVAQLSMDR     
Sbjct: 431 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEYEPASDNTDVTLVAQLSMDR----- 484

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 485 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 516

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
           A  SE L SR N+ YH+VYKEG FYP+N LRNVA+  V TPY+FL DIDFLPM+GLY YL
Sbjct: 517 AQGSEVLMSRHNVAYHIVYKEGQFYPVNLLRNVAMKHVGTPYMFLSDIDFLPMYGLYEYL 576

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           + S+  +D+     KK ++VPAFET RYR +FP S                        H
Sbjct: 577 RKSVIQLDL--ASTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGH 634

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           APTNF++W  ATTPY++EW  DFEPY+V  RD P YD RFVGFGWNK
Sbjct: 635 APTNFAKWRTATTPYRVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 681



 Score =  256 bits (655), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 111/168 (66%), Positives = 137/168 (81%)

Query: 5   EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
           ++PV+  CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A  IL TLF 
Sbjct: 128 QQPVVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 187

Query: 65  TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
           TW VP V V FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ F
Sbjct: 188 TWMVPAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 247

Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 248 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 295


>gi|417404394|gb|JAA48952.1| Putative glycosyltransferase-like protein large1 [Desmodus
           rotundus]
          Length = 756

 Score =  644 bits (1660), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 312/527 (59%), Positives = 375/527 (71%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ FATDI
Sbjct: 192 PAVRVDFYNADELKTEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 251

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR + 
Sbjct: 252 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 311

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ AE+ L+  L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 312 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 370

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
            +++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC    ++    L
Sbjct: 371 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENL 430

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           Q+ +LS L+E+D CY+ RR + T  RTHLYFL YEYE ++D  DVTLVAQLSMDR     
Sbjct: 431 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEYEPAADNTDVTLVAQLSMDR----- 484

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 485 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 516

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
           A  SE L  RRN+ YH+VYKEG FYP+N LRNVA+  V TPY+FL DIDFLPM+GLY YL
Sbjct: 517 AQGSEVLMGRRNVAYHIVYKEGQFYPVNLLRNVAMKHVGTPYMFLSDIDFLPMYGLYEYL 576

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           + S+  +D+     KK ++VPAFET RYR +FP S                        H
Sbjct: 577 RKSVIQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGH 634

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           APTNF++W  ATTPY++EW  DFEPY+V  RD P YD RFVGFGWNK
Sbjct: 635 APTNFAKWRTATTPYRVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 681



 Score =  256 bits (655), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 111/168 (66%), Positives = 137/168 (81%)

Query: 5   EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
           ++PV+  CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A  IL TLF 
Sbjct: 128 QQPVVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 187

Query: 65  TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
           TW VP V V FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ F
Sbjct: 188 TWMVPAVRVDFYNADELKTEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 247

Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 248 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 295


>gi|345777120|ref|XP_531751.3| PREDICTED: like-glycosyltransferase [Canis lupus familiaris]
          Length = 756

 Score =  644 bits (1660), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 313/527 (59%), Positives = 375/527 (71%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ FATDI
Sbjct: 192 PAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 251

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR + 
Sbjct: 252 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 311

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ AE+ L+  L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 312 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 370

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
            +++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC    ++    L
Sbjct: 371 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENL 430

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           Q+ +LS L+E+D CY+ RR + T  RTHLYFL YEYE +SD  DVTLVAQLSMDR     
Sbjct: 431 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEYEPASDNTDVTLVAQLSMDR----- 484

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 485 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 516

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
           A  SE L SR N+ YH+VYKEG FYP+N LRNVA+  V TPY+FL DIDFLPM+GLY YL
Sbjct: 517 AQGSEVLMSRHNVAYHIVYKEGQFYPVNLLRNVAMKHVGTPYLFLSDIDFLPMYGLYEYL 576

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           + S+  +D+     KK ++VPAFET RYR +FP S                        H
Sbjct: 577 RKSVIQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGH 634

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           APTNF++W  ATTPY++EW  DFEPY+V  RD P YD RFVGFGWNK
Sbjct: 635 APTNFAKWRTATTPYRVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 681



 Score =  256 bits (655), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 111/168 (66%), Positives = 137/168 (81%)

Query: 5   EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
           ++PV+  CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A  IL TLF 
Sbjct: 128 QQPVVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 187

Query: 65  TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
           TW VP V V FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ F
Sbjct: 188 TWMVPAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 247

Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 248 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 295


>gi|119580447|gb|EAW60043.1| like-glycosyltransferase, isoform CRA_b [Homo sapiens]
          Length = 693

 Score =  643 bits (1658), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 313/527 (59%), Positives = 377/527 (71%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ FATDI
Sbjct: 129 PAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 188

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR + 
Sbjct: 189 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 248

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ AE+ L+  L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 249 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 307

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
            +++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC    ++    L
Sbjct: 308 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENL 367

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           Q+ +LS L+E+D CY+ RR + T  RTHLYFL YEYE ++D  DVTLVAQLSMDRLQM+E
Sbjct: 368 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEYEPAADSTDVTLVAQLSMDRLQMLE 426

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                             + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 427 A---------------------------------ICKHWEGPISLALYLSDAEAQQFLRY 453

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
           A  SE L SR N+GYH+VYKEG FYP+N LRNVA+  +STPY+FL DIDFLPM+GLY YL
Sbjct: 454 AQGSEVLMSRHNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYL 513

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           + S+  +D+     KK ++VPAFET RYR +FP S                        H
Sbjct: 514 RKSVIQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGH 571

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           APTNF++W  ATTPY++EW  DFEPY+V  RD P YD RFVGFGWNK
Sbjct: 572 APTNFAKWRTATTPYRVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 618



 Score =  203 bits (517), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 93/151 (61%), Positives = 113/151 (74%)

Query: 22  AGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQVEVSFYLADSV 81
           A + +T   V   +S    R+NPLHFHLI D++A  IL TLF TW VP V V FY AD +
Sbjct: 82  ASWQATAPSVGSSRSWRNARRNPLHFHLIADSIAEQILATLFQTWMVPAVRVDFYNADEL 141

Query: 82  VEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQR 141
             +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ FATDIA+LWA+F + + +
Sbjct: 142 KSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDIAELWAVFHKFKGQ 201

Query: 142 QTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 202 QVLGLVENQSDWYLGNLWKNHRPWPALGRGY 232


>gi|440903966|gb|ELR54547.1| Glycosyltransferase-like protein LARGE1, partial [Bos grunniens
           mutus]
          Length = 721

 Score =  642 bits (1656), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 311/527 (59%), Positives = 374/527 (70%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ FATDI
Sbjct: 157 PAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 216

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR + 
Sbjct: 217 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 276

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ AE+ L+  L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 277 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 335

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
            +++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC    ++    L
Sbjct: 336 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENL 395

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           Q+ +LS L+E+D CY+ RR + T  RTHLYFL YEYE +SD  DVTLVAQLSMDR     
Sbjct: 396 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEYEPASDNTDVTLVAQLSMDR----- 449

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+E + KHWEGP+SL LY+SDAEAQQFL Y
Sbjct: 450 ----------------------------LQMLEAICKHWEGPVSLALYLSDAEAQQFLRY 481

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
           A  SE L  R N+ YH+VYKEG FYP+N LRNVA+  V TPY+FL DIDFLPM+GLY YL
Sbjct: 482 AQGSEVLMGRHNVAYHIVYKEGQFYPVNLLRNVAMKHVGTPYMFLSDIDFLPMYGLYEYL 541

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           + S+  +D+     KK ++VPAFET RYR +FP S                        H
Sbjct: 542 RKSVIQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGH 599

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           APTNF++W  ATTPY++EW  DFEPY+V  RD P YD RFVGFGWNK
Sbjct: 600 APTNFAKWRTATTPYRVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 646



 Score =  257 bits (656), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 111/168 (66%), Positives = 137/168 (81%)

Query: 5   EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
           ++PV+  CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A  IL TLF 
Sbjct: 93  QQPVVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 152

Query: 65  TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
           TW VP V V FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ F
Sbjct: 153 TWMVPAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 212

Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 213 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 260


>gi|351708238|gb|EHB11157.1| Glycosyltransferase-like protein LARGE1 [Heterocephalus glaber]
          Length = 759

 Score =  642 bits (1656), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 312/530 (58%), Positives = 377/530 (71%), Gaps = 68/530 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY A+ +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ FATDI
Sbjct: 192 PAVRVDFYNAEELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 251

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWA+F R R +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR + 
Sbjct: 252 AELWAVFHRFRGQQALGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 311

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ AE+ L+  L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 312 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 370

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC-------NLTQ 398
            +++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC       + + 
Sbjct: 371 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEAGVADASN 430

Query: 399 TTLQQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQ 458
             LQ+ +LS L+E+D CY+ RR + T  RTHLYFL YE+E ++DG DVTLVAQLSMDR  
Sbjct: 431 KDLQK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEHEPATDGTDVTLVAQLSMDR-- 487

Query: 459 MVEINPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQF 518
                                          LQM+E + KHWEGPISL LY+SDAEAQQF
Sbjct: 488 -------------------------------LQMLEAICKHWEGPISLALYLSDAEAQQF 516

Query: 519 LSYAGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLY 578
           L YA  SE L  R N+GYH+VYKEG FYP+N LRNVA+  +STPY+FL DIDFLPM+GLY
Sbjct: 517 LRYAQGSEVLMGRHNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLY 576

Query: 579 PYLKSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS---------------------- 616
            YL+ S+  +D+     KK ++VPAFET RYR +FP S                      
Sbjct: 577 EYLRKSVIQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWT 634

Query: 617 --HAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
             HAPTNF++W  ATTPY+++W  DFEPY+V  RD P YD RFVGFGWNK
Sbjct: 635 KGHAPTNFAKWRTATTPYRVQWEADFEPYVVLRRDCPEYDRRFVGFGWNK 684



 Score =  254 bits (648), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 110/168 (65%), Positives = 135/168 (80%)

Query: 5   EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
           ++P +  CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A  IL  LF 
Sbjct: 128 QQPAVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILAMLFH 187

Query: 65  TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
           TW VP V V FY A+ +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ F
Sbjct: 188 TWMVPAVRVDFYNAEELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 247

Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           ATDIA+LWA+F R R +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 248 ATDIAELWAVFHRFRGQQALGLVENQSDWYLGNLWKNHRPWPALGRGY 295


>gi|52138661|ref|NP_001004383.1| glycosyltransferase-like protein LARGE1 [Gallus gallus]
 gi|82083914|sp|Q66PG3.1|LARGE_CHICK RecName: Full=Glycosyltransferase-like protein LARGE1; AltName:
           Full=Acetylglucosaminyltransferase-like 1A
 gi|51872293|gb|AAU12250.1| glycosyltransferase-like 1A [Gallus gallus]
          Length = 756

 Score =  642 bits (1656), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 311/527 (59%), Positives = 376/527 (71%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ FATDI
Sbjct: 192 PAVRIDFYDADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPSNLERVIVLDTDITFATDI 251

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR + 
Sbjct: 252 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 311

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ AE+ L++ L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 312 WEQMWRLTAERELMSMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 370

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
            +++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC    ++    L
Sbjct: 371 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENL 430

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           Q+ +LS  +E+D CY+ RR + T  RTHLYFL YEYE +SD  DVTLVAQLSMDR     
Sbjct: 431 QK-QLSEPDEDDLCYEFRRERFTVHRTHLYFLHYEYEPASDNTDVTLVAQLSMDR----- 484

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+E + KHWE PISL LY+SDAEAQQFL Y
Sbjct: 485 ----------------------------LQMLEAICKHWEDPISLALYLSDAEAQQFLRY 516

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
           A  SE L SR N+GYH+VYKEG FYP+N LRNVA+  +STPY+FL DIDFLPM+GLY YL
Sbjct: 517 AQGSEVLMSRHNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYL 576

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           + S+  +D+     KK L++PAFET RYR +FP S                        H
Sbjct: 577 RKSVTQLDL--ANTKKALIIPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGH 634

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           APTNF++W  ATTPY++EW  DFEPY+V  +D P YD RFVGFGWNK
Sbjct: 635 APTNFAKWRTATTPYRVEWEADFEPYVVVRKDCPEYDRRFVGFGWNK 681



 Score =  254 bits (650), Expect = 9e-65,   Method: Compositional matrix adjust.
 Identities = 109/168 (64%), Positives = 135/168 (80%)

Query: 5   EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
           ++P +  CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D +A  IL TLF 
Sbjct: 128 QQPAVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADAIAKQILATLFQ 187

Query: 65  TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
           TW VP V + FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ F
Sbjct: 188 TWMVPAVRIDFYDADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPSNLERVIVLDTDITF 247

Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 248 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 295


>gi|432850590|ref|XP_004066824.1| PREDICTED: glycosyltransferase-like protein LARGE2-like [Oryzias
           latipes]
          Length = 756

 Score =  642 bits (1656), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 317/526 (60%), Positives = 373/526 (70%), Gaps = 65/526 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           P++   FY AD +  +VSWIPNKHYSG+YGL+KLTL K LP  L K IVLDTD+ FATDI
Sbjct: 186 PSVQVSFYDADELKSEVSWIPNKHYSGIYGLMKLTLTKALPPDLNKVIVLDTDITFATDI 245

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LW +F +   +Q IGLVENQSDWYLG LWKNHKPWPALGRGFNTGVILL L +LR IS
Sbjct: 246 AELWGIFRKFTDKQVIGLVENQSDWYLGNLWKNHKPWPALGRGFNTGVILLYLERLRRIS 305

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ AE+ L++ L TSLADQDIFNA I ++P LV+ LPC WNVQLSD+TRS++ CY
Sbjct: 306 WEQMWRLTAERELMSMLSTSLADQDIFNAFIKQNPVLVHQLPCFWNVQLSDHTRSEQ-CY 364

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC--NLTQTTLQ- 402
           TE++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC    +  ++Q 
Sbjct: 365 TEVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSQASPESIQL 424

Query: 403 QAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEI 462
           Q  L  L+E+D CYD RR +LT  R HL FL+YEY  + D  DVTLVAQLSMDR      
Sbjct: 425 QGALEELDEDDQCYDFRRERLTVHRVHLEFLQYEYTPTED--DVTLVAQLSMDR------ 476

Query: 463 NPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYA 522
                                      LQM+E + KHWEGPISL LYMSDAEAQQFL YA
Sbjct: 477 ---------------------------LQMLEAICKHWEGPISLALYMSDAEAQQFLRYA 509

Query: 523 GNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLK 582
             SE L++R+N+GYH+VYKEG FYP+N LRNVAL   +TPY FL D+DFLPM+GLY YL+
Sbjct: 510 QASEVLKNRKNVGYHIVYKEGQFYPVNLLRNVALKNANTPYTFLTDVDFLPMYGLYDYLR 569

Query: 583 SSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------HA 618
            S+  +DM     KK LVVPAFET RYR +FP S                        HA
Sbjct: 570 RSVVHLDM--RNSKKALVVPAFETLRYRLSFPKSKAELLSMLDMGTLYTFRYHVWPKGHA 627

Query: 619 PTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           PTN+++W  ATTPY++EW PDFEPY+V  R  P YD RFVGFGWNK
Sbjct: 628 PTNYAKWRTATTPYRVEWEPDFEPYVVVRRTCPEYDQRFVGFGWNK 673



 Score =  256 bits (653), Expect = 4e-65,   Method: Compositional matrix adjust.
 Identities = 114/166 (68%), Positives = 135/166 (81%)

Query: 7   PVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTW 66
           P I  CE+I VA VCAG+N++R +VTL+KS+LF+R+NPLHFH ITDTVA  IL +LF +W
Sbjct: 124 PTIQKCELIHVACVCAGHNASRDVVTLVKSVLFHRRNPLHFHFITDTVANRILSSLFRSW 183

Query: 67  SVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 126
            VP V+VSFY AD +  +VSWIPNKHYSG+YGL+KLTL K LP  L K IVLDTD+ FAT
Sbjct: 184 MVPSVQVSFYDADELKSEVSWIPNKHYSGIYGLMKLTLTKALPPDLNKVIVLDTDITFAT 243

Query: 127 DIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           DIA+LW +F +   +Q IGLVENQSDWYLG LWKNHKPWPALGRGF
Sbjct: 244 DIAELWGIFRKFTDKQVIGLVENQSDWYLGNLWKNHKPWPALGRGF 289


>gi|329663446|ref|NP_001192517.1| glycosyltransferase-like protein LARGE1 [Bos taurus]
 gi|296487413|tpg|DAA29526.1| TPA: like-glycosyltransferase [Bos taurus]
          Length = 756

 Score =  642 bits (1655), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 311/527 (59%), Positives = 374/527 (70%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ FATDI
Sbjct: 192 PAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 251

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR + 
Sbjct: 252 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 311

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ AE+ L+  L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 312 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 370

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
            +++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC    ++    L
Sbjct: 371 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENL 430

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           Q+ +LS L+E+D CY+ RR + T  RTHLYFL YEYE +SD  DVTLVAQLSMDR     
Sbjct: 431 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEYEPASDNTDVTLVAQLSMDR----- 484

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+E + KHWEGP+SL LY+SDAEAQQFL Y
Sbjct: 485 ----------------------------LQMLEAICKHWEGPVSLALYLSDAEAQQFLRY 516

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
           A  SE L  R N+ YH+VYKEG FYP+N LRNVA+  V TPY+FL DIDFLPM+GLY YL
Sbjct: 517 AQGSEVLMGRHNVAYHIVYKEGQFYPVNLLRNVAMKHVGTPYMFLSDIDFLPMYGLYEYL 576

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           + S+  +D+     KK ++VPAFET RYR +FP S                        H
Sbjct: 577 RKSVIQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGH 634

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           APTNF++W  ATTPY++EW  DFEPY+V  RD P YD RFVGFGWNK
Sbjct: 635 APTNFAKWRTATTPYRVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 681



 Score =  256 bits (655), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 111/168 (66%), Positives = 137/168 (81%)

Query: 5   EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
           ++PV+  CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A  IL TLF 
Sbjct: 128 QQPVVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 187

Query: 65  TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
           TW VP V V FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ F
Sbjct: 188 TWMVPAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 247

Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 248 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 295


>gi|194226762|ref|XP_001498804.2| PREDICTED: like-glycosyltransferase [Equus caballus]
          Length = 756

 Score =  642 bits (1655), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 312/527 (59%), Positives = 375/527 (71%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ FATDI
Sbjct: 192 PAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPPNLERVIVLDTDITFATDI 251

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR + 
Sbjct: 252 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 311

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ AE+ L+  L TSLADQDIFNA+I ++P+LV+ LPC WNVQLSD+TRS++ CY
Sbjct: 312 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVHKLPCFWNVQLSDHTRSEQ-CY 370

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
            +++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC    ++    L
Sbjct: 371 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENL 430

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           Q+ +LS L+E+D CY+ RR + T  RTHLYFL YEYE +SD  DVTLVAQLSMDR     
Sbjct: 431 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEYEPASDHTDVTLVAQLSMDR----- 484

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 485 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 516

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
           A  SE L SR N+ YH+VYKEG FYP+N LRNVA+  V TPY+FL DIDFLPM+GLY YL
Sbjct: 517 AQGSEVLMSRHNVAYHIVYKEGQFYPVNLLRNVAMKHVGTPYMFLSDIDFLPMYGLYEYL 576

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           + S+  +D+     KK ++VPAFET RYR +FP S                        H
Sbjct: 577 RKSVIQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGH 634

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           APTNF++W  ATTPY++EW  DFEPY+V  RD P YD RFVGFGWNK
Sbjct: 635 APTNFAKWRTATTPYRVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 681



 Score =  256 bits (655), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 111/168 (66%), Positives = 137/168 (81%)

Query: 5   EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
           ++PV+  CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A  IL TLF 
Sbjct: 128 QQPVVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 187

Query: 65  TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
           TW VP V V FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ F
Sbjct: 188 TWMVPAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPPNLERVIVLDTDITF 247

Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 248 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 295


>gi|301772300|ref|XP_002921570.1| PREDICTED: glycosyltransferase-like protein LARGE1-like [Ailuropoda
           melanoleuca]
          Length = 756

 Score =  642 bits (1655), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 312/527 (59%), Positives = 374/527 (70%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY A+ +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ FATDI
Sbjct: 192 PAVRVDFYNAEELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 251

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR + 
Sbjct: 252 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 311

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ AE+ L+  L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 312 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 370

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
            +++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC    ++    L
Sbjct: 371 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENL 430

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           Q+ +LS L+E+D CY+ RR + T  RTHLYFL YEYE +SD  DVTLVAQLSMDR     
Sbjct: 431 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEYEPASDNTDVTLVAQLSMDR----- 484

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 485 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 516

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
           A  SE L SR N+ YH+VYKEG FYP+N LRNVA+  V TPY+FL DIDFLPM+GLY YL
Sbjct: 517 AQGSEVLMSRHNVAYHIVYKEGQFYPVNLLRNVAMKHVGTPYMFLSDIDFLPMYGLYEYL 576

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           + S+  +D+     KK ++VPAFET RYR +FP S                        H
Sbjct: 577 RKSVIQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGH 634

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           APTNF++W  ATTPY +EW  DFEPY+V  RD P YD RFVGFGWNK
Sbjct: 635 APTNFAKWRTATTPYHVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 681



 Score =  254 bits (650), Expect = 7e-65,   Method: Compositional matrix adjust.
 Identities = 110/168 (65%), Positives = 137/168 (81%)

Query: 5   EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
           ++PV+  CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A  IL TLF 
Sbjct: 128 QQPVVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFR 187

Query: 65  TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
           TW VP V V FY A+ +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ F
Sbjct: 188 TWMVPAVRVDFYNAEELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 247

Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 248 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 295


>gi|71682499|gb|AAI00400.1| Like-glycosyltransferase [Mus musculus]
          Length = 756

 Score =  641 bits (1654), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 311/527 (59%), Positives = 376/527 (71%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ FATDI
Sbjct: 192 PAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 251

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWA+F + + +Q +G VENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR + 
Sbjct: 252 AELWAVFHKFKGQQVLGSVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 311

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ AE+  +  L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 312 WEQMWRLTAEREHMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 370

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
            +++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC    ++    L
Sbjct: 371 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSETDVNNENL 430

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           Q+ +LS L+E++ CY+ RR + T  RTHLYFL YE+E S+D  DVTLVAQLSMDR     
Sbjct: 431 QK-QLSELDEDNLCYEFRRERFTVHRTHLYFLHYEFEPSADNTDVTLVAQLSMDR----- 484

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 485 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 516

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
           A  SE L SR+N+GYH+VYKEG FYP+N LRNVA+  +STPY+FL DIDFLPM+GLY YL
Sbjct: 517 AQGSEVLMSRQNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYL 576

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           + S+  +D+     KK ++VPAFET RYR +FP S                        H
Sbjct: 577 RKSVIQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGH 634

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           APTNF++W  ATTPYQ+EW  DFEPY+V  RD P YD RFVGFGWNK
Sbjct: 635 APTNFAKWRTATTPYQVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 681



 Score =  253 bits (646), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 109/168 (64%), Positives = 135/168 (80%)

Query: 5   EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
           ++P +  CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A  IL TLF 
Sbjct: 128 QQPAVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 187

Query: 65  TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
           TW VP V V FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ F
Sbjct: 188 TWMVPAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 247

Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           ATDIA+LWA+F + + +Q +G VENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 248 ATDIAELWAVFHKFKGQQVLGSVENQSDWYLGNLWKNHRPWPALGRGY 295


>gi|348553098|ref|XP_003462364.1| PREDICTED: glycosyltransferase-like protein LARGE1-like [Cavia
           porcellus]
          Length = 759

 Score =  641 bits (1654), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 312/530 (58%), Positives = 375/530 (70%), Gaps = 68/530 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ FATDI
Sbjct: 192 PAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 251

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR + 
Sbjct: 252 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 311

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ AE+ L+  L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 312 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 370

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC-------NLTQ 398
            +++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC       + + 
Sbjct: 371 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEAGVADASS 430

Query: 399 TTLQQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQ 458
             LQ+ +LS L+E+D CY+ RR + T  RTHLYFL YEYE ++D  DVTLVAQLSMDR  
Sbjct: 431 KDLQK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEYEPAADSTDVTLVAQLSMDR-- 487

Query: 459 MVEINPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQF 518
                                          LQM+E + KHWEGPISL LY+SDAEAQQF
Sbjct: 488 -------------------------------LQMLEAICKHWEGPISLALYLSDAEAQQF 516

Query: 519 LSYAGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLY 578
           L YA  SE L  R N+GYH+VYKEG FYP+N LRNVAL  + TPY+FL DIDFLPM+GLY
Sbjct: 517 LRYAQGSEVLMGRHNVGYHIVYKEGQFYPVNLLRNVALKHIRTPYMFLSDIDFLPMYGLY 576

Query: 579 PYLKSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS---------------------- 616
            YL+ S+  +D+     KK ++VPAFET RYR +FP S                      
Sbjct: 577 EYLRKSVVQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWT 634

Query: 617 --HAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
             HAPTNF++W  ATTPY++EW  DFEPY+V  RD P YD RFVGFGWNK
Sbjct: 635 KGHAPTNFAKWRTATTPYRVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 684



 Score =  254 bits (650), Expect = 9e-65,   Method: Compositional matrix adjust.
 Identities = 110/168 (65%), Positives = 136/168 (80%)

Query: 5   EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
           ++P +  CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A  IL TLF 
Sbjct: 128 QQPAVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFH 187

Query: 65  TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
           TW VP V V FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ F
Sbjct: 188 TWMVPAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 247

Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 248 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 295


>gi|354485819|ref|XP_003505079.1| PREDICTED: glycosyltransferase-like protein LARGE1 [Cricetulus
           griseus]
 gi|344238050|gb|EGV94153.1| Glycosyltransferase-like protein LARGE1 [Cricetulus griseus]
          Length = 756

 Score =  640 bits (1651), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 310/527 (58%), Positives = 375/527 (71%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ FATDI
Sbjct: 192 PAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 251

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR + 
Sbjct: 252 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 311

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ AE+ L+  L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 312 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 370

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
            +++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC    ++    L
Sbjct: 371 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSETDVNSENL 430

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           Q+ +LS L+E+D CY+ RR + T  RTHLYFL YE+E ++D  DVTLVAQLSMDR     
Sbjct: 431 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEFEPATDNTDVTLVAQLSMDR----- 484

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 485 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 516

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
           A  SE L SR N+GYH+VYKEG FYP+N LRNVA+  +STPY+FL DIDFLPM+GLY YL
Sbjct: 517 AQGSEVLMSRHNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYL 576

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           + S+  +D+     KK ++VPAFET RYR +FP S                        H
Sbjct: 577 RKSVIQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGH 634

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           APTNF++W  AT PY++EW  DFEPY+V  RD   YD RFVGFGWNK
Sbjct: 635 APTNFAKWRTATMPYRVEWEADFEPYVVVRRDCSEYDRRFVGFGWNK 681



 Score =  255 bits (652), Expect = 5e-65,   Method: Compositional matrix adjust.
 Identities = 110/168 (65%), Positives = 136/168 (80%)

Query: 5   EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
           ++P +  CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A  IL TLF 
Sbjct: 128 QQPAVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 187

Query: 65  TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
           TW VP V V FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ F
Sbjct: 188 TWMVPAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 247

Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 248 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 295


>gi|390349971|ref|XP_781563.3| PREDICTED: glycosyltransferase-like protein LARGE1-like
           [Strongylocentrotus purpuratus]
          Length = 575

 Score =  640 bits (1651), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 318/530 (60%), Positives = 376/530 (70%), Gaps = 66/530 (12%)

Query: 163 KPW--PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVI 220
           K W  P +   FYLA++    V WIPNKHYSG+YGL+KL L ++LP  + KTIVLDTD+ 
Sbjct: 11  KTWDVPGVKVSFYLAENYKSHVEWIPNKHYSGLYGLMKLLLIEILPSNMEKTIVLDTDIT 70

Query: 221 FATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTK 280
           FA+DIA+LW LF R+  +QT+ LVENQSDWYLG LWK HKPWPALGRGFNTGVILLDL K
Sbjct: 71  FASDIAELWGLFQRMTSKQTLALVENQSDWYLGTLWKKHKPWPALGRGFNTGVILLDLQK 130

Query: 281 LRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRS 340
           LRD+ W   W + AEK L+++  T+LADQDI NA+I +HPYLVYTLPC WNVQLSDNTRS
Sbjct: 131 LRDLKWMQMWLLTAEKELMSQFSTALADQDIINAVIKQHPYLVYTLPCAWNVQLSDNTRS 190

Query: 341 DELCYTELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC--NLTQ 398
            ELCY E+TDLK IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC      
Sbjct: 191 -ELCYKEVTDLKAIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCTDGKAN 249

Query: 399 TTLQQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQ 458
            TL + +LS+L+E+D CY+ RR +L   RTHLY+L+Y+     D  DVTLVAQLSMDRLQ
Sbjct: 250 NTLHE-KLSSLDEDDQCYEFRREQLLMHRTHLYYLDYQLPKFLD-TDVTLVAQLSMDRLQ 307

Query: 459 MVEINPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQF 518
           M+E                                  + KHWEGPISL LYMSDAEAQQF
Sbjct: 308 MLEA---------------------------------ICKHWEGPISLALYMSDAEAQQF 334

Query: 519 LSYAGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLY 578
           + YA +SE L +R+NIGYHVVYK+G FYP+N LRN A++QV TPY+FL DIDFLPM+GLY
Sbjct: 335 IRYALSSETLMARKNIGYHVVYKDGQFYPVNYLRNTAMSQVKTPYMFLSDIDFLPMYGLY 394

Query: 579 PYLKSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS---------------------- 616
            YL+ +I  +DM      K L+VPAFET RYR  FP S                      
Sbjct: 395 DYLRKAIGMLDM--ATSNKALIVPAFETLRYRLNFPKSKADLLHMLDMGTLFTFRYHVWQ 452

Query: 617 --HAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
             HAPTNF+RW  ATTPYQ++W PDFEPY+V  +D P+YD RF+GFGWNK
Sbjct: 453 KGHAPTNFARWRTATTPYQVQWEPDFEPYVVVKKDCPKYDPRFLGFGWNK 502



 Score =  177 bits (449), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 79/115 (68%), Positives = 93/115 (80%)

Query: 58  ILQTLFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIV 117
           IL TLF TW VP V+VSFYLA++    V WIPNKHYSG+YGL+KL L ++LP  + KTIV
Sbjct: 5   ILSTLFKTWDVPGVKVSFYLAENYKSHVEWIPNKHYSGLYGLMKLLLIEILPSNMEKTIV 64

Query: 118 LDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           LDTD+ FA+DIA+LW LF R+  +QT+ LVENQSDWYLG LWK HKPWPALGRGF
Sbjct: 65  LDTDITFASDIAELWGLFQRMTSKQTLALVENQSDWYLGTLWKKHKPWPALGRGF 119


>gi|298684047|gb|ADI96203.1| LARGE-1 [Rattus losea]
          Length = 671

 Score =  638 bits (1645), Expect = e-180,   Method: Compositional matrix adjust.
 Identities = 311/527 (59%), Positives = 375/527 (71%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ FATDI
Sbjct: 128 PAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 187

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR + 
Sbjct: 188 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 247

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ AE+ L+  L TSLADQDIFNA+I ++P+LV  LPC WNVQLSD+TRS++ CY
Sbjct: 248 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVXQLPCFWNVQLSDHTRSEQ-CY 306

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
            +++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC    ++    L
Sbjct: 307 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSETDVNSENL 366

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           Q+ +LS L+E+D CY+ RR + T  RTHLYFL YE+E S+D  DVTLVAQLSMDR     
Sbjct: 367 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEFEPSADHTDVTLVAQLSMDR----- 420

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 421 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLXY 452

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
           A  SE L SR+N+GYH+VYKEG  Y +N LRNVA+  +STPY+FL DIDFLPM+GLY YL
Sbjct: 453 AQGSEVLMSRQNVGYHIVYKEGQXYXVNFLRNVAMKXISTPYMFLSDIDFLPMYGLYEYL 512

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           + S+  +D+     KK ++VPAFET RYR +FP S                        H
Sbjct: 513 RKSVIQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGH 570

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           APTNF++W  ATTPYQ+EW  DFEPY+V  RD P YD RFVGFGWNK
Sbjct: 571 APTNFAKWRTATTPYQVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 617



 Score =  255 bits (652), Expect = 5e-65,   Method: Compositional matrix adjust.
 Identities = 110/168 (65%), Positives = 136/168 (80%)

Query: 5   EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
           ++P +  CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A  IL TLF 
Sbjct: 64  QQPAVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 123

Query: 65  TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
           TW VP V V FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ F
Sbjct: 124 TWMVPAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 183

Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 184 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 231


>gi|432941969|ref|XP_004082928.1| PREDICTED: glycosyltransferase-like protein LARGE1-like isoform 1
           [Oryzias latipes]
          Length = 756

 Score =  638 bits (1645), Expect = e-180,   Method: Compositional matrix adjust.
 Identities = 319/527 (60%), Positives = 375/527 (71%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L K IVLDTD+ FATDI
Sbjct: 192 PAVRVAFYDADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPSDLQKVIVLDTDITFATDI 251

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +LR + 
Sbjct: 252 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLLLDRLRKLR 311

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ AE+ L+  L TSLADQDIFNA+I ++P+LV+ LPC WNVQLSD+TRS E CY
Sbjct: 312 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVHQLPCFWNVQLSDHTRS-EKCY 370

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
            +++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC    +     L
Sbjct: 371 KDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADHNSENL 430

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           Q+  L  L+E+DPCY+ RR + T  RTHLYFL YEYE SSDG DVTLVAQLSMDRLQM+E
Sbjct: 431 QKT-LMELDEDDPCYEFRRERFTVHRTHLYFLHYEYEPSSDGTDVTLVAQLSMDRLQMLE 489

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                             + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 490 A---------------------------------ICKHWEGPISLALYLSDAEAQQFLRY 516

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
           A  SE L SR N+GYH+VYKEG FYP+N LRNVA+ QV TPY+FL DIDFLPM+GLY YL
Sbjct: 517 AQGSEVLMSRGNVGYHIVYKEGQFYPVNLLRNVAMRQVHTPYMFLSDIDFLPMYGLYEYL 576

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           + S+  +DM     KK LVVPAFET RYR +FP S                        H
Sbjct: 577 RKSVVQLDM--ANTKKALVVPAFETLRYRLSFPKSKAELLSQLDMGTLFTFRYHVWTKGH 634

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           APTNF++W  AT PY+++W PDFEPY++  RD P YD RFVGFGWNK
Sbjct: 635 APTNFAKWRTATMPYRVQWEPDFEPYVMVRRDSPEYDRRFVGFGWNK 681



 Score =  257 bits (657), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 112/168 (66%), Positives = 136/168 (80%)

Query: 5   EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
           ++P +  CE I VAIVCAGYN++R +VTL+KS+LF+RKNPLHFH ITD++A  IL  LF 
Sbjct: 128 QQPTVDKCETIHVAIVCAGYNASRDVVTLVKSVLFHRKNPLHFHFITDSIAQKILSALFH 187

Query: 65  TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
           TW VP V V+FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L K IVLDTD+ F
Sbjct: 188 TWMVPAVRVAFYDADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPSDLQKVIVLDTDITF 247

Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRGF
Sbjct: 248 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGF 295


>gi|443688665|gb|ELT91285.1| hypothetical protein CAPTEDRAFT_168890 [Capitella teleta]
          Length = 710

 Score =  638 bits (1645), Expect = e-180,   Method: Compositional matrix adjust.
 Identities = 312/523 (59%), Positives = 372/523 (71%), Gaps = 61/523 (11%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           P +   FY AD +   VSWIPNKHYSGVYGL+KLTLP  LP+TL K IVLDTD+ FA+DI
Sbjct: 153 PTVTVTFYPADDLKSVVSWIPNKHYSGVYGLMKLTLPTTLPDTLNKVIVLDTDITFASDI 212

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A LW LF +   +Q +GLVENQSDWYLGK+WKNH+PWPALGRGFNTGVILLDL KLR I+
Sbjct: 213 ADLWNLFRKFTGKQAMGLVENQSDWYLGKIWKNHRPWPALGRGFNTGVILLDLEKLRAIN 272

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+IAE+ L+T L T+LADQDIFNA+I +HP LVYTLPCQWNVQLSDNTRS++ CY
Sbjct: 273 WIQAWRLIAERELMTMLSTALADQDIFNAVIKQHPMLVYTLPCQWNVQLSDNTRSEQ-CY 331

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTTLQQAE 405
           T+++DLKIIHWNSP KL+VKNKH+E FR+LYLTFL+YDGNLLRRELFGC          +
Sbjct: 332 TDVSDLKIIHWNSPHKLQVKNKHIEVFRSLYLTFLQYDGNLLRRELFGCGGESKPSLSNQ 391

Query: 406 LSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINPN 465
           +   NE+D CYD RR +    RTHLY+L++ +    D NDVTLVAQLSMDRL        
Sbjct: 392 ILEANEDDECYDFRRERNLVHRTHLYYLDFNFTPDPD-NDVTLVAQLSMDRL-------- 442

Query: 466 NVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNS 525
                                    QM+EML +HWEGPISL LYMSD EAQQFL YA +S
Sbjct: 443 -------------------------QMLEMLCQHWEGPISLALYMSDTEAQQFLQYALSS 477

Query: 526 EALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSI 585
           E L SR+NIG+H+VYK+G FYP+N LRNVAL+Q ++PYV L DIDFLPMFGLY YL+ ++
Sbjct: 478 EILMSRKNIGFHIVYKDGQFYPVNYLRNVALSQATSPYVLLSDIDFLPMFGLYEYLRKAV 537

Query: 586 RSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------HAPTN 621
               +     KK LV+PAFETQRYR  FP S                        H+PTN
Sbjct: 538 SMTSVMQE--KKALVIPAFETQRYRLTFPTSKAELLTMLDMGTLFTFRYHVWPQGHSPTN 595

Query: 622 FSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           +++W  ATTPY++EW  DFEPY+V  R++P YDTRFVGFGWNK
Sbjct: 596 YAKWRTATTPYEVEWDQDFEPYVVVPREIPMYDTRFVGFGWNK 638



 Score =  259 bits (661), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 116/172 (67%), Positives = 140/172 (81%)

Query: 1   MSSDEEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQ 60
           +  ++ P +P CEVI VAIVCA YNS+R +VTL+KSILFY K+PLHFH I+DT+A  IL+
Sbjct: 85  LKCEQVPEVPKCEVIHVAIVCAQYNSSRDVVTLVKSILFYGKHPLHFHFISDTIAQRILE 144

Query: 61  TLFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDT 120
           T+F TW+VP V V+FY AD +   VSWIPNKHYSGVYGL+KLTLP  LP+TL K IVLDT
Sbjct: 145 TVFRTWAVPTVTVTFYPADDLKSVVSWIPNKHYSGVYGLMKLTLPTTLPDTLNKVIVLDT 204

Query: 121 DVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           D+ FA+DIA LW LF +   +Q +GLVENQSDWYLGK+WKNH+PWPALGRGF
Sbjct: 205 DITFASDIADLWNLFRKFTGKQAMGLVENQSDWYLGKIWKNHRPWPALGRGF 256


>gi|432941971|ref|XP_004082929.1| PREDICTED: glycosyltransferase-like protein LARGE1-like isoform 2
           [Oryzias latipes]
          Length = 755

 Score =  638 bits (1645), Expect = e-180,   Method: Compositional matrix adjust.
 Identities = 319/527 (60%), Positives = 375/527 (71%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L K IVLDTD+ FATDI
Sbjct: 191 PAVRVAFYDADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPSDLQKVIVLDTDITFATDI 250

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +LR + 
Sbjct: 251 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLLLDRLRKLR 310

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ AE+ L+  L TSLADQDIFNA+I ++P+LV+ LPC WNVQLSD+TRS E CY
Sbjct: 311 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVHQLPCFWNVQLSDHTRS-EKCY 369

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
            +++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC    +     L
Sbjct: 370 KDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADHNSENL 429

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           Q+  L  L+E+DPCY+ RR + T  RTHLYFL YEYE SSDG DVTLVAQLSMDRLQM+E
Sbjct: 430 QKT-LMELDEDDPCYEFRRERFTVHRTHLYFLHYEYEPSSDGTDVTLVAQLSMDRLQMLE 488

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                             + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 489 A---------------------------------ICKHWEGPISLALYLSDAEAQQFLRY 515

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
           A  SE L SR N+GYH+VYKEG FYP+N LRNVA+ QV TPY+FL DIDFLPM+GLY YL
Sbjct: 516 AQGSEVLMSRGNVGYHIVYKEGQFYPVNLLRNVAMRQVHTPYMFLSDIDFLPMYGLYEYL 575

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           + S+  +DM     KK LVVPAFET RYR +FP S                        H
Sbjct: 576 RKSVVQLDM--ANTKKALVVPAFETLRYRLSFPKSKAELLSQLDMGTLFTFRYHVWTKGH 633

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           APTNF++W  AT PY+++W PDFEPY++  RD P YD RFVGFGWNK
Sbjct: 634 APTNFAKWRTATMPYRVQWEPDFEPYVMVRRDSPEYDRRFVGFGWNK 680



 Score =  257 bits (657), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 112/168 (66%), Positives = 136/168 (80%)

Query: 5   EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
           ++P +  CE I VAIVCAGYN++R +VTL+KS+LF+RKNPLHFH ITD++A  IL  LF 
Sbjct: 127 QQPTVDKCETIHVAIVCAGYNASRDVVTLVKSVLFHRKNPLHFHFITDSIAQKILSALFH 186

Query: 65  TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
           TW VP V V+FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L K IVLDTD+ F
Sbjct: 187 TWMVPAVRVAFYDADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPSDLQKVIVLDTDITF 246

Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRGF
Sbjct: 247 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGF 294


>gi|410918397|ref|XP_003972672.1| PREDICTED: glycosyltransferase-like protein LARGE1-like [Takifugu
           rubripes]
          Length = 755

 Score =  636 bits (1641), Expect = e-179,   Method: Compositional matrix adjust.
 Identities = 316/527 (59%), Positives = 377/527 (71%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ FATDI
Sbjct: 191 PAVKVNFYDADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPSDLQRVIVLDTDITFATDI 250

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWA+F + R +Q +GLVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +LR + 
Sbjct: 251 AELWAVFHKFRGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLLLDRLRKLR 310

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ AE+ L++ L TSLADQDIFNA+I ++P+LV+ LPC WNVQLSD+TRS E CY
Sbjct: 311 WEQMWRLTAERELMSMLSTSLADQDIFNAVIKQNPFLVHQLPCFWNVQLSDHTRS-EKCY 369

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
            +++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC    +     L
Sbjct: 370 KDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSETDHNSENL 429

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           Q+  LS L+E+DPCY+ RR + T  RTHLYFL YEYEA+SD  DVTLVAQLSMDR     
Sbjct: 430 QKT-LSELDEDDPCYEFRRERFTVHRTHLYFLHYEYEAASDNTDVTLVAQLSMDR----- 483

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 484 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 515

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
           A  S+ L SR N+GYH+VYKEG FYP+N LRNVA+ QV+TPY+FL DIDFLPM+GLY YL
Sbjct: 516 AQGSDVLMSRGNVGYHIVYKEGQFYPVNLLRNVAMQQVNTPYMFLSDIDFLPMYGLYEYL 575

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           + S+  +DM     KK LVVPAFET RYR ++P S                        H
Sbjct: 576 RKSVVQLDM--ANAKKALVVPAFETLRYRLSYPKSKAELLSQLDMGTLFTFRYHVWTKGH 633

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           APTNF++W  ATT Y+++W  DFEPY++  RD P YD RFVGFGWNK
Sbjct: 634 APTNFAKWRTATTAYRVQWEADFEPYVMVRRDSPEYDRRFVGFGWNK 680



 Score =  258 bits (659), Expect = 7e-66,   Method: Compositional matrix adjust.
 Identities = 111/168 (66%), Positives = 139/168 (82%)

Query: 5   EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
           ++P +  CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFH ITD++A +IL +LF 
Sbjct: 127 QQPAVDKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHFITDSIAQHILSSLFH 186

Query: 65  TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
           TW VP V+V+FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ F
Sbjct: 187 TWMVPAVKVNFYDADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPSDLQRVIVLDTDITF 246

Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           ATDIA+LWA+F + R +Q +GLVENQSDWYLG LWKNH+PWPALGRGF
Sbjct: 247 ATDIAELWAVFHKFRGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGF 294


>gi|426225215|ref|XP_004006763.1| PREDICTED: LOW QUALITY PROTEIN: glycosyltransferase-like protein
           LARGE1 [Ovis aries]
          Length = 758

 Score =  636 bits (1640), Expect = e-179,   Method: Compositional matrix adjust.
 Identities = 312/532 (58%), Positives = 375/532 (70%), Gaps = 70/532 (13%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ FATDI
Sbjct: 192 PAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 251

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR + 
Sbjct: 252 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 311

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ AE+ L+  L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 312 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 370

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
            +++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC    ++    L
Sbjct: 371 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENL 430

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           Q+ +LS L+E+D CY+ RR + T  RTHLYFL YEYE +SD  DVTLVAQLSMDR     
Sbjct: 431 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEYEPASDNTDVTLVAQLSMDR----- 484

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+E + KHWEGP+SL LY+SDAEAQQFL Y
Sbjct: 485 ----------------------------LQMLEAICKHWEGPVSLALYLSDAEAQQFLRY 516

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
           A  SE L  R N+ YH+VYKEG FYP+N LRNVA+  VSTPY+FL DIDFLPM+GLY YL
Sbjct: 517 AQGSEVLMGRHNVAYHIVYKEGQFYPVNLLRNVAMKHVSTPYMFLSDIDFLPMYGLYEYL 576

Query: 582 K-----SSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS-------------------- 616
           +      S+  +D+     KK ++VPAFET RYR +FP S                    
Sbjct: 577 RXGRAGKSVIQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHV 634

Query: 617 ----HAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
               HAPTNF++W  ATTPY++EW  DFEPY+V  RD P YD RFVGFGWNK
Sbjct: 635 WTKGHAPTNFAKWRTATTPYRVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 686



 Score =  256 bits (654), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 111/168 (66%), Positives = 137/168 (81%)

Query: 5   EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
           ++PV+  CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A  IL TLF 
Sbjct: 128 QQPVVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 187

Query: 65  TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
           TW VP V V FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ F
Sbjct: 188 TWMVPAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 247

Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 248 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 295


>gi|221043764|dbj|BAH13559.1| unnamed protein product [Homo sapiens]
          Length = 555

 Score =  634 bits (1634), Expect = e-179,   Method: Compositional matrix adjust.
 Identities = 307/512 (59%), Positives = 369/512 (72%), Gaps = 65/512 (12%)

Query: 181 DVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQT 240
           +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ FATDIA+LWA+F + + +Q 
Sbjct: 6   EVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDIAELWAVFHKFKGQQV 65

Query: 241 IGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLT 300
           +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR + W   WR+ AE+ L+ 
Sbjct: 66  LGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMKWEQMWRLTAERELMG 125

Query: 301 RLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWNSPK 360
            L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY +++DLK+IHWNSPK
Sbjct: 126 MLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CYRDVSDLKVIHWNSPK 184

Query: 361 KLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTLQQAELSNLNEEDPCY 416
           KL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC    ++    LQ+ +LS L+E+D CY
Sbjct: 185 KLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENLQK-QLSELDEDDLCY 243

Query: 417 DLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINPNNVDEYEASSDG 476
           + RR + T  RTHLYFL YEYE ++D  DVTLVAQLSMDRLQM+E               
Sbjct: 244 EFRRERFTVHRTHLYFLHYEYEPAADSTDVTLVAQLSMDRLQMLEA-------------- 289

Query: 477 NDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGY 536
                              + KHWEGPISL LY+SDAEAQQFL YA  SE L SR N+GY
Sbjct: 290 -------------------ICKHWEGPISLALYLSDAEAQQFLRYAQGSEVLMSRHNVGY 330

Query: 537 HVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGK 596
           H+VYKEG FYP+N LRNVA+  +STPY+FL DIDFLPM+GLY YL+ S+  +D+     K
Sbjct: 331 HIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYLRKSVIQLDL--ANTK 388

Query: 597 KVLVVPAFETQRYRTAFPAS------------------------HAPTNFSRWVNATTPY 632
           K ++VPAFET RYR +FP S                        HAPTNF++W  ATTPY
Sbjct: 389 KAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGHAPTNFAKWRTATTPY 448

Query: 633 QIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           ++EW  DFEPY+V  RD P YD RFVGFGWNK
Sbjct: 449 RVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 480



 Score =  149 bits (377), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 63/89 (70%), Positives = 76/89 (85%)

Query: 84  DVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQT 143
           +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ FATDIA+LWA+F + + +Q 
Sbjct: 6   EVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDIAELWAVFHKFKGQQV 65

Query: 144 IGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 66  LGLVENQSDWYLGNLWKNHRPWPALGRGY 94


>gi|348523141|ref|XP_003449082.1| PREDICTED: glycosyltransferase-like protein LARGE1-like isoform 2
           [Oreochromis niloticus]
          Length = 756

 Score =  630 bits (1625), Expect = e-178,   Method: Compositional matrix adjust.
 Identities = 314/527 (59%), Positives = 375/527 (71%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ FATDI
Sbjct: 192 PAVRVDFYDADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPSDLQRVIVLDTDITFATDI 251

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +LR + 
Sbjct: 252 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLLLDRLRKLR 311

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ AE+ L++ L TSLADQDIFNA+I ++P+LV+ LPC WNVQLSD+TRS E CY
Sbjct: 312 WEQMWRLTAERELMSMLSTSLADQDIFNAVIKQNPFLVHQLPCFWNVQLSDHTRS-EKCY 370

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
            +++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC    +     L
Sbjct: 371 KDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADHNSENL 430

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           Q+  LS L+E+DPCY+ RR + T  RTHLYFL YEY+ SSD  DVTLVAQLSMDR     
Sbjct: 431 QKT-LSELDEDDPCYEFRRERFTVHRTHLYFLHYEYKPSSDNTDVTLVAQLSMDR----- 484

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 485 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 516

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
           A  SE L SR N+GYH+VYKEG FYP+N LRNVA+  V+TPY+FL DIDFLPM+ LY YL
Sbjct: 517 AQGSEVLMSRGNVGYHIVYKEGQFYPVNLLRNVAMRHVNTPYMFLSDIDFLPMYSLYEYL 576

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           + S+  +DM     KK LVVPAFET RYR ++P S                        H
Sbjct: 577 RKSVVQLDM--ANTKKALVVPAFETLRYRLSYPKSKAELLSQLDMGTLFTFRYHVWTKGH 634

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           APTNF++W  ATTPY+++W  DFEPY++  RD P YD RFVGFGWNK
Sbjct: 635 APTNFAKWRTATTPYRVQWEADFEPYVMVRRDSPEYDRRFVGFGWNK 681



 Score =  256 bits (653), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 114/173 (65%), Positives = 138/173 (79%), Gaps = 2/173 (1%)

Query: 2   SSD--EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNIL 59
           SSD    PV+  CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFH ITD++A  IL
Sbjct: 123 SSDCVRHPVVDKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHFITDSIAQQIL 182

Query: 60  QTLFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLD 119
            +LF TW VP V V FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLD
Sbjct: 183 SSLFHTWMVPAVRVDFYDADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPSDLQRVIVLD 242

Query: 120 TDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           TD+ FATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRGF
Sbjct: 243 TDITFATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGF 295


>gi|348523139|ref|XP_003449081.1| PREDICTED: glycosyltransferase-like protein LARGE1-like isoform 1
           [Oreochromis niloticus]
          Length = 755

 Score =  630 bits (1624), Expect = e-178,   Method: Compositional matrix adjust.
 Identities = 314/527 (59%), Positives = 375/527 (71%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ FATDI
Sbjct: 191 PAVRVDFYDADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPSDLQRVIVLDTDITFATDI 250

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +LR + 
Sbjct: 251 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLLLDRLRKLR 310

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ AE+ L++ L TSLADQDIFNA+I ++P+LV+ LPC WNVQLSD+TRS E CY
Sbjct: 311 WEQMWRLTAERELMSMLSTSLADQDIFNAVIKQNPFLVHQLPCFWNVQLSDHTRS-EKCY 369

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
            +++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC    +     L
Sbjct: 370 KDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADHNSENL 429

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           Q+  LS L+E+DPCY+ RR + T  RTHLYFL YEY+ SSD  DVTLVAQLSMDR     
Sbjct: 430 QKT-LSELDEDDPCYEFRRERFTVHRTHLYFLHYEYKPSSDNTDVTLVAQLSMDR----- 483

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 484 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 515

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
           A  SE L SR N+GYH+VYKEG FYP+N LRNVA+  V+TPY+FL DIDFLPM+ LY YL
Sbjct: 516 AQGSEVLMSRGNVGYHIVYKEGQFYPVNLLRNVAMRHVNTPYMFLSDIDFLPMYSLYEYL 575

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           + S+  +DM     KK LVVPAFET RYR ++P S                        H
Sbjct: 576 RKSVVQLDM--ANTKKALVVPAFETLRYRLSYPKSKAELLSQLDMGTLFTFRYHVWTKGH 633

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           APTNF++W  ATTPY+++W  DFEPY++  RD P YD RFVGFGWNK
Sbjct: 634 APTNFAKWRTATTPYRVQWEADFEPYVMVRRDSPEYDRRFVGFGWNK 680



 Score =  256 bits (653), Expect = 4e-65,   Method: Compositional matrix adjust.
 Identities = 114/173 (65%), Positives = 138/173 (79%), Gaps = 2/173 (1%)

Query: 2   SSD--EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNIL 59
           SSD    PV+  CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFH ITD++A  IL
Sbjct: 122 SSDCVRHPVVDKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHFITDSIAQQIL 181

Query: 60  QTLFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLD 119
            +LF TW VP V V FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLD
Sbjct: 182 SSLFHTWMVPAVRVDFYDADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPSDLQRVIVLD 241

Query: 120 TDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           TD+ FATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRGF
Sbjct: 242 TDITFATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGF 294


>gi|291238057|ref|XP_002738950.1| PREDICTED: like-glycosyltransferase-like [Saccoglossus kowalevskii]
          Length = 749

 Score =  629 bits (1623), Expect = e-177,   Method: Compositional matrix adjust.
 Identities = 303/512 (59%), Positives = 366/512 (71%), Gaps = 62/512 (12%)

Query: 179 VEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQR 238
           V+DVSWIPNKHYSGVYGL+KL L KVLP+ + K IVLDTDV FATDIA+LW +F +L+ +
Sbjct: 201 VDDVSWIPNKHYSGVYGLMKLLLTKVLPKDMDKVIVLDTDVTFATDIAELWKIFRKLKNK 260

Query: 239 QTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFL 298
           + IGLVENQSDWYLG LW+ HKPWPALGRGFNTGVILLD+ KLR + W   WR+ AE+ L
Sbjct: 261 KAIGLVENQSDWYLGTLWRKHKPWPALGRGFNTGVILLDVKKLRKMKWMQMWRLTAEREL 320

Query: 299 LTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWNS 358
           ++ L TSLADQDIFNA+I +HP +V+ LPC WNVQLS+NTRS E CYTE++D+K+IHWNS
Sbjct: 321 MSMLSTSLADQDIFNAVIKQHPDIVHQLPCAWNVQLSENTRS-ETCYTEISDIKVIHWNS 379

Query: 359 PKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTTLQQA--ELSNLNEEDPCY 416
           P+KL+VKN H+EFFRNLYLTFLEYDGNLL+RELFGC   Q   ++   E++ L+E+D CY
Sbjct: 380 PQKLRVKNNHIEFFRNLYLTFLEYDGNLLKRELFGCQAYQHKPKEVVKEITELDEDDQCY 439

Query: 417 DLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINPNNVDEYEASSDG 476
           + RR KL + R HLY+L+Y+Y    DG DVTLVAQLSMDRLQM+E   N           
Sbjct: 440 EFRREKLLTHRIHLYYLDYQYIQDLDGTDVTLVAQLSMDRLQMIESICN----------- 488

Query: 477 NDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGY 536
                                 HW GPIS  LYMSDAE QQFL YA +S+ L  R+N+GY
Sbjct: 489 ----------------------HWLGPISFALYMSDAEVQQFLRYALSSDILMKRKNVGY 526

Query: 537 HVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGK 596
           H+VYK+G FYP+N LRNVA+ QV+TPYV+L DIDFLPM+GLY YLK +I  MDM      
Sbjct: 527 HIVYKDGQFYPVNFLRNVAMKQVNTPYVYLSDIDFLPMYGLYEYLKKAIGMMDM--SKAN 584

Query: 597 KVLVVPAFETQRYRTAFPAS------------------------HAPTNFSRWVNATTPY 632
           K L+VPAFET RYR  FP S                        H+PTNF++W  ATTPY
Sbjct: 585 KALIVPAFETLRYRLTFPKSKAELLTMLDMGSLYTFRYHVWQKGHSPTNFAKWRTATTPY 644

Query: 633 QIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           +++W  DFEPY+V  +D P YDTRFVGFGWNK
Sbjct: 645 KVQWESDFEPYVVVKKDSPLYDTRFVGFGWNK 676



 Score =  230 bits (586), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 110/183 (60%), Positives = 133/183 (72%), Gaps = 14/183 (7%)

Query: 1   MSSDEEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQ 60
           + ++ E  +P CEVI VAIVCAGYN++R ++TLIKS+LFYRKNP+H H + D      L 
Sbjct: 112 ICTNVEENVPKCEVIHVAIVCAGYNASREVITLIKSVLFYRKNPIHLHFLADKPGQLTLS 171

Query: 61  TLFSTW-----------SVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLP 109
           TLF TW           S+ Q   SF L    V+DVSWIPNKHYSGVYGL+KL L KVLP
Sbjct: 172 TLFDTWDITGGMLIGESSLAQPIKSFNLT---VDDVSWIPNKHYSGVYGLMKLLLTKVLP 228

Query: 110 ETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALG 169
           + + K IVLDTDV FATDIA+LW +F +L+ ++ IGLVENQSDWYLG LW+ HKPWPALG
Sbjct: 229 KDMDKVIVLDTDVTFATDIAELWKIFRKLKNKKAIGLVENQSDWYLGTLWRKHKPWPALG 288

Query: 170 RGF 172
           RGF
Sbjct: 289 RGF 291


>gi|52219122|ref|NP_001004537.1| glycosyltransferase-like protein LARGE1 [Danio rerio]
 gi|82083911|sp|Q66PG2.1|LARGE_DANRE RecName: Full=Glycosyltransferase-like protein LARGE1; AltName:
           Full=Acetylglucosaminyltransferase-like 1A
 gi|51872295|gb|AAU12251.1| glycosyltransferase-like 1A [Danio rerio]
 gi|190336617|gb|AAI62096.1| Like-glycosyltransferase [Danio rerio]
          Length = 757

 Score =  629 bits (1622), Expect = e-177,   Method: Compositional matrix adjust.
 Identities = 312/527 (59%), Positives = 375/527 (71%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY AD +  +VSWIPNKHYSG++GL+KL L K LP  L K IVLDTD+ FATDI
Sbjct: 193 PAVHVNFYDADELKSEVSWIPNKHYSGIHGLMKLVLTKTLPSDLEKVIVLDTDITFATDI 252

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LW +F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +LR + 
Sbjct: 253 AELWVVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLLLDRLRKLK 312

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ AE+ L++ L TSLADQDIFNA+I ++P+LV+ LPC WNVQLSD+TRS E CY
Sbjct: 313 WEQMWRLTAERELMSMLSTSLADQDIFNAVIKQNPFLVHQLPCYWNVQLSDHTRS-EKCY 371

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
            +++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC    +     L
Sbjct: 372 KDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSETDHNSENL 431

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           Q+  LS L+E+DPCY+ RR + T  RTH+YFL YEYE + D  DVTLVAQLSMDR     
Sbjct: 432 QKT-LSELDEDDPCYEFRRERFTVHRTHVYFLHYEYEPTVDNTDVTLVAQLSMDR----- 485

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 486 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 517

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
           A  SE L SR N+GYH+VYKEG FYP+N LRNVA+ QV+TPY+FL DIDFLPM+GLY YL
Sbjct: 518 AQGSEVLMSRSNVGYHIVYKEGQFYPVNLLRNVAMGQVNTPYMFLSDIDFLPMYGLYEYL 577

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           + S+  +DM     KK LVVPAFET RYR +FP S                        H
Sbjct: 578 RKSVVQLDM--GNTKKALVVPAFETLRYRLSFPKSKAELLSQLDMGTLFTFRYHVWTKGH 635

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           APT+F++W  ATTPY+++W  DFEPY++  R+ P YD RFVGFGWNK
Sbjct: 636 APTDFAKWRTATTPYRVQWEADFEPYVMVRRESPEYDRRFVGFGWNK 682



 Score =  253 bits (645), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 109/167 (65%), Positives = 136/167 (81%)

Query: 6   EPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFST 65
           +PV+  CE I +AIVCAGYN++R +VTL+KS+LF+R+NPLHFH+ITD++A  IL  LF T
Sbjct: 130 QPVVKKCETIHIAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHIITDSIARKILADLFHT 189

Query: 66  WSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFA 125
           W VP V V+FY AD +  +VSWIPNKHYSG++GL+KL L K LP  L K IVLDTD+ FA
Sbjct: 190 WMVPAVHVNFYDADELKSEVSWIPNKHYSGIHGLMKLVLTKTLPSDLEKVIVLDTDITFA 249

Query: 126 TDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           TDIA+LW +F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRGF
Sbjct: 250 TDIAELWVVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGF 296


>gi|52138697|ref|NP_001004404.1| glycosyltransferase-like protein LARGE2 [Gallus gallus]
 gi|82083916|sp|Q66PG4.1|LARG2_CHICK RecName: Full=Glycosyltransferase-like protein LARGE2; AltName:
           Full=Glycosyltransferase-like 1B
 gi|51872291|gb|AAU12249.1| glycosyltransferase-like 1B [Gallus gallus]
          Length = 739

 Score =  629 bits (1621), Expect = e-177,   Method: Compositional matrix adjust.
 Identities = 317/527 (60%), Positives = 373/527 (70%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           P++   FY AD +  +VSWIPNKHYSG+YGL+KLTL K LP  L+K IVLDTD+ FATDI
Sbjct: 175 PSIHVSFYNADDLKPEVSWIPNKHYSGIYGLMKLTLTKALPSNLSKVIVLDTDITFATDI 234

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWA+F +  ++Q IGLVENQSDWYLG LWKNHKPWPALGRGFNTGVILL L +LR + 
Sbjct: 235 AELWAVFGKFSEKQVIGLVENQSDWYLGNLWKNHKPWPALGRGFNTGVILLLLDRLRRLG 294

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ AE+ L++ L TSLADQDIFNA+I ++P LVY LPC WNVQLSD+TRS ELCY
Sbjct: 295 WEQMWRLTAERELMSMLSTSLADQDIFNAVIKQNPALVYRLPCFWNVQLSDHTRS-ELCY 353

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
           TE++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC    +     L
Sbjct: 354 TEVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCASLPSPPSDQL 413

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           QQA L  L+E+DPCYD RR  LT  R HL+FL+YE+ A  +  DVTLVAQLSMDR     
Sbjct: 414 QQA-LEELDEDDPCYDFRRQHLTQHRVHLFFLQYEFLALPNPTDVTLVAQLSMDR----- 467

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+E + KHW GPISL LYMSDAEAQQFL Y
Sbjct: 468 ----------------------------LQMLEAICKHWAGPISLALYMSDAEAQQFLRY 499

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
           A  SE L +RRN+ YH+VYKEG FYPIN LRNVAL    TPYVFL DIDFLPM+GLY YL
Sbjct: 500 AQASEVLSARRNVAYHIVYKEGQFYPINLLRNVALANTQTPYVFLTDIDFLPMYGLYDYL 559

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           ++SI+ +++     K  L+VPAFET  YR  FP S                        H
Sbjct: 560 RNSIQQLELPHR--KAALIVPAFETLHYRLTFPKSKAELLSMLDMGSLYTFRYHVWPKGH 617

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           APT++++W  AT PY++ W PDFEPY+V  RD P+YD RFVGFGWNK
Sbjct: 618 APTDYAKWRTATVPYRVAWQPDFEPYVVVRRDCPKYDQRFVGFGWNK 664



 Score =  261 bits (667), Expect = 9e-67,   Method: Compositional matrix adjust.
 Identities = 116/171 (67%), Positives = 142/171 (83%)

Query: 2   SSDEEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQT 61
           +  E+ ++  CE++ VAIVCAG+N++R +VTL+KSILF+RKNPLHFH ITD+VA  ILQT
Sbjct: 108 ACPEQRMVRKCELLHVAIVCAGHNASRDVVTLVKSILFHRKNPLHFHFITDSVAHQILQT 167

Query: 62  LFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTD 121
           LF +W VP + VSFY AD +  +VSWIPNKHYSG+YGL+KLTL K LP  L+K IVLDTD
Sbjct: 168 LFQSWMVPSIHVSFYNADDLKPEVSWIPNKHYSGIYGLMKLTLTKALPSNLSKVIVLDTD 227

Query: 122 VIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           + FATDIA+LWA+F +  ++Q IGLVENQSDWYLG LWKNHKPWPALGRGF
Sbjct: 228 ITFATDIAELWAVFGKFSEKQVIGLVENQSDWYLGNLWKNHKPWPALGRGF 278


>gi|327259715|ref|XP_003214681.1| PREDICTED: glycosyltransferase-like protein LARGE2-like [Anolis
           carolinensis]
          Length = 739

 Score =  627 bits (1618), Expect = e-177,   Method: Compositional matrix adjust.
 Identities = 312/527 (59%), Positives = 370/527 (70%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           P++   FY AD +  D+SWIPNKHYSG+YGL+KLTL K LP  L+K IVLDTD+ FATDI
Sbjct: 175 PSVHISFYDADDLKPDISWIPNKHYSGIYGLMKLTLTKALPSNLSKVIVLDTDITFATDI 234

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWA+F +   +Q IGLVENQSDWYLG LWKNHKPWPALGRGFNTGVILL L +LR I 
Sbjct: 235 AELWAVFGKFSDKQVIGLVENQSDWYLGNLWKNHKPWPALGRGFNTGVILLLLERLRRIG 294

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ AE+ L++ L TSLADQDIFNA+I + P LVY LPC WNVQLSD+TRS++ CY
Sbjct: 295 WEQMWRLTAERELMSMLSTSLADQDIFNAVIKQSPALVYQLPCFWNVQLSDHTRSEQ-CY 353

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
           TE++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC    +LT   L
Sbjct: 354 TEVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCASLPSLTSGQL 413

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           QQA L  L+E+DPCYD RR  L   R HL+FL+Y++ AS+D  DVTL+AQLSMDR     
Sbjct: 414 QQA-LEELDEDDPCYDFRRQSLIQHRVHLFFLQYDFPASTDAADVTLIAQLSMDR----- 467

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+E + KHW GPISL LY+SDAEAQQFL Y
Sbjct: 468 ----------------------------LQMLEAICKHWTGPISLALYLSDAEAQQFLRY 499

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
           A  SE L +RRN+ YH+VYKEG FYP+N LRNVAL    T +VFL DIDFLPM+GLY YL
Sbjct: 500 AQASEVLSNRRNVAYHIVYKEGQFYPVNFLRNVALKNAQTSHVFLTDIDFLPMYGLYDYL 559

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           ++S+  +++     K   +VPAFET  YR  FP S                        H
Sbjct: 560 RTSVVQLELPER--KAAFIVPAFETLHYRLTFPKSKAELVSMLDMGSLYTFRYHVWPQGH 617

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           APT++++W  AT PYQ+EW P FEPY+V  RD P YD RFVGFGWNK
Sbjct: 618 APTDYAKWRTATVPYQVEWQPHFEPYVVVRRDCPLYDQRFVGFGWNK 664



 Score =  264 bits (675), Expect = 9e-68,   Method: Compositional matrix adjust.
 Identities = 117/168 (69%), Positives = 141/168 (83%)

Query: 5   EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
           ++ ++  CE++ VAIVCAGYN++R +VTL+KSILF+RKNPLHFHLITD+VA  ILQTLF 
Sbjct: 111 KQRMVQKCELLHVAIVCAGYNASRDVVTLVKSILFHRKNPLHFHLITDSVARQILQTLFQ 170

Query: 65  TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
           +W VP V +SFY AD +  D+SWIPNKHYSG+YGL+KLTL K LP  L+K IVLDTD+ F
Sbjct: 171 SWMVPSVHISFYDADDLKPDISWIPNKHYSGIYGLMKLTLTKALPSNLSKVIVLDTDITF 230

Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           ATDIA+LWA+F +   +Q IGLVENQSDWYLG LWKNHKPWPALGRGF
Sbjct: 231 ATDIAELWAVFGKFSDKQVIGLVENQSDWYLGNLWKNHKPWPALGRGF 278


>gi|47209488|emb|CAF89604.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 571

 Score =  627 bits (1617), Expect = e-177,   Method: Compositional matrix adjust.
 Identities = 313/532 (58%), Positives = 367/532 (68%), Gaps = 86/532 (16%)

Query: 181 DVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQT 240
           +VSWIPNKHYSG+YGL+KLTLPK LP  L++ +VLDTD+ FATDIA+LW +F +   +Q 
Sbjct: 4   EVSWIPNKHYSGIYGLMKLTLPKALPAHLSRVVVLDTDITFATDIAELWGVFRKFTDKQV 63

Query: 241 IGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLT 300
           IGLVENQSDWYLG LWKNHKPWPALGRGFNTGVILL L +LR + W   WR+ AEK L++
Sbjct: 64  IGLVENQSDWYLGNLWKNHKPWPALGRGFNTGVILLHLERLRSLGWEQMWRLTAEKELMS 123

Query: 301 RLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWNSPK 360
            L TSLADQDIFNA I ++P LV+ LPC WNVQLSD+TRS++ CYTE++DLK+IHWNSPK
Sbjct: 124 MLSTSLADQDIFNAFIKQNPALVHQLPCFWNVQLSDHTRSEQ-CYTEVSDLKVIHWNSPK 182

Query: 361 KLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNL---TQTTLQQAELSNLNEEDPCYD 417
           KL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC      +T   QA L  L+E+D CYD
Sbjct: 183 KLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSPPGPETVQLQAALEELDEDDQCYD 242

Query: 418 LRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINPNNVDEYEASSDGN 477
            RR +LT+ R HL+FL+Y+Y  + D  D+TLVAQLSMDRL                    
Sbjct: 243 FRRERLTAHRLHLFFLQYKYTPTED--DITLVAQLSMDRL-------------------- 280

Query: 478 DVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGYH 537
                        QM+E + KHW GPISL LYMSDAEAQQFL YA  SE L++RRN+GYH
Sbjct: 281 -------------QMLEAICKHWAGPISLALYMSDAEAQQFLRYAQASEVLKNRRNVGYH 327

Query: 538 VVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKK 597
           VVYKEG FYP+N LRNVAL    TPYVFL D+DFLPM+GLY YL+  +  MDM  H  KK
Sbjct: 328 VVYKEGQFYPVNLLRNVALRNARTPYVFLTDVDFLPMYGLYEYLRKLVLQMDM-AH-TKK 385

Query: 598 VLVVPAFETQRYRTAFPAS----------------------------------------- 616
            LVVPAFET RYR +FP S                                         
Sbjct: 386 ALVVPAFETLRYRLSFPKSKAELLSMLDMGTLYTFRYRSFTLGPGGLQSDLPFCCCRYHV 445

Query: 617 ----HAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
               HAPTN+++W  ATTPY++EW PDFEPY+V  RD P YD RFVGFGWNK
Sbjct: 446 WPKGHAPTNYAKWRTATTPYRVEWEPDFEPYVVVRRDCPEYDQRFVGFGWNK 497



 Score =  155 bits (392), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 66/89 (74%), Positives = 77/89 (86%)

Query: 84  DVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQT 143
           +VSWIPNKHYSG+YGL+KLTLPK LP  L++ +VLDTD+ FATDIA+LW +F +   +Q 
Sbjct: 4   EVSWIPNKHYSGIYGLMKLTLPKALPAHLSRVVVLDTDITFATDIAELWGVFRKFTDKQV 63

Query: 144 IGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           IGLVENQSDWYLG LWKNHKPWPALGRGF
Sbjct: 64  IGLVENQSDWYLGNLWKNHKPWPALGRGF 92


>gi|224051108|ref|XP_002200170.1| PREDICTED: glycosyltransferase-like protein LARGE2 [Taeniopygia
           guttata]
          Length = 739

 Score =  624 bits (1608), Expect = e-176,   Method: Compositional matrix adjust.
 Identities = 314/527 (59%), Positives = 372/527 (70%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           P++   FY AD +  +VSWIPNKHYSG+YGL+KLTL K LP  L+K IVLDTD+ FATDI
Sbjct: 175 PSVHVSFYNADDLKPEVSWIPNKHYSGIYGLMKLTLTKALPSNLSKVIVLDTDITFATDI 234

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWA+F +  ++Q IGLVENQSDWYLG LWKNHKPWPALGRGFNTGVILL L +LR + 
Sbjct: 235 AELWAVFKKFSEKQVIGLVENQSDWYLGNLWKNHKPWPALGRGFNTGVILLLLDRLRRLG 294

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ AE+ L++ L TSLADQDIFNA+I + P LVY LPC WNVQLSD+TRS++ CY
Sbjct: 295 WEQMWRLTAERELMSMLSTSLADQDIFNAVIKQDPSLVYRLPCFWNVQLSDHTRSEQ-CY 353

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
           TEL+DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC    +  +  L
Sbjct: 354 TELSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCASLPSPPRDQL 413

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           QQA L  L+E+DPCYD R+  L   R HL+FL+YE+ A  +  DVTLVAQLSMDR     
Sbjct: 414 QQA-LEELDEDDPCYDFRQQHLMQHRIHLFFLQYEFLALPNPTDVTLVAQLSMDR----- 467

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+E + KHW GPISL LYMSDAEAQQFL Y
Sbjct: 468 ----------------------------LQMLEAICKHWAGPISLALYMSDAEAQQFLRY 499

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
           A  SE L +RRN+ YH+VYKEG FYPIN LRNVAL    TPYVFL DIDFLPM+GLY YL
Sbjct: 500 AQASEVLSARRNVAYHIVYKEGQFYPINLLRNVALANTQTPYVFLTDIDFLPMYGLYDYL 559

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           ++SI+ +++     K  L+VPAFET  YR  FP S                        H
Sbjct: 560 RNSIQQLELPQR--KAALIVPAFETLHYRLTFPKSKAELLSMLDMGSLYTFRYHVWPKGH 617

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           APT++++W  AT PY++ W PDFEPY+V  RD P+YD RFVGFGWNK
Sbjct: 618 APTDYAKWRTATVPYRVAWQPDFEPYVVVRRDCPKYDQRFVGFGWNK 664



 Score =  260 bits (664), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 116/171 (67%), Positives = 141/171 (82%)

Query: 2   SSDEEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQT 61
           +  ++  +  CE++ VAIVCAG+N++R +VTL+KSILF+RKNPLHFH ITD+VA  ILQT
Sbjct: 108 ACPKQRTVHKCELLHVAIVCAGHNASRDVVTLVKSILFHRKNPLHFHFITDSVAHQILQT 167

Query: 62  LFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTD 121
           LF +W VP V VSFY AD +  +VSWIPNKHYSG+YGL+KLTL K LP  L+K IVLDTD
Sbjct: 168 LFQSWMVPSVHVSFYNADDLKPEVSWIPNKHYSGIYGLMKLTLTKALPSNLSKVIVLDTD 227

Query: 122 VIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           + FATDIA+LWA+F +  ++Q IGLVENQSDWYLG LWKNHKPWPALGRGF
Sbjct: 228 ITFATDIAELWAVFKKFSEKQVIGLVENQSDWYLGNLWKNHKPWPALGRGF 278


>gi|297708702|ref|XP_002831097.1| PREDICTED: like-glycosyltransferase isoform 4 [Pongo abelii]
          Length = 707

 Score =  615 bits (1587), Expect = e-173,   Method: Compositional matrix adjust.
 Identities = 297/503 (59%), Positives = 357/503 (70%), Gaps = 66/503 (13%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ FATDI
Sbjct: 192 PAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 251

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR + 
Sbjct: 252 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 311

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ AE+ L+  L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 312 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 370

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
            +++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC    ++    L
Sbjct: 371 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENL 430

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           Q+ +LS L+E+D CY+ RR + T  RTHLYFL YEYE ++D  DVTLVAQLSMDR     
Sbjct: 431 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEYEPAADSTDVTLVAQLSMDR----- 484

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 485 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 516

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
           A  SE L SR N+GYH+VYKEG FYP+N LRNVA+  +STPY+FL DIDFLPM+GLY YL
Sbjct: 517 AQGSEVLMSRHNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYL 576

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPASHAPTNFSRWVNATTPYQIEWAPDFE 641
           +  +                           +   HAPTNF++W  ATTPY++EW  DFE
Sbjct: 577 RYHV---------------------------WTKGHAPTNFAKWRTATTPYRVEWEADFE 609

Query: 642 PYIVAHRDLPRYDTRFVGFGWNK 664
           PY+V  RD P YD RFVGFGWNK
Sbjct: 610 PYVVVRRDCPEYDRRFVGFGWNK 632



 Score =  257 bits (656), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 111/168 (66%), Positives = 137/168 (81%)

Query: 5   EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
           ++PV+  CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A  IL TLF 
Sbjct: 128 QQPVVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 187

Query: 65  TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
           TW VP V V FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ F
Sbjct: 188 TWMVPAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 247

Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 248 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 295


>gi|332859457|ref|XP_003317213.1| PREDICTED: like-glycosyltransferase [Pan troglodytes]
 gi|397501769|ref|XP_003821548.1| PREDICTED: glycosyltransferase-like protein LARGE1 isoform 5 [Pan
           paniscus]
          Length = 707

 Score =  615 bits (1586), Expect = e-173,   Method: Compositional matrix adjust.
 Identities = 297/503 (59%), Positives = 357/503 (70%), Gaps = 66/503 (13%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ FATDI
Sbjct: 192 PAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 251

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR + 
Sbjct: 252 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 311

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ AE+ L+  L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 312 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 370

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
            +++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC    ++    L
Sbjct: 371 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENL 430

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           Q+ +LS L+E+D CY+ RR + T  RTHLYFL YEYE ++D  DVTLVAQLSMDR     
Sbjct: 431 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEYEPAADSTDVTLVAQLSMDR----- 484

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 485 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 516

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
           A  SE L SR N+GYH+VYKEG FYP+N LRNVA+  +STPY+FL DIDFLPM+GLY YL
Sbjct: 517 AQGSEVLMSRHNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYL 576

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPASHAPTNFSRWVNATTPYQIEWAPDFE 641
           +  +                           +   HAPTNF++W  ATTPY++EW  DFE
Sbjct: 577 RYHV---------------------------WTKGHAPTNFAKWRTATTPYRVEWEADFE 609

Query: 642 PYIVAHRDLPRYDTRFVGFGWNK 664
           PY+V  RD P YD RFVGFGWNK
Sbjct: 610 PYVVVRRDCPEYDRRFVGFGWNK 632



 Score =  256 bits (655), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 111/168 (66%), Positives = 137/168 (81%)

Query: 5   EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
           ++PV+  CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A  IL TLF 
Sbjct: 128 QQPVVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 187

Query: 65  TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
           TW VP V V FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ F
Sbjct: 188 TWMVPAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 247

Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 248 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 295


>gi|403283128|ref|XP_003932979.1| PREDICTED: glycosyltransferase-like protein LARGE1 isoform 5
           [Saimiri boliviensis boliviensis]
          Length = 707

 Score =  615 bits (1585), Expect = e-173,   Method: Compositional matrix adjust.
 Identities = 297/503 (59%), Positives = 357/503 (70%), Gaps = 66/503 (13%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ FATDI
Sbjct: 192 PAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 251

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR + 
Sbjct: 252 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 311

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ AE+ L+  L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 312 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 370

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
            +++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC    ++    L
Sbjct: 371 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENL 430

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           Q+ +LS L+E+D CY+ RR + T  RTHLYFL YEYE ++D  DVTLVAQLSMDR     
Sbjct: 431 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEYEPAADSTDVTLVAQLSMDR----- 484

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 485 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 516

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
           A  SE L SR N+GYH+VYKEG FYP+N LRNVA+  +STPY+FL DIDFLPM+GLY YL
Sbjct: 517 AQGSEVLMSRHNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYL 576

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPASHAPTNFSRWVNATTPYQIEWAPDFE 641
           +  +                           +   HAPTNF++W  ATTPY++EW  DFE
Sbjct: 577 RYHV---------------------------WTKGHAPTNFAKWRTATTPYRVEWEADFE 609

Query: 642 PYIVAHRDLPRYDTRFVGFGWNK 664
           PY+V  RD P YD RFVGFGWNK
Sbjct: 610 PYVVVKRDCPEYDRRFVGFGWNK 632



 Score =  256 bits (655), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 111/168 (66%), Positives = 137/168 (81%)

Query: 5   EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
           ++PV+  CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A  IL TLF 
Sbjct: 128 QQPVVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 187

Query: 65  TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
           TW VP V V FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ F
Sbjct: 188 TWMVPAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 247

Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 248 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 295


>gi|402884052|ref|XP_003905507.1| PREDICTED: glycosyltransferase-like protein LARGE1 isoform 5 [Papio
           anubis]
          Length = 707

 Score =  615 bits (1585), Expect = e-173,   Method: Compositional matrix adjust.
 Identities = 297/503 (59%), Positives = 357/503 (70%), Gaps = 66/503 (13%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ FATDI
Sbjct: 192 PAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 251

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR + 
Sbjct: 252 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 311

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ AE+ L+  L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 312 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 370

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
            +++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC    ++    L
Sbjct: 371 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENL 430

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           Q+ +LS L+E+D CY+ RR + T  RTHLYFL YEYE ++D  DVTLVAQLSMDR     
Sbjct: 431 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEYEPAADSTDVTLVAQLSMDR----- 484

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 485 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 516

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
           A  SE L SR N+GYH+VYKEG FYP+N LRNVA+  +STPY+FL DIDFLPM+GLY YL
Sbjct: 517 AQGSEVLMSRHNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYL 576

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPASHAPTNFSRWVNATTPYQIEWAPDFE 641
           +  +                           +   HAPTNF++W  ATTPY++EW  DFE
Sbjct: 577 RYHV---------------------------WTKGHAPTNFAKWRTATTPYRVEWEADFE 609

Query: 642 PYIVAHRDLPRYDTRFVGFGWNK 664
           PY+V  RD P YD RFVGFGWNK
Sbjct: 610 PYVVVRRDCPEYDRRFVGFGWNK 632



 Score =  256 bits (655), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 111/168 (66%), Positives = 137/168 (81%)

Query: 5   EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
           ++PV+  CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A  IL TLF 
Sbjct: 128 QQPVVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 187

Query: 65  TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
           TW VP V V FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ F
Sbjct: 188 TWMVPAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 247

Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 248 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 295


>gi|296191744|ref|XP_002743760.1| PREDICTED: glycosyltransferase-like protein LARGE1 isoform 2
           [Callithrix jacchus]
          Length = 707

 Score =  615 bits (1585), Expect = e-173,   Method: Compositional matrix adjust.
 Identities = 297/503 (59%), Positives = 357/503 (70%), Gaps = 66/503 (13%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ FATDI
Sbjct: 192 PAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 251

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR + 
Sbjct: 252 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 311

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ AE+ L+  L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 312 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 370

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
            +++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC    ++    L
Sbjct: 371 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENL 430

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           Q+ +LS L+E+D CY+ RR + T  RTHLYFL YEYE ++D  DVTLVAQLSMDR     
Sbjct: 431 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEYEPAADSTDVTLVAQLSMDR----- 484

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 485 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 516

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
           A  SE L SR N+GYH+VYKEG FYP+N LRNVA+  +STPY+FL DIDFLPM+GLY YL
Sbjct: 517 AQGSEVLMSRHNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYL 576

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPASHAPTNFSRWVNATTPYQIEWAPDFE 641
           +  +                           +   HAPTNF++W  ATTPY++EW  DFE
Sbjct: 577 RYHV---------------------------WTKGHAPTNFAKWRTATTPYRVEWEADFE 609

Query: 642 PYIVAHRDLPRYDTRFVGFGWNK 664
           PY+V  RD P YD RFVGFGWNK
Sbjct: 610 PYVVVKRDCPEYDRRFVGFGWNK 632



 Score =  256 bits (655), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 111/168 (66%), Positives = 137/168 (81%)

Query: 5   EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
           ++PV+  CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A  IL TLF 
Sbjct: 128 QQPVVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 187

Query: 65  TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
           TW VP V V FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ F
Sbjct: 188 TWMVPAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 247

Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 248 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 295


>gi|221040396|dbj|BAH11875.1| unnamed protein product [Homo sapiens]
          Length = 707

 Score =  612 bits (1577), Expect = e-172,   Method: Compositional matrix adjust.
 Identities = 296/503 (58%), Positives = 356/503 (70%), Gaps = 66/503 (13%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ FATDI
Sbjct: 192 PAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 251

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR + 
Sbjct: 252 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKME 311

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ AE+ L+  L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 312 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 370

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
            +++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRREL GC    ++    L
Sbjct: 371 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELSGCPSEADVNSENL 430

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           Q+ +LS L+E+D CY+ RR + T  RTHLYFL YEYE ++D  DVTLVAQLSMDR     
Sbjct: 431 QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEYEPAADSTDVTLVAQLSMDR----- 484

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 485 ----------------------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRY 516

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
           A  SE L SR N+GYH+VYKEG FYP+N LRNVA+  +STPY+FL DIDFLPM+GLY YL
Sbjct: 517 AQGSEVLMSRHNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYL 576

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPASHAPTNFSRWVNATTPYQIEWAPDFE 641
           +  +                           +   HAPTNF++W  ATTPY++EW  DFE
Sbjct: 577 RYHV---------------------------WTKGHAPTNFAKWRTATTPYRVEWEADFE 609

Query: 642 PYIVAHRDLPRYDTRFVGFGWNK 664
           PY+V  RD P YD RFVGFGWNK
Sbjct: 610 PYVVVRRDCPEYDRRFVGFGWNK 632



 Score =  257 bits (656), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 111/168 (66%), Positives = 137/168 (81%)

Query: 5   EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
           ++PV+  CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A  IL TLF 
Sbjct: 128 QQPVVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 187

Query: 65  TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
           TW VP V V FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ F
Sbjct: 188 TWMVPAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 247

Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 248 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 295


>gi|391344075|ref|XP_003746329.1| PREDICTED: glycosyltransferase-like protein LARGE1-like
           [Metaseiulus occidentalis]
          Length = 713

 Score =  611 bits (1576), Expect = e-172,   Method: Compositional matrix adjust.
 Identities = 309/534 (57%), Positives = 361/534 (67%), Gaps = 68/534 (12%)

Query: 163 KPW--PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVI 220
           K W  P +   FYL D V +DV+WIPNKHYSGVYGL+KL L K LP +LAKTIVLDTDV 
Sbjct: 139 KTWEIPQVEYSFYLTDKVAKDVNWIPNKHYSGVYGLMKLVLTKALPLSLAKTIVLDTDVT 198

Query: 221 FATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTK 280
           FA DIAQLW LF        I LVENQSDWYLGKLWK+HKPWPALGRGFNTGVILL L K
Sbjct: 199 FACDIAQLWKLFDSFGPEHAIALVENQSDWYLGKLWKHHKPWPALGRGFNTGVILLHLGK 258

Query: 281 LRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRS 340
           LR   W   WR +AE+ L +  +TSLADQDIFNA+I+ +P L+  LPCQWNVQL DNT S
Sbjct: 259 LRSKGWDELWRNVAERELSSMYFTSLADQDIFNAVINSYPALLLPLPCQWNVQLGDNTLS 318

Query: 341 DELCYTELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTT 400
            E+CY+E+ DLK+IHWNSPKKL+VKNKH+++FRNLYLTFLEYDGNLLRREL  CN +   
Sbjct: 319 -EVCYSEVEDLKVIHWNSPKKLRVKNKHVQYFRNLYLTFLEYDGNLLRRELISCNRSSPN 377

Query: 401 ----LQQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDR 456
                 Q +L  L+E+DPCY+ RR  +T  RTHLY++EYEY +S D   VTLV QLSMDR
Sbjct: 378 ESGPSSQDQLQGLDEDDPCYEFRRGNITRYRTHLYYMEYEYPSSPDY--VTLVTQLSMDR 435

Query: 457 LQMVEINPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQ 516
           L                                 QMVE L KHW GPISL LY+SD+E Q
Sbjct: 436 L---------------------------------QMVEALCKHWSGPISLALYLSDSEVQ 462

Query: 517 QFLSYAGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFG 576
           QFLS+  NSE L  RRNIGYH+VY++G  YP+N LRNVAL+Q  TPYVFL DIDFLPMFG
Sbjct: 463 QFLSFTLNSEVLSKRRNIGYHIVYRDGPLYPVNHLRNVALDQADTPYVFLSDIDFLPMFG 522

Query: 577 LYPYLKSSIR--SMDMHGHGGKKVLVVPAFETQRYRTAFP-------------------- 614
           L+  LK+ IR   MD    G  + LVVPAFET RYR  FP                    
Sbjct: 523 LFEALKNYIRFLRMDAPNMGLYRALVVPAFETHRYRVEFPTSKAKLLKMLDDGRVFTFRH 582

Query: 615 ----ASHAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
               A HA T+F+RW  ATTPY + W P FEPY+V  +++ R+D+RF+GFGWNK
Sbjct: 583 DVWCAGHAATDFARWKTATTPYDVNWQPHFEPYVVVRKEVVRFDSRFLGFGWNK 636



 Score =  265 bits (677), Expect = 6e-68,   Method: Compositional matrix adjust.
 Identities = 123/161 (76%), Positives = 135/161 (83%)

Query: 12  CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQV 71
           C+VI V+IVCAGYNSTRSLVTLIKS+LFYRKNPLH HL+TD+VA +IL TLF TW +PQV
Sbjct: 87  CQVIHVSIVCAGYNSTRSLVTLIKSVLFYRKNPLHLHLVTDSVAHSILSTLFKTWEIPQV 146

Query: 72  EVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQL 131
           E SFYL D V +DV+WIPNKHYSGVYGL+KL L K LP +LAKTIVLDTDV FA DIAQL
Sbjct: 147 EYSFYLTDKVAKDVNWIPNKHYSGVYGLMKLVLTKALPLSLAKTIVLDTDVTFACDIAQL 206

Query: 132 WALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           W LF        I LVENQSDWYLGKLWK+HKPWPALGRGF
Sbjct: 207 WKLFDSFGPEHAIALVENQSDWYLGKLWKHHKPWPALGRGF 247


>gi|47217228|emb|CAF96751.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 826

 Score =  603 bits (1554), Expect = e-169,   Method: Compositional matrix adjust.
 Identities = 310/548 (56%), Positives = 369/548 (67%), Gaps = 100/548 (18%)

Query: 181 DVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQT 240
           +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ FATDIA+LWA+F + + +Q 
Sbjct: 240 EVSWIPNKHYSGIYGLMKLVLTKTLPPDLHRVIVLDTDITFATDIAELWAVFHKFQGQQV 299

Query: 241 IGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLT 300
           +GLVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +LR + W   WR+ AE+ L++
Sbjct: 300 LGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLLLDRLRKLRWEQMWRLTAERELMS 359

Query: 301 RLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKI------- 353
            L TSLADQDIFNA+I ++P+LV+ LPC WNVQLSD+TRS E CY +++DLK+       
Sbjct: 360 MLSTSLADQDIFNAVIKQNPFLVHQLPCFWNVQLSDHTRS-EKCYKDVSDLKVSPKTLHP 418

Query: 354 -----------------------IHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRE 390
                                  IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRE
Sbjct: 419 GSFLQGGPLNISPPSFSGSFAQVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRE 478

Query: 391 LFGC---------NLTQTTLQ-QAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEAS 440
           LFGC         N+     Q Q  LS L+E+DPCY+ RR + T  RTHLYFL YEYE S
Sbjct: 479 LFGCPSETDHNSENVALFCPQLQKTLSELDEDDPCYEFRRERFTVHRTHLYFLHYEYEPS 538

Query: 441 SDGNDVTLVAQLSMDRLQMVEINPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHW 500
                                            +DG DVTLVAQLSMDRLQM+E + KHW
Sbjct: 539 ---------------------------------ADGTDVTLVAQLSMDRLQMLEAICKHW 565

Query: 501 EGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVS 560
           EGPISL LY+SDAEAQQFL YA  S+ L SR N+GYH+VYKEG FYP+N LRNVA+ QV+
Sbjct: 566 EGPISLALYLSDAEAQQFLRYAQGSDVLMSRGNVGYHIVYKEGQFYPVNLLRNVAMQQVN 625

Query: 561 TPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS---- 616
           TPY+FL DIDFLPM+GLY YL+ S+  +DM     KK LVVPAFET RYR ++P S    
Sbjct: 626 TPYMFLSDIDFLPMYGLYEYLRKSVVQLDM--ANTKKALVVPAFETLRYRLSYPKSKAEL 683

Query: 617 --------------------HAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTR 656
                               HAPTNF++W  ATT Y+++W  DFEPY++  RD P YD R
Sbjct: 684 LSQLDMGTLFTFRYHVWTKGHAPTNFAKWRTATTAYRVQWEADFEPYVMVRRDSPEYDRR 743

Query: 657 FVGFGWNK 664
           FVGFGWNK
Sbjct: 744 FVGFGWNK 751



 Score =  237 bits (605), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 113/207 (54%), Positives = 141/207 (68%), Gaps = 36/207 (17%)

Query: 2   SSD--EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNIL 59
           SSD  ++P +  CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFH ITD++A  IL
Sbjct: 122 SSDCVQQPAVDKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHFITDSIAQQIL 181

Query: 60  QTLFSTWSVPQVEVSFYLADSVV----------------------------------EDV 85
            +LF TW VP V+V+FY AD +                                    +V
Sbjct: 182 SSLFHTWMVPAVKVNFYDADELKVRTNSSRFSQETAVLMWDKNYWCFFCLFVWFFSQSEV 241

Query: 86  SWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIG 145
           SWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ FATDIA+LWA+F + + +Q +G
Sbjct: 242 SWIPNKHYSGIYGLMKLVLTKTLPPDLHRVIVLDTDITFATDIAELWAVFHKFQGQQVLG 301

Query: 146 LVENQSDWYLGKLWKNHKPWPALGRGF 172
           LVENQSDWYLG LWKNH+PWPALGRGF
Sbjct: 302 LVENQSDWYLGNLWKNHRPWPALGRGF 328


>gi|156717856|ref|NP_001096468.1| glycosyltransferase-like 1B precursor [Xenopus (Silurana)
           tropicalis]
 gi|134025835|gb|AAI36205.1| LOC100125087 protein [Xenopus (Silurana) tropicalis]
          Length = 723

 Score =  593 bits (1530), Expect = e-167,   Method: Compositional matrix adjust.
 Identities = 317/538 (58%), Positives = 365/538 (67%), Gaps = 68/538 (12%)

Query: 155 LGKLWKNHKPW--PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKT 212
           LG L+K    W  P+L   FY A  +  DV+WIPNKHYSG+YGLLKLTL K LP  L+K 
Sbjct: 151 LGNLFKT---WMVPSLQISFYNASELKPDVAWIPNKHYSGIYGLLKLTLTKALPSDLSKV 207

Query: 213 IVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTG 272
           IVLDTD+ FATDIA+LWA+F +    Q +GLVENQSDWYLG LWKNHKPWPALGRGFNTG
Sbjct: 208 IVLDTDITFATDIAELWAIFKKFTGEQVLGLVENQSDWYLGNLWKNHKPWPALGRGFNTG 267

Query: 273 VILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNV 332
           VILL L KLR I W   WR+ AE+ L+  L TSLADQDIFNA+I   P LVY LPC WNV
Sbjct: 268 VILLLLDKLRLIGWEEMWRLTAERELMNMLSTSLADQDIFNAVIKSSPTLVYQLPCYWNV 327

Query: 333 QLSDNTRSDELCYTELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELF 392
           QLSD+TRS++ CY+EL DLK+IHWNSP KL+VKNKH+E FR LYLTFLEYDG+LLRREL 
Sbjct: 328 QLSDHTRSEQ-CYSELADLKVIHWNSPHKLRVKNKHVELFRTLYLTFLEYDGSLLRRELI 386

Query: 393 GC--NLTQTTLQQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVA 450
           GC     Q    QA LS L+EEDPCYD RR  L S R HL FL +      D  DVTLVA
Sbjct: 387 GCPSEGEQQGGSQAALSQLDEEDPCYDFRRESLASHRVHLSFLPH-VTPPPDPYDVTLVA 445

Query: 451 QLSMDRLQMVEINPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYM 510
           QLSMDRL                                 QM+E++ +HW+GP+SL LY+
Sbjct: 446 QLSMDRL---------------------------------QMLELICRHWDGPMSLALYL 472

Query: 511 SDAEAQQFLSYAGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDID 570
           SDAEAQQFL YA  SE L+SR N+ YHVVYKEG  YP+N LRNVAL    TPYVFL DID
Sbjct: 473 SDAEAQQFLRYAQASEVLQSRTNVAYHVVYKEGQLYPVNLLRNVALKNSQTPYVFLSDID 532

Query: 571 FLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS-------------- 616
           FLPM+GLY YL+ SI   D+   G  K L+VPAFET RYR +FP S              
Sbjct: 533 FLPMYGLYEYLRKSISQQDL--TGPPKALIVPAFETLRYRLSFPKSKAELLSMLDTGALY 590

Query: 617 ----------HAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
                     HAPT++++W  ATTPY++EWAPDFEPY+V  RD P YD RF+GFGWNK
Sbjct: 591 TFRYHVWEKGHAPTDYAKWRTATTPYRVEWAPDFEPYVVVRRDCPEYDQRFLGFGWNK 648



 Score =  236 bits (602), Expect = 3e-59,   Method: Compositional matrix adjust.
 Identities = 113/161 (70%), Positives = 133/161 (82%)

Query: 12  CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQV 71
           CE+I VAIVCAG+NS+R +VTL+KSILF+R+NPLH HLITD VAL +L  LF TW VP +
Sbjct: 104 CELIHVAIVCAGHNSSRDVVTLVKSILFHRRNPLHLHLITDAVALRVLGNLFKTWMVPSL 163

Query: 72  EVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQL 131
           ++SFY A  +  DV+WIPNKHYSG+YGLLKLTL K LP  L+K IVLDTD+ FATDIA+L
Sbjct: 164 QISFYNASELKPDVAWIPNKHYSGIYGLLKLTLTKALPSDLSKVIVLDTDITFATDIAEL 223

Query: 132 WALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           WA+F +    Q +GLVENQSDWYLG LWKNHKPWPALGRGF
Sbjct: 224 WAIFKKFTGEQVLGLVENQSDWYLGNLWKNHKPWPALGRGF 264


>gi|148224937|ref|NP_001083457.1| glycosyltransferase-like protein LARGE2-A precursor [Xenopus
           laevis]
 gi|82186763|sp|Q6PA90.1|LRG2A_XENLA RecName: Full=Glycosyltransferase-like protein LARGE2-A; AltName:
           Full=Glycosyltransferase-like 1B-A
 gi|38014447|gb|AAH60410.1| MGC68636 protein [Xenopus laevis]
          Length = 723

 Score =  593 bits (1529), Expect = e-166,   Method: Compositional matrix adjust.
 Identities = 313/525 (59%), Positives = 359/525 (68%), Gaps = 63/525 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           P+L   FY A  +  DV+WIPNKHYSG+YGLLKLTL K LP  L+K IVLDTD+ FATDI
Sbjct: 161 PSLQISFYNASELKPDVAWIPNKHYSGIYGLLKLTLTKALPSDLSKVIVLDTDITFATDI 220

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWA+F +    Q +GLVENQSDWYLG LWKNHKPWPALGRGFNTGVILL L KLR I 
Sbjct: 221 AELWAIFKKFTGEQVLGLVENQSDWYLGNLWKNHKPWPALGRGFNTGVILLLLDKLRLIG 280

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ AE+ L+  L TSLADQDIFNA+I   P LVY LPC WNVQLSD+TRS++ CY
Sbjct: 281 WEEMWRLTAERELMNMLSTSLADQDIFNAVIKSSPTLVYQLPCYWNVQLSDHTRSEQ-CY 339

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC--NLTQTTLQQ 403
           +EL DLK+IHWNSP KL+VKNKH+E FR LYLTFLEYDG+LLRREL GC     Q    Q
Sbjct: 340 SELADLKVIHWNSPHKLRVKNKHVELFRTLYLTFLEYDGSLLRRELIGCPSEGEQQGGSQ 399

Query: 404 AELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEIN 463
           A LS L+EEDPCYD RR  + S R HL FL +      D  DVTLVAQLSMDRL      
Sbjct: 400 ATLSQLDEEDPCYDFRRESVASHRVHLSFLPH-LTPPPDPYDVTLVAQLSMDRL------ 452

Query: 464 PNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAG 523
                                      QM+E++ +HWEGP+SL LY+SDAEAQQFL YA 
Sbjct: 453 ---------------------------QMLELICRHWEGPMSLALYLSDAEAQQFLRYAQ 485

Query: 524 NSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKS 583
            SE L+SR NIGYHVVYKEG  YP+N LRNVAL    TPYVFL DIDFLPM+GLY YL+ 
Sbjct: 486 ASEVLQSRTNIGYHVVYKEGQLYPVNLLRNVALKNSHTPYVFLSDIDFLPMYGLYEYLRK 545

Query: 584 SIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------HAP 619
           SI   D    G  + L+VPAFET RYR +FP S                        HAP
Sbjct: 546 SIAQQD--PAGSPRALIVPAFETLRYRLSFPKSKADLLSMLDTGALYTFRYHVWEKGHAP 603

Query: 620 TNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           T++++W  ATTPY++EWAPDFEPY+V  RD P YD RF+GFGWNK
Sbjct: 604 TDYAKWRTATTPYRVEWAPDFEPYVVVRRDCPEYDQRFLGFGWNK 648



 Score =  236 bits (601), Expect = 3e-59,   Method: Compositional matrix adjust.
 Identities = 113/161 (70%), Positives = 134/161 (83%)

Query: 12  CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQV 71
           CE+I VAIVCAG+NS+R +VTL+KSILF+R+NPLH HLITD VAL +L  LF+TW VP +
Sbjct: 104 CELIHVAIVCAGHNSSRDVVTLVKSILFHRRNPLHLHLITDDVALRVLGNLFNTWMVPSL 163

Query: 72  EVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQL 131
           ++SFY A  +  DV+WIPNKHYSG+YGLLKLTL K LP  L+K IVLDTD+ FATDIA+L
Sbjct: 164 QISFYNASELKPDVAWIPNKHYSGIYGLLKLTLTKALPSDLSKVIVLDTDITFATDIAEL 223

Query: 132 WALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           WA+F +    Q +GLVENQSDWYLG LWKNHKPWPALGRGF
Sbjct: 224 WAIFKKFTGEQVLGLVENQSDWYLGNLWKNHKPWPALGRGF 264


>gi|147900424|ref|NP_001089877.1| glycosyltransferase-like protein LARGE2-B precursor [Xenopus
           laevis]
 gi|90185260|sp|Q32NJ7.1|LRG2B_XENLA RecName: Full=Glycosyltransferase-like protein LARGE2-B; AltName:
           Full=Glycosyltransferase-like 1B-B
 gi|80477206|gb|AAI08591.1| Gyltl1b protein [Xenopus laevis]
          Length = 723

 Score =  592 bits (1526), Expect = e-166,   Method: Compositional matrix adjust.
 Identities = 312/525 (59%), Positives = 360/525 (68%), Gaps = 63/525 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           P+L   FY A  +  DV+WIPNKHYSG++GLLKLTL K LP  L+K IVLDTD+ FATDI
Sbjct: 161 PSLTISFYNASELKPDVAWIPNKHYSGIFGLLKLTLTKALPSYLSKVIVLDTDITFATDI 220

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWA+F +    Q +GLVENQSDWYLG LWKNHKPWPALGRGFNTGVILL L KLR I 
Sbjct: 221 AELWAIFKKFTGEQVLGLVENQSDWYLGNLWKNHKPWPALGRGFNTGVILLLLDKLRLIG 280

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ AE+ L+  L TSLADQDIFNA+I   P LVY LPC WNVQLSD+TRS++ CY
Sbjct: 281 WEEMWRLTAERELMNMLSTSLADQDIFNAVIKSSPTLVYQLPCYWNVQLSDHTRSEQ-CY 339

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC--NLTQTTLQQ 403
           +EL DLK+IHWNSP KL+VKNKH+E FR LYLTFLEYDG+LLRREL GC     Q    Q
Sbjct: 340 SELADLKVIHWNSPHKLRVKNKHVELFRTLYLTFLEYDGSLLRRELIGCPSEGEQQGGSQ 399

Query: 404 AELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEIN 463
           A LS L+EEDPCYD RR  L S R HL FL +    + D +DVTLVAQLSMDRL      
Sbjct: 400 AALSQLDEEDPCYDFRRESLASHRVHLSFLPH-LTPTPDPSDVTLVAQLSMDRL------ 452

Query: 464 PNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAG 523
                                      QM+E++ +HWEGP+SL LY+SDAEAQQFL YA 
Sbjct: 453 ---------------------------QMLELICRHWEGPMSLALYLSDAEAQQFLRYAQ 485

Query: 524 NSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKS 583
            SE L+SR NIGYHV+YKEG  YP+N LRNVAL    TPYVFL DIDFLPM+GLY  L+ 
Sbjct: 486 ASEVLQSRTNIGYHVIYKEGQLYPVNLLRNVALKNSHTPYVFLSDIDFLPMYGLYENLRK 545

Query: 584 SIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------HAP 619
           SI   D    G  K L+VPAFET RYR +FP S                        HAP
Sbjct: 546 SIAQQD--PTGSPKALIVPAFETLRYRLSFPKSKADLLSMLDTGALYTFRYHVWEKGHAP 603

Query: 620 TNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           TN+++W  ATTPY++EWAPDFEPY+V  +D P YD RF+GFGWNK
Sbjct: 604 TNYAKWRTATTPYRVEWAPDFEPYVVVRQDCPEYDQRFLGFGWNK 648



 Score =  234 bits (596), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 112/161 (69%), Positives = 134/161 (83%)

Query: 12  CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQV 71
           CE+I VAIVCAG+NS+R +VTL+KSILF+R+NPLH HLITD VAL +L+ LF+TW VP +
Sbjct: 104 CELIHVAIVCAGHNSSRDVVTLVKSILFHRRNPLHLHLITDDVALRVLRNLFNTWMVPSL 163

Query: 72  EVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQL 131
            +SFY A  +  DV+WIPNKHYSG++GLLKLTL K LP  L+K IVLDTD+ FATDIA+L
Sbjct: 164 TISFYNASELKPDVAWIPNKHYSGIFGLLKLTLTKALPSYLSKVIVLDTDITFATDIAEL 223

Query: 132 WALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           WA+F +    Q +GLVENQSDWYLG LWKNHKPWPALGRGF
Sbjct: 224 WAIFKKFTGEQVLGLVENQSDWYLGNLWKNHKPWPALGRGF 264


>gi|334331837|ref|XP_003341528.1| PREDICTED: LOW QUALITY PROTEIN: glycosyltransferase-like protein
           LARGE2-like [Monodelphis domestica]
          Length = 751

 Score =  585 bits (1507), Expect = e-164,   Method: Compositional matrix adjust.
 Identities = 297/527 (56%), Positives = 360/527 (68%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           P++   FY AD +  +V+WIPNKHYSG+YGLLKL L + LP TL + IVLDTD+ FA+DI
Sbjct: 155 PSVLVSFYNADELKPEVAWIPNKHYSGIYGLLKLALLRALPPTLPQVIVLDTDITFASDI 214

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A LWA+F+R   +Q IGLVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +LR + 
Sbjct: 215 ADLWAVFARFSDKQVIGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLQLERLRQVG 274

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ AE+ LLT L TSLADQDIFNA+I +HP+LV+ LPC WNVQLSD+TR+ E CY
Sbjct: 275 WEQMWRLTAERELLTMLATSLADQDIFNAVIKQHPWLVHPLPCSWNVQLSDHTRA-ERCY 333

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTTLQQAE 405
           +E +DLK+IHWNSPKKL VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC  ++   +  +
Sbjct: 334 SEASDLKVIHWNSPKKLGVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCGGSEPQPEPGQ 393

Query: 406 ----LSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
                  L +++ CY+ R+ +LT  R HL FL +E  A  D +DVTLVAQLSMDRL    
Sbjct: 394 TRPATGELEDDEACYEFRQQRLTVHRVHLTFLPFEPPA-PDPHDVTLVAQLSMDRL---- 448

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                        QM+E L +HW GP+SL LY+SDAEAQQFL +
Sbjct: 449 -----------------------------QMLEALCRHWPGPMSLALYLSDAEAQQFLRF 479

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
           A  SE L +RR++ YH+VY+EG  YP+N LR VAL Q  TPYVFL DIDFLP F LY YL
Sbjct: 480 AQASEVLAARRDVAYHIVYREGPLYPVNLLRXVALGQARTPYVFLSDIDFLPAFSLYDYL 539

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           ++SI+ + +     K  LVVPAFET RYR  FP S                        H
Sbjct: 540 RTSIKELGLATR--KAALVVPAFETLRYRLNFPTSKKELLALLDAGSLYTFRYHVWPRGH 597

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
            PT+++RW  A  PY++EWA D+EPY+V  RD PRYD RFVGFGWNK
Sbjct: 598 EPTDYARWREAQAPYRVEWAADYEPYVVVARDCPRYDPRFVGFGWNK 644



 Score =  204 bits (519), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 104/175 (59%), Positives = 125/175 (71%), Gaps = 4/175 (2%)

Query: 2   SSDEEPVIPTCEV----IQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALN 57
           S  + P +P C+V    +Q+A    G           +     RKNPLHFHL+TDTVA +
Sbjct: 84  SQPQPPAVPKCQVSRGGLQMAEREGGATCCSQPFRTGRCGPRERKNPLHFHLVTDTVARS 143

Query: 58  ILQTLFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIV 117
           IL+TLFSTW VP V VSFY AD +  +V+WIPNKHYSG+YGLLKL L + LP TL + IV
Sbjct: 144 ILETLFSTWMVPSVLVSFYNADELKPEVAWIPNKHYSGIYGLLKLALLRALPPTLPQVIV 203

Query: 118 LDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           LDTD+ FA+DIA LWA+F+R   +Q IGLVENQSDWYLG LWKNH+PWPALGRGF
Sbjct: 204 LDTDITFASDIADLWAVFARFSDKQVIGLVENQSDWYLGNLWKNHRPWPALGRGF 258


>gi|241051730|ref|XP_002407451.1| conserved hypothetical protein [Ixodes scapularis]
 gi|215492218|gb|EEC01859.1| conserved hypothetical protein [Ixodes scapularis]
          Length = 636

 Score =  583 bits (1502), Expect = e-163,   Method: Compositional matrix adjust.
 Identities = 299/539 (55%), Positives = 371/539 (68%), Gaps = 81/539 (15%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FYLAD++  +VSWIPNKHYSGVYGL+KL LP+ LP+ L K IVLDTD+ FA+DI
Sbjct: 66  PAVEVSFYLADNLQGEVSWIPNKHYSGVYGLMKLVLPRALPDDLDKVIVLDTDITFASDI 125

Query: 226 AQLWALFSRLRQRQ---TIGLVENQSDWYLGKLWKNHKPWP-----ALGRGFNTGVILLD 277
           A+LW +F R   +Q   +  ++ + +      L ++  P P     + GRGFNTGVILL 
Sbjct: 126 AELWRIFHRFTAKQVPPSFQILPHNT-----FLTRDQIP-PIGRRLSQGRGFNTGVILLQ 179

Query: 278 LTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDN 337
           L KLR+ +W   WR+IAEK L++  +TSLADQDIFNA++ +H  L+Y LPCQWNVQL DN
Sbjct: 180 LQKLRERNWMQMWRLIAEKELISMWFTSLADQDIFNAMLKQHTDLLYRLPCQWNVQLGDN 239

Query: 338 TRSD-ELCYTELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNL 396
           T S+  LCY+E+ +LKIIHWNSPKKLKVKNKH+EFFRNLYLTFLEYDGNLLRREL GCN 
Sbjct: 240 TLSEASLCYSEVQELKIIHWNSPKKLKVKNKHVEFFRNLYLTFLEYDGNLLRRELIGCNG 299

Query: 397 TQTTLQ-------QAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLV 449
           + ++         Q  L+ L+E+DPCYD RR+++   RTHLY++EY+YE S +GNDVTLV
Sbjct: 300 SASSSAGAHASHLQDALNLLDEDDPCYDFRRARVAQHRTHLYYIEYDYEPSPEGNDVTLV 359

Query: 450 AQLSMDRLQMVEINPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLY 509
           AQLSMDRL                                 QMVE L KHWEGPISL LY
Sbjct: 360 AQLSMDRL---------------------------------QMVEALCKHWEGPISLALY 386

Query: 510 MSDAEAQQFLSYAGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDI 569
           MSD+E QQFLSY  +SE L+SR+N+GYH+VYK+G FYP+N LRNVAL QV+TP+VFL DI
Sbjct: 387 MSDSEVQQFLSYTLSSEILQSRKNVGYHIVYKDGTFYPVNLLRNVALQQVNTPFVFLTDI 446

Query: 570 DFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------- 616
           DFLPM GLY YLK S+ ++ +     +K L+VPAFETQRYR +FP S             
Sbjct: 447 DFLPMHGLYEYLKRSVFALGL--ESARKALIVPAFETQRYRLSFPKSKAELLSMLDMGTL 504

Query: 617 -----------HAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
                      HAPTNF++W  ATTPY++ W  +FEPY+V  RD+P YD RFVGFGWNK
Sbjct: 505 FTFRYHVWQKGHAPTNFAKWRTATTPYRVAWEANFEPYVVVKRDVPEYDRRFVGFGWNK 563



 Score =  209 bits (531), Expect = 6e-51,   Method: Compositional matrix adjust.
 Identities = 94/140 (67%), Positives = 116/140 (82%)

Query: 4   DEEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLF 63
           D  P +P CEV+QVAIVCAGYN++RS+VTL+KS+LFYRKNPLHFH ++D VA  +LQTLF
Sbjct: 1   DALPEVPKCEVLQVAIVCAGYNASRSVVTLVKSVLFYRKNPLHFHFVSDAVAHLVLQTLF 60

Query: 64  STWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVI 123
            TW+VP VEVSFYLAD++  +VSWIPNKHYSGVYGL+KL LP+ LP+ L K IVLDTD+ 
Sbjct: 61  RTWNVPAVEVSFYLADNLQGEVSWIPNKHYSGVYGLMKLVLPRALPDDLDKVIVLDTDIT 120

Query: 124 FATDIAQLWALFSRLRQRQT 143
           FA+DIA+LW +F R   +Q 
Sbjct: 121 FASDIAELWRIFHRFTAKQV 140


>gi|321476466|gb|EFX87427.1| hypothetical protein DAPPUDRAFT_192525 [Daphnia pulex]
          Length = 705

 Score =  582 bits (1499), Expect = e-163,   Method: Compositional matrix adjust.
 Identities = 288/524 (54%), Positives = 354/524 (67%), Gaps = 64/524 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           P +   FY AD +  DV+WIPNKHYSGVYGLLKLTLPKVLP+++ K IV D DV F+ DI
Sbjct: 154 PHMNFSFYSADELTSDVAWIPNKHYSGVYGLLKLTLPKVLPQSVNKVIVFDCDVTFSADI 213

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A LWA+ S     Q IGLVENQSDWYLGKLWKN++PWPALGRGFNTGVI +DL KLR ++
Sbjct: 214 ADLWAVISSFSSNQAIGLVENQSDWYLGKLWKNYEPWPALGRGFNTGVIAMDLKKLRSLN 273

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   W + AEK L++  +TSLADQDIFNA++ + P+LVY LPCQ+NVQLSDNTRS E CY
Sbjct: 274 WNQLWLLTAEKDLVSHYFTSLADQDIFNAVLFQEPHLVYRLPCQYNVQLSDNTRS-ESCY 332

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTTLQQAE 405
           +E+ DLKIIHWNSPKKLKVKNKH +FFRNLY TF+ +DGN LRR+LF CN   +   Q+ 
Sbjct: 333 SEVADLKIIHWNSPKKLKVKNKHSDFFRNLYYTFVGFDGNFLRRQLFHCN--DSGFNQSS 390

Query: 406 LSNLN-EEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINP 464
           + +L  E+D CY++R+++    RTHL++L Y+                            
Sbjct: 391 VDDLPVEDDACYEIRKARDAKYRTHLFYLSYD---------------------------- 422

Query: 465 NNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGN 524
                     D ND+T VAQLS+DRLQM++ L + WEGPISL LY+SD EA QF S+   
Sbjct: 423 ------AGKIDDNDITWVAQLSLDRLQMIDPLCRLWEGPISLALYLSDTEADQFHSFVTE 476

Query: 525 SEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSS 584
           S  L SR NIGYHVVYK+G+ YP+N LRNVAL Q STP+VFL DIDFLPM  LY  LK +
Sbjct: 477 SHYLNSRTNIGYHVVYKQGSLYPVNLLRNVALEQASTPFVFLCDIDFLPMPNLYTTLKKA 536

Query: 585 IRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------HAPT 620
           ++S+ +      K LVVPAFE+QRY+T  P +                        HA T
Sbjct: 537 VQSLKVATEN--KALVVPAFESQRYQTKVPRTKAEVIAALDMGDLLTFRYHDWALGHAAT 594

Query: 621 NFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           NF  W  AT PY+I+W P+FEPY+V  RD+ RYD RF+GFGWNK
Sbjct: 595 NFPMWRTATIPYKIKWEPNFEPYVVVRRDVTRYDPRFLGFGWNK 638



 Score =  252 bits (644), Expect = 4e-64,   Method: Compositional matrix adjust.
 Identities = 111/171 (64%), Positives = 135/171 (78%)

Query: 2   SSDEEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQT 61
           + D   ++  CE+I VAIVCAGY+++R++ TLIKS+LFYRKNP+H H I D +A NIL+T
Sbjct: 87  AKDASILVKRCEIIHVAIVCAGYDASRAVSTLIKSLLFYRKNPIHLHFIADAIAENILKT 146

Query: 62  LFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTD 121
           LF TW +P +  SFY AD +  DV+WIPNKHYSGVYGLLKLTLPKVLP+++ K IV D D
Sbjct: 147 LFQTWDIPHMNFSFYSADELTSDVAWIPNKHYSGVYGLLKLTLPKVLPQSVNKVIVFDCD 206

Query: 122 VIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           V F+ DIA LWA+ S     Q IGLVENQSDWYLGKLWKN++PWPALGRGF
Sbjct: 207 VTFSADIADLWAVISSFSSNQAIGLVENQSDWYLGKLWKNYEPWPALGRGF 257


>gi|410973795|ref|XP_003993333.1| PREDICTED: glycosyltransferase-like protein LARGE2 [Felis catus]
          Length = 702

 Score =  581 bits (1497), Expect = e-163,   Method: Compositional matrix adjust.
 Identities = 293/526 (55%), Positives = 354/526 (67%), Gaps = 63/526 (11%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+G  FY  + +   VSWIPNKHYSG+YGL+KL LP  LP  LA+ IVLDTDV FA+DI
Sbjct: 134 PAVGVSFYDLEELKPQVSWIPNKHYSGLYGLMKLVLPGALPPDLARVIVLDTDVTFASDI 193

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWALF+    +Q IGLVENQSDWYLG LW+NH+PWPALGRGFNTGVILL L +LR   
Sbjct: 194 AELWALFAHFSDKQVIGLVENQSDWYLGNLWRNHRPWPALGRGFNTGVILLRLDRLRQAG 253

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   W++ A + LLT   TSLADQDIFNA+I EHP LV  LPC WNVQLSD+T + E CY
Sbjct: 254 WGQMWKLTATRELLTLPATSLADQDIFNAVIKEHPGLVQPLPCVWNVQLSDHTLA-ERCY 312

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQ---TTLQ 402
           +E +DLK+IHWNSPKKL+VKNKH+E+FRNLYLTFLEYDGNLLRRELFGC       T   
Sbjct: 313 SEASDLKVIHWNSPKKLRVKNKHVEYFRNLYLTFLEYDGNLLRRELFGCPSPPPPGTEQL 372

Query: 403 QAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEI 462
           Q  L+ L+EED C++ R+ +LT  R H+ FL +E       +DVTLVAQLSMDRL     
Sbjct: 373 QRALAQLDEEDACFEFRQQQLTVHRVHISFLPHE-PLPPRPHDVTLVAQLSMDRL----- 426

Query: 463 NPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYA 522
                                       QM+E L +HW GP+SL LY++DAEAQQFL + 
Sbjct: 427 ----------------------------QMLEALCRHWPGPMSLALYLTDAEAQQFLRFV 458

Query: 523 GNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLK 582
             S  L +R+++ YHVVY+EG  YP+N LRNVAL Q  TPYVFL DIDFLP + LY YL+
Sbjct: 459 EASAVLSARQDVAYHVVYREGPLYPVNELRNVALAQALTPYVFLSDIDFLPAYSLYDYLR 518

Query: 583 SSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------HA 618
           +SI  + +     K  LVVPAFET  YR +FP+S                        HA
Sbjct: 519 ASIEQLKLDSE-RKVALVVPAFETLHYRFSFPSSKAELLALLDSGSLYTFRYHEWPRGHA 577

Query: 619 PTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           PT+++RW  A TPY+++WA D+EPY+V  RD PRYD RFVGFGWNK
Sbjct: 578 PTDYARWREAQTPYRVQWAADYEPYVVVPRDCPRYDPRFVGFGWNK 623



 Score =  246 bits (629), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 112/163 (68%), Positives = 132/163 (80%)

Query: 10  PTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVP 69
           P  +++ VAIVCAG+NS+R L TL+KS+LFYRKNPLHFHL+TD VA NIL+ LF TW VP
Sbjct: 75  PPPQLLHVAIVCAGHNSSRDLNTLVKSVLFYRKNPLHFHLVTDAVARNILEMLFHTWMVP 134

Query: 70  QVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIA 129
            V VSFY  + +   VSWIPNKHYSG+YGL+KL LP  LP  LA+ IVLDTDV FA+DIA
Sbjct: 135 AVGVSFYDLEELKPQVSWIPNKHYSGLYGLMKLVLPGALPPDLARVIVLDTDVTFASDIA 194

Query: 130 QLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           +LWALF+    +Q IGLVENQSDWYLG LW+NH+PWPALGRGF
Sbjct: 195 ELWALFAHFSDKQVIGLVENQSDWYLGNLWRNHRPWPALGRGF 237


>gi|390470460|ref|XP_002755202.2| PREDICTED: glycosyltransferase-like protein LARGE2 [Callithrix
           jacchus]
          Length = 689

 Score =  580 bits (1496), Expect = e-163,   Method: Compositional matrix adjust.
 Identities = 296/527 (56%), Positives = 352/527 (66%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY AD +   VSWIPNKHYSG+YGL+KL LP  LP  LA+ IVLDTD+ FA+DI
Sbjct: 118 PAVRVSFYNADELKPQVSWIPNKHYSGLYGLMKLVLPSTLPAELARVIVLDTDITFASDI 177

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWALF+     Q IGLVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +LR   
Sbjct: 178 AELWALFAHFSDMQAIGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLRLDRLRHAG 237

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ A + LLT   TSLADQDIFNA+I EHP LV  LPC WNVQLSD+T + E CY
Sbjct: 238 WEQMWRLTARQELLTLPATSLADQDIFNAVIKEHPGLVQPLPCVWNVQLSDHTLA-ERCY 296

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTT----L 401
           +E +DLK+IHWNSPKKL+VKNKH+EFFRN YLTFL YDGNLLRR LFGC     +    L
Sbjct: 297 SEASDLKVIHWNSPKKLRVKNKHVEFFRNFYLTFLAYDGNLLRRGLFGCPSQPPSGAEQL 356

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           QQA L+ L+EEDPC++ R+ +LT  R H+ FL +E    S  +DVTLVAQLSMDRL    
Sbjct: 357 QQA-LAQLDEEDPCFEFRQQQLTVHRVHITFLPHE-PPPSRPHDVTLVAQLSMDRL---- 410

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                        QM+E L +HW GP+SL LY++DAEAQQFL +
Sbjct: 411 -----------------------------QMLEALCRHWPGPMSLALYLTDAEAQQFLRF 441

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
              S  L +R+++ YHVVY+EG  YP+N LRNVAL Q  TPYVFL DIDFLP + LY YL
Sbjct: 442 VEASPVLAARQDVAYHVVYREGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYL 501

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           ++SI  + + G   K  LVVPAFET  YR +FP S                        H
Sbjct: 502 RASIEQLGL-GSRRKAALVVPAFETLHYRFSFPRSKVELLALLDAGALYTFRYHEWPRGH 560

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           APT+++RW  A  PY ++W  D+EPY+V  RD PRYD RFVGFGWNK
Sbjct: 561 APTDYARWREAQAPYSVQWVADYEPYVVVPRDCPRYDPRFVGFGWNK 607



 Score =  254 bits (649), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 114/163 (69%), Positives = 133/163 (81%)

Query: 10  PTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVP 69
           P CE++ VAIVCAG+NS+R ++TL+KS+LFYRKNPLH HL+TD VA NIL+ LF TW VP
Sbjct: 59  PKCELLHVAIVCAGHNSSRDVITLVKSMLFYRKNPLHLHLVTDAVARNILEMLFHTWMVP 118

Query: 70  QVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIA 129
            V VSFY AD +   VSWIPNKHYSG+YGL+KL LP  LP  LA+ IVLDTD+ FA+DIA
Sbjct: 119 AVRVSFYNADELKPQVSWIPNKHYSGLYGLMKLVLPSTLPAELARVIVLDTDITFASDIA 178

Query: 130 QLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           +LWALF+     Q IGLVENQSDWYLG LWKNH+PWPALGRGF
Sbjct: 179 ELWALFAHFSDMQAIGLVENQSDWYLGNLWKNHRPWPALGRGF 221


>gi|291384909|ref|XP_002709120.1| PREDICTED: glycosyltransferase-like 1B [Oryctolagus cuniculus]
          Length = 721

 Score =  580 bits (1495), Expect = e-163,   Method: Compositional matrix adjust.
 Identities = 296/527 (56%), Positives = 357/527 (67%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PAL   FY A+++   VSWIPNKHYSG+YGL+KL LP VLP  LA+ IVLDTDV FA+DI
Sbjct: 153 PALRVSFYDAEALKPQVSWIPNKHYSGLYGLMKLVLPAVLPPDLARVIVLDTDVTFASDI 212

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWALF+    +Q IGLVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +L+   
Sbjct: 213 AELWALFAHFSDKQLIGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLWLGRLQQAG 272

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   W++ A + LLT   TSLADQDIFNA+I EHP LV  LPC WNVQLSD+T + E CY
Sbjct: 273 WEQMWKLTARRELLTLPATSLADQDIFNAVIKEHPGLVQPLPCVWNVQLSDHTLA-ERCY 331

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
           +E +DLK+IHWNSPKKL+VKNKH EFFRN +LTFLEYDGNLLRRELFGC          L
Sbjct: 332 SEASDLKVIHWNSPKKLRVKNKHAEFFRNFHLTFLEYDGNLLRRELFGCPSQPPAGAEQL 391

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           QQA L+ L+EED C++ R+ +LT  R HL FL ++       +DVTLVAQLSMDRLQM+E
Sbjct: 392 QQA-LAQLDEEDACFEFRQQRLTVHRVHLTFLPHKPPPPRP-HDVTLVAQLSMDRLQMLE 449

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
           +                                 L +HW GP+SL LY++DAEAQQFL +
Sbjct: 450 V---------------------------------LCRHWPGPMSLALYLTDAEAQQFLRF 476

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
              S  L +R+++ YHVVY+EG  YP+N LRNVAL Q  TPYVFL DIDFLP + LY YL
Sbjct: 477 VEASPVLSARQDVAYHVVYREGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYL 536

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           ++SI  +++ G   K  LVVPAFET  YR +FP S                        H
Sbjct: 537 RASIEQLELGGR-RKAALVVPAFETLHYRFSFPTSKAELLALLDAGSLHTFRYHEWPQGH 595

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           APT+++RW  A  PY+++WA D+EPY+V  RD PRYD RFVGFGWNK
Sbjct: 596 APTDYARWREARVPYRVQWAADYEPYVVVPRDCPRYDPRFVGFGWNK 642



 Score =  260 bits (665), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 128/219 (58%), Positives = 159/219 (72%), Gaps = 5/219 (2%)

Query: 5   EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
           + P+ P CE++ VAIVCAG+NS+R +VTL+KS+LFYRKNPLH HL+TD VA NIL+ LF 
Sbjct: 89  QPPLPPKCELLHVAIVCAGHNSSREVVTLVKSMLFYRKNPLHLHLVTDAVARNILEMLFH 148

Query: 65  TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
           TW VP + VSFY A+++   VSWIPNKHYSG+YGL+KL LP VLP  LA+ IVLDTDV F
Sbjct: 149 TWLVPALRVSFYDAEALKPQVSWIPNKHYSGLYGLMKLVLPAVLPPDLARVIVLDTDVTF 208

Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFYLADSVVEDVSW 184
           A+DIA+LWALF+    +Q IGLVENQSDWYLG LWKNH+PWPALGRGF     ++    W
Sbjct: 209 ASDIAELWALFAHFSDKQLIGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILL----W 264

Query: 185 IPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 223
           +     +G   + KLT  + L  TL  T + D D+  A 
Sbjct: 265 LGRLQQAGWEQMWKLTARREL-LTLPATSLADQDIFNAV 302


>gi|109106604|ref|XP_001112766.1| PREDICTED: glycosyltransferase-like protein LARGE2-like isoform 1
           [Macaca mulatta]
          Length = 690

 Score =  580 bits (1494), Expect = e-162,   Method: Compositional matrix adjust.
 Identities = 299/527 (56%), Positives = 352/527 (66%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY AD +   VSWIPNKHYSG+YGL+KL LP  LP  LA  IVLDTDV FA+DI
Sbjct: 120 PAVRVSFYNADELKSQVSWIPNKHYSGLYGLMKLVLPSALPTELAHVIVLDTDVTFASDI 179

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           ++LWALF+     Q IGLVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +LR   
Sbjct: 180 SELWALFAHFSDTQAIGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLRLDRLRQAG 239

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ A + LLT   TSLADQDIFNA+I EHP LV  LPC WNVQLSD+T + E CY
Sbjct: 240 WEQMWRLTARRELLTLPATSLADQDIFNAVIKEHPGLVQPLPCVWNVQLSDHTLA-ERCY 298

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLT----QTTL 401
           +  +DLK+IHWNSPKKL+VKNKH+EFFRN YLTFLEYDGNLLRRELFGC          L
Sbjct: 299 SAASDLKVIHWNSPKKLQVKNKHVEFFRNFYLTFLEYDGNLLRRELFGCPSQPPPGAEQL 358

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           QQA LS LNEEDPC++ R+ +LT  R H+ FL +E       +DVTLVAQLSMDR     
Sbjct: 359 QQA-LSQLNEEDPCFEFRQQQLTVHRVHVTFLPHEPPPPR-PHDVTLVAQLSMDR----- 411

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+E L +HW GP+SL LY++DAEAQQFL +
Sbjct: 412 ----------------------------LQMLEALCRHWPGPMSLALYLTDAEAQQFLRF 443

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
              S  L +R+++ YHVVY+EG  YP+N LRNVAL Q  TPYVFL DIDFLP + LY YL
Sbjct: 444 IEASPVLAARQDVAYHVVYREGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYL 503

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           ++SI  + + G   K  LVVPAFET  YR +FP S                        H
Sbjct: 504 RASIEQLGL-GSQRKAALVVPAFETLHYRFSFPHSKVELLALLDVGALYTFRYHEWPRGH 562

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           APT+++RW  A  PY+++WA D+EPY+V  RD PRYD RFVGFGWNK
Sbjct: 563 APTDYARWREAQDPYRVQWAADYEPYVVVPRDCPRYDPRFVGFGWNK 609



 Score =  251 bits (641), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 117/173 (67%), Positives = 135/173 (78%), Gaps = 3/173 (1%)

Query: 3   SDEEPVIP---TCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNIL 59
           SD  P  P    CE++ VAIVCAG+NS+R ++TL+KS+LFYRKNPLH HL+TD VA NIL
Sbjct: 51  SDCGPPPPRPSKCELLHVAIVCAGHNSSRDVITLVKSMLFYRKNPLHLHLVTDAVARNIL 110

Query: 60  QTLFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLD 119
           + LF TW VP V VSFY AD +   VSWIPNKHYSG+YGL+KL LP  LP  LA  IVLD
Sbjct: 111 EMLFHTWMVPAVRVSFYNADELKSQVSWIPNKHYSGLYGLMKLVLPSALPTELAHVIVLD 170

Query: 120 TDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           TDV FA+DI++LWALF+     Q IGLVENQSDWYLG LWKNH+PWPALGRGF
Sbjct: 171 TDVTFASDISELWALFAHFSDTQAIGLVENQSDWYLGNLWKNHRPWPALGRGF 223


>gi|109106606|ref|XP_001112795.1| PREDICTED: glycosyltransferase-like protein LARGE2-like isoform 2
           [Macaca mulatta]
 gi|109106608|ref|XP_001112825.1| PREDICTED: glycosyltransferase-like protein LARGE2-like isoform 3
           [Macaca mulatta]
          Length = 721

 Score =  580 bits (1494), Expect = e-162,   Method: Compositional matrix adjust.
 Identities = 299/527 (56%), Positives = 352/527 (66%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY AD +   VSWIPNKHYSG+YGL+KL LP  LP  LA  IVLDTDV FA+DI
Sbjct: 151 PAVRVSFYNADELKSQVSWIPNKHYSGLYGLMKLVLPSALPTELAHVIVLDTDVTFASDI 210

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           ++LWALF+     Q IGLVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +LR   
Sbjct: 211 SELWALFAHFSDTQAIGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLRLDRLRQAG 270

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ A + LLT   TSLADQDIFNA+I EHP LV  LPC WNVQLSD+T + E CY
Sbjct: 271 WEQMWRLTARRELLTLPATSLADQDIFNAVIKEHPGLVQPLPCVWNVQLSDHTLA-ERCY 329

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLT----QTTL 401
           +  +DLK+IHWNSPKKL+VKNKH+EFFRN YLTFLEYDGNLLRRELFGC          L
Sbjct: 330 SAASDLKVIHWNSPKKLQVKNKHVEFFRNFYLTFLEYDGNLLRRELFGCPSQPPPGAEQL 389

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           QQA LS LNEEDPC++ R+ +LT  R H+ FL +E       +DVTLVAQLSMDRL    
Sbjct: 390 QQA-LSQLNEEDPCFEFRQQQLTVHRVHVTFLPHEPPPPRP-HDVTLVAQLSMDRL---- 443

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                        QM+E L +HW GP+SL LY++DAEAQQFL +
Sbjct: 444 -----------------------------QMLEALCRHWPGPMSLALYLTDAEAQQFLRF 474

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
              S  L +R+++ YHVVY+EG  YP+N LRNVAL Q  TPYVFL DIDFLP + LY YL
Sbjct: 475 IEASPVLAARQDVAYHVVYREGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYL 534

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           ++SI  + + G   K  LVVPAFET  YR +FP S                        H
Sbjct: 535 RASIEQLGL-GSQRKAALVVPAFETLHYRFSFPHSKVELLALLDVGALYTFRYHEWPRGH 593

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           APT+++RW  A  PY+++WA D+EPY+V  RD PRYD RFVGFGWNK
Sbjct: 594 APTDYARWREAQDPYRVQWAADYEPYVVVPRDCPRYDPRFVGFGWNK 640



 Score =  251 bits (641), Expect = 9e-64,   Method: Compositional matrix adjust.
 Identities = 117/173 (67%), Positives = 135/173 (78%), Gaps = 3/173 (1%)

Query: 3   SDEEPVIP---TCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNIL 59
           SD  P  P    CE++ VAIVCAG+NS+R ++TL+KS+LFYRKNPLH HL+TD VA NIL
Sbjct: 82  SDCGPPPPRPSKCELLHVAIVCAGHNSSRDVITLVKSMLFYRKNPLHLHLVTDAVARNIL 141

Query: 60  QTLFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLD 119
           + LF TW VP V VSFY AD +   VSWIPNKHYSG+YGL+KL LP  LP  LA  IVLD
Sbjct: 142 EMLFHTWMVPAVRVSFYNADELKSQVSWIPNKHYSGLYGLMKLVLPSALPTELAHVIVLD 201

Query: 120 TDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           TDV FA+DI++LWALF+     Q IGLVENQSDWYLG LWKNH+PWPALGRGF
Sbjct: 202 TDVTFASDISELWALFAHFSDTQAIGLVENQSDWYLGNLWKNHRPWPALGRGF 254


>gi|403255509|ref|XP_003920468.1| PREDICTED: glycosyltransferase-like protein LARGE2 [Saimiri
           boliviensis boliviensis]
          Length = 694

 Score =  579 bits (1493), Expect = e-162,   Method: Compositional matrix adjust.
 Identities = 294/527 (55%), Positives = 354/527 (67%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY AD +   VSWIPNKHYSG+YGL+KL LP  LP  LA+ IVLDTD+ FA+DI
Sbjct: 123 PAVRVSFYDADELKPQVSWIPNKHYSGLYGLMKLVLPGTLPAELARVIVLDTDITFASDI 182

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWALF++    Q IGLVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +LR   
Sbjct: 183 AELWALFAQFSDTQAIGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLRLDRLRQAG 242

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   W++ A + LLT   TSLADQDIFNA+I EHP LV  LPC WNVQLSD+T + E CY
Sbjct: 243 WEQMWKLTARRELLTLPATSLADQDIFNAVIKEHPGLVQPLPCVWNVQLSDHTLA-ERCY 301

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTT----L 401
           +E +DLK+IHWNSPKKL+VKNKH+EFFRN+YLTFLEYDGNLLRR LFGC     +    L
Sbjct: 302 SEASDLKVIHWNSPKKLRVKNKHVEFFRNVYLTFLEYDGNLLRRGLFGCPSQPPSGAEQL 361

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           QQA L+ L+EEDPC++ ++ +LT  R H+ FL +E    S  +DVTLVAQLSMDRL    
Sbjct: 362 QQA-LTQLDEEDPCFEFQQQQLTVHRVHVTFLPHE-PPPSRPHDVTLVAQLSMDRL---- 415

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                        QM+E L +HW GP+SL LY++DAEAQQFL +
Sbjct: 416 -----------------------------QMLEALCRHWPGPMSLALYLTDAEAQQFLRF 446

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
              S  L +R+++ YHVVY+EG  YP+N LRNVAL Q  TPYVFL DIDFLP + LY YL
Sbjct: 447 VEASPVLSARQDVAYHVVYREGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYL 506

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           ++SI  + + G   K  LVVPAFET  Y  +FP S                        H
Sbjct: 507 RASIEQLGL-GSRRKAALVVPAFETLHYHFSFPRSKVELLALLDAGAVYTFRYHEWPRGH 565

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           APT+++RW  A  PY ++W  D+EPY+V  RD PRYD RFVGFGWNK
Sbjct: 566 APTDYARWREAQAPYGVQWVADYEPYVVVPRDCPRYDPRFVGFGWNK 612



 Score =  253 bits (647), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 113/161 (70%), Positives = 134/161 (83%)

Query: 12  CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQV 71
           CE++ VAIVCAG+NS+R ++TL+KS+LFYR+NPLH HL+TD VA NIL+TLF TW VP V
Sbjct: 66  CELLHVAIVCAGHNSSRDVITLVKSMLFYRRNPLHLHLVTDAVARNILETLFHTWMVPAV 125

Query: 72  EVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQL 131
            VSFY AD +   VSWIPNKHYSG+YGL+KL LP  LP  LA+ IVLDTD+ FA+DIA+L
Sbjct: 126 RVSFYDADELKPQVSWIPNKHYSGLYGLMKLVLPGTLPAELARVIVLDTDITFASDIAEL 185

Query: 132 WALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           WALF++    Q IGLVENQSDWYLG LWKNH+PWPALGRGF
Sbjct: 186 WALFAQFSDTQAIGLVENQSDWYLGNLWKNHRPWPALGRGF 226


>gi|395816209|ref|XP_003781599.1| PREDICTED: glycosyltransferase-like protein LARGE2 [Otolemur
           garnettii]
          Length = 687

 Score =  579 bits (1493), Expect = e-162,   Method: Compositional matrix adjust.
 Identities = 294/525 (56%), Positives = 351/525 (66%), Gaps = 63/525 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY AD +   VSWIPNKHYSG+YGL+KL LP  LP  LA+ IVLDTDV FA+DI
Sbjct: 121 PAVHVSFYDADELKHQVSWIPNKHYSGLYGLMKLVLPGALPPGLARVIVLDTDVTFASDI 180

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWALF+    +Q IGLVENQS+WYLG LWKNH+PWPALGRGFNTGVILL L +LR   
Sbjct: 181 AELWALFAHFSDKQAIGLVENQSNWYLGNLWKNHRPWPALGRGFNTGVILLQLDRLRTAG 240

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ A + LLT   TSLADQDIFNA+I EHP LV  LPC WNVQLSD+T + E CY
Sbjct: 241 WEQMWRLTASRELLTLPATSLADQDIFNAVIKEHPGLVQPLPCIWNVQLSDHTLA-ERCY 299

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC--NLTQTTLQQ 403
           +E +DLK+IHWNSPKKL+VKNKH+ FFRN YLTF EYDGNLLRRELFGC        +QQ
Sbjct: 300 SEASDLKVIHWNSPKKLRVKNKHVAFFRNFYLTFTEYDGNLLRRELFGCPSQPLPGAVQQ 359

Query: 404 AELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEIN 463
           A L  L+EED C++ R+ +LT  R H+ FL +E   +   +DVTLV QLSMDRL      
Sbjct: 360 A-LEQLDEEDSCFEFRQQQLTVHRVHIAFLPHEL-PTPRPHDVTLVTQLSMDRL------ 411

Query: 464 PNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAG 523
                                      QM+E L +HW GP+SL LY++DAEAQQFL +  
Sbjct: 412 ---------------------------QMLEALCRHWPGPMSLALYLTDAEAQQFLRFVE 444

Query: 524 NSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKS 583
            S  L +R+++ YHVVY+EG  YP+N LRNVAL Q  TPYVFL DIDFLP + LY YL+S
Sbjct: 445 ASPVLSARQDVAYHVVYREGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYLRS 504

Query: 584 SIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------HAP 619
           SI  + + G   K  LVVPAFET  YR +FP S                        HAP
Sbjct: 505 SIEQLGL-GSQRKAALVVPAFETLHYRFSFPTSKEELLALLDAGALYTFRYHEWPRGHAP 563

Query: 620 TNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           T+++RW  A  PY+++WA D+EPY+V  RD PRYD RFVGFGWNK
Sbjct: 564 TDYTRWREAQAPYRVQWAADYEPYVVVPRDCPRYDPRFVGFGWNK 608



 Score =  256 bits (654), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 116/163 (71%), Positives = 134/163 (82%)

Query: 10  PTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVP 69
           P CE + VAIVCAG+NS+R L+TL+KSILFYRKNPLH HL+T+ VA NIL+TLF TW VP
Sbjct: 62  PRCERLHVAIVCAGHNSSRDLITLVKSILFYRKNPLHLHLVTNAVARNILETLFHTWMVP 121

Query: 70  QVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIA 129
            V VSFY AD +   VSWIPNKHYSG+YGL+KL LP  LP  LA+ IVLDTDV FA+DIA
Sbjct: 122 AVHVSFYDADELKHQVSWIPNKHYSGLYGLMKLVLPGALPPGLARVIVLDTDVTFASDIA 181

Query: 130 QLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           +LWALF+    +Q IGLVENQS+WYLG LWKNH+PWPALGRGF
Sbjct: 182 ELWALFAHFSDKQAIGLVENQSNWYLGNLWKNHRPWPALGRGF 224


>gi|405965905|gb|EKC31250.1| Glycosyltransferase-like protein LARGE1 [Crassostrea gigas]
          Length = 725

 Score =  579 bits (1492), Expect = e-162,   Method: Compositional matrix adjust.
 Identities = 293/538 (54%), Positives = 360/538 (66%), Gaps = 73/538 (13%)

Query: 154 YLGKLWKNHKPWPALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTI 213
           YL + W+     P +   FY AD V ED+ WI NKHYSG+YGLLKL LPK LP +L K I
Sbjct: 162 YLFETWR----VPDVKISFYHADKVKEDIEWIHNKHYSGIYGLLKLVLPKTLPTSLEKVI 217

Query: 214 VLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGV 273
           VLDTDV FATDIA+LW +FS L   + +GLVENQSDWYLGK+WKNH PWPALGRGFNTGV
Sbjct: 218 VLDTDVSFATDIAELWKMFSFLNGLKVLGLVENQSDWYLGKIWKNHIPWPALGRGFNTGV 277

Query: 274 ILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQ 333
           IL++L  LR+++W   W+ IA K L T  +T LADQDIFNA++ EHPY VY LPCQWNVQ
Sbjct: 278 ILMNLRVLREMNWMEMWKKIAVKELETMQYTQLADQDIFNAVLKEHPYFVYHLPCQWNVQ 337

Query: 334 LSDNTRSDELCYTELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFG 393
           LSDN++SD+ CY +L + KIIHWNSPKK KV+NKH+++FRNL+L F+EYDGNLLRRELFG
Sbjct: 338 LSDNSKSDK-CYNQLEEPKIIHWNSPKKFKVENKHVDYFRNLHLAFVEYDGNLLRRELFG 396

Query: 394 CNLTQT---TLQQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVA 450
           C  ++    T  Q E+S    ED C++ RR +    RTHL++L+Y+Y+   +  DVTLV 
Sbjct: 397 CEKSEDANPTASQEEVS----EDECFEFRREQSLVHRTHLFYLDYKYQP--EEYDVTLVT 450

Query: 451 QLSMDRLQMVEINPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYM 510
           QLS+DRLQM+E+  N                                  WEGPISL LY+
Sbjct: 451 QLSIDRLQMLELLSN---------------------------------RWEGPISLALYL 477

Query: 511 SDAEAQQFLSYAGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDID 570
           SDAEAQQFL YA +SE L  R NIGYH+VYK+G FYP+N LRNVALNQ  T ++FL DID
Sbjct: 478 SDAEAQQFLKYAQDSEILTKRTNIGYHIVYKDGQFYPVNYLRNVALNQSQTSHMFLSDID 537

Query: 571 FLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS-------------- 616
           FLPM GLY +LKS +  +DM      K LV+PAFETQRYR  FP +              
Sbjct: 538 FLPMPGLYNHLKSVVSRIDMEKEN--KALVIPAFETQRYRLEFPTTKADLLTKLDKEEIF 595

Query: 617 ----------HAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
                     H PTN+++W  A   Y ++W  D+EPYIV  + +P +D RFVGFGWNK
Sbjct: 596 TFRYDVWPQGHLPTNYTKWRTAKAMYTVKWERDYEPYIVVKKGIPLFDQRFVGFGWNK 653



 Score =  254 bits (649), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 115/164 (70%), Positives = 133/164 (81%)

Query: 9   IPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSV 68
           +P CEVI +AIVC+G+ STR +VTLIKS+LFYRKNPLHFH ++DT A  IL  LF TW V
Sbjct: 110 VPKCEVIHIAIVCSGHKSTRDVVTLIKSVLFYRKNPLHFHFLSDTTAQVILNYLFETWRV 169

Query: 69  PQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 128
           P V++SFY AD V ED+ WI NKHYSG+YGLLKL LPK LP +L K IVLDTDV FATDI
Sbjct: 170 PDVKISFYHADKVKEDIEWIHNKHYSGIYGLLKLVLPKTLPTSLEKVIVLDTDVSFATDI 229

Query: 129 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           A+LW +FS L   + +GLVENQSDWYLGK+WKNH PWPALGRGF
Sbjct: 230 AELWKMFSFLNGLKVLGLVENQSDWYLGKIWKNHIPWPALGRGF 273


>gi|332259858|ref|XP_003279001.1| PREDICTED: glycosyltransferase-like protein LARGE2 isoform 1
           [Nomascus leucogenys]
          Length = 720

 Score =  579 bits (1492), Expect = e-162,   Method: Compositional matrix adjust.
 Identities = 297/527 (56%), Positives = 354/527 (67%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY AD +   VSWIPNKHYSG+YGL+KL LP  LP  LA+ IVLDTDV FA+DI
Sbjct: 152 PAVRVSFYHADELKPQVSWIPNKHYSGLYGLMKLVLPSALPAELARVIVLDTDVTFASDI 211

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           ++LWALF+     Q IGLVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +LR   
Sbjct: 212 SELWALFAHFSDTQAIGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLRLDRLRQAG 271

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ A + LLT   TSLADQDIFNA+I EHP LV  LPC WNVQLSD+T + E CY
Sbjct: 272 WEQMWRLTARRELLTLPATSLADQDIFNAVIKEHPGLVQRLPCVWNVQLSDHTLA-ERCY 330

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLT----QTTL 401
           +E +DLK+IHWNSPKKL+VKNKH+EFFR+ YLTFLEYDGNLLRRELF C          L
Sbjct: 331 SEASDLKVIHWNSPKKLRVKNKHVEFFRSFYLTFLEYDGNLLRRELFVCPSQPPPGAEQL 390

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           QQA L+ L+EEDPC++ R+ +LT  R H+ FL +E       +DVTLVAQLSMDRL    
Sbjct: 391 QQA-LAQLDEEDPCFEFRQQQLTVHRVHVTFLPHEPPPPRP-HDVTLVAQLSMDRL---- 444

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                        QM+E L +HW GP+SL LY++DAEAQQFL +
Sbjct: 445 -----------------------------QMLEALCRHWPGPMSLALYLTDAEAQQFLHF 475

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
              S  L +R+++ YHVVY+EG  YP+N LRNVAL Q  TPYVFL DIDFLP + LY YL
Sbjct: 476 VEASPVLAARQDVAYHVVYREGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYL 535

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           ++SI  + + G   K  LVVPAFET  YR +FP S                        H
Sbjct: 536 RASIEQLGL-GSRRKAALVVPAFETLHYRFSFPHSKVELLASLDVGALYTFRYHEWPRGH 594

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           APT+++RW  A  PY+++WAPD+EPY+V  RD PRYD RFVGFGWNK
Sbjct: 595 APTDYARWRGAQDPYRVQWAPDYEPYVVVPRDCPRYDPRFVGFGWNK 641



 Score =  254 bits (649), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 114/161 (70%), Positives = 133/161 (82%)

Query: 12  CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQV 71
           CE++ VAIVCAG+NS+R ++TL+KS+LFYRKNPLH HL+TD VA NIL+TLF TW VP V
Sbjct: 95  CELLHVAIVCAGHNSSRDVITLVKSMLFYRKNPLHLHLVTDAVARNILETLFHTWMVPAV 154

Query: 72  EVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQL 131
            VSFY AD +   VSWIPNKHYSG+YGL+KL LP  LP  LA+ IVLDTDV FA+DI++L
Sbjct: 155 RVSFYHADELKPQVSWIPNKHYSGLYGLMKLVLPSALPAELARVIVLDTDVTFASDISEL 214

Query: 132 WALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           WALF+     Q IGLVENQSDWYLG LWKNH+PWPALGRGF
Sbjct: 215 WALFAHFSDTQAIGLVENQSDWYLGNLWKNHRPWPALGRGF 255


>gi|119588426|gb|EAW68020.1| glycosyltransferase-like 1B, isoform CRA_a [Homo sapiens]
          Length = 573

 Score =  578 bits (1490), Expect = e-162,   Method: Compositional matrix adjust.
 Identities = 296/527 (56%), Positives = 354/527 (67%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY AD +   VSWIPNKHYSG+YGL+KL LP  LP  LA+ IVLDTDV FA+DI
Sbjct: 3   PAVRVSFYHADQLKPQVSWIPNKHYSGLYGLMKLVLPSALPAELARVIVLDTDVTFASDI 62

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           ++LWALF+     Q IGLVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +LR   
Sbjct: 63  SELWALFAHFSDTQAIGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLRLDRLRQAG 122

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ A + LL+   TSLADQDIFNA+I EHP LV  LPC WNVQLSD+T + E CY
Sbjct: 123 WEQMWRLTARRELLSLPATSLADQDIFNAVIKEHPGLVQRLPCVWNVQLSDHTLA-ERCY 181

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLT----QTTL 401
           +E +DLK+IHWNSPKKL+VKNKH+EFFRN YLTFLEYDGNLLRRELF C          L
Sbjct: 182 SEASDLKVIHWNSPKKLRVKNKHVEFFRNFYLTFLEYDGNLLRRELFVCPSQPPPGAEQL 241

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           QQA L+ L+EEDPC++ R+ +LT  R H+ FL +E       +DVTLVAQLSMDR     
Sbjct: 242 QQA-LAQLDEEDPCFEFRQQQLTVHRVHVTFLPHEPPPPR-PHDVTLVAQLSMDR----- 294

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+E L +HW GP+SL LY++DAEAQQFL +
Sbjct: 295 ----------------------------LQMLEALCRHWPGPMSLALYLTDAEAQQFLHF 326

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
              S  L +R+++ YHVVY+EG  YP+N LRNVAL Q  TPYVFL DIDFLP + LY YL
Sbjct: 327 VEASPVLAARQDVAYHVVYREGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYL 386

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           ++SI  + + G   K  LVVPAFET RYR +FP S                        H
Sbjct: 387 RASIEQLGL-GSRRKAALVVPAFETLRYRFSFPHSKVELLALLDAGTLYTFRYHEWPRGH 445

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           APT+++RW  A  PY+++WA ++EPY+V  RD PRYD RFVGFGWNK
Sbjct: 446 APTDYARWREAQAPYRVQWAANYEPYVVVPRDCPRYDPRFVGFGWNK 492



 Score =  166 bits (420), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 75/105 (71%), Positives = 84/105 (80%)

Query: 68  VPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATD 127
           VP V VSFY AD +   VSWIPNKHYSG+YGL+KL LP  LP  LA+ IVLDTDV FA+D
Sbjct: 2   VPAVRVSFYHADQLKPQVSWIPNKHYSGLYGLMKLVLPSALPAELARVIVLDTDVTFASD 61

Query: 128 IAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           I++LWALF+     Q IGLVENQSDWYLG LWKNH+PWPALGRGF
Sbjct: 62  ISELWALFAHFSDTQAIGLVENQSDWYLGNLWKNHRPWPALGRGF 106


>gi|402893700|ref|XP_003910029.1| PREDICTED: glycosyltransferase-like protein LARGE2 [Papio anubis]
          Length = 690

 Score =  578 bits (1490), Expect = e-162,   Method: Compositional matrix adjust.
 Identities = 297/527 (56%), Positives = 352/527 (66%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY AD +   VSWIPNKHYSG+YGL+KL LP  LP  LA  IVLDTDV FA+DI
Sbjct: 120 PAVRVSFYNADELKSQVSWIPNKHYSGLYGLMKLVLPSALPAELAHVIVLDTDVTFASDI 179

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           ++LWALF+     Q IGLVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +LR   
Sbjct: 180 SELWALFAHFSDTQAIGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLRLDRLRQAG 239

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ A + LLT   TSLADQDIFNA+I EHP LV  LPC WNVQLSD+T + E CY
Sbjct: 240 WEQMWRLTARRELLTLPATSLADQDIFNAVIKEHPGLVQPLPCVWNVQLSDHTLA-ERCY 298

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLT----QTTL 401
           +  +DLK+IHWNSPKKL+VKNKH+EFFRN YLTFLEYDGNLLRRELFGC          L
Sbjct: 299 SAASDLKVIHWNSPKKLQVKNKHVEFFRNFYLTFLEYDGNLLRRELFGCPSQPPPGAEQL 358

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           QQA L+ L+EEDPC++ R+ +LT  R H+ FL +E       +DVTLVAQLSMDRL    
Sbjct: 359 QQA-LAQLDEEDPCFEFRQQQLTVHRVHVTFLPHEPPPPR-PHDVTLVAQLSMDRL---- 412

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                        QM+E L +HW GP+SL LY++DAEAQQFL +
Sbjct: 413 -----------------------------QMLEALCRHWPGPMSLALYLTDAEAQQFLRF 443

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
              S  L +R+++ YHVVY+EG  YP+N LRNVAL Q  TPYVFL DIDFLP + LY YL
Sbjct: 444 IEASPVLAARQDVAYHVVYREGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYL 503

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           ++SI  + + G   K  LVVPAFET  YR +FP S                        H
Sbjct: 504 RASIEQLGL-GSQRKAALVVPAFETLHYRFSFPHSKVELLALLDVGALYTFRYHEWPRGH 562

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           APT+++RW  A  PY+++WA D+EPY+V  RD PRYD RFVGFGWNK
Sbjct: 563 APTDYARWREAQAPYRVQWAADYEPYVVVPRDCPRYDPRFVGFGWNK 609



 Score =  251 bits (641), Expect = 8e-64,   Method: Compositional matrix adjust.
 Identities = 117/173 (67%), Positives = 135/173 (78%), Gaps = 3/173 (1%)

Query: 3   SDEEPVIP---TCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNIL 59
           SD  P  P    CE++ VAIVCAG+NS+R ++TL+KS+LFYRKNPLH HL+TD VA NIL
Sbjct: 51  SDCGPPPPRPSKCELLHVAIVCAGHNSSRDVITLVKSMLFYRKNPLHLHLVTDAVARNIL 110

Query: 60  QTLFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLD 119
           + LF TW VP V VSFY AD +   VSWIPNKHYSG+YGL+KL LP  LP  LA  IVLD
Sbjct: 111 EMLFHTWMVPAVRVSFYNADELKSQVSWIPNKHYSGLYGLMKLVLPSALPAELAHVIVLD 170

Query: 120 TDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           TDV FA+DI++LWALF+     Q IGLVENQSDWYLG LWKNH+PWPALGRGF
Sbjct: 171 TDVTFASDISELWALFAHFSDTQAIGLVENQSDWYLGNLWKNHRPWPALGRGF 223


>gi|116235482|ref|NP_689525.3| glycosyltransferase-like protein LARGE2 precursor [Homo sapiens]
 gi|146345450|sp|Q8N3Y3.2|LARG2_HUMAN RecName: Full=Glycosyltransferase-like protein LARGE2; AltName:
           Full=Glycosyltransferase-like 1B
 gi|119588427|gb|EAW68021.1| glycosyltransferase-like 1B, isoform CRA_b [Homo sapiens]
          Length = 721

 Score =  577 bits (1488), Expect = e-162,   Method: Compositional matrix adjust.
 Identities = 296/527 (56%), Positives = 354/527 (67%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY AD +   VSWIPNKHYSG+YGL+KL LP  LP  LA+ IVLDTDV FA+DI
Sbjct: 151 PAVRVSFYHADQLKPQVSWIPNKHYSGLYGLMKLVLPSALPAELARVIVLDTDVTFASDI 210

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           ++LWALF+     Q IGLVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +LR   
Sbjct: 211 SELWALFAHFSDTQAIGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLRLDRLRQAG 270

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ A + LL+   TSLADQDIFNA+I EHP LV  LPC WNVQLSD+T + E CY
Sbjct: 271 WEQMWRLTARRELLSLPATSLADQDIFNAVIKEHPGLVQRLPCVWNVQLSDHTLA-ERCY 329

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLT----QTTL 401
           +E +DLK+IHWNSPKKL+VKNKH+EFFRN YLTFLEYDGNLLRRELF C          L
Sbjct: 330 SEASDLKVIHWNSPKKLRVKNKHVEFFRNFYLTFLEYDGNLLRRELFVCPSQPPPGAEQL 389

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           QQA L+ L+EEDPC++ R+ +LT  R H+ FL +E       +DVTLVAQLSMDRL    
Sbjct: 390 QQA-LAQLDEEDPCFEFRQQQLTVHRVHVTFLPHEPPPPRP-HDVTLVAQLSMDRL---- 443

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                        QM+E L +HW GP+SL LY++DAEAQQFL +
Sbjct: 444 -----------------------------QMLEALCRHWPGPMSLALYLTDAEAQQFLHF 474

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
              S  L +R+++ YHVVY+EG  YP+N LRNVAL Q  TPYVFL DIDFLP + LY YL
Sbjct: 475 VEASPVLAARQDVAYHVVYREGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYL 534

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           ++SI  + + G   K  LVVPAFET RYR +FP S                        H
Sbjct: 535 RASIEQLGL-GSRRKAALVVPAFETLRYRFSFPHSKVELLALLDAGTLYTFRYHEWPRGH 593

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           APT+++RW  A  PY+++WA ++EPY+V  RD PRYD RFVGFGWNK
Sbjct: 594 APTDYARWREAQAPYRVQWAANYEPYVVVPRDCPRYDPRFVGFGWNK 640



 Score =  254 bits (649), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 114/161 (70%), Positives = 133/161 (82%)

Query: 12  CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQV 71
           CE++ VAIVCAG+NS+R ++TL+KS+LFYRKNPLH HL+TD VA NIL+TLF TW VP V
Sbjct: 94  CELLHVAIVCAGHNSSRDVITLVKSMLFYRKNPLHLHLVTDAVARNILETLFHTWMVPAV 153

Query: 72  EVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQL 131
            VSFY AD +   VSWIPNKHYSG+YGL+KL LP  LP  LA+ IVLDTDV FA+DI++L
Sbjct: 154 RVSFYHADQLKPQVSWIPNKHYSGLYGLMKLVLPSALPAELARVIVLDTDVTFASDISEL 213

Query: 132 WALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           WALF+     Q IGLVENQSDWYLG LWKNH+PWPALGRGF
Sbjct: 214 WALFAHFSDTQAIGLVENQSDWYLGNLWKNHRPWPALGRGF 254


>gi|22713410|gb|AAH37291.1| GYLTL1B protein [Homo sapiens]
 gi|123981284|gb|ABM82471.1| glycosyltransferase-like 1B [synthetic construct]
 gi|123995351|gb|ABM85277.1| glycosyltransferase-like 1B [synthetic construct]
 gi|124000717|gb|ABM87867.1| glycosyltransferase-like 1B [synthetic construct]
          Length = 721

 Score =  577 bits (1487), Expect = e-162,   Method: Compositional matrix adjust.
 Identities = 296/527 (56%), Positives = 354/527 (67%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY AD +   VSWIPNKHYSG+YGL+KL LP  LP  LA+ IVLDTDV FA+DI
Sbjct: 151 PAVRVSFYHADQLKPQVSWIPNKHYSGLYGLMKLVLPSALPAELARVIVLDTDVTFASDI 210

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           ++LWALF+     Q IGLVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +LR   
Sbjct: 211 SELWALFAHFSDTQAIGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLRLDRLRQAG 270

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ A + LL+   TSLADQDIFNA+I EHP LV  LPC WNVQLSD+T + E CY
Sbjct: 271 WEQMWRLTARRELLSLPATSLADQDIFNAVIKEHPGLVQRLPCVWNVQLSDHTLA-ERCY 329

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLT----QTTL 401
           +E +DLK+IHWNSPKKL+VKNKH+EFFRN YLTFLEYDGNLLRRELF C          L
Sbjct: 330 SEASDLKVIHWNSPKKLRVKNKHVEFFRNFYLTFLEYDGNLLRRELFVCPSQPPPGAEQL 389

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           QQA L+ L+EEDPC++ R+ +LT  R H+ FL +E       +DVTLVAQLSMDRL    
Sbjct: 390 QQA-LAQLDEEDPCFEFRQQQLTVHRVHVTFLPHEPPPPRP-HDVTLVAQLSMDRL---- 443

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                        QM+E L +HW GP+SL LY++DAEAQQFL +
Sbjct: 444 -----------------------------QMLEALCRHWPGPMSLALYLTDAEAQQFLHF 474

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
              S  L +R+++ YHVVY+EG  YP+N LRNVAL Q  TPYVFL DIDFLP + LY YL
Sbjct: 475 VEASPVLAARQDVAYHVVYREGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYL 534

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           ++SI  + + G   K  LVVPAFET RYR +FP S                        H
Sbjct: 535 RASIEQLGL-GSRRKAALVVPAFETLRYRFSFPHSKVELLALLDAGTLYTFRYHEWPRGH 593

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           APT+++RW  A  PY+++WA ++EPY+V  RD PRYD RFVGFGWNK
Sbjct: 594 APTDYARWREAQAPYRVQWAANYEPYVVVPRDCPRYDPRFVGFGWNK 640



 Score =  254 bits (649), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 114/161 (70%), Positives = 133/161 (82%)

Query: 12  CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQV 71
           CE++ VAIVCAG+NS+R ++TL+KS+LFYRKNPLH HL+TD VA NIL+TLF TW VP V
Sbjct: 94  CELLHVAIVCAGHNSSRDVITLVKSMLFYRKNPLHLHLVTDAVARNILETLFHTWMVPAV 153

Query: 72  EVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQL 131
            VSFY AD +   VSWIPNKHYSG+YGL+KL LP  LP  LA+ IVLDTDV FA+DI++L
Sbjct: 154 RVSFYHADQLKPQVSWIPNKHYSGLYGLMKLVLPSALPAELARVIVLDTDVTFASDISEL 213

Query: 132 WALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           WALF+     Q IGLVENQSDWYLG LWKNH+PWPALGRGF
Sbjct: 214 WALFAHFSDTQAIGLVENQSDWYLGNLWKNHRPWPALGRGF 254


>gi|21755403|dbj|BAC04675.1| unnamed protein product [Homo sapiens]
          Length = 721

 Score =  577 bits (1487), Expect = e-162,   Method: Compositional matrix adjust.
 Identities = 296/527 (56%), Positives = 354/527 (67%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY AD +   VSWIPNKHYSG+YGL+KL LP  LP  LA+ IVLDTDV FA+DI
Sbjct: 151 PAVRVSFYHADQLKPRVSWIPNKHYSGLYGLMKLVLPSALPAELARVIVLDTDVTFASDI 210

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           ++LWALF+     Q IGLVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +LR   
Sbjct: 211 SELWALFAHFSDTQAIGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLRLDRLRQAG 270

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ A + LL+   TSLADQDIFNA+I EHP LV  LPC WNVQLSD+T + E CY
Sbjct: 271 WEQMWRLTARRELLSLPATSLADQDIFNAVIKEHPGLVQRLPCVWNVQLSDHTLA-ERCY 329

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLT----QTTL 401
           +E +DLK+IHWNSPKKL+VKNKH+EFFRN YLTFLEYDGNLLRRELF C          L
Sbjct: 330 SEASDLKVIHWNSPKKLRVKNKHVEFFRNFYLTFLEYDGNLLRRELFVCPSQPPPGAEQL 389

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           QQA L+ L+EEDPC++ R+ +LT  R H+ FL +E       +DVTLVAQLSMDRL    
Sbjct: 390 QQA-LAQLDEEDPCFEFRQQQLTVHRVHVTFLPHEPPPPRP-HDVTLVAQLSMDRL---- 443

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                        QM+E L +HW GP+SL LY++DAEAQQFL +
Sbjct: 444 -----------------------------QMLEALCRHWPGPMSLALYLTDAEAQQFLHF 474

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
              S  L +R+++ YHVVY+EG  YP+N LRNVAL Q  TPYVFL DIDFLP + LY YL
Sbjct: 475 VEASPVLAARQDVAYHVVYREGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYL 534

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           ++SI  + + G   K  LVVPAFET RYR +FP S                        H
Sbjct: 535 RASIEQLGL-GSRRKAALVVPAFETLRYRFSFPHSKVELLALLDAGTLYTFRYHEWPRGH 593

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           APT+++RW  A  PY+++WA ++EPY+V  RD PRYD RFVGFGWNK
Sbjct: 594 APTDYARWREAQAPYRVQWAANYEPYVVVPRDCPRYDPRFVGFGWNK 640



 Score =  254 bits (648), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 114/161 (70%), Positives = 133/161 (82%)

Query: 12  CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQV 71
           CE++ VAIVCAG+NS+R ++TL+KS+LFYRKNPLH HL+TD VA NIL+TLF TW VP V
Sbjct: 94  CELLHVAIVCAGHNSSRDVITLVKSMLFYRKNPLHLHLVTDAVARNILETLFHTWMVPAV 153

Query: 72  EVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQL 131
            VSFY AD +   VSWIPNKHYSG+YGL+KL LP  LP  LA+ IVLDTDV FA+DI++L
Sbjct: 154 RVSFYHADQLKPRVSWIPNKHYSGLYGLMKLVLPSALPAELARVIVLDTDVTFASDISEL 213

Query: 132 WALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           WALF+     Q IGLVENQSDWYLG LWKNH+PWPALGRGF
Sbjct: 214 WALFAHFSDTQAIGLVENQSDWYLGNLWKNHRPWPALGRGF 254


>gi|426368099|ref|XP_004051050.1| PREDICTED: glycosyltransferase-like protein LARGE2 isoform 1
           [Gorilla gorilla gorilla]
 gi|426368101|ref|XP_004051051.1| PREDICTED: glycosyltransferase-like protein LARGE2 isoform 2
           [Gorilla gorilla gorilla]
          Length = 721

 Score =  576 bits (1485), Expect = e-161,   Method: Compositional matrix adjust.
 Identities = 296/527 (56%), Positives = 353/527 (66%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY AD +   VSWIPNKHYSG+YGL+KL LP  LP  LA+ IVLDTDV FA+DI
Sbjct: 151 PAVRVSFYHADQLKPQVSWIPNKHYSGLYGLMKLVLPSALPAELARVIVLDTDVTFASDI 210

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
            +LWALF+     Q IGLVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +LR   
Sbjct: 211 LELWALFAHFSDTQAIGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLRLDRLRQAD 270

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   W++ A + LL+   TSLADQDIFNA+I EHP LV  LPC WNVQLSD+T + E CY
Sbjct: 271 WEQMWKLTARRELLSLPATSLADQDIFNAVIKEHPGLVQRLPCVWNVQLSDHTLA-ERCY 329

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLT----QTTL 401
           +E +DLK+IHWNSPKKL+VKNKH+EFFRN YLTFLEYDGNLLRRELF C          L
Sbjct: 330 SEASDLKVIHWNSPKKLRVKNKHVEFFRNFYLTFLEYDGNLLRRELFVCPSQPPPGAEQL 389

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           QQA L+ L+EEDPC++ R+ +LT  R H+ FL +E       +DVTLVAQLSMDRL    
Sbjct: 390 QQA-LAQLDEEDPCFEFRQQQLTVHRVHVTFLPHEPPPPRP-HDVTLVAQLSMDRL---- 443

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                        QM+E L +HW GP+SL LY++DAEAQQFL +
Sbjct: 444 -----------------------------QMLEALCRHWPGPMSLALYLTDAEAQQFLHF 474

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
              S  L +R+++ YHVVY+EG  YP+N LRNVAL Q  TPYVFL DIDFLP + LY YL
Sbjct: 475 VEASPVLAARQDVAYHVVYREGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYL 534

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           ++SI  + + G   K  LVVPAFET RYR +FP S                        H
Sbjct: 535 RASIEQLGL-GSRRKAALVVPAFETLRYRFSFPHSKVELLALLDAGTLYTFRYHEWPRGH 593

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           APT+++RW  A  PY+++WA D+EPY+V  RD PRYD RFVGFGWNK
Sbjct: 594 APTDYARWREAQAPYRVQWAADYEPYVVVPRDCPRYDPRFVGFGWNK 640



 Score =  253 bits (646), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 114/161 (70%), Positives = 132/161 (81%)

Query: 12  CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQV 71
           CE++ VAIVCAG+NS+R ++TL+KS+LFYRKNPLH HL+TD VA NIL+TLF TW VP V
Sbjct: 94  CELLHVAIVCAGHNSSRDVITLVKSMLFYRKNPLHLHLVTDAVARNILETLFHTWMVPAV 153

Query: 72  EVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQL 131
            VSFY AD +   VSWIPNKHYSG+YGL+KL LP  LP  LA+ IVLDTDV FA+DI +L
Sbjct: 154 RVSFYHADQLKPQVSWIPNKHYSGLYGLMKLVLPSALPAELARVIVLDTDVTFASDILEL 213

Query: 132 WALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           WALF+     Q IGLVENQSDWYLG LWKNH+PWPALGRGF
Sbjct: 214 WALFAHFSDTQAIGLVENQSDWYLGNLWKNHRPWPALGRGF 254


>gi|338712079|ref|XP_001915260.2| PREDICTED: LOW QUALITY PROTEIN: glycosyltransferase-like 1B [Equus
           caballus]
          Length = 695

 Score =  576 bits (1485), Expect = e-161,   Method: Compositional matrix adjust.
 Identities = 295/526 (56%), Positives = 352/526 (66%), Gaps = 63/526 (11%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY AD +   VSWIPNKHYSG+YGL+KL LP  LP  LA+ IVLDTDV FA+DI
Sbjct: 126 PAVRVSFYDADELKPQVSWIPNKHYSGLYGLMKLVLPGTLPLDLARVIVLDTDVTFASDI 185

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWALF+     Q IGLVENQSDWYLG LW+NH+PWPALGRGFNTGVILL L +LR + 
Sbjct: 186 AELWALFAHFSDEQVIGLVENQSDWYLGNLWRNHRPWPALGRGFNTGVILLRLDRLRQVG 245

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ A + LLT   TSLADQDIFNA+I EHP LV  LPC WNVQLSD+T + E CY
Sbjct: 246 WEQMWRLTATRELLTLPATSLADQDIFNAVIKEHPGLVRPLPCVWNVQLSDHTLA-EGCY 304

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC-NLTQTTLQ-- 402
           +E +DLK+IHWNSPKKL+VKNKH+ FFRNLYLTFLEYDGNLLRRELFGC +L     +  
Sbjct: 305 SEASDLKVIHWNSPKKLRVKNKHVGFFRNLYLTFLEYDGNLLRRELFGCPSLPHPGAEQW 364

Query: 403 QAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEI 462
           Q  L+ L+ ED C + R+ +LT  R H+ FL +E        DVTLVAQLSMDRL     
Sbjct: 365 QRALAQLDGEDACSEFRQQQLTVHRVHVTFLPHELPPPRP-LDVTLVAQLSMDRL----- 418

Query: 463 NPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYA 522
                                       QM+E L +HW GP+SL LY++DAEAQQFL + 
Sbjct: 419 ----------------------------QMLEALCRHWPGPMSLALYLTDAEAQQFLRFV 450

Query: 523 GNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLK 582
             S  L +R+++ YHVVY+EG  YPIN LRNVAL Q  TPYVFL DIDFLP + LY YL+
Sbjct: 451 EASAVLSARQDVAYHVVYREGPLYPINQLRNVALAQALTPYVFLSDIDFLPAYSLYDYLR 510

Query: 583 SSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------HA 618
           +SI  +++ G   K  LVVPAFET  YR +FP S                        HA
Sbjct: 511 ASIEQLELGGR-RKAALVVPAFETLHYRFSFPRSKAEVLASLDAGALYTFRYHEWPRGHA 569

Query: 619 PTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           PT+++RW  A  PY+++WA D+EPY+V  RD PRYD RFVGFGWNK
Sbjct: 570 PTDYARWREAQAPYRVQWAADYEPYVVVPRDCPRYDPRFVGFGWNK 615



 Score =  256 bits (653), Expect = 4e-65,   Method: Compositional matrix adjust.
 Identities = 116/167 (69%), Positives = 135/167 (80%)

Query: 6   EPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFST 65
            P+ P CE++ VAIVCAG+NS+R +VTL+KSILFYRKNPLH HL+TD VA +IL+ LF T
Sbjct: 63  RPLAPKCELLHVAIVCAGHNSSRDVVTLVKSILFYRKNPLHLHLVTDAVARSILEMLFHT 122

Query: 66  WSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFA 125
           W VP V VSFY AD +   VSWIPNKHYSG+YGL+KL LP  LP  LA+ IVLDTDV FA
Sbjct: 123 WMVPAVRVSFYDADELKPQVSWIPNKHYSGLYGLMKLVLPGTLPLDLARVIVLDTDVTFA 182

Query: 126 TDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           +DIA+LWALF+     Q IGLVENQSDWYLG LW+NH+PWPALGRGF
Sbjct: 183 SDIAELWALFAHFSDEQVIGLVENQSDWYLGNLWRNHRPWPALGRGF 229


>gi|301773854|ref|XP_002922345.1| PREDICTED: LOW QUALITY PROTEIN: glycosyltransferase-like protein
           LARGE2-like [Ailuropoda melanoleuca]
          Length = 716

 Score =  576 bits (1484), Expect = e-161,   Method: Compositional matrix adjust.
 Identities = 293/526 (55%), Positives = 354/526 (67%), Gaps = 63/526 (11%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY A+ +   VSWIPNKHYSG+YGL+KL LP  LP  LA+ IVLDTDV FA+DI
Sbjct: 148 PAVWVSFYDAEELKPQVSWIPNKHYSGLYGLMKLVLPATLPPDLARVIVLDTDVTFASDI 207

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWALF+    +Q IGLVENQSDWYLG LW+NH+PWPALGRGFNTGVILL L +LR   
Sbjct: 208 AELWALFAHFSDKQVIGLVENQSDWYLGNLWRNHRPWPALGRGFNTGVILLRLDRLRQAG 267

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   W++ A + LLT   TSLADQDIFNA+I EHP LV  LPC WNVQLSD+T + E CY
Sbjct: 268 WEQMWKLTATRELLTLPATSLADQDIFNAVIKEHPRLVQPLPCVWNVQLSDHTLA-ERCY 326

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTTLQ--- 402
           +E +DLK+IHWNSPKKL+VKNKH++ FRNLYLTFLEYDGNLLRRELFGC     +     
Sbjct: 327 SEASDLKVIHWNSPKKLRVKNKHVDHFRNLYLTFLEYDGNLLRRELFGCPSPPPSGAEQV 386

Query: 403 QAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEI 462
           Q  L+ L+EED C++ R+ +LT  R H+ FL  E       +DVTLVAQLS+DRL     
Sbjct: 387 QRALAQLDEEDACFEFRQQQLTVHRVHVTFLPLEPPLPR-PHDVTLVAQLSIDRL----- 440

Query: 463 NPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYA 522
                                       QM+E L +HW GP+SL LY++DAEAQQFL + 
Sbjct: 441 ----------------------------QMLEALCRHWPGPMSLALYLTDAEAQQFLRFV 472

Query: 523 GNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLK 582
             S  L +R+N+ YHVVY+EG  YP+N LRNVAL Q  TPYVFL DIDFLP + LY YL+
Sbjct: 473 EASAVLSARQNVAYHVVYREGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYLR 532

Query: 583 SSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------HA 618
           +SI  + + G   K  LVVPAFET RYR +FP+S                        HA
Sbjct: 533 ASIEQLKL-GRERKAALVVPAFETLRYRFSFPSSKAELLALLDSGSLYTFRYHEWPQGHA 591

Query: 619 PTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           PT+++RW  A TPY+++WA D+EPY+V  RD PRYD RFVGFGWNK
Sbjct: 592 PTDYARWREAQTPYRVQWAADYEPYVVVPRDCPRYDPRFVGFGWNK 637



 Score =  248 bits (633), Expect = 9e-63,   Method: Compositional matrix adjust.
 Identities = 113/160 (70%), Positives = 133/160 (83%)

Query: 13  EVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQVE 72
           +++ VAIVCAG NS+R LVTL+KS+LFYRKNPLHFHL+TDT+A NIL+ LF TW VP V 
Sbjct: 92  QLLHVAIVCAGQNSSRDLVTLVKSVLFYRKNPLHFHLVTDTMARNILEMLFHTWMVPAVW 151

Query: 73  VSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLW 132
           VSFY A+ +   VSWIPNKHYSG+YGL+KL LP  LP  LA+ IVLDTDV FA+DIA+LW
Sbjct: 152 VSFYDAEELKPQVSWIPNKHYSGLYGLMKLVLPATLPPDLARVIVLDTDVTFASDIAELW 211

Query: 133 ALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           ALF+    +Q IGLVENQSDWYLG LW+NH+PWPALGRGF
Sbjct: 212 ALFAHFSDKQVIGLVENQSDWYLGNLWRNHRPWPALGRGF 251


>gi|193786298|dbj|BAG51581.1| unnamed protein product [Homo sapiens]
          Length = 690

 Score =  574 bits (1480), Expect = e-161,   Method: Compositional matrix adjust.
 Identities = 295/527 (55%), Positives = 353/527 (66%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY AD +   VSWIPNKHYSG+YGL+KL LP  LP  LA+ IVLDTDV FA+DI
Sbjct: 120 PAVRVSFYHADQLKPQVSWIPNKHYSGLYGLMKLVLPSALPAELARVIVLDTDVTFASDI 179

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           ++LWALF+     Q IGLVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +LR   
Sbjct: 180 SELWALFAHFSDTQAIGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLRLDRLRQAG 239

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ A + LL+   TSLADQDIFNA+I EHP LV  LPC WNVQLSD+T + E CY
Sbjct: 240 WEQMWRLTARRELLSLPATSLADQDIFNAVIKEHPGLVQRLPCVWNVQLSDHTLA-ERCY 298

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLT----QTTL 401
           +E +DLK+IHWNSPKKL+VKNKH+EFFRN YLTFLEYDGNLLRRELF C          L
Sbjct: 299 SEASDLKVIHWNSPKKLRVKNKHVEFFRNFYLTFLEYDGNLLRRELFVCPSQPPPGAEQL 358

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           QQA L+ L+EEDPC++ R+ +LT  R H+ FL +E       +DVTLVAQLSMDRL    
Sbjct: 359 QQA-LAQLDEEDPCFEFRQQQLTVHRVHVTFLPHEPPPPR-PHDVTLVAQLSMDRL---- 412

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                        QM+E L +HW GP+SL LY++DAEAQQFL +
Sbjct: 413 -----------------------------QMLEALCRHWPGPMSLALYLTDAEAQQFLHF 443

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
              S  L +R+++ YHVVY+EG  YP+N LRNVAL Q  TPYVFL DIDFLP + LY YL
Sbjct: 444 VEASPVLAARQDVAYHVVYREGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYL 503

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           ++SI  + + G   K  LVVPAFET RY  +FP S                        H
Sbjct: 504 RASIEQLGL-GSRRKAALVVPAFETLRYCFSFPHSKVELLALLDAGTLYTFRYHEWPRGH 562

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           APT+++RW  A  PY+++WA ++EPY+V  RD PRYD RFVGFGWNK
Sbjct: 563 APTDYARWREAQAPYRVQWAANYEPYVVVPRDCPRYDPRFVGFGWNK 609



 Score =  254 bits (649), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 114/161 (70%), Positives = 133/161 (82%)

Query: 12  CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQV 71
           CE++ VAIVCAG+NS+R ++TL+KS+LFYRKNPLH HL+TD VA NIL+TLF TW VP V
Sbjct: 63  CELLHVAIVCAGHNSSRDVITLVKSMLFYRKNPLHLHLVTDAVARNILETLFHTWMVPAV 122

Query: 72  EVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQL 131
            VSFY AD +   VSWIPNKHYSG+YGL+KL LP  LP  LA+ IVLDTDV FA+DI++L
Sbjct: 123 RVSFYHADQLKPQVSWIPNKHYSGLYGLMKLVLPSALPAELARVIVLDTDVTFASDISEL 182

Query: 132 WALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           WALF+     Q IGLVENQSDWYLG LWKNH+PWPALGRGF
Sbjct: 183 WALFAHFSDTQAIGLVENQSDWYLGNLWKNHRPWPALGRGF 223


>gi|397473583|ref|XP_003808287.1| PREDICTED: LOW QUALITY PROTEIN: glycosyltransferase-like protein
           LARGE2 [Pan paniscus]
          Length = 757

 Score =  574 bits (1480), Expect = e-161,   Method: Compositional matrix adjust.
 Identities = 296/527 (56%), Positives = 352/527 (66%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY AD +   VSWIPNKHYSG+YGL+KL LP  LP  LA+ IVLDTDV FA+DI
Sbjct: 187 PAVRVSFYHADQLKPQVSWIPNKHYSGLYGLMKLVLPSALPAELARVIVLDTDVTFASDI 246

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           ++LWALF+     Q IGLVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +LR   
Sbjct: 247 SELWALFAHFSDTQAIGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLRLDRLRQAG 306

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ A + LL+   TSLADQDIFNA+I EHP LV  LPC WNVQLSD+T + E CY
Sbjct: 307 WEQMWRLTARRELLSLPATSLADQDIFNAVIKEHPGLVQHLPCVWNVQLSDHTLA-ERCY 365

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLT----QTTL 401
           +E +DLK+IHWNSPKKL+VKNKH+EFFRN YLTFLEYD NLLRRELF C          L
Sbjct: 366 SEASDLKVIHWNSPKKLRVKNKHVEFFRNFYLTFLEYDXNLLRRELFVCPSQPPPGAEQL 425

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           QQA L+ L+EEDPC++ R+  LT  R H+ FL +E       +DVTLVAQLSMDRL    
Sbjct: 426 QQA-LAQLDEEDPCFEFRQQPLTVHRVHVTFLPHEPPPPRP-HDVTLVAQLSMDRL---- 479

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                        QM+E L +HW GP+SL LY++DAEAQQFL +
Sbjct: 480 -----------------------------QMLEALCRHWPGPMSLALYLTDAEAQQFLHF 510

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
              S  L +R+++ YHVVY+EG  YP+N LRNVAL Q  TPYVFL DIDFLP + LY YL
Sbjct: 511 VEASPVLAARQDVAYHVVYREGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYL 570

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           ++SI  + + G   K  LVVPAFET RYR +FP S                        H
Sbjct: 571 RASIEQLGL-GSWRKAALVVPAFETLRYRFSFPHSKVELLALLDAGTLYTFRYHEWPRGH 629

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           APT+++RW  A  PY+++WA D+EPY+V  RD PRYD RFVGFGWNK
Sbjct: 630 APTDYARWREAQAPYRVQWAADYEPYVVVPRDCPRYDPRFVGFGWNK 676



 Score =  252 bits (643), Expect = 5e-64,   Method: Compositional matrix adjust.
 Identities = 113/161 (70%), Positives = 132/161 (81%)

Query: 12  CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQV 71
           CE++ VAIVCAG+NS+R ++TL+KS+LFYRKNPLH HL+TD VA NIL+ LF TW VP V
Sbjct: 130 CELLHVAIVCAGHNSSRDVITLVKSMLFYRKNPLHLHLVTDAVARNILEMLFHTWMVPAV 189

Query: 72  EVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQL 131
            VSFY AD +   VSWIPNKHYSG+YGL+KL LP  LP  LA+ IVLDTDV FA+DI++L
Sbjct: 190 RVSFYHADQLKPQVSWIPNKHYSGLYGLMKLVLPSALPAELARVIVLDTDVTFASDISEL 249

Query: 132 WALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           WALF+     Q IGLVENQSDWYLG LWKNH+PWPALGRGF
Sbjct: 250 WALFAHFSDTQAIGLVENQSDWYLGNLWKNHRPWPALGRGF 290


>gi|355566575|gb|EHH22954.1| hypothetical protein EGK_06315, partial [Macaca mulatta]
 gi|355752188|gb|EHH56308.1| hypothetical protein EGM_05685, partial [Macaca fascicularis]
          Length = 647

 Score =  573 bits (1477), Expect = e-160,   Method: Compositional matrix adjust.
 Identities = 299/537 (55%), Positives = 352/537 (65%), Gaps = 75/537 (13%)

Query: 166 PALGRGFYLADSVVED----------VSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVL 215
           PA+   FY AD +             VSWIPNKHYSG+YGL+KL LP  LP  LA  IVL
Sbjct: 67  PAVRVSFYNADELKAGAQPFPAPPSCVSWIPNKHYSGLYGLMKLVLPSALPTELAHVIVL 126

Query: 216 DTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVIL 275
           DTDV FA+DI++LWALF+     Q IGLVENQSDWYLG LWKNH+PWPALGRGFNTGVIL
Sbjct: 127 DTDVTFASDISELWALFAHFSDTQAIGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVIL 186

Query: 276 LDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLS 335
           L L +LR   W   WR+ A + LLT   TSLADQDIFNA+I EHP LV  LPC WNVQLS
Sbjct: 187 LRLDRLRQAGWEQMWRLTARRELLTLPATSLADQDIFNAVIKEHPGLVQPLPCVWNVQLS 246

Query: 336 DNTRSDELCYTELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCN 395
           D+T + E CY+  +DLK+IHWNSPKKL+VKNKH+EFFRN YLTFLEYDGNLLRRELFGC 
Sbjct: 247 DHTLA-ERCYSAASDLKVIHWNSPKKLQVKNKHVEFFRNFYLTFLEYDGNLLRRELFGCP 305

Query: 396 LT----QTTLQQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQ 451
                    LQQA LS LNEEDPC++ R+ +LT  R H+ FL +E       +DVTLVAQ
Sbjct: 306 SQPPPGAEQLQQA-LSQLNEEDPCFEFRQQQLTVHRVHVTFLPHEPPPPR-PHDVTLVAQ 363

Query: 452 LSMDRLQMVEINPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMS 511
           LSMDR                                 LQM+E L +HW GP+SL LY++
Sbjct: 364 LSMDR---------------------------------LQMLEALCRHWPGPMSLALYLT 390

Query: 512 DAEAQQFLSYAGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDF 571
           DAEAQQFL +   S  L +R+++ YHVVY+EG  YP+N LRNVAL Q  TPYVFL DIDF
Sbjct: 391 DAEAQQFLRFIEASPVLAARQDVAYHVVYREGPLYPVNQLRNVALAQALTPYVFLSDIDF 450

Query: 572 LPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS--------------- 616
           LP + LY YL++SI  + + G   K  LVVPAFET  YR +FP S               
Sbjct: 451 LPAYSLYDYLRASIEQLGL-GSQRKAALVVPAFETLHYRFSFPHSKVELLALLDVGALYT 509

Query: 617 ---------HAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
                    HAPT+++RW  A  PY+++WA D+EPY+V  RD PRYD RFVGFGWNK
Sbjct: 510 FRYHEWPRGHAPTDYARWREAQAPYRVQWAADYEPYVVVPRDCPRYDPRFVGFGWNK 566



 Score =  237 bits (604), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 111/170 (65%), Positives = 130/170 (76%), Gaps = 10/170 (5%)

Query: 13  EVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQVE 72
           +++ VAIVCAG+NS+R ++TL+KS+LFYRKNPLH HL+TD VA NIL+ LF TW VP V 
Sbjct: 11  QLLHVAIVCAGHNSSRDVITLVKSMLFYRKNPLHLHLVTDAVARNILEMLFHTWMVPAVR 70

Query: 73  VSFYLADSVVED----------VSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDV 122
           VSFY AD +             VSWIPNKHYSG+YGL+KL LP  LP  LA  IVLDTDV
Sbjct: 71  VSFYNADELKAGAQPFPAPPSCVSWIPNKHYSGLYGLMKLVLPSALPTELAHVIVLDTDV 130

Query: 123 IFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
            FA+DI++LWALF+     Q IGLVENQSDWYLG LWKNH+PWPALGRGF
Sbjct: 131 TFASDISELWALFAHFSDTQAIGLVENQSDWYLGNLWKNHRPWPALGRGF 180


>gi|354469848|ref|XP_003497334.1| PREDICTED: glycosyltransferase-like protein LARGE2-like isoform 1
           [Cricetulus griseus]
          Length = 690

 Score =  573 bits (1476), Expect = e-160,   Method: Compositional matrix adjust.
 Identities = 288/527 (54%), Positives = 357/527 (67%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY A+ +   +SWIPNKHYSG+YGL+KL LP +LP +LA+ IVLDTDV F++DI
Sbjct: 122 PAVVIRFYDAEELKPLISWIPNKHYSGLYGLMKLVLPSILPLSLARVIVLDTDVTFSSDI 181

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWALF+    +Q +GLVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +LR   
Sbjct: 182 AELWALFAHFSDKQVVGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLWLDRLRQTG 241

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   W++IA++ LLT + TSLADQD+FNA+I EHP LV+ LPC WNVQLSD+T + E CY
Sbjct: 242 WEQMWKLIAKQELLTLMATSLADQDVFNAVIKEHPELVHPLPCVWNVQLSDHTLA-ERCY 300

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
            E  DLK+IHWNSPKKL+VKNKH EFFRNL+LTFL YDG LLRRELFGC          L
Sbjct: 301 LEAADLKVIHWNSPKKLRVKNKHAEFFRNLHLTFLGYDGKLLRRELFGCPNQLPPGAEQL 360

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           QQA L+ L+EE+PC++ R+ +L+  R H+ FL ++       +DVTLVAQLSMDR     
Sbjct: 361 QQA-LAQLDEEEPCFEFRQQQLSVHRVHITFLPHQPPPPR-PHDVTLVAQLSMDR----- 413

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+E L +HW GP+SL LY++DAEAQQFL +
Sbjct: 414 ----------------------------LQMLEALCRHWPGPMSLALYLTDAEAQQFLHF 445

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
              S  L +R+++ YH+VY+EG  YP+N LRNVAL Q  TPYVFL DIDFLP + LY YL
Sbjct: 446 VETSPVLSARKDVAYHIVYREGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYL 505

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           ++SI  +++ G   K  LVVPAFET  YR +FP S                        H
Sbjct: 506 RASIEQLEL-GSQRKAALVVPAFETLHYRLSFPNSKAELLTLLDAGSLYTFRYHEWPQGH 564

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           APT+++RW  A  PY ++WA D+EPY+V  RD PRYD RFVGFGWNK
Sbjct: 565 APTDYARWREAKAPYHVQWAADYEPYVVVPRDCPRYDPRFVGFGWNK 611



 Score =  253 bits (645), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 122/214 (57%), Positives = 155/214 (72%), Gaps = 5/214 (2%)

Query: 10  PTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVP 69
           P CE++ VAIVCAGYNS+R ++TL+KS+LFYRKNPLH HL+TD +A NIL+TLF TW VP
Sbjct: 63  PKCELLHVAIVCAGYNSSREVITLVKSLLFYRKNPLHLHLVTDAIARNILETLFRTWMVP 122

Query: 70  QVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIA 129
            V + FY A+ +   +SWIPNKHYSG+YGL+KL LP +LP +LA+ IVLDTDV F++DIA
Sbjct: 123 AVVIRFYDAEELKPLISWIPNKHYSGLYGLMKLVLPSILPLSLARVIVLDTDVTFSSDIA 182

Query: 130 QLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFYLADSVVEDVSWIPNKH 189
           +LWALF+    +Q +GLVENQSDWYLG LWKNH+PWPALGRGF     ++    W+    
Sbjct: 183 ELWALFAHFSDKQVVGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILL----WLDRLR 238

Query: 190 YSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 223
            +G   + KL   + L  TL  T + D DV  A 
Sbjct: 239 QTGWEQMWKLIAKQEL-LTLMATSLADQDVFNAV 271


>gi|311247925|ref|XP_003122879.1| PREDICTED: glycosyltransferase-like 1B [Sus scrofa]
          Length = 726

 Score =  572 bits (1475), Expect = e-160,   Method: Compositional matrix adjust.
 Identities = 295/527 (55%), Positives = 357/527 (67%), Gaps = 64/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY AD +   VSWIPNKHYSG+YGL+KL LP  LP  LA+ IVLDTDV FA++I
Sbjct: 157 PAVRVSFYDADELKPQVSWIPNKHYSGLYGLMKLVLPGALPPDLARVIVLDTDVTFASNI 216

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWALF+    +Q IGLVENQSDWYLG LW+NH+PWPALGRGFNTGVILL L +LR   
Sbjct: 217 AELWALFAHFSDKQVIGLVENQSDWYLGNLWRNHRPWPALGRGFNTGVILLRLDRLRQAG 276

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   W++IA + LLT   TSLADQDIFNA+I EHP+LV  LPC WNVQLSD+T + E CY
Sbjct: 277 WEQMWKLIATQELLTLPATSLADQDIFNAVIKEHPWLVQPLPCVWNVQLSDHTLA-ERCY 335

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTTLQ--- 402
           +E +DLK+IHWNSPKKL+VK KH+E FRNL+LTFLEYDGNLLRRELFGC           
Sbjct: 336 SEASDLKVIHWNSPKKLRVKTKHVESFRNLHLTFLEYDGNLLRRELFGCPCPPPPGAEQL 395

Query: 403 QAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEI 462
           Q  L+ L+EED C++ R+ +LT  R H+ FL ++        DVTLVAQLSMDR      
Sbjct: 396 QRALAQLDEEDACFEFRQQQLTVHRVHITFLPHKPPPPQ-PYDVTLVAQLSMDR------ 448

Query: 463 NPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYA 522
                                      LQM+E L +HW GP+SL LY++DAEAQQFL + 
Sbjct: 449 ---------------------------LQMLEALCRHWPGPMSLALYLTDAEAQQFLRFV 481

Query: 523 GNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLK 582
             S  L +R+N+ YHVVY+EG  YPIN LRNVAL Q  TPYVFL DIDFLP + LY YL+
Sbjct: 482 ETSAVLSARQNVAYHVVYREGPLYPINQLRNVALAQALTPYVFLSDIDFLPAYSLYHYLR 541

Query: 583 SSIRSMDMHGHGGKKV-LVVPAFETQRYRTAFPAS------------------------H 617
           +SI  +++ G GG+KV LVVPAFET  YR +FP+S                        H
Sbjct: 542 ASIEQLEL-GTGGRKVALVVPAFETLHYRFSFPSSKAELLALLDAGSLYTFRYHEWPRGH 600

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           APT+++RW  A  PY+++WA D+EPY+V  RD PRYD RFVGFGWNK
Sbjct: 601 APTDYARWREAQAPYRVQWAADYEPYVVVPRDCPRYDPRFVGFGWNK 647



 Score =  259 bits (661), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 116/167 (69%), Positives = 138/167 (82%)

Query: 6   EPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFST 65
            P+ P CE++QVAIVCAG+NS+R ++TL+KS+LFYRKNPLH HL+TD VA NIL+TLF T
Sbjct: 94  RPLPPKCELLQVAIVCAGHNSSRDVITLVKSMLFYRKNPLHLHLVTDAVARNILETLFHT 153

Query: 66  WSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFA 125
           W VP V VSFY AD +   VSWIPNKHYSG+YGL+KL LP  LP  LA+ IVLDTDV FA
Sbjct: 154 WMVPAVRVSFYDADELKPQVSWIPNKHYSGLYGLMKLVLPGALPPDLARVIVLDTDVTFA 213

Query: 126 TDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           ++IA+LWALF+    +Q IGLVENQSDWYLG LW+NH+PWPALGRGF
Sbjct: 214 SNIAELWALFAHFSDKQVIGLVENQSDWYLGNLWRNHRPWPALGRGF 260


>gi|156405409|ref|XP_001640724.1| predicted protein [Nematostella vectensis]
 gi|156227860|gb|EDO48661.1| predicted protein [Nematostella vectensis]
          Length = 746

 Score =  572 bits (1474), Expect = e-160,   Method: Compositional matrix adjust.
 Identities = 289/536 (53%), Positives = 359/536 (66%), Gaps = 73/536 (13%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY  +   + VSW+PNKHYSGV+GL+KL L + LPE L K IVLDTDVIFA+D+
Sbjct: 174 PAVQTSFYHLEKYKDRVSWVPNKHYSGVFGLMKLVLTEALPEVLDKVIVLDTDVIFASDV 233

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LW L  +L +++ IGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDL KLR ++
Sbjct: 234 AELWKLLDQLTKKKAIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLNKLRRMN 293

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ AEK L+T L T+LADQDIFNA+I   P+L++ LPC WNVQL DNTRS++ CY
Sbjct: 294 WKDKWRLTAEKELMTMLSTALADQDIFNAVIKNDPHLLHKLPCTWNVQLGDNTRSEQ-CY 352

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----------N 395
           TE+ +LK+IHWNSPKKL VKNKH +FFRNLYLTFLEYDGNLLRR LF C          N
Sbjct: 353 TEVNELKVIHWNSPKKLLVKNKHGDFFRNLYLTFLEYDGNLLRRALFHCEKEGDKTTPSN 412

Query: 396 LTQTTLQQ---AELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQL 452
            + T  +Q    +L  ++ EDPCY+ ++ +  + RTHLY+L+Y++      +DVTLVAQL
Sbjct: 413 NSLTASEQEIATKLPGVDGEDPCYEFKQEQHLTQRTHLYYLDYDFTVEYGPHDVTLVAQL 472

Query: 453 SMDRLQMVEINPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSD 512
           SMDRL                                 QMVE L ++WEGPISL LYMSD
Sbjct: 473 SMDRL---------------------------------QMVEALCQNWEGPISLALYMSD 499

Query: 513 AEAQQFLSYAGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFL 572
           +EAQQFL YA  S  L++R+NIGYH+V+KEG FYP+N LRN+AL    TP+VFL DIDFL
Sbjct: 500 SEAQQFLRYALGSPVLKARKNIGYHIVFKEGQFYPVNYLRNIALEHARTPFVFLTDIDFL 559

Query: 573 PMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS---------------- 616
           PM  L  Y+   + +  M      +  VVPAFET  Y+  FP +                
Sbjct: 560 PMPSLNVYINVRVNNKTMTLLS--QAYVVPAFETLHYKLEFPQTKSQLLSSWDLGTVFIF 617

Query: 617 --------HAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
                   HAPT++++W  ATT Y ++W  D+EPYIV  +D+ RYDTRFVGFGWNK
Sbjct: 618 RYNEWSKGHAPTDYAKWRTATTLYTVQWEEDYEPYIVVPKDIVRYDTRFVGFGWNK 673



 Score =  245 bits (626), Expect = 4e-62,   Method: Compositional matrix adjust.
 Identities = 109/165 (66%), Positives = 134/165 (81%)

Query: 8   VIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWS 67
            +P CEVI +A+VCAG++++R  VTLIKS+LFYR NPLHFH ++D+VA  IL TLF TW 
Sbjct: 113 AVPQCEVIHIAVVCAGHSASRQAVTLIKSLLFYRHNPLHFHFVSDSVAELILGTLFRTWG 172

Query: 68  VPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATD 127
           VP V+ SFY  +   + VSW+PNKHYSGV+GL+KL L + LPE L K IVLDTDVIFA+D
Sbjct: 173 VPAVQTSFYHLEKYKDRVSWVPNKHYSGVFGLMKLVLTEALPEVLDKVIVLDTDVIFASD 232

Query: 128 IAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           +A+LW L  +L +++ IGLVENQSDWYLGKLWKNHKPWPALGRGF
Sbjct: 233 VAELWKLLDQLTKKKAIGLVENQSDWYLGKLWKNHKPWPALGRGF 277


>gi|432090380|gb|ELK23806.1| Glycosyltransferase-like protein LARGE2, partial [Myotis davidii]
          Length = 632

 Score =  572 bits (1473), Expect = e-160,   Method: Compositional matrix adjust.
 Identities = 294/527 (55%), Positives = 353/527 (66%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY A+ +   VSW+PNKHYSG+YGL+KL LP  LP  LA+ IVLDTDV FA+DI
Sbjct: 65  PAVRVSFYDANELKPQVSWVPNKHYSGLYGLMKLVLPGALPPDLARVIVLDTDVTFASDI 124

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWALF+     Q IGLVENQSDWYLG LW+NHKPWPALGRGFNTGVILL L +LR   
Sbjct: 125 AELWALFAHFSDEQVIGLVENQSDWYLGNLWRNHKPWPALGRGFNTGVILLWLDRLRQAG 184

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   W++ A + LLT   TSLADQDIFNA+I  HP LV  LPC WNVQLSD+T + E CY
Sbjct: 185 WEQMWKLTATQELLTLPATSLADQDIFNAVIKGHPALVQPLPCVWNVQLSDHTLA-ERCY 243

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQ----TTL 401
            E  D+K IHWNSPKKL+VKNKH+EFFRNLYLTFLE+DGNLLRRELFGC   +      L
Sbjct: 244 LEAADIKAIHWNSPKKLRVKNKHVEFFRNLYLTFLEHDGNLLRRELFGCPSLRPPEAEQL 303

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           QQA L+ LNEED C++ R+ +LT  R H+ FL +E       +DVTLVAQLSMDR     
Sbjct: 304 QQA-LAQLNEEDACFEFRQQQLTVHRVHITFLPHEPPPPR-PHDVTLVAQLSMDR----- 356

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+E L +HW GP+SL LY++DAEAQQFL +
Sbjct: 357 ----------------------------LQMLEALCRHWPGPMSLALYLTDAEAQQFLRF 388

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
              SE L +R+++ YHVVY+EG  YPIN +RNVAL Q  TPYVFL DIDFLP + LY YL
Sbjct: 389 VEASEVLSARQDVAYHVVYREGPLYPINQIRNVALAQALTPYVFLSDIDFLPAYSLYDYL 448

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           ++SI  +++ G   K  LVVPAFET  Y+ +FP S                        H
Sbjct: 449 RASIEQLEL-GSRRKAALVVPAFETLHYQFSFPTSKAELLSLLDAGSVYTFRYHEWPRGH 507

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           APT+++RW  A  PY+++WA D+EPY+V  RD PRYD RFVGFGWNK
Sbjct: 508 APTDYARWREAQAPYRVQWAVDYEPYVVVPRDCPRYDPRFVGFGWNK 554



 Score =  248 bits (632), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 122/216 (56%), Positives = 154/216 (71%), Gaps = 5/216 (2%)

Query: 8   VIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWS 67
            +P  +++ VAIVCAG++++R +VTL+KS+LFYRKNPLH HL+TD VA NIL+TLF TW 
Sbjct: 4   CLPRPQLLHVAIVCAGHHTSRDVVTLVKSLLFYRKNPLHLHLVTDAVARNILETLFHTWM 63

Query: 68  VPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATD 127
           +P V VSFY A+ +   VSW+PNKHYSG+YGL+KL LP  LP  LA+ IVLDTDV FA+D
Sbjct: 64  LPAVRVSFYDANELKPQVSWVPNKHYSGLYGLMKLVLPGALPPDLARVIVLDTDVTFASD 123

Query: 128 IAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFYLADSVVEDVSWIPN 187
           IA+LWALF+     Q IGLVENQSDWYLG LW+NHKPWPALGRGF     ++    W+  
Sbjct: 124 IAELWALFAHFSDEQVIGLVENQSDWYLGNLWRNHKPWPALGRGFNTGVILL----WLDR 179

Query: 188 KHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 223
              +G   + KLT  + L  TL  T + D D+  A 
Sbjct: 180 LRQAGWEQMWKLTATQEL-LTLPATSLADQDIFNAV 214


>gi|73983379|ref|XP_540758.2| PREDICTED: glycosyltransferase-like 1B [Canis lupus familiaris]
          Length = 721

 Score =  572 bits (1473), Expect = e-160,   Method: Compositional matrix adjust.
 Identities = 293/527 (55%), Positives = 353/527 (66%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY A+ +   +SWIPNKHYSG+YGL+KL LP VLP  LA+ IVLDTDV FA+DI
Sbjct: 154 PAVRVSFYDAEELKPQISWIPNKHYSGLYGLMKLVLPGVLPPDLARVIVLDTDVTFASDI 213

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWALF+     Q IGLVENQSDWYLG LW+NH+PWPALGRGFNTGVILL L +LR   
Sbjct: 214 AELWALFAHFSDEQVIGLVENQSDWYLGNLWRNHRPWPALGRGFNTGVILLWLDRLRQAG 273

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   W++ A + LLT   TSLADQDIFNA+I EHP LV  LPC WNVQLSD+T + E CY
Sbjct: 274 WEQMWKLTAGQELLTLPSTSLADQDIFNAVIKEHPGLVQPLPCVWNVQLSDHTLA-ERCY 332

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTT----L 401
           +E  DLK+IHWNSPKKL+VKNKH+ +FRNLYLTFLEYDGNLLRRELFGC     +    L
Sbjct: 333 SEAADLKVIHWNSPKKLRVKNKHVAYFRNLYLTFLEYDGNLLRRELFGCPSPPPSGAKQL 392

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           QQA L++L+EED C++ R+ +L   R H+ FL +        +DVTLVAQLSMDRL    
Sbjct: 393 QQA-LAHLDEEDACFEFRQQQLMVHRVHVTFLPH-VPPPGRPHDVTLVAQLSMDRL---- 446

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                        QM+E L +HW GP+SL LY++DAEAQQFL +
Sbjct: 447 -----------------------------QMLEALCRHWPGPMSLALYLTDAEAQQFLRF 477

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
              S  L +R+++ YHVVY+EG  YP+N LRNVAL Q  TPYVFL DIDFLP F LY YL
Sbjct: 478 VEASAVLSARQDVAYHVVYREGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAFSLYDYL 537

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           ++SI  + + G   K  LVVPAFET  YR +FP+S                        H
Sbjct: 538 RASIEQLQL-GSEQKAALVVPAFETLHYRFSFPSSKAELLALLDSGALYTFRFHEWPRGH 596

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           A T++ RW  A TPY+++WA D+EPY+V  RD PRYD RFVGFGWNK
Sbjct: 597 AATDYVRWRQARTPYRVQWAADYEPYVVVPRDCPRYDPRFVGFGWNK 643



 Score =  260 bits (664), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 127/218 (58%), Positives = 156/218 (71%), Gaps = 5/218 (2%)

Query: 6   EPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFST 65
            P +P CE++ VA+VCAG+NS+R LVTL+KS+LFYRKNPLHFHL+TD  A NIL+ LF T
Sbjct: 91  RPPLPKCELLHVAVVCAGHNSSRDLVTLVKSVLFYRKNPLHFHLVTDATARNILEMLFHT 150

Query: 66  WSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFA 125
           W VP V VSFY A+ +   +SWIPNKHYSG+YGL+KL LP VLP  LA+ IVLDTDV FA
Sbjct: 151 WMVPAVRVSFYDAEELKPQISWIPNKHYSGLYGLMKLVLPGVLPPDLARVIVLDTDVTFA 210

Query: 126 TDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFYLADSVVEDVSWI 185
           +DIA+LWALF+     Q IGLVENQSDWYLG LW+NH+PWPALGRGF     ++    W+
Sbjct: 211 SDIAELWALFAHFSDEQVIGLVENQSDWYLGNLWRNHRPWPALGRGFNTGVILL----WL 266

Query: 186 PNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 223
                +G   + KLT  + L  TL  T + D D+  A 
Sbjct: 267 DRLRQAGWEQMWKLTAGQEL-LTLPSTSLADQDIFNAV 303


>gi|348558768|ref|XP_003465188.1| PREDICTED: glycosyltransferase-like protein LARGE2-like [Cavia
           porcellus]
          Length = 695

 Score =  570 bits (1470), Expect = e-160,   Method: Compositional matrix adjust.
 Identities = 297/529 (56%), Positives = 353/529 (66%), Gaps = 69/529 (13%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PALG  FY AD +   +SWIPNKHYSG+YGL+KL LP VLP  LA+ IVLDTDV  A+DI
Sbjct: 126 PALGVSFYDADQLKPHISWIPNKHYSGLYGLMKLVLPGVLPPGLAQVIVLDTDVTLASDI 185

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWALF+    +Q IGLVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +LR + 
Sbjct: 186 AELWALFAYFSDKQVIGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLWLDRLRQVG 245

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   W++ A + LLT+  TSLADQDIFNA+I EHP LV  LPC WNVQLSD+T + E CY
Sbjct: 246 WEQMWQLTARRELLTQPATSLADQDIFNAVIKEHPGLVQPLPCVWNVQLSDHTLA-ERCY 304

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTT----L 401
            E  DLK+IHWNSPKKL+VK KH EFFRNLYLTFL YDGNLLRRELFGC          L
Sbjct: 305 LEAADLKVIHWNSPKKLRVKIKHAEFFRNLYLTFLGYDGNLLRRELFGCPSQSPAGAEQL 364

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           QQ+ L+ L+EED C++ R+ +LT  R H+ FL +E       +DVTLVAQLSMDR     
Sbjct: 365 QQS-LAQLDEEDACFEFRQQQLTVHRVHITFLPHEPPPPRP-HDVTLVAQLSMDR----- 417

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+E L +HW GP+SL LY++DAEAQQFL +
Sbjct: 418 ----------------------------LQMLEALCRHWLGPMSLALYLTDAEAQQFLRF 449

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
              S  L +R+++ YHVVY+EG  YP+N LRNVAL Q  TPYVFL DIDFLP + LY YL
Sbjct: 450 VEASPVLSARQDVAYHVVYREGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYL 509

Query: 582 KSSIRSMDMHGHGG--KKVLVVPAFETQRYRTAFPAS----------------------- 616
           ++SI  +   G GG  K  LVVPAFET RYR +FP S                       
Sbjct: 510 RASIEQL---GLGGLRKAALVVPAFETPRYRFSFPKSKAELLALLDAGALYTFRYHEWPQ 566

Query: 617 -HAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
            HA T+++RW  A  PY+++WA D+EPY+V  RD P YD RFVGFGWNK
Sbjct: 567 GHAATDYARWREAQAPYRVQWAADYEPYVVVPRDCPHYDPRFVGFGWNK 615



 Score =  249 bits (636), Expect = 3e-63,   Method: Compositional matrix adjust.
 Identities = 113/163 (69%), Positives = 134/163 (82%)

Query: 10  PTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVP 69
           P C+++ VAIVCAG NS+R +VTL+KS+LFYRKNPLH HL+TD VA +IL+TLF TW VP
Sbjct: 67  PRCQLLHVAIVCAGLNSSREVVTLVKSMLFYRKNPLHLHLVTDAVAQSILETLFRTWMVP 126

Query: 70  QVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIA 129
            + VSFY AD +   +SWIPNKHYSG+YGL+KL LP VLP  LA+ IVLDTDV  A+DIA
Sbjct: 127 ALGVSFYDADQLKPHISWIPNKHYSGLYGLMKLVLPGVLPPGLAQVIVLDTDVTLASDIA 186

Query: 130 QLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           +LWALF+    +Q IGLVENQSDWYLG LWKNH+PWPALGRGF
Sbjct: 187 ELWALFAYFSDKQVIGLVENQSDWYLGNLWKNHRPWPALGRGF 229


>gi|351697353|gb|EHB00272.1| Glycosyltransferase-like protein LARGE2, partial [Heterocephalus
           glaber]
          Length = 630

 Score =  570 bits (1468), Expect = e-159,   Method: Compositional matrix adjust.
 Identities = 297/528 (56%), Positives = 351/528 (66%), Gaps = 67/528 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PALG  FY AD +   +SWIPNKHYSG+YGL+KL L  VLP  LA+ IVLDTDV FA DI
Sbjct: 62  PALGVSFYDADQLKPHISWIPNKHYSGLYGLMKLVLLSVLPPGLAQVIVLDTDVTFAADI 121

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWALF+    +Q IGLVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +LR   
Sbjct: 122 AELWALFADFSDKQVIGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLWLDRLRHEG 181

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   W++ A + LLT+  TSLADQDIFNA+I EHP LV +LPC WNVQLSD+T + E CY
Sbjct: 182 WEQMWQLTARRELLTQPATSLADQDIFNAVIKEHPGLVQSLPCVWNVQLSDHTLA-ERCY 240

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTTLQ--- 402
            E  DLK+IHWNSPKKL+VKNK  EFFRNLYLTFL YDGNLLRRELFGC     T     
Sbjct: 241 LEAADLKVIHWNSPKKLQVKNKDAEFFRNLYLTFLGYDGNLLRRELFGCPSQPPTGAEEL 300

Query: 403 QAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEI 462
           Q  L+ L+EED C++ R+ +LT  R H+ FL ++       +DVTLVAQLSMDR      
Sbjct: 301 QHSLAQLDEEDTCFEFRQQQLTMHRVHITFLPHKPPPPQP-HDVTLVAQLSMDR------ 353

Query: 463 NPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYA 522
                                      LQM+E L +HW GP+SL LY++DAEAQQFL + 
Sbjct: 354 ---------------------------LQMLEALCRHWPGPMSLALYLTDAEAQQFLHFV 386

Query: 523 GNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLK 582
             S  L +R+++ YHVVY+EG  YP+N LRNVAL Q  TPYVFL DIDFLP   LY YL+
Sbjct: 387 EASPVLSARQDVAYHVVYREGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAHSLYDYLR 446

Query: 583 SSIRSMDMHGHGG--KKVLVVPAFETQRYRTAFPAS------------------------ 616
           +SI  +   G GG  K  LVVPAFET RYR +FP S                        
Sbjct: 447 ASIEQL---GLGGLRKAALVVPAFETLRYRFSFPRSKAELLDLLDAGALYTFRYHEWPWG 503

Query: 617 HAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           HAPT+++RW  A  PY+++WA D+EPY+V  RD PRYD RFVGFGWNK
Sbjct: 504 HAPTDYARWRAAQAPYRVQWAADYEPYVVVPRDCPRYDPRFVGFGWNK 551



 Score =  239 bits (609), Expect = 5e-60,   Method: Compositional matrix adjust.
 Identities = 110/160 (68%), Positives = 130/160 (81%)

Query: 13  EVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQVE 72
           +++ VAIVCAG NS+R +VTL+KS+LFYRKNPLH HL+TD VA +IL+ LF TW VP + 
Sbjct: 6   QLLHVAIVCAGLNSSREVVTLVKSMLFYRKNPLHLHLVTDAVARSILEMLFRTWMVPALG 65

Query: 73  VSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLW 132
           VSFY AD +   +SWIPNKHYSG+YGL+KL L  VLP  LA+ IVLDTDV FA DIA+LW
Sbjct: 66  VSFYDADQLKPHISWIPNKHYSGLYGLMKLVLLSVLPPGLAQVIVLDTDVTFAADIAELW 125

Query: 133 ALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           ALF+    +Q IGLVENQSDWYLG LWKNH+PWPALGRGF
Sbjct: 126 ALFADFSDKQVIGLVENQSDWYLGNLWKNHRPWPALGRGF 165


>gi|344247816|gb|EGW03920.1| Glycosyltransferase-like protein LARGE2 [Cricetulus griseus]
          Length = 692

 Score =  569 bits (1467), Expect = e-159,   Method: Compositional matrix adjust.
 Identities = 288/529 (54%), Positives = 357/529 (67%), Gaps = 67/529 (12%)

Query: 166 PALGRGFYLADSVVED--VSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 223
           PA+   FY A+ +     +SWIPNKHYSG+YGL+KL LP +LP +LA+ IVLDTDV F++
Sbjct: 122 PAVVIRFYDAEELKAGPLISWIPNKHYSGLYGLMKLVLPSILPLSLARVIVLDTDVTFSS 181

Query: 224 DIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRD 283
           DIA+LWALF+    +Q +GLVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +LR 
Sbjct: 182 DIAELWALFAHFSDKQVVGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLWLDRLRQ 241

Query: 284 ISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDEL 343
             W   W++IA++ LLT + TSLADQD+FNA+I EHP LV+ LPC WNVQLSD+T + E 
Sbjct: 242 TGWEQMWKLIAKQELLTLMATSLADQDVFNAVIKEHPELVHPLPCVWNVQLSDHTLA-ER 300

Query: 344 CYTELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQT 399
           CY E  DLK+IHWNSPKKL+VKNKH EFFRNL+LTFL YDG LLRRELFGC         
Sbjct: 301 CYLEAADLKVIHWNSPKKLRVKNKHAEFFRNLHLTFLGYDGKLLRRELFGCPNQLPPGAE 360

Query: 400 TLQQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQM 459
            LQQA L+ L+EE+PC++ R+ +L+  R H+ FL ++       +DVTLVAQLSMDR   
Sbjct: 361 QLQQA-LAQLDEEEPCFEFRQQQLSVHRVHITFLPHQPPPPR-PHDVTLVAQLSMDR--- 415

Query: 460 VEINPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFL 519
                                         LQM+E L +HW GP+SL LY++DAEAQQFL
Sbjct: 416 ------------------------------LQMLEALCRHWPGPMSLALYLTDAEAQQFL 445

Query: 520 SYAGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYP 579
            +   S  L +R+++ YH+VY+EG  YP+N LRNVAL Q  TPYVFL DIDFLP + LY 
Sbjct: 446 HFVETSPVLSARKDVAYHIVYREGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYD 505

Query: 580 YLKSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS----------------------- 616
           YL++SI  +++ G   K  LVVPAFET  YR +FP S                       
Sbjct: 506 YLRASIEQLEL-GSQRKAALVVPAFETLHYRLSFPNSKAELLTLLDAGSLYTFRYHEWPQ 564

Query: 617 -HAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
            HAPT+++RW  A  PY ++WA D+EPY+V  RD PRYD RFVGFGWNK
Sbjct: 565 GHAPTDYARWREAKAPYHVQWAADYEPYVVVPRDCPRYDPRFVGFGWNK 613



 Score =  249 bits (636), Expect = 3e-63,   Method: Compositional matrix adjust.
 Identities = 122/216 (56%), Positives = 155/216 (71%), Gaps = 7/216 (3%)

Query: 10  PTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVP 69
           P CE++ VAIVCAGYNS+R ++TL+KS+LFYRKNPLH HL+TD +A NIL+TLF TW VP
Sbjct: 63  PKCELLHVAIVCAGYNSSREVITLVKSLLFYRKNPLHLHLVTDAIARNILETLFRTWMVP 122

Query: 70  QVEVSFYLADSVVED--VSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATD 127
            V + FY A+ +     +SWIPNKHYSG+YGL+KL LP +LP +LA+ IVLDTDV F++D
Sbjct: 123 AVVIRFYDAEELKAGPLISWIPNKHYSGLYGLMKLVLPSILPLSLARVIVLDTDVTFSSD 182

Query: 128 IAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFYLADSVVEDVSWIPN 187
           IA+LWALF+    +Q +GLVENQSDWYLG LWKNH+PWPALGRGF     ++    W+  
Sbjct: 183 IAELWALFAHFSDKQVVGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILL----WLDR 238

Query: 188 KHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 223
              +G   + KL   + L  TL  T + D DV  A 
Sbjct: 239 LRQTGWEQMWKLIAKQEL-LTLMATSLADQDVFNAV 273


>gi|329664130|ref|NP_001193121.1| glycosyltransferase-like protein LARGE2 precursor [Bos taurus]
 gi|296479677|tpg|DAA21792.1| TPA: glycosyltransferase-like 1B [Bos taurus]
          Length = 726

 Score =  567 bits (1461), Expect = e-159,   Method: Compositional matrix adjust.
 Identities = 292/527 (55%), Positives = 354/527 (67%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY AD +   VSWIPNKHYSG+YGL+KL LP  LP  LA+ IVLDTDV FA++I
Sbjct: 158 PAVQISFYNADELKPQVSWIPNKHYSGLYGLMKLVLPSALPPDLARVIVLDTDVTFASNI 217

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWALF+    +Q IGLVENQSDWYLG LW+NH+PWPALGRGFNTGVILL L +LR   
Sbjct: 218 AELWALFAHFSDKQAIGLVENQSDWYLGNLWRNHRPWPALGRGFNTGVILLRLDRLRQAG 277

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   W++ A + LLT   TSLADQDIFNA+I EHP+LV  LPC WNVQLSD+T + E CY
Sbjct: 278 WEHMWKLTATRELLTLPATSLADQDIFNAVIKEHPWLVQPLPCMWNVQLSDHTLA-ERCY 336

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTT----L 401
           +E  DLK+IHWNSPKKL VKNKH+E FRNLYLTFLEYDGNLLRRELFGC          L
Sbjct: 337 SEAPDLKVIHWNSPKKLHVKNKHVESFRNLYLTFLEYDGNLLRRELFGCPCPPPPGAEQL 396

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           QQA L+ L E+D C++ R+ +LT  R H+ FL ++       +DVTLVAQLSM+RL    
Sbjct: 397 QQA-LAQLEEQDACFEFRQQQLTVHRVHITFLPHKPPPPQ-PHDVTLVAQLSMERL---- 450

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                        QM+E L +HW GP+SL LY++D EAQQFL +
Sbjct: 451 -----------------------------QMLEALCRHWPGPMSLALYLTDTEAQQFLRF 481

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
              S  L +R+++ YHVVY+EG+ YPIN LRNVAL Q  TPYVFL DIDFLP + LY YL
Sbjct: 482 VETSAVLSARQDVAYHVVYREGSLYPINQLRNVALAQALTPYVFLSDIDFLPAYSLYDYL 541

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           ++SI  +++ G   K  LVVPAFET  YR +FP+S                        H
Sbjct: 542 RASIEQLELDGR-RKAALVVPAFETLHYRFSFPSSKAELLTLLDAGSLYTFRYHEWPQGH 600

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           APT+++RW  A  PY+++WA D+EPY+V  RD PRYD RFVGFGWNK
Sbjct: 601 APTDYARWREAQAPYRVQWAADYEPYVVVPRDCPRYDPRFVGFGWNK 647



 Score =  255 bits (652), Expect = 5e-65,   Method: Compositional matrix adjust.
 Identities = 113/161 (70%), Positives = 136/161 (84%)

Query: 12  CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQV 71
           CE++QVAIVCAG+NS+R ++TL+KS+LFYRKNPLH HL+TD VA NIL+TLF TW VP V
Sbjct: 101 CELLQVAIVCAGHNSSRDVITLVKSLLFYRKNPLHLHLVTDAVARNILETLFHTWMVPAV 160

Query: 72  EVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQL 131
           ++SFY AD +   VSWIPNKHYSG+YGL+KL LP  LP  LA+ IVLDTDV FA++IA+L
Sbjct: 161 QISFYNADELKPQVSWIPNKHYSGLYGLMKLVLPSALPPDLARVIVLDTDVTFASNIAEL 220

Query: 132 WALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           WALF+    +Q IGLVENQSDWYLG LW+NH+PWPALGRGF
Sbjct: 221 WALFAHFSDKQAIGLVENQSDWYLGNLWRNHRPWPALGRGF 261


>gi|148695646|gb|EDL27593.1| glycosyltransferase-like 1B, isoform CRA_a [Mus musculus]
          Length = 629

 Score =  564 bits (1453), Expect = e-158,   Method: Compositional matrix adjust.
 Identities = 285/527 (54%), Positives = 352/527 (66%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY A+ +   VSWIPNKHYSG+YGL+KL LP +LP +LA+ IVLDTDV F++DI
Sbjct: 61  PAVVVSFYDAEELKPLVSWIPNKHYSGLYGLMKLVLPSILPPSLARVIVLDTDVTFSSDI 120

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
            +LWALF     +Q +GLVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +L+   
Sbjct: 121 VELWALFDHFSDKQVVGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLWLDRLQQTG 180

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   W++ A++ LLT + TSLADQDIFNA+I EHP+LV+ LPC WNVQLSD+TR+ E CY
Sbjct: 181 WEQMWKVTAKRELLTLMATSLADQDIFNAVIKEHPHLVHPLPCVWNVQLSDHTRA-ERCY 239

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
            E  DLK+IHWNSPKKL+VKNKH EFFRNL+LTFL YDG LLRRELFGC          L
Sbjct: 240 LEAADLKVIHWNSPKKLRVKNKHAEFFRNLHLTFLGYDGKLLRRELFGCPNQFPPGAEQL 299

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           QQA L+ L+EE+PC++ R+ +LT  R H+ FL ++       +DVTLVAQLSMDR     
Sbjct: 300 QQA-LTQLDEEEPCFEFRQQQLTVHRVHITFLPHQPPPPQ-PHDVTLVAQLSMDR----- 352

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+E L +HW GP+SL LY++D EAQQFL +
Sbjct: 353 ----------------------------LQMLEALCRHWPGPMSLALYLTDEEAQQFLHF 384

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
              S  L  R+++ YHVVY++G  YP+N LRNVAL Q  TPYVFL DIDFLP + LY YL
Sbjct: 385 VETSPVLSMRKDVAYHVVYRDGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYL 444

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           ++SI  +++     K  LVVPAFET  YR +FP S                        H
Sbjct: 445 RASIEQLELDSR-RKTALVVPAFETLHYRFSFPNSKAELLTLLDAGSLHTFRYHEWPQGH 503

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           + T++SRW  A  PY ++W+ D+EPY+V  RD PRYD RFVGFGWNK
Sbjct: 504 SSTDYSRWREAQAPYSVQWSADYEPYVVVPRDCPRYDPRFVGFGWNK 550



 Score =  206 bits (524), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 104/183 (56%), Positives = 127/183 (69%), Gaps = 5/183 (2%)

Query: 41  RKNPLHFHLITDTVALNILQTLFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLL 100
           RKNPLH HLITD VA NIL+TLF TW VP V VSFY A+ +   VSWIPNKHYSG+YGL+
Sbjct: 33  RKNPLHLHLITDAVARNILETLFRTWMVPAVVVSFYDAEELKPLVSWIPNKHYSGLYGLM 92

Query: 101 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 160
           KL LP +LP +LA+ IVLDTDV F++DI +LWALF     +Q +GLVENQSDWYLG LWK
Sbjct: 93  KLVLPSILPPSLARVIVLDTDVTFSSDIVELWALFDHFSDKQVVGLVENQSDWYLGNLWK 152

Query: 161 NHKPWPALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVI 220
           NH+PWPALGRGF     ++    W+     +G   + K+T  + L  TL  T + D D+ 
Sbjct: 153 NHRPWPALGRGFNTGVILL----WLDRLQQTGWEQMWKVTAKREL-LTLMATSLADQDIF 207

Query: 221 FAT 223
            A 
Sbjct: 208 NAV 210


>gi|109730755|gb|AAI16716.1| Gyltl1b protein [Mus musculus]
          Length = 691

 Score =  563 bits (1452), Expect = e-158,   Method: Compositional matrix adjust.
 Identities = 285/527 (54%), Positives = 352/527 (66%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY A+ +   VSWIPNKHYSG+YGL+KL LP +LP +LA+ IVLDTDV F++DI
Sbjct: 123 PAVVVSFYDAEELKPLVSWIPNKHYSGLYGLMKLVLPSILPPSLARVIVLDTDVTFSSDI 182

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
            +LWALF     +Q +GLVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +L+   
Sbjct: 183 VELWALFDHFSDKQVVGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLWLDRLQQTG 242

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   W++ A++ LLT + TSLADQDIFNA+I EHP+LV+ LPC WNVQLSD+TR+ E CY
Sbjct: 243 WEQMWKVTAKRELLTLMATSLADQDIFNAVIKEHPHLVHPLPCVWNVQLSDHTRA-ERCY 301

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
            E  DLK+IHWNSPKKL+VKNKH EFFRNL+LTFL YDG LLRRELFGC          L
Sbjct: 302 LEAADLKVIHWNSPKKLRVKNKHAEFFRNLHLTFLGYDGKLLRRELFGCPNQFPPGAEQL 361

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           QQA L+ L+EE+PC++ R+ +LT  R H+ FL ++       +DVTLVAQLSMDR     
Sbjct: 362 QQA-LTQLDEEEPCFEFRQQQLTVHRVHITFLPHQPPPPQ-PHDVTLVAQLSMDR----- 414

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+E L +HW GP+SL LY++D EAQQFL +
Sbjct: 415 ----------------------------LQMLEALCRHWPGPMSLALYLTDEEAQQFLHF 446

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
              S  L  R+++ YHVVY++G  YP+N LRNVAL Q  TPYVFL DIDFLP + LY YL
Sbjct: 447 VETSPVLSMRKDVAYHVVYRDGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYL 506

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           ++SI  +++     K  LVVPAFET  YR +FP S                        H
Sbjct: 507 RASIEQLELDSR-RKTALVVPAFETLHYRFSFPNSKAELLTLLDAGSLHTFRYHEWPQGH 565

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           + T++SRW  A  PY ++W+ D+EPY+V  RD PRYD RFVGFGWNK
Sbjct: 566 SSTDYSRWREAQAPYSVQWSADYEPYVVVPRDCPRYDPRFVGFGWNK 612



 Score =  252 bits (643), Expect = 6e-64,   Method: Compositional matrix adjust.
 Identities = 128/223 (57%), Positives = 158/223 (70%), Gaps = 6/223 (2%)

Query: 2   SSDEEPVIPTCEV-IQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQ 60
           SS +  + P CEV + VAIVCAGYNS+R ++TL KS+LFYRKNPLH HLITD VA NIL+
Sbjct: 55  SSPQLLLPPKCEVMLHVAIVCAGYNSSREIITLTKSLLFYRKNPLHLHLITDAVARNILE 114

Query: 61  TLFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDT 120
           TLF TW VP V VSFY A+ +   VSWIPNKHYSG+YGL+KL LP +LP +LA+ IVLDT
Sbjct: 115 TLFRTWMVPAVVVSFYDAEELKPLVSWIPNKHYSGLYGLMKLVLPSILPPSLARVIVLDT 174

Query: 121 DVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFYLADSVVE 180
           DV F++DI +LWALF     +Q +GLVENQSDWYLG LWKNH+PWPALGRGF     ++ 
Sbjct: 175 DVTFSSDIVELWALFDHFSDKQVVGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILL- 233

Query: 181 DVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 223
              W+     +G   + K+T  + L  TL  T + D D+  A 
Sbjct: 234 ---WLDRLQQTGWEQMWKVTAKREL-LTLMATSLADQDIFNAV 272


>gi|262231828|ref|NP_766258.2| glycosyltransferase-like protein LARGE2 isoform 1 precursor [Mus
           musculus]
 gi|81910390|sp|Q5XPT3.1|LARG2_MOUSE RecName: Full=Glycosyltransferase-like protein LARGE2; AltName:
           Full=Glycosyltransferase-like 1B
 gi|53830728|gb|AAU95213.1| glycosyltransferase-like 1B splice form 1 [Mus musculus]
 gi|66794531|gb|AAH96655.1| Gyltl1b protein [Mus musculus]
 gi|148695649|gb|EDL27596.1| glycosyltransferase-like 1B, isoform CRA_d [Mus musculus]
          Length = 690

 Score =  563 bits (1452), Expect = e-158,   Method: Compositional matrix adjust.
 Identities = 285/527 (54%), Positives = 352/527 (66%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY A+ +   VSWIPNKHYSG+YGL+KL LP +LP +LA+ IVLDTDV F++DI
Sbjct: 122 PAVVVSFYDAEELKPLVSWIPNKHYSGLYGLMKLVLPSILPPSLARVIVLDTDVTFSSDI 181

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
            +LWALF     +Q +GLVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +L+   
Sbjct: 182 VELWALFDHFSDKQVVGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLWLDRLQQTG 241

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   W++ A++ LLT + TSLADQDIFNA+I EHP+LV+ LPC WNVQLSD+TR+ E CY
Sbjct: 242 WEQMWKVTAKRELLTLMATSLADQDIFNAVIKEHPHLVHPLPCVWNVQLSDHTRA-ERCY 300

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
            E  DLK+IHWNSPKKL+VKNKH EFFRNL+LTFL YDG LLRRELFGC          L
Sbjct: 301 LEAADLKVIHWNSPKKLRVKNKHAEFFRNLHLTFLGYDGKLLRRELFGCPNQFPPGAEQL 360

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           QQA L+ L+EE+PC++ R+ +LT  R H+ FL ++       +DVTLVAQLSMDR     
Sbjct: 361 QQA-LTQLDEEEPCFEFRQQQLTVHRVHITFLPHQPPPPQ-PHDVTLVAQLSMDR----- 413

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+E L +HW GP+SL LY++D EAQQFL +
Sbjct: 414 ----------------------------LQMLEALCRHWPGPMSLALYLTDEEAQQFLHF 445

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
              S  L  R+++ YHVVY++G  YP+N LRNVAL Q  TPYVFL DIDFLP + LY YL
Sbjct: 446 VETSPVLSMRKDVAYHVVYRDGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYL 505

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           ++SI  +++     K  LVVPAFET  YR +FP S                        H
Sbjct: 506 RASIEQLELDSR-RKTALVVPAFETLHYRFSFPNSKAELLTLLDAGSLHTFRYHEWPQGH 564

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           + T++SRW  A  PY ++W+ D+EPY+V  RD PRYD RFVGFGWNK
Sbjct: 565 SSTDYSRWREAQAPYSVQWSADYEPYVVVPRDCPRYDPRFVGFGWNK 611



 Score =  255 bits (652), Expect = 5e-65,   Method: Compositional matrix adjust.
 Identities = 127/222 (57%), Positives = 158/222 (71%), Gaps = 5/222 (2%)

Query: 2   SSDEEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQT 61
           SS +  + P CE++ VAIVCAGYNS+R ++TL KS+LFYRKNPLH HLITD VA NIL+T
Sbjct: 55  SSPQLLLPPKCEMLHVAIVCAGYNSSREIITLTKSLLFYRKNPLHLHLITDAVARNILET 114

Query: 62  LFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTD 121
           LF TW VP V VSFY A+ +   VSWIPNKHYSG+YGL+KL LP +LP +LA+ IVLDTD
Sbjct: 115 LFRTWMVPAVVVSFYDAEELKPLVSWIPNKHYSGLYGLMKLVLPSILPPSLARVIVLDTD 174

Query: 122 VIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFYLADSVVED 181
           V F++DI +LWALF     +Q +GLVENQSDWYLG LWKNH+PWPALGRGF     ++  
Sbjct: 175 VTFSSDIVELWALFDHFSDKQVVGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILL-- 232

Query: 182 VSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 223
             W+     +G   + K+T  + L  TL  T + D D+  A 
Sbjct: 233 --WLDRLQQTGWEQMWKVTAKREL-LTLMATSLADQDIFNAV 271


>gi|148695648|gb|EDL27595.1| glycosyltransferase-like 1B, isoform CRA_c [Mus musculus]
          Length = 642

 Score =  563 bits (1451), Expect = e-158,   Method: Compositional matrix adjust.
 Identities = 285/527 (54%), Positives = 352/527 (66%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY A+ +   VSWIPNKHYSG+YGL+KL LP +LP +LA+ IVLDTDV F++DI
Sbjct: 74  PAVVVSFYDAEELKPLVSWIPNKHYSGLYGLMKLVLPSILPPSLARVIVLDTDVTFSSDI 133

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
            +LWALF     +Q +GLVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +L+   
Sbjct: 134 VELWALFDHFSDKQVVGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLWLDRLQQTG 193

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   W++ A++ LLT + TSLADQDIFNA+I EHP+LV+ LPC WNVQLSD+TR+ E CY
Sbjct: 194 WEQMWKVTAKRELLTLMATSLADQDIFNAVIKEHPHLVHPLPCVWNVQLSDHTRA-ERCY 252

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
            E  DLK+IHWNSPKKL+VKNKH EFFRNL+LTFL YDG LLRRELFGC          L
Sbjct: 253 LEAADLKVIHWNSPKKLRVKNKHAEFFRNLHLTFLGYDGKLLRRELFGCPNQFPPGAEQL 312

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           QQA L+ L+EE+PC++ R+ +LT  R H+ FL ++       +DVTLVAQLSMDR     
Sbjct: 313 QQA-LTQLDEEEPCFEFRQQQLTVHRVHITFLPHQPPPPQ-PHDVTLVAQLSMDR----- 365

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+E L +HW GP+SL LY++D EAQQFL +
Sbjct: 366 ----------------------------LQMLEALCRHWPGPMSLALYLTDEEAQQFLHF 397

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
              S  L  R+++ YHVVY++G  YP+N LRNVAL Q  TPYVFL DIDFLP + LY YL
Sbjct: 398 VETSPVLSMRKDVAYHVVYRDGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYL 457

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           ++SI  +++     K  LVVPAFET  YR +FP S                        H
Sbjct: 458 RASIEQLELDSR-RKTALVVPAFETLHYRFSFPNSKAELLTLLDAGSLHTFRYHEWPQGH 516

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           + T++SRW  A  PY ++W+ D+EPY+V  RD PRYD RFVGFGWNK
Sbjct: 517 SSTDYSRWREAQAPYSVQWSADYEPYVVVPRDCPRYDPRFVGFGWNK 563



 Score =  251 bits (641), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 123/212 (58%), Positives = 153/212 (72%), Gaps = 5/212 (2%)

Query: 12  CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQV 71
           C+++ VAIVCAGYNS+R ++TL KS+LFYRKNPLH HLITD VA NIL+TLF TW VP V
Sbjct: 17  CQMLHVAIVCAGYNSSREIITLTKSLLFYRKNPLHLHLITDAVARNILETLFRTWMVPAV 76

Query: 72  EVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQL 131
            VSFY A+ +   VSWIPNKHYSG+YGL+KL LP +LP +LA+ IVLDTDV F++DI +L
Sbjct: 77  VVSFYDAEELKPLVSWIPNKHYSGLYGLMKLVLPSILPPSLARVIVLDTDVTFSSDIVEL 136

Query: 132 WALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFYLADSVVEDVSWIPNKHYS 191
           WALF     +Q +GLVENQSDWYLG LWKNH+PWPALGRGF     ++    W+     +
Sbjct: 137 WALFDHFSDKQVVGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILL----WLDRLQQT 192

Query: 192 GVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 223
           G   + K+T  + L  TL  T + D D+  A 
Sbjct: 193 GWEQMWKVTAKREL-LTLMATSLADQDIFNAV 223


>gi|26346524|dbj|BAC36913.1| unnamed protein product [Mus musculus]
          Length = 629

 Score =  561 bits (1447), Expect = e-157,   Method: Compositional matrix adjust.
 Identities = 284/527 (53%), Positives = 351/527 (66%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY A+ +   VSWIPN HYSG+YGL+KL LP +LP +LA+ IVLDTDV F++DI
Sbjct: 61  PAVVVSFYDAEELKPLVSWIPNNHYSGLYGLMKLVLPSILPPSLARVIVLDTDVTFSSDI 120

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
            +LWALF     +Q +GLVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +L+   
Sbjct: 121 VELWALFDHFSDKQVVGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLWLDRLQQTG 180

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   W++ A++ LLT + TSLADQDIFNA+I EHP+LV+ LPC WNVQLSD+TR+ E CY
Sbjct: 181 WEQMWKVTAKRELLTLMATSLADQDIFNAVIKEHPHLVHPLPCVWNVQLSDHTRA-ERCY 239

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
            E  DLK+IHWNSPKKL+VKNKH EFFRNL+LTFL YDG LLRRELFGC          L
Sbjct: 240 LEAADLKVIHWNSPKKLRVKNKHAEFFRNLHLTFLGYDGKLLRRELFGCPNQFPPGAEQL 299

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           QQA L+ L+EE+PC++ R+ +LT  R H+ FL ++       +DVTLVAQLSMDR     
Sbjct: 300 QQA-LTQLDEEEPCFEFRQQQLTVHRVHITFLPHQPPPPQ-PHDVTLVAQLSMDR----- 352

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+E L +HW GP+SL LY++D EAQQFL +
Sbjct: 353 ----------------------------LQMLEALCRHWPGPMSLALYLTDEEAQQFLHF 384

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
              S  L  R+++ YHVVY++G  YP+N LRNVAL Q  TPYVFL DIDFLP + LY YL
Sbjct: 385 VETSPVLSMRKDVAYHVVYRDGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYL 444

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           ++SI  +++     K  LVVPAFET  YR +FP S                        H
Sbjct: 445 RASIEQLELDSR-RKTALVVPAFETLHYRFSFPNSKAELLTLLDAGSLHTFRYHEWPQGH 503

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           + T++SRW  A  PY ++W+ D+EPY+V  RD PRYD RFVGFGWNK
Sbjct: 504 SSTDYSRWREAQAPYSVQWSADYEPYVVVPRDCPRYDPRFVGFGWNK 550



 Score =  204 bits (519), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 103/183 (56%), Positives = 126/183 (68%), Gaps = 5/183 (2%)

Query: 41  RKNPLHFHLITDTVALNILQTLFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLL 100
           RKNPLH HLITD VA NIL+TLF TW VP V VSFY A+ +   VSWIPN HYSG+YGL+
Sbjct: 33  RKNPLHLHLITDAVARNILETLFRTWMVPAVVVSFYDAEELKPLVSWIPNNHYSGLYGLM 92

Query: 101 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 160
           KL LP +LP +LA+ IVLDTDV F++DI +LWALF     +Q +GLVENQSDWYLG LWK
Sbjct: 93  KLVLPSILPPSLARVIVLDTDVTFSSDIVELWALFDHFSDKQVVGLVENQSDWYLGNLWK 152

Query: 161 NHKPWPALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVI 220
           NH+PWPALGRGF     ++    W+     +G   + K+T  + L  TL  T + D D+ 
Sbjct: 153 NHRPWPALGRGFNTGVILL----WLDRLQQTGWEQMWKVTAKREL-LTLMATSLADQDIF 207

Query: 221 FAT 223
            A 
Sbjct: 208 NAV 210


>gi|40018582|ref|NP_954538.1| glycosyltransferase-like protein LARGE2 [Rattus norvegicus]
 gi|81885334|sp|Q6P7A1.1|LARG2_RAT RecName: Full=Glycosyltransferase-like protein LARGE2; AltName:
           Full=Glycosyltransferase-like 1B
 gi|38197418|gb|AAH61762.1| Glycosyltransferase-like 1B [Rattus norvegicus]
 gi|149022667|gb|EDL79561.1| glycosyltransferase-like 1B, isoform CRA_a [Rattus norvegicus]
 gi|149022668|gb|EDL79562.1| glycosyltransferase-like 1B, isoform CRA_a [Rattus norvegicus]
 gi|149022669|gb|EDL79563.1| glycosyltransferase-like 1B, isoform CRA_a [Rattus norvegicus]
          Length = 690

 Score =  561 bits (1447), Expect = e-157,   Method: Compositional matrix adjust.
 Identities = 284/527 (53%), Positives = 351/527 (66%), Gaps = 64/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY A+ +   VSWIPNKHYSG+YGL+KL LP VLP +LA+ IVLDTDV F++DI
Sbjct: 121 PAVVVSFYDAEELKPLVSWIPNKHYSGLYGLMKLVLPSVLPLSLARVIVLDTDVTFSSDI 180

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
            +LWALF     +Q +GLVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +L+ I 
Sbjct: 181 MELWALFGHFSDKQVVGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLWLDRLQQIG 240

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   W++ A++ LLT   TSLADQDIFNA+I EHP LV+ LPC WNVQLSD+T + E CY
Sbjct: 241 WEQMWKLTAKRELLTLTATSLADQDIFNAVIKEHPELVHPLPCVWNVQLSDHTLA-ERCY 299

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
            E  DLK+IHWNSPKKL+VKNKH EFFR+L+LTFL +DG LL RELFGC          L
Sbjct: 300 LEAADLKVIHWNSPKKLRVKNKHAEFFRDLHLTFLGFDGKLLCRELFGCPNQFPPGVEQL 359

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           QQA L+ L+EE+PC++ R+ +LT  R H+ FL ++       +DVTLVAQLSMDR     
Sbjct: 360 QQA-LAQLDEEEPCFEFRQQQLTVHRVHITFLSHQ-PPPPRPHDVTLVAQLSMDR----- 412

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+E L +HW GP+SL LY++DAEAQQFL +
Sbjct: 413 ----------------------------LQMLEALCRHWRGPMSLALYLTDAEAQQFLRF 444

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
              S  L +R+++ YHVVY++G  YP+N LRNVAL Q  TPYVFL DIDFLP + LY YL
Sbjct: 445 VETSPVLSARKDVAYHVVYRDGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYL 504

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           ++SI  + +     K  LVVPAFET  YR +FP S                        H
Sbjct: 505 RASIEQLALGRRQRKAALVVPAFETLHYRFSFPNSKAELLTLLDAGSLHTFRYHEWPQGH 564

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           A T+++RW  A  PY+++W+ D+EPY+V  RD PRYD RFVGFGWNK
Sbjct: 565 ASTDYTRWREAQAPYRVQWSADYEPYVVVPRDCPRYDPRFVGFGWNK 611



 Score =  256 bits (654), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 127/214 (59%), Positives = 154/214 (71%), Gaps = 5/214 (2%)

Query: 10  PTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVP 69
           P CE++ VAIVCAGYNS+R ++TL+KS+LFYRKNPLH HLITD VA NIL+TLF TW VP
Sbjct: 62  PKCEMLHVAIVCAGYNSSREIITLMKSVLFYRKNPLHLHLITDAVARNILETLFRTWMVP 121

Query: 70  QVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIA 129
            V VSFY A+ +   VSWIPNKHYSG+YGL+KL LP VLP +LA+ IVLDTDV F++DI 
Sbjct: 122 AVVVSFYDAEELKPLVSWIPNKHYSGLYGLMKLVLPSVLPLSLARVIVLDTDVTFSSDIM 181

Query: 130 QLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFYLADSVVEDVSWIPNKH 189
           +LWALF     +Q +GLVENQSDWYLG LWKNH+PWPALGRGF     ++    W+    
Sbjct: 182 ELWALFGHFSDKQVVGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILL----WLDRLQ 237

Query: 190 YSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 223
             G   + KLT  + L  TL  T + D D+  A 
Sbjct: 238 QIGWEQMWKLTAKREL-LTLTATSLADQDIFNAV 270


>gi|149022670|gb|EDL79564.1| glycosyltransferase-like 1B, isoform CRA_b [Rattus norvegicus]
          Length = 621

 Score =  561 bits (1446), Expect = e-157,   Method: Compositional matrix adjust.
 Identities = 284/527 (53%), Positives = 351/527 (66%), Gaps = 64/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY A+ +   VSWIPNKHYSG+YGL+KL LP VLP +LA+ IVLDTDV F++DI
Sbjct: 52  PAVVVSFYDAEELKPLVSWIPNKHYSGLYGLMKLVLPSVLPLSLARVIVLDTDVTFSSDI 111

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
            +LWALF     +Q +GLVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +L+ I 
Sbjct: 112 MELWALFGHFSDKQVVGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLWLDRLQQIG 171

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   W++ A++ LLT   TSLADQDIFNA+I EHP LV+ LPC WNVQLSD+T + E CY
Sbjct: 172 WEQMWKLTAKRELLTLTATSLADQDIFNAVIKEHPELVHPLPCVWNVQLSDHTLA-ERCY 230

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
            E  DLK+IHWNSPKKL+VKNKH EFFR+L+LTFL +DG LL RELFGC          L
Sbjct: 231 LEAADLKVIHWNSPKKLRVKNKHAEFFRDLHLTFLGFDGKLLCRELFGCPNQFPPGVEQL 290

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           QQA L+ L+EE+PC++ R+ +LT  R H+ FL ++       +DVTLVAQLSMDR     
Sbjct: 291 QQA-LAQLDEEEPCFEFRQQQLTVHRVHITFLSHQ-PPPPRPHDVTLVAQLSMDR----- 343

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+E L +HW GP+SL LY++DAEAQQFL +
Sbjct: 344 ----------------------------LQMLEALCRHWRGPMSLALYLTDAEAQQFLRF 375

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
              S  L +R+++ YHVVY++G  YP+N LRNVAL Q  TPYVFL DIDFLP + LY YL
Sbjct: 376 VETSPVLSARKDVAYHVVYRDGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYL 435

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           ++SI  + +     K  LVVPAFET  YR +FP S                        H
Sbjct: 436 RASIEQLALGRRQRKAALVVPAFETLHYRFSFPNSKAELLTLLDAGSLHTFRYHEWPQGH 495

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           A T+++RW  A  PY+++W+ D+EPY+V  RD PRYD RFVGFGWNK
Sbjct: 496 ASTDYTRWREAQAPYRVQWSADYEPYVVVPRDCPRYDPRFVGFGWNK 542



 Score =  206 bits (523), Expect = 4e-50,   Method: Compositional matrix adjust.
 Identities = 106/183 (57%), Positives = 126/183 (68%), Gaps = 5/183 (2%)

Query: 41  RKNPLHFHLITDTVALNILQTLFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLL 100
           RKNPLH HLITD VA NIL+TLF TW VP V VSFY A+ +   VSWIPNKHYSG+YGL+
Sbjct: 24  RKNPLHLHLITDAVARNILETLFRTWMVPAVVVSFYDAEELKPLVSWIPNKHYSGLYGLM 83

Query: 101 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 160
           KL LP VLP +LA+ IVLDTDV F++DI +LWALF     +Q +GLVENQSDWYLG LWK
Sbjct: 84  KLVLPSVLPLSLARVIVLDTDVTFSSDIMELWALFGHFSDKQVVGLVENQSDWYLGNLWK 143

Query: 161 NHKPWPALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVI 220
           NH+PWPALGRGF     ++    W+      G   + KLT  + L  TL  T + D D+ 
Sbjct: 144 NHRPWPALGRGFNTGVILL----WLDRLQQIGWEQMWKLTAKREL-LTLTATSLADQDIF 198

Query: 221 FAT 223
            A 
Sbjct: 199 NAV 201


>gi|298683993|gb|ADI96176.1| LARGE-2 [Mastomys erythroleucus]
          Length = 653

 Score =  560 bits (1443), Expect = e-157,   Method: Compositional matrix adjust.
 Identities = 286/527 (54%), Positives = 351/527 (66%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY A+ +   VSWIPNKHYSG+YGL+KL LP +LP +LA+ IVLDTDV F++DI
Sbjct: 109 PAVVVSFYDAEELKPLVSWIPNKHYSGLYGLMKLVLPSILPLSLARVIVLDTDVTFSSDI 168

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
            +LWALF R   +Q +GLVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +L+   
Sbjct: 169 VELWALFDRFSDKQVVGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLWLDRLQQTG 228

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   W++ A++ L T   TSLADQDIFNA++ EHP LV+ LPC WNVQLSD+T + E CY
Sbjct: 229 WEQMWKLTAKRELHTLTATSLADQDIFNAVLKEHPNLVHPLPCVWNVQLSDHTLA-ERCY 287

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
            E  DLK+IHWNSPKKL+VKNKH EFFRNL+LTFL YDG LLRRELFGC          L
Sbjct: 288 LEAADLKVIHWNSPKKLRVKNKHAEFFRNLHLTFLGYDGKLLRRELFGCPNQFPPGAEQL 347

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           QQA L+ L+EE+PC++ R+ +LT  R H+ FL ++        DVTLVAQLSMDR     
Sbjct: 348 QQA-LAQLDEEEPCFEFRQQQLTVHRVHITFLPHQPPPPR-PYDVTLVAQLSMDR----- 400

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+E L +HW GP+SL LY++DAEAQQFL +
Sbjct: 401 ----------------------------LQMLEALCRHWPGPMSLALYLTDAEAQQFLHF 432

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
              S  L +R+++ YHVVY++G  YP+N LRNVAL Q  TPYVFL DIDFLP + LY YL
Sbjct: 433 VETSPVLSARKDVAYHVVYQDGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYL 492

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           ++SI  +++ G   K  LVVPAFET  YR +FP S                        H
Sbjct: 493 RASIEQLEL-GSRRKAALVVPAFETLHYRFSFPNSKAELLTLLDAGSLYTFRYHEWPQGH 551

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
             TN+SRW  A  PY+++W+ D+EPY+V  RD PRYD RFVGFGWNK
Sbjct: 552 LSTNYSRWREAQAPYRVQWSADYEPYVVVPRDCPRYDPRFVGFGWNK 598



 Score =  258 bits (658), Expect = 9e-66,   Method: Compositional matrix adjust.
 Identities = 127/223 (56%), Positives = 162/223 (72%), Gaps = 6/223 (2%)

Query: 2   SSDEEPVIPT-CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQ 60
           +S  + ++P+ CE++ VAIVCAGYNS+R ++TL+KS+LFYRKNPLH HLITD +A NIL+
Sbjct: 41  ASSSQLLLPSKCEMLHVAIVCAGYNSSREIITLMKSLLFYRKNPLHLHLITDAIAKNILE 100

Query: 61  TLFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDT 120
           TLF TW VP V VSFY A+ +   VSWIPNKHYSG+YGL+KL LP +LP +LA+ IVLDT
Sbjct: 101 TLFRTWMVPAVVVSFYDAEELKPLVSWIPNKHYSGLYGLMKLVLPSILPLSLARVIVLDT 160

Query: 121 DVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFYLADSVVE 180
           DV F++DI +LWALF R   +Q +GLVENQSDWYLG LWKNH+PWPALGRGF     ++ 
Sbjct: 161 DVTFSSDIVELWALFDRFSDKQVVGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILL- 219

Query: 181 DVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 223
              W+     +G   + KLT  + L  TL  T + D D+  A 
Sbjct: 220 ---WLDRLQQTGWEQMWKLTAKREL-HTLTATSLADQDIFNAV 258


>gi|298684007|gb|ADI96183.1| LARGE-2 [Rattus losea]
          Length = 654

 Score =  560 bits (1442), Expect = e-156,   Method: Compositional matrix adjust.
 Identities = 283/527 (53%), Positives = 350/527 (66%), Gaps = 64/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY A+ +   VSWIPNKHYSG+YGL+KL LP VLP +L + IVLDTDV F++DI
Sbjct: 109 PAVVVSFYDAEELKPLVSWIPNKHYSGLYGLMKLVLPSVLPLSLTRVIVLDTDVTFSSDI 168

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
            +LWALF     +Q +GLVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +L+ I 
Sbjct: 169 MELWALFGHFSDKQVVGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLWLDRLQQIG 228

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   W++ A++ LLT   TSLADQDIFNA+I EHP LV+ LPC WNVQLSD+T + E CY
Sbjct: 229 WEQMWKLTAKRELLTLTATSLADQDIFNAVIKEHPELVHPLPCVWNVQLSDHTLA-ERCY 287

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
            E  DLK+IHWNSPKKL+VKNKH EFFR+L+LTFL +DG LL RELFGC          L
Sbjct: 288 LEAADLKVIHWNSPKKLRVKNKHAEFFRDLHLTFLGFDGKLLCRELFGCPNQFPPGAEQL 347

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           QQA L+ L+EE+PC++ R+ +LT  R H+ FL ++       +DVTLVAQLSMDR     
Sbjct: 348 QQA-LAQLDEEEPCFEFRQQQLTVHRVHITFLSHQ-PPPPRPHDVTLVAQLSMDR----- 400

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+E L +HW GP+SL LY++DAEAQQFL +
Sbjct: 401 ----------------------------LQMLEALCRHWRGPMSLALYLTDAEAQQFLRF 432

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
              S  L +R+++ YHVVY++G  YP+N LRNVAL Q  TPYVFL DIDFLP + LY YL
Sbjct: 433 VETSPVLSARKDVAYHVVYRDGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYL 492

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           ++SI  + +     K  LVVPAFET  YR +FP S                        H
Sbjct: 493 RASIEQLALGRRQRKAALVVPAFETLHYRFSFPNSKAELLTLLDAGSLHTFRYHEWPQGH 552

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           A T+++RW  A  PY+++W+ D+EPY+V  RD PRYD RFVGFGWNK
Sbjct: 553 ASTDYTRWREAQAPYRVQWSADYEPYVVVPRDCPRYDPRFVGFGWNK 599



 Score =  252 bits (643), Expect = 5e-64,   Method: Compositional matrix adjust.
 Identities = 127/217 (58%), Positives = 156/217 (71%), Gaps = 6/217 (2%)

Query: 8   VIPT-CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTW 66
           ++PT CE++ VAIVCAGYNS+R ++TL+KS+LFYRKNPLH HLITD VA NIL+TLF TW
Sbjct: 47  LLPTKCEMLHVAIVCAGYNSSREIITLMKSVLFYRKNPLHLHLITDAVARNILETLFRTW 106

Query: 67  SVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 126
            VP V VSFY A+ +   VSWIPNKHYSG+YGL+KL LP VLP +L + IVLDTDV F++
Sbjct: 107 MVPAVVVSFYDAEELKPLVSWIPNKHYSGLYGLMKLVLPSVLPLSLTRVIVLDTDVTFSS 166

Query: 127 DIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFYLADSVVEDVSWIP 186
           DI +LWALF     +Q +GLVENQSDWYLG LWKNH+PWPALGRGF     ++    W+ 
Sbjct: 167 DIMELWALFGHFSDKQVVGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILL----WLD 222

Query: 187 NKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 223
                G   + KLT  + L  TL  T + D D+  A 
Sbjct: 223 RLQQIGWEQMWKLTAKREL-LTLTATSLADQDIFNAV 258


>gi|298684001|gb|ADI96180.1| LARGE-2 [Mastomys erythroleucus]
          Length = 653

 Score =  559 bits (1440), Expect = e-156,   Method: Compositional matrix adjust.
 Identities = 285/527 (54%), Positives = 351/527 (66%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY A+ +   VSWIPNKHYSG+YGL+KL LP +LP +LA+ IVLDTDV F++DI
Sbjct: 109 PAVVVSFYDAEELKPLVSWIPNKHYSGLYGLMKLVLPSILPLSLARVIVLDTDVTFSSDI 168

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
            +LWALF R   +Q +GLVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +L+   
Sbjct: 169 VELWALFDRFSDKQVVGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLWLDRLQQTG 228

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   W++ A++ L T   TSLADQDIFNA++ EHP LV+ LPC WNVQLSD+T + E CY
Sbjct: 229 WEQMWKLTAKRELHTLTATSLADQDIFNAVLKEHPXLVHPLPCVWNVQLSDHTLA-ERCY 287

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
            E  DLK+IHWNSPKKL+VKNKH EFFRNL+LTFL YDG LLRRELFGC          L
Sbjct: 288 LEAADLKVIHWNSPKKLRVKNKHAEFFRNLHLTFLGYDGKLLRRELFGCPNQFPPGAEQL 347

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           QQA L+ L+EE+PC++ R+ +LT  R H+ FL ++        DVTLVAQLSMDR     
Sbjct: 348 QQA-LAQLDEEEPCFEFRQQQLTVHRVHITFLPHQPPPPR-PYDVTLVAQLSMDR----- 400

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+E L +HW GP+SL LY++DAEAQQFL +
Sbjct: 401 ----------------------------LQMLEALCRHWPGPMSLALYLTDAEAQQFLHF 432

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
              S  L +R+++ YHVVY++G  YP+N LRNVAL Q  TPYVFL DIDFLP + LY YL
Sbjct: 433 VETSPVLSARKDVAYHVVYQDGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYL 492

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           ++SI  +++ G   K  LVVPAFET  YR +FP S                        H
Sbjct: 493 RASIEQLEL-GSRRKAALVVPAFETLHYRFSFPNSKAELLTLLDAGSLYTFRYHEWPQGH 551

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
             T++SRW  A  PY+++W+ D+EPY+V  RD PRYD RFVGFGWNK
Sbjct: 552 LSTDYSRWREAQAPYRVQWSADYEPYVVVPRDCPRYDPRFVGFGWNK 598



 Score =  257 bits (657), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 127/223 (56%), Positives = 162/223 (72%), Gaps = 6/223 (2%)

Query: 2   SSDEEPVIPT-CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQ 60
           +S  + ++P+ CE++ VAIVCAGYNS+R ++TL+KS+LFYRKNPLH HLITD +A NIL+
Sbjct: 41  ASSSQLLLPSKCEMLHVAIVCAGYNSSREIITLMKSLLFYRKNPLHLHLITDAIAKNILE 100

Query: 61  TLFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDT 120
           TLF TW VP V VSFY A+ +   VSWIPNKHYSG+YGL+KL LP +LP +LA+ IVLDT
Sbjct: 101 TLFRTWMVPAVVVSFYDAEELKPLVSWIPNKHYSGLYGLMKLVLPSILPLSLARVIVLDT 160

Query: 121 DVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFYLADSVVE 180
           DV F++DI +LWALF R   +Q +GLVENQSDWYLG LWKNH+PWPALGRGF     ++ 
Sbjct: 161 DVTFSSDIVELWALFDRFSDKQVVGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILL- 219

Query: 181 DVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 223
              W+     +G   + KLT  + L  TL  T + D D+  A 
Sbjct: 220 ---WLDRLQQTGWEQMWKLTAKREL-HTLTATSLADQDIFNAV 258


>gi|298683999|gb|ADI96179.1| LARGE-2 [Mastomys erythroleucus]
          Length = 653

 Score =  559 bits (1440), Expect = e-156,   Method: Compositional matrix adjust.
 Identities = 285/527 (54%), Positives = 351/527 (66%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY A+ +   VSWIPNKHYSG+YGL+KL LP +LP +LA+ IVLDTDV F++DI
Sbjct: 109 PAVVVSFYDAEELKPLVSWIPNKHYSGLYGLMKLVLPSILPLSLARVIVLDTDVTFSSDI 168

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
            +LWALF R   +Q +GLVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +L+   
Sbjct: 169 VELWALFDRFSDKQVVGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLWLDRLQQTG 228

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   W++ A++ L T   TSLADQDIFNA++ EHP LV+ LPC WNVQLSD+T + E CY
Sbjct: 229 WEQMWKLTAKRELHTLTATSLADQDIFNAVLKEHPNLVHPLPCVWNVQLSDHTLA-ERCY 287

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
            E  DLK+IHWNSPKKL+VKNKH EFFRNL+LTFL YDG LLRRELFGC          L
Sbjct: 288 LEAADLKVIHWNSPKKLRVKNKHAEFFRNLHLTFLGYDGKLLRRELFGCPNQFPPGAEQL 347

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           QQA L+ L+EE+PC++ R+ +LT  R H+ FL ++        DVTLVAQLSMDR     
Sbjct: 348 QQA-LAQLDEEEPCFEFRQQQLTVHRVHITFLPHQPPPPR-PYDVTLVAQLSMDR----- 400

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+E L +HW GP+SL LY++DAEAQQFL +
Sbjct: 401 ----------------------------LQMLEALCRHWPGPMSLALYLTDAEAQQFLHF 432

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
              S  L +R+++ YHVVY++G  YP+N LRNVAL Q  TPYVFL DIDFLP + LY YL
Sbjct: 433 VETSPVLSARKDVAYHVVYQDGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYL 492

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           ++SI  +++ G   K  LVVPAFET  YR +FP S                        H
Sbjct: 493 RASIEQLEL-GSRRKAALVVPAFETLHYRFSFPNSKAELLTLLDAGSLYTFRYHEWPQGH 551

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
             T++SRW  A  PY+++W+ D+EPY+V  RD PRYD RFVGFGWNK
Sbjct: 552 LSTDYSRWREAQAPYRVQWSADYEPYVVVPRDCPRYDPRFVGFGWNK 598



 Score =  258 bits (658), Expect = 9e-66,   Method: Compositional matrix adjust.
 Identities = 127/223 (56%), Positives = 162/223 (72%), Gaps = 6/223 (2%)

Query: 2   SSDEEPVIPT-CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQ 60
           +S  + ++P+ CE++ VAIVCAGYNS+R ++TL+KS+LFYRKNPLH HLITD +A NIL+
Sbjct: 41  ASSSQLLLPSKCEMLHVAIVCAGYNSSREIITLMKSLLFYRKNPLHLHLITDAIAKNILE 100

Query: 61  TLFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDT 120
           TLF TW VP V VSFY A+ +   VSWIPNKHYSG+YGL+KL LP +LP +LA+ IVLDT
Sbjct: 101 TLFRTWMVPAVVVSFYDAEELKPLVSWIPNKHYSGLYGLMKLVLPSILPLSLARVIVLDT 160

Query: 121 DVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFYLADSVVE 180
           DV F++DI +LWALF R   +Q +GLVENQSDWYLG LWKNH+PWPALGRGF     ++ 
Sbjct: 161 DVTFSSDIVELWALFDRFSDKQVVGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILL- 219

Query: 181 DVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 223
              W+     +G   + KLT  + L  TL  T + D D+  A 
Sbjct: 220 ---WLDRLQQTGWEQMWKLTAKREL-HTLTATSLADQDIFNAV 258


>gi|298683995|gb|ADI96177.1| LARGE-2 [Mastomys erythroleucus]
          Length = 653

 Score =  558 bits (1439), Expect = e-156,   Method: Compositional matrix adjust.
 Identities = 285/527 (54%), Positives = 351/527 (66%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY A+ +   VSWIPNKHYSG+YGL+KL LP +LP +LA+ IVLDTDV F++DI
Sbjct: 109 PAVVVSFYDAEELKPLVSWIPNKHYSGLYGLMKLVLPSILPLSLARVIVLDTDVTFSSDI 168

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
            +LWALF R   +Q +GLVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +L+   
Sbjct: 169 VELWALFDRFSDKQVVGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLWLDRLQQTG 228

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   W++ A++ L T   TSLADQDIFNA++ EHP LV+ LPC WNVQLSD+T + E CY
Sbjct: 229 WEQMWKLTAKRELHTLTATSLADQDIFNAVLKEHPNLVHPLPCVWNVQLSDHTLA-ERCY 287

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
            E  DLK+IHWNSPKKL+VKNKH EFFRNL+LTFL YDG LLRRELFGC          L
Sbjct: 288 LEAADLKVIHWNSPKKLRVKNKHAEFFRNLHLTFLGYDGKLLRRELFGCPNQFPPGAEQL 347

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           QQA L+ L+EE+PC++ R+ +LT  R H+ FL ++        DVTLVAQLSMDR     
Sbjct: 348 QQA-LAQLDEEEPCFEFRQQQLTVHRVHITFLPHQPPPPR-PYDVTLVAQLSMDR----- 400

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+E L +HW GP+SL LY++DAEAQQFL +
Sbjct: 401 ----------------------------LQMLEALCRHWPGPMSLALYLTDAEAQQFLHF 432

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
              S  L +R+++ YHVVY++G  YP+N LRNVAL Q  TPYVFL DIDFLP + LY YL
Sbjct: 433 VETSPVLSARKDVAYHVVYQDGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYL 492

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           ++SI  +++ G   K  LVVPAFET  YR +FP S                        H
Sbjct: 493 RASIEQLEL-GSRRKAALVVPAFETLHYRFSFPNSKAELLTLLDAGSLYTFRYHEWPQGH 551

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
             T++SRW  A  PY+++W+ D+EPY+V  RD PRYD RFVGFGWNK
Sbjct: 552 LSTDYSRWREAQAPYRVQWSADYEPYVVVPRDCPRYDPRFVGFGWNK 598



 Score =  258 bits (658), Expect = 9e-66,   Method: Compositional matrix adjust.
 Identities = 127/223 (56%), Positives = 162/223 (72%), Gaps = 6/223 (2%)

Query: 2   SSDEEPVIPT-CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQ 60
           +S  + ++P+ CE++ VAIVCAGYNS+R ++TL+KS+LFYRKNPLH HLITD +A NIL+
Sbjct: 41  ASSSQLLLPSKCEMLHVAIVCAGYNSSREIITLMKSLLFYRKNPLHLHLITDAIAKNILE 100

Query: 61  TLFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDT 120
           TLF TW VP V VSFY A+ +   VSWIPNKHYSG+YGL+KL LP +LP +LA+ IVLDT
Sbjct: 101 TLFRTWMVPAVVVSFYDAEELKPLVSWIPNKHYSGLYGLMKLVLPSILPLSLARVIVLDT 160

Query: 121 DVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFYLADSVVE 180
           DV F++DI +LWALF R   +Q +GLVENQSDWYLG LWKNH+PWPALGRGF     ++ 
Sbjct: 161 DVTFSSDIVELWALFDRFSDKQVVGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILL- 219

Query: 181 DVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 223
              W+     +G   + KLT  + L  TL  T + D D+  A 
Sbjct: 220 ---WLDRLQQTGWEQMWKLTAKREL-HTLTATSLADQDIFNAV 258


>gi|298683997|gb|ADI96178.1| LARGE-2 [Mastomys erythroleucus]
          Length = 653

 Score =  558 bits (1439), Expect = e-156,   Method: Compositional matrix adjust.
 Identities = 285/527 (54%), Positives = 351/527 (66%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY A+ +   VSWIPNKHYSG+YGL+KL LP +LP +LA+ IVLDTDV F++DI
Sbjct: 109 PAVVVSFYDAEELKPLVSWIPNKHYSGLYGLMKLVLPSILPLSLARVIVLDTDVTFSSDI 168

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
            +LWALF R   +Q +GLVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +L+   
Sbjct: 169 VELWALFDRFSDKQVVGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLWLDRLQQTG 228

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   W++ A++ L T   TSLADQDIFNA++ EHP LV+ LPC WNVQLSD+T + E CY
Sbjct: 229 WEQMWKLTAKRELHTLTATSLADQDIFNAVLKEHPNLVHPLPCVWNVQLSDHTLA-ERCY 287

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
            E  DLK+IHWNSPKKL+VKNKH EFFRNL+LTFL YDG LLRRELFGC          L
Sbjct: 288 LEAADLKVIHWNSPKKLRVKNKHAEFFRNLHLTFLGYDGKLLRRELFGCPNQFPPGAEQL 347

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           QQA L+ L+EE+PC++ R+ +LT  R H+ FL ++        DVTLVAQLSMDR     
Sbjct: 348 QQA-LAQLDEEEPCFEFRQQQLTVHRVHITFLPHQPPPPR-PYDVTLVAQLSMDR----- 400

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+E L +HW GP+SL LY++DAEAQQFL +
Sbjct: 401 ----------------------------LQMLEALCRHWPGPMSLALYLTDAEAQQFLHF 432

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
              S  L +R+++ YHVVY++G  YP+N LRNVAL Q  TPYVFL DIDFLP + LY YL
Sbjct: 433 VETSPVLSARKDVAYHVVYQDGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYL 492

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           ++SI  +++ G   K  LVVPAFET  YR +FP S                        H
Sbjct: 493 RASIEQLEL-GSRRKAALVVPAFETLHYRFSFPNSKAELLTLLDAGSLYTFRYHEWPQGH 551

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
             T++SRW  A  PY+++W+ D+EPY+V  RD PRYD RFVGFGWNK
Sbjct: 552 LSTDYSRWREAQAPYRVQWSADYEPYVVVPRDCPRYDPRFVGFGWNK 598



 Score =  246 bits (628), Expect = 3e-62,   Method: Compositional matrix adjust.
 Identities = 122/212 (57%), Positives = 152/212 (71%), Gaps = 5/212 (2%)

Query: 12  CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQV 71
           CE++ VAI  A YNS+R ++TL+KS+LFYRKNPLH HLITD +A NIL+TLF TW VP V
Sbjct: 52  CEMLHVAIEWAEYNSSREIITLMKSLLFYRKNPLHLHLITDAIAKNILETLFRTWMVPAV 111

Query: 72  EVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQL 131
            VSFY A+ +   VSWIPNKHYSG+YGL+KL LP +LP +LA+ IVLDTDV F++DI +L
Sbjct: 112 VVSFYDAEELKPLVSWIPNKHYSGLYGLMKLVLPSILPLSLARVIVLDTDVTFSSDIVEL 171

Query: 132 WALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFYLADSVVEDVSWIPNKHYS 191
           WALF R   +Q +GLVENQSDWYLG LWKNH+PWPALGRGF     ++    W+     +
Sbjct: 172 WALFDRFSDKQVVGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILL----WLDRLQQT 227

Query: 192 GVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 223
           G   + KLT  + L  TL  T + D D+  A 
Sbjct: 228 GWEQMWKLTAKREL-HTLTATSLADQDIFNAV 258


>gi|298684003|gb|ADI96181.1| LARGE-2 [Mastomys erythroleucus]
          Length = 653

 Score =  558 bits (1438), Expect = e-156,   Method: Compositional matrix adjust.
 Identities = 284/527 (53%), Positives = 351/527 (66%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY A+ +   VSWIPNKHYSG+YGL+KL LP +LP +LA+ I+LDTDV F++DI
Sbjct: 109 PAVVVSFYDAEELKPLVSWIPNKHYSGLYGLMKLVLPSILPLSLARVIILDTDVTFSSDI 168

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
            +LWALF R   +Q +GLVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +L+   
Sbjct: 169 VELWALFDRFSDKQVVGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLWLDRLQQTG 228

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   W++ A++ L T   TSLADQDIFNA++ EHP LV+ LPC WNVQLSD+T + E CY
Sbjct: 229 WEQMWKLTAKRELHTLTATSLADQDIFNAVLKEHPNLVHPLPCVWNVQLSDHTLA-ERCY 287

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
            E  DLK+IHWNSPKKL+VKNKH EFFRNL+LTFL YDG LLRRELFGC          L
Sbjct: 288 LEAADLKVIHWNSPKKLRVKNKHAEFFRNLHLTFLGYDGKLLRRELFGCPNQFPPGAEQL 347

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           QQA L+ L+EE+PC++ R+ +LT  R H+ FL ++        DVTLVAQLSMDR     
Sbjct: 348 QQA-LAQLDEEEPCFEFRQQQLTVHRVHITFLPHQPPPPR-PYDVTLVAQLSMDR----- 400

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+E L +HW GP+SL LY++DAEAQQFL +
Sbjct: 401 ----------------------------LQMLEALCRHWPGPMSLALYLTDAEAQQFLHF 432

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
              S  L +R+++ YHVVY++G  YP+N LRNVAL Q  TPYVFL DIDFLP + LY YL
Sbjct: 433 VETSPVLSARKDVAYHVVYQDGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYL 492

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           ++SI  +++ G   K  LVVPAFET  YR +FP S                        H
Sbjct: 493 RASIEQLEL-GSRRKAALVVPAFETLHYRFSFPNSKAELLTLLDAGSLYTFRYHEWPQGH 551

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
             T++SRW  A  PY+++W+ D+EPY+V  RD PRYD RFVGFGWNK
Sbjct: 552 LSTDYSRWREAQAPYRVQWSADYEPYVVVPRDCPRYDPRFVGFGWNK 598



 Score =  257 bits (657), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 126/223 (56%), Positives = 162/223 (72%), Gaps = 6/223 (2%)

Query: 2   SSDEEPVIPT-CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQ 60
           +S  + ++P+ CE++ VAIVCAGYNS+R ++TL+KS+LFYRKNPLH HLITD +A NIL+
Sbjct: 41  ASSSQLLLPSKCEMLHVAIVCAGYNSSREIITLMKSLLFYRKNPLHLHLITDAIAKNILE 100

Query: 61  TLFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDT 120
           TLF TW VP V VSFY A+ +   VSWIPNKHYSG+YGL+KL LP +LP +LA+ I+LDT
Sbjct: 101 TLFRTWMVPAVVVSFYDAEELKPLVSWIPNKHYSGLYGLMKLVLPSILPLSLARVIILDT 160

Query: 121 DVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFYLADSVVE 180
           DV F++DI +LWALF R   +Q +GLVENQSDWYLG LWKNH+PWPALGRGF     ++ 
Sbjct: 161 DVTFSSDIVELWALFDRFSDKQVVGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILL- 219

Query: 181 DVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 223
              W+     +G   + KLT  + L  TL  T + D D+  A 
Sbjct: 220 ---WLDRLQQTGWEQMWKLTAKREL-HTLTATSLADQDIFNAV 258


>gi|298683979|gb|ADI96169.1| LARGE-2 [Mastomys natalensis]
          Length = 653

 Score =  557 bits (1436), Expect = e-156,   Method: Compositional matrix adjust.
 Identities = 284/527 (53%), Positives = 350/527 (66%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY A+ +   VSWIPNKHYSG+YGL+KL LP +LP +LA+ I+LDTDV F++DI
Sbjct: 109 PAVVVSFYDAEELKPLVSWIPNKHYSGLYGLMKLVLPSILPLSLARVIILDTDVTFSSDI 168

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
            +LWALF R   +Q +GLVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +L+   
Sbjct: 169 VELWALFDRFSDKQVVGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLWLDRLQQTG 228

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   W++ A++ L T   TSLADQDIFNA++ EHP LV+ LPC WNVQLSD+T + E CY
Sbjct: 229 WEQMWKLTAKRELHTLTATSLADQDIFNAVLKEHPNLVHPLPCVWNVQLSDHTLA-ERCY 287

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
            E  DLK+IHWNSPKKL+VKNKH EFFR L+LTFL YDG LLRRELFGC          L
Sbjct: 288 LEAADLKVIHWNSPKKLRVKNKHAEFFRTLHLTFLGYDGKLLRRELFGCPNQFPPGAEQL 347

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           QQA L+ L+EE+PC++ R+ +LT  R H+ FL ++        DVTLVAQLSMDR     
Sbjct: 348 QQA-LAQLDEEEPCFEFRQQQLTVHRVHITFLPHQPPPPR-PYDVTLVAQLSMDR----- 400

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+E L +HW GP+SL LY++DAEAQQFL +
Sbjct: 401 ----------------------------LQMLEALCRHWPGPMSLALYLTDAEAQQFLHF 432

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
              S  L +R+++ YHVVY++G  YP+N LRNVAL Q  TPYVFL DIDFLP + LY YL
Sbjct: 433 VETSPVLSARKDVAYHVVYQDGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYL 492

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           ++SI  +++ G   K  LVVPAFET  YR +FP S                        H
Sbjct: 493 RASIEQLEL-GSRRKAALVVPAFETLHYRFSFPNSKAELLTLLDAGSLYTFRYHEWPQGH 551

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
             TN+SRW  A  PY+++W+ D+EPY+V  RD PRYD RFVGFGWNK
Sbjct: 552 LSTNYSRWREAQAPYRVQWSADYEPYVVVPRDCPRYDPRFVGFGWNK 598



 Score =  257 bits (657), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 126/223 (56%), Positives = 162/223 (72%), Gaps = 6/223 (2%)

Query: 2   SSDEEPVIPT-CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQ 60
           +S  + ++P+ CE++ VAIVCAGYNS+R ++TL+KS+LFYRKNPLH HLITD +A NIL+
Sbjct: 41  ASSSQLLLPSKCEMLHVAIVCAGYNSSREIITLMKSLLFYRKNPLHLHLITDAIARNILE 100

Query: 61  TLFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDT 120
           TLF TW VP V VSFY A+ +   VSWIPNKHYSG+YGL+KL LP +LP +LA+ I+LDT
Sbjct: 101 TLFRTWMVPAVVVSFYDAEELKPLVSWIPNKHYSGLYGLMKLVLPSILPLSLARVIILDT 160

Query: 121 DVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFYLADSVVE 180
           DV F++DI +LWALF R   +Q +GLVENQSDWYLG LWKNH+PWPALGRGF     ++ 
Sbjct: 161 DVTFSSDIVELWALFDRFSDKQVVGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILL- 219

Query: 181 DVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 223
              W+     +G   + KLT  + L  TL  T + D D+  A 
Sbjct: 220 ---WLDRLQQTGWEQMWKLTAKREL-HTLTATSLADQDIFNAV 258


>gi|298683991|gb|ADI96175.1| LARGE-2 [Mastomys natalensis]
          Length = 653

 Score =  556 bits (1434), Expect = e-155,   Method: Compositional matrix adjust.
 Identities = 284/527 (53%), Positives = 350/527 (66%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY A+ +   VSWIPNKHYSG+YGL+KL LP +LP +LA+ IVLDTDV F++DI
Sbjct: 109 PAVVVSFYDAEELKPLVSWIPNKHYSGLYGLMKLVLPSILPLSLARVIVLDTDVTFSSDI 168

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
            +LWALF R   +Q +GLVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +L+   
Sbjct: 169 VELWALFDRFSDKQVVGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLWLDRLQQTG 228

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   W++ A++ L T   TSLADQDIFNA++ EHP LV+ LPC WNVQLSD+T + E CY
Sbjct: 229 WEQMWKLTAKRELHTLTATSLADQDIFNAVLKEHPNLVHPLPCVWNVQLSDHTLA-ERCY 287

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
            E  DLK+IHWNSPKKL+VKNKH EFFR L+LTFL YDG LLRRELFGC          L
Sbjct: 288 LEAADLKVIHWNSPKKLRVKNKHAEFFRTLHLTFLGYDGKLLRRELFGCPNQFPPGAEQL 347

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           QQA L+ L+EE+PC++ R+ +LT  R H+ FL ++        DVTLVAQLSMDR     
Sbjct: 348 QQA-LAQLDEEEPCFEFRQQQLTVHRVHITFLPHQPPPPR-PYDVTLVAQLSMDR----- 400

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+E L +HW GP+SL LY++DAEAQQFL +
Sbjct: 401 ----------------------------LQMLEALCRHWPGPMSLALYLTDAEAQQFLHF 432

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
              S  L +R+++ YHVVY++G  YP+N LRNVAL Q  TPYVFL DIDFLP + LY YL
Sbjct: 433 VETSPVLSARKDVAYHVVYQDGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYL 492

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           ++SI  +++ G   K  LVVPAFET  YR +FP S                        H
Sbjct: 493 RASIEQLEL-GSRRKAALVVPAFETLHYRFSFPNSKAELLTLLDAGSLYTFRYHEWPQGH 551

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
             T++SRW  A  PY+++W+ D+EPY+V  RD PRYD RFVGFGWNK
Sbjct: 552 LSTDYSRWREAQAPYRVQWSADYEPYVVVPRDCPRYDPRFVGFGWNK 598



 Score =  259 bits (661), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 128/223 (57%), Positives = 162/223 (72%), Gaps = 6/223 (2%)

Query: 2   SSDEEPVIPT-CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQ 60
           +S  + ++P+ CE++ VAIVCAGYNS+R L+TL+KS+LFYRKNPLH HLITD +A NIL+
Sbjct: 41  ASSSQLLLPSKCEMLHVAIVCAGYNSSRELITLMKSLLFYRKNPLHLHLITDAIARNILE 100

Query: 61  TLFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDT 120
           TLF TW VP V VSFY A+ +   VSWIPNKHYSG+YGL+KL LP +LP +LA+ IVLDT
Sbjct: 101 TLFRTWMVPAVVVSFYDAEELKPLVSWIPNKHYSGLYGLMKLVLPSILPLSLARVIVLDT 160

Query: 121 DVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFYLADSVVE 180
           DV F++DI +LWALF R   +Q +GLVENQSDWYLG LWKNH+PWPALGRGF     ++ 
Sbjct: 161 DVTFSSDIVELWALFDRFSDKQVVGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILL- 219

Query: 181 DVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 223
              W+     +G   + KLT  + L  TL  T + D D+  A 
Sbjct: 220 ---WLDRLQQTGWEQMWKLTAKREL-HTLTATSLADQDIFNAV 258


>gi|431915739|gb|ELK16072.1| Glycosyltransferase-like protein LARGE2 [Pteropus alecto]
          Length = 673

 Score =  556 bits (1432), Expect = e-155,   Method: Compositional matrix adjust.
 Identities = 284/510 (55%), Positives = 343/510 (67%), Gaps = 63/510 (12%)

Query: 182 VSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTI 241
           VSWIPNKHYSG+YGL+KL LP  LP  LA+ IVLDTDV FA+DIA+LWALF+     Q I
Sbjct: 120 VSWIPNKHYSGLYGLMKLVLPGALPPALARVIVLDTDVTFASDIAELWALFAYFSDEQVI 179

Query: 242 GLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTR 301
           GLVENQSDWYLG LW+NH+PWPALGRGFNTGVILL L +LR   W   W++ A + LLT 
Sbjct: 180 GLVENQSDWYLGNLWRNHRPWPALGRGFNTGVILLRLDRLRQAGWEQMWKLTATQELLTL 239

Query: 302 LWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWNSPKK 361
             TSLADQDIFNAII  HP LV  LPC WNVQLSD+T + E CY+E  DLK+IHWNSPKK
Sbjct: 240 PATSLADQDIFNAIIKRHPGLVLPLPCIWNVQLSDHTLA-ERCYSEAADLKVIHWNSPKK 298

Query: 362 LKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC---NLTQTTLQQAELSNLNEEDPCYDL 418
           L+VKNKH+EFFRNL+LTFLEY+GNLLRRELFGC      +    Q  L++L+EED C + 
Sbjct: 299 LRVKNKHVEFFRNLHLTFLEYNGNLLRRELFGCPSPPPPEAEQLQRALAHLSEEDTCLEF 358

Query: 419 RRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINPNNVDEYEASSDGND 478
           R+ +LT  R H+ FL +E       +DVTLVAQLSMDR                      
Sbjct: 359 RQQQLTVHRVHITFLPHEPPPPRP-HDVTLVAQLSMDR---------------------- 395

Query: 479 VTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGYHV 538
                      LQM+E L +HW GP+SL LY++DAEAQQFL +   S  L +R+++ YHV
Sbjct: 396 -----------LQMLEALCRHWPGPMSLALYLTDAEAQQFLHFVEASAVLSARQDVAYHV 444

Query: 539 VYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKV 598
           VY+EG+ YPIN LRNVA+ Q  TPYVFL DID LP + LY YL++SI  +++ G G K  
Sbjct: 445 VYREGSLYPINQLRNVAMAQALTPYVFLSDIDLLPSYSLYDYLRASIEQLEL-GSGRKAA 503

Query: 599 LVVPAFETQRYRTAFPAS------------------------HAPTNFSRWVNATTPYQI 634
           LVVPAFET  Y  +FP S                        +APT+++RW  A  PY++
Sbjct: 504 LVVPAFETLHYHFSFPNSKAELLALLDAGSLYTFRSHEWPQGYAPTDYARWREAQAPYRV 563

Query: 635 EWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           +WA D+EPY+V  RD PRYD RFVGFGWNK
Sbjct: 564 QWAADYEPYVVVPRDCPRYDPRFVGFGWNK 593



 Score =  183 bits (464), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 88/166 (53%), Positives = 109/166 (65%), Gaps = 33/166 (19%)

Query: 7   PVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTW 66
           P+ P CE++ VA+VCAG+N++R ++TL+KS+LF+R                        W
Sbjct: 75  PLPPKCELLHVAMVCAGHNTSRDVITLVKSMLFFR-----------------------AW 111

Query: 67  SVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 126
            V + E            VSWIPNKHYSG+YGL+KL LP  LP  LA+ IVLDTDV FA+
Sbjct: 112 GVGRGE----------PQVSWIPNKHYSGLYGLMKLVLPGALPPALARVIVLDTDVTFAS 161

Query: 127 DIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           DIA+LWALF+     Q IGLVENQSDWYLG LW+NH+PWPALGRGF
Sbjct: 162 DIAELWALFAYFSDEQVIGLVENQSDWYLGNLWRNHRPWPALGRGF 207


>gi|298683983|gb|ADI96171.1| LARGE-2 [Mastomys natalensis]
          Length = 653

 Score =  555 bits (1431), Expect = e-155,   Method: Compositional matrix adjust.
 Identities = 284/527 (53%), Positives = 349/527 (66%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY A+ +   VSWIPNKHYSG+YGL+KL LP +LP +LA+ IVLDTDV F++DI
Sbjct: 109 PAVVVSFYDAEELKPLVSWIPNKHYSGLYGLMKLVLPSILPLSLARVIVLDTDVTFSSDI 168

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
            +LWALF R   +Q +GLVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +L+   
Sbjct: 169 VELWALFDRFSDKQVVGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLWLDRLQQTG 228

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   W++ A++ L T   TSLADQDIFNA++ EHP LV+ LPC WNVQLSD+T + E CY
Sbjct: 229 WEQMWKLTAKRELHTLTATSLADQDIFNAVLKEHPNLVHPLPCVWNVQLSDHTLA-ERCY 287

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
            E  DLK+IHWNSPKKL+VKNKH EFFR L+LTFL YDG LLRRELFGC          L
Sbjct: 288 LEAADLKVIHWNSPKKLRVKNKHAEFFRXLHLTFLGYDGKLLRRELFGCPNQFPPGAEQL 347

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           QQA L+ L+EE+PC++ R+ +LT  R H+ FL ++        DVTLVAQLSMDR     
Sbjct: 348 QQA-LAQLDEEEPCFEFRQQQLTVHRVHITFLPHQPPPPR-PYDVTLVAQLSMDR----- 400

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+E L +HW GP+SL LY++DAEAQQFL +
Sbjct: 401 ----------------------------LQMLEALCRHWPGPMSLALYLTDAEAQQFLHF 432

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
              S  L +R+++ YHVVY++G  YP+N LRNVAL Q  TPYVFL DIDFLP + LY YL
Sbjct: 433 VETSPVLSARKDVAYHVVYQDGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYL 492

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           ++SI  +++ G   K  LVVPAFET  YR +FP S                        H
Sbjct: 493 RASIEQLEL-GSRRKAALVVPAFETLHYRFSFPNSKAELLTLLDAGSLYTFRYHEWPQGH 551

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
             T +SRW  A  PY+++W+ D+EPY+V  RD PRYD RFVGFGWNK
Sbjct: 552 LSTXYSRWREAQAPYRVQWSADYEPYVVVPRDCPRYDPRFVGFGWNK 598



 Score =  258 bits (659), Expect = 8e-66,   Method: Compositional matrix adjust.
 Identities = 127/223 (56%), Positives = 162/223 (72%), Gaps = 6/223 (2%)

Query: 2   SSDEEPVIPT-CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQ 60
           +S  + ++P+ CE++ VAIVCAGYNS+R ++TL+KS+LFYRKNPLH HLITD +A NIL+
Sbjct: 41  ASSSQLLLPSKCEMLHVAIVCAGYNSSREIITLMKSLLFYRKNPLHLHLITDAIARNILE 100

Query: 61  TLFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDT 120
           TLF TW VP V VSFY A+ +   VSWIPNKHYSG+YGL+KL LP +LP +LA+ IVLDT
Sbjct: 101 TLFRTWMVPAVVVSFYDAEELKPLVSWIPNKHYSGLYGLMKLVLPSILPLSLARVIVLDT 160

Query: 121 DVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFYLADSVVE 180
           DV F++DI +LWALF R   +Q +GLVENQSDWYLG LWKNH+PWPALGRGF     ++ 
Sbjct: 161 DVTFSSDIVELWALFDRFSDKQVVGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILL- 219

Query: 181 DVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 223
              W+     +G   + KLT  + L  TL  T + D D+  A 
Sbjct: 220 ---WLDRLQQTGWEQMWKLTAKREL-HTLTATSLADQDIFNAV 258


>gi|298683981|gb|ADI96170.1| LARGE-2 [Mastomys natalensis]
          Length = 653

 Score =  555 bits (1429), Expect = e-155,   Method: Compositional matrix adjust.
 Identities = 283/527 (53%), Positives = 350/527 (66%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY A+ +   VSWIPNKHYSG+YGL+KL LP +LP +LA+ I+LDTDV F++DI
Sbjct: 109 PAVVVSFYDAEELKPLVSWIPNKHYSGLYGLMKLVLPSILPLSLARVIILDTDVTFSSDI 168

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
            +LWALF R   +Q +GLVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +L+   
Sbjct: 169 VELWALFDRFSDKQVVGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLWLDRLQQTG 228

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   W++ A++ L T   TSLADQDIFNA++ EHP LV+ LPC WNVQLSD+T + E CY
Sbjct: 229 WEQMWKLTAKRELHTLTATSLADQDIFNAVLKEHPNLVHPLPCVWNVQLSDHTLA-ERCY 287

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
            E  DLK+IHWNSPKKL+VKNKH EFFR L+LTFL YDG LLRRELFGC          L
Sbjct: 288 LEAADLKVIHWNSPKKLRVKNKHAEFFRXLHLTFLGYDGKLLRRELFGCPNQFPPGAEQL 347

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           QQA L+ L+EE+PC++ R+ +LT  R H+ FL ++        DVTLVAQLSMDR     
Sbjct: 348 QQA-LAQLDEEEPCFEFRQQQLTVHRVHITFLPHQPPPPR-PYDVTLVAQLSMDR----- 400

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+E L +HW GP+SL LY++DAEAQQFL +
Sbjct: 401 ----------------------------LQMLEALCRHWPGPMSLALYLTDAEAQQFLHF 432

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
              S  L +R+++ YHVVY++G  YP+N LRNVAL Q  TPYVFL DIDFLP + LY YL
Sbjct: 433 VETSPVLSARKDVAYHVVYQDGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYL 492

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           ++SI  +++ G   K  LVVPAFET  YR +FP S                        H
Sbjct: 493 RASIEQLEL-GSRRKAALVVPAFETLHYRFSFPNSKAELLTLLDAGSLYTFRYHEWPQGH 551

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
             T++SRW  A  PY+++W+ D+EPY+V  RD PRYD RFVGFGWNK
Sbjct: 552 LSTDYSRWREAQAPYRVQWSADYEPYVVVPRDCPRYDPRFVGFGWNK 598



 Score =  257 bits (656), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 126/223 (56%), Positives = 162/223 (72%), Gaps = 6/223 (2%)

Query: 2   SSDEEPVIPT-CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQ 60
           +S  + ++P+ CE++ VAIVCAGYNS+R ++TL+KS+LFYRKNPLH HLITD +A NIL+
Sbjct: 41  ASSSQLLLPSKCEMLHVAIVCAGYNSSREIITLMKSLLFYRKNPLHLHLITDAIARNILE 100

Query: 61  TLFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDT 120
           TLF TW VP V VSFY A+ +   VSWIPNKHYSG+YGL+KL LP +LP +LA+ I+LDT
Sbjct: 101 TLFRTWMVPAVVVSFYDAEELKPLVSWIPNKHYSGLYGLMKLVLPSILPLSLARVIILDT 160

Query: 121 DVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFYLADSVVE 180
           DV F++DI +LWALF R   +Q +GLVENQSDWYLG LWKNH+PWPALGRGF     ++ 
Sbjct: 161 DVTFSSDIVELWALFDRFSDKQVVGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILL- 219

Query: 181 DVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 223
              W+     +G   + KLT  + L  TL  T + D D+  A 
Sbjct: 220 ---WLDRLQQTGWEQMWKLTAKREL-HTLTATSLADQDIFNAV 258


>gi|298683989|gb|ADI96174.1| LARGE-2 [Mastomys natalensis]
          Length = 653

 Score =  554 bits (1428), Expect = e-155,   Method: Compositional matrix adjust.
 Identities = 283/527 (53%), Positives = 349/527 (66%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY A+ +   VSWIPNKHYSG+YGL+KL LP +LP +LA+ IVLDTDV F++DI
Sbjct: 109 PAVVVSFYDAEELKPLVSWIPNKHYSGLYGLMKLVLPSILPLSLARVIVLDTDVTFSSDI 168

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
            +LWALF R   +Q +GLV NQSDWYLG LWKNH+PWPALGRGFNTGVILL L +L+   
Sbjct: 169 VELWALFDRFSDKQVVGLVXNQSDWYLGNLWKNHRPWPALGRGFNTGVILLWLDRLQQTG 228

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   W++ A++ L T   TSLADQDIFNA++ EHP LV+ LPC WNVQLSD+T + E CY
Sbjct: 229 WEQMWKLTAKRELHTLTATSLADQDIFNAVLKEHPNLVHPLPCVWNVQLSDHTLA-ERCY 287

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
            E  DLK+IHWNSPKKL+VKNKH EFFR L+LTFL YDG LLRRELFGC          L
Sbjct: 288 LEAADLKVIHWNSPKKLRVKNKHAEFFRTLHLTFLGYDGKLLRRELFGCPNQFPPGAEQL 347

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           QQA L+ L+EE+PC++ R+ +LT  R H+ FL ++        DVTLVAQLSMDR     
Sbjct: 348 QQA-LAQLDEEEPCFEFRQQQLTVHRVHITFLPHQPPPPR-PYDVTLVAQLSMDR----- 400

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+E L +HW GP+SL LY++DAEAQQFL +
Sbjct: 401 ----------------------------LQMLEALCRHWPGPMSLALYLTDAEAQQFLHF 432

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
              S  L +R+++ YHVVY++G  YP+N LRNVAL Q  TPYVFL DIDFLP + LY YL
Sbjct: 433 VETSPVLSARKDVAYHVVYQDGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYL 492

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           ++SI  +++ G   K  LVVPAFET  YR +FP S                        H
Sbjct: 493 RASIEQLEL-GSRRKAALVVPAFETLHYRFSFPNSKAELLTLLDAGSLYTFRYHEWPQGH 551

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
             T++SRW  A  PY+++W+ D+EPY+V  RD PRYD RFVGFGWNK
Sbjct: 552 LSTDYSRWREAQAPYRVQWSADYEPYVVVPRDCPRYDPRFVGFGWNK 598



 Score =  256 bits (655), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 127/223 (56%), Positives = 161/223 (72%), Gaps = 6/223 (2%)

Query: 2   SSDEEPVIPT-CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQ 60
           +S  + ++P+ CE++ VAIVCAGYNS+R L+TL+KS+LFYRKNPLH HLITD +A NIL+
Sbjct: 41  ASSSQLLLPSKCEMLHVAIVCAGYNSSRELITLMKSLLFYRKNPLHLHLITDAIARNILE 100

Query: 61  TLFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDT 120
           TLF TW VP V VSFY A+ +   VSWIPNKHYSG+YGL+KL LP +LP +LA+ IVLDT
Sbjct: 101 TLFRTWMVPAVVVSFYDAEELKPLVSWIPNKHYSGLYGLMKLVLPSILPLSLARVIVLDT 160

Query: 121 DVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFYLADSVVE 180
           DV F++DI +LWALF R   +Q +GLV NQSDWYLG LWKNH+PWPALGRGF     ++ 
Sbjct: 161 DVTFSSDIVELWALFDRFSDKQVVGLVXNQSDWYLGNLWKNHRPWPALGRGFNTGVILL- 219

Query: 181 DVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 223
              W+     +G   + KLT  + L  TL  T + D D+  A 
Sbjct: 220 ---WLDRLQQTGWEQMWKLTAKREL-HTLTATSLADQDIFNAV 258


>gi|345326808|ref|XP_001510588.2| PREDICTED: glycosyltransferase-like protein LARGE1 [Ornithorhynchus
           anatinus]
          Length = 502

 Score =  554 bits (1427), Expect = e-155,   Method: Compositional matrix adjust.
 Identities = 271/455 (59%), Positives = 324/455 (71%), Gaps = 65/455 (14%)

Query: 238 RQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKF 297
           +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR + W   WR+ AE+ 
Sbjct: 10  QQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMKWEQMWRLTAERE 69

Query: 298 LLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
           L++ L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY +++DLK+IHWN
Sbjct: 70  LMSMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CYRDVSDLKVIHWN 128

Query: 358 SPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTLQQAELSNLNEED 413
           SPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC    ++    LQ+ +LS L+E+D
Sbjct: 129 SPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENLQK-QLSELDEDD 187

Query: 414 PCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINPNNVDEYEAS 473
            CY+ RR + T  RTHLYFL YEYE +SD  DVTLVAQLSMDRLQM+E            
Sbjct: 188 LCYEFRRERFTVHRTHLYFLHYEYEPASDNTDVTLVAQLSMDRLQMLEA----------- 236

Query: 474 SDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRN 533
                                 + KHWEGPISL LY+SDAEAQQFL YA  SE L SRRN
Sbjct: 237 ----------------------ICKHWEGPISLALYLSDAEAQQFLRYAQGSEVLMSRRN 274

Query: 534 IGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGH 593
           +GYH+VYKEG FYP+N LRNVA+  + TPY+FL DIDFLPM+GLY YL+ S+  +D+   
Sbjct: 275 VGYHIVYKEGQFYPVNLLRNVAMKHIGTPYMFLSDIDFLPMYGLYEYLRKSVIQLDL--A 332

Query: 594 GGKKVLVVPAFETQRYRTAFPAS------------------------HAPTNFSRWVNAT 629
             KK L+VPAFET RYR +FP S                        HAPTNF++W  AT
Sbjct: 333 NTKKALIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGHAPTNFAKWRTAT 392

Query: 630 TPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           TPY++EW  DFEPY+V  R+ P YD RFVGFGWNK
Sbjct: 393 TPYRVEWEADFEPYVVVRRNCPEYDRRFVGFGWNK 427



 Score = 68.2 bits (165), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 26/32 (81%), Positives = 30/32 (93%)

Query: 141 RQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 10  QQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 41


>gi|298683985|gb|ADI96172.1| LARGE-2 [Mastomys natalensis]
          Length = 653

 Score =  553 bits (1424), Expect = e-154,   Method: Compositional matrix adjust.
 Identities = 283/527 (53%), Positives = 348/527 (66%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY A+ +   VSWIPNKHYSG+YGL+KL LP +LP +LA+ IVLDTDV F++DI
Sbjct: 109 PAVVVSFYDAEELKPLVSWIPNKHYSGLYGLMKLVLPSILPLSLARVIVLDTDVTFSSDI 168

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
            +LWALF R    Q +GLVE QSDWYLG LWKNH+PWPALGRGFNTGVILL L +L+   
Sbjct: 169 VELWALFDRFSDXQVVGLVEXQSDWYLGNLWKNHRPWPALGRGFNTGVILLWLDRLQQTG 228

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   W++ A++ L T   TSLADQDIFNA++ EHP LV+ LPC WNVQLSD+T + E CY
Sbjct: 229 WEQMWKLTAKRELHTLTATSLADQDIFNAVLKEHPNLVHPLPCVWNVQLSDHTLA-ERCY 287

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
            E  DLK+IHWNSPKKL+VKNKH EFFR L+LTFL YDG LLRRELFGC          L
Sbjct: 288 LEAADLKVIHWNSPKKLRVKNKHAEFFRTLHLTFLGYDGKLLRRELFGCPNQFPPGAEQL 347

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           QQA L+ L+EE+PC++ R+ +LT  R H+ FL ++        DVTLVAQLSMDR     
Sbjct: 348 QQA-LAQLDEEEPCFEFRQQQLTVHRVHITFLPHQPPPPR-PYDVTLVAQLSMDR----- 400

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+E L +HW GP+SL LY++DAEAQQFL +
Sbjct: 401 ----------------------------LQMLEALCRHWPGPMSLALYLTDAEAQQFLHF 432

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
              S  L +R+++ YHVVY++G  YP+N LRNVAL Q  TPYVFL DIDFLP + LY YL
Sbjct: 433 VETSPVLSARKDVAYHVVYQDGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYL 492

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           ++SI  +++ G   K  LVVPAFET  YR +FP S                        H
Sbjct: 493 RASIEQLEL-GSRRKAALVVPAFETLHYRFSFPNSKAELLTLLDAGSLYTFRYHEWPQGH 551

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
             T++SRW  A  PY+++W+ D+EPY+V  RD PRYD RFVGFGWNK
Sbjct: 552 LSTDYSRWREAQAPYRVQWSADYEPYVVVPRDCPRYDPRFVGFGWNK 598



 Score =  255 bits (651), Expect = 7e-65,   Method: Compositional matrix adjust.
 Identities = 127/223 (56%), Positives = 160/223 (71%), Gaps = 6/223 (2%)

Query: 2   SSDEEPVIPT-CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQ 60
           +S  + ++P+ CE++ VAIVCAGYNS+R L+TL+KS+LFYRKNPLH HLITD +A NIL+
Sbjct: 41  ASSSQLLLPSKCEMLHVAIVCAGYNSSRELITLMKSLLFYRKNPLHLHLITDAIARNILE 100

Query: 61  TLFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDT 120
           TLF TW VP V VSFY A+ +   VSWIPNKHYSG+YGL+KL LP +LP +LA+ IVLDT
Sbjct: 101 TLFRTWMVPAVVVSFYDAEELKPLVSWIPNKHYSGLYGLMKLVLPSILPLSLARVIVLDT 160

Query: 121 DVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFYLADSVVE 180
           DV F++DI +LWALF R    Q +GLVE QSDWYLG LWKNH+PWPALGRGF     ++ 
Sbjct: 161 DVTFSSDIVELWALFDRFSDXQVVGLVEXQSDWYLGNLWKNHRPWPALGRGFNTGVILL- 219

Query: 181 DVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 223
              W+     +G   + KLT  + L  TL  T + D D+  A 
Sbjct: 220 ---WLDRLQQTGWEQMWKLTAKREL-HTLTATSLADQDIFNAV 258


>gi|298683973|gb|ADI96166.1| LARGE-2 [Mastomys kollmannspergeri]
          Length = 653

 Score =  551 bits (1421), Expect = e-154,   Method: Compositional matrix adjust.
 Identities = 283/527 (53%), Positives = 348/527 (66%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY A+ +   VSWIPNKHYSG+YGL+KL LP +LP +LA+ IVLDTDV F +DI
Sbjct: 109 PAVVVSFYDAEELKPLVSWIPNKHYSGLYGLMKLVLPSILPLSLARVIVLDTDVTFFSDI 168

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
            +LWALF R    Q +GLVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +L+   
Sbjct: 169 VELWALFDRFSDXQVVGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLWLDRLQQTG 228

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   W++ A++ LLT   TSLADQDIFN ++ EHP LV+ LPC WNVQLSD+T + E CY
Sbjct: 229 WEQMWKLTAKRELLTLTATSLADQDIFNVVLKEHPNLVHPLPCVWNVQLSDHTLA-ERCY 287

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
            E  DLK+IHWNSPKKL+VKNKH EFFRNL+LTFL YDG LLRRELFGC          L
Sbjct: 288 LEAADLKVIHWNSPKKLRVKNKHAEFFRNLHLTFLGYDGKLLRRELFGCPNQFPPGAEHL 347

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           QQA L+ ++EE+PC++ R+ +LT  R H+ FL ++        DVTLVAQLSMDR     
Sbjct: 348 QQA-LAQVDEEEPCFEFRQQQLTVHRVHITFLPHQPPPPR-PYDVTLVAQLSMDR----- 400

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+E L +HW GP+SL LY++DAEAQ FL +
Sbjct: 401 ----------------------------LQMLEALCRHWPGPMSLALYLTDAEAQLFLHF 432

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
              S  L +R+++ YHVVY++G  YP+N LRNVAL Q  TPYVFL DIDFLP + LY YL
Sbjct: 433 VETSPVLSARKDVAYHVVYQDGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYL 492

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           ++SI  +++ G   K  LVVPAFET  YR +FP S                        H
Sbjct: 493 RASIEQLEL-GSRRKAALVVPAFETLHYRFSFPNSKAELLTLLDAGSLYTFRYXEWPQGH 551

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
             T++SRW  A  PY+++W+ D+EPY+V  RD PRYD RFVGFGWNK
Sbjct: 552 LSTDYSRWREAQAPYRVQWSADYEPYVVVPRDCPRYDPRFVGFGWNK 598



 Score =  254 bits (648), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 127/220 (57%), Positives = 159/220 (72%), Gaps = 6/220 (2%)

Query: 2   SSDEEPVIPT-CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQ 60
           +S  + ++P+ CE++ VAIVCAGYNS+R +VTL+KS+LFYRKNPLH HLITD +A NIL+
Sbjct: 41  ASSSQLLLPSKCEMLHVAIVCAGYNSSREIVTLMKSLLFYRKNPLHLHLITDAIARNILE 100

Query: 61  TLFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDT 120
           TLF TW VP V VSFY A+ +   VSWIPNKHYSG+YGL+KL LP +LP +LA+ IVLDT
Sbjct: 101 TLFRTWMVPAVVVSFYDAEELKPLVSWIPNKHYSGLYGLMKLVLPSILPLSLARVIVLDT 160

Query: 121 DVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFYLADSVVE 180
           DV F +DI +LWALF R    Q +GLVENQSDWYLG LWKNH+PWPALGRGF     ++ 
Sbjct: 161 DVTFFSDIVELWALFDRFSDXQVVGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILL- 219

Query: 181 DVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVI 220
              W+     +G   + KLT  + L  TL  T + D D+ 
Sbjct: 220 ---WLDRLQQTGWEQMWKLTAKREL-LTLTATSLADQDIF 255


>gi|298684005|gb|ADI96182.1| LARGE-2 [Mus musculus]
          Length = 653

 Score =  550 bits (1417), Expect = e-154,   Method: Compositional matrix adjust.
 Identities = 281/527 (53%), Positives = 349/527 (66%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY A+ +   VSWI NKHYSG+YGL+KL LP +LP +LA+ IVLDTDV F++ I
Sbjct: 109 PAVVVXFYDAEELKPLVSWIXNKHYSGLYGLMKLVLPSILPXSLARVIVLDTDVTFSSXI 168

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
            +LWALF     +Q + LVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +L+   
Sbjct: 169 VELWALFDHFSDKQVVVLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLWLDRLQQTG 228

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   W++ A++ LLT + TSLADQDIFNA+I EHP+LV+ LPC WNVQLSD+TR+ E CY
Sbjct: 229 WEQMWKVTAKRELLTLMATSLADQDIFNAVIKEHPHLVHPLPCVWNVQLSDHTRA-ERCY 287

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
            E  DLK+IHWNSPKKL+VKNKH EFFRNL+LTFL YDG LLRRELFGC          L
Sbjct: 288 LEAADLKVIHWNSPKKLRVKNKHAEFFRNLHLTFLGYDGKLLRRELFGCPNQFPPGAEQL 347

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           QQA L+ L+EE+PC++ R+ +LT  R H+ FL ++       +DVTLVAQLSMDR     
Sbjct: 348 QQA-LTQLDEEEPCFEFRQQQLTVHRVHITFLPHQPPPPQ-PHDVTLVAQLSMDR----- 400

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+E L +HW GP+SL LY++D EAQQFL +
Sbjct: 401 ----------------------------LQMLEALCRHWPGPMSLALYLTDEEAQQFLHF 432

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
              S  L  R+++ YHVVY++G  YP+N LRNVAL Q  TPYVFL DIDFLP + LY YL
Sbjct: 433 VETSPVLSMRKDVAYHVVYRDGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYL 492

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           ++SI  +++     K  LVVPAFET  YR +FP S                        H
Sbjct: 493 RASIEQLELDSR-RKTALVVPAFETLHYRFSFPNSKAELLTLLDAGSLHTFRYHEWPQGH 551

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           + T++SRW  A  PY+++W+ D+EPY+V  RD P YD RFVGFGWNK
Sbjct: 552 SSTDYSRWREAQAPYRVQWSADYEPYVVVPRDCPXYDPRFVGFGWNK 598



 Score =  243 bits (619), Expect = 3e-61,   Method: Compositional matrix adjust.
 Identities = 123/222 (55%), Positives = 154/222 (69%), Gaps = 5/222 (2%)

Query: 2   SSDEEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQT 61
           SS +  + P CE++ VAIVCAGYNS+R ++TL KS+LFYRKNPLH HLITD VA NIL+T
Sbjct: 42  SSPQLLLPPKCEMLHVAIVCAGYNSSREIITLTKSLLFYRKNPLHLHLITDAVARNILET 101

Query: 62  LFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTD 121
           LF TW VP V V FY A+ +   VSWI NKHYSG+YGL+KL LP +LP +LA+ IVLDTD
Sbjct: 102 LFRTWMVPAVVVXFYDAEELKPLVSWIXNKHYSGLYGLMKLVLPSILPXSLARVIVLDTD 161

Query: 122 VIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFYLADSVVED 181
           V F++ I +LWALF     +Q + LVENQSDWYLG LWKNH+PWPALGRGF     ++  
Sbjct: 162 VTFSSXIVELWALFDHFSDKQVVVLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILL-- 219

Query: 182 VSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 223
             W+     +G   + K+T  + L  TL  T + D D+  A 
Sbjct: 220 --WLDRLQQTGWEQMWKVTAKREL-LTLMATSLADQDIFNAV 258


>gi|298683987|gb|ADI96173.1| LARGE-2 [Mastomys natalensis]
          Length = 653

 Score =  550 bits (1416), Expect = e-153,   Method: Compositional matrix adjust.
 Identities = 281/527 (53%), Positives = 347/527 (65%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY A+ +   VSWIPNKHYSG+YGL+KL LP +LP +LA+ IVLDTDV F++DI
Sbjct: 109 PAVVVSFYDAEELKPLVSWIPNKHYSGLYGLMKLVLPSILPLSLARVIVLDTDVTFSSDI 168

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
            +LWALF R   +Q +GLV  Q DWYLG LWKNH+PWPALGRGFNTGVILL L +L+   
Sbjct: 169 VELWALFDRFSDKQVVGLVXXQXDWYLGNLWKNHRPWPALGRGFNTGVILLWLDRLQQTG 228

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   W++ A++ L T   TSLADQDIFNA++ EHP LV+ LPC WNVQLSD+T + E CY
Sbjct: 229 WEQMWKLTAKRELHTLTATSLADQDIFNAVLKEHPNLVHPLPCVWNVQLSDHTLA-ERCY 287

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
            E  DLK+IHWNSPKKL+VKNKH EFFR L+LTFL YDG LLRRELFGC          L
Sbjct: 288 LEAADLKVIHWNSPKKLRVKNKHAEFFRTLHLTFLGYDGKLLRRELFGCPNQFPPGAEQL 347

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           QQA L+ L+EE+PC++ R+ +LT  R H+ FL ++        DVTLVAQLSMDR     
Sbjct: 348 QQA-LAQLDEEEPCFEFRQQQLTVHRVHITFLPHQPPPPR-PYDVTLVAQLSMDR----- 400

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+E L +HW GP+SL LY++DAEAQQFL +
Sbjct: 401 ----------------------------LQMLEALCRHWPGPMSLALYLTDAEAQQFLHF 432

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
              S  L +R+++ YHVVY++G  YP+N LRNVAL Q  TPYVFL DIDFLP + LY YL
Sbjct: 433 VETSPVLSARKDVAYHVVYQDGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYL 492

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           ++SI  +++ G   K  LVVPAFET  YR +FP S                        H
Sbjct: 493 RASIEQLEL-GSRRKAALVVPAFETLHYRFSFPNSKAELLTLLDAGSLYTFRYHEWPQGH 551

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
             T++SRW  A  PY+++W+ D+EPY+V  RD PRYD RFVGFGWNK
Sbjct: 552 LSTDYSRWREAQAPYRVQWSADYEPYVVVPRDCPRYDPRFVGFGWNK 598



 Score =  252 bits (643), Expect = 5e-64,   Method: Compositional matrix adjust.
 Identities = 125/223 (56%), Positives = 159/223 (71%), Gaps = 6/223 (2%)

Query: 2   SSDEEPVIPT-CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQ 60
           +S  + ++P+ CE++ VAIVCAGYNS+R L+TL+KS+LFYRKNPLH HLITD +A NIL+
Sbjct: 41  ASSSQLLLPSKCEMLHVAIVCAGYNSSRELITLMKSLLFYRKNPLHLHLITDAIARNILE 100

Query: 61  TLFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDT 120
           TLF TW VP V VSFY A+ +   VSWIPNKHYSG+YGL+KL LP +LP +LA+ IVLDT
Sbjct: 101 TLFRTWMVPAVVVSFYDAEELKPLVSWIPNKHYSGLYGLMKLVLPSILPLSLARVIVLDT 160

Query: 121 DVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFYLADSVVE 180
           DV F++DI +LWALF R   +Q +GLV  Q DWYLG LWKNH+PWPALGRGF     ++ 
Sbjct: 161 DVTFSSDIVELWALFDRFSDKQVVGLVXXQXDWYLGNLWKNHRPWPALGRGFNTGVILL- 219

Query: 181 DVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 223
              W+     +G   + KLT  + L  TL  T + D D+  A 
Sbjct: 220 ---WLDRLQQTGWEQMWKLTAKREL-HTLTATSLADQDIFNAV 258


>gi|281342977|gb|EFB18561.1| hypothetical protein PANDA_010467 [Ailuropoda melanoleuca]
          Length = 496

 Score =  550 bits (1416), Expect = e-153,   Method: Compositional matrix adjust.
 Identities = 270/455 (59%), Positives = 320/455 (70%), Gaps = 65/455 (14%)

Query: 238 RQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKF 297
           +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR + W   WR+ AE+ 
Sbjct: 4   QQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMKWEQMWRLTAERE 63

Query: 298 LLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
           L+  L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY +++DLK+IHWN
Sbjct: 64  LMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CYRDVSDLKVIHWN 122

Query: 358 SPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTLQQAELSNLNEED 413
           SPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC    ++    LQ+ +LS L+E+D
Sbjct: 123 SPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENLQK-QLSELDEDD 181

Query: 414 PCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINPNNVDEYEAS 473
            CY+ RR + T  RTHLYFL YEYE +SD  DVTLVAQLSMDRLQM+E            
Sbjct: 182 LCYEFRRERFTVHRTHLYFLHYEYEPASDNTDVTLVAQLSMDRLQMLEA----------- 230

Query: 474 SDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRN 533
                                 + KHWEGPISL LY+SDAEAQQFL YA  SE L SR N
Sbjct: 231 ----------------------ICKHWEGPISLALYLSDAEAQQFLRYAQGSEVLMSRHN 268

Query: 534 IGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGH 593
           + YH+VYKEG FYP+N LRNVA+  V TPY+FL DIDFLPM+GLY YL+ S+  +D+   
Sbjct: 269 VAYHIVYKEGQFYPVNLLRNVAMKHVGTPYMFLSDIDFLPMYGLYEYLRKSVIQLDL--A 326

Query: 594 GGKKVLVVPAFETQRYRTAFPAS------------------------HAPTNFSRWVNAT 629
             KK ++VPAFET RYR +FP S                        HAPTNF++W  AT
Sbjct: 327 NTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGHAPTNFAKWRTAT 386

Query: 630 TPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           TPY +EW  DFEPY+V  RD P YD RFVGFGWNK
Sbjct: 387 TPYHVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 421



 Score = 68.2 bits (165), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 26/32 (81%), Positives = 30/32 (93%)

Query: 141 RQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 4   QQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 35


>gi|298683977|gb|ADI96168.1| LARGE-2 [Mastomys kollmannspergeri]
          Length = 653

 Score =  549 bits (1415), Expect = e-153,   Method: Compositional matrix adjust.
 Identities = 282/527 (53%), Positives = 345/527 (65%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY A+ +   VSWIPNKHYSG+YGL+KL LP +LP +LA+ IVLDTDV F +DI
Sbjct: 109 PAVVVSFYDAEELKPLVSWIPNKHYSGLYGLMKLVLPSILPLSLARVIVLDTDVTFFSDI 168

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
            +LWALF R   +Q +GLVENQSDWYLG LWKNH+PWPALGRGF TGVILL L +L+   
Sbjct: 169 VELWALFDRFSDKQVVGLVENQSDWYLGNLWKNHRPWPALGRGFXTGVILLWLDRLQQTG 228

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   W++ A++ LLT   TSLADQDIFN ++ EHP LV+ LPC WNVQLSD+T + E CY
Sbjct: 229 WEQMWKLTAKRELLTLTATSLADQDIFNVVLKEHPNLVHPLPCVWNVQLSDHTLA-ERCY 287

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
            E  DLK+IHWNSPKKL+VKNKH EFFRNL+LTFL YDG LLRRELFGC          L
Sbjct: 288 LEAADLKVIHWNSPKKLRVKNKHAEFFRNLHLTFLGYDGKLLRRELFGCPNQFPPGAEHL 347

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           QQA L+ L+EE+PC++ R+ +L   R H+ FL           DVTLVAQLSMDR     
Sbjct: 348 QQA-LAQLDEEEPCFEFRQQQLXVHRVHITFLP-HQPPPPRPYDVTLVAQLSMDR----- 400

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+E L +HW GP+SL LY++DAEAQ FL +
Sbjct: 401 ----------------------------LQMLEALCRHWPGPMSLALYLTDAEAQLFLHF 432

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
              S  L +R+++ YHVVY++G  YP+N LRNVAL Q  TPYVFL DIDFLP + LY YL
Sbjct: 433 VETSPVLSARKDVAYHVVYQDGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYL 492

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           ++SI  +++ G   K  LVVPAFET  YR +FP S                        H
Sbjct: 493 RASIEQLEL-GSRRKAALVVPAFETLHYRFSFPNSKAELLTLLDAGSLYTFRYHEWPQGH 551

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
             T++SRW  A  PY+++W+ D+EPY+V  RD PRYD RFVGFGWNK
Sbjct: 552 LSTDYSRWREAQAPYRVQWSADYEPYVVVPRDCPRYDPRFVGFGWNK 598



 Score =  251 bits (640), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 126/220 (57%), Positives = 159/220 (72%), Gaps = 6/220 (2%)

Query: 2   SSDEEPVIPT-CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQ 60
           +S  + ++P+  E++ VAIVCAGYNS+R +VTL+KS+LFYRKNPLH HLITD +A NIL+
Sbjct: 41  ASSSQLLLPSKYEMLHVAIVCAGYNSSREIVTLMKSLLFYRKNPLHLHLITDAIARNILE 100

Query: 61  TLFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDT 120
           TLF TW VP V VSFY A+ +   VSWIPNKHYSG+YGL+KL LP +LP +LA+ IVLDT
Sbjct: 101 TLFRTWMVPAVVVSFYDAEELKPLVSWIPNKHYSGLYGLMKLVLPSILPLSLARVIVLDT 160

Query: 121 DVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFYLADSVVE 180
           DV F +DI +LWALF R   +Q +GLVENQSDWYLG LWKNH+PWPALGRGF     ++ 
Sbjct: 161 DVTFFSDIVELWALFDRFSDKQVVGLVENQSDWYLGNLWKNHRPWPALGRGFXTGVILL- 219

Query: 181 DVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVI 220
              W+     +G   + KLT  + L  TL  T + D D+ 
Sbjct: 220 ---WLDRLQQTGWEQMWKLTAKREL-LTLTATSLADQDIF 255


>gi|335288967|ref|XP_003126812.2| PREDICTED: glycosyltransferase-like protein LARGE1-like [Sus
           scrofa]
          Length = 618

 Score =  548 bits (1411), Expect = e-153,   Method: Compositional matrix adjust.
 Identities = 270/458 (58%), Positives = 321/458 (70%), Gaps = 65/458 (14%)

Query: 235 LRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIA 294
           L  +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR + W   WR+ A
Sbjct: 123 LTGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMKWEQMWRLTA 182

Query: 295 EKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKII 354
           E+ L+  L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY +++DLK+I
Sbjct: 183 ERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CYRDVSDLKVI 241

Query: 355 HWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTLQQAELSNLN 410
           HWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC    ++    LQ+ +LS L+
Sbjct: 242 HWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENLQK-QLSELD 300

Query: 411 EEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINPNNVDEY 470
           E+D CY+ RR + T  RTHLYFL YEYE +SD  DVTLVAQLSMDRLQM+E         
Sbjct: 301 EDDLCYEFRRERFTVHRTHLYFLHYEYEPASDNTDVTLVAQLSMDRLQMLEA-------- 352

Query: 471 EASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRS 530
                                    + KHWEGPISL LY+SDAEAQQFL YA  SE L  
Sbjct: 353 -------------------------ICKHWEGPISLALYLSDAEAQQFLRYAQGSEVLMG 387

Query: 531 RRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDM 590
           R N+ YH+VYKEG FYP+N LRNVA+  V TPY+FL DIDFLPM+GLY YL+ S+  +D+
Sbjct: 388 RHNVAYHIVYKEGQFYPVNLLRNVAMKHVGTPYMFLSDIDFLPMYGLYEYLRKSVIQLDL 447

Query: 591 HGHGGKKVLVVPAFETQRYRTAFPAS------------------------HAPTNFSRWV 626
                KK ++VPAFET RYR +FP S                        HAPTNF++W 
Sbjct: 448 --ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGHAPTNFAKWR 505

Query: 627 NATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
            ATTPY++EW  DFEPY+V  RD P YD RFVGFGWNK
Sbjct: 506 TATTPYRVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 543



 Score = 67.8 bits (164), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 27/35 (77%), Positives = 31/35 (88%)

Query: 138 LRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           L  +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 123 LTGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 157


>gi|168278675|dbj|BAG11217.1| glycosyltransferase-like protein LARGE1 [synthetic construct]
 gi|168985116|emb|CAQ09434.1| like-glycosyltransferase [Homo sapiens]
          Length = 704

 Score =  546 bits (1406), Expect = e-152,   Method: Compositional matrix adjust.
 Identities = 273/523 (52%), Positives = 332/523 (63%), Gaps = 109/523 (20%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ FATDI
Sbjct: 192 PAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 251

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR + 
Sbjct: 252 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 311

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ AE+ L+  L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 312 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 370

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTTLQQAE 405
            +++DLK+                                                 Q +
Sbjct: 371 RDVSDLKL-------------------------------------------------QKQ 381

Query: 406 LSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINPN 465
           LS L+E+D CY+ RR + T  RTHLYFL YEYE ++                        
Sbjct: 382 LSELDEDDLCYEFRRERFTVHRTHLYFLHYEYEPAA------------------------ 417

Query: 466 NVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNS 525
                    D  DVTLVAQLSMDRLQM+E + KHWEGPISL LY+SDAEAQQFL YA  S
Sbjct: 418 ---------DSTDVTLVAQLSMDRLQMLEAICKHWEGPISLALYLSDAEAQQFLRYAQGS 468

Query: 526 EALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSI 585
           E L SR N+GYH+VYKEG FYP+N LRNVA+  +STPY+FL DIDFLPM+GLY YL+ S+
Sbjct: 469 EVLMSRHNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYLRKSV 528

Query: 586 RSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------HAPTN 621
             +D+     KK ++VPAFET RYR +FP S                        HAPTN
Sbjct: 529 IQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGHAPTN 586

Query: 622 FSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           F++W  ATTPY++EW  DFEPY+V  RD P YD RFVGFGWNK
Sbjct: 587 FAKWRTATTPYRVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 629



 Score =  257 bits (656), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 111/168 (66%), Positives = 137/168 (81%)

Query: 5   EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
           ++PV+  CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A  IL TLF 
Sbjct: 128 QQPVVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 187

Query: 65  TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
           TW VP V V FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ F
Sbjct: 188 TWMVPAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 247

Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 248 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 295


>gi|58257733|dbj|BAA25535.3| KIAA0609 protein [Homo sapiens]
          Length = 769

 Score =  544 bits (1402), Expect = e-152,   Method: Compositional matrix adjust.
 Identities = 273/523 (52%), Positives = 332/523 (63%), Gaps = 109/523 (20%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ FATDI
Sbjct: 257 PAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 316

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR + 
Sbjct: 317 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 376

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ AE+ L+  L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 377 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 435

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTTLQQAE 405
            +++DLK+                                                 Q +
Sbjct: 436 RDVSDLKL-------------------------------------------------QKQ 446

Query: 406 LSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINPN 465
           LS L+E+D CY+ RR + T  RTHLYFL YEYE ++                        
Sbjct: 447 LSELDEDDLCYEFRRERFTVHRTHLYFLHYEYEPAA------------------------ 482

Query: 466 NVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNS 525
                    D  DVTLVAQLSMDRLQM+E + KHWEGPISL LY+SDAEAQQFL YA  S
Sbjct: 483 ---------DSTDVTLVAQLSMDRLQMLEAICKHWEGPISLALYLSDAEAQQFLRYAQGS 533

Query: 526 EALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSI 585
           E L SR N+GYH+VYKEG FYP+N LRNVA+  +STPY+FL DIDFLPM+GLY YL+ S+
Sbjct: 534 EVLMSRHNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYLRKSV 593

Query: 586 RSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------HAPTN 621
             +D+     KK ++VPAFET RYR +FP S                        HAPTN
Sbjct: 594 IQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGHAPTN 651

Query: 622 FSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           F++W  ATTPY++EW  DFEPY+V  RD P YD RFVGFGWNK
Sbjct: 652 FAKWRTATTPYRVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 694



 Score =  257 bits (656), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 111/168 (66%), Positives = 137/168 (81%)

Query: 5   EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
           ++PV+  CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A  IL TLF 
Sbjct: 193 QQPVVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 252

Query: 65  TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
           TW VP V V FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ F
Sbjct: 253 TWMVPAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 312

Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 313 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 360


>gi|298683975|gb|ADI96167.1| LARGE-2 [Mastomys kollmannspergeri]
          Length = 653

 Score =  543 bits (1399), Expect = e-151,   Method: Compositional matrix adjust.
 Identities = 280/527 (53%), Positives = 344/527 (65%), Gaps = 65/527 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY A+ +   VSWIPNKHYSG+YGL+KL LP +LP +LA+ IVLDTDV F +DI
Sbjct: 109 PAVVVSFYDAEELKPLVSWIPNKHYSGLYGLMKLVLPSILPLSLARVIVLDTDVTFFSDI 168

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
            +LWALF R    Q +GL E QS WYLG LWKNH+PWPALGRGFNTGVILL L +L+   
Sbjct: 169 VELWALFDRFSDXQXVGLXEXQSXWYLGNLWKNHRPWPALGRGFNTGVILLWLDRLQQTG 228

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   W++ A++ LLT   TSLADQDIFN ++ EHP LV+ LPC WNVQLSD+T + E CY
Sbjct: 229 WEQMWKLTAKRELLTLTATSLADQDIFNVVLKEHPNLVHPLPCVWNVQLSDHTLA-ERCY 287

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
            E  DLK+IHWNSPKKL+VKNKH EFFRNL+LTFL YDG LLRRELFGC          L
Sbjct: 288 LEAADLKVIHWNSPKKLRVKNKHAEFFRNLHLTFLGYDGKLLRRELFGCPNQFPPGAEHL 347

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           QQA L+ L+EE+PC++ R+ +L   R H+ FL ++        DVTLVAQLSMDR     
Sbjct: 348 QQA-LAQLDEEEPCFEFRQQQLXVHRVHITFLPHQPPPPR-PYDVTLVAQLSMDR----- 400

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+E L +HW GP+SL LY++DAEAQ FL +
Sbjct: 401 ----------------------------LQMLEALCRHWPGPMSLALYLTDAEAQLFLHF 432

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
              S  L +R+++ YHVVY++G  YP+N LRNVAL Q  TPYVFL DIDFLP + LY YL
Sbjct: 433 VETSPVLSARKDVAYHVVYQDGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYL 492

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           ++SI  +++ G   K  LVVPAFET  YR +FP S                        H
Sbjct: 493 RASIEQLEL-GSRRKAALVVPAFETLHYRFSFPNSKAELLTLLDAGSLYTFRYHEWPQGH 551

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
             T++SRW  A  PY+++W+ D+EPY+V  RD PRYD RFVGFGWNK
Sbjct: 552 LSTDYSRWREAQAPYRVQWSADYEPYVVVPRDCPRYDPRFVGFGWNK 598



 Score =  246 bits (628), Expect = 3e-62,   Method: Compositional matrix adjust.
 Identities = 124/220 (56%), Positives = 156/220 (70%), Gaps = 6/220 (2%)

Query: 2   SSDEEPVIPT-CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQ 60
           +S  + ++P+ CE++ VAIVCAGYNS+R +VTL+KS+LFYRKNPLH HLITD +A NIL+
Sbjct: 41  ASSSQLLLPSKCEMLHVAIVCAGYNSSREIVTLMKSLLFYRKNPLHLHLITDAIARNILE 100

Query: 61  TLFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDT 120
           TLF TW VP V VSFY A+ +   VSWIPNKHYSG+YGL+KL LP +LP +LA+ IVLDT
Sbjct: 101 TLFRTWMVPAVVVSFYDAEELKPLVSWIPNKHYSGLYGLMKLVLPSILPLSLARVIVLDT 160

Query: 121 DVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFYLADSVVE 180
           DV F +DI +LWALF R    Q +GL E QS WYLG LWKNH+PWPALGRGF     ++ 
Sbjct: 161 DVTFFSDIVELWALFDRFSDXQXVGLXEXQSXWYLGNLWKNHRPWPALGRGFNTGVILL- 219

Query: 181 DVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVI 220
              W+     +G   + KLT  + L  TL  T + D D+ 
Sbjct: 220 ---WLDRLQQTGWEQMWKLTAKREL-LTLTATSLADQDIF 255


>gi|301627879|ref|XP_002943094.1| PREDICTED: glycosyltransferase-like protein LARGE1-like, partial
           [Xenopus (Silurana) tropicalis]
          Length = 540

 Score =  538 bits (1387), Expect = e-150,   Method: Compositional matrix adjust.
 Identities = 270/523 (51%), Positives = 333/523 (63%), Gaps = 109/523 (20%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY AD +  +VSWIPNKHYSG+YGL+KL L K LP +L + IVLDTD+ FATDI
Sbjct: 28  PAVRVDFYDADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPASLERVIVLDTDITFATDI 87

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWA+F + +++Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR + 
Sbjct: 88  AELWAVFHKFKRQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 147

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ AE+ L++ L TSLADQDIFNA+I ++P+LV+ LPC WNVQLSD+TRS++ CY
Sbjct: 148 WEQMWRLTAERELMSMLSTSLADQDIFNAVIKQNPFLVHQLPCFWNVQLSDHTRSEQ-CY 206

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTTLQQAE 405
            +++DLK+                                                 Q +
Sbjct: 207 RDVSDLKL-------------------------------------------------QKQ 217

Query: 406 LSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINPN 465
           LS L+E+D CY+ RR + T  RTHLYFL YEY+ + D                       
Sbjct: 218 LSELDEDDLCYEFRRERFTVHRTHLYFLHYEYKPAGDDT--------------------- 256

Query: 466 NVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNS 525
                       DVTLVAQLSMDRLQM+E + KHWEGPISL LY+SDAEAQQFL YA  S
Sbjct: 257 ------------DVTLVAQLSMDRLQMLEAICKHWEGPISLALYLSDAEAQQFLRYAQGS 304

Query: 526 EALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSI 585
           E L +RRN+GYH+VYKEG FYP+N LRNVA+  VSTPY+FL DIDFLPM+GLY  L+ ++
Sbjct: 305 EVLLNRRNVGYHIVYKEGQFYPVNLLRNVAMKHVSTPYMFLSDIDFLPMYGLYESLRKAV 364

Query: 586 RSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------HAPTN 621
             +DM     KK L++PAFET RYR +FP S                        H PTN
Sbjct: 365 VQLDM--KITKKALIIPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGHGPTN 422

Query: 622 FSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           F++W  ATTPY++EW  DFEPY+V  RD P YD RFVGFGWNK
Sbjct: 423 FAKWRTATTPYRVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 465



 Score =  204 bits (518), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 89/131 (67%), Positives = 107/131 (81%)

Query: 42  KNPLHFHLITDTVALNILQTLFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLK 101
           +NPLHFHLI DT+A  IL TLF TW VP V V FY AD +  +VSWIPNKHYSG+YGL+K
Sbjct: 1   RNPLHFHLIADTIAKQILATLFQTWMVPAVRVDFYDADELKSEVSWIPNKHYSGIYGLMK 60

Query: 102 LTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKN 161
           L L K LP +L + IVLDTD+ FATDIA+LWA+F + +++Q +GLVENQSDWYLG LWKN
Sbjct: 61  LVLTKTLPASLERVIVLDTDITFATDIAELWAVFHKFKRQQVLGLVENQSDWYLGNLWKN 120

Query: 162 HKPWPALGRGF 172
           H+PWPALGRG+
Sbjct: 121 HRPWPALGRGY 131


>gi|344280770|ref|XP_003412155.1| PREDICTED: LOW QUALITY PROTEIN: glycosyltransferase-like protein
           LARGE2-like [Loxodonta africana]
          Length = 746

 Score =  538 bits (1385), Expect = e-150,   Method: Compositional matrix adjust.
 Identities = 284/532 (53%), Positives = 345/532 (64%), Gaps = 68/532 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY        VSWIPN HYSG+Y LLKL LP  LP  LA+ IVLDTDV FA+DI
Sbjct: 149 PAVRVSFYNTSKFQSQVSWIPNTHYSGLYELLKLVLPDALPPDLARVIVLDTDVTFASDI 208

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWA F+    +Q IGLVENQS+WYLG LWKNH+PWPALGRGFNTGVILL L +LR   
Sbjct: 209 AELWAFFAHFSDKQMIGLVENQSNWYLGNLWKNHRPWPALGRGFNTGVILLWLDRLRQAG 268

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRS----- 340
           W   W++ A + LLT   TSLADQDIFNA+I EHP LV  LPC WNVQLSD+T +     
Sbjct: 269 WEQMWKVTARRLLLTLSSTSLADQDIFNAVIKEHPGLVLLLPCLWNVQLSDHTLALSDHT 328

Query: 341 -DELCYTELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLT-- 397
             E CY+E +DLK+IHWNSPKKL+VKNK +EFFRNL+LTFLEYDGNLLRRELFGC     
Sbjct: 329 LAERCYSEASDLKVIHWNSPKKLRVKNKDVEFFRNLHLTFLEYDGNLLRRELFGCPSQPP 388

Query: 398 -QTTLQQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDR 456
            Q   +Q  L++L+EED C++ ++ +LT  R HL FL  +       +DVTLVAQLS+DR
Sbjct: 389 PQAEQRQRALAHLDEEDACFEFQQQRLTVHRVHLTFLPLDLLPPE-LHDVTLVAQLSVDR 447

Query: 457 LQMVEINPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQ 516
           L                                 QM+E L  HW GP+SL LY++DAEAQ
Sbjct: 448 L---------------------------------QMLEALCSHWPGPMSLALYLTDAEAQ 474

Query: 517 QFLSYAGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFG 576
           QFL +   S+ L +R N+ YHVVY+EG  YP+N LRNVAL Q  T +VFL DIDFLP + 
Sbjct: 475 QFLRFVEASKVLSARSNVAYHVVYREGPLYPVNHLRNVALAQARTSHVFLSDIDFLPAYS 534

Query: 577 LYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS-------------------- 616
           LY YL++SI  +D+     K  LVVPAFE+  YR +FP S                    
Sbjct: 535 LYNYLRASIEQLDLSSQ-RKTALVVPAFESLHYRLSFPTSKAELLALLDAGSLHTFRYDV 593

Query: 617 ----HAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
               H PT+++RW  A  PYQ++WA D+EPY+V  R+ PRYD+RFVGFGWNK
Sbjct: 594 WPQGHTPTDYARWREAQAPYQVQWAADYEPYVVVPRECPRYDSRFVGFGWNK 645



 Score =  248 bits (634), Expect = 6e-63,   Method: Compositional matrix adjust.
 Identities = 113/165 (68%), Positives = 131/165 (79%)

Query: 8   VIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWS 67
           ++P CE++ VAIVCAGYNS+R +VTL+KS+LFYRKNPLH HL+TD VA NIL+TLF TW 
Sbjct: 88  LLPKCELLHVAIVCAGYNSSRDVVTLVKSMLFYRKNPLHLHLVTDEVARNILETLFHTWM 147

Query: 68  VPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATD 127
           VP V VSFY        VSWIPN HYSG+Y LLKL LP  LP  LA+ IVLDTDV FA+D
Sbjct: 148 VPAVRVSFYNTSKFQSQVSWIPNTHYSGLYELLKLVLPDALPPDLARVIVLDTDVTFASD 207

Query: 128 IAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           IA+LWA F+    +Q IGLVENQS+WYLG LWKNH+PWPALGRGF
Sbjct: 208 IAELWAFFAHFSDKQMIGLVENQSNWYLGNLWKNHRPWPALGRGF 252


>gi|449666871|ref|XP_002154954.2| PREDICTED: glycosyltransferase-like protein LARGE2-like [Hydra
           magnipapillata]
          Length = 727

 Score =  531 bits (1368), Expect = e-148,   Method: Compositional matrix adjust.
 Identities = 270/535 (50%), Positives = 343/535 (64%), Gaps = 77/535 (14%)

Query: 163 KPW--PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVI 220
           K W    +G  FY A+ +  DV WIPN HYSGVYGL+KLTL + LPE L+K IVLDTDV 
Sbjct: 171 KTWVLKQVGVSFYDAEKLKADVDWIPNTHYSGVYGLMKLTLTRALPEFLSKVIVLDTDVF 230

Query: 221 FATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTK 280
           F TD+A+LWA F+   + Q IGLVENQS WY GKLWK +K WPA+GRGFNTGV+L DL K
Sbjct: 231 FLTDLAELWAFFNNFTEDQAIGLVENQSQWYTGKLWKKYKIWPAIGRGFNTGVMLFDLQK 290

Query: 281 LRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRS 340
           LR   WA  WR+ AEK LL  L T LADQD+ NA + ++P +VY LPCQWN+QLSDNT S
Sbjct: 291 LRKFQWAHLWRLTAEKQLLNLLSTVLADQDVINAALKDNPQIVYKLPCQWNIQLSDNTES 350

Query: 341 DELCYTELTDLKIIHWNSPKK-LKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----- 394
            E CY +L +LK IHWNSP K    K KH+E+FRN+YLTFLEY+GNLLR+ +F C     
Sbjct: 351 -EYCYNKLIELKAIHWNSPNKHTGNKLKHVEYFRNMYLTFLEYNGNLLRKSIFSCLIPGE 409

Query: 395 NLTQTTLQQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSM 454
           N T+T+ ++    NL++ D CYD++  + T+ R HLY+++Y+Y+                
Sbjct: 410 NATETSKEK----NLSDNDQCYDIKIEQSTTRRVHLYYMDYDYKP--------------- 450

Query: 455 DRLQMVEINPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAE 514
                               +  DVTL+  LSMDR+Q++E+L +HWEGP+S++LY SDAE
Sbjct: 451 --------------------EVGDVTLMTHLSMDRIQILELLSQHWEGPMSISLYASDAE 490

Query: 515 AQQFLSYAGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPM 574
           AQQF  Y+  S  LR R+NIG H+VYK+G+ YP+N LRNVAL  V T YVFL DIDFLPM
Sbjct: 491 AQQFTRYSSVSSFLRKRKNIGLHIVYKDGDLYPVNYLRNVALEHVQTTYVFLSDIDFLPM 550

Query: 575 FGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------ 616
            GLY YL+      +M     KK +VVPAFET +Y+  +P S                  
Sbjct: 551 IGLYNYLREIANITNME----KKAVVVPAFETYQYKLDYPQSKKQLIEYWGSQKVDTFRG 606

Query: 617 ------HAPTNFSRWVNATTPYQIEWAPDFEPYIVAHR-DLPRYDTRFVGFGWNK 664
                 HA T+F RW  ++T Y++ W  DFEPYI+  +  +P YD RFVGFGWNK
Sbjct: 607 EVWQQGHAATDFQRWKTSSTDYEVRWEVDFEPYILVKKSSIPLYDMRFVGFGWNK 661



 Score =  233 bits (595), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 116/215 (53%), Positives = 151/215 (70%), Gaps = 5/215 (2%)

Query: 9   IPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSV 68
           +P C+VI +AI+CAGY  T+ +VTLIKSILFYR++PLHFH I+D    ++LQ LF TW +
Sbjct: 116 LPDCQVIHIAIICAGYKETQRVVTLIKSILFYRRHPLHFHFISDISGRHVLQVLFKTWVL 175

Query: 69  PQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 128
            QV VSFY A+ +  DV WIPN HYSGVYGL+KLTL + LPE L+K IVLDTDV F TD+
Sbjct: 176 KQVGVSFYDAEKLKADVDWIPNTHYSGVYGLMKLTLTRALPEFLSKVIVLDTDVFFLTDL 235

Query: 129 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFYLADSVVEDVSWIPNK 188
           A+LWA F+   + Q IGLVENQS WY GKLWK +K WPA+GRGF     ++ D+  +   
Sbjct: 236 AELWAFFNNFTEDQAIGLVENQSQWYTGKLWKKYKIWPAIGRGFNTG-VMLFDLQKLRKF 294

Query: 189 HYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 223
            ++ ++   +LT  K L   L+ T++ D DVI A 
Sbjct: 295 QWAHLW---RLTAEKQLLNLLS-TVLADQDVINAA 325


>gi|426246004|ref|XP_004016789.1| PREDICTED: LOW QUALITY PROTEIN: glycosyltransferase-like protein
           LARGE2, partial [Ovis aries]
          Length = 646

 Score =  530 bits (1366), Expect = e-148,   Method: Compositional matrix adjust.
 Identities = 280/513 (54%), Positives = 345/513 (67%), Gaps = 57/513 (11%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY AD +   VSWIPNKHYSG+YGL+KL LP  LP  LA+ IVLDTDV FA++I
Sbjct: 130 PAVQISFYDADELKPLVSWIPNKHYSGLYGLMKLVLPSALPPDLARVIVLDTDVTFASNI 189

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWALF+    +Q IGLVENQSDWYLG LWKNH+PWPALGRGFNTGVILL L +LR   
Sbjct: 190 AELWALFAHFSDKQAIGLVENQSDWYLGNLWKNHRPWPALGRGFNTGVILLRLDRLRQAG 249

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   W++ A++  +T    +++ QDIFNAII EHP+LV  LPC WNVQLSD+T + E CY
Sbjct: 250 WEQMWKLTAKRGGVT---ATVSAQDIFNAIIKEHPWLVQPLPCMWNVQLSDHTLA-ERCY 305

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTT----L 401
           +E +DL++IHWNSPKKL VKNKH+E FRNLYLTFLEY+GNLLRRELFGC          L
Sbjct: 306 SEASDLQVIHWNSPKKLHVKNKHVESFRNLYLTFLEYNGNLLRRELFGCPCPPPPGAEQL 365

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           QQA L+ L E+D C + R+ +L   R H+ FL ++       +DVTLVAQLSM+RL    
Sbjct: 366 QQA-LAQLEEQDACLEFRQQQLAVHRVHITFLPHKPPPPQP-HDVTLVAQLSMERL---- 419

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                        QM+E L +HW GP+SL LY++DAEAQQFL +
Sbjct: 420 -----------------------------QMLEALCRHWPGPMSLALYLTDAEAQQFLRF 450

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
              S  L +R+++ YHVVY+EG+ YPIN LRNVAL Q  TPYVFL DIDFLP + LY YL
Sbjct: 451 VETSAVLSARQDVAYHVVYREGSLYPINQLRNVALAQALTPYVFLSDIDFLPAYSLYSYL 510

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPASHAPT----------NFSRWVNATTP 631
           ++SI  +++ G   K  LVVPAFET  YR +FP+S A             F RW      
Sbjct: 511 RASIEQLELGGK-RKAALVVPAFETLHYRFSFPSSKAELLTLLDAGSLYTF-RWXRGH-- 566

Query: 632 YQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           ++++WA D+EPY+V  RD PRYD RFVGFGWNK
Sbjct: 567 WRVQWAADYEPYVVVPRDCPRYDPRFVGFGWNK 599



 Score =  254 bits (649), Expect = 9e-65,   Method: Compositional matrix adjust.
 Identities = 114/161 (70%), Positives = 136/161 (84%)

Query: 12  CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQV 71
           CE++QVAIVCAG+NS+R ++TL+KS+LFYRKNPLH HL+TD VA NIL+TLF TW VP V
Sbjct: 73  CELLQVAIVCAGHNSSRDVITLVKSLLFYRKNPLHLHLVTDAVARNILETLFHTWMVPAV 132

Query: 72  EVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQL 131
           ++SFY AD +   VSWIPNKHYSG+YGL+KL LP  LP  LA+ IVLDTDV FA++IA+L
Sbjct: 133 QISFYDADELKPLVSWIPNKHYSGLYGLMKLVLPSALPPDLARVIVLDTDVTFASNIAEL 192

Query: 132 WALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           WALF+    +Q IGLVENQSDWYLG LWKNH+PWPALGRGF
Sbjct: 193 WALFAHFSDKQAIGLVENQSDWYLGNLWKNHRPWPALGRGF 233


>gi|196015485|ref|XP_002117599.1| hypothetical protein TRIADDRAFT_32796 [Trichoplax adhaerens]
 gi|190579768|gb|EDV19857.1| hypothetical protein TRIADDRAFT_32796 [Trichoplax adhaerens]
          Length = 618

 Score =  518 bits (1333), Expect = e-144,   Method: Compositional matrix adjust.
 Identities = 256/528 (48%), Positives = 338/528 (64%), Gaps = 69/528 (13%)

Query: 171 GFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWA 230
            FY   S+ ++++WIPN+HYSG+YGL+KL L  +LP +L + IVLDTD+ F  DIA LW 
Sbjct: 60  SFYNLQSIKKEIAWIPNRHYSGIYGLMKLILITILPNSLKRVIVLDTDLTFNADIADLWY 119

Query: 231 LFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFW 290
            F  L + + +GLVENQSDWYLGKLWKNH PWPALGRGFNTGV+L++L+KLR  +W   W
Sbjct: 120 YFRYLTKDKILGLVENQSDWYLGKLWKNHTPWPALGRGFNTGVMLINLSKLRQNNWNYIW 179

Query: 291 RIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTD 350
           + + EK L T  +T+LADQDI NA+I  +P LVY LPC WN+QLSDNTRS + CY + ++
Sbjct: 180 KEVTEKQLKTLTYTTLADQDIINAVIKSYPKLVYRLPCSWNIQLSDNTRS-QTCYKKSSE 238

Query: 351 LKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC---NLTQTTLQQAEL- 406
           +KIIHWNSPKK+ V+NKH+++FRN+Y TF+E DGNLLRREL GC   NL  T+++   + 
Sbjct: 239 IKIIHWNSPKKINVRNKHIQYFRNIYQTFIELDGNLLRRELIGCEKSNLKPTSVRTRNMH 298

Query: 407 ------SNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMV 460
                 SN N  DPCY  R     S R H Y L+Y+Y+                      
Sbjct: 299 VQLKIVSNDNLCDPCYQFRVEAYISHRIHPYLLDYKYQ---------------------- 336

Query: 461 EINPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLS 520
              PN         D  DVTLV  LS+DRLQM++ L KHW GP+S+ LY++D +     S
Sbjct: 337 ---PN---------DNYDVTLVTHLSIDRLQMLDGLAKHWTGPMSIALYVTDRDVPSIRS 384

Query: 521 YAGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPY 580
           +  +S  L +R+NIGYH+VY +G+ YP+N LRN+AL+ ++T YVFLLDIDFLP    Y Y
Sbjct: 385 FIQHSNNLSNRKNIGYHIVYIQGDCYPVNKLRNIALDYITTDYVFLLDIDFLPSHNSYQY 444

Query: 581 LKSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------ 616
           L+     + +  +  K  L+VPAFETQRYR  FP S                        
Sbjct: 445 LRYYFDFLSLQHNFNKAALIVPAFETQRYRVNFPESKERLIALLNKQELFTFRYHLYTKA 504

Query: 617 HAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           H+PTN+++W  A   YQ++W  +FEPYIV  +++ RYD RF+GFGWNK
Sbjct: 505 HSPTNYTKWQTANETYQVQWQENFEPYIVVTKNVARYDERFIGFGWNK 552



 Score =  210 bits (534), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 94/158 (59%), Positives = 119/158 (75%)

Query: 15  IQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQVEVS 74
           I VAIV  GY+++R ++TLIKS+LFYR +PLHFH I D  A NIL   F TW +  V++S
Sbjct: 1   IHVAIVVGGYDASRDVITLIKSVLFYRNHPLHFHFIADPSAQNILSKHFFTWQLHHVDIS 60

Query: 75  FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 134
           FY   S+ ++++WIPN+HYSG+YGL+KL L  +LP +L + IVLDTD+ F  DIA LW  
Sbjct: 61  FYNLQSIKKEIAWIPNRHYSGIYGLMKLILITILPNSLKRVIVLDTDLTFNADIADLWYY 120

Query: 135 FSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           F  L + + +GLVENQSDWYLGKLWKNH PWPALGRGF
Sbjct: 121 FRYLTKDKILGLVENQSDWYLGKLWKNHTPWPALGRGF 158


>gi|441618397|ref|XP_003264907.2| PREDICTED: LOW QUALITY PROTEIN: glycosyltransferase-like protein
            LARGE1-like [Nomascus leucogenys]
          Length = 1173

 Score =  499 bits (1284), Expect = e-138,   Method: Compositional matrix adjust.
 Identities = 265/527 (50%), Positives = 324/527 (61%), Gaps = 94/527 (17%)

Query: 166  PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
            PA+   FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ FATDI
Sbjct: 638  PAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 697

Query: 226  AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
            A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTG     L+      
Sbjct: 698  AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGQPFSSLSLS---- 753

Query: 286  WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
                        LL +L    +D      ++++H  L  T      +   D+T       
Sbjct: 754  ----------FSLLRKLTFPQSDPHPSTFLLTQH--LAST-----GIIRKDDT------- 789

Query: 346  TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
              +  L +IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC    ++    L
Sbjct: 790  --IHSLDVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENL 847

Query: 402  QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
            Q+ +LS L+E+D CY+ RR + T  RTHLYFL YEYE ++                    
Sbjct: 848  QK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEYEPAA-------------------- 886

Query: 462  INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                         D  DVTLVAQLSMDRLQM+E + KHWEGPISL LY+SDAEAQQFL Y
Sbjct: 887  -------------DSTDVTLVAQLSMDRLQMLEAICKHWEGPISLALYLSDAEAQQFLRY 933

Query: 522  AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
            A  SE L SR N+GYH+VYKEG FYP+N LRNVA+  +S PY+      F    GL  YL
Sbjct: 934  AQGSEVLMSRHNVGYHIVYKEGQFYPVNLLRNVAMKHISIPYMLTCATHFXATKGLXEYL 993

Query: 582  KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
            + S+  +D+     KK ++VPAFET RYR +FP S                        H
Sbjct: 994  RKSVIQLDLANT--KKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGH 1051

Query: 618  APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
            APTNF++W  ATTPY++EW  DFEPY+V  RD P YD RFVGFGWNK
Sbjct: 1052 APTNFAKWRTATTPYRVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 1098



 Score =  256 bits (654), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 111/168 (66%), Positives = 137/168 (81%)

Query: 5   EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
           ++PV+  CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A  IL TLF 
Sbjct: 574 QQPVVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 633

Query: 65  TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
           TW VP V V FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ F
Sbjct: 634 TWMVPAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 693

Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 694 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 741


>gi|56403760|emb|CAI29669.1| hypothetical protein [Pongo abelii]
          Length = 463

 Score =  491 bits (1265), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 244/425 (57%), Positives = 293/425 (68%), Gaps = 65/425 (15%)

Query: 268 GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLP 327
           G+NTGVILL L KLR + W   WR+ AE+ L+  L TSLADQDIFNA I ++P+LVY LP
Sbjct: 1   GYNTGVILLLLDKLRKMKWEQMWRLTAERELMGMLSTSLADQDIFNAAIKQNPFLVYQLP 60

Query: 328 CQWNVQLSDNTRSDELCYTELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLL 387
           C WNVQLSD+TRS++ CY +++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLL
Sbjct: 61  CFWNVQLSDHTRSEQ-CYRDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLL 119

Query: 388 RRELFGC----NLTQTTLQQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDG 443
           RRELFGC    ++    LQ+ +LS L+E+D CY+ RR + T  RTHLYFL YEYE ++  
Sbjct: 120 RRELFGCPSEADVNSENLQK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEYEPAA-- 176

Query: 444 NDVTLVAQLSMDRLQMVEINPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGP 503
                                          D  DVTLVAQLSMDRLQM+E + KHWEGP
Sbjct: 177 -------------------------------DSTDVTLVAQLSMDRLQMLEAICKHWEGP 205

Query: 504 ISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPY 563
           ISL LY+SDAEAQQFL YA  SE L SR N+GYH+VYKEG FYP+N LRNVA+  +STPY
Sbjct: 206 ISLALYLSDAEAQQFLRYAQGSEVLMSRHNVGYHIVYKEGQFYPVNLLRNVAMKHISTPY 265

Query: 564 VFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------- 616
           +FL D DFLPM+GLY YL+ S+  +D+     KK ++VPAFET RYR +FP S       
Sbjct: 266 MFLSDTDFLPMYGLYEYLRKSVIQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSM 323

Query: 617 -----------------HAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVG 659
                            HAPTNF++W  ATTPY++EW  DFEPY+V  RD P YD RFVG
Sbjct: 324 LDMGTLFTFRYHVWTKDHAPTNFAKWRTATTPYRVEWEADFEPYVVVRRDCPEYDRRFVG 383

Query: 660 FGWNK 664
           FGWNK
Sbjct: 384 FGWNK 388


>gi|354469850|ref|XP_003497335.1| PREDICTED: glycosyltransferase-like protein LARGE2-like isoform 2
           [Cricetulus griseus]
          Length = 655

 Score =  488 bits (1257), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 259/527 (49%), Positives = 326/527 (61%), Gaps = 100/527 (18%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY A+ +   +SWIPNKHYSG+YGL+KL LP +LP +LA+ IVLDTDV F++DI
Sbjct: 122 PAVVIRFYDAEELKPLISWIPNKHYSGLYGLMKLVLPSILPLSLARVIVLDTDVTFSSDI 181

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWALF+                                     +GVILL L +LR   
Sbjct: 182 AELWALFAHF-----------------------------------SGVILLWLDRLRQTG 206

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   W++IA++ LLT + TSLADQD+FNA+I EHP LV+ LPC WNVQLSD+T + E CY
Sbjct: 207 WEQMWKLIAKQELLTLMATSLADQDVFNAVIKEHPELVHPLPCVWNVQLSDHTLA-ERCY 265

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTL 401
            E  DLK+IHWNSPKKL+VKNKH EFFRNL+LTFL YDG LLRRELFGC          L
Sbjct: 266 LEAADLKVIHWNSPKKLRVKNKHAEFFRNLHLTFLGYDGKLLRRELFGCPNQLPPGAEQL 325

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           QQA L+ L+EE+PC++ R+ +L+  R H+ FL ++       +DVTLVAQLSMDR     
Sbjct: 326 QQA-LAQLDEEEPCFEFRQQQLSVHRVHITFLPHQPPPPR-PHDVTLVAQLSMDR----- 378

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                       LQM+E L +HW GP+SL LY++DAEAQQFL +
Sbjct: 379 ----------------------------LQMLEALCRHWPGPMSLALYLTDAEAQQFLHF 410

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
              S  L +R+++ YH+VY+EG  YP+N LRNVAL Q  TPYVFL DIDFLP + LY YL
Sbjct: 411 VETSPVLSARKDVAYHIVYREGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYL 470

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           ++SI  +++ G   K  LVVPAFET  YR +FP S                        H
Sbjct: 471 RASIEQLEL-GSQRKAALVVPAFETLHYRLSFPNSKAELLTLLDAGSLYTFRYHEWPQGH 529

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           APT+++RW  A  PY ++WA D+EPY+V  RD PRYD RFVGFGWNK
Sbjct: 530 APTDYARWREAKAPYHVQWAADYEPYVVVPRDCPRYDPRFVGFGWNK 576



 Score =  188 bits (477), Expect = 8e-45,   Method: Compositional matrix adjust.
 Identities = 83/127 (65%), Positives = 106/127 (83%)

Query: 10  PTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVP 69
           P CE++ VAIVCAGYNS+R ++TL+KS+LFYRKNPLH HL+TD +A NIL+TLF TW VP
Sbjct: 63  PKCELLHVAIVCAGYNSSREVITLVKSLLFYRKNPLHLHLVTDAIARNILETLFRTWMVP 122

Query: 70  QVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIA 129
            V + FY A+ +   +SWIPNKHYSG+YGL+KL LP +LP +LA+ IVLDTDV F++DIA
Sbjct: 123 AVVIRFYDAEELKPLISWIPNKHYSGLYGLMKLVLPSILPLSLARVIVLDTDVTFSSDIA 182

Query: 130 QLWALFS 136
           +LWALF+
Sbjct: 183 ELWALFA 189


>gi|431905242|gb|ELK10287.1| Glycosyltransferase-like protein LARGE1, partial [Pteropus alecto]
          Length = 906

 Score =  483 bits (1244), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 250/458 (54%), Positives = 300/458 (65%), Gaps = 87/458 (18%)

Query: 236 RQR-QTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIA 294
           RQR QT+G  E +SD+                      VILL L KLR + W   WR+ A
Sbjct: 433 RQRTQTLGNAEAESDFR---------------------VILLLLDKLRKMKWEQMWRLTA 471

Query: 295 EKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKII 354
           E+ L+  L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY +++DLK+I
Sbjct: 472 ERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CYRDVSDLKVI 530

Query: 355 HWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTLQQAELSNLN 410
           HWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC    ++    LQ+ +LS L+
Sbjct: 531 HWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENLQK-QLSELD 589

Query: 411 EEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINPNNVDEY 470
           E+D CY+ RR + T  RTHLYFL YEYE +                              
Sbjct: 590 EDDLCYEFRRERFTVHRTHLYFLHYEYEPA------------------------------ 619

Query: 471 EASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRS 530
              SD  DVTLVAQLSMDRLQM+E + KHWEGP+SL LY+SDAEAQQFL YA  SE L S
Sbjct: 620 ---SDNTDVTLVAQLSMDRLQMLEAICKHWEGPVSLALYLSDAEAQQFLRYAQGSEVLMS 676

Query: 531 RRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDM 590
           R N+ YHVVYKEG FYP+N LRNVA+  V TPY+FL DIDFLPM+GLY YL+ S+  +D+
Sbjct: 677 RHNVAYHVVYKEGQFYPVNLLRNVAMKHVGTPYMFLSDIDFLPMYGLYEYLRKSVIQLDL 736

Query: 591 HGHGGKKVLVVPAFETQRYRTAFPAS------------------------HAPTNFSRWV 626
                KK ++VPAFET RYR +FP S                        HAPTNF++W 
Sbjct: 737 --ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGHAPTNFAKWR 794

Query: 627 NATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
            ATTPY++EW  DFEPY+V  RD P YD RFVGFGWNK
Sbjct: 795 TATTPYRVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 832



 Score =  261 bits (667), Expect = 8e-67,   Method: Compositional matrix adjust.
 Identities = 125/222 (56%), Positives = 155/222 (69%), Gaps = 11/222 (4%)

Query: 5   EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
           ++P +  CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A  IL TLF 
Sbjct: 93  QQPAVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 152

Query: 65  TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
           TW VP V V FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ F
Sbjct: 153 TWMVPAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 212

Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFYLADSVVEDVSW 184
           ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+    +   D S 
Sbjct: 213 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGTAHPRDPSP 272

Query: 185 IPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIA 226
           +P        GL    LP   P T   + VL  DVI +   A
Sbjct: 273 LP--------GLFFHQLPPAQPLT---SSVLSLDVISSRHCA 303



 Score =  166 bits (421), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 72/108 (66%), Positives = 87/108 (80%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ FATDI
Sbjct: 157 PAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 216

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGV 273
           A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTG 
Sbjct: 217 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGT 264


>gi|262231830|ref|NP_001160105.1| glycosyltransferase-like protein LARGE2 isoform 2 precursor [Mus
           musculus]
 gi|53830730|gb|AAU95214.1| glycosyltransferase-like 1B splice form 2 [Mus musculus]
          Length = 655

 Score =  483 bits (1243), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 256/527 (48%), Positives = 321/527 (60%), Gaps = 100/527 (18%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY A+ +   VSWIPNKHYSG+YGL+KL LP +LP +LA+ IVLDTDV F++DI
Sbjct: 122 PAVVVSFYDAEELKPLVSWIPNKHYSGLYGLMKLVLPSILPPSLARVIVLDTDVTFSSDI 181

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
            +LWALF                                      +GVILL L +L+   
Sbjct: 182 VELWALFDHF-----------------------------------SGVILLWLDRLQQTG 206

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   W++ A++ LLT + TSLADQDIFNA+I EHP+LV+ LPC WNVQLSD+TR+ E CY
Sbjct: 207 WEQMWKVTAKRELLTLMATSLADQDIFNAVIKEHPHLVHPLPCVWNVQLSDHTRA-ERCY 265

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCN----LTQTTL 401
            E  DLK+IHWNSPKKL+VKNKH EFFRNL+LTFL YDG LLRRELFGC          L
Sbjct: 266 LEAADLKVIHWNSPKKLRVKNKHAEFFRNLHLTFLGYDGKLLRRELFGCPNQFPPGAEQL 325

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           QQA L+ L+EE+PC++ R+ +LT  R H+ FL ++                         
Sbjct: 326 QQA-LTQLDEEEPCFEFRQQQLTVHRVHITFLPHQPPPPQ-------------------- 364

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                          +DVTLVAQLSMDRLQM+E L +HW GP+SL LY++D EAQQFL +
Sbjct: 365 --------------PHDVTLVAQLSMDRLQMLEALCRHWPGPMSLALYLTDEEAQQFLHF 410

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
              S  L  R+++ YHVVY++G  YP+N LRNVAL Q  TPYVFL DIDFLP + LY YL
Sbjct: 411 VETSPVLSMRKDVAYHVVYRDGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYL 470

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           ++SI  +++     K  LVVPAFET  YR +FP S                        H
Sbjct: 471 RASIEQLELDSR-RKTALVVPAFETLHYRFSFPNSKAELLTLLDAGSLHTFRYHEWPQGH 529

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           + T++SRW  A  PY ++W+ D+EPY+V  RD PRYD RFVGFGWNK
Sbjct: 530 SSTDYSRWREAQAPYSVQWSADYEPYVVVPRDCPRYDPRFVGFGWNK 576



 Score =  189 bits (480), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 87/126 (69%), Positives = 104/126 (82%)

Query: 10  PTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVP 69
           P CE++ VAIVCAGYNS+R ++TL KS+LFYRKNPLH HLITD VA NIL+TLF TW VP
Sbjct: 63  PKCEMLHVAIVCAGYNSSREIITLTKSLLFYRKNPLHLHLITDAVARNILETLFRTWMVP 122

Query: 70  QVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIA 129
            V VSFY A+ +   VSWIPNKHYSG+YGL+KL LP +LP +LA+ IVLDTDV F++DI 
Sbjct: 123 AVVVSFYDAEELKPLVSWIPNKHYSGLYGLMKLVLPSILPPSLARVIVLDTDVTFSSDIV 182

Query: 130 QLWALF 135
           +LWALF
Sbjct: 183 ELWALF 188


>gi|444511887|gb|ELV09961.1| Glycosyltransferase-like protein LARGE1 [Tupaia chinensis]
          Length = 436

 Score =  474 bits (1219), Expect = e-131,   Method: Compositional matrix adjust.
 Identities = 234/409 (57%), Positives = 283/409 (69%), Gaps = 65/409 (15%)

Query: 284 ISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDEL 343
           + W   WR+ AE+ L+  L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ 
Sbjct: 1   MKWEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ- 59

Query: 344 CYTELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQT 399
           CY +++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC    ++   
Sbjct: 60  CYRDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSE 119

Query: 400 TLQQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQM 459
            LQ+ +LS L+E+D CY+ RR +LT  RTHLYFL YEYE ++                  
Sbjct: 120 NLQK-QLSELDEDDLCYEFRRERLTVHRTHLYFLHYEYEPAT------------------ 160

Query: 460 VEINPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFL 519
                          D  DVTLVAQLSMDRLQM+E + KHWEGPISL LY+SDAEAQQFL
Sbjct: 161 ---------------DNTDVTLVAQLSMDRLQMLEAICKHWEGPISLALYLSDAEAQQFL 205

Query: 520 SYAGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYP 579
            YA  SE L SR N+GYH+VYKEG FYP+N LRNVA+  +STPY+FL DIDFLPM+GLY 
Sbjct: 206 RYAQGSEVLMSRHNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYE 265

Query: 580 YLKSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS----------------------- 616
           YL+ S+  +D+     KK ++VPAFET RYR +FP S                       
Sbjct: 266 YLRKSVVQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTK 323

Query: 617 -HAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
            HAPTNF++W  ATTPY++EW  DFEPY+V  RD P YD RFVGFGWNK
Sbjct: 324 GHAPTNFAKWRTATTPYRVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 372


>gi|322794621|gb|EFZ17629.1| hypothetical protein SINV_09453 [Solenopsis invicta]
          Length = 664

 Score =  470 bits (1209), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 249/523 (47%), Positives = 313/523 (59%), Gaps = 68/523 (13%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           P +   FY A+  V  VSWIPNKHYSGVYGLLKL LP  + E   + +VLDTDV   TD+
Sbjct: 108 PHVNVTFYEAEVWVPRVSWIPNKHYSGVYGLLKLILPDAMRED--RVLVLDTDVTILTDM 165

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
             LW +F      QT+GLVENQS+WY+  L    +PWPALGRGFNTGV+L+ L +LRD  
Sbjct: 166 YPLWKMFQMFSASQTLGLVENQSNWYVKALSYGQRPWPALGRGFNTGVMLMHLQRLRDGG 225

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           +A  W  + ++ L     TSLADQD+ NA+ISE P +VY + C WN+QLSD+T SD  CY
Sbjct: 226 FATSWESVTKRVLEHIPETSLADQDVINAVISEQPTVVYKIECTWNIQLSDHTISDT-CY 284

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTTLQQAE 405
            +   + I+HWNSP+K  V NKH+  FR L+  FLE DGNLLR+ LFGC+  ++ L   E
Sbjct: 285 RDTNRINILHWNSPRKQDVHNKHVNEFRKLHRVFLEMDGNLLRKRLFGCDRHESALPYYE 344

Query: 406 LSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINPN 465
           LS      PC +  +      RTH + LEYEY   S   DV LV Q S            
Sbjct: 345 LS------PCREFEKGATLLYRTHPFLLEYEYNVYSP-TDVALVTQCS------------ 385

Query: 466 NVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNS 525
                                M+R+ ++E L KHW G IS+ LY++DAE Q FL +   S
Sbjct: 386 ---------------------MERVPLLEALSKHWPGTISVALYLTDAEVQNFLDFVRGS 424

Query: 526 EALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSI 585
             LR+RRNI YHVVYK+G+ YPIN LRN A++ V T +VF LDIDFLP FGLY  + + I
Sbjct: 425 ADLRTRRNIAYHVVYKDGDLYPINYLRNTAVSYVPTSFVFQLDIDFLPQFGLYENIMNHI 484

Query: 586 RSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------HAPTN 621
             ++++    K  L+VPAFET+RYR  FPA+                        HA TN
Sbjct: 485 NRLNIN-ESDKVALIVPAFETERYRFTFPANKEELLKFLKRGILYTFRYHVWTQGHAATN 543

Query: 622 FSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           +S W N+T PYQ+ W PDFEPYIV  +  PRYDTRFVGFGWNK
Sbjct: 544 YSLWRNSTEPYQVSWEPDFEPYIVVSKSAPRYDTRFVGFGWNK 586



 Score =  203 bits (517), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 97/161 (60%), Positives = 116/161 (72%), Gaps = 2/161 (1%)

Query: 12  CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQV 71
           C+ I VA+VCAGYNST +LVT++KSILFYR NPLHFHL+ D +A   L TLF TW +P V
Sbjct: 51  CDTIHVAMVCAGYNSTFALVTVVKSILFYRTNPLHFHLLVDEIARKTLSTLFRTWDLPHV 110

Query: 72  EVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQL 131
            V+FY A+  V  VSWIPNKHYSGVYGLLKL LP  + E   + +VLDTDV   TD+  L
Sbjct: 111 NVTFYEAEVWVPRVSWIPNKHYSGVYGLLKLILPDAMRED--RVLVLDTDVTILTDMYPL 168

Query: 132 WALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           W +F      QT+GLVENQS+WY+  L    +PWPALGRGF
Sbjct: 169 WKMFQMFSASQTLGLVENQSNWYVKALSYGQRPWPALGRGF 209


>gi|270005097|gb|EFA01545.1| hypothetical protein TcasGA2_TC007106 [Tribolium castaneum]
          Length = 576

 Score =  468 bits (1203), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 244/526 (46%), Positives = 320/526 (60%), Gaps = 73/526 (13%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLP-ETLAKTIVLDTDVIFATD 224
           P     FY  +  ++DV W+PN HYSG++GLLKL  PK++P     K IVLDTD+ F  D
Sbjct: 20  PQSNTTFYDINDYIQDVRWVPNSHYSGIFGLLKLVFPKIIPLNVTKKIIVLDTDLTFVGD 79

Query: 225 IAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDI 284
           I +LW LF +   +Q +G+VENQSD+YLGK  K  KPWPA GRGFN+GV+L +L +LR +
Sbjct: 80  IIELWRLFEKFNNKQAVGIVENQSDYYLGKNTK-IKPWPASGRGFNSGVLLYNLERLRRL 138

Query: 285 SWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELC 344
            W   W  +A+K  +    T L DQDI NA++ +H  L+Y +PC WN QLSD+  S   C
Sbjct: 139 DWPSLWPSVAKKVAIIYGSTRLGDQDIINAVLLQHTDLLYQVPCFWNTQLSDHALSYN-C 197

Query: 345 YTELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTTLQQA 404
           Y +   +K++HWNSPKK  V NK  ++FR LY TF+EY+GNLLRR+L+ C+ TQ  + + 
Sbjct: 198 YNKFR-VKVVHWNSPKKTHVLNKDGDYFRTLYQTFVEYNGNLLRRQLYYCDATQDAVNEP 256

Query: 405 ELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDG-NDVTLVAQLSMDRLQMVEIN 463
                 + D C +  R+K +  RT L+F EY++ A  +  NDVT VAQLS DRL      
Sbjct: 257 ------QTDLCAEFHRAKTSQWRTLLFFREYQHSAEENAKNDVTFVAQLSYDRL------ 304

Query: 464 PNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAG 523
                                      QMVE L K+W+GPISLT Y++D E QQ  ++  
Sbjct: 305 ---------------------------QMVEELVKYWQGPISLTFYVTDPEFQQCYNFIE 337

Query: 524 NSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKS 583
           NSE L+ R NI YH V+K+G ++PIN LRNV L  V+TP+VFL DIDFLPM GLY  LKS
Sbjct: 338 NSELLQDRTNIAYHAVFKDGEYHPINILRNVGLKNVATPFVFLADIDFLPMKGLYSVLKS 397

Query: 584 SIRSM-DMHGHGGKKVLVVPAFETQRYRTAFP------------------------ASHA 618
            + S+ DM      K L+VPAFETQRYR+  P                        A HA
Sbjct: 398 HLNSIHDMKA----KALIVPAFETQRYRSRVPKNKAQLLGMWEKKSIFPFRSDVWVAGHA 453

Query: 619 PTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           PTN+++W +A  PY+++W PDFEPYIV   D+  YD RF+GFGWNK
Sbjct: 454 PTNYTKWKSAIAPYKVKWEPDFEPYIVVSSDVTEYDNRFMGFGWNK 499



 Score =  142 bits (357), Expect = 8e-31,   Method: Compositional matrix adjust.
 Identities = 65/124 (52%), Positives = 87/124 (70%), Gaps = 2/124 (1%)

Query: 50  ITDTVALNILQTLFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLP 109
           + + V+  +L+TLF TW+VPQ   +FY  +  ++DV W+PN HYSG++GLLKL  PK++P
Sbjct: 1   MVNKVSEKVLKTLFDTWNVPQSNTTFYDINDYIQDVRWVPNSHYSGIFGLLKLVFPKIIP 60

Query: 110 -ETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPAL 168
                K IVLDTD+ F  DI +LW LF +   +Q +G+VENQSD+YLGK  K  KPWPA 
Sbjct: 61  LNVTKKIIVLDTDLTFVGDIIELWRLFEKFNNKQAVGIVENQSDYYLGKNTK-IKPWPAS 119

Query: 169 GRGF 172
           GRGF
Sbjct: 120 GRGF 123


>gi|156552489|ref|XP_001602136.1| PREDICTED: glycosyltransferase-like protein LARGE2 [Nasonia
           vitripennis]
          Length = 712

 Score =  467 bits (1202), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 246/523 (47%), Positives = 309/523 (59%), Gaps = 68/523 (13%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           P +   +Y A+  +  V+WIPNKHYSGVYGLLKL LP  + E  +K IVLDTDV    D+
Sbjct: 129 PHVAVSYYKAEEWIPKVAWIPNKHYSGVYGLLKLILPDAIRE--SKIIVLDTDVTVLNDV 186

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
             LW LF+   Q Q +GLVENQS+WY+     + +PWPALGRGFN+GV+L+DL KLR  S
Sbjct: 187 TLLWELFNNFNQEQVLGLVENQSNWYIKPSLYSTRPWPALGRGFNSGVMLMDLQKLRAKS 246

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           +A  W   A   L   L TSLADQDI NA+I  +P  VYT+ C WNVQLSD+  S E CY
Sbjct: 247 FAKLWEHTAAVVLKDILETSLADQDIINAVIKNNPKFVYTIDCIWNVQLSDHALS-ENCY 305

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTTLQQAE 405
           +    L IIHWNSP+K  V++K+   FR ++  FLE DGNLLRR LFGC          E
Sbjct: 306 SNADQLNIIHWNSPRKQNVRHKYTHDFRKMHQVFLELDGNLLRRRLFGCQTND------E 359

Query: 406 LSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINPN 465
            S LNE D C+  +       RTH +FLE+EY   +   D+ L  Q S            
Sbjct: 360 ASQLNERDLCFKFKEGATIIYRTHTFFLEFEYNIVNSA-DILLATQCS------------ 406

Query: 466 NVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNS 525
                                ++R+ ++E L KHW G IS+ LY+SD E Q FL +  NS
Sbjct: 407 ---------------------IERVALLEELAKHWPGVISIALYLSDNEVQSFLDFVQNS 445

Query: 526 EALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSI 585
           E LR+R+NI YH+VYKEG FYP+N LRNVA+  ++ PY+F +DIDFLP  GL+  L + I
Sbjct: 446 EELRNRKNIAYHIVYKEGEFYPVNYLRNVAMTHITLPYIFQIDIDFLPQIGLHRILMNYI 505

Query: 586 RSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------HAPTN 621
             ++      +  LV+PAFETQRYR  +P+S                        HAPTN
Sbjct: 506 IQLN-SSELQQIALVIPAFETQRYRFTYPSSKADLLKYWDHGVLYTFRYHVWPQGHAPTN 564

Query: 622 FSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           FS + NAT PY+I W PDFEPYIV  R  P YDTRF+GFGWNK
Sbjct: 565 FSVYRNATEPYEISWEPDFEPYIVVLRSAPTYDTRFIGFGWNK 607



 Score =  201 bits (512), Expect = 8e-49,   Method: Compositional matrix adjust.
 Identities = 94/161 (58%), Positives = 115/161 (71%), Gaps = 2/161 (1%)

Query: 12  CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQV 71
           C  I +AIVCAGYNST +LVT +KSILFYR  PLHFH++ D +AL  L  LF+TW +P V
Sbjct: 72  CSTIHIAIVCAGYNSTVTLVTTVKSILFYRTKPLHFHIMVDEIALKTLNVLFTTWDIPHV 131

Query: 72  EVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQL 131
            VS+Y A+  +  V+WIPNKHYSGVYGLLKL LP  + E  +K IVLDTDV    D+  L
Sbjct: 132 AVSYYKAEEWIPKVAWIPNKHYSGVYGLLKLILPDAIRE--SKIIVLDTDVTVLNDVTLL 189

Query: 132 WALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           W LF+   Q Q +GLVENQS+WY+     + +PWPALGRGF
Sbjct: 190 WELFNNFNQEQVLGLVENQSNWYIKPSLYSTRPWPALGRGF 230


>gi|307210698|gb|EFN87121.1| Glycosyltransferase-like protein LARGE1 [Harpegnathos saltator]
          Length = 682

 Score =  466 bits (1199), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 242/523 (46%), Positives = 315/523 (60%), Gaps = 68/523 (13%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           P +   FY A+  V  VSWIPNKHYSGVYGLLKL LP  + E   K +VLDTDV    D+
Sbjct: 126 PHVNITFYGAEVWVPRVSWIPNKHYSGVYGLLKLILPDAIRED--KVLVLDTDVTVLNDV 183

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           + LW +F +    QT+GL+ENQS+WY+  L    +PWPALGRGFNTGV+L+ L +LRD  
Sbjct: 184 SLLWGMFEKFSTGQTLGLIENQSNWYIKALSYGQRPWPALGRGFNTGVMLMHLQRLRDGK 243

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           +   W  +A++ L     TSLADQD+ NA+I+EHP +VY + C WN+QLSD T SD  CY
Sbjct: 244 FTSSWESVAKRVLGYIPKTSLADQDVINAVINEHPSIVYRIECTWNIQLSDRTISDT-CY 302

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTTLQQAE 405
            + + + I+HWNSP+K  V+NKH+  FR L+  FLE DGNLLRR LFGC+  ++ L    
Sbjct: 303 RDSSRINILHWNSPRKQDVRNKHVNEFRKLHRVFLEMDGNLLRRRLFGCDKHESALP--- 359

Query: 406 LSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINPN 465
               NE  PC +  +S     RTH + LEY+Y   S   DV LV Q              
Sbjct: 360 ---YNESSPCREFEKSASILYRTHPFLLEYDYNMYSP-MDVALVTQ-------------- 401

Query: 466 NVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNS 525
                               S++R+ ++E L KHW G IS+ +Y++DAE Q FL +   S
Sbjct: 402 -------------------CSVERIPLLEALSKHWPGTISVAIYLTDAEVQNFLDFVRGS 442

Query: 526 EALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSI 585
             LR+RRNI YHVVYK+G  YPIN LRN A++ VSTP++F LDIDFLP  GL+  +   I
Sbjct: 443 ADLRTRRNIAYHVVYKDGELYPINYLRNTAMSYVSTPFIFQLDIDFLPQLGLHENIMGYI 502

Query: 586 RSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------HAPTN 621
             +D++    K  L+VPAFET+RYR  FPA+                        HA TN
Sbjct: 503 NKLDIN-ESDKIALIVPAFETERYRFDFPANKEELLKFLKRGVLYTFRYHVWTQGHAATN 561

Query: 622 FSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           +S W +++ PY++ W PDFEPYIV  +  PRYDTRF+GFGWNK
Sbjct: 562 YSVWRSSSEPYEVSWEPDFEPYIVVSKSAPRYDTRFIGFGWNK 604



 Score =  203 bits (517), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 94/161 (58%), Positives = 117/161 (72%), Gaps = 2/161 (1%)

Query: 12  CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQV 71
           C+ + VA+VCAGYNST +LVT++KSILFYR NPLHFHL+ D +A   L TLF TW +P V
Sbjct: 69  CDTVHVAMVCAGYNSTLTLVTVVKSILFYRTNPLHFHLLVDEIARRTLSTLFRTWDLPHV 128

Query: 72  EVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQL 131
            ++FY A+  V  VSWIPNKHYSGVYGLLKL LP  + E   K +VLDTDV    D++ L
Sbjct: 129 NITFYGAEVWVPRVSWIPNKHYSGVYGLLKLILPDAIRED--KVLVLDTDVTVLNDVSLL 186

Query: 132 WALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           W +F +    QT+GL+ENQS+WY+  L    +PWPALGRGF
Sbjct: 187 WGMFEKFSTGQTLGLIENQSNWYIKALSYGQRPWPALGRGF 227


>gi|328788916|ref|XP_624139.2| PREDICTED: glycosyltransferase-like protein LARGE1-like [Apis
           mellifera]
          Length = 692

 Score =  464 bits (1194), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 238/523 (45%), Positives = 307/523 (58%), Gaps = 68/523 (13%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           P +   +Y A+  V  VSWIPNKHYSGVYGLLKL LP  + E   K +V DTDV    D+
Sbjct: 128 PHVNLTYYKAEKWVPKVSWIPNKHYSGVYGLLKLILPDAMRED--KVLVFDTDVTVLNDV 185

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           + LW +F +    Q +GL ENQS WY+  L    +PWPALGRGFNTGV+L+ L +LR   
Sbjct: 186 SLLWHIFEKFSSDQALGLTENQSHWYIKALSYGQRPWPALGRGFNTGVMLMHLQQLRKRK 245

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           +   W  + ++ L     TSLADQDI NA+I E P++VY + C WN+QLSD+T SD +CY
Sbjct: 246 FTSLWEAVTKRVLAHIPETSLADQDIINAVIKERPFIVYKIKCTWNIQLSDHTISD-ICY 304

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTTLQQAE 405
            + + + IIHWNSP+K  V NKH+  FR L+  FLE DGNLLRR LFGC+  +T      
Sbjct: 305 RDTSQISIIHWNSPRKQDVYNKHINEFRKLHRVFLEMDGNLLRRRLFGCDKHET------ 358

Query: 406 LSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINPN 465
           +S  NE  PC D  +      RTH + LEYEY   +   DV L+ Q              
Sbjct: 359 VSQYNESSPCRDFTKGATMLYRTHPFLLEYEYNVYTPA-DVALITQ-------------- 403

Query: 466 NVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNS 525
                               S++R+ ++E L KHW G IS+ LY++DAE Q FL +   S
Sbjct: 404 -------------------CSVERIPLLEDLSKHWPGTISVALYLTDAEVQNFLEFVRGS 444

Query: 526 EALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSI 585
             LR+R+NI YHVVYK+G  YPIN LRN+A++ +STPY+F LD+DFLP + L+  L + I
Sbjct: 445 IELRNRKNIAYHVVYKDGELYPINYLRNIAMSYISTPYIFQLDVDFLPQYRLHENLMNYI 504

Query: 586 RSMDMHGHGGKKVLVVPAFETQRYRTAFPA------------------------SHAPTN 621
             ++M     K  L+VPAFET+RYR  FPA                         HA TN
Sbjct: 505 VKLNM-SESDKVALIVPAFETERYRFTFPADKDELLKFLKRGVLYTFRYHVWTQGHAATN 563

Query: 622 FSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           ++ W N   PY++ W PDFEPYIV  R  PRYDTRF+GFGWNK
Sbjct: 564 YTYWRNTMEPYEVSWEPDFEPYIVVSRLAPRYDTRFIGFGWNK 606



 Score =  194 bits (493), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 88/161 (54%), Positives = 112/161 (69%), Gaps = 2/161 (1%)

Query: 12  CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQV 71
           CE + +A+VCAGYNST +LVT++KS+LFYR  PLHFHL+ D +A   L T+F TW +P V
Sbjct: 71  CETVHIAMVCAGYNSTFALVTVVKSVLFYRTKPLHFHLLVDEIAKRTLTTVFQTWDLPHV 130

Query: 72  EVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQL 131
            +++Y A+  V  VSWIPNKHYSGVYGLLKL LP  + E   K +V DTDV    D++ L
Sbjct: 131 NLTYYKAEKWVPKVSWIPNKHYSGVYGLLKLILPDAMRED--KVLVFDTDVTVLNDVSLL 188

Query: 132 WALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           W +F +    Q +GL ENQS WY+  L    +PWPALGRGF
Sbjct: 189 WHIFEKFSSDQALGLTENQSHWYIKALSYGQRPWPALGRGF 229


>gi|380019964|ref|XP_003693869.1| PREDICTED: glycosyltransferase-like protein LARGE1 [Apis florea]
          Length = 692

 Score =  463 bits (1192), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 239/523 (45%), Positives = 307/523 (58%), Gaps = 68/523 (13%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           P +   +Y A+  V  VSWIPNKHYSGVYGLLKL LP  L E   K +V DTDV    D+
Sbjct: 128 PHVNLTYYKAEKWVPKVSWIPNKHYSGVYGLLKLILPDALRED--KVLVFDTDVTVLNDV 185

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           + LW +F +    Q +GL ENQS WY+  L    +PWPALGRGFNTGV+L+ L +LR   
Sbjct: 186 SLLWHIFEKFSSDQALGLTENQSHWYIKALSYGQRPWPALGRGFNTGVMLMHLQQLRKRK 245

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           +   W  + ++ L     TSLADQDI NA+I E P++VY + C WN+QLSD+T SD +CY
Sbjct: 246 FTSLWEAVTKRVLAHIPETSLADQDIINAVIKERPFIVYKIKCTWNIQLSDHTISD-ICY 304

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTTLQQAE 405
            + + + IIHWNSP+K  V NKH+  FR L+  FLE DGNLLRR LFGC+  +T      
Sbjct: 305 RDTSQISIIHWNSPRKQDVYNKHINEFRKLHRVFLEMDGNLLRRRLFGCDKHET------ 358

Query: 406 LSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINPN 465
           +S  NE  PC D  +      RTH + LEYEY   +   DV L+ Q              
Sbjct: 359 VSQYNESSPCRDFTKGATMLYRTHPFLLEYEYNVYTPA-DVALITQ-------------- 403

Query: 466 NVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNS 525
                               S++R+ ++E L KHW G IS+ LY++DAE Q FL +   S
Sbjct: 404 -------------------CSVERIPLLEDLSKHWPGTISVALYLTDAEVQNFLEFVRGS 444

Query: 526 EALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSI 585
             LR+R+NI YHVVYK+G  YPIN LRN+A++ VSTPY+F LD+DFLP + L+  L + I
Sbjct: 445 IELRNRKNIAYHVVYKDGELYPINYLRNIAMSYVSTPYIFQLDVDFLPQYRLHETLMNYI 504

Query: 586 RSMDMHGHGGKKVLVVPAFETQRYRTAFPA------------------------SHAPTN 621
             +++     K  L+VPAFET+RYR  FPA                         HA TN
Sbjct: 505 VKLNI-SESDKIALIVPAFETERYRFTFPADKDELLKFLKRGVLYTFRYHVWTQGHAATN 563

Query: 622 FSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           ++ W N   PY++ W PDFEPYIV  R  PRYDTRF+GFGWNK
Sbjct: 564 YTYWRNTMEPYEVTWEPDFEPYIVVSRLAPRYDTRFIGFGWNK 606



 Score =  194 bits (494), Expect = 9e-47,   Method: Compositional matrix adjust.
 Identities = 89/161 (55%), Positives = 112/161 (69%), Gaps = 2/161 (1%)

Query: 12  CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQV 71
           CE + +A+VCAGYNST +LVT++KS+LFYR  PLHFHL+ D +A   L T+F TW +P V
Sbjct: 71  CETVHIAMVCAGYNSTFALVTVVKSVLFYRTKPLHFHLLVDEIAKRTLTTVFQTWDLPHV 130

Query: 72  EVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQL 131
            +++Y A+  V  VSWIPNKHYSGVYGLLKL LP  L E   K +V DTDV    D++ L
Sbjct: 131 NLTYYKAEKWVPKVSWIPNKHYSGVYGLLKLILPDALRED--KVLVFDTDVTVLNDVSLL 188

Query: 132 WALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           W +F +    Q +GL ENQS WY+  L    +PWPALGRGF
Sbjct: 189 WHIFEKFSSDQALGLTENQSHWYIKALSYGQRPWPALGRGF 229


>gi|350417177|ref|XP_003491294.1| PREDICTED: glycosyltransferase-like protein LARGE1-like [Bombus
           impatiens]
          Length = 689

 Score =  460 bits (1184), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 236/523 (45%), Positives = 308/523 (58%), Gaps = 68/523 (13%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           P +   +Y A+  V  VSWIPNKHYSGVYGLLKL LP  + E   K +V DTDV    D+
Sbjct: 125 PHVNLTYYKAEKWVPKVSWIPNKHYSGVYGLLKLILPDAMRED--KVLVFDTDVTVLNDV 182

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           + LW +F +    Q +GL ENQS WY+  L    +PWPALGRGFNTGV+L+ L +LR   
Sbjct: 183 SLLWQMFEKFTSDQALGLTENQSHWYIKALSYGQRPWPALGRGFNTGVMLMHLQQLRKRK 242

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           +   W  +A++ L+    TSLADQDI NA++ EH  +VY + C WN+QLSD+T SD +CY
Sbjct: 243 FMSLWETVAKRVLVHIPETSLADQDIINAVVKEHSSIVYKIQCTWNIQLSDHTISD-ICY 301

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTTLQQAE 405
            + + + I+HWNSP+K  V NKH+  FR L+  FLE DGNLLR+ LFGC+  +T      
Sbjct: 302 RDTSQINIVHWNSPRKQDVYNKHINQFRKLHKVFLEMDGNLLRKRLFGCDKHET------ 355

Query: 406 LSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINPN 465
           +S  NE   C +  R   T  RTH + LEYEY   +   DV L  Q              
Sbjct: 356 VSQYNESTLCREFTRGATTLYRTHPFLLEYEYNVYA-STDVALATQ-------------- 400

Query: 466 NVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNS 525
                               S++R+ ++E L KHW G IS+ LY++DAE Q FL +   S
Sbjct: 401 -------------------CSVERIPLLEDLSKHWPGTISVALYLTDAEVQNFLEFVRGS 441

Query: 526 EALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSI 585
             LR+R+NI YHVVYK+G  YPIN LRN A++ +STP++F LD+DFLP +GL+  L + I
Sbjct: 442 IELRNRKNIAYHVVYKDGELYPINYLRNTAMSYISTPFIFQLDVDFLPQYGLHESLMNYI 501

Query: 586 RSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------HAPTN 621
             +++     K  L+VPAFET+RYR  FPA+                        HA TN
Sbjct: 502 IKLNI-SESDKVALIVPAFETERYRFIFPANKDELLKFLKRGVLYTFRYHVWTQGHAATN 560

Query: 622 FSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           +S W N   PY++ W PDFEPYIV  R  PRYDTRF+GFGWNK
Sbjct: 561 YSYWRNTMEPYEVSWEPDFEPYIVVSRKAPRYDTRFIGFGWNK 603



 Score =  195 bits (496), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 90/161 (55%), Positives = 112/161 (69%), Gaps = 2/161 (1%)

Query: 12  CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQV 71
           CE I +A+VCAGYNST +LVT++KS+LFYR  PLHFHL+ D VA   L T+F TW +P V
Sbjct: 68  CETIHIAMVCAGYNSTFALVTVVKSVLFYRTKPLHFHLLVDEVANRTLTTVFQTWDLPHV 127

Query: 72  EVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQL 131
            +++Y A+  V  VSWIPNKHYSGVYGLLKL LP  + E   K +V DTDV    D++ L
Sbjct: 128 NLTYYKAEKWVPKVSWIPNKHYSGVYGLLKLILPDAMRED--KVLVFDTDVTVLNDVSLL 185

Query: 132 WALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           W +F +    Q +GL ENQS WY+  L    +PWPALGRGF
Sbjct: 186 WQMFEKFTSDQALGLTENQSHWYIKALSYGQRPWPALGRGF 226


>gi|340715041|ref|XP_003396029.1| PREDICTED: glycosyltransferase-like protein LARGE1-like [Bombus
           terrestris]
          Length = 689

 Score =  457 bits (1175), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 234/523 (44%), Positives = 307/523 (58%), Gaps = 68/523 (13%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           P +   +Y A+  V  VSWIPNKHYSGVYGLLKL LP  + E   K +V DTDV    D+
Sbjct: 125 PHVNLTYYKAEKWVPKVSWIPNKHYSGVYGLLKLILPDAMRED--KVLVFDTDVTVLNDV 182

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           + LW +F +    Q +GL ENQS WY+  L    +PWPALGRGFNTGV+L+ L +LR   
Sbjct: 183 SLLWQMFEKFTSDQALGLTENQSHWYIKALSYGQRPWPALGRGFNTGVMLMHLQQLRKRK 242

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           +   W  +A++ L+    TSLADQDI NA++ E+  +VY + C WN+QLSD+T SD +CY
Sbjct: 243 FMSLWETVAKRVLVHIPETSLADQDIINAVVKEYSSIVYKIQCTWNIQLSDHTISD-ICY 301

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTTLQQAE 405
            + + + I+HWNSP+K  V NKH+  FR L+  FLE DGNLLR+ LFGC+      +   
Sbjct: 302 RDTSQINIVHWNSPRKQDVYNKHINQFRKLHKVFLEMDGNLLRKRLFGCD------KHEA 355

Query: 406 LSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINPN 465
           +S  NE   C +  R   T  RTH + LEYEY   +   DV L  Q              
Sbjct: 356 VSEYNESALCREFTRGATTLYRTHPFLLEYEYNVYA-STDVALATQ-------------- 400

Query: 466 NVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNS 525
                               S++R+ ++E L KHW G IS+ LY++DAE Q FL +   S
Sbjct: 401 -------------------CSVERIPLLEDLSKHWPGTISVALYLTDAEVQNFLEFVRGS 441

Query: 526 EALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSI 585
             LR+R+NI YHVVYK+G  YPIN LRN A++ +STP++F LD+DFLP +GL+  L + I
Sbjct: 442 IELRNRKNIAYHVVYKDGELYPINYLRNTAMSYISTPFIFQLDVDFLPQYGLHESLMNYI 501

Query: 586 RSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------HAPTN 621
             +++     K  L+VPAFET+RYR  FPA+                        HA TN
Sbjct: 502 IKLNI-SESDKVALIVPAFETERYRFIFPANKDELLKFLKRGVLYTFRYHVWTQGHAATN 560

Query: 622 FSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           +S W N   PY++ W PDFEPYIV  R  PRYDTRF+GFGWNK
Sbjct: 561 YSYWRNTMEPYEVSWEPDFEPYIVVSRKAPRYDTRFIGFGWNK 603



 Score =  194 bits (494), Expect = 9e-47,   Method: Compositional matrix adjust.
 Identities = 89/161 (55%), Positives = 112/161 (69%), Gaps = 2/161 (1%)

Query: 12  CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQV 71
           CE I +A+VCAGYNST +LVT++KS+LFYR  PLHFHL+ D +A   L T+F TW +P V
Sbjct: 68  CETIHIAMVCAGYNSTFALVTVVKSVLFYRTKPLHFHLLVDEIANRTLTTVFQTWDLPHV 127

Query: 72  EVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQL 131
            +++Y A+  V  VSWIPNKHYSGVYGLLKL LP  + E   K +V DTDV    D++ L
Sbjct: 128 NLTYYKAEKWVPKVSWIPNKHYSGVYGLLKLILPDAMRED--KVLVFDTDVTVLNDVSLL 185

Query: 132 WALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           W +F +    Q +GL ENQS WY+  L    +PWPALGRGF
Sbjct: 186 WQMFEKFTSDQALGLTENQSHWYIKALSYGQRPWPALGRGF 226


>gi|332836402|ref|XP_003313074.1| PREDICTED: LOW QUALITY PROTEIN: glycosyltransferase-like protein
           LARGE2 [Pan troglodytes]
          Length = 639

 Score =  451 bits (1159), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 252/527 (47%), Positives = 308/527 (58%), Gaps = 116/527 (22%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY AD +   VSWIPNKHYSG+YGL+KL LP  LP  LA+ IVLDTDV FA+DI
Sbjct: 120 PAVRVSFYHADQLKPQVSWIPNKHYSGLYGLMKLVLPSALPAELARVIVLDTDVTFASDI 179

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           ++LWALF+     Q IGLVENQSDWYLG LWKNH+PWPALGRGFNTG      + L   S
Sbjct: 180 SELWALFAHFSDTQAIGLVENQSDWYLGNLWKNHRPWPALGRGFNTG----GDSALGSPS 235

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
                  +   ++ +R ++  +D      +I             WN              
Sbjct: 236 -------VRRPWMCSRCYSEASD---LRXVI------------HWN-------------- 259

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLT----QTTL 401
                       SPKKL+VKNKH+EFFRN YLTFLEYDGNLLRRELF C          L
Sbjct: 260 ------------SPKKLRVKNKHVEFFRNFYLTFLEYDGNLLRRELFVCPSQPPPGAEQL 307

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
           QQA L+ L+EEDPC++ R+ +LT  R H+ FL +E       +DVTLVAQLSMDRL    
Sbjct: 308 QQA-LAQLDEEDPCFEFRQQQLTVHRVHVTFLPHEPPPPRP-HDVTLVAQLSMDRL---- 361

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                        QM+E L +HW GP+SL LY++DAEAQQFL +
Sbjct: 362 -----------------------------QMLEALCRHWPGPMSLALYLTDAEAQQFLHF 392

Query: 522 AGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
              S  L +R+++ YHVVY+EG  YP+N LRNVAL Q  TPYVFL DIDFLP + LY YL
Sbjct: 393 VEASPVLAARQDVAYHVVYREGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYL 452

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------H 617
           ++SI  + + G   K  LVVPAFET RYR +FP S                        H
Sbjct: 453 RASIEQLGL-GSWRKAALVVPAFETLRYRFSFPHSKVELLALLDAGTLYTFRYHEWPRGH 511

Query: 618 APTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           APT+++RW  A  PY+++WA D+EPY+V  RD PRYD RFVGFGWNK
Sbjct: 512 APTDYARWREAQAPYRVQWAADYEPYVVVPRDCPRYDPRFVGFGWNK 558



 Score =  254 bits (649), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 120/185 (64%), Positives = 143/185 (77%), Gaps = 5/185 (2%)

Query: 12  CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQV 71
           CE++ VAIVCAG+NS+R ++TL+KS+LFYRKNPLH HL+TD VA NIL+TLF TW VP V
Sbjct: 63  CELLHVAIVCAGHNSSRDVITLVKSMLFYRKNPLHLHLVTDAVARNILETLFHTWMVPAV 122

Query: 72  EVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQL 131
            VSFY AD +   VSWIPNKHYSG+YGL+KL LP  LP  LA+ IVLDTDV FA+DI++L
Sbjct: 123 RVSFYHADQLKPQVSWIPNKHYSGLYGLMKLVLPSALPAELARVIVLDTDVTFASDISEL 182

Query: 132 WALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFYL-ADSVVEDVS----WIP 186
           WALF+     Q IGLVENQSDWYLG LWKNH+PWPALGRGF    DS +   S    W+ 
Sbjct: 183 WALFAHFSDTQAIGLVENQSDWYLGNLWKNHRPWPALGRGFNTGGDSALGSPSVRRPWMC 242

Query: 187 NKHYS 191
           ++ YS
Sbjct: 243 SRCYS 247


>gi|383855636|ref|XP_003703316.1| PREDICTED: glycosyltransferase-like protein LARGE1 [Megachile
           rotundata]
          Length = 689

 Score =  449 bits (1155), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 238/523 (45%), Positives = 309/523 (59%), Gaps = 68/523 (13%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           P +   +Y A+  V  VSWIPNKHYSGVYGLLKL LP  + E   K +V DTDV    D+
Sbjct: 126 PHVNLTYYKAERWVPKVSWIPNKHYSGVYGLLKLILPDAMRED--KVVVFDTDVTVVNDV 183

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
             LW +F +    Q +GL ENQS WY+  L    +PWPALGRGFNTGV+L+ L +LR   
Sbjct: 184 NLLWQMFEKFTNDQALGLTENQSHWYIKALSYGQRPWPALGRGFNTGVMLMHLQQLRTRK 243

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           +   W  + ++ L     TSLADQDI NA+I +HP +VY + C WN+QLSD+T SD+ CY
Sbjct: 244 FMSLWEAVTKRVLSYIPETSLADQDIINAVIKDHPSIVYKVECTWNIQLSDHTISDQ-CY 302

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTTLQQAE 405
            + + + I+HWNSP+K  V NKH+  FR L+  FL+ DGNLLRR LFGC+      +   
Sbjct: 303 RDTSRINIVHWNSPRKQDVYNKHINEFRKLHKVFLDMDGNLLRRRLFGCD------KHEA 356

Query: 406 LSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINPN 465
           +S  NE  PC +  R   T  RTH + LEYEY   +   DV LV Q S++R+ ++E    
Sbjct: 357 MSEYNESSPCREFTRGASTLYRTHPFLLEYEYNVYAP-TDVALVTQCSVERIPLLE---- 411

Query: 466 NVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNS 525
                       D++                 KHW G IS+ LY++DAE Q FL +   S
Sbjct: 412 ------------DIS-----------------KHWPGTISVALYLTDAEVQNFLEFVRGS 442

Query: 526 EALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSI 585
             LR R+NI YHVVYK+G  YPIN LRN+A++ VS+P++F LD+DFLP FGL+  L S I
Sbjct: 443 VELRKRKNIAYHVVYKDGELYPINYLRNIAMSYVSSPFIFQLDVDFLPQFGLHEALMSYI 502

Query: 586 RSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------HAPTN 621
             +++     K  L+VPAFET+RYR  FPA+                        HA TN
Sbjct: 503 SKLNI-TESDKVALIVPAFETERYRFTFPANKDELLKFLKRGVLYTFRYHVWTQGHAATN 561

Query: 622 FSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           +S W N   PY++ W PDFEPYIV  R  P+YDTRF+GFGWNK
Sbjct: 562 YSYWRNTMEPYEVSWEPDFEPYIVVSRLAPKYDTRFIGFGWNK 604



 Score =  180 bits (457), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 89/161 (55%), Positives = 111/161 (68%), Gaps = 2/161 (1%)

Query: 12  CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQV 71
           CE I +A+VCAGYNST +LVT++KS+LFYR  PLHFHL+ D +A   L T+F TW +P V
Sbjct: 69  CETIHIAMVCAGYNSTFALVTVVKSVLFYRTKPLHFHLLVDEIAKRTLTTVFRTWDLPHV 128

Query: 72  EVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQL 131
            +++Y A+  V  VSWIPNKHYSGVYGLLKL LP  + E   K +V DTDV    D+  L
Sbjct: 129 NLTYYKAERWVPKVSWIPNKHYSGVYGLLKLILPDAMRED--KVVVFDTDVTVVNDVNLL 186

Query: 132 WALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           W +F +    Q +GL ENQS WY+  L    +PWPALGRGF
Sbjct: 187 WQMFEKFTNDQALGLTENQSHWYIKALSYGQRPWPALGRGF 227


>gi|307174208|gb|EFN64853.1| Glycosyltransferase-like protein LARGE1 [Camponotus floridanus]
          Length = 613

 Score =  446 bits (1147), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 232/523 (44%), Positives = 308/523 (58%), Gaps = 68/523 (13%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           P +   FY A+  +  VSWIPN HYSGVYGLLKL LP  + E   K +VLDTDV    D+
Sbjct: 51  PHVNVTFYEAEVWMPKVSWIPNTHYSGVYGLLKLILPDAIQED--KILVLDTDVTVMNDV 108

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
             LW +F      + +GL+ENQS+WY+  L    +PWPALGRGFNTGV+++ L +LRD  
Sbjct: 109 YSLWKMFESFSTVEILGLIENQSNWYVKALSYGQRPWPALGRGFNTGVMMMHLRRLRDRK 168

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           +A  W  + ++ L     TSLADQD+ NA+I+E P +V+ + C WN+QLSD+T SD  CY
Sbjct: 169 FASLWESVTKRVLGYIPETSLADQDVINAVINEQPNIVHEIDCTWNIQLSDHTISDT-CY 227

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTTLQQAE 405
            +   + I+HWNSP+K  V NKH+  FR L+  FLE DGNLLR+ LFGC+  +  L    
Sbjct: 228 RDTNRINILHWNSPRKQDVHNKHINEFRKLHRVFLEMDGNLLRKRLFGCDKHENVLP--- 284

Query: 406 LSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINPN 465
               +E   C +  +      RT+ + LEYEY   S   DV LV Q              
Sbjct: 285 ---YDESSLCREFEKGAAILYRTYPFLLEYEYNVFS-SMDVALVTQ-------------- 326

Query: 466 NVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNS 525
                               S++R+ ++E L KHW G IS+ LY++DAE Q FL++   S
Sbjct: 327 -------------------CSVERIPLLETLAKHWPGTISVALYLTDAEVQYFLNFVRGS 367

Query: 526 EALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSI 585
             LR+R+NI YHVVYK+G  YPIN LRN+A++ VSTP++F LD+DFLP FGL+  L ++I
Sbjct: 368 IDLRTRKNIAYHVVYKDGELYPINYLRNIAISYVSTPFIFQLDVDFLPQFGLHENLMNNI 427

Query: 586 RSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------HAPTN 621
             +++     K  L+VPAFET+RYR  FP +                        HA TN
Sbjct: 428 NRLNI-NESDKIALIVPAFETERYRFTFPTNKAELLKFLKRGILYTFRYHVWTQGHAATN 486

Query: 622 FSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           +S W N+  PY++ W PDFEPYIV  +  PRYDTRFVGFGWNK
Sbjct: 487 YSFWQNSMEPYEVSWEPDFEPYIVVPKSAPRYDTRFVGFGWNK 529



 Score =  186 bits (473), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 88/154 (57%), Positives = 108/154 (70%), Gaps = 2/154 (1%)

Query: 19  IVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQVEVSFYLA 78
           +VCAGYNST +LVT++KSILFYR NPLHFHL+ D +A   L TLF TW +P V V+FY A
Sbjct: 1   MVCAGYNSTFALVTVVKSILFYRTNPLHFHLLVDEIARRTLSTLFQTWDLPHVNVTFYEA 60

Query: 79  DSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRL 138
           +  +  VSWIPN HYSGVYGLLKL LP  + E   K +VLDTDV    D+  LW +F   
Sbjct: 61  EVWMPKVSWIPNTHYSGVYGLLKLILPDAIQED--KILVLDTDVTVMNDVYSLWKMFESF 118

Query: 139 RQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
              + +GL+ENQS+WY+  L    +PWPALGRGF
Sbjct: 119 STVEILGLIENQSNWYVKALSYGQRPWPALGRGF 152


>gi|390368395|ref|XP_790103.3| PREDICTED: glycosyltransferase-like protein LARGE2-like, partial
           [Strongylocentrotus purpuratus]
          Length = 576

 Score =  432 bits (1110), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 215/335 (64%), Positives = 249/335 (74%), Gaps = 38/335 (11%)

Query: 211 KTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFN 270
           KTIVLDTD+ FA+DIA+LW LF R+  +QT+ LVENQSDWYLG LWK HKPWPALGRGFN
Sbjct: 278 KTIVLDTDITFASDIAELWGLFQRMTSKQTLALVENQSDWYLGTLWKKHKPWPALGRGFN 337

Query: 271 TGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQW 330
           TGVILLDL KLRD+ W   W + AEK L+++  T+LADQDI NA+I +HPYLVYTLPC W
Sbjct: 338 TGVILLDLQKLRDLKWMQMWLLTAEKELMSQFSTALADQDIINAVIKQHPYLVYTLPCAW 397

Query: 331 NVQLSDNTRSDELCYTELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRE 390
           NVQLSDNTRS ELCY E+TDLK IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRE
Sbjct: 398 NVQLSDNTRS-ELCYKEVTDLKAIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRE 456

Query: 391 LFGC--NLTQTTLQQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTL 448
           LFGC       TL + +LS+L+E+D CY+ RR +L   RTHLY+L+Y+     D  DVTL
Sbjct: 457 LFGCTDGKANNTLHE-KLSSLDEDDQCYEFRREQLLMHRTHLYYLDYQLPKFLD-TDVTL 514

Query: 449 VAQLSMDRLQMVEINPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTL 508
           VAQLSMDR                                 LQM+E + KHWEGPISL L
Sbjct: 515 VAQLSMDR---------------------------------LQMLEAICKHWEGPISLAL 541

Query: 509 YMSDAEAQQFLSYAGNSEALRSRRNIGYHVVYKEG 543
           YMSDAEAQQF+ YA +SE L +R+NIGYHVVYK+G
Sbjct: 542 YMSDAEAQQFIRYALSSETLMARKNIGYHVVYKDG 576



 Score =  192 bits (489), Expect = 4e-46,   Method: Compositional matrix adjust.
 Identities = 92/164 (56%), Positives = 109/164 (66%), Gaps = 32/164 (19%)

Query: 9   IPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSV 68
           IP CE I V+IVCAG+N++R ++ LIKSILFYRKNPLHFH I+D +   IL TLF TW V
Sbjct: 205 IPKCERIHVSIVCAGHNASRDVIILIKSILFYRKNPLHFHFISDEMGQLILSTLFKTWDV 264

Query: 69  PQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 128
           P V+VSFYLA++                                 KTIVLDTD+ FA+DI
Sbjct: 265 PGVKVSFYLAENY--------------------------------KTIVLDTDITFASDI 292

Query: 129 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           A+LW LF R+  +QT+ LVENQSDWYLG LWK HKPWPALGRGF
Sbjct: 293 AELWGLFQRMTSKQTLALVENQSDWYLGTLWKKHKPWPALGRGF 336


>gi|324506667|gb|ADY42842.1| Glycosyltransferase-like protein [Ascaris suum]
          Length = 692

 Score =  424 bits (1091), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 229/526 (43%), Positives = 305/526 (57%), Gaps = 66/526 (12%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           P++   FY +   +  VSWIPN+HYS  YGLLKL L  + PE L K +VLDTD++F  DI
Sbjct: 131 PSVNVIFYNSSLFINRVSWIPNRHYSREYGLLKLALTDIFPENLKKVVVLDTDLLFVEDI 190

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWA +S++   Q + +VEN SDWY+     N KPWPA  RGFNTGV+L+DL KLR ++
Sbjct: 191 ARLWAFWSQMNSAQALAVVENLSDWYIVNASSNRKPWPAWNRGFNTGVMLMDLEKLRTMN 250

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   W+ +A   L     T LADQDI NA+IS  P +VY LPC+WN+Q+   +R + LC 
Sbjct: 251 WTLLWKEVAADNLKEFGATQLADQDIINAVISRKPEIVYKLPCEWNLQMGYQSRQN-LCP 309

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTTLQQAE 405
             ++DLKI+HWNSPKK + +N +  FFR  + TF + DGNLLR E+  C+          
Sbjct: 310 ARISDLKIVHWNSPKKTRTRNPYAVFFRRHFQTFTQMDGNLLRTEMLRCS---------- 359

Query: 406 LSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINPN 465
               NE      LR S+L S+        YEYE S DG      A+    R+Q+      
Sbjct: 360 ----NE------LRSSELRSV--------YEYEESDDGCGDLRHARNIKYRIQLF----- 396

Query: 466 NVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNS 525
            +  Y   SD  DVTL  QLSMDR  + E L   W GP+S  +Y+SD+E  Q + Y  ++
Sbjct: 397 -IRRYAHQSDDADVTLTTQLSMDRFVLFESLLAFWTGPVSAAIYLSDSELTQLMQYLADT 455

Query: 526 EALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSI 585
           +  R+R +I  H V+KEG  +PIN LRNVALN  +TP+VFL D+DFLPM GLY  ++  +
Sbjct: 456 KVFRNRTDIALHAVFKEGVHFPINYLRNVALNATNTPFVFLADVDFLPMPGLYELIRKRL 515

Query: 586 RS-MDMHGHGGKKVLVVPAFETQRYRTA-FPAS------------------------HAP 619
            S + + G    K  VVPAFE++ YR++  PAS                        HA 
Sbjct: 516 SSGLPLKG----KAFVVPAFESKGYRSSVVPASKAELLVMLDTGQVQIFRQDVWIQGHAS 571

Query: 620 TNFSRWVNATTPYQIEWAPDFEPYIVAHR-DLPRYDTRFVGFGWNK 664
           T++ RW    T Y + W  D+EPY+V  R + P YD RFVGFGWNK
Sbjct: 572 TDYDRWRTTQTEYSVAWRTDYEPYVVVARNETPPYDNRFVGFGWNK 617



 Score =  186 bits (473), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 86/161 (53%), Positives = 111/161 (68%)

Query: 12  CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQV 71
           CEV+QVAIVCAGY S+   VTL+KS+L++R++ +  H + D VA  IL TLF TW +P V
Sbjct: 74  CEVLQVAIVCAGYESSFRSVTLVKSLLYHRRHAITMHFVVDDVAAQILPTLFDTWQLPSV 133

Query: 72  EVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQL 131
            V FY +   +  VSWIPN+HYS  YGLLKL L  + PE L K +VLDTD++F  DIA+L
Sbjct: 134 NVIFYNSSLFINRVSWIPNRHYSREYGLLKLALTDIFPENLKKVVVLDTDLLFVEDIARL 193

Query: 132 WALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           WA +S++   Q + +VEN SDWY+     N KPWPA  RGF
Sbjct: 194 WAFWSQMNSAQALAVVENLSDWYIVNASSNRKPWPAWNRGF 234


>gi|357623999|gb|EHJ74927.1| hypothetical protein KGM_05471 [Danaus plexippus]
          Length = 692

 Score =  424 bits (1090), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 223/530 (42%), Positives = 313/530 (59%), Gaps = 77/530 (14%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           P +    Y A   + +V WIPN HYSGVY L+KL  P +LP+ L + IVLD+D+ F +D+
Sbjct: 138 PDVKYSCYDAQDRLSEVKWIPNNHYSGVYALVKLLFPSILPDKLEQVIVLDSDLTFLSDV 197

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRD-I 284
           +QLW +F  +   Q IGLVEN+S+WY  +  +    WPAL RG+NTGV+LLDL K+R  I
Sbjct: 198 SQLWHMFRNMSSLQFIGLVENESNWYTNQKTR----WPALRRGYNTGVMLLDLYKIRSLI 253

Query: 285 SWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELC 344
           SW   W+    + L     T+LADQD+ NAII  HP++VY + C +NVQ+S  T +   C
Sbjct: 254 SWTSVWQKTVNENLNRLKTTALADQDVINAIIKNHPHIVYDISCHYNVQMSTQTLAKN-C 312

Query: 345 YTE-LTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTTLQQ 403
           Y E + ++KI+HWNSP K  ++ +  ++F+N++ +++ +DGNLLR +L  C+  +    +
Sbjct: 313 YGEDVKNIKILHWNSPSKYNIRIRDADYFKNIHQSYVNFDGNLLREKLHRCSQNEVVTYK 372

Query: 404 AELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEIN 463
                +N  D C   R ++   LRTH+Y+++Y Y  + D  DVTLV QLSMDR       
Sbjct: 373 -----MNHSDLCLSFRAAQRVKLRTHIYYMDYSY-MNVDNFDVTLVLQLSMDR------- 419

Query: 464 PNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAG 523
                                     LQ +E + K+WEGP+S  +Y+SD E  +  S   
Sbjct: 420 --------------------------LQFLERIVKYWEGPLSAAIYLSDCEVTKLESILR 453

Query: 524 N-SEALRSRRNIGYHVVYKEGNF-YPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
           + S  L  + NIGYH+V+K  +  YP+N LRNVAL  V+TPYVFL+D DF+PM GLY YL
Sbjct: 454 DWSSTLNIKNNIGYHLVFKHDSVHYPVNYLRNVALENVNTPYVFLMDADFVPMAGLYSYL 513

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTA---------------------------FP 614
           + SI+ ++   +  KK LVVPAFETQRYR +                           +P
Sbjct: 514 RESIKLIN--PYPQKKCLVVPAFETQRYRASPPRYKEELLSRLSIKHLGDVAPFRAREWP 571

Query: 615 ASHAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
             H  TN+ RW  AT PY+++W  D+EPY+VAHR +P+YDTRF GFGWNK
Sbjct: 572 RGHRATNYIRWSTATAPYEVDWQSDYEPYLVAHRSIPKYDTRFSGFGWNK 621



 Score =  164 bits (416), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 73/161 (45%), Positives = 107/161 (66%), Gaps = 5/161 (3%)

Query: 12  CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQV 71
           CE + +A+VC G   TR++  ++KS+L +R+NP+HFH + D  ++  L  LF TW +P V
Sbjct: 82  CETVHIALVCMG-KCTRNITPMLKSLLHHRQNPIHFHFVVDPESMRTLNKLFETWDLPDV 140

Query: 72  EVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQL 131
           + S Y A   + +V WIPN HYSGVY L+KL  P +LP+ L + IVLD+D+ F +D++QL
Sbjct: 141 KYSCYDAQDRLSEVKWIPNNHYSGVYALVKLLFPSILPDKLEQVIVLDSDLTFLSDVSQL 200

Query: 132 WALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           W +F  +   Q IGLVEN+S+WY  +  +    WPAL RG+
Sbjct: 201 WHMFRNMSSLQFIGLVENESNWYTNQKTR----WPALRRGY 237


>gi|198435392|ref|XP_002128775.1| PREDICTED: similar to glycosyltransferase-like 1B [Ciona
           intestinalis]
          Length = 682

 Score =  416 bits (1068), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 226/530 (42%), Positives = 306/530 (57%), Gaps = 73/530 (13%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           P     FY A   +++VSWIPNKHYSGVYGL+KLT+  VLP  + K IVLDTD+IFA D+
Sbjct: 123 PHFVVSFYQASDFIDEVSWIPNKHYSGVYGLMKLTILTVLPANVGKIIVLDTDLIFAHDV 182

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
            ++W  F +  + + IGLVENQSDWYLG LW+N+ PWPA+GRGFNTG+ILLD  KLR I 
Sbjct: 183 TEIWQYFDKFGENEAIGLVENQSDWYLGTLWENYTPWPAIGRGFNTGLILLDCMKLRAIE 242

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   W+ +A+  L   L T+LADQDI N+++S +  L++ LPC WN+QLSD+ R D LCY
Sbjct: 243 WQSKWKEVAQTDLKHFLSTALADQDIINSVLSHNASLLHVLPCYWNLQLSDHMRMD-LCY 301

Query: 346 TELT-DLKIIHWNSPKKLKVKNKHMEFFRNL---YLTFLEYDGNLLRRELFGCNLTQTTL 401
            +   ++K+IHWN+ +K    N+H E  ++L   + TF E +G  LR     C+     +
Sbjct: 302 KQHKHEVKVIHWNTRRKQ--NNRHDENIKSLTDVFRTFTEINGAFLRHPRITCSTDDPEI 359

Query: 402 QQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVE 461
            Q+   +  + D C DL  S  +  RTH+++L ++Y ++S   DV+++  L++       
Sbjct: 360 YQS--LDDTDFDMCSDLISSAASLHRTHIFYLPFDYHSNS-ATDVSIIVHLTI------- 409

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
                                     DRL + E + KHWEGPIS   + +DAEAQ  + +
Sbjct: 410 --------------------------DRLTVFEAMCKHWEGPISAAFFTTDAEAQTLVDF 443

Query: 522 AGNSEALRSRRNIGYHVVYKEGN--FYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYP 579
              S  L  R NIGYH+VYK  N   YPIN LRNVAL Q  T +VFL D+DF+PM GL+ 
Sbjct: 444 VKLSPVLMDRTNIGYHIVYKNSNHAVYPINLLRNVALKQAVTDHVFLYDVDFVPMPGLFT 503

Query: 580 YLKSSIRSMDMHGHGGKKVLVVPAFETQRYR-------------------TAF-----PA 615
           YL S +     H    K   +VPAFE+  Y+                   T+F     P 
Sbjct: 504 YLTSYLAE---HAKSPKTAYIVPAFESFWYKFKVPQTKSVLHKQIDRGVFTSFRSYMWPQ 560

Query: 616 SHAPTNFSRWVNATTPYQIEWAPDFEPYIVAHR-DLPRYDTRFVGFGWNK 664
            HA TNFSRW  A   Y + W  +FEPY+V  + +LP YD  F+GFGWNK
Sbjct: 561 GHAATNFSRWKTAKEAYPVRWETNFEPYVVVKKQNLPLYDQAFIGFGWNK 610



 Score =  215 bits (548), Expect = 5e-53,   Method: Compositional matrix adjust.
 Identities = 93/161 (57%), Positives = 120/161 (74%)

Query: 12  CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQV 71
           C  + + IVCAGY ++R +VTLIKS+LFYR + LH H I + V+  IL+ LF TW +P  
Sbjct: 66  CTTVHICIVCAGYTASRQVVTLIKSLLFYRTSKLHVHFIVNGVSRTILRNLFQTWDIPHF 125

Query: 72  EVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQL 131
            VSFY A   +++VSWIPNKHYSGVYGL+KLT+  VLP  + K IVLDTD+IFA D+ ++
Sbjct: 126 VVSFYQASDFIDEVSWIPNKHYSGVYGLMKLTILTVLPANVGKIIVLDTDLIFAHDVTEI 185

Query: 132 WALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           W  F +  + + IGLVENQSDWYLG LW+N+ PWPA+GRGF
Sbjct: 186 WQYFDKFGENEAIGLVENQSDWYLGTLWENYTPWPAIGRGF 226


>gi|339257814|ref|XP_003369093.1| glycosyl transferase family 8 [Trichinella spiralis]
 gi|316966736|gb|EFV51279.1| glycosyl transferase family 8 [Trichinella spiralis]
          Length = 763

 Score =  407 bits (1045), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 220/528 (41%), Positives = 300/528 (56%), Gaps = 69/528 (13%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           P++    Y  D  +ED++WIPN HYSG++GL KL LPK+L  ++ K +VLD D++F +DI
Sbjct: 99  PSVSLSIYRLDEALEDINWIPNGHYSGLFGLSKLILPKILSTSVTKVVVLDVDILFVSDI 158

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
            +LW   S+    Q +G+ ENQSDWYLG L+ ++KPWPALGRG+N+GVIL++L KLR I+
Sbjct: 159 FELWNFLSKFNDSQALGMTENQSDWYLGNLFLDYKPWPALGRGYNSGVILMNLLKLRAIN 218

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   ++ + ++ L T   T+LADQDIFNA+I   P LVY LPC +N+QL+DN R  ELC 
Sbjct: 219 WTELYKAVTKELLNTYNRTNLADQDIFNAVIQRLPALVYRLPCVYNLQLNDNARR-ELCS 277

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTTLQQAE 405
            +  D+KIIHWN+ +K  ++N +   +R+LY  F++ DGN LR     C L         
Sbjct: 278 YDQADMKIIHWNNYRKPLLENPNESHYRSLYQQFVDLDGNFLRHINKNCKLETAKPSVEN 337

Query: 406 LSNLNEE--DPCYDLRRSKLTSLRTHLYF---LEYEYEASSDGNDVTLVAQLSMDRLQMV 460
           +SN  +   + C   + +     R HLYF   ++++ +     NDVTL+ QLS DRL   
Sbjct: 338 ISNNYDSFTNLCLPFKTASNLMHRMHLYFGNVVQFDLKL----NDVTLLTQLSFDRL--- 390

Query: 461 EINPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLS 520
                                   +S++RL +       W GPI   LY++DAEA Q  +
Sbjct: 391 ------------------------VSLNRLLI------QWTGPIIAVLYLTDAEAFQLAT 420

Query: 521 YAGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPY 580
           +  N + L  R N+  HVVYKEGN YP+N LRN+AL    TP+VF+ DIDF+P  GLY +
Sbjct: 421 FMEN-DFLNIRNNVIIHVVYKEGNLYPVNKLRNIALQSSITPFVFMNDIDFVPATGLYEH 479

Query: 581 LKSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------ 616
           L   I          KK  +VPAFE   Y    P S                        
Sbjct: 480 L-LQILPQQRDAIASKKAYIVPAFEVYSYHVIIPESKTDLLKSLDSGEIHIFRSKEWVKG 538

Query: 617 HAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           H  T++ RW NA  PY+I W+ DFEPY V  RD+P YD RFVGFGWNK
Sbjct: 539 HLATDYDRWRNALEPYKIAWSTDFEPYFVVRRDVPLYDERFVGFGWNK 586



 Score =  204 bits (520), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 103/225 (45%), Positives = 151/225 (67%), Gaps = 7/225 (3%)

Query: 12  CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDT-VALNILQTLFSTWSVPQ 70
           CEVI +A+V  G+ S   L+TL+KS+LFYR  PLHFH++TD+ ++  IL+TLFSTW++P 
Sbjct: 41  CEVIHIAVVSGGFQSRLQLLTLLKSVLFYRSTPLHFHIVTDSDLSEKILRTLFSTWNLPS 100

Query: 71  VEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQ 130
           V +S Y  D  +ED++WIPN HYSG++GL KL LPK+L  ++ K +VLD D++F +DI +
Sbjct: 101 VSLSIYRLDEALEDINWIPNGHYSGLFGLSKLILPKILSTSVTKVVVLDVDILFVSDIFE 160

Query: 131 LWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFYLADSVVEDVSWIPNKHY 190
           LW   S+    Q +G+ ENQSDWYLG L+ ++KPWPALGRG Y +  ++ ++  +   ++
Sbjct: 161 LWNFLSKFNDSQALGMTENQSDWYLGNLFLDYKPWPALGRG-YNSGVILMNLLKLRAINW 219

Query: 191 SGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRL 235
           + +Y      + K L  T  +T + D D IF   I +L AL  RL
Sbjct: 220 TELYK----AVTKELLNTYNRTNLADQD-IFNAVIQRLPALVYRL 259


>gi|220732255|emb|CAX15215.1| like-glycosyltransferase [Homo sapiens]
          Length = 325

 Score =  393 bits (1010), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 195/325 (60%), Positives = 236/325 (72%), Gaps = 41/325 (12%)

Query: 298 LLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
           L+  L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY +++DLK+IHWN
Sbjct: 1   LMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CYRDVSDLKVIHWN 59

Query: 358 SPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTLQQAELSNLNEED 413
           SPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC    ++    LQ+ +LS L+E+D
Sbjct: 60  SPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENLQK-QLSELDEDD 118

Query: 414 PCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINPNNVDEYEAS 473
            CY+ RR + T  RTHLYFL YEYE ++D  DVTLVAQLSMDRLQM+E            
Sbjct: 119 LCYEFRRERFTVHRTHLYFLHYEYEPAADSTDVTLVAQLSMDRLQMLEA----------- 167

Query: 474 SDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRN 533
                                 + KHWEGPISL LY+SDAEAQQFL YA  SE L SR N
Sbjct: 168 ----------------------ICKHWEGPISLALYLSDAEAQQFLRYAQGSEVLMSRHN 205

Query: 534 IGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGH 593
           +GYH+VYKEG FYP+N LRNVA+  +STPY+FL DIDFLPM+GLY YL+ S+  +D+   
Sbjct: 206 VGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYLRKSVIQLDL--A 263

Query: 594 GGKKVLVVPAFETQRYRTAFPASHA 618
             KK ++VPAFET RYR +FP S A
Sbjct: 264 NTKKAMIVPAFETLRYRLSFPKSKA 288


>gi|320162741|gb|EFW39640.1| glycosyltransferase-like protein LARGE1 [Capsaspora owczarzaki ATCC
           30864]
          Length = 815

 Score =  378 bits (970), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 218/561 (38%), Positives = 302/561 (53%), Gaps = 110/561 (19%)

Query: 182 VSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTI 241
           V+ IP  HYSG +GL+KL L  +LPE++   + LDTD++F  D+A LW  FS   + Q  
Sbjct: 229 VASIPTVHYSGKFGLIKLVLANILPESVTHVLSLDTDLLFMRDVAHLWGHFSEFTEDQMF 288

Query: 242 GLVENQSDWYLGKLWKNHK-----------PWPALGRGFNTGVILLDLTKLRDISWAGFW 290
           GLVENQS+WY+ +  ++ K           PWPAL  GFNTGV+LL L +LR  +W   W
Sbjct: 289 GLVENQSEWYIEQEARSSKTTSASKLLPSLPWPALSHGFNTGVMLLRLDRLRLRAWDETW 348

Query: 291 RIIAEKFLLTR----LWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY- 345
             +A K L  R      T LADQD+ NA+      +VY LPC WN+QLSDN+ +      
Sbjct: 349 LSVAMKELALRPAGKQTTQLADQDVINAVALTQSSIVYRLPCTWNLQLSDNSLAAPCVIA 408

Query: 346 --------------------------TELTDLK-------IIHWNSPKKLKVKNKHMEFF 372
                                     T ++ LK       I+H+NSP+K+KV+N   E++
Sbjct: 409 ALNGHDSVKNWQVSTFAPEGNPSDPKTVISKLKNAPEAVGIVHFNSPRKMKVQNAFTEYY 468

Query: 373 RNLYLTFLEYDGNLLRRELFGCN---LTQTTLQQAELSN--LNEEDPCYDLRRSKLTSLR 427
           R+L+ TF + DG+ LRREL  C    L   T   A  S   +  +DPC + R +  T  R
Sbjct: 469 RHLHATFSQLDGSTLRRELISCRPELLDPATSSHAMHSEAEVESDDPCIEYRLAASTKPR 528

Query: 428 THLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINPNNVDEYEASSDGNDVTLVAQLSM 487
           TH+Y+ +Y                          I P       A   G +VT ++Q+S 
Sbjct: 529 THVYYFDY--------------------------IPPT------AKETGQEVTWISQMSW 556

Query: 488 DRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGYHVVYKEGNFYP 547
           DRL M+E + + W GP+S++LY++D +    + +   S+ LR+R+NI  H+V+KEG  +P
Sbjct: 557 DRLGMLEQIAERWSGPLSISLYLNDYDVDDLVDHFETSDILRTRKNIALHLVFKEGTLFP 616

Query: 548 INTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQ 607
           +N LRNVAL+  +TPYVFL DIDFLP   +  YL S   S+D+      K LVVPAFET 
Sbjct: 617 VNYLRNVALDHATTPYVFLADIDFLPSAQMATYLASMAASLDLDNKNLPKALVVPAFETF 676

Query: 608 RYRTAFPAS------------------------HAPTNFSRWVNATTPYQIEWAPDFEPY 643
            Y+ AFPA+                        H+PT+F+++  AT PY I+W  DFEPY
Sbjct: 677 HYKFAFPANKNKLNSMLDAESIVSFRANEWPRGHSPTDFNKFRLATAPYVIQWQEDFEPY 736

Query: 644 IVAHRDLPRYDTRFVGFGWNK 664
           I+  RD+PRYD RF+GFGWNK
Sbjct: 737 ILVRRDVPRYDPRFLGFGWNK 757



 Score =  166 bits (419), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 78/177 (44%), Positives = 106/177 (59%), Gaps = 11/177 (6%)

Query: 7   PVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTW 66
           P   TCE I + +VCAG+ S R +V L+KSILFYR NPLH H + D  +   + TL  TW
Sbjct: 151 PRAATCETIHIVVVCAGFRSVRMMVPLLKSILFYRTNPLHLHFVVDEASREPIGTLIETW 210

Query: 67  SVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 126
           ++ +VE+S Y        V+ IP  HYSG +GL+KL L  +LPE++   + LDTD++F  
Sbjct: 211 NLSRVELSLYDLAPFESKVASIPTVHYSGKFGLIKLVLANILPESVTHVLSLDTDLLFMR 270

Query: 127 DIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHK-----------PWPALGRGF 172
           D+A LW  FS   + Q  GLVENQS+WY+ +  ++ K           PWPAL  GF
Sbjct: 271 DVAHLWGHFSEFTEDQMFGLVENQSEWYIEQEARSSKTTSASKLLPSLPWPALSHGF 327


>gi|220732254|emb|CAX15214.1| like-glycosyltransferase [Homo sapiens]
          Length = 276

 Score =  359 bits (921), Expect = 3e-96,   Method: Compositional matrix adjust.
 Identities = 178/288 (61%), Positives = 212/288 (73%), Gaps = 39/288 (13%)

Query: 298 LLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
           L+  L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY +++DLK+IHWN
Sbjct: 1   LMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CYRDVSDLKVIHWN 59

Query: 358 SPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTLQQAELSNLNEED 413
           SPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC    ++    LQ+ +LS L+E+D
Sbjct: 60  SPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENLQK-QLSELDEDD 118

Query: 414 PCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINPNNVDEYEAS 473
            CY+ RR + T  RTHLYFL YEYE ++D  DVTLVAQLSMDR                 
Sbjct: 119 LCYEFRRERFTVHRTHLYFLHYEYEPAADSTDVTLVAQLSMDR----------------- 161

Query: 474 SDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRN 533
                           LQM+E + KHWEGPISL LY+SDAEAQQFL YA  SE L SR N
Sbjct: 162 ----------------LQMLEAICKHWEGPISLALYLSDAEAQQFLRYAQGSEVLMSRHN 205

Query: 534 IGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
           +GYH+VYKEG FYP+N LRNVA+  +STPY+FL DIDFLPM+GLY YL
Sbjct: 206 VGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYL 253


>gi|312069691|ref|XP_003137800.1| glycosyl transferase family 8 protein [Loa loa]
 gi|307767035|gb|EFO26269.1| glycosyl transferase family 8 protein [Loa loa]
          Length = 698

 Score =  357 bits (917), Expect = 9e-96,   Method: Compositional matrix adjust.
 Identities = 203/531 (38%), Positives = 285/531 (53%), Gaps = 74/531 (13%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY A   ++  SWIPN+HYSG YGLLKL L  +LP  + K I LDTDV+   DI
Sbjct: 127 PAVKVTFYNASQYLDRFSWIPNRHYSGRYGLLKLILNDILPADVDKVIALDTDVLIMGDI 186

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYL-GKLWKNHKPWPALGRGFNTGVILLDLTKLRDI 284
           AQLW+ FS++   Q IGLVEN SDWYL  +       WPA GRGFN+GV+LLDL KLR++
Sbjct: 187 AQLWSFFSKMANLQAIGLVENLSDWYLFNRSTPQRTVWPAWGRGFNSGVMLLDLAKLRNM 246

Query: 285 SWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELC 344
           SW+  W   A K +       LADQD+ NA+I++H ++V  LPC+WN QL   ++ + LC
Sbjct: 247 SWSHIWEESASKNVKEYGPVELADQDVINAVINDHRWIVQKLPCEWNFQLGFQSQQN-LC 305

Query: 345 YTELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTTLQQA 404
             E++ LK++HWNSP K ++ N++    R  Y +  + DG++ R     C    +  + +
Sbjct: 306 AVEISHLKLVHWNSPLKTRIVNRYAVLLRRYYDSVRDIDGSMFRSNTIHCRYG-SQREHS 364

Query: 405 ELSNLNE----EDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMV 460
            L  ++E    +D C ++R ++  + RT LY   Y + A+                    
Sbjct: 365 TLDKVSEYDDNDDGCSEIRHARWITYRTLLYVRPYNFTAA-------------------- 404

Query: 461 EINPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLS 520
              P NV             L+ Q S+DRL     L ++W GP+S  +Y++D+E    + 
Sbjct: 405 ---PKNV------------VLITQFSVDRLMHFNALLQYWTGPVSAAVYVTDSELSLLIQ 449

Query: 521 YAGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVS-TPYVFLLDIDFLPMFGLYP 579
           +  ++  L +R N+  H VYKEG +YPIN LRNVALN  +   +VFL D+DF P  GLY 
Sbjct: 450 FFDDT--LVNRTNVALHAVYKEGTYYPINYLRNVALNNSNDASFVFLADVDFTPAPGLYT 507

Query: 580 YLKSSIRSMDMHGHGGKKVLVVPAFE---------------------TQRY----RTAFP 614
            L   +   D      K+  VVPAFE                      +R     R+ + 
Sbjct: 508 MLYKKLVKTD---STNKRAFVVPAFEYTGNRVPAVPLTKNELLRELDARRMQIFRRSVWI 564

Query: 615 ASHAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRD-LPRYDTRFVGFGWNK 664
             HA T++ RW +A   Y + W  D+EPY+V  R  LP YD RFVGFGWNK
Sbjct: 565 QGHAATDYDRWRHADQEYSVSWKADYEPYVVVRRSGLPPYDQRFVGFGWNK 615



 Score =  167 bits (422), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 85/172 (49%), Positives = 110/172 (63%), Gaps = 2/172 (1%)

Query: 3   SDEEPVIPTCEVIQVA-IVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQT 61
           SD E     C+V+ +A +VC     +  L TLIKSIL YR+  +  HL+ D + ++I+  
Sbjct: 60  SDNEHRSQHCQVLHLALVVCGAVQLSYPLSTLIKSILRYRQQAIVLHLLVDDITMSIISL 119

Query: 62  LFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTD 121
           LFSTW +P V+V+FY A   ++  SWIPN+HYSG YGLLKL L  +LP  + K I LDTD
Sbjct: 120 LFSTWRLPAVKVTFYNASQYLDRFSWIPNRHYSGRYGLLKLILNDILPADVDKVIALDTD 179

Query: 122 VIFATDIAQLWALFSRLRQRQTIGLVENQSDWYL-GKLWKNHKPWPALGRGF 172
           V+   DIAQLW+ FS++   Q IGLVEN SDWYL  +       WPA GRGF
Sbjct: 180 VLIMGDIAQLWSFFSKMANLQAIGLVENLSDWYLFNRSTPQRTVWPAWGRGF 231


>gi|426394261|ref|XP_004063418.1| PREDICTED: glycosyltransferase-like protein LARGE1 [Gorilla gorilla
           gorilla]
          Length = 780

 Score =  352 bits (904), Expect = 3e-94,   Method: Compositional matrix adjust.
 Identities = 160/229 (69%), Positives = 192/229 (83%), Gaps = 1/229 (0%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ FATDI
Sbjct: 292 PAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 351

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR + 
Sbjct: 352 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 411

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ AE+ L+  L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 412 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 470

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC 394
            +++DLK+IHWNSPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC
Sbjct: 471 RDVSDLKVIHWNSPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGC 519



 Score =  257 bits (656), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 111/168 (66%), Positives = 137/168 (81%)

Query: 5   EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
           ++PV+  CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A  IL TLF 
Sbjct: 228 QQPVVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 287

Query: 65  TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
           TW VP V V FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ F
Sbjct: 288 TWMVPAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 347

Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 348 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 395



 Score =  107 bits (267), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 51/109 (46%), Positives = 63/109 (57%), Gaps = 26/109 (23%)

Query: 580 YLKSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS----------------------- 616
           Y + S+  +D+     KK ++VPAFET RYR +FP S                       
Sbjct: 599 YFRKSVIQLDLAN--TKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTK 656

Query: 617 -HAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
            HAPTNF++W  ATTPY++EW  DFEPY+V  RD P YD RFVGFGWNK
Sbjct: 657 GHAPTNFAKWRTATTPYRVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 705


>gi|170589603|ref|XP_001899563.1| Glycosyl transferase family 8 protein [Brugia malayi]
 gi|158593776|gb|EDP32371.1| Glycosyl transferase family 8 protein [Brugia malayi]
          Length = 603

 Score =  344 bits (882), Expect = 1e-91,   Method: Compositional matrix adjust.
 Identities = 201/529 (37%), Positives = 278/529 (52%), Gaps = 81/529 (15%)

Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
           +Y A   ++  SWIPN+HYSG YG LKL L  +LP  + K I+LDTD +   DIAQLW+ 
Sbjct: 47  YYNASQYLDRFSWIPNRHYSGRYGFLKLILNDILPADVDKVIILDTDALIMDDIAQLWSF 106

Query: 232 FSRLRQRQTIGLVENQSDWYL-GKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFW 290
           FS++   Q IGL EN S+WYL  +       WPA GRGFN+GV+LLDLTKLR+I+W+  W
Sbjct: 107 FSKMTSLQAIGLAENLSNWYLINRNTSQRIVWPAWGRGFNSGVMLLDLTKLRNINWSHIW 166

Query: 291 RIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTD 350
           + IA K +       LADQD+ NA+I+ H ++V+ LPC+WN QL   ++ + LC  E++ 
Sbjct: 167 KEIASKNIKDYGPVELADQDVINAVINHHQWIVHKLPCEWNFQLGFQSQQN-LCPVEISR 225

Query: 351 LKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTTLQQAELSNLN 410
           LK++HWNSP K +  N++    R  Y +  + DGN+ R  +  C+       Q E S L+
Sbjct: 226 LKLVHWNSPLKTRTVNRYAVLLRRYYDSMRDMDGNMFRSNIIHCHYD----SQREYSALD 281

Query: 411 E-------EDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEIN 463
           +       +D C ++R ++  + RT LY   Y + A     +V L+ Q SM         
Sbjct: 282 KVSEYDDNDDGCSEIRHARWITYRTLLYVRPYNFIAVP--KNVVLITQFSM--------- 330

Query: 464 PNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAG 523
                                   DRL     L ++W GP+S  +Y++D+E    + +  
Sbjct: 331 ------------------------DRLMHFNALLQYWTGPVSAAVYVTDSELSLLIQFF- 365

Query: 524 NSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVS-TPYVFLLDIDFLPM-FGLYPYL 581
             + L +R N+  H VYKE  +YPIN LRNVALN  +   +VFL D+DF+P   GLY  L
Sbjct: 366 -DDTLANRTNVALHAVYKESTYYPINYLRNVALNSSNDASFVFLADVDFIPRPGGLYATL 424

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFE---------------------TQRY----RTAFPAS 616
               + + M     K+  VVPAFE                      QR     R  +   
Sbjct: 425 ---CKKLVMTNSTNKRAFVVPAFEYMGNRVPVIPLTKNELLIELDAQRMQIFRRNKWIQG 481

Query: 617 HAPTNFSRWVNATTPYQIEWAPDFEPYIVA-HRDLPRYDTRFVGFGWNK 664
           H  TN+ RW  A   Y + W  D+EPYIV   R LP YD RFVGFGWNK
Sbjct: 482 HVATNYDRWRYADQEYSVSWRTDYEPYIVVRRRGLPPYDQRFVGFGWNK 530



 Score =  105 bits (263), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 51/103 (49%), Positives = 66/103 (64%), Gaps = 1/103 (0%)

Query: 71  VEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQ 130
           ++V +Y A   ++  SWIPN+HYSG YG LKL L  +LP  + K I+LDTD +   DIAQ
Sbjct: 43  LKVMYYNASQYLDRFSWIPNRHYSGRYGFLKLILNDILPADVDKVIILDTDALIMDDIAQ 102

Query: 131 LWALFSRLRQRQTIGLVENQSDWYL-GKLWKNHKPWPALGRGF 172
           LW+ FS++   Q IGL EN S+WYL  +       WPA GRGF
Sbjct: 103 LWSFFSKMTSLQAIGLAENLSNWYLINRNTSQRIVWPAWGRGF 145


>gi|355694034|gb|AER99533.1| glycosyltransferase-like 1B [Mustela putorius furo]
          Length = 266

 Score =  333 bits (854), Expect = 2e-88,   Method: Compositional matrix adjust.
 Identities = 158/229 (68%), Positives = 183/229 (79%), Gaps = 1/229 (0%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY A+ +   VSWIPNKHYSG+YGL+KL LP  LP  LA+ IVLDTDV FA+DI
Sbjct: 28  PAVRVSFYDAEELKPQVSWIPNKHYSGLYGLMKLVLPAALPPDLARVIVLDTDVTFASDI 87

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWALF+     Q IGLVENQSDWYLG LW+NH+PWPALGRGFNTGVILL L +LR   
Sbjct: 88  AELWALFAHFSDEQVIGLVENQSDWYLGNLWRNHRPWPALGRGFNTGVILLRLDRLRQAG 147

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   W++ A + LLT   TSLADQDIFNA+I EHP LV TLPC WNVQLSD+T + E CY
Sbjct: 148 WGQMWKLTATRELLTLPATSLADQDIFNAVIKEHPELVQTLPCIWNVQLSDHTLA-ERCY 206

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC 394
           +E +DLK+IHWNSPKKL+VKNKH+ +FRNL+LTFL YDGNLLRRELFGC
Sbjct: 207 SEASDLKVIHWNSPKKLRVKNKHVGYFRNLHLTFLGYDGNLLRRELFGC 255



 Score =  202 bits (514), Expect = 5e-49,   Method: Compositional matrix adjust.
 Identities = 93/131 (70%), Positives = 105/131 (80%)

Query: 42  KNPLHFHLITDTVALNILQTLFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLK 101
           KNPLHFHL+TD VA NIL+ LF TW VP V VSFY A+ +   VSWIPNKHYSG+YGL+K
Sbjct: 1   KNPLHFHLVTDAVARNILEMLFYTWMVPAVRVSFYDAEELKPQVSWIPNKHYSGLYGLMK 60

Query: 102 LTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKN 161
           L LP  LP  LA+ IVLDTDV FA+DIA+LWALF+     Q IGLVENQSDWYLG LW+N
Sbjct: 61  LVLPAALPPDLARVIVLDTDVTFASDIAELWALFAHFSDEQVIGLVENQSDWYLGNLWRN 120

Query: 162 HKPWPALGRGF 172
           H+PWPALGRGF
Sbjct: 121 HRPWPALGRGF 131


>gi|395742848|ref|XP_002821888.2| PREDICTED: glycosyltransferase-like 1B [Pongo abelii]
          Length = 624

 Score =  328 bits (841), Expect = 6e-87,   Method: Compositional matrix adjust.
 Identities = 174/345 (50%), Positives = 213/345 (61%), Gaps = 64/345 (18%)

Query: 348 LTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLT----QTTLQQ 403
           L   ++IHWNSPKKL+VKNKH+EFFRN YLTFL YDGNLLRRELF C          LQQ
Sbjct: 235 LPGSQVIHWNSPKKLRVKNKHVEFFRNFYLTFLGYDGNLLRRELFVCPSQPPPGAEQLQQ 294

Query: 404 AELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEIN 463
           A L+ L+EEDPC++ R+ +LT  R H+ FL +E                           
Sbjct: 295 A-LAQLDEEDPCFEFRQQQLTVHRVHVTFLPHEPPPPRP--------------------- 332

Query: 464 PNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAG 523
                        +DVTLVAQLSMDRLQM+E L +HW GP+SL LY++DAEAQQFL +  
Sbjct: 333 -------------HDVTLVAQLSMDRLQMLEALCRHWPGPMSLALYLTDAEAQQFLHFVE 379

Query: 524 NSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKS 583
            S  L +R+++ YHVVY+EG  YP+N LRNVAL Q  TPYVFL DIDFLP + LY YL++
Sbjct: 380 ASPVLAARQDVAYHVVYREGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYNYLRA 439

Query: 584 SIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------HAP 619
           SI  + + G   K  LVVPAFET RYR +FP S                        HAP
Sbjct: 440 SIEQLGL-GSQRKAALVVPAFETLRYRFSFPHSKVELLALLDAGALYTFRYHEWPRGHAP 498

Query: 620 TNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           T+++RW  A  PY+++WA D+EPY+V  RD PRYD RFVGFGWNK
Sbjct: 499 TDYARWREAQAPYRVQWAADYEPYVVVPRDCPRYDPRFVGFGWNK 543



 Score =  253 bits (647), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 114/161 (70%), Positives = 132/161 (81%)

Query: 12  CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQV 71
           CE++ VAIVCAG+NS+R ++TL+KS+LFYRKNPLH HL+TD VA NIL+TLF TW VP V
Sbjct: 65  CELLHVAIVCAGHNSSRDVITLVKSMLFYRKNPLHLHLVTDAVARNILETLFHTWMVPAV 124

Query: 72  EVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQL 131
            VSFY AD +   VSWIPNKHYSG+YGL+KL LP  LP  LA+ IVLDTDV FA+DI+ L
Sbjct: 125 RVSFYHADELKPQVSWIPNKHYSGLYGLMKLVLPSALPAELARVIVLDTDVTFASDISAL 184

Query: 132 WALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           WALF+     Q IGLVENQSDWYLG LWKNH+PWPALGRGF
Sbjct: 185 WALFAHFSDTQAIGLVENQSDWYLGNLWKNHRPWPALGRGF 225



 Score =  166 bits (420), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 75/107 (70%), Positives = 84/107 (78%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY AD +   VSWIPNKHYSG+YGL+KL LP  LP  LA+ IVLDTDV FA+DI
Sbjct: 122 PAVRVSFYHADELKPQVSWIPNKHYSGLYGLMKLVLPSALPAELARVIVLDTDVTFASDI 181

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTG 272
           + LWALF+     Q IGLVENQSDWYLG LWKNH+PWPALGRGFNTG
Sbjct: 182 SALWALFAHFSDTQAIGLVENQSDWYLGNLWKNHRPWPALGRGFNTG 228


>gi|324511340|gb|ADY44727.1| Glycosyltransferase-like protein [Ascaris suum]
          Length = 477

 Score =  313 bits (803), Expect = 2e-82,   Method: Compositional matrix adjust.
 Identities = 165/378 (43%), Positives = 222/378 (58%), Gaps = 35/378 (9%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           P++   FY +   +  VSWIPN+HYS  YGLLKL L  + PE L K +VLDTD++F  DI
Sbjct: 131 PSVNVIFYNSSLFINRVSWIPNRHYSREYGLLKLALTDIFPENLKKVVVLDTDLLFVEDI 190

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWA +S++   Q + +VEN SDWY+     N KPWPA  RGFNTGV+L+DL KLR ++
Sbjct: 191 ARLWAFWSQMNSAQALAVVENLSDWYIVNASSNRKPWPAWNRGFNTGVMLMDLEKLRTMN 250

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   W+ +A   L     T LADQDI NA+IS  P +VY LPC+WN+Q+   +R + LC 
Sbjct: 251 WTLLWKEVAADNLKEFGATQLADQDIINAVISRKPEIVYKLPCEWNLQMGYQSRQN-LCP 309

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTTLQQAE 405
             ++DLKI+HWNSPKK + +N +  FFR  + TF + DGNLLR E+  C+          
Sbjct: 310 ARISDLKIVHWNSPKKTRTRNPYAVFFRRHFQTFTQMDGNLLRTEMLRCS---------- 359

Query: 406 LSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINPN 465
               NE      LR S+L S+        YEYE S DG      A+    R+Q+      
Sbjct: 360 ----NE------LRSSELRSV--------YEYEESDDGCGDLRHARNIKYRIQLF----- 396

Query: 466 NVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNS 525
            +  Y   SD  DVTL  QLSMDR  + E L   W GP+S  +Y+SD+E  Q + Y  ++
Sbjct: 397 -IRRYAHQSDDADVTLTTQLSMDRFVLFESLLAFWTGPVSAAIYLSDSELTQLMQYLADT 455

Query: 526 EALRSRRNIGYHVVYKEG 543
           +  R+R +I  H V+KEG
Sbjct: 456 KVFRNRTDIALHAVFKEG 473



 Score =  187 bits (474), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 86/161 (53%), Positives = 111/161 (68%)

Query: 12  CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQV 71
           CEV+QVAIVCAGY S+   VTL+KS+L++R++ +  H + D VA  IL TLF TW +P V
Sbjct: 74  CEVLQVAIVCAGYESSFRSVTLVKSLLYHRRHAITMHFVVDDVAAQILPTLFDTWQLPSV 133

Query: 72  EVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQL 131
            V FY +   +  VSWIPN+HYS  YGLLKL L  + PE L K +VLDTD++F  DIA+L
Sbjct: 134 NVIFYNSSLFINRVSWIPNRHYSREYGLLKLALTDIFPENLKKVVVLDTDLLFVEDIARL 193

Query: 132 WALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           WA +S++   Q + +VEN SDWY+     N KPWPA  RGF
Sbjct: 194 WAFWSQMNSAQALAVVENLSDWYIVNASSNRKPWPAWNRGF 234


>gi|298684037|gb|ADI96198.1| LARGE-1 [Mastomys erythroleucus]
          Length = 671

 Score =  309 bits (791), Expect = 4e-81,   Method: Compositional matrix adjust.
 Identities = 153/286 (53%), Positives = 182/286 (63%), Gaps = 59/286 (20%)

Query: 403 QAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEI 462
           Q +LS L+E+D CY+ RR + T  RTHLYFL YE+E S+                     
Sbjct: 367 QKQLSELDEDDLCYEFRRERFTVHRTHLYFLHYEFEPSA--------------------- 405

Query: 463 NPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYA 522
                       D  DVTLVAQLSMDRLQM+E + KHWEGPISL LY+SDAEAQQFL YA
Sbjct: 406 ------------DNTDVTLVAQLSMDRLQMLEAICKHWEGPISLALYLSDAEAQQFLRYA 453

Query: 523 GNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLK 582
             SE L SR+N+GYH+VYKEG FYP+N LRNVA+  +STPY+FL DIDFLPM+GLY YL+
Sbjct: 454 QGSEVLMSRQNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYLR 513

Query: 583 SSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------HA 618
            S+  +D+     KK ++VPAFET RYR +FP S                        HA
Sbjct: 514 KSVIQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVXTKGHA 571

Query: 619 PTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           PTNF++W  ATTPYQ+EW  DFEPY+V  RD P YD RFVGFGWNK
Sbjct: 572 PTNFAKWRTATTPYQVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 617



 Score =  254 bits (650), Expect = 8e-65,   Method: Compositional matrix adjust.
 Identities = 110/168 (65%), Positives = 136/168 (80%)

Query: 5   EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
           ++P +  CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A  IL TLF 
Sbjct: 64  QQPAVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 123

Query: 65  TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
           TW VP V V FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ F
Sbjct: 124 TWMVPAVGVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 183

Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 184 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 231



 Score =  231 bits (589), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 108/180 (60%), Positives = 132/180 (73%), Gaps = 1/180 (0%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+G  FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ FATDI
Sbjct: 128 PAVGVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 187

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR + 
Sbjct: 188 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 247

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ AE+ L+  L TS  DQ    A+I    +L   LPC   VQLSD+TRS++ CY
Sbjct: 248 WEQMWRLTAERELMGMLSTSXXDQXXXXAVIXXXXFLXXXLPCXXXVQLSDHTRSEQ-CY 306


>gi|358337461|dbj|GAA55819.1| glycosyltransferase-like protein LARGE [Clonorchis sinensis]
          Length = 767

 Score =  308 bits (789), Expect = 6e-81,   Method: Compositional matrix adjust.
 Identities = 191/562 (33%), Positives = 269/562 (47%), Gaps = 91/562 (16%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           P +    Y       +VSWIP  HYSG +GL KL LP +LP T+ K I LD D++   DI
Sbjct: 156 PGVQFSLYSVHDYENEVSWIPTTHYSGTFGLCKLLLPNILPPTVEKVIGLDVDLLLNADI 215

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
            +LW  F R    Q IGLV NQS WYL      +  WPALG G+NTGV+LL L +++ + 
Sbjct: 216 RELWDCFDRFTHSQMIGLVRNQSPWYLQG--AQNTVWPALGYGYNTGVMLLHLQRMKLMR 273

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTR------ 339
           W   W+ +A+  L       LADQD+ NA++ + P  V  L C+WNVQL+   +      
Sbjct: 274 WKEAWQSVAKSTLQYLPHAPLADQDVINALLVQMPDAVNELACEWNVQLNSRAQPMQCPV 333

Query: 340 --------SDELCYTELTDLKIIHWNSPKKLKVKNKHMEF----------------FRNL 375
                   S +   + +  LKI H+NSP K ++ N+                    F   
Sbjct: 334 HWLTVQKPSPDNINSHIGQLKIAHYNSPVKPELMNQAFSAEDESSPGDIGLRLRTNFVQQ 393

Query: 376 YLTFLEYDGNLLR-----RELFGCNLTQTTLQQAELSNLNEEDPCYDLRRSKLTSLRTHL 430
           Y  F  +DG +L+     R    C   +T   QA L+       C +L        R H 
Sbjct: 394 YRFFQNFDGVILKHQVRERRSEECEDHETDEAQAPLAG-----ECSELIAQVQIHHRIHP 448

Query: 431 YFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINPNNVDEYEASSDGNDVTLVAQLSMDRL 490
           +++  E +  +                      P    E        DVTLV+QL+ DR+
Sbjct: 449 FYMHCEIQPKA---------------------VPTKCAEGCVVDQDADVTLVSQLTFDRI 487

Query: 491 QMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGYHVVYKEGNFYPINT 550
             VE + K WEGP+SL LY++D +A   + +   S+ L +R NIGYH+VY +G  YPIN 
Sbjct: 488 HRVEEIAKRWEGPMSLALYVTDRDAAVLVDFVSRSQLLVNRTNIGYHLVYVDGVLYPINK 547

Query: 551 LRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSI----RSMDMHGHGGKKVLVVPAFET 606
           LRNVA+    T ++F+LD+DF+P   +Y  LK+ I    R  +      +  LVVPAFET
Sbjct: 548 LRNVAMQFAQTKFLFILDVDFIPSPNMYRELKTIIERQLRGPNKETRTERWCLVVPAFET 607

Query: 607 QRYRTAFP------------------------ASHAPTNFSRWVNATTPYQIEWAPDFEP 642
                  P                        A H  T++ RW N +  Y++ W+ D+EP
Sbjct: 608 FGEHVQLPETKDALLAQWAQNSVVPFRHEIWTAGHMATDYERWKNTSDLYEVFWSADYEP 667

Query: 643 YIVAHRDLPRYDTRFVGFGWNK 664
           YIV  R++ ++D  FVGFGWNK
Sbjct: 668 YIVIPRNVLQFDEHFVGFGWNK 689



 Score =  130 bits (326), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 72/184 (39%), Positives = 92/184 (50%), Gaps = 28/184 (15%)

Query: 15  IQVAIVCAGYNSTRSLVTLIKSILFY--------------------------RKNPLHFH 48
           IQ+A+V  G  + R   TL+KSI+++                          R   LH H
Sbjct: 76  IQLALVVGGAQAARQTSTLLKSIVYFAQWTYNCSGRNRSHDRNKQKVSGDNSRFPQLHLH 135

Query: 49  LITDTVALNILQTLFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVL 108
           +I D  A   L +L  TW +P V+ S Y       +VSWIP  HYSG +GL KL LP +L
Sbjct: 136 MIVDRHAFISLNSLLQTWELPGVQFSLYSVHDYENEVSWIPTTHYSGTFGLCKLLLPNIL 195

Query: 109 PETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPAL 168
           P T+ K I LD D++   DI +LW  F R    Q IGLV NQS WYL      +  WPAL
Sbjct: 196 PPTVEKVIGLDVDLLLNADIRELWDCFDRFTHSQMIGLVRNQSPWYLQG--AQNTVWPAL 253

Query: 169 GRGF 172
           G G+
Sbjct: 254 GYGY 257


>gi|345317667|ref|XP_001521231.2| PREDICTED: glycosyltransferase-like protein LARGE2-like, partial
           [Ornithorhynchus anatinus]
          Length = 285

 Score =  308 bits (789), Expect = 6e-81,   Method: Compositional matrix adjust.
 Identities = 157/284 (55%), Positives = 192/284 (67%), Gaps = 42/284 (14%)

Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWNSPKKLKV 364
           +L   DIFNA+I +HP+LV+ LPC WNVQLSD+TRS++ CY+E++DLK+IHWNSPKKL+V
Sbjct: 37  TLTPGDIFNAVIKQHPWLVHPLPCSWNVQLSDHTRSEQ-CYSEVSDLKVIHWNSPKKLRV 95

Query: 365 KNKHMEFFRNLYLTFLEYDGNLLRRELFGCN-------LTQTTLQQAELSNLNEEDPCYD 417
           KNKH+EFFRNLYLTFLEYDGNLLR ELFGC+       + +    Q  L  L+E+D CY+
Sbjct: 96  KNKHVEFFRNLYLTFLEYDGNLLRGELFGCSGSPPPAAVRRPAQSQRALDELDEDDVCYE 155

Query: 418 LRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINPNNVDEYEASSDGN 477
            RR +LT  R HL FL +E  A    +DVTLVAQLS DRL                    
Sbjct: 156 FRRQRLTVHRVHLTFLPHEPPAPRP-HDVTLVAQLSTDRL-------------------- 194

Query: 478 DVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGYH 537
                        QM+E L +HW GP+SL LY+SDAEAQ+FL +A  SE L +RR++ YH
Sbjct: 195 -------------QMLEALCRHWPGPMSLALYLSDAEAQRFLRFAQASEVLAARRDVAYH 241

Query: 538 VVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
           VVY+EG  YP+N LRNVAL Q  TPYVFL DIDFLP + LY YL
Sbjct: 242 VVYREGPLYPVNLLRNVALAQARTPYVFLSDIDFLPAYSLYDYL 285


>gi|298684029|gb|ADI96194.1| LARGE-1 [Mastomys natalensis]
          Length = 671

 Score =  307 bits (786), Expect = 1e-80,   Method: Compositional matrix adjust.
 Identities = 152/284 (53%), Positives = 181/284 (63%), Gaps = 59/284 (20%)

Query: 405 ELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINP 464
           +LS L+E+D CY+ RR + T  RTHLYFL YE+E S+                       
Sbjct: 369 QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEFEPSA----------------------- 405

Query: 465 NNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGN 524
                     D  DVTLVAQLSMDRLQM+E + KHWEGPISL LY+SDAEAQQFL YA  
Sbjct: 406 ----------DNTDVTLVAQLSMDRLQMLEAICKHWEGPISLALYLSDAEAQQFLRYAQG 455

Query: 525 SEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSS 584
           SE L SR+N+GYH+VYKEG FYP+N LRNVA+  +STPY+FL DIDFLPM+GLY YL+ S
Sbjct: 456 SEVLMSRQNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYLRKS 515

Query: 585 IRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------HAPT 620
           +  +D+     KK ++VPAFET RYR +FP S                        HAPT
Sbjct: 516 VIQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGHAPT 573

Query: 621 NFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           NF++W  ATTPYQ+EW  DFEPY+V  RD P YD RFVGFGWNK
Sbjct: 574 NFAKWRTATTPYQVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 617



 Score =  254 bits (649), Expect = 9e-65,   Method: Compositional matrix adjust.
 Identities = 110/168 (65%), Positives = 136/168 (80%)

Query: 5   EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
           ++P +  CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A  IL TLF 
Sbjct: 64  QQPAVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 123

Query: 65  TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
           TW VP V V FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ F
Sbjct: 124 TWMVPAVGVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 183

Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 184 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 231



 Score =  252 bits (644), Expect = 4e-64,   Method: Compositional matrix adjust.
 Identities = 115/175 (65%), Positives = 139/175 (79%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+G  FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ FATDI
Sbjct: 128 PAVGVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 187

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NT VILL L KLR + 
Sbjct: 188 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTXVILLLLDKLRKMK 247

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRS 340
           W   WR+ AE+ L+  L TSLADQDIFNA+I ++P+LVY LPC WNVQLS +TRS
Sbjct: 248 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSXHTRS 302


>gi|298684027|gb|ADI96193.1| LARGE-1 [Mastomys natalensis]
          Length = 671

 Score =  307 bits (786), Expect = 1e-80,   Method: Compositional matrix adjust.
 Identities = 152/284 (53%), Positives = 181/284 (63%), Gaps = 59/284 (20%)

Query: 405 ELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINP 464
           +LS L+E+D CY+ RR + T  RTHLYFL YE+E S+                       
Sbjct: 369 QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEFEPSA----------------------- 405

Query: 465 NNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGN 524
                     D  DVTLVAQLSMDRLQM+E + KHWEGPISL LY+SDAEAQQFL YA  
Sbjct: 406 ----------DNTDVTLVAQLSMDRLQMLEAICKHWEGPISLALYLSDAEAQQFLRYAQG 455

Query: 525 SEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSS 584
           SE L SR+N+GYH+VYKEG FYP+N LRNVA+  +STPY+FL DIDFLPM+GLY YL+ S
Sbjct: 456 SEVLMSRQNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYLRKS 515

Query: 585 IRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------HAPT 620
           +  +D+     KK ++VPAFET RYR +FP S                        HAPT
Sbjct: 516 VIQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGHAPT 573

Query: 621 NFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           NF++W  ATTPYQ+EW  DFEPY+V  RD P YD RFVGFGWNK
Sbjct: 574 NFAKWRTATTPYQVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 617



 Score =  258 bits (659), Expect = 7e-66,   Method: Compositional matrix adjust.
 Identities = 117/177 (66%), Positives = 141/177 (79%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+G  FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ FATDI
Sbjct: 128 PAVGVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 187

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR + 
Sbjct: 188 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 247

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDE 342
           W   WR+ AE+ L+  L TSLADQDIFNA+I + P+LVY LPC WNVQLSD TRS++
Sbjct: 248 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQXPFLVYQLPCFWNVQLSDXTRSEQ 304



 Score =  254 bits (649), Expect = 9e-65,   Method: Compositional matrix adjust.
 Identities = 110/168 (65%), Positives = 136/168 (80%)

Query: 5   EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
           ++P +  CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A  IL TLF 
Sbjct: 64  QQPAVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 123

Query: 65  TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
           TW VP V V FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ F
Sbjct: 124 TWMVPAVGVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 183

Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 184 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 231


>gi|355698993|gb|AES00982.1| like-glycosyltransferase [Mustela putorius furo]
          Length = 339

 Score =  305 bits (782), Expect = 4e-80,   Method: Compositional matrix adjust.
 Identities = 156/301 (51%), Positives = 186/301 (61%), Gaps = 64/301 (21%)

Query: 392 FGC----NLTQTTLQQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVT 447
           FGC    ++    LQ+ +LS L+E+D CY+ RR + T  RTHLYFL YEYE +S      
Sbjct: 1   FGCPSEADVNSENLQK-QLSELDEDDLCYEFRRERFTVHRTHLYFLHYEYEPAS------ 53

Query: 448 LVAQLSMDRLQMVEINPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLT 507
                                      D  DVTLVAQLSMDRLQM+E + KHWEGPISL 
Sbjct: 54  ---------------------------DNTDVTLVAQLSMDRLQMLEAICKHWEGPISLA 86

Query: 508 LYMSDAEAQQFLSYAGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLL 567
           LY+SDAEAQQFL YA  SE L SR N+ YH+VYKEG FYP+N LRNVA+  V TPY+FL 
Sbjct: 87  LYLSDAEAQQFLRYAQGSEVLMSRHNVAYHIVYKEGQFYPVNLLRNVAMKHVGTPYMFLS 146

Query: 568 DIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS----------- 616
           DIDFLPM+GLY YL+ S+  +D+     KK ++VPAFET RYR +FP S           
Sbjct: 147 DIDFLPMYGLYEYLRKSVIQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMG 204

Query: 617 -------------HAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWN 663
                        HAPTNF++W  ATTPY++EW  DFEPY+V  RD P YD RFVGFGWN
Sbjct: 205 TLFTFRYHVWTKGHAPTNFAKWRTATTPYRVEWEADFEPYVVVRRDCPEYDRRFVGFGWN 264

Query: 664 K 664
           K
Sbjct: 265 K 265


>gi|298684017|gb|ADI96188.1| LARGE-1 [Mastomys natalensis]
          Length = 671

 Score =  305 bits (780), Expect = 6e-80,   Method: Compositional matrix adjust.
 Identities = 151/282 (53%), Positives = 179/282 (63%), Gaps = 59/282 (20%)

Query: 407 SNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINPNN 466
           S L+E+D CY+ RR + T  RTHLYFL YE+E S+                         
Sbjct: 371 SELDEDDLCYEFRRERFTVHRTHLYFLHYEFEPSA------------------------- 405

Query: 467 VDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSE 526
                   D  DVTLVAQLSMDRLQM+E + KHWEGPISL LY+SDAEAQQFL YA  SE
Sbjct: 406 --------DNTDVTLVAQLSMDRLQMLEAICKHWEGPISLALYLSDAEAQQFLRYAQGSE 457

Query: 527 ALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIR 586
            L SR+N+GYH+VYKEG FYP+N LRNVA+  +STPY+FL DIDFLPM+GLY YL+ S+ 
Sbjct: 458 VLMSRQNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYLRKSVI 517

Query: 587 SMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------HAPTNF 622
            +D+     KK ++VPAFET RYR +FP S                        HAPTNF
Sbjct: 518 QLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFXYHXXTKGHAPTNF 575

Query: 623 SRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           ++W  ATTPYQ+EW  DFEPY+V  RD P YD RFVGFGWNK
Sbjct: 576 AKWRTATTPYQVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 617



 Score =  263 bits (671), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 119/184 (64%), Positives = 148/184 (80%), Gaps = 1/184 (0%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+G  FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ FATDI
Sbjct: 128 PAVGVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 187

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR + 
Sbjct: 188 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 247

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ AE+ L+  L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 248 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 306

Query: 346 TELT 349
            +++
Sbjct: 307 RDVS 310



 Score =  254 bits (650), Expect = 9e-65,   Method: Compositional matrix adjust.
 Identities = 110/168 (65%), Positives = 136/168 (80%)

Query: 5   EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
           ++P +  CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A  IL TLF 
Sbjct: 64  QQPAVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 123

Query: 65  TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
           TW VP V V FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ F
Sbjct: 124 TWMVPAVGVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 183

Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 184 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 231


>gi|298684039|gb|ADI96199.1| LARGE-1 [Mastomys erythroleucus]
          Length = 671

 Score =  303 bits (777), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 151/283 (53%), Positives = 179/283 (63%), Gaps = 59/283 (20%)

Query: 406 LSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINPN 465
           LS L+E+D CY+ RR + T  RTHLYFL YE+E S+                        
Sbjct: 370 LSELDEDDLCYEFRRERFTVHRTHLYFLHYEFEPSA------------------------ 405

Query: 466 NVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNS 525
                    D  DVTLVAQLSMDRLQM+E + KHWEGPISL LY+SDAEAQQFL YA  S
Sbjct: 406 ---------DNTDVTLVAQLSMDRLQMLEAICKHWEGPISLALYLSDAEAQQFLRYAQGS 456

Query: 526 EALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSI 585
           E L SR+N+GYH+VYKEG F P+N LRNVA+  +STPY+FL DIDFLPM+GLY YL+ S+
Sbjct: 457 EVLMSRQNVGYHIVYKEGQFXPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYLRKSV 516

Query: 586 RSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------HAPTN 621
             +D+     KK ++VPAFET RYR +FP S                        HAPTN
Sbjct: 517 IQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGHAPTN 574

Query: 622 FSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           F++W  ATTPYQ+EW  DFEPY+V  RD P YD RFVGFGWNK
Sbjct: 575 FAKWRTATTPYQVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 617



 Score =  249 bits (636), Expect = 4e-63,   Method: Compositional matrix adjust.
 Identities = 108/168 (64%), Positives = 134/168 (79%)

Query: 5   EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
           ++P +  CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A  IL TLF 
Sbjct: 64  QQPAVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 123

Query: 65  TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
           TW VP V V FY AD +  +VSWIPNKHYSG+YGL+KL L   LP  L + IVLDTD+ F
Sbjct: 124 TWMVPAVGVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTXTLPANLERVIVLDTDITF 183

Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           ATDIA+LWA+F + + +Q +GLVENQSDWYL  LWKNH+PWPALGRG+
Sbjct: 184 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLXNLWKNHRPWPALGRGY 231



 Score =  240 bits (613), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 111/174 (63%), Positives = 135/174 (77%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+G  FY AD +  +VSWIPNKHYSG+YGL+KL L   LP  L + IVLDTD+ FATDI
Sbjct: 128 PAVGVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTXTLPANLERVIVLDTDITFATDI 187

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWA+F + + +Q +GLVENQSDWYL  LWKNH+PWPALGRG+NTGVILL L KLR + 
Sbjct: 188 AELWAVFHKFKGQQVLGLVENQSDWYLXNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 247

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTR 339
           W   WR+ AE+ L+  L TSLADQDIFNA+I ++P+LVY LP   NVQLS +TR
Sbjct: 248 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPXXXNVQLSXHTR 301


>gi|298684043|gb|ADI96201.1| LARGE-1 [Mastomys erythroleucus]
          Length = 671

 Score =  300 bits (767), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 149/284 (52%), Positives = 178/284 (62%), Gaps = 59/284 (20%)

Query: 405 ELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINP 464
           +LS L+E+D CY+ RR + T  RTHLYFL                               
Sbjct: 369 QLSELDEDDLCYEFRRERFTVHRTHLYFL------------------------------- 397

Query: 465 NNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGN 524
                +E  +D  DVTLVAQLSMDRLQM+E + KHWEGPISL LY+SDAEAQQFL YA  
Sbjct: 398 --XXXFEPXADNTDVTLVAQLSMDRLQMLEAICKHWEGPISLALYLSDAEAQQFLRYAQG 455

Query: 525 SEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSS 584
           SE L SR+N+GYH+VYKEG FYP+N LRNVA+  +STPY+FL DIDFLPM+GLY YL+ S
Sbjct: 456 SEVLMSRQNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYLRKS 515

Query: 585 IRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------HAPT 620
           +  +D+     KK ++VPAFET RYR +FP S                        HAPT
Sbjct: 516 VIQLDL--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGHAPT 573

Query: 621 NFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           NF++W  ATTPYQ+EW  DFEPY+V  RD P YD RFVGFGWNK
Sbjct: 574 NFAKWRTATTPYQVEWEADFEPYVVVRRDCPEYDRRFVGFGWNK 617



 Score =  256 bits (655), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 116/177 (65%), Positives = 142/177 (80%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+G  FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ FATDI
Sbjct: 128 PAVGVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 187

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR + 
Sbjct: 188 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 247

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDE 342
           W   WR+ AE+ L+  L TSLADQDIFNA+I ++P+LVY LPC WNVQLS +TRS++
Sbjct: 248 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSXHTRSEQ 304



 Score =  254 bits (649), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 110/168 (65%), Positives = 136/168 (80%)

Query: 5   EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
           ++P +  CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A  IL TLF 
Sbjct: 64  QQPAVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 123

Query: 65  TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
           TW VP V V FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ F
Sbjct: 124 TWMVPAVGVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 183

Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 184 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 231


>gi|298684041|gb|ADI96200.1| LARGE-1 [Mastomys erythroleucus]
          Length = 671

 Score =  295 bits (756), Expect = 4e-77,   Method: Compositional matrix adjust.
 Identities = 147/279 (52%), Positives = 175/279 (62%), Gaps = 59/279 (21%)

Query: 410 NEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINPNNVDE 469
           +E+D CY+ RR + T  RTHLYFL YE+E S+                            
Sbjct: 374 DEDDLCYEFRRERFTVHRTHLYFLHYEFEPSA---------------------------- 405

Query: 470 YEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALR 529
                D  DVTLVAQLSMDRLQM+E + KHWEGPISL LY+SDAEAQQFL YA  SE L 
Sbjct: 406 -----DNTDVTLVAQLSMDRLQMLEAICKHWEGPISLALYLSDAEAQQFLRYAQGSEVLM 460

Query: 530 SRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMD 589
           SR+N+GYH+VYKEG FYP+N LRNVA+  +STPY+ L DIDFLPM+GLY YL+ S+  +D
Sbjct: 461 SRQNVGYHIVYKEGQFYPVNLLRNVAMKHISTPYMXLSDIDFLPMYGLYEYLRKSVIQLD 520

Query: 590 MHGHGGKKVLVVPAFETQRYRTAFPAS------------------------HAPTNFSRW 625
           +     KK ++VPAFET RYR +FP S                        HAPTNF++W
Sbjct: 521 L--ANTKKAMIVPAFETLRYRLSFPKSKAELLSMLDMGTLFTFRYXVWTKGHAPTNFAKW 578

Query: 626 VNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
             ATTPYQ+EW  DFEPY+V  RD P  D RFVGFGWNK
Sbjct: 579 RTATTPYQVEWEADFEPYVVVRRDCPEXDRRFVGFGWNK 617



 Score =  258 bits (658), Expect = 9e-66,   Method: Compositional matrix adjust.
 Identities = 117/175 (66%), Positives = 141/175 (80%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+G  FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ FATDI
Sbjct: 128 PAVGVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 187

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR + 
Sbjct: 188 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 247

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRS 340
           W   WR+ AE+ L+  L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS
Sbjct: 248 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRS 302



 Score =  254 bits (649), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 110/168 (65%), Positives = 136/168 (80%)

Query: 5   EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
           ++P +  CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A  IL TLF 
Sbjct: 64  QQPAVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 123

Query: 65  TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
           TW VP V V FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ F
Sbjct: 124 TWMVPAVGVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 183

Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 184 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 231


>gi|21751142|dbj|BAC03909.1| unnamed protein product [Homo sapiens]
          Length = 274

 Score =  291 bits (744), Expect = 1e-75,   Method: Compositional matrix adjust.
 Identities = 154/284 (54%), Positives = 188/284 (66%), Gaps = 40/284 (14%)

Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWNSPKKLKV 364
            ++ QDIFNA+I EHP LV  LPC WNVQLSD+T + E CY+E +DLK+IHWNSPKKL+V
Sbjct: 2   GVSAQDIFNAVIKEHPGLVQRLPCVWNVQLSDHTLA-ERCYSEASDLKVIHWNSPKKLRV 60

Query: 365 KNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLT----QTTLQQAELSNLNEEDPCYDLRR 420
           KNKH+EFFRN YLTFLEYDGNLLRRELF C          LQQA L+ L+ EDPC++ R+
Sbjct: 61  KNKHVEFFRNFYLTFLEYDGNLLRRELFVCPSQPPPGAEQLQQA-LAQLDGEDPCFEFRQ 119

Query: 421 SKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINPNNVDEYEASSDGNDVT 480
            +LT  R H+ FL +E       +DVTLVAQLSMDR                        
Sbjct: 120 QQLTVHRVHVTFLPHEPPPPR-PHDVTLVAQLSMDR------------------------ 154

Query: 481 LVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGYHVVY 540
                    LQM+E L +HW GP+SL LY++DAEAQQFL +   S  L +R+++ YHVVY
Sbjct: 155 ---------LQMLEALCRHWPGPMSLALYLTDAEAQQFLHFVEASPVLAARQDVAYHVVY 205

Query: 541 KEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSS 584
           +EG  YP+N LRNVAL Q  TPYVFL DIDFLP + LY YL+ +
Sbjct: 206 REGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYLREA 249


>gi|168985115|emb|CAQ09433.1| like-glycosyltransferase [Homo sapiens]
          Length = 187

 Score =  271 bits (694), Expect = 6e-70,   Method: Compositional matrix adjust.
 Identities = 124/183 (67%), Positives = 153/183 (83%), Gaps = 1/183 (0%)

Query: 181 DVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQT 240
           +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ FATDIA+LWA+F + + +Q 
Sbjct: 6   EVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDIAELWAVFHKFKGQQV 65

Query: 241 IGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLT 300
           +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR + W   WR+ AE+ L+ 
Sbjct: 66  LGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMKWEQMWRLTAERELMG 125

Query: 301 RLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWNSPK 360
            L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY +++DLK+IHWNSPK
Sbjct: 126 MLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CYRDVSDLKVIHWNSPK 184

Query: 361 KLK 363
           KL+
Sbjct: 185 KLR 187



 Score =  147 bits (371), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 63/89 (70%), Positives = 76/89 (85%)

Query: 84  DVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQT 143
           +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ FATDIA+LWA+F + + +Q 
Sbjct: 6   EVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDIAELWAVFHKFKGQQV 65

Query: 144 IGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 66  LGLVENQSDWYLGNLWKNHRPWPALGRGY 94


>gi|198435803|ref|XP_002121835.1| PREDICTED: similar to like-glycosyltransferase [Ciona intestinalis]
          Length = 671

 Score =  270 bits (691), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 176/541 (32%), Positives = 264/541 (48%), Gaps = 66/541 (12%)

Query: 134 LFSRLRQRQTIGLVENQSDWYLGKLWKNHKPW--PALGRGFYLADSVVEDVSWIPNKHYS 191
           LF R        LV+N S   L +L+++   W  P L    Y   +V++DVSW+P  H+S
Sbjct: 85  LFHRKDSLHFHYLVDNVSKPILKELFRS---WDIPDLKVSMYEDIAVLDDVSWLPFTHHS 141

Query: 192 GVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWY 251
           G+  + KL + KVLP  + K IVLD+D++FATDIA+LW  F  + Q+Q  G+VENQSDWY
Sbjct: 142 GINSVYKLAILKVLPLYIDKVIVLDSDMVFATDIAELWLQFRHMDQQQAFGMVENQSDWY 201

Query: 252 LGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDI 311
           LG L   +  WPA+GRG N+G++LLD  KLR  +W   W+  A++ +       LADQD+
Sbjct: 202 LGTLKFEYVVWPAIGRGLNSGMMLLDCEKLRRANWDSEWKETAQQGIKRFKTVPLADQDV 261

Query: 312 FNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELT-DLKIIHWNSPKKLKVKNKHME 370
            N  +  + +++Y L C WN QL    R  ELCY +   ++KI+HWN+ +K  + +  ++
Sbjct: 262 INLFLVNNKHMLYKLECNWNFQLP-YERKMELCYNDHKHNVKIVHWNNVRKHDINSDQIQ 320

Query: 371 FFRNLYLTFLEYDGNLLRRELFGCNLTQTTLQQAELSNLNEEDPCYDLRRSKLTSLRTHL 430
              NL+  FLE +   LR +  GC         A+ +   E + C        +  RTH+
Sbjct: 321 GLTNLFTIFLEMNSMSLRHQAPGCTQRGEVKLYADDNLTKETEICDKFTAQASSRHRTHI 380

Query: 431 YFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINPNNVDEYEASSDGNDVTLVAQLSMDRL 490
           Y+L ++Y+ + D  D+TL                                 V  L +  +
Sbjct: 381 YYLPFKYQ-NKDSRDITL---------------------------------VTHLQLKDM 406

Query: 491 QMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGYHVVYKEGN--FYPI 548
           Q  E L K W GPIS+ ++++D E  Q   +  +S+ L+ R NIGYH+V+   N    PI
Sbjct: 407 QTFETLCKTWTGPISVAIFVADRETDQLEHFIDSSDILKQRDNIGYHLVFNGVNEQVPPI 466

Query: 549 NTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQR 608
           + L N + +QV T Y+ + D+ +     LY  L +     D+     + V V+P FET  
Sbjct: 467 SYLLNTSASQVLTEYLLIYDVKYQQAQQLYDILYNYFSKFDV--LSLESVYVLPVFETTS 524

Query: 609 Y-------------------RTAFPASHAPTNFS--RWVNATTPYQIEWAPDFEPYIVAH 647
                                  +       +F   RW  ATTPY  +W     P I A 
Sbjct: 525 NVLDSTPNTKVSIVKQIAVGEIKYSNDQRANSFDIVRWATATTPYLTDWRNIGTPIIFAL 584

Query: 648 R 648
           R
Sbjct: 585 R 585



 Score =  182 bits (463), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 82/160 (51%), Positives = 114/160 (71%)

Query: 12  CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQV 71
           CE I V +VC GY + + +V L+KSILF+RK+ LHFH + D V+  IL+ LF +W +P +
Sbjct: 59  CETIHVFLVCTGYKTVKDMVVLVKSILFHRKDSLHFHYLVDNVSKPILKELFRSWDIPDL 118

Query: 72  EVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQL 131
           +VS Y   +V++DVSW+P  H+SG+  + KL + KVLP  + K IVLD+D++FATDIA+L
Sbjct: 119 KVSMYEDIAVLDDVSWLPFTHHSGINSVYKLAILKVLPLYIDKVIVLDSDMVFATDIAEL 178

Query: 132 WALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRG 171
           W  F  + Q+Q  G+VENQSDWYLG L   +  WPA+GRG
Sbjct: 179 WLQFRHMDQQQAFGMVENQSDWYLGTLKFEYVVWPAIGRG 218


>gi|157823495|ref|NP_001101909.1| glycosyltransferase-like protein LARGE1 [Rattus norvegicus]
 gi|149032476|gb|EDL87367.1| like-glycosyltransferase (predicted), isoform CRA_a [Rattus
           norvegicus]
          Length = 385

 Score =  264 bits (674), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 121/187 (64%), Positives = 150/187 (80%), Gaps = 1/187 (0%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ FATDI
Sbjct: 192 PAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 251

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR + 
Sbjct: 252 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 311

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W   WR+ AE+ L+  L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY
Sbjct: 312 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CY 370

Query: 346 TELTDLK 352
            +++DLK
Sbjct: 371 RDVSDLK 377



 Score =  254 bits (650), Expect = 7e-65,   Method: Compositional matrix adjust.
 Identities = 110/168 (65%), Positives = 136/168 (80%)

Query: 5   EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
           ++P +  CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A  IL TLF 
Sbjct: 128 QQPAVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 187

Query: 65  TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
           TW VP V V FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ F
Sbjct: 188 TWMVPAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 247

Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 248 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 295


>gi|392927212|ref|NP_509833.3| Protein LGE-1 [Caenorhabditis elegans]
 gi|92058691|sp|Q21389.3|LGE1_CAEEL RecName: Full=Glycosyltransferase-like protein LARGE
 gi|255068767|emb|CAA91997.4| Protein LGE-1 [Caenorhabditis elegans]
          Length = 631

 Score =  261 bits (668), Expect = 7e-67,   Method: Compositional matrix adjust.
 Identities = 176/522 (33%), Positives = 252/522 (48%), Gaps = 108/522 (20%)

Query: 182 VSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTI 241
           V+WIPN HYS  YGL KL +P+++   + K + +D D+IF T+I  LW  F      Q  
Sbjct: 113 VAWIPNSHYSKYYGLSKLLIPEIIGNDIGKIMFMDVDIIFQTNIFDLWKQFRNFNNSQVF 172

Query: 242 GLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTR 301
           G+VEN SDWYL K  K    WPALGRGFNTG+I+ DL KLR   WA  WR++A K+L   
Sbjct: 173 GMVENLSDWYLNKDGKK-SVWPALGRGFNTGIIMFDLDKLRKNGWASKWRVVANKYLRIH 231

Query: 302 LWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELC-YTELTDLKIIHWNSPK 360
             T+++DQDIFNA I ++P  +  +PC +N QL   T+S ELC  T L     +H+NS  
Sbjct: 232 GKTAMSDQDIFNAYIHDYPTEIIQIPCAYNYQLGALTKSKELCPETPLA----LHFNSQN 287

Query: 361 KLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTTLQQAELSNLNEEDPCYDLRR 420
           K   KN    FF  +   F E DG+ L+R            ++      N++D C+    
Sbjct: 288 KTVGKN--YAFFDKIRKAFDEMDGSDLKR------------RRRSFKGNNQKDICH---- 329

Query: 421 SKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINPNNVDEYEASSDGNDVT 480
                        EY                L +D  +++   PN +      +   ++ 
Sbjct: 330 -------------EY----------------LPLDNFRII---PNAIGRMTKPA---ELC 354

Query: 481 LVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGYHVVY 540
           +V Q S DRL         W  PIS  +Y  D +    L  A     L +R +I  H+V+
Sbjct: 355 MVTQFSKDRLNHFLESANAWRHPISTAVYGKDKD---LLDIAKAVTEL-NRTDITIHLVF 410

Query: 541 KE-------GNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGH 593
           +E        + YPIN LRNVA+   +  Y+ + D+DF+ + G Y  +      +D  G+
Sbjct: 411 EEPTESWMLDSLYPINFLRNVAIEHANCKYILMTDVDFV-VLGDYGTI------IDQTGN 463

Query: 594 -GGKKVLVVPAFET----------------------------QRYR-TAFPASHAPTNFS 623
              K+VLV+PA E                             Q +R T +P+SH PTN S
Sbjct: 464 LKQKEVLVIPALEMTYPQLRLNLSNFLSRKDLVIEHLLNKTIQTFRETIWPSSHVPTNIS 523

Query: 624 RWVNATTPYQIEWAPDFEPYIVAHR-DLPRYDTRFVGFGWNK 664
           +W+ +   Y + +  ++EPY V  + + P YD RF GFGWNK
Sbjct: 524 KWIKSNRTYMVNYEKNYEPYFVIKKEECPFYDQRFGGFGWNK 565



 Score =  120 bits (300), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 63/166 (37%), Positives = 90/166 (54%), Gaps = 2/166 (1%)

Query: 8   VIPTCEVIQVAIVCAGYNSTRSL-VTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTW 66
             P  + I++A +  G   TR + +   KS+L Y  +    HLITD    + +  L ++W
Sbjct: 35  AFPEEDYIRLAYIIGGNFMTRLMFMQHFKSVLKYSDHFFRLHLITDENHRSDIHELMTSW 94

Query: 67  SVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 126
           ++   E  F+      + V+WIPN HYS  YGL KL +P+++   + K + +D D+IF T
Sbjct: 95  NISNCEWFFHNLTEFEKRVAWIPNSHYSKYYGLSKLLIPEIIGNDIGKIMFMDVDIIFQT 154

Query: 127 DIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           +I  LW  F      Q  G+VEN SDWYL K  K    WPALGRGF
Sbjct: 155 NIFDLWKQFRNFNNSQVFGMVENLSDWYLNKDGKK-SVWPALGRGF 199


>gi|167516064|ref|XP_001742373.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163778997|gb|EDQ92611.1| predicted protein [Monosiga brevicollis MX1]
          Length = 716

 Score =  255 bits (652), Expect = 5e-65,   Method: Compositional matrix adjust.
 Identities = 168/534 (31%), Positives = 254/534 (47%), Gaps = 72/534 (13%)

Query: 165 WPALGRGFYL--ADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFA 222
           WP  G  ++   AD+    V+WIP  H +G    LKL  P++LP+ +A+ +V+D D++  
Sbjct: 115 WPLAGVEWHAHPADAGQARVAWIPTLHKAGWVAHLKLVYPQLLPQ-VAQLVVIDLDLLVQ 173

Query: 223 TDIAQLWALFSRLRQRQTIGLVENQSDWYLGKL--WKNHKPWPALGRGFNTGVILLDLTK 280
            D+ QL  + +    +     V  QSDWYLG L   + +  WPA+ RG NTGV++L L +
Sbjct: 174 ADLCQLAGILAD-HPKALFAAVPQQSDWYLGTLPDTRENVVWPAIERGLNTGVMVLHLER 232

Query: 281 LRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRS 340
           +R   W   W+ +  + LL R  T LADQD++N +  EHP     L C WNVQL D++ +
Sbjct: 233 MRAWGWDQRWQSLVRRELLVRHHTGLADQDVYNLVALEHPETWTMLSCAWNVQLHDHSVT 292

Query: 341 DELCYTE--LTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQ 398
                    L + +++HWNSPKK         + R+L      + G              
Sbjct: 293 QHCDQPSHLLANARVLHWNSPKKYDAAFAEATYVRSLLAAAEAHSG-------------- 338

Query: 399 TTLQQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQ 458
                A  +      P YD   S L+             + S  G D    A    D ++
Sbjct: 339 ----LAASAQFTLRCPGYDAGGSSLS-------LAWAAVDTSWAGEDPAARA----DIVE 383

Query: 459 MVEINPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAE---A 515
              ++ NN     A   G+ +TLV  L+ DRL  ++     W+GP+   +Y++DA+    
Sbjct: 384 GGHLDENNPAATRAP--GSRITLVTHLTPDRLAALQQTLHQWDGPVHAAVYLADADLWRL 441

Query: 516 QQFLSYAGNSEALRSRRNIGYHVVYKEGN-FYPINTLRNVALNQVSTPYVFLLDIDFLPM 574
           +QFL      EA+    N+  HV +++G   YP+N LRN+ L+ V+T +VF+ D+D  P+
Sbjct: 442 RQFLEQVATLEAI----NLTLHVAFEDGGRRYPVNHLRNLLLDHVTTSHVFMADVDLCPL 497

Query: 575 FGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRYRT----------------------- 611
            G   +L+ ++ S    G   +  LVVPAFE   YR                        
Sbjct: 498 AGSRAHLEQAL-SASATGSAQRVALVVPAFEYVGYRAPTFLSLTTVQDLDLADIRPFRVD 556

Query: 612 AFPASHAPTNFSRWVNATTPYQIEWAPDFEPYIVAH-RDLPRYDTRFVGFGWNK 664
            +P  H PTN++R  +A+  YQ+ WAP +EPY+V      PRY     GFGWNK
Sbjct: 557 VWPRGHGPTNYTRLQSASDYYQVSWAPGYEPYVVLRTATAPRYPEALAGFGWNK 610



 Score = 87.8 bits (216), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 52/174 (29%), Positives = 86/174 (49%), Gaps = 11/174 (6%)

Query: 1   MSSDEEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQ 60
           +S  E+P       + V ++  G  +   L  ++K ++  R  PL  H++ D      + 
Sbjct: 57  VSCSEQP-------LHVGMIALGDQAPSELYRVLKDVVARRSCPLVLHVVCDPTTCEAVA 109

Query: 61  TLFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDT 120
            L  TW +  VE   + AD+    V+WIP  H +G    LKL  P++LP+ +A+ +V+D 
Sbjct: 110 ALKQTWPLAGVEWHAHPADAGQARVAWIPTLHKAGWVAHLKLVYPQLLPQ-VAQLVVIDL 168

Query: 121 DVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKL--WKNHKPWPALGRGF 172
           D++   D+ QL  + +    +     V  QSDWYLG L   + +  WPA+ RG 
Sbjct: 169 DLLVQADLCQLAGILAD-HPKALFAAVPQQSDWYLGTLPDTRENVVWPAIERGL 221


>gi|341903546|gb|EGT59481.1| hypothetical protein CAEBREN_08278 [Caenorhabditis brenneri]
          Length = 643

 Score =  255 bits (651), Expect = 6e-65,   Method: Compositional matrix adjust.
 Identities = 171/521 (32%), Positives = 259/521 (49%), Gaps = 94/521 (18%)

Query: 180 EDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQ 239
           + V WIPN HYS  YGL KL +P+++ + + K + +D D++F +DI  LW  F     +Q
Sbjct: 115 KKVKWIPNSHYSKYYGLTKLLIPEIMSDNVGKVMYVDIDIVFRSDIFYLWQKFRDFNNQQ 174

Query: 240 TIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLL 299
           +IG++EN SDWYL K  K    W A GRGFN+G+I+ DL KLR+ ++   WR +A K+L 
Sbjct: 175 SIGMIENLSDWYLNKDGKK-SVWTARGRGFNSGIIMFDLDKLREENFKEKWRSVANKYLK 233

Query: 300 TRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY-TELTDLKIIHWNS 358
               T+++DQDIFNA + ++P  + TLPC++N QL +  +S+ELC+ T L     +H+NS
Sbjct: 234 IHGKTTMSDQDIFNAYVHDYPNQIKTLPCEYNFQLGNLAKSEELCHETPLA----LHFNS 289

Query: 359 PKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTTLQQAELSNLNEEDPCYDL 418
             K   KN         Y  +     N +R+E+ G + +                   DL
Sbjct: 290 QNKTVRKN---------YALY-----NAIRKEIDGIDGS-------------------DL 316

Query: 419 RRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINPNNVDEYEASSDGND 478
           RR K  S    +  +E  Y   +     + V       LQ   I PN +   E ++    
Sbjct: 317 RRRKRRSFDA-MQKMEETYGMRNRNECSSYVP------LQNFRILPNALGRLEKTAH--- 366

Query: 479 VTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGYHV 538
           + LV Q S+DRL       K W  PIS  +Y +D E    LS   ++    +R +I  H+
Sbjct: 367 LCLVTQFSIDRLHNFLENVKTWMHPISAAIYGNDEE----LSIIVDAIKALNRTDITIHM 422

Query: 539 VYKEGNF------YPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHG 592
           V++E +       YPIN LRN A+N  +  ++ + D+DF+ ++G    L+  +  M    
Sbjct: 423 VFREPDHQIISQTYPINFLRNFAMNHSNCEHILMADVDFM-IYGDDSSLQEQVMKM---- 477

Query: 593 HGGKKVLVVPAFET----------------QRYRTAF------------PASHAPTNFSR 624
              K +L++PAFET                +  R  F            P +H  TN + 
Sbjct: 478 -KEKDLLIIPAFETSDENITDVSKFPQTKEKVARGIFSGKVRIFREVIWPNAHNATNVAE 536

Query: 625 WVNATTPYQIEWAPDFEPYIVAHR-DLPRYDTRFVGFGWNK 664
           W+ A   Y +E+  ++EPY V  + + P YD RF GFGWNK
Sbjct: 537 WIIAEDTYTVEYGKNYEPYFVIRKSECPIYDQRFGGFGWNK 577



 Score =  112 bits (280), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 59/166 (35%), Positives = 88/166 (53%), Gaps = 2/166 (1%)

Query: 8   VIPTCEVIQVAIVCAGYNSTRSL-VTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTW 66
             P  + + +  +  G   TR L +   KS+L Y  +    H I D      +  L ++W
Sbjct: 39  AFPEEDHVSLVYILGGNFMTRLLFMQHFKSVLNYSDSNFRLHFIVDYENKEDVNDLMTSW 98

Query: 67  SVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 126
           +V  VE  F+   +  + V WIPN HYS  YGL KL +P+++ + + K + +D D++F +
Sbjct: 99  NVSNVEWFFHNLTNYEKKVKWIPNSHYSKYYGLTKLLIPEIMSDNVGKVMYVDIDIVFRS 158

Query: 127 DIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           DI  LW  F     +Q+IG++EN SDWYL K  K    W A GRGF
Sbjct: 159 DIFYLWQKFRDFNNQQSIGMIENLSDWYLNKDGKK-SVWTARGRGF 203


>gi|149032477|gb|EDL87368.1| like-glycosyltransferase (predicted), isoform CRA_b [Rattus
           norvegicus]
          Length = 328

 Score =  255 bits (651), Expect = 7e-65,   Method: Compositional matrix adjust.
 Identities = 110/168 (65%), Positives = 136/168 (80%)

Query: 5   EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
           ++P +  CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A  IL TLF 
Sbjct: 128 QQPAVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 187

Query: 65  TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
           TW VP V V FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ F
Sbjct: 188 TWMVPAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 247

Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 248 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 295



 Score =  166 bits (420), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 72/107 (67%), Positives = 87/107 (81%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ FATDI
Sbjct: 192 PAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 251

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTG 272
           A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTG
Sbjct: 252 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTG 298


>gi|343958624|dbj|BAK63167.1| glycosyltransferase-like protein LARGE1 [Pan troglodytes]
          Length = 286

 Score =  252 bits (643), Expect = 5e-64,   Method: Compositional matrix adjust.
 Identities = 118/201 (58%), Positives = 140/201 (69%), Gaps = 26/201 (12%)

Query: 488 DRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGYHVVYKEGNFYP 547
            RLQM+E + KHWEGPISL LY+SDAEAQQFL YA  SE L SR N+GYH+VYKEG FYP
Sbjct: 13  SRLQMLEAICKHWEGPISLALYLSDAEAQQFLRYAQGSEVLMSRHNVGYHIVYKEGQFYP 72

Query: 548 INTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQ 607
           +N LRNVA+  +STPY+FL DIDFLPM+GLY YL+ S+  +D+     KK ++VPAFET 
Sbjct: 73  VNLLRNVAMKHISTPYMFLSDIDFLPMYGLYEYLRKSVIQLDL--ANTKKAMIVPAFETL 130

Query: 608 RYRTAFPAS------------------------HAPTNFSRWVNATTPYQIEWAPDFEPY 643
           RYR +FP S                        HAPTNF++W  ATTPY++EW  DFEPY
Sbjct: 131 RYRLSFPKSKAELLSMLDMGTLFTFRYHVWTKGHAPTNFAKWRTATTPYRVEWEADFEPY 190

Query: 644 IVAHRDLPRYDTRFVGFGWNK 664
           +V  RD P YD RFVGFGWNK
Sbjct: 191 VVVRRDCPEYDRRFVGFGWNK 211


>gi|298684025|gb|ADI96192.1| LARGE-1 [Mastomys natalensis]
          Length = 671

 Score =  251 bits (640), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 109/168 (64%), Positives = 135/168 (80%)

Query: 5   EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
           ++P +  CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A  IL TLF 
Sbjct: 64  QQPAVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 123

Query: 65  TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
           TW VP V V FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ F
Sbjct: 124 TWMVPAVGVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 183

Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALG G+
Sbjct: 184 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGXGY 231



 Score =  182 bits (463), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 82/127 (64%), Positives = 99/127 (77%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+G  FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ FATDI
Sbjct: 128 PAVGVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 187

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALG G+NTGVILL L KLR + 
Sbjct: 188 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGXGYNTGVILLLLDKLRKMK 247

Query: 286 WAGFWRI 292
           W    R+
Sbjct: 248 WEQMXRL 254



 Score =  107 bits (267), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 52/109 (47%), Positives = 63/109 (57%), Gaps = 26/109 (23%)

Query: 580 YLKSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS----------------------- 616
           YL+ S+  +D+     KK ++VPAFET RY  +FP S                       
Sbjct: 511 YLRKSVIQLDLAN--TKKAMIVPAFETLRYGLSFPKSKAELLSMLDMGTLFTFRYHVWTK 568

Query: 617 -HAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
            HAPTNF++W  ATTPYQ+EW  DFEPY+V  RD P YD RFVGFGWNK
Sbjct: 569 GHAPTNFAKWRTATTPYQVEWEADFEPYVVXRRDCPEYDRRFVGFGWNK 617


>gi|298684011|gb|ADI96185.1| LARGE-1 [Mastomys kollmannspergeri]
          Length = 302

 Score =  250 bits (638), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 114/175 (65%), Positives = 138/175 (78%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+G  FY AD +  +VSW PNKHYSG+YGL+KL L K LP  L +  VLDTD+ FATDI
Sbjct: 128 PAVGVDFYNADELKSEVSWNPNKHYSGIYGLMKLVLTKTLPANLERVTVLDTDITFATDI 187

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
           A+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+NTGVILL L KLR + 
Sbjct: 188 AELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGYNTGVILLLLDKLRKMK 247

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRS 340
           W   WR+ AE+ L+  L TSLADQDIFNA+I ++P+L Y LPC WNVQLSD+TRS
Sbjct: 248 WEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLXYQLPCFWNVQLSDHTRS 302



 Score =  249 bits (636), Expect = 4e-63,   Method: Compositional matrix adjust.
 Identities = 108/168 (64%), Positives = 134/168 (79%)

Query: 5   EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
           ++P +  CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A  IL TLF 
Sbjct: 64  QQPAVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 123

Query: 65  TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
           TW VP V V FY AD +  +VSW PNKHYSG+YGL+KL L K LP  L +  VLDTD+ F
Sbjct: 124 TWMVPAVGVDFYNADELKSEVSWNPNKHYSGIYGLMKLVLTKTLPANLERVTVLDTDITF 183

Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           ATDIA+LWA+F + + +Q +GLVENQSDWYLG LWKNH+PWPALGRG+
Sbjct: 184 ATDIAELWAVFHKFKGQQVLGLVENQSDWYLGNLWKNHRPWPALGRGY 231


>gi|355694037|gb|AER99534.1| glycosyltransferase-like 1B [Mustela putorius furo]
          Length = 282

 Score =  233 bits (595), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 113/205 (55%), Positives = 138/205 (67%), Gaps = 25/205 (12%)

Query: 484 QLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGYHVVYKEG 543
           QLSMDRLQM+E L +HW GP+SL LY++DAEAQQFL +   S  L +R+N+ YHVVY+EG
Sbjct: 1   QLSMDRLQMLEALCRHWPGPMSLALYLTDAEAQQFLRFVEASAVLSARQNVAYHVVYREG 60

Query: 544 NFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPA 603
             YP+N LRNVAL Q  TPYVFL DIDFLP + LY YL++SI  + + G   K  LVVPA
Sbjct: 61  PLYPVNQLRNVALAQSLTPYVFLSDIDFLPAYSLYDYLRASIEQLKL-GSERKAALVVPA 119

Query: 604 FETQRYRTAFPAS------------------------HAPTNFSRWVNATTPYQIEWAPD 639
           FET  YR  FP+S                        HAPT+++RW  A TPY+++WA D
Sbjct: 120 FETLHYRFRFPSSKAELLALLDSGSLYTFRYHEWPRGHAPTDYARWREAQTPYRVQWAAD 179

Query: 640 FEPYIVAHRDLPRYDTRFVGFGWNK 664
           +EPY+V  RD PRYD RFVGFGWNK
Sbjct: 180 YEPYVVVPRDCPRYDPRFVGFGWNK 204


>gi|332019413|gb|EGI59897.1| Glycosyltransferase-like protein LARGE1 [Acromyrmex echinatior]
          Length = 636

 Score =  232 bits (592), Expect = 4e-58,   Method: Compositional matrix adjust.
 Identities = 121/281 (43%), Positives = 157/281 (55%), Gaps = 59/281 (20%)

Query: 408 NLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINPNNV 467
           N NE +PC +  +      RTH + LEYEY   S                          
Sbjct: 313 NYNESNPCREFEKGANILYRTHPFLLEYEYNVYSPM------------------------ 348

Query: 468 DEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEA 527
                     DV LV Q S++R+ ++E L KHW G IS+ LY++DAE Q FL++   S  
Sbjct: 349 ----------DVALVTQCSIERIPLLEALSKHWPGTISVALYLTDAEVQNFLNFVRGSLD 398

Query: 528 LRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRS 587
           LR+RRNI YHVVYK+G+ YPIN LRN A++ V TP+VF LD+DFLP FGLY  + + I  
Sbjct: 399 LRTRRNIAYHVVYKDGDLYPINYLRNTAISYVPTPFVFQLDVDFLPQFGLYENIMNYINR 458

Query: 588 MDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------HAPTNFS 623
           ++++    K  L+VPAFET+RYR  FPA+                        HA TN+S
Sbjct: 459 LNIN-ESDKVALIVPAFETERYRFTFPANKDELLKFLKRGILYTFRYHVWTQGHAATNYS 517

Query: 624 RWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
            W N+T PYQ+ W PDFEPYI+  +  PRYDTRF+GFGWNK
Sbjct: 518 FWRNSTDPYQVSWEPDFEPYIIVSKSAPRYDTRFIGFGWNK 558



 Score =  201 bits (510), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 96/161 (59%), Positives = 115/161 (71%), Gaps = 2/161 (1%)

Query: 12  CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQV 71
           C+ I VA+VCAGYNST +LVT++KSILFYR NPLHFHL+ D +A   L TLF TW +P  
Sbjct: 73  CDTIHVAMVCAGYNSTFALVTVVKSILFYRTNPLHFHLLVDEIARKTLSTLFRTWDLPHA 132

Query: 72  EVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQL 131
            V+FY A+  V  VSWIPNKHYSGVYGLLKL LP  + E   K +VLDTDV   TD+  L
Sbjct: 133 NVTFYEAEIWVPRVSWIPNKHYSGVYGLLKLILPDAMRED--KVLVLDTDVTVLTDMHPL 190

Query: 132 WALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
           W +F R    + +GLVENQS+WY+  L    +PWPALGRGF
Sbjct: 191 WKMFERFSVDEALGLVENQSNWYVKALSYGQRPWPALGRGF 231



 Score =  192 bits (487), Expect = 7e-46,   Method: Compositional matrix adjust.
 Identities = 103/221 (46%), Positives = 135/221 (61%), Gaps = 6/221 (2%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           P     FY A+  V  VSWIPNKHYSGVYGLLKL LP  + E   K +VLDTDV   TD+
Sbjct: 130 PHANVTFYEAEIWVPRVSWIPNKHYSGVYGLLKLILPDAMRED--KVLVLDTDVTVLTDM 187

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
             LW +F R    + +GLVENQS+WY+  L    +PWPALGRGFNTGV+L+ L +LRD  
Sbjct: 188 HPLWKMFERFSVDEALGLVENQSNWYVKALSYGQRPWPALGRGFNTGVMLMHLRRLRDRK 247

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           ++  W  + ++ L     TSLADQD+ NA+I+E P +VY + C WN+QLSD+T SD +CY
Sbjct: 248 FSRSWENVTKRVLGHIPETSLADQDVINAVINEQPSIVYKIECTWNIQLSDHTLSD-ICY 306

Query: 346 TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNL 386
               D   I++N     +   K        +   LEY+ N+
Sbjct: 307 ---RDTNRINYNESNPCREFEKGANILYRTHPFLLEYEYNV 344


>gi|349805191|gb|AEQ18068.1| putative glycosyltransferase isoform 4 [Hymenochirus curtipes]
          Length = 279

 Score =  229 bits (585), Expect = 3e-57,   Method: Compositional matrix adjust.
 Identities = 113/187 (60%), Positives = 132/187 (70%), Gaps = 20/187 (10%)

Query: 478 DVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGYH 537
           DVTLVAQLSMDRLQM+E++ +HWEGP+SL LY+SDAEAQQFL YA  SE L+SR N+GYH
Sbjct: 56  DVTLVAQLSMDRLQMLELICRHWEGPMSLALYLSDAEAQQFLRYAQASEVLQSRTNVGYH 115

Query: 538 VVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKK 597
           VVYKEG  YP+N LRNVAL    TPYVF  DIDFLPM+GLY                   
Sbjct: 116 VVYKEGQLYPVNLLRNVALKNAQTPYVF-SDIDFLPMYGLY------------------S 156

Query: 598 VLVVPAFETQRYRTAFPASHAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRF 657
           +L + A  T RY   +   HAPTN+++W  A TPY +EWAPDFEPY+V  RD P YD RF
Sbjct: 157 MLDMGALYTFRYHV-WEKGHAPTNYAKWRTAVTPYSVEWAPDFEPYVVVKRDCPEYDQRF 215

Query: 658 VGFGWNK 664
           +GFGWNK
Sbjct: 216 LGFGWNK 222



 Score = 57.4 bits (137), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 24/34 (70%), Positives = 30/34 (88%), Gaps = 1/34 (2%)

Query: 326 LPCQWNVQLSDNTRSDELCYTELTDLKIIHWNSP 359
           LPC WNVQLSD+TRS++ CY ++ DLK+IHWNSP
Sbjct: 22  LPCFWNVQLSDHTRSEQ-CYRDVADLKVIHWNSP 54



 Score = 47.0 bits (110), Expect = 0.030,   Method: Compositional matrix adjust.
 Identities = 18/20 (90%), Positives = 18/20 (90%)

Query: 148 ENQSDWYLGKLWKNHKPWPA 167
           E QSDWYLG LWKNHKPWPA
Sbjct: 1   EKQSDWYLGNLWKNHKPWPA 20


>gi|168985117|emb|CAQ09435.1| like-glycosyltransferase [Homo sapiens]
          Length = 183

 Score =  226 bits (577), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 110/162 (67%), Positives = 134/162 (82%), Gaps = 6/162 (3%)

Query: 298 LLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
           L+  L TSLADQDIFNA+I ++P+LVY LPC WNVQLSD+TRS++ CY +++DLK+IHWN
Sbjct: 1   LMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNVQLSDHTRSEQ-CYRDVSDLKVIHWN 59

Query: 358 SPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGC----NLTQTTLQQAELSNLNEED 413
           SPKKL+VKNKH+EFFRNLYLTFLEYDGNLLRRELFGC    ++    LQ+ +LS L+E+D
Sbjct: 60  SPKKLRVKNKHVEFFRNLYLTFLEYDGNLLRRELFGCPSEADVNSENLQK-QLSELDEDD 118

Query: 414 PCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMD 455
            CY+ RR + T  RTHLYFL YEYE ++D  DVTLVAQLSMD
Sbjct: 119 LCYEFRRERFTVHRTHLYFLHYEYEPAADSTDVTLVAQLSMD 160


>gi|350645709|emb|CCD59684.1| glycosyltransferase-related [Schistosoma mansoni]
          Length = 757

 Score =  226 bits (577), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 145/423 (34%), Positives = 210/423 (49%), Gaps = 76/423 (17%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           P     FY  +  +  VSWIP+ HYSG++GL KL +P++LPE++ K I LD D++  +D+
Sbjct: 273 PNFNVYFYQLELYLHKVSWIPSTHYSGLFGLSKLLIPEILPESVEKVINLDVDLLVNSDL 332

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
            +LWA F +    Q IGLV NQS WYL K   N   WPA G GFNTGV+LLDL +LR+  
Sbjct: 333 LELWAHFQKFNTSQMIGLVANQSPWYLRK--TNRIVWPAWGPGFNTGVMLLDLVRLRESH 390

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W+  W    +  L    ++ LADQDI NA + E P ++Y LPC+WNVQL+ N  + +LC 
Sbjct: 391 WSEHWYKTTKSALQHIPYSMLADQDIINAALVEIPSIIYQLPCEWNVQLTSNM-NIKLCA 449

Query: 346 TELTD---------------------LKIIHWNSPKK----------------LKVKNKH 368
              ++                     LKI H+N P K                L++K + 
Sbjct: 450 IIWSNDYSIHNVQSFHSNPFESFNYHLKIAHFNHPIKADMITLNEIIDDDQLRLRLKRQF 509

Query: 369 MEFFRNLYLTFLEYDGNLLRR--------ELFGCNLTQTTLQQAELSNLNEEDPCYDLRR 420
           M  +R     +  +DG +++         E +  +     + Q    N    D C +   
Sbjct: 510 MRMYR----YYQNFDGTIVQSPGKLSSCSEYYNPSTFNRKINQTSSKNTITIDGCEEFSE 565

Query: 421 SKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINPNNVDEYEASSDGNDVT 480
             L + R H Y+ +Y +            +  +M   + V IN  N            V+
Sbjct: 566 DLLKTHRIHPYYYKYNF------------STFTMKDKENV-INMKN-----------HVS 601

Query: 481 LVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGYHVVY 540
           LV+QL+ DRL  +E L   W GP+SL LY++D EA   + Y  NS  L++R NIGYH+VY
Sbjct: 602 LVSQLTFDRLHRLEELAIRWLGPMSLALYLTDREATLLIEYITNSRILQNRTNIGYHIVY 661

Query: 541 KEG 543
            +G
Sbjct: 662 VDG 664



 Score =  133 bits (334), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 76/198 (38%), Positives = 101/198 (51%), Gaps = 42/198 (21%)

Query: 15  IQVAIVCAGYNSTRSLVTLIKSILFYRK-------------------------------- 42
           I +++V  G  + R LVTLIKSIL  RK                                
Sbjct: 179 IDISLVIGGAQAARGLVTLIKSILLQRKWERSVEFSKFPRPATLKSTVTFNGKETIMDNV 238

Query: 43  --------NPLHFHLITDTVALNILQTLFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYS 94
                     ++ HLI+D  A   L TLF TW++P   V FY  +  +  VSWIP+ HYS
Sbjct: 239 NKHNECYIRHINLHLISDYHAFKSLSTLFDTWNIPNFNVYFYQLELYLHKVSWIPSTHYS 298

Query: 95  GVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWY 154
           G++GL KL +P++LPE++ K I LD D++  +D+ +LWA F +    Q IGLV NQS WY
Sbjct: 299 GLFGLSKLLIPEILPESVEKVINLDVDLLVNSDLLELWAHFQKFNTSQMIGLVANQSPWY 358

Query: 155 LGKLWKNHKPWPALGRGF 172
           L K   N   WPA G GF
Sbjct: 359 LRK--TNRIVWPAWGPGF 374



 Score = 45.4 bits (106), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 14/31 (45%), Positives = 22/31 (70%)

Query: 634 IEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           ++W+ D+EPY++  R    +D  F+GFGWNK
Sbjct: 665 VQWSADYEPYVIVSRHTTMFDEIFIGFGWNK 695


>gi|21706836|gb|AAH33922.1| Gyltl1b protein, partial [Mus musculus]
          Length = 285

 Score =  224 bits (570), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 108/207 (52%), Positives = 136/207 (65%), Gaps = 25/207 (12%)

Query: 482 VAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGYHVVYK 541
           VAQLSMDRLQM+E L +HW GP+SL LY++D EAQQFL +   S  L  R+++ YHVVY+
Sbjct: 1   VAQLSMDRLQMLEALCRHWPGPMSLALYLTDEEAQQFLHFVETSPVLSMRKDVAYHVVYR 60

Query: 542 EGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVV 601
           +G  YP+N LRNVAL Q  TPYVFL DIDFLP + LY YL++SI  +++     K  LVV
Sbjct: 61  DGPLYPVNQLRNVALAQALTPYVFLSDIDFLPAYSLYDYLRASIEQLELDSR-RKTALVV 119

Query: 602 PAFETQRYRTAFPAS------------------------HAPTNFSRWVNATTPYQIEWA 637
           PAFET  YR +FP S                        H+ T++SRW  A  PY+++W+
Sbjct: 120 PAFETLHYRFSFPNSKAELLTLLDAGSLHTFRYHEWPQGHSSTDYSRWREAQAPYRVQWS 179

Query: 638 PDFEPYIVAHRDLPRYDTRFVGFGWNK 664
            D+EPY+V  RD PRYD RFVGFGWNK
Sbjct: 180 ADYEPYVVVPRDCPRYDPRFVGFGWNK 206


>gi|402590775|gb|EJW84705.1| glycosyltransferase, partial [Wuchereria bancrofti]
          Length = 354

 Score =  219 bits (558), Expect = 4e-54,   Method: Compositional matrix adjust.
 Identities = 108/231 (46%), Positives = 147/231 (63%), Gaps = 2/231 (0%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   +Y A   +   SWIPN+HYSG YG LKL L  +LP  + K I LDTD +   DI
Sbjct: 122 PAVKVIYYNASQYLGRFSWIPNRHYSGRYGFLKLILNDILPADVDKVITLDTDALIMDDI 181

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYL-GKLWKNHKPWPALGRGFNTGVILLDLTKLRDI 284
           AQLW+ FS++   Q IGLVEN S+WYL  +       WPA GRGFN+GV+LLDLTKLR+I
Sbjct: 182 AQLWSFFSKMTSLQAIGLVENLSNWYLINRNTSQRIVWPAWGRGFNSGVMLLDLTKLRNI 241

Query: 285 SWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELC 344
           SW+  W+ IA K +       LADQD+ NA+I+ H ++V+ LPC+WN QL   ++ D LC
Sbjct: 242 SWSYIWKEIASKNVKDYGPVELADQDVINAVINHHQWIVHKLPCEWNFQLGFQSQQD-LC 300

Query: 345 YTELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCN 395
             E++ LK++HWNSP K +  N++    R  Y +  + DGN+ R  +  C+
Sbjct: 301 PVEISRLKLVHWNSPLKTRTVNRYAVLLRRYYDSMRDMDGNMFRSNIIHCH 351



 Score =  155 bits (393), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 78/164 (47%), Positives = 104/164 (63%), Gaps = 2/164 (1%)

Query: 11  TCEVIQVA-IVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVP 69
            C+V+ +A +VC G + +  L TLIKSIL YR+  +  HL+ D + + I+  LF+TW +P
Sbjct: 63  NCQVLHLALVVCGGVHLSYPLSTLIKSILRYRQQAIVLHLLVDDITMRIISLLFNTWRLP 122

Query: 70  QVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIA 129
            V+V +Y A   +   SWIPN+HYSG YG LKL L  +LP  + K I LDTD +   DIA
Sbjct: 123 AVKVIYYNASQYLGRFSWIPNRHYSGRYGFLKLILNDILPADVDKVITLDTDALIMDDIA 182

Query: 130 QLWALFSRLRQRQTIGLVENQSDWYL-GKLWKNHKPWPALGRGF 172
           QLW+ FS++   Q IGLVEN S+WYL  +       WPA GRGF
Sbjct: 183 QLWSFFSKMTSLQAIGLVENLSNWYLINRNTSQRIVWPAWGRGF 226


>gi|256079175|ref|XP_002575865.1| glycosyltransferase-related [Schistosoma mansoni]
          Length = 757

 Score =  217 bits (552), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 141/432 (32%), Positives = 206/432 (47%), Gaps = 94/432 (21%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           P     FY  +  +  VSWIP+ HYSG++GL KL +P++LPE++ K I LD D++  +D+
Sbjct: 273 PNFNVYFYQLELYLHKVSWIPSTHYSGLFGLSKLLIPEILPESVEKVINLDVDLLVNSDL 332

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
            +LWA F +    Q IGLV NQS WYL K   N   WPA G GFNTGV+LLDL +LR+  
Sbjct: 333 LELWAHFQKFNTSQMIGLVANQSPWYLRK--TNRIVWPAWGPGFNTGVMLLDLVRLRESH 390

Query: 286 WAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           W+  W    +  L    ++ LADQDI NA + E P ++Y LPC+WNVQL+ N  + +LC 
Sbjct: 391 WSEHWYKTTKSALQHIPYSMLADQDIINAALVEIPSIIYQLPCEWNVQLTSNM-NIKLCA 449

Query: 346 TELTD---------------------LKIIHWNSPKK----------------LKVKNKH 368
              ++                     LKI H+N P K                L++K + 
Sbjct: 450 IIWSNDYSIHNVQSFHSNPFESFNYHLKIAHFNHPIKADMITLNEIIDDDQLRLRLKRQF 509

Query: 369 MEFFRNLYLTFLEYDGNLLRR--------ELFGCNLTQTTLQQAELSNLNEEDPCYDLRR 420
           M  +R     +  +DG +++         E +  +     + Q    N    D C +   
Sbjct: 510 MRMYR----YYQNFDGTIVQSPGKLSSCSEYYNPSTFNRKINQTSSKNTITIDGCEEFSE 565

Query: 421 SKLTSLRTHLYFLEYEYEASS---------DGNDVTLVAQLSMDRLQMVEINPNNVDEYE 471
             L + R H Y+ +Y +   +           N V+LV+QL+ DRL  +E          
Sbjct: 566 DLLKTHRIHPYYYKYNFSTFTMKDKENVINMKNHVSLVSQLTFDRLHRLE---------- 615

Query: 472 ASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSR 531
                                 E+  + W GP+SL LY++D EA   + Y  NS  L++R
Sbjct: 616 ----------------------ELAIR-WLGPMSLALYLTDREATLLIEYITNSRILQNR 652

Query: 532 RNIGYHVVYKEG 543
            NIGYH+VY +G
Sbjct: 653 TNIGYHIVYVDG 664



 Score =  132 bits (332), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 76/198 (38%), Positives = 100/198 (50%), Gaps = 42/198 (21%)

Query: 15  IQVAIVCAGYNSTRSLVTLIKSILFYRK-------------------------------- 42
           I +++V  G  + R LVTLIKSIL  RK                                
Sbjct: 179 IDISLVIGGAQAARGLVTLIKSILLQRKWERSVEFSKFPRPATLKSTVTFNGKETIMDNV 238

Query: 43  --------NPLHFHLITDTVALNILQTLFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYS 94
                     ++ HLI D  A   L TLF TW++P   V FY  +  +  VSWIP+ HYS
Sbjct: 239 NKHNECYIRHINLHLIIDYHAFKSLSTLFDTWNIPNFNVYFYQLELYLHKVSWIPSTHYS 298

Query: 95  GVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWY 154
           G++GL KL +P++LPE++ K I LD D++  +D+ +LWA F +    Q IGLV NQS WY
Sbjct: 299 GLFGLSKLLIPEILPESVEKVINLDVDLLVNSDLLELWAHFQKFNTSQMIGLVANQSPWY 358

Query: 155 LGKLWKNHKPWPALGRGF 172
           L K   N   WPA G GF
Sbjct: 359 LRK--TNRIVWPAWGPGF 374



 Score = 45.1 bits (105), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 14/31 (45%), Positives = 22/31 (70%)

Query: 634 IEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           ++W+ D+EPY++  R    +D  F+GFGWNK
Sbjct: 665 VQWSADYEPYVIVSRHTTMFDEIFIGFGWNK 695


>gi|444511886|gb|ELV09960.1| Glycosyltransferase-like protein LARGE1, partial [Tupaia chinensis]
          Length = 255

 Score =  196 bits (498), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 86/138 (62%), Positives = 108/138 (78%)

Query: 5   EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
           ++PV+  CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A  IL TLF 
Sbjct: 93  QQPVVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 152

Query: 65  TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
           TW VP V V FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ F
Sbjct: 153 TWMVPAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITF 212

Query: 125 ATDIAQLWALFSRLRQRQ 142
           ATDIA+LWA+F + +  Q
Sbjct: 213 ATDIAELWAVFHKFKASQ 230



 Score = 99.0 bits (245), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 44/74 (59%), Positives = 55/74 (74%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           PA+   FY AD +  +VSWIPNKHYSG+YGL+KL L K LP  L + IVLDTD+ FATDI
Sbjct: 157 PAVRVDFYNADELKSEVSWIPNKHYSGIYGLMKLVLTKTLPANLERVIVLDTDITFATDI 216

Query: 226 AQLWALFSRLRQRQ 239
           A+LWA+F + +  Q
Sbjct: 217 AELWAVFHKFKASQ 230


>gi|326428547|gb|EGD74117.1| Gyltl1b protein [Salpingoeca sp. ATCC 50818]
          Length = 830

 Score =  194 bits (494), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 171/591 (28%), Positives = 255/591 (43%), Gaps = 119/591 (20%)

Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
            Y  D  +  +SW+   HYSG Y LLKL LP +L + +   I LDTD+ F  DIA+LW +
Sbjct: 180 LYPGDRHLSAISWVETSHYSGQYPLLKLLLPTLLAD-VQHVIALDTDLTFRADIAELWRI 238

Query: 232 FSRLRQRQTIGLVENQSD-----------WYLGKLWKNHKPWPALGRGFNTGVILLDLTK 280
           F+     + IG+VEN S            W        H P           + +  L+ 
Sbjct: 239 FAMFEAEEAIGVVENLSGGERRGGRRLCIWRARARVSPHSP-----------LCMASLSC 287

Query: 281 LRDI-SWAGFWRIIAEKFL----LTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLS 335
            R   +WA  WR      L    +    T+LADQDI NA++++HP  +  L C WN QLS
Sbjct: 288 RRSQRNWANVWRDTTRAVLSDNRIGVTATALADQDIINAVLNQHPEWMLLLSCVWNFQLS 347

Query: 336 DNTRSDELCYTELTDLKIIHWNSPKK----LKVKNKHMEFFRNLYLTFLEYDGNLLRREL 391
           +++ +   C     D  ++HWN+  K      + +      R+ Y      DG LL    
Sbjct: 348 EHSLAQTQCSAR--DALVVHWNTRAKDMQATLLDDNAFRALRDAYEISRRVDGALLTSTT 405

Query: 392 F--GCNLTQTTLQQ--------------------------AELSNLNEEDPCYDLRRSKL 423
               C+   T +                             EL N  E+      R+S  
Sbjct: 406 LDVACDSHTTAVNNRQQQQQIQIQQQLQHQHQHQQQQQQQKELENQGEDGGGGHTRQSAH 465

Query: 424 TSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINPNNVDE-------------Y 470
              R  L  L     +SS  + + ++ Q S+ R   V   P  +               Y
Sbjct: 466 RQPRPQLSALSSSSSSSSQSSILGVMGQTSIPRAVRVACPPTYLSALQAPLRVHPYFIAY 525

Query: 471 EASSDG-NDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALR 529
           +AS +  +++TLV   ++DRLQ    + + W+GPIS  +++S+ +A  F S A    AL 
Sbjct: 526 DASLETESEITLVTHFTLDRLQAFVAMAETWQGPISAAVFISEHDA--FTSLA----ALH 579

Query: 530 SRRNIGYHVVYKE----GNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSI 585
              N+  HVVY++    G+ YPIN LRN+A++   TPY   LD+DF     L+  L   +
Sbjct: 580 -LTNVALHVVYQDPDHDGSLYPINRLRNMAISLCRTPYFLHLDVDFSFRSDLHAVLGRHV 638

Query: 586 RSMDMH--------GHGGKKVLVVPAFETQRYR------------------------TAF 613
           R++ +         G      LVVPAFE+  Y+                        + +
Sbjct: 639 RAVSIGELNIEASMGAARGTALVVPAFESVSYKRHVPDTKERLLWELSTGQMQPFRVSEW 698

Query: 614 PASHAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           P  H+ T+F+RW  A  PY I+    FEPY+V  +  P +D  F+GFGWNK
Sbjct: 699 PQGHSATDFARWKQADAPYLIQHQSGFEPYLVLPKTAPPFDETFLGFGWNK 749



 Score =  103 bits (257), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 56/140 (40%), Positives = 81/140 (57%), Gaps = 1/140 (0%)

Query: 12  CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQV 71
           C+ + +A V  GY  +  L T++KSIL +R  P+H H+ITD   L +L  +  TW +   
Sbjct: 117 CDTVHLAAVSVGYAQSAHLRTMLKSILLHRSLPIHLHVITDDATLPVLDVMLGTWGIRAF 176

Query: 72  EVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQL 131
             + Y  D  +  +SW+   HYSG Y LLKL LP +L + +   I LDTD+ F  DIA+L
Sbjct: 177 NYTLYPGDRHLSAISWVETSHYSGQYPLLKLLLPTLLAD-VQHVIALDTDLTFRADIAEL 235

Query: 132 WALFSRLRQRQTIGLVENQS 151
           W +F+     + IG+VEN S
Sbjct: 236 WRIFAMFEAEEAIGVVENLS 255


>gi|60360244|dbj|BAD90366.1| mKIAA4105 protein [Mus musculus]
          Length = 249

 Score =  186 bits (473), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 106/243 (43%), Positives = 140/243 (57%), Gaps = 36/243 (14%)

Query: 376 YLTFLEYDGNLLRRELFGCNLTQTTLQQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEY 435
           +L F     ++L+  LF  +     LQQA L+ L+EE+PC++ R+ +LT  R H+ FL +
Sbjct: 28  WLAFGRSCPSVLKDMLFFRSWNPAQLQQA-LTQLDEEEPCFEFRQQQLTVHRVHITFLPH 86

Query: 436 EYEASSDGNDVTLVAQLSMDRLQMVEINPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEM 495
           +                                        +DVTLVAQLSMDRLQM+E 
Sbjct: 87  QPPPPQ----------------------------------PHDVTLVAQLSMDRLQMLEA 112

Query: 496 LFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVA 555
           L  HW GP+SL LY++D EAQQFL +   S  L  R+++ YHVVY++G  YP+N LRNVA
Sbjct: 113 LCGHWPGPMSLALYLTDEEAQQFLHFVETSPVLSMRKDVAYHVVYRDGPLYPVNQLRNVA 172

Query: 556 LNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPA 615
           L Q  TPYVFL DIDFLP + LY YL++SI  +++     K  LVVPAFET  YR +FP 
Sbjct: 173 LAQALTPYVFLSDIDFLPAYSLYDYLRASIEQLELDSR-RKTALVVPAFETLHYRFSFPN 231

Query: 616 SHA 618
           S A
Sbjct: 232 SKA 234


>gi|345324407|ref|XP_001507723.2| PREDICTED: glycosyltransferase-like protein LARGE2-like
           [Ornithorhynchus anatinus]
          Length = 542

 Score =  184 bits (466), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 84/124 (67%), Positives = 103/124 (83%)

Query: 13  EVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQVE 72
           +++ VAIVCAG+NS+R  VTL+KSILF+RKNPLHFHL+TD VA  IL+TLF +W VP V 
Sbjct: 32  QLLHVAIVCAGHNSSRDAVTLVKSILFHRKNPLHFHLVTDAVARLILETLFRSWMVPSVL 91

Query: 73  VSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLW 132
           VSFY AD +  +V+WIPNKHYSG+YGLLKLTLP+ LP  L + IVLDTD+ FA+DIA LW
Sbjct: 92  VSFYDADELKPEVAWIPNKHYSGIYGLLKLTLPRALPPDLPRVIVLDTDITFASDIADLW 151

Query: 133 ALFS 136
           A+FS
Sbjct: 152 AVFS 155



 Score =  103 bits (257), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 51/110 (46%), Positives = 64/110 (58%), Gaps = 26/110 (23%)

Query: 579 PYLKSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS---------------------- 616
           P+ ++SI  +D+     K  LVVPAFET RYR +FPAS                      
Sbjct: 228 PFPRASIEQLDLATR--KAALVVPAFETLRYRLSFPASKAELLTLLDAGSLYTFRYHVWP 285

Query: 617 --HAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
             HAPT+++RW  A  PY++ WA D+EPY+V  RD PRYD RFVGFGWNK
Sbjct: 286 RGHAPTDYARWREAPAPYRVAWAADYEPYVVVRRDCPRYDPRFVGFGWNK 335



 Score = 95.9 bits (237), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 42/62 (67%), Positives = 51/62 (82%)

Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
           FY AD +  +V+WIPNKHYSG+YGLLKLTLP+ LP  L + IVLDTD+ FA+DIA LWA+
Sbjct: 94  FYDADELKPEVAWIPNKHYSGIYGLLKLTLPRALPPDLPRVIVLDTDITFASDIADLWAV 153

Query: 232 FS 233
           FS
Sbjct: 154 FS 155


>gi|395543773|ref|XP_003773787.1| PREDICTED: glycosyltransferase-like protein LARGE2 [Sarcophilus
           harrisii]
          Length = 448

 Score =  181 bits (460), Expect = 9e-43,   Method: Compositional matrix adjust.
 Identities = 89/177 (50%), Positives = 115/177 (64%), Gaps = 36/177 (20%)

Query: 177 SVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLR 236
           S   +V+WIPNKHYSG+YGLLKLTLP+ LP +L + IVLDTD+ FA+DIA+LWA+F+R  
Sbjct: 22  STAPEVAWIPNKHYSGIYGLLKLTLPRALPPSLPRVIVLDTDITFASDIAELWAVFARF- 80

Query: 237 QRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEK 296
                                             +GVILL L +LR   W   WR+ AE+
Sbjct: 81  ----------------------------------SGVILLHLERLRQAGWEQMWRLTAER 106

Query: 297 FLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKI 353
            LLT L TSLADQD+FNA+I +HP+LV+ LPC WNVQLSD+TR+++ CY E +DLK+
Sbjct: 107 ELLTMLATSLADQDVFNAVIKQHPWLVHPLPCSWNVQLSDHTRAEQ-CYLEGSDLKV 162



 Score = 96.7 bits (239), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 46/93 (49%), Positives = 54/93 (58%), Gaps = 24/93 (25%)

Query: 596 KKVLVVPAFETQRYRTAFPAS------------------------HAPTNFSRWVNATTP 631
           K  LVVPAFET RYR +FP S                        H PT+++RW  A  P
Sbjct: 184 KAALVVPAFETLRYRLSFPTSKAELLSLLDDGSLYTFRYHVWPRGHEPTDYARWREARAP 243

Query: 632 YQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           Y++EWA D+EPY+V  RD PRYD RFVGFGWNK
Sbjct: 244 YRVEWAADYEPYVVVPRDCPRYDPRFVGFGWNK 276



 Score = 93.6 bits (231), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 39/59 (66%), Positives = 50/59 (84%)

Query: 80  SVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRL 138
           S   +V+WIPNKHYSG+YGLLKLTLP+ LP +L + IVLDTD+ FA+DIA+LWA+F+R 
Sbjct: 22  STAPEVAWIPNKHYSGIYGLLKLTLPRALPPSLPRVIVLDTDITFASDIAELWAVFARF 80


>gi|268578177|ref|XP_002644071.1| C. briggsae CBR-LGE-1 protein [Caenorhabditis briggsae]
          Length = 471

 Score =  177 bits (450), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 136/448 (30%), Positives = 206/448 (45%), Gaps = 107/448 (23%)

Query: 266 GRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYT 325
           GRGFNTGVI++DL +LRD+ +   WR +A K+L     T+++DQDIFNA + E+P  V T
Sbjct: 15  GRGFNTGVIMMDLDRLRDMDFKTKWRAVANKYLRIHGKTTMSDQDIFNAYLYEYPQEVKT 74

Query: 326 LPCQWNVQLSDNTRSDELCYTELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGN 385
           LPC WN QL   ++SD+LC+ E    + +H+NS  K   KN         YL +     N
Sbjct: 75  LPCAWNYQLGVLSKSDKLCFEE---PQALHFNSQNKTVRKN---------YLLY-----N 117

Query: 386 LLRRELFGCNLTQTTLQQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGND 445
            +R+E+ G + T+                     R + + + TH         +S   + 
Sbjct: 118 AVRKEIDGIDGTEL--------------------RKRRSLVSTH------PAPSSPTPSK 151

Query: 446 VTLVAQLSMDRLQMVEINPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPIS 505
                Q     LQ   + PN + + +   +  ++ LV Q S DRL       + W  PIS
Sbjct: 152 TNTCEQFL--PLQNFRVLPNALGKIK---ENPELCLVTQFSKDRLHSFVKNARKWNRPIS 206

Query: 506 LTLYMSDAEAQQFLSYAGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVF 565
             +Y  D++  +    A   +AL +R +I  H+V++E + YPIN LRNVA+N      + 
Sbjct: 207 AAIYGKDSDLAEI---ADVIKAL-NRSDIAIHMVFEEESTYPINFLRNVAINHTECVNML 262

Query: 566 LLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFET------------------- 606
           L D DF+ ++G Y  L+   R++       K+VLV+PAFET                   
Sbjct: 263 LADADFI-IYGDYKELEDQARNL-----TEKEVLVIPAFETTNASMTDVSKFPQTKKDVV 316

Query: 607 ----------------------------QRYR-TAFPASHAPTNFSRWVNATTPYQIEWA 637
                                       Q +R  ++P +H  TN   W+ A   Y + + 
Sbjct: 317 LSFFNKTVKIMLNPLRIFFTLPKTILKVQMFRGKSWPNAHNATNIVEWIAAEHVYSVNYN 376

Query: 638 PDFEPY-IVAHRDLPRYDTRFVGFGWNK 664
            ++EPY I+     P YD RF G+GWNK
Sbjct: 377 RNYEPYFIIKKTSFPMYDLRFGGYGWNK 404


>gi|291242113|ref|XP_002740954.1| PREDICTED: conserved hypothetical protein-like [Saccoglossus
           kowalevskii]
          Length = 656

 Score =  165 bits (417), Expect = 9e-38,   Method: Compositional matrix adjust.
 Identities = 136/519 (26%), Positives = 232/519 (44%), Gaps = 96/519 (18%)

Query: 185 IPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLV 244
           +P  + S  Y    L L K     + + + +D +VIF +D A++W+ F+     Q +G+V
Sbjct: 125 VPGVNCSTYYNTEFLELMKSPATRIPRLMYIDWNVIFLSDPAEIWSQFNNFAPGQAVGVV 184

Query: 245 ENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWT 304
                        +++P        NT ++LLD+  + +++       + +K  L     
Sbjct: 185 G-----------LSYRP-----EDMNTNLLLLDMNYVMELNLQD----VLDKIGLGSDNY 224

Query: 305 SLADQD-----IFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWNSP 359
            + D +        ++ + +P  ++ LPC +N+   D T S   CY ++ ++  + + SP
Sbjct: 225 DITDHNSKLHTTITSLSTTNPDALFLLPCNYNL---DFTNSSMDCYLQMGNM--LSFQSP 279

Query: 360 KKLK----VKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTTLQQAELSNLNEEDPC 415
           +       +   + + FRN Y + L+YDGNL R ++ GC +     ++      N E  C
Sbjct: 280 ESFGNDPWLNIDYHQSFRNQYFSLLDYDGNLFRHKIIGCGINIARPEKIVKLFRNGELDC 339

Query: 416 YDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINPNNVDEYEASSD 475
                 +   LRTH YF+         GN  T                       E  S 
Sbjct: 340 SVFFMRETMILRTHTYFI---------GNIPT-----------------------EKDSP 367

Query: 476 GNDVTLVAQLSMDRL-QMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNI 534
             ++TL  Q + DR  QM+E +  +W GPIS+ +Y +D + Q   +    S+ L++R N+
Sbjct: 368 PIEITLSLQSTFDRFYQMLERIATYWTGPISVAVYATDTQVQLIPAMFEMSQILKNRNNV 427

Query: 535 GYHV--VYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHG 592
             HV  V +E   YP N +RN+A+ +V+T Y+++ +IDFL    LY  LK  ++     G
Sbjct: 428 AVHVGFVEREVGQYPFNHMRNIAMEEVNTTYLYIGEIDFLYKPTLYEELKDIVQWFVTSG 487

Query: 593 HGGK-KVLVVPAFETQRYRTA-------------------------FPASHAPTNFSRWV 626
            G + K LVVPA+E  R   +                         FP  H P+N+++ +
Sbjct: 488 GGMEMKALVVPAYEVNRLNVSYIPSTKAELIRLWDNEDLHLFHFWDFPQGHLPSNYTKIM 547

Query: 627 NATTPYQIEWAPDFEPYIV-AHRDLPRYDTRFVGFGWNK 664
               PY I W   +EPY+V  ++ LP +  RF G G +K
Sbjct: 548 LEDQPYPIIWEKGYEPYVVLLNKALPDFPLRFSGQGHDK 586



 Score = 63.9 bits (154), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 42/137 (30%), Positives = 65/137 (47%), Gaps = 21/137 (15%)

Query: 12  CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALN-ILQTLFSTWSVPQ 70
           C+ I + I     + +R+L T +KS+ FYR   LHFH I D      +L+TL+ TW+VP 
Sbjct: 68  CDDIHLLINIPFKDLSRNLTTTLKSLFFYRTTRLHFHYIVDQERTGYMLRTLYKTWNVPG 127

Query: 71  VEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQ 130
           V  S Y                    Y    L L K     + + + +D +VIF +D A+
Sbjct: 128 VNCSTY--------------------YNTEFLELMKSPATRIPRLMYIDWNVIFLSDPAE 167

Query: 131 LWALFSRLRQRQTIGLV 147
           +W+ F+     Q +G+V
Sbjct: 168 IWSQFNNFAPGQAVGVV 184


>gi|291242121|ref|XP_002740957.1| PREDICTED: glycosyltransferase-like 1b-like [Saccoglossus
           kowalevskii]
          Length = 630

 Score =  163 bits (413), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 123/490 (25%), Positives = 216/490 (44%), Gaps = 100/490 (20%)

Query: 209 LAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRG 268
           + K IVLD +VIF  DI++LW  F +         +E                 P++ R 
Sbjct: 134 IDKVIVLDLNVIFIRDISELWDEFRQFGDNNCFATIE-----------------PSIERF 176

Query: 269 FNTG-VILLDLTKL-RDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTL 326
             TG V+L+++  L R      FW    E  L +   +S   + I  AI ++    +  L
Sbjct: 177 DQTGNVVLIEIETLSRFADQTAFW----ENILQSNEQSSFTTK-IMKAIETQPGERLKLL 231

Query: 327 PCQWNVQLSDNTRSDELCYTELTDL---KIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYD 383
            C++NV ++ + +    C ++++D+    ++H       +     +  F ++Y + +E D
Sbjct: 232 ACKFNVDVNSHIQQ---CVSDISDIVSFNMVHVTEKSLNENDALFVHHFNSIYDSIVEGD 288

Query: 384 GNLLRRELFGCNLTQTTLQQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDG 443
           GNLLR  +  C ++ T   + + S + +        R +L  LR+H+++L     +  D 
Sbjct: 289 GNLLRNRMVKCPVSLTPKNKTKCSFVTD--------REQLV-LRSHMFYLGENITSPDDD 339

Query: 444 NDVTLVAQLSMDRLQMVEINPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGP 503
             +++  Q    R                                 +QM++ +  +W  P
Sbjct: 340 FSISMAMQTDFSRF--------------------------------VQMIDKISVYWTCP 367

Query: 504 ISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGYHVVYKEGNF--YPINTLRNVALNQVST 561
           +S+T+Y +D EA+Q L +   S  L  R++I  H+ Y E  F  YP+N LRNVA    +T
Sbjct: 368 MSITIYATDEEARQLLKFRRISSTLSQRKDISIHISYVEREFEDYPVNHLRNVAQEGATT 427

Query: 562 PYVFLLDIDFLPMFGLYPYLKSSIRSM-DMHGHGGKKVLVVPAFETQR------------ 608
            Y+F+ +ID+     +   L++ +  + D      KK +VVPAFE+ R            
Sbjct: 428 QYIFITEIDYFYSLDICQNLQNLVSMIEDRERTMDKKAIVVPAFESNRLSDRMIPETKEE 487

Query: 609 -------------YRTAFPASHAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRD-LPRYD 654
                        + T +P  H PTN+ RW++++  Y++ W   +EPY+V  R   P + 
Sbjct: 488 IVDKYLNEEYHYFHHTIYPGGHKPTNYERWIDSSEEYRVRWNRGYEPYLVLKRKHSPEFP 547

Query: 655 TRFVGFGWNK 664
             F G G+NK
Sbjct: 548 KIFSGRGYNK 557



 Score = 58.9 bits (141), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 38/141 (26%), Positives = 63/141 (44%), Gaps = 26/141 (18%)

Query: 12  CEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLIT-DTVALNILQTLFSTWSVPQ 70
           CE + + +  + Y   R   +L+KSI  + K+ L+FH+I+ D    +++ TL  TW +P 
Sbjct: 52  CEYVNILMYVSSYTEVRQFASLVKSITAHSKSSLYFHVISADEKIRSVVSTLMKTWLLPS 111

Query: 71  VEVSFYLADSVVE---DVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATD 127
           V  + Y  D  ++   D+   P                      + K IVLD +VIF  D
Sbjct: 112 VAYTCYSVDDFLDRSRDLWATP----------------------IDKVIVLDLNVIFIRD 149

Query: 128 IAQLWALFSRLRQRQTIGLVE 148
           I++LW  F +         +E
Sbjct: 150 ISELWDEFRQFGDNNCFATIE 170


>gi|308494773|ref|XP_003109575.1| CRE-LGE-1 protein [Caenorhabditis remanei]
 gi|308245765|gb|EFO89717.1| CRE-LGE-1 protein [Caenorhabditis remanei]
          Length = 434

 Score =  141 bits (355), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 121/438 (27%), Positives = 194/438 (44%), Gaps = 118/438 (26%)

Query: 275 LLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQL 334
           ++DL +LR+  W   WR +A K+L     T+++DQDIFNA I ++P  + +LPC++N QL
Sbjct: 1   MMDLDRLREGDWKSKWRAVANKYLKLHGKTTMSDQDIFNAYIHDYPTEIKSLPCEYNFQL 60

Query: 335 SDNTRSDELCY-TELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFG 393
              T+SDELC  T L     +H+NS  K   +N         Y+ +     N +R+E+ G
Sbjct: 61  GTLTKSDELCVETPLA----LHFNSQNKTVRRN---------YVVY-----NAVRKEIEG 102

Query: 394 CNLTQTTLQQA--------ELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGND 445
            + +    ++          +S  + +  C +     L + RT    L    E +     
Sbjct: 103 IDGSDLRRRRRSLKKRALPSVSKTDSKTSCEEYM--PLQNFRTLPNALGRLMETAK---- 156

Query: 446 VTLVAQLSMDRLQMVEINPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPIS 505
           + LV Q S DRL     N                                  KHW+ PIS
Sbjct: 157 LCLVTQFSKDRLDSFIENA---------------------------------KHWKQPIS 183

Query: 506 LTLYMSDAEAQQFLSYAGNSEALRS--RRNIGYHVVYKE-------GNFYPINTLRNVAL 556
             +Y +D +  + +      EA+++  R ++  H+V+KE        + YPIN LRN A+
Sbjct: 184 AAIYGTDQDMNEIV------EAVKTLNRPDLVLHMVFKEPTDQTMSQDIYPINYLRNTAI 237

Query: 557 NQVSTPYVFLLDIDFLPMFGLYPYLKS-SIRSMDMHGHGGKKVLVVPAFET--------- 606
              +   + + D+DF+    +Y   +   ++S D+     K+VLV+PAFET         
Sbjct: 238 KYSNCEKILMTDVDFM----IYEDTQDLVVQSKDL---KDKEVLVIPAFETAENNITDVT 290

Query: 607 ------QRYRTAF-------------PASHAPTNFSRWVNATTPYQIEWAPDFEPY-IVA 646
                 ++   AF             P++H  TN + W+ A   Y + +  ++EPY I+ 
Sbjct: 291 TFPHTKEQLVKAFFKKKINVFRGITWPSAHNSTNVAEWIIAEDVYSVNYEKNYEPYFIIK 350

Query: 647 HRDLPRYDTRFVGFGWNK 664
               P YD RF GFGWNK
Sbjct: 351 KSSCPMYDQRFGGFGWNK 368


>gi|10834722|gb|AAG23791.1|AF258588_1 PP5656 [Homo sapiens]
          Length = 257

 Score =  131 bits (330), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 66/134 (49%), Positives = 81/134 (60%), Gaps = 25/134 (18%)

Query: 555 ALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFP 614
            L Q  TPYVFL DIDFLP + LY YL++SI  + + G   K  LVVPAFET RYR +FP
Sbjct: 18  GLGQALTPYVFLSDIDFLPAYSLYDYLRASIEQLGL-GSRRKAALVVPAFETLRYRFSFP 76

Query: 615 AS------------------------HAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRDL 650
            S                        HAPT+++RW  A  PY+++WA ++EPY+V  RD 
Sbjct: 77  HSKVELLALLDAGTLYTFRYHEWPRGHAPTDYARWREAQAPYRVQWAANYEPYVVVPRDC 136

Query: 651 PRYDTRFVGFGWNK 664
           PRYD RFVGFGWNK
Sbjct: 137 PRYDPRFVGFGWNK 150


>gi|443686400|gb|ELT89685.1| hypothetical protein CAPTEDRAFT_152623 [Capitella teleta]
          Length = 314

 Score =  130 bits (326), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 72/211 (34%), Positives = 105/211 (49%), Gaps = 28/211 (13%)

Query: 474 SDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRN 533
            D N +T V QL+ DRL+ +  + + W GP++   Y++  +   F+        L+ R N
Sbjct: 37  GDLNQITFVTQLTPDRLERLRRVAESWRGPMAANFYVTPKKIADFMHAYCTDPVLQKREN 96

Query: 534 IGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGH 593
           I  HV   EG FYP N +RN+A++ V T YVFL D+DF+P+ G+   ++  IR   +   
Sbjct: 97  IQLHVTQTEGLFYPSNIMRNIAVDPVKTKYVFLCDVDFVPIPGIEDIMQRYIRDGFV--- 153

Query: 594 GGKKVLVVPAFETQRYRTAFPAS------------------------HAPTNFSRWVNAT 629
              K++V+PAFE +    A P S                        H  T   RW  A 
Sbjct: 154 SDDKLIVIPAFEIKEKEIAHPKSKREVLDLLDKEKIQLMDIDHFDGAHKWTELDRWKTAE 213

Query: 630 TPYQIEWAPDFEPYIVAHRD-LPRYDTRFVG 659
            PY ++W   FEPY+V  +D  PRY+ RFVG
Sbjct: 214 EPYIVQWREYFEPYVVMPKDKTPRYNPRFVG 244


>gi|440797224|gb|ELR18319.1| N-acetyllactosaminide beta-1,3-N-acetylglucosaminyltransferase
           [Acanthamoeba castellanii str. Neff]
          Length = 807

 Score =  126 bits (316), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 73/230 (31%), Positives = 122/230 (53%), Gaps = 40/230 (17%)

Query: 473 SSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLY-MSDAEAQQFLSYAGNSEALRSR 531
           S D   V++  QL++DRL  + M+ ++W+GPIS  ++ M +AE  + L +    + L   
Sbjct: 488 SPDPKGVSIATQLTIDRLPSLRMMAQNWDGPISAAVFVMDEAEVDEVLYFRSTWDKLAD- 546

Query: 532 RNIGYHVVYKEGNF-------YPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSS 584
            NI +H+V  E N+       YP+NTLRN+AL +  T +  +LD DF+P    +  +   
Sbjct: 547 -NIDFHLVISERNYAYKTLGLYPVNTLRNIALARCRTEHALILDADFIPSPDSHSVIMDM 605

Query: 585 IRSMDMHGHGGKK------VLVVPAFE-----------------------TQRYRTAFPA 615
           +  +     G ++      V VVPAFE                        Q +   +  
Sbjct: 606 MPRLGRISEGAQEPDSPGVVWVVPAFEIEPGVPLPDSRKELVEMSRRDLIRQVHEVKWAP 665

Query: 616 SHAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRD-LPRYDTRFVGFGWNK 664
           +HAPTN++ W+N++ PY+I + P++EPY++  RD +P YD RF+G+G++K
Sbjct: 666 AHAPTNYTAWMNSSEPYEITFRPEYEPYMMVPRDSVPLYDERFIGYGYDK 715


>gi|167523375|ref|XP_001746024.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163775295|gb|EDQ88919.1| predicted protein [Monosiga brevicollis MX1]
          Length = 306

 Score =  125 bits (315), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 75/234 (32%), Positives = 110/234 (47%), Gaps = 47/234 (20%)

Query: 458 QMVEINPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQ 517
           Q+V   P  V     + +   VTL+   + DRL +   L + +EGP+S  L+ +D EA +
Sbjct: 28  QIVRELPRFVCLERPALETGAVTLITAFTPDRLDLFLSLVRSYEGPVSAALFANDEEADR 87

Query: 518 FLSYAGNSEALRSRRNIGYHVVYKEG---NFYPINTLRNVALNQVSTPYVFLLDIDFLPM 574
                  +   R  R +  H++  +G      PIN LRN+A +   TP+VF +D+DF   
Sbjct: 88  VAQALAETSFARRHRRVCVHLLMPDGLAPGAVPINRLRNLAWDHAPTPWVFHVDVDF--- 144

Query: 575 FGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRYRTA---------------------- 612
                  ++S R+            VVPAFET +YR                        
Sbjct: 145 -------EASNRT----------AFVVPAFETPQYRLPRFGTKAALLEAWDLGLLRPFRM 187

Query: 613 --FPASHAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
             +P +H PT++ RW   T PY I+W PD+EPY++     PR+D RF GFGWNK
Sbjct: 188 DLWPVAHRPTDYHRWSGTTDPYAIDWQPDYEPYLLMSPHAPRFDERFSGFGWNK 241


>gi|326437988|gb|EGD83558.1| hypothetical protein PTSG_04163 [Salpingoeca sp. ATCC 50818]
          Length = 1024

 Score =  125 bits (315), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 83/271 (30%), Positives = 117/271 (43%), Gaps = 53/271 (19%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           P +   F      +++V+WIP +H +G YG  KL L + LP+ +   IVLDTD+    DI
Sbjct: 139 PCVSSTFRDITPALDEVAWIPTRHTAGAYGFAKLVLERALPD-VDHVIVLDTDLSVHADI 197

Query: 226 AQLWALF----------------------SRLRQ-------------RQTIGLVENQSDW 250
            QLW L                        RL               R+       QSDW
Sbjct: 198 GQLWRLAFFPSDTAHGSNNGNMHNNHNAGDRLHSLDDSTQQQQQQQQRRLFAAALQQSDW 257

Query: 251 YLGKL--WKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTR-LWTSLA 307
           YLG L    +   WPA+GRG NTGV+ L L  +R   W   W     + L +R   T+LA
Sbjct: 258 YLGTLPGIPSSLIWPAVGRGLNTGVLALHLAHMRQDGWRDLWETALRRQLASRRATTALA 317

Query: 308 DQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLK--------------I 353
           +QD+FN +  EH  L   LPC+WN+QL D+T +           +              I
Sbjct: 318 EQDVFNVVFLEHRELGGVLPCEWNLQLHDHTSAPACLMPAYQGHQHPRQYWLPPPRQPAI 377

Query: 354 IHWNSPKKLKVKNKHMEFFRNLYLTFLEYDG 384
           +HWNSP K +           ++ + +E +G
Sbjct: 378 LHWNSPLKQEADIPLRPHLMRMHQSAVEANG 408



 Score = 93.6 bits (231), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 52/150 (34%), Positives = 73/150 (48%), Gaps = 31/150 (20%)

Query: 546 YPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSM--DMHGHG-GKKVLVVP 602
           YPIN +RNVA ++ +T +V +LD DF+     Y  +++++     D      G    VVP
Sbjct: 716 YPINYMRNVAWDEATTEHVLMLDADFVISSNAYARIRATLHRQLNDRDARDTGATAFVVP 775

Query: 603 AFETQRYRTAFP------------------------ASHAPTNFSRWVNATT----PYQI 634
           A ET  Y    P                          HAPTN +RW+++T     PY++
Sbjct: 776 ALETDEYAFRVPDTVDGVRRLAKEGELRAFRSREWEKGHAPTNMTRWLSSTQQQQQPYRV 835

Query: 635 EWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           +W P +EPY+V     PRYD R  GFGWNK
Sbjct: 836 DWQPGYEPYLVLPATAPRYDERLSGFGWNK 865



 Score = 89.0 bits (219), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 61/197 (30%), Positives = 86/197 (43%), Gaps = 41/197 (20%)

Query: 16  QVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTW---SVPQVE 72
            VA+V AG ++ R     I+S+L +R   +H H++ D  A + LQ          +P V 
Sbjct: 83  HVAMVVAGVDAARGASITIRSLLQHRTVCVHLHIVCDAEARHSLQRFLQGGLQARLPCVS 142

Query: 73  VSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLW 132
            +F      +++V+WIP +H +G YG  KL L + LP+ +   IVLDTD+    DI QLW
Sbjct: 143 STFRDITPALDEVAWIPTRHTAGAYGFAKLVLERALPD-VDHVIVLDTDLSVHADIGQLW 201

Query: 133 ALF----------------------SRLRQ-------------RQTIGLVENQSDWYLGK 157
            L                        RL               R+       QSDWYLG 
Sbjct: 202 RLAFFPSDTAHGSNNGNMHNNHNAGDRLHSLDDSTQQQQQQQQRRLFAAALQQSDWYLGT 261

Query: 158 L--WKNHKPWPALGRGF 172
           L    +   WPA+GRG 
Sbjct: 262 LPGIPSSLIWPAVGRGL 278


>gi|443695311|gb|ELT96252.1| hypothetical protein CAPTEDRAFT_226558 [Capitella teleta]
          Length = 453

 Score =  125 bits (314), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 73/219 (33%), Positives = 110/219 (50%), Gaps = 25/219 (11%)

Query: 471 EASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRS 530
           +A+ DG D+T V  +S +R+ M+E +  HWEGP+S+ + +      Q +     SE+L+ 
Sbjct: 173 QATEDGRDITFVTHISYNRMYMLERIAAHWEGPMSVAIDILPENFDQLVENFCASESLQL 232

Query: 531 RRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDM 590
           + N+  H+  K+G FYP N +R VALN   T YVFL D DF+P+ G    +   ++   +
Sbjct: 233 KENLMIHLSLKQGEFYPANFIRTVALNGSCTNYVFLSDGDFVPVQGSELRMMEYVKEHGV 292

Query: 591 HGHGGKKVLVVPAFETQRYRTAF------------------------PASHAPTNFSRWV 626
              G K  +V+PAFE  R  T F                        P  H+ T F  W 
Sbjct: 293 LAPGEKHAVVLPAFELLRDGTPFPREKRDVIDLWKEKQLRPFHLREWPPGHSATLFPYWK 352

Query: 627 NATTPYQIEWAPDFEPYIVAHRD-LPRYDTRFVGFGWNK 664
            A   Y+I+W   FEPY++   D +P Y+  FVG  ++K
Sbjct: 353 IADKIYRIKWRRQFEPYLIMRSDEVPAYNQEFVGRFFDK 391


>gi|256072706|ref|XP_002572675.1| glycosyltransferase-related [Schistosoma mansoni]
 gi|360044230|emb|CCD81777.1| glycosyltransferase-related [Schistosoma mansoni]
          Length = 249

 Score =  122 bits (306), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 61/168 (36%), Positives = 99/168 (58%), Gaps = 12/168 (7%)

Query: 176 DSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRL 235
           D  ++ + WI +KH +G    +KL LP++LP ++ K IVLD D+I  T+I +LW  F R 
Sbjct: 5   DQTLDRLQWIVSKHPAGAKPFVKLLLPEILPPSVHKVIVLDLDMILNTNIVELWNHFERF 64

Query: 236 RQRQTIGLVENQSDWYLGKLWKNHKPWPAL-----GRGFNTGVILLDLTKLRDISWAGFW 290
           ++ Q IG+   Q+ ++       HK   +L     G G+N G++L DL+KLR + W   W
Sbjct: 65  QETQMIGIGLEQNPYF-------HKVMKSLISDWKGYGYNGGILLFDLSKLRLMMWNDIW 117

Query: 291 RIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNT 338
             I    + ++ +    +QD+ N II ++ +L+Y +PC+WNVQLSD +
Sbjct: 118 LSITAHLMQSKGYLVTGEQDVMNMIIFKYKHLLYEIPCEWNVQLSDGS 165



 Score = 64.7 bits (156), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 29/76 (38%), Positives = 48/76 (63%)

Query: 79  DSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRL 138
           D  ++ + WI +KH +G    +KL LP++LP ++ K IVLD D+I  T+I +LW  F R 
Sbjct: 5   DQTLDRLQWIVSKHPAGAKPFVKLLLPEILPPSVHKVIVLDLDMILNTNIVELWNHFERF 64

Query: 139 RQRQTIGLVENQSDWY 154
           ++ Q IG+   Q+ ++
Sbjct: 65  QETQMIGIGLEQNPYF 80


>gi|443733888|gb|ELU18085.1| hypothetical protein CAPTEDRAFT_135861, partial [Capitella teleta]
          Length = 270

 Score =  122 bits (306), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 66/196 (33%), Positives = 98/196 (50%), Gaps = 28/196 (14%)

Query: 485 LSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGYHVVYKEGN 544
           L + R  ++E +  +WEGP S+  ++  ++     S    S+ L+ R+N+ +H+VY+ G 
Sbjct: 1   LLIGRFGVLEKMLSNWEGPASVVFHVKHSDLSVLDSLYTKSKVLKDRKNVSFHLVYRRGE 60

Query: 545 FYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAF 604
           FYPIN LRNV +NQ  TPY   +D+D +P  G+Y   K  I+   M     K  LV+P+F
Sbjct: 61  FYPINYLRNVGVNQSRTPYTIFIDVDLVPNPGIYQQAKVYIQQ--MLKPNLKVALVIPSF 118

Query: 605 ETQRYRTAFPAS-------------------------HAPTNFSRWVNATTPYQIEWAPD 639
           E    +  FP +                         H PTNF +W+ AT PY+  W   
Sbjct: 119 EAIEEKLTFPKNKAELLQQIQQNKTLPYKMHRGWNLGHKPTNFPKWLTATEPYEALWRDA 178

Query: 640 FEPYIVAHRDL-PRYD 654
           FEPY +   DL P Y+
Sbjct: 179 FEPYTITQTDLMPPYN 194


>gi|358332625|dbj|GAA51264.1| glycosyltransferase-like protein LARGE [Clonorchis sinensis]
          Length = 458

 Score =  116 bits (290), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 73/217 (33%), Positives = 108/217 (49%), Gaps = 34/217 (15%)

Query: 173 YLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALF 232
           Y A+  +  +  IPN H +GV+GLLKL +P +LPET++K IV+D D ++  D+ +LW  F
Sbjct: 154 YEAEKHLRVLKLIPNTHPAGVWGLLKLEIPSLLPETVSKVIVVDCDTLWNADVQELWLQF 213

Query: 233 SRLRQRQTIGLVENQS----DWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAG 288
           +   + Q++     QS    D   G   ++++P      G N GV L++L  LR+  W  
Sbjct: 214 NTFTENQSLSFAREQSADVGDCDTG--LRSNRP----EAGVNGGVALMNLENLRESGWWQ 267

Query: 289 FWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSD------------ 336
            WR    K LL        +Q ++NAI +  P+L   LPC+WN+QL D            
Sbjct: 268 LWRSTVAKILLAEPTLKEGEQTVYNAIRNYAPFLFGELPCEWNIQLWDQKAHECCPVIWP 327

Query: 337 --NTRSDELCY----------TELTDLKIIHWNSPKK 361
             N R    C             LT+ KIIH+NS +K
Sbjct: 328 DLNQRHKMQCKLGGSLSAAYPNGLTEAKIIHYNSAQK 364



 Score = 79.7 bits (195), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 45/164 (27%), Positives = 80/164 (48%), Gaps = 27/164 (16%)

Query: 15  IQVAIVCAGYNSTRSLVTLIKSILFYR---------------------------KNPLHF 47
           I +A+   G N+  +  +++KS+L++R                            +PL  
Sbjct: 66  IHIAVTLHGMNAIYNFRSMLKSLLYFRGTHNGGTYNCLLRKTSVTQCPKINRVTASPLVV 125

Query: 48  HLITDTVALNILQTLFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKV 107
           H+I D       +    +W+   V++  Y A+  +  +  IPN H +GV+GLLKL +P +
Sbjct: 126 HVILDESTKLGYEDHIKSWADDHVQLIEYEAEKHLRVLKLIPNTHPAGVWGLLKLEIPSL 185

Query: 108 LPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQS 151
           LPET++K IV+D D ++  D+ +LW  F+   + Q++     QS
Sbjct: 186 LPETVSKVIVVDCDTLWNADVQELWLQFNTFTENQSLSFAREQS 229


>gi|167523373|ref|XP_001746023.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163775294|gb|EDQ88918.1| predicted protein [Monosiga brevicollis MX1]
          Length = 1116

 Score =  113 bits (283), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 63/160 (39%), Positives = 94/160 (58%), Gaps = 7/160 (4%)

Query: 13   EVIQVAIVCAGYNSTRSLVTLIKSILFYR--KNPLHFHLITDTVALNILQTLFSTWSVPQ 70
            E I+VA+V  G N  R L  LIKSILF+     PL FH+ITD      L+ L+ +W +P 
Sbjct: 876  EAIEVAVVAVGSNHARRLQVLIKSILFHHLPPQPLRFHVITDHETAASLRHLYRSWRLPA 935

Query: 71   VEVSFYLADSVVE--DVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 128
            V+V FY   + ++  D+  +   HY+G Y  +KL +  +LP +L + +VLDTD++F   I
Sbjct: 936  VQVRFYSITAALQGVDLHGLETHHYAGRYAFVKLFVADLLPVSLERVMVLDTDLLFLGPI 995

Query: 129  AQLWALFSRLRQRQTIGLVENQSDWYL-GKLWKNHKPWPA 167
            A+LW  F          +V+N S+WY+ G+L +  +PWPA
Sbjct: 996  AELWDQFKGWSASAIFAVVDNFSEWYIPGRLQR--QPWPA 1033



 Score =  113 bits (282), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 67/182 (36%), Positives = 106/182 (58%), Gaps = 12/182 (6%)

Query: 166  PALGRGFYLADSVVE--DVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 223
            PA+   FY   + ++  D+  +   HY+G Y  +KL +  +LP +L + +VLDTD++F  
Sbjct: 934  PAVQVRFYSITAALQGVDLHGLETHHYAGRYAFVKLFVADLLPVSLERVMVLDTDLLFLG 993

Query: 224  DIAQLWALFSRLRQRQTIGLVENQSDWYL-GKLWKNHKPWPALGR-GFNTGVILLDLTKL 281
             IA+LW  F          +V+N S+WY+ G+L +  +PWPA    G NTGV LL L +L
Sbjct: 994  PIAELWDQFKGWSASAIFAVVDNFSEWYIPGRLQR--QPWPAPAPLGINTGVTLLHLARL 1051

Query: 282  RDISWAGFW-----RIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSD 336
            R  ++   W     R++A+   L   +  LADQD+ N +  ++P L++ LPC++N QLS+
Sbjct: 1052 RHQNFPKVWTSAVARVLADP-RLNITYAPLADQDVMNTVFYDNPTLLHRLPCRFNYQLSE 1110

Query: 337  NT 338
            NT
Sbjct: 1111 NT 1112


>gi|443717625|gb|ELU08592.1| hypothetical protein CAPTEDRAFT_191108 [Capitella teleta]
          Length = 364

 Score =  112 bits (280), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 73/214 (34%), Positives = 105/214 (49%), Gaps = 35/214 (16%)

Query: 477 NDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGY 536
           +DVTLV QL++ R+  +E +   W+GP+S   ++ D      +    +++ L +R N+  
Sbjct: 87  HDVTLVTQLTISRVGRLEQVLDKWKGPVSAAFHV-DESPNDVIQAICDNKVLSNRNNLYI 145

Query: 537 HVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGK 596
           H++ K G  YPIN LRN+AL  VST + FL D+DF+P   LY   K  I S ++ G   K
Sbjct: 146 HLMRKTGRLYPINQLRNLALKCVSTGFSFLSDVDFIPHQNLYNESK-QIVSKNIEG-SKK 203

Query: 597 KVLVVPAFETQRYR-------TAFPAS------------------------HAPTNFSRW 625
           +  VV AF  Q  +         FP S                        H  TN++RW
Sbjct: 204 RAYVVAAFGVQEAKGTNKAKAVEFPESKEKLLKLWDSKMIEPVHFAKFIHGHYQTNYTRW 263

Query: 626 VNATTPYQIEWAPDFEPY-IVAHRDLPRYDTRFV 658
            NAT  Y I+W   +EPY IV   + P YD  F+
Sbjct: 264 RNATEIYPIQWYTHYEPYVIVKTTETPAYDGTFI 297


>gi|440804874|gb|ELR25737.1| fibronectin type III domain containing protein [Acanthamoeba
           castellanii str. Neff]
          Length = 642

 Score =  111 bits (278), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 65/226 (28%), Positives = 116/226 (51%), Gaps = 39/226 (17%)

Query: 473 SSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRR 532
           + + +DV + +Q+++DRL ++E + + W GPIS  + +S+ EA+    ++  + +   R+
Sbjct: 301 TPESDDVAIASQMTIDRLPVLESMAESWAGPISAVILLSNPEAELKQIHSLRNRSAVIRK 360

Query: 533 NIGYHVVYKEGN--------FYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSS 584
           N+ +H+V+ E N         YP+N+LRN A+ +  T +V +LDIDF+P        + +
Sbjct: 361 NVDFHLVHGEENVYALHTLGLYPVNSLRNEAVVKSRTAHVMVLDIDFIPSRD----AREN 416

Query: 585 IRSMDMHGHGGKKVLVVPAFETQRYRTA--------------------------FPASHA 618
           I  +    H  + V VV +FE   +                             FPA H 
Sbjct: 417 ILRLWDQLHEPQTVFVVASFEMDDFNMRDVPHTKDQLLRMVAQDRIRQVHKDKWFPA-HG 475

Query: 619 PTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           PT++  W +    YQ+ +  D+EPY +  +D+P +D RFVG+G++K
Sbjct: 476 PTDYEHWYSTRELYQVPYQRDYEPYYIVRKDVPLWDERFVGYGFDK 521


>gi|449017583|dbj|BAM80985.1| similar to glycosyltransferase [Cyanidioschyzon merolae strain 10D]
          Length = 505

 Score =  110 bits (275), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 74/255 (29%), Positives = 125/255 (49%), Gaps = 43/255 (16%)

Query: 453 SMDRLQMVEI-NPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMS 511
           ++D L++  +  P  +     +    +V+LV   S+ +L  +  L +HW  PIS  L + 
Sbjct: 180 ALDTLKLENVLGPQKIQRSHRALTEINVSLVTHASLSKLSELRRLTEHWPAPISCALVLP 239

Query: 512 D-AEAQQFLSYAGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDID 570
           D +++ Q L       + R   ++     Y+   +YP N  RN+AL+   T +VFL+D+D
Sbjct: 240 DVSQSHQVLDMYPKGRSDRVDIHLLLADDYERPTYYPFNAARNLALDNARTDWVFLVDVD 299

Query: 571 FLPMFGLYPYLKSSIR---SMDMHGHGGKKVLVVPAFET----QRYRTAFPAS------- 616
           F+P   L   ++ ++R    +       + VL+VPAFE     + ++T+ PA+       
Sbjct: 300 FVPSPNLVESVQRTLRRFPELREEMQQRRAVLIVPAFEKLQAREEHQTSLPATRAELVAE 359

Query: 617 -----------------HAPTNFSRWV---NATT------PYQIEWAPDFEPYIVAHR-D 649
                            H PT++ RW+   N T+      PY++ +   +EPYIVA+R  
Sbjct: 360 TRAGRIVPFHVSWYWPGHGPTDYIRWLADANETSEHEPHKPYRVRYRDGYEPYIVAYRHG 419

Query: 650 LPRYDTRFVGFGWNK 664
           LPRY+  FVG+GWNK
Sbjct: 420 LPRYEDSFVGYGWNK 434


>gi|345326810|ref|XP_003431086.1| PREDICTED: glycosyltransferase-like protein LARGE1-like
           [Ornithorhynchus anatinus]
          Length = 390

 Score =  109 bits (273), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 52/108 (48%), Positives = 70/108 (64%)

Query: 5   EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
           ++P +  CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D +A  IL TLF 
Sbjct: 269 QQPAVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADAIAKQILATLFQ 328

Query: 65  TWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETL 112
           TW VP V V FY AD +  ++  + +        L  +  P+ LP  L
Sbjct: 329 TWMVPAVRVDFYDADELKWNLEPLSSPVQESFAILENVLCPQSLPVCL 376


>gi|358255463|dbj|GAA57164.1| glycosyltransferase-like protein LARGE [Clonorchis sinensis]
          Length = 409

 Score =  108 bits (270), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 57/163 (34%), Positives = 85/163 (52%), Gaps = 5/163 (3%)

Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
            Y  D  +  VSWI N+H SGV  L K+ +P +LP ++ K IVLDTD I   DI +LW L
Sbjct: 156 LYDLDLYLPRVSWIRNQHCSGVGALSKMLVPDILPTSVNKAIVLDTDTILNADILELWKL 215

Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWR 291
           F    Q Q  G+      W    +   +   P    G N+GV+L+ L+++R   W+  + 
Sbjct: 216 FDNFTQIQIFGMA-----WETDPVGPENAETPIPKSGVNSGVVLMYLSRMRRAKWSQLFE 270

Query: 292 IIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQL 334
               +FL      S+ +Q   N +I + P L + +PC+WN+QL
Sbjct: 271 WAVREFLFHYGLLSVGEQSALNTLIQQRPELYHRIPCEWNLQL 313



 Score = 81.3 bits (199), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 42/103 (40%), Positives = 56/103 (54%)

Query: 45  LHFHLITDTVALNILQTLFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTL 104
           +H HLI +   L+   +  S W V Q   + Y  D  +  VSWI N+H SGV  L K+ +
Sbjct: 126 IHLHLIVEQKLLSAAYSYVSQWRVAQFNWTLYDLDLYLPRVSWIRNQHCSGVGALSKMLV 185

Query: 105 PKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLV 147
           P +LP ++ K IVLDTD I   DI +LW LF    Q Q  G+ 
Sbjct: 186 PDILPTSVNKAIVLDTDTILNADILELWKLFDNFTQIQIFGMA 228


>gi|443712918|gb|ELU06002.1| hypothetical protein CAPTEDRAFT_208613 [Capitella teleta]
          Length = 403

 Score =  108 bits (270), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 69/216 (31%), Positives = 106/216 (49%), Gaps = 35/216 (16%)

Query: 479 VTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGYHV 538
            TLV Q + DR   + M+ K+W+GPIS  +Y+   + + F+      E   +RRNI  H+
Sbjct: 130 TTLVTQFTPDRFNRLSMVAKNWDGPISAGIYLPVEKLESFVQDFCTDENTYNRRNIQIHI 189

Query: 539 VYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKK- 597
            +  G FYP+N LRN+AL+   T +VFL DID +PM G+      +I +  ++    K+ 
Sbjct: 190 TFTHGEFYPMNFLRNLALDAAMTSHVFLSDIDLVPMPGI-----DTIVAKYLYEEKFKRG 244

Query: 598 -VLVVPAFETQ------------------------RYR---TAFPASHAPTNFSRWVNAT 629
             LVV  FE                          +Y    T    + A TN++ W    
Sbjct: 245 QALVVACFEKTPNVETPLDKKQLLSLWSKHELEPCQYSFKWTKKRKAQAATNYALWRKTV 304

Query: 630 TPYQIEWAPDFEPYIVAHR-DLPRYDTRFVGFGWNK 664
             Y+I WAP++EP+I+  + D+P Y  +FVG   +K
Sbjct: 305 DIYRITWAPEYEPFIIMFKDDVPAYYPKFVGRNRDK 340


>gi|443719772|gb|ELU09782.1| hypothetical protein CAPTEDRAFT_220855 [Capitella teleta]
          Length = 472

 Score =  108 bits (270), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 72/220 (32%), Positives = 111/220 (50%), Gaps = 35/220 (15%)

Query: 475 DGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNI 534
           +G+ VTLV Q+S +RL  +    KHW GP+S+ +Y+     + F+     +E+L  R NI
Sbjct: 190 NGSRVTLVTQVSFNRLFNLTTQAKHWPGPMSVAMYLPLFRLETFVMDYCEAESLNKRENI 249

Query: 535 GYHVVYKEGN-FYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGH 593
             H++  EG  F+P N +RN+A+  V T + F+ D D +P     P ++++I      G+
Sbjct: 250 QIHIMLDEGCYFFPTNVMRNLAMEYVYTSHSFITDGDHIPN----PNMENNIIKHAEDGY 305

Query: 594 -GGKKVLVVPAFE---------------------TQRYRTAF------PASHAPTNFSRW 625
             G  V+++PAFE                      ++Y   F      PA + PTN++ W
Sbjct: 306 LKGATVMIIPAFEYTVMPEGFTMVHDKEELMKHWDEKYLKPFYIDRFLPA-YGPTNYTYW 364

Query: 626 VNATTPYQIEWAPDFEPYIVAHRDLPR-YDTRFVGFGWNK 664
             A  PY IE+   FEPYIVA   L   Y+  F+G  +NK
Sbjct: 365 RTAEHPYTIEYGDQFEPYIVAETKLTHSYNNSFIGRFYNK 404


>gi|443687234|gb|ELT90283.1| hypothetical protein CAPTEDRAFT_177310 [Capitella teleta]
          Length = 361

 Score =  108 bits (269), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 71/201 (35%), Positives = 101/201 (50%), Gaps = 27/201 (13%)

Query: 479 VTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGYHV 538
           +TLV QL  +RLQ +E L   W GP S+ LY SD EA  F  Y  ++  LR+  ++  H+
Sbjct: 46  ITLVTQLQPNRLQGLERLLMEWRGPASVALYASDEEAVDFAKYFRSNTKLRTLPDLVVHL 105

Query: 539 VYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKV 598
           V+K    YPIN LRNVA+ +V T Y  +LD+DF+P  GL  + ++    +          
Sbjct: 106 VFKRQKQYPINYLRNVAMREVLTQYHLMLDVDFVPNSGLAQHAQNIFEKLST-TEETPVA 164

Query: 599 LVVPAFETQR--YRTAFPAS-----------------------HAPTNFSRWVNATTPYQ 633
           LVVP FET    Y+ A   S                       H  T++++W  AT PY+
Sbjct: 165 LVVPTFETSNSFYKIAENRSEFLHHLDEGKVLSYHQTAQWNLGHRYTSYAKWKTATDPYK 224

Query: 634 IEWAPDFEPYIVAH-RDLPRY 653
           +  A  +EPY+V   +D P Y
Sbjct: 225 VYGAFGYEPYLVIRAQDSPEY 245


>gi|358340195|dbj|GAA48141.1| glycosyltransferase-like protein LARGE, partial [Clonorchis
           sinensis]
          Length = 695

 Score =  107 bits (268), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 59/205 (28%), Positives = 93/205 (45%), Gaps = 48/205 (23%)

Query: 185 IPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQT---- 240
           +PN H+SG+  ++KL +P +LP T+ K IV+D+D++F  ++ +LW LF R    Q     
Sbjct: 105 MPNSHHSGIPSMMKLLVPVILPSTVEKVIVMDSDMLFNHNVLELWELFYRFNSTQVSVEC 164

Query: 241 -IGLVENQSDWYLG---------------KLWKNHK------------------------ 260
            + +     DW                  +LW   K                        
Sbjct: 165 QVTVTARHFDWCTNLIKYNNTDFHEVPDTELWVTCKLEDLEYQKGELGSKVYGTVYTKKL 224

Query: 261 ----PWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAII 316
               P P   RG N G+ ++ LT+LR+  W   W+  ++KF         ADQDIFN +I
Sbjct: 225 TTVPPKPQQDRGLNGGLSIMHLTRLRENGWHDKWQEASKKFFERNTMLKQADQDIFNTVI 284

Query: 317 SEHPYLVYTLPCQWNVQLSDNTRSD 341
             +P + + +PC+WN+QL   T +D
Sbjct: 285 EMNPTMYFRIPCEWNIQLCSKTTAD 309


>gi|281212139|gb|EFA86299.1| putative glycosyltransferase [Polysphondylium pallidum PN500]
          Length = 393

 Score =  107 bits (266), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 56/213 (26%), Positives = 112/213 (52%), Gaps = 32/213 (15%)

Query: 479 VTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSD--AEAQQFLSYAGNSEALRSRRNIGY 536
           VT+V Q +++R+  + ++   W+ PIS ++++ D   E  Q      N+  L   + + +
Sbjct: 104 VTIVTQATVNRVDRIVLMADRWKAPISTSIFIRDPDTELDQLNQMIANNPILM--KFVDF 161

Query: 537 HVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGK 596
           H++Y     YP+N LRN+++    T YV ++D DF+P +G++ YL   ++ +D  G+  K
Sbjct: 162 HLLYANNTRYPVNNLRNLSIRNSHTDYVLVMDADFIPPYGMHDYL---LKYIDNTGNEEK 218

Query: 597 KVLVVPAFETQR------------------------YRTAFPASHAPTNFSRWVNATTPY 632
              +VP+F + +                             P  H+P++++RW   T PY
Sbjct: 219 VAYIVPSFSSSQDPRILPNDKPTLVEMVEKNLVNPSNLNVCPKCHSPSDYNRWFETTVPY 278

Query: 633 QIEWAPDFEPYIVAHRDLPR-YDTRFVGFGWNK 664
           +++++  +EPY++ +R+    +D RF G+G++K
Sbjct: 279 EVKYSWIYEPYLIFNRNNTELFDERFKGYGFDK 311


>gi|443726598|gb|ELU13717.1| hypothetical protein CAPTEDRAFT_190301 [Capitella teleta]
          Length = 449

 Score =  107 bits (266), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 73/229 (31%), Positives = 112/229 (48%), Gaps = 41/229 (17%)

Query: 472 ASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSR 531
           + +D   VTLV  LS+D L+ + +L  HW+GPIS  +++   +    +    + EAL SR
Sbjct: 146 SGADDGLVTLVTHLSLDHLESLAILIDHWQGPISAAVWVPKNKTASAVMGFCSIEALLSR 205

Query: 532 RNIGYHVVYKE------GNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSI 585
            NI  H+V ++        FYP N LRNVAL +  T YVFL D+D +PM  +   L   +
Sbjct: 206 DNILLHLVIEDHVFTFNKAFYPSNYLRNVALEEAKTIYVFLSDVDVIPMASMTRRLGFQL 265

Query: 586 RSMDMHGHGGKKVLVVPAFETQRYRTAFPASHAPTN-----------------FSRWVNA 628
               +   G K+VLVVP FE+    ++F  S  P +                 F R++  
Sbjct: 266 AKRPL---GSKQVLVVPTFES--LNSSFCCSTIPNSKIALLKLWAQHRIGPFYFKRYIPG 320

Query: 629 TTP------------YQIEWAPDFEPYIVAHRD-LPRYDTRFVGFGWNK 664
            +P            Y++ +   FEPYIV  R+  P+++T F G  ++K
Sbjct: 321 YSPIRYDIWKVAKELYKVNFHKQFEPYIVMQRETTPKFNTSFFGLYFDK 369


>gi|328865206|gb|EGG13592.1| putative glycosyltransferase [Dictyostelium fasciculatum]
          Length = 405

 Score =  106 bits (265), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 61/219 (27%), Positives = 112/219 (51%), Gaps = 31/219 (14%)

Query: 475 DGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYM-SDAEAQQFLSYAGNSEALRSRRN 533
           D + +T+V Q+S++RL+ +  +   W  PIS  +Y+ +D E     S   NS ++    +
Sbjct: 92  DHSKITIVTQVSVERLERIAWMADKWRAPISAAVYIKNDNEIHNVTSLINNSYSVTQFVD 151

Query: 534 IGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDM--- 590
           I  H+++     YP+NTLRN+++ + ST +V L+D DF+P   L+  +++ +  +++   
Sbjct: 152 I--HLLFANKTRYPVNTLRNLSIKKASTKWVLLMDADFIPPLDLHDTIQTHVDRLELDSD 209

Query: 591 HGHGGKKVLVVPAFETQRYR------------------------TAFPASHAPTNFSRWV 626
           H        VVP+F +   R                           P  H+PTN++RW 
Sbjct: 210 HNQNDLVSFVVPSFASSLPRFQLPDDKESFIQSVKSQTIVPTNLNVCPKCHSPTNYTRWY 269

Query: 627 NATTPYQIEWAPDFEPYIVAHRD-LPRYDTRFVGFGWNK 664
           N + PY++E+   +EP++V   D +  +D R  G+G++K
Sbjct: 270 NESIPYEVEYQWIYEPFLVFRTDQVEPFDERLKGYGFDK 308


>gi|90079249|dbj|BAE89304.1| unnamed protein product [Macaca fascicularis]
          Length = 215

 Score =  106 bits (265), Expect = 4e-20,   Method: Composition-based stats.
 Identities = 45/74 (60%), Positives = 58/74 (78%)

Query: 5   EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
           ++PV+  CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A  IL TLF 
Sbjct: 128 QQPVVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFQ 187

Query: 65  TWSVPQVEVSFYLA 78
           TW VP V V   L+
Sbjct: 188 TWMVPAVRVDSLLS 201


>gi|281342976|gb|EFB18560.1| hypothetical protein PANDA_010466 [Ailuropoda melanoleuca]
          Length = 162

 Score =  105 bits (263), Expect = 6e-20,   Method: Composition-based stats.
 Identities = 44/69 (63%), Positives = 56/69 (81%)

Query: 5   EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
           ++PV+  CE I VAIVCAGYN++R +VTL+KS+LF+R+NPLHFHLI D++A  IL TLF 
Sbjct: 93  QQPVVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHRRNPLHFHLIADSIAEQILATLFR 152

Query: 65  TWSVPQVEV 73
           TW VP V V
Sbjct: 153 TWMVPAVRV 161


>gi|443691848|gb|ELT93598.1| hypothetical protein CAPTEDRAFT_227063 [Capitella teleta]
          Length = 463

 Score =  103 bits (258), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 70/230 (30%), Positives = 108/230 (46%), Gaps = 40/230 (17%)

Query: 468 DEYEASSDGN--DVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNS 525
           D ++A  + N   +TLV QLS  RL+   +    W GPIS+ LY+        +      
Sbjct: 143 DVFDAKINPNISQITLVTQLSHVRLERFALHAAKWPGPISMALYVPQGMLLPTMLAWCRF 202

Query: 526 EALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSI 585
           + L +R NI  HV+ ++G FYP+N LRNVA+  V T   +++D D +PM G+   L   +
Sbjct: 203 KELCTRDNIQLHVIVEKGIFYPVNYLRNVAVKYVKTDLTYVVDADVMPMPGMEAMLHEHL 262

Query: 586 RSMDMHGHGGKKVLVVPAFET------------------------------QRYRTAFPA 615
           +    H     +VLV+P FE+                              +RY+ A+  
Sbjct: 263 QR---HPLNTTQVLVIPTFESYDGSCCDTMPKTKHEMIKLWDQGVVGPFYFERYKPAY-- 317

Query: 616 SHAPTNFSRWVNATTPYQIEWAPDFEPYIVAH-RDLPRYDTRFVGFGWNK 664
             AP  + RW  AT  Y + +   FEPYI+ H + +P + T F G  ++K
Sbjct: 318 --APIRYQRWKVATEAYIVPFHKQFEPYIIMHTKRVPAFHTGFFGLYFDK 365


>gi|390347596|ref|XP_001200797.2| PREDICTED: glycosyltransferase-like protein LARGE2-like
           [Strongylocentrotus purpuratus]
          Length = 560

 Score =  102 bits (254), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 69/230 (30%), Positives = 110/230 (47%), Gaps = 43/230 (18%)

Query: 477 NDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYM--------SDAEAQQFLSYAGNSEAL 528
           +DVTL+   S+DR+  V  +   W+ P S++LY+         D E ++   +      L
Sbjct: 261 DDVTLITHASLDRIDGVLRMLNSWDAPCSISLYLPIKADERDEDIEWKKIYVFK-KMRIL 319

Query: 529 RSRRNIGYHVVYKEG--NFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIR 586
           ++R +I   +VY     + YPIN +RNVA++Q  T ++ LLD DFLP   L+ +    + 
Sbjct: 320 QTRADIDLTLVYSNTMEDIYPINYMRNVAISQSKTKFILLLDADFLPSPTLHKHFLDVMA 379

Query: 587 SMDMHGHGGKK-VLVVPAF---ETQRYRTAFPA--------------------------S 616
              +     +K   VVPAF   E ++ +   P+                          S
Sbjct: 380 VWQVSEQRDRKTAFVVPAFDYIEEKQDQAGLPSTKDELISLMKQDLPAVSIFRFYESPLS 439

Query: 617 HAPTNFSRWVNATTPYQIEWAPD-FEPYIVAHRD-LPRYDTRFVGFGWNK 664
           H PTN+ +W  A  PY+I +  D +EPY++  +  +P YD RF G+G NK
Sbjct: 440 HRPTNYVQWYMANEPYKISYYEDKYEPYLILQKSGMPMYDERFTGYGMNK 489


>gi|332267349|ref|XP_003282645.1| PREDICTED: glycosyltransferase-like protein LARGE1-like, partial
           [Nomascus leucogenys]
          Length = 79

 Score =  101 bits (251), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 49/80 (61%), Positives = 63/80 (78%), Gaps = 1/80 (1%)

Query: 273 VILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNV 332
           VILL L KLR + W   WR+ AE+ L+  L TSLADQDIFNA+I ++P+LVY LPC WNV
Sbjct: 1   VILLLLDKLRKMKWEQMWRLTAERELMGMLSTSLADQDIFNAVIKQNPFLVYQLPCFWNV 60

Query: 333 QLSDNTRSDELCYTELTDLK 352
           QLSD+TRS++ CY +++DLK
Sbjct: 61  QLSDHTRSEQ-CYRDVSDLK 79


>gi|428184713|gb|EKX53567.1| hypothetical protein GUITHDRAFT_160868 [Guillardia theta CCMP2712]
          Length = 389

 Score = 99.4 bits (246), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 66/199 (33%), Positives = 94/199 (47%), Gaps = 19/199 (9%)

Query: 478 DVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGYH 537
           DVT V QL+ DR+ + + + + W GPIS  LY  D           + E L  R  + +H
Sbjct: 159 DVTWVTQLTTDRIPVFKKMLERWTGPISAALYSQDVSKD-----TKDIEDLLDR--VDFH 211

Query: 538 VVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHG-GK 596
           +V      YP+NTLRNVA+N+ +T ++ L D+DF+P  G+Y   KS +  +   G    K
Sbjct: 212 IVGASQGLYPVNTLRNVAINKANTDFIILADVDFVPSEGIYEATKSFVEKLASTGAAHDK 271

Query: 597 KVLVVPAFETQRYRTAFPAS-----------HAPTNFSRWVNATTPYQIEWAPDFEPYIV 645
           KV V+PAFE        P +           H   N      A    Q +       Y  
Sbjct: 272 KVYVLPAFEIDGADQTVPPNFDELQKMGSRIHQVHNDKGRDIAHKYTQYDKRQIHTEYPT 331

Query: 646 AHRDLPRYDTRFVGFGWNK 664
           A R +PR+D RF G+G +K
Sbjct: 332 APRTIPRFDVRFFGYGNDK 350


>gi|66809567|ref|XP_638506.1| hypothetical protein DDB_G0284567 [Dictyostelium discoideum AX4]
 gi|74854201|sp|Q54PG8.1|GNT14_DICDI RecName: Full=Glycosyltransferase-like protein gnt14
 gi|60467115|gb|EAL65152.1| hypothetical protein DDB_G0284567 [Dictyostelium discoideum AX4]
          Length = 640

 Score = 99.0 bits (245), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 61/212 (28%), Positives = 105/212 (49%), Gaps = 28/212 (13%)

Query: 479 VTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGYHV 538
           VT+V Q ++DRL  V  + + W  PIS++L++        L  A ++  + +     +H+
Sbjct: 348 VTIVTQTTVDRLYKVAAMAERWRSPISVSLFIKSQGDIDKLEKAISANKILATF-ADFHL 406

Query: 539 VYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKV 598
            Y     YP+N LRN+A+    T +V LLD+DF+P   L+ ++   I    ++       
Sbjct: 407 FYHNNTRYPVNNLRNLAIRNALTEHVLLLDVDFIPPSHLHDHIAHYINLNYLNQDDSLNA 466

Query: 599 LVVPAFETQRYRTAFPAS------------------------HAPTNFSRWVNATTPYQI 634
            V+P+F +       P S                        H+PT+++RW+ +T PY I
Sbjct: 467 FVIPSFSSNLQPKDIPDSKFEFIEMLTKNKIEPSNLKVCPKCHSPTDYTRWMTSTEPYAI 526

Query: 635 EWAPDFEPYIVAHRD--LPRYDTRFVGFGWNK 664
           E+   +EP++V +R   LP +D RF G+G++K
Sbjct: 527 EYHWIYEPFLVYNRSQTLP-FDERFKGYGFDK 557


>gi|358336370|dbj|GAA54896.1| glycosyltransferase-like protein LARGE [Clonorchis sinensis]
          Length = 399

 Score = 97.1 bits (240), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 57/175 (32%), Positives = 92/175 (52%), Gaps = 11/175 (6%)

Query: 182 VSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTI 241
           V+ IP ++++G   L KL  P +LPE + K I LD+DV+F  ++  LW  F + +  Q  
Sbjct: 157 VASIPTRYFAGTPPLYKLLAPYILPEDVTKVIALDSDVLFNYNVLDLWKEFGQFKPGQAF 216

Query: 242 GLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTR 301
           G V  Q  +      K   P+P    G NTGV+L+DL +LR I+W   WR    K +   
Sbjct: 217 GAVCEQITFCPYSCPKEDLPFPTF--GINTGVLLMDLAELRRINWWQLWRQELLKEVNEW 274

Query: 302 LWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHW 356
            +    +Q + + ++S++ +L + L C+WN+Q+    RSD +      +   IHW
Sbjct: 275 GYLPAGEQKVISQVMSKYNHLYHRLRCEWNIQI---FRSDGM------NCCPIHW 320



 Score = 73.9 bits (180), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 42/145 (28%), Positives = 69/145 (47%), Gaps = 2/145 (1%)

Query: 25  NSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQVEVSFYLADSVVED 84
           N   ++V   + +  YR  P+  H + D  A N + ++F  W    V +  Y  +  +  
Sbjct: 99  NRPSNIVCKHQPLEAYR--PVRLHWVADIDARNRILSIFEEWQPTDVRIFLYEYERYLPY 156

Query: 85  VSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTI 144
           V+ IP ++++G   L KL  P +LPE + K I LD+DV+F  ++  LW  F + +  Q  
Sbjct: 157 VASIPTRYFAGTPPLYKLLAPYILPEDVTKVIALDSDVLFNYNVLDLWKEFGQFKPGQAF 216

Query: 145 GLVENQSDWYLGKLWKNHKPWPALG 169
           G V  Q  +      K   P+P  G
Sbjct: 217 GAVCEQITFCPYSCPKEDLPFPTFG 241


>gi|260801759|ref|XP_002595763.1| hypothetical protein BRAFLDRAFT_117562 [Branchiostoma floridae]
 gi|229281010|gb|EEN51775.1| hypothetical protein BRAFLDRAFT_117562 [Branchiostoma floridae]
          Length = 554

 Score = 96.7 bits (239), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 75/234 (32%), Positives = 106/234 (45%), Gaps = 50/234 (21%)

Query: 477 NDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQ-----QFLSYAGNSEALRSR 531
           + VTLV Q+SMDR+ M++ + +HW+GP+++ LY+     +      +  Y  N +    +
Sbjct: 253 DTVTLVTQVSMDRIAMLDRIIRHWDGPMAVALYVPTTSQKPDTNHDWKRYYINKKLRHPK 312

Query: 532 RNIGYHVVYKEGNF----YPINTLRNVALNQVSTPYVFLLDIDFLPM------FGLYPYL 581
                 V+    N     YPIN LRN+AL  V + YVFLLD DF+P       F +YP  
Sbjct: 313 FRTFCSVLAVYANVVNDEYPINYLRNLALRMVKSKYVFLLDADFIPSPNFQDNFCVYP-- 370

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFE-----------------TQR----------YRTA-F 613
              +   D      +   VVPAFE                  Q+          +R+   
Sbjct: 371 --RVLINDQSDPTDRTAYVVPAFEFLEEQEDQPPPQDKEALKQQIFADDANILPFRSVES 428

Query: 614 PASHAPTNFSRWVNATTPYQIEWAPD-FEPYIVAHR--DLPRYDTRFVGFGWNK 664
           P SH PT++ RW      Y +    D FEPY+V      LP YD RF G+G NK
Sbjct: 429 PESHRPTDYQRWYAEEQVYPVSTYQDKFEPYLVLRMSPSLPLYDERFSGYGMNK 482


>gi|330799891|ref|XP_003287974.1| hypothetical protein DICPUDRAFT_152182 [Dictyostelium purpureum]
 gi|325081998|gb|EGC35495.1| hypothetical protein DICPUDRAFT_152182 [Dictyostelium purpureum]
          Length = 460

 Score = 96.3 bits (238), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 57/210 (27%), Positives = 105/210 (50%), Gaps = 25/210 (11%)

Query: 478 DVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSD-AEAQQFLSYAGNSEALRSRRNIGY 536
           ++T+V Q ++DR+  + M+   W  PIS  +Y+ + ++         NS ++    +I  
Sbjct: 118 EITIVTQFTIDRMDRIAMMADIWRAPISAAVYIRNQSDIDSVFKLVRNSFSVAQFVDI-- 175

Query: 537 HVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDM------ 590
           H +Y     YP+N LRN+AL    T +  LLD+DF+   G+Y YL+S ++ +D       
Sbjct: 176 HFLYSNQTRYPVNNLRNLALVNARTDWTLLLDVDFVTNEGMYEYLQSLLKEIDKNKLVSF 235

Query: 591 -----------HGHGGKKVLVVPAFETQRY----RTAFPASHAPTNFSRWVNATTPYQIE 635
                      +     KV ++   E        +   P  H PT++SRW +A  PY+++
Sbjct: 236 IIPSFSSSISHYDFPTGKVDLIQMVEAGDIKIINKNVCPRCHGPTDYSRWFSAVEPYKVD 295

Query: 636 WAPDFEPYIVAHRD-LPRYDTRFVGFGWNK 664
           +   +EP+++ ++  +  YD R  G+G++K
Sbjct: 296 YRWLYEPFLLYNKSQIELYDERLKGYGFDK 325


>gi|443732711|gb|ELU17338.1| hypothetical protein CAPTEDRAFT_85145, partial [Capitella teleta]
          Length = 241

 Score = 95.9 bits (237), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 65/210 (30%), Positives = 100/210 (47%), Gaps = 36/210 (17%)

Query: 482 VAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGYHVVYK 541
           V QL++ RL     +   W+GP+S  +++ D      ++   +++ L  R N+  H++ K
Sbjct: 1   VTQLTLSRLGSFNEVLMRWQGPVSAAIHVEDP-VLNVINAVCHNQVLTERGNVYIHLMRK 59

Query: 542 EGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVV 601
            G  YP+N LRN+AL+ V+T YVFL D DF+P   LY  + + + S D   +  K+  VV
Sbjct: 60  IGKLYPVNQLRNLALSSVTTDYVFLTDADFIPNKDLYKSMLNVVASSDT--NRTKRAYVV 117

Query: 602 PAF------------------ETQR--------------YRTAFPASHAPTNFSRWVNAT 629
            AF                  ET+               + T F   H  TN++RW  AT
Sbjct: 118 AAFGVFEEKEKNKPAKFPDFPETRETLLQLWDSKAIEPVHFTKFIHGHYQTNYTRWRKAT 177

Query: 630 TPYQIEWAPDFEPYIVAHR-DLPRYDTRFV 658
             Y I W   +EPY+V    D+P Y+  F+
Sbjct: 178 ELYPITWFHHYEPYVVVRTIDVPLYEGAFL 207


>gi|358336371|dbj|GAA54897.1| glycosyltransferase-like protein LARGE [Clonorchis sinensis]
          Length = 400

 Score = 95.9 bits (237), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 62/209 (29%), Positives = 97/209 (46%), Gaps = 21/209 (10%)

Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
            Y  +  V  V  IP  HY+G   + K+ +P +LPE + K IVLD+DVIF  ++  LW  
Sbjct: 146 LYDYERFVSYVEAIPINHYAGTPAMYKILVPYILPEEVTKVIVLDSDVIFNHNVLDLWKE 205

Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWR 291
           F   + +Q  G    Q  +       +    P    G N+GV+L++LT+LR I+W   W+
Sbjct: 206 FELFKPQQVFGAACEQITYCPHSCSSDDTNIPLF--GINSGVVLMNLTELRRINWWYQWQ 263

Query: 292 IIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQL-------------SDNT 338
               K +    +  +A+Q + N I+  +  L Y L C+WNVQ+              D  
Sbjct: 264 QEVLKEVTEFGYLPIAEQKVINEIMRRNIKLYYQLGCEWNVQVFKADGIQCCPVQWVDRL 323

Query: 339 RSDELCYTELTD------LKIIHWNSPKK 361
             +  C   L D       +I+H+N  +K
Sbjct: 324 PHEMYCKNTLDDANPREPARIVHYNIKRK 352



 Score = 72.0 bits (175), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 35/107 (32%), Positives = 53/107 (49%)

Query: 44  PLHFHLITDTVALNILQTLFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLT 103
           P+  H + D  A + +  +F  W    V +  Y  +  V  V  IP  HY+G   + K+ 
Sbjct: 115 PVRLHWVADVAARDRIIYIFEQWHPSDVHIFLYDYERFVSYVEAIPINHYAGTPAMYKIL 174

Query: 104 LPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQ 150
           +P +LPE + K IVLD+DVIF  ++  LW  F   + +Q  G    Q
Sbjct: 175 VPYILPEEVTKVIVLDSDVIFNHNVLDLWKEFELFKPQQVFGAACEQ 221


>gi|443710196|gb|ELU04494.1| hypothetical protein CAPTEDRAFT_221047 [Capitella teleta]
          Length = 304

 Score = 95.9 bits (237), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 65/202 (32%), Positives = 92/202 (45%), Gaps = 37/202 (18%)

Query: 495 MLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGYHVVYKEGNFYPINTLRNV 554
           ML +HW GPIS ++Y+S+ +    +    N     +  NI  HV++  G+ YP N LRN 
Sbjct: 1   MLAQHWSGPISASVYVSNIQDSSMID---NLCQFSTASNIFIHVLHAVGDIYPANILRNF 57

Query: 555 ALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFE------TQR 608
           AL +V T YVFL D+DF+P F LY      I+  + H +      VV AFE      T  
Sbjct: 58  ALEEVFTEYVFLSDVDFMPDFSLYNASVDFIQHNERHLNSS--AFVVAAFEILNDSKTWG 115

Query: 609 YRTAFPAS-------------------------HAPTNFSRWVNATTPYQIEWAPDFEPY 643
               FP +                         H  TN+ +W  A   Y + W P +EPY
Sbjct: 116 QPVQFPRNKTELIDLWGAQKLQPFHFSKTKNYGHIETNYEKWTVAHEIYAVTWMPYYEPY 175

Query: 644 IVAHRD-LPRYDTRFVGFGWNK 664
           ++     +PRY   FVG  ++K
Sbjct: 176 VIVRTSTVPRYHNAFVGRYFDK 197


>gi|358342270|dbj|GAA49775.1| glycosyltransferase-like protein LARGE [Clonorchis sinensis]
          Length = 398

 Score = 95.1 bits (235), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 55/171 (32%), Positives = 91/171 (53%), Gaps = 6/171 (3%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           P L   FY           IPN +++G  G+ KL +P +LP  + +TI +DTD ++  ++
Sbjct: 138 PYLECSFYNLAEYQSFAERIPNTNHAGAAGIAKLWVPYILPHWVIRTIFVDTDTLWNENV 197

Query: 226 AQLWALFSRLRQRQTIGLV-ENQS-DWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRD 283
             LW +F      Q +G+  E QS D +  +  +   P      G N+G++L+ L+++R 
Sbjct: 198 ETLWKIFDNFTSTQMLGIAWEQQSQDAHCRRSSETSLP----VTGVNSGLVLMHLSRMRV 253

Query: 284 ISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQL 334
            +W          FL  R   S ADQDI+N++++  P  +Y+LPC+WNVQ+
Sbjct: 254 ANWGLITEQHIVSFLAVRKHLSQADQDIYNSVLALKPEWLYSLPCEWNVQV 304



 Score = 70.9 bits (172), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 44/166 (26%), Positives = 76/166 (45%), Gaps = 29/166 (17%)

Query: 15  IQVAIVCAGYNSTRSLVTLIKSILFYR----------------------------KNPLH 46
           I +  +  G  S    VT++KS+L+++                            +NP+H
Sbjct: 56  IHITQIVKGKKSAIRSVTMLKSLLYFQNHLNSASQECLRVEPFETSPCFTELHPAQNPIH 115

Query: 47  FHLITDTVALNILQTLFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPK 106
            H IT+     +L++     S+P +E SFY           IPN +++G  G+ KL +P 
Sbjct: 116 VHFITEYDLWPVLKSKLGNCSLPYLECSFYNLAEYQSFAERIPNTNHAGAAGIAKLWVPY 175

Query: 107 VLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLV-ENQS 151
           +LP  + +TI +DTD ++  ++  LW +F      Q +G+  E QS
Sbjct: 176 ILPHWVIRTIFVDTDTLWNENVETLWKIFDNFTSTQMLGIAWEQQS 221


>gi|156602774|ref|XP_001618715.1| hypothetical protein NEMVEDRAFT_v1g224874 [Nematostella vectensis]
 gi|156200021|gb|EDO26615.1| predicted protein [Nematostella vectensis]
          Length = 158

 Score = 94.4 bits (233), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 39/62 (62%), Positives = 50/62 (80%)

Query: 8   VIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWS 67
            +P CEVI +A+VCAG++++R  VTLIKS+LFYR NPLHFH ++D+VA  IL TLF TW 
Sbjct: 72  AVPQCEVIHIAVVCAGHSASRQAVTLIKSLLFYRHNPLHFHFVSDSVAELILGTLFRTWG 131

Query: 68  VP 69
           VP
Sbjct: 132 VP 133


>gi|328869315|gb|EGG17693.1| putative glycosyltransferase [Dictyostelium fasciculatum]
          Length = 407

 Score = 93.2 bits (230), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 62/232 (26%), Positives = 104/232 (44%), Gaps = 51/232 (21%)

Query: 479 VTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSR----RNI 534
           VT+V Q ++DR++ V  + K W+GPIS  +++ + +    L      + LR        +
Sbjct: 124 VTIVTQATVDRIERVVAMAKKWKGPISTAVFIKNPKDIPQLK-----KLLRKHWILAEYV 178

Query: 535 GYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSM------ 588
             H++Y     YP+N LRN+++    + YV L+D DF+P  G+  Y+ + I ++      
Sbjct: 179 DIHLLYANHTRYPVNNLRNLSIKYAQSDYVLLMDADFVPHLGMNEYVSAQIETLLQQQQQ 238

Query: 589 -----------DMHGHGGKKVLVVPAF---------------------ETQRYRTAF--- 613
                      +      K   VVP+F                     E + Y T     
Sbjct: 239 QQQNKHPLSSHEQQQQEEKIAFVVPSFASSENPNHLPNDKQSLLKMIKENKVYPTNLNVC 298

Query: 614 PASHAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLP-RYDTRFVGFGWNK 664
           P  H PTN+SRW      YQ+E+   +EPY++ +R     +D R  G+G++K
Sbjct: 299 PKCHTPTNYSRWYETDDLYQVEYRWIYEPYLIFNRSTCLAFDERLKGYGFDK 350


>gi|443682653|gb|ELT87166.1| hypothetical protein CAPTEDRAFT_200998 [Capitella teleta]
          Length = 201

 Score = 92.8 bits (229), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 56/159 (35%), Positives = 85/159 (53%), Gaps = 14/159 (8%)

Query: 473 SSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEAL---- 528
           S   ND+TLV QL+ DR+  + ++   W+GPIS ++Y  D E    LS  G    L    
Sbjct: 42  SQSSNDITLVTQLTSDRIDRLLLIMNIWKGPISASVY-PDTE----LSIRGEDICLLKKQ 96

Query: 529 RSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSM 588
            SR  +  H+V K G F+P+N LRN+AL+   T +VFL D+DF+P   LY   +++++  
Sbjct: 97  DSRCRVQLHLVQKSGVFFPVNKLRNIALDMAVTSHVFLTDVDFIPDQNLY---ENALQQ- 152

Query: 589 DMHGHGGKKVLVVPAFETQRYRTAFPASHAPTNFSRWVN 627
            +H   G + LV+PAFE    +        P+     +N
Sbjct: 153 -LHSLQGMQSLVIPAFEMIEMKNLVRKEDLPSGKPELLN 190


>gi|358342269|dbj|GAA49774.1| glycosyltransferase-like protein LARGE [Clonorchis sinensis]
          Length = 289

 Score = 91.7 bits (226), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 64/192 (33%), Positives = 92/192 (47%), Gaps = 26/192 (13%)

Query: 161 NHKPWPALGR------------GFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPET 208
           +HK WP L               FY  ++  +    IPN H+SG   + KL  P +LP  
Sbjct: 12  DHKLWPLLKSHLYRCELSYLKCSFYSLENYYDFAGRIPNTHHSGPTAMAKLWTPYILPHW 71

Query: 209 LAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLV-ENQSDWYLGKLWKNHKPWPAL-G 266
           L K I++D+D ++  +I  LW  F R    Q IG+  E QSD     L  N      +  
Sbjct: 72  LEKIILIDSDTLWNENIYTLWNHFDRFNSTQMIGIAWEQQSD----DLHCNRSSTMVIPS 127

Query: 267 RGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTS----LADQDIFNAIISEHPYL 322
            G N G++L+ L ++R++ W      I+E  ++ RL        +DQ I+N IIS  P  
Sbjct: 128 EGVNAGLVLMHLWRMRNVRWDA----ISELLVVARLERDKMLYQSDQSIYNDIISLKPEW 183

Query: 323 VYTLPCQWNVQL 334
            Y L C+WNVQL
Sbjct: 184 YYPLACEWNVQL 195



 Score = 70.9 bits (172), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 37/112 (33%), Positives = 59/112 (52%), Gaps = 1/112 (0%)

Query: 42  KNPLHFHLITDTVALNILQTLFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLK 101
           +NP+H H + D     +L++      +  ++ SFY  ++  +    IPN H+SG   + K
Sbjct: 2   QNPIHIHFVADHKLWPLLKSHLYRCELSYLKCSFYSLENYYDFAGRIPNTHHSGPTAMAK 61

Query: 102 LTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLV-ENQSD 152
           L  P +LP  L K I++D+D ++  +I  LW  F R    Q IG+  E QSD
Sbjct: 62  LWTPYILPHWLEKIILIDSDTLWNENIYTLWNHFDRFNSTQMIGIAWEQQSD 113


>gi|358332611|dbj|GAA51245.1| glycosyltransferase-like protein LARGE [Clonorchis sinensis]
          Length = 313

 Score = 91.7 bits (226), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 55/165 (33%), Positives = 79/165 (47%), Gaps = 28/165 (16%)

Query: 10  PTCEVIQVAIVCAGYNSTRSLVTLIKSILFYR-----KNP-------------------- 44
           PT   I +  V  G  +    +TL+KS+L+YR     K P                    
Sbjct: 76  PTPHTIHLVHVIKGEATLARSITLLKSVLYYRGSLRSKRPECHLRRIPSPHLCAKQRAIE 135

Query: 45  ---LHFHLITDTVALNILQTLFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLK 101
              +H HL+ +   L+   +  S W V ++  + Y  D  +  V+WIPNKH SGV  L K
Sbjct: 136 QIAIHLHLVVEQRLLSAAYSYVSQWRVARLNWTLYELDPYLPRVNWIPNKHGSGVAALSK 195

Query: 102 LTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGL 146
           + +P +LP  + K IVLDTD++F  DI +LW LF    Q Q  G+
Sbjct: 196 MLIPDMLPTNVTKAIVLDTDILFNADILELWKLFDDFDQSQKNGV 240



 Score = 79.3 bits (194), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 45/128 (35%), Positives = 64/128 (50%), Gaps = 26/128 (20%)

Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
            Y  D  +  V+WIPNKH SGV  L K+ +P +LP  + K IVLDTD++F  DI +LW L
Sbjct: 169 LYELDPYLPRVNWIPNKHGSGVAALSKMLIPDMLPTNVTKAIVLDTDILFNADILELWKL 228

Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWR 291
           F    Q Q                 KN         G N+GV+L+ L+++R+  W+  + 
Sbjct: 229 FDDFDQSQ-----------------KN---------GVNSGVVLMHLSRMREAKWSELFD 262

Query: 292 IIAEKFLL 299
               K L+
Sbjct: 263 WAVRKVLI 270


>gi|384252903|gb|EIE26378.1| hypothetical protein COCSUDRAFT_64466 [Coccomyxa subellipsoidea
           C-169]
          Length = 389

 Score = 90.5 bits (223), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 68/250 (27%), Positives = 107/250 (42%), Gaps = 65/250 (26%)

Query: 475 DGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSR--- 531
           D ND+T++ Q S+DRL M+E     W+G  S+ +Y        ++  + N++ LR     
Sbjct: 45  DKNDITIITQCSLDRLYMLEEQCNTWQGLTSVAVYYP-----LYIFQSTNADRLREAITT 99

Query: 532 -RNIGYHVVYKEGNF------------------YPINTLRNVALNQVSTPYVFLLDIDFL 572
             N    +    G                    YP N LRN AL +  T  + LLD DFL
Sbjct: 100 VENFHEQIDRSHGKCRMDIVLLSEVIKPHDEWSYPYNALRNHALVRSETDVLLLLDADFL 159

Query: 573 PMFGLYPYLKSSIRSMDMHGH--GGKKVLVVPAFETQR---------------------- 608
              G++  L ++ ++  +       KK++V+PAFET R                      
Sbjct: 160 VSEGVHEALTATDKAAALMEDLTANKKIIVLPAFETPRTLSEETGKSYALQAQASTKAEL 219

Query: 609 -----------YRTAFPASHAPTNFSRWVNA--TTPYQIEWAPDFEPYI-VAHRDLPRYD 654
                      +   +P  H  +N+ +W ++  T PY I +   +EPY+ V  R +P YD
Sbjct: 220 KSMFDEGKIIQFADFYPVGHGSSNYKKWFSSEGTEPYPITYKTGYEPYVLVNRRTVPWYD 279

Query: 655 TRFVGFGWNK 664
            RF G+GW+K
Sbjct: 280 ERFRGYGWDK 289


>gi|443734790|gb|ELU18647.1| hypothetical protein CAPTEDRAFT_226416 [Capitella teleta]
          Length = 461

 Score = 90.5 bits (223), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 69/241 (28%), Positives = 106/241 (43%), Gaps = 53/241 (21%)

Query: 477 NDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGY 536
           +DVTLV Q S+DRL     + + W G IS ++Y+S +    FL    +   L+ RRNI  
Sbjct: 145 DDVTLVTQFSVDRLDRFYEIAQLWPGTISASVYVSRSHVLAFLQQWRSLPHLKLRRNILL 204

Query: 537 HVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKS-----------SI 585
           H+V + G  YP N LRN+A+      +++L D+D LPM GL   L+S            I
Sbjct: 205 HMVTQTGPIYPSNALRNIAVKNSVGRFLYLTDVDILPMKGLQSRLRSHLDNSFAKSKKRI 264

Query: 586 RSMD--------------MHGHGGKKVLV-VPAFETQR---------------------- 608
           +++D              + G   K     VP FE                         
Sbjct: 265 QNVDDLVAKTHCWVLTTVIMGVAAKGAAAYVPTFERVHREGRGILPEDKAEVLQQWDAGR 324

Query: 609 ----YRTAFPASHAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRD-LPRYDTRFVGFGWN 663
               Y T +  ++ P  +++W  +   Y++++   FEPYIVA RD +  Y  R  G  ++
Sbjct: 325 IRPFYSTRYQPAYLPILYNKWRRSNAEYKVKYYRQFEPYIVAARDAVHNYSERIPGLYFD 384

Query: 664 K 664
           K
Sbjct: 385 K 385


>gi|358333649|dbj|GAA52132.1| glycosyltransferase-like protein LARGE2 [Clonorchis sinensis]
          Length = 323

 Score = 89.7 bits (221), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 54/173 (31%), Positives = 90/173 (52%), Gaps = 10/173 (5%)

Query: 166 PALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           P L   FY  +S V+    IPN H +G   + KL +P +LP  + KTIV+D+D+++  + 
Sbjct: 65  PYLKCSFYDVESYVDFAQRIPNGHIAGPSAVAKLWIPYILPPWVIKTIVIDSDLLWNENA 124

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDIS 285
             LW +F + +  Q IGL   Q  +    +    K  P   RG N G++L+ L ++R+I 
Sbjct: 125 LMLWKIFEQFKSTQMIGLAWEQQSYDPLCIEDPAKLLPR--RGVNGGLVLMHLERMRNIR 182

Query: 286 WAGFWRIIAEKFLLTRL----WTSLADQDIFNAIISEHPYLVYTLPCQWNVQL 334
               W I  E+ +  RL         +Q++++ +I   P   Y +PC+WN+Q+
Sbjct: 183 ----WDITIEQIIHQRLNQVGILGEGEQEVYSFLIHLKPDWYYGIPCEWNIQV 231



 Score = 76.6 bits (187), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 39/111 (35%), Positives = 65/111 (58%), Gaps = 1/111 (0%)

Query: 42  KNPLHFHLITDTVALNILQTLFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLK 101
           +N +H HL+ D    ++L++     S+P ++ SFY  +S V+    IPN H +G   + K
Sbjct: 38  QNIIHMHLVCDRNLWSMLESQLQNCSLPYLKCSFYDVESYVDFAQRIPNGHIAGPSAVAK 97

Query: 102 LTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLV-ENQS 151
           L +P +LP  + KTIV+D+D+++  +   LW +F + +  Q IGL  E QS
Sbjct: 98  LWIPYILPPWVIKTIVIDSDLLWNENALMLWKIFEQFKSTQMIGLAWEQQS 148


>gi|26350809|dbj|BAC39041.1| unnamed protein product [Mus musculus]
          Length = 379

 Score = 89.7 bits (221), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 72/264 (27%), Positives = 106/264 (40%), Gaps = 79/264 (29%)

Query: 475 DGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQF---LSYAGNSEALRSR 531
           D NDV L    S+D L  +  L + WEGP+S++++ +  E  Q    L+YA +S     R
Sbjct: 91  DPNDVILATHASVDNLLHLSGLLERWEGPLSVSVFAATKEEAQLATVLAYALSSHCPEMR 150

Query: 532 RNIGYHVVY------------KEGNF------------------------------YPIN 549
             +  H+V             + G F                              YP N
Sbjct: 151 ARVAMHLVCPSRYEAAVPDPREPGEFALLRSCQEVFDKLARVAQPGINYALGTNTSYPNN 210

Query: 550 TLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRY 609
            LRN+A  + +  Y  ++D+D +P  GL+  L+     +D   H     LVVPAFE +R 
Sbjct: 211 LLRNLAREEAN--YALVIDVDMVPSEGLWRGLREM---LDQSNHWDGTALVVPAFEIRRS 265

Query: 610 RTA-----------------------FPASHAPTNFSRWVNATT------PYQIEWAPDF 640
           R                             HAPTN+SRWVN          Y + W   +
Sbjct: 266 RRMPMNKNELVQLYQVGEVRPFYYGLCTPCHAPTNYSRWVNLPEESLLRPAYVVPWRDPW 325

Query: 641 EPYIVAHRDLPRYDTRFVGFGWNK 664
           EP+ VA   +P +D RF  +G+N+
Sbjct: 326 EPFYVAGGKVPTFDERFRQYGFNR 349


>gi|443719771|gb|ELU09781.1| hypothetical protein CAPTEDRAFT_220854 [Capitella teleta]
          Length = 512

 Score = 89.7 bits (221), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 55/174 (31%), Positives = 92/174 (52%), Gaps = 5/174 (2%)

Query: 445 DVTLVAQLSMDRLQMVEINPNNVDEYEASSDGN-DVTLVAQLSMDRLQMVEMLFKHWEGP 503
           D   + +L+ +  +   +  +N+ +  A  DG+  V+ V   S DRL     + +HW G 
Sbjct: 84  DADKICKLTEEESEYDRVVLDNILDSGAPDDGDLRVSYVTHFSYDRLDKFLAVVQHWPGY 143

Query: 504 ISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPY 563
           IS  + +   + ++F++       L +R NIG H+V ++G FYP N  RN+AL++  T +
Sbjct: 144 ISAAVQIKADKVEEFVNNYCREPELAARDNIGIHLVIQKGIFYPYNHYRNIALDRAPTSH 203

Query: 564 VFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFE-TQRYRTAFPAS 616
           VF+ DID +P   L    + S++       G K+VLVVP+FE T+ +    P S
Sbjct: 204 VFVTDIDLVPSDDL---ERRSLQHFKEQKVGEKQVLVVPSFEITKEHGYTVPRS 254



 Score = 78.6 bits (192), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 48/146 (32%), Positives = 72/146 (49%), Gaps = 29/146 (19%)

Query: 545 FYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAF 604
           FYP N  RN+AL++  T +VF+ DID +P   L    + S++       G K+VLVVP+F
Sbjct: 298 FYPYNHYRNIALDRAPTSHVFVTDIDLVPSDDL---ERRSLQHFKEQKVGEKQVLVVPSF 354

Query: 605 ETQR-------------------------YRTAFPASHAPTNFSRWVNATTPYQIEWAPD 639
           E  +                         Y T +  ++ P N++ W  AT PY +++   
Sbjct: 355 EITKEHGYSVPRSKEDLLEIWDRGELQEFYATRYAPAYMPLNYTVWRTATEPYWVKYYDQ 414

Query: 640 FEPYIVAHR-DLPRYDTRFVGFGWNK 664
           FEPY++A +  L  Y  RFVG  +NK
Sbjct: 415 FEPYVIAEKAKLQPYTNRFVGRFYNK 440


>gi|157822857|ref|NP_001099794.1| N-acetyllactosaminide beta-1,3-N-acetylglucosaminyltransferase
           [Rattus norvegicus]
 gi|392344822|ref|XP_003749084.1| PREDICTED: N-acetyllactosaminide
           beta-1,3-N-acetylglucosaminyltransferase-like [Rattus
           norvegicus]
 gi|149062023|gb|EDM12446.1| UDP-GlcNAc:betaGal beta-1,3-N-acetylglucosaminyltransferase 6
           (predicted) [Rattus norvegicus]
          Length = 415

 Score = 89.4 bits (220), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 74/265 (27%), Positives = 107/265 (40%), Gaps = 81/265 (30%)

Query: 475 DGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQF---LSYAGNSEALRSR 531
           D NDV L    S+D L  +  L + WEGP+S++++ +  E  Q    L+YA +S     R
Sbjct: 91  DPNDVILATHASVDNLLHLSGLLERWEGPLSVSVFAATREEAQLATVLAYALSSHCPEMR 150

Query: 532 RNIGYHVVY------------KEGNF------------------------------YPIN 549
             +  H+V             + G F                              YP N
Sbjct: 151 ARVAMHLVCPSRYEAAVPDPREPGEFALLRSCQEVFDKLARVAQPGINYALGTNISYPNN 210

Query: 550 TLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSM-DMHGHGGKKVLVVPAFETQR 608
            LRN+A  + +  Y  ++D+D +P  GL+     S+R M D   H     LVVPAFE +R
Sbjct: 211 LLRNLAREEAN--YALVIDVDMVPSEGLW----RSLREMLDQSNHWDGTALVVPAFEIRR 264

Query: 609 YRTA-----------------------FPASHAPTNFSRWVNATT------PYQIEWAPD 639
            R                             HAPTN+SRWVN          Y + W   
Sbjct: 265 ARRMPMNKNELVQLYQVGEVRPFYYGLCTPCHAPTNYSRWVNLPEESLLRPAYVVPWRDP 324

Query: 640 FEPYIVAHRDLPRYDTRFVGFGWNK 664
           +EP+ VA   +P +D RF  +G+N+
Sbjct: 325 WEPFYVAGGKVPTFDERFRQYGFNR 349


>gi|443703882|gb|ELU01224.1| hypothetical protein CAPTEDRAFT_189662, partial [Capitella teleta]
          Length = 265

 Score = 89.4 bits (220), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 53/137 (38%), Positives = 76/137 (55%), Gaps = 14/137 (10%)

Query: 473 SSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEAL---- 528
           S   ND+TLV QL+ DR+  + ++   W+GPIS ++Y  D E    LS  G    L    
Sbjct: 107 SQSSNDITLVTQLTSDRIDRLLLIMNIWKGPISASVY-PDTE----LSIRGEEICLLKKQ 161

Query: 529 RSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSM 588
            SR  +  H+V K G F+P+N LRN+AL+   T +VFL D+DF+P   LY      + S+
Sbjct: 162 DSRCRVQLHLVQKSGVFFPVNKLRNIALDMAVTSHVFLTDVDFIPDQNLYANALQQLHSL 221

Query: 589 DMHGHGGKKVLVVPAFE 605
            +      + LV+PAFE
Sbjct: 222 QV-----MQSLVIPAFE 233


>gi|384486166|gb|EIE78346.1| hypothetical protein RO3G_03050 [Rhizopus delemar RA 99-880]
          Length = 402

 Score = 89.4 bits (220), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 65/230 (28%), Positives = 103/230 (44%), Gaps = 48/230 (20%)

Query: 478 DVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDA----EAQQFLSYAGNSEALRS--- 530
           DVTL++Q S++RL+    + + W+GPIS+ +Y++      E  QFL    N E   +   
Sbjct: 84  DVTLISQFSVNRLKSFIQVIEAWQGPISIAIYLTQPDDIDELIQFLKIPKNLEIYSALTI 143

Query: 531 ---RRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRS 587
              + N G+    +E   YPIN LRN+A+ + ST ++F+ D DF+P   LY +++S +  
Sbjct: 144 TLVKPNYGH----QEHLAYPINHLRNIAITESSTEFIFVTDADFVPSSNLYTFIRSRLVP 199

Query: 588 MDMHGHGGKK--VLVVPAFETQRYRTAFP-----------------------ASHAPTNF 622
             ++  G  +    V+P F         P                       A H PT  
Sbjct: 200 YVIYQSGKLQPTAWVIPCFAIHEAYAQLPIPNTYNELRKLVGQGVAYITDPGAGHGPT-L 258

Query: 623 SRWVNATTP--------YQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           +  +    P        Y++ +   +EPY V HR  P YD RF   G +K
Sbjct: 259 ATEIAMVRPLLMGNPLAYEVCYESQWEPYYVLHRSAPLYDVRFRNQGGDK 308


>gi|157111360|ref|XP_001651502.1| hypothetical protein AaeL_AAEL015300 [Aedes aegypti]
 gi|108868339|gb|EAT32564.1| AAEL015300-PA [Aedes aegypti]
          Length = 331

 Score = 89.4 bits (220), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 54/166 (32%), Positives = 82/166 (49%), Gaps = 18/166 (10%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +LP  +   + +D+D IF + + +LW++F      Q  G+     D   G  W 
Sbjct: 106 RLFLPSLLPH-IDSVLYVDSDTIFLSPVQELWSMFRNFNASQFAGMAPEHEDKNAG--WY 162

Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAI 315
           N     P  G  G N+GV+L++LT++RD  W      I +++ L  +W    DQDI N +
Sbjct: 163 NRFARHPYYGDLGVNSGVMLMNLTRMRDFKWEEHIMPIYKEYRLKLVW---GDQDIINIL 219

Query: 316 ISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTD----LKIIHWN 357
              HP ++Y  PC WN       R+D   Y  + D    +KIIH N
Sbjct: 220 FHHHPDMLYIFPCDWNY------RADHCMYMSVCDAPDGVKIIHGN 259


>gi|428181260|gb|EKX50124.1| hypothetical protein GUITHDRAFT_103938 [Guillardia theta CCMP2712]
          Length = 577

 Score = 89.4 bits (220), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 61/213 (28%), Positives = 103/213 (48%), Gaps = 37/213 (17%)

Query: 478 DVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGYH 537
           DV+ V QL+ DRL   + +        S     + +E Q+ L        LR R  + +H
Sbjct: 181 DVSWVTQLTSDRLDGFKAMMARRANSCS----SASSELQRDLDLI---RPLRDR--VDFH 231

Query: 538 VVYKE---GNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHG 594
           +V  +   G  YP+NTLRN+A+++  T ++ L+D DF+P   +Y  L   +      G  
Sbjct: 232 LVLADLHPGRLYPVNTLRNLAMDKCRTQWMVLVDADFVPNDDVYDLLLPRLEPWGKTG-- 289

Query: 595 GKKVLVVPAFETQRYRTAFPA-----------------------SHAPTNFSRWVNATTP 631
           G ++ V+PAF+ Q      P                        +H  T++ RW  ++  
Sbjct: 290 GNEIYVLPAFQMQDKEKGIPRNKMELLSLGPAVSQVHPEKGRDIAHKYTDYERWRVSSEI 349

Query: 632 YQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           Y+I++   +EPY VA++ +PRY++ FVG+G +K
Sbjct: 350 YEIQYRMPYEPYYVANKRIPRYESAFVGYGNDK 382


>gi|428183316|gb|EKX52174.1| hypothetical protein GUITHDRAFT_102076 [Guillardia theta CCMP2712]
          Length = 688

 Score = 89.4 bits (220), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 61/224 (27%), Positives = 105/224 (46%), Gaps = 39/224 (17%)

Query: 476 GNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDA-EAQQFLSYAGNSEALRSRRNI 534
           G+D+++   L++DRL  ++  F+ W GP+   +Y+ +  E ++ +     +  +  R+N 
Sbjct: 366 GDDISISTHLTVDRLDRLQSYFRRWAGPMVACIYIRNKLEEKELVDRFVKALDVEGRKN- 424

Query: 535 GYHVVY---KEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMH 591
            YH++    ++   YP N LRN+AL  V + +VFLLDID +P  G Y  L+S I  + + 
Sbjct: 425 -YHLILANARDATTYPANILRNLALETVKSTFVFLLDIDLIPDPGFYKSLQSQINVLRVM 483

Query: 592 GHGGKKVLVVPAFE----TQRYRTAFPA--------------------------SHAPTN 621
                 V  +PAF+         T FP                           + +  N
Sbjct: 484 SKN--HVFTIPAFQGVGAAASTETQFPVDKDDLMTLLRAAKIKPILSGKDEFWPAFSCLN 541

Query: 622 FSRWVNATTPYQIEWAPDFEPYIV-AHRDLPRYDTRFVGFGWNK 664
           +  W NAT  Y  ++    EPY++ A   +P +D RF+ +G +K
Sbjct: 542 YDAWYNATQSYLADYRWPCEPYLLGATSSMPFFDERFIHYGNDK 585


>gi|157128205|ref|XP_001655090.1| hypothetical protein AaeL_AAEL011072 [Aedes aegypti]
 gi|108872658|gb|EAT36883.1| AAEL011072-PA [Aedes aegypti]
          Length = 373

 Score = 89.0 bits (219), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 54/166 (32%), Positives = 82/166 (49%), Gaps = 18/166 (10%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +LP  +   + +D+D IF + + +LW++F      Q  G+     D   G  W 
Sbjct: 148 RLFLPSLLPH-IDSVLYVDSDTIFLSPVQELWSMFRNFNASQFAGMAPEHEDKNAG--WY 204

Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAI 315
           N     P  G  G N+GV+L++LT++RD  W      I +++ L  +W    DQDI N +
Sbjct: 205 NRFARHPYYGDLGVNSGVMLMNLTRMRDFKWEEHIMPIYKEYRLKLVW---GDQDIINIL 261

Query: 316 ISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTD----LKIIHWN 357
              HP ++Y  PC WN       R+D   Y  + D    +KIIH N
Sbjct: 262 FHHHPDMLYIFPCDWNY------RADHCMYMSVCDAPDGVKIIHGN 301


>gi|354494706|ref|XP_003509476.1| PREDICTED: N-acetyllactosaminide
           beta-1,3-N-acetylglucosaminyltransferase [Cricetulus
           griseus]
 gi|344243237|gb|EGV99340.1| N-acetyllactosaminide beta-1,3-N-acetylglucosaminyltransferase
           [Cricetulus griseus]
          Length = 415

 Score = 89.0 bits (219), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 72/264 (27%), Positives = 106/264 (40%), Gaps = 79/264 (29%)

Query: 475 DGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQF---LSYAGNSEALRSR 531
           D NDV L    S+D L  +  L + WEGP+S++++ +  E  Q    L+YA +S     R
Sbjct: 91  DPNDVILATHASVDNLLHLSGLLERWEGPLSVSVFAATKEEAQMATVLAYALSSHCPEMR 150

Query: 532 RNIGYHVVY------------KEGNF------------------------------YPIN 549
             +  H+V             + G F                              YP N
Sbjct: 151 ARVAMHLVCPSRYEAAVPDPREPGEFALLRSCQEVFDKLARVAQPGTNYALGTNISYPNN 210

Query: 550 TLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRY 609
            LRN+A  + +  Y  ++D+D +P  GL+  L+  +   D   H     LVVPAFE +R 
Sbjct: 211 LLRNLAREEAN--YALVIDVDMVPSEGLWRRLREML---DQSNHWDGMALVVPAFEIRRA 265

Query: 610 RTA-----------------------FPASHAPTNFSRWVNATT------PYQIEWAPDF 640
           R                             HAPTN+SRWVN          Y + W   +
Sbjct: 266 RRMPMNKNELVQLYQVGEVRPFYYGLCTPCHAPTNYSRWVNLPEESLLRPAYVVPWRDPW 325

Query: 641 EPYIVAHRDLPRYDTRFVGFGWNK 664
           EP+ VA   +P +D RF  +G+N+
Sbjct: 326 EPFYVAGGKVPTFDERFRQYGFNR 349


>gi|148701142|gb|EDL33089.1| UDP-GlcNAc:betaGal beta-1,3-N-acetylglucosaminyltransferase 1,
           isoform CRA_a [Mus musculus]
          Length = 415

 Score = 89.0 bits (219), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 72/264 (27%), Positives = 106/264 (40%), Gaps = 79/264 (29%)

Query: 475 DGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQF---LSYAGNSEALRSR 531
           D NDV L    S+D L  +  L + WEGP+S++++ +  E  Q    L+YA +S     R
Sbjct: 91  DPNDVILATHASVDNLLHLSGLLERWEGPLSVSVFAATKEEAQLATVLAYALSSHCPEMR 150

Query: 532 RNIGYHVV------------YKEGNF------------------------------YPIN 549
             +  H+V             + G F                              YP N
Sbjct: 151 ARVAMHLVCPSRYEAAVPDPREPGEFALLRSCQEVFDKLARVAQPGINYALGTNTSYPNN 210

Query: 550 TLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRY 609
            LRN+A  + +  Y  ++D+D +P  GL+  L+     +D   H     LVVPAFE +R 
Sbjct: 211 LLRNLAREEAN--YALVIDVDMVPSEGLWRGLREM---LDQSNHWDGTALVVPAFEIRRS 265

Query: 610 RTA-----------------------FPASHAPTNFSRWVNATT------PYQIEWAPDF 640
           R                             HAPTN+SRWVN          Y + W   +
Sbjct: 266 RRMPMNKNELVQLYQVGEVRPFYYGLCTPCHAPTNYSRWVNLPEESLLRPAYVVPWRDPW 325

Query: 641 EPYIVAHRDLPRYDTRFVGFGWNK 664
           EP+ VA   +P +D RF  +G+N+
Sbjct: 326 EPFYVAGGKVPTFDERFRQYGFNR 349


>gi|402578443|gb|EJW72397.1| hypothetical protein WUBG_16696, partial [Wuchereria bancrofti]
          Length = 144

 Score = 88.6 bits (218), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 42/97 (43%), Positives = 62/97 (63%), Gaps = 3/97 (3%)

Query: 478 DVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGYH 537
           +V L+ Q SMDRL     L ++W GP+S  +Y++D+E    + +  ++  L +R N+  H
Sbjct: 48  NVVLITQFSMDRLTHFNALLQYWTGPVSAAVYVTDSELSLLIQFFDDT--LANRTNVALH 105

Query: 538 VVYKEGNFYPINTLRNVALNQVS-TPYVFLLDIDFLP 573
            VYKE  +YPIN LRNVALN  +   +VFL D+DF+P
Sbjct: 106 AVYKESTYYPINYLRNVALNSSNDASFVFLADVDFIP 142


>gi|30039682|ref|NP_780592.1| N-acetyllactosaminide beta-1,3-N-acetylglucosaminyltransferase [Mus
           musculus]
 gi|67460548|sp|Q8BWP8.1|B3GN1_MOUSE RecName: Full=N-acetyllactosaminide
           beta-1,3-N-acetylglucosaminyltransferase; AltName:
           Full=I-beta-1,3-N-acetylglucosaminyltransferase;
           Short=iGnT; AltName: Full=Poly-N-acetyllactosamine
           extension enzyme; AltName: Full=UDP-GlcNAc:betaGal
           beta-1,3-N-acetylglucosaminyltransferase 1
 gi|26341082|dbj|BAC34203.1| unnamed protein product [Mus musculus]
 gi|47124409|gb|AAH69927.1| B3gnt1 protein [Mus musculus]
 gi|74182782|dbj|BAE34720.1| unnamed protein product [Mus musculus]
 gi|148701143|gb|EDL33090.1| UDP-GlcNAc:betaGal beta-1,3-N-acetylglucosaminyltransferase 1,
           isoform CRA_b [Mus musculus]
          Length = 415

 Score = 88.6 bits (218), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 72/264 (27%), Positives = 106/264 (40%), Gaps = 79/264 (29%)

Query: 475 DGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQF---LSYAGNSEALRSR 531
           D NDV L    S+D L  +  L + WEGP+S++++ +  E  Q    L+YA +S     R
Sbjct: 91  DPNDVILATHASVDNLLHLSGLLERWEGPLSVSVFAATKEEAQLATVLAYALSSHCPEMR 150

Query: 532 RNIGYHVVY------------KEGNF------------------------------YPIN 549
             +  H+V             + G F                              YP N
Sbjct: 151 ARVAMHLVCPSRYEAAVPDPREPGEFALLRSCQEVFDKLARVAQPGINYALGTNTSYPNN 210

Query: 550 TLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRY 609
            LRN+A  + +  Y  ++D+D +P  GL+  L+     +D   H     LVVPAFE +R 
Sbjct: 211 LLRNLAREEAN--YALVIDVDMVPSEGLWRGLREM---LDQSNHWDGTALVVPAFEIRRS 265

Query: 610 RTA-----------------------FPASHAPTNFSRWVNATT------PYQIEWAPDF 640
           R                             HAPTN+SRWVN          Y + W   +
Sbjct: 266 RRMPMNKNELVQLYQVGEVRPFYYGLCTPCHAPTNYSRWVNLPEESLLRPAYVVPWRDPW 325

Query: 641 EPYIVAHRDLPRYDTRFVGFGWNK 664
           EP+ VA   +P +D RF  +G+N+
Sbjct: 326 EPFYVAGGKVPTFDERFRQYGFNR 349


>gi|326427667|gb|EGD73237.1| hypothetical protein PTSG_04953 [Salpingoeca sp. ATCC 50818]
          Length = 625

 Score = 87.8 bits (216), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 65/234 (27%), Positives = 105/234 (44%), Gaps = 46/234 (19%)

Query: 477 NDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSE-----ALRSR 531
           + +++V Q+S+DRL ++  L   W GP+S+ ++ SD            +       L+  
Sbjct: 318 HPLSIVTQVSLDRLSLLRYLPSTWSGPVSMAVHHSDDSDGGDGGRESAAAWVPAFVLKHL 377

Query: 532 RNIGYH-------VVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSS 584
           R +           + + G  YPIN LRN A+   +T +VF++D DF+P   L      +
Sbjct: 378 RKLAPTQPWAATLTLGRTGEPYPINALRNAAIEGAATDFVFIMDADFVPTSNLKDQFALA 437

Query: 585 IRSMDMHGHG-GKKVLVVPAFET-QRYRTAF----------------------------- 613
              +D +G    +   VVPAFE  Q  R +                              
Sbjct: 438 AGFVDGNGDSVARTAFVVPAFEMLQGDRASIGVGDLPKTMEQLVAAMEAGTVRPFRESVS 497

Query: 614 PASHAPTNFSRWVNATTPYQIEWAPD-FEPYIVAHR--DLPRYDTRFVGFGWNK 664
           P SHA T++S W +A TPY +++  D +EPY++  +   LP +D  F G+G NK
Sbjct: 498 PLSHAATDYSYWRHAYTPYTVDYQVDLYEPYVIVQKTNHLPLFDPNFTGYGMNK 551


>gi|198425664|ref|XP_002127967.1| PREDICTED: similar to MGC83717 protein [Ciona intestinalis]
          Length = 397

 Score = 87.4 bits (215), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 78/316 (24%), Positives = 132/316 (41%), Gaps = 48/316 (15%)

Query: 64  STWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDV- 122
           ST +  +   +FY A++   +V     +  +G +   +    K + +  A     D  V 
Sbjct: 51  STQTFTKESKAFYRAENHPINVKSDREEQIAGTFEGKQAKAAKDVAQKHADEFKNDYHVL 110

Query: 123 -IFATDIAQLWALF----------SRLRQRQTIGL---VENQSDWYLGKLWKNHKPWPAL 168
            IFA    +L   F          ++  +R+T  L    E +   ++ + +K H   P  
Sbjct: 111 FIFAHKNPKLEKKFTTAFGSMFKHAKFNERETFHLHFVCEKEGKEFVQQYFKEHVSHPKF 170

Query: 169 GRGFYLADSV---------VEDV-SWIPNKH----YSGVYGLLKLTLPKVLPETLAKTIV 214
               +  D++         V+ + + + N H     + ++  L + LP ++PE + + + 
Sbjct: 171 NLKIHFHDTIKLARQVHPLVQGIQAMLGNDHPYFSETAIF-FLSMVLPNIMPENIHRLVQ 229

Query: 215 LDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLW---------KNHKPWPAL 265
           LD D+ F T+I  LW+ F    +   IG+       Y   LW         K  +P+PA 
Sbjct: 230 LDLDLKFVTNIRDLWSEFDYFTKDTVIGIGNENQPVYRHLLWQYRNENPNTKVGEPYPAG 289

Query: 266 GRGFNTGVILLDLTKLRDISW------AGFWRIIAEKFLLTRLWTSLADQDIFNAIISEH 319
             GFN+GV+LL L  +R  S       A   + I +K+        L DQD +  +  EH
Sbjct: 290 QPGFNSGVVLLHLDNMRTSSLYNSYMNATNLQRIVQKY---HFKGHLGDQDFYTLLAFEH 346

Query: 320 PYLVYTLPCQWNVQLS 335
           P L + LPC WN QL 
Sbjct: 347 PPLFHVLPCGWNKQLC 362



 Score = 51.6 bits (122), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 38/170 (22%), Positives = 70/170 (41%), Gaps = 25/170 (14%)

Query: 27  TRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTW-SVPQVEVSFYLADSV---- 81
           T +  ++ K   F  +   H H + +      +Q  F    S P+  +  +  D++    
Sbjct: 125 TTAFGSMFKHAKFNERETFHLHFVCEKEGKEFVQQYFKEHVSHPKFNLKIHFHDTIKLAR 184

Query: 82  -----VEDV-SWIPNKH----YSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQL 131
                V+ + + + N H     + ++  L + LP ++PE + + + LD D+ F T+I  L
Sbjct: 185 QVHPLVQGIQAMLGNDHPYFSETAIF-FLSMVLPNIMPENIHRLVQLDLDLKFVTNIRDL 243

Query: 132 WALFSRLRQRQTIGLVENQSDWYLGKLW---------KNHKPWPALGRGF 172
           W+ F    +   IG+       Y   LW         K  +P+PA   GF
Sbjct: 244 WSEFDYFTKDTVIGIGNENQPVYRHLLWQYRNENPNTKVGEPYPAGQPGF 293


>gi|355751935|gb|EHH56055.1| N-acetyllactosaminide beta-1,3-N-acetylglucosaminyltransferase
           [Macaca fascicularis]
          Length = 383

 Score = 87.0 bits (214), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 70/245 (28%), Positives = 101/245 (41%), Gaps = 62/245 (25%)

Query: 477 NDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQF---LSYAGNSEALRSRRN 533
           NDV L    S+D L  +  L + WEGP+S++++ +  E  Q    L YA +S     R  
Sbjct: 70  NDVILATHASVDNLLHLSGLLERWEGPLSVSVFAATKEEAQLATVLDYALSSHCPDMRAR 129

Query: 534 IGYHVV-----------------------YKEGN--FYPINTLRNVALNQVSTPYVFLLD 568
           +  H+V                       Y  G    YP N LRN+A    +  Y  ++D
Sbjct: 130 VAMHLVCPSRYEAAEVFDKLARVAQPGINYALGTNVSYPNNLLRNLAREGAN--YALVID 187

Query: 569 IDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRYRTA---------------- 612
           +D +P  GL+  L+     +D     G   LVVPAFE +R R                  
Sbjct: 188 VDMVPSEGLWRGLREM---LDQSNQWGGTALVVPAFEIRRARRMPMNKNELVQLYQVGEV 244

Query: 613 -------FPASHAPTNFSRWVNATT------PYQIEWAPDFEPYIVAHRDLPRYDTRFVG 659
                       APTN+SRWVN          Y + W   +EP+ VA   +P +D RF  
Sbjct: 245 RPFYYGLCTPCQAPTNYSRWVNLPEESLLRPAYVVPWQDPWEPFYVAGGKVPTFDERFRQ 304

Query: 660 FGWNK 664
           +G+N+
Sbjct: 305 YGFNR 309


>gi|444510201|gb|ELV09536.1| N-acetyllactosaminide beta-1,3-N-acetylglucosaminyltransferase
           [Tupaia chinensis]
          Length = 415

 Score = 86.7 bits (213), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 75/266 (28%), Positives = 107/266 (40%), Gaps = 83/266 (31%)

Query: 475 DGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQF---LSYAGNSEALRSR 531
           D NDV L    S+D L  +  L + WEGP+S++++ +  E  Q    L+YA +S     R
Sbjct: 91  DPNDVILATHASVDNLLHLSGLLERWEGPLSVSVFAATKEEAQLATVLAYALSSHCPDMR 150

Query: 532 RNIGYHVVY------------KEGNF------------------------------YPIN 549
             +  H+V             + G F                              YP N
Sbjct: 151 ARVAMHLVCPSRYEAAVPDPREPGEFALLRSCQEVFDKLARVAQPGINYALGTNVSYPNN 210

Query: 550 TLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSM-DMHGHGGKKVLVVPAFETQR 608
            LRN+A    +  Y  ++D+D +P  GL+     S+R M D     G   LVVPAFE +R
Sbjct: 211 LLRNLAREGAN--YALVIDVDMVPSEGLW----RSLREMLDQSNQWGGTALVVPAFEIRR 264

Query: 609 YRTAFPAS------------------------HAPTNFSRWVNATT------PYQIEWAP 638
            R   P S                         APTN+SRWVN          Y + W  
Sbjct: 265 ARR-MPMSKTELLQLYQVGEVRPFYYGLCTPCQAPTNYSRWVNLPEESLLRPAYVVPWQD 323

Query: 639 DFEPYIVAHRDLPRYDTRFVGFGWNK 664
            +EP+ VA   +P +D RF  +G+N+
Sbjct: 324 PWEPFYVAGGKVPPFDERFRQYGFNR 349


>gi|241133589|ref|XP_002404534.1| Gyltl1B protein, putative [Ixodes scapularis]
 gi|215493625|gb|EEC03266.1| Gyltl1B protein, putative [Ixodes scapularis]
          Length = 281

 Score = 86.3 bits (212), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 70/219 (31%), Positives = 95/219 (43%), Gaps = 46/219 (21%)

Query: 487 MDRLQMVEMLFKHWEGPISLTLYMSDAEAQ-----QFLSYAGNSEALR----SRRNIGYH 537
           MDRL ++E   + WEGP+SL +Y      Q     Q L      + L+    S  ++ Y 
Sbjct: 1   MDRLPVLEHTLQQWEGPVSLVIYAPVKAGQPEGDWQKLYVQKKLQKLKLHPDSHVSLAYG 60

Query: 538 VVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKK 597
           V       YPIN LRN+A+ Q +  ++FL D DF P    + +   S RS+    +  K 
Sbjct: 61  V--PGSGDYPINALRNLAIRQTNCEHLFLADADFQPSPDFHKHFLLSTRSV---SNTDKV 115

Query: 598 VLVVPAFE----------------------------TQRYR-TAFPASHAPTNFSRWVNA 628
             VVPAFE                             QR+R    P SH  T++ +W   
Sbjct: 116 AFVVPAFEYLELPQKSDGVAQTKEELMQLLHRQDPFVQRFRHDVTPDSHQSTDYWKWYRT 175

Query: 629 TTPYQIEWAPD-FEPYIVAHR--DLPRYDTRFVGFGWNK 664
             PY +    D FEPY+V      LP YD RF G+G NK
Sbjct: 176 DRPYPVRTFVDMFEPYMVVKNVPQLPLYDERFTGYGSNK 214


>gi|256083969|ref|XP_002578207.1| TATA-binding protein-associated phosphoprotein [Schistosoma
           mansoni]
          Length = 316

 Score = 86.3 bits (212), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 44/127 (34%), Positives = 69/127 (54%), Gaps = 7/127 (5%)

Query: 178 VVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQ 237
           + E +    + H +G    LKL + ++LP  + K IVLDTD++   DI+ LW+LF    +
Sbjct: 170 IKERIKSFKSSHPTGHKNYLKLLITQILPNEIRKVIVLDTDILLNDDISNLWSLFDEFNK 229

Query: 238 RQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFW-----RI 292
            Q IG+   Q+  +   +    + WP LG G+N+G++L+ LTKLR   W   W      +
Sbjct: 230 DQCIGIAAEQNPTFYSNM--GQRFWPTLGYGYNSGILLIHLTKLRARGWDEIWMKTAFNL 287

Query: 293 IAEKFLL 299
           I EK +L
Sbjct: 288 IEEKRIL 294



 Score = 63.5 bits (153), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 30/92 (32%), Positives = 49/92 (53%), Gaps = 2/92 (2%)

Query: 81  VVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQ 140
           + E +    + H +G    LKL + ++LP  + K IVLDTD++   DI+ LW+LF    +
Sbjct: 170 IKERIKSFKSSHPTGHKNYLKLLITQILPNEIRKVIVLDTDILLNDDISNLWSLFDEFNK 229

Query: 141 RQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
            Q IG+   Q+  +   +    + WP LG G+
Sbjct: 230 DQCIGIAAEQNPTFYSNM--GQRFWPTLGYGY 259


>gi|351710870|gb|EHB13789.1| N-acetyllactosaminide beta-1,3-N-acetylglucosaminyltransferase
           [Heterocephalus glaber]
          Length = 415

 Score = 86.3 bits (212), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 72/264 (27%), Positives = 105/264 (39%), Gaps = 79/264 (29%)

Query: 475 DGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQF---LSYAGNSEALRSR 531
           D NDV L    S+D L  +  L + WEGP+S++++ +  E  Q    L+YA +S     R
Sbjct: 91  DPNDVILATHASVDNLLHLSGLLERWEGPLSVSVFAATKEQAQLATVLAYALSSHCPDMR 150

Query: 532 RNIGYHVVY------------KEGNF------------------------------YPIN 549
             +  H+V             + G F                              YP N
Sbjct: 151 ARVAMHLVCPSRYEAAVPDPREPGEFALLRSCQEVFDKLARVTRPGINYALGTNVSYPNN 210

Query: 550 TLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRY 609
            LRN+A    +  Y  ++D+D +P  GL+  L+     +D     G   LVVPAFE +R 
Sbjct: 211 LLRNLAREGAN--YALVIDMDMVPSEGLWRGLREM---LDQSNQWGGTALVVPAFEIRRA 265

Query: 610 RTA-----------------------FPASHAPTNFSRWVNATT------PYQIEWAPDF 640
           R                             HAPTN+SRWVN          Y + W   +
Sbjct: 266 RRMPMNKAELLQLYQVGEVRPFYYGLCTPCHAPTNYSRWVNLPEESLLRPSYIVPWQDPW 325

Query: 641 EPYIVAHRDLPRYDTRFVGFGWNK 664
           EP+ VA   +P +D RF  +G+N+
Sbjct: 326 EPFYVAGGKVPAFDERFRQYGFNR 349


>gi|384486813|gb|EIE78993.1| hypothetical protein RO3G_03698 [Rhizopus delemar RA 99-880]
          Length = 415

 Score = 85.9 bits (211), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 68/229 (29%), Positives = 112/229 (48%), Gaps = 38/229 (16%)

Query: 471 EASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRS 530
           E   +  D+T+V  ++ +R+  +  L K ++GPIS T+++SD E  ++          +S
Sbjct: 98  EMDMESKDITIVTLVTRNRIPNLARLAKKYKGPISTTIHISDDEEGEYTLEVLEDTIEKS 157

Query: 531 ---RRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRS 587
              R+ +  H+V  E +   +N  RNVA     T YV LLDIDF P   L   + ++ ++
Sbjct: 158 SEMRKYVDVHIVRDEFD-RELNLWRNVAKLFARTDYVMLLDIDFYPCSNLRDNVLNNPKA 216

Query: 588 MDMHGHGGKKVLVVPAFETQR------YRTAFPAS-----------------------HA 618
           M +    G+  LV+PAFE  +      YRT FP +                       HA
Sbjct: 217 MSL-LESGEAALVIPAFEFSKQQDGLDYRT-FPKNKQELIMAYQDNKVEMFHSFWLPGHA 274

Query: 619 PTNFSRWVNA-TTPYQI-EWAPDFEPYIV-AHRDLPRYDTRFVGFGWNK 664
           PT++ RW++   + YQ+  +   +EPY++  ++  P  D RF G+G NK
Sbjct: 275 PTDYDRWMSTDDSIYQVTTYQHSYEPYVIFKNQGSPWCDERFAGYGSNK 323


>gi|291416316|ref|XP_002724391.1| PREDICTED: UDP-GlcNAc:betaGal
           beta-1,3-N-acetylglucosaminyltransferase 1 [Oryctolagus
           cuniculus]
          Length = 415

 Score = 85.5 bits (210), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 73/265 (27%), Positives = 106/265 (40%), Gaps = 81/265 (30%)

Query: 475 DGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQF---LSYAGNSEALRSR 531
           D NDV L    S+D L  +  L + WEGP+S++++ +  E  Q    L+YA +S     R
Sbjct: 91  DPNDVILATHASVDNLLHLSGLLERWEGPLSVSVFAATKEEAQLATVLAYALSSHCPDMR 150

Query: 532 RNIGYHVVY------------KEGNF------------------------------YPIN 549
             +  H+V             + G F                              YP N
Sbjct: 151 ARVAMHLVCPSRYEAAVPDPREPGEFALLRTCQEVFDKLARVAQPGVNYALGTNVSYPNN 210

Query: 550 TLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRY 609
            LRN+A    +  Y  ++D+D +P  GL+  L+     +D     G   LVVPAFE +R 
Sbjct: 211 LLRNLAREGAN--YALVIDVDMVPSEGLWRGLREM---LDQSSQWGGTALVVPAFEIRRA 265

Query: 610 RTAFPAS------------------------HAPTNFSRWVNATT------PYQIEWAPD 639
           R   P S                         APTN+SRWVN          Y + W   
Sbjct: 266 RR-MPMSKTELLQLYQVGEVRPFYYGLCTPCQAPTNYSRWVNLPEESLLRPAYVVPWQDP 324

Query: 640 FEPYIVAHRDLPRYDTRFVGFGWNK 664
           +EP+ VA   +P +D RF  +G+N+
Sbjct: 325 WEPFYVAGGKVPTFDERFRQYGFNR 349


>gi|395851721|ref|XP_003798401.1| PREDICTED: N-acetyllactosaminide
           beta-1,3-N-acetylglucosaminyltransferase [Otolemur
           garnettii]
          Length = 415

 Score = 85.5 bits (210), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 73/265 (27%), Positives = 106/265 (40%), Gaps = 81/265 (30%)

Query: 475 DGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQF---LSYAGNSEALRSR 531
           D NDV L    S+D L  +  L + WEGP+S++++ +  E  Q    L+YA +S     R
Sbjct: 91  DPNDVILATHASVDNLLHLSGLLERWEGPLSVSVFAATKEEAQLATVLAYALSSHCPDMR 150

Query: 532 RNIGYHVVY------------KEGNF------------------------------YPIN 549
             +  H+V             + G F                              YP N
Sbjct: 151 ARVAMHLVCPSRYEAAVPDPREPGEFALLRSCQEVFDKLARVAQPGINYALGTNVSYPNN 210

Query: 550 TLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRY 609
            LRN+A    +  Y  ++D+D +P  GL+  L+     +D     G   LVVPAFE +R 
Sbjct: 211 LLRNLAREGAN--YALVIDVDMVPSEGLWRGLREM---LDQSNQWGGTALVVPAFEIRRA 265

Query: 610 RTAFPAS------------------------HAPTNFSRWVNATT------PYQIEWAPD 639
           R   P S                         APTN+SRWVN          Y + W   
Sbjct: 266 RR-MPMSKTELLQLYQVGEVRPFYYGLCTPCQAPTNYSRWVNLPEESLLRPAYVVPWQDP 324

Query: 640 FEPYIVAHRDLPRYDTRFVGFGWNK 664
           +EP+ VA   +P +D RF  +G+N+
Sbjct: 325 WEPFYVAGGKVPTFDERFRQYGFNR 349


>gi|189442730|gb|AAI67687.1| LOC100170600 protein [Xenopus (Silurana) tropicalis]
          Length = 435

 Score = 84.7 bits (208), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 76/263 (28%), Positives = 110/263 (41%), Gaps = 80/263 (30%)

Query: 478 DVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDA------------------EAQQFL 519
           DV L +  S++ L  ++ L +HW+G ISL L+ S+A                    +Q +
Sbjct: 111 DVVLASHASVNNLGHLQDLVQHWDGRISLALFASNAVQAKLAVMFVYALSQLCPHVRQRV 170

Query: 520 SYA-----------------------GNSEALRSRR-NIGYHVVYKEGNF-YPINTLRNV 554
           S+                         N EA+ S+  ++G  VV   GN  YP N LRNV
Sbjct: 171 SFHLVCKSSDRVTFPELEDLSDFATLPNCEAVFSKAADMGIKVVNYAGNASYPNNLLRNV 230

Query: 555 ALNQV-STPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFE---TQR-- 608
           A   + S  YV ++DID +P  GL    +S   ++   G     V VVPAFE   T+R  
Sbjct: 231 ARAGIESAAYVLVVDIDMVPSEGL----RSGFVNLATGGVDQHLVFVVPAFEIRHTRRLP 286

Query: 609 ------------------YRTAFPASHAPTNFSRWVNA---------TTPYQIEWAPDFE 641
                             Y    P   APTN+S W+N             Y +EW   +E
Sbjct: 287 STKEELMRLYQVGEVRAFYEELCPRCQAPTNYSLWINLPEKKPAAKLGVAYVVEWKDPWE 346

Query: 642 PYIVAHRDLPRYDTRFVGFGWNK 664
           P+ +   D+P YD RF  +G+N+
Sbjct: 347 PFYIGRADVPAYDERFKQYGFNR 369


>gi|193786796|dbj|BAG52119.1| unnamed protein product [Homo sapiens]
          Length = 415

 Score = 84.7 bits (208), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 71/264 (26%), Positives = 105/264 (39%), Gaps = 79/264 (29%)

Query: 475 DGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQF---LSYAGNSEALRSR 531
           D NDV L +  S+D L  +  L + WEGP+S++++ +  E  Q    L+YA +S     R
Sbjct: 91  DPNDVILASHASVDNLLHLSGLLERWEGPLSVSVFAATKEEAQLATVLAYALSSHCPDMR 150

Query: 532 RNIGYHVVY------------KEGNF------------------------------YPIN 549
             +  H+V             + G F                              YP N
Sbjct: 151 ARVAMHLVCPSRYEAAVPDPREPGEFALLRSCQEVFDKLARVAQPGINYALGTNVSYPNN 210

Query: 550 TLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRY 609
            LRN+A    +  Y  ++D+D +P  GL+  L+     +D     G   LVVPAFE +R 
Sbjct: 211 LLRNLAREGAN--YALVIDVDMVPSEGLWRGLREM---LDQSNQWGGTALVVPAFEIRRA 265

Query: 610 RTA-----------------------FPASHAPTNFSRWVNATT------PYQIEWAPDF 640
           R                              APTN+SRWVN          Y + W   +
Sbjct: 266 RRMPMNKNELVQLYQVGEVRPFYYGLCTPCQAPTNYSRWVNLPEESLLRPAYVVPWQDPW 325

Query: 641 EPYIVAHRDLPRYDTRFVGFGWNK 664
           EP+ VA   +P +D RF  +G+N+
Sbjct: 326 EPFYVAGGKVPTFDERFRQYGFNR 349


>gi|426252530|ref|XP_004019962.1| PREDICTED: LOW QUALITY PROTEIN: N-acetyllactosaminide
           beta-1,3-N-acetylglucosaminyltransferase [Ovis aries]
          Length = 421

 Score = 84.7 bits (208), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 73/266 (27%), Positives = 105/266 (39%), Gaps = 81/266 (30%)

Query: 474 SDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQF---LSYAGNSEALRS 530
           SD NDV L    S+D L  +  L + WEGP+S++++ +  E  Q    L+YA +S     
Sbjct: 96  SDPNDVILATHASVDNLLHLSGLLERWEGPLSVSVFAATKEEAQLATVLTYALSSHCPDM 155

Query: 531 RRNIGYHVVY------------KEGNF------------------------------YPI 548
           R  +  H+V             + G F                              YP 
Sbjct: 156 RARVAMHLVCPSRYEAAVPDPREPGEFALLRSCQEVFDKLARVAQPGVNYALGTNVSYPN 215

Query: 549 NTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSM-DMHGHGGKKVLVVPAFETQ 607
           N LRN+A    +  Y  ++D+D +P  GL+     S+R M D         LVVPAFE +
Sbjct: 216 NLLRNLAREGAN--YALVIDVDMVPSEGLW----RSLREMLDQSKQWAGTALVVPAFEIR 269

Query: 608 RYRTA-----------------------FPASHAPTNFSRWVNAT------TPYQIEWAP 638
           R R                              APTN+SRWVN          Y + W  
Sbjct: 270 RARRMPMNKNELLQLYQVGEVRPFYYGLCTPCQAPTNYSRWVNLPEETLLRPAYVVPWQD 329

Query: 639 DFEPYIVAHRDLPRYDTRFVGFGWNK 664
            +EP+ VA   +P +D RF  +G+N+
Sbjct: 330 PWEPFYVAGGKVPTFDERFRQYGFNR 355


>gi|296218850|ref|XP_002807418.1| PREDICTED: LOW QUALITY PROTEIN: N-acetyllactosaminide
           beta-1,3-N-acetylglucosaminyltransferase [Callithrix
           jacchus]
          Length = 415

 Score = 84.3 bits (207), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 71/264 (26%), Positives = 104/264 (39%), Gaps = 79/264 (29%)

Query: 475 DGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQF---LSYAGNSEALRSR 531
           D NDV L    S+D L  +  L + WEGP+S++++ +  E  Q    L+YA +S     R
Sbjct: 91  DPNDVILATHASVDNLLHLSGLLERWEGPLSVSVFAATKEEAQLATVLAYALSSHCPDMR 150

Query: 532 RNIGYHVVY------------KEGNF------------------------------YPIN 549
             +  H+V             + G F                              YP N
Sbjct: 151 ARVAMHLVCPSRYEAAVPDPREPGEFALLRSCQEVFDKLARVAQPGINYALGTNVSYPNN 210

Query: 550 TLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRY 609
            LRN+A    +  Y  ++D+D +P  GL+  L+     +D     G   LVVPAFE +R 
Sbjct: 211 LLRNLAREGAN--YALVIDVDMVPSEGLWRGLREM---LDQSNQWGGTALVVPAFEIRRA 265

Query: 610 RTA-----------------------FPASHAPTNFSRWVNATT------PYQIEWAPDF 640
           R                              APTN+SRWVN          Y + W   +
Sbjct: 266 RRMPMNKNELVQLYQVGEVRPFYYGLCTPCQAPTNYSRWVNLPEESLLRPAYVVPWQDPW 325

Query: 641 EPYIVAHRDLPRYDTRFVGFGWNK 664
           EP+ VA   +P +D RF  +G+N+
Sbjct: 326 EPFYVAGGKVPTFDERFRQYGFNR 349


>gi|301619991|ref|XP_002939368.1| PREDICTED: n-acetyllactosaminide
           beta-1,3-N-acetylglucosaminyltransferase [Xenopus
           (Silurana) tropicalis]
          Length = 421

 Score = 84.3 bits (207), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 76/263 (28%), Positives = 110/263 (41%), Gaps = 80/263 (30%)

Query: 478 DVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDA------------------EAQQFL 519
           DV L +  S++ L  ++ L +HW+G ISL L+ S+A                    +Q +
Sbjct: 97  DVVLASHASVNNLGHLQDLVQHWDGRISLALFASNAVQAKLAVMFVYALSQLCPHVRQRV 156

Query: 520 SYA-----------------------GNSEALRSRR-NIGYHVVYKEGNF-YPINTLRNV 554
           S+                         N EA+ S+  ++G  VV   GN  YP N LRNV
Sbjct: 157 SFHLVCKSSDRVTFPELEDLSDFATLPNCEAVFSKAADMGIKVVNYAGNASYPNNLLRNV 216

Query: 555 ALNQV-STPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFE---TQR-- 608
           A   + S  YV ++DID +P  GL    +S   ++   G     V VVPAFE   T+R  
Sbjct: 217 ARAGIESAAYVLVVDIDMVPSEGL----RSGFVNLATGGVDQHLVFVVPAFEIRHTRRLP 272

Query: 609 ------------------YRTAFPASHAPTNFSRWVNA---------TTPYQIEWAPDFE 641
                             Y    P   APTN+S W+N             Y +EW   +E
Sbjct: 273 STKEELMRLYQVGEVRAFYEELCPRCQAPTNYSLWINLPEKKPAAKLGVAYVVEWKDPWE 332

Query: 642 PYIVAHRDLPRYDTRFVGFGWNK 664
           P+ +   D+P YD RF  +G+N+
Sbjct: 333 PFYIGRADVPAYDERFKQYGFNR 355


>gi|197101823|ref|NP_001126908.1| N-acetyllactosaminide beta-1,3-N-acetylglucosaminyltransferase
           [Pongo abelii]
 gi|67460445|sp|Q5R4S2.1|B3GN1_PONAB RecName: Full=N-acetyllactosaminide
           beta-1,3-N-acetylglucosaminyltransferase; AltName:
           Full=I-beta-1,3-N-acetylglucosaminyltransferase;
           Short=iGnT; AltName: Full=Poly-N-acetyllactosamine
           extension enzyme; AltName: Full=UDP-GlcNAc:betaGal
           beta-1,3-N-acetylglucosaminyltransferase 1
 gi|55733120|emb|CAH93244.1| hypothetical protein [Pongo abelii]
          Length = 415

 Score = 84.3 bits (207), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 71/264 (26%), Positives = 104/264 (39%), Gaps = 79/264 (29%)

Query: 475 DGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQF---LSYAGNSEALRSR 531
           D NDV L    S+D L  +  L + WEGP+S++++ +  E  Q    L+YA +S     R
Sbjct: 91  DPNDVILATHASVDNLLHLSGLLERWEGPLSVSVFAATKEEAQLATVLAYALSSHCPDMR 150

Query: 532 RNIGYHVVY------------KEGNF------------------------------YPIN 549
             +  H+V             + G F                              YP N
Sbjct: 151 ARVAMHLVCPSRYEAAVPDPREPGEFALLRSCQEVFDKLARVAQPGINYALGTNVSYPNN 210

Query: 550 TLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRY 609
            LRN+A    +  Y  ++D+D +P  GL+  L+     +D     G   LVVPAFE +R 
Sbjct: 211 LLRNLAREGAN--YALVIDVDMVPSEGLWRGLREM---LDQSNQWGGTALVVPAFEIRRA 265

Query: 610 RTA-----------------------FPASHAPTNFSRWVNATT------PYQIEWAPDF 640
           R                              APTN+SRWVN          Y + W   +
Sbjct: 266 RRMPMNKNELVQLYQVGEVRPFYYGLCTPCQAPTNYSRWVNLPEESLLRPAYVVPWQDPW 325

Query: 641 EPYIVAHRDLPRYDTRFVGFGWNK 664
           EP+ VA   +P +D RF  +G+N+
Sbjct: 326 EPFYVAGGKVPTFDERFRQYGFNR 349


>gi|5802984|ref|NP_006867.1| N-acetyllactosaminide beta-1,3-N-acetylglucosaminyltransferase
           [Homo sapiens]
 gi|114638684|ref|XP_508571.2| PREDICTED: N-acetyllactosaminide
           beta-1,3-N-acetylglucosaminyltransferase [Pan
           troglodytes]
 gi|397517017|ref|XP_003828716.1| PREDICTED: N-acetyllactosaminide
           beta-1,3-N-acetylglucosaminyltransferase [Pan paniscus]
 gi|426369296|ref|XP_004051629.1| PREDICTED: N-acetyllactosaminide
           beta-1,3-N-acetylglucosaminyltransferase [Gorilla
           gorilla gorilla]
 gi|23396461|sp|O43505.1|B3GN1_HUMAN RecName: Full=N-acetyllactosaminide
           beta-1,3-N-acetylglucosaminyltransferase; AltName:
           Full=I-beta-1,3-N-acetylglucosaminyltransferase;
           Short=iGnT; AltName: Full=Poly-N-acetyllactosamine
           extension enzyme; AltName: Full=UDP-GlcNAc:betaGal
           beta-1,3-N-acetylglucosaminyltransferase 1
 gi|2745741|gb|AAC39538.1| i-beta-1,3-N-acetylglucosaminyltransferase [Homo sapiens]
 gi|18314366|gb|AAH21965.1| UDP-GlcNAc:betaGal beta-1,3-N-acetylglucosaminyltransferase 1 [Homo
           sapiens]
 gi|66394590|gb|AAY46155.1| UDP-GlcNAc:betaGal beta-1,3-N-acetylglucosaminyltransferase 6 [Homo
           sapiens]
 gi|119594923|gb|EAW74517.1| UDP-GlcNAc:betaGal beta-1,3-N-acetylglucosaminyltransferase 6,
           isoform CRA_a [Homo sapiens]
 gi|119594924|gb|EAW74518.1| UDP-GlcNAc:betaGal beta-1,3-N-acetylglucosaminyltransferase 6,
           isoform CRA_a [Homo sapiens]
 gi|123984573|gb|ABM83632.1| UDP-GlcNAc:betaGal beta-1,3-N-acetylglucosaminyltransferase 1
           [synthetic construct]
 gi|123998563|gb|ABM86883.1| UDP-GlcNAc:betaGal beta-1,3-N-acetylglucosaminyltransferase 1
           [synthetic construct]
 gi|410208370|gb|JAA01404.1| UDP-GlcNAc:betaGal beta-1,3-N-acetylglucosaminyltransferase 1 [Pan
           troglodytes]
 gi|410247954|gb|JAA11944.1| UDP-GlcNAc:betaGal beta-1,3-N-acetylglucosaminyltransferase 1 [Pan
           troglodytes]
 gi|410301258|gb|JAA29229.1| UDP-GlcNAc:betaGal beta-1,3-N-acetylglucosaminyltransferase 1 [Pan
           troglodytes]
 gi|410353541|gb|JAA43374.1| UDP-GlcNAc:betaGal beta-1,3-N-acetylglucosaminyltransferase 1 [Pan
           troglodytes]
          Length = 415

 Score = 84.3 bits (207), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 71/264 (26%), Positives = 104/264 (39%), Gaps = 79/264 (29%)

Query: 475 DGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQF---LSYAGNSEALRSR 531
           D NDV L    S+D L  +  L + WEGP+S++++ +  E  Q    L+YA +S     R
Sbjct: 91  DPNDVILATHASVDNLLHLSGLLERWEGPLSVSVFAATKEEAQLATVLAYALSSHCPDMR 150

Query: 532 RNIGYHVVY------------KEGNF------------------------------YPIN 549
             +  H+V             + G F                              YP N
Sbjct: 151 ARVAMHLVCPSRYEAAVPDPREPGEFALLRSCQEVFDKLARVAQPGINYALGTNVSYPNN 210

Query: 550 TLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRY 609
            LRN+A    +  Y  ++D+D +P  GL+  L+     +D     G   LVVPAFE +R 
Sbjct: 211 LLRNLAREGAN--YALVIDVDMVPSEGLWRGLREM---LDQSNQWGGTALVVPAFEIRRA 265

Query: 610 RTA-----------------------FPASHAPTNFSRWVNATT------PYQIEWAPDF 640
           R                              APTN+SRWVN          Y + W   +
Sbjct: 266 RRMPMNKNELVQLYQVGEVRPFYYGLCTPCQAPTNYSRWVNLPEESLLRPAYVVPWQDPW 325

Query: 641 EPYIVAHRDLPRYDTRFVGFGWNK 664
           EP+ VA   +P +D RF  +G+N+
Sbjct: 326 EPFYVAGGKVPTFDERFRQYGFNR 349


>gi|403293582|ref|XP_003937792.1| PREDICTED: N-acetyllactosaminide
           beta-1,3-N-acetylglucosaminyltransferase [Saimiri
           boliviensis boliviensis]
          Length = 415

 Score = 84.3 bits (207), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 71/264 (26%), Positives = 104/264 (39%), Gaps = 79/264 (29%)

Query: 475 DGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQF---LSYAGNSEALRSR 531
           D NDV L    S+D L  +  L + WEGP+S++++ +  E  Q    L+YA +S     R
Sbjct: 91  DPNDVILATHASVDNLLHLSGLLERWEGPLSVSVFAATKEEAQLATVLAYALSSHCPDMR 150

Query: 532 RNIGYHVVY------------KEGNF------------------------------YPIN 549
             +  H+V             + G F                              YP N
Sbjct: 151 ARVAMHLVCPSRYEAAVPDPREPGEFALLRSCQEVFDKLARVAQPGINYALGTNVSYPNN 210

Query: 550 TLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRY 609
            LRN+A    +  Y  ++D+D +P  GL+  L+     +D     G   LVVPAFE +R 
Sbjct: 211 LLRNLAREGAN--YALVIDVDMVPSEGLWRGLREM---LDQSNQWGGTALVVPAFEIRRA 265

Query: 610 RTA-----------------------FPASHAPTNFSRWVNATT------PYQIEWAPDF 640
           R                              APTN+SRWVN          Y + W   +
Sbjct: 266 RRMPMNKNELVQLYQVGEVRPFYYGLCTPCQAPTNYSRWVNLPEESLLRPAYVVPWQDPW 325

Query: 641 EPYIVAHRDLPRYDTRFVGFGWNK 664
           EP+ VA   +P +D RF  +G+N+
Sbjct: 326 EPFYVAGGKVPTFDERFRQYGFNR 349


>gi|343958360|dbj|BAK63035.1| N-acetyllactosaminide beta-1,3-N-acetylglucosaminyltransferase [Pan
           troglodytes]
          Length = 415

 Score = 84.3 bits (207), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 71/264 (26%), Positives = 104/264 (39%), Gaps = 79/264 (29%)

Query: 475 DGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQF---LSYAGNSEALRSR 531
           D NDV L    S+D L  +  L + WEGP+S++++ +  E  Q    L+YA +S     R
Sbjct: 91  DPNDVILATHASVDNLLHLSGLLERWEGPLSVSVFAATKEEAQLATVLAYALSSHCPDMR 150

Query: 532 RNIGYHVVY------------KEGNF------------------------------YPIN 549
             +  H+V             + G F                              YP N
Sbjct: 151 ARVAVHLVCPSRYEAAVPDPREPGEFALLRSCQEVFDKLARVAQPGINYALGTNVSYPNN 210

Query: 550 TLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRY 609
            LRN+A    +  Y  ++D+D +P  GL+  L+     +D     G   LVVPAFE +R 
Sbjct: 211 LLRNLAREGAN--YALVIDVDMVPSEGLWRGLREM---LDQSNQWGGTALVVPAFEIRRA 265

Query: 610 RTA-----------------------FPASHAPTNFSRWVNATT------PYQIEWAPDF 640
           R                              APTN+SRWVN          Y + W   +
Sbjct: 266 RRMPMNKNELVQLYQVGEMRPFYYGLCTPCQAPTNYSRWVNLPEESLLRPAYVVPWQDPW 325

Query: 641 EPYIVAHRDLPRYDTRFVGFGWNK 664
           EP+ VA   +P +D RF  +G+N+
Sbjct: 326 EPFYVAGGKVPTFDERFRQYGFNR 349


>gi|328719955|ref|XP_001943688.2| PREDICTED: glucoside xylosyltransferase 1-like isoform 1
           [Acyrthosiphon pisum]
 gi|328719957|ref|XP_003246910.1| PREDICTED: glucoside xylosyltransferase 1-like isoform 2
           [Acyrthosiphon pisum]
          Length = 376

 Score = 84.0 bits (206), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 51/151 (33%), Positives = 78/151 (51%), Gaps = 16/151 (10%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLG--KL 255
           +L LP VL   L   + +DTD +F + +  +W  FS++   Q   +V    D  +G    
Sbjct: 155 RLFLPTVLNH-LDSILYVDTDTLFLSSVEDVWKHFSQMNSSQLAAMVPEHEDMNVGWYNR 213

Query: 256 WKNHKPWPALGR-GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNA 314
           + NH   P  G+ G N+GV+L++LT++R+  W      + EK+   RL+ +  DQDI N 
Sbjct: 214 FANH---PYYGKLGLNSGVMLMNLTRMREFDWISRLAPVLEKY---RLYLTWGDQDIINV 267

Query: 315 IISEHPYLVYTLPCQWNVQLSDNTRSDELCY 345
           I  +HP  VY  PC++N       RSD   Y
Sbjct: 268 IFHDHPDKVYVYPCRYNY------RSDHCMY 292



 Score = 42.4 bits (98), Expect = 0.90,   Method: Compositional matrix adjust.
 Identities = 44/169 (26%), Positives = 71/169 (42%), Gaps = 23/169 (13%)

Query: 13  EVIQVAIVCAGYNSTRSLVTLIKSILFYRKNP-LHFHLITDTVALNILQTLFST---WSV 68
           EV+  A++C   N T   + +IKS + +R N  L   +I +    NI Q    T   W +
Sbjct: 67  EVVIAAVICG--NRTSQALVMIKSAIVFRGNSDLRIVVIAEK---NIQQEFDETLRYWRL 121

Query: 69  PQVEVSFYLADSVVEDVSWIPNKHYSGVYGLL------KLTLPKVLPETLAKTIVLDTDV 122
              +   Y   S++      P +H      L       +L LP VL   L   + +DTD 
Sbjct: 122 LTNDSFSYEIHSII-----FPKEHSDEWRKLFQPCASERLFLPTVLNH-LDSILYVDTDT 175

Query: 123 IFATDIAQLWALFSRLRQRQTIGLVENQSDWYLG--KLWKNHKPWPALG 169
           +F + +  +W  FS++   Q   +V    D  +G    + NH  +  LG
Sbjct: 176 LFLSSVEDVWKHFSQMNSSQLAAMVPEHEDMNVGWYNRFANHPYYGKLG 224


>gi|402892745|ref|XP_003909569.1| PREDICTED: N-acetyllactosaminide
           beta-1,3-N-acetylglucosaminyltransferase [Papio anubis]
          Length = 415

 Score = 83.6 bits (205), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 70/264 (26%), Positives = 104/264 (39%), Gaps = 79/264 (29%)

Query: 475 DGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQF---LSYAGNSEALRSR 531
           + NDV L    S+D L  +  L + WEGP+S++++ +  E  Q    L+YA +S     R
Sbjct: 91  EPNDVILATHASVDNLLHLSGLLERWEGPLSVSVFAATKEEAQLATVLAYALSSHCPDMR 150

Query: 532 RNIGYHVVY------------KEGNF------------------------------YPIN 549
             +  H+V             + G F                              YP N
Sbjct: 151 ARVAMHLVCPSRYEAAVPDPREPGEFALLRSCQEVFDKLARVAQPGINYALGTNVSYPNN 210

Query: 550 TLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRY 609
            LRN+A    +  Y  ++D+D +P  GL+  L+     +D     G   LVVPAFE +R 
Sbjct: 211 LLRNLAREGAN--YALVIDVDMVPSEGLWRGLREM---LDQSNQWGSTALVVPAFEIRRA 265

Query: 610 RTA-----------------------FPASHAPTNFSRWVNATT------PYQIEWAPDF 640
           R                              APTN+SRWVN          Y + W   +
Sbjct: 266 RRMPMNKNELVQLYQVGEVRPFYYGLCTPCQAPTNYSRWVNLAEESLLRPAYVVPWQDPW 325

Query: 641 EPYIVAHRDLPRYDTRFVGFGWNK 664
           EP+ VA   +P +D RF  +G+N+
Sbjct: 326 EPFYVAGGKVPTFDERFRQYGFNR 349


>gi|410974574|ref|XP_003993719.1| PREDICTED: N-acetyllactosaminide
           beta-1,3-N-acetylglucosaminyltransferase [Felis catus]
          Length = 415

 Score = 83.6 bits (205), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 72/265 (27%), Positives = 106/265 (40%), Gaps = 81/265 (30%)

Query: 475 DGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQF---LSYAGNSEALRSR 531
           D NDV L    S+D L  +  L + WEGP+S++++ +  E  Q    L+YA +S     R
Sbjct: 91  DPNDVILATHASVDNLLHLSGLLERWEGPLSVSVFAATKEEAQLATVLTYALSSHCPDMR 150

Query: 532 RNIGYHVVY------------KEGNF------------------------------YPIN 549
             +  H+V             + G F                              YP N
Sbjct: 151 ARVAMHLVCPSRYEAAVPDPREPGEFALLRSCQEVFDKLARVAQPGINYALGTNVSYPNN 210

Query: 550 TLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRY 609
            LRN+A    +  Y  ++D+D +P  GL+  L+     +D         LVVPAFE +R 
Sbjct: 211 LLRNLAREGAN--YALVIDVDMVPSEGLWRGLREM---LDQSKQWAGTALVVPAFEIRRA 265

Query: 610 RTAFPAS------------------------HAPTNFSRWVNATT------PYQIEWAPD 639
           R   PA+                         APTN+SRWVN          Y + W   
Sbjct: 266 RR-MPANKNELLQLYQVGEVRPFYYGLCTPCQAPTNYSRWVNLPEETLLRPAYVVPWQDP 324

Query: 640 FEPYIVAHRDLPRYDTRFVGFGWNK 664
           +EP+ VA   +P +D RF  +G+N+
Sbjct: 325 WEPFYVAGGKVPTFDERFRQYGFNR 349


>gi|161078603|ref|NP_001097911.1| CG9996 [Drosophila melanogaster]
 gi|158030380|gb|AAF56394.2| CG9996 [Drosophila melanogaster]
 gi|162944842|gb|ABY20490.1| IP22168p [Drosophila melanogaster]
          Length = 362

 Score = 83.2 bits (204), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 49/162 (30%), Positives = 85/162 (52%), Gaps = 9/162 (5%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L   +   + +DTD++F + I+ +W  F +  + Q   L     +  +G  W 
Sbjct: 136 RLFLPSLLTH-VDSLLYVDTDILFLSPISDIWRFFKKFNETQMSALTPEHENENIG--WY 192

Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAI 315
           N     P  GR G N+GV+L++LT++R++ W      I +++ L  +W    DQDI N +
Sbjct: 193 NRFARHPFYGRLGVNSGVMLMNLTRMREMKWEQHIVSIHKEYKLRIIW---GDQDIINIL 249

Query: 316 ISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
              HP  +Y +PC++N +  D+     +C    T +K+IH N
Sbjct: 250 FYYHPDKLYIMPCEYNYR-PDHCMYMSICNMSQTGVKVIHGN 290


>gi|281208069|gb|EFA82247.1| putative glycosyltransferase [Polysphondylium pallidum PN500]
          Length = 391

 Score = 83.2 bits (204), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 54/206 (26%), Positives = 96/206 (46%), Gaps = 31/206 (15%)

Query: 485 LSMDRLQMVEMLFKHWEGPISLTLYM-SDAEAQQFLSYAGNSEALRSRRNIGYHVVYKEG 543
           +S+DRL+ +  +   W  PIS  +Y+ +D E +   +    S ++    +I  H++Y   
Sbjct: 70  VSVDRLERLAWMADKWRAPISAAVYIQNDTEIEAIANLIRGSYSVTQFVDI--HLLYANN 127

Query: 544 NFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPA 603
             YPINTLRN+++    T +V L+D DF+P   LY   K+ ++             V+P+
Sbjct: 128 TRYPINTLRNLSIKHAETDWVLLMDADFIPPISLY---KNIVQHTYKVPRDELTAFVIPS 184

Query: 604 FETQ--RYR----------------------TAFPASHAPTNFSRWVNATTPYQIEWAPD 639
           F +   R++                         P  H P+N++RW      Y+ ++   
Sbjct: 185 FASSLDRFKIPDDKLDLLKEIKKKTIVPTNLNVCPKCHTPSNYTRWYKEKDTYEAKYQWV 244

Query: 640 FEPYIVAHR-DLPRYDTRFVGFGWNK 664
           +EPY+V  + D   +D R  G+G++K
Sbjct: 245 YEPYLVYRKIDAEPFDERLKGYGFDK 270


>gi|452824557|gb|EME31559.1| glycosyltransferase-like protein LARGE [Galdieria sulphuraria]
          Length = 655

 Score = 83.2 bits (204), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 64/217 (29%), Positives = 111/217 (51%), Gaps = 30/217 (13%)

Query: 475 DGNDVTLVAQLSMDRLQMVEMLFKHWE-GPI-SLTLYMSDAEAQQFLSYAGNSEALRSRR 532
           + +++TL   LSM R+ +++ L K ++ GP+ ++    S+ E ++ L Y         R 
Sbjct: 384 NKSELTLAIPLSMHRMFILKELLKFYKTGPVVAVIAIQSEQEKKELLEYLSIIPN-NMRM 442

Query: 533 NIGYHVVYKE--GNFYPINTLRNVALNQVSTPYVFLLDIDFLPM---FGLYPYLKSSIRS 587
           N+ + +V      + +PIN LRN+AL   +T +V +LD+D  P+   F  +P +    + 
Sbjct: 443 NVEFVIVLISPWKDRFPINQLRNIALKYTTTDFVCVLDVDTFPVSSAFTAFPQIME--KE 500

Query: 588 MDMHGHGGKKVLVV-------------PAFE---TQRYRTAFP---ASHAPTNFSRWVNA 628
            ++     K+ LVV             P+ E   ++  +T FP   AS +P +F RW+  
Sbjct: 501 PELLPMNRKRCLVVTNWIASDSEQLVYPSVEDLKSKYLKTWFPYCEASQSPISFRRWLYD 560

Query: 629 TTPYQIEWAPDFEPY-IVAHRDLPRYDTRFVGFGWNK 664
              Y + + P+FEPY I+  RD   +D RF G+G+NK
Sbjct: 561 NNSYFVSFRPNFEPYCIMRTRDFILFDERFHGYGFNK 597


>gi|332249596|ref|XP_003273944.1| PREDICTED: N-acetyllactosaminide
           beta-1,3-N-acetylglucosaminyltransferase [Nomascus
           leucogenys]
          Length = 415

 Score = 83.2 bits (204), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 70/264 (26%), Positives = 104/264 (39%), Gaps = 79/264 (29%)

Query: 475 DGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQF---LSYAGNSEALRSR 531
           D NDV L    S+D L  +  L + WEGP+S++++ +  E  Q    L+YA +S     R
Sbjct: 91  DPNDVILATHASVDNLLHLSGLLERWEGPLSVSVFAATKEEAQLATVLAYALSSHCPDMR 150

Query: 532 RNIGYHVVY------------KEGNF------------------------------YPIN 549
             +  H+V             + G F                              YP N
Sbjct: 151 ARVAMHLVCPSRYEAAVPDPREPGEFALLRSCQEVFDKLARVAQPGINYALGTNVSYPNN 210

Query: 550 TLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRY 609
            LRN+A    +  Y  ++D+D +P  GL+  L+     +D     G   LVVPAF+ +R 
Sbjct: 211 LLRNLAREGAN--YALVIDVDMVPSEGLWRGLREM---LDQSNQWGGTALVVPAFQIRRA 265

Query: 610 RTA-----------------------FPASHAPTNFSRWVNATT------PYQIEWAPDF 640
           R                              APTN+SRWVN          Y + W   +
Sbjct: 266 RRMPMNKNELVQLYQVGEVRPFYYGLCTPCQAPTNYSRWVNLPEESLLRPAYVVPWQDPW 325

Query: 641 EPYIVAHRDLPRYDTRFVGFGWNK 664
           EP+ VA   +P +D RF  +G+N+
Sbjct: 326 EPFYVAGGKVPTFDERFRQYGFNR 349


>gi|109939728|sp|Q5EA01.2|B3GN1_BOVIN RecName: Full=N-acetyllactosaminide
           beta-1,3-N-acetylglucosaminyltransferase; AltName:
           Full=I-beta-1,3-N-acetylglucosaminyltransferase;
           Short=iGnT; AltName: Full=Poly-N-acetyllactosamine
           extension enzyme; AltName: Full=UDP-GlcNAc:betaGal
           beta-1,3-N-acetylglucosaminyltransferase 1
 gi|75773634|gb|AAI05207.1| UDP-GlcNAc:betaGal beta-1,3-N-acetylglucosaminyltransferase 2 [Bos
           taurus]
 gi|296471480|tpg|DAA13595.1| TPA: N-acetyllactosaminide beta-1,3-N-acetylglucosaminyltransferase
           [Bos taurus]
 gi|440899377|gb|ELR50680.1| N-acetyllactosaminide beta-1,3-N-acetylglucosaminyltransferase [Bos
           grunniens mutus]
          Length = 415

 Score = 82.8 bits (203), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 72/265 (27%), Positives = 104/265 (39%), Gaps = 81/265 (30%)

Query: 475 DGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQF---LSYAGNSEALRSR 531
           D NDV L    S+D L  +  L + WEGP+S++++ +  E  Q    L+YA +S     R
Sbjct: 91  DPNDVILATHASVDNLLHLSGLLERWEGPLSVSVFAATKEEAQLATVLTYALSSHCPDMR 150

Query: 532 RNIGYHVVY------------KEGNF------------------------------YPIN 549
             +  H+V             + G F                              YP N
Sbjct: 151 ARVAMHLVCPSRYEAAVPDPREPGEFALLRSCQEVFDKLARVAQPGVNYALGTNVSYPNN 210

Query: 550 TLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSM-DMHGHGGKKVLVVPAFETQR 608
            LRN+A    +  Y  ++D+D +P  GL+     S+R M D         LVVPAFE +R
Sbjct: 211 LLRNLAREGAN--YALVIDVDMVPSEGLW----RSLREMLDQSKQWAGTALVVPAFEIRR 264

Query: 609 YRTA-----------------------FPASHAPTNFSRWVNATT------PYQIEWAPD 639
            R                              APTN+SRWVN          Y + W   
Sbjct: 265 ARRMPMNKNELLQLYQVGEVRPFYYGLCTPCQAPTNYSRWVNLPEETLLRPAYVVPWQDP 324

Query: 640 FEPYIVAHRDLPRYDTRFVGFGWNK 664
           +EP+ VA   +P +D RF  +G+N+
Sbjct: 325 WEPFYVAGGKVPTFDERFRQYGFNR 349


>gi|77736590|ref|NP_001029980.1| N-acetyllactosaminide beta-1,3-N-acetylglucosaminyltransferase [Bos
           taurus]
 gi|59857901|gb|AAX08785.1| beta-1,3-N-acetylglucosaminyltransferase bGnT-6 [Bos taurus]
          Length = 415

 Score = 82.8 bits (203), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 72/265 (27%), Positives = 104/265 (39%), Gaps = 81/265 (30%)

Query: 475 DGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQF---LSYAGNSEALRSR 531
           D NDV L    S+D L  +  L + WEGP+S++++ +  E  Q    L+YA +S     R
Sbjct: 91  DPNDVILATHASVDNLLHLSGLLERWEGPLSVSVFAATKEEAQLATVLTYALSSHCPDMR 150

Query: 532 RNIGYHVVY------------KEGNF------------------------------YPIN 549
             +  H+V             + G F                              YP N
Sbjct: 151 ARVAMHLVCPSRYEAAVPDPREPGEFAPLRSCQEVFDKLARVAQPGVNYALGTNVSYPNN 210

Query: 550 TLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSM-DMHGHGGKKVLVVPAFETQR 608
            LRN+A    +  Y  ++D+D +P  GL+     S+R M D         LVVPAFE +R
Sbjct: 211 LLRNLAREGAN--YALVIDVDMVPSEGLW----RSLREMLDQSKQWAGTALVVPAFEIRR 264

Query: 609 YRTA-----------------------FPASHAPTNFSRWVNATT------PYQIEWAPD 639
            R                              APTN+SRWVN          Y + W   
Sbjct: 265 ARRMPMNKNELLQLYQVGEVRPFYYGLCTPCQAPTNYSRWVNLPEETLLRPAYVVPWQDP 324

Query: 640 FEPYIVAHRDLPRYDTRFVGFGWNK 664
           +EP+ VA   +P +D RF  +G+N+
Sbjct: 325 WEPFYVAGGKVPTFDERFRQYGFNR 349


>gi|61553937|gb|AAX46482.1| beta-1,3-N-acetylglucosaminyltransferase bGnT-6 [Bos taurus]
          Length = 388

 Score = 82.8 bits (203), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 72/265 (27%), Positives = 104/265 (39%), Gaps = 81/265 (30%)

Query: 475 DGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQF---LSYAGNSEALRSR 531
           D NDV L    S+D L  +  L + WEGP+S++++ +  E  Q    L+YA +S     R
Sbjct: 91  DPNDVILATHASVDNLLHLSGLLERWEGPLSVSVFAATKEEAQLATVLTYALSSHCPDMR 150

Query: 532 RNIGYHVVY------------KEGNF------------------------------YPIN 549
             +  H+V             + G F                              YP N
Sbjct: 151 ARVAMHLVCPSRYEAAVPDPREPGEFALLRSCQEVFDKLARVAQPGVNYALGTNVSYPNN 210

Query: 550 TLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSM-DMHGHGGKKVLVVPAFETQR 608
            LRN+A    +  Y  ++D+D +P  GL+     S+R M D         LVVPAFE +R
Sbjct: 211 LLRNLAREGAN--YALVIDVDMVPSEGLW----RSLREMLDQSKQWAGTALVVPAFEIRR 264

Query: 609 YRTA-----------------------FPASHAPTNFSRWVNATT------PYQIEWAPD 639
            R                              APTN+SRWVN          Y + W   
Sbjct: 265 ARRMPMNKNELLQLYQVGEVRPFYYGLCTPCQAPTNYSRWVNLPEETLLRPAYVVPWQDP 324

Query: 640 FEPYIVAHRDLPRYDTRFVGFGWNK 664
           +EP+ VA   +P +D RF  +G+N+
Sbjct: 325 WEPFYVAGGKVPTFDERFRQYGFNR 349


>gi|67970700|dbj|BAE01692.1| unnamed protein product [Macaca fascicularis]
          Length = 415

 Score = 82.8 bits (203), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 70/262 (26%), Positives = 103/262 (39%), Gaps = 79/262 (30%)

Query: 477 NDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQF---LSYAGNSEALRSRRN 533
           NDV L    S+D L  +  L + WEGP+S++++ +  E  Q    L+YA +S     R  
Sbjct: 93  NDVILATHASVDNLLHLSGLLERWEGPLSVSVFAATKEEAQLATVLAYALSSHCPDMRAR 152

Query: 534 IGYHVVY------------KEGNF------------------------------YPINTL 551
           +  H+V             + G F                              YP N L
Sbjct: 153 VAMHLVCPSRYEAAVPDPREPGEFALLRSCQEVFDKLARVAQPGINYALGTNVSYPNNLL 212

Query: 552 RNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRYRT 611
           RN+A    +  Y  ++D+D +P  GL+  L+     +D     G   LVVPAFE +R R 
Sbjct: 213 RNLAREGAN--YALVIDVDMVPSEGLWRGLREM---LDQSNQWGGTALVVPAFEIRRARR 267

Query: 612 A-----------------------FPASHAPTNFSRWVNATT------PYQIEWAPDFEP 642
                                        APTN+SRWVN          Y + W   +EP
Sbjct: 268 MPMNKNELVQLYQVGEVRPFYYGLCTPCQAPTNYSRWVNLPEESLLRPAYVVPWQDPWEP 327

Query: 643 YIVAHRDLPRYDTRFVGFGWNK 664
           + VA   +P +D RF  +G+N+
Sbjct: 328 FYVAGGKVPTFDERFRQYGFNR 349


>gi|358336521|dbj|GAA55008.1| glycosyltransferase-like protein LARGE [Clonorchis sinensis]
          Length = 213

 Score = 82.8 bits (203), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 44/160 (27%), Positives = 80/160 (50%), Gaps = 29/160 (18%)

Query: 13  EVIQVAIVCAGYNSTRSLVTLIKSILFYRK-----------------------------N 43
           + I +AI+ +G N    L+  +K++L+Y+                              N
Sbjct: 43  DTIDIAIILSGPNQEYQLMNFLKNLLYYQGRFNSSSEECKLNYTVQRDYVCPKQLTVLYN 102

Query: 44  PLHFHLITDTVALNILQTLFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLT 103
           PL+ HLI D++++  +  +  TW +  VE +FY  DS++  V  IP+ H +G+  L K+ 
Sbjct: 103 PLNIHLILDSLSMEFILNVIHTWRIRDVEFTFYHFDSLMRYVQAIPDIHMAGIAALAKIA 162

Query: 104 LPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQT 143
           +P+V+   + K I+LD+D++F+ +I  LW  F    + Q 
Sbjct: 163 VPEVISSGVNKVILLDSDMLFSENINLLWKEFENFNETQV 202



 Score = 53.5 bits (127), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 24/69 (34%), Positives = 41/69 (59%)

Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
           FY  DS++  V  IP+ H +G+  L K+ +P+V+   + K I+LD+D++F+ +I  LW  
Sbjct: 134 FYHFDSLMRYVQAIPDIHMAGIAALAKIAVPEVISSGVNKVILLDSDMLFSENINLLWKE 193

Query: 232 FSRLRQRQT 240
           F    + Q 
Sbjct: 194 FENFNETQV 202


>gi|90076258|dbj|BAE87809.1| unnamed protein product [Macaca fascicularis]
 gi|355566281|gb|EHH22660.1| N-acetyllactosaminide beta-1,3-N-acetylglucosaminyltransferase
           [Macaca mulatta]
 gi|380788399|gb|AFE66075.1| N-acetyllactosaminide beta-1,3-N-acetylglucosaminyltransferase
           [Macaca mulatta]
 gi|383411805|gb|AFH29116.1| N-acetyllactosaminide beta-1,3-N-acetylglucosaminyltransferase
           [Macaca mulatta]
 gi|384943476|gb|AFI35343.1| N-acetyllactosaminide beta-1,3-N-acetylglucosaminyltransferase
           [Macaca mulatta]
          Length = 415

 Score = 82.8 bits (203), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 70/264 (26%), Positives = 104/264 (39%), Gaps = 79/264 (29%)

Query: 475 DGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQF---LSYAGNSEALRSR 531
           + NDV L    S+D L  +  L + WEGP+S++++ +  E  Q    L+YA +S     R
Sbjct: 91  EPNDVILATHASVDNLLHLSGLLERWEGPLSVSVFAATKEEAQLATVLAYALSSHCPDMR 150

Query: 532 RNIGYHVVY------------KEGNF------------------------------YPIN 549
             +  H+V             + G F                              YP N
Sbjct: 151 ARVAMHLVCPSRYEAAVPDPREPGEFALLRSCQEVFDKLARVAQPGINYALGTNVSYPNN 210

Query: 550 TLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRY 609
            LRN+A    +  Y  ++D+D +P  GL+  L+     +D     G   LVVPAFE +R 
Sbjct: 211 LLRNLAREGAN--YALVIDVDMVPSEGLWRGLREM---LDQSNQWGGTALVVPAFEIRRA 265

Query: 610 RTA-----------------------FPASHAPTNFSRWVNATT------PYQIEWAPDF 640
           R                              APTN+SRWVN          Y + W   +
Sbjct: 266 RRMPMNKNELVQLYQVGEVRPFYYGLCTPCQAPTNYSRWVNLPEESLLRPAYVVPWQDPW 325

Query: 641 EPYIVAHRDLPRYDTRFVGFGWNK 664
           EP+ VA   +P +D RF  +G+N+
Sbjct: 326 EPFYVAGGKVPTFDERFRQYGFNR 349


>gi|432854536|ref|XP_004067949.1| PREDICTED: xyloside xylosyltransferase 1-like [Oryzias latipes]
          Length = 384

 Score = 82.4 bits (202), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 54/180 (30%), Positives = 86/180 (47%), Gaps = 22/180 (12%)

Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
           F + +++ +  S     +YS     L + +  ++PE+L   + LD D+ + T+I  L+  
Sbjct: 173 FPIVEAMQKHFSAGSGAYYSDAIFFLSVAMHHIMPESLTHIVQLDLDLKYRTNIRNLFQE 232

Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWKNHK--PWPALGR-------GFNTGVILLDLTKLR 282
           F +      IG++      Y    W+  K  P   +G        GFN+GV+LLDL  +R
Sbjct: 233 FDKFPPGAVIGIIREMQPVYRHTFWQYRKENPQTKVGEPPPDGLPGFNSGVMLLDLQGMR 292

Query: 283 DISWAGFWRII--------AEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQL 334
             + A + +++        A+K+   R    L DQD F  I  EHP L Y+LPC WN QL
Sbjct: 293 --ASALYNQLLEPSNVAKLADKY---RFRGHLGDQDFFTMIGMEHPNLFYSLPCGWNRQL 347



 Score = 44.7 bits (104), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 30/157 (19%), Positives = 66/157 (42%), Gaps = 14/157 (8%)

Query: 29  SLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTW---SVPQVEVSFY--------- 76
           ++++++K   F +   L  H ++D  +  + Q +   +   +  + EV F+         
Sbjct: 113 AMLSMVKHGHFLKGEVLVLHFVSDKASQQLGQRMLQDFLQDATFKYEVLFHDVAELTQTL 172

Query: 77  --LADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 134
             + +++ +  S     +YS     L + +  ++PE+L   + LD D+ + T+I  L+  
Sbjct: 173 FPIVEAMQKHFSAGSGAYYSDAIFFLSVAMHHIMPESLTHIVQLDLDLKYRTNIRNLFQE 232

Query: 135 FSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRG 171
           F +      IG++      Y    W+  K  P    G
Sbjct: 233 FDKFPPGAVIGIIREMQPVYRHTFWQYRKENPQTKVG 269


>gi|194374501|dbj|BAG57146.1| unnamed protein product [Homo sapiens]
          Length = 408

 Score = 82.4 bits (202), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 70/264 (26%), Positives = 103/264 (39%), Gaps = 79/264 (29%)

Query: 475 DGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQF---LSYAGNSEALRSR 531
           D NDV L    S+D L  +  L + WEGP+S++++ +  E  Q    L+YA +S     R
Sbjct: 84  DPNDVILATHASVDNLLHLSGLLERWEGPLSVSVFAATKEEAQLATVLAYALSSHCPDMR 143

Query: 532 RNIGYHVVY------------KEGNF------------------------------YPIN 549
             +  H+V             + G F                              YP N
Sbjct: 144 ARVAMHLVCPSRYEAAVPDPREPGEFALLRSCQEVFDKLARVAQPGINYALGTNVSYPNN 203

Query: 550 TLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRY 609
            LRN+A    +  Y  ++D+D +P  GL+  L+     +D     G   LVVP FE +R 
Sbjct: 204 LLRNLAREGAN--YALVIDVDMVPSEGLWRGLREM---LDQSNQWGGTALVVPTFEIRRA 258

Query: 610 RTA-----------------------FPASHAPTNFSRWVNATT------PYQIEWAPDF 640
           R                              APTN+SRWVN          Y + W   +
Sbjct: 259 RRMPMNKNELVQLYQVGEVRPFYYGLCTPCQAPTNYSRWVNLPEESLLRPAYVVPWQDPW 318

Query: 641 EPYIVAHRDLPRYDTRFVGFGWNK 664
           EP+ VA   +P +D RF  +G+N+
Sbjct: 319 EPFYVAGGKVPTFDERFRQYGFNR 342


>gi|198434732|ref|XP_002132013.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
          Length = 444

 Score = 82.4 bits (202), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 71/268 (26%), Positives = 108/268 (40%), Gaps = 84/268 (31%)

Query: 477 NDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYM---SDAEAQQFLSYAGNSEALRSRRN 533
           NDVTLV Q S++ L  +    + W+GPIS+ L++   S   A+  + +      LR    
Sbjct: 79  NDVTLVTQCSVNNLYHLIETLQLWKGPISVALFVPGSSIIHAELAIKFIKQCFPLRK--- 135

Query: 534 IGYHVVYKE--------------------------------------------GNFYPIN 549
           + +H+VY E                                            G  YP N
Sbjct: 136 VQFHLVYPEHHTPTFDGNNLEIWVGLLFANDRLRCKDLFETLSSIEGENYATWGIPYPQN 195

Query: 550 TLRNVALNQVSTPYVFLLDIDFLPMFGLYP-YLKSSIRSMDMHGHGGKKVLVV-PAFETQ 607
           TLRNVA   V T Y+ +LDID +P +GL   +L+  +R+   +     K+  + PAFE +
Sbjct: 196 TLRNVAKLAVQTEYLIVLDIDVMPSYGLRENFLEKVLRNRGRNKTAETKIAYIPPAFELK 255

Query: 608 ------------------------RYRTAFPASHAPTNFSRW------VNATTPYQIEWA 637
                                    Y T  P+ H P  +  W      +N    Y+ EW 
Sbjct: 256 INEQMPENKALLLTAWKSNQIRPFHYETC-PSCHKPARYDEWKAISTNLNLGIAYEAEWQ 314

Query: 638 PDFEPYIVAHRD-LPRYDTRFVGFGWNK 664
           P +EP+ +AH    P YD RF  +G+++
Sbjct: 315 PSWEPFYIAHTSHTPDYDERFKQYGYDR 342


>gi|158299248|ref|XP_319364.4| AGAP010187-PA [Anopheles gambiae str. PEST]
 gi|157014274|gb|EAA13820.4| AGAP010187-PA [Anopheles gambiae str. PEST]
          Length = 376

 Score = 82.4 bits (202), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 72/248 (29%), Positives = 113/248 (45%), Gaps = 32/248 (12%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L + L   + +D+D +F + + ++WA+F +    Q  G+     D   G  W 
Sbjct: 151 RLFLPSLL-QHLDAVLYVDSDTVFLSPVQEIWAMFQQFNATQFAGMAPEHEDMNAG--WY 207

Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAI 315
           N     P  G  G N+GV+L++LT++R+  W      I  ++ L  +W    DQDI N +
Sbjct: 208 NRFARHPYYGELGVNSGVMLMNLTRMREFRWEEHILPIYREYRLQLVW---GDQDILNVL 264

Query: 316 ISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTD----LKIIHWNSPKKLKVKNKHMEF 371
              +P  +Y  PC WN       R+D   Y  + D    +KIIH N   +    ++    
Sbjct: 265 FHYNPDRLYVFPCDWNY------RADHCMYMSVCDAPDGVKIIHGN---RGYFHSRAQPI 315

Query: 372 FRNLYLTFLEYDGNLLRRELFGCNLTQTTLQQAEL-SNLNEEDPCYDLRRSKLTSLRTHL 430
           F  +Y T  E+     R +++G N  +T  +   L SN N    C  L    L   R   
Sbjct: 316 FNLIYSTIEEF---TFRTDVYG-NFIRTIEESLVLPSNSN----CDKLLDKFLQDPRK-- 365

Query: 431 YFLEYEYE 438
           YF E +YE
Sbjct: 366 YFKENQYE 373



 Score = 39.3 bits (90), Expect = 7.8,   Method: Compositional matrix adjust.
 Identities = 37/155 (23%), Positives = 66/155 (42%), Gaps = 17/155 (10%)

Query: 15  IQVAIVCAGYNSTRSLVTLIKSILFYR-KNPLHFHLITD-TVALNILQTLFSTWSVPQVE 72
           I V +V        +L  +  +I+F R ++PL F +I +  + LN ++ L + W      
Sbjct: 63  ISVVVVACNERHNEALNMIKSAIIFNRNQSPLRFVIIAEEKLKLNFMEKL-NDWQEVTDR 121

Query: 73  VSFYLADSVVEDVSWIPNKHYSGVYGLLK------LTLPKVLPETLAKTIVLDTDVIFAT 126
           +  Y   S+       P  +      L K      L LP +L + L   + +D+D +F +
Sbjct: 122 IFTYEIHSLT-----FPRANKDEWRKLFKPCAAQRLFLPSLL-QHLDAVLYVDSDTVFLS 175

Query: 127 DIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKN 161
            + ++WA+F +    Q  G+     D   G  W N
Sbjct: 176 PVQEIWAMFQQFNATQFAGMAPEHEDMNAG--WYN 208


>gi|348535861|ref|XP_003455416.1| PREDICTED: uncharacterized protein C3orf21 homolog [Oreochromis
           niloticus]
          Length = 394

 Score = 82.0 bits (201), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 54/177 (30%), Positives = 83/177 (46%), Gaps = 16/177 (9%)

Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
           F + +++ +  S     +YS     L + +  ++PE+L + + LD D+ + T+I  L+  
Sbjct: 183 FPIVEAMQKHFSAGSGAYYSDAIFFLSVAMHHIMPESLTRIVQLDLDLKYRTNIRDLFQE 242

Query: 232 FSRLRQRQTIGLVENQSDWYLGKLW--KNHKPWPALGR-------GFNTGVILLDLTKLR 282
           F R      IG+       Y    W  +  KP   +G        GFN+GV+LLDL  +R
Sbjct: 243 FHRFPPGAVIGITREMQPVYRHTFWQYRKEKPRTRVGEPPPDGLPGFNSGVMLLDLGAMR 302

Query: 283 DISWAGFWRI-----IAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQL 334
             S A + ++     +A+     R    L DQD F  I  EHP L Y+L C WN QL
Sbjct: 303 --SSALYNKLLEPSNVAKLADQYRFRGHLGDQDFFTMIGMEHPELFYSLACGWNRQL 357



 Score = 45.4 bits (106), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 33/153 (21%), Positives = 65/153 (42%), Gaps = 16/153 (10%)

Query: 29  SLVTLIKSILFYRKNPLHFHLITDTVALNI----LQTLFSTWSVPQVEVSFY-------- 76
           ++++++K   F     L  H ++D  +  +    LQ LF   +  + EV F+        
Sbjct: 123 AMLSMVKHGHFLEGEVLVLHFVSDQASQELGKRMLQELFLDATF-KYEVVFHDVVALTQK 181

Query: 77  ---LADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWA 133
              + +++ +  S     +YS     L + +  ++PE+L + + LD D+ + T+I  L+ 
Sbjct: 182 LFPIVEAMQKHFSAGSGAYYSDAIFFLSVAMHHIMPESLTRIVQLDLDLKYRTNIRDLFQ 241

Query: 134 LFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWP 166
            F R      IG+       Y    W+  K  P
Sbjct: 242 EFHRFPPGAVIGITREMQPVYRHTFWQYRKEKP 274


>gi|344295836|ref|XP_003419616.1| PREDICTED: N-acetyllactosaminide
           beta-1,3-N-acetylglucosaminyltransferase [Loxodonta
           africana]
          Length = 415

 Score = 82.0 bits (201), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 72/265 (27%), Positives = 105/265 (39%), Gaps = 81/265 (30%)

Query: 475 DGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQF---LSYAGNSEALRSR 531
           D NDV L    S+D L  +  L + WEGP+S++++ +  E  Q    L+YA +S     R
Sbjct: 91  DPNDVILATHASVDNLLHLSGLLERWEGPMSVSVFAATKEEAQLATVLAYALSSHCPDMR 150

Query: 532 RNIGYHVVY------------KEGNF------------------------------YPIN 549
             +  H+V             + G F                              YP N
Sbjct: 151 SRVAMHLVCPSRYEAAVPDPREPGEFALLRSCQEVFDKLARVAQPGINYALGTNVSYPNN 210

Query: 550 TLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRY 609
            LRN+A    +  Y  ++D+D +P  GL+  L+     +D         LVVPAFE +R 
Sbjct: 211 LLRNLAREGAN--YALVIDVDMVPSEGLWRGLREL---LDQSRQWAGTALVVPAFEIRRA 265

Query: 610 RTAFPAS------------------------HAPTNFSRWVNATT------PYQIEWAPD 639
           R   P S                         APTN+SRWVN          Y + W   
Sbjct: 266 RR-MPVSKTELVQLYQVGEVRPFYYGLCTPCQAPTNYSRWVNLPEESLLRPSYVVPWQDP 324

Query: 640 FEPYIVAHRDLPRYDTRFVGFGWNK 664
           +EP+ VA   +P +D RF  +G+N+
Sbjct: 325 WEPFYVAGGKVPPFDERFRQYGFNR 349


>gi|72006259|ref|XP_782394.1| PREDICTED: N-acetyllactosaminide
           beta-1,3-N-acetylglucosaminyltransferase-like
           [Strongylocentrotus purpuratus]
          Length = 441

 Score = 82.0 bits (201), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 69/271 (25%), Positives = 105/271 (38%), Gaps = 84/271 (30%)

Query: 478 DVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEA--LRSRRNIG 535
           D+TL+   SM+ L  V  L   W+GPISL ++ S ++ +  +SY  + +   L  RRN+ 
Sbjct: 103 DITLITHCSMNNLHHVPDLVDRWKGPISLAVFGSLSDVKTIVSYIMSLKECILGVRRNVT 162

Query: 536 YHVVY----------------------------------------------KEGNFYPIN 549
           +H+V+                                               +G  YP N
Sbjct: 163 FHLVHPVNSVSEEAEIFGSVNNKRSIRVPLIHCSKLMEAIKASYSSVSNYAADGILYPNN 222

Query: 550 TLRNVALNQVSTPYVFLLDIDFLPMFGLYP----YLKSSIRSMDMHGHGGKKVLVVPAFE 605
            LRNV    + T + F+LDID LP  GL      ++K S           + V V+PAFE
Sbjct: 223 LLRNVGRANILTEFHFVLDIDMLPSVGLRENFLNFVKGSSSLWLDEDMVQQMVFVLPAFE 282

Query: 606 T-------------------------QRYRTAFPASHAPTNFSRWVNATT-------PYQ 633
                                     Q Y  A       T++ RW +  +        Y 
Sbjct: 283 MDDRAVSKVPGDKKELLSYLEKDRVRQFYVKACLYCQNHTDYRRWTSLQSFTGQLGVAYT 342

Query: 634 IEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           +EW P +EP+ ++  D P YD RF  +G+N+
Sbjct: 343 VEWKPMWEPFYISRSDAPLYDERFRQYGYNR 373


>gi|291237248|ref|XP_002738549.1| PREDICTED: like-glycosyltransferase-like [Saccoglossus kowalevskii]
          Length = 715

 Score = 82.0 bits (201), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 56/171 (32%), Positives = 84/171 (49%), Gaps = 19/171 (11%)

Query: 180 EDVSWIPNKHYSGVYGLLKLTLPK-VLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQR 238
           E +S   +K+Y      L + L K +LP  + K I++D+D+ F TDI +L+  F      
Sbjct: 511 EHISTGKHKYYMEAIFFLSVALHKNILPSEVKKLIMIDSDLKFRTDIKKLFQHFDNFETG 570

Query: 239 QTIGLVENQSDWYLG--KLWKNHKPWPALG-------RGFNTGVILLDLTKLRD-ISWAG 288
             IG+   Q   Y     L+++  P   +G        GFN+GV+LLDL +++  IS+  
Sbjct: 571 NVIGIAHEQQPVYRHTFSLYRSQNPGTLVGDPPPNGLSGFNSGVMLLDLERMQSSISYNA 630

Query: 289 FW-----RIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQL 334
           F      R +++K+        L DQD F  I  E+  L Y LPC WN QL
Sbjct: 631 FLDVDVVRALSDKY---SFKGHLGDQDFFTLISLENKNLFYVLPCTWNRQL 678



 Score = 41.2 bits (95), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 26/90 (28%), Positives = 43/90 (47%), Gaps = 3/90 (3%)

Query: 83  EDVSWIPNKHYSGVYGLLKLTLPK-VLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQR 141
           E +S   +K+Y      L + L K +LP  + K I++D+D+ F TDI +L+  F      
Sbjct: 511 EHISTGKHKYYMEAIFFLSVALHKNILPSEVKKLIMIDSDLKFRTDIKKLFQHFDNFETG 570

Query: 142 QTIGLVENQSDWYLG--KLWKNHKPWPALG 169
             IG+   Q   Y     L+++  P   +G
Sbjct: 571 NVIGIAHEQQPVYRHTFSLYRSQNPGTLVG 600


>gi|417400521|gb|JAA47198.1| Putative glycosyltransferase [Desmodus rotundus]
          Length = 415

 Score = 81.6 bits (200), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 70/264 (26%), Positives = 103/264 (39%), Gaps = 79/264 (29%)

Query: 475 DGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQF---LSYAGNSEALRSR 531
           D NDV L    S+D L  +  L + WEGP+S++++ +  E  Q    L+YA +S     R
Sbjct: 91  DPNDVILATHASVDNLLHLSGLLERWEGPLSVSVFAATKEEAQLATVLTYALSSHCPDMR 150

Query: 532 RNIGYHVVY------------KEGNF------------------------------YPIN 549
             +  H+V             + G F                              YP N
Sbjct: 151 ARVAMHLVCPSRYEAAVPDPREPGEFALLRSCQEVFDKLARVAQPGINYALGTNVSYPNN 210

Query: 550 TLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRY 609
            LRN+A    +  Y  ++D+D +P  GL+  L+     +D         LVVPAFE +R 
Sbjct: 211 LLRNLAREGAN--YALVIDVDMVPSEGLWRGLREM---LDQSKQWAGTALVVPAFEIRRA 265

Query: 610 RTA-----------------------FPASHAPTNFSRWVNATT------PYQIEWAPDF 640
           R                              APTN+SRWVN          Y + W   +
Sbjct: 266 RRMPTNKNELLQLYQVGEVRPFYYGLCTPCQAPTNYSRWVNLPEETLLRPAYVVPWQDPW 325

Query: 641 EPYIVAHRDLPRYDTRFVGFGWNK 664
           EP+ VA   +P +D RF  +G+N+
Sbjct: 326 EPFYVAGGKVPTFDERFRQYGFNR 349


>gi|311247200|ref|XP_003122534.1| PREDICTED: N-acetyllactosaminide
           beta-1,3-N-acetylglucosaminyltransferase-like [Sus
           scrofa]
          Length = 415

 Score = 81.6 bits (200), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 70/264 (26%), Positives = 103/264 (39%), Gaps = 79/264 (29%)

Query: 475 DGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQF---LSYAGNSEALRSR 531
           D NDV L    S+D L  +  L + WEGP+S++++ +  E  Q    L+YA +S     R
Sbjct: 91  DPNDVILATHASVDNLLHLSGLLERWEGPLSVSVFAATKEEAQLATVLTYALSSHCPDMR 150

Query: 532 RNIGYHVVY------------KEGNF------------------------------YPIN 549
             +  H+V             + G F                              YP N
Sbjct: 151 ARVAMHLVCPSRYEAAVPDPREPGEFALLRSCQEVFDKLARVAQPGINYALGTNVSYPNN 210

Query: 550 TLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRY 609
            LRN+A    +  Y  ++D+D +P  GL+  L+     +D         LVVPAFE +R 
Sbjct: 211 LLRNLAREGAN--YALVIDVDMVPSEGLWRGLREM---LDQSKQWAGTALVVPAFEIRRA 265

Query: 610 RTA-----------------------FPASHAPTNFSRWVNATT------PYQIEWAPDF 640
           R                              APTN+SRWVN          Y + W   +
Sbjct: 266 RRMPTNKNELLQLYQVGEVRPFYYGLCTPCQAPTNYSRWVNLPEETLLRPSYVVPWQDPW 325

Query: 641 EPYIVAHRDLPRYDTRFVGFGWNK 664
           EP+ VA   +P +D RF  +G+N+
Sbjct: 326 EPFYVAGGKVPTFDERFRQYGFNR 349


>gi|338712409|ref|XP_001916917.2| PREDICTED: LOW QUALITY PROTEIN: n-acetyllactosaminide
           beta-1,3-N-acetylglucosaminyltransferase-like [Equus
           caballus]
          Length = 396

 Score = 81.6 bits (200), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 70/264 (26%), Positives = 103/264 (39%), Gaps = 79/264 (29%)

Query: 475 DGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQF---LSYAGNSEALRSR 531
           D NDV L    S+D L  +  L + WEGP+S++++ +  E  Q    L+YA +S     R
Sbjct: 72  DPNDVILATHASVDNLLHLSGLLERWEGPLSVSVFAATKEEAQLATVLTYALSSHCPDMR 131

Query: 532 RNIGYHVVY------------KEGNF------------------------------YPIN 549
             +  H+V             + G F                              YP N
Sbjct: 132 ARVAMHLVCPSRYEAAVPDPREPGEFALLRSCQEVFDKLARVAQPGINYALGTNVSYPNN 191

Query: 550 TLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRY 609
            LRN+A    +  Y  ++D+D +P  GL+  L+     +D         LVVPAFE +R 
Sbjct: 192 LLRNLAREGAN--YALVIDVDMVPSEGLWRGLREM---LDQSKQWAGTALVVPAFEIRRA 246

Query: 610 RTA-----------------------FPASHAPTNFSRWVNATT------PYQIEWAPDF 640
           R                              APTN+SRWVN          Y + W   +
Sbjct: 247 RRMPTDKNELLQLYQVGEVRPFYYGLCTPCQAPTNYSRWVNLPEETLLRPAYVVPWQDPW 306

Query: 641 EPYIVAHRDLPRYDTRFVGFGWNK 664
           EP+ VA   +P +D RF  +G+N+
Sbjct: 307 EPFYVAGGKVPTFDERFRQYGFNR 330


>gi|391339560|ref|XP_003744116.1| PREDICTED: glycosyltransferase-like protein LARGE2-like
           [Metaseiulus occidentalis]
          Length = 286

 Score = 81.6 bits (200), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 66/217 (30%), Positives = 97/217 (44%), Gaps = 41/217 (18%)

Query: 487 MDRLQMVEMLFKHWEGPISLTLY-----MSDAEAQ-QFLSYAGNSEALRSRRNIGYHVVY 540
           MDRL ++E     W+ P+SL +Y     + +A+   Q L  +    A +   +    +VY
Sbjct: 1   MDRLGVLEHTLDQWDMPVSLVIYAPLKALRNADDDWQRLYISKKVTAFKLHPSSVVCLVY 60

Query: 541 KEG--NFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKV 598
                  YPIN+LRN+A+ Q  + ++ L D DF P   L P +   IR+        K  
Sbjct: 61  ASSVSEQYPINSLRNIAIKQAHSQFLLLADADFQPSPDLAPKVLPLIRNRT--KKFLKSA 118

Query: 599 LVVPAFETQRY----------------------------RTAFPASHAPTNFSRWVNATT 630
           +V+PAFE                                R   P SHA T++ RW  A  
Sbjct: 119 VVIPAFEYADVPHKSDVVAANKHELMQLIHAQGIVCPFRRQDAPQSHAATDYWRWYTARE 178

Query: 631 PYQIEWAPD-FEPYIVAHRD--LPRYDTRFVGFGWNK 664
           PY++    D FEPY+V  ++  LP Y  +FVG+G NK
Sbjct: 179 PYEVHLFSDKFEPYLVIEKNAHLPLYWEKFVGYGMNK 215


>gi|301762478|ref|XP_002916660.1| PREDICTED: n-acetyllactosaminide
           beta-1,3-N-acetylglucosaminyltransferase-like
           [Ailuropoda melanoleuca]
 gi|281350599|gb|EFB26183.1| hypothetical protein PANDA_004757 [Ailuropoda melanoleuca]
          Length = 415

 Score = 81.6 bits (200), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 70/264 (26%), Positives = 103/264 (39%), Gaps = 79/264 (29%)

Query: 475 DGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQF---LSYAGNSEALRSR 531
           D NDV L    S+D L  +  L + WEGP+S++++ +  E  Q    L+YA +S     R
Sbjct: 91  DPNDVILATHASVDNLLHLSGLLERWEGPLSVSVFAATKEEAQLATVLTYALSSHCPDMR 150

Query: 532 RNIGYHVVY------------KEGNF------------------------------YPIN 549
             +  H+V             + G F                              YP N
Sbjct: 151 ARVALHLVCPSRYEAAVPDPREPGEFALLRSCQEVFDKLARVAQPGINYALGTNVSYPNN 210

Query: 550 TLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRY 609
            LRN+A    +  Y  ++D+D +P  GL+  L+     +D         LVVPAFE +R 
Sbjct: 211 LLRNLAREGAN--YALVIDVDMVPSEGLWRGLREM---LDQSKQWAGTALVVPAFEIRRA 265

Query: 610 RTA-----------------------FPASHAPTNFSRWVNATT------PYQIEWAPDF 640
           R                              APTN+SRWVN          Y + W   +
Sbjct: 266 RRMPTNKNELLQLYQVGEVRPFYYGLCTPCQAPTNYSRWVNLPEESLLRPAYVVPWQDPW 325

Query: 641 EPYIVAHRDLPRYDTRFVGFGWNK 664
           EP+ VA   +P +D RF  +G+N+
Sbjct: 326 EPFYVAGGKVPTFDERFRQYGFNR 349


>gi|327286348|ref|XP_003227892.1| PREDICTED: uncharacterized protein C3orf21 homolog, partial [Anolis
           carolinensis]
          Length = 236

 Score = 81.3 bits (199), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 50/164 (30%), Positives = 75/164 (45%), Gaps = 18/164 (10%)

Query: 187 NKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVEN 246
             +YS     L + + +++P  +++ I++D DV +   I +L+  F R  +   IG+   
Sbjct: 29  GTYYSDAIFFLSVAMHRIMPAEISRLILVDLDVEYRASIQELFEEFDRFPEGAVIGIAHE 88

Query: 247 QSDWYLGKLWKNHK---------PWPALGRGFNTGVILLDLTKLR------DISWAGFWR 291
               Y    W+  K         P P    GFN+GV+LLDL  +R       +   G  R
Sbjct: 89  MQPVYRHLFWQFRKEHPGTRVGDPPPEGLPGFNSGVLLLDLPAMRASGLYNGLLEEGPLR 148

Query: 292 IIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLS 335
            +A++F   R    L DQD F     EHP L + LPC WN QL 
Sbjct: 149 ALAQRF---RFRGHLGDQDFFTLAGMEHPGLFHRLPCAWNRQLC 189



 Score = 40.4 bits (93), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 39/174 (22%), Positives = 67/174 (38%), Gaps = 20/174 (11%)

Query: 69  PQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 128
           P VE    +   + E  S     +YS     L + + +++P  +++ I++D DV +   I
Sbjct: 8   PLVERLLPVIGPMQELFSAGAGTYYSDAIFFLSVAMHRIMPAEISRLILVDLDVEYRASI 67

Query: 129 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFYLADSVVEDVSWIPNK 188
            +L+  F R  +   IG+       Y    W+  K  P    G  + D   E +    + 
Sbjct: 68  QELFEEFDRFPEGAVIGIAHEMQPVYRHLFWQFRKEHP----GTRVGDPPPEGLPGFNS- 122

Query: 189 HYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIG 242
                 G+L L LP +    L   ++ +           L AL  R R R  +G
Sbjct: 123 ------GVLLLDLPAMRASGLYNGLLEE---------GPLRALAQRFRFRGHLG 161


>gi|355733055|gb|AES10900.1| N-acetyllactosaminide beta-1 [Mustela putorius furo]
          Length = 427

 Score = 81.3 bits (199), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 71/265 (26%), Positives = 105/265 (39%), Gaps = 81/265 (30%)

Query: 475 DGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQF---LSYAGNSEALRSR 531
           D NDV L    S+D L  +  L + WEGP+S++++ +  E  Q    L+YA +S     R
Sbjct: 104 DPNDVILATHASVDNLLHLSGLLERWEGPLSVSVFAATKEEAQLATVLTYALSSHCPDMR 163

Query: 532 RNIGYHVVY------------KEGNF------------------------------YPIN 549
             +  H+V             + G F                              YP N
Sbjct: 164 ARVAMHLVCPSRYEAAVPDPREPGEFALLRSCQEVFDKLARVAQPGINYALGTNVSYPNN 223

Query: 550 TLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRY 609
            LRN+A    +  Y  ++D+D +P  GL+  L+     +D         LVVPAFE +R 
Sbjct: 224 LLRNLAREGAN--YALVIDVDMVPSEGLWRGLREM---LDQSKQWAGTALVVPAFEIRRA 278

Query: 610 RTAFPAS------------------------HAPTNFSRWVNAT------TPYQIEWAPD 639
           R   P +                         APTN+SRWVN          Y + W   
Sbjct: 279 RR-MPTNKNELLQLYQVGEVRPFYYGLCTPCQAPTNYSRWVNLPEESLLRPAYVVPWQDP 337

Query: 640 FEPYIVAHRDLPRYDTRFVGFGWNK 664
           +EP+ VA   +P +D RF  +G+N+
Sbjct: 338 WEPFYVAGGKVPTFDERFRQYGFNR 362


>gi|345783078|ref|XP_533222.3| PREDICTED: N-acetyllactosaminide
           beta-1,3-N-acetylglucosaminyltransferase [Canis lupus
           familiaris]
          Length = 415

 Score = 81.3 bits (199), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 70/264 (26%), Positives = 102/264 (38%), Gaps = 79/264 (29%)

Query: 475 DGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQF---LSYAGNSEALRSR 531
           D NDV L    S+D L  +  L + WEGP+S+ ++ +  E  Q    L+YA +S     R
Sbjct: 91  DPNDVILATHASVDNLLHLSGLLERWEGPLSVAVFAATKEEAQLATVLTYALSSHCPDMR 150

Query: 532 RNIGYHVVY------------KEGNF------------------------------YPIN 549
             +  H+V             + G F                              YP N
Sbjct: 151 ARVAMHLVCPSRYEAAVPDPREPGEFALLRSCQEVFDKLARVAQPGINYALGTNVSYPNN 210

Query: 550 TLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRY 609
            LRN+A    +  Y  ++D+D +P  GL+  L+     +D         LVVPAFE +R 
Sbjct: 211 LLRNLAREGAN--YALVIDVDMVPSEGLWRGLREM---LDQSKQWAGTALVVPAFEIRRA 265

Query: 610 RTA-----------------------FPASHAPTNFSRWVNATT------PYQIEWAPDF 640
           R                              APTN+SRWVN          Y + W   +
Sbjct: 266 RRMPTNKNELLQLYQVGEVRPFYYGLCTPCQAPTNYSRWVNLPEESLLRPAYVVPWQDPW 325

Query: 641 EPYIVAHRDLPRYDTRFVGFGWNK 664
           EP+ VA   +P +D RF  +G+N+
Sbjct: 326 EPFYVAGGKVPTFDERFRQYGFNR 349


>gi|384501051|gb|EIE91542.1| hypothetical protein RO3G_16253 [Rhizopus delemar RA 99-880]
          Length = 417

 Score = 81.3 bits (199), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 67/231 (29%), Positives = 111/231 (48%), Gaps = 40/231 (17%)

Query: 471 EASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMS-DAEAQQFLSYAGNSEALR 529
           E   +  D+T+V  ++ +R+  +  L K ++GPIS T+++S D E +  L     +    
Sbjct: 98  EMDMESKDITIVTLVTRNRIPNLARLAKKYKGPISTTIHISDDKEGEYTLEILAQTMEKN 157

Query: 530 S--RRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRS 587
              R+ +  H+V  E +   +N  RNVA     T YV LLDIDF P   +   + ++ ++
Sbjct: 158 PDMRKYVDVHIVRDEFD-RELNLWRNVAKLFARTEYVMLLDIDFYPCTNIRKNVLNNPKA 216

Query: 588 MDMHGHGGKKVLVVPAFETQR------YRTAFPAS-----------------------HA 618
           M +    G+  LV+PAFE  +      YRT FP +                       HA
Sbjct: 217 MTLL-ESGEAALVIPAFEFSKQEDGLDYRT-FPKNKPDLIKAYKENKIEMFHSFWLPGHA 274

Query: 619 PTNFSRWVNAT---TPYQI-EWAPDFEPYIV-AHRDLPRYDTRFVGFGWNK 664
           PT++ RW++     + YQ+  +   +EPY++  ++  P  D RF G+G NK
Sbjct: 275 PTDYDRWISVEQDDSIYQVTTYQHSYEPYVIFKNQGSPWCDERFAGYGSNK 325


>gi|313232704|emb|CBY19374.1| unnamed protein product [Oikopleura dioica]
          Length = 578

 Score = 80.9 bits (198), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 67/267 (25%), Positives = 108/267 (40%), Gaps = 74/267 (27%)

Query: 469 EYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEAL 528
           E E+  D  DVTLV+Q S D L  +E L K W+GPIS+ ++  D +A   +S     + L
Sbjct: 73  ETESGVDAKDVTLVSQCSPDNLHHIEGLMKAWKGPISIAVFSPDRDA---VSAMFGIQIL 129

Query: 529 RS-----RRNIGYHVVY-----------------------------KEG-NF------YP 547
           R      RR++ +H++Y                              EG N+      YP
Sbjct: 130 RKCNADIRRHVTFHIMYPSSHPPRFDETVEDFTDDECSIAIEQLKESEGKNYARGELPYP 189

Query: 548 INTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQ 607
            N+LRN+A  ++ T   +L+DID +P   L        +   +      +  + PAFE +
Sbjct: 190 QNSLRNIARVKIKTDLFYLVDIDTVPSNDLRYQFTKFAKKRGLLDQTRLEAFITPAFEIR 249

Query: 608 RYRT-----------------------AFPASHAPTNFSRWV------NATTPYQIEWAP 638
            +                           P  H PT F  W            ++IE++ 
Sbjct: 250 SHHNLPANKDNVIRLIDAGEMRPFHYDTCPYCHKPTEFDHWYGYRGSDELEDAFEIEYSN 309

Query: 639 DFEPYIVAH-RDLPRYDTRFVGFGWNK 664
            FEP+ +A+    P +D RF  +G+++
Sbjct: 310 SFEPFYLANTHTAPLHDERFKAYGFDR 336


>gi|195504526|ref|XP_002099117.1| GE10743 [Drosophila yakuba]
 gi|194185218|gb|EDW98829.1| GE10743 [Drosophila yakuba]
          Length = 340

 Score = 80.9 bits (198), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 48/162 (29%), Positives = 84/162 (51%), Gaps = 9/162 (5%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L   +   + +DTD++F + I+ +W  F +  + Q   L     +  +G  W 
Sbjct: 114 RLFLPSLLTH-VDSLLYVDTDILFLSPISDIWRFFKKFNETQMSALTPEHENENIG--WY 170

Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAI 315
           N     P  GR G N+GV+L++LT++R++ W      I +++ L  +W    DQDI N +
Sbjct: 171 NRFARHPFYGRLGVNSGVMLMNLTRMREMKWEQHIVSIHKEYKLRIIW---GDQDIINIL 227

Query: 316 ISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
              HP  +Y +PC++N +  D+     +C      +K+IH N
Sbjct: 228 FYYHPDKLYIMPCEYNYR-PDHCMYMSICNMTHAGVKVIHGN 268


>gi|195354760|ref|XP_002043864.1| GM17754 [Drosophila sechellia]
 gi|194129102|gb|EDW51145.1| GM17754 [Drosophila sechellia]
          Length = 340

 Score = 80.9 bits (198), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 48/162 (29%), Positives = 84/162 (51%), Gaps = 9/162 (5%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L   +   + +DTD++F + I+ +W  F +  + Q   L     +  +G  W 
Sbjct: 114 RLFLPSLLTH-VDSLLYVDTDILFLSPISDIWRFFKKFNETQMSALTPEHENENIG--WY 170

Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAI 315
           N     P  GR G N+GV+L++LT++R++ W      I +++ L  +W    DQDI N +
Sbjct: 171 NRFARHPFYGRLGVNSGVMLMNLTRMREMKWEQHIVSIHKEYKLRIIW---GDQDIINIL 227

Query: 316 ISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
              HP  +Y +PC++N +  D+     +C      +K+IH N
Sbjct: 228 FYYHPDKLYIMPCEYNYR-PDHCMYMSICNMSHAGVKVIHGN 268


>gi|348565043|ref|XP_003468313.1| PREDICTED: N-acetyllactosaminide
           beta-1,3-N-acetylglucosaminyltransferase-like [Cavia
           porcellus]
          Length = 415

 Score = 80.9 bits (198), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 70/264 (26%), Positives = 103/264 (39%), Gaps = 79/264 (29%)

Query: 475 DGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQF---LSYAGNSEALRSR 531
           D +DV L    S+D L  +  L + WEGP+S++++ +  E  Q    L+YA +S     R
Sbjct: 91  DPSDVILATHASVDNLLHLSGLLERWEGPLSISVFAATKEEVQLATVLAYALSSHCPDMR 150

Query: 532 RNIGYHVVY------------KEGNF------------------------------YPIN 549
             +  H+V             + G F                              YP N
Sbjct: 151 ARVAMHLVCPSRYEASVPDPREPGEFALLRSCQEVFDKLAKVAQPGINYALGTNVSYPNN 210

Query: 550 TLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRY 609
            LRN+A    +  Y  ++D D +P  GL+  L+     +D     G   LVVPAFE +R 
Sbjct: 211 LLRNLAREGAN--YALVIDADMVPSEGLWRGLREM---LDQSNQWGGTALVVPAFEIRRA 265

Query: 610 RTA-----------------------FPASHAPTNFSRWVNATT------PYQIEWAPDF 640
           R                              APTN+SRWVN          Y + W   +
Sbjct: 266 RRMPTNKAELLQLYQVGEVRPFYYGLCTPCQAPTNYSRWVNLPEESLLRPAYVVPWQDPW 325

Query: 641 EPYIVAHRDLPRYDTRFVGFGWNK 664
           EP+ VA   +P +D RF  +G+N+
Sbjct: 326 EPFYVAGGKVPAFDERFRQYGFNR 349


>gi|302565470|ref|NP_001181411.1| N-acetyllactosaminide beta-1,3-N-acetylglucosaminyltransferase
           [Macaca mulatta]
          Length = 415

 Score = 80.5 bits (197), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 69/264 (26%), Positives = 103/264 (39%), Gaps = 79/264 (29%)

Query: 475 DGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQF---LSYAGNSEALRSR 531
           + NDV L    S+D L  +  L + WEGP+S++++ +  E  Q    L+Y  +S     R
Sbjct: 91  EPNDVILATHASVDNLLHLSGLLERWEGPLSVSVFAATKEEAQLATVLAYVLSSHCPDMR 150

Query: 532 RNIGYHVVY------------KEGNF------------------------------YPIN 549
             +  H+V             + G F                              YP N
Sbjct: 151 ARVAMHLVCPSRYEAAVPDPREPGEFALLRSCQEVFDKLARVAQPGINYALGTNVSYPNN 210

Query: 550 TLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRY 609
            LRN+A  +    Y  ++D+D +P  GL+  L+     +D     G   LVVPAFE +R 
Sbjct: 211 LLRNLA--REGAHYALVIDVDMVPSEGLWRGLREM---LDQSNQWGGTALVVPAFEIRRA 265

Query: 610 RTA-----------------------FPASHAPTNFSRWVNATT------PYQIEWAPDF 640
           R                              APTN+SRWVN          Y + W   +
Sbjct: 266 RRMPMNKNELVQLYQVGEVRPFYYGLCTPCQAPTNYSRWVNLPEESLLRPAYVVPWQDPW 325

Query: 641 EPYIVAHRDLPRYDTRFVGFGWNK 664
           EP+ VA   +P +D RF  +G+N+
Sbjct: 326 EPFYVAGGKVPTFDERFRQYGFNR 349


>gi|194909001|ref|XP_001981876.1| GG12290 [Drosophila erecta]
 gi|190656514|gb|EDV53746.1| GG12290 [Drosophila erecta]
          Length = 340

 Score = 80.5 bits (197), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 48/162 (29%), Positives = 84/162 (51%), Gaps = 9/162 (5%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L   +   + +DTD++F + I+ +W  F +  + Q   L     +  +G  W 
Sbjct: 114 RLFLPSLLTH-VDSLLYVDTDILFLSPISDIWRFFKKFNETQMSALTPEHENENIG--WY 170

Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAI 315
           N     P  GR G N+GV+L++LT++R++ W      I +++ L  +W    DQDI N +
Sbjct: 171 NRFARHPFYGRLGVNSGVMLMNLTRMREMKWEQHIVSIHKEYKLRIIW---GDQDIINIL 227

Query: 316 ISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
              HP  +Y +PC++N +  D+     +C      +K+IH N
Sbjct: 228 FYYHPDKLYIMPCEYNYR-PDHCMYMSICNMTHAGVKVIHGN 268


>gi|195453679|ref|XP_002073892.1| GK12907 [Drosophila willistoni]
 gi|194169977|gb|EDW84878.1| GK12907 [Drosophila willistoni]
          Length = 361

 Score = 80.1 bits (196), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 50/162 (30%), Positives = 83/162 (51%), Gaps = 9/162 (5%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L   +   + +DTDV+F + I+ LW  F +  + Q   L        +G  W 
Sbjct: 130 RLFLPSLL-TNVDSLLYVDTDVLFLSPISDLWTFFKKFNETQMAALTPEHESENIG--WY 186

Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAI 315
           N     P  G+ G N+GV+L++LT++R+ +W      I +++ L   W    DQDI N +
Sbjct: 187 NRFARHPFYGKLGVNSGVMLMNLTRMREFNWEKQILPIHKEYKLRITW---GDQDIINIL 243

Query: 316 ISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
              HP  +Y +PC++N +  D+     +C T    +K+IH N
Sbjct: 244 FYYHPDKLYVMPCEYNYR-PDHCMYMSVCSTSHLGIKLIHGN 284


>gi|397571327|gb|EJK47739.1| hypothetical protein THAOC_33522, partial [Thalassiosira oceanica]
          Length = 1892

 Score = 80.1 bits (196), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 86/313 (27%), Positives = 135/313 (43%), Gaps = 75/313 (23%)

Query: 397  TQTTLQQAELSNLNEEDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDR 456
            +   +QQ E+S+   +D   D+  S  T ++ HL     E  ASSD     L+  + M +
Sbjct: 1192 SSPQMQQVEMSDAWPKD-IEDIVTSWKTQVQNHLI----EAMASSD-----LINYIPMQQ 1241

Query: 457  LQMVEINPNNVDEYEASS--DGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAE 514
              ++      + E+ +S   +G DV +V Q S+DRL  +E   + W G  ++ +Y+S  E
Sbjct: 1242 SHVL------LQEWTSSKARNGIDVAIVTQCSLDRLASLEAQLRSWPGNSAVAVYVSSTE 1295

Query: 515  AQQFLSYAGNSEALRSRRNIGYHVVYKEG-----NFYPINTLRNVALNQVSTPYVFLLDI 569
            ++     A   +++ SR N    VV  EG       YPIN LRNVAL +    +     +
Sbjct: 1296 SKSDAIVA--IQSMVSRGNFNCAVVLVEGAPFDDEPYPINYLRNVALLESQRQH-----L 1348

Query: 570  DFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQR--------------------- 608
             F P       L  +I + +       KV+V+PAFE                        
Sbjct: 1349 RFHPTLEQSAALLDAILNQN-------KVVVIPAFECSEGVDCPETLPELHELVRDGRAD 1401

Query: 609  --YRTAFPASHAPTNFSR-WVNATT-------------PYQIEWAPDFEPYIVAHR-DLP 651
              +   FP  H PTNF   W+ + +              Y +E+   FEPY++    ++P
Sbjct: 1402 GFHLAHFPQGHGPTNFEEFWLVSQSLETSQDCAQAWQKSYSVEYERCFEPYVIMKSANVP 1461

Query: 652  RYDTRFVGFGWNK 664
             YD RF G+G NK
Sbjct: 1462 LYDERFTGYGLNK 1474


>gi|195573727|ref|XP_002104843.1| GD18236 [Drosophila simulans]
 gi|194200770|gb|EDX14346.1| GD18236 [Drosophila simulans]
          Length = 511

 Score = 80.1 bits (196), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 48/162 (29%), Positives = 84/162 (51%), Gaps = 9/162 (5%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L   +   + +DTD++F + I+ +W  F +  + Q   L     +  +G  W 
Sbjct: 285 RLFLPSLLTH-VDSLLYVDTDILFLSPISDIWRFFKKFNETQMSALTPEHENENIG--WY 341

Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAI 315
           N     P  GR G N+GV+L++LT++R++ W      I +++ L  +W    DQDI N +
Sbjct: 342 NRFARHPFYGRLGVNSGVMLMNLTRMREMKWEQQIVSIHKEYKLRIIW---GDQDIINIL 398

Query: 316 ISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
              HP  +Y +PC++N +  D+     +C      +K+IH N
Sbjct: 399 FYYHPDKLYIMPCEYNYR-PDHCMYMSICNMSHAGVKVIHGN 439


>gi|194742393|ref|XP_001953687.1| GF17104 [Drosophila ananassae]
 gi|190626724|gb|EDV42248.1| GF17104 [Drosophila ananassae]
          Length = 361

 Score = 79.7 bits (195), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 48/162 (29%), Positives = 85/162 (52%), Gaps = 9/162 (5%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L + +   + +DTD++F + I+ +W  F +  + Q   L     +  +G  W 
Sbjct: 137 RLFLPSLLTK-VDSLLYVDTDILFLSPISDIWQYFKKFNESQISALTPEHENENIG--WY 193

Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAI 315
           N     P  GR G N+GV+L++LT++R++ W      I +++ L  +W    DQDI N +
Sbjct: 194 NRFARHPFYGRLGVNSGVMLMNLTRMREMKWEQHILSIHKEYKLRIIW---GDQDIINIL 250

Query: 316 ISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
              HP  +Y +PC++N +  D+     +C      +K+IH N
Sbjct: 251 FYYHPDKLYIMPCEYNYR-PDHCMYMSICNMTHAGVKLIHGN 291


>gi|390338379|ref|XP_781007.3| PREDICTED: xyloside xylosyltransferase 1-like [Strongylocentrotus
           purpuratus]
          Length = 467

 Score = 79.7 bits (195), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 49/157 (31%), Positives = 80/157 (50%), Gaps = 16/157 (10%)

Query: 192 GVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWY 251
            ++ L  +   ++LPE + + I+LDTD++F +DI +L+  F   +    +G+V  Q   Y
Sbjct: 277 AIFFLSTMLHKELLPEYVHRVIMLDTDLVFMSDIKELFEHFDHFQGDNIMGVVHEQQPVY 336

Query: 252 --LGKLWKNHKPWPALGR-------GFNTGVILLDLTK-LRDISWAGFWRIIAEKFLLTR 301
             +  L+++  P   +G        GFN+GV+LLDL + L+ + +  + R   EK +  R
Sbjct: 337 RHIFSLYRSQNPGTRVGNPPPDGLTGFNSGVLLLDLDRMLQSVHFEDYMR--PEKVVELR 394

Query: 302 ----LWTSLADQDIFNAIISEHPYLVYTLPCQWNVQL 334
                   L DQD +  +  E   L Y LPC WN QL
Sbjct: 395 DKFHFKGHLGDQDFYTLVSMEKEDLFYNLPCSWNKQL 431



 Score = 43.5 bits (101), Expect = 0.33,   Method: Compositional matrix adjust.
 Identities = 21/77 (27%), Positives = 42/77 (54%), Gaps = 2/77 (2%)

Query: 95  GVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWY 154
            ++ L  +   ++LPE + + I+LDTD++F +DI +L+  F   +    +G+V  Q   Y
Sbjct: 277 AIFFLSTMLHKELLPEYVHRVIMLDTDLVFMSDIKELFEHFDHFQGDNIMGVVHEQQPVY 336

Query: 155 --LGKLWKNHKPWPALG 169
             +  L+++  P   +G
Sbjct: 337 RHIFSLYRSQNPGTRVG 353


>gi|72079522|ref|XP_780812.1| PREDICTED: xyloside xylosyltransferase 1-like [Strongylocentrotus
           purpuratus]
          Length = 400

 Score = 79.7 bits (195), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 46/146 (31%), Positives = 76/146 (52%), Gaps = 18/146 (12%)

Query: 204 VLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLG--KLWKNHKP 261
           +LP+ + + I+LDTD+ F +DI+ L+A F + +    +G+  +Q   Y    K ++   P
Sbjct: 223 ILPDFVHQIIMLDTDLKFLSDISGLFAHFDKFQGSNVMGIAHDQQPTYRDAFKAFRARNP 282

Query: 262 WPALGR-------GFNTGVILLDLTKLRDISWAGFW------RIIAEKFLLTRLWTSLAD 308
              +G        GFN+GV+LLD+ ++R      ++       +I+ K+L      SL D
Sbjct: 283 TTRVGSPAPDGLPGFNSGVVLLDIARMRSSQLYNYYLNPPVAELISRKYLFK---GSLGD 339

Query: 309 QDIFNAIISEHPYLVYTLPCQWNVQL 334
           QD ++ I +E   L Y LPC WN  L
Sbjct: 340 QDFYSLIGNERENLFYVLPCSWNRHL 365



 Score = 45.4 bits (106), Expect = 0.089,   Method: Compositional matrix adjust.
 Identities = 41/155 (26%), Positives = 70/155 (45%), Gaps = 19/155 (12%)

Query: 34  IKSILFYRKNPLHFHLITDTVA---LNILQTLFSTWSVPQVEVSFYLAD---------SV 81
           +KSIL     PL F  + D  +   L    T+FS     +VE+ +   D         S 
Sbjct: 133 MKSILEKSSIPLFFFFVVDEPSRLYLEETMTMFSRKYRSKVEIQYTFLDQETIAKKLASS 192

Query: 82  VEDVSWIPNKHYSGVYG----LLKLTLPK-VLPETLAKTIVLDTDVIFATDIAQLWALFS 136
           V  V  I + +  G Y      L + + + +LP+ + + I+LDTD+ F +DI+ L+A F 
Sbjct: 193 VSVVQQIVSGNTQGYYNQSIFFLSVAIHRGILPDFVHQIIMLDTDLKFLSDISGLFAHFD 252

Query: 137 RLRQRQTIGLVENQSDWYLG--KLWKNHKPWPALG 169
           + +    +G+  +Q   Y    K ++   P   +G
Sbjct: 253 KFQGSNVMGIAHDQQPTYRDAFKAFRARNPTTRVG 287


>gi|307106498|gb|EFN54743.1| hypothetical protein CHLNCDRAFT_58024 [Chlorella variabilis]
          Length = 492

 Score = 79.7 bits (195), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 74/279 (26%), Positives = 113/279 (40%), Gaps = 68/279 (24%)

Query: 447 TLVAQLSMDRLQMVEINPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISL 506
           T+V QL   ++Q V      +D  +  S   DVTLV QLS +RL M+E     W G IS 
Sbjct: 132 TIVEQLPQLKIQQV------LDSNKPRSP-QDVTLVTQLSFERLYMLEGQCDVWNGVISA 184

Query: 507 TLYMSDAEAQQFLSYAGNSEALR-------SRRNIGYHVVYKEGNF-------------- 545
            +Y++    +       +++  R        R+   +H + +                  
Sbjct: 185 AVYIALLNGKVVTVELNSNQDPRLTDITEVERKFKQFHQLAEAKGLCKLDLQLVSQTVES 244

Query: 546 ------YPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMD-MHGHGGKKV 598
                 YP+N +RN AL    T  V LLD+DF P   L   ++   +    M        
Sbjct: 245 IWLSALYPVNAMRNRALANARTDAVLLLDVDFWPSAELSELMQKPAKYESLMKTLQAGAA 304

Query: 599 LVVPAFET--------QRYRTA------------------------FPASHAPTNFSRWV 626
           +V+PA+ET        +  R A                        + A H  T++ +W+
Sbjct: 305 VVLPAYETGDPGDIGVEVAREAVLEGKDAAVKMFWDGRIKAFHTDRYEAGHRATDYKKWL 364

Query: 627 NATTPYQIEWAPDFEPYIVAHRD-LPRYDTRFVGFGWNK 664
            A+ PY I +   FEPY++  R  +P YD RFVG+  NK
Sbjct: 365 QASRPYNIMYEEGFEPYVIISRKFVPWYDERFVGYRKNK 403


>gi|443723788|gb|ELU12058.1| hypothetical protein CAPTEDRAFT_219773 [Capitella teleta]
          Length = 341

 Score = 79.3 bits (194), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 59/210 (28%), Positives = 92/210 (43%), Gaps = 24/210 (11%)

Query: 174 LADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFS 233
           + + + E  S+    +Y+     L   + +V+P T+ + I++D D+ F  DI QL+  FS
Sbjct: 132 IVEPMQERFSYKKGSYYNHALFFLSTAIHRVMPSTMHRAIMMDVDLKFRGDIKQLYDQFS 191

Query: 234 RLRQRQTIGLVENQSDWYLGKLWKNHK---------PWPALGRGFNTGVILLDLTKLRDI 284
              +   +GL       Y    W + +         P P    GFN+GV+LLDL ++R  
Sbjct: 192 LFDETHVMGLAREMQPVYRHLFWYHRRENPGTRVGDPPPNGLTGFNSGVLLLDLDRMRSS 251

Query: 285 SWAGFW------RIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLS--- 335
                         +A++F   R+   L DQD F  +   H  L Y LPC +N QLS   
Sbjct: 252 QLYNSMLERKNIESLADEF---RIKGHLGDQDFFTLLGMVHEELFYVLPCVFNRQLSSWW 308

Query: 336 -DNTRSD--ELCYTELTDLKIIHWNSPKKL 362
            D+   D   L +    D+ I H N   K+
Sbjct: 309 KDHGYKDVFHLYHDCEGDVLIYHGNGKTKI 338



 Score = 50.4 bits (119), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 34/157 (21%), Positives = 67/157 (42%), Gaps = 13/157 (8%)

Query: 28  RSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFST-------WSVPQVEVSFY---L 77
           +  +  + S+  +   PL+ H+I +  + N+ + + +        + +  ++V      L
Sbjct: 70  KQFILTVTSLFRHATIPLNIHIIGEEASQNLAEKIMAKNVPSTAKYKIVHLDVDHLATEL 129

Query: 78  ADSV---VEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 134
           AD V    E  S+    +Y+     L   + +V+P T+ + I++D D+ F  DI QL+  
Sbjct: 130 ADIVEPMQERFSYKKGSYYNHALFFLSTAIHRVMPSTMHRAIMMDVDLKFRGDIKQLYDQ 189

Query: 135 FSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRG 171
           FS   +   +GL       Y    W + +  P    G
Sbjct: 190 FSLFDETHVMGLAREMQPVYRHLFWYHRRENPGTRVG 226


>gi|443710195|gb|ELU04493.1| hypothetical protein CAPTEDRAFT_192232 [Capitella teleta]
          Length = 320

 Score = 78.6 bits (192), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 57/188 (30%), Positives = 85/188 (45%), Gaps = 35/188 (18%)

Query: 478 DVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGYH 537
           DVTL+  L+ DR++ +  L ++W GPIS  +Y+ +      +    N    +SR NI  H
Sbjct: 123 DVTLMVHLTSDRIEKLLRLTENWSGPISAAIYVDNITDSLMIE---NICHFQSRPNIFIH 179

Query: 538 VVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKK 597
           +++K G  YP + +RN+AL +V T Y FL D+DF+P   +Y      I+S +        
Sbjct: 180 ILHKVGGIYPSSIIRNLALEEVFTEYAFLSDVDFMPAPSVYKTSVDLIKSKN-RLLSKTT 238

Query: 598 VLVVPAFETQRYRT------AFPAS-------------------------HAPTNFSRWV 626
             VV AFE +R  T       FP +                         H  TN+  W 
Sbjct: 239 AFVVAAFELKRNITPDGDAVKFPRNKSDLLALQNSTLIQPFHYHPKNHFGHFSTNYEAWN 298

Query: 627 NATTPYQI 634
            A T YQ+
Sbjct: 299 IANTTYQV 306


>gi|410921614|ref|XP_003974278.1| PREDICTED: xyloside xylosyltransferase 1-like [Takifugu rubripes]
          Length = 389

 Score = 78.6 bits (192), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 50/178 (28%), Positives = 81/178 (45%), Gaps = 18/178 (10%)

Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
           F + +++ +  S     +YS     L + + +++P++L + + LD D+ +  +I +L+  
Sbjct: 178 FPIVEAMQKHFSSGSGAYYSDAIFFLSVAMHRIMPDSLTRMVQLDLDLQYRVNIRELFQE 237

Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWKNHK---------PWPALGRGFNTGVILLDLTKLR 282
           F +      IG+       Y    W+  K         P P    GFN+GV+LLDL  +R
Sbjct: 238 FHQFPPGAVIGIAREMQPVYRHTFWQYRKENPQTKVGEPHPDGLPGFNSGVMLLDLASMR 297

Query: 283 DISW------AGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQL 334
             S             +A+++   R    L DQD F  I  EHP L Y+L C WN QL
Sbjct: 298 ASSLYNQLLEPDHVAKLADQY---RFRGHLGDQDFFTMIGMEHPELFYSLACGWNRQL 352


>gi|289742285|gb|ADD19890.1| putative glycosyltransferase [Glossina morsitans morsitans]
          Length = 361

 Score = 78.2 bits (191), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 49/162 (30%), Positives = 80/162 (49%), Gaps = 9/162 (5%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L ET+   + +DTD++F + +A +W  F      Q   L     D  +G  W 
Sbjct: 134 RLFLPSIL-ETVDAALYVDTDILFLSPVADIWKFFKSFNGTQIAALSPEHEDANVG--WY 190

Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAI 315
           N     P  G  G N+GV+L++LT++R+  W      I +++ L   W    DQDI N +
Sbjct: 191 NRFARHPYYGPLGVNSGVMLMNLTRMREFKWEEKVLPIYKEYKLRLTW---GDQDIINIL 247

Query: 316 ISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
              H   +Y +PC++N +  D+     +C      +K+IH N
Sbjct: 248 FYYHSEKLYLMPCEYNYR-PDHCMYMNICNVSADGIKVIHGN 288



 Score = 40.0 bits (92), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 35/150 (23%), Positives = 60/150 (40%), Gaps = 10/150 (6%)

Query: 17  VAIVCAGYNSTRSLVTLIKSILF-YRKNPLHFHLITDTVALNILQTLFSTWSVPQVEVSF 75
           VA+V  G      LV +  ++LF   +  L+F +  +   +       + W   + E  F
Sbjct: 49  VAVVVCGQRMQEVLVMIKSALLFNSSREKLYFLIFAEERLMISFNEKLNDWRAIRSEFDF 108

Query: 76  YLAD---SVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLW 132
            L      V  +  W   ++        +L LP +L ET+   + +DTD++F + +A +W
Sbjct: 109 RLLPLQFPVNSEREW---RNLFKPCAAQRLFLPSIL-ETVDAALYVDTDILFLSPVADIW 164

Query: 133 ALFSRLRQRQTIGLVENQSDWYLGKLWKNH 162
             F      Q   L     D  +G  W N 
Sbjct: 165 KFFKSFNGTQIAALSPEHEDANVG--WYNR 192


>gi|301616898|ref|XP_002937879.1| PREDICTED: uncharacterized protein C3orf21 homolog [Xenopus
           (Silurana) tropicalis]
          Length = 376

 Score = 78.2 bits (191), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 58/207 (28%), Positives = 94/207 (45%), Gaps = 24/207 (11%)

Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
           F + +++ +  S     +YS     L + L +++P+ + + I LD D+ F T++ +L+  
Sbjct: 165 FPIVEAMQKHFSSGSGTYYSDSIFFLSVALHQIMPKEIYRIIQLDLDLKFKTNVRELFEE 224

Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWKNHK---------PWPALGRGFNTGVILLDLTKLR 282
           F        IG+ +     Y    W+  K         P P    GFN+GV+LL+L  +R
Sbjct: 225 FDNFMHMSIIGIAQEMQPVYRHTFWQYRKENPSTKVGEPPPDGMPGFNSGVMLLNLEAMR 284

Query: 283 ------DISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLS- 335
                 ++      + +AEK+        L DQD F+ I  EHP L Y L C WN QL  
Sbjct: 285 QSRLYNELLEPNAVQQLAEKY---HFKGHLGDQDFFSMIGMEHPELFYVLDCGWNRQLCT 341

Query: 336 ---DNTRSD--ELCYTELTDLKIIHWN 357
              D+   D  +  Y    ++KI+H N
Sbjct: 342 WWKDHGYMDVFDDYYRCEGNIKILHGN 368



 Score = 43.1 bits (100), Expect = 0.42,   Method: Compositional matrix adjust.
 Identities = 37/158 (23%), Positives = 67/158 (42%), Gaps = 16/158 (10%)

Query: 29  SLVTLIKSILFYRKNPLHFHLITDT----VALNILQTLFSTWSVPQVEVSFYLADSVVED 84
           +L +LIK   F     L  H ++D     +  ++L+ L    S  + +V F+    + E 
Sbjct: 105 ALSSLIKYGKFESNEVLCLHFVSDEPSKEIGKSLLKDLLQGASF-KYKVIFHDVGMLTEK 163

Query: 85  VSWIP---NKHYSGVYG--------LLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWA 133
           +  I     KH+S   G         L + L +++P+ + + I LD D+ F T++ +L+ 
Sbjct: 164 LFPIVEAMQKHFSSGSGTYYSDSIFFLSVALHQIMPKEIYRIIQLDLDLKFKTNVRELFE 223

Query: 134 LFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRG 171
            F        IG+ +     Y    W+  K  P+   G
Sbjct: 224 EFDNFMHMSIIGIAQEMQPVYRHTFWQYRKENPSTKVG 261


>gi|390597315|gb|EIN06715.1| hypothetical protein PUNSTDRAFT_72884 [Punctularia strigosozonata
           HHB-11173 SS5]
          Length = 382

 Score = 77.8 bits (190), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 44/130 (33%), Positives = 70/130 (53%), Gaps = 7/130 (5%)

Query: 13  EVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQ-- 70
           E +  ++V    +S      LIKSIL Y   P  FH++ D  A ++L++  S    P+  
Sbjct: 4   EEVHFSLVMFSADSGTEGAMLIKSILMYTTRPTAFHIVCDEGAQSVLESRISLVRRPRYD 63

Query: 71  VEVSFY-----LADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFA 125
           + V FY       ++ +     I  +HYSG+YGL+KL + ++LP T+ ++I +DTD IF 
Sbjct: 64  IRVVFYRVAREAMEARIRREGGIYTEHYSGIYGLMKLFIHEILPPTVKRSIFVDTDAIFI 123

Query: 126 TDIAQLWALF 135
           +D   LW  F
Sbjct: 124 SDPTLLWRRF 133



 Score = 53.5 bits (127), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 23/48 (47%), Positives = 34/48 (70%)

Query: 185 IPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALF 232
           I  +HYSG+YGL+KL + ++LP T+ ++I +DTD IF +D   LW  F
Sbjct: 86  IYTEHYSGIYGLMKLFIHEILPPTVKRSIFVDTDAIFISDPTLLWRRF 133


>gi|432091091|gb|ELK24303.1| N-acetyllactosaminide beta-1,3-N-acetylglucosaminyltransferase
           [Myotis davidii]
          Length = 415

 Score = 77.8 bits (190), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 71/268 (26%), Positives = 107/268 (39%), Gaps = 87/268 (32%)

Query: 475 DGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQF---LSYAGNSEALRSR 531
           D +DV L    S+D L  +  L + WEGP+S++++ +  E  Q    L+YA +S     R
Sbjct: 91  DPDDVILATHASVDNLVHLSGLLERWEGPLSVSVFAATKEEAQLATVLTYALSSHCPDIR 150

Query: 532 RNIGYHVVY------------KEGNF------------------------------YPIN 549
             +  H+V             + G F                              YP N
Sbjct: 151 ARVAMHLVCPSRYEAAVPDPREPGEFALLRSCQQVFDKLARVAQPGINYALGTNVSYPNN 210

Query: 550 TLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGK---KVLVVPAFET 606
            LRN+A    +  Y  ++D+D +P  GL+  L+      +M G   +     LVVPAFE 
Sbjct: 211 LLRNLAREGAN--YALVIDVDMVPSEGLWRGLR------EMLGQSKQWAGTALVVPAFEI 262

Query: 607 QRYRTAFPAS------------------------HAPTNFSRWVNATT------PYQIEW 636
           +R R   P +                         APTN+SRWVN          Y + W
Sbjct: 263 RRARR-IPTNKNELLKLYQVDEVQPFYYGLCTPCQAPTNYSRWVNLPEETLLRPAYVVPW 321

Query: 637 APDFEPYIVAHRDLPRYDTRFVGFGWNK 664
              +EP+ VA   +P +D RF  +G+N+
Sbjct: 322 QDPWEPFYVAGGKVPAFDERFRQYGFNR 349


>gi|358339090|dbj|GAA47215.1| glycosyltransferase-like protein LARGE [Clonorchis sinensis]
          Length = 419

 Score = 77.8 bits (190), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 42/139 (30%), Positives = 70/139 (50%), Gaps = 2/139 (1%)

Query: 195 GLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGK 254
            L  L  P ++ +++ K IVL  D++F  ++A+LW  F      Q +G V  Q D     
Sbjct: 190 ALQNLMAPYIVDQSVDKIIVLSPDLLFNENVAELWTHFEHFSSTQAVGAVCEQVDDCDQC 249

Query: 255 LWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNA 314
             K+ + +P    G NTG++LL L +LR   W G ++         +      +Q I N 
Sbjct: 250 CPKDDEQFPLY--GLNTGLVLLHLDRLRKADWRGLYKAELSSNQYVQESPGDLNQGILNT 307

Query: 315 IISEHPYLVYTLPCQWNVQ 333
           ++S+   ++Y LPC+WN+Q
Sbjct: 308 VLSKRMRMLYRLPCEWNIQ 326



 Score = 43.1 bits (100), Expect = 0.42,   Method: Compositional matrix adjust.
 Identities = 34/147 (23%), Positives = 57/147 (38%), Gaps = 15/147 (10%)

Query: 9   IPTCEV---IQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFST 65
           + TCE+   +   I+C   N+T                P+  HL+    A N     F  
Sbjct: 110 MKTCELDWSVPSGILCENINATPI------------AQPIMLHLVVHQKAWNKTLRRFQR 157

Query: 66  WSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFA 125
           W+  +  V  Y   + +        +       L  L  P ++ +++ K IVL  D++F 
Sbjct: 158 WNSTRFLVKLYEFKNELALRGTEAERSPRVQLALQNLMAPYIVDQSVDKIIVLSPDLLFN 217

Query: 126 TDIAQLWALFSRLRQRQTIGLVENQSD 152
            ++A+LW  F      Q +G V  Q D
Sbjct: 218 ENVAELWTHFEHFSSTQAVGAVCEQVD 244


>gi|170038922|ref|XP_001847296.1| GLT8D3 protein [Culex quinquefasciatus]
 gi|167862537|gb|EDS25920.1| GLT8D3 protein [Culex quinquefasciatus]
          Length = 375

 Score = 77.8 bits (190), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 51/166 (30%), Positives = 77/166 (46%), Gaps = 18/166 (10%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L   +   + +D+D IF   + +LW+LF      Q  G+     D   G  W 
Sbjct: 150 RLFLPSLLTH-IDSVLYVDSDTIFLAPVQELWSLFKTFNASQFAGMAPEHEDKNAG--WY 206

Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAI 315
           N     P  G  G N+GV+L++LT++R+  W      I  +F L  +W    DQDI N +
Sbjct: 207 NRFARHPYYGDLGVNSGVMLMNLTRMREFRWEQHIMPIYREFRLKLVW---GDQDIINIL 263

Query: 316 ISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTD----LKIIHWN 357
              HP  +   PC WN       R+D   Y  + +    +KI+H N
Sbjct: 264 FHHHPDRLLVFPCDWNY------RADHCMYMSVCEAPDGVKIVHGN 303


>gi|71834366|ref|NP_001025270.1| glucoside xylosyltransferase 1 precursor [Danio rerio]
 gi|82079018|sp|Q5SP46.1|GXLT1_DANRE RecName: Full=Glucoside xylosyltransferase 1; AltName:
           Full=Glycosyltransferase 8 domain-containing protein 3
 gi|190337605|gb|AAI63763.1| Glycosyltransferase 8 domain containing 3 [Danio rerio]
 gi|190339952|gb|AAI63768.1| Glycosyltransferase 8 domain containing 3 [Danio rerio]
          Length = 405

 Score = 77.4 bits (189), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 72/271 (26%), Positives = 118/271 (43%), Gaps = 40/271 (14%)

Query: 165 WPALGRGFYLADSVVEDVSWIPNKHYSGVYGLLK------LTLPKVLPETLAKTIVLDTD 218
           WPA  +G +    V+  +S+ P++H+     L K      L LP +L E L   + +DTD
Sbjct: 129 WPAFIQGKF--SYVLHPISF-PHEHHEEWSQLFKPCASQRLFLPMILRE-LDSLLYVDTD 184

Query: 219 VIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPW-PALGR-GFNTGVILL 276
           V+F   +  +W +       Q I +     +  +   W +   W P  G+ G N+GV+L+
Sbjct: 185 VLFLQPVELIWDMLMLFNSTQLIAMAPEHEEPRIA--WYSRFSWHPYYGKMGINSGVMLM 242

Query: 277 DLTKLR-----------DISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYT 325
           +LT++R            + W      + +K+ L   W    DQD+ N I   +P +VYT
Sbjct: 243 NLTRMRITQFKNDMTPVGLHWDELLMPLLQKYKLNITW---GDQDLINIIFHYNPEMVYT 299

Query: 326 LPCQWNVQLSDNTRSDELCYTELTDLKIIHWNSPKKLKVKNKHMEFFRNLYLTFLEYDGN 385
           LPC WN +             E   + ++H N   +    +     FR +Y  F +Y   
Sbjct: 300 LPCHWNYRPDHCIYGSNCAPAEDEGVFVLHGN---RGVFHSDKQPAFRAVYEAFEQYT-- 354

Query: 386 LLRRELFGCNLTQTTLQQAELSNLNEEDPCY 416
                 FG +L Q+ L + E++ LNE    Y
Sbjct: 355 ------FGEDLRQSLLSRLEVA-LNETTHTY 378


>gi|428166538|gb|EKX35512.1| hypothetical protein GUITHDRAFT_118323 [Guillardia theta CCMP2712]
          Length = 1696

 Score = 77.0 bits (188), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 60/207 (28%), Positives = 93/207 (44%), Gaps = 44/207 (21%)

Query: 476  GNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNS-EALRSR--- 531
             + V LV Q S+DRL  +E     W G  S+ LY+  ++ +Q      NS   +  R   
Sbjct: 1380 ASGVVLVTQASLDRLPRLEAQLLAWRGAASVALYVPWSDQRQETEDRLNSIRDMYGRLLD 1439

Query: 532  RNIGYHVV----------YKEGNFYPINTLRNVALNQ---VSTPYVFLLDIDFLPMFGLY 578
              +G  VV          ++  N YPIN LRN+ALN     +   V L+D+DFLP   L 
Sbjct: 1440 AGVGGVVVSILFGNSPSEHEYNNLYPINALRNLALNTSKFCAAELVLLVDVDFLPSKAL- 1498

Query: 579  PYLKSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPASHAPTNFSRWVNATTPYQIEWAP 638
                                  V   + + Y  A   + A        ++T PY++E+  
Sbjct: 1499 ----------------------VDKCQDEEYLAAMRQASA---IGSMFSSTEPYEVEYQS 1533

Query: 639  DFEPYIVAH-RDLPRYDTRFVGFGWNK 664
            ++EPY++ + + +P+YD RF G+G NK
Sbjct: 1534 NYEPYVIMNVKTVPKYDERFRGYGMNK 1560


>gi|156373903|ref|XP_001629549.1| predicted protein [Nematostella vectensis]
 gi|156216552|gb|EDO37486.1| predicted protein [Nematostella vectensis]
          Length = 304

 Score = 77.0 bits (188), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 56/183 (30%), Positives = 83/183 (45%), Gaps = 18/183 (9%)

Query: 183 SWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIG 242
           S+    +YS     L   L KV PE L K I+LD+D+ F  DI  L+  F +      +G
Sbjct: 104 SYKKGAYYSDPIFFLSTALHKVFPENLTKIIMLDSDLRFRNDINLLFQHFKKFESSNIMG 163

Query: 243 LVENQSDWY--LGKLWKNHKPWPALGR-------GFNTGVILLDLTKLRDISWAGFWRII 293
           L   Q   Y  +   ++++ P   +G        GFN+GV LL L  +R      + +II
Sbjct: 164 LATEQQPVYRHIFDTYRHNNPGTMVGEPPPNGITGFNSGVALLHLENMR--RSKTYNQII 221

Query: 294 AEKFLLT-----RLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTEL 348
           + +  +           L DQD ++ +  EH  L Y LPC WN QL    R +   Y ++
Sbjct: 222 SSEAAIDMAKKYNFKGHLGDQDFYSLLSVEHKDLFYILPCGWNRQLCSYWRDNG--YAKV 279

Query: 349 TDL 351
            DL
Sbjct: 280 FDL 282



 Score = 58.9 bits (141), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 45/172 (26%), Positives = 78/172 (45%), Gaps = 17/172 (9%)

Query: 15  IQVAIVCAGYNSTRSLVT--LIKSILFYRKNPLHFHLITDTVALNILQTLF--STWSVPQ 70
           I + +V A + S  ++ T   IKSI  +   P+ FH I D    ++++ +F  ++     
Sbjct: 18  ILMTLVKAYHGSPLAVSTEKCIKSICKHSIIPITFHFIVDKKGTDVVKGIFMRTSGHCTN 77

Query: 71  VEVSFYLADSVVEDV-----------SWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLD 119
            ++ F+ AD + E +           S+    +YS     L   L KV PE L K I+LD
Sbjct: 78  PDIVFHDADKLAERLKGIVQVLQRPFSYKKGAYYSDPIFFLSTALHKVFPENLTKIIMLD 137

Query: 120 TDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWY--LGKLWKNHKPWPALG 169
           +D+ F  DI  L+  F +      +GL   Q   Y  +   ++++ P   +G
Sbjct: 138 SDLRFRNDINLLFQHFKKFESSNIMGLATEQQPVYRHIFDTYRHNNPGTMVG 189


>gi|321452095|gb|EFX63567.1| hypothetical protein DAPPUDRAFT_204739 [Daphnia pulex]
          Length = 326

 Score = 77.0 bits (188), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 60/217 (27%), Positives = 99/217 (45%), Gaps = 27/217 (12%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L + +   + +DTD IF T    +W  F+++ + Q   +     D+  G  W 
Sbjct: 111 RLFLPSLL-QGVDSVLYMDTDTIFLTSPLNVWEHFNKMDEMQMAAVAPEHEDFSTG--WY 167

Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAI 315
           N     P  G  G N+GV+L++LT++R   W  +   I  ++    +W    DQDI N +
Sbjct: 168 NRFARHPYYGNLGVNSGVMLMNLTRMRHFGWEKYVVPIYHEYKSQIVW---GDQDIINIV 224

Query: 316 ISEHPYLVYTLPCQWNVQLSDNTRSDELCYT-ELTDLKIIHWNSPKKLKVKNKHMEFFRN 374
              HP  +Y  PC +N +  D+     +C T E   + ++H N   +    NK    F+ 
Sbjct: 225 FHFHPEKLYVYPCHYNYR-PDHCMYMSVCKTAEEQGIHVLHGN---RGSFHNKKQPAFKA 280

Query: 375 LYLTFLEYDGNLLRRELFGCNLTQTTLQQAELSNLNE 411
           +Y +  +Y         FG NL      Q  L+N+ E
Sbjct: 281 VYASIEQYS--------FGENL------QTLLTNMKE 303


>gi|326426427|gb|EGD71997.1| hypothetical protein PTSG_00013 [Salpingoeca sp. ATCC 50818]
          Length = 458

 Score = 77.0 bits (188), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 48/160 (30%), Positives = 76/160 (47%), Gaps = 25/160 (15%)

Query: 204 VLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWY---------LGK 254
           +LP+ + K IV+D DV+F  D+  LW +F   +  + IGLV  Q   +         L K
Sbjct: 271 LLPD-VDKAIVVDADVVFRVDVTLLWQIFRAFKTEELIGLVYEQQPTFRHTFAAYRKLNK 329

Query: 255 LWKNHKPWPALGR------GFNTGVILLDLTKLR------DISWAGFWRIIAEKFLLTRL 302
                 P  ++G       GFNTGV LL+L  +R      ++ +      +  K++++  
Sbjct: 330 RTHIGSPAASVGGVRAGLPGFNTGVALLNLAHMRASDLISELVYTDLVDHMTRKYMVSER 389

Query: 303 WTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDE 342
            T      +F  + +EHP+L + LPC+WN QL    R  E
Sbjct: 390 LTPAT---LFTLMAAEHPFLFHVLPCKWNRQLCQKMRQSE 426


>gi|196016771|ref|XP_002118236.1| hypothetical protein TRIADDRAFT_62267 [Trichoplax adhaerens]
 gi|190579211|gb|EDV19312.1| hypothetical protein TRIADDRAFT_62267 [Trichoplax adhaerens]
          Length = 1053

 Score = 76.6 bits (187), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 62/237 (26%), Positives = 108/237 (45%), Gaps = 46/237 (19%)

Query: 474 SDGND--VTLVAQLSMDRLQMVEMLFKHWEGPISLTLY----MSDAEAQQFLSYA-GNSE 526
           + G D  +T+  Q+++DRL  +E L + W+GPIS+ L+    +S    Q  L    G   
Sbjct: 253 TSGKDPFITIATQMTIDRLPTLERLVQRWQGPISVVLFVLNDISGVTVQDHLQQVLGKYH 312

Query: 527 ALRS-RRNIGYHVVY------KEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYP 579
            +   R+ I  H+V+      K+   YP+N LRN+A+    T +V+LLDID +P      
Sbjct: 313 QVDDFRKFIDLHIVFEDALNGKDSALYPVNFLRNLAIKFSKTSHVYLLDIDIIPSVSHEQ 372

Query: 580 YLKSSIR-SMDMHGH------GGKKVLVVPAFE-----------------TQRYRTAFP- 614
            + + ++ S D++          K   + P F+                  ++ R   P 
Sbjct: 373 AVAAIMKASHDINNREHNIKVCQKCAFITPLFKLKTDAVDNDIPREKNVLLEQIRKDNPP 432

Query: 615 ------ASHAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRDL-PRYDTRFVGFGWNK 664
                  SHA T F  W ++++ ++  +  + EPY + +R L P  +  F G+G +K
Sbjct: 433 IEPFHAVSHAMTQFGIWFSSSSAFKSNYIENNEPYFIINRKLSPNINNIFSGYGRDK 489


>gi|410929415|ref|XP_003978095.1| PREDICTED: N-acetyllactosaminide
           beta-1,3-N-acetylglucosaminyltransferase-like [Takifugu
           rubripes]
          Length = 431

 Score = 75.9 bits (185), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 65/262 (24%), Positives = 102/262 (38%), Gaps = 77/262 (29%)

Query: 477 NDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYM--SDAEAQQFLSYAGNSEALRSRRNI 534
           N +TL    +++ L  +E L + W+ P+S+ ++    D      L YA +    + +  +
Sbjct: 107 NILTLATHTTINNLHHLESLLERWQNPLSVAIFAHGEDVRFATALVYALSFFCPQIQALV 166

Query: 535 GYHVV------------------------------------YK----EGNF-YPINTLRN 553
            +H+V                                    YK     GN  YP N LRN
Sbjct: 167 DFHLVCLSGETASFPEKDREHFAGLEDCASVFLRLEKHRDKYKNYDISGNVSYPNNLLRN 226

Query: 554 VALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQR----- 608
           VA +   + YV ++DID +P   L+    + +R    H   G +V V+PAFE +      
Sbjct: 227 VARSGTESAYVLVIDIDMMPSADLHQQFLAMVRK---HKPAGDEVFVLPAFEIRHARKIP 283

Query: 609 ------------------YRTAFPASHAPTNFSRWVNA--------TTPYQIEWAPDFEP 642
                             Y    P   APTN+SRWVN+           Y I W   +EP
Sbjct: 284 ANKAELVQLYQVGEVRPFYEELCPRCQAPTNYSRWVNSHVRETATLEVAYTITWVDPWEP 343

Query: 643 YIVAHRDLPRYDTRFVGFGWNK 664
           + +    +P YD  F  +G+N+
Sbjct: 344 FYIGPNSVPLYDENFKQYGFNR 365


>gi|320170259|gb|EFW47158.1| UDP-GlcNAc:betaGal beta-1,3-N-acetylglucosaminyltransferase 1
           [Capsaspora owczarzaki ATCC 30864]
          Length = 448

 Score = 75.9 bits (185), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 72/299 (24%), Positives = 121/299 (40%), Gaps = 87/299 (29%)

Query: 448 LVAQLSMDRLQMVEINPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLT 507
           +++QL +D      I+ + V   + + +  DVTLV+Q   D L     L ++W+GP+S+ 
Sbjct: 89  VLSQLPLDESSKYRIHMHYV-RRQIAIEQPDVTLVSQCDADHLHEAIGLAQYWDGPVSVA 147

Query: 508 LYMSDAEAQQFLSYAGNSEAL--RSRRNIGYHVVYKEG-------------------NF- 545
           +++      + L+      A   R R  + +H+VY E                    NF 
Sbjct: 148 VFIP-GNVYKALATLRQIRACHDRVRERVSFHLVYMENESVYEEVAASELPLEGACENFQ 206

Query: 546 -------------------------------YPINTLRNVALNQVSTPYVFLLDIDFLPM 574
                                          YP N LRNVA + VS+ ++F++DID  P 
Sbjct: 207 DPYEELRANNGTAVRPKLQQQKAPAKYDGVPYPNNFLRNVARSGVSSTFMFVIDIDMRPS 266

Query: 575 FGLYPYLKSSIRSMDMHGHGG-KKVLVVPAFETQR-----------------------YR 610
            GL  +    +R  D       +   VVP FE +                        Y 
Sbjct: 267 RGLRLWF---LRLADKRPRASLRTAFVVPTFELKAGAPIPRIRQQLIDTYLQGNARPFYE 323

Query: 611 TAFPASHAPTNFSRWVNATTP-----YQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
            A      PT+F +W++ T+P     Y+++W   +EP+ ++  +LP YD RF  +G+N+
Sbjct: 324 KACWKCQKPTDFDQWLSVTSPISKPTYKVDWKDPWEPFYISWTNLPLYDERFQRYGFNR 382


>gi|195151677|ref|XP_002016765.1| GL21941 [Drosophila persimilis]
 gi|194111822|gb|EDW33865.1| GL21941 [Drosophila persimilis]
          Length = 318

 Score = 75.5 bits (184), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 46/160 (28%), Positives = 81/160 (50%), Gaps = 9/160 (5%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L   +   + +DTD +F + I+++WA F +    Q   L     +  +G  W 
Sbjct: 95  RLFLPSLLT-NVDSLLYVDTDTLFLSPISEIWAFFKKFNDSQMAALTPEHENENIG--WY 151

Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAI 315
           N     P  GR G N+GV+L++LT++R+  W      I E++ L  +W    DQDI N  
Sbjct: 152 NRFARHPFYGRLGVNSGVMLMNLTRMREFHWEKHILSIHEEYKLRIIW---GDQDIINIF 208

Query: 316 ISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIH 355
              +P  ++ +PC++N +  D+      C    + +K++H
Sbjct: 209 FYYYPDKLFVMPCEYNYR-PDHCMYMSTCNMTHSGVKLMH 247



 Score = 40.4 bits (93), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 41/169 (24%), Positives = 74/169 (43%), Gaps = 16/169 (9%)

Query: 10  PTCEVIQVAIVCAGYNSTRSLVTLIKSILF-YRKNPLHFHLITDTVALNILQTLFSTWSV 68
           P+   + + +VC G     +LV +  +I+F Y K  L F + T+      L+   + W  
Sbjct: 2   PSRTPLYIVVVCCGQRVQETLVMIKSAIVFNYEKEYLKFLIFTEKEKDEELREKLTDWK- 60

Query: 69  PQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLK------LTLPKVLPETLAKTIVLDTDV 122
             +  SF+  D  +  +++ P+++      L K      L LP +L   +   + +DTD 
Sbjct: 61  -NILTSFF--DFEILPLNF-PSQNKIEWMNLFKPCAAQRLFLPSLLT-NVDSLLYVDTDT 115

Query: 123 IFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNH-KPWPALGR 170
           +F + I+++WA F +    Q   L     +  +G  W N     P  GR
Sbjct: 116 LFLSPISEIWAFFKKFNDSQMAALTPEHENENIG--WYNRFARHPFYGR 162


>gi|49903673|gb|AAH76793.1| MGC83717 protein [Xenopus laevis]
          Length = 367

 Score = 75.5 bits (184), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 50/178 (28%), Positives = 81/178 (45%), Gaps = 18/178 (10%)

Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
           F + +++ +  S     +YS     L + L +++P+ + + + LD D+ F T+I +L+  
Sbjct: 156 FPIVEAMQKHFSSGSGTYYSDSIFFLSVALHQIMPKEIYRIVQLDLDLKFKTNIRELFEE 215

Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWKNHK---------PWPALGRGFNTGVILLDLTKLR 282
           F        IG+       Y    W+  +         P P    GFN+GV+LL+L  +R
Sbjct: 216 FDNFMHTSVIGIAPEMQPVYRHTFWQYRRENPLTKVGDPPPDGMPGFNSGVMLLNLEAMR 275

Query: 283 ------DISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQL 334
                 ++      + +AEK+        L DQD F+ I  EHP L Y L C WN QL
Sbjct: 276 LSKLYNELLEPNAVQQLAEKY---HFKGHLGDQDFFSMIGMEHPELFYVLDCGWNRQL 330


>gi|198477802|ref|XP_002136434.1| GA22542 [Drosophila pseudoobscura pseudoobscura]
 gi|198145185|gb|EDY71889.1| GA22542 [Drosophila pseudoobscura pseudoobscura]
          Length = 295

 Score = 75.5 bits (184), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 46/160 (28%), Positives = 81/160 (50%), Gaps = 9/160 (5%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L   +   + +DTD +F + I+++WA F +    Q   L     +  +G  W 
Sbjct: 72  RLFLPSLLT-NVDSLLYVDTDTLFLSPISEIWAFFKKFNDSQMAALTPEHENENIG--WY 128

Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAI 315
           N     P  GR G N+GV+L++LT++R+  W      I E++ L  +W    DQDI N  
Sbjct: 129 NRFARHPFYGRLGVNSGVMLMNLTRMREFHWEKHILSIHEEYKLRIIW---GDQDIINIF 185

Query: 316 ISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIH 355
              +P  ++ +PC++N +  D+      C    + +K++H
Sbjct: 186 FYYYPDKLFVMPCEYNYR-PDHCMYMSTCNMTHSGVKLMH 224


>gi|260822374|ref|XP_002606577.1| hypothetical protein BRAFLDRAFT_62546 [Branchiostoma floridae]
 gi|229291920|gb|EEN62587.1| hypothetical protein BRAFLDRAFT_62546 [Branchiostoma floridae]
          Length = 183

 Score = 75.5 bits (184), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 48/150 (32%), Positives = 66/150 (44%), Gaps = 18/150 (12%)

Query: 201 LPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK--- 257
           + KV P      + LD D+ F +DI +L+ +F        IG+ E Q   Y    W+   
Sbjct: 1   MHKVFPSDTKHIVQLDCDLKFRSDIRELFDMFDHFESDNVIGIAEEQQPVYRHTFWQYRN 60

Query: 258 NHK------PWPALGRGFNTGVILLDLTKLRDISW------AGFWRIIAEKFLLTRLWTS 305
            HK      P P    GFN+GV+LLDL K+R                ++E+F        
Sbjct: 61  EHKGTKVGDPPPDGQPGFNSGVLLLDLDKMRGSRLYNSLLNGTVVHAMSEEFHFK---GH 117

Query: 306 LADQDIFNAIISEHPYLVYTLPCQWNVQLS 335
           L DQD F  +   +P L + LPC WN QL 
Sbjct: 118 LGDQDYFTLLSYRYPQLFHVLPCTWNRQLC 147


>gi|353681747|ref|NP_001086551.2| xyloside xylosyltransferase 1 [Xenopus laevis]
          Length = 387

 Score = 75.1 bits (183), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 50/178 (28%), Positives = 81/178 (45%), Gaps = 18/178 (10%)

Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
           F + +++ +  S     +YS     L + L +++P+ + + + LD D+ F T+I +L+  
Sbjct: 176 FPIVEAMQKHFSSGSGTYYSDSIFFLSVALHQIMPKEIYRIVQLDLDLKFKTNIRELFEE 235

Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWKNHK---------PWPALGRGFNTGVILLDLTKLR 282
           F        IG+       Y    W+  +         P P    GFN+GV+LL+L  +R
Sbjct: 236 FDNFMHTSVIGIAPEMQPVYRHTFWQYRRENPLTKVGDPPPDGMPGFNSGVMLLNLEAMR 295

Query: 283 ------DISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQL 334
                 ++      + +AEK+        L DQD F+ I  EHP L Y L C WN QL
Sbjct: 296 LSKLYNELLEPNAVQQLAEKY---HFKGHLGDQDFFSMIGMEHPELFYVLDCGWNRQL 350


>gi|195341842|ref|XP_002037514.1| GM18308 [Drosophila sechellia]
 gi|194132364|gb|EDW53932.1| GM18308 [Drosophila sechellia]
          Length = 408

 Score = 75.1 bits (183), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 62/245 (25%), Positives = 106/245 (43%), Gaps = 54/245 (22%)

Query: 468 DEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEA----QQFLSYA- 522
           ++Y  SS+G+ V L  Q S++RL  +  +  +W+G +S+ L+ +  E     Q F++Y  
Sbjct: 93  EQYVYSSEGSLVCLATQTSVERLNSLPQVAANWQGKMSVALFAAGPEEFVVLQYFVTYMR 152

Query: 523 -GNSEALRSRRNIGYHVVYK--------EGNFYPINTLRNVALNQVSTPYVFLLDIDFLP 573
            G  +     R +   + ++        + N YP N +RN+A     T YVFL DID +P
Sbjct: 153 LGKFDCQYPDRTLKALLKFRSLKTLQWRQRNTYPQNHMRNLARKGCQTKYVFLTDIDIVP 212

Query: 574 MFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS----------------- 616
                P L   +R+ +          V+P FE    R  FP S                 
Sbjct: 213 STNSVPQLNHFLRTANCTKSCA---YVIPTFEID-VRATFPRSKNALVRLIRKGLARPFH 268

Query: 617 -------HAPTNFSRWVNATTP----------YQIEWAPDFEPYIVAHRDLPRYDTRFVG 659
                     TNFS+W+++ T              E+   +EP+ ++  + P +D RF+G
Sbjct: 269 EKVFIYNQYATNFSKWLSSNTNDTEVSVSHIVTNFEFL--YEPFYISVDNAPAHDERFLG 326

Query: 660 FGWNK 664
           +G+ +
Sbjct: 327 YGFTR 331


>gi|449668715|ref|XP_002164899.2| PREDICTED: glucoside xylosyltransferase 2-like [Hydra
           magnipapillata]
          Length = 403

 Score = 75.1 bits (183), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 52/179 (29%), Positives = 87/179 (48%), Gaps = 20/179 (11%)

Query: 163 KPWPALGRGFYLAD-------SVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVL 215
           K WPA    + +         S V+   W   K    V    +L L  +L  ++   I +
Sbjct: 163 KTWPAWSNRWMIYSMYNIYYPSSVKKSQW---KKTFKVCATQRLFLSSILA-SIDSVIYI 218

Query: 216 DTDVIFATDIAQLWALFSRLRQRQTIGL---VENQSDWYLGKLWKNHKPWPALGRGFNTG 272
           DTDV+    I +LW + S++   Q IGL    E +++ +  K  +N    P    G N+G
Sbjct: 219 DTDVLMFKPIEKLWKMISQMNSNQIIGLAQECETKANCWYNKYSRNIYFNP---HGLNSG 275

Query: 273 VILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWN 331
           V+L++LT++R ++W    + I + + +++   +  DQDI N   ++HP  VY L C WN
Sbjct: 276 VMLMNLTRMRSLNWE---KAIVDIYKISKSNITYGDQDILNIYFAKHPNQVYLLSCTWN 331



 Score = 42.7 bits (99), Expect = 0.71,   Method: Compositional matrix adjust.
 Identities = 39/140 (27%), Positives = 62/140 (44%), Gaps = 9/140 (6%)

Query: 13  EVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVA--LNILQTLFSTWSVPQ 70
           E I +A V  G +  +  +  + SI       L+FH+ +D      N L+T +  WS   
Sbjct: 114 ESIHIATVACGNDKIQEALVALMSITLTTDYNLYFHIFSDNSKPFENQLKT-WPAWSNRW 172

Query: 71  VEVSFY--LADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 128
           +  S Y     S V+   W   K    V    +L L  +L  ++   I +DTDV+    I
Sbjct: 173 MIYSMYNIYYPSSVKKSQW---KKTFKVCATQRLFLSSILA-SIDSVIYIDTDVLMFKPI 228

Query: 129 AQLWALFSRLRQRQTIGLVE 148
            +LW + S++   Q IGL +
Sbjct: 229 EKLWKMISQMNSNQIIGLAQ 248


>gi|384486619|gb|EIE78799.1| hypothetical protein RO3G_03504 [Rhizopus delemar RA 99-880]
          Length = 330

 Score = 74.7 bits (182), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 62/227 (27%), Positives = 104/227 (45%), Gaps = 43/227 (18%)

Query: 475 DGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFL------SYAGNSEAL 528
           D  DVT+   ++ +R  ++  L   ++GPIS+ ++++D   +Q +       Y  N +  
Sbjct: 18  DSEDVTIATLVTHNRFPVLSRLATRYQGPISVAIHINDDATKQTILKDLHAMYETNPDM- 76

Query: 529 RSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSM 588
              R +  H+V  + +    N  RNVA     T Y+ +LD+DF         L+     M
Sbjct: 77  --GRFVDIHLVVDKLD-RQFNMWRNVAKFFARTEYIMMLDVDFHLCTDFRQSLRKHTDLM 133

Query: 589 DMHGHGGKKVLVVPAFE------TQRYRTAFPAS-----------------------HAP 619
           D+   G K  +VVPAFE       + +RT FP +                       H  
Sbjct: 134 DILRQG-KGAVVVPAFEYLAEEDGEDWRT-FPTTKEALIDIVQQEKIDMFHITWTRGHGS 191

Query: 620 TNFSRWVNATTPYQIE-WAPDFEPYIVAHRD-LPRYDTRFVGFGWNK 664
           TN+++W  +  PY++E +   +EPY++  ++  P  D RFVG+G NK
Sbjct: 192 TNYTKWYASNDPYKVEDYNYSYEPYVIYKKEGSPWCDERFVGYGANK 238


>gi|302844911|ref|XP_002953995.1| hypothetical protein VOLCADRAFT_94735 [Volvox carteri f.
           nagariensis]
 gi|300260807|gb|EFJ45024.1| hypothetical protein VOLCADRAFT_94735 [Volvox carteri f.
           nagariensis]
          Length = 360

 Score = 74.7 bits (182), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 49/169 (28%), Positives = 71/169 (42%), Gaps = 47/169 (27%)

Query: 543 GNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL------------KSSIRSMDM 590
               PINTLRN AL   STP V ++D+D  P   L   +              S+  +  
Sbjct: 136 AGLVPINTLRNAALLAASTPLVAMIDVDLAPSESLGAEVLRPLRLGGGGGGGGSVSELLR 195

Query: 591 HGHGGKKVLVVPAFETQR----------------------------------YRTAFPAS 616
                + V V+PA+ET +                                   + A+   
Sbjct: 196 TCSQQRAVWVLPAWETHKRLGREEGTRVAQMALSGDKVTLRRFQADELIHYFAKLAYARG 255

Query: 617 HAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRDL-PRYDTRFVGFGWNK 664
           HA TNFSRW+++  PY + +  ++EP+ +  R L P YD RF G+GWNK
Sbjct: 256 HAATNFSRWLDSDVPYTVPYEVNYEPWFIGSRALMPPYDVRFRGYGWNK 304


>gi|405969672|gb|EKC34627.1| Glycosyltransferase 8 domain-containing protein 4 [Crassostrea
           gigas]
          Length = 367

 Score = 74.7 bits (182), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 53/188 (28%), Positives = 89/188 (47%), Gaps = 13/188 (6%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L +P +L   +   + +DTD++F + + ++W  FSR    Q I L     D   G  W 
Sbjct: 142 RLFIPSILT-NVDSLLYVDTDILFLSPLEEIWFHFSRFNSTQLIALAPESEDRQTG--WY 198

Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAI 315
           N     P  G  G N+GV+L++LT+LR+ +W    +   +++ L   W    DQD+ N  
Sbjct: 199 NRFARHPYFGELGVNSGVMLMNLTRLRNSTWLSSIKNYYKEYKLKLTW---GDQDLINVY 255

Query: 316 ISEHPYLVYTLPCQWNVQLSDNTRSDELCY-TELTDLKIIHWNSPKKLKVKNKHMEFFRN 374
              HP  +Y  PC+WN +  D+     +C   E     ++H +   +  + N     F+ 
Sbjct: 256 FHFHPDELYVYPCEWNYR-PDHCMYMSVCKGAERGGAYVLHGS---RRAMHNDKQPAFKA 311

Query: 375 LYLTFLEY 382
           +YL F  Y
Sbjct: 312 VYLAFNAY 319



 Score = 45.4 bits (106), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 35/148 (23%), Positives = 72/148 (48%), Gaps = 14/148 (9%)

Query: 15  IQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTW-SVPQVEV 73
           I +++V  G     S +TLIKS + + K+ +  H+  +    + LQ   ++W S+ + + 
Sbjct: 57  IHLSLVVCGDRGNES-ITLIKSAVLFTKSYIIIHIFAEEELHSALQYQLNSWPSLVRKKF 115

Query: 74  SFYLADSVVEDVSWIPNKHYSGVY---GLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQ 130
            ++L   +  D     N+ +  ++      +L +P +L   +   + +DTD++F + + +
Sbjct: 116 EYHLYGIIYPD----SNQEWKMLFKPCASQRLFIPSILT-NVDSLLYVDTDILFLSPLEE 170

Query: 131 LWALFSRLRQRQTIGLV----ENQSDWY 154
           +W  FSR    Q I L     + Q+ WY
Sbjct: 171 IWFHFSRFNSTQLIALAPESEDRQTGWY 198


>gi|358339091|dbj|GAA47216.1| glycosyltransferase-like protein LARGE [Clonorchis sinensis]
          Length = 308

 Score = 74.3 bits (181), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 38/103 (36%), Positives = 56/103 (54%), Gaps = 4/103 (3%)

Query: 188 KHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQ 247
           KHY+G   ++KL  P ++   + K IVLD D++F  DI +LW  FS+    + IG V  Q
Sbjct: 148 KHYAGYSAMMKLMAPYIVDSNVQKVIVLDVDLLFNDDIKELWKYFSQFEPGEAIGAVCEQ 207

Query: 248 SDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFW 290
           S    G+   +        +G NTGV+LLD+++LR   W   W
Sbjct: 208 S----GECQASCSSPAVPPQGINTGVMLLDVSELRAKEWRKMW 246



 Score = 57.8 bits (138), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 49/174 (28%), Positives = 77/174 (44%), Gaps = 11/174 (6%)

Query: 13  EVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHL---ITDTVALNILQTLFSTWSVP 69
           E+I    V  G ++    VTL+KSIL Y +     HL   +T  +    +  +  + S  
Sbjct: 61  EIIHTFSVLHGASAVYQAVTLLKSIL-YSQGQFKDHLPECLTGWIRHKDINCVHHSPSPR 119

Query: 70  QVEVSFYLAD----SVVEDVSWIPN---KHYSGVYGLLKLTLPKVLPETLAKTIVLDTDV 122
           Q     ++ D     ++ D+  I +   KHY+G   ++KL  P ++   + K IVLD D+
Sbjct: 120 QYLNMHWVVDQGSRGLLVDLKLIGHARTKHYAGYSAMMKLMAPYIVDSNVQKVIVLDVDL 179

Query: 123 IFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFYLAD 176
           +F  DI +LW  FS+    + IG V  QS            P   +  G  L D
Sbjct: 180 LFNDDIKELWKYFSQFEPGEAIGAVCEQSGECQASCSSPAVPPQGINTGVMLLD 233


>gi|332375717|gb|AEE62999.1| unknown [Dendroctonus ponderosae]
          Length = 366

 Score = 73.9 bits (180), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 49/161 (30%), Positives = 78/161 (48%), Gaps = 10/161 (6%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP VLP+  A  + +DTD +F T +  +W  F++    Q   L     D  +   W 
Sbjct: 141 RLFLPDVLPDVDA-LLYMDTDTLFLTPVESIWRYFNKFNASQMAALAPEHEDPAIA--WY 197

Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAI 315
           N     P  G+ G N+GV+L++LT++R   W  +   I +K+ L   W    DQDI N I
Sbjct: 198 NRFAKHPYYGKLGVNSGVMLMNLTRMRLFKWTEYVVPIYKKYKLHITW---GDQDIINII 254

Query: 316 ISEHPYLVYTLPCQWNVQLSDNTRSDELC-YTELTDLKIIH 355
              HP  ++   C++N +  D+     +C   E   + +IH
Sbjct: 255 FHYHPGKLFIYSCRYNYR-PDHCMYSSICKAAEKNGVAVIH 294



 Score = 40.8 bits (94), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 42/148 (28%), Positives = 62/148 (41%), Gaps = 17/148 (11%)

Query: 17  VAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQVEVSFY 76
           VA+V  G +     +TLIKS L + K  L+F +I +      L    S W     E +  
Sbjct: 57  VAVVACG-DRLNETLTLIKSALMFSKGYLNFIVIAEDKLKENLNEKLSEWK----EFTHN 111

Query: 77  LADSVVEDVSWIPNKHYSGVYGLLK------LTLPKVLPETLAKTIVLDTDVIFATDIAQ 130
                +  +++ P K  S    L K      L LP VLP+  A  + +DTD +F T +  
Sbjct: 112 GFSYRILPLNF-PTKDGSEWMKLFKPCAAQRLFLPDVLPDVDA-LLYMDTDTLFLTPVES 169

Query: 131 LWALFSRLRQRQTIGLVENQSD----WY 154
           +W  F++    Q   L     D    WY
Sbjct: 170 IWRYFNKFNASQMAALAPEHEDPAIAWY 197


>gi|196011271|ref|XP_002115499.1| hypothetical protein TRIADDRAFT_29667 [Trichoplax adhaerens]
 gi|190581787|gb|EDV21862.1| hypothetical protein TRIADDRAFT_29667 [Trichoplax adhaerens]
          Length = 365

 Score = 73.6 bits (179), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 65/266 (24%), Positives = 102/266 (38%), Gaps = 79/266 (29%)

Query: 478 DVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYM--SDAEAQQFL---------------- 519
           DVTLVA  ++++L ++  L + W GP+S+ ++   +DA   +F                 
Sbjct: 34  DVTLVAHCTVNKLHLLLPLAERWNGPMSIVVFAPGNDARMTRFAIDGLRQCSTYIRDYAT 93

Query: 520 -----------SYAGNSEALRSRRNIGY-----------------HVVYKEGNF------ 545
                       + G +  L + R + Y                 H  Y +G        
Sbjct: 94  FHLVYPTALLPPFDGGTGELFTEREMEYANFCPEVIPNRIKALSSHSSYDDGYVRATRYI 153

Query: 546 -YPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAF 604
            YP N LRN+  +   + YVF++DID +P  GLY  L + +    +     K   VVPAF
Sbjct: 154 PYPSNLLRNIGRDFAKSTYVFVIDIDMVPNKGLYHELINFLEEKKLFDSIEKNAYVVPAF 213

Query: 605 ETQRYR------------TAFPASHA-------PTNFSRWVNATT-------PYQIEWAP 638
           E    R             A P  H        P N+ RW  A +        Y + W  
Sbjct: 214 EINEKRDIPKNKAALREAAARPFYHEVCEKCQRPLNYDRWWKANSNKGVLGPAYTLNWKD 273

Query: 639 DFEPYIVAHRDLPRYDTRFVGFGWNK 664
            +EP+ +     P +D RF  +G+N+
Sbjct: 274 PWEPFYIIDSKSPSFDERFKQYGFNR 299


>gi|313232041|emb|CBY09152.1| unnamed protein product [Oikopleura dioica]
          Length = 510

 Score = 73.6 bits (179), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 63/255 (24%), Positives = 101/255 (39%), Gaps = 72/255 (28%)

Query: 477 NDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNS-EALRS----- 530
           NDVTL  Q S+D+L  +  +   ++GP+S++++    +A    +YA ++   LR      
Sbjct: 81  NDVTLATQGSLDKLSYLTGIADRFDGPLSVSIFCPGYDA----AYADDAIHMLRKCYPKI 136

Query: 531 RRNIGYHVVY---------------------------------KEGNFYPINTLRNVALN 557
             N+ +H+VY                                  E   +P N LRNVA +
Sbjct: 137 SENVMFHLVYPAEQPADLAYTTGWRDLECADFIYQLKAFTSTENEAIIFPHNVLRNVARS 196

Query: 558 QVSTPYVFLLDIDFLPMFGL-YPYLKSSIRSMDMHGHGGKKVLVVPAFETQ--------- 607
            V + +  LLDID  P   L   ++  +  +          V V+PAFE +         
Sbjct: 197 AVHSEFFLLLDIDLAPSLNLRQSFVDYANENNLWEEENDLVVYVLPAFEQKAGTKMPKDK 256

Query: 608 ---------------RYRTAFPASHAPTNFSRWVNATT---PYQIEWAPDFEPYIVAHRD 649
                             T FP  H P  +  W N +     ++  W   +EP+ +  R 
Sbjct: 257 DELINAIEKGNIRPFHNETCFPC-HRPEKYEHWKNTSEKFIAFEAPWEEAWEPFYIGRRS 315

Query: 650 LPRYDTRFVGFGWNK 664
           LP YD RF GFG+++
Sbjct: 316 LPYYDERFKGFGFDR 330


>gi|321473905|gb|EFX84871.1| hypothetical protein DAPPUDRAFT_46504 [Daphnia pulex]
          Length = 405

 Score = 73.6 bits (179), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 63/273 (23%), Positives = 102/273 (37%), Gaps = 77/273 (28%)

Query: 469 EYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEAL 528
           ++ +++  ++V L  Q S+DRL  +     HW G ISL +Y+ D E   F     +    
Sbjct: 23  QFHSTAVSHNVCLATQSSLDRLHWLGESSDHWRGAISLAIYLDDEEVGLFYKVLTHMRRC 82

Query: 529 RSR--RNIGYHVVYK--------------------------------------------- 541
            S    NI +HVV K                                             
Sbjct: 83  HSDLLANIAFHVVSKTETDDGDEGTSASAGHQSTPVNCTGPSEALTWLLKERPSLSAKWE 142

Query: 542 EGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVV 601
            G  YP N +RNVA     + YV LLD+D +P + +   L   +    +  H  K   VV
Sbjct: 143 SGRPYPQNLMRNVARKGCPSHYVLLLDVDVIPSYNMAESLAEFLDRPSVVAHCTKCAFVV 202

Query: 602 PAFE-----------TQR------------YRTAFPASHAPTNFSRWVNATTPYQIEWAP 638
           P +E           TQ             +  AF  +   +N +RW   +   Q+  + 
Sbjct: 203 PTYELDYSADFPQNKTQLLTLENQGRAQPFHHKAFRYNQFASNLTRWRQTSETDQVVISH 262

Query: 639 D-------FEPYIVAHRDLPRYDTRFVGFGWNK 664
           +       +EP+ +A  + P +D RF+G+G+ +
Sbjct: 263 NVTNYEFFYEPFYIARDEAPPHDERFLGYGFTR 295


>gi|346474050|gb|AEO36869.1| hypothetical protein [Amblyomma maculatum]
          Length = 388

 Score = 73.2 bits (178), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 44/138 (31%), Positives = 67/138 (48%), Gaps = 12/138 (8%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSD----WYLG 253
           +L LP +LP+  A  I +D DV+F + I +LW  F+ +       L     D    WY  
Sbjct: 155 RLFLPSMLPDVDA-VIYVDADVLFLSPIEKLWEHFASMNSSHLAALAPESEDYATNWY-- 211

Query: 254 KLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFN 313
           + +  H  +  LG   N+GV+L++LT++RD  W      +   +     W    DQD+ N
Sbjct: 212 RRFARHPFYQPLG--VNSGVMLMNLTRMRDFGWESRLGPLLHNYSRAISW---GDQDLLN 266

Query: 314 AIISEHPYLVYTLPCQWN 331
            + S HP  +   PCQWN
Sbjct: 267 ILFSAHPERLLLFPCQWN 284


>gi|348520260|ref|XP_003447646.1| PREDICTED: N-acetyllactosaminide
           beta-1,3-N-acetylglucosaminyltransferase-like
           [Oreochromis niloticus]
          Length = 431

 Score = 72.8 bits (177), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 44/154 (28%), Positives = 67/154 (43%), Gaps = 35/154 (22%)

Query: 543 GNF-YPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVV 601
           GN  YP N LRN+A     + Y+ ++D+D +P   L+    + I     H     +V V+
Sbjct: 215 GNISYPNNLLRNIARGGTESSYILVIDMDMMPSADLHEQFMAMIAK---HEPASDEVFVL 271

Query: 602 PAFETQR-----------------------YRTAFPASHAPTNFSRWVNA--------TT 630
           PAFE +                        Y    P   APTN+S+WVN+          
Sbjct: 272 PAFEIRHARKMPATKAELVQLYQVGEVRPFYEELCPRCQAPTNYSQWVNSHVRETGTLEV 331

Query: 631 PYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
            Y + W   +EP+ + HR +P YD  F  +G+N+
Sbjct: 332 AYTLTWVDPWEPFYIGHRSVPLYDENFKQYGFNR 365


>gi|384499295|gb|EIE89786.1| hypothetical protein RO3G_14497 [Rhizopus delemar RA 99-880]
          Length = 351

 Score = 72.8 bits (177), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 57/227 (25%), Positives = 100/227 (44%), Gaps = 45/227 (19%)

Query: 476 GNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFL------SYAGNSEALR 529
             D+TL   ++ DR  ++  L  H++GPIS T+Y++  + +  +      +Y  N +   
Sbjct: 96  SKDITLSTIVTNDRFPVLSRLASHYKGPISATVYVNGNDNKDTILQELSKTYRENPDM-- 153

Query: 530 SRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMD 589
            ++ +  H++    +    N  RNVA     T Y+ +LDIDF P      + +  +   D
Sbjct: 154 -KKYVDIHLILDAYD-RQFNMWRNVARLYARTDYIMMLDIDFFPCTD---FRQEILNDAD 208

Query: 590 MHGH--GGKKVLVVPAFETQR----------------------------YRTAFPASHAP 619
           +      G+  LV+PAFE                               +  A+   H  
Sbjct: 209 LLNRLDSGRTALVIPAFEYTHLEDGMDYRHFPRTKADLIQQVQKGKLGMFHRAWQKGHGS 268

Query: 620 TNFSRWVNATTPYQ-IEWAPDFEPYIVAHRD-LPRYDTRFVGFGWNK 664
           T++ +W NAT  Y+ +++   +EPY++  R   P  D RF+G+G NK
Sbjct: 269 TDYVKWYNATAHYRVVDYNYSYEPYVIFKRQGAPWCDERFIGYGANK 315


>gi|66820721|ref|XP_643936.1| hypothetical protein DDB_G0274741 [Dictyostelium discoideum AX4]
 gi|74857595|sp|Q555X4.1|GNT15_DICDI RecName: Full=Glycosyltransferase-like protein gnt15
 gi|60472310|gb|EAL70263.1| hypothetical protein DDB_G0274741 [Dictyostelium discoideum AX4]
          Length = 516

 Score = 72.8 bits (177), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 37/123 (30%), Positives = 64/123 (52%), Gaps = 5/123 (4%)

Query: 470 YEASSD--GNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSD-AEAQQFLSYAGNSE 526
           Y A+ D    D++++ Q ++DR   + M+   W  PIS  +Y++   +  +      NS 
Sbjct: 79  YRANGDLKKYDISIITQFTVDRFDRIAMMADKWRAPISAAVYITSFKDIDEVFKLVRNSF 138

Query: 527 ALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIR 586
           A+     +  H ++     YP+N LRN+AL    T +  LLD+DF+   G+Y YL S++ 
Sbjct: 139 AVTEF--VDLHFLFANKTRYPVNNLRNLALRNARTEWCLLLDVDFISPLGMYDYLHSTLE 196

Query: 587 SMD 589
            +D
Sbjct: 197 KLD 199



 Score = 47.0 bits (110), Expect = 0.030,   Method: Compositional matrix adjust.
 Identities = 19/52 (36%), Positives = 34/52 (65%), Gaps = 1/52 (1%)

Query: 614 PASHAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRD-LPRYDTRFVGFGWNK 664
           P  H PTN+SRW  ++ PY +++   +EP+++ +R  +  YD R  G+G++K
Sbjct: 379 PKCHGPTNYSRWYLSSEPYLVQYKWIYEPFLLYNRSQIHDYDERLKGYGFDK 430


>gi|384490378|gb|EIE81600.1| hypothetical protein RO3G_06305 [Rhizopus delemar RA 99-880]
          Length = 260

 Score = 72.8 bits (177), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 58/217 (26%), Positives = 96/217 (44%), Gaps = 43/217 (19%)

Query: 485 LSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLS------YAGNSEALRSRRNIGYHV 538
           ++ +R +++  L  H++GPIS  ++++D   ++ +       YA N +     R +  H+
Sbjct: 5   VTYNRFKVLSRLASHYKGPISAAIHVNDDHTKEHIIKDLHQLYADNPDM---GRYVDLHL 61

Query: 539 VYKEGNF-YPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKK 597
           V    NF    N  RNVA     T Y+ +LD+DF         LK     M++    GK 
Sbjct: 62  VI--DNFDRQFNMWRNVAKFFARTNYIMMLDVDFYLCTDFRKSLKKQPHLMEL-LRSGKG 118

Query: 598 VLVVPAFE----------------------------TQRYRTAFPASHAPTNFSRWVNAT 629
            +VVPAFE                               +  ++   H  TN+ +W   T
Sbjct: 119 AIVVPAFEYIDESDGEDWNVFPKTKQELLVDVEAEKLDMFHRSWKRGHGSTNYEKWYKQT 178

Query: 630 TPYQI-EWAPDFEPYIVAHRD-LPRYDTRFVGFGWNK 664
            PY++ E+   +EPY++  ++  P  D RFVG+G NK
Sbjct: 179 KPYKVTEYNYSYEPYVIFKKEGTPWCDERFVGYGANK 215


>gi|149411403|ref|XP_001511493.1| PREDICTED: uncharacterized protein C3orf21 homolog [Ornithorhynchus
           anatinus]
          Length = 276

 Score = 72.4 bits (176), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 49/179 (27%), Positives = 81/179 (45%), Gaps = 18/179 (10%)

Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
           F + +S+ +  S     +YS     L + + +++P+ +++ I LD D+ + T+I +L+  
Sbjct: 65  FPIVESMQKHFSAGSGTYYSDSIFFLSVAMHQIMPKEISQIIQLDLDLKYKTNILELFEE 124

Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWKNHK---------PWPALGRGFNTGVILLDLTKLR 282
           F    +   IG+       Y    W+  +         P P    GFN+GV+LLDL  +R
Sbjct: 125 FDNFSEGNVIGIAREMQPVYRHTFWQFRQENPKTRVGDPPPDGLPGFNSGVMLLDLEAMR 184

Query: 283 DISWAGFWRIIAEKFLLTRLWTS------LADQDIFNAIISEHPYLVYTLPCQWNVQLS 335
               +  +  + E   + +L         L DQD F  I  EHP L + L C WN QL 
Sbjct: 185 Q---SQLYNQLLEPARIQQLTEKYHFRGHLGDQDFFTMIGMEHPELFHVLDCTWNRQLC 240


>gi|324518766|gb|ADY47199.1| Glycosyltransferase-like protein [Ascaris suum]
          Length = 189

 Score = 72.4 bits (176), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 44/118 (37%), Positives = 59/118 (50%), Gaps = 31/118 (26%)

Query: 574 MFGLYPYLKSSIRS-MDMHGHGGKKVLVVPAFETQRYRTAF-PAS--------------- 616
           M GLY  ++  + S + + G    K  VVPAFE++ YR++  PAS               
Sbjct: 1   MPGLYELIRKRLSSGLPLKG----KAFVVPAFESKGYRSSVVPASKAELLVMLDTGQVQI 56

Query: 617 ---------HAPTNFSRWVNATTPYQIEWAPDFEPYIVAHR-DLPRYDTRFVGFGWNK 664
                    HA T++ RW    T Y + W  D+EPY+V  R + P YD RFVGFGWNK
Sbjct: 57  FRQDVWIQGHASTDYDRWRTTQTEYSVAWRTDYEPYVVVARNETPPYDNRFVGFGWNK 114


>gi|91090075|ref|XP_969798.1| PREDICTED: similar to GLT8D3 protein [Tribolium castaneum]
          Length = 370

 Score = 72.4 bits (176), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 43/138 (31%), Positives = 70/138 (50%), Gaps = 8/138 (5%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L + +   + +DTD +F T +  +W+ F ++   Q   L     D  +G  W 
Sbjct: 146 RLFLPNILKD-VDSLLYMDTDTLFLTPVENIWSYFHKMNSSQMAALAPEHEDPNVG--WY 202

Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAI 315
           N     P  G+ G N+GV+L++LT++R   W+ +   I  K+ L   W    DQDI N I
Sbjct: 203 NRFAKHPFYGKLGVNSGVMLMNLTRMRAFQWSEYVVPIYRKYKLKITW---GDQDIINII 259

Query: 316 ISEHPYLVYTLPCQWNVQ 333
              HP  +Y   C++N +
Sbjct: 260 FHYHPGKLYLYSCRFNFR 277


>gi|389749357|gb|EIM90534.1| hypothetical protein STEHIDRAFT_50943 [Stereum hirsutum FP-91666
           SS1]
          Length = 468

 Score = 72.4 bits (176), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 49/156 (31%), Positives = 76/156 (48%), Gaps = 13/156 (8%)

Query: 11  TCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVP- 69
           T E +  +++    +S      LIKS + Y   PL FH+I D  A N L+  F     P 
Sbjct: 104 TLEPVVFSLIMWSESSAIEGALLIKSAIMYTTRPLEFHIICDEEAQNHLEKKFLLLEHPA 163

Query: 70  -QVEVSFY------LADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDV 122
             + + FY      + D +  +   I   H +GV GL+KL + ++LP+++ K+I +DTD 
Sbjct: 164 QNILLRFYRVSRQSMTDRIQRE-GGIHTDHSAGVPGLMKLFIHEILPDSVRKSIYVDTDA 222

Query: 123 IFATDIAQLWALFSRLRQRQTIGLV----ENQSDWY 154
            F TD A LW  F  LR    I +     +N  +W+
Sbjct: 223 FFITDPALLWQTFDTLRPSTAISMPTHYDQNSPEWH 258



 Score = 58.2 bits (139), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 48/173 (27%), Positives = 73/173 (42%), Gaps = 38/173 (21%)

Query: 185 IPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLV 244
           I   H +GV GL+KL + ++LP+++ K+I +DTD  F TD A LW  F  LR    I + 
Sbjct: 188 IHTDHSAGVPGLMKLFIHEILPDSVRKSIYVDTDAFFITDPALLWQTFDTLRPSTAISMP 247

Query: 245 ----ENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISW--AGFWRIIAEKFL 298
               +N  +W+                   + V+LLDL+KLR +    +  +R       
Sbjct: 248 THYDQNSPEWHHAS-------------KICSCVMLLDLSKLRSLRLMDSSIYRADPTSRY 294

Query: 299 LTRL-------------------WTSLADQDIFNAIISEHPYLVYTLPCQWNV 332
            + L                     +L DQ  F AI+S+ P +   L   W V
Sbjct: 295 PSALSPPLFRSLFGPPGPEDHYEGVALGDQGYFWAIVSQRPDIFEHLSFDWEV 347


>gi|219115597|ref|XP_002178594.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217410329|gb|EEC50259.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 417

 Score = 72.4 bits (176), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 67/242 (27%), Positives = 101/242 (41%), Gaps = 64/242 (26%)

Query: 478 DVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEA----QQFLSYAGNSEALRSRRN 533
           D TLV QLS DRL M+    K W G  +++L +   E+    +Q LS +G   AL     
Sbjct: 111 DFTLVTQLSDDRLAMMRPHCKRW-GKHTISLAIGTNESRDTVEQALSKSGCDTAL----- 164

Query: 534 IGYHVV--YKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMH 591
           I   +V  +     YP+N LRNVA++QV T +  ++D DF+    LY  L    +++   
Sbjct: 165 ITLSIVRDFDSDQKYPVNKLRNVAMSQVRTSHAVIIDADFVLSPNLYETLHLHNKTL--- 221

Query: 592 GHGGKKVLVVPAFETQR------------YRTAFPAS----------------------- 616
                  LV+P+FE ++            Y    P +                       
Sbjct: 222 AADSTNALVIPSFELRKACRRRNRRCITMYSAMVPRNKDGLLELYDPMTEDSAGYGIAQF 281

Query: 617 --------HAPTNFSRWVNATT----PYQIEWAPDFEPYIVAH--RDLPRYDTRFVGFGW 662
                   HA T ++ W +       P +   +  +EPY+V    RDLP +   FVG+G 
Sbjct: 282 DIRGNYHGHASTRYADWASQPAEQLLPIECVTSDRYEPYLVVRHCRDLPPFQEAFVGYGQ 341

Query: 663 NK 664
           NK
Sbjct: 342 NK 343


>gi|323450957|gb|EGB06836.1| hypothetical protein AURANDRAFT_65363 [Aureococcus anophagefferens]
          Length = 2492

 Score = 72.0 bits (175), Expect = 8e-10,   Method: Composition-based stats.
 Identities = 52/153 (33%), Positives = 79/153 (51%), Gaps = 11/153 (7%)

Query: 468 DEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEA 527
           D    +++GN VT+V Q ++DRL  V  + + W+GP+SL + +   +       A    A
Sbjct: 427 DAPSTTANGN-VTIVTQSTLDRLWNVGHICERWKGPLSLGMMVKHEDPSAVKRDAKRIMA 485

Query: 528 LRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYP--YLKSSI 585
                 +   VV K  + YP+N LRN+A ++ +TP++FLLD DF P  G Y   +L+ +I
Sbjct: 486 ACPEVRVSV-VVSKPEDKYPVNRLRNLAWSKATTPWIFLLDADFWP--GAYARFFLEQAI 542

Query: 586 RSMDMHGHGGKKVLVVPAFETQRYRTAFPASHA 618
                   G +  LVVPAF  Q +   F   HA
Sbjct: 543 ARSSF---GPRTALVVPAFAFQPHE--FLPGHA 570


>gi|432861231|ref|XP_004069565.1| PREDICTED: glucoside xylosyltransferase 1-like [Oryzias latipes]
          Length = 443

 Score = 72.0 bits (175), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 58/223 (26%), Positives = 95/223 (42%), Gaps = 30/223 (13%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L E +   + +DTD++F   +  +WAL S     Q   +     +  +G  W 
Sbjct: 208 RLFLPLILKE-VDSLLYVDTDILFLQPVEDIWALLSLFDSSQLAAMAPEHEEPRIG--WY 264

Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
           N     P  G+ G N+GV+L+++T+LR+           + W      + +K+ L   W 
Sbjct: 265 NRFARHPYYGKTGVNSGVMLMNMTRLREKYFKNDMTAVALKWEEMLMPLLQKYKLNITW- 323

Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWNSPKKLKV 364
              DQD+ N I   +P  +Y  PCQWN +             E   + I+H N   +   
Sbjct: 324 --GDQDLLNIIFHHNPESLYVFPCQWNYRPDHCMYGSNCAQAEREGVFILHGN---RGVY 378

Query: 365 KNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTTLQQAELS 407
            N     F+ +Y T  +Y         FG NL  + L+  E +
Sbjct: 379 HNDKQPAFQAVYKTIQKYQ--------FGENLETSLLKPLEAA 413



 Score = 40.4 bits (93), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 36/152 (23%), Positives = 65/152 (42%), Gaps = 15/152 (9%)

Query: 16  QVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQVEVSF 75
            VA+V  G     +L T++KS + + K P+HF++  +    +  +    TW   +V+  F
Sbjct: 123 HVAVVACGPRLEETL-TMVKSAVLFSKKPVHFYIFAEDELHDSFRDALITWPR-EVKAKF 180

Query: 76  YLADSVVEDVSWIPNKHYSGVYGLLK------LTLPKVLPETLAKTIVLDTDVIFATDIA 129
                 +      P ++ +    L K      L LP +L E +   + +DTD++F   + 
Sbjct: 181 TFTIYPIT----FPKENANEWKKLFKPCASQRLFLPLILKE-VDSLLYVDTDILFLQPVE 235

Query: 130 QLWALFSRLRQRQTIGLVENQSDWYLGKLWKN 161
            +WAL S     Q   +     +  +G  W N
Sbjct: 236 DIWALLSLFDSSQLAAMAPEHEEPRIG--WYN 265


>gi|242001678|ref|XP_002435482.1| glycosyltransferase domain-containing protein, putative [Ixodes
           scapularis]
 gi|215498818|gb|EEC08312.1| glycosyltransferase domain-containing protein, putative [Ixodes
           scapularis]
          Length = 269

 Score = 71.6 bits (174), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 48/158 (30%), Positives = 79/158 (50%), Gaps = 7/158 (4%)

Query: 188 KHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQ 247
           K   G     +L LP +L E  A  + +D+D++F   + +LW++F  +   Q  GL  + 
Sbjct: 85  KELFGPCASQRLFLPSMLSEEDA-VLYVDSDIVFFRPVEELWSIFDNMDDMQLAGLAPDV 143

Query: 248 SDWYLGKLWKNHKPWPALGR-GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSL 306
            D Y G ++ ++      GR G N+GVIL++LT++R     G   I+            L
Sbjct: 144 ED-YNGSVYMHNWKTRYYGRYGLNSGVILMNLTRMRAY---GLESIVTNLMNKYHSVMKL 199

Query: 307 ADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELC 344
            DQD+ N +  + P  +Y LPC+W+V L  + R+   C
Sbjct: 200 PDQDLLNMVFHDDPGRLYELPCRWDV-LPPDCRAPASC 236



 Score = 38.9 bits (89), Expect = 9.7,   Method: Compositional matrix adjust.
 Identities = 39/147 (26%), Positives = 65/147 (44%), Gaps = 14/147 (9%)

Query: 14  VIQVAIVCAGYNSTRSLVTL--IKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQV 71
           V  V + CA    TR  +T+  +KS + + + PL   L  D   +  LQ        P  
Sbjct: 7   VTMVVVTCA----TRLELTMNNLKSAVSFSRAPLRLLLYADVENIKRLQDRVRIREWPSS 62

Query: 72  ---EVSFYLADSVVEDVSWIPN--KHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 126
               V++ L   +VE     P+  K   G     +L LP +L E  A  + +D+D++F  
Sbjct: 63  VLQRVTYSL--RLVEFPKKSPHNWKELFGPCASQRLFLPSMLSEEDA-VLYVDSDIVFFR 119

Query: 127 DIAQLWALFSRLRQRQTIGLVENQSDW 153
            + +LW++F  +   Q  GL  +  D+
Sbjct: 120 PVEELWSIFDNMDDMQLAGLAPDVEDY 146


>gi|47210788|emb|CAF91098.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 457

 Score = 71.6 bits (174), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 44/150 (29%), Positives = 67/150 (44%), Gaps = 34/150 (22%)

Query: 546 YPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFE 605
           YP N LRNVA +   + YV ++DID +P   L+ +  + I   +  GH   +V V+PAFE
Sbjct: 219 YPNNLLRNVARSGTESTYVLVIDIDMVPSADLHQHFLAMIARHEPAGH---EVFVLPAFE 275

Query: 606 TQR-----------------------YRTAFPASHAPTNFSRWVNA--------TTPYQI 634
            +                        Y    P   APTN+S+WVN+           Y +
Sbjct: 276 IRHARKIPASKAELVQLYQVGEVRPFYEELCPRCQAPTNYSQWVNSHVRETGTLEVAYTV 335

Query: 635 EWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
            W   +EP+ +    +P YD  F  +G+N+
Sbjct: 336 PWTDPWEPFYIGPNSVPLYDESFKQYGFNR 365


>gi|327289393|ref|XP_003229409.1| PREDICTED: LOW QUALITY PROTEIN: n-acetyllactosaminide
           beta-1,3-N-acetylglucosaminyltransferase-like [Anolis
           carolinensis]
          Length = 407

 Score = 71.6 bits (174), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 70/256 (27%), Positives = 96/256 (37%), Gaps = 75/256 (29%)

Query: 478 DVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDA------------------------ 513
           D+ LV   S+  L   + L + W+GP S+ ++   A                        
Sbjct: 92  DMVLVTHTSLGNLHHAQRLAERWQGPFSVAVFAPGAAEVRLAVTMIYAIAALCSPLRQLL 151

Query: 514 ---------EAQQFLS-------YAGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALN 557
                    +A  F S         G+  A  SR    +H  Y     YP N LRNVA  
Sbjct: 152 RLHLVCHADQAATFFSGDVARLETCGDVFAKLSRIGSRWHN-YGANASYPNNLLRNVAKK 210

Query: 558 QVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFE---TQR------ 608
            V   +  ++D+D +P  GL     +      M G     V VVPAFE   T+R      
Sbjct: 211 AVGDHWALVVDMDMVPSEGLREAFLALRNDETMPG-----VFVVPAFEIRHTRRIPGTKA 265

Query: 609 --------------YRTAFPASHAPTNFSRWVN--ATTP----YQIEWAPDFEPYIVAHR 648
                         Y    P   APTN+S+WVN  A  P    Y +EW   +EP+ V   
Sbjct: 266 ELLRLYQVGETRPFYEELCPRCQAPTNYSQWVNLPAGGPLRVAYDVEWRDPWEPFYVGPP 325

Query: 649 DLPRYDTRFVGFGWNK 664
             P YD RF  +G+N+
Sbjct: 326 SAPPYDERFKQYGFNR 341


>gi|402225614|gb|EJU05675.1| hypothetical protein DACRYDRAFT_113733 [Dacryopinax sp. DJM-731
           SS1]
          Length = 455

 Score = 71.2 bits (173), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 65/227 (28%), Positives = 105/227 (46%), Gaps = 42/227 (18%)

Query: 473 SSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRR 532
           S D + +T+   ++  R  ++  L  H+ GPIS+ +       Q  LS     E   S  
Sbjct: 132 SPDPSAITITTLVTPSRYPVLSRLASHYNGPISVAVPF-----QSPLSLPSLHEFYTSNP 186

Query: 533 NIGYHV-VYKEGNFYP--INTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMD 589
            I  HV ++   N  P   NT RNVA     T  + +LD+DF P       ++ S  +++
Sbjct: 187 AIVEHVDIHLILNSPPRAFNTWRNVARAFARTDTILMLDVDFYPCTPFAQRIQGSQIALN 246

Query: 590 MHGHGGKKVLVVPAFE-----TQRYRTAFPAS-----------------------HAPTN 621
           +  +    +LV+PAFE       +Y + FP +                       HA TN
Sbjct: 247 LLENNA--MLVLPAFEYASIDAGKYTSLFPVAKRDLLPLLENGEIGQFHGAWRPGHAATN 304

Query: 622 FSRWVNATTP--YQI-EWAPDFEPYIVAHRD-LPRYDTRFVGFGWNK 664
           +SR+ NAT+   Y++ ++ P +EPY++  R+  P  D RF+G+G NK
Sbjct: 305 YSRFANATSGEIYKVKDYHPSYEPYVIFQRENAPWCDERFIGYGANK 351


>gi|301772162|ref|XP_002921506.1| PREDICTED: uncharacterized protein C3orf21 homolog [Ailuropoda
           melanoleuca]
          Length = 339

 Score = 71.2 bits (173), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 57/208 (27%), Positives = 90/208 (43%), Gaps = 24/208 (11%)

Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
           F + +++ +  S     +YS     L + + +++P+ + + I LD D+ + T+I +L+  
Sbjct: 128 FPVVEAMQKHFSAGSGTYYSDSIFFLSVAMHQIMPKEILRIIQLDLDLKYKTNIRELFEE 187

Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPA---------LGRGFNTGVILLDLTKLR 282
           F        IG+       Y    W+     P             GFN+GV+LL+L  +R
Sbjct: 188 FDNFLPGAIIGIAREMQPVYRHTFWQFRHENPKTRVGGPPPEGLPGFNSGVMLLNLEAMR 247

Query: 283 DISW------AGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLS- 335
             +        G  R +A+K+   R    L DQD F  I  EHP L + L C WN QL  
Sbjct: 248 QSALYSRLLEPGQVRQLADKY---RFRGHLGDQDFFTMIGMEHPELFHVLDCTWNRQLCT 304

Query: 336 ---DNTRSD--ELCYTELTDLKIIHWNS 358
              D+  SD  E  +     +KI H N 
Sbjct: 305 WWRDHGYSDVFEAYFRCEGHVKIYHGNC 332


>gi|403418633|emb|CCM05333.1| predicted protein [Fibroporia radiculosa]
          Length = 491

 Score = 71.2 bits (173), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 42/129 (32%), Positives = 64/129 (49%), Gaps = 7/129 (5%)

Query: 25  NSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVP--QVEVSFYLAD--- 79
           +S      L+KSI+ Y   P+ FH+I D VA   L+      + P   + V FY      
Sbjct: 124 DSAAEGAVLMKSIIMYSSVPIEFHVICDDVAQTFLEKRLRLVTRPARNILVRFYRLSWSS 183

Query: 80  --SVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSR 137
             + +E    I   H +G  GL+KL + ++LP ++ + I +DTD  F TD AQLW  F+ 
Sbjct: 184 MVARIEREGAITTDHSAGTPGLMKLFIHEILPPSVPRAIFIDTDAFFITDPAQLWDAFTH 243

Query: 138 LRQRQTIGL 146
           L+    I +
Sbjct: 244 LKPTTAIAM 252



 Score = 52.8 bits (125), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 24/65 (36%), Positives = 37/65 (56%)

Query: 179 VEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQR 238
           +E    I   H +G  GL+KL + ++LP ++ + I +DTD  F TD AQLW  F+ L+  
Sbjct: 188 IEREGAITTDHSAGTPGLMKLFIHEILPPSVPRAIFIDTDAFFITDPAQLWDAFTHLKPT 247

Query: 239 QTIGL 243
             I +
Sbjct: 248 TAIAM 252


>gi|443704974|gb|ELU01746.1| hypothetical protein CAPTEDRAFT_195175 [Capitella teleta]
          Length = 346

 Score = 71.2 bits (173), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 52/180 (28%), Positives = 85/180 (47%), Gaps = 21/180 (11%)

Query: 174 LADSVV---EDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWA 230
           LAD+V    E  S +   ++  +   +   + +V+P  + ++I++D D+ F  DI  L+ 
Sbjct: 135 LADTVYSMQELFSSVELPYFQNILFHISPAIHRVMPTFVHRSIMIDVDLKFLGDIRLLYD 194

Query: 231 LFSRLRQRQTIGLVENQSDWY--LGKLWKNHKPWPALGR-------GFNTGVILLDLTKL 281
            F        IG+   Q   Y  L K +++  P   +G        GFN+GVILLDL K+
Sbjct: 195 QFRHFNDTHLIGIANEQQPTYRHLFKEYRHQNPRTIIGDPPPNGLPGFNSGVILLDLEKM 254

Query: 282 RDISWAGFWRIIAEKFLLTRLWTS------LADQDIFNAIISEHPYLVYTLPCQWNVQLS 335
           R+   +  +  + EK  +  L         L DQD +  +   H  + Y LPC++N QLS
Sbjct: 255 RN---SHLYNTLLEKEKIEELAKHFMLSGYLGDQDFYTLVNLVHREMFYQLPCEFNRQLS 311


>gi|195996077|ref|XP_002107907.1| hypothetical protein TRIADDRAFT_18847 [Trichoplax adhaerens]
 gi|190588683|gb|EDV28705.1| hypothetical protein TRIADDRAFT_18847 [Trichoplax adhaerens]
          Length = 329

 Score = 71.2 bits (173), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 47/136 (34%), Positives = 68/136 (50%), Gaps = 8/136 (5%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L +T    I LDTD++F      LW  F++    Q + +     D  +   W 
Sbjct: 105 RLFLPYLLKKT-DSLIYLDTDILFMRPPEDLWNFFNKFNATQLVAVSPEGED--IRTSWY 161

Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAI 315
           N     P  G+ G N+ VIL++LT++R  +W      + +K+   R    L DQDI NAI
Sbjct: 162 NKFAQHPYYGKLGINSDVILMNLTRMRKANWNDKVTKVYQKY---RSSIVLGDQDILNAI 218

Query: 316 ISEHPYLVYTLPCQWN 331
              HP +VY L C WN
Sbjct: 219 FHFHPEMVYVLTCAWN 234


>gi|391325616|ref|XP_003737327.1| PREDICTED: N-acetyllactosaminide
           beta-1,3-N-acetylglucosaminyltransferase-like
           [Metaseiulus occidentalis]
          Length = 381

 Score = 71.2 bits (173), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 60/261 (22%), Positives = 94/261 (36%), Gaps = 74/261 (28%)

Query: 478 DVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGYH 537
           DVTLV Q +++ L  +E L   W GP+S+ ++ +D            + + + R N+ +H
Sbjct: 54  DVTLVTQTTLNHLGHIESLAGGWGGPLSVAIFATDLPDVVEAIIQLRTCSSQVRHNVSFH 113

Query: 538 VVY----------------------------------------KEGNFYPINTLRNVALN 557
           +VY                                          G+ +P+N LRNV   
Sbjct: 114 LVYPLRSKSLSGHVHPSGASYLKDGGCATLVERLSDLSRKTNYDHGDEFPVNLLRNVGRK 173

Query: 558 QVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDM----HGHGGKKVLVVPAFETQR----- 608
              T +VF++D+D     GL        R   +    +  G  +V VVPAFE        
Sbjct: 174 NSRTQFVFVVDVDLKVNAGLRDAFVHFARVNSLFTQQYAEGPNEVFVVPAFEALETIQMP 233

Query: 609 ------------------YRTAFPASHAPTNFSRWVNATTP-------YQIEWAPDFEPY 643
                             Y       H  T++  W+ A          Y + W   +EP+
Sbjct: 234 KDKYELVDLLSNNQARPFYTELCHKCHKHTDYETWLKAPVDGNRLEVLYHVTWKDPWEPF 293

Query: 644 IVAHRDLPRYDTRFVGFGWNK 664
             AH   P YD RF  +G+N+
Sbjct: 294 YFAHNSAPLYDERFRQYGFNR 314


>gi|410970721|ref|XP_003991826.1| PREDICTED: xyloside xylosyltransferase 1, partial [Felis catus]
          Length = 259

 Score = 71.2 bits (173), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 57/208 (27%), Positives = 91/208 (43%), Gaps = 24/208 (11%)

Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
           F + +++ +  S     +YS     L + + +++P+ + + I LD D+ + T+I +L+  
Sbjct: 48  FPVVEAMQKHFSAGSGTYYSDSIFFLSVAMHQIMPKEILRIIQLDLDLKYKTNIRELFEE 107

Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWK-NHK--------PWPALGRGFNTGVILLDLTKLR 282
           F        IG+       Y    W+  H+        P P    GFN+GV+LL+L  +R
Sbjct: 108 FDNFLPGAVIGIAREMQPVYRHTFWQFRHENPKTRVGSPPPEGLPGFNSGVMLLNLEAMR 167

Query: 283 DISWAGFW------RIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLS- 335
                G        + +A+K+        L DQD F  I  EHP L + L C WN QL  
Sbjct: 168 QSPLYGRLLEPSQVQQLADKY---HFRGHLGDQDFFTMIGMEHPELFHVLDCTWNRQLCT 224

Query: 336 ---DNTRSD--ELCYTELTDLKIIHWNS 358
              D+  SD  E  +     +KI H N 
Sbjct: 225 WWRDHGYSDVFEAYFRCEGHVKIYHGNC 252


>gi|390597324|gb|EIN06724.1| hypothetical protein PUNSTDRAFT_71944 [Punctularia strigosozonata
           HHB-11173 SS5]
          Length = 385

 Score = 71.2 bits (173), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 65/231 (28%), Positives = 103/231 (44%), Gaps = 42/231 (18%)

Query: 473 SSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMS----DAEAQQFLSYAGNSEAL 528
           S   +D+T+   ++ +RLQ +  L  H++GP+S  +++S     A A+   +  G   + 
Sbjct: 56  SPQKDDITITTIVTPNRLQRLADLVDHYQGPVSAAVHISTSNATACAEMLTALHGIYRSG 115

Query: 529 RSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDF--LPMFGLYPYLKSSIR 586
             RR +  H+V    +    NT RN+A     T YV LLD+DF     FG     K + R
Sbjct: 116 SLRRYLDVHLVLDPFD-RQFNTWRNIARLFARTDYVMLLDVDFWLCTDFGS-ALKKRASR 173

Query: 587 SMDMHGHGGKKVLVVPAFETQRYRTA-----FP-----------------------ASHA 618
            +     GG   LV+PAFE  +         FP                         H 
Sbjct: 174 EVVAKLDGGNAALVIPAFEYTKQEEGRDPSLFPRDKETLLSLVRQERLGMFHRSWAPGHN 233

Query: 619 PTNFSRWVNATTPYQI----EWAPDFEPYIVAHRDLPRY-DTRFVGFGWNK 664
            T+++R+  A  P Q+    ++   +EPY++  +D P + D RFVG+G NK
Sbjct: 234 STDYNRYY-AAPPGQVYRVTQYQAAYEPYVIFKKDGPPWCDERFVGYGGNK 283


>gi|195997643|ref|XP_002108690.1| hypothetical protein TRIADDRAFT_18350 [Trichoplax adhaerens]
 gi|190589466|gb|EDV29488.1| hypothetical protein TRIADDRAFT_18350, partial [Trichoplax
           adhaerens]
          Length = 323

 Score = 70.9 bits (172), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 48/164 (29%), Positives = 80/164 (48%), Gaps = 11/164 (6%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L +T A  I LDTD++F      LW  F +L   Q   +     D   G  W 
Sbjct: 95  RLFLPDILVKTDA-LIYLDTDILFMRPPEDLWQFFDKLNSTQLAAVSPEGEDAPTG--WY 151

Query: 258 N-HKPWPALGR-GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAI 315
           N +   P  G  G N+G++L++LT++R  +W     I+ + +   +      DQDI N +
Sbjct: 152 NKYARHPYYGELGINSGIMLMNLTRMRKANWND---IVLKLYQEYKSGIVFGDQDILNVV 208

Query: 316 ISEHPYLVYTLPCQWNVQLSDNTR--SDELCYTELTDLKIIHWN 357
              HP ++YT  C WN +  D+ R   ++  + +   + ++H N
Sbjct: 209 FHFHPEMLYTFTCDWNYR-PDHCRYGRNDCKHADKNGISLLHGN 251


>gi|281339157|gb|EFB14741.1| hypothetical protein PANDA_010392 [Ailuropoda melanoleuca]
          Length = 308

 Score = 70.9 bits (172), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 57/208 (27%), Positives = 90/208 (43%), Gaps = 24/208 (11%)

Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
           F + +++ +  S     +YS     L + + +++P+ + + I LD D+ + T+I +L+  
Sbjct: 98  FPVVEAMQKHFSAGSGTYYSDSIFFLSVAMHQIMPKEILRIIQLDLDLKYKTNIRELFEE 157

Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPA---------LGRGFNTGVILLDLTKLR 282
           F        IG+       Y    W+     P             GFN+GV+LL+L  +R
Sbjct: 158 FDNFLPGAIIGIAREMQPVYRHTFWQFRHENPKTRVGGPPPEGLPGFNSGVMLLNLEAMR 217

Query: 283 DISW------AGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLS- 335
             +        G  R +A+K+   R    L DQD F  I  EHP L + L C WN QL  
Sbjct: 218 QSALYSRLLEPGQVRQLADKY---RFRGHLGDQDFFTMIGMEHPELFHVLDCTWNRQLCT 274

Query: 336 ---DNTRSD--ELCYTELTDLKIIHWNS 358
              D+  SD  E  +     +KI H N 
Sbjct: 275 WWRDHGYSDVFEAYFRCEGHVKIYHGNC 302


>gi|410908409|ref|XP_003967683.1| PREDICTED: glucoside xylosyltransferase 1-like [Takifugu rubripes]
          Length = 408

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 51/198 (25%), Positives = 88/198 (44%), Gaps = 22/198 (11%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L + +   + +DTD+IF   + ++WAL SR        +     +  +G  W 
Sbjct: 169 RLFLPLILKD-VDSLLYVDTDIIFLQPVEEIWALLSRFNSSHLAAMAPEHEEPRIG--WY 225

Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
           N     P  G+ G N+GV+L++LT++R+           + W      + +K+ L   W 
Sbjct: 226 NRFARHPYYGKTGINSGVMLMNLTRIREKLFKNDMTTMLLRWEEILMPLLQKYKLNITW- 284

Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWNSPKKLKV 364
              DQD+ N I   +P  ++  PCQWN +             E   + I+H N   +   
Sbjct: 285 --GDQDLLNIIFHHNPESLHVFPCQWNYRPDHCIYGSNCREAEHDGVFILHGN---RGVF 339

Query: 365 KNKHMEFFRNLYLTFLEY 382
            +     FR +Y  FL++
Sbjct: 340 HDDKQPAFRAVYEAFLKF 357



 Score = 43.5 bits (101), Expect = 0.39,   Method: Compositional matrix adjust.
 Identities = 35/149 (23%), Positives = 62/149 (41%), Gaps = 5/149 (3%)

Query: 13  EVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQVE 72
           E   +A+V  G     +L T++KS L   + PLHF++  +    +  Q   ++W      
Sbjct: 83  ERSHLAVVACGPRLEETL-TMLKSALLLSRKPLHFYIFAEEDLHDSFQNTLNSWPR-TTR 140

Query: 73  VSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLW 132
            SF +        +    K         +L LP +L + +   + +DTD+IF   + ++W
Sbjct: 141 FSFTIYPITFPSENGKEWKKLFKPCASQRLFLPLILKD-VDSLLYVDTDIIFLQPVEEIW 199

Query: 133 ALFSRLRQRQTIGLVENQSDWYLGKLWKN 161
           AL SR        +     +  +G  W N
Sbjct: 200 ALLSRFNSSHLAAMAPEHEEPRIG--WYN 226


>gi|348513043|ref|XP_003444052.1| PREDICTED: LOW QUALITY PROTEIN: glucoside xylosyltransferase 1-like
           [Oreochromis niloticus]
          Length = 442

 Score = 70.5 bits (171), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 59/223 (26%), Positives = 93/223 (41%), Gaps = 30/223 (13%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L E +   + +DTD+IF   +  +WAL S     Q   +     +  +G  W 
Sbjct: 201 RLFLPLILKE-VDSLLYVDTDIIFLQPVEDMWALLSHFNSSQLAAMAPEHEEPRIG--WY 257

Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
           N     P  G+ G N+GV+L+++T+LR+           + W      + +K+ L   W 
Sbjct: 258 NRFARHPYYGKTGINSGVMLMNMTRLREKFFKNDMTTVVLKWEEMLMPLLQKYKLNITW- 316

Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWNSPKKLKV 364
              DQD+ N I   +P  +Y  PCQWN +             E   + I+H N   +   
Sbjct: 317 --GDQDLLNIIFHHNPESLYVFPCQWNYRPDHCIYGSNCQQAEQEGVFILHGN---RGVY 371

Query: 365 KNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTTLQQAELS 407
            +     FR +Y    +Y         FG NL  + L   E S
Sbjct: 372 HDDKQPAFRAVYEAIQKYQ--------FGDNLETSLLGPLETS 406



 Score = 42.7 bits (99), Expect = 0.60,   Method: Compositional matrix adjust.
 Identities = 37/155 (23%), Positives = 67/155 (43%), Gaps = 15/155 (9%)

Query: 13  EVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQVE 72
           E   +A+V  G       +T++KS + + K PLHF++  +       +    +W   +V 
Sbjct: 113 ERSHLAVVACG-PRLEETITMLKSAVLFSKKPLHFYIFAEDELHKSFRNTLESWPW-RVR 170

Query: 73  VSFYLADSVVEDVSWIPNKHYSGVYGLLK------LTLPKVLPETLAKTIVLDTDVIFAT 126
             F   +S +  +++ P ++      L K      L LP +L E +   + +DTD+IF  
Sbjct: 171 ARF---NSTIYPITF-PTENAREWKKLFKPCASQRLFLPLILKE-VDSLLYVDTDIIFLQ 225

Query: 127 DIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKN 161
            +  +WAL S     Q   +     +  +G  W N
Sbjct: 226 PVEDMWALLSHFNSSQLAAMAPEHEEPRIG--WYN 258


>gi|323451610|gb|EGB07487.1| hypothetical protein AURANDRAFT_71805 [Aureococcus anophagefferens]
          Length = 529

 Score = 70.5 bits (171), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 52/179 (29%), Positives = 86/179 (48%), Gaps = 11/179 (6%)

Query: 442 DGNDVTLVAQLSMDRLQMVEINPNNV------DEYEASSDGNDVTLVAQLSMDRLQMVEM 495
           +G    LVA  + + L+ +   P  V      D    +++GN VT+V Q + DRL  +  
Sbjct: 21  EGGRPALVASAAPENLRALSEWPGGVPPGCEPDAPSTTANGN-VTIVTQSTTDRLWNLGH 79

Query: 496 LFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVA 555
           + + W+GP+SL + ++  +       A    A     ++   +V +    YP+N LRN+A
Sbjct: 80  VCERWKGPLSLGMTVTPEDPGGVTQDAKRIMAACPEVHVSV-LVREPDQKYPVNRLRNLA 138

Query: 556 LNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFP 614
             + +TP++FLLD DF P       L+ +I        G +  LVVPAF     + A+P
Sbjct: 139 WAKATTPWIFLLDADFWPGADARFLLEQTIARSSF---GPRTALVVPAFALDLGKPAWP 194


>gi|195028392|ref|XP_001987060.1| GH21705 [Drosophila grimshawi]
 gi|193903060|gb|EDW01927.1| GH21705 [Drosophila grimshawi]
          Length = 463

 Score = 70.5 bits (171), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 54/199 (27%), Positives = 85/199 (42%), Gaps = 40/199 (20%)

Query: 174 LADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFS 233
           + D ++   S  PN +YS     L L L ++  ++L + I+ D D++F +DI  L+A F 
Sbjct: 232 ITDKLLPYFSSSPNSYYSDSLFFLSLGLHRIADKSLDRAILFDCDIVFRSDIRLLFAEFD 291

Query: 234 RLRQRQTIGLVENQSDWYLGKLWKNHKPWPA-----------LGR--------------- 267
           R    Q  GL    +  Y   L++    +P            LG                
Sbjct: 292 RFLPHQLFGLAPELTPVYRHILYRYRTRFPKSNFGNPYNPIQLGGKKNNENKLVSSRNRD 351

Query: 268 -----GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTS------LADQDIFNAII 316
                G N+GV+L+ L ++RD   +  +  +  +F + RL         L DQD F  + 
Sbjct: 352 RHGYPGLNSGVVLMLLNRIRD---SKTYIEMLTQFEVNRLVAKYSFKGHLGDQDFFTLLG 408

Query: 317 SEHPYLVYTLPCQWNVQLS 335
            E+P L+Y L C WN QL 
Sbjct: 409 YEYPNLIYRLDCVWNRQLC 427



 Score = 53.1 bits (126), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 41/150 (27%), Positives = 68/150 (45%), Gaps = 14/150 (9%)

Query: 36  SILFYRKNPLHFHLITDTVA----LNILQT---------LFSTWSVPQV-EVSFYLADSV 81
           S+L +    LH H+ITD  +    L ILQ          +++ + V    E+   + D +
Sbjct: 177 SLLKHTNAQLHLHIITDKKSEVSVLEILQRKIRKFKRILIYTMYDVKICSEIIQDITDKL 236

Query: 82  VEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQR 141
           +   S  PN +YS     L L L ++  ++L + I+ D D++F +DI  L+A F R    
Sbjct: 237 LPYFSSSPNSYYSDSLFFLSLGLHRIADKSLDRAILFDCDIVFRSDIRLLFAEFDRFLPH 296

Query: 142 QTIGLVENQSDWYLGKLWKNHKPWPALGRG 171
           Q  GL    +  Y   L++    +P    G
Sbjct: 297 QLFGLAPELTPVYRHILYRYRTRFPKSNFG 326


>gi|326925929|ref|XP_003209159.1| PREDICTED: uncharacterized protein C3orf21 homolog [Meleagris
           gallopavo]
          Length = 266

 Score = 70.5 bits (171), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 52/208 (25%), Positives = 91/208 (43%), Gaps = 24/208 (11%)

Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
           F + +++ +  S     +YS     L + + +++P+ + + I +D D+ +  +I  L+  
Sbjct: 55  FPIVEAMQKHFSAGSGTYYSDSIFFLSVAMHRIMPKEITQIIQVDLDLKYKANIRDLFEE 114

Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWKNHK---------PWPALGRGFNTGVILLDLTKLR 282
           F   ++   IG+       Y    W+  +         P P    GFN+GV+LL+L  +R
Sbjct: 115 FDNFQEGAVIGIAREMQPVYRHTFWQYRRENPQTRVGEPPPDGLPGFNSGVLLLNLEAMR 174

Query: 283 ------DISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLS- 335
                  +   G  + + EK+        L DQD F  +  EHP L + L C WN QL  
Sbjct: 175 RSKLYNQLLEPGMVQKLTEKY---HFKGHLGDQDFFTMVGMEHPELFHVLDCTWNRQLCT 231

Query: 336 ---DNTRSD--ELCYTELTDLKIIHWNS 358
              D+  SD  +  +    ++KI H N 
Sbjct: 232 WWKDHGYSDVFDQYFRCDGEVKIYHGNC 259


>gi|449509892|ref|XP_002189200.2| PREDICTED: xyloside xylosyltransferase 1 [Taeniopygia guttata]
          Length = 313

 Score = 70.1 bits (170), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 52/200 (26%), Positives = 84/200 (42%), Gaps = 36/200 (18%)

Query: 169 GRGFYLADSVVEDVSWIP----------NKHYSGVYG--------LLKLTLPKVLPETLA 210
           GRG      +V DV+ +            KH+S   G         L + + +++P+ + 
Sbjct: 81  GRGLLRHPVIVHDVNELTEKLFPIVEAMQKHFSAGSGTYYSDSIFFLSVAMHRIMPKEIT 140

Query: 211 KTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLW---------KNHKP 261
           + I +D D+ + T+I  L+  F    +   IG+       Y    W         K  +P
Sbjct: 141 RIIQVDLDLKYKTNIRDLFDEFDNFPEGAVIGIAREMQPVYRHTFWQYRRENPQTKVGEP 200

Query: 262 WPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTS------LADQDIFNAI 315
            P    GFN+GV+LL+L  +R    +  +  + E  ++ +L         L DQD F  +
Sbjct: 201 PPDGLPGFNSGVLLLNLEAMRQ---SKLYNQLLEPTMVQKLTEKYHFKGHLGDQDFFTMV 257

Query: 316 ISEHPYLVYTLPCQWNVQLS 335
             EHP L + L C WN QL 
Sbjct: 258 GMEHPELFHVLDCTWNRQLC 277


>gi|156379954|ref|XP_001631720.1| predicted protein [Nematostella vectensis]
 gi|156218765|gb|EDO39657.1| predicted protein [Nematostella vectensis]
          Length = 389

 Score = 70.1 bits (170), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 56/185 (30%), Positives = 85/185 (45%), Gaps = 19/185 (10%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVE----NQSDWYLG 253
           +L LP +L E +   I +D D +F + +  LW  FS     Q   +      + + WY  
Sbjct: 170 RLFLPDILTE-VDSLIYMDIDTLFLSPVQWLWDQFSNFNSSQLASMTPEGEVSATGWY-- 226

Query: 254 KLWKNHKPWPALGR-GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIF 312
             +  H   P  G+ G N+GV+L++LT++R   W      I  K+     W    DQDI 
Sbjct: 227 NRFARH---PYYGKLGLNSGVMLMNLTRMRSFGWQEKILPIYNKYRFDITW---GDQDIL 280

Query: 313 NAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTD-LKIIHWNSPKKLKVKNKHMEF 371
           N +   HP LVY L C+WN + +D+      C T  T+ + I+H N   +    N     
Sbjct: 281 NILFHYHPELVYVLSCEWNYR-NDHCIYGNNCKTADTNGIYILHGN---RGVFTNNKQPV 336

Query: 372 FRNLY 376
           FR +Y
Sbjct: 337 FRAVY 341


>gi|47224410|emb|CAG08660.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 319

 Score = 70.1 bits (170), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 48/173 (27%), Positives = 78/173 (45%), Gaps = 19/173 (10%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L E +   + +DTD+IF   + ++WAL S     Q   +     +  +G  W 
Sbjct: 89  RLFLPMILKE-VDSLLYVDTDIIFLQPVEEIWALLSHFNSSQLAAMAPEHEEPRIG--WY 145

Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
           N     P  G+ G N+GV+L++LT+LR+           + W      + +K+ L   W 
Sbjct: 146 NRFARHPYFGKTGINSGVMLMNLTRLREKLFKNDMTTVLLRWDEILMPLLQKYKLNITW- 204

Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
              DQD+ N I   +P  ++  PCQWN +             E   + I+H N
Sbjct: 205 --GDQDLLNIIFYHNPESLHVFPCQWNYRPDHCIYGSNCKEAEHEGVFILHGN 255



 Score = 44.3 bits (103), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 37/150 (24%), Positives = 62/150 (41%), Gaps = 5/150 (3%)

Query: 13  EVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTW-SVPQV 71
           E   VA+V  G     +L T++KS +     PLHF++  +    +  Q   ++W    Q 
Sbjct: 1   ERSHVAVVACGPRLEETL-TMLKSAVLLSVKPLHFYIFAEDDLHDSFQNSLNSWPRTVQT 59

Query: 72  EVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQL 131
             SF +        +    K         +L LP +L E +   + +DTD+IF   + ++
Sbjct: 60  RFSFTIYPITFPSENGKEWKKLFKPCASQRLFLPMILKE-VDSLLYVDTDIIFLQPVEEI 118

Query: 132 WALFSRLRQRQTIGLVENQSDWYLGKLWKN 161
           WAL S     Q   +     +  +G  W N
Sbjct: 119 WALLSHFNSSQLAAMAPEHEEPRIG--WYN 146


>gi|405972582|gb|EKC37344.1| Uncharacterized protein C3orf21-like protein [Crassostrea gigas]
          Length = 371

 Score = 70.1 bits (170), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 51/156 (32%), Positives = 74/156 (47%), Gaps = 21/156 (13%)

Query: 196 LLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLV-ENQSDW-YLG 253
            L + L +V+P  L K I+LD D+ F  DI  L+ +F    +R  IG+  ENQ  + +L 
Sbjct: 180 FLSIVLHRVIP--LDKVIMLDVDLKFDEDIRLLYNIFDEFSERSIIGIARENQPVYRHLF 237

Query: 254 KLWKNHKPWPALG-------RGFNTGVILLDLTKLRD-------ISWAGFWRIIAEKFLL 299
             ++   P   +G        GFN+GV+LLDL K+R        I        + +K+L 
Sbjct: 238 HQYRGKNPSTRVGAPPPDGLTGFNSGVLLLDLEKMRKSQVYNSLIDSPKTLETLTQKYLF 297

Query: 300 TRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLS 335
                 L DQD +  +  EH  L + L C WN QL 
Sbjct: 298 K---GHLGDQDFYTLVSMEHEDLFHILLCSWNRQLC 330



 Score = 41.6 bits (96), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 40/150 (26%), Positives = 67/150 (44%), Gaps = 15/150 (10%)

Query: 33  LIKSILFYRKNPLHFHLITD----TVALNILQTLFSTWSVPQVEVSFYLADSVVEDVSWI 88
           L++SI       L FH+I D    T+A +I++    T +   +     +   V E +S +
Sbjct: 103 LVESISRKSSISLCFHVICDEDGKTIATSIIERAAPTETQIIIHDITIMTQKVKEQISQM 162

Query: 89  -------PNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQR 141
                     +Y      L + L +V+P  L K I+LD D+ F  DI  L+ +F    +R
Sbjct: 163 QTFFSHSSKSYYGDPLFFLSIVLHRVIP--LDKVIMLDVDLKFDEDIRLLYNIFDEFSER 220

Query: 142 QTIGLV-ENQSDW-YLGKLWKNHKPWPALG 169
             IG+  ENQ  + +L   ++   P   +G
Sbjct: 221 SIIGIARENQPVYRHLFHQYRGKNPSTRVG 250


>gi|195122692|ref|XP_002005845.1| GI20695 [Drosophila mojavensis]
 gi|193910913|gb|EDW09780.1| GI20695 [Drosophila mojavensis]
          Length = 463

 Score = 70.1 bits (170), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 54/204 (26%), Positives = 86/204 (42%), Gaps = 45/204 (22%)

Query: 174 LADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFS 233
           + D +    S  PN +YS     + L L ++   +L++ I+ D D++F +DI  L+A F 
Sbjct: 228 ITDKLSPYFSSAPNSYYSDSLFFMSLGLHRIADASLSRAILFDCDIVFRSDIRLLFAEFD 287

Query: 234 RLRQRQTIGLVENQSDWYLGKLWKNHKPWP-----------ALGR--------------- 267
           R    Q  GL    +  Y   L++    +P            +G+               
Sbjct: 288 RFLPNQLFGLAPELTPVYRHILYRYRTQFPESNFGNPYQPIQIGKNKNNNNKNKNNKLLT 347

Query: 268 ----------GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTS------LADQDI 311
                     G N+GV+L+ L ++RD S    +  I  ++ + RL T       L DQD 
Sbjct: 348 SRSQERHGYPGLNSGVVLILLNRIRDSST---YIEILTQYEINRLVTKYSFKGHLGDQDF 404

Query: 312 FNAIISEHPYLVYTLPCQWNVQLS 335
           F  +  E+P L+Y L C WN QL 
Sbjct: 405 FTLLGYEYPTLIYRLDCVWNRQLC 428



 Score = 53.1 bits (126), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 39/160 (24%), Positives = 69/160 (43%), Gaps = 28/160 (17%)

Query: 33  LIKSILFYRKNPLHFHLITD----TVALNILQT---------LFSTWSVPQVEVSFYLAD 79
            I S+  +    LH H+ITD       L ILQ          ++S +++        +  
Sbjct: 170 FIHSLFKHTNAQLHLHVITDKESEVAVLEILQRKVRRFKRVLIYSMYNIQ-------ICS 222

Query: 80  SVVEDV--------SWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQL 131
           ++++D+        S  PN +YS     + L L ++   +L++ I+ D D++F +DI  L
Sbjct: 223 NIIQDITDKLSPYFSSAPNSYYSDSLFFMSLGLHRIADASLSRAILFDCDIVFRSDIRLL 282

Query: 132 WALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRG 171
           +A F R    Q  GL    +  Y   L++    +P    G
Sbjct: 283 FAEFDRFLPNQLFGLAPELTPVYRHILYRYRTQFPESNFG 322


>gi|363737325|ref|XP_422707.3| PREDICTED: uncharacterized protein C3orf21 homolog [Gallus gallus]
          Length = 339

 Score = 70.1 bits (170), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 52/208 (25%), Positives = 91/208 (43%), Gaps = 24/208 (11%)

Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
           F + +++ +  S     +YS     L + + +++P+ + + I +D D+ +  +I  L+  
Sbjct: 128 FPIVEAMQKHFSAGSGTYYSDSIFFLSVAMHRIMPKEITQIIQVDLDLKYKANIRDLFEE 187

Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWKNHK---------PWPALGRGFNTGVILLDLTKLR 282
           F   ++   IG+       Y    W+  +         P P    GFN+GV+LL+L  +R
Sbjct: 188 FDNFQEGAVIGIAREMQPVYRHTFWQYRRENPQTRVGEPPPDGLPGFNSGVLLLNLEAMR 247

Query: 283 ------DISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLS- 335
                  +   G  + + EK+        L DQD F  +  EHP L + L C WN QL  
Sbjct: 248 RSKLYNQLLEPGMVQKLTEKYHFK---GHLGDQDFFTMVGMEHPELFHVLDCTWNRQLCT 304

Query: 336 ---DNTRSD--ELCYTELTDLKIIHWNS 358
              D+  SD  +  +    ++KI H N 
Sbjct: 305 WWKDHGYSDVFDQYFRCEGEVKIYHGNC 332


>gi|21594165|gb|AAH31419.1| Expressed sequence AI480653 [Mus musculus]
          Length = 379

 Score = 70.1 bits (170), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 57/211 (27%), Positives = 93/211 (44%), Gaps = 30/211 (14%)

Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
           F + +++ +  S     +YS     L + + +++P+ + + I LD D+ + T+I +L+  
Sbjct: 168 FPVVEAMQKYFSAGSGTYYSDSIFFLSVAMHQIMPKEIPRIIQLDLDLKYKTNIRELFEE 227

Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWK-NHK--------PWPALGRGFNTGVILLDLTKLR 282
           F        IG+       Y    W+  H+        P P    GFN+GV+LL+L  +R
Sbjct: 228 FDNFLPGAVIGIAREMQPVYRHTFWQFRHENPKTRVGDPPPEGLPGFNSGVMLLNLEAMR 287

Query: 283 ---------DISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQ 333
                    + SW    + +A+K+        L DQD F  I  EHP L + L C WN Q
Sbjct: 288 QSPLYSHLLEPSWV---QQLADKY---HFRGHLGDQDFFTMIGMEHPELFHVLDCTWNRQ 341

Query: 334 LS----DNTRSD--ELCYTELTDLKIIHWNS 358
           L     D+  SD  +  +     +KI H N 
Sbjct: 342 LCTWWRDHGYSDVFQAYFRCEGHVKIYHGNC 372


>gi|12805437|gb|AAH02191.1| B3gnt1 protein, partial [Mus musculus]
          Length = 241

 Score = 70.1 bits (170), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 47/148 (31%), Positives = 67/148 (45%), Gaps = 34/148 (22%)

Query: 546 YPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFE 605
           YP N LRN+A  + +  Y  ++D+D +P  GL+  L+     +D   H     LVVPAFE
Sbjct: 33  YPNNLLRNLAREEAN--YALVIDVDMVPSEGLWRGLREM---LDQSNHWDGTALVVPAFE 87

Query: 606 TQRYRTA-----------------------FPASHAPTNFSRWVNATT------PYQIEW 636
            +R R                             HAPTN+SRWVN          Y + W
Sbjct: 88  IRRSRRMPMNKNELVQLYQVGEVRPFYYGLCTPCHAPTNYSRWVNLPEESLLRPAYVVPW 147

Query: 637 APDFEPYIVAHRDLPRYDTRFVGFGWNK 664
              +EP+ VA   +P +D RF  +G+N+
Sbjct: 148 RDPWEPFYVAGGKVPTFDERFRQYGFNR 175


>gi|390349079|ref|XP_789869.3| PREDICTED: glucoside xylosyltransferase 1-like [Strongylocentrotus
           purpuratus]
          Length = 582

 Score = 69.7 bits (169), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 50/207 (24%), Positives = 92/207 (44%), Gaps = 19/207 (9%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +LP  +   + +DTD++F   + ++++ F +    Q   L        +G  W 
Sbjct: 351 RLFLPDLLP-NVDSLLYVDTDILFIRPVEEIYSFFKQFNSTQMAALAPEHEAKNIG--WY 407

Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAI 315
           N     P  G  G N+GV+L++LT++R   W      +  ++ L   W    DQD+ N +
Sbjct: 408 NRFARHPYYGETGMNSGVMLMNLTRMRRFGWTQTIVPLYHEYKLKITW---GDQDLINIL 464

Query: 316 ISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWNSPKKLKVKNKHMEFFRNL 375
             ++P  ++   C+WN +      S+    T    + +IH N   +    N     FR +
Sbjct: 465 FHDYPQGLFPYACEWNYRPDHCMYSNNCHRTNEHGIAVIHGN---RGVYHNMKQPAFRAI 521

Query: 376 YLTFLEYDGNLLRRELFGCNLTQTTLQ 402
           Y +F +Y         FG +L +  ++
Sbjct: 522 YESFRDY--------TFGADLEKGLIE 540



 Score = 44.7 bits (104), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 38/155 (24%), Positives = 69/155 (44%), Gaps = 16/155 (10%)

Query: 14  VIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLIT-DTVALNILQTLFSTWSVPQVE 72
           VI +++VC G  +  +LV ++KS     +  +HFH+   D +   I Q L +  SV + +
Sbjct: 263 VINLSVVCCGDRTNETLV-MLKSATVLTQQLMHFHIFAEDELQAGIAQRLNAWPSVYRRK 321

Query: 73  VSFYLADSVVEDVSWIPN------KHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 126
            +F L       +S+ P       K         +L LP +LP  +   + +DTD++F  
Sbjct: 322 FAFSL-----HPISFPPGEDPAKWKKVFKTCATQRLFLPDLLP-NVDSLLYVDTDILFIR 375

Query: 127 DIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKN 161
            + ++++ F +    Q   L        +G  W N
Sbjct: 376 PVEEIYSFFKQFNSTQMAALAPEHEAKNIG--WYN 408


>gi|256355176|ref|NP_941028.2| xyloside xylosyltransferase 1 [Mus musculus]
 gi|341940318|sp|Q3U4G3.2|XXLT1_MOUSE RecName: Full=Xyloside xylosyltransferase 1; AltName:
           Full=UDP-xylose:alpha-xyloside
           alpha-1,3-xylosyltransferase
          Length = 392

 Score = 69.7 bits (169), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 57/211 (27%), Positives = 93/211 (44%), Gaps = 30/211 (14%)

Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
           F + +++ +  S     +YS     L + + +++P+ + + I LD D+ + T+I +L+  
Sbjct: 181 FPVVEAMQKYFSAGSGTYYSDSIFFLSVAMHQIMPKEIPRIIQLDLDLKYKTNIRELFEE 240

Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWK-NHK--------PWPALGRGFNTGVILLDLTKLR 282
           F        IG+       Y    W+  H+        P P    GFN+GV+LL+L  +R
Sbjct: 241 FDNFLPGAVIGIAREMQPVYRHTFWQFRHENPKTRVGDPPPEGLPGFNSGVMLLNLEAMR 300

Query: 283 ---------DISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQ 333
                    + SW    + +A+K+        L DQD F  I  EHP L + L C WN Q
Sbjct: 301 QSPLYSHLLEPSWV---QQLADKY---HFRGHLGDQDFFTMIGMEHPELFHVLDCTWNRQ 354

Query: 334 LS----DNTRSD--ELCYTELTDLKIIHWNS 358
           L     D+  SD  +  +     +KI H N 
Sbjct: 355 LCTWWRDHGYSDVFQAYFRCEGHVKIYHGNC 385


>gi|125829752|ref|XP_694206.2| PREDICTED: uncharacterized protein C3orf21 homolog [Danio rerio]
 gi|326680849|ref|XP_002663856.2| PREDICTED: uncharacterized protein C3orf21 homolog [Danio rerio]
          Length = 387

 Score = 69.7 bits (169), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 49/178 (27%), Positives = 77/178 (43%), Gaps = 18/178 (10%)

Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
           F + +++ +  S     +YS     L + + +++P  L + + LD D+ F ++I  L+  
Sbjct: 176 FPIVEAMQKHFSAGSGAYYSDAIFFLSVAMHRIMPAELKRIVQLDLDLKFRSNIRDLFQE 235

Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWKNHK---------PWPALGRGFNTGVILLDLTKLR 282
           F+       IG+       Y    W+  K         P P    GFN+GV+LL+L  +R
Sbjct: 236 FNHFPSDAVIGIAREMQPVYRHTFWQYRKENPKSRVGDPPPDGLPGFNSGVMLLNLDSMR 295

Query: 283 DISWAGFWRIIAEKFLLTRLWTS------LADQDIFNAIISEHPYLVYTLPCQWNVQL 334
               +  +  + E   + RL         L DQD F  I  EH  L Y L C WN QL
Sbjct: 296 R---STVYNQLLEPERVARLADKYHFRGHLGDQDFFTMIGMEHTELFYPLTCGWNRQL 350


>gi|74178412|dbj|BAE32468.1| unnamed protein product [Mus musculus]
          Length = 392

 Score = 69.7 bits (169), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 57/211 (27%), Positives = 93/211 (44%), Gaps = 30/211 (14%)

Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
           F + +++ +  S     +YS     L + + +++P+ + + I LD D+ + T+I +L+  
Sbjct: 181 FPVVEAMQKYFSAGSGTYYSDSIFFLSVAMHQIMPKEIPRIIQLDLDLKYKTNIRELFEE 240

Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWK-NHK--------PWPALGRGFNTGVILLDLTKLR 282
           F        IG+       Y    W+  H+        P P    GFN+GV+LL+L  +R
Sbjct: 241 FDNFLPGAVIGIAREMQPVYRHTFWQFRHENPKTRVGDPPPEGLPGFNSGVMLLNLEAMR 300

Query: 283 ---------DISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQ 333
                    + SW    + +A+K+        L DQD F  I  EHP L + L C WN Q
Sbjct: 301 QSPLYSHLLEPSWV---QQLADKY---HFRGHLGDQDFFTMIGMEHPELFHVLDCTWNRQ 354

Query: 334 LS----DNTRSD--ELCYTELTDLKIIHWNS 358
           L     D+  SD  +  +     +KI H N 
Sbjct: 355 LCTWWRDHGYSDVFQAYFRCEGHVKIYHGNC 385


>gi|242214080|ref|XP_002472865.1| predicted protein [Postia placenta Mad-698-R]
 gi|220728071|gb|EED81973.1| predicted protein [Postia placenta Mad-698-R]
          Length = 492

 Score = 69.7 bits (169), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 46/150 (30%), Positives = 76/150 (50%), Gaps = 10/150 (6%)

Query: 5   EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
           +EP  P  E +  +++    +S      L+KSIL Y   P+ FH++ D  A   L+    
Sbjct: 123 KEPPKPP-EAVVFSLIMFSEDSAVEGAMLMKSILMYTSMPVEFHIVCDEAAQAYLENRVR 181

Query: 65  TWSVP--QVEVSFYL------ADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTI 116
             + P   + V FYL      AD +  + + I   H +GV GL+KL + ++LP ++ + I
Sbjct: 182 LITRPAHDILVRFYLLSWQSMADRIDREGT-IMTDHSAGVPGLMKLFIHEILPPSVPRAI 240

Query: 117 VLDTDVIFATDIAQLWALFSRLRQRQTIGL 146
            +DTD +F +D A LW  F++L     I +
Sbjct: 241 FIDTDALFISDPALLWDRFAQLAPGAAIAI 270



 Score = 48.1 bits (113), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 22/59 (37%), Positives = 36/59 (61%)

Query: 185 IPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGL 243
           I   H +GV GL+KL + ++LP ++ + I +DTD +F +D A LW  F++L     I +
Sbjct: 212 IMTDHSAGVPGLMKLFIHEILPPSVPRAIFIDTDALFISDPALLWDRFAQLAPGAAIAI 270


>gi|270013726|gb|EFA10174.1| hypothetical protein TcasGA2_TC012364 [Tribolium castaneum]
          Length = 326

 Score = 69.7 bits (169), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 39/121 (32%), Positives = 61/121 (50%), Gaps = 7/121 (5%)

Query: 215 LDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNH-KPWPALGR-GFNTG 272
           +DTD +F T +  +W+ F ++   Q   L     D  +G  W N     P  G+ G N+G
Sbjct: 118 MDTDTLFLTPVENIWSYFHKMNSSQMAALAPEHEDPNVG--WYNRFAKHPFYGKLGVNSG 175

Query: 273 VILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNV 332
           V+L++LT++R   W+ +   I  K+ L   W    DQDI N I   HP  +Y   C++N 
Sbjct: 176 VMLMNLTRMRAFQWSEYVVPIYRKYKLKITW---GDQDIINIIFHYHPGKLYLYSCRFNF 232

Query: 333 Q 333
           +
Sbjct: 233 R 233


>gi|322792843|gb|EFZ16676.1| hypothetical protein SINV_09024 [Solenopsis invicta]
          Length = 359

 Score = 69.3 bits (168), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 45/136 (33%), Positives = 68/136 (50%), Gaps = 8/136 (5%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP VL +T A  + +DTD +F     ++W  F R+   Q   L     D   G  W 
Sbjct: 136 RLFLPTVLNDTDA-VLYVDTDTLFLAPPEKIWDEFKRMNSSQLAALSPEHEDPNTG--WY 192

Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAI 315
           N     P  G+ G N+GV+L++LT++R+  W  +   I +++ L   W    DQDI N I
Sbjct: 193 NRFAKHPYYGKLGVNSGVMLMNLTRMREFRWTDYVIPIHKEYKLKITW---GDQDIINII 249

Query: 316 ISEHPYLVYTLPCQWN 331
              HP  +Y   C++N
Sbjct: 250 FHYHPEKLYVYSCRYN 265



 Score = 40.0 bits (92), Expect = 4.3,   Method: Compositional matrix adjust.
 Identities = 38/155 (24%), Positives = 65/155 (41%), Gaps = 19/155 (12%)

Query: 11  TCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQ 70
           T EV    +VC   +     +T++KS L +   PL F ++ +   +       S W +  
Sbjct: 46  TSEVTICVVVCG--DRLHEALTMLKSALVFTSRPLQFIILAEQNLIPAFDEKLSEWRL-- 101

Query: 71  VEVSFYLADSVVEDVSWIPNKHYSGVYGLL-------KLTLPKVLPETLAKTIVLDTDVI 123
             +S    D +V  +++ P+     ++  L       +L LP VL +T A  + +DTD +
Sbjct: 102 --ISNKTFDFLVRPITF-PDGSDVAMWKKLFKPCAAQRLFLPTVLNDTDA-VLYVDTDTL 157

Query: 124 FATDIAQLWALFSRLRQRQTIGLVENQSD----WY 154
           F     ++W  F R+   Q   L     D    WY
Sbjct: 158 FLAPPEKIWDEFKRMNSSQLAALSPEHEDPNTGWY 192


>gi|293352262|ref|XP_344028.3| PREDICTED: xyloside xylosyltransferase 1-like [Rattus norvegicus]
          Length = 392

 Score = 69.3 bits (168), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 50/179 (27%), Positives = 81/179 (45%), Gaps = 18/179 (10%)

Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
           F + +++ +  S     +YS     L + + +++P+ + + I LD D+ + T+I +L+  
Sbjct: 181 FPVVEAMQKYFSAGSGTYYSDSIFFLSVAMHQIMPKEILRIIQLDLDLKYKTNIRELFEE 240

Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWK-NHK--------PWPALGRGFNTGVILLDLTKLR 282
           F        IG+       Y    W+  H+        P P    GFN+GV+LL+L  +R
Sbjct: 241 FDNFLPGAVIGIAREMQPVYRHTFWQFRHENPKTRVGDPPPEGLPGFNSGVMLLNLEAMR 300

Query: 283 DISWAGF-----W-RIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLS 335
                G      W + +A+K+        L DQD F  I  EHP L + L C WN QL 
Sbjct: 301 QSPLYGHLLEPAWVQQLADKY---HFRGHLGDQDFFTMIGMEHPELFHVLDCTWNRQLC 356


>gi|66811776|ref|XP_640067.1| hypothetical protein DDB_G0282469 [Dictyostelium discoideum AX4]
 gi|74855026|sp|Q54SH2.1|GNT13_DICDI RecName: Full=Glycosyltransferase-like protein gnt13
 gi|60468084|gb|EAL66094.1| hypothetical protein DDB_G0282469 [Dictyostelium discoideum AX4]
          Length = 635

 Score = 69.3 bits (168), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 38/111 (34%), Positives = 63/111 (56%), Gaps = 8/111 (7%)

Query: 479 VTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSD-----AEAQQFLSYAGNS-EALRSRR 532
           +T+V Q ++DRL  V M+ + W+ P+S+ +++ D     AE ++   +  N   AL    
Sbjct: 112 ITIVTQTTVDRLSKVSMMAEKWKAPLSVAVFIKDSENPKAEIEKLEKHIENDYPALSYYS 171

Query: 533 NIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKS 583
           +I  H++      YP+N LRN+A+    T  VF++D DFLP  GL+ Y+ S
Sbjct: 172 DI--HILISNKTRYPVNNLRNLAIEHSRTDLVFIMDADFLPPLGLHDYILS 220



 Score = 50.1 bits (118), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 21/50 (42%), Positives = 37/50 (74%), Gaps = 3/50 (6%)

Query: 617 HAPTNFSRWVNATTPYQIEWAPDFEPYIVAHR--DLPRYDTRFVGFGWNK 664
           H+PTNF +W++A  PY++E+   FEPY+V ++  ++P +D R  G+G++K
Sbjct: 505 HSPTNFEKWMDAVEPYEVEYKWIFEPYLVFNKTQNIP-FDERLKGYGFDK 553


>gi|357614662|gb|EHJ69200.1| putative N-acetyllactosaminide
           beta-1,3-N-acetylglucosaminyltransferase [Danaus
           plexippus]
          Length = 459

 Score = 69.3 bits (168), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 62/267 (23%), Positives = 100/267 (37%), Gaps = 73/267 (27%)

Query: 468 DEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYA----- 522
           D Y  +S    V L  Q S++RL  +  +  HW GPIS+ ++++  E +   ++      
Sbjct: 107 DMYADASSNRRVCLATQSSIERLHELLGIAAHWTGPISVAVFVAGDELRLLRAFEMWLFR 166

Query: 523 --------------------------------GNSEALRSRRNIGYHVVYKEGNFYPINT 550
                                            N   L +       V ++  + YP N 
Sbjct: 167 CQPDVYARLALHVAMPAERPGVQSNVPNWARNCNVAPLATEERKSETVAWRARHPYPQNH 226

Query: 551 LRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRYR 610
           LRN+A     TPYVFL+D+D +P  G+   L S + S+           VVP +E  +  
Sbjct: 227 LRNLARKNCHTPYVFLVDVDIVPSRGMAEALDSFLSSVP---KCQMCAYVVPTYELDKRV 283

Query: 611 TAFPASHA------------------------PTNFSRWV----NATTPYQIEWAPD--- 639
             FP + +                         +NFSRW     N +    I        
Sbjct: 284 ANFPTNKSELLRLSRNKLAIPFHRKVFIYNQYASNFSRWESSGGNESLSTHISHTVTNFE 343

Query: 640 --FEPYIVAHRDLPRYDTRFVGFGWNK 664
             +EP+ VA   +P +D RF+G+G+ +
Sbjct: 344 LLYEPFYVAPDTVPAHDERFLGYGFTR 370


>gi|293340644|ref|XP_002724719.1| PREDICTED: xyloside xylosyltransferase 1-like [Rattus norvegicus]
          Length = 391

 Score = 68.9 bits (167), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 48/162 (29%), Positives = 74/162 (45%), Gaps = 18/162 (11%)

Query: 189 HYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQS 248
           +YS     L + + +++P+ + + I LD D+ + T+I +L+  F        IG+     
Sbjct: 197 YYSDSIFFLSVAMHQIMPKEILRIIQLDLDLKYKTNIRELFEEFDNFLPGAVIGIAREMQ 256

Query: 249 DWYLGKLWK-NHK--------PWPALGRGFNTGVILLDLTKLRDISWAGF-----W-RII 293
             Y    W+  H+        P P    GFN+GV+LL+L  +R     G      W + +
Sbjct: 257 PVYRHTFWQFRHENPKTRVGDPPPEGLPGFNSGVMLLNLEAMRQSPLYGHLLEPAWVQQL 316

Query: 294 AEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLS 335
           A+K+        L DQD F  I  EHP L + L C WN QL 
Sbjct: 317 ADKY---HFRGHLGDQDFFTMIGMEHPELFHVLDCTWNRQLC 355


>gi|307196398|gb|EFN77987.1| Glycosyltransferase 8 domain-containing protein 3 [Harpegnathos
           saltator]
          Length = 367

 Score = 68.9 bits (167), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 44/136 (32%), Positives = 68/136 (50%), Gaps = 8/136 (5%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP VL +T A  + +DTD +F     ++W  F ++   Q   L     D   G  W 
Sbjct: 145 RLFLPTVLNDTDA-VLYVDTDTLFLAPPEKIWDEFKKMNSSQLAALSPEHEDPNTG--WY 201

Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAI 315
           N     P  G+ G N+GV+L++LT++R+  W  +   I +++ L   W    DQDI N I
Sbjct: 202 NRFAKHPYYGKLGVNSGVMLMNLTRMREFRWTDYVIPIHKEYRLKITW---GDQDIINII 258

Query: 316 ISEHPYLVYTLPCQWN 331
              HP  +Y   C++N
Sbjct: 259 FHYHPEKLYVYSCRYN 274


>gi|354466014|ref|XP_003495471.1| PREDICTED: uncharacterized protein C3orf21 homolog, partial
           [Cricetulus griseus]
          Length = 254

 Score = 68.9 bits (167), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 49/179 (27%), Positives = 80/179 (44%), Gaps = 18/179 (10%)

Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
           F + +++ +  S     +YS     L + + ++LP+ + + I LD D+ + T+I +L+  
Sbjct: 43  FPIVEAMQKHFSAGSGTYYSDSIFFLSVAMHQILPKEILRIIQLDLDLKYKTNIRELFEE 102

Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWK-NHK--------PWPALGRGFNTGVILLDLTKLR 282
           F        IG+       Y    W+  H+        P P    GFN+GV+LL+L  +R
Sbjct: 103 FDNFLPGAVIGIAREMQPVYRHTFWQFRHENPKTRVGDPPPEGLPGFNSGVMLLNLEAMR 162

Query: 283 D------ISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLS 335
                  +      + +A+K+        L DQD F  I  EHP L + L C WN QL 
Sbjct: 163 QSPLYSHLLEPARVQQLADKY---HFRGHLGDQDFFTMIGMEHPELFHVLDCTWNRQLC 218


>gi|307186594|gb|EFN72111.1| Glycosyltransferase 8 domain-containing protein 3 [Camponotus
           floridanus]
          Length = 368

 Score = 68.9 bits (167), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 56/196 (28%), Positives = 88/196 (44%), Gaps = 23/196 (11%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP VL +T A  + +DTD +F     ++W  F ++   Q   L     D   G  W 
Sbjct: 145 RLFLPTVLNDTDA-VLYVDTDTLFLAPPDKVWDEFKKMNSSQLAALSPEHEDPNTG--WY 201

Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAI 315
           N     P  G+ G N+GV+L++LT++R+  W      I +++ L   W    DQDI N I
Sbjct: 202 NRFAKHPYYGKLGVNSGVMLMNLTRMREFRWTDHVIPIHKEYKLKITW---GDQDIINII 258

Query: 316 ISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKII------HWNSPKKLKVKNKHM 369
              HP  +Y   C++N +  D+     +C     D  ++       ++SPK+        
Sbjct: 259 FHHHPEKLYVYSCRYNYR-PDHCMYMSVCAEAEKDGALVLHGSRGTFHSPKQPP------ 311

Query: 370 EFFRNLYLTFLEYDGN 385
             FR +Y    EY  N
Sbjct: 312 --FRAVYRAMQEYPLN 325



 Score = 40.8 bits (94), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 38/154 (24%), Positives = 62/154 (40%), Gaps = 17/154 (11%)

Query: 11  TCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQ 70
           T EV    +VC   +     +T++KS L +   PL F ++ D   +       S W +  
Sbjct: 55  TSEVTICVVVCG--DRLHEALTMLKSALVFANRPLQFVILADNNLIPAFSEKLSEWRL-- 110

Query: 71  VEVSFYLADSVVEDVSWIPNKHYSGVYGLLK------LTLPKVLPETLAKTIVLDTDVIF 124
             +S    D +V  +++      +    L K      L LP VL +T A  + +DTD +F
Sbjct: 111 --ISNKTFDFLVRPITFPDGSDVAMWKKLFKPCAAQRLFLPTVLNDTDA-VLYVDTDTLF 167

Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSD----WY 154
                ++W  F ++   Q   L     D    WY
Sbjct: 168 LAPPDKVWDEFKKMNSSQLAALSPEHEDPNTGWY 201


>gi|74147544|dbj|BAE38667.1| unnamed protein product [Mus musculus]
          Length = 392

 Score = 68.6 bits (166), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 56/211 (26%), Positives = 93/211 (44%), Gaps = 30/211 (14%)

Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
           F + +++ +  S     +YS     L + + +++P+ + + I LD D+ + T+I +L+  
Sbjct: 181 FPVVEAMQKYFSAGSGTYYSDSIFFLSVAMHQIMPKEIPRIIQLDLDLKYKTNIRELFEE 240

Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWK-NHK--------PWPALGRGFNTGVILLDLTKLR 282
           F        IG+       Y    W+  H+        P P    GFN+GV+LL+L  +R
Sbjct: 241 FDNFLPGAVIGIAREMQPVYRHTFWQFRHENPKTRVGDPPPEGLPGFNSGVMLLNLEAMR 300

Query: 283 ---------DISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQ 333
                    + +W    + +A+K+        L DQD F  I  EHP L + L C WN Q
Sbjct: 301 QSPLYSHLLEPAWV---QQLADKY---HFRGHLGDQDFFTMIGMEHPELFHVLDCTWNRQ 354

Query: 334 LS----DNTRSD--ELCYTELTDLKIIHWNS 358
           L     D+  SD  +  +     +KI H N 
Sbjct: 355 LCTWWRDHGYSDVFQAYFRCEGHVKIYHGNC 385


>gi|194222714|ref|XP_001499067.2| PREDICTED: uncharacterized protein C3orf21 homolog [Equus caballus]
          Length = 281

 Score = 68.6 bits (166), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 55/208 (26%), Positives = 91/208 (43%), Gaps = 24/208 (11%)

Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
           F + +++ +  S     +YS     L + + +++P+ + + I LD D+ + T+I +L+  
Sbjct: 70  FPVVEAMQKHFSAGSGTYYSDSIFFLSVAMHQIMPKEILRIIQLDLDLKYKTNIRELFEE 129

Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWK-NHK--------PWPALGRGFNTGVILLDLTKLR 282
           F        IG+       Y    W+  H+        P P    GFN+GV+LL+L  +R
Sbjct: 130 FDNFLPGAIIGIAHEMQPVYRHTFWQFRHENPKTRVGSPPPEGLPGFNSGVMLLNLEAMR 189

Query: 283 D------ISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLS- 335
                  +      + +A+K+        L DQD F  I  EHP L + L C WN QL  
Sbjct: 190 QSPLYSRLLEPAQVQQLADKY---HFRGHLGDQDFFTMIGMEHPELFHVLDCTWNRQLCT 246

Query: 336 ---DNTRSD--ELCYTELTDLKIIHWNS 358
              D+  SD  E  +     ++I H N 
Sbjct: 247 WWRDHGYSDVFEAYFRCEGHVRIYHGNC 274


>gi|428179147|gb|EKX48019.1| hypothetical protein GUITHDRAFT_136980 [Guillardia theta CCMP2712]
          Length = 946

 Score = 68.6 bits (166), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 64/247 (25%), Positives = 96/247 (38%), Gaps = 63/247 (25%)

Query: 479 VTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGYHV 538
           VTLV QLS ++   + +L   W GP    + +  +++ +F  + G  E  +S  +I   V
Sbjct: 32  VTLVTQLSWNKFSSLLVLRPRWAGPCVAAVLVPSSQSSEF--HEGVQELQQSSHDITQIV 89

Query: 539 VYKEGNF----YPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSS---------- 584
              E       YPIN +RN+AL    T  VFL D+D LP       L+            
Sbjct: 90  AVWEDELHFERYPINAMRNIALAHAGTDLVFLCDVDQLPSVTFSALLQQEDILMLLWRWC 149

Query: 585 --------IRSMDMHGHGG---------------KKVLVVPAFETQR---YRT-AFPASH 617
                   + ++++   GG               KK+L     E  R   ++T  FP  H
Sbjct: 150 CVERNAIVVPALELKKGGGEAISLEDVLGDYEMAKKLLHEEDLEKTRVIGFQTEKFPTGH 209

Query: 618 APTNFSR-WVNATT--PYQIEWAPDFEPYIVAHRD-----------------LPRYDTRF 657
           A       W    +   Y IE+   FEPY++  R                  +P YD R+
Sbjct: 210 AAERVRELWTACESIEGYMIEYEEGFEPYVIVSRQSFLLFLAPSAEYCSRWMVPAYDERY 269

Query: 658 VGFGWNK 664
            G+G NK
Sbjct: 270 EGYGRNK 276


>gi|344240257|gb|EGV96360.1| Arf-GAP with coiled-coil, ANK repeat and PH domain-containing
           protein 2 [Cricetulus griseus]
          Length = 852

 Score = 68.6 bits (166), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 48/178 (26%), Positives = 80/178 (44%), Gaps = 18/178 (10%)

Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
           F + +++ +  S     +YS     L + + ++LP+ + + I LD D+ + T+I +L+  
Sbjct: 641 FPIVEAMQKHFSAGSGTYYSDSIFFLSVAMHQILPKEILRIIQLDLDLKYKTNIRELFEE 700

Query: 232 FSRLRQRQTIGLVENQSDWYLGKLW--KNHKPWPALGR-------GFNTGVILLDLTKLR 282
           F        IG+       Y    W  ++  P   +G        GFN+GV+LL+L  +R
Sbjct: 701 FDNFLPGAVIGIAREMQPVYRHTFWQFRHENPKTRVGDPPPEGLPGFNSGVMLLNLEAMR 760

Query: 283 D------ISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQL 334
                  +      + +A+K+        L DQD F  I  EHP L + L C WN QL
Sbjct: 761 QSPLYSHLLEPARVQQLADKY---HFRGHLGDQDFFTMIGMEHPELFHVLDCTWNRQL 815


>gi|302836423|ref|XP_002949772.1| hypothetical protein VOLCADRAFT_80793 [Volvox carteri f.
           nagariensis]
 gi|300265131|gb|EFJ49324.1| hypothetical protein VOLCADRAFT_80793 [Volvox carteri f.
           nagariensis]
          Length = 415

 Score = 68.2 bits (165), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 63/250 (25%), Positives = 98/250 (39%), Gaps = 56/250 (22%)

Query: 470 YEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYM-------------SDAEAQ 516
           + + S G  VTLV QLS +RL  ++     W GP++  LY+             S  + +
Sbjct: 70  WSSKSLGIPVTLVTQLSAERLDQLKAQCATWGGPLAAALYVPLHNPSATELLETSRQKLE 129

Query: 517 QFLSYA----------GNSEALRSRRNIGYHVVY--KEGNFYPINTLRNVALNQVSTPYV 564
             +++A           N    + R  + Y +    +    YP+N+LRN A     T  +
Sbjct: 130 AMIAHADELFRNSENNNNGHGCQLRLALIYELFADPRAQILYPVNSLRNYARLMADTDLI 189

Query: 565 FLLDIDFLPMFGLYPYLK--SSIRSMDMHGHGGKKVLVVPAFET---------------- 606
             +D+D +P   +   L    ++         G  V V+PAFET                
Sbjct: 190 ANIDVDMVPSLSISTALAEPQTLAEYTRACRSG-AVYVLPAFETHCGGTSYADSVAVDGK 248

Query: 607 -----------QRYRTAFPASHAPTNFSRWVNATTPYQIEWAPDFEPYIVAHR-DLPRYD 654
                      +R R   P SH  T + RW      Y I + P FEP+ ++ R     YD
Sbjct: 249 KAVIQGIKRCVRRMRPKAPFSHNSTKYERWFATDEAYSITYGPLFEPWFISWRWGTLWYD 308

Query: 655 TRFVGFGWNK 664
            R+ G+G NK
Sbjct: 309 YRYRGYGKNK 318


>gi|353232724|emb|CCD80079.1| putative tata-binding protein-associated phosphoprotein
           [Schistosoma mansoni]
          Length = 116

 Score = 68.2 bits (165), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 35/93 (37%), Positives = 52/93 (55%), Gaps = 7/93 (7%)

Query: 212 TIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNT 271
            IVLDTD++   DI+ LW+LF    + Q IG+   Q+  +   +    + WP LG G+N+
Sbjct: 4   VIVLDTDILLNDDISNLWSLFDEFNKDQCIGIAAEQNPTFYSNM--GQRFWPTLGYGYNS 61

Query: 272 GVILLDLTKLRDISWAGFW-----RIIAEKFLL 299
           G++L+ LTKLR   W   W      +I EK +L
Sbjct: 62  GILLIHLTKLRARGWDEIWMKTAFNLIEEKRIL 94



 Score = 46.6 bits (109), Expect = 0.040,   Method: Compositional matrix adjust.
 Identities = 21/58 (36%), Positives = 32/58 (55%), Gaps = 2/58 (3%)

Query: 115 TIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGF 172
            IVLDTD++   DI+ LW+LF    + Q IG+   Q+  +   +    + WP LG G+
Sbjct: 4   VIVLDTDILLNDDISNLWSLFDEFNKDQCIGIAAEQNPTFYSNM--GQRFWPTLGYGY 59


>gi|428163534|gb|EKX32600.1| hypothetical protein GUITHDRAFT_148516 [Guillardia theta CCMP2712]
          Length = 690

 Score = 68.2 bits (165), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 67/268 (25%), Positives = 119/268 (44%), Gaps = 64/268 (23%)

Query: 458 QMVEINPNNVDEYEASSDGNDVTLVAQLSMDR-LQMVEMLFKHWEGPISLTLYMSDAEAQ 516
           Q  E+ P+      +     +++L +  S+DR +QM+ ++ + W+GP+SL  Y    E +
Sbjct: 155 QSFEVPPSLQCCSSSHEQDLNISLASIGSIDRSIQMLHVMLQ-WDGPVSLVFYARSEEEE 213

Query: 517 QFLSYAGNS----EALRSRRNIGYHVV---YKEGNF-----YPINTLRNVALNQVSTPYV 564
           +F+S          A+RS R +   ++   + +G+      +P+N LR+VA+++  T  V
Sbjct: 214 RFISAFVEETMIPAAVRSCRELNLQLLSDCHDQGSPERVYPFPVNKLRSVAVSRAKTDLV 273

Query: 565 FLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAF-------------------- 604
             +D+DF+P       L+ ++ S       G  +LV+PAF                    
Sbjct: 274 LYIDVDFIPSADAARQLQQAMSSPSFDQAAG--LLVLPAFLTVWSSPWPSPMRVEDEGMQ 331

Query: 605 -------ETQR-------YRTAFP---ASHAPTNFSRW----VNATTPYQIEWAPDFEPY 643
                  E +R       Y    P   + H  T++ +W     N T P+Q+ ++  +EPY
Sbjct: 332 VEPLHLDELKRRCIAGDVYPPEMPFRMSPHGATDYLKWFHMPANRTEPFQVLYSMWYEPY 391

Query: 644 IVAHRDLPR-------YDTRFVGFGWNK 664
            V +R L R       +D RFV  G +K
Sbjct: 392 FVVNRRLWRGINHPGLFDERFVDAGPDK 419


>gi|126338874|ref|XP_001379360.1| PREDICTED: n-acetyllactosaminide
           beta-1,3-N-acetylglucosaminyltransferase-like
           [Monodelphis domestica]
          Length = 441

 Score = 68.2 bits (165), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 67/268 (25%), Positives = 106/268 (39%), Gaps = 90/268 (33%)

Query: 477 NDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQF---LSYAGNS---EALRS 530
           + V L    S+  L  V  L + W GP+++ ++ +  E  Q    L+YA  +   EA+R 
Sbjct: 118 DAVLLATHASLSNLGHVGTLLERWAGPLAVAVFAARPEEAQLATVLTYALGALCTEAVRD 177

Query: 531 RRNIGYHVV------------YKEGNF------------------------------YPI 548
           R  +  H+V             + G F                              YP 
Sbjct: 178 R--LTMHLVCPARTEAAVPDPREPGEFARISSCTDVFTSMGARSRSGRNYALATNVSYPN 235

Query: 549 NTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHG-GKKVLVVPAFETQ 607
           N LRNVA       +  ++D+D +P  GL+P     +R +   G G   + LVVPAFE +
Sbjct: 236 NLLRNVAREGAG--HALVVDVDMVPSEGLWP----GLRDLLAQGEGKAGRALVVPAFEIR 289

Query: 608 R-----------------------YRTAFPASHAPTNFSRWVNATTP--------YQIEW 636
           +                       Y    P   APTN++RW+  T P        Y++ W
Sbjct: 290 QGVRVPRTKIELMRFLKMGHVRPFYAELCPRCQAPTNYTRWL--TLPPGGPLRPAYRVTW 347

Query: 637 APDFEPYIVAHRDLPRYDTRFVGFGWNK 664
              +EP+ VA   +P ++ RF  +G+N+
Sbjct: 348 QDPWEPFYVAGGAVPSFNERFRQYGFNR 375


>gi|307104338|gb|EFN52592.1| hypothetical protein CHLNCDRAFT_138618 [Chlorella variabilis]
          Length = 308

 Score = 67.8 bits (164), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 49/162 (30%), Positives = 67/162 (41%), Gaps = 34/162 (20%)

Query: 537 HVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGK 596
           H      + YP N +RN AL    T  V LLD DF+P   +   +     S    G   +
Sbjct: 68  HCDAHSASLYPANAVRNRALRLAETEAVILLDADFVPSASMAARMSDQGYSAATLGLLQQ 127

Query: 597 KV-LVVPAFET------------QRYRTA--------------------FPASHAPTNFS 623
           ++ LV+PAF+T            Q  R+                     F   H  T+F 
Sbjct: 128 RLALVLPAFQTVGGGEEDAELALQVARSGKAALLEQCEVGELESFHSGHFSKGHRATDFG 187

Query: 624 RWVNATTPYQIEWAPDFEPYIVAHRD-LPRYDTRFVGFGWNK 664
           RW  A   Y++E+   FEPYI+ HR  +P YD R  G+  NK
Sbjct: 188 RWATAANVYEVEFQTGFEPYILVHRKHVPWYDERCRGYFVNK 229


>gi|66535277|ref|XP_624895.1| PREDICTED: glucoside xylosyltransferase 1-like [Apis mellifera]
          Length = 362

 Score = 67.8 bits (164), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 43/136 (31%), Positives = 67/136 (49%), Gaps = 8/136 (5%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP VL  T+   + +DTD +F     ++W  F ++   Q   L     D   G  W 
Sbjct: 142 RLFLPSVL-NTIDAVLYVDTDTLFLAPPEKIWDEFKKMNSSQLAALSPEHEDPNTG--WY 198

Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAI 315
           N     P  G+ G N+GV+L++LT++R+  W  +   I +++ L   W    DQDI N I
Sbjct: 199 NRFAKHPYYGKLGVNSGVMLMNLTRMREFRWIEYVIPIHKQYKLKITW---GDQDIINII 255

Query: 316 ISEHPYLVYTLPCQWN 331
              HP  +Y   C++N
Sbjct: 256 FHYHPEKLYVYSCRYN 271


>gi|115496384|ref|NP_001069467.1| uncharacterized protein C3orf21 homolog [Bos taurus]
 gi|112180740|gb|AAI14705.2| C1H3ORF21 protein [Bos taurus]
          Length = 393

 Score = 67.8 bits (164), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 56/208 (26%), Positives = 91/208 (43%), Gaps = 24/208 (11%)

Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
           F + +++ +  S     +YS     L + + +++P+ + + I LD D+ + T+I +L+  
Sbjct: 182 FPVVEAMQKHFSAGSGTYYSDSIFFLSVAMHQIMPKEILRIIQLDLDLKYKTNIRELFEE 241

Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWK----NHK-----PWPALGRGFNTGVILLDLTKLR 282
           F        IG+       Y    W+    N K     P P    GFN+GV+LL+L  +R
Sbjct: 242 FDNFLPGAIIGIAREMQPVYRHTFWQFRHDNPKTRVGSPPPEGLPGFNSGVMLLNLEAMR 301

Query: 283 D------ISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLS- 335
                  +      + +A+K+        L DQD F  I  EHP L + L C WN QL  
Sbjct: 302 QSPLYDRLLEPAQVQQLADKY---HFRGHLGDQDFFTMIGMEHPELFHVLDCTWNRQLCT 358

Query: 336 ---DNTRSD--ELCYTELTDLKIIHWNS 358
              D+  SD  +  +     +KI H N 
Sbjct: 359 WWRDHGYSDVFDAYFRCEGHVKIYHGNC 386


>gi|332022944|gb|EGI63210.1| Glycosyltransferase 8 domain-containing protein 3 [Acromyrmex
           echinatior]
          Length = 367

 Score = 67.8 bits (164), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 44/136 (32%), Positives = 67/136 (49%), Gaps = 8/136 (5%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP VL +T A  + +DTD +F     ++W  F ++   Q   L     D   G  W 
Sbjct: 144 RLFLPTVLNDTDA-VLYVDTDTLFLAPPEKVWDNFKKMNSSQLAALSPEHEDPNTG--WY 200

Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAI 315
           N     P  G+ G N+GV+L++LT++R   W  +   I +++ L   W    DQDI N I
Sbjct: 201 NRFAKHPYYGKLGVNSGVMLMNLTRMRKFRWTDYVIPIHKEYKLKITW---GDQDIINII 257

Query: 316 ISEHPYLVYTLPCQWN 331
              HP  +Y   C++N
Sbjct: 258 FHYHPEKLYVYSCRYN 273



 Score = 43.5 bits (101), Expect = 0.40,   Method: Compositional matrix adjust.
 Identities = 38/163 (23%), Positives = 70/163 (42%), Gaps = 18/163 (11%)

Query: 3   SDEEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTL 62
           S + P  P+   + + +V  G +     +T++KS L +   PL F ++T+   +      
Sbjct: 45  SKKSPSTPSTSEVTICVVVCG-DRLHEALTMLKSALVFTSRPLQFIILTEQNLIPAFSEK 103

Query: 63  FSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLL-------KLTLPKVLPETLAKT 115
            S W +    +S    D +V  +++ P+     ++  L       +L LP VL +T A  
Sbjct: 104 LSEWRL----ISNKTFDFLVRPITF-PDGSDVAMWKKLFKPCAAQRLFLPTVLNDTDA-V 157

Query: 116 IVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSD----WY 154
           + +DTD +F     ++W  F ++   Q   L     D    WY
Sbjct: 158 LYVDTDTLFLAPPEKVWDNFKKMNSSQLAALSPEHEDPNTGWY 200


>gi|194756854|ref|XP_001960685.1| GF11373 [Drosophila ananassae]
 gi|190621983|gb|EDV37507.1| GF11373 [Drosophila ananassae]
          Length = 377

 Score = 67.8 bits (164), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 64/278 (23%), Positives = 108/278 (38%), Gaps = 87/278 (31%)

Query: 468 DEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEA----QQFLSYAG 523
           ++Y  +S+ N V L  Q S++RL  +  +  +W+G +S+ L+ +  E     Q F++Y  
Sbjct: 29  EQYADTSEANYVCLATQTSVERLNSLPQVSANWQGKMSVALFAAGHEEFVVLQYFVTYMR 88

Query: 524 NSEALRSRRNIGYHVV-------------------------------------------- 539
               +  R N  +H++                                            
Sbjct: 89  LC-FVNIRENATFHILVPRDYDKLPRVEAVPYNMKGKFDCQYPDRTMKALLKFRTLKTLQ 147

Query: 540 YKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVL 599
           +++ N YP N +RN+A     T YVFL DID +P     P L    R+ +          
Sbjct: 148 WRQRNTYPQNHMRNLARKGCQTKYVFLTDIDIVPSTNSVPQLNHFFRTANCSKLCA---Y 204

Query: 600 VVPAFETQRYRTAFPASHA------------------------PTNFSRWVNATTPYQIE 635
           V+P FE    R  FP S +                         TNFS+W++  T  + E
Sbjct: 205 VIPTFEID-IRAMFPRSKSGLLRLIKKGLARPFHEKVFIYNQYATNFSKWLSTNTN-ETE 262

Query: 636 WAPD---------FEPYIVAHRDLPRYDTRFVGFGWNK 664
            +           +EP+ VA  ++P +D RF+G+G+ +
Sbjct: 263 VSVSHIVTNFEFLYEPFYVAVDNVPAHDERFLGYGFTR 300


>gi|300120705|emb|CBK20259.2| unnamed protein product [Blastocystis hominis]
          Length = 322

 Score = 67.8 bits (164), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 61/233 (26%), Positives = 94/233 (40%), Gaps = 46/233 (19%)

Query: 473 SSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRR 532
           +S  +DVT  +Q +  R     +L + W GP+S+TL +   E          ++   SR 
Sbjct: 10  TSQVHDVTFTSQGTTVRYYYFPILLQRWTGPLSITLCLKPEEVSDINDQLA-ADNFDSRL 68

Query: 533 NIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHG 592
           +I  +V    G  YPIN LRN+A+  V T + +L D+D  P  GLY   ++ +R  +   
Sbjct: 69  SIVQYVATNPG-IYPINKLRNLAIRNVKTDHFWLTDLDMWPSQGLY---EAILRLPNSAL 124

Query: 593 HGGKKVLVVPAFETQRYRTA-------------FPASHAPTN------------------ 621
                 ++VPAFE      +             FP +    N                  
Sbjct: 125 QDEVLAIIVPAFEIHASSCSTFQACTDLPILPQFPNTKRELNRCISTRRCSTFRPWDQLH 184

Query: 622 ---FSRWVNAT-----TPYQIEWAPDFEPYIVAHRD--LPRYDTRFVGFGWNK 664
              F  W   +     TP +       EPY+V  +   LP +D RFV + +NK
Sbjct: 185 DYHFPEWYRESYTAELTPVRCFKGNTQEPYVVVKKSPFLPSFDERFVNYAYNK 237


>gi|256072700|ref|XP_002572672.1| glycosyltransferase-related [Schistosoma mansoni]
          Length = 165

 Score = 67.4 bits (163), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 44/137 (32%), Positives = 68/137 (49%), Gaps = 18/137 (13%)

Query: 266 GRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYT 325
           G G+N+GV+L+DL KLR + W   W    +  + T  + S  +QD+ N I+  +  + Y 
Sbjct: 20  GYGYNSGVVLMDLAKLRSLPWNHLWMSAIKFVMKTAGYLSAGEQDVINLIMLRNNEIFYE 79

Query: 326 LPCQWNVQLSDNTRSDELCYTELT--------DL------KIIHWN---SPKKLKVKNKH 368
           +PC+WN+QLS    +     + LT        DL      KI+H N    P+ L +KN  
Sbjct: 80  IPCEWNIQLSSGVDTHRCPVSWLTENSLLNSGDLIYGKQPKIVHLNHQVKPEDLDLKNI- 138

Query: 369 MEFFRNLYLTFLEYDGN 385
           +    N+ LT    +GN
Sbjct: 139 LNVDTNVSLTTYNGNGN 155


>gi|395330127|gb|EJF62511.1| hypothetical protein DICSQDRAFT_126280 [Dichomitus squalens
           LYAD-421 SS1]
          Length = 363

 Score = 67.4 bits (163), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 41/129 (31%), Positives = 64/129 (49%), Gaps = 7/129 (5%)

Query: 25  NSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQ--VEVSFYL--ADS 80
           +S      LIKSI+ Y  +PL FH+I D  A   L+      + PQ  + V FY    +S
Sbjct: 5   DSAAEGSILIKSIIMYSSSPLAFHIICDHDARQYLERRLRLLTHPQHDISVQFYRIPHES 64

Query: 81  VVEDV---SWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSR 137
           +V  +     +   H +GV GL+KL + ++LP ++ + I +DTD  F  D  QLW  F  
Sbjct: 65  MVARIQREGALHTDHSAGVPGLMKLFIHEILPPSVERAIFVDTDAFFIADPTQLWDRFDA 124

Query: 138 LRQRQTIGL 146
            +    + +
Sbjct: 125 FKPGAAVSM 133



 Score = 49.3 bits (116), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 30/96 (31%), Positives = 48/96 (50%), Gaps = 9/96 (9%)

Query: 189 HYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQS 248
           H +GV GL+KL + ++LP ++ + I +DTD  F  D  QLW  F   +    + +     
Sbjct: 79  HSAGVPGLMKLFIHEILPPSVERAIFVDTDAFFIADPTQLWDRFDAFKPGAAVSM-PYHP 137

Query: 249 DWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDI 284
           D Y  +       W    R   + V+LLDL +LR++
Sbjct: 138 DQYAPE-------WHHANR-ICSCVMLLDLKRLREL 165


>gi|198431243|ref|XP_002124020.1| PREDICTED: similar to glycosyltransferase-like 1B [Ciona
           intestinalis]
          Length = 406

 Score = 67.4 bits (163), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 61/259 (23%), Positives = 104/259 (40%), Gaps = 75/259 (28%)

Query: 475 DGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRS---- 530
           D  DVTLV   ++DRL  +  + + W GPIS+ +Y   A  Q    +    + LR     
Sbjct: 78  DPADVTLVMHGTVDRLYRLIEIAEVWSGPISMAVY---APMQDASFFDDAIDGLRLCWAT 134

Query: 531 -RRNIGYHVVY---------KEGNF-----------------------YPINTLRNVALN 557
            R+++ +H+V+         + G+F                       YP N LRNV  +
Sbjct: 135 LRKSVSFHLVHHVNSPANVTQLGSFVYLSCKDIIRRLRLVRPNAGRSQYPHNLLRNVGRS 194

Query: 558 QVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMH----GHGGKKVLVVPAFETQRY---- 609
              T +V  +D D  P   L    K  ++  ++         K + V+P FE +++    
Sbjct: 195 GALTNFVLHIDADLAPSRSLRGKFKELLKLHNLESGIAARSSKLLFVLPVFELRQFQNPV 254

Query: 610 ------------RTAFP-------ASHAPTNFSRWV--------NATTPYQIEWAPDFEP 642
                       R A P         H PT++ RW+        ++   Y++++  ++ P
Sbjct: 255 RTKQELAEAIKERNAIPFLQETCVPCHRPTDYDRWLQSLKSDSNDSEMSYKVDYVKEWTP 314

Query: 643 YIVAHRDLPRYDTRFVGFG 661
           + +  +  P YD RFV FG
Sbjct: 315 FFITSKHSPLYDDRFVAFG 333


>gi|302689521|ref|XP_003034440.1| glycosyltransferase family 8 protein [Schizophyllum commune H4-8]
 gi|300108135|gb|EFI99537.1| glycosyltransferase family 8 protein [Schizophyllum commune H4-8]
          Length = 427

 Score = 67.4 bits (163), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 45/146 (30%), Positives = 65/146 (44%), Gaps = 8/146 (5%)

Query: 8   VIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWS 67
           + P  E +  A +     S      LIKSIL Y  NP   H++        L++  S  +
Sbjct: 58  LAPALEPVVFAFMIWSRGSAEEGAQLIKSILLYNSNPTEIHIVCGEDERVYLESRLSLVT 117

Query: 68  VPQ--VEVSFYLAD-----SVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDT 120
            PQ  V V FY          +E    I   H +G  GL+KL L +++P T+ K I +DT
Sbjct: 118 SPQHDVRVVFYPVTLQEMLDRIEREGAIKTDHEAGTPGLMKLFLHEIMP-TVKKAIYVDT 176

Query: 121 DVIFATDIAQLWALFSRLRQRQTIGL 146
           D +F +D   LW +F   R    I +
Sbjct: 177 DALFISDPTILWRIFDTFRPSTAIAM 202



 Score = 48.1 bits (113), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 31/106 (29%), Positives = 49/106 (46%), Gaps = 10/106 (9%)

Query: 179 VEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQR 238
           +E    I   H +G  GL+KL L +++P T+ K I +DTD +F +D   LW +F   R  
Sbjct: 139 IEREGAIKTDHEAGTPGLMKLFLHEIMP-TVKKAIYVDTDALFISDPTILWRIFDTFRPS 197

Query: 239 QTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDI 284
             I +  +     +   W              + ++LLD  KLRD+
Sbjct: 198 TAIAMGYHPEQ--MAAAWH-------YASRICSCIMLLDFEKLRDM 234


>gi|443696721|gb|ELT97358.1| hypothetical protein CAPTEDRAFT_147975 [Capitella teleta]
          Length = 388

 Score = 67.4 bits (163), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 66/270 (24%), Positives = 109/270 (40%), Gaps = 76/270 (28%)

Query: 468 DEYEASSDGN----DVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEA-------- 515
           D ++AS +      DVTL++  S+  ++ +  L   W GP+S+++++  +E         
Sbjct: 55  DVFQASGETKTRILDVTLISHCSVHLIKHIPALVNAWHGPMSISIFLVSSELHTAVDMLL 114

Query: 516 ---------QQF--------LSYAG-----------NSEALRS---RRNIGYHVVYKEGN 544
                    Q+F        LS+             N + + S   +R +G      +G 
Sbjct: 115 IIHQCVPLFQKFVTVHFVLPLSHQKLTISKSSLLPINCDEMNSDLQKRVLGSRNYALDGV 174

Query: 545 FYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAF 604
            YP N LRNVAL    T Y  ++D+D LP   L     + ++S        +   V+PAF
Sbjct: 175 LYPNNLLRNVALRNALTEYHLVIDVDMLPSPNLRQLFTTFVKS--QFAPNNRTAFVIPAF 232

Query: 605 ETQRYRTAFP------------ASHAP------------TNFSRW-----VNATTP-YQI 634
           E     T FP             S  P            T++  W      +  +P Y +
Sbjct: 233 EMVTGGT-FPQTKQHLLDLWDQGSIQPFYLKACWKCQRFTDYDSWRKLPDSDGLSPGYDV 291

Query: 635 EWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           +W   +EP+ +A    PRYD RF  +G+N+
Sbjct: 292 DWKDPWEPFFIASWQAPRYDERFKQYGFNR 321


>gi|241608406|ref|XP_002405973.1| hypothetical protein IscW_ISCW019642 [Ixodes scapularis]
 gi|215500729|gb|EEC10223.1| hypothetical protein IscW_ISCW019642 [Ixodes scapularis]
          Length = 345

 Score = 67.4 bits (163), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 43/134 (32%), Positives = 64/134 (47%), Gaps = 12/134 (8%)

Query: 204 VLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSD----WYLGKLWKNH 259
           VLPE  A  + +D DVIF      LW+LF  +   Q   LV    D    WY  + +  H
Sbjct: 113 VLPEEDA-VLYVDADVIFVHPPEDLWSLFGAMNDSQLAALVPESEDYATNWY--RRFARH 169

Query: 260 KPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEH 319
             +  LG   N+GV+L++LT++R   W      +  K+     W    DQD+ N +  +H
Sbjct: 170 PFFEPLG--VNSGVMLMNLTRMRAFGWESLMGPLLTKYERNISW---GDQDLINIVFHDH 224

Query: 320 PYLVYTLPCQWNVQ 333
           P  +  L C+WN +
Sbjct: 225 PDKLLVLSCKWNFR 238


>gi|241672118|ref|XP_002411447.1| N-acetyllactosaminide beta-1,3-N-acetylglucosaminyltransferase,
           putative [Ixodes scapularis]
 gi|215504098|gb|EEC13592.1| N-acetyllactosaminide beta-1,3-N-acetylglucosaminyltransferase,
           putative [Ixodes scapularis]
          Length = 398

 Score = 67.4 bits (163), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 72/270 (26%), Positives = 108/270 (40%), Gaps = 87/270 (32%)

Query: 476 GNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQ----FLSYAGN-SEALRS 530
           G DV L  Q S+DRL  +  L + W GP+SL ++++++   Q    +LS   + SEA+++
Sbjct: 67  GGDVCLATQGSLDRLSWLVDLVQLWNGPVSLAVFVANSAQFQAVKVYLSLIRSCSEAIKA 126

Query: 531 R---------------------------------------RNIGYHVVYK---EGNFYPI 548
                                                   +N+G   V K   +   YP 
Sbjct: 127 NMRVDLVYPADENLTRKTVWSDELGSDFSCGTHAVLLKVLQNVGKTKVVKGSAKQMLYPQ 186

Query: 549 NTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGH---GGKKVLVVPAFE 605
           N LRNVA +  +  Y FL DID +P  GL+  L       D  G      K V VVP +E
Sbjct: 187 NHLRNVARDGCAQKYFFLTDIDIMPKPGLWEELS------DFFGRTLPCTKCVFVVPTYE 240

Query: 606 ---------TQR--------------YRTAFPASHAPTNFSRWVNAT-------TPYQIE 635
                    T+R              ++ +F  +   TN   W             Y ++
Sbjct: 241 MSETAAVPRTKRELLHLVRRKEARPFHQKSFIHNQYATNHGMWEGLKQRGDGLRAAYSVK 300

Query: 636 WAPDF-EPYIVAHRDLPRYDTRFVGFGWNK 664
               F EP+ VA + +PRYD RFVG+G+ +
Sbjct: 301 RYEFFYEPFYVAAKVVPRYDERFVGYGFTR 330


>gi|383864121|ref|XP_003707528.1| PREDICTED: glucoside xylosyltransferase 1-like [Megachile
           rotundata]
          Length = 367

 Score = 67.0 bits (162), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 45/139 (32%), Positives = 69/139 (49%), Gaps = 14/139 (10%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSD----WYLG 253
           +L LP VL ET A  + +DTD +F      +W  F ++   Q   L     D    WY  
Sbjct: 144 RLFLPTVLNETDA-VLYMDTDTLFLAPPETIWDEFKKMNSSQLAALSPEHEDPNTAWY-N 201

Query: 254 KLWKNHKPWPALGR-GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIF 312
           +  K+    P  G+ G N+GV+L++LT++R+  W  +   I +++ L   W    DQDI 
Sbjct: 202 RFAKH----PYYGKLGVNSGVMLMNLTRMREFRWIEYVIPIHKEYKLKITW---GDQDII 254

Query: 313 NAIISEHPYLVYTLPCQWN 331
           N I   HP  +Y   C++N
Sbjct: 255 NIIFHYHPEKLYVYSCRYN 273



 Score = 44.3 bits (103), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 40/156 (25%), Positives = 68/156 (43%), Gaps = 18/156 (11%)

Query: 10  PTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVP 69
           P  E + + +V  G     SL T++KS L + + PL F ++ +T  ++        W + 
Sbjct: 52  PIVEEVTICVVVCGDRLDESL-TMLKSALVFTRRPLQFIILAETDLISAFNEKLMEWKL- 109

Query: 70  QVEVSFYLADSVVEDVSWIPNKHYSGVYGLL-------KLTLPKVLPETLAKTIVLDTDV 122
              +S    D VV+ +++ P    + ++  L       +L LP VL ET A  + +DTD 
Sbjct: 110 ---MSNKTFDFVVKPITF-PEGSDAAMWKKLFKPCAAQRLFLPTVLNETDA-VLYMDTDT 164

Query: 123 IFATDIAQLWALFSRLRQRQTIGLVENQSD----WY 154
           +F      +W  F ++   Q   L     D    WY
Sbjct: 165 LFLAPPETIWDEFKKMNSSQLAALSPEHEDPNTAWY 200


>gi|426217674|ref|XP_004003078.1| PREDICTED: xyloside xylosyltransferase 1 [Ovis aries]
          Length = 393

 Score = 67.0 bits (162), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 56/208 (26%), Positives = 91/208 (43%), Gaps = 24/208 (11%)

Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
           F + +++ +  S     +YS     L + + +++P+ + + I LD D+ + T+I +L+  
Sbjct: 182 FPVVEAMQKHFSAGSGTYYSDSIFFLSVAMHQIMPKEILRIIQLDLDLKYKTNIRELFEE 241

Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWK----NHK-----PWPALGRGFNTGVILLDLTKLR 282
           F        IG+       Y    W+    N K     P P    GFN+GV+LL+L  +R
Sbjct: 242 FDNFLPGAIIGIAREMQPVYRHTFWQFRHDNPKTRVGSPPPEGLPGFNSGVMLLNLEAMR 301

Query: 283 D------ISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLS- 335
                  +      + +A+K+        L DQD F  I  EHP L + L C WN QL  
Sbjct: 302 QSPLYDRLLEPAQVQQLADKY---HFRGHLGDQDFFTMIGMEHPELFHVLDCTWNRQLCT 358

Query: 336 ---DNTRSD--ELCYTELTDLKIIHWNS 358
              D+  SD  +  +     +KI H N 
Sbjct: 359 WWRDHGYSDVFDAYFRCEGHVKIYHGNC 386


>gi|440895294|gb|ELR47525.1| hypothetical protein M91_05743, partial [Bos grunniens mutus]
          Length = 226

 Score = 67.0 bits (162), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 56/208 (26%), Positives = 91/208 (43%), Gaps = 24/208 (11%)

Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
           F + +++ +  S     +YS     L + + +++P+ + + I LD D+ + T+I +L+  
Sbjct: 15  FPVVEAMQKHFSAGSGTYYSDSIFFLSVAMHQIMPKEILRIIQLDLDLKYKTNIRELFEE 74

Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWK----NHK-----PWPALGRGFNTGVILLDLTKLR 282
           F        IG+       Y    W+    N K     P P    GFN+GV+LL+L  +R
Sbjct: 75  FDNFLPGAIIGIAREMQPVYRHTFWQFRHDNPKTRVGSPPPEGLPGFNSGVMLLNLEAMR 134

Query: 283 D------ISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLS- 335
                  +      + +A+K+        L DQD F  I  EHP L + L C WN QL  
Sbjct: 135 QSPLYDRLLEPAQVQQLADKY---HFRGHLGDQDFFTMIGMEHPELFHVLDCTWNRQLCT 191

Query: 336 ---DNTRSD--ELCYTELTDLKIIHWNS 358
              D+  SD  +  +     +KI H N 
Sbjct: 192 WWRDHGYSDVFDAYFRCEGHVKIYHGNC 219


>gi|242006902|ref|XP_002424281.1| conserved hypothetical protein [Pediculus humanus corporis]
 gi|212507681|gb|EEB11543.1| conserved hypothetical protein [Pediculus humanus corporis]
          Length = 334

 Score = 67.0 bits (162), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 46/156 (29%), Positives = 73/156 (46%), Gaps = 13/156 (8%)

Query: 205 LPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSD----WYLGKLWKNHK 260
           LP  +   + LDTD +    + Q+WA F  +   Q   L     D    WY    +  H 
Sbjct: 115 LPVNVDSVLYLDTDTLILGPVEQIWAHFKLMNSSQFAALAPEHEDPLTGWY--NRFARHP 172

Query: 261 PWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHP 320
            +  LG   NTGV+L++LT++R   W  +   I +++ L   W    DQDI N I   HP
Sbjct: 173 FYKPLG--VNTGVMLMNLTRMRKFEWERYLLPIYKQYKLKIPW---GDQDIINIIFHFHP 227

Query: 321 YLVYTLPCQWNVQLSDNTRSDELCYT-ELTDLKIIH 355
             +Y   C++N + +D+     +C T E   + ++H
Sbjct: 228 DKLYIYSCKYNYR-ADHCMYTSVCKTAEKEGIVVLH 262



 Score = 45.1 bits (105), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 38/148 (25%), Positives = 62/148 (41%), Gaps = 11/148 (7%)

Query: 13  EVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQVE 72
           E + + +V  G     +LV L+KS L + K  L F++ TD   +N+ Q   + W     +
Sbjct: 23  ETMSIVVVVCGDRWKETLV-LLKSALLFTKGHLQFYIFTDLELVNVFQRELNIWK-KDFK 80

Query: 73  VSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPK--VLPETLAKTIVLDTDVIFATDIAQ 130
            +F   D  + +V +   +  S    L K    +   LP  +   + LDTD +    + Q
Sbjct: 81  KNF---DYKILEVKFPSGEFESSWRKLFKPCSSQRLFLPVNVDSVLYLDTDTLILGPVEQ 137

Query: 131 LWALFSRLRQRQTIGLVENQSD----WY 154
           +WA F  +   Q   L     D    WY
Sbjct: 138 IWAHFKLMNSSQFAALAPEHEDPLTGWY 165


>gi|20130341|ref|NP_611886.1| CG3253, isoform A [Drosophila melanogaster]
 gi|386768550|ref|NP_001246488.1| CG3253, isoform B [Drosophila melanogaster]
 gi|7291745|gb|AAF47166.1| CG3253, isoform A [Drosophila melanogaster]
 gi|15010436|gb|AAK77266.1| GH04269p [Drosophila melanogaster]
 gi|220956256|gb|ACL90671.1| CG3253-PA [synthetic construct]
 gi|383302675|gb|AFH08241.1| CG3253, isoform B [Drosophila melanogaster]
          Length = 389

 Score = 66.6 bits (161), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 65/278 (23%), Positives = 107/278 (38%), Gaps = 87/278 (31%)

Query: 468 DEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEA----QQFLSYAG 523
           ++Y  SS+G+ V L  Q S++RL  +  +  +W+G +S+ L+ +  E     Q F++Y  
Sbjct: 41  EQYVDSSEGSLVCLATQTSVERLNSLPQVAANWQGKMSVALFAAGPEEFVVLQYFVTYMR 100

Query: 524 NSEALRSRRNIGYHVV-------------------------------------------- 539
              A   R N  +H++                                            
Sbjct: 101 LCFA-NIRENATFHLLTPRDFDKLPRVAALPLNMRGKFDCQYPDRTLKALLKFRSLKTLQ 159

Query: 540 YKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVL 599
           +++ N YP N +RN+A     T YVFL DID +P     P L    R+ +          
Sbjct: 160 WRQRNTYPQNHMRNLARKGCQTKYVFLTDIDIVPSTNSVPQLNHFFRTANCTKSCA---Y 216

Query: 600 VVPAFETQRYRTAFPAS------------------------HAPTNFSRWVNATTPYQIE 635
           V+P FE    R  FP S                           TNFS+W++  T  + E
Sbjct: 217 VIPTFEID-VRATFPRSKNALVRLIRKGLARPFHEKVFIYNQYATNFSKWLSPNTN-ETE 274

Query: 636 WAPD---------FEPYIVAHRDLPRYDTRFVGFGWNK 664
            +           +EP+ +A  + P +D RF+G+G+ +
Sbjct: 275 VSVSHVVTNFEFLYEPFYIAVDNAPAHDERFLGYGFTR 312


>gi|442754659|gb|JAA69489.1| Putative gt8 like 2 represents a subfamily protein of gt8 with
           unknown function [Ixodes ricinus]
          Length = 318

 Score = 66.6 bits (161), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 48/170 (28%), Positives = 84/170 (49%), Gaps = 11/170 (6%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +LP+  A  + LD D++    +  LW +F  +     I       D      ++
Sbjct: 87  RLFLPDMLPKEEA-VLYLDADILLLRPVEDLWRVFHSMGPSHLIAQSYEIEDMDTNP-YR 144

Query: 258 NHKPWPALGR-GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAII 316
           ++   P +   G N GV+ ++LT++R+ +W      + E++     W   ADQDI N I 
Sbjct: 145 HYGSVPYVKPYGVNAGVMPMNLTRMRNFAWGSRMEPLLEQYGSRLKW---ADQDIVNVIF 201

Query: 317 SEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWNSPKKLKVKN 366
           +E+P  ++ +PC WN +  DN    E C  E+  L  +H N  ++L +K+
Sbjct: 202 NENPDKLFLMPCTWNFR-RDNCEFHESCKGEVPGL--LHAN--QRLFIKD 246


>gi|395528648|ref|XP_003766439.1| PREDICTED: xyloside xylosyltransferase 1 [Sarcophilus harrisii]
          Length = 368

 Score = 66.6 bits (161), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 47/172 (27%), Positives = 74/172 (43%), Gaps = 26/172 (15%)

Query: 187 NKHYSGVYG--------LLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQR 238
            KH+S   G         L + + +++P+ +++ I LD D+   T+I +L+  F      
Sbjct: 164 QKHFSAGSGTYYGDSIFFLSVAMHQIMPKEISQIIQLDLDLKHKTNIRELFEEFDNFPPG 223

Query: 239 QTIGLVENQSDWYLGKLWKNHK---------PWPALGRGFNTGVILLDLTKLRDISWAGF 289
             IG+       Y    W+  +         P P    GFN+GV+LL+L  +R    +  
Sbjct: 224 AVIGIAREMQPVYRHTFWQFRRENPQTRVGDPPPDGLPGFNSGVMLLNLEAMRQ---SAL 280

Query: 290 WRIIAEKFLLT------RLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLS 335
           +R + E   L       R    L DQD F  +  E+P L + L C WN QL 
Sbjct: 281 YRRLLEPATLQQLTEKYRFQGHLGDQDFFTVVGMEYPELFHVLDCTWNRQLC 332


>gi|330846633|ref|XP_003295120.1| hypothetical protein DICPUDRAFT_51901 [Dictyostelium purpureum]
 gi|325074251|gb|EGC28356.1| hypothetical protein DICPUDRAFT_51901 [Dictyostelium purpureum]
          Length = 387

 Score = 66.6 bits (161), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 32/107 (29%), Positives = 63/107 (58%), Gaps = 4/107 (3%)

Query: 479 VTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMS--DAEAQQFLSYAGNSEALRSRRNIGY 536
           V++V Q ++DRL  V M+ + W+ P+S+++++     +  +      N  +L    ++  
Sbjct: 12  VSIVTQTTVDRLHKVSMMAEKWKAPMSVSVFIKKPSKDIPELKKKLQNYPSLVYYADL-- 69

Query: 537 HVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKS 583
           H+++     YP+N LRN+A+    T +VF++D DFLP  GL+ Y+++
Sbjct: 70  HLLFANNTRYPVNNLRNLAIENSRTDFVFIMDADFLPPLGLHDYIQN 116



 Score = 52.8 bits (125), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 24/50 (48%), Positives = 35/50 (70%), Gaps = 3/50 (6%)

Query: 617 HAPTNFSRWVNATTPYQIEWAPDFEPYIVAHR--DLPRYDTRFVGFGWNK 664
           H+PTNF +W+ A  PY+IE+   FEPY+V +R   LP +D R  G+G++K
Sbjct: 257 HSPTNFPKWIEAKEPYEIEYKWIFEPYLVFNRSESLP-FDERLKGYGFDK 305


>gi|68362842|ref|XP_708461.1| PREDICTED: n-acetyllactosaminide
           beta-1,3-N-acetylglucosaminyltransferase-like isoform 2
           [Danio rerio]
 gi|444246614|gb|AGD94656.1| beta-1,3-N-acetylglycosaminyltransferase 1 [Danio rerio]
          Length = 431

 Score = 66.6 bits (161), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 45/151 (29%), Positives = 65/151 (43%), Gaps = 36/151 (23%)

Query: 546 YPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFE 605
           YP N LRNVA       Y+ ++DID +P   L+    + +   +       +VLV+PAFE
Sbjct: 219 YPNNLLRNVARGGTDAAYILVIDIDMIPSANLHHQFVTMLMKREP---AADEVLVLPAFE 275

Query: 606 TQRYRTAFPAS------------------------HAPTNFSRWVNATTP--------YQ 633
            +  R   PAS                         APTN+S WVN  +         Y 
Sbjct: 276 IRHIRK-MPASKPELVQLYQVGEVRPFYDELCSRCQAPTNYSLWVNLASKSSGPLEVSYT 334

Query: 634 IEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           I W   +EP+ +  R +P YD  F  +G+N+
Sbjct: 335 INWVDPWEPFYIGARSVPLYDESFRQYGFNR 365


>gi|328855389|gb|EGG04516.1| family 49 glycosyltransferase [Melampsora larici-populina 98AG31]
          Length = 544

 Score = 66.6 bits (161), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 64/243 (26%), Positives = 101/243 (41%), Gaps = 52/243 (21%)

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY 521
           I P++  E+    D  D+T+   ++ +R ++   L +H++GPIS T+++S+         
Sbjct: 221 IGPHSDGEF----DQKDITITTLVTSNRFKVFAKLVEHYQGPISATIHVSNVPETLEPLL 276

Query: 522 AGNSEALRSRRN----IGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGL 577
           A      RS       +  H+V    +    N  RNVA     T YV +LD+DF      
Sbjct: 277 ASLETMYRSSPAMSTWVDIHLVVDSFD-RQFNMWRNVAKWFSRTSYVMMLDVDFWICTDF 335

Query: 578 Y------PYLKSSIRSMDMHGHGGKKVLVVPAFETQRYR-----TAFP------------ 614
                  PY  S + S       G+  LV+PAFE  +       T FP            
Sbjct: 336 RTRVLSNPYYLSLLES-------GQAALVIPAFEFTKQADGVNPTQFPNQKSTLKGLFQD 388

Query: 615 -----------ASHAPTNFSRWVNATTPYQ-IEWAPDFEPYIVAHRD-LPRYDTRFVGFG 661
                        H PT++ ++      YQ I +   +EPY++  +D  P  D RF+G+G
Sbjct: 389 GKIDMFHKSWIPGHGPTDYKQYFQTDQSYQVISYTHSYEPYVIYKKDGTPFCDERFIGYG 448

Query: 662 WNK 664
            NK
Sbjct: 449 GNK 451


>gi|302564007|ref|NP_001181005.1| xyloside xylosyltransferase 1 [Macaca mulatta]
          Length = 393

 Score = 66.6 bits (161), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 57/210 (27%), Positives = 89/210 (42%), Gaps = 28/210 (13%)

Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
           F + +++ +  S     +YS     L + + +++P+ + + I LD D+ F T+I +L+  
Sbjct: 182 FPVVEAMQKHFSAGSGTYYSDSIFFLSVAMHQIMPKEILRLIQLDLDLKFKTNIRELFEE 241

Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWKNHKPWP---------ALGRGFNTGVILLDLTKLR 282
           F        IGL       Y    W+     P             GFN+GV+LL+L  +R
Sbjct: 242 FDSFLPGAIIGLAREMQPVYRHTFWQFRHENPQTRVGGPPPEGLPGFNSGVMLLNLEAMR 301

Query: 283 DISWAGFWRII--------AEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQL 334
                 + R++        A+K+        L DQD F  I  EHP L + L C WN QL
Sbjct: 302 QSPL--YSRLLEPAQVQQLADKY---HFRGHLGDQDFFTMIGMEHPRLFHVLDCTWNRQL 356

Query: 335 S----DNTRSD--ELCYTELTDLKIIHWNS 358
                D+  SD  E  +     +KI H N 
Sbjct: 357 CTWWRDHGYSDVFEAYFRCEGHVKIYHGNC 386


>gi|428180835|gb|EKX49701.1| hypothetical protein GUITHDRAFT_104664 [Guillardia theta CCMP2712]
          Length = 576

 Score = 66.6 bits (161), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 23/46 (50%), Positives = 34/46 (73%)

Query: 614 PASHAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVG 659
           P +HAP ++ RW+ AT PY + +  DF P+++  RD+PR+DTRFVG
Sbjct: 423 PRAHAPVDYERWIGATRPYAVTYQHDFAPFLLCRRDVPRFDTRFVG 468


>gi|219123437|ref|XP_002182031.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217406632|gb|EEC46571.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 527

 Score = 66.6 bits (161), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 69/265 (26%), Positives = 103/265 (38%), Gaps = 52/265 (19%)

Query: 449 VAQLSMDRLQMVEINPNNVDE---YEASSDGNDVTLVAQLSMDRLQMVEMLFKHW--EGP 503
           V+    D L  +    N  D    +  S D    TLV QLS DRL M+E   + W    P
Sbjct: 122 VSSTETDALPSIPFQTNFPDAPVCHSLSPDDVSYTLVTQLSQDRLWMMEHHCQRWGPSHP 181

Query: 504 ISLTLYMSDAEAQ---QFLSYAGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVS 560
           +S+ ++ +   A+   Q ++     E L S + +           YP+N LRN+A   V+
Sbjct: 182 MSIAVFTNQTVAEVRSQLVALGCAPEQLASVQTLPSTAAAVSD--YPVNVLRNLAFRAVT 239

Query: 561 TPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRY----------- 609
           T ++  +D+DF P   L+  L S  R         +  LV+PAF+ QR            
Sbjct: 240 TTHIVYVDVDFWPSADLHATL-SGARIRHALAQNERTALVIPAFQLQRQCRAWKECPDQN 298

Query: 610 -----------------RTAFP------ASHAPTNFSRWVNATTPYQIEWAP-----DFE 641
                            R AFP        H  T +  W+ +     +   P      +E
Sbjct: 299 VPVMPTHKAALERLSRNRQAFPFDPTNVGGHGSTKYRAWIKSQPDGVLLEIPCVLSNRYE 358

Query: 642 PYIVAHRD--LPRYDTRFVGFGWNK 664
           PY+V      LP +   F G+G NK
Sbjct: 359 PYLVVRYCDVLPPFQEAFSGYGKNK 383


>gi|193577758|ref|XP_001943061.1| PREDICTED: uncharacterized protein C3orf21 homolog [Acyrthosiphon
           pisum]
          Length = 344

 Score = 66.2 bits (160), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 51/159 (32%), Positives = 73/159 (45%), Gaps = 15/159 (9%)

Query: 189 HYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQS 248
           +YS     L   L  V P +L K IV+D D  F + I  L  +F      Q  GL    S
Sbjct: 151 YYSDDVFYLSPALHLVAPNSLDKAIVVDVDTAFQSTICDLHNVFDNFTNTQLFGLAPELS 210

Query: 249 DWYLGKLWK-------NHKPWPALG-RGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLT 300
             Y   LWK        +K  P +   G N+GV+L  L ++R    A + R+I+ +++ +
Sbjct: 211 PVYHHILWKWRNYRGDKYKTQPHINLNGLNSGVLLSYLERIRKS--AEYDRLISPEWISS 268

Query: 301 RLWT-----SLADQDIFNAIISEHPYLVYTLPCQWNVQL 334
            +        L DQD +  +  EHP L+Y L C WN QL
Sbjct: 269 VVQKYLFKGHLGDQDWYTLVSYEHPELIYRLSCGWNRQL 307


>gi|299469898|emb|CBN76752.1| Glycosyltransferase, family GT49 [Ectocarpus siliculosus]
          Length = 1372

 Score = 66.2 bits (160), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 72/300 (24%), Positives = 103/300 (34%), Gaps = 107/300 (35%)

Query: 461 EINPNNVD-----EYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEA 515
           E +P  +D     E       +D+T+  Q S+ RL  +E +   W G +S  +       
Sbjct: 440 EYHPKRIDVVARLESRHRRAESDITIFTQCSLSRLNRLERMCDLWTGVVSAAI------- 492

Query: 516 QQFLSYAGNSEALRSRRNIG--YHVVYKEG-------------------NFYPINTLRNV 554
              ++  G     R RR +   +  V   G                     YP+N+LRN 
Sbjct: 493 ---INGGGGHVDQRMRRKVSALHRRVESTGKCRLDICLCQPAPGLAARDGLYPVNSLRNA 549

Query: 555 ALNQVSTPYVFLLDIDFLPMFGLYPYLKSS---IRSMDMHGHGGK--------------- 596
           AL    T  V  +D+D +PM GL+  +       R ++M   GG+               
Sbjct: 550 ALRASRTDLVLSVDVDAMPMRGLHEVVSRPPVYARLLEM-CRGGRPPTHAAPAFPAASCS 608

Query: 597 --------------KVLVVPAFE--------TQRYRT----------------------- 611
                           LVVPA E          R RT                       
Sbjct: 609 GADTEAATLDTHPPTFLVVPALEFVGASSPDDPRIRTLLQEADNKRKAADLVVAGDLQAF 668

Query: 612 ---AFPASHAPTNFSRWVN---ATTPYQIEWAPDFEPY-IVAHRDLPRYDTRFVGFGWNK 664
               FP  H  TN  RW       TPY++ +A  +EPY +   R  P +D RF GFG +K
Sbjct: 669 HAECFPEGHRQTNLDRWTTLPVDATPYEVMYAEGYEPYGLFFRRHAPPFDERFRGFGLDK 728



 Score = 60.8 bits (146), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 51/179 (28%), Positives = 71/179 (39%), Gaps = 56/179 (31%)

Query: 542 EGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSS--IRSMDMHGHGGKKVL 599
           + ++YPIN LRN AL Q  T  V + D+DF P   L   L++S  I+S+          +
Sbjct: 112 KASWYPINQLRNRALAQADTDLVLICDVDFRPCRRLAHLLRASGEIKSIFKRVSCRLNCV 171

Query: 600 VVPAFE-----------------TQRYRTAFPA--------------------------- 615
           V+PAFE                  QR     P                            
Sbjct: 172 VLPAFEVSAARDDDDDGVSNEEAAQRAAIEMPEKSWWLKTLACKQELLKQWEQGRVVPFA 231

Query: 616 ------SHAPTNFSRWVNATTP---YQIEWAPDFEPYIVAHRDL-PRYDTRFVGFGWNK 664
                  H  TNF RW  AT     Y++ +   FEP+++ +R L P +D RF G+G NK
Sbjct: 232 SRVWAQGHRATNFDRWKAATDLSGWYEVAYEEGFEPFVIMNRLLVPAFDERFEGYGRNK 290


>gi|195489451|ref|XP_002092743.1| GE14358 [Drosophila yakuba]
 gi|194178844|gb|EDW92455.1| GE14358 [Drosophila yakuba]
          Length = 458

 Score = 66.2 bits (160), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 53/202 (26%), Positives = 85/202 (42%), Gaps = 46/202 (22%)

Query: 174 LADSVVEDV--------SWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           +  S+++D+        S  PN +YS     L L L ++  ++L + I+LD D++F +D+
Sbjct: 228 VCSSIIQDIAAKLSPYFSSAPNSYYSDSLFFLSLGLHRIADQSLNRAILLDCDIVFRSDV 287

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWP--ALGR---------------- 267
             L+  F      Q  GL    +  Y   L++    +P  + G                 
Sbjct: 288 RLLFNEFDNFLPHQLYGLAPELTPVYRHILYRYRVRYPKTSFGNPYYPINNEGSNQHSRV 347

Query: 268 -----GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWT---------SLADQDIFN 313
                G N+GV+LL L ++R+       +   EK   T + T          L DQD F 
Sbjct: 348 HHGYPGLNSGVVLLLLNRIRN------SKSYLEKLTHTEVHTLVAKYSFKGHLGDQDFFT 401

Query: 314 AIISEHPYLVYTLPCQWNVQLS 335
            +  E+P L+Y L C WN QL 
Sbjct: 402 LLGYEYPNLIYRLDCIWNRQLC 423



 Score = 50.4 bits (119), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 39/150 (26%), Positives = 67/150 (44%), Gaps = 14/150 (9%)

Query: 36  SILFYRKNPLHFHLITDTVA----LNILQTLFSTW--SVPQVEVSFYLADSVVEDV---- 85
           S+L +    LH H+ITD+ +    L ILQ     +  +V        +  S+++D+    
Sbjct: 181 SLLKHTSAQLHLHVITDSESQPSVLEILQRQIRRFRRTVIYTIYDVKVCSSIIQDIAAKL 240

Query: 86  ----SWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQR 141
               S  PN +YS     L L L ++  ++L + I+LD D++F +D+  L+  F      
Sbjct: 241 SPYFSSAPNSYYSDSLFFLSLGLHRIADQSLNRAILLDCDIVFRSDVRLLFNEFDNFLPH 300

Query: 142 QTIGLVENQSDWYLGKLWKNHKPWPALGRG 171
           Q  GL    +  Y   L++    +P    G
Sbjct: 301 QLYGLAPELTPVYRHILYRYRVRYPKTSFG 330


>gi|402861997|ref|XP_003895359.1| PREDICTED: xyloside xylosyltransferase 1 [Papio anubis]
          Length = 393

 Score = 66.2 bits (160), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 56/210 (26%), Positives = 89/210 (42%), Gaps = 28/210 (13%)

Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
           F + +++ +  S     +YS     L + + +++P+ + + I LD D+ F T++ +L+  
Sbjct: 182 FPVVEAMQKHFSAGSGTYYSDSIFFLSVAMHQIMPKEILRLIQLDLDLKFKTNVRELFEE 241

Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWKNHKPWP---------ALGRGFNTGVILLDLTKLR 282
           F        IGL       Y    W+     P             GFN+GV+LL+L  +R
Sbjct: 242 FDSFLPGAIIGLAREMQPVYRHTFWQFRHENPQTRVGGPPPEGLPGFNSGVMLLNLEAMR 301

Query: 283 DISWAGFWRII--------AEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQL 334
                 + R++        A+K+        L DQD F  I  EHP L + L C WN QL
Sbjct: 302 QSPL--YSRLLEPAQVQQLADKY---HFRGHLGDQDFFTMIGMEHPRLFHVLDCTWNRQL 356

Query: 335 S----DNTRSD--ELCYTELTDLKIIHWNS 358
                D+  SD  E  +     +KI H N 
Sbjct: 357 CTWWRDHGYSDVFEAYFRCEGHIKIYHGNC 386


>gi|291236100|ref|XP_002737979.1| PREDICTED: UDP-GlcNAc:betaGal
           beta-1,3-N-acetylglucosaminyltransferase 1-like
           [Saccoglossus kowalevskii]
          Length = 888

 Score = 66.2 bits (160), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 68/279 (24%), Positives = 107/279 (38%), Gaps = 92/279 (32%)

Query: 477 NDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDA-----------------EAQQFL 519
           +DVT+V Q S + L ++    + W GPIS+ ++  +                  + Q+F 
Sbjct: 545 DDVTIVTQCSPNHLHLLFDQAEIWAGPISVAIFAPNPNITALYVVIAHLRHCSLKIQKFA 604

Query: 520 SY-------------AGNSEALRSRR-NIGYHVVYKEG---NF---------------YP 547
           S+               N E   SR   I  H    +    NF               YP
Sbjct: 605 SFHLVYPLRQMSQSRGFNKENYSSRPIEIAMHCTEVKDWIKNFDNLDHENYVLAGDVPYP 664

Query: 548 INTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKS--------SIRSMDMHGHGGKKVL 599
            N LRNVA     T Y+ +LDID +P  GL    KS        S+  ++M     + V 
Sbjct: 665 NNLLRNVAWQGSVTEYILVLDIDIIPSSGLLASFKSFDRRRKSLSLSHLEME-EPPETVY 723

Query: 600 VVPAFETQR--------------------------YRTAFPASHAPTNFSRW--VNATTP 631
           V+PAFE +                           Y          ++++RW    + +P
Sbjct: 724 VLPAFEIREGIDRIFTPMKKNELLTLWNEGDVRPFYEEVCSKCQIHSSYTRWRTFESESP 783

Query: 632 ------YQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
                 Y++EW   +EP+ +A R +P+YD RF  +G+N+
Sbjct: 784 DMMNIAYELEWKDPWEPFYIAKRHIPKYDERFKQYGFNR 822



 Score = 48.1 bits (113), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 54/250 (21%), Positives = 100/250 (40%), Gaps = 73/250 (29%)

Query: 477 NDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAE---------------AQQFLSY 521
           +DVT+V   S+  L  +  + + WEGPIS+ ++ +D                  +Q +++
Sbjct: 89  DDVTIVTHSSVSDLHHLVGMAERWEGPISVAVFSTDISLFASSVSNLRRCFPLVKQHVTF 148

Query: 522 -------AGNSE--------------ALRSRRNIGYHVVYKEGN-FYPINTLRNVALNQV 559
                   G +E               L   RN+       + N  YPIN L+N AL   
Sbjct: 149 HLVTTIAPGKAEISNDETILKVPCGMVLHVTRNLEDESYSSDDNPAYPINILQNEALKGS 208

Query: 560 STPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAF--------ETQRYRT 611
            T Y+  L++D +P   L    K    +++      + V ++P F        ET +Y++
Sbjct: 209 ITKYILTLNVDLMPNANLLKNFKLFTDAIEDDA-AIQNVYILPDFEICDDDVHETPKYKS 267

Query: 612 AF-------------------PASHAPTNFSRWV----NAT----TPYQIEWAPDFEPYI 644
                                 + H   N+++W     +AT    + Y++E+   ++P+ 
Sbjct: 268 EVLDLWREGKDLIRPYRYAMCRSCHNHGNYTKWRQILPSATNTLDSSYEVEFRYPWQPFY 327

Query: 645 VAHRDLPRYD 654
           ++ RD+P YD
Sbjct: 328 ISSRDVPLYD 337


>gi|195552258|ref|XP_002076411.1| GD15463 [Drosophila simulans]
 gi|194202060|gb|EDX15636.1| GD15463 [Drosophila simulans]
          Length = 389

 Score = 66.2 bits (160), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 64/279 (22%), Positives = 108/279 (38%), Gaps = 89/279 (31%)

Query: 468 DEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEA----QQFLSYAG 523
           ++Y  SS+G+ V L  Q S++RL  +  +  +W+G +S+ L+ +  E     Q F++Y  
Sbjct: 41  EQYVDSSEGSLVCLATQTSVERLNSLPQVAANWQGKMSVALFAAGPEEFVVLQYFVTYMR 100

Query: 524 NSEALRSRRNIGYHVV-------------------------------------------- 539
              A   R N  +H++                                            
Sbjct: 101 LCFA-NIRENATFHLLTPRDFDKLPRVAALPLNMRGKFDCQYPDRTLKALLKFRSLKTLQ 159

Query: 540 YKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVL 599
           +++ N YP N +RN+A     T YVFL DID +P     P L   +R+ +          
Sbjct: 160 WRQRNTYPQNHMRNLARKGCQTKYVFLTDIDIVPSTNSVPQLNHFLRTANCTKSCA---Y 216

Query: 600 VVPAFETQRYRTAFPAS------------------------HAPTNFSRWVNATTP---- 631
           V+P FE    R  FP S                           TNFS+W+++ T     
Sbjct: 217 VIPTFEID-IRATFPRSKNALVRLIRKGLARPFHEKVFIYNQYATNFSKWLSSNTNDTEV 275

Query: 632 ------YQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
                    E+   +EP+ ++  + P +D RF+G+G+ +
Sbjct: 276 SVSHIVTNFEFL--YEPFYISVDNAPAHDERFLGYGFTR 312


>gi|427785201|gb|JAA58052.1| Putative gt8 like 2 represents a subfamily protein of gt8 with
           unknown function [Rhipicephalus pulchellus]
          Length = 387

 Score = 66.2 bits (160), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 41/138 (29%), Positives = 67/138 (48%), Gaps = 12/138 (8%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSD----WYLG 253
           +L LP +LP+  A  I +D DV+F + I +LW  F+ +       L     D    WY  
Sbjct: 156 RLFLPSMLPDVDA-VIYVDADVLFLSPIEELWKHFTAMNSSHLAALAPESEDYATNWY-- 212

Query: 254 KLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFN 313
           + +  H  +  LG   N+GV+L++LT++R+  W      +  K+     W    DQD+ N
Sbjct: 213 RRFARHPFYQPLG--VNSGVMLMNLTRMREFGWEKRLGPLLRKYGSNISW---GDQDLLN 267

Query: 314 AIISEHPYLVYTLPCQWN 331
            +  +HP  +    C+WN
Sbjct: 268 ILFHDHPERLLLFSCRWN 285


>gi|358336522|dbj|GAA55009.1| glycosyltransferase-like protein LARGE [Clonorchis sinensis]
          Length = 230

 Score = 66.2 bits (160), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 36/86 (41%), Positives = 43/86 (50%), Gaps = 11/86 (12%)

Query: 263 PALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQ-----------DI 311
           P   RG NTGV+L+ L KLR +SW   W    +  L        ADQ           DI
Sbjct: 61  PMEKRGINTGVLLIHLQKLRKVSWHRMWMDTIQDLLKRTKNLKAADQVSAVRIYIHIQDI 120

Query: 312 FNAIISEHPYLVYTLPCQWNVQLSDN 337
           +NAI   HP L+Y LPC WNVQ+  N
Sbjct: 121 YNAIFGIHPELLYQLPCTWNVQVIRN 146


>gi|355747173|gb|EHH51787.1| hypothetical protein EGM_11230, partial [Macaca fascicularis]
          Length = 318

 Score = 66.2 bits (160), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 56/208 (26%), Positives = 88/208 (42%), Gaps = 24/208 (11%)

Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
           F + +++ +  S     +YS     L + + +++P+ + + I LD D+ F T+I +L+  
Sbjct: 107 FPVVEAMQKHFSAGSGTYYSDSIFFLSVAMHQIMPKEILRLIQLDLDLKFKTNIRELFEE 166

Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWKNHKPWP---------ALGRGFNTGVILLDLTKLR 282
           F        IGL       Y    W+     P             GFN+GV+LL+L  +R
Sbjct: 167 FDSFLPGAIIGLAREMQPVYRHTFWQFRHENPQTRVGGPPPEGLPGFNSGVMLLNLEAMR 226

Query: 283 D------ISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLS- 335
                  +      + +A+K+        L DQD F  I  EHP L + L C WN QL  
Sbjct: 227 QSPLYSRLLEPAQVQQLADKY---HFRGHLGDQDFFTMIGMEHPRLFHVLDCTWNRQLCT 283

Query: 336 ---DNTRSD--ELCYTELTDLKIIHWNS 358
              D+  SD  E  +     +KI H N 
Sbjct: 284 WWRDHGYSDVFEAYFRCEGHVKIYHGNC 311


>gi|355560144|gb|EHH16872.1| hypothetical protein EGK_12239, partial [Macaca mulatta]
          Length = 308

 Score = 65.9 bits (159), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 56/208 (26%), Positives = 88/208 (42%), Gaps = 24/208 (11%)

Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
           F + +++ +  S     +YS     L + + +++P+ + + I LD D+ F T+I +L+  
Sbjct: 97  FPVVEAMQKHFSAGSGTYYSDSIFFLSVAMHQIMPKEILRLIQLDLDLKFKTNIRELFEE 156

Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWKNHKPWP---------ALGRGFNTGVILLDLTKLR 282
           F        IGL       Y    W+     P             GFN+GV+LL+L  +R
Sbjct: 157 FDSFLPGAIIGLAREMQPVYRHTFWQFRHENPQTRVGGPPPEGLPGFNSGVMLLNLEAMR 216

Query: 283 D------ISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLS- 335
                  +      + +A+K+        L DQD F  I  EHP L + L C WN QL  
Sbjct: 217 QSPLYSRLLEPAQVQQLADKY---HFRGHLGDQDFFTMIGMEHPRLFHVLDCTWNRQLCT 273

Query: 336 ---DNTRSD--ELCYTELTDLKIIHWNS 358
              D+  SD  E  +     +KI H N 
Sbjct: 274 WWRDHGYSDVFEAYFRCEGHVKIYHGNC 301


>gi|16768190|gb|AAL28314.1| GH22974p [Drosophila melanogaster]
          Length = 458

 Score = 65.9 bits (159), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 52/196 (26%), Positives = 83/196 (42%), Gaps = 34/196 (17%)

Query: 174 LADSVVEDV--------SWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           +  S+++D+        S  PN +YS     L L L ++   TL + I+LD D++F +D+
Sbjct: 228 VCSSIIQDIAAKLSPYFSSTPNSYYSDSLFFLSLGLHRIADRTLNRAILLDCDIVFRSDV 287

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWP--ALGR---------------- 267
             L+  F      Q  GL    +  Y   L++    +P  + G                 
Sbjct: 288 RLLFNEFDNFLPHQLYGLAPELTPVYRHILYRYRVRYPKTSFGNPYYPINNEGGNQHSRV 347

Query: 268 -----GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWT---SLADQDIFNAIISEH 319
                G N+GV+LL L ++R+          +E   L   ++    L DQD F  +  E+
Sbjct: 348 HHGYPGLNSGVVLLLLNRIRNSKSYLEKLTHSEVHTLVAKYSFKGHLGDQDFFTLLGYEY 407

Query: 320 PYLVYTLPCQWNVQLS 335
           P L+Y L C WN QL 
Sbjct: 408 PNLIYRLDCIWNRQLC 423



 Score = 50.8 bits (120), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 40/150 (26%), Positives = 66/150 (44%), Gaps = 14/150 (9%)

Query: 36  SILFYRKNPLHFHLITDTVA----LNILQTLFSTW--SVPQVEVSFYLADSVVEDV---- 85
           S+L +    LH H+ITD+ +    L ILQ     +  +V        +  S+++D+    
Sbjct: 181 SLLKHTSAQLHLHVITDSESQPSVLEILQRQIRRFRRTVIYTIYDVKVCSSIIQDIAAKL 240

Query: 86  ----SWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQR 141
               S  PN +YS     L L L ++   TL + I+LD D++F +D+  L+  F      
Sbjct: 241 SPYFSSTPNSYYSDSLFFLSLGLHRIADRTLNRAILLDCDIVFRSDVRLLFNEFDNFLPH 300

Query: 142 QTIGLVENQSDWYLGKLWKNHKPWPALGRG 171
           Q  GL    +  Y   L++    +P    G
Sbjct: 301 QLYGLAPELTPVYRHILYRYRVRYPKTSFG 330


>gi|340726907|ref|XP_003401793.1| PREDICTED: glucoside xylosyltransferase 1-like [Bombus terrestris]
          Length = 364

 Score = 65.9 bits (159), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 43/136 (31%), Positives = 67/136 (49%), Gaps = 8/136 (5%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP VL +  A  + +DTD +F     ++W  F ++   Q   L     D   G  W 
Sbjct: 143 RLFLPSVLNDIDA-VLYVDTDTLFLAPPEKIWDEFKKMNSSQLAALSPEHEDPNTG--WY 199

Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAI 315
           N     P  G+ G N+GV+L++LT++R+  W  +   I +++ L   W    DQDI N I
Sbjct: 200 NRFAKHPYYGKLGVNSGVMLMNLTRMREFRWIEYVIPIHKEYKLKITW---GDQDIINII 256

Query: 316 ISEHPYLVYTLPCQWN 331
              HP  +Y   C++N
Sbjct: 257 FHYHPEKLYVYSCRYN 272


>gi|427778361|gb|JAA54632.1| Putative gt8 like 2 represents a subfamily protein of gt8 with
           unknown function [Rhipicephalus pulchellus]
          Length = 375

 Score = 65.9 bits (159), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 41/138 (29%), Positives = 67/138 (48%), Gaps = 12/138 (8%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSD----WYLG 253
           +L LP +LP+  A  I +D DV+F + I +LW  F+ +       L     D    WY  
Sbjct: 156 RLFLPSMLPDVDA-VIYVDADVLFLSPIEELWKHFTAMNSSHLAALAPESEDYATNWY-- 212

Query: 254 KLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFN 313
           + +  H  +  LG   N+GV+L++LT++R+  W      +  K+     W    DQD+ N
Sbjct: 213 RRFARHPFYQPLG--VNSGVMLMNLTRMREFGWEKRLGPLLRKYGSNISW---GDQDLLN 267

Query: 314 AIISEHPYLVYTLPCQWN 331
            +  +HP  +    C+WN
Sbjct: 268 ILFHDHPERLLLFSCRWN 285


>gi|152125897|sp|Q3UHH8.2|GXLT1_MOUSE RecName: Full=Glucoside xylosyltransferase 1; AltName:
           Full=Glycosyltransferase 8 domain-containing protein 3
 gi|148672339|gb|EDL04286.1| mCG19737 [Mus musculus]
 gi|219521061|gb|AAI71972.1| Glt8d3 protein [Mus musculus]
 gi|223461493|gb|AAI41303.1| Glt8d3 protein [Mus musculus]
          Length = 404

 Score = 65.9 bits (159), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 44/173 (25%), Positives = 77/173 (44%), Gaps = 19/173 (10%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L E +   + +DTD++F   +  +W+L  +    Q   +     +  +G  W 
Sbjct: 164 RLFLPLILKE-VDSLLYVDTDILFLRPVDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WY 220

Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
           N     P  GR G N+GV+L+++T++R            + W      + +K+ L   W 
Sbjct: 221 NRFARHPYYGRTGVNSGVMLMNMTRMRRKYFKNDMTTARLQWGDILMPLLKKYKLNITW- 279

Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
              DQD+ N + S +P  ++  PCQWN +             E   + I+H N
Sbjct: 280 --GDQDLLNIVFSHNPESLFVFPCQWNYRPDHCIYGSNCREAEEEGVFILHGN 330



 Score = 42.4 bits (98), Expect = 0.83,   Method: Compositional matrix adjust.
 Identities = 36/154 (23%), Positives = 66/154 (42%), Gaps = 15/154 (9%)

Query: 13  EVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWS-VPQV 71
           E + +A+V  G       VT++KS L +   PLH H+  +    +  +   ++WS + + 
Sbjct: 78  EKMHLAVVACG-ERLEETVTMLKSALIFSIKPLHVHIFAEDQLHDSFKDRLASWSFLRRF 136

Query: 72  EVSFY----LADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATD 127
           + S Y      DS  +   W   K         +L LP +L E +   + +DTD++F   
Sbjct: 137 DYSLYPITFPGDSAAD---W---KKLFKPCASQRLFLPLILKE-VDSLLYVDTDILFLRP 189

Query: 128 IAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKN 161
           +  +W+L  +    Q   +     +  +G  W N
Sbjct: 190 VDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WYN 221


>gi|380020565|ref|XP_003694153.1| PREDICTED: glucoside xylosyltransferase 1-like [Apis florea]
          Length = 364

 Score = 65.9 bits (159), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 42/136 (30%), Positives = 66/136 (48%), Gaps = 8/136 (5%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP VL   +   + +DTD +F     ++W  F ++   Q   L     D   G  W 
Sbjct: 144 RLFLPSVL-NNIDAVLYVDTDTLFLAPPEKIWDEFKKMNSSQLAALSPEHEDPNTG--WY 200

Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAI 315
           N     P  G+ G N+GV+L++LT++R+  W  +   I +++ L   W    DQDI N I
Sbjct: 201 NRFAKHPYYGKLGVNSGVMLMNLTRMREFRWIEYVIPIHKEYKLKITW---GDQDIINII 257

Query: 316 ISEHPYLVYTLPCQWN 331
              HP  +Y   C++N
Sbjct: 258 FHYHPEKLYVYSCRYN 273


>gi|348582762|ref|XP_003477145.1| PREDICTED: uncharacterized protein C3orf21 homolog [Cavia
           porcellus]
          Length = 393

 Score = 65.9 bits (159), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 55/208 (26%), Positives = 87/208 (41%), Gaps = 24/208 (11%)

Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
           F + +++ +  S     +YS     L + + +++P+ + + I LD D+ + T+I +L+  
Sbjct: 182 FPVVEAMQKHFSAGSGTYYSDSIFFLSVAMHQIMPKEILRIIQLDLDLKYKTNIRELFEE 241

Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWKNHKPWP---------ALGRGFNTGVILLDLTKLR 282
           F        IG+       Y    W+  +  P             GFN+GV+LL L  +R
Sbjct: 242 FDNFLPGAVIGIAREMQPVYRHTFWQFRQENPKTRVGGPPPEGLPGFNSGVMLLSLEAMR 301

Query: 283 DISWAGFW------RIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLS- 335
                G        + +A K+        L DQD F  I  EHP L + L C WN QL  
Sbjct: 302 QSPLYGRLLEPEQVQQLANKY---HFRGHLGDQDFFTMIGMEHPELFHVLDCTWNRQLCT 358

Query: 336 ---DNTRSD--ELCYTELTDLKIIHWNS 358
              D+  SD  E  +     +KI H N 
Sbjct: 359 WWRDHGYSDVFEAYFRCEGHVKIYHGNC 386


>gi|332262844|ref|XP_003280468.1| PREDICTED: LOW QUALITY PROTEIN: xyloside xylosyltransferase 1
           [Nomascus leucogenys]
          Length = 394

 Score = 65.9 bits (159), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 56/210 (26%), Positives = 89/210 (42%), Gaps = 28/210 (13%)

Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
           F + +++ +  S     +YS     L + + +++P+ + + I LD D+ F T+I +L+  
Sbjct: 183 FPVVEAMQKHFSAGSGTYYSDSIFFLSVAMHQIMPKEILRIIQLDLDLKFKTNIRELFEE 242

Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWKNHKPWP---------ALGRGFNTGVILLDLTKLR 282
           F        IG+ +     Y    W+     P             GFN+GV+LL L  +R
Sbjct: 243 FDSFLPGTIIGIAQEMQPVYRHTFWQFRHENPQTRVGGPPPEGLPGFNSGVMLLSLEAMR 302

Query: 283 DISWAGFWRII--------AEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQL 334
                 + R++        A+K+        L DQD F  I  EHP L + L C WN QL
Sbjct: 303 QSPL--YSRLLEPAQVQQLADKY---HFRGHLGDQDFFTMIGMEHPKLFHVLDCTWNRQL 357

Query: 335 S----DNTRSD--ELCYTELTDLKIIHWNS 358
                D+  SD  E  +     +KI H N 
Sbjct: 358 CTWWRDHGYSDVFEAYFRCEGHVKIYHGNC 387


>gi|380798379|gb|AFE71065.1| xyloside xylosyltransferase 1, partial [Macaca mulatta]
          Length = 272

 Score = 65.9 bits (159), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 56/208 (26%), Positives = 88/208 (42%), Gaps = 24/208 (11%)

Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
           F + +++ +  S     +YS     L + + +++P+ + + I LD D+ F T+I +L+  
Sbjct: 61  FPVVEAMQKHFSAGSGTYYSDSIFFLSVAMHQIMPKEILRLIQLDLDLKFKTNIRELFEE 120

Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWKNHKPWP---------ALGRGFNTGVILLDLTKLR 282
           F        IGL       Y    W+     P             GFN+GV+LL+L  +R
Sbjct: 121 FDSFLPGAIIGLAREMQPVYRHTFWQFRHENPQTRVGGPPPEGLPGFNSGVMLLNLEAMR 180

Query: 283 D------ISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLS- 335
                  +      + +A+K+        L DQD F  I  EHP L + L C WN QL  
Sbjct: 181 QSPLYSRLLEPAQVQQLADKY---HFRGHLGDQDFFTMIGMEHPRLFHVLDCTWNRQLCT 237

Query: 336 ---DNTRSD--ELCYTELTDLKIIHWNS 358
              D+  SD  E  +     +KI H N 
Sbjct: 238 WWRDHGYSDVFEAYFRCEGHVKIYHGNC 265


>gi|195430896|ref|XP_002063484.1| GK21934 [Drosophila willistoni]
 gi|194159569|gb|EDW74470.1| GK21934 [Drosophila willistoni]
          Length = 429

 Score = 65.5 bits (158), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 50/190 (26%), Positives = 79/190 (41%), Gaps = 42/190 (22%)

Query: 183 SWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIG 242
           S  PN +YS     L L L ++   +L+K I+LD D++F +D+  L+  F      Q  G
Sbjct: 208 SSAPNSYYSDSLFFLSLGLHRIADTSLSKAILLDCDIVFRSDVRLLFGEFDNFLPHQLFG 267

Query: 243 LVENQSDWYLGKLWKNHKPWP--ALGR-----------------------------GFNT 271
           L    +  Y   L++    +P  + G                              G N+
Sbjct: 268 LAPELTPVYRHILYRYRVRFPKTSFGNPYYPMRFAKNNEIGTGDQANHIHEHHGYPGLNS 327

Query: 272 GVILLDLTKLRD-------ISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVY 324
           GV+LL L ++R+       ++     R++A+          L DQD F  +  E+P L+Y
Sbjct: 328 GVVLLLLNRIRESKTYVEALTSTEVHRLVAK----YSFKGHLGDQDFFTLLGYEYPNLIY 383

Query: 325 TLPCQWNVQL 334
            L C WN QL
Sbjct: 384 RLDCVWNRQL 393



 Score = 52.8 bits (125), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 40/150 (26%), Positives = 66/150 (44%), Gaps = 14/150 (9%)

Query: 36  SILFYRKNPLHFHLITD----TVALNILQTLFSTW--SVPQVEVSFYLADSVVEDV---- 85
           S+L +    LH H+ITD    T  L+ILQ     +  +V        +   +++D+    
Sbjct: 144 SLLKHTNAQLHLHVITDKDSQTSVLDILQKQIRRFRRTVIYTMYDVKVCSVIIQDIAAKL 203

Query: 86  ----SWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQR 141
               S  PN +YS     L L L ++   +L+K I+LD D++F +D+  L+  F      
Sbjct: 204 SPYFSSAPNSYYSDSLFFLSLGLHRIADTSLSKAILLDCDIVFRSDVRLLFGEFDNFLPH 263

Query: 142 QTIGLVENQSDWYLGKLWKNHKPWPALGRG 171
           Q  GL    +  Y   L++    +P    G
Sbjct: 264 QLFGLAPELTPVYRHILYRYRVRFPKTSFG 293


>gi|402591667|gb|EJW85596.1| hypothetical protein WUBG_03494 [Wuchereria bancrofti]
          Length = 181

 Score = 65.5 bits (158), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 38/96 (39%), Positives = 44/96 (45%), Gaps = 26/96 (27%)

Query: 595 GKKVLVVPAFE---------------------TQRY----RTAFPASHAPTNFSRWVNAT 629
            K+  VVPAFE                      QR     R  +   H  T++ RW NA 
Sbjct: 13  NKRAFVVPAFEYMSNQVPAIPLTKNELLIELDAQRMQIFRRNIWIQGHVATDYDRWRNAD 72

Query: 630 TPYQIEWAPDFEPYIVA-HRDLPRYDTRFVGFGWNK 664
             Y + W  D+EPYIV   R LP YD RFVGFGWNK
Sbjct: 73  QEYSVSWRTDYEPYIVVKRRGLPPYDQRFVGFGWNK 108


>gi|395839733|ref|XP_003792735.1| PREDICTED: xyloside xylosyltransferase 1 [Otolemur garnettii]
          Length = 393

 Score = 65.5 bits (158), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 54/205 (26%), Positives = 86/205 (41%), Gaps = 18/205 (8%)

Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
           F + +++ +  S     +YS     L + + +++P+ + + I +D D+ + T+I +L+  
Sbjct: 182 FPVVEAMQKHFSAGSGTYYSDSIFFLSIAMHQIMPKEILRVIQVDLDLKYKTNIRELFEE 241

Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWKNHKPWP---------ALGRGFNTGVILLDLTKLR 282
           F        IG+       Y    W+  +  P             GFN+GV+LL+L  +R
Sbjct: 242 FDNFLPGAVIGIAREMQPVYRHTFWQFRRENPKTRVGGPPPEGLPGFNSGVMLLNLEAMR 301

Query: 283 DISWAGFWRIIAEKFLLT---RLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLS---- 335
                      A+   LT        L DQD F  I  EHP L + L C WN QL     
Sbjct: 302 QSPLYSHLLEPAQVQQLTDKYHFRGHLGDQDFFTMIGMEHPELFHVLDCTWNRQLCTWWR 361

Query: 336 DNTRSD--ELCYTELTDLKIIHWNS 358
           D+  SD  E  +     +KI H N 
Sbjct: 362 DHGYSDVFEAYFQCEGHVKIYHGNC 386


>gi|428175844|gb|EKX44732.1| hypothetical protein GUITHDRAFT_109510 [Guillardia theta CCMP2712]
          Length = 550

 Score = 65.5 bits (158), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 59/261 (22%), Positives = 98/261 (37%), Gaps = 72/261 (27%)

Query: 474 SDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALR--SR 531
           ++  +VT V   ++DRL  +  L  HW  P+S+ +Y    E ++ +S       L   S+
Sbjct: 89  NECENVTFVTSATIDRLPTLLNLLAHWSRPVSVAIYARTHEEERQISLFVRKYVLPWCSK 148

Query: 532 RNIGYHVVYKEGNF----------YPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL 581
           ++    +                 +P+N LR +A +   T  V   DID LP   L   +
Sbjct: 149 KHQSISICMLTACIDESRPPFILPFPVNRLRRLATSIAKTDLVIYADIDLLPSADLGSII 208

Query: 582 KSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS------------------------- 616
                S  + G   K VLV+P+F +       PAS                         
Sbjct: 209 MEKFLSRQVQG--WKDVLVIPSFRSSDESWPSPASILGSDNDVWIEAEPIKTDSLLERFL 266

Query: 617 --------------------------HAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRD- 649
                                     HAPT+++RW NA + Y++++   +EPY+V +R  
Sbjct: 267 NETVTIPGLKELELPRGDLWFHSSVFHAPTDYARWFNAKSLYEVDYVLGYEPYVVLNRSD 326

Query: 650 ------LPRYDTRFVGFGWNK 664
                    +D  F  +GW+K
Sbjct: 327 WHGQHGWGLWDEAFDTWGWDK 347


>gi|195347329|ref|XP_002040206.1| GM16083 [Drosophila sechellia]
 gi|194135555|gb|EDW57071.1| GM16083 [Drosophila sechellia]
          Length = 417

 Score = 65.5 bits (158), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 51/196 (26%), Positives = 83/196 (42%), Gaps = 34/196 (17%)

Query: 174 LADSVVEDV--------SWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           +  S+++D+        S  PN +YS     L L L ++   +L + I+LD D++F +D+
Sbjct: 187 VCSSIIQDIAAKLSPYFSSTPNSYYSDSLFFLSLGLHRIADRSLNRAILLDCDIVFRSDV 246

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWP--ALGR---------------- 267
             L+  F      Q  GL    +  Y   L++    +P  + G                 
Sbjct: 247 RLLFNEFDNFLPHQLYGLAPELTPVYRHILYRYRVRYPKTSFGNPYYPINNEGGNQHSRV 306

Query: 268 -----GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWT---SLADQDIFNAIISEH 319
                G N+GV+LL L ++R+          +E   L   ++    L DQD F  +  E+
Sbjct: 307 HHGYPGLNSGVVLLLLNRIRNSKSYLEKLTHSEVHTLVAKYSFKGHLGDQDFFTLLGYEY 366

Query: 320 PYLVYTLPCQWNVQLS 335
           P L+Y L C WN QL 
Sbjct: 367 PNLIYRLDCIWNRQLC 382



 Score = 49.7 bits (117), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 40/154 (25%), Positives = 68/154 (44%), Gaps = 22/154 (14%)

Query: 36  SILFYRKNPLHFHLITDTVA----LNILQTLFS------TWSVPQVEVSFYLADSVVEDV 85
           S+L +    LH H+ITD+ +    L ILQ           +++  V+V      S+++D+
Sbjct: 140 SLLKHTSAQLHLHVITDSESQPSVLEILQRQIRRFRRTLIYTIYDVKV----CSSIIQDI 195

Query: 86  --------SWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSR 137
                   S  PN +YS     L L L ++   +L + I+LD D++F +D+  L+  F  
Sbjct: 196 AAKLSPYFSSTPNSYYSDSLFFLSLGLHRIADRSLNRAILLDCDIVFRSDVRLLFNEFDN 255

Query: 138 LRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRG 171
               Q  GL    +  Y   L++    +P    G
Sbjct: 256 FLPHQLYGLAPELTPVYRHILYRYRVRYPKTSFG 289


>gi|397472300|ref|XP_003807689.1| PREDICTED: xyloside xylosyltransferase 1 [Pan paniscus]
          Length = 393

 Score = 65.5 bits (158), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 56/210 (26%), Positives = 89/210 (42%), Gaps = 28/210 (13%)

Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
           F + +++ +  S     +YS     L + + +++P+ + + I LD D+ F T+I +L+  
Sbjct: 182 FPVVEAMQKHFSAGSGTYYSDSIFFLSVAMHQIMPKEILQIIQLDLDLKFKTNIRELFEE 241

Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWKNHKPWP---------ALGRGFNTGVILLDLTKLR 282
           F        IG+       Y    W+     P             GFN+GV+LL+L  +R
Sbjct: 242 FDSFLPGAIIGIAREMQPVYRHTFWQFRHENPQTRVGGPPPEGLPGFNSGVMLLNLEAMR 301

Query: 283 DISWAGFWRII--------AEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQL 334
                 + R++        A+K+        L DQD F  I  EHP L + L C WN QL
Sbjct: 302 QSPL--YSRLLEPAQVQQLADKY---HFRGHLGDQDFFTMIGMEHPKLFHVLDCTWNRQL 356

Query: 335 S----DNTRSD--ELCYTELTDLKIIHWNS 358
                D+  SD  E  +     +KI H N 
Sbjct: 357 CTWWRDHGYSDVFEAYFRCEGHVKIYHGNC 386


>gi|256082479|ref|XP_002577483.1| glycosyltransferase-related [Schistosoma mansoni]
 gi|350644935|emb|CCD60351.1| glycosyltransferase-related [Schistosoma mansoni]
          Length = 118

 Score = 65.1 bits (157), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 35/96 (36%), Positives = 53/96 (55%), Gaps = 2/96 (2%)

Query: 201 LPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHK 260
           + ++LP T++K IVLD D++  TDI +LW  F      Q+IG+   Q+  +   + K + 
Sbjct: 2   ITEILPLTISKVIVLDIDILLNTDIIELWNYFDYFTGTQSIGIGLEQNPHFQEVMTKLNS 61

Query: 261 PWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEK 296
            W   G G+N GV+L  L+KLR   W   W  I +K
Sbjct: 62  NWK--GYGYNNGVLLFHLSKLRSTKWDYLWLNITKK 95



 Score = 45.1 bits (105), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 22/66 (33%), Positives = 36/66 (54%)

Query: 104 LPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHK 163
           + ++LP T++K IVLD D++  TDI +LW  F      Q+IG+   Q+  +   + K + 
Sbjct: 2   ITEILPLTISKVIVLDIDILLNTDIIELWNYFDYFTGTQSIGIGLEQNPHFQEVMTKLNS 61

Query: 164 PWPALG 169
            W   G
Sbjct: 62  NWKGYG 67


>gi|410221668|gb|JAA08053.1| chromosome 3 open reading frame 21 [Pan troglodytes]
          Length = 393

 Score = 65.1 bits (157), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 56/210 (26%), Positives = 89/210 (42%), Gaps = 28/210 (13%)

Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
           F + +++ +  S     +YS     L + + +++P+ + + I LD D+ F T+I +L+  
Sbjct: 182 FPVVEAMQKHFSAGSGTYYSDSIFFLSVAMHQIMPKEILQIIQLDLDLKFKTNIRELFEE 241

Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWKNHKPWP---------ALGRGFNTGVILLDLTKLR 282
           F        IG+       Y    W+     P             GFN+GV+LL+L  +R
Sbjct: 242 FDSFLPGAIIGIAREMQPVYRHTFWQFRHENPQTRVGGPPPEGLPGFNSGVMLLNLEAMR 301

Query: 283 DISWAGFWRII--------AEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQL 334
                 + R++        A+K+        L DQD F  I  EHP L + L C WN QL
Sbjct: 302 QSPL--YSRLLEPAQVQQLADKY---HFRGHLGDQDFFTMIGMEHPKLFHVLDCTWNRQL 356

Query: 335 S----DNTRSD--ELCYTELTDLKIIHWNS 358
                D+  SD  E  +     +KI H N 
Sbjct: 357 CTWWRDHGYSDVFEAYFRCEGHVKIYHGNC 386


>gi|114591198|ref|XP_516961.2| PREDICTED: xyloside xylosyltransferase 1 isoform 2 [Pan
           troglodytes]
 gi|410247994|gb|JAA11964.1| chromosome 3 open reading frame 21 [Pan troglodytes]
 gi|410247996|gb|JAA11965.1| chromosome 3 open reading frame 21 [Pan troglodytes]
 gi|410247998|gb|JAA11966.1| chromosome 3 open reading frame 21 [Pan troglodytes]
 gi|410294112|gb|JAA25656.1| chromosome 3 open reading frame 21 [Pan troglodytes]
 gi|410294114|gb|JAA25657.1| chromosome 3 open reading frame 21 [Pan troglodytes]
 gi|410294116|gb|JAA25658.1| chromosome 3 open reading frame 21 [Pan troglodytes]
 gi|410294118|gb|JAA25659.1| chromosome 3 open reading frame 21 [Pan troglodytes]
 gi|410294120|gb|JAA25660.1| chromosome 3 open reading frame 21 [Pan troglodytes]
 gi|410335001|gb|JAA36447.1| chromosome 3 open reading frame 21 [Pan troglodytes]
          Length = 393

 Score = 65.1 bits (157), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 56/210 (26%), Positives = 89/210 (42%), Gaps = 28/210 (13%)

Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
           F + +++ +  S     +YS     L + + +++P+ + + I LD D+ F T+I +L+  
Sbjct: 182 FPVVEAMQKHFSAGSGTYYSDSIFFLSVAMHQIMPKEILQIIQLDLDLKFKTNIRELFEE 241

Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWKNHKPWP---------ALGRGFNTGVILLDLTKLR 282
           F        IG+       Y    W+     P             GFN+GV+LL+L  +R
Sbjct: 242 FDSFLPGAIIGIAREMQPVYRHTFWQFRHENPQTRVGGPPPEGLPGFNSGVMLLNLEAMR 301

Query: 283 DISWAGFWRII--------AEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQL 334
                 + R++        A+K+        L DQD F  I  EHP L + L C WN QL
Sbjct: 302 QSPL--YSRLLEPAQVQQLADKY---HFRGHLGDQDFFTMIGMEHPKLFHVLDCTWNRQL 356

Query: 335 S----DNTRSD--ELCYTELTDLKIIHWNS 358
                D+  SD  E  +     +KI H N 
Sbjct: 357 CTWWRDHGYSDVFEAYFRCEGHVKIYHGNC 386


>gi|321472410|gb|EFX83380.1| hypothetical protein DAPPUDRAFT_210281 [Daphnia pulex]
          Length = 272

 Score = 65.1 bits (157), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 48/165 (29%), Positives = 73/165 (44%), Gaps = 26/165 (15%)

Query: 189 HYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQS 248
           +YS     + + LP ++P  + + I +D D  F   + +L   F     +Q +GL    S
Sbjct: 76  YYSQALFYVSVQLPDLIPSDVTRLIFVDVDTEFRDSVEKLNQQFEYFDDQQLLGLAPELS 135

Query: 249 DWYLGKLW--KNHKPWPALGR-------GFNTGVILLDLTKLRDISWAGFWRIIAEKFL- 298
             Y   L+  +N      LG        G+N+GV+L+ L K+R         +I +  L 
Sbjct: 136 PVYRHILYVYRNQHKTSLLGEPKGKGFPGYNSGVVLMKLDKMRQ-------SVIYQSLLT 188

Query: 299 ------LTRLWT---SLADQDIFNAIISEHPYLVYTLPCQWNVQL 334
                 LT  ++    L DQD +     E+P L YTLPC WN QL
Sbjct: 189 NSSLGNLTDKYSFKGHLGDQDFYTLFALEYPQLFYTLPCNWNRQL 233



 Score = 43.5 bits (101), Expect = 0.33,   Method: Compositional matrix adjust.
 Identities = 42/166 (25%), Positives = 67/166 (40%), Gaps = 25/166 (15%)

Query: 27  TRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVP---QVEVSFYLADSVVE 83
           TRSL+ L       +   LHFHL+TD  +  I + +    S      ++VS Y  D+ V+
Sbjct: 2   TRSLIKLCS-----KNIALHFHLVTDPSSWEIAKEIIHNESKKTQINIQVSAYFIDTFVK 56

Query: 84  DV-----------SWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLW 132
                        S     +YS     + + LP ++P  + + I +D D  F   + +L 
Sbjct: 57  QFVTTVQVLQQYFSVKQGTYYSQALFYVSVQLPDLIPSDVTRLIFVDVDTEFRDSVEKLN 116

Query: 133 ALFSRLRQRQTIGLVENQSDWYLGKL--WKNHKPWPAL----GRGF 172
             F     +Q +GL    S  Y   L  ++N      L    G+GF
Sbjct: 117 QQFEYFDDQQLLGLAPELSPVYRHILYVYRNQHKTSLLGEPKGKGF 162


>gi|326432410|gb|EGD77980.1| hypothetical protein PTSG_09613 [Salpingoeca sp. ATCC 50818]
          Length = 522

 Score = 65.1 bits (157), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 49/168 (29%), Positives = 79/168 (47%), Gaps = 32/168 (19%)

Query: 478 DVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGYH 537
           DVTLV Q ++DR+ +VEML + W GP  +   + +A  +           LR+ ++  ++
Sbjct: 175 DVTLVTQGTLDRVHVVEMLCRAWRGPKVVVFAVREAGTEH--------PQLRTLQSTCHN 226

Query: 538 V------VYKEGNFY---------------PINTLRNVALNQVSTPYVFLLDIDFLPMFG 576
                  + +  +FY               PINTLRN+A++   T ++F  D+DFLP   
Sbjct: 227 AKIVAVPITESSDFYTSRFRASDPAAPPVIPINTLRNLAVDLAPTNFIFTCDMDFLPSAS 286

Query: 577 LYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPASHAPTNFSR 624
           LY  L +  R + +     +  LVVP +E  R   + P    PT F +
Sbjct: 287 LYDSLVT--RYLPLLAAIDRPALVVPHWELLRCGGSAPPP-VPTTFQQ 331


>gi|219115563|ref|XP_002178577.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217410312|gb|EEC50242.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 349

 Score = 65.1 bits (157), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 64/246 (26%), Positives = 96/246 (39%), Gaps = 71/246 (28%)

Query: 478 DVTLVAQLSMDRLQMVEMLFKHW-EGPISLTLYMSD--AEAQQFLSYAGNSEALRSRRNI 534
           D TLV QLS +RL M+    + W + PISL +  +      Q+ LS  G    + +   I
Sbjct: 42  DFTLVTQLSFNRLSMMRQHCERWGDHPISLAIGTTKNLETVQKALSQLGCKNEMITVSLI 101

Query: 535 GYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHG 594
                 +E   YP+N LRN+A++ + T +  ++D DF+   GL+  L         H H 
Sbjct: 102 SDFNSEEE---YPVNRLRNLAMSHIKTSHAVIIDADFVLSVGLFEIL---------HMHR 149

Query: 595 GK------KVLVVPAFETQR---------------------------YRTAFPA------ 615
                     LV+PAFE ++                           YR A         
Sbjct: 150 ATLAADYLNALVIPAFELRKVCEEQTLNCTALHIAMLPHNKDELLKLYRGATEVYSNSSS 209

Query: 616 -----------SHAPTNFSRWVNATT----PYQIEWAPDFEPYIVAH--RDLPRYDTRFV 658
                       HA T ++ W+        P +   +  +EPY+V    R LP +   FV
Sbjct: 210 VTQFNGRGNFHGHASTRYTDWITQPAEQLLPIECVTSDRYEPYLVVRHCRALPPFQKVFV 269

Query: 659 GFGWNK 664
           G+G NK
Sbjct: 270 GYGQNK 275


>gi|194754365|ref|XP_001959466.1| GF12890 [Drosophila ananassae]
 gi|190620764|gb|EDV36288.1| GF12890 [Drosophila ananassae]
          Length = 457

 Score = 65.1 bits (157), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 52/202 (25%), Positives = 84/202 (41%), Gaps = 46/202 (22%)

Query: 174 LADSVVEDV--------SWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           +  S+++D+        S  PN +YS     L L L ++   +L + I+LD D++F +D+
Sbjct: 227 VCSSIIQDIAARLSPYFSSTPNSYYSDSLFFLSLGLHRIADRSLNRAILLDCDIVFRSDV 286

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWP--ALGR---------------- 267
             L+  F      Q  GL    +  Y   L++    +P  + G                 
Sbjct: 287 RLLFNEFDNFLPHQLYGLAPELTPVYRHILYRYRVRYPKTSFGNPYYPMNNEAGNQHSHI 346

Query: 268 -----GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWT---------SLADQDIFN 313
                G N+GV+LL L ++R+       +   EK   T + +          L DQD F 
Sbjct: 347 HHGYPGLNSGVVLLLLNRIRN------SKAYLEKLTHTEVHSLVAKYSFKGHLGDQDFFT 400

Query: 314 AIISEHPYLVYTLPCQWNVQLS 335
            +  E+P L+Y L C WN QL 
Sbjct: 401 LLGYEYPNLIYRLDCIWNRQLC 422



 Score = 48.5 bits (114), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 38/150 (25%), Positives = 65/150 (43%), Gaps = 14/150 (9%)

Query: 36  SILFYRKNPLHFHLITDTVA----LNILQTLFSTW--SVPQVEVSFYLADSVVEDV---- 85
           S+L +    LH H+ITD  +    + ILQ     +  +V        +  S+++D+    
Sbjct: 180 SLLKHTSAQLHLHIITDNESQVSVMEILQRQVRRFRRTVIYTIYDVKVCSSIIQDIAARL 239

Query: 86  ----SWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQR 141
               S  PN +YS     L L L ++   +L + I+LD D++F +D+  L+  F      
Sbjct: 240 SPYFSSTPNSYYSDSLFFLSLGLHRIADRSLNRAILLDCDIVFRSDVRLLFNEFDNFLPH 299

Query: 142 QTIGLVENQSDWYLGKLWKNHKPWPALGRG 171
           Q  GL    +  Y   L++    +P    G
Sbjct: 300 QLYGLAPELTPVYRHILYRYRVRYPKTSFG 329


>gi|74002978|ref|XP_852253.1| PREDICTED: uncharacterized protein C3orf21 [Canis lupus familiaris]
          Length = 500

 Score = 65.1 bits (157), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 58/208 (27%), Positives = 89/208 (42%), Gaps = 24/208 (11%)

Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
           F + +++ +  S     +YS     L + + +++P+ + + I LD D+ + T+I +L+  
Sbjct: 289 FPVVEAMQKHFSAGSGTYYSDSIFFLSVAMHQIMPKEILRIIQLDLDLKYKTNIRELFEE 348

Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPA---------LGRGFNTGVILLDLTKLR 282
           F        IG+       Y    W+     P             GFN+GV+LLDL  +R
Sbjct: 349 FDSFLPGAVIGIAREMQPVYRHTFWQFRHENPGTRVGGPPPEGLPGFNSGVMLLDLEAMR 408

Query: 283 DISWAGFWRIIAEKFLLTRLWTS------LADQDIFNAIISEHPYLVYTLPCQWNVQLS- 335
                G  R++ E   + RL         L DQD F  I  EHP L + L C WN QL  
Sbjct: 409 RSLLYG--RLL-EPAQVQRLADKYHFRGHLGDQDFFTMIGMEHPELFHVLDCTWNRQLCT 465

Query: 336 ---DNTRSD--ELCYTELTDLKIIHWNS 358
              D+  SD  E  +     +KI H N 
Sbjct: 466 WWRDHGYSDVFEAYFRCEGHVKIYHGNC 493


>gi|85701786|ref|NP_001028447.1| glucoside xylosyltransferase 1 [Mus musculus]
 gi|74184510|dbj|BAE27879.1| unnamed protein product [Mus musculus]
          Length = 372

 Score = 65.1 bits (157), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 44/173 (25%), Positives = 77/173 (44%), Gaps = 19/173 (10%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L E +   + +DTD++F   +  +W+L  +    Q   +     +  +G  W 
Sbjct: 164 RLFLPLILKE-VDSLLYVDTDILFLRPVDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WY 220

Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
           N     P  GR G N+GV+L+++T++R            + W      + +K+ L   W 
Sbjct: 221 NRFARHPYYGRTGVNSGVMLMNMTRMRRKYFKNDMTTARLQWGDILMPLLKKYKLNITW- 279

Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
              DQD+ N + S +P  ++  PCQWN +             E   + I+H N
Sbjct: 280 --GDQDLLNIVFSHNPESLFVFPCQWNYRPDHCIYGSNCREAEEEGVFILHGN 330



 Score = 42.0 bits (97), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 36/154 (23%), Positives = 66/154 (42%), Gaps = 15/154 (9%)

Query: 13  EVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWS-VPQV 71
           E + +A+V  G       VT++KS L +   PLH H+  +    +  +   ++WS + + 
Sbjct: 78  EKMHLAVVACG-ERLEETVTMLKSALIFSIKPLHVHIFAEDQLHDSFKDRLASWSFLRRF 136

Query: 72  EVSFY----LADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATD 127
           + S Y      DS  +   W   K         +L LP +L E +   + +DTD++F   
Sbjct: 137 DYSLYPITFPGDSAAD---W---KKLFKPCASQRLFLPLILKE-VDSLLYVDTDILFLRP 189

Query: 128 IAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKN 161
           +  +W+L  +    Q   +     +  +G  W N
Sbjct: 190 VDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WYN 221


>gi|449668189|ref|XP_002163320.2| PREDICTED: xyloside xylosyltransferase 1-like [Hydra
           magnipapillata]
          Length = 342

 Score = 65.1 bits (157), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 53/177 (29%), Positives = 79/177 (44%), Gaps = 21/177 (11%)

Query: 178 VVEDVSWIP-NKHYSGVYGLLKLTLPKVLP-ETLAKTIVLDTDVIFATDIAQLWALFSRL 235
           V++D+   P   +YS     + L L ++   + +   I++D D+   TDI  L+ +F ++
Sbjct: 134 VMQDLFSSPAGSYYSKTLFFIPLALHRINSLKNVTHLIMIDVDIELRTDIRYLYNIFLQM 193

Query: 236 RQRQTIGLVENQSDWY---LGKLWKNHK------PWPALGRGFNTGVILLDLTKLRDISW 286
              Q  GL   Q   Y        K HK      P P    GFN+GV L++LT LR    
Sbjct: 194 GPEQLYGLAREQQPTYRHITSAYRKRHKKTAVGNPPPFGLTGFNSGVKLINLTSLRQSKL 253

Query: 287 AGFW-------RIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSD 336
              +       + +A+KF        L DQD F  +   H  + Y LPCQWN QL +
Sbjct: 254 YNSYLDDPVKLKHLADKF---SFRGHLGDQDFFTLLSFLHKDMFYELPCQWNRQLCE 307


>gi|241618451|ref|XP_002406997.1| glycosyltransferase domain-containing protein, putative [Ixodes
           scapularis]
 gi|215500924|gb|EEC10418.1| glycosyltransferase domain-containing protein, putative [Ixodes
           scapularis]
          Length = 230

 Score = 65.1 bits (157), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 40/143 (27%), Positives = 67/143 (46%), Gaps = 12/143 (8%)

Query: 213 IVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSD----WYLGKLWKNHKPWPALGRG 268
           + +D D +F   + + W +F ++ +   + L     D    WY  +  K+  P P    G
Sbjct: 9   LYVDADTLFLNPVEEHWNVFEKMNESHLMALGSEGEDFAHNWYHRRA-KHPYPPPF---G 64

Query: 269 FNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPC 328
            N GV+ ++LT++R   W      + E++    LW    DQD+ N + S HP  ++ L C
Sbjct: 65  ANAGVMAMNLTRMRSFDWVSRLAPLREEYAGRLLW---GDQDLINILFSFHPDKLFMLTC 121

Query: 329 QWNVQLSDNTRSDELCYTELTDL 351
           +WN +  DN R  + C  E   L
Sbjct: 122 RWNYR-QDNCRFHDSCSGETPAL 143


>gi|194885984|ref|XP_001976525.1| GG22921 [Drosophila erecta]
 gi|190659712|gb|EDV56925.1| GG22921 [Drosophila erecta]
          Length = 458

 Score = 65.1 bits (157), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 53/202 (26%), Positives = 83/202 (41%), Gaps = 46/202 (22%)

Query: 174 LADSVVEDV--------SWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           +  S+++D+        S  PN +YS     L L L ++    L + I+LD D++F +D+
Sbjct: 228 VCSSIIQDIAAKLSPYFSSTPNSYYSDSLFFLSLGLHRIADRNLNRAILLDCDIVFRSDV 287

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWP--ALGR---------------- 267
             L+  F      Q  GL    +  Y   L++    +P  + G                 
Sbjct: 288 RLLFNEFDNFLPHQLYGLAPELTPVYRHILYRYRVRYPKTSFGNPYYPINNEGGNQHSRV 347

Query: 268 -----GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWT---------SLADQDIFN 313
                G N+GV+LL L ++R+       +   EK   T + T          L DQD F 
Sbjct: 348 HHGYPGLNSGVVLLLLNRIRN------SKSYLEKLTHTEVHTLVAKYSFKGHLGDQDFFT 401

Query: 314 AIISEHPYLVYTLPCQWNVQLS 335
            +  E+P L+Y L C WN QL 
Sbjct: 402 LLGYEYPNLIYRLDCIWNRQLC 423



 Score = 48.9 bits (115), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 39/150 (26%), Positives = 65/150 (43%), Gaps = 14/150 (9%)

Query: 36  SILFYRKNPLHFHLITDTVA----LNILQTLFSTW--SVPQVEVSFYLADSVVEDV---- 85
           S+L +    LH H+ITD+ +    L ILQ     +  +V        +  S+++D+    
Sbjct: 181 SLLKHTSAQLHLHVITDSESQPSVLEILQRQIRRFRRTVIYTIYDVKVCSSIIQDIAAKL 240

Query: 86  ----SWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQR 141
               S  PN +YS     L L L ++    L + I+LD D++F +D+  L+  F      
Sbjct: 241 SPYFSSTPNSYYSDSLFFLSLGLHRIADRNLNRAILLDCDIVFRSDVRLLFNEFDNFLPH 300

Query: 142 QTIGLVENQSDWYLGKLWKNHKPWPALGRG 171
           Q  GL    +  Y   L++    +P    G
Sbjct: 301 QLYGLAPELTPVYRHILYRYRVRYPKTSFG 330


>gi|24762499|ref|NP_611871.2| CG11388 [Drosophila melanogaster]
 gi|7291716|gb|AAF47138.1| CG11388 [Drosophila melanogaster]
 gi|372266220|gb|AEX91658.1| FI19481p1 [Drosophila melanogaster]
          Length = 458

 Score = 64.7 bits (156), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 51/196 (26%), Positives = 83/196 (42%), Gaps = 34/196 (17%)

Query: 174 LADSVVEDV--------SWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           +  S+++D+        S  PN +YS     L L L ++   +L + I+LD D++F +D+
Sbjct: 228 VCSSIIQDIAAKLSPYFSSTPNSYYSDSLFFLSLGLHRIADRSLNRAILLDCDIVFRSDV 287

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWP--ALGR---------------- 267
             L+  F      Q  GL    +  Y   L++    +P  + G                 
Sbjct: 288 RLLFNEFDNFLPHQLYGLAPELTPVYRHILYRYRVRYPKTSFGNPYYPINNEGGNQHSRV 347

Query: 268 -----GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWT---SLADQDIFNAIISEH 319
                G N+GV+LL L ++R+          +E   L   ++    L DQD F  +  E+
Sbjct: 348 HHGYPGLNSGVVLLLLNRIRNSKSYLEKLTHSEVHTLVAKYSFKGHLGDQDFFTLLGYEY 407

Query: 320 PYLVYTLPCQWNVQLS 335
           P L+Y L C WN QL 
Sbjct: 408 PNLIYRLDCIWNRQLC 423



 Score = 49.7 bits (117), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 39/150 (26%), Positives = 66/150 (44%), Gaps = 14/150 (9%)

Query: 36  SILFYRKNPLHFHLITDTVA----LNILQTLFSTW--SVPQVEVSFYLADSVVEDV---- 85
           S+L +    LH H+ITD+ +    L ILQ     +  +V        +  S+++D+    
Sbjct: 181 SLLKHTSAQLHLHVITDSESQPSVLEILQRQIRRFRRTVIYTIYDVKVCSSIIQDIAAKL 240

Query: 86  ----SWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQR 141
               S  PN +YS     L L L ++   +L + I+LD D++F +D+  L+  F      
Sbjct: 241 SPYFSSTPNSYYSDSLFFLSLGLHRIADRSLNRAILLDCDIVFRSDVRLLFNEFDNFLPH 300

Query: 142 QTIGLVENQSDWYLGKLWKNHKPWPALGRG 171
           Q  GL    +  Y   L++    +P    G
Sbjct: 301 QLYGLAPELTPVYRHILYRYRVRYPKTSFG 330


>gi|17512112|gb|AAH19036.1| Chromosome 3 open reading frame 21 [Homo sapiens]
          Length = 380

 Score = 64.7 bits (156), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 56/210 (26%), Positives = 89/210 (42%), Gaps = 28/210 (13%)

Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
           F + +++ +  S     +YS     L + + +++P+ + + I LD D+ F T+I +L+  
Sbjct: 169 FPIVEAMQKHFSAGLGTYYSDSIFFLSVAMHQIMPKEILQIIQLDLDLKFKTNIRELFEE 228

Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWKNHKPWP---------ALGRGFNTGVILLDLTKLR 282
           F        IG+       Y    W+     P             GFN+GV+LL+L  +R
Sbjct: 229 FDSFLPGAIIGIAREMQPVYRHTFWQFRHENPQTRVGGPPPEGLPGFNSGVMLLNLEAMR 288

Query: 283 DISWAGFWRII--------AEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQL 334
                 + R++        A+K+        L DQD F  I  EHP L + L C WN QL
Sbjct: 289 QSPL--YSRLLEPAQVQQLADKY---HFRGHLGDQDFFTMIGMEHPKLFHVLDCTWNRQL 343

Query: 335 S----DNTRSD--ELCYTELTDLKIIHWNS 358
                D+  SD  E  +     +KI H N 
Sbjct: 344 CTWWRDHGYSDVFEAYFRCEGHVKIYHGNC 373


>gi|194886138|ref|XP_001976557.1| GG22943 [Drosophila erecta]
 gi|190659744|gb|EDV56957.1| GG22943 [Drosophila erecta]
          Length = 389

 Score = 64.7 bits (156), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 65/277 (23%), Positives = 104/277 (37%), Gaps = 85/277 (30%)

Query: 468 DEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEA----QQFLSYAG 523
           ++Y   +DG+ V L  Q S++RL  +  +  +W+G +S+ L+ +  E     Q F++Y  
Sbjct: 41  EQYADLADGSLVCLATQTSVERLNSLPQVAANWQGKMSVALFAAGPEEFVVLQYFVTYMR 100

Query: 524 NSEALRSRRNIGYHVV-------------------------------------------- 539
              A   R N  +H++                                            
Sbjct: 101 LCFA-NIRENATFHLLTPRDFDKLPRMAALPPNMKGKFDCQYPDRTLKALLKFRSLKTLQ 159

Query: 540 YKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVL 599
           +++ N YP N +RN+A     T YVFL DID +P     P L    R+ +          
Sbjct: 160 WRQRNTYPQNHMRNLARKGCQTKYVFLTDIDIVPSTNSVPQLNHFFRTANCSKSCA---Y 216

Query: 600 VVPAFETQRYRTAFPAS------------------------HAPTNFSRWV----NATTP 631
           V+P FE    R  FP S                           TNFS+W+    N T  
Sbjct: 217 VIPTFEID-VRATFPRSKNALVRLIRKGLARPFHEKVFIYNQYATNFSKWLSPNSNETEV 275

Query: 632 YQIEWAPDF----EPYIVAHRDLPRYDTRFVGFGWNK 664
                  +F    EP+ +A  + P +D RF+G+G+ +
Sbjct: 276 SVSHIVTNFEFLYEPFYIAVDNAPAHDERFLGYGFTR 312


>gi|353238557|emb|CCA70499.1| hypothetical protein PIIN_04436 [Piriformospora indica DSM 11827]
          Length = 443

 Score = 64.7 bits (156), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 64/240 (26%), Positives = 101/240 (42%), Gaps = 57/240 (23%)

Query: 473 SSDGN----DVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQ----------- 517
           S DG     D+TL+  ++ +R  ++  L K ++GPIS+ L+++++               
Sbjct: 111 SMDGTLAKEDITLLTIVTSNRFSVLGRLAKRYQGPISVALHVTESPRAHRDRILDELHEL 170

Query: 518 FLSYAGNSEALRSRRNIGYHVVYKEGNF-YPINTLRNVALNQVSTPYVFLLDIDFLPMFG 576
           +LS  G S      + +  H++    NF    N  RN A     T YV +LD+DF    G
Sbjct: 171 YLSTPGMS------KWVDVHLILD--NFDRQFNMWRNAARLFARTNYVMMLDVDFAVCTG 222

Query: 577 LYPYLKSSIRSMDMHGHGGKKVLVVPAFE----------------------------TQR 608
               L +S   M M    G    VVPAFE                             Q 
Sbjct: 223 FRQRLLASKEMMGML-RAGDAAFVVPAFEYVVAEDGGNEASFPRTKERLRRLHDAGKIQV 281

Query: 609 YRTAFPASHAPTNFSRWVNATTP--YQIE-WAPDFEPYIVAHRD-LPRYDTRFVGFGWNK 664
           +  A+   H  +++SR+  A     Y++  +   +EPY++  RD  P  D RF+G+G NK
Sbjct: 282 FHQAWEGGHNSSDYSRYFQAAPGDVYKVHAYQHAYEPYVIFKRDSAPWCDERFIGYGGNK 341


>gi|112181295|ref|NP_689744.3| xyloside xylosyltransferase 1 [Homo sapiens]
 gi|74751171|sp|Q8NBI6.1|XXLT1_HUMAN RecName: Full=Xyloside xylosyltransferase 1; AltName:
           Full=UDP-xylose:alpha-xyloside
           alpha-1,3-xylosyltransferase
 gi|22761774|dbj|BAC11694.1| unnamed protein product [Homo sapiens]
 gi|112180452|gb|AAH39067.2| C3orf21 protein [Homo sapiens]
 gi|119598434|gb|EAW78028.1| chromosome 3 open reading frame 21, isoform CRA_a [Homo sapiens]
 gi|119598435|gb|EAW78029.1| chromosome 3 open reading frame 21, isoform CRA_a [Homo sapiens]
          Length = 393

 Score = 64.3 bits (155), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 56/210 (26%), Positives = 89/210 (42%), Gaps = 28/210 (13%)

Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
           F + +++ +  S     +YS     L + + +++P+ + + I LD D+ F T+I +L+  
Sbjct: 182 FPIVEAMQKHFSAGLGTYYSDSIFFLSVAMHQIMPKEILQIIQLDLDLKFKTNIRELFEE 241

Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWKNHKPWP---------ALGRGFNTGVILLDLTKLR 282
           F        IG+       Y    W+     P             GFN+GV+LL+L  +R
Sbjct: 242 FDSFLPGAIIGIAREMQPVYRHTFWQFRHENPQTRVGGPPPEGLPGFNSGVMLLNLEAMR 301

Query: 283 DISWAGFWRII--------AEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQL 334
                 + R++        A+K+        L DQD F  I  EHP L + L C WN QL
Sbjct: 302 QSPL--YSRLLEPAQVQQLADKY---HFRGHLGDQDFFTMIGMEHPKLFHVLDCTWNRQL 356

Query: 335 S----DNTRSD--ELCYTELTDLKIIHWNS 358
                D+  SD  E  +     +KI H N 
Sbjct: 357 CTWWRDHGYSDVFEAYFRCEGHVKIYHGNC 386


>gi|302834571|ref|XP_002948848.1| hypothetical protein VOLCADRAFT_89141 [Volvox carteri f.
           nagariensis]
 gi|300266039|gb|EFJ50228.1| hypothetical protein VOLCADRAFT_89141 [Volvox carteri f.
           nagariensis]
          Length = 547

 Score = 64.3 bits (155), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 48/175 (27%), Positives = 73/175 (41%), Gaps = 47/175 (26%)

Query: 536 YHVVYKE--GNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYL---KSSIRSMDM 590
           Y +V  E      PIN LRN A+    TP V ++D+D  P   L  +L   ++    +  
Sbjct: 239 YELVADEVMAALMPINALRNAAILAADTPLVAMVDVDLSPSLSLSQHLLSNRTRAAELQR 298

Query: 591 HGHGGKKVLVVPAFETQRYRTAFP------------------------------------ 614
                + V V+PAF+TQR R + P                                    
Sbjct: 299 RAEAERTVWVIPAFDTQR-RLSLPGREAVADLAVAVPHSAKGTRLVALWKKAEKIYPFAY 357

Query: 615 ----ASHAPTNFSRWVNATTPYQIEWAPDFEPYIVAHR-DLPRYDTRFVGFGWNK 664
               A H+ T++++W  +   YQ+ + P +EP+ +  R  LP YD RF G+ WNK
Sbjct: 358 DVYRAGHSATDYNKWFRSKLDYQVSYRPGYEPWFIGARMALPAYDARFRGYSWNK 412


>gi|328778044|ref|XP_393493.3| PREDICTED: n-acetyllactosaminide
           beta-1,3-N-acetylglucosaminyltransferase-like [Apis
           mellifera]
          Length = 495

 Score = 64.3 bits (155), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 64/272 (23%), Positives = 98/272 (36%), Gaps = 91/272 (33%)

Query: 479 VTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEA----QQFLSYAGNS-EALRSRRN 533
           V L  Q S+++L  +     HW GP+S+ LY++  E     Q++L Y     E +R R  
Sbjct: 158 VCLATQSSVEKLHSIVQAAHHWTGPMSVALYVAGDEEFEVLQKYLIYLRKCYEPIRERVA 217

Query: 534 IGYHV---------------------VYKEG-------------------NFYPINTLRN 553
               V                        EG                   N YP N +RN
Sbjct: 218 FSLAVPRVRPVKRQPREFDPPEMVDCAKPEGTLNELMNGVSNEQANWRIRNAYPQNHMRN 277

Query: 554 VALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRYRTAF 613
           +A     + YVFL D+D +P F L   L   +R+        K   V+P +E    R  F
Sbjct: 278 LARKNCQSDYVFLTDVDIVPSFNLTGALDEFLRA----DTCDKCAYVIPTYELDS-RVRF 332

Query: 614 PASHA------------------------PTNFSRWVNATTPYQIEWAPD---------- 639
           P +                           TNF+RW+  T+P   ++             
Sbjct: 333 PQNKTELVRLARKGLARPFHQKVFIHNQFATNFTRWILDTSPNHKKYGNMKSGKVYVSHD 392

Query: 640 -------FEPYIVAHRDLPRYDTRFVGFGWNK 664
                  +EP+ VA   +P +D RF+G+G+ +
Sbjct: 393 VTNFEFLYEPFYVAKDIVPAHDERFMGYGYTR 424


>gi|242012129|ref|XP_002426793.1| N-acetyllactosaminide beta-1,3-N-acetylglucosaminyltransferase,
           putative [Pediculus humanus corporis]
 gi|212510975|gb|EEB14055.1| N-acetyllactosaminide beta-1,3-N-acetylglucosaminyltransferase,
           putative [Pediculus humanus corporis]
          Length = 451

 Score = 64.3 bits (155), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 61/278 (21%), Positives = 107/278 (38%), Gaps = 83/278 (29%)

Query: 468 DEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSY------ 521
           +EY + S    V +  Q S+++L  +  + +HW GPIS+ +Y +  E    L +      
Sbjct: 108 EEYGSLSKKYMVCMATQSSIEKLHSIAQVSEHWTGPISVAIYAAGDEEYYLLHWYVHFLK 167

Query: 522 ---------------------------------AGNSEALRSRRNIGYHVV--------- 539
                                              N+   R  +N+   ++         
Sbjct: 168 NCYPWISKRVSFHLGVPKLRIPTGKYPENLKNLKINNYKCRDYKNVLKEIIKLRPVEIVK 227

Query: 540 YKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVL 599
           ++  N YP N +RN+A     + YVFL D+D +P F L   L   +++        K   
Sbjct: 228 WRMKNPYPQNHMRNLARKNCQSDYVFLTDVDIVPSFKLADNLNKFLKTAKKTCK-NKCAY 286

Query: 600 VVPAFETQRYRTAFPASHA------------------------PTNFSRWVN--ATTPYQ 633
           V+P +E    RT FP + +                         TNFS+W +   ++  +
Sbjct: 287 VIPTYELDS-RTDFPKNKSELLRLSQKGLARAFHQKVFIYNQFATNFSKWESFVDSSEDE 345

Query: 634 IEWAPD-------FEPYIVAHRDLPRYDTRFVGFGWNK 664
           I  + D       +EP+ VA   +P +D RF+G+G+ +
Sbjct: 346 IIISHDVTNFEFLYEPFYVAPDTVPEHDERFLGYGYTR 383


>gi|384248607|gb|EIE22091.1| hypothetical protein COCSUDRAFT_83482 [Coccomyxa subellipsoidea
           C-169]
          Length = 330

 Score = 64.3 bits (155), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 64/244 (26%), Positives = 101/244 (41%), Gaps = 24/244 (9%)

Query: 146 LVENQSDWYLGKLW--KNHKPWPALGRGFYLAD--SVVEDVSWI--PNKHYSGVYGLLKL 199
           LV   +D  + + W  +   PW  +   F   D  +V+E ++ I  P K Y G  GL ++
Sbjct: 63  LVAADADAEIVRSWLQRLRSPWQPVTYSFVTIDDGAVLERLAAIHGPLKGYMGYCGLARM 122

Query: 200 TLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNH 259
              + LP    + + LDTD +   D+ QLW  F      Q IG   + +   L      H
Sbjct: 123 FFTEFLPHE-ERVLWLDTDTVVNGDVCQLWPHFDCFNASQIIGFPGDPTIPNL------H 175

Query: 260 KPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSL---ADQDIFNAII 316
            P+     G  + V L+DLT++R   W G +   A +  L    + +    DQD+   + 
Sbjct: 176 HPF-----GLCSCVALMDLTRMRAAGWGGSFMAAALEAHLKVNPSEVFPYGDQDVLRIVQ 230

Query: 317 SEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTD--LKIIHWNSPKKLKVKNKHMEFFRN 374
             +P   Y LP  WNV +      D   +    D  + IIH+    K  + +    F R 
Sbjct: 231 RHYPDSAYRLPSGWNV-MPFCPAYDAAMHETAVDAFVGIIHFTCTVKQHIYSSMASFGRW 289

Query: 375 LYLT 378
            +L 
Sbjct: 290 AWLV 293



 Score = 48.1 bits (113), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 38/122 (31%), Positives = 57/122 (46%), Gaps = 12/122 (9%)

Query: 34  IKSILFYRKNPLHFHLITDT-----VALNILQTLFSTWSVPQVEVSFYLAD--SVVEDVS 86
           +KS+L   + PLH +++        +  + LQ L S W    V  SF   D  +V+E ++
Sbjct: 47  LKSLLRNARCPLHLYILVAADADAEIVRSWLQRLRSPWQ--PVTYSFVTIDDGAVLERLA 104

Query: 87  WI--PNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTI 144
            I  P K Y G  GL ++   + LP    + + LDTD +   D+ QLW  F      Q I
Sbjct: 105 AIHGPLKGYMGYCGLARMFFTEFLPHE-ERVLWLDTDTVVNGDVCQLWPHFDCFNASQII 163

Query: 145 GL 146
           G 
Sbjct: 164 GF 165


>gi|156545014|ref|XP_001600015.1| PREDICTED: glucoside xylosyltransferase 1-like isoform 1 [Nasonia
           vitripennis]
 gi|345479641|ref|XP_003423999.1| PREDICTED: glucoside xylosyltransferase 1-like isoform 2 [Nasonia
           vitripennis]
          Length = 385

 Score = 64.3 bits (155), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 43/136 (31%), Positives = 67/136 (49%), Gaps = 8/136 (5%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP VL +T A  + +DTD +F      +W  F ++   Q   L     D   G  W 
Sbjct: 157 RLFLPTVLNDTDA-ILYVDTDTLFLAPPEVIWDEFKKMNNVQLAALSPEHEDPNTG--WY 213

Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAI 315
           N     P  G+ G N+GV+L++LT++R+  W  +   I +++ L   W    DQDI N I
Sbjct: 214 NRFAKHPYYGKLGVNSGVMLMNLTRMREFKWTQYVIPIHKEYRLKITW---GDQDIINII 270

Query: 316 ISEHPYLVYTLPCQWN 331
              HP  ++   C++N
Sbjct: 271 FHYHPEKLHVYDCRYN 286



 Score = 41.6 bits (96), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 35/148 (23%), Positives = 61/148 (41%), Gaps = 7/148 (4%)

Query: 13  EVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWS-VPQV 71
           +V+ + +V  G +     +T++KS L + K PL F +I+D   +       + W  +   
Sbjct: 67  QVVVICVVACGADRLDESLTMLKSALVFAKRPLRFIVISDYGLIPAFHEKLTEWKQIVDK 126

Query: 72  EVSFYLADSVVEDVSWIPN-KHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQ 130
              F +      D+S +   +         +L LP VL +T A  + +DTD +F      
Sbjct: 127 TFDFIVRPVTFPDISDVTMWRKLFKPCAAQRLFLPTVLNDTDA-ILYVDTDTLFLAPPEV 185

Query: 131 LWALFSRLRQRQTIGLVENQSD----WY 154
           +W  F ++   Q   L     D    WY
Sbjct: 186 IWDEFKKMNNVQLAALSPEHEDPNTGWY 213


>gi|313233727|emb|CBY09897.1| unnamed protein product [Oikopleura dioica]
          Length = 505

 Score = 64.3 bits (155), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 69/290 (23%), Positives = 113/290 (38%), Gaps = 95/290 (32%)

Query: 457 LQMVEINPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQ 516
           ++M  I+P    EY A+    DVTLV   S + L  +  L + W GPIS+ ++    EA 
Sbjct: 129 IRMFFISP----EYFANWKFEDVTLVTMCSANHLHHIISLAERWSGPISVGVFGPGLEA- 183

Query: 517 QFLSYAGNS-EALRS-----RRNIGYHVVY------------------------------ 540
              S+A ++ E++++     R  + +H+VY                              
Sbjct: 184 ---SFATDAIESIKNCFPAVREQVLFHLVYPISLPADLSKSSGKLRFENCDDLRKQIKSY 240

Query: 541 -----KEGNF-----YPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKS-SIRSMD 589
                   N+     +P N LRN A     T Y+FL+D+D +P   L       +IR   
Sbjct: 241 GGDEDDRANYGGDIPFPHNVLRNAARRGAPTAYIFLIDVDVMPSLNLREDFNDFAIR--- 297

Query: 590 MHG-----HGGKKVLVVPAFETQRYRTAFPAS------------------------HAPT 620
            HG          V VVP FE ++  T  P +                        H P 
Sbjct: 298 -HGLYDKNENAMTVFVVPVFEKKKGSTC-PDNKRELAESCDKNEVRSFHEETCKWCHEPE 355

Query: 621 NFSRWVNATTPYQIE------WAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
              +W+      ++E      W   +EP+ +A R +P +D RF+ +G+++
Sbjct: 356 QPKKWLQIQGQSRLEISFNSSWTKSWEPFYIASRQVPYFDERFMVYGFDR 405


>gi|149616439|ref|XP_001516223.1| PREDICTED: N-acetyllactosaminide
           beta-1,3-N-acetylglucosaminyltransferase-like
           [Ornithorhynchus anatinus]
          Length = 412

 Score = 64.3 bits (155), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 63/267 (23%), Positives = 101/267 (37%), Gaps = 79/267 (29%)

Query: 472 ASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQF---LSYAGNSEAL 528
           A     D+ L    S+  L  +E L + W GP+++ ++ +  +  Q    L+YA  +   
Sbjct: 85  AEGVAGDLVLATHASLGNLPHLEGLVERWGGPVAVAIFAAGGKEAQLATVLAYALGTLCG 144

Query: 529 RSRRNIGYHVVYKEGN------------------------------------------FY 546
             R  +  H+V   G+                                           Y
Sbjct: 145 AVRARVTLHLVCPAGHEASVPEPREPGEFARLRSCPEVFAKLAAWAGAGPNYALGANVSY 204

Query: 547 PINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFET 606
           P N LRN+A  +    +  ++D+D +P  GL+  L   +             LVVPAFE 
Sbjct: 205 PNNLLRNLAREEAG--HALVVDVDMMPSEGLWAGLLELLALGAGRA---GTALVVPAFEI 259

Query: 607 QRYRT------------------AF-----PASHAPTNFSRWVNATT------PYQIEWA 637
           +R R                   AF     P   APTN+SRW+N  +       Y++ W 
Sbjct: 260 RRARRVPGTKAELLRLYQVGEVRAFYAELCPRCQAPTNYSRWLNLPSGGSLRPAYRVGWR 319

Query: 638 PDFEPYIVAHRDLPRYDTRFVGFGWNK 664
             +EP+ VA   +P +D RF  +G+N+
Sbjct: 320 DPWEPFYVAGGAVPTFDERFRQYGFNR 346


>gi|296487730|tpg|DAA29843.1| TPA: glucoside xylosyltransferase 1 isoform 1 [Bos taurus]
          Length = 409

 Score = 64.3 bits (155), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 44/173 (25%), Positives = 76/173 (43%), Gaps = 19/173 (10%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L E +   + +DTD++F   +  +WAL  +    Q   +     +  +G  W 
Sbjct: 169 RLFLPLILKE-VDSLLYVDTDILFLRPVDDIWALLKKFNSTQIAAMAPEHEEPRIG--WY 225

Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
           N     P  G+ G N+GV+L+++T++R            + W      + +K+ L   W 
Sbjct: 226 NRFARHPYYGKTGVNSGVMLMNMTRMRRKYFKNDMTTVRLRWGDILMPLLKKYKLNITW- 284

Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
              DQD+ N I   +P  ++  PCQWN +             E   + I+H N
Sbjct: 285 --GDQDLLNIIFFHNPESLFVFPCQWNYRPDHCIYGSNCQEAEEEGIFILHGN 335



 Score = 45.4 bits (106), Expect = 0.087,   Method: Compositional matrix adjust.
 Identities = 37/151 (24%), Positives = 64/151 (42%), Gaps = 9/151 (5%)

Query: 13  EVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQV- 71
           E + +A+V  G       VT++KS L +   PL FH+  +    +  + +  +WS  Q  
Sbjct: 83  ERVHLAVVACG-ERLEETVTMLKSALIFSIKPLQFHIFAEDQLHDSFKGILDSWSFLQTF 141

Query: 72  EVSFYLADSVVEDVS-WIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQ 130
             S Y      E+ + W   K         +L LP +L E +   + +DTD++F   +  
Sbjct: 142 NYSLYPITFPSENAAEW---KKLFKPCASQRLFLPLILKE-VDSLLYVDTDILFLRPVDD 197

Query: 131 LWALFSRLRQRQTIGLVENQSDWYLGKLWKN 161
           +WAL  +    Q   +     +  +G  W N
Sbjct: 198 IWALLKKFNSTQIAAMAPEHEEPRIG--WYN 226


>gi|296224892|ref|XP_002758256.1| PREDICTED: xyloside xylosyltransferase 1 [Callithrix jacchus]
          Length = 393

 Score = 64.3 bits (155), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 54/208 (25%), Positives = 88/208 (42%), Gaps = 24/208 (11%)

Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
           F + +++ +  S     +YS     L + + +++P+ + + I LD D+ + T+I +L+  
Sbjct: 182 FPVVEAMQKHFSAGSGTYYSDSIFFLSVAMHQIMPKEILRIIQLDLDLKYKTNIRELFEE 241

Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWKNHKPWP---------ALGRGFNTGVILLDLTKLR 282
           F        IG+       Y    W+     P             GFN+GV+LL+L  +R
Sbjct: 242 FDSFLPGAIIGIAREMQPVYRHTFWQFRHENPQTRVGGPPPEGLPGFNSGVMLLNLEAMR 301

Query: 283 D------ISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLS- 335
                  +      + +A+K+        L DQD F  I  EHP L + L C WN QL  
Sbjct: 302 QSLLYSRLLEPAQVQQLADKY---HFRGHLGDQDFFTMIGMEHPKLFHVLDCTWNRQLCT 358

Query: 336 ---DNTRSD--ELCYTELTDLKIIHWNS 358
              D+  SD  E  +     +KI H N 
Sbjct: 359 WWRDHGYSDVFEAYFQCEGHVKIYHGNC 386


>gi|313238546|emb|CBY13597.1| unnamed protein product [Oikopleura dioica]
          Length = 427

 Score = 64.3 bits (155), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 63/265 (23%), Positives = 100/265 (37%), Gaps = 75/265 (28%)

Query: 474 SDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEAL---RS 530
           S   DVTL +Q S + L  V  L + W GPIS+ ++  D E+  F++ A  S      + 
Sbjct: 94  SQYEDVTLASQCSANHLHHVINLLERWSGPISIAVFAPDKESS-FVTDAILSMRKCWPQI 152

Query: 531 RRNIGYHVVYKEG-------------------------NF-------------YPINTLR 552
           R  + +H+VY  G                         NF             YP N LR
Sbjct: 153 RDKVTFHLVYPIGHTSDVSASTGGLEFNDCNELVDIISNFGGDDAGNYKGDIPYPHNVLR 212

Query: 553 NVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKK--VLVVPAFETQRYR 610
           NVA+   +T Y FL+DID LP   +             +     +  V VVP FE ++  
Sbjct: 213 NVAIKTSATKYTFLIDIDVLPNENIRERFNQMAIKKGYYDEDFDENVVFVVPCFEMKKGY 272

Query: 611 TA----------FPAS-------------HAPTNFSRWV--------NATTPYQIEWAPD 639
           T           F                H+P     W+         +   +   W  +
Sbjct: 273 TRSLNKKHLLEDFQKGLIRPFHNESCWWCHSPEEHPHWMAIPLSRNDQSKISFNATWNKN 332

Query: 640 FEPYIVAHRDLPRYDTRFVGFGWNK 664
           +EP+ +  +++P +D RF  +G+++
Sbjct: 333 WEPFYIGSKNVPLFDERFKQYGFDR 357


>gi|300123041|emb|CBK24048.2| unnamed protein product [Blastocystis hominis]
          Length = 290

 Score = 64.3 bits (155), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 46/152 (30%), Positives = 73/152 (48%), Gaps = 11/152 (7%)

Query: 464 PNNVDEYEASSDGNDV----TLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFL 519
           P + D    S+D  DV    T  +Q ++ R   +  + + W GP+S+  Y        FL
Sbjct: 136 PTDFDPVFHSTDAPDVEYDVTFCSQGTITRWGFMLEVSRRWRGPMSIVYYFPKENESYFL 195

Query: 520 SYAGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYP 579
           S    S  LR    + Y V+  +  ++PINTLRN A++ V T + +L D+D  P FG Y 
Sbjct: 196 SLKAESN-LRPNIIVDY-VIQTDPFYFPINTLRNRAIDNVKTTHFWLTDMDVWPSFGSY- 252

Query: 580 YLKSSIRSMDMH-GHGGKKVLVVPAFETQRYR 610
               +I  + +      K   +VP F+ +R+R
Sbjct: 253 ---DAIMHLPLEFFQDPKNAGIVPVFQMRRHR 281


>gi|170067015|ref|XP_001868312.1| N-acetyl lactosaminide beta-1,3-N-acetyl glucosaminyl transferase
           [Culex quinquefasciatus]
 gi|167863173|gb|EDS26556.1| N-acetyl lactosaminide beta-1,3-N-acetyl glucosaminyl transferase
           [Culex quinquefasciatus]
          Length = 500

 Score = 63.9 bits (154), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 64/274 (23%), Positives = 104/274 (37%), Gaps = 86/274 (31%)

Query: 474 SDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEA----QQFLSYAGNS-EAL 528
           S+   V L  Q S++++  +  +  HW GPIS  ++ +  +     Q +LSY  N  + +
Sbjct: 125 SERYSVCLATQSSLEKIYSLVQVSHHWSGPISAAIFAAGNDELYLLQIYLSYLRNCFKTI 184

Query: 529 RSRRNIGYHVVYKE------------GNF------------------------------- 545
           R R  + +H+   +            G+F                               
Sbjct: 185 RER--VSFHLALPKERAPTHLKSIHVGDFTKFDCAKPEATLNDLIKLRKADTNKWRIKNP 242

Query: 546 YPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDM--HGHGGKK-VLVVP 602
           YP N LRNVA     + YVFL D+D +P      +L   +R+      GHG ++   VVP
Sbjct: 243 YPQNHLRNVARKGCQSTYVFLTDVDIIPSVNFAEHLDKFLRTQRAPHGGHGAQQTAFVVP 302

Query: 603 AFETQRYRTAFPASHA------------------------PTNFSRW--------VNATT 630
            +E    R  FP +                           TNFSRW        +   +
Sbjct: 303 TYELDE-RVRFPRNKTDLIRLANKGLARPFHHKVFIYNQFATNFSRWQADIFDGEMTRVS 361

Query: 631 PYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
                +   +EP+ VA   +P +D RF+G+G+ +
Sbjct: 362 HNVTNFEFLYEPFYVAPDTVPPHDERFLGYGYTR 395


>gi|440905211|gb|ELR55623.1| Glucoside xylosyltransferase 1, partial [Bos grunniens mutus]
          Length = 369

 Score = 63.9 bits (154), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 45/173 (26%), Positives = 76/173 (43%), Gaps = 19/173 (10%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L E +   + +DTD++F   I  +WAL  +    Q   +     +  +G  W 
Sbjct: 129 RLFLPLILKE-VDSLLYVDTDILFLRPIDDIWALLKKFNSTQIAAMAPEHEEPRIG--WY 185

Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
           N     P  G+ G N+GV+L+++T++R            + W      + +K+ L   W 
Sbjct: 186 NRFARHPYYGKTGVNSGVMLMNMTRMRRKYFKNDMTTVRLRWGDILMPLLKKYKLNITW- 244

Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
              DQD+ N I   +P  ++  PCQWN +             E   + I+H N
Sbjct: 245 --GDQDLLNIIFFHNPESLFVFPCQWNYRPDHCIYGSNCQEAEEEGIFILHGN 295



 Score = 46.6 bits (109), Expect = 0.042,   Method: Compositional matrix adjust.
 Identities = 38/151 (25%), Positives = 64/151 (42%), Gaps = 9/151 (5%)

Query: 13  EVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQV- 71
           E + +A+V  G       VT++KS L +   PL FH+  +    +  + +  +WS  Q  
Sbjct: 43  ERVHLAVVACG-ERLEETVTMLKSALIFSIKPLQFHIFAEDQLHDSFKGILDSWSFLQTF 101

Query: 72  EVSFYLADSVVEDVS-WIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQ 130
             S Y      E+ + W   K         +L LP +L E +   + +DTD++F   I  
Sbjct: 102 NYSLYPITFPSENAAEW---KKLFKPCASQRLFLPLILKE-VDSLLYVDTDILFLRPIDD 157

Query: 131 LWALFSRLRQRQTIGLVENQSDWYLGKLWKN 161
           +WAL  +    Q   +     +  +G  W N
Sbjct: 158 IWALLKKFNSTQIAAMAPEHEEPRIG--WYN 186


>gi|403270494|ref|XP_003927212.1| PREDICTED: xyloside xylosyltransferase 1, partial [Saimiri
           boliviensis boliviensis]
          Length = 328

 Score = 63.9 bits (154), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 54/208 (25%), Positives = 88/208 (42%), Gaps = 24/208 (11%)

Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
           F + +++ +  S     +YS     L + + +++P+ + + I LD D+ + T+I +L+  
Sbjct: 117 FPVVEAMQKHFSAGSGTYYSDSIFFLSVAMHQIMPKDVLRIIQLDLDLKYMTNIRELFEE 176

Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWKNHKPWP---------ALGRGFNTGVILLDLTKLR 282
           F        IG+       Y    W+     P             GFN+GV+LL+L  +R
Sbjct: 177 FDSFLPGAVIGIAREMQPVYRHTFWQFRHENPQTRVGGPPPEGLPGFNSGVMLLNLEAMR 236

Query: 283 D------ISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLS- 335
                  +      + +A+K+        L DQD F  I  EHP L + L C WN QL  
Sbjct: 237 QSLLYSRLLEPARVQQLADKY---HFRGHLGDQDFFTMIGMEHPKLFHVLDCTWNRQLCT 293

Query: 336 ---DNTRSD--ELCYTELTDLKIIHWNS 358
              D+  SD  E  +     +KI H N 
Sbjct: 294 WWRDHGYSDVFEAYFRCEGHVKIYHGNC 321


>gi|224002262|ref|XP_002290803.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220974225|gb|EED92555.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 575

 Score = 63.9 bits (154), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 46/133 (34%), Positives = 65/133 (48%), Gaps = 6/133 (4%)

Query: 480 TLVAQLSMDRLQMVEMLFKHW-EGPISLTLY---MSDAEAQQFLSYAGNSEALRSRRNIG 535
           TLV QLS DRL M+    K W E PIS+ ++    +D   +Q +    N+E L + + + 
Sbjct: 204 TLVTQLSNDRLWMIPYHCKRWGEHPISIAVFSNRRADDVKEQLVKEGCNAEYL-NVQTVS 262

Query: 536 YHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGG 595
                  G  YP+N LRN+AL+ V T +V   D+DF P   L+  L        M     
Sbjct: 263 KTKYDPTGTEYPVNILRNLALSAVKTTHVVYADVDFWPASNLHSILTEQNVKERM-ASDN 321

Query: 596 KKVLVVPAFETQR 608
           K   V+PAF+  R
Sbjct: 322 KLATVIPAFQMNR 334


>gi|342321325|gb|EGU13259.1| Glycosyltransferase family 49 protein [Rhodotorula glutinis ATCC
           204091]
          Length = 569

 Score = 63.9 bits (154), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 63/254 (24%), Positives = 110/254 (43%), Gaps = 76/254 (29%)

Query: 463 NPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYA 522
           +P+ +D      +  D+T+   ++ +R  + E L + + GP+S T+++++ +A       
Sbjct: 231 DPDRID-----FNKEDITITTLVTSNRFAVFERLVERYRGPMSATVHLTEGKA------- 278

Query: 523 GNSEALRS-----------RRNIGYHVVYK--EGNFYPINTLRNVALNQVSTPYVFLLDI 569
            +SE LR+           ++ +  H+VY   E  F   N  RNVA     T YV +LD+
Sbjct: 279 -HSEMLRALETMYTSSPLMKQYVDIHLVYDPFERQF---NMWRNVAKFFARTEYVMMLDV 334

Query: 570 DFLPMFGLYPYLKSSIRSMDMHG-------HGGKKVLVVPAFETQRYRTA-----FPAS- 616
           DF        +L +  RS  +          GG    VVPAFE  +         FP++ 
Sbjct: 335 DF--------WLCTDFRSRMLESPEVMQRLKGGMAAFVVPAFEFHKQADGVDPNTFPSTK 386

Query: 617 ----------------------HAPTNFSRWVNATTP--YQIE-WAPDFEPYIVAHRD-L 650
                                 H  TN++R+  A     Y+++ +   +EPY++  ++  
Sbjct: 387 EGLLQLVNDDKIGMFHKSWAPGHGSTNYTRYYAAKPGEVYRVQGYTHSYEPYVIFKKEGT 446

Query: 651 PRYDTRFVGFGWNK 664
           P  D RF+G+G NK
Sbjct: 447 PWCDERFIGYGGNK 460


>gi|326679949|ref|XP_002667034.2| PREDICTED: glucoside xylosyltransferase 1-like, partial [Danio
           rerio]
          Length = 405

 Score = 63.9 bits (154), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 43/173 (24%), Positives = 75/173 (43%), Gaps = 19/173 (10%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L + +   + +DTD++F   +  +W+  S   +     L     +  +G  W 
Sbjct: 218 RLFLPLILKQ-VDSVLYVDTDILFLRPVEDIWSFLSSFNRSHVAALAPEHEEPRMG--WY 274

Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLR-----------DISWAGFWRIIAEKFLLTRLWT 304
           N     P  G+ G N+GV+L+++T++R           D+SW+     +  K+ L   W 
Sbjct: 275 NRFARHPYYGKTGVNSGVMLINMTRIRHTHFKNDLSAMDLSWSDLLMPLLNKYKLNITW- 333

Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
              DQD+ N I   +P  +   PC WN +             E   + I+H N
Sbjct: 334 --GDQDLLNIIFHYNPESLLVFPCHWNYRPDHCIYGSNCAAAEQHGIYILHGN 384


>gi|393222913|gb|EJD08397.1| hypothetical protein FOMMEDRAFT_74587 [Fomitiporia mediterranea
           MF3/22]
          Length = 381

 Score = 63.9 bits (154), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 59/245 (24%), Positives = 101/245 (41%), Gaps = 53/245 (21%)

Query: 473 SSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAG--------- 523
           S    D+T+   ++ +R  + + L + + GPIS+T+++S AE  +  S            
Sbjct: 38  SHKKEDITIATLVTRNRFAVFKQLVERYRGPISVTVHVSRAELAESSSGGKGVNFLDALH 97

Query: 524 ---NSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPY 580
              +S  L S     + ++    N    N  RN A     T YV +LD+DF+P       
Sbjct: 98  ALYSSSPLMSALVDVHLILTPSRNDRQFNAWRNAARLFARTDYVMMLDVDFVPCTDFRTR 157

Query: 581 LKSSIRSMDMHG--HGGKKVLVVPAFETQRYRTA-----FPAS----------------- 616
           L  + ++  +H      +  LV+PAFE   ++       FP S                 
Sbjct: 158 LLHTPKTSPIHELLRRDRAALVIPAFEYVEHKDGTDARTFPTSKVELIKLYNESRIGMFH 217

Query: 617 ------HAPTNFSRWVNATTPYQI------EWAPDFEPYIVAHRD-----LPRYDTRFVG 659
                 H  T++ R++  + P ++       +   +EPY++  RD     +P  D RFVG
Sbjct: 218 ATWSPGHNSTDYERFLYHSQPGEVYKVSPAHYQHAYEPYVIFRRDGGERTIPWCDERFVG 277

Query: 660 FGWNK 664
           +G NK
Sbjct: 278 YGGNK 282


>gi|383851201|ref|XP_003701127.1| PREDICTED: xyloside xylosyltransferase 1-like [Megachile rotundata]
          Length = 356

 Score = 63.9 bits (154), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 47/165 (28%), Positives = 71/165 (43%), Gaps = 19/165 (11%)

Query: 186 PNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVE 245
           P  +YS     L L L ++ P       + D D  F  DI +L+  F+        GL  
Sbjct: 158 PGTYYSDALFFLSLGLHRIAPPEQNVAAMFDADTKFRKDIKELFEEFNSFGNEALFGLAP 217

Query: 246 NQSDWY--LGKLWKNHKPWPALGR--------GFNTGVILLDLTKLRDIS-WAGFWR--- 291
             +  Y  +  L++N  P    G         G+N+GV+L +L +LR+ S +    R   
Sbjct: 218 ELTPVYRHVLYLYRNKNPNTLFGEPASSGGYPGYNSGVVLFNLNRLRNSSIYHEIVRNES 277

Query: 292 --IIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQL 334
              I EK+        L DQD +  +  E P L++T+ C WN QL
Sbjct: 278 VNSITEKY---HFKGHLGDQDFYTLLGMERPELIHTIDCGWNKQL 319


>gi|219115037|ref|XP_002178314.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217410049|gb|EEC49979.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 402

 Score = 63.5 bits (153), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 66/241 (27%), Positives = 100/241 (41%), Gaps = 61/241 (25%)

Query: 478 DVTLVAQLSMDRLQMVEMLFKHWEG-PISLTLYMSD--AEAQQFLSYAGNSEALRSRRNI 534
           D TLV QLS  RL ++E   + W   PISL +  ++     ++ LS  G  + L +   +
Sbjct: 95  DFTLVTQLSPSRLAIMEQHCERWGNHPISLAIGTTEDIENIEKTLSEFGCQKNLVT---V 151

Query: 535 GYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHG 594
            +   +     YP+N LRN+A+++V T + F++D DF+    LY  L+   R+M    H 
Sbjct: 152 SFVSDFNSEGEYPVNRLRNLAMSRVQTSHAFVIDADFVLSTNLYQMLRLH-RAMLATDH- 209

Query: 595 GKKVLVVPAFETQ---------------------------------------------RY 609
               LVVPAFE Q                                             R 
Sbjct: 210 -LHTLVVPAFELQAVCNEQNENCTAKHLAMLPDTKNELVKQYWTAKDHVGPNPSVTQFRE 268

Query: 610 RTAFPASHAPTNFSRWVNATT----PYQIEWAPDFEPYIVAH--RDLPRYDTRFVGFGWN 663
           + AF A HA T +  W+        P +   +  +EPY+V    + L  +   FVG+G N
Sbjct: 269 QAAFHA-HASTRYEDWMTQPAEKLLPIECVTSDSYEPYLVVRPCQFLAPFQEAFVGYGQN 327

Query: 664 K 664
           K
Sbjct: 328 K 328


>gi|300798509|ref|NP_001180096.1| glucoside xylosyltransferase 1 [Bos taurus]
 gi|296487731|tpg|DAA29844.1| TPA: glucoside xylosyltransferase 1 isoform 2 [Bos taurus]
          Length = 440

 Score = 63.5 bits (153), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 44/173 (25%), Positives = 76/173 (43%), Gaps = 19/173 (10%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L E +   + +DTD++F   +  +WAL  +    Q   +     +  +G  W 
Sbjct: 200 RLFLPLILKE-VDSLLYVDTDILFLRPVDDIWALLKKFNSTQIAAMAPEHEEPRIG--WY 256

Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
           N     P  G+ G N+GV+L+++T++R            + W      + +K+ L   W 
Sbjct: 257 NRFARHPYYGKTGVNSGVMLMNMTRMRRKYFKNDMTTVRLRWGDILMPLLKKYKLNITW- 315

Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
              DQD+ N I   +P  ++  PCQWN +             E   + I+H N
Sbjct: 316 --GDQDLLNIIFFHNPESLFVFPCQWNYRPDHCIYGSNCQEAEEEGIFILHGN 366



 Score = 45.8 bits (107), Expect = 0.078,   Method: Compositional matrix adjust.
 Identities = 37/151 (24%), Positives = 64/151 (42%), Gaps = 9/151 (5%)

Query: 13  EVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQV- 71
           E + +A+V  G       VT++KS L +   PL FH+  +    +  + +  +WS  Q  
Sbjct: 114 ERVHLAVVACG-ERLEETVTMLKSALIFSIKPLQFHIFAEDQLHDSFKGILDSWSFLQTF 172

Query: 72  EVSFYLADSVVEDVS-WIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQ 130
             S Y      E+ + W   K         +L LP +L E +   + +DTD++F   +  
Sbjct: 173 NYSLYPITFPSENAAEW---KKLFKPCASQRLFLPLILKE-VDSLLYVDTDILFLRPVDD 228

Query: 131 LWALFSRLRQRQTIGLVENQSDWYLGKLWKN 161
           +WAL  +    Q   +     +  +G  W N
Sbjct: 229 IWALLKKFNSTQIAAMAPEHEEPRIG--WYN 257


>gi|307189145|gb|EFN73593.1| N-acetyllactosaminide beta-1,3-N-acetylglucosaminyltransferase
           [Camponotus floridanus]
          Length = 735

 Score = 63.5 bits (153), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 63/272 (23%), Positives = 102/272 (37%), Gaps = 91/272 (33%)

Query: 479 VTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEA----QQFLSYAGNS-EALRSR-- 531
           V L  Q S+++L  +  +  HW GP+S+ LY +  E     Q++L Y     E++R R  
Sbjct: 154 VCLATQTSVEKLHSLVQVALHWTGPMSVALYAAGDEEFEVLQRYLVYLRKCYESIRERVI 213

Query: 532 ---------------------------------RNIGYHVVYKEGNF-----YPINTLRN 553
                                                 H+  ++ N+     YP N +RN
Sbjct: 214 FSLAVPRMRPPKKQPRVFELPDIIDCAKPEATLNEFASHISSEQTNWRIRNVYPQNHMRN 273

Query: 554 VALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRYRTAF 613
           +A     T YVFL D+D +P F L   L   +R+ +      K   V+P +E    R  F
Sbjct: 274 LARKNCQTDYVFLTDVDIIPSFNLSVALDEFLRNDNC----DKCAYVIPTYELD-TRMRF 328

Query: 614 PASHA------------------------PTNFSRWVNATTPY----------QIEWAPD 639
           P +                           TNF+RW+   +P           ++  + D
Sbjct: 329 PQNKTELIRLARKGLARPFHQKVFIHNQFATNFTRWLQDVSPTHKYHENVRTGKVYVSHD 388

Query: 640 -------FEPYIVAHRDLPRYDTRFVGFGWNK 664
                  +EP+ VA    P +D RF+G+G+ +
Sbjct: 389 VTNFEFLYEPFYVAKDIAPSHDERFMGYGYTR 420


>gi|426343379|ref|XP_004038286.1| PREDICTED: xyloside xylosyltransferase 1 [Gorilla gorilla gorilla]
          Length = 247

 Score = 63.5 bits (153), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 55/208 (26%), Positives = 87/208 (41%), Gaps = 24/208 (11%)

Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
           F + +++ +  S     +YS     L + + +++P+ + + I LD D+ F T+I +L+  
Sbjct: 36  FPVVEAMQKHFSAGSGTYYSDSIFFLSVAMHQIMPKEILQIIQLDLDLKFKTNIRELFEE 95

Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWKNHKPWP---------ALGRGFNTGVILLDLTKLR 282
           F        IG+       Y    W+     P             GFN+GV+LL L  +R
Sbjct: 96  FDSFLPGAIIGIAREMQPVYRHTFWQFRHENPQTRVGGPPPEGLPGFNSGVMLLSLEAMR 155

Query: 283 D------ISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLS- 335
                  +      + +A+K+        L DQD F  I  EHP L + L C WN QL  
Sbjct: 156 QSPLYSRLLEPAQVQQLADKY---HFRGHLGDQDFFTMIGMEHPKLFHVLDCTWNRQLCT 212

Query: 336 ---DNTRSD--ELCYTELTDLKIIHWNS 358
              D+  SD  E  +     +KI H N 
Sbjct: 213 WWRDHGYSDVFEAYFRCEGHVKIYHGNC 240


>gi|198458069|ref|XP_001360902.2| GA16964 [Drosophila pseudoobscura pseudoobscura]
 gi|198136211|gb|EAL25477.2| GA16964 [Drosophila pseudoobscura pseudoobscura]
          Length = 506

 Score = 63.5 bits (153), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 62/278 (22%), Positives = 107/278 (38%), Gaps = 87/278 (31%)

Query: 468 DEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEA----QQFLSY-- 521
           ++Y  SS+ + V L  Q S++RL  +  +  +W+GP+S+ ++ +  E     Q F++Y  
Sbjct: 46  EQYTESSESSLVCLATQTSVERLNSLPQVATNWQGPMSVAVFSASHEEFVVLQYFVTYMR 105

Query: 522 --------------------------AGNSEALRSRRNIGY---------------HVVY 540
                                        S  LR + +  Y                + +
Sbjct: 106 LCFPNIRENATFHLLTPPDYDKLPRLVSISFNLRGKFDCQYPDRTLKALLKLRSLKSLQW 165

Query: 541 KEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLV 600
           ++ N YP N +RN+A     T Y FL DID +P     P L   +++ +   +      V
Sbjct: 166 RQRNPYPQNHMRNLARKGCQTKYTFLTDIDIVPSTNSVPQLNQFLKTANCSKNCA---YV 222

Query: 601 VPAFETQRYRTAFPAS------------------------HAPTNFSRWVNATTP----- 631
           +P FE    R +FP S                           TNFS+W+++ T      
Sbjct: 223 IPTFEID-VRASFPRSKYGLQRLMKKGLARPFHEKVFIYNQYATNFSKWLSSNTNDTEVS 281

Query: 632 -----YQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
                   E+   +EP+ +A    P +D RF G+G+ +
Sbjct: 282 VSHVVTNFEFL--YEPFYIAIDSAPAHDERFTGYGFTR 317


>gi|302830866|ref|XP_002946999.1| hypothetical protein VOLCADRAFT_103232 [Volvox carteri f.
           nagariensis]
 gi|300268043|gb|EFJ52225.1| hypothetical protein VOLCADRAFT_103232 [Volvox carteri f.
           nagariensis]
          Length = 276

 Score = 63.2 bits (152), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 44/149 (29%), Positives = 66/149 (44%), Gaps = 29/149 (19%)

Query: 545 FYPINTLRNVALNQVSTPYVFLLDIDFLPMFGL-YPYLKSSIRSMDMHGHGGKKVLVVPA 603
            YP+NTLRN A     T  +  +D+D LP   + Y      I +    G    +  V PA
Sbjct: 48  LYPLNTLRNYARLMADTDLITNVDVDLLPSMSISYALADPGILANYTEGCRSGRAFVWPA 107

Query: 604 FET-----------------------QRYRTAF----PASHAPTNFSRWVNATTPYQIEW 636
           F+T                       Q+  T F    P  H  TN+ RW+ +   Y IE+
Sbjct: 108 FDTHCNGTEYADFIAMDGRPALNEGLQKCVTMFSYKNPVFHNATNYQRWLTSRETYDIEY 167

Query: 637 APDFEPYIVAHR-DLPRYDTRFVGFGWNK 664
             ++EP+ ++ R D P +D R+ G+G+NK
Sbjct: 168 NLEYEPWYLSWRWDTPWFDFRYRGYGYNK 196


>gi|403269346|ref|XP_003926708.1| PREDICTED: glucoside xylosyltransferase 1 isoform 2 [Saimiri
           boliviensis boliviensis]
          Length = 406

 Score = 63.2 bits (152), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 42/173 (24%), Positives = 76/173 (43%), Gaps = 19/173 (10%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L E +   + +DTD++F   +  +W+   +    Q   +     +  +G  W 
Sbjct: 166 RLFLPLILKE-VDSLLYVDTDILFLRPVDDIWSFLKKFNSTQIAAMAPEHEEPRIG--WY 222

Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
           N     P  G+ G N+GV+L+++T++R            + W      + +K+ L   W 
Sbjct: 223 NRFARHPYYGKTGVNSGVMLMNMTRMRRKYFKNDMTTVRLQWGDILMPLLKKYKLNITW- 281

Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
              DQD+ N + S +P  ++  PCQWN +             E   + I+H N
Sbjct: 282 --GDQDLLNIMFSHNPESLFVFPCQWNYRPDHCIYGSNCQEAEEDGIFILHGN 332


>gi|328778506|ref|XP_001120832.2| PREDICTED: uncharacterized protein C3orf21 homolog [Apis mellifera]
          Length = 357

 Score = 63.2 bits (152), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 45/164 (27%), Positives = 75/164 (45%), Gaps = 17/164 (10%)

Query: 186 PNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVE 245
           P  +YS     L L L ++ P       + D D  F TDI +L+  F+    +   GL  
Sbjct: 159 PGTYYSDALFFLSLGLHRIAPIEQNVAAMFDADTKFRTDIKELFEEFNTFGNQALFGLSP 218

Query: 246 NQSDWY--LGKLWKNHKPWPALGR--------GFNTGVILLDLTKLRDISWAGFWRIIAE 295
             +  Y  +  L++N  P    G         G+N+G++L +L KLR+ +   + +I+ +
Sbjct: 219 ELTPVYRHVLYLYRNKHPNTLFGEPASSGGYPGYNSGMVLFNLDKLRNSTV--YDKIVKK 276

Query: 296 KFLLT-----RLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQL 334
           + + T          L DQD +  +  E P L++T+ C WN QL
Sbjct: 277 ESVNTMTSKYHFKGHLGDQDFYTLLGMEKPELIHTIDCGWNRQL 320


>gi|357612376|gb|EHJ67946.1| putative GLT8D3 protein [Danaus plexippus]
          Length = 263

 Score = 63.2 bits (152), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 48/191 (25%), Positives = 87/191 (45%), Gaps = 18/191 (9%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSD----WYLG 253
           +L +P ++   +   + +DTD +F     +LW +F ++ + Q   +     +    WY  
Sbjct: 11  RLFIPNLITH-IDSMMYVDTDTLFLGPADRLWEVFYKMNRTQISAMALEDDNPNISWY-- 67

Query: 254 KLWKNHKPWPALGR-GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIF 312
             +  H   P  G+ G N+GV+L++LT++RD  W  +   I  K+   +L+    DQDI 
Sbjct: 68  PRFAKH---PFYGKYGLNSGVMLMNLTRMRDFGWVDYVTPIMLKW---KLYIPWGDQDII 121

Query: 313 NAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWNSPKKLKVKNKHMEFF 372
           N I   H   VY + C +N + SD     + C      + ++H +   +    N     F
Sbjct: 122 NIIFHYHENAVYVMTCNYNYR-SDQCVYGDACEPASHGVLVVHGS---RGVFHNNKQPAF 177

Query: 373 RNLYLTFLEYD 383
           + +Y    EY+
Sbjct: 178 QAVYRAINEYE 188


>gi|351713218|gb|EHB16137.1| Glycosyltransferase 8 domain-containing protein 3, partial
           [Heterocephalus glaber]
          Length = 338

 Score = 63.2 bits (152), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 42/173 (24%), Positives = 76/173 (43%), Gaps = 19/173 (10%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L E +   + +DTD++F   +  +W+L  +    Q   +     +  +G  W 
Sbjct: 98  RLFLPSILKE-VDSLLYVDTDILFLRPVDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WY 154

Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
           N     P  G+ G N+GV+L+++T++R            + W      + +K+ L   W 
Sbjct: 155 NRFARHPYYGKTGVNSGVMLMNMTRMRKKYFKNDMTTVRLQWGDILMPLLKKYKLNITW- 213

Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
              DQD+ N +   +P  ++  PCQWN +             E   + I+H N
Sbjct: 214 --GDQDLLNIMFFHNPESLFVFPCQWNYRPDHCIYGSNCREAEEEGIFILHGN 264



 Score = 42.4 bits (98), Expect = 0.87,   Method: Compositional matrix adjust.
 Identities = 35/155 (22%), Positives = 64/155 (41%), Gaps = 9/155 (5%)

Query: 9   IPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSV 68
           I   E + +A+V  G       +T++KS L +   PL FH+  +    +  +    +WS 
Sbjct: 8   IQPIEKMHLAVVACG-ERLEETMTMLKSALIFSIKPLQFHIFAEDQLHDSFKARLDSWSF 66

Query: 69  PQV-EVSFYLADSVVEDVS-WIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 126
            Q    + Y      E+ + W   K         +L LP +L E +   + +DTD++F  
Sbjct: 67  LQTFNYTIYPITFPSENAAEW---KKLFKPCASQRLFLPSILKE-VDSLLYVDTDILFLR 122

Query: 127 DIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKN 161
            +  +W+L  +    Q   +     +  +G  W N
Sbjct: 123 PVDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WYN 155


>gi|195430906|ref|XP_002063489.1| GK21937 [Drosophila willistoni]
 gi|194159574|gb|EDW74475.1| GK21937 [Drosophila willistoni]
          Length = 378

 Score = 63.2 bits (152), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 63/282 (22%), Positives = 106/282 (37%), Gaps = 94/282 (33%)

Query: 468 DEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEA----QQFLSYAG 523
           D+Y  SS+ + + L  Q S++RL  +  +  +W GPIS+ L+ +  E     Q F++Y  
Sbjct: 34  DQYTESSENSLICLATQSSVERLNSLAQVSANWYGPISVALFSASNEEFLILQYFITYMR 93

Query: 524 ----NSEALRSRRNIGYHVV---------------------------------------- 539
               N +A     N  +H++                                        
Sbjct: 94  LCFPNIKA-----NATFHILIPRDYEKLPKVVTLPFILKGKFDCRYPEKTLKALLKLRTV 148

Query: 540 ----YKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGG 595
               +++ N YP N +RN+A     + YVFL DID +P   +   L    +      +  
Sbjct: 149 KTLQWRQRNTYPQNHMRNLARKGCQSKYVFLTDIDIIPSTNIVAQLNDFFKGERCTKNCA 208

Query: 596 KKVLVVPAFETQRYRTAFPASHA------------------------PTNFSRW----VN 627
               V+P FE    R  FP S                           TNF++W    +N
Sbjct: 209 ---YVIPTFEID-VRAPFPRSKKSLLRLIRKGLARPFHEKVFIYNQYATNFTKWLSTNLN 264

Query: 628 ATTPYQIEWAPD-----FEPYIVAHRDLPRYDTRFVGFGWNK 664
            T   +I  +       +EP+ +A  ++P +D RF G+G+ +
Sbjct: 265 ETDVTKISHSVTNFEFLYEPFYIAMDNVPAHDERFTGYGFTR 306


>gi|296211440|ref|XP_002752411.1| PREDICTED: glucoside xylosyltransferase 1 isoform 2 [Callithrix
           jacchus]
          Length = 406

 Score = 63.2 bits (152), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 42/173 (24%), Positives = 76/173 (43%), Gaps = 19/173 (10%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L E +   + +DTD++F   +  +W+   +    Q   +     +  +G  W 
Sbjct: 166 RLFLPLILKE-VDSLLYVDTDILFLRPVDDIWSFLKKFNSTQIAAMAPEHEEPRIG--WY 222

Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
           N     P  G+ G N+GV+L+++T++R            + W      + +K+ L   W 
Sbjct: 223 NRFARHPYYGKTGVNSGVMLMNMTRMRRKYFKNDMTTVRLQWGDILMPLLKKYKLNITW- 281

Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
              DQD+ N + S +P  ++  PCQWN +             E   + I+H N
Sbjct: 282 --GDQDLLNIMFSHNPESLFVFPCQWNYRPDHCIYGSNCQEAEEDGIFILHGN 332



 Score = 39.3 bits (90), Expect = 7.3,   Method: Compositional matrix adjust.
 Identities = 34/155 (21%), Positives = 63/155 (40%), Gaps = 9/155 (5%)

Query: 9   IPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSV 68
           I   E I +A+V  G       +T++KS + +   PL FH+  +    +  +    +WS 
Sbjct: 76  IQPVEKIHLAVVACG-ERLEETMTMLKSAIIFSIRPLQFHIFAEDQLHHSFKGRLDSWSF 134

Query: 69  PQV-EVSFYLADSVVEDVS-WIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 126
            Q    + Y      E+ + W   K         +L LP +L E +   + +DTD++F  
Sbjct: 135 LQTFNYTLYPITFPSENAAEW---KKLFKPCASQRLFLPLILKE-VDSLLYVDTDILFLR 190

Query: 127 DIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKN 161
            +  +W+   +    Q   +     +  +G  W N
Sbjct: 191 PVDDIWSFLKKFNSTQIAAMAPEHEEPRIG--WYN 223


>gi|354594893|ref|ZP_09012930.1| hypothetical protein CIN_16260 [Commensalibacter intestini A911]
 gi|353671732|gb|EHD13434.1| hypothetical protein CIN_16260 [Commensalibacter intestini A911]
          Length = 608

 Score = 63.2 bits (152), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 58/207 (28%), Positives = 98/207 (47%), Gaps = 31/207 (14%)

Query: 183 SWIP-NKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTI 241
           +++P N+ Y  +    +L +  +LP+T+ K I +D+DVI   +IA+LW       Q   +
Sbjct: 358 NYLPLNREYISLNTYYRLVIQDILPKTVKKIIYIDSDVIVYGNIAELW---QEPLQDMCV 414

Query: 242 GLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTR 301
           G V ++     G   ++ +        FN G+++ D+ K++   +   ++   E F   R
Sbjct: 415 GAVLDE-----GGTLQSRRLSLEDNNYFNAGIMIFDIEKIKG-EFKDIFKTYFENFYKNR 468

Query: 302 LWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRS---DELCY--------TELTD 350
              +L DQDI N   +E   +V   P +WNV    NTR    +EL Y          L +
Sbjct: 469 DIITLQDQDILNITFAEKTKIV---PLRWNV----NTRMLGYNELDYKYTLKDAEAALQN 521

Query: 351 LKIIHWNSPKK-LKVKNKHMEFFRNLY 376
           + IIH+   +K  K+   H   FR+LY
Sbjct: 522 IGIIHYTDKRKPWKITCNHP--FRSLY 546



 Score = 41.2 bits (95), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 28/120 (23%), Positives = 60/120 (50%), Gaps = 11/120 (9%)

Query: 20  VCAGYNSTRS------LVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQVEV 73
           +C  ++   S      + +LIK+   + K  ++ +++ D       Q++ +      V +
Sbjct: 290 ICMSFDDNYSAHGDAVITSLIKNA--HPKQQINIYILHDEKLSRTNQSILTRSENQNVRI 347

Query: 74  SFYLADSVVEDVSWIP-NKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLW 132
            + L D  +   +++P N+ Y  +    +L +  +LP+T+ K I +D+DVI   +IA+LW
Sbjct: 348 HYILIDKKL--FNYLPLNREYISLNTYYRLVIQDILPKTVKKIIYIDSDVIVYGNIAELW 405


>gi|410918973|ref|XP_003972959.1| PREDICTED: glucoside xylosyltransferase 1-like [Takifugu rubripes]
          Length = 413

 Score = 63.2 bits (152), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 42/176 (23%), Positives = 77/176 (43%), Gaps = 25/176 (14%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSD----WYLG 253
           +L LP +L + +   + +D+D++F   +  LWA  S+    Q   +     +    WY  
Sbjct: 173 RLFLPLILKD-IDSIVYVDSDILFLQPMDHLWAFLSKFTPSQLAAMAPEHEEPRIAWY-- 229

Query: 254 KLWKNHKPWPALGR-GFNTGVILLDLTKLRDIS-----------WAGFWRIIAEKFLLTR 301
             +  H   P  GR G N+GV+L+++T++R +S           W      + +K+ L  
Sbjct: 230 SRFARH---PFYGRTGVNSGVMLMNMTRMRSVSFKNDMTSVGLQWEELLMPLLQKYKLNI 286

Query: 302 LWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
            W    DQD+ N +   +P  +   PC+WN +             E + + ++H N
Sbjct: 287 TW---GDQDLLNIVFHHNPERLLEFPCKWNYRPDHCIYGSNCASAEESGVYMLHGN 339


>gi|332022196|gb|EGI62513.1| N-acetyllactosaminide beta-1,3-N-acetylglucosaminyltransferase
           [Acromyrmex echinatior]
          Length = 486

 Score = 63.2 bits (152), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 62/272 (22%), Positives = 99/272 (36%), Gaps = 91/272 (33%)

Query: 479 VTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEA----QQFLSYAGNS-EALRSR-- 531
           V L  Q S+++L  +  +  HW GP+S+ LY +  E     Q++L Y     E++R R  
Sbjct: 126 VCLATQSSVEKLHSLVQVALHWTGPMSVALYAAGDEEFEVLQRYLVYLRRCYESIRERVI 185

Query: 532 --------------------------------------RNIGYHVVYKEGNFYPINTLRN 553
                                                 R       ++  N YP N +RN
Sbjct: 186 FSLAVPKMRTPKKQPRVFELPDRVDCAKPEATLNEFMSRVPNEQTNWRIRNVYPQNHMRN 245

Query: 554 VALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRYRTAF 613
           +A     T YVFL D+D +P F L   L   +R+     +  K   V+P +E    R  F
Sbjct: 246 LARKNCQTDYVFLTDVDIVPSFNLTVVLDEFLRN----DNCDKCAYVIPTYELD-TRVRF 300

Query: 614 PASHA------------------------PTNFSRWVNATTPYQIE-------------- 635
           P +                           TNF+RW+   +P  +               
Sbjct: 301 PPNKTELIRLATKGLARPFHQKVFIHNQFATNFTRWLQDVSPNHVHHENVKTGKVYISHD 360

Query: 636 ---WAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
              +   +EP+ VA   +P +D RF+G+G+ +
Sbjct: 361 VTNFEFLYEPFYVAKDIVPPHDERFMGYGYTR 392


>gi|426226725|ref|XP_004007489.1| PREDICTED: glucoside xylosyltransferase 1 [Ovis aries]
          Length = 486

 Score = 63.2 bits (152), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 43/173 (24%), Positives = 76/173 (43%), Gaps = 19/173 (10%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L E +   + +DTD++F   +  +W+L  +    Q   +     +  +G  W 
Sbjct: 246 RLFLPLILKE-VDSLLYVDTDILFLRPVDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WY 302

Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
           N     P  G+ G N+GV+L+++T++R            + W      + +K+ L   W 
Sbjct: 303 NRFARHPYYGKTGVNSGVMLMNMTRMRRKYFKNDMTTVRLRWGDILMPLLKKYKLNITW- 361

Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
              DQD+ N I   +P  ++  PCQWN +             E   + I+H N
Sbjct: 362 --GDQDLLNIIFFHNPESLFVFPCQWNYRPDHCIYGSNCQEAEEEGIFILHGN 412



 Score = 45.1 bits (105), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 36/151 (23%), Positives = 64/151 (42%), Gaps = 9/151 (5%)

Query: 13  EVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQV- 71
           E + +A+V  G       VT++KS L +   PL FH+  +    +  + +  +WS  Q  
Sbjct: 160 ERVHLAVVACG-ERLEETVTMLKSALIFSIKPLQFHIFAEDQLHDSFKGILDSWSFLQTF 218

Query: 72  EVSFYLADSVVEDVS-WIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQ 130
             S Y      E+ + W   K         +L LP +L E +   + +DTD++F   +  
Sbjct: 219 NYSLYPITFPSENAAEW---KKLFKPCASQRLFLPLILKE-VDSLLYVDTDILFLRPVDD 274

Query: 131 LWALFSRLRQRQTIGLVENQSDWYLGKLWKN 161
           +W+L  +    Q   +     +  +G  W N
Sbjct: 275 IWSLLKKFNSTQIAAMAPEHEEPRIG--WYN 303


>gi|48686592|gb|AAT46050.1| hypothetical protein [Rattus norvegicus]
          Length = 319

 Score = 63.2 bits (152), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 44/173 (25%), Positives = 76/173 (43%), Gaps = 19/173 (10%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L + +   + +DTDV+F   +  +W+L  R    Q   +     +  +G  W 
Sbjct: 79  RLFLPLIL-KGVDSLLYVDTDVLFLRPVDDIWSLLERFNSTQIAAMAPEHEEPRVG--WY 135

Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
           N     P  GR G N+GV+L+++T++R            + W      + +K+ L   W 
Sbjct: 136 NRFARHPYYGRTGVNSGVMLMNMTRMRRKYFKNDMTTARLQWGDILMPLLKKYKLNITW- 194

Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
              DQD+ N +   +P  ++  PCQWN +             E   + I+H N
Sbjct: 195 --GDQDLLNIMFYHNPESLFVFPCQWNYRPDHCIYGSNCREAEEEGVFILHGN 245



 Score = 40.0 bits (92), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 31/131 (23%), Positives = 55/131 (41%), Gaps = 4/131 (3%)

Query: 31  VTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQVEVSFYLADSVVEDVSWIPN 90
           VT++KS L +   PLH H+  +    +  +    +WS  Q   ++ L        S +  
Sbjct: 10  VTMLKSALIFSIKPLHVHIFAEDQLHDSFKDRLDSWSFLQ-RFNYSLYPITFPSDSAMEW 68

Query: 91  KHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQ 150
           K         +L LP +L + +   + +DTDV+F   +  +W+L  R    Q   +    
Sbjct: 69  KKLFKPCASQRLFLPLIL-KGVDSLLYVDTDVLFLRPVDDIWSLLERFNSTQIAAMAPEH 127

Query: 151 SDWYLGKLWKN 161
            +  +G  W N
Sbjct: 128 EEPRVG--WYN 136


>gi|307200359|gb|EFN80611.1| Uncharacterized protein C3orf21-like protein [Harpegnathos
           saltator]
          Length = 368

 Score = 62.8 bits (151), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 44/164 (26%), Positives = 75/164 (45%), Gaps = 17/164 (10%)

Query: 186 PNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVE 245
           P  +YS     L L L ++ P       + D D  F  D+  L+  F+   +    GL  
Sbjct: 170 PGTYYSDALFFLSLGLHRIAPSDQKLAAMFDADTKFHRDVKDLFEEFNNFGEEALFGLAP 229

Query: 246 NQSDWY--LGKLWKNHKPWPALGR--------GFNTGVILLDLTKLRDISWAGFWRIIAE 295
             +  Y  +  L++N  P    G         G+N+G++L +L +LR  +   + +I+++
Sbjct: 230 ELTPVYRHVLYLYRNKHPNTLFGEPRHKGGYPGYNSGMVLFNLERLRKSNI--YKKIVSK 287

Query: 296 KFL--LTRLW---TSLADQDIFNAIISEHPYLVYTLPCQWNVQL 334
           K +  +T  +     L DQD +  +  E P L+YT+ C WN QL
Sbjct: 288 KNVDAMTEKYHFKGHLGDQDFYTLLGMEKPELIYTVDCGWNRQL 331


>gi|195151255|ref|XP_002016563.1| GL11650 [Drosophila persimilis]
 gi|194110410|gb|EDW32453.1| GL11650 [Drosophila persimilis]
          Length = 459

 Score = 62.8 bits (151), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 51/204 (25%), Positives = 83/204 (40%), Gaps = 48/204 (23%)

Query: 174 LADSVVEDV--------SWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           +  +++ED+        S  PN +YS     L L L ++   +L + I+LD D++F +D+
Sbjct: 227 VCSNIIEDIAAKLSPYFSSAPNSYYSDSLFFLSLGLHRIADASLNRAILLDCDIVFRSDV 286

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWP--ALGR---------------- 267
             L+  F      Q  GL    +  Y   L++    +P  + G                 
Sbjct: 287 RLLFNEFDHFLPHQLYGLAPELTPVYRHILYRYRVRYPKTSFGNPYYPMNHDMGNQHNHN 346

Query: 268 -------GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWT---------SLADQDI 311
                  G N+GV+LL   ++R+       +   EK   + + T          L DQD 
Sbjct: 347 HERHGYPGLNSGVVLLLFNRIRN------SKSYLEKLTNSEVQTLVAKYSFKGHLGDQDF 400

Query: 312 FNAIISEHPYLVYTLPCQWNVQLS 335
           F  +  E+P L+Y L C WN QL 
Sbjct: 401 FTLLGYEYPNLIYRLDCVWNRQLC 424



 Score = 49.3 bits (116), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 38/150 (25%), Positives = 65/150 (43%), Gaps = 14/150 (9%)

Query: 36  SILFYRKNPLHFHLITDTVA----LNILQTLFSTW--SVPQVEVSFYLADSVVEDV---- 85
           S+L +    LH H+ITD  +    + ILQ     +  +V        +  +++ED+    
Sbjct: 180 SLLKHTSAQLHLHVITDKESSPSVMEILQRQIRRFRRTVIYTMYDVKVCSNIIEDIAAKL 239

Query: 86  ----SWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQR 141
               S  PN +YS     L L L ++   +L + I+LD D++F +D+  L+  F      
Sbjct: 240 SPYFSSAPNSYYSDSLFFLSLGLHRIADASLNRAILLDCDIVFRSDVRLLFNEFDHFLPH 299

Query: 142 QTIGLVENQSDWYLGKLWKNHKPWPALGRG 171
           Q  GL    +  Y   L++    +P    G
Sbjct: 300 QLYGLAPELTPVYRHILYRYRVRYPKTSFG 329


>gi|213688380|ref|NP_001094357.1| glucoside xylosyltransferase 1 [Rattus norvegicus]
 gi|152125898|sp|Q6GX83.2|GXLT1_RAT RecName: Full=Glucoside xylosyltransferase 1; AltName:
           Full=Glycosyltransferase 8 domain-containing protein 3;
           AltName: Full=S33-D
          Length = 435

 Score = 62.8 bits (151), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 44/173 (25%), Positives = 76/173 (43%), Gaps = 19/173 (10%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L + +   + +DTDV+F   +  +W+L  R    Q   +     +  +G  W 
Sbjct: 195 RLFLPLIL-KGVDSLLYVDTDVLFLRPVDDIWSLLERFNSTQIAAMAPEHEEPRVG--WY 251

Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
           N     P  GR G N+GV+L+++T++R            + W      + +K+ L   W 
Sbjct: 252 NRFARHPYYGRTGVNSGVMLMNMTRMRRKYFKNDMTTARLQWGDILMPLLKKYKLNITW- 310

Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
              DQD+ N +   +P  ++  PCQWN +             E   + I+H N
Sbjct: 311 --GDQDLLNIMFYHNPESLFVFPCQWNYRPDHCIYGSNCREAEEEGVFILHGN 361



 Score = 44.3 bits (103), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 37/149 (24%), Positives = 64/149 (42%), Gaps = 8/149 (5%)

Query: 10  PTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVP 69
           PT E + +A+V  G +     VT++KS L +   PLH H+  +    +  +    +WS  
Sbjct: 107 PT-EKMHLAVVACG-DRLEETVTMLKSALIFSIKPLHVHIFAEDQLHDSFKDRLDSWSFL 164

Query: 70  QVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIA 129
           Q   ++ L        S +  K         +L LP +L + +   + +DTDV+F   + 
Sbjct: 165 Q-RFNYSLYPITFPSDSAMEWKKLFKPCASQRLFLPLIL-KGVDSLLYVDTDVLFLRPVD 222

Query: 130 QLWALFSRLRQRQTIGLV----ENQSDWY 154
            +W+L  R    Q   +     E +  WY
Sbjct: 223 DIWSLLERFNSTQIAAMAPEHEEPRVGWY 251


>gi|296211438|ref|XP_002752410.1| PREDICTED: glucoside xylosyltransferase 1 isoform 1 [Callithrix
           jacchus]
          Length = 437

 Score = 62.8 bits (151), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 42/173 (24%), Positives = 76/173 (43%), Gaps = 19/173 (10%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L E +   + +DTD++F   +  +W+   +    Q   +     +  +G  W 
Sbjct: 197 RLFLPLILKE-VDSLLYVDTDILFLRPVDDIWSFLKKFNSTQIAAMAPEHEEPRIG--WY 253

Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
           N     P  G+ G N+GV+L+++T++R            + W      + +K+ L   W 
Sbjct: 254 NRFARHPYYGKTGVNSGVMLMNMTRMRRKYFKNDMTTVRLQWGDILMPLLKKYKLNITW- 312

Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
              DQD+ N + S +P  ++  PCQWN +             E   + I+H N
Sbjct: 313 --GDQDLLNIMFSHNPESLFVFPCQWNYRPDHCIYGSNCQEAEEDGIFILHGN 363



 Score = 39.3 bits (90), Expect = 7.3,   Method: Compositional matrix adjust.
 Identities = 34/155 (21%), Positives = 63/155 (40%), Gaps = 9/155 (5%)

Query: 9   IPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSV 68
           I   E I +A+V  G       +T++KS + +   PL FH+  +    +  +    +WS 
Sbjct: 107 IQPVEKIHLAVVACG-ERLEETMTMLKSAIIFSIRPLQFHIFAEDQLHHSFKGRLDSWSF 165

Query: 69  PQV-EVSFYLADSVVEDVS-WIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 126
            Q    + Y      E+ + W   K         +L LP +L E +   + +DTD++F  
Sbjct: 166 LQTFNYTLYPITFPSENAAEW---KKLFKPCASQRLFLPLILKE-VDSLLYVDTDILFLR 221

Query: 127 DIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKN 161
            +  +W+   +    Q   +     +  +G  W N
Sbjct: 222 PVDDIWSFLKKFNSTQIAAMAPEHEEPRIG--WYN 254


>gi|410964153|ref|XP_003988620.1| PREDICTED: glucoside xylosyltransferase 1 [Felis catus]
          Length = 476

 Score = 62.4 bits (150), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 43/173 (24%), Positives = 76/173 (43%), Gaps = 19/173 (10%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L E +   + +DTD++F   +  +W+L  +    Q   +     +  +G  W 
Sbjct: 236 RLFLPLILKE-VDSLLYVDTDILFLRPVDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WY 292

Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
           N     P  G+ G N+GV+L+++T++R            + W      + +K+ L   W 
Sbjct: 293 NRFARHPYYGKTGVNSGVMLMNMTRMRRKYFKNDMTAVRLRWGDILMPLLKKYKLNITW- 351

Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
              DQD+ N I   +P  ++  PCQWN +             E   + I+H N
Sbjct: 352 --GDQDLLNIIFFHNPESLFVFPCQWNYRPDHCIYGSNCQEAEEEGIFILHGN 402



 Score = 41.6 bits (96), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 33/149 (22%), Positives = 63/149 (42%), Gaps = 9/149 (6%)

Query: 15  IQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQV-EV 73
           + +A+V  G       +T++KS + +   PL FH+  +    +  + +  +WS  Q    
Sbjct: 152 MHLAVVACG-ERLEETITMLKSAIIFSIRPLQFHIFAEDQLHDSFKGILESWSFLQTFNY 210

Query: 74  SFYLADSVVEDVS-WIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLW 132
           S Y      E+ + W   K         +L LP +L E +   + +DTD++F   +  +W
Sbjct: 211 SLYPITFPSENAAEW---KKLFKPCASQRLFLPLILKE-VDSLLYVDTDILFLRPVDDIW 266

Query: 133 ALFSRLRQRQTIGLVENQSDWYLGKLWKN 161
           +L  +    Q   +     +  +G  W N
Sbjct: 267 SLLKKFNSTQIAAMAPEHEEPRIG--WYN 293


>gi|125808799|ref|XP_001360878.1| GA10968 [Drosophila pseudoobscura pseudoobscura]
 gi|54636050|gb|EAL25453.1| GA10968 [Drosophila pseudoobscura pseudoobscura]
          Length = 459

 Score = 62.4 bits (150), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 51/204 (25%), Positives = 83/204 (40%), Gaps = 48/204 (23%)

Query: 174 LADSVVEDV--------SWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 225
           +  +++ED+        S  PN +YS     L L L ++   +L + I+LD D++F +D+
Sbjct: 227 VCSNIIEDIAAKLSPYFSSAPNSYYSDSLFFLSLGLHRIADASLNRAILLDCDIVFRSDV 286

Query: 226 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWP--ALGR---------------- 267
             L+  F      Q  GL    +  Y   L++    +P  + G                 
Sbjct: 287 RLLFNEFDHFLPHQLYGLAPELTPVYRHILYRYRVRYPKTSFGNPYYPMNHEMGNQHNHN 346

Query: 268 -------GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWT---------SLADQDI 311
                  G N+GV+LL   ++R+       +   EK   + + T          L DQD 
Sbjct: 347 HERHGYPGLNSGVVLLLFNRIRNS------KSYLEKLTNSEVQTLVAKYSFKGHLGDQDF 400

Query: 312 FNAIISEHPYLVYTLPCQWNVQLS 335
           F  +  E+P L+Y L C WN QL 
Sbjct: 401 FTLLGYEYPNLIYRLDCVWNRQLC 424



 Score = 48.9 bits (115), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 38/150 (25%), Positives = 65/150 (43%), Gaps = 14/150 (9%)

Query: 36  SILFYRKNPLHFHLITDTVA----LNILQTLFSTW--SVPQVEVSFYLADSVVEDV---- 85
           S+L +    LH H+ITD  +    + ILQ     +  +V        +  +++ED+    
Sbjct: 180 SLLKHTSAQLHLHVITDKESSPSVMEILQRQIRRFRRTVIYTMYDVKVCSNIIEDIAAKL 239

Query: 86  ----SWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQR 141
               S  PN +YS     L L L ++   +L + I+LD D++F +D+  L+  F      
Sbjct: 240 SPYFSSAPNSYYSDSLFFLSLGLHRIADASLNRAILLDCDIVFRSDVRLLFNEFDHFLPH 299

Query: 142 QTIGLVENQSDWYLGKLWKNHKPWPALGRG 171
           Q  GL    +  Y   L++    +P    G
Sbjct: 300 QLYGLAPELTPVYRHILYRYRVRYPKTSFG 329


>gi|403269344|ref|XP_003926707.1| PREDICTED: glucoside xylosyltransferase 1 isoform 1 [Saimiri
           boliviensis boliviensis]
          Length = 437

 Score = 62.4 bits (150), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 42/173 (24%), Positives = 76/173 (43%), Gaps = 19/173 (10%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L E +   + +DTD++F   +  +W+   +    Q   +     +  +G  W 
Sbjct: 197 RLFLPLILKE-VDSLLYVDTDILFLRPVDDIWSFLKKFNSTQIAAMAPEHEEPRIG--WY 253

Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
           N     P  G+ G N+GV+L+++T++R            + W      + +K+ L   W 
Sbjct: 254 NRFARHPYYGKTGVNSGVMLMNMTRMRRKYFKNDMTTVRLQWGDILMPLLKKYKLNITW- 312

Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
              DQD+ N + S +P  ++  PCQWN +             E   + I+H N
Sbjct: 313 --GDQDLLNIMFSHNPESLFVFPCQWNYRPDHCIYGSNCQEAEEDGIFILHGN 363


>gi|55637891|ref|XP_522357.1| PREDICTED: glucoside xylosyltransferase 1 isoform 2 [Pan
           troglodytes]
 gi|410223168|gb|JAA08803.1| glucoside xylosyltransferase 1 [Pan troglodytes]
 gi|410250938|gb|JAA13436.1| glucoside xylosyltransferase 1 [Pan troglodytes]
 gi|410298480|gb|JAA27840.1| glucoside xylosyltransferase 1 [Pan troglodytes]
          Length = 409

 Score = 62.4 bits (150), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 42/173 (24%), Positives = 76/173 (43%), Gaps = 19/173 (10%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L E +   + +DTD++F   +  +W+L  +    Q   +     +  +G  W 
Sbjct: 169 RLFLPLILKE-VDSLLYVDTDILFLRPVDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WY 225

Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
           N     P  G+ G N+GV+L+++T++R            + W      + +K+ L   W 
Sbjct: 226 NRFARHPYYGKTGVNSGVMLMNMTRMRRKYFKNDMTTVRLQWGDILMPLLKKYKLNITW- 284

Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
              DQD+ N +   +P  ++  PCQWN +             E   + I+H N
Sbjct: 285 --GDQDLLNIVFFHNPESLFVFPCQWNYRPDHCIYGSNCQEAEEEGIFILHGN 335


>gi|300175417|emb|CBK20728.2| unnamed protein product [Blastocystis hominis]
          Length = 311

 Score = 62.4 bits (150), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 36/130 (27%), Positives = 69/130 (53%), Gaps = 5/130 (3%)

Query: 478 DVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGYH 537
           DVT  +Q S++R + ++ + + W+GP+S+  Y+   E + +      +  LR   ++ Y 
Sbjct: 63  DVTFASQASINRWEFMKQILRFWKGPVSIVYYLKPDEVETYKRLKEKT-PLRDNVHLAYM 121

Query: 538 VVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKK 597
           + +     YPIN LRN A++ ++T + +  D+D  P   LY  L    RS     +  K 
Sbjct: 122 IQHNNSQ-YPINKLRNKAISYITTSHYYFSDMDNWPSPNLYDELLRLPRSA---LNDDKL 177

Query: 598 VLVVPAFETQ 607
           +++VPA++ +
Sbjct: 178 IIIVPAYQIK 187


>gi|403169096|ref|XP_003328632.2| hypothetical protein PGTG_10591 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
 gi|375167800|gb|EFP84213.2| hypothetical protein PGTG_10591 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
          Length = 571

 Score = 62.4 bits (150), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 62/244 (25%), Positives = 101/244 (41%), Gaps = 51/244 (20%)

Query: 458 QMVEINPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQ 517
           Q  EINP             D+++   ++ +R ++   L ++++GPIS T+++SD+ A  
Sbjct: 240 QSGEINPK------------DISITTLVTSNRFKVFAKLVENYQGPISATIHVSDSPAIL 287

Query: 518 FLSYAGNSEALRSRRN----IGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLP 573
                   E   S ++    +  H+V    +    N  RNVA     T YV +LD+DF  
Sbjct: 288 KPLLESLDELYHSSKSMSTWVDIHLVVDSFD-RQFNMWRNVAKFFARTDYVMMLDVDFWI 346

Query: 574 MFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRYR-----TAFPA------------- 615
                  +  S   +D     G    V+PAFE  +         FP+             
Sbjct: 347 CTNFRQRILGSPELLDKLA-AGTSAFVIPAFEYTKQSDGVDPDGFPSDKQALMKLVSSGQ 405

Query: 616 ----------SHAPTNFSRWVNATTPYQI----EWAPDFEPYIVAHRD-LPRYDTRFVGF 660
                      H  TN++R+ +A    QI     ++  +EPY+V  +D  P  D RF+G+
Sbjct: 406 IGMFHKSWKPGHGSTNYTRFYHADQLDQIYPASGYSHSYEPYVVYKKDKTPFCDERFIGY 465

Query: 661 GWNK 664
           G NK
Sbjct: 466 GANK 469


>gi|198416660|ref|XP_002125009.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
          Length = 363

 Score = 62.4 bits (150), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 67/278 (24%), Positives = 102/278 (36%), Gaps = 98/278 (35%)

Query: 478 DVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEA-----------QQF-------- 518
           DV+L  Q S++ L  +  L   W+GP+S ++++ + +A           Q F        
Sbjct: 29  DVSLATQCSVNHLHHLVELLDRWDGPVSCSVFVPNQDASFADDAIKRLRQCFPKINRHVS 88

Query: 519 ------------LSYAGN------SEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVS 560
                       LS  G       SE L    N GY      G  +P N LRN A + V 
Sbjct: 89  FHLVYPSTHIADLSLVGTWLELSCSELLVRLENFGYQNYDVAGISFPHNALRNAARSGVL 148

Query: 561 TPYVFLLDIDFLPMFGLY-PYLKSSIRS-------MDMHGHGGKKVLVVPAFETQRYRTA 612
           T YV L+DID +P  GL   +++ + R+       MD+  +      VVP FE       
Sbjct: 149 TQYVLLVDIDVMPNVGLRNQFIEFANRNSLFNTSVMDLTAY------VVPVFEVLA-GLE 201

Query: 613 FPASH------------------------------APTNFSRWV---------------- 626
           FP                                   ++F RW                 
Sbjct: 202 FPKDKKQLLNGLKMVPFGLFITKLAGGVIKLKTLTGGSHFHRWKTLFLNVQLLQLQGNGE 261

Query: 627 NATTPYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           N    +  EW   +EP+ +A RD+P +D RF  +G+++
Sbjct: 262 NLNVAFIAEWDKSWEPFYIARRDVPMFDERFKQYGFDR 299


>gi|196002501|ref|XP_002111118.1| hypothetical protein TRIADDRAFT_50070 [Trichoplax adhaerens]
 gi|190587069|gb|EDV27122.1| hypothetical protein TRIADDRAFT_50070 [Trichoplax adhaerens]
          Length = 171

 Score = 62.4 bits (150), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 42/136 (30%), Positives = 62/136 (45%), Gaps = 18/136 (13%)

Query: 215 LDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK-NHK--------PWPAL 265
           +D+D+ F TDI  L+ LF   +    IG+       Y    W+  H+        P P  
Sbjct: 1   MDSDLKFTTDIRNLFNLFHEFQPNNIIGIGRESQPVYRHVFWQYRHENPKTRVGDPPPHG 60

Query: 266 GRGFNTGVILLDLTKLRDISWAG------FWRIIAEKFLLTRLWTSLADQDIFNAIISEH 319
             GFN+GV+LLDL ++R     G          +A+K+        L DQD++  +  E+
Sbjct: 61  LTGFNSGVLLLDLERMRKSKLFGQVLDENLPENLAKKY---HFRGHLGDQDLYTLLHMEY 117

Query: 320 PYLVYTLPCQWNVQLS 335
             + Y LPC WN QL 
Sbjct: 118 NDMFYVLPCTWNRQLC 133


>gi|153791276|ref|NP_001093120.1| glucoside xylosyltransferase 1 isoform 2 [Homo sapiens]
 gi|24659591|gb|AAH39145.1| GLT8D3 protein [Homo sapiens]
 gi|119578236|gb|EAW57832.1| hCG39367, isoform CRA_b [Homo sapiens]
 gi|193785001|dbj|BAG54154.1| unnamed protein product [Homo sapiens]
          Length = 409

 Score = 62.4 bits (150), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 42/173 (24%), Positives = 76/173 (43%), Gaps = 19/173 (10%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L E +   + +DTD++F   +  +W+L  +    Q   +     +  +G  W 
Sbjct: 169 RLFLPLILKE-VDSLLYVDTDILFLRPVDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WY 225

Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
           N     P  G+ G N+GV+L+++T++R            + W      + +K+ L   W 
Sbjct: 226 NRFARHPYYGKTGVNSGVMLMNMTRMRRKYFKNDMTTVRLQWGDILMPLLKKYKLNITW- 284

Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
              DQD+ N +   +P  ++  PCQWN +             E   + I+H N
Sbjct: 285 --GDQDLLNIVFFHNPESLFVFPCQWNYRPDHCIYGSNCQEAEEGGIFILHGN 335



 Score = 38.9 bits (89), Expect = 9.9,   Method: Compositional matrix adjust.
 Identities = 34/155 (21%), Positives = 64/155 (41%), Gaps = 9/155 (5%)

Query: 9   IPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSV 68
           I   E + +A+V  G     ++ T++KS + +   PL FH+  +    +  +     WS 
Sbjct: 79  IQPVEKMHLAVVACGERLEETM-TMLKSAIIFSIKPLQFHIFAEDQLHHSFKGRLDNWSF 137

Query: 69  PQV-EVSFYLADSVVEDVS-WIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 126
            Q    + Y      E+ + W   K         +L LP +L E +   + +DTD++F  
Sbjct: 138 LQTFNYTLYPITFPSENAAEW---KKLFKPCASQRLFLPLILKE-VDSLLYVDTDILFLR 193

Query: 127 DIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKN 161
            +  +W+L  +    Q   +     +  +G  W N
Sbjct: 194 PVDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WYN 226


>gi|157113908|ref|XP_001657920.1| n-acetyllactosaminide beta-1,3-n-acetylglucosaminyltransferase
           [Aedes aegypti]
 gi|108877534|gb|EAT41759.1| AAEL006638-PA [Aedes aegypti]
          Length = 482

 Score = 62.4 bits (150), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 62/277 (22%), Positives = 103/277 (37%), Gaps = 85/277 (30%)

Query: 468 DEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEA----QQFLSYAG 523
           ++Y   S+   V L  Q S++++  +  +  HW GPIS  ++ +  +     Q +LSY  
Sbjct: 118 EKYPELSERFSVCLATQSSLEKIYSLVQVSHHWSGPISTAIFAAGNDELYLLQIYLSYLR 177

Query: 524 NS-EALRSRRNIGYHVVYKE------------GNF------------------------- 545
           N  + +R R  + +H+   +            G+F                         
Sbjct: 178 NCFKTIRER--VSFHLALPKDRAPTHLKSIHVGDFTKFDCAKPEATLNDLIKLRKVETNK 235

Query: 546 ------YPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVL 599
                 YP N LRNVA     + +VFL D+D +P      +L   +R+     H      
Sbjct: 236 WRIKNPYPQNHLRNVARKGCQSTHVFLTDVDIIPSVNFAEHLDKFLRNQRPQHH--PTAY 293

Query: 600 VVPAFETQRYRTAFPASHA------------------------PTNFSRWVNATTPYQIE 635
           VVP +E    R  FP +                           TNFSRW    +  ++ 
Sbjct: 294 VVPTYELDE-RVRFPRNKTDLIRLANKGLARPFHHKVFIYNQFATNFSRWQADVSDSEVT 352

Query: 636 WAPD--------FEPYIVAHRDLPRYDTRFVGFGWNK 664
                       +EP+ VA   +P +D RF+G+G+ +
Sbjct: 353 HVSHNVTNFEFLYEPFYVAPDTVPPHDERFLGYGYTR 389


>gi|260822427|ref|XP_002606603.1| hypothetical protein BRAFLDRAFT_209578 [Branchiostoma floridae]
 gi|229291947|gb|EEN62613.1| hypothetical protein BRAFLDRAFT_209578 [Branchiostoma floridae]
          Length = 381

 Score = 62.4 bits (150), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 67/270 (24%), Positives = 97/270 (35%), Gaps = 88/270 (32%)

Query: 478 DVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRS-----RR 532
           D TLV Q S+  L  +  L   W+GP+S+ ++     A       G    LR      R+
Sbjct: 30  DATLVTQCSVQNLYHIVELSVRWDGPVSVAIFAPGENAAFADDIVG---GLRRCYPLIRK 86

Query: 533 NIGYHVVY--------------------------------------KEGNFYPINTLRNV 554
            + +H+VY                                       E   YP N LRNV
Sbjct: 87  YVSFHLVYPTSREAIIYEESPTLLNTPCQDILHLAKYHKSATLNYGAEKEPYPNNLLRNV 146

Query: 555 ALNQVSTPYVFLLDIDFLP---------MFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFE 605
           A    ST ++F++DID LP          F L   L  S  S+ +  H    V VVPAFE
Sbjct: 147 ARQGSSTEFIFVVDIDMLPSEKLRTDFVHFALENRLFDSRTSL-VSDH-LDTVYVVPAFE 204

Query: 606 TQR-----------------------YRTAFPASHAPTNFSRWVNATT--------PYQI 634
            Q                        YR       + T++ +W  A +         Y +
Sbjct: 205 LQLNASIPKDKTELLEALQQGQARPFYRDICWKCQSSTDYDKWTAAKSSSKRRLDVAYNV 264

Query: 635 EWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
            W   +EP+ +    +P YD RF  +G+N+
Sbjct: 265 TWKDPWEPFYIGRWSVPLYDERFKQYGFNR 294


>gi|395841533|ref|XP_003793589.1| PREDICTED: glucoside xylosyltransferase 1 isoform 2 [Otolemur
           garnettii]
          Length = 406

 Score = 62.4 bits (150), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 43/173 (24%), Positives = 76/173 (43%), Gaps = 19/173 (10%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L E +   + +DTD++F   +  +W+L  +    Q   +     +  +G  W 
Sbjct: 166 RLFLPLILKE-VDSLLYVDTDILFLRPVDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WY 222

Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
           N     P  G+ G N+GV+L+++T++R            + W      + +K+ L   W 
Sbjct: 223 NRFARHPYYGKTGVNSGVMLMNMTRIRRKYFKNDMTSVRLRWGDILMPLLKKYKLNITW- 281

Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
              DQD+ N I   +P  ++  PCQWN +             E   + I+H N
Sbjct: 282 --GDQDLLNIIFFHNPESLFVFPCQWNYRPDHCIYGSNCHEAEEEGIFILHGN 332


>gi|426372204|ref|XP_004053018.1| PREDICTED: glucoside xylosyltransferase 1 isoform 2 [Gorilla
           gorilla gorilla]
          Length = 409

 Score = 62.4 bits (150), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 42/173 (24%), Positives = 76/173 (43%), Gaps = 19/173 (10%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L E +   + +DTD++F   +  +W+L  +    Q   +     +  +G  W 
Sbjct: 169 RLFLPLILKE-IDSLLYVDTDILFLRPVDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WY 225

Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
           N     P  G+ G N+GV+L+++T++R            + W      + +K+ L   W 
Sbjct: 226 NRFARHPYYGKTGVNSGVMLMNMTRMRRKYFKNDMTTVRLQWGDILMPLLKKYKLNITW- 284

Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
              DQD+ N +   +P  ++  PCQWN +             E   + I+H N
Sbjct: 285 --GDQDLLNIMFFHNPESLFVFPCQWNYRPDHCIYGSNCQEAEEEGIFILHGN 335



 Score = 39.3 bits (90), Expect = 7.0,   Method: Compositional matrix adjust.
 Identities = 34/155 (21%), Positives = 63/155 (40%), Gaps = 9/155 (5%)

Query: 9   IPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSV 68
           I   E + +A+V  G       +T++KS + +   PL FH+  +    +  +     WS 
Sbjct: 79  IQPVEKMHLAVVACG-ERLEETMTMLKSAIIFSIKPLQFHIFAEDQLHHSFKGRLDNWSF 137

Query: 69  PQV-EVSFYLADSVVEDVS-WIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 126
            Q    + Y      E+ + W   K         +L LP +L E +   + +DTD++F  
Sbjct: 138 LQTFNYTLYPITFPSENAAEW---KKLFKPCASQRLFLPLILKE-IDSLLYVDTDILFLR 193

Query: 127 DIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKN 161
            +  +W+L  +    Q   +     +  +G  W N
Sbjct: 194 PVDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WYN 226


>gi|350411357|ref|XP_003489320.1| PREDICTED: uncharacterized protein C3orf21 homolog [Bombus
           impatiens]
          Length = 357

 Score = 62.4 bits (150), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 45/164 (27%), Positives = 76/164 (46%), Gaps = 17/164 (10%)

Query: 186 PNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVE 245
           P  +YS     L L L ++ P   +   + D D  F  DI +L+  F+    +   GL  
Sbjct: 159 PGTYYSDALFFLSLGLHRIAPAEQSVAAMFDVDTKFRKDIKELFEEFNSFGSQALFGLAP 218

Query: 246 NQSDWY--LGKLWKNHKPWPALGR--------GFNTGVILLDLTKLRDISWAGFWRIIAE 295
             +  Y  +  L+++  P    G         G+N+GV+L +L KLR+ S   +  I+  
Sbjct: 219 ELTPVYRHVLYLYRSKHPNTLFGEPASSGGYCGYNSGVVLFNLDKLRNSSV--YDAIVKR 276

Query: 296 KFL--LTRLW---TSLADQDIFNAIISEHPYLVYTLPCQWNVQL 334
           + +  +T+ +     L DQD +  +  E P L++T+ C WN QL
Sbjct: 277 ESVDAMTKKYHFKGHLGDQDFYTILGMEKPELIHTIDCGWNRQL 320


>gi|301772784|ref|XP_002921812.1| PREDICTED: glucoside xylosyltransferase 1-like [Ailuropoda
           melanoleuca]
          Length = 458

 Score = 62.4 bits (150), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 44/173 (25%), Positives = 78/173 (45%), Gaps = 19/173 (10%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L E +   + +DTD++F   +  +W+L  +    Q   +     +  +G  W 
Sbjct: 218 RLFLPLILKE-VDSLLYVDTDILFLRPVDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WY 274

Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLR------DIS-----WAGFWRIIAEKFLLTRLWT 304
           N     P  G+ G N+GV+L+++T++R      D++     W      + +K+ L   W 
Sbjct: 275 NRFARHPYYGKTGVNSGVMLMNMTRMRRKYFKNDMTTVRLLWGDILMPLLKKYKLNITW- 333

Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
              DQD+ N I   +P  ++  PCQWN +             E   + I+H N
Sbjct: 334 --GDQDLLNIIFFHNPESLFVFPCQWNYRPDHCIYGSNCQEAEAEGIFILHGN 384



 Score = 42.0 bits (97), Expect = 0.97,   Method: Compositional matrix adjust.
 Identities = 34/149 (22%), Positives = 63/149 (42%), Gaps = 9/149 (6%)

Query: 15  IQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQV-EV 73
           + +A+V  G       VT++KS + +   PL FH+  +    +  + +  +WS  Q    
Sbjct: 134 MHLAVVACG-ERLEETVTMLKSAIIFSIRPLQFHIFAEDQLHDSFKGILDSWSFLQTFNY 192

Query: 74  SFYLADSVVEDVS-WIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLW 132
           S Y      E+ + W   K         +L LP +L E +   + +DTD++F   +  +W
Sbjct: 193 SLYPITFPSENAAEW---KKLFKPCASQRLFLPLILKE-VDSLLYVDTDILFLRPVDDIW 248

Query: 133 ALFSRLRQRQTIGLVENQSDWYLGKLWKN 161
           +L  +    Q   +     +  +G  W N
Sbjct: 249 SLLKKFNSTQIAAMAPEHEEPRIG--WYN 275


>gi|42415433|gb|AAS15686.1| GH27387p [Drosophila melanogaster]
          Length = 186

 Score = 62.4 bits (150), Expect = 9e-07,   Method: Composition-based stats.
 Identities = 33/96 (34%), Positives = 55/96 (57%), Gaps = 5/96 (5%)

Query: 263 PALGR-GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPY 321
           P  GR G N+GV+L++LT++R++ W      I +++ L  +W    DQDI N +   HP 
Sbjct: 23  PFYGRLGVNSGVMLMNLTRMREMKWEQHIVSIHKEYKLRIIW---GDQDIINILFYYHPD 79

Query: 322 LVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
            +Y +PC++N +  D+     +C    T +K+IH N
Sbjct: 80  KLYIMPCEYNYR-PDHCMYMSICNMSQTGVKVIHGN 114


>gi|156367099|ref|XP_001627257.1| predicted protein [Nematostella vectensis]
 gi|156214161|gb|EDO35157.1| predicted protein [Nematostella vectensis]
          Length = 325

 Score = 62.0 bits (149), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 58/265 (21%), Positives = 98/265 (36%), Gaps = 83/265 (31%)

Query: 477 NDVTLVAQLSMDRLQMVEMLFKHWEGPISLT-----LYMSDAEAQQFLSYAGNSEALRSR 531
           +DV LV   +++ L  +  + +HW+GPIS+      L++  A       +  N +    +
Sbjct: 3   DDVVLVTHCTVNHLHYLLDIVEHWQGPISVAVVVPGLHIVSAYKSMIGLHLCNKDV---Q 59

Query: 532 RNIGYHVVY--------------------------------------KEGNF-----YPI 548
           + + +HVVY                                         N+     YP 
Sbjct: 60  QRVTFHVVYPLSHPALDNRLSNYVRLARAELPLTCEEFMVTLQTGLANSANYGGEVPYPS 119

Query: 549 NTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQR 608
           N LRNV        +VF++DID +    LY    +  R  ++       V VVPAFE+Q 
Sbjct: 120 NVLRNVGREHSKAKFVFVVDIDMMCNGNLYKDFLNFSRENNLF---SMTVYVVPAFESQE 176

Query: 609 -----------------------YRTAFPASHAPTNFSRWVNATT------PYQIEWAPD 639
                                  Y  A     +PTN+ +W  + +       +   W   
Sbjct: 177 NLPPNTNKMELIKMWENGKVLPFYYQACRKCQSPTNYDKWKRSRSSSGLHVSFTRSWTDP 236

Query: 640 FEPYIVAHRDLPRYDTRFVGFGWNK 664
           +EP+ ++   +P YD RF  +G+N+
Sbjct: 237 WEPFYISLNSVPLYDGRFKQYGFNR 261


>gi|397511420|ref|XP_003826072.1| PREDICTED: glucoside xylosyltransferase 1 [Pan paniscus]
          Length = 394

 Score = 62.0 bits (149), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 42/173 (24%), Positives = 76/173 (43%), Gaps = 19/173 (10%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L E +   + +DTD++F   +  +W+L  +    Q   +     +  +G  W 
Sbjct: 154 RLFLPLILKE-VDSLLYVDTDILFLRPVDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WY 210

Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
           N     P  G+ G N+GV+L+++T++R            + W      + +K+ L   W 
Sbjct: 211 NRFARHPYYGKTGVNSGVMLMNMTRMRRKYFKNDMTTVRLQWGDILMPLLKKYKLNITW- 269

Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
              DQD+ N +   +P  ++  PCQWN +             E   + I+H N
Sbjct: 270 --GDQDLLNIVFFHNPESLFVFPCQWNYRPDHCIYGSNCQEAEEEGIFILHGN 320



 Score = 39.3 bits (90), Expect = 7.5,   Method: Compositional matrix adjust.
 Identities = 34/155 (21%), Positives = 63/155 (40%), Gaps = 9/155 (5%)

Query: 9   IPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSV 68
           I   E + +A+V  G       +T++KS + +   PL FH+  +    +  +     WS 
Sbjct: 64  IQPVEKMHLAVVACG-ERLEETMTMLKSAIIFSIKPLQFHIFAEDQLHHSFKGRLDNWSF 122

Query: 69  PQV-EVSFYLADSVVEDVS-WIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 126
            Q    + Y      E+ + W   K         +L LP +L E +   + +DTD++F  
Sbjct: 123 LQTFNYTLYPITFPSENAAEW---KKLFKPCASQRLFLPLILKE-VDSLLYVDTDILFLR 178

Query: 127 DIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKN 161
            +  +W+L  +    Q   +     +  +G  W N
Sbjct: 179 PVDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WYN 211


>gi|380796193|gb|AFE69972.1| glucoside xylosyltransferase 1 isoform 1, partial [Macaca mulatta]
          Length = 375

 Score = 62.0 bits (149), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 42/173 (24%), Positives = 76/173 (43%), Gaps = 19/173 (10%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L E +   + +DTD++F   +  +W+L  +    Q   +     +  +G  W 
Sbjct: 135 RLFLPLILKE-VDSLLYVDTDILFLRPVDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WY 191

Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
           N     P  G+ G N+GV+L+++T++R            + W      + +K+ L   W 
Sbjct: 192 NRFARHPYYGKTGVNSGVMLMNMTRMRRKYFKNDMTTVRLQWGDILMPLLKKYKLNITW- 250

Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
              DQD+ N +   +P  ++  PCQWN +             E   + I+H N
Sbjct: 251 --GDQDLLNIVFFHNPESLFVFPCQWNYRPDHCIYGSNCQEAEEEGIFILHGN 301



 Score = 39.7 bits (91), Expect = 5.6,   Method: Compositional matrix adjust.
 Identities = 33/155 (21%), Positives = 64/155 (41%), Gaps = 9/155 (5%)

Query: 9   IPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSV 68
           I   E + +A+V  G       +T++KS + +   PL FH+  +    +  +    +W+ 
Sbjct: 45  IQPVEKMHLAVVACG-ERLEETMTMLKSAIIFSMKPLQFHIFAEDQLHHSFKDRLDSWAF 103

Query: 69  PQV-EVSFYLADSVVEDVS-WIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 126
            Q    + Y      E+ + W   K         +L LP +L E +   + +DTD++F  
Sbjct: 104 LQTFNYTLYPITFPSENAAEW---KKLFKPCASQRLFLPLILKE-VDSLLYVDTDILFLR 159

Query: 127 DIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKN 161
            +  +W+L  +    Q   +     +  +G  W N
Sbjct: 160 PVDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WYN 192


>gi|340729673|ref|XP_003403121.1| PREDICTED: uncharacterized protein C3orf21 homolog [Bombus
           terrestris]
          Length = 357

 Score = 62.0 bits (149), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 45/164 (27%), Positives = 76/164 (46%), Gaps = 17/164 (10%)

Query: 186 PNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVE 245
           P  +YS     L L L ++ P   +   + D D  F  DI +L+  F+    +   GL  
Sbjct: 159 PGTYYSDALFFLSLGLHRIAPAEQSVAAMFDVDTKFRKDIKELFEEFNTFGNQALFGLAP 218

Query: 246 NQSDWY--LGKLWKNHKPWPALGR--------GFNTGVILLDLTKLRDISWAGFWRIIAE 295
             +  Y  +  L+++  P    G         G+N+GV+L +L KLR+ S   +  I+  
Sbjct: 219 ELTPVYRHVLYLYRSKHPNTLFGEPASSGGYHGYNSGVVLFNLDKLRNSSV--YDAIVKR 276

Query: 296 KFL--LTRLW---TSLADQDIFNAIISEHPYLVYTLPCQWNVQL 334
           + +  +T+ +     L DQD +  +  E P L++T+ C WN QL
Sbjct: 277 ESVDAMTKKYHFKGHLGDQDFYTILGMEKPELIHTIDCGWNRQL 320


>gi|187469806|gb|AAI67068.1| Glt8d3 protein [Rattus norvegicus]
          Length = 359

 Score = 62.0 bits (149), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 44/173 (25%), Positives = 76/173 (43%), Gaps = 19/173 (10%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L + +   + +DTDV+F   +  +W+L  R    Q   +     +  +G  W 
Sbjct: 119 RLFLPLIL-KGVDSLLYVDTDVLFLRPVDDIWSLLERFNSTQIAAMAPEHEEPRVG--WY 175

Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
           N     P  GR G N+GV+L+++T++R            + W      + +K+ L   W 
Sbjct: 176 NRFARHPYYGRTGVNSGVMLMNMTRMRRKYFKNDMTTARLQWGDILMPLLKKYKLNITW- 234

Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
              DQD+ N +   +P  ++  PCQWN +             E   + I+H N
Sbjct: 235 --GDQDLLNIMFYHNPESLFVFPCQWNYRPDHCIYGSNCREAEEEGVFILHGN 285



 Score = 44.7 bits (104), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 35/146 (23%), Positives = 62/146 (42%), Gaps = 7/146 (4%)

Query: 13  EVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQVE 72
           E + +A+V  G +     VT++KS L +   PLH H+  +    +  +    +WS  Q  
Sbjct: 33  EKMHLAVVACG-DRLEETVTMLKSALIFSIKPLHVHIFAEDQLHDSFKDRLDSWSFLQ-R 90

Query: 73  VSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLW 132
            ++ L        S +  K         +L LP +L + +   + +DTDV+F   +  +W
Sbjct: 91  FNYSLYPITFPSDSAMEWKKLFKPCASQRLFLPLIL-KGVDSLLYVDTDVLFLRPVDDIW 149

Query: 133 ALFSRLRQRQTIGLV----ENQSDWY 154
           +L  R    Q   +     E +  WY
Sbjct: 150 SLLERFNSTQIAAMAPEHEEPRVGWY 175


>gi|302564381|ref|NP_001180786.1| glucoside xylosyltransferase 1 [Macaca mulatta]
          Length = 440

 Score = 62.0 bits (149), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 42/173 (24%), Positives = 76/173 (43%), Gaps = 19/173 (10%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L E +   + +DTD++F   +  +W+L  +    Q   +     +  +G  W 
Sbjct: 200 RLFLPLILKE-VDSLLYVDTDILFLRPVDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WY 256

Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
           N     P  G+ G N+GV+L+++T++R            + W      + +K+ L   W 
Sbjct: 257 NRFARHPYYGKTGVNSGVMLMNMTRMRRKYFKNDMTTVRLQWGDILMPLLKKYKLNITW- 315

Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
              DQD+ N +   +P  ++  PCQWN +             E   + I+H N
Sbjct: 316 --GDQDLLNIVFFHNPESLFVFPCQWNYRPDHCIYGSNCQEAEEEGIFILHGN 366



 Score = 40.0 bits (92), Expect = 4.3,   Method: Compositional matrix adjust.
 Identities = 34/155 (21%), Positives = 64/155 (41%), Gaps = 9/155 (5%)

Query: 9   IPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSV 68
           I   E + +A+V  G       +T++KS + +   PL FH+  +    +  +    +WS 
Sbjct: 110 IQPVEKMHLAVVACG-ERLEETMTMLKSAIIFSMKPLQFHIFAEDQLHHSFKDRLDSWSF 168

Query: 69  PQV-EVSFYLADSVVEDVS-WIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 126
            Q    + Y      E+ + W   K         +L LP +L E +   + +DTD++F  
Sbjct: 169 LQTFNYTLYPITFPSENAAEW---KKLFKPCASQRLFLPLILKE-VDSLLYVDTDILFLR 224

Query: 127 DIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKN 161
            +  +W+L  +    Q   +     +  +G  W N
Sbjct: 225 PVDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WYN 257


>gi|395541130|ref|XP_003772500.1| PREDICTED: glucoside xylosyltransferase 1-like [Sarcophilus
           harrisii]
          Length = 410

 Score = 62.0 bits (149), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 42/173 (24%), Positives = 76/173 (43%), Gaps = 19/173 (10%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L E +   + +DTD++F   +  +W+L  +    Q   +     +  +G  W 
Sbjct: 170 RLFLPLILKE-VDSLLYVDTDILFLRPVDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WY 226

Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
           N     P  G+ G N+GV+L+++T++R            + W      + +K+ L   W 
Sbjct: 227 NRFARHPYYGKTGINSGVMLMNMTRMRRKYFKNDMTTARLQWGEILMPLLKKYKLNITW- 285

Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
              DQD+ N +   +P  ++  PCQWN +             E   + I+H N
Sbjct: 286 --GDQDLLNIMFFHNPESLFVFPCQWNYRPDHCIYGSNCREAEEEGIFILHGN 336



 Score = 39.3 bits (90), Expect = 6.9,   Method: Compositional matrix adjust.
 Identities = 30/153 (19%), Positives = 61/153 (39%), Gaps = 9/153 (5%)

Query: 11  TCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQ 70
           + + + +A+V  G       +T+IKS + +   PL FH+  +    +  + +   W   Q
Sbjct: 82  SSDTMHLAVVACG-ERLEETITMIKSAIIFSIQPLQFHIFAEDQLHHSFKNILDGWKFLQ 140

Query: 71  V--EVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 128
               + + +         W   K         +L LP +L E +   + +DTD++F   +
Sbjct: 141 TFNYIIYPITFPTENAAEW---KKLFKPCASQRLFLPLILKE-VDSLLYVDTDILFLRPV 196

Query: 129 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKN 161
             +W+L  +    Q   +     +  +G  W N
Sbjct: 197 DDIWSLLKKFNSTQIAAMAPEHEEPRIG--WYN 227


>gi|384489769|gb|EIE80991.1| hypothetical protein RO3G_05696 [Rhizopus delemar RA 99-880]
          Length = 392

 Score = 62.0 bits (149), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 56/226 (24%), Positives = 99/226 (43%), Gaps = 47/226 (20%)

Query: 478 DVTLVAQLSMDRLQMVEMLFKHWEGPISLTLY-MSDAEAQQFLS-----YAGNSEALRSR 531
           D+TL   ++ DR  ++  L  +++GPIS  ++ M + E  + +      Y  N +    +
Sbjct: 44  DITLATLVTRDRFHVLSRLATNYQGPISAAIHVMDNKEKTKTIGELNRIYKSNPDM---Q 100

Query: 532 RNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSI---RSM 588
             +  H++  + +    N  RN A     T Y+ +LD+D    F L    + SI     +
Sbjct: 101 AFVDVHLIVDKYD-RQFNMWRNTAKLFTRTDYLMMLDVD----FHLCTDFRKSIWENAEL 155

Query: 589 DMHGHGGKKVLVVPAFE--------------TQR--------------YRTAFPASHAPT 620
                 GK   VVPAFE              T++              + +A+ + H  T
Sbjct: 156 AAKLKSGKTAFVVPAFEYLDQKDGLDWRTFPTEKKKVLKEVEEAKLDSFHSAWVSGHGAT 215

Query: 621 NFSRWVNATTPYQI-EWAPDFEPYIVAHRD-LPRYDTRFVGFGWNK 664
           N++RW +    Y + E+   +EPY++  ++  P  D RF+G+G NK
Sbjct: 216 NYTRWYSTNQAYPVTEYEYSYEPYVIYKKEGTPWCDERFIGYGANK 261


>gi|167525928|ref|XP_001747298.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163774133|gb|EDQ87765.1| predicted protein [Monosiga brevicollis MX1]
          Length = 434

 Score = 62.0 bits (149), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 42/150 (28%), Positives = 64/150 (42%), Gaps = 33/150 (22%)

Query: 546 YPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFE 605
           YP N LRN+A +Q +T +  +LD+D +P   +   ++ ++ +   HG   +   VVPAFE
Sbjct: 222 YPNNLLRNLARDQATTLFTMVLDVDIVPSHNIAKAVEQAVAA--HHGSYTRLAFVVPAFE 279

Query: 606 -TQRYRTAFPA------------------------SHAPTNFSRWVNATT-----PYQIE 635
                 T  PA                            TN+ +W           Y++ 
Sbjct: 280 LDTSTATTMPAHRDELLQMLEAGTARPFYQEVCWKCQRHTNYEKWQQGPMQITELEYEVS 339

Query: 636 WAPDFEP-YIVAHRDLPRYDTRFVGFGWNK 664
           W   +EP YI   R  PRYD RF  +G+N+
Sbjct: 340 WHDPWEPFYIARERSCPRYDVRFRQYGFNR 369


>gi|195489522|ref|XP_002092774.1| GE14380 [Drosophila yakuba]
 gi|194178875|gb|EDW92486.1| GE14380 [Drosophila yakuba]
          Length = 389

 Score = 62.0 bits (149), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 61/278 (21%), Positives = 106/278 (38%), Gaps = 87/278 (31%)

Query: 468 DEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEA----QQFLSYAG 523
           ++Y  +++ + V L  Q S++RL  +  +  +W+G +S+ L+ +  E     Q F++Y  
Sbjct: 41  EQYADTTESSLVCLATQTSVERLNSLPQVAANWQGKMSVALFAAGPEEFVVLQYFVTYMR 100

Query: 524 NSEALRSRRNIGYHVV-------------------------------------------- 539
              A   R N  +H++                                            
Sbjct: 101 LCFA-NIRENATFHLLTPRDFDKLPRMSALPPNMKGKFDCQYPDRTLKALLKFRSLKTLQ 159

Query: 540 YKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVL 599
           +++ N YP N +RN+A     T YVFL DID +P     P L    R+ +          
Sbjct: 160 WRQRNTYPQNHMRNLARKGCQTKYVFLTDIDIVPSTNSVPQLNHFFRTANCTKSCA---Y 216

Query: 600 VVPAFETQRYRTAFPASHA------------------------PTNFSRWVNATTPYQIE 635
           V+P FE    R  FP S                           TNFS+W++  T  + E
Sbjct: 217 VIPTFEID-VRATFPRSKTALVRLIRKGLARPFHEKVFIYNQYATNFSKWLSPNTN-ETE 274

Query: 636 WAPD---------FEPYIVAHRDLPRYDTRFVGFGWNK 664
            +           +EP+ ++  + P +D RF+G+G+ +
Sbjct: 275 VSVSHIVTNFEFLYEPFYISVDNAPAHDERFLGYGFTR 312


>gi|348580283|ref|XP_003475908.1| PREDICTED: glucoside xylosyltransferase 1-like isoform 2 [Cavia
           porcellus]
          Length = 406

 Score = 62.0 bits (149), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 42/173 (24%), Positives = 76/173 (43%), Gaps = 19/173 (10%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L E +   + +DTD++F   +  +W+L  +    Q   +     +  +G  W 
Sbjct: 166 RLFLPLILKE-VDSLLYVDTDILFLRPVDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WY 222

Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
           N     P  G+ G N+GV+L+++T++R            + W      + +K+ L   W 
Sbjct: 223 NRFARHPYYGKTGVNSGVMLMNMTRMRRKYFKNDMTPVRLQWGEILMPLLKKYKLNITW- 281

Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
              DQD+ N +   +P  ++  PCQWN +             E   + I+H N
Sbjct: 282 --GDQDLLNIMFFHNPESLFVFPCQWNYRPDHCIYGSNCQEAEEEGIFILHGN 332



 Score = 41.6 bits (96), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 35/155 (22%), Positives = 64/155 (41%), Gaps = 9/155 (5%)

Query: 9   IPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSV 68
           I   E + +A+V  G       +T++KS L +   PL FH+  +    +  +    +WS 
Sbjct: 76  IQPVEKMHLAVVACG-ERLEETMTMLKSALIFSIKPLQFHIFAEDQLHDSFKARLDSWSF 134

Query: 69  PQV-EVSFYLADSVVEDVS-WIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 126
            Q    + Y      E+ + W   K         +L LP +L E +   + +DTD++F  
Sbjct: 135 LQTFNYTLYPITFPSENAAEW---KKLFKPCASQRLFLPLILKE-VDSLLYVDTDILFLR 190

Query: 127 DIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKN 161
            +  +W+L  +    Q   +     +  +G  W N
Sbjct: 191 PVDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WYN 223


>gi|332206948|ref|XP_003252557.1| PREDICTED: glucoside xylosyltransferase 1, partial [Nomascus
           leucogenys]
          Length = 417

 Score = 62.0 bits (149), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 43/173 (24%), Positives = 76/173 (43%), Gaps = 19/173 (10%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L E +   + +DTD++F   I  +W+L  +    Q   +     +  +G  W 
Sbjct: 177 RLFLPLILKE-VDSLLYVDTDILFLRPIDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WY 233

Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
           N     P  G+ G N+GV+L+++T++R            + W      + +K+ L   W 
Sbjct: 234 NRFARHPYYGKTGVNSGVMLMNMTRMRRKYFKNDMTTVRLQWGDILMPLLKKYKLNITW- 292

Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
              DQD+ N +   +P  ++  PCQWN +             E   + I+H N
Sbjct: 293 --GDQDLLNIMFFHNPESLFVFPCQWNYRPDHCIYGSNCQEAEEEGIFILHGN 343



 Score = 39.7 bits (91), Expect = 5.1,   Method: Compositional matrix adjust.
 Identities = 35/155 (22%), Positives = 63/155 (40%), Gaps = 9/155 (5%)

Query: 9   IPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSV 68
           I   E + +A+V  G       +T++KS + +   PL FH+  +    +  +     WS 
Sbjct: 87  IQPVEKMHLAVVACG-ERLEETMTMLKSAIIFSIKPLQFHIFAEDQLHHSFKGRLDDWSF 145

Query: 69  PQV-EVSFYLADSVVEDVS-WIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 126
            Q    + Y      E+ + W   K         +L LP +L E +   + +DTD++F  
Sbjct: 146 LQTFNYTLYPITFPSENAAEW---KKLFKPCASQRLFLPLILKE-VDSLLYVDTDILFLR 201

Query: 127 DIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKN 161
            I  +W+L  +    Q   +     +  +G  W N
Sbjct: 202 PIDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WYN 234


>gi|432114562|gb|ELK36410.1| Glucoside xylosyltransferase 1 [Myotis davidii]
          Length = 392

 Score = 62.0 bits (149), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 43/173 (24%), Positives = 76/173 (43%), Gaps = 19/173 (10%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L E +   + +DTD++F   +  +W+L  +    Q   +     +  +G  W 
Sbjct: 152 RLFLPLILKE-VDSLLYVDTDILFLRPVDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WY 208

Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
           N     P  G+ G N+GV+L+++T++R            + W      + +K+ L   W 
Sbjct: 209 NRFARHPYYGKTGVNSGVMLMNMTRIRGKYFKNDMTTVRMRWGDILMPLLKKYKLNITW- 267

Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
              DQD+ N I   +P  ++  PCQWN +             E   + I+H N
Sbjct: 268 --GDQDLLNIIFFHNPESLFVFPCQWNYRPDHCIYGSNCREAEEEGIFILHGN 318



 Score = 47.0 bits (110), Expect = 0.032,   Method: Compositional matrix adjust.
 Identities = 37/157 (23%), Positives = 66/157 (42%), Gaps = 9/157 (5%)

Query: 7   PVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTW 66
           P +   E + +A+V  G       +T++KS + +   PL FH+  +    +  + +  TW
Sbjct: 60  PKVQPVERMHLAVVACG-ERLEETITMLKSAIIFSIKPLQFHIFAEDQLHDSFKGILDTW 118

Query: 67  SVPQV-EVSFYLADSVVED-VSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 124
           S  Q    S Y      E+ V W   K         +L LP +L E +   + +DTD++F
Sbjct: 119 SFLQTFNYSLYPISFPSENAVEW---KKLFKPCASQRLFLPLILKE-VDSLLYVDTDILF 174

Query: 125 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKN 161
              +  +W+L  +    Q   +     +  +G  W N
Sbjct: 175 LRPVDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WYN 209


>gi|281341850|gb|EFB17434.1| hypothetical protein PANDA_010726 [Ailuropoda melanoleuca]
          Length = 369

 Score = 62.0 bits (149), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 44/173 (25%), Positives = 78/173 (45%), Gaps = 19/173 (10%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L E +   + +DTD++F   +  +W+L  +    Q   +     +  +G  W 
Sbjct: 129 RLFLPLILKE-VDSLLYVDTDILFLRPVDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WY 185

Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLR------DIS-----WAGFWRIIAEKFLLTRLWT 304
           N     P  G+ G N+GV+L+++T++R      D++     W      + +K+ L   W 
Sbjct: 186 NRFARHPYYGKTGVNSGVMLMNMTRMRRKYFKNDMTTVRLLWGDILMPLLKKYKLNITW- 244

Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
              DQD+ N I   +P  ++  PCQWN +             E   + I+H N
Sbjct: 245 --GDQDLLNIIFFHNPESLFVFPCQWNYRPDHCIYGSNCQEAEAEGIFILHGN 295



 Score = 42.0 bits (97), Expect = 0.97,   Method: Compositional matrix adjust.
 Identities = 34/149 (22%), Positives = 63/149 (42%), Gaps = 9/149 (6%)

Query: 15  IQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQV-EV 73
           + +A+V  G       VT++KS + +   PL FH+  +    +  + +  +WS  Q    
Sbjct: 45  MHLAVVACG-ERLEETVTMLKSAIIFSIRPLQFHIFAEDQLHDSFKGILDSWSFLQTFNY 103

Query: 74  SFYLADSVVEDVS-WIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLW 132
           S Y      E+ + W   K         +L LP +L E +   + +DTD++F   +  +W
Sbjct: 104 SLYPITFPSENAAEW---KKLFKPCASQRLFLPLILKE-VDSLLYVDTDILFLRPVDDIW 159

Query: 133 ALFSRLRQRQTIGLVENQSDWYLGKLWKN 161
           +L  +    Q   +     +  +G  W N
Sbjct: 160 SLLKKFNSTQIAAMAPEHEEPRIG--WYN 186


>gi|71051513|gb|AAH30023.1| Glycosyltransferase 8 domain containing 3 [Homo sapiens]
          Length = 440

 Score = 62.0 bits (149), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 42/173 (24%), Positives = 76/173 (43%), Gaps = 19/173 (10%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L E +   + +DTD++F   +  +W+L  +    Q   +     +  +G  W 
Sbjct: 200 RLFLPLILKE-VNSLLYVDTDILFLRPVDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WY 256

Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
           N     P  G+ G N+GV+L+++T++R            + W      + +K+ L   W 
Sbjct: 257 NRFARHPYYGKTGVNSGVMLMNMTRMRRKYFKNDMTTVRLQWGDILMPLLKKYKLNITW- 315

Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
              DQD+ N +   +P  ++  PCQWN +             E   + I+H N
Sbjct: 316 --GDQDLLNIVFFHNPESLFVFPCQWNYRPDHCIYGSNCQEAEEGGIFILHGN 366



 Score = 38.9 bits (89), Expect = 9.8,   Method: Compositional matrix adjust.
 Identities = 34/155 (21%), Positives = 63/155 (40%), Gaps = 9/155 (5%)

Query: 9   IPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSV 68
           I   E + +A+V  G       +T++KS + +   PL FH+  +    +  +     WS 
Sbjct: 110 IQPVEKMHLAVVACG-ERLEETMTMLKSAIIFSIKPLQFHIFAEDQLHHSFKGRLDNWSF 168

Query: 69  PQV-EVSFYLADSVVEDVS-WIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 126
            Q    + Y      E+ + W   K         +L LP +L E +   + +DTD++F  
Sbjct: 169 LQTFNYTLYPITFPSENAAEW---KKLFKPCASQRLFLPLILKE-VNSLLYVDTDILFLR 224

Query: 127 DIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKN 161
            +  +W+L  +    Q   +     +  +G  W N
Sbjct: 225 PVDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WYN 257


>gi|114645289|ref|XP_001167787.1| PREDICTED: glucoside xylosyltransferase 1 isoform 1 [Pan
           troglodytes]
 gi|410223170|gb|JAA08804.1| glucoside xylosyltransferase 1 [Pan troglodytes]
 gi|410298482|gb|JAA27841.1| glucoside xylosyltransferase 1 [Pan troglodytes]
          Length = 440

 Score = 62.0 bits (149), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 42/173 (24%), Positives = 76/173 (43%), Gaps = 19/173 (10%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L E +   + +DTD++F   +  +W+L  +    Q   +     +  +G  W 
Sbjct: 200 RLFLPLILKE-VDSLLYVDTDILFLRPVDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WY 256

Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
           N     P  G+ G N+GV+L+++T++R            + W      + +K+ L   W 
Sbjct: 257 NRFARHPYYGKTGVNSGVMLMNMTRMRRKYFKNDMTTVRLQWGDILMPLLKKYKLNITW- 315

Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
              DQD+ N +   +P  ++  PCQWN +             E   + I+H N
Sbjct: 316 --GDQDLLNIVFFHNPESLFVFPCQWNYRPDHCIYGSNCQEAEEEGIFILHGN 366


>gi|355564140|gb|EHH20640.1| Glucoside xylosyltransferase 1, partial [Macaca mulatta]
 gi|355786013|gb|EHH66196.1| Glucoside xylosyltransferase 1, partial [Macaca fascicularis]
          Length = 369

 Score = 61.6 bits (148), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 42/173 (24%), Positives = 76/173 (43%), Gaps = 19/173 (10%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L E +   + +DTD++F   +  +W+L  +    Q   +     +  +G  W 
Sbjct: 129 RLFLPLILKE-VDSLLYVDTDILFLRPVDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WY 185

Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
           N     P  G+ G N+GV+L+++T++R            + W      + +K+ L   W 
Sbjct: 186 NRFARHPYYGKTGVNSGVMLMNMTRMRRKYFKNDMTTVRLQWGDILMPLLKKYKLNITW- 244

Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
              DQD+ N +   +P  ++  PCQWN +             E   + I+H N
Sbjct: 245 --GDQDLLNIVFFHNPESLFVFPCQWNYRPDHCIYGSNCQEAEEEGIFILHGN 295



 Score = 40.4 bits (93), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 34/155 (21%), Positives = 64/155 (41%), Gaps = 9/155 (5%)

Query: 9   IPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSV 68
           I   E + +A+V  G       +T++KS + +   PL FH+  +    +  +    +WS 
Sbjct: 39  IQPVEKMHLAVVACG-ERLEETMTMLKSAIIFSMKPLQFHIFAEDQLHHSFKDRLDSWSF 97

Query: 69  PQV-EVSFYLADSVVEDVS-WIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 126
            Q    + Y      E+ + W   K         +L LP +L E +   + +DTD++F  
Sbjct: 98  LQTFNYTLYPITFPSENAAEW---KKLFKPCASQRLFLPLILKE-VDSLLYVDTDILFLR 153

Query: 127 DIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKN 161
            +  +W+L  +    Q   +     +  +G  W N
Sbjct: 154 PVDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WYN 186


>gi|73997151|ref|XP_543730.2| PREDICTED: glucoside xylosyltransferase 1 [Canis lupus familiaris]
          Length = 410

 Score = 61.6 bits (148), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 43/173 (24%), Positives = 76/173 (43%), Gaps = 19/173 (10%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L E +   + +DTD++F   +  +W+L  +    Q   +     +  +G  W 
Sbjct: 170 RLFLPLILKE-VDSLLYVDTDILFLRPVDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WY 226

Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
           N     P  G+ G N+GV+L+++T++R            + W      + +K+ L   W 
Sbjct: 227 NRFARHPYYGKTGVNSGVMLMNMTRMRRKYFKNDMTTVRLRWGDMLMPLLKKYKLNITW- 285

Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
              DQD+ N I   +P  ++  PCQWN +             E   + I+H N
Sbjct: 286 --GDQDLLNIIFFHNPESLFVFPCQWNYRPDHCIYGSNCQEAEEEGIFILHGN 336



 Score = 41.2 bits (95), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 33/149 (22%), Positives = 63/149 (42%), Gaps = 9/149 (6%)

Query: 15  IQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQV-EV 73
           + +A+V  G       +T++KS + +   PL FH+  +    +  + +  +WS  Q    
Sbjct: 86  MHLAVVACG-ERLEETITMLKSAIIFSIRPLQFHIFAEDQLHDSFKGILDSWSFLQTFNY 144

Query: 74  SFYLADSVVED-VSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLW 132
           S Y      E+ + W   K         +L LP +L E +   + +DTD++F   +  +W
Sbjct: 145 SLYPITFPSENALEW---KKLFKPCASQRLFLPLILKE-VDSLLYVDTDILFLRPVDDIW 200

Query: 133 ALFSRLRQRQTIGLVENQSDWYLGKLWKN 161
           +L  +    Q   +     +  +G  W N
Sbjct: 201 SLLKKFNSTQIAAMAPEHEEPRIG--WYN 227


>gi|431899796|gb|ELK07743.1| Glycosyltransferase 8 domain-containing protein 4 [Pteropus alecto]
          Length = 317

 Score = 61.6 bits (148), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 42/172 (24%), Positives = 74/172 (43%), Gaps = 17/172 (9%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L + +   + +DTDV+F   +  +W L  +    Q   +        +G  + 
Sbjct: 70  RLFLPMILKD-VDSLLYVDTDVLFLRPVDDIWKLLRQFNSTQLAAMAPEHEIPKIG-WYS 127

Query: 258 NHKPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWTS 305
                P  G  G N+GV+L++LT++R            ++W     ++ +K+     W  
Sbjct: 128 RFAQHPFYGSSGINSGVMLMNLTRIRSAQFKNSVIPTGLAWEDMLYLLYQKYKSAITW-- 185

Query: 306 LADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
             DQD+ N I   +P  +Y  PCQWN +             EL  + ++H N
Sbjct: 186 -GDQDLLNIIFYFNPECLYVFPCQWNYRPDHCMYGSNCKEAELEGVSVLHGN 236


>gi|432920667|ref|XP_004079976.1| PREDICTED: N-acetyllactosaminide
           beta-1,3-N-acetylglucosaminyltransferase-like [Oryzias
           latipes]
          Length = 431

 Score = 61.6 bits (148), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 39/150 (26%), Positives = 63/150 (42%), Gaps = 34/150 (22%)

Query: 546 YPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFE 605
           YP N LRN+A +   + Y+ ++DID +P   L+    + +           +V V+PAFE
Sbjct: 219 YPNNLLRNIARSGTESSYILVIDIDMMPSADLHQQFLNLVMKRQP---TSDEVFVLPAFE 275

Query: 606 TQR-----------------------YRTAFPASHAPTNFSRWVNA--------TTPYQI 634
            +                        Y    P   APTN+S+WVN            Y +
Sbjct: 276 IRHARKIPANKAELVQLYQVGEVRPFYEELCPRCQAPTNYSQWVNNHVRGIGTLEVAYTL 335

Query: 635 EWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
            W   +EP+ +  R +P Y+  F  +G+N+
Sbjct: 336 TWVDPWEPFYIGPRTVPLYNENFKQYGFNR 365


>gi|153791362|ref|NP_775872.1| glucoside xylosyltransferase 1 isoform 1 [Homo sapiens]
 gi|269849602|sp|Q4G148.2|GXLT1_HUMAN RecName: Full=Glucoside xylosyltransferase 1; AltName:
           Full=Glycosyltransferase 8 domain-containing protein 3
 gi|119578237|gb|EAW57833.1| hCG39367, isoform CRA_c [Homo sapiens]
          Length = 440

 Score = 61.6 bits (148), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 42/173 (24%), Positives = 76/173 (43%), Gaps = 19/173 (10%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L E +   + +DTD++F   +  +W+L  +    Q   +     +  +G  W 
Sbjct: 200 RLFLPLILKE-VDSLLYVDTDILFLRPVDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WY 256

Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
           N     P  G+ G N+GV+L+++T++R            + W      + +K+ L   W 
Sbjct: 257 NRFARHPYYGKTGVNSGVMLMNMTRMRRKYFKNDMTTVRLQWGDILMPLLKKYKLNITW- 315

Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
              DQD+ N +   +P  ++  PCQWN +             E   + I+H N
Sbjct: 316 --GDQDLLNIVFFHNPESLFVFPCQWNYRPDHCIYGSNCQEAEEGGIFILHGN 366


>gi|313213019|emb|CBY36900.1| unnamed protein product [Oikopleura dioica]
          Length = 427

 Score = 61.6 bits (148), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 61/265 (23%), Positives = 100/265 (37%), Gaps = 75/265 (28%)

Query: 474 SDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEAL---RS 530
           S   DVTL +Q S + L  V  L + W GP+S+ ++  D ++  F++ A  S      + 
Sbjct: 94  SQYEDVTLASQCSANHLHHVINLLERWSGPMSIAVFAPDKDSS-FVTDAILSMRKCWPQI 152

Query: 531 RRNIGYHVVYKEG-------------------------NF-------------YPINTLR 552
           R  + +H+VY  G                         NF             YP N LR
Sbjct: 153 RDKVTFHLVYPIGHTSDVSASTGGLEFNDCNELVDIISNFGGDDAGNYKGDIPYPHNVLR 212

Query: 553 NVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKK--VLVVPAFETQRYR 610
           NVA+   +T Y FL+DID LP   +             +     +  V VVP FE ++  
Sbjct: 213 NVAIKTSATKYTFLIDIDVLPNENIRERFNQMAIKKGYYDEDFDENVVFVVPCFEMKKGY 272

Query: 611 TA----------FPAS-------------HAPTNFSRWV--------NATTPYQIEWAPD 639
           T           F                H+P     W+         +   +   W  +
Sbjct: 273 TRSLNKKHLLEDFQKGLIRPFHNESCWWCHSPEEHPHWMAIPLSRNDQSKISFNATWNKN 332

Query: 640 FEPYIVAHRDLPRYDTRFVGFGWNK 664
           +EP+ +  +++P +D RF  +G+++
Sbjct: 333 WEPFYIGSKNVPLFDERFKQYGFDR 357


>gi|149017615|gb|EDL76619.1| similar to CG9996-PA, isoform CRA_b [Rattus norvegicus]
          Length = 362

 Score = 61.6 bits (148), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 40/158 (25%), Positives = 68/158 (43%), Gaps = 18/158 (11%)

Query: 213 IVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNH-KPWPALGR-GFN 270
           + +DTDV+F   +  +W+L  R    Q   +     +  +G  W N     P  GR G N
Sbjct: 136 LYVDTDVLFLRPVDDIWSLLERFNSTQIAAMAPEHEEPRVG--WYNRFARHPYYGRTGVN 193

Query: 271 TGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEH 319
           +GV+L+++T++R            + W      + +K+ L   W    DQD+ N +   +
Sbjct: 194 SGVMLMNMTRMRRKYFKNDMTTARLQWGDILMPLLKKYKLNITW---GDQDLLNIMFYHN 250

Query: 320 PYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
           P  ++  PCQWN +             E   + I+H N
Sbjct: 251 PESLFVFPCQWNYRPDHCIYGSNCREAEEEGVFILHGN 288


>gi|426372202|ref|XP_004053017.1| PREDICTED: glucoside xylosyltransferase 1 isoform 1 [Gorilla
           gorilla gorilla]
          Length = 440

 Score = 61.6 bits (148), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 42/173 (24%), Positives = 76/173 (43%), Gaps = 19/173 (10%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L E +   + +DTD++F   +  +W+L  +    Q   +     +  +G  W 
Sbjct: 200 RLFLPLILKE-IDSLLYVDTDILFLRPVDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WY 256

Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
           N     P  G+ G N+GV+L+++T++R            + W      + +K+ L   W 
Sbjct: 257 NRFARHPYYGKTGVNSGVMLMNMTRMRRKYFKNDMTTVRLQWGDILMPLLKKYKLNITW- 315

Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
              DQD+ N +   +P  ++  PCQWN +             E   + I+H N
Sbjct: 316 --GDQDLLNIMFFHNPESLFVFPCQWNYRPDHCIYGSNCQEAEEEGIFILHGN 366



 Score = 38.9 bits (89), Expect = 8.0,   Method: Compositional matrix adjust.
 Identities = 34/155 (21%), Positives = 63/155 (40%), Gaps = 9/155 (5%)

Query: 9   IPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSV 68
           I   E + +A+V  G       +T++KS + +   PL FH+  +    +  +     WS 
Sbjct: 110 IQPVEKMHLAVVACG-ERLEETMTMLKSAIIFSIKPLQFHIFAEDQLHHSFKGRLDNWSF 168

Query: 69  PQV-EVSFYLADSVVEDVS-WIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 126
            Q    + Y      E+ + W   K         +L LP +L E +   + +DTD++F  
Sbjct: 169 LQTFNYTLYPITFPSENAAEW---KKLFKPCASQRLFLPLILKE-IDSLLYVDTDILFLR 224

Query: 127 DIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKN 161
            +  +W+L  +    Q   +     +  +G  W N
Sbjct: 225 PVDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WYN 257


>gi|431901432|gb|ELK08458.1| Glycosyltransferase 8 domain-containing protein 3, partial
           [Pteropus alecto]
          Length = 369

 Score = 61.6 bits (148), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 43/173 (24%), Positives = 76/173 (43%), Gaps = 19/173 (10%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L E +   + +DTD++F   +  +W+L  +    Q   +     +  +G  W 
Sbjct: 129 RLFLPLILKE-VDSLLYVDTDILFLRPVDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WY 185

Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
           N     P  G+ G N+GV+L+++T++R            + W      + +K+ L   W 
Sbjct: 186 NRFARHPYYGKTGVNSGVMLMNMTRIRRKYFKNDMTTARLRWGDILMPLLKKYKLNITW- 244

Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
              DQD+ N I   +P  ++  PCQWN +             E   + I+H N
Sbjct: 245 --GDQDLLNIIFFHNPESLFVFPCQWNYRPDHCIYGSNCQEAEEEGIFILHGN 295



 Score = 42.0 bits (97), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 32/153 (20%), Positives = 63/153 (41%), Gaps = 5/153 (3%)

Query: 9   IPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSV 68
           I   + I +A+V  G       +T++KS + +   PL FH+  +    +  + +   WS 
Sbjct: 39  IQPVQRIHLAVVACG-ERLEETITMLKSAIIFSIKPLQFHIFAEDQLHDSFRGILDNWSF 97

Query: 69  PQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 128
            Q   ++ L   +    +    K         +L LP +L E +   + +DTD++F   +
Sbjct: 98  LQT-FNYSLYPIIFPSENAAEWKKLFKPCASQRLFLPLILKE-VDSLLYVDTDILFLRPV 155

Query: 129 AQLWALFSRLRQRQTIGLVENQSDWYLGKLWKN 161
             +W+L  +    Q   +     +  +G  W N
Sbjct: 156 DDIWSLLKKFNSTQIAAMAPEHEEPRIG--WYN 186


>gi|355736744|gb|AES12096.1| hypothetical protein [Mustela putorius furo]
          Length = 226

 Score = 61.6 bits (148), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 54/210 (25%), Positives = 88/210 (41%), Gaps = 26/210 (12%)

Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
           F + +++ +  S     +YS     L + + +++P+ + + I LD D+ + T+I +L+  
Sbjct: 14  FPVVEAMQKHFSAGSGTYYSDSIFFLSVAMHQIMPKEILRIIQLDLDLKYKTNIRELFEE 73

Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPA---------LGRGFNTGVILLDLTKLR 282
           F        IG+       Y    W+     P             GFN+GV+LL+L  +R
Sbjct: 74  FDNFLPGAVIGIAREMQPVYRHTFWQFRHENPKTRVGGPPPEGLPGFNSGVMLLNLEAMR 133

Query: 283 DISWAGFWRIIAEKFLLTRLWTS------LADQDIFNAIISEHPYLVYTLPCQWNVQLS- 335
               +  +  + E   + +L         L DQD F  I  EHP L + L C WN QL  
Sbjct: 134 Q---SPLYSRLLEPTQVQQLADKYHFRGHLGDQDFFTMIGMEHPELFHVLDCTWNRQLCT 190

Query: 336 -----DNTRSD--ELCYTELTDLKIIHWNS 358
                D+  SD  E  +     +KI H N 
Sbjct: 191 WTWWRDHGYSDVFEAYFRCEGHVKIYHGNC 220


>gi|395841531|ref|XP_003793588.1| PREDICTED: glucoside xylosyltransferase 1 isoform 1 [Otolemur
           garnettii]
          Length = 437

 Score = 61.6 bits (148), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 43/173 (24%), Positives = 76/173 (43%), Gaps = 19/173 (10%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L E +   + +DTD++F   +  +W+L  +    Q   +     +  +G  W 
Sbjct: 197 RLFLPLILKE-VDSLLYVDTDILFLRPVDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WY 253

Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
           N     P  G+ G N+GV+L+++T++R            + W      + +K+ L   W 
Sbjct: 254 NRFARHPYYGKTGVNSGVMLMNMTRIRRKYFKNDMTSVRLRWGDILMPLLKKYKLNITW- 312

Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
              DQD+ N I   +P  ++  PCQWN +             E   + I+H N
Sbjct: 313 --GDQDLLNIIFFHNPESLFVFPCQWNYRPDHCIYGSNCHEAEEEGIFILHGN 363


>gi|393242337|gb|EJD49855.1| hypothetical protein AURDEDRAFT_112338 [Auricularia delicata
           TFB-10046 SS5]
          Length = 442

 Score = 61.2 bits (147), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 46/136 (33%), Positives = 71/136 (52%), Gaps = 9/136 (6%)

Query: 10  PTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS--TWS 67
           P  E I +++V  G +S      ++KSIL     P+  H++    A+ +LQ      T  
Sbjct: 60  PEEEKIVISMVMFGRSSAEEGQHMLKSILMRASAPVELHIVCSDDAVPVLQRRLDLVTRP 119

Query: 68  VPQVEVSFY--LADSVVE--DVSWIPNK-HYSGVYGLLKLTLPKVLPETLAKTIVLDTDV 122
           V  V+V FY    D+VV   + + I NK H++GV GL K+ + +++P  LA  + +DTD 
Sbjct: 120 VHDVDVVFYPLTEDAVVTRGERAGIGNKRHHAGVGGLAKMFIHELVPAPLA--LYVDTDA 177

Query: 123 IFATDIAQLWALFSRL 138
           +F TD   LW    RL
Sbjct: 178 MFLTDPLHLWHTARRL 193


>gi|241848448|ref|XP_002415636.1| conserved hypothetical protein [Ixodes scapularis]
 gi|215509850|gb|EEC19303.1| conserved hypothetical protein [Ixodes scapularis]
          Length = 387

 Score = 61.2 bits (147), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 52/161 (32%), Positives = 75/161 (46%), Gaps = 22/161 (13%)

Query: 190 YSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSD 249
           YS     L L L +VL  +L + I+LD D+   +D+  L   F+   +   +G+   Q  
Sbjct: 192 YSNALFFLSLGLHRVL--SLRRLILLDIDLRLESDVWLLHRHFALFPETALVGMAHEQQP 249

Query: 250 WYLGKLWKNHK----------PWPALGRGFNTGVILLDLTKL------RDISWAGFWRII 293
            YL  L+  H+          P P    GFN+GV+LLDL ++      R++S       +
Sbjct: 250 TYL-HLFHEHRKRHPETRCSLPLPQGNPGFNSGVLLLDLHRMALSSAFRNLSSREGVEYL 308

Query: 294 AEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQL 334
             K+   R    L DQD +  +  E P LVY LPC WN QL
Sbjct: 309 TSKY---RFQGHLGDQDFYTVLSCERPELVYALPCTWNRQL 346


>gi|344255262|gb|EGW11366.1| Glycosyltransferase 8 domain-containing protein 3 [Cricetulus
           griseus]
          Length = 345

 Score = 61.2 bits (147), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 42/173 (24%), Positives = 76/173 (43%), Gaps = 19/173 (10%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L E +   + +DTD++F   +  +W+L  +    Q   +     +  +G  W 
Sbjct: 105 RLFLPLILKE-VDSLLYVDTDILFLRPVDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WY 161

Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
           N     P  G+ G N+GV+L+++T++R            + W      + +K+ L   W 
Sbjct: 162 NRFARHPYYGKTGVNSGVMLMNMTRMRRKYFKNDMTTARLQWGDILMPLLKKYKLNITW- 220

Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
              DQD+ N +   +P  ++  PCQWN +             E   + I+H N
Sbjct: 221 --GDQDLLNIMFFHNPESLFVFPCQWNYRPDHCIYGSNCREAEEEGVFILHGN 271



 Score = 46.6 bits (109), Expect = 0.047,   Method: Compositional matrix adjust.
 Identities = 37/152 (24%), Positives = 64/152 (42%), Gaps = 6/152 (3%)

Query: 10  PTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVP 69
           PT E + +A+V  G       VT++KS L +   PL FH+  +    N  +    +WS  
Sbjct: 17  PT-EKMHLAVVACG-ERLEETVTMLKSALIFSIKPLQFHIFAEDQLHNSFKERLDSWSFL 74

Query: 70  QVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIA 129
           Q   ++ L        S +  K         +L LP +L E +   + +DTD++F   + 
Sbjct: 75  Q-RFNYTLYSITFPSDSAVEWKKLFKPCASQRLFLPLILKE-VDSLLYVDTDILFLRPVD 132

Query: 130 QLWALFSRLRQRQTIGLVENQSDWYLGKLWKN 161
            +W+L  +    Q   +     +  +G  W N
Sbjct: 133 DIWSLLKKFNSTQIAAMAPEHEEPRIG--WYN 162


>gi|297691568|ref|XP_002823152.1| PREDICTED: glucoside xylosyltransferase 1 [Pongo abelii]
          Length = 437

 Score = 61.2 bits (147), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 42/173 (24%), Positives = 76/173 (43%), Gaps = 19/173 (10%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L E +   + +DTD++F   +  +W+L  +    Q   +     +  +G  W 
Sbjct: 197 RLFLPLILKE-VDSLLYVDTDILFLRPVDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WY 253

Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
           N     P  G+ G N+GV+L+++T++R            + W      + +K+ L   W 
Sbjct: 254 NRFARHPYYGKTGVNSGVMLMNMTRMRRKYFKNDMTTVRLQWGDILMPLLKKYKLNITW- 312

Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
              DQD+ N +   +P  ++  PCQWN +             E   + I+H N
Sbjct: 313 --GDQDLLNIMFFHNPESLFVFPCQWNYRPDHCIYGSNCQEAEEEGIFILHGN 363



 Score = 39.3 bits (90), Expect = 6.7,   Method: Compositional matrix adjust.
 Identities = 34/155 (21%), Positives = 64/155 (41%), Gaps = 9/155 (5%)

Query: 9   IPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSV 68
           I   E + +A+V  G       +T++KS + +   PL FH+  +    +  +    +WS 
Sbjct: 107 IQPVEKMHLAVVACG-ERLEETMTMLKSAIIFSIKPLQFHIFAEDQLHHSFKGRLDSWSF 165

Query: 69  PQV-EVSFYLADSVVEDVS-WIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 126
            Q    + Y      E+ + W   K         +L LP +L E +   + +DTD++F  
Sbjct: 166 LQTFNYTLYPITFPSENAAEW---KKLFKPCASQRLFLPLILKE-VDSLLYVDTDILFLR 221

Query: 127 DIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKN 161
            +  +W+L  +    Q   +     +  +G  W N
Sbjct: 222 PVDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WYN 254


>gi|348580281|ref|XP_003475907.1| PREDICTED: glucoside xylosyltransferase 1-like isoform 1 [Cavia
           porcellus]
          Length = 437

 Score = 61.2 bits (147), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 42/173 (24%), Positives = 76/173 (43%), Gaps = 19/173 (10%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L E +   + +DTD++F   +  +W+L  +    Q   +     +  +G  W 
Sbjct: 197 RLFLPLILKE-VDSLLYVDTDILFLRPVDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WY 253

Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
           N     P  G+ G N+GV+L+++T++R            + W      + +K+ L   W 
Sbjct: 254 NRFARHPYYGKTGVNSGVMLMNMTRMRRKYFKNDMTPVRLQWGEILMPLLKKYKLNITW- 312

Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
              DQD+ N +   +P  ++  PCQWN +             E   + I+H N
Sbjct: 313 --GDQDLLNIMFFHNPESLFVFPCQWNYRPDHCIYGSNCQEAEEEGIFILHGN 363



 Score = 41.6 bits (96), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 35/155 (22%), Positives = 64/155 (41%), Gaps = 9/155 (5%)

Query: 9   IPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSV 68
           I   E + +A+V  G       +T++KS L +   PL FH+  +    +  +    +WS 
Sbjct: 107 IQPVEKMHLAVVACG-ERLEETMTMLKSALIFSIKPLQFHIFAEDQLHDSFKARLDSWSF 165

Query: 69  PQV-EVSFYLADSVVEDVS-WIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 126
            Q    + Y      E+ + W   K         +L LP +L E +   + +DTD++F  
Sbjct: 166 LQTFNYTLYPITFPSENAAEW---KKLFKPCASQRLFLPLILKE-VDSLLYVDTDILFLR 221

Query: 127 DIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKN 161
            +  +W+L  +    Q   +     +  +G  W N
Sbjct: 222 PVDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WYN 254


>gi|147906982|ref|NP_001089592.1| glucoside xylosyltransferase 1 [Xenopus laevis]
 gi|68533770|gb|AAH99025.1| MGC115517 protein [Xenopus laevis]
          Length = 324

 Score = 61.2 bits (147), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 55/223 (24%), Positives = 96/223 (43%), Gaps = 31/223 (13%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L E +   + +DTD++F   +  +W+        Q   +     +  +G  W 
Sbjct: 85  RLFLPLILTE-VDSLLYVDTDILFLRPLDDIWSFLGGFNATQIAAMAPEHEEPRIG--WY 141

Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
           N     P  G+ G N+GV+L+++T++R            + W      + +K+ L   W 
Sbjct: 142 NRFARHPYYGKTGINSGVMLMNMTRIRKKHFKNDMTTVRLHWGDLLLPLLKKYKLNITW- 200

Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWNSPKKLKV 364
              DQD+ N + S +P  ++  PCQWN +  D+      C+     + I+H N   +   
Sbjct: 201 --GDQDLLNIMFSHNPESLFVFPCQWNYR-PDHCIYGSNCHEAEDGIFILHGN---RGVY 254

Query: 365 KNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTTLQQAELS 407
            ++    FR +Y  F  Y         FG +L Q+ L   EL 
Sbjct: 255 HDRKQPAFRAVYEAFRNYS--------FGHDLRQSLLYPLELE 289



 Score = 42.0 bits (97), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 37/180 (20%), Positives = 78/180 (43%), Gaps = 10/180 (5%)

Query: 15  IQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQ-VEV 73
           + +A+V  G +     +T++KS L + K  L FH+  +       +   ++W+ P+  + 
Sbjct: 1   MHLAVVACG-DRLEETITMLKSALLFSKKLLQFHIFAEDQLHYDFKAALNSWAFPEKFKY 59

Query: 74  SFYLADSVVED-VSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLW 132
           + Y     +E+ V W   K    +    +L LP +L E +   + +DTD++F   +  +W
Sbjct: 60  TLYPITFPLENAVEW---KKLFKLCASQRLFLPLILTE-VDSLLYVDTDILFLRPLDDIW 115

Query: 133 ALFSRLRQRQTIGLVENQSDWYLGKLWKNH-KPWPALGRGFYLADSVVEDVSWIPNKHYS 191
           +        Q   +     +  +G  W N     P  G+    +  ++ +++ I  KH+ 
Sbjct: 116 SFLGGFNATQIAAMAPEHEEPRIG--WYNRFARHPYYGKTGINSGVMLMNMTRIRKKHFK 173


>gi|291392421|ref|XP_002712730.1| PREDICTED: glycosyltransferase 8 domain containing 3 [Oryctolagus
           cuniculus]
          Length = 680

 Score = 60.8 bits (146), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 41/173 (23%), Positives = 75/173 (43%), Gaps = 19/173 (10%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L E +   + +DTD++F   +  +W+L  +    Q   +     +  +G  W 
Sbjct: 292 RLFLPLILKE-VDSLLYVDTDILFLRPVDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WY 348

Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
           N     P  G  G N+GV+L+++T++R            + W      + +K+ L   W 
Sbjct: 349 NRFARHPYYGETGVNSGVMLMNMTRMRRKYFKNDMTTARLRWGDILVPLLKKYKLNITW- 407

Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
              DQD+ N +   +P  ++  PCQWN +             E   + ++H N
Sbjct: 408 --GDQDLLNIMFFHNPESLFVFPCQWNYRPDHCIYGSNCREAEEEGISVLHGN 458


>gi|156537600|ref|XP_001607722.1| PREDICTED: uncharacterized protein C3orf21 homolog [Nasonia
           vitripennis]
          Length = 372

 Score = 60.8 bits (146), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 40/163 (24%), Positives = 74/163 (45%), Gaps = 13/163 (7%)

Query: 186 PNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVE 245
           P  +YS     L L L ++ P+     +++D D  F  D+ +L+  F    ++   GL  
Sbjct: 174 PGTYYSDALFFLSLGLHRIAPQDQDTAVMIDADTKFRVDVKKLFKEFDHFGKQALFGLGP 233

Query: 246 NQSDWYLGKL--WKNHKPWPALGR--------GFNTGVILLDLTKLRD-ISWAGFWRIIA 294
             +  Y   L  +++  P   +G         G+N+GV+LL+L  LR+ + +     + +
Sbjct: 234 ELTPVYRHVLYVYRSKNPKTIVGEPGYLGGYPGYNSGVVLLNLKGLRESLEYDQIVSMDS 293

Query: 295 EKFLLTR--LWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLS 335
            + L+ +      L DQD +  +  E P  ++ + C WN QL 
Sbjct: 294 VEHLVEKYEFKGHLGDQDFYTLLGMERPEFIHAIDCGWNRQLC 336



 Score = 42.0 bits (97), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 37/165 (22%), Positives = 70/165 (42%), Gaps = 18/165 (10%)

Query: 28  RSLVTLIKSILFYRKNPLHFHLITD----TVALNILQTLFSTWSVPQVEVSFYLADSV-- 81
           R     + S+L Y    + FH+ITD     +A  +++  F   +  +++V +Y    +  
Sbjct: 99  RKFRMFVDSLLKYSTGGVVFHVITDEESKIIAEKVIEYNFLV-NKKKMKVQYYNVHKLAI 157

Query: 82  -VEDV--------SWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLW 132
            +ED+        S  P  +YS     L L L ++ P+     +++D D  F  D+ +L+
Sbjct: 158 EIEDIVSVMSPHFSSKPGTYYSDALFFLSLGLHRIAPQDQDTAVMIDADTKFRVDVKKLF 217

Query: 133 ALFSRLRQRQTIGLVENQSDWYLGKL--WKNHKPWPALGRGFYLA 175
             F    ++   GL    +  Y   L  +++  P   +G   YL 
Sbjct: 218 KEFDHFGKQALFGLGPELTPVYRHVLYVYRSKNPKTIVGEPGYLG 262


>gi|380017376|ref|XP_003692633.1| PREDICTED: xyloside xylosyltransferase 1-like [Apis florea]
          Length = 357

 Score = 60.8 bits (146), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 44/164 (26%), Positives = 74/164 (45%), Gaps = 17/164 (10%)

Query: 186 PNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVE 245
           P  +YS     L L L ++ P       + D D  F  DI +L+  F+    +   GL  
Sbjct: 159 PGTYYSDALFFLSLGLHRIAPIEQKVAAMFDADTKFRKDIKELFEEFNTFGNQALFGLSP 218

Query: 246 NQSDWY--LGKLWKNHKPWPALGR--------GFNTGVILLDLTKLRDISWAGFWRIIAE 295
             +  Y  +  L++N  P    G         G+N+G++L +L KLR+ +   + +I+ +
Sbjct: 219 ELTPVYRHVLYLYRNKHPNTLFGEPASSGGYPGYNSGMVLFNLDKLRNSTV--YDKIVKK 276

Query: 296 KFLLT-----RLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQL 334
           + + T          L DQD +  +  E P L++T+ C WN QL
Sbjct: 277 ESVDTMTSKYHFKGHLGDQDFYTLLGMEKPELIHTIDCGWNRQL 320


>gi|149017614|gb|EDL76618.1| similar to CG9996-PA, isoform CRA_a [Rattus norvegicus]
          Length = 393

 Score = 60.8 bits (146), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 40/158 (25%), Positives = 68/158 (43%), Gaps = 18/158 (11%)

Query: 213 IVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNH-KPWPALGR-GFN 270
           + +DTDV+F   +  +W+L  R    Q   +     +  +G  W N     P  GR G N
Sbjct: 167 LYVDTDVLFLRPVDDIWSLLERFNSTQIAAMAPEHEEPRVG--WYNRFARHPYYGRTGVN 224

Query: 271 TGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEH 319
           +GV+L+++T++R            + W      + +K+ L   W    DQD+ N +   +
Sbjct: 225 SGVMLMNMTRMRRKYFKNDMTTARLQWGDILMPLLKKYKLNITW---GDQDLLNIMFYHN 281

Query: 320 PYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
           P  ++  PCQWN +             E   + I+H N
Sbjct: 282 PESLFVFPCQWNYRPDHCIYGSNCREAEEEGVFILHGN 319


>gi|344266729|ref|XP_003405432.1| PREDICTED: glucoside xylosyltransferase 1-like [Loxodonta africana]
          Length = 580

 Score = 60.8 bits (146), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 42/173 (24%), Positives = 76/173 (43%), Gaps = 19/173 (10%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L E +   + +DTD++F   +  +W+L  +    Q   +     +  +G  W 
Sbjct: 340 RLFLPLILKE-VDSLLYVDTDILFLRPVDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WY 396

Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
           N     P  G+ G N+GV+L+++T++R            + W      + +K+ L   W 
Sbjct: 397 NRFARHPYYGKTGVNSGVMLMNMTRMRRKYFKNDMTTVRLRWGDILMPLLKKYKLNITW- 455

Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
              DQD+ N +   +P  ++  PCQWN +             E   + I+H N
Sbjct: 456 --GDQDLLNIMFFHNPESLFVFPCQWNYRPDHCIYGSNCQEAEKEGIFILHGN 506



 Score = 41.2 bits (95), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 32/150 (21%), Positives = 64/150 (42%), Gaps = 7/150 (4%)

Query: 9   IPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSV 68
           I T E + +A+V  G       +T++KS + +   PL FH+  +    +  + +  +W +
Sbjct: 250 IHTVERMHLAVVACG-ERLEETITMLKSAIIFSIKPLQFHIFAEDQLHDNFKGILDSWLL 308

Query: 69  PQV-EVSFYLADSVVEDVS-WIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 126
            Q    + Y      E+ + W   K         +L LP +L E +   + +DTD++F  
Sbjct: 309 LQTFNYTLYPITFPSENAAEW---KKLFKPCASQRLFLPLILKE-VDSLLYVDTDILFLR 364

Query: 127 DIAQLWALFSRLRQRQTIGLVENQSDWYLG 156
            +  +W+L  +    Q   +     +  +G
Sbjct: 365 PVDDIWSLLKKFNSTQIAAMAPEHEEPRIG 394


>gi|159155696|gb|AAI54725.1| LOC100127708 protein [Xenopus (Silurana) tropicalis]
          Length = 366

 Score = 60.8 bits (146), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 55/223 (24%), Positives = 95/223 (42%), Gaps = 31/223 (13%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L E +   + +DTD++F   +  +W+        Q   +     +  +G  W 
Sbjct: 127 RLFLPLILTE-VDSLLYVDTDILFLRPLDDIWSFLGGFNSTQIAAMAPEHEEPRIG--WY 183

Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
           N     P  G+ G N+GV+L+++T++R            + W      + +K+ L   W 
Sbjct: 184 NRFARHPYYGKTGINSGVMLMNMTRIRKKHFKNDMTAVRLHWGDLLLPLLKKYKLNITW- 242

Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWNSPKKLKV 364
              DQD+ N + S +P  ++  PCQWN +  D+      C+     + I+H N   +   
Sbjct: 243 --GDQDLLNIMFSHNPESLFVFPCQWNYR-PDHCIYGSNCHEAEGGIFILHGN---RGVY 296

Query: 365 KNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTTLQQAELS 407
            ++    FR +Y  F  Y         FG NL  + L   EL 
Sbjct: 297 HDRKQPAFRAVYDAFRNYT--------FGDNLFHSLLYPLELE 331



 Score = 39.7 bits (91), Expect = 5.2,   Method: Compositional matrix adjust.
 Identities = 36/186 (19%), Positives = 74/186 (39%), Gaps = 6/186 (3%)

Query: 7   PVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTW 66
           PV    E + +A+V  G +     +T++KS L +    L FH+  +       +   + W
Sbjct: 35  PVKHGTETMHLAVVACG-DRLEETITMLKSALLFSTKFLQFHIFAEDQLHYEFKATLNRW 93

Query: 67  SVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFAT 126
           + P+ +  + L        +    K         +L LP +L E +   + +DTD++F  
Sbjct: 94  AFPE-KFKYILYPITFPSENAAEWKKLFKPCASQRLFLPLILTE-VDSLLYVDTDILFLR 151

Query: 127 DIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNH-KPWPALGRGFYLADSVVEDVSWI 185
            +  +W+        Q   +     +  +G  W N     P  G+    +  ++ +++ I
Sbjct: 152 PLDDIWSFLGGFNSTQIAAMAPEHEEPRIG--WYNRFARHPYYGKTGINSGVMLMNMTRI 209

Query: 186 PNKHYS 191
             KH+ 
Sbjct: 210 RKKHFK 215


>gi|256082475|ref|XP_002577481.1| glycosyltransferase-related [Schistosoma mansoni]
 gi|350644937|emb|CCD60353.1| glycosyltransferase-related [Schistosoma mansoni]
          Length = 237

 Score = 60.8 bits (146), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 32/86 (37%), Positives = 46/86 (53%), Gaps = 2/86 (2%)

Query: 213 IVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTG 272
           IVLDTD++  TDI +LW  F    Q Q+IG+   Q+  +   + K    W   G G+N G
Sbjct: 142 IVLDTDILLNTDIIELWNHFENFSQTQSIGIGLEQNPHFQDVMPKLESDWE--GYGYNNG 199

Query: 273 VILLDLTKLRDISWAGFWRIIAEKFL 298
           V+L DL+K+R   W   W ++    L
Sbjct: 200 VLLFDLSKIRLTLWNELWIVMTRNVL 225



 Score = 40.0 bits (92), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 20/54 (37%), Positives = 28/54 (51%)

Query: 116 IVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALG 169
           IVLDTD++  TDI +LW  F    Q Q+IG+   Q+  +   + K    W   G
Sbjct: 142 IVLDTDILLNTDIIELWNHFENFSQTQSIGIGLEQNPHFQDVMPKLESDWEGYG 195


>gi|345319859|ref|XP_001507088.2| PREDICTED: glucoside xylosyltransferase 1-like, partial
           [Ornithorhynchus anatinus]
          Length = 253

 Score = 60.8 bits (146), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 38/169 (22%), Positives = 72/169 (42%), Gaps = 18/169 (10%)

Query: 202 PKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNH-K 260
           PK++ + +   + +DTD++F   +  +W+        Q   +     +  +G  W N   
Sbjct: 16  PKLILKEVDSLLYVDTDILFLRPVDDIWSFLKEFNSTQIAAMAPEHEEPRIG--WYNRFA 73

Query: 261 PWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWTSLAD 308
             P  G+ G N+GV+L+++T++R            + W      + +K+ L   W    D
Sbjct: 74  RHPYYGKTGINSGVMLMNMTRMRRKYFKNDMTTARLQWGDILMPLLKKYKLNITW---GD 130

Query: 309 QDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
           QD+ N +   +P  ++  PCQWN +             E   + I+H N
Sbjct: 131 QDLLNIMFFHNPESLFVFPCQWNYRPDHCIYGSNCQEAEEEGIFILHGN 179


>gi|354504228|ref|XP_003514179.1| PREDICTED: glucoside xylosyltransferase 1-like [Cricetulus griseus]
          Length = 486

 Score = 60.8 bits (146), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 42/173 (24%), Positives = 76/173 (43%), Gaps = 19/173 (10%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L E +   + +DTD++F   +  +W+L  +    Q   +     +  +G  W 
Sbjct: 246 RLFLPLILKE-VDSLLYVDTDILFLRPVDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WY 302

Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
           N     P  G+ G N+GV+L+++T++R            + W      + +K+ L   W 
Sbjct: 303 NRFARHPYYGKTGVNSGVMLMNMTRMRRKYFKNDMTTARLQWGDILMPLLKKYKLNITW- 361

Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
              DQD+ N +   +P  ++  PCQWN +             E   + I+H N
Sbjct: 362 --GDQDLLNIMFFHNPESLFVFPCQWNYRPDHCIYGSNCREAEEEGVFILHGN 412



 Score = 45.4 bits (106), Expect = 0.096,   Method: Compositional matrix adjust.
 Identities = 35/149 (23%), Positives = 62/149 (41%), Gaps = 5/149 (3%)

Query: 13  EVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQVE 72
           E + +A+V  G       VT++KS L +   PL FH+  +    N  +    +WS  Q  
Sbjct: 160 EKMHLAVVACG-ERLEETVTMLKSALIFSIKPLQFHIFAEDQLHNSFKERLDSWSFLQ-R 217

Query: 73  VSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLW 132
            ++ L        S +  K         +L LP +L E +   + +DTD++F   +  +W
Sbjct: 218 FNYTLYSITFPSDSAVEWKKLFKPCASQRLFLPLILKE-VDSLLYVDTDILFLRPVDDIW 276

Query: 133 ALFSRLRQRQTIGLVENQSDWYLGKLWKN 161
           +L  +    Q   +     +  +G  W N
Sbjct: 277 SLLKKFNSTQIAAMAPEHEEPRIG--WYN 303


>gi|313231957|emb|CBY09069.1| unnamed protein product [Oikopleura dioica]
          Length = 384

 Score = 60.8 bits (146), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 64/277 (23%), Positives = 105/277 (37%), Gaps = 96/277 (34%)

Query: 477 NDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRS------ 530
           NDVT+  Q S + L  V  L++ W GPIS+ ++  + EA    S+A  ++A+ S      
Sbjct: 18  NDVTVATQCSANHLHHVVKLWERWSGPISVAVFAPNLEA----SFA--TDAILSMQKCWP 71

Query: 531 --RRNIGYHVVYK-------------------------------------EGNF-YPINT 550
             ++ + +H+VY                                      +G+  YP N 
Sbjct: 72  DMKKRVTFHLVYPTQHKADVSESTGTLDYNSCEELRQIIHDYGGDDSQNYKGDIPYPHNV 131

Query: 551 LRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHG----HGGKKVLVVPAFE- 605
           LRN A    +T +VFL+D+D +P   +             HG     G K V VVP FE 
Sbjct: 132 LRNAARKGAATEFVFLIDVDVMPSLHMREGFNEFANKFK-HGLYEEKGDKAVFVVPVFEI 190

Query: 606 ------------------TQRYRTAFPAS-----HAPTNFSRWVNATTPYQIEWAPD--- 639
                             T++    F  +     H P +   W+       +E A +   
Sbjct: 191 LKGHQCPDDKADLKDAQMTRKEVRPFHNATCRWCHEPEDHDNWLMKPKNEHMEVAFNATW 250

Query: 640 ------------FEPYIVAHRDLPRYDTRFVGFGWNK 664
                       +EP+ +A R +P +D RF  +G+++
Sbjct: 251 NKIGFDCLYLLSWEPFYIARRSVPLFDERFKQYGFDR 287


>gi|242012431|ref|XP_002426936.1| N-acetyllactosaminide beta-1,3-N-acetylglucosaminyltransferase,
           putative [Pediculus humanus corporis]
 gi|212511165|gb|EEB14198.1| N-acetyllactosaminide beta-1,3-N-acetylglucosaminyltransferase,
           putative [Pediculus humanus corporis]
          Length = 305

 Score = 60.8 bits (146), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 42/149 (28%), Positives = 67/149 (44%), Gaps = 33/149 (22%)

Query: 546 YPINTLRNVALNQVSTPYVFLLDIDFLPMFGLY-PYLKSSIRSMDMHGHGGKKVLVVPAF 604
           YP+N LRN+A +Q  T Y+  +DID LP   L   +LKS +            V VVP F
Sbjct: 94  YPVNFLRNLAKSQSVTDYILTIDIDMLPSENLRNDFLKSKLVE---KYQWNDVVFVVPIF 150

Query: 605 ETQR-----------------------YRTAFPASHAPTNFSRWV--NATTP----YQIE 635
           E +                        Y        + TN++ W+  N T+     Y++E
Sbjct: 151 EIEHDVQMPQSKNDLLECVKMGKARPFYLEPCEKCQSLTNYTEWLKNNKTSGIEILYEVE 210

Query: 636 WAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           W   +EP+ ++++ +P +D RF  +G N+
Sbjct: 211 WKYPWEPFFISNKKVPNFDERFKQYGMNR 239


>gi|195122672|ref|XP_002005835.1| GI18863 [Drosophila mojavensis]
 gi|193910903|gb|EDW09770.1| GI18863 [Drosophila mojavensis]
          Length = 385

 Score = 60.5 bits (145), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 61/276 (22%), Positives = 103/276 (37%), Gaps = 83/276 (30%)

Query: 468 DEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEA 527
           ++Y  SS  + + L  Q S++RL  +  +  +W GP+S+ L+ +  E    L Y      
Sbjct: 46  EQYTESSKISLICLATQSSVERLYSLPQVAANWIGPMSVALFSAGNEEYIVLKYFVTYMR 105

Query: 528 L---RSRRNIGYHVVY-------------------------------------------- 540
           L     + N  +H+++                                            
Sbjct: 106 LCFPNVKANATFHILFPQEYESLPGHLTEPLNIQKKFNCRNPEKTLKFLLKLRTAKTLEW 165

Query: 541 KEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLV 600
           ++ N YP N +RN+A     + YVFL DID +P       L +  +++D   +      V
Sbjct: 166 RKRNVYPQNHMRNLARKGCQSKYVFLTDIDIIPSVNSVVQLNNFFKTVDCEKNCA---YV 222

Query: 601 VPAFETQRYRTAFPASHA------------------------PTNFSRWV----NATTPY 632
           +P FE    R +FP S                           TNFS+W+    N T+  
Sbjct: 223 IPTFEID-VRASFPQSKESLLALTKKGLARPFHEKVFIYNQYATNFSKWLTLKGNETSVS 281

Query: 633 QIEWAPDF----EPYIVAHRDLPRYDTRFVGFGWNK 664
                 +F    EP+ +A   +P +D RF G+G+ +
Sbjct: 282 ISHIVTNFEFLYEPFYIALDTVPIHDERFTGYGFTR 317


>gi|432942747|ref|XP_004083054.1| PREDICTED: glucoside xylosyltransferase 1-like [Oryzias latipes]
          Length = 420

 Score = 60.5 bits (145), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 44/198 (22%), Positives = 84/198 (42%), Gaps = 22/198 (11%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L + +   + +D+D++F   + +LW   +R  + Q   +     +  +   W 
Sbjct: 178 RLFLPLILKD-VDSVVYVDSDILFLRPVDELWGFLTRFDRSQLAAMAPEHEEPRIA--WY 234

Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
           N     P  GR G N+GV+L+++T++R            + W      + +K+ L   W 
Sbjct: 235 NRFAMHPFYGRTGVNSGVMLMNMTRMRSAYFKNDMTPVGLLWEELLMPLLQKYKLNITW- 293

Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWNSPKKLKV 364
              DQD+ N I   +P  +   PC+WN +             E   + ++H N   +   
Sbjct: 294 --GDQDLLNIIFHHNPERLLEFPCKWNYRPDHCIYGSNCASAEEDGVCVMHGN---RGVY 348

Query: 365 KNKHMEFFRNLYLTFLEY 382
            ++    FR +Y    +Y
Sbjct: 349 HDQKQPAFRAVYQAIKKY 366


>gi|426200366|gb|EKV50290.1| hypothetical protein AGABI2DRAFT_115347 [Agaricus bisporus var.
           bisporus H97]
          Length = 487

 Score = 60.5 bits (145), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 45/152 (29%), Positives = 73/152 (48%), Gaps = 16/152 (10%)

Query: 1   MSSDEEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQ 60
           +S ++EP     E +   +V     S      L+KSIL Y  +P  FH++ D  A   L+
Sbjct: 116 LSENQEP-----EPVVFVLVVWSRPSAIEGALLVKSILMYNTSPSEFHIVCDDDAEEFLR 170

Query: 61  TLFSTWSVPQ--VEVSFY------LADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETL 112
           + F+    PQ  V V FY      + D +  + + I   H +G+ GL+KL L +++P  +
Sbjct: 171 SRFALVHHPQRDVLVRFYKLTWQSMLDRIQREGT-ISTDHSAGLPGLMKLFLHEIVP--V 227

Query: 113 AKTIVLDTDVIFATDIAQLWALFSRLRQRQTI 144
            + I +DTD  F +D   LW +F   +    I
Sbjct: 228 RRGIYIDTDAFFISDPTLLWGVFKDFKASTAI 259


>gi|412992029|emb|CCO20755.1| glycosyltransferase family 8 [Bathycoccus prasinos]
          Length = 369

 Score = 60.5 bits (145), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 49/166 (29%), Positives = 68/166 (40%), Gaps = 34/166 (20%)

Query: 213 IVLDTDVIFATDIAQLWALFSRLRQRQTIGLVE----NQSDWYLGKLWKNHKPWPALGR- 267
           I +DTDV+F      LW +FS     Q  G+          +Y  +L KNH P       
Sbjct: 159 IYVDTDVLFLKSPLLLWNIFSCFSPLQKFGMANETNGGTGGYYRKRLSKNHVPCSQGSCK 218

Query: 268 --------------GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFN 313
                         G N+GV L++LT++R   +    RI    F          DQD++N
Sbjct: 219 CKNGLRCKRFYGLYGLNSGVALMNLTRIRRSDFQSRIRIAMTSFKF------YGDQDVYN 272

Query: 314 AIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTD--LKIIHWN 357
            + +E    V+ LPC WN+      RSD  C     D  L I+H N
Sbjct: 273 HLFAERND-VHVLPCIWNI------RSDSRCDFAKNDESLGILHGN 311


>gi|391343843|ref|XP_003746215.1| PREDICTED: N-acetyllactosaminide
           beta-1,3-N-acetylglucosaminyltransferase-like
           [Metaseiulus occidentalis]
          Length = 453

 Score = 60.1 bits (144), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 66/286 (23%), Positives = 108/286 (37%), Gaps = 94/286 (32%)

Query: 467 VDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEA----QQFLSYA 522
           V E + S+  + V L    SMDRL  +  + +HW+GP+S+ +++   E     +QF+   
Sbjct: 108 VAEDQESTSRSQVCLATLSSMDRLLGLMDIAQHWDGPVSIAVFIRTTEELQALRQFIRIL 167

Query: 523 G-NSEALRSRRNIGYHVVYKEGN------------------------------------- 544
           G  S+A+R   N  +H+V    N                                     
Sbjct: 168 GICSDAVRL--NFAFHLVLPISNRSRIDLHAKDIVSRSSESTSCADQLKSLLPAPPPVPM 225

Query: 545 -----------FYPINTLRNVALN---QVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDM 590
                       +P N LRNVA +   ++ + + FL DID +P     P L S +     
Sbjct: 226 KSLKNVDLLKLLFPQNHLRNVARDGCERMGSRFFFLTDIDIIPK----PALASQLNEFLQ 281

Query: 591 HGHGGKKVLVVPAFETQRY-----------------------RTAFPASHAPTNFSRW-- 625
                K++ VVP +E   +                       +  F  +   TN   W  
Sbjct: 282 PRQASKEIFVVPTYEMHDHAEMPQSKTDLLKMVSVGYARPFHKIVFIHNQYATNHDLWER 341

Query: 626 VNATTPYQIEWAPD-------FEPYIVAHRDLPRYDTRFVGFGWNK 664
           +   +P +I  A         +EP+ VA   +PR+D RFVG+G+ +
Sbjct: 342 IPRKSPDKIHVAYKINNYEFFYEPFYVAPFSVPRHDERFVGYGFTR 387


>gi|449481334|ref|XP_002193848.2| PREDICTED: glucoside xylosyltransferase 1 [Taeniopygia guttata]
          Length = 629

 Score = 60.1 bits (144), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 51/221 (23%), Positives = 89/221 (40%), Gaps = 30/221 (13%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L + +   + +DTD++F   +  +W+   +    Q   +     +  +G  W 
Sbjct: 389 RLFLPLIL-KNVDSLLYVDTDILFLRPVDDIWSFLRKFNSTQIAAMAPEHEEPRIG--WY 445

Query: 258 NH-KPWPALG-RGFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
           N     P  G  G N+GV+L+++T++R            + W      + +K+ L   W 
Sbjct: 446 NRFARHPYYGVTGINSGVMLMNMTRIRRKYFKNDMTPVRLQWGEILMPLLKKYKLNITW- 504

Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWNSPKKLKV 364
              DQD+ N +   +P  +Y  PCQWN +             E   + I+H N   +   
Sbjct: 505 --GDQDLLNIMFFHNPESLYVFPCQWNYRPDHCIYGSNCKAAEEEGIFILHGN---RGVY 559

Query: 365 KNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTTLQQAE 405
            +     FR +Y     Y         FG +L  + LQ  E
Sbjct: 560 HDDKQPTFRAVYEAIKNYS--------FGDDLVHSLLQPLE 592


>gi|170027891|ref|XP_001841830.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167868300|gb|EDS31683.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 412

 Score = 60.1 bits (144), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 74/323 (22%), Positives = 119/323 (36%), Gaps = 86/323 (26%)

Query: 33  LIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQVEVSFY--------------LA 78
            +KS+L Y   P+H H+ITD  +    + L         +V FY              + 
Sbjct: 121 FLKSLLKYTTIPMHLHVITDEQSEGSAEELIREQIAHYRKVVFYTLYDVKDCAEKISDIV 180

Query: 79  DSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRL 138
            S++   ++    +YS    LL L L +++   + + I++D DV+F + + +L+  F   
Sbjct: 181 KSMLPLFNYKTGSYYSDALFLLSLGLHRIVDVNMKRAILIDCDVVFRSSVKELFDQFDHF 240

Query: 139 RQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFYLADSVVEDVSWIPNKHYSGVYGLLK 198
                 GL    +  YL  L+K            Y A           N H   ++G   
Sbjct: 241 GPEHLFGLAPELTPVYLHILYK------------YRAQ----------NSH--TLFG--- 273

Query: 199 LTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKN 258
              P  L +T   TI              +   FSR  ++   G                
Sbjct: 274 --TPYYLNKTTTTTIG-----------NSIGGFFSRTAKKTKHG---------------- 304

Query: 259 HKPWPALGRGFNTGVILLDLTKLR------DISWAGFWRIIAEKFLLTRLWTSLADQDIF 312
              +P    G N+GV++L L ++R      +I      + I EK+        L DQD +
Sbjct: 305 ---YP----GLNSGVVMLQLDRIRQSRLYEEIIKPDSVKNIVEKY---SFKGHLGDQDFY 354

Query: 313 NAIISEHPYLVYTLPCQWNVQLS 335
             +  E P L+Y L C WN QL 
Sbjct: 355 TLMGFEFPGLIYRLDCVWNRQLC 377


>gi|449272005|gb|EMC82135.1| Glycosyltransferase 8 domain-containing protein 3, partial [Columba
           livia]
          Length = 368

 Score = 60.1 bits (144), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 42/173 (24%), Positives = 74/173 (42%), Gaps = 19/173 (10%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L + +   + +DTDV+F   +  +W+   +    Q   +     +  +G  W 
Sbjct: 128 RLFLPLIL-KNVDSLLYVDTDVLFLRPVDDIWSFLGKFNSTQIAAMAPEHEEPRIG--WY 184

Query: 258 NH-KPWPALG-RGFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
           N     P  G  G N+GV+L+++T++R            + W      + +K+ L   W 
Sbjct: 185 NRFARHPYYGVTGINSGVMLMNMTRIRRKYFKNDMTPVRLQWGEILMPLLKKYKLNITW- 243

Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
              DQD+ N +   +P  +Y  PCQWN +             E   + I+H N
Sbjct: 244 --GDQDLLNIMFFHNPESLYVFPCQWNYRPDHCIYGSNCKEAEEEGIFILHGN 294


>gi|432858866|ref|XP_004068977.1| PREDICTED: glucoside xylosyltransferase 2-like [Oryzias latipes]
          Length = 429

 Score = 60.1 bits (144), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 42/172 (24%), Positives = 72/172 (41%), Gaps = 17/172 (9%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L + +   + +DTDV+F   +  +W       + Q   +        +G  + 
Sbjct: 184 RLFLPVILKD-VDSLLYVDTDVLFLRPMDDIWGFLKSFNETQLAAMAPEHEVPKIG-WYS 241

Query: 258 NHKPWPALG-RGFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWTS 305
                P  G  G N+GV+L++LTK+R            +SW      + +K+     W  
Sbjct: 242 RFARHPFYGVTGVNSGVMLMNLTKIRSTLFKNSMISSGLSWGDLLHPLYQKYKSHITW-- 299

Query: 306 LADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
             DQD+ N I  ++P  ++  PCQWN +             E   + I+H N
Sbjct: 300 -GDQDLLNIIFHDNPECLFIFPCQWNYRPDHCMYGSNCKAAEEEGVSILHGN 350


>gi|340729238|ref|XP_003402913.1| PREDICTED: n-acetyllactosaminide
           beta-1,3-N-acetylglucosaminyltransferase-like [Bombus
           terrestris]
          Length = 492

 Score = 60.1 bits (144), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 65/277 (23%), Positives = 96/277 (34%), Gaps = 91/277 (32%)

Query: 474 SDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEA----QQFLSYAGNS-EAL 528
           S    V L  Q S+++L  V     HW GP+S+ LY +  E     Q++L Y     E +
Sbjct: 150 SQAYRVCLATQSSVEKLHSVVQAAHHWTGPMSVALYAAGDEEFEVLQKYLIYLRKCYEPI 209

Query: 529 RSRRNIGYHV---------------------VYKEG-------------------NFYPI 548
           R R      V                        EG                   N YP 
Sbjct: 210 RERVAFSLAVPRVRPPKKQPREFELPEVVDCAKPEGTLNELMNGISNEQTNWRIRNVYPQ 269

Query: 549 NTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQR 608
           N +RN+A     + YVFL D+D +P F L   L   +R+        K   V+P +E   
Sbjct: 270 NHMRNLARKNCQSDYVFLTDVDIVPSFNLTGALDEFLRT----DTCDKCAYVIPTYELDS 325

Query: 609 YRTAFPASHA------------------------PTNFSRWVNATTPYQIEWAPD----- 639
            R  FP +                           TNF+RW+  T+     +        
Sbjct: 326 -RVRFPQNKTELVRLARKGLARPFHQKVFIHNQFATNFTRWILDTSVNHKNFGNVKTGKV 384

Query: 640 ------------FEPYIVAHRDLPRYDTRFVGFGWNK 664
                       +EP+ VA   +P +D RF+G+G+ +
Sbjct: 385 YISHDVTNFEFLYEPFYVAKDIVPAHDERFMGYGYTR 421


>gi|307174885|gb|EFN65151.1| Uncharacterized protein C3orf21-like protein [Camponotus
           floridanus]
          Length = 371

 Score = 60.1 bits (144), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 43/166 (25%), Positives = 69/166 (41%), Gaps = 19/166 (11%)

Query: 186 PNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVE 245
           P  +YS     L L + K+ P       + D D  F  D+  L+  F+   +    GL  
Sbjct: 173 PGTYYSDALFFLSLGMHKIAPSHQKLAAMFDADTKFRRDVKDLFEEFNNFGKEALFGLAP 232

Query: 246 NQSDWY--LGKLWKNHKPWPALGR--------GFNTGVILLDLTKLR------DISWAGF 289
             +  Y  +  L++   P    G         G+N+G++L +L +LR      +I     
Sbjct: 233 ELTPVYRHVLYLYRTKHPTTMFGEPRHKGGYPGYNSGMVLFNLERLRKSLEYNEILTKNS 292

Query: 290 WRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLS 335
             I+ EK+        L DQD +  +  E P L++T+ C WN QL 
Sbjct: 293 VDIMTEKY---HFKGHLGDQDFYTLLGMERPELIHTVDCGWNRQLC 335


>gi|157120271|ref|XP_001653581.1| hypothetical protein AaeL_AAEL001590 [Aedes aegypti]
 gi|108883094|gb|EAT47319.1| AAEL001590-PA [Aedes aegypti]
          Length = 418

 Score = 60.1 bits (144), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 69/322 (21%), Positives = 120/322 (37%), Gaps = 89/322 (27%)

Query: 33  LIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQVEVSFYL----------ADSVV 82
            +KS+L Y   P+H H+ITD  +    + L         +V FY              +V
Sbjct: 130 FLKSLLKYTTIPIHLHVITDEQSETSAEELIREQIAHYRKVVFYTLYDVKDCAEKTSDIV 189

Query: 83  EDV----SWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRL 138
           +++    ++    +YS    L+ L L +++   + + I++D DV+F + + +L+  F+R 
Sbjct: 190 KNMLPLFNYKTGSYYSDALFLISLGLHRIVDVNMKRAIIIDCDVVFRSSVKELFDEFNRF 249

Query: 139 RQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFYLADSVVEDVSWIPNKHYSGVYGLLK 198
                 GL    +  YL  L+K     P                          ++G   
Sbjct: 250 SSDHLFGLAPELTPVYLHILFKYRAQNPHT------------------------IFG--- 282

Query: 199 LTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKN 258
                  P  L KT + ++          +   FSR  ++   G                
Sbjct: 283 ------TPYFLNKTAIGNS----------VGGFFSRTAKKTKHG---------------- 310

Query: 259 HKPWPALGRGFNTGVILLDLTKLR------DISWAGFWRIIAEKFLLTRLWTSLADQDIF 312
              +P L    N+GV++L L ++R      +I      + + EK+        L DQD +
Sbjct: 311 ---YPGL----NSGVVMLRLDRIRQSRLYEEIIKQESVKNMVEKY---SFKGHLGDQDFY 360

Query: 313 NAIISEHPYLVYTLPCQWNVQL 334
             +  E P L+Y L C WN QL
Sbjct: 361 TLMGFEFPGLIYRLDCVWNRQL 382


>gi|443710199|gb|ELU04497.1| hypothetical protein CAPTEDRAFT_192236 [Capitella teleta]
          Length = 225

 Score = 60.1 bits (144), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 45/151 (29%), Positives = 67/151 (44%), Gaps = 33/151 (21%)

Query: 546 YPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFE 605
           YP + +RN+AL +V T Y FL D+DF+P   +Y      I+S +          VV AFE
Sbjct: 38  YPSSIIRNLALEEVFTEYAFLSDVDFMPAPSVYKTSVDLIKSKN-RLLSKTTAFVVAAFE 96

Query: 606 TQRYRT------AFPAS-------------------------HAPTNFSRWVNATTPYQI 634
            +R  T       FP +                         H  TN+  W  A T Y+ 
Sbjct: 97  LKRNTTPDGDAVKFPRNKSDLLALQNSTLIQPFHYHRERHYGHFSTNYEAWNIAKTTYKA 156

Query: 635 EWAPDFEPYIVAHR-DLPRYDTRFVGFGWNK 664
           +W   +EPY++ +   +PRY+  FVG  ++K
Sbjct: 157 KWVGLYEPYVIVNTCAVPRYNKAFVGRYFDK 187


>gi|349604448|gb|AEP99996.1| Glycosyltransferase 8 domain-containing protein 3-like protein,
           partial [Equus caballus]
          Length = 257

 Score = 59.7 bits (143), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 42/173 (24%), Positives = 75/173 (43%), Gaps = 19/173 (10%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L E +   + +DTD++F   +  +W+L  +    Q   +     +  +G  W 
Sbjct: 17  RLFLPLILKE-VDSLLYVDTDILFLRPVDDIWSLLKKFNSTQIAAMAPEHEEPRIG--WY 73

Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
           N     P  G+ G N+GV+L+++T++R            + W      + +K+ L   W 
Sbjct: 74  NRFARHPYYGKTGVNSGVMLMNMTRMRRKYFKNDMTTVRLRWGDILMPLLKKYKLNITWG 133

Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
               QD+ N I   +P  ++  PCQWN +             E   + I+H N
Sbjct: 134 C---QDLLNIIFFHNPESLFVFPCQWNYRPDHCIYGSNCQEAEEEGIFILHGN 183


>gi|348526548|ref|XP_003450781.1| PREDICTED: glucoside xylosyltransferase 1-like [Oreochromis
           niloticus]
          Length = 409

 Score = 59.7 bits (143), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 45/176 (25%), Positives = 74/176 (42%), Gaps = 25/176 (14%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSD----WYLG 253
           +L LP +L + +   + +D+D++F   +  LW L S     Q   +     +    WY  
Sbjct: 170 RLFLPLILKD-VDSIVYVDSDILFLQPVDLLWRLLSDFNSSQLAAMAPEHEEPRIAWY-- 226

Query: 254 KLWKNHKPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTR 301
             +  H   P  GR G N+GV+L++LT++R            + W      + +K+ L  
Sbjct: 227 NRFARH---PFYGRTGINSGVMLMNLTRMRSMFFKNDMTSVGLRWEELLMPLLQKYKLNI 283

Query: 302 LWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
            W    DQD+ N I   +P  +   PCQWN +             E   + I+H N
Sbjct: 284 TW---GDQDLLNIIFHYNPESLLEFPCQWNYRPDHCIYGSNCASAEEDGIYILHGN 336


>gi|423295529|ref|ZP_17273656.1| hypothetical protein HMPREF1070_02321 [Bacteroides ovatus
           CL03T12C18]
 gi|392672453|gb|EIY65921.1| hypothetical protein HMPREF1070_02321 [Bacteroides ovatus
           CL03T12C18]
          Length = 324

 Score = 59.7 bits (143), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 43/135 (31%), Positives = 67/135 (49%), Gaps = 11/135 (8%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP+VLP TL K + LD D+I    I  LW +  +     TIG VE+  +  +     
Sbjct: 95  RLMLPEVLPVTLDKILYLDCDIIVNGRIESLWNIDLKY---YTIGAVED--NIVISSEAP 149

Query: 258 NHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIIS 317
               +P     FN GV+L++L+ +RDI +     +  E+ L   ++    DQDI N ++ 
Sbjct: 150 RRLGYPVQSSYFNAGVMLMNLSLMRDIQFTKNAFVYIEQHLKEIVY---HDQDILNVLLY 206

Query: 318 EHPYLVYTLPCQWNV 332
           +       LP +WNV
Sbjct: 207 DQKLF---LPIKWNV 218


>gi|71896989|ref|NP_001025907.1| glucoside xylosyltransferase 1 [Gallus gallus]
 gi|82082155|sp|Q5ZKI6.1|GXLT1_CHICK RecName: Full=Glucoside xylosyltransferase 1; AltName:
           Full=Glycosyltransferase 8 domain-containing protein 3
 gi|53130856|emb|CAG31757.1| hypothetical protein RCJMB04_10i5 [Gallus gallus]
          Length = 433

 Score = 59.7 bits (143), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 42/173 (24%), Positives = 75/173 (43%), Gaps = 19/173 (10%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L + +   + +DTD++F   +  +W+   +    Q   +     +  +G  W 
Sbjct: 193 RLFLPLILKD-VDSLLYVDTDILFLRPVDDIWSFLRKFDSTQIAAMAPEHEEPRIG--WY 249

Query: 258 NH-KPWPALG-RGFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
           N     P  G  G N+GV+L+++T++R            + WA     + +K+ L   W 
Sbjct: 250 NRFARHPYYGVTGINSGVMLMNMTRIRRKYFKNDMTSVRLRWAEILMPLLKKYKLNITW- 308

Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
              DQD+ N +   +P  +Y  PCQWN +             E   + I+H N
Sbjct: 309 --GDQDLLNIMFFHNPESLYVFPCQWNYRPDHCIYGSNCKEAEEEGIFILHGN 359


>gi|198424870|ref|XP_002119916.1| PREDICTED: similar to glycosyltransferase 8 domain containing 4
           [Ciona intestinalis]
          Length = 401

 Score = 59.7 bits (143), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 43/154 (27%), Positives = 68/154 (44%), Gaps = 25/154 (16%)

Query: 13  EVIQVAIVCAG---YNSTRSL-------VTLIKSILFYRKNPLHFHLITDTVALNILQTL 62
           EV+ +A V  G   Y   R +       +T+IKS   +  NPLH H+ T+   +++ QT 
Sbjct: 49  EVMHLAAVSCGRPNYQGIRDIKKPMDQTLTMIKSASLFAHNPLHIHIFTENDMISMFQTE 108

Query: 63  FSTW----------SVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETL 112
              W          S+  ++   YL  +++    W   K +    G  +L LP VL +  
Sbjct: 109 IDAWPARVRSRVTYSIRPIDYERYLPPTLI--TQW---KKWYKPCGSFRLLLPMVLKDVT 163

Query: 113 AKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGL 146
              I  D+DVIF   I +LWA    + +RQ   +
Sbjct: 164 DAAIYSDSDVIFIKPIDELWAQLKEMDERQVAAI 197



 Score = 57.0 bits (136), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 56/207 (27%), Positives = 81/207 (39%), Gaps = 21/207 (10%)

Query: 195 GLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLG- 253
           G  +L LP VL +     I  D+DVIF   I +LWA    + +RQ   +         G 
Sbjct: 149 GSFRLLLPMVLKDVTDAAIYSDSDVIFIKPIDELWAQLKEMDERQVAAISPTAGHPLGGS 208

Query: 254 KLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFW-----------RIIAEKFLLTRL 302
           KL +N           NTGV++L+ T++    W  F            R+     LL   
Sbjct: 209 KLNENFISHSTGLFQINTGVMVLNFTRMLSSDWLLFHDEHKTDQSELRRVKYGIDLLLPY 268

Query: 303 WTSLA-----DQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELC-YTELTDLKIIHW 356
           +         DQ + N +   +P L+  LPC WN + +     D  C   E      +H 
Sbjct: 269 YDEYKEKAEHDQKLLNIMFHFNPSLLRQLPCSWNFKNNFCQDEDNTCSVAERGGAGAVHG 328

Query: 357 NSPKKLKVKNKHMEFFRNLYLTFLEYD 383
            +       N     FR LY +FL+YD
Sbjct: 329 ITSAFFNDVNPT---FRALYESFLQYD 352


>gi|159466980|ref|XP_001691676.1| hypothetical protein CHLREDRAFT_206250 [Chlamydomonas reinhardtii]
 gi|158279022|gb|EDP04784.1| hypothetical protein CHLREDRAFT_206250 [Chlamydomonas reinhardtii]
          Length = 496

 Score = 59.7 bits (143), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 27/53 (50%), Positives = 35/53 (66%), Gaps = 1/53 (1%)

Query: 613 FPASHAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRD-LPRYDTRFVGFGWNK 664
           FPA H+ TNFSRW  A  PY + +  +FEP+ VA RD +P +D R  G+G NK
Sbjct: 84  FPAGHSATNFSRWFRAAEPYTVRYVRNFEPWFVAGRDSVPWHDVRLRGYGQNK 136


>gi|47227367|emb|CAF96916.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 322

 Score = 59.7 bits (143), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 44/172 (25%), Positives = 74/172 (43%), Gaps = 17/172 (9%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L +  A  + +DTDV+F   +  +W+LF      Q   +        +G  + 
Sbjct: 92  RLFLPVILKDVDA-LLYVDTDVLFLRPMDDIWSLFRSFNSTQLAAMAPEHEVPKIG-WYS 149

Query: 258 NHKPWPALG-RGFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWTS 305
                P  G  G N+GV+L++LT++R            +SW      + +K+     W  
Sbjct: 150 RFARHPFYGVTGVNSGVMLMNLTRIRRATFKNSMIPARLSWEDLLHPLYQKYKNHITW-- 207

Query: 306 LADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
             DQD+ N I  ++P  ++  PCQWN +             E   + I+H N
Sbjct: 208 -GDQDLLNIIFHDNPECLFIFPCQWNYRPDHCMYGSNCRGAEKDGVSILHGN 258


>gi|55727735|emb|CAH90618.1| hypothetical protein [Pongo abelii]
          Length = 164

 Score = 59.3 bits (142), Expect = 6e-06,   Method: Composition-based stats.
 Identities = 23/37 (62%), Positives = 32/37 (86%)

Query: 5   EEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYR 41
           ++PV+  CE I VAIVCAGYN++R +VTL+KS+LF+R
Sbjct: 128 QQPVVEKCETIHVAIVCAGYNASRDVVTLVKSVLFHR 164


>gi|383859939|ref|XP_003705449.1| PREDICTED: N-acetyllactosaminide
           beta-1,3-N-acetylglucosaminyltransferase-like [Megachile
           rotundata]
          Length = 489

 Score = 59.3 bits (142), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 65/277 (23%), Positives = 99/277 (35%), Gaps = 91/277 (32%)

Query: 474 SDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEA----QQFLSYAGNS-EAL 528
           S    V L  Q S+++L  +     HW GP+S+ LY +  E     Q++L Y     E +
Sbjct: 147 SQAYRVCLATQSSVEKLHSLVQAAHHWTGPMSVALYAAGDEEFDVLQKYLVYLRRCYEPI 206

Query: 529 RSRRNIGYHV---------------------VYKEG-------------------NFYPI 548
           R R +    V                        EG                   N YP 
Sbjct: 207 RERVSFSLAVPRNRPPKKQPVDHELPDTVDCAKPEGTLNQLMNGISNEQTNWRIHNVYPQ 266

Query: 549 NTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQR 608
           N +RN+A     T YVFL D+D +P + L   L   +R+        K   V+P +E   
Sbjct: 267 NHMRNLARKNCQTDYVFLTDVDIVPSYNLTGALDEFLRT----DSCDKCAYVIPTYELDS 322

Query: 609 YRTAFPASHA------------------------PTNFSRWVNATT----------PYQI 634
            R  FP +                           TNF+RW+  T             ++
Sbjct: 323 -RVRFPQNKTELVRLARKGLARPFHQKVFIHNQFATNFTRWILDTALNHRNFENVRSGKV 381

Query: 635 EWAPD-------FEPYIVAHRDLPRYDTRFVGFGWNK 664
             + D       +EP+ VA   +P +D RF+G+G+ +
Sbjct: 382 YISHDVTNFEFLYEPFYVARDIVPPHDERFMGYGYTR 418


>gi|327273109|ref|XP_003221325.1| PREDICTED: LOW QUALITY PROTEIN: glucoside xylosyltransferase 1-like
           [Anolis carolinensis]
          Length = 442

 Score = 59.3 bits (142), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 40/173 (23%), Positives = 76/173 (43%), Gaps = 19/173 (10%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L + +   + +DTD++F   +  +W+   ++   Q   +     +  +G  W 
Sbjct: 202 RLFLPLILSD-VDSLLYVDTDILFLRPVDDIWSFLKKMNSSQIAAMAPEHEEPRIG--WY 258

Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
           N     P  G+ G N+GV+L+++T++R            + W      + +K+ L   W 
Sbjct: 259 NRFARHPYYGKTGINSGVMLMNMTRIRRKYFKNDMTTVRLRWEQILMPLLKKYKLNITW- 317

Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
              DQD+ N +   +P  ++  PCQWN +             E   + I+H N
Sbjct: 318 --GDQDLLNIMFFHNPESLFVFPCQWNYRPDHCIYGSNCQQAEKEGIFILHGN 368


>gi|427783557|gb|JAA57230.1| Putative gt8 like 2 represents a subfamily protein of gt8 with
           unknown function [Rhipicephalus pulchellus]
          Length = 384

 Score = 59.3 bits (142), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 48/202 (23%), Positives = 90/202 (44%), Gaps = 30/202 (14%)

Query: 157 KLWKNHKPWPALGRGFYLADSVVEDVS------WIPNKHYSGVYGLLK------LTLPKV 204
           ++ + +  WP         DSV + VS      W P ++Y   + + +      L L ++
Sbjct: 109 RMRREYASWP---------DSVKKRVSCDVMPVWFPKENYYQWWAMFRPCTTQRLFLTRM 159

Query: 205 LPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPA 264
           LP   A  + +DTDVIF   I   W  F  + + Q  G+     ++ +   +   K    
Sbjct: 160 LPNEDA-VLYVDTDVIFVHPIEDFWRKFYAMNEWQMAGMAPETENYTMN--YYTTKALHP 216

Query: 265 LGRGF--NTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYL 322
             + F  N G+++++LT++R          + ++F     W   ADQD+ N + S +P  
Sbjct: 217 FVQPFALNAGLLMMNLTRMRAFDLENRVMRLKQEFEGRIPW---ADQDLLNIVFSRYPQG 273

Query: 323 VYTLPCQWNVQLSDNTRSDELC 344
           ++T  C+WN +  ++   + LC
Sbjct: 274 IFTFTCRWNFR-GEHCHGEALC 294


>gi|402221737|gb|EJU01805.1| hypothetical protein DACRYDRAFT_116245 [Dacryopinax sp. DJM-731
           SS1]
          Length = 876

 Score = 59.3 bits (142), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 39/139 (28%), Positives = 66/139 (47%), Gaps = 7/139 (5%)

Query: 14  VIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVP--QV 71
            I+ A+V  G ++    VT+IK IL +   P+H H++ D  A+  +    +    P   +
Sbjct: 167 TIEFALVIMGESAAVEAVTMIKGILMHISTPVHLHIVCDEAAMYYIDDRLTLVHRPAYDI 226

Query: 72  EVSFYLADS----VVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATD 127
            VS+ L  S         + I + +++G  GL K+ L ++L ET  + I +D D +F  D
Sbjct: 227 HVSYNLLSSSQIWARGARAGIGSTYHAGAGGLAKVFLHEILTET-KRAIFIDIDTLFTVD 285

Query: 128 IAQLWALFSRLRQRQTIGL 146
              LW  F + +  Q   L
Sbjct: 286 PLLLWKEFDKFKPGQVFAL 304


>gi|410920623|ref|XP_003973783.1| PREDICTED: glucoside xylosyltransferase 2-like [Takifugu rubripes]
          Length = 442

 Score = 59.3 bits (142), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 43/172 (25%), Positives = 73/172 (42%), Gaps = 17/172 (9%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L   +   + +DTDV+F   +  +W+LF      Q   +        +G  + 
Sbjct: 197 RLFLPVIL-RDVDSLLYVDTDVLFLRPMDDIWSLFRSFNATQLAAMAPEHEVPKIG-WYS 254

Query: 258 NHKPWPALG-RGFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWTS 305
                P  G  G N+GV+L++LT++R            +SW      + +K+     W  
Sbjct: 255 RFARHPFYGVTGVNSGVMLMNLTRIRSAAFKNSMIPARLSWEDLLHPLYQKYKNHITW-- 312

Query: 306 LADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
             DQD+ N I  ++P  ++  PCQWN +             E   + I+H N
Sbjct: 313 -GDQDLLNIIFHDNPECLFIFPCQWNYRPDHCMYGSNCRGAEKDGVSILHGN 363


>gi|301618797|ref|XP_002938793.1| PREDICTED: glucoside xylosyltransferase 2-like [Xenopus (Silurana)
           tropicalis]
          Length = 422

 Score = 59.3 bits (142), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 41/172 (23%), Positives = 75/172 (43%), Gaps = 17/172 (9%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L + +   + +DTDV+F   +  +WA   R    Q   +        +G  + 
Sbjct: 177 RLFLPMLL-QDVDSLLYVDTDVLFLRPLDHIWAFLRRFNDTQLAAMAPEHEISKIG-WYS 234

Query: 258 NHKPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWTS 305
                P  G  G N+GV+L++LT++R+           ++W      + +K+   + + +
Sbjct: 235 RFARHPFYGTTGVNSGVMLMNLTRIRNQQFKNNMIPAGLTWEEMLHPLYQKY---KNYIT 291

Query: 306 LADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
             DQD+ N I   +P ++Y  PC WN +             E   + I+H N
Sbjct: 292 WGDQDLLNIIFYFNPEMLYVFPCHWNYRPDHCMYGSNCKAAEEEGVSILHGN 343


>gi|443717909|gb|ELU08746.1| hypothetical protein CAPTEDRAFT_217021 [Capitella teleta]
          Length = 365

 Score = 59.3 bits (142), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 49/206 (23%), Positives = 84/206 (40%), Gaps = 19/206 (9%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L +P +L   +   + +DTD++F   I  +W  F      Q   L         G  W 
Sbjct: 145 RLFIPDIL-RDVDSVLYVDTDILFMRPIEDVWDHFKHFNSSQIAALSPEHESPNAG--WY 201

Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAI 315
           N     P  G+ G N+GV+L++LT++R + W        +++ L   W    DQD+ N  
Sbjct: 202 NRFARHPYYGKLGVNSGVMLMNLTRMRQVGWLDHIIPYYKEYKLKITW---GDQDLINIF 258

Query: 316 ISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWNSPKKLKVKNKHMEFFRNL 375
               P  +Y   C WN +             E   + ++H     +    N+    F+++
Sbjct: 259 FHYFPDKLYIYNCDWNYRPDHCMYMSNCKQAESNGVSVLHGC---RGVYHNEKQPVFKSI 315

Query: 376 YLTFLEYDGNLLRRELFGCNLTQTTL 401
           Y  F +Y         FG +L++T L
Sbjct: 316 YEAFRDYK--------FGDDLSKTLL 333


>gi|409080739|gb|EKM81099.1| hypothetical protein AGABI1DRAFT_71889 [Agaricus bisporus var.
           burnettii JB137-S8]
          Length = 430

 Score = 59.3 bits (142), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 63/260 (24%), Positives = 115/260 (44%), Gaps = 64/260 (24%)

Query: 462 INPNNVDEYEASSDG----NDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQ 517
           + P+ +  Y   + G    +D+T+ + ++ +R +++  L + ++GPIS+TL++++A    
Sbjct: 72  MKPSQIIPYYLRASGVFHKHDITITSIITSNRFEILARLAQRYQGPISVTLHINNATE-- 129

Query: 518 FLSYAGNSEALRSRRN--------IGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDI 569
             +     E+L +  N        +  H+V    +    NT RN+A     T YV +LD+
Sbjct: 130 --NVPHLLESLHNLYNTTPNMAVYVDVHLVLDTFD-RQFNTWRNIARLFARTEYVMMLDV 186

Query: 570 DFLPMFGLYPYLKSSIRSMDMHG------------HGGKKVLVVPAFETQRYRTA----- 612
           DF         L++ +R +D +G            + G    VVPAFE   Y+       
Sbjct: 187 DFYICTDFRETLRNLLR-VDGNGEIGTGENIREMFNNGVAAFVVPAFEYISYKEGREYKT 245

Query: 613 FPAS-----------------------HAPTNFSRWVNATTPYQI----EWAPDFEPYIV 645
           FP +                       H  T++ R+ +A  P +I    ++   +EPY +
Sbjct: 246 FPKTKEHLMQLVQQRRIGMFHRVWAPGHNSTDYDRFYSA-KPGEIYKVTQFHSAYEPYTI 304

Query: 646 AHRDLPRY-DTRFVGFGWNK 664
             +D P + D RF+G+G NK
Sbjct: 305 FRKDGPPWCDERFIGYGGNK 324


>gi|395544504|ref|XP_003774150.1| PREDICTED: N-acetyllactosaminide
           beta-1,3-N-acetylglucosaminyltransferase [Sarcophilus
           harrisii]
          Length = 358

 Score = 58.9 bits (141), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 59/220 (26%), Positives = 87/220 (39%), Gaps = 48/220 (21%)

Query: 480 TLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDA-EAQQFLSYAGNSE---ALRSRRNIG 535
            L  Q   DRL M      H   P      + D  E  +F   +  S+   +L +R   G
Sbjct: 86  ALCTQAVRDRLTM------HLVCPARTEAAVPDPREPGEFARISSCSDVFTSLGARSRSG 139

Query: 536 YHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGG 595
            +        YP N LRNVA       +  ++D D +P  GL+  L   +   +      
Sbjct: 140 RNYALAANVSYPNNLLRNVAREGAG--HALVVDADMVPSEGLWTGLLELLALGEGKA--- 194

Query: 596 KKVLVVPAFETQR-----------------------YRTAFPASHAPTNFSRWVNATTP- 631
            + LVVPAFE ++                       Y    P   APTN++RW+  T P 
Sbjct: 195 GRALVVPAFEIRQGVRVPRTKMELMRFLKMGHVRPFYAELCPRCQAPTNYTRWL--TLPP 252

Query: 632 -------YQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
                  Y++ W   +EP+ VA   +P YD RF  +G+N+
Sbjct: 253 GGPLRPAYRVTWQDPWEPFYVAGGAVPAYDERFRQYGFNR 292


>gi|148222987|ref|NP_001086684.1| glucoside xylosyltransferase 2 precursor [Xenopus laevis]
 gi|82182587|sp|Q6DE37.1|GXLT2_XENLA RecName: Full=Glucoside xylosyltransferase 2; AltName:
           Full=Glycosyltransferase 8 domain-containing protein 4
 gi|50416401|gb|AAH77307.1| MGC80221 protein [Xenopus laevis]
          Length = 423

 Score = 58.9 bits (141), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 41/172 (23%), Positives = 75/172 (43%), Gaps = 17/172 (9%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L + +   + +DTDV+F   +  +WA   R    Q   +        +G  + 
Sbjct: 178 RLFLPMLL-QDVDSLLYVDTDVLFLRPLDHVWAFLRRFNDTQLAAMAPEHEISKIG-WYS 235

Query: 258 NHKPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWTS 305
                P  G  G N+GV+L++LT++R+           ++W      + +K+   + + +
Sbjct: 236 RFARHPFYGTTGVNSGVMLMNLTRIRNQHFKNNMIPAGLTWEEMLHPLYQKY---KNYIT 292

Query: 306 LADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
             DQD+ N I   +P ++Y  PC WN +             E   + I+H N
Sbjct: 293 WGDQDLLNIIFYFNPEMLYVFPCHWNYRPDHCMYGSNCKAAEEEGVSILHGN 344


>gi|393229277|gb|EJD36903.1| hypothetical protein AURDEDRAFT_174070 [Auricularia delicata
           TFB-10046 SS5]
          Length = 412

 Score = 58.9 bits (141), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 36/127 (28%), Positives = 63/127 (49%), Gaps = 8/127 (6%)

Query: 18  AIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQVEVSFYL 77
           A+V  G  S      ++KSIL     P+  H+I    +  ++Q   +  + P+ +V    
Sbjct: 67  AMVMFGAGSAEEGQHMLKSILMQSSRPIEVHIICSNDSAPVIQQRLALVTRPRHDVRAVF 126

Query: 78  ADSVVEDV------SWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQL 131
                E +      + I +KH++GV GL K+ + +++P    + + +DTD +FATD   L
Sbjct: 127 YPVTPEAILARAARAGIGSKHHAGVGGLAKMLIHELIPAE--RALYVDTDAMFATDPVLL 184

Query: 132 WALFSRL 138
           W  F+RL
Sbjct: 185 WRAFTRL 191



 Score = 45.1 bits (105), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 21/51 (41%), Positives = 34/51 (66%), Gaps = 2/51 (3%)

Query: 185 IPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRL 235
           I +KH++GV GL K+ + +++P    + + +DTD +FATD   LW  F+RL
Sbjct: 143 IGSKHHAGVGGLAKMLIHELIPAE--RALYVDTDAMFATDPVLLWRAFTRL 191


>gi|350417157|ref|XP_003491284.1| PREDICTED: N-acetyllactosaminide
           beta-1,3-N-acetylglucosaminyltransferase-like [Bombus
           impatiens]
          Length = 492

 Score = 58.9 bits (141), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 64/277 (23%), Positives = 95/277 (34%), Gaps = 91/277 (32%)

Query: 474 SDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEA----QQFLSYAGNS-EAL 528
           S    V L  Q S+++L  V     HW GP+S+ LY +  E     Q++L Y     E +
Sbjct: 150 SQAYRVCLATQSSVEKLHSVVQAAHHWTGPMSVALYAAGDEEFEVLQKYLIYLRKCYEPI 209

Query: 529 RSRRNIG---------------------YHVVYKEG-------------------NFYPI 548
           R R                               EG                   N YP 
Sbjct: 210 RERVAFSLAXXXVRPPKRQPREFELPEVVDCAKPEGTLNELMNGISNEQTNWRIRNVYPQ 269

Query: 549 NTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQR 608
           N +RN+A     + YVFL D+D +P F L   L   +R+        K   V+P +E   
Sbjct: 270 NHMRNLARKNCQSDYVFLTDVDIVPSFNLTGALDEFLRT----DTCDKCAYVIPTYELDS 325

Query: 609 YRTAFPASHA------------------------PTNFSRWVNATTPYQIEWAPD----- 639
            R  FP +                           TNF+RW+  T+     +        
Sbjct: 326 -RVRFPQNKTELVRLARKGLARPFHQKVFIHNQFATNFTRWILDTSVNHKNFGNVKTGKV 384

Query: 640 ------------FEPYIVAHRDLPRYDTRFVGFGWNK 664
                       +EP+ VA   +P +D RF+G+G+ +
Sbjct: 385 YISHDVTNFEFLYEPFYVAKDIVPAHDERFMGYGYTR 421


>gi|47204580|emb|CAG04707.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 184

 Score = 58.9 bits (141), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 44/150 (29%), Positives = 63/150 (42%), Gaps = 25/150 (16%)

Query: 209 LAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGK---------LWKNH 259
           L + + LD D+ +  +I +L+  F R      +G+       Y  +          W+  
Sbjct: 1   LTRIVQLDLDLKYRENIRELFQEFHRFPPGAVMGIAREMQPVYSTEESATPDTHTFWQYR 60

Query: 260 K---------PWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLT-----RLWTS 305
           K         P P    GFN+GV+LLDL  +R  S   + +++  + L       R    
Sbjct: 61  KENPQTKVGEPHPDGLPGFNSGVMLLDLAAMRASSL--YNQLLEPEHLAQLAAQYRFRGH 118

Query: 306 LADQDIFNAIISEHPYLVYTLPCQWNVQLS 335
           L DQD F  I  EHP L Y L C WN QL 
Sbjct: 119 LGDQDFFTMIGMEHPELFYPLACGWNRQLC 148


>gi|301606488|ref|XP_002932855.1| PREDICTED: glucoside xylosyltransferase 1 isoform 2 [Xenopus
           (Silurana) tropicalis]
          Length = 456

 Score = 58.9 bits (141), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 54/222 (24%), Positives = 95/222 (42%), Gaps = 31/222 (13%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L E +   + +DTD++F   +  +W+        Q   +     +  +G  W 
Sbjct: 217 RLFLPLILTE-VDSLLYVDTDILFLRPLDDIWSFLGGFNSTQIAAMAPEHEEPRIG--WY 273

Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
           N     P  G+ G N+GV+L+++T++R            + W      + +K+ L   W 
Sbjct: 274 NRFARHPYYGKTGINSGVMLMNMTRIRKKHFKNDMTAVRLHWGDLLLPLLKKYKLNITW- 332

Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWNSPKKLKV 364
              DQD+ N + S +P  ++  PCQWN +  D+      C+     + I+H N   +   
Sbjct: 333 --GDQDLLNIMFSHNPESLFVFPCQWNYR-PDHCIYGSNCHEAEGGIFILHGN---RGVY 386

Query: 365 KNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTTLQQAEL 406
            ++    FR +Y  F  Y         FG +L  + L   EL
Sbjct: 387 HDRKQPAFRAVYDAFRNYR--------FGDDLFHSLLYPLEL 420


>gi|38348470|ref|NP_941014.1| glucoside xylosyltransferase 2 precursor [Mus musculus]
 gi|81873556|sp|Q810K9.1|GXLT2_MOUSE RecName: Full=Glucoside xylosyltransferase 2; AltName:
           Full=Glycosyltransferase 8 domain-containing protein 4
 gi|29437118|gb|AAH49816.1| Glycosyltransferase 8 domain containing 4 [Mus musculus]
          Length = 444

 Score = 58.9 bits (141), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 43/176 (24%), Positives = 74/176 (42%), Gaps = 25/176 (14%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQS----DWYLG 253
           +L LP +L + +   + +DTDV+F   +  +W L  +    Q   +          WY  
Sbjct: 197 RLFLPAIL-KDVDSLLYVDTDVLFLRPVDDIWKLLRQFNSTQLAAMAPEHEIPKIGWY-- 253

Query: 254 KLWKNHKPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTR 301
             +  H   P  G  G N+GV+L++LT++R+           ++W      + +K+    
Sbjct: 254 SRFARH---PFYGSAGVNSGVMLMNLTRIRNTQFKNSLIPAGLAWEEMLLPLYQKYKSAI 310

Query: 302 LWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
            W    DQD+ N I   +P  +Y  PCQWN +             E   + ++H N
Sbjct: 311 TW---GDQDLLNIIFYYNPECLYVFPCQWNYRPDHCMYGSNCKEAEREGVSVLHGN 363


>gi|157120962|ref|XP_001653726.1| hypothetical protein AaeL_AAEL009190 [Aedes aegypti]
 gi|108874756|gb|EAT38981.1| AAEL009190-PA [Aedes aegypti]
          Length = 403

 Score = 58.5 bits (140), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 46/189 (24%), Positives = 78/189 (41%), Gaps = 40/189 (21%)

Query: 183 SWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIG 242
           ++    +YS    L+ L L +++   + + I++D DV+F + + +L+  F+R       G
Sbjct: 182 NYKTGSYYSDALFLISLGLHRIVDVNMKRAIIIDCDVVFRSSVKELFDEFNRFSSDHLFG 241

Query: 243 LVENQSDWYLGKLWKNHKPWP-------------ALGR------------------GFNT 271
           L    +  YL  L+K     P             A+G                   G N+
Sbjct: 242 LAPELTPVYLHILFKYRAQNPHTLFGTPYFLNKTAIGNSVGGFFSRTAKKTKHGYPGLNS 301

Query: 272 GVILLDLTKLR------DISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYT 325
           GV++L L ++R      +I      + + EK+        L DQD +  +  E P L+Y 
Sbjct: 302 GVVMLRLDRIRQSRLYEEIIKQESVKNMVEKY---SFKGHLGDQDFYTLMGFEFPGLIYR 358

Query: 326 LPCQWNVQL 334
           L C WN QL
Sbjct: 359 LDCVWNRQL 367



 Score = 51.2 bits (121), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 37/165 (22%), Positives = 72/165 (43%), Gaps = 16/165 (9%)

Query: 33  LIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQVEVSFYL----------ADSVV 82
            +KS+L Y   P+H H+ITD  +    + L         +V FY              +V
Sbjct: 115 FLKSLLKYTTIPIHLHVITDEQSETSAEELIREQIAHYRKVVFYTLYDVKDCAEKTSDIV 174

Query: 83  EDV----SWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRL 138
           +++    ++    +YS    L+ L L +++   + + I++D DV+F + + +L+  F+R 
Sbjct: 175 KNMLPLFNYKTGSYYSDALFLISLGLHRIVDVNMKRAIIIDCDVVFRSSVKELFDEFNRF 234

Query: 139 RQRQTIGLVENQSDWYLGKLWK--NHKPWPALGRGFYLADSVVED 181
                 GL    +  YL  L+K     P    G  ++L  + + +
Sbjct: 235 SSDHLFGLAPELTPVYLHILFKYRAQNPHTLFGTPYFLNKTAIGN 279


>gi|66828487|ref|XP_647598.1| hypothetical protein DDB_G0268160 [Dictyostelium discoideum AX4]
 gi|74859220|sp|Q55FD5.1|GNT12_DICDI RecName: Full=Glycosyltransferase-like protein gnt12
 gi|60475598|gb|EAL73533.1| hypothetical protein DDB_G0268160 [Dictyostelium discoideum AX4]
          Length = 550

 Score = 58.5 bits (140), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 44/187 (23%), Positives = 84/187 (44%), Gaps = 44/187 (23%)

Query: 473 SSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRR 532
           + DG ++++V  + ++ ++ + M+   W  PIS ++ + +      + Y     +L    
Sbjct: 125 NEDGFEISIVTHVPIENMEKIAMIADIWRAPISASVLIKNKNDIDSV-YKLIRNSLSVSE 183

Query: 533 NIGYHVVY-------------------------KEGN----FYPINTLRNVALNQVSTPY 563
            + +H +Y                         +E N    +YPIN+LRN++L    T +
Sbjct: 184 FVDFHFLYWNEDSDSIINNNNNNIINKNKINTNEEDNLNYYYYPINSLRNLSLKNSKTDW 243

Query: 564 VFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVL-------VVPAFETQRYRTAFPAS 616
           +  +DID+LP +G+Y YL+   R++       KK++       VVP+F+     T+    
Sbjct: 244 ILTIDIDYLPNYGIYQYLE---RTLYTSLQPSKKLINSDLVSFVVPSFQL----TSISTK 296

Query: 617 HAPTNFS 623
            APT  S
Sbjct: 297 TAPTTIS 303


>gi|397567044|gb|EJK45361.1| hypothetical protein THAOC_36024 [Thalassiosira oceanica]
          Length = 735

 Score = 58.5 bits (140), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 59/240 (24%), Positives = 95/240 (39%), Gaps = 57/240 (23%)

Query: 474 SDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMS------DAEAQQFLSYAGNSEA 527
           +D    T+V QLS +R+ ++E     W+GP+SL ++          E ++          
Sbjct: 431 ADSISYTMVTQLSTNRMWLMERQCSVWKGPMSLGIFTDRNMTSLQDELEKLGCDVDRIAI 490

Query: 528 LRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRS 587
            R RR+      YK  + YP+N LRN+AL+ V T + F  DIDF+    +   L SS  +
Sbjct: 491 ERVRRD-----AYKSED-YPVNILRNLALSNVKTSHFFYTDIDFMLSRDVESSL-SSPEA 543

Query: 588 MDMHGHGGKKVLVVPAFE-----TQRYRTAF----------------------------- 613
           + +     K+ +VVPAF      +  Y+                                
Sbjct: 544 LAILSKDPKQTIVVPAFMANAHCSDDYKDCIRPKASEIPSTKPELLRGMLKGDVKVFDVE 603

Query: 614 --PASHAPTNFSRWVNATTPYQIEWAP-----DFEPYIVAH--RDLPRYDTRFVGFGWNK 664
             P  H  T +  W+    P  I   P      +EPY+ A    ++P +   F G+  +K
Sbjct: 604 TNPGGHGSTRYDVWMEQ-GPNTIATIPCLSSDRYEPYVAAQYCHEMPPFQESFQGYSSDK 662



 Score = 57.0 bits (136), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 58/234 (24%), Positives = 100/234 (42%), Gaps = 53/234 (22%)

Query: 478 DVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGYH 537
           D+TL  QL  +RL +++     W+  ISL + +SD   +   S + + +++  R NI   
Sbjct: 111 DITLTVQLDEERLWVMKHHCLAWKRQISLAV-LSD---RMKSSISNDLKSMGCRGNIILS 166

Query: 538 VV----YKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGH 593
            +    YK    YP+N LRN+A+   +T ++  +D DF    G    L +      +   
Sbjct: 167 TLEASKYKRDE-YPVNLLRNIAIQASTTSHIVFVDADFFMPKGTDEVLMARTIKEKL-AR 224

Query: 594 GGKKVLVVPAFE---------------TQR----------------------YRTAFPAS 616
             K+ LV+PAF+               +Q+                      +    P  
Sbjct: 225 DAKQALVLPAFQLNYKCSDTTSADQCRSQKLEAMPQNIIDLHRGLDDGSVSMFDPKNPHG 284

Query: 617 HAPTNFSRW--VNATTPYQIEW--APDFEPYIVAHR--DLPRYDTRFVGFGWNK 664
           H  T +++W   +A++   I    +  +EPY+V  R  D+P +   F G+GWNK
Sbjct: 285 HDSTGYNKWRKQSASSVEDISCINSERYEPYLVVRRCEDMPPFPEAFAGYGWNK 338


>gi|326911414|ref|XP_003202054.1| PREDICTED: glucoside xylosyltransferase 1-like [Meleagris
           gallopavo]
          Length = 382

 Score = 58.2 bits (139), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 41/173 (23%), Positives = 74/173 (42%), Gaps = 19/173 (10%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L + +   + +DTD++F   +  +W+   +    Q   +     +  +G  W 
Sbjct: 142 RLFLPLILKD-VDSLLYVDTDILFLRPVDDIWSFLRKFNSTQIAAMAPEHEEPRIG--WY 198

Query: 258 NH-KPWPALG-RGFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
           N     P  G  G N+GV+L+++T++R            + W      + +K+ L   W 
Sbjct: 199 NRFARHPYYGVTGINSGVMLMNMTRIRRKYFKNDMTTVRLRWGEILMPLLKKYKLNITW- 257

Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
              DQD+ N +   +P  +Y  PCQWN +             E   + I+H N
Sbjct: 258 --GDQDLLNIMFFHNPESLYVFPCQWNYRPDHCIYGSNCKEAEEEGIFILHGN 308


>gi|301606486|ref|XP_002932854.1| PREDICTED: glucoside xylosyltransferase 1 isoform 1 [Xenopus
           (Silurana) tropicalis]
          Length = 487

 Score = 58.2 bits (139), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 54/222 (24%), Positives = 95/222 (42%), Gaps = 31/222 (13%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L E +   + +DTD++F   +  +W+        Q   +     +  +G  W 
Sbjct: 248 RLFLPLILTE-VDSLLYVDTDILFLRPLDDIWSFLGGFNSTQIAAMAPEHEEPRIG--WY 304

Query: 258 NH-KPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWT 304
           N     P  G+ G N+GV+L+++T++R            + W      + +K+ L   W 
Sbjct: 305 NRFARHPYYGKTGINSGVMLMNMTRIRKKHFKNDMTAVRLHWGDLLLPLLKKYKLNITW- 363

Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWNSPKKLKV 364
              DQD+ N + S +P  ++  PCQWN +  D+      C+     + I+H N   +   
Sbjct: 364 --GDQDLLNIMFSHNPESLFVFPCQWNYR-PDHCIYGSNCHEAEGGIFILHGN---RGVY 417

Query: 365 KNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTTLQQAEL 406
            ++    FR +Y  F  Y         FG +L  + L   EL
Sbjct: 418 HDRKQPAFRAVYDAFRNYR--------FGDDLFHSLLYPLEL 451


>gi|300175814|emb|CBK21357.2| unnamed protein product [Blastocystis hominis]
          Length = 615

 Score = 58.2 bits (139), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 71/291 (24%), Positives = 119/291 (40%), Gaps = 51/291 (17%)

Query: 418 LRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINPNNVDEYEASSDGN 477
           +RRS  +  R     +E  +E    G    ++ + + +R +    N +    + A     
Sbjct: 251 VRRSADSPARCDGCKVETPFELRVSGGSALILRKFAGNREEFTIYN-STFYRFPAEDAPF 309

Query: 478 DVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGYH 537
           DVT V   + DRL ++  L   WEGP+ +      +E + F+ Y  N+  L +R     +
Sbjct: 310 DVTGVCVTTRDRLTVLTQLLYRWEGPLVIVFNGVVSEEKAFIDYL-NTHFLPARLTTLLY 368

Query: 538 VVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKK 597
           ++ + G F+P+N LRN+A+  + T +  +LD+D  P    Y  L     S+      G+ 
Sbjct: 369 LMPQRG-FFPVNLLRNLAIRNIRTTHFQVLDMDLWPSPNTYRELLRLPPSL----RDGRV 423

Query: 598 VLVVPAFETQR----------YRTAFPA-SHAPTN------------------------- 621
             ++P F   R          Y  A  A  H P+N                         
Sbjct: 424 AAILPVFFFDRKQVLNRCKSFYDCALLALEHLPSNKTALEACLYAKACLSSKPGIRTHLY 483

Query: 622 -FSRW--VNATTPY-QIE-WAPDF-EPYIVAH--RDLPRYDTRFVGFGWNK 664
               W  + A  P+ Q+  +  DF EPY++       P +D RFV +G+NK
Sbjct: 484 VLPDWFTIPADIPFVQVRCFLTDFQEPYVLLRYAPTTPLFDERFVNYGYNK 534


>gi|358060807|dbj|GAA93578.1| hypothetical protein E5Q_00222 [Mixia osmundae IAM 14324]
          Length = 565

 Score = 58.2 bits (139), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 56/230 (24%), Positives = 104/230 (45%), Gaps = 40/230 (17%)

Query: 472 ASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSD--AEAQQFLSYAGN-SEAL 528
           A  +  D+T+   ++ +R +++  L + ++GP+S+ +++ +  AE ++ L+   +  E+ 
Sbjct: 243 AEDNKRDITITTLVTANRFKVLTDLVEKYQGPVSVCIHVVNIPAEREELLTSLRDIYESH 302

Query: 529 RSRRN-IGYHVVYKEGNF-YPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIR 586
            + +N +  H+V    NF    N  RNVA     T YV +LD+DF         + +S  
Sbjct: 303 EAMKNWVDVHLVID--NFDRQFNMWRNVAKYFARTDYVMMLDVDFWICTDFRHRILASPD 360

Query: 587 SMDMHGHGGKKVLVVPAFETQR----------------------------YRTAFPASHA 618
            +D     G    VVPAFE  +                            +  ++   H 
Sbjct: 361 IIDKLS-AGTAAFVVPAFEFTKQVDGLDSKTFPTEKKDLLELVEGGKIGMFHKSWAPGHG 419

Query: 619 PTNFSRWVNATTP--YQIE-WAPDFEPYIVAHRD-LPRYDTRFVGFGWNK 664
            TN++R+  +     Y++  +   +EPYI+  R+  P  D RF+G+G NK
Sbjct: 420 STNYTRFYESKEGEIYRVHGYTHSYEPYIIYKREGTPYCDERFIGYGGNK 469


>gi|291394020|ref|XP_002713234.1| PREDICTED: glucoside xylosyltransferase 2 [Oryctolagus cuniculus]
          Length = 501

 Score = 58.2 bits (139), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 43/176 (24%), Positives = 73/176 (41%), Gaps = 25/176 (14%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQS----DWYLG 253
           +L LP +L + +   + +DTDV+F   +  +W L  +    Q   +          WY  
Sbjct: 253 RLFLPVIL-KDVDSLLYVDTDVLFLRPVDDIWKLLRKFNSTQLAAMAPEHEIPKIGWY-- 309

Query: 254 KLWKNHKPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTR 301
             +  H   P  G  G N+GV+L++LT++R            ++W      + +K+    
Sbjct: 310 SRFARH---PFYGSAGVNSGVMLMNLTRIRSTQFKNSVIPTGLAWGDMLYPLYQKYKNAI 366

Query: 302 LWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
            W    DQD+ N I   +P  +Y  PCQWN +             E   + ++H N
Sbjct: 367 TW---GDQDLLNIIFYFNPECLYVFPCQWNYRPDHCMYGSNCKEAEREGVSVLHGN 419


>gi|395824579|ref|XP_003785540.1| PREDICTED: glucoside xylosyltransferase 2 [Otolemur garnettii]
          Length = 319

 Score = 57.8 bits (138), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 41/172 (23%), Positives = 72/172 (41%), Gaps = 17/172 (9%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L + +   + +DTDV+F   +  +W L  +    Q   +        +G  + 
Sbjct: 72  RLFLPVIL-KDVDSLLYVDTDVLFLRPVDDIWKLLRQFNSTQLAAMAPEHETPKIG-WYS 129

Query: 258 NHKPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWTS 305
                P  G  G N+GV+L++LT++R            ++W      + +K+     W  
Sbjct: 130 RFARHPFYGSAGLNSGVMLMNLTRIRSAQFKNSMIPTGLAWEDMLYPLYQKYKNAITW-- 187

Query: 306 LADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
             DQD+ N I   +P  +Y  PCQWN +             E   + ++H N
Sbjct: 188 -GDQDLLNIIFYFNPECLYVFPCQWNYRPDHCMYGSNCKEAEREGVSVLHGN 238


>gi|189237113|ref|XP_971629.2| PREDICTED: similar to conserved hypothetical protein [Tribolium
           castaneum]
 gi|270007231|gb|EFA03679.1| hypothetical protein TcasGA2_TC013781 [Tribolium castaneum]
          Length = 351

 Score = 57.8 bits (138), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 44/184 (23%), Positives = 82/184 (44%), Gaps = 22/184 (11%)

Query: 174 LADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFS 233
           + D +    S  P  +YS     + L L ++ P +    ++LD D+ F  DI  ++  F 
Sbjct: 133 IVDVMTPHFSSKPGTYYSDALFYISLGLYRIAPPSQRTALLLDCDLYFKKDIRLVFEEFD 192

Query: 234 RLRQRQTIGL----------------VENQS---DWYLGKLWKNHKPWPALGRGFNTGVI 274
           + +     GL                V+  S   ++Y  K   ++   P   +G+N+GV+
Sbjct: 193 KFKPTALFGLAPELTPVYRHVLQTYKVKKNSTFGEYYHSKAISSNMQHPRGFQGYNSGVV 252

Query: 275 LLDLTKLRDIS-WAGFWRIIAEKFLLT--RLWTSLADQDIFNAIISEHPYLVYTLPCQWN 331
           L++L+ +R    ++   R    ++L +  +    L DQD +  I  E+P L+ T+ C +N
Sbjct: 253 LINLSAIRKSKEYSQIIRNETVQYLTSKYKFRGHLGDQDFYTLIGFEYPDLIQTINCGFN 312

Query: 332 VQLS 335
            QL 
Sbjct: 313 RQLC 316


>gi|452823675|gb|EME30683.1| glycosyltransferase-like protein LARGE [Galdieria sulphuraria]
          Length = 380

 Score = 57.8 bits (138), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 46/147 (31%), Positives = 68/147 (46%), Gaps = 6/147 (4%)

Query: 189 HYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGL-VENQ 247
           HY G   L KL LP  L       +VLDTD+I   D+ +LW  +   RQ   +G+   ++
Sbjct: 168 HYHGKEALWKLLLPSFLQSD--NLLVLDTDMIILRDVLELWHRWRLTRQSVHLGMPCMSR 225

Query: 248 SDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLT-RLWTSL 306
            + +L  LW+   P         +G +L+   K+R  +W         K L     WT++
Sbjct: 226 DNSWLYPLWREIFPENGTLHCM-SGTLLMRPRKMRQENWINVVASSTNKILEDFPGWTAI 284

Query: 307 -ADQDIFNAIISEHPYLVYTLPCQWNV 332
            ADQDIFN +    P  V  + C+WN 
Sbjct: 285 SADQDIFNLVAKSIPEKVTNIGCEWNC 311



 Score = 53.5 bits (127), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 48/165 (29%), Positives = 76/165 (46%), Gaps = 18/165 (10%)

Query: 15  IQVAIVCAGYNSTRSLVTLIKSILFYRKNP---LHFHLITDTVALNILQTLFSTWSVPQV 71
           I VAIV  G N   ++  +I S++    +P    HFHLI+D   LN  +  ++ W+    
Sbjct: 78  IDVAIVSIGENDILNMKRMISSLVM-SSSPNVKFHFHLISDKKGLNAAKNWYTLWNSACA 136

Query: 72  EVSFY------LADSVVEDVSWIPNK-----HYSGVYGLLKLTLPKVLPETLAKTIVLDT 120
             + Y      L+ ++ +  S    K     HY G   L KL LP  L       +VLDT
Sbjct: 137 VWNIYDIESSKLSKALDQFHSLTGGKMRTAVHYHGKEALWKLLLPSFLQSD--NLLVLDT 194

Query: 121 DVIFATDIAQLWALFSRLRQRQTIGL-VENQSDWYLGKLWKNHKP 164
           D+I   D+ +LW  +   RQ   +G+   ++ + +L  LW+   P
Sbjct: 195 DMIILRDVLELWHRWRLTRQSVHLGMPCMSRDNSWLYPLWREIFP 239


>gi|312373082|gb|EFR20905.1| hypothetical protein AND_18321 [Anopheles darlingi]
          Length = 410

 Score = 57.8 bits (138), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 46/184 (25%), Positives = 73/184 (39%), Gaps = 41/184 (22%)

Query: 189 HYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQS 248
           +YS    LL L L +++   + + I++D DV+F   + +L+  F      Q  GL    +
Sbjct: 194 YYSDALFLLSLGLHRIVDRNMRRAILIDCDVVFRASVKELFDQFDNFSPDQLFGLAPELT 253

Query: 249 DWYLGKL--WKNHKPWPALGR------------------------------GFNTGVILL 276
             Y   L  ++ + P    G                               G N+GV++L
Sbjct: 254 PVYRHVLSRYRMNNPHTIFGSPYYLDSAGLSEAASVVSPPSGEKARRHGYPGLNSGVVML 313

Query: 277 DLTKLR------DISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQW 330
            L ++R      +I      R +AEK+        L DQD +  +  E P L+Y L C W
Sbjct: 314 HLDRIRRSRLYEEIIKESTVRNMAEKY---SFQGHLGDQDFYTLMGFEFPGLIYRLDCVW 370

Query: 331 NVQL 334
           N QL
Sbjct: 371 NRQL 374



 Score = 46.2 bits (108), Expect = 0.054,   Method: Compositional matrix adjust.
 Identities = 48/216 (22%), Positives = 91/216 (42%), Gaps = 26/216 (12%)

Query: 30  LVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQVEVSFY-------LADSVV 82
           L   ++S+L Y    LH H+ITD  +    + L           +FY        A+ + 
Sbjct: 118 LELFLRSLLKYSTIELHLHIITDEQSERSAEELIRQQIERFHRTAFYTLYDVRDCAEKIS 177

Query: 83  EDV-SWIP------NKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALF 135
           + V S +P        +YS    LL L L +++   + + I++D DV+F   + +L+  F
Sbjct: 178 DIVHSMMPLFNYRTGSYYSDALFLLSLGLHRIVDRNMRRAILIDCDVVFRASVKELFDQF 237

Query: 136 SRLRQRQTIGLVENQSDWYLGKL--WKNHKPWPALGRGFYLADSVVEDVSWI-------- 185
                 Q  GL    +  Y   L  ++ + P    G  +YL  + + + + +        
Sbjct: 238 DNFSPDQLFGLAPELTPVYRHVLSRYRMNNPHTIFGSPYYLDSAGLSEAASVVSPPSGEK 297

Query: 186 PNKH-YSGV-YGLLKLTLPKVLPETLAKTIVLDTDV 219
             +H Y G+  G++ L L ++    L + I+ ++ V
Sbjct: 298 ARRHGYPGLNSGVVMLHLDRIRRSRLYEEIIKESTV 333


>gi|242219718|ref|XP_002475635.1| predicted protein [Postia placenta Mad-698-R]
 gi|220725156|gb|EED79156.1| predicted protein [Postia placenta Mad-698-R]
          Length = 337

 Score = 57.8 bits (138), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 35/108 (32%), Positives = 57/108 (52%), Gaps = 9/108 (8%)

Query: 39  FYRKNPLHFHLITDTVALNILQTLFSTWSVP--QVEVSFYL------ADSVVEDVSWIPN 90
            Y   P+ FH++ D  A   L+      + P   + V FYL      AD +  + + I  
Sbjct: 1   MYTSMPVEFHIVCDEAAQAYLENRVRLITRPAHDILVRFYLLSWQSMADRIDREGT-IMT 59

Query: 91  KHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRL 138
            H +GV GL+KL + ++LP ++ + I +DTD +F +D A LW  F++L
Sbjct: 60  DHSAGVPGLMKLFIHEILPPSVPRAIFIDTDALFISDPALLWDRFAQL 107



 Score = 47.4 bits (111), Expect = 0.026,   Method: Compositional matrix adjust.
 Identities = 23/62 (37%), Positives = 40/62 (64%), Gaps = 1/62 (1%)

Query: 174 LADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFS 233
           +AD +  + + I   H +GV GL+KL + ++LP ++ + I +DTD +F +D A LW  F+
Sbjct: 47  MADRIDREGT-IMTDHSAGVPGLMKLFIHEILPPSVPRAIFIDTDALFISDPALLWDRFA 105

Query: 234 RL 235
           +L
Sbjct: 106 QL 107


>gi|158854046|ref|NP_001103670.1| glucoside xylosyltransferase 2 precursor [Danio rerio]
 gi|141795891|gb|AAI34948.1| Glt8d4 protein [Danio rerio]
          Length = 441

 Score = 57.8 bits (138), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 42/172 (24%), Positives = 71/172 (41%), Gaps = 17/172 (9%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L + +   + +DTDV+F   +  +W         Q   +        +G  + 
Sbjct: 195 RLFLPVIL-KDVDSLLYVDTDVLFLRPMDDIWKFLKAFNSTQLAAMAPEHEIPKIG-WYS 252

Query: 258 NHKPWPALG-RGFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWTS 305
                P  G  G N+GV+L++LT++R+           +SW      + +K+     W  
Sbjct: 253 RFARHPYYGVTGVNSGVMLMNLTRIRNTLFRNSMIASGLSWENLLHPLYQKYKNHITW-- 310

Query: 306 LADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
             DQD+ N I   +P  +Y  PCQWN +             E   + I+H N
Sbjct: 311 -GDQDLLNIIFHYNPECLYIFPCQWNYRPDHCMYGSNCKAAEEEGVSILHGN 361


>gi|428181726|gb|EKX50589.1| hypothetical protein GUITHDRAFT_103814 [Guillardia theta CCMP2712]
          Length = 294

 Score = 57.4 bits (137), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 43/122 (35%), Positives = 56/122 (45%), Gaps = 19/122 (15%)

Query: 478 DVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRS-RRNIGY 536
           DVTLV Q S+DRL  +      W+G +S  +Y    E +Q  +   N EALR   R I  
Sbjct: 77  DVTLVTQGSLDRLSRLRDQALSWKGSLSAAIYFRHHE-RQGENLEKNLEALRELHREIEM 135

Query: 537 ----------------HVVYKE-GNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYP 579
                               KE    YPIN LRNVA+    T  + ++DIDF+P  G + 
Sbjct: 136 LGSCRLRISLLFALDPETADKEYDTLYPINALRNVAVRMSYTDLIMVIDIDFVPSKGFWE 195

Query: 580 YL 581
           YL
Sbjct: 196 YL 197


>gi|336414250|ref|ZP_08594596.1| hypothetical protein HMPREF1017_01704 [Bacteroides ovatus
           3_8_47FAA]
 gi|335933362|gb|EGM95364.1| hypothetical protein HMPREF1017_01704 [Bacteroides ovatus
           3_8_47FAA]
          Length = 324

 Score = 57.4 bits (137), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 41/135 (30%), Positives = 66/135 (48%), Gaps = 11/135 (8%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP+VLP TL K + LD D+I  + I  LW +  +      IG VE+  +  +     
Sbjct: 95  RLMLPEVLPVTLDKILYLDCDIIVNSKIESLWNIDLKY---YAIGAVED--NIVISSEAP 149

Query: 258 NHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIIS 317
               +P     FN GV+L++L+ +RD  +     +  E+ L   ++    DQDI N ++ 
Sbjct: 150 RRLGYPVQSSYFNAGVMLMNLSLMRDTQFTKNAFVYIEQHLKEIVY---HDQDILNVLLY 206

Query: 318 EHPYLVYTLPCQWNV 332
           +       LP +WNV
Sbjct: 207 DQKLF---LPIKWNV 218


>gi|395516491|ref|XP_003762421.1| PREDICTED: glucoside xylosyltransferase 2 [Sarcophilus harrisii]
          Length = 452

 Score = 57.4 bits (137), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 43/176 (24%), Positives = 76/176 (43%), Gaps = 25/176 (14%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQS----DWYLG 253
           +L LP +L + +   + +DTDV+F   I  +W L  +    Q   +          WY  
Sbjct: 204 RLFLPVIL-KDVDSLLYVDTDVLFLRPIDDIWRLLKQFNSTQLAAMAPEHEIPKIGWY-- 260

Query: 254 KLWKNHKPWPALG-RGFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTR 301
             +  H   P  G  G N+GV+L++LT++R+           ++W      + +K+   +
Sbjct: 261 SRFARH---PYYGMTGVNSGVMLMNLTRIRNTQFKNSMIPTGLAWEDMLYPLYQKY---K 314

Query: 302 LWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
            + +  DQD+ N I   +P  +Y  PCQWN +             E   + ++H N
Sbjct: 315 NYITWGDQDLLNIIFYFNPECLYVFPCQWNYRPDHCMYGSNCKGAEQEGVSVLHGN 370


>gi|302680991|ref|XP_003030177.1| glycosyltransferase family 49 protein [Schizophyllum commune H4-8]
 gi|300103868|gb|EFI95274.1| glycosyltransferase family 49 protein [Schizophyllum commune H4-8]
          Length = 446

 Score = 57.4 bits (137), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 58/229 (25%), Positives = 98/229 (42%), Gaps = 47/229 (20%)

Query: 476 GNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQ------QFLSYAGNSEALR 529
            ND+T+   ++ +R  +   L + ++GPIS+ +++ +  +          +   +S  + 
Sbjct: 88  ANDITMTTLITSNRFHVFSRLVERYQGPISVAIHVKNVTSHVDDLLASLHTLYASSPLMA 147

Query: 530 SRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIR-SM 588
           +  ++   V   +  F   NT RNVA     T YV +LDID    F L    +  IR S 
Sbjct: 148 AHVDVHLIVDVFDRQF---NTWRNVARFFARTDYVMMLDID----FSLCTDFRRVIRESA 200

Query: 589 DMHGHGGKKV-LVVPAFETQRYRTA-----FPAS-----------------------HAP 619
           D+ G   + V  V+ AFE            FP +                       H  
Sbjct: 201 DLRGRLDEGVAFVITAFEYTNLEEGTNQEVFPRTKEDLLDLVNVGRIGMFHASWQPGHNS 260

Query: 620 TNFSRWVNATTP--YQI-EWAPDFEPYIVAHRDLPRY-DTRFVGFGWNK 664
           T ++++  A     Y++  + P +EPYIV  +D P + D RF+G+G NK
Sbjct: 261 TEYAKYYGAAPGEVYEVTRYQPAYEPYIVMKKDGPPWCDERFIGYGGNK 309


>gi|428179611|gb|EKX48481.1| hypothetical protein GUITHDRAFT_105628 [Guillardia theta CCMP2712]
          Length = 953

 Score = 57.4 bits (137), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 50/178 (28%), Positives = 70/178 (39%), Gaps = 64/178 (35%)

Query: 546 YPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGH-GGKKVLVVPAF 604
           +PIN LR ++     T  V   D+D L   GL     S+IR     G  G  ++LV+P+F
Sbjct: 571 FPINMLRKISCAIARTDLVLYADVDMLASDGL----ASNIRQAYQDGKVGPTELLVIPSF 626

Query: 605 ET--------------QRYRTAFPAS---------------------------------- 616
           +T              QR R   PAS                                  
Sbjct: 627 KTSGSWPREAGISLEEQRIRVE-PASISMEALRLNFEGRQVWIPGLDMGMWSTTGRWHEA 685

Query: 617 ---HAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPR-------YDTRFVGFGWNK 664
              HA T + RW+ A   YQ+++   +EPY+V +R   R       YD RFV +GW+K
Sbjct: 686 GVFHAGTEYDRWMEAEETYQVKYILGYEPYVVVNRSAWRGSHGVGLYDDRFVHWGWDK 743


>gi|397629521|gb|EJK69396.1| hypothetical protein THAOC_09354 [Thalassiosira oceanica]
          Length = 489

 Score = 57.0 bits (136), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 58/235 (24%), Positives = 94/235 (40%), Gaps = 46/235 (19%)

Query: 475 DGNDV--TLVAQLSMDRLQMVEMLFKHW-EGPISLTLY--MSDAEAQQFLSYAGNSEALR 529
           D  DV  TLV+QLS DRL MV    + W + PIS+ ++   S  + +  L   G  E   
Sbjct: 102 DAADVSYTLVSQLSNDRLWMVRYHCERWGDNPISIVVFSDRSSDDIKTQLIREGCYEPGL 161

Query: 530 SRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMD 589
           + + +        G  YP+N LRN+A++ V T ++   DIDF P   L+ ++ S     +
Sbjct: 162 TVQVVSKSRYDPTGKEYPVNILRNMAISAVKTSHIVYADIDFWPSADLH-HILSKDEMKE 220

Query: 590 MHGHGGKKVLVVPAFE------------------------------TQRYRTAFP----A 615
                 K   ++P F+                               +R  + F      
Sbjct: 221 RFAKDAKLATIIPVFQMFRRCKEYKDCQEKNIPVMPKDKDSLINLIKKREASTFDPTNIG 280

Query: 616 SHAPTNFSRWVNATTPYQIEW----APDFEPYIVAH--RDLPRYDTRFVGFGWNK 664
            H  T +  W + +    ++     +  +EPY+      DLP +   F G+G NK
Sbjct: 281 GHGSTKYITWRDQSPATFVDLPCIKSNRYEPYLTFRYCSDLPPFQEGFTGYGKNK 335


>gi|449474226|ref|XP_002189241.2| PREDICTED: glucoside xylosyltransferase 2 [Taeniopygia guttata]
          Length = 334

 Score = 57.0 bits (136), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 42/172 (24%), Positives = 73/172 (42%), Gaps = 17/172 (9%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L + +   + +DTDV+F   I  +W L       Q   +        +G  + 
Sbjct: 87  RLFLPVILKD-VDSLLYVDTDVLFLRPIDDIWHLLREFNSTQLAAMAPEHEIPKIG-WYS 144

Query: 258 NHKPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWTS 305
                P  G  G N+GV+L++LT++R            ++W      + +K+   + + +
Sbjct: 145 RFARHPYYGTTGLNSGVMLMNLTRIRSARFKNSLIPGGLTWEEMLYPLYQKY---KSYIT 201

Query: 306 LADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
             DQD+ N I   +P  +Y  PCQWN +             E   + I+H N
Sbjct: 202 WGDQDLLNIIFYFNPECLYVFPCQWNYRPDHCMYGSNCRAAEEEGVSILHGN 253


>gi|149022671|gb|EDL79565.1| glycosyltransferase-like 1B, isoform CRA_c [Rattus norvegicus]
          Length = 123

 Score = 57.0 bits (136), Expect = 3e-05,   Method: Composition-based stats.
 Identities = 22/32 (68%), Positives = 29/32 (90%)

Query: 10 PTCEVIQVAIVCAGYNSTRSLVTLIKSILFYR 41
          P CE++ VAIVCAGYNS+R ++TL+KS+LFYR
Sbjct: 62 PKCEMLHVAIVCAGYNSSREIITLMKSVLFYR 93


>gi|338714503|ref|XP_001493752.2| PREDICTED: glucoside xylosyltransferase 2 [Equus caballus]
          Length = 317

 Score = 57.0 bits (136), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 41/172 (23%), Positives = 72/172 (41%), Gaps = 17/172 (9%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L + +   + +DTDV+F   +  +W L  +    Q   +        +G  + 
Sbjct: 70  RLFLPVIL-KDVDSLLYVDTDVLFLRPVDDIWKLLRQFNSTQLAAMAPEHEIPKIG-WYS 127

Query: 258 NHKPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWTS 305
                P  G  G N+GV+L++LT++R            ++W      + +K+     W  
Sbjct: 128 RFARHPFYGSTGLNSGVMLMNLTRIRSAQFKNSMIPTGLAWEDMLYPLYQKYKNAITW-- 185

Query: 306 LADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
             DQD+ N I   +P  +Y  PCQWN +             E   + ++H N
Sbjct: 186 -GDQDLLNIIFYFNPECLYVFPCQWNYRPDHCMYGSNCKEAEREGVSVLHGN 236


>gi|426197653|gb|EKV47580.1| hypothetical protein AGABI2DRAFT_204862 [Agaricus bisporus var.
           bisporus H97]
          Length = 430

 Score = 57.0 bits (136), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 62/260 (23%), Positives = 115/260 (44%), Gaps = 64/260 (24%)

Query: 462 INPNNVDEYEASSDG----NDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQ 517
           + P+ +  Y   + G    +D+T+ + ++ +R +++  L + ++GPIS+TL++++A    
Sbjct: 72  MKPSQIIPYYLRASGVFHKHDITITSIITSNRFEILARLAQRYQGPISVTLHINNATE-- 129

Query: 518 FLSYAGNSEALRSRRN--------IGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDI 569
             +     E+L +  N        +  H+V    +    NT RN+A     T YV +LD+
Sbjct: 130 --NVPHLLESLHNLYNTTPNMAVYVDVHLVLDTFD-RQFNTWRNIARLFARTEYVMMLDV 186

Query: 570 DFLPMFGLYPYLKSSIRSMDMHG------------HGGKKVLVVPAFETQRYRTA----- 612
           DF         L++ ++ +D +G            + G    VVPAFE   Y+       
Sbjct: 187 DFYICTDFRETLQNLLQ-VDGNGEIGEGENIREMFNNGVAAFVVPAFEYINYKEGREYKT 245

Query: 613 FPAS-----------------------HAPTNFSRWVNATTPYQI----EWAPDFEPYIV 645
           FP +                       H  T++ R+ +A  P +I    ++   +EPY +
Sbjct: 246 FPKTKEHLMQLVQQRRIGMFHRVWAPGHNSTDYDRFYSA-KPGEIYKVTQFHSAYEPYTI 304

Query: 646 AHRDLPRY-DTRFVGFGWNK 664
             +D P + D RF+G+G NK
Sbjct: 305 FRKDGPPWCDERFIGYGGNK 324


>gi|301757713|ref|XP_002914707.1| PREDICTED: glucoside xylosyltransferase 2-like [Ailuropoda
           melanoleuca]
          Length = 431

 Score = 57.0 bits (136), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 41/172 (23%), Positives = 72/172 (41%), Gaps = 17/172 (9%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L + +   + +DTDV+F   +  +W L  +    Q   +        +G  + 
Sbjct: 184 RLFLPVIL-KDVDSLLYVDTDVLFLRPVDDIWKLLRQFNATQLAAMAPEHEIPKIG-WYS 241

Query: 258 NHKPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWTS 305
                P  G  G N+GV+L++LT++R            ++W      + +K+     W  
Sbjct: 242 RFARHPFYGSAGVNSGVMLMNLTRIRSAQFKNSMIPTGLAWEDMLYPLYQKYKNAITW-- 299

Query: 306 LADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
             DQD+ N I   +P  +Y  PCQWN +             E   + ++H N
Sbjct: 300 -GDQDLLNIIFYFNPECLYVFPCQWNYRPDHCMYGSNCKAAEREGVSVLHGN 350


>gi|354465598|ref|XP_003495266.1| PREDICTED: glucoside xylosyltransferase 2 [Cricetulus griseus]
          Length = 376

 Score = 57.0 bits (136), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 41/172 (23%), Positives = 73/172 (42%), Gaps = 17/172 (9%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L + +   + +DTDV+F   +  +W L  +    Q   +        +G  + 
Sbjct: 129 RLFLPVILKD-VDSLLYVDTDVLFLRPVDDIWKLLRQFNSTQLAAMAPEHEIPKIG-WYS 186

Query: 258 NHKPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWTS 305
                P  G  G N+GV+L++LT++R+           ++W      + +K+     W  
Sbjct: 187 RFARHPFYGSTGVNSGVMLMNLTRIRNTQFKNSLIPTGLAWEDMLLPLYQKYKNAITW-- 244

Query: 306 LADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
             DQD+ N I   +P  +Y  PCQWN +             E   + ++H N
Sbjct: 245 -GDQDLLNIIFYFNPECLYVFPCQWNYRPDHCMYGSNCKEAEREGVSVLHGN 295


>gi|317418931|emb|CBN80969.1| Glycosyltransferase 8 domain-containing protein 4 [Dicentrarchus
           labrax]
          Length = 448

 Score = 56.6 bits (135), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 42/172 (24%), Positives = 72/172 (41%), Gaps = 17/172 (9%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L + +   + +DTDV+F   +  +W+L       Q   +        +G  + 
Sbjct: 203 RLFLPVIL-KDVDSLLYVDTDVLFLRPMDDIWSLLKSFNSTQLAAMAPEHEVPKIG-WYS 260

Query: 258 NHKPWPALG-RGFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWTS 305
                P  G  G N+GV+L++LT++R            +SW      + +K+     W  
Sbjct: 261 RFARHPFYGVTGVNSGVMLMNLTRIRSTMFKNSMIPSGLSWEDLLHPLYQKYKNHITW-- 318

Query: 306 LADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
             DQD+ N I   +P  ++  PCQWN +             E   + I+H N
Sbjct: 319 -GDQDLLNIIFHYNPECLFIFPCQWNYRPDHCMYGSNCKGAEEEGVSILHGN 369


>gi|348575275|ref|XP_003473415.1| PREDICTED: glucoside xylosyltransferase 2-like [Cavia porcellus]
          Length = 478

 Score = 56.6 bits (135), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 43/176 (24%), Positives = 73/176 (41%), Gaps = 25/176 (14%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQS----DWYLG 253
           +L LP +L + +   + +DTDV+F   +  +W L  +    Q   +          WY  
Sbjct: 231 RLFLPVIL-KDVDSLLYVDTDVLFLRPVDDIWKLLRQFNSTQLAAMAPEHEIPKIGWY-- 287

Query: 254 KLWKNHKPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTR 301
             +  H   P  G  G N+GV+L++LT++R            ++W      + +K+    
Sbjct: 288 SRFARH---PFYGSAGVNSGVMLMNLTRIRSTQFKNSMGPTGLAWEDMLYPLYQKYKNAI 344

Query: 302 LWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
            W    DQD+ N I   +P  +Y  PCQWN +             E   + ++H N
Sbjct: 345 TW---GDQDLLNIIFYFNPECLYVFPCQWNYRPDHCMYGSNCKEAEREGVSVLHGN 397


>gi|109472410|ref|XP_001066466.1| PREDICTED: glucoside xylosyltransferase 2-like [Rattus norvegicus]
 gi|392340057|ref|XP_003753976.1| PREDICTED: glucoside xylosyltransferase 2-like [Rattus norvegicus]
          Length = 444

 Score = 56.6 bits (135), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 43/176 (24%), Positives = 73/176 (41%), Gaps = 25/176 (14%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQS----DWYLG 253
           +L LP +L + +   + +DTDV+F   +  +W L  +    Q   +          WY  
Sbjct: 197 RLFLPVIL-KDVDSLLYVDTDVLFLRPVDDIWKLLRQFNSTQLAAMAPEHEIPKIGWY-- 253

Query: 254 KLWKNHKPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTR 301
             +  H   P  G  G N+GV+L++LT++R            ++W      + +K+    
Sbjct: 254 SRFARH---PFYGSAGVNSGVMLMNLTRIRSTQFKNSLIPTGLAWEEMLLPLYQKYKNAI 310

Query: 302 LWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
            W    DQD+ N I   +P  +Y  PCQWN +             E   + ++H N
Sbjct: 311 TW---GDQDLLNIIFYFNPECLYVFPCQWNYRPDHCMYGSNCKEAEREGVSVLHGN 363


>gi|397610368|gb|EJK60794.1| hypothetical protein THAOC_18795 [Thalassiosira oceanica]
          Length = 565

 Score = 56.6 bits (135), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 47/162 (29%), Positives = 73/162 (45%), Gaps = 18/162 (11%)

Query: 476 GNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYM------SDAEAQQFLSYAGNSEALR 529
           G  VTLV Q S+DR+ ++      W  PI L +Y+        A+    +S   +     
Sbjct: 217 GVSVTLVVQSSIDRIWLLSETCSRWTEPIVLVVYLQWTMLNEKAQKSTAISQIVDVNMAC 276

Query: 530 SRRNIGYHVVYKEGN-----FYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSS 584
            +  +  HV     N      YP+N +RN  L+ V T +V +LD+D +P   L    K +
Sbjct: 277 PQMTVVPHVHDDRDNGEGTTTYPVNIMRNRGLDVVDTSHVLILDVDLIPSGDLSHVAKDN 336

Query: 585 I-RSMDMHGHGGK-----KVLVVPAFETQRYRTAFPASHAPT 620
           +   + M    G      K LVVPAFE + +R   P ++A +
Sbjct: 337 LMDEVSMTETNGDFQVPLKALVVPAFERKVHRDP-PCTNAAS 377


>gi|221194679|ref|ZP_03567736.1| glycosyl transferase family 8 [Atopobium rimae ATCC 49626]
 gi|221185583|gb|EEE17973.1| glycosyl transferase family 8 [Atopobium rimae ATCC 49626]
          Length = 358

 Score = 56.6 bits (135), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 66/245 (26%), Positives = 107/245 (43%), Gaps = 41/245 (16%)

Query: 139 RQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFYLADSVVEDVSWIPNKHYSGVYGLLK 198
           R    I L  + S   +  L +  +  P +G GF   D+ + D+  +P+  +       +
Sbjct: 45  RNYDVIVLTRDLSPTSMITLTRQCQVTPNVGIGFLDVDAALGDIE-LPHHGHFRPETYFR 103

Query: 199 LTLPKVLPETLAKTIVLDTDVIFATDIAQLWA------LFSRLRQRQTIGLVENQSDWYL 252
           L  P +LP  +AK + LD+D++   D+A+L+       L +  R   TIG ++   D  +
Sbjct: 104 LLAPSLLP-NVAKAVYLDSDLVVNHDVAELFDIDVTGYLVAATRDADTIGQIDGY-DSTV 161

Query: 253 GKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFL---LTRLWTSLADQ 309
           G   K+          F  GVIL++L ++R         I  E+FL     R W  L DQ
Sbjct: 162 GPYLKDELGMNDPHDYFQAGVILMNLAEIRT-------SIAPEEFLKVATMRNWRWL-DQ 213

Query: 310 DIFNAIISEHPYLVYTLPCQWN--------------VQLSDNTRSDELCYTEL-TDLKII 354
           D+ N +I+ H YL   +P +WN               Q   + R +   Y E  +DL I+
Sbjct: 214 DVLNRLINGH-YL--RIPMKWNYLADWQFLRRTHIVAQAPKDVREE---YDEARSDLCIV 267

Query: 355 HWNSP 359
           H+  P
Sbjct: 268 HYAGP 272


>gi|335299327|ref|XP_003132373.2| PREDICTED: glucoside xylosyltransferase 2 [Sus scrofa]
          Length = 317

 Score = 56.6 bits (135), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 41/172 (23%), Positives = 72/172 (41%), Gaps = 17/172 (9%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L + +   + +DTDV+F   +  +W L  +    Q   +        +G  + 
Sbjct: 70  RLFLPVIL-KDVDSLLYVDTDVLFLRPVDDIWKLLRQFNSTQLAAMAPEHEIPKIG-WYS 127

Query: 258 NHKPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWTS 305
                P  G  G N+GV+L++LT++R            ++W      + +K+     W  
Sbjct: 128 RFARHPFYGSAGVNSGVMLMNLTRIRSAQFKNSMIPTGLAWEDMLYPLYQKYKNAITW-- 185

Query: 306 LADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
             DQD+ N I   +P  +Y  PCQWN +             E   + ++H N
Sbjct: 186 -GDQDLLNIIFYFNPECLYVFPCQWNYRPDHCMYGSNCREAEQEGVSVLHGN 236


>gi|384248337|gb|EIE21821.1| hypothetical protein COCSUDRAFT_56271 [Coccomyxa subellipsoidea
           C-169]
          Length = 359

 Score = 56.6 bits (135), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 42/133 (31%), Positives = 60/133 (45%), Gaps = 15/133 (11%)

Query: 209 LAKTIVLDTDVIFATDIAQLWALFSRL-RQRQTIGLVEN----QSDWYLGKLWKNHKPWP 263
           L K I +DTD ++  D    W  F  +   R   GL EN       WY+      + P P
Sbjct: 182 LDKVIYVDTDTLWLDDAVFWWTHFKHVGTMRAAFGLAENAGSGNGSWYI------NAPIP 235

Query: 264 ALG-RGFNTGVILLDLTKLRDISW-AGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPY 321
             G RG N G ++  L  +R  ++ A    IIA    L +L   L DQD+ N    +HP 
Sbjct: 236 HHGERGVNAGTMMASLAAIRASNFTAERDAIIANYLPLNQL--PLGDQDVLNIYGHDHPD 293

Query: 322 LVYTLPCQWNVQL 334
            +Y +PC +N + 
Sbjct: 294 QIYVMPCIFNFRF 306


>gi|426249850|ref|XP_004018661.1| PREDICTED: glucoside xylosyltransferase 2 [Ovis aries]
          Length = 454

 Score = 56.6 bits (135), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 43/176 (24%), Positives = 73/176 (41%), Gaps = 25/176 (14%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQS----DWYLG 253
           +L LP +L + +   + +DTDV+F   +  +W L  +    Q   +          WY  
Sbjct: 207 RLFLPVIL-KDVDSLLYVDTDVLFLRPVDDIWKLLRQFNSTQLAAMAPEHEIPKIGWY-- 263

Query: 254 KLWKNHKPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTR 301
             +  H   P  G  G N+GV+L++LT++R            ++W      + +K+    
Sbjct: 264 SRFARH---PFYGSAGVNSGVMLMNLTRIRSAQFKNSMISTGLAWEDMLYPLYQKYKNAI 320

Query: 302 LWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
            W    DQD+ N I   +P  +Y  PCQWN +             E   + ++H N
Sbjct: 321 TW---GDQDLLNIIFYFNPECLYVFPCQWNYRPDHCMYGSNCREAEREGVSVLHGN 373


>gi|334335508|ref|XP_001373334.2| PREDICTED: glucoside xylosyltransferase 2-like [Monodelphis
           domestica]
          Length = 704

 Score = 56.6 bits (135), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 43/176 (24%), Positives = 76/176 (43%), Gaps = 25/176 (14%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQS----DWYLG 253
           +L LP +L + +   + +DTDV+F   I  +W L  +    Q   +          WY  
Sbjct: 456 RLFLPVIL-KDVDSLLYVDTDVLFLRPIDDIWRLLKQFNSTQLAAMAPEHEIPKIGWY-- 512

Query: 254 KLWKNHKPWPALG-RGFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTR 301
             +  H   P  G  G N+GV+L++LT++R+           ++W      + +K+   +
Sbjct: 513 SRFARH---PYYGMTGVNSGVMLMNLTRIRNTQFKNSLIPTGLAWEDMLYPLYQKY---K 566

Query: 302 LWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
            + +  DQD+ N I   +P  +Y  PCQWN +             E   + ++H N
Sbjct: 567 NYITWGDQDLLNIIFYFNPECLYVFPCQWNYRPDHCMYGSNCKGAEQEGVSVLHGN 622


>gi|281353070|gb|EFB28654.1| hypothetical protein PANDA_002618 [Ailuropoda melanoleuca]
          Length = 354

 Score = 56.6 bits (135), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 41/172 (23%), Positives = 72/172 (41%), Gaps = 17/172 (9%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L + +   + +DTDV+F   +  +W L  +    Q   +        +G  + 
Sbjct: 107 RLFLPVIL-KDVDSLLYVDTDVLFLRPVDDIWKLLRQFNATQLAAMAPEHEIPKIG-WYS 164

Query: 258 NHKPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWTS 305
                P  G  G N+GV+L++LT++R            ++W      + +K+     W  
Sbjct: 165 RFARHPFYGSAGVNSGVMLMNLTRIRSAQFKNSMIPTGLAWEDMLYPLYQKYKNAITW-- 222

Query: 306 LADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
             DQD+ N I   +P  +Y  PCQWN +             E   + ++H N
Sbjct: 223 -GDQDLLNIIFYFNPECLYVFPCQWNYRPDHCMYGSNCKAAEREGVSVLHGN 273


>gi|393235330|gb|EJD42886.1| hypothetical protein AURDEDRAFT_152720 [Auricularia delicata
           TFB-10046 SS5]
          Length = 412

 Score = 56.2 bits (134), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 37/130 (28%), Positives = 63/130 (48%), Gaps = 6/130 (4%)

Query: 15  IQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVP--QVE 72
           + + +V  G  S R    ++KS L     PL  H++    A+  L       + P   V 
Sbjct: 58  VVLVLVMFGLASAREGQHMLKSALMRSSRPLDVHIVCSPDAVPFLHARLDLVTRPARDVF 117

Query: 73  VSFY--LADSVVEDV--SWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDI 128
           V FY    D +      + + +KH++G+ GL+K+ L ++LP ++ + + +DTD  FA+D 
Sbjct: 118 VQFYPITPDKIATRAQRAGVSSKHHAGLGGLVKIFLHELLPPSVPRALYVDTDAFFASDP 177

Query: 129 AQLWALFSRL 138
             LW    RL
Sbjct: 178 YLLWQTLLRL 187



 Score = 44.3 bits (103), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 19/51 (37%), Positives = 34/51 (66%)

Query: 185 IPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRL 235
           + +KH++G+ GL+K+ L ++LP ++ + + +DTD  FA+D   LW    RL
Sbjct: 137 VSSKHHAGLGGLVKIFLHELLPPSVPRALYVDTDAFFASDPYLLWQTLLRL 187


>gi|409041947|gb|EKM51432.1| glycosyltransferase family 8 protein [Phanerochaete carnosa
           HHB-10118-sp]
          Length = 459

 Score = 56.2 bits (134), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 43/159 (27%), Positives = 69/159 (43%), Gaps = 12/159 (7%)

Query: 7   PVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTW 66
           P  P+ + +  A++    +S      L+KSI+ Y   P+H ++I D  A   ++   +  
Sbjct: 103 PTAPS-QPVTFALIMWSEDSASEGAILLKSIMMYTSVPVHVYIICDEQAQAHIEKRLALV 161

Query: 67  SVPQ--VEVSFYLAD-----SVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLD 119
             P+  + V FY          VE    I   H +G  GL+KL + ++LP  + + I +D
Sbjct: 162 RRPRHDLHVFFYRLSWQAMLGRVEREGSISTVHAAGTPGLMKLFIHEILPTHVRRAIFVD 221

Query: 120 TDVIFATDIAQLWALFSRLRQRQTIGLVEN----QSDWY 154
           TD  F TD   LW  F        I L  +      +WY
Sbjct: 222 TDAFFTTDPLLLWRQFGTFNATTAIALPSHPEMFAPEWY 260



 Score = 44.7 bits (104), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 45/172 (26%), Positives = 68/172 (39%), Gaps = 27/172 (15%)

Query: 179 VEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQR 238
           VE    I   H +G  GL+KL + ++LP  + + I +DTD  F TD   LW  F      
Sbjct: 184 VEREGSISTVHAAGTPGLMKLFIHEILPTHVRRAIFVDTDAFFTTDPLLLWRQFGTFNAT 243

Query: 239 QTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRD--------------- 283
             I L  +  + +  + +  +K          + V+LLDL + R                
Sbjct: 244 TAIAL-PSHPEMFAPEWYDANK--------ICSCVMLLDLERFRTLRLMDSTHYRTAGLP 294

Query: 284 -ISWAGFWRIIAEKFLLTRLW--TSLADQDIFNAIISEHPYLVYTLPCQWNV 332
            ++ A F  +  E    T  +   +L DQ  + AII   P L   L   W V
Sbjct: 295 ALAPATFRALFGEPDAATGHFADAALGDQTFWWAIIQGRPELFKHLHYDWEV 346


>gi|270001550|gb|EEZ97997.1| hypothetical protein TcasGA2_TC000395 [Tribolium castaneum]
          Length = 432

 Score = 56.2 bits (134), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 65/279 (23%), Positives = 100/279 (35%), Gaps = 92/279 (32%)

Query: 468 DEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISL-TLYMSDAEAQQFLSYAG--N 524
           D + A SD +   L  Q S++++  +  +  +W GPISL T   SD E    L Y     
Sbjct: 92  DHFAALSDVHKTCLATQSSLEKIASIVEVTANWNGPISLATFAASDDELNSLLLYILFLR 151

Query: 525 SEALRSRRNIGYHVVY----------------KEGNF----------------------- 545
               + R  + +H  Y                KE N                        
Sbjct: 152 DCFAKIREQVSFHFAYPKDHRPRNIDIDFDKLKEMNCANPSGVLQQLVKQYLKGNKWRTK 211

Query: 546 --YPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMH----GHGGKKVL 599
             YP N LRN+A     + +VFL D+D +P        K  +  +D+        G    
Sbjct: 212 LPYPQNHLRNLARKNCQSKFVFLTDVDIIPS-------KDMVEKLDLFLGRVKCSGLCAY 264

Query: 600 VVPAFETQRYRTAFPASHA------------------------PTNFSRW---------V 626
           V+P +E    R  FP +                           TNF++W         V
Sbjct: 265 VIPTYEIDE-RVRFPPNKTELIRLANKGLARPFHHKVFIYNQFATNFTKWQSTSNESPEV 323

Query: 627 NATTPY-QIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
           + + P    E+   +EP+ VA   +P +D RFVG+G+ +
Sbjct: 324 HISHPVTNFEFL--YEPFYVAPDTVPPHDERFVGYGYTR 360


>gi|391229413|ref|ZP_10265619.1| LPS:glycosyltransferase [Opitutaceae bacterium TAV1]
 gi|391219074|gb|EIP97494.1| LPS:glycosyltransferase [Opitutaceae bacterium TAV1]
          Length = 323

 Score = 56.2 bits (134), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 43/137 (31%), Positives = 67/137 (48%), Gaps = 15/137 (10%)

Query: 197 LKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLW 256
            +  +P++ PE   K I LD D++   D+ +LW +      +   G+VE   D    K +
Sbjct: 88  FRFLVPQIRPE-FRKAIYLDCDLVVQDDLKKLWDVEP---GQNYAGVVE---DLLPRKDY 140

Query: 257 KNHKPWPALGRGFNTGVILLDLTKLR-DISWAGFWRIIAEKFLLTRLWTSLADQDIFNAI 315
           ++HK    + R FN G++LL+L ++R D S   F  I  +     R W   ADQD+ N  
Sbjct: 141 RSHKARIGIRRYFNAGMLLLNLEQIRQDFSVETFLSIERK----NRAWFLFADQDVLNFA 196

Query: 316 ISEHPYLVYTLPCQWNV 332
            +     V  LP +WNV
Sbjct: 197 FANR---VIYLPLRWNV 210


>gi|359078230|ref|XP_002697017.2| PREDICTED: glucoside xylosyltransferase 2 [Bos taurus]
          Length = 317

 Score = 56.2 bits (134), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 41/172 (23%), Positives = 71/172 (41%), Gaps = 17/172 (9%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L   +   + +DTDV+F   +  +W L  +    Q   +        +G  + 
Sbjct: 70  RLFLPVIL-RDVDSLLYVDTDVLFLRPVDDIWKLLRQFNSTQLAAMAPEHEIPKIG-WYS 127

Query: 258 NHKPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWTS 305
                P  G  G N+GV+L++LT++R            ++W      + +K+     W  
Sbjct: 128 RFARHPFYGSAGVNSGVMLMNLTRIRSAQFKNSMISTGLAWEDMLYPLYQKYKNAITW-- 185

Query: 306 LADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
             DQD+ N I   +P  +Y  PCQWN +             E   + ++H N
Sbjct: 186 -GDQDLLNIIFYFNPECLYVFPCQWNYRPDHCMYGSNCREAEREGVSVLHGN 236


>gi|363738785|ref|XP_414430.2| PREDICTED: glucoside xylosyltransferase 2 [Gallus gallus]
          Length = 451

 Score = 56.2 bits (134), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 41/172 (23%), Positives = 74/172 (43%), Gaps = 17/172 (9%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L + +   + +DTDV+F   I  +W +       Q   +        +G  + 
Sbjct: 204 RLFLPVILKD-VDSLLYVDTDVLFLRPIDDIWHILKEFNSTQLAAMAPEHEIPKIG-WYS 261

Query: 258 NHKPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWTS 305
                P  G  G N+GV+L++LT++R+           ++W      + +K+   + + +
Sbjct: 262 RFARHPYYGTTGVNSGVMLMNLTRIRNTQFRNSMIPSGLTWEEMLYPLYQKY---KNYIT 318

Query: 306 LADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
             DQD+ N I   +P  +Y  PCQWN +             E   + I+H N
Sbjct: 319 WGDQDLLNIIFYFNPECLYVFPCQWNYRPDHCMYGSNCKGAEEEGVSILHGN 370


>gi|449283522|gb|EMC90138.1| Glycosyltransferase 8 domain-containing protein 4, partial [Columba
           livia]
          Length = 350

 Score = 56.2 bits (134), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 41/172 (23%), Positives = 74/172 (43%), Gaps = 17/172 (9%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L + +   + +DTDV+F   I  +W +       Q   +        +G  + 
Sbjct: 103 RLFLPVILKD-VDSLLYVDTDVLFLRPIDDIWHILKEFNSTQLAAMAPEHEIPKIG-WYS 160

Query: 258 NHKPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWTS 305
                P  G  G N+GV+L++LT++R+           ++W      + +K+   + + +
Sbjct: 161 RFARHPYYGTTGVNSGVMLMNLTRIRNTQFKNSIIPSGLTWEEMLYPLYQKY---KNYIT 217

Query: 306 LADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
             DQD+ N I   +P  +Y  PCQWN +             E   + I+H N
Sbjct: 218 WGDQDLLNIIFYFNPECLYVFPCQWNYRPDHCMYGSNCKGAEEEGVSILHGN 269


>gi|384248338|gb|EIE21822.1| hypothetical protein COCSUDRAFT_56272 [Coccomyxa subellipsoidea
           C-169]
          Length = 275

 Score = 56.2 bits (134), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 40/130 (30%), Positives = 59/130 (45%), Gaps = 8/130 (6%)

Query: 207 ETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQT-IGLVENQSDWYLGKLWKNHKPWPAL 265
           + L K I +DTD ++  D    W  F+ ++ +    G+ E  +    G  W  +   P  
Sbjct: 151 QDLDKVIYVDTDTLWLDDPVWWWTHFAHMKTKHAAFGMAEESNS---GGSWYTNAQIPHH 207

Query: 266 G-RGFNTGVILLDLTKLRDISW-AGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLV 323
           G RG N GV++  L  +R  ++ A    IIA    L  L   L DQDI N     HP   
Sbjct: 208 GARGLNAGVLMASLATVRASNFTAEREAIIAHYLPLGHL--PLGDQDILNIYGHNHPEQF 265

Query: 324 YTLPCQWNVQ 333
           Y +PC +N +
Sbjct: 266 YAMPCMFNFR 275


>gi|296475003|tpg|DAA17118.1| TPA: hypothetical protein BOS_20881 [Bos taurus]
          Length = 354

 Score = 56.2 bits (134), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 41/172 (23%), Positives = 71/172 (41%), Gaps = 17/172 (9%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L   +   + +DTDV+F   +  +W L  +    Q   +        +G  + 
Sbjct: 107 RLFLPVIL-RDVDSLLYVDTDVLFLRPVDDIWKLLRQFNSTQLAAMAPEHEIPKIG-WYS 164

Query: 258 NHKPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWTS 305
                P  G  G N+GV+L++LT++R            ++W      + +K+     W  
Sbjct: 165 RFARHPFYGSAGVNSGVMLMNLTRIRSAQFKNSMISTGLAWEDMLYPLYQKYKNAITW-- 222

Query: 306 LADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
             DQD+ N I   +P  +Y  PCQWN +             E   + ++H N
Sbjct: 223 -GDQDLLNIIFYFNPECLYVFPCQWNYRPDHCMYGSNCREAEREGVSVLHGN 273


>gi|345786246|ref|XP_853462.2| PREDICTED: glucoside xylosyltransferase 2 [Canis lupus familiaris]
          Length = 317

 Score = 56.2 bits (134), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 41/172 (23%), Positives = 72/172 (41%), Gaps = 17/172 (9%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L + +   + +DTDV+F   +  +W L  +    Q   +        +G  + 
Sbjct: 70  RLFLPVIL-KDVDSLLYVDTDVLFLRPVDDIWKLLRQFNATQLAAMAPEHEIPKIG-WYS 127

Query: 258 NHKPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWTS 305
                P  G  G N+GV+L++LT++R            ++W      + +K+     W  
Sbjct: 128 RFARHPFYGSAGVNSGVMLMNLTRIRSAQFKNSMIPTGLAWEDMLYPLYQKYKNAITW-- 185

Query: 306 LADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
             DQD+ N I   +P  +Y  PCQWN +             E   + ++H N
Sbjct: 186 -GDQDLLNIIFYFNPEYLYVFPCQWNYRPDHCMYGSNCKEAEREGVSVLHGN 236


>gi|410951581|ref|XP_003982473.1| PREDICTED: glucoside xylosyltransferase 2 [Felis catus]
          Length = 433

 Score = 56.2 bits (134), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 41/172 (23%), Positives = 72/172 (41%), Gaps = 17/172 (9%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L + +   + +DTDV+F   +  +W L  +    Q   +        +G  + 
Sbjct: 186 RLFLPVIL-KDVDSLLYVDTDVLFLRPVDDIWKLLRQFNATQLAAMAPEHEIPKIG-WYS 243

Query: 258 NHKPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWTS 305
                P  G  G N+GV+L++LT++R            ++W      + +K+     W  
Sbjct: 244 RFARHPFYGSAGVNSGVMLMNLTRIRSAQFKNSMIPTGLAWEDMLYPLYQKYKNAITW-- 301

Query: 306 LADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
             DQD+ N I   +P  +Y  PCQWN +             E   + ++H N
Sbjct: 302 -GDQDLLNIIFYFNPECLYVFPCQWNYRPDHCMYGSNCKEAEREGVSVLHGN 352


>gi|167518187|ref|XP_001743434.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163778533|gb|EDQ92148.1| predicted protein [Monosiga brevicollis MX1]
          Length = 600

 Score = 55.8 bits (133), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 38/127 (29%), Positives = 58/127 (45%), Gaps = 21/127 (16%)

Query: 477 NDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEAL-RSRRN-- 533
           +D+TLV Q + DRL +  ML   W GP      + D +           EA+ R+ R+  
Sbjct: 223 DDITLVTQATFDRLDVFGMLANAWPGPKVAVFAVFDFDEASHAKAQVQIEAIIRTARDWT 282

Query: 534 ---IGYHVVYKEGNFY---------------PINTLRNVALNQVSTPYVFLLDIDFLPMF 575
              I  + V  + ++Y               PIN  RN+A++   T +VF  D+DF+P  
Sbjct: 283 NVKILVYPVTTKDDYYTDRYRRHDPASEPQLPINVFRNLAVDHARTNFVFTCDMDFVPSS 342

Query: 576 GLYPYLK 582
            LYP L+
Sbjct: 343 TLYPKLR 349


>gi|432119874|gb|ELK38654.1| Glucoside xylosyltransferase 2 [Myotis davidii]
          Length = 392

 Score = 55.8 bits (133), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 37/157 (23%), Positives = 64/157 (40%), Gaps = 16/157 (10%)

Query: 213 IVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGR-GFNT 271
           + +DTDV+F   +  +W L  +    Q   +        +G  +      P  G  G N+
Sbjct: 159 LYVDTDVLFLRPVDDIWKLLRQFNSTQLAAMAPEHEIPKIG-WYSRFAQHPFYGSAGVNS 217

Query: 272 GVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHP 320
           GV+L++LT++R            ++W      + +K+     W    DQD+ N I   +P
Sbjct: 218 GVMLMNLTRIRSAQFKNSMIPTGLAWEDMLYPLYQKYKNAITW---GDQDLLNIIFYFNP 274

Query: 321 YLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
             +Y  PC WN +             EL  + ++H N
Sbjct: 275 ERLYVFPCHWNYRPDHCMYGSNCKRAELEGVSVLHGN 311


>gi|344248898|gb|EGW05002.1| Glycosyltransferase 8 domain-containing protein 4 [Cricetulus
           griseus]
          Length = 539

 Score = 55.8 bits (133), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 42/174 (24%), Positives = 74/174 (42%), Gaps = 21/174 (12%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLG--KL 255
           +L LP +L + +   + +DTDV+F   +  +W L  +    Q   +        +G    
Sbjct: 70  RLFLPVIL-KDVDSLLYVDTDVLFLRPVDDIWKLLRQFNSTQLAAMAPEHEIPKIGWYSR 128

Query: 256 WKNHKPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLW 303
           +  H   P  G  G N+GV+L++LT++R+           ++W      + +K+     W
Sbjct: 129 FARH---PFYGSTGVNSGVMLMNLTRIRNTQFKNSLIPTGLAWEDMLLPLYQKYKNAITW 185

Query: 304 TSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
               DQD+ N I   +P  +Y  PCQWN +             E   + ++H N
Sbjct: 186 ---GDQDLLNIIFYFNPECLYVFPCQWNYRPDHCMYGSNCKEAEREGVSVLHGN 236


>gi|440891388|gb|ELR45103.1| Glucoside xylosyltransferase 2, partial [Bos grunniens mutus]
          Length = 354

 Score = 55.8 bits (133), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 41/172 (23%), Positives = 71/172 (41%), Gaps = 17/172 (9%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L   +   + +DTDV+F   +  +W L  +    Q   +        +G  + 
Sbjct: 107 RLFLPVIL-RDVDSLLYVDTDVLFLRPVDDIWKLLRQFNSTQLAAMAPEHEIPKIG-WYS 164

Query: 258 NHKPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWTS 305
                P  G  G N+GV+L++LT++R            ++W      + +K+     W  
Sbjct: 165 RFARHPFYGSAGVNSGVMLMNLTRIRSAQFKNSMISTGLAWEDMLYPLYQKYKNAITW-- 222

Query: 306 LADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
             DQD+ N I   +P  +Y  PCQWN +             E   + ++H N
Sbjct: 223 -GDQDLLNIIFYFNPECLYVFPCQWNYRPDHCMYGSNCREAEREGVSVLHGN 273


>gi|326928149|ref|XP_003210244.1| PREDICTED: glucoside xylosyltransferase 2-like [Meleagris
           gallopavo]
          Length = 463

 Score = 55.8 bits (133), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 41/172 (23%), Positives = 74/172 (43%), Gaps = 17/172 (9%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L + +   + +DTDV+F   I  +W +       Q   +        +G  + 
Sbjct: 216 RLFLPVIL-KDVDSLLYVDTDVLFLRPIDDIWHILKEFNSTQLAAMAPEHEIPKIG-WYS 273

Query: 258 NHKPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWTS 305
                P  G  G N+GV+L++LT++R+           ++W      + +K+   + + +
Sbjct: 274 RFARHPYYGTTGVNSGVMLMNLTRIRNTQFRNSVIPSGLTWEEMLYPLYQKY---KNYIT 330

Query: 306 LADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
             DQD+ N I   +P  +Y  PCQWN +             E   + I+H N
Sbjct: 331 WGDQDLLNIIFYFNPECLYVFPCQWNYRPDHCMYGSNCKGAEEEGVSILHGN 382


>gi|355694022|gb|AER99529.1| glucoside xylosyltransferase 2 [Mustela putorius furo]
          Length = 334

 Score = 55.8 bits (133), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 41/172 (23%), Positives = 72/172 (41%), Gaps = 17/172 (9%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L + +   + +DTDV+F   +  +W L  +    Q   +        +G  + 
Sbjct: 88  RLFLPVILKD-VDSLLYVDTDVLFLRPVDDIWKLLRQFNATQLAAMAPEHEIPKIG-WYS 145

Query: 258 NHKPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWTS 305
                P  G  G N+GV+L++LT++R            ++W      + +K+     W  
Sbjct: 146 RFARHPFYGSAGVNSGVMLMNLTRIRSTQFKNSMIPTGLAWEDMLYPLYQKYKNAITW-- 203

Query: 306 LADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
             DQD+ N I   +P  +Y  PCQWN +             E   + ++H N
Sbjct: 204 -GDQDLLNIIFYFNPECLYVFPCQWNYRPDHCMYGSNCKEAEREGVSVLHGN 254


>gi|308494925|ref|XP_003109651.1| hypothetical protein CRE_07260 [Caenorhabditis remanei]
 gi|308245841|gb|EFO89793.1| hypothetical protein CRE_07260 [Caenorhabditis remanei]
          Length = 164

 Score = 55.8 bits (133), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 32/108 (29%), Positives = 51/108 (47%), Gaps = 1/108 (0%)

Query: 9   IPTCEVIQVAIVCAGYNSTRSL-VTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWS 67
            P  + + +A V  G   TR + +   KS+L Y  +    + I D      +  L ++W+
Sbjct: 55  FPEEDHVSLAYVLGGNFMTRLMFMQHFKSVLNYSDHYFRLYFIVDDENREDVNDLMTSWN 114

Query: 68  VPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKT 115
           +   E  FY        V WIPN HYS  YGL KL +P+++ + + K 
Sbjct: 115 ITNCEWFFYNLTQYDSRVKWIPNSHYSKYYGLSKLLIPEIISDNVGKV 162



 Score = 39.7 bits (91), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 15/31 (48%), Positives = 21/31 (67%)

Query: 182 VSWIPNKHYSGVYGLLKLTLPKVLPETLAKT 212
           V WIPN HYS  YGL KL +P+++ + + K 
Sbjct: 132 VKWIPNSHYSKYYGLSKLLIPEIISDNVGKV 162


>gi|327266150|ref|XP_003217869.1| PREDICTED: glucoside xylosyltransferase 2-like [Anolis
           carolinensis]
          Length = 334

 Score = 55.8 bits (133), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 41/172 (23%), Positives = 74/172 (43%), Gaps = 17/172 (9%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L + +   + +DTDV+F   I  +W   +     Q   +        +G  + 
Sbjct: 87  RLFLPMILKD-VDSLLYVDTDVLFLRPIDDIWRFLTAFNSTQLAAMAPEHEIPKIG-WYS 144

Query: 258 NHKPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWTS 305
                P  G  G N+GV+L++LT++R+           ++W      + +K+   + + +
Sbjct: 145 RFARHPYYGTTGVNSGVMLMNLTRIRNAQFKNSMIPTGLTWEEMLYPLYQKY---KNYIT 201

Query: 306 LADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
             DQD+ N I   +P  +Y  PCQWN +             E   + I+H N
Sbjct: 202 WGDQDLLNIIFYFNPECLYLFPCQWNYRPDHCMYGSNCKGAEEEGISILHGN 253


>gi|158285361|ref|XP_308268.3| AGAP007604-PA [Anopheles gambiae str. PEST]
 gi|157019955|gb|EAA03928.3| AGAP007604-PA [Anopheles gambiae str. PEST]
          Length = 416

 Score = 55.8 bits (133), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 48/198 (24%), Positives = 78/198 (39%), Gaps = 50/198 (25%)

Query: 183 SWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIG 242
           ++    +YS    LL L L +++   + + I++D DV+F   + +L+  F +    Q  G
Sbjct: 187 NYRSGSYYSDALFLLSLGLHRIVDRNMRRAILIDCDVVFRASVKELFDQFDQFAPDQLFG 246

Query: 243 LVEN---------------------QSDWYLGKLWKNHKPWPALGR-------------- 267
           L                         S +YL K+       P LG+              
Sbjct: 247 LAPELTPVYRHVLSRYRMNNPQTSFGSPYYLDKVPAAKDMQP-LGKDVPRGKKAPASEPR 305

Query: 268 -----GFNTGVILLDLTKLR------DISWAGFWRIIAEKFLLTRLWTSLADQDIFNAII 316
                G N+GV++L L ++R      +I      + +AEK+        L DQD +  + 
Sbjct: 306 HHGYPGLNSGVVMLHLDRIRRSRIYEEIIKESTVKNMAEKY---SFQGHLGDQDFYTLMG 362

Query: 317 SEHPYLVYTLPCQWNVQL 334
            E P L+Y L C WN QL
Sbjct: 363 FEFPGLIYRLDCVWNRQL 380



 Score = 45.8 bits (107), Expect = 0.079,   Method: Compositional matrix adjust.
 Identities = 44/203 (21%), Positives = 79/203 (38%), Gaps = 34/203 (16%)

Query: 6   EPVIPTCEVIQVAIVCAGYNSTRSLVTLI------------------KSILFYRKNPLHF 47
           +P     ++ +V  V  G N T   + LI                  +S+L Y    LH 
Sbjct: 75  DPAAAASKLKRVKYVSKGTNDTEYNIFLIYTKESQNMILHSQLELFLRSLLKYSTIELHL 134

Query: 48  HLITDTVALNILQTLFSTWSVPQVEVSFY-------LADSVVEDV-------SWIPNKHY 93
           H+ITD  +    + L           +FY        A+ + + V       ++    +Y
Sbjct: 135 HVITDEQSERSAEELIKQQIERFHRTAFYTLYDVRDCAEKISDIVHGMMPLFNYRSGSYY 194

Query: 94  SGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDW 153
           S    LL L L +++   + + I++D DV+F   + +L+  F +    Q  GL    +  
Sbjct: 195 SDALFLLSLGLHRIVDRNMRRAILIDCDVVFRASVKELFDQFDQFAPDQLFGLAPELTPV 254

Query: 154 YLGKL--WKNHKPWPALGRGFYL 174
           Y   L  ++ + P  + G  +YL
Sbjct: 255 YRHVLSRYRMNNPQTSFGSPYYL 277


>gi|443711810|gb|ELU05398.1| hypothetical protein CAPTEDRAFT_177574 [Capitella teleta]
          Length = 221

 Score = 55.5 bits (132), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 25/53 (47%), Positives = 34/53 (64%), Gaps = 1/53 (1%)

Query: 613 FPASHAPTNFSRWVNATTPYQIEWAPDFEPYIVAH-RDLPRYDTRFVGFGWNK 664
           FPA+H PTNF++W  A+ PY+I + P FEPY+V     +P Y   F+G   NK
Sbjct: 78  FPAAHMPTNFTKWRTASIPYRIHYGPYFEPYVVVSVAAIPPYVEEFLGRQMNK 130


>gi|296225683|ref|XP_002758628.1| PREDICTED: glucoside xylosyltransferase 2 [Callithrix jacchus]
          Length = 443

 Score = 55.5 bits (132), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 44/176 (25%), Positives = 72/176 (40%), Gaps = 25/176 (14%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQS----DWYLG 253
           +L LP +L + +   + +DTDV+F   +  +W L       Q   +          WY  
Sbjct: 196 RLFLPVIL-KDVDSLLYVDTDVLFLRPVDDIWKLLRLFNSTQLAAMAPEHEIPKIGWY-- 252

Query: 254 KLWKNHKPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTR 301
             +  H   P  G  G N+GV+L++LT++R            ++W      + +K+    
Sbjct: 253 SRFARH---PFYGSAGVNSGVMLMNLTRIRSTQFKNSMIPTGLAWEDMLYPLYQKYKNAI 309

Query: 302 LWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
            W    DQD+ N I   +P  +Y  PCQWN +             E   + I+H N
Sbjct: 310 TW---GDQDLLNIIFYFNPECLYVFPCQWNYRPDHCMYGSNCREAEHEGVSILHGN 362


>gi|402859592|ref|XP_003894235.1| PREDICTED: glucoside xylosyltransferase 2, partial [Papio anubis]
          Length = 405

 Score = 55.5 bits (132), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 43/176 (24%), Positives = 72/176 (40%), Gaps = 25/176 (14%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQS----DWYLG 253
           +L LP +L + +   + +DTDV+F   +  +W L       Q   +          WY  
Sbjct: 158 RLFLPVIL-KDVDSLLYVDTDVLFLRPVDDIWKLLRLFNSSQLAAMAPEHEIPKIGWY-- 214

Query: 254 KLWKNHKPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTR 301
             +  H   P  G  G N+GV+L++LT++R            ++W      + +K+    
Sbjct: 215 SRFARH---PFYGSAGVNSGVMLMNLTRIRSTQFKNSMIPTGLAWEDMLYPLYQKYKNAI 271

Query: 302 LWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
            W    DQD+ N I   +P  +Y  PCQWN +             E   + ++H N
Sbjct: 272 TW---GDQDLLNIIFYFNPECLYVFPCQWNYRPDHCMYGSNCREAEREGVSVLHGN 324


>gi|390605133|gb|EIN14524.1| hypothetical protein PUNSTDRAFT_140795 [Punctularia strigosozonata
           HHB-11173 SS5]
          Length = 519

 Score = 55.5 bits (132), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 43/162 (26%), Positives = 75/162 (46%), Gaps = 11/162 (6%)

Query: 3   SDEEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTL 62
           S E+   P  + I  +++  G  + R    ++KSIL +   P+ FH++    A+ +L++ 
Sbjct: 42  STEQAGGPDPDTIVFSLLMLGATTAREGSQMMKSILMHATRPVDFHIVCTPDAVPVLESP 101

Query: 63  FSTWSVPQ--VEVSFYLADS--VVEDV--SWIPNKHYSGVYGLLKLTLPKVLPETLAKTI 116
            S    P   + V  Y+ D     E V  S + N +   +Y L+KL L +++P    K I
Sbjct: 102 LSLIHNPAHGIRVRLYVVDEQYAFERVRRSGVGNPN---MYQLVKLFLHEIIPAE--KAI 156

Query: 117 VLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKL 158
            +DTD +F+ D A LW  F R        +   +   + G +
Sbjct: 157 YVDTDALFSYDPAALWREFERFTPTTIAAITAKEEAGHAGSI 198


>gi|26343315|dbj|BAC35314.1| unnamed protein product [Mus musculus]
 gi|148695647|gb|EDL27594.1| glycosyltransferase-like 1B, isoform CRA_b [Mus musculus]
          Length = 118

 Score = 55.5 bits (132), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 24/40 (60%), Positives = 32/40 (80%)

Query: 2  SSDEEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYR 41
          SS +  + P CE++ VAIVCAGYNS+R ++TL KS+LFYR
Sbjct: 55 SSPQLLLPPKCEMLHVAIVCAGYNSSREIITLTKSLLFYR 94


>gi|348510584|ref|XP_003442825.1| PREDICTED: glucoside xylosyltransferase 2-like [Oreochromis
           niloticus]
          Length = 450

 Score = 55.1 bits (131), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 42/172 (24%), Positives = 72/172 (41%), Gaps = 17/172 (9%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L + +   + +DTDV+F   +  +W L     + Q   +        +G  + 
Sbjct: 205 RLFLPVIL-KDVDSLLYVDTDVLFLRPMDDIWNLLKSFNRTQLAAMAPEHEVPKIG-WYS 262

Query: 258 NHKPWPALG-RGFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWTS 305
                P  G  G N+GV+L++LT++R            +SW      + +K+     W  
Sbjct: 263 RFARHPFYGVTGVNSGVMLMNLTRIRSTLFKNSMIPTGLSWEDLLHPLYQKYKNHITW-- 320

Query: 306 LADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
             DQD+ N I   +P  ++  PCQWN +             E   + I+H N
Sbjct: 321 -GDQDLLNIIFHYNPERLFIFPCQWNYRPDHCMYGSNCKGAEEEGVSILHGN 371


>gi|332231527|ref|XP_003264947.1| PREDICTED: glucoside xylosyltransferase 2 [Nomascus leucogenys]
          Length = 443

 Score = 55.1 bits (131), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 43/176 (24%), Positives = 72/176 (40%), Gaps = 25/176 (14%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQS----DWYLG 253
           +L LP +L + +   + +DTDV+F   +  +W L       Q   +          WY  
Sbjct: 196 RLFLPVIL-KDVDSLLYVDTDVLFLRPVDDIWKLLRLFNSTQLAAMAPEHEIPKIGWY-- 252

Query: 254 KLWKNHKPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTR 301
             +  H   P  G  G N+GV+L++LT++R            ++W      + +K+    
Sbjct: 253 SRFARH---PFYGSAGVNSGVMLMNLTRIRSTQFKNSMIPTGLAWEDMLYPLYQKYKNAI 309

Query: 302 LWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
            W    DQD+ N I   +P  +Y  PCQWN +             E   + ++H N
Sbjct: 310 TW---GDQDLLNIIFYFNPECLYVFPCQWNYRPDHCMYGSNCREAEHEGVSVLHGN 362


>gi|122937187|ref|NP_001073862.1| glucoside xylosyltransferase 2 precursor [Homo sapiens]
 gi|152125899|sp|A0PJZ3.2|GXLT2_HUMAN RecName: Full=Glucoside xylosyltransferase 2; AltName:
           Full=Glycosyltransferase 8 domain-containing protein 4
          Length = 443

 Score = 55.1 bits (131), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 43/176 (24%), Positives = 72/176 (40%), Gaps = 25/176 (14%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQS----DWYLG 253
           +L LP +L + +   + +DTDV+F   +  +W L       Q   +          WY  
Sbjct: 196 RLFLPVIL-KDVDSLLYVDTDVLFLRPVDDIWKLLRLFNSTQLAAMAPEHEIPKIGWY-- 252

Query: 254 KLWKNHKPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTR 301
             +  H   P  G  G N+GV+L++LT++R            ++W      + +K+    
Sbjct: 253 SRFARH---PFYGSAGVNSGVMLMNLTRIRSTQFKNSMIPTGLAWEDMLYPLYQKYKNAI 309

Query: 302 LWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
            W    DQD+ N I   +P  +Y  PCQWN +             E   + ++H N
Sbjct: 310 TW---GDQDLLNIIFYFNPECLYVFPCQWNYRPDHCMYGSNCREAEHEGVSVLHGN 362


>gi|194677265|ref|XP_583887.4| PREDICTED: glucoside xylosyltransferase 2 [Bos taurus]
          Length = 251

 Score = 55.1 bits (131), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 38/167 (22%), Positives = 68/167 (40%), Gaps = 16/167 (9%)

Query: 203 KVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPW 262
           +V+   +   + +DTDV+F   +  +W L  +    Q   +        +G  +      
Sbjct: 8   RVILRDVDSLLYVDTDVLFLRPVDDIWKLLRQFNSTQLAAMAPEHEIPKIG-WYSRFARH 66

Query: 263 PALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWTSLADQD 310
           P  G  G N+GV+L++LT++R            ++W      + +K+     W    DQD
Sbjct: 67  PFYGSAGVNSGVMLMNLTRIRSAQFKNSMISTGLAWEDMLYPLYQKYKNAITW---GDQD 123

Query: 311 IFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
           + N I   +P  +Y  PCQWN +             E   + ++H N
Sbjct: 124 LLNIIFYFNPECLYVFPCQWNYRPDHCMYGSNCREAEREGVSVLHGN 170


>gi|389741822|gb|EIM83010.1| hypothetical protein STEHIDRAFT_84404, partial [Stereum hirsutum
           FP-91666 SS1]
          Length = 340

 Score = 55.1 bits (131), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 60/228 (26%), Positives = 98/228 (42%), Gaps = 60/228 (26%)

Query: 488 DRLQMVEMLFKHWEGPISLTLYM----------SDAEAQQFLSYAGNSEALRSRRNIGYH 537
           +RLQ++E L   ++GP+S+TL++           ++      S   +S AL +R +I  H
Sbjct: 31  NRLQVLEKLAARYQGPLSVTLHLYLHPNGSLSSLNSSILSLHSLIRSSPALAARADI--H 88

Query: 538 VVYKEGNFYP--INTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHG--- 592
           +V      +P   NT RN+A     T Y+ LLD+D    F +    +  +R+M   G   
Sbjct: 89  LVVSP---FPRAFNTWRNIARFLARTSYILLLDVD----FAMCTAWRDPVRAMLQSGGEM 141

Query: 593 ----HGGKKVLVVPAFETQRYRTA-----FPA-----------------------SHAPT 620
                 G+  LV+PAFE  R R       FP+                        H  T
Sbjct: 142 AKRMREGRAALVLPAFEYARARDGKDQMLFPSDKQALLEQVHSHKILPFHASWAQGHNST 201

Query: 621 NFSRWVNATTP--YQI-EWAPDFEPYIVAHRDLPRY-DTRFVGFGWNK 664
           ++ R+        Y++  +   +EPY++  +D   + D RF G+G NK
Sbjct: 202 DYDRFYRTQLGEIYRVARYQSAYEPYVIVRKDAGGWCDERFNGYGANK 249


>gi|426341237|ref|XP_004035955.1| PREDICTED: glucoside xylosyltransferase 2, partial [Gorilla gorilla
           gorilla]
          Length = 403

 Score = 55.1 bits (131), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 43/176 (24%), Positives = 72/176 (40%), Gaps = 25/176 (14%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQS----DWYLG 253
           +L LP +L + +   + +DTDV+F   +  +W L       Q   +          WY  
Sbjct: 156 RLFLPVIL-KDVDSLLYVDTDVLFLRPVDDIWKLLRLFNSTQLAAMAPEHEIPKIGWY-- 212

Query: 254 KLWKNHKPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTR 301
             +  H   P  G  G N+GV+L++LT++R            ++W      + +K+    
Sbjct: 213 SRFARH---PFYGSAGVNSGVMLMNLTRIRSTQFKNSMIPTGLAWEDMLYPLYQKYKNAI 269

Query: 302 LWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
            W    DQD+ N I   +P  +Y  PCQWN +             E   + ++H N
Sbjct: 270 TW---GDQDLLNIIFYFNPECLYVFPCQWNYRPDHCMYGSNCREAEHEGVSVLHGN 322


>gi|403297281|ref|XP_003939503.1| PREDICTED: glucoside xylosyltransferase 2 [Saimiri boliviensis
           boliviensis]
          Length = 317

 Score = 55.1 bits (131), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 42/172 (24%), Positives = 71/172 (41%), Gaps = 17/172 (9%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L + +   + +DTDV+F   +  +W L       Q   +        +G  + 
Sbjct: 70  RLFLPVILKD-VDSLLYVDTDVLFLRPVDDIWKLLRLFNSTQLAAMAPEHEIPKIG-WYS 127

Query: 258 NHKPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWTS 305
                P  G  G N+GV+L++LT++R            ++W      + +K+     W  
Sbjct: 128 RFARHPFYGSAGVNSGVMLMNLTRIRSTQFKNSMIPTGLAWEDMLYPLYQKYKNAITW-- 185

Query: 306 LADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
             DQD+ N I   +P  +Y  PCQWN +             E   + I+H N
Sbjct: 186 -GDQDLLNIIFYFNPECLYIFPCQWNYRPDHCMYGSNCREAEHEGVSILHGN 236


>gi|358057892|dbj|GAA96137.1| hypothetical protein E5Q_02798 [Mixia osmundae IAM 14324]
          Length = 443

 Score = 55.1 bits (131), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 66/260 (25%), Positives = 102/260 (39%), Gaps = 63/260 (24%)

Query: 462 INPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEM-LFKHWEGPISLTLYMSDAEAQQFLS 520
           + P  V  YEA      VTLV+ ++MDRL+   + +   W G IS+++Y++    +  L 
Sbjct: 65  VMPALVPTYEAPR----VTLVSHMTMDRLKAYGVSVVSRWAGSISISVYLTQQTDELDLV 120

Query: 521 YAGNSEALRSRRNIGYHVVY-----KEGNF-----YPINTLRNVALNQVSTPYVFLLDID 570
               +E   +  N+   +V       E +      YPIN LRN+A++     YV + D D
Sbjct: 121 EFFRNEQKTAWENVTITIVRPSYAPDEASLLHRLRYPINRLRNLAISVAPAAYVLMTDAD 180

Query: 571 FLPMFGLY--------PYLKSSIRSMDMHGHGG----KKVLVVPAFETQRYRTAFPAS-- 616
           F+P   ++        P L S I +     H      K  +V+P F      T F +S  
Sbjct: 181 FVPSPNMHEIITRHGLPLLDSKITAWPS-AHDPRMIIKHAIVIPCFVASPGATQFASSQD 239

Query: 617 ---------------------HAPTNFSRWVNATT---------PYQIEWAPDFEPY--I 644
                                H PT   R +              Y + + P++EPY  +
Sbjct: 240 ELKAHLGARPALLSLTDPTIGHGPTAPHRLIKTQAYEDSPISAWSYSVGFEPEWEPYYLL 299

Query: 645 VAHRDLPRYDTRFVGFGWNK 664
             H  L  YD RF   G +K
Sbjct: 300 EKHSHL-LYDERFTDQGGDK 318


>gi|344276496|ref|XP_003410044.1| PREDICTED: glucoside xylosyltransferase 2-like [Loxodonta africana]
          Length = 442

 Score = 55.1 bits (131), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 41/175 (23%), Positives = 71/175 (40%), Gaps = 23/175 (13%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQS----DWYLG 253
           +L LP +L + +   + +DTDV+F   +  +W L       Q   +          WY  
Sbjct: 195 RLFLPVIL-KDVDSLLYVDTDVLFLRPVDDIWKLLRHFNSTQLAAMAPEHEIPKIGWY-- 251

Query: 254 KLWKNHKPWPALGRGFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRL 302
             +  H  + A G   N+GV+L++LT++R            ++W      + +K+     
Sbjct: 252 SRFARHPFYGAAG--VNSGVMLMNLTRIRSTQFKNSLISTGLAWEDMLYPLYQKYKNAIT 309

Query: 303 WTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
           W    DQD+ N I   +P  +Y  PC WN +             E   + ++H N
Sbjct: 310 W---GDQDLLNIIFYFNPECLYVFPCHWNYRPDHCMYGSNCKEAEREGVSVLHGN 361


>gi|242000568|ref|XP_002434927.1| conserved hypothetical protein [Ixodes scapularis]
 gi|215498257|gb|EEC07751.1| conserved hypothetical protein [Ixodes scapularis]
          Length = 173

 Score = 55.1 bits (131), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 27/122 (22%), Positives = 57/122 (46%), Gaps = 5/122 (4%)

Query: 212 TIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNT 271
            + +D+D++F   + +LW+    +  +Q + +  +  D  +    +  K    L  G N 
Sbjct: 8   VLYVDSDIVFLRPVEELWSYLGAMDSQQMVAMAPHAEDANVSAYTRYAKHPFVLPYGVNA 67

Query: 272 GVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWN 331
           G++L++LT++R+ +W      + EK+          DQ + N +    P  +  L C+WN
Sbjct: 68  GLMLMNLTRMRNFAWESRLEPMYEKY-----NRPFGDQALVNIVFHFFPEKLMMLSCKWN 122

Query: 332 VQ 333
            +
Sbjct: 123 FR 124


>gi|355746568|gb|EHH51182.1| hypothetical protein EGM_10516, partial [Macaca fascicularis]
          Length = 354

 Score = 55.1 bits (131), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 41/172 (23%), Positives = 71/172 (41%), Gaps = 17/172 (9%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L + +   + +DTDV+F   +  +W L       Q   +        +G  + 
Sbjct: 107 RLFLPVIL-KDVDSLLYVDTDVLFLRPVDDIWKLLRLFNSTQLAAMAPEHEIPKIG-WYS 164

Query: 258 NHKPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWTS 305
                P  G  G N+GV+L++LT++R            ++W      + +K+     W  
Sbjct: 165 RFARHPFYGSAGVNSGVMLMNLTRIRSTQFKNSMIPTGLAWEDMLYPLYQKYKNAITW-- 222

Query: 306 LADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
             DQD+ N I   +P  +Y  PCQWN +             E   + ++H N
Sbjct: 223 -GDQDLLNIIFYFNPECLYVFPCQWNYRPDHCMYGSNCREAEHEGVSVLHGN 273


>gi|119585929|gb|EAW65525.1| hCG1998746, isoform CRA_b [Homo sapiens]
          Length = 364

 Score = 55.1 bits (131), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 41/172 (23%), Positives = 71/172 (41%), Gaps = 17/172 (9%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L + +   + +DTDV+F   +  +W L       Q   +        +G  + 
Sbjct: 117 RLFLPVIL-KDVDSLLYVDTDVLFLRPVDDIWKLLRLFNSTQLAAMAPEHEIPKIG-WYS 174

Query: 258 NHKPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWTS 305
                P  G  G N+GV+L++LT++R            ++W      + +K+     W  
Sbjct: 175 RFARHPFYGSAGVNSGVMLMNLTRIRSTQFKNSMIPTGLAWEDMLYPLYQKYKNAITW-- 232

Query: 306 LADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
             DQD+ N I   +P  +Y  PCQWN +             E   + ++H N
Sbjct: 233 -GDQDLLNIIFYFNPECLYVFPCQWNYRPDHCMYGSNCREAEHEGVSVLHGN 283


>gi|302565234|ref|NP_001180620.1| glucoside xylosyltransferase 2 precursor [Macaca mulatta]
          Length = 443

 Score = 55.1 bits (131), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 41/172 (23%), Positives = 71/172 (41%), Gaps = 17/172 (9%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L + +   + +DTDV+F   +  +W L       Q   +        +G  + 
Sbjct: 196 RLFLPVIL-KDVDSLLYVDTDVLFLRPVDDIWKLLRLFNSTQLAAMAPEHEIPKIG-WYS 253

Query: 258 NHKPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWTS 305
                P  G  G N+GV+L++LT++R            ++W      + +K+     W  
Sbjct: 254 RFARHPFYGSAGVNSGVMLMNLTRIRSTQFKNSMIPTGLAWEDMLYPLYQKYKNAITW-- 311

Query: 306 LADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
             DQD+ N I   +P  +Y  PCQWN +             E   + ++H N
Sbjct: 312 -GDQDLLNIIFYFNPECLYVFPCQWNYRPDHCMYGSNCREAEHEGVSVLHGN 362


>gi|355559502|gb|EHH16230.1| hypothetical protein EGK_11484, partial [Macaca mulatta]
          Length = 366

 Score = 54.7 bits (130), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 41/172 (23%), Positives = 71/172 (41%), Gaps = 17/172 (9%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L + +   + +DTDV+F   +  +W L       Q   +        +G  + 
Sbjct: 119 RLFLPVIL-KDVDSLLYVDTDVLFLRPVDDIWKLLRLFNSTQLAAMAPEHEIPKIG-WYS 176

Query: 258 NHKPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWTS 305
                P  G  G N+GV+L++LT++R            ++W      + +K+     W  
Sbjct: 177 RFARHPFYGSAGVNSGVMLMNLTRIRSTQFKNSMIPTGLAWEDMLYPLYQKYKNAITW-- 234

Query: 306 LADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
             DQD+ N I   +P  +Y  PCQWN +             E   + ++H N
Sbjct: 235 -GDQDLLNIIFYFNPECLYVFPCQWNYRPDHCMYGSNCREAEHEGVSVLHGN 285


>gi|332817310|ref|XP_516584.3| PREDICTED: glucoside xylosyltransferase 2 [Pan troglodytes]
 gi|397489677|ref|XP_003815849.1| PREDICTED: glucoside xylosyltransferase 2 [Pan paniscus]
 gi|118341485|gb|AAI27734.1| GLT8D4 protein [Homo sapiens]
          Length = 317

 Score = 54.7 bits (130), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 41/172 (23%), Positives = 71/172 (41%), Gaps = 17/172 (9%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L + +   + +DTDV+F   +  +W L       Q   +        +G  + 
Sbjct: 70  RLFLPVIL-KDVDSLLYVDTDVLFLRPVDDIWKLLRLFNSTQLAAMAPEHEIPKIG-WYS 127

Query: 258 NHKPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWTS 305
                P  G  G N+GV+L++LT++R            ++W      + +K+     W  
Sbjct: 128 RFARHPFYGSAGVNSGVMLMNLTRIRSTQFKNSMIPTGLAWEDMLYPLYQKYKNAITW-- 185

Query: 306 LADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
             DQD+ N I   +P  +Y  PCQWN +             E   + ++H N
Sbjct: 186 -GDQDLLNIIFYFNPECLYVFPCQWNYRPDHCMYGSNCREAEHEGVSVLHGN 236


>gi|395733536|ref|XP_002813566.2| PREDICTED: glucoside xylosyltransferase 2 [Pongo abelii]
          Length = 317

 Score = 54.7 bits (130), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 41/172 (23%), Positives = 71/172 (41%), Gaps = 17/172 (9%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L + +   + +DTDV+F   +  +W L       Q   +        +G  + 
Sbjct: 70  RLFLPVIL-KDVDSLLYVDTDVLFLRPVDDIWKLLRLFNSTQLAAMAPEHEIPKIG-WYS 127

Query: 258 NHKPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWTS 305
                P  G  G N+GV+L++LT++R            ++W      + +K+     W  
Sbjct: 128 RFARHPFYGSAGVNSGVMLMNLTRIRSTQFKNSMIPTGLAWEDMLYPLYQKYKNAITW-- 185

Query: 306 LADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
             DQD+ N I   +P  +Y  PCQWN +             E   + ++H N
Sbjct: 186 -GDQDLLNIIFYFNPECLYVFPCQWNYRPDHCMYGSNCREAEHEGVSVLHGN 236


>gi|345482774|ref|XP_001599812.2| PREDICTED: N-acetyllactosaminide
           beta-1,3-N-acetylglucosaminyltransferase-like [Nasonia
           vitripennis]
          Length = 518

 Score = 54.7 bits (130), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 65/277 (23%), Positives = 97/277 (35%), Gaps = 91/277 (32%)

Query: 474 SDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEA----QQFLSYAGNS-EAL 528
           S+ + V L  Q S+++L  +  +   W G IS+ LY +  E     Q++L Y  +  E +
Sbjct: 176 SESSRVCLATQTSLEKLSSLVQVVHQWTGTISVALYAAGDEEFEVLQRYLEYMRHCYEPI 235

Query: 529 RSRRNIGYHVVYK---------------------EG-------------------NFYPI 548
           R R      V                        EG                   N YP 
Sbjct: 236 RERVTFSLAVPRNRPPRAQPRYYPPDGLLDCQKPEGTLNQMTRAISNEQTNWRIRNVYPQ 295

Query: 549 NTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQR 608
           N +RN+A     T YVFL D+D +P   L   L   +          K   V+P +E   
Sbjct: 296 NHMRNLARKNCQTDYVFLTDVDIVPSTNLSRVLDEFL----GQEKCDKCAYVIPTYELD- 350

Query: 609 YRTAFPASHA------------------------PTNFSRWVNATTP------------- 631
            R  FP + +                         TNF+RW+   TP             
Sbjct: 351 ARVRFPQNKSELIRLAKKGLARPFHQKVFIHNQFATNFTRWMLDATPGYKAHENLKMGKA 410

Query: 632 YQIEWAPDF----EPYIVAHRDLPRYDTRFVGFGWNK 664
           Y      +F    EP+ VA   +P +D RF+G+G+ +
Sbjct: 411 YVSHDVTNFEFLYEPFYVARDLVPPHDERFMGYGYTR 447


>gi|386265848|ref|YP_005829340.1| glycosyltransferase [Haemophilus influenzae R2846]
 gi|309973084|gb|ADO96285.1| Probable glycosyltransferase [Haemophilus influenzae R2846]
          Length = 301

 Score = 54.7 bits (130), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 39/146 (26%), Positives = 72/146 (49%), Gaps = 13/146 (8%)

Query: 189 HYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQS 248
           H      L +L L ++LP+ + + I LD D+I    I +LW +           L+   S
Sbjct: 77  HIQSTASLFRLYLHQILPQHIERVIYLDIDLIIHQAIDELWDI------NLEDSLIAGVS 130

Query: 249 DWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLAD 308
           D+++  LW++  P+    +  N+GV+L++L K R+ +   ++   A K+       S  D
Sbjct: 131 DFFIEYLWEH--PFCETQQYINSGVMLINLNKWRENNIEQYFIEFAAKYGKN---LSFGD 185

Query: 309 QDIFNAIISEHPYLVYTLPCQWNVQL 334
           QD+ N  I  +  L+  L  ++N+Q+
Sbjct: 186 QDVINFSIPTN--LIKLLSVKFNIQV 209



 Score = 39.7 bits (91), Expect = 5.9,   Method: Compositional matrix adjust.
 Identities = 33/140 (23%), Positives = 62/140 (44%), Gaps = 12/140 (8%)

Query: 25  NSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQVEVSFYLADSVVED 84
           N    L T++KS L Y  N ++ ++I D +    L+ L       + + + Y     +  
Sbjct: 10  NFASHLDTVLKS-LCYHHNNINIYVIHDGIPAESLEKL--KMHCAKFDNTLYYIQFNINQ 66

Query: 85  VSW---IPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQR 141
            S+   +   H      L +L L ++LP+ + + I LD D+I    I +LW +       
Sbjct: 67  FSFPTVMSPAHIQSTASLFRLYLHQILPQHIERVIYLDIDLIIHQAIDELWDI------N 120

Query: 142 QTIGLVENQSDWYLGKLWKN 161
               L+   SD+++  LW++
Sbjct: 121 LEDSLIAGVSDFFIEYLWEH 140


>gi|443682654|gb|ELT87167.1| hypothetical protein CAPTEDRAFT_200999 [Capitella teleta]
          Length = 177

 Score = 54.3 bits (129), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 36/97 (37%), Positives = 50/97 (51%), Gaps = 18/97 (18%)

Query: 517 QFLSYA--------GNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLD 568
           QFL+Y          N       RNIG+  V     F+P+NTLRN+AL+   T +VFL D
Sbjct: 12  QFLTYKYHRRRSPLTNRLCFDHLRNIGWGHV-----FFPVNTLRNIALDMAVTSHVFLTD 66

Query: 569 IDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFE 605
           +DF+P   LY      + S+ +      + LV+PAFE
Sbjct: 67  VDFIPDQNLYENALQQLHSLQV-----MQSLVIPAFE 98


>gi|313232027|emb|CBY09138.1| unnamed protein product [Oikopleura dioica]
          Length = 380

 Score = 54.3 bits (129), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 43/153 (28%), Positives = 68/153 (44%), Gaps = 15/153 (9%)

Query: 192 GVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWY 251
           G     +L LP+VL + +   + +D+DV+F      LW+ F +  + Q  GL   +  W 
Sbjct: 125 GPCASFRLFLPEVLSD-VDSVVYVDSDVLFLAPPQDLWSHFEKFTEHQVGGLAP-RVGWD 182

Query: 252 LGKLWKNHKPWPALGRGF---NTGVILLDLTKLRDISWAGFWRIIAEKF-----LLTRLW 303
                 N       G G    N+GV L++LT++R+  +     I    F     LL  L+
Sbjct: 183 FKVPGSNPNFILRAGAGMTQVNSGVFLMNLTRMREPVFNTAESISLNSFKWDKDLLLPLY 242

Query: 304 -----TSLADQDIFNAIISEHPYLVYTLPCQWN 331
                    DQ++ N     +P L+Y LPC+WN
Sbjct: 243 YRNKKDMYGDQNLINTAFHYNPELIYFLPCRWN 275


>gi|428163092|gb|EKX32183.1| hypothetical protein GUITHDRAFT_121643 [Guillardia theta CCMP2712]
          Length = 1349

 Score = 54.3 bits (129), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 59/260 (22%), Positives = 100/260 (38%), Gaps = 82/260 (31%)

Query: 479  VTLVAQLSMDRLQMVEMLFKHWEGPISLTLYM-SDAEAQQFLSYAGNSEAL-RSRRNIGY 536
            VTLV   ++DR  ++  +  HW+  +++  Y  S ++ QQ   +  +S     S R    
Sbjct: 887  VTLVTVATVDRAVVLVNVAAHWKEDMAVAFYARSQSDEQQIRRFVRDSLGPWFSSRGRQL 946

Query: 537  HVVY--------KEGNF--YPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYP-----YL 581
             +V          E  F  +PIN LR ++     +  V   D+D +P   L       Y 
Sbjct: 947  EIVMLTLCPDADAEARFAVFPINMLRKISCAIAKSDLVLYSDVDMIPSDSLAHQIRLLYQ 1006

Query: 582  KSSIRSMDMHGHGGKKVLVVPAFETQR----------------YRT-------------- 611
             S++ S        +++LVVP+F++                   RT              
Sbjct: 1007 TSTVTS--------QQLLVVPSFKSNGAWPPYAHVELHNKSLLVRTHAVTFLDLVTSFDA 1058

Query: 612  ------AFPAS--------------HAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRD-- 649
                  A P                HAPT +SRW N +  Y++++   +EPY+V +R   
Sbjct: 1059 CQVNIPAMPCGLWSPSHRWTEAGVYHAPTEYSRWFNTSVAYEVDYITGYEPYVVVNRSSW 1118

Query: 650  -----LPRYDTRFVGFGWNK 664
                    YD  ++ +GW+K
Sbjct: 1119 MGRHGAGMYDDGYISWGWDK 1138


>gi|241784637|ref|XP_002414405.1| glycosyltransferase domain-containing protein, putative [Ixodes
           scapularis]
 gi|215508616|gb|EEC18070.1| glycosyltransferase domain-containing protein, putative [Ixodes
           scapularis]
          Length = 231

 Score = 54.3 bits (129), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 36/130 (27%), Positives = 60/130 (46%), Gaps = 6/130 (4%)

Query: 203 KVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPW 262
           ++LP+     + +D+D +F   +  LW  F ++  +Q   +  +  D  L     N    
Sbjct: 1   RLLPQE-DGVLYVDSDAVFFLSVEDLWNYFGQMNDQQLAVVFSDVEDEALTIY--NEWEM 57

Query: 263 PALGR-GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPY 321
           P   R G N GV+L++LT++R+           +K+    L     DQDI N +   HP 
Sbjct: 58  PHYKRYGINAGVMLMNLTRMRNSGLDALLLSWMDKYHHKVL--DFHDQDIINTVFHHHPE 115

Query: 322 LVYTLPCQWN 331
            V+  PC+WN
Sbjct: 116 KVFHGPCKWN 125


>gi|119585931|gb|EAW65527.1| hCG2042744 [Homo sapiens]
          Length = 324

 Score = 54.3 bits (129), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 43/176 (24%), Positives = 72/176 (40%), Gaps = 25/176 (14%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQS----DWYLG 253
           +L LP +L + +   + +DTDV+F   +  +W L       Q   +          WY  
Sbjct: 117 RLFLPVIL-KDVDSLLYVDTDVLFLRPVDDIWKLLRLFNSTQLAAMAPEHEIPKIGWY-- 173

Query: 254 KLWKNHKPWPALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTR 301
             +  H   P  G  G N+GV+L++LT++R            ++W      + +K+    
Sbjct: 174 SRFARH---PFYGSAGVNSGVMLMNLTRIRSTQFKNSMIPTGLAWEDMLYPLYQKYKNAI 230

Query: 302 LWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
            W    DQD+ N I   +P  +Y  PCQWN +             E   + ++H N
Sbjct: 231 TW---GDQDLLNIIFYFNPECLYVFPCQWNYRPDHCMYGSNCREAEHEGVSVLHGN 283


>gi|145632851|ref|ZP_01788584.1| fumarate hydratase [Haemophilus influenzae 3655]
 gi|144986507|gb|EDJ93073.1| fumarate hydratase [Haemophilus influenzae 3655]
          Length = 277

 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 40/147 (27%), Positives = 73/147 (49%), Gaps = 13/147 (8%)

Query: 189 HYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQS 248
           H      L +L L ++LP+ + + I LD D+I    I +LW +           L+   S
Sbjct: 77  HIQSTASLFRLYLHQILPQHIERVIYLDMDLIIHQAIDELWDI------NLEDSLIAGVS 130

Query: 249 DWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLAD 308
           D++   LW++  P+    +  NTGV+L++L K R+ +   ++   A K+    ++    D
Sbjct: 131 DFFSEYLWEH--PFYEKQQYINTGVMLINLNKWRENNIEQYFIEYAAKYGEFFVY---GD 185

Query: 309 QDIFNAIISEHPYLVYTLPCQWNVQLS 335
           QD+ N  I  +  L+  LP ++N+Q+ 
Sbjct: 186 QDVINFSIPTN--LIKLLPVKFNIQVK 210


>gi|299751516|ref|XP_001830315.2| hypothetical protein CC1G_01951 [Coprinopsis cinerea okayama7#130]
 gi|298409410|gb|EAU91462.2| hypothetical protein CC1G_01951 [Coprinopsis cinerea okayama7#130]
          Length = 499

 Score = 53.9 bits (128), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 68/275 (24%), Positives = 119/275 (43%), Gaps = 64/275 (23%)

Query: 444 NDVTLVAQLSMD-----RLQMVEINPNNVDEYEASSDG----NDVTLVAQLSMDRLQMVE 494
           +D  L+   S+D     +     + P+ V  Y   + G    ++VT+ + ++ DRL +  
Sbjct: 141 HDTGLLRSTSVDDVLFSKAFATSMRPSQVVPYFYRATGPFSEDEVTITSIITSDRLDVFA 200

Query: 495 MLFKHWEGPISLTLYMSDAEAQ---QFLSY----AGNSEALRSRRNIGYHVVYKEGNFYP 547
            L + ++GPIS+T ++ ++  Q   Q L Y       SE +    +I  H+V    +   
Sbjct: 201 RLVERYQGPISVTFHVKNSTQQQVSQILDYLQKIVTASETVVRFADI--HLVLDSFD-RQ 257

Query: 548 INTLRNVALNQVSTPYVFLLDIDF--LPMFGLY----PYLKSSIRSMDMHGHGGKKVLVV 601
            NT RN+A     T +V +LDIDF     F L     P++++ +         G    V+
Sbjct: 258 FNTWRNIARLFARTNFVMMLDIDFYLCTDFRLTLRNNPWVRAKLEE-------GLSAFVI 310

Query: 602 PAFETQRYR-----TAFPAS-----------------------HAPTNFSRWVNATTP-- 631
           PAFE    R     + FP +                       H  T++ ++ NA     
Sbjct: 311 PAFEYTDPRQEKGFSTFPTTKQDLLSKIKAGTISMFHASWKPGHNSTDYPKFYNAPAGDV 370

Query: 632 YQI-EWAPDFEPYIVAHRDLPRY-DTRFVGFGWNK 664
           Y++  +   +EPY++  +  P + + RFVG+G NK
Sbjct: 371 YKVTTYQSAYEPYVIFKKAGPPWCEERFVGYGGNK 405


>gi|159485548|ref|XP_001700806.1| hypothetical protein CHLREDRAFT_187466 [Chlamydomonas reinhardtii]
 gi|158281305|gb|EDP07060.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 363

 Score = 53.9 bits (128), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 41/153 (26%), Positives = 63/153 (41%), Gaps = 39/153 (25%)

Query: 545 FYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAF 604
             PIN+LRNVAL    TP V ++D+D  P  GL   L   +  +         + V+PA+
Sbjct: 117 LVPINSLRNVALLAARTPLVAMIDVDLAPCEGLAGRLLRDVSRVQGLQAERDAMWVLPAW 176

Query: 605 E------------------------------TQRYRT----AFPASHAPTNFSRWVNATT 630
           E                              + R RT     +   H  T+FSRW+ +  
Sbjct: 177 ESSPQLAQAEAEAVVQAARTGDKPRLAALWSSHRIRTFSEDVYARGHNATDFSRWLRSPK 236

Query: 631 PYQIEWAPDFEPYIVAHRDL----PRYDTRFVG 659
           PY +++   +EP+ +  R+     P +D RF G
Sbjct: 237 PYVVQYETGYEPWGIVAREHYLQWP-FDERFRG 268


>gi|422293685|gb|EKU20985.1| hypothetical protein NGA_2018100, partial [Nannochloropsis gaditana
           CCMP526]
          Length = 262

 Score = 53.9 bits (128), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 33/104 (31%), Positives = 56/104 (53%), Gaps = 11/104 (10%)

Query: 478 DVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSD--AEAQQFLSYAGNSE--------A 527
           D TLV+Q S DR+ M+  + + WEGPI + ++M    AE   F++     +        +
Sbjct: 160 DFTLVSQSSPDRIWMLPHICERWEGPIIVAIFMHGFTAEHDAFMANLKCPQLQCLPVHPS 219

Query: 528 LRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDF 571
           + +  +I    V+   N YP+N LRN+A+++V T +  L D+D 
Sbjct: 220 VPASSSIPAS-VWAAPNLYPVNALRNLAISEVRTTHFLLTDMDL 262


>gi|242020666|ref|XP_002430773.1| conserved hypothetical protein [Pediculus humanus corporis]
 gi|212515970|gb|EEB18035.1| conserved hypothetical protein [Pediculus humanus corporis]
          Length = 356

 Score = 53.9 bits (128), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 54/203 (26%), Positives = 80/203 (39%), Gaps = 34/203 (16%)

Query: 186 PNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVE 245
           P+ +YS     + L L K+  +     I++D D     D+  L+  F+     + IG+  
Sbjct: 161 PDSYYSDALFFISLGLHKIADDQ-NYGIMIDVDTKIQIDVKLLFDEFNNFSSNELIGIAP 219

Query: 246 NQSDWYLGKLWKNH---------KPWPALGR-GFNTGVILLDLTKLRDISWAGFWRIIAE 295
            QS  Y   L+            KP    G  G N+GV+LL   +LR         ++  
Sbjct: 220 EQSPVYRHVLYMYRAKYRDTNFGKPLSQKGNPGVNSGVLLLHFERLRK-------SLLYN 272

Query: 296 KFLLTR----------LWTSLADQDIFNAIISEHPYLVYTLPCQWNVQL----SDNTRSD 341
           K LL                L DQD +  +  EH  L++ L C WN QL    S +   D
Sbjct: 273 KLLLPEQVKRLVDKYMFQGHLGDQDWYTLVSQEHKKLIHLLDCTWNRQLCQWWSHHGYED 332

Query: 342 --ELCYTELTDLKIIHWNSPKKL 362
              + Y    D+KI H N   K+
Sbjct: 333 IFNMFYHCDGDVKIYHGNCNTKI 355


>gi|358332613|dbj|GAA51247.1| glycosyltransferase-like protein LARGE, partial [Clonorchis
           sinensis]
          Length = 159

 Score = 53.5 bits (127), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 35/128 (27%), Positives = 55/128 (42%), Gaps = 31/128 (24%)

Query: 10  PTCEVIQVAIVCAGYNSTRSLVTLIKSILFYR---------------------------- 41
           PT  ++ V    A +  +R   TL+KS+ +Y+                            
Sbjct: 32  PTVHLVHVLKGEAAFKRSR---TLLKSVFYYQGRLRSNRSECSLPRTPSPKPCAQQRKPE 88

Query: 42  KNPLHFHLITDTVALNILQTLFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLK 101
           + P+H HLI +    + +    S W V Q + + Y  D  +  VSWI N HY+GV  L K
Sbjct: 89  RTPIHLHLIAERKLWSTINDSVSQWLVQQFQWTLYDLDDYMSRVSWISNGHYAGVVTLAK 148

Query: 102 LTLPKVLP 109
           + +P +LP
Sbjct: 149 MIVPDILP 156


>gi|219115613|ref|XP_002178602.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217410337|gb|EEC50267.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 293

 Score = 53.5 bits (127), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 43/171 (25%), Positives = 69/171 (40%), Gaps = 57/171 (33%)

Query: 546 YPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYP--YLKSSIRSMDMHGHGGKKVLVVPA 603
           YP+N LRN+A++QV T +   +D DF+   GL+   YL  ++ + D      +  +V+PA
Sbjct: 54  YPVNKLRNLAMSQVKTSHAVFIDADFVLSAGLFETLYLHRALLAAD-----ERNAIVIPA 108

Query: 604 FETQRY------------------------------------RTAFPA--------SHAP 619
           FE ++                                     R+A            HA 
Sbjct: 109 FELRKVCETTSYECLTTHLAMLPRDKEELLHLHKKSIESNGGRSAILQFNGLSNIHGHAS 168

Query: 620 TNFSRWVNATT----PYQIEWAPDFEPYIVAH--RDLPRYDTRFVGFGWNK 664
           T ++ WV        P +   +  +EPY+V    R+LP +   F G+G NK
Sbjct: 169 TRYNDWVTQPAEQLLPIECVTSDRYEPYLVVRQCRNLPPFQEAFAGYGQNK 219


>gi|302847664|ref|XP_002955366.1| hypothetical protein VOLCADRAFT_96251 [Volvox carteri f.
           nagariensis]
 gi|300259438|gb|EFJ43666.1| hypothetical protein VOLCADRAFT_96251 [Volvox carteri f.
           nagariensis]
          Length = 464

 Score = 53.5 bits (127), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 40/155 (25%), Positives = 58/155 (37%), Gaps = 35/155 (22%)

Query: 545 FYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHG-HGGKKVLVVPA 603
            YP+N LRN A  QV T  + ++D+D      L   +    R          ++  V+PA
Sbjct: 200 LYPVNYLRNYARMQVRTRLLAMIDVDMYVSRTLSEEMAREGRIAHYESLCAERRATVLPA 259

Query: 604 FETQR---------------------------------YRTAFPASHAPTNFSRWVNATT 630
           FE  R                                     FP  H PT++ RW   + 
Sbjct: 260 FEPTRPGDKGRGMALKITQVTKSELASMHGRNKEAIQFKLRVFPRGHTPTDYVRWFTESQ 319

Query: 631 PYQIEWAPDFEP-YIVAHRDLPRYDTRFVGFGWNK 664
           PY + +   +EP +I     +P YD  F G+G NK
Sbjct: 320 PYPVSYKRFYEPWFITCDEVMPWYDVDFRGYGMNK 354


>gi|145634288|ref|ZP_01789998.1| fumarate hydratase [Haemophilus influenzae PittAA]
 gi|145268268|gb|EDK08262.1| fumarate hydratase [Haemophilus influenzae PittAA]
          Length = 310

 Score = 53.5 bits (127), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 40/147 (27%), Positives = 73/147 (49%), Gaps = 13/147 (8%)

Query: 189 HYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQS 248
           H      L +L L ++LP+ + + I LD D+I    I +LW +           L+   S
Sbjct: 77  HIQSSASLFRLYLHQILPQHIERVIYLDIDLIIHQAIDELWDI------NLEDSLIAGVS 130

Query: 249 DWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLAD 308
           D++   LW++  P+    +  NTGV+L++L K R+ +   ++   A K+    ++    D
Sbjct: 131 DFFSEYLWEH--PFYEKQQYINTGVMLINLNKWRENNIEQYFIEYAAKYGEFFVY---GD 185

Query: 309 QDIFNAIISEHPYLVYTLPCQWNVQLS 335
           QD+ N  I  +  L+  LP ++N+Q+ 
Sbjct: 186 QDVINFSIPTN--LIKLLPVKFNIQVK 210


>gi|387199040|gb|AFJ68883.1| hypothetical protein NGATSA_2018100, partial [Nannochloropsis
           gaditana CCMP526]
          Length = 216

 Score = 53.5 bits (127), Expect = 4e-04,   Method: Composition-based stats.
 Identities = 33/103 (32%), Positives = 56/103 (54%), Gaps = 11/103 (10%)

Query: 478 DVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMS--DAEAQQFLSYAGNSE--------A 527
           D TLV+Q S DR+ M+  + + WEGPI + ++M    AE   F++     +        +
Sbjct: 114 DFTLVSQSSPDRIWMLPHICERWEGPIIVAIFMHGFTAEHDAFMANLKCPQLQCLPVHPS 173

Query: 528 LRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDID 570
           + +  +I    V+   N YP+N LRN+A+++V T +  L D+D
Sbjct: 174 VPASSSIPAS-VWAAPNLYPVNALRNLAISEVRTTHFLLTDMD 215


>gi|428174702|gb|EKX43596.1| hypothetical protein GUITHDRAFT_140359 [Guillardia theta CCMP2712]
          Length = 882

 Score = 53.5 bits (127), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 48/178 (26%), Positives = 68/178 (38%), Gaps = 64/178 (35%)

Query: 546 YPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGH-GGKKVLVVPAF 604
           +PIN LR ++     T  V   D+D L       +L S+IR     G  G  ++LV+P+F
Sbjct: 498 FPINMLRKISCAIARTDLVLYADVDMLA----SDWLASNIRQAYQDGKVGPTELLVIPSF 553

Query: 605 --------------ETQRYRTAFPAS---------------------------------- 616
                         E QR R   P S                                  
Sbjct: 554 KSSEAWPMEAGISLEQQRMRVE-PVSISMDALRLNFQRRQVWIPGLVGGMWSTAGRWHES 612

Query: 617 ---HAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPR-------YDTRFVGFGWNK 664
              HA T + RW+ A   YQ+++   +EPY+V +R   R       YD RFV +GW+K
Sbjct: 613 GVFHAGTEYDRWMEAEETYQVKYILGYEPYVVVNRTAWRGSHGVGLYDDRFVHWGWDK 670


>gi|167517295|ref|XP_001742988.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163778087|gb|EDQ91702.1| predicted protein [Monosiga brevicollis MX1]
          Length = 408

 Score = 53.5 bits (127), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 48/159 (30%), Positives = 72/159 (45%), Gaps = 41/159 (25%)

Query: 546 YPINTLRNVALN-QVSTP---YVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVL-V 600
           YPIN LRN A+    S+P   ++F LD DFLP   L   L +++ + +  G    ++  V
Sbjct: 52  YPINALRNAAMQLGPSSPAADFMFYLDADFLPSPDLPEALAAAVAAAETDGEDVARIAWV 111

Query: 601 VPAFETQ------------------------------RYRTAFPASHAPTNFSRWVNATT 630
           VPAFE +                              R R + P SH  TN+++++N   
Sbjct: 112 VPAFELKGQSGPTVDRLLPASMDALRSQYAADQVVPFRSRES-PLSHRATNYAQFLNPEN 170

Query: 631 P---YQIEWAPDFEPYIVAHR--DLPRYDTRFVGFGWNK 664
               Y IE+   FEPY+      +LP +  +F G+G NK
Sbjct: 171 RQPYYLIEYEDKFEPYVALRNNPELPLFAEQFEGYGLNK 209


>gi|229844138|ref|ZP_04464279.1| aspartate-semialdehyde dehydrogenase [Haemophilus influenzae
           6P18H1]
 gi|229813132|gb|EEP48820.1| aspartate-semialdehyde dehydrogenase [Haemophilus influenzae
           6P18H1]
          Length = 293

 Score = 53.1 bits (126), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 40/147 (27%), Positives = 73/147 (49%), Gaps = 13/147 (8%)

Query: 189 HYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQS 248
           H      L +L L ++LP+ + + I LD D+I    I +LW +           L+   S
Sbjct: 77  HIQSSASLFRLYLHQILPQHIERVIYLDIDLIIHQAIDELWDI------NLEDSLIAGVS 130

Query: 249 DWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLAD 308
           D++   LW++  P+    +  NTGV+L++L K R+ +   ++   A K+    ++    D
Sbjct: 131 DFFSEYLWEH--PFYEKQQYINTGVMLINLNKWRENNIEQYFIEYAAKYGEFFVY---GD 185

Query: 309 QDIFNAIISEHPYLVYTLPCQWNVQLS 335
           QD+ N  I  +  L+  LP ++N+Q+ 
Sbjct: 186 QDVINFSIPTN--LIKLLPVKFNIQVK 210


>gi|148825938|ref|YP_001290691.1| aspartate-semialdehyde dehydrogenase [Haemophilus influenzae
           PittEE]
 gi|148716098|gb|ABQ98308.1| aspartate-semialdehyde dehydrogenase [Haemophilus influenzae
           PittEE]
          Length = 300

 Score = 53.1 bits (126), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 40/147 (27%), Positives = 73/147 (49%), Gaps = 13/147 (8%)

Query: 189 HYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQS 248
           H      L +L L ++LP+ + + I LD D+I    I +LW +           L+   S
Sbjct: 77  HIQSSASLFRLYLHQILPQHIERVIYLDIDLIIHQAIDELWDI------NLEDSLIAGVS 130

Query: 249 DWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLAD 308
           D++   LW++  P+    +  NTGV+L++L K R+ +   ++   A K+    ++    D
Sbjct: 131 DFFSEYLWEH--PFYEKQQYINTGVMLINLNKWRENNIEQYFIEYAAKYGEFFVY---GD 185

Query: 309 QDIFNAIISEHPYLVYTLPCQWNVQLS 335
           QD+ N  I  +  L+  LP ++N+Q+ 
Sbjct: 186 QDVINFSIPTN--LIKLLPVKFNIQVK 210


>gi|223993881|ref|XP_002286624.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220977939|gb|EED96265.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 378

 Score = 53.1 bits (126), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 43/170 (25%), Positives = 75/170 (44%), Gaps = 22/170 (12%)

Query: 476 GNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRS----- 530
           G  VTLV Q S+DR+ ++      W  PI L +Y+        +  A   E++       
Sbjct: 22  GASVTLVIQCSLDRVWLLSETCSRWPDPIVLVVYLPLDTVLDAVDRATVIESIAEVMTGC 81

Query: 531 -RRNIGYHVV-----YKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSS 584
            +  +  H+        + + YP+N +RN  L+ V+T +V ++D+D +P   L    K +
Sbjct: 82  PQMKVVLHISDTSNGKDDTSTYPVNIMRNRGLDAVTTSHVLIMDVDLIPSADLSHVAKDN 141

Query: 585 -IRSMDMHGHGGKK---------VLVVPAFETQRYRTAFPASHAPTNFSR 624
            +  + M     KK          +VVPAFE ++  T  P   +  ++ R
Sbjct: 142 LVDQITMLNQTQKKGGEMEIPMNAIVVPAFE-RKIETPCPDIESCKSYLR 190


>gi|153807295|ref|ZP_01959963.1| hypothetical protein BACCAC_01573 [Bacteroides caccae ATCC 43185]
 gi|149130415|gb|EDM21625.1| glycosyltransferase, family 8 [Bacteroides caccae ATCC 43185]
          Length = 310

 Score = 53.1 bits (126), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 42/149 (28%), Positives = 72/149 (48%), Gaps = 26/149 (17%)

Query: 190 YSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVE--NQ 247
           Y  +   L+L +P++LP  + K + +D+D+IF  DI+ L+   S +      G+ +  NQ
Sbjct: 79  YPSLATYLRLFIPQLLPFEVDKALYVDSDIIFKKDISALYD--SDITNYALAGMEDAPNQ 136

Query: 248 SDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWA----GFWRIIAEKFLLTRLW 303
           +   LG        +P     FN G +LL++  LRD+ +      + R   EK +L    
Sbjct: 137 NALRLG--------FPESDLYFNAGFVLLNVKYLRDMDFTNKAMAYIRDCREKIVLH--- 185

Query: 304 TSLADQDIFNAIISEHPYLVYTLPCQWNV 332
               DQD+ NA++      V  +P +WN+
Sbjct: 186 ----DQDVLNALLHGK---VLFVPIKWNM 207


>gi|290975672|ref|XP_002670566.1| predicted protein [Naegleria gruberi]
 gi|284084126|gb|EFC37822.1| predicted protein [Naegleria gruberi]
          Length = 378

 Score = 53.1 bits (126), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 42/139 (30%), Positives = 73/139 (52%), Gaps = 11/139 (7%)

Query: 13  EVIQVAIVCAGYNSTRSLVTL-IKSILFYR--KNPLHFHLITDTVALNIL-QTLFSTWSV 68
           + + + IV  G  +T    TL IKSIL        +  ++ TD   +N L + +F+   V
Sbjct: 73  KTLSITIVSGGNRTTSDETTLLIKSILLNHVGNTQIELNIATDLYNMNFLIENVFNRIKV 132

Query: 69  --PQVEVSFYLADSVVEDVSW----IPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDV 122
              Q+ V+F + +    D       IP  H+SG +G++KL L KV  +T+ +TI +DTD+
Sbjct: 133 YRTQLNVNFKIVNLTRIDEECNELKIPIVHHSGKWGMVKLFLDKVF-DTVKRTIFVDTDM 191

Query: 123 IFATDIAQLWALFSRLRQR 141
           +F T+   L++ F + ++ 
Sbjct: 192 VFGTNPDLLFSEFDKFKEE 210



 Score = 47.8 bits (112), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 42/163 (25%), Positives = 71/163 (43%), Gaps = 24/163 (14%)

Query: 185 IPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLV 244
           IP  H+SG +G++KL L KV  +T+ +TI +DTD++F T+   L++ F + ++       
Sbjct: 158 IPIVHHSGKWGMVKLFLDKVF-DTVKRTIFVDTDMVFGTNPDLLFSEFDKFKEETLFS-- 214

Query: 245 ENQSDWYLGKLWKNHKPWPALGRGFNTGVILL-DLTKLRDISWAGFWRIIA--EKF---- 297
                      W     W + G       I L D+ K R +++   + I A  E F    
Sbjct: 215 -----------WTREPNWESKGPNHVCSCIFLWDMKKTRKVNYLKEFAIPASIEIFGFDE 263

Query: 298 ---LLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDN 337
               +++   + +DQD   A+    P     L   WN+    N
Sbjct: 264 KTQQISKEKRNGSDQDFLFALNKMKPNFFNELDPSWNLANCQN 306


>gi|219121206|ref|XP_002185831.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|209582680|gb|ACI65301.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 1347

 Score = 53.1 bits (126), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 46/173 (26%), Positives = 79/173 (45%), Gaps = 15/173 (8%)

Query: 448  LVAQLSMDRLQMVEINPNNVDEYEASS----DGNDVTLVAQLSMDRLQMVEMLFKHWEGP 503
            LV + + DR++ +      V+E +  +    D    TLV Q S +RL ++    K W  P
Sbjct: 835  LVVRKAGDRIRHLGNTDLFVNETKCPANLIPDDVQTTLVIQSSSNRLWILAETCKRWNDP 894

Query: 504  ISLTLYMSDAEAQQFLS--YAGNSEALRSRRNIGYHVVYKEGN--FYPINTLRNVALNQV 559
            I   + ++  E    +S   +G  +     + I Y +   E     YP+N LRN+ L+ V
Sbjct: 895  IIAVIALTSTEDHDEVSALLSGWDDKCPHLQVIVYQMDTDEEKPEMYPVNRLRNIGLDAV 954

Query: 560  STPYVFLLDIDFLPMFGLYPYLKSSIRS-------MDMHGHGGKKVLVVPAFE 605
             T +V + D+DF+P   L   +++++          D      +  +VVPAFE
Sbjct: 955  HTSHVLVADVDFIPAQDLARLIRTTLLDRARLRVLKDDMIPKDQDAIVVPAFE 1007


>gi|300122415|emb|CBK22986.2| unnamed protein product [Blastocystis hominis]
          Length = 251

 Score = 53.1 bits (126), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 37/129 (28%), Positives = 64/129 (49%), Gaps = 8/129 (6%)

Query: 479 VTLVAQLSMDRLQMVEMLFKHWEGPISLTLYM-SDAEAQQFLSYAGNSEALRSRRNIGYH 537
           + LV    + R+     L + W GP+S+ +Y+ S  E +Q       +   RS     Y 
Sbjct: 11  IPLVFPFRIQRVFFFPHLVQRWSGPMSVAVYLKSHIEKEQVTQLLAQTSLPRSITLSFY- 69

Query: 538 VVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHG-HGGK 596
            + K+   YP N LRN+AL+++ T + +++D+D  P  GLY     S+  +D    +   
Sbjct: 70  -ITKDPTEYPYNKLRNIALSKIVTSHFWVMDMDMWPCDGLY----ESLHQLDSKFLNDDH 124

Query: 597 KVLVVPAFE 605
             ++VP+FE
Sbjct: 125 LAVIVPSFE 133


>gi|229846746|ref|ZP_04466853.1| putative glycosyl transferase, glycosyl transferase family 8
           protein [Haemophilus influenzae 7P49H1]
 gi|229810235|gb|EEP45954.1| putative glycosyl transferase, glycosyl transferase family 8
           protein [Haemophilus influenzae 7P49H1]
          Length = 312

 Score = 52.8 bits (125), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 39/147 (26%), Positives = 72/147 (48%), Gaps = 13/147 (8%)

Query: 189 HYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQS 248
           H      L +L L ++LP+ + + I LD D+I    I +LW +           L+   S
Sbjct: 77  HIQSTASLFRLYLHQILPQHIERVIYLDMDLIIHQAIDELWDI------NLEDSLIAGVS 130

Query: 249 DWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLAD 308
           D++   LW++  P+    +  NTGV+L++L K R+ +   ++   A K+    ++    D
Sbjct: 131 DFFSEYLWEH--PFYEKQQYINTGVMLINLNKWRENNIEQYFIEYAAKYGKNFVY---GD 185

Query: 309 QDIFNAIISEHPYLVYTLPCQWNVQLS 335
           QD+ N  I  +   +  LP ++N+Q+ 
Sbjct: 186 QDVINFSIPTNR--IKLLPVKFNIQVK 210


>gi|336372368|gb|EGO00707.1| glycosyltransferase family 49 protein [Serpula lacrymans var.
           lacrymans S7.3]
          Length = 337

 Score = 52.8 bits (125), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 53/246 (21%), Positives = 112/246 (45%), Gaps = 48/246 (19%)

Query: 462 INPNNVDE--YEASSDGN--DVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQ- 516
           + PN +    Y AS   N  D+T+   ++ +R ++   L + ++GPIS+T+++    +  
Sbjct: 15  MQPNKIVPFFYRASGTFNPEDITITTLVTSNRFKVFAQLVERYQGPISVTVHIKSDSSHT 74

Query: 517 -----QFLSYAGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDF 571
                +      +S ++ +  ++  H+V    +   +NT RN+A     T +V +LD+DF
Sbjct: 75  TALLNELHKLYASSPSMATFVDV--HLVLDPFD-RQLNTWRNIARFFARTDFVMMLDVDF 131

Query: 572 LPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQR----------------------- 608
           +        +++S +++      G   LVVPAFE  +                       
Sbjct: 132 VVCTDFRAAVRAS-KAVMARLSEGYTALVVPAFEYLKQADGADQKKFPRDKNSLLSLVKA 190

Query: 609 -----YRTAFPASHAPTNFSRWVNATTPYQI----EWAPDFEPYIVAHRDLPRY-DTRFV 658
                +  ++   H  T+++R+  ++ P ++    ++   +EPY++  +  P + D RF+
Sbjct: 191 KKIDMFHRSWAPGHNSTDYARFY-SSQPGEVYKVTQYHSAYEPYVIFKKTGPPWCDERFI 249

Query: 659 GFGWNK 664
           G+G NK
Sbjct: 250 GYGGNK 255


>gi|409350114|ref|ZP_11233352.1| Lipopolysaccharide biosynthesis proteins, LPS:glycosyltransferases
           [Lactobacillus equicursoris CIP 110162]
 gi|407877661|emb|CCK85410.1| Lipopolysaccharide biosynthesis proteins, LPS:glycosyltransferases
           [Lactobacillus equicursoris CIP 110162]
          Length = 574

 Score = 52.8 bits (125), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 47/176 (26%), Positives = 81/176 (46%), Gaps = 31/176 (17%)

Query: 195 GLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGK 254
           GL ++ LP +L E + + I LD D++   D+ +LW   ++    ++IG V +        
Sbjct: 263 GLFRMLLPDLLLE-VKQVIYLDADIMVNCDLVELW---NQHLYGKSIGAVVDSG------ 312

Query: 255 LWKNHKPWPALGRG-------FNTGVILLDLTKLRDIS--WAGFWRIIAEKFLLTRLWTS 305
           + K       +GRG       FN+GV+L++L ++R+    W  F   I +    +  W  
Sbjct: 313 IKKGRYTSKIIGRGLVEKDKYFNSGVLLMNLNRIREKGNLWEKFLSFIEKN---SNAW-- 367

Query: 306 LADQDIFNAIISEHPYLVYTLPCQWNV----QLSDNTRSDELCYTELTDLKIIHWN 357
           L DQD  N I  +    +Y L  +WN     + +DN +     Y    D +I ++N
Sbjct: 368 LIDQDALNCIFKDD---IYFLNSKWNTLIIDERADNPKLQPAIYHFAGDKRIYYYN 420


>gi|432118136|gb|ELK38021.1| hypothetical protein MDA_GLEAN10011661 [Myotis davidii]
          Length = 241

 Score = 52.8 bits (125), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 48/171 (28%), Positives = 69/171 (40%), Gaps = 24/171 (14%)

Query: 209 LAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPA---- 264
           + + I LD D+ + T+I +L+  F        IG+       Y    W+     P     
Sbjct: 67  ILQIIQLDLDLKYKTNIRELFEEFDNFLPGAIIGIAREMQPVYRHTFWQFRHENPKTRVG 126

Query: 265 -----LGRGFNTGVILLDLTKLRDISWAGFW------RIIAEKFLLTRLWTSLADQDIFN 313
                   GFN+GV+LL L  +R     G        + +A+K+        L DQD F 
Sbjct: 127 GPPPEGLPGFNSGVMLLHLEAMRQSPLYGRLLEPAQVQQLADKY---HFRGHLGDQDFFT 183

Query: 314 AIISEHPYLVYTLPCQWNVQLS----DNTRSD--ELCYTELTDLKIIHWNS 358
            I  EHP L + L C WN QL     D+  SD  E  +     ++I H N 
Sbjct: 184 MIGMEHPELFHVLDCTWNRQLCTWWRDHGYSDVFEAYFRCEGHVRIYHGNC 234


>gi|198432242|ref|XP_002119989.1| PREDICTED: similar to glycosyltransferase 8 domain containing 4
           [Ciona intestinalis]
          Length = 396

 Score = 52.8 bits (125), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 50/223 (22%), Positives = 92/223 (41%), Gaps = 29/223 (13%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L   +   + +DTD++F + +  +WA F +    Q   +     +  +G  + 
Sbjct: 164 RLFLPNIL-TNVDSLLYVDTDILFLSPVDDIWAKFKKFNSTQLSAMSPEHENPRVG-WYS 221

Query: 258 NHKPWPALG-RGFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWTS 305
                P  G  G N+GV+L+++T+LR            + W      +  K+     W  
Sbjct: 222 RFARHPYYGDTGINSGVMLINMTRLRSADFKNDVQKSKLKWDNLLLPLFRKYSSNLTW-- 279

Query: 306 LADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTD-LKIIH--------- 355
             DQDI N I   +P ++Y + C  N +  D+    ++C + + + + ++H         
Sbjct: 280 -GDQDILNIIFHHNPEMLYVMSCYLNYR-PDHCMYMQMCGSAVENGVSVMHGCRRSFHND 337

Query: 356 -WNSPKKLKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLT 397
            W S K +    K   F+  L     E   NL +     C L+
Sbjct: 338 KWPSFKAVYKAFKDYSFYNPLSSLLQEIKKNLDKSPHSQCKLS 380


>gi|16552621|dbj|BAB71355.1| unnamed protein product [Homo sapiens]
 gi|119598436|gb|EAW78030.1| chromosome 3 open reading frame 21, isoform CRA_b [Homo sapiens]
          Length = 190

 Score = 52.8 bits (125), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 49/171 (28%), Positives = 70/171 (40%), Gaps = 24/171 (14%)

Query: 209 LAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPA---- 264
           + + I LD D+ F T+I +L+  F        IG+       Y    W+     P     
Sbjct: 16  ILQIIQLDLDLKFKTNIRELFEEFDSFLPGAIIGIAREMQPVYRHTFWQFRHENPQTRVG 75

Query: 265 -----LGRGFNTGVILLDLTKLRD------ISWAGFWRIIAEKFLLTRLWTSLADQDIFN 313
                   GFN+GV+LL+L  +R       +      + +A+K+        L DQD F 
Sbjct: 76  GPPPEGLPGFNSGVMLLNLEAMRQSPLYSRLLEPAQVQQLADKYHFR---GHLGDQDFFT 132

Query: 314 AIISEHPYLVYTLPCQWNVQLS----DNTRSD--ELCYTELTDLKIIHWNS 358
            I  EHP L + L C WN QL     D+  SD  E  +     +KI H N 
Sbjct: 133 MIGMEHPKLFHVLDCTWNRQLCTWWRDHGYSDVFEAYFRCEGHVKIYHGNC 183


>gi|428162720|gb|EKX31837.1| hypothetical protein GUITHDRAFT_149039 [Guillardia theta CCMP2712]
          Length = 937

 Score = 52.4 bits (124), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 63/270 (23%), Positives = 102/270 (37%), Gaps = 72/270 (26%)

Query: 463 NPNNVDEYEASSDGN-DVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDA----EAQQ 517
           +P N+ +      G  DVTL    ++DRL  +     HW   +++ +Y   A    E ++
Sbjct: 466 HPVNLTQQHLGKAGALDVTLTTTATLDRLVTLISTAAHWGDNMAVAMYARSAVEQGEIRR 525

Query: 518 FLS------YAGNSEALRSRRNIGYHVVYKEGN--FYPINTLRNVALNQVSTPYVFLLDI 569
           F+       +A   + L              GN   +PIN LR +      T  +   D+
Sbjct: 526 FVMEILGPWFATRGKTLEVVLLSACPDPLNTGNDLVFPINMLRKICCAIAKTELILFTDV 585

Query: 570 DFLPMFGLYPYLKSSIRSMDMHGH-GGKKVLVVPAFETQ--------------------- 607
           D  P  GL     +SI+     G  G   +LV+PAF +                      
Sbjct: 586 DMQPSDGL----AASIQLAYSRGQLGPMDLLVLPAFRSNELWPLDVEVDVNEGNLSVNTF 641

Query: 608 -----------------------RYRTAFPAS---HAPTNFSRWVNATTPYQIEWAPDFE 641
                                  R  T++  +   H PT++ RW++A   Y+  +   +E
Sbjct: 642 SILFRDLRQHFATCSVYVPGIDCRGTTSWSGAGVFHMPTDYDRWLDAEDLYEASYLLGYE 701

Query: 642 PYIVAHRD--LPR-----YDTRFVGFGWNK 664
           PY+V +R   L R     YD R+V +GW+K
Sbjct: 702 PYVVLNRSSWLGRHGAGVYDDRYVFWGWDK 731


>gi|47205429|emb|CAG01190.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 192

 Score = 52.4 bits (124), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 32/81 (39%), Positives = 41/81 (50%), Gaps = 7/81 (8%)

Query: 260 KPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLT-----RLWTSLADQDIFNA 314
           +P P    GFN+GV+LLDL  +R  S   + +++  + L       R    L DQD F  
Sbjct: 78  EPHPDGLPGFNSGVMLLDLAAMRASSL--YNQLLEPEHLAQLAAQYRFRGHLGDQDFFTM 135

Query: 315 IISEHPYLVYTLPCQWNVQLS 335
           I  EHP L Y L C WN QL 
Sbjct: 136 IGMEHPELFYPLACGWNRQLC 156


>gi|68250084|ref|YP_249196.1| glycosyl transferase family protein [Haemophilus influenzae
           86-028NP]
 gi|68058283|gb|AAX88536.1| putative glycosyl transferase, glycosyl transferase family 8
           protein [Haemophilus influenzae 86-028NP]
          Length = 312

 Score = 52.4 bits (124), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 39/147 (26%), Positives = 72/147 (48%), Gaps = 13/147 (8%)

Query: 189 HYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQS 248
           H      L +L L ++LP+ + + I LD D+I    I +LW +           L+   S
Sbjct: 77  HIQSTASLFRLYLHQILPQHIERVIYLDMDLIIHQAIDELWDI------NLEDSLIAGVS 130

Query: 249 DWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLAD 308
           D++   LW++  P+    +  N GV+L++L K R+ +   ++   A K+    ++    D
Sbjct: 131 DFFSEYLWEH--PFYEKQQYINAGVMLINLNKWRENNIEQYFIEYAAKYGEFFVY---GD 185

Query: 309 QDIFNAIISEHPYLVYTLPCQWNVQLS 335
           QD+ N  I  +  L+  LP ++N+Q+ 
Sbjct: 186 QDVINFSIPTN--LIKLLPVKFNIQVK 210


>gi|198417085|ref|XP_002124537.1| PREDICTED: similar to glycosyltransferase 8 domain containing 4
           [Ciona intestinalis]
          Length = 390

 Score = 52.4 bits (124), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 51/187 (27%), Positives = 87/187 (46%), Gaps = 19/187 (10%)

Query: 188 KHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQ 247
           K++ G     +L LP VL +T    + +DTDVIF T   +LW  F     RQ   L   +
Sbjct: 135 KNWWGPCASFRLFLPTVLDKT-DSVLYVDTDVIFLTPPEELWRHFYLFNDRQVAALAP-R 192

Query: 248 SDWYLGKLWKNHKPWPALGRG----FNTGVILLDLTKLRDISWA----GFWRIIAEKFLL 299
             W   ++  ++  +  +  G     N+GV L++LT++R   +A       +I   K LL
Sbjct: 193 VGWSF-QVPNDNANFIRMQDGKKTQVNSGVFLMNLTRMRQPVFATESESRQKISWNKKLL 251

Query: 300 TRLWTS-----LADQDIFNAIISEHPYLVYTLPCQWNV--QLSDNTRSDELCYTELTD-L 351
             L+         DQ++ N +   +P L+Y LPC++N   +   +   +  C +  +D  
Sbjct: 252 FPLYRKHKEDMYGDQNLINLVFHYNPDLIYFLPCKYNYHHKFCFDAYRERWCTSAESDGA 311

Query: 352 KIIHWNS 358
            +IH N+
Sbjct: 312 AVIHGNA 318



 Score = 45.4 bits (106), Expect = 0.093,   Method: Compositional matrix adjust.
 Identities = 39/148 (26%), Positives = 68/148 (45%), Gaps = 12/148 (8%)

Query: 8   VIPTCEVIQVAIV-CAGYNST--RSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFS 64
           +  T E + +A+  C G + T  +   T+IKS + + ++ +H H+ T+ +  +  +  F+
Sbjct: 47  IFQTDETMHLAMAACRGKSLTIVKEATTMIKSAIIFSRSRIHVHIFTENLE-DEFRKEFA 105

Query: 65  TWSVPQVEVSFYLADSVV-----EDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLD 119
           TW +            +V     E++  + N  + G     +L LP VL +T    + +D
Sbjct: 106 TWPLSAKRKFLLTIHPLVYPLDPEELKVMKN--WWGPCASFRLFLPTVLDKT-DSVLYVD 162

Query: 120 TDVIFATDIAQLWALFSRLRQRQTIGLV 147
           TDVIF T   +LW  F     RQ   L 
Sbjct: 163 TDVIFLTPPEELWRHFYLFNDRQVAALA 190


>gi|240995558|ref|XP_002404618.1| N-acetyllactosaminide beta-1,3-N-acetylglucosaminyltransferase,
           putative [Ixodes scapularis]
 gi|215491620|gb|EEC01261.1| N-acetyllactosaminide beta-1,3-N-acetylglucosaminyltransferase,
           putative [Ixodes scapularis]
          Length = 315

 Score = 52.0 bits (123), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 45/169 (26%), Positives = 65/169 (38%), Gaps = 40/169 (23%)

Query: 532 RNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRS---- 587
           R++ +   Y  G  YP+N LRNVA     T +V +LD+D LP  GL+       R     
Sbjct: 133 RSLRFPENYALGVSYPVNLLRNVARRNSQTEFVLVLDVDLLPNDGLHESFVDFARENRLF 192

Query: 588 MDMHGHGGKKVLVVPAFETQRYRTAFPASHAPTNFSRWVNA--TTPYQIE---------- 635
           ++ H    K V V PAFE    R   P+     +  + V A    P+ +E          
Sbjct: 193 VESH-RDDKTVYVAPAFEV---RDGVPSPRDKASLLQRVEAMDARPFYLELCWKCQKHTD 248

Query: 636 --------------------WAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
                               W   +EP+       P YD RF  +G+N+
Sbjct: 249 YEAWQREPPGQRLAVLFEVLWRDPWEPFYFGRNSAPLYDERFRQYGFNR 297


>gi|195586289|ref|XP_002082910.1| GD11824 [Drosophila simulans]
 gi|194194919|gb|EDX08495.1| GD11824 [Drosophila simulans]
          Length = 978

 Score = 52.0 bits (123), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 41/158 (25%), Positives = 66/158 (41%), Gaps = 26/158 (16%)

Query: 204 VLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWP 263
           +   +L + I+LD D++F +D+  L+  F      Q  GL    +  Y   L++    +P
Sbjct: 786 IADRSLNRAILLDCDIVFRSDVRLLFNEFDNFLPHQLYGLAPELTPVYRHILYRYRVRYP 845

Query: 264 --ALGR---------------------GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLT 300
             + G                      G N+GV+LL L ++R+          +E   L 
Sbjct: 846 KTSFGNPYYPINNEGGNQHSRVHHGYPGLNSGVVLLLLNRIRNSKSYLEKLTHSEVHTLV 905

Query: 301 RLWT---SLADQDIFNAIISEHPYLVYTLPCQWNVQLS 335
             ++    L DQD F  +  E+P L+Y L C WN QL 
Sbjct: 906 AKYSFKGHLGDQDFFTLLGYEYPNLIYRLDCIWNRQLC 943


>gi|424766329|ref|ZP_18193682.1| glycosyltransferase, family 8 [Enterococcus faecium TX1337RF]
 gi|402411068|gb|EJV43448.1| glycosyltransferase, family 8 [Enterococcus faecium TX1337RF]
          Length = 302

 Score = 52.0 bits (123), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 57/205 (27%), Positives = 102/205 (49%), Gaps = 33/205 (16%)

Query: 188 KHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQ 247
           K++      +K+ L  +L + + K I LD+DV+   DI +LW+  +++     +G + N 
Sbjct: 77  KNHVSKAAYIKINLDTILSD-IDKVIYLDSDVLINDDIKELWSYSNKI---HYLGAIINP 132

Query: 248 SDWYLGKLW---KNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRI-IAEKFLLTRLW 303
              Y  +++   K+HK        FN+GV++L+L  +R   ++   ++ + EK  LTR  
Sbjct: 133 GYDYDNRVFGVSKDHKT-------FNSGVMMLNLKNIRKHEFSKKLKLFLDEKGHLTR-- 183

Query: 304 TSLADQDIFNAIISEHPYLVYTLPCQWNVQ---LSDNTRSDELCYTELTDLK----IIHW 356
             L DQ  FNA+  +       LP +WNVQ      +++  ++    L DL+    IIH+
Sbjct: 184 --LNDQAAFNAVFLDWQ----LLPEKWNVQYVFYMKSSKELDIDSIHLKDLRKNPSIIHF 237

Query: 357 NSPKK-LKVKNKHMEFFRNLYLTFL 380
            S  K  + +N H   F+  Y ++L
Sbjct: 238 TSNSKPWQYRNAHP--FKKKYYSYL 260


>gi|391338768|ref|XP_003743727.1| PREDICTED: xyloside xylosyltransferase 1-like [Metaseiulus
           occidentalis]
          Length = 295

 Score = 52.0 bits (123), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 36/118 (30%), Positives = 55/118 (46%), Gaps = 17/118 (14%)

Query: 209 LAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKL--WKNHKPWPALG 266
           L + I LDTD+ F  DI +LW +F    +   IG+ ++    Y  K   ++   P   +G
Sbjct: 171 LDRAIFLDTDLRFHDDIRKLWDIFPEFTEENIIGIADDLQPMYREKFADYREKNPETDVG 230

Query: 267 ------RGFNTGVILLDLTKLRDIS---W---AGFWRIIAEKFLLTRLWTSLADQDIF 312
                 +GFNTGV+L+ L ++R+ S   W       R + EK+        L  QD F
Sbjct: 231 SPHPGKQGFNTGVVLMHLGRMRNSSKYNWNFDEDHLRSLQEKYFFN---GDLGHQDFF 285


>gi|241248532|ref|XP_002402932.1| N-acetyllactosaminide beta-1,3-N-acetylglucosaminyltransferase,
           putative [Ixodes scapularis]
 gi|215496429|gb|EEC06069.1| N-acetyllactosaminide beta-1,3-N-acetylglucosaminyltransferase,
           putative [Ixodes scapularis]
          Length = 456

 Score = 51.6 bits (122), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 42/174 (24%), Positives = 65/174 (37%), Gaps = 42/174 (24%)

Query: 479 VTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFL---SYAGNSEALRSRRNIG 535
           VTL  Q + + LQ    L  HW+GP+S+  Y    + +  L   +Y         R N+ 
Sbjct: 131 VTLTTQATHEFLQHAPTLCSHWQGPVSVAAYSPGTDYEIVLNKIAYLRQCGDPCVRANMT 190

Query: 536 YHVVYKEGN-------------------------------------FYPINTLRNVALNQ 558
           +H+VY   +                                      YPIN LRN A  +
Sbjct: 191 WHIVYDREHRPARLQNATDFLASWNGSCSSDVMTREYAEYRKAHKMTYPINVLRNAARTR 250

Query: 559 VSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRYRTA 612
             T YV   DI+  P   + P     ++  ++      +V ++P FE Q  +TA
Sbjct: 251 AKTRYVLASDIELYPSGNIIPRFMRLLKQPNL--TQPHQVFILPIFEVQAKQTA 302


>gi|194378082|dbj|BAG57791.1| unnamed protein product [Homo sapiens]
          Length = 186

 Score = 51.6 bits (122), Expect = 0.001,   Method: Composition-based stats.
 Identities = 37/123 (30%), Positives = 52/123 (42%), Gaps = 34/123 (27%)

Query: 572 LPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPAS--------------- 616
           +P  GL+  L+     +D     G   LVVPAFE +R R   P +               
Sbjct: 2   VPSEGLWRGLR---EMLDQSNQWGGTALVVPAFEIRRARR-MPMNKNELVQLYQVGEVRP 57

Query: 617 ---------HAPTNFSRWVNATT------PYQIEWAPDFEPYIVAHRDLPRYDTRFVGFG 661
                     APTN+SRWVN          Y + W   +EP+ VA   +P +D RF  +G
Sbjct: 58  FYYGLCTPCQAPTNYSRWVNLPEESLLRPAYVVPWQDPWEPFYVAGGKVPTFDERFRQYG 117

Query: 662 WNK 664
           +N+
Sbjct: 118 FNR 120


>gi|322787502|gb|EFZ13590.1| hypothetical protein SINV_13647 [Solenopsis invicta]
          Length = 324

 Score = 51.2 bits (121), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 46/181 (25%), Positives = 69/181 (38%), Gaps = 50/181 (27%)

Query: 479 VTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEA----QQFLSYAGNS-EALRSR-- 531
           V L  Q S+++L  +  +  HW GP+S+ LY +  E     Q++L Y     E++R R  
Sbjct: 115 VCLATQSSVEKLHSLVQVALHWTGPMSVALYAAGDEEFEVLQRYLVYLRKCYESIRERVI 174

Query: 532 --------------------------------------RNIGYHVVYKEGNFYPINTLRN 553
                                                 R       ++  N YP N +RN
Sbjct: 175 FSLAVPKMRTPKKQPRVFELPDIVDCAKPEATLNEFMTRVPTEQTNWRIRNVYPQNHMRN 234

Query: 554 VALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRYRTAF 613
           +A     T YVFL D+D +P F L   L   +R    + +  K   V+P +E    R  F
Sbjct: 235 LARKNCQTDYVFLTDVDIVPSFNLTVVLDEFLR----NDNCDKCAYVIPTYEID-TRVRF 289

Query: 614 P 614
           P
Sbjct: 290 P 290


>gi|430860484|ref|ZP_19478083.1| hypothetical protein OI5_03257 [Enterococcus faecium E1573]
 gi|430551882|gb|ELA91632.1| hypothetical protein OI5_03257 [Enterococcus faecium E1573]
          Length = 302

 Score = 51.2 bits (121), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 57/205 (27%), Positives = 101/205 (49%), Gaps = 33/205 (16%)

Query: 188 KHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQ 247
           K++      +K+ L  +L + + K I LD+DV+   DI +LW+  +++     +G + N 
Sbjct: 77  KNHVSKAAYIKINLDTILSD-IDKVIYLDSDVLINDDIKELWSYSNKI---HYLGAIINP 132

Query: 248 SDWYLGKLW---KNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWR-IIAEKFLLTRLW 303
              Y  +++   K+HK        FN+GV++L+L  +R   ++   +  + EK  LTR  
Sbjct: 133 GYDYDNRVFGVSKDHKT-------FNSGVMMLNLKNIRKHEFSKKLKFFLDEKGHLTR-- 183

Query: 304 TSLADQDIFNAIISEHPYLVYTLPCQWNVQ---LSDNTRSDELCYTELTDLK----IIHW 356
             L DQ  FNA+  +       LP +WNVQ      +++  ++    L DL+    IIH+
Sbjct: 184 --LNDQAAFNAVFLDWQ----LLPEKWNVQYVFYMKSSKELDIDSIHLKDLRKNPSIIHF 237

Query: 357 NSPKK-LKVKNKHMEFFRNLYLTFL 380
            S  K  + +N H   F+  Y ++L
Sbjct: 238 TSNSKPWQYRNAHP--FKKKYYSYL 260


>gi|330996534|ref|ZP_08320416.1| glycosyltransferase, family 8 [Paraprevotella xylaniphila YIT
           11841]
 gi|329573090|gb|EGG54709.1| glycosyltransferase, family 8 [Paraprevotella xylaniphila YIT
           11841]
          Length = 309

 Score = 51.2 bits (121), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 39/140 (27%), Positives = 65/140 (46%), Gaps = 18/140 (12%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +  L  +LPE++ + + LD D++   DI  LW   + L     + +VE+     L +   
Sbjct: 90  RCFLSALLPESIDRVLYLDCDIVILGDITPLWR--TPLDAHTGVAVVEDTGCKELQRY-- 145

Query: 258 NHKPWPALGRGFNTGVILLDLTKLRDISWA----GFWRIIAEKFLLTRLWTSLADQDIFN 313
               +PA    FN+GV+L++L   R+   A     ++R   E+ +         DQD+ N
Sbjct: 146 EILQYPAEDSYFNSGVLLINLAYWREHHIAQACVDYYRAYPERIIFN-------DQDLLN 198

Query: 314 AIISEHPYLVYTLPCQWNVQ 333
            ++  H  LV     QWNVQ
Sbjct: 199 CVLHRHKTLV---GLQWNVQ 215


>gi|326202604|ref|ZP_08192472.1| glycosyl transferase family 8 [Clostridium papyrosolvens DSM 2782]
 gi|325987188|gb|EGD48016.1| glycosyl transferase family 8 [Clostridium papyrosolvens DSM 2782]
          Length = 502

 Score = 51.2 bits (121), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 40/149 (26%), Positives = 63/149 (42%), Gaps = 16/149 (10%)

Query: 190 YSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLW-----ALFSRLRQRQTIGLV 244
           Y G     ++ +P+++  ++ K I LD D++   DI +LW       F    +   I + 
Sbjct: 81  YFGYVTYFRIFIPEIVEASVRKVIYLDCDIVIKGDIRKLWENDISEYFVAAVEDVGIDIG 140

Query: 245 ENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWT 304
            N        + K H   P  G+ FN GV+L++L K R          I +  +  R   
Sbjct: 141 GN-----FATMVKKHIGIPRKGKYFNAGVLLINLDKWRADKTT---ETIRKYLIENREKI 192

Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQ 333
             ADQD  NA+  +       LP +WN Q
Sbjct: 193 YFADQDGLNAVFKDR---WLKLPIEWNQQ 218


>gi|392591311|gb|EIW80639.1| glycosyltransferase family 49 protein, partial [Coniophora puteana
           RWD-64-598 SS2]
          Length = 355

 Score = 50.8 bits (120), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 54/228 (23%), Positives = 96/228 (42%), Gaps = 45/228 (19%)

Query: 477 NDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLS------YAGNSEALRS 530
            D+TL   ++  R  + + L + ++GPIS+ ++   + A   ++      +   + + R 
Sbjct: 49  EDITLATLVTHSRFTVFKRLVERYDGPISVAVHHEWSHAHPAVASQLRALHVLYTSSPRF 108

Query: 531 RRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDM 590
              +  H+V        +NT RNVA     T  V +LD+DF     +    + ++ ++ +
Sbjct: 109 AERVDVHLVLAPHE-RALNTWRNVARQLARTGLVMMLDVDF----AVCTDFRGALLNLGV 163

Query: 591 --HGHGGKKVLVVPAFETQRYRTA-----FP-----------------------ASHAPT 620
                      +VPAFE  R R       FP                         H  T
Sbjct: 164 VGARDAAGAGAIVPAFEYTRLRDGMDSRTFPRDKQSLLALVHSGKLDMFHRSWQPGHNST 223

Query: 621 NFSRWVNATTP--YQIE-WAPDFEPYIVAHRDLPRY-DTRFVGFGWNK 664
           +++R+  A     Y+++ +   +EPY+V  RD P + D RFVG+G NK
Sbjct: 224 DYTRYYAAAPGEVYKVDRYQAAYEPYVVFRRDGPPWCDERFVGYGGNK 271


>gi|298706849|emb|CBJ25813.1| Glycosyltransferase, family GT49 [Ectocarpus siliculosus]
          Length = 1541

 Score = 50.8 bits (120), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 63/248 (25%), Positives = 84/248 (33%), Gaps = 84/248 (33%)

Query: 479  VTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDA----------EAQQFLSYAGNS--E 526
            VTLV Q S DRL  V      W G  SL +++  A          E ++           
Sbjct: 1289 VTLVTQCSTDRLPAVLEQALRWGGDTSLAVHVPSAPLSAKDETMSEIRRLCQRVDQEVRA 1348

Query: 527  ALRSRRNIGYHVVYKEGN------------FYPINTLRNVALNQV------STPYVFLLD 568
            A   R+ +   V   EG              YP+NTLRN+AL Q           VFL+D
Sbjct: 1349 APNRRKQLSVDVAILEGAEADPARHDHCGPLYPVNTLRNLALIQARDDSLHPAQAVFLVD 1408

Query: 569  IDFLPMFGLYPYLKSSIRSMDMHGHGGKK--VLVVPAFE------TQRYRTAFPAS---- 616
             D LP   L   L S      ++     +   +V+P  E      + R+  + P+S    
Sbjct: 1409 CDCLPSEHLLEELHSQEVQDRLNAESSARPAAVVIPCLEFTPGSVSARHDKSIPSSVQKV 1468

Query: 617  --------------------HAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRYDTR 656
                                H PT+F R                        DLP YD R
Sbjct: 1469 VEMLAEGTAQGFHVDYFFKGHGPTDFLRKT----------------------DLPMYDER 1506

Query: 657  FVGFGWNK 664
            F G+G NK
Sbjct: 1507 FRGYGMNK 1514


>gi|227872037|ref|ZP_03990417.1| lipopolysaccharide biosynthesis protein, LPS:glycosyltransferase
           [Oribacterium sinus F0268]
 gi|227842114|gb|EEJ52364.1| lipopolysaccharide biosynthesis protein, LPS:glycosyltransferase
           [Oribacterium sinus F0268]
          Length = 324

 Score = 50.8 bits (120), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 39/139 (28%), Positives = 65/139 (46%), Gaps = 18/139 (12%)

Query: 196 LLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKL 255
           LL+L +P  LPET+ K + LD D +    I  L+ L  RL  +  +G+    S      +
Sbjct: 88  LLRLFIPSTLPETITKALYLDADTVVLQSILSLYHL--RLGDK-LLGMAPEPS------I 138

Query: 256 WKNHKPWPALGRG---FNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIF 312
           +K HK + +L      +N GV+L++L+ LR+    G        + +        DQDI 
Sbjct: 139 YKKHKEFLSLAEESPYYNAGVMLMNLSLLRE---EGMEEKCLRYYQMKEGQLPFNDQDIL 195

Query: 313 NAIISEHPYLVYTLPCQWN 331
           N +       + +LP ++N
Sbjct: 196 NMVCKGR---IRSLPQRFN 211



 Score = 40.8 bits (94), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 35/129 (27%), Positives = 61/129 (47%), Gaps = 14/129 (10%)

Query: 45  LHFHLITDTVALNILQTLFSTWSVPQVEVSFY-LADSVVEDVSWIPNKHYSGVYG---LL 100
           LHFH+++  +     + L         ++SFY L + + E    +P   ++G +    LL
Sbjct: 31  LHFHILSLGIEEESKEKLRQYVETEGQKISFYDLEEKLSEWKEKLPAL-FTGKFSKATLL 89

Query: 101 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 160
           +L +P  LPET+ K + LD D +    I  L+ L  RL  +  +G+    S      ++K
Sbjct: 90  RLFIPSTLPETITKALYLDADTVVLQSILSLYHL--RLGDK-LLGMAPEPS------IYK 140

Query: 161 NHKPWPALG 169
            HK + +L 
Sbjct: 141 KHKEFLSLA 149


>gi|300122829|emb|CBK23836.2| unnamed protein product [Blastocystis hominis]
          Length = 343

 Score = 50.4 bits (119), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 38/148 (25%), Positives = 70/148 (47%), Gaps = 22/148 (14%)

Query: 475 DGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQ---------------FL 519
           D +DVTL+  L++ R   +  L K W+  I++ +++++ E  +               F+
Sbjct: 169 DYDDVTLITILTLVRKDRLPYLMKRWKHRINMVIFVTENEIDELREVIDSYKSYKRITFI 228

Query: 520 SYAGNSEALRSRRNIGYH---VVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFG 576
            Y   S      ++I YH    VY+    +PIN LR++A+  + T + F+ DID      
Sbjct: 229 LYIVKSRIGSRYQSIYYHGKKPVYRNNTIFPINVLRDLAIEFIDTTHYFVSDIDVFSSDT 288

Query: 577 LYPYLKSSIRSMDMHGHGGKKVLVVPAF 604
           LY  L+   +++       K VL++ +F
Sbjct: 289 LYSTLQIHNQTLK----DPKAVLLLKSF 312


>gi|300176055|emb|CBK23366.2| unnamed protein product [Blastocystis hominis]
          Length = 343

 Score = 50.4 bits (119), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 38/148 (25%), Positives = 70/148 (47%), Gaps = 22/148 (14%)

Query: 475 DGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQ---------------FL 519
           D +DVTL+  L++ R   +  L K W+  I++ +++++ E  +               F+
Sbjct: 169 DYDDVTLITILTLVRKDRLPYLMKRWKHRINMVIFVTENEIDELREVIDSYKSYKRITFI 228

Query: 520 SYAGNSEALRSRRNIGYH---VVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFG 576
            Y   S      ++I YH    VY+    +PIN LR++A+  + T + F+ DID      
Sbjct: 229 LYIVKSRIGSRYQSIYYHGKKPVYRNNTIFPINVLRDLAIEFIDTTHYFVSDIDVFSSDT 288

Query: 577 LYPYLKSSIRSMDMHGHGGKKVLVVPAF 604
           LY  L+   +++       K VL++ +F
Sbjct: 289 LYSTLQIHNQTLK----DPKAVLLLKSF 312


>gi|332881420|ref|ZP_08449070.1| glycosyltransferase, family 8 [Capnocytophaga sp. oral taxon 329
           str. F0087]
 gi|357045165|ref|ZP_09106802.1| glycosyltransferase, family 8 [Paraprevotella clara YIT 11840]
 gi|332680796|gb|EGJ53743.1| glycosyltransferase, family 8 [Capnocytophaga sp. oral taxon 329
           str. F0087]
 gi|355531748|gb|EHH01144.1| glycosyltransferase, family 8 [Paraprevotella clara YIT 11840]
          Length = 309

 Score = 50.1 bits (118), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 39/140 (27%), Positives = 64/140 (45%), Gaps = 18/140 (12%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +  L  +LPE + + + LD D++   DI  LW   + L     + +VE+     L +   
Sbjct: 90  RCFLSALLPEDIDRVLYLDCDIVILGDITPLWR--TPLDAHTGVAVVEDTGCKELQRY-- 145

Query: 258 NHKPWPALGRGFNTGVILLDLTKLRDISWA----GFWRIIAEKFLLTRLWTSLADQDIFN 313
               +PA    FN+GV+L++L   R+   A     ++R   E+ +         DQD+ N
Sbjct: 146 EILQYPAEDSYFNSGVLLINLVYWREHHIAQACVDYYRTYPERIIFN-------DQDLLN 198

Query: 314 AIISEHPYLVYTLPCQWNVQ 333
            ++  H  LV     QWNVQ
Sbjct: 199 CVLHRHKTLV---GLQWNVQ 215


>gi|347532720|ref|YP_004839483.1| glycosyl transferase [Roseburia hominis A2-183]
 gi|345502868|gb|AEN97551.1| glycosyl transferase [Roseburia hominis A2-183]
          Length = 333

 Score = 50.1 bits (118), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 49/164 (29%), Positives = 69/164 (42%), Gaps = 30/164 (18%)

Query: 190 YSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVE---- 245
           Y  V    +L +P++  E   + + LD D++  +DIAQL   F    Q + +G V     
Sbjct: 91  YLSVMTYARLLIPQIF-ENFERVVYLDCDMVCNSDIAQL---FHADMQDKPLGAVADTVL 146

Query: 246 NQSDWY------LGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLL 299
           N   W+        +  K        GR FN GVIL D+ +LR+           EK L 
Sbjct: 147 NMEAWHNPNSEETKQYLKETVGITTEGRYFNGGVILFDIDRLRE---------DGEKLLA 197

Query: 300 T---RLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRS 340
               R W   ADQD+ N I  E    V+    QWNV +  N + 
Sbjct: 198 CARERQW-RWADQDVLNHIYKER---VFYFDLQWNVIVISNLKQ 237


>gi|257784421|ref|YP_003179638.1| glycosyl transferase family protein [Atopobium parvulum DSM 20469]
 gi|257472928|gb|ACV51047.1| glycosyl transferase family 8 [Atopobium parvulum DSM 20469]
          Length = 358

 Score = 50.1 bits (118), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 51/170 (30%), Positives = 79/170 (46%), Gaps = 23/170 (13%)

Query: 171 GFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWA 230
           GF   D+ + D+  +P+  +       +L  P +LP  + K I LD+D++  TDIA+L+ 
Sbjct: 77  GFLDVDAALGDIE-LPHHGHFRPETYYRLLAPSLLP-NVNKAIYLDSDLVVNTDIAELYD 134

Query: 231 ------LFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDI 284
                 L    R   TIG ++   D  +G   KN          F  GVIL++L ++R  
Sbjct: 135 IDITGYLVGATRDADTIGQIDGY-DATVGPYLKNELGMDDPHDYFQAGVILMNLEEIRK- 192

Query: 285 SWAGFWRIIAEKFL---LTRLWTSLADQDIFNAIISEHPYLVYTLPCQWN 331
                 +I  E+FL     R W  L DQD+ N  ++ H YL   +  +WN
Sbjct: 193 ------QISPEEFLKVSTMRTWRWL-DQDVLNRFVNGH-YLRINM--KWN 232


>gi|326486345|gb|ADZ76178.1| putative sugar transferase [Campylobacter jejuni subsp. jejuni]
          Length = 456

 Score = 49.3 bits (116), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 42/151 (27%), Positives = 71/151 (47%), Gaps = 21/151 (13%)

Query: 188 KHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQ 247
           KHY+  +   +L L  +LPE   K + LD D++  +D+ +L+ L     Q   I    + 
Sbjct: 111 KHYATYF---RLYLGSILPENCEKCLYLDVDMLVFSDLRELFCL---DLQDYIIAASPDI 164

Query: 248 SDW----YLGKLWKNHKPWPALGRG---FNTGVILLDLTKLRDISWAGFWRIIAEKFLLT 300
             W    ++GK  K++     + +G   FN G +L++L + R  S       +A  +   
Sbjct: 165 KQWPGSLFVGKSKKDNVRDLIIEKGPLYFNAGFMLINLKQWRLESLEKRMVNVANSYTF- 223

Query: 301 RLWTSLADQDIFNAIISEHPYLVYTLPCQWN 331
                + DQDI N +I++    V  LPC+WN
Sbjct: 224 ----EVGDQDILNYVINKK---VIILPCKWN 247


>gi|350410076|ref|XP_003488938.1| PREDICTED: N-acetyllactosaminide
           beta-1,3-N-acetylglucosaminyltransferase-like [Bombus
           impatiens]
          Length = 449

 Score = 49.3 bits (116), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 56/241 (23%), Positives = 83/241 (34%), Gaps = 68/241 (28%)

Query: 478 DVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQ--------------------- 516
            VTL    + D++  +  L + WEGP+SL ++    +A                      
Sbjct: 126 QVTLCTHATADQVYGIVELARRWEGPLSLAVFTPGLDAGIAVALLDRACRCEPEMYKVSV 185

Query: 517 -------------QFLSYAGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPY 563
                        Q     G+  A   +R  G     +    YPIN  RNVA  Q +T  
Sbjct: 186 HLVFPASRPPALGQITRIHGDCAASDLQRGDGGTERKRRSMIYPINVARNVARIQANTTR 245

Query: 564 VFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQR--------------- 608
           V + DI+ LP   L       +R        G+ V VVP FE +                
Sbjct: 246 VLVTDIELLPSQRLASGFMEIVREKTPK---GRIVFVVPVFEIESNEVPPLTKKELLSAT 302

Query: 609 ------YRTAFPASHAP--TNFSRWVNATTPYQIEWAPDFEPYIVAHRDLP--RYDTRFV 658
                 Y   F  SH       SRW+    P ++      +P I+  R+ P  R++  F+
Sbjct: 303 KAGLAVYFHRFLCSHCQRFPGLSRWLIRPDPGKV------KPLIITKREYPHHRWEPVFI 356

Query: 659 G 659
           G
Sbjct: 357 G 357


>gi|195028428|ref|XP_001987078.1| GH20168 [Drosophila grimshawi]
 gi|193903078|gb|EDW01945.1| GH20168 [Drosophila grimshawi]
          Length = 208

 Score = 49.3 bits (116), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 39/146 (26%), Positives = 60/146 (41%), Gaps = 36/146 (24%)

Query: 551 LRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRYR 610
           +RN+A     T YVFL DID +P       L +  +++   G+      V+P FE    R
Sbjct: 1   MRNIARKGCQTKYVFLTDIDIIPSLNSVVELNNFFKTVICEGYCA---YVIPTFEID-VR 56

Query: 611 TAFPAS------------------------HAPTNFSRWVN---ATTPYQIEWAPD---- 639
            AFP S                           TNFS+W++   + T  +I         
Sbjct: 57  AAFPRSKDLLRGLIKKGLARPFHEKVFIYNQYATNFSKWLSTNRSETAVRISHTVTNFEF 116

Query: 640 -FEPYIVAHRDLPRYDTRFVGFGWNK 664
            +EP+ +A    P +D RF G+G+ +
Sbjct: 117 LYEPFYIAIDSAPAHDERFTGYGFTR 142


>gi|326486369|gb|ADZ76201.1| putative sugar transferase [Campylobacter jejuni subsp. jejuni]
          Length = 456

 Score = 49.3 bits (116), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 42/151 (27%), Positives = 71/151 (47%), Gaps = 21/151 (13%)

Query: 188 KHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQ 247
           KHY+  +   +L L  +LPE   K + LD D++  +D+ +L+ L     Q   I    + 
Sbjct: 111 KHYATYF---RLYLGSILPENCEKCLYLDVDMLVFSDLRELFCL---DLQDYIIAASPDI 164

Query: 248 SDW----YLGKLWKNHKPWPALGRG---FNTGVILLDLTKLRDISWAGFWRIIAEKFLLT 300
             W    ++GK  K++     + +G   FN G +L++L + R  S       +A  +   
Sbjct: 165 KQWPGSLFVGKSKKDNVRDLIIEKGPLYFNAGFMLINLKQWRLESLEKRMVNVANSYTF- 223

Query: 301 RLWTSLADQDIFNAIISEHPYLVYTLPCQWN 331
                + DQDI N +I++    V  LPC+WN
Sbjct: 224 ----EVGDQDILNYVINKK---VIILPCKWN 247


>gi|384244587|gb|EIE18087.1| hypothetical protein COCSUDRAFT_55028 [Coccomyxa subellipsoidea
           C-169]
          Length = 136

 Score = 49.3 bits (116), Expect = 0.007,   Method: Composition-based stats.
 Identities = 41/131 (31%), Positives = 59/131 (45%), Gaps = 17/131 (12%)

Query: 250 WYLGKLWKNHKPWPALGR-GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLAD 308
           WY G      K  P  G  G N GV+LL+L +LR+  +A     I   F   +    L D
Sbjct: 14  WYKGGF---SKSLPFYGEYGLNAGVLLLNLDELRESHFAQERDKIIRHFHPKKA-LPLGD 69

Query: 309 QDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWNSPKKLKVKNKH 368
           QD+ NA  S++P  ++ + C +N       RSD  CY       I+H N      +KN+ 
Sbjct: 70  QDVLNAFASKYPTRLHVMSCVFNF------RSDSACYKGFP--AILHGNR----NLKNEF 117

Query: 369 MEFFRNLYLTF 379
              + +LY  F
Sbjct: 118 SSTYSSLYKLF 128


>gi|300175712|emb|CBK21255.2| unnamed protein product [Blastocystis hominis]
          Length = 499

 Score = 49.3 bits (116), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 43/165 (26%), Positives = 72/165 (43%), Gaps = 26/165 (15%)

Query: 478 DVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGYH 537
           D T V   ++DRL+  ++L + W+GP S+ +++   +      +  ++  +   R + Y 
Sbjct: 293 DFTYVTHTTVDRLRYFKVLRELWKGPFSIAIFLRRWDVNTVDRWVRSNSDVPHLRLLFYV 352

Query: 538 VVYK---------------------EGNF-YPINTLRNVALNQVSTPYVFLLDIDFLPMF 575
           V                        EG F YPIN LR++A+  V T +   LD+D  P +
Sbjct: 353 VPSNLHFPKTDYSLWFDRSSMRKITEGAFLYPINLLRDIAIMNVVTTHYINLDMDLWPSY 412

Query: 576 GLYPYLKSSIRSMDMHGHGGKKVLVVPAFE-TQRYRTAFPASHAP 619
             Y  L    R +       K V ++PAF+ T+R+R     S  P
Sbjct: 413 STYQNLHRIPREL---LDPWKNVFILPAFQLTERFRNRILQSGNP 454


>gi|419633523|ref|ZP_14165956.1| glycosyl transferase family protein [Campylobacter jejuni subsp.
           jejuni LMG 23269]
 gi|419667488|ref|ZP_14197457.1| glycosyl transferase family protein [Campylobacter jejuni subsp.
           jejuni 1997-10]
 gi|419688439|ref|ZP_14216762.1| glycosyl transferase family protein [Campylobacter jejuni subsp.
           jejuni 1854]
 gi|380612022|gb|EIB31561.1| glycosyl transferase family protein [Campylobacter jejuni subsp.
           jejuni LMG 23269]
 gi|380645936|gb|EIB62940.1| glycosyl transferase family protein [Campylobacter jejuni subsp.
           jejuni 1997-10]
 gi|380665463|gb|EIB81031.1| glycosyl transferase family protein [Campylobacter jejuni subsp.
           jejuni 1854]
          Length = 483

 Score = 48.9 bits (115), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 42/151 (27%), Positives = 71/151 (47%), Gaps = 21/151 (13%)

Query: 188 KHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQ 247
           KHY+  +   +L L  +LPE   K + LD D++  +D+ +L+ L     Q   I    + 
Sbjct: 111 KHYATYF---RLYLGSILPENCEKCLYLDVDMLVFSDLRELFCL---DLQDYIIAASPDI 164

Query: 248 SDW----YLGKLWKNHKPWPALGRG---FNTGVILLDLTKLRDISWAGFWRIIAEKFLLT 300
             W    ++GK  K++     + +G   FN G +L++L + R  S       +A  +   
Sbjct: 165 KQWPGSLFVGKSKKDNVRDLIIEKGPLYFNAGFMLINLKQWRLESLEKRMVNVANSYTF- 223

Query: 301 RLWTSLADQDIFNAIISEHPYLVYTLPCQWN 331
                + DQDI N +I++    V  LPC+WN
Sbjct: 224 ----EVGDQDILNYVINKK---VIILPCKWN 247


>gi|341881070|gb|EGT37005.1| hypothetical protein CAEBREN_10005 [Caenorhabditis brenneri]
          Length = 607

 Score = 48.9 bits (115), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 43/179 (24%), Positives = 75/179 (41%), Gaps = 41/179 (22%)

Query: 468 DEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEA 527
           + +  S D   V+LV   + D L+  E + K W+GPIS+ +++ D ++QQ L Y  +   
Sbjct: 256 ESFRVSDDLEPVSLVTHGTSDMLKHFENMPKMWDGPISIGVFI-DFDSQQVLQYLTDLHQ 314

Query: 528 LRSR--RNIGYHVVYKEGNF------------------------------------YPIN 549
              +  R +  H  Y++  F                                    YP N
Sbjct: 315 CDEQFSRKVTVHFAYRKSAFQKECLIRESPKTNTPCQKFLRNKVSYAKKIPSPFFLYPYN 374

Query: 550 TLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQR 608
            +RN+A     +   FL+D D +   G+   +K+    M + G+  K VL+V  FE+++
Sbjct: 375 LMRNLARKGAKSDLHFLMDGDMIVSNGMAEKVKTIANEM-LDGN-QKNVLLVRRFESEK 431


>gi|427412560|ref|ZP_18902752.1| hypothetical protein HMPREF9282_00159 [Veillonella ratti
           ACS-216-V-Col6b]
 gi|425716367|gb|EKU79351.1| hypothetical protein HMPREF9282_00159 [Veillonella ratti
           ACS-216-V-Col6b]
          Length = 310

 Score = 48.9 bits (115), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 46/190 (24%), Positives = 81/190 (42%), Gaps = 59/190 (31%)

Query: 197 LKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALF-------------------SRLRQ 237
           L+L +PK+LP+++ K I  DTD++   D+A+LW L                    SR  +
Sbjct: 87  LRLMIPKLLPDSVRKAIYFDTDLVVIGDVAELWQLSLDGHPVGATVDLGIMSSKRSRREK 146

Query: 238 RQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAG--FWRIIAE 295
            ++IGL  N+SD Y                 FN+G++++D+++ R  ++       I A 
Sbjct: 147 HESIGL--NESDDY-----------------FNSGMMVIDVSRWRVENYGTEVLTEITAH 187

Query: 296 KFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNV---------QLSDNTRSDELCYT 346
           +F          DQD  N +   +      LP +WN+         ++  + R  +  + 
Sbjct: 188 QF-------RHHDQDGLNKVFKNN---WQELPLRWNIIPPVFSLPLKILCSGRWRKKAFE 237

Query: 347 ELTDLKIIHW 356
            L    +IHW
Sbjct: 238 ALKSPAVIHW 247



 Score = 44.7 bits (104), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 42/161 (26%), Positives = 71/161 (44%), Gaps = 27/161 (16%)

Query: 17  VAIVCAGY-NSTRSLVTLIKSILFYRKNP--LHFHLITDTVALNILQTLFSTWSVPQVEV 73
           +AIV A   N  +       SIL +   P  +  ++++D ++    Q + +T    +  V
Sbjct: 4   IAIVLAADDNYAQHAAVTAASILLHTTEPQRVTLYILSDGISEIKQQKIEATIKDLKGRV 63

Query: 74  SFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWA 133
                D   E +    + H S     L+L +PK+LP+++ K I  DTD++   D+A+LW 
Sbjct: 64  QLIPVDG--EAIKGFTSGHISKA-AYLRLMIPKLLPDSVRKAIYFDTDLVVIGDVAELWQ 120

Query: 134 LF-------------------SRLRQRQTIGLVENQSDWYL 155
           L                    SR  + ++IGL  N+SD Y 
Sbjct: 121 LSLDGHPVGATVDLGIMSSKRSRREKHESIGL--NESDDYF 159


>gi|218442752|ref|YP_002381072.1| glycosyl transferase family 8 [Cyanothece sp. PCC 7424]
 gi|218175110|gb|ACK73842.1| glycosyl transferase family 8 [Cyanothece sp. PCC 7424]
          Length = 347

 Score = 48.5 bits (114), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 50/176 (28%), Positives = 80/176 (45%), Gaps = 25/176 (14%)

Query: 197 LKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQS--DWYLGK 254
            +L LP++LP  + + I LD+D++   ++A LW         Q +G     +  D+    
Sbjct: 99  FRLLLPELLPTQVERVIYLDSDLVVEGNLANLW--------EQELGNCPAVAVQDYVFPY 150

Query: 255 LWKNHKPWPALGRGFNT-----GVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQ 309
           +    K +  LG   NT     GV+L+++ + R        R I E          LADQ
Sbjct: 151 VCNGLKTYQQLGLASNTPYCNAGVMLINIKQWR---IEALNRKILEYIRKFYDLVYLADQ 207

Query: 310 DIFNAIISEHPYLVYTLPCQWNVQLSD--NTRSDELCYTE--LTDLKIIHWNSPKK 361
           D  NA+I+    L   L  +WNVQ+    N + D LC  +  + D  I+H+ +P K
Sbjct: 208 DGINALIANRFKL---LDLKWNVQIFGVYNGKIDLLCKPKELIRDAFILHFTTPIK 260


>gi|308494240|ref|XP_003109309.1| hypothetical protein CRE_08184 [Caenorhabditis remanei]
 gi|308246722|gb|EFO90674.1| hypothetical protein CRE_08184 [Caenorhabditis remanei]
          Length = 389

 Score = 48.5 bits (114), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 45/177 (25%), Positives = 71/177 (40%), Gaps = 43/177 (24%)

Query: 479 VTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRS--RRNIGY 536
           +TL    + + ++ +E    +W+GPISL L++ D  +Q+ L Y  +        RR +  
Sbjct: 66  ITLAVHGTPEMMETIERKPSNWDGPISLALFI-DYHSQRALEYLSDVHRCDQEFRRKVTV 124

Query: 537 HVVYK----------------------------------EGNF--YPINTLRNVALNQVS 560
           H  ++                                   G+F  YPIN +RN+A     
Sbjct: 125 HFAFRLSPFQKSCPLINVTSHHQDCKEFLKNRAKYRNEIAGSFQLYPINLMRNIARRGAK 184

Query: 561 TPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPASH 617
           +   F+ DID +   G    +K    S +M     KKVLVV  FE+    T  P +H
Sbjct: 185 SDIHFIADIDMIMSEGFATKVKQI--SNEMIDGKSKKVLVVRRFESN--HTTIPTNH 237


>gi|428162183|gb|EKX31362.1| hypothetical protein GUITHDRAFT_122441 [Guillardia theta CCMP2712]
          Length = 737

 Score = 48.5 bits (114), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 55/257 (21%), Positives = 93/257 (36%), Gaps = 74/257 (28%)

Query: 479 VTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLS-YAGNSEAL-RSRRNIGY 536
           VT+    ++DR  ++     HW+  +++  Y    E ++ L  Y  +S     S RN   
Sbjct: 281 VTMATISTVDRATVLMNTAAHWKEDMAVAFYARSPEDERDLRRYVADSLGPWFSSRNKSL 340

Query: 537 HVVY------KEGNF----YPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIR 586
             V       +E       +PIN LR +A +   T  +   D+D +    +  +++    
Sbjct: 341 EAVMLSVCPNREDPLKILVFPINMLRKIACSIAKTDLILYTDVDMIASDDIASHIRQVYE 400

Query: 587 SMDMHGHGGKKVLVVPAFE-----------------TQRYRTAFPAS------------- 616
              +     K +LVVP+F+                 TQ   T  P +             
Sbjct: 401 GTRLKP---KDLLVVPSFKGSSSWPPVAEIQRGNNTTQVQVTTHPVTVADVIEHFIHCRV 457

Query: 617 ----------------------HAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRD----- 649
                                 HAPT++ RW+     Y++ +   +EPY V +R      
Sbjct: 458 HIPGMRCGMWTLASRWSEDGVFHAPTSYDRWMEEKEVYEVSYVVGYEPYFVVNRSAWEGR 517

Query: 650 --LPRYDTRFVGFGWNK 664
                YD RFV +GW+K
Sbjct: 518 HGAGIYDDRFVHWGWDK 534


>gi|159476896|ref|XP_001696547.1| hypothetical protein CHLREDRAFT_187123 [Chlamydomonas reinhardtii]
 gi|158282772|gb|EDP08524.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 417

 Score = 48.5 bits (114), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 55/242 (22%), Positives = 92/242 (38%), Gaps = 61/242 (25%)

Query: 479 VTLVAQLSMDRLQMVEMLFKHWEG-PISLTLYMSDAEA---------QQFLSYAGNS-EA 527
           VT+    ++DRL M+E   K + G P +  +Y+   +          +Q ++ A +  +A
Sbjct: 31  VTIFTTATLDRLDMLETQCKSYPGGPHAAAVYVPLVQEHGGELTIPNEQLIALAESRVQA 90

Query: 528 LRSRRN-----IGYHVVY--------KEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPM 574
           L  R          HV+            +  PIN LRN AL  V TP + ++D+D    
Sbjct: 91  LFDRMEGIVNACHMHVLLLYEVTSDPALAHLTPINGLRNAALLAVRTPLLVMVDVDLCVS 150

Query: 575 FGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRYRTA---------------------- 612
             L  +L        +         V+PA++     +                       
Sbjct: 151 ASLVRFLSEPSNGDKLVASSSNTFWVLPAWDVSAGLSRVEIDHIAESALAGDKMTLAKLW 210

Query: 613 ------------FPASHAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRD---LPRYDTRF 657
                       F   HAPTN++RW+ +   Y I+++  +EP+ +  RD   +  YD RF
Sbjct: 211 ASGRLHWFGQLYFQLGHAPTNYTRWLLSRKSYLIDYSIGYEPWGILSRDRQVVVPYDARF 270

Query: 658 VG 659
            G
Sbjct: 271 RG 272


>gi|412991019|emb|CCO18391.1| predicted protein [Bathycoccus prasinos]
          Length = 601

 Score = 48.1 bits (113), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 56/261 (21%), Positives = 99/261 (37%), Gaps = 73/261 (27%)

Query: 476 GNDVTLVAQLSMDRLQMV-EMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRN- 533
             ++ L A  ++DRL+ V   L + W G +S+ +  S+ + ++ +      E  R+R   
Sbjct: 179 AREICLFAPSTLDRLKSVLPRLAQTWSGTMSVAVLASEEDVRREMFGTSTREFTRNRLTV 238

Query: 534 IGYHVVYKEGNFYPINTLRNVALN----QVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMD 589
           I    + +    +P+N LRN+AL+     ++  YV L D+DF  +F   P  +      +
Sbjct: 239 IAVDPLPEYETRFPVNALRNLALSGCKRTLNATYVVLHDVDF-EIFPDAPSKELLEEIEE 297

Query: 590 MHGHGGKKVLVVPAF--------ETQRYRTA----------------------------- 612
           +     K  LV+P+F          + +R A                             
Sbjct: 298 VLKPNAKHGLVLPSFTADDRYLRRVETFRKAHDFSRESLDVLNSFNRKEKLVELIKEYGI 357

Query: 613 --------FPASHAPTNFSRWVNATTPYQIEWAPDFEPYIVAHR---------------- 648
                   +P +HA TN S+W++     + + A    PY V                   
Sbjct: 358 VESFREKYWPVAHAATNVSKWLSMIIAEKKKDAEKHRPYSVGANHLGSRHPYYYEPWIIL 417

Query: 649 -----DLPRYDTRFVGFGWNK 664
                D+P +D  FV +G+NK
Sbjct: 418 RADVDDMPAFDESFVTYGFNK 438


>gi|237750590|ref|ZP_04581070.1| conserved hypothetical protein [Helicobacter bilis ATCC 43879]
 gi|229373680|gb|EEO24071.1| conserved hypothetical protein [Helicobacter bilis ATCC 43879]
          Length = 364

 Score = 48.1 bits (113), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 47/198 (23%), Positives = 86/198 (43%), Gaps = 45/198 (22%)

Query: 162 HKPWPALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 221
           +K +P   R + L+DS+ + +  + N + +      +L +   LP+ +   + LD D+I 
Sbjct: 73  NKIYPCEFRVYTLSDSIFQGLPKLNNNYLA----YFRLKIASCLPQDIKTCLYLDVDMIC 128

Query: 222 ATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRG------------- 268
             DI +   +F    Q +  G+V          L  +H+ +  L R              
Sbjct: 129 VADIRE---IFYTDLQGKICGVV----------LVPDHQQYCVLKRNSAIGDEFVFNAST 175

Query: 269 -FNTGVILLDLTKLRDISWAGFWRIIAEKFL--LTRLWTSLADQDIFNAIISEHPYLVYT 325
            FN+G++L+D+ + R  +       + +K L    +    L DQD  NA++ +H   +  
Sbjct: 176 YFNSGLMLIDVEQYRKYN-------VEQKCLEWFEQYVPVLLDQDALNAVLGDH---ICA 225

Query: 326 LPCQWN--VQLSDNTRSD 341
           LP +WN  V+L    R D
Sbjct: 226 LPLEWNFFVELLKYKRQD 243


>gi|308494062|ref|XP_003109220.1| hypothetical protein CRE_08183 [Caenorhabditis remanei]
 gi|308246633|gb|EFO90585.1| hypothetical protein CRE_08183 [Caenorhabditis remanei]
          Length = 326

 Score = 48.1 bits (113), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 54/221 (24%), Positives = 84/221 (38%), Gaps = 56/221 (25%)

Query: 479 VTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRS--RRNIGY 536
           +TL    + + ++ +E    +W+GPISL L++ D  +Q+ L Y  +        RR +  
Sbjct: 27  ITLAVHGTPEMMETIEKKPSNWDGPISLALFI-DYHSQRALEYLSDVHRCDQEFRRKMTA 85

Query: 537 HVVYK----------------------------------EGNF--YPINTLRNVALNQVS 560
           H  ++                                   G+F  YPIN +RN+A     
Sbjct: 86  HFTFRLSPFQKSCPLINVTSHHQDCKEFLKNRAKYRNEIAGSFQLYPINLMRNIARRGAK 145

Query: 561 TPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRYRTAFPASHAPT 620
           +   F+ DID +   G    +K    S +M     KKVLVV  F  Q +   F   H   
Sbjct: 146 SDIHFIADIDMIMSEGFATKVKRI--SNEMIDGKSKKVLVVRVF--QFHHRHFFVGHQIP 201

Query: 621 NFSRWVNATT----------PYQIEWAPDFEPYIVAHRDLP 651
           N S W   +           PYQ      +E  ++ HR+ P
Sbjct: 202 NISYWFEVSNASDTISAWEIPYQ---GSVWEVQVILHRNDP 239


>gi|194211847|ref|XP_001488961.2| PREDICTED: LOW QUALITY PROTEIN: glucoside xylosyltransferase 1
           [Equus caballus]
          Length = 376

 Score = 48.1 bits (113), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 35/150 (23%), Positives = 60/150 (40%), Gaps = 18/150 (12%)

Query: 221 FATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNH-KPWPALGR-GFNTGVILLDL 278
           F   +  +W+   R    Q   +     +  +G  W N     P  G+ G N+GV+L+++
Sbjct: 158 FLRPVDDIWSNGGRFNSTQIAAMAPEHEEPRIG--WYNRFARHPYYGKTGVNSGVMLMNM 215

Query: 279 TKLRD-----------ISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLP 327
           T++R            + W      + +K+ L   W    DQD+ N I   +P  ++  P
Sbjct: 216 TRMRRKYFKNDMTTVRLRWGDILMPLLKKYKLNITW---GDQDLLNIIFFHNPESLFVFP 272

Query: 328 CQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
           CQWN +             E   + I+H N
Sbjct: 273 CQWNYRPDHCIYGSNCQEAEEEGIFILHGN 302


>gi|148665328|gb|EDK97744.1| mCG126098 [Mus musculus]
          Length = 182

 Score = 48.1 bits (113), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 31/84 (36%), Positives = 41/84 (48%), Gaps = 15/84 (17%)

Query: 261 PWPALGRGFNTGVILLDLTKLR---------DISWAGFWRIIAEKFLLTRLWTSLADQDI 311
           P P    GFN+GV+LL+L  +R         + SW    + +A+K+        L DQD 
Sbjct: 69  PPPEGLPGFNSGVMLLNLEAMRQSPLYSHLLEPSWV---QQLADKY---HFRGHLGDQDF 122

Query: 312 FNAIISEHPYLVYTLPCQWNVQLS 335
           F  I  EHP L + L C WN QL 
Sbjct: 123 FTMIGMEHPELFHVLDCTWNRQLC 146


>gi|299472732|emb|CBN80300.1| Glycosyltransferase, family GT49 [Ectocarpus siliculosus]
          Length = 280

 Score = 47.8 bits (112), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 44/161 (27%), Positives = 60/161 (37%), Gaps = 42/161 (26%)

Query: 546 YPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFE 605
           YP+N LRNVA+  V T +  + DID  P    Y  L    R         +  +V  AF 
Sbjct: 19  YPVNRLRNVAVKAVKTSHFLMTDIDIWPDVNAYSALHMRYRLETERVEDARSAIVFSAFS 78

Query: 606 TQR---------YRTAFPASHA------PTNFSRWVNAT------TPYQIEW-------- 636
            +R         Y    P + A       TN     +A       T Y   W        
Sbjct: 79  RRRFCEEEDCLQYANDVPETIADLKPCLETNTCGRFDAANQSGQGTAYVYYWRSMLRQPN 138

Query: 637 -----------APDFEPYIVAHRD--LPRYDTRFVGFGWNK 664
                      +  FEPY+V  +   LP++D RF+G+G NK
Sbjct: 139 QRTMEHILCFKSHRFEPYLVVRKTALLPKFDERFLGYGKNK 179


>gi|336407713|ref|ZP_08588209.1| hypothetical protein HMPREF1018_00224 [Bacteroides sp. 2_1_56FAA]
 gi|335944792|gb|EGN06609.1| hypothetical protein HMPREF1018_00224 [Bacteroides sp. 2_1_56FAA]
          Length = 311

 Score = 47.8 bits (112), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 49/194 (25%), Positives = 87/194 (44%), Gaps = 27/194 (13%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVEN---QSDWYLGK 254
           +  L ++LP  + K + +D D++   DI++ W   + + Q   IG +E+     + Y  +
Sbjct: 89  RCLLSRILPVNIDKILYMDCDIVVLNDISEFWN--TDITQ-YAIGCIEDIGSDEEEYYSR 145

Query: 255 LWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNA 314
           L  + K        FN GV+L++L   R+    G   +  + FL         DQD+ NA
Sbjct: 146 LQYDKK-----YSYFNAGVLLINLKYWREHKIDG---MCEQYFLAHSDRIRFNDQDLLNA 197

Query: 315 IISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLK-------IIHWNSPKKLKVKNK 367
           ++ +    V   P +WNVQ +   R+      E + LK       I+H+ + K     + 
Sbjct: 198 LLYKDKLFV---PFRWNVQDTFYRRTYSHKVKEHSGLKEALLHPAILHYTNKKPWNYDSM 254

Query: 368 H---MEFFRNLYLT 378
           H    E+F+ L +T
Sbjct: 255 HPLKQEYFKYLDMT 268


>gi|300119982|emb|CBK19536.2| unnamed protein product [Blastocystis hominis]
          Length = 429

 Score = 47.8 bits (112), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 32/144 (22%), Positives = 65/144 (45%), Gaps = 13/144 (9%)

Query: 464 PNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAG 523
           P+N  +Y+ ++      +V     +RL  +      ++GPI L +Y +  +    + Y  
Sbjct: 163 PSNDQKYDVTA------IVTAFGYNRLFALRQFMYRYQGPIVLVIYATSTQEVHLVRYIS 216

Query: 524 NSEALRSRRNIGYHVV---YKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPY 580
            +  +  R  I +++V    K    +PIN LRN+A+  + T +  +LD+D       Y  
Sbjct: 217 -THFIPKRVTILFYLVSRYLKSSTVFPINRLRNLAIRNIRTTHFLILDMDLRLSLNTYKE 275

Query: 581 LKSSIRSMDMHGHGGKKVLVVPAF 604
           + S  + +    H  +  +++P F
Sbjct: 276 VMSLPQFL---YHSNRSAVILPVF 296


>gi|419622112|ref|ZP_14155354.1| glycosyl transferase family protein [Campylobacter jejuni subsp.
           jejuni LMG 23216]
 gi|380600476|gb|EIB20813.1| glycosyl transferase family protein [Campylobacter jejuni subsp.
           jejuni LMG 23216]
          Length = 483

 Score = 47.4 bits (111), Expect = 0.024,   Method: Compositional matrix adjust.
 Identities = 41/151 (27%), Positives = 71/151 (47%), Gaps = 21/151 (13%)

Query: 188 KHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQ 247
           KHY+  +   +L L  +LP+   K + LD D++  +D+ +L+ L     Q   I    + 
Sbjct: 111 KHYATYF---RLYLGSILPKNCEKCLYLDVDMLVFSDLRELFCL---DLQDYIIAASPDI 164

Query: 248 SDW----YLGKLWKNHKPWPALGRG---FNTGVILLDLTKLRDISWAGFWRIIAEKFLLT 300
             W    ++GK  K++     + +G   FN G +L++L + R  S       +A  +   
Sbjct: 165 KQWPGSLFIGKSKKDNVRDLIIEKGPLYFNAGFMLINLKQWRLESLEKRMINVANSYTF- 223

Query: 301 RLWTSLADQDIFNAIISEHPYLVYTLPCQWN 331
                + DQDI N +I++    V  LPC+WN
Sbjct: 224 ----EVGDQDILNYVINKK---VIILPCKWN 247


>gi|300176754|emb|CBK24419.2| unnamed protein product [Blastocystis hominis]
          Length = 369

 Score = 47.4 bits (111), Expect = 0.026,   Method: Compositional matrix adjust.
 Identities = 46/235 (19%), Positives = 92/235 (39%), Gaps = 52/235 (22%)

Query: 478 DVT-LVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGY 536
           DVT +V     +RL  +      ++GPI L +Y +  +    + Y   +  +  R  I +
Sbjct: 110 DVTAIVTAFGYNRLFALRQFMYRYQGPIVLVIYATSTQEVHLVRYIS-THFIPKRVTILF 168

Query: 537 HVVYK---EGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGH 593
           ++V +       +PIN LRN+A+  + T +  +LD+D       Y  + S  + +    H
Sbjct: 169 YLVSRYLQSSTVFPINRLRNLAIRNIRTTHFLILDMDLRLSLNTYKEVMSLPQFL---YH 225

Query: 594 GGKKVLVVPAF--ETQRYRTAFPASHAPTNFSRWVNATTPYQI----------------- 634
             +  +++P F  ++++      ++ + +  S W+      ++                 
Sbjct: 226 SNRSAVILPVFFYKSKQILAHCSSTESCSYLSNWIQPENKLELIECINRKICISNKNNIR 285

Query: 635 -------EWAPD----------------FEPYIVAHRD--LPRYDTRFVGFGWNK 664
                  EW                    EPY++      LP ++ RFV +G+NK
Sbjct: 286 THMYVMPEWFTTSKESHVSRVRCFITNFMEPYLMVKYSPTLPLFEERFVNYGYNK 340


>gi|332018122|gb|EGI58734.1| Uncharacterized protein C3orf21-like protein [Acromyrmex
           echinatior]
          Length = 358

 Score = 47.4 bits (111), Expect = 0.026,   Method: Compositional matrix adjust.
 Identities = 28/107 (26%), Positives = 48/107 (44%), Gaps = 10/107 (9%)

Query: 186 PNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVE 245
           P  +YS     + L L ++ P      I+ D D  F  D+  L+  F++  +    GL  
Sbjct: 168 PGTYYSDALFFISLGLHRIAPNHQKLAIMFDADTKFRRDVKDLFEEFNKFGKEALFGLAP 227

Query: 246 NQSDWY--LGKLWKNHKPWPALGR--------GFNTGVILLDLTKLR 282
             +  Y  +  L++N  P    G         G+N+G++LL+L +LR
Sbjct: 228 ELTPVYRHVLYLYRNKHPTTMFGEPRHKGGYPGYNSGIVLLNLERLR 274


>gi|300175964|emb|CBK22181.2| unnamed protein product [Blastocystis hominis]
          Length = 429

 Score = 47.4 bits (111), Expect = 0.027,   Method: Compositional matrix adjust.
 Identities = 50/238 (21%), Positives = 89/238 (37%), Gaps = 58/238 (24%)

Query: 478 DVT-LVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGY 536
           DVT +V     +RL  +      ++GPI L +Y +  +    + Y   +  +  R  I +
Sbjct: 170 DVTAIVTAFGYNRLFALRQFMYRYQGPIVLVIYATSTQEVHLVRYIS-THFIPKRVTILF 228

Query: 537 HVV---YKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGH 593
           ++V    K    +PIN LRN+A+  + T +  +LD+D       Y  + S  + +    H
Sbjct: 229 YLVSRYLKSSTVFPINRLRNLAIRNIRTTHFLILDMDLRLSLNTYKEVMSLPQFL---YH 285

Query: 594 GGKKVLVVPAFETQRYRTAFPASHAPTN-----FSRWVNATTPYQI-------------- 634
             +  +++P F    Y+     +H  T       S W+      ++              
Sbjct: 286 SNRSAVILPVF---LYKGKQILAHCSTTESCSYLSNWIQPENKLELIECINRKICISNKN 342

Query: 635 ----------EWAPD----------------FEPYIVAHRD--LPRYDTRFVGFGWNK 664
                     EW                    EPY++      LP ++ RFV +G+NK
Sbjct: 343 NIRTHMYVMPEWFTTSKESHVSRVRCFITNFMEPYLMVKYSPTLPLFEERFVNYGYNK 400


>gi|298480509|ref|ZP_06998706.1| lipopolysaccharide 1,2-glucosyltransferase/general stress protein
           [Bacteroides sp. D22]
 gi|298273330|gb|EFI14894.1| lipopolysaccharide 1,2-glucosyltransferase/general stress protein
           [Bacteroides sp. D22]
          Length = 311

 Score = 47.4 bits (111), Expect = 0.028,   Method: Compositional matrix adjust.
 Identities = 38/147 (25%), Positives = 67/147 (45%), Gaps = 22/147 (14%)

Query: 190 YSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSD 249
           Y  +   L+L +P++LP  + K + +D+D+IF  DI+ L+           +    ++  
Sbjct: 79  YPSLATYLRLFIPQLLPLNVEKALYIDSDIIFNGDISDLYNCDIANYALAGMEDAPHRQP 138

Query: 250 WYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWA----GFWRIIAEKFLLTRLWTS 305
             LG        +P     FN G +LL++  LRDI++      + R    K +       
Sbjct: 139 ERLG--------YPIEDLYFNAGFLLLNIKYLRDINFVDMALSYIRDCHHKIV------- 183

Query: 306 LADQDIFNAIISEHPYLVYTLPCQWNV 332
           L DQD+ NA++      V  +P +WN+
Sbjct: 184 LHDQDVLNALLHGR---VRFVPIKWNM 207


>gi|302853592|ref|XP_002958310.1| hypothetical protein VOLCADRAFT_108086 [Volvox carteri f.
            nagariensis]
 gi|300256335|gb|EFJ40603.1| hypothetical protein VOLCADRAFT_108086 [Volvox carteri f.
            nagariensis]
          Length = 1182

 Score = 47.4 bits (111), Expect = 0.028,   Method: Compositional matrix adjust.
 Identities = 21/50 (42%), Positives = 31/50 (62%), Gaps = 1/50 (2%)

Query: 616  SHAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRY-DTRFVGFGWNK 664
             H  T++S+W N T PY +EW+  +EP++V  R    + D RF G+G NK
Sbjct: 1035 GHKNTDYSKWFNTTEPYVVEWSGRYEPWVVVDRLSSSWADARFRGYGKNK 1084


>gi|340720433|ref|XP_003398643.1| PREDICTED: n-acetyllactosaminide
           beta-1,3-N-acetylglucosaminyltransferase-like [Bombus
           terrestris]
          Length = 449

 Score = 47.4 bits (111), Expect = 0.029,   Method: Compositional matrix adjust.
 Identities = 56/241 (23%), Positives = 84/241 (34%), Gaps = 70/241 (29%)

Query: 479 VTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQ---------------------- 516
           VTL    + D++  +  L + WEGP+SL ++    +A                       
Sbjct: 127 VTLCTHATADQVYGIVELARRWEGPLSLAVFTPGLDAGIAVALLDRACRCEPEMYKVSVH 186

Query: 517 ------------QFLSYAGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYV 564
                       +     G+  A   +R  G     +    YPIN  RNVA  Q +T  V
Sbjct: 187 LVFPASRPPALGRITRIHGDCAASDLQRGDGATERKRRNMIYPINVARNVARIQANTTRV 246

Query: 565 FLLDIDFLPMFGLYPYLKSSIRSMDMHGH-GGKKVLVVPAFETQR--------------- 608
            + DI+ LP       L S    +D      G+ V VVP FE +                
Sbjct: 247 LVTDIELLP----SQRLASGFMEIDREKTPKGRIVFVVPVFEIESNEVPPLTKKELLLAT 302

Query: 609 ------YRTAFPASHAP--TNFSRWVNATTPYQIEWAPDFEPYIVAHRDLP--RYDTRFV 658
                 Y   F  SH       SRW+    P ++      +P I+  R+ P  R++  F+
Sbjct: 303 KAGLAVYFHRFLCSHCQRFPGLSRWLIRPDPGKV------KPLIITKREYPYHRWEPVFI 356

Query: 659 G 659
           G
Sbjct: 357 G 357


>gi|300123773|emb|CBK25045.2| unnamed protein product [Blastocystis hominis]
          Length = 343

 Score = 47.0 bits (110), Expect = 0.029,   Method: Compositional matrix adjust.
 Identities = 37/148 (25%), Positives = 69/148 (46%), Gaps = 22/148 (14%)

Query: 475 DGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQ---------------FL 519
           D +DVTL+  L++ R   +  L K W   I++ +++++ E  +               F+
Sbjct: 169 DYDDVTLITILTLVRKDRLPYLMKRWRHRINMVIFVTENEIDELREVIDSYKSYKRITFI 228

Query: 520 SYAGNSEALRSRRNIGYHV---VYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFG 576
            Y   S      ++I Y+    VY+    +PIN LR++A+  + T + F+ DID      
Sbjct: 229 LYIVKSRIGSRYQSIYYNEKKPVYRNNTIFPINVLRDLAIEFIDTTHYFVSDIDVFSSDT 288

Query: 577 LYPYLKSSIRSMDMHGHGGKKVLVVPAF 604
           LY  L+   +++       K VL++ +F
Sbjct: 289 LYSTLQIHNQTLK----DPKAVLLLKSF 312


>gi|300120974|emb|CBK21216.2| unnamed protein product [Blastocystis hominis]
          Length = 343

 Score = 47.0 bits (110), Expect = 0.031,   Method: Compositional matrix adjust.
 Identities = 37/148 (25%), Positives = 69/148 (46%), Gaps = 22/148 (14%)

Query: 475 DGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQ---------------FL 519
           D +DVTL+  L++ R   +  L K W+  I++ +++++ E  +               F+
Sbjct: 169 DYDDVTLITILTLVRKDRLPYLMKRWKHRINMVIFVTENEIDELREVIDSYKSYKRITFI 228

Query: 520 SYAGNSEALRSRRNIGY---HVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFG 576
            Y   S      ++I Y     VY+    +PIN LR++A+  + T + F+ DID      
Sbjct: 229 LYIVKSRIGSRYQSIYYDEKKPVYRNNTIFPINVLRDLAIEFIDTTHYFVSDIDVFSSDT 288

Query: 577 LYPYLKSSIRSMDMHGHGGKKVLVVPAF 604
           LY  L+   +++       K VL++ +F
Sbjct: 289 LYSTLQIHNQTLK----DPKAVLLLKSF 312


>gi|300176749|emb|CBK24414.2| unnamed protein product [Blastocystis hominis]
          Length = 343

 Score = 47.0 bits (110), Expect = 0.032,   Method: Compositional matrix adjust.
 Identities = 37/148 (25%), Positives = 69/148 (46%), Gaps = 22/148 (14%)

Query: 475 DGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQ---------------FL 519
           D +DVTL+  L++ R   +  L K W   I++ +++++ E  +               F+
Sbjct: 169 DYDDVTLITILTLVRKDRLPYLMKRWRHRINMVIFVTENEIDELREVIDSYKSYKRITFI 228

Query: 520 SYAGNSEALRSRRNIGYHV---VYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFG 576
            Y   S      ++I Y+    VY+    +PIN LR++A+  + T + F+ DID      
Sbjct: 229 LYIVKSRIGSRYQSIYYNEKKPVYRNNTIFPINVLRDLAIEFIDTTHYFVSDIDVFSSDT 288

Query: 577 LYPYLKSSIRSMDMHGHGGKKVLVVPAF 604
           LY  L+   +++       K VL++ +F
Sbjct: 289 LYSTLQIHNQTLK----DPKAVLLLKSF 312


>gi|300119989|emb|CBK19543.2| unnamed protein product [Blastocystis hominis]
          Length = 343

 Score = 47.0 bits (110), Expect = 0.033,   Method: Compositional matrix adjust.
 Identities = 37/148 (25%), Positives = 69/148 (46%), Gaps = 22/148 (14%)

Query: 475 DGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQ---------------FL 519
           D +DVTL+  L++ R   +  L K W+  I++ +++++ E  +               F+
Sbjct: 169 DYDDVTLITILTLVRKDRLPYLMKRWKHRINMVIFVTENEIDELREVIDSYKSYKRITFI 228

Query: 520 SYAGNSEALRSRRNIGY---HVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFG 576
            Y   S      ++I Y     VY+    +PIN LR++A+  + T + F+ DID      
Sbjct: 229 LYIVKSRIGSRYQSIYYDEKKPVYRNNTIFPINVLRDLAIEFIDTTHYFVSDIDVFSSDT 288

Query: 577 LYPYLKSSIRSMDMHGHGGKKVLVVPAF 604
           LY  L+   +++       K VL++ +F
Sbjct: 289 LYSTLQIHNQTLK----DPKAVLLLKSF 312


>gi|148673701|gb|EDL05648.1| mCG51561 [Mus musculus]
          Length = 230

 Score = 47.0 bits (110), Expect = 0.035,   Method: Compositional matrix adjust.
 Identities = 39/161 (24%), Positives = 69/161 (42%), Gaps = 10/161 (6%)

Query: 13  EVIQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWS-VPQV 71
           E + +A+V  G       VT++KS L +   PLH H+  +    N  +   ++WS + + 
Sbjct: 38  EKMHLAVVACG-ERLEETVTMLKSALIFSIKPLHVHIFAEDQLHNSFKDRLASWSFLRRF 96

Query: 72  EVSFYLADSVVEDVS-WIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQ 130
           + S Y      +  + W   K         +L LP +L E +   + +DTD++F   +  
Sbjct: 97  DYSLYPITFPCDSAADW---KKLFKPCASQRLFLPLILKE-VDSLLYVDTDILFLPPVDD 152

Query: 131 LWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPW-PALGR 170
           +W+L  +    Q   +     +  +G  W N   W P  GR
Sbjct: 153 IWSLLKKFNSTQIAAMAPEHKEPRIG--WYNRFAWHPYYGR 191



 Score = 46.2 bits (108), Expect = 0.059,   Method: Compositional matrix adjust.
 Identities = 26/87 (29%), Positives = 47/87 (54%), Gaps = 5/87 (5%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L LP +L E +   + +DTD++F   +  +W+L  +    Q   +     +  +G  W 
Sbjct: 124 RLFLPLILKE-VDSLLYVDTDILFLPPVDDIWSLLKKFNSTQIAAMAPEHKEPRIG--WY 180

Query: 258 NHKPW-PALGR-GFNTGVILLDLTKLR 282
           N   W P  GR G N+GV+L+++T++R
Sbjct: 181 NRFAWHPYYGRTGVNSGVMLMNMTRMR 207


>gi|300123357|emb|CBK24630.2| unnamed protein product [Blastocystis hominis]
          Length = 343

 Score = 46.6 bits (109), Expect = 0.039,   Method: Compositional matrix adjust.
 Identities = 30/125 (24%), Positives = 53/125 (42%), Gaps = 6/125 (4%)

Query: 458 QMVEINPNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQ 517
           +MV+ + N  D Y    D  D+TLV   S    ++     + WEGPI + +  +  E  Q
Sbjct: 85  KMVQSSSNTFDLY--PDDTFDITLVTYASASMYELAPKYCRQWEGPIVIVVSYNQNEESQ 142

Query: 518 FLSYAGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGL 577
              +  +  +  +   I + +        P+N ++N  +  V T +    + D +P   L
Sbjct: 143 LTQFLASIASFPAVSVIPFPITAVS----PVNAMKNAGIAAVRTTHYVFAEFDLVPSSNL 198

Query: 578 YPYLK 582
           Y  LK
Sbjct: 199 YSVLK 203


>gi|300123772|emb|CBK25044.2| unnamed protein product [Blastocystis hominis]
          Length = 429

 Score = 46.6 bits (109), Expect = 0.039,   Method: Compositional matrix adjust.
 Identities = 32/131 (24%), Positives = 60/131 (45%), Gaps = 8/131 (6%)

Query: 478 DVT-LVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGY 536
           DVT +V     +RL  +      ++GPI L +Y +  +    + Y   +  +  R  I +
Sbjct: 170 DVTAIVTAFGYNRLFALRQFMYRYQGPIVLVIYATSTQEVHLVRYIS-THFIPKRVTILF 228

Query: 537 HVV---YKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGH 593
           ++V    K    +PIN LRN+A+  + T +  +LD+D       Y  + S  + +    H
Sbjct: 229 YLVSRYLKSSTVFPINRLRNLAIRNIRTTHFLILDMDLRLSVNTYKEVMSLPQFL---YH 285

Query: 594 GGKKVLVVPAF 604
             +  +++P F
Sbjct: 286 SNRSAVILPVF 296


>gi|291527266|emb|CBK92852.1| Lipopolysaccharide biosynthesis proteins, LPS:glycosyltransferases
           [Eubacterium rectale M104/1]
          Length = 723

 Score = 46.6 bits (109), Expect = 0.039,   Method: Compositional matrix adjust.
 Identities = 52/188 (27%), Positives = 86/188 (45%), Gaps = 22/188 (11%)

Query: 196 LLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSR---LRQRQTIGLVENQSDWYL 252
           + ++ LP ++P+ L K I LD+D+   TDI +LW L      L   Q    + N   W  
Sbjct: 429 MFRIMLPDIMPD-LKKIIYLDSDLFVNTDIEELWNLNIDNYCLAAAQDCSTIRN---W-- 482

Query: 253 GKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIF 312
           G  +       +  R FN GV+ ++L  +R  + + F ++I   +L     T L DQD  
Sbjct: 483 GTPYAVAAGQTSRDRYFNAGVLCMNLDNIRK-NGSLFQQVI--DYLNDNPRTWLPDQDAL 539

Query: 313 NAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWNSPKKLKVKNKHMEFF 372
           NAI S    L+     +WN  + +  +++     E  + KI H+ +   L + + + E  
Sbjct: 540 NAIFSGKTLLIDE---KWNYFIDEARKNN-----EKAEKKIYHYAAT--LLMLHTNNEID 589

Query: 373 RNLYLTFL 380
           R  Y T L
Sbjct: 590 RAYYFTIL 597



 Score = 40.4 bits (93), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 32/117 (27%), Positives = 55/117 (47%), Gaps = 29/117 (24%)

Query: 32  TLIKSILFYRKNPLHFHLITDTVALNILQTLFSTWSVPQVEVSFYLADSVVEDVSWIPNK 91
           T ++SI+   K P+ FH++ D     I +   S           ++AD+   D+ +    
Sbjct: 362 TTMQSIVENTKAPIVFHILHDDTLNEINKNKLS-----------FIADNSGNDIEF---H 407

Query: 92  HY-SGVYG-------------LLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 134
           H+ S V+G             + ++ LP ++P+ L K I LD+D+   TDI +LW L
Sbjct: 408 HFNSDVFGTFADSMNRFAIGTMFRIMLPDIMPD-LKKIIYLDSDLFVNTDIEELWNL 463


>gi|431910233|gb|ELK13306.1| N-acetyllactosaminide beta-1,3-N-acetylglucosaminyltransferase
           [Pteropus alecto]
          Length = 148

 Score = 46.6 bits (109), Expect = 0.044,   Method: Composition-based stats.
 Identities = 21/54 (38%), Positives = 30/54 (55%), Gaps = 6/54 (11%)

Query: 617 HAPTNFSRWVNATT------PYQIEWAPDFEPYIVAHRDLPRYDTRFVGFGWNK 664
            APTN+SRWVN          Y + W   +EP+ VA   +P +D RF  +G+N+
Sbjct: 29  QAPTNYSRWVNLPEETLLRPAYVVPWQDPWEPFYVAGGKVPTFDERFRQYGFNR 82


>gi|300176052|emb|CBK23363.2| unnamed protein product [Blastocystis hominis]
          Length = 316

 Score = 46.6 bits (109), Expect = 0.046,   Method: Compositional matrix adjust.
 Identities = 35/139 (25%), Positives = 60/139 (43%), Gaps = 22/139 (15%)

Query: 478 DVT-LVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGY 536
           DVT +V     +RL  +      ++GPI L +Y +  +    + Y   +  +  R  I +
Sbjct: 110 DVTAIVTAFGYNRLFALRQFMYRYQGPIVLVIYATSTQEVHLVRYIS-THFIPKRVTILF 168

Query: 537 HVV---YKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGH 593
           ++V    K    +PIN LRN+A+  + T +  +LD+D       Y               
Sbjct: 169 YLVSRYLKSSTVFPINRLRNLAIRNIRTTHFLILDMDLRLSLNTY--------------- 213

Query: 594 GGKKVLVVPAFETQRYRTA 612
             K+V+ +P F  Q  R+A
Sbjct: 214 --KEVMSLPQFLYQSNRSA 230


>gi|300121109|emb|CBK21490.2| unnamed protein product [Blastocystis hominis]
          Length = 343

 Score = 46.2 bits (108), Expect = 0.052,   Method: Compositional matrix adjust.
 Identities = 37/148 (25%), Positives = 68/148 (45%), Gaps = 22/148 (14%)

Query: 475 DGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQ---------------FL 519
           D +DVTL+  L++ R      L K W+  I++ +++++ E  +               F+
Sbjct: 169 DYDDVTLITILTLVRKDRFPYLMKRWKHRINMVIFVTENEIDELREVIDSYKSYKRITFI 228

Query: 520 SYAGNSEALRSRRNIGY---HVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFG 576
            Y   S      ++I Y     VY+    +PIN LR++A+  + T + F+ DID      
Sbjct: 229 LYIVKSRIGSRYQSIYYDEKKPVYRNNTIFPINVLRDLAIEFIDTTHYFVSDIDVFSSDT 288

Query: 577 LYPYLKSSIRSMDMHGHGGKKVLVVPAF 604
           LY  L+   +++       K VL++ +F
Sbjct: 289 LYSTLQIHNQTLK----DPKAVLLLKSF 312


>gi|444709933|gb|ELW50928.1| hypothetical protein TREES_T100021730 [Tupaia chinensis]
          Length = 405

 Score = 46.2 bits (108), Expect = 0.052,   Method: Compositional matrix adjust.
 Identities = 47/169 (27%), Positives = 66/169 (39%), Gaps = 27/169 (15%)

Query: 211 KTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWP------- 263
           + I LD D+ + T+I +L+  F        IG+       Y    W+     P       
Sbjct: 236 RIIQLDLDLKYKTNIRELFEEFDNFLPAAVIGIAREMQPVYRHTFWQFRHENPKTRVGGP 295

Query: 264 --ALGRGFNTGVILLDLTKLRDISWAGFW------RIIAEKFLLTRLWTSLADQDIFNAI 315
                 GFN+G    +L  +R  S  G        + +A+K+        L DQD F  I
Sbjct: 296 PPEGLPGFNSG---FNLEAMRQSSLYGRLLEPAKVQQLADKY---HFRGHLGDQDFFTMI 349

Query: 316 ISEHPYLVYTLPCQWNVQLS----DNTRSD--ELCYTELTDLKIIHWNS 358
             EHP L + L C WN QL     D+  SD  E  +     +KI H N 
Sbjct: 350 GMEHPELFHVLDCTWNRQLCTWWRDHGYSDIFEAYFRCEGHVKIYHGNC 398


>gi|300121608|emb|CBK22126.2| unnamed protein product [Blastocystis hominis]
          Length = 224

 Score = 46.2 bits (108), Expect = 0.052,   Method: Compositional matrix adjust.
 Identities = 37/139 (26%), Positives = 65/139 (46%), Gaps = 22/139 (15%)

Query: 478 DVTLVA-QLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGY 536
           DVT +A     +RL  +      ++GPI L +Y +  +    + Y  ++  +  R  I +
Sbjct: 71  DVTAIATAFGYNRLFALRQFMYRYQGPIVLVIYATSTQEVHLVRYI-STHFIPKRVTILF 129

Query: 537 HVV---YKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGH 593
           ++V    K    +PIN LRN+A+  + T +  +LD+D          L+ S+ +      
Sbjct: 130 YLVSRYLKSSTVFPINRLRNLAIRNIRTTHFLILDMD----------LRLSLNTY----- 174

Query: 594 GGKKVLVVPAFETQRYRTA 612
             K+VL +P F  Q  R+A
Sbjct: 175 --KEVLSLPQFLYQSNRSA 191


>gi|300176048|emb|CBK23359.2| unnamed protein product [Blastocystis hominis]
          Length = 376

 Score = 46.2 bits (108), Expect = 0.053,   Method: Compositional matrix adjust.
 Identities = 35/139 (25%), Positives = 60/139 (43%), Gaps = 22/139 (15%)

Query: 478 DVT-LVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGY 536
           DVT +V     +RL  +      ++GPI L +Y +  +    + Y   +  +  R  I +
Sbjct: 170 DVTAIVTAFGYNRLFALRQFMYRYQGPIVLVIYATSTQEIHLVRYIS-THFIPKRVTILF 228

Query: 537 HVV---YKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGH 593
           ++V    K    +PIN LRN+A+  + T +  +LD+D       Y               
Sbjct: 229 YLVSRYLKSSTVFPINRLRNLAIRNIRTTHFLILDMDLRLSLNTY--------------- 273

Query: 594 GGKKVLVVPAFETQRYRTA 612
             K+V+ +P F  Q  R+A
Sbjct: 274 --KEVMSLPQFLYQSNRSA 290


>gi|345313001|ref|XP_001516834.2| PREDICTED: uncharacterized protein C3orf21 homolog, partial
           [Ornithorhynchus anatinus]
          Length = 111

 Score = 46.2 bits (108), Expect = 0.054,   Method: Compositional matrix adjust.
 Identities = 28/74 (37%), Positives = 37/74 (50%), Gaps = 9/74 (12%)

Query: 268 GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTS------LADQDIFNAIISEHPY 321
           GFN+GV+LLDL  +R    +  +  + E   + +L         L DQD F  I  EHP 
Sbjct: 25  GFNSGVMLLDLEAMRQ---SQLYNQLLEPARIQQLTEKYHFRGHLGDQDFFTMIGMEHPE 81

Query: 322 LVYTLPCQWNVQLS 335
           L + L C WN QL 
Sbjct: 82  LFHVLDCTWNRQLC 95


>gi|336385107|gb|EGO26254.1| glycosyltransferase family 49 protein [Serpula lacrymans var.
           lacrymans S7.9]
          Length = 355

 Score = 45.8 bits (107), Expect = 0.066,   Method: Compositional matrix adjust.
 Identities = 54/264 (20%), Positives = 110/264 (41%), Gaps = 66/264 (25%)

Query: 462 INPNNVDE--YEASSDGN--DVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQ- 516
           + PN +    Y AS   N  D+T+   ++ +R ++   L + ++GPIS+T+++    +  
Sbjct: 15  MQPNKIVPFFYRASGTFNPEDITITTLVTSNRFKVFAQLVERYQGPISVTVHIKSDSSHT 74

Query: 517 -----QFLSYAGNSEALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDF 571
                +      +S ++ +  ++  H+V    +   +NT RN+A     T +V +LD+DF
Sbjct: 75  TALLNELHKLYASSPSMATFVDV--HLVLDPFD-RQLNTWRNIARFFARTDFVMMLDVDF 131

Query: 572 LPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQR----------------------- 608
           +        +++S   M      G   LVVPAFE  +                       
Sbjct: 132 VVCTDFRAAVRASKAVMARLSE-GYTALVVPAFEYLKQADGADQKKFPRDKNVCISIHSS 190

Query: 609 -----------------------YRTAFPASHAPTNFSRWVNATTPYQI----EWAPDFE 641
                                  +  ++   H  T+++R+  ++ P ++    ++   +E
Sbjct: 191 LSLTHFSLLQSLLSLVKAKKIDMFHRSWAPGHNSTDYARFY-SSQPGEVYKVTQYHSAYE 249

Query: 642 PYIVAHRDLPRY-DTRFVGFGWNK 664
           PY++  +  P + D RF+G+G NK
Sbjct: 250 PYVIFKKTGPPWCDERFIGYGGNK 273


>gi|392356257|ref|XP_003752296.1| PREDICTED: glucoside xylosyltransferase 1-like [Rattus norvegicus]
          Length = 197

 Score = 45.8 bits (107), Expect = 0.069,   Method: Compositional matrix adjust.
 Identities = 27/107 (25%), Positives = 47/107 (43%), Gaps = 15/107 (14%)

Query: 263 PALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWTSLADQD 310
           P  G+ G N+GV+L+++T++R            + W      + +K+ L   W    DQD
Sbjct: 20  PYYGKTGVNSGVMLMNMTRMRRKYFKNDMTTVRLQWGDILMPLLKKYKLNITW---GDQD 76

Query: 311 IFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
           + N +   +P  ++  PCQWN +             E   + I+H N
Sbjct: 77  LLNIVFFHNPESLFVFPCQWNYRPDHCIYGSNCQEAEEGGIFILHGN 123


>gi|33988262|gb|AAH15597.2| GLT8D3 protein [Homo sapiens]
          Length = 202

 Score = 45.8 bits (107), Expect = 0.072,   Method: Compositional matrix adjust.
 Identities = 27/107 (25%), Positives = 47/107 (43%), Gaps = 15/107 (14%)

Query: 263 PALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWTSLADQD 310
           P  G+ G N+GV+L+++T++R            + W      + +K+ L   W    DQD
Sbjct: 25  PYYGKTGVNSGVMLMNMTRMRRKYFKNDMTTVRLQWGDILMPLLKKYKLNITW---GDQD 81

Query: 311 IFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
           + N +   +P  ++  PCQWN +             E   + I+H N
Sbjct: 82  LLNIVFFHNPESLFVFPCQWNYRPDHCIYGSNCQEAEEGGIFILHGN 128


>gi|159475561|ref|XP_001695887.1| hypothetical protein CHLREDRAFT_149054 [Chlamydomonas reinhardtii]
 gi|158275447|gb|EDP01224.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 181

 Score = 45.8 bits (107), Expect = 0.073,   Method: Composition-based stats.
 Identities = 20/45 (44%), Positives = 25/45 (55%), Gaps = 3/45 (6%)

Query: 617 HAPTNFSRWVNATTPYQIEWAPDFEPY--IVAHRDLPRYDTRFVG 659
           H P N  RW  A  PY ++WA  FEP+  +  HRD P YD  + G
Sbjct: 121 HGPINHKRWATAKEPYTVKWAQAFEPWGILARHRD-PGYDEVWPG 164


>gi|238924869|ref|YP_002938385.1| Glycosyltransferase Family 2 modular protein [Eubacterium rectale
           ATCC 33656]
 gi|238876544|gb|ACR76251.1| Glycosyltransferase Family 2 modular protein [Eubacterium rectale
           ATCC 33656]
          Length = 723

 Score = 45.8 bits (107), Expect = 0.074,   Method: Compositional matrix adjust.
 Identities = 51/188 (27%), Positives = 86/188 (45%), Gaps = 22/188 (11%)

Query: 196 LLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSR---LRQRQTIGLVENQSDWYL 252
           + ++ LP ++P+ L K I LD+D+   TDI +LW L      L   Q    + N   W  
Sbjct: 429 MFRIMLPDIMPD-LKKIIYLDSDLFVNTDIEELWNLNIDNYCLAAAQDCSTIRN---W-- 482

Query: 253 GKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIF 312
           G  +       +  R FN GV+ ++L  +R  + + F +++   +L     T L DQD  
Sbjct: 483 GTPYAVAAGQTSRDRYFNAGVLCMNLDNIRK-NGSLFQQVM--DYLSDNPRTWLPDQDAL 539

Query: 313 NAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWNSPKKLKVKNKHMEFF 372
           NAI S    L+     +WN  + +  +++     E  + KI H+ +   L + + + E  
Sbjct: 540 NAIFSGKTLLIDE---KWNYFIDEARKNN-----EKAEKKIYHYAAT--LLMLHTNNEID 589

Query: 373 RNLYLTFL 380
           R  Y T L
Sbjct: 590 RAYYFTIL 597


>gi|265764909|ref|ZP_06093184.1| lipopolysaccharide 1,2-glucosyltransferase [Bacteroides sp. 2_1_16]
 gi|263254293|gb|EEZ25727.1| lipopolysaccharide 1,2-glucosyltransferase [Bacteroides sp. 2_1_16]
          Length = 311

 Score = 45.8 bits (107), Expect = 0.076,   Method: Compositional matrix adjust.
 Identities = 48/194 (24%), Positives = 87/194 (44%), Gaps = 27/194 (13%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVEN---QSDWYLGK 254
           +  L ++LP  + K + +D D++   DI++ W   + + Q   IG +E+     + Y  +
Sbjct: 89  RCLLSRILPVNIDKILYIDCDIVVLNDISEFWN--TDITQ-YAIGCIEDIGSDEEEYYSR 145

Query: 255 LWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNA 314
           L  + K        FN GV+L++L   R+        +  + FL         DQD+ NA
Sbjct: 146 LQYDKK-----YSYFNAGVLLINLKYWRE---HKIDEMCEQYFLAHSDRIRFNDQDLLNA 197

Query: 315 IISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLK-------IIHWNSPKKLKVKNK 367
           ++ ++   V   P +WNVQ +   R+      E + LK       I+H+ + K     + 
Sbjct: 198 LLYKNKLFV---PFRWNVQDTFYRRTYSHKVKEHSGLKEALLHPAILHYTNKKPWNYDSM 254

Query: 368 H---MEFFRNLYLT 378
           H    E+F+ L +T
Sbjct: 255 HPLKQEYFKYLDMT 268


>gi|296491302|tpg|DAA33365.1| TPA: hypothetical protein LOC533703 [Bos taurus]
          Length = 333

 Score = 45.8 bits (107), Expect = 0.077,   Method: Compositional matrix adjust.
 Identities = 31/120 (25%), Positives = 56/120 (46%), Gaps = 9/120 (7%)

Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
           F + +++ +  S     +YS     L + + +++P+ + + I LD D+ + T+I +L+  
Sbjct: 182 FPVVEAMQKHFSAGSGTYYSDSIFFLSVAMHQIMPKEILRIIQLDLDLKYKTNIRELFEE 241

Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWK----NHK-----PWPALGRGFNTGVILLDLTKLR 282
           F        IG+       Y    W+    N K     P P    GFN+GV+LL+L  +R
Sbjct: 242 FDNFLPGAIIGIAREMQPVYRHTFWQFRHDNPKTRVGSPPPEGLPGFNSGVMLLNLEAMR 301


>gi|119578235|gb|EAW57831.1| hCG39367, isoform CRA_a [Homo sapiens]
          Length = 177

 Score = 45.8 bits (107), Expect = 0.081,   Method: Compositional matrix adjust.
 Identities = 27/107 (25%), Positives = 47/107 (43%), Gaps = 15/107 (14%)

Query: 263 PALGR-GFNTGVILLDLTKLRD-----------ISWAGFWRIIAEKFLLTRLWTSLADQD 310
           P  G+ G N+GV+L+++T++R            + W      + +K+ L   W    DQD
Sbjct: 20  PYYGKTGVNSGVMLMNMTRMRRKYFKNDMTTVRLQWGDILMPLLKKYKLNITW---GDQD 76

Query: 311 IFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
           + N +   +P  ++  PCQWN +             E   + I+H N
Sbjct: 77  LLNIVFFHNPESLFVFPCQWNYRPDHCIYGSNCQEAEEGGIFILHGN 123


>gi|291526000|emb|CBK91587.1| Lipopolysaccharide biosynthesis proteins, LPS:glycosyltransferases
           [Eubacterium rectale DSM 17629]
          Length = 723

 Score = 45.4 bits (106), Expect = 0.087,   Method: Compositional matrix adjust.
 Identities = 52/186 (27%), Positives = 85/186 (45%), Gaps = 22/186 (11%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSR---LRQRQTIGLVENQSDWYLGK 254
           ++ LP ++P+ L K I LD+D+   TDI +LW L      L   Q    + N   W  G 
Sbjct: 431 RIMLPDIMPD-LKKIIYLDSDLFVNTDIEELWNLNIDNYCLAAAQDCSTIRN---W--GT 484

Query: 255 LWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNA 314
            +       +  R FN GV+ ++L  +R  + + F ++I   +L     T L DQD  NA
Sbjct: 485 PYAVAAGQTSRDRYFNVGVLCMNLDNIRK-NGSLFQQVI--DYLNDNPRTWLPDQDALNA 541

Query: 315 IISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWNSPKKLKVKNKHMEFFRN 374
           I S    L+     +WN  + +  +++     E  + KI H+ +   L + + + E  R 
Sbjct: 542 IFSGKTLLIDE---KWNYFIDEARKNN-----EKAEKKIYHYAAT--LLMLHTNNEIDRA 591

Query: 375 LYLTFL 380
            Y T L
Sbjct: 592 YYFTIL 597


>gi|329964475|ref|ZP_08301529.1| glycosyltransferase, family 8 [Bacteroides fluxus YIT 12057]
 gi|328524875|gb|EGF51927.1| glycosyltransferase, family 8 [Bacteroides fluxus YIT 12057]
          Length = 316

 Score = 45.4 bits (106), Expect = 0.091,   Method: Compositional matrix adjust.
 Identities = 53/199 (26%), Positives = 90/199 (45%), Gaps = 22/199 (11%)

Query: 172 FYL-ADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWA 230
           FYL  + +++D S   + H S +    ++ L  +LP  L+K + LD D++    I +LW 
Sbjct: 67  FYLIGEEILQDCSIYGDSHIS-LATYYRIFLCSILPADLSKALYLDCDLVVLGSINELWN 125

Query: 231 LFSRLRQRQTIGLVENQSDWYLGKLWKNHK-PWPALGRGFNTGVILLDLTKLRDISWAGF 289
             + + Q   +  VE   D + GK     +  + +    FN GV+L++L   R++ + G 
Sbjct: 126 --TDISQ-YAVACVE---DMWSGKPDNYERLHYASSDSYFNAGVLLVNLDYWRELDFEGL 179

Query: 290 WRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQ---LSDNTR----SDE 342
                +      ++    DQD+ NA++ +       LP +WNVQ   L    R    S  
Sbjct: 180 AMAYIKAHHSELVFN---DQDVLNALLHDRKLF---LPFRWNVQDGFLRRKRRIRQESIA 233

Query: 343 LCYTELTDLKIIHWNSPKK 361
           +   EL    IIH+   KK
Sbjct: 234 MLDEELKSPVIIHYTGGKK 252


>gi|300122825|emb|CBK23832.2| unnamed protein product [Blastocystis hominis]
          Length = 316

 Score = 45.4 bits (106), Expect = 0.093,   Method: Compositional matrix adjust.
 Identities = 34/152 (22%), Positives = 65/152 (42%), Gaps = 27/152 (17%)

Query: 464 PNNVDEYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAG 523
           P+N  +Y+ ++      +V     +RL  +      ++GPI L +Y +  +    + Y  
Sbjct: 103 PSNDQKYDVTA------IVTAFGYNRLFALRQFMYRYQGPIVLVIYATSTQEVHLVRYIS 156

Query: 524 NSEALRSRRNIGYHVVYK---EGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPY 580
            +  +  R  I +++V +       +PIN LRN+A+  + T +  +LD+D       Y  
Sbjct: 157 -THFIPKRVTILFYLVSRYLQSSTVFPINRLRNLAIRNIRTTHFLILDMDLRLSLNTY-- 213

Query: 581 LKSSIRSMDMHGHGGKKVLVVPAFETQRYRTA 612
                          K+V+ +P F  Q  R+A
Sbjct: 214 ---------------KEVMSLPQFLYQSNRSA 230


>gi|427412548|ref|ZP_18902740.1| hypothetical protein HMPREF9282_00147 [Veillonella ratti
           ACS-216-V-Col6b]
 gi|425716355|gb|EKU79339.1| hypothetical protein HMPREF9282_00147 [Veillonella ratti
           ACS-216-V-Col6b]
          Length = 606

 Score = 45.4 bits (106), Expect = 0.097,   Method: Compositional matrix adjust.
 Identities = 50/197 (25%), Positives = 76/197 (38%), Gaps = 21/197 (10%)

Query: 173 YLADSVVEDVSWIP-NKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
           Y  D     +S +P N+ Y  +    +L +  ++     K I +D+DVI A DI  LW  
Sbjct: 347 YFVDVDARSLSGLPLNRAYISINTYYRLLIHDLIDAD--KIIYIDSDVIVADDILNLWNF 404

Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWR 291
              +      G ++       G +        A     N GV++ +L +++   +    R
Sbjct: 405 --DVTDACIAGALDEG-----GVMQSRRLSLGANSNYINAGVLVFNLKEIK-ARYKDPLR 456

Query: 292 IIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLS-------DNTRSDELC 344
           +  E +   R   SL DQDI N         +  LP +WNV          D   S    
Sbjct: 457 LYLETYYFNRDLISLQDQDILNLAFKNE---IKVLPLKWNVNGRIFEVNELDFKYSKADI 513

Query: 345 YTELTDLKIIHWNSPKK 361
              L DL IIH+   KK
Sbjct: 514 NEALNDLGIIHYTDHKK 530


>gi|300175897|emb|CBK21893.2| unnamed protein product [Blastocystis hominis]
          Length = 276

 Score = 45.4 bits (106), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 45/198 (22%), Positives = 73/198 (36%), Gaps = 73/198 (36%)

Query: 537 HVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPY----------LKSSIR 586
           H+V   GN YPIN LRN+A++ V+T + F+ D+D  P+ G+ P+             +I 
Sbjct: 35  HLVLAYGNDYPINYLRNLAIDAVATSHFFVADMDVWPV-GISPFPLFSDFVGDVYNQAIS 93

Query: 587 SMDMH-------------------GHGGKKVLVVPAFETQRYRTAFPASHAPT------- 620
            ++M                     +     ++VP +E  R    F  +  PT       
Sbjct: 94  ILNMQPDPNYSGPINFNPAMVGSVANRNTTAIIVPVYEYMRACNKF--ATCPTMRENEIP 151

Query: 621 ----NFSRWVNATTPYQIEWAPDF----------------------------EPYIVAHR 648
                  R +  +T  Q     D                             EPY++  +
Sbjct: 152 ATKKALRRCLETSTCRQFRQGQDLHNYYFDEWLDSNYTQPLTLVKCFVKDKQEPYVILKK 211

Query: 649 --DLPRYDTRFVGFGWNK 664
              +PR+D RFV +G+NK
Sbjct: 212 TEGMPRFDERFVNYGYNK 229


>gi|423269915|ref|ZP_17248887.1| hypothetical protein HMPREF1079_01969 [Bacteroides fragilis
           CL05T00C42]
 gi|423272630|ref|ZP_17251577.1| hypothetical protein HMPREF1080_00230 [Bacteroides fragilis
           CL05T12C13]
 gi|392700761|gb|EIY93923.1| hypothetical protein HMPREF1079_01969 [Bacteroides fragilis
           CL05T00C42]
 gi|392708707|gb|EIZ01812.1| hypothetical protein HMPREF1080_00230 [Bacteroides fragilis
           CL05T12C13]
          Length = 311

 Score = 45.4 bits (106), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 48/194 (24%), Positives = 86/194 (44%), Gaps = 27/194 (13%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVEN---QSDWYLGK 254
           +  L ++LP  + K + +D D++   DI++ W   + + Q   IG +E+     + Y  +
Sbjct: 89  RCLLSRILPVNIDKILYMDCDIVVLNDISEFWD--TDITQ-YAIGCIEDIGSDEEEYYSR 145

Query: 255 LWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNA 314
           L  + K        FN GV+L++L   R+        +  + FL         DQD+ NA
Sbjct: 146 LQYDKK-----YSYFNAGVLLINLKYWRE---HKIDEMCEQYFLAHSDRIRFNDQDLLNA 197

Query: 315 IISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLK-------IIHWNSPKKLKVKNK 367
           ++ +    V   P +WNVQ +   R+      E + LK       I+H+ + K     + 
Sbjct: 198 LLYKDKLFV---PFRWNVQDTFYRRTYSHKVKEHSGLKEALLHPAILHYTNKKPWNYDSM 254

Query: 368 H---MEFFRNLYLT 378
           H    E+F+ L +T
Sbjct: 255 HPLKQEYFKYLDMT 268


>gi|237751063|ref|ZP_04581543.1| glycosyl transferase [Helicobacter bilis ATCC 43879]
 gi|229373508|gb|EEO23899.1| glycosyl transferase [Helicobacter bilis ATCC 43879]
          Length = 365

 Score = 45.4 bits (106), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 30/98 (30%), Positives = 53/98 (54%), Gaps = 13/98 (13%)

Query: 46  HFHLITDTVA------LNILQTLFSTWSVPQVEVSFYLADSVVEDV---SWIPNKHYSGV 96
           HFH+ITD++A       +ILQT  +     Q+E    ++D   +D+    +   + Y+  
Sbjct: 35  HFHVITDSIAKKTLEQFHILQTTLNDIYPCQIEAHI-ISDEDFKDLPKWGYEEAQQYAAY 93

Query: 97  YGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 134
           Y   ++ L   LP+ + K + LDTD++  TD+ +L+AL
Sbjct: 94  Y---RVKLVDFLPKNVDKCLYLDTDMLVLTDLRELFAL 128


>gi|60679779|ref|YP_209923.1| lipopolysaccharide 1,2-glucosyltransferase/general stress protein
           [Bacteroides fragilis NCTC 9343]
 gi|375356564|ref|YP_005109335.1| putative lipopolysaccharide 1,2-glucosyltransferase/general stress
           protein [Bacteroides fragilis 638R]
 gi|383116528|ref|ZP_09937276.1| hypothetical protein BSHG_1402 [Bacteroides sp. 3_2_5]
 gi|423248186|ref|ZP_17229202.1| hypothetical protein HMPREF1066_00212 [Bacteroides fragilis
           CL03T00C08]
 gi|423253135|ref|ZP_17234066.1| hypothetical protein HMPREF1067_00710 [Bacteroides fragilis
           CL03T12C07]
 gi|423259422|ref|ZP_17240345.1| hypothetical protein HMPREF1055_02622 [Bacteroides fragilis
           CL07T00C01]
 gi|423263604|ref|ZP_17242607.1| hypothetical protein HMPREF1056_00294 [Bacteroides fragilis
           CL07T12C05]
 gi|423282507|ref|ZP_17261392.1| hypothetical protein HMPREF1204_00930 [Bacteroides fragilis HMW
           615]
 gi|60491213|emb|CAH05961.1| putative lipopolysaccharide 1,2-glucosyltransferase/general stress
           protein [Bacteroides fragilis NCTC 9343]
 gi|251948201|gb|EES88483.1| hypothetical protein BSHG_1402 [Bacteroides sp. 3_2_5]
 gi|301161244|emb|CBW20782.1| putative lipopolysaccharide 1,2-glucosyltransferase/general stress
           protein [Bacteroides fragilis 638R]
 gi|387777002|gb|EIK39102.1| hypothetical protein HMPREF1055_02622 [Bacteroides fragilis
           CL07T00C01]
 gi|392657035|gb|EIY50672.1| hypothetical protein HMPREF1067_00710 [Bacteroides fragilis
           CL03T12C07]
 gi|392660293|gb|EIY53907.1| hypothetical protein HMPREF1066_00212 [Bacteroides fragilis
           CL03T00C08]
 gi|392707026|gb|EIZ00146.1| hypothetical protein HMPREF1056_00294 [Bacteroides fragilis
           CL07T12C05]
 gi|404582075|gb|EKA86770.1| hypothetical protein HMPREF1204_00930 [Bacteroides fragilis HMW
           615]
          Length = 311

 Score = 45.1 bits (105), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 48/194 (24%), Positives = 86/194 (44%), Gaps = 27/194 (13%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVEN---QSDWYLGK 254
           +  L ++LP  + K + +D D++   DI++ W   + + Q   IG +E+     + Y  +
Sbjct: 89  RCLLSRILPVNIDKILYIDCDIVVLNDISEFWN--TDITQ-YAIGCIEDIGSDEEEYYSR 145

Query: 255 LWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNA 314
           L  + K        FN GV+L++L   R+        +  + FL         DQD+ NA
Sbjct: 146 LQYDKK-----YSYFNAGVLLINLKYWRE---HKIDEMCEQYFLAHSDRIRFNDQDLLNA 197

Query: 315 IISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLK-------IIHWNSPKKLKVKNK 367
           ++ +    V   P +WNVQ +   R+      E + LK       I+H+ + K     + 
Sbjct: 198 LLYKDKLFV---PFRWNVQDTFYRRTYSHKVKEHSGLKEALLHPAILHYTNKKPWNYDSM 254

Query: 368 H---MEFFRNLYLT 378
           H    E+F+ L +T
Sbjct: 255 HPLKQEYFKYLDMT 268


>gi|420236654|ref|ZP_14741135.1| glycosyl transferase, family 8 [Parascardovia denticolens IPLA
           20019]
 gi|391880108|gb|EIT88604.1| glycosyl transferase, family 8 [Parascardovia denticolens IPLA
           20019]
          Length = 324

 Score = 45.1 bits (105), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 44/166 (26%), Positives = 69/166 (41%), Gaps = 25/166 (15%)

Query: 197 LKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL-----FSRLRQRQTIGLVENQSDWY 251
            +L L ++ P+ + K I LD D +   DIAQL+ +     F        +   E  S + 
Sbjct: 99  FRLFLSEMFPD-IDKAIYLDADTVTNVDIAQLYRIDLGDNFVAAVNDNFVAAGEETSYYT 157

Query: 252 LGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDI 311
           L  L      +       N+GV+L++L  +R+   AGF     +      + +  ADQD 
Sbjct: 158 LNALGIPSSEY------VNSGVLLMNLKAMRE---AGFVDHFVKLLNAYHVESIAADQDY 208

Query: 312 FNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
            N I       +  L  +WN  ++D T   E         KIIH+N
Sbjct: 209 LNVICRGR---ILMLGYEWNTMMADGTSGPEHP-------KIIHYN 244


>gi|218130745|ref|ZP_03459549.1| hypothetical protein BACEGG_02336 [Bacteroides eggerthii DSM 20697]
 gi|217987089|gb|EEC53420.1| general stress protein A [Bacteroides eggerthii DSM 20697]
          Length = 321

 Score = 45.1 bits (105), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 43/193 (22%), Positives = 90/193 (46%), Gaps = 23/193 (11%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKL-- 255
           +L +P++LP  + KT  LD D++    + + W   ++++ +    + +  +D+       
Sbjct: 104 RLFIPQILPLNIKKTFYLDCDLLVLHPLREFWN--TKMQNKGVAAIADQWTDYIEAATRL 161

Query: 256 -WKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNA 314
            ++N + +      FN GV+LL+L  LR+ ++         K     ++    DQD+ N 
Sbjct: 162 KYRNDREY------FNAGVLLLNLEYLRNHNFTNNAIDFVTKHANDIVY---HDQDVLNK 212

Query: 315 IISEHPYLVYTLPCQWNV-QLSDNTRSDELCYTELTDLK----IIHWNSPKKLKVKNKHM 369
           +I E+  +   +P +WNV     N +   +    + D +    IIH+ +P K   ++   
Sbjct: 213 LIGENRII---MPVKWNVCSFKINDKIPHIYNATMNDARKDPYIIHFFAPIKPWNQDSSH 269

Query: 370 EFFRNLYLTFLEY 382
             +R+ Y  FL++
Sbjct: 270 P-YRSYYYYFLQF 281


>gi|53711516|ref|YP_097508.1| lipopolysaccharide 1,2-glucosyltransferase [Bacteroides fragilis
           YCH46]
 gi|52214381|dbj|BAD46974.1| lipopolysaccharide 1,2-glucosyltransferase [Bacteroides fragilis
           YCH46]
          Length = 311

 Score = 45.1 bits (105), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 48/194 (24%), Positives = 86/194 (44%), Gaps = 27/194 (13%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVEN---QSDWYLGK 254
           +  L ++LP  + K + +D D++   DI++ W   + + Q   IG +E+     + Y  +
Sbjct: 89  RCLLSRILPVNIDKILYIDCDIVVLNDISEFWD--TDITQ-YAIGCIEDIGSDEEEYYSR 145

Query: 255 LWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNA 314
           L  + K        FN GV+L++L   R+        +  + FL         DQD+ NA
Sbjct: 146 LQYDKK-----YSYFNAGVLLINLKYWRE---HKIDEMCEQYFLAHSDRIRFNDQDLLNA 197

Query: 315 IISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLK-------IIHWNSPKKLKVKNK 367
           ++ +    V   P +WNVQ +   R+      E + LK       I+H+ + K     + 
Sbjct: 198 LLYKDKLFV---PFRWNVQDTFYRRTYSHKVKEHSGLKEALLHPAILHYTNKKPWNYDSM 254

Query: 368 H---MEFFRNLYLT 378
           H    E+F+ L +T
Sbjct: 255 HPLKQEYFKYLDMT 268


>gi|302848747|ref|XP_002955905.1| hypothetical protein VOLCADRAFT_96805 [Volvox carteri f.
           nagariensis]
 gi|300258873|gb|EFJ43106.1| hypothetical protein VOLCADRAFT_96805 [Volvox carteri f.
           nagariensis]
          Length = 1206

 Score = 45.1 bits (105), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 23/54 (42%), Positives = 31/54 (57%), Gaps = 2/54 (3%)

Query: 613 FPASHAPTNFSRWV-NATTPYQIEWAPDFEPYIVAHRD-LPRYDTRFVGFGWNK 664
           FPA H  T++ RW   A   Y I +A  FEP+ +A R  +P +D R  G+G NK
Sbjct: 888 FPAGHGATDYLRWFKQANEAYDISYARKFEPWFMAGRQVVPWHDVRLRGYGHNK 941


>gi|302828420|ref|XP_002945777.1| hypothetical protein VOLCADRAFT_127353 [Volvox carteri f.
           nagariensis]
 gi|217416170|dbj|BAH03159.1| InvC [Volvox carteri f. nagariensis]
 gi|300268592|gb|EFJ52772.1| hypothetical protein VOLCADRAFT_127353 [Volvox carteri f.
           nagariensis]
          Length = 401

 Score = 45.1 bits (105), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 51/243 (20%), Positives = 87/243 (35%), Gaps = 62/243 (25%)

Query: 479 VTLVAQLSMDRLQMVEMLFKHWEG-PISLTLYMSDAEA---------------------- 515
           VT+    ++DRL M++   +++ G P    +Y+   +A                      
Sbjct: 24  VTMFTTATLDRLDMLDTQCRNYRGGPHVAAVYLPLVQAANKSGLTNENEQAVRTAEEKLQ 83

Query: 516 QQFLSYAGNSEALRSRRNIGYHVVYKEGNFY--PINTLRNVALNQVSTPYVFLLDIDFLP 573
           + F      +EA        Y V       Y  PIN LRN AL    T  V ++D+D   
Sbjct: 84  ELFYRMEATTEACHMHILFLYEVTSDSALAYLTPINALRNAALLASQTQLVMMIDVDLCV 143

Query: 574 MFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRY------------------------ 609
              L  ++  S     +         V+PA++  +                         
Sbjct: 144 SNTLVQFVSESKYIDQLVRMSSNSFWVLPAWDVNKQLPQEQINEVAASAASGTKMQLVKL 203

Query: 610 ----------RTAFPASHAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRD---LPRYDTR 656
                     +  F   H+PTN++RW+ +   Y I+++  +EP+ +  RD   +  YD R
Sbjct: 204 WLNGSLNWFGQHYFQLGHSPTNYTRWLMSRRSYVIDYSIGYEPWGLVSRDRMAIVPYDAR 263

Query: 657 FVG 659
           F G
Sbjct: 264 FRG 266


>gi|357623047|gb|EHJ74352.1| hypothetical protein KGM_22246 [Danaus plexippus]
          Length = 398

 Score = 45.1 bits (105), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 30/75 (40%), Positives = 38/75 (50%), Gaps = 7/75 (9%)

Query: 541 KEGNF-YPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSM--DMHGHGG-- 595
           +E N  YPIN  RNVA N   T Y  + DI+ +P  GL P   + +R +  D     G  
Sbjct: 167 RENNMIYPINVGRNVARNASRTNYFIVSDIELVPSNGLAPKFLAMLRKLMGDKKRDEGCI 226

Query: 596 --KKVLVVPAFETQR 608
             K V VVP FE +R
Sbjct: 227 FSKTVFVVPLFEVER 241


>gi|317474579|ref|ZP_07933853.1| glycosyl transferase family 8 [Bacteroides eggerthii 1_2_48FAA]
 gi|316909260|gb|EFV30940.1| glycosyl transferase family 8 [Bacteroides eggerthii 1_2_48FAA]
          Length = 313

 Score = 44.7 bits (104), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 39/141 (27%), Positives = 64/141 (45%), Gaps = 21/141 (14%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +  LP +LP  L+K I LD+D++    I ++W   + L      G+ E +S        K
Sbjct: 88  RCMLPSLLPSQLSKAIYLDSDILVLDSIKEIWN--TDLNNIAIAGIEEARS--------K 137

Query: 258 NHKPWPALG-----RGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIF 312
             K    LG     R  N GV+L++L   R  +     R    K +   L+    DQD+ 
Sbjct: 138 EDKHCDRLGYAPSYRYINAGVLLINLDYWRKYNIEEKCRQYYAKNIDRMLYN---DQDLL 194

Query: 313 NAIISEHPYLVYTLPCQWNVQ 333
           NA++ +   ++   P ++NVQ
Sbjct: 195 NALLYDKKAVI---PTRYNVQ 212


>gi|294786378|ref|ZP_06751632.1| glycosyltransferase [Parascardovia denticolens F0305]
 gi|315225938|ref|ZP_07867726.1| glycosyltransferase [Parascardovia denticolens DSM 10105 = JCM
           12538]
 gi|294485211|gb|EFG32845.1| glycosyltransferase [Parascardovia denticolens F0305]
 gi|315120070|gb|EFT83202.1| glycosyltransferase [Parascardovia denticolens DSM 10105 = JCM
           12538]
          Length = 324

 Score = 44.7 bits (104), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 44/166 (26%), Positives = 69/166 (41%), Gaps = 25/166 (15%)

Query: 197 LKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL-----FSRLRQRQTIGLVENQSDWY 251
            +L L ++ P+ + K I LD D +   DIAQL+ +     F        +   E  S + 
Sbjct: 99  FRLFLSEMFPD-IDKAIYLDADTVTNVDIAQLYRIDLGDNFFAAVNDNFVAAGEETSYYT 157

Query: 252 LGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDI 311
           L  L      +       N+GV+L++L  +R+   AGF     +      + +  ADQD 
Sbjct: 158 LNALGIPSSEY------VNSGVLLMNLKAMRE---AGFVDHFVKLLNAYHVESIAADQDY 208

Query: 312 FNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
            N I       +  L  +WN  ++D T   E         KIIH+N
Sbjct: 209 LNVICRGR---ILMLGYEWNTMMADGTSGPEHP-------KIIHYN 244


>gi|431910232|gb|ELK13305.1| N-acetyllactosaminide beta-1,3-N-acetylglucosaminyltransferase
           [Pteropus alecto]
          Length = 221

 Score = 44.7 bits (104), Expect = 0.15,   Method: Composition-based stats.
 Identities = 23/72 (31%), Positives = 38/72 (52%), Gaps = 3/72 (4%)

Query: 471 EASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQF---LSYAGNSEA 527
           + + D NDV L    S+D L  +  L + WEGP+S++++ +  E  Q    L+YA +S  
Sbjct: 85  KTTMDPNDVILATHASVDNLLHLSGLLERWEGPLSVSVFAATKEEAQLATVLTYALSSHC 144

Query: 528 LRSRRNIGYHVV 539
              R  +  H+V
Sbjct: 145 PDMRARVAMHLV 156


>gi|42518147|ref|NP_964077.1| hypothetical protein LJ0061 [Lactobacillus johnsonii NCC 533]
 gi|227889100|ref|ZP_04006905.1| lipopolysaccharide biosynthesis glycosyltransferase [Lactobacillus
           johnsonii ATCC 33200]
 gi|385825016|ref|YP_005861358.1| hypothetical protein [Lactobacillus johnsonii DPC 6026]
 gi|41582431|gb|AAS08043.1| hypothetical protein LJ_0061 [Lactobacillus johnsonii NCC 533]
 gi|227850329|gb|EEJ60415.1| lipopolysaccharide biosynthesis glycosyltransferase [Lactobacillus
           johnsonii ATCC 33200]
 gi|329666460|gb|AEB92408.1| hypothetical protein LJP_0069c [Lactobacillus johnsonii DPC 6026]
          Length = 316

 Score = 44.7 bits (104), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 40/163 (24%), Positives = 71/163 (43%), Gaps = 20/163 (12%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L +P + P+   K + LD D I  TDIA+L   ++          V + S  ++  L  
Sbjct: 94  RLFIPNLFPQ-YDKAVYLDADTIICTDIAEL---YNTEIGDNMFASVPDMSIRFIKPLQV 149

Query: 258 NHKPWPAL---GRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNA 314
             K    +    +  N GVIL ++ + RD  +   +  + EK+    +     DQ   N 
Sbjct: 150 YIKECQGIFPPEKYINNGVILFNMKEFRDKKFVDKFYSLIEKYHFDNI---DPDQAYMNE 206

Query: 315 IISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
           I  +    +Y LP +W+   +++         E+ D KI+H+N
Sbjct: 207 ICEDK---IYHLPLEWDAMPNEHM-------DEIKDPKIVHYN 239


>gi|417838481|ref|ZP_12484719.1| glycosyl transferase family 8 domain containing protein
           [Lactobacillus johnsonii pf01]
 gi|338762024|gb|EGP13293.1| glycosyl transferase family 8 domain containing protein
           [Lactobacillus johnsonii pf01]
          Length = 316

 Score = 44.7 bits (104), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 40/163 (24%), Positives = 71/163 (43%), Gaps = 20/163 (12%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L +P + P+   K + LD D I  TDIA+L   ++          V + S  ++  L  
Sbjct: 94  RLFIPNLFPQ-YDKAVYLDADTIICTDIAEL---YNTEIGDNMFASVPDMSIRFIKPLQV 149

Query: 258 NHKPWPAL---GRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNA 314
             K    +    +  N GVIL ++ + RD  +   +  + EK+    +     DQ   N 
Sbjct: 150 YIKECQGIFPPEKYINNGVILFNMKEFRDKKFVDKFYSLIEKYHFDNI---DPDQAYMNE 206

Query: 315 IISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
           I  +    +Y LP +W+   +++         E+ D KI+H+N
Sbjct: 207 ICEDK---IYHLPLEWDAMPNEHM-------DEIKDPKIVHYN 239


>gi|356499719|ref|XP_003518684.1| PREDICTED: probable galacturonosyltransferase-like 3-like [Glycine
           max]
          Length = 342

 Score = 44.7 bits (104), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 28/94 (29%), Positives = 50/94 (53%), Gaps = 11/94 (11%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVE----NQSDWYLG 253
           ++ L  +LP T+ + I  D+D+I   D+A+LW++    R    +G  E    N ++++  
Sbjct: 144 RMYLADLLPATVRRIIYFDSDLIVVDDVAKLWSIDLHAR---VLGAPEYCHANFTNYFTH 200

Query: 254 KLWKNHKPWPALGRG----FNTGVILLDLTKLRD 283
           + W N     +  R     FNTGV+++DL K R+
Sbjct: 201 RFWSNPSYAASFKRRDACYFNTGVMVIDLWKWRE 234


>gi|365175698|ref|ZP_09363125.1| hypothetical protein HMPREF1006_01070 [Synergistes sp. 3_1_syn1]
 gi|363612266|gb|EHL63811.1| hypothetical protein HMPREF1006_01070 [Synergistes sp. 3_1_syn1]
          Length = 341

 Score = 44.3 bits (103), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 39/132 (29%), Positives = 63/132 (47%), Gaps = 19/132 (14%)

Query: 196 LLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKL 255
           L +L +P+++ E   K + LD D+I   DIA+LW   +     +TIG V +   W L  +
Sbjct: 94  LFRLLIPQLINEK--KILYLDCDLIIDLDIAELW---NEPLDGKTIGAVCDV--WSLDYI 146

Query: 256 ------WKNHKPWPALG----RGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTS 305
                 W+    W  +G      FN GV ++DL ++RD     F + +   +   +   +
Sbjct: 147 KGTPVPWRYALAWEIMGIRNDSYFNAGVTIMDLERIRD--RYDFLKEVETFYDKYKKCIT 204

Query: 306 LADQDIFNAIIS 317
           LADQD  N I +
Sbjct: 205 LADQDCLNYIFA 216


>gi|297672811|ref|XP_002814481.1| PREDICTED: uncharacterized protein LOC100458876 [Pongo abelii]
          Length = 570

 Score = 44.3 bits (103), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 34/101 (33%), Positives = 47/101 (46%), Gaps = 15/101 (14%)

Query: 269 FNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTS------LADQDIFNAIISEHPYL 322
           FN+GV+LL+L  +R    +  +  + E   + +L         L DQD F  I  EHP L
Sbjct: 465 FNSGVMLLNLEAMRQ---SPLYSRLLEPAQVQQLADKYHFRGHLGDQDFFTMIGMEHPKL 521

Query: 323 VYTLPCQWNVQLS----DNTRSD--ELCYTELTDLKIIHWN 357
            + L C WN QL     D+  SD  E  +     +KI H N
Sbjct: 522 FHVLDCTWNRQLCTWWRDHGYSDVFEAYFRCEGHVKIYHGN 562


>gi|81299339|ref|YP_399547.1| lipopolysaccharide biosynthesis proteins LPS [Synechococcus
           elongatus PCC 7942]
 gi|81168220|gb|ABB56560.1| Lipopolysaccharide biosynthesis proteins LPS [Synechococcus
           elongatus PCC 7942]
          Length = 329

 Score = 44.3 bits (103), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 48/193 (24%), Positives = 77/193 (39%), Gaps = 9/193 (4%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLV-ENQSDW-YLGKL 255
           ++ LP+V P+ L + +  DTDV+   D A L         +     V  ++  W Y  K 
Sbjct: 139 RVWLPQVFPD-LTRILYFDTDVVLLEDPAILDQQAGDFNDQIFFAAVPHSRPAWLYFKKP 197

Query: 256 WKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAI 315
           W+ H    A+G  FN+GV++ DL    +  +      +         +    D+ + NA 
Sbjct: 198 WRAHSYIKAMGTTFNSGVMVTDLRFWTEAVYQRIQAALDRDRQFRYRFLEPGDEALLNAC 257

Query: 316 ISEHPYLVYTLPCQWNVQLSDNTR-SDELCYTELTDLKIIHWNSPKKLKVKNKHMEFFRN 374
              +      LP +WN     N R    L   +  +  IIHW S    K  N H   + +
Sbjct: 258 FPNY----RALPKRWNRCGYGNARFVARLLACDPQEAAIIHW-SGGHHKPWNTHDIIYGD 312

Query: 375 LYLTFLEYDGNLL 387
           L+  +    G LL
Sbjct: 313 LWRRYANLPGLLL 325


>gi|366054099|ref|ZP_09451821.1| glycosyl transferase family protein [Lactobacillus suebicus KCTC
           3549]
          Length = 316

 Score = 44.3 bits (103), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 44/166 (26%), Positives = 76/166 (45%), Gaps = 27/166 (16%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L +  + P+   K I LD D+I   DI+   +LF +      +G V   +D ++  L  
Sbjct: 97  RLFIADMFPQ-YDKAIYLDADIIVTGDIS---SLFDKNIDNNLMGAV---NDRFIA-LDP 148

Query: 258 NHKPWPALGRG------FNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDI 311
           N   +     G       N+GV+LL+L ++R  + +G +  +  +F      T   DQD 
Sbjct: 149 NGSSYAREAIGVDGSLYVNSGVLLLNLDEIRQFNLSGRFLDLLRRF---HFDTIAPDQDY 205

Query: 312 FNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
            NAI  +    ++ L   WN Q +       +    +TD++IIH+N
Sbjct: 206 LNAICQDR---IFQLDPAWNYQTA-------VLDDSVTDIQIIHFN 241


>gi|56751002|ref|YP_171703.1| hypothetical protein syc0993_d [Synechococcus elongatus PCC 6301]
 gi|56685961|dbj|BAD79183.1| unknown protein [Synechococcus elongatus PCC 6301]
          Length = 250

 Score = 44.3 bits (103), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 48/193 (24%), Positives = 77/193 (39%), Gaps = 9/193 (4%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLV-ENQSDW-YLGKL 255
           ++ LP+V P+ L + +  DTDV+   D A L         +     V  ++  W Y  K 
Sbjct: 60  RVWLPQVFPD-LTRILYFDTDVVLLEDPAILDQQAGDFNDQIFFAAVPHSRPAWLYFKKP 118

Query: 256 WKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAI 315
           W+ H    A+G  FN+GV++ DL    +  +      +         +    D+ + NA 
Sbjct: 119 WRAHSYIKAMGTTFNSGVMVTDLRFWTEAVYQRIQAALDRDRQFRYRFLEPGDEALLNAC 178

Query: 316 ISEHPYLVYTLPCQWNVQLSDNTR-SDELCYTELTDLKIIHWNSPKKLKVKNKHMEFFRN 374
              +      LP +WN     N R    L   +  +  IIHW S    K  N H   + +
Sbjct: 179 FPNY----RALPKRWNRCGYGNARFVARLLACDPQEAAIIHW-SGGHHKPWNTHDIIYGD 233

Query: 375 LYLTFLEYDGNLL 387
           L+  +    G LL
Sbjct: 234 LWRRYANLPGLLL 246


>gi|30689156|ref|NP_189474.2| putative galacturonosyltransferase-like 10 [Arabidopsis thaliana]
 gi|75273232|sp|Q9LHD2.1|GATLA_ARATH RecName: Full=Probable galacturonosyltransferase-like 10; AltName:
           Full=Galactinol synthase 8; Short=AtGolS8; Short=GolS-8
 gi|11994580|dbj|BAB02626.1| glycosyl transferase-like protein [Arabidopsis thaliana]
 gi|44917577|gb|AAS49113.1| At3g28340 [Arabidopsis thaliana]
 gi|51971391|dbj|BAD44360.1| unknown protein [Arabidopsis thaliana]
 gi|332643914|gb|AEE77435.1| putative galacturonosyltransferase-like 10 [Arabidopsis thaliana]
          Length = 365

 Score = 44.3 bits (103), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 28/91 (30%), Positives = 46/91 (50%), Gaps = 10/91 (10%)

Query: 201 LPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVE----NQSDWYLGKLW 256
           L ++L   +++ I LD+DVI   DI +LW +   L   +TIG  E    N + ++    W
Sbjct: 162 LSEILSSCVSRVIYLDSDVIVVDDIQKLWKI--SLSGSRTIGAPEYCHANFTKYFTDSFW 219

Query: 257 KNHKPWPALGRG----FNTGVILLDLTKLRD 283
            + K            FNTGV+++DL + R+
Sbjct: 220 SDQKLSSVFDSKTPCYFNTGVMVIDLERWRE 250



 Score = 40.0 bits (92), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 36/143 (25%), Positives = 64/143 (44%), Gaps = 9/143 (6%)

Query: 28  RSLVTLIKSILFYRKNP--LHFHLI-TDTVALNILQTLFSTWSVPQVEVSFYLADSVVED 84
           R  V+ + SIL +   P  + FH I + T   ++ +TL S +     +V  +   +V   
Sbjct: 83  RGTVSAVHSILKHTSCPENIFFHFIASGTSQGSLAKTLSSVFPSLSFKVYTFEETTVKNL 142

Query: 85  VSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTI 144
           +S    +         +  L ++L   +++ I LD+DVI   DI +LW +   L   +TI
Sbjct: 143 ISSSIRQALDSPLNYARSYLSEILSSCVSRVIYLDSDVIVVDDIQKLWKI--SLSGSRTI 200

Query: 145 GLVE----NQSDWYLGKLWKNHK 163
           G  E    N + ++    W + K
Sbjct: 201 GAPEYCHANFTKYFTDSFWSDQK 223


>gi|86152043|ref|ZP_01070256.1| putative sugar transferase [Campylobacter jejuni subsp. jejuni
           260.94]
 gi|315124873|ref|YP_004066877.1| sugar transferase [Campylobacter jejuni subsp. jejuni ICDCCJ07001]
 gi|56783473|emb|CAI38726.1| putative sugar transferase [Campylobacter jejuni]
 gi|85841151|gb|EAQ58400.1| putative sugar transferase [Campylobacter jejuni subsp. jejuni
           260.94]
 gi|315018595|gb|ADT66688.1| putative sugar transferase [Campylobacter jejuni subsp. jejuni
           ICDCCJ07001]
          Length = 459

 Score = 43.9 bits (102), Expect = 0.25,   Method: Compositional matrix adjust.
 Identities = 40/179 (22%), Positives = 84/179 (46%), Gaps = 34/179 (18%)

Query: 165 WPALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATD 224
           +P+  + +++ D++  D + IP KH       L+L L  +L + + K + LD D++   D
Sbjct: 63  YPSEIKIYHINDNLFYDYN-IP-KHEGSYNAYLRLMLASILSKDIKKCLYLDVDMLVLGD 120

Query: 225 IAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPAL------------GRGFNTG 272
           I++L+ L  + +    + ++++              PWP L            G  FN+G
Sbjct: 121 ISELFDLDLKDKVFAAVFILKH--------------PWPNLNSKDSSEIFYIYGSHFNSG 166

Query: 273 VILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWN 331
           ++L++L   R+ +     R ++  F+         D+ + NAI+S+    +++L  +WN
Sbjct: 167 LMLINLDAWREKNIES--RSLS--FIKNYYVPYAVDEYVLNAILSKDD--IFSLKLEWN 219


>gi|418087438|ref|ZP_12724607.1| hypothetical protein SPAR87_1223 [Streptococcus pneumoniae GA47033]
 gi|353758454|gb|EHD39046.1| hypothetical protein SPAR87_1223 [Streptococcus pneumoniae GA47033]
          Length = 1077

 Score = 43.9 bits (102), Expect = 0.25,   Method: Compositional matrix adjust.
 Identities = 43/173 (24%), Positives = 77/173 (44%), Gaps = 30/173 (17%)

Query: 151 SDWYLGKLWKNHKPWPALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLA 210
           SDW+  +L    +   ++  G Y+ D++  DV W   +H S +    +  +P+++ E   
Sbjct: 41  SDWF-HELKIQVEKLGSVVEGIYIGDTI--DVEWKTQEHISPI-AYARYLIPRLIKE--E 94

Query: 211 KTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFN 270
           + I LD+D+I   D++ L+ L              +  D+ +  +        A G GFN
Sbjct: 95  RVIYLDSDIIVNGDLSSLFEL--------------DFGDYSIAAVRD------ADGNGFN 134

Query: 271 TGVILLDLTKLR--DISWAGFWRIIAEKFLLTRLWTSL--ADQDIFNAIISEH 319
           +GV+++D  K R  D++   F + +     L    T     DQ IFN +   H
Sbjct: 135 SGVLVIDSQKWREKDVTSILFDKTVEYMSYLDHTDTDRFNGDQTIFNLVFQNH 187


>gi|297818430|ref|XP_002877098.1| hypothetical protein ARALYDRAFT_905093 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297322936|gb|EFH53357.1| hypothetical protein ARALYDRAFT_905093 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 364

 Score = 43.9 bits (102), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 28/91 (30%), Positives = 46/91 (50%), Gaps = 10/91 (10%)

Query: 201 LPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVE----NQSDWYLGKLW 256
           L ++L   +++ I LD+DVI   DI +LW +   L   +TIG  E    N + ++    W
Sbjct: 161 LSEILSSCVSRVIYLDSDVIVVDDIQKLWKI--SLSGSRTIGAPEYCHANFTKYFTESFW 218

Query: 257 KNHKPWPALGRG----FNTGVILLDLTKLRD 283
            + K            FNTGV+++DL + R+
Sbjct: 219 SDRKLSSVFDSKTPCYFNTGVMVIDLDRWRE 249


>gi|384110005|ref|ZP_10010852.1| Lipopolysaccharide biosynthesis protein [Treponema sp. JC4]
 gi|383868445|gb|EID84097.1| Lipopolysaccharide biosynthesis protein [Treponema sp. JC4]
          Length = 350

 Score = 43.9 bits (102), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 43/168 (25%), Positives = 71/168 (42%), Gaps = 33/168 (19%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +  +P + P+   K I LD D+I   DI++L+           I L  N       +++ 
Sbjct: 95  RFFIPNLFPK-YKKIIYLDCDIIVKGDISELY----------NIDLGNNYVAAAPEEVFI 143

Query: 258 NHKPWPALGRG---------FNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLAD 308
            H  W    RG         FN GV+LL+  ++     A  +  +AEK+     +  + D
Sbjct: 144 LHPNWLNYPRGALGIEPEDYFNAGVMLLNTERMLKDKIAEKFVALAEKY----TFRIVQD 199

Query: 309 QDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHW 356
           +D  N ++ +H  +   LP  WN     N   D+       +LKIIH+
Sbjct: 200 EDYLNVLLKDHKKI---LPLDWNKTAFKNDAYDD------KNLKIIHY 238


>gi|357445497|ref|XP_003593026.1| UDP-glucose:glycoprotein glucosyltransferase [Medicago truncatula]
 gi|355482074|gb|AES63277.1| UDP-glucose:glycoprotein glucosyltransferase [Medicago truncatula]
          Length = 1650

 Score = 43.9 bits (102), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 39/141 (27%), Positives = 68/141 (48%), Gaps = 18/141 (12%)

Query: 199  LTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSD------WYL 252
            L L  + P +L K I +D D I  TD+ +L+ +  + R        +N  +      W  
Sbjct: 1431 LFLDVIFPLSLEKVIFVDADQIVRTDMGELYDMDLKGRPLAYTPFCDNNREMDGYRFWRQ 1490

Query: 253  GKLWKNHKPWPALGRGFN-TGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLA--DQ 309
            G  WK+H      GR ++ + + ++DL K R+ +     R+  E   L++   SLA  DQ
Sbjct: 1491 G-FWKDH----LRGRPYHISALYVVDLKKFRETAAGDNLRVFYET--LSKDPNSLANLDQ 1543

Query: 310  DIFNAIISEHPYLVYTLPCQW 330
            D+ N   ++H   +++LP +W
Sbjct: 1544 DLPN--YAQHTVPIFSLPQEW 1562


>gi|381150345|ref|ZP_09862214.1| LPS:glycosyltransferase [Methylomicrobium album BG8]
 gi|380882317|gb|EIC28194.1| LPS:glycosyltransferase [Methylomicrobium album BG8]
          Length = 311

 Score = 43.9 bits (102), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 33/112 (29%), Positives = 54/112 (48%), Gaps = 16/112 (14%)

Query: 181 DVSWIPNKHYSGVYGLLKLT-----LPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRL 235
           +V   P + +S +  + ++T     +P V PET++K + LD D++   DIA L       
Sbjct: 71  EVDMKPFREFSTIAHISRITFARFLIPDVFPETVSKVLYLDADILVLDDIAPL------C 124

Query: 236 RQRQTIGLVENQSDWYLGKLWKNHKPW----PALGRGFNTGVILLDLTKLRD 283
           R      L+   +D YL    K  +P     P +   FN GV+L+DL + R+
Sbjct: 125 RMELNGALLGAVTD-YLDACLKRGEPLFAAVPRVSNYFNAGVLLIDLGRWRE 175


>gi|423331375|ref|ZP_17309159.1| hypothetical protein HMPREF1075_01172 [Parabacteroides distasonis
           CL03T12C09]
 gi|409230671|gb|EKN23533.1| hypothetical protein HMPREF1075_01172 [Parabacteroides distasonis
           CL03T12C09]
          Length = 301

 Score = 43.9 bits (102), Expect = 0.29,   Method: Compositional matrix adjust.
 Identities = 51/198 (25%), Positives = 91/198 (45%), Gaps = 24/198 (12%)

Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
           FY  D  V   +  P   +  +    +L +  +LP+++ K + LD DV+    +  LW  
Sbjct: 61  FYTVDKKV--FANCPISRHITLATYFRLIMTDILPKSVEKVLYLDCDVVVRHSLRSLWDT 118

Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWA-GFW 290
             +       G++ + S   + +++   +  P+LG  FN GV+L++L   R+ + +  F+
Sbjct: 119 DIK---SYAAGVIPDMSIDDI-RIYNRLQYSPSLGY-FNAGVLLVNLRYWRENNLSESFF 173

Query: 291 RIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQ--------LSDNTRSDE 342
            II +     R      DQD+ N ++ E   +  TLP ++NVQ        L   T  DE
Sbjct: 174 EIINKYPERLR----YHDQDVLNIVLKE---IKLTLPMKYNVQHGYFFKDPLISRTYRDE 226

Query: 343 LCYTELTDLKIIHWNSPK 360
                +TD  I+H++  K
Sbjct: 227 R-EQAITDPVILHYSGSK 243


>gi|339448960|ref|ZP_08652516.1| bifunctional glycosyl transferase family protein [Lactobacillus
           fructivorans KCTC 3543]
          Length = 290

 Score = 43.9 bits (102), Expect = 0.29,   Method: Compositional matrix adjust.
 Identities = 59/200 (29%), Positives = 89/200 (44%), Gaps = 33/200 (16%)

Query: 205 LPETLAK--TIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK---NH 259
           +PE + +   + LD D+I   DI  LW   + L  +  +G VEN    YL +L +   + 
Sbjct: 103 IPEEVKRPRILYLDADMICDGDITGLWQ--TDLGGK-VVGAVENAG--YLDRLREMGVSE 157

Query: 260 KPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEH 319
           KP    GR FN G++L+D  K ++   +   R +A             DQD  NAI + +
Sbjct: 158 KP----GRYFNAGLLLIDTKKWKEQGISQRARNLANDHPEI---LRFQDQDALNAIFNGN 210

Query: 320 PYLVYTLPCQWNVQLS--------DNTRSDELCYTE-LTDLKIIHW-NSPKKLKVKNKHM 369
                +LP ++NVQ +          T S      E L    IIH+ N  K   ++N H+
Sbjct: 211 ---WQSLPSKYNVQSNLVKGKYRKSGTESGRRSQQEALNQPVIIHYTNFDKPWLIRNGHL 267

Query: 370 EFFRNLYLTFLEYDGNLLRR 389
              R+LY    EY   LL +
Sbjct: 268 HPLRSLY---DEYQNKLLNQ 284


>gi|366165519|ref|ZP_09465274.1| glycosyl transferase family protein [Acetivibrio cellulolyticus
           CD2]
          Length = 484

 Score = 43.9 bits (102), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 48/182 (26%), Positives = 77/182 (42%), Gaps = 21/182 (11%)

Query: 186 PNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVE 245
           P   Y+  Y   ++ LP++L  ++ K + LD D++   DIA+LW           +  VE
Sbjct: 80  PCASYATYY---RIFLPELLDSSIEKVLYLDCDIVVKGDIAKLW---ETDITGNYLAAVE 133

Query: 246 NQSDWYLGKLWKNHKPWPALGRG---FNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRL 302
           +    Y G+  K  K   ++ R    FN GV++++L   R     G    I +  +    
Sbjct: 134 DVGVEYSGEFGKKVKENLSMDRKDIYFNAGVLIINLDLWRQ---HGISDKICDFLIQNPD 190

Query: 303 WTSLADQDIFNAIISEHPYLVYTLPCQWNVQLS------DNTRSDELCYTELTDLKIIHW 356
               ADQD  NA++S        L   WN Q++      D    D+     L +  IIH+
Sbjct: 191 KAPFADQDGLNAVLSGK---WVPLSLLWNQQVALWEHFDDGKPLDQEMLESLHNPFIIHY 247

Query: 357 NS 358
            S
Sbjct: 248 TS 249


>gi|255014941|ref|ZP_05287067.1| putative glucosyltransferase [Bacteroides sp. 2_1_7]
 gi|410105347|ref|ZP_11300255.1| hypothetical protein HMPREF0999_04027 [Parabacteroides sp. D25]
 gi|409232557|gb|EKN25403.1| hypothetical protein HMPREF0999_04027 [Parabacteroides sp. D25]
          Length = 301

 Score = 43.9 bits (102), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 51/198 (25%), Positives = 91/198 (45%), Gaps = 24/198 (12%)

Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
           FY  D  V   +  P   +  +    +L +  +LP+++ K + LD DV+    +  LW  
Sbjct: 61  FYTVDKKV--FANCPISRHITLATYFRLIMTDILPKSVEKVLYLDCDVVVRHSLRSLWDT 118

Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWA-GFW 290
             +       G++ + S   + +++   +  P+LG  FN GV+L++L   R+ + +  F+
Sbjct: 119 DIK---SYAAGVIPDMSIDDI-RIYNRLQYSPSLGY-FNAGVLLVNLRYWRENNLSESFF 173

Query: 291 RIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQ--------LSDNTRSDE 342
            II +     R      DQD+ N ++ E   +  TLP ++NVQ        L   T  DE
Sbjct: 174 EIINKYPERLR----YHDQDVLNIVLKE---IKLTLPMKYNVQHGYFFKDPLISRTYRDE 226

Query: 343 LCYTELTDLKIIHWNSPK 360
                +TD  I+H++  K
Sbjct: 227 R-EQAITDPVILHYSGSK 243


>gi|356495990|ref|XP_003516853.1| PREDICTED: probable galacturonosyltransferase-like 3-like [Glycine
           max]
          Length = 338

 Score = 43.9 bits (102), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 29/95 (30%), Positives = 52/95 (54%), Gaps = 13/95 (13%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVE----NQSDWYLG 253
           ++ L  +LP T+ + I  D+D+I   D+A+LW++    R    +G  E    N ++++  
Sbjct: 140 RMYLADLLPATVRRIIYFDSDLIVVDDVAKLWSIDLHAR---VLGAPEYCHANFTNYFTH 196

Query: 254 KLWKNHKPWPALGRG-----FNTGVILLDLTKLRD 283
           + W N   + A  +G     FNTGV+++DL K R+
Sbjct: 197 RFWSN-PSYAASFKGRDACYFNTGVMVIDLWKWRE 230


>gi|357146607|ref|XP_003574051.1| PREDICTED: LOW QUALITY PROTEIN: UDP-glucose:glycoprotein
            glucosyltransferase-like [Brachypodium distachyon]
          Length = 1604

 Score = 43.9 bits (102), Expect = 0.31,   Method: Compositional matrix adjust.
 Identities = 38/141 (26%), Positives = 68/141 (48%), Gaps = 18/141 (12%)

Query: 199  LTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSD------WYL 252
            L L  + P +L K I +D D I  TD+ +L+ +  + R        +N  +      W  
Sbjct: 1385 LFLDVIFPLSLRKVIFVDADQIVRTDMGELYDMNLKGRPLAYTPFCDNNKEMDGYRFWKQ 1444

Query: 253  GKLWKNHKPWPALGRGFN-TGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLA--DQ 309
            G  WK+H      GR ++ + + ++DL K R  +     R++ E   L++   SL+  DQ
Sbjct: 1445 G-FWKDH----LRGRPYHISALYVVDLAKFRQTAAGDTLRVVYET--LSKDPNSLSNLDQ 1497

Query: 310  DIFNAIISEHPYLVYTLPCQW 330
            D+ N   ++H   +++LP +W
Sbjct: 1498 DLPN--YAQHTVPIFSLPQEW 1516


>gi|324497845|gb|ADY39571.1| putative N-acetyllactosaminide [Hottentotta judaicus]
          Length = 232

 Score = 43.9 bits (102), Expect = 0.32,   Method: Compositional matrix adjust.
 Identities = 38/161 (23%), Positives = 61/161 (37%), Gaps = 41/161 (25%)

Query: 484 QLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSR------RNIGYH 537
           Q + + L  V +L   W+ PIS+ +Y   A    F+S       LR         N+ +H
Sbjct: 2   QGTYEFLHHVNILCSRWQAPISVAVY---APGDDFISAHLTISYLRQCGPPCIVENVTWH 58

Query: 538 VVY------------------------------KEGNFYPINTLRNVALNQVSTPYVFLL 567
           ++Y                              K+G  YPIN  RNVA  +  T Y+F  
Sbjct: 59  MIYDTDFPPEERKSVVKPINCSDSLNLSSSYKTKKGLLYPINVARNVARLKAETYYIFSS 118

Query: 568 DIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQR 608
           DI+  P  G+       ++  +       ++  VP FE ++
Sbjct: 119 DIELYPSIGIVTQFLDMVQKEE--NSETLRIYAVPVFEIKK 157


>gi|284047872|ref|YP_003398211.1| glycosyl transferase family protein [Acidaminococcus fermentans DSM
           20731]
 gi|283952093|gb|ADB46896.1| glycosyl transferase family 8 [Acidaminococcus fermentans DSM
           20731]
          Length = 309

 Score = 43.5 bits (101), Expect = 0.33,   Method: Compositional matrix adjust.
 Identities = 39/173 (22%), Positives = 75/173 (43%), Gaps = 25/173 (14%)

Query: 197 LKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLW 256
           L+L +P+++P+ + + I LDTD++   DI +LW +     Q + +G V +       ++ 
Sbjct: 87  LRLLIPELVPQAVHRVIYLDTDLVVLDDIQELWEMDL---QGKPVGAVPDLGILASSRMR 143

Query: 257 KNHKPWPALGRG---FNTGVILLDLTKLRDISWAG-FWRIIAEKFLLTRLWTSLADQDIF 312
           +  +    +  G   FN+GV++++L   R+  +     R + E            DQD  
Sbjct: 144 RQKEETLGIQEGKLYFNSGVMVMELEAWREKQYGDQVIRCVEEGNFRHH------DQDGL 197

Query: 313 NAIISEHPYLVYTLPCQWN---------VQLSDNTRSDELCYTELTDLKIIHW 356
           N +  ++      LP +WN         V++   +R   L    L    + HW
Sbjct: 198 NKVFQDN---WQPLPLRWNVIPPVFTLPVKVLKKSRWRNLALEALERPAVFHW 247


>gi|378823526|ref|ZP_09846149.1| glycosyltransferase, family 8 [Sutterella parvirubra YIT 11816]
 gi|378597650|gb|EHY30915.1| glycosyltransferase, family 8 [Sutterella parvirubra YIT 11816]
          Length = 309

 Score = 43.5 bits (101), Expect = 0.35,   Method: Compositional matrix adjust.
 Identities = 29/117 (24%), Positives = 54/117 (46%), Gaps = 4/117 (3%)

Query: 19  IVCAGYNSTRSLVTLIKSILFYRKNP--LHFHLITDTVALNILQTLFSTWSVPQVEVSFY 76
           + C+  N  +     ++SI+ +   P  + F+ I D ++      L  T       ++F 
Sbjct: 7   VFCSDDNFAQHSAVTMRSIVEHCSRPADIDFYYIHDGISEEKQVRLRRTIEEFGATLTFL 66

Query: 77  LADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWA 133
             D  V   ++I   HY  +    +L +P  LP+ + + I LD D++   D+A+LWA
Sbjct: 67  TPDVSVFKDAFI--NHYYSLASYFRLLVPSCLPQEVHRCIYLDGDMVVDGDVAELWA 121


>gi|47222079|emb|CAG12105.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 416

 Score = 43.5 bits (101), Expect = 0.37,   Method: Compositional matrix adjust.
 Identities = 53/227 (23%), Positives = 92/227 (40%), Gaps = 58/227 (25%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSD----WYLG 253
           +L LP +L + +   + +D+D++F   +  LWA  S+    Q   +     +    WY  
Sbjct: 119 RLFLPLIL-KDVDSIVYVDSDILFLQPMDDLWAFLSKFTPSQLAAMAPEHEEPRIAWY-- 175

Query: 254 KLWKNHKPWPALGR-GFNTGVILLDLTKLRDI-----------SWAGFWRIIAEKFLLTR 301
             +  H   P  GR G N+GV+L+++T++R +            W      + +K+ L  
Sbjct: 176 SRFARH---PFYGRTGVNSGVMLMNMTRMRTVVFQNDMTSVPLRWEELLMPLLQKYKLNI 232

Query: 302 LWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWNSPKK 361
            W    DQD+ N I         T+  + +  +S+      +C+              + 
Sbjct: 233 TW---GDQDLLNIIFP------ITILVRLHTAVSNYASVCSVCF--------------QA 269

Query: 362 LKVKNKHMEFFRNLYLTFLEYDGNLLRRELFGCNLTQTTLQQAELSN 408
           LK KN +      L+L F+        REL G +L      Q ELS+
Sbjct: 270 LKFKNCNSRIHFPLFLFFVP-------RELAGVSL------QVELSS 303


>gi|336426113|ref|ZP_08606126.1| hypothetical protein HMPREF0994_02132 [Lachnospiraceae bacterium
           3_1_57FAA_CT1]
 gi|336011071|gb|EGN41039.1| hypothetical protein HMPREF0994_02132 [Lachnospiraceae bacterium
           3_1_57FAA_CT1]
          Length = 352

 Score = 43.5 bits (101), Expect = 0.37,   Method: Compositional matrix adjust.
 Identities = 39/154 (25%), Positives = 68/154 (44%), Gaps = 24/154 (15%)

Query: 181 DVSWIPNKHYSGVYGL------------LKLTLPKVLPETLAKTIVLDTDVIFATDIAQL 228
           D+ WIP K  S    +             +L +  VLP  L + + LD D++    + +L
Sbjct: 82  DIQWIPAKDISEELNMDVVVDRGSLSQYARLFVSSVLPADLERVLYLDCDIVINQSLDEL 141

Query: 229 WALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAG 288
           W L   +  +    L +  S WY   +  + KP   +   FN+GV+L+DL + ++     
Sbjct: 142 WNL--DMHGKTIAALKDAFSKWYRANI--DLKPTDIM---FNSGVMLIDLKRWKEQKIEK 194

Query: 289 FWRIIAEKFLLTRLW-TSLADQDIFNAIISEHPY 321
             R++  KF+ ++       DQ   NA++S   Y
Sbjct: 195 --RLM--KFIASKNGRIQQGDQGALNAVLSHDTY 224


>gi|171058066|ref|YP_001790415.1| glycosyl transferase family protein [Leptothrix cholodnii SP-6]
 gi|170775511|gb|ACB33650.1| glycosyl transferase family 8 [Leptothrix cholodnii SP-6]
          Length = 316

 Score = 43.5 bits (101), Expect = 0.37,   Method: Compositional matrix adjust.
 Identities = 39/136 (28%), Positives = 53/136 (38%), Gaps = 10/136 (7%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L +  +LP  L + + LDTD++   D   L  L         +G V +  D  L     
Sbjct: 96  RLLMADLLPAELERVLYLDTDILVLGD---LLPLMRTELDGAILGAVRDGLDAELKSTSP 152

Query: 258 NHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIIS 317
                P +   FN GV+L+DL + R    AG     A   L+    T  ADQD  N    
Sbjct: 153 APTGMPDVCDYFNAGVLLIDLARWR----AGRVSAAARDHLVAHPQTPFADQDALNVACD 208

Query: 318 EHPYLVYTLPCQWNVQ 333
            H      L   WN Q
Sbjct: 209 GH---WKPLAAHWNFQ 221


>gi|295691931|ref|YP_003600541.1| lipopolysaccharide biosynthesis glycosyltransferase [Lactobacillus
           crispatus ST1]
 gi|295030037|emb|CBL49516.1| Lipopolysaccharide biosynthesis glycosyltransferase [Lactobacillus
           crispatus ST1]
          Length = 317

 Score = 43.5 bits (101), Expect = 0.38,   Method: Compositional matrix adjust.
 Identities = 41/164 (25%), Positives = 72/164 (43%), Gaps = 22/164 (13%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L +P++ P+   K + LD D I  TDIA+++ +              + S  Y+  L K
Sbjct: 95  RLFIPELFPQ-YDKAVYLDADTIICTDIAKMYEI---DIADNMFASCPDLSIRYMPLLQK 150

Query: 258 NHKP----WPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFN 313
             K     +PA  +  N GVIL ++   RD  +   +  +  K+    +     DQ   N
Sbjct: 151 YIKECQGIFPA-EKYINNGVILFNMKAFRDKKFVDKFYYLMNKYHFDNV---DPDQAYMN 206

Query: 314 AIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
            I  +    +Y LP +W+   +++         E+ D KI+H+N
Sbjct: 207 EICEDE---IYHLPKEWDAMPNESI-------PEIQDPKIVHYN 240


>gi|296084616|emb|CBI25666.3| unnamed protein product [Vitis vinifera]
          Length = 503

 Score = 43.5 bits (101), Expect = 0.39,   Method: Compositional matrix adjust.
 Identities = 28/95 (29%), Positives = 46/95 (48%), Gaps = 8/95 (8%)

Query: 47  FHLITDTVALNILQTLF-------STWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGL 99
           FH++TD + L  +Q +F       S   V  VE   +L  S V  +  + N  Y  +   
Sbjct: 250 FHVVTDKMNLGAMQVMFKMRDYNGSHIEVKAVEDYKFLNSSYVPVLRQLENPKYLSMLNH 309

Query: 100 LKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 134
           L+  LP++ P+ L + + LD DV+   D+  LW +
Sbjct: 310 LRFYLPEMYPK-LHRILFLDDDVVVQRDLTGLWKI 343


>gi|428176520|gb|EKX45404.1| hypothetical protein GUITHDRAFT_163273 [Guillardia theta CCMP2712]
          Length = 1204

 Score = 43.5 bits (101), Expect = 0.42,   Method: Compositional matrix adjust.
 Identities = 32/131 (24%), Positives = 58/131 (44%), Gaps = 15/131 (11%)

Query: 486 SMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGYHV------V 539
           ++DR  ++  L   W GP+S+  Y S  E ++ +      E +R+ ++    +       
Sbjct: 189 TVDRAVVIYNLAHRWRGPLSVVFYTSTEEEREIVRTFVREELIRASKSYCKSLWVQLVSD 248

Query: 540 YKEGN------FYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGH 593
            K+ N       +P+N LR  A+N +   YV  +D DFLP       ++  ++  D    
Sbjct: 249 CKDKNQPDDQFSFPVNKLRFEAINVLPHDYVLYVDADFLPSDNALVEIERGLKEFD---E 305

Query: 594 GGKKVLVVPAF 604
               +LV+P F
Sbjct: 306 LESALLVIPCF 316


>gi|365853174|ref|ZP_09393469.1| glycosyltransferase, family 8 [Lactobacillus parafarraginis F0439]
 gi|363713150|gb|EHL96796.1| glycosyltransferase, family 8 [Lactobacillus parafarraginis F0439]
          Length = 316

 Score = 43.5 bits (101), Expect = 0.42,   Method: Compositional matrix adjust.
 Identities = 39/147 (26%), Positives = 71/147 (48%), Gaps = 24/147 (16%)

Query: 197 LKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIG--LVENQSDWYLGK 254
            +L + ++ P+ L K I LD D +   DIA+L+ +        ++G  L+    D+++G 
Sbjct: 95  FRLFIAELFPD-LDKAIYLDADTVILDDIAKLYDV--------SLGENLIGAVPDYFIGH 145

Query: 255 L-----WKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLA-D 308
                 + N        +  N+GV+L++L  +RD  +A  +  +  ++     + SLA D
Sbjct: 146 TPETIDYANQAVGIDSQKYVNSGVLLMNLKVMRDCHFADHFLDLLNRY----HFKSLAPD 201

Query: 309 QDIFNAIISEHPYLVYTLPCQWNVQLS 335
           QD  NAI  +    +  L   WNVQ++
Sbjct: 202 QDYMNAIARKR---ICYLDSSWNVQIT 225


>gi|224108539|ref|XP_002314884.1| glycosyltransferase, CAZy family GT8 [Populus trichocarpa]
 gi|222863924|gb|EEF01055.1| glycosyltransferase, CAZy family GT8 [Populus trichocarpa]
          Length = 342

 Score = 43.5 bits (101), Expect = 0.42,   Method: Compositional matrix adjust.
 Identities = 31/117 (26%), Positives = 59/117 (50%), Gaps = 16/117 (13%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVE----NQSDWYLG 253
           ++ L  +LP ++ + I  D+D+I   D+A+LW +         +G  E    N ++++  
Sbjct: 145 RIYLADLLPMSVRRIIYFDSDLILVDDVAKLWNIN---LGAHVLGAPEYCHANFTNYFNS 201

Query: 254 KLWKNHKPWPALGRG-----FNTGVILLDLTKLRDISWA---GFWRIIAEKFLLTRL 302
           + W N     +L RG     FNTGV+++DL K R+  +     +W  + +K+ +  L
Sbjct: 202 RFWSNSACAASL-RGRRACYFNTGVMVIDLGKWREGKYTERLEYWMKVQKKYRIYEL 257


>gi|293380058|ref|ZP_06626154.1| glycosyl transferase family 8 [Lactobacillus crispatus 214-1]
 gi|312977025|ref|ZP_07788774.1| glycosyltransferase [Lactobacillus crispatus CTV-05]
 gi|423319858|ref|ZP_17297733.1| hypothetical protein HMPREF9250_02166 [Lactobacillus crispatus
           FB049-03]
 gi|423320134|ref|ZP_17298006.1| hypothetical protein HMPREF9249_00006 [Lactobacillus crispatus
           FB077-07]
 gi|290923372|gb|EFE00279.1| glycosyl transferase family 8 [Lactobacillus crispatus 214-1]
 gi|310896353|gb|EFQ45418.1| glycosyltransferase [Lactobacillus crispatus CTV-05]
 gi|405586879|gb|EKB60623.1| hypothetical protein HMPREF9250_02166 [Lactobacillus crispatus
           FB049-03]
 gi|405609037|gb|EKB81940.1| hypothetical protein HMPREF9249_00006 [Lactobacillus crispatus
           FB077-07]
          Length = 317

 Score = 43.1 bits (100), Expect = 0.44,   Method: Compositional matrix adjust.
 Identities = 41/164 (25%), Positives = 72/164 (43%), Gaps = 22/164 (13%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L +P++ P+   K + LD D I  TDIA+++ +              + S  Y+  L K
Sbjct: 95  RLFIPELFPQ-YDKAVYLDADTIICTDIAKMYEI---DIADNMFASCPDLSIRYMPLLQK 150

Query: 258 NHKP----WPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFN 313
             K     +PA  +  N GVIL ++   RD  +   +  +  K+    +     DQ   N
Sbjct: 151 YIKECQGIFPA-EKYINNGVILFNMKAFRDKKFVDKFYYLMNKYHFDNV---DPDQAYMN 206

Query: 314 AIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
            I  +    +Y LP +W+   +++         E+ D KI+H+N
Sbjct: 207 EICEDK---IYHLPKEWDAMPNESI-------PEIQDPKIVHYN 240


>gi|313236397|emb|CBY11715.1| unnamed protein product [Oikopleura dioica]
          Length = 275

 Score = 43.1 bits (100), Expect = 0.45,   Method: Compositional matrix adjust.
 Identities = 36/165 (21%), Positives = 69/165 (41%), Gaps = 34/165 (20%)

Query: 473 SSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEA-------QQFLSYAGNS 525
           + D  D+TL    ++  ++ +  L + W+GP+S++L++   +A       ++F   +G  
Sbjct: 86  AGDYEDITLATFATVVDIRQLWKLARKWQGPMSISLFVDGDQATIAHIAIEKFRKCSGLD 145

Query: 526 EALRSRRNIGYHVVY-----------------------KEGNFYPINTLRNVALNQVSTP 562
           +++     + +H+VY                       + G   P N LRN+A +  ST 
Sbjct: 146 DSI----PLHFHLVYSTSLEALTDDARLAELQQRIDFAQTGLRLPKNLLRNIAKDSASTD 201

Query: 563 YVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQ 607
           +V   DI   P   L           ++     K+  V+PAFE +
Sbjct: 202 FVLSADISLSPNDNLRENFLKFANKENLFDATSKQAFVIPAFEIK 246


>gi|428186170|gb|EKX55021.1| hypothetical protein GUITHDRAFT_99661 [Guillardia theta CCMP2712]
          Length = 1551

 Score = 43.1 bits (100), Expect = 0.46,   Method: Compositional matrix adjust.
 Identities = 36/142 (25%), Positives = 69/142 (48%), Gaps = 8/142 (5%)

Query: 193  VYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYL 252
            ++G   L L  + P ++ K I +D+D +  +D+ QLW +  R R        +++ +   
Sbjct: 1362 IWGYKILMLDVLFPLSVPKIIYIDSDQVVRSDLKQLWDMNLRGRPYAYTPFCDDKREIEG 1421

Query: 253  GKLWKNHKPWPALG--RGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLA--D 308
             + WK       LG  +   + + ++DL + R I      R++  +  L+R   SLA  D
Sbjct: 1422 YRFWKQGFWQTHLGDMKYHISALYVVDLNRFRAIGAGDELRVVYSQ--LSRDPNSLANLD 1479

Query: 309  QDIFNAIISEHPYLVYTLPCQW 330
            QD+ N   ++H   +++LP +W
Sbjct: 1480 QDLPN--YAQHSVPIFSLPQEW 1499


>gi|256844221|ref|ZP_05549707.1| conserved hypothetical protein [Lactobacillus crispatus 125-2-CHN]
 gi|256613299|gb|EEU18502.1| conserved hypothetical protein [Lactobacillus crispatus 125-2-CHN]
          Length = 317

 Score = 43.1 bits (100), Expect = 0.46,   Method: Compositional matrix adjust.
 Identities = 41/164 (25%), Positives = 72/164 (43%), Gaps = 22/164 (13%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L +P++ P+   K + LD D I  TDIA+++ +              + S  Y+  L K
Sbjct: 95  RLFIPELFPQ-YDKAVYLDADTIICTDIAKMYEI---DIADNMFASCPDLSIRYMPLLQK 150

Query: 258 NHKP----WPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFN 313
             K     +PA  +  N GVIL ++   RD  +   +  +  K+    +     DQ   N
Sbjct: 151 YIKECQGIFPA-EKYINNGVILFNMKAFRDKKFVDKFYYLMNKYHFDNV---DPDQAYMN 206

Query: 314 AIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
            I  +    +Y LP +W+   +++         E+ D KI+H+N
Sbjct: 207 EICEDK---IYHLPKEWDAMPNESI-------PEIQDPKIVHYN 240


>gi|296110269|ref|YP_003620650.1| lipopolysaccharide glycosyltransferase [Leuconostoc kimchii IMSNU
           11154]
 gi|339490576|ref|YP_004705081.1| putative lipopolysaccharide glycosyltransferase [Leuconostoc sp.
           C2]
 gi|295831800|gb|ADG39681.1| putative lipopolysaccharide glycosyltransferase [Leuconostoc
           kimchii IMSNU 11154]
 gi|338852248|gb|AEJ30458.1| putative lipopolysaccharide glycosyltransferase [Leuconostoc sp.
           C2]
          Length = 277

 Score = 43.1 bits (100), Expect = 0.48,   Method: Compositional matrix adjust.
 Identities = 39/142 (27%), Positives = 67/142 (47%), Gaps = 11/142 (7%)

Query: 197 LKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLW 256
            +L   + LP+TL + + LD D++    I  LW L  +    Q I    +     + +  
Sbjct: 87  FRLLCGEYLPKTLTRILYLDPDILVINPIRSLWTLDMK---NQMIAAASHNGLTGVSQTI 143

Query: 257 KNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAII 316
            N + +    + FN+GV+L+DL K+R+           +++   RL   L DQDI N + 
Sbjct: 144 NNVR-FQNKNKYFNSGVMLMDLNKMREKVKLDDITSAIDRY-AKRL--ILPDQDILNYLY 199

Query: 317 SEHPYLVYTLPCQ-WNVQLSDN 337
           S+H   + ++P + WN    DN
Sbjct: 200 SDH---ILSIPEEIWNYDTRDN 218


>gi|242066908|ref|XP_002454743.1| hypothetical protein SORBIDRAFT_04g036540 [Sorghum bicolor]
 gi|241934574|gb|EES07719.1| hypothetical protein SORBIDRAFT_04g036540 [Sorghum bicolor]
          Length = 1568

 Score = 43.1 bits (100), Expect = 0.49,   Method: Compositional matrix adjust.
 Identities = 38/141 (26%), Positives = 67/141 (47%), Gaps = 18/141 (12%)

Query: 199  LTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSD------WYL 252
            L L  + P +L K I +D D I   D+ +L+ +  + R        +N  D      W  
Sbjct: 1349 LFLDVIFPLSLRKVIFVDADQIVRADMGELYDMNLKGRPLAYTPFCDNNKDMDGYRFWKQ 1408

Query: 253  GKLWKNHKPWPALGRGFN-TGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLA--DQ 309
            G  WK+H      GR ++ + + ++DL K R  +     R+  E+  L++   SL+  DQ
Sbjct: 1409 G-FWKDH----LRGRPYHISALYVVDLAKFRQTASGDTLRVFYEQ--LSKDPNSLSNLDQ 1461

Query: 310  DIFNAIISEHPYLVYTLPCQW 330
            D+ N   ++H   +++LP +W
Sbjct: 1462 DLPN--YAQHTVPIFSLPQEW 1480


>gi|395243686|ref|ZP_10420670.1| Putative glucosyl transferase [Lactobacillus hominis CRBIP 24.179]
 gi|394484101|emb|CCI81678.1| Putative glucosyl transferase [Lactobacillus hominis CRBIP 24.179]
          Length = 316

 Score = 43.1 bits (100), Expect = 0.50,   Method: Compositional matrix adjust.
 Identities = 43/193 (22%), Positives = 82/193 (42%), Gaps = 24/193 (12%)

Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLL----KLTLPKVLPETLAKTIVLDTDVIFATDIAQ 227
           F+++D +V+ +      +  G +  +    +L +P + P+   K + LD D I  TDI++
Sbjct: 64  FHISDDMVQPIHKGEENYLRGQFFTMSIFYRLFIPDLFPQ-YNKAVYLDADTIICTDISE 122

Query: 228 LWALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPAL---GRGFNTGVILLDLTKLRDI 284
           ++ +              + S  Y+  L K  K    +    +  N GVIL ++   RD 
Sbjct: 123 MYDI---EIGDNMFASCPDLSIRYMPLLQKYIKECQGIFPPEKYINNGVILFNMKAFRDK 179

Query: 285 SWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELC 344
            +   +  + EK+    L     DQ   N I  +    +Y L  +W+   +++       
Sbjct: 180 KFVDKFYYLMEKYHFDNL---DPDQAYMNEICEDK---IYHLDKEWDAMPNESM------ 227

Query: 345 YTELTDLKIIHWN 357
             E+ D KI+H+N
Sbjct: 228 -PEIKDPKIVHYN 239


>gi|414159380|ref|ZP_11415666.1| hypothetical protein HMPREF9310_00040 [Staphylococcus simulans
           ACS-120-V-Sch1]
 gi|410884382|gb|EKS32208.1| hypothetical protein HMPREF9310_00040 [Staphylococcus simulans
           ACS-120-V-Sch1]
          Length = 280

 Score = 43.1 bits (100), Expect = 0.53,   Method: Compositional matrix adjust.
 Identities = 42/165 (25%), Positives = 75/165 (45%), Gaps = 21/165 (12%)

Query: 197 LKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLW 256
            +L   ++LPE++ K + LD D++    I  LW L     +R       ++      +  
Sbjct: 88  FRLACGELLPESVTKVLYLDPDILVINTIDDLWNLN---LERNIFAAATHEGFTKFSQGM 144

Query: 257 KNHKPWPALGRG-FNTGVILLDLTKLR-DISWAGFWRIIAE--KFLLTRLWTSLADQDIF 312
            N +     G+  FN+GV+L+D+ K R +IS    +  I +  ++LL      L DQD+ 
Sbjct: 145 NNLRLGTKQGQVYFNSGVMLIDVEKARQEISMDDIFNFIEKNNQYLL------LPDQDVM 198

Query: 313 NAIISEHPYLVYTLPCQ-WNVQLSDNTRSDELCYTELTDLKIIHW 356
           N++  +    +  +P + WN     +TR   + YT+       HW
Sbjct: 199 NSLYGDK---IKEIPEEIWNY----DTRQANMYYTKSVGKYDTHW 236


>gi|227879001|ref|ZP_03996898.1| glucosyl transferase, partial [Lactobacillus crispatus JV-V01]
 gi|227861406|gb|EEJ69028.1| glucosyl transferase [Lactobacillus crispatus JV-V01]
          Length = 287

 Score = 43.1 bits (100), Expect = 0.53,   Method: Compositional matrix adjust.
 Identities = 41/164 (25%), Positives = 72/164 (43%), Gaps = 22/164 (13%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L +P++ P+   K + LD D I  TDIA+++ +              + S  Y+  L K
Sbjct: 65  RLFIPELFPQ-YDKAVYLDADTIICTDIAKMYEIDI---ADNMFASCPDLSIRYMPLLQK 120

Query: 258 NHKP----WPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFN 313
             K     +PA  +  N GVIL ++   RD  +   +  +  K+    +     DQ   N
Sbjct: 121 YIKECQGIFPA-EKYINNGVILFNMKAFRDKKFVDKFYYLMNKYHFDNV---DPDQAYMN 176

Query: 314 AIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
            I  +    +Y LP +W+   +++         E+ D KI+H+N
Sbjct: 177 EICEDK---IYHLPKEWDAMPNESI-------PEIQDPKIVHYN 210


>gi|262047884|ref|ZP_06020832.1| glucosyl transferase [Lactobacillus crispatus MV-3A-US]
 gi|260571828|gb|EEX28401.1| glucosyl transferase [Lactobacillus crispatus MV-3A-US]
          Length = 288

 Score = 42.7 bits (99), Expect = 0.55,   Method: Compositional matrix adjust.
 Identities = 41/164 (25%), Positives = 72/164 (43%), Gaps = 22/164 (13%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L +P++ P+   K + LD D I  TDIA+++ +              + S  Y+  L K
Sbjct: 66  RLFIPELFPQ-YDKAVYLDADTIICTDIAKMYEIDI---ADNMFASCPDLSIRYMPLLQK 121

Query: 258 NHKP----WPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFN 313
             K     +PA  +  N GVIL ++   RD  +   +  +  K+    +     DQ   N
Sbjct: 122 YIKECQGIFPA-EKYINNGVILFNMKAFRDKKFVDKFYYLMNKYHFDNV---DPDQAYMN 177

Query: 314 AIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
            I  +    +Y LP +W+   +++         E+ D KI+H+N
Sbjct: 178 EICEDK---IYHLPKEWDAMPNESI-------PEIQDPKIVHYN 211


>gi|255564294|ref|XP_002523144.1| transferase, transferring glycosyl groups, putative [Ricinus
           communis]
 gi|223537706|gb|EEF39329.1| transferase, transferring glycosyl groups, putative [Ricinus
           communis]
          Length = 340

 Score = 42.7 bits (99), Expect = 0.57,   Method: Compositional matrix adjust.
 Identities = 31/94 (32%), Positives = 48/94 (51%), Gaps = 11/94 (11%)

Query: 201 LPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVE----NQSDWYLGKLW 256
           L  +L   + + I LD+D++   DIA+LW   + L  R TIG  E    N + ++    W
Sbjct: 170 LADLLEPCVRRVIYLDSDLVVVDDIAKLWN--TNLGSR-TIGAPEYCHANFTKYFTSSFW 226

Query: 257 KNHKPWPAL-GRG---FNTGVILLDLTKLRDISW 286
            N +      GR    FNTGV+++DL K R + +
Sbjct: 227 SNKRFSSTFSGRKPCYFNTGVMVIDLVKWRRVGY 260


>gi|256849379|ref|ZP_05554812.1| conserved hypothetical protein [Lactobacillus crispatus MV-1A-US]
 gi|256714155|gb|EEU29143.1| conserved hypothetical protein [Lactobacillus crispatus MV-1A-US]
          Length = 286

 Score = 42.7 bits (99), Expect = 0.58,   Method: Compositional matrix adjust.
 Identities = 41/164 (25%), Positives = 72/164 (43%), Gaps = 22/164 (13%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L +P++ P+   K + LD D I  TDIA+++ +              + S  Y+  L K
Sbjct: 64  RLFIPELFPQ-YDKAVYLDADTIICTDIAKMYEIDI---ADNMFASCPDLSIRYMPLLQK 119

Query: 258 NHKP----WPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFN 313
             K     +PA  +  N GVIL ++   RD  +   +  +  K+    +     DQ   N
Sbjct: 120 YIKECQGIFPA-EKYINNGVILFNMKAFRDKKFVDKFYYLMNKYHFDNV---DPDQAYMN 175

Query: 314 AIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
            I  +    +Y LP +W+   +++         E+ D KI+H+N
Sbjct: 176 EICEDK---IYHLPKEWDAMPNESI-------PEIQDPKIVHYN 209


>gi|423349485|ref|ZP_17327141.1| hypothetical protein HMPREF9156_00679 [Scardovia wiggsiae F0424]
 gi|393703033|gb|EJD65235.1| hypothetical protein HMPREF9156_00679 [Scardovia wiggsiae F0424]
          Length = 319

 Score = 42.7 bits (99), Expect = 0.59,   Method: Compositional matrix adjust.
 Identities = 41/173 (23%), Positives = 78/173 (45%), Gaps = 39/173 (22%)

Query: 197 LKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWA------LFSRLR------QRQTIGLV 244
            +L +P++ P+ + K I LD+D +  TD+A+L++      L + +        ++T   V
Sbjct: 96  FRLFIPEMFPD-IDKAIYLDSDTVINTDVAELYSYDLGDNLVAGVHDVFMSANKETTDYV 154

Query: 245 ENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWT 304
           EN     LG   +++          N+G++L++L  +R+  +   +  +  K+    +  
Sbjct: 155 ENA----LGMPVRDY---------INSGMLLMNLKAMREEHFTEHFVRLLRKY---HVEC 198

Query: 305 SLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
           +  DQD  N I      L   L  +WN  ++D T   +         KIIH+N
Sbjct: 199 AAVDQDYLNVICRGRILL---LAREWNTMMTDGTAGSDHP-------KIIHYN 241


>gi|356500226|ref|XP_003518934.1| PREDICTED: probable galacturonosyltransferase-like 7-like [Glycine
           max]
          Length = 357

 Score = 42.7 bits (99), Expect = 0.61,   Method: Compositional matrix adjust.
 Identities = 31/95 (32%), Positives = 51/95 (53%), Gaps = 11/95 (11%)

Query: 201 LPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVE----NQSDWYLGKLW 256
           L  +L   + + I LD+D++   DIA+LW+  + L  R TIG  E    N + ++    W
Sbjct: 168 LADLLEPCVERVIYLDSDLVLVDDIAKLWS--TSLGSR-TIGAPEYCHANFTKYFTAGFW 224

Query: 257 KNHKPWPAL-GRG---FNTGVILLDLTKLRDISWA 287
            + +   A  GR    FNTGV+++DL + R I ++
Sbjct: 225 SDMRFASAFAGRRPCYFNTGVMVIDLVRWRKIGYS 259


>gi|283956813|ref|ZP_06374287.1| hypothetical protein C1336_000310011 [Campylobacter jejuni subsp.
           jejuni 1336]
 gi|283791674|gb|EFC30469.1| hypothetical protein C1336_000310011 [Campylobacter jejuni subsp.
           jejuni 1336]
          Length = 396

 Score = 42.7 bits (99), Expect = 0.61,   Method: Compositional matrix adjust.
 Identities = 45/186 (24%), Positives = 81/186 (43%), Gaps = 33/186 (17%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGK--- 254
           ++ +P++      K I  D+DVIF  DI+ L+  F+ L  ++ IG   + +  Y  +   
Sbjct: 105 RIFIPEIF-SNFKKVIYCDSDVIFKADISHLF--FTDLNNKE-IGACRDIAALYAYRKRE 160

Query: 255 -LWK-------NHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSL 306
            +W+       +   + ++   FN+GVI+ D+ K   +        + +   +  L+   
Sbjct: 161 TIWQQNIGNNFDKINFRSISDYFNSGVIVFDIVKCIQMKTVSKCLTVIKN--IDNLY--F 216

Query: 307 ADQDIFNAIISEHPYLVYTLPCQWN------VQLSDN-----TRSDELCYTELTDLKIIH 355
            DQD+ N +   H   V+ LP +WN      ++  DN      +     Y   T  KIIH
Sbjct: 217 PDQDVLNIVFCGH---VHFLPLEWNFLWTTYIEYRDNFMYLPKKIINEIYRAKTKPKIIH 273

Query: 356 WNSPKK 361
           + S  K
Sbjct: 274 YISETK 279


>gi|224068414|ref|XP_002302739.1| glycosyltransferase, CAZy family GT8 [Populus trichocarpa]
 gi|222844465|gb|EEE82012.1| glycosyltransferase, CAZy family GT8 [Populus trichocarpa]
          Length = 367

 Score = 42.7 bits (99), Expect = 0.61,   Method: Compositional matrix adjust.
 Identities = 35/104 (33%), Positives = 53/104 (50%), Gaps = 20/104 (19%)

Query: 201 LPKVLPETLAKTIVLDTDVIFATDIAQLWA--LFSRLRQRQTIGLVE----NQSDWYLGK 254
           L  +L   + + I LD+D++   DIA+LW   L SR+     IG  E    N + ++   
Sbjct: 178 LADLLEPCVKRVIYLDSDLVVVDDIAKLWTTNLGSRI-----IGAPEYCHANFTKYFTAD 232

Query: 255 LWKNHKPWPALGRG-----FNTGVILLDLTKLRDISWAGFWRII 293
            W + K +    RG     FNTGV+++DL K R   WAG+ + I
Sbjct: 233 FWSD-KRFSGTFRGRKPCYFNTGVMVIDLVKWR---WAGYTKRI 272


>gi|323452188|gb|EGB08063.1| hypothetical protein AURANDRAFT_64432 [Aureococcus anophagefferens]
          Length = 2067

 Score = 42.7 bits (99), Expect = 0.62,   Method: Composition-based stats.
 Identities = 44/152 (28%), Positives = 59/152 (38%), Gaps = 15/152 (9%)

Query: 197  LKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTI--------GLVENQS 248
            +KL LP++L  T+   + LDTDV+FA D A LW                    G V    
Sbjct: 1300 VKLFLPQILDPTIRWAVYLDTDVVFAADPAALWLGAVGTLAAAGAAVAAAAEHGDVAPSC 1359

Query: 249  DWYLGKLWKNHKPWPALGRGF-----NTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLW 303
            +      +       AL R F     NTGV+ LD    RD  W      +      T+  
Sbjct: 1360 EGSAEHYYNQSAAARALPRAFPDDGANTGVLALDAAAARDAGWGPTLAAVMADLAATKAN 1419

Query: 304  TS--LADQDIFNAIISEHPYLVYTLPCQWNVQ 333
             S  L DQ +FN       + V  LPC+ N +
Sbjct: 1420 GSLDLGDQSVFNVAAGGDQFTVARLPCEANFR 1451



 Score = 39.7 bits (91), Expect = 5.9,   Method: Composition-based stats.
 Identities = 17/33 (51%), Positives = 23/33 (69%)

Query: 100  LKLTLPKVLPETLAKTIVLDTDVIFATDIAQLW 132
            +KL LP++L  T+   + LDTDV+FA D A LW
Sbjct: 1300 VKLFLPQILDPTIRWAVYLDTDVVFAADPAALW 1332


>gi|429209656|ref|ZP_19200885.1| glycosyl transferase, family 8 [Rhodobacter sp. AKP1]
 gi|428187382|gb|EKX55965.1| glycosyl transferase, family 8 [Rhodobacter sp. AKP1]
          Length = 334

 Score = 42.7 bits (99), Expect = 0.65,   Method: Compositional matrix adjust.
 Identities = 39/151 (25%), Positives = 73/151 (48%), Gaps = 15/151 (9%)

Query: 187 NKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVEN 246
           ++H S     L+   P+VLPE + + + LD D+I   D+A++ ++  R    + +    +
Sbjct: 75  DRHLSAA-AYLRFLAPEVLPEAVERVLYLDCDLIVLDDVAKILSIDLR---GKAVAAAPD 130

Query: 247 QSDWYLGKLWKNHKPWPALGRGF-NTGVILLDLTKLR--DISWAGFWRIIAEKFLLTRLW 303
                  +  + H     L R + N+GV+L+DL + R   +S   F  +     LL R  
Sbjct: 131 LGWKDAAQAARFHTLGIPLDRAYVNSGVLLMDLGRWRRDGLSQKLFDYVARHGSLLLR-- 188

Query: 304 TSLADQDIFNAIISEHPYLVYTLPCQWNVQL 334
               DQD  NA++++    ++ L  +WN+Q+
Sbjct: 189 ---HDQDALNAVLADD---IHLLDRRWNLQV 213


>gi|440800079|gb|ELR21122.1| hypothetical protein ACA1_283190 [Acanthamoeba castellanii str.
           Neff]
          Length = 563

 Score = 42.7 bits (99), Expect = 0.70,   Method: Compositional matrix adjust.
 Identities = 42/144 (29%), Positives = 70/144 (48%), Gaps = 10/144 (6%)

Query: 469 EYEASSDGNDVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDA-EAQQFLSYAGNSEA 527
           EY A++  +DVTLV  LS+D+L ++  L  +W+GPIS  + + +       +S   N  A
Sbjct: 223 EYAATA--SDVTLVTHLSLDQLSLLSDLASYWKGPISAAVLLPNQLSLGDLVSAHNNDPA 280

Query: 528 LRSRRNIGYHVVYKE---GNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSS 584
           L+    +  H  Y +    + YP   LR  A + V++ YV  +D+++     L    +  
Sbjct: 281 LKQWATL--HATYPDKVRSSGYPSGFLRKAATDAVTSTYVLQMDVNYRTEDSLRSDAERF 338

Query: 585 IRSMDMHGHGGKKVLVVPAFETQR 608
           + S+D        V VVP+F   R
Sbjct: 339 VASLDWTNE--DAVYVVPSFAVSR 360


>gi|227498918|ref|ZP_03929057.1| general stress protein A [Acidaminococcus sp. D21]
 gi|352683476|ref|YP_004895459.1| general stress protein A [Acidaminococcus intestini RyC-MR95]
 gi|226904369|gb|EEH90287.1| general stress protein A [Acidaminococcus sp. D21]
 gi|350278129|gb|AEQ21319.1| general stress protein A [Acidaminococcus intestini RyC-MR95]
          Length = 311

 Score = 42.7 bits (99), Expect = 0.71,   Method: Compositional matrix adjust.
 Identities = 41/143 (28%), Positives = 65/143 (45%), Gaps = 22/143 (15%)

Query: 197 LKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLW 256
           L+L + K+LP  +AK +  DTD++   D+A+LW           IG V++       ++ 
Sbjct: 87  LRLLIAKLLPLAVAKALYFDTDLVVKDDVAKLWDF---PLDGHPIGAVKDFGIMASSRMR 143

Query: 257 KNHKPWPALGRG---FNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLA----DQ 309
           +       L  G   FN+GV+++DL   R   +        EK L  +  TS A    DQ
Sbjct: 144 RQKAESLGLPLGAPYFNSGVMIMDLAAFRKEGY-------GEKVL--QCVTSHAYRHHDQ 194

Query: 310 DIFNAIISEHPYLVYTLPCQWNV 332
           D  N +   +  +   LP +WNV
Sbjct: 195 DGLNKVFMGNWSI---LPLRWNV 214



 Score = 40.0 bits (92), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 25/90 (27%), Positives = 44/90 (48%), Gaps = 3/90 (3%)

Query: 43  NPLHFHLITDTVALNILQTLFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKL 102
            PLHF+L  D +     + +  T    +  V+F     +  D ++     +   Y  L+L
Sbjct: 33  RPLHFYLFDDGITGVKKEMVERTIKDLKGLVTFIDTKGIAVD-AYTSGHIHKAAY--LRL 89

Query: 103 TLPKVLPETLAKTIVLDTDVIFATDIAQLW 132
            + K+LP  +AK +  DTD++   D+A+LW
Sbjct: 90  LIAKLLPLAVAKALYFDTDLVVKDDVAKLW 119


>gi|195382571|ref|XP_002050003.1| GJ20432 [Drosophila virilis]
 gi|194144800|gb|EDW61196.1| GJ20432 [Drosophila virilis]
          Length = 208

 Score = 42.4 bits (98), Expect = 0.73,   Method: Compositional matrix adjust.
 Identities = 36/146 (24%), Positives = 57/146 (39%), Gaps = 36/146 (24%)

Query: 551 LRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRYR 610
           +RN+A     + YVFL DID +P       L +  +++   G+      V+P FE    R
Sbjct: 1   MRNIARKGCQSKYVFLTDIDIIPSINSVVLLNNFFKTVTCDGNCA---YVIPTFEID-VR 56

Query: 611 TAFPASHA------------------------PTNFSRWV----NATTPYQIEWAPDF-- 640
             FP +                           TNFS+W+    N T         +F  
Sbjct: 57  APFPRTKENLLGLIKKGLARPFHEKVFIYNQYATNFSKWLSSHRNETVVRISHTVTNFEF 116

Query: 641 --EPYIVAHRDLPRYDTRFVGFGWNK 664
             EP+ +   + P +D RF G+G+ +
Sbjct: 117 LYEPFYITIDNAPIHDERFTGYGFTR 142


>gi|418056880|ref|ZP_12694931.1| glycosyl transferase family 2 [Hyphomicrobium denitrificans 1NES1]
 gi|353207652|gb|EHB73059.1| glycosyl transferase family 2 [Hyphomicrobium denitrificans 1NES1]
          Length = 674

 Score = 42.4 bits (98), Expect = 0.76,   Method: Compositional matrix adjust.
 Identities = 37/121 (30%), Positives = 58/121 (47%), Gaps = 13/121 (10%)

Query: 195 GLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGK 254
            LLK  LP++LP    + + LD D+I   D++ L+  FS +      G++ +    Y   
Sbjct: 94  ALLKFALPELLPHH-DRVLYLDGDLIVRDDLSDLF--FSDI-DGYVAGVISDSGQIYFKH 149

Query: 255 LWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNA 314
            W        +G  FN+GV+LLDL ++R  +      I A+K        SL DQ+ FN 
Sbjct: 150 EWVRR-----VGNYFNSGVMLLDLKEMRRSNVTELL-IKAKK---ENCDGSLLDQNAFNI 200

Query: 315 I 315
           +
Sbjct: 201 V 201


>gi|328958101|ref|YP_004375487.1| hypothetical protein CAR_c18140 [Carnobacterium sp. 17-4]
 gi|328674425|gb|AEB30471.1| hypothetical protein CAR_c18140 [Carnobacterium sp. 17-4]
          Length = 284

 Score = 42.4 bits (98), Expect = 0.76,   Method: Compositional matrix adjust.
 Identities = 47/184 (25%), Positives = 76/184 (41%), Gaps = 29/184 (15%)

Query: 162 HKPWPALGRGFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIF 221
           H  +P   +  +L + VV       N++YS +    +L  P VLP  L + + LD D+I 
Sbjct: 57  HTFFPVDCKKLFLEEVVV-------NRYYS-IEMYYRLLAPFVLPIELERILYLDPDIIN 108

Query: 222 ATDIAQLW------ALFSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVIL 275
               +  +       LF           ++  ++  LG L        +    FNTGVIL
Sbjct: 109 LRSFSSFYKQDFEGKLFVATTHDYATKWIQPINNIRLGTL--------SSEDYFNTGVIL 160

Query: 276 LDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAI----ISEHPYLVYTLPCQWN 331
           ++LT +R          I     + + W  L DQDIFN +    + E  + +Y L  ++ 
Sbjct: 161 MNLTLIRSTRNL---EEITTAIKINKNWLVLPDQDIFNHLYWNEVKEADWRIYNLDARFY 217

Query: 332 VQLS 335
            QL 
Sbjct: 218 SQLK 221


>gi|402912676|ref|XP_003918876.1| PREDICTED: LOW QUALITY PROTEIN: glucoside xylosyltransferase 1-like
           [Papio anubis]
          Length = 405

 Score = 42.4 bits (98), Expect = 0.84,   Method: Compositional matrix adjust.
 Identities = 43/176 (24%), Positives = 74/176 (42%), Gaps = 23/176 (13%)

Query: 195 GLLKLTLPKVLPETLAKTIVLDTD-VIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLG 253
           GL +L+LP  L     K +   TD   F   +  +W+L  +    Q   +     +  +G
Sbjct: 166 GLQRLSLPLFL-----KXVDSHTDSXFFLRPVDTIWSLLKKFHSTQIAAMAPEPKEPLIG 220

Query: 254 KLWKNHKPWPALGR-GFNTGVILLDLTKLR------DISWAGFWRIIAEKFLLT-----R 301
             +      P  G+ G N+GV+L ++T +R      D++     R+     L+T     +
Sbjct: 221 -WYSRFARHPYRGKAGGNSGVMLTNMTGMRRKYFKNDVTTV---RLQXGDILMTWLKIYK 276

Query: 302 LWTSLADQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
           L  +  DQD+ + +   +P   +  PCQWN  L D+      C      + I+H N
Sbjct: 277 LNMTWXDQDLLDIMXFHNPESPFVSPCQWNYHL-DHCLYGSSCQEAEEGIFILHGN 331


>gi|225447947|ref|XP_002268972.1| PREDICTED: UDP-glucose:glycoprotein glucosyltransferase-like [Vitis
            vinifera]
          Length = 1611

 Score = 42.4 bits (98), Expect = 0.84,   Method: Compositional matrix adjust.
 Identities = 38/143 (26%), Positives = 66/143 (46%), Gaps = 22/143 (15%)

Query: 199  LTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSD------WYL 252
            L L  + P +L K I +D D I   D+ +L+ +  + R        +N  D      W  
Sbjct: 1389 LFLDVIFPLSLEKVIFVDADQIVRADMGELYDMDIKGRPLAYTPFCDNNKDMDGYRFWRQ 1448

Query: 253  GKLWKNH---KPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLA-- 307
            G  WK+H   KP+        + + ++DL K R+ +     R+  E   L++   SL+  
Sbjct: 1449 G-FWKDHLRGKPYHI------SALYVVDLVKFRETAAGDNLRVFYET--LSKDPNSLSNL 1499

Query: 308  DQDIFNAIISEHPYLVYTLPCQW 330
            DQD+ N   ++H   +++LP +W
Sbjct: 1500 DQDLPN--FAQHTVPIFSLPQEW 1520


>gi|297838947|ref|XP_002887355.1| UDP-glucose:glycoprotein glucosyltransferase [Arabidopsis lyrata
            subsp. lyrata]
 gi|297333196|gb|EFH63614.1| UDP-glucose:glycoprotein glucosyltransferase [Arabidopsis lyrata
            subsp. lyrata]
          Length = 1616

 Score = 42.4 bits (98), Expect = 0.84,   Method: Compositional matrix adjust.
 Identities = 38/141 (26%), Positives = 67/141 (47%), Gaps = 18/141 (12%)

Query: 199  LTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSD------WYL 252
            L L  + P +L K I +D D I  TD+ +L+ +  + R        +N  +      W  
Sbjct: 1388 LFLDVIFPLSLEKVIFVDADQIIRTDMGELYDMDIKGRPLAYTPFCDNNREMDGYRFWRQ 1447

Query: 253  GKLWKNHKPWPALGRGFN-TGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLA--DQ 309
            G  WK H      GR ++ + + ++DL K R+ +     R+  E   L++   SL+  DQ
Sbjct: 1448 G-FWKEH----LRGRPYHISALYVVDLVKFRETAAGDNLRVFYET--LSKDPNSLSNLDQ 1500

Query: 310  DIFNAIISEHPYLVYTLPCQW 330
            D+ N   ++H   +++LP +W
Sbjct: 1501 DLPN--YAQHTVPIFSLPQEW 1519


>gi|253826760|ref|ZP_04869645.1| glycosyl transferase [Helicobacter canadensis MIT 98-5491]
 gi|253510166|gb|EES88825.1| glycosyl transferase [Helicobacter canadensis MIT 98-5491]
          Length = 397

 Score = 42.4 bits (98), Expect = 0.84,   Method: Compositional matrix adjust.
 Identities = 55/215 (25%), Positives = 93/215 (43%), Gaps = 40/215 (18%)

Query: 163 KPWPALGRGFYLADSVVEDVSWIP--NKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVI 220
           K +P     + L D +  + S +P  N +Y   Y   +L +   LP ++ + + LD D+I
Sbjct: 61  KIYPCTLTIYILDDQLFREYS-MPTLNGNYLAYY---RLKIGSALPLSIKRCVYLDVDMI 116

Query: 221 FATDIAQLWALFSRLRQRQTIGLV-ENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLT 279
              D+ +   LF    Q +  G+V E+ S        + +KP    G  FN+G++L+DL 
Sbjct: 117 VLGDLRE---LFEVDLQGKICGVVMEHHSQKIYKPKNQAYKPINITGSYFNSGMLLVDLD 173

Query: 280 KLRDISWAGFWR---IIAEKFLLTRLWT-SLADQDIFNAIISEHPYLVYTLPCQWNVQLS 335
                     WR   I    F + + +  S  DQDI N ++S   + V     +WN+ + 
Sbjct: 174 ---------LWRQENIEDRAFEIGKNYHYSFHDQDILNIVLSGKTHKV---GIEWNLMVC 221

Query: 336 --------DNTRSDELCY------TELTDLKIIHW 356
                   D    D+L Y      + L + KI+H+
Sbjct: 222 VYYRAICKDEKGRDKLPYYRKDFNSALRNPKILHY 256


>gi|413939392|gb|AFW73943.1| hypothetical protein ZEAMMB73_654737 [Zea mays]
          Length = 417

 Score = 42.4 bits (98), Expect = 0.87,   Method: Compositional matrix adjust.
 Identities = 36/139 (25%), Positives = 62/139 (44%), Gaps = 14/139 (10%)

Query: 199 LTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSD------WYL 252
           L L  + P +L K I +D D I   D+ +L+ +  + R        +N  D      W  
Sbjct: 201 LFLDVIFPLSLRKVIFVDADQIVRADMGELYDMNLKGRPLAYTPFCDNNKDMDGFRFWKQ 260

Query: 253 GKLWKNHKPWPALGRGFN-TGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDI 311
           G  WK+H      GR ++ + + ++DL K R  +     R+  E+        S  DQD+
Sbjct: 261 G-FWKDH----LRGRPYHISALYVVDLAKFRQTASGDTLRVFYEQLSKDPNSLSNLDQDL 315

Query: 312 FNAIISEHPYLVYTLPCQW 330
            N   ++H   +++LP +W
Sbjct: 316 PN--YAQHTVPIFSLPQEW 332


>gi|147846492|emb|CAN79512.1| hypothetical protein VITISV_014158 [Vitis vinifera]
          Length = 648

 Score = 42.4 bits (98), Expect = 0.87,   Method: Compositional matrix adjust.
 Identities = 25/88 (28%), Positives = 44/88 (50%), Gaps = 8/88 (9%)

Query: 47  FHLITDTVALNILQTLFSTWSVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPK 106
           FH++TD ++   ++  F   S  +V +        VE++    N  Y  +   L+  LP+
Sbjct: 409 FHIVTDKLSFAAMKMWFLVNSPAKVTIQ-------VENIDDFKNPKYLSMLNHLRFYLPE 461

Query: 107 VLPETLAKTIVLDTDVIFATDIAQLWAL 134
           V P+ L K + LD D++   D+  LW+L
Sbjct: 462 VYPK-LEKILFLDDDIVVQKDLTPLWSL 488


>gi|145337405|ref|NP_177278.3| UDP-glucose:glycoprotein glucosyltransferase [Arabidopsis thaliana]
 gi|122236068|sp|Q0WL80.1|UGGG_ARATH RecName: Full=UDP-glucose:glycoprotein glucosyltransferase; AltName:
            Full=EMS-mutagenized BRI1 suppressor 1; AltName:
            Full=Protein PRIORITY IN SWEET LIFE 2; Flags: Precursor
 gi|110740466|dbj|BAF02127.1| putative UDP-glucose:glycoprotein glucosyltransferase [Arabidopsis
            thaliana]
 gi|332197053|gb|AEE35174.1| UDP-glucose:glycoprotein glucosyltransferase [Arabidopsis thaliana]
          Length = 1613

 Score = 42.4 bits (98), Expect = 0.88,   Method: Compositional matrix adjust.
 Identities = 38/141 (26%), Positives = 67/141 (47%), Gaps = 18/141 (12%)

Query: 199  LTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSD------WYL 252
            L L  + P +L K I +D D I  TD+ +L+ +  + R        +N  +      W  
Sbjct: 1386 LFLDVIFPLSLEKVIFVDADQIIRTDMGELYDMDIKGRPLAYTPFCDNNREMDGYKFWKQ 1445

Query: 253  GKLWKNHKPWPALGRGFN-TGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLA--DQ 309
            G  WK H      GR ++ + + ++DL K R+ +     R+  E   L++   SL+  DQ
Sbjct: 1446 G-FWKEH----LRGRPYHISALYVVDLVKFRETAAGDNLRVFYET--LSKDPNSLSNLDQ 1498

Query: 310  DIFNAIISEHPYLVYTLPCQW 330
            D+ N   ++H   +++LP +W
Sbjct: 1499 DLPN--YAQHTVPIFSLPQEW 1517


>gi|334183827|ref|NP_001185370.1| UDP-glucose:glycoprotein glucosyltransferase [Arabidopsis thaliana]
 gi|332197054|gb|AEE35175.1| UDP-glucose:glycoprotein glucosyltransferase [Arabidopsis thaliana]
          Length = 1614

 Score = 42.4 bits (98), Expect = 0.89,   Method: Compositional matrix adjust.
 Identities = 38/141 (26%), Positives = 67/141 (47%), Gaps = 18/141 (12%)

Query: 199  LTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSD------WYL 252
            L L  + P +L K I +D D I  TD+ +L+ +  + R        +N  +      W  
Sbjct: 1387 LFLDVIFPLSLEKVIFVDADQIIRTDMGELYDMDIKGRPLAYTPFCDNNREMDGYKFWKQ 1446

Query: 253  GKLWKNHKPWPALGRGFN-TGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLA--DQ 309
            G  WK H      GR ++ + + ++DL K R+ +     R+  E   L++   SL+  DQ
Sbjct: 1447 G-FWKEH----LRGRPYHISALYVVDLVKFRETAAGDNLRVFYET--LSKDPNSLSNLDQ 1499

Query: 310  DIFNAIISEHPYLVYTLPCQW 330
            D+ N   ++H   +++LP +W
Sbjct: 1500 DLPN--YAQHTVPIFSLPQEW 1518


>gi|298204497|emb|CBI23772.3| unnamed protein product [Vitis vinifera]
          Length = 1715

 Score = 42.4 bits (98), Expect = 0.90,   Method: Compositional matrix adjust.
 Identities = 38/143 (26%), Positives = 66/143 (46%), Gaps = 22/143 (15%)

Query: 199  LTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSD------WYL 252
            L L  + P +L K I +D D I   D+ +L+ +  + R        +N  D      W  
Sbjct: 1493 LFLDVIFPLSLEKVIFVDADQIVRADMGELYDMDIKGRPLAYTPFCDNNKDMDGYRFWRQ 1552

Query: 253  GKLWKNH---KPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLA-- 307
            G  WK+H   KP+        + + ++DL K R+ +     R+  E   L++   SL+  
Sbjct: 1553 G-FWKDHLRGKPYHI------SALYVVDLVKFRETAAGDNLRVFYET--LSKDPNSLSNL 1603

Query: 308  DQDIFNAIISEHPYLVYTLPCQW 330
            DQD+ N   ++H   +++LP +W
Sbjct: 1604 DQDLPN--FAQHTVPIFSLPQEW 1624


>gi|21751080|dbj|BAC03899.1| unnamed protein product [Homo sapiens]
          Length = 187

 Score = 42.4 bits (98), Expect = 0.90,   Method: Compositional matrix adjust.
 Identities = 34/102 (33%), Positives = 47/102 (46%), Gaps = 15/102 (14%)

Query: 269 FNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTS------LADQDIFNAIISEHPYL 322
           FN+GV+LL+L  +R    +  +  + E   + +L         L DQD F  I  EHP L
Sbjct: 82  FNSGVMLLNLEAMRQ---SPLYSRLLEPAQVQQLADKYHFRGHLGDQDFFTMIGMEHPKL 138

Query: 323 VYTLPCQWNVQLS----DNTRSD--ELCYTELTDLKIIHWNS 358
            + L C WN QL     D+  SD  E  +     +KI H N 
Sbjct: 139 FHVLDCTWNRQLCTWWRDHGYSDVFEAYFRCEGHVKIYHGNC 180


>gi|323453752|gb|EGB09623.1| hypothetical protein AURANDRAFT_63327 [Aureococcus anophagefferens]
          Length = 2161

 Score = 42.4 bits (98), Expect = 0.91,   Method: Composition-based stats.
 Identities = 40/155 (25%), Positives = 62/155 (40%), Gaps = 20/155 (12%)

Query: 195  GLLKLTLPKVLPET----LAKTIVLDTDVIFATDIAQLW--ALFSRLRQRQTIGLVENQS 248
             ++KL LP +L +     L   + +DTDV+F  D  ++W  A+ +               
Sbjct: 1196 AVVKLFLPSLLAKNDTAGLGAAVYVDTDVLFLDDPQRVWVDAVGALAASGAVAAAAAEHG 1255

Query: 249  DWYLGKLWKNH--------KPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLT 300
            D   G    +H         P      G NTGV+ L+ T+ R + +    R++     L 
Sbjct: 1256 DLEAGCSSTDHFYDHEERETPRAFADDGANTGVVALNFTRARAVGYEA--RLVEALLDLD 1313

Query: 301  R----LWTSLADQDIFNAIISEHPYLVYTLPCQWN 331
            R       SL DQ +FN         V  LPC++N
Sbjct: 1314 RSGGERALSLGDQSVFNVAAGNASLDVARLPCEFN 1348


>gi|224128416|ref|XP_002320324.1| predicted protein [Populus trichocarpa]
 gi|222861097|gb|EEE98639.1| predicted protein [Populus trichocarpa]
          Length = 368

 Score = 42.0 bits (97), Expect = 0.95,   Method: Compositional matrix adjust.
 Identities = 32/91 (35%), Positives = 48/91 (52%), Gaps = 13/91 (14%)

Query: 201 LPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVE----NQSDWYLGKLW 256
           L  +L   + + I LD+D++   DIA+LWA  + L  R TIG  E    N + ++    W
Sbjct: 178 LADLLETCVKRVIYLDSDLVVVDDIAKLWA--TNLGSR-TIGAPEYCHANFTKYFTSGFW 234

Query: 257 KNHKPWPALGRG-----FNTGVILLDLTKLR 282
            + K +    RG     FNTGV+++DL K R
Sbjct: 235 SD-KRFSGAFRGRKPCYFNTGVMVIDLVKWR 264


>gi|300362623|ref|ZP_07058799.1| glycosyltransferase [Lactobacillus gasseri JV-V03]
 gi|300353614|gb|EFJ69486.1| glycosyltransferase [Lactobacillus gasseri JV-V03]
          Length = 316

 Score = 42.0 bits (97), Expect = 0.97,   Method: Compositional matrix adjust.
 Identities = 39/163 (23%), Positives = 70/163 (42%), Gaps = 20/163 (12%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L +P + P+   K + LD D I  TDIA+L+   +          V + S  ++  L  
Sbjct: 94  RLFIPNLFPQ-YDKAVYLDADTIICTDIAELY---NTEIGDNMFASVPDMSIRFIKPLQV 149

Query: 258 NHKPWPAL---GRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNA 314
             K    +    +  N GVIL ++   RD  +   +  + EK+    +     DQ   N 
Sbjct: 150 YIKECQGIFPPEKYINNGVILFNMKAFRDKKFVDKFYSLIEKYHFDNI---DPDQAYMNE 206

Query: 315 IISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
           I  +    +Y LP +W+   +++         E+ + KI+H+N
Sbjct: 207 ICEDK---IYHLPLEWDAMPNEHM-------DEIKNPKIVHYN 239


>gi|313142229|ref|ZP_07804422.1| glycosyl transferase family protein [Helicobacter canadensis MIT
           98-5491]
 gi|313131260|gb|EFR48877.1| glycosyl transferase family protein [Helicobacter canadensis MIT
           98-5491]
          Length = 357

 Score = 42.0 bits (97), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 55/215 (25%), Positives = 93/215 (43%), Gaps = 40/215 (18%)

Query: 163 KPWPALGRGFYLADSVVEDVSWIP--NKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVI 220
           K +P     + L D +  + S +P  N +Y   Y   +L +   LP ++ + + LD D+I
Sbjct: 21  KIYPCTLTIYILDDQLFREYS-MPTLNGNYLAYY---RLKIGSALPLSIKRCVYLDVDMI 76

Query: 221 FATDIAQLWALFSRLRQRQTIGLV-ENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLT 279
              D+ +   LF    Q +  G+V E+ S        + +KP    G  FN+G++L+DL 
Sbjct: 77  VLGDLRE---LFEVDLQGKICGVVMEHHSQKIYKPKNQAYKPINITGSYFNSGMLLVDLD 133

Query: 280 KLRDISWAGFWR---IIAEKFLLTRLWT-SLADQDIFNAIISEHPYLVYTLPCQWNVQLS 335
                     WR   I    F + + +  S  DQDI N ++S   + V     +WN+ + 
Sbjct: 134 ---------LWRQENIEDRAFEIGKNYHYSFHDQDILNIVLSGKTHKV---GIEWNLMVC 181

Query: 336 --------DNTRSDELCY------TELTDLKIIHW 356
                   D    D+L Y      + L + KI+H+
Sbjct: 182 VYYRAICKDEKGRDKLPYYRKDFNSALRNPKILHY 216


>gi|241569994|ref|XP_002402644.1| hypothetical protein IscW_ISCW009544 [Ixodes scapularis]
 gi|215502040|gb|EEC11534.1| hypothetical protein IscW_ISCW009544 [Ixodes scapularis]
          Length = 85

 Score = 42.0 bits (97), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 21/78 (26%), Positives = 38/78 (48%), Gaps = 8/78 (10%)

Query: 213 IVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSD----WYLGKLWKNHKPWPALGRG 268
           + +D D +F + + +LW++F ++ +     L     D    WY         P+PA   G
Sbjct: 9   LYVDADTLFLSPVEELWSVFEKMNESHLTALTYETEDVRTNWYQQ---HGKHPYPA-PFG 64

Query: 269 FNTGVILLDLTKLRDISW 286
            N GV+ ++LT++R   W
Sbjct: 65  VNAGVMPMNLTRMRSFDW 82


>gi|116628739|ref|YP_813911.1| lipopolysaccharide biosynthesis glycosyltransferase [Lactobacillus
           gasseri ATCC 33323]
 gi|311111532|ref|ZP_07712929.1| glycosyltransferase [Lactobacillus gasseri MV-22]
 gi|116094321|gb|ABJ59473.1| Lipopolysaccharide biosynthesis glycosyltransferase [Lactobacillus
           gasseri ATCC 33323]
 gi|311066686|gb|EFQ47026.1| glycosyltransferase [Lactobacillus gasseri MV-22]
          Length = 317

 Score = 42.0 bits (97), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 39/163 (23%), Positives = 70/163 (42%), Gaps = 20/163 (12%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L +P + P+   K + LD D I  TDIA+L+   +          V + S  ++  L  
Sbjct: 95  RLFIPNLFPQ-YDKAVYLDADTIICTDIAELY---NTEIGDNMFASVPDMSIRFIKPLQV 150

Query: 258 NHKPWPAL---GRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNA 314
             K    +    +  N GVIL ++   RD  +   +  + EK+    +     DQ   N 
Sbjct: 151 YIKECQGIFPPEKYINNGVILFNMKAFRDKKFVDKFYSLIEKYHFDNI---DPDQAYMNE 207

Query: 315 IISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
           I  +    +Y LP +W+   +++         E+ + KI+H+N
Sbjct: 208 ICEDK---IYHLPLEWDAMPNEHM-------DEIKNPKIVHYN 240


>gi|145297263|ref|YP_001140104.1| glycosyl transferase family protein [Aeromonas salmonicida subsp.
           salmonicida A449]
 gi|418362015|ref|ZP_12962660.1| glycosyl transferase family protein [Aeromonas salmonicida subsp.
           salmonicida 01-B526]
 gi|142850035|gb|ABO88356.1| glycosyl transferase family 8 [Aeromonas salmonicida subsp.
           salmonicida A449]
 gi|356686830|gb|EHI51422.1| glycosyl transferase family protein [Aeromonas salmonicida subsp.
           salmonicida 01-B526]
          Length = 321

 Score = 42.0 bits (97), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 37/140 (26%), Positives = 64/140 (45%), Gaps = 21/140 (15%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLG-KLW 256
           +  +P VL + + K + +D D+I   D++ LW++           +V   SD  LG    
Sbjct: 91  RFAIPNVL-QNIEKVLFIDADMIAVGDVSSLWSI------EMGEAVVAVVSDHILGYDKE 143

Query: 257 KNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEK---FLLTRLWTSLADQDIFN 313
           K  +   + G+ FN G +L+DL K R+       + I+E+    L+        DQD  N
Sbjct: 144 KQQERGISSGKYFNAGFMLMDLDKWRE-------KNISEQALGLLIDNNGFEHNDQDALN 196

Query: 314 AIISEHPYLVYTLPCQWNVQ 333
            I+ +  + + T   +WN Q
Sbjct: 197 IILEKKTFYLDT---KWNAQ 213


>gi|297809943|ref|XP_002872855.1| hypothetical protein ARALYDRAFT_490360 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297318692|gb|EFH49114.1| hypothetical protein ARALYDRAFT_490360 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 346

 Score = 42.0 bits (97), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 31/93 (33%), Positives = 49/93 (52%), Gaps = 17/93 (18%)

Query: 201 LPKVLPETLAKTIVLDTDVIFATDIAQLW--ALFSRLRQRQTIGLVE----NQSDWYLGK 254
           L  +L   +++ I LD+D++   DIA+LW  +L SR+     IG  E    N + ++ G 
Sbjct: 157 LADLLEPCVSRVIYLDSDLVVVDDIAKLWKTSLGSRI-----IGAPEYCHANFTKYFTGG 211

Query: 255 LWKNHKPWPALGRG-----FNTGVILLDLTKLR 282
            W   + +    RG     FNTGV+++DL K R
Sbjct: 212 FWSEER-FSGAFRGRKPCYFNTGVMVIDLKKWR 243


>gi|282852386|ref|ZP_06261728.1| glycosyltransferase family 8 [Lactobacillus gasseri 224-1]
 gi|282556128|gb|EFB61748.1| glycosyltransferase family 8 [Lactobacillus gasseri 224-1]
          Length = 316

 Score = 42.0 bits (97), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 39/163 (23%), Positives = 70/163 (42%), Gaps = 20/163 (12%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L +P + P+   K + LD D I  TDIA+L+   +          V + S  ++  L  
Sbjct: 94  RLFIPNLFPQ-YDKAVYLDADTIICTDIAELY---NTEIGDNMFASVPDMSIRFIKPLQV 149

Query: 258 NHKPWPAL---GRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNA 314
             K    +    +  N GVIL ++   RD  +   +  + EK+    +     DQ   N 
Sbjct: 150 YIKECQGIFPPEKYINNGVILFNMKAFRDKKFVDKFYSLIEKYHFDNI---DPDQAYMNE 206

Query: 315 IISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
           I  +    +Y LP +W+   +++         E+ + KI+H+N
Sbjct: 207 ICEDK---IYHLPLEWDAMPNEHM-------DEIKNPKIVHYN 239


>gi|356534929|ref|XP_003536003.1| PREDICTED: probable galacturonosyltransferase-like 7-like [Glycine
           max]
          Length = 359

 Score = 42.0 bits (97), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 31/95 (32%), Positives = 51/95 (53%), Gaps = 11/95 (11%)

Query: 201 LPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVE----NQSDWYLGKLW 256
           L  +L   + + I LD+D++   DIA+LW+  + L  R TIG  E    N + ++    W
Sbjct: 170 LADLLEPCVERVIYLDSDLVVVDDIAKLWS--TSLGSR-TIGAPEYCHANFTKYFTAAFW 226

Query: 257 KNHKPWPAL-GRG---FNTGVILLDLTKLRDISWA 287
            + +   A  GR    FNTGV+++DL + R I ++
Sbjct: 227 SDTRFARAFAGRRPCYFNTGVMVIDLVRWRRIGYS 261


>gi|449523017|ref|XP_004168521.1| PREDICTED: UDP-glucose:glycoprotein glucosyltransferase-like
           [Cucumis sativus]
          Length = 1056

 Score = 42.0 bits (97), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 38/143 (26%), Positives = 66/143 (46%), Gaps = 22/143 (15%)

Query: 199 LTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSD------WYL 252
           L L  + P +L K I +D D I  TD+ +L+ +  + +        +N  D      W  
Sbjct: 835 LFLDVIFPLSLEKVIFVDADQIVRTDMGELYDMDIKGKPLAYTPFCDNNKDMDGYRFWRQ 894

Query: 253 GKLWKNH---KPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLA-- 307
           G  WK H   KP+        + + ++DL K R+ +     R+  E   L++   SL+  
Sbjct: 895 G-FWKEHLRGKPYHI------SALYVVDLKKFRETAAGDNLRVFYES--LSKDPNSLSNL 945

Query: 308 DQDIFNAIISEHPYLVYTLPCQW 330
           DQD+ N   ++H   +++LP +W
Sbjct: 946 DQDLPN--YAQHTVPIFSLPQEW 966


>gi|358332612|dbj|GAA51246.1| glycosyltransferase-like protein LARGE2 [Clonorchis sinensis]
          Length = 156

 Score = 41.6 bits (96), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 19/59 (32%), Positives = 32/59 (54%)

Query: 276 LDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQL 334
           + L+++R+  W   +   A+  L       + +Q + N II E P L Y +PC+W+VQL
Sbjct: 1   MQLSRMREAGWWDMFIEEAKNILSVYRALPIGEQSVLNNIILERPELYYRVPCEWHVQL 59


>gi|238852953|ref|ZP_04643352.1| lipopolysaccharide biosynthesis glycosyltransferase [Lactobacillus
           gasseri 202-4]
 gi|238834403|gb|EEQ26641.1| lipopolysaccharide biosynthesis glycosyltransferase [Lactobacillus
           gasseri 202-4]
          Length = 316

 Score = 41.6 bits (96), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 39/163 (23%), Positives = 70/163 (42%), Gaps = 20/163 (12%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           +L +P + P+   K + LD D I  TDIA+L+   +          V + S  ++  L  
Sbjct: 94  RLFIPNLFPQ-YDKAVYLDADTIICTDIAELY---NTEIGDNMFASVPDMSIRFIKPLQV 149

Query: 258 NHKPWPAL---GRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNA 314
             K    +    +  N GVIL ++   RD  +   +  + EK+    +     DQ   N 
Sbjct: 150 YIKECQGIFPPEKYINNGVILFNMKAFRDKKFVDKFYSLIEKYHFDNI---DPDQAYMNE 206

Query: 315 IISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWN 357
           I  +    +Y LP +W+   +++         E+ + KI+H+N
Sbjct: 207 ICEDK---IYHLPLEWDAMPNEHM-------DEIKNPKIVHYN 239


>gi|357486289|ref|XP_003613432.1| Glycosyltransferase CAZy family GT8 [Medicago truncatula]
 gi|355514767|gb|AES96390.1| Glycosyltransferase CAZy family GT8 [Medicago truncatula]
          Length = 395

 Score = 41.6 bits (96), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 27/95 (28%), Positives = 50/95 (52%), Gaps = 18/95 (18%)

Query: 201 LPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVE----NQSDWYLGKLW 256
           L  +L   + + I LD+D++   DI++LW++  ++  ++ IG  E    N + ++  + W
Sbjct: 173 LGDMLDTCVERVIYLDSDIVVVDDISKLWSV--KMDAKKVIGAPEYCHANFTKYFTDEFW 230

Query: 257 KNHKPWPALGRG--------FNTGVILLDLTKLRD 283
            +    P L R         FNTGV+++DL K R+
Sbjct: 231 ND----PLLSRVFKARKACYFNTGVMVMDLMKWRE 261


>gi|404369533|ref|ZP_10974867.1| hypothetical protein CSBG_03347 [Clostridium sp. 7_2_43FAA]
 gi|226914520|gb|EEH99721.1| hypothetical protein CSBG_03347 [Clostridium sp. 7_2_43FAA]
          Length = 273

 Score = 41.6 bits (96), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 36/125 (28%), Positives = 56/125 (44%), Gaps = 25/125 (20%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWA------LFSRLRQRQTIGLVENQSDWY 251
           +L  P +LP+++ K + LD D++    I  LW       +F+        G++   +D  
Sbjct: 90  RLLAPLILPKSIKKILYLDPDILIINSIRPLWETELGNYIFAAASHVGVTGVI---NDIN 146

Query: 252 LGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTS---LAD 308
             +L  +H         +N+GV+L+DLTK R I        + E F   R       L D
Sbjct: 147 RVRLRVDHD-------YYNSGVMLMDLTKARSIV------NVEEIFQCVREHKEELLLPD 193

Query: 309 QDIFN 313
           QDIFN
Sbjct: 194 QDIFN 198


>gi|303273630|ref|XP_003056175.1| glycosyltransferase family 24 protein [Micromonas pusilla CCMP1545]
 gi|226462259|gb|EEH59551.1| glycosyltransferase family 24 protein [Micromonas pusilla CCMP1545]
          Length = 1657

 Score = 41.6 bits (96), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 38/137 (27%), Positives = 66/137 (48%), Gaps = 10/137 (7%)

Query: 199  LTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWKN 258
            L L  + P TL K I +D D +  +++ +LW +  R          +N  +    + WK 
Sbjct: 1452 LFLDVLFPLTLNKVIFVDADQVVRSNLKELWEMDLRGAPYAYTPFCDNNPEMEGYRFWK- 1510

Query: 259  HKPWPA--LGRGFN-TGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLA--DQDIFN 313
            H  W     G+ ++ + + ++DL   R  +     R+I E   L++  +SLA  DQD+ N
Sbjct: 1511 HGFWQTHLAGKPYHISALYVVDLETFRHTAAGDKLRLIYET--LSKDPSSLANLDQDLPN 1568

Query: 314  AIISEHPYLVYTLPCQW 330
               ++H   ++TLP QW
Sbjct: 1569 --YAQHQVPIFTLPQQW 1583


>gi|301309800|ref|ZP_07215739.1| putative lipopolysaccharide 1,2-glucosyltransferase/general stress
           protein [Bacteroides sp. 20_3]
 gi|423340274|ref|ZP_17318013.1| hypothetical protein HMPREF1059_03938 [Parabacteroides distasonis
           CL09T03C24]
 gi|300831374|gb|EFK62005.1| putative lipopolysaccharide 1,2-glucosyltransferase/general stress
           protein [Bacteroides sp. 20_3]
 gi|409227709|gb|EKN20605.1| hypothetical protein HMPREF1059_03938 [Parabacteroides distasonis
           CL09T03C24]
          Length = 301

 Score = 41.6 bits (96), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 42/163 (25%), Positives = 78/163 (47%), Gaps = 15/163 (9%)

Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
           FY  D  V   +  P   +  +    +L +  +LP+++ K + LD DV+    +  LW  
Sbjct: 61  FYTVDKKV--FANCPISRHITLATYFRLIMTDILPKSVEKVLYLDCDVVVRHSLRSLWDT 118

Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWA-GFW 290
             +       G++ + S   + +++   +  P+LG  FN GV+L++L   R+ + +  F+
Sbjct: 119 DIK---SYAAGVIPDMSIDDI-RIYNRLQYSPSLGY-FNAGVLLVNLRYWRENNLSESFF 173

Query: 291 RIIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNVQ 333
            II +     R      DQD+ N ++ E   +  TLP ++NVQ
Sbjct: 174 EIINKYPERLR----YHDQDVLNIVLKE---IKLTLPLKYNVQ 209


>gi|413939391|gb|AFW73942.1| hypothetical protein ZEAMMB73_654737 [Zea mays]
          Length = 388

 Score = 41.6 bits (96), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 36/139 (25%), Positives = 62/139 (44%), Gaps = 14/139 (10%)

Query: 199 LTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSD------WYL 252
           L L  + P +L K I +D D I   D+ +L+ +  + R        +N  D      W  
Sbjct: 201 LFLDVIFPLSLRKVIFVDADQIVRADMGELYDMNLKGRPLAYTPFCDNNKDMDGFRFWKQ 260

Query: 253 GKLWKNHKPWPALGRGFN-TGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDI 311
           G  WK+H      GR ++ + + ++DL K R  +     R+  E+        S  DQD+
Sbjct: 261 G-FWKDH----LRGRPYHISALYVVDLAKFRQTASGDTLRVFYEQLSKDPNSLSNLDQDL 315

Query: 312 FNAIISEHPYLVYTLPCQW 330
            N   ++H   +++LP +W
Sbjct: 316 PN--YAQHTVPIFSLPQEW 332


>gi|321464173|gb|EFX75183.1| hypothetical protein DAPPUDRAFT_306884 [Daphnia pulex]
          Length = 498

 Score = 41.6 bits (96), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 39/165 (23%), Positives = 61/165 (36%), Gaps = 49/165 (29%)

Query: 496 LFKHWEGPISLTLYM--SDAE-AQQFLSYAGNSEALRSRRNIGYHVVYKEGNF------- 545
           L + W+GP+S+ +Y   SD E A + ++Y         R+ + +H+V  E +F       
Sbjct: 180 LTRRWDGPLSVAVYTPGSDYESAIRSMAYLRQCTGPDIRQRVSFHLVLDERHFPTVLRKL 239

Query: 546 --------------------------------------YPINTLRNVALNQVSTPYVFLL 567
                                                 YP+N LRNVA     T Y+   
Sbjct: 240 HSQQGPPTTRSPLQTSAVNGLSTATPETQDNKRTMNLTYPVNLLRNVARQNAPTHYILAS 299

Query: 568 DIDFLPMFGLYPYLKSSIRSMDMHGHGGK-KVLVVPAFETQRYRT 611
           D +  P   +       +R  +   H G+ KV V+P FE  R +T
Sbjct: 300 DAELYPSLDMVRLFFDMLRRNESVLHLGRPKVFVLPIFEVYRNQT 344


>gi|449457101|ref|XP_004146287.1| PREDICTED: UDP-glucose:glycoprotein glucosyltransferase-like [Cucumis
            sativus]
          Length = 1575

 Score = 41.6 bits (96), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 38/143 (26%), Positives = 66/143 (46%), Gaps = 22/143 (15%)

Query: 199  LTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSD------WYL 252
            L L  + P +L K I +D D I  TD+ +L+ +  + +        +N  D      W  
Sbjct: 1359 LFLDVIFPLSLEKVIFVDADQIVRTDMGELYDMDIKGKPLAYTPFCDNNKDMDGYRFWRQ 1418

Query: 253  GKLWKNH---KPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLA-- 307
            G  WK H   KP+        + + ++DL K R+ +     R+  E   L++   SL+  
Sbjct: 1419 G-FWKEHLRGKPYHI------SALYVVDLKKFRETAAGDNLRVFYES--LSKDPNSLSNL 1469

Query: 308  DQDIFNAIISEHPYLVYTLPCQW 330
            DQD+ N   ++H   +++LP +W
Sbjct: 1470 DQDLPN--YAQHTVPIFSLPQEW 1490


>gi|390603232|gb|EIN12624.1| hypothetical protein PUNSTDRAFT_59955 [Punctularia strigosozonata
           HHB-11173 SS5]
          Length = 332

 Score = 41.6 bits (96), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 28/109 (25%), Positives = 50/109 (45%), Gaps = 7/109 (6%)

Query: 44  PLHFHLITDTVALNILQTLFSTWSVPQVEVSFYLADSVVEDVS------WIPNKHYSGVY 97
           P  FH+I    A+  ++   + +S P   V         +D+S       +   H +G+ 
Sbjct: 6   PTVFHIICSQDAIPTIEEKLALFSRPAYPVRATFYPLTQDDISRRAARAGVGTSHEAGMG 65

Query: 98  GLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGL 146
           GL+K  + +VL   + + I +DTD++F  D A LW  F  + + Q +  
Sbjct: 66  GLVKTFIHEVL-HDVERIIFVDTDMLFLVDPALLWREFDAMDENQMVAF 113


>gi|15217851|ref|NP_171772.1| galacturonosyltransferase 5 [Arabidopsis thaliana]
 gi|42571307|ref|NP_973744.1| galacturonosyltransferase 5 [Arabidopsis thaliana]
 gi|75172933|sp|Q9FWY9.1|GATL5_ARATH RecName: Full=Probable galacturonosyltransferase-like 5
 gi|9972380|gb|AAG10630.1|AC022521_8 Unknown protein [Arabidopsis thaliana]
 gi|25083419|gb|AAN72073.1| Unknown protein [Arabidopsis thaliana]
 gi|26452192|dbj|BAC43184.1| unknown protein [Arabidopsis thaliana]
 gi|31711862|gb|AAP68287.1| At1g02720 [Arabidopsis thaliana]
 gi|332189341|gb|AEE27462.1| galacturonosyltransferase 5 [Arabidopsis thaliana]
 gi|332189342|gb|AEE27463.1| galacturonosyltransferase 5 [Arabidopsis thaliana]
          Length = 361

 Score = 41.2 bits (95), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 31/91 (34%), Positives = 48/91 (52%), Gaps = 13/91 (14%)

Query: 201 LPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVE----NQSDWYLGKLW 256
           L  +L   + + I LD+D++   DI +LW   + L QR TIG  E    N + ++ G  W
Sbjct: 172 LADLLEPCVKRVIYLDSDLVVVDDIVKLWK--TGLGQR-TIGAPEYCHANFTKYFTGGFW 228

Query: 257 KNHKPWPALGRG-----FNTGVILLDLTKLR 282
            + K +    +G     FNTGV+++DL K R
Sbjct: 229 SD-KRFNGTFKGRNPCYFNTGVMVIDLKKWR 258



 Score = 38.9 bits (89), Expect = 9.5,   Method: Compositional matrix adjust.
 Identities = 38/169 (22%), Positives = 77/169 (45%), Gaps = 11/169 (6%)

Query: 2   SSDEEPVIPTCEVIQVAIVCAGYNSTRSLVTLIKSILFYRKNP--LHFHLITDTVALNIL 59
           SS E+  +    ++ VAI     +  R  +  + SIL +   P  + FH +  + + N+ 
Sbjct: 68  SSGEDSGVCNPNLVHVAIT-LDIDYLRGSIAAVNSILQHSMCPQSVFFHFLVSSESQNLE 126

Query: 60  QTLFSTW-SVPQVEVSFYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVL 118
             + ST+  +  +++ ++  ++V   +S    +         +  L  +L   + + I L
Sbjct: 127 SLIRSTFPKLTNLKIYYFAPETVQSLISSSVRQALEQPLNYARNYLADLLEPCVKRVIYL 186

Query: 119 DTDVIFATDIAQLWALFSRLRQRQTIGLVE----NQSDWYLGKLWKNHK 163
           D+D++   DI +LW   + L QR TIG  E    N + ++ G  W + +
Sbjct: 187 DSDLVVVDDIVKLWK--TGLGQR-TIGAPEYCHANFTKYFTGGFWSDKR 232


>gi|357473531|ref|XP_003607050.1| Glycosyltransferase CAZy family GT8 [Medicago truncatula]
 gi|357496195|ref|XP_003618386.1| Glycosyltransferase CAZy family GT8 [Medicago truncatula]
 gi|355493401|gb|AES74604.1| Glycosyltransferase CAZy family GT8 [Medicago truncatula]
 gi|355508105|gb|AES89247.1| Glycosyltransferase CAZy family GT8 [Medicago truncatula]
 gi|388506006|gb|AFK41069.1| unknown [Medicago truncatula]
          Length = 371

 Score = 41.2 bits (95), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 29/94 (30%), Positives = 49/94 (52%), Gaps = 14/94 (14%)

Query: 201 LPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVE----NQSDWYLGKLW 256
           L  +L E + + I LD+DVI   DI  LW +   L   + IG  E    N + ++  + W
Sbjct: 172 LADLLEECVERVIYLDSDVIVVDDIQDLWKV--SLTDSKVIGAPEYCHANFTRYFSYEFW 229

Query: 257 KNHKPWPALGRG-------FNTGVILLDLTKLRD 283
            +++ +  + +G       FNTGV+++DL K R+
Sbjct: 230 SSYE-FSEVFKGRKNRPCYFNTGVMVMDLMKWRE 262


>gi|225437483|ref|XP_002274175.1| PREDICTED: probable galacturonosyltransferase-like 7-like [Vitis
           vinifera]
          Length = 450

 Score = 41.2 bits (95), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 32/94 (34%), Positives = 48/94 (51%), Gaps = 11/94 (11%)

Query: 201 LPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVE----NQSDWYLGKLW 256
           L  +L   + + I LD+D+I   DI +LW+  + L  R TIG  E    N + ++  K W
Sbjct: 261 LADLLEPCVRRVIYLDSDLIVVDDIYKLWS--TSLGTR-TIGAPEYCHANFTRYFTDKFW 317

Query: 257 KNHKPWPAL-GRG---FNTGVILLDLTKLRDISW 286
              + +    GR    FNTGVI++DL K R   +
Sbjct: 318 SEKRYYGTFDGRKPCYFNTGVIVIDLAKWRRFGF 351


>gi|7362746|emb|CAB83116.1| putative protein [Arabidopsis thaliana]
          Length = 357

 Score = 41.2 bits (95), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 47/178 (26%), Positives = 78/178 (43%), Gaps = 32/178 (17%)

Query: 201 LPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVE----NQSDWYLGKLW 256
           L  +L   + + I LD+D+I   DIA+LW   ++L  + TIG  E    N + ++    W
Sbjct: 167 LADLLEPCVRRVIYLDSDLIVVDDIAKLW--MTKLGSK-TIGAPEYCHANFTKYFTPAFW 223

Query: 257 KNHKPWPAL-GRG---FNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIF 312
            + +   A  GR    FNTGV+++DL + R +   G+  +I +       W  +   D  
Sbjct: 224 SDERFSGAFSGRKPCYFNTGVMVMDLERWRRV---GYTEVIEK-------WMEIQKSDRI 273

Query: 313 NAIISEHPYL------VYTLPCQWNVQ--LSDNTRSDELCYT-ELTDLKIIHWNSPKK 361
             + S  P+L      V  +  +WN      DN R    C       + ++HW+   K
Sbjct: 274 YELGSLPPFLLVFAGEVAPIEHRWNQHGLGGDNVRGS--CRDLHPGPVSLLHWSGSGK 329


>gi|297843072|ref|XP_002889417.1| hypothetical protein ARALYDRAFT_470231 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297335259|gb|EFH65676.1| hypothetical protein ARALYDRAFT_470231 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 361

 Score = 41.2 bits (95), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 31/91 (34%), Positives = 48/91 (52%), Gaps = 13/91 (14%)

Query: 201 LPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVE----NQSDWYLGKLW 256
           L  +L   + + I LD+D++   DI +LW   + L QR TIG  E    N + ++ G  W
Sbjct: 172 LADLLEPCVKRVIYLDSDLVVVDDIVKLWK--TGLGQR-TIGAPEYCHANFTKYFTGGFW 228

Query: 257 KNHKPWPALGRG-----FNTGVILLDLTKLR 282
            + K +    +G     FNTGV+++DL K R
Sbjct: 229 SD-KRFNGTFKGRNPCYFNTGVMVIDLKKWR 258


>gi|147771376|emb|CAN62993.1| hypothetical protein VITISV_021618 [Vitis vinifera]
          Length = 367

 Score = 41.2 bits (95), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 32/90 (35%), Positives = 47/90 (52%), Gaps = 11/90 (12%)

Query: 201 LPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVE----NQSDWYLGKLW 256
           L  +L   + + I LD+D+I   DI +LW+  + L  R TIG  E    N + ++  K W
Sbjct: 178 LADLLEPCVRRVIYLDSDLIVVDDIYKLWS--TSLGTR-TIGAPEYCHANFTRYFTDKFW 234

Query: 257 KNHKPWPAL-GRG---FNTGVILLDLTKLR 282
              + +    GR    FNTGVI++DL K R
Sbjct: 235 SEKRYYGTFDGRKPCYFNTGVIVIDLAKWR 264


>gi|339007385|ref|ZP_08639960.1| sporulation kinase E [Brevibacillus laterosporus LMG 15441]
 gi|338776594|gb|EGP36122.1| sporulation kinase E [Brevibacillus laterosporus LMG 15441]
          Length = 599

 Score = 41.2 bits (95), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 65/228 (28%), Positives = 98/228 (42%), Gaps = 39/228 (17%)

Query: 359 PKKLKVKNK--HMEFFRNLYLTFLEYDGNLLRRELFGCNL-----TQTTLQQAE-LSNLN 410
           P +L+V+N   H  F RN  +  L  DGN+L   L G N+     T+  LQQ+E LS + 
Sbjct: 326 PIELRVRNHDGHFLFIRNSCIPMLSKDGNILGFALIGENIHHQKQTENLLQQSEKLSLIG 385

Query: 411 E--EDPCYDLRRSKLTSLRTHLYFLEYEYEASSDGNDVTLVAQLSMDRLQMVEINPNNVD 468
           E      +++ R+ LT+LR     L+   E S+   D+ L     +DR+  +     N  
Sbjct: 386 ELAAGIAHEI-RNPLTALRGFTQLLQTHPEDSNKYTDIMLK---ELDRIHFIA----NEL 437

Query: 469 EYEASSDG-----NDVTLVAQLSMDRLQMVEMLFK-----HWEGPISLTLYMSDAEAQQF 518
              A   G     ND+  + Q  +  L+   +L       HWE  I   L       Q F
Sbjct: 438 LLLAKPQGYQLAPNDICSLLQDVITLLESQAILANVTISTHWEPDIPSLLCEPRQLKQVF 497

Query: 519 LSYAGNS-EALRSRRNIG-------YHVVYK---EGNFYPINTLRNVA 555
           ++   NS EA+ +   I        +HVV +   EG   P++TL  V 
Sbjct: 498 INIIKNSVEAMPAGGCIHIEVTKLEHHVVIRFSDEGCGIPLSTLEKVG 545


>gi|356528410|ref|XP_003532796.1| PREDICTED: UDP-glucose:glycoprotein glucosyltransferase-like [Glycine
            max]
          Length = 1676

 Score = 41.2 bits (95), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 38/143 (26%), Positives = 66/143 (46%), Gaps = 22/143 (15%)

Query: 199  LTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSD------WYL 252
            L L  + P +L K I +D D +   D+  L+ +  R +        +N  +      W  
Sbjct: 1455 LFLDVIFPLSLEKVIFVDADQVVRADMGVLYDMDIRGKPLAYTPFCDNNKEMDGYRFWRQ 1514

Query: 253  GKLWKNH---KPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLA-- 307
            G  WK+H   KP+        + + ++DL K R+ +     R+I E   L++   SLA  
Sbjct: 1515 G-FWKDHLRGKPYHI------SALYVVDLKKFRETAAGDNLRVIYET--LSKDPNSLANL 1565

Query: 308  DQDIFNAIISEHPYLVYTLPCQW 330
            DQD+ N   ++H   +++LP +W
Sbjct: 1566 DQDLPN--YAQHTVPIFSLPQEW 1586


>gi|159487158|ref|XP_001701602.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158271543|gb|EDO97360.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 377

 Score = 41.2 bits (95), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 17/51 (33%), Positives = 29/51 (56%), Gaps = 1/51 (1%)

Query: 615 ASHAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRDLPRY-DTRFVGFGWNK 664
           + H PT+++ W  +  PY ++W   +EP++   R    + D RF G+G NK
Sbjct: 236 SGHRPTDYTAWFRSAEPYPVQWQERYEPWVFVDRLGSSWADARFRGYGKNK 286


>gi|20258800|gb|AAM13982.1| putative glycosyl transferase [Arabidopsis thaliana]
          Length = 313

 Score = 40.8 bits (94), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 31/93 (33%), Positives = 48/93 (51%), Gaps = 17/93 (18%)

Query: 201 LPKVLPETLAKTIVLDTDVIFATDIAQLW--ALFSRLRQRQTIGLVE----NQSDWYLGK 254
           L  +L   + + I LD+D++   DIA+LW  +L SR+     IG  E    N + ++ G 
Sbjct: 157 LADLLEPCVNRVIYLDSDLVVVDDIAKLWKTSLGSRI-----IGAPEYCHANFTKYFTGG 211

Query: 255 LWKNHKPWPALGRG-----FNTGVILLDLTKLR 282
            W   + +    RG     FNTGV+++DL K R
Sbjct: 212 FWSEER-FSGTFRGRKPCYFNTGVMVIDLKKWR 243


>gi|20089086|ref|NP_615161.1| hypothetical protein MA0188 [Methanosarcina acetivorans C2A]
 gi|19913948|gb|AAM03641.1| hypothetical protein (multi-domain) [Methanosarcina acetivorans
           C2A]
          Length = 914

 Score = 40.8 bits (94), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 42/128 (32%), Positives = 56/128 (43%), Gaps = 20/128 (15%)

Query: 93  YSGVYGLLKLTLPKVLPETLAK-------TIVLDTDVIFATDIAQ-----LWALFSRLRQ 140
           Y GV G+ K +L K LP  L K       T V+ T + FAT+  +     L  L S+LRQ
Sbjct: 43  YYGVAGIGKTSLRKELPALLEKHNESDLHTRVIWTSIDFATEQYKQPYKFLEVLSSQLRQ 102

Query: 141 RQTIGLVENQSDWYLGKLWKNHKPWPALGRGFYLADSVVEDV-----SWIPNKHYSGVYG 195
           +  +    N  D  L   WK   P   L +  Y   S+V D+      ++P      VY 
Sbjct: 103 KYNVKF--NSFDIALATYWKKINPHNPLVKENYSEGSIVHDILDVCDEFVPANLIPNVYN 160

Query: 196 LLKLTLPK 203
           L K  LPK
Sbjct: 161 LTK-NLPK 167


>gi|366165525|ref|ZP_09465280.1| putative glycosyl transferase (general stress protein) [Acetivibrio
           cellulolyticus CD2]
          Length = 482

 Score = 40.8 bits (94), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 45/180 (25%), Positives = 78/180 (43%), Gaps = 16/180 (8%)

Query: 190 YSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSD 249
           Y G     ++ + ++   ++ K I LD D+I   DIA+LW              V  ++D
Sbjct: 81  YFGQVTFFRIFVTELFDPSVEKIIFLDCDMIIKGDIAELWETDVSGYYAAAGEDVGIEND 140

Query: 250 WYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLT-RLWTSLAD 308
              G   K         + FN GV+++++T  R+ +  G        +LLT R      D
Sbjct: 141 GLFGTQHKRSLGIKRKSKYFNAGVMVINMTMWRNHNIPG----QTSDYLLTHRNEIKFPD 196

Query: 309 QDIFNAIISEHPYLVYTLPCQWN----VQLSDNTR---SDELCYTELTDLKIIHWNSPKK 361
           QD  NA++ +   L++    +WN    +QL    +    D+L    + +  IIH++ P K
Sbjct: 197 QDALNAVLCDKWKLLHP---KWNQVATLQLFYKKKWVIRDDLL-EAVHNPAIIHYSEPSK 252


>gi|431918394|gb|ELK17619.1| hypothetical protein PAL_GLEAN10007398 [Pteropus alecto]
          Length = 148

 Score = 40.8 bits (94), Expect = 2.2,   Method: Composition-based stats.
 Identities = 26/72 (36%), Positives = 35/72 (48%), Gaps = 9/72 (12%)

Query: 269 FNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTS------LADQDIFNAIISEHPYL 322
           FN+GV+LL L  +R    +  +  + E   + +L         L DQD F  I  EHP L
Sbjct: 43  FNSGVMLLKLEAMRQ---SPLYSRLLEPAQVQQLADKYHFRGHLGDQDFFTMIGMEHPEL 99

Query: 323 VYTLPCQWNVQL 334
            + L C WN QL
Sbjct: 100 FHVLDCTWNRQL 111


>gi|222623395|gb|EEE57527.1| hypothetical protein OsJ_07839 [Oryza sativa Japonica Group]
          Length = 1597

 Score = 40.8 bits (94), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 37/141 (26%), Positives = 66/141 (46%), Gaps = 18/141 (12%)

Query: 199  LTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSD------WYL 252
            L L  + P +L K I +D D I   D+ +L+ +  + R        +N  +      W  
Sbjct: 1381 LFLDVIFPLSLRKVIFVDADQIVRADMGELYDMNLKGRPLAYTPFCDNNKEMDGYRFWKQ 1440

Query: 253  GKLWKNHKPWPALGRGFN-TGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLA--DQ 309
            G  WK+H      GR ++ + + ++DL K R  +     R+  E   L++   SL+  DQ
Sbjct: 1441 G-FWKDH----LRGRPYHISALYVVDLAKFRQTASGDTLRVFYET--LSKDPNSLSNLDQ 1493

Query: 310  DIFNAIISEHPYLVYTLPCQW 330
            D+ N   ++H   +++LP +W
Sbjct: 1494 DLPN--YAQHTVPIFSLPQEW 1512


>gi|22331906|ref|NP_191825.2| putative galacturonosyltransferase-like 7 [Arabidopsis thaliana]
 gi|75161472|sp|Q8VYF4.1|GATL7_ARATH RecName: Full=Probable galacturonosyltransferase-like 7
 gi|18175835|gb|AAL59936.1| unknown protein [Arabidopsis thaliana]
 gi|20465549|gb|AAM20257.1| unknown protein [Arabidopsis thaliana]
 gi|23397213|gb|AAN31889.1| unknown protein [Arabidopsis thaliana]
 gi|332646856|gb|AEE80377.1| putative galacturonosyltransferase-like 7 [Arabidopsis thaliana]
          Length = 361

 Score = 40.8 bits (94), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 47/178 (26%), Positives = 78/178 (43%), Gaps = 32/178 (17%)

Query: 201 LPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVE----NQSDWYLGKLW 256
           L  +L   + + I LD+D+I   DIA+LW   ++L  + TIG  E    N + ++    W
Sbjct: 171 LADLLEPCVRRVIYLDSDLIVVDDIAKLW--MTKLGSK-TIGAPEYCHANFTKYFTPAFW 227

Query: 257 KNHKPWPAL-GRG---FNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIF 312
            + +   A  GR    FNTGV+++DL + R +   G+  +I +       W  +   D  
Sbjct: 228 SDERFSGAFSGRKPCYFNTGVMVMDLERWRRV---GYTEVIEK-------WMEIQKSDRI 277

Query: 313 NAIISEHPYL------VYTLPCQWNVQ--LSDNTRSDELCYT-ELTDLKIIHWNSPKK 361
             + S  P+L      V  +  +WN      DN R    C       + ++HW+   K
Sbjct: 278 YELGSLPPFLLVFAGEVAPIEHRWNQHGLGGDNVRGS--CRDLHPGPVSLLHWSGSGK 333


>gi|297743944|emb|CBI36914.3| unnamed protein product [Vitis vinifera]
          Length = 364

 Score = 40.8 bits (94), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 32/90 (35%), Positives = 47/90 (52%), Gaps = 11/90 (12%)

Query: 201 LPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVE----NQSDWYLGKLW 256
           L  +L   + + I LD+D+I   DI +LW+  + L  R TIG  E    N + ++  K W
Sbjct: 151 LADLLEPCVRRVIYLDSDLIVVDDIYKLWS--TSLGTR-TIGAPEYCHANFTRYFTDKFW 207

Query: 257 KNHKPWPAL-GRG---FNTGVILLDLTKLR 282
              + +    GR    FNTGVI++DL K R
Sbjct: 208 SEKRYYGTFDGRKPCYFNTGVIVIDLAKWR 237


>gi|218191315|gb|EEC73742.1| hypothetical protein OsI_08377 [Oryza sativa Indica Group]
          Length = 1673

 Score = 40.8 bits (94), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 37/141 (26%), Positives = 66/141 (46%), Gaps = 18/141 (12%)

Query: 199  LTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSD------WYL 252
            L L  + P +L K I +D D I   D+ +L+ +  + R        +N  +      W  
Sbjct: 1457 LFLDVIFPLSLRKVIFVDADQIVRADMGELYDMNLKGRPLAYTPFCDNNKEMDGYRFWKQ 1516

Query: 253  GKLWKNHKPWPALGRGFN-TGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLA--DQ 309
            G  WK+H      GR ++ + + ++DL K R  +     R+  E   L++   SL+  DQ
Sbjct: 1517 G-FWKDH----LRGRPYHISALYVVDLAKFRQTASGDTLRVFYET--LSKDPNSLSNLDQ 1569

Query: 310  DIFNAIISEHPYLVYTLPCQW 330
            D+ N   ++H   +++LP +W
Sbjct: 1570 DLPN--YAQHTVPIFSLPQEW 1588


>gi|50251355|dbj|BAD28382.1| putative UDP-glucose:glycoprotein glucosyltransferase [Oryza sativa
            Japonica Group]
          Length = 1626

 Score = 40.8 bits (94), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 37/141 (26%), Positives = 66/141 (46%), Gaps = 18/141 (12%)

Query: 199  LTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSD------WYL 252
            L L  + P +L K I +D D I   D+ +L+ +  + R        +N  +      W  
Sbjct: 1410 LFLDVIFPLSLRKVIFVDADQIVRADMGELYDMNLKGRPLAYTPFCDNNKEMDGYRFWKQ 1469

Query: 253  GKLWKNHKPWPALGRGFN-TGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLA--DQ 309
            G  WK+H      GR ++ + + ++DL K R  +     R+  E   L++   SL+  DQ
Sbjct: 1470 G-FWKDH----LRGRPYHISALYVVDLAKFRQTASGDTLRVFYET--LSKDPNSLSNLDQ 1522

Query: 310  DIFNAIISEHPYLVYTLPCQW 330
            D+ N   ++H   +++LP +W
Sbjct: 1523 DLPN--YAQHTVPIFSLPQEW 1541


>gi|359405620|ref|ZP_09198369.1| glycosyltransferase, family 8 [Prevotella stercorea DSM 18206]
 gi|357557990|gb|EHJ39504.1| glycosyltransferase, family 8 [Prevotella stercorea DSM 18206]
          Length = 308

 Score = 40.8 bits (94), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 51/203 (25%), Positives = 84/203 (41%), Gaps = 45/203 (22%)

Query: 197 LKLTLPKVLPETLAKTIVLDTDVIFATDIAQL--WALFSRLRQRQTIGLVENQSDWYLGK 254
            KL +P++LP T  + + LD D+I   DI  L  W L     Q + I   E+  D    K
Sbjct: 85  FKLLIPQLLPATEKRCLFLDVDMIVYDDITSLYHWNL-----QDKVIAAAEDMPDCITFK 139

Query: 255 LWKNHKPWPALGR-----GFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLA-- 307
                   P LG        N+GV++ DL K         WR++ ++  +    T++   
Sbjct: 140 --------PRLGLKDTDFCINSGVMVCDLEK---------WRLMEQEQPIMAYATNIISI 182

Query: 308 ---DQDIFNAIISEHPYLVYTLPCQWNVQLSDNTRSDELCYTELTDLK-------IIHWN 357
              +QD+      +    +  LP +WN+      R  ++    L +L+       IIH+ 
Sbjct: 183 IVNEQDVLACYFKDK---IALLPIRWNMTTFYFNRKPKIFSKYLPELEAAKRFPGIIHFA 239

Query: 358 SPKKLKVKNKHMEFFRNLYLTFL 380
           +P K   K+     +R LY  +L
Sbjct: 240 APIKPWFKDCQHP-YRKLYKKYL 261


>gi|15235247|ref|NP_192122.1| putative galacturonosyltransferase-like 6 [Arabidopsis thaliana]
 gi|30679061|ref|NP_849285.1| putative galacturonosyltransferase-like 6 [Arabidopsis thaliana]
 gi|79324977|ref|NP_001031573.1| putative galacturonosyltransferase-like 6 [Arabidopsis thaliana]
 gi|75317781|sp|O04253.1|GATL6_ARATH RecName: Full=Probable galacturonosyltransferase-like 6; AltName:
           Full=Like glycosyl transferase 10
 gi|2104536|gb|AAC78704.1| predicted glycosyl transferase [Arabidopsis thaliana]
 gi|7268597|emb|CAB80706.1| predicted glycosyl transferase [Arabidopsis thaliana]
 gi|24030376|gb|AAN41350.1| putative glycosyl transferase [Arabidopsis thaliana]
 gi|26453088|dbj|BAC43620.1| putative glycosyl transferase [Arabidopsis thaliana]
 gi|110738563|dbj|BAF01207.1| glycosyl transferase like protein [Arabidopsis thaliana]
 gi|332656727|gb|AEE82127.1| putative galacturonosyltransferase-like 6 [Arabidopsis thaliana]
 gi|332656728|gb|AEE82128.1| putative galacturonosyltransferase-like 6 [Arabidopsis thaliana]
 gi|332656729|gb|AEE82129.1| putative galacturonosyltransferase-like 6 [Arabidopsis thaliana]
          Length = 346

 Score = 40.8 bits (94), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 31/93 (33%), Positives = 48/93 (51%), Gaps = 17/93 (18%)

Query: 201 LPKVLPETLAKTIVLDTDVIFATDIAQLW--ALFSRLRQRQTIGLVE----NQSDWYLGK 254
           L  +L   + + I LD+D++   DIA+LW  +L SR+     IG  E    N + ++ G 
Sbjct: 157 LADLLEPCVNRVIYLDSDLVVVDDIAKLWKTSLGSRI-----IGAPEYCHANFTKYFTGG 211

Query: 255 LWKNHKPWPALGRG-----FNTGVILLDLTKLR 282
            W   + +    RG     FNTGV+++DL K R
Sbjct: 212 FWSEER-FSGTFRGRKPCYFNTGVMVIDLKKWR 243


>gi|255566847|ref|XP_002524407.1| transferase, transferring glycosyl groups, putative [Ricinus
           communis]
 gi|223536368|gb|EEF38018.1| transferase, transferring glycosyl groups, putative [Ricinus
           communis]
          Length = 344

 Score = 40.4 bits (93), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 30/117 (25%), Positives = 59/117 (50%), Gaps = 16/117 (13%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVE----NQSDWYLG 253
           +  L  +LP ++ + I  D+D+I   D+A+LW +   +     +G  E    N ++++  
Sbjct: 147 RFYLADLLPASVPRIIYFDSDLIVVDDVAKLWNINLGV---HVLGAPEYCHANFTNYFNS 203

Query: 254 KLWKNHKPWPALGRG-----FNTGVILLDLTKLRDISWA---GFWRIIAEKFLLTRL 302
           + W N + + A  R      FNTGV+++DL K R+  +     +W  + +K+ +  L
Sbjct: 204 RFWSN-EGYAASFRERRACYFNTGVMVIDLMKWREGKYREKLEYWMKVQKKYRIYEL 259


>gi|428175077|gb|EKX43969.1| hypothetical protein GUITHDRAFT_140108 [Guillardia theta CCMP2712]
          Length = 919

 Score = 40.4 bits (93), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 40/175 (22%), Positives = 64/175 (36%), Gaps = 60/175 (34%)

Query: 547 PINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFET 606
           P N LR +A +   +  V   D+D L    L  +++ + R     G G  ++LV+P+F +
Sbjct: 544 PTNMLRKIACSIAKSNMVLNADVDMLASDELADHIRKA-RGDGWMGRG--ELLVIPSFRS 600

Query: 607 -----QRYRTAF--------PAS------------------------------------- 616
                +  +T F        PA+                                     
Sbjct: 601 AGSWPRAVKTHFDSHQLTVKPAAITMEDLKAALLRCELLLPLLGCGMWSPSTRWSENAVF 660

Query: 617 HAPTNFSRWVNATTPYQIEWAPDFEPYIVAHR-------DLPRYDTRFVGFGWNK 664
           HAPT + RW+ A   Y++++   +EP  V  R           YD RF   GW+K
Sbjct: 661 HAPTEYGRWMEAEEVYEVDYLLGYEPSFVVDRRAWVGGQGAGMYDDRFEQGGWDK 715


>gi|21554300|gb|AAM63375.1| putative glycosyl transferase [Arabidopsis thaliana]
          Length = 346

 Score = 40.4 bits (93), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 31/93 (33%), Positives = 48/93 (51%), Gaps = 17/93 (18%)

Query: 201 LPKVLPETLAKTIVLDTDVIFATDIAQLW--ALFSRLRQRQTIGLVE----NQSDWYLGK 254
           L  +L   + + I LD+D++   DIA+LW  +L SR+     IG  E    N + ++ G 
Sbjct: 157 LADLLEPCVNRVIYLDSDLVVVDDIAKLWKTSLGSRI-----IGAPEYCYANFTKYFTGG 211

Query: 255 LWKNHKPWPALGRG-----FNTGVILLDLTKLR 282
            W   + +    RG     FNTGV+++DL K R
Sbjct: 212 FWSEER-FSGTFRGRKPCYFNTGVMVIDLKKWR 243


>gi|424665842|ref|ZP_18102878.1| hypothetical protein HMPREF1205_01717 [Bacteroides fragilis HMW
           616]
 gi|404574095|gb|EKA78846.1| hypothetical protein HMPREF1205_01717 [Bacteroides fragilis HMW
           616]
          Length = 304

 Score = 40.4 bits (93), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 45/195 (23%), Positives = 81/195 (41%), Gaps = 29/195 (14%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVE----NQSDWYLG 253
           +     +LPE + K + LD D++   DI++ W   +      ++  VE    N+ + Y  
Sbjct: 89  RCMFSAILPEEVDKVLYLDCDIVILGDISEYW---NTDMSNYSVACVEDIGSNEDERY-- 143

Query: 254 KLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFN 313
            + K  K +      FN GV+L++L   R+           + FL         DQD+ N
Sbjct: 144 DILKYDKSFSY----FNAGVLLINLRYWRE---HKIDEQCEQYFLRYPERIRFNDQDLLN 196

Query: 314 AIISEHPYLVYTLPCQWNVQ-------LSDNTRSDELCYTELTDLKIIHWNSPKKLKVKN 366
           A++ E    V   P +WN+Q            +  +  + +L    I+H+ + K     +
Sbjct: 197 AVLHESKLFV---PLKWNMQDGFYRYGADKGVKDKKQLHQQLLHPVILHYTNKKPWNYDS 253

Query: 367 KH---MEFFRNLYLT 378
            H    E+F+ L +T
Sbjct: 254 MHPLKGEYFKYLDMT 268


>gi|356511033|ref|XP_003524236.1| PREDICTED: UDP-glucose:glycoprotein glucosyltransferase-like [Glycine
            max]
          Length = 1647

 Score = 40.4 bits (93), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 38/143 (26%), Positives = 65/143 (45%), Gaps = 22/143 (15%)

Query: 199  LTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSD------WYL 252
            L L  + P +L K I +D D +   D+  L+ +  R +        +N  +      W  
Sbjct: 1426 LFLDVIFPLSLEKVIFVDADQVVRADMGVLYDMDIRGKPLAYTPFCDNNKEMDGYRFWRQ 1485

Query: 253  GKLWKNH---KPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLA-- 307
            G  W +H   KP+        + + ++DL K R+ +     R+I E   L+R   SLA  
Sbjct: 1486 G-FWNDHLQGKPYHI------SALYVVDLKKFRETAAGDNLRVIYET--LSRDPNSLANL 1536

Query: 308  DQDIFNAIISEHPYLVYTLPCQW 330
            DQD+ N   ++H   +++LP +W
Sbjct: 1537 DQDLPN--YAQHTVPIFSLPQEW 1557


>gi|242041219|ref|XP_002468004.1| hypothetical protein SORBIDRAFT_01g037870 [Sorghum bicolor]
 gi|241921858|gb|EER95002.1| hypothetical protein SORBIDRAFT_01g037870 [Sorghum bicolor]
          Length = 371

 Score = 40.4 bits (93), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 29/94 (30%), Positives = 50/94 (53%), Gaps = 11/94 (11%)

Query: 201 LPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVE----NQSDWYLGKLW 256
           L  +L   + + I LD+D++   D+A+LW   + L  R T+G  E    N + ++ G+ W
Sbjct: 175 LADLLEPCVRRVIYLDSDLVLVDDVAKLWR--TDLGGR-TVGAPEYCHANFTKYFTGRFW 231

Query: 257 KNHK-PWPALGRG---FNTGVILLDLTKLRDISW 286
            + +     +GR    FNTGV++LDL + R   +
Sbjct: 232 SDQRFAGTFVGRRPCYFNTGVMVLDLERWRQAGY 265


>gi|423281223|ref|ZP_17260134.1| hypothetical protein HMPREF1203_04351 [Bacteroides fragilis HMW
           610]
 gi|404583387|gb|EKA88068.1| hypothetical protein HMPREF1203_04351 [Bacteroides fragilis HMW
           610]
          Length = 305

 Score = 40.4 bits (93), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 45/195 (23%), Positives = 81/195 (41%), Gaps = 29/195 (14%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVE----NQSDWYLG 253
           +     +LPE + K + LD D++   DI++ W   +      ++  VE    N+ + Y  
Sbjct: 89  RCMFSAILPEEVDKVLYLDCDIVILGDISEYW---NTDMSNYSVACVEDIGSNEDERY-- 143

Query: 254 KLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFN 313
            + K  K +      FN GV+L++L   R+           + FL         DQD+ N
Sbjct: 144 DILKYDKSFSY----FNAGVLLINLRYWRE---HKIDEQCEQYFLRYPERIRFNDQDLLN 196

Query: 314 AIISEHPYLVYTLPCQWNVQ-------LSDNTRSDELCYTELTDLKIIHWNSPKKLKVKN 366
           A++ E    V   P +WN+Q            +  +  + +L    I+H+ + K     +
Sbjct: 197 AVLHESKLFV---PLKWNMQDGFYRYGADKGVKDKKQLHQQLLHPVILHYTNKKPWNYDS 253

Query: 367 KH---MEFFRNLYLT 378
            H    E+F+ L +T
Sbjct: 254 MHPLKGEYFKYLDMT 268


>gi|219363085|ref|NP_001137113.1| uncharacterized protein LOC100217291 precursor [Zea mays]
 gi|194698410|gb|ACF83289.1| unknown [Zea mays]
 gi|413952525|gb|AFW85174.1| hypothetical protein ZEAMMB73_350653 [Zea mays]
          Length = 372

 Score = 40.4 bits (93), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 46/183 (25%), Positives = 73/183 (39%), Gaps = 38/183 (20%)

Query: 201 LPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVE----NQSDWYLGKLW 256
           L  +LP  + + I LD+DV+ A D+ +LW   +RL     +   E    N S ++    W
Sbjct: 159 LADLLPRCVPRAIYLDSDVLAADDVRRLWE--TRLPAAAVVAAPEYCHANFSRYFTPAFW 216

Query: 257 KNHKPWPALGRG-----------FNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTS 305
            +    P LG             FNTGV+++DL + R    AG +R   E+      W  
Sbjct: 217 SD----PVLGARVFAGRRRPPCYFNTGVMVIDLRRWR----AGNYRQRIER------WME 262

Query: 306 LADQDIFNAIISEHPYL------VYTLPCQWNVQ-LSDNTRSDELCYTELTDLKIIHWNS 358
           +  Q     + S  P+L      V  +  +WN   L  N             + ++HW+ 
Sbjct: 263 IQKQKRIYELGSLPPFLLVFAGEVEAVDHRWNQHGLGGNNVHGSCRPLHAGPVSLMHWSG 322

Query: 359 PKK 361
             K
Sbjct: 323 KGK 325


>gi|298480796|ref|ZP_06998991.1| lipopolysaccharide 1,2-glucosyltransferase/general stress protein
           [Bacteroides sp. D22]
 gi|298272819|gb|EFI14385.1| lipopolysaccharide 1,2-glucosyltransferase/general stress protein
           [Bacteroides sp. D22]
          Length = 312

 Score = 40.4 bits (93), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 45/163 (27%), Positives = 73/163 (44%), Gaps = 15/163 (9%)

Query: 172 FYLADSVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
           FY  DS + +   I  +    +Y   +L L  +L +++ + + LD+D+I    I  LW  
Sbjct: 62  FYEVDSNILNNLPISTRFRKSIY--YRLLLDNILDKSIERILYLDSDIIVRDSIEALW-- 117

Query: 232 FSRLRQRQTIGLVENQSDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWR 291
            +       +G V +QS   +     N    P L   FN+GV+L+++ K R  +     R
Sbjct: 118 -NENIDEYVLGAVLDQSCDDIRNF--NRTKLPYLSDYFNSGVLLININKWRAFNIGK--R 172

Query: 292 IIAEKFLLTRLWTSL-ADQDIFNAIISEHPYLVYTLPCQWNVQ 333
            I  K++     + L  DQD  N I S    +   LP  +NVQ
Sbjct: 173 CI--KYISENPESCLYPDQDALNVITSNSHKI---LPLCFNVQ 210


>gi|443710200|gb|ELU04498.1| hypothetical protein CAPTEDRAFT_192237 [Capitella teleta]
          Length = 221

 Score = 40.4 bits (93), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 22/66 (33%), Positives = 37/66 (56%), Gaps = 3/66 (4%)

Query: 478 DVTLVAQLSMDRLQMVEMLFKHWEGPISLTLYMSDAEAQQFLSYAGNSEALRSRRNIGYH 537
           DVTL+  L+ DR++ +  L ++W GPIS  +Y+ +      +    N    +SR NI  H
Sbjct: 138 DVTLMVHLTSDRIEKLIRLTENWSGPISAAIYVDNITDSLMIE---NMCHFQSRPNIYIH 194

Query: 538 VVYKEG 543
           +++K G
Sbjct: 195 ILHKVG 200


>gi|300176891|emb|CBK25460.2| unnamed protein product [Blastocystis hominis]
          Length = 234

 Score = 40.4 bits (93), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 24/72 (33%), Positives = 36/72 (50%), Gaps = 8/72 (11%)

Query: 543 GNFYPINTLRNVALNQVSTPYVFLLDIDFLPM-------FGLYPYLKSSIRSMDMH-GHG 594
           GN + IN LRN+AL++ ST +VF+ D D LP        F     L+ +  S+  H    
Sbjct: 147 GNSFRINALRNLALSKASTTHVFIADADILPACTQNESNFDFTDNLRDTFLSLPFHLLQD 206

Query: 595 GKKVLVVPAFET 606
             +  +VP F+ 
Sbjct: 207 SAQAFIVPLFDV 218


>gi|297850896|ref|XP_002893329.1| hypothetical protein ARALYDRAFT_472678 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297339171|gb|EFH69588.1| hypothetical protein ARALYDRAFT_472678 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 389

 Score = 40.4 bits (93), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 29/92 (31%), Positives = 46/92 (50%), Gaps = 12/92 (13%)

Query: 201 LPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVE----NQSDWYLGKLW 256
           L  +L  ++ + I LD+DVI   DI +LW   + L   + IG  E    N + ++    W
Sbjct: 175 LGDILDRSVERVIYLDSDVITVDDITKLWN--TVLTGSRVIGAPEYCHANFTQYFTSGFW 232

Query: 257 KNHKPWPALGRG-----FNTGVILLDLTKLRD 283
            +    P L  G     FNTGV+++DL + R+
Sbjct: 233 SD-PALPGLISGQKPCYFNTGVMVMDLVRWRE 263


>gi|313147947|ref|ZP_07810140.1| glycosyl transferase [Bacteroides fragilis 3_1_12]
 gi|313136714|gb|EFR54074.1| glycosyl transferase [Bacteroides fragilis 3_1_12]
          Length = 311

 Score = 40.4 bits (93), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 45/195 (23%), Positives = 81/195 (41%), Gaps = 29/195 (14%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVE----NQSDWYLG 253
           +     +LPE + K + LD D++   DI++ W   +      ++  VE    N+ + Y  
Sbjct: 96  RCMFSAILPEEVDKVLYLDCDIVILGDISEYW---NTDMSNYSVACVEDIGSNEDERY-- 150

Query: 254 KLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFN 313
            + K  K +      FN GV+L++L   R+           + FL         DQD+ N
Sbjct: 151 DILKYDKSFSY----FNAGVLLINLRYWRE---HKIDEQCEQYFLRYPERIRFNDQDLLN 203

Query: 314 AIISEHPYLVYTLPCQWNVQ-------LSDNTRSDELCYTELTDLKIIHWNSPKKLKVKN 366
           A++ E    V   P +WN+Q            +  +  + +L    I+H+ + K     +
Sbjct: 204 AVLHESKLFV---PLKWNMQDGFYRYGADKGVKDKKQLHQQLLHPVILHYTNKKPWNYDS 260

Query: 367 KH---MEFFRNLYLT 378
            H    E+F+ L +T
Sbjct: 261 MHPLKGEYFKYLDMT 275


>gi|326803152|ref|YP_004320970.1| glycosyltransferase family protein [Aerococcus urinae
           ACS-120-V-Col10a]
 gi|326650610|gb|AEA00793.1| glycosyltransferase, family 8 [Aerococcus urinae ACS-120-V-Col10a]
          Length = 274

 Score = 40.4 bits (93), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 32/132 (24%), Positives = 59/132 (44%), Gaps = 22/132 (16%)

Query: 205 LPETLAKTIVLDTDVIFATDIAQLWA------LFSRLRQRQTIGLVENQSDWYLGKLWKN 258
           LP+ L K + LD D++   D++ L+       L+      +   + +  +   LG    +
Sbjct: 91  LPDQLEKILYLDADILCINDVSPLYETELGNNLYGAASHAKLTKITDQINKLRLGNSQAD 150

Query: 259 HKPWPALGRGFNTGVILLDLTKLRD-ISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIIS 317
           H         +N+GV+L++L  +R  +     +  I E     R    L DQD+ NA+ S
Sbjct: 151 HY--------YNSGVLLMNLKAIRQKVKAEDIFDFIHE----NRYQLILPDQDVLNALYS 198

Query: 318 EHPYLVYTLPCQ 329
           ++   +Y +P Q
Sbjct: 199 QY---IYDVPDQ 207


>gi|357464109|ref|XP_003602336.1| Glycosyltransferase CAZy family GT8 [Medicago truncatula]
 gi|355491384|gb|AES72587.1| Glycosyltransferase CAZy family GT8 [Medicago truncatula]
          Length = 343

 Score = 40.0 bits (92), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 27/95 (28%), Positives = 49/95 (51%), Gaps = 13/95 (13%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVE----NQSDWYLG 253
           ++ L  ++P T  + I  D+D+I   D+A+LW++         +G  E    N + ++  
Sbjct: 144 RMYLADLVPATAQRIIYFDSDLIVVDDVAKLWSI---DLGNHVLGAPEYCHANFTTYFTH 200

Query: 254 KLWKNHKPWPALGRG-----FNTGVILLDLTKLRD 283
           + W N   + A  +G     FNTGV+++DL K R+
Sbjct: 201 RFWSN-PSYSASFKGREACYFNTGVMVIDLWKWRE 234


>gi|308807441|ref|XP_003081031.1| Predicted glycosyltransferase (ISS) [Ostreococcus tauri]
 gi|116059493|emb|CAL55200.1| Predicted glycosyltransferase (ISS) [Ostreococcus tauri]
          Length = 476

 Score = 40.0 bits (92), Expect = 3.7,   Method: Compositional matrix adjust.
 Identities = 22/59 (37%), Positives = 36/59 (61%), Gaps = 8/59 (13%)

Query: 613 FPASHAPTNFSRWVNATT--PYQIEWAPD---FEPYIVAHRD--LPRYDTRFVGFGWNK 664
           FP +HAP+N+++W +  T  PY++        +EPY++   D  LP +D  FV +G+NK
Sbjct: 260 FPRAHAPSNYNQWFSNVTCEPYRVAAHKHPWYYEPYVILRADMALP-FDESFVTYGFNK 317


>gi|291222851|ref|XP_002731428.1| PREDICTED: glycosyltransferase 8 domain-containing protein 4-like
           [Saccoglossus kowalevskii]
          Length = 238

 Score = 40.0 bits (92), Expect = 3.7,   Method: Compositional matrix adjust.
 Identities = 31/131 (23%), Positives = 63/131 (48%), Gaps = 14/131 (10%)

Query: 15  IQVAIVCAGYNSTRSLVTLIKSILFYRKNPLHFHLITDTVALNILQTLFSTW-SVPQVEV 73
           I +++V  G  +  +L+ L  ++LF R  PLHFH+  +    +I +     W S  + ++
Sbjct: 66  IHLSVVACGDRANETLIMLKSAVLFTRA-PLHFHIFAEDELHSIFRNRLELWPSKYRAKI 124

Query: 74  SFYLADSVVEDVSWIPNKHYSGVYGLLK------LTLPKVLPETLAKTIVLDTDVIFATD 127
            +      +  +++ P ++      L K      L LP VL + +   + +DTDV+F   
Sbjct: 125 KY-----DIYSITFPPGENAMEWKKLFKPCASQRLFLPDVLTD-VDSILYVDTDVLFLFP 178

Query: 128 IAQLWALFSRL 138
           + ++W  F++ 
Sbjct: 179 LDEIWEFFNKF 189


>gi|189465050|ref|ZP_03013835.1| hypothetical protein BACINT_01394 [Bacteroides intestinalis DSM
           17393]
 gi|189437324|gb|EDV06309.1| general stress protein A [Bacteroides intestinalis DSM 17393]
          Length = 301

 Score = 40.0 bits (92), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 41/167 (24%), Positives = 75/167 (44%), Gaps = 23/167 (13%)

Query: 172 FYLAD-SVVEDVSWIPNKHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWA 230
           FY  D S+++D       H S +   L+L + +++P  + K + LD D+I    I +LW 
Sbjct: 63  FYDVDYSLIKDFPIRKQDHLS-LAAYLRLFMSELIPSNINKILYLDCDLIVVDSIKELW- 120

Query: 231 LFSRLRQRQTIGLVENQSDWYLGKLWKNHKP----WPALGRGFNTGVILLDLTKLRDISW 286
              +      +  VE +S       +    P    +P     FN+GV+L++L K R+  +
Sbjct: 121 --EKNIDNIAVAAVEERSP------FDTESPVTLKYPVEYSYFNSGVMLINLQKWREKKF 172

Query: 287 AGFWR-IIAEKFLLTRLWTSLADQDIFNAIISEHPYLVYTLPCQWNV 332
               +  IA  +   +    L DQD+ NA++ +    +     +WN+
Sbjct: 173 VEACKSYIASNYENIK----LHDQDVLNALLYKEKQFI---SIRWNL 212


>gi|449469050|ref|XP_004152234.1| PREDICTED: probable galacturonosyltransferase-like 7-like [Cucumis
           sativus]
 gi|449515897|ref|XP_004164984.1| PREDICTED: probable galacturonosyltransferase-like 7-like [Cucumis
           sativus]
          Length = 367

 Score = 40.0 bits (92), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 31/91 (34%), Positives = 48/91 (52%), Gaps = 11/91 (12%)

Query: 201 LPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVE----NQSDWYLGKLW 256
           L  +L   + K I LD+D+I   DI +LW   + L +  TIG  E    N S ++  + W
Sbjct: 178 LAGLLESCVKKVIYLDSDLIVVDDIRKLWT--TNLGE-WTIGAPEYCHANFSKYFTTRFW 234

Query: 257 KNHKPWPAL-GRG---FNTGVILLDLTKLRD 283
            + + +    GR    FNTGV+++DL K R+
Sbjct: 235 SDERFFGTFAGRKPCYFNTGVMVIDLVKWRN 265


>gi|307108997|gb|EFN57236.1| expressed protein, partial [Chlorella variabilis]
          Length = 429

 Score = 40.0 bits (92), Expect = 4.1,   Method: Compositional matrix adjust.
 Identities = 19/46 (41%), Positives = 27/46 (58%), Gaps = 1/46 (2%)

Query: 620 TNFSRWVNATTPYQIEWAPDFEPYIVAHRD-LPRYDTRFVGFGWNK 664
           T+ SRW   +  Y+I +   FEPY++  R  +P YD RF G+  NK
Sbjct: 340 TDHSRWFETSISYRIPYEEGFEPYVLVQRRFVPWYDERFKGYRKNK 385


>gi|52076753|dbj|BAD45664.1| putative Avr9/Cf-9 rapidly elicited protein 231 [Oryza sativa
           Japonica Group]
 gi|125554743|gb|EAZ00349.1| hypothetical protein OsI_22365 [Oryza sativa Indica Group]
          Length = 371

 Score = 40.0 bits (92), Expect = 4.3,   Method: Compositional matrix adjust.
 Identities = 33/113 (29%), Positives = 53/113 (46%), Gaps = 25/113 (22%)

Query: 201 LPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVE----NQSDWYLGKLW 256
           L  +LP  + + I LD+DV+   D+ +LW   +RL     +   E    N S ++    W
Sbjct: 150 LADLLPRCVPRAIYLDSDVLAVDDVRRLWE--TRLPAAAVVAAPEYCHANFSRYFTPAFW 207

Query: 257 KNHKPWPALGRG-----------FNTGVILLDLTKLRDISWAGFWRIIAEKFL 298
            +    P LGR            FNTGV+++DL + R    AG +R   E+++
Sbjct: 208 SD----PGLGRRVFAGRRRPPCYFNTGVMVIDLRRWR----AGNYRHRIERWM 252


>gi|226324415|ref|ZP_03799933.1| hypothetical protein COPCOM_02196 [Coprococcus comes ATCC 27758]
 gi|225206863|gb|EEG89217.1| glycosyltransferase, family 8 [Coprococcus comes ATCC 27758]
          Length = 348

 Score = 40.0 bits (92), Expect = 4.5,   Method: Compositional matrix adjust.
 Identities = 40/150 (26%), Positives = 65/150 (43%), Gaps = 26/150 (17%)

Query: 197 LKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWA------LFSRLRQRQTIGL---VENQ 247
            +L +P++L +   K + +D+D++   DIA+L+A      L +  +   T GL    E  
Sbjct: 99  FRLLMPQILAD-YDKAVYIDSDLVVNADIAELYATDVDGYLLAAAKDADTAGLYNGFEPN 157

Query: 248 SDWYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAE--KFLLTRLWTS 305
              Y+  + K  KP+      F  GVI+ +L + R           AE  KF  +  W  
Sbjct: 158 KKKYMDTILKIKKPY----EYFQAGVIVFNLAEFRKTY------TTAEMLKFAASYEW-E 206

Query: 306 LADQDIFNAIISEHPYLVYTLPCQWNVQLS 335
           L DQD+ N +       V  +   WNV + 
Sbjct: 207 LLDQDVLNYLAQGR---VKFVDMAWNVMVD 233


>gi|366165534|ref|ZP_09465289.1| putative glycosyl transferase (general stress protein) [Acetivibrio
           cellulolyticus CD2]
          Length = 491

 Score = 39.7 bits (91), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 44/181 (24%), Positives = 77/181 (42%), Gaps = 18/181 (9%)

Query: 190 YSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSD 249
           Y G     ++ +  +   ++ K + LD D+I   DIA+LW           +  V  ++D
Sbjct: 81  YFGQATFFRIFVTDLFDPSVEKIVFLDCDMIIKGDIAELWKTDVSGYYMAAVEDVGLEND 140

Query: 250 WYLGKLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLW-TSLAD 308
              G   K         + FN GV+++++T  R+ +     R     +LLT      L D
Sbjct: 141 GLYGIQHKRSLGIKRRSKYFNAGVMVINMTLWRNHNIPDRTR----NYLLTHHNDVKLPD 196

Query: 309 QDIFNAIISEHPYLVYTLPCQWNVQLSDN--------TRSDELCYTELTDLKIIHWNSPK 360
           QD  NA++ ++  L++    +WN Q +           R D L    + +  IIH++ P 
Sbjct: 197 QDALNAVLCDNWKLLHP---KWNQQATLQLFYKKKWVIREDLL--EAVHNPAIIHYSEPS 251

Query: 361 K 361
           K
Sbjct: 252 K 252


>gi|195116431|ref|XP_002002758.1| GI11232 [Drosophila mojavensis]
 gi|193913333|gb|EDW12200.1| GI11232 [Drosophila mojavensis]
          Length = 458

 Score = 39.7 bits (91), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 34/149 (22%), Positives = 54/149 (36%), Gaps = 38/149 (25%)

Query: 546 YPINTLRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRS----MDMHGHGGKKVLVV 601
           YPIN  RN+A   V+T ++F  DI+  P  GL       +      + +  +   +V  +
Sbjct: 233 YPINVGRNIARKAVNTHFIFACDIELFPSLGLIEQFLDMVHRNRSVLALSPNQRPRVYPL 292

Query: 602 PAFET-----------------QRYRTAF------PASHAPTNFSRWVNATTPYQIEWA- 637
           P FE                  Q+ R          + H    + RW+N TTP   E   
Sbjct: 293 PVFEIEENEQVPNDKLELLDLLQKRRAQLFHAKVCASCHKVPGYQRWINRTTPIPDELQL 352

Query: 638 ----------PDFEPYIVAHRDLPRYDTR 656
                       +EP+ ++    P +D R
Sbjct: 353 FSMTLRQDTFRHWEPFYISDNKEPVFDER 381


>gi|330996530|ref|ZP_08320412.1| glycosyltransferase, family 8 [Paraprevotella xylaniphila YIT
           11841]
 gi|329573086|gb|EGG54705.1| glycosyltransferase, family 8 [Paraprevotella xylaniphila YIT
           11841]
          Length = 637

 Score = 39.7 bits (91), Expect = 5.1,   Method: Compositional matrix adjust.
 Identities = 34/138 (24%), Positives = 59/138 (42%), Gaps = 11/138 (7%)

Query: 196 LLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKL 255
             +  L  +LP +++K + LD DV+  + +  LW           + L      + +  +
Sbjct: 87  FFRCLLSSILPLSVSKVLYLDCDVLVLSSLYGLWET-----DLTGVALAGVPDSFTVNPV 141

Query: 256 WKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAI 315
                 + +    FN GV+LL+L   R     G  R+ AE + +        DQD+ N++
Sbjct: 142 HCRRLHYASSYNYFNGGVLLLNLEYWRA---HGVERLCAEHYRMYPDRIVYNDQDLLNSL 198

Query: 316 ISEHPYLVYTLPCQWNVQ 333
           + E   L   L  +WNVQ
Sbjct: 199 LHERKRL---LDMKWNVQ 213


>gi|145350151|ref|XP_001419480.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144579712|gb|ABO97773.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 471

 Score = 39.7 bits (91), Expect = 5.2,   Method: Compositional matrix adjust.
 Identities = 22/61 (36%), Positives = 33/61 (54%), Gaps = 9/61 (14%)

Query: 613 FPASHAPTNFSRWVNATT-----PYQIEWAPD---FEPYIVAHRDLP-RYDTRFVGFGWN 663
           FP +HAPTN++ W   TT      Y++        +EPY++   DL   +D  FV +G+N
Sbjct: 253 FPIAHAPTNYTAWFENTTTGADSTYRVATPKHPWYYEPYVIVRADLALPFDESFVQYGFN 312

Query: 664 K 664
           K
Sbjct: 313 K 313


>gi|237747745|ref|ZP_04578225.1| lipopolysaccharide 3-alpha-galactosyltransferase [Oxalobacter
           formigenes OXCC13]
 gi|229379107|gb|EEO29198.1| lipopolysaccharide 3-alpha-galactosyltransferase [Oxalobacter
           formigenes OXCC13]
          Length = 307

 Score = 39.7 bits (91), Expect = 5.3,   Method: Compositional matrix adjust.
 Identities = 40/170 (23%), Positives = 80/170 (47%), Gaps = 18/170 (10%)

Query: 195 GLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVEN-QSDWYLG 253
             L+  +P++L     + + LD D+I   +I+ L+ L   + + + + +VE+  S+ YL 
Sbjct: 88  AFLRFFIPELLQGLTDRALYLDADIICINNISDLFHL--EMDENEILAVVEDIDSETYLN 145

Query: 254 KLWKNHKPWPALGRGFNTGVILLDLTKL-RDISWAGFWRIIAEKFLLTRLWTSLADQDIF 312
           +     K      R FN+GV+++D+ K  ++  +     ++ EK        +L DQD  
Sbjct: 146 ENASFQK------RYFNSGVLMMDIEKWNKNNVYGQLLSVLNEK----GSGFNLIDQDAL 195

Query: 313 NAIISEHPYLVYTLPCQWNVQL-SDNTRSDELCYTELTDLKIIHWNSPKK 361
           N ++ +    V+ L   WN  + ++     +  Y+   + K IH+  P K
Sbjct: 196 NLVMIDK---VHYLDNIWNYMINAEQLDKKKEKYSVPENAKFIHFVGPVK 242


>gi|222635310|gb|EEE65442.1| hypothetical protein OsJ_20804 [Oryza sativa Japonica Group]
          Length = 341

 Score = 39.7 bits (91), Expect = 5.6,   Method: Compositional matrix adjust.
 Identities = 33/113 (29%), Positives = 53/113 (46%), Gaps = 25/113 (22%)

Query: 201 LPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVE----NQSDWYLGKLW 256
           L  +LP  + + I LD+DV+   D+ +LW   +RL     +   E    N S ++    W
Sbjct: 114 LADLLPRCVPRAIYLDSDVLAVDDVRRLWE--TRLPAAAVVAAPEYCHANFSRYFTPAFW 171

Query: 257 KNHKPWPALGRG-----------FNTGVILLDLTKLRDISWAGFWRIIAEKFL 298
            +    P LGR            FNTGV+++DL + R    AG +R   E+++
Sbjct: 172 SD----PGLGRRVFAGRRRPPCYFNTGVMVIDLRRWR----AGNYRHRIERWM 216


>gi|254421706|ref|ZP_05035424.1| Glycosyl transferase family 8 [Synechococcus sp. PCC 7335]
 gi|196189195|gb|EDX84159.1| Glycosyl transferase family 8 [Synechococcus sp. PCC 7335]
          Length = 298

 Score = 39.7 bits (91), Expect = 5.9,   Method: Compositional matrix adjust.
 Identities = 40/167 (23%), Positives = 69/167 (41%), Gaps = 21/167 (12%)

Query: 207 ETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVEN--QSDWYLGKLWKNHKPWPA 264
           E L + I LDTD+I   DIA+L+A    L +    G + +     +Y     K  +  P 
Sbjct: 112 EDLERVIYLDTDLIVLGDIAELYAYTKALDEHCYFGSIPHFYPCIFYFSNFMKMREEIPK 171

Query: 265 LGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFNAIISEHPYL-- 322
             + FN GV   +L+   + ++      ++          +L D+ +FN +  +  YL  
Sbjct: 172 FKQTFNAGVWFTNLSFWNEKTYERLNYYLSLDAKSNYKLYTLGDEPVFNLMFKD--YLQA 229

Query: 323 --------VYTLPCQWNVQLSDNTRSDELCYTELTDLKIIHWNSPKK 361
                     T P   N+ L+   +        L++ K+IHW+ P K
Sbjct: 230 DKNWNRCGYGTHPAVTNLFLASGEKF-------LSEAKLIHWSGPFK 269


>gi|218673597|ref|ZP_03523266.1| putative glycosyltransferase protein [Rhizobium etli GR56]
          Length = 311

 Score = 39.3 bits (90), Expect = 6.1,   Method: Compositional matrix adjust.
 Identities = 39/123 (31%), Positives = 60/123 (48%), Gaps = 21/123 (17%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           ++ LP+ LP+T  + + LD D++  T + QLW   +       IG V    D++L    +
Sbjct: 88  RILLPQFLPQTCDRALYLDGDILVLTALEQLW---NTDLGDAVIGAVP---DYWLDSAAR 141

Query: 258 NHKPWPALG----RGFNTGVILLDLTKLRDISWAGFWRIIAEKFL--LTRLWTS-LADQD 310
           N  P    G    R FN G++L+DL K R+         I+E+ L  L R  T+  +DQD
Sbjct: 142 N-GPGATGGARVKRYFNAGILLIDLAKWRN-------ERISERSLDYLERFPTTEYSDQD 193

Query: 311 IFN 313
             N
Sbjct: 194 ALN 196


>gi|90657605|gb|ABD96904.1| hypothetical protein [Cleome spinosa]
          Length = 528

 Score = 39.3 bits (90), Expect = 6.2,   Method: Compositional matrix adjust.
 Identities = 29/94 (30%), Positives = 47/94 (50%), Gaps = 8/94 (8%)

Query: 141 RQTIGLVENQSDWYLGKLWKNHKPWPALGRGFYLADSVVEDVSWIPNKHYSGVYGLL--- 197
           +    +V ++ +    K+W   +P   LGRG ++   VVED  ++ N  Y  V   L   
Sbjct: 280 KHVFHVVTDRMNLAAMKVWFKMRP---LGRGAHIEIKVVEDFKFL-NSSYVPVLRQLESA 335

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL 231
           K  LP++ P+ L K + LD DV+   D+  LW +
Sbjct: 336 KFYLPEMYPK-LHKILFLDDDVVVQKDLTGLWKI 368


>gi|295398746|ref|ZP_06808760.1| family 8 glycosyl transferase [Aerococcus viridans ATCC 11563]
 gi|294973009|gb|EFG48822.1| family 8 glycosyl transferase [Aerococcus viridans ATCC 11563]
          Length = 283

 Score = 39.3 bits (90), Expect = 6.2,   Method: Compositional matrix adjust.
 Identities = 53/200 (26%), Positives = 83/200 (41%), Gaps = 38/200 (19%)

Query: 197 LKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWAL---FSRLRQRQTIGLVENQSDWYLG 253
            +L   ++LP+ + + I LD D++    + +LW        L     +GL +  +     
Sbjct: 93  FRLLAGEILPKEMKRVIYLDPDILVVNPLLELWQTDLEGHMLAAATHVGLTDVSTRVNQV 152

Query: 254 KLWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDIFN 313
           +L  +H         +N+GV+++DL K R+I     W  IA+      L   L DQDI N
Sbjct: 153 RLDVDHA-------YYNSGVMVIDLDKAREIV---KWSDIAQMIEKYNLLLFLPDQDILN 202

Query: 314 AIISEHPYLVYT--LPCQ-WNVQLSDNTRSDELCYTELTDLKIIHW-----------NSP 359
                H Y  YT  +P + WN     +TR     +T+      IHW             P
Sbjct: 203 -----HLYGKYTKEIPEEIWNY----DTRKYMRYFTKSLTQHDIHWVMANTSILHFCGGP 253

Query: 360 KKLKVKNKHMEFFRNLYLTF 379
           K     +KH   F +LYL +
Sbjct: 254 KPW--DDKHDNRFTSLYLNY 271


>gi|404328965|ref|ZP_10969413.1| hypothetical protein SvinD2_02667 [Sporolactobacillus vineae DSM
           21990 = SL153]
          Length = 255

 Score = 39.3 bits (90), Expect = 6.4,   Method: Compositional matrix adjust.
 Identities = 40/142 (28%), Positives = 66/142 (46%), Gaps = 19/142 (13%)

Query: 195 GLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGK 254
              ++ +P +L + + K + LD+D+I   D+  LW    +LR    IG VE+  D    +
Sbjct: 86  AFYRIVIPDLLDKKIKKAVYLDSDLIIKDDLLNLWN-NEQLRH-YFIGAVEDPVDVTGIR 143

Query: 255 LWKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLA--DQDIF 312
           L       P   + FN+GV+L++L K R    +   RI+     + R  + +   DQD  
Sbjct: 144 L-------PKKYKYFNSGVMLMNLEKWRKNKTSQ--RILQ---FIRRHSSKIMWWDQDAL 191

Query: 313 NAIISEHPYLVYTLPCQWNVQL 334
           NAI+       + L   WN Q+
Sbjct: 192 NAILYNK---WFELGYTWNFQV 210


>gi|115447761|ref|NP_001047660.1| Os02g0664200 [Oryza sativa Japonica Group]
 gi|113537191|dbj|BAF09574.1| Os02g0664200, partial [Oryza sativa Japonica Group]
          Length = 242

 Score = 39.3 bits (90), Expect = 6.5,   Method: Compositional matrix adjust.
 Identities = 35/139 (25%), Positives = 61/139 (43%), Gaps = 14/139 (10%)

Query: 199 LTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSD------WYL 252
           L L  + P +L K I +D D I   D+ +L+ +  + R        +N  +      W  
Sbjct: 26  LFLDVIFPLSLRKVIFVDADQIVRADMGELYDMNLKGRPLAYTPFCDNNKEMDGYRFWKQ 85

Query: 253 GKLWKNHKPWPALGRGFN-TGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLADQDI 311
           G  WK+H      GR ++ + + ++DL K R  +     R+  E         S  DQD+
Sbjct: 86  G-FWKDH----LRGRPYHISALYVVDLAKFRQTASGDTLRVFYETLSKDPNSLSNLDQDL 140

Query: 312 FNAIISEHPYLVYTLPCQW 330
            N   ++H   +++LP +W
Sbjct: 141 PN--YAQHTVPIFSLPQEW 157


>gi|198275401|ref|ZP_03207932.1| hypothetical protein BACPLE_01564 [Bacteroides plebeius DSM 17135]
 gi|198271737|gb|EDY96007.1| glycosyltransferase, family 8 [Bacteroides plebeius DSM 17135]
          Length = 308

 Score = 39.3 bits (90), Expect = 7.1,   Method: Compositional matrix adjust.
 Identities = 38/144 (26%), Positives = 68/144 (47%), Gaps = 26/144 (18%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVE----NQSDWYLG 253
           +  L ++LP  + + + LD D++   DI++ W   + L     +  VE    N+   Y  
Sbjct: 89  RCILSELLPADIDRLLYLDCDIVIVGDISEYWN--TPLDDETGVAAVEDMGCNEPARY-- 144

Query: 254 KLWKNHKPWPALGRGFNTGVILLDLTKLR--DISWA--GFWRIIAEKFLLTRLWTSLADQ 309
           ++ K    +P     FN GV+L++L   R  D++ A   ++    E+ L         DQ
Sbjct: 145 EILK----YPMEDSYFNAGVLLINLDYWRKNDVAHACVDYFHKYPERILFN-------DQ 193

Query: 310 DIFNAIISEHPYLVYTLPCQWNVQ 333
           D+ N+I+ ++  LV     +WNVQ
Sbjct: 194 DLLNSILHKNKILV---DLKWNVQ 214


>gi|424790536|ref|ZP_18217073.1| glycosyltransferase, family 8 [Enterococcus faecium V689]
 gi|424955172|ref|ZP_18370024.1| glycosyltransferase, family 8 [Enterococcus faecium R494]
 gi|425039651|ref|ZP_18444168.1| glycosyltransferase, family 8 [Enterococcus faecium 513]
 gi|402920658|gb|EJX41154.1| glycosyltransferase, family 8 [Enterococcus faecium V689]
 gi|402934695|gb|EJX54015.1| glycosyltransferase, family 8 [Enterococcus faecium R494]
 gi|403015200|gb|EJY28128.1| glycosyltransferase, family 8 [Enterococcus faecium 513]
          Length = 280

 Score = 38.9 bits (89), Expect = 8.0,   Method: Compositional matrix adjust.
 Identities = 51/199 (25%), Positives = 87/199 (43%), Gaps = 36/199 (18%)

Query: 197 LKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSR---LRQRQTIGLVENQSDWYLG 253
            +L   ++LP+ + + I LD D++    +  LW        L     +GL +  +     
Sbjct: 90  FRLLAGEILPKEMKRVIYLDPDILVVNPLLDLWQTDLEGYMLAAATHVGLTDVSTRVNQM 149

Query: 254 KLWKNHKPWPALGRGFNTGVILLDLTKLRD-ISWAGFWRIIAEKFLLTRLWTSLADQDIF 312
           +L  +H         +N+GV+++DL K R+ I W+   ++I EK+ L  +   L DQDI 
Sbjct: 150 RLDVDHA-------YYNSGVMVIDLDKAREIIKWSDIAQMI-EKYNLLLI---LPDQDIL 198

Query: 313 NAIISEHPYLVYTLPCQ-WNVQLSDNTRSDELCYTELTDLKIIHW-----------NSPK 360
           N +  ++      +P + WN     +TR     +T+      IHW             PK
Sbjct: 199 NRLYGKY---TKEIPEEIWNY----DTRKYMRYFTKSLAQHDIHWVMAHTAILHFCGGPK 251

Query: 361 KLKVKNKHMEFFRNLYLTF 379
                +KH   F +LYLT+
Sbjct: 252 PW--DDKHDNRFTSLYLTY 268


>gi|224117396|ref|XP_002317564.1| glycosyltransferase, CAZy family GT8 [Populus trichocarpa]
 gi|222860629|gb|EEE98176.1| glycosyltransferase, CAZy family GT8 [Populus trichocarpa]
          Length = 532

 Score = 38.9 bits (89), Expect = 8.2,   Method: Compositional matrix adjust.
 Identities = 21/83 (25%), Positives = 42/83 (50%), Gaps = 3/83 (3%)

Query: 190 YSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSD 249
           Y+ V   +++ LP++ P +L K + LD D++  TD++ LW +    +    +     Q  
Sbjct: 325 YNSVMNHIRIHLPELFP-SLNKVVFLDDDIVVQTDLSPLWDIDMNGKVNGAVETCRGQDK 383

Query: 250 WYLGKLWKNHKPW--PALGRGFN 270
           + + K  KN+  +  P + + FN
Sbjct: 384 FVMSKRLKNYLNFSHPLIAKNFN 406


>gi|224141469|ref|XP_002324094.1| glycosyltransferase [Populus trichocarpa]
 gi|222867096|gb|EEF04227.1| glycosyltransferase [Populus trichocarpa]
          Length = 528

 Score = 38.9 bits (89), Expect = 8.7,   Method: Compositional matrix adjust.
 Identities = 17/70 (24%), Positives = 38/70 (54%), Gaps = 1/70 (1%)

Query: 93  YSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSD 152
           Y  +   L++ +P++ P +L K + LD DV+   D++ LW +  + +    +   + + +
Sbjct: 319 YISILNHLRIYIPELFP-SLDKVVFLDDDVVIQRDLSPLWEIDLKGKVNGAVETCKGEDE 377

Query: 153 WYLGKLWKNH 162
           W + K +KN+
Sbjct: 378 WVMSKHFKNY 387



 Score = 38.9 bits (89), Expect = 8.7,   Method: Compositional matrix adjust.
 Identities = 17/70 (24%), Positives = 38/70 (54%), Gaps = 1/70 (1%)

Query: 190 YSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSD 249
           Y  +   L++ +P++ P +L K + LD DV+   D++ LW +  + +    +   + + +
Sbjct: 319 YISILNHLRIYIPELFP-SLDKVVFLDDDVVIQRDLSPLWEIDLKGKVNGAVETCKGEDE 377

Query: 250 WYLGKLWKNH 259
           W + K +KN+
Sbjct: 378 WVMSKHFKNY 387


>gi|218706579|ref|YP_002414098.1| putative Capsule biosynthesis protein [Escherichia coli UMN026]
 gi|300900284|ref|ZP_07118463.1| hypothetical protein HMPREF9552_04335 [Escherichia coli MS 198-1]
 gi|419934881|ref|ZP_14451973.1| putative Capsule biosynthesis protein [Escherichia coli 576-1]
 gi|432403347|ref|ZP_19646092.1| hypothetical protein WEK_03550 [Escherichia coli KTE26]
 gi|432462309|ref|ZP_19704444.1| hypothetical protein A15I_03183 [Escherichia coli KTE204]
 gi|432632844|ref|ZP_19868765.1| hypothetical protein A1UW_03231 [Escherichia coli KTE80]
 gi|432767333|ref|ZP_20001727.1| hypothetical protein A1S9_00118 [Escherichia coli KTE50]
 gi|432963419|ref|ZP_20152838.1| hypothetical protein A15E_03777 [Escherichia coli KTE202]
 gi|433049396|ref|ZP_20236734.1| hypothetical protein WII_03332 [Escherichia coli KTE120]
 gi|433064414|ref|ZP_20251326.1| hypothetical protein WIO_03240 [Escherichia coli KTE125]
 gi|433069285|ref|ZP_20256061.1| hypothetical protein WIQ_03168 [Escherichia coli KTE128]
 gi|218433676|emb|CAR14591.1| putative Capsule biosynthesis protein [Escherichia coli UMN026]
 gi|300356174|gb|EFJ72044.1| hypothetical protein HMPREF9552_04335 [Escherichia coli MS 198-1]
 gi|388406150|gb|EIL66559.1| putative Capsule biosynthesis protein [Escherichia coli 576-1]
 gi|430923733|gb|ELC44466.1| hypothetical protein WEK_03550 [Escherichia coli KTE26]
 gi|430986553|gb|ELD03120.1| hypothetical protein A15I_03183 [Escherichia coli KTE204]
 gi|431167973|gb|ELE68227.1| hypothetical protein A1UW_03231 [Escherichia coli KTE80]
 gi|431322497|gb|ELG10082.1| hypothetical protein A1S9_00118 [Escherichia coli KTE50]
 gi|431471994|gb|ELH51886.1| hypothetical protein A15E_03777 [Escherichia coli KTE202]
 gi|431562785|gb|ELI36028.1| hypothetical protein WII_03332 [Escherichia coli KTE120]
 gi|431579442|gb|ELI52025.1| hypothetical protein WIO_03240 [Escherichia coli KTE125]
 gi|431580783|gb|ELI53338.1| hypothetical protein WIQ_03168 [Escherichia coli KTE128]
          Length = 330

 Score = 38.9 bits (89), Expect = 8.7,   Method: Compositional matrix adjust.
 Identities = 22/62 (35%), Positives = 30/62 (48%), Gaps = 6/62 (9%)

Query: 551 LRNVALNQVSTPYVFLLDIDFLPMFGLYPYLKSSIRSMDMHGHGGKKVLVVPAFETQRYR 610
           LRN A   V + YV LLD+D  P F L+   K  I+S      G K   ++P     +Y 
Sbjct: 77  LRNQAFKSVESEYVILLDVDIYPDFNLFEKYKEKIKS------GIKPFYILPCLYLTKYG 130

Query: 611 TA 612
           T+
Sbjct: 131 TS 132


>gi|405376632|ref|ZP_11030585.1| LPS:glycosyltransferase [Rhizobium sp. CF142]
 gi|397326770|gb|EJJ31082.1| LPS:glycosyltransferase [Rhizobium sp. CF142]
          Length = 322

 Score = 38.9 bits (89), Expect = 8.7,   Method: Compositional matrix adjust.
 Identities = 36/124 (29%), Positives = 61/124 (49%), Gaps = 23/124 (18%)

Query: 198 KLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSDWYLGKLWK 257
           ++ +P+ LPET ++ + LD D++  T + QLW +         IG V    D++L  +  
Sbjct: 94  RILMPQFLPETCSRVLYLDGDILVLTALEQLWNIDL---GEAVIGAVP---DYWLDNVVS 147

Query: 258 NHKPWPA---LGRGFNTGVILLDLTKLRDISWAGFWRI--IAEKFL--LTRLWTS-LADQ 309
           + +       + R FN G++L+DL +         WR   I+E+ L  L R  T+  +DQ
Sbjct: 148 SGRGATGGARVERYFNAGILLIDLAR---------WRAERISERSLDYLDRFPTTEYSDQ 198

Query: 310 DIFN 313
           D  N
Sbjct: 199 DALN 202


>gi|418935516|ref|ZP_13489286.1| glycosyl transferase family 8 [Rhizobium sp. PDO1-076]
 gi|375057763|gb|EHS53917.1| glycosyl transferase family 8 [Rhizobium sp. PDO1-076]
          Length = 306

 Score = 38.9 bits (89), Expect = 8.7,   Method: Compositional matrix adjust.
 Identities = 38/146 (26%), Positives = 67/146 (45%), Gaps = 11/146 (7%)

Query: 190 YSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQSD 249
           Y  V  + +L +  ++PET+++ + LD D +    +A +  LFS       +G V++   
Sbjct: 81  YWSVATIARLYMDTLVPETISRLVYLDADTL---AVASIVPLFSLDMGGHPVGAVDDCLM 137

Query: 250 WYLGKL--WKNHKPWPALGRGFNTGVILLDLTKLRDISWAGFWRIIAEKFLLTRLWTSLA 307
            +  K+   K+     A  R FN GV+L D + L + +     +   E FL         
Sbjct: 138 AFPDKMSERKSRIGMAAEARYFNAGVLLFDWSVLPETN---VLKEAREIFLKDPERYPFK 194

Query: 308 DQDIFNAIISEHPYLVYTLPCQWNVQ 333
           DQD+ N +++E+      L  +WN Q
Sbjct: 195 DQDVLNVVLAEN---WLALDPRWNTQ 217


>gi|313225951|emb|CBY21094.1| unnamed protein product [Oikopleura dioica]
          Length = 330

 Score = 38.9 bits (89), Expect = 8.7,   Method: Compositional matrix adjust.
 Identities = 41/141 (29%), Positives = 65/141 (46%), Gaps = 28/141 (19%)

Query: 542 EGNFYPINTLRNVALNQVSTPYVFLLDIDFLPMFGLY--PYLK-------SSIRSMDMHG 592
           E +++P N LRNVAL       +FL ++  L    LY  P  +       +S   +D + 
Sbjct: 141 EESWFPNNLLRNVALKPKIE--IFLKNMSDLSNKDLYLIPVFEKESSTQINSFPQLDENY 198

Query: 593 HGGKKVLVVPAFETQRYRTAFPASHAPTNFSRWVNATTP--------YQIEWAPDFEPYI 644
             GK V +  A+ T+         ++  +F +W   T          Y+I++   FEPY 
Sbjct: 199 QNGK-VQMFHAWCTE-------WCYSFYDFPKWFEKTKKNPKSTEVGYKIKYKLSFEPYY 250

Query: 645 VAHRD-LPRYDTRFVGFGWNK 664
           +   D +P YD RF G+G+NK
Sbjct: 251 LGRLDAVPFYDIRFRGYGYNK 271


>gi|302840832|ref|XP_002951962.1| hypothetical protein VOLCADRAFT_92485 [Volvox carteri f.
           nagariensis]
 gi|300262863|gb|EFJ47067.1| hypothetical protein VOLCADRAFT_92485 [Volvox carteri f.
           nagariensis]
          Length = 550

 Score = 38.9 bits (89), Expect = 8.7,   Method: Compositional matrix adjust.
 Identities = 25/74 (33%), Positives = 38/74 (51%), Gaps = 10/74 (13%)

Query: 594 GGKKVLVVPAFETQRYRT----AFPASHAPTNFSRWVNATTPYQIEWAPDFEPYIVAHRD 649
           G KK L +  ++T+  RT     FP  H  T++ RW++   PY I +   +EP+ +  R 
Sbjct: 342 GDKKRLQM-LWQTRHLRTFSEDIFPQGHNATDYRRWLSTDQPYTIRYGRGYEPWGITSRK 400

Query: 650 ----LPRYDTRFVG 659
               +P YD RF G
Sbjct: 401 QYTWMP-YDVRFRG 413


>gi|300770875|ref|ZP_07080752.1| family 2 glycosyl transferase [Sphingobacterium spiritivorum ATCC
           33861]
 gi|300762148|gb|EFK58967.1| family 2 glycosyl transferase [Sphingobacterium spiritivorum ATCC
           33861]
          Length = 271

 Score = 38.9 bits (89), Expect = 8.7,   Method: Compositional matrix adjust.
 Identities = 17/47 (36%), Positives = 28/47 (59%)

Query: 526 EALRSRRNIGYHVVYKEGNFYPINTLRNVALNQVSTPYVFLLDIDFL 572
           E  R R NI    ++ E N + ++ +RN A++QVS PY+  +D D +
Sbjct: 53  EKFRLRTNISIQHIWHEDNGFQLSQIRNKAISQVSNPYIIQIDGDVI 99


>gi|209877503|ref|XP_002140193.1| glycosyl transferase family 8 protein [Cryptosporidium muris RN66]
 gi|209555799|gb|EEA05844.1| glycosyl transferase family 8 protein, putative [Cryptosporidium
           muris RN66]
          Length = 304

 Score = 38.9 bits (89), Expect = 9.9,   Method: Compositional matrix adjust.
 Identities = 29/101 (28%), Positives = 50/101 (49%), Gaps = 11/101 (10%)

Query: 188 KHYSGVYGLLKLTLPKVLPETLAKTIVLDTDVIFATDIAQLWALFSRLRQRQTIGLVENQ 247
           KH S    + +L LP ++ +++ K + LDTDVI  T + +L+     +      G+V   
Sbjct: 129 KHVSEA-TMCRLLLPNIIDKSIDKLLYLDTDVIVNTPLRELFG----ININSQCGIVARS 183

Query: 248 SDW--YLGKLWKNHKPWPAL----GRGFNTGVILLDLTKLR 282
           S     + +  K  K +P +     + FN GV+L+ L +LR
Sbjct: 184 STKADLINEWLKKDKIYPHIIYNGTKSFNAGVLLISLNELR 224


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.322    0.137    0.426 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 10,707,228,967
Number of Sequences: 23463169
Number of extensions: 457256966
Number of successful extensions: 933385
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 382
Number of HSP's successfully gapped in prelim test: 661
Number of HSP's that attempted gapping in prelim test: 929761
Number of HSP's gapped (non-prelim): 2398
length of query: 664
length of database: 8,064,228,071
effective HSP length: 149
effective length of query: 515
effective length of database: 8,863,183,186
effective search space: 4564539340790
effective search space used: 4564539340790
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 80 (35.4 bits)