RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy15555
         (488 letters)



>gnl|CDD|99978 cd03806, GT1_ALG11_like, This family is most closely related to the
           GT1 family of glycosyltransferases. ALG11 in yeast is
           involved in adding the final 1,2-linked Man to the
           Man5GlcNAc2-PP-Dol synthesized on the cytosolic face of
           the ER. The deletion analysis of ALG11 was shown to
           block the early steps of core biosynthesis that takes
           place on the cytoplasmic face of the ER and lead to a
           defect in the assembly of lipid-linked oligosaccharides.
          Length = 419

 Score =  638 bits (1649), Expect = 0.0
 Identities = 227/419 (54%), Positives = 294/419 (70%), Gaps = 1/419 (0%)

Query: 53  KTVAFFHPYCNAGGGGERVLWTAVLALHQKYPDYKIYIYTGDVDASPSEIIKRAHQRFNI 112
            TV FFHPYCNAGGGGERVLW AV AL ++YP+  + IYTGD+DA+P EI+++   RFNI
Sbjct: 1   NTVGFFHPYCNAGGGGERVLWCAVRALQKRYPNNIVVIYTGDLDATPEEILEKVESRFNI 60

Query: 113 VLPDQVINFVYLYRRKFVEASLYPYFTLLGQSIGSMILGVEALLSFQPDIYIDTMGYAFT 172
            L    I F  L  RK VEAS YP FTLLGQ++GSMILG+EALL   PDI+IDTMGY FT
Sbjct: 61  ELDRPRIVFFLLKYRKLVEASTYPRFTLLGQALGSMILGLEALLKLVPDIFIDTMGYPFT 120

Query: 173 YPLFSYIGGSKVACYIHYPTITKEMLTRVARRVITHNNSQRVANNPILTSFKLFYYKVFA 232
           YPL   +GG  V  Y+HYPTI+ +ML +V  R  ++NNS  +A +P+L+  KL YY++FA
Sbjct: 121 YPLVRLLGGCPVGAYVHYPTISTDMLQKVRSREASYNNSATIARSPVLSKAKLLYYRLFA 180

Query: 233 LLYSHVGKYSDIIMVNSSWTEEHVIQLWNCQLKTYKLYPPCDTEDLKKITHSKTDGPVKI 292
            LY   G ++D++MVNS+WT  H+  LW    K   +YPPCD E+L K+   +     +I
Sbjct: 181 FLYGLAGSFADVVMVNSTWTRNHIRSLWKRNTKPSIVYPPCDVEELLKLPLDEKTRENQI 240

Query: 293 ISVAQFRPEKDHPLQLRAMYQLRQIISEELWDNLKLIFIGSTRNEEDEVCVKDMQDLCKH 352
           +S+AQFRPEK+HPLQLRA  +L + + EE+ + +KL+ IGS RNE+DE  V+D++ L K 
Sbjct: 241 LSIAQFRPEKNHPLQLRAFAKLLKRLPEEIKEKIKLVLIGSCRNEDDEKRVEDLKLLAKE 300

Query: 353 LSLENNVEFKVNLPYEDMKKEFSEGLIGLHAMWNEHFGIGIVECMAAGLIMIAHKSGGPK 412
           L LE+ VEF VN P+E++ +E S   IGLH MWNEHFGIG+VE MAAGLI +AH SGGP 
Sbjct: 301 LGLEDKVEFVVNAPFEELLEELSTASIGLHTMWNEHFGIGVVEYMAAGLIPLAHASGGPL 360

Query: 413 MDIVIEDPETCRNGFLACDEVEYAQTIKLILHLSQDTKTRISQNAVSSVDRFSMEEFKN 471
           +DIV+        GFLA    EYA+ I+ IL LS++ + RI + A SSV RFS EEF+ 
Sbjct: 361 LDIVVPWDGG-PTGFLASTAEEYAEAIEKILSLSEEERLRIRRAARSSVKRFSDEEFER 418


>gnl|CDD|215511 PLN02949, PLN02949, transferase, transferring glycosyl groups.
          Length = 463

 Score =  484 bits (1248), Expect = e-169
 Identities = 198/474 (41%), Positives = 280/474 (59%), Gaps = 19/474 (4%)

Query: 10  VMWLVFL-FYSILALLLLSIIVLPLSVLLFKYYVSKKRKSYNVLKTVAFFHPYCNAGGGG 68
            +WL+     + + LLL++I +  L            R   +  + V FFHPY N GGGG
Sbjct: 2   AIWLILYHLLTSIVLLLVAIALSVL------------RARRSRKRAVGFFHPYTNDGGGG 49

Query: 69  ERVLWTAVLALHQKYPDYKIYIYTGDVDASPSEIIKRAHQRFNIVLPDQVINFVYLYRRK 128
           ERVLW AV A+ ++ PD    IYTGD DASP  +  RA  RF + L       V+L +RK
Sbjct: 50  ERVLWCAVRAIQEENPDLDCVIYTGDHDASPDSLAARARDRFGVELL-SPPKVVHLRKRK 108

Query: 129 FVEASLYPYFTLLGQSIGSMILGVEALLSFQPDIYIDTMGYAFTYPLFSYIGGSKVACYI 188
           ++E   YP FT++GQS+GS+ L  EAL  F P  + DT GYAFTYPL + + G KV CY 
Sbjct: 109 WIEEETYPRFTMIGQSLGSVYLAWEALCKFTPLYFFDTSGYAFTYPL-ARLFGCKVVCYT 167

Query: 189 HYPTITKEMLTRVARRVITHNNSQRVANNPILTSFKLFYYKVFALLYSHVGKYSDIIMVN 248
           HYPTI+ +M++RV  R   +NN   +A +  L++ K+ YY+ FA +Y  VG+ + + MVN
Sbjct: 168 HYPTISSDMISRVRDRSSMYNNDASIARSFWLSTCKILYYRAFAWMYGLVGRCAHLAMVN 227

Query: 249 SSWTEEHVIQLWNCQLKTYKLYPPCDTEDLKKITHSKTDGPVKIISVAQFRPEKDHPLQL 308
           SSWT+ H+  LW    +  ++YPPCDT  L+ +   +++ P  IISVAQFRPEK H LQL
Sbjct: 228 SSWTKSHIEALWRIPERIKRVYPPCDTSGLQALPLERSEDPPYIISVAQFRPEKAHALQL 287

Query: 309 RAMYQLRQIISEELWDNLKLIFIGSTRNEEDEVCVKDMQDLCKHLSLENNVEFKVNLPYE 368
            A     + +  ++    KL F+GS RN+EDE  ++ ++D  K L L+ +VEF  N+ Y 
Sbjct: 288 EAFALALEKLDADV-PRPKLQFVGSCRNKEDEERLQKLKDRAKELGLDGDVEFHKNVSYR 346

Query: 369 DMKKEFSEGLIGLHAMWNEHFGIGIVECMAAGLIMIAHKSGGPKMDIVI-EDPETCRNGF 427
           D+ +     + GLH+M +EHFGI +VE MAAG + IAH S GPKMDIV+ ED +  + GF
Sbjct: 347 DLVRLLGGAVAGLHSMIDEHFGISVVEYMAAGAVPIAHNSAGPKMDIVLDEDGQ--QTGF 404

Query: 428 LACDEVEYAQTIKLILHLSQDTKTRISQNAVSSVDRFSMEEFKNGFLTFTQPLF 481
           LA    EYA  I  +L + +  +  I+  A    +RFS + F   F    +P+ 
Sbjct: 405 LATTVEEYADAILEVLRMRETERLEIAAAARKRANRFSEQRFNEDFKDAIRPIL 458


>gnl|CDD|99977 cd03805, GT1_ALG2_like, This family is most closely related to the
           GT1 family of glycosyltransferases.  ALG2, a
           1,3-mannosyltransferase, in yeast catalyzes the
           mannosylation of Man(2)GlcNAc(2)-dolichol diphosphate
           and Man(1)GlcNAc(2)-dolichol diphosphate to form
           Man(3)GlcNAc(2)-dolichol diphosphate. A deficiency of
           this enzyme causes an abnormal accumulation of
           Man1GlcNAc2-PP-dolichol and Man2GlcNAc2-PP-dolichol,
           which is associated with a type of congenital disorders
           of glycosylation (CDG), designated CDG-Ii, in humans.
          Length = 392

 Score =   99 bits (250), Expect = 6e-23
 Identities = 114/445 (25%), Positives = 185/445 (41%), Gaps = 85/445 (19%)

Query: 53  KTVAFFHPYCNAG-GGGERVLWTAVLALHQKYPDYKIYIYTGDVDAS--PSEIIKRAHQR 109
             VAF HP    G GG ER++  A LAL  +   +++ IYT   D S    E        
Sbjct: 1   LRVAFIHPDL--GIGGAERLVVDAALALQSR--GHEVTIYTSHHDPSHCFEETK------ 50

Query: 110 FNIVLPDQVINFVYLYRRKFVEASLYPYFTLLGQSIGSMILGVEALLSF---QPDIYI-D 165
            +  LP +V       R  ++  S++  F +L   +  M+     LL     + D++I D
Sbjct: 51  -DGTLPVRV-------RGDWLPRSIFGRFHILCAYL-RMLYLALYLLLLPDEKYDVFIVD 101

Query: 166 TMGYAFTYPLFSYIGGSKVACYIHYPTITKEMLTRVARRVITHNNSQRVANNPILTSFKL 225
            +  +   PL      SK+  Y H+P                    Q +A     +  K 
Sbjct: 102 QV--SACVPLLKLFSPSKILFYCHFP-------------------DQLLAQRG--SLLKR 138

Query: 226 FYYKVFALLYSHVGKYSDIIMVNSSWTEEHVIQLWNCQLKTYK---LYPPCDTEDLKKIT 282
            Y K F  L       +D I+VNS++T     + +   L       +YP  DT+  +  +
Sbjct: 139 LYRKPFDWLEEFTTGMADKIVVNSNFTASVFKKTFP-SLAKNPREVVYPCVDTDSFESTS 197

Query: 283 HSKTDGPVKI-------ISVAQFRPEKDHPLQLRAMYQLRQIISEELWDNLKLIFIGS-- 333
                G +         +S+ +F  +K+  L + A   L+  ++E  + N++L+  G   
Sbjct: 198 EDPDPGLLIPKSGKKTFLSINRFERKKNIALAIEAFAILKDKLAE--FKNVRLVIAGGYD 255

Query: 334 TRNEEDEVCVKDMQDLC-KHLSLENNVEFKVNLPYEDMKKE--FSEGLIGLHAMWNEHFG 390
            R  E+   ++++Q L  + L LE+ V F  ++   D +KE   S     L+   NEHFG
Sbjct: 256 PRVAENVEYLEELQRLAEELLLLEDQVIFLPSIS--DSQKELLLSSARALLYTPSNEHFG 313

Query: 391 IGIVECMAAGLIMIAHKSGGPKMDIVIEDPETCRN---GFLACD--EVEYAQTIKLILHL 445
           I  +E M AG  +IA  SGGP         ET  +   GFL C+    E+A+ +  + + 
Sbjct: 314 IVPLEAMYAGKPVIACNSGGPL--------ETVVDGETGFL-CEPTPEEFAEAMLKLAN- 363

Query: 446 SQDTKTRISQNAVSSV-DRFSMEEF 469
             D   R+       V ++FS E F
Sbjct: 364 DPDLADRMGAAGRKRVKEKFSTEAF 388


>gnl|CDD|223515 COG0438, RfaG, Glycosyltransferase [Cell envelope biogenesis, outer
           membrane].
          Length = 381

 Score = 89.6 bits (221), Expect = 2e-19
 Identities = 64/367 (17%), Positives = 126/367 (34%), Gaps = 26/367 (7%)

Query: 118 VINFVYLYRRKFVEASLYPYFTLLGQSIGSMILGVEALLSFQPDIYIDTMGYAFTYPLFS 177
           V+      R       +        + +G  +L + ++      + +          L  
Sbjct: 21  VLELAKALRLLGRGHEVLVIAPEKLEGLGIEVLRLPSVSLPLLIVRLRPALLRLLLLLKR 80

Query: 178 YIGGSKVACYIHYPTITKEMLTRVARR------VITHNNSQRVANNPILTSFKLFYYKVF 231
            +    +  +         +   + +       V  H    R+   P L         + 
Sbjct: 81  LLPYDIIHAHSLLLAPGGLLALLLLKLLGIPLVVTLHGLIPRILLLPRLLLLLGLLRLLL 140

Query: 232 ALLYSHVGKYSDIIMVNSSWTEEHVIQLWNCQLKTYKLYPPCDTEDLKKITH--SKTDGP 289
             L   +   +D ++  S   +E +  L     K   +    DTE             G 
Sbjct: 141 KRLKKALRLLADRVIAVSPALKELLEALGV-PNKIVVIPNGIDTEKFAPARIGLLPEGGK 199

Query: 290 VKIISVAQFRPEKDHPLQLRAMYQLRQIISEELWDNLKLIFIGSTRNEEDEVCVKDMQDL 349
             ++ V +  PEK   L + A  +L++        ++KL+ +G      +E     ++ L
Sbjct: 200 FVVLYVGRLDPEKGLDLLIEAAAKLKKRG-----PDIKLVIVGDGPERREE-----LEKL 249

Query: 350 CKHLSLENNVEFKVNLPYEDMKKEFSEGLIGLHAMWNEHFGIGIVECMAAGLIMIAHKSG 409
            K L LE+NV+F   +P E++ +  +   + +    +E FG+ ++E MAAG  +IA   G
Sbjct: 250 AKKLGLEDNVKFLGYVPDEELAELLASADVFVLPSLSEGFGLVLLEAMAAGTPVIASDVG 309

Query: 410 GPKMDIVIEDPETCRNGFLAC--DEVEYAQTIKLILHLSQDTKTRISQNAVSSVDRFSME 467
           G   ++V +       G L    D  E A  ++ +L   +  +           + FS E
Sbjct: 310 GIP-EVVEDG----ETGLLVPPGDVEELADALEQLLEDPELREELGEAARERVEEEFSWE 364

Query: 468 EFKNGFL 474
                 L
Sbjct: 365 RIAEQLL 371


>gnl|CDD|99974 cd03801, GT1_YqgM_like, This family is most closely related to the
           GT1 family of glycosyltransferases and named after YqgM
           in Bacillus licheniformis about which little is known.
           Glycosyltransferases catalyze the transfer of sugar
           moieties from activated donor molecules to specific
           acceptor molecules, forming glycosidic bonds. The
           acceptor molecule can be a lipid, a protein, a
           heterocyclic compound, or another carbohydrate residue.
           This group of glycosyltransferases is most closely
           related to the previously defined glycosyltransferase
           family 1 (GT1). The members of this family may transfer
           UDP, ADP, GDP, or CMP linked sugars. The diverse
           enzymatic activities among members of this family
           reflect a wide range of biological functions. The
           protein structure available for this family has the GTB
           topology, one of the two protein topologies observed for
           nucleotide-sugar-dependent glycosyltransferases. GTB
           proteins have distinct N- and C- terminal domains each
           containing a typical Rossmann fold. The two domains have
           high structural homology despite minimal sequence
           homology. The large cleft that separates the two domains
           includes the catalytic center and permits a high degree
           of flexibility. The members of this family are found
           mainly in certain bacteria and archaea.
          Length = 374

 Score = 87.1 bits (216), Expect = 1e-18
 Identities = 79/426 (18%), Positives = 143/426 (33%), Gaps = 75/426 (17%)

Query: 55  VAFFHPYCN-AGGGGERVLWTAVLALHQKYPDYKIYIYTGDVDASPSEIIKRAHQRFNIV 113
           +    P    + GG ER +     AL  +   +++ + T      P             V
Sbjct: 2   ILLVTPEYPPSVGGAERHVLELARALAAR--GHEVTVLTPGDGGLP---------DEEEV 50

Query: 114 LPDQVINFVYLYRRKFVEASLYPYFTLLGQSIGSMILGVEALLSFQPDIYIDTMGYAFTY 173
               V+    L R + +   L     L              L   + D+      +A  +
Sbjct: 51  GGIVVVRPPPLLRVRRLLLLLLLALRLR-----------RLLRRERFDVV-----HAHDW 94

Query: 174 PLFSYIGGSKVACYIHYPTITKEMLTRVARRVITHNNSQRVANNPILTSFKLFYYKVFAL 233
                   +                      V+T +  +       L         +   
Sbjct: 95  LALLAAALAARL--------------LGIPLVLTVHGLEFGRPGNELGLLLKLARALERR 140

Query: 234 LYSHVGKYSDIIMVNSSWTEEHVIQLWNCQL-KTYKLYPPCDTE-------DLKKITHSK 285
                 + +D I+  S  T E + +L      K   +    DTE         ++     
Sbjct: 141 AL----RRADRIIAVSEATREELRELGGVPPEKITVIPNGVDTERFRPAPRAARRRLGIP 196

Query: 286 TDGPVKIISVAQFRPEKDHPLQLRAMYQLRQIISEELWDNLKLIFIGSTRNEEDEVCVKD 345
            D PV I+ V +  P K   L L A+ +LR+        +++L+ +G      +E     
Sbjct: 197 EDEPV-ILFVGRLVPRKGVDLLLEALAKLRKEYP-----DVRLVIVG-DGPLREE----- 244

Query: 346 MQDLCKHLSLENNVEFKVNLPYEDMKKEFSEGLIGLHAMWNEHFGIGIVECMAAGLIMIA 405
           ++ L   L L + V F   +P ED+   ++   + +     E FG+ ++E MAAGL ++A
Sbjct: 245 LEALAAELGLGDRVTFLGFVPDEDLPALYAAADVFVLPSLYEGFGLVLLEAMAAGLPVVA 304

Query: 406 HKSGGPKMDIVIEDPETCRNGFLA--CDEVEYAQTIKLILHLSQDTKTRISQNAVSSV-D 462
              GG   ++V +       G L    D    A+ I  +L    + + R+ + A   V +
Sbjct: 305 SDVGGIP-EVVEDG----ETGLLVPPGDPEALAEAILRLLD-DPELRRRLGEAARERVAE 358

Query: 463 RFSMEE 468
           RFS + 
Sbjct: 359 RFSWDR 364


>gnl|CDD|215979 pfam00534, Glycos_transf_1, Glycosyl transferases group 1.
           Mutations in this domain of human PIGA lead to disease
           (Paroxysmal Nocturnal haemoglobinuria). Members of this
           family transfer activated sugars to a variety of
           substrates, including glycogen, Fructose-6-phosphate and
           lipopolysaccharides. Members of this family transfer
           UDP, ADP, GDP or CMP linked sugars. The eukaryotic
           glycogen synthases may be distant members of this
           family.
          Length = 158

 Score = 80.0 bits (198), Expect = 9e-18
 Identities = 50/172 (29%), Positives = 81/172 (47%), Gaps = 19/172 (11%)

Query: 288 GPVKIISVAQFRPEKDHPLQLRAMYQLRQIISEELWDNLKLIFIGSTRNEEDEVCVKDMQ 347
               I+ V +  PEK   L L A   L++        NLKL+ +G    EE +     ++
Sbjct: 1   DKPIILFVGRLVPEKGLDLLLEAFALLKEQH-----PNLKLVIVGD-GEEEKK-----LK 49

Query: 348 DLCKHLSLENNVEFKVNLPYEDMKKEFSEGLIGLHAMWNEHFGIGIVECMAAGLIMIAHK 407
            L   L LE+NV F   +P ED+ + +    + +     E FG+ ++E MAAG+ +IA  
Sbjct: 50  KLALKLGLEDNVIFVGFVPDEDLIELYRIADLFVLPSRYEGFGLVLLEAMAAGVPVIATD 109

Query: 408 SGGPKMDIVIEDPETCRNGFL--ACDEVEYAQTIKLILHLSQDTKTRISQNA 457
            GGP  +IV +       G L    D    A+ I+ +L   ++ + R+ +NA
Sbjct: 110 VGGPA-EIVKDG----ETGLLVDPGDAEALAEAIEKLLK-DEELRERLGENA 155


>gnl|CDD|99976 cd03804, GT1_wbaZ_like, This family is most closely related to the
           GT1 family of glycosyltransferases.  wbaZ in Salmonella
           enterica has been shown to possess the mannosyl
           transferase activity. The members of this family are
           found in certain bacteria and Archaea.
          Length = 351

 Score = 68.3 bits (168), Expect = 2e-12
 Identities = 88/431 (20%), Positives = 138/431 (32%), Gaps = 94/431 (21%)

Query: 55  VAFFHPYCNAGGGGERVLWTAVLALHQKYPDYKIY-IYTGDVDASPSEIIKRAHQRFNIV 113
           VA  H +    GGGE+V    V AL + +PD  I+ +            +K+    F   
Sbjct: 2   VAIVHDWLVNIGGGEKV----VEALARLFPDADIFTLVDDPDKLPRLLRLKKIRTSFIQK 57

Query: 114 LPDQVINFVYLYRRKFVEASLYPYFTLLGQSIGSMILGVEALLSFQPDIYIDTMGYAFTY 173
           LP          RR++     Y            M L +E       D+ I +  +A   
Sbjct: 58  LP--------FARRRYR---KYLPL---------MPLAIEQFDLSGYDLVISS-SHAVAK 96

Query: 174 PLFSYIGGSKVACYIHYPTITKEMLTRVARRVITHNNSQRVANNPILTSFKLFY-YKVFA 232
            + +      + CY H P            R         +  + +     L        
Sbjct: 97  GVITRPDQLHI-CYCHTPM-----------RYAWDLYHDYLKESGLGKRLALRLLLHYLR 144

Query: 233 LLYSHVGKYSDIIMVNSSWTEEHV---IQLWNCQLKTYK-----LYPPCDTEDLKKITHS 284
           +         D  + NS      V   I+      K Y      +YPP DT+   + T +
Sbjct: 145 IWDRRSAARVDYFIANS----RFVARRIK------KYYGRDATVIYPPVDTD---RFTPA 191

Query: 285 KTDGPVKIISVAQFRPEKDHPLQLRAMYQLRQIISEELWDNLKLIFIGSTRNEEDEVCVK 344
           +       +SV +  P K   L + A  +L            +L+ IG    E D +  K
Sbjct: 192 EEKEDY-YLSVGRLVPYKRIDLAIEAFNKL----------GKRLVVIGDGP-ELDRLRAK 239

Query: 345 DMQDLCKHLSLENNVEFKVNLPYEDMKKEFSEGLIGLHAMWNEHFGIGIVECMAAGLIMI 404
                        NV F   +  ++  ++             E FGI  VE MA+G  +I
Sbjct: 240 ----------AGPNVTFLGRVS-DEELRDLYARARAFLFPAEEDFGIVPVEAMASGTPVI 288

Query: 405 AHKSGGPKMDIVIEDPETCRNGFL-ACDEVE-YAQTIKLILHLSQDTKTRISQNAVSSVD 462
           A+  GG   + VI+       G L     VE  A  ++            I  +A    +
Sbjct: 289 AYGKGGAL-ETVIDGV----TGILFEEQTVESLAAAVERFEKNEDFDPQAIRAHA----E 339

Query: 463 RFSMEEFKNGF 473
           RFS   F+   
Sbjct: 340 RFSESRFREKI 350


>gnl|CDD|99971 cd03798, GT1_wlbH_like, This family is most closely related to the
           GT1 family of glycosyltransferases. wlbH in Bordetella
           parapertussis has been shown to be required for the
           biosynthesis of a trisaccharide that, when attached to
           the B. pertussis lipopolysaccharide (LPS) core (band B),
           generates band A LPS.
          Length = 377

 Score = 67.8 bits (166), Expect = 3e-12
 Identities = 77/432 (17%), Positives = 149/432 (34%), Gaps = 83/432 (19%)

Query: 51  VLKTVAFFHPYCNAGGGGERVLWTAVLALHQKYPDYKIYIYTGDVDASPSEIIKRAHQRF 110
           +L   + + P  N  GGG   +     AL +     ++ +          +++     R 
Sbjct: 1   ILVISSLYPPPNN--GGGGIFVKELARALAK--RGVEVTVLAPG--PWGPKLLDLLKGRL 54

Query: 111 NIVLPDQVINFVYLYRRKFVEASLYPYFTLLGQSIGSMILGVEALLSFQPDI----YIDT 166
             V    V+  V    +  +       + L  +++  ++     L  F+PD+    +   
Sbjct: 55  VGVERLPVLLPVVPLLKGPL------LYLLAARALLKLL----KLKRFRPDLIHAHFAYP 104

Query: 167 MGYAFTYPLFSYIGGSKVACYIHYPTITKEMLTRVARRVITHNNSQRVANNPILTSFKLF 226
            G+A    L     G  +   +H   +      R+ R ++                    
Sbjct: 105 DGFAAA--LLKRKLGIPLVVTLHGSDVNLLPRKRLLRALLRR------------------ 144

Query: 227 YYKVFALLYSHVGKYSDIIMVNSSWTEEHVIQLWNCQLKTY--------KLYPPCDTEDL 278
                        + +D ++  S    + +  L     K          + + P D  + 
Sbjct: 145 -----------ALRRADAVIAVSEALADELKALGIDPEKVTVIPNGVDTERFSPADRAEA 193

Query: 279 KKITHSKTDGPVKIISVAQFRPEKDHPLQLRAMYQLRQIISEELWDNLKLIFIGSTRNEE 338
           +K      +    I+ V +  P K     + A+ +L +        ++ L+ +G     E
Sbjct: 194 RK--LGLPEDKKVILFVGRLVPRKGIDYLIEALARLLKKR-----PDVHLVIVGDGPLRE 246

Query: 339 DEVCVKDMQDLCKHLSLENNVEFKVNLPYEDMKKEFSEGLIGLHAMWNEHFGIGIVECMA 398
                   + L   L LE+ V F   +P+E++   ++   + +     E FG+ ++E MA
Sbjct: 247 AL------EALAAELGLEDRVTFLGAVPHEEVPAYYAAADVFVLPSLREGFGLVLLEAMA 300

Query: 399 AGLIMIAHKSGGPKMDIVIEDPETCRNGFL--ACDEVEYAQTIKLILHLSQDTKTRISQN 456
            GL ++A   GG     +I D E   NG L    D    A+ I  +L    D   R+ + 
Sbjct: 301 CGLPVVATDVGGIPE--IITDGE---NGLLVPPGDPEALAEAILRLL---ADPWLRLGRA 352

Query: 457 AVSSV-DRFSME 467
           A   V +RFS E
Sbjct: 353 ARRRVAERFSWE 364


>gnl|CDD|99982 cd03811, GT1_WabH_like, This family is most closely related to the
           GT1 family of glycosyltransferases. WabH in Klebsiella
           pneumoniae has been shown to transfer a GlcNAc residue
           from UDP-GlcNAc onto the acceptor GalUA residue in the
           cellular outer core.
          Length = 353

 Score = 65.8 bits (161), Expect = 9e-12
 Identities = 80/434 (18%), Positives = 140/434 (32%), Gaps = 101/434 (23%)

Query: 55  VAFFHPYCNAGGGGERVLWTAVLALHQKYPDYKIYIYTGDVDASPSEIIKRAHQRFNIVL 114
           + F  P    GGG ERVL    LA       Y + +           ++ R    +  +L
Sbjct: 2   ILFVIP-SLGGGGAERVL--LNLANGLDKRGYDVTL-----------VVLRDEGDYLELL 47

Query: 115 PDQVINFVYLYRRKFVEASLYPYFTLLGQSIGSMILGVEALLSFQPDIYIDTMG-YAFTY 173
           P  V        +      L     L              L   +PD+ I  +       
Sbjct: 48  PSNVKLIPVRVLKLKSLRDLLAILRL-----------RRLLRKEKPDVVISHLTTTPNVL 96

Query: 174 PLFSYIGGSKVACYIHYPTITKEMLTRVARRVITHNNSQRVANNPILTSFKLFYYKVFAL 233
            L +   G+K+  + H                    + +      +L   +  Y +    
Sbjct: 97  ALLAARLGTKLIVWEH-----------------NSLSLELKRKLRLLLLIRKLYRRA--- 136

Query: 234 LYSHVGKYSDIIMVNSSWTEEHVIQLWNCQL-KTYKLYPPCDTEDLKK-----ITHSKTD 287
                    D I+  S   +E +++L      K   +Y P D E+++      +      
Sbjct: 137 ---------DKIVAVSEGVKEDLLKLLGIPPDKIEVIYNPIDIEEIRALAEEPLELGIPP 187

Query: 288 GPVKIISVAQFRPEKDHPLQLRAMYQLRQIISEELWDNLKLIFIGSTRNEEDEVCVKDMQ 347
               I++V +  P+K     +RA   LR+        + +L+ +G       E    +++
Sbjct: 188 DGPVILAVGRLSPQKGFDTLIRAFALLRK-----EGPDARLVILG--DGPLRE----ELE 236

Query: 348 DLCKHLSLENNVEF--KVNLPYEDMKK-------EFSEGLIGLHAMWNEHFGIGIVECMA 398
            L K L L + V F    + PY  +K           EG           F   ++E MA
Sbjct: 237 ALAKELGLADRVHFLGFQSNPYPYLKAADLFVLSSRYEG-----------FPNVLLEAMA 285

Query: 399 AGLIMIAHKSGGPKMDIVIEDPETCRNGFLA--CDEVEYAQTIKLILHLSQDTKTRISQN 456
            G  ++A    GP+ +I+ +      NG L    DE   A     +L L  D + R  + 
Sbjct: 286 LGTPVVATDCPGPR-EILEDG----ENGLLVPVGDEAALAAAALALLDLLLDPELR--ER 338

Query: 457 AVSSVDRFSMEEFK 470
             ++       E+ 
Sbjct: 339 LAAAARERVAREYS 352


>gnl|CDD|99973 cd03800, GT1_Sucrose_synthase, This family is most closely related
           to the GT1 family of glycosyltransferases. The
           sucrose-phosphate synthases in this family may be unique
           to plants and photosynthetic bacteria. This enzyme
           catalyzes the synthesis of sucrose 6-phosphate from
           fructose 6-phosphate and uridine 5'-diphosphate-glucose,
           a key regulatory step of sucrose metabolism. The
           activity of this enzyme is regulated by phosphorylation
           and moderated by the concentration of various
           metabolites and light.
          Length = 398

 Score = 62.2 bits (152), Expect = 2e-10
 Identities = 52/237 (21%), Positives = 94/237 (39%), Gaps = 31/237 (13%)

Query: 234 LYSHVGKYSDIIMVNSSWTEEHVIQLWNCQLKTYKLYPPCDTEDLKKITHSKTDGPVKII 293
           LYS  G Y   I          V+      +   +  P    E  +       D P  I+
Sbjct: 178 LYSLYGAYPRRI---------RVVP---PGVDLERFTPYGRAEARRARLLRDPDKPR-IL 224

Query: 294 SVAQFRPEKDHPLQLRAMYQLRQIISEELWDNLKLIFIGSTRNEEDEVCVKDMQDLCKHL 353
           +V +  P K     +RA  +L      EL +   L+ +G  R++   +  +++++L + L
Sbjct: 225 AVGRLDPRKGIDTLIRAYAELP-----ELRERANLVIVGGPRDDILAMDEEELRELAREL 279

Query: 354 SLENNVEFKVNLPYEDMKKEFSEGLIGLHAM--WNEHFGIGIVECMAAGLIMIAHKSGGP 411
            + + V+F   +  ED+   +      +       E FG+  +E MA GL ++A   GGP
Sbjct: 280 GVIDRVDFPGRVSREDLPALYRAA--DVFVNPALYEPFGLTALEAMACGLPVVATAVGGP 337

Query: 412 KMDIVIEDPETCRNGFLA--CDEVEYAQTIKLILHLSQDTKTRISQNAVSSV-DRFS 465
           + DIV         G L    D    A  ++ +L      + R+S+  +     R++
Sbjct: 338 R-DIV----VDGVTGLLVDPRDPEALAAALRRLLT-DPALRRRLSRAGLRRARARYT 388


>gnl|CDD|99959 cd01635, Glycosyltransferase_GTB_type, Glycosyltransferases
           catalyze the transfer of sugar moieties from activated
           donor molecules to specific acceptor molecules, forming
           glycosidic bonds. The acceptor molecule can be a lipid,
           a protein, a heterocyclic compound, or another
           carbohydrate residue. The structures of the formed
           glycoconjugates are extremely diverse, reflecting a wide
           range of biological functions. The members of this
           family share a common GTB topology, one of the two
           protein topologies observed for
           nucleotide-sugar-dependent glycosyltransferases. GTB
           proteins have distinct N- and C- terminal domains each
           containing a typical Rossmann fold. The two domains have
           high structural homology despite minimal sequence
           homology. The large cleft that separates the two domains
           includes the catalytic center and permits a high degree
           of flexibility.
          Length = 229

 Score = 59.9 bits (145), Expect = 3e-10
 Identities = 35/136 (25%), Positives = 55/136 (40%), Gaps = 17/136 (12%)

Query: 294 SVAQFRPEKDHPLQLRAMYQLRQIISEELWDNLKLIFIGSTRNEEDEVCVKDMQDLCKHL 353
            V +  PEK     + A   L++        +LKL+  G     E       +++L   L
Sbjct: 109 FVGRLAPEKGLDDLIEAFALLKERGP-----DLKLVIAGDGPEREY------LEELLAAL 157

Query: 354 SLENNVEFKVNLPYEDMKKEFSEGL-IGLHAMWNEHFGIGIVECMAAGLIMIAHKSGGPK 412
            L + V F   L  E++         + +     E FG+ ++E MA GL +IA   GGP 
Sbjct: 158 LLLDRVIFLGGLDPEELLALLLAAADVFVLPSLREGFGLVVLEAMACGLPVIATDVGGPP 217

Query: 413 MDIVIEDPETCRNGFL 428
            +IV +       G L
Sbjct: 218 -EIVEDG----LTGLL 228


>gnl|CDD|99987 cd03817, GT1_UGDG_like, This family is most closely related to the
           GT1 family of glycosyltransferases.
           UDP-glucose-diacylglycerol glucosyltransferase (UGDG;
           also known as 1,2-diacylglycerol 3-glucosyltransferase)
           catalyzes the transfer of glucose from UDP-glucose to
           1,2-diacylglycerol forming
           3-D-glucosyl-1,2-diacylglycerol.
          Length = 374

 Score = 59.2 bits (144), Expect = 1e-09
 Identities = 54/201 (26%), Positives = 97/201 (48%), Gaps = 21/201 (10%)

Query: 270 YPPCDTEDLKKITHSKTDGPVKIISVAQFRPEKDHPLQLRAMYQLRQIISEELWDNLKLI 329
           + P D +D ++      D PV ++ V +   EK+    +RA  +L      +   ++KL+
Sbjct: 184 FEPVDGDDERRKLGIPEDEPV-LLYVGRLAKEKNIDFLIRAFARLL-----KEEPDVKLV 237

Query: 330 FIGSTRNEEDEVCVKDMQDLCKHLSLENNVEFKVNLPYEDMKKEFSEGLIGLHAMWNEHF 389
            +G    E      +++++L + L L + V F   +P E++   +    + + A   E  
Sbjct: 238 IVGDG-PER-----EELEELARELGLADRVIFTGFVPREELPDYYKAADLFVFASTTETQ 291

Query: 390 GIGIVECMAAGLIMIAHKSGGPKMDIVIEDPETCRNGFLACDEVEYAQTIKLILHLSQD- 448
           G+ ++E MAAGL ++A  + G   D+V  D E   NGFL     E     + +L L QD 
Sbjct: 292 GLVLLEAMAAGLPVVAVDAPGLP-DLV-ADGE---NGFLFPPGDEAL--AEALLRLLQDP 344

Query: 449 -TKTRISQNAVSSVDRFSMEE 468
             + R+S+NA  S ++FS  +
Sbjct: 345 ELRRRLSKNAEESAEKFSFAK 365


>gnl|CDD|99990 cd03820, GT1_amsD_like, This family is most closely related to the
           GT1 family of glycosyltransferases. AmSD in Erwinia
           amylovora has been shown to be involved in the
           biosynthesis of amylovoran, the acidic exopolysaccharide
           acting as a virulence factor. This enzyme may be
           responsible for the formation of  galactose alpha-1,6
           linkages in amylovoran.
          Length = 348

 Score = 56.1 bits (136), Expect = 1e-08
 Identities = 70/412 (16%), Positives = 151/412 (36%), Gaps = 88/412 (21%)

Query: 63  NAGGGGERVLWTAVLALHQKYPDYKIYIYTGDVDASPSEIIKRAHQRFNIVLPDQVINFV 122
              GG ERVL     AL +K   +++ I +          + +    F  + P   +  +
Sbjct: 10  GNAGGAERVLSNLANALAEK--GHEVTIIS----------LDKGEPPFYELDPKIKVIDL 57

Query: 123 YLYRRKFVEASLYPYFTLLGQSIGSMILGVEALLSFQPDIYIDTMGYAFTYPLFSYIGGS 182
              R   + A       L            + L + +PD+ I  +    T    + +G  
Sbjct: 58  GDKRDSKLLARFKKLRRLR-----------KLLKNNKPDVVISFLTSLLT--FLASLG-- 102

Query: 183 KVACYIHYPTITKEMLTRVARRVIT-HNNSQRVANNPILTSFKLFYYKVFALLYSHVGKY 241
                            ++ + +++ HN+                      LL   + + 
Sbjct: 103 ----------------LKIVKLIVSEHNSP-----------DAYKKRLRRLLLRRLLYRR 135

Query: 242 SDIIMVNSSWTEEHVIQLWNCQLKTYKLYPPCDTEDLKKITHSKTDGPVKIISVAQFRPE 301
           +D ++V +   E+  +           +  P      ++ +        +I++V +  P+
Sbjct: 136 ADAVVVLTE--EDRALYYKKFNKNVVVIPNPL-PFPPEEPSSDLKS--KRILAVGRLVPQ 190

Query: 302 KDHPLQLRAMYQLRQIISEELWDNLKLIFIGSTRNEEDEVCVKDMQDLCKHLSLENNVEF 361
           K   L + A  ++ +   +  W   KL  +G    E +      ++ L K L LE+ V  
Sbjct: 191 KGFDLLIEAWAKIAKKHPD--W---KLRIVGDG-PEREA-----LEALIKELGLEDRVIL 239

Query: 362 KVNLPYEDMKKEFSEGLIGLHAM---WNEHFGIGIVECMAAGLIMIAHK-SGGPKMDIVI 417
                 +++++ +++    +  +   + E F + ++E MA GL +I+     GP  +I+ 
Sbjct: 240 LG--FTKNIEEYYAKA--SIFVLTSRF-EGFPMVLLEAMAFGLPVISFDCPTGPS-EIIE 293

Query: 418 EDPETCRNGFL--ACDEVEYAQTIKLILHLSQDTKTRISQNAVSSVDRFSME 467
           +      NG L    D    A+ +  ++   ++ + R+  NA  S +RFS+E
Sbjct: 294 DG----VNGLLVPNGDVEALAEALLRLME-DEELRKRMGANARESAERFSIE 340


>gnl|CDD|99981 cd03809, GT1_mtfB_like, This family is most closely related to the
           GT1 family of glycosyltransferases. mtfB
           (mannosyltransferase B) in E. coli has been shown to
           direct the growth of the O9-specific polysaccharide
           chain. It transfers two mannoses into the position 3 of
           the previously synthesized polysaccharide.
          Length = 365

 Score = 50.1 bits (120), Expect = 1e-06
 Identities = 56/323 (17%), Positives = 89/323 (27%), Gaps = 66/323 (20%)

Query: 101 EIIKRAHQRFNIVLPDQVINFVYLYRRKFVEASLYPYFTLLGQSIGSMILGVEALLSFQP 160
            ++K   +   ++LP      +   R         P   L G            LL    
Sbjct: 27  ALLKLDPEEVLLLLPGAPGLLLLPLRAALRLLLRLPRRLLWGLLFLLRAGDRLLLLLLGL 86

Query: 161 DIYIDTMGYAFTYPLFSYIGGSKVACYIH--YPTITKEMLTRVARRVITHNNSQRVANNP 218
           D+       A    L     G  V   IH   P    E                      
Sbjct: 87  DLLHSPHNTAPLLRLR----GVPVVVTIHDLIPLRFPE---------------------- 120

Query: 219 ILTSFKLFYYKVFALLYSHVGKYSDIIMVNSSWTEEHVIQLWNCQL-KTYKLYPPCD--- 274
               F   + + F  L     + +D I+  S  T+  +++       K   +    D   
Sbjct: 121 ---YFSPGFRRYFRRLLRRALRRADAIITVSEATKRDLLRYLGVPPDKIVVIPLGVDPRF 177

Query: 275 ----TEDLKKITHSKTDGPVKIISVAQFRPEKDHPLQLRAMYQLRQIISEELWDNLKLIF 330
                E            P   + V    P K+    L A  +L          + KL+ 
Sbjct: 178 RPPPAEAEVLRALYLLPRPY-FLYVGTIEPRKNLERLLEAFARLP-----AKGPDPKLVI 231

Query: 331 IGSTRNEEDEVCVKDMQDLCKHLSLENNVEFKVNLPYEDMKKEFSEGLIGL--HAM---- 384
           +G      +E+         + L L + V F   +  E+        L  L   A     
Sbjct: 232 VGKRGWLNEEL-----LARLRELGLGDRVRFLGYVSDEE--------LAALYRGARAFVF 278

Query: 385 --WNEHFGIGIVECMAAGLIMIA 405
               E FG+ ++E MA G  +IA
Sbjct: 279 PSLYEGFGLPVLEAMACGTPVIA 301


>gnl|CDD|99980 cd03808, GT1_cap1E_like, This family is most closely related to the
           GT1 family of glycosyltransferases. cap1E in
           Streptococcus pneumoniae is required for the synthesis
           of type 1 capsular polysaccharides.
          Length = 359

 Score = 49.5 bits (119), Expect = 2e-06
 Identities = 49/254 (19%), Positives = 94/254 (37%), Gaps = 41/254 (16%)

Query: 229 KVFALLYSHVGKYSDIIMVNSSWTEEHVIQLWNCQLKTYKLYPPCDTE-------DLKKI 281
           +++ LL     +++D ++  +    E    L         +     T        DL + 
Sbjct: 126 RLYLLLERLALRFTDKVIFQN----EDDRDL----ALKLGIIKKKKTVLIPGSGVDLDRF 177

Query: 282 THSKT---DGPVKIISVAQFRPEKDHPLQLRAMYQLRQIISEELWDNLKLIFIGSTRNEE 338
           + S     +     + VA+   +K     L A   L+         N++L+ +G    +E
Sbjct: 178 SPSPEPIPEDDPVFLFVARLLKDKGIDELLEAARILKAK-----GPNVRLLLVGDG--DE 230

Query: 339 DEVCVKDMQDLCKHLSLENNVEFKVNLPY-EDMKKEFSEGLIGLHAMWNEHFGIGIVECM 397
           +           + L LE  VEF   L + +D+ +  +   + +   + E     ++E M
Sbjct: 231 EN---PAAILEIEKLGLEGRVEF---LGFRDDVPELLAAADVFVLPSYREGLPRVLLEAM 284

Query: 398 AAGLIMIAHKSGGPKMDIVIEDPETCRNGFLAC--DEVEYAQTIKLILHLSQDTKTRISQ 455
           A G  +IA    G + + VI+      NGFL    D    A  I+ ++    + + R+ Q
Sbjct: 285 AMGRPVIATDVPGCR-EAVIDG----VNGFLVPPGDAEALADAIERLIE-DPELRARMGQ 338

Query: 456 NAVSSV-DRFSMEE 468
            A     + F  E 
Sbjct: 339 AARKRAEEEFDEEI 352


>gnl|CDD|99998 cd04949, GT1_gtfA_like, This family is most closely related to the
           GT1 family of glycosyltransferases and is named after
           gtfA in Streptococcus gordonii, where it plays a role in
           the O-linked glycosylation of GspB, a cell surface
           glycoprotein involved in platelet binding.  In general
           glycosyltransferases catalyze the transfer of sugar
           moieties from activated donor molecules to specific
           acceptor molecules, forming glycosidic bonds. The
           acceptor molecule can be a lipid, a protein, a
           heterocyclic compound, or another carbohydrate residue.
           This group of glycosyltransferases is most closely
           related to the previously defined glycosyltransferase
           family 1 (GT1). The members of this family may transfer
           UDP, ADP, GDP, or CMP linked sugars. The diverse
           enzymatic activities among members of this family
           reflect a wide range of biological functions. The
           protein structure available for this family has the GTB
           topology, one of the two protein topologies observed for
           nucleotide-sugar-dependent glycosyltransferases. GTB
           proteins have distinct N- and C- terminal domains each
           containing a typical Rossmann fold. The two domains have
           high structural homology despite minimal sequence
           homology. The large cleft that separates the two domains
           includes the catalytic center and permits a high degree
           of flexibility. The members of this family are found in
           bacteria.
          Length = 372

 Score = 49.1 bits (118), Expect = 2e-06
 Identities = 69/280 (24%), Positives = 128/280 (45%), Gaps = 45/280 (16%)

Query: 198 LTRVARRVIT-HNNSQRVANNPILTSFKLFYYKVFALLYSHVGKYSDIIMVNSSWTEE-- 254
           + + A+ V+  H+N     N+P+ +    FY  VF     ++ K   +I+     TE+  
Sbjct: 118 MKKAAKVVVVLHSNHVSDNNDPVHSLINNFYEYVF----ENLDKVDGVIVA----TEQQK 169

Query: 255 HVIQLWNCQLKTYKLYPPCDT------EDLKKITHSKTDGPVKIISVAQFRPEKDHPLQL 308
             +Q      K +  Y P  T      + LK     K   P KII+VA+  PEK     +
Sbjct: 170 QDLQ------KQFGNYNPIYTIPVGSIDPLKLPAQFKQRKPHKIITVARLAPEKQLDQLI 223

Query: 309 RAMYQLRQIISEELWDNLKLIFIGSTRNEEDEVCVKDMQDLCKHLSLENNVEFKVNLPYE 368
           +A  ++ + + +       L   G   +EE++     +++L + L LE+ V  K      
Sbjct: 224 KAFAKVVKQVPD-----ATLDIYGYG-DEEEK-----LKELIEELGLEDYVFLKGYTR-- 270

Query: 369 DMKKEFSEGLIGLHAMWNEHFGIGIVECMAAGLIMIAHKSG-GPKMDIVIEDPETCRNGF 427
           D+ + + +  + L    +E FG+ ++E ++ GL +I++    GP  +I IED E   NG+
Sbjct: 271 DLDEVYQKAQLSLLTSQSEGFGLSLMEALSHGLPVISYDVNYGP-SEI-IEDGE---NGY 325

Query: 428 LA--CDEVEYAQTIKLILHLSQDTKTRISQNAVSSVDRFS 465
           L    D    A+ I  +L+  +  + + S+ A  + +R+S
Sbjct: 326 LVPKGDIEALAEAIIELLNDPKLLQ-KFSEAAYENAERYS 364


>gnl|CDD|100003 cd05844, GT1_like_7, Glycosyltransferases catalyze the transfer of
           sugar moieties from activated donor molecules to
           specific acceptor molecules, forming glycosidic bonds.
           The acceptor molecule can be a lipid, a protein, a
           heterocyclic compound, or another carbohydrate residue.
           This group of glycosyltransferases is most closely
           related to the previously defined glycosyltransferase
           family 1 (GT1). The members of this family may transfer
           UDP, ADP, GDP, or CMP linked sugars. The diverse
           enzymatic activities among members of this family
           reflect a wide range of biological functions. The
           protein structure available for this family has the GTB
           topology, one of the two protein topologies observed for
           nucleotide-sugar-dependent glycosyltransferases. GTB
           proteins have distinct N- and C- terminal domains each
           containing a typical Rossmann fold. The two domains have
           high structural homology despite minimal sequence
           homology. The large cleft that separates the two domains
           includes the catalytic center and permits a high degree
           of flexibility.
          Length = 367

 Score = 46.1 bits (110), Expect = 3e-05
 Identities = 37/177 (20%), Positives = 67/177 (37%), Gaps = 52/177 (29%)

Query: 289 PVKIISVAQFRPEKDHPLQLRAMYQLRQIISEELWDNLKLIFIGS--TRNEEDEVCVKDM 346
           P +I+ V +F  +K   L L A  +L +         ++L+ IG               +
Sbjct: 188 PPRILFVGRFVEKKGPLLLLEAFARLAR-----RVPEVRLVIIGDGPLLAA--------L 234

Query: 347 QDLCKHLSLENNVEFKVNLPYEDMKKEF-----------------SEGLIGLHAMWNEHF 389
           + L + L L   V F    P+ ++++                   +EGL           
Sbjct: 235 EALARALGLGGRVTFLGAQPHAEVRELMRRARIFLQPSVTAPSGDAEGL----------- 283

Query: 390 GIGIVECMAAGLIMIAHKSGG-PKMDIVIEDPETCRNGFLAC--DEVEYAQTIKLIL 443
            + ++E  A+G+ ++A + GG P+    +ED ET   G L    D    A  +  +L
Sbjct: 284 PVVLLEAQASGVPVVATRHGGIPE---AVEDGET---GLLVPEGDVAALAAALGRLL 334


>gnl|CDD|100000 cd04951, GT1_WbdM_like, This family is most closely related to the
           GT1 family of glycosyltransferases and is named after
           WbdM in Escherichia coli. In general
           glycosyltransferases catalyze the transfer of sugar
           moieties from activated donor molecules to specific
           acceptor molecules, forming glycosidic bonds. The
           acceptor molecule can be a lipid, a protein, a
           heterocyclic compound, or another carbohydrate residue.
           This group of glycosyltransferases is most closely
           related to the previously defined glycosyltransferase
           family 1 (GT1). The members of this family may transfer
           UDP, ADP, GDP, or CMP linked sugars. The diverse
           enzymatic activities among members of this family
           reflect a wide range of biological functions. The
           protein structure available for this family has the GTB
           topology, one of the two protein topologies observed for
           nucleotide-sugar-dependent glycosyltransferases. GTB
           proteins have distinct N- and C- terminal domains each
           containing a typical Rossmann fold. The two domains have
           high structural homology despite minimal sequence
           homology. The large cleft that separates the two domains
           includes the catalytic center and permits a high degree
           of flexibility. The members of this family are found in
           bacteria.
          Length = 360

 Score = 44.2 bits (105), Expect = 9e-05
 Identities = 50/219 (22%), Positives = 89/219 (40%), Gaps = 35/219 (15%)

Query: 263 QLKTYKLYPPCDTEDLKKITHSK---------TDGPVKIISVAQFRPEKDHPLQLRAMYQ 313
             K++ +Y   DT+  +K    +          +    I++V +    KD+P  L+A  +
Sbjct: 153 ANKSFVVYNGIDTDRFRKDPARRLKIRNALGVKNDTFVILAVGRLVEAKDYPNLLKAFAK 212

Query: 314 LRQIISEELWDNLKLIFIGSTRNEEDEVCVKDMQDLCKHLSLENNVEFKVNL-PYEDMKK 372
           L          ++KL+  G             ++ L K L L N V+    L   +D+  
Sbjct: 213 LLSDYL-----DIKLLIAGD--GPLR----ATLERLIKALGLSNRVKL---LGLRDDIAA 258

Query: 373 EFSEG-LIGLHAMWNEHFGIGIVECMAAGLIMIAHKSGGPKMDIVIEDPETCRNGFL-AC 430
            ++   L  L + W E FG+ + E MA  L ++A  +GG +   V+ D     +G +   
Sbjct: 259 YYNAADLFVLSSAW-EGFGLVVAEAMACELPVVATDAGGVRE--VVGD-----SGLIVPI 310

Query: 431 DEVE-YAQTIKLILHLSQDTKTRISQNAVSSVDRFSMEE 468
            + E  A  I  IL +S + +  I       V +FS+  
Sbjct: 311 SDPEALANKIDEILKMSGEERDIIGARRERIVKKFSINS 349


>gnl|CDD|99979 cd03807, GT1_WbnK_like, This family is most closely related to the
           GT1 family of glycosyltransferases. WbnK in Shigella
           dysenteriae has been shown to be involved in the type 7
           O-antigen biosynthesis.
          Length = 365

 Score = 43.0 bits (102), Expect = 2e-04
 Identities = 43/181 (23%), Positives = 71/181 (39%), Gaps = 27/181 (14%)

Query: 292 IISVAQFRPEKDHPLQLRAMYQLRQIISEELWDNLKLIFIGSTRNEEDEVCVKDMQDLCK 351
           I  VA+  P+KDH   LRA   L +        N +L+ +G      D     +++ L  
Sbjct: 196 IGIVARLHPQKDHATLLRAAALLLKKFP-----NARLLLVG------DGPDRANLELLAL 244

Query: 352 H-LSLENNVEFKVNL-PYEDMKKEFSEGL-IGLHAMWNEHFGIGIVECMAAGLIMIAHKS 408
             L LE+ V     L    D+       L + + +  +E F   ++E MA GL ++A   
Sbjct: 245 KELGLEDKVIL---LGERSDV-PALLNALDVFVLSSLSEGFPNVLLEAMACGLPVVATDV 300

Query: 409 GGPKMDIVIEDPETCRNGFL--ACDEVEYAQTIKLILHLSQDTKTRISQNAVSSVDRFSM 466
           G    ++V +       GFL    D    A+ I+ +L      +           + FS+
Sbjct: 301 GDNA-ELVGD------TGFLVPPGDPEALAEAIEALLADPALRQALGEAARERIEENFSI 353

Query: 467 E 467
           E
Sbjct: 354 E 354


>gnl|CDD|99972 cd03799, GT1_amsK_like, This is a family of GT1
           glycosyltransferases found specifically in certain
           bacteria. amsK in Erwinia amylovora, has been reported
           to be involved in the biosynthesis of amylovoran, a
           exopolysaccharide acting as a virulence factor.
          Length = 355

 Score = 42.2 bits (100), Expect = 4e-04
 Identities = 43/226 (19%), Positives = 87/226 (38%), Gaps = 28/226 (12%)

Query: 242 SDIIMVNSSWTEEHVIQLWNCQL-KTYKLYPPCDTEDLKKITHSKTDGPVKIISVAQFRP 300
           +D ++  S +  + +I+L  C   K + ++   D E            P++I+SV +   
Sbjct: 131 ADFVVAISEYNRQQLIRLLGCDPDKIHVVHCGVDLERFPPRPPPPPGEPLRILSVGRLVE 190

Query: 301 EKDHPLQLRAMYQLRQIISEELWDNLKLIFIGSTRNEEDEVCVKDMQDLCKHLSLENNVE 360
           +K     L A+  L+         + +L  +G     +      +++ L   L LE+ V 
Sbjct: 191 KKGLDYLLEALALLKDR-----GIDFRLDIVGDGPLRD------ELEALIAELGLEDRVT 239

Query: 361 FKVNLPYEDMKKEFSEG----LIGLHAMWNEHFGIGIV--ECMAAGLIMIAHKSGG-PKM 413
                  E++++         L  + A   +  G+ +V  E MA GL +I+    G P  
Sbjct: 240 LLGAKSQEEVRELLRAADLFVLPSVTAADGDREGLPVVLMEAMAMGLPVISTDVSGIP-- 297

Query: 414 DIVIEDPETCRNGFLA--CDEVEYAQTIKLILHLSQDTKTRISQNA 457
           ++V +       G L    D    A  I+ +L   +  +  + +  
Sbjct: 298 ELVEDG----ETGLLVPPGDPEALADAIERLLDDPELRR-EMGEAG 338


>gnl|CDD|234438 TIGR03999, thiol_BshA, N-acetyl-alpha-D-glucosaminyl L-malate
           synthase BshA.  Members of this protein family are BshA,
           a glycosyltransferase required for bacillithiol
           biosynthesis. This enzyme combines UDP-GlcNAc and
           L-malate to form N-acetyl-alpha-D-glucosaminyl L-malate
           synthase. Bacillithiol is a low-molecular-weight thiol,
           an analog of glutathione and mycothiol, and is found
           largely in the Firmicutes [Biosynthesis of cofactors,
           prosthetic groups, and carriers, Glutathione and
           analogs].
          Length = 374

 Score = 41.1 bits (97), Expect = 0.001
 Identities = 53/212 (25%), Positives = 86/212 (40%), Gaps = 46/212 (21%)

Query: 270 YPPCDTEDLKKITHSKTDGPVKIISVAQFRPEKDHPLQLRAMYQLRQIISEELWDNLKLI 329
           Y   +   LK+   +  D  V +I ++ FRP K     +    +++Q +        KL+
Sbjct: 181 YRRKNDPALKRKLGAPEDEKV-LIHISNFRPVKRVEDVIEVFARVQQEVPA------KLL 233

Query: 330 FIGSTRNEEDEVCVKDMQDLCKHLSLENNVEF--KVN-----LPYEDMKKEFSEGLIGLH 382
            +G    +  E      + L + L L + V F  K +     L   D+    SE      
Sbjct: 234 LVG----DGPERS--PAEQLVRELGLTDRVLFLGKQDDVAELLSISDLFLLPSE------ 281

Query: 383 AMWNEHFGIGIVECMAAGLIMIAHKSGG-PKMDIVIEDPETCRNGFLA----CDEV-EYA 436
               E FG+  +E MA G+ +IA  +GG P+   V+E   T   GFL      + + EYA
Sbjct: 282 ---KESFGLAALEAMACGVPVIASNAGGIPE---VVEHGVT---GFLCDVGDVETMAEYA 332

Query: 437 QTIKLILHLSQDTKTRISQNAVSSV-DRFSME 467
            +   +L   ++   R S  A     +RF  E
Sbjct: 333 IS---LLE-DEELLQRFSAAARERAKERFDSE 360


>gnl|CDD|99968 cd03794, GT1_wbuB_like, This family is most closely related to the
           GT1 family of glycosyltransferases. wbuB in E. coli is
           involved in the biosynthesis of the O26 O-antigen.  It
           has been proposed to function as an
           N-acetyl-L-fucosamine (L-FucNAc) transferase.
          Length = 394

 Score = 40.7 bits (96), Expect = 0.001
 Identities = 34/169 (20%), Positives = 68/169 (40%), Gaps = 30/169 (17%)

Query: 308 LRAMYQLRQIISEELWDNLKLIFIGSTRNEEDEVCVKDMQDLCKHLSLENNVEFKVNLPY 367
           L A   L+         +++ + +G      D    +++++L K L L +NV F   +P 
Sbjct: 239 LEAAALLKDR------PDIRFLIVG------DGPEKEELKELAKALGL-DNVTFLGRVPK 285

Query: 368 EDMKKEFSEGLIGLHAMWNEHFGIGIV-----ECMAAGLIMIAHKSGGPKMDIVIEDPET 422
           E++ +  +   +GL  +       G+      E MAAG  ++A  S   +   ++E+   
Sbjct: 286 EELPELLAAADVGLVPLKPGPAFEGVSPSKLFEYMAAGKPVLA--SVDGESAELVEEAGA 343

Query: 423 CRNGFL--ACDEVEYAQTIKLILHLSQDTKTRISQNAVSSVDRFSMEEF 469
              G +    D    A  I  +L    + +  + +N      R+  E+F
Sbjct: 344 ---GLVVPPGDPEALAAAILELL-DDPEERAEMGENGR----RYVEEKF 384


>gnl|CDD|131525 TIGR02472, sucr_P_syn_N, sucrose-phosphate synthase, putative,
           glycosyltransferase domain.  This family consists of the
           N-terminal regions, or in some cases the entirety, of
           bacterial proteins closely related to plant
           sucrose-phosphate synthases (SPS). The C-terminal domain
           (TIGR02471), found with most members of this family,
           resembles both bona fide plant sucrose-phosphate
           phosphatases (SPP) and the SPP-like domain of plant SPS.
           At least two members of this family lack the SPP-like
           domain, which may have binding or regulatory rather than
           enzymatic activity by analogy to plant SPS. This enzyme
           produces sucrose 6-phosphate and UDP from UDP-glucose
           and D-fructose 6-phosphate, and may be encoded near the
           gene for fructokinase.
          Length = 439

 Score = 38.2 bits (89), Expect = 0.009
 Identities = 27/80 (33%), Positives = 42/80 (52%), Gaps = 10/80 (12%)

Query: 387 EHFGIGIVECMAAGLIMIAHKSGGPKMDIVIEDPETCRNGFLACDEVEYAQTIKLILH-- 444
           E FG+ ++E  A GL ++A   GGP+ DI+      CRNG L   +V   + I   L   
Sbjct: 351 EPFGLTLLEAAACGLPIVATDDGGPR-DII----ANCRNGLLV--DVLDLEAIASALEDA 403

Query: 445 LSQDTKTRI-SQNAVSSVDR 463
           LS  ++ ++ S+N +  V R
Sbjct: 404 LSDSSQWQLWSRNGIEGVRR 423


>gnl|CDD|99985 cd03814, GT1_like_2, This family is most closely related to the GT1
           family of glycosyltransferases. Glycosyltransferases
           catalyze the transfer of sugar moieties from activated
           donor molecules to specific acceptor molecules, forming
           glycosidic bonds. The acceptor molecule can be a lipid,
           a protein, a heterocyclic compound, or another
           carbohydrate residue. This group of glycosyltransferases
           is most closely related to the previously defined
           glycosyltransferase family 1 (GT1). The members of this
           family may transfer UDP, ADP, GDP, or CMP linked sugars.
           The diverse enzymatic activities among members of this
           family reflect a wide range of biological functions. The
           protein structure available for this family has the GTB
           topology, one of the two protein topologies observed for
           nucleotide-sugar-dependent glycosyltransferases. GTB
           proteins have distinct N- and C- terminal domains each
           containing a typical Rossmann fold. The two domains have
           high structural homology despite minimal sequence
           homology. The large cleft that separates the two domains
           includes the catalytic center and permits a high degree
           of flexibility. The members of this family are found
           mainly in bacteria and eukaryotes.
          Length = 364

 Score = 38.0 bits (89), Expect = 0.010
 Identities = 27/90 (30%), Positives = 46/90 (51%), Gaps = 8/90 (8%)

Query: 387 EHFGIGIVECMAAGLIMIAHKSGGPKMDIVIEDPETCRNGFLA--CDEVEYAQTIKLILH 444
           E FG+ ++E MA+GL ++A  +GGP    ++ D E   NG L    D   +A  +  +L 
Sbjct: 277 ETFGLVVLEAMASGLPVVAPDAGGPAD--IVTDGE---NGLLVEPGDAEAFAAALAALLA 331

Query: 445 LSQDTKTRISQNAVSSVDRFSMEEFKNGFL 474
             +  + R++  A +  +R S E F +  L
Sbjct: 332 DPELRR-RMAARARAEAERRSWEAFLDNLL 360


>gnl|CDD|100002 cd04962, GT1_like_5, This family is most closely related to the GT1
           family of glycosyltransferases. Glycosyltransferases
           catalyze the transfer of sugar moieties from activated
           donor molecules to specific acceptor molecules, forming
           glycosidic bonds. The acceptor molecule can be a lipid,
           a protein, a heterocyclic compound, or another
           carbohydrate residue. This group of glycosyltransferases
           is most closely related to the previously defined
           glycosyltransferase family 1 (GT1). The members of this
           family may transfer UDP, ADP, GDP, or CMP linked sugars.
           The diverse enzymatic activities among members of this
           family reflect a wide range of biological functions. The
           protein structure available for this family has the GTB
           topology, one of the two protein topologies observed for
           nucleotide-sugar-dependent glycosyltransferases. GTB
           proteins have distinct N- and C- terminal domains each
           containing a typical Rossmann fold. The two domains have
           high structural homology despite minimal sequence
           homology. The large cleft that separates the two domains
           includes the catalytic center and permits a high degree
           of flexibility. The members of this family are found
           mainly in bacteria, while some of them are also found in
           Archaea and eukaryotes.
          Length = 371

 Score = 36.4 bits (85), Expect = 0.026
 Identities = 50/208 (24%), Positives = 85/208 (40%), Gaps = 38/208 (18%)

Query: 270 YPPCDTEDLKKITHSKTDGPVKIISVAQFRPEKDHPLQLRAMYQLRQIISEELWDNLKLI 329
           + P   E LK+   +     V +I ++ FRP K     +R   ++R+ +        +L+
Sbjct: 179 FRPKPDEALKRRLGAPEGEKV-LIHISNFRPVKRIDDVIRIFAKVRKEV------PARLL 231

Query: 330 FIGSTRNEEDEVCVKDMQDLCKHLSLENNVEF--KVN-----LPYEDMKKEFSEGLIGLH 382
            +G              + L + L L+++V F  K +     L   D+    SE      
Sbjct: 232 LVGDGPERSP------AERLARELGLQDDVLFLGKQDHVEELLSIADLFLLPSE------ 279

Query: 383 AMWNEHFGIGIVECMAAGLIMIAHKSGG-PKMDIVIEDPETCRNGFLA-CDEVEYAQTIK 440
               E FG+  +E MA G+ ++A  +GG P+   V++  ET   GFL    +VE      
Sbjct: 280 ---KESFGLAALEAMACGVPVVASNAGGIPE---VVKHGET---GFLVDVGDVEAMAEYA 330

Query: 441 LILHLSQDTKTRISQNAVSSV-DRFSME 467
           L L    +     S+ A +   +RF  E
Sbjct: 331 LSLLEDDELWQEFSRAARNRAAERFDSE 358


>gnl|CDD|99975 cd03802, GT1_AviGT4_like, This family is most closely related to
           the GT1 family of glycosyltransferases. aviGT4 in
           Streptomyces viridochromogenes has been shown to be
           involved in biosynthesis of oligosaccharide antibiotic
           avilamycin A. Inactivation of aviGT4 resulted in a
           mutant that accumulated a novel avilamycin derivative
           lacking the terminal eurekanate residue.
          Length = 335

 Score = 34.9 bits (81), Expect = 0.087
 Identities = 23/84 (27%), Positives = 35/84 (41%), Gaps = 17/84 (20%)

Query: 385 WNEHFGIGIVECMAAGLIMIAHKSGG-PKMDIVIEDPETCRNGFLACDEVEYAQTIKLIL 443
           W E FG+ ++E MA G  +IA + G  P  ++V +       GFL     E A  +    
Sbjct: 253 WEEPFGLVMIEAMACGTPVIAFRRGAVP--EVVEDG----VTGFLVDSVEELAAAVARAD 306

Query: 444 HLSQDTKTRISQNAV--SSVDRFS 465
                   R+ + A    +  RFS
Sbjct: 307 --------RLDRAACRRRAERRFS 322


>gnl|CDD|233879 TIGR02468, sucrsPsyn_pln, sucrose phosphate synthase/possible
           sucrose phosphate phosphatase, plant.  Members of this
           family are sucrose-phosphate synthases of plants. This
           enzyme is known to exist in multigene families in
           several species of both monocots and dicots. The
           N-terminal domain is the glucosyltransferase domain.
           Members of this family also have a variable linker
           region and a C-terminal domain that resembles sucrose
           phosphate phosphatase (SPP) (EC 3.1.3.24) (see
           TIGR01485), the next and final enzyme of sucrose
           biosynthesis. The SPP-like domain likely serves a
           binding and not a catalytic function, as the reported
           SPP is always encoded by a distinct protein.
          Length = 1050

 Score = 34.0 bits (78), Expect = 0.19
 Identities = 15/29 (51%), Positives = 21/29 (72%), Gaps = 1/29 (3%)

Query: 387 EHFGIGIVECMAAGLIMIAHKSGGPKMDI 415
           E FG+ ++E  A GL M+A K+GGP +DI
Sbjct: 582 EPFGLTLIEAAAHGLPMVATKNGGP-VDI 609


>gnl|CDD|222322 pfam13692, Glyco_trans_1_4, Glycosyl transferases group 1. 
          Length = 134

 Score = 31.9 bits (73), Expect = 0.28
 Identities = 21/92 (22%), Positives = 39/92 (42%), Gaps = 10/92 (10%)

Query: 355 LENNVEFKVNLPY-EDMKKEFSEGLIGLHAM-WNEHFGIGIVECMAAGLIMIAHKSGGPK 412
           L  NV F   L + ED+    +   + L  + +     + ++E +AAGL ++A   G   
Sbjct: 50  LAPNVHF---LGFVEDLAALLASADVALAPLRFGAGSPLKLLEALAAGLPVVATDIGAEG 106

Query: 413 MDIVIEDPETCRNGFLACDEVEYAQTIKLILH 444
           +   +          +A D  E+A+ I  +L 
Sbjct: 107 LPEDLGWG-----VLVADDPEEFAEAIVRLLA 133


>gnl|CDD|215469 PLN02871, PLN02871, UDP-sulfoquinovose:DAG
           sulfoquinovosyltransferase.
          Length = 465

 Score = 33.1 bits (76), Expect = 0.37
 Identities = 47/238 (19%), Positives = 104/238 (43%), Gaps = 36/238 (15%)

Query: 241 YSDIIMVNSS----------WTEEHVIQLWNCQLKTYKLYPP-CDTEDLKKITHSKTDGP 289
            +D+ +V S            T  + I++WN  + +   +P     E   +++  + + P
Sbjct: 205 AADLTLVTSPALGKELEAAGVTAANRIRVWNKGVDSESFHPRFRSEEMRARLSGGEPEKP 264

Query: 290 VKIISVAQFRPEKDHPLQLRAMYQLRQIISEELWDNLKLIFIGSTRNEEDEVCVKDMQDL 349
           + I+ V +   EK+       +  L++++  E     +L F+G     E+         L
Sbjct: 265 L-IVYVGRLGAEKN-------LDFLKRVM--ERLPGARLAFVGDGPYREE---------L 305

Query: 350 CKHLSLENNVEFKVNLPYEDMKKEFSEGLIGLHAMWNEHFGIGIVECMAAGLIMIAHKSG 409
            K  +    V F   L  +++ + ++ G + +    +E  G  ++E MA+G+ ++A ++G
Sbjct: 306 EKMFAGTPTV-FTGMLQGDELSQAYASGDVFVMPSESETLGFVVLEAMASGVPVVAARAG 364

Query: 410 GPKMDIVIEDPETCRNGFLA--CDEVEYAQTIKLILHLSQDTKTRISQNAVSSVDRFS 465
           G   DI+  D E  + GFL    D  +  + ++ +L    + + R+   A   V+++ 
Sbjct: 365 GIP-DIIPPDQEG-KTGFLYTPGDVDDCVEKLETLLA-DPELRERMGAAAREEVEKWD 419


>gnl|CDD|238929 cd01967, Nitrogenase_MoFe_alpha_like, Nitrogenase_MoFe_alpha_like:
           Nitrogenase MoFe protein, alpha subunit_like. The
           nitrogenase enzyme catalyzes the ATP-dependent reduction
           of dinitrogen to ammonia.  Three genetically distinct
           types of nitrogenase systems are known to exist: a
           molybdenum-dependent  nitrogenase (Mo-nitrogenase), a
           vanadium dependent nitrogenase (V-nitrogenase), and an
           iron-only nitrogenase (Fe-nitrogenase). These
           nitrogenase systems consist of component 1 (MoFe
           protein, VFe protein or, FeFe protein respectively) and,
           component 2 (Fe protein). This group contains the alpha
           subunit of component 1 of all three different forms. The
           most widespread and best characterized of these systems
           is the Mo-nitrogenase. MoFe is an alpha2beta2 tetramer,
           the alternative nitrogenases are alpha2beta2delta2
           hexamers having  alpha and beta subunits similar to the
           alpha and beta subunits of MoFe.  The role of the delta
           subunit is unknown. For MoFe, each alphabeta pair of
           subunits contains one P-cluster (located at the
           alphabeta interface) and, one molecule of iron
           molybdenum cofactor (FeMoco) contained within the alpha
           subunit. The Fe protein is a homodimer which contains, a
           single [4Fe-4S] cluster from which electrons are
           transferred  to the P-cluster of the MoFe and in turn,
           to FeMoCo the site of substrate reduction. The
           V-nitrogenase requires an iron-vanadium cofactor
           (FeVco), the iron only-nitrogenase an iron only cofactor
           (FeFeco). These cofactors are analogous to the FeMoco.
           The V-nitrogenase has P clusters identical to those of
           MoFe. In addition to N2, nitrogenase also catalyzes the
           reduction of a variety of other substrates such as
           acetylene  The V-nitrogenase differs from the Mo-
           nitrogenase in that it produces free hydrazine, as a
           minor product during  dinitrogen reduction and, ethane
           as a minor product during acetylene reduction.
          Length = 406

 Score = 33.0 bits (76), Expect = 0.37
 Identities = 15/58 (25%), Positives = 29/58 (50%), Gaps = 13/58 (22%)

Query: 67  GGERVLWTAVLALHQKYPDYKIYIYTG--------DVDASPSEIIKRAHQRFNI-VLP 115
           GGE+ L  A+   ++++P   I++Y+         D++A    + K A +   I V+P
Sbjct: 70  GGEKKLKKAIKEAYERFPPKAIFVYSTCPTGLIGDDIEA----VAKEASKELGIPVIP 123


>gnl|CDD|233881 TIGR02470, sucr_synth, sucrose synthase.  This model represents
           sucrose synthase, an enzyme that, despite its name,
           generally uses rather produces sucrose. Sucrose plus UDP
           (or ADP) becomes D-fructose plus UDP-glucose (or
           ADP-glucose), which is then available for cell wall (or
           starch) biosynthesis. The enzyme is homologous to
           sucrose phosphate synthase, which catalyzes the
           penultimate step in sucrose synthesis. Sucrose synthase
           is found, so far, exclusively in plants and
           cyanobacteria [Energy metabolism, Biosynthesis and
           degradation of polysaccharides].
          Length = 784

 Score = 32.7 bits (75), Expect = 0.52
 Identities = 20/81 (24%), Positives = 34/81 (41%), Gaps = 12/81 (14%)

Query: 387 EHFGIGIVECMAAGLIMIAHKSGGPKMDIVIEDPETCRNGFLACDEVEYAQTIKLILHLS 446
           E FG+ ++E M  GL   A + GGP    +I+D     +GF   D     +  + I+   
Sbjct: 654 EAFGLTVLEAMTCGLPTFATRFGGPLE--IIQDG---VSGFHI-DPYHGEEAAEKIVDFF 707

Query: 447 QDTKT------RISQNAVSSV 461
           +          +ISQ  +  +
Sbjct: 708 EKCDEDPSYWQKISQGGLQRI 728


>gnl|CDD|132132 TIGR03088, stp2, sugar transferase, PEP-CTERM/EpsH1 system
           associated.  Members of this family include a match to
           the pfam00534 Glycosyl transferases group 1 domain.
           Nearly all are found in species that encode the
           PEP-CTERM/exosortase system predicted to act in protein
           sorting in a number of Gram-negative bacteria. In
           particular, these transferases are found proximal to a
           particular variant of exosortase, EpsH1, which appears
           to travel with a conserved group of genes summarized by
           Genome Property GenProp0652. The nature of the sugar
           transferase reaction catalyzed by members of this clade
           is unknown and may conceivably be variable with respect
           to substrate by species, but we hypothesize a conserved
           substrate.
          Length = 374

 Score = 31.6 bits (72), Expect = 0.90
 Identities = 13/46 (28%), Positives = 24/46 (52%), Gaps = 1/46 (2%)

Query: 287 DGPVKIISVAQFRPEKDHPLQLRAMYQLRQIISEELWDNLKLIFIG 332
           D  V + +V + +  KD P  +RA   L + +  E  + L+L+ +G
Sbjct: 192 DESVVVGTVGRLQAVKDQPTLVRAFALLVRQLP-EGAERLRLVIVG 236


>gnl|CDD|99984 cd03813, GT1_like_3, This family is most closely related to the GT1
           family of glycosyltransferases. Glycosyltransferases
           catalyze the transfer of sugar moieties from activated
           donor molecules to specific acceptor molecules, forming
           glycosidic bonds. The acceptor molecule can be a lipid,
           a protein, a heterocyclic compound, or another
           carbohydrate residue. This group of glycosyltransferases
           is most closely related to the previously defined
           glycosyltransferase family 1 (GT1). The members of this
           family may transfer UDP, ADP, GDP, or CMP linked sugars.
           The diverse enzymatic activities among members of this
           family reflect a wide range of biological functions. The
           protein structure available for this family has the GTB
           topology, one of the two protein topologies observed for
           nucleotide-sugar-dependent glycosyltransferases. GTB
           proteins have distinct N- and C- terminal domains each
           containing a typical Rossmann fold. The two domains have
           high structural homology despite minimal sequence
           homology. The large cleft that separates the two domains
           includes the catalytic center and permits a high degree
           of flexibility. The members of this family are found
           mainly in bacteria, while some of them are also found in
           Archaea and eukaryotes.
          Length = 475

 Score = 31.8 bits (73), Expect = 0.93
 Identities = 26/104 (25%), Positives = 46/104 (44%), Gaps = 30/104 (28%)

Query: 308 LRAMYQLRQIISEELWDNLKLIFIGSTRNEEDEVCVKDMQDLCKHLSLENNVEF--KVNL 365
           +RA   +R+ I      + +   IG T  +ED    ++ ++L + L LE+NV+F    N+
Sbjct: 312 IRAAAIVRKKI-----PDAEGWVIGPT--DEDPEYAEECRELVESLGLEDNVKFTGFQNV 364

Query: 366 PYEDMKKEF--------SEGLIGLHAMWNEHFGIGIVECMAAGL 401
             ++   +         SEG             + I+E MAAG+
Sbjct: 365 --KEYLPKLDVLVLTSISEG-----------QPLVILEAMAAGI 395


>gnl|CDD|132490 TIGR03449, mycothiol_MshA, D-inositol-3-phosphate
           glycosyltransferase.  Members of this protein family,
           found exclusively in the Actinobacteria, are MshA, the
           glycosyltransferase of mycothiol biosynthesis. Mycothiol
           replaces glutathione in these species.
          Length = 405

 Score = 30.9 bits (70), Expect = 1.5
 Identities = 30/124 (24%), Positives = 53/124 (42%), Gaps = 8/124 (6%)

Query: 346 MQDLCKHLSLENNVEFKVNLPYEDMKKEFSEGLIGLHAMWNEHFGIGIVECMAAGLIMIA 405
           + +L   L + + V F    P E++   +    +     +NE FG+  +E  A G  ++A
Sbjct: 272 LIELAAELGIADRVRFLPPRPPEELVHVYRAADVVAVPSYNESFGLVAMEAQACGTPVVA 331

Query: 406 HKSGGPKMDIVIEDPETCRNGFL--ACDEVEYAQTIKLILHLSQDTKTRISQNAVSSVDR 463
            + GG  + + + D ET   G L    D  ++A  +  +L     T+ R+   AV     
Sbjct: 332 ARVGG--LPVAVADGET---GLLVDGHDPADWADALARLL-DDPRTRIRMGAAAVEHAAG 385

Query: 464 FSME 467
           FS  
Sbjct: 386 FSWA 389


>gnl|CDD|215118 PLN02177, PLN02177, glycerol-3-phosphate acyltransferase.
          Length = 497

 Score = 31.0 bits (70), Expect = 1.6
 Identities = 24/95 (25%), Positives = 40/95 (42%), Gaps = 15/95 (15%)

Query: 130 VEASLYPYFTLLGQSIGSMILGVEALLSFQPDIYIDTMGYAFTYPLFSYIGGSKVACYIH 189
           +  S +PY+ L+    GS++  +  LLS  P +Y       FTY   S     K   +I 
Sbjct: 35  ISRSAFPYYLLVALEAGSLLRALILLLSV-PFVY-------FTYLFISESLAIKTFVFIA 86

Query: 190 YPTITKEMLTRVARRVI-------THNNSQRVANN 217
           +  +    +  V+R V+        H  + RV N+
Sbjct: 87  FAGLKIRDIELVSRSVLPKFYAEDVHPETWRVFNS 121


>gnl|CDD|99970 cd03796, GT1_PIG-A_like, This family is most closely related to the
           GT1 family of glycosyltransferases. Phosphatidylinositol
           glycan-class A (PIG-A), an X-linked gene in humans, is
           necessary for the synthesis of
           N-acetylglucosaminyl-phosphatidylinositol, a very early
           intermediate in glycosyl phosphatidylinositol
           (GPI)-anchor biosynthesis. The GPI-anchor is an
           important cellular structure that facilitates the
           attachment of many proteins to cell surfaces. Somatic
           mutations in PIG-A have been associated with Paroxysmal
           Nocturnal Hemoglobinuria (PNH), an acquired
           hematological disorder.
          Length = 398

 Score = 31.0 bits (71), Expect = 1.6
 Identities = 18/56 (32%), Positives = 32/56 (57%)

Query: 355 LENNVEFKVNLPYEDMKKEFSEGLIGLHAMWNEHFGIGIVECMAAGLIMIAHKSGG 410
           L++ VE    +P+E ++    +G I L+    E F I IVE  + GL++++ + GG
Sbjct: 248 LQDRVELLGAVPHERVRDVLVQGHIFLNTSLTEAFCIAIVEAASCGLLVVSTRVGG 303


>gnl|CDD|216655 pfam01708, Gemini_mov, Geminivirus putative movement protein.
          This family consists of putative movement proteins from
          Maize streak and wheat dwarf virus.
          Length = 92

 Score = 28.9 bits (65), Expect = 1.7
 Identities = 12/43 (27%), Positives = 20/43 (46%), Gaps = 3/43 (6%)

Query: 6  WKSVVMWLVFLFYSILALLLLSIIVLPLSVLLFKYYVSKKRKS 48
          W  V    +F   ++  L LL   VL   +L+ K   +K+ +S
Sbjct: 34 WSRVGEIAIFTLVAVGVLYLLYTWVLKDLILVLK---AKRGRS 73


>gnl|CDD|99989 cd03819, GT1_WavL_like, This family is most closely related to the
           GT1 family of glycosyltransferases. WavL in Vibrio
           cholerae has been shown to be involved in the
           biosynthesis of the lipopolysaccharide core.
          Length = 355

 Score = 29.9 bits (68), Expect = 3.0
 Identities = 46/248 (18%), Positives = 87/248 (35%), Gaps = 34/248 (13%)

Query: 213 RVANNPILTSFKLFYYKVFALLYSHVGKYSDIIMVNSSWTEEHVIQLWNC---QLKT--- 266
           R    P +T+   FY       Y+ +    D ++  S++  +H+ + +     +++    
Sbjct: 98  RRTRPPFVTTVHGFYS--VNFRYNAIMARGDRVIAVSNFIADHIRENYGVDPDRIRVIPR 155

Query: 267 ---YKLYPPC---DTEDLKKITHSKTDGPVKIIS-VAQFRPEKDHPLQLRAMYQLRQIIS 319
                 + P        L             +I    +    K   + + A+ +L+    
Sbjct: 156 GVDLDRFDPGAVPPERILALAREWPLPKGKPVILLPGRLTRWKGQEVFIEALARLK---- 211

Query: 320 EELWDNLKLIFIGSTRNEEDEVCVKDMQDLCKHLSLENNVEFKVNLPY-EDMKKEFSEGL 378
            +   ++ L+ +G            ++ +L K L L++ V F   + +  DM   ++   
Sbjct: 212 -KDDPDVHLLIVGDA-QGRRFY-YAELLELIKRLGLQDRVTF---VGHCSDMPAAYALAD 265

Query: 379 IGLHA-MWNEHFGIGIVECMAAGLIMIAHKSGGPKMDIVIEDPETCRNGFL--ACDEVEY 435
           I + A    E FG   VE  A G  +IA   GG   + V         G L    D    
Sbjct: 266 IVVSASTEPEAFGRTAVEAQAMGRPVIASDHGGA-RETVRPGET----GLLVPPGDAEAL 320

Query: 436 AQTIKLIL 443
           AQ +  IL
Sbjct: 321 AQALDQIL 328


>gnl|CDD|178213 PLN02603, PLN02603, asparaginyl-tRNA synthetase.
          Length = 565

 Score = 29.9 bits (67), Expect = 3.6
 Identities = 15/46 (32%), Positives = 20/46 (43%)

Query: 40  YYVSKKRKSYNVLKTVAFFHPYCNAGGGGERVLWTAVLALHQKYPD 85
           Y + KKR S   L+T A   P  N  G   RV      A H+ + +
Sbjct: 197 YPIQKKRVSREFLRTKAHLRPRTNTFGAVARVRNALAYATHKFFQE 242


>gnl|CDD|236051 PRK07565, PRK07565, dihydroorotate dehydrogenase 2; Reviewed.
          Length = 334

 Score = 29.4 bits (67), Expect = 4.5
 Identities = 14/47 (29%), Positives = 23/47 (48%), Gaps = 13/47 (27%)

Query: 130 VEASLYPYFTLLGQSIGSMI-----LGVEALLSF----QPDIYIDTM 167
           V   L PYF+    ++ +M       G + L+ F    QPDI ++T+
Sbjct: 168 VAVKLSPYFS----NLANMAKRLDAAGADGLVLFNRFYQPDIDLETL 210


>gnl|CDD|215073 PLN00142, PLN00142, sucrose synthase.
          Length = 815

 Score = 29.6 bits (67), Expect = 4.7
 Identities = 14/30 (46%), Positives = 16/30 (53%)

Query: 387 EHFGIGIVECMAAGLIMIAHKSGGPKMDIV 416
           E FG+ +VE M  GL   A   GGP   IV
Sbjct: 677 EAFGLTVVEAMTCGLPTFATCQGGPAEIIV 706


>gnl|CDD|223563 COG0489, Mrp, ATPases involved in chromosome partitioning [Cell
           division and chromosome partitioning].
          Length = 265

 Score = 28.9 bits (65), Expect = 4.8
 Identities = 9/58 (15%), Positives = 21/58 (36%)

Query: 274 DTEDLKKITHSKTDGPVKIISVAQFRPEKDHPLQLRAMYQLRQIISEELWDNLKLIFI 331
           +    + +        +K++S+    P    P  L     + Q++ + LW     + I
Sbjct: 115 ELLAGEALEPVIQHDGIKVLSILPLGPVPVIPRGLLGSKAMLQLLEDVLWGEYDYVII 172


>gnl|CDD|234093 TIGR03025, EPS_sugtrans, exopolysaccharide biosynthesis polyprenyl
           glycosylphosphotransferase.  Members of this family are
           generally found near other genes involved in the
           biosynthesis of a variety of exopolysaccharides. These
           proteins consist of two fused domains, an N-terminal
           hydrophobic domain of generally low conservation and a
           highly conserved C-terminal sugar transferase domain
           (pfam02397). Characterized and partially characterized
           members of this subfamily include Salmonella WbaP
           (originally RfbP) , E. coli WcaJ , Methylobacillus EpsB,
           Xanthomonas GumD, Vibrio CpsA, Erwinia AmsG, Group B
           Streptococcus CpsE (originally CpsD), and Streptococcus
           suis Cps2E. Each of these is believed to act in
           transferring the sugar from, for instance, UDP-glucose
           or UDP-galactose, to a lipid carrier such as
           undecaprenyl phosphate as the first (priming) step in
           the synthesis of an oligosaccharide "block". This
           function is encoded in the C-terminal domain. The
           liposaccharide is believed to be subsequently
           transferred through a "flippase" function from the
           cytoplasmic to the periplasmic face of the inner
           membrane by the N-terminal domain. Certain closely
           related transferase enzymes such as Sinorhizobium ExoY
           and Lactococcus EpsD lack the N-terminal domain and are
           not found by this model.
          Length = 445

 Score = 29.5 bits (67), Expect = 5.0
 Identities = 27/100 (27%), Positives = 42/100 (42%), Gaps = 17/100 (17%)

Query: 9   VVMWLVFLF----YSILALLL---LSIIVLPLSVLLFKYYVSKKRKSYNVLKTVAFFHPY 61
           +++ L FL     +S L LLL   L++++L L  LL +  + + RK    L+ V      
Sbjct: 75  LLLALAFLLKGFDFSRLVLLLWFVLALVLLLLWRLLLRRLLRRLRKRGKNLRRVLIV--- 131

Query: 62  CNAGGGGERVLWTAVLALHQKYPDYKIYIYTGDVDASPSE 101
               G GE     A          Y++    G VD  PS+
Sbjct: 132 ----GTGEAAEELAKALSRNPALGYRV---VGFVDDRPSD 164


>gnl|CDD|240090 cd04739, DHOD_like, Dihydroorotate dehydrogenase (DHOD) like
           proteins.  DHOD catalyzes the oxidation of
           (S)-dihydroorotate to orotate. This is the fourth step
           and the only redox reaction in the de novo biosynthesis
           of UMP, the precursor of all pyrimidine nucleotides.
           DHOD requires FMN as co-factor. DHOD divides into class
           1 and class 2 based on their amino acid sequences and
           cellular location. Members of class 1 are cytosolic
           enzymes and multimers while class 2 enzymes are membrane
           associated and monomeric. The class 1 enzymes can be
           further divided into subtypes 1A and 1B which are
           homodimers and heterotetrameric proteins, respectively. 
           This subgroup has the conserved FMN binding site, but
           lacks some catalytic residues and may therefore be
           inactive.
          Length = 325

 Score = 29.1 bits (66), Expect = 5.0
 Identities = 13/47 (27%), Positives = 22/47 (46%), Gaps = 13/47 (27%)

Query: 130 VEASLYPYFTLLGQSIGSMI-----LGVEALLSF----QPDIYIDTM 167
           V   L P+F+    ++  M       G + L+ F    QPDI ++T+
Sbjct: 166 VAVKLSPFFS----ALAHMAKQLDAAGADGLVLFNRFYQPDIDLETL 208


>gnl|CDD|222495 pfam14012, DUF4229, Protein of unknown function (DUF4229).  This
          family of integral membrane proteins is functionally
          uncharacterized. This family of proteins is found in
          bacteria. Proteins in this family are typically between
          95 and 122 amino acids in length.
          Length = 65

 Score = 26.8 bits (60), Expect = 5.1
 Identities = 13/36 (36%), Positives = 21/36 (58%), Gaps = 5/36 (13%)

Query: 9  VVMWLVFLFYSILALLLLS-----IIVLPLSVLLFK 39
          VV+  V L   +L  +L++     ++ LPLS LLF+
Sbjct: 13 VVLTAVILLLGVLLGVLVAALLALVVALPLSYLLFR 48


>gnl|CDD|220281 pfam09531, Ndc1_Nup, Nucleoporin protein Ndc1-Nup.  Ndc1 is a
          nucleoporin protein that is a component of the Nuclear
          Pore Complex, and, in fungi, also of the Spindle Pole
          Body. It consists of six transmembrane segments, three
          lumenal loops, both concentrated at the N-terminus and
          cytoplasmic domains largely at the C-terminus, all of
          which are well conserved.
          Length = 557

 Score = 29.2 bits (66), Expect = 5.2
 Identities = 10/51 (19%), Positives = 19/51 (37%)

Query: 6  WKSVVMWLVFLFYSILALLLLSIIVLPLSVLLFKYYVSKKRKSYNVLKTVA 56
          W S    L    Y+ +  LLL + +L + +L         R   + +  + 
Sbjct: 34 WSSGSWSLFPSGYTGIRSLLLFLALLLIGILRKNQLHVGYRPYSSRIAQIL 84


>gnl|CDD|131258 TIGR02203, MsbA_lipidA, lipid A export permease/ATP-binding protein
           MsbA.  This family consists of a single polypeptide
           chain transporter in the ATP-binding cassette (ABC)
           transporter family, MsbA, which exports lipid A. It may
           also act in multidrug resistance. Lipid A, a part of
           lipopolysaccharide, is found in the outer leaflet of the
           outer membrane of most Gram-negative bacteria. Members
           of this family are restricted to the Proteobacteria
           (although lipid A is more broadly distributed) and often
           are clustered with lipid A biosynthesis genes [Cell
           envelope, Biosynthesis and degradation of surface
           polysaccharides and lipopolysaccharides, Transport and
           binding proteins, Other].
          Length = 571

 Score = 29.3 bits (66), Expect = 5.3
 Identities = 9/40 (22%), Positives = 21/40 (52%)

Query: 8   SVVMWLVFLFYSILALLLLSIIVLPLSVLLFKYYVSKKRK 47
           +V+   + L Y    L L+ +++LP+  +L +    + R+
Sbjct: 141 TVIGLFIVLLYYSWQLTLIVVVMLPVLSILMRRVSKRLRR 180


>gnl|CDD|214864 smart00860, SMI1_KNR4, SMI1 / KNR4 family.  Proteins in this family
           are involved in the regulation of 1,3-beta-glucan
           synthase activity and cell-wall formation.
          Length = 127

 Score = 28.0 bits (62), Expect = 6.4
 Identities = 14/91 (15%), Positives = 24/91 (26%), Gaps = 1/91 (1%)

Query: 101 EIIKRAHQRFNIVLPDQVINFVYLYRRKFVEASLYPYFTLLGQSIGSMILGVEALL-SFQ 159
           E I    ++  I LP+    F+ LY    +  S       L       +           
Sbjct: 5   EEIAELEKKLGIKLPEDYKEFLLLYNGGELGGSAELPLGGLSLLDLLELEAAIEDQEEID 64

Query: 160 PDIYIDTMGYAFTYPLFSYIGGSKVACYIHY 190
            DI  +           +  GG  +   +  
Sbjct: 65  SDIEEEEYALPGWLIFIADGGGGGILIDLDD 95


>gnl|CDD|236598 PRK09631, PRK09631, DNA topoisomerase IV subunit A; Provisional.
          Length = 635

 Score = 28.9 bits (65), Expect = 6.4
 Identities = 17/58 (29%), Positives = 27/58 (46%), Gaps = 4/58 (6%)

Query: 84  PDYKIYIYTGDVDASPSEIIKRAHQRFNIVLPDQVINFVYLYR---RKFVEASLYPYF 138
           PD K +I   +V     +I     Q F+I+  D+  NF Y+ R    KF+   +Y + 
Sbjct: 516 PD-KTFIDKKNVYVGVVDINNSKEQVFSIIYRDKKDNFYYVKRFKIDKFILDKVYRFL 572


>gnl|CDD|172161 PRK13595, ubiA, prenyltransferase; Provisional.
          Length = 292

 Score = 28.7 bits (64), Expect = 6.9
 Identities = 24/82 (29%), Positives = 34/82 (41%), Gaps = 7/82 (8%)

Query: 5   LWKSVVMWLV-FLFYSILALLLLSIIVLPLSVLLFKYYVS-----KKRKSYNVLKTVAFF 58
           L ++V++    FL Y  L L   + ++L L   LF  Y       K R   + L   A+ 
Sbjct: 97  LLRAVLLLNAPFLLYLALLLPPAATLLLLLYAALFVGYSLPPLRFKARPFLDGLSNAAYA 156

Query: 59  HPYC-NAGGGGERVLWTAVLAL 79
            P    A   G  V W  +LAL
Sbjct: 157 LPLALPALALGAPVPWPPLLAL 178


>gnl|CDD|238930 cd01968, Nitrogenase_NifE_I, Nitrogenase_NifE_I: a subgroup of the
           NifE subunit of the NifEN complex: NifE forms an
           alpha2beta2 tetramer with NifN.  NifE and NifN are
           structurally homologous to nitrogenase MoFe protein
           alpha and beta subunits respectively.  NifEN
           participates in the synthesis of the iron-molybdenum
           cofactor (FeMoco) of the MoFe protein.  NifB-co (an iron
           and sulfur containing precursor of the FeMoco) from NifB
           is transferred to the NifEN complex where it is further
           processed to FeMoco. The NifEN bound precursor of FeMoco
           has been identified as a molybdenum-free, iron- and
           sulfur- containing analog of FeMoco. It has been
           suggested that this NifEN bound precursor also acts as a
           cofactor precursor in nitrogenase systems which require
           a cofactor other than FeMoco: i.e. iron-vanadium
           cofactor (FeVco) or iron only cofactor (FeFeco).
          Length = 410

 Score = 28.8 bits (65), Expect = 7.8
 Identities = 15/57 (26%), Positives = 29/57 (50%), Gaps = 12/57 (21%)

Query: 67  GGERVLWTAVLALHQKYPDYKIYIY----TG----DVDASPSEIIKRAHQRFNIVLP 115
           GGE+ L+ A+L + ++Y    +++Y          D+DA    + K A ++F I + 
Sbjct: 69  GGEKKLYKAILEIIERYHPKAVFVYSTCVVALIGDDIDA----VCKTASEKFGIPVI 121


>gnl|CDD|206204 pfam14034, Spore_YtrH, Sporulation protein YtrH.  This family of
           proteins is involved in sporulation. It may contribute
           to the formation and stability of the thick
           peptidoglycan layer between the two membranes of the
           spore, known as the cortex. In Bacillus subtilis its
           expression is regulated by sigma-E.
          Length = 102

 Score = 27.1 bits (61), Expect = 8.7
 Identities = 10/28 (35%), Positives = 16/28 (57%), Gaps = 1/28 (3%)

Query: 134 LYPYFTLLGQSIG-SMILGVEALLSFQP 160
           +  +F   G  IG S+I G+ A L+ +P
Sbjct: 4   ILSFFIAFGVVIGGSLIGGLGAFLTGEP 31


>gnl|CDD|166000 PLN02359, PLN02359, ethanolaminephosphotransferase.
          Length = 389

 Score = 28.6 bits (64), Expect = 8.9
 Identities = 20/71 (28%), Positives = 29/71 (40%), Gaps = 12/71 (16%)

Query: 113 VLPDQVINFVYLYRRKFVE----------ASLYPYFTLLGQSIGSMILGVEALLSFQPDI 162
           V+P    N   +Y  K V+          A LYP+ TLLG  +    L    L+   P +
Sbjct: 230 VIPTVAFNVSNVY--KVVQARKGSMLLALAMLYPFVTLLGGVLIWDYLSPSDLMRNYPHL 287

Query: 163 YIDTMGYAFTY 173
            +   G AF +
Sbjct: 288 VVLGTGLAFGF 298


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.325    0.139    0.425 

Gapped
Lambda     K      H
   0.267   0.0580    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 25,256,398
Number of extensions: 2502218
Number of successful extensions: 4253
Number of sequences better than 10.0: 1
Number of HSP's gapped: 4212
Number of HSP's successfully gapped: 110
Length of query: 488
Length of database: 10,937,602
Length adjustment: 101
Effective length of query: 387
Effective length of database: 6,457,848
Effective search space: 2499187176
Effective search space used: 2499187176
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 61 (27.6 bits)