RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy15582
         (477 letters)



>gnl|CDD|223071 PHA03392, egt, ecdysteroid UDP-glucosyltransferase; Provisional.
          Length = 507

 Score =  230 bits (590), Expect = 6e-70
 Identities = 131/497 (26%), Positives = 236/497 (47%), Gaps = 62/497 (12%)

Query: 18  LILMAFLLTVESANVLIICPTPSYSHQVPFIAIGKELVRRGHTVTMIGTDPL----KEPP 73
           L+L+  L  V +A +L + PTP+YSH   F    + L  RGH VT+I             
Sbjct: 9   LLLLLLLSGVRAARILAVFPTPAYSHHSVFKVYVEALAERGHNVTVIKPTLRVYYASHLC 68

Query: 74  VNYTDIDLSFSYKYFKPQLQKGEVLPDAVDNQRRLTGYEFIVN----IGRITIAYTEDQL 129
            N T+ID S S +YFK +L K      AV  +R +      V     +G + +    DQ 
Sbjct: 69  GNITEIDASLSVEYFK-KLVKSS----AVFRKRGVVADSSTVTADNYMGLVRM--ISDQF 121

Query: 130 KSQQMQQFFKYIDENHVKFDLIIYEGLL-HTAYLGFL----------------------- 165
               ++     I   + KFDL++ E  L +      L                       
Sbjct: 122 DLPNVKNL---IANKNNKFDLLVTEAFLDYPLVFSHLFGDAPVIQISSGYGLAENFETMG 178

Query: 166 ---------PKLGYT--QSMTLMERMNNLFM--QLYSKFYIRSRLMKKQDEIMERYFGTR 212
                    P L  +   ++ + E +N ++   +LY++F   S L  +Q++++++ FG  
Sbjct: 179 AVSRHPVYYPNLWRSKFGNLNVWETINEIYTELRLYNEF---SLLADEQNKLLKQQFGPD 235

Query: 213 GLSGKQLEENKTLLFISTSWLLTYPRPVFPNTILLGPIHLNN--PKPLPQNLKDWIEGAK 270
             + ++L     LLF++   +    RPV P+   LG +HL+   P+PL   L++++  + 
Sbjct: 236 TPTIRELRNRVQLLFVNVHPVFDNNRPVPPSVQYLGGLHLHKKPPQPLDDYLEEFLNNST 295

Query: 271 DGVIYFSLGTNMQSASLQEDKRKAIVDSFKQFPRHRIIWKWEEDI-LPDLPSNVICRKWL 329
           +GV+Y S G+++ +  +  +  + ++ +FK+ P   ++WK++ ++   +LP+NV+ +KW 
Sbjct: 296 NGVVYVSFGSSIDTNDMDNEFLQMLLRTFKKLPY-NVLWKYDGEVEAINLPANVLTQKWF 354

Query: 330 PQHDILAHPKVKLFIMQGGLQSSQEAIHFGVPMIGIPFFADQDTNVRKLESMDVARFLEY 389
           PQ  +L H  VK F+ QGG+QS+ EAI   VPM+G+P   DQ  N  K   + + R L+ 
Sbjct: 355 PQRAVLKHKNVKAFVTQGGVQSTDEAIDALVPMVGLPMMGDQFYNTNKYVELGIGRALDT 414

Query: 390 ENITAETLVTLMKSILYNETVYRKSQVYSKLSNTQMMSPKDTAVWWIEYVLKAEGNVDHL 449
             ++A  LV  +  ++ N    +  +    L   Q M+P   A+W+ E+V++ +     L
Sbjct: 415 VTVSAAQLVLAIVDVIENPKYRKNLKELRHLIRHQPMTPLHKAIWYTEHVIRNKHGNTSL 474

Query: 450 KYNLDQIPWYQYYLVDL 466
           K     + +  Y++  +
Sbjct: 475 KTKAANVSYSDYFMSYI 491


>gnl|CDD|201077 pfam00201, UDPGT, UDP-glucoronosyl and UDP-glucosyl transferase. 
          Length = 500

 Score =  177 bits (451), Expect = 5e-50
 Identities = 95/300 (31%), Positives = 161/300 (53%), Gaps = 7/300 (2%)

Query: 174 MTLMERMNNLFMQLYSKFYIRSRLMKKQDEIMERYFGTRGLSGKQLEENKTLLFISTSWL 233
           MT  ER+ N+ + LY  F+ + R  KK D+      G      + + +    L +   W 
Sbjct: 180 MTFGERVKNMLIMLYFDFWFQ-RFPKKWDQFASELLGRPVTLPELMSKASAWL-LRNYWD 237

Query: 234 LTYPRPVFPNTILLGPIHLNNPKPLPQNLKDWIEGAKD-GVIYFSLGTNMQSASLQEDKR 292
           L +PRP+ PN   +G ++    KPLPQ ++ +++ + + GV+ FSLG+ M S  + E+K 
Sbjct: 238 LEFPRPLLPNMEFIGGLNCKPAKPLPQEMEAFVQSSGEHGVVVFSLGS-MVSN-IPEEKA 295

Query: 293 KAIVDSFKQFPRHRIIWKWEEDILPDLPSNVICRKWLPQHDILAHPKVKLFIMQGGLQSS 352
             I  +  Q P+ +++W+++      L  N    KWLPQ+D+L HPK + F+   G    
Sbjct: 296 NEIASALAQIPQ-KVLWRFDGTKPSTLGRNTRLVKWLPQNDLLGHPKTRAFVTHAGSNGV 354

Query: 353 QEAIHFGVPMIGIPFFADQDTNVRKLESMDVARFLEYENITAETLVTLMKSILYNETVYR 412
            EAI  GVPM+G+P F DQ  N + +E+   A  L    +T+E L+  +K+++ + +   
Sbjct: 355 YEAICHGVPMVGMPLFGDQMDNAKHMEAKGAAVTLNVLTMTSEDLLNALKTVINDPSYKE 414

Query: 413 KSQVYSKLSNTQMMSPKDTAVWWIEYVLKAEGNVDHLKYNLDQIPWYQYYLVDLAGIFIA 472
                S + + Q + P D AV+WIE+V++ +G   HL+     + WYQY+ +D+ G  +A
Sbjct: 415 NIMRLSSIHHDQPVKPLDRAVFWIEFVMRHKG-AKHLRPAAHDLTWYQYHSLDVIGFLLA 473


>gnl|CDD|99960 cd03784, GT1_Gtf_like, This family includes the Gtfs, a group of
           homologous glycosyltransferases involved in the final
           stages of the biosynthesis of antibiotics vancomycin and
           related chloroeremomycin. Gtfs transfer sugar moieties
           from an activated NDP-sugar donor to the oxidatively
           cross-linked heptapeptide core of vancomycin group
           antibiotics. The core structure is important for the
           bioactivity of the antibiotics.
          Length = 401

 Score =  105 bits (264), Expect = 9e-25
 Identities = 74/400 (18%), Positives = 121/400 (30%), Gaps = 56/400 (14%)

Query: 31  NVLIICPTPSYSHQVPFIAIGKELVRRGHTVTMIGTDPLKE-------PPVNY-TDIDLS 82
            VLI     S     P +A+   L   GH V +       +         V    D D  
Sbjct: 2   RVLITT-IGSRGDVQPLVALAWALRAAGHEVRVATPPEFADLVEAAGLEFVPVGGDPD-- 58

Query: 83  FSYKYFKPQLQKGEVLPDAVDNQRRLTGYEFIVNIGRITIAYTEDQLKSQQMQQFFKYID 142
                   +L         +           +  + R   A  +D + + +         
Sbjct: 59  --------ELLASPERNAGLLLLGPGLLLGALRLLRREAEAMLDDLVAAARDWGP----- 105

Query: 143 ENHVKFDLIIYEGLLHTAYLGFL--PKLG------YTQSMTLMERMNNLFMQLYSKFYIR 194
                 DL++ + L   A+ G +    LG           T          +   + Y  
Sbjct: 106 ------DLVVADPL---AFAGAVAAEALGIPAVRLLLGPDTPTSAFPPPLGRANLRLYAL 156

Query: 195 SRLMKKQDEIMERY------FGTRGLSGKQLEENKTLLFISTSWL-LTYPRPVFPNTILL 247
                 QD +           G   LS     +   L   S + L      P F      
Sbjct: 157 LEAELWQDLLGAWLRARRRRLGLPPLSLLDGSDVPELYGFSPAVLPPPPDWPRFDLVTGY 216

Query: 248 GPIHLNNPKPLPQNLKDWIEGAKDGVIYFSLGTNMQSASLQEDKRKAIVDSFKQFPRHRI 307
           G   +    P P  L  ++  A    +Y   G+ M      E   +  V+        R 
Sbjct: 217 GFRDVPYNGPPPPELWLFLA-AGRPPVYVGFGS-MVVRDP-EALARLDVE-AVATLGQRA 272

Query: 308 IWK--WEEDILPDLPSNVICRKWLPQHDILAHPKVKLFIMQGGLQSSQEAIHFGVPMIGI 365
           I    W      DLP NV    ++P   +L  P+    +  GG  ++  A+  GVP + +
Sbjct: 273 ILSLGWGGLGAEDLPDNVRVVDFVPHDWLL--PRCAAVVHHGGAGTTAAALRAGVPQLVV 330

Query: 366 PFFADQDTNVRKLESMDVARFLEYENITAETLVTLMKSIL 405
           PFF DQ     ++  +     L+   +TAE L   ++ +L
Sbjct: 331 PFFGDQPFWAARVAELGAGPALDPRELTAERLAAALRRLL 370


>gnl|CDD|233407 TIGR01426, MGT, glycosyltransferase, MGT family.  This model
           describes the MGT (macroside glycosyltransferase)
           subfamily of the UDP-glucuronosyltransferase family.
           Members include a number of glucosyl transferases for
           macrolide antibiotic inactivation, but also include
           transferases of glucose-related sugars for macrolide
           antibiotic production [Cellular processes, Toxin
           production and resistance].
          Length = 392

 Score =  100 bits (251), Expect = 4e-23
 Identities = 92/396 (23%), Positives = 142/396 (35%), Gaps = 87/396 (21%)

Query: 39  PSYSHQVPFIAIGKELVRRGHTVTMIGTDPLKEP-------PVNYTDIDLSFSYKYFKPQ 91
           P++ H  P + + +ELV RGH VT   T+   E         V Y               
Sbjct: 4   PAHGHVNPTLGVVEELVARGHRVTYATTEEFAERVEAAGAEFVLYGSAL----------- 52

Query: 92  LQKGEVLPDAVDNQRRLTGYEFIVNIGRITIAYTEDQLKSQQMQQFFKYIDENHVKFDLI 151
                      DN    T  E I  I ++ +   ED L   Q+++   Y  +   + DLI
Sbjct: 53  --------PPPDNPPENTEEEPIDIIEKL-LDEAEDVL--PQLEE--AYKGD---RPDLI 96

Query: 152 IYEGLLHTAYLGFLPKLGYTQSMTLMERMNNLFMQLYSKFYIRSRLMKKQ--------DE 203
           +Y+    T  L             L  + +   +  +  F       +          +E
Sbjct: 97  VYDIASWTGRL-------------LARKWDVPVISSFPTFAANEEFEEMVSPAGEGSAEE 143

Query: 204 IMERYFGTRGLSGKQ---LEENKT-------LLFISTSWLLTY-PRPVFPNT-------I 245
                 G      +    LEE+         L        L Y P+   P          
Sbjct: 144 GAIAERGLAEYVARLSALLEEHGITTPPVEFLAAPRRDLNLVYTPKAFQPAGETFDDSFT 203

Query: 246 LLGPIHLNNPKPLPQNLKDW-IEGAKDGVIYFSLGTNMQSASLQEDKRKAIVDSFKQFPR 304
            +GP          +    W   G    V+  SLGT     + Q    +  V++F+    
Sbjct: 204 FVGP-----CIGDRKEDGSWERPGDGRPVVLISLGT---VFNNQPSFYRTCVEAFRDLDW 255

Query: 305 HRII---WKWEEDILPDLPSNVICRKWLPQHDILAHPKVKLFIMQGGLQSSQEAIHFGVP 361
           H ++      +   L +LP NV  R+W+PQ +IL   K   FI  GG+ S+ EA+  GVP
Sbjct: 256 HVVLSVGRGVDPADLGELPPNVEVRQWVPQLEIL--KKADAFITHGGMNSTMEALFNGVP 313

Query: 362 MIGIPFFADQDTNVRKLESMDVARFLEYENITAETL 397
           M+ +P  ADQ    R++  + + R L  E +TAE L
Sbjct: 314 MVAVPQGADQPMTARRIAELGLGRHLPPEEVTAEKL 349


>gnl|CDD|224732 COG1819, COG1819, Glycosyl transferases, related to
           UDP-glucuronosyltransferase [Carbohydrate transport and
           metabolism / Signal transduction mechanisms].
          Length = 406

 Score = 97.1 bits (242), Expect = 6e-22
 Identities = 80/426 (18%), Positives = 145/426 (34%), Gaps = 63/426 (14%)

Query: 32  VLIICPTPSYSHQVPFIAIGKELVRRGHTVTMIGTDPLKEPPVNYTDIDLSFSYKYFKPQ 91
           + ++C   +Y H  P +A+GKEL RRGH V                      S   FK  
Sbjct: 5   LFVVCG--AYGHVNPCLALGKELRRRGHEVVFA-------------------STGKFKEF 43

Query: 92  LQK-GEVLPDAVDNQRRLTGYEFIVNIGR-ITIAYTEDQLKSQQMQQFFKYIDENHVKFD 149
           ++  G            L   +      +       + +   +++ +  + + E  +  D
Sbjct: 44  VEAAGLAFVAYPIRDSELATEDGKFAGVKSFRRLLQQFKKLIRELLELLREL-EPDLVVD 102

Query: 150 LIIYEGLLHTAYLG-------------------FLPKLGYTQSMTLMERMNNLFMQLYSK 190
                  L    LG                    LP +G    + +          +   
Sbjct: 103 DARLSLGLAARLLGIPVVGINVAPYTPLPAAGLPLPPVGIAGKLPIPLY-PLPPRLVRPL 161

Query: 191 FYIRSRLMKKQDEIMERYFGTRGLSGKQLEENKTLLFISTSWL--LTYPRP-VFPNTILL 247
            + RS L K                 +       L    T  L       P + P    +
Sbjct: 162 IFARSWLPKLVVRRNLGLELGLPNIRRLFASGPLLEIAYTDVLFPPGDRLPFIGP---YI 218

Query: 248 GPIHLNNPKPLPQNLKDWIEGAKDGVIYFSLGTNMQSASLQEDKRKAIVDSFKQFPRHRI 307
           GP+           L  WI   +  ++Y SLGT         +    ++++        I
Sbjct: 219 GPLLGEAA----NELPYWIPADRP-IVYVSLGTV----GNAVELLAIVLEALADLDVRVI 269

Query: 308 IWKWEEDI-LPDLPSNVICRKWLPQHDILAHPKVKLFIMQGGLQSSQEAIHFGVPMIGIP 366
           +        L ++P NVI   ++PQ ++L  P+    I  GG  ++ EA++ GVP++ IP
Sbjct: 270 VSLGGARDTLVNVPDNVIVADYVPQLELL--PRADAVIHHGGAGTTSEALYAGVPLVVIP 327

Query: 367 FFADQDTNVRKLESMDVARFLEYENITAETLVTLMKSILYNETVYRKSQVYSKLSNTQMM 426
             ADQ  N  ++E +     L +E +T E L   +  +L +++  R ++  ++    +  
Sbjct: 328 DGADQPLNAERVEELGAGIALPFEELTEERLRAAVNEVLADDSYRRAAERLAEEFKEE-D 386

Query: 427 SPKDTA 432
            P   A
Sbjct: 387 GPAKAA 392


>gnl|CDD|178032 PLN02410, PLN02410, UDP-glucoronosyl/UDP-glucosyl transferase
           family protein.
          Length = 451

 Score = 64.3 bits (156), Expect = 5e-11
 Identities = 52/161 (32%), Positives = 72/161 (44%), Gaps = 28/161 (17%)

Query: 239 PVFPNTILLGPIHL--NNPKPLPQNLKDWIE----GAKDGVIYFSLGTNMQSASLQEDKR 292
           PV+P    +GP+HL  + P  L +  K  IE      K+ VI+ SLG    S +L E   
Sbjct: 230 PVYP----IGPLHLVASAPTSLLEENKSCIEWLNKQKKNSVIFVSLG----SLALMEINE 281

Query: 293 KAIVDSFKQFPRHRIIW----------KWEEDILPDLPSNVICR----KWLPQHDILAHP 338
                S       + +W          +W E +  +    +  R    KW PQ ++L+HP
Sbjct: 282 VMETASGLDSSNQQFLWVIRPGSVRGSEWIESLPKEFSKIISGRGYIVKWAPQKEVLSHP 341

Query: 339 KVKLFIMQGGLQSSQEAIHFGVPMIGIPFFADQDTNVRKLE 379
            V  F    G  S+ E+I  GVPMI  PF +DQ  N R LE
Sbjct: 342 AVGGFWSHCGWNSTLESIGEGVPMICKPFSSDQKVNARYLE 382


>gnl|CDD|215084 PLN00164, PLN00164, glucosyltransferase; Provisional.
          Length = 480

 Score = 57.8 bits (140), Expect = 7e-09
 Identities = 24/60 (40%), Positives = 31/60 (51%), Gaps = 1/60 (1%)

Query: 326 RKWLPQHDILAHPKVKLFIMQGGLQSSQEAIHFGVPMIGIPFFADQDTNVRKL-ESMDVA 384
             W PQ +ILAH  V  F+   G  S  E++  GVPM   P +A+Q  N  +L   M VA
Sbjct: 344 PTWAPQKEILAHAAVGGFVTHCGWNSVLESLWHGVPMAPWPLYAEQHLNAFELVADMGVA 403


>gnl|CDD|215112 PLN02167, PLN02167, UDP-glycosyltransferase family protein.
          Length = 475

 Score = 56.0 bits (135), Expect = 2e-08
 Identities = 62/246 (25%), Positives = 105/246 (42%), Gaps = 48/246 (19%)

Query: 203 EIMERYFGTRGL---SGKQLEENKTLLFISTSWLLTYPRPVFPNTILLGPIHLNNPKPLP 259
           EI ER+   +G+   S  +LE N    F  +     YP PV+P    +GPI     +  P
Sbjct: 208 EIAERFPEAKGILVNSFTELEPNAFDYF--SRLPENYP-PVYP----VGPILSLKDRTSP 260

Query: 260 -------QNLKDWIEGAKDG-VIYFSLGTNMQSASLQEDKRKAIVDSFKQFPRHRIIWKW 311
                    +  W++   +  V++   G+     SL   + K I  +  +    R +W  
Sbjct: 261 NLDSSDRDRIMRWLDDQPESSVVFLCFGS---LGSLPAPQIKEIAQAL-ELVGCRFLWSI 316

Query: 312 E---------EDILPD------LPSNVICRKWLPQHDILAHPKVKLFIMQGGLQSSQEAI 356
                      + LP+      +   ++C  W PQ +ILAH  +  F+   G  S  E++
Sbjct: 317 RTNPAEYASPYEPLPEGFMDRVMGRGLVC-GWAPQVEILAHKAIGGFVSHCGWNSVLESL 375

Query: 357 HFGVPMIGIPFFADQDTN----VRKLESMDVARFLEY-----ENITAETLVTLMKSILYN 407
            FGVP+   P +A+Q  N    V++L  + V   L+Y     E + A+ +   ++S++  
Sbjct: 376 WFGVPIATWPMYAEQQLNAFTMVKEL-GLAVELRLDYVSAYGEIVKADEIAGAVRSLMDG 434

Query: 408 ETVYRK 413
           E V RK
Sbjct: 435 EDVPRK 440


>gnl|CDD|178572 PLN02992, PLN02992, coniferyl-alcohol glucosyltransferase.
          Length = 481

 Score = 53.1 bits (127), Expect = 2e-07
 Identities = 54/222 (24%), Positives = 87/222 (39%), Gaps = 40/222 (18%)

Query: 239 PVFPNTILLGPIHLNNPKPLPQNLKDWI-EGAKDGVIYFSLGTNMQSASLQEDKRKAIVD 297
           PV+P   L  PI           + DW+ +   + V+Y S G+     SL   +   +  
Sbjct: 233 PVYPIGPLCRPI---QSSKTDHPVLDWLNKQPNESVLYISFGS---GGSLSAKQLTELAW 286

Query: 298 SFKQFPRHRIIWKWE-------------------EDILPD-LPSNVICR---------KW 328
             +   + R +W                       D  P+ LP   + R          W
Sbjct: 287 GLEM-SQQRFVWVVRPPVDGSACSAYFSANGGETRDNTPEYLPEGFVSRTHDRGFVVPSW 345

Query: 329 LPQHDILAHPKVKLFIMQGGLQSSQEAIHFGVPMIGIPFFADQDTNVRKL-ESMDVARFL 387
            PQ +ILAH  V  F+   G  S+ E++  GVPMI  P FA+Q+ N   L + + +A   
Sbjct: 346 APQAEILAHQAVGGFLTHCGWSSTLESVVGGVPMIAWPLFAEQNMNAALLSDELGIAVRS 405

Query: 388 E--YENITAETLVTLMKSILYNETVYRKSQVYSKLSNTQMMS 427
           +   E I+   +  L++ ++  E      +   KL +T  MS
Sbjct: 406 DDPKEVISRSKIEALVRKVMVEEEGEEMRRKVKKLRDTAEMS 447


>gnl|CDD|177830 PLN02173, PLN02173, UDP-glucosyl transferase family protein.
          Length = 449

 Score = 52.0 bits (124), Expect = 4e-07
 Identities = 30/125 (24%), Positives = 60/125 (48%), Gaps = 10/125 (8%)

Query: 264 DWIEGAKDG-VIYFSLGTNMQSASLQEDKRKAIVDSFKQFPRHRIIWKWEEDILPDLPSN 322
           DW++    G V+Y + G+  + +S Q ++  + + +F       ++   EE  LP     
Sbjct: 256 DWLDKRPQGSVVYIAFGSMAKLSSEQMEEIASAISNFSYL---WVVRASEESKLPPGFLE 312

Query: 323 VICR------KWLPQHDILAHPKVKLFIMQGGLQSSQEAIHFGVPMIGIPFFADQDTNVR 376
            + +      KW PQ  +L++  +  F+   G  S+ E +  GVPM+ +P + DQ  N +
Sbjct: 313 TVDKDKSLVLKWSPQLQVLSNKAIGCFMTHCGWNSTMEGLSLGVPMVAMPQWTDQPMNAK 372

Query: 377 KLESM 381
            ++ +
Sbjct: 373 YIQDV 377


>gnl|CDD|215465 PLN02863, PLN02863, UDP-glucoronosyl/UDP-glucosyl transferase
           family protein.
          Length = 477

 Score = 51.4 bits (123), Expect = 6e-07
 Identities = 45/191 (23%), Positives = 80/191 (41%), Gaps = 25/191 (13%)

Query: 240 VFPNTILLGPIHLNNPKPLP-QNLKDWIEGAKDG-VIYFSLGTNMQSASLQEDKRKAIVD 297
           +  +    G +    P  +   ++  W++  +D  V+Y   G+      L +++ +A+  
Sbjct: 250 LPLSGEKSGLMERGGPSSVSVDDVMTWLDTCEDHKVVYVCFGSQ---VVLTKEQMEALAS 306

Query: 298 SFKQFPRHRIIWKWEEDILPD-----LPSN---------VICRKWLPQHDILAHPKVKLF 343
             ++   H  IW  +E +  +     +PS          ++ R W PQ  IL+H  V  F
Sbjct: 307 GLEKSGVH-FIWCVKEPVNEESDYSNIPSGFEDRVAGRGLVIRGWAPQVAILSHRAVGAF 365

Query: 344 IMQGGLQSSQEAIHFGVPMIGIPFFADQDTNVRKL-ESMDVA-RFLEYENI---TAETLV 398
           +   G  S  E +  GVPM+  P  ADQ  N   L + + VA R  E  +    + E   
Sbjct: 366 LTHCGWNSVLEGLVAGVPMLAWPMAADQFVNASLLVDELKVAVRVCEGADTVPDSDELAR 425

Query: 399 TLMKSILYNET 409
             M+S+  N+ 
Sbjct: 426 VFMESVSENQV 436


>gnl|CDD|177857 PLN02207, PLN02207, UDP-glycosyltransferase.
          Length = 468

 Score = 49.7 bits (118), Expect = 2e-06
 Identities = 44/162 (27%), Positives = 74/162 (45%), Gaps = 27/162 (16%)

Query: 241 FPNTILLGPIHLNNPKPLPQN-------LKDWIEGAKDGVIYFSLGTNMQSASLQEDKRK 293
           +P+   +GPI     +P P+        L  W++   +  + F    +M    L+    K
Sbjct: 237 YPSVYAVGPIFDLKAQPHPEQDLARRDELMKWLDDQPEASVVFLCFGSM--GRLRGPLVK 294

Query: 294 AIVDSFKQFPRHRIIW--KWEE----DILPD------LPSNVICRKWLPQHDILAHPKVK 341
            I     +  ++R +W  + EE    D+LP+          +IC  W PQ +ILAH  V 
Sbjct: 295 EIAHGL-ELCQYRFLWSLRTEEVTNDDLLPEGFLDRVSGRGMIC-GWSPQVEILAHKAVG 352

Query: 342 LFIMQGGLQSSQEAIHFGVPMIGIPFFADQDTN----VRKLE 379
            F+   G  S  E++ FGVP++  P +A+Q  N    V++L+
Sbjct: 353 GFVSHCGWNSIVESLWFGVPIVTWPMYAEQQLNAFLMVKELK 394


>gnl|CDD|215293 PLN02534, PLN02534, UDP-glycosyltransferase.
          Length = 491

 Score = 49.1 bits (117), Expect = 4e-06
 Identities = 22/52 (42%), Positives = 30/52 (57%)

Query: 323 VICRKWLPQHDILAHPKVKLFIMQGGLQSSQEAIHFGVPMIGIPFFADQDTN 374
           ++ + W PQ  IL+HP +  F+   G  S+ E I  GVPMI  P FA+Q  N
Sbjct: 346 LLIKGWAPQVLILSHPAIGGFLTHCGWNSTIEGICSGVPMITWPLFAEQFLN 397


>gnl|CDD|178170 PLN02555, PLN02555, limonoid glucosyltransferase.
          Length = 480

 Score = 48.3 bits (115), Expect = 7e-06
 Identities = 21/52 (40%), Positives = 30/52 (57%)

Query: 327 KWLPQHDILAHPKVKLFIMQGGLQSSQEAIHFGVPMIGIPFFADQDTNVRKL 378
           +W PQ  +LAHP V  F+   G  S+ EA+  GVP++  P + DQ T+   L
Sbjct: 343 QWCPQEKVLAHPSVACFVTHCGWNSTMEALSSGVPVVCFPQWGDQVTDAVYL 394


>gnl|CDD|215127 PLN02210, PLN02210, UDP-glucosyl transferase.
          Length = 456

 Score = 47.7 bits (113), Expect = 8e-06
 Identities = 39/162 (24%), Positives = 72/162 (44%), Gaps = 24/162 (14%)

Query: 238 RPVFPNTILLGPIHLNNPKPLPQNLK------------DWIEG-AKDGVIYFSLGTNMQS 284
           +PV P   L+ P  L + +    + K            +W++  A+  V+Y S G+ ++S
Sbjct: 223 KPVIPIGPLVSPFLLGDDEEETLDGKNLDMCKSDDCCMEWLDKQARSSVVYISFGSMLES 282

Query: 285 ASLQEDKRKAIVDSFKQ--FPRHRIIWKWEE----DILPDL--PSNVICRKWLPQHDILA 336
               E++ + I  + K    P   +I   E+     +L ++      +  +W PQ  IL+
Sbjct: 283 L---ENQVETIAKALKNRGVPFLWVIRPKEKAQNVQVLQEMVKEGQGVVLEWSPQEKILS 339

Query: 337 HPKVKLFIMQGGLQSSQEAIHFGVPMIGIPFFADQDTNVRKL 378
           H  +  F+   G  S+ E +  GVP++  P + DQ  + R L
Sbjct: 340 HMAISCFVTHCGWNSTIETVVAGVPVVAYPSWTDQPIDARLL 381


>gnl|CDD|215304 PLN02554, PLN02554, UDP-glycosyltransferase family protein.
          Length = 481

 Score = 47.9 bits (114), Expect = 9e-06
 Identities = 19/47 (40%), Positives = 26/47 (55%)

Query: 328 WLPQHDILAHPKVKLFIMQGGLQSSQEAIHFGVPMIGIPFFADQDTN 374
           W PQ  +LA P +  F+   G  S  E++ FGVPM   P +A+Q  N
Sbjct: 349 WAPQVAVLAKPAIGGFVTHCGWNSILESLWFGVPMAAWPLYAEQKFN 395


>gnl|CDD|177813 PLN02152, PLN02152, indole-3-acetate beta-glucosyltransferase.
          Length = 455

 Score = 46.2 bits (109), Expect = 2e-05
 Identities = 19/52 (36%), Positives = 29/52 (55%)

Query: 328 WLPQHDILAHPKVKLFIMQGGLQSSQEAIHFGVPMIGIPFFADQDTNVRKLE 379
           W  Q ++L H  V  F+   G  SS E++  GVP++  P ++DQ  N + LE
Sbjct: 334 WCSQIEVLRHRAVGCFVTHCGWSSSLESLVLGVPVVAFPMWSDQPANAKLLE 385


>gnl|CDD|215247 PLN02448, PLN02448, UDP-glycosyltransferase family protein.
          Length = 459

 Score = 45.4 bits (108), Expect = 4e-05
 Identities = 19/50 (38%), Positives = 24/50 (48%)

Query: 328 WLPQHDILAHPKVKLFIMQGGLQSSQEAIHFGVPMIGIPFFADQDTNVRK 377
           W  Q  +L H  V  F    G  S+ EA+  GVPM+  P F DQ  N + 
Sbjct: 330 WCDQLKVLCHSSVGGFWTHCGWNSTLEAVFAGVPMLTFPLFWDQPLNSKL 379


>gnl|CDD|178589 PLN03015, PLN03015, UDP-glucosyl transferase.
          Length = 470

 Score = 44.7 bits (105), Expect = 8e-05
 Identities = 28/91 (30%), Positives = 48/91 (52%), Gaps = 5/91 (5%)

Query: 323 VICRKWLPQHDILAHPKVKLFIMQGGLQSSQEAIHFGVPMIGIPFFADQDTNVRKL-ESM 381
           ++  +W PQ +IL+H  +  F+   G  S  E++  GVP++  P +A+Q  N   L E +
Sbjct: 337 LVVTQWAPQVEILSHRSIGGFLSHCGWSSVLESLTKGVPIVAWPLYAEQWMNATLLTEEI 396

Query: 382 DVA-RFLEYEN---ITAETLVTLMKSILYNE 408
            VA R  E  +   I  E + +L++ I+  E
Sbjct: 397 GVAVRTSELPSEKVIGREEVASLVRKIVAEE 427


>gnl|CDD|178275 PLN02670, PLN02670, transferase, transferring glycosyl groups.
          Length = 472

 Score = 42.2 bits (99), Expect = 5e-04
 Identities = 39/165 (23%), Positives = 66/165 (40%), Gaps = 25/165 (15%)

Query: 236 YPRPVFPNTILLGPIHLNNPKPLPQ-----NLKDWIEGAK-DGVIYFSLGTNMQSASLQE 289
           Y +P+ P   L   I  +             +K+W++  + + V+Y +LGT    ASL+ 
Sbjct: 237 YRKPIIPIGFLPPVIEDDEEDDTIDVKGWVRIKEWLDKQRVNSVVYVALGT---EASLRR 293

Query: 290 DKRKAIVDSFKQFPRHRIIW---------KWEEDILPD------LPSNVICRKWLPQHDI 334
           ++   +    ++       W         +   ++LPD          +I   W+PQ  I
Sbjct: 294 EEVTELALGLEK-SETPFFWVLRNEPGTTQNALEMLPDGFEERVKGRGMIHVGWVPQVKI 352

Query: 335 LAHPKVKLFIMQGGLQSSQEAIHFGVPMIGIPFFADQDTNVRKLE 379
           L+H  V  F+   G  S  E + FG  +I  P   +Q  N R L 
Sbjct: 353 LSHESVGGFLTHCGWNSVVEGLGFGRVLILFPVLNEQGLNTRLLH 397


>gnl|CDD|178581 PLN03004, PLN03004, UDP-glycosyltransferase.
          Length = 451

 Score = 42.0 bits (98), Expect = 5e-04
 Identities = 19/52 (36%), Positives = 28/52 (53%)

Query: 323 VICRKWLPQHDILAHPKVKLFIMQGGLQSSQEAIHFGVPMIGIPFFADQDTN 374
           ++ + W PQ  +L H  V  F+   G  S  EA+  GVPM+  P +A+Q  N
Sbjct: 336 MVVKSWAPQVPVLNHKAVGGFVTHCGWNSILEAVCAGVPMVAWPLYAEQRFN 387


>gnl|CDD|178584 PLN03007, PLN03007, UDP-glucosyltransferase family protein.
          Length = 482

 Score = 39.1 bits (91), Expect = 0.005
 Identities = 35/125 (28%), Positives = 51/125 (40%), Gaps = 19/125 (15%)

Query: 265 WIEGAK-DGVIYFSLGTNMQSASLQEDKRKAIVDSFKQFPRHRIIW--------KWEEDI 315
           W++  K D VIY S G+    AS + ++   I     +      IW          +E+ 
Sbjct: 278 WLDSKKPDSVIYLSFGS---VASFKNEQLFEIAAGL-EGSGQNFIWVVRKNENQGEKEEW 333

Query: 316 LPD------LPSNVICRKWLPQHDILAHPKVKLFIMQGGLQSSQEAIHFGVPMIGIPFFA 369
           LP+          +I R W PQ  IL H     F+   G  S  E +  G+PM+  P  A
Sbjct: 334 LPEGFEERTKGKGLIIRGWAPQVLILDHQATGGFVTHCGWNSLLEGVAAGLPMVTWPVGA 393

Query: 370 DQDTN 374
           +Q  N
Sbjct: 394 EQFYN 398


>gnl|CDD|215029 PLN00016, PLN00016, RNA-binding protein; Provisional.
          Length = 378

 Score = 37.8 bits (88), Expect = 0.011
 Identities = 18/56 (32%), Positives = 26/56 (46%), Gaps = 6/56 (10%)

Query: 26  TVESANVLIICPTPSYSHQVPFIAIGKELVRRGHTVTMI-----GTDPLKEPPVNY 76
            VE   VLI+  T S  H      + KELV+ GH VT+       +  +K+ P + 
Sbjct: 49  AVEKKKVLIV-NTNSGGHAFIGFYLAKELVKAGHEVTLFTRGKEPSQKMKKEPFSR 103


>gnl|CDD|223779 COG0707, MurG, UDP-N-acetylglucosamine:LPS N-acetylglucosamine
           transferase [Cell envelope biogenesis, outer membrane].
          Length = 357

 Score = 36.9 bits (86), Expect = 0.018
 Identities = 17/60 (28%), Positives = 26/60 (43%), Gaps = 4/60 (6%)

Query: 354 EAIHFGVPMIGIPF----FADQDTNVRKLESMDVARFLEYENITAETLVTLMKSILYNET 409
           E +  GVP I +P+       Q+ N + LE    A  +    +T E L  L+  +L N  
Sbjct: 267 ELLALGVPAILVPYPPGADGHQEYNAKFLEKAGAALVIRQSELTPEKLAELILRLLSNPE 326


>gnl|CDD|177807 PLN00414, PLN00414, glycosyltransferase family protein.
          Length = 446

 Score = 34.2 bits (78), Expect = 0.15
 Identities = 25/82 (30%), Positives = 43/82 (52%), Gaps = 4/82 (4%)

Query: 328 WLPQHDILAHPKVKLFIMQGGLQSSQEAIHFGVPMIGIPFFADQDTNVRKL-ESMDVARF 386
           W+ Q  IL+HP V  F+   G  S  E++     ++ IP  ADQ    R L E ++V+  
Sbjct: 319 WVEQPLILSHPSVGCFVNHCGFGSMWESLVSDCQIVFIPQLADQVLITRLLTEELEVSVK 378

Query: 387 LEYEN---ITAETLVTLMKSIL 405
           ++ E+    + E+L   +KS++
Sbjct: 379 VQREDSGWFSKESLRDTVKSVM 400


>gnl|CDD|99968 cd03794, GT1_wbuB_like, This family is most closely related to
          the GT1 family of glycosyltransferases. wbuB in E. coli
          is involved in the biosynthesis of the O26 O-antigen.
          It has been proposed to function as an
          N-acetyl-L-fucosamine (L-FucNAc) transferase.
          Length = 394

 Score = 34.2 bits (79), Expect = 0.15
 Identities = 13/41 (31%), Positives = 19/41 (46%), Gaps = 3/41 (7%)

Query: 31 NVLIIC---PTPSYSHQVPFIAIGKELVRRGHTVTMIGTDP 68
           +LI+    P            + +ELV+RGH VT+I   P
Sbjct: 1  KILILSQYFPPELGGGAFRTTELAEELVKRGHEVTVITGSP 41


>gnl|CDD|217329 pfam03033, Glyco_transf_28, Glycosyltransferase family 28
          N-terminal domain.  The glycosyltransferase family 28
          includes monogalactosyldiacylglycerol synthase (EC
          2.4.1.46) and UDP-N-acetylglucosamine transferase (EC
          2.4.1.-). This N-terminal domain contains the acceptor
          binding site and likely membrane association site. This
          family also contains a large number of proteins that
          probably have quite distinct activities.
          Length = 136

 Score = 32.7 bits (75), Expect = 0.17
 Identities = 12/26 (46%), Positives = 15/26 (57%), Gaps = 1/26 (3%)

Query: 43 HQVPFIAIGKELVRRGHTVTMIGTDP 68
          H  P +A+   L RRGH V + GT P
Sbjct: 11 HVFPAVALAWALRRRGHEVRL-GTPP 35


>gnl|CDD|178364 PLN02764, PLN02764, glycosyltransferase family protein.
          Length = 453

 Score = 33.9 bits (77), Expect = 0.19
 Identities = 22/68 (32%), Positives = 33/68 (48%), Gaps = 5/68 (7%)

Query: 323 VICRKWLPQHDILAHPKVKLFIMQGGLQSSQEAIHFGVPMIGIPFFADQDTNVRKLE--- 379
           V+   W+ Q  IL+HP V  F+   G  S  E++     ++ +P   DQ  N R L    
Sbjct: 319 VVWGGWVQQPLILSHPSVGCFVSHCGFGSMWESLLSDCQIVLVPQLGDQVLNTRLLSDEL 378

Query: 380 --SMDVAR 385
             S++VAR
Sbjct: 379 KVSVEVAR 386


>gnl|CDD|233284 TIGR01133, murG, undecaprenyldiphospho-muramoylpentapeptide
           beta-N-acetylglucosaminyltransferase.  RM 8449890 RT The
           final step of peptidoglycan subunit assembly in
           Escherichia coli occurs in the cytoplasm. RA Bupp K, van
           Heijenoort J. RL J Bacteriol 1993 Mar;175(6):1841-3
           [Cell envelope, Biosynthesis and degradation of murein
           sacculus and peptidoglycan].
          Length = 348

 Score = 32.6 bits (75), Expect = 0.40
 Identities = 16/58 (27%), Positives = 27/58 (46%), Gaps = 3/58 (5%)

Query: 354 EAIHFGVPMIGIPF-FA--DQDTNVRKLESMDVARFLEYENITAETLVTLMKSILYNE 408
           E    GVP I IP+ +A  DQ  N + LE +     +  + +  E L+  +  +L + 
Sbjct: 265 ELAAAGVPAILIPYPYAADDQYYNAKFLEDLGAGLVIRQKELLPEKLLEALLKLLLDP 322


>gnl|CDD|177858 PLN02208, PLN02208, glycosyltransferase family protein.
          Length = 442

 Score = 32.7 bits (74), Expect = 0.40
 Identities = 24/87 (27%), Positives = 42/87 (48%), Gaps = 4/87 (4%)

Query: 323 VICRKWLPQHDILAHPKVKLFIMQGGLQSSQEAIHFGVPMIGIPFFADQDTNVRKL-ESM 381
           V+   W+ Q  IL HP +  F+   G  +  E++     M+ IPF +DQ    R + E  
Sbjct: 313 VVWGGWVQQPLILDHPSIGCFVNHCGPGTIWESLVSDCQMVLIPFLSDQVLFTRLMTEEF 372

Query: 382 DVARFLEYEN---ITAETLVTLMKSIL 405
           +V+  +  E     + E+L   +KS++
Sbjct: 373 EVSVEVSREKTGWFSKESLSNAIKSVM 399


>gnl|CDD|227535 COG5210, COG5210, GTPase-activating protein [General function
           prediction only].
          Length = 496

 Score = 32.9 bits (75), Expect = 0.45
 Identities = 33/182 (18%), Positives = 61/182 (33%), Gaps = 39/182 (21%)

Query: 159 TAYLGFLPKLGYTQSMT-----LMERMNN------LFMQLYSKFYIRS----------RL 197
            AY  + P++GY Q M      L+  + +        ++L   + +            R 
Sbjct: 290 KAYSLYNPEVGYVQGMNFLAAPLLLVLESEEQAFWCLVKLLKNYGLPGYFLKNLSGLHRD 349

Query: 198 MKKQDEIMERYFGTRGLSGKQLEENKTLLFISTSWLLTYPRPVFPNTILLGPIHLNNPKP 257
           +K  D+++E       L    L E   LL  +  W LT     FP    L          
Sbjct: 350 LKVLDDLVEEL--DPELYEHLLREGVVLLMFAFRWFLTLFVREFPLEYAL---------- 397

Query: 258 LPQNLKD--WIEG-AKDGVIYFSLGTNMQSASLQEDKRKAIVDSFKQFPRHRIIWKWEED 314
               + D  ++EG +    +  ++   ++   L+ D  + +    KQ   H     W   
Sbjct: 398 ---RIWDCLFLEGSSMLFQLALAILKLLRDKLLKLDSDELLDLLLKQLFLHSGKEAWSSI 454

Query: 315 IL 316
           + 
Sbjct: 455 LK 456


>gnl|CDD|227017 COG4671, COG4671, Predicted glycosyl transferase [General function
           prediction only].
          Length = 400

 Score = 31.6 bits (72), Expect = 0.81
 Identities = 19/68 (27%), Positives = 34/68 (50%), Gaps = 3/68 (4%)

Query: 341 KLFIMQGGLQSSQEAIHFGVPMIGIPFFA---DQDTNVRKLESMDVARFLEYENITAETL 397
           +L +  GG  +  E + FG P + +P  A   +Q    ++LE + +   L  EN+T + L
Sbjct: 296 RLVVSMGGYNTVCEILSFGKPALIVPRAAPREEQLIRAQRLEELGLVDVLLPENLTPQNL 355

Query: 398 VTLMKSIL 405
              +K+ L
Sbjct: 356 ADALKAAL 363


>gnl|CDD|234474 TIGR04113, cas_csx17, CRISPR-associated protein Csx17, subtype
           Dpsyc.  Members of this protein family are found
           exclusively in CRISPR-associated (cas) type I system
           gene clusters of the Dpsyc subtype. Markers for that
           type include a variant form of cas3 (model TIGR02621)
           and the GSU0054-like protein family (model TIGR02165).
           This family occurs in less than half of known Dpsyc
           clusters.
          Length = 704

 Score = 32.0 bits (73), Expect = 0.83
 Identities = 17/65 (26%), Positives = 26/65 (40%), Gaps = 11/65 (16%)

Query: 255 PKPLPQNLKDWIEGAKDGVIYFSLGTNMQSASLQEDKRKAIVDSFKQF-----PRHRIIW 309
            +PLP    +W+    DG   F +   +  AS+ +      V   + F     P  R  W
Sbjct: 450 VRPLPLLSPEWVLADDDGSPEFRIALAL--ASIHDGP----VGPIRSFLAPVEPGRRGRW 503

Query: 310 KWEED 314
            WEE+
Sbjct: 504 VWEEE 508


>gnl|CDD|233077 TIGR00661, MJ1255, conserved hypothetical protein.  This model
           represents nearly the full length of MJ1255 from
           Methanococcus jannaschii and of an unpublished protein
           from Vibrio cholerae, as well as the C-terminal half of
           a protein from Methanobacterium thermoautotrophicum. A
           small region (~50 amino acids) within the domain appears
           related to a family of sugar transferases [Hypothetical
           proteins, Conserved].
          Length = 321

 Score = 31.4 bits (71), Expect = 1.1
 Identities = 19/50 (38%), Positives = 23/50 (46%), Gaps = 2/50 (4%)

Query: 342 LFIMQGGLQSSQEAIHFGVPMIGIPFFA--DQDTNVRKLESMDVARFLEY 389
           L I  GG     EA+  G P+I IP     +Q  N  KLE +     LEY
Sbjct: 250 LVITHGGFSLISEALSLGKPLIVIPDLGQFEQGNNAVKLEDLGCGIALEY 299


>gnl|CDD|215305 PLN02562, PLN02562, UDP-glycosyltransferase.
          Length = 448

 Score = 31.4 bits (71), Expect = 1.1
 Identities = 39/166 (23%), Positives = 66/166 (39%), Gaps = 36/166 (21%)

Query: 234 LTYPRPVFPNTILLGPIH------LNNPKPLPQNLK--DWIEGAK-DGVIYFSLGT---- 280
            +Y     P  + +GP+H      +  P    +++    W++  K + VIY S G+    
Sbjct: 227 ASYNNGQNPQILQIGPLHNQEATTITKPSFWEEDMSCLGWLQEQKPNSVIYISFGSWVSP 286

Query: 281 ----NMQSASLQEDKRKAIVDSFKQFPRHRIIW----KWEEDILPDLPSNVICR----KW 328
               N+++ +L      A+  S + F     IW     W E + P     V  +     W
Sbjct: 287 IGESNVRTLAL------ALEASGRPF-----IWVLNPVWREGLPPGYVERVSKQGKVVSW 335

Query: 329 LPQHDILAHPKVKLFIMQGGLQSSQEAIHFGVPMIGIPFFADQDTN 374
            PQ ++L H  V  ++   G  S+ EAI     ++  P   DQ  N
Sbjct: 336 APQLEVLKHQAVGCYLTHCGWNSTMEAIQCQKRLLCYPVAGDQFVN 381


>gnl|CDD|222200 pfam13528, Glyco_trans_1_3, Glycosyl transferase family 1. 
          Length = 317

 Score = 31.1 bits (71), Expect = 1.3
 Identities = 13/59 (22%), Positives = 26/59 (44%), Gaps = 2/59 (3%)

Query: 341 KLFIMQGGLQSSQEAIHFGVPMIGIPFFA--DQDTNVRKLESMDVARFLEYENITAETL 397
              I   G +   EA++ G P++ +P     +Q +N   LE +     ++ E++    L
Sbjct: 251 SAVICNAGFELLSEALYLGKPLLLVPLDGQFEQTSNALYLERLGYGIVMDMEDLDPAVL 309


>gnl|CDD|99961 cd03785, GT1_MurG, MurG is an N-acetylglucosaminyltransferase, the
           last enzyme involved in the intracellular phase of
           peptidoglycan biosynthesis. It transfers
           N-acetyl-D-glucosamine (GlcNAc) from UDP-GlcNAc to the
           C4 hydroxyl of a lipid-linked N-acetylmuramoyl
           pentapeptide (NAM). The resulting disaccharide is then
           transported across the cell membrane, where it is
           polymerized into NAG-NAM cell-wall repeat structure.
           MurG belongs to the GT-B structural superfamily of
           glycoslytransferases, which have characteristic N- and
           C-terminal domains, each containing a typical Rossmann
           fold. The two domains have high structural homology
           despite minimal sequence homology.  The large cleft that
           separates the two domains includes the catalytic center
           and permits a high degree of flexibility.
          Length = 350

 Score = 30.9 bits (71), Expect = 1.6
 Identities = 17/59 (28%), Positives = 24/59 (40%), Gaps = 4/59 (6%)

Query: 354 EAIHFGVPMIGIPF-FA---DQDTNVRKLESMDVARFLEYENITAETLVTLMKSILYNE 408
           E    G+P I IP  +A    Q  N R L     A  +  E +T E L   +  +L + 
Sbjct: 267 ELAALGLPAILIPLPYAADDHQTANARALVKAGAAVLIPQEELTPERLAAALLELLSDP 325



 Score = 28.2 bits (64), Expect = 9.3
 Identities = 10/24 (41%), Positives = 14/24 (58%)

Query: 43 HQVPFIAIGKELVRRGHTVTMIGT 66
          H  P +A+ +EL  RG  V  +GT
Sbjct: 12 HIFPALALAEELRERGAEVLFLGT 35


>gnl|CDD|225462 COG2910, COG2910, Putative NADH-flavin reductase [General
          function prediction only].
          Length = 211

 Score = 29.6 bits (67), Expect = 2.8
 Identities = 10/24 (41%), Positives = 13/24 (54%)

Query: 49 AIGKELVRRGHTVTMIGTDPLKEP 72
           I KE ++RGH VT I  +  K  
Sbjct: 15 RILKEALKRGHEVTAIVRNASKLA 38


>gnl|CDD|112137 pfam03308, ArgK, ArgK protein.  The ArgK protein acts as an
          ATPase enzyme and as a kinase, and phosphorylates
          periplasmic binding proteins involved in the LAO
          (lysine, arginine, ornithine)/AO transport systems.
          Length = 267

 Score = 29.6 bits (67), Expect = 3.0
 Identities = 11/20 (55%), Positives = 14/20 (70%)

Query: 49 AIGKELVRRGHTVTMIGTDP 68
          A+G EL RRGH V ++  DP
Sbjct: 48 ALGMELRRRGHRVAVLAVDP 67


>gnl|CDD|217893 pfam04101, Glyco_tran_28_C, Glycosyltransferase family 28
           C-terminal domain.  The glycosyltransferase family 28
           includes monogalactosyldiacylglycerol synthase (EC
           2.4.1.46) and UDP-N-acetylglucosamine transferase (EC
           2.4.1.-). Structural analysis suggests the C-terminal
           domain contains the UDP-GlcNAc binding site.
          Length = 167

 Score = 28.8 bits (65), Expect = 4.0
 Identities = 16/60 (26%), Positives = 23/60 (38%), Gaps = 4/60 (6%)

Query: 354 EAIHFGVPMIGIP----FFADQDTNVRKLESMDVARFLEYENITAETLVTLMKSILYNET 409
           E +  G P I +P        QD N  +L     A  L  + +T E LV  +  +L    
Sbjct: 87  ELLALGKPAILVPRPKAAGEHQDNNALELVKAGAALVLLQKELTPEKLVEALLKLLLKPL 146


>gnl|CDD|187705 cd09767, Csx17_I-U, CRISPR/Cas system-associated protein Csx17.
           CRISPR (Clustered Regularly Interspaced Short
           Palindromic Repeats) and associated Cas proteins
           comprise a system for heritable host defense by
           prokaryotic cells against phage and other foreign DNA;
           Large proteins; Predicted subunit of the Cascade
           complex;.
          Length = 652

 Score = 29.3 bits (66), Expect = 5.0
 Identities = 20/62 (32%), Positives = 29/62 (46%), Gaps = 3/62 (4%)

Query: 254 NPKPLPQNLKDWIEGAKDGVIYFSLGTNMQSASLQEDKRKAIVDSFKQFPRHRII-WKWE 312
             +PLP    DWIE A D    F +   +  AS+++  +K    S    P +R + WKW 
Sbjct: 437 VTRPLPGLSPDWIEAADDLSPEFRIAAAL--ASIRDGGKKGDRTSLGPAPINRRLRWKWV 494

Query: 313 ED 314
           E 
Sbjct: 495 EP 496


>gnl|CDD|240645 cd12168, Mand_dh_like, D-Mandelate Dehydrogenase-like
           dehydrogenases.  D-Mandelate dehydrogenase (D-ManDH),
           identified as an enzyme that interconverts
           benzoylformate and D-mandelate, is a D-2-hydroxyacid
           dehydrogenase family member that catalyzes the
           conversion of c3-branched 2-ketoacids. D-ManDH exhibits
           broad substrate specificities for 2-ketoacids with large
           hydrophobic side chains, particularly those with
           C3-branched side chains. 2-hydroxyacid dehydrogenases
           catalyze the conversion of a wide variety of D-2-hydroxy
           acids to their corresponding keto acids. The general
           mechanism is (R)-lactate + acceptor to pyruvate +
           reduced acceptor. Glycerate dehydrogenase catalyzes the
           reaction (R)-glycerate + NAD+ to hydroxypyruvate + NADH
           + H+. Formate/glycerate and related dehydrogenases of
           the D-specific 2-hydroxyacid dehydrogenase superfamily
           include groups such as formate dehydrogenase, glycerate
           dehydrogenase, L-alanine dehydrogenase, and
           S-adenosylhomocysteine hydrolase. Despite often low
           sequence identity, these proteins typically have a
           characteristic arrangement of 2 similar subdomains of
           the alpha/beta Rossmann fold NAD+ binding form. The NAD+
           binding domain is inserted within the linear sequence of
           the mostly N-terminal catalytic domain, which has a
           similar domain structure to the internal NAD binding
           domain. Structurally, these domains are connected by
           extended alpha helices and create a cleft in which NAD
           is bound, primarily to the C-terminal portion of the 2nd
           (internal) domain.
          Length = 321

 Score = 28.7 bits (65), Expect = 6.1
 Identities = 19/84 (22%), Positives = 34/84 (40%), Gaps = 12/84 (14%)

Query: 246 LLGPIHLNNPKPLPQNLKDWIEGAKDG-VIYFSLGTNMQ-SASLQEDKRKAIVDSFKQFP 303
           LLG               +W E +    VIY + GT  +   +L+E K    V  ++ F 
Sbjct: 6   LLGDPI--------HAHDEWKELSSIAEVIYPTSGTREEFIEALKEGKYGDFVAIYRTFG 57

Query: 304 RHRIIWKWEEDILPDLPSNV--IC 325
                  ++E+++  LP ++  I 
Sbjct: 58  SAGETGPFDEELISPLPPSLKIIA 81


>gnl|CDD|129833 TIGR00750, lao, LAO/AO transport system ATPase.  In E. coli,
          mutation of this kinase blocks phosphorylation of two
          transporter system periplasmic binding proteins and
          consequently inhibits those transporters. This kinase
          is also found in Gram-positive bacteria, archaea, and
          the roundworm C. elegans. It may have a more general,
          but still unknown function. Mutations have also been
          found that do not phosphorylate the periplasmic binding
          proteins, yet still allow transport. The ATPase
          activity of this protein seems to be necessary, however
          [Transport and binding proteins, Amino acids, peptides
          and amines, Regulatory functions, Protein
          interactions].
          Length = 300

 Score = 29.0 bits (65), Expect = 6.2
 Identities = 11/20 (55%), Positives = 13/20 (65%)

Query: 49 AIGKELVRRGHTVTMIGTDP 68
          A+G EL RRG  V +I  DP
Sbjct: 53 ALGMELRRRGLRVAVIAVDP 72


>gnl|CDD|233649 TIGR01945, rnfC, electron transport complex, RnfABCDGE type, C
           subunit.  The six subunit complex RnfABCDGE in
           Rhodobacter capsulatus encodes an apparent NADH
           oxidoreductase responsible for electron transport to
           nitrogenase, necessary for nitrogen fixation. A closely
           related complex in E. coli, RsxABCDGE (Reducer of SoxR),
           reduces the 2Fe-2S-containing superoxide sensor SoxR,
           active as a transcription factor when oxidized. This
           family of putative NADH oxidoreductase complexes exists
           in many of the same species as the related NQR, a
           Na(+)-translocating NADH-quinone reductase, but is
           distinct. This model describes the C subunit [Energy
           metabolism, Electron transport].
          Length = 435

 Score = 28.5 bits (64), Expect = 8.6
 Identities = 14/47 (29%), Positives = 19/47 (40%), Gaps = 8/47 (17%)

Query: 320 PSNVICRKWLPQHDILAHPKVKLFIMQGGLQSSQEAIHFGVPMIGIP 366
           P N+      P  DILA          GG +   E +  G PM+G+ 
Sbjct: 296 PKNLWVLIGTPVSDILAF--------CGGFREKPERLIMGGPMMGLA 334


>gnl|CDD|99974 cd03801, GT1_YqgM_like, This family is most closely related to the
           GT1 family of glycosyltransferases and named after YqgM
           in Bacillus licheniformis about which little is known.
           Glycosyltransferases catalyze the transfer of sugar
           moieties from activated donor molecules to specific
           acceptor molecules, forming glycosidic bonds. The
           acceptor molecule can be a lipid, a protein, a
           heterocyclic compound, or another carbohydrate residue.
           This group of glycosyltransferases is most closely
           related to the previously defined glycosyltransferase
           family 1 (GT1). The members of this family may transfer
           UDP, ADP, GDP, or CMP linked sugars. The diverse
           enzymatic activities among members of this family
           reflect a wide range of biological functions. The
           protein structure available for this family has the GTB
           topology, one of the two protein topologies observed for
           nucleotide-sugar-dependent glycosyltransferases. GTB
           proteins have distinct N- and C- terminal domains each
           containing a typical Rossmann fold. The two domains have
           high structural homology despite minimal sequence
           homology. The large cleft that separates the two domains
           includes the catalytic center and permits a high degree
           of flexibility. The members of this family are found
           mainly in certain bacteria and archaea.
          Length = 374

 Score = 28.5 bits (64), Expect = 8.7
 Identities = 15/86 (17%), Positives = 29/86 (33%), Gaps = 3/86 (3%)

Query: 31  NVLIICPTPSYSH---QVPFIAIGKELVRRGHTVTMIGTDPLKEPPVNYTDIDLSFSYKY 87
            +L++ P    S    +   + + + L  RGH VT++       P        +      
Sbjct: 1   KILLVTPEYPPSVGGAERHVLELARALAARGHEVTVLTPGDGGLPDEEEVGGIVVVRPPP 60

Query: 88  FKPQLQKGEVLPDAVDNQRRLTGYEF 113
                +   +L  A+  +R L    F
Sbjct: 61  LLRVRRLLLLLLLALRLRRLLRRERF 86


>gnl|CDD|215261 PLN02470, PLN02470, acetolactate synthase.
          Length = 585

 Score = 28.5 bits (64), Expect = 8.9
 Identities = 13/30 (43%), Positives = 18/30 (60%), Gaps = 5/30 (16%)

Query: 45  VPFIAIGKELVRRGHTVTMIGTDPLKEPPV 74
           VP +AI  ++ RR     MIGTD  +E P+
Sbjct: 103 VPLVAITGQVPRR-----MIGTDAFQETPI 127


>gnl|CDD|153133 cd01583, IPMI, 3-isopropylmalate dehydratase catalyzes the
           isomerization between 2-isopropylmalate and
           3-isopropylmalate.  Aconatase-like catalytic domain of
           3-isopropylmalate dehydratase and related
           uncharacterized proteins. 3-isopropylmalate dehydratase
           catalyzes the isomerization between 2-isopropylmalate
           and 3-isopropylmalate, via the formation of
           2-isopropylmaleate 3-isopropylmalate. IPMI is involved
           in fungal and bacterial leucine biosynthesis and is also
           found in eukaryotes.
          Length = 382

 Score = 28.3 bits (64), Expect = 9.2
 Identities = 14/53 (26%), Positives = 25/53 (47%), Gaps = 5/53 (9%)

Query: 125 TEDQLKSQQMQQFFKYIDENHVKFDLIIYEGLLHTAYLGFLPKLGYTQ-SMTL 176
           T D   ++Q++   K+  E  + F  +  +G+ H      LP+ G T   MT+
Sbjct: 40  TPDIKAAEQVKTLRKFAKEFGINFFDVGRQGICHVI----LPEKGLTLPGMTI 88


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.323    0.139    0.418 

Gapped
Lambda     K      H
   0.267   0.0723    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 24,998,082
Number of extensions: 2472771
Number of successful extensions: 2381
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2363
Number of HSP's successfully gapped: 71
Length of query: 477
Length of database: 10,937,602
Length adjustment: 101
Effective length of query: 376
Effective length of database: 6,457,848
Effective search space: 2428150848
Effective search space used: 2428150848
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 61 (27.3 bits)