RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy14941
         (513 letters)



>gnl|CDD|223071 PHA03392, egt, ecdysteroid UDP-glucosyltransferase; Provisional.
          Length = 507

 Score =  258 bits (662), Expect = 3e-80
 Identities = 154/521 (29%), Positives = 256/521 (49%), Gaps = 56/521 (10%)

Query: 11  LISAAILLMCIRIQQCSKILAIFPTPAKSHQIIFDTILVELYQRGHDLTVITQYPETLVH 70
           +I   +LL+ +   + ++ILA+FPTPA SH  +F   +  L +RGH++TVI   P   V+
Sbjct: 5   IIILLLLLLLLSGVRAARILAVFPTPAYSHHSVFKVYVEALAERGHNVTVIK--PTLRVY 62

Query: 71  Y-----ERMKVLDIKGTHTY------NSTIEDIYELSADSIKRIHIN-------FIDQEI 112
           Y       +  +D   +  Y      +S +     + ADS      N         DQ  
Sbjct: 63  YASHLCGNITEIDASLSVEYFKKLVKSSAVFRKRGVVADSSTVTADNYMGLVRMISDQ-- 120

Query: 113 QVENIFLHENMKS-IWNMENKYDLIITEMFLTDAFLVIPYLY-KVPYISIASSTLHPQHS 170
                F   N+K+ I N  NK+DL++TE FL D  LV  +L+   P I I+S     ++ 
Sbjct: 121 -----FDLPNVKNLIANKNNKFDLLVTEAFL-DYPLVFSHLFGDAPVIQISSGYGLAENF 174

Query: 171 ERLGLPD-NPSYIP---------SYVSAYTDHMSFTERLSNTFVGLYYKWYYDYKSHGAA 220
           E +G    +P Y P           V    + +    RL N F  L              
Sbjct: 175 ETMGAVSRHPVYYPNLWRSKFGNLNVWETINEIYTELRLYNEFSLL----------ADEQ 224

Query: 221 NRIIHKYF-PEIPRIQDLLNTCSLTLVNTHHTINIARPLPANVVEIGGIH--VKPAKKLN 277
           N+++ + F P+ P I++L N   L  VN H   +  RP+P +V  +GG+H   KP + L+
Sbjct: 225 NKLLKQQFGPDTPTIRELRNRVQLLFVNVHPVFDNNRPVPPSVQYLGGLHLHKKPPQPLD 284

Query: 278 EEMERFLNESHNGVIYFSMGSMLKTSSFPPDKFKAFLKAFSKIPQRVLWKFEDNDTSIFK 337
           + +E FLN S NGV+Y S GS + T+    +  +  L+ F K+P  VLWK++    +I  
Sbjct: 285 DYLEEFLNNSTNGVVYVSFGSSIDTNDMDNEFLQMLLRTFKKLPYNVLWKYDGEVEAINL 344

Query: 338 PYKNIRTSSWMPQRDIFAHPNMKLFISHGGLLGITEAVYEGIPVLGIPVFGDQWANIKKL 397
           P  N+ T  W PQR +  H N+K F++ GG+    EA+   +P++G+P+ GDQ+ N  K 
Sbjct: 345 P-ANVLTQKWFPQRAVLKHKNVKAFVTQGGVQSTDEAIDALVPMVGLPMMGDQFYNTNKY 403

Query: 398 ESLKAGKLLPYLEITEETVSDA-LKIVLSPEYKENAEDLGTRFRDRPQSPLEVAIYWIEY 456
             L  G+ L  + ++   +  A + ++ +P+Y++N ++L    R +P +PL  AI++ E+
Sbjct: 404 VELGIGRALDTVTVSAAQLVLAIVDVIENPKYRKNLKELRHLIRHQPMTPLHKAIWYTEH 463

Query: 457 VIKYNG-AYHLQSAAVKLTWYQYYLLDIALVLIVGLVSVCF 496
           VI+       L++ A  +++  Y++  I + L+   V    
Sbjct: 464 VIRNKHGNTSLKTKAANVSYSDYFMSYILVPLVTFTVMNHL 504


>gnl|CDD|201077 pfam00201, UDPGT, UDP-glucoronosyl and UDP-glucosyl transferase. 
          Length = 500

 Score =  244 bits (624), Expect = 1e-74
 Identities = 156/496 (31%), Positives = 252/496 (50%), Gaps = 42/496 (8%)

Query: 28  KILAIFPTPAKSHQIIFDTILVELYQRGHDLTVITQYPETLVHYERMKVLDIKGTHTYNS 87
           K+L ++P    SH +    IL+EL QRGH++TV+   P   +        ++K   TY  
Sbjct: 2   KVL-VWPM-DGSHWMNMKGILLELVQRGHEVTVLR--PSASILIGPALPSNLKFE-TY-P 55

Query: 88  TIEDIYELSADSIKRIHINFIDQE-----------IQVENIFLHENMKS-IWNM------ 129
           T     EL     KR+   F++             +Q  +     + K  + N       
Sbjct: 56  TSYTKEELENLFPKRVMNWFMEAAEAGTVWSYFSALQEYSDGARVSCKELVGNKFLMTKL 115

Query: 130 -ENKYDLIITEMFLTDAFLVIPYLYKVPYI----SIASSTLHPQHSERLGLPDNPSYIPS 184
            E+ +D+++ +       L+   L  +P +     +       +     GLP  PSY+P 
Sbjct: 116 QESSFDVVLADPVWPCGALLAELL-HIPTVYSLRFVPGYAAE-KADG--GLPAPPSYVPV 171

Query: 185 YVSAYTDHMSFTERLSNTFVGLYYK-WYYDYKSHGAANRIIHKYFPEIPRIQDLLNTCSL 243
            +S  +D M+F ER+ N  + LY+  W+  +              P    + +L++  S 
Sbjct: 172 RLSDLSDGMTFGERVKNMLIMLYFDFWFQRFPKKWDQFASELLGRPVT--LPELMSKASA 229

Query: 244 TLVNTHHTINIARPLPANVVEIGGIHVKPAKKLNEEMERFLNES-HNGVIYFSMGSMLKT 302
            L+  +  +   RPL  N+  IGG++ KPAK L +EME F+  S  +GV+ FS+GSM+  
Sbjct: 230 WLLRNYWDLEFPRPLLPNMEFIGGLNCKPAKPLPQEMEAFVQSSGEHGVVVFSLGSMV-- 287

Query: 303 SSFPPDKFKAFLKAFSKIPQRVLWKFEDNDTSIFKPYKNIRTSSWMPQRDIFAHPNMKLF 362
           S+ P +K      A ++IPQ+VLW+F+    S      N R   W+PQ D+  HP  + F
Sbjct: 288 SNIPEEKANEIASALAQIPQKVLWRFDGTKPSTLGR--NTRLVKWLPQNDLLGHPKTRAF 345

Query: 363 ISHGGLLGITEAVYEGIPVLGIPVFGDQWANIKKLESLKAGKLLPYLEITEETVSDALKI 422
           ++H G  G+ EA+  G+P++G+P+FGDQ  N K +E+  A   L  L +T E + +ALK 
Sbjct: 346 VTHAGSNGVYEAICHGVPMVGMPLFGDQMDNAKHMEAKGAAVTLNVLTMTSEDLLNALKT 405

Query: 423 VLS-PEYKENAEDLGTRFRDRPQSPLEVAIYWIEYVIKYNGAYHLQSAAVKLTWYQYYLL 481
           V++ P YKEN   L +   D+P  PL+ A++WIE+V+++ GA HL+ AA  LTWYQY+ L
Sbjct: 406 VINDPSYKENIMRLSSIHHDQPVKPLDRAVFWIEFVMRHKGAKHLRPAAHDLTWYQYHSL 465

Query: 482 DIALVLIVGLVSVCFV 497
           D+   L+  + +V F+
Sbjct: 466 DVIGFLLACVATVAFI 481


>gnl|CDD|224732 COG1819, COG1819, Glycosyl transferases, related to
           UDP-glucuronosyltransferase [Carbohydrate transport and
           metabolism / Signal transduction mechanisms].
          Length = 406

 Score =  103 bits (259), Expect = 6e-24
 Identities = 81/420 (19%), Positives = 144/420 (34%), Gaps = 56/420 (13%)

Query: 28  KILAIFPTPAKSHQIIFDTILVELYQRGHDLTVITQYPETLVHYERMKVLDIKGTHTYNS 87
           KIL      A  H      +  EL +RGH++   +   +     E      +        
Sbjct: 3   KIL-FVVCGAYGHVNPCLALGKELRRRGHEVVFAS-TGKFKEFVEAA---GLAFVAYPIR 57

Query: 88  TIEDIYELSADSIKRIHINFIDQEIQVENIFLHENMKSIWNMENKYDLIITEMFLTDAFL 147
             E   E    +  +     + Q+ +     L E ++ +       DL++ +  L+    
Sbjct: 58  DSELATEDGKFAGVKSFRRLL-QQFKKLIRELLELLRELEP-----DLVVDDARLSLGLA 111

Query: 148 VIPYLYKVPYISI--ASSTLHPQHSERLGLPDNPSYIPSYVSAYTDHMSFTERLSNTFVG 205
               L  +P + I  A  T  P     L        +P  +      +      +     
Sbjct: 112 AR--LLGIPVVGINVAPYTPLPAAGLPLPPVGIAGKLPIPLYPLPPRLVRPLIFA----- 164

Query: 206 LYYKWYYDYKSHGAANRIIHKYFPEIPRIQDLLNTCSLTLVNTHHTINIA---------R 256
                           R        +P I          L  +   + IA          
Sbjct: 165 -------RSWLPKLVVRRNLGLELGLPNI--------RRLFASGPLLEIAYTDVLFPPGD 209

Query: 257 PLPANVVEIGGIHVKPAKKLNEEMERFLNESHNGVIYFSMGSMLKTSSFPPDKFKAFLKA 316
            LP     IG +  + A   NE       +    ++Y S+G++              L+A
Sbjct: 210 RLPFIGPYIGPLLGEAA---NELPYWIPADR--PIVYVSLGTVGNAVEL----LAIVLEA 260

Query: 317 FSKIPQRVLWKFEDNDTSIFKPYKNIRTSSWMPQRDIFAHPNMKLFISHGGLLGITEAVY 376
            + +  RV+        ++     N+  + ++PQ ++    +    I HGG    +EA+Y
Sbjct: 261 LADLDVRVIVSLGGARDTLVNVPDNVIVADYVPQLELLPRAD--AVIHHGGAGTTSEALY 318

Query: 377 EGIPVLGIPVFGDQWANIKKLESLKAGKLLPYLEITEETVSDALKIVLS-PEYKENAEDL 435
            G+P++ IP   DQ  N +++E L AG  LP+ E+TEE +  A+  VL+   Y+  AE L
Sbjct: 319 AGVPLVVIPDGADQPLNAERVEELGAGIALPFEELTEERLRAAVNEVLADDSYRRAAERL 378


>gnl|CDD|99960 cd03784, GT1_Gtf_like, This family includes the Gtfs, a group of
           homologous glycosyltransferases involved in the final
           stages of the biosynthesis of antibiotics vancomycin and
           related chloroeremomycin. Gtfs transfer sugar moieties
           from an activated NDP-sugar donor to the oxidatively
           cross-linked heptapeptide core of vancomycin group
           antibiotics. The core structure is important for the
           bioactivity of the antibiotics.
          Length = 401

 Score = 91.7 bits (228), Expect = 5e-20
 Identities = 59/312 (18%), Positives = 98/312 (31%), Gaps = 32/312 (10%)

Query: 132 KYDLIITEMFLTDAFLVIPYLYKVPYISIASSTLHPQHSERLGLPDNPSYIPSYVSAYTD 191
             DL++ +       +    L  +P + +      P            S  P  +     
Sbjct: 104 GPDLVVADPLAFAGAVAAEAL-GIPAVRLLLGPDTP-----------TSAFPPPLGRAN- 150

Query: 192 HMSFTERLSNTFVGLYYKWYYD-YKSHGAANRIIHKYFPEIPRIQDLLNTCSLTLVNTHH 250
              +    +  +  L   W     +  G     +      +P +                
Sbjct: 151 LRLYALLEAELWQDLLGAWLRARRRRLGLPPLSLLDGSD-VPEL--------YGFSPAVL 201

Query: 251 TINIARPLPANVVEIGGIHVKPAKKLNEEMERFLNESHNGVIYFSMGSMLKTSSFPPDKF 310
                 P    V   G   V        E+  FL  +    +Y   GSM      P    
Sbjct: 202 PPPPDWPRFDLVTGYGFRDVPYNGPPPPELWLFL-AAGRPPVYVGFGSM--VVRDPEALA 258

Query: 311 KAFLKAFSKIPQRVLWKFEDNDTSIFKPYKNIRTSSWMPQRDIFAHPNMKLFISHGGLLG 370
           +  ++A + + QR +               N+R   ++P   +   P     + HGG   
Sbjct: 259 RLDVEAVATLGQRAILSLGWGGLGAEDLPDNVRVVDFVPHDWLL--PRCAAVVHHGGAGT 316

Query: 371 ITEAVYEGIPVLGIPVFGDQ--WANIKKLESLKAGKLLPYLEITEETVSDALKIVLSPEY 428
              A+  G+P L +P FGDQ  WA    +  L AG  L   E+T E ++ AL+ +L P  
Sbjct: 317 TAAALRAGVPQLVVPFFGDQPFWAAR--VAELGAGPALDPRELTAERLAAALRRLLDPPS 374

Query: 429 KENAEDLGTRFR 440
           +  A  L  R R
Sbjct: 375 RRRAAALLRRIR 386


>gnl|CDD|233407 TIGR01426, MGT, glycosyltransferase, MGT family.  This model
           describes the MGT (macroside glycosyltransferase)
           subfamily of the UDP-glucuronosyltransferase family.
           Members include a number of glucosyl transferases for
           macrolide antibiotic inactivation, but also include
           transferases of glucose-related sugars for macrolide
           antibiotic production [Cellular processes, Toxin
           production and resistance].
          Length = 392

 Score = 77.0 bits (190), Expect = 3e-15
 Identities = 40/153 (26%), Positives = 72/153 (47%), Gaps = 8/153 (5%)

Query: 291 VIYFSMGSMLKTSSFPPDKFKAFLKAFSKIPQRVLWKFEDN-DTSIFKPY-KNIRTSSWM 348
           V+  S+G++       P  ++  ++AF  +   V+       D +       N+    W+
Sbjct: 227 VVLISLGTVFNNQ---PSFYRTCVEAFRDLDWHVVLSVGRGVDPADLGELPPNVEVRQWV 283

Query: 349 PQRDIFAHPNMKLFISHGGLLGITEAVYEGIPVLGIPVFGDQWANIKKLESLKAGKLLPY 408
           PQ +I    +   FI+HGG+    EA++ G+P++ +P   DQ    +++  L  G+ LP 
Sbjct: 284 PQLEILKKAD--AFITHGGMNSTMEALFNGVPMVAVPQGADQPMTARRIAELGLGRHLPP 341

Query: 409 LEITEETVSDALKIVLS-PEYKENAEDLGTRFR 440
            E+T E + +A+  VLS P Y E    +    R
Sbjct: 342 EEVTAEKLREAVLAVLSDPRYAERLRKMRAEIR 374


>gnl|CDD|215112 PLN02167, PLN02167, UDP-glycosyltransferase family protein.
          Length = 475

 Score = 52.1 bits (125), Expect = 4e-07
 Identities = 32/129 (24%), Positives = 58/129 (44%), Gaps = 16/129 (12%)

Query: 278 EEMERFLNES-HNGVIYFSMGSMLKTSSFPPDKFKAFLKAFSKIPQRVLWKFEDNDTSIF 336
           + + R+L++   + V++   GS+    S P  + K   +A   +  R LW    N     
Sbjct: 268 DRIMRWLDDQPESSVVFLCFGSL---GSLPAPQIKEIAQALELVGCRFLWSIRTNPAEYA 324

Query: 337 KPYKNI------RTS------SWMPQRDIFAHPNMKLFISHGGLLGITEAVYEGIPVLGI 384
            PY+ +      R         W PQ +I AH  +  F+SH G   + E+++ G+P+   
Sbjct: 325 SPYEPLPEGFMDRVMGRGLVCGWAPQVEILAHKAIGGFVSHCGWNSVLESLWFGVPIATW 384

Query: 385 PVFGDQWAN 393
           P++ +Q  N
Sbjct: 385 PMYAEQQLN 393


>gnl|CDD|215084 PLN00164, PLN00164, glucosyltransferase; Provisional.
          Length = 480

 Score = 50.4 bits (121), Expect = 1e-06
 Identities = 15/48 (31%), Positives = 30/48 (62%)

Query: 346 SWMPQRDIFAHPNMKLFISHGGLLGITEAVYEGIPVLGIPVFGDQWAN 393
           +W PQ++I AH  +  F++H G   + E+++ G+P+   P++ +Q  N
Sbjct: 345 TWAPQKEILAHAAVGGFVTHCGWNSVLESLWHGVPMAPWPLYAEQHLN 392


>gnl|CDD|178589 PLN03015, PLN03015, UDP-glucosyl transferase.
          Length = 470

 Score = 49.7 bits (118), Expect = 2e-06
 Identities = 23/94 (24%), Positives = 51/94 (54%), Gaps = 5/94 (5%)

Query: 345 SSWMPQRDIFAHPNMKLFISHGGLLGITEAVYEGIPVLGIPVFGDQWANIKKLE-----S 399
           + W PQ +I +H ++  F+SH G   + E++ +G+P++  P++ +QW N   L      +
Sbjct: 340 TQWAPQVEILSHRSIGGFLSHCGWSSVLESLTKGVPIVAWPLYAEQWMNATLLTEEIGVA 399

Query: 400 LKAGKLLPYLEITEETVSDALKIVLSPEYKENAE 433
           ++  +L     I  E V+  ++ +++ E +E  +
Sbjct: 400 VRTSELPSEKVIGREEVASLVRKIVAEEDEEGQK 433


>gnl|CDD|178170 PLN02555, PLN02555, limonoid glucosyltransferase.
          Length = 480

 Score = 47.9 bits (114), Expect = 9e-06
 Identities = 30/97 (30%), Positives = 47/97 (48%), Gaps = 12/97 (12%)

Query: 347 WMPQRDIFAHPNMKLFISHGGLLGITEAVYEGIPVLGIPVFGDQWANIKKL-ESLKAG-- 403
           W PQ  + AHP++  F++H G     EA+  G+PV+  P +GDQ  +   L +  K G  
Sbjct: 344 WCPQEKVLAHPSVACFVTHCGWNSTMEALSSGVPVVCFPQWGDQVTDAVYLVDVFKTGVR 403

Query: 404 --------KLLPYLEITEETVSDALKIVLSPEYKENA 432
                   KL+   E+  E + +A     + E K+NA
Sbjct: 404 LCRGEAENKLITREEV-AECLLEATVGEKAAELKQNA 439


>gnl|CDD|215127 PLN02210, PLN02210, UDP-glucosyl transferase.
          Length = 456

 Score = 47.7 bits (113), Expect = 1e-05
 Identities = 31/119 (26%), Positives = 52/119 (43%), Gaps = 12/119 (10%)

Query: 280 MERFLNESHNGVIYFSMGSMLKTSSFPPDKFKAFLKAFSKIPQRVLW------KFEDNDT 333
           ME    ++ + V+Y S GSML++     ++ +   KA        LW      K ++   
Sbjct: 260 MEWLDKQARSSVVYISFGSMLESLE---NQVETIAKALKNRGVPFLWVIRPKEKAQNVQV 316

Query: 334 --SIFKPYKNIRTSSWMPQRDIFAHPNMKLFISHGGLLGITEAVYEGIPVLGIPVFGDQ 390
              + K  + +    W PQ  I +H  +  F++H G     E V  G+PV+  P + DQ
Sbjct: 317 LQEMVKEGQGVVLE-WSPQEKILSHMAISCFVTHCGWNSTIETVVAGVPVVAYPSWTDQ 374


>gnl|CDD|215247 PLN02448, PLN02448, UDP-glycosyltransferase family protein.
          Length = 459

 Score = 46.5 bits (111), Expect = 2e-05
 Identities = 35/110 (31%), Positives = 49/110 (44%), Gaps = 6/110 (5%)

Query: 289 NGVIYFSMGSMLKTSSFPPDKFKAFLKAFSKIPQRVLWKFEDNDTSIFKPY--KNIRTSS 346
             V+Y S+GS L  SS   D+  A L+  S +  R LW     + S  K           
Sbjct: 274 GSVLYVSLGSFLSVSSAQMDEIAAGLRD-SGV--RFLWVARG-EASRLKEICGDMGLVVP 329

Query: 347 WMPQRDIFAHPNMKLFISHGGLLGITEAVYEGIPVLGIPVFGDQWANIKK 396
           W  Q  +  H ++  F +H G     EAV+ G+P+L  P+F DQ  N K 
Sbjct: 330 WCDQLKVLCHSSVGGFWTHCGWNSTLEAVFAGVPMLTFPLFWDQPLNSKL 379


>gnl|CDD|178032 PLN02410, PLN02410, UDP-glucoronosyl/UDP-glucosyl transferase
           family protein.
          Length = 451

 Score = 46.2 bits (109), Expect = 3e-05
 Identities = 44/183 (24%), Positives = 74/183 (40%), Gaps = 27/183 (14%)

Query: 240 TCSLTLVNTHHTIN------IARPLPANVVEIGGIHV---KPAKKLNEE---MERFLNES 287
           T S  ++NT   +       + + L   V  IG +H+    P   L E    +E    + 
Sbjct: 203 TASSVIINTASCLESSSLSRLQQQLQIPVYPIGPLHLVASAPTSLLEENKSCIEWLNKQK 262

Query: 288 HNGVIYFSMGSMLKTSSFPPDKFKAFLKAFSKIPQRVLWKFEDNDT-------SIFKPYK 340
            N VI+ S+GS+   +    ++            Q+ LW              S+ K + 
Sbjct: 263 KNSVIFVSLGSL---ALMEINEVMETASGLDSSNQQFLWVIRPGSVRGSEWIESLPKEFS 319

Query: 341 NIRTS-----SWMPQRDIFAHPNMKLFISHGGLLGITEAVYEGIPVLGIPVFGDQWANIK 395
            I +       W PQ+++ +HP +  F SH G     E++ EG+P++  P   DQ  N +
Sbjct: 320 KIISGRGYIVKWAPQKEVLSHPAVGGFWSHCGWNSTLESIGEGVPMICKPFSSDQKVNAR 379

Query: 396 KLE 398
            LE
Sbjct: 380 YLE 382


>gnl|CDD|223779 COG0707, MurG, UDP-N-acetylglucosamine:LPS N-acetylglucosamine
           transferase [Cell envelope biogenesis, outer membrane].
          Length = 357

 Score = 45.0 bits (107), Expect = 6e-05
 Identities = 27/87 (31%), Positives = 40/87 (45%), Gaps = 9/87 (10%)

Query: 361 LFISHGGLLGITEAVYEGIPVLGIP----VFGDQWANIKKLESLKAGKLLPYLEITEETV 416
           L IS  G L I E +  G+P + +P      G Q  N K LE   A  ++   E+T E +
Sbjct: 255 LVISRAGALTIAELLALGVPAILVPYPPGADGHQEYNAKFLEKAGAALVIRQSELTPEKL 314

Query: 417 SDALKIVLSP-----EYKENAEDLGTR 438
           ++ +  +LS         ENA+ LG  
Sbjct: 315 AELILRLLSNPEKLKAMAENAKKLGKP 341


>gnl|CDD|178572 PLN02992, PLN02992, coniferyl-alcohol glucosyltransferase.
          Length = 481

 Score = 44.6 bits (105), Expect = 9e-05
 Identities = 18/49 (36%), Positives = 28/49 (57%)

Query: 345 SSWMPQRDIFAHPNMKLFISHGGLLGITEAVYEGIPVLGIPVFGDQWAN 393
            SW PQ +I AH  +  F++H G     E+V  G+P++  P+F +Q  N
Sbjct: 343 PSWAPQAEILAHQAVGGFLTHCGWSSTLESVVGGVPMIAWPLFAEQNMN 391


>gnl|CDD|177813 PLN02152, PLN02152, indole-3-acetate beta-glucosyltransferase.
          Length = 455

 Score = 43.9 bits (103), Expect = 1e-04
 Identities = 34/138 (24%), Positives = 60/138 (43%), Gaps = 22/138 (15%)

Query: 285 NESHNGVIYFSMGSMLKTSSFPPDKFKAFLKAFSKIPQRVLW----------KFEDNDTS 334
           +++ + VIY S G+M++ S     + +   +A  +  +  LW          K E  + +
Sbjct: 257 SKTESSVIYVSFGTMVELSK---KQIEELARALIEGKRPFLWVITDKLNREAKIEGEEET 313

Query: 335 IFKPYKNIRTS--------SWMPQRDIFAHPNMKLFISHGGLLGITEAVYEGIPVLGIPV 386
             +     R          SW  Q ++  H  +  F++H G     E++  G+PV+  P+
Sbjct: 314 EIEKIAGFRHELEEVGMIVSWCSQIEVLRHRAVGCFVTHCGWSSSLESLVLGVPVVAFPM 373

Query: 387 FGDQWANIKKLE-SLKAG 403
           + DQ AN K LE   K G
Sbjct: 374 WSDQPANAKLLEEIWKTG 391


>gnl|CDD|178581 PLN03004, PLN03004, UDP-glycosyltransferase.
          Length = 451

 Score = 43.1 bits (101), Expect = 2e-04
 Identities = 27/88 (30%), Positives = 42/88 (47%), Gaps = 9/88 (10%)

Query: 306 PPDKFKAFLKAFSKIPQRVLWKFEDNDTSIFKPYKNIRTSSWMPQRDIFAHPNMKLFISH 365
           PP+  K  L   S +P+  L + ED         K +   SW PQ  +  H  +  F++H
Sbjct: 309 PPELEKTELDLKSLLPEGFLSRTED---------KGMVVKSWAPQVPVLNHKAVGGFVTH 359

Query: 366 GGLLGITEAVYEGIPVLGIPVFGDQWAN 393
            G   I EAV  G+P++  P++ +Q  N
Sbjct: 360 CGWNSILEAVCAGVPMVAWPLYAEQRFN 387


>gnl|CDD|177857 PLN02207, PLN02207, UDP-glycosyltransferase.
          Length = 468

 Score = 42.7 bits (100), Expect = 4e-04
 Identities = 19/56 (33%), Positives = 34/56 (60%), Gaps = 4/56 (7%)

Query: 347 WMPQRDIFAHPNMKLFISHGGLLGITEAVYEGIPVLGIPVFGDQWAN----IKKLE 398
           W PQ +I AH  +  F+SH G   I E+++ G+P++  P++ +Q  N    +K+L+
Sbjct: 339 WSPQVEILAHKAVGGFVSHCGWNSIVESLWFGVPIVTWPMYAEQQLNAFLMVKELK 394


>gnl|CDD|215304 PLN02554, PLN02554, UDP-glycosyltransferase family protein.
          Length = 481

 Score = 42.5 bits (100), Expect = 4e-04
 Identities = 33/136 (24%), Positives = 57/136 (41%), Gaps = 24/136 (17%)

Query: 279 EMERFLNES-HNGVIYFSMGSMLKTSSFPPDKFKAFLKAFSKIPQRVLWKFEDNDTSIFK 337
           E+ R+L+E     V++   GSM     F  ++ +    A  +   R LW       +I K
Sbjct: 263 EILRWLDEQPPKSVVFLCFGSM---GGFSEEQAREIAIALERSGHRFLWSLRRASPNIMK 319

Query: 338 P----YKNI----------RTS------SWMPQRDIFAHPNMKLFISHGGLLGITEAVYE 377
                + N+          RT        W PQ  + A P +  F++H G   I E+++ 
Sbjct: 320 EPPGEFTNLEEILPEGFLDRTKDIGKVIGWAPQVAVLAKPAIGGFVTHCGWNSILESLWF 379

Query: 378 GIPVLGIPVFGDQWAN 393
           G+P+   P++ +Q  N
Sbjct: 380 GVPMAAWPLYAEQKFN 395


>gnl|CDD|215465 PLN02863, PLN02863, UDP-glucoronosyl/UDP-glucosyl transferase
           family protein.
          Length = 477

 Score = 42.2 bits (99), Expect = 6e-04
 Identities = 19/59 (32%), Positives = 30/59 (50%), Gaps = 1/59 (1%)

Query: 347 WMPQRDIFAHPNMKLFISHGGLLGITEAVYEGIPVLGIPVFGDQWANIKKL-ESLKAGK 404
           W PQ  I +H  +  F++H G   + E +  G+P+L  P+  DQ+ N   L + LK   
Sbjct: 350 WAPQVAILSHRAVGAFLTHCGWNSVLEGLVAGVPMLAWPMAADQFVNASLLVDELKVAV 408


>gnl|CDD|215293 PLN02534, PLN02534, UDP-glycosyltransferase.
          Length = 491

 Score = 41.8 bits (98), Expect = 7e-04
 Identities = 20/58 (34%), Positives = 33/58 (56%), Gaps = 1/58 (1%)

Query: 347 WMPQRDIFAHPNMKLFISHGGLLGITEAVYEGIPVLGIPVFGDQWANIKKL-ESLKAG 403
           W PQ  I +HP +  F++H G     E +  G+P++  P+F +Q+ N K + E L+ G
Sbjct: 351 WAPQVLILSHPAIGGFLTHCGWNSTIEGICSGVPMITWPLFAEQFLNEKLIVEVLRIG 408


>gnl|CDD|217893 pfam04101, Glyco_tran_28_C, Glycosyltransferase family 28
           C-terminal domain.  The glycosyltransferase family 28
           includes monogalactosyldiacylglycerol synthase (EC
           2.4.1.46) and UDP-N-acetylglucosamine transferase (EC
           2.4.1.-). Structural analysis suggests the C-terminal
           domain contains the UDP-GlcNAc binding site.
          Length = 167

 Score = 39.6 bits (93), Expect = 0.001
 Identities = 27/88 (30%), Positives = 40/88 (45%), Gaps = 7/88 (7%)

Query: 361 LFISHGGLLGITEAVYEGIPVLGIP----VFGDQWANIKKLESLKAGKLLPYLEITEETV 416
           L IS  G   I E +  G P + +P        Q  N  +L    A  +L   E+T E +
Sbjct: 75  LVISRAGAGTIAELLALGKPAILVPRPKAAGEHQDNNALELVKAGAALVLLQKELTPEKL 134

Query: 417 SDAL-KIVLSPE--YKENAEDLGTRFRD 441
            +AL K++L P   Y+ N    G+R +D
Sbjct: 135 VEALLKLLLKPLRLYEMNKAAKGSRLKD 162


>gnl|CDD|215305 PLN02562, PLN02562, UDP-glycosyltransferase.
          Length = 448

 Score = 41.0 bits (96), Expect = 0.001
 Identities = 28/113 (24%), Positives = 42/113 (37%), Gaps = 8/113 (7%)

Query: 289 NGVIYFSMGSMLKTSSFPPDKFKAFLKAFSKIPQRVLWKFEDNDTSIFKP------YKNI 342
           N VIY S GS +  S       +    A     +  +W           P       K  
Sbjct: 273 NSVIYISFGSWV--SPIGESNVRTLALALEASGRPFIWVLNPVWREGLPPGYVERVSKQG 330

Query: 343 RTSSWMPQRDIFAHPNMKLFISHGGLLGITEAVYEGIPVLGIPVFGDQWANIK 395
           +  SW PQ ++  H  +  +++H G     EA+     +L  PV GDQ+ N  
Sbjct: 331 KVVSWAPQLEVLKHQAVGCYLTHCGWNSTMEAIQCQKRLLCYPVAGDQFVNCA 383


>gnl|CDD|177830 PLN02173, PLN02173, UDP-glucosyl transferase family protein.
          Length = 449

 Score = 40.8 bits (95), Expect = 0.001
 Identities = 30/125 (24%), Positives = 56/125 (44%), Gaps = 21/125 (16%)

Query: 291 VIYFSMGSMLKTSSFPPDKFKAFLKAFS-----------KIPQRVLWKFEDNDTSIFKPY 339
           V+Y + GSM K SS   ++  + +  FS           K+P   L   + + + + K  
Sbjct: 266 VVYIAFGSMAKLSSEQMEEIASAISNFSYLWVVRASEESKLPPGFLETVDKDKSLVLK-- 323

Query: 340 KNIRTSSWMPQRDIFAHPNMKLFISHGGLLGITEAVYEGIPVLGIPVFGDQWANIKKLES 399
                  W PQ  + ++  +  F++H G     E +  G+P++ +P + DQ  N K ++ 
Sbjct: 324 -------WSPQLQVLSNKAIGCFMTHCGWNSTMEGLSLGVPMVAMPQWTDQPMNAKYIQD 376

Query: 400 L-KAG 403
           + K G
Sbjct: 377 VWKVG 381


>gnl|CDD|233077 TIGR00661, MJ1255, conserved hypothetical protein.  This model
           represents nearly the full length of MJ1255 from
           Methanococcus jannaschii and of an unpublished protein
           from Vibrio cholerae, as well as the C-terminal half of
           a protein from Methanobacterium thermoautotrophicum. A
           small region (~50 amino acids) within the domain appears
           related to a family of sugar transferases [Hypothetical
           proteins, Conserved].
          Length = 321

 Score = 40.2 bits (94), Expect = 0.001
 Identities = 24/55 (43%), Positives = 31/55 (56%), Gaps = 2/55 (3%)

Query: 358 NMKLFISHGGLLGITEAVYEGIPVLGIPVFG--DQWANIKKLESLKAGKLLPYLE 410
           N +L I+HGG   I+EA+  G P++ IP  G  +Q  N  KLE L  G  L Y E
Sbjct: 247 NAELVITHGGFSLISEALSLGKPLIVIPDLGQFEQGNNAVKLEDLGCGIALEYKE 301


>gnl|CDD|233284 TIGR01133, murG, undecaprenyldiphospho-muramoylpentapeptide
           beta-N-acetylglucosaminyltransferase.  RM 8449890 RT The
           final step of peptidoglycan subunit assembly in
           Escherichia coli occurs in the cytoplasm. RA Bupp K, van
           Heijenoort J. RL J Bacteriol 1993 Mar;175(6):1841-3
           [Cell envelope, Biosynthesis and degradation of murein
           sacculus and peptidoglycan].
          Length = 348

 Score = 38.0 bits (89), Expect = 0.010
 Identities = 27/84 (32%), Positives = 38/84 (45%), Gaps = 8/84 (9%)

Query: 361 LFISHGGLLGITEAVYEGIPVLGIP---VFGDQWANIKKLESLKAGKLLPYLEIT-EETV 416
           L IS  G   + E    G+P + IP      DQ+ N K LE L AG ++   E+  E+ +
Sbjct: 253 LVISRAGASTVAELAAAGVPAILIPYPYAADDQYYNAKFLEDLGAGLVIRQKELLPEKLL 312

Query: 417 SDALKIVLSPE----YKENAEDLG 436
              LK++L P       E A  L 
Sbjct: 313 EALLKLLLDPANLEAMAEAARKLA 336


>gnl|CDD|178584 PLN03007, PLN03007, UDP-glucosyltransferase family protein.
          Length = 482

 Score = 37.1 bits (86), Expect = 0.018
 Identities = 41/163 (25%), Positives = 64/163 (39%), Gaps = 24/163 (14%)

Query: 289 NGVIYFSMGSMLKTSSFPPDKFKAFLKAFSKIPQRVLWKFEDNDTSIFKPY--------- 339
           + VIY S GS+   +SF  ++            Q  +W    N+    K           
Sbjct: 285 DSVIYLSFGSV---ASFKNEQLFEIAAGLEGSGQNFIWVVRKNENQGEKEEWLPEGFEER 341

Query: 340 ---KNIRTSSWMPQRDIFAHPNMKLFISHGGLLGITEAVYEGIPVLGIPVFGDQWANIKK 396
              K +    W PQ  I  H     F++H G   + E V  G+P++  PV  +Q+ N K 
Sbjct: 342 TKGKGLIIRGWAPQVLILDHQATGGFVTHCGWNSLLEGVAAGLPMVTWPVGAEQFYNEKL 401

Query: 397 LE-------SLKAGKLLPYLE--ITEETVSDALKIVLSPEYKE 430
           +        S+ A KL+      I+ E V  A++ V+  E  E
Sbjct: 402 VTQVLRTGVSVGAKKLVKVKGDFISREKVEKAVREVIVGEEAE 444


>gnl|CDD|178275 PLN02670, PLN02670, transferase, transferring glycosyl groups.
          Length = 472

 Score = 36.8 bits (85), Expect = 0.026
 Identities = 24/99 (24%), Positives = 48/99 (48%), Gaps = 4/99 (4%)

Query: 347 WMPQRDIFAHPNMKLFISHGGLLGITEAVYEGIPVLGIPVFGDQWANIKKLESLKAGKLL 406
           W+PQ  I +H ++  F++H G   + E +  G  ++  PV  +Q  N + L   K G  +
Sbjct: 346 WVPQVKILSHESVGGFLTHCGWNSVVEGLGFGRVLILFPVLNEQGLNTRLLHGKKLGLEV 405

Query: 407 PYLE----ITEETVSDALKIVLSPEYKENAEDLGTRFRD 441
           P  E     T ++V++++++ +  +  E   D     R+
Sbjct: 406 PRDERDGSFTSDSVAESVRLAMVDDAGEEIRDKAKEMRN 444


>gnl|CDD|178364 PLN02764, PLN02764, glycosyltransferase family protein.
          Length = 453

 Score = 36.2 bits (83), Expect = 0.037
 Identities = 28/106 (26%), Positives = 52/106 (49%), Gaps = 6/106 (5%)

Query: 347 WMPQRDIFAHPNMKLFISHGGLLGITEAVYEGIPVLGIPVFGDQWANIKKL-ESLKAGKL 405
           W+ Q  I +HP++  F+SH G   + E++     ++ +P  GDQ  N + L + LK    
Sbjct: 324 WVQQPLILSHPSVGCFVSHCGFGSMWESLLSDCQIVLVPQLGDQVLNTRLLSDELKVSVE 383

Query: 406 LPYLE---ITEETVSDALKIVLS--PEYKENAEDLGTRFRDRPQSP 446
           +   E    ++E++ DA+  V+    E     +   T++R+   SP
Sbjct: 384 VAREETGWFSKESLRDAINSVMKRDSEIGNLVKKNHTKWRETLASP 429


>gnl|CDD|146802 pfam04354, ZipA_C, ZipA, C-terminal FtsZ-binding domain.  This
           family represents the ZipA C-terminal domain. ZipA is
           involved in septum formation in bacterial cell division.
           Its C-terminal domain binds FtsZ, a major component of
           the bacterial septal ring. The structure of this domain
           is an alpha-beta fold with three alpha helices and a
           beta sheet of six antiparallel beta strands. The major
           loops protruding from the beta sheet surface are thought
           to form a binding site for FtsZ.
          Length = 130

 Score = 32.9 bits (76), Expect = 0.12
 Identities = 9/32 (28%), Positives = 17/32 (53%)

Query: 282 RFLNESHNGVIYFSMGSMLKTSSFPPDKFKAF 313
           R  +   +G + FS+ +M+K  +F  D  + F
Sbjct: 40  RHEDLDGSGPVLFSVANMVKPGTFDLDNMEQF 71


>gnl|CDD|222200 pfam13528, Glyco_trans_1_3, Glycosyl transferase family 1. 
          Length = 317

 Score = 33.8 bits (78), Expect = 0.20
 Identities = 24/107 (22%), Positives = 45/107 (42%), Gaps = 10/107 (9%)

Query: 324 VLWKFEDNDTSIFKPYKNIRTSSWMPQRDIFAHPNM-------KLFISHGGLLGITEAVY 376
           +L  F D +  ++ P K       +  R  F+              I + G   ++EA+Y
Sbjct: 209 LLKPFPDYEFIVYGPNKEEGREGNLTFRP-FSEEGFLRDLAECSAVICNAGFELLSEALY 267

Query: 377 EGIPVLGIPVFG--DQWANIKKLESLKAGKLLPYLEITEETVSDALK 421
            G P+L +P+ G  +Q +N   LE L  G ++   ++    +   L+
Sbjct: 268 LGKPLLLVPLDGQFEQTSNALYLERLGYGIVMDMEDLDPAVLRRFLE 314


>gnl|CDD|177858 PLN02208, PLN02208, glycosyltransferase family protein.
          Length = 442

 Score = 32.7 bits (74), Expect = 0.46
 Identities = 24/98 (24%), Positives = 46/98 (46%), Gaps = 9/98 (9%)

Query: 347 WMPQRDIFAHPNMKLFISHGGLLGITEAVYEGIPVLGIPVFGDQWANIKKL-ESLKAGKL 405
           W+ Q  I  HP++  F++H G   I E++     ++ IP   DQ    + + E  +    
Sbjct: 318 WVQQPLILDHPSIGCFVNHCGPGTIWESLVSDCQMVLIPFLSDQVLFTRLMTEEFEVSVE 377

Query: 406 LPYLE---ITEETVSDALKIVLSPEYKENAEDLGTRFR 440
           +   +    ++E++S+A+K V+  +      DLG   R
Sbjct: 378 VSREKTGWFSKESLSNAIKSVMDKD-----SDLGKLVR 410


>gnl|CDD|225087 COG2176, PolC, DNA polymerase III, alpha subunit (gram-positive
           type) [DNA replication, recombination, and repair].
          Length = 1444

 Score = 33.1 bits (76), Expect = 0.49
 Identities = 12/44 (27%), Positives = 22/44 (50%), Gaps = 4/44 (9%)

Query: 246 VNTHHT-INIARPLPANVVEIGGI---HVKPAKKLNEEMERFLN 285
           ++     I   RPL A + E+ GI    ++ A ++ E +E+F  
Sbjct: 454 IDKFQFFIKPGRPLSATITELTGITDEMLENAPEIEEVLEKFRE 497


>gnl|CDD|241494 cd13340, PH-GRAM_MTMR5, Myotubularian (MTM) related 5 protein
           (MTMR5) Pleckstrin Homology-Glucosyltransferases,
           Rab-like GTPase activators and Myotubularins (PH-GRAM)
           domain.  MTMR5 (also called SBF1/SET binding factor 1)
           is a catalytically inactive phosphatase that plays a
           role as an adapter for the phosphatase myotubularin to
           regulate myotubularintracellular location. It lacks
           several amino acids in the dsPTPase catalytic pocket
           which renders it catalytically inactive as a
           phosphatase. MTMR5 is the most well-studied inactive
           member of this family and has been implicated in
           cellular growth control and oncogenic transformation.
           MTMR5 and MTMR13 contain an N-terminal DENN domain, a
           PH-GRAM domain, an inactive PTP domain, a SET
           interaction domain, a coiled-coil domain, and a
           C-terminal PH domain. Myotubularin-related proteins are
           a subfamily of protein tyrosine phosphatases (PTPs) that
           dephosphorylate D3-phosphorylated inositol lipids.
           Mutations in this family cause the human neuromuscular
           disorders myotubular myopathy and type 4B
           Charcot-Marie-Tooth syndrome. 6 of the 13 MTMRs (MTMRs
           5, 9-13) contain naturally occurring substitutions of
           residues required for catalysis by PTP family enzymes.
           Although these proteins are predicted to be
           enzymatically inactive, they are thought to function as
           antagonists of endogenous phosphatase activity or
           interaction modules. Most MTMRs contain a N-terminal
           PH-GRAM domain, a Rac-induced recruitment domain (RID)
           domain, a PTP domain (which may be active or inactive),
           a SET-interaction domain, and a C-terminal coiled-coil
           region. In addition some members contain DENN domain
           N-terminal to the PH-GRAM domain and FYVE, PDZ, and PH
           domains C-terminal to the coiled-coil region. The GRAM
           domain, found in myotubularins, glucosyltransferases,
           and other putative membrane-associated proteins, is part
           of a larger motif with a pleckstrin homology (PH) domain
           fold. The PH domain family possesses multiple functions
           including the ability to bind phosphoinositides via its
           beta1/beta2, beta3/beta4, and beta6/beta7 connecting
           loops and to other proteins. However, no
           phosphoinositide binding sites have been found for the
           MTMRs to date.
          Length = 185

 Score = 31.0 bits (70), Expect = 0.94
 Identities = 24/69 (34%), Positives = 34/69 (49%), Gaps = 3/69 (4%)

Query: 395 KKLESLKAGKLL-PYLEITEETVSDALKIVLSPEYKENAEDLGTRFRDRPQSPLEVAIYW 453
           K+L  ++  KLL P L   EE V D L++ L P+ +E A   G      P  P E AI+ 
Sbjct: 15  KRLPPIQKPKLLRPTLLPGEECVMDGLRVYLLPDGREEAA--GGSIGGPPLLPAEGAIFL 72

Query: 454 IEYVIKYNG 462
             Y + + G
Sbjct: 73  TTYRVIFKG 81


>gnl|CDD|177807 PLN00414, PLN00414, glycosyltransferase family protein.
          Length = 446

 Score = 31.1 bits (70), Expect = 1.4
 Identities = 13/44 (29%), Positives = 24/44 (54%)

Query: 347 WMPQRDIFAHPNMKLFISHGGLLGITEAVYEGIPVLGIPVFGDQ 390
           W+ Q  I +HP++  F++H G   + E++     ++ IP   DQ
Sbjct: 319 WVEQPLILSHPSVGCFVNHCGFGSMWESLVSDCQIVFIPQLADQ 362


>gnl|CDD|129010 smart00771, ZipA_C, ZipA, C-terminal domain (FtsZ-binding).
           C-terminal domain of ZipA, a component of cell division
           in E.coli. It interacts with the FtsZ protein in one of
           the initial steps of septum formation. The structure of
           this domain is composed of three alpha-helices and a
           beta-sheet consisting of six antiparallel beta-strands.
          Length = 131

 Score = 29.6 bits (67), Expect = 1.9
 Identities = 9/32 (28%), Positives = 17/32 (53%)

Query: 282 RFLNESHNGVIYFSMGSMLKTSSFPPDKFKAF 313
           R  + + +G + FS+ +M+K  +F  D    F
Sbjct: 41  RHDDLAGSGPVLFSLANMVKPGTFDLDNMDNF 72


>gnl|CDD|233397 TIGR01405, polC_Gram_pos, DNA polymerase III, alpha chain,
           Gram-positive type.  This model describes a polypeptide
           chain of DNA polymerase III. Full-length homologs of
           this protein are restricted to the Gram-positive
           lineages, including the Mycoplasmas. This protein is
           designated alpha chain and given the gene symbol polC,
           but is not a full-length homolog of other polC genes.
           The N-terminal region of about 200 amino acids is rich
           in low-complexity sequence, poorly alignable, and not
           included n this model [DNA metabolism, DNA replication,
           recombination, and repair].
          Length = 1213

 Score = 30.8 bits (70), Expect = 2.2
 Identities = 12/43 (27%), Positives = 21/43 (48%), Gaps = 4/43 (9%)

Query: 246 VNTHHT-INIARPLPANVVEIGGI---HVKPAKKLNEEMERFL 284
           ++     I    PL A V E+ GI    ++ A ++ E +E+F 
Sbjct: 223 IDKFQFFIKPHEPLSAFVTELTGITQDMLENAPEIEEVLEKFK 265


>gnl|CDD|178499 PLN02911, PLN02911, inositol-phosphate phosphatase.
          Length = 296

 Score = 30.1 bits (68), Expect = 2.7
 Identities = 31/96 (32%), Positives = 46/96 (47%), Gaps = 13/96 (13%)

Query: 360 KLFISHGGLLGITEA-VYEGIPVLGI---PVFGDQWANIKKLESLKAGKLLPYLEITE-- 413
           K FI+   L G   A +Y+G PVLGI   PV  ++W  +    +   G+     EI+   
Sbjct: 121 KSFITGKPLFGTLIALLYKGKPVLGIIDQPVLKERWVGVAGRATTLNGE-----EISTRS 175

Query: 414 -ETVSDALKIVLSPE-YKENAEDLGTRFRDRPQSPL 447
             ++ DA     SP  +  +AED   R RD+ + PL
Sbjct: 176 CASLKDAYLYTTSPHMFSGDAEDAFARVRDKVKVPL 211


>gnl|CDD|187832 cd09701, Cas10_III, CRISPR/Cas system-associated protein Cas10.
           CRISPR (Clustered Regularly Interspaced Short
           Palindromic Repeats) and associated Cas proteins
           comprise a system for heritable host defense by
           prokaryotic cells against phage and other foreign DNA;
           Multidomain protein with permuted HD nuclease domain,
           inactivated palm domain and Zn-ribbon; signature gene
           for type III.
          Length = 909

 Score = 30.5 bits (69), Expect = 3.1
 Identities = 12/48 (25%), Positives = 18/48 (37%), Gaps = 2/48 (4%)

Query: 207 YYKWYYDYKSHGAANRIIHK--YFPEIPRIQDLLNTCSLTLVNTHHTI 252
            ++  Y         R + +  Y  +I RI DL +  S  L N    I
Sbjct: 791 RFEICYIEDESDRRRRDLKRPFYLDDIDRITDLWDVLSRLLSNQQIQI 838


>gnl|CDD|225858 COG3321, COG3321, Polyketide synthase modules and related proteins
           [Secondary metabolites biosynthesis, transport, and
           catabolism].
          Length = 1061

 Score = 30.1 bits (68), Expect = 4.0
 Identities = 19/99 (19%), Positives = 31/99 (31%), Gaps = 17/99 (17%)

Query: 378 GIPVLGIPVFGDQWANIKKLESLKAGK-LLPYLEITEETVSDALKIVLSPEYKENAEDLG 436
              V   P  G QWA          G+ L     +      DAL+ +L           G
Sbjct: 525 KKTVFVFPGQGSQWAG--------MGRELYALEPVFASAF-DALEALLHRLL-------G 568

Query: 437 TRFRDRPQSPLEVAIYWIEYVIKYNGAYHLQSAAVKLTW 475
               +   +P    +  I++      A  +  AA+  +W
Sbjct: 569 FSLPEVIFAPDYPFLESIDFAQPALFAVSVALAALWRSW 607


>gnl|CDD|233080 TIGR00665, DnaB, replicative DNA helicase.  This model describes
          the helicase DnaB, a homohexameric protein required for
          DNA replication. The homohexamer can form a ring around
          a single strand of DNA near a replication fork. An
          intein of > 400 residues is found at a conserved
          location in DnaB of Synechocystis PCC6803, Rhodothermus
          marinus (both experimentally confirmed), and
          Mycobacterium tuberculosis. The intein removes itself
          by a self-splicing reaction. The seed alignment
          contains inteins so that the model built from the seed
          alignment will model a low cost at common intein
          insertion sites [DNA metabolism, DNA replication,
          recombination, and repair].
          Length = 434

 Score = 29.7 bits (68), Expect = 4.2
 Identities = 10/25 (40%), Positives = 19/25 (76%), Gaps = 1/25 (4%)

Query: 38 KSHQIIFDTILVELYQRGHDLTVIT 62
           +HQ+IF+ IL +LY++G  + ++T
Sbjct: 38 PAHQLIFEAIL-DLYEKGEPIDLVT 61


>gnl|CDD|240631 cd12154, FDH_GDH_like, Formate/glycerate dehydrogenases,
          D-specific 2-hydroxy acid dehydrogenases and related
          dehydrogenases.  The formate/glycerate dehydrogenase
          like family contains a diverse group of enzymes such as
          formate dehydrogenase (FDH), glycerate dehydrogenase
          (GDH), D-lactate dehydrogenase, L-alanine
          dehydrogenase, and S-Adenosylhomocysteine hydrolase,
          that share a common 2-domain structure. Despite often
          low sequence identity, these proteins typically have a
          characteristic arrangement of 2 similar domains of the
          alpha/beta Rossmann fold NAD+ binding form. The NAD(P)
          binding domain is inserted within the linear sequence
          of the mostly N-terminal catalytic domain.
          Structurally, these domains are connected by extended
          alpha helices and create a cleft in which NAD(P) is
          bound, primarily to the C-terminal portion of the 2nd
          (internal) domain. While many members of this family
          are dimeric, alanine DH is hexameric and
          phosphoglycerate DH is tetrameric. 2-hydroxyacid
          dehydrogenases are enzymes that catalyze the conversion
          of a wide variety of D-2-hydroxy acids to their
          corresponding keto acids. The general mechanism is
          (R)-lactate + acceptor to pyruvate + reduced acceptor.
          Formate dehydrogenase (FDH) catalyzes the
          NAD+-dependent oxidation of formate ion to carbon
          dioxide with the concomitant reduction of NAD+ to NADH.
          FDHs of this family contain no metal ions or prosthetic
          groups. Catalysis occurs though direct transfer of a
          hydride ion to NAD+ without the stages of acid-base
          catalysis typically found in related dehydrogenases.
          Length = 310

 Score = 29.1 bits (65), Expect = 4.8
 Identities = 16/67 (23%), Positives = 28/67 (41%), Gaps = 5/67 (7%)

Query: 21 IRIQQCSKILAIFPTPAKSHQIIFDTILVELYQRGHDLTVI--TQYPETLVHYERMKVLD 78
          +R++  + I A F   A    +    I+V L +    L V+   + P T   Y  ++ L 
Sbjct: 31 VRVETGAGIGAGFADQA---YVQAGAIVVTLAKALWSLDVVLKVKEPLTNAEYALIQKLG 87

Query: 79 IKGTHTY 85
           +   TY
Sbjct: 88 DRLLFTY 94


>gnl|CDD|241493 cd13339, PH-GRAM_MTMR13, Myotubularian (MTM) related 13 protein
           Pleckstrin Homology-Glucosyltransferases, Rab-like
           GTPase activators and Myotubularins (PH-GRAM) domain.
           MTMR13 (also called SBF2/SET binding factor 2) is a
           catalytically inactive phosphatase that plays a role as
           an adapter for the phosphatase myotubularin to regulate
           myotubularintracellular location. It contains a Leu
           residue instead of a conserved Cys residue in the
           dsPTPase catalytic loop which renders it catalytically
           inactive as a phosphatase. MTMR13 has high sequence
           similarity to MTMR5 and has recently been shown to be a
           second gene mutated in type 4B Charcot-Marie-Tooth
           syndrome. Both MTMR5 and MTMR13 contain an N-terminal
           DENN domain, a PH-GRAM domain, an inactive PTP domain, a
           SET interaction domain, a coiled-coil domain, and a
           C-terminal PH domain. Myotubularin-related proteins are
           a subfamily of protein tyrosine phosphatases (PTPs) that
           dephosphorylate D3-phosphorylated inositol lipids.
           Mutations in this family cause the human neuromuscular
           disorders myotubular myopathy and type 4B
           Charcot-Marie-Tooth syndrome. 6 of the 13 MTMRs (MTMRs
           5, 9-13) contain naturally occurring substitutions of
           residues required for catalysis by PTP family enzymes.
           Although these proteins are predicted to be
           enzymatically inactive, they are thought to function as
           antagonists of endogenous phosphatase activity or
           interaction modules. Most MTMRs contain a N-terminal
           PH-GRAM domain, a Rac-induced recruitment domain (RID)
           domain, a PTP domain (which may be active or inactive),
           a SET-interaction domain, and a C-terminal coiled-coil
           region. In addition some members contain DENN domain
           N-terminal to the PH-GRAM domain and FYVE, PDZ, and PH
           domains C-terminal to the coiled-coil region. The GRAM
           domain, found in myotubularins, glucosyltransferases,
           and other putative membrane-associated proteins, is part
           of a larger motif with a pleckstrin homology (PH) domain
           fold. The PH domain family possesses multiple functions
           including the ability to bind phosphoinositides via its
           beta1/beta2, beta3/beta4, and beta6/beta7 connecting
           loops and to other proteins. However, no
           phosphoinositide binding sites have been found for the
           MTMRs to date.
          Length = 184

 Score = 28.8 bits (64), Expect = 5.0
 Identities = 21/72 (29%), Positives = 35/72 (48%), Gaps = 3/72 (4%)

Query: 395 KKLESLKAGKLL-PYLEITEETVSDALKIVLSPEYKENAEDLGTRFRDRPQSPLEVAIYW 453
           ++L  ++  K+L P L   EE V + L+++L P+ +E A   G         P E A++ 
Sbjct: 14  RRLPPIQKPKILRPALLPGEEFVCEGLRVLLDPDGREEAT--GGLLGGPHILPAEGALFL 71

Query: 454 IEYVIKYNGAYH 465
             Y I + G  H
Sbjct: 72  TTYRIIFKGTPH 83


>gnl|CDD|173850 cd01161, VLCAD, Very long chain acyl-CoA dehydrogenase.  VLCAD is
           an acyl-CoA dehydrogenase (ACAD), which is found in the
           mitochondria of eukaryotes and in some bacteria.  It
           catalyzes the alpha,beta dehydrogenation of the
           corresponding trans-enoyl-CoA by FAD, which becomes
           reduced. The reduced form of ACAD is reoxidized in the
           oxidative half-reaction by electron-transferring
           flavoprotein (ETF), from which the electrons are
           transferred to the mitochondrial respiratory chain
           coupled with ATP synthesis. VLCAD acts as a homodimer.
          Length = 409

 Score = 29.0 bits (65), Expect = 6.0
 Identities = 20/72 (27%), Positives = 32/72 (44%), Gaps = 5/72 (6%)

Query: 363 ISHGGLLGITEAVYEGIPVLGIPVFGDQWANIKKLESLKAGKLLPYLEITEETV-SDALK 421
           +       +T   ++ I   GI +FG +    K L  L +G+ +    +TE +  SDA  
Sbjct: 96  VGMDLGFSVTLGAHQSIGFKGILLFGTEAQKEKYLPKLASGEWIAAFALTEPSSGSDAAS 155

Query: 422 I----VLSPEYK 429
           I    VLS + K
Sbjct: 156 IRTTAVLSEDGK 167


>gnl|CDD|169414 PRK08381, PRK08381, putative monovalent cation/H+ antiporter
           subunit F; Reviewed.
          Length = 87

 Score = 27.0 bits (60), Expect = 7.5
 Identities = 14/34 (41%), Positives = 22/34 (64%), Gaps = 1/34 (2%)

Query: 469 AAVKLTWYQYYLLDIALVLI-VGLVSVCFVLKYL 501
           A + + W +YYLLD+A+VL+ V  V    + KY+
Sbjct: 50  AILSVLWEEYYLLDLAIVLLMVNAVGGLILAKYM 83


>gnl|CDD|99968 cd03794, GT1_wbuB_like, This family is most closely related to the
           GT1 family of glycosyltransferases. wbuB in E. coli is
           involved in the biosynthesis of the O26 O-antigen.  It
           has been proposed to function as an
           N-acetyl-L-fucosamine (L-FucNAc) transferase.
          Length = 394

 Score = 28.8 bits (65), Expect = 7.5
 Identities = 30/145 (20%), Positives = 52/145 (35%), Gaps = 31/145 (21%)

Query: 28  KILAI---FPTPAKSHQIIFDTILVELYQRGHDLTVIT---QYPETLVHYERMKVLDIKG 81
           KIL +   FP            +  EL +RGH++TVIT    YP     Y+  K  ++ G
Sbjct: 1   KILILSQYFPPELGGGAFRTTELAEELVKRGHEVTVITGSPNYPSG-KIYKGYKREEVDG 59

Query: 82  -----THTYNSTIEDIYELSADSIKRIHINFIDQEIQVENIFLHENMKSIWNMENKYDLI 136
                                  +KR+ +N++   +          + ++     + D+I
Sbjct: 60  VRVHRVPLPPYK-------KNGLLKRL-LNYLSFALSA--------LLALLKRRRRPDVI 103

Query: 137 I-TEMFLTDAFLVIPY--LYKVPYI 158
           I T   L  A   +    L   P++
Sbjct: 104 IATSPPLLIALAALLLARLKGAPFV 128


>gnl|CDD|99961 cd03785, GT1_MurG, MurG is an N-acetylglucosaminyltransferase, the
           last enzyme involved in the intracellular phase of
           peptidoglycan biosynthesis. It transfers
           N-acetyl-D-glucosamine (GlcNAc) from UDP-GlcNAc to the
           C4 hydroxyl of a lipid-linked N-acetylmuramoyl
           pentapeptide (NAM). The resulting disaccharide is then
           transported across the cell membrane, where it is
           polymerized into NAG-NAM cell-wall repeat structure.
           MurG belongs to the GT-B structural superfamily of
           glycoslytransferases, which have characteristic N- and
           C-terminal domains, each containing a typical Rossmann
           fold. The two domains have high structural homology
           despite minimal sequence homology.  The large cleft that
           separates the two domains includes the catalytic center
           and permits a high degree of flexibility.
          Length = 350

 Score = 28.6 bits (65), Expect = 8.7
 Identities = 26/85 (30%), Positives = 37/85 (43%), Gaps = 9/85 (10%)

Query: 361 LFISHGGLLGITEAVYEGIPVLGIP----VFGDQWANIKKLESLKAGKLLPYLEITEETV 416
           L IS  G   + E    G+P + IP        Q AN + L    A  L+P  E+T E +
Sbjct: 255 LVISRAGASTVAELAALGLPAILIPLPYAADDHQTANARALVKAGAAVLIPQEELTPERL 314

Query: 417 SDAL-KIVLSPEY----KENAEDLG 436
           + AL +++  PE      E A  L 
Sbjct: 315 AAALLELLSDPERLKAMAEAARSLA 339


>gnl|CDD|238142 cd00231, ZipA, ZipA C-terminal domain. ZipA, a membrane-anchored
           protein, is one of at least nine essential gene products
           necessary for assembly of the septal ring which mediates
           cell division in E.coli. ZipA and FtsA directly bind
           FtsZ, a homolog of eukaryotic tubulins, at the
           prospective division site, followed by the sequential
           addition of FtsK, FtsQ, FtsL, FtsW, FtsI, and FtsN.
           ZipA contains three domains: a short N-terminal
           membrane-anchored domain, a central P/Q domain that is
           rich in proline and glutamine and a C-terminal domain,
           which comprises almost half the protein.
          Length = 130

 Score = 27.3 bits (61), Expect = 9.2
 Identities = 12/32 (37%), Positives = 19/32 (59%)

Query: 282 RFLNESHNGVIYFSMGSMLKTSSFPPDKFKAF 313
           R L+ S +G + FS+ +M+K  +F PD    F
Sbjct: 40  RHLSLSGSGPVLFSVANMVKPGTFDPDNMADF 71


>gnl|CDD|212130 cd11665, LamB_like, Aspergillus nidulans lactam utilization protein
           LamB and similar proteins.  This eukaryotic and
           bacterial subfamily of the LamB/YbgL family, includes
           Aspergillus nidulans protein LamB. The lamb gene locates
           at the lam locus of Aspergillus nidulans, consisting of
           two divergently transcribed genes, lamA and lamB, needed
           for the utilization of lactams such as 2-pyrrolidinone.
           Both genes are under the control of the positive
           regulatory gene amdR and are subject to carbon and
           nitrogen metabolite repression. Although the exact
           molecular function of lamb encoding protein LamB is
           unknown, it might be required for conversion of
           exogenous 2-pyrrolidinone to endogenous GABA.
          Length = 238

 Score = 28.0 bits (63), Expect = 9.2
 Identities = 12/31 (38%), Positives = 15/31 (48%), Gaps = 7/31 (22%)

Query: 365 HGGLLGITE-------AVYEGIPVLGIPVFG 388
           HG L G+         AV +   V G+PVFG
Sbjct: 109 HGALYGMAARDEEVARAVADAAKVFGVPVFG 139


>gnl|CDD|182528 PRK10535, PRK10535, macrolide transporter ATP-binding /permease
           protein; Provisional.
          Length = 648

 Score = 28.5 bits (64), Expect = 9.3
 Identities = 20/56 (35%), Positives = 31/56 (55%), Gaps = 8/56 (14%)

Query: 29  ILAIFPTPA-KSH---QIIFDTILVELYQRGHDLTVITQYPETLVHYERMKVLDIK 80
           ILA  PT A  SH   +++   IL +L  RGH + ++T  P+     ER  V++I+
Sbjct: 166 ILADEPTGALDSHSGEEVM--AILHQLRDRGHTVIIVTHDPQVAAQAER--VIEIR 217


>gnl|CDD|114135 pfam05394, AvrB_AvrC, Avirulence protein.  This family consists of
           several avirulence proteins from Pseudomonas syringae
           and Xanthomonas campestris.
          Length = 326

 Score = 28.3 bits (63), Expect = 9.4
 Identities = 16/59 (27%), Positives = 29/59 (49%), Gaps = 1/59 (1%)

Query: 182 IP-SYVSAYTDHMSFTERLSNTFVGLYYKWYYDYKSHGAANRIIHKYFPEIPRIQDLLN 239
           IP + +S  +D  +  +RL++ +  L  +   D  + G  N I H     +P+I+D L 
Sbjct: 180 IPLTTISVSSDDSAIRDRLADLYKDLRRQGLKDLMARGEPNMITHTSAEYVPKIKDHLE 238


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.322    0.138    0.416 

Gapped
Lambda     K      H
   0.267   0.0747    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 26,723,547
Number of extensions: 2667027
Number of successful extensions: 2603
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2584
Number of HSP's successfully gapped: 75
Length of query: 513
Length of database: 10,937,602
Length adjustment: 101
Effective length of query: 412
Effective length of database: 6,457,848
Effective search space: 2660633376
Effective search space used: 2660633376
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 61 (27.2 bits)