RPS-BLAST 2.2.22 [Sep-27-2009] Database: CddB 21,608 sequences; 5,994,473 total letters Searching..................................................done Query= gi|254780328|ref|YP_003064741.1| UDP-glucose 4-epimerase [Candidatus Liberibacter asiaticus str. psy62] (333 letters) >gnl|CDD|130247 TIGR01179, galE, UDP-glucose-4-epimerase. This enzyme interconverts UDP-glucose and UDP-galactose. A set of related proteins, some of which are tentatively identified as UDP-glucose-4-epimerase in Thermotoga maritima, Bacillus halodurans, and several archaea, but deeply branched from this set and lacking experimental evidence, are excluded from this model and described separately. Length = 328 Score = 372 bits (957), Expect = e-104 Identities = 146/325 (44%), Positives = 201/325 (61%), Gaps = 8/325 (2%) Query: 5 NVLVVGGAGYIGAHTCRVLYERGFLPIVLDNLSSGHAEFVLWG------PLEQVDICDYT 58 +LV GGAGYIG+HT R L E G +VLDNLS+G E + G + D+ D Sbjct: 1 KILVTGGAGYIGSHTVRQLLESGHEVVVLDNLSNGSPEALKRGERITRVTFVEGDLRDRE 60 Query: 59 NLRAVFAKYQPASVMHFAGLTNISESVKNPSLFYEINIKGSFNLIATAIESNVRRFIFSS 118 L +F +++ +V+HFAGL + ESV++P +Y N+ + NL+ ++ V++FIFSS Sbjct: 61 LLDRLFEEHKIDAVIHFAGLIAVGESVQDPLKYYRNNVVNTLNLLEAMQQTGVKKFIFSS 120 Query: 119 TCATYGIPHNTIITENDPQESITPYGYTKYVVERELLQHNKVN-GLRSVVLRYFNAAGAT 177 + A YG P + I+E+ P I PYG +K + ER L +K + GL V+LRYFN AGA Sbjct: 121 SAAVYGEPSSIPISEDSPLGPINPYGRSKLMSERILRDLSKADPGLSYVILRYFNVAGAD 180 Query: 178 LDSIIGEWHNPETHVIPLAIKTAMGYQNSFKVFGQDYATRDGTCLRDYIHVLDLANAHIM 237 + IGE TH+IP A + A+G ++ +FG DY T DGTC+RDYIHV+DLA+AH+ Sbjct: 181 PEGTIGEDPPGITHLIPYACQVAVGKRDKLTIFGTDYPTPDGTCVRDYIHVMDLADAHLA 240 Query: 238 ALEYLINQGDSIAINLGTGTGITVKEIISTIQSMYECAFPITYESRRIGDPPSLVADNKK 297 ALEYL+N G+S NLG G G +V E+I + + FP+ RR GDP SLVAD K Sbjct: 241 ALEYLLNGGESHVYNLGYGQGFSVLEVIEAFKKVSGVDFPVELAPRRPGDPASLVADASK 300 Query: 298 AKKILGWNPKY-KLRDIIESAWNWH 321 ++ LGW PKY L II++AW W Sbjct: 301 IRRELGWQPKYTDLEIIIKTAWRWE 325 >gnl|CDD|177883 PLN02240, PLN02240, UDP-glucose 4-epimerase. Length = 352 Score = 257 bits (660), Expect = 2e-69 Identities = 125/343 (36%), Positives = 181/343 (52%), Gaps = 21/343 (6%) Query: 1 MENKNVLVVGGAGYIGAHTCRVLYERGFLPIVLDNLSSGHAEFV-----LWGPL------ 49 + + +LV GGAGYIG+HT L G+ +V+DNL + E + L G L Sbjct: 3 LMGRTILVTGGAGYIGSHTVLQLLLAGYKVVVIDNLDNSSEEALRRVKELAGDLGDNLVF 62 Query: 50 EQVDICDYTNLRAVFAKYQPASVMHFAGLTNISESVKNPSLFYEINIKGSFNLIATAIES 109 +VD+ D L VFA + +V+HFAGL + ESV P L+Y+ N+ G+ NL+ + Sbjct: 63 HKVDLRDKEALEKVFASTRFDAVIHFAGLKAVGESVAKPLLYYDNNLVGTINLLEVMAKH 122 Query: 110 NVRRFIFSSTCATYGIPHNTIITENDPQESITPYGYTKYVVEREL--LQHNKVNGLRSVV 167 ++ +FSS+ YG P TE P + PYG TK +E + + + ++ Sbjct: 123 GCKKLVFSSSATVYGQPEEVPCTEEFPLSATNPYGRTKLFIEEICRDIHASDPE-WKIIL 181 Query: 168 LRYFNAAGATLDSIIGEWHNPE---THVIPLAIKTAMGYQNSFKVFGQDYATRDGTCLRD 224 LRYFN GA IGE +P+ +++P + A+G + VFG DY T+DGT +RD Sbjct: 182 LRYFNPVGAHPSGRIGE--DPKGIPNNLMPYVQQVAVGRRPELTVFGNDYPTKDGTGVRD 239 Query: 225 YIHVLDLANAHIMALEYLINQGDS--IAINLGTGTGITVKEIISTIQSMYECAFPITYES 282 YIHV+DLA+ HI AL L D A NLGTG G +V E+++ + P+ Sbjct: 240 YIHVMDLADGHIAALRKLFTDPDIGCEAYNLGTGKGTSVLEMVAAFEKASGKKIPLKLAP 299 Query: 283 RRIGDPPSLVADNKKAKKILGWNPKYKLRDIIESAWNWHLKYP 325 RR GD + A +KA+K LGW KY + ++ WNW K P Sbjct: 300 RRPGDAEEVYASTEKAEKELGWKAKYGIDEMCRDQWNWASKNP 342 >gnl|CDD|182639 PRK10675, PRK10675, UDP-galactose-4-epimerase; Provisional. Length = 338 Score = 211 bits (540), Expect = 1e-55 Identities = 119/339 (35%), Positives = 179/339 (52%), Gaps = 25/339 (7%) Query: 5 NVLVVGGAGYIGAHTCRVLYERGFLPIVLDNLSSGHAEFV-----LWGP---LEQVDICD 56 VLV GG+GYIG+HTC L + G ++LDNL + + L G + DI + Sbjct: 2 RVLVTGGSGYIGSHTCVQLLQNGHDVVILDNLCNSKRSVLPVIERLGGKHPTFVEGDIRN 61 Query: 57 YTNLRAVFAKYQPASVMHFAGLTNISESVKNPSLFYEINIKGSFNLIATAIESNVRRFIF 116 L + + +V+HFAGL + ESV+ P +Y+ N+ G+ LI+ +NV+ IF Sbjct: 62 EALLTEILHDHAIDTVIHFAGLKAVGESVQKPLEYYDNNVNGTLRLISAMRAANVKNLIF 121 Query: 117 SSTCATYG----IPHNTIITENDPQESITPYGYTKYVVERELLQHNKVNGLRSVV-LRYF 171 SS+ YG IP+ PQ +PYG +K +VE+ L K S+ LRYF Sbjct: 122 SSSATVYGDQPKIPYVESFPTGTPQ---SPYGKSKLMVEQILTDLQKAQPDWSIALLRYF 178 Query: 172 NAAGATLDSIIGEWHNPE---THVIPLAIKTAMGYQNSFKVFGQDYATRDGTCLRDYIHV 228 N GA +GE +P+ +++P + A+G ++S +FG DY T DGT +RDYIHV Sbjct: 179 NPVGAHPSGDMGE--DPQGIPNNLMPYIAQVAVGRRDSLAIFGNDYPTEDGTGVRDYIHV 236 Query: 229 LDLANAHIMALEYLINQGDSIAINLGTGTGITVKEIISTIQSMYECAFPITYE--SRRIG 286 +DLA+ H+ A+E L N+ NLG G G +V ++++ C P+ Y RR G Sbjct: 237 MDLADGHVAAMEKLANKPGVHIYNLGAGVGSSVLDVVNAFSK--ACGKPVNYHFAPRREG 294 Query: 287 DPPSLVADNKKAKKILGWNPKYKLRDIIESAWNWHLKYP 325 D P+ AD KA + L W L ++ + W+W ++P Sbjct: 295 DLPAYWADASKADRELNWRVTRTLDEMAQDTWHWQSRHP 333 >gnl|CDD|130249 TIGR01181, dTDP_gluc_dehyt, dTDP-glucose 4,6-dehydratase. This protein is related to UDP-glucose 4-epimerase (GalE) and likewise has an NAD cofactor. Length = 317 Score = 114 bits (288), Expect = 3e-26 Identities = 94/333 (28%), Positives = 152/333 (45%), Gaps = 35/333 (10%) Query: 5 NVLVVGGAGYIGAHTCR-VLYERG-FLPIVLDNLS-SGHAEFV--LWGP----LEQVDIC 55 +LV GGAG+IG++ R +L E IVLD L+ +G+ E + L + DI Sbjct: 1 RILVTGGAGFIGSNFVRYILNEHPDAEVIVLDKLTYAGNLENLADLEDNPRYRFVKGDIG 60 Query: 56 DYTNLRAVFAKYQPASVMHFAGLTNISESVKNPSLFYEINIKGSFNLIATAIESNVR-RF 114 D + +F ++QP +V+HFA +++ S+ P+ F E N+ G++ L+ + RF Sbjct: 61 DRELVSRLFTEHQPDAVVHFAAESHVDRSISGPAAFIETNVVGTYTLLEAVRKYWHEFRF 120 Query: 115 IFSSTCATYGIPHNT-IITENDPQESITPYGYTKYVVERELLQHNKVNGLRSVVLRYFNA 173 ST YG TE P +PY +K + + +++ GL +++ R N Sbjct: 121 HHISTDEVYGDLEKGDAFTETTPLAPSSPYSASKAASDHLVRAYHRTYGLPALITRCSNN 180 Query: 174 AGATLDSIIGEWHNPETHVIPLAIKTA-MGYQNSFKVFGQDYATRDGTCLRDYIHVLDLA 232 G + PE +IPL I A G V+G DG +RD+++V D Sbjct: 181 YGP--------YQFPEK-LIPLMITNALAG--KPLPVYG------DGQQVRDWLYVED-- 221 Query: 233 NAHIMALEYLINQGDSIAI-NLGTGTGITVKEIISTI-QSMYECAFPITYESRRIGDPPS 290 H A+ ++ +G N+G G T E++ TI + + + IT+ R G Sbjct: 222 --HCRAIYLVLEKGRVGETYNIGGGNERTNLEVVETILELLGKDEDLITHVEDRPGHDRR 279 Query: 291 LVADNKKAKKILGWNPKYKLRDIIESAWNWHLK 323 D K K+ LGW PKY + + W+L Sbjct: 280 YAIDASKIKRELGWAPKYTFEEGLRKTVQWYLD 312 >gnl|CDD|182313 PRK10217, PRK10217, dTDP-glucose 4,6-dehydratase; Provisional. Length = 355 Score = 98.2 bits (244), Expect = 2e-21 Identities = 92/355 (25%), Positives = 147/355 (41%), Gaps = 60/355 (16%) Query: 4 KNVLVVGGAGYIGAHTCR-VLYERGFLPIVLDNLS-----------SGHAEFVLWGPLEQ 51 + +L+ GGAG+IG+ R ++ E +V+D L+ + F E+ Sbjct: 2 RKILITGGAGFIGSALVRYIINETSDAVVVVDKLTYAGNLMSLAPVAQSERFAF----EK 57 Query: 52 VDICDYTNLRAVFAKYQPASVMHFAGLTNISESVKNPSLFYEINIKGSFNLIATA----- 106 VDICD L VF ++QP VMH A +++ S+ P+ F E NI G++ L+ A Sbjct: 58 VDICDRAELARVFTEHQPDCVMHLAAESHVDRSIDGPAAFIETNIVGTYTLLEAARAYWN 117 Query: 107 ----IESNVRRFIFSSTCATYGIPHNT--IITENDPQESITPYGYTKYVVERELLQHNKV 160 + + RF ST YG H+T TE P +PY +K + + + Sbjct: 118 ALTEDKKSAFRFHHISTDEVYGDLHSTDDFFTETTPYAPSSPYSASKASSDHLVRAWLRT 177 Query: 161 NGLRSVVLRYFNAAGATLDSIIGEWHNPETHVIPLAIKTAMGYQNSFKVFGQDYATRDGT 220 GL +++ N G +H PE +IPL I A+ + V+G +G Sbjct: 178 YGLPTLITNCSNN--------YGPYHFPEK-LIPLMILNALAGK-PLPVYG------NGQ 221 Query: 221 CLRDYIHVLDLANAHIMALEYLINQGD-SIAINLGTGTGITVKEIISTIQSMYECAFP-- 277 +RD+++V D H AL + G N+G +++ TI + E P Sbjct: 222 QIRDWLYVED----HARALYCVATTGKVGETYNIGGHNERKNLDVVETICELLEELAPNK 277 Query: 278 ----------ITYESRRIGDPPSLVADNKKAKKILGWNPKYKLRDIIESAWNWHL 322 IT+ + R G D K + LGW P+ + W+L Sbjct: 278 PQGVAHYRDLITFVADRPGHDLRYAIDASKIARELGWLPQETFESGMRKTVQWYL 332 >gnl|CDD|178326 PLN02725, PLN02725, GDP-4-keto-6-deoxymannose-3,5-epimerase-4-reductase. Length = 306 Score = 96.3 bits (240), Expect = 9e-21 Identities = 77/328 (23%), Positives = 148/328 (45%), Gaps = 40/328 (12%) Query: 7 LVVGGAGYIGAHTCRVLYERGFLPIVLDNLSSGHAEFVLWGPLEQVDICDYTNLRAVFAK 66 V G G +G+ R L GF +VL H E +D+ ++ A FAK Sbjct: 1 FVAGHRGLVGSAIVRKLEALGFTNLVLRT----HKE---------LDLTRQADVEAFFAK 47 Query: 67 YQPASVMHFAG-LTNISESVKNPSLFYEINIKGSFNLIATAIESNVRRFIF-SSTC---- 120 +P V+ A + I ++ P+ F N++ N+I A V++ +F S+C Sbjct: 48 EKPTYVILAAAKVGGIHANMTYPADFIRENLQIQTNVIDAAYRHGVKKLLFLGSSCIYPK 107 Query: 121 -ATYGIPHNTIITENDPQESITPYGYTKYVVERELLQHNKVNGLRSVVLRY-FNAAGATL 178 A IP ++T P+ + Y K K+ ++ ++Y ++A Sbjct: 108 FAPQPIPETALLT-GPPEPTNEWYAIAKIAGI-------KMC--QAYRIQYGWDAISGMP 157 Query: 179 DSIIGEWHN--PE-THVIPLAIKTAMGYQNSFKVFGQDYATRDGTCLRDYIHVLDLANAH 235 ++ G N PE +HVIP I+ ++ + G+ LR+++HV DLA+A Sbjct: 158 TNLYGPHDNFHPENSHVIPALIRR--FHEAKANGAPEVVVWGSGSPLREFLHVDDLADAV 215 Query: 236 IMALEYLINQGDSIAINLGTGTGITVKEIISTIQSMYECAFPITYESRRIGDPPSLVADN 295 + + + +N+G+G +T+KE+ ++ + + +++ + P + D+ Sbjct: 216 VFLMR---RYSGAEHVNVGSGDEVTIKELAELVKEVVGFEGELVWDTSKPDGTPRKLMDS 272 Query: 296 KKAKKILGWNPKYKLRDIIESAWNWHLK 323 K + LGW+PK+ L+D ++ + W+L+ Sbjct: 273 SKLRS-LGWDPKFSLKDGLQETYKWYLE 299 >gnl|CDD|182230 PRK10084, PRK10084, dTDP-glucose 4,6 dehydratase; Provisional. Length = 352 Score = 86.4 bits (214), Expect = 9e-18 Identities = 92/353 (26%), Positives = 148/353 (41%), Gaps = 56/353 (15%) Query: 6 VLVVGGAGYIGAHTCR-VLYERGFLPIVLDNLS-SGHAEFVLWGP------LEQVDICDY 57 +LV GGAG+IG+ R ++ + +D L+ +G+ E + E DICD Sbjct: 3 ILVTGGAGFIGSAVVRHIINNTQDSVVNVDKLTYAGNLESLADVSDSERYVFEHADICDR 62 Query: 58 TNLRAVFAKYQPASVMHFAGLTNISESVKNPSLFYEINIKGSFNLIATAIE--------- 108 L +FA++QP +VMH A +++ S+ P+ F E NI G++ L+ A Sbjct: 63 AELDRIFAQHQPDAVMHLAAESHVDRSITGPAAFIETNIVGTYVLLEAARNYWSALDEDK 122 Query: 109 SNVRRFIFSSTCATYG-IPHN---------TIITENDPQESITPYGYTKYVVERELLQHN 158 N RF ST YG +PH + TE +PY +K + + Sbjct: 123 KNAFRFHHISTDEVYGDLPHPDEVENSEELPLFTETTAYAPSSPYSASKASSDHLVRAWL 182 Query: 159 KVNGLRSVVLRYFNAAGATLDSIIGEWHNPETHVIPLAIKTAMGYQNSFKVFGQDYATRD 218 + GL ++V N G +H PE +IPL I A+ + ++G+ Sbjct: 183 RTYGLPTIVTNCSNN--------YGPYHFPEK-LIPLVILNALEGK-PLPIYGK------ 226 Query: 219 GTCLRDYIHVLDLANAHIMALEYLINQGD-SIAINLGTGTGITVKEIISTIQSMYECAFP 277 G +RD+++V D H AL ++ +G N+G +++ TI + + P Sbjct: 227 GDQIRDWLYVED----HARALYKVVTEGKAGETYNIGGHNEKKNLDVVLTICDLLDEIVP 282 Query: 278 --------ITYESRRIGDPPSLVADNKKAKKILGWNPKYKLRDIIESAWNWHL 322 ITY + R G D K + LGW P+ I W+L Sbjct: 283 KATSYREQITYVADRPGHDRRYAIDASKISRELGWKPQETFESGIRKTVEWYL 335 >gnl|CDD|162758 TIGR02197, heptose_epim, ADP-L-glycero-D-manno-heptose-6-epimerase. This family consists of examples of ADP-L-glycero-D-mannoheptose-6-epimerase, an enzyme involved in biosynthesis of the inner core of lipopolysaccharide (LPS) for Gram-negative bacteria. This enzyme is homologous to UDP-glucose 4-epimerase (TIGR01179) and belongs to the NAD dependent epimerase/dehydratase family (pfam01370). Length = 314 Score = 77.3 bits (191), Expect = 5e-15 Identities = 76/290 (26%), Positives = 125/290 (43%), Gaps = 38/290 (13%) Query: 6 VLVVGGAGYIGAHTCRVLYERGFLPI-VLDNLSSGHAEFVLWGPLEQVDICDYTN----- 59 ++V GGAG+IG++ + L ERG I V+DNL GH + L + I DY + Sbjct: 1 IIVTGGAGFIGSNLVKALNERGITDILVVDNLRDGHK----FLNLADLVIADYIDKEDFL 56 Query: 60 --LRAVFAKYQPASVMHFAGLTNISESVKNPSLFYEINIKGSFNLIATAIESNVRRFIFS 117 L A + ++ H ++ +E+ + E N + S L+ E + FI++ Sbjct: 57 DRLEKG-AFGKIEAIFHQGACSDTTET--DGEYMMENNYQYSKRLLDWCAEKGIP-FIYA 112 Query: 118 STCATYGIPHNTIITENDPQESITPYGYTK-----YVVERELLQHN--KVNGLRSVVLRY 170 S+ ATYG + + + YGY+K YV R L + +V G LRY Sbjct: 113 SSAATYGDGEAGFREGRELERPLNVYGYSKFLFDQYVRRRVLPEALSAQVVG-----LRY 167 Query: 171 FNAAGATLDSIIGEWHNPETHVIPLAIKTAMGYQNSFKVFGQDYATRDGTCLRDYIHVLD 230 FN G E+H + + + + + K+F +DG LRD+++V D Sbjct: 168 FNVYGP------REYHKGKMASVAFHLFNQIKAGGNVKLFKSSEGFKDGEQLRDFVYVKD 221 Query: 231 LANAHIMALEYLINQGDSIAINLGTGTGITVKEIISTIQSMYECAFPITY 280 + + ++ LE G S NLGTG + ++ + I Y Sbjct: 222 VVDVNLWLLE----NGVSGIFNLGTGRARSFNDLADAVFKALGKDEKIEY 267 >gnl|CDD|185103 PRK15181, PRK15181, Vi polysaccharide biosynthesis protein TviC; Provisional. Length = 348 Score = 73.6 bits (180), Expect = 6e-14 Identities = 80/341 (23%), Positives = 142/341 (41%), Gaps = 40/341 (11%) Query: 4 KNVLVVGGAGYIGAHTCRVLYERGFLPIVLDNLSSGHAEFV----------LWGPLE--Q 51 K L+ G AG+IG+ L I LDN S+G+ + W Q Sbjct: 16 KRWLITGVAGFIGSGLLEELLFLNQTVIGLDNFSTGYQHNLDDVRTSVSEEQWSRFIFIQ 75 Query: 52 VDICDYTNLRAVFAKYQPASVMHFAGLTNISESVKNPSLFYEINIKGSFNLIATAIESNV 111 DI +T+ + V+H A L ++ S+K+P NI G N++ A +++V Sbjct: 76 GDIRKFTDCQKACKNVD--YVLHQAALGSVPRSLKDPIATNSANIDGFLNMLTAARDAHV 133 Query: 112 RRFIFSSTCATYGIPHNTIITENDPQESITPYGYTKYVVERELLQHNKVNGLRSVVLRYF 171 F ++++ +TYG + E ++PY TKYV E + ++ LRYF Sbjct: 134 SSFTYAASSSTYGDHPDLPKIEERIGRPLSPYAVTKYVNELYADVFARSYEFNAIGLRYF 193 Query: 172 NAAGATLDSIIGEWHNPE---THVIPLAIKTAMGYQNSFKVFGQDYATRDGTCLRDYIHV 228 N + G NP + VIP I + + + Y DG+ RD+ ++ Sbjct: 194 N--------VFGRRQNPNGAYSAVIPRWILSLLKDEPI-------YINGDGSTSRDFCYI 238 Query: 229 LDLANAHIMALEYLINQGDSIAINLGTGTGITVKEIISTIQSMYEC-------AFPITYE 281 ++ A++++ + N+ G ++ E+ I+ A PI Y+ Sbjct: 239 ENVIQANLLSATTNDLASKNKVYNVAVGDRTSLNELYYLIRDGLNLWRNEQSRAEPI-YK 297 Query: 282 SRRIGDPPSLVADNKKAKKILGWNPKYKLRDIIESAWNWHL 322 R GD AD K K L + P++ +++ ++ W++ Sbjct: 298 DFRDGDVKHSQADITKIKTFLSYEPEFDIKEGLKQTLKWYI 338 >gnl|CDD|163279 TIGR03466, HpnA, hopanoid-associated sugar epimerase. The sequences in this family are members of the pfam01370 superfamily of NAD-dependent epimerases and dehydratases typically acting on nucleotide-sugar substrates. The genes of the family modeled here are generally in the same locus with genes involved in the biosynthesis and elaboration of hopene, the cyclization product of the polyisoprenoid squalene. Length = 328 Score = 72.7 bits (179), Expect = 1e-13 Identities = 68/252 (26%), Positives = 109/252 (43%), Gaps = 52/252 (20%) Query: 6 VLVVGGAGYIGAHTCRVLYERGFLPIVL-------DNLSSGHAEFVLWGPLEQVDICDYT 58 VLV G G++G+ R+L E+G VL NL E V G D+ D Sbjct: 3 VLVTGATGFVGSAVVRLLLEQGEEVRVLVRPTSDRRNLEGLDVEIVE-G-----DLRDPA 56 Query: 59 NLRAVFAKYQPASVMHFAGLTNISESVKNPSLFYEINIKGSFNLIATAIESNVRRFIFSS 118 +LR A + ++ H A + +P Y N++G+ NL+ A+E+ V R +++S Sbjct: 57 SLRKAVAGCR--ALFHVAADYRLW--APDPEEMYAANVEGTRNLLRAALEAGVERVVYTS 112 Query: 119 TCATYGIPHN-TIITENDP---QESITPYGYTKYVVERELLQHNKVNGLRSVVLRYFNAA 174 + AT G+ + T E P + I Y +K++ E+ L+ GL V++ Sbjct: 113 SVATLGVRGDGTPADETTPSSLDDMIGHYKRSKFLAEQAALEMAAEKGLPVVIV------ 166 Query: 175 GATLDSIIGEWHNPETHVIPLAIK-TAMGYQNSFKVFGQDYATRD-----GTCLRDYIHV 228 NP T + P IK T G ++ D+ T L + +HV Sbjct: 167 ------------NPSTPIGPRDIKPTPTG-----RII-VDFLNGKMPAYVDTGL-NLVHV 207 Query: 229 LDLANAHIMALE 240 D+A H++ALE Sbjct: 208 DDVAEGHLLALE 219 >gnl|CDD|165812 PLN02166, PLN02166, dTDP-glucose 4,6-dehydratase. Length = 436 Score = 67.0 bits (163), Expect = 7e-12 Identities = 78/321 (24%), Positives = 134/321 (41%), Gaps = 43/321 (13%) Query: 6 VLVVGGAGYIGAHTCRVLYERGFLPIVLDNLSSGHAEFV--LWG----PLEQVDICDYTN 59 ++V GGAG++G+H L RG IV+DN +G E + L+G L + D+ + Sbjct: 123 IVVTGGAGFVGSHLVDKLIGRGDEVIVIDNFFTGRKENLVHLFGNPRFELIRHDVVEPIL 182 Query: 60 LRA--VFAKYQPASVMHFAGLTNISESVKNPSLFYEINIKGSFNLIATAIESNVRRFIFS 117 L ++ PAS +H+ NP + N+ G+ N++ A RF+ + Sbjct: 183 LEVDQIYHLACPASPVHYK---------YNPVKTIKTNVMGTLNMLGLAKRVGA-RFLLT 232 Query: 118 STCATYGIP-----HNTIITENDPQESITPYGYTKYVVERELLQHNKVNGLRSVVLRYFN 172 ST YG P T +P + Y K E + +++ G+ + R FN Sbjct: 233 STSEVYGDPLEHPQKETYWGNVNPIGERSCYDEGKRTAETLAMDYHRGAGVEVRIARIFN 292 Query: 173 AAGATLDSIIGEWHNPETHVIPLAIKTAMGYQNSFKVFGQDYATRDGTCLRDYIHVLDLA 232 G + G ++ + I+ + V+G DG R + +V DL Sbjct: 293 TYGPRMCLDDGR---VVSNFVAQTIR-----KQPMTVYG------DGKQTRSFQYVSDLV 338 Query: 233 NAHIMALEYLINQGDSIA-INLGTGTGITVKEIISTIQSMYECAFPITYESRRIGDPPSL 291 + + +E G+ + NLG T+ E+ ++ + + I ++ DP Sbjct: 339 DGLVALME-----GEHVGPFNLGNPGEFTMLELAEVVKETIDSSATIEFKPNTADDPHKR 393 Query: 292 VADNKKAKKILGWNPKYKLRD 312 D KAK++L W PK LR+ Sbjct: 394 KPDISKAKELLNWEPKISLRE 414 >gnl|CDD|177856 PLN02206, PLN02206, UDP-glucuronate decarboxylase. Length = 442 Score = 63.5 bits (154), Expect = 8e-11 Identities = 82/320 (25%), Positives = 130/320 (40%), Gaps = 43/320 (13%) Query: 6 VLVVGGAGYIGAHTCRVLYERGFLPIVLDNLSSGHAEFVLW---GP---LEQVDICDYTN 59 V+V GGAG++G+H L RG IV+DN +G E V+ P L + D+ + Sbjct: 122 VVVTGGAGFVGSHLVDRLMARGDSVIVVDNFFTGRKENVMHHFSNPNFELIRHDVVEPIL 181 Query: 60 LRA--VFAKYQPASVMHFAGLTNISESVKNPSLFYEINIKGSFNLIATAIESNVRRFIFS 117 L ++ PAS +H+ NP + N+ G+ N++ A R F+ + Sbjct: 182 LEVDQIYHLACPASPVHYK---------FNPVKTIKTNVVGTLNMLGLAKRVGAR-FLLT 231 Query: 118 STCATYGIP--HNTIIT---ENDPQESITPYGYTKYVVERELLQHNKVNGLRSVVLRYFN 172 ST YG P H + T +P + Y K E + +++ + + R FN Sbjct: 232 STSEVYGDPLQHPQVETYWGNVNPIGVRSCYDEGKRTAETLTMDYHRGANVEVRIARIFN 291 Query: 173 AAGATLDSIIGEWHNPETHVIPLAIKTAMGYQNSFKVFGQDYATRDGTCLRDYIHVLDLA 232 G + G V+ + A+ + V+G DG R + V DL Sbjct: 292 TYGPRMCIDDGR-------VVSNFVAQAL-RKEPLTVYG------DGKQTRSFQFVSDLV 337 Query: 233 NAHIMALEYLINQGDSIA-INLGTGTGITVKEIISTIQSMYECAFPITYESRRIGDPPSL 291 + +E G+ + NLG T+ E+ +Q + I + DP Sbjct: 338 EGLMRLME-----GEHVGPFNLGNPGEFTMLELAKVVQETIDPNAKIEFRPNTEDDPHKR 392 Query: 292 VADNKKAKKILGWNPKYKLR 311 D KAK++LGW PK LR Sbjct: 393 KPDITKAKELLGWEPKVSLR 412 >gnl|CDD|162953 TIGR02622, CDP_4_6_dhtase, CDP-glucose 4,6-dehydratase. Members of this protein family are CDP-glucose 4,6-dehydratase from a variety of Gram-negative and Gram-positive bacteria. Members typically are encoded next to a gene that encodes a glucose-1-phosphate cytidylyltransferase, which produces the substrate, CDP-D-glucose, used by this enzyme to produce CDP-4-keto-6-deoxyglucose. Length = 349 Score = 55.8 bits (135), Expect = 1e-08 Identities = 64/305 (20%), Positives = 106/305 (34%), Gaps = 60/305 (19%) Query: 53 DICDYTNLRAVFAKYQPASVMHFAGLTNISESVKNPSLFYEINIKGSFNLIATAIESNVR 112 DI D LR A+++P V H A + +S +P +E N+ G+ NL+ Sbjct: 60 DIRDAAKLRKAIAEFKPEIVFHLAAQPLVRKSYADPLETFETNVMGTVNLLEAI------ 113 Query: 113 RFIFSSTCATYGIPHNTIIT---------------ENDPQESITPYGYTKYVVE------ 151 R I S A +T E DP PY +K E Sbjct: 114 RAI-GSVKAV------VNVTSDKCYRNDEWVWGYRETDPLGGHDPYSSSKACAELVIASY 166 Query: 152 -RELLQHNKVNGLRSVVLRYFNAAGATLDSIIGEWHNPETHVIPLAIKTAMGYQNSFKVF 210 +G++ R N G G+W E +IP I+ Sbjct: 167 RSSFFGVANFHGIKIASARAGNVIGG------GDW--AEDRLIPDVIRAFSS-------- 210 Query: 211 GQDYATRDGTCLRDYIHVLDLANAHIMALEYLINQGDSI--AINLG--TGTGITVKEIIS 266 + R+ R + HVL+ + +++ E L A N G V E++ Sbjct: 211 NKIVIIRNPDATRPWQHVLEPLSGYLLLAEKLFTGQAEFAGAWNFGPRASDNARVVELVV 270 Query: 267 TIQSMYECAFPITYESRRIGDPPS----LVADNKKAKKILGWNPKYKLRDIIESAWNWHL 322 + +E + P L D+ KA+ +LGW+P++ L + + +W+ Sbjct: 271 DALEFWWGD-DAEWEDDSDLNHPHEARLLKLDSSKARTLLGWHPRWGLEEAVSRTVDWYK 329 Query: 323 KYPRS 327 + R Sbjct: 330 AWLRG 334 >gnl|CDD|177902 PLN02260, PLN02260, probable rhamnose biosynthetic enzyme. Length = 668 Score = 55.9 bits (135), Expect = 1e-08 Identities = 69/253 (27%), Positives = 103/253 (40%), Gaps = 43/253 (16%) Query: 2 ENKNVLVVGGAGYIGAHTCRVLYER--GFLPIVLDNL-------------SSGHAEFVLW 46 E KN+L+ G AG+I +H L + +VLD L SS + +FV Sbjct: 5 EPKNILITGAAGFIASHVANRLIRNYPDYKIVVLDKLDYCSNLKNLNPSKSSPNFKFVK- 63 Query: 47 GPLEQVDICDYTNLRAVFAKYQPASVMHFAGLTNISESVKNPSLFYEINIKGSFNLI-AT 105 G + D+ +Y + ++MHFA T++ S N F + NI G+ L+ A Sbjct: 64 GDIASADLVNY-----LLITEGIDTIMHFAAQTHVDNSFGNSFEFTKNNIYGTHVLLEAC 118 Query: 106 AIESNVRRFIFSSTCATYGIPHNTIITENDPQESI---TPYGYTKYVVERELLQHNKVNG 162 + +RRFI ST YG N + PY TK E ++ + + G Sbjct: 119 KVTGQIRRFIHVSTDEVYGETDEDADVGNHEASQLLPTNPYSATKAGAEMLVMAYGRSYG 178 Query: 163 LRSVVLRYFNAAGATLDSIIGEWHNPETHVIPLAIKTAM-GYQNSFKVFGQDYATRDGTC 221 L + R N G PE +IP I AM G + G DG+ Sbjct: 179 LPVITTRGNNVYGPN--------QFPEK-LIPKFILLAMQGKP--LPIHG------DGSN 221 Query: 222 LRDYIHVLDLANA 234 +R Y++ D+A A Sbjct: 222 VRSYLYCEDVAEA 234 >gnl|CDD|178298 PLN02695, PLN02695, GDP-D-mannose-3',5'-epimerase. Length = 370 Score = 54.4 bits (131), Expect = 4e-08 Identities = 76/336 (22%), Positives = 127/336 (37%), Gaps = 44/336 (13%) Query: 2 ENKNVLVVGGAGYIGAHTCRVLYERGFLPIVLDNLSSGHA-------EFVLWGPLEQVDI 54 E + + G G+I +H R L G I D + H EF L L ++ Sbjct: 20 EKLRICITGAGGFIASHIARRLKAEGHYIIASDWKKNEHMSEDMFCHEFHL-VDLRVMEN 78 Query: 55 CDYTNLRAVFAKYQPASVMHFAGLTNISESVKNPSLFYEINIKGSFNLIATAIESNVRRF 114 C + V + A+ M G + SV + Y N SFN++ A + V+RF Sbjct: 79 CLKV-TKGVDHVFNLAADMGGMGFIQSNHSV----IMYN-NTMISFNMLEAARINGVKRF 132 Query: 115 IFSSTCATYG----IPHNTIITEND--PQESITPYGYTKYVVERELLQHNKVNGLRSVVL 168 ++S+ Y + N + E+D P E YG K E + K G+ + Sbjct: 133 FYASSACIYPEFKQLETNVSLKESDAWPAEPQDAYGLEKLATEELCKHYTKDFGIECRIG 192 Query: 169 RYFNAAGATLDSIIGEWHNPETHVIPLAIKTAMGYQNSFKVFGQDYATRDGTCLRDYIH- 227 R+ N G G W + A+ + F+++G TR T + + + Sbjct: 193 RFHNIYGP-----FGTWKGGREKAPAAFCRKALTSTDEFEMWGDGKQTRSFTFIDECVEG 247 Query: 228 VLDLANAHIMALEYLINQGDSIAINLGTGTGITVKEIISTIQSMYECAFPITYESRRIGD 287 VL L + +N+G+ +++ E+ S PI + I Sbjct: 248 VLRLTKSDF-----------REPVNIGSDEMVSMNEMAEIALSFENKKLPI----KHIPG 292 Query: 288 PPSLV---ADNKKAKKILGWNPKYKLRDIIESAWNW 320 P + +DN K+ LGW P +L+D + + W Sbjct: 293 PEGVRGRNSDNTLIKEKLGWAPTMRLKDGLRITYFW 328 >gnl|CDD|182998 PRK11150, rfaD, ADP-L-glycero-D-mannoheptose-6-epimerase; Provisional. Length = 308 Score = 52.0 bits (125), Expect = 2e-07 Identities = 69/274 (25%), Positives = 121/274 (44%), Gaps = 60/274 (21%) Query: 6 VLVVGGAGYIGAHTCRVLYERGFLPI-VLDNLSSGHAEFVLWGPLEQVDICDYTNLRAVF 64 ++V GGAG+IG++ + L ++G I V+DNL G +FV L +DI DY + + F Sbjct: 2 IIVTGGAGFIGSNIVKALNDKGITDILVVDNLKDG-TKFV---NLVDLDIADYMD-KEDF 56 Query: 65 AKYQPASVMHFAGLTNISESVKNPSLFYE----------------INIKGSFNLIATAIE 108 A +M +I ++F+E N + S L+ +E Sbjct: 57 L----AQIMAGDDFGDIE------AIFHEGACSSTTEWDGKYMMDNNYQYSKELLHYCLE 106 Query: 109 SNVRRFIFSSTCATYGIPHNTIITENDPQESITPYGYTKYVVE---RELLQH--NKVNGL 163 + F+++S+ ATYG + I E + ++ + YGY+K++ + R++L +++ G Sbjct: 107 REIP-FLYASSAATYGGRTDDFIEEREYEKPLNVYGYSKFLFDEYVRQILPEANSQICG- 164 Query: 164 RSVVLRYFNAAGATLDSIIGEWHNPETHVIPLAIKTAMGYQNSFKVF-GQDYATRDGTCL 222 RYFN G E H + + + + K+F G + Sbjct: 165 ----FRYFNVYGPR------EGHKGSMASVAFHLNNQLNNGENPKLFEGSE------NFK 208 Query: 223 RDYIHVLDLANAHIMALEYLINQGDSIAINLGTG 256 RD+++V D+A ++ E G S N GTG Sbjct: 209 RDFVYVGDVAAVNLWFWE----NGVSGIFNCGTG 238 >gnl|CDD|178184 PLN02572, PLN02572, UDP-sulfoquinovose synthase. Length = 442 Score = 45.6 bits (108), Expect = 2e-05 Identities = 40/157 (25%), Positives = 62/157 (39%), Gaps = 28/157 (17%) Query: 2 ENKNVLVVGGAGYIGAHTCRVLYERGFLPIVLDNLS-----------------SGHAEFV 44 + K V+V+GG GY G T L +RG+ ++DNL S H Sbjct: 46 KKKKVMVIGGDGYCGWATALHLSKRGYEVAIVDNLCRRLFDHQLGLDSLTPIASIHERVR 105 Query: 45 LWGP-------LEQVDICDYTNLRAVFAKYQPASVMHFAGLTNISESVKNPS-LFYEI-- 94 W L DICD+ L F ++P +V+HF + S+ + S + Sbjct: 106 RWKEVSGKEIELYVGDICDFEFLSEAFKSFEPDAVVHFGEQRSAPYSMIDRSRAVFTQHN 165 Query: 95 NIKGSFNLIATAIESNVR-RFIFSSTCATYGIPHNTI 130 N+ G+ N++ E + T YG P+ I Sbjct: 166 NVIGTLNVLFAIKEFAPDCHLVKLGTMGEYGTPNIDI 202 >gnl|CDD|132628 TIGR03589, PseB, UDP-N-acetylglucosamine 4,6-dehydratase. This enzyme catalyzes the first step in the biosynthesis of pseudaminic acid, the conversion of UDP-N-acetylglucosamine to UDP-4-keto-6-deoxy-N-acetylglucosamine. These sequences are members of the broader pfam01073 (3-beta hydroxysteroid dehydrogenase/isomerase family) family. Length = 324 Score = 42.4 bits (100), Expect = 2e-04 Identities = 46/193 (23%), Positives = 73/193 (37%), Gaps = 44/193 (22%) Query: 3 NKNVLVVGGAGYIGAHTCRVLYERGFLPIVLDNLSSGHA------EFVLWG-----PLEQ 51 NK++L+ GG G G + F+ +L+N + E W P Sbjct: 4 NKSILITGGTGSFG---------KAFISRLLENYNPKKIIIYSRDELKQWEMQQKFPAPC 54 Query: 52 V-----DICDYTNLRAVFAKYQPASVMHFAGLTNISESVKNPSLFYEINIKGSFNLIATA 106 + D+ D L V+H A L + + NP NI G+ N+I A Sbjct: 55 LRFFIGDVRDKERLTRALRGVD--YVVHAAALKQVPAAEYNPFECIRTNINGAQNVIDAA 112 Query: 107 IESNVRRFIFSSTCATYGIPHNTIITENDPQESITPYGYTKYVVERELLQHNKVNG---L 163 I++ V+R + ST + I YG TK ++ + N ++G Sbjct: 113 IDNGVKRVVALST--------------DKAANPINLYGATKLASDKLFVAANNISGSKGT 158 Query: 164 RSVVLRYFNAAGA 176 R V+RY N G+ Sbjct: 159 RFSVVRYGNVVGS 171 >gnl|CDD|177862 PLN02214, PLN02214, cinnamoyl-CoA reductase. Length = 342 Score = 41.7 bits (97), Expect = 2e-04 Identities = 63/265 (23%), Positives = 94/265 (35%), Gaps = 63/265 (23%) Query: 4 KNVLVVGGAGYIGAHTCRVLYERGFL------------PIVLDNLSSGHAEFVLWGPLEQ 51 K V V G GYI + ++L ERG+ L L G +L + Sbjct: 11 KTVCVTGAGGYIASWIVKILLERGYTVKGTVRNPDDPKNTHLRELEGGKERLILC----K 66 Query: 52 VDICDYTNLRAVFAKYQPASVMHFAGLTNISESVKNPSLFYEINIKGSFNLIATAIESNV 111 D+ DY L+A V H A S +P E + G+ +I A E+ V Sbjct: 67 ADLQDYEALKAAIDGCD--GVFHTA-----SPVTDDPEQMVEPAVNGAKFVINAAAEAKV 119 Query: 112 RRFIFSSTC-ATYGIPHNTIITENDPQESITP---------------YGYTKYVVERELL 155 +R + +S+ A Y P+ DP+ + Y Y K V E+ Sbjct: 120 KRVVITSSIGAVYMDPN------RDPEAVVDESCWSDLDFCKNTKNWYCYGKMVAEQAAW 173 Query: 156 QHNKVNGLRSVVLRYFNAAGATLDSIIGEWHNPETHVIPLAIKTAMGYQNSFKVFGQDYA 215 + K G+ VVL G L I + HV+ +A Y N Sbjct: 174 ETAKEKGVDLVVLNPVLVLGPPLQPTI---NASLYHVLKYLTGSAKTYAN---------- 220 Query: 216 TRDGTCLRDYIHVLDLANAHIMALE 240 + Y+ V D+A AH++ E Sbjct: 221 -----LTQAYVDVRDVALAHVLVYE 240 >gnl|CDD|178484 PLN02896, PLN02896, cinnamyl-alcohol dehydrogenase. Length = 353 Score = 41.3 bits (97), Expect = 4e-04 Identities = 72/358 (20%), Positives = 137/358 (38%), Gaps = 71/358 (19%) Query: 8 VVGGAGYIGAHTCRVLYERGFL-PIVLDNLSSGHAEFVLWGPLEQV-----DICDYTNLR 61 V G GYIG+ ++L +RG+ L + + W +++ D+ + + Sbjct: 15 VTGATGYIGSWLVKLLLQRGYTVHATLRDPAKSLHLLSKWKEGDRLRLFRADLQEEGSFD 74 Query: 62 AVF----AKYQPASVMHF---AGLTNISESVKNPSLFYEINIKGSFNLIATAIESN-VRR 113 + A+ M F + NI E V+ S + IKG+ N++ + ++S V+R Sbjct: 75 EAVKGCDGVFHVAASMEFDVSSDHNNIEEYVQ--SKVIDPAIKGTLNVLKSCLKSKTVKR 132 Query: 114 FIFSSTCATY----------GIPHNTIITENDPQESITPYGY----TKYVVERELLQHNK 159 +F+S+ +T + T T D + G+ +K + E ++ K Sbjct: 133 VVFTSSISTLTAKDSNGRWRAVVDETCQTPIDHVWNTKASGWVYVLSKLLTEEAAFKYAK 192 Query: 160 VNGLRSVVLRYFNAAGATLDS------------IIGEWHNPETHVIPLAIKTAMGYQNSF 207 NG+ V + AG L I G+ + I A+ + MG Sbjct: 193 ENGIDLVSVITTTVAGPFLTPSVPSSIQVLLSPITGD---SKLFSILSAVNSRMG----- 244 Query: 208 KVFGQDYATRDGTCLRDYIHVLDLANAHIMALEYLINQGDSIAINLGTGTGITVKEIIST 267 + +H+ D+ +AHI +E +G I + E+I+ Sbjct: 245 -------------SIA-LVHIEDICDAHIFLMEQTKAEGRYIC----CVDSYDMSELINH 286 Query: 268 IQSMYECAFPITYESRRI-GDPPSLVADNKKAKKILGWNPKYKLRDIIESAWNWHLKY 324 + Y C+ G PS ++ +KK + LG+ KY + +II+ + + + Sbjct: 287 LSKEYPCSNIQVRLDEEKRGSIPSEIS-SKKLRD-LGFEYKYGIEEIIDQTIDCCVDH 342 >gnl|CDD|180299 PRK05872, PRK05872, short chain dehydrogenase; Provisional. Length = 296 Score = 40.3 bits (95), Expect = 8e-04 Identities = 31/146 (21%), Positives = 54/146 (36%), Gaps = 31/146 (21%) Query: 1 MENKNVLVVGGAGYIGAHTCRVLYERGFLPIVLD-----------NLSSGHAEFVLWGPL 49 + K V+V G A IGA R L+ RG ++D L + Sbjct: 7 LAGKVVVVTGAARGIGAELARRLHARGAKLALVDLEEAELAALAAELGGDDRVLTV---- 62 Query: 50 EQVDICDYTNLRA----VFAKYQPAS-VMHFAGLTNISESVK--NPSLF---YEINIKGS 99 D+ D ++A ++ V+ AG+ + SV +P F ++N+ G Sbjct: 63 -VADVTDLAAMQAAAEEAVERFGGIDVVVANAGIAS-GGSVAQVDPDAFRRVIDVNLLGV 120 Query: 100 FNLIATAIESNVRR----FIFSSTCA 121 F+ + + + + R SS A Sbjct: 121 FHTVRATLPALIERRGYVLQVSSLAA 146 >gnl|CDD|162255 TIGR01214, rmlD, dTDP-4-dehydrorhamnose reductase. This enzyme catalyzes the last of 4 steps in making dTDP-rhamnose, a precursor of LPS core antigen, O-antigen, etc. Length = 287 Score = 39.7 bits (93), Expect = 0.001 Identities = 31/164 (18%), Positives = 62/164 (37%), Gaps = 19/164 (11%) Query: 6 VLVVGGAGYIGAHTCRVLYERGFLPIVLDNLSSGHAEFVLWGPLEQVDICDYTNLRAVFA 65 +L+ G G +G + L G + + L + Q+D+ D L + Sbjct: 2 ILITGANGQLGRELVQQLSPEGRVVVALTS--------------SQLDLTDPEALERLLR 47 Query: 66 KYQPASVMHFAGLTNISESVKNPSLFYEINIKGSFNLIATAIESNVRRFIFSSTCATYGI 125 +P +V++ A T++ + +P + +N NL A R + ST + Sbjct: 48 AIRPDAVVNTAAYTDVDGAESDPEKAFAVNALAPQNLARAAARHGA-RLVHISTDYVFDG 106 Query: 126 PHNTIITENDPQESITPYGYTKYVVERELLQHNKVNGLRSVVLR 169 E+D + YG +K E+ + G ++++R Sbjct: 107 EGKRPYREDDATNPLNVYGQSKLAGEQAIRAA----GPNALIVR 146 >gnl|CDD|178259 PLN02653, PLN02653, GDP-mannose 4,6-dehydratase. Length = 340 Score = 39.4 bits (92), Expect = 0.001 Identities = 62/289 (21%), Positives = 110/289 (38%), Gaps = 60/289 (20%) Query: 53 DICDYTNLRAVFAKYQPASVMHFAGLTNISESVKNPSLFYEINIKGSFNL---IATAIES 109 D+ D ++LR +P V + A ++++ S + P ++ G+ L + + Sbjct: 68 DLSDASSLRRWLDDIKPDEVYNLAAQSHVAVSFEMPDYTADVVATGALRLLEAVRLHGQE 127 Query: 110 NVR--RFIFSSTCATYGIPHNTIITENDPQESITP-YGYTKYVVERELLQHNKVNGLRSV 166 R ++ + + YG + PQ TP + + Y V + VN Sbjct: 128 TGRQIKYYQAGSSEMYG-------STPPPQSETTPFHPRSPYAVAKVAAHWYTVN----- 175 Query: 167 VLRYFNAAGATL-DSIIGEWHNP---ETHV---IPLAI-KTAMGYQNSFKVF-GQDYATR 217 Y A G + I+ +P E V I A+ + +G Q K+F G A+R Sbjct: 176 ---YREAYGLFACNGILFNHESPRRGENFVTRKITRAVGRIKVGLQK--KLFLGNLDASR 230 Query: 218 DGTCLRDYIHVLDLANAHIMALEYLINQGDSIAINLGTGTGITVKEIISTIQSMYECAFP 277 D DY+ A+ ++ Q + T TV+E + E AF Sbjct: 231 DWGFAGDYVE----------AMWLMLQQEKPDDYVVATEESHTVEEFL-------EEAFG 273 Query: 278 ---ITYESRRIGDP----PS----LVADNKKAKKILGWNPKYKLRDIIE 315 + ++ DP P+ L D KA+++LGW PK +++ Sbjct: 274 YVGLNWKDHVEIDPRYFRPAEVDNLKGDASKAREVLGWKPKVGFEQLVK 322 >gnl|CDD|129055 smart00822, PKS_KR, This enzymatic domain is part of bacterial polyketide synthases and catalyses the first step in the reductive modification of the beta-carbonyl centres in the growing polyketide chain. It uses NADPH to reduce the keto group to a hydroxy group. Length = 180 Score = 38.6 bits (91), Expect = 0.002 Identities = 37/151 (24%), Positives = 53/151 (35%), Gaps = 39/151 (25%) Query: 6 VLVVGGAGYIGAHTCRVLYERG-----FL---PIVLDNLSSGHAEFVLWGPLEQVDICDY 57 L+ GG G +G R L ERG L + AE G V CD Sbjct: 3 YLITGGLGGLGLELARWLAERGARHLVLLSRSGPDAPGAAELLAELEALGAEVTVVACDV 62 Query: 58 TN-------LRAVFAKYQP-ASVMHFAG------LTNISE-------SVKNPSLFYEINI 96 + L A+ A+ P V+H AG L N++ + K + Sbjct: 63 ADRAALAAALAAIPARLGPLRGVIHAAGVLDDGLLANLTPERFAAVLAPK---------V 113 Query: 97 KGSFNLIATAIESNVRRFI-FSSTCATYGIP 126 G++NL + + F+ FSS G P Sbjct: 114 DGAWNLHELTRDLPLDFFVLFSSVAGVLGNP 144 >gnl|CDD|162378 TIGR01472, gmd, GDP-mannose 4,6-dehydratase. Excluded from this model are members of the clade that score poorly because of highly dervied (phylogenetically long-branch) sequences, e.g. Aneurinibacillus thermoaerophilus Gmd, described as a bifunctional GDP-mannose 4,6-dehydratase/GDP-6-deoxy-D-lyxo-4-hexulose reductase (PUBMED:11096116). Length = 343 Score = 38.7 bits (90), Expect = 0.002 Identities = 62/293 (21%), Positives = 106/293 (36%), Gaps = 34/293 (11%) Query: 53 DICDYTNLRAVFAKYQPASVMHFAGLTNISESVKNPSLFYEINIKGSFNLI-ATAIESNV 111 D+ D +NLR + + +P + + A +++ S + P +++ G+ L+ A + Sbjct: 63 DLTDSSNLRRIIDEIKPTEIYNLAAQSHVKVSFEIPEYTADVDGIGTLRLLEAVRTLGLI 122 Query: 112 R--RFIFSSTCATYGIPHNTIITENDPQESITPYGYTK-----YVVE-RELLQHNKVNGL 163 + +F +ST YG E P +PY K V RE VNG+ Sbjct: 123 KSVKFYQASTSELYGKVQEIPQNETTPFYPRSPYAAAKLYAHWITVNYREAYGLFAVNGI 182 Query: 164 RSVVLRYFNAAGATLDSIIGEWHNPETHVIPLAI-KTAMGYQNSFKVFGQDYATRDGTCL 222 FN GE N T I A K +G Q G A RD Sbjct: 183 ------LFNHE----SPRRGE--NFVTRKITRAAAKIKLGLQEKL-YLGNLDAKRDWGHA 229 Query: 223 RDYIHVLDLANAHIMALEYLINQGDSIAIN-------LGTGTGI----TVKEIISTIQSM 271 +DY+ + L +Y+I G++ ++ G + + + Sbjct: 230 KDYVEAMWLMLQQDKPDDYVIATGETHSVREFVEVSFEYIGKTLNWKDKGINEVGRCKET 289 Query: 272 YECAFPITYESRRIGDPPSLVADNKKAKKILGWNPKYKLRDIIESAWNWHLKY 324 + I R + L+ D KAK+ LGW P+ +++ L+ Sbjct: 290 GKVHVEIDPRYFRPTEVDLLLGDATKAKEKLGWKPEVSFEKLVKEMVEEDLEL 342 >gnl|CDD|183375 PRK11908, PRK11908, NAD-dependent epimerase/dehydratase family protein; Provisional. Length = 347 Score = 38.5 bits (90), Expect = 0.002 Identities = 45/197 (22%), Positives = 84/197 (42%), Gaps = 30/197 (15%) Query: 4 KNVLVVGGAGYIGAHTC-RVL----YERGFLPIVLDNLSS--GHAEF-VLWGPLE-QVDI 54 K VL++G G+IG H R+L +E + + D L H G + + Sbjct: 2 KKVLILGVNGFIGHHLSKRILETTDWEVYGMDMQTDRLGDLVNHPRMHFFEGDITINKEW 61 Query: 55 CDYTNLRAVFAKYQPASVMHFAGLTNISESVKNPSLFYEINIKGSFNLIATAIESNVRRF 114 +Y +++ ++ + + VK P +E++ + + ++ +A++ + Sbjct: 62 IEY-HVKKCDV------ILPLVAIATPATYVKQPLRVFELDFEANLPIVRSAVKYG-KHL 113 Query: 115 IFSSTCATYGIPHNTIITENDPQESITPYG--------Y--TKYVVERELLQHNKVNGLR 164 +F ST YG+ + E DP+ S YG Y +K +++R + + GL Sbjct: 114 VFPSTSEVYGMCPDE---EFDPEASPLVYGPINKPRWIYACSKQLMDRVIWAYGMEEGLN 170 Query: 165 SVVLRYFNAAGATLDSI 181 + R FN G LDSI Sbjct: 171 FTLFRPFNWIGPGLDSI 187 >gnl|CDD|178567 PLN02986, PLN02986, cinnamyl-alcohol dehydrogenase family protein. Length = 322 Score = 38.1 bits (88), Expect = 0.003 Identities = 68/302 (22%), Positives = 117/302 (38%), Gaps = 71/302 (23%) Query: 4 KNVLVVGGAGYIGAHTCRVLYERGF--------------------LPIVLDNLSSGHAEF 43 K V V G +GYI + ++L RG+ L + L A+ Sbjct: 6 KLVCVTGASGYIASWIVKLLLLRGYTVKATVRDLTDRKKTEHLLALDGAKERLKLFKADL 65 Query: 44 VLWGPLEQ-VDICDYTNLRAVFAKYQPASVMHFAGLTNISESVKNPSLFYEINIKGSFNL 102 + EQ ++ CD AVF P F + + + +P+L KG+ N+ Sbjct: 66 LEESSFEQAIEGCD-----AVFHTASPV----FFTVKDPQTELIDPAL------KGTINV 110 Query: 103 IATAIE-SNVRRFIFSSTCATY-----GIPHNTIITE---NDP---QESITPYGYTKYVV 150 + T E +V+R I +S+ A I N ++ E +DP +E+ Y +K + Sbjct: 111 LNTCKETPSVKRVILTSSTAAVLFRQPPIEANDVVDETFFSDPSLCRETKNWYPLSKILA 170 Query: 151 ERELLQHNKVNGLRSVVLRYFNAAGATLDSIIGEWHNPETHVIPLAIKTAMGYQNSFKVF 210 E + K NG+ VVL G L + +++ + + N +F Sbjct: 171 ENAAWEFAKDNGIDMVVLNPGFICGPLLQPTLN-----------FSVELIVDFINGKNLF 219 Query: 211 GQDYATRDGTCLRDYIHVLDLANAHIMALEYLINQGDSIAINLGTGTGITVKEIISTIQS 270 + ++ V D+A AHI ALE G I G ++V +II ++ Sbjct: 220 NNRFYR--------FVDVRDVALAHIKALETPSANGRYII----DGPIMSVNDIIDILRE 267 Query: 271 MY 272 ++ Sbjct: 268 LF 269 >gnl|CDD|149205 pfam07993, NAD_binding_4, Male sterility protein. This family represents the C-terminal region of the male sterility protein in a number of arabidopsis and drosophila. A sequence-related jojoba acyl CoA reductase is also included. Length = 245 Score = 37.2 bits (87), Expect = 0.006 Identities = 18/107 (16%), Positives = 34/107 (31%), Gaps = 14/107 (13%) Query: 72 VMHFAGLTNISESVKNPSLFYEINIKGSFNLIATAIESNVRRFIFSSTCATYGIPHNTI- 130 ++H A N E + N+ G+ ++ A + F ST G + Sbjct: 89 IIHNAATVNFVEPYSD---LRATNVLGTREVLRLAKQMKKLPFHHVSTAYVNGERGGLLE 145 Query: 131 --------ITENDPQESITPYGYTKYVVERELLQHNKVNGLRSVVLR 169 Y +K++ E+ + + GL V+ R Sbjct: 146 EKPYPLDEDEPALLGGLPNGYTQSKWLAEQLVREAAG--GLPVVIYR 190 >gnl|CDD|180183 PRK05653, fabG, 3-ketoacyl-(acyl-carrier-protein) reductase; Validated. Length = 246 Score = 34.4 bits (80), Expect = 0.041 Identities = 15/78 (19%), Positives = 25/78 (32%), Gaps = 19/78 (24%) Query: 1 MENKNVLVVGGAGYIGAHTCRVLYERGFLPIVLD-------------NLSSGHAEFVLWG 47 ++ K LV G + IG L G ++ D + G A + Sbjct: 3 LQGKTALVTGASRGIGRAIALRLAADGAKVVIYDSNEEAAEALAAELRAAGGEARVL--- 59 Query: 48 PLEQVDICDYTNLRAVFA 65 D+ D +RA+ Sbjct: 60 ---VFDVSDEAAVRALIE 74 >gnl|CDD|178047 PLN02427, PLN02427, UDP-apiose/xylose synthase. Length = 386 Score = 34.4 bits (79), Expect = 0.043 Identities = 56/263 (21%), Positives = 92/263 (34%), Gaps = 57/263 (21%) Query: 110 NVRRFIFSSTCATYGIPHNTIITENDP----------QESITP------------YGYTK 147 N +R I STC YG + + ++ P +E +P Y K Sbjct: 127 NNKRLIHFSTCEVYGKTIGSFLPKDHPLRQDPAFYVLKEDESPCIFGSIEKQRWSYACAK 186 Query: 148 YVVERELLQHNKVNGLRSVVLRYFNAAGATLDSIIGEWHNPETHVIPLAIKTAMGYQNSF 207 ++ER + NGL ++R FN W P IP + G Sbjct: 187 QLIERLIYAEGAENGLEFTIVRPFN------------WIGPRMDFIPGIDGPSEGVPRVL 234 Query: 208 KVFGQDYATRD-------GTCLRDYIHVLDLANAHIMALEYLINQGDSIAINLGT-GTGI 259 F + R+ G R ++++ D A ++ +E I N+G + Sbjct: 235 ACFSNNLLRREPLKLVDGGQSQRTFVYIKDAIEAVLLMIENPARANGHI-FNVGNPNNEV 293 Query: 260 TVKEIISTIQSMY-----ECAFPIT--------YESRRIGDPPSLVADNKKAKKILGWNP 306 TV+++ + +Y E A + D + D K LGWNP Sbjct: 294 TVRQLAEMMTEVYAKVSGEPALEEPTVDVSSKEFYGEGYDDSDKRIPDMTIINKQLGWNP 353 Query: 307 KYKLRDIIESAWNW-HLKYPRSL 328 K L D++ES + H Y ++ Sbjct: 354 KTSLWDLLESTLTYQHKTYAEAI 376 >gnl|CDD|169699 PRK09186, PRK09186, flagellin modification protein A; Provisional. Length = 256 Score = 33.8 bits (78), Expect = 0.064 Identities = 18/79 (22%), Positives = 32/79 (40%), Gaps = 13/79 (16%) Query: 1 MENKNVLVVGGAGYIGAHTCRVLYERGFLPIV-----------LDNLSSGHAEFVLWGPL 49 ++ K +L+ G G IG+ + + E G + I L++L L L Sbjct: 2 LKGKTILITGAGGLIGSALVKAILEAGGIVIAADIDKEALNELLESLGKEFKSKKL--SL 59 Query: 50 EQVDICDYTNLRAVFAKYQ 68 ++DI D +L +K Sbjct: 60 VELDITDQESLEEFLSKSA 78 >gnl|CDD|162515 TIGR01746, Thioester-redct, thioester reductase domain. It has been suggested that a NADP-binding motif can be found in the N-terminal portion of this domain that may form a Rossman-type fold. Length = 367 Score = 33.5 bits (77), Expect = 0.069 Identities = 24/121 (19%), Positives = 45/121 (37%), Gaps = 12/121 (9%) Query: 71 SVMHFAGLTNISESVKNPSLFYEINIKGSFNLIATAIESNVRRFIFSSTCATYGIPHNTI 130 +++H L N V S N+ G+ ++ A + + ST + + Sbjct: 91 TIVHNGALVNW---VYPYSELRAANVLGTREVLRLAASGRAKPLHYVSTISVLAAIDLST 147 Query: 131 ITENDP-----QESITPYGYTKYVVERELLQHNKVNGLRSVVLRYFNAAGATLDSIIGEW 185 +TE+D Y +K+V E L++ GL ++R G +S G Sbjct: 148 VTEDDAIVTPPPGLAGGYAQSKWVAEL-LVREASDRGLPVTIVRPGRILG---NSYTGAI 203 Query: 186 H 186 + Sbjct: 204 N 204 >gnl|CDD|181389 PRK08324, PRK08324, short chain dehydrogenase; Validated. Length = 681 Score = 33.7 bits (78), Expect = 0.073 Identities = 27/114 (23%), Positives = 42/114 (36%), Gaps = 19/114 (16%) Query: 6 VLVVGGAGYIGAHTCRVLYERGFLPIVLD----NLSSGHAEFVLWGPLEQVDICDYTNLR 61 LV G AG IG T + L G ++ D + AE V CD T+ Sbjct: 425 ALVTGAAGGIGKATAKRLAAEGACVVLADLDEEAAEAAAAELGGPDRALGV-ACDVTDEA 483 Query: 62 AVFAKYQPASVMHFAGL-------------TNISESVKNPSLFYEINIKGSFNL 102 AV A ++ A + F G+ S ++ +++N G F + Sbjct: 484 AVQAAFEEA-ALAFGGVDIVVSNAGIAISGPIEETSDEDWRRSFDVNATGHFLV 536 >gnl|CDD|132250 TIGR03206, benzo_BadH, 2-hydroxycyclohexanecarboxyl-CoA dehydrogenase. Members of this protein family are the enzyme 2-hydroxycyclohexanecarboxyl-CoA dehydrogenase. The enzymatic properties were confirmed experimentally in Rhodopseudomonas palustris; the enzyme is homotetrameric, and not sensitive to oxygen. This enzyme is part of proposed pathway for degradation of benzoyl-CoA to 3-hydroxypimeloyl-CoA that differs from the analogous in Thauera aromatica. It also may occur in degradation of the non-aromatic compound cyclohexane-1-carboxylate. Length = 250 Score = 33.4 bits (76), Expect = 0.086 Identities = 18/44 (40%), Positives = 23/44 (52%), Gaps = 1/44 (2%) Query: 1 MENKNVLVVGGAGYIGAHTCRVLYERGFLPIVLDNLSSGHAEFV 44 +++K +V GG G IG TCR E G V D L+ AE V Sbjct: 1 LKDKTAIVTGGGGGIGGATCRRFAEEGAKVAVFD-LNREAAEKV 43 >gnl|CDD|168574 PRK06484, PRK06484, short chain dehydrogenase; Validated. Length = 520 Score = 33.3 bits (76), Expect = 0.098 Identities = 25/115 (21%), Positives = 47/115 (40%), Gaps = 14/115 (12%) Query: 6 VLVVGGAGYIGAHTCRVLYERGFLPIVLDNLSSGHAEF--VLWGP--LEQVDICDYTNLR 61 V + GGA IG G +++D + G + L Q DI D + Sbjct: 272 VAITGGARGIGRAVADRFAAAGDRLLIIDRDAEGAKKLAEALGDEHLSVQADITDEAAVE 331 Query: 62 AVFA----KYQPASVM-HFAG-----LTNISESVKNPSLFYEINIKGSFNLIATA 106 + FA ++ V+ + AG ++ +S ++ + Y++N+ G+F A Sbjct: 332 SAFAQIQARWGRLDVLVNNAGIAEVFKPSLEQSAEDFTRVYDVNLSGAFACARAA 386 >gnl|CDD|181668 PRK09135, PRK09135, pteridine reductase; Provisional. Length = 249 Score = 33.0 bits (76), Expect = 0.11 Identities = 13/42 (30%), Positives = 20/42 (47%) Query: 1 MENKNVLVVGGAGYIGAHTCRVLYERGFLPIVLDNLSSGHAE 42 K L+ GGA IGA R L+ G+ + + S+ A+ Sbjct: 4 DSAKVALITGGARRIGAAIARTLHAAGYRVAIHYHRSAAEAD 45 >gnl|CDD|178263 PLN02657, PLN02657, 3,8-divinyl protochlorophyllide a 8-vinyl reductase. Length = 390 Score = 32.8 bits (75), Expect = 0.14 Identities = 31/129 (24%), Positives = 52/129 (40%), Gaps = 13/129 (10%) Query: 2 ENKNVLVVGGAGYIGAHTCRVLYERGFLPIVLDNLSSG-------HAEFVLWGPLEQV-- 52 ++ VLVVG GYIG R L RG+ + + SG E V Sbjct: 59 KDVTVLVVGATGYIGKFVVRELVRRGYNVVAVAREKSGIRGKNGKEDTKKELPGAEVVFG 118 Query: 53 DICDYTNLRAVFAKYQPASVMHFAGLTNISESVKNPSLFYEINIKGSFNLIATAIESNVR 112 D+ D +LR V + + L + + VK+ ++I+ + + N + E + Sbjct: 119 DVTDADSLRKVLFSEGDPVDVVVSCLASRTGGVKDS---WKIDYQATKNSLDAGREVGAK 175 Query: 113 RFI-FSSTC 120 F+ S+ C Sbjct: 176 HFVLLSAIC 184 >gnl|CDD|181508 PRK08628, PRK08628, short chain dehydrogenase; Provisional. Length = 258 Score = 32.6 bits (75), Expect = 0.15 Identities = 13/35 (37%), Positives = 20/35 (57%) Query: 1 MENKNVLVVGGAGYIGAHTCRVLYERGFLPIVLDN 35 +++K V+V GGA IGA L E G +P++ Sbjct: 5 LKDKVVIVTGGASGIGAAISLRLAEEGAIPVIFGR 39 >gnl|CDD|178569 PLN02989, PLN02989, cinnamyl-alcohol dehydrogenase family protein. Length = 325 Score = 31.5 bits (71), Expect = 0.34 Identities = 63/291 (21%), Positives = 108/291 (37%), Gaps = 48/291 (16%) Query: 4 KNVLVVGGAGYIGAHTCRVLYERGFLPIVL-----DNLSSGHAEFVLWGPLEQVDICDYT 58 K V V G +GYI + ++L RG+ D + H L G E++ + Sbjct: 6 KVVCVTGASGYIASWIVKLLLFRGYTINATVRDPKDRKKTDHL-LALDGAKERLKLFKAD 64 Query: 59 NLRAVFAKYQPA-----SVMHFAGLTNISESVKNPSLFYEINIKGSFNLIATAIE-SNVR 112 L ++ A +V H A I+ + G+ N++ T + S+V+ Sbjct: 65 LLDE--GSFELAIDGCETVFHTASPVAITVKTDPQVELINPAVNGTINVLRTCTKVSSVK 122 Query: 113 RFIFSSTCATYGIPH-----NTIITEN---DP---QESITPYGYTKYVVERELLQHNKVN 161 R I +S+ A P N ++ E +P +E Y +K + E + K N Sbjct: 123 RVILTSSMAAVLAPETKLGPNDVVDETFFTNPSFAEERKQWYVLSKTLAEDAAWRFAKDN 182 Query: 162 GLRSVVLRYFNAAGATLDSIIGEWHNPETHVIPLAIKTAMGYQNSFKVFGQDYATRDGTC 221 + +VL + G P + I M +N F T Sbjct: 183 EIDLIVL--------NPGLVTGPILQPTLNFSVAVIVELMKGKNPFN-----------TT 223 Query: 222 LRDYIHVLDLANAHIMALEYLINQGDSIAINLGTGTGITVKEIISTIQSMY 272 ++ V D+A AH+ ALE G I G +T+K+I + ++ + Sbjct: 224 HHRFVDVRDVALAHVKALETPSANGRYII----DGPVVTIKDIENVLREFF 270 >gnl|CDD|184593 PRK14258, PRK14258, phosphate ABC transporter ATP-binding protein; Provisional. Length = 261 Score = 31.2 bits (70), Expect = 0.42 Identities = 15/30 (50%), Positives = 19/30 (63%), Gaps = 1/30 (3%) Query: 289 PSLVADN-KKAKKILGWNPKYKLRDIIESA 317 P V DN KI+GW PK ++ DI+ESA Sbjct: 101 PMSVYDNVAYGVKIVGWRPKLEIDDIVESA 130 >gnl|CDD|162529 TIGR01777, yfcH, conserved hypothetical protein TIGR01777. This model represents a clade of proteins of unknown function including the E. coli yfcH protein. Length = 292 Score = 31.1 bits (71), Expect = 0.47 Identities = 33/174 (18%), Positives = 60/174 (34%), Gaps = 23/174 (13%) Query: 6 VLVVGGAGYIGAHTCRVLYERGFLPIVLDNLSSGHAEFVLWGPLEQVDICDYTNLRAVFA 65 +L+ GG G+IG + L + G +L A + + A Sbjct: 1 ILITGGTGFIGRALTQRLTKDGHEVTILTRSPPAGAN-------TKWEGYKPWAPLAESE 53 Query: 66 KYQPA-SVMHFAGLTNISESVKNPS---LFYEINIKGSFNLIATAIESNVRR---FIFSS 118 + A +V++ AG I++ + I + + AI + ++ FI +S Sbjct: 54 ALEGADAVINLAGE-PIADKRWTEERKQEIRDSRIDTT-RALVEAIAAAEQKPKVFISAS 111 Query: 119 TCATYGIPHNTIITENDPQESITPYGYTKYVV---ERELLQHNKVNGLRSVVLR 169 YG + + TE D + + E + G R V+LR Sbjct: 112 AVGYYGTSEDRVFTEEDSP---AGDDFLAELCRDWEEAAQAAEDL-GTRVVLLR 161 >gnl|CDD|180462 PRK06198, PRK06198, short chain dehydrogenase; Provisional. Length = 260 Score = 30.7 bits (70), Expect = 0.53 Identities = 39/141 (27%), Positives = 57/141 (40%), Gaps = 41/141 (29%) Query: 2 ENKNVLVVGGAGYIGAHTCRVLYERGFLPIVL------------DNLSSGHAE--FVLWG 47 + K LV GG +GA R ERG +V+ L + A+ FV Sbjct: 5 DGKVALVTGGTQGLGAAIARAFAERGAAGLVICGRNAEKGEAQAAELEALGAKAVFV--- 61 Query: 48 PLEQVDICDYTNLRAVFAKYQPASVMHF---------AGLT---NISESVKNPSLF---Y 92 Q D+ D + R V A A+ F AGLT I ++ +P LF + Sbjct: 62 ---QADLSDVEDCRRVVA----AADEAFGRLDALVNAAGLTDRGTILDT--SPELFDRHF 112 Query: 93 EINIKGSFNLIATAIESNVRR 113 +N++ F L+ AI+ RR Sbjct: 113 AVNVRAPFFLMQEAIKLMRRR 133 >gnl|CDD|181045 PRK07578, PRK07578, short chain dehydrogenase; Provisional. Length = 199 Score = 30.6 bits (70), Expect = 0.56 Identities = 25/86 (29%), Positives = 38/86 (44%), Gaps = 18/86 (20%) Query: 6 VLVVGGAGYIGAHTCRVLYERGFLPIVLDNLSSGHAEFVLWGPLEQVDICDYTNLRAVFA 65 +LV+G +G IG L +R ++ SSG QVDI D ++RA+F Sbjct: 3 ILVIGASGTIGRAVVAELSKRH--EVITAGRSSGDV---------QVDITDPASIRALFE 51 Query: 66 KYQP-------ASVMHFAGLTNISES 84 K A +HFA L +++ Sbjct: 52 KVGKVDAVVSAAGKVHFAPLAEMTDE 77 >gnl|CDD|149646 pfam08659, KR, KR domain. This enzymatic domain is part of bacterial polyketide synthases and catalyses the first step in the reductive modification of the beta-carbonyl centres in the growing polyketide chain. It uses NADPH to reduce the keto group to a hydroxy group. Length = 181 Score = 30.6 bits (70), Expect = 0.56 Identities = 11/22 (50%), Positives = 12/22 (54%) Query: 6 VLVVGGAGYIGAHTCRVLYERG 27 LV GG G +G R L ERG Sbjct: 3 YLVTGGLGGLGLELARWLAERG 24 >gnl|CDD|181299 PRK08220, PRK08220, 2,3-dihydroxybenzoate-2,3-dehydrogenase; Validated. Length = 252 Score = 30.6 bits (70), Expect = 0.62 Identities = 23/118 (19%), Positives = 35/118 (29%), Gaps = 23/118 (19%) Query: 1 MENKNVLVVGGAGYIGAHTCRVLYERGFLPIVLD----NLSSGHAEFVLWGPLEQVDICD 56 K V V G A IG E G I D +D+ D Sbjct: 6 FSGKTVWVTGAAQGIGYAVALAFVEAGAKVIGFDQAFLTQEDYPFA------TFVLDVSD 59 Query: 57 YTNLRAV----FAKYQP-------ASVMHFAGLTNISESVKNPSLFYEINIKGSFNLI 103 + V A+ P A ++ ++S+ + +N G+FNL Sbjct: 60 AAAVAQVCQRLLAETGPLDVLVNAAGILRMGATDSLSDEDWQQTF--AVNAGGAFNLF 115 >gnl|CDD|180126 PRK05557, fabG, 3-ketoacyl-(acyl-carrier-protein) reductase; Validated. Length = 248 Score = 29.8 bits (68), Expect = 1.0 Identities = 9/28 (32%), Positives = 12/28 (42%) Query: 1 MENKNVLVVGGAGYIGAHTCRVLYERGF 28 +E K LV G + IG L +G Sbjct: 3 LEGKVALVTGASRGIGRAIAERLAAQGA 30 >gnl|CDD|178268 PLN02662, PLN02662, cinnamyl-alcohol dehydrogenase family protein. Length = 322 Score = 28.5 bits (64), Expect = 2.4 Identities = 11/28 (39%), Positives = 17/28 (60%) Query: 1 MENKNVLVVGGAGYIGAHTCRVLYERGF 28 E K V V G +GYI + ++L +RG+ Sbjct: 2 GEGKVVCVTGASGYIASWLVKLLLQRGY 29 Score = 27.4 bits (61), Expect = 5.5 Identities = 9/16 (56%), Positives = 12/16 (75%) Query: 225 YIHVLDLANAHIMALE 240 ++ V D+ANAHI A E Sbjct: 225 WVDVRDVANAHIQAFE 240 >gnl|CDD|183775 PRK12826, PRK12826, 3-ketoacyl-(acyl-carrier-protein) reductase; Reviewed. Length = 251 Score = 28.3 bits (64), Expect = 2.9 Identities = 13/44 (29%), Positives = 16/44 (36%) Query: 1 MENKNVLVVGGAGYIGAHTCRVLYERGFLPIVLDNLSSGHAEFV 44 +E + LV G A IG L G IV+D A Sbjct: 4 LEGRVALVTGAARGIGRAIAVRLAADGAEVIVVDICGDDAAATA 47 >gnl|CDD|163114 TIGR03031, cas_csx12, CRISPR-associated protein, Csx12 family. Members of this family of CRISPR-associated (cas) protein are found, so far, in CRISPR/cas loci in Wolinella succinogenes DSM 1740, Legionella pneumophila str. Paris, and Francisella tularensis, where the last probably is an example of a degenerate CRISPR locus, having neither repeats nor a functional Cas1. The characteristic repeat length is 37 base pairs and period is about 72. One region of this large protein shows sequence similarity to PFAM model pfam01844, HNH endonuclease. Length = 802 Score = 28.4 bits (63), Expect = 2.9 Identities = 9/38 (23%), Positives = 14/38 (36%) Query: 265 ISTIQSMYECAFPITYESRRIGDPPSLVADNKKAKKIL 302 I + F I E DP L ++ K K++ Sbjct: 545 IEEARKNSGNTFKIDIEEALSKDPEHLTSNEKALLKLI 582 >gnl|CDD|184511 PRK14106, murD, UDP-N-acetylmuramoyl-L-alanyl-D-glutamate synthetase; Provisional. Length = 450 Score = 28.0 bits (63), Expect = 3.1 Identities = 13/34 (38%), Positives = 18/34 (52%), Gaps = 1/34 (2%) Query: 1 MENKNVLVVGGAGYIGAHTCRVLYERGFLPIVLD 34 ++ K VLVV GAG G + L + G I+ D Sbjct: 3 LKGKKVLVV-GAGVSGLALAKFLKKLGAKVILTD 35 >gnl|CDD|180804 PRK07041, PRK07041, short chain dehydrogenase; Provisional. Length = 230 Score = 28.1 bits (63), Expect = 3.4 Identities = 20/70 (28%), Positives = 27/70 (38%), Gaps = 6/70 (8%) Query: 7 LVVGGAGYIGAHTCRVLYERG-FLPIV---LDNLSSGHAEFVLWGPLEQV--DICDYTNL 60 LVVGG+ IG R G + I D L++ P+ DI D + Sbjct: 1 LVVGGSSGIGLALARAFAAEGARVTIASRSRDRLAAAARALGGGAPVRTAALDITDEAAV 60 Query: 61 RAVFAKYQPA 70 A FA+ P Sbjct: 61 DAFFAEAGPF 70 >gnl|CDD|181335 PRK08264, PRK08264, short chain dehydrogenase; Validated. Length = 238 Score = 27.9 bits (63), Expect = 3.7 Identities = 10/27 (37%), Positives = 12/27 (44%) Query: 1 MENKNVLVVGGAGYIGAHTCRVLYERG 27 ++ K VLV G IG L RG Sbjct: 4 IKGKVVLVTGANRGIGRAFVEQLLARG 30 >gnl|CDD|181239 PRK08125, PRK08125, bifunctional UDP-glucuronic acid decarboxylase/UDP-4-amino-4-deoxy-L-arabinose formyltransferase; Validated. Length = 660 Score = 27.6 bits (62), Expect = 4.1 Identities = 27/106 (25%), Positives = 51/106 (48%), Gaps = 8/106 (7%) Query: 83 ESVKNPSLFYEINIKGSFNLIATAIESNVRRFIFSSTCATYGIPHNTIITENDPQESITP 142 E +NP +E++ + + +I ++ N +R IF ST YG+ + E+ + P Sbjct: 397 EYTRNPLRVFELDFEENLKIIRYCVKYN-KRIIFPSTSEVYGMCTDKYFDEDTSNLIVGP 455 Query: 143 -------YGYTKYVVERELLQHNKVNGLRSVVLRYFNAAGATLDSI 181 Y +K +++R + + + GLR + R FN G LD++ Sbjct: 456 INKQRWIYSVSKQLLDRVIWAYGEKEGLRFTLFRPFNWMGPRLDNL 501 Score = 26.5 bits (59), Expect = 9.5 Identities = 7/28 (25%), Positives = 18/28 (64%) Query: 296 KKAKKILGWNPKYKLRDIIESAWNWHLK 323 + A+++L W PK +++ I+ ++ L+ Sbjct: 624 RNARRLLDWEPKIDMQETIDETLDFFLR 651 >gnl|CDD|183776 PRK12827, PRK12827, short chain dehydrogenase; Provisional. Length = 249 Score = 27.8 bits (62), Expect = 4.3 Identities = 32/160 (20%), Positives = 50/160 (31%), Gaps = 53/160 (33%) Query: 4 KNVLVVGGAGYIGAHTCRVLYERGFLPIVLDNLSS-----------------GHAEFVLW 46 + VL+ GG+G +G L G IVLD G A + Sbjct: 7 RRVLITGGSGGLGRAIAVRLAADGADVIVLDIHPMRGRAEADAVAAGIEAAGGKALGL-- 64 Query: 47 GPLEQVDICDYTNLRAVFAKYQPASVMHFAGLTNISESVKNPSLF--------------- 91 D+ D+ RA A V F L + V N + Sbjct: 65 ----AFDVRDFAATRAALD----AGVEEFGRLDIL---VNNAGIATDAAFAELSIEEWDD 113 Query: 92 -YEINIKGSFNLIATAIESNVR-----RFIFSSTCATYGI 125 ++N+ G FN+ A+ +R R + + G+ Sbjct: 114 VIDVNLDGFFNVTQAALPPMIRARRGGRIVN--IASVAGV 151 >gnl|CDD|184103 PRK13512, PRK13512, coenzyme A disulfide reductase; Provisional. Length = 438 Score = 27.4 bits (61), Expect = 4.6 Identities = 13/27 (48%), Positives = 14/27 (51%), Gaps = 1/27 (3%) Query: 1 MENKNVLVVGGAGYIGAHTCRVLYERG 27 + LVVG AGYI LYERG Sbjct: 146 NQVDKALVVG-AGYISLEVLENLYERG 171 >gnl|CDD|178819 PRK00045, hemA, glutamyl-tRNA reductase; Reviewed. Length = 423 Score = 27.5 bits (62), Expect = 5.0 Identities = 15/44 (34%), Positives = 23/44 (52%), Gaps = 1/44 (2%) Query: 1 MENKNVLVVGGAGYIGAHTCRVLYERGFLPIVLDNLSSGHAEFV 44 + K VLV+ GAG +G + L E+G I + N + AE + Sbjct: 180 LSGKKVLVI-GAGEMGELVAKHLAEKGVRKITVANRTLERAEEL 222 >gnl|CDD|180879 PRK07201, PRK07201, short chain dehydrogenase; Provisional. Length = 657 Score = 27.2 bits (61), Expect = 6.1 Identities = 18/66 (27%), Positives = 25/66 (37%), Gaps = 4/66 (6%) Query: 2 ENKNVLVVGGAGYIGAHTCRVLYERGFLPIVL----DNLSSGHAEFVLWGPLEQVDICDY 57 K VL+ G + IG T + E G ++ + L AE G CD Sbjct: 370 VGKVVLITGASSGIGRATAIKVAEAGATVFLVARNGEALDELVAEIRAKGGTAHAYTCDL 429 Query: 58 TNLRAV 63 T+ AV Sbjct: 430 TDSAAV 435 >gnl|CDD|177655 PLN00016, PLN00016, RNA-binding protein; Provisional. Length = 378 Score = 27.4 bits (61), Expect = 6.3 Identities = 9/16 (56%), Positives = 12/16 (75%) Query: 297 KAKKILGWNPKYKLRD 312 KAK+ LGW PK+ L + Sbjct: 326 KAKEELGWTPKFDLVE 341 >gnl|CDD|163268 TIGR03443, alpha_am_amid, L-aminoadipate-semialdehyde dehydrogenase. Members of this protein family are L-aminoadipate-semialdehyde dehydrogenase (EC 1.2.1.31), product of the LYS2 gene. It is also called alpha-aminoadipate reductase. In fungi, lysine is synthesized via aminoadipate. Currently, all members of this family are fungal. Length = 1389 Score = 27.0 bits (60), Expect = 7.9 Identities = 7/25 (28%), Positives = 11/25 (44%) Query: 2 ENKNVLVVGGAGYIGAHTCRVLYER 26 V + G G++G+ R L R Sbjct: 970 TPITVFLTGATGFLGSFILRDLLTR 994 >gnl|CDD|178616 PLN03064, PLN03064, alpha,alpha-trehalose-phosphate synthase (UDP-forming); Provisional. Length = 934 Score = 26.7 bits (59), Expect = 8.1 Identities = 20/71 (28%), Positives = 33/71 (46%), Gaps = 12/71 (16%) Query: 21 RVLYERGFLPIVLDNLS-----SGHAEFVLWG-----PLEQVDICDYT-NLRAVFAKYQP 69 + L E+ +P+ LD +G+ +LW L Q D T + ++ FA Y+ Sbjct: 157 KALAEKRCIPVFLDEEIVHQYYNGYCNNILWPLFHYLGLPQEDRLATTRSFQSQFAAYKK 216 Query: 70 ASVMHFAGLTN 80 A+ M FA + N Sbjct: 217 ANQM-FADVVN 226 >gnl|CDD|183489 PRK12384, PRK12384, sorbitol-6-phosphate dehydrogenase; Provisional. Length = 259 Score = 26.5 bits (59), Expect = 8.6 Identities = 12/33 (36%), Positives = 17/33 (51%) Query: 2 ENKNVLVVGGAGYIGAHTCRVLYERGFLPIVLD 34 N+ +V+GG +GA C L E G+ V D Sbjct: 1 MNQVAVVIGGGQTLGAFLCHGLAEEGYRVAVAD 33 >gnl|CDD|129984 TIGR00906, 2A0303, cationic amino acid transport permease. Length = 557 Score = 26.7 bits (59), Expect = 9.2 Identities = 18/53 (33%), Positives = 22/53 (41%), Gaps = 3/53 (5%) Query: 96 IKGS--FNLIATAIESNVRRFIFSSTCATYGIPHNTIITENDPQESITPYGYT 146 +K S N I TAI V F+ + T N ITE PYG+T Sbjct: 180 VKESAWVNKIFTAINILVLLFVIIAG-FTKADVANWSITEEKGAGGFMPYGFT 231 >gnl|CDD|183833 PRK12939, PRK12939, short chain dehydrogenase; Provisional. Length = 250 Score = 26.5 bits (59), Expect = 9.3 Identities = 32/154 (20%), Positives = 58/154 (37%), Gaps = 33/154 (21%) Query: 1 MENKNVLVVGGAGYIGAHTCRVLYERGFLPIVLDNL-------------SSGHAEFVLWG 47 + K LV G A +GA L E G D L + G A + Sbjct: 5 LAGKRALVTGAARGLGAAFAEALAEAGATVAFNDGLAAEARELAAALEAAGGRAHAI--- 61 Query: 48 PLEQVDICDYTNLRAVFAKYQPA-----SVMHFAGLTNISESVK-NPSLF---YEINIKG 98 D+ D +++ F A +++ AG+TN + + + + +N++G Sbjct: 62 ---AADLADPASVQRFFDAAAAALGGLDGLVNNAGITNSKSATELDIDTWDAVMNVNVRG 118 Query: 99 SFNLIATA----IESNVRRFI-FSSTCATYGIPH 127 +F ++ A +S R + +S A +G P Sbjct: 119 TFLMLRAALPHLRDSGRGRIVNLASDTALWGAPK 152 >gnl|CDD|181126 PRK07806, PRK07806, short chain dehydrogenase; Provisional. Length = 248 Score = 26.6 bits (59), Expect = 9.4 Identities = 10/25 (40%), Positives = 13/25 (52%) Query: 3 NKNVLVVGGAGYIGAHTCRVLYERG 27 K LV G + IGA T ++L G Sbjct: 6 GKTALVTGSSRGIGADTAKILAGAG 30 Database: CddB Posted date: Feb 4, 2011 9:54 PM Number of letters in database: 5,994,473 Number of sequences in database: 21,608 Lambda K H 0.320 0.136 0.413 Gapped Lambda K H 0.267 0.0686 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 21608 Number of Hits to DB: 5,476,730 Number of extensions: 348226 Number of successful extensions: 839 Number of sequences better than 10.0: 1 Number of HSP's gapped: 795 Number of HSP's successfully gapped: 89 Length of query: 333 Length of database: 5,994,473 Length adjustment: 94 Effective length of query: 239 Effective length of database: 3,963,321 Effective search space: 947233719 Effective search space used: 947233719 Neighboring words threshold: 11 Window for multiple hits: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.8 bits) S2: 57 (25.8 bits)