RPS-BLAST 2.2.22 [Sep-27-2009] Database: CddA 21,609 sequences; 6,263,737 total letters Searching..................................................done Query= gi|254780328|ref|YP_003064741.1| UDP-glucose 4-epimerase [Candidatus Liberibacter asiaticus str. psy62] (333 letters) >gnl|CDD|31284 COG1087, GalE, UDP-glucose 4-epimerase [Cell envelope biogenesis, outer membrane]. Length = 329 Score = 405 bits (1043), Expect = e-114 Identities = 144/327 (44%), Positives = 203/327 (62%), Gaps = 4/327 (1%) Query: 4 KNVLVVGGAGYIGAHTCRVLYERGFLPIVLDNLSSGHAEFVLW--GPLEQVDICDYTNLR 61 VLV GGAGYIG+HT R L + G +VLDNLS+GH +L + D+ D L Sbjct: 1 MKVLVTGGAGYIGSHTVRQLLKTGHEVVVLDNLSNGHKIALLKLQFKFYEGDLLDRALLT 60 Query: 62 AVFAKYQPASVMHFAGLTNISESVKNPSLFYEINIKGSFNLIATAIESNVRRFIFSSTCA 121 AVF + + +V+HFA ++ ESV+NP +Y+ N+ G+ NLI +++ V++FIFSST A Sbjct: 61 AVFEENKIDAVVHFAASISVGESVQNPLKYYDNNVVGTLNLIEAMLQTGVKKFIFSSTAA 120 Query: 122 TYGIPHNTIITENDPQESITPYGYTKYVVERELLQHNKVNGLRSVVLRYFNAAGATLDSI 181 YG P + I+E P I PYG +K + E L K N + V+LRYFN AGA D Sbjct: 121 VYGEPTTSPISETSPLAPINPYGRSKLMSEEILRDAAKANPFKVVILRYFNVAGACPDGT 180 Query: 182 IGEWHNPETHVIPLAIKTAMGYQNSFKVFGQDYATRDGTCLRDYIHVLDLANAHIMALEY 241 +G+ + T +IP+A + A+G ++ +FG DY T+DGTC+RDYIHV DLA+AH++AL+Y Sbjct: 181 LGQRYPGATLLIPVAAEAALGKRDKLFIFGDDYDTKDGTCIRDYIHVDDLADAHVLALKY 240 Query: 242 LINQGDSIAINLGTGTGITVKEIISTIQSMYECAFPITYESRRIGDPPSLVADNKKAKKI 301 L G + NLG+G G +V E+I + + P+ RR GDP LVAD+ KA++I Sbjct: 241 LKEGGSNNIFNLGSGNGFSVLEVIEAAKKVTGRDIPVEIAPRRAGDPAILVADSSKARQI 300 Query: 302 LGWNPKYK-LRDIIESAWNWH-LKYPR 326 LGW P Y L DII+ AW+WH ++ Sbjct: 301 LGWQPTYDDLEDIIKDAWDWHQQRHGD 327 >gnl|CDD|36585 KOG1371, KOG1371, KOG1371, UDP-glucose 4-epimerase/UDP-sulfoquinovose synthase [Cell wall/membrane/envelope biogenesis]. Length = 343 Score = 264 bits (676), Expect = 3e-71 Identities = 123/334 (36%), Positives = 183/334 (54%), Gaps = 12/334 (3%) Query: 4 KNVLVVGGAGYIGAHTCRVLYERGFLPIVLDNLSSGHAEFV-----LWG-----PLEQVD 53 K+VLV GGAGYIG+HT L +RG+ +++DNL++ + E + L G + D Sbjct: 3 KHVLVTGGAGYIGSHTVLALLKRGYGVVIVDNLNNSYLESLKRVRQLLGEGKSVFFVEGD 62 Query: 54 ICDYTNLRAVFAKYQPASVMHFAGLTNISESVKNPSLFYEINIKGSFNLIATAIESNVRR 113 + D L +F++ + +VMHFA L + ES++NP +Y NI G+ NL+ NV+ Sbjct: 63 LNDAEALEKLFSEVKFDAVMHFAALAAVGESMENPLSYYHNNIAGTLNLLEVMKAHNVKA 122 Query: 114 FIFSSTCATYGIPHNTIITENDPQESIT-PYGYTKYVVERELLQHNKVNGLRSVVLRYFN 172 +FSS+ YG+P ITE DP + T PYG TK +E + +NK G + LRYFN Sbjct: 123 LVFSSSATVYGLPTKVPITEEDPTDQPTNPYGKTKKAIEEIIHDYNKAYGWKVTGLRYFN 182 Query: 173 AAGATLDSIIGEWHNPETHVIPLAIK-TAMGYQNSFKVFGQDYATRDGTCLRDYIHVLDL 231 GA IGE + + + A+G + + +V G+DY T DGT +RDYIHVLDL Sbjct: 183 VIGAHPSGRIGEAPLGIPNNLLPYVFQVAIGRRPNLQVVGRDYTTIDGTIVRDYIHVLDL 242 Query: 232 ANAHIMALEYLINQGDSIAINLGTGTGITVKEIISTIQSMYECAFPITYESRRIGDPPSL 291 A+ H+ AL L + NLGTG G +V E+++ + RR GD + Sbjct: 243 ADGHVAALGKLRGAAEFGVYNLGTGKGSSVLELVTAFEKALGVKIKKKVVPRRNGDVAFV 302 Query: 292 VADNKKAKKILGWNPKYKLRDIIESAWNWHLKYP 325 A+ KA++ LGW KY L+++++ W W + P Sbjct: 303 YANPSKAQRELGWKAKYGLQEMLKDLWRWQKQNP 336 >gnl|CDD|144821 pfam01370, Epimerase, NAD dependent epimerase/dehydratase family. This family of proteins utilize NAD as a cofactor. The proteins in this family use nucleotide-sugar substrates for a variety of chemical reactions. Length = 235 Score = 181 bits (461), Expect = 2e-46 Identities = 76/238 (31%), Positives = 120/238 (50%), Gaps = 16/238 (6%) Query: 6 VLVVGGAGYIGAHTCRVLYERGFLPIVLDNLSSGHAEFVLWG---PLEQVDICDYTNLRA 62 +LV GG G+IG+ R L + G+ IVL + D+ D L Sbjct: 1 ILVTGGTGFIGSALVRRLLQEGYEVIVL-GRRRRSESLNTGRIRFRFHEGDLTDPDALER 59 Query: 63 VFAKYQPASVMHFAGLTNISESVKNPSLFYEINIKGSFNLIATAIESNVRRFIFSSTCAT 122 + A+ QP +V+H A + + S ++P+ F N+ G+ NL+ A + V+RF+F+S+ Sbjct: 60 LLAEVQPDAVIHLAAQSGVGASFEDPAEFIRANVLGTLNLLEAARRAGVKRFVFASSSEV 119 Query: 123 YGIPHNTIITENDPQESITPYGYTKYVVERELLQHNKVNGLRSVVLRYFNAAGATLDSII 182 YG + ITE+ P ++PY K ER +L + + GLR+V+LR FN G Sbjct: 120 YGDVADPPITEDTPLGPLSPYAAAKLAGERLVLAYARAYGLRAVILRLFNVYGP------ 173 Query: 183 GEWHNPETHVIPLAIKTAMGYQNSFKVFGQDYATRDGTCLRDYIHVLDLANAHIMALE 240 G+ THVIP I+ + + + G DGT RD+++V D+A A ++ALE Sbjct: 174 GDNDTFVTHVIPALIRRILEGKPEILLLG------DGTQRRDFLYVDDVARAILLALE 225 >gnl|CDD|30800 COG0451, WcaG, Nucleoside-diphosphate-sugar epimerases [Cell envelope biogenesis, outer membrane / Carbohydrate transport and metabolism]. Length = 314 Score = 174 bits (442), Expect = 2e-44 Identities = 93/331 (28%), Positives = 153/331 (46%), Gaps = 24/331 (7%) Query: 4 KNVLVVGGAGYIGAHTCRVLYERGFLPIVLDNLSSGHAEFVLWGPLEQVDICDYTNLRAV 63 +LV GGAG+IG+H L G LD L G + +D+ D + + Sbjct: 1 MRILVTGGAGFIGSHLVERLLAAGHDVRGLDRLRDGLDPLLSGVEFVVLDLTDRDLVDEL 60 Query: 64 FAKYQPASVMHFAGLTNISESVK-NPSLFYEINIKGSFNLIATAIESNVRRFIFSSTCAT 122 A V+H A +++ +S +P+ F ++N+ G+ NL+ A + V+RF+F+S+ + Sbjct: 61 AKGVPDA-VIHLAAQSSVPDSNASDPAEFLDVNVDGTLNLLEAARAAGVKRFVFASSVSV 119 Query: 123 -YGIPHNTIITEND-PQESITPYGYTKYVVERELLQHNKVNGLRSVVLRYFNAAGATLDS 180 YG P I E+ P + PYG +K E+ L + ++ GL V+LR FN G Sbjct: 120 VYGDPPPLPIDEDLGPPRPLNPYGVSKLAAEQLLRAYARLYGLPVVILRPFNVYGP---- 175 Query: 181 IIGEWHNPETHVIPLAIKTAMGYQNSFKVFGQDYATRDGTCLRDYIHVLDLANAHIMALE 240 G+ + + V+ I+ + + + G DG+ RD+++V D+A+A ++ALE Sbjct: 176 --GDKPDLSSGVVSAFIRQLLKGEPIIVIGG------DGSQTRDFVYVDDVADALLLALE 227 Query: 241 YLINQGDSIAINLGTGTG-ITVKEIISTIQSMYECAFPITY---ESRRIGDPPSLVADNK 296 D N+G+GT ITV+E+ + P+ RR + D Sbjct: 228 N----PDGGVFNIGSGTAEITVRELAEAVAEAVGSKAPLIVYIPLGRRGDLREGKLLDIS 283 Query: 297 KAKKILGWNPKYKLRDIIESAWNWHLKYPRS 327 KA+ LGW PK L + + W LK Sbjct: 284 KARAALGWEPKVSLEEGLADTLEWLLKKLEL 314 >gnl|CDD|31285 COG1088, RfbB, dTDP-D-glucose 4,6-dehydratase [Cell envelope biogenesis, outer membrane]. Length = 340 Score = 124 bits (312), Expect = 4e-29 Identities = 89/337 (26%), Positives = 151/337 (44%), Gaps = 38/337 (11%) Query: 4 KNVLVVGGAGYIGAHTCRVLYERG--FLPIVLDNLS-SGHAEFV--LWGP----LEQVDI 54 +LV GGAG+IG++ R + + + LD L+ +G+ E + + Q DI Sbjct: 1 MKILVTGGAGFIGSNFVRYILNKHPDDHVVNLDKLTYAGNLENLADVEDSPRYRFVQGDI 60 Query: 55 CDYTNLRAVFAKYQPASVMHFAGLTNISESVKNPSLFYEINIKGSFNLIATAIESNVR-R 113 CD + +F +YQP +V+HFA +++ S+ P+ F + N+ G++ L+ A + + R Sbjct: 61 CDRELVDRLFKEYQPDAVVHFAAESHVDRSIDGPAPFIQTNVVGTYTLLEAARKYWGKFR 120 Query: 114 FIFSSTCATYG-IPHN-TIITENDPQESITPYGYTKYVVERELLQHNKVNGLRSVVLRYF 171 F ST YG + + TE P +PY +K + + + + GL + + R Sbjct: 121 FHHISTDEVYGDLGLDDDAFTETTPYNPSSPYSASKAASDLLVRAYVRTYGLPATITRCS 180 Query: 172 NAAGATLDSIIGEWHNPETHVIPLAIKTAMGYQNSFKVFGQDYATRDGTCLRDYIHVLDL 231 N G + PE +IPL I A+ V+G DG +RD+++V D Sbjct: 181 NNYGP--------YQFPEK-LIPLMIINAL-LGKPLPVYG------DGLQIRDWLYVED- 223 Query: 232 ANAHIMALEYLINQGDSIAI-NLGTGTGITVKEIISTIQSMYE-----CAFPITYESRRI 285 H A++ ++ +G N+G G T E++ TI + IT+ R Sbjct: 224 ---HCRAIDLVLTKGKIGETYNIGGGNERTNLEVVKTICELLGKDKPDYRDLITFVEDRP 280 Query: 286 GDPPSLVADNKKAKKILGWNPKYKLRDIIESAWNWHL 322 G D K K+ LGW P+ + +W+L Sbjct: 281 GHDRRYAIDASKIKRELGWRPQETFETGLRKTVDWYL 317 >gnl|CDD|36642 KOG1429, KOG1429, KOG1429, dTDP-glucose 4-6-dehydratase/UDP-glucuronic acid decarboxylase [Carbohydrate transport and metabolism, Cell wall/membrane/envelope biogenesis]. Length = 350 Score = 101 bits (253), Expect = 3e-22 Identities = 78/336 (23%), Positives = 123/336 (36%), Gaps = 49/336 (14%) Query: 2 ENKNVLVVGGAGYIGAHTCRVLYERGFLPIVLDNLSSGHAE-----FVLW-GPLEQVDIC 55 +N +L+ GGAG+IG+H L G I LDN +G E L + D+ Sbjct: 26 QNLRILITGGAGFIGSHLVDKLMTEGHEVIALDNYFTGRKENLEHWIGHPNFELIRHDVV 85 Query: 56 DYTNLRA--VFAKYQPASVMHFAGLTNISESVKNPSLFYEINIKGSFNLIATAIESNVRR 113 + ++ PAS H+ NP + N+ G+ N++ A R Sbjct: 86 EPLLKEVDQIYHLAAPASPPHY---------KYNPVKTIKTNVIGTLNMLGLAKRVG-AR 135 Query: 114 FIFSSTCATYGIPHNTIITEN-----DPQESITPYGYTKYVVERELLQHNKVNGLRSVVL 168 F+ +ST YG P E +P + Y K V E ++K G+ + Sbjct: 136 FLLASTSEVYGDPLVHPQVETYWGNVNPIGPRSCYDEGKRVAETLCYAYHKQEGIEVRIA 195 Query: 169 RYFNAAGATLDSIIGEWHNPETHVIPLAIKTAMGYQ----NSFKVFGQDYATRDGTCLRD 224 R FN + P H+ + + Q V+G DG R Sbjct: 196 RIFN------------TYGPRMHMDDGRVVSNFIAQALRGEPLTVYG------DGKQTRS 237 Query: 225 YIHVLDLANAHIMALEYLINQGDSIAINLGTGTGITVKEIISTIQSMYECAFPITYESRR 284 + +V DL L L+ +N+G T+ E+ ++ + I + Sbjct: 238 FQYVSDLVE----GLLRLMESDYRGPVNIGNPGEFTMLELAEMVKELIGPVSEIEFVENG 293 Query: 285 IGDPPSLVADNKKAKKILGWNPKYKLRDIIESAWNW 320 DP D KAK+ LGW PK LR+ + + Sbjct: 294 PDDPRKRKPDITKAKEQLGWEPKVSLREGLPLTVTY 329 >gnl|CDD|35966 KOG0747, KOG0747, KOG0747, Putative NAD+-dependent epimerases [Carbohydrate transport and metabolism]. Length = 331 Score = 98.1 bits (244), Expect = 3e-21 Identities = 89/349 (25%), Positives = 144/349 (41%), Gaps = 44/349 (12%) Query: 2 ENKNVLVVGGAGYIGAHTCRVLYER--GFLPIVLDNLSSGHAEFVLWGPLE-------QV 52 + KNVL+ GGAG+IG++ L ++ + + LD L L + Sbjct: 5 KEKNVLITGGAGFIGSNFINYLVDKYPDYKFVNLDKLDYCSNLKNLEPVRNSPNYKFVEG 64 Query: 53 DICDYTNLRAVFAKYQPASVMHFAGLTNISESVKNPSLFYEINIKGSFNLIATAIES-NV 111 DI D + +F + +V+HFA T++ S + F + NI + L+ S N+ Sbjct: 65 DIADADLVLYLFETEEIDTVIHFAAQTHVDRSFGDSFEFTKNNILSTHVLLEAVRVSGNI 124 Query: 112 RRFIFSSTCATYG-IPHNTIITENDPQESITPYGYTKYVVERELLQHNKVNGLRSVVLRY 170 RRF+ ST YG + ++ E PY +K E + + + GL V R Sbjct: 125 RRFVHVSTDEVYGDSDEDAVVGEASLLNPTNPYAASKAAAEMLVRSYGRSYGLPVVTTRM 184 Query: 171 FNAAGATLDSIIGEWHNPETHVIPLAIKTAMGYQNSFKVFGQDYATR-DGTCLRDYIHVL 229 N G ++ +IP IK AM G++Y DG R Y++V Sbjct: 185 NNVYGP------NQYP---EKLIPKFIKLAMR--------GKEYPIHGDGLQTRSYLYVE 227 Query: 230 DLANAHIMALEYLINQGDSIAI-NLGTGTGITVKEIISTIQSMYE--------CAFPITY 280 D++ A LE +G+ I N+GT + V ++ I ++E F Sbjct: 228 DVSEAFKAVLE----KGELGEIYNIGTDDEMRVIDLAKDICELFEKRLPNIDTEPFIFFV 283 Query: 281 ESRRIGDPPSLVADNKKAKKILGWNPKYKLRDIIESAWNWHLKYPRSLS 329 E R D + D++K KK LGW P + + W+ K + +S Sbjct: 284 EDRPYNDLRYFL-DDEKIKK-LGWRPTTPWEEGLRKTIEWYTKNFKDVS 330 >gnl|CDD|36643 KOG1430, KOG1430, KOG1430, C-3 sterol dehydrogenase/3-beta-hydroxysteroid dehydrogenase and related dehydrogenases [Lipid transport and metabolism, Amino acid transport and metabolism]. Length = 361 Score = 93.5 bits (232), Expect = 7e-20 Identities = 77/368 (20%), Positives = 137/368 (37%), Gaps = 59/368 (16%) Query: 1 MENKNVLVVGGAGYIGAHTCRVLYERGFLP--IVLD----NLSSGHAEFVLWGPLEQVDI 54 + +VLV GG+G++G H + L E V+D + V + Sbjct: 2 EKKLSVLVTGGSGFLGQHLVQALLENELKLEIRVVDKTPTQSNLPAELTGFRSGRVTVIL 61 Query: 55 CDYTNLRAVFAKYQPASVMHFAGLTNISESVKNPSLFYEINIKGSFNLIATAIESNVRRF 114 D + ++ +Q A V+H A + L +N+ G+ N+I E V+R Sbjct: 62 GDLLDANSISNAFQGAVVVHCAASPVPDFVENDRDLAMRVNVNGTLNVIEACKELGVKRL 121 Query: 115 IFSSTCATYGIPHNTIITEND---PQESITPYGYTKYVVERELLQHNKVNGLRSVVLRYF 171 I++S+ I + P + I PYG +K + E+ +L+ N + L + LR Sbjct: 122 IYTSSAYVVFGGEPIINGDESLPYPLKHIDPYGESKALAEKLVLEANGSDDLYTCALR-- 179 Query: 172 NAAGATLDSIIGEWHNPETHVIPLAIKTAMGYQNSFKVFGQDYATRDGTCLRDYIHVLDL 231 I G + ++P ++ FK DG L D+ + ++ Sbjct: 180 ------PPGIYGP---GDKRLLPKIVEALKNGGFLFK-------IGDGENLNDFTYGENV 223 Query: 232 ANAHIMALEYLINQGDSIAI---NLGTGTGITVKEIISTI---------QSMYECAF--- 276 A AHI+A L+++ S+ + T + + +S + S+ F Sbjct: 224 AWAHILAARALLDKSPSVNGQFYFITDDTPVRFFDFLSPLVKALGYCLPSSIKLPLFLSY 283 Query: 277 ------PITYESRRIGDPP------SLVA-----DNKKAKKILGWNPKYKLRDIIESAWN 319 I Y R P +L+ +KAK+ LG+ P L + I+ + Sbjct: 284 FLAYLLEIVYFLLRPYQPILTRFRVALLGVTRTFSIEKAKRELGYKPLVSLEEAIQRTIH 343 Query: 320 WHLKYPRS 327 W S Sbjct: 344 WVASESDS 351 >gnl|CDD|36644 KOG1431, KOG1431, KOG1431, GDP-L-fucose synthetase [Carbohydrate transport and metabolism, Posttranslational modification, protein turnover, chaperones]. Length = 315 Score = 86.6 bits (214), Expect = 1e-17 Identities = 72/334 (21%), Positives = 136/334 (40%), Gaps = 41/334 (12%) Query: 4 KNVLVVGGAGYIGAHTCRVLYERGFLPIVLDNLSSGHAEFVLWGPLEQVDICDYTNLRAV 63 K +LV GG G +G+ +V+ E+GF + G + D+ + + RA+ Sbjct: 2 KKILVTGGTGLVGSAIVKVVQEQGFDDENW--VFIGSKD---------ADLTNLADTRAL 50 Query: 64 FAKYQPASVMHFAGLTN-ISESVKNPSLFYEINIKGSFNLIATAIESNVRRFIFS-STC- 120 F +P V+H A + + + F N++ + N++ +A E V++ + STC Sbjct: 51 FESEKPTHVIHLAAMVGGLFHNNTYNLDFIRKNLQINDNVLHSAHEHGVKKVVSCLSTCI 110 Query: 121 ----ATYGIPHNTIITENDPQESITPYGYTKYVVERELLQHNKVNGLRSVVLRYFNAAGA 176 +Y I + P S Y Y K +++ + + + +G + N G Sbjct: 111 FPDKTSYPIDETMVHN-GPPHPSNFGYSYAKRMIDVQNQAYRQQHGRDYTSVIPTNVFGP 169 Query: 177 TLDSIIGEWHNPE-THVIPLAIKTAMGYQNS----FKVFGQDYATRDGTCLRDYIHVLDL 231 + NPE +HV+P I + + V+G G+ LR +I+ DL Sbjct: 170 H------DNFNPENSHVLPALIHRFHEAKRNGTDELTVWGS------GSPLRQFIYSDDL 217 Query: 232 ANAHIMAL-EYLINQGDSIAINLGTGTGITVKEIISTIQSMYECAFPITYESRRIGDPPS 290 A+ I L EY + I +++G +T++E + + + +++ + Sbjct: 218 ADLFIWVLREY--EGVEPIILSVGESDEVTIREAAEAVVEAVDFTGKLVWDTTKSDGQFK 275 Query: 291 LVADNKKAKKILGWNPKY-KLRDIIESAWNWHLK 323 A N K + L + K+ L I W+L Sbjct: 276 KTASNSKLRS-LLPDFKFTPLEQAISETVQWYLD 308 >gnl|CDD|36715 KOG1502, KOG1502, KOG1502, Flavonol reductase/cinnamoyl-CoA reductase [Defense mechanisms]. Length = 327 Score = 86.5 bits (214), Expect = 1e-17 Identities = 64/342 (18%), Positives = 125/342 (36%), Gaps = 60/342 (17%) Query: 1 MENKNVLVVGGAGYIGAHTCRVLYERGFLPIVL-----DNLSSGH-AEFVLWGPLEQV-- 52 E K V V G +G+IG+ ++L RG+ D + H + ++ Sbjct: 4 DEGKKVCVTGASGFIGSWIVKLLLSRGYTVRGTVRDPEDEKKTEHLRKLEGAKERLKLFK 63 Query: 53 -DICDYTNLRAVFAK----YQPASVMHFAGLTNISESVKNPSLFYEINIKGSFNLIATAI 107 D+ D + + AS + F + + +KG+ N++ Sbjct: 64 ADLLDEGSFDKAIDGCDGVFHTASPVDF-------DLEDPEKELIDPAVKGTKNVLEACK 116 Query: 108 ESN-VRRFIFSSTCATYG-----IPHNTIITE---NDPQ---ESITPYGYTKYVVERELL 155 ++ V+R +++S+ A I N+++ E +D Y +K + E+ Sbjct: 117 KTKSVKRVVYTSSTAAVRYNGPNIGENSVVDEESWSDLDFCRCKKLWYALSKTLAEKAAW 176 Query: 156 QHNKVNGLRSVVLRYFNAAGATLDSIIGEWHNPETHVIPLA--IKTAMGYQNSFKVFGQD 213 + K NGL V + G L + + + L IK +F + Sbjct: 177 EFAKENGLDLVTINPGLVFGPGLQP------SLNSSLNALLKLIKGLAETYPNFWLA--- 227 Query: 214 YATRDGTCLRDYIHVLDLANAHIMALEYLINQGDSIAINLGTGTGITVKEIISTIQSMY- 272 ++ V D+A AH++ALE +G I G +++KEI ++ ++ Sbjct: 228 -----------FVDVRDVALAHVLALEKPSAKGRYIC----VGEVVSIKEIADILRELFP 272 Query: 273 ECAFPITYESRRIGDPPSLVADNKKAKKILGWNPKYKLRDII 314 + P G S ++K K + G+ L + + Sbjct: 273 DYPIPKKNAEEHEGFLTSFKVSSEKLKSLGGFK-FRPLEETL 313 >gnl|CDD|31286 COG1089, Gmd, GDP-D-mannose dehydratase [Cell envelope biogenesis, outer membrane]. Length = 345 Score = 73.4 bits (180), Expect = 8e-14 Identities = 71/363 (19%), Positives = 128/363 (35%), Gaps = 64/363 (17%) Query: 2 ENKNVLVVGGAGYIGAHTCRVLYERGFLPIVLDNLSSGHAEFVLWGP------------- 48 K L+ G G G++ +L E+G+ + + + F Sbjct: 1 MGKVALITGITGQDGSYLAELLLEKGY---EVHGIKRRSSSFNTPRIHLYEDPHLNDPRL 57 Query: 49 -LEQVDICDYTNLRAVFAKYQPASVMHFAGLTNISESVKNPSLFYEINIKGSFNLI-ATA 106 L D+ D +NL + + QP + + A +++ S + P +++ G+ L+ A Sbjct: 58 HLHYGDLTDSSNLLRILEEVQPDEIYNLAAQSHVGVSFEQPEYTADVDAIGTLRLLEAIR 117 Query: 107 IESNVR-RFIFSSTCATYGIPHNTIITENDPQESITPYGYTKYVVERELLQHNKVNGLRS 165 I + RF +ST YG+ E P +PY K + + + GL + Sbjct: 118 ILGEKKTRFYQASTSELYGLVQEIPQKETTPFYPRSPYAVAKLYAYWITVNYRESYGLFA 177 Query: 166 VVLRYFNAAGATLDSIIGEWHNPETHV---IPLAI-KTAMGYQNSFKVFGQDYATRDGTC 221 FN ET V I A+ + +G Q+ + G A RD Sbjct: 178 CNGILFNHESPL---------RGETFVTRKITRAVARIKLGLQDKLYL-GNLDAKRDWGH 227 Query: 222 LRDYIHVLDLANAHIMALEYLINQGDSIAINLGTGTGITVKEIISTIQSM--YECAFPIT 279 +DY+ + L ++ Q + + TG +V+E + M + + T Sbjct: 228 AKDYVEAMWL----------MLQQEEPDDYVIATGETHSVREFVELAFEMVGIDLEWEGT 277 Query: 280 YESRRIGDP-------------------PSLVADNKKAKKILGWNPKYKLRDIIESAWNW 320 + D L+ D KAK+ LGW P+ L +++ Sbjct: 278 GVDEKGVDAKTGKIIVEIDPRYFRPAEVDLLLGDPTKAKEKLGWRPEVSLEELVREMVEA 337 Query: 321 HLK 323 L+ Sbjct: 338 DLE 340 >gnl|CDD|31288 COG1091, RfbD, dTDP-4-dehydrorhamnose reductase [Cell envelope biogenesis, outer membrane]. Length = 281 Score = 64.1 bits (156), Expect = 5e-11 Identities = 37/165 (22%), Positives = 63/165 (38%), Gaps = 20/165 (12%) Query: 5 NVLVVGGAGYIGAHTCRVLYERGFLPIVLDNLSSGHAEFVLWGPLEQVDICDYTNLRAVF 64 +L+ G G +G R LP + +++ AE +DI D + V Sbjct: 2 KILITGANGQLGTELRR------ALPGEFEVIATDRAE---------LDITDPDAVLEVI 46 Query: 65 AKYQPASVMHFAGLTNISESVKNPSLFYEINIKGSFNLIATAIESNVRRFIFSSTCATYG 124 + +P V++ A T + ++ P L + +N G+ NL A E R + ST + Sbjct: 47 RETRPDVVINAAAYTAVDKAESEPELAFAVNATGAENLARAAAEVGA-RLVHISTDYVFD 105 Query: 125 IPHNTIITENDPQESITPYGYTKYVVERELLQHNKVNGLRSVVLR 169 E D + YG +K E + R ++LR Sbjct: 106 GEKGGPYKETDTPNPLNVYGRSKLAGEEAVRAAGP----RHLILR 146 >gnl|CDD|144605 pfam01073, 3Beta_HSD, 3-beta hydroxysteroid dehydrogenase/isomerase family. The enzyme 3 beta-hydroxysteroid dehydrogenase/5-ene-4-ene isomerase (3 beta-HSD) catalyses the oxidation and isomerisation of 5-ene-3 beta-hydroxypregnene and 5-ene-hydroxyandrostene steroid precursors into the corresponding 4-ene-ketosteroids necessary for the formation of all classes of steroid hormones. Length = 280 Score = 63.5 bits (155), Expect = 9e-11 Identities = 63/266 (23%), Positives = 106/266 (39%), Gaps = 46/266 (17%) Query: 7 LVVGGAGYIGAHTCRVLYERGFLPIV----LDNLSSGHAEFVLWGPLE-----QVDICDY 57 LV GG G++G H R+L G L V L +F L+ + D+ D Sbjct: 1 LVTGGGGFLGRHIVRLLLREGELQEVRVFDLRFSPELLEDF---SKLQVITYIEGDVTDK 57 Query: 58 TNLRAVFAKYQPAS-VMHFAGLTNISESVKNPSLFYEINIKGSFNLIATAIESNVRRFIF 116 +LR Q + V+H A + ++ ++N+KG+ N++ +++ VR ++ Sbjct: 58 QDLRRAL---QGSDVVIHTAAIIDVF-GKAYRDTIMKVNVKGTQNVLDACVKAGVRVLVY 113 Query: 117 SSTC-ATYGIPHNTII---TENDPQESI--TPYGYTKYVVERELLQHN---KVNG--LRS 165 +S+ + I E P ES PY +K + E+ +L+ N NG L + Sbjct: 114 TSSMEVVGPNSYGQPIVNGDETTPYESTHQDPYPESKALAEKLVLKANGSTLKNGGRLYT 173 Query: 166 VVLRYFNAAGATLDSIIGEWHNPETHVIPLAIKTAMGYQNSFKVFGQDYATRDGTCLRDY 225 LR I GE +P K + T D L D Sbjct: 174 CALR--------PAGIFGE-GDPFLFP---------FLVRLLKNGLAKFRTGDKNVLSDR 215 Query: 226 IHVLDLANAHIMALEYLINQGDSIAI 251 ++V ++A AHI+A L + + +I Sbjct: 216 VYVGNVAWAHILAARALQDPKKASSI 241 >gnl|CDD|31283 COG1086, COG1086, Predicted nucleoside-diphosphate sugar epimerases [Cell envelope biogenesis, outer membrane / Carbohydrate transport and metabolism]. Length = 588 Score = 60.3 bits (146), Expect = 7e-10 Identities = 45/197 (22%), Positives = 73/197 (37%), Gaps = 47/197 (23%) Query: 2 ENKNVLVVGGAGYIGAHTCRVLYERGFLPIVL-----DNLSSGHAEFVLWGPLEQV---- 52 K VLV GG G IG+ CR + + I+L L E P ++ Sbjct: 249 TGKTVLVTGGGGSIGSELCRQILKFNPKEIILFSRDEYKLYLIDMELREKFPELKLRFYI 308 Query: 53 -DICDYTNLRAVFAKYQPASVMHFAGLT-------NISESVKNPSLFYEINIKGSFNLIA 104 D+ D + ++ V H A L N E++K N+ G+ N+ Sbjct: 309 GDVRDRDRVERAMEGHKVDIVFHAAALKHVPLVEYNPEEAIKT-------NVLGTENVAE 361 Query: 105 TAIESNVRRFIFSSTCATYGIPHNTIITENDPQESITP---YGYTKYVVERELLQ---HN 158 AI++ V++F+ ST +++ P G TK + E+ + Sbjct: 362 AAIKNGVKKFVLIST-----------------DKAVNPTNVMGATKRLAEKLFQAANRNV 404 Query: 159 KVNGLRSVVLRYFNAAG 175 G R V+R+ N G Sbjct: 405 SGTGTRFCVVRFGNVLG 421 >gnl|CDD|37985 KOG2774, KOG2774, KOG2774, NAD dependent epimerase [General function prediction only]. Length = 366 Score = 56.6 bits (136), Expect = 1e-08 Identities = 74/330 (22%), Positives = 122/330 (36%), Gaps = 52/330 (15%) Query: 6 VLVVGGAGYIGAHTCRVL-YERGFLPIVLDNLSSGHAEFVLWGPLEQVDICDYTNLRAVF 64 VL+ G G +G +L Y G ++L ++ A GP +DI D +L + Sbjct: 47 VLITGSLGQLGRGLASLLRYMYGSECVILSDIVKPPANVTDVGPYIYLDILDQKSLEEIV 106 Query: 65 AKYQPASVMHFAG-LTNISESVKNPSLFYEINIKGSFNLIATAIESNVRRFIFSSTCATY 123 + ++HF+ L+ + E+ N L ++NI+G N++ A + ++ F+ ST + Sbjct: 107 VNKRIDWLVHFSALLSAVGET--NVPLALQVNIRGVHNILQVAAKHKLKVFV-PSTIGAF 163 Query: 124 GIPHNTIITENDP----QESITPYGYTKYVVER--ELLQHNKVNGLRSV----VLRYFNA 173 G T P Q T YG +K E E H RS+ ++ Sbjct: 164 G---PTSPRNPTPDLTIQRPRTIYGVSKVHAELLGEYFNHRFGVDFRSMRFPGIISATKP 220 Query: 174 AGATLDSIIGEWH---NPETHVIPLAIKT--AMGYQNSFKVFGQDYATRDGTCLRDYIHV 228 G T D I ++ H L T M Y D C+ I + Sbjct: 221 GGGTTDYAIAIFYDALQKGKHTCYLRPDTRLPMMY--------------DTDCMASVIQL 266 Query: 229 LDLANAHIMALEYLINQGDSIAINLGTGTGITVKEIISTIQSMYECAFPITYE--SRR-I 285 L + + Y + TG T +EI I+ + F I Y+ +R+ I Sbjct: 267 LAADSQSLKRRTYNV-----------TGFSFTPEEIADAIRRVMP-GFEIDYDICTRQSI 314 Query: 286 GDPPSLVADNKKAKKILGWNPKYKLRDIIE 315 D + D+ +A+ L II Sbjct: 315 ADSWPMSLDDSEARTEWHEKHSLHLLSIIS 344 >gnl|CDD|145720 pfam02719, Polysacc_synt_2, Polysaccharide biosynthesis protein. This is a family of diverse bacterial polysaccharide biosynthesis proteins including the CapD protein, WalL protein mannosyl-transferase and several putative epimerases (e.g. WbiI). Length = 280 Score = 54.4 bits (132), Expect = 4e-08 Identities = 35/133 (26%), Positives = 52/133 (39%), Gaps = 28/133 (21%) Query: 6 VLVVGGAGYIGAHTCRVLYERGFLP---IVLD----NLSSGHAEFVLWGPLEQV-----D 53 VLV GG G IG+ CR + + P I+ L E ++ D Sbjct: 1 VLVTGGGGSIGSELCRQILKFN--PKKIILFSRDEFKLYEIEQELREEYNDPKLRFFIGD 58 Query: 54 ICDYTNLRAVFAKYQPASVMHFAGLT-------NISESVKNPSLFYEINIKGSFNLIATA 106 + D L +Y +V H A L N E++K N+ G+ N+ A Sbjct: 59 VRDRERLERAMEEYGVDTVFHAAALKHVPLVEYNPMEAIKT-------NVLGTENVAEAA 111 Query: 107 IESNVRRFIFSST 119 IE+ V +F+ ST Sbjct: 112 IENGVEKFVLIST 124 >gnl|CDD|113104 pfam04321, RmlD_sub_bind, RmlD substrate binding domain. L-rhamnose is a saccharide required for the virulence of some bacteria. Its precursor, dTDP-L-rhamnose, is synthesized by four different enzymes the final one of which is RmlD. The RmlD substrate binding domain is responsible for binding a sugar nucleotide. Length = 284 Score = 46.8 bits (112), Expect = 7e-06 Identities = 44/168 (26%), Positives = 68/168 (40%), Gaps = 27/168 (16%) Query: 6 VLVVGGAGYIGAHTCRVLYERGFLPIVLDNLSSGHAEFVLWGPLEQVDICDYTNLRAVFA 65 +LV G G +G R+L ERG + LD ++D+ D + A+ Sbjct: 1 ILVTGANGQLGRELTRLLAERGVEVVALDR--------------PELDLTDPEAVAALVR 46 Query: 66 KYQPASVMHFAGLTNISESVKNPSLFYEINIKGSFNLIATAIESNVRRFIFSST----CA 121 + +P V++ A T + ++ P L Y +N G NL I ST Sbjct: 47 EARPDVVVNAAAYTAVDKAESEPELAYAVNALGPGNLAEACAARGA-PLIHISTDYVFDG 105 Query: 122 TYGIPHNTIITENDPQESITPYGYTKYVVERELLQHNKVNGLRSVVLR 169 G P+ E+DP + YG TK E+ +L N R ++LR Sbjct: 106 AKGGPY----REDDPTGPLNVYGRTKLAGEQAVLAANP----RHLILR 145 >gnl|CDD|143883 pfam00106, adh_short, short chain dehydrogenase. This family contains a wide variety of dehydrogenases. Length = 167 Score = 46.4 bits (111), Expect = 1e-05 Identities = 39/146 (26%), Positives = 58/146 (39%), Gaps = 24/146 (16%) Query: 4 KNVLVVGGAGYIGAHTCRVLYERGFLPIVLD---NLSSGHAEFV--LWGPLEQVDI--CD 56 VL+ GG G +G R L G +VL + G AE V L +V + CD Sbjct: 1 GTVLITGGTGGLGLALARWLAAEGARHLVLVSRRGDAPGAAELVAELEALGAEVTVAACD 60 Query: 57 YTN-------LRAVFAKYQP-ASVMHFAGL---TNISESVKNPSLF---YEINIKGSFNL 102 + L A+ A P V+H AG+ + E P F + G++NL Sbjct: 61 VADRDALAALLAALPAALGPLDGVVHNAGVLDDGPLEEL--TPERFERVLAPKVTGAWNL 118 Query: 103 IATAIESNVRRFI-FSSTCATYGIPH 127 ++ ++ F+ FSS G P Sbjct: 119 HELTLDLDLGAFVLFSSVAGVLGSPG 144 >gnl|CDD|39489 KOG4288, KOG4288, KOG4288, Predicted oxidoreductase [General function prediction only]. Length = 283 Score = 38.1 bits (88), Expect = 0.003 Identities = 39/176 (22%), Positives = 61/176 (34%), Gaps = 34/176 (19%) Query: 1 MENKNVLVVGGAGYIGAHTCRVLYERGF-LPIVLDN----LSSGHAEFVLWGPLEQVDIC 55 +E + LV+GG + G+ + + I+ +N S +V W Sbjct: 50 VEVEWTLVLGGNPFSGSEVLKNATNVVHSVGILSENENKQTLSSWPTYVSWHRGNSF--- 106 Query: 56 DYTNLRAVFAKYQPASVMHFAGLTNISESVKNPSLFYEINIKGSFNLIATAIESNVRRFI 115 +N + P V G N L IN + N + A ++ V RF+ Sbjct: 107 -SSNPNKL-KLSGPTFVYEMMG------GFGNIILMDRINGTANINAVKAAAKAGVPRFV 158 Query: 116 FSSTCATYGIPHNTIITENDPQESITPYGY--TKYVVERELLQHNKVNGLRSVVLR 169 + S +G+P P GY K E ELL K R ++LR Sbjct: 159 YISA-HDFGLPPL------------IPRGYIEGKREAEAELL---KKFRFRGIILR 198 >gnl|CDD|31231 COG1028, FabG, Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) [Secondary metabolites biosynthesis, transport, and catabolism / General function prediction only]. Length = 251 Score = 37.5 bits (86), Expect = 0.005 Identities = 41/265 (15%), Positives = 67/265 (25%), Gaps = 62/265 (23%) Query: 1 MENKNVLVVGGAGYIGAHTCRVLYERGFLPIVLDNLSSGHAEFVLWGPLEQVDI------ 54 + K LV G + IG R L G +V S A L +++ Sbjct: 3 LSGKVALVTGASSGIGRAIARALAREGARVVVAARRSEEEAAEALAAAIKEAGGGRAAAV 62 Query: 55 -CDYTNLRAVFAKYQPASVMHF---------AGLTNISESVKNPSL-----FYEINIKGS 99 D ++ A+ F AG+ ++ + ++N+ G+ Sbjct: 63 AADVSDDEESVEALVAAAEEEFGRIDILVNNAGIAGPDAPLEELTEEDWDRVIDVNLLGA 122 Query: 100 FNLIATAIESNVRRFIF--SSTCATYGIPHNTIITENDPQESITPYGYTKYVVE---REL 154 F L A+ ++ I SS G P Y +K + + L Sbjct: 123 FLLTRAALPLMKKQRIVNISSVAGLGGPPGQ------------AAYAASKAALIGLTKAL 170 Query: 155 LQHNKVNGLRSVVLRYFNAAGATLDSIIGEWHNPETHVIPLAIKTAMGYQNSFKVFGQDY 214 G+R V P I T M Sbjct: 171 ALELAPRGIRVNA------------------------VAPGYIDTPMTAALESAELEALK 206 Query: 215 ATRDGTCLRDYIHVLDLANAHIMAL 239 L ++A A Sbjct: 207 RLAARIPLGRLGTPEEVAAAVAFLA 231 >gnl|CDD|39371 KOG4169, KOG4169, KOG4169, 15-hydroxyprostaglandin dehydrogenase and related dehydrogenases [Lipid transport and metabolism, General function prediction only]. Length = 261 Score = 36.4 bits (84), Expect = 0.012 Identities = 30/113 (26%), Positives = 46/113 (40%), Gaps = 15/113 (13%) Query: 3 NKNVLVVGGAGYIGAHTCRVLYERG-FLPIVLDNLSSGHAEFVLWGPLEQVDI----CDY 57 KN LV GGAG IG T + L E+G + ++ D+ + A L V + CD Sbjct: 5 GKNALVTGGAGGIGLATSKALLEKGIKVLVIDDSEENPEAIAKLQAINPSVSVIFIKCDV 64 Query: 58 TN---LRAVFAKYQPASVMHFAGLTNISESVKNPSLFYEINIKGSFNLIATAI 107 TN L A F K A I + + + + + + N+ T + Sbjct: 65 TNRGDLEAAFDK-------ILATFGTIDILINGAGILDDKDWERTINVNLTGV 110 >gnl|CDD|31046 COG0702, COG0702, Predicted nucleoside-diphosphate-sugar epimerases [Cell envelope biogenesis, outer membrane / Carbohydrate transport and metabolism]. Length = 275 Score = 34.9 bits (79), Expect = 0.034 Identities = 21/115 (18%), Positives = 33/115 (28%), Gaps = 9/115 (7%) Query: 4 KNVLVVGGAGYIGAHTCRVLYERGFLPIVLDNLSSGHAEFVLWGPLEQVDICDYTNLRAV 63 +LV G G++G R L RG A + D+ D +L A Sbjct: 1 MKILVTGATGFVGGAVVRELLARGHEVRAAVRNPEAAAALAGGVEVVLGDLRDPKSLVAG 60 Query: 64 FAKYQPASVMHFAGLTNISESVKNPSLFYEINIKGSFNLIATAIESNVRRFIFSS 118 G+ IS + F + + A V+ + S Sbjct: 61 AK--------GVDGVLLISGLLDGSDAFRAVQVTAVVRAAEAAGAG-VKHGVSLS 106 >gnl|CDD|31287 COG1090, COG1090, Predicted nucleoside-diphosphate sugar epimerase [General function prediction only]. Length = 297 Score = 34.8 bits (80), Expect = 0.036 Identities = 37/172 (21%), Positives = 60/172 (34%), Gaps = 21/172 (12%) Query: 6 VLVVGGAGYIGAHTCRVLYERGFLPIVLDNLSSGHAEFVLWGPLEQVDICDYTNLRAVFA 65 +L+ GG G IG L + G +L A L + + + A Sbjct: 1 ILITGGTGLIGRALTARLRKGGHQVTILTR-RPPKASQNLHPNVTLWEGLADALTLGIDA 59 Query: 66 KYQPASVMHFAGLTNISESVKNPSLFYEI---NIKGS---FNLIATAIESNVRRFIFSST 119 V++ AG I+E EI I + LIA + E+ + I +S Sbjct: 60 ------VINLAG-EPIAERRWTEKQKEEIRQSRINTTEKLVELIAAS-ETKPKVLISASA 111 Query: 120 CATYGIPHNTIITENDPQESITPYGYTKYVVER--ELLQHNKVNGLRSVVLR 169 YG + ++TE P + + + E + G R V+LR Sbjct: 112 VGYYGHSGDRVVTEESP----PGDDFLAQLCQDWEEEALQAQQLGTRVVLLR 159 >gnl|CDD|114110 pfam05368, NmrA, NmrA-like family. NmrA is a negative transcriptional regulator involved in the post-translational modification of the transcription factor AreA. NmrA is part of a system controlling nitrogen metabolite repression in fungi. This family only contains a few sequences as iteration results in significant matches to other Rossmann fold families. Length = 232 Score = 32.6 bits (75), Expect = 0.17 Identities = 24/123 (19%), Positives = 39/123 (31%), Gaps = 31/123 (25%) Query: 6 VLVVGGAGYIGAHTCRVLYERGFLPI----------VLDNLSSGHAEFVLWGPLEQVDIC 55 +LV G GY G R + G P+ + +L + E V + D+ Sbjct: 1 ILVFGATGYQGGSVVRASLKAGH-PVRALVRDPKSELAKSLKAAGVELV------EGDLD 53 Query: 56 DYTNLRAVFAKYQPASVMHFAGLTNISESVKNPSLFYEINIKGSFNLIATAIESNVRRFI 115 D+ +L + T + I+ L A E+ V+ FI Sbjct: 54 DHESLVEALKGVDVVFSV-----TG---------FWLSKEIEDGKKLADAAKEAGVKHFI 99 Query: 116 FSS 118 S Sbjct: 100 PSE 102 >gnl|CDD|36417 KOG1203, KOG1203, KOG1203, Predicted dehydrogenase [Carbohydrate transport and metabolism]. Length = 411 Score = 31.5 bits (71), Expect = 0.36 Identities = 11/28 (39%), Positives = 16/28 (57%) Query: 1 MENKNVLVVGGAGYIGAHTCRVLYERGF 28 + VLVVG G +G ++L +RGF Sbjct: 77 KKPTTVLVVGATGKVGRRIVKILLKRGF 104 >gnl|CDD|37958 KOG2747, KOG2747, KOG2747, Histone acetyltransferase (MYST family) [Chromatin structure and dynamics]. Length = 396 Score = 31.2 bits (70), Expect = 0.44 Identities = 22/88 (25%), Positives = 31/88 (35%), Gaps = 17/88 (19%) Query: 239 LEYLINQGDS-IAIN-LGTGTGITVKEIISTIQSM----YECAFPITYESRRIGDPPSLV 292 LE L I+I + TGI +IIST+QS+ Y + I+ S L Sbjct: 319 LELLRKHRGEHISIKEISKETGIRPDDIISTLQSLNMIKYYKGYIISICSDDK-----LE 373 Query: 293 ADNKKAKKILGWNPKYKLRDIIESAWNW 320 + W K R + W Sbjct: 374 DHKRN------WAKFKKPRLLDPDCLLW 395 >gnl|CDD|177093 CHL00194, ycf39, Ycf39; Provisional. Length = 317 Score = 31.1 bits (71), Expect = 0.47 Identities = 10/35 (28%), Positives = 16/35 (45%) Query: 85 VKNPSLFYEINIKGSFNLIATAIESNVRRFIFSST 119 + +I+ G LI A + ++RFIF S Sbjct: 76 PSDLYNAKQIDWDGKLALIEAAKAAKIKRFIFFSI 110 >gnl|CDD|33946 COG4221, COG4221, Short-chain alcohol dehydrogenase of unknown specificity [General function prediction only]. Length = 246 Score = 30.6 bits (69), Expect = 0.69 Identities = 35/176 (19%), Positives = 58/176 (32%), Gaps = 45/176 (25%) Query: 1 MENKNVLVVGGAGYIGAHTCRVLYERGFLPIVL------------DNLSSGHAEFVLWGP 48 ++ K L+ G + IG T R L E G +VL D + +G A + Sbjct: 4 LKGKVALITGASSGIGEATARALAEAGA-KVVLAARREERLEALADEIGAGAALAL---- 58 Query: 49 LEQVDICDY----TNLRAVFAKYQPASVM-HFAGLTNISE----SVKNPSLFYEINIKGS 99 +D+ D + A+ ++ ++ + AGL + + + N+KG Sbjct: 59 --ALDVTDRAAVEAAIEALPEEFGRIDILVNNAGLALGDPLDEADLDDWDRMIDTNVKGL 116 Query: 100 FNLIATAIESNVRR-----FIFSSTCATYGIPHNTIITENDPQESITPYGYTKYVV 150 N + V R S Y P + YG TK V Sbjct: 117 LNGTRAVLPGMVERKSGHIINLGSIAGRYPYPGGAV------------YGATKAAV 160 >gnl|CDD|36415 KOG1201, KOG1201, KOG1201, Hydroxysteroid 17-beta dehydrogenase 11 [Secondary metabolites biosynthesis, transport and catabolism]. Length = 300 Score = 30.2 bits (68), Expect = 0.73 Identities = 33/199 (16%), Positives = 60/199 (30%), Gaps = 43/199 (21%) Query: 2 ENKNVLVVGGAGYIGAHTCRVLYERGFLPIVLDNLSSGHAEFV----LWGPLE--QVDIC 55 + VL+ GG +G +RG ++ D G+ E V G + DI Sbjct: 37 SGEIVLITGGGSGLGRLIALEFAKRGAKLVLWDINKQGNEETVKEIRKIGEAKAYTCDIS 96 Query: 56 DYTNLRAVFAKYQP-----------ASVMHFAGLTNISESVKNPSLFYEINIKGSFNLIA 104 D + + K + A ++ L + S+ + +++N F Sbjct: 97 DREEIYRLAKKVKKEVGDVDILVNNAGIVTGKKLLDCSD--EEIQKTFDVNTIAHFWTTK 154 Query: 105 T---AIESNVRRFI--FSSTCATYGIPHNTIITENDPQESITPYGYTKYVV-------ER 152 + N I +S +G + Y +K+ Sbjct: 155 AFLPKMLENNNGHIVTIASVAGLFGPA------------GLADYCASKFAAVGFHESLSM 202 Query: 153 ELLQHNKVNGLRSVVLRYF 171 EL K ++V YF Sbjct: 203 ELRALGKDGIKTTLVCPYF 221 >gnl|CDD|36419 KOG1205, KOG1205, KOG1205, Predicted dehydrogenase [Secondary metabolites biosynthesis, transport and catabolism]. Length = 282 Score = 29.5 bits (66), Expect = 1.2 Identities = 9/27 (33%), Positives = 13/27 (48%) Query: 1 MENKNVLVVGGAGYIGAHTCRVLYERG 27 + K VL+ G + IG L +RG Sbjct: 10 LAGKVVLITGASSGIGEALAYELAKRG 36 >gnl|CDD|133172 cd05040, PTKc_Ack_like, Catalytic domain of the Protein Tyrosine Kinase, Activated Cdc42-associated kinase. Protein Tyrosine Kinase (PTK) family; Activated Cdc42-associated kinase (Ack) subfamily; catalytic (c) domain. Ack subfamily members include Ack1, thirty-eight-negative kinase 1 (Tnk1), and similar proteins. The PTKc family is part of a larger superfamily that includes the catalytic domains of other kinases such as protein serine/threonine kinases, RIO kinases, and phosphoinositide 3-kinase (PI3K). PTKs catalyze the transfer of the gamma-phosphoryl group from ATP to tyrosine (tyr) residues in protein substrates. Ack subfamily members are cytoplasmic (or nonreceptor) tyr kinases containing an N-terminal catalytic domain, an SH3 domain, a Cdc42-binding CRIB domain, and a proline-rich region. They are mainly expressed in brain and skeletal tissues and are involved in the regulation of cell adhesion and growth, receptor degradation, and axonal guidance. Ack1 is also associated with androgen-independent prostate cancer progression. Tnk1 regulates TNFalpha signaling and may play an important role in cell death. Length = 257 Score = 29.7 bits (67), Expect = 1.3 Identities = 8/17 (47%), Positives = 10/17 (58%) Query: 183 GEWHNPETHVIPLAIKT 199 GEW VIP+A+K Sbjct: 14 GEWSTSGGKVIPVAVKC 30 >gnl|CDD|36422 KOG1208, KOG1208, KOG1208, Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) [Secondary metabolites biosynthesis, transport and catabolism]. Length = 314 Score = 28.8 bits (64), Expect = 2.1 Identities = 11/28 (39%), Positives = 12/28 (42%) Query: 1 MENKNVLVVGGAGYIGAHTCRVLYERGF 28 + K LV G IG T R L RG Sbjct: 33 LSGKVALVTGATSGIGFETARELALRGA 60 >gnl|CDD|31009 COG0665, DadA, Glycine/D-amino acid oxidases (deaminating) [Amino acid transport and metabolism]. Length = 387 Score = 28.7 bits (63), Expect = 2.3 Identities = 13/54 (24%), Positives = 22/54 (40%), Gaps = 6/54 (11%) Query: 1 MENKNVLVVGGAGYIGAHTCRVLYERGFLPIVLD-----NLSSGHAEFVLWGPL 49 +V+++GG G +G L ERG VL+ ++G + P Sbjct: 2 SMKMDVVIIGG-GIVGLSAAYYLAERGADVTVLEAGEAGGGAAGRNAGGILAPW 54 >gnl|CDD|30896 COG0550, TopA, Topoisomerase IA [DNA replication, recombination, and repair]. Length = 570 Score = 28.7 bits (64), Expect = 2.7 Identities = 12/54 (22%), Positives = 24/54 (44%), Gaps = 5/54 (9%) Query: 258 GITVKEIISTIQSMYECAFPITY---ESRRIGDPPSLVADNKKAKKILGWNPKY 308 G + K+ + Q +YE ITY +S R+ +A+ + + + +Y Sbjct: 274 GFSAKKTMDIAQKLYEGHGLITYPRTDSTRL--SEEALAEARLYILAIAYGKEY 325 >gnl|CDD|39552 KOG4351, KOG4351, KOG4351, Uncharacterized conserved protein [Function unknown]. Length = 244 Score = 28.5 bits (63), Expect = 2.9 Identities = 10/37 (27%), Positives = 15/37 (40%) Query: 99 SFNLIATAIESNVRRFIFSSTCATYGIPHNTIITEND 135 SF L I + + SST +P ++ I E Sbjct: 194 SFELSDIPIVQELEQLPVSSTPCDSPVPIDSPIIEKP 230 >gnl|CDD|39164 KOG3961, KOG3961, KOG3961, Uncharacterized conserved protein [Function unknown]. Length = 262 Score = 28.1 bits (62), Expect = 3.6 Identities = 15/38 (39%), Positives = 21/38 (55%), Gaps = 3/38 (7%) Query: 21 RVLYERGFLPIVLDNLSSGHAEFVLWG-PLEQVDICDY 57 R +Y RG LPI L++ S G + W E++D C Y Sbjct: 80 RAIYSRGDLPISLEHGSKGRK--LQWECDPEKLDYCPY 115 >gnl|CDD|31275 COG1077, MreB, Actin-like ATPase involved in cell morphogenesis [Cell division and chromosome partitioning]. Length = 342 Score = 27.9 bits (62), Expect = 3.8 Identities = 16/59 (27%), Positives = 25/59 (42%), Gaps = 7/59 (11%) Query: 249 IAINLGTG-TGITVKEIISTIQSMYECAFPITYESRRIGDPPSLVADNKKAKKILGWNP 306 I I+LGT T + VK + P G ++A ++AK++LG P Sbjct: 9 IGIDLGTANTLVYVKGKGIVLN------EPSVVAIESEGKTKVVLAVGEEAKQMLGRTP 61 >gnl|CDD|176124 cd08433, PBP2_Nac, The C-teminal substrate binding domain of LysR-like nitrogen assimilation control (NAC) protein, contains the type 2 periplasmic binding fold. The NAC is a LysR-type transcription regulator that activates expression of operons such as hut (histidine utilization) and ure (urea utilization), allowing use of non-preferred (poor) nitrogen sources, and represses expression of operons, such as glutamate dehydrogenase (gdh), allowing assimilation of the preferred nitrogen source. The expression of the nac gene is fully dependent on the nitrogen regulatory system (NTR) and the sigma54-containing RNA polymerase (sigma54-RNAP). In response to nitrogen starvation, NTR system activates the expression of nac, and NAC activates the expression of hut, ure, and put (proline utilization). NAC is not involved in the transcription of Sigma70-RNAP operons such as glnA, which directly respond by the NTR system, but activates the transcription of sigma70-RNAP dependent operons such as hut. Hence, NAC allows the coupling of sigma70-RNAP dependent operons to the sigma54-RNAP dependent NTR system. This substrate-binding domain has significant homology to the type 2 periplasmic binding proteins (PBP2), which are responsible for the uptake of a variety of substrates such as phosphate, sulfate, polysaccharides, lysine/arginine/ornithine, and histidine. The PBP2 bind their ligand in the cleft between these domains in a manner resembling a Venus flytrap. After binding their specific ligand with high affinity, they can interact with a cognate membrane transport complex comprised of two integral membrane domains and two cytoplasmically located ATPase domains. This interaction triggers the ligand translocation across the cytoplasmic membrane energized by ATP hydrolysis. Length = 198 Score = 27.9 bits (63), Expect = 3.8 Identities = 11/38 (28%), Positives = 18/38 (47%), Gaps = 6/38 (15%) Query: 152 RELLQHNKV-----NGLRSVVLRYFNAAGATLDSIIGE 184 EL + + +GLR +V AG TL+ ++ E Sbjct: 93 AELARLPLILPSRGHGLRRLVDEAAARAGLTLN-VVVE 129 >gnl|CDD|38353 KOG3143, KOG3143, KOG3143, Imidazoleglycerol-phosphate dehydratase [Amino acid transport and metabolism]. Length = 219 Score = 27.0 bits (59), Expect = 7.1 Identities = 21/60 (35%), Positives = 26/60 (43%), Gaps = 5/60 (8%) Query: 176 ATLDSIIGEWHNPETHVIPL--AIKTAMGYQNSFKVFGQDYATRDGTCLRDYIHVLDLAN 233 AT D I + H E I L A K A+G + K FG A D R V+DL+ Sbjct: 80 ATGDLHIDDHHTNEDVAIALGTAFKKALGERKGVKRFGDFTAPLDEALSRV---VVDLSG 136 >gnl|CDD|133342 cd04142, RRP22, RRP22 subfamily. RRP22 (Ras-related protein on chromosome 22) subfamily consists of proteins that inhibit cell growth and promote caspase-independent cell death. Unlike most Ras proteins, RRP22 is down-regulated in many human tumor cells due to promoter methylation. RRP22 localizes to the nucleolus in a GTP-dependent manner, suggesting a novel function in modulating transport of nucleolar components. Most Ras proteins contain a lipid modification site at the C-terminus, with a typical sequence motif CaaX, where a = an aliphatic amino acid and X = any amino acid. Lipid binding is essential for membrane attachment, a key feature of most Ras proteins. Like most Ras family proteins, RRP22 is farnesylated. Length = 198 Score = 27.1 bits (60), Expect = 7.2 Identities = 14/55 (25%), Positives = 27/55 (49%), Gaps = 13/55 (23%) Query: 281 ESRRIG--DPPSLVADNKKAKKILGWNPKYKLRDIIESAW-----------NWHL 322 E+R G +PP +V NK+ ++ + P++ L ++ +W NWH+ Sbjct: 108 ETRPAGNKEPPIVVVGNKRDQQRHRFAPRHVLSVLVRKSWKCGYLECSAKYNWHI 162 >gnl|CDD|30648 COG0300, DltE, Short-chain dehydrogenases of various substrate specificities [General function prediction only]. Length = 265 Score = 26.8 bits (59), Expect = 7.8 Identities = 18/73 (24%), Positives = 29/73 (39%), Gaps = 8/73 (10%) Query: 1 MENKNVLVVGGAGYIGAHTCRVLYERGFLPIVL----DNLSSGHAEFVLWGPLE----QV 52 M+ K L+ G + IGA + L RG+ I++ D L + E +E Sbjct: 4 MKGKTALITGASSGIGAELAKQLARRGYNLILVARREDKLEALAKELEDKTGVEVEVIPA 63 Query: 53 DICDYTNLRAVFA 65 D+ D L + Sbjct: 64 DLSDPEALERLED 76 >gnl|CDD|133452 cd05213, NAD_bind_Glutamyl_tRNA_reduct, NADP-binding domain of glutamyl-tRNA reductase. Glutamyl-tRNA reductase catalyzes the conversion of glutamyl-tRNA to glutamate-1-semialdehyde, initiating the synthesis of tetrapyrrole. Whereas tRNAs are generally associated with peptide bond formation in protein translation, here the tRNA activates glutamate in the initiation of tetrapyrrole biosynthesis in archaea, plants and many bacteria. In the first step, activated glutamate is reduced to glutamate-1-semi-aldehyde via the NADPH dependent glutamyl-tRNA reductase. Glutamyl-tRNA reductase forms a V-shaped dimer. Each monomer has 3 domains: an N-terminal catalytic domain, a classic nucleotide binding domain, and a C-terminal dimerization domain. Although the representative structure 1GPJ lacks a bound NADPH, a theoretical binding pocket has been described. (PMID 11172694). Amino acid dehydrogenase (DH)-like NAD(P)-binding domains are members of the Rossmann fold superfamily and include glutamate, leucine, and phenylalanine DHs, methylene tetrahydrofolate DH, methylene-tetrahydromethanopterin DH, methylene-tetrahydropholate DH/cyclohydrolase, Shikimate DH-like proteins, malate oxidoreductases, and glutamyl tRNA reductase. Amino acid DHs catalyze the deamination of amino acids to keto acids with NAD(P)+ as a cofactor. The NAD(P)-binding Rossmann fold superfamily includes a wide variety of protein families including NAD(P)- binding domains of alcohol DHs, tyrosine-dependent oxidoreductases, glyceraldehyde-3-phosphate DH, lactate/malate DHs, formate/glycerate DHs, siroheme synthases, 6-phosphogluconate DH, amino acid DHs, repressor rex, NAD-binding potassium channel domain, CoA-binding, and ornithine cyclodeaminase-like domains. These domains have an alpha-beta-alpha configuration. NAD binding involves numerous hydrogen and van der Waals contacts. Length = 311 Score = 26.8 bits (60), Expect = 8.5 Identities = 14/41 (34%), Positives = 21/41 (51%), Gaps = 1/41 (2%) Query: 2 ENKNVLVVGGAGYIGAHTCRVLYERGFLPIVLDNLSSGHAE 42 + K VLV+G AG +G + L +G I + N + AE Sbjct: 177 KGKKVLVIG-AGEMGELAAKHLAAKGVAEITIANRTYERAE 216 >gnl|CDD|32328 COG2145, ThiM, Hydroxyethylthiazole kinase, sugar kinase family [Coenzyme metabolism]. Length = 265 Score = 26.7 bits (59), Expect = 9.6 Identities = 14/71 (19%), Positives = 30/71 (42%), Gaps = 12/71 (16%) Query: 221 CLRDYIHVLDLANA-------HIMA-----LEYLINQGDSIAINLGTGTGITVKEIISTI 268 C+ + + AN +MA +E D++ IN+GT + ++ + + I Sbjct: 19 CITNDVVQNFTANGLLALGASPVMADAPEEVEEFAKIADALLINIGTLSAERIQAMRAAI 78 Query: 269 QSMYECAFPIT 279 ++ E P+ Sbjct: 79 KAANESGKPVV 89 >gnl|CDD|146441 pfam03801, Ndc80_HEC, HEC/Ndc80p family. Members of this family are components of the mitotic spindle. It has been shown that Ndc80/HEC from yeast is part of a complex called the Ndc80p complex. This complex is thought to bind to the microtubules of the spindle. Length = 159 Score = 26.5 bits (59), Expect = 9.8 Identities = 11/36 (30%), Positives = 17/36 (47%) Query: 239 LEYLINQGDSIAINLGTGTGITVKEIISTIQSMYEC 274 LE+L+ G I I+ T T K+ I + +Y Sbjct: 47 LEFLVENGFEIEISPKTLKSPTQKDFIEIFKFLYNR 82 Database: CddA Posted date: Feb 4, 2011 9:38 PM Number of letters in database: 6,263,737 Number of sequences in database: 21,609 Lambda K H 0.320 0.136 0.413 Gapped Lambda K H 0.267 0.0752 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 21609 Number of Hits to DB: 4,089,495 Number of extensions: 215018 Number of successful extensions: 548 Number of sequences better than 10.0: 1 Number of HSP's gapped: 509 Number of HSP's successfully gapped: 50 Length of query: 333 Length of database: 6,263,737 Length adjustment: 94 Effective length of query: 239 Effective length of database: 4,232,491 Effective search space: 1011565349 Effective search space used: 1011565349 Neighboring words threshold: 11 Window for multiple hits: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.8 bits) S2: 58 (26.1 bits)