RPS-BLAST 2.2.22 [Sep-27-2009]
Database: CddA
21,609 sequences; 6,263,737 total letters
Searching..................................................done
Query= gi|254780328|ref|YP_003064741.1| UDP-glucose 4-epimerase
[Candidatus Liberibacter asiaticus str. psy62]
(333 letters)
>gnl|CDD|31284 COG1087, GalE, UDP-glucose 4-epimerase [Cell envelope biogenesis,
outer membrane].
Length = 329
Score = 405 bits (1043), Expect = e-114
Identities = 144/327 (44%), Positives = 203/327 (62%), Gaps = 4/327 (1%)
Query: 4 KNVLVVGGAGYIGAHTCRVLYERGFLPIVLDNLSSGHAEFVLW--GPLEQVDICDYTNLR 61
VLV GGAGYIG+HT R L + G +VLDNLS+GH +L + D+ D L
Sbjct: 1 MKVLVTGGAGYIGSHTVRQLLKTGHEVVVLDNLSNGHKIALLKLQFKFYEGDLLDRALLT 60
Query: 62 AVFAKYQPASVMHFAGLTNISESVKNPSLFYEINIKGSFNLIATAIESNVRRFIFSSTCA 121
AVF + + +V+HFA ++ ESV+NP +Y+ N+ G+ NLI +++ V++FIFSST A
Sbjct: 61 AVFEENKIDAVVHFAASISVGESVQNPLKYYDNNVVGTLNLIEAMLQTGVKKFIFSSTAA 120
Query: 122 TYGIPHNTIITENDPQESITPYGYTKYVVERELLQHNKVNGLRSVVLRYFNAAGATLDSI 181
YG P + I+E P I PYG +K + E L K N + V+LRYFN AGA D
Sbjct: 121 VYGEPTTSPISETSPLAPINPYGRSKLMSEEILRDAAKANPFKVVILRYFNVAGACPDGT 180
Query: 182 IGEWHNPETHVIPLAIKTAMGYQNSFKVFGQDYATRDGTCLRDYIHVLDLANAHIMALEY 241
+G+ + T +IP+A + A+G ++ +FG DY T+DGTC+RDYIHV DLA+AH++AL+Y
Sbjct: 181 LGQRYPGATLLIPVAAEAALGKRDKLFIFGDDYDTKDGTCIRDYIHVDDLADAHVLALKY 240
Query: 242 LINQGDSIAINLGTGTGITVKEIISTIQSMYECAFPITYESRRIGDPPSLVADNKKAKKI 301
L G + NLG+G G +V E+I + + P+ RR GDP LVAD+ KA++I
Sbjct: 241 LKEGGSNNIFNLGSGNGFSVLEVIEAAKKVTGRDIPVEIAPRRAGDPAILVADSSKARQI 300
Query: 302 LGWNPKYK-LRDIIESAWNWH-LKYPR 326
LGW P Y L DII+ AW+WH ++
Sbjct: 301 LGWQPTYDDLEDIIKDAWDWHQQRHGD 327
>gnl|CDD|36585 KOG1371, KOG1371, KOG1371, UDP-glucose
4-epimerase/UDP-sulfoquinovose synthase [Cell
wall/membrane/envelope biogenesis].
Length = 343
Score = 264 bits (676), Expect = 3e-71
Identities = 123/334 (36%), Positives = 183/334 (54%), Gaps = 12/334 (3%)
Query: 4 KNVLVVGGAGYIGAHTCRVLYERGFLPIVLDNLSSGHAEFV-----LWG-----PLEQVD 53
K+VLV GGAGYIG+HT L +RG+ +++DNL++ + E + L G + D
Sbjct: 3 KHVLVTGGAGYIGSHTVLALLKRGYGVVIVDNLNNSYLESLKRVRQLLGEGKSVFFVEGD 62
Query: 54 ICDYTNLRAVFAKYQPASVMHFAGLTNISESVKNPSLFYEINIKGSFNLIATAIESNVRR 113
+ D L +F++ + +VMHFA L + ES++NP +Y NI G+ NL+ NV+
Sbjct: 63 LNDAEALEKLFSEVKFDAVMHFAALAAVGESMENPLSYYHNNIAGTLNLLEVMKAHNVKA 122
Query: 114 FIFSSTCATYGIPHNTIITENDPQESIT-PYGYTKYVVERELLQHNKVNGLRSVVLRYFN 172
+FSS+ YG+P ITE DP + T PYG TK +E + +NK G + LRYFN
Sbjct: 123 LVFSSSATVYGLPTKVPITEEDPTDQPTNPYGKTKKAIEEIIHDYNKAYGWKVTGLRYFN 182
Query: 173 AAGATLDSIIGEWHNPETHVIPLAIK-TAMGYQNSFKVFGQDYATRDGTCLRDYIHVLDL 231
GA IGE + + + A+G + + +V G+DY T DGT +RDYIHVLDL
Sbjct: 183 VIGAHPSGRIGEAPLGIPNNLLPYVFQVAIGRRPNLQVVGRDYTTIDGTIVRDYIHVLDL 242
Query: 232 ANAHIMALEYLINQGDSIAINLGTGTGITVKEIISTIQSMYECAFPITYESRRIGDPPSL 291
A+ H+ AL L + NLGTG G +V E+++ + RR GD +
Sbjct: 243 ADGHVAALGKLRGAAEFGVYNLGTGKGSSVLELVTAFEKALGVKIKKKVVPRRNGDVAFV 302
Query: 292 VADNKKAKKILGWNPKYKLRDIIESAWNWHLKYP 325
A+ KA++ LGW KY L+++++ W W + P
Sbjct: 303 YANPSKAQRELGWKAKYGLQEMLKDLWRWQKQNP 336
>gnl|CDD|144821 pfam01370, Epimerase, NAD dependent epimerase/dehydratase family.
This family of proteins utilize NAD as a cofactor. The
proteins in this family use nucleotide-sugar substrates
for a variety of chemical reactions.
Length = 235
Score = 181 bits (461), Expect = 2e-46
Identities = 76/238 (31%), Positives = 120/238 (50%), Gaps = 16/238 (6%)
Query: 6 VLVVGGAGYIGAHTCRVLYERGFLPIVLDNLSSGHAEFVLWG---PLEQVDICDYTNLRA 62
+LV GG G+IG+ R L + G+ IVL + D+ D L
Sbjct: 1 ILVTGGTGFIGSALVRRLLQEGYEVIVL-GRRRRSESLNTGRIRFRFHEGDLTDPDALER 59
Query: 63 VFAKYQPASVMHFAGLTNISESVKNPSLFYEINIKGSFNLIATAIESNVRRFIFSSTCAT 122
+ A+ QP +V+H A + + S ++P+ F N+ G+ NL+ A + V+RF+F+S+
Sbjct: 60 LLAEVQPDAVIHLAAQSGVGASFEDPAEFIRANVLGTLNLLEAARRAGVKRFVFASSSEV 119
Query: 123 YGIPHNTIITENDPQESITPYGYTKYVVERELLQHNKVNGLRSVVLRYFNAAGATLDSII 182
YG + ITE+ P ++PY K ER +L + + GLR+V+LR FN G
Sbjct: 120 YGDVADPPITEDTPLGPLSPYAAAKLAGERLVLAYARAYGLRAVILRLFNVYGP------ 173
Query: 183 GEWHNPETHVIPLAIKTAMGYQNSFKVFGQDYATRDGTCLRDYIHVLDLANAHIMALE 240
G+ THVIP I+ + + + G DGT RD+++V D+A A ++ALE
Sbjct: 174 GDNDTFVTHVIPALIRRILEGKPEILLLG------DGTQRRDFLYVDDVARAILLALE 225
>gnl|CDD|30800 COG0451, WcaG, Nucleoside-diphosphate-sugar epimerases [Cell
envelope biogenesis, outer membrane / Carbohydrate
transport and metabolism].
Length = 314
Score = 174 bits (442), Expect = 2e-44
Identities = 93/331 (28%), Positives = 153/331 (46%), Gaps = 24/331 (7%)
Query: 4 KNVLVVGGAGYIGAHTCRVLYERGFLPIVLDNLSSGHAEFVLWGPLEQVDICDYTNLRAV 63
+LV GGAG+IG+H L G LD L G + +D+ D + +
Sbjct: 1 MRILVTGGAGFIGSHLVERLLAAGHDVRGLDRLRDGLDPLLSGVEFVVLDLTDRDLVDEL 60
Query: 64 FAKYQPASVMHFAGLTNISESVK-NPSLFYEINIKGSFNLIATAIESNVRRFIFSSTCAT 122
A V+H A +++ +S +P+ F ++N+ G+ NL+ A + V+RF+F+S+ +
Sbjct: 61 AKGVPDA-VIHLAAQSSVPDSNASDPAEFLDVNVDGTLNLLEAARAAGVKRFVFASSVSV 119
Query: 123 -YGIPHNTIITEND-PQESITPYGYTKYVVERELLQHNKVNGLRSVVLRYFNAAGATLDS 180
YG P I E+ P + PYG +K E+ L + ++ GL V+LR FN G
Sbjct: 120 VYGDPPPLPIDEDLGPPRPLNPYGVSKLAAEQLLRAYARLYGLPVVILRPFNVYGP---- 175
Query: 181 IIGEWHNPETHVIPLAIKTAMGYQNSFKVFGQDYATRDGTCLRDYIHVLDLANAHIMALE 240
G+ + + V+ I+ + + + G DG+ RD+++V D+A+A ++ALE
Sbjct: 176 --GDKPDLSSGVVSAFIRQLLKGEPIIVIGG------DGSQTRDFVYVDDVADALLLALE 227
Query: 241 YLINQGDSIAINLGTGTG-ITVKEIISTIQSMYECAFPITY---ESRRIGDPPSLVADNK 296
D N+G+GT ITV+E+ + P+ RR + D
Sbjct: 228 N----PDGGVFNIGSGTAEITVRELAEAVAEAVGSKAPLIVYIPLGRRGDLREGKLLDIS 283
Query: 297 KAKKILGWNPKYKLRDIIESAWNWHLKYPRS 327
KA+ LGW PK L + + W LK
Sbjct: 284 KARAALGWEPKVSLEEGLADTLEWLLKKLEL 314
>gnl|CDD|31285 COG1088, RfbB, dTDP-D-glucose 4,6-dehydratase [Cell envelope
biogenesis, outer membrane].
Length = 340
Score = 124 bits (312), Expect = 4e-29
Identities = 89/337 (26%), Positives = 151/337 (44%), Gaps = 38/337 (11%)
Query: 4 KNVLVVGGAGYIGAHTCRVLYERG--FLPIVLDNLS-SGHAEFV--LWGP----LEQVDI 54
+LV GGAG+IG++ R + + + LD L+ +G+ E + + Q DI
Sbjct: 1 MKILVTGGAGFIGSNFVRYILNKHPDDHVVNLDKLTYAGNLENLADVEDSPRYRFVQGDI 60
Query: 55 CDYTNLRAVFAKYQPASVMHFAGLTNISESVKNPSLFYEINIKGSFNLIATAIESNVR-R 113
CD + +F +YQP +V+HFA +++ S+ P+ F + N+ G++ L+ A + + R
Sbjct: 61 CDRELVDRLFKEYQPDAVVHFAAESHVDRSIDGPAPFIQTNVVGTYTLLEAARKYWGKFR 120
Query: 114 FIFSSTCATYG-IPHN-TIITENDPQESITPYGYTKYVVERELLQHNKVNGLRSVVLRYF 171
F ST YG + + TE P +PY +K + + + + GL + + R
Sbjct: 121 FHHISTDEVYGDLGLDDDAFTETTPYNPSSPYSASKAASDLLVRAYVRTYGLPATITRCS 180
Query: 172 NAAGATLDSIIGEWHNPETHVIPLAIKTAMGYQNSFKVFGQDYATRDGTCLRDYIHVLDL 231
N G + PE +IPL I A+ V+G DG +RD+++V D
Sbjct: 181 NNYGP--------YQFPEK-LIPLMIINAL-LGKPLPVYG------DGLQIRDWLYVED- 223
Query: 232 ANAHIMALEYLINQGDSIAI-NLGTGTGITVKEIISTIQSMYE-----CAFPITYESRRI 285
H A++ ++ +G N+G G T E++ TI + IT+ R
Sbjct: 224 ---HCRAIDLVLTKGKIGETYNIGGGNERTNLEVVKTICELLGKDKPDYRDLITFVEDRP 280
Query: 286 GDPPSLVADNKKAKKILGWNPKYKLRDIIESAWNWHL 322
G D K K+ LGW P+ + +W+L
Sbjct: 281 GHDRRYAIDASKIKRELGWRPQETFETGLRKTVDWYL 317
>gnl|CDD|36642 KOG1429, KOG1429, KOG1429, dTDP-glucose
4-6-dehydratase/UDP-glucuronic acid decarboxylase
[Carbohydrate transport and metabolism, Cell
wall/membrane/envelope biogenesis].
Length = 350
Score = 101 bits (253), Expect = 3e-22
Identities = 78/336 (23%), Positives = 123/336 (36%), Gaps = 49/336 (14%)
Query: 2 ENKNVLVVGGAGYIGAHTCRVLYERGFLPIVLDNLSSGHAE-----FVLW-GPLEQVDIC 55
+N +L+ GGAG+IG+H L G I LDN +G E L + D+
Sbjct: 26 QNLRILITGGAGFIGSHLVDKLMTEGHEVIALDNYFTGRKENLEHWIGHPNFELIRHDVV 85
Query: 56 DYTNLRA--VFAKYQPASVMHFAGLTNISESVKNPSLFYEINIKGSFNLIATAIESNVRR 113
+ ++ PAS H+ NP + N+ G+ N++ A R
Sbjct: 86 EPLLKEVDQIYHLAAPASPPHY---------KYNPVKTIKTNVIGTLNMLGLAKRVG-AR 135
Query: 114 FIFSSTCATYGIPHNTIITEN-----DPQESITPYGYTKYVVERELLQHNKVNGLRSVVL 168
F+ +ST YG P E +P + Y K V E ++K G+ +
Sbjct: 136 FLLASTSEVYGDPLVHPQVETYWGNVNPIGPRSCYDEGKRVAETLCYAYHKQEGIEVRIA 195
Query: 169 RYFNAAGATLDSIIGEWHNPETHVIPLAIKTAMGYQ----NSFKVFGQDYATRDGTCLRD 224
R FN + P H+ + + Q V+G DG R
Sbjct: 196 RIFN------------TYGPRMHMDDGRVVSNFIAQALRGEPLTVYG------DGKQTRS 237
Query: 225 YIHVLDLANAHIMALEYLINQGDSIAINLGTGTGITVKEIISTIQSMYECAFPITYESRR 284
+ +V DL L L+ +N+G T+ E+ ++ + I +
Sbjct: 238 FQYVSDLVE----GLLRLMESDYRGPVNIGNPGEFTMLELAEMVKELIGPVSEIEFVENG 293
Query: 285 IGDPPSLVADNKKAKKILGWNPKYKLRDIIESAWNW 320
DP D KAK+ LGW PK LR+ + +
Sbjct: 294 PDDPRKRKPDITKAKEQLGWEPKVSLREGLPLTVTY 329
>gnl|CDD|35966 KOG0747, KOG0747, KOG0747, Putative NAD+-dependent epimerases
[Carbohydrate transport and metabolism].
Length = 331
Score = 98.1 bits (244), Expect = 3e-21
Identities = 89/349 (25%), Positives = 144/349 (41%), Gaps = 44/349 (12%)
Query: 2 ENKNVLVVGGAGYIGAHTCRVLYER--GFLPIVLDNLSSGHAEFVLWGPLE-------QV 52
+ KNVL+ GGAG+IG++ L ++ + + LD L L +
Sbjct: 5 KEKNVLITGGAGFIGSNFINYLVDKYPDYKFVNLDKLDYCSNLKNLEPVRNSPNYKFVEG 64
Query: 53 DICDYTNLRAVFAKYQPASVMHFAGLTNISESVKNPSLFYEINIKGSFNLIATAIES-NV 111
DI D + +F + +V+HFA T++ S + F + NI + L+ S N+
Sbjct: 65 DIADADLVLYLFETEEIDTVIHFAAQTHVDRSFGDSFEFTKNNILSTHVLLEAVRVSGNI 124
Query: 112 RRFIFSSTCATYG-IPHNTIITENDPQESITPYGYTKYVVERELLQHNKVNGLRSVVLRY 170
RRF+ ST YG + ++ E PY +K E + + + GL V R
Sbjct: 125 RRFVHVSTDEVYGDSDEDAVVGEASLLNPTNPYAASKAAAEMLVRSYGRSYGLPVVTTRM 184
Query: 171 FNAAGATLDSIIGEWHNPETHVIPLAIKTAMGYQNSFKVFGQDYATR-DGTCLRDYIHVL 229
N G ++ +IP IK AM G++Y DG R Y++V
Sbjct: 185 NNVYGP------NQYP---EKLIPKFIKLAMR--------GKEYPIHGDGLQTRSYLYVE 227
Query: 230 DLANAHIMALEYLINQGDSIAI-NLGTGTGITVKEIISTIQSMYE--------CAFPITY 280
D++ A LE +G+ I N+GT + V ++ I ++E F
Sbjct: 228 DVSEAFKAVLE----KGELGEIYNIGTDDEMRVIDLAKDICELFEKRLPNIDTEPFIFFV 283
Query: 281 ESRRIGDPPSLVADNKKAKKILGWNPKYKLRDIIESAWNWHLKYPRSLS 329
E R D + D++K KK LGW P + + W+ K + +S
Sbjct: 284 EDRPYNDLRYFL-DDEKIKK-LGWRPTTPWEEGLRKTIEWYTKNFKDVS 330
>gnl|CDD|36643 KOG1430, KOG1430, KOG1430, C-3 sterol
dehydrogenase/3-beta-hydroxysteroid dehydrogenase and
related dehydrogenases [Lipid transport and metabolism,
Amino acid transport and metabolism].
Length = 361
Score = 93.5 bits (232), Expect = 7e-20
Identities = 77/368 (20%), Positives = 137/368 (37%), Gaps = 59/368 (16%)
Query: 1 MENKNVLVVGGAGYIGAHTCRVLYERGFLP--IVLD----NLSSGHAEFVLWGPLEQVDI 54
+ +VLV GG+G++G H + L E V+D + V +
Sbjct: 2 EKKLSVLVTGGSGFLGQHLVQALLENELKLEIRVVDKTPTQSNLPAELTGFRSGRVTVIL 61
Query: 55 CDYTNLRAVFAKYQPASVMHFAGLTNISESVKNPSLFYEINIKGSFNLIATAIESNVRRF 114
D + ++ +Q A V+H A + L +N+ G+ N+I E V+R
Sbjct: 62 GDLLDANSISNAFQGAVVVHCAASPVPDFVENDRDLAMRVNVNGTLNVIEACKELGVKRL 121
Query: 115 IFSSTCATYGIPHNTIITEND---PQESITPYGYTKYVVERELLQHNKVNGLRSVVLRYF 171
I++S+ I + P + I PYG +K + E+ +L+ N + L + LR
Sbjct: 122 IYTSSAYVVFGGEPIINGDESLPYPLKHIDPYGESKALAEKLVLEANGSDDLYTCALR-- 179
Query: 172 NAAGATLDSIIGEWHNPETHVIPLAIKTAMGYQNSFKVFGQDYATRDGTCLRDYIHVLDL 231
I G + ++P ++ FK DG L D+ + ++
Sbjct: 180 ------PPGIYGP---GDKRLLPKIVEALKNGGFLFK-------IGDGENLNDFTYGENV 223
Query: 232 ANAHIMALEYLINQGDSIAI---NLGTGTGITVKEIISTI---------QSMYECAF--- 276
A AHI+A L+++ S+ + T + + +S + S+ F
Sbjct: 224 AWAHILAARALLDKSPSVNGQFYFITDDTPVRFFDFLSPLVKALGYCLPSSIKLPLFLSY 283
Query: 277 ------PITYESRRIGDPP------SLVA-----DNKKAKKILGWNPKYKLRDIIESAWN 319
I Y R P +L+ +KAK+ LG+ P L + I+ +
Sbjct: 284 FLAYLLEIVYFLLRPYQPILTRFRVALLGVTRTFSIEKAKRELGYKPLVSLEEAIQRTIH 343
Query: 320 WHLKYPRS 327
W S
Sbjct: 344 WVASESDS 351
>gnl|CDD|36644 KOG1431, KOG1431, KOG1431, GDP-L-fucose synthetase [Carbohydrate
transport and metabolism, Posttranslational
modification, protein turnover, chaperones].
Length = 315
Score = 86.6 bits (214), Expect = 1e-17
Identities = 72/334 (21%), Positives = 136/334 (40%), Gaps = 41/334 (12%)
Query: 4 KNVLVVGGAGYIGAHTCRVLYERGFLPIVLDNLSSGHAEFVLWGPLEQVDICDYTNLRAV 63
K +LV GG G +G+ +V+ E+GF + G + D+ + + RA+
Sbjct: 2 KKILVTGGTGLVGSAIVKVVQEQGFDDENW--VFIGSKD---------ADLTNLADTRAL 50
Query: 64 FAKYQPASVMHFAGLTN-ISESVKNPSLFYEINIKGSFNLIATAIESNVRRFIFS-STC- 120
F +P V+H A + + + F N++ + N++ +A E V++ + STC
Sbjct: 51 FESEKPTHVIHLAAMVGGLFHNNTYNLDFIRKNLQINDNVLHSAHEHGVKKVVSCLSTCI 110
Query: 121 ----ATYGIPHNTIITENDPQESITPYGYTKYVVERELLQHNKVNGLRSVVLRYFNAAGA 176
+Y I + P S Y Y K +++ + + + +G + N G
Sbjct: 111 FPDKTSYPIDETMVHN-GPPHPSNFGYSYAKRMIDVQNQAYRQQHGRDYTSVIPTNVFGP 169
Query: 177 TLDSIIGEWHNPE-THVIPLAIKTAMGYQNS----FKVFGQDYATRDGTCLRDYIHVLDL 231
+ NPE +HV+P I + + V+G G+ LR +I+ DL
Sbjct: 170 H------DNFNPENSHVLPALIHRFHEAKRNGTDELTVWGS------GSPLRQFIYSDDL 217
Query: 232 ANAHIMAL-EYLINQGDSIAINLGTGTGITVKEIISTIQSMYECAFPITYESRRIGDPPS 290
A+ I L EY + I +++G +T++E + + + +++ +
Sbjct: 218 ADLFIWVLREY--EGVEPIILSVGESDEVTIREAAEAVVEAVDFTGKLVWDTTKSDGQFK 275
Query: 291 LVADNKKAKKILGWNPKY-KLRDIIESAWNWHLK 323
A N K + L + K+ L I W+L
Sbjct: 276 KTASNSKLRS-LLPDFKFTPLEQAISETVQWYLD 308
>gnl|CDD|36715 KOG1502, KOG1502, KOG1502, Flavonol reductase/cinnamoyl-CoA
reductase [Defense mechanisms].
Length = 327
Score = 86.5 bits (214), Expect = 1e-17
Identities = 64/342 (18%), Positives = 125/342 (36%), Gaps = 60/342 (17%)
Query: 1 MENKNVLVVGGAGYIGAHTCRVLYERGFLPIVL-----DNLSSGH-AEFVLWGPLEQV-- 52
E K V V G +G+IG+ ++L RG+ D + H + ++
Sbjct: 4 DEGKKVCVTGASGFIGSWIVKLLLSRGYTVRGTVRDPEDEKKTEHLRKLEGAKERLKLFK 63
Query: 53 -DICDYTNLRAVFAK----YQPASVMHFAGLTNISESVKNPSLFYEINIKGSFNLIATAI 107
D+ D + + AS + F + + +KG+ N++
Sbjct: 64 ADLLDEGSFDKAIDGCDGVFHTASPVDF-------DLEDPEKELIDPAVKGTKNVLEACK 116
Query: 108 ESN-VRRFIFSSTCATYG-----IPHNTIITE---NDPQ---ESITPYGYTKYVVERELL 155
++ V+R +++S+ A I N+++ E +D Y +K + E+
Sbjct: 117 KTKSVKRVVYTSSTAAVRYNGPNIGENSVVDEESWSDLDFCRCKKLWYALSKTLAEKAAW 176
Query: 156 QHNKVNGLRSVVLRYFNAAGATLDSIIGEWHNPETHVIPLA--IKTAMGYQNSFKVFGQD 213
+ K NGL V + G L + + + L IK +F +
Sbjct: 177 EFAKENGLDLVTINPGLVFGPGLQP------SLNSSLNALLKLIKGLAETYPNFWLA--- 227
Query: 214 YATRDGTCLRDYIHVLDLANAHIMALEYLINQGDSIAINLGTGTGITVKEIISTIQSMY- 272
++ V D+A AH++ALE +G I G +++KEI ++ ++
Sbjct: 228 -----------FVDVRDVALAHVLALEKPSAKGRYIC----VGEVVSIKEIADILRELFP 272
Query: 273 ECAFPITYESRRIGDPPSLVADNKKAKKILGWNPKYKLRDII 314
+ P G S ++K K + G+ L + +
Sbjct: 273 DYPIPKKNAEEHEGFLTSFKVSSEKLKSLGGFK-FRPLEETL 313
>gnl|CDD|31286 COG1089, Gmd, GDP-D-mannose dehydratase [Cell envelope biogenesis,
outer membrane].
Length = 345
Score = 73.4 bits (180), Expect = 8e-14
Identities = 71/363 (19%), Positives = 128/363 (35%), Gaps = 64/363 (17%)
Query: 2 ENKNVLVVGGAGYIGAHTCRVLYERGFLPIVLDNLSSGHAEFVLWGP------------- 48
K L+ G G G++ +L E+G+ + + + F
Sbjct: 1 MGKVALITGITGQDGSYLAELLLEKGY---EVHGIKRRSSSFNTPRIHLYEDPHLNDPRL 57
Query: 49 -LEQVDICDYTNLRAVFAKYQPASVMHFAGLTNISESVKNPSLFYEINIKGSFNLI-ATA 106
L D+ D +NL + + QP + + A +++ S + P +++ G+ L+ A
Sbjct: 58 HLHYGDLTDSSNLLRILEEVQPDEIYNLAAQSHVGVSFEQPEYTADVDAIGTLRLLEAIR 117
Query: 107 IESNVR-RFIFSSTCATYGIPHNTIITENDPQESITPYGYTKYVVERELLQHNKVNGLRS 165
I + RF +ST YG+ E P +PY K + + + GL +
Sbjct: 118 ILGEKKTRFYQASTSELYGLVQEIPQKETTPFYPRSPYAVAKLYAYWITVNYRESYGLFA 177
Query: 166 VVLRYFNAAGATLDSIIGEWHNPETHV---IPLAI-KTAMGYQNSFKVFGQDYATRDGTC 221
FN ET V I A+ + +G Q+ + G A RD
Sbjct: 178 CNGILFNHESPL---------RGETFVTRKITRAVARIKLGLQDKLYL-GNLDAKRDWGH 227
Query: 222 LRDYIHVLDLANAHIMALEYLINQGDSIAINLGTGTGITVKEIISTIQSM--YECAFPIT 279
+DY+ + L ++ Q + + TG +V+E + M + + T
Sbjct: 228 AKDYVEAMWL----------MLQQEEPDDYVIATGETHSVREFVELAFEMVGIDLEWEGT 277
Query: 280 YESRRIGDP-------------------PSLVADNKKAKKILGWNPKYKLRDIIESAWNW 320
+ D L+ D KAK+ LGW P+ L +++
Sbjct: 278 GVDEKGVDAKTGKIIVEIDPRYFRPAEVDLLLGDPTKAKEKLGWRPEVSLEELVREMVEA 337
Query: 321 HLK 323
L+
Sbjct: 338 DLE 340
>gnl|CDD|31288 COG1091, RfbD, dTDP-4-dehydrorhamnose reductase [Cell envelope
biogenesis, outer membrane].
Length = 281
Score = 64.1 bits (156), Expect = 5e-11
Identities = 37/165 (22%), Positives = 63/165 (38%), Gaps = 20/165 (12%)
Query: 5 NVLVVGGAGYIGAHTCRVLYERGFLPIVLDNLSSGHAEFVLWGPLEQVDICDYTNLRAVF 64
+L+ G G +G R LP + +++ AE +DI D + V
Sbjct: 2 KILITGANGQLGTELRR------ALPGEFEVIATDRAE---------LDITDPDAVLEVI 46
Query: 65 AKYQPASVMHFAGLTNISESVKNPSLFYEINIKGSFNLIATAIESNVRRFIFSSTCATYG 124
+ +P V++ A T + ++ P L + +N G+ NL A E R + ST +
Sbjct: 47 RETRPDVVINAAAYTAVDKAESEPELAFAVNATGAENLARAAAEVGA-RLVHISTDYVFD 105
Query: 125 IPHNTIITENDPQESITPYGYTKYVVERELLQHNKVNGLRSVVLR 169
E D + YG +K E + R ++LR
Sbjct: 106 GEKGGPYKETDTPNPLNVYGRSKLAGEEAVRAAGP----RHLILR 146
>gnl|CDD|144605 pfam01073, 3Beta_HSD, 3-beta hydroxysteroid dehydrogenase/isomerase
family. The enzyme 3 beta-hydroxysteroid
dehydrogenase/5-ene-4-ene isomerase (3 beta-HSD)
catalyses the oxidation and isomerisation of 5-ene-3
beta-hydroxypregnene and 5-ene-hydroxyandrostene steroid
precursors into the corresponding 4-ene-ketosteroids
necessary for the formation of all classes of steroid
hormones.
Length = 280
Score = 63.5 bits (155), Expect = 9e-11
Identities = 63/266 (23%), Positives = 106/266 (39%), Gaps = 46/266 (17%)
Query: 7 LVVGGAGYIGAHTCRVLYERGFLPIV----LDNLSSGHAEFVLWGPLE-----QVDICDY 57
LV GG G++G H R+L G L V L +F L+ + D+ D
Sbjct: 1 LVTGGGGFLGRHIVRLLLREGELQEVRVFDLRFSPELLEDF---SKLQVITYIEGDVTDK 57
Query: 58 TNLRAVFAKYQPAS-VMHFAGLTNISESVKNPSLFYEINIKGSFNLIATAIESNVRRFIF 116
+LR Q + V+H A + ++ ++N+KG+ N++ +++ VR ++
Sbjct: 58 QDLRRAL---QGSDVVIHTAAIIDVF-GKAYRDTIMKVNVKGTQNVLDACVKAGVRVLVY 113
Query: 117 SSTC-ATYGIPHNTII---TENDPQESI--TPYGYTKYVVERELLQHN---KVNG--LRS 165
+S+ + I E P ES PY +K + E+ +L+ N NG L +
Sbjct: 114 TSSMEVVGPNSYGQPIVNGDETTPYESTHQDPYPESKALAEKLVLKANGSTLKNGGRLYT 173
Query: 166 VVLRYFNAAGATLDSIIGEWHNPETHVIPLAIKTAMGYQNSFKVFGQDYATRDGTCLRDY 225
LR I GE +P K + T D L D
Sbjct: 174 CALR--------PAGIFGE-GDPFLFP---------FLVRLLKNGLAKFRTGDKNVLSDR 215
Query: 226 IHVLDLANAHIMALEYLINQGDSIAI 251
++V ++A AHI+A L + + +I
Sbjct: 216 VYVGNVAWAHILAARALQDPKKASSI 241
>gnl|CDD|31283 COG1086, COG1086, Predicted nucleoside-diphosphate sugar epimerases
[Cell envelope biogenesis, outer membrane / Carbohydrate
transport and metabolism].
Length = 588
Score = 60.3 bits (146), Expect = 7e-10
Identities = 45/197 (22%), Positives = 73/197 (37%), Gaps = 47/197 (23%)
Query: 2 ENKNVLVVGGAGYIGAHTCRVLYERGFLPIVL-----DNLSSGHAEFVLWGPLEQV---- 52
K VLV GG G IG+ CR + + I+L L E P ++
Sbjct: 249 TGKTVLVTGGGGSIGSELCRQILKFNPKEIILFSRDEYKLYLIDMELREKFPELKLRFYI 308
Query: 53 -DICDYTNLRAVFAKYQPASVMHFAGLT-------NISESVKNPSLFYEINIKGSFNLIA 104
D+ D + ++ V H A L N E++K N+ G+ N+
Sbjct: 309 GDVRDRDRVERAMEGHKVDIVFHAAALKHVPLVEYNPEEAIKT-------NVLGTENVAE 361
Query: 105 TAIESNVRRFIFSSTCATYGIPHNTIITENDPQESITP---YGYTKYVVERELLQ---HN 158
AI++ V++F+ ST +++ P G TK + E+ +
Sbjct: 362 AAIKNGVKKFVLIST-----------------DKAVNPTNVMGATKRLAEKLFQAANRNV 404
Query: 159 KVNGLRSVVLRYFNAAG 175
G R V+R+ N G
Sbjct: 405 SGTGTRFCVVRFGNVLG 421
>gnl|CDD|37985 KOG2774, KOG2774, KOG2774, NAD dependent epimerase [General
function prediction only].
Length = 366
Score = 56.6 bits (136), Expect = 1e-08
Identities = 74/330 (22%), Positives = 122/330 (36%), Gaps = 52/330 (15%)
Query: 6 VLVVGGAGYIGAHTCRVL-YERGFLPIVLDNLSSGHAEFVLWGPLEQVDICDYTNLRAVF 64
VL+ G G +G +L Y G ++L ++ A GP +DI D +L +
Sbjct: 47 VLITGSLGQLGRGLASLLRYMYGSECVILSDIVKPPANVTDVGPYIYLDILDQKSLEEIV 106
Query: 65 AKYQPASVMHFAG-LTNISESVKNPSLFYEINIKGSFNLIATAIESNVRRFIFSSTCATY 123
+ ++HF+ L+ + E+ N L ++NI+G N++ A + ++ F+ ST +
Sbjct: 107 VNKRIDWLVHFSALLSAVGET--NVPLALQVNIRGVHNILQVAAKHKLKVFV-PSTIGAF 163
Query: 124 GIPHNTIITENDP----QESITPYGYTKYVVER--ELLQHNKVNGLRSV----VLRYFNA 173
G T P Q T YG +K E E H RS+ ++
Sbjct: 164 G---PTSPRNPTPDLTIQRPRTIYGVSKVHAELLGEYFNHRFGVDFRSMRFPGIISATKP 220
Query: 174 AGATLDSIIGEWH---NPETHVIPLAIKT--AMGYQNSFKVFGQDYATRDGTCLRDYIHV 228
G T D I ++ H L T M Y D C+ I +
Sbjct: 221 GGGTTDYAIAIFYDALQKGKHTCYLRPDTRLPMMY--------------DTDCMASVIQL 266
Query: 229 LDLANAHIMALEYLINQGDSIAINLGTGTGITVKEIISTIQSMYECAFPITYE--SRR-I 285
L + + Y + TG T +EI I+ + F I Y+ +R+ I
Sbjct: 267 LAADSQSLKRRTYNV-----------TGFSFTPEEIADAIRRVMP-GFEIDYDICTRQSI 314
Query: 286 GDPPSLVADNKKAKKILGWNPKYKLRDIIE 315
D + D+ +A+ L II
Sbjct: 315 ADSWPMSLDDSEARTEWHEKHSLHLLSIIS 344
>gnl|CDD|145720 pfam02719, Polysacc_synt_2, Polysaccharide biosynthesis protein.
This is a family of diverse bacterial polysaccharide
biosynthesis proteins including the CapD protein, WalL
protein mannosyl-transferase and several putative
epimerases (e.g. WbiI).
Length = 280
Score = 54.4 bits (132), Expect = 4e-08
Identities = 35/133 (26%), Positives = 52/133 (39%), Gaps = 28/133 (21%)
Query: 6 VLVVGGAGYIGAHTCRVLYERGFLP---IVLD----NLSSGHAEFVLWGPLEQV-----D 53
VLV GG G IG+ CR + + P I+ L E ++ D
Sbjct: 1 VLVTGGGGSIGSELCRQILKFN--PKKIILFSRDEFKLYEIEQELREEYNDPKLRFFIGD 58
Query: 54 ICDYTNLRAVFAKYQPASVMHFAGLT-------NISESVKNPSLFYEINIKGSFNLIATA 106
+ D L +Y +V H A L N E++K N+ G+ N+ A
Sbjct: 59 VRDRERLERAMEEYGVDTVFHAAALKHVPLVEYNPMEAIKT-------NVLGTENVAEAA 111
Query: 107 IESNVRRFIFSST 119
IE+ V +F+ ST
Sbjct: 112 IENGVEKFVLIST 124
>gnl|CDD|113104 pfam04321, RmlD_sub_bind, RmlD substrate binding domain.
L-rhamnose is a saccharide required for the virulence of
some bacteria. Its precursor, dTDP-L-rhamnose, is
synthesized by four different enzymes the final one of
which is RmlD. The RmlD substrate binding domain is
responsible for binding a sugar nucleotide.
Length = 284
Score = 46.8 bits (112), Expect = 7e-06
Identities = 44/168 (26%), Positives = 68/168 (40%), Gaps = 27/168 (16%)
Query: 6 VLVVGGAGYIGAHTCRVLYERGFLPIVLDNLSSGHAEFVLWGPLEQVDICDYTNLRAVFA 65
+LV G G +G R+L ERG + LD ++D+ D + A+
Sbjct: 1 ILVTGANGQLGRELTRLLAERGVEVVALDR--------------PELDLTDPEAVAALVR 46
Query: 66 KYQPASVMHFAGLTNISESVKNPSLFYEINIKGSFNLIATAIESNVRRFIFSST----CA 121
+ +P V++ A T + ++ P L Y +N G NL I ST
Sbjct: 47 EARPDVVVNAAAYTAVDKAESEPELAYAVNALGPGNLAEACAARGA-PLIHISTDYVFDG 105
Query: 122 TYGIPHNTIITENDPQESITPYGYTKYVVERELLQHNKVNGLRSVVLR 169
G P+ E+DP + YG TK E+ +L N R ++LR
Sbjct: 106 AKGGPY----REDDPTGPLNVYGRTKLAGEQAVLAANP----RHLILR 145
>gnl|CDD|143883 pfam00106, adh_short, short chain dehydrogenase. This family
contains a wide variety of dehydrogenases.
Length = 167
Score = 46.4 bits (111), Expect = 1e-05
Identities = 39/146 (26%), Positives = 58/146 (39%), Gaps = 24/146 (16%)
Query: 4 KNVLVVGGAGYIGAHTCRVLYERGFLPIVLD---NLSSGHAEFV--LWGPLEQVDI--CD 56
VL+ GG G +G R L G +VL + G AE V L +V + CD
Sbjct: 1 GTVLITGGTGGLGLALARWLAAEGARHLVLVSRRGDAPGAAELVAELEALGAEVTVAACD 60
Query: 57 YTN-------LRAVFAKYQP-ASVMHFAGL---TNISESVKNPSLF---YEINIKGSFNL 102
+ L A+ A P V+H AG+ + E P F + G++NL
Sbjct: 61 VADRDALAALLAALPAALGPLDGVVHNAGVLDDGPLEEL--TPERFERVLAPKVTGAWNL 118
Query: 103 IATAIESNVRRFI-FSSTCATYGIPH 127
++ ++ F+ FSS G P
Sbjct: 119 HELTLDLDLGAFVLFSSVAGVLGSPG 144
>gnl|CDD|39489 KOG4288, KOG4288, KOG4288, Predicted oxidoreductase [General
function prediction only].
Length = 283
Score = 38.1 bits (88), Expect = 0.003
Identities = 39/176 (22%), Positives = 61/176 (34%), Gaps = 34/176 (19%)
Query: 1 MENKNVLVVGGAGYIGAHTCRVLYERGF-LPIVLDN----LSSGHAEFVLWGPLEQVDIC 55
+E + LV+GG + G+ + + I+ +N S +V W
Sbjct: 50 VEVEWTLVLGGNPFSGSEVLKNATNVVHSVGILSENENKQTLSSWPTYVSWHRGNSF--- 106
Query: 56 DYTNLRAVFAKYQPASVMHFAGLTNISESVKNPSLFYEINIKGSFNLIATAIESNVRRFI 115
+N + P V G N L IN + N + A ++ V RF+
Sbjct: 107 -SSNPNKL-KLSGPTFVYEMMG------GFGNIILMDRINGTANINAVKAAAKAGVPRFV 158
Query: 116 FSSTCATYGIPHNTIITENDPQESITPYGY--TKYVVERELLQHNKVNGLRSVVLR 169
+ S +G+P P GY K E ELL K R ++LR
Sbjct: 159 YISA-HDFGLPPL------------IPRGYIEGKREAEAELL---KKFRFRGIILR 198
>gnl|CDD|31231 COG1028, FabG, Dehydrogenases with different specificities (related
to short-chain alcohol dehydrogenases) [Secondary
metabolites biosynthesis, transport, and catabolism /
General function prediction only].
Length = 251
Score = 37.5 bits (86), Expect = 0.005
Identities = 41/265 (15%), Positives = 67/265 (25%), Gaps = 62/265 (23%)
Query: 1 MENKNVLVVGGAGYIGAHTCRVLYERGFLPIVLDNLSSGHAEFVLWGPLEQVDI------ 54
+ K LV G + IG R L G +V S A L +++
Sbjct: 3 LSGKVALVTGASSGIGRAIARALAREGARVVVAARRSEEEAAEALAAAIKEAGGGRAAAV 62
Query: 55 -CDYTNLRAVFAKYQPASVMHF---------AGLTNISESVKNPSL-----FYEINIKGS 99
D ++ A+ F AG+ ++ + ++N+ G+
Sbjct: 63 AADVSDDEESVEALVAAAEEEFGRIDILVNNAGIAGPDAPLEELTEEDWDRVIDVNLLGA 122
Query: 100 FNLIATAIESNVRRFIF--SSTCATYGIPHNTIITENDPQESITPYGYTKYVVE---REL 154
F L A+ ++ I SS G P Y +K + + L
Sbjct: 123 FLLTRAALPLMKKQRIVNISSVAGLGGPPGQ------------AAYAASKAALIGLTKAL 170
Query: 155 LQHNKVNGLRSVVLRYFNAAGATLDSIIGEWHNPETHVIPLAIKTAMGYQNSFKVFGQDY 214
G+R V P I T M
Sbjct: 171 ALELAPRGIRVNA------------------------VAPGYIDTPMTAALESAELEALK 206
Query: 215 ATRDGTCLRDYIHVLDLANAHIMAL 239
L ++A A
Sbjct: 207 RLAARIPLGRLGTPEEVAAAVAFLA 231
>gnl|CDD|39371 KOG4169, KOG4169, KOG4169, 15-hydroxyprostaglandin dehydrogenase
and related dehydrogenases [Lipid transport and
metabolism, General function prediction only].
Length = 261
Score = 36.4 bits (84), Expect = 0.012
Identities = 30/113 (26%), Positives = 46/113 (40%), Gaps = 15/113 (13%)
Query: 3 NKNVLVVGGAGYIGAHTCRVLYERG-FLPIVLDNLSSGHAEFVLWGPLEQVDI----CDY 57
KN LV GGAG IG T + L E+G + ++ D+ + A L V + CD
Sbjct: 5 GKNALVTGGAGGIGLATSKALLEKGIKVLVIDDSEENPEAIAKLQAINPSVSVIFIKCDV 64
Query: 58 TN---LRAVFAKYQPASVMHFAGLTNISESVKNPSLFYEINIKGSFNLIATAI 107
TN L A F K A I + + + + + + N+ T +
Sbjct: 65 TNRGDLEAAFDK-------ILATFGTIDILINGAGILDDKDWERTINVNLTGV 110
>gnl|CDD|31046 COG0702, COG0702, Predicted nucleoside-diphosphate-sugar epimerases
[Cell envelope biogenesis, outer membrane / Carbohydrate
transport and metabolism].
Length = 275
Score = 34.9 bits (79), Expect = 0.034
Identities = 21/115 (18%), Positives = 33/115 (28%), Gaps = 9/115 (7%)
Query: 4 KNVLVVGGAGYIGAHTCRVLYERGFLPIVLDNLSSGHAEFVLWGPLEQVDICDYTNLRAV 63
+LV G G++G R L RG A + D+ D +L A
Sbjct: 1 MKILVTGATGFVGGAVVRELLARGHEVRAAVRNPEAAAALAGGVEVVLGDLRDPKSLVAG 60
Query: 64 FAKYQPASVMHFAGLTNISESVKNPSLFYEINIKGSFNLIATAIESNVRRFIFSS 118
G+ IS + F + + A V+ + S
Sbjct: 61 AK--------GVDGVLLISGLLDGSDAFRAVQVTAVVRAAEAAGAG-VKHGVSLS 106
>gnl|CDD|31287 COG1090, COG1090, Predicted nucleoside-diphosphate sugar epimerase
[General function prediction only].
Length = 297
Score = 34.8 bits (80), Expect = 0.036
Identities = 37/172 (21%), Positives = 60/172 (34%), Gaps = 21/172 (12%)
Query: 6 VLVVGGAGYIGAHTCRVLYERGFLPIVLDNLSSGHAEFVLWGPLEQVDICDYTNLRAVFA 65
+L+ GG G IG L + G +L A L + + + A
Sbjct: 1 ILITGGTGLIGRALTARLRKGGHQVTILTR-RPPKASQNLHPNVTLWEGLADALTLGIDA 59
Query: 66 KYQPASVMHFAGLTNISESVKNPSLFYEI---NIKGS---FNLIATAIESNVRRFIFSST 119
V++ AG I+E EI I + LIA + E+ + I +S
Sbjct: 60 ------VINLAG-EPIAERRWTEKQKEEIRQSRINTTEKLVELIAAS-ETKPKVLISASA 111
Query: 120 CATYGIPHNTIITENDPQESITPYGYTKYVVER--ELLQHNKVNGLRSVVLR 169
YG + ++TE P + + + E + G R V+LR
Sbjct: 112 VGYYGHSGDRVVTEESP----PGDDFLAQLCQDWEEEALQAQQLGTRVVLLR 159
>gnl|CDD|114110 pfam05368, NmrA, NmrA-like family. NmrA is a negative
transcriptional regulator involved in the
post-translational modification of the transcription
factor AreA. NmrA is part of a system controlling
nitrogen metabolite repression in fungi. This family
only contains a few sequences as iteration results in
significant matches to other Rossmann fold families.
Length = 232
Score = 32.6 bits (75), Expect = 0.17
Identities = 24/123 (19%), Positives = 39/123 (31%), Gaps = 31/123 (25%)
Query: 6 VLVVGGAGYIGAHTCRVLYERGFLPI----------VLDNLSSGHAEFVLWGPLEQVDIC 55
+LV G GY G R + G P+ + +L + E V + D+
Sbjct: 1 ILVFGATGYQGGSVVRASLKAGH-PVRALVRDPKSELAKSLKAAGVELV------EGDLD 53
Query: 56 DYTNLRAVFAKYQPASVMHFAGLTNISESVKNPSLFYEINIKGSFNLIATAIESNVRRFI 115
D+ +L + T + I+ L A E+ V+ FI
Sbjct: 54 DHESLVEALKGVDVVFSV-----TG---------FWLSKEIEDGKKLADAAKEAGVKHFI 99
Query: 116 FSS 118
S
Sbjct: 100 PSE 102
>gnl|CDD|36417 KOG1203, KOG1203, KOG1203, Predicted dehydrogenase [Carbohydrate
transport and metabolism].
Length = 411
Score = 31.5 bits (71), Expect = 0.36
Identities = 11/28 (39%), Positives = 16/28 (57%)
Query: 1 MENKNVLVVGGAGYIGAHTCRVLYERGF 28
+ VLVVG G +G ++L +RGF
Sbjct: 77 KKPTTVLVVGATGKVGRRIVKILLKRGF 104
>gnl|CDD|37958 KOG2747, KOG2747, KOG2747, Histone acetyltransferase (MYST family)
[Chromatin structure and dynamics].
Length = 396
Score = 31.2 bits (70), Expect = 0.44
Identities = 22/88 (25%), Positives = 31/88 (35%), Gaps = 17/88 (19%)
Query: 239 LEYLINQGDS-IAIN-LGTGTGITVKEIISTIQSM----YECAFPITYESRRIGDPPSLV 292
LE L I+I + TGI +IIST+QS+ Y + I+ S L
Sbjct: 319 LELLRKHRGEHISIKEISKETGIRPDDIISTLQSLNMIKYYKGYIISICSDDK-----LE 373
Query: 293 ADNKKAKKILGWNPKYKLRDIIESAWNW 320
+ W K R + W
Sbjct: 374 DHKRN------WAKFKKPRLLDPDCLLW 395
>gnl|CDD|177093 CHL00194, ycf39, Ycf39; Provisional.
Length = 317
Score = 31.1 bits (71), Expect = 0.47
Identities = 10/35 (28%), Positives = 16/35 (45%)
Query: 85 VKNPSLFYEINIKGSFNLIATAIESNVRRFIFSST 119
+ +I+ G LI A + ++RFIF S
Sbjct: 76 PSDLYNAKQIDWDGKLALIEAAKAAKIKRFIFFSI 110
>gnl|CDD|33946 COG4221, COG4221, Short-chain alcohol dehydrogenase of unknown
specificity [General function prediction only].
Length = 246
Score = 30.6 bits (69), Expect = 0.69
Identities = 35/176 (19%), Positives = 58/176 (32%), Gaps = 45/176 (25%)
Query: 1 MENKNVLVVGGAGYIGAHTCRVLYERGFLPIVL------------DNLSSGHAEFVLWGP 48
++ K L+ G + IG T R L E G +VL D + +G A +
Sbjct: 4 LKGKVALITGASSGIGEATARALAEAGA-KVVLAARREERLEALADEIGAGAALAL---- 58
Query: 49 LEQVDICDY----TNLRAVFAKYQPASVM-HFAGLTNISE----SVKNPSLFYEINIKGS 99
+D+ D + A+ ++ ++ + AGL + + + N+KG
Sbjct: 59 --ALDVTDRAAVEAAIEALPEEFGRIDILVNNAGLALGDPLDEADLDDWDRMIDTNVKGL 116
Query: 100 FNLIATAIESNVRR-----FIFSSTCATYGIPHNTIITENDPQESITPYGYTKYVV 150
N + V R S Y P + YG TK V
Sbjct: 117 LNGTRAVLPGMVERKSGHIINLGSIAGRYPYPGGAV------------YGATKAAV 160
>gnl|CDD|36415 KOG1201, KOG1201, KOG1201, Hydroxysteroid 17-beta dehydrogenase 11
[Secondary metabolites biosynthesis, transport and
catabolism].
Length = 300
Score = 30.2 bits (68), Expect = 0.73
Identities = 33/199 (16%), Positives = 60/199 (30%), Gaps = 43/199 (21%)
Query: 2 ENKNVLVVGGAGYIGAHTCRVLYERGFLPIVLDNLSSGHAEFV----LWGPLE--QVDIC 55
+ VL+ GG +G +RG ++ D G+ E V G + DI
Sbjct: 37 SGEIVLITGGGSGLGRLIALEFAKRGAKLVLWDINKQGNEETVKEIRKIGEAKAYTCDIS 96
Query: 56 DYTNLRAVFAKYQP-----------ASVMHFAGLTNISESVKNPSLFYEINIKGSFNLIA 104
D + + K + A ++ L + S+ + +++N F
Sbjct: 97 DREEIYRLAKKVKKEVGDVDILVNNAGIVTGKKLLDCSD--EEIQKTFDVNTIAHFWTTK 154
Query: 105 T---AIESNVRRFI--FSSTCATYGIPHNTIITENDPQESITPYGYTKYVV-------ER 152
+ N I +S +G + Y +K+
Sbjct: 155 AFLPKMLENNNGHIVTIASVAGLFGPA------------GLADYCASKFAAVGFHESLSM 202
Query: 153 ELLQHNKVNGLRSVVLRYF 171
EL K ++V YF
Sbjct: 203 ELRALGKDGIKTTLVCPYF 221
>gnl|CDD|36419 KOG1205, KOG1205, KOG1205, Predicted dehydrogenase [Secondary
metabolites biosynthesis, transport and catabolism].
Length = 282
Score = 29.5 bits (66), Expect = 1.2
Identities = 9/27 (33%), Positives = 13/27 (48%)
Query: 1 MENKNVLVVGGAGYIGAHTCRVLYERG 27
+ K VL+ G + IG L +RG
Sbjct: 10 LAGKVVLITGASSGIGEALAYELAKRG 36
>gnl|CDD|133172 cd05040, PTKc_Ack_like, Catalytic domain of the Protein Tyrosine
Kinase, Activated Cdc42-associated kinase. Protein
Tyrosine Kinase (PTK) family; Activated Cdc42-associated
kinase (Ack) subfamily; catalytic (c) domain. Ack
subfamily members include Ack1, thirty-eight-negative
kinase 1 (Tnk1), and similar proteins. The PTKc family
is part of a larger superfamily that includes the
catalytic domains of other kinases such as protein
serine/threonine kinases, RIO kinases, and
phosphoinositide 3-kinase (PI3K). PTKs catalyze the
transfer of the gamma-phosphoryl group from ATP to
tyrosine (tyr) residues in protein substrates. Ack
subfamily members are cytoplasmic (or nonreceptor) tyr
kinases containing an N-terminal catalytic domain, an
SH3 domain, a Cdc42-binding CRIB domain, and a
proline-rich region. They are mainly expressed in brain
and skeletal tissues and are involved in the regulation
of cell adhesion and growth, receptor degradation, and
axonal guidance. Ack1 is also associated with
androgen-independent prostate cancer progression. Tnk1
regulates TNFalpha signaling and may play an important
role in cell death.
Length = 257
Score = 29.7 bits (67), Expect = 1.3
Identities = 8/17 (47%), Positives = 10/17 (58%)
Query: 183 GEWHNPETHVIPLAIKT 199
GEW VIP+A+K
Sbjct: 14 GEWSTSGGKVIPVAVKC 30
>gnl|CDD|36422 KOG1208, KOG1208, KOG1208, Dehydrogenases with different
specificities (related to short-chain alcohol
dehydrogenases) [Secondary metabolites biosynthesis,
transport and catabolism].
Length = 314
Score = 28.8 bits (64), Expect = 2.1
Identities = 11/28 (39%), Positives = 12/28 (42%)
Query: 1 MENKNVLVVGGAGYIGAHTCRVLYERGF 28
+ K LV G IG T R L RG
Sbjct: 33 LSGKVALVTGATSGIGFETARELALRGA 60
>gnl|CDD|31009 COG0665, DadA, Glycine/D-amino acid oxidases (deaminating) [Amino
acid transport and metabolism].
Length = 387
Score = 28.7 bits (63), Expect = 2.3
Identities = 13/54 (24%), Positives = 22/54 (40%), Gaps = 6/54 (11%)
Query: 1 MENKNVLVVGGAGYIGAHTCRVLYERGFLPIVLD-----NLSSGHAEFVLWGPL 49
+V+++GG G +G L ERG VL+ ++G + P
Sbjct: 2 SMKMDVVIIGG-GIVGLSAAYYLAERGADVTVLEAGEAGGGAAGRNAGGILAPW 54
>gnl|CDD|30896 COG0550, TopA, Topoisomerase IA [DNA replication, recombination,
and repair].
Length = 570
Score = 28.7 bits (64), Expect = 2.7
Identities = 12/54 (22%), Positives = 24/54 (44%), Gaps = 5/54 (9%)
Query: 258 GITVKEIISTIQSMYECAFPITY---ESRRIGDPPSLVADNKKAKKILGWNPKY 308
G + K+ + Q +YE ITY +S R+ +A+ + + + +Y
Sbjct: 274 GFSAKKTMDIAQKLYEGHGLITYPRTDSTRL--SEEALAEARLYILAIAYGKEY 325
>gnl|CDD|39552 KOG4351, KOG4351, KOG4351, Uncharacterized conserved protein
[Function unknown].
Length = 244
Score = 28.5 bits (63), Expect = 2.9
Identities = 10/37 (27%), Positives = 15/37 (40%)
Query: 99 SFNLIATAIESNVRRFIFSSTCATYGIPHNTIITEND 135
SF L I + + SST +P ++ I E
Sbjct: 194 SFELSDIPIVQELEQLPVSSTPCDSPVPIDSPIIEKP 230
>gnl|CDD|39164 KOG3961, KOG3961, KOG3961, Uncharacterized conserved protein
[Function unknown].
Length = 262
Score = 28.1 bits (62), Expect = 3.6
Identities = 15/38 (39%), Positives = 21/38 (55%), Gaps = 3/38 (7%)
Query: 21 RVLYERGFLPIVLDNLSSGHAEFVLWG-PLEQVDICDY 57
R +Y RG LPI L++ S G + W E++D C Y
Sbjct: 80 RAIYSRGDLPISLEHGSKGRK--LQWECDPEKLDYCPY 115
>gnl|CDD|31275 COG1077, MreB, Actin-like ATPase involved in cell morphogenesis
[Cell division and chromosome partitioning].
Length = 342
Score = 27.9 bits (62), Expect = 3.8
Identities = 16/59 (27%), Positives = 25/59 (42%), Gaps = 7/59 (11%)
Query: 249 IAINLGTG-TGITVKEIISTIQSMYECAFPITYESRRIGDPPSLVADNKKAKKILGWNP 306
I I+LGT T + VK + P G ++A ++AK++LG P
Sbjct: 9 IGIDLGTANTLVYVKGKGIVLN------EPSVVAIESEGKTKVVLAVGEEAKQMLGRTP 61
>gnl|CDD|176124 cd08433, PBP2_Nac, The C-teminal substrate binding domain of
LysR-like nitrogen assimilation control (NAC) protein,
contains the type 2 periplasmic binding fold. The NAC
is a LysR-type transcription regulator that activates
expression of operons such as hut (histidine
utilization) and ure (urea utilization), allowing use of
non-preferred (poor) nitrogen sources, and represses
expression of operons, such as glutamate dehydrogenase
(gdh), allowing assimilation of the preferred nitrogen
source. The expression of the nac gene is fully
dependent on the nitrogen regulatory system (NTR) and
the sigma54-containing RNA polymerase (sigma54-RNAP). In
response to nitrogen starvation, NTR system activates
the expression of nac, and NAC activates the expression
of hut, ure, and put (proline utilization). NAC is not
involved in the transcription of Sigma70-RNAP operons
such as glnA, which directly respond by the NTR system,
but activates the transcription of sigma70-RNAP
dependent operons such as hut. Hence, NAC allows the
coupling of sigma70-RNAP dependent operons to the
sigma54-RNAP dependent NTR system. This
substrate-binding domain has significant homology to the
type 2 periplasmic binding proteins (PBP2), which are
responsible for the uptake of a variety of substrates
such as phosphate, sulfate, polysaccharides,
lysine/arginine/ornithine, and histidine. The PBP2 bind
their ligand in the cleft between these domains in a
manner resembling a Venus flytrap. After binding their
specific ligand with high affinity, they can interact
with a cognate membrane transport complex comprised of
two integral membrane domains and two cytoplasmically
located ATPase domains. This interaction triggers the
ligand translocation across the cytoplasmic membrane
energized by ATP hydrolysis.
Length = 198
Score = 27.9 bits (63), Expect = 3.8
Identities = 11/38 (28%), Positives = 18/38 (47%), Gaps = 6/38 (15%)
Query: 152 RELLQHNKV-----NGLRSVVLRYFNAAGATLDSIIGE 184
EL + + +GLR +V AG TL+ ++ E
Sbjct: 93 AELARLPLILPSRGHGLRRLVDEAAARAGLTLN-VVVE 129
>gnl|CDD|38353 KOG3143, KOG3143, KOG3143, Imidazoleglycerol-phosphate dehydratase
[Amino acid transport and metabolism].
Length = 219
Score = 27.0 bits (59), Expect = 7.1
Identities = 21/60 (35%), Positives = 26/60 (43%), Gaps = 5/60 (8%)
Query: 176 ATLDSIIGEWHNPETHVIPL--AIKTAMGYQNSFKVFGQDYATRDGTCLRDYIHVLDLAN 233
AT D I + H E I L A K A+G + K FG A D R V+DL+
Sbjct: 80 ATGDLHIDDHHTNEDVAIALGTAFKKALGERKGVKRFGDFTAPLDEALSRV---VVDLSG 136
>gnl|CDD|133342 cd04142, RRP22, RRP22 subfamily. RRP22 (Ras-related protein on
chromosome 22) subfamily consists of proteins that
inhibit cell growth and promote caspase-independent cell
death. Unlike most Ras proteins, RRP22 is
down-regulated in many human tumor cells due to promoter
methylation. RRP22 localizes to the nucleolus in a
GTP-dependent manner, suggesting a novel function in
modulating transport of nucleolar components. Most Ras
proteins contain a lipid modification site at the
C-terminus, with a typical sequence motif CaaX, where a
= an aliphatic amino acid and X = any amino acid. Lipid
binding is essential for membrane attachment, a key
feature of most Ras proteins. Like most Ras family
proteins, RRP22 is farnesylated.
Length = 198
Score = 27.1 bits (60), Expect = 7.2
Identities = 14/55 (25%), Positives = 27/55 (49%), Gaps = 13/55 (23%)
Query: 281 ESRRIG--DPPSLVADNKKAKKILGWNPKYKLRDIIESAW-----------NWHL 322
E+R G +PP +V NK+ ++ + P++ L ++ +W NWH+
Sbjct: 108 ETRPAGNKEPPIVVVGNKRDQQRHRFAPRHVLSVLVRKSWKCGYLECSAKYNWHI 162
>gnl|CDD|30648 COG0300, DltE, Short-chain dehydrogenases of various substrate
specificities [General function prediction only].
Length = 265
Score = 26.8 bits (59), Expect = 7.8
Identities = 18/73 (24%), Positives = 29/73 (39%), Gaps = 8/73 (10%)
Query: 1 MENKNVLVVGGAGYIGAHTCRVLYERGFLPIVL----DNLSSGHAEFVLWGPLE----QV 52
M+ K L+ G + IGA + L RG+ I++ D L + E +E
Sbjct: 4 MKGKTALITGASSGIGAELAKQLARRGYNLILVARREDKLEALAKELEDKTGVEVEVIPA 63
Query: 53 DICDYTNLRAVFA 65
D+ D L +
Sbjct: 64 DLSDPEALERLED 76
>gnl|CDD|133452 cd05213, NAD_bind_Glutamyl_tRNA_reduct, NADP-binding domain of
glutamyl-tRNA reductase. Glutamyl-tRNA reductase
catalyzes the conversion of glutamyl-tRNA to
glutamate-1-semialdehyde, initiating the synthesis of
tetrapyrrole. Whereas tRNAs are generally associated
with peptide bond formation in protein translation, here
the tRNA activates glutamate in the initiation of
tetrapyrrole biosynthesis in archaea, plants and many
bacteria. In the first step, activated glutamate is
reduced to glutamate-1-semi-aldehyde via the NADPH
dependent glutamyl-tRNA reductase. Glutamyl-tRNA
reductase forms a V-shaped dimer. Each monomer has 3
domains: an N-terminal catalytic domain, a classic
nucleotide binding domain, and a C-terminal dimerization
domain. Although the representative structure 1GPJ lacks
a bound NADPH, a theoretical binding pocket has been
described. (PMID 11172694). Amino acid dehydrogenase
(DH)-like NAD(P)-binding domains are members of the
Rossmann fold superfamily and include glutamate,
leucine, and phenylalanine DHs, methylene
tetrahydrofolate DH, methylene-tetrahydromethanopterin
DH, methylene-tetrahydropholate DH/cyclohydrolase,
Shikimate DH-like proteins, malate oxidoreductases, and
glutamyl tRNA reductase. Amino acid DHs catalyze the
deamination of amino acids to keto acids with NAD(P)+ as
a cofactor. The NAD(P)-binding Rossmann fold superfamily
includes a wide variety of protein families including
NAD(P)- binding domains of alcohol DHs,
tyrosine-dependent oxidoreductases,
glyceraldehyde-3-phosphate DH, lactate/malate DHs,
formate/glycerate DHs, siroheme synthases,
6-phosphogluconate DH, amino acid DHs, repressor rex,
NAD-binding potassium channel domain, CoA-binding, and
ornithine cyclodeaminase-like domains. These domains
have an alpha-beta-alpha configuration. NAD binding
involves numerous hydrogen and van der Waals contacts.
Length = 311
Score = 26.8 bits (60), Expect = 8.5
Identities = 14/41 (34%), Positives = 21/41 (51%), Gaps = 1/41 (2%)
Query: 2 ENKNVLVVGGAGYIGAHTCRVLYERGFLPIVLDNLSSGHAE 42
+ K VLV+G AG +G + L +G I + N + AE
Sbjct: 177 KGKKVLVIG-AGEMGELAAKHLAAKGVAEITIANRTYERAE 216
>gnl|CDD|32328 COG2145, ThiM, Hydroxyethylthiazole kinase, sugar kinase family
[Coenzyme metabolism].
Length = 265
Score = 26.7 bits (59), Expect = 9.6
Identities = 14/71 (19%), Positives = 30/71 (42%), Gaps = 12/71 (16%)
Query: 221 CLRDYIHVLDLANA-------HIMA-----LEYLINQGDSIAINLGTGTGITVKEIISTI 268
C+ + + AN +MA +E D++ IN+GT + ++ + + I
Sbjct: 19 CITNDVVQNFTANGLLALGASPVMADAPEEVEEFAKIADALLINIGTLSAERIQAMRAAI 78
Query: 269 QSMYECAFPIT 279
++ E P+
Sbjct: 79 KAANESGKPVV 89
>gnl|CDD|146441 pfam03801, Ndc80_HEC, HEC/Ndc80p family. Members of this family
are components of the mitotic spindle. It has been shown
that Ndc80/HEC from yeast is part of a complex called
the Ndc80p complex. This complex is thought to bind to
the microtubules of the spindle.
Length = 159
Score = 26.5 bits (59), Expect = 9.8
Identities = 11/36 (30%), Positives = 17/36 (47%)
Query: 239 LEYLINQGDSIAINLGTGTGITVKEIISTIQSMYEC 274
LE+L+ G I I+ T T K+ I + +Y
Sbjct: 47 LEFLVENGFEIEISPKTLKSPTQKDFIEIFKFLYNR 82
Database: CddA
Posted date: Feb 4, 2011 9:38 PM
Number of letters in database: 6,263,737
Number of sequences in database: 21,609
Lambda K H
0.320 0.136 0.413
Gapped
Lambda K H
0.267 0.0752 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 21609
Number of Hits to DB: 4,089,495
Number of extensions: 215018
Number of successful extensions: 548
Number of sequences better than 10.0: 1
Number of HSP's gapped: 509
Number of HSP's successfully gapped: 50
Length of query: 333
Length of database: 6,263,737
Length adjustment: 94
Effective length of query: 239
Effective length of database: 4,232,491
Effective search space: 1011565349
Effective search space used: 1011565349
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 58 (26.1 bits)