RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy8218
(228 letters)
>gnl|CDD|213293 cd05927, LC-FACS_euk, Eukaryotic long-chain fatty acid CoA
synthetase (LC-FACS). The members of this family are
eukaryotic fatty acid CoA synthetases that activate
fatty acids with chain lengths of 12 to 20. LC-FACS
catalyzes the formation of fatty acyl-CoA in a two-step
reaction: the formation of a fatty acyl-AMP molecule as
an intermediate, and the formation of a fatty acyl-CoA.
This is a required step before free fatty acids can
participate in most catabolic and anabolic reactions.
Organisms tend to have multiple isoforms of LC-FACS
genes with multiple splice variants. For example, nine
genes are found in Arabidopsis and six genes are
expressed in mammalian cells.
Length = 539
Score = 336 bits (864), Expect = e-114
Identities = 108/226 (47%), Positives = 149/226 (65%), Gaps = 4/226 (1%)
Query: 1 MSTGRAGAPTTVCDIRLVDWDEGNYRVCDKPFPRGEILIGGSNISPGYYKNPTKTKEDFF 60
+TG G P C+I+LVD E Y D P PRGEI I G N+ GYYKNP KT E F
Sbjct: 318 TTTGHVGPPLPCCEIKLVDVPEMGYFAKDDP-PRGEICIRGPNVFKGYYKNPEKTAEAF- 375
Query: 61 DEEGKRWFRTGDIGELHQDGVIRIIDRKKDLVKLQAGEYVSLGKVEAELKTCPVIENICV 120
DE+G WF TGDIGE DG ++IIDRKK+L KL GEYV+L K+E K+ P+++ ICV
Sbjct: 376 DEDG--WFHTGDIGEWLPDGTLKIIDRKKNLFKLSQGEYVALEKLENIYKSSPLVDQICV 433
Query: 121 YGDSRKDYTVALVVPNPSRLKEIAEKLNLDSLTFQQLCTHPEIEKQVLVELQEHGKKSNL 180
YGDS K + VA+VVP+ L++ A + F++LC + +++K +L +L E GK++ L
Sbjct: 434 YGDSLKSFLVAIVVPDEDVLEKWAAENGGGGGDFEELCNNKKLKKAILKDLNEIGKENGL 493
Query: 181 ERFEIPNAVKLCSEVWSPDMGLVTAAFKLKRKDIQERYQHEINRMY 226
+ FEI A+ L E ++P+ GL+T FKLKR +++RY+ EI+ MY
Sbjct: 494 KGFEIVKAIHLTPEPFTPENGLLTPTFKLKRPQLKKRYKKEIDEMY 539
>gnl|CDD|215217 PLN02387, PLN02387, long-chain-fatty-acid-CoA ligase family
protein.
Length = 696
Score = 290 bits (744), Expect = 1e-94
Identities = 114/228 (50%), Positives = 153/228 (67%), Gaps = 1/228 (0%)
Query: 1 MSTGRAGAPTTVCDIRLVDWDEGNYRVCDKPFPRGEILIGGSNISPGYYKNPTKTKEDF- 59
S GR G P C ++LV W+EG Y + DKP PRGEI+IGG +++ GY+KN KT E +
Sbjct: 469 TSVGRVGPPLPCCYVKLVSWEEGGYLISDKPMPRGEIVIGGPSVTLGYFKNQEKTDEVYK 528
Query: 60 FDEEGKRWFRTGDIGELHQDGVIRIIDRKKDLVKLQAGEYVSLGKVEAELKTCPVIENIC 119
DE G RWF TGDIG+ H DG + IIDRKKD+VKLQ GEYVSLGKVEA L P ++NI
Sbjct: 529 VDERGMRWFYTGDIGQFHPDGCLEIIDRKKDIVKLQHGEYVSLGKVEAALSVSPYVDNIM 588
Query: 120 VYGDSRKDYTVALVVPNPSRLKEIAEKLNLDSLTFQQLCTHPEIEKQVLVELQEHGKKSN 179
V+ D Y VALVVP+ L++ A+K +D F +LC E K+V L + K +
Sbjct: 589 VHADPFHSYCVALVVPSQQALEKWAKKAGIDYSNFAELCEKEEAVKEVQQSLSKAAKAAR 648
Query: 180 LERFEIPNAVKLCSEVWSPDMGLVTAAFKLKRKDIQERYQHEINRMYA 227
LE+FEIP +KL E W+P+ GLVTAA KLKR+ I+++++ ++ ++Y
Sbjct: 649 LEKFEIPAKIKLLPEPWTPESGLVTAALKLKREQIRKKFKDDLKKLYE 696
>gnl|CDD|178337 PLN02736, PLN02736, long-chain acyl-CoA synthetase.
Length = 651
Score = 190 bits (485), Expect = 7e-57
Identities = 87/226 (38%), Positives = 125/226 (55%), Gaps = 3/226 (1%)
Query: 2 STGRAGAPTTVCDIRLVDWDEGNYRVCDKPFPRGEILIGGSNISPGYYKNPTKTKEDFFD 61
+G G+P C+++LVD E NY D+P+PRGEI + G I GYYK+ +T+E D
Sbjct: 426 LSGHVGSPNPACEVKLVDVPEMNYTSEDQPYPRGEICVRGPIIFKGYYKDEVQTRE-VID 484
Query: 62 EEGKRWFRTGDIGELHQDGVIRIIDRKKDLVKLQAGEYVSLGKVEAELKTCPVIENICVY 121
E+G W TGDIG G ++IIDRKK++ KL GEY++ K+E C + VY
Sbjct: 485 EDG--WLHTGDIGLWLPGGRLKIIDRKKNIFKLAQGEYIAPEKIENVYAKCKFVAQCFVY 542
Query: 122 GDSRKDYTVALVVPNPSRLKEIAEKLNLDSLTFQQLCTHPEIEKQVLVELQEHGKKSNLE 181
GDS VA+VV +P LK A + +QLC P + VL ++ G+++ L
Sbjct: 543 GDSLNSSLVAVVVVDPEVLKAWAASEGIKYEDLKQLCNDPRVRAAVLADMDAVGREAQLR 602
Query: 182 RFEIPNAVKLCSEVWSPDMGLVTAAFKLKRKDIQERYQHEINRMYA 227
FE AV L E ++ + GL+T FK+KR + + I+ MYA
Sbjct: 603 GFEFAKAVTLVPEPFTVENGLLTPTFKVKRPQAKAYFAKAISDMYA 648
>gnl|CDD|223953 COG1022, FAA1, Long-chain acyl-CoA synthetases (AMP-forming) [Lipid
metabolism].
Length = 613
Score = 185 bits (472), Expect = 3e-55
Identities = 84/217 (38%), Positives = 123/217 (56%), Gaps = 15/217 (6%)
Query: 21 DEGNYRVCDKPFP--------RGEILIGGSNISPGYYKNPTKTKEDFFDEEGKRWFRTGD 72
D KP P GEIL+ G N+ GYYKNP T E F E+G WFRTGD
Sbjct: 399 DRFVLGTVGKPLPGIEVKIADDGEILVRGPNVMKGYYKNPEATAE-AFTEDG--WFRTGD 455
Query: 73 IGELHQDGVIRIIDRKKDLVKLQAGEYVSLGKVEAELKTCPVIENICVYGDSRKDYTVAL 132
+GEL +DG + I RKK+L+KL G+ ++ +E++L P+IE ICV GD +K + VAL
Sbjct: 456 LGELDEDGYLVITGRKKELIKLSNGKNIAPEPIESKLAKSPLIEQICVVGDDKK-FLVAL 514
Query: 133 VVPNPSRLKEIAEKLNLD-SLTFQQLCTHPEIEKQVLVELQEHGKKSNLERFEIPNAVKL 191
+VP+ L++ AE LN S + ++L P++ K +L + + K+ L FE L
Sbjct: 515 IVPDFDALEKWAESLNKVISASREELARDPKLLKLILPRVNKGNKR--LFGFEQIKKFVL 572
Query: 192 CSEVWSPDMGLVTAAFKLKRKDIQERYQHEINRMYAS 228
+ ++P+ G +T KLKR I +RY+ EI +Y+
Sbjct: 573 LPKEFTPENGELTPTLKLKRHVILDRYKDEIEAVYSR 609
>gnl|CDD|178049 PLN02430, PLN02430, long-chain-fatty-acid-CoA ligase.
Length = 660
Score = 174 bits (442), Expect = 1e-50
Identities = 93/224 (41%), Positives = 134/224 (59%), Gaps = 6/224 (2%)
Query: 4 GRAGAPTTVCDIRLVDWDEGNYRVCDKPFPRGEILIGGSNISPGYYKNPTKTKEDFFDEE 63
G GAP ++RL + E Y +P PRGEI + G + GYYKNP T+E D
Sbjct: 436 GTVGAPAVYNELRLEEVPEMGYDPLGEP-PRGEICVRGKCLFSGYYKNPELTEEVMKDG- 493
Query: 64 GKRWFRTGDIGELHQDGVIRIIDRKKDLVKLQAGEYVSLGKVEAELKTCPVIENICVYGD 123
WF TGDIGE+ +GV++IIDRKK+L+KL GEYV+L +E P++E+I VYGD
Sbjct: 494 ---WFHTGDIGEILPNGVLKIIDRKKNLIKLSQGEYVALEYLENVYGQNPIVEDIWVYGD 550
Query: 124 SRKDYTVALVVPNPSRLKEIAEKLNLDSLTFQQLCTHPEIEKQVLVELQEHGKKSNLERF 183
S K VA+VVPN + A K N + +F++LC+ PE+++ +L EL+ +K+ L F
Sbjct: 551 SFKSMLVAVVVPNEENTNKWA-KDNGFTGSFEELCSLPELKEHILSELKSTAEKNKLRGF 609
Query: 184 EIPNAVKLCSEVWSPDMGLVTAAFKLKRKDIQERYQHEINRMYA 227
E V L ++ + + LVTA K +R ++ + YQ EI+ MY
Sbjct: 610 EYIKGVILETKPFDVERDLVTATLKKRRNNLLKYYQVEIDEMYR 653
>gnl|CDD|213275 cd05907, VL_LC_FACS_like, Long-chain fatty acid CoA synthetases and
Bubblegum-like very long-chain fatty acid CoA
synthetases. This family includes long-chain fatty acid
(C12-C20) CoA synthetases and Bubblegum-like very
long-chain (>C20) fatty acid CoA synthetases. FACS
catalyzes the formation of fatty acyl-CoA in a two-step
reaction: the formation of a fatty acyl-AMP molecule as
an intermediate, and the formation of a fatty acyl-CoA.
Eukaryotes generally have multiple isoforms of LC-FACS
genes with multiple splice variants. For example, nine
genes are found in Arabidopsis and six genes are
expressed in mammalian cells. Drosophila melanogaster
mutant bubblegum (BGM) have elevated levels of
very-long-chain fatty acids (VLCFA) caused by a
defective gene later named bubblegum. The human homolog
(hsBG) of bubblegum has been characterized as a very
long chain fatty acid CoA synthetase that functions
specifically in the brain; hsBG may play a central role
in brain VLCFA metabolism and myelinogenesis. Free fatty
acids must be "activated" to their CoA thioesters before
participating in most catabolic and anabolic reactions.
Length = 456
Score = 164 bits (418), Expect = 2e-48
Identities = 72/211 (34%), Positives = 104/211 (49%), Gaps = 20/211 (9%)
Query: 4 GRAGAPTTVCDIRLVDWDEGNYRVCDKPFPRGEILIGGSNISPGYYKNPTKTKEDFFDEE 63
G G P ++R+ + GEIL+ G N+ GYYKNP T E DE+
Sbjct: 266 GTVGKPLPGVEVRIAE--------------DGEILVRGPNVMKGYYKNPEATAEAL-DED 310
Query: 64 GKRWFRTGDIGELHQDGVIRIIDRKKDLVKLQAGEYVSLGKVEAELKTCPVIENICVYGD 123
G W TGDIG L +DG + I DRKKDL+ G+ ++ +E LK P I V GD
Sbjct: 311 G--WLHTGDIGRLDEDGFLVITDRKKDLIVTAGGKNIAPQPIENALKASPYISQAVVVGD 368
Query: 124 SRKDYTVALVVPNPSRLKEIAEKLNLDSLTFQQLCTHPEIEKQVLVELQEHGKKSNLERF 183
R Y AL+V +P L++ AE+ L T+ L PE+ + E++E + L R
Sbjct: 369 DRP-YLTALIVLDPEALEKWAEQHGLPFTTYADLAEDPEVRALIRKEVEEANAR--LARV 425
Query: 184 EIPNAVKLCSEVWSPDMGLVTAAFKLKRKDI 214
E L + +S + G +T KL+R+ +
Sbjct: 426 EQIKKFVLLPKEFSIEDGELTPTMKLRRRVV 456
>gnl|CDD|240316 PTZ00216, PTZ00216, acyl-CoA synthetase; Provisional.
Length = 700
Score = 163 bits (415), Expect = 1e-46
Identities = 77/229 (33%), Positives = 124/229 (54%), Gaps = 8/229 (3%)
Query: 1 MSTGRAGAPTTVCDIRLVDWDEGNYRVCDKPFPRGEILIGGSNISPGYYKNPTKTKEDFF 60
+ G +++L+D +E Y+ D P PRGEIL+ G + GYYK T+E
Sbjct: 476 LEPNAVGQLLKGVEMKLLDTEE--YKHTDTPEPRGEILLRGPFLFKGYYKQEELTRE-VL 532
Query: 61 DEEGKRWFRTGDIGELHQDGVIRIIDRKKDLVKLQAGEYVSLGKVEAELKTCPVIEN--I 118
DE+G WF TGD+G + +G +RII R K L K GEY++L +EA ++ +
Sbjct: 533 DEDG--WFHTGDVGSIAANGTLRIIGRVKALAKNCLGEYIALEALEALYGQNELVVPNGV 590
Query: 119 CVYGDSRKDYTVALVVPNPSRLKEIAEKLNLDSLTFQQLCTHPEIEKQVLVELQEHGKKS 178
CV + Y ALV+ + ++ A++ ++ + + PE +K+ LQE + +
Sbjct: 591 CVLVHPARSYICALVLTDEAKAMAFAKEHGIEG-EYPAILKDPEFQKKATESLQETARAA 649
Query: 179 NLERFEIPNAVKLCSEVWSPDMGLVTAAFKLKRKDIQERYQHEINRMYA 227
+ FEI V++ S+ W+P+ G++TAA KLKR+ I ERY I ++A
Sbjct: 650 GRKSFEIVRHVRVLSDEWTPENGVLTAAMKLKRRVIDERYADLIKELFA 698
>gnl|CDD|178452 PLN02861, PLN02861, long-chain-fatty-acid-CoA ligase.
Length = 660
Score = 158 bits (402), Expect = 4e-45
Identities = 83/224 (37%), Positives = 122/224 (54%), Gaps = 6/224 (2%)
Query: 4 GRAGAPTTVCDIRLVDWDEGNYRVCDKPFPRGEILIGGSNISPGYYKNPTKTKEDFFDEE 63
G G P T + RL E Y PRGEI + G+ + GY+K T+E D
Sbjct: 436 GTVGVPMTTIEARLESVPEMGYDALSD-VPRGEICLRGNTLFSGYHKRQDLTEEVLIDG- 493
Query: 64 GKRWFRTGDIGELHQDGVIRIIDRKKDLVKLQAGEYVSLGKVEAELKTCPVIENICVYGD 123
WF TGDIGE +G ++IIDRKK++ KL GEYV++ +E CP+I +I VYG+
Sbjct: 494 ---WFHTGDIGEWQPNGAMKIIDRKKNIFKLSQGEYVAVENLENTYSRCPLIASIWVYGN 550
Query: 124 SRKDYTVALVVPNPSRLKEIAEKLNLDSLTFQQLCTHPEIEKQVLVELQEHGKKSNLERF 183
S + + VA+VVP+ L++ A N F+ LC + + K +L EL GKK L F
Sbjct: 551 SFESFLVAVVVPDRQALEDWAANNNKTG-DFKSLCKNLKARKYILDELNSTGKKLQLRGF 609
Query: 184 EIPNAVKLCSEVWSPDMGLVTAAFKLKRKDIQERYQHEINRMYA 227
E+ A+ L + + L+T FKLKR + + Y+ I+++Y+
Sbjct: 610 EMLKAIHLEPNPFDIERDLITPTFKLKRPQLLKYYKDCIDQLYS 653
>gnl|CDD|166255 PLN02614, PLN02614, long-chain acyl-CoA synthetase.
Length = 666
Score = 128 bits (324), Expect = 3e-34
Identities = 78/225 (34%), Positives = 116/225 (51%), Gaps = 6/225 (2%)
Query: 4 GRAGAPTTVCDIRLVDWDEGNYRVCDKPFPRGEILIGGSNISPGYYKNPTKTKEDFFDEE 63
G G P DIRL E Y PRGEI I G + GYYK TKE D
Sbjct: 439 GTVGPPVPNVDIRLESVPEMEYDALAST-PRGEICIRGKTLFSGYYKREDLTKEVLIDG- 496
Query: 64 GKRWFRTGDIGELHQDGVIRIIDRKKDLVKLQAGEYVSLGKVEAELKTCPVIENICVYGD 123
W TGD+GE +G ++IIDRKK++ KL GEYV++ +E ++++ VYG+
Sbjct: 497 ---WLHTGDVGEWQPNGSMKIIDRKKNIFKLSQGEYVAVENIENIYGEVQAVDSVWVYGN 553
Query: 124 SRKDYTVALVVPNPSRLKEIAEKLNLDSLTFQQLCTHPEIEKQVLVELQEHGKKSNLERF 183
S + + VA+ PN L+ A + N S + LC + + ++ +L EL + K+ ++ F
Sbjct: 554 SFESFLVAIANPNQQILERWAAE-NGVSGDYNALCQNEKAKEFILGELVKMAKEKKMKGF 612
Query: 184 EIPNAVKLCSEVWSPDMGLVTAAFKLKRKDIQERYQHEINRMYAS 228
EI A+ L + + L+T FK KR + + YQ I+ MY +
Sbjct: 613 EIIKAIHLDPVPFDMERDLLTPTFKKKRPQLLKYYQSVIDEMYKT 657
>gnl|CDD|213298 cd05932, LC_FACS_bac, Bacterial long-chain fatty acid CoA
synthetase (LC-FACS), including Marinobacter
hydrocarbonoclasticus isoprenoid Coenzyme A synthetase.
The members of this family are bacterial long-chain
fatty acid CoA synthetase. Marinobacter
hydrocarbonoclasticus isoprenoid Coenzyme A synthetase
in this family is involved in the synthesis of
isoprenoid wax ester storage compounds when grown on
phytol as the sole carbon source. LC-FACS catalyzes the
formation of fatty acyl-CoA in a two-step reaction: the
formation of a fatty acyl-AMP molecule as an
intermediate, and the formation of a fatty acyl-CoA.
Free fatty acids must be "activated" to their CoA
thioesters before participating in most catabolic and
anabolic reactions.
Length = 504
Score = 124 bits (313), Expect = 4e-33
Identities = 64/187 (34%), Positives = 90/187 (48%), Gaps = 28/187 (14%)
Query: 35 GEILIGGSNISPGYYKNPTKTKEDFFDEEGKRWFRTGDIGELHQDGVIRIIDRKKDLVKL 94
GEILI + GYYK+P KT E F E+G W TGD GE+ +DG ++I R K+L K
Sbjct: 343 GEILIRSPGLMSGYYKDPEKTAEAF-TEDG--WLHTGDKGEIDEDGFLKITGRVKELFKT 399
Query: 95 QAGEYVSLGKVEAELKTCPVIENICVYGDSRKDYTVALVVPNPS-RLKEIAEKLNLDSLT 153
G+YV+ +E L P +E +CV G VALVV + + E E
Sbjct: 400 SKGKYVAPAPIENLLSANPHVEQVCVVGSGLPQ-PVALVVLSEAADPLEREE-------- 450
Query: 154 FQQLCTHPEIEKQVLVELQEHGKKSN--LERFEIPNAVKLCSEVWSPDMGLVTAAFKLKR 211
V LQ+ K N LE E +A+ + E W+ + GL+T K+KR
Sbjct: 451 -------------VEESLQQTLAKVNSELESHERLSALVVVKEAWTVENGLLTPTLKIKR 497
Query: 212 KDIQERY 218
+++ Y
Sbjct: 498 NVVEKHY 504
>gnl|CDD|215954 pfam00501, AMP-binding, AMP-binding enzyme.
Length = 412
Score = 114 bits (288), Expect = 5e-30
Identities = 40/120 (33%), Positives = 62/120 (51%), Gaps = 6/120 (5%)
Query: 1 MSTGRAGAPTTVCDIRLVDWDEGNYRVCDKPFPRGEILIGGSNISPGYYKNPTKTKEDFF 60
+ G G P +++++D +EG P GE+ I G ++ GY +P T E F
Sbjct: 299 VKPGSIGRPLPGVEVKVLD-EEGEP---VPPGEVGELCIRGPGVARGYLNDPELTAERF- 353
Query: 61 DEEGKRWFRTGDIGELHQDGVIRIIDRKKDLVKLQAGEYVSLGKVEAELKTCPVIENICV 120
E+G +RTGD+G +DG + I+ RK D VK+ GE + G++EA L P + V
Sbjct: 354 VEDGWGMYRTGDLGRWDEDGYLEILGRKDDQVKI-RGERIEPGEIEAVLLEHPGVAEAAV 412
>gnl|CDD|223395 COG0318, CaiC, Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases
II [Lipid metabolism / Secondary metabolites
biosynthesis, transport, and catabolism].
Length = 534
Score = 108 bits (272), Expect = 2e-27
Identities = 65/227 (28%), Positives = 95/227 (41%), Gaps = 41/227 (18%)
Query: 4 GRAGAPTTVCDIRLVDWDEGNYRVCDKPFPRGEILIGGSNISPGYYKNPTKTKEDFFDEE 63
G G P ++R+VD D G P GEI + G N+ GY+ P T E FDE+
Sbjct: 346 GSVGRPLPGVEVRIVDPDGGE----VLPGEVGEIWVRGPNVMKGYWNRPEATAE-AFDED 400
Query: 64 GKRWFRTGDIGELHQDGVIRIIDRKKDLVKLQAGEYVSLGKVEAELKTCPVIENICVYG- 122
G W RTGD+G + +DG + I+ R KDL+ GE + ++EA L P + V G
Sbjct: 401 G--WLRTGDLGYVDEDGYLYIVGRLKDLIIS-GGENIYPEEIEAVLAEHPAVAEAAVVGV 457
Query: 123 -DSRKDYT-VALVVPNPSRLKEIAEKLNLDSLTFQQLCTHPEIEKQVLVELQEHGKKSNL 180
D R VA+VV P + EL+ +K L
Sbjct: 458 PDERWGERVVAVVVLKP-----------------------GGDAELTAEELRAFLRK-RL 493
Query: 181 ERFEIPNAVKLCSEVWSPDMGLVTAAFKLKRKDIQERYQHEINRMYA 227
+++P V D TA+ K+ R+ ++E Y+ E
Sbjct: 494 ALYKVP------RIVVFVDELPRTASGKIDRRALREEYRAEPRLDLK 534
>gnl|CDD|213310 cd05945, DltA, D-alanine:D-alanyl carrier protein ligase (DltA).
DltA belongs to the class I AMP-forming adenylation
domain superfamily, which also includes acetyl-CoA
synthetase, luciferase, and the adenylation domains of
non-ribosomal synthetases. It catalyzes the two-step
activation reaction of D-alanine: the formation of a
substrate-AMP molecule as an intermediate, and then the
transfer of the amino acid adenylate to teichoic acid in
the biosynthesis of lipoteichoic acid (LTA) and wall
teichoic acid (WTA) in gram-positive bacteria.
Length = 447
Score = 104 bits (263), Expect = 3e-26
Identities = 49/146 (33%), Positives = 74/146 (50%), Gaps = 10/146 (6%)
Query: 7 GAPTTVCDIRLVDWDEGNYRVCDKPFPRGEILIGGSNISPGYYKNPTKTKEDFFDEEGKR 66
G P ++D ++G P GE++I G +SPGY NP KT + FF +EG+R
Sbjct: 275 GYPKPGLRALILD-EDGR---PVPPGEEGELVIAGPQVSPGYLNNPEKTAKAFFQDEGQR 330
Query: 67 WFRTGDIGELHQDGVIRIIDRKKDLVKLQAGEYVSLGKVEAELKTCPVIENICV---YGD 123
W+RTGD+ L DG++ + RK +KL G + L ++EA L+ P +E V
Sbjct: 331 WYRTGDLVYLEDDGLLVYLGRKDFQIKLH-GYRIELEEIEAALRALPGVEEAVVLPVPKG 389
Query: 124 SRKDYTVALVVPNPSRL--KEIAEKL 147
+ VA VV L K + ++L
Sbjct: 390 EKVVRLVAFVVGKEGALDTKALKKEL 415
>gnl|CDD|213284 cd05917, FACL_like_2, Uncharacterized subfamily of fatty acid CoA
ligase (FACL). Fatty acyl-CoA ligases catalyze the
ATP-dependent activation of fatty acids in a two-step
reaction. The carboxylate substrate first reacts with
ATP to form an acyl-adenylate intermediate, which then
reacts with CoA to produce an acyl-CoA ester. This is a
required step before free fatty acids can participate in
most catabolic and anabolic reactions.
Length = 347
Score = 93.0 bits (232), Expect = 3e-22
Identities = 60/217 (27%), Positives = 93/217 (42%), Gaps = 47/217 (21%)
Query: 3 TGRAGAPTTVCDIRLVDWDEGNYRVCDKPFP---RGEILIGGSNISPGYYKNPTKTKEDF 59
G G P ++++VD +G P GEI + G ++ GYY +P T E
Sbjct: 172 PGTVGRPLPGVEVKIVD-PDGG------EVPPGEVGEICVRGYSVMKGYYNDPEATAEAI 224
Query: 60 FDEEGKRWFRTGDIGELHQDGVIRIIDRKKDLVKLQAGEYVSLGKVEAELKTCPVIENIC 119
D +G W TGD+G + +DG +RI+ R KD++ + GE + ++E L T P +
Sbjct: 225 -DADG--WLHTGDLGYMDEDGYLRIVGRIKDMII-RGGENIYPAEIEEALLTHPAVAEAA 280
Query: 120 VYG--DSRKDYTVALVVPNPSRLKEIAEKLNLDSLTFQQLCTHPEIEKQVLVELQEHGKK 177
V G D R VA V LK A LT + EL +
Sbjct: 281 VVGVPDERLGEVVAAFV----VLKPGAT------LTEE--------------ELIAF-CR 315
Query: 178 SNLERFEIPNAVKLCSEVWSPDMGLVTAAFKLKRKDI 214
L RF++P V+ E+ TA+ K+++ +
Sbjct: 316 GRLARFKVPRYVRFVDELP------RTASGKIQKFKL 346
>gnl|CDD|213307 cd05941, MCS, Malonyl-CoA synthetase (MCS). MCS catalyzes the
formation of malonyl-CoA in a two-step reaction
consisting of the adenylation of malonate with ATP,
followed by malonyl transfer from malonyl-AMP to CoA.
Malonic acid and its derivatives are the building blocks
of polyketides and malonyl-CoA serves as the substrate
of polyketide synthases. Malonyl-CoA synthetase has
broad substrate tolerance and can activate a variety of
malonyl acid derivatives. MCS may play an important role
in biosynthesis of polyketides, the important secondary
metabolites with therapeutic and agrochemical utility.
Length = 430
Score = 91.1 bits (227), Expect = 2e-21
Identities = 58/204 (28%), Positives = 85/204 (41%), Gaps = 35/204 (17%)
Query: 4 GRAGAPTTVCDIRLVDWDEGNYRVCDKPFPRGEILIGGSNISPGYYKNPTKTKEDFFDEE 63
G G P ++R+VD D G + GEI + G N+ Y+ P T E F E+
Sbjct: 258 GTVGLPLPGVEVRIVDEDGGEPLAAGEV---GEIQVRGPNVFSEYWNKPEATAEAF-TED 313
Query: 64 GKRWFRTGDIGELHQDGVIRIIDRKKDLVKLQAGEYVSLGKVEAELKTCPVIENICVYGD 123
G WF+TGD+G + +DG RI+ RK D + G VS ++E L P + + V G
Sbjct: 314 G--WFKTGDVGVVDEDGYYRILGRKSDDIIKSGGYKVSALEIEEALLEHPGVAEVAVIGV 371
Query: 124 SRKDY---TVALVVPNPSRLKEIAEKLNLDSLTFQQLCTHPEIEKQVLVELQEHGKKSNL 180
D+ VA+VVP +LT +L + L
Sbjct: 372 PDPDWGEAVVAVVVPEAGA-----------ALTLAELRAW---------------AREKL 405
Query: 181 ERFEIPNAVKLCSEVWSPDMGLVT 204
++IP V + E+ MG V
Sbjct: 406 APYKIPKRVIVVDELPRNAMGKVQ 429
>gnl|CDD|236315 PRK08633, PRK08633, 2-acyl-glycerophospho-ethanolamine
acyltransferase; Validated.
Length = 1146
Score = 90.8 bits (226), Expect = 5e-21
Identities = 47/151 (31%), Positives = 66/151 (43%), Gaps = 22/151 (14%)
Query: 15 IRLVDWDEGNYRVCDKPFPRGE---ILIGGSNISPGYYKNPTKTKEDFFDEEGKRWFRTG 71
+R+VD + + P GE ILIGG + GY +P KT E D +G W+ TG
Sbjct: 971 VRIVDPETF------EELPPGEDGLILIGGPQVMKGYLGDPEKTAEVIKDIDGIGWYVTG 1024
Query: 72 DIGELHQDGVIRIIDRKKDLVKLQAGEYVSLGKVEAELKTCPVIEN----ICVYG--DSR 125
D G L +DG + I DR K+ GE V LG VE EL + V D +
Sbjct: 1025 DKGHLDEDGFLTITDRYSRFAKI-GGEMVPLGAVEEELAK--ALGGEEVVFAVTAVPDEK 1081
Query: 126 KDYTVALVVPN----PSRLKEIAEKLNLDSL 152
K + ++ LK ++ L +L
Sbjct: 1082 KGEKLVVLHTCGAEDVEELKRAIKESGLPNL 1112
>gnl|CDD|213270 cd04433, AFD_class_I, Adenylate forming domain, Class I. This
family includes acyl- and aryl-CoA ligases, as well as
the adenylation domain of nonribosomal peptide
synthetases and firefly luciferases. The
adenylate-forming enzymes catalyze an ATP-dependent
two-step reaction to first activate a carboxylate
substrate as an adenylate and then transfer the
carboxylate to the pantetheine group of either coenzyme
A or an acyl-carrier protein. The active site of the
domain is located at the interface of a large N-terminal
subdomain and a smaller C-terminal subdomain.
Length = 338
Score = 87.0 bits (216), Expect = 3e-20
Identities = 47/135 (34%), Positives = 70/135 (51%), Gaps = 11/135 (8%)
Query: 6 AGAPTTVCDIRLVDWDEGNYRVCDKPFPRGEILIGGSNISPGYYKNPTKTKEDFFDEEGK 65
G P ++R+VD +EG P GE+++ G + GY+ NP +T E+G
Sbjct: 171 VGRPVPGVEVRVVD-EEGKPL---PPGEVGELVVRGPWVMKGYWNNPPETTA-AATEDG- 224
Query: 66 RWFRTGDIGELHQDGVIRIIDRKKDLVKLQAGEYVSLGKVEAELKTCPVIENICVYG--- 122
W+RTGD+G L ++G + I R KDL+K+ GE V +VE+ L P + V G
Sbjct: 225 -WYRTGDLGYLDEEGYLYITGRSKDLIKV-GGENVYPAEVESVLLQHPAVAEAAVVGVPD 282
Query: 123 DSRKDYTVALVVPNP 137
+ R + VA VV P
Sbjct: 283 EDRGERIVAFVVLRP 297
>gnl|CDD|213271 cd05903, CHC_CoA_lg, Cyclohexanecarboxylate-CoA ligase (also called
cyclohex-1-ene-1-carboxylate:CoA ligase).
Cyclohexanecarboxylate-CoA ligase activates the
aliphatic ring compound, cyclohexanecarboxylate, for
degradation. It catalyzes the synthesis of
cyclohexanecarboxylate-CoA thioesters in a two-step
reaction involving the formation of
cyclohexanecarboxylate-AMP anhydride, followed by the
nucleophilic substitution of AMP by CoA.
Length = 437
Score = 86.1 bits (214), Expect = 1e-19
Identities = 54/163 (33%), Positives = 77/163 (47%), Gaps = 32/163 (19%)
Query: 4 GRAGAPTTVCDIRLVDWDEGNYRVCDKPFPR---GEILIGGSNISPGYYKNPTKTKEDFF 60
GR ++R+VD D+GN P GE+ + G + GY P T+ F
Sbjct: 265 GRPVPG---VEVRIVD-DDGN------DVPPGEEGELQVRGPQLFLGYLDPPDNTEA--F 312
Query: 61 DEEGKRWFRTGDIGELHQDGVIRIIDRKKDLVKLQAGEYVSLGKVEAELKTCPVIENICV 120
++G WFRTGD+G L DG +RI RKKD++ ++ GE +S ++E L P + + V
Sbjct: 313 TDDG--WFRTGDLGRLDADGYLRITGRKKDII-IRGGENISAREIEDLLLRHPAVAEVAV 369
Query: 121 YG--DSRKDYTV-ALVVPNPSRLKEIAEKLNLDSLTFQQLCTH 160
D R V A+VVP P SLT +L H
Sbjct: 370 VAMPDERLGERVCAVVVPAPGA-----------SLTLAELTEH 401
>gnl|CDD|213301 cd05935, LC_FACS_like, Putative long-chain fatty acid CoA ligase.
The members of this family are putative long-chain fatty
acyl-CoA synthetases, which catalyze the ATP-dependent
activation of fatty acids in a two-step reaction. The
carboxylate substrate first reacts with ATP to form an
acyl-adenylate intermediate, which then reacts with CoA
to produce an acyl-CoA ester. Fatty acyl-CoA synthetases
are responsible for fatty acid degradation as well as
physiological regulation of cellular functions via the
production of fatty acyl-CoA esters.
Length = 430
Score = 85.5 bits (212), Expect = 2e-19
Identities = 48/142 (33%), Positives = 73/142 (51%), Gaps = 7/142 (4%)
Query: 7 GAPTTVCDIRLVDWDEGNYRVCDKPFPRGEILIGGSNISPGYYKNPTKTKEDFFDEEGKR 66
G P D R++D + G P GEI++ G + GY+ P T E F + +GKR
Sbjct: 254 GIPVFGTDARVIDPETGEEL---PPGEEGEIVVRGPQVFKGYWNRPEATAESFIELDGKR 310
Query: 67 WFRTGDIGELHQDGVIRIIDRKKDLVKLQAGEYVSLGKVEAELKTCPVIENICVYG--DS 124
+FRTGD+G + ++G +DR K ++ + +G V +VEA L P + +CV G D
Sbjct: 311 FFRTGDLGYIDEEGYFFFLDRVKRMINV-SGYKVWPAEVEALLYQHPAVLEVCVIGRPDP 369
Query: 125 RKDYTV-ALVVPNPSRLKEIAE 145
R+ V A VV P ++ E
Sbjct: 370 RRGEVVKAFVVLKPDYRGKVTE 391
>gnl|CDD|240370 PTZ00342, PTZ00342, acyl-CoA synthetase; Provisional.
Length = 746
Score = 85.2 bits (211), Expect = 4e-19
Identities = 68/222 (30%), Positives = 108/222 (48%), Gaps = 22/222 (9%)
Query: 2 STGRAGAPTTVCDIRLVDWDEGNYRVCDKPFPRGEILIGGSNISPGYYKNPTKTKEDFFD 61
S G +P T ++ W+ Y+ D P+GE+LI +I GY+ +TK F +
Sbjct: 514 SIGGPISPNT--KYKVRTWE--TYKATDTL-PKGELLIKSDSIFSGYFLEKEQTKNAFTE 568
Query: 62 EEGKRWFRTGDIGELHQDGVIRIIDRKKDLVKLQAGEYVSLGKVEAELKTCPVIENICVY 121
+ +F+TGDI +++++G + +DR K LVKL GEY+ + I VY
Sbjct: 569 D---GYFKTGDIVQINKNGSLTFLDRSKGLVKLSQGEYIETDMLNNLYSQISFINFCVVY 625
Query: 122 GDSRKDYTVALVVPNPSR----LKE--------IAEKLNLDSLTFQQLCTHPEIEKQVLV 169
GD D +A++ + LK+ I EK L+ LT + + + V
Sbjct: 626 GDDSMDGPLAIISVDKYLLFKCLKDDNMLESTGINEKNYLEKLTDETI-NNNIYVDYVKG 684
Query: 170 ELQEHGKKSNLERFEIPNAVKLCSEVWSPDMGLVTAAFKLKR 211
++ E KK+NL R+ I N + L S+VW + L T FK+KR
Sbjct: 685 KMLEVYKKTNLNRYNIINDIYLTSKVWDTNNYL-TPTFKVKR 725
>gnl|CDD|213322 cd12114, A_NRPS_TlmIV_like, The adenylation domain of nonribosomal
peptide synthetases (NRPS), including Streptoalloteichus
tallysomycin biosynthesis genes. The adenylation (A)
domain of NRPS recognizes a specific amino acid or
hydroxy acid and activates it as an (amino) acyl
adenylate by hydrolysis of ATP. The activated acyl
moiety then forms a thioester to the enzyme-bound
cofactor phosphopantetheine of a peptidyl carrier
protein domain. NRPSs are large multifunctional enzymes
which synthesize many therapeutically useful peptides in
bacteria and fungi via a template-directed, nucleic acid
independent nonribosomal mechanism. These natural
products include antibiotics, immunosuppressants, plant
and animal toxins, and enzyme inhibitors. NRPS has a
distinct modular structure in which each module is
responsible for the recognition, activation, and in some
cases, modification of a single amino acid residue of
the final peptide product. The modules can be subdivided
into domains that catalyze specific biochemical
reactions. This family includes the TLM biosynthetic
gene cluster from Streptoalloteichus that consists of
nine NRPS genes; the N-terminal module of TlmVI (NRPS-5)
and the starter module of BlmVI (NRPS-5) are comprised
of the acyl CoA ligase (AL) and acyl carrier protein
(ACP)-like domains, which are thought to be involved in
the biosynthesis of the beta-aminoalaninamide moiety.
Length = 476
Score = 84.2 bits (209), Expect = 7e-19
Identities = 38/106 (35%), Positives = 53/106 (50%), Gaps = 4/106 (3%)
Query: 35 GEILIGGSNISPGYYKNPTKTKEDFF-DEEGKRWFRTGDIGELHQDGVIRIIDRKKDLVK 93
GE+ IGG ++ GY+ +P T E F G+R +RTGD+G DG I + R VK
Sbjct: 327 GELYIGGVGVALGYWGDPELTAERFITHRTGERLYRTGDLGRYRPDGTIEFLGRADHQVK 386
Query: 94 LQAGEYVSLGKVEAELKTCPVIENICV--YGDSRKDYTVALVVPNP 137
++ G V LG++EA L P ++ V GD A VV
Sbjct: 387 IR-GYRVELGEIEAALARHPGVQRAVVVVVGDGGAKALAAFVVAED 431
>gnl|CDD|213279 cd05911, Firefly_Luc_like, Firefly luciferase of light emitting
insects and 4-Coumarate-CoA Ligase (4CL). This family
contains two functionally unique groups of proteins; one
group is insect firefly luciferases and the other is
plant 4-coumarate:coenzyme A ligases. However, they
share significant sequence similarity in spite of their
functional diversity. Luciferase catalyzes the
production of light in the presence of MgATP, molecular
oxygen, and luciferin. In the first step, luciferin is
activated by acylation of its carboxylate group with
ATP, resulting in an enzyme-bound luciferyl adenylate.
In the second step, luciferyl adenylate reacts with
molecular oxygen, producing an enzyme-bound excited
state product (Luc=O*) and releasing AMP. This
excited-state product then decays to the ground state
(Luc=O), emitting a quantum of visible light.
4-coumarate:coenzyme A ligase is a key enzyme in the
phenylpropanoid metabolic pathway for monolignol and
flavonoid biosynthesis. It catalyzes the synthesis of
hydroxycinnamate-CoA thioesters in a two-step reaction,
involving the formation of hydroxycinnamate-AMP
anhydride and then the nucleophilic substitution of AMP
by CoA. The phenylpropanoid pathway is one of the most
important secondary metabolism pathways in plants and
hydroxycinnamate-CoA thioesters are the precursors of
lignin and other important phenylpropanoids.
Length = 487
Score = 83.8 bits (208), Expect = 1e-18
Identities = 49/139 (35%), Positives = 67/139 (48%), Gaps = 13/139 (9%)
Query: 2 STGRAGAPTTVCDIRLVDWDEGNYRVCDKPFPRGEILIGGSNISPGYYKNPTKTKEDFFD 61
S GR ++VD ++G + P RGE+ + G I GY NP TKE D
Sbjct: 317 SVGRLVPNVEA---KIVD-EDGGKSL--GPNERGELCVKGPQIMKGYLNNPEATKE-TID 369
Query: 62 EEGKRWFRTGDIGELHQDGVIRIIDRKKDLVKLQAGEYVSLGKVEAELKTCPVIENICVY 121
E+G W TGDIG +DG I+DRKK+L+K + G V ++EA L P + + V
Sbjct: 370 EDG--WLHTGDIGYFDEDGNFYIVDRKKELIKYK-GYQVPPAELEAVLLEHPKVADAAVI 426
Query: 122 GDSRKDYT---VALVVPNP 137
G + A VV P
Sbjct: 427 GIPDEVAGELPRAYVVLKP 445
>gnl|CDD|181644 PRK09088, PRK09088, acyl-CoA synthetase; Validated.
Length = 488
Score = 83.3 bits (206), Expect = 1e-18
Identities = 50/119 (42%), Positives = 64/119 (53%), Gaps = 8/119 (6%)
Query: 4 GRAGAPTTVCDIRLVDWDEGNYRVCDKPFPRGEILIGGSNISPGYYKNPTKTKEDFFDEE 63
G AG PT R+VD D+GN C P GE+L+ G N+SPGY++ P T F +
Sbjct: 306 GAAGIPTPTVQTRVVD-DQGND--CPAGVP-GELLLRGPNLSPGYWRRPQATAR-AFTGD 360
Query: 64 GKRWFRTGDIGELHQDGVIRIIDRKKDLVKLQAGEYVSLGKVEAELKTCPVIENICVYG 122
G WFRTGDI DG ++DRKKD+ + GE V ++EA L P I V G
Sbjct: 361 G--WFRTGDIARRDADGFFWVVDRKKDMF-ISGGENVYPAEIEAVLADHPGIRECAVVG 416
>gnl|CDD|213292 cd05926, FACL_fum10p_like, Subfamily of fatty acid CoA ligase
(FACL) similar to Fum10p of Gibberella moniliformis.
FACL catalyzes the formation of fatty acyl-CoA in a
two-step reaction: the formation of a fatty acyl-AMP
molecule as an intermediate, followed by the formation
of a fatty acyl-CoA. This is a required step before free
fatty acids can participate in most catabolic and
anabolic reactions. Fum10p is a fatty acid CoA ligase
involved in the synthesis of fumonisin, a polyketide
mycotoxin, in Gibberella moniliformis.
Length = 345
Score = 81.5 bits (202), Expect = 3e-18
Identities = 58/216 (26%), Positives = 90/216 (41%), Gaps = 45/216 (20%)
Query: 4 GRAGAPTTVCDIRLVDWDEGNYRVCDKPFPRGEILIGGSNISPGYYKNPTKTKEDFFDEE 63
G G P ++ ++D D+G P GEI+I G N++ GY NP +E F D
Sbjct: 172 GSVGRPA-GVEVAILD-DDGRPL---PPGQEGEIVIRGPNVTAGYLNNPEANREAFRD-- 224
Query: 64 GKRWFRTGDIGELHQDGVIRIIDRKKDLVKLQAGEYVSLGKVEAELKTCPVIENICVYGD 123
WFRTGD+G L +DG + + R K+L+ + GE +S +VE L P + V+G
Sbjct: 225 --GWFRTGDLGYLDEDGYLFLTGRIKELIN-RGGEKISPREVEEVLLRHPAVAEAVVFGV 281
Query: 124 SRKDY---TVALVVPNPSRLKEIAEKLNLDSLTFQQLCTHPEIEKQVLVELQEHGKKSNL 180
+ Y A VV I EL++ +K L
Sbjct: 282 PDELYGEEVAAAVVLRADSKVTIE-------------------------ELRDFARK-RL 315
Query: 181 ERFEIPNAVKLCSEVWSPDMGLVTAAFKLKRKDIQE 216
F++P + + P A K++R+ + E
Sbjct: 316 AAFKVPKKILFVDAL--P----KGATGKIQRRKLAE 345
>gnl|CDD|213280 cd05912, OSB_CoA_lg, O-succinylbenzoate-CoA ligase (also known as
O-succinylbenzoate-CoA synthase, OSB-CoA synthetase, or
MenE). O-succinylbenzoic acid-CoA synthase catalyzes
the coenzyme A (CoA)- and ATP-dependent conversion of
o-succinylbenzoic acid to o-succinylbenzoyl-CoA. The
reaction is the fourth step of the biosynthesis pathway
of menaquinone (vitamin K2). In certain bacteria,
menaquinone is used during fumarate reduction in
anaerobic respiration. In cyanobacteria, the product of
the menaquinone pathway is phylloquinone
(2-methyl-3-phytyl-1,4-naphthoquinone), a molecule used
exclusively as an electron transfer cofactor in
Photosystem 1. In green sulfur bacteria and
heliobacteria, menaquinones are used as loosely bound
secondary electron acceptors in the photosynthetic
reaction center.
Length = 407
Score = 81.5 bits (202), Expect = 5e-18
Identities = 47/187 (25%), Positives = 74/187 (39%), Gaps = 45/187 (24%)
Query: 35 GEILIGGSNISPGYYKNPTKTKEDFFDEEGKRWFRTGDIGELHQDGVIRIIDRKKDLVKL 94
GEIL+ G ++ GY T DE+G WF TGD+G L +G + ++ R+ DL+ +
Sbjct: 261 GEILVRGPSLFLGYLPQGGLTPP--LDEDG--WFHTGDLGYLDAEGYLYVLGRRDDLI-I 315
Query: 95 QAGEYVSLGKVEAELKTCPVIENICVYGDSRKDYT-----VALVVPNPSRLKEIAEKLNL 149
GE + ++EA L P +E V G D VA VVPN
Sbjct: 316 SGGENIYPEEIEAVLLQHPAVEEAAVVG--VPDDEWGQRPVAFVVPN------------- 360
Query: 150 DSLTFQQLCTHPEIEKQVLVELQEHGKKSNLERFEIPNAVKLCSEVWSPDMGLVTAAFKL 209
+ + ELQ L +++ P E+ A+ K+
Sbjct: 361 ----------DDPVSVE---ELQAFLADK-LAKYKRPKRWYPLPEL------PRNASGKI 400
Query: 210 KRKDIQE 216
R +++
Sbjct: 401 DRAALRQ 407
>gnl|CDD|102207 PRK06145, PRK06145, acyl-CoA synthetase; Validated.
Length = 497
Score = 79.5 bits (196), Expect = 3e-17
Identities = 53/155 (34%), Positives = 82/155 (52%), Gaps = 21/155 (13%)
Query: 2 STGRAGAPTTVCDIRLVDWDEGNYRVCDKPFPRGEILIGGSNISPGYYKNPTKTKEDFFD 61
STGRA A +IR+ D G + P +GEI + G ++ GY+K+P KT E F+
Sbjct: 320 STGRALAHV---EIRIAD-GAGRWL---PPNMKGEICMRGPKVTKGYWKDPEKTAEAFYG 372
Query: 62 EEGKRWFRTGDIGELHQDGVIRIIDRKKDLVKLQAGEYVSLGKVEAELKTCPVIENICVY 121
+ WFR+GD+G L ++G + + DRKKD++ + GE ++ +VE + P + V
Sbjct: 373 D----WFRSGDVGYLDEEGFLYLTDRKKDMI-ISGGENIASSEVERVIYELPEVAEAAVI 427
Query: 122 G--DSR-KDYTVALVVPNPSRLKEIAEKLNLDSLT 153
G D R + A+VV NP L L++L
Sbjct: 428 GVHDDRWGERITAVVVLNP------GATLTLEALD 456
>gnl|CDD|213327 cd12119, ttLC_FACS_AlkK_like, Fatty acyl-CoA synthetases similar to
LC-FACS from Thermus thermophiles. This family includes
fatty acyl-CoA synthetases that can activate
medium-chain to long-chain fatty acids. They catalyze
the ATP-dependent acylation of fatty acids in a two-step
reaction. The carboxylate substrate first reacts with
ATP to form an acyl-adenylate intermediate, which then
reacts with CoA to produce an acyl-CoA ester. The fatty
acyl-CoA synthetases are responsible for fatty acid
degradation as well as physiological regulation of
cellular functions via the production of fatty acyl-CoA
esters. The fatty acyl-CoA synthetase from Thermus
thermophiles in this family was shown catalyzing the
long-chain fatty acid, myristoyl acid, while another
member in this family, the AlkK protein identified from
Pseudomonas oleovorans, targets medium chain fatty
acids. This family also includes uncharacterized FACS
proteins.
Length = 517
Score = 79.2 bits (196), Expect = 4e-17
Identities = 46/126 (36%), Positives = 66/126 (52%), Gaps = 21/126 (16%)
Query: 1 MSTGRAGAPTTVCDIRLVDWDEGNYRVCDKPFPR-----GEILIGGSNISPGYYKNPTKT 55
GR P ++R+VD DEGN P GE+++ G ++ YYK+P KT
Sbjct: 341 AKQGR---PVPGVELRIVD-DEGN------ELPWDGKTVGELVVRGPWVTGSYYKDPEKT 390
Query: 56 KEDFFDEEGKRWFRTGDIGELHQDGVIRIIDRKKDLVKLQAGEYVSLGKVEAELKTCP-V 114
+E D WFRTGD+ + +DG I+I DR KD++K GE++S ++E L P V
Sbjct: 391 EELTRD----GWFRTGDVAVIDEDGYIQITDRAKDVIK-SGGEWISSVELENALMAHPAV 445
Query: 115 IENICV 120
E V
Sbjct: 446 AEAAVV 451
>gnl|CDD|235146 PRK03640, PRK03640, O-succinylbenzoic acid--CoA ligase;
Provisional.
Length = 483
Score = 78.5 bits (194), Expect = 7e-17
Identities = 39/119 (32%), Positives = 63/119 (52%), Gaps = 10/119 (8%)
Query: 4 GRAGAPTTVCDIRLVDWDEGNYRVCDKPFPRGEILIGGSNISPGYYKNPTKTKEDFFDEE 63
G AG P C++++ + V PF GEI++ G N++ GY T+E F D
Sbjct: 307 GSAGKPLFPCELKIEK----DGVVV-PPFEEGEIVVKGPNVTKGYLNREDATRETFQDG- 360
Query: 64 GKRWFRTGDIGELHQDGVIRIIDRKKDLVKLQAGEYVSLGKVEAELKTCPVIENICVYG 122
WF+TGDIG L ++G + ++DR+ DL+ + GE + ++E L + P + V G
Sbjct: 361 ---WFKTGDIGYLDEEGFLYVLDRRSDLI-ISGGENIYPAEIEEVLLSHPGVAEAGVVG 415
>gnl|CDD|236235 PRK08314, PRK08314, long-chain-fatty-acid--CoA ligase; Validated.
Length = 546
Score = 77.7 bits (192), Expect = 1e-16
Identities = 48/137 (35%), Positives = 70/137 (51%), Gaps = 13/137 (9%)
Query: 7 GAPTTVCDIRLVDWDEGNYRVCDKPFPR---GEILIGGSNISPGYYKNPTKTKEDFFDEE 63
G PT D R++D + + P GEI++ G + GY+ P T E F + +
Sbjct: 360 GIPTFGVDARVIDPETL------EELPPGEVGEIVVHGPQVFKGYWNRPEATAEAFIEID 413
Query: 64 GKRWFRTGDIGELHQDGVIRIIDRKKDLVKLQAGEYVSLGKVEAELKTCPVIENICVYG- 122
GKR+FRTGD+G + ++G I DR K ++ +G V +VE L P I+ CV
Sbjct: 414 GKRFFRTGDLGRMDEEGYFFITDRLKRMIN-ASGFKVWPAEVENLLYKHPAIQEACVIAT 472
Query: 123 -DSRKDYTV-ALVVPNP 137
D R+ TV A+VV P
Sbjct: 473 PDPRRGETVKAVVVLRP 489
>gnl|CDD|213277 cd05909, AAS_C, C-terminal domain of the acyl-acyl carrier protein
synthetase (also called 2-acylglycerophosphoethanolamine
acyltransferase, Aas). Acyl-acyl carrier protein
synthase (Aas) is a membrane protein responsible for a
minor pathway of incorporating exogenous fatty acids
into membrane phospholipids. Its in vitro activity is
characterized by the ligation of free fatty acids
between 8 and 18 carbons in length to the acyl carrier
protein sulfydryl group (ACP-SH) in the presence of ATP
and Mg2+. However, its in vivo function is as a
2-acylglycerophosphoethanolamine (2-acyl-GPE)
acyltransferase. The reaction occurs in two steps: the
acyl chain is first esterified to acyl carrier protein
(ACP) via a thioester bond, followed by a second step
where the acyl chain is transferred to a
2-acyllysophospholipid, thus completing the
transacylation reaction. This model represents the
C-terminal domain of the enzyme, which belongs to the
class I adenylate-forming enzyme family, including
acyl-CoA synthetases.
Length = 489
Score = 76.9 bits (190), Expect = 3e-16
Identities = 49/191 (25%), Positives = 81/191 (42%), Gaps = 38/191 (19%)
Query: 14 DIRLVDWDEGNYRVCDKPFPRGE---ILIGGSNISPGYYKNPTKTKEDFFDEEGKRWFRT 70
++R+V + + P GE +L+ G N+ GY N KT + G W+ T
Sbjct: 321 EVRIVSPETH------EELPIGEGGLLLVRGPNVMSGYLNNEEKTS--EVEVLGDGWYDT 372
Query: 71 GDIGELHQDGVIRIIDRKKDLVKLQAGEYVSLGKVEAELKTCPVIENICVYGDSRKDYTV 130
GDIG++ +DG + I+ R K K+ GE VSL VE L + + V
Sbjct: 373 GDIGKIDEDGFLTIVGRLKRFAKI-GGEMVSLTAVEELLS--------EILPEDSLHAAV 423
Query: 131 ALVVPNPSRLKEIAEKLNLDSLTFQQLCTHPEIEKQVLVELQEHGKKSNLERFEIPNAVK 190
VP+ + EK+ L L T +++ + EL +++ L +P +
Sbjct: 424 E--VPDEKK----GEKIVL-------LTTTEDLDPE---ELNAILREAGLPNLAVPRKIL 467
Query: 191 LCSEVWSPDMG 201
E+ P +G
Sbjct: 468 HVDEI--PLLG 476
>gnl|CDD|213302 cd05936, FC-FACS_FadD_like, Prokaryotic long-chain fatty acid CoA
synthetases similar to Escherichia coli FadD. This
subfamily of the AMP-forming adenylation family contains
Escherichia coli FadD and similar prokaryotic fatty acid
CoA synthetases. FadD was characterized as a long-chain
fatty acid CoA synthetase. The gene fadD is regulated by
the fatty acid regulatory protein FadR. Fatty acid CoA
synthetase catalyzes the formation of fatty acyl-CoA in
a two-step reaction: the formation of a fatty acyl-AMP
molecule as an intermediate, followed by the formation
of a fatty acyl-CoA. This is a required step before free
fatty acids can participate in most catabolic and
anabolic reactions.
Length = 468
Score = 76.4 bits (189), Expect = 4e-16
Identities = 43/130 (33%), Positives = 64/130 (49%), Gaps = 18/130 (13%)
Query: 14 DIRLVDWDEGNYRVCDKPFPR---GEILIGGSNISPGYYKNPTKTKEDFFDEEGKRWFRT 70
++++VD DEGN P GE+++ G + GY+ P +T E D W RT
Sbjct: 305 EVKIVD-DEGN------ELPPGEVGELVVRGPQVMKGYWNRPEETAEVLTD----GWLRT 353
Query: 71 GDIGELHQDGVIRIIDRKKDLVKLQAGEYVSLGKVEAELKTCPVIENICVYG--DSRKDY 128
GDIG + +DG I+DRKKD++ + G V ++E L + P + V G D +
Sbjct: 354 GDIGYMDEDGYFYIVDRKKDMI-IVGGFNVYPREIEEVLYSHPAVLEAAVVGVPDPYRGE 412
Query: 129 TV-ALVVPNP 137
V A VV
Sbjct: 413 AVKAFVVLKE 422
>gnl|CDD|213300 cd05934, FACL_DitJ_like, Uncharacterized subfamily of fatty acid
CoA ligase (FACL). Fatty acyl-CoA ligases catalyze the
ATP-dependent activation of fatty acids in a two-step
reaction. The carboxylate substrate first reacts with
ATP to form an acyl-adenylate intermediate, which then
reacts with CoA to produce an acyl-CoA ester. This is a
required step before free fatty acids can participate in
most catabolic and anabolic reactions. Members of this
family include DitJ from Pseudomonas and similar
proteins.
Length = 421
Score = 75.7 bits (187), Expect = 5e-16
Identities = 41/143 (28%), Positives = 66/143 (46%), Gaps = 21/143 (14%)
Query: 4 GRAGAPTTVCDIRLVDWDEGNYRVCDKPFP---RGEILIGGSN---ISPGYYKNPTKTKE 57
G G P ++R+VD ++G P GE+++ + GYY P T +
Sbjct: 245 GSCGRPRPGVEVRIVD-EDGR------EVPPGEVGELVVRPRRPWAMFKGYYGMPEATAK 297
Query: 58 DFFDEEGKRWFRTGDIGELHQDGVIRIIDRKKDLVKLQAGEYVSLGKVEAELKTCPVIEN 117
+ + WF TGD G +DG + +DRKKD ++ + GE +S +VEA + P +
Sbjct: 298 AWRNG----WFHTGDRGRRDEDGFLYFVDRKKDAIRRR-GENISSYEVEAAILAHPAVAE 352
Query: 118 ICVYG---DSRKDYTVALVVPNP 137
V + +D A+VV P
Sbjct: 353 AAVVAVPSELGEDEVKAVVVLRP 375
>gnl|CDD|213299 cd05933, ACSBG_like, Bubblegum-like very long-chain fatty acid CoA
synthetase (VL-FACS). This family of very long-chain
fatty acid CoA synthetase is named bubblegum because
Drosophila melanogaster mutant bubblegum (BGM) has
elevated levels of very-long-chain fatty acids (VLCFA)
caused by a defective gene of this family. The human
homolog (hsBG) has been characterized as a very long
chain fatty acid CoA synthetase that functions
specifically in the brain; hsBG may play a central role
in brain VLCFA metabolism and myelinogenesis. VL-FACS is
involved in the first reaction step of very long chain
fatty acid degradation. It catalyzes the formation of
fatty acyl-CoA in a two-step reaction: the formation of
a fatty acyl-AMP molecule as an intermediate, and the
formation of a fatty acyl-CoA. Free fatty acids must be
"activated" to their CoA thioesters before participating
in most catabolic and anabolic reactions.
Length = 594
Score = 75.5 bits (186), Expect = 8e-16
Identities = 64/215 (29%), Positives = 97/215 (45%), Gaps = 34/215 (15%)
Query: 35 GEILIGGSNISPGYYKNPTKTKEDFFDEEGKRWFRTGDIGELHQDGVIRIIDRKKDLVKL 94
GEI I G ++ GY KT+E DE+G W +GD+G L DG + I R K+L+
Sbjct: 391 GEICIWGRHVFMGYLNMEEKTEEAI-DEDG--WLHSGDLGFLDDDGFLYITGRIKELIIT 447
Query: 95 QAGEYVSLGKVEAELKT-CPVIENICVYGDSRKDYTVALVVPNPSRLKEIAEKLN---LD 150
GE V +E +K P+I N + GD RK ++ L LK + LD
Sbjct: 448 AGGENVPPVPIEEAVKKELPIISNAMLVGDKRKFLSMLLT------LKCEVDPETGEPLD 501
Query: 151 SLTFQ--QLCTHPEIEKQVLVEL------------QEHGKKSNLERFEIPNAVKLCSEV- 195
+LT + + C + E+ +E K+ N E I NA K+ V
Sbjct: 502 NLTEEAIEFCRLLGSHATTVSEILAGKDPLVYEAIEEGIKRVNKE--AISNAQKVQKWVI 559
Query: 196 ----WSPDMGLVTAAFKLKRKDIQERYQHEINRMY 226
+S G + KLKR + ++Y+ EI+++Y
Sbjct: 560 LEKDFSVPGGELGPTMKLKRPVVAKKYKDEIDKLY 594
>gnl|CDD|213324 cd12116, A_NRPS_Ta1_like, The adenylation domain of nonribosomal
peptide synthetases (NRPS), including salinosporamide A
polyketide synthase. The adenylation (A) domain of NRPS
recognizes a specific amino acid or hydroxy acid and
activates it as an (amino) acyl adenylate by hydrolysis
of ATP. The activated acyl moiety then forms a thioester
to the enzyme-bound cofactor phosphopantetheine of a
peptidyl carrier protein domain. NRPSs are large
multifunctional enzymes which synthesize many
therapeutically useful peptides in bacteria and fungi
via a template-directed, nucleic acid independent
nonribosomal mechanism. These natural products include
antibiotics, immunosuppressants, plant and animal
toxins, and enzyme inhibitors. NRPS has a distinct
modular structure in which each module is responsible
for the recognition, activation, and in some cases,
modification of a single amino acid residue of the final
peptide product. The modules can be subdivided into
domains that catalyze specific biochemical reactions.
This family includes the myxovirescin (TA) antibiotic
biosynthetic gene in Myxococcus xanthus; TA production
plays a role in predation. It also includes the
salinosporamide A polyketide synthase which is involved
in the biosynthesis of salinosporamide A, a marine
microbial metabolite whose chlorine atom is crucial for
potent proteasome inhibition and anticancer activity.
Length = 438
Score = 74.9 bits (185), Expect = 9e-16
Identities = 37/111 (33%), Positives = 54/111 (48%), Gaps = 8/111 (7%)
Query: 30 KPFPR---GEILIGGSNISPGYYKNPTKTKE----DFFDEEGKRWFRTGDIGELHQDGVI 82
+P P GE+ IGG ++ GY P T E D FD+ G R +RTGD+ DG +
Sbjct: 277 QPVPPGVPGELYIGGDGVARGYLGRPELTAERFVPDPFDDPGGRLYRTGDLVRWRPDGRL 336
Query: 83 RIIDRKKDLVKLQAGEYVSLGKVEAELKTCPVIENICVYGDSRKDYTVALV 133
+ R D VK++ G + LG++EA L+ P + + V LV
Sbjct: 337 EYLGRADDQVKIR-GFRIELGEIEAALRAHPGVADAVVVVREEGPGDQRLV 386
>gnl|CDD|235730 PRK06187, PRK06187, long-chain-fatty-acid--CoA ligase; Validated.
Length = 521
Score = 74.5 bits (184), Expect = 1e-15
Identities = 44/144 (30%), Positives = 66/144 (45%), Gaps = 23/144 (15%)
Query: 2 STGRAGAPTTVCDIRLVDWDEGNYRVCDKPFPR-----GEILIGGSNISPGYYKNPTKTK 56
S GR V + R+VD D+G+ P GEI++ G + GY+ P T
Sbjct: 340 SAGRPLP--GV-EARIVD-DDGD------ELPPDGGEVGEIIVRGPWLMQGYWNRPEATA 389
Query: 57 EDFFDEEGKRWFRTGDIGELHQDGVIRIIDRKKDLVKLQAGEYVSLGKVEAELKTCPVIE 116
E W TGD+G + +DG + I DR KD++ GE + ++E L P +
Sbjct: 390 ETIDGG----WLHTGDVGYIDEDGYLYITDRIKDVII-SGGENIYPRELEDALYGHPAVA 444
Query: 117 NICVYG--DSR-KDYTVALVVPNP 137
+ V G D + + VA+VV P
Sbjct: 445 EVAVIGVPDEKWGERPVAVVVLKP 468
>gnl|CDD|235531 PRK05605, PRK05605, long-chain-fatty-acid--CoA ligase; Validated.
Length = 573
Score = 74.3 bits (183), Expect = 2e-15
Identities = 51/147 (34%), Positives = 75/147 (51%), Gaps = 20/147 (13%)
Query: 1 MSTGR----AGAPTTVCDIRLVDWDEGNYRVCDKPFP---RGEILIGGSNISPGYYKNPT 53
MS R G P ++R+VD ++ D+ P GE+L+ G + GY+ P
Sbjct: 382 MSDDRRPGYVGVPFPDTEVRIVDPED-----PDETMPDGEEGELLVRGPQVFKGYWNRPE 436
Query: 54 KTKEDFFDEEGKRWFRTGDIGELHQDGVIRIIDRKKDLVKLQAGEYVSLGKVEAELKTCP 113
+T + F D WFRTGD+ + +DG IRI+DR K+L+ + G V +VE L+ P
Sbjct: 437 ETAKSFLDG----WFRTGDVVVMEEDGFIRIVDRIKELI-ITGGFNVYPAEVEEVLREHP 491
Query: 114 VIENICVYGDSRKD---YTVALVVPNP 137
+E+ V G R+D VA VV P
Sbjct: 492 GVEDAAVVGLPREDGSEEVVAAVVLEP 518
>gnl|CDD|213295 cd05929, BACL_like, Bacterial Bile acid CoA ligases and similar
proteins. Bile acid-Coenzyme A ligase catalyzes the
formation of bile acid-CoA conjugates in a two-step
reaction: the formation of a bile acid-AMP molecule as
an intermediate, followed by the formation of a bile
acid-CoA. This ligase requires a bile acid with a free
carboxyl group, ATP, Mg2+, and CoA for synthesis of the
final bile acid-CoA conjugate. The bile acid-CoA
ligation is believed to be the initial step in the bile
acid 7alpha-dehydroxylation pathway in the intestinal
bacterium Eubacterium sp.
Length = 342
Score = 73.3 bits (181), Expect = 3e-15
Identities = 40/122 (32%), Positives = 61/122 (50%), Gaps = 15/122 (12%)
Query: 4 GRAGAPTTVCDIRLVDWDEGNYRVCDKPFPR---GEILIGGSNISPGYYKNPTKTKEDFF 60
G G P ++R+VD ++G P GEI++ G + GY+ P T E
Sbjct: 169 GSVGRPVPGVEVRIVD-EDGR------ELPPGEVGEIVVRGPAVMAGYWNRPEATAEALR 221
Query: 61 DEEGKRWFRTGDIGELHQDGVIRIIDRKKDLVKLQAGEYVSLGKVEAELKTCPVIENICV 120
D W TGD+G L +DG + I+DRKKD++ + GE + +VE L P + ++ V
Sbjct: 222 DG----WLHTGDLGYLDEDGYLYIVDRKKDMI-ISGGENIYPAEVENVLLAHPAVADVAV 276
Query: 121 YG 122
G
Sbjct: 277 IG 278
>gnl|CDD|213288 cd05921, FCS, Feruloyl-CoA synthetase (FCS). Feruloyl-CoA
synthetase is an essential enzyme in the feruloyl acid
degradation pathway and enables some proteobacteria to
grow on media containing feruloyl acid as the sole
carbon source. It catalyzes the transfer of CoA to the
carboxyl group of ferulic acid, which then forms
feruloyl-CoA in the presence of ATP and Mg2. The
resulting feruloyl-CoA is further degraded to vanillin
and acetyl-CoA. Feruloyl-CoA synthetase (FCS) is a
subfamily of the adenylate-forming enzymes superfamily.
Length = 559
Score = 73.4 bits (181), Expect = 4e-15
Identities = 60/240 (25%), Positives = 103/240 (42%), Gaps = 44/240 (18%)
Query: 3 TGRAGAPTTVCDIRLVDWDEGNYRVCDKPFPRG---EILIGGSNISPGYYKNPTKTKEDF 59
+G G P +++LV P G E+ + G N++PGY+++P T E
Sbjct: 346 SGVIGLPAPGTELKLV--------------PNGGKLEVRVKGPNVTPGYWRDPELTAEA- 390
Query: 60 FDEEGKRWFRTGDIGELHQD------GVI---RIIDRKKDLVKLQAGEYVSLGKVEAEL- 109
FDEEG ++R GD D G++ R+ + D KL +G +VS+G + L
Sbjct: 391 FDEEG--FYRIGDAVRF-VDPDDPERGLVFDGRVAE---DF-KLASGTWVSVGALRVALV 443
Query: 110 -KTCPVIENICVYGDSRKDYTVALVVPNPSRLKEIAEKLNLDSLTFQQLCTHPEIEKQVL 168
P++++ V G R D LV PN + + +A + + HP + + +
Sbjct: 444 AALAPLVQDAVVAGHDR-DEIGLLVFPNLAACRRLAGLAEASAE---DVLAHPAVREALA 499
Query: 169 VELQEHGK-KSNLERFEIPNAVKLCSEVWSPDMGLVTAAFKLKRKDIQERYQHEINRMYA 227
L H + + R L +E S D G +T + ++ + R + R+YA
Sbjct: 500 ARLAAHNRAATGSTRVT---RALLLAEPPSIDAGEITDKGYINQRAVLARRAALVERLYA 556
>gnl|CDD|213326 cd12118, ttLC_FACS_AEE21_like, Fatty acyl-CoA synthetases similar
to LC-FACS from Thermus thermophiles and Arabidopsis.
This family includes fatty acyl-CoA synthetases that can
activate medium to long-chain fatty acids. These enzymes
catalyze the ATP-dependent acylation of fatty acids in a
two-step reaction. The carboxylate substrate first
reacts with ATP to form an acyl-adenylate intermediate,
which then reacts with CoA to produce an acyl-CoA ester.
Fatty acyl-CoA synthetases are responsible for fatty
acid degradation as well as physiological regulation of
cellular functions via the production of fatty acyl-CoA
esters. The fatty acyl-CoA synthetase from Thermus
thermophiles in this family has been shown to catalyze
the long-chain fatty acid, myristoyl acid. Also included
in this family are acyl activating enzymes from
Arabidopsis, which contains a large number of proteins
from this family with up to 63 different genes, many of
which are uncharacterized.
Length = 520
Score = 72.7 bits (179), Expect = 6e-15
Identities = 41/115 (35%), Positives = 59/115 (51%), Gaps = 17/115 (14%)
Query: 5 RAGAP-TTVCDIRLVDWDEGNYRVCDKPFPR-----GEILIGGSNISPGYYKNPTKTKED 58
R G + ++ +VD + K PR GEI++ G+ + GYYKNP T+E
Sbjct: 344 RQGVRYVGLEEVDVVDPETM------KDVPRDGKTIGEIVMRGNTVMKGYYKNPEATEEA 397
Query: 59 FFDEEGKRWFRTGDIGELHQDGVIRIIDRKKDLVKLQAGEYVSLGKVEAELKTCP 113
F G WF +GD+ +H DG I I DR KD++ + GE +S +VE L P
Sbjct: 398 F--AGG--WFHSGDLAVVHPDGYIEIKDRSKDII-ISGGENISSIEVEGVLYKHP 447
>gnl|CDD|213325 cd12117, A_NRPS_Srf_like, The adenylation domain of nonribosomal
peptide synthetases (NRPS), including Bacillus subtilis
termination module Surfactin (SrfA-C). The adenylation
(A) domain of NRPS recognizes a specific amino acid or
hydroxy acid and activates it as an (amino) acyl
adenylate by hydrolysis of ATP. The activated acyl
moiety then forms a thioester to the enzyme-bound
cofactor phosphopantetheine of a peptidyl carrier
protein domain. NRPSs are large multifunctional enzymes
which synthesize many therapeutically useful peptides in
bacteria and fungi via a template-directed, nucleic acid
independent nonribosomal mechanism. These natural
products include antibiotics, immunosuppressants, plant
and animal toxins, and enzyme inhibitors. NRPS has a
distinct modular structure in which each module is
responsible for the recognition, activation, and, in
some cases, modification of a single amino acid residue
of the final peptide product. The modules can be
subdivided into domains that catalyze specific
biochemical reactions. This family includes the
adenylation domain of the Bacillus subtilis termination
module (Surfactin domain, SrfA-C) which recognizes a
specific amino acid building block, which is then
activated and transferred to the terminal thiol of the
4'-phosphopantetheine (Ppan) arm of the downstream
peptidyl carrier protein (PCP) domain.
Length = 474
Score = 72.6 bits (179), Expect = 7e-15
Identities = 37/110 (33%), Positives = 52/110 (47%), Gaps = 7/110 (6%)
Query: 35 GEILIGGSNISPGYYKNPTKTKEDFFD---EEGKRWFRTGDIGELHQDGVIRIIDRKKDL 91
GE+ +GG ++ GY P T E F G+R +RTGD+ DG I + R
Sbjct: 324 GELYVGGDGLALGYLNRPELTAERFVPDPFGPGERLYRTGDLARWRPDGNIEFLGRIDRQ 383
Query: 92 VKLQAGEYVSLGKVEAELKTCP-VIENI-CVYGDSRKDYT-VALVVPNPS 138
VK++ G + LG++E L P V E V D+ D VA VV +
Sbjct: 384 VKIR-GFRIELGEIEQALLQHPGVREAAVLVREDAAGDKRLVAYVVARGA 432
>gnl|CDD|235313 PRK04813, PRK04813, D-alanine--poly(phosphoribitol) ligase subunit
1; Provisional.
Length = 503
Score = 72.2 bits (178), Expect = 8e-15
Identities = 38/106 (35%), Positives = 56/106 (52%), Gaps = 5/106 (4%)
Query: 35 GEILIGGSNISPGYYKNPTKTKEDFFDEEGKRWFRTGDIGELHQDGVIRIIDRKKDLVKL 94
GEI+I G ++S GY NP KT E FF +G+ + TGD G L +DG++ R +KL
Sbjct: 345 GEIVISGPSVSKGYLNNPEKTAEAFFTFDGQPAYHTGDAGYL-EDGLLFYQGRIDFQIKL 403
Query: 95 QAGEYVSLGKVEAELKTCPVIENICV---YGDSRKDYTVALVVPNP 137
G + L ++E L+ +E+ V D + Y +A VVP
Sbjct: 404 -NGYRIELEEIEQNLRQSSYVESAVVVPYNKDHKVQYLIAYVVPKE 448
>gnl|CDD|233550 TIGR01733, AA-adenyl-dom, amino acid adenylation domain. This
model represents a domain responsible for the specific
recognition of amino acids and activation as adenylyl
amino acids. The reaction catalyzed is aa + ATP ->
aa-AMP + PPi. These domains are usually found as
components of multi-domain non-ribosomal peptide
synthetases and are usually called "A-domains" in that
context (for a review, see ). A-domains are almost
invariably followed by "T-domains" (thiolation domains,
pfam00550) to which the amino acid adenylate is
transferred as a thiol-ester to a bound pantetheine
cofactor with the release of AMP (these are also called
peptide carrier proteins, or PCPs. When the A-domain
does not represent the first module (corresponding to
the first amino acid in the product molecule) it is
usually preceded by a "C-domain" (condensation domain,
pfam00668) which catalyzes the ligation of two amino
acid thiol-esters from neighboring modules. This domain
is a subset of the AMP-binding domain found in Pfam
(pfam00501) which also hits substrate--CoA ligases and
luciferases. Sequences scoring in between trusted and
noise for this model may be ambiguous as to whether they
activate amino acids or other molecules lacking an alpha
amino group.
Length = 409
Score = 72.3 bits (178), Expect = 8e-15
Identities = 31/91 (34%), Positives = 45/91 (49%), Gaps = 6/91 (6%)
Query: 35 GEILIGGSNISPGYYKNPTKTKEDFFD-----EEGKRWFRTGDIGELHQDGVIRIIDRKK 89
GE+ IGG ++ GY P T E F +G R +RTGD+ DG + + R
Sbjct: 320 GELYIGGPGVARGYLNRPELTAERFVPDPFAGGDGARLYRTGDLVRYLPDGNLEFLGRID 379
Query: 90 DLVKLQAGEYVSLGKVEAELKTCPVIENICV 120
D VK++ G + LG++EA L P + V
Sbjct: 380 DQVKIR-GYRIELGEIEAALLRHPGVREAVV 409
>gnl|CDD|237144 PRK12582, PRK12582, acyl-CoA synthetase; Provisional.
Length = 624
Score = 72.0 bits (177), Expect = 1e-14
Identities = 57/211 (27%), Positives = 91/211 (43%), Gaps = 40/211 (18%)
Query: 36 EILIGGSNISPGYYKNPTKTKEDFFDEEGKRWFRTGDIGELHQDGVIRIIDRK---KDLV 92
E+ + G N++PGY+K+P T FDEEG ++R GD R +D K L+
Sbjct: 425 EVRVKGPNVTPGYHKDPELT-AAAFDEEG--FYRLGDAA--------RFVDPDDPEKGLI 473
Query: 93 ---------KLQAGEYVSLGKVEAE-LKTC-PVIENICVYGDSRKDYTVALVVPNPSRLK 141
KL G +VS+G + + + C PVI + V G R + L PNP+ +
Sbjct: 474 FDGRVAEDFKLSTGTWVSVGTLRPDAVAACSPVIHDAVVAGQDR-AFIGLLAWPNPAACR 532
Query: 142 EIAEKLNLDSLTFQQLCTHPEIEKQVLVELQEH-----GKKSNLERFEIPNAVKLCSEVW 196
++A + + HP + + L H G S + R L +E
Sbjct: 533 QLAGD---PDAAPEDVVKHPAVLAILREGLSAHNAEAGGSSSRIARA------LLMTEPP 583
Query: 197 SPDMGLVTAAFKLKRKDIQERYQHEINRMYA 227
S D G +T + ++ + ER + R+YA
Sbjct: 584 SIDAGEITDKGYINQRAVLERRAALVERLYA 614
>gnl|CDD|171539 PRK12492, PRK12492, long-chain-fatty-acid--CoA ligase; Provisional.
Length = 562
Score = 71.8 bits (176), Expect = 1e-14
Identities = 49/136 (36%), Positives = 70/136 (51%), Gaps = 13/136 (9%)
Query: 4 GRAGAPTTVCDIRLVDWDEGNYRVCDKPF-PRGEILIGGSNISPGYYKNPTKTKEDFFDE 62
G G P ++++D D+GN + P RGE+ I G + GY++ P T E D
Sbjct: 386 GTVGIPVPGTALKVID-DDGN----ELPLGERGELCIKGPQVMKGYWQQPEATAE-ALDA 439
Query: 63 EGKRWFRTGDIGELHQDGVIRIIDRKKDLVKLQAGEYVSLGKVEAELKTCPVIENICVYG 122
EG WF+TGDI + DG +RI+DRKKDL+ + +G V ++E + P + N G
Sbjct: 440 EG--WFKTGDIAVIDPDGFVRIVDRKKDLI-IVSGFNVYPNEIEDVVMAHPKVANCAAIG 496
Query: 123 --DSRKDYTVAL-VVP 135
D R V L VV
Sbjct: 497 VPDERSGEAVKLFVVA 512
>gnl|CDD|235625 PRK05852, PRK05852, acyl-CoA synthetase; Validated.
Length = 534
Score = 71.5 bits (175), Expect = 2e-14
Identities = 46/141 (32%), Positives = 67/141 (47%), Gaps = 13/141 (9%)
Query: 1 MSTGRAGAPTTVCDIRLVDWDEGNYRVCDKPFPRGEILIGGSNISPGYYKNPTKTKEDFF 60
+STG G T IR+V D GE+ + G+ + GY +PT T +F
Sbjct: 352 VSTGLVGRSTGA-QIRIVGSDGLPL----PAGAVGEVWLRGTTVVRGYLGDPTITAANFT 406
Query: 61 DEEGKRWFRTGDIGELHQDGVIRIIDRKKDLVKLQAGEYVSLGKVEAELKTCPVIENICV 120
D W RTGD+G L G + I R K+L+ + GE +S +VE L + P + V
Sbjct: 407 DG----WLRTGDLGSLSAAGDLSIRGRIKELIN-RGGEKISPERVEGVLASHPNVMEAAV 461
Query: 121 YGDSRKDY--TV-ALVVPNPS 138
+G + Y V A++VP S
Sbjct: 462 FGVPDQLYGEAVAAVIVPRES 482
>gnl|CDD|181011 PRK07514, PRK07514, malonyl-CoA synthase; Validated.
Length = 504
Score = 71.4 bits (176), Expect = 2e-14
Identities = 44/130 (33%), Positives = 68/130 (52%), Gaps = 18/130 (13%)
Query: 15 IRLVDWDEGNYRVCDKPFPRGE---ILIGGSNISPGYYKNPTKTKEDFFDEEGKRWFRTG 71
+R+ D + G P GE I + G N+ GY++ P KT E+F +G +F TG
Sbjct: 332 LRVTDPETG------AELPPGEIGMIEVKGPNVFKGYWRMPEKTAEEF-RADG--FFITG 382
Query: 72 DIGELHQDGVIRIIDRKKDLVKLQAGEYVSLGKVEAELKTCP-VIENICVYGDSRKDY-- 128
D+G++ + G + I+ R KDL+ + G V +VE E+ P V+E+ V G D+
Sbjct: 383 DLGKIDERGYVHIVGRGKDLI-ISGGYNVYPKEVEGEIDELPGVVES-AVIGVPHPDFGE 440
Query: 129 -TVALVVPNP 137
A+VVP P
Sbjct: 441 GVTAVVVPKP 450
>gnl|CDD|213285 cd05918, A_NRPS_SidN3_like, The adenylation (A) domain of
siderophore-synthesizing nonribosomal peptide
synthetases (NRPS). The adenylation (A) domain of NRPS
recognizes a specific amino acid or hydroxy acid and
activates it as an (amino) acyl adenylate by hydrolysis
of ATP. The activated acyl moiety then forms a thioester
to the enzyme-bound cofactor phosphopantetheine of a
peptidyl carrier protein domain. This family of
siderophore-synthesizing NRPS includes the third
adenylation domain of SidN from the endophytic fungus
Neotyphodium lolii, ferrichrome siderophore synthetase,
HC-toxin synthetase, and enniatin synthase. NRPSs are
large multifunctional enzymes which synthesize many
therapeutically useful peptides. These natural products
include antibiotics, immunosuppressants, plant and
animal toxins, and enzyme inhibitors. NRPS has a
distinct modular structure in which each module is
responsible for the recognition, activation, and in some
cases, modification of a single amino acid residue of
the final peptide product. The modules can be subdivided
into domains that catalyze specific biochemical
reactions.
Length = 447
Score = 70.7 bits (174), Expect = 3e-14
Identities = 35/120 (29%), Positives = 55/120 (45%), Gaps = 10/120 (8%)
Query: 7 GAPTTVCDIRLVDWDEGNYRVCDKPFPRGEILIGGSNISPGYYKNPTKTKEDFF------ 60
G P ++D + GE+ IGG ++ GY P T E F
Sbjct: 261 GRPLPNYTCWVLD--PDLEPLVPIGAV-GELCIGGPGVARGYLNRPELTAEKFIPDPFWL 317
Query: 61 DEEGKRWFRTGDIGELHQDGVIRIIDRKKDLVKLQAGEYVSLGKVEAELKTCPVIENICV 120
+ G R +RTGD+ +DG + + RK D VK++ G+ + LG++EA L+ P + V
Sbjct: 318 NNPGGRIYRTGDLVRYLEDGSLEFLGRKDDQVKIR-GQRIELGEIEAVLRALPGVVVAAV 376
>gnl|CDD|236175 PRK08180, PRK08180, feruloyl-CoA synthase; Reviewed.
Length = 614
Score = 69.9 bits (172), Expect = 6e-14
Identities = 61/238 (25%), Positives = 101/238 (42%), Gaps = 46/238 (19%)
Query: 7 GAPTTVCDIRLVDWDEGNYRVCDKPFPRGEILIGGSNISPGYYKNPTKTKEDFFDEEGKR 66
G P C+++LV + E+ + G N++PGY++ P T E FDEEG
Sbjct: 395 GLPAPGCEVKLVPVGG-----------KLEVRVKGPNVTPGYWRAPELTAE-AFDEEG-- 440
Query: 67 WFRTGDIGELHQ----------DGVIRIIDRKKDLVKLQAGEYVSLGKVEAEL--KTCPV 114
++R+GD DG RI + D KL +G +VS+G + A P+
Sbjct: 441 YYRSGDAVRFVDPADPERGLMFDG--RIAE---DF-KLSSGTWVSVGPLRARAVSAGAPL 494
Query: 115 IENICVYGDSRKDYTVALVVPNPSRLKEIAEKLNLDSLTFQQLCTHPEIE---KQVLVEL 171
++++ + G R D LV PN + +A L SL ++ HP + ++ L L
Sbjct: 495 VQDVVITGHDR-DEIGLLVFPNLDACRRLAGLLADASLA--EVLAHPAVRAAFRERLARL 551
Query: 172 QEHGKKSN--LERFEIPNAVKLCSEVWSPDMGLVTAAFKLKRKDIQERYQHEINRMYA 227
S+ + R L E S D G +T + ++ + R + +YA
Sbjct: 552 NAQATGSSTRVAR------ALLLDEPPSLDAGEITDKGYINQRAVLARRAALVEALYA 603
>gnl|CDD|213276 cd05908, A_NRPS_MycA_like, The adenylation domain of nonribosomal
peptide synthetases (NRPS) similar to mycosubtilin
synthase subunit A (MycA). The adenylation (A) domain
of NRPS recognizes a specific amino acid or hydroxy acid
and activates it as (amino)-acyl adenylate by hydrolysis
of ATP. The activated acyl moiety then forms thioester
to the enzyme-bound cofactor phosphopantetheine of a
peptidyl carrier protein domain. This family includes
NRPS similar to mycosubtilin synthase subunit A (MycA).
Mycosubtilin, which is characterized by a beta-amino
fatty acid moiety linked to the circular heptapeptide
Asn-Tyr-Asn-Gln-Pro-Ser-Asn, belongs to the iturin
family of lipopeptide antibiotics. The mycosubtilin
synthase subunit A (MycA) combines functional domains
derived from peptide synthetases, amino transferases,
and fatty acid synthases. Nonribosomal peptide
synthetases are large multifunction enzymes that
synthesize many therapeutically useful peptides. NRPS
has a distinct modular structure in which each module is
responsible for the recognition, activation, and, in
some cases, modification of a single amino acid residue
of the final peptide product. The modules can be
subdivided into domains that catalyze specific
biochemical reactions.
Length = 499
Score = 67.7 bits (166), Expect = 3e-13
Identities = 32/86 (37%), Positives = 47/86 (54%), Gaps = 8/86 (9%)
Query: 7 GAPTTVCDIRLVDWDEGNYRVCDKPFPRGEILIGGSNISPGYYKNPTKTKEDFFDEEGKR 66
G+P C+IR+VD + + D+ G I I G N++ GYY NP TK +G
Sbjct: 317 GSPIDGCEIRIVD--DAGEVLPDRTI--GHIQIKGDNVTSGYYNNPEATK-KSITPDG-- 369
Query: 67 WFRTGDIGELHQDGVIRIIDRKKDLV 92
W +TGD+G L +G + I R KD++
Sbjct: 370 WLKTGDLGFLR-NGRLVITGRAKDII 394
>gnl|CDD|236169 PRK08162, PRK08162, acyl-CoA synthetase; Validated.
Length = 545
Score = 66.9 bits (164), Expect = 6e-13
Identities = 35/82 (42%), Positives = 47/82 (57%), Gaps = 10/82 (12%)
Query: 30 KPFPR-----GEILIGGSNISPGYYKNPTKTKEDFFDEEGKRWFRTGDIGELHQDGVIRI 84
+P P GEI+ G+ + GY KNP T+E F G WF TGD+ LH DG I+I
Sbjct: 379 QPVPADGETIGEIMFRGNIVMKGYLKNPKATEEAF--AGG--WFHTGDLAVLHPDGYIKI 434
Query: 85 IDRKKDLVKLQAGEYVSLGKVE 106
DR KD++ + GE +S +VE
Sbjct: 435 KDRSKDII-ISGGENISSIEVE 455
>gnl|CDD|168170 PRK05677, PRK05677, long-chain-fatty-acid--CoA ligase; Validated.
Length = 562
Score = 66.7 bits (163), Expect = 7e-13
Identities = 38/106 (35%), Positives = 61/106 (57%), Gaps = 12/106 (11%)
Query: 9 PTTVCDIRLVDWDEGNYRVCDKPF-PRGEILIGGSNISPGYYKNPTKTKEDFFDEEGKRW 67
P+T+C +++D D+GN + P GE+ + G + GY++ P T E D +G W
Sbjct: 385 PSTLC--KVID-DDGN----ELPLGEVGELCVKGPQVMKGYWQRPEATDE-ILDSDG--W 434
Query: 68 FRTGDIGELHQDGVIRIIDRKKDLVKLQAGEYVSLGKVEAELKTCP 113
+TGDI + +DG +RI+DRKKD++ L +G V ++E L P
Sbjct: 435 LKTGDIALIQEDGYMRIVDRKKDMI-LVSGFNVYPNELEDVLAALP 479
>gnl|CDD|233551 TIGR01734, D-ala-DACP-lig, D-alanine--poly(phosphoribitol) ligase,
subunit 1. This model represents the enzyme (also
called D-alanine-D-alanyl carrier protein ligase) which
activates D-alanine as an adenylate via the reaction
D-ala + ATP -> D-ala-AMP + PPi, and further catalyzes
the condensation of the amino acid adenylate with the
D-alanyl carrier protein (D-ala-ACP). The D-alanine is
then further transferred to teichoic acid in the
biosynthesis of lipoteichoic acid (LTA) and wall
teichoic acid (WTA) in gram positive bacteria, both
polysacchatides [Cell envelope, Biosynthesis and
degradation of murein sacculus and peptidoglycan].
Length = 502
Score = 66.7 bits (163), Expect = 7e-13
Identities = 39/108 (36%), Positives = 56/108 (51%), Gaps = 6/108 (5%)
Query: 34 RGEILIGGSNISPGYYKNPTKTKEDFFDEEGKRWFRTGDIGELHQDGVIRIIDRKKDLVK 93
+GEI+I G ++S GY NP KT E FF EG+ +RTGD G + DG + R +K
Sbjct: 342 KGEIVIVGPSVSKGYLNNPEKTAEAFFSHEGQPAYRTGDAGTI-TDGQLFYQGRLDFQIK 400
Query: 94 LQAGEYVSLGKVEAELKTCPVIEN---ICVYGDSRK-DYTVALVVPNP 137
L G + L +E L+ IE+ + Y K +Y +A +VP
Sbjct: 401 LH-GYRIELEDIEFNLRQSSYIESAVVVPKYNKDHKVEYLIAAIVPET 447
>gnl|CDD|181381 PRK08316, PRK08316, acyl-CoA synthetase; Validated.
Length = 523
Score = 66.1 bits (162), Expect = 1e-12
Identities = 44/122 (36%), Positives = 64/122 (52%), Gaps = 15/122 (12%)
Query: 4 GRAGAPTTVCDIRLVDWDEGNYRVCDKPFPR---GEILIGGSNISPGYYKNPTKTKEDFF 60
G AG P + R+VD D+GN GEI+ + GY+ +P KT E F
Sbjct: 341 GSAGRPVLNVETRVVD-DDGN------DVAPGEVGEIVHRSPQLMLGYWDDPEKTAEAFR 393
Query: 61 DEEGKRWFRTGDIGELHQDGVIRIIDRKKDLVKLQAGEYVSLGKVEAELKTCPVIENICV 120
G WF +GD+G + ++G I ++DRKKD++K GE V+ +VE L T P + + V
Sbjct: 394 G--G--WFHSGDLGVMDEEGYITVVDRKKDMIK-TGGENVASREVEEALYTHPAVAEVAV 448
Query: 121 YG 122
G
Sbjct: 449 IG 450
>gnl|CDD|223951 COG1020, EntF, Non-ribosomal peptide synthetase modules and related
proteins [Secondary metabolites biosynthesis, transport,
and catabolism].
Length = 642
Score = 66.1 bits (161), Expect = 1e-12
Identities = 28/79 (35%), Positives = 40/79 (50%), Gaps = 4/79 (5%)
Query: 35 GEILIGGSNISPGYYKNPTKTKEDFFDEEGKRWFRTGDIGELHQDGVIRIIDRKKDLVKL 94
GE+ I G ++ GY P T E F R +RTGD+ DG + + RK VK+
Sbjct: 568 GELYIAGLGLALGYLNRPDLTAERFIA---LRLYRTGDLARPLADGALEYLGRKDSQVKI 624
Query: 95 QAGEYVSLGKVEAELKTCP 113
+ G + LG++EA L P
Sbjct: 625 R-GFRIELGEIEAALAEQP 642
>gnl|CDD|213282 cd05914, FACL_like_3, Uncharacterized subfamily of fatty acid CoA
ligase (FACL). Fatty acyl-CoA ligases catalyze the
ATP-dependent activation of fatty acids in a two-step
reaction. The carboxylate substrate first reacts with
ATP to form an acyl-adenylate intermediate, which then
reacts with CoA to produce an acyl-CoA ester. This is a
required step before free fatty acids can participate in
most catabolic and anabolic reactions.
Length = 448
Score = 65.8 bits (161), Expect = 1e-12
Identities = 40/108 (37%), Positives = 59/108 (54%), Gaps = 11/108 (10%)
Query: 35 GEILIGGSNISPGYYKNPTKTKEDFFDEEGKRWFRTGDIGELHQDGVIRIIDRKKDLVKL 94
GEIL+ GS + GY P T + W+ TGD+G L ++G + I RKK+L+
Sbjct: 305 GEILVRGSLML-GYLGEPPATDD---------WWATGDLGHLDEEGYLYINGRKKNLIIT 354
Query: 95 QAGEYVSLGKVEAELKTCPVIENICVYGDSRKDYTVALVVPNPSRLKE 142
G VS VE+EL+ P I V+GD++ + VAL+VP P+ +
Sbjct: 355 SFGRNVSPEWVESELQQAPAIAQAVVFGDAQP-FLVALIVPAPNISDD 401
>gnl|CDD|213272 cd05904, 4CL, 4-Coumarate-CoA Ligase (4CL). 4-Coumarate:coenzyme A
ligase is a key enzyme in the phenylpropanoid metabolic
pathway for monolignol and flavonoid biosynthesis. It
catalyzes the synthesis of hydroxycinnamate-CoA
thioesters in a two-step reaction, involving the
formation of hydroxycinnamate-AMP anhydride and the
nucleophilic substitution of AMP by CoA. The
phenylpropanoid pathway is one of the most important
secondary metabolism pathways in plants and
hydroxycinnamate-CoA thioesters are the precursors of
lignin and other important phenylpropanoids.
Length = 504
Score = 65.4 bits (160), Expect = 2e-12
Identities = 43/135 (31%), Positives = 67/135 (49%), Gaps = 17/135 (12%)
Query: 15 IRLVDWDEGNYRVCDKPFP---RGEILIGGSNISPGYYKNPTKTKEDFFDEEGKRWFRTG 71
++VD + G + P GE+ + G + GY NP T E D++G W TG
Sbjct: 341 AKIVDPETG------ESLPPNQPGELWVRGPQVMKGYLNNPEATAETI-DKDG--WLHTG 391
Query: 72 DIGELHQDGVIRIIDRKKDLVKLQAGEYVSLGKVEAELKTCPVIENICVYGDSRKD---Y 128
D+G +DG + I+DR K+L+K + G V+ ++EA L + P I + V ++
Sbjct: 392 DLGYFDEDGYLFIVDRLKELIKYK-GFQVAPAELEALLLSHPEIADAAVIPYPDEEAGEI 450
Query: 129 TVALVVPNP-SRLKE 142
+A VV P S L E
Sbjct: 451 PMAFVVRQPGSELSE 465
>gnl|CDD|237054 PRK12316, PRK12316, peptide synthase; Provisional.
Length = 5163
Score = 65.7 bits (160), Expect = 2e-12
Identities = 40/118 (33%), Positives = 56/118 (47%), Gaps = 9/118 (7%)
Query: 35 GEILIGGSNISPGYYKNPTKTKEDF----FDEEGKRWFRTGDIGELHQDGVIRIIDRKKD 90
GE+ +GG ++ GY + P T E F F G R +RTGD+ DGVI + R
Sbjct: 4894 GELYLGGEGVARGYLERPALTAERFVPDPFGAPGGRLYRTGDLARYRADGVIDYLGRVDH 4953
Query: 91 LVKLQAGEYVSLGKVEAELKTCPVIEN---ICVYGDSRKDYTVALVVPNPSRLKEIAE 145
VK++ G + LG++EA L+ P + I G K V VVP L + E
Sbjct: 4954 QVKIR-GFRIELGEIEARLREHPAVREAVVIAQEGAVGKQL-VGYVVPQDPALADADE 5009
Score = 60.0 bits (145), Expect = 2e-10
Identities = 37/111 (33%), Positives = 50/111 (45%), Gaps = 8/111 (7%)
Query: 31 PFP---RGEILIGGSNISPGYYKNPTKTKEDFFDEE---GKRWFRTGDIGELHQDGVIRI 84
P P GE+ + G ++ GY+ P T E F G+R +RTGD+ DGVI
Sbjct: 845 PVPVGVLGELYLAGRGLARGYHGRPGLTAERFVPSPFVAGERMYRTGDLARYRADGVIEY 904
Query: 85 IDRKKDLVKLQAGEYVSLGKVEAELKTCPVIENICVYGDSRKDYTVALVVP 135
R VKL+ G + LG++EA L P + V K V VV
Sbjct: 905 AGRIDHQVKLR-GLRIELGEIEARLLEHPWVREAAVLAVDGKQL-VGYVVL 953
Score = 57.3 bits (138), Expect = 2e-09
Identities = 37/107 (34%), Positives = 52/107 (48%), Gaps = 5/107 (4%)
Query: 35 GEILIGGSNISPGYYKNPTKTKEDFFDE---EGKRWFRTGDIGELHQDGVIRIIDRKKDL 91
GE+ +GG ++ GY+ P T E F + G+R +RTGD+ DGVI I R
Sbjct: 3390 GELYLGGEGLARGYHNRPGLTAERFVPDPFVPGERLYRTGDLARYRADGVIEYIGRVDHQ 3449
Query: 92 VKLQAGEYVSLGKVEAELKTCPVIENICVYGDSRKDYTVALVVPNPS 138
VK++ G + LG++EA L P + V VA VVP
Sbjct: 3450 VKIR-GFRIELGEIEARLLEHPWVREAVVLAVD-GRQLVAYVVPEDE 3494
Score = 55.7 bits (134), Expect = 6e-09
Identities = 39/108 (36%), Positives = 55/108 (50%), Gaps = 9/108 (8%)
Query: 35 GEILIGGSNISPGYYKNPTKTKEDF----FDEEGKRWFRTGDIGELHQDGVIRIIDRKKD 90
GE+ +GG ++ GY P T E F F G+R +RTGD+ DGV+ + R
Sbjct: 2346 GELYLGGEGLARGYLNRPGLTAERFVPDPFSASGERLYRTGDLARYRADGVVEYLGRIDH 2405
Query: 91 LVKLQAGEYVSLGKVEAELKTCPVIENICVY---GDSRKDYTVALVVP 135
VK++ G + LG++EA L+ P + V G S K VA VVP
Sbjct: 2406 QVKIR-GFRIELGEIEARLQAHPAVREAVVVAQDGASGKQL-VAYVVP 2451
>gnl|CDD|213296 cd05930, A_NRPS, The adenylation domain of nonribosomal peptide
synthetases (NRPS). The adenylation (A) domain of NRPS
recognizes a specific amino acid or hydroxy acid and
activates it as an (amino) acyl adenylate by hydrolysis
of ATP. The activated acyl moiety then forms a thioester
bond to the enzyme-bound cofactor phosphopantetheine of
a peptidyl carrier protein domain. NRPSs are large
multifunctional enzymes which synthesize many
therapeutically useful peptides in bacteria and fungi
via a template-directed, nucleic acid independent
nonribosomal mechanism. These natural products include
antibiotics, immunosuppressants, plant and animal
toxins, and enzyme inhibitors. NRPS has a distinct
modular structure in which each module is responsible
for the recognition, activation, and in some cases,
modification of a single amino acid residue of the final
peptide product. The modules can be subdivided into
domains that catalyze specific biochemical reactions.
Length = 445
Score = 64.4 bits (158), Expect = 4e-12
Identities = 39/118 (33%), Positives = 56/118 (47%), Gaps = 8/118 (6%)
Query: 35 GEILIGGSNISPGYYKNPTKTKEDF----FDEEGKRWFRTGDIGELHQDGVIRIIDRKKD 90
GE+ IGG+ ++ GY P T E F F G+R +RTGD+ DG + + R D
Sbjct: 292 GELYIGGAGVARGYLNRPELTAERFVPDPFGGPGERLYRTGDLARWLPDGNLEFLGRADD 351
Query: 91 LVKLQAGEYVSLGKVEAELKTCPVIEN--ICVYGDSRKD-YTVALVVPNPSRLKEIAE 145
VK+ G + LG++EA L P + + D + VA VVP + AE
Sbjct: 352 QVKI-RGYRIELGEIEAALLAHPGVREAVVVAREDGAGEKRLVAYVVPAAGAELDAAE 408
>gnl|CDD|169098 PRK07786, PRK07786, long-chain-fatty-acid--CoA ligase; Validated.
Length = 542
Score = 64.0 bits (156), Expect = 5e-12
Identities = 44/153 (28%), Positives = 70/153 (45%), Gaps = 18/153 (11%)
Query: 4 GRAGAPTTVCDIRLVDWDEGNYRVCDKPFPR-GEILIGGSNISPGYYKNPTKTKEDFFDE 62
G G R+VD + N D P GEI+ + GY+ NP T E F
Sbjct: 345 GSVGKVIPTVAARVVD-ENMN----DVPVGEVGEIVYRAPTLMSGYWNNPEATAEAF--- 396
Query: 63 EGKRWFRTGDIGELHQDGVIRIIDRKKDLVKLQAGEYVSLGKVEAELKTCPVIENICVYG 122
G WF +GD+ ++G + ++DRKKD++ + GE + +VE L + P I + V G
Sbjct: 397 AGG-WFHSGDLVRQDEEGYVWVVDRKKDMI-ISGGENIYCAEVENVLASHPDIVEVAVIG 454
Query: 123 DSRKDYT---VALVVPNPS----RLKEIAEKLN 148
+ + + VA+ L+++AE L
Sbjct: 455 RADEKWGEVPVAVAAVRNDDAALTLEDLAEFLT 487
>gnl|CDD|236359 PRK08974, PRK08974, long-chain-fatty-acid--CoA ligase; Validated.
Length = 560
Score = 64.3 bits (157), Expect = 5e-12
Identities = 33/90 (36%), Positives = 50/90 (55%), Gaps = 8/90 (8%)
Query: 3 TGRAGAPTTVCDIRLVDWDEGNYRVCDKPFPRGEILIGGSNISPGYYKNPTKTKEDFFDE 62
+G G P +I+LVD D+GN +P GE+ + G + GY++ P T E D
Sbjct: 377 SGSIGLPVPSTEIKLVD-DDGNEVPPGEP---GELWVKGPQVMLGYWQRPEATDEVIKDG 432
Query: 63 EGKRWFRTGDIGELHQDGVIRIIDRKKDLV 92
W TGDI + ++G +RI+DRKKD++
Sbjct: 433 ----WLATGDIAVMDEEGFLRIVDRKKDMI 458
>gnl|CDD|215576 PLN03102, PLN03102, acyl-activating enzyme; Provisional.
Length = 579
Score = 63.9 bits (155), Expect = 7e-12
Identities = 39/87 (44%), Positives = 51/87 (58%), Gaps = 6/87 (6%)
Query: 35 GEILIGGSNISPGYYKNPTKTKEDFFDEEGKRWFRTGDIGELHQDGVIRIIDRKKDLVKL 94
GEI+I GS+I GY KNP T E F + G W TGD+G +H DG + I DR KD++ +
Sbjct: 393 GEIVIKGSSIMKGYLKNPKATSEAF--KHG--WLNTGDVGVIHPDGHVEIKDRSKDII-I 447
Query: 95 QAGEYVSLGKVEAELKTCP-VIENICV 120
GE +S +VE L P V+E V
Sbjct: 448 SGGENISSVEVENVLYKYPKVLETAVV 474
>gnl|CDD|236072 PRK07656, PRK07656, long-chain-fatty-acid--CoA ligase; Validated.
Length = 513
Score = 63.8 bits (156), Expect = 7e-12
Identities = 25/58 (43%), Positives = 36/58 (62%), Gaps = 3/58 (5%)
Query: 35 GEILIGGSNISPGYYKNPTKTKEDFFDEEGKRWFRTGDIGELHQDGVIRIIDRKKDLV 92
GE+L+ G N+ GYY +P T D +G W TGD+G L ++G + I+DRKKD+
Sbjct: 364 GELLVRGPNVMKGYYDDPEATAAAI-DADG--WLHTGDLGRLDEEGYLYIVDRKKDMF 418
>gnl|CDD|168698 PRK06839, PRK06839, acyl-CoA synthetase; Validated.
Length = 496
Score = 63.0 bits (153), Expect = 1e-11
Identities = 37/138 (26%), Positives = 62/138 (44%), Gaps = 12/138 (8%)
Query: 4 GRAGAPTTVCDIRLVDWDEGNYRVCDKPFPRGEILIGGSNISPGYYKNPTKTKEDFFDEE 63
G G P CD L+D ++ + GE+LI G N+ Y+ P T+E D
Sbjct: 317 GSIGKPVLFCDYELIDENKNKV----EVGEVGELLIRGPNVMKEYWNRPDATEETIQDG- 371
Query: 64 GKRWFRTGDIGELHQDGVIRIIDRKKDLVKLQAGEYVSLGKVEAELKTCPVIENICVYGD 123
W TGD+ + +DG + I+ RKK+++ + GE + +VE + + + V G
Sbjct: 372 ---WLCTGDLARVDEDGFVYIVGRKKEMI-ISGGENIYPLEVEQVINKLSDVYEVAVVGR 427
Query: 124 SRKDY---TVALVVPNPS 138
+ +A +V S
Sbjct: 428 QHVKWGEIPIAFIVKKSS 445
>gnl|CDD|213289 cd05922, FACL_like_6, Uncharacterized subfamily of fatty acid CoA
ligase (FACL). Fatty acyl-CoA ligases catalyze the
ATP-dependent activation of fatty acids in a two-step
reaction. The carboxylate substrate first reacts with
ATP to form an acyl-adenylate intermediate, which then
reacts with CoA to produce an acyl-CoA ester. This is a
required step before free fatty acids can participate in
most catabolic and anabolic reactions.
Length = 350
Score = 62.6 bits (153), Expect = 1e-11
Identities = 42/182 (23%), Positives = 79/182 (43%), Gaps = 36/182 (19%)
Query: 13 CDIRLVDWDEGNYRVCDKPFPRGEILIGGSNISPGYYKNPTKTKEDF--FDEEGKRWFRT 70
++ +VD ++GN R P GE++ G+N+ GY+ +P T E G+ T
Sbjct: 180 VELWVVD-EDGN-RC--APGEVGELVHRGANVMKGYWNDPEATAERLRPGPLPGEIVLYT 235
Query: 71 GDIGELHQDGVIRIIDRKKDLVKLQAGEYVSLGKVEAELKTCPVIENICVYGDSRKDYTV 130
GD+ + ++G + + RK D++K G VS ++E + P++ V G
Sbjct: 236 GDLVRMDEEGYLYFVGRKDDMIKT-RGYRVSPTEIEEVICAHPLVAEAAVIG-------- 286
Query: 131 ALVVPNPS---RLKEIAEKLNLDSLTFQQLCTHPEIEKQVLVELQEHGKKSNLERFEIPN 187
VP+P + + + +SL EK++L ++H L + +P
Sbjct: 287 ---VPDPLLGQAIVAVVVSADGESLD----------EKELLKHCRKH-----LPNYMVPA 328
Query: 188 AV 189
+
Sbjct: 329 EI 330
>gnl|CDD|236108 PRK07824, PRK07824, O-succinylbenzoic acid--CoA ligase;
Provisional.
Length = 358
Score = 62.8 bits (153), Expect = 1e-11
Identities = 40/114 (35%), Positives = 54/114 (47%), Gaps = 10/114 (8%)
Query: 35 GEILIGGSNISPGYYKNPTKTKEDFFDEEGKRWFRTGDIGELHQDGVIRIIDRKKDLVKL 94
G I +GG ++ GY +NP D F E G WFRT D+G L DGV+ ++ R D +
Sbjct: 208 GRIALGGPTLAKGY-RNPVD--PDPFAEPG--WFRTDDLGALD-DGVLTVLGRADDAIS- 260
Query: 95 QAGEYVSLGKVEAELKTCPVIENICVYG--DSR-KDYTVALVVPNPSRLKEIAE 145
G V VEA L T P + + V+G D R VA VV + +
Sbjct: 261 TGGLTVLPQVVEAALATHPAVADCAVFGLPDDRLGQRVVAAVVGDGGPAPTLEA 314
>gnl|CDD|213323 cd12115, A_NRPS_Sfm_like, The adenylation domain of nonribosomal
peptide synthetases (NRPS), including Saframycin A gene
cluster from Streptomyces lavendulae. The adenylation
(A) domain of NRPS recognizes a specific amino acid or
hydroxy acid and activates it as an (amino) acyl
adenylate by hydrolysis of ATP. The activated acyl
moiety then forms a thioester to the enzyme-bound
cofactor phosphopantetheine of a peptidyl carrier
protein domain. NRPSs are large multifunctional enzymes
which synthesize many therapeutically useful peptides in
bacteria and fungi via a template-directed, nucleic acid
independent nonribosomal mechanism. These natural
products include antibiotics, immunosuppressants, plant
and animal toxins, and enzyme inhibitors. NRPS has a
distinct modular structure in which each module is
responsible for the recognition, activation, and in some
cases, modification of a single amino acid residue of
the final peptide product. The modules can be subdivided
into domains that catalyze specific biochemical
reactions. This family includes the saframycin A gene
cluster from Streptomyces lavendulae which implicates
the NRPS system for assembling the unusual tetrapeptidyl
skeleton in an iterative manner. It also includes
saframycin Mx1 produced by Myxococcus xanthus NRPS.
Length = 449
Score = 62.3 bits (152), Expect = 2e-11
Identities = 44/156 (28%), Positives = 70/156 (44%), Gaps = 22/156 (14%)
Query: 1 MSTGRAGAPTTVCDIRLVDWDEGNYRVCDKPFPR---GEILIGGSNISPGYYKNPTKTKE 57
S GR A T ++D + + P P GE+ IGG+ ++ GY P T E
Sbjct: 268 PSIGRPLAGTQA---YVLD---AHLQ----PVPVGVPGELYIGGAGVARGYLGRPELTAE 317
Query: 58 DF-----FDEEGKRWFRTGDIGELHQDGVIRIIDRKKDLVKLQAGEYVSLGKVEAELKTC 112
F G+R +RTGD+ DG + + R D VK++ G + LG++EA L++
Sbjct: 318 RFLPNPFAGSPGERLYRTGDLVRRRADGQLEYLGRIDDQVKVR-GFRIELGEIEAALRSI 376
Query: 113 PVIENICVY---GDSRKDYTVALVVPNPSRLKEIAE 145
P + V + VA +V +P +I +
Sbjct: 377 PGVAEAVVVAIGDAAGDRQLVAYIVADPGAAIDIED 412
>gnl|CDD|213287 cd05920, 23DHB-AMP_lg, 2,3-dihydroxybenzoate-AMP ligase.
2,3-dihydroxybenzoate-AMP ligase activates
2,3-dihydroxybenzoate (DHB) by ligation of AMP from ATP
with the release of pyrophosphate. However, it can also
catalyze the ATP-PPi exchange for 2,3-DHB analogs, such
as salicyclic acid (o-hydrobenzoate), as well as 2,4-DHB
and 2,5-DHB, but with less efficiency. Proteins in this
family are the stand-alone adenylation components of
non-ribosomal peptide synthases (NRPSs) involved in the
biosynthesis of siderophores, which are low molecular
weight iron-chelating compounds synthesized by many
bacteria to aid in the acquisition of this vital trace
elements. In Escherichia coli, the
2,3-dihydroxybenzoate-AMP ligase is called EntE, the
adenylation component of the enterobactin NRPS system.
Length = 483
Score = 62.2 bits (152), Expect = 2e-11
Identities = 36/111 (32%), Positives = 56/111 (50%), Gaps = 14/111 (12%)
Query: 15 IRLVDWDEGNYRVCDKPFPRGEI---LIGGSNISPGYYKNPTKTKEDFFDEEGKRWFRTG 71
IR+VD D+G P P GE+ L G GYY+ P F ++G ++RTG
Sbjct: 320 IRIVD-DQGE------PVPPGEVGELLTRGPYTIRGYYRAPEHNARAF-TDDG--FYRTG 369
Query: 72 DIGELHQDGVIRIIDRKKDLVKLQAGEYVSLGKVEAELKTCPVIENICVYG 122
D+ + DG R++ R KD + + GE +S ++E L + P + + V G
Sbjct: 370 DLVRIDADGYYRVVGRIKDQIN-RGGEKISPEEIENLLLSHPAVADAAVVG 419
>gnl|CDD|213297 cd05931, FAAL, Fatty acyl-AMP ligase (FAAL). FAAL belongs to the
class I adenylate forming enzyme family and is
homologous to fatty acyl-coenzyme A (CoA) ligases
(FACLs). However, FAALs produce only the acyl adenylate
and are unable to perform the thioester-forming
reaction, while FACLs perform a two-step catalytic
reaction; AMP ligation followed by CoA ligation using
ATP and CoA as cofactors. FAALs have insertion motifs
between the N-terminal and C-terminal subdomains that
distinguish them from the FACLs. This insertion motif
precludes the binding of CoA, thus preventing CoA
ligation. It has been suggested that the acyl adenylates
serve as substrates for multifunctional polyketide
synthases to permit synthesis of complex lipids such as
phthiocerol dimycocerosate, sulfolipids, mycolic acids,
and mycobactin.
Length = 547
Score = 62.2 bits (152), Expect = 2e-11
Identities = 31/91 (34%), Positives = 44/91 (48%), Gaps = 11/91 (12%)
Query: 7 GAPTTVCDIRLVDWDEGNYRVCDKPFPR--GEILIGGSNISPGYYKNPTKTKEDFF---D 61
G P ++R+VD E R GEI + G +++ GY+ P T E F
Sbjct: 358 GRPLPGHEVRIVD-PETR-RPLP---DGEVGEIWVRGPSVAAGYWNRPEATAETFGARLA 412
Query: 62 EEGKRWFRTGDIGELHQDGVIRIIDRKKDLV 92
+ W RTGD+G LH DG + + R KDL+
Sbjct: 413 TDEGGWLRTGDLGFLH-DGELYVTGRLKDLI 442
>gnl|CDD|132249 TIGR03205, pimA, dicarboxylate--CoA ligase PimA. PimA, a member of
a large family of acyl-CoA ligases, is found in a
characteristic operon pimFABCDE for the metabolism of
pimelate and related compounds. It is found, so far, in
Bradyrhizobium japonicum and several strains of
Rhodopseudomonas palustris. PimA from R. palustris was
shown to be active as a CoA ligase for C(7) to C(14)
dicarboxylates and fatty acids.
Length = 541
Score = 61.9 bits (150), Expect = 3e-11
Identities = 39/129 (30%), Positives = 62/129 (48%), Gaps = 16/129 (12%)
Query: 30 KPFPRGE---ILIGGSNISPGYYKNPTKTKEDFFDEEGKRWFRTGDIGELHQDGVIRIID 86
K P GE + I G N++ GY+ P ++ E F + F TGDIG + DG ++D
Sbjct: 386 KVLPPGEVGELRIRGPNVTRGYWNRPEESAEAFVGDR----FLTGDIGYMDTDGYFFLVD 441
Query: 87 RKKDLVKLQAGEYVSLGKVEAELKTCPVIENICVYG---DSRKDYTVALVVPNPSRLKEI 143
RKKD++ + G V +E + P ++ + V G R + A V +L+
Sbjct: 442 RKKDMI-ISGGFNVYPQMIEQAIYEHPGVQEVIVIGIPDQYRGEAAKAFV-----KLRPG 495
Query: 144 AEKLNLDSL 152
A+ +LD L
Sbjct: 496 AKPFSLDEL 504
>gnl|CDD|178097 PLN02479, PLN02479, acetate-CoA ligase.
Length = 567
Score = 61.8 bits (150), Expect = 3e-11
Identities = 36/91 (39%), Positives = 50/91 (54%), Gaps = 10/91 (10%)
Query: 30 KPFPR-----GEILIGGSNISPGYYKNPTKTKEDFFDEEGKRWFRTGDIGELHQDGVIRI 84
KP P GEI++ G+ + GY KNP +E F G WF +GD+G H DG I I
Sbjct: 393 KPVPADGKTMGEIVMRGNMVMKGYLKNPKANEEAF--ANG--WFHSGDLGVKHPDGYIEI 448
Query: 85 IDRKKDLVKLQAGEYVSLGKVEAELKTCPVI 115
DR KD++ + GE +S +VE + T P +
Sbjct: 449 KDRSKDII-ISGGENISSLEVENVVYTHPAV 478
>gnl|CDD|213309 cd05944, FACL_like_4, Uncharacterized subfamily of fatty acid CoA
ligase (FACL). Fatty acyl-CoA ligases catalyze the
ATP-dependent activation of fatty acids in a two-step
reaction. The carboxylate substrate first reacts with
ATP to form an acyl-adenylate intermediate, which then
reacts with CoA to produce an acyl-CoA ester. This is a
required step before free fatty acids can participate in
most catabolic and anabolic reactions.
Length = 359
Score = 61.6 bits (150), Expect = 3e-11
Identities = 25/84 (29%), Positives = 36/84 (42%), Gaps = 5/84 (5%)
Query: 9 PTTVCDIRLVDWDEGNYRVCDKPFPRGEILIGGSNISPGYYKNPTKTKEDFFDEEGKRWF 68
P T + +D R C P G + I G N+ PGY N ++ W
Sbjct: 182 PYTRVRVAKLDAGGALGRDCA-PGEVGVLAIRGPNVFPGY-LNDAHNAGARLED---GWL 236
Query: 69 RTGDIGELHQDGVIRIIDRKKDLV 92
TGD+G + DG + + R KDL+
Sbjct: 237 NTGDLGRIDADGYLWLTGRAKDLI 260
>gnl|CDD|162605 TIGR01923, menE, O-succinylbenzoate-CoA ligase. This model
represents an enzyme, O-succinylbenzoate-CoA ligase,
which is involved in the fourth step of the menaquinone
biosynthesis pathway. O-succinylbenzoate-CoA ligase,
together with menB - naphtoate synthase, take
2-succinylbenzoate and convert it into 1,4-di-hydroxy-2-
naphtoate [Biosynthesis of cofactors, prosthetic groups,
and carriers, Menaquinone and ubiquinone].
Length = 436
Score = 61.3 bits (149), Expect = 4e-11
Identities = 40/138 (28%), Positives = 64/138 (46%), Gaps = 29/138 (21%)
Query: 1 MSTGRAGAPTTVCDIRL-VDWDEGNYRVCDKPFPRGEILIGGSNISPGYYKNPTKTKEDF 59
+ G P +I++ VD EG+ GEI++ G+N+ GY
Sbjct: 268 HARPDVGRPLAGREIKIKVDNKEGH----------GEIMVKGANLMKGYLYQ--GELTPA 315
Query: 60 FDEEGKRWFRTGDIGELHQDGVIRIIDRKKDLVKLQAGEYVSLGKVEAELKTCPVIENIC 119
F+++G WF TGDIGEL +G + ++ R+ DL+ + GE + ++E L P I+
Sbjct: 316 FEQQG--WFNTGDIGELDGEGFLYVLGRRDDLI-ISGGENIYPEEIETVLYQHPGIQ--- 369
Query: 120 VYGDSRKDYTVALVVPNP 137
A+VVP P
Sbjct: 370 ----------EAVVVPKP 377
>gnl|CDD|213286 cd05919, BCL_like, Benzoate CoA ligase (BCL) and similar adenylate
forming enzymes. This family contains benzoate CoA
ligase (BCL) and related ligases that catalyze the
acylation of benzoate derivatives, 2-aminobenzoate and
4-hydroxybenzoate. Aromatic compounds represent the
second most abundant class of organic carbon compounds
after carbohydrates. Xenobiotic aromatic compounds are
also a major class of man-made pollutants. Some bacteria
use benzoate as the sole source of carbon and energy
through benzoate degradation. Benzoate degradation
starts with its activation to benzoyl-CoA by benzoate
CoA ligase. The reaction catalyzed by benzoate CoA
ligase proceeds via a two-step process; the first
ATP-dependent step forms an acyl-AMP intermediate, and
the second step forms the acyl-CoA ester with release of
the AMP.
Length = 436
Score = 60.8 bits (148), Expect = 6e-11
Identities = 43/137 (31%), Positives = 62/137 (45%), Gaps = 12/137 (8%)
Query: 4 GRAGAPTTVCDIRLVDWDEGNYRVCDKPFPRGEILIGGSNISPGYYKNPTKTKEDFFDEE 63
G G P + RLVD + G G++ + G +++PGY+ P KT+ D
Sbjct: 260 GTTGRPVPGYEARLVD-ESGREVPAGTV---GDLWVRGPSLAPGYWNLPEKTQRTLRDG- 314
Query: 64 GKRWFRTGDIGELHQDGVIRIIDRKKDLVKLQAGEYVSLGKVEAELKTCPVIENICVYGD 123
W RTGD DG R R D++K+ +G++VS +VEA L P + V
Sbjct: 315 ---WLRTGDRFSRDADGWYRYQGRADDMIKV-SGQWVSPLEVEAALGEHPAVAEAAVVAV 370
Query: 124 SRKD---YTVALVVPNP 137
+D A VVP P
Sbjct: 371 PDEDGLVRLKAFVVPRP 387
>gnl|CDD|171961 PRK13295, PRK13295, cyclohexanecarboxylate-CoA ligase; Reviewed.
Length = 547
Score = 60.5 bits (147), Expect = 9e-11
Identities = 45/139 (32%), Positives = 66/139 (47%), Gaps = 19/139 (13%)
Query: 7 GAPTTVCDIRLVDWDEGNYRVCDKPFPRGEI---LIGGSNISPGYYKNPTKTKEDFFDEE 63
G P ++R+VD D P P G+I + G + GY K P D +
Sbjct: 369 GCPLPGVEVRVVDAD-------GAPLPAGQIGRLQVRGCSNFGGYLKRPQLNGTDA---D 418
Query: 64 GKRWFRTGDIGELHQDGVIRIIDRKKDLVKLQAGEYVSLGKVEAELKTCPVIENICV--Y 121
G WF TGD+ + DG IRI R KD++ ++ GE + + ++EA L P I + + Y
Sbjct: 419 G--WFDTGDLARIDADGYIRISGRSKDVI-IRGGENIPVVEIEALLYRHPAIAQVAIVAY 475
Query: 122 GDSR-KDYTVALVVPNPSR 139
D R + A VVP P +
Sbjct: 476 PDERLGERACAFVVPRPGQ 494
>gnl|CDD|213274 cd05906, A_NRPS_TubE_like, The adenylation domain (A domain) of a
family of nonribosomal peptide synthetases (NRPSs)
synthesizing toxins and antitumor agents. The
adenylation (A) domain of NRPS recognizes a specific
amino acid or hydroxy acid and activates it as an
(amino)-acyl adenylate by hydrolysis of ATP. The
activated acyl moiety then forms a thioester to the
enzyme-bound cofactor phosphopantetheine of a peptidyl
carrier protein domain. This family includes NRPSs that
synthesize toxins and antitumor agents; for example,
TubE for Tubulysine, CrpA for cryptophycin, TdiA for
terrequinone A, KtzG for kutzneride, and Vlm1/Vlm2 for
Valinomycin. Nonribosomal peptide synthetases are large
multifunctional enzymes which synthesize many
therapeutically useful peptides. NRPS has a distinct
modular structure in which each module is responsible
for the recognition, activation, and, in some cases,
modification of a single amino acid residue of the final
peptide product. The modules can be subdivided into
domains that catalyze specific biochemical reactions.
Length = 560
Score = 60.4 bits (147), Expect = 1e-10
Identities = 35/89 (39%), Positives = 47/89 (52%), Gaps = 14/89 (15%)
Query: 7 GAPTTVCDIRLVDWDEGNYRVCDKPFPRGEI---LIGGSNISPGYYKNPTKTKEDFFDEE 63
G P IR+VD D P GE+ + G ++ GYY+NP E F E+
Sbjct: 377 GRPIPGVAIRIVDED-------GALLPEGEVGRLQVRGPTVTSGYYRNPEANAE-AFTED 428
Query: 64 GKRWFRTGDIGELHQDGVIRIIDRKKDLV 92
G WFRTGD+G LH DG + I R+KD++
Sbjct: 429 G--WFRTGDLGFLH-DGRLTITGREKDMI 454
>gnl|CDD|235722 PRK06164, PRK06164, acyl-CoA synthetase; Validated.
Length = 540
Score = 59.8 bits (145), Expect = 2e-10
Identities = 37/125 (29%), Positives = 51/125 (40%), Gaps = 13/125 (10%)
Query: 22 EGNYRVCDKPFPR-------GEILIGGSNISPGYYKNPTKTKEDFFDEEGKRWFRTGDIG 74
E R D GEI I ++ GY NP T D+ +FRTGD+G
Sbjct: 358 EARVRARDPQDGALLPDGESGEIEIRAPSLMRGYLDNPDATARALTDDG---YFRTGDLG 414
Query: 75 ELHQDGVIRIIDRKKDLVKLQAGEYVSLGKVEAELKTCPVIENICVYGDSRKDYT--VAL 132
DG R D ++L G V+ ++E L+ P + V G +R T VA
Sbjct: 415 YTRGDGQFVYQTRMGDSLRL-GGFLVNPAEIEHALEALPGVAAAQVVGATRDGKTVPVAF 473
Query: 133 VVPNP 137
V+P
Sbjct: 474 VIPTD 478
>gnl|CDD|236096 PRK07787, PRK07787, acyl-CoA synthetase; Validated.
Length = 471
Score = 59.6 bits (145), Expect = 2e-10
Identities = 46/143 (32%), Positives = 64/143 (44%), Gaps = 22/143 (15%)
Query: 4 GRAGAPTTVCDIRLVDWDEGNYRVCDKPFPR-----GEILIGGSNISPGYYKNPTKTKED 58
G G P + RLVD ++G P P GE+ + G + GY P T
Sbjct: 293 GWVGLPLAGVETRLVD-EDGG------PVPHDGETVGELQVRGPTLFDGYLNRPDATAAA 345
Query: 59 FFDEEGKRWFRTGDIGELHQDGVIRIIDRKK-DLVKLQAGEY-VSLGKVEAELKTCPVIE 116
F +G WFRTGD+ + DG+ RI+ R+ DL+K +G Y + G++E L P +
Sbjct: 346 F-TADG--WFRTGDVAVVDPDGMHRIVGRESTDLIK--SGGYRIGAGEIETALLGHPGVR 400
Query: 117 NICVYGDSRKDY---TVALVVPN 136
V G D VA VV
Sbjct: 401 EAAVVGVPDDDLGQRIVAYVVGA 423
>gnl|CDD|180393 PRK06087, PRK06087, short chain acyl-CoA synthetase; Reviewed.
Length = 547
Score = 59.4 bits (144), Expect = 2e-10
Identities = 43/139 (30%), Positives = 62/139 (44%), Gaps = 17/139 (12%)
Query: 6 AGAPTTVCDIRLVDWDEGNYRVCDKPFPRG---EILIGGSNISPGYYKNPTKTKEDFFDE 62
G +I++VD K P G E G N+ GY P T DE
Sbjct: 357 DGYAAAGVEIKVVDEAR-------KTLPPGCEGEEASRGPNVFMGYLDEPELTAR-ALDE 408
Query: 63 EGKRWFRTGDIGELHQDGVIRIIDRKKDLVKLQAGEYVSLGKVEAELKTCPVIENICVYG 122
EG W+ +GD+ + + G I+I RKKD++ ++ GE +S +VE L P I + CV
Sbjct: 409 EG--WYYSGDLCRMDEAGYIKITGRKKDII-VRGGENISSREVEDILLQHPKIHDACVVA 465
Query: 123 --DSR-KDYTVALVVPNPS 138
D R + + A VV
Sbjct: 466 MPDERLGERSCAYVVLKAP 484
>gnl|CDD|235923 PRK07059, PRK07059, Long-chain-fatty-acid--CoA ligase; Validated.
Length = 557
Score = 59.3 bits (144), Expect = 2e-10
Identities = 44/133 (33%), Positives = 63/133 (47%), Gaps = 22/133 (16%)
Query: 9 PTTVCDIRLVDWDEGNYRVCDKPFPR-GEILIGGSNISPGYYKNPTKTK----EDFFDEE 63
P+T IR D D+GN D P GEI I G + GY+ P +T D F
Sbjct: 387 PSTEVSIR--D-DDGN----DLPLGEPGEICIRGPQVMAGYWNRPDETAKVMTADGF--- 436
Query: 64 GKRWFRTGDIGELHQDGVIRIIDRKKDLVKLQAGEYVSLGKVEAELKTCPVIENICVYG- 122
FRTGD+G + + G +I+DRKKD++ L +G V ++E + + P + + G
Sbjct: 437 ----FRTGDVGVMDERGYTKIVDRKKDMI-LVSGFNVYPNEIEEVVASHPGVLEVAAVGV 491
Query: 123 -DSRKDYTVALVV 134
D V L V
Sbjct: 492 PDEHSGEAVKLFV 504
>gnl|CDD|237145 PRK12583, PRK12583, acyl-CoA synthetase; Provisional.
Length = 558
Score = 59.4 bits (144), Expect = 2e-10
Identities = 35/106 (33%), Positives = 58/106 (54%), Gaps = 7/106 (6%)
Query: 35 GEILIGGSNISPGYYKNPTKTKEDFFDEEGKRWFRTGDIGELHQDGVIRIIDRKKDLVKL 94
GE+ G ++ GY+ NP T E DE+G W TGD+ + + G +RI+ R KD++ +
Sbjct: 400 GELCTRGYSVMKGYWNNPEATAESI-DEDG--WMHTGDLATMDEQGYVRIVGRSKDMI-I 455
Query: 95 QAGEYVSLGKVEAELKTCPVIENICVYGDSRKDY---TVALVVPNP 137
+ GE + ++E L T P + ++ V+G + Y VA V +P
Sbjct: 456 RGGENIYPREIEEFLFTHPAVADVQVFGVPDEKYGEEIVAWVRLHP 501
>gnl|CDD|237108 PRK12467, PRK12467, peptide synthase; Provisional.
Length = 3956
Score = 59.4 bits (144), Expect = 3e-10
Identities = 40/108 (37%), Positives = 56/108 (51%), Gaps = 7/108 (6%)
Query: 35 GEILIGGSNISPGYYKNPTKTKEDF----FDEEGKRWFRTGDIGELHQDGVIRIIDRKKD 90
GE+ IGG ++ GY++ P+ T E F F G R +RTGD+ DGVI + R
Sbjct: 3436 GELYIGGVGLARGYHQRPSLTAERFVADPFSGSGGRLYRTGDLARYRADGVIEYLGRIDH 3495
Query: 91 LVKLQAGEYVSLGKVEAELKTCP-VIENICVYGDSRK-DYTVALVVPN 136
VK++ G + LG++EA L P V E + + D VA VVP
Sbjct: 3496 QVKIR-GFRIELGEIEARLLQHPSVREAVVLARDGAGGKQLVAYVVPA 3542
Score = 57.5 bits (139), Expect = 1e-09
Identities = 37/107 (34%), Positives = 54/107 (50%), Gaps = 7/107 (6%)
Query: 35 GEILIGGSNISPGYYKNPTKTKEDF----FDEEGKRWFRTGDIGELHQDGVIRIIDRKKD 90
GE+ IGG+ ++ GY++ P T E F F +G R +RTGD+ DGVI + R
Sbjct: 855 GELYIGGAGLARGYHRRPALTAERFVPDPFGADGGRLYRTGDLARYRADGVIEYLGRMDH 914
Query: 91 LVKLQAGEYVSLGKVEAELKTCPVIENICV--YGDSRKDYTVALVVP 135
VK++ G + LG++EA L P + V VA +VP
Sbjct: 915 QVKIR-GFRIELGEIEARLLAQPGVREAVVLAQPGDAGLQLVAYLVP 960
Score = 56.7 bits (137), Expect = 2e-09
Identities = 42/125 (33%), Positives = 58/125 (46%), Gaps = 12/125 (9%)
Query: 31 PFPR---GEILIGGSNISPGYYKNPTKTKEDF----FDEEGKRWFRTGDIGELHQDGVIR 83
P P GE+ +GG ++ GY P T E F F G R +RTGD+ DGVI
Sbjct: 1912 PVPIGVAGELYLGGVGLARGYLNRPALTAERFVADPFGTVGSRLYRTGDLARYRADGVIE 1971
Query: 84 IIDRKKDLVKLQAGEYVSLGKVEAELKTCPVIENICVY---GDSRKDYTVALVVPNPSRL 140
+ R VK++ G + LG++EA L+ + V G + K VA VVP L
Sbjct: 1972 YLGRIDHQVKIR-GFRIELGEIEARLREQGGVREAVVIAQDGANGKQL-VAYVVPTDPGL 2029
Query: 141 KEIAE 145
+ E
Sbjct: 2030 VDDDE 2034
>gnl|CDD|180988 PRK07470, PRK07470, acyl-CoA synthetase; Validated.
Length = 528
Score = 58.1 bits (141), Expect = 5e-10
Identities = 43/140 (30%), Positives = 60/140 (42%), Gaps = 32/140 (22%)
Query: 4 GRAGAPTTVCDIRLVDWDEGNYRVCDKPFPRGEILIGGSNISPGYYKNPTKTKEDFFDEE 63
G G T ++++ D DEG P GEI + G + GYY NP + F D
Sbjct: 340 GTCGFERTGMEVQIQD-DEGREL---PPGETGEICVIGPAVFAGYYNNPEANAKAFRDG- 394
Query: 64 GKRWFRTGDIGELHQDGVIRIIDRKKDLVKLQAGEYVSLG------KVEAELKTCPVIEN 117
WFRTGD+G L G + I R D+ Y+S G ++E +L T P +
Sbjct: 395 ---WFRTGDLGHLDARGFLYITGRASDM-------YISGGSNVYPREIEEKLLTHPAVSE 444
Query: 118 ICVYGDSRKDYTVALVVPNP 137
+ V G VP+P
Sbjct: 445 VAVLG-----------VPDP 453
>gnl|CDD|180167 PRK05620, PRK05620, long-chain-fatty-acid--CoA ligase; Validated.
Length = 576
Score = 58.3 bits (141), Expect = 6e-10
Identities = 35/133 (26%), Positives = 61/133 (45%), Gaps = 23/133 (17%)
Query: 35 GEILIGGSNISPGYYKNPTKTK---------------EDFFDEEGKRWFRTGDIGELHQD 79
GEI + G+ ++ YY +PT+ D F +G W RTGD+G + +D
Sbjct: 386 GEIQVRGNWVTASYYHSPTEEGGGAASTFRGEDVEDANDRFTADG--WLRTGDVGSVTRD 443
Query: 80 GVIRIIDRKKDLVKLQAGEYVSLGKVEAELKTCPVIENICVYGDSRKDY-----TVALVV 134
G + I DR +D+++ GE++ ++E + P + V G + V ++
Sbjct: 444 GFLTIHDRARDVIR-SGGEWIYSAQLENYIMAAPEVVECAVIGYPDDKWGERPLAVTVLA 502
Query: 135 PNPSRLKEIAEKL 147
P +E AE+L
Sbjct: 503 PGIEPTRETAERL 515
>gnl|CDD|215137 PLN02246, PLN02246, 4-coumarate--CoA ligase.
Length = 537
Score = 58.1 bits (141), Expect = 6e-10
Identities = 50/150 (33%), Positives = 73/150 (48%), Gaps = 24/150 (16%)
Query: 5 RAGAPTTV---CDIRLVDWDEGNYRVCDKPFPR---GEILIGGSNISPGYYKNPTKTKED 58
++G+ TV ++++VD + G PR GEI I G I GY +P T
Sbjct: 354 KSGSCGTVVRNAELKIVDPETG------ASLPRNQPGEICIRGPQIMKGYLNDPEATANT 407
Query: 59 FFDEEGKRWFRTGDIGELHQDGVIRIIDRKKDLVKLQAGEYVSLGKVEAELKTCPVIENI 118
D++G W TGDIG + D + I+DR K+L+K + G V+ ++EA L + P I +
Sbjct: 408 I-DKDG--WLHTGDIGYIDDDDELFIVDRLKELIKYK-GFQVAPAELEALLISHPSIADA 463
Query: 119 CVYGDSRKDYT-----VALVV-PNPSRLKE 142
V KD VA VV N S + E
Sbjct: 464 AVVP--MKDEVAGEVPVAFVVRSNGSEITE 491
>gnl|CDD|215189 PLN02330, PLN02330, 4-coumarate--CoA ligase-like 1.
Length = 546
Score = 56.9 bits (137), Expect = 1e-09
Identities = 43/128 (33%), Positives = 65/128 (50%), Gaps = 10/128 (7%)
Query: 14 DIRLVDWDEGNYRVCDKPFPRGEILIGGSNISPGYYKNPTKTKEDFFDEEGKRWFRTGDI 73
+++ +D D G + P GE+ + + GYY N +T DE+G W TGDI
Sbjct: 371 EVKFIDPDTGRSLPKNTP---GELCVRSQCVMQGYYNNKEETDRTI-DEDG--WLHTGDI 424
Query: 74 GELHQDGVIRIIDRKKDLVKLQAGEYVSLGKVEAELKTCPVIENICVY---GDSRKDYTV 130
G + DG I I+DR K+L+K + G V+ ++EA L T P +E+ V + +
Sbjct: 425 GYIDDDGDIFIVDRIKELIKYK-GFQVAPAELEAILLTHPSVEDAAVVPLPDEEAGEIPA 483
Query: 131 ALVVPNPS 138
A VV NP
Sbjct: 484 ACVVINPK 491
>gnl|CDD|181195 PRK08008, caiC, putative crotonobetaine/carnitine-CoA ligase;
Validated.
Length = 517
Score = 57.0 bits (138), Expect = 1e-09
Identities = 38/109 (34%), Positives = 56/109 (51%), Gaps = 10/109 (9%)
Query: 35 GEILIGG---SNISPGYYKNPTKTKEDFFDEEGKRWFRTGDIGELHQDGVIRIIDRKKDL 91
GEI I G I YY +P T + E W TGD G + ++G +DR+ ++
Sbjct: 367 GEICIKGVPGKTIFKEYYLDPKATAKVL---EADGWLHTGDTGYVDEEGFFYFVDRRCNM 423
Query: 92 VKLQAGEYVSLGKVEAELKTCPVIENICVYG--DSRKDYTV-ALVVPNP 137
+K + GE VS ++E + T P I++I V G DS +D + A VV N
Sbjct: 424 IK-RGGENVSCVELENIIATHPKIQDIVVVGIKDSIRDEAIKAFVVLNE 471
>gnl|CDD|235724 PRK06178, PRK06178, acyl-CoA synthetase; Validated.
Length = 567
Score = 56.6 bits (137), Expect = 2e-09
Identities = 39/137 (28%), Positives = 62/137 (45%), Gaps = 17/137 (12%)
Query: 7 GAPTTVCDIRLVDWDEGNYRVCDKPFP---RGEILIGGSNISPGYYKNPTKTKEDFFDEE 63
G P + ++ D++ G + P GEI++ ++ GY+ P T E D
Sbjct: 390 GLPVPGTEFKICDFETG------ELLPLGAEGEIVVRTPSLLKGYWNKPEATAEALRDG- 442
Query: 64 GKRWFRTGDIGELHQDGVIRIIDRKKDLVKLQAGEYVSLGKVEAELKTCPVIENICVYG- 122
W TGDIG++ + G + + R+K+++K+ G V +VEA L P + V G
Sbjct: 443 ---WLHTGDIGKIDEQGFLHYLGRRKEMLKVN-GMSVFPSEVEALLGQHPAVLGSAVVGR 498
Query: 123 -DSRK-DYTVALVVPNP 137
D K VA V P
Sbjct: 499 PDPDKGQVPVAFVQLKP 515
>gnl|CDD|223442 COG0365, Acs, Acyl-coenzyme A synthetases/AMP-(fatty) acid ligases
[Lipid metabolism].
Length = 528
Score = 56.1 bits (136), Expect = 3e-09
Identities = 43/146 (29%), Positives = 66/146 (45%), Gaps = 26/146 (17%)
Query: 14 DIRLVDWDEGN-------YRVCDKPFPRGEILIGGSNISPGYYKNPTKTKEDFFDEEGKR 66
+R VD DEGN V P+P ++ Y+ +P + KE +F R
Sbjct: 351 AVRRVD-DEGNPVPPGVGELVVRLPWPG---------MALTYWNDPERYKEAYFG----R 396
Query: 67 WFRTGDIGELHQDGVIRIIDRKKDLVKLQAGEYVSLGKVEAELKTCP-VIENICV-YGDS 124
W+RTGD E +DG + R D++K+ +G+ + ++E+ L P V E V D
Sbjct: 397 WYRTGDWAERDEDGYFWLHGRSDDVIKV-SGKRIGPLEIESVLLAHPAVAEAAVVGVPDP 455
Query: 125 RKDYTV-ALVVPNPS-RLKEIAEKLN 148
K V A VV E+AE++
Sbjct: 456 GKGQIVLAFVVLAAGVEPNELAEEIR 481
>gnl|CDD|235673 PRK06018, PRK06018, putative acyl-CoA synthetase; Provisional.
Length = 542
Score = 55.9 bits (135), Expect = 3e-09
Identities = 29/95 (30%), Positives = 49/95 (51%), Gaps = 12/95 (12%)
Query: 8 APTTVCDIRLVDWDEGNYRVCD-KPFPRGEILIGGSNISPGYYKNPTKTKEDFFDEEGKR 66
P V ++++ D D G D K F G + + G ++ YY+ + D++G
Sbjct: 361 PPFGV-EMKITD-DAGKELPWDGKTF--GRLKVRGPAVAAAYYRVDGE----ILDDDG-- 410
Query: 67 WFRTGDIGELHQDGVIRIIDRKKDLVKLQAGEYVS 101
+F TGD+ + G +RI DR KD++K GE++S
Sbjct: 411 FFDTGDVATIDAYGYMRITDRSKDVIK-SGGEWIS 444
>gnl|CDD|213291 cd05924, FACL_like_5, Uncharacterized subfamily of fatty acid CoA
ligase (FACL). Fatty acyl-CoA ligases catalyze the
ATP-dependent activation of fatty acids in a two-step
reaction. The carboxylate substrate first reacts with
ATP to form an acyl-adenylate intermediate, which then
reacts with CoA to produce an acyl-CoA ester. This is a
required step before free fatty acids can participate in
most catabolic and anabolic reactions.
Length = 365
Score = 55.8 bits (135), Expect = 3e-09
Identities = 31/90 (34%), Positives = 43/90 (47%), Gaps = 3/90 (3%)
Query: 47 GYYKNPTKTKEDFFDEEGKRWFRTGDIGELHQDGVIRIIDRKKDLVKLQAGEYVSLGKVE 106
GYYK+P KT + F + +G R+ GD + DG I ++ R + GE V +VE
Sbjct: 227 GYYKDPEKTAKTFREIDGVRYAVPGDFARVEADGTITLLGRGSVCIN-TGGEKVFPEEVE 285
Query: 107 AELKTCPVIENICVYG--DSRKDYTVALVV 134
LK P + + V G D R V VV
Sbjct: 286 EALKAHPDVADALVVGVPDERWGQRVVAVV 315
>gnl|CDD|132252 TIGR03208, cyc_hxne_CoA_lg, cyclohexanecarboxylate-CoA ligase.
Members of this protein family are
cyclohexanecarboxylate-CoA ligase. This enzyme prepares
the aliphatic ring compound, cyclohexanecarboxylate, for
dehydrogenation and then degradation by a pathway also
used in benzoyl-CoA degradation in Rhodopseudomonas
palustris.
Length = 538
Score = 55.7 bits (134), Expect = 3e-09
Identities = 39/137 (28%), Positives = 64/137 (46%), Gaps = 19/137 (13%)
Query: 7 GAPTTVCDIRLVDWDEGNYRVCDKPFPRGEI---LIGGSNISPGYYKNPTKTKEDFFDEE 63
G P +++++D + +GE L+ G + GY K P + D E
Sbjct: 367 GRPLPGVEVKVIDAN-------GAKLSQGETGRLLVRGCSNFGGYLKRP---HLNSTDAE 416
Query: 64 GKRWFRTGDIGELHQDGVIRIIDRKKDLVKLQAGEYVSLGKVEAELKTCPVIENICV--Y 121
G WF TGD+ +G IRI R KD++ ++ GE + + ++E L P + + + Y
Sbjct: 417 G--WFDTGDLAFQDAEGYIRINGRSKDVI-IRGGENIPVVEIENLLYQHPAVAQVAIVAY 473
Query: 122 GDSR-KDYTVALVVPNP 137
D R + A+VVP P
Sbjct: 474 PDERLGERACAVVVPKP 490
>gnl|CDD|235731 PRK06188, PRK06188, acyl-CoA synthetase; Validated.
Length = 524
Score = 55.8 bits (135), Expect = 4e-09
Identities = 43/144 (29%), Positives = 62/144 (43%), Gaps = 30/144 (20%)
Query: 6 AGAPTTVCDIRLVDWDEGNYRVCDKPFPR---GEILIGGSNISPGYYKNPTKTKEDFFDE 62
G PT + L+D D + + GEI + G + GY+ P +T E F D
Sbjct: 341 CGRPTPGLRVALLDED-------GREVAQGEVGEICVRGPLVMDGYWNRPEETAEAFRDG 393
Query: 63 EGKRWFRTGDIGELHQDGVIRIIDRKKDLVKLQAGEYVSLG------KVEAELKTCPVIE 116
W TGD+ +DG I+DRKKD++ V+ G +VE L P +
Sbjct: 394 ----WLHTGDVAREDEDGFYYIVDRKKDMI-------VTGGFNVFPREVEDVLAEHPAVA 442
Query: 117 NICVYG--DSRKDYTV-ALVVPNP 137
+ V G D + V A+VV P
Sbjct: 443 QVAVIGVPDEKWGEAVTAVVVLRP 466
>gnl|CDD|215312 PLN02574, PLN02574, 4-coumarate--CoA ligase-like.
Length = 560
Score = 55.2 bits (133), Expect = 5e-09
Identities = 41/125 (32%), Positives = 61/125 (48%), Gaps = 10/125 (8%)
Query: 16 RLVDWDEGNYRVCDKPFPRGEILIGGSNISPGYYKNPTKTKEDFFDEEGKRWFRTGDIGE 75
++VDW G P GE+ I G + GY NP T+ D++G W RTGDI
Sbjct: 386 KVVDWSTGC---LLPPGNCGELWIQGPGVMKGYLNNPKATQSTI-DKDG--WLRTGDIAY 439
Query: 76 LHQDGVIRIIDRKKDLVKLQAGEYVSLGKVEAELKTCPVIENICVYGDSRK---DYTVAL 132
+DG + I+DR K+++K + G ++ +EA L + P I + V K + VA
Sbjct: 440 FDEDGYLYIVDRLKEIIKYK-GFQIAPADLEAVLISHPEIIDAAVTAVPDKECGEIPVAF 498
Query: 133 VVPNP 137
VV
Sbjct: 499 VVRRQ 503
>gnl|CDD|181546 PRK08751, PRK08751, putative long-chain fatty acyl CoA ligase;
Provisional.
Length = 560
Score = 55.3 bits (133), Expect = 5e-09
Identities = 32/102 (31%), Positives = 53/102 (51%), Gaps = 6/102 (5%)
Query: 35 GEILIGGSNISPGYYKNPTKTKEDFFDEEGKRWFRTGDIGELHQDGVIRIIDRKKDLVKL 94
GE+ I G + GY+K P +T + D +G W TGDI + + G + I+DRKKD++ L
Sbjct: 409 GELCIKGPQVMKGYWKRPEETAK-VMDADG--WLHTGDIARMDEQGFVYIVDRKKDMI-L 464
Query: 95 QAGEYVSLGKVEAELKTCPVIENICVYG--DSRKDYTVALVV 134
+G V ++E + P + + G D + V +V+
Sbjct: 465 VSGFNVYPNEIEDVIAMMPGVLEVAAVGVPDEKSGEIVKVVI 506
>gnl|CDD|213306 cd05940, FATP_FACS, Fatty acid transport proteins (FATP) play dual
roles as fatty acid transporters and its activation
enzymes. Fatty acid transport protein (FATP) transports
long-chain or very-long-chain fatty acids across the
plasma membrane. FATPs also have fatty acid CoA
synthetase activity, thus playing dual roles as fatty
acid transporters and its activation enzymes. At least
five copies of FATPs are identified in mammalian cells.
This family also includes prokaryotic FATPs. FATPs are
the key players in the trafficking of exogenous fatty
acids into the cell and in intracellular fatty acid
homeostasis.
Length = 444
Score = 54.9 bits (133), Expect = 7e-09
Identities = 36/130 (27%), Positives = 50/130 (38%), Gaps = 30/130 (23%)
Query: 14 DIRLVDWD----------EGNYRVCDKPFPRGEI--LIGGSNISP-----GYYKNPTKTK 56
I +V +D G P GE+ L+G I+ GY + K
Sbjct: 253 AIAVVKYDVETEEPIRDANGFCIKV----PPGEVGLLLG--EITDRNPFDGYTDDEATEK 306
Query: 57 EDFFD--EEGKRWFRTGDIGELHQDGVIRIIDRKKDL--VKLQAGEYVSLGKVEAELKTC 112
+ D ++G +F TGD+ G +DR D K GE VS +VE L
Sbjct: 307 KILRDVFKKGDAYFNTGDLVRRDGFGYFYFVDRLGDTFRWK---GENVSTTEVEEVLAKH 363
Query: 113 PVIENICVYG 122
P +E VYG
Sbjct: 364 PGVEEANVYG 373
>gnl|CDD|235564 PRK05691, PRK05691, peptide synthase; Validated.
Length = 4334
Score = 55.2 bits (133), Expect = 8e-09
Identities = 30/90 (33%), Positives = 49/90 (54%), Gaps = 5/90 (5%)
Query: 35 GEILIGGSNISPGYYKNPTKTKEDF----FDEEGKRWFRTGDIGELHQDGVIRIIDRKKD 90
GE+ +GG+ ++ GY+ P T E F F +G R +RTGD+ L DG++ + R
Sbjct: 2533 GELYVGGAGLAQGYHDRPGLTAERFVADPFAADGGRLYRTGDLVRLRADGLVEYVGRIDH 2592
Query: 91 LVKLQAGEYVSLGKVEAELKTCPVIENICV 120
VK++ G + LG++E+ L P + V
Sbjct: 2593 QVKIR-GFRIELGEIESRLLEHPAVREAVV 2621
Score = 54.4 bits (131), Expect = 1e-08
Identities = 23/58 (39%), Positives = 38/58 (65%), Gaps = 1/58 (1%)
Query: 35 GEILIGGSNISPGYYKNPTKTKEDFFDEEGKRWFRTGDIGELHQDGVIRIIDRKKDLV 92
GEI G +I+ GY++NP + + F + +G+ W RTGD+G L +DG + + R KD++
Sbjct: 398 GEIWASGPSIAHGYWRNPEASAKTFVEHDGRTWLRTGDLGFL-RDGELFVTGRLKDML 454
Score = 53.6 bits (129), Expect = 3e-08
Identities = 34/105 (32%), Positives = 48/105 (45%), Gaps = 11/105 (10%)
Query: 26 RVCD---KPFPRG---EILIGGSNISPGYYKNPTKTKEDF----FDEEGKRWFRTGDIGE 75
RV D P G E+ IGG+ ++ GY P T E F E+G R +RTGD
Sbjct: 1455 RVLDAELNLLPPGVAGELCIGGAGLARGYLGRPALTAERFVPDPLGEDGARLYRTGDRAR 1514
Query: 76 LHQDGVIRIIDRKKDLVKLQAGEYVSLGKVEAELKTCPVIENICV 120
+ DG + + R VKL+ G V +++A L P + V
Sbjct: 1515 WNADGALEYLGRLDQQVKLR-GFRVEPEEIQARLLAQPGVAQAAV 1558
Score = 49.4 bits (118), Expect = 6e-07
Identities = 31/107 (28%), Positives = 50/107 (46%), Gaps = 7/107 (6%)
Query: 35 GEILIGGSNISPGYYKNPTKTKEDF----FDEEGKRWFRTGDIGELHQDGVIRIIDRKKD 90
GE+ + G+ + GY +P +T F F G+R +RTGD+ DGV+ + R
Sbjct: 4067 GELCVAGTGVGRGYVGDPLRTALAFVPHPFGAPGERLYRTGDLARRRSDGVLEYVGRIDH 4126
Query: 91 LVKLQAGEYVSLGKVEAELKTCPVIENICVYGDSR--KDYTVALVVP 135
VK++ G + LG++EA L + V + V +VP
Sbjct: 4127 QVKIR-GYRIELGEIEARLHEQAEVREAAVAVQEGVNGKHLVGYLVP 4172
>gnl|CDD|213312 cd05959, BCL_4HBCL, Benzoate CoA ligase (BCL) and
4-Hydroxybenzoate-Coenzyme A Ligase (4-HBA-CoA ligase).
Benzoate CoA ligase and 4-hydroxybenzoate-coenzyme A
ligase catalyze the first activating step for benzoate
and 4-hydroxybenzoate catabolic pathways, respectively.
Although these two enzymes share very high sequence
homology, they have their own substrate preference. The
reaction proceeds via a two-step process; the first
ATP-dependent step forms the substrate-AMP intermediate,
while the second step forms the acyl-CoA ester,
releasing the AMP. Aromatic compounds represent the
second most abundant class of organic carbon compounds
after carbohydrates. Some bacteria can use benzoic acid
or benzenoid compounds as the sole source of carbon and
energy through degradation. Benzoate CoA ligase and
4-hydroxybenzoate-Coenzyme A ligase are key enzymes of
this process.
Length = 506
Score = 54.6 bits (132), Expect = 9e-09
Identities = 41/140 (29%), Positives = 61/140 (43%), Gaps = 18/140 (12%)
Query: 4 GRAGAPTTVCDIRLVDWDEGNYRVCDKPFPRGEI---LIGGSNISPGYYKNPTKTKEDFF 60
G +G P +++LVD ++G GEI + G + + GY+ KT+E F
Sbjct: 330 GTSGKPVPGYEVKLVD-EDGE------EVADGEIGELWVRGDSSAAGYWNRREKTRETFV 382
Query: 61 DEEGKRWFRTGDIGELHQDGVIRIIDRKKDLVKLQAGEYVSLGKVEAELKTCPVIENICV 120
E W RTGD +DG R D++K+ +G +VS +VE L P + V
Sbjct: 383 GE----WTRTGDKYYRDEDGYYWYCGRSDDMLKV-SGIWVSPFEVEDALLQHPAVLEAAV 437
Query: 121 YGDSRKD---YTVALVVPNP 137
G +D A VV
Sbjct: 438 VGAEDEDGLTKPKAFVVLKD 457
>gnl|CDD|215464 PLN02860, PLN02860, o-succinylbenzoate-CoA ligase.
Length = 563
Score = 54.4 bits (131), Expect = 1e-08
Identities = 45/167 (26%), Positives = 75/167 (44%), Gaps = 23/167 (13%)
Query: 26 RVC-DKPFPRGEILIGGSNISPGYYKNPTKTKEDFFDEEGKRWFRTGDIGELHQDGVIRI 84
++ D+ G IL G ++ GY+ ++T ++ W TGDIG + + G + +
Sbjct: 376 KIGLDESSRVGRILTRGPHVMLGYWGQNSETASVLSNDG---WLDTGDIGWIDKAGNLWL 432
Query: 85 IDRKKDLVKLQAGEYVSLGKVEAELKTCPVIENICVYG--DSRKDYTVALVVPNPSRLKE 142
I R D +K GE V +VEA L P + ++ V G DSR V V RL++
Sbjct: 433 IGRSNDRIK-TGGENVYPEEVEAVLSQHPGVASVVVVGVPDSRLTEMVVACV----RLRD 487
Query: 143 IAEKLNLDSLTFQQLCTHPEIEKQVLV---ELQEHGKKSNLERFEIP 186
+ + +K + + L+ H ++ NL RF+IP
Sbjct: 488 GWIWSDNE---------KENAKKNLTLSSETLRHHCREKNLSRFKIP 525
>gnl|CDD|236236 PRK08315, PRK08315, AMP-binding domain protein; Validated.
Length = 559
Score = 53.3 bits (129), Expect = 2e-08
Identities = 28/92 (30%), Positives = 47/92 (51%), Gaps = 18/92 (19%)
Query: 10 TTV------CDIRLVDWDEGNYRVCDKPFPR---GEILIGGSNISPGYYKNPTKTKEDFF 60
TTV ++++VD + G PR GE+ G ++ GY+ +P KT E
Sbjct: 371 TTVGRALPHLEVKIVDPETGE------TVPRGEQGELCTRGYSVMKGYWNDPEKTAE-AI 423
Query: 61 DEEGKRWFRTGDIGELHQDGVIRIIDRKKDLV 92
D +G W TGD+ + ++G + I+ R KD++
Sbjct: 424 DADG--WMHTGDLAVMDEEGYVNIVGRIKDMI 453
>gnl|CDD|213278 cd05910, FACL_like_1, Uncharacterized subfamily of fatty acid CoA
ligase (FACL). Fatty acyl-CoA ligases catalyze the
ATP-dependent activation of fatty acids in a two-step
reaction. The carboxylate substrate first reacts with
ATP to form an acyl-adenylate intermediate, which then
reacts with CoA to produce an acyl-CoA ester. This is a
required step before free fatty acids can participate in
most catabolic and anabolic reactions.
Length = 455
Score = 52.8 bits (127), Expect = 3e-08
Identities = 33/133 (24%), Positives = 49/133 (36%), Gaps = 3/133 (2%)
Query: 35 GEILIGGSNISPGYYKNPTKTKEDFFDEEGKRWFRTGDIGELHQDGVIRIIDRKKDLVKL 94
GEI++ G +++ YY P TK + + W R GD+G G + RK V+
Sbjct: 299 GEIIVSGPHVTREYYNRPEATKLAKISDGNRIWHRMGDLGYFDDQGRLWFCGRKAHRVE- 357
Query: 95 QAGEYVSLGKVEAELKTCPVIENICVYG--DSRKDYTVALVVPNPSRLKEIAEKLNLDSL 152
AG + VE P + + G V +V P P L
Sbjct: 358 TAGGTLFTVPVEQVFNRHPGVRRSALVGVGKPGTQAPVLVVEPMPPVLDRRKRLEGRLRA 417
Query: 153 TFQQLCTHPEIEK 165
T +Q EI+
Sbjct: 418 TAKQAPATVEIQA 430
>gnl|CDD|236043 PRK07529, PRK07529, AMP-binding domain protein; Validated.
Length = 632
Score = 52.3 bits (126), Expect = 5e-08
Identities = 27/91 (29%), Positives = 39/91 (42%), Gaps = 7/91 (7%)
Query: 4 GRAGAPTTVCDIRLVDWDE-GNY-RVCDKPFPRGEILIGGSNISPGYYKNPTKTKEDFFD 61
G G +R+V D+ G Y R C G + I G N+ GY K + +
Sbjct: 386 GSVGLRLPYQRVRVVILDDAGRYLRDCA-VDEVGVLCIAGPNVFSGYL-EAAHNKGLWLE 443
Query: 62 EEGKRWFRTGDIGELHQDGVIRIIDRKKDLV 92
+ W TGD+G + DG + R KDL+
Sbjct: 444 D---GWLNTGDLGRIDADGYFWLTGRAKDLI 471
>gnl|CDD|233803 TIGR02262, benz_CoA_lig, benzoate-CoA ligase family. Characterized
members of this protein family include benzoate-CoA
ligase, 4-hydroxybenzoate-CoA ligase,
2-aminobenzoate-CoA ligase, etc. Members are related to
fatty acid and acetate CoA ligases.
Length = 508
Score = 51.4 bits (123), Expect = 9e-08
Identities = 42/137 (30%), Positives = 64/137 (46%), Gaps = 12/137 (8%)
Query: 4 GRAGAPTTVCDIRLVDWDEGNYRVCDKPFPRGEILIGGSNISPGYYKNPTKTKEDFFDEE 63
G +G P +RLV D G +P GE+LI G + + Y+ N KT++ F E
Sbjct: 330 GTSGKPVPGYRLRLVG-DGGQDVAAGEP---GELLISGPSSATMYWNNRAKTRDTFQGE- 384
Query: 64 GKRWFRTGDIGELHQDGVIRIIDRKKDLVKLQAGEYVSLGKVEAELKTCPVIENICVYGD 123
W R+GD + DG R D++K+ +G YVS ++E+ L P + V G
Sbjct: 385 ---WTRSGDKYVRNDDGSYTYAGRTDDMLKV-SGIYVSPFEIESALIQHPAVLEAAVVGV 440
Query: 124 SRKDYTV---ALVVPNP 137
+D + A +V P
Sbjct: 441 EDEDGLIKPKAFIVLRP 457
>gnl|CDD|236100 PRK07798, PRK07798, acyl-CoA synthetase; Validated.
Length = 533
Score = 51.4 bits (124), Expect = 1e-07
Identities = 34/106 (32%), Positives = 45/106 (42%), Gaps = 15/106 (14%)
Query: 33 PRGEILIGGSNISPGYYKNPTKTKEDFFDEEGKRWFRTGDIGELHQDGVIRIIDRKKDLV 92
G I G +I GYYK+P KT E F +G R+ GD + DG I ++ R +
Sbjct: 376 EIGWIARRG-HIPLGYYKDPEKTAETFPTIDGVRYAIPGDRARVEADGTITLLGR-GSVC 433
Query: 93 KLQAGEYVSLGKVEAELKTCPVIENICVYGDSRKDYTVALVVPNPS 138
GE V +VE LK P + + ALVV P
Sbjct: 434 INTGGEKVFPEEVEEALKAHPDVAD-------------ALVVGVPD 466
>gnl|CDD|235865 PRK06814, PRK06814, acylglycerophosphoethanolamine acyltransferase;
Provisional.
Length = 1140
Score = 51.5 bits (124), Expect = 1e-07
Identities = 26/76 (34%), Positives = 38/76 (50%), Gaps = 8/76 (10%)
Query: 35 GEILIGGSNISPGYYK--NPTKTKEDFFDEEGKRWFRTGDIGELHQDGVIRIIDRKKDLV 92
G + + G N+ GY + NP + +G W+ TGDI + ++G I I R K
Sbjct: 982 GRLFVRGPNVMLGYLRAENPGVLEPP---ADG--WYDTGDIVTIDEEGFITIKGRAKRFA 1036
Query: 93 KLQAGEYVSLGKVEAE 108
K+ AGE +SL VE
Sbjct: 1037 KI-AGEMISLAAVEEL 1051
>gnl|CDD|236217 PRK08279, PRK08279, long-chain-acyl-CoA synthetase; Validated.
Length = 600
Score = 51.4 bits (124), Expect = 1e-07
Identities = 34/128 (26%), Positives = 52/128 (40%), Gaps = 28/128 (21%)
Query: 15 IRLVDWDE----------GNYRVCDKPFPRGEI--LIGGSNISPGY----YKNPTKTKE- 57
+V +D G GE+ LIG I+ Y +P +++
Sbjct: 375 YAIVKYDVDTGEPVRDADGRCIKV----KPGEVGLLIGR--ITDRGPFDGYTDPEASEKK 428
Query: 58 ---DFFDEEGKRWFRTGDIGELHQDGVIRIIDRKKDLVKLQAGEYVSLGKVEAELKTCPV 114
D F ++G WF TGD+ G + +DR D + + GE V+ +VE L P
Sbjct: 429 ILRDVF-KKGDAWFNTGDLMRDDGFGHAQFVDRLGDTFRWK-GENVATTEVENALSGFPG 486
Query: 115 IENICVYG 122
+E VYG
Sbjct: 487 VEEAVVYG 494
>gnl|CDD|213318 cd05971, MACS_like_3, Uncharacterized subfamily of medium-chain
acyl-CoA synthetase (MACS). MACS catalyzes the two-step
activation of medium chain fatty acids (containing 4-12
carbons). The carboxylate substrate first reacts with
ATP to form an acyl-adenylate intermediate, which then
reacts with CoA to produce an acyl-CoA ester. MACS
enzymes are localized to mitochondria.
Length = 439
Score = 51.2 bits (123), Expect = 1e-07
Identities = 26/94 (27%), Positives = 37/94 (39%), Gaps = 8/94 (8%)
Query: 47 GYYKNPTKTKEDFFDEEGKRWFRTGDIGELHQDGVIRIIDRKKDLVKLQAGEYVSLGKVE 106
GY+ NP T F W TGD+G DG + R D++K +G + ++E
Sbjct: 301 GYWNNPEATAAKFAG----DWLLTGDLGRRDADGYLWFKGRADDVIK-SSGYRIGPAEIE 355
Query: 107 AELKTCPVIENICVYG---DSRKDYTVALVVPNP 137
L P + V G R + A VV
Sbjct: 356 ECLLKHPAVLEAAVVGVPDPERGEIVKAFVVLAE 389
>gnl|CDD|213317 cd05970, MACS_AAE_MA_like, Medium-chain acyl-CoA synthetase (MACS)
of AAE_MA like. MACS catalyzes the two-step activation
of medium chain fatty acids (containing 4-12 carbons).
The carboxylate substrate first reacts with ATP to form
an acyl-adenylate intermediate, which then reacts with
CoA to produce an acyl-CoA ester. This family of MACS
enzymes is found in archaea and bacteria. It is
represented by the acyl-adenylating enzyme from
Methanosarcina acetivorans (AAE_MA). AAE_MA is most
active with propionate, butyrate, and the branched
analogs: 2-methyl-propionate, butyrate, and pentanoate.
The specific activity is weaker for smaller or larger
acids.
Length = 537
Score = 50.5 bits (121), Expect = 2e-07
Identities = 57/219 (26%), Positives = 99/219 (45%), Gaps = 40/219 (18%)
Query: 4 GRAGAPTTVCDIRLVDWDEGNYRVCDKPFPRGEILIGGSNISP-----GYYKNPTKTKED 58
G G P+ DI ++D D + C+ GEI+I S+ P GYY++P +T E
Sbjct: 353 GSMGKPSPGYDIDIIDPDG---KSCE-VGEEGEIVIRTSDGKPLGLFMGYYRDPERTAEV 408
Query: 59 FFDEEGKRWFRTGDIGELHQDGVIRIIDRKKDLVKLQAGEYVSLGKVEAELKTCPVIENI 118
+ D ++ TGD + +DG + + R DL+K +G + +VE+ L P +
Sbjct: 409 WHDG----YYHTGDTAWMDEDGYLWFVGRADDLIK-SSGYRIGPFEVESALIQHPAVLEC 463
Query: 119 CVYGDSRKDYTVALVVPNPSRLKEIAEKLNLDSLTFQQLCTHPEIEKQVLVELQEHGKKS 178
V G VP+P R + + + L E +++ ELQ+H KK
Sbjct: 464 AVTG-----------VPDPIRGQVVKATIVLTKGY--------EPSEELKKELQDHVKK- 503
Query: 179 NLERFEIPNAVKLCSEVWSPDMGLVTAAFKLKRKDIQER 217
++ P ++ E+ P T + K++R +I+E+
Sbjct: 504 VTAPYKYPRIIEFVDEL--PK----TISGKIRRVEIREK 536
>gnl|CDD|180666 PRK06710, PRK06710, long-chain-fatty-acid--CoA ligase; Validated.
Length = 563
Score = 50.0 bits (119), Expect = 3e-07
Identities = 38/136 (27%), Positives = 58/136 (42%), Gaps = 19/136 (13%)
Query: 4 GRAGAPTTVCDIRLVDWDEGNYRVCDKPFPRGEILIGGSNISPGYYKNPTKTKEDFFDEE 63
G G P + ++ + G P GEI++ G I GY+ P +T D
Sbjct: 376 GSIGVPWPDTEAMIMSLETGEAL---PPGEIGEIVVKGPQIMKGYWNKPEETAAVLQDG- 431
Query: 64 GKRWFRTGDIGELHQDGVIRIIDRKKDLVKLQAGEYVSLGKVEAELKTCPVIENICVYGD 123
W TGD+G + +DG + DRKKD++ + +G V +VE L Y
Sbjct: 432 ---WLHTGDVGYMDEDGFFYVKDRKKDMI-VASGFNVYPREVEEVL-----------YEH 476
Query: 124 SRKDYTVALVVPNPSR 139
+ V + VP+P R
Sbjct: 477 EKVQEVVTIGVPDPYR 492
>gnl|CDD|236019 PRK07445, PRK07445, O-succinylbenzoic acid--CoA ligase; Reviewed.
Length = 452
Score = 49.2 bits (118), Expect = 5e-07
Identities = 32/124 (25%), Positives = 52/124 (41%), Gaps = 19/124 (15%)
Query: 33 PRGEILIGGSNISPGYYKNPTKTKEDFFDEEGKRWFRTGDIGELHQDGVIRIIDRKKDLV 92
G I I +++ GYY D +G F T D+G L G + I+ R +
Sbjct: 300 QTGNITIQAQSLALGYYPQ-------ILDSQGI--FETDDLGYLDAQGYLHILGRNSQKI 350
Query: 93 KLQAGEYVSLGKVEAELKTCPVIENICVYGDSRKDY---TVALVVPN-----PSRLKE-I 143
+ GE V +VEA + +++++CV G + A+ VP LK I
Sbjct: 351 -ITGGENVYPAEVEAAILATGLVQDVCVLGLPDPHWGEVVTAIYVPKDPSISLEELKTAI 409
Query: 144 AEKL 147
++L
Sbjct: 410 KDQL 413
>gnl|CDD|181109 PRK07769, PRK07769, long-chain-fatty-acid--CoA ligase; Validated.
Length = 631
Score = 49.3 bits (118), Expect = 5e-07
Identities = 25/72 (34%), Positives = 32/72 (44%), Gaps = 15/72 (20%)
Query: 35 GEILIGGSNISPGYYKNPTKTKEDFFD--------------EEGKRWFRTGDIGELHQDG 80
GEI + G+NI GY+ P +T F + + W RTGD G DG
Sbjct: 419 GEIWLHGNNIGTGYWGKPEETAATFQNILKSRLSESHAEGAPDDALWVRTGDYGVYF-DG 477
Query: 81 VIRIIDRKKDLV 92
+ I R KDLV
Sbjct: 478 ELYITGRVKDLV 489
>gnl|CDD|236668 PRK10252, entF, enterobactin synthase subunit F; Provisional.
Length = 1296
Score = 49.3 bits (118), Expect = 7e-07
Identities = 32/132 (24%), Positives = 60/132 (45%), Gaps = 23/132 (17%)
Query: 35 GEILIGGSNISPGYYKNPTKTKEDFFDE---EGKRWFRTGDIGELHQDGVIRIIDRKKDL 91
G++ + G ++ GY P T F + G+R +RTGD+ DG + + R D
Sbjct: 803 GDLYLTGIQLAQGYLGRPDLTASRFIADPFAPGERMYRTGDVARWLDDGAVEYLGRSDDQ 862
Query: 92 VKLQAGEYVSLGKVEAELKTCPVIENI----CVY-------GDSRKDYTVALVVPNPSRL 140
+K++ G+ + LG+++ ++ P +E CV GD+R+ V +V
Sbjct: 863 LKIR-GQRIELGEIDRAMQALPDVEQAVTHACVINQAAATGGDARQ--LVGYLVS----- 914
Query: 141 KEIAEKLNLDSL 152
+ L+ +L
Sbjct: 915 -QSGLPLDTSAL 925
>gnl|CDD|236443 PRK09274, PRK09274, peptide synthase; Provisional.
Length = 552
Score = 48.7 bits (117), Expect = 8e-07
Identities = 28/105 (26%), Positives = 39/105 (37%), Gaps = 15/105 (14%)
Query: 4 GRAGAPTTVCDIRLVD-----WDEGNYRVCDKPFPR---GEILIGGSNISPGYYKNPTKT 55
GR V I + D WD+ GEI++ G ++ YY P T
Sbjct: 355 GRPVDGVEVRIIAISDAPIPEWDD------ALRLATGEIGEIVVAGPMVTRSYYNRPEAT 408
Query: 56 KED-FFDEEGKRWFRTGDIGELHQDGVIRIIDRKKDLVKLQAGEY 99
+ D +G W R GD+G L G + RK V+ G
Sbjct: 409 RLAKIPDGQGDVWHRMGDLGYLDAQGRLWFCGRKAHRVETAGGTL 453
>gnl|CDD|213320 cd05973, MACS_like_2, Uncharacterized subfamily of medium-chain
acyl-CoA synthetase (MACS). MACS catalyzes the two-step
activation of medium chain fatty acids (containing 4-12
carbons). The carboxylate substrate first reacts with
ATP to form an acyl-adenylate intermediate, which then
reacts with CoA to produce an acyl-CoA ester. MACS
enzymes are localized to mitochondria.
Length = 440
Score = 48.6 bits (116), Expect = 8e-07
Identities = 29/93 (31%), Positives = 40/93 (43%), Gaps = 16/93 (17%)
Query: 47 GYYKNPTKTKEDFFDEEGKRWFRTGDIGELHQDGVIRIIDRKKDLVKLQAGEYVSLGKVE 106
GY+ +P KT E RW+ TGD+ E +DG I R D++ AG + VE
Sbjct: 302 GYWDDPEKTAELIAG----RWYVTGDLVERDEDGYFWFIGRADDVII-SAGYRIGPFDVE 356
Query: 107 AELKTCPVIENICVYGDSRKDYTVALVVPNPSR 139
+ L P + V G VP+P R
Sbjct: 357 SALLEHPAVAEAAVVG-----------VPDPLR 378
>gnl|CDD|236091 PRK07768, PRK07768, long-chain-fatty-acid--CoA ligase; Validated.
Length = 545
Score = 47.7 bits (114), Expect = 2e-06
Identities = 29/92 (31%), Positives = 46/92 (50%), Gaps = 20/92 (21%)
Query: 7 GAPTTVCDIRLVDWDEGNYRVCDKPFPR--GEILIGGSNISPGYYKNPTKTKEDFF---- 60
G P ++R+VD ++G V PR G I + G +++PGY D F
Sbjct: 363 GPPLPGLEVRVVD-EDGQ--VLP---PRGVGVIELRGESVTPGY------LTMDGFIPAQ 410
Query: 61 DEEGKRWFRTGDIGELHQDGVIRIIDRKKDLV 92
D +G W TGD+G L ++G + + R KD++
Sbjct: 411 DADG--WLDTGDLGYLTEEGEVVVCGRVKDVI 440
>gnl|CDD|233807 TIGR02275, DHB_AMP_lig, 2,3-dihydroxybenzoate-AMP ligase. Proteins
in this family belong to the AMP-binding enzyme family
(pfam00501). Members activate 2,3-dihydroxybenzoate
(DHB) by ligation of AMP from ATP with the release of
pyrophosphate; many are involved in synthesis of
siderophores such as enterobactin, vibriobactin,
vulnibactin, etc. The most closely related proteine
believed to differ in function activates salicylate
rather than DHB [Transport and binding proteins, Cations
and iron carrying compounds].
Length = 526
Score = 47.1 bits (112), Expect = 3e-06
Identities = 33/99 (33%), Positives = 49/99 (49%), Gaps = 14/99 (14%)
Query: 14 DIRLVDWDEGNYRVCDKPFPRGE---ILIGGSNISPGYYKNPTKTKEDFFDEEGKRWFRT 70
++R+VD D GN P GE +L G GYYK P FD EG ++ T
Sbjct: 362 EVRVVD-DHGN------PVAPGETGMLLTRGPYTFRGYYKAPEHNAA-AFDAEG--FYYT 411
Query: 71 GDIGELHQDGVIRIIDRKKDLVKLQAGEYVSLGKVEAEL 109
GD+ L +G I ++ R KD + + GE ++ ++E L
Sbjct: 412 GDLVRLTPEGYIVVVGRAKDQIN-RGGEKIAAEEIENLL 449
>gnl|CDD|213303 cd05937, FATP_chFAT1_like, Uncharacterized subfamily of
bifunctional fatty acid transporter/very-long-chain
acyl-CoA synthetase in fungi. Fatty acid transport
protein (FATP) transports long-chain or very-long-chain
fatty acids across the plasma membrane. FATPs also have
fatty acid CoA synthetase activity, thus playing dual
roles as fatty acid transporters and its activation
enzymes. FATPs are the key players in the trafficking of
exogenous fatty acids into the cell and in intracellular
fatty acid homeostasis. Members of this family are
fungal FATPs, including FAT1 from Cochliobolus
heterostrophus.
Length = 468
Score = 47.1 bits (112), Expect = 3e-06
Identities = 41/154 (26%), Positives = 56/154 (36%), Gaps = 24/154 (15%)
Query: 46 PGYYKNPTKTKE----DFFDEEGKRWFRTGDIGELHQDGVIRIIDRKKDLVKLQAGEYVS 101
GY KN T+ D F +G W+RTGD+ DG +DR D + + E VS
Sbjct: 315 QGYLKNEDATESKLLRDVF-RKGDIWYRTGDLLRQDADGRWYFLDRLGDTFRWK-SENVS 372
Query: 102 LGKVEAELKTCPVIENICVYGDSRKDYTVALVVPNPSRLKEIAEKLNLDSLTFQQLCTHP 161
G+V L P + VYG V R A L S
Sbjct: 373 TGEVADVLGAIPSVAEANVYG--------VKVPGYDGRAGCAAITLEESSA--------V 416
Query: 162 EIEKQVLVELQEHGKKSNLERFEIPNAVKLCSEV 195
E E L + + L + +P ++L EV
Sbjct: 417 ETEF-TKNLLAKL-ARKRLPSYAVPLFLRLTEEV 448
>gnl|CDD|235719 PRK06155, PRK06155, crotonobetaine/carnitine-CoA ligase;
Provisional.
Length = 542
Score = 46.7 bits (111), Expect = 3e-06
Identities = 26/75 (34%), Positives = 38/75 (50%), Gaps = 5/75 (6%)
Query: 47 GYYKNPTKTKEDFFDEEGKRWFRTGDIGELHQDGVIRIIDRKKDLVKLQAGEYVSLGKVE 106
GY+ P KT E + + WF TGD DG R +DR KD ++ + GE +S +VE
Sbjct: 385 GYFGMPEKTVEAWRN----LWFHTGDRVVRDADGWFRFVDRIKDAIRRR-GENISSFEVE 439
Query: 107 AELKTCPVIENICVY 121
L + P + V+
Sbjct: 440 QVLLSHPAVAAAAVF 454
>gnl|CDD|235908 PRK07008, PRK07008, long-chain-fatty-acid--CoA ligase; Validated.
Length = 539
Score = 45.9 bits (109), Expect = 7e-06
Identities = 28/94 (29%), Positives = 47/94 (50%), Gaps = 12/94 (12%)
Query: 14 DIRLVDWDEGNYRVCD-KPFPRGEILIGGSNISPGYYKNPTKTKEDFFDEEGKRWFRTGD 72
D+++V D+G D K F G++ + G + Y++ D WF TGD
Sbjct: 366 DMKIVG-DDGRELPWDGKAF--GDLQVRGPWVIDRYFRGDASPLVD-------GWFPTGD 415
Query: 73 IGELHQDGVIRIIDRKKDLVKLQAGEYVSLGKVE 106
+ + DG ++I DR KD++K GE++S +E
Sbjct: 416 VATIDADGFMQITDRSKDVIK-SGGEWISSIDIE 448
>gnl|CDD|171527 PRK12476, PRK12476, putative fatty-acid--CoA ligase; Provisional.
Length = 612
Score = 45.5 bits (108), Expect = 9e-06
Identities = 23/73 (31%), Positives = 33/73 (45%), Gaps = 16/73 (21%)
Query: 35 GEILIGGSNISPGYYKNPTKTKEDFFD---------------EEGKRWFRTGDIGELHQD 79
GEI + G NI GY+ P +T+ F + W RTGD+G ++ D
Sbjct: 430 GEIWLHGDNIGRGYWGRPEETERTFGAKLQSRLAEGSHADGAADDGTWLRTGDLG-VYLD 488
Query: 80 GVIRIIDRKKDLV 92
G + I R DL+
Sbjct: 489 GELYITGRIADLI 501
>gnl|CDD|213319 cd05972, MACS_like, Medium-chain acyl-CoA synthetase (MACS or
ACSM). MACS catalyzes the two-step activation of medium
chain fatty acids (containing 4-12 carbons). The
carboxylate substrate first reacts with ATP to form an
acyl-adenylate intermediate, which then reacts with CoA
to produce an acyl-CoA ester. The acyl-CoA is a key
intermediate in many important biosynthetic and
catabolic processes.
Length = 430
Score = 45.4 bits (108), Expect = 1e-05
Identities = 45/178 (25%), Positives = 71/178 (39%), Gaps = 32/178 (17%)
Query: 4 GRAGAPTTVCDIRLVDWDEGNYRVCDKPFPRGEILIGGSNISPG----YYKNPTKTKEDF 59
G G P + ++D DEGN + G+I + PG Y K+P KT+
Sbjct: 249 GSMGRPAPGYRVAIID-DEGNELPPGEE---GDIAVRVKPRPPGLFRGYLKDPEKTEATI 304
Query: 60 FDEEGKRWFRTGDIGELHQDGVIRIIDRKKDLVKLQAGEYVSLGKVEAELKTCPVIENIC 119
+ W+ TGD +DG + R D++K +G + +VE+ L P +
Sbjct: 305 RGD----WYLTGDRAIKDEDGYFWFVGRADDVIK-SSGYRIGPFEVESALIEHPAVAEAA 359
Query: 120 VYGDSRKDYTVALVVPNPSRLKEIAEKLNLDSLTFQQLCTHPEIEKQVLVELQEHGKK 177
V G P+P R + + F L + +++ ELQEH KK
Sbjct: 360 VVG-----------SPDPVRGEVVK--------AFVVLAPGYQPSEELAKELQEHVKK 398
>gnl|CDD|213290 cd05923, CBAL, 4-Chlorobenzoate-CoA ligase (CBAL). CBAL catalyzes
the conversion of 4-chlorobenzoate (4-CB) to
4-chlorobenzoyl-coenzyme A (4-CB-CoA) by the two-step
adenylation and thioester-forming reactions.
4-Chlorobenzoate (4-CBA) is an environmental pollutant
derived from microbial breakdown of aromatic pollutants,
such as polychlorinated biphenyls (PCBs), DDT, and
certain herbicides. The 4-CBA degrading pathway converts
4-CBA to the metabolite 4-hydroxybezoate (4-HBA),
allowing some soil-dwelling microbes to utilize 4-CBA as
an alternate carbon source. This pathway consists of
three chemical steps catalyzed by 4-CBA-CoA ligase,
4-CBA-CoA dehalogenase, and 4HBA-CoA thioesterase in
sequential reactions.
Length = 495
Score = 44.9 bits (106), Expect = 1e-05
Identities = 30/94 (31%), Positives = 43/94 (45%), Gaps = 8/94 (8%)
Query: 47 GYYKNPTKTKEDFFDEEGKRWFRTGDIGELHQDGVIRIIDRKKDLVKLQAGEYVSLGKVE 106
GY P T E D W+RT D+ + G +RI+ R D++ + GE + +VE
Sbjct: 361 GYLNQPQATAEKLQD----GWYRTSDVAVVDPSGTVRILGRVDDMI-ISGGENIHPSEVE 415
Query: 107 AELKTCPVIENICVYG--DSRKDYTV-ALVVPNP 137
L P + + V G D R +V A VVP
Sbjct: 416 RVLGRAPGVTEVVVIGLPDERWGQSVTACVVPRE 449
>gnl|CDD|223952 COG1021, EntE, Peptide arylation enzymes [Secondary metabolites
biosynthesis, transport, and catabolism].
Length = 542
Score = 44.0 bits (104), Expect = 3e-05
Identities = 33/125 (26%), Positives = 54/125 (43%), Gaps = 28/125 (22%)
Query: 14 DIRLVDWDEGNYRVCDKPFPRGEI---LIGGSNISPGYYKNPTKTKEDFFDEEGKRWFRT 70
++R+VD +GN P GE+ L G GYY+ P + FD +G ++R+
Sbjct: 369 EVRVVD-ADGN------PVAPGEVGELLTRGPYTIRGYYRAP-EHNARAFDADG--FYRS 418
Query: 71 GDIGELHQDGVIRIIDRKKDLVKLQAGEYVSLGKVEAELKTCPVI--------------E 116
GD+ DG + + R KD + + GE ++ +VE L P + E
Sbjct: 419 GDLVRRDPDGYLVVEGRVKDQIN-RGGEKIAAEEVENLLLRHPAVHDAALVAMPDELLGE 477
Query: 117 NICVY 121
C +
Sbjct: 478 KSCAF 482
>gnl|CDD|235624 PRK05850, PRK05850, acyl-CoA synthetase; Validated.
Length = 578
Score = 43.8 bits (104), Expect = 3e-05
Identities = 23/70 (32%), Positives = 36/70 (51%), Gaps = 17/70 (24%)
Query: 35 GEILIGGSNISPGYYKNPTKTKEDFFDEEGKR------------WFRTGDIGELHQDGVI 82
GEI + G N++ GY++ P +T+ F G W RTGD+G + +G +
Sbjct: 398 GEIWVHGDNVAAGYWQKPEETERTF----GATLVDPSPGTPEGPWLRTGDLGFIS-EGEL 452
Query: 83 RIIDRKKDLV 92
I+ R KDL+
Sbjct: 453 FIVGRIKDLL 462
>gnl|CDD|181207 PRK08043, PRK08043, bifunctional acyl-[acyl carrier protein]
synthetase/2-acylglycerophosphoethanolamine
acyltransferase; Validated.
Length = 718
Score = 43.9 bits (104), Expect = 4e-05
Identities = 28/78 (35%), Positives = 38/78 (48%), Gaps = 7/78 (8%)
Query: 35 GEILIGGSNISPGYYK--NP----TKTKEDFFDEEGKRWFRTGDIGELHQDGVIRIIDRK 88
G + + G NI GY + P T E+ E + W+ TGDI + G ++I R
Sbjct: 554 GRLQLKGPNIMNGYLRVEKPGVLEVPTAENARGEMERGWYDTGDIVRFDEQGFVQIQGRA 613
Query: 89 KDLVKLQAGEYVSLGKVE 106
K K+ AGE VSL VE
Sbjct: 614 KRFAKI-AGEMVSLEMVE 630
>gnl|CDD|183506 PRK12406, PRK12406, long-chain-fatty-acid--CoA ligase; Provisional.
Length = 509
Score = 43.5 bits (103), Expect = 4e-05
Identities = 42/144 (29%), Positives = 67/144 (46%), Gaps = 25/144 (17%)
Query: 4 GRAGAPTTVCDIRLVDWDEGNYRVCDKPFPRGEILIGGSNIS--PG-----YYKNPTKTK 56
G G ++R VD ++G +P P+GEI G S G Y+ P K
Sbjct: 324 GTVGKAAPGAELRFVD-EDG------RPLPQGEI---GEIYSRIAGNPDFTYHNKPEKRA 373
Query: 57 EDFFDEEGKRWFRTGDIGELHQDGVIRIIDRKKDLVKLQAGEYVSLGKVEAELKTCPVIE 116
E D G + +GD+G L DG + + DRK+D+V + G + ++EA L P +
Sbjct: 374 E--IDRGG--FITSGDVGYLDADGYLFLCDRKRDMV-ISGGVNIYPAEIEAVLHAVPGVH 428
Query: 117 NICVYGDSRKDY---TVALVVPNP 137
+ V+G ++ +A+V P P
Sbjct: 429 DCAVFGIPDAEFGEALMAVVEPQP 452
>gnl|CDD|237374 PRK13388, PRK13388, acyl-CoA synthetase; Provisional.
Length = 540
Score = 43.1 bits (102), Expect = 5e-05
Identities = 31/107 (28%), Positives = 45/107 (42%), Gaps = 9/107 (8%)
Query: 35 GEIL-IGGSNISPGYYKNPTKTKEDFFDEEGKRWFRTGDIGELHQDGVIRIIDRKKDLVK 93
GE++ G+ GYY NP T E + +GD+ DG I R D ++
Sbjct: 352 GELVNTAGAGFFEGYYNNPEATAERMRHG----MYWSGDLAYRDADGWIYFAGRTADWMR 407
Query: 94 LQAGEYVSLGKVEAELKTCPVIENICVYG--DSRK-DYTVALVVPNP 137
+ GE +S +E L P I + VY D R D +A +V
Sbjct: 408 VD-GENLSAAPIERILLRHPAINRVAVYAVPDERVGDQVMAALVLRD 453
>gnl|CDD|211788 TIGR03098, ligase_PEP_1, acyl-CoA ligase (AMP-forming), exosortase
A-associated. This group of proteins contains an
AMP-binding domain (pfam00501) associated with acyl
CoA-ligases. These proteins are generally found in
genomes containing the exosortase/PEP-CTERM protein
expoert system , specifically the type 1 variant of this
system described by the Genome Property GenProp0652.
When found in this context they are invariably present
next to a decarboxylase enzyme. A number of sequences
from Burkholderia species also hit this model, but the
genomic context is obviously different. The hypothesis
of a constant substrate for this family is only strong
where the exosortase context is present.
Length = 517
Score = 42.5 bits (100), Expect = 9e-05
Identities = 27/115 (23%), Positives = 50/115 (43%), Gaps = 16/115 (13%)
Query: 35 GEILIGGSNISPGYYKNPTKTKEDF-----FDEEGKR-----WFRTGDIGELHQDGVIRI 84
GE++ G+ ++ GY+ +P KT E F F E W +GD ++G +
Sbjct: 357 GELVHRGALVAMGYWNDPEKTAERFRPLPPFPGELHLPELAVW--SGDTVRRDEEGFLYF 414
Query: 85 IDRKKDLVKLQAGEYVSLGKVEAELKTCPVIENICVYG---DSRKDYTVALVVPN 136
+ R+ +++K +G VS +VE ++ +G + V +V P
Sbjct: 415 VGRRDEMIK-TSGYRVSPTEVEEVAYATGLVAEAVAFGVPDPTLGQAIVLVVTPP 468
>gnl|CDD|213316 cd05969, MACS_like_4, Uncharacterized subfamily of Acetyl-CoA
synthetase like family (ACS). This family is most
similar to acetyl-CoA synthetase. Acetyl-CoA synthetase
(ACS) catalyzes the formation of acetyl-CoA from
acetate, CoA, and ATP. Synthesis of acetyl-CoA is
carried out in a two-step reaction. In the first step,
the enzyme catalyzes the synthesis of acetyl-AMP
intermediate from acetate and ATP. In the second step,
acetyl-AMP reacts with CoA to produce acetyl-CoA. This
enzyme is only present in bacteria.
Length = 443
Score = 42.4 bits (100), Expect = 9e-05
Identities = 40/156 (25%), Positives = 62/156 (39%), Gaps = 12/156 (7%)
Query: 4 GRAGAPTTVCDIRLVDWDEGNYRVCDKPFPRGEILI--GGSNISPGYYKNPTKTKEDFFD 61
G G P + +++ D P GE+ + G ++ GY N + F D
Sbjct: 260 GSMGRPLPGIEAAVIERDGDGLTPVTGPGQVGELALKPGWPSMFRGYLGNEERYASSFVD 319
Query: 62 EEGKRWFRTGDIGELHQDGVIRIIDRKKDLVKLQAGEYVSLGKVEAELKTCPVIENICVY 121
W+ TGD+ +DG + R D++K AG V +VE+ L P + V
Sbjct: 320 G----WYLTGDLAYRDEDGYFWFVGRADDVIK-TAGHLVGPFEVESALMEHPAVAEAGVI 374
Query: 122 GDS---RKDYTVALVVPNPSRLKEIAEKLNLDSLTF 154
G + A V P E +E+L + LTF
Sbjct: 375 GKPDPVAGEIVKAFVSLRPGF--EPSEELRRELLTF 408
>gnl|CDD|236215 PRK08276, PRK08276, long-chain-fatty-acid--CoA ligase; Validated.
Length = 502
Score = 41.0 bits (97), Expect = 3e-04
Identities = 22/75 (29%), Positives = 32/75 (42%), Gaps = 13/75 (17%)
Query: 26 RVCD---KPFPRGEI-LI----GGSNISPGYYKNPTKTKEDFFDEEGKRWFRTGDIGELH 77
R+ D P GEI + G Y+ +P KT W GD+G L
Sbjct: 326 RILDEDGNELPPGEIGTVYFEMDGYPFE--YHNDPEKTAAA---RNPHGWVTVGDVGYLD 380
Query: 78 QDGVIRIIDRKKDLV 92
+DG + + DRK D++
Sbjct: 381 EDGYLYLTDRKSDMI 395
>gnl|CDD|213283 cd05915, ttLC_FACS_like, Fatty acyl-CoA synthetases similar to
LC-FACS from Thermus thermophiles. This family includes
fatty acyl-CoA synthetases that can activate
medium-chain to long-chain fatty acids. They catalyze
the ATP-dependent acylation of fatty acids in a two-step
reaction. The carboxylate substrate first reacts with
ATP to form an acyl-adenylate intermediate, which then
reacts with CoA to produce an acyl-CoA ester. Fatty
acyl-CoA synthetases are responsible for fatty acid
degradation as well as physiological regulation of
cellular functions via the production of fatty acyl-CoA
esters. The fatty acyl-CoA synthetase from Thermus
thermophiles in this family has been shown to catalyze
the long-chain fatty acid, myristoyl acid, while another
member in this family, the AlkK protein identified in
Pseudomonas oleovorans, targets medium chain fatty
acids. This family also includes an uncharacterized
subgroup of FACS.
Length = 509
Score = 40.5 bits (94), Expect = 4e-04
Identities = 28/106 (26%), Positives = 43/106 (40%), Gaps = 6/106 (5%)
Query: 37 ILIGGSNISPGYYKNPTKTKEDFFDEEGKRWFRTGDIGELHQDGVIRIIDRKKDLVKLQA 96
+ + G I+ GYY N T+ +FRTGDI ++G + I DR KDL+K
Sbjct: 363 VQLKGPWITGGYYGNEEATRSALTP---DGFFRTGDIAVWDEEGYVEIKDRLKDLIKS-G 418
Query: 97 GEYVSLGKVEAELKTCPVIENICVYGDSRKDYTVALVVPNPSRLKE 142
GE++S +E L P ++ V +
Sbjct: 419 GEWISSVDLENALMGHPKVKEAAVVA--IPHPKWQERPLAVVVPRG 462
>gnl|CDD|236363 PRK09029, PRK09029, O-succinylbenzoic acid--CoA ligase;
Provisional.
Length = 458
Score = 38.7 bits (91), Expect = 0.002
Identities = 27/107 (25%), Positives = 51/107 (47%), Gaps = 21/107 (19%)
Query: 14 DIRLVDWDEGNYRVCDKPFPRGEILIGGSNISPGYYKNPTKTKEDFFDEEGKRWFRTGDI 73
+++LVD GEI + G++++ GY++ ++EG WF T D
Sbjct: 299 EVKLVD---------------GEIWLRGASLALGYWRQGQLV--PLVNDEG--WFATRDR 339
Query: 74 GELHQDGVIRIIDRKKDLVKLQAGEYVSLGKVEAELKTCPVIENICV 120
GE Q+G + I+ R +L GE + ++E + P+++ + V
Sbjct: 340 GEW-QNGELTILGRLDNLF-FSGGEGIQPEEIERVINQHPLVQQVFV 384
>gnl|CDD|236803 PRK10946, entE, enterobactin synthase subunit E; Provisional.
Length = 536
Score = 38.4 bits (90), Expect = 0.002
Identities = 32/107 (29%), Positives = 51/107 (47%), Gaps = 24/107 (22%)
Query: 14 DIRLVDWDEGNYRVCDKPFPRGEI---LIGGSNISPGYYKNPTKTKEDFFDEEGKRWFRT 70
++ + D +GN P P+GE+ + G GYYK+P FD G ++ +
Sbjct: 364 EVWVAD-ADGN------PLPQGEVGRLMTRGPYTFRGYYKSPQHNAS-AFDANG--FYCS 413
Query: 71 GDIGELHQDGVIRIIDRKKDLVKLQAGEYVSLGKVEAELKTCPVIEN 117
GD+ + DG I ++ R+KD + + GE K+ AE IEN
Sbjct: 414 GDLVSIDPDGYITVVGREKDQIN-RGGE-----KIAAE-----EIEN 449
>gnl|CDD|213294 cd05928, MACS_euk, Eukaryotic Medium-chain acyl-CoA synthetase
(MACS or ACSM). MACS catalyzes the two-step activation
of medium chain fatty acids (containing 4-12 carbons).
The carboxylate substrate first reacts with ATP to form
an acyl-adenylate intermediate, which then reacts with
CoA to produce an acyl-CoA ester. The acyl-CoA is a key
intermediate in many important biosynthetic and
catabolic processes. MACS enzymes are localized to
mitochondria. Two murine MACS family proteins are found
in liver and kidney. In rodents, a MACS member is
detected particularly in the olfactory epithelium and is
called O-MACS. O-MACS demonstrates substrate preference
for the fatty acid lengths of C6-C12.
Length = 530
Score = 37.9 bits (88), Expect = 0.003
Identities = 54/222 (24%), Positives = 83/222 (37%), Gaps = 44/222 (19%)
Query: 4 GRAGAPTTVCDIRLVDWDEGNYRVCDKPFPRGEILIGGSNISP-----GYYKNPTKTKED 58
G G + D++++D D GN P G+I I P Y NP KT
Sbjct: 343 GSMGKASPPYDVQIID-DNGNVL---PPGTEGDIGIRVKPTRPFCLFSCYVDNPEKTAAT 398
Query: 59 FFDEEGKRWFRTGDIGELHQDGVIRIIDRKKDLVKLQAGEYVSLGKVEAELKTCPVIENI 118
G + TGD G + +DG + R D++ +G + +VE+ L P +
Sbjct: 399 ---IRGDFYI-TGDRGIMDEDGYFWFVGRADDVI-NSSGYRIGPFEVESALIEHPAVVES 453
Query: 119 CVYGDS---RKDYTVALVVPNPSRLKEIAEKLNLDSLTFQQLCTHPEIEKQVLVELQEHG 175
V R + A VV P L E+L ELQEH
Sbjct: 454 AVVSSPDPIRGEVVKAFVVLAPQFLSHDPEQLTK--------------------ELQEHV 493
Query: 176 KKSNLERFEIPNAVKLCSEVWSPDMGLVTAAFKLKRKDIQER 217
K ++ P V+ E+ P T K+KR +++++
Sbjct: 494 KSVTAP-YKYPRKVEFVQEL--PK----TITGKIKRNELRDK 528
>gnl|CDD|139531 PRK13383, PRK13383, acyl-CoA synthetase; Provisional.
Length = 516
Score = 37.7 bits (87), Expect = 0.003
Identities = 37/137 (27%), Positives = 56/137 (40%), Gaps = 21/137 (15%)
Query: 7 GAPTTVCDIRLVDWDEGNYRVCDKPF-PR--GEILIGGSNISPGYYKNPTKTKEDFFDEE 63
G P C +R++D + ++P PR G I +GG Y K D
Sbjct: 348 GKPVAGCPVRILDRN-------NRPVGPRVTGRIFVGGELAGTRYTDGGGKAVVDGMTS- 399
Query: 64 GKRWFRTGDIGELHQDGVIRIIDRKKDLVKLQAGEYVSLGKVEAELKTCPVIENICVYGD 123
TGD+G L G + I+ R+ D++ + GE V VE L P + + V G
Sbjct: 400 ------TGDMGYLDNAGRLFIVGREDDMI-ISGGENVYPRAVENALAAHPAVADNAVIGV 452
Query: 124 SRKDY---TVALVVPNP 137
+ + A VV +P
Sbjct: 453 PDERFGHRLAAFVVLHP 469
>gnl|CDD|213311 cd05958, ABCL, 2-aminobenzoate-CoA ligase (ABCL). ABCL catalyzes
the initial step in the 2-aminobenzoate aerobic
degradation pathway by activating 2-aminobenzoate to
2-aminobenzoyl-CoA. The reaction is carried out via a
two-step process; the first step is ATP-dependent and
forms a 2-aminobenzoyl-AMP intermediate, and the second
step forms the 2-aminobenzoyl-CoA ester and releases the
AMP. 2-Aminobenzoyl-CoA is further converted to
2-amino-5-oxo-cyclohex-1-ene-1-carbonyl-CoA catalyzed by
2-aminobenzoyl-CoA monooxygenase/reductase. ABCL has
been purified from cells aerobically grown with
2-aminobenzoate as sole carbon, energy, and nitrogen
source, and has been characterized as a monomer.
Length = 487
Score = 37.5 bits (87), Expect = 0.004
Identities = 39/148 (26%), Positives = 59/148 (39%), Gaps = 30/148 (20%)
Query: 1 MSTGRAGAPTTVCDIRLVDWDEGNYRVCDKPFPRGEILIGGSNISPGYY--KNPTKTKED 58
G G P + R+VD D+GN PRGE G + PT +
Sbjct: 310 AKPGATGKPVPGYEARIVD-DQGN------EVPRGE---------AGRLAVRGPTGCR-- 351
Query: 59 FFDEEGKR------WFRTGDIGELHQDGVIRIIDRKKDLVKLQAGEYVSLGKVEAELKTC 112
+ D+E ++ W TGDI +DG + R D++ + AG ++ +VE L T
Sbjct: 352 YLDDERQQEYVRDGWNVTGDIFRQDEDGYFHYVARSDDMI-VSAGYNIAAPEVEDALLTH 410
Query: 113 PVIENICVYG---DSRKDYTVALVVPNP 137
P + V G + R A +V
Sbjct: 411 PDVAECAVIGVPDEERGQIVCAHIVLRD 438
>gnl|CDD|180533 PRK06334, PRK06334, long chain fatty acid--[acyl-carrier-protein]
ligase; Validated.
Length = 539
Score = 36.3 bits (84), Expect = 0.011
Identities = 30/107 (28%), Positives = 50/107 (46%), Gaps = 14/107 (13%)
Query: 31 PFPRGE---ILIGGSNISPGYYKNPTKTKEDFFDEEGKRWFRTGDIGELHQDGVIRIIDR 87
P GE +L G+++ GY + F + G+ W+ TGD+G + + G + + R
Sbjct: 374 PVSSGETGLVLTRGTSLFSGYLGE--DFGQGFVELGGETWYVTGDLGYVDRHGELFLKGR 431
Query: 88 KKDLVKLQAGEYVSLGKVEAELKTCPVIENICVYGDSRKDYTVALVV 134
VK+ E VSL +E+ L +E +G + D+ LVV
Sbjct: 432 LSRFVKI-GAEMVSLEALESIL-----MEG---FGQNAADHAGPLVV 469
>gnl|CDD|180289 PRK05851, PRK05851, long-chain-fatty-acid--[acyl-carrier-protein]
ligase; Validated.
Length = 525
Score = 35.9 bits (83), Expect = 0.012
Identities = 21/63 (33%), Positives = 32/63 (50%), Gaps = 10/63 (15%)
Query: 35 GEILIGGSNISPGYYKNPTKTKEDFFDEEGKRWFRTGDIGELHQDGVIRIIDRKKDLVKL 94
GEI I G+++ GY +D WF TGD+G L G++ + R K+L+ +
Sbjct: 373 GEIEIRGASMMSGYLGQAPIDPDD--------WFPTGDLGYLVDGGLV-VCGRAKELITV 423
Query: 95 QAG 97
AG
Sbjct: 424 -AG 425
>gnl|CDD|180293 PRK05857, PRK05857, acyl-CoA synthetase; Validated.
Length = 540
Score = 35.8 bits (82), Expect = 0.016
Identities = 16/46 (34%), Positives = 23/46 (50%), Gaps = 4/46 (8%)
Query: 47 GYYKNPTKTKEDFFDEEGKRWFRTGDIGELHQDGVIRIIDRKKDLV 92
GY+ NP +T E D W TGD+ E +DG I R +++
Sbjct: 387 GYWNNPERTAEVLIDG----WVNTGDLLERREDGFFYIKGRSSEMI 428
>gnl|CDD|213321 cd05974, MACS_like_1, Uncharacterized subfamily of medium-chain
acyl-CoA synthetase (MACS). MACS catalyzes the two-step
activation of medium chain fatty acids (containing 4-12
carbons). The carboxylate substrate first reacts with
ATP to form an acyl-adenylate intermediate, which then
reacts with CoA to produce an acyl-CoA ester. MACS
enzymes are localized to mitochondria.
Length = 433
Score = 35.4 bits (82), Expect = 0.016
Identities = 27/97 (27%), Positives = 41/97 (42%), Gaps = 13/97 (13%)
Query: 1 MSTGRAGAPTTVCDIRLVDWDEGNYRVCDKPFPRGEILIGGSN----ISPGYYKNPTKTK 56
+ G G P + L+D DEG + P GEI + + + GY +P KT
Sbjct: 248 VKPGSMGRPLPGYRVVLLD-DEGK----EIPVTEGEIALDLGDRPIGLMLGYMGDPEKT- 301
Query: 57 EDFFDEEGKRWFRTGDIGELHQDGVIRIIDRKKDLVK 93
F ++RTGD +DG + + R D+ K
Sbjct: 302 AAAFRGG---YYRTGDKAYRDEDGYLWFVGRADDVFK 335
>gnl|CDD|236071 PRK07638, PRK07638, acyl-CoA synthetase; Validated.
Length = 487
Score = 34.8 bits (80), Expect = 0.028
Identities = 16/58 (27%), Positives = 30/58 (51%), Gaps = 1/58 (1%)
Query: 65 KRWFRTGDIGELHQDGVIRIIDRKKDLVKLQAGEYVSLGKVEAELKTCPVIENICVYG 122
W D+G ++G I I+ R+K+++ L G + ++E+ L P ++ I V G
Sbjct: 360 DGWMTVRDVGYEDEEGFIYIVGREKNMI-LFGGINIFPEEIESVLHEHPAVDEIVVIG 416
>gnl|CDD|184022 PRK13391, PRK13391, acyl-CoA synthetase; Provisional.
Length = 511
Score = 34.7 bits (80), Expect = 0.032
Identities = 31/109 (28%), Positives = 47/109 (43%), Gaps = 16/109 (14%)
Query: 23 GNYRVCD---KPFPRGEI----LIGGSNISPGYYKNPTKTKEDFFDEEGKRWFRTGDIGE 75
G+ + D P GE GG Y +P KT E + W GDIG
Sbjct: 336 GDLHILDDDGAELPPGEPGTIWFEGGRPFE--YLNDPAKTAEARHPDGT--WSTVGDIGY 391
Query: 76 LHQDGVIRIIDRKKDLVKLQAGEYVSLGKVEAE--LKTCPVIENICVYG 122
+ +DG + + DR ++ + G V++ EAE L T P + + V+G
Sbjct: 392 VDEDGYLYLTDRAAFMI-ISGG--VNIYPQEAENLLITHPKVADAAVFG 437
>gnl|CDD|234212 TIGR03443, alpha_am_amid, L-aminoadipate-semialdehyde
dehydrogenase. Members of this protein family are
L-aminoadipate-semialdehyde dehydrogenase (EC 1.2.1.31),
product of the LYS2 gene. It is also called
alpha-aminoadipate reductase. In fungi, lysine is
synthesized via aminoadipate. Currently, all members of
this family are fungal.
Length = 1389
Score = 34.3 bits (79), Expect = 0.058
Identities = 29/110 (26%), Positives = 44/110 (40%), Gaps = 27/110 (24%)
Query: 35 GEILIGGSNISPGYYKNPTKTKEDFF-------------DEEGKRW------------FR 69
GEI + ++ GY P E F D+E + +R
Sbjct: 622 GEIYVRAGGLAEGYLGLPELNAEKFVNNWFVDPSHWIDLDKENNKPEREFWLGPRDRLYR 681
Query: 70 TGDIGELHQDGVIRIIDRKKDLVKLQAGEYVSLGKVEAELKTCP-VIENI 118
TGD+G DG + R D VK++ G + LG+++ L P V EN+
Sbjct: 682 TGDLGRYLPDGNVECCGRADDQVKIR-GFRIELGEIDTHLSQHPLVRENV 730
>gnl|CDD|205769 pfam13591, MerR_2, MerR HTH family regulatory protein.
Length = 84
Score = 31.3 bits (72), Expect = 0.090
Identities = 9/24 (37%), Positives = 13/24 (54%)
Query: 152 LTFQQLCTHPEIEKQVLVELQEHG 175
+T + C IE+ L+EL E G
Sbjct: 1 ITITEFCERTGIEESFLIELVELG 24
>gnl|CDD|147830 pfam05892, Tricho_coat, Trichovirus coat protein. This family
consists of several coat proteins which are specific to
the ssRNA positive-strand, no DNA stage viruses such as
the Trichovirus and Vitivirus.
Length = 195
Score = 32.0 bits (73), Expect = 0.15
Identities = 18/59 (30%), Positives = 26/59 (44%), Gaps = 1/59 (1%)
Query: 123 DSRKDYTVALVVPNPSRLKEIAEKLNLDSLTFQQLCTHPEIE-KQVLVELQEHGKKSNL 180
D R ++ VV + N+ +TF+QLC E + LV L+ G SNL
Sbjct: 72 DVRGKINLSEVVNKIKTFAVTSSDGNIKGMTFRQLCEPFAEEARDFLVILKSLGVYSNL 130
>gnl|CDD|213314 cd05967, PrpE, Propionyl-CoA synthetase (PrpE). PrpE catalyzes the
first step of the 2-methylcitric acid cycle for
propionate catabolism. It activates propionate to
propionyl-CoA in a two-step reaction, which proceeds
through a propionyl-AMP intermediate and requires ATP
and Mg2+. In Salmonella enterica, the PrpE protein is
required for growth of S. enterica on propionate and can
substitute for the acetyl-CoA synthetase (Acs) enzyme
during growth on acetate. PrpE can also activate
acetate, 3HP, and butyrate to their corresponding
CoA-thioesters, although with less efficiency.
Length = 607
Score = 32.7 bits (75), Expect = 0.16
Identities = 37/152 (24%), Positives = 68/152 (44%), Gaps = 15/152 (9%)
Query: 3 TGRAGAPTTVCDIRLVDWDEGNYRVCDKPFPRGEILIGGSNISPGY----YKNPTKTKED 58
G G P D++++D + G + P G I+I + PG + + + K+
Sbjct: 400 PGSPGKPVPGYDVQVLD-ETGE-EL--GPGELGNIVIKLP-LPPGCLLTLWGDDERFKKL 454
Query: 59 FFDEEGKRWFRTGDIGELHQDGVIRIIDRKKDLVKLQAGEYVSLGKVEAELKTCPVIENI 118
+ + ++ TGD G +DG + ++ R D++ + AG +S G++E + P +
Sbjct: 455 YL-NKFPGYYDTGDSGYKDEDGYLFVMGRTDDVINV-AGHRLSTGEMEESVLKHPDVAEC 512
Query: 119 CVYG--DSRK-DYTVALVVPNPSRLKEIAEKL 147
V G D K + LVV K A++L
Sbjct: 513 AVVGVRDELKGQVPLGLVVLKDD-CKIDADQL 543
>gnl|CDD|213305 cd05939, hsFATP4_like, Fatty acid transport proteins (FATP),
including FATP4 and FATP1, and similar proteins. Fatty
acid transport protein (FATP) transports long-chain or
very-long-chain fatty acids across the plasma membrane.
At least five copies of FATPs are identified in
mammalian cells. This family includes FATP4, FATP1, and
homologous proteins. Each FATP has unique patterns of
tissue distribution. FATP4 is mainly expressed in the
brain, testis, colon and kidney. FATPs also have fatty
acid CoA synthetase activity, thus playing dual roles as
fatty acid transporters and its activation enzymes.
FATPs are the key players in the trafficking of
exogenous fatty acids into the cell and in intracellular
fatty acid homeostasis.
Length = 474
Score = 32.0 bits (73), Expect = 0.26
Identities = 43/169 (25%), Positives = 66/169 (39%), Gaps = 45/169 (26%)
Query: 54 KTKEDFFDEEGKRWFRTGDIGELHQD--GVIRIIDRKKDLVKLQAGEYVSLGKVEAELKT 111
K D F ++G F +GD+ L D G + DR D + + GE VS +VE L
Sbjct: 338 KIARDVF-KKGDSAFLSGDV--LVMDELGYLYFKDRTGDTFRWK-GENVSTTEVEGILSN 393
Query: 112 CPVIENICVYG-----DSRKDYTVALVVPNPSRLKEIAEKLNLDSLTFQQLCTHPEIEKQ 166
+E++ VYG + A+V P K++LD +
Sbjct: 394 VLGLEDVVVYGVEVPGVEGRAGMAAIVDPE--------RKVDLDRFS------------A 433
Query: 167 VLVELQEHGKKSNLERFEIPNAVKLCSEVWSPDMGLVTAAFKLKRKDIQ 215
VL +L + P ++L EV T FKL++ D+Q
Sbjct: 434 VL--------AKSLPPYARPQFIRLLPEVDK------TGTFKLQKTDLQ 468
>gnl|CDD|139538 PRK13390, PRK13390, acyl-CoA synthetase; Provisional.
Length = 501
Score = 31.5 bits (71), Expect = 0.32
Identities = 29/133 (21%), Positives = 51/133 (38%), Gaps = 19/133 (14%)
Query: 23 GNYRVCD---KPFPRGEI---LIGGSNISPGYYKNPTKTKEDFFDEEGKRWFRTGDIGEL 76
G+ +CD P G I + Y +P KT W GD+G +
Sbjct: 331 GDLHICDDDGNELPAGRIGTVYFERDRLPFRYLNDPEKTAAAQHPAH-PFWTTVGDLGSV 389
Query: 77 HQDGVIRIIDRKKDLVKLQAGEYVSLGKVEAELKTCPVIENICVYGDSRKDYTVALVVPN 136
+DG + + DRK ++ + G + + E L P + ++ V G VP+
Sbjct: 390 DEDGYLYLADRKSFMI-ISGGVNIYPQETENALTMHPAVHDVAVIG-----------VPD 437
Query: 137 PSRLKEIAEKLNL 149
P +++ + L
Sbjct: 438 PEMGEQVKAVIQL 450
>gnl|CDD|213304 cd05938, hsFATP2a_ACSVL_like, Fatty acid transport proteins (FATP)
including hsFATP2, hsFATP5, and hsFATP6, and similar
proteins. Fatty acid transport proteins (FATP) of this
family transport long-chain or very-long-chain fatty
acids across the plasma membrane. At least five copies
of FATPs are identified in mammalian cells. This family
includes hsFATP2, hsFATP5, and hsFATP6, and similar
proteins. Each FATP has unique patterns of tissue
distribution. These FATPs also have fatty acid CoA
synthetase activity, thus playing dual roles as fatty
acid transporters and its activation enzymes. The hsFATP
proteins exist in two splice variants; the b variant,
lacking exon 3, has no acyl-CoA synthetase activity.
FATPs are key players in the trafficking of exogenous
fatty acids into the cell and in intracellular fatty
acid homeostasis.
Length = 535
Score = 30.8 bits (70), Expect = 0.60
Identities = 29/101 (28%), Positives = 45/101 (44%), Gaps = 11/101 (10%)
Query: 31 PFPRGE--ILIGGSN-ISP--GYYKNPTKTKE----DFFDEEGKRWFRTGDIGELHQDGV 81
P +GE +LI SP GY T++ D F ++G +F TGD+ +
Sbjct: 341 PVGKGEPGLLISKITSQSPFLGYAGPRELTEKKLLRDVF-KKGDVYFNTGDLLVQDRQNF 399
Query: 82 IRIIDRKKDLVKLQAGEYVSLGKVEAELKTCPVIENICVYG 122
+ DR D + + GE V+ +V L I+ + VYG
Sbjct: 400 LYFHDRTGDTFRWK-GENVATTEVADILTMVDFIQEVNVYG 439
>gnl|CDD|226713 COG4262, COG4262, Predicted spermidine synthase with an N-terminal
membrane domain [General function prediction only].
Length = 508
Score = 30.5 bits (69), Expect = 0.72
Identities = 15/49 (30%), Positives = 25/49 (51%), Gaps = 3/49 (6%)
Query: 126 KDYTVALVVPNPSRLKEIAEKLNL---DSLTFQQLCTHPEIEKQVLVEL 171
Y +LV P S ++ L L D L ++L +P++E+ LV+L
Sbjct: 273 YRYHESLVYPALSSVRGARSVLVLGGGDGLALRELLKYPQVEQITLVDL 321
>gnl|CDD|237733 PRK14499, PRK14499, molybdenum cofactor biosynthesis protein
MoaC/MOSC-domain-containing protein; Provisional.
Length = 308
Score = 29.1 bits (65), Expect = 1.9
Identities = 16/51 (31%), Positives = 26/51 (50%), Gaps = 2/51 (3%)
Query: 144 AEKLNLDSLTFQ--QLCTHPEIEKQVLVELQEHGKKSNLERFEIPNAVKLC 192
AE + ++L Q L T +I V++E+ + GKK + EI V +C
Sbjct: 230 AENITTENLDLQKISLGTKLKIGDNVVLEISQIGKKCHGSGCEIARQVGVC 280
>gnl|CDD|151654 pfam11212, DUF2999, Protein of unknown function (DUF2999). This
family of proteins with unknown function appears to be
restricted to Gammaproteobacteria.
Length = 82
Score = 27.4 bits (61), Expect = 2.2
Identities = 10/21 (47%), Positives = 15/21 (71%)
Query: 130 VALVVPNPSRLKEIAEKLNLD 150
+ LV+ NP+ +KE E+L LD
Sbjct: 50 MMLVMQNPALIKEAVEELGLD 70
>gnl|CDD|130582 TIGR01519, plasmod_dom_1, Plasmodium falciparum uncharacterized
domain. This model represents an uncharacterized domain
present in roughly eight hypothetical proteins of the
malaria parasite Plasmodium falciparum.
Length = 70
Score = 26.7 bits (59), Expect = 2.6
Identities = 12/56 (21%), Positives = 25/56 (44%), Gaps = 1/56 (1%)
Query: 163 IEKQVLVELQEHGKKSNLERFEIPNAVKLCSEVWSPDMGLVTAAFKLKRKDIQERY 218
I L++ H K S + +PN++ L + +P + ++ KR + +Y
Sbjct: 16 IFIDKLIDNNVHNKGSIISEDVVPNSLLLSKAL-TPLTAIPILSYITKRINFLNKY 70
>gnl|CDD|240047 cd04691, Nudix_Hydrolase_32, Members of the Nudix hydrolase
superfamily catalyze the hydrolysis of NUcleoside
DIphosphates linked to other moieties, X. Enzymes
belonging to this superfamily require a divalent cation,
such as Mg2+ or Mn2+, for their activity and contain a
highly conserved 23-residue nudix motif
(GX5EX7REUXEEXGU, where U = I, L or V), which functions
as a metal binding and catalytic site. Substrates of
nudix hydrolases include intact and oxidatively damaged
nucleoside triphosphates, dinucleoside polyphosphates,
nucleotide-sugars and dinucleotide enzymes. These
substrates are metabolites or cell signaling molecules
that require regulation during different stages of the
cell cycle or during periods of stress. In general, the
role of the nudix hydrolase is to sanitize the
nucleotide pools and to maintain cell viability, thereby
serving as surveillance & "house-cleaning" enzymes.
Substrate specificity is used to define families within
the superfamily. Differences in substrate specificity
are determined by the N-terminal extension or by
residues in variable loop regions. Mechanistically,
substrate hydrolysis occurs by a nucleophilic
substitution reaction, with variation in the numbers and
roles of divalent cations required.
Length = 117
Score = 27.5 bits (61), Expect = 2.9
Identities = 13/31 (41%), Positives = 20/31 (64%), Gaps = 2/31 (6%)
Query: 140 LKEIAEKLNLDSLTFQQLC--THPEIEKQVL 168
L+E+ E+L +D L++ LC HP E Q+L
Sbjct: 47 LREVQEELGVDPLSYTYLCSLYHPTSELQLL 77
>gnl|CDD|234701 PRK00252, alaS, alanyl-tRNA synthetase; Reviewed.
Length = 865
Score = 28.5 bits (65), Expect = 3.0
Identities = 13/36 (36%), Positives = 16/36 (44%), Gaps = 4/36 (11%)
Query: 71 GDIGELHQDGVIRIIDRKKDLVKLQAGEYVSLGKVE 106
GD G + GV + D +K G V GKVE
Sbjct: 496 GDTGVIEDGGVFEVTDTQKP----PNGLIVHRGKVE 527
>gnl|CDD|181506 PRK08622, PRK08622, galactose-6-phosphate isomerase subunit LacB;
Reviewed.
Length = 171
Score = 27.7 bits (62), Expect = 4.3
Identities = 10/28 (35%), Positives = 13/28 (46%), Gaps = 4/28 (14%)
Query: 50 KNPTKTKEDFFDEEGKRWFRTGDIGELH 77
N + FFDE ++W D GE H
Sbjct: 147 PNADQANPHFFDEFLEKW----DRGEYH 170
>gnl|CDD|237845 PRK14874, PRK14874, aspartate-semialdehyde dehydrogenase;
Provisional.
Length = 334
Score = 27.8 bits (63), Expect = 4.3
Identities = 15/37 (40%), Positives = 18/37 (48%), Gaps = 5/37 (13%)
Query: 112 CPVIENICVYGDSRKDYTVALVVP--NPSRLKEIAEK 146
VI+N + R D V LVVP NP L E +K
Sbjct: 88 AVVIDNSSAF---RMDPDVPLVVPEVNPEALAEHRKK 121
>gnl|CDD|233435 TIGR01488, HAD-SF-IB, Haloacid Dehalogenase superfamily, subfamily
IB, phosphoserine phosphatase-like. This model
represents a subfamily of the Haloacid Dehalogenase
superfamily of aspartate-nucleophile hydrolases.
Subfamily IA, B, C and D are distinguished from the rest
of the superfamily by the presence of a variable domain
between the first and second conserved catalytic motifs.
In subfamilies IA and IB, this domain consists of an
alpha-helical bundle. It was necessary to model these
two subfamilies separately, breaking them at a an
apparent phylogenetic bifurcation, so that the resulting
model(s) are not so broadly defined that members of
subfamily III (which lack the variable domain) are
included. Subfamily IA includes the enzyme phosphoserine
phosphatase (TIGR00338) as well as three hypothetical
equivalogs. Many members of these hypothetical
equivalogs have been annotated as PSPase-like or
PSPase-family proteins. In particular, the hypothetical
equivalog which appears to be most closely related to
PSPase contains only Archaea (while TIGR00338 contains
only eukaryotes and bacteria) of which some are
annotated as PSPases. Although this is a reasonable
conjecture, none of these sequences has sufficient
evidence for this assignment. If such should be found,
this model should be retired while the PSPase model
should be broadened to include these sequences [Unknown
function, Enzymes of unknown specificity].
Length = 177
Score = 27.3 bits (61), Expect = 4.5
Identities = 19/132 (14%), Positives = 37/132 (28%), Gaps = 13/132 (9%)
Query: 53 TKTKEDFFDEE-----GKRWFRTGDIGELHQDGVIRIIDRKKDLVKLQAGEYVSLGKVEA 107
T T++D + G ++ L G I D + L E
Sbjct: 9 TLTRQDSLIDLLAKLLGTNDEV-IELTRLAPSGRISFEDALGRRLALLHRSRSEEVAKEF 67
Query: 108 ELKTCPVIENI--CVYGDSRKDYTVALVVPNPSRL-KEIAEKLNLDSLTFQQLCTHPEIE 164
+ + + + +V + +AEKL +D + E +
Sbjct: 68 LARQVALRPGARELISWLKERGIDTVIVSGGFDFFVEPVAEKLGIDDV----FANRLEFD 123
Query: 165 KQVLVELQEHGK 176
L+ G+
Sbjct: 124 DNGLLTGPIEGQ 135
>gnl|CDD|224982 COG2071, COG2071, Predicted glutamine amidotransferases [General
function prediction only].
Length = 243
Score = 27.7 bits (62), Expect = 5.4
Identities = 17/57 (29%), Positives = 25/57 (43%), Gaps = 13/57 (22%)
Query: 35 GEILIGGSNISPGYY-KNPTKTKEDFFDEEGKRWFRTGDIGELHQDGVIRI-IDRKK 89
G IL GGSN+ P Y + P++ + E D EL +IR ++R
Sbjct: 63 GLILTGGSNVDPSLYGEEPSEKDGPYDPER--------DAFEL---ALIRAALERGI 108
>gnl|CDD|116864 pfam08280, HTH_Mga, M protein trans-acting positive regulator (MGA)
HTH domain. Mga is a DNA-binding protein that activates
the expression of several important virulence genes in
group A streptococcus in response to changing
environmental conditions.
Length = 59
Score = 25.3 bits (56), Expect = 6.7
Identities = 12/37 (32%), Positives = 18/37 (48%), Gaps = 1/37 (2%)
Query: 125 RKDYTVALVVPNPS-RLKEIAEKLNLDSLTFQQLCTH 160
RK + L+ N S L E+A++L L LT +
Sbjct: 6 RKLKLLELLTENKSITLDELAKQLGLSELTLKSYIHE 42
>gnl|CDD|237551 PRK13902, alaS, alanyl-tRNA synthetase; Provisional.
Length = 900
Score = 27.5 bits (62), Expect = 6.8
Identities = 10/24 (41%), Positives = 12/24 (50%), Gaps = 7/24 (29%)
Query: 69 RTGDIG-------ELHQDGVIRII 85
TG+IG E QDGV R+
Sbjct: 704 NTGEIGPIKILKTERIQDGVERLE 727
>gnl|CDD|143609 cd07585, nitrilase_7, Uncharacterized subgroup of the nitrilase
superfamily (putative class 13 nitrilases). The
nitrilase superfamily is comprised of nitrile- or
amide-hydrolyzing enzymes and amide-condensing enzymes,
which depend on a Glu-Lys-Cys catalytic triad. This
superfamily has been classified in the literature based
on global and structure based sequence analysis into
thirteen different enzyme classes (referred to as 1-13).
Class 13 represents proteins that at the time were
difficult to place in a distinct similarity group; this
subgroup represents either a new class or one that was
included previously in class 13. Members of this
superfamily generally form homomeric complexes, the
basic building block of which is a homodimer.
Length = 261
Score = 27.3 bits (61), Expect = 7.0
Identities = 10/35 (28%), Positives = 13/35 (37%), Gaps = 2/35 (5%)
Query: 117 NICVYGDSRKDYTVALVVPNPS--RLKEIAEKLNL 149
I Y R A V PS L ++A + L
Sbjct: 42 CITGYTHVRALSREAEVPDGPSTQALSDLARRYGL 76
>gnl|CDD|225891 COG3355, COG3355, Predicted transcriptional regulator
[Transcription].
Length = 126
Score = 26.5 bits (59), Expect = 7.1
Identities = 17/41 (41%), Positives = 20/41 (48%), Gaps = 3/41 (7%)
Query: 118 ICVYGDSRKDYTV--ALVVPN-PSRLKEIAEKLNLDSLTFQ 155
CVYG S D V AL+ N P + E+AE LN T
Sbjct: 20 KCVYGLSELDVEVYKALLEENGPLTVDELAEILNRSRSTVY 60
>gnl|CDD|200251 TIGR03214, ura-cupin, putative allantoin catabolism protein. This
model represents a protein containing a tandem
arrangement of cupin domains (N-terminal part of
pfam07883 and C-terminal more distantly related to
pfam00190). This protein is found in the vicinity of
genes involved in the catabolism of allantoin, a
breakdown product of urate and sometimes of urate
iteslf. The distribution of pathway components in the
genomes in which this family is observed suggests that
the function is linked to the allantoate catabolism to
glyoxylate pathway (GenProp0686) since it is sometimes
found in genomes lacking any elements of the
xanthine-to-allantoin pathways (e.g. in Enterococcus
faecalis).
Length = 252
Score = 27.0 bits (60), Expect = 7.5
Identities = 14/35 (40%), Positives = 20/35 (57%), Gaps = 4/35 (11%)
Query: 141 KEIAEKLNLDSLTFQQLCTHPEIEKQVLVELQEHG 175
KE+A +N+ L+F+ +HP IE V EHG
Sbjct: 165 KELAFDMNVHILSFEPGASHPYIETHV----MEHG 195
>gnl|CDD|223642 COG0568, RpoD, DNA-directed RNA polymerase, sigma subunit
(sigma70/sigma32) [Transcription].
Length = 342
Score = 27.3 bits (61), Expect = 7.6
Identities = 13/47 (27%), Positives = 23/47 (48%), Gaps = 6/47 (12%)
Query: 136 NPSRLKEIAEKLNLDSLTFQQLCTHPEIEKQVLVELQEHGKKSNLER 182
P L+E+ E+ + +Q IE + L +L+ H ++S L R
Sbjct: 298 EPKTLEELGEEFGISRERVRQ------IEAKALRKLRRHPERSALLR 338
>gnl|CDD|225928 COG3393, COG3393, Predicted acetyltransferase [General function
prediction only].
Length = 268
Score = 26.7 bits (59), Expect = 9.7
Identities = 19/84 (22%), Positives = 30/84 (35%), Gaps = 7/84 (8%)
Query: 51 NPTKTKEDFFDEEGKRWFRTGDIGELHQDGVIRIIDRKKDLVKLQAGEYVSLGKVEAELK 110
P D+ G+ R + L+ G R + D G+ V+ + AE
Sbjct: 146 IPEVGLRATLDDFGRADSRKEAVAVLNALGRSRTYFLEGD------GKIVAKAETAAENP 199
Query: 111 TCPVIENICVYGDSR-KDYTVALV 133
I + + + R K Y ALV
Sbjct: 200 AYAQINGVYTHPEYRGKGYATALV 223
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.317 0.137 0.411
Gapped
Lambda K H
0.267 0.0849 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 11,947,395
Number of extensions: 1138753
Number of successful extensions: 1557
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1406
Number of HSP's successfully gapped: 188
Length of query: 228
Length of database: 10,937,602
Length adjustment: 94
Effective length of query: 134
Effective length of database: 6,768,326
Effective search space: 906955684
Effective search space used: 906955684
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 57 (25.5 bits)