RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy613
(545 letters)
>gnl|CDD|225858 COG3321, COG3321, Polyketide synthase modules and related proteins
[Secondary metabolites biosynthesis, transport, and
catabolism].
Length = 1061
Score = 619 bits (1599), Expect = 0.0
Identities = 232/535 (43%), Positives = 305/535 (57%), Gaps = 13/535 (2%)
Query: 11 DTNTIAIIGIGGKFPGSKNINDFWRKLENNEDAITEVPTSRWDWKAIYGDPHLESGKTKV 70
IAIIG+ +FPG+ + +FW L+ D ITEVP RWD A Y GK+
Sbjct: 2 LIEPIAIIGMACRFPGADSPEEFWDLLKEGRDEITEVPADRWDVDAYYDPDPTVPGKSYS 61
Query: 71 KWGGFLYDADCFDANFFGISPAEAEVMDPQLRLFIETTWAALEDAGYPPSKLSGSKTAIF 130
+WGGFL D D FDA FFGISP EAE MDPQ RL +E W ALEDAG P L GS T +F
Sbjct: 62 RWGGFLDDVDDFDALFFGISPREAEAMDPQQRLLLEVAWEALEDAGIYPDSLRGSATGVF 121
Query: 131 AGVSTADYKDILNEARHKGLVKSLAEPFPFMIANRVSYLFNFHGPSEVIDTACSSSLIAI 190
AG S ADY +L ++ + A R+SY+ GPS +DTACSSSL+A+
Sbjct: 122 AGASVADYLLLLLADDEAEPEYAITGNSSSVAAGRISYVLGLSGPSVTVDTACSSSLVAV 181
Query: 191 NRAIESLHLKNCDLALAGGVNILASPNITIASSKAGLLSENGRCMTFDQRANGYVRSEGV 250
+ A +SL L CDLALAGGVN++ SP + S G+LS +GRC FD A+GYVR EG
Sbjct: 182 HLACQSLRLGECDLALAGGVNLVLSPESSYLFSAGGMLSPDGRCKAFDADADGYVRGEGA 241
Query: 251 GVILLKPLKNAIIDNDHIYGIFRGNFENHGGHSSSPTSPNMLAQKQLLIDVYRRANINPY 310
GV++LK L +A D D IY + RG+ N G S+ T+PN+ AQ ++ + A I+P
Sbjct: 242 GVVVLKRLSDAERDGDRIYAVIRGSAVNQDGRSNGLTAPNLEAQADVIREALADAGIDPA 301
Query: 311 TINYIEAHGTGTKLGDPIEVNGLKSAFSELHEYYKTPLLKPYCGLGSVKANIGHLEAASG 370
T+ Y+EAHGTGT LGDPIE N L + + E C +GSVK+NIGHLEAA+G
Sbjct: 302 TVQYVEAHGTGTPLGDPIEANALGAVYGEGAP-------AQPCAIGSVKSNIGHLEAAAG 354
Query: 371 VIGVIKVLLMLKYKKIPGNPHLKIPNSYLKLDNTPFYLVNKTCDWIQLDNNIPRRAGVSS 430
+ G+IK L LK+ IP H PN + D++PF + + W PRRAGVSS
Sbjct: 355 IAGLIKTALALKHGYIPPTLHFDTPNPEIDFDSSPFVVPTEATPWP--TGGGPRRAGVSS 412
Query: 431 FGVGGSNVHVIIEEYRKNIKTKYIKENTVLVRIIMLSAKTKKSLKEYVILLLEFIIKEKN 490
FG GG+N HVI+EE ++ R+++LSAKT + L L + + +
Sbjct: 413 FGFGGTNAHVILEEAPPRAEST----IPSSPRLLVLSAKTAERLAATAPRLADRLELQGG 468
Query: 491 NFSLCDLAYTLQVGREAMKYRLAIYVNSYEDLIKKLQDYLNKKITNGIYTNFSKN 545
SL D+AYTLQ GR ++RLA+ N E+L L+ + K +
Sbjct: 469 LLSLADVAYTLQAGRPHFEHRLAVVANDREELEAGLRAFAAGKAKALSGVGADDS 523
>gnl|CDD|238429 cd00833, PKS, polyketide synthases (PKSs) polymerize simple fatty
acids into a large variety of different products, called
polyketides, by successive decarboxylating Claisen
condensations. PKSs can be divided into 2 groups,
modular type I PKSs consisting of one or more large
multifunctional proteins and iterative type II PKSs,
complexes of several monofunctional subunits.
Length = 421
Score = 585 bits (1510), Expect = 0.0
Identities = 206/430 (47%), Positives = 273/430 (63%), Gaps = 13/430 (3%)
Query: 15 IAIIGIGGKFPGSKNINDFWRKLENNEDAITEVPTSRWDWKAIYGDPHLESGKTKVKWGG 74
IAI+G+ +FPG+ + ++FW L DAI+E+P RWD Y DP + GKT + GG
Sbjct: 3 IAIVGMACRFPGAADPDEFWENLLEGRDAISEIPEDRWDADGYYPDPG-KPGKTYTRRGG 61
Query: 75 FLYDADCFDANFFGISPAEAEVMDPQLRLFIETTWAALEDAGYPPSKLSGSKTAIFAGVS 134
FL D D FDA FFGISP EAE MDPQ RL +E W ALEDAGY P L+GS+T +F G S
Sbjct: 62 FLDDVDAFDAAFFGISPREAEAMDPQQRLLLEVAWEALEDAGYSPESLAGSRTGVFVGAS 121
Query: 135 TADYKDILNEARHKGLVKSLAEPF--PFMIANRVSYLFNFHGPSEVIDTACSSSLIAINR 192
++DY ++L AR + + A +ANR+SY F+ GPS +DTACSSSL+A++
Sbjct: 122 SSDYLELL--ARDPDEIDAYAATGTSRAFLANRISYFFDLRGPSLTVDTACSSSLVALHL 179
Query: 193 AIESLHLKNCDLALAGGVNILASPNITIASSKAGLLSENGRCMTFDQRANGYVRSEGVGV 252
A +SL CDLAL GGVN++ SP++ + SKAG+LS +GRC FD A+GYVR EGVGV
Sbjct: 180 ACQSLRSGECDLALVGGVNLILSPDMFVGFSKAGMLSPDGRCRPFDADADGYVRGEGVGV 239
Query: 253 ILLKPLKNAIIDNDHIYGIFRGNFENHGGHSSSPTSPNMLAQKQLLIDVYRRANINPYTI 312
++LK L +A+ D D IY + RG+ N G + T+P+ AQ L+ Y RA ++P I
Sbjct: 240 VVLKRLSDALRDGDRIYAVIRGSAVNQDGRTKGITAPSGEAQAALIRRAYARAGVDPSDI 299
Query: 313 NYIEAHGTGTKLGDPIEVNGLKSAFSELHEYYKTPLLKPYCGLGSVKANIGHLEAASGVI 372
+Y+EAHGTGT LGDPIEV L F +GSVK+NIGHLEAA+G+
Sbjct: 300 DYVEAHGTGTPLGDPIEVEALAKVFGGSRS------ADQPLLIGSVKSNIGHLEAAAGLA 353
Query: 373 GVIKVLLMLKYKKIPGNPHLKIPNSYLKLDNTPFYLVNKTCDWIQLDNNIPRRAGVSSFG 432
G+IKV+L L++ IP N H + PN + + +P + + W PRRAGVSSFG
Sbjct: 354 GLIKVVLALEHGVIPPNLHFETPNPKIDFEESPLRVPTEARPWPA--PAGPRRAGVSSFG 411
Query: 433 VGGSNVHVII 442
GG+N HVI+
Sbjct: 412 FGGTNAHVIL 421
>gnl|CDD|214836 smart00825, PKS_KS, Beta-ketoacyl synthase. The structure of
beta-ketoacyl synthase is similar to that of the
thiolase family and also chalcone synthase. The active
site of beta-ketoacyl synthase is located between the N
and C-terminal domains.
Length = 298
Score = 313 bits (804), Expect = e-103
Identities = 115/285 (40%), Positives = 150/285 (52%), Gaps = 74/285 (25%)
Query: 15 IAIIGIGGKFPGSKNINDFWRKLENNEDAITEVPTSRWDWKAIYGDPHLESGKTKVKWGG 74
IAI+G+ +FPG+ + +FW L +G
Sbjct: 1 IAIVGMSCRFPGADDPEEFWD--------------------------LLLAG-------- 26
Query: 75 FLYDADCFDANFFGISPAEAEVMDPQLRLFIETTWAALEDAGYPPSKLSGSKTAIFAGVS 134
L D D FDA FFGISP EAE MDPQ RL +E W ALEDAG P L GS+T +F GVS
Sbjct: 27 -LDDVDLFDAAFFGISPREAEAMDPQQRLLLEVAWEALEDAGIDPESLRGSRTGVFVGVS 85
Query: 135 TADYKDILNEARHKGLVKSLAEPFPFMIANRVSYLFNFHGPSEVIDTACSSSLIAINRAI 194
++DY S +DTACSSSL+A++ A
Sbjct: 86 SSDY-------------------------------------SVTVDTACSSSLVALHLAC 108
Query: 195 ESLHLKNCDLALAGGVNILASPNITIASSKAGLLSENGRCMTFDQRANGYVRSEGVGVIL 254
+SL CD+ALAGGVN++ SP+ + S+AG+LS +GRC TFD A+GYVR EGVGV++
Sbjct: 109 QSLRSGECDMALAGGVNLILSPDTFVGLSRAGMLSPDGRCKTFDASADGYVRGEGVGVVV 168
Query: 255 LKPLKNAIIDNDHIYGIFRGNFENHGGHSSSPTSPNMLAQKQLLI 299
LK L +A+ D D I + RG+ N G S+ T+P+ A QLLI
Sbjct: 169 LKRLSDALRDGDPILAVIRGSAVNQDGRSNGITAPSGPA--QLLI 211
Score = 152 bits (387), Expect = 4e-42
Identities = 47/92 (51%), Positives = 62/92 (67%), Gaps = 2/92 (2%)
Query: 353 CGLGSVKANIGHLEAASGVIGVIKVLLMLKYKKIPGNPHLKIPNSYLKLDNTPFYLVNKT 412
+GSVK+NIGHLEAA+GV G+IKV+L LK+ IP H + PN ++ L+ +P + +
Sbjct: 209 LLIGSVKSNIGHLEAAAGVAGLIKVVLALKHGVIPPTLHFETPNPHIDLEESPLRVPTEL 268
Query: 413 CDWIQLDNNIPRRAGVSSFGVGGSNVHVIIEE 444
W PRRAGVSSFG GG+N HVI+EE
Sbjct: 269 TPW--PPPGRPRRAGVSSFGFGGTNAHVILEE 298
>gnl|CDD|215723 pfam00109, ketoacyl-synt, Beta-ketoacyl synthase, N-terminal
domain. The structure of beta-ketoacyl synthase is
similar to that of the thiolase family (pfam00108) and
also chalcone synthase. The active site of beta-ketoacyl
synthase is located between the N and C-terminal
domains. The N-terminal domain contains most of the
structures involved in dimer formation and also the
active site cysteine.
Length = 243
Score = 269 bits (691), Expect = 2e-87
Identities = 105/253 (41%), Positives = 132/253 (52%), Gaps = 19/253 (7%)
Query: 15 IAIIGIGGKFPGSKNINDFWRKLENNEDAITEVPTSRWDWKAIYGDPHLESGKTKVKWGG 74
+AI G+G +FPG +FW L DAI E P D +Y + G+
Sbjct: 4 VAITGMGCRFPGGVGPEEFWELLLAGRDAIREFPA---DLSGLYPPSRVA-GEIYG---- 55
Query: 75 FLYDADCFDANFFGISPAEAEVMDPQLRLFIETTWAALEDAGYPPSKLSGSK-TAIFAGV 133
FDA FFGISP EAE MDPQ RL +E W ALEDAG P+ L GS T +F G
Sbjct: 56 -----FDFDAAFFGISPREAEAMDPQQRLALEAAWEALEDAGLDPASLRGSDRTGVFVGS 110
Query: 134 STADYKDILNEARHKGLVKSLAEPFPF----MIANRVSYLFNFHGPSEVIDTACSSSLIA 189
+ DY ++ G + + A R+SY GPS +DTACSSSL+A
Sbjct: 111 GSGDYAELQALDSAGGPRRVSPYLTGAWMPSVAAGRISYRLGLRGPSVTVDTACSSSLVA 170
Query: 190 INRAIESLHLKNCDLALAGGVNILASPNITIASSKAG-LLSENGRCMTFDQRANGYVRSE 248
++ A+ S+ CDLALAGGV +P S AG LLS +G C FD A+G+VR E
Sbjct: 171 LHAAVRSIRRGECDLALAGGVEAPLTPGGFAGFSAAGALLSPDGPCKAFDPFADGFVRGE 230
Query: 249 GVGVILLKPLKNA 261
GVG +LLK L A
Sbjct: 231 GVGAVLLKELSEA 243
>gnl|CDD|234022 TIGR02813, omega_3_PfaA, polyketide-type polyunsaturated fatty acid
synthase PfaA. Members of the seed for this alignment
are involved in omega-3 polyunsaturated fatty acid
biosynthesis, such as the protein PfaA from the
eicosapentaenoic acid biosynthesis operon in
Photobacterium profundum strain SS9. PfaA is encoded
together with PfaB, PfaC, and PfaD, and the functions of
the individual polypeptides have not yet been described.
More distant homologs of PfaA, also included with the
reach of this model, appear to be involved in
polyketide-like biosynthetic mechanisms of
polyunsaturated fatty acid biosynthesis, an alternative
to the more familiar iterated mechanism of chain
extension and desaturation, and in most cases are
encoded near genes for homologs of PfaB, PfaC, and/or
PfaD.
Length = 2582
Score = 262 bits (671), Expect = 3e-76
Identities = 159/492 (32%), Positives = 243/492 (49%), Gaps = 41/492 (8%)
Query: 15 IAIIGIGGKFPGSKNINDFWRKLENNEDAITEVPTSRWDWKAIYGDPHLESGKTKVKWGG 74
IAI+G+ F S+ +N FW + DAIT+VP+ W Y E+ K+ K GG
Sbjct: 9 IAIVGMASIFANSRYLNKFWDLIFEKIDAITDVPSDHWAKDDYYDSDKSEADKSYCKRGG 68
Query: 75 FLYDADCFDANFFGISPAEAEVMDPQLRLFIETTWAALEDAGYPPS-------------- 120
FL + D F+ FG+ P E+ D L + L DAG P
Sbjct: 69 FLPEVD-FNPMEFGLPPNILELTDISQLLSLVVAKEVLNDAGLPDGYDRDKIGITLGVGG 127
Query: 121 --KLSGSKTA---------IF--AGVSTADYKDILNEARHKGLVKSLAEPFPFMIAN--- 164
K S S A +F +GV D ++L + + FP + N
Sbjct: 128 GQKQSSSLNARLQYPVLKKVFKASGVEDED-SEMLIKKFQDQYIHWEENSFPGSLGNVIS 186
Query: 165 -RVSYLFNFHGPSEVIDTACSSSLIAINRAIESLHLKNCDLALAGGVNILASPNITIASS 223
R++ F+ G + V+D AC+ SL AI A+ L ++ + GGV SP + ++ S
Sbjct: 187 GRIANRFDLGGMNCVVDAACAGSLAAIRMALSELLEGRSEMMITGGVCTDNSPFMYMSFS 246
Query: 224 KAGLLSENGRCMTFDQRANGYVRSEGVGVILLKPLKNAIIDNDHIYGIFRGNFENHGGHS 283
K + N FD + G + EG+G++ LK L++A D D IY + +G + G
Sbjct: 247 KTPAFTTNEDIQPFDIDSKGMMIGEGIGMMALKRLEDAERDGDRIYAVIKGVGASSDGKF 306
Query: 284 SSPTSPNMLAQKQLLIDVYRRANINPYTINYIEAHGTGTKLGDPIEVNGLKSAFSELHEY 343
S +P Q + L Y A P+T IEAHGTGT GD E GL S FS+ ++
Sbjct: 307 KSIYAPRPEGQAKALKRAYDDAGFAPHTCGLIEAHGTGTAAGDVAEFGGLVSVFSQDNDQ 366
Query: 344 YKTPLLKPYCGLGSVKANIGHLEAASGVIGVIKVLLMLKYKKIPGNPHLKIPNSYLKLDN 403
K + LGSVK+ IGH ++ +G G+IK +L L +K +P ++ PN L ++N
Sbjct: 367 ------KQHIALGSVKSQIGHTKSTAGTAGMIKAVLALHHKVLPPTINVDQPNPKLDIEN 420
Query: 404 TPFYLVNKTCDWIQLDNNIPRRAGVSSFGVGGSNVHVIIEEYR-KNIKTKYIKENTVLVR 462
+PFYL +T W+Q ++ PRRAG+SSFG GG+N H+++EEY K+ + ++ V +
Sbjct: 421 SPFYLNTETRPWMQREDGTPRRAGISSFGFGGTNFHMVLEEYSPKHQRDDQYRQRAV-AQ 479
Query: 463 IIMLSAKTKKSL 474
++ +A +K+L
Sbjct: 480 TLLFTAANEKAL 491
>gnl|CDD|238430 cd00834, KAS_I_II, Beta-ketoacyl-acyl carrier protein (ACP)
synthase (KAS), type I and II. KASs are responsible for
the elongation steps in fatty acid biosynthesis. KASIII
catalyses the initial condensation and KAS I and II
catalyze further elongation steps by Claisen
condensation of malonyl-acyl carrier protein (ACP) with
acyl-ACP.
Length = 406
Score = 181 bits (461), Expect = 1e-51
Identities = 119/451 (26%), Positives = 176/451 (39%), Gaps = 70/451 (15%)
Query: 15 IAIIGIGGKFPGSKNINDFWRKLENNEDAITEVPTSRWDWKAIYGDPHLESGKTKVKWGG 74
+ I G+G P + +FW L I P +R+D + G
Sbjct: 3 VVITGLGAVTPLGNGVEEFWEALLAGRSGIR--PITRFDASGFP-----------SRIAG 49
Query: 75 FLYDADCFDANFFGISPAEAEVMDPQLRLFIETTWAALEDAGYPPSKLSGSKTAIFAGVS 134
+ D D D + E MD + + AL DAG P +L + + G
Sbjct: 50 EVPDFDPEDY----LDRKELRRMDRFAQFALAAAEEALADAGLDPEELDPERIGVVIGSG 105
Query: 135 TADYKDILNEAR---HKGLVKSLAEPFPFMIANR----VSYLFNFHGPSEVIDTACSSSL 187
I R KG + P + N V+ GP+ + TAC+S
Sbjct: 106 IGGLATIEEAYRALLEKGPRRVSPFFVPMALPNMAAGQVAIRLGLRGPNYTVSTACASGA 165
Query: 188 IAINRAIESLHLKNCDLALAGGVNILASPNITIAS-SKAGLLSENG-----RCMTFDQRA 241
AI A + L D+ +AGG L +P +T+A + LS FD+
Sbjct: 166 HAIGDAARLIRLGRADVVIAGGAEALITP-LTLAGFAALRALSTRNDDPEKASRPFDKDR 224
Query: 242 NGYVRSEGVGVILLKPLKNAIIDNDHIYGIFRGNFENHGGHSS---SPTSPN------ML 292
+G+V EG GV++L+ L++A IY G G SS T+P+
Sbjct: 225 DGFVLGEGAGVLVLESLEHAKARGAKIYAEILG-----YGASSDAYHITAPDPDGEGAAR 279
Query: 293 AQKQLLIDVYRRANINPYTINYIEAHGTGTKLGDPIEVNGLKSAFSELHEYYKTPLLKPY 352
A + L A ++P I+YI AHGT T L D E +K F E K P+
Sbjct: 280 AMRAAL----ADAGLSPEDIDYINAHGTSTPLNDAAESKAIKRVFGE--HAKKVPVS--- 330
Query: 353 CGLGSVKANIGHLEAASGVIGVIKVLLMLKYKKIPGNPHLKIPNSYLKLDNTPFYLVNKT 412
S K+ GHL A+G + I LL L+ +P +L+ P+ LD Y+ N+
Sbjct: 331 ----STKSMTGHLLGAAGAVEAIATLLALRDGVLPPTINLEEPDPECDLD----YVPNEA 382
Query: 413 CDWIQLDNNIPRRAGVS-SFGVGGSNVHVII 442
+ P R +S SFG GG N ++
Sbjct: 383 REA-------PIRYALSNSFGFGGHNASLVF 406
>gnl|CDD|238421 cd00825, decarbox_cond_enzymes, decarboxylating condensing enzymes;
Family of enzymes that catalyze the formation of a new
carbon-carbon bond by a decarboxylating Claisen-like
condensation reaction. Members are involved in the
synthesis of fatty acids and polyketides, a diverse
group of natural products. Both pathways are an
iterative series of additions of small carbon units,
usually acetate, to a nascent acyl group. There are 2
classes of decarboxylating condensing enzymes, which can
be distinguished by sequence similarity, type of active
site residues and type of primer units (acetyl CoA or
acyl carrier protein (ACP) linked units).
Length = 332
Score = 167 bits (425), Expect = 2e-47
Identities = 86/344 (25%), Positives = 149/344 (43%), Gaps = 27/344 (7%)
Query: 102 RLFIETTWAALEDAGYPPSKLSGSKTAIFAGVSTADYKDILNEA---RHKGLVKSLAEPF 158
L E A+ DAG + G + + A R G F
Sbjct: 13 ILGFEAAERAIADAGLSREYQKNPIVGVVVGTGGGSPRFQVFGADAMRAVGPYVVTKAMF 72
Query: 159 PFMIANRVSYLFNFHGPSEVIDTACSSSLIAINRAIESLHLKNCDLALAGGVNILASPNI 218
P + +++ HGP+ + AC+ SL A++ A +++ D+ LAGG LA+P
Sbjct: 73 PG-ASGQIATPLGIHGPAYDVSAACAGSLHALSLAADAVQNGKQDIVLAGGSEELAAPMD 131
Query: 219 TIASSKAGLLSENGRCMTFDQRANGYVRSEGVGVILLKPLKNAIIDNDHIYGIFRGNFEN 278
+ L + TFD A+G+V +G G ++++ L++A+ HIY G
Sbjct: 132 CEFDAMGALSTPEKASRTFDAAADGFVFGDGAGALVVEELEHALARGAHIYAEIVGTAAT 191
Query: 279 HGGHSSSPTSPNMLAQKQLLIDVYRRANINPYTINYIEAHGTGTKLGDPIEVNGLKSAFS 338
G +P+ + + A + + I+Y+ AHGTGT +GD E+ L+S F
Sbjct: 192 IDGAGMGAFAPSAEGLARAAKEALAVAGLTVWDIDYLVAHGTGTPIGDVKELKLLRSEFG 251
Query: 339 ELHEYYKTPLLKPYCGLGSVKANIGHLEAASGVIGVIKVLLMLKYKKIPGNPHLKIPNSY 398
K + + KA G+L +A+ V+ V + +LML++ IP + H+
Sbjct: 252 ----------DKSP-AVSATKAMTGNLSSAAVVLAVDEAVLMLEHGFIPPSIHI------ 294
Query: 399 LKLDNTPFYLVNKTCDWIQLDNNIPRRAGVSSFGVGGSNVHVII 442
+LD +V +T R A ++ FG+GG+N +++
Sbjct: 295 EELDEAGLNIVTETTP------RELRTALLNGFGLGGTNATLVL 332
>gnl|CDD|238424 cd00828, elong_cond_enzymes, "elongating" condensing enzymes are a
subclass of decarboxylating condensing enzymes,
including beta-ketoacyl [ACP] synthase, type I and II
and polyketide synthases.They are characterized by the
utlization of acyl carrier protein (ACP) thioesters as
primer substrates, as well as the nature of their active
site residues.
Length = 407
Score = 147 bits (372), Expect = 4e-39
Identities = 102/448 (22%), Positives = 165/448 (36%), Gaps = 63/448 (14%)
Query: 15 IAIIGIGGKFP---GSKNINDFWRKLENNEDAITEVPTSRWDWKAIYGDPHLESGKTKVK 71
+ I GIG P G + +FW L I V L+S +
Sbjct: 3 VVITGIGVVSPHGEGCDEVEEFWEALREGRSGIAPVA-------------RLKSRFDRGV 49
Query: 72 WGGFLYDADCFDANFFGISPAEA---EVMDPQLRLFIETTWAALEDAGY-PPSKLSGSKT 127
G I +A ++D L + T AL DAG P ++ S+
Sbjct: 50 AG---------QIPTGDIPGWDAKRTGIVDRTTLLALVATEEALADAGITDPYEVHPSEV 100
Query: 128 AIFAGVSTADY----KDILNEARHKG-LVKSLAEPFPFMIANRVSYLFNF-HGPSEVIDT 181
+ G + +AR V P +A V+ L HGP +
Sbjct: 101 GVVVGSGMGGLRFLRRGGKLDARAVNPYVSPKWMLSPNTVAGWVNILLLSSHGPIKTPVG 160
Query: 182 ACSSSLIAINRAIESLHLKNCDLALAGGVNILASPNITIASSKAGLLSE-----NGRCMT 236
AC+++L A++ A+E++ D+ + GGV + G LS
Sbjct: 161 ACATALEALDLAVEAIRSGKADIVVVGGVEDP-LEEGLSGFANMGALSTAEEEPEEMSRP 219
Query: 237 FDQRANGYVRSEGVGVILLKPLKNAIIDNDHIYGIFRGNFENH--GGHSSSPTSPNMLAQ 294
FD+ +G+V +EG GV++L+ + A+ IYG G G S +
Sbjct: 220 FDETRDGFVEAEGAGVLVLERAELALARGAPIYGRVAGTASTTDGAGRSVPAGGKGIARA 279
Query: 295 KQLLIDVYRRANINPYTINYIEAHGTGTKLGDPIEVNGLKSAFSELHEYYKTPLLKPYCG 354
+ + +A ++ ++ I AHGT T D E + L PL P
Sbjct: 280 IRTAL---AKAGLSLDDLDVISAHGTSTPANDVAESRAIAEVAGALGA----PL--P--- 327
Query: 355 LGSVKANIGHLEAASGVIGVIKVLLMLKYKKIPGNPHLKIPNSYLKLDNTPFYLVNKTCD 414
+ + KA GH + A+G + +I L L++ IP +L + P
Sbjct: 328 VTAQKALFGHSKGAAGALQLIGALQSLEHGLIPPTANLDDVD--------PDVEHLSVVG 379
Query: 415 WIQLDNNIPRRAGVSSFGVGGSNVHVII 442
+ N R A V++FG GGSN +++
Sbjct: 380 LSRDLNLKVRAALVNAFGFGGSNAALVL 407
>gnl|CDD|223381 COG0304, FabB, 3-oxoacyl-(acyl-carrier-protein) synthase [Lipid
metabolism / Secondary metabolites biosynthesis,
transport, and catabolism].
Length = 412
Score = 145 bits (367), Expect = 3e-38
Identities = 112/455 (24%), Positives = 167/455 (36%), Gaps = 78/455 (17%)
Query: 15 IAIIGIGGKFPGSKNINDFWRKL---ENNEDAITEVPTSRWDWKAIYGDPHLESGKTKVK 71
+ I G+G + + W L ++ IT S VK
Sbjct: 5 VVITGLGIVSSLGNGVEEVWAALLAGKSGIRPITRFDASGLG----------------VK 48
Query: 72 WGGFLYDADCFDANFFGISPAEAEVMDPQLRLFIETTWAALEDAGYPPSKLSGSKTAIFA 131
G + D I+ E MD +L + ALEDAG + +
Sbjct: 49 IAG---EIKDLDD---QIAKKERRFMDRFSQLAVVAAVEALEDAGLDNELNVDMRVGVAI 102
Query: 132 GVSTADYKDILNEA---RHKGLVKSLAEPF------PFMIANRVSYLFNFHGPSEVIDTA 182
G +DI + +GL K ++ PF P + A V+ +F GP+ TA
Sbjct: 103 GSGIGGLEDIEFDLDALLLEGLRKRIS-PFLVPKMLPNLAAGNVAIVFGLKGPNYTPVTA 161
Query: 183 CSSSLIAINRAIESLHLKNCDLALAGGVNILASPNITIAS-SKAGLLS-----ENGRCMT 236
C++ AI A+ + L D+ +AGG +P + IA LS
Sbjct: 162 CATGAHAIGDAVRLIRLGKADVVIAGGAEAAITP-LGIAGFEAMRALSTRNDDPEKASRP 220
Query: 237 FDQRANGYVRSEGVGVILLKPLKNAIIDNDHIYGIFRGNFENHGG-HSSSP---TSPNML 292
FD+ +G+V EG G ++L+ L++A+ IY G H ++P +
Sbjct: 221 FDKNRDGFVIGEGAGALVLEELEHALARGAKIYAEIVGYGTTSDAYHITAPAPDGEGAIR 280
Query: 293 AQKQLLIDVYRRANINPYTINYIEAHGTGTKLGDPIEVNGLKSAFSELHEYYKTPLLKPY 352
A + L A + P I+YI AHGT T D E +K F E H
Sbjct: 281 AMRAAL----ADAGLTPEDIDYINAHGTSTPANDKAESLAIKRVFGE-HAK------SLP 329
Query: 353 CGLGSVKANIGHLEAASGVIGVIKVLLMLKYKKIPGNPHLKIPNSYLKLDNTPFYLVNKT 412
S K+ GH A+G + I LL L+ IP +L P D L
Sbjct: 330 V--SSTKSLTGHTLGAAGAVEAIISLLALRDGIIPPTINLDNP------DPEAADL---- 377
Query: 413 CDWIQLDNNIPRRAGV-----SSFGVGGSNVHVII 442
D + N R V +SFG GG+N ++
Sbjct: 378 -DVV---PNEARTGAVRAALSNSFGFGGTNASLVF 408
>gnl|CDD|217236 pfam02801, Ketoacyl-synt_C, Beta-ketoacyl synthase, C-terminal
domain. The structure of beta-ketoacyl synthase is
similar to that of the thiolase family (pfam00108) and
also chalcone synthase. The active site of beta-ketoacyl
synthase is located between the N and C-terminal
domains.
Length = 119
Score = 135 bits (343), Expect = 3e-38
Identities = 53/126 (42%), Positives = 73/126 (57%), Gaps = 8/126 (6%)
Query: 269 YGIFRGNFENHGG-HSSSPTSPNMLAQKQLLIDVYRRANINPYTINYIEAHGTGTKLGDP 327
Y + RG+ N G + T+PN AQ + + A ++P ++Y+EAHGTGT LGDP
Sbjct: 1 YAVIRGSAVNQDGAAHNGLTAPNGPAQARAIRAALADAGLDPEDVDYVEAHGTGTPLGDP 60
Query: 328 IEVNGLKSAFSELHEYYKTPLLKPYCGLGSVKANIGHLEAASGVIGVIKVLLMLKYKKIP 387
IE LK+ F PL +GSVK+NIGHLEAA+GV G+IK +L L++ IP
Sbjct: 61 IEAEALKAVFGP--GRDSQPLP-----VGSVKSNIGHLEAAAGVAGLIKAVLALRHGVIP 113
Query: 388 GNPHLK 393
+L
Sbjct: 114 PTLNLD 119
>gnl|CDD|235653 PRK05952, PRK05952, 3-oxoacyl-(acyl carrier protein) synthase II;
Reviewed.
Length = 381
Score = 111 bits (280), Expect = 1e-26
Identities = 74/264 (28%), Positives = 111/264 (42%), Gaps = 40/264 (15%)
Query: 182 ACSSSLIAINRAIESLHLKNCDLALAGGVNILASPNITIAS-SKAGLLSENGRCMTFDQR 240
AC++ L AI + +E + C +AG V +P +T+A + G L++ G FD++
Sbjct: 145 ACATGLWAIAQGVELIQTGQCQRVIAGAVEAPITP-LTLAGFQQMGALAKTG-AYPFDRQ 202
Query: 241 ANGYVRSEGVGVILLKPLKNAIIDNDHIYGIFRG-NFENHGGHSSSPTSPN---MLAQKQ 296
G V EG +++L+ + A IYG G H S+P + A +Q
Sbjct: 203 REGLVLGEGGAILVLESAELAQKRGAKIYGQILGFGLTCDAYHMSAPEPDGKSAIAAIQQ 262
Query: 297 LLIDVYRRANINPYTINYIEAHGTGTKLGDPIEVNGLKSAFSELHEYYKTPLLKPYCGLG 356
L R+ + P I+YI AHGT T+L D E N +++ F +
Sbjct: 263 CL----ARSGLTPEDIDYIHAHGTATRLNDQREANLIQALFP------------HRVAVS 306
Query: 357 SVKANIGHLEAASGVIGVIKVLLMLKYKKIPGNPHLKIPNSYLKLDNTP--FYLVNKTCD 414
S K GH ASG +GV LL L+++++P L+ P L L N C
Sbjct: 307 STKGATGHTLGASGALGVAFSLLALRHQQLPPCVGLQEPEFDLNFVRQAQQSPLQNVLC- 365
Query: 415 WIQLDNNIPRRAGVSSFGVGGSNV 438
SFG GG N
Sbjct: 366 --------------LSFGFGGQNA 375
>gnl|CDD|200247 TIGR03150, fabF, beta-ketoacyl-acyl-carrier-protein synthase II.
3-oxoacyl-[acyl-carrier-protein] synthase 2 (KAS-II,
FabF) is involved in the condensation step of fatty acid
biosynthesis in which the malonyl donor group is
decarboxylated and the resulting carbanion used to
attack and extend the acyl group attached to the acyl
carrier protein. Most genomes encoding fatty acid
biosynthesis contain a number of condensing enzymes,
often of all three types: 1, 2 and 3. Synthase 2 is
mechanistically related to synthase 1 (KAS-I, FabB)
containing a number of absolutely conserved catalytic
residues in common. This model is based primarily on
genes which are found in apparent operons with other
essential genes of fatty acid biosynthesis
(GenProp0681). The large gap between the trusted cutoff
and the noise cutoff contains many genes which are not
found adjacent to genes of the fatty acid pathway in
genomes that often also contain a better hit to this
model. These genes may be involved in other processes
such as polyketide biosyntheses. Some genomes contain
more than one above-trusted hit to this model which may
result from recent paralogous expansions. Second hits to
this model which are not next to other fatty acid
biosynthesis genes may be involved in other processes.
FabB sequences should fall well below the noise cutoff
of this model [Fatty acid and phospholipid metabolism,
Biosynthesis].
Length = 407
Score = 111 bits (280), Expect = 2e-26
Identities = 119/459 (25%), Positives = 178/459 (38%), Gaps = 92/459 (20%)
Query: 17 IIGIGGKFPGSKNINDFWRKLENNE---DAITEVPTSRWDWKAIYGDPHLESGKTKVKWG 73
+ G+G P + +FW L + IT S VK
Sbjct: 5 VTGLGAVTPLGNGVEEFWENLLAGKSGIGPITRFDAS----------------DLPVKIA 48
Query: 74 GFLYDADCFDANFFGISPAEAEVMDPQLRLFIETTWAALEDAGYPPSKLSGSKTAIFAGV 133
G + D FD + I EA MD ++ + A+ED+G + + + G
Sbjct: 49 GEVKD---FDPEDY-IDKKEARRMDRFIQYALAAAKEAVEDSGLDIEEEDAERVGVIIGS 104
Query: 134 STADYKDILNEAR---HKGLVKSLAEPF--PFMIAN----RVSYLFNFHGPSEVIDTACS 184
+ I + KG + PF P I N ++S + GP+ + TAC+
Sbjct: 105 GIGGLETIEEQHIVLLEKGPRR--VSPFFIPMSIINMAAGQISIRYGAKGPNHAVVTACA 162
Query: 185 SSLIAINRAIESLHLKNCDLALAGGVNILASPNITIA---SSKAGLLSENGR----CMTF 237
+ AI A + + D+ +AGG +P + IA + KA L + N F
Sbjct: 163 TGTHAIGDAFRLIQRGDADVMIAGGAEAAITP-LGIAGFAAMKA-LSTRNDDPEKASRPF 220
Query: 238 DQRANGYVRSEGVGVILLKPLKNAIIDNDHIYGIFRGNFENHGGHSSSP----TSPN--- 290
D+ +G+V EG GV++L+ L++A IY E G S T+P
Sbjct: 221 DKDRDGFVMGEGAGVLVLEELEHAKARGAKIYA------EIVGYGMSGDAYHITAPAPEG 274
Query: 291 ---MLAQKQLLIDVYRRANINPYTINYIEAHGTGTKLGDPIEVNGLKSAFSELHEYYKTP 347
A + L + A INP ++YI AHGT T LGD E +K F + H YK
Sbjct: 275 EGAARAMRAAL----KDAGINPEDVDYINAHGTSTPLGDKAETKAIKRVFGD-HA-YKLA 328
Query: 348 LLKPYCGLGSVKANIGHLEAASGVIGVIKVLLMLKYKKIPGNPHLKIPNSYLKLDNTPFY 407
+ S K+ GHL A+G I I +L L+ +P +L P+ LD P
Sbjct: 329 -------VSSTKSMTGHLLGAAGAIEAIFTVLALRDGIVPPTINLDNPDPECDLDYVP-- 379
Query: 408 LVNKTCDWIQLDNNIPRRAGVS-----SFGVGGSNVHVI 441
N R A + SFG GG N ++
Sbjct: 380 -------------NEAREAKIDYALSNSFGFGGHNASLV 405
>gnl|CDD|240245 PTZ00050, PTZ00050, 3-oxoacyl-acyl carrier protein synthase;
Provisional.
Length = 421
Score = 108 bits (272), Expect = 2e-25
Identities = 97/431 (22%), Positives = 154/431 (35%), Gaps = 45/431 (10%)
Query: 29 NINDFWRKL---ENNEDAITEVPTSRWDWKAIYGDPHLESGKTKVKWGGFLYDADCFDAN 85
W L ++ +TE P D + + ++ FD +
Sbjct: 8 GAESTWEALIAGKSGIRKLTEFPKFLPDCIPEQKALENLVAAMPCQIAAEVDQSE-FDPS 66
Query: 86 FFGISPAEAEVMDPQLRLFIETTWAALEDAGY-PPSKLSGSKTAIFAGV---STADYKDI 141
F + E + AL DA S+ + + G S AD D
Sbjct: 67 DFAPTKRE----SRATHFAMAAAREALADAKLDILSEKDQERIGVNIGSGIGSLADLTDE 122
Query: 142 LNEARHKGLVKSLAEPFPFMIANR----VSYLFNFHGPSEVIDTACSSSLIAINRAIESL 197
+ KG + P ++ N V+ GPS TAC++ I A +
Sbjct: 123 MKTLYEKGHSRVSPYFIPKILGNMAAGLVAIKHKLKGPSGSAVTACATGAHCIGEAFRWI 182
Query: 198 HLKNCDLALAGGVNILASPNITIASSKAGLLSE--NGR----CMTFDQRANGYVRSEGVG 251
D+ + GG +P S+ L N FD+ G+V EG G
Sbjct: 183 KYGEADIMICGGTEASITPVSFAGFSRMRALCTKYNDDPQRASRPFDKDRAGFVMGEGAG 242
Query: 252 VILLKPLKNAIIDNDHIYGIFRGNFENHGGHSSSPTSPN----MLAQKQLLIDVYRRANI 307
+++L+ L++A+ IY RG + H + P+ + L D ANI
Sbjct: 243 ILVLEELEHALRRGAKIYAEIRGYGSSSDAHHITAPHPDGRGARRCMENALKD---GANI 299
Query: 308 NPYTINYIEAHGTGTKLGDPIEVNGLKSAFSELHEYYKTPLLKPYCGLGSVKANIGHLEA 367
N ++Y+ AH T T +GD IE+ +K F P + S K +GHL
Sbjct: 300 NINDVDYVNAHATSTPIGDKIELKAIKKVFG--------DSGAPKLYVSSTKGGLGHLLG 351
Query: 368 ASGVIGVIKVLLMLKYKKIPGNPHLKIPNSYLKLDNTPFYLVNKTCDWIQLDNNIPRRAG 427
A+G + I +L L + IP +L+ P++ L+ + KT
Sbjct: 352 AAGAVESIVTILSLYEQIIPPTINLENPDAECDLN----LVQGKT----AHPLQSIDAVL 403
Query: 428 VSSFGVGGSNV 438
+SFG GG N
Sbjct: 404 STSFGFGGVNT 414
>gnl|CDD|215449 PLN02836, PLN02836, 3-oxoacyl-[acyl-carrier-protein] synthase.
Length = 437
Score = 98.7 bits (246), Expect = 4e-22
Identities = 97/350 (27%), Positives = 147/350 (42%), Gaps = 48/350 (13%)
Query: 111 ALEDAGYPPSKLSGS-KTAIFAGVSTADYKDILNEARHKGLVKSLAEPFPF--------M 161
AL DA + PS+ +T + G DIL EA K L PF M
Sbjct: 104 ALSDARWLPSEDEAKERTGVSIGGGIGSITDIL-EAAQLICEKRLRRLSPFFVPRILINM 162
Query: 162 IANRVSYLFNFHGPSEVIDTACSSSLIAINRAIESLHLKNCDLALAGGVNILASPNITIA 221
A VS + F GP+ TAC++ +I A + + D+ +AGG ++IA
Sbjct: 163 AAGHVSIRYGFQGPNHAAVTACATGAHSIGDAFRMIQFGDADVMVAGGTESSIDA-LSIA 221
Query: 222 S-SKAGLLS--------ENGRCMTFDQRANGYVRSEGVGVILLKPLKNAIIDNDHIYGIF 272
S++ LS E R FD +G+V EG GV++L+ L++A IY
Sbjct: 222 GFSRSRALSTKFNSCPTEASR--PFDCDRDGFVIGEGAGVLVLEELEHAKRRGAKIYAEV 279
Query: 273 RGNFENHGGHSSSPTSPN----MLAQKQLLIDVYRRANINPYTINYIEAHGTGTKLGDPI 328
RG + H + + +LA + L +++ ++P ++Y+ AH T T LGD +
Sbjct: 280 RGYGMSGDAHHITQPHEDGRGAVLAMTRAL----QQSGLHPNQVDYVNAHATSTPLGDAV 335
Query: 329 EVNGLKSAFSELHEYYKTPLLKPYCGLGSVKANIGHLEAASGVIGVIKVLLMLKYKKIPG 388
E +K+ FSE H S K GHL A+G + I +L + + P
Sbjct: 336 EARAIKTVFSE-HATSGG------LAFSSTKGATGHLLGAAGAVEAIFSVLAIHHGIAPP 388
Query: 389 NPHLKIPNSYLKLDNTPFYLVNKTCDWIQLDNNIPRRAGVS-SFGVGGSN 437
+L+ P+ P L RA +S SFG GG+N
Sbjct: 389 TLNLERPDPIFDDGFVP--LTASKAM--------LIRAALSNSFGFGGTN 428
>gnl|CDD|181539 PRK08722, PRK08722, 3-oxoacyl-(acyl carrier protein) synthase II;
Reviewed.
Length = 414
Score = 97.8 bits (243), Expect = 6e-22
Identities = 109/450 (24%), Positives = 185/450 (41%), Gaps = 62/450 (13%)
Query: 15 IAIIGIGGKFPGSKNINDFWRKLENNEDAITEVPTSRWDWKAIYGDPHLESGKTKVKWGG 74
+ + G+G P + W+ L + I + H ++ ++ G
Sbjct: 6 VVVTGMGMLSPVGNTVESSWKALLAGQSGIVNIE-------------HFDTTNFSTRFAG 52
Query: 75 FLYDADCFDANFFGISPAEAEVMDPQLRLFIETTWAALEDAGYPPSKLSGSK--TAIFAG 132
+ D +C + +S +A MD ++ I AL+D+G ++ + + AI +G
Sbjct: 53 LVKDFNCEEY----MSKKDARKMDLFIQYGIAAGIQALDDSGLEVTEENAHRIGVAIGSG 108
Query: 133 VSTADYKDILNEARHKGLV-KSLAEPFPF--------MIANRVSYLFNFHGPSEVIDTAC 183
+ L EA H+ LV K + PF MIA +S + GP+ I TAC
Sbjct: 109 IGGLG----LIEAGHQALVEKGPRKVSPFFVPSTIVNMIAGNLSIMRGLRGPNIAISTAC 164
Query: 184 SSSLIAINRAIESLHLKNCDLALAGGVNILASPNITIASSKAGLLSENGR-----CMTFD 238
++ L I A + + D +AGG ++P A LS +D
Sbjct: 165 TTGLHNIGHAARMIAYGDADAMVAGGAEKASTPLGMAGFGAAKALSTRNDEPQKASRPWD 224
Query: 239 QRANGYVRSEGVGVILLKPLKNAIIDNDHIYGIFRG-NFENHGGHSSSPT---SPNMLAQ 294
+ +G+V +G G+++L+ ++A IY G H +SP+ S LA
Sbjct: 225 KDRDGFVLGDGAGMMVLEEYEHAKARGAKIYAELVGFGMSGDAYHMTSPSEDGSGGALAM 284
Query: 295 KQLLIDVYRRANINPYTINYIEAHGTGTKLGDPIEVNGLKSAFSELHEYYKTPLLKPYCG 354
+ + R A + I Y+ AHGT T GD E+ G+K A E K L+
Sbjct: 285 EAAM----RDAGVTGEQIGYVNAHGTSTPAGDVAEIKGIKRALGE--AGSKQVLVS---- 334
Query: 355 LGSVKANIGHLEAASGVIGVIKVLLMLKYKKIPGNPHLKIPNSYLKLDNTPFYLVNKTCD 414
S K+ GHL A+G + I ++ L + +P +L P L +D P + +
Sbjct: 335 --STKSMTGHLLGAAGSVEAIITVMSLVDQIVPPTINLDDPEEGLDIDLVPH--TARKVE 390
Query: 415 WIQLDNNIPRRAGVSSFGVGGSNVHVIIEE 444
++ A +SFG GG+N +I ++
Sbjct: 391 SMEY-------AICNSFGFGGTNGSLIFKK 413
>gnl|CDD|235781 PRK06333, PRK06333, 3-oxoacyl-(acyl carrier protein) synthase II;
Reviewed.
Length = 424
Score = 97.8 bits (244), Expect = 7e-22
Identities = 112/469 (23%), Positives = 184/469 (39%), Gaps = 90/469 (19%)
Query: 15 IAIIGIGGKFPGSKNINDFWRKLENNEDAITEVPTSRWDWKAIYGDPHLESGKTKVKWGG 74
I + G+G P + FW++L + I + P G K GG
Sbjct: 6 IVVTGMGAVSPLGCGVETFWQRLLAGQSGIRTLT----------DFP---VGDLATKIGG 52
Query: 75 FLYD--ADC---FDANFFGISPAEAEVMDPQLRLFIETTWAA----LEDAGYPPSKLSGS 125
+ D D FD + + + P + MD FI AA L AG+ P L
Sbjct: 53 QVPDLAEDAEAGFDPDRY-LDPKDQRKMDR----FILFAMAAAKEALAQAGWDPDTLEDR 107
Query: 126 K---TAIFAGV----STADYKDILNEARHKGLVKSLAEPF--PFMIAN----RVSYLFNF 172
+ T I +GV + A+ L+ + L PF P + N VS + F
Sbjct: 108 ERTATIIGSGVGGFPAIAEAVRTLDSRGPRRL-----SPFTIPSFLTNMAAGHVSIRYGF 162
Query: 173 HGPSEVIDTACSSSLIAINRAIESLHLKNCDLALAGGVNILASPNIT----IASSKAGLL 228
GP TAC++ + AI A + D+A+ GG A+ + A+++A L
Sbjct: 163 KGPLGAPVTACAAGVQAIGDAARLIRSGEADVAVCGGTE--AAIDRVSLAGFAAARA--L 218
Query: 229 SENGR------CMTFDQRANGYVRSEGVGVILLKPLKNAIIDNDHIYGIFRGNFENHGG- 281
S FD+ +G+V EG G+++++ L++A+ G +G
Sbjct: 219 STRFNDAPEQASRPFDRDRDGFVMGEGAGILVIETLEHALARGAPPLAELVG----YGTS 274
Query: 282 ----HSSSPTSPNMLAQKQLLIDVYRRANINPYTINYIEAHGTGTKLGDPIEVNGLKSAF 337
H ++ A++ +LI + R+A I P + ++ AH T T +GD EV +K F
Sbjct: 275 ADAYHMTAGPEDGEGARRAMLIAL-RQAGIPPEEVQHLNAHATSTPVGDLGEVAAIKKVF 333
Query: 338 SELHEYYKTPLLKPYCGLGSVKANIGHLEAASGVIGVIKVLLMLKYKKIPGNPHLKIPNS 397
+ + S K+ GHL A+G + I +L L+ + P +L+ P+
Sbjct: 334 GHV---SGL-------AVSSTKSATGHLLGAAGGVEAIFTILALRDQIAPPTLNLENPDP 383
Query: 398 YLK-LDNTPFYLVNKTCDWIQLDNNIPRRAGVSSFGVGGSNVHVIIEEY 445
+ LD NK A + FG GG N ++ +
Sbjct: 384 AAEGLDVVA----NKARPMDM------DYALSNGFGFGGVNASILFRRW 422
>gnl|CDD|235987 PRK07314, PRK07314, 3-oxoacyl-(acyl carrier protein) synthase II;
Reviewed.
Length = 411
Score = 97.2 bits (243), Expect = 1e-21
Identities = 108/392 (27%), Positives = 165/392 (42%), Gaps = 75/392 (19%)
Query: 89 ISPAEAEVMDPQLRLFIETTWAALEDAGYPPSKLSGSKT--AIFAGV----STADYKDIL 142
+S EA MD ++ I A+EDAG ++ + + I +G+ + + L
Sbjct: 61 MSRKEARRMDRFIQYGIAAAKQAVEDAGLEITEENADRIGVIIGSGIGGLETIEEQHITL 120
Query: 143 NEARHKGLVKSLAEPF--PFMIAN----RVSYLFNFHGPSEVIDTACSSSLIAINRAIES 196
E KG + PF P I N VS + GP+ I TAC++ AI A
Sbjct: 121 LE---KGPRR--VSPFFVPMAIINMAAGHVSIRYGAKGPNHSIVTACATGAHAIGDAARL 175
Query: 197 LHLKNCDLALAGGVNILASPNITIA---SSKAGLLSEN----GRCMTFDQRANGYVRSEG 249
+ + D+ +AGG +P + IA +++A L + N FD+ +G+V EG
Sbjct: 176 IAYGDADVMVAGGAEAAITP-LGIAGFAAARA-LSTRNDDPERASRPFDKDRDGFVMGEG 233
Query: 250 VGVILLKPLKNAIIDNDHIYGIFRGNFENHG-GHSSSP---TSPNM------LAQKQLLI 299
G+++L+ L++A IY E G G + T+P A K L
Sbjct: 234 AGILVLEELEHAKARGAKIYA------EVVGYGMTGDAYHMTAPAPDGEGAARAMKLAL- 286
Query: 300 DVYRRANINPYTINYIEAHGTGTKLGDPIEVNGLKSAFSELHEYYKTPLLKPYCGLGSVK 359
+ A INP I+YI AHGT T GD E +K F E YK + S K
Sbjct: 287 ---KDAGINPEDIDYINAHGTSTPAGDKAETQAIKRVFGE--HAYKVA-------VSSTK 334
Query: 360 ANIGHLEAASGVIGVIKVLLMLKYKKIPGNPHLKIPNSYLKLDNTPFYLVNKTCDWIQLD 419
+ GHL A+G + I +L ++ + IP +L P+ LD P
Sbjct: 335 SMTGHLLGAAGAVEAIFSVLAIRDQVIPPTINLDNPDEECDLDYVP-------------- 380
Query: 420 NNIPRRAGV-----SSFGVGGSNVHVIIEEYR 446
N R + +SFG GG+N ++ + Y
Sbjct: 381 -NEARERKIDYALSNSFGFGGTNASLVFKRYE 411
>gnl|CDD|235817 PRK06501, PRK06501, 3-oxoacyl-(acyl carrier protein) synthase II;
Reviewed.
Length = 425
Score = 95.1 bits (237), Expect = 5e-21
Identities = 92/322 (28%), Positives = 136/322 (42%), Gaps = 44/322 (13%)
Query: 136 ADYKDILNEARHKGLVKSLAEPFPF-MIANRVSYLFNFHGPSEVIDTACSSSLIAINRAI 194
Y +L AR G +L E F F IA+R++ F G + TAC+S AI +
Sbjct: 128 PSYDRLLRAARG-GRFDALHERFQFGSIADRLADRFGTRGLPISLSTACASGATAIQLGV 186
Query: 195 ESLHLKNCDLALAGGVNILASPNITIASSKAGLLSEN-----GRCMTFDQRANGYVRSEG 249
E++ D AL + S I S LS F + +G+V +EG
Sbjct: 187 EAIRRGETDRALCIATDGSVSAEALIRFSLLSALSTQNDPPEKASKPFSKDRDGFVMAEG 246
Query: 250 VGVILLKPLKNAIIDNDHIYGIFRGNFEN----HGGHSSSPTSPNMLAQKQLLIDVYRRA 305
G ++L+ L++A+ I GI G E H SS SP + A + L D A
Sbjct: 247 AGALVLESLESAVARGAKILGIVAGCGEKADSFHRTRSSPDGSPAIGAIRAALAD----A 302
Query: 306 NINPYTINYIEAHGTGTKLGDPIEVNGLKSAFSELHEYYKTPLLKPYCGLGSVKANIGHL 365
+ P I+YI AHGT T D +E GL + F E P + S K+ IGH
Sbjct: 303 GLTPEQIDYINAHGTSTPENDKMEYLGLSAVFGERLA--SIP-------VSSNKSMIGHT 353
Query: 366 EAASGVIGVIKVLLMLKYKKIPGNPHLKIPNSYLKLDNTPFYLVNKTCDWIQLDNNIPRR 425
A+G + + LL ++ ++P + P+ + LD P N+ R
Sbjct: 354 LTAAGAVEAVFSLLTIQTGRLPPTINYDNPDPAIPLDVVP---------------NVARD 398
Query: 426 AGVS-----SFGVGGSNVHVII 442
A V+ SFG GG N +++
Sbjct: 399 ARVTAVLSNSFGFGGQNASLVL 420
>gnl|CDD|238201 cd00327, cond_enzymes, Condensing enzymes; Family of enzymes that
catalyze a (decarboxylating or non-decarboxylating)
Claisen-like condensation reaction. Members are share
strong structural similarity, and are involved in the
synthesis and degradation of fatty acids, and the
production of polyketides, a diverse group of natural
products.
Length = 254
Score = 88.7 bits (220), Expect = 8e-20
Identities = 61/342 (17%), Positives = 112/342 (32%), Gaps = 97/342 (28%)
Query: 102 RLFIETTWAALEDAGYPPSKLSGSKTAIFAGVSTADYKDILNEARHKGLVKSLAEPFPFM 161
L E A+ DAG G + G + + F
Sbjct: 9 ELGFEAAEQAIADAGLS----KGPIVGVIVGTTGGSGE------------------FSG- 45
Query: 162 IANRVSYLFNF-HGPSEVIDTACSSSLIAINRAIESLHLKNCDLALAGGVNILASPNITI 220
A +++Y GP+ ++ AC++ L A+ A++ + D+ LAGG
Sbjct: 46 AAGQLAYHLGISGGPAYSVNQACATGLTALALAVQQVQNGKADIVLAGGSEE-------- 97
Query: 221 ASSKAGLLSENGRCMTFDQRANGYVRSEGVGVILLKPLKNAIIDNDHIYGIFRGNFENHG 280
+V +G +++ ++A+ H
Sbjct: 98 -----------------------FVFGDGAAAAVVESEEHALRRGAHPQAEIVS-TAATF 133
Query: 281 GHSSSPTSPNMLAQKQLLIDVYRRANINPYTINYIEAHGTGTKLGDPIEVNGLKSAFSEL 340
+S + + + A + P I+Y+EAHGTGT +GD +E+ +
Sbjct: 134 DGASMVPAVSGEGLARAARKALEGAGLTPSDIDYVEAHGTGTPIGDAVELALGLDPDG-V 192
Query: 341 HEYYKTPLLKPYCGLGSVKANIGHLEAASGVIGVIKVLLMLKYKKIPGNPHLKIPNSYLK 400
+ + GH A+G+ + ++LLML+++ IP P
Sbjct: 193 RSP----------AVSATLIMTGHPLGAAGLAILDELLLMLEHEFIPPTP---------- 232
Query: 401 LDNTPFYLVNKTCDWIQLDNNIPRRAGVSSFGVGGSNVHVII 442
PR + FG+GG+N V++
Sbjct: 233 --------------------REPRTVLLLGFGLGGTNAAVVL 254
>gnl|CDD|236265 PRK08439, PRK08439, 3-oxoacyl-(acyl carrier protein) synthase II;
Reviewed.
Length = 406
Score = 86.3 bits (214), Expect = 4e-18
Identities = 102/421 (24%), Positives = 166/421 (39%), Gaps = 84/421 (19%)
Query: 66 GKTKVKWGGFLYDADCFDANFFGISPA-------EAEVMDPQ--------LRLFIETTWA 110
G+ +K FDA+ F + A EVMDP+ ++L ++
Sbjct: 29 GECGIK------KITLFDASDFPVQIAGEITDFDPTEVMDPKEVKKADRFIQLGLKAARE 82
Query: 111 ALEDAGYPPSKLSGSKTAIFA-----GVSTADYKDILNEARHKGLVKSLAEPFPF----- 160
A++DAG+ P +L + + + G+ + I+ + + PF
Sbjct: 83 AMKDAGFLPEELDAERFGVSSASGIGGLPNIEKNSIICFEKGPRKIS------PFFIPSA 136
Query: 161 ---MIANRVSYLFNFHGPSEVIDTACSSSLIAINRAIESLHLKNCDLALAGGVNILASPN 217
M+ +S GP+ TAC++ AI A++++ L D L G P
Sbjct: 137 LVNMLGGFISIEHGLKGPNLSSVTACAAGTHAIIEAVKTIMLGGADKMLVVGAESAICP- 195
Query: 218 ITI---ASSKAGLLSENGRCMT----FDQRANGYVRSEGVGVILLKPLKNAIIDNDHIYG 270
+ I A+ KA L + N FD+ +G+V EG G ++L+ ++A IY
Sbjct: 196 VGIGGFAAMKA-LSTRNDDPKKASRPFDKDRDGFVMGEGAGALVLEEYESAKKRGAKIYA 254
Query: 271 IFRGNFENHGGHSSSPTSPNMLAQKQLLIDVYRRANINPYTINYIEAHGTGTKLGDPIEV 330
G E+ G ++ TSP + + A NP I+YI AHGT T D E
Sbjct: 255 EIIGFGES--GDANHITSPAPEGPLRAMKAALEMAG-NP-KIDYINAHGTSTPYNDKNET 310
Query: 331 NGLKSAFSELHEYYKTPLLKPYCGL-GSVKANIGHLEAASGVIGVIKVLLMLKYKKIPGN 389
LK F K S K IGH A+G I + ++ ++ +P
Sbjct: 311 AALKELFGS----------KEKVPPVSSTKGQIGHCLGAAGAIEAVISIMAMRDGILPPT 360
Query: 390 PHLKIPNSYLKLDNTPFYLVNKTCDWIQLDNNIPRRAGV-----SSFGVGGSNVHVIIEE 444
+ + P+ LD P N+ R+A + +SFG GG+N VI ++
Sbjct: 361 INQETPDPECDLDYIP---------------NVARKAELNVVMSNSFGFGGTNGVVIFKK 405
Query: 445 Y 445
Sbjct: 406 V 406
>gnl|CDD|180839 PRK07103, PRK07103, polyketide beta-ketoacyl:acyl carrier protein
synthase; Validated.
Length = 410
Score = 85.1 bits (211), Expect = 1e-17
Identities = 65/287 (22%), Positives = 121/287 (42%), Gaps = 36/287 (12%)
Query: 166 VSYLFNFHGPSEVIDTACSSSLIAI---NRAIESLHLKNCDLALAGGVNILASPNITIAS 222
S F G + A +S +A+ R ++S + C +A+ G + L+
Sbjct: 150 CSEQFGIRGEGFTVGGASASGQLAVIQAARLVQSGSVDAC-IAV-GALMDLSYWECQALR 207
Query: 223 SKAGLLSENGR------CMTFDQRANGYVRSEGVGVILLKPLKNAIIDNDHIYGIFRGNF 276
S + S+ C FDQ +G++ E G ++L+ ++A Y G
Sbjct: 208 SLGAMGSDRFADEPEAACRPFDQDRDGFIYGEACGAVVLESAESARRRGARPYAKLLGWS 267
Query: 277 ENHGGHSSSPTSPNMLAQKQLLIDVYRRANINPYTINYIEAHGTGTKLGDPIEVNGLKSA 336
+ P++ + +++ RRA + P I+Y+ HGTG+ LGD E+ L ++
Sbjct: 268 MRLDANRG--PDPSLEGEMRVIRAALRRAGLGPEDIDYVNPHGTGSPLGDETELAALFAS 325
Query: 337 FSELHEYYKTPLLKPYCGLGSVKANIGHLEAASGVIGVIKVLLMLKYKKIPGNPHLKIPN 396
L + + + K+ GH +A+G++ +I LL ++ + + +L P
Sbjct: 326 G----------LAHAW--INATKSLTGHGLSAAGIVELIATLLQMRAGFLHPSRNLDEP- 372
Query: 397 SYLKLDNTPFYLVNKTCDWIQLDNNIPRRAGVSSFGVGGSNVHVIIE 443
+D F V T + ++ R A SFG GG N +++E
Sbjct: 373 ----IDER-FRWVGSTAESARI-----RYALSLSFGFGGINTALVLE 409
>gnl|CDD|236398 PRK09185, PRK09185, 3-oxoacyl-(acyl carrier protein) synthase I;
Reviewed.
Length = 392
Score = 76.8 bits (190), Expect = 4e-15
Identities = 71/306 (23%), Positives = 101/306 (33%), Gaps = 75/306 (24%)
Query: 159 PFMIANRVSYLFNFHGPSEVIDTACSSSLIAINRAIESLHLKNCDLALAGGVNIL----- 213
+A+ + GP+ I TACSSS A L CD A+ GGV+ L
Sbjct: 136 LGSLADFLRAYLGLSGPAYTISTACSSSAKVFASARRLLEAGLCDAAIVGGVDSLCRLTL 195
Query: 214 ---ASPNITIASSKAGLLSENGRCMTFDQRANGYVRSEGVGVILL-KPLKNAII------ 263
S LS C F +G E LL + A+
Sbjct: 196 NGFNS---------LESLS-PQPCRPFSANRDGINIGEAAAFFLLEREDDAAVALLGVGE 245
Query: 264 --DNDHIYGIFRGNFENHGGHSSSPTSPN----MLAQKQLLIDVYRRANINPYTINYIEA 317
D H+ S+P P +LA +Q L A + P I YI
Sbjct: 246 SSDAHHM---------------SAP-HPEGLGAILAMQQAL----ADAGLAPADIGYINL 285
Query: 318 HGTGTKLGDPIEVNGLKSAFSELHEYYKTPLLKPYCGLGSVKANIGHLEAASGVIGVIKV 377
HGT T L D +E + + F + P S K GH A+G +
Sbjct: 286 HGTATPLNDAMESRAVAAVFGD-----GVP-------CSSTKGLTGHTLGAAGAVEAAIC 333
Query: 378 LLMLKYKKIPGNPHLKIPNSYLKLDNTPFYLVNKTCDWIQLDNNIPRRAGVS-SFGVGGS 436
L L++ P + P+ L ++ + R +S SF GG+
Sbjct: 334 WLALRHGLPPHGWNTGQPDPALPP-----------LYLVENAQALAIRYVLSNSFAFGGN 382
Query: 437 NVHVII 442
N +I
Sbjct: 383 NCSLIF 388
>gnl|CDD|215421 PLN02787, PLN02787, 3-oxoacyl-[acyl-carrier-protein] synthase II.
Length = 540
Score = 73.9 bits (181), Expect = 8e-14
Identities = 100/385 (25%), Positives = 163/385 (42%), Gaps = 59/385 (15%)
Query: 89 ISPAEAEVMDPQLRLFIETTWAALEDAGYPP---SKLSGSKTAIFAGVSTADYKDILNEA 145
++P ++ MD + + AL D G +L +K + G + K + N+A
Sbjct: 188 VAPKLSKRMDKFMLYLLTAGKKALADGGITEDVMKELDKTKCGVLIGSAMGGMK-VFNDA 246
Query: 146 RHKGLVKSL------AEPF--PFMIANRVSYLF----NFHGPSEVIDTACSSSLIAINRA 193
+++L PF PF N S + + GP+ I TAC++S I A
Sbjct: 247 -----IEALRISYRKMNPFCVPFATTNMGSAMLAMDLGWMGPNYSISTACATSNFCILNA 301
Query: 194 IESLHLKNCDLALAGGVNILASPNITI-----ASSKAGLLSENGRCMT-----FDQRANG 243
+ D+ L GG + + I I + +A LS+ T +D +G
Sbjct: 302 ANHIIRGEADVMLCGGSD---AAIIPIGLGGFVACRA--LSQRNDDPTKASRPWDMNRDG 356
Query: 244 YVRSEGVGVILLKPLKNAIIDNDHIYGIFRG-NFENHGGHSSSPTSPNMLAQKQLLID-V 301
+V EG GV+LL+ L++A +IY F G +F H + P A L I+
Sbjct: 357 FVMGEGAGVLLLEELEHAKKRGANIYAEFLGGSFTCDAYHMTEPHPEG--AGVILCIEKA 414
Query: 302 YRRANINPYTINYIEAHGTGTKLGDPIEVNGLKSAFSELHEYYKTPLLKPYCGLGSVKAN 361
++ ++ +NYI AH T TK GD E L F + P L+ + S K+
Sbjct: 415 LAQSGVSKEDVNYINAHATSTKAGDLKEYQALMRCFGQ------NPELR----VNSTKSM 464
Query: 362 IGHLEAASGVIGVIKVLLMLKYKKIPGNPHLKIPNSYLKLDNTPFYLVNKTCDWIQLDNN 421
IGHL A+G + I + ++ + N +L+ P S + LV + +LD
Sbjct: 465 IGHLLGAAGAVEAIATVQAIRTGWVHPNINLENPESGVDTK----VLVGPKKE--RLDIK 518
Query: 422 IPRRAGVSSFGVGGSNVHVIIEEYR 446
+ A +SFG GG N ++ Y+
Sbjct: 519 V---ALSNSFGFGGHNSSILFAPYK 540
>gnl|CDD|173154 PRK14691, PRK14691, 3-oxoacyl-(acyl carrier protein) synthase II;
Provisional.
Length = 342
Score = 72.1 bits (176), Expect = 1e-13
Identities = 60/246 (24%), Positives = 106/246 (43%), Gaps = 16/246 (6%)
Query: 161 MIANRVSYLFNFHGPSEVIDTACSSSLIAINRAIESLHLKNCDLALAGGVNILASPNITI 220
+ A VS +F GP TAC++ + AI A+ + D+AL GG +
Sbjct: 69 LAAGHVSIKHHFKGPIGAPVTACAAGVQAIGDAVRMIRNNEADVALCGGAEAVIDTVSLA 128
Query: 221 ASSKAGLLSENGRCMT------FDQRANGYVRSEGVGVILLKPLKNAIIDNDHIYGIFRG 274
+ A LS + FD +G+V EG G+++++ L++A+ G
Sbjct: 129 GFAAARALSTHFNSTPEKASRPFDTARDGFVMGEGAGLLIIEELEHALARGAKPLAEIVG 188
Query: 275 NFENHGGHSSSPTSPNMLAQKQLLIDVYRRANINPYTINYIEAHGTGTKLGDPIEVNGLK 334
+ + + + + + + R+A I P + ++ AH T T +GD E+N +K
Sbjct: 189 YGTSADAYHMTSGAEDGDGAYRAMKIALRQAGITPEQVQHLNAHATSTPVGDLGEINAIK 248
Query: 335 SAFSELHEYYKTPLLKPYCGLGSVKANIGHLEAASGVIGVIKVLLMLKYKKIPGNPHLKI 394
F E + T S K+ GHL A+G + I +L L+ + +P +L+
Sbjct: 249 HLFGESNALAIT----------STKSATGHLLGAAGGLETIFTVLALRDQIVPATLNLEN 298
Query: 395 PNSYLK 400
P+ K
Sbjct: 299 PDPAAK 304
>gnl|CDD|181184 PRK07967, PRK07967, 3-oxoacyl-(acyl carrier protein) synthase I;
Reviewed.
Length = 406
Score = 72.4 bits (178), Expect = 1e-13
Identities = 85/362 (23%), Positives = 147/362 (40%), Gaps = 62/362 (17%)
Query: 111 ALEDAGYPPSKLSGSKTAIFAGVSTADYKDILNEARHKGLVKSLAEPFPFMI----ANRV 166
A+ DAG ++S +T + AG ++ + A + P+ + A+ V
Sbjct: 82 AIADAGLSEEQVSNPRTGLIAGSGGGSTRNQVEAADAMRGPRGPKRVGPYAVTKAMASTV 141
Query: 167 SYL----FNFHGPSEVIDTACSSSLIAINRAIESLHLKNCDLALAGGVNILASPNITIAS 222
S F G + I +AC++S I A+E + L D+ AGG L ++
Sbjct: 142 SACLATPFKIKGVNYSISSACATSAHCIGNAVEQIQLGKQDIVFAGGGEEL-DWEMSCLF 200
Query: 223 SKAGLLSEN--------GRCMTFDQRANGYVRSEGVGVILLKPLKNAIIDNDHIYGIFRG 274
G LS R +D +G+V + G GV++++ L++A+ IY G
Sbjct: 201 DAMGALSTKYNDTPEKASR--AYDANRDGFVIAGGGGVVVVEELEHALARGAKIYAEIVG 258
Query: 275 NFENHGGHSSSPTSPNMLA---------QKQLLIDVYRRANINPYTINYIEAHGTGTKLG 325
G+ +M+A + L V I+YI HGT T +G
Sbjct: 259 YGATSDGY-------DMVAPSGEGAVRCMQMALATVDTP-------IDYINTHGTSTPVG 304
Query: 326 DPIEVNGLKSAFSELHEYYKTPLLKPYCGLGSVKANIGHLEAASGVIGVIKVLLMLKYKK 385
D E+ ++ F + K+P + + K+ GH A+GV I LLM+++
Sbjct: 305 DVKELGAIREVFGD-----KSP------AISATKSLTGHSLGAAGVQEAIYSLLMMEHGF 353
Query: 386 IPGNPHLKIPNSYLKLDNTPFYLVNKTCDWIQLDNNIPRRAGVSSFGVGGSNVHVIIEEY 445
I + + I + P +V +T D +L + +SFG GG+N ++ Y
Sbjct: 354 IAPSAN--IEELDPQAAGMP--IVTETTDNAELTTVMS-----NSFGFGGTNATLVFRRY 404
Query: 446 RK 447
+
Sbjct: 405 KG 406
>gnl|CDD|181657 PRK09116, PRK09116, 3-oxoacyl-(acyl carrier protein) synthase II;
Reviewed.
Length = 405
Score = 66.2 bits (162), Expect = 1e-11
Identities = 82/281 (29%), Positives = 109/281 (38%), Gaps = 41/281 (14%)
Query: 111 ALEDAGY--PPSKLSGSKTAIFAGVSTADYKDI------LNEARHKGL-VKSLAEPFPFM 161
ALEDAG P L+ + I G ST I L E G+ + P
Sbjct: 84 ALEDAGLLGDPI-LTDGRMGIAYGSSTGSTDPIGAFGTMLLEGSMSGITATTYVRMMPHT 142
Query: 162 IANRVSYLFNFHGPSEVIDT--ACSSSLIAINRAIESLHLKNCDLALAGGVNILASPNIT 219
A V F G VI T AC+S I A E++ + LAGG L
Sbjct: 143 TAVNVGLFFGLKG--RVIPTSSACTSGSQGIGYAYEAIKYGYQTVMLAGGAEELCPTEAA 200
Query: 220 I------ASSKAGLLSENGRCMTFDQRANGYVRSEGVGVILLKPLKNAIIDNDHIYGIFR 273
+ S++ R FD +G V EG G ++L+ L++A IY
Sbjct: 201 VFDTLFATSTRNDAPELTPR--PFDANRDGLVIGEGAGTLVLEELEHAKARGATIYAEIV 258
Query: 274 GNFEN-HGGHSSSPTSPNM-LAQKQLLIDVYRRANINPYTINYIEAHGTGTKLGDPIEVN 331
G N G H + P + M +A + L D A + P I Y+ AHGT T GD E
Sbjct: 259 GFGTNSDGAHVTQPQAETMQIAMELALKD----AGLAPEDIGYVNAHGTATDRGDIAESQ 314
Query: 332 GLKSAFSELHEYYKTPL--LKPYCG--LGSVKANIGHLEAA 368
+ F + P+ LK Y G LG+ G LEA
Sbjct: 315 ATAAVFGA-----RMPISSLKSYFGHTLGAC----GALEAW 346
>gnl|CDD|236129 PRK07910, PRK07910, 3-oxoacyl-(acyl carrier protein) synthase II;
Reviewed.
Length = 418
Score = 62.1 bits (151), Expect = 2e-10
Identities = 70/270 (25%), Positives = 112/270 (41%), Gaps = 39/270 (14%)
Query: 181 TACSSSLIAINRAIESLHLKNCDLALAGGVN--ILASPNITIASSKAGLLSEN----GRC 234
+AC+S AI +A + L D+A+ GGV I A P A + + + N G C
Sbjct: 169 SACASGSEAIAQAWRQIVLGEADIAICGGVETRIEAVPIAGFAQMRIVMSTNNDDPAGAC 228
Query: 235 MTFDQRANGYVRSEGVGVILLKPLKNAIIDNDHIYGIFRG-NFENHGGHSSSPTSPNMLA 293
FD+ +G+V EG +++++ ++A +I G + + G H +P PN
Sbjct: 229 RPFDKDRDGFVFGEGGALMVIETEEHAKARGANILARIMGASITSDGFHMVAP-DPNGER 287
Query: 294 QKQLLIDVYRRANINPYTINYIEAHGTGTKLGDPIEVNGLKSAFSELHEYYKTPLLKPYC 353
+ A + P I+++ AH TGT +GD E + +A P
Sbjct: 288 AGHAMTRAIELAGLTPGDIDHVNAHATGTSVGDVAEGKAINNALGGHRPAVYAP------ 341
Query: 354 GLGSVKANIGHLEAASGVIGVIKVLLMLKYKKIPGNPHLKIPNSYLKLDNTPFYLVNKTC 413
K+ +GH A G + I +L L+ IP +L+ + + LD +V
Sbjct: 342 -----KSALGHSVGAVGAVESILTVLALRDGVIPPTLNLENLDPEIDLD-----VVAGE- 390
Query: 414 DWIQLDNNIPRR-----AGVSSFGVGGSNV 438
PR A +SFG GG NV
Sbjct: 391 ---------PRPGNYRYAINNSFGFGGHNV 411
>gnl|CDD|238428 cd00832, CLF, Chain-length factor (CLF) is a factor required for
polyketide chain initiation of aromatic
antibiotic-producing polyketide synthases (PKSs) of
filamentous bacteria. CLFs have been shown to have
decarboxylase activity towards malonyl-acyl carrier
protein (ACP). CLFs are similar to other elongation
ketosynthase domains, but their active site cysteine is
replaced by a conserved glutamine.
Length = 399
Score = 57.8 bits (140), Expect = 6e-09
Identities = 84/368 (22%), Positives = 130/368 (35%), Gaps = 60/368 (16%)
Query: 98 DPQLRLFIETTWAALEDAGYPPSKLSGSKTAIFAGVSTA-----------DYKDILNE-A 145
D RL + AL DAG P+ L GV TA + + + ++
Sbjct: 69 DRMTRLALAAADWALADAGVDPAALPPYD----MGVVTASAAGGFEFGQRELQKLWSKGP 124
Query: 146 RHKGLVKSLAEPFPFMIAN--RVSYLFNFHGPSEVIDTACSSSLIAINRAIESLHLKNCD 203
RH +S A F N ++S GPS V+ + L A+ +A + +
Sbjct: 125 RHVSAYQSFAW---FYAVNTGQISIRHGMRGPSGVVVAEQAGGLDALAQARRLVR-RGTP 180
Query: 204 LALAGGVNILASPNITIASSKAGLLSENGR----CMTFDQRANGYVRSEGVGVILLKPLK 259
L ++GGV+ P +A +G LS + + FD A GYV EG +++L+
Sbjct: 181 LVVSGGVDSALCPWGWVAQLSSGRLSTSDDPARAYLPFDAAAAGYVPGEGGAILVLEDAA 240
Query: 260 NAIIDNDHIYGIFRGNFENHGGHSSSPTSPNMLAQKQLLIDVYRRANINPYTINYIEAHG 319
A +YG G S P + +L + A + P ++ + A
Sbjct: 241 AARERGARVYGEIAGYAATFDPPPGSGRPPGLARAIRLALA---DAGLTPEDVDVVFADA 297
Query: 320 TGTKLGDPIEVNGLKSAFSELHEYYKTPLLKPYCGLGSVKANIGHLEAASGVIGVIKVLL 379
G D E L + F P+ P K G L A + V LL
Sbjct: 298 AGVPELDRAEAAALAAVFGP----RGVPVTAP-------KTMTGRLYAGGAPLDVATALL 346
Query: 380 MLKYKKIPGNPHLKIPNSYLKLDNTPFYLVNKTCDWIQLDNNIPRRAGVS-----SFGVG 434
L+ IP ++ LD LV PR A + + G G
Sbjct: 347 ALRDGVIPPTVNVTDVPPAYGLD-----LVTGR----------PRPAALRTALVLARGRG 391
Query: 435 GSNVHVII 442
G N +++
Sbjct: 392 GFNSALVV 399
>gnl|CDD|215722 pfam00108, Thiolase_N, Thiolase, N-terminal domain. Thiolase is
reported to be structurally related to beta-ketoacyl
synthase (pfam00109), and also chalcone synthase.
Length = 262
Score = 39.6 bits (93), Expect = 0.002
Identities = 27/101 (26%), Positives = 39/101 (38%), Gaps = 21/101 (20%)
Query: 110 AALEDAGYPPSKLSGSKTAIFAGVSTADYKDILNEARHKGLVKSLAEPFPFMIANRVSYL 169
AALE AG P + I V A N AR L + + P + N+V
Sbjct: 34 AALERAGVKPEDVDE---VIMGNVLQAGEGQ--NPARQAALKAGIPDSVPAVTINKV--- 85
Query: 170 FNFHGPSEVIDTACSSSLIAINRAIESLHLKNCDLALAGGV 210
C S L A+ A +++ + D+ +AGGV
Sbjct: 86 -------------CGSGLKAVALAAQAIRAGDADIVVAGGV 113
>gnl|CDD|238425 cd00829, SCP-x_thiolase, Thiolase domain associated with sterol
carrier protein (SCP)-x isoform and related proteins;
SCP-2 has multiple roles in intracellular lipid
circulation and metabolism. The N-terminal presequence
in the SCP-x isoform represents a peroxisomal
3-ketacyl-Coa thiolase specific for branched-chain acyl
CoAs, which is proteolytically cleaved from the sterol
carrier protein.
Length = 375
Score = 37.2 bits (87), Expect = 0.018
Identities = 24/131 (18%), Positives = 43/131 (32%), Gaps = 24/131 (18%)
Query: 87 FGISPAEAEVMDPQLRLFIETTWAALEDAGYPPSKLSGSKTAIFAGVSTADYKDILNEAR 146
G++P L L E AAL+DAG + DI +A
Sbjct: 3 VGMTPFGRRSDRSPLELAAEAARAALDDAG-------------------LEPADI--DAV 41
Query: 147 HKGLVKSLAEPFPFMIANRVSYLFNFHG-PSEVIDTACSSSLIAINRAIESLHLKNCDLA 205
+ + F ++ G P+ ++ A +S A+ A ++ D+
Sbjct: 42 V--VGNAAGGRFQSFPGALIAEYLGLLGKPATRVEAAGASGSAAVRAAAAAIASGLADVV 99
Query: 206 LAGGVNILASP 216
L G ++
Sbjct: 100 LVVGAEKMSDV 110
>gnl|CDD|187656 cd08953, KR_2_SDR_x, ketoreductase (KR), subgroup 2, complex (x)
SDRs. Ketoreductase, a module of the multidomain
polyketide synthase (PKS), has 2 subdomains, each
corresponding to a SDR family monomer. The C-terminal
subdomain catalyzes the NADPH-dependent reduction of the
beta-carbonyl of a polyketide to a hydroxyl group, a
step in the biosynthesis of polyketides, such as
erythromycin. The N-terminal subdomain, an interdomain
linker, is a truncated Rossmann fold which acts to
stabilizes the catalytic subdomain. Unlike typical SDRs,
the isolated domain does not oligomerize but is composed
of 2 subdomains, each resembling an SDR monomer. The
active site resembles that of typical SDRs, except that
the usual positions of the catalytic Asn and Tyr are
swapped, so that the canonical YXXXK motif changes to
YXXXN. Modular PKSs are multifunctional structures in
which the makeup recapitulates that found in (and may
have evolved from) FAS. Polyketide synthesis also
proceeds via the addition of 2-carbon units as in fatty
acid synthesis. The complex SDR NADP-binding motif,
GGXGXXG, is often present, but is not strictly conserved
in each instance of the module. This subfamily includes
both KR domains of the Bacillus subtilis Pks J,-L, and
PksM, and all three KR domains of PksN, components of
the megacomplex bacillaene synthase, which synthesizes
the antibiotic bacillaene. SDRs are a functionally
diverse family of oxidoreductases that have a single
domain with a structurally conserved Rossmann fold
(alpha/beta folding pattern with a central beta-sheet),
an NAD(P)(H)-binding region, and a structurally diverse
C-terminal region. Classical SDRs are typically about
250 residues long, while extended SDRs are approximately
350 residues. Sequence identity between different SDR
enzymes are typically in the 15-30% range, but the
enzymes share the Rossmann fold NAD-binding motif and
characteristic NAD-binding and catalytic sequence
patterns. These enzymes catalyze a wide range of
activities including the metabolism of steroids,
cofactors, carbohydrates, lipids, aromatic compounds,
and amino acids, and act in redox sensing. Classical
SDRs have an TGXXX[AG]XG cofactor binding motif and a
YXXXK active site motif, with the Tyr residue of the
active site motif serving as a critical catalytic
residue (Tyr-151, human prostaglandin dehydrogenase
(PGDH) numbering). In addition to the Tyr and Lys, there
is often an upstream Ser (Ser-138, PGDH numbering)
and/or an Asn (Asn-107, PGDH numbering) contributing to
the active site; while substrate binding is in the
C-terminal region, which determines specificity. The
standard reaction mechanism is a 4-pro-S hydride
transfer and proton relay involving the conserved Tyr
and Lys, a water molecule stabilized by Asn, and
nicotinamide. Extended SDRs have additional elements in
the C-terminal region, and typically have a TGXXGXXG
cofactor binding motif. Complex (multidomain) SDRs such
as ketoreductase domains of fatty acid synthase have a
GGXGXXG NAD(P)-binding motif and an altered active site
motif (YXXXN). Fungal type KRs have a TGXXXGX(1-2)G
NAD(P)-binding motif. Some atypical SDRs have lost
catalytic activity and/or have an unusual NAD(P)-binding
motif and missing or unusual active site residues.
Reactions catalyzed within the SDR family include
isomerization, decarboxylation, epimerization, C=N bond
reduction, dehydratase activity, dehalogenation,
Enoyl-CoA reduction, and carbonyl-alcohol
oxidoreduction.
Length = 436
Score = 37.0 bits (86), Expect = 0.026
Identities = 14/19 (73%), Positives = 16/19 (84%)
Query: 496 DLAYTLQVGREAMKYRLAI 514
DLAYTLQVGREAM+ R +
Sbjct: 12 DLAYTLQVGREAMEERRRL 30
>gnl|CDD|223261 COG0183, PaaJ, Acetyl-CoA acetyltransferase [Lipid metabolism].
Length = 392
Score = 35.0 bits (81), Expect = 0.083
Identities = 12/32 (37%), Positives = 20/32 (62%)
Query: 179 IDTACSSSLIAINRAIESLHLKNCDLALAGGV 210
++ AC+S L A+ A +++ D+ LAGGV
Sbjct: 88 VNRACASGLAAVRLAAQAIASGEADVVLAGGV 119
>gnl|CDD|233642 TIGR01930, AcCoA-C-Actrans, acetyl-CoA acetyltransferases. This
model represents a large family of enzymes which
catalyze the thiolysis of a linear fatty acid CoA (or
acetoacetyl-CoA) using a second CoA molecule to produce
acetyl-CoA and a CoA-ester product two carbons shorter
(or, alternatively, the condensation of two molecules of
acetyl-CoA to produce acetoacetyl-CoA and CoA). This
enzyme is also known as "thiolase", "3-ketoacyl-CoA
thiolase", "beta-ketothiolase" and "Fatty oxidation
complex beta subunit". When catalyzing the degradative
reaction on fatty acids the corresponding EC number is
2.3.1.16. The condensation reaction corresponds to
2.3.1.9. Note that the enzymes which catalyze the
condensation are generally not involved in fatty acid
biosynthesis, which is carried out by a decarboxylating
condensation of acetyl and malonyl esters of acyl
carrier proteins. Rather, this activity may produce
acetoacetyl-CoA for pathways such as IPP biosynthesis in
the absence of sufficient fatty acid oxidation [Fatty
acid and phospholipid metabolism, Other].
Length = 386
Score = 33.7 bits (78), Expect = 0.25
Identities = 11/41 (26%), Positives = 20/41 (48%)
Query: 170 FNFHGPSEVIDTACSSSLIAINRAIESLHLKNCDLALAGGV 210
P+ ++ C+S L A+ A + + D+ +AGGV
Sbjct: 70 LPESVPAYTVNRQCASGLQAVILAAQLIRAGEADVVVAGGV 110
>gnl|CDD|238383 cd00751, thiolase, Thiolase are ubiquitous enzymes that catalyze
the reversible thiolytic cleavage of 3-ketoacyl-CoA into
acyl-CoA and acetyl-CoA, a 2-step reaction involving a
covalent intermediate formed with a catalytic cysteine.
They are found in prokaryotes and eukaryotes (cytosol,
microbodies and mitochondria). There are 2 functional
different classes: thiolase-I (3-ketoacyl-CoA thiolase)
and thiolase-II (acetoacetyl-CoA thiolase). Thiolase-I
can cleave longer fatty acid molecules and plays an
important role in the beta-oxidative degradation of
fatty acids. Thiolase-II has a high substrate
specificity. Although it can cleave acetoacyl-CoA, its
main function is the synthesis of acetoacyl-CoA from two
molecules of acetyl-CoA, which gives it importance in
several biosynthetic pathways.
Length = 386
Score = 33.2 bits (77), Expect = 0.31
Identities = 13/40 (32%), Positives = 20/40 (50%), Gaps = 4/40 (10%)
Query: 175 PSEV----IDTACSSSLIAINRAIESLHLKNCDLALAGGV 210
P V ++ C S L A+ A +S+ D+ +AGGV
Sbjct: 72 PESVPATTVNRVCGSGLQAVALAAQSIAAGEADVVVAGGV 111
>gnl|CDD|236388 PRK09133, PRK09133, hypothetical protein; Provisional.
Length = 472
Score = 32.3 bits (74), Expect = 0.68
Identities = 16/50 (32%), Positives = 21/50 (42%), Gaps = 6/50 (12%)
Query: 241 ANGYVRSEGVGVILLKPLKNAIIDNDHIYGIFRGNFENHGGHSSSPTSPN 290
G + +G KP+ + + Y FR N GGHSS PT N
Sbjct: 204 GGGTLDEDG------KPVLLTVQAGEKTYADFRLEVTNPGGHSSRPTKDN 247
>gnl|CDD|181625 PRK09051, PRK09051, beta-ketothiolase; Provisional.
Length = 394
Score = 32.2 bits (74), Expect = 0.69
Identities = 11/28 (39%), Positives = 17/28 (60%)
Query: 183 CSSSLIAINRAIESLHLKNCDLALAGGV 210
C S L AI A +++ L + D+A+ GG
Sbjct: 90 CGSGLQAIVSAAQAILLGDADVAIGGGA 117
>gnl|CDD|132846 cd07207, Pat_ExoU_VipD_like, ExoU and VipD-like proteins; homologus
to patatin, cPLA2, and iPLA2. ExoU, a 74-kDa enzyme, is
a potent virulence factor of Pseudomonas aeruginosa. One
of the pathogenic mechanisms of P. aeruginosa is to
induce cytotoxicity by the injection of effector
proteins (e.g. ExoU) using the type III secretion (T3S)
system. ExoU is homologus to patatin and also has the
conserved catalytic residues of mammalian
calcium-independent (iPLA2) and cytosolic (cPLA2) PLA2.
In vitro, ExoU cytotoxity is blocked by the inhibitor of
cytosolic and Ca2-independent phospholipase A2 (cPLA2
and iPLA2) enzymes, suggesting that phospholipase A2
inhibitors may represent a novel mode of treatment for
acute P. aeruginosa infections. ExoU requires eukaryotic
superoxide dismutase as a cofactor and cleaves
phosphatidylcholine and phosphatidylethanolamine in
vitro. VipD, a 69-kDa cytosolic protein, belongs to the
members of Legionella pneumophila family and is
homologus to ExoU from Pseudomonas. Even though VipD
shows high sequence similarity with several functional
regions of ExoU (e.g. oxyanion hole, active site serine,
active site aspartate), it has been shown to have no
phospholipase activity. This family includes ExoU from
Pseudomonas aeruginosa and VipD of Legionella
pneumophila.
Length = 194
Score = 31.5 bits (72), Expect = 0.74
Identities = 23/77 (29%), Positives = 31/77 (40%), Gaps = 13/77 (16%)
Query: 109 WAALEDAGYPPSKLSGSKT-AIFA-----GVSTADYKDILNE-------ARHKGLVKSLA 155
ALE+AG +++G+ AI A G S AD KDIL E GL+ L
Sbjct: 18 LKALEEAGILKKRVAGTSAGAITAALLALGYSAADIKDILKETDFAKLLDSPVGLLFLLP 77
Query: 156 EPFPFMIANRVSYLFNF 172
F + L +
Sbjct: 78 SLFKEGGLYKGDALEEW 94
>gnl|CDD|235715 PRK06147, PRK06147, 3-oxoacyl-(acyl carrier protein) synthase;
Validated.
Length = 348
Score = 31.9 bits (73), Expect = 0.78
Identities = 33/181 (18%), Positives = 58/181 (32%), Gaps = 25/181 (13%)
Query: 110 AALEDA--GYPPSKLSGSKTAIFAGVSTADYKDILNEARHKGLVKSLAEPFPFMIANRVS 167
A+ +A G P S+ + V+ + + L E + R+
Sbjct: 71 PAIAEALEGLPAL--DASEAPLLLCVAEEERPGRPPD---------LEERLLRELEARLG 119
Query: 168 YLFNFHGPSEVIDTACSSSLIAINRAIESLHLKNCDLALAGGVN-ILASPNITIASSKAG 226
S VI S +A+ +A + C L GV+ +L P + ++
Sbjct: 120 --LRLEPGSAVIARGRVSGAVALAQARRLIAAGGCPRVLVAGVDSLLTGPTLAHYEARDR 177
Query: 227 LLSENGRCMTFDQRANGYVRSEGVGVILLKPLKNAIIDNDHIYGIFRGNFENH-GGHSSS 285
LL+ Q +NG++ E +LL + G+ G G
Sbjct: 178 LLTS--------QNSNGFIPGEAAAAVLLGRPAGGEAPGLPLLGLGLGREPAPVGESEDL 229
Query: 286 P 286
P
Sbjct: 230 P 230
>gnl|CDD|132849 cd07210, Pat_hypo_W_succinogenes_WS1459_like, Hypothetical patatin
similar to WS1459 of Wolinella succinogenes.
Patatin-like phospholipase. This family predominantly
consists of bacterial patatin glycoproteins. The patatin
protein accounts for up to 40% of the total soluble
protein in potato tubers. Patatin is a storage protein,
but it also has the enzymatic activity of a lipid acyl
hydrolase, catalyzing the cleavage of fatty acids from
membrane lipids. Members of this family have also been
found in vertebrates.
Length = 221
Score = 31.5 bits (72), Expect = 0.86
Identities = 13/41 (31%), Positives = 21/41 (51%), Gaps = 6/41 (14%)
Query: 108 TWAALEDAGYPPSKLSGSKT-----AIFA-GVSTADYKDIL 142
AAL + G PS +SG+ +FA G+S + ++L
Sbjct: 18 FLAALLEMGLEPSAISGTSAGALVGGLFASGISPDEMAELL 58
>gnl|CDD|219836 pfam08429, PLU-1, PLU-1-like protein. Sequences in this family
bear similarity to the central region of PLU-1. This is
a nuclear protein that may have a role in DNA-binding
and transcription, and is closely associated with the
malignant phenotype of breast cancer. This region is
found in various other Jumonji/ARID domain-containing
proteins (see pfam02373, pfam01388).
Length = 335
Score = 31.1 bits (71), Expect = 1.5
Identities = 23/72 (31%), Positives = 31/72 (43%), Gaps = 14/72 (19%)
Query: 465 MLSAKTKKSLKEYVILLLEFIIKEKNNFSLCDLAYTLQVGREAMKYRLAIYVNSYEDLIK 524
+LS + K SLKE LL E EK F L DL L ++ V E ++
Sbjct: 10 LLSEEPKPSLKELRTLLSEG---EKIKFPLPDLLERL---KDF--------VQEAESWVE 55
Query: 525 KLQDYLNKKITN 536
K Q L++K
Sbjct: 56 KAQQLLSRKQQT 67
>gnl|CDD|240424 PTZ00455, PTZ00455, 3-ketoacyl-CoA thiolase; Provisional.
Length = 438
Score = 31.0 bits (70), Expect = 1.7
Identities = 16/52 (30%), Positives = 27/52 (51%), Gaps = 5/52 (9%)
Query: 175 PSEVIDTACSSSLIAINRAIESLHLKNCDLALAGGVNILASPNITIASSKAG 226
P+ ++ AC+S +A+ A E+L D+AL GV + T S++ G
Sbjct: 112 PAMRVEGACASGGLAVQSAWEALLAGTSDIALVVGVEVQ-----TTVSARVG 158
>gnl|CDD|222937 PHA02864, PHA02864, hypothetical protein; Provisional.
Length = 240
Score = 30.6 bits (69), Expect = 2.0
Identities = 28/107 (26%), Positives = 46/107 (42%), Gaps = 7/107 (6%)
Query: 437 NVHVIIEEYRKNIKTKYIKENTVLVRIIMLSAKTKKSLKEYVILLLEFIIKEKNNFSLCD 496
NV II++ N+ K N + II+ S+ + + + + K+ NN C
Sbjct: 85 NVCKIIKDENNNLLIKSNYLNKKINYIILDKVFKNHSIDDIIYMYFNWR-KKYNNIVSCG 143
Query: 497 LAYTLQVGREAMKYRLAIYVNSYEDLIKKLQDY-LNKKITNGIYTNF 542
+V +E MKY Y+D+ K + ++ LN K IY F
Sbjct: 144 -----KVFKELMKYDEIAKKQYYKDIHKDINNFKLNNKYKINIYEKF 185
>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
(vWA) domain [General function prediction only].
Length = 4600
Score = 30.7 bits (69), Expect = 2.7
Identities = 12/76 (15%), Positives = 22/76 (28%), Gaps = 19/76 (25%)
Query: 444 EYRKNIKTKYIKENTVLVRIIMLS----------AKTKKSLKEYVILLLEFIIKEKNNFS 493
E + + I + L R I LS K KK + ++ + I + ++
Sbjct: 3133 EVSLRNEDQLITKLINLWRKIELSKWGNLYRGEFRKGKKLNMKRLVPYIASIFR-RDFIW 3191
Query: 494 --------LCDLAYTL 501
L
Sbjct: 3192 MRKFTQRNNDKKESVL 3207
>gnl|CDD|190397 pfam02713, DUF220, Domain of unknown function DUF220. This is
family consists of a region in several Arabidopsis
thaliana hypothetical proteins none of which have any
known function. The aligned region contains two cysteine
residues.
Length = 74
Score = 27.8 bits (62), Expect = 2.9
Identities = 15/55 (27%), Positives = 27/55 (49%), Gaps = 15/55 (27%)
Query: 438 VHVIIEEYRKNIKTKYIKENTVLVRIIMLSAKTK---------------KSLKEY 477
+H+I++E RK++ KY KE + +++ S K + KS +EY
Sbjct: 7 IHLIVDENRKDLTAKYKKEKMMFMKVFEGSWKVEPLYVDSERLCKQRKPKSREEY 61
>gnl|CDD|234141 TIGR03185, DNA_S_dndD, DNA sulfur modification protein DndD. This
model describes the DndB protein encoded by an operon
associated with a sulfur-containing modification to DNA.
The operon is sporadically distributed in bacteria, much
like some restriction enzyme operons. DndD is described
as a putative ATPase. The small number of examples known
so far include species from among the Firmicutes,
Actinomycetes, Proteobacteria, and Cyanobacteria [DNA
metabolism, Restriction/modification].
Length = 650
Score = 30.4 bits (69), Expect = 3.2
Identities = 19/44 (43%), Positives = 24/44 (54%), Gaps = 2/44 (4%)
Query: 442 IEEYRKNIKTKY-IKENTV-LVRIIMLSAKTKKSLKEYVILLLE 483
IE RK + K K N L R I ++ K KK+LKE+ LLE
Sbjct: 458 IEALRKTLDEKTKQKINAFELERAITIADKAKKTLKEFREKLLE 501
>gnl|CDD|132880 cd06561, AlkD_like, A new structural DNA glycosylase. This domain
represents a new and uncharacterized structural
superfamily of DNA glycosylases that form an alpha-alpha
superhelix fold that are not belong to the identified
five structural DNA glycosylase superfamilies (UDG,
AAG/MNPG, MutM/Fpg and helix-hairpin-helix). DNA
glycosylases removing alkylated base residues have been
identified in all organisms investigated and may be
universally present in nature. DNA glycosylases catalyze
the first step in Base Excision Repair (BER) pathway by
cleaving damaged DNA bases within double strand DNA to
produce an abasic site. The resulting abasic site is
further processed by AP endonuclease, phosphodiesterase,
DNA polymerases, and DNA ligase functions to restore the
DNA to an undamaged state. All glycosylase examined to
date utilize a similar strategy for binding DNA and base
flipping despite their structural diversity.
Length = 197
Score = 29.2 bits (66), Expect = 3.8
Identities = 15/62 (24%), Positives = 24/62 (38%), Gaps = 8/62 (12%)
Query: 444 EYRKNIKTKYI--------KENTVLVRIIMLSAKTKKSLKEYVILLLEFIIKEKNNFSLC 495
E++K K + E + + L KK LKE + E I+ +N+ L
Sbjct: 30 EFKKEDKLEEDHELAEALWHEEIREAQYLALDLLDKKELKEEDLERFEPWIEYIDNWDLV 89
Query: 496 DL 497
D
Sbjct: 90 DS 91
>gnl|CDD|218454 pfam05132, RNA_pol_Rpc4, RNA polymerase III RPC4. Specific
subunit for Pol III, the tRNA specific polymerase.
Length = 131
Score = 28.8 bits (65), Expect = 3.9
Identities = 7/16 (43%), Positives = 10/16 (62%)
Query: 65 SGKTKVKWGGFLYDAD 80
SGK K+K G ++D
Sbjct: 77 SGKVKLKLGDVVFDVS 92
>gnl|CDD|227198 COG4861, COG4861, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 345
Score = 29.9 bits (67), Expect = 3.9
Identities = 15/48 (31%), Positives = 20/48 (41%), Gaps = 3/48 (6%)
Query: 203 DLALAGGVNILASPNITIASSKAGLLSENGRCMTFDQRANGYVRSEGV 250
LA AG ++ P + S A L E G D N Y+R G+
Sbjct: 71 TLAGAGSPLLVVGPRLH--PSSAETLRERG-LWFIDGAGNAYLRHPGL 115
>gnl|CDD|204301 pfam09735, Nckap1, Membrane-associated apoptosis protein.
Expression of this protein was found to be markedly
reduced in patients with Alzheimer's disease. It is
involved in the regulation of actin polymerisation in
the brain as part of a WAVE2 signalling complex.
Length = 1118
Score = 29.7 bits (67), Expect = 5.0
Identities = 14/55 (25%), Positives = 25/55 (45%), Gaps = 1/55 (1%)
Query: 478 VILLLEFIIKEKNNFSLCDLAYTLQVG-REAMKYRLAIYVNSYEDLIKKLQDYLN 531
+++LL I K L + A+ +Q G + RL + Y+ +KKL +
Sbjct: 134 LMILLSRIEDRKAVLGLYNAAHEMQHGQSDCSFPRLGQMILDYDPPLKKLHEEFV 188
>gnl|CDD|191183 pfam05066, RNA_pol_delta, DNA-directed RNA polymerase delta
subunit. The delta protein is a dispensable subunit of
Bacillus subtilis RNA polymerase (RNAP) that has major
effects on the biochemical properties of the purified
enzyme. In the presence of delta, RNAP displays an
increased specificity of transcription, a decreased
affinity for nucleic acids, and an increased efficiency
of RNA synthesis because of enhanced recycling. The
delta protein, contains two distinct regions, an
N-terminal domain and a glutamate and aspartate
residue-rich carboxyl-terminal region.
Length = 91
Score = 27.6 bits (62), Expect = 5.2
Identities = 14/54 (25%), Positives = 27/54 (50%), Gaps = 10/54 (18%)
Query: 481 LLEFIIKEKNNFSLCDLAYT-LQVGREAMKYRLAIYVNSYEDLIKKLQDYLNKK 533
L +F +EK+ SL ++AY L+ + M +++DL+ ++Q L
Sbjct: 2 LKQFTKEEKDELSLIEVAYEILKEKGKPM---------TFDDLVNEIQKLLGIS 46
>gnl|CDD|221815 pfam12864, DUF3822, Protein of unknown function (DUF3822). This is
a family of uncharacterized bacterial proteins. However,
structural-similarity searches indicate the family takes
on an actin-like ATPase fold.
Length = 248
Score = 29.2 bits (66), Expect = 5.6
Identities = 16/100 (16%), Positives = 35/100 (35%), Gaps = 19/100 (19%)
Query: 452 KYIKENTVLVRIIMLSAKTKKSLKEYVIL---LLEFIIKEKNNFSLC---------DLAY 499
++I + + L+ ++ ++ K YV + + E L D Y
Sbjct: 147 EFIHQASPLLEYLLALSRNGTEKKLYVHFEKESFDLFVFENGKLLLANSFEYKTEEDFLY 206
Query: 500 -------TLQVGREAMKYRLAIYVNSYEDLIKKLQDYLNK 532
L + E + L + E+L ++L+ Y+
Sbjct: 207 YLLFVWEQLGLDPEEDELHLTGEITEDEELYEELRKYIRN 246
>gnl|CDD|172759 PRK14271, PRK14271, phosphate ABC transporter ATP-binding protein;
Provisional.
Length = 276
Score = 28.9 bits (64), Expect = 5.8
Identities = 16/49 (32%), Positives = 24/49 (48%), Gaps = 3/49 (6%)
Query: 121 KLSG---SKTAIFAGVSTADYKDILNEARHKGLVKSLAEPFPFMIANRV 166
K+SG S + G S +Y+D+L R G++ PFP I + V
Sbjct: 73 KVSGYRYSGDVLLGGRSIFNYRDVLEFRRRVGMLFQRPNPFPMSIMDNV 121
>gnl|CDD|235534 PRK05618, PRK05618, 50S ribosomal protein L25/general stress
protein Ctc; Reviewed.
Length = 197
Score = 28.6 bits (65), Expect = 6.0
Identities = 10/28 (35%), Positives = 17/28 (60%), Gaps = 2/28 (7%)
Query: 195 ESLHLKNCDLALAGGVNILASPNITIAS 222
+S+H+ DL L GV +L P+ +A+
Sbjct: 156 DSIHVS--DLKLPEGVKLLDDPDEVVAT 181
>gnl|CDD|173163 PRK14700, PRK14700, recombination factor protein RarA; Provisional.
Length = 300
Score = 29.2 bits (65), Expect = 6.2
Identities = 24/62 (38%), Positives = 34/62 (54%), Gaps = 8/62 (12%)
Query: 98 DPQ-LRLFIETTWAALEDAGYPPSKLSGSKTAIFAGV---STADYKDILNEARHKGLVKS 153
DPQ LR+ ++ W A E G P +L ++ AI+ V S A YK + A+ + LVKS
Sbjct: 178 DPQALRVAMDA-WNAYEKLGMPEGRLVLAQAAIYLAVAPKSNACYKAL---AQAQQLVKS 233
Query: 154 LA 155
L
Sbjct: 234 LG 235
>gnl|CDD|180563 PRK06445, PRK06445, acetyl-CoA acetyltransferase; Provisional.
Length = 394
Score = 28.9 bits (65), Expect = 7.2
Identities = 11/36 (30%), Positives = 19/36 (52%)
Query: 175 PSEVIDTACSSSLIAINRAIESLHLKNCDLALAGGV 210
P+ +D C+SSL ++ + D+ +AGGV
Sbjct: 87 PAMAVDRQCASSLTTVSIGAMEIATGMADIVIAGGV 122
>gnl|CDD|131499 TIGR02446, FadI, fatty oxidation complex, beta subunit FadI. This
subunit of the FadJI complex has acetyl-CoA
C-acyltransferase (EC 2.3.1.16) activity, and is also
known as beta-ketothiolase and fatty oxidation complex,
beta subunit, and YfcY. This protein is almost always
located adjacent to FadJ (TIGR02440). The FadJI complex
is needed for anaerobic beta-oxidation of short-chain
fatty acids in E. coli [Fatty acid and phospholipid
metabolism, Degradation].
Length = 430
Score = 28.8 bits (64), Expect = 7.6
Identities = 16/70 (22%), Positives = 31/70 (44%)
Query: 171 NFHGPSEVIDTACSSSLIAINRAIESLHLKNCDLALAGGVNILASPNITIASSKAGLLSE 230
N H + + AC++S + ES+ D+ +AGG + + I ++ A L +
Sbjct: 81 NVHTDAYSVTRACATSFQSAVNVAESIMAGAIDIGIAGGADSSSVLPIGVSKKLAASLVD 140
Query: 231 NGRCMTFDQR 240
+ T Q+
Sbjct: 141 LNKARTLGQK 150
>gnl|CDD|130343 TIGR01276, thiB, thiamine ABC transporter, periplasmic binding
protein. This model finds the thiamine (and thiamine
pyrophosphate) ABC transporter periplasmic binding
protein ThiB in proteobacteria. Completed genomes having
this protein (E. coli, Vibrio cholera, Haemophilus
influenzae) also have the permease ThiP, described by
TIGRFAMs equivalog model TIGR01253 [Transport and
binding proteins, Other].
Length = 309
Score = 28.5 bits (63), Expect = 8.4
Identities = 14/46 (30%), Positives = 23/46 (50%), Gaps = 1/46 (2%)
Query: 36 KLENNEDAITEVPTSRWDWKAIYGDPHLES-GKTKVKWGGFLYDAD 80
KL+N ++ E+ S +W+ IY DP + G + W +Y D
Sbjct: 115 KLKNPPQSLKELVESDQNWRVIYQDPRTSTPGLGLLLWMQKVYGDD 160
>gnl|CDD|181242 PRK08131, PRK08131, acetyl-CoA acetyltransferase; Provisional.
Length = 401
Score = 28.6 bits (64), Expect = 9.0
Identities = 23/91 (25%), Positives = 33/91 (36%), Gaps = 28/91 (30%)
Query: 132 GVSTADYKDIL------------NEARHKGLVKSLAEPFPFMIANRVSYLFNFHGPSEVI 179
G D +D++ N AR+ L+ L P NR+
Sbjct: 42 GFPGDDIEDVILGCTNQAGEDSRNVARNALLLAGLPVTVPGQTVNRL------------- 88
Query: 180 DTACSSSLIAINRAIESLHLKNCDLALAGGV 210
C+S L A+ A ++ DL LAGGV
Sbjct: 89 ---CASGLAAVIDAARAITCGEGDLYLAGGV 116
>gnl|CDD|216490 pfam01420, Methylase_S, Type I restriction modification DNA
specificity domain. This domain is also known as the
target recognition domain (TRD).
Restriction-modification (R-M) systems protect a
bacterial cell against invasion of foreign DNA by
endonucleolytic cleavage of DNA that lacks a site
specific modification. The host genome is protected from
cleavage by methylation of specific nucleotides in the
target sites. In type I systems, both restriction and
modification activities are present in one heteromeric
enzyme complex composed of one DNA specificity subunit
(this family), two modification (M) subunits and two
restriction (R) subunits.
Length = 167
Score = 27.7 bits (62), Expect = 9.9
Identities = 12/81 (14%), Positives = 29/81 (35%), Gaps = 20/81 (24%)
Query: 433 VGGSNVHVIIEEYRKNIKTKYIKENTVLV--------------------RIIMLSAKTKK 472
+ +++ + IK K N++L+ + +L K +
Sbjct: 35 ITAGDLNNGVIGGVGYIKKKIFPGNSILISSNGSIGYVFYRDKPFFANQDVKVLIPKNNE 94
Query: 473 SLKEYVILLLEFIIKEKNNFS 493
L +++ L L+ I+K+
Sbjct: 95 LLNKFLYLFLKTILKKLKKLK 115
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.318 0.137 0.404
Gapped
Lambda K H
0.267 0.0940 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 28,412,289
Number of extensions: 2854580
Number of successful extensions: 2819
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2720
Number of HSP's successfully gapped: 83
Length of query: 545
Length of database: 10,937,602
Length adjustment: 102
Effective length of query: 443
Effective length of database: 6,413,494
Effective search space: 2841177842
Effective search space used: 2841177842
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 61 (26.9 bits)