RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= 007699
(592 letters)
>gnl|CDD|215244 PLN02445, PLN02445, anthranilate synthase component I.
Length = 523
Score = 1090 bits (2820), Expect = 0.0
Identities = 422/523 (80%), Positives = 466/523 (89%), Gaps = 7/523 (1%)
Query: 77 FSEASKRANLVPLYRCIFSDHLTPVVAYRCLVQEDDREAPSFLFESVEPGVRVSNVGRYS 136
F EA+K NLVPLYR IFSDHLTPV+AYRCLV+EDDREAPSFLFESVEPG + SNVGRYS
Sbjct: 1 FKEAAKGGNLVPLYRRIFSDHLTPVLAYRCLVKEDDREAPSFLFESVEPGSQSSNVGRYS 60
Query: 137 VVGAQPVMEVIVKDNNVTIMDHEKGSLVEEVVDDPMEIPRKISEDWKPQIIDELPEAFCG 196
VVGAQP ME++ K+N VTIMDHEKG+ EE+V+DPMEIPR+ISE W PQ+ID LP+ FCG
Sbjct: 61 VVGAQPAMEIVAKENKVTIMDHEKGTRTEEIVEDPMEIPRRISEKWNPQLIDGLPDVFCG 120
Query: 197 GWVGYFSYDTVRYVEKKKLPFSKAPHDDRSLADIHLGLYNDVLVFDHVEKKVYVIHWVRL 256
GWVGYFSYDTVRYVEKKKLPFS AP DDR+L DIHLGLY+DV+VFDHVEKK YVIHWVRL
Sbjct: 121 GWVGYFSYDTVRYVEKKKLPFSGAPEDDRNLPDIHLGLYDDVIVFDHVEKKAYVIHWVRL 180
Query: 257 DQHSSVQKAYAEGLEHLEKLVAR-------KVITRSIDLHTHHFGPPLKKSNMTSEAYKN 309
D++SSV++AY +G++ LE LV+R K+ S+ L T+ FGP L+KSNMTSE YKN
Sbjct: 181 DRYSSVEEAYEDGMKRLEALVSRLQDINPPKLSPGSVKLSTNQFGPSLEKSNMTSEEYKN 240
Query: 310 AVLEAKEHIQAGDIFQIVLSQRFERRTFADPFEVYRALRVVNPSPYMTYLQARGCILVAS 369
AVL+AKEHI AGDIFQIVLSQRFERRTFADPFEVYRALR+VNPSPYM YLQARGCILVAS
Sbjct: 241 AVLQAKEHILAGDIFQIVLSQRFERRTFADPFEVYRALRIVNPSPYMIYLQARGCILVAS 300
Query: 370 SPEILTRVKKNKIVNRPLAGTVRRGRTTEEDEMLETQLLKDAKQCAEHVMLVDLGRNDVG 429
SPEILTRVKKNKIVNRPLAGT RRG+T EED+ LE LL D KQCAEH+MLVDLGRNDVG
Sbjct: 301 SPEILTRVKKNKIVNRPLAGTRRRGKTPEEDKALEKDLLADEKQCAEHIMLVDLGRNDVG 360
Query: 430 KVARSGSVKVEKLMNVERYSHVMHISSTITGELQDRLSCWDALRAALPVGTVSGAPKVKA 489
KV+++GSVKVEKLMN+ERYSHVMHISST+TGEL D L+ WDALRAALPVGTVSGAPKV+A
Sbjct: 361 KVSKAGSVKVEKLMNIERYSHVMHISSTVTGELLDHLTSWDALRAALPVGTVSGAPKVRA 420
Query: 490 MELIDELEVNRRGPYSGGFGGVSFTGDMDIALALRTMVFQTGTRYDTMYSYKDARKRREW 549
MELIDELEV RRGPYSGGFGGVSFTGDMDIALALRTMVF T RYDTMYSYKD RREW
Sbjct: 421 MELIDELEVTRRGPYSGGFGGVSFTGDMDIALALRTMVFPTAARYDTMYSYKDTNSRREW 480
Query: 550 VAYLQAGAGIVADSDPDDEHRECQNKAAGLARAIDLAESAFIK 592
VA+LQAGAGIVADSDP+DE+REC NKAAGLARAIDLAESAF+K
Sbjct: 481 VAHLQAGAGIVADSDPEDEYRECVNKAAGLARAIDLAESAFVK 523
>gnl|CDD|233026 TIGR00564, trpE_most, anthranilate synthase component I,
non-proteobacterial lineages. This enzyme resembles
some other chorismate-binding enzymes, including
para-aminobenzoate synthase (pabB) and isochorismate
synthase. There is a fairly deep split between two sets,
seen in the pattern of gaps as well as in amino acid
sequence differences. Archaeal enzymes have been
excluded from this model (and are now found in
TIGR01820) as have a clade of enzymes which constitute a
TrpE paralog which may have PabB activity (TIGR01824).
This allows the B. subtilus paralog which has been shown
to have PabB activity to score below trusted to this
model. This model contains sequences from gram-positive
bacteria, certain proteobacteria, cyanobacteria, plants,
fungi and assorted other bacteria.A second family of
TrpE enzymes is modelled by TIGR00565. The breaking of
the TrpE family into these diverse models allows for the
separation of the models for the related enzyme, PabB
[Amino acid biosynthesis, Aromatic amino acid family].
Length = 454
Score = 613 bits (1583), Expect = 0.0
Identities = 233/493 (47%), Positives = 291/493 (59%), Gaps = 42/493 (8%)
Query: 95 SDHLTPVVAYRCLVQEDDREAPSFLFESVEPGVRVSNVGRYSVVGAQPVMEVIVKDNNVT 154
+D LTP+ AY L Q SFL ES EPG S GRYS +G PV+ + +
Sbjct: 1 ADTLTPISAYLKLAQP-----GSFLLESAEPG---SERGRYSFIGLNPVLTIKTEGGTEY 52
Query: 155 IMDHEKGSLVEEVVDDPMEIPRKISEDWKPQIIDELPEAFCGGWVGYFSYDTVRYVEKKK 214
+V +P++ R + E + ELP F GG VGY YDTVR EK
Sbjct: 53 ---LGADDRRSGIVGNPLDEIRDVMETFAQHSDPELPIPFTGGAVGYLGYDTVRLFEKIT 109
Query: 215 LPFSKAPHDDRSLADIHLGLYNDVLVFDHVEKKVYVIHWVRLDQHSSVQKAYAEGLEHLE 274
LP P D +L D +L L +D ++FDHV K+Y+IH R S A A LE
Sbjct: 110 LP----PPDPLNLPDAYLMLCDDFIIFDHVTDKLYLIHNNRTTASRS---AKAAADARLE 162
Query: 275 KLVARKVITRSIDLHTHHFGPPLK---KSNMTSEAYKNAVLEAKEHIQAGDIFQIVLSQR 331
LVA + + L P SN E Y+ V +AKE+I+AGDIFQ+VLSQR
Sbjct: 163 ALVAD--LQDPL-LPEVPVPYPAALSFTSNYEKEEYEANVAKAKEYIKAGDIFQVVLSQR 219
Query: 332 FERRTFADPFEVYRALRVVNPSPYMTYLQARGCILVASSPEILTRVKKNKIVNRPLAGTV 391
FE +T A PFE+YR LR+VNPSPYM YL +V SSPE+L +V +I RP+AGT
Sbjct: 220 FEAKTEAPPFELYRVLRIVNPSPYMYYLDFGDFQIVGSSPELLVKVTGGRITTRPIAGTR 279
Query: 392 RRGRTTEEDEMLETQLLKDAKQCAEHVMLVDLGRNDVGKVARSGSVKVEKLMNVERYSHV 451
+RG T EEDE L +LL D K+ AEH+MLVDLGRND+G+V GSV+V + M +ERYSHV
Sbjct: 280 KRGATPEEDEALAEELLADEKERAEHLMLVDLGRNDIGRVCEPGSVEVPEFMKIERYSHV 339
Query: 452 MHISSTITGELQDRLSCWDALRAALPVGTVSGAPKVKAMELIDELEVNRRGPYSGGFGGV 511
MHI ST+ G L+D L+ DALRA P GTVSGAPK++AMELIDELE +RG Y G G +
Sbjct: 340 MHIVSTVEGRLKDGLTAIDALRATFPAGTVSGAPKIRAMELIDELEPEKRGIYGGAVGYL 399
Query: 512 SFTGDMDIALALRTMVFQTGTRYDTMYSYKDARKRREWVAYLQAGAGIVADSDPDDEHRE 571
SF GDMD A+A+RTMV ++ AY+QAG GIVADSDP+ E+ E
Sbjct: 400 SFDGDMDTAIAIRTMVV------------------KDGKAYVQAGGGIVADSDPEAEYEE 441
Query: 572 CQNKAAGLARAID 584
NKA L RAI+
Sbjct: 442 TLNKARALLRAIE 454
>gnl|CDD|223225 COG0147, TrpE, Anthranilate/para-aminobenzoate synthases component
I [Amino acid transport and metabolism / Coenzyme
metabolism].
Length = 462
Score = 493 bits (1270), Expect = e-171
Identities = 217/502 (43%), Positives = 286/502 (56%), Gaps = 46/502 (9%)
Query: 88 PLYRCIFSDHLTPVVAYRCLVQEDDREAPSFLFESVEPGVRVSNVGRYSVVGAQPVMEVI 147
P +D TP+ Y L R +FL ES E + GRYS++G P++ +
Sbjct: 1 PSLLSFTADLETPLSLYLKLAASRPR---AFLLESAEIYEKY---GRYSIIGLDPLLRLR 54
Query: 148 VKDNNVTIMDHEKGSLVEEVVDDPMEIPRKISE-DWKPQIIDELPEAFCGGWVGYFSYDT 206
+ V + E+ + +++ DP+E R + E + F GG VGYFSYD
Sbjct: 55 AFGDEVISANGEELAKELDLLADPLEELRSLLEFVAPRAALPNSEPPFQGGLVGYFSYDL 114
Query: 207 VRYVEKKKLPFSKAPHDDRSLADIHLGLYNDVLVFDHVEKKVYVIHWVRLDQHSSVQKAY 266
VRY + D GLY++VLVFDH + K+Y+I
Sbjct: 115 VRYFDLPP----LIAEAPLDFPDALFGLYDEVLVFDHQKGKLYLI--------------- 155
Query: 267 AEGLEHLEKLVARKVITRSIDLHTHHFGPPLK-KSNMTSEAYKNAVLEAKEHIQAGDIFQ 325
A G E LE+L+AR L P + +SN+ EAY+ AV +AKE+I+AGDI+Q
Sbjct: 156 AFGAERLEQLLARL-EDALAPLPEGDPPLPREVQSNLDREAYEEAVRKAKEYIRAGDIYQ 214
Query: 326 IVLSQRFERRTFADPFEVYRALRVVNPSPYMTYLQARGCILVASSPEILTRVKKNKIVNR 385
+VLS+RFE DP +YR LR NPSPYM +L+ LV +SPE+ +V N+I R
Sbjct: 215 VVLSRRFEAPCDGDPLALYRRLRQRNPSPYMFFLRLGDFTLVGASPELFVKVDGNRIETR 274
Query: 386 PLAGTVRRGRTTEEDEMLETQLLKDAKQCAEHVMLVDLGRNDVGKVARSGSVKVEKLMNV 445
P+AGT RG EEDE LE +LL D K+ AEH+MLVDL RND+G+V GSVKV +LM V
Sbjct: 275 PIAGTRPRGADPEEDEALEAELLNDEKERAEHLMLVDLARNDLGRVCEPGSVKVPELMEV 334
Query: 446 ERYSHVMHISSTITGELQDRLSCWDALRAALPVGTVSGAPKVKAMELIDELEVNRRGPYS 505
ERYSHVMH+ ST+TG L+ L DALRA P GTV+GAPKV+AME+I+ELE + RG Y
Sbjct: 335 ERYSHVMHLVSTVTGRLKPGLDALDALRALFPAGTVTGAPKVRAMEIIEELEPSPRGIYG 394
Query: 506 GGFGGVSFTGDMDIALALRTMVFQTGTRYDTMYSYKDARKRREWVAYLQAGAGIVADSDP 565
G G +SF GD+D A+A+RT + G AY+QAGAGIVADSDP
Sbjct: 395 GAVGYLSFNGDLDFAIAIRTAELKDGR------------------AYVQAGAGIVADSDP 436
Query: 566 DDEHRECQNKAAGLARAIDLAE 587
+ E+ E NKA L RA++LAE
Sbjct: 437 EAEYEETLNKARALLRALELAE 458
>gnl|CDD|237431 PRK13570, PRK13570, anthranilate synthase component I; Provisional.
Length = 455
Score = 464 bits (1196), Expect = e-160
Identities = 206/496 (41%), Positives = 284/496 (57%), Gaps = 49/496 (9%)
Query: 88 PLYRCIFSDHLTPVVAYRCLVQEDDREAPSFLFESVEPGVRVSNVGRYSVVGAQPVMEVI 147
+ + I D LTP+ AY L + FL ES+ R GRYS++ PV E+
Sbjct: 5 RVIKEINGDTLTPISAYMRLKGKH-----KFLLESIP---RDKEKGRYSIIAYNPVFEIK 56
Query: 148 VKDNNVTIMDHEKGSLVEEVVDDPMEIPRKISEDWKPQIIDELPEAFCGGWVGYFSYDTV 207
+ I + EK + DP++ ++ K Q+ ELP FCGG +GY YD +
Sbjct: 57 SYGGELYIGNGEK------IDGDPLDFLEEVIV--KSQVDSELP--FCGGAIGYVGYDVI 106
Query: 208 RYVEKKKLPFSKAPHDDRSLADIHLGLYNDVLVFDHVEKKVYVIHWVRLDQHSS--VQKA 265
R E + P D + D+H LY +++DH ++K+ ++ R S ++KA
Sbjct: 107 RLYEN--IG--DIPEDTIGIPDMHFFLYESFIIYDHKKEKLIFVYDNRYSDRSEEELEKA 162
Query: 266 YAEGLEHLEKLVARKVITRSIDLHTHHFGPPLKKSNMTSEAYKNAVLEAKEHIQAGDIFQ 325
LE L++ + I+L F KSN+T E + V +AKE+I+AGDIFQ
Sbjct: 163 LNVVLEELKQPA--EAEHELIELSKLSF-----KSNITKEEFCGMVEKAKEYIRAGDIFQ 215
Query: 326 IVLSQRFERRTFADPFEVYRALRVVNPSPYMTYLQARGCILVASSPEILTRVKKNKIVNR 385
+VLSQR DPF+ YR LRV NPSPY+ Y+ ++ SSPE L VK +K+
Sbjct: 216 VVLSQRLSAEFTGDPFDYYRKLRVTNPSPYLYYIDFGDYQVIGSSPESLVSVKGDKVTTN 275
Query: 386 PLAGTVRRGRTTEEDEMLETQLLKDAKQCAEHVMLVDLGRNDVGKVARSGSVKVEKLMNV 445
P+AGT RG+T EEDE L +LL D K+ AEH MLVDLGRND+GK++ +GSVKV K M V
Sbjct: 276 PIAGTRPRGKTKEEDEALAKELLSDEKERAEHRMLVDLGRNDIGKISETGSVKVTKYMEV 335
Query: 446 ERYSHVMHISSTITGELQDRLSCWDALRAALPVGTVSGAPKVKAMELIDELEVNRRGPYS 505
E+Y HVMH+ S ++G L+ L+ +DAL+A LP GTVSGAPK++AME I ELE +RG Y+
Sbjct: 336 EKYRHVMHLVSEVSGTLRPGLTAFDALKATLPAGTVSGAPKIRAMERIYELENEKRGVYA 395
Query: 506 GGFGGVSFTGDMDIALALRTMVFQTGTRYDTMYSYKDARKRREWVAYLQAGAGIVADSDP 565
G G +S G+MD A+A+RTMV + G AY+QAGAGIV DSDP
Sbjct: 396 GAVGYLSANGNMDFAIAIRTMVLKNGK------------------AYVQAGAGIVYDSDP 437
Query: 566 DDEHRECQNKAAGLAR 581
++E++E NKA L
Sbjct: 438 ENEYQETLNKAKALLE 453
>gnl|CDD|184146 PRK13565, PRK13565, anthranilate synthase component I; Provisional.
Length = 490
Score = 454 bits (1170), Expect = e-155
Identities = 221/508 (43%), Positives = 284/508 (55%), Gaps = 44/508 (8%)
Query: 85 NLVPLYRCIFSDHLTPVVAYRCLVQEDDREAP-SFLFESVEPGVRVSNVGRYSVVG--AQ 141
N +PL +D TP+ Y L AP S+L ESV G R GRYS +G A+
Sbjct: 15 NRIPLVAEALADLDTPLSLYLKLAD-----APYSYLLESVVGGERF---GRYSFIGLPAR 66
Query: 142 PVMEVIVKDNNVTIMDHEKGSLVEEV-VDDPMEIPRKISEDWKPQIIDELPEAFCGGWVG 200
V+ V V G +VE V DP+ +K + LP FCGG VG
Sbjct: 67 TVLRVRGHTVEVV----TDGQVVETHDVGDPLAFIEAFQARFKVALRPGLPR-FCGGLVG 121
Query: 201 YFSYDTVRYVEKKKLPFSKAPHDDRSLADIHLGLYNDVLVFDHVEKKVYVIHWVRLDQHS 260
YF YDTVRY+E + A D DI L L ++ V D++ K+Y+I V D
Sbjct: 122 YFGYDTVRYIEPRLAN--TAKPDPLGTPDILLLLSEELAVIDNLSGKLYLI--VYAD--P 175
Query: 261 SVQKAYAEGLEHLEKLVARKVITRSIDL-HTHHFGPPLKKSNMTSEAYKNAVLEAKEHIQ 319
+ +AY + L +L AR + + + T S T E Y AV +AKE+I
Sbjct: 176 AQPEAYERAKQRLRELRAR--LRQPVAPPVTSASSRTEFVSEFTKEDYLAAVRKAKEYIA 233
Query: 320 AGDIFQIVLSQRFERRTFADPFEVYRALRVVNPSPYMTYLQARGCILVASSPEILTRVKK 379
AGD Q+V SQR + A P +YRALR +NPSPYM + +V SSPEIL R +
Sbjct: 234 AGDCMQVVPSQRLSKPFRASPLSLYRALRSLNPSPYMYFYNFGDFHVVGSSPEILVRQED 293
Query: 380 NKIVNRPLAGTVRRGRTTEEDEMLETQLLKDAKQCAEHVMLVDLGRNDVGKVARSGSVKV 439
+ RP+AGT RG T EED LET+LL D K+ AEHVML+DLGRNDVG+VA +GSVKV
Sbjct: 294 RIVTVRPIAGTRPRGATPEEDLALETELLADPKEIAEHVMLIDLGRNDVGRVAETGSVKV 353
Query: 440 EKLMNVERYSHVMHISSTITGELQDRLSCWDALRAALPVGTVSGAPKVKAMELIDELEVN 499
+ M +ERYSHVMHI S + G+L+ L+ D LRA P GT+SGAPKV+AME+IDELE
Sbjct: 354 TEKMVIERYSHVMHIVSNVEGKLKPGLTNMDVLRATFPAGTLSGAPKVRAMEIIDELEPV 413
Query: 500 RRGPYSGGFGGVSFTGDMDIALALRTMVFQTGTRYDTMYSYKDARKRREWVAYLQAGAGI 559
+RG Y G G +SF GDMD+A+A+RT V + G Y+QAGAGI
Sbjct: 414 KRGIYGGAVGYLSFNGDMDLAIAIRTAVIKDGN------------------LYVQAGAGI 455
Query: 560 VADSDPDDEHRECQNKAAGLARAIDLAE 587
VADS P+ E +E +NKA + RA + AE
Sbjct: 456 VADSVPELEWQETENKARAVLRAAEQAE 483
>gnl|CDD|233586 TIGR01820, TrpE-arch, anthranilate synthase component I, archaeal
clade. This model represents an archaeal clade of
anthranilate synthase component I enzymes. This enzyme
is responsible for the first step of tryptophan
biosynthesis from chorismate. The Sulfolobus enzyme has
been reported to be part of a gene cluster for Trp
biosynthesis [Amino acid biosynthesis, Aromatic amino
acid family].
Length = 435
Score = 412 bits (1062), Expect = e-140
Identities = 195/488 (39%), Positives = 251/488 (51%), Gaps = 56/488 (11%)
Query: 100 PVVAYRCLVQEDDREAPSFLFESVEPGVRVSNVGRYSVVGAQPVMEVIVKDNNVTIMDHE 159
P+ Y+ + + D +FL ES E + S RYS +G P V + +
Sbjct: 1 PLELYKAIRADGDY---AFLLESAE---KPSKKARYSFIGWDPEFVVRI---------NG 45
Query: 160 KGSLVE--EVVDDPMEIPRKISEDWKPQIIDELPEAFCGGWVGYFSYDTVR-YVEKKKLP 216
KG VE D ++ R K I F GG VGY +YD VR Y E
Sbjct: 46 KGKSVEGIPEDGDVVDKLRNAFPKLKGINIPGEDRRFKGGLVGYIAYDAVRDYWEGIVDL 105
Query: 217 FSKAPHDDRSLADIHLGLYNDVLVFDHVEKKVYVIHWVRLDQHSSVQKAYAEGLEHLEKL 276
KA +Y + +V+DH+E KVY + + E LE++
Sbjct: 106 KRKAE----DWPPAEFFIYPNTIVYDHLEGKVYYV-------------STPEPEAELERI 148
Query: 277 VARKVITRSIDLHTHHFGPPLKKSNMTSEAYKNAVLEAKEHIQAGDIFQIVLSQRFERRT 336
V R R+ D + + E ++ AV EAKE+I AGDIFQ+VLS+ +E R
Sbjct: 149 VER--AKRATDPGEAGVSFEGESLSDREE-FEEAVEEAKEYIFAGDIFQVVLSREYEYRL 205
Query: 337 FADPFEVYRALRVVNPSPYMTYLQARGCILVASSPEILTRVKKNKIVNRPLAGTVRRGRT 396
DPFE+Y LR +NPSPYM L+ LV SSPE L RV+ + P+AGT RG T
Sbjct: 206 DGDPFELYYNLREINPSPYMFLLKFGDRYLVGSSPETLVRVEGRTVETNPIAGTRPRGAT 265
Query: 397 TEEDEMLETQLLKDAKQCAEHVMLVDLGRNDVGKVARSGSVKVEKLMNVERYSHVMHISS 456
EEDE L +LL D K+ AEHVMLVDL RNDV KV+ GSVKV + M VE+YSHV HI S
Sbjct: 266 PEEDERLAKELLSDEKERAEHVMLVDLARNDVRKVSEPGSVKVPEFMYVEKYSHVQHIES 325
Query: 457 TITGELQDRLSCWDALRAALPVGTVSGAPKVKAMELIDELEVNRRGPYSGGFGGVSFTGD 516
T+ G L+ +DALRA P GT+SGAPK++AME+IDELE RG Y GG G S+ GD
Sbjct: 326 TVIGTLKKDYDAFDALRATFPAGTLSGAPKIRAMEIIDELEKEPRGVYGGGVGYFSWNGD 385
Query: 517 MDIALALRTMVFQTGTRYDTMYSYKDARKRREWVAYLQAGAGIVADSDPDDEHRECQNKA 576
D A+A+RT +QAGAGIVADS P+ E E +NK
Sbjct: 386 ADFAIAIRTAEIDKDK------------------LRIQAGAGIVADSIPEREFEETENKM 427
Query: 577 AGLARAID 584
+ +AI
Sbjct: 428 KAVLKAIG 435
>gnl|CDD|184150 PRK13569, PRK13569, anthranilate synthase component I; Provisional.
Length = 506
Score = 411 bits (1058), Expect = e-138
Identities = 197/536 (36%), Positives = 286/536 (53%), Gaps = 50/536 (9%)
Query: 70 LASDASGFSEASKRANLVPLYRCIFSDHLTPVVAYRCLVQEDDREAPSFLFESVEPGVRV 129
+D + F E S +P+ F+D LTP+ L ++ EA +L ES +
Sbjct: 2 SQTDFTSFLEDSNEFRTIPIVETFFADTLTPIQ----LFEKLQDEA-VYLLESKD---DE 53
Query: 130 SNVGRYSVVGAQPVMEVIVKDNNVTIMDHEKGSLVEEVVDDPMEIPRKISEDWKPQIID- 188
S RYS +G P + + ++ + D L E + + + + ++
Sbjct: 54 SPWSRYSFIGLNPFLTLEEENGTFSAKDENGNELAT--APTLKEAFQWMEQTLDVKPLEL 111
Query: 189 ELPEAFCGGWVGYFSYDTVRYVEKKKLPFSKAPH-DDRSLADIHLGLYNDVLVFDHVEKK 247
++P F GG VGY SYD + +EK H D + H ++ +DH K+
Sbjct: 112 DIP--FTGGAVGYLSYDAISLIEK-----VPKHHSRDTEMPTCHFFFCETLIAYDHETKE 164
Query: 248 VYVIHWVRLDQHSSVQK---AYAEGLEHLEKLVARKVITRSIDLHTHHFGPPLKK----- 299
++ IH+VRL+ + ++ Y E +E L+ + + R ++
Sbjct: 165 LHFIHYVRLNGQETEEEKIEKYKEAQAEIETLIEK--LARRKAEKELLLPADSERTVSFE 222
Query: 300 ---SNMTSEAYKNAVLEAKEHIQAGDIFQIVLSQRFERRTFADPFEVYRALRVVNPSPYM 356
SN E + V + KE+I+AGDIFQ VLSQRFE FE+YR LR+VNPSPYM
Sbjct: 223 GVTSNYEKEQFLRDVEKIKEYIKAGDIFQAVLSQRFEIPVSVGGFELYRVLRMVNPSPYM 282
Query: 357 TYLQARGCILVASSPEILTRVKKNKIVNRPLAGTVRRGRTTEEDEMLETQLLKDAKQCAE 416
Y++ +V SSPE L +V + P+AGT RRG EEDE L +LL D K+ AE
Sbjct: 283 FYMKLDDVEIVGSSPERLIQVHNRHLEIHPIAGTRRRGADAEEDERLAKELLADEKERAE 342
Query: 417 HVMLVDLGRNDVGKVARSGSVKVEKLMNVERYSHVMHISSTITGELQDRLSCWDALRAAL 476
H MLVDL RND+G+VA GSV+V L+ + ++SHVMH+ S +TGEL++ + DAL +A
Sbjct: 343 HYMLVDLARNDIGRVAEYGSVEVPVLLEIGKFSHVMHLISKVTGELKEGVHPIDALLSAF 402
Query: 477 PVGTVSGAPKVKAMELIDELEVNRRGPYSGGFGGVSFTGDMDIALALRTMVFQTGTRYDT 536
P GTVSGAPK++AM++++ELE RG Y+G + F G++D +A+RTMV + G
Sbjct: 403 PAGTVSGAPKIRAMQILNELEPTARGTYAGAIAYIGFDGNIDSCIAIRTMVVKDG----- 457
Query: 537 MYSYKDARKRREWVAYLQAGAGIVADSDPDDEHRECQNKAAGLARAIDLAESAFIK 592
VAY+QAGAGIVADS P+ E E +NKA+ L +AI LAE F K
Sbjct: 458 -------------VAYIQAGAGIVADSVPELEWEETRNKASALLKAIQLAERLFAK 500
>gnl|CDD|184152 PRK13571, PRK13571, anthranilate synthase component I; Provisional.
Length = 506
Score = 391 bits (1006), Expect = e-130
Identities = 212/541 (39%), Positives = 285/541 (52%), Gaps = 54/541 (9%)
Query: 62 TATAPATKLASDASGFSEASKRANLVPLYRCIFSDHLTPVVAYRCLVQEDDREAPSFLFE 121
A AT + F + +VP+ R + +D TPV AYR L +FL E
Sbjct: 1 MADGAAT---TSREDFRALAAEHRVVPVTRKVLADSETPVGAYRKLAAN---RPGTFLLE 54
Query: 122 SVEPGVRVSNVGRYSVVGAQPVMEVIVKDNNVTIMDHEKGSLVEEVV--DDPMEIPRKIS 179
S E G S R+S +G + V+D G+ DP+ R
Sbjct: 55 SAENGRSWS---RWSFIGVGSPAALTVRDGEA----VWLGTPPAGAPTGGDPLAALRATL 107
Query: 180 EDWKPQIIDELPEAFCGGWVGYFSYDTVRYVEKKKLPFSKAPHDDRSLADIHLGLYNDVL 239
E + LP GG VG+ YD VR +E+ LP + DD L ++ L L D+
Sbjct: 108 ELLATPRLPGLP-PLTGGMVGFLGYDAVRRLER--LP--ELAVDDLGLPEMLLLLATDLA 162
Query: 240 VFDHVEKKVYVI----HWVRLDQHSSVQKAYAEGLEHLEKLVARKVITRSIDLHTHHFGP 295
DH E + +I +W D+ V AY + + L+ + A + + + F
Sbjct: 163 AVDHHEGTITLIANAVNWNGTDER--VDAAYDDAVARLDVMTAA--LAQPLPSTVATFSR 218
Query: 296 PLK--KSNMTSEAYKNAVLEAKEHIQAGDIFQIVLSQRFERRTFADPFEVYRALRVVNPS 353
P+ ++ T E + AV + E I+AG+ FQ+V SQRFE T ADP +VYR LRV NPS
Sbjct: 219 PVPEFRAQRTVEEFGAAVEKLVEEIRAGEAFQVVPSQRFEMDTTADPLDVYRVLRVTNPS 278
Query: 354 PYMTYLQ------ARGCILVASSPEILTRVKKNKIVNRPLAGTVRRGRTTEEDEMLETQL 407
PYM L+ +V SSPE L V + P+AGT RG T EED +LE +L
Sbjct: 279 PYMYLLRVPNSDGGTDFSIVGSSPEALVTVTDGRATTHPIAGTRWRGATPEEDALLEKEL 338
Query: 408 LKDAKQCAEHVMLVDLGRNDVGKVARSGSVKVEKLMNVERYSHVMHISSTITGELQDRLS 467
L D K+ AEH+MLVDLGRND+G+V R G+V+V ++ERYSHVMH+ ST+TGEL + +
Sbjct: 339 LADPKERAEHLMLVDLGRNDLGRVCRPGTVRVVDFSHIERYSHVMHLVSTVTGELAEGRT 398
Query: 468 CWDALRAALPVGTVSGAPKVKAMELIDELEVNRRGPYSGGFGGVSFTGDMDIALALRTMV 527
DA+ A P GT+SGAPKV+AMELI+ELE RRG Y G G + F GD D A+A+RT +
Sbjct: 399 ALDAVTACFPAGTLSGAPKVRAMELIEELEPTRRGLYGGVVGYLDFAGDADTAIAIRTAL 458
Query: 528 FQTGTRYDTMYSYKDARKRREWVAYLQAGAGIVADSDPDDEHRECQNKAAGLARAIDLAE 587
+ GT AY+QAG G+VADSDPD E E +NKAA + RAI AE
Sbjct: 459 MRDGT------------------AYVQAGGGVVADSDPDYEDNEARNKAAAVLRAIAAAE 500
Query: 588 S 588
+
Sbjct: 501 T 501
>gnl|CDD|184154 PRK13573, PRK13573, anthranilate synthase component I; Provisional.
Length = 503
Score = 372 bits (957), Expect = e-123
Identities = 204/527 (38%), Positives = 275/527 (52%), Gaps = 42/527 (7%)
Query: 70 LASDASGFSEASKRANLVPLYRCIFSDHLTPVVAYRCLVQEDDREAPSFLFESVEPG-VR 128
L D F A +Y + +D TPV L +F+ ESV G VR
Sbjct: 3 LTPDFDAFERAYDAGENQVVYTRLAADLDTPVSLMLKLA---GARKDAFMLESVTGGEVR 59
Query: 129 VSNVGRYSVVGAQP--VMEVIVKDNNVTIMDHEKGSLVEEVVDDPMEIPRKISEDWKPQI 186
GRYS++G +P + + + E + P++ R + + + +
Sbjct: 60 ----GRYSIIGMKPDLIWRCRGQQARINREARFDRDAFEPLEGHPLDSLRALIAESRIDM 115
Query: 187 IDELPEAFCGGWVGYFSYDTVRYVEKKKLPFSKAPHDDRSLADIHLGLYNDVLVFDHVEK 246
+LP A G GY YD +R VE LP D L D L + V V D V+
Sbjct: 116 PADLPPA-AAGLFGYLGYDMIRLVEH--LP--DVNPDPLGLPDAVLMRPSVVAVLDGVKG 170
Query: 247 KVYVIHWVRLDQHSSVQKAYAEGLEHLEKLV---ARKVITRSIDL-HTHHFGPPLKKSNM 302
+V V+ + S + AYA+ E + V R + D G P SN
Sbjct: 171 EVTVVAPAWVSSGLSARAAYAQAAERVMDAVRDLERALPAAQRDFGEAAQVGEP--VSNF 228
Query: 303 TSEAYKNAVLEAKEHIQAGDIFQIVLSQRFERRTFADPFEVYRALRVVNPSPYMTYLQAR 362
T E YK AV +AK++I+AGDIFQ+V SQR+ + PF +YR+LR NPSP+M +
Sbjct: 229 THEGYKAAVEKAKDYIRAGDIFQVVPSQRWAQDFRLPPFALYRSLRRTNPSPFMFFFNFG 288
Query: 363 GCILVASSPEILTRVKKNKIVNRPLAGTVRRGRTTEEDEMLETQLLKDAKQCAEHVMLVD 422
G +V +SPEIL R++ ++ RP+AGT RG T EED LE LL D K+ AEH+ML+D
Sbjct: 289 GFQVVGASPEILVRLRDGEVTIRPIAGTRPRGATPEEDRALEADLLADKKELAEHLMLLD 348
Query: 423 LGRNDVGKVARSGSVKVEKLMNVERYSHVMHISSTITGELQDRLSCWDALRAALPVGTVS 482
LGRNDVG+VA+ G+V+ + +ERYSHVMHI S + GEL + AL A LP GTVS
Sbjct: 349 LGRNDVGRVAKIGTVRPTEKFIIERYSHVMHIVSNVVGELAEGEDALSALLAGLPAGTVS 408
Query: 483 GAPKVKAMELIDELEVNRRGPYSGGFGGVSFTGDMDIALALRTMVFQTGTRYDTMYSYKD 542
GAPKV+AME+IDELE +RG Y GG G + G+MD+ +ALRT V + T
Sbjct: 409 GAPKVRAMEIIDELEPEKRGVYGGGVGYFAANGEMDMCIALRTAVVKDET---------- 458
Query: 543 ARKRREWVAYLQAGAGIVADSDPDDEHRECQNKAAGLARAIDLAESA 589
Y+QAG G+V DSDP+ E++E NKA L RA AE A
Sbjct: 459 --------LYIQAGGGVVYDSDPEAEYQETVNKARALRRA---AEDA 494
>gnl|CDD|215913 pfam00425, Chorismate_bind, chorismate binding enzyme. This family
includes the catalytic regions of the chorismate binding
enzymes anthranilate synthase, isochorismate synthase,
aminodeoxychorismate synthase and para-aminobenzoate
synthase.
Length = 254
Score = 309 bits (795), Expect = e-102
Identities = 126/273 (46%), Positives = 162/273 (59%), Gaps = 23/273 (8%)
Query: 305 EAYKNAVLEAKEHIQAGDIFQIVLSQRFERRT--FADPFEVYRALRVVNPSPYMTYLQAR 362
E Y AV +AKE I+AGD++++VLS+R E DP +YR LR NP+PY L+
Sbjct: 3 EDYAAAVEKAKEAIRAGDLYKVVLSRRLELPLSSPIDPLALYRRLRARNPAPYAFLLELG 62
Query: 363 GCILVASSPEILTRVKKNKIVNRPLAGTVRRGRTTEEDEMLETQLLKDAKQCAEHVMLVD 422
+ +SPE L V+ +I RPLAGT RG EEDE L +LL K+ AEH+M+VD
Sbjct: 63 D--FLGASPERLLSVRGGRITTRPLAGTRPRGEDPEEDEALAAELLASEKERAEHLMVVD 120
Query: 423 LGRNDVGKVARSGSVKVEKLMNVERYSHVMHISSTITGELQDRLSCWDALRAALPVGTVS 482
L RND+G+V + GSVKV +L VERY +V H+ STITG L+ LS D L A P G V+
Sbjct: 121 LIRNDLGRVCK-GSVKVPELPEVERYGNVQHLVSTITGRLKPGLSLLDLLAALHPTGAVT 179
Query: 483 GAPKVKAMELIDELEVNRRGPYSGGFGGVSFTGDMDIALALRTMVFQTGTRYDTMYSYKD 542
GAPK +AME+I ELE RG Y+G G + G+ D A+A+RT + G
Sbjct: 180 GAPKKRAMEIIAELEPFDRGLYAGAVGWLDPDGNGDFAVAIRTALIDNGR---------- 229
Query: 543 ARKRREWVAYLQAGAGIVADSDPDDEHRECQNK 575
A L AGAGIVADSDP+ E E + K
Sbjct: 230 --------ARLYAGAGIVADSDPEAEWAETELK 254
>gnl|CDD|237432 PRK13572, PRK13572, anthranilate synthase component I; Provisional.
Length = 435
Score = 316 bits (811), Expect = e-102
Identities = 176/493 (35%), Positives = 257/493 (52%), Gaps = 65/493 (13%)
Query: 96 DHLTPVVAYRCLVQEDDREAPSFLFESVEPGVRVSNVGRYSVVGAQPVMEVIVKDNNVTI 155
D++ P+ Y L E F+ ES E G R + RY+ + A P E +V+ N T
Sbjct: 7 DYVNPLKLYSVLRDEGY----PFILESAEKGQRKA---RYTYISANP--EFMVRIGNKTK 57
Query: 156 MDHEKGSLVEEVVDDPMEIPRKISEDWKPQIIDELPEAFCGGWVGYFSYDTVR-YVEKKK 214
+D E S E + + F GG+VGY +YD V Y+ K
Sbjct: 58 VDGETISKESNPFKALKENFKITQSG----------DRFTGGFVGYIAYDAVHNYIGGKI 107
Query: 215 LPFSKAPHDDRSLADIHLGLYNDVLVFDHVEKKVYVIHWVRLDQHSSVQKAYAEGLEHLE 274
S G Y+ V V+DHV +K Y S+ E L + E
Sbjct: 108 EEPSV------------FGYYDHVFVYDHVTRKFYFH---------SLNNN-PEELFNAE 145
Query: 275 KLVARKVITRSIDLHTHHFGPPLKKSNMTSEAYKNAVLEAKEHIQAGDIFQIVLSQRFER 334
K+V + + ++ G + + E + V +AKE+I +GD+FQ+VLS+ +
Sbjct: 146 KIVEK---AKRFEIEEEDGGSEVLGCDADREEFVEMVEKAKEYIYSGDVFQVVLSREYRL 202
Query: 335 RTFADPFEVYRALRVVNPSPYMTYLQARGCILVASSPEILTRVKKNKIVNRPLAGTVRRG 394
+T PF++YR LR +NPSPYM +L +V +SPE + V+ N + P+AGT RG
Sbjct: 203 KTDLSPFQLYRNLREINPSPYM-FLLEFDKDVVGASPETMASVENNILKINPIAGTAPRG 261
Query: 395 RTTEEDEMLETQLLKDAKQCAEHVMLVDLGRNDVGKVARSGSVKVEKLMNVERYSHVMHI 454
+T EED+ L LL D K+ AEHVMLVDL RNDV KV++SGSV++E+ +V +YSHV HI
Sbjct: 262 KTEEEDKKLAEALLSDEKERAEHVMLVDLARNDVRKVSKSGSVRLERFFDVVKYSHVQHI 321
Query: 455 SSTITGELQDRLSCWDALRAALPVGTVSGAPKVKAMELIDELEVNRRGPYSGGFGGVSFT 514
S + GEL++ + +DA+ AA P GT++GAPK +AME+IDELE +RR Y G G S +
Sbjct: 322 ESEVVGELKEDSTMFDAIEAAFPAGTLTGAPKFRAMEIIDELEKSRRKVYGGAVGYFSNS 381
Query: 515 GDMDIALALRTMVFQTGTRYDTMYSYKDARKRREWVAYLQAGAGIVADSDPDDEHRECQN 574
G+ D+A+A+R + V ++AGAGIVADS P+ E E +
Sbjct: 382 GNADLAIAIRMAEI-------------------DKVCRVRAGAGIVADSVPEKEFYETER 422
Query: 575 KAAGLARAIDLAE 587
K A + +A+ +
Sbjct: 423 KMAAVLKALGVVN 435
>gnl|CDD|130883 TIGR01824, PabB-clade2, aminodeoxychorismate synthase, component I,
clade 2. This clade of sequences is more closely
related to TrpE (anthranilate synthase,
TIGR00564/TIGR01820/TIGR00565) than to the better
characterized group of PabB enzymes
(TIGR00553/TIGR01823). This clade includes one
characterized enzyme from Lactococcus and the conserved
function across the clade is supported by these pieces
of evidence: 1) all genomes with a member in this clade
also have a separate TrpE gene, 2) none of these genomes
contain an aparrent PabB from any of the other PabB
clades, 3) none of these sequences are found in a region
of the genome in association with other Trp biosynthesis
genes, 4) all of these genomes aparrently contain most
if not all of the steps of the folate biosynthetic
pathway (for which PABA is a precursor). Many of the
sequences hit by this model are annotated as TrpE
enzymes, however, we believe that all members of this
clade are, in fact, PabB. The sequences from Bacillus
halodurans and subtilus which score below the trusted
cutoff for this model are also likely to be PabB
enzymes, but are too closely related to TrpE to be
separated at this time.
Length = 355
Score = 287 bits (735), Expect = 5e-92
Identities = 133/392 (33%), Positives = 190/392 (48%), Gaps = 45/392 (11%)
Query: 196 GGWVGYFSYDTVRYVEKKKLPFSKAPHDDRSLADIHLGLYNDVLVFDHVEKKVYVIH--- 252
GG +G+ +YD R +E D Y + DH + V +
Sbjct: 1 GGRLGWLAYDVARRLE----GIPDLGTSDGGWPVAADFRYEAAVARDHQRQIVALATVPA 56
Query: 253 -WVRLDQHSSVQKAYAEGLEHLEKLVARKVITRSIDLHTHHFGPPLKKSNMTS--EAYKN 309
SS Q L + GP + AY+
Sbjct: 57 ETEGEFATSSDQLPAVAAATSLP---------------SPDVGPLPVDLEASIDRAAYET 101
Query: 310 AVLEAKEHIQAGDIFQIVLSQRFERRTFA--DPFEVYRALRVVNPSPYMTYLQARGCILV 367
V K++I+AGD+FQ LS+R A DP +++ ALR NP+PY YL+ G +
Sbjct: 102 GVRRIKDYIRAGDVFQANLSRRLTAPIAADVDPLQLFLALRAPNPAPYAIYLEEPGVDVA 161
Query: 368 ASSPEILTRVKKNKIVNRPLAGTVRRGRTTEEDEMLETQLLKDAKQCAEHVMLVDLGRND 427
++SPE+ + + RP+AGT RG T ED L +LL+ K AEHVM+VDL RND
Sbjct: 162 SASPELFLAREGRVVQTRPIAGTRPRGATLAEDGALAAELLQHDKDRAEHVMIVDLERND 221
Query: 428 VGKVARSGSVKVEKLMNVERYSHVMHISSTITGELQDRLSCWDALRAALPVGTVSGAPKV 487
+G+V +G+V+V +L VE YSHV H+ S +TG L++ D +RA P G+++GAPKV
Sbjct: 222 LGRVCATGTVRVPELCAVESYSHVHHLVSRVTGRLREGAGLADLIRALFPGGSITGAPKV 281
Query: 488 KAMELIDELEVNRRGPYSGGFGGVSFTGDMDIALALRTMVFQTGTRYDTMYSYKDARKRR 547
+AME+IDELE RGPY+G G + G+ D+ + +RT + G +
Sbjct: 282 RAMEIIDELEPQPRGPYTGSVGWIDADGNADLNILIRT-LEGGGAQL------------- 327
Query: 548 EWVAYLQAGAGIVADSDPDDEHRECQNKAAGL 579
+ + GAGIVADSDP E E + KA L
Sbjct: 328 ----HFRTGAGIVADSDPAGEWDETEAKARAL 355
>gnl|CDD|184155 PRK13574, PRK13574, anthranilate synthase component I; Provisional.
Length = 420
Score = 288 bits (739), Expect = 1e-91
Identities = 168/485 (34%), Positives = 241/485 (49%), Gaps = 79/485 (16%)
Query: 100 PVVAYRCLVQEDDREAPSFLFESVEPGVRVSNVGRYSVVGAQPVMEVIVKDNNVTIMDHE 159
P ++C+ ++ L ES+ RYSV+
Sbjct: 12 PFEVFKCIERDFKVAG---LLESIGGP---QYKARYSVIAWG-----------------T 48
Query: 160 KGSLVEEVVDDPMEIPRKISEDWKPQIIDELPEAFCGGWVGYFSYDTVRYVEKKKLPFSK 219
G L ++ DDP+ I ++ K + ++P F GG +GY SYD VR+ EK +
Sbjct: 49 NGYL--KIHDDPVNI---LNSYLKDLKLVDIPGLFKGGMIGYISYDAVRFWEKIR-DLKP 102
Query: 220 APHDDRSLADIHLGLYNDVLVFDHVEKKVYVIHWVRLDQHSSVQKAYAEGLEHLEKLVAR 279
A D + ++++++DH E KVYV G
Sbjct: 103 AAED---WPYAEFFIPDNIIIYDHNEGKVYV-----------------NG---------- 132
Query: 280 KVITRSIDLHTHHFGPPLKKSNMTSEAYKNAVLEAKEHIQAGDIFQIVLSQRFERRTF-A 338
+ + F ++ Y+ V E+ E+I++G IFQ+VLS RF R F
Sbjct: 133 DLSSVGGCGDMGEFKISFYDESLNKNNYEKIVSESLEYIRSGYIFQVVLS-RFYRYLFSG 191
Query: 339 DPFEVYRALRVVNPSPYMTYLQARGCILVASSPEILTRVKKNKIVNRPLAGTVRRGRTTE 398
DP +Y LR +NPSPYM YL+ L+ SSPE+L RV+ N + P+AGT RG E
Sbjct: 192 DPLRIYYNLRRINPSPYMFYLKFDERYLIGSSPELLFRVQDNIVETYPIAGTRPRGSDQE 251
Query: 399 EDEMLETQLLKDAKQCAEHVMLVDLGRNDVGKVARSGSVKVEKLMNVERYSHVMHISSTI 458
ED LE +L+ K AEH+MLVDL RND+GKV G+V+V +LM VE+YSHV HI S +
Sbjct: 252 EDLKLELELMNSEKDKAEHLMLVDLARNDLGKVCVPGTVRVPELMYVEKYSHVQHIVSKV 311
Query: 459 TGELQDRLSCWDALRAALPVGTVSGAPKVKAMELIDELEVNRRGPYSGGFGGVSFTGDMD 518
G L+ + + D L+A P GTVSGAPK AM +I+ LE +RGPY+G G +S G+ +
Sbjct: 312 IGTLKKKYNALDVLKATFPAGTVSGAPKPMAMNIIETLEEYKRGPYAGAVGFISADGNAE 371
Query: 519 IALALRTMVFQTGTRYDTMYSYKDARKRREWVAYLQAGAGIVADSDPDDEHRECQNKAAG 578
A+A+R T + KD + +QAGAGIV DS+P+ E+ E ++K
Sbjct: 372 FAIAIR-----------TAFLNKDLLR-------IQAGAGIVYDSNPESEYFETEHKLRA 413
Query: 579 LARAI 583
L AI
Sbjct: 414 LKTAI 418
>gnl|CDD|184148 PRK13567, PRK13567, anthranilate synthase component I; Provisional.
Length = 468
Score = 283 bits (726), Expect = 3e-89
Identities = 154/459 (33%), Positives = 233/459 (50%), Gaps = 44/459 (9%)
Query: 133 GRYSVVGAQPVMEVIVKDNNVTI-MDHEKGSLVEEVVDDPMEIPRKISEDWKPQIIDELP 191
GRYSVV + + ++ +++ E + E + + + + + LP
Sbjct: 37 GRYSVVIFDIYGTLTLDNDVLSVSTLKESYQITERPYHYLTTKINEDYHNIQDEQLKSLP 96
Query: 192 EAFCGGWVGYFSYDTVRYVEKKKLPFSKAPHDDRSLADIHLGLYNDVLVFDHVEKKVYVI 251
F G+VG S+D VR+ E KL +D D+ L + V VFDH + ++Y+I
Sbjct: 97 --FISGYVGTCSFDLVRH-EFPKL--QSIQLEDHKQHDVRLYMVEQVYVFDHYKDELYII 151
Query: 252 HWVRLDQHSSVQKAYAEGLEHLEKLVARKV-----ITRSIDLHTHHFGPPLKKSNMTSEA 306
+Q S+ K LE V + + I + F +SN++ E
Sbjct: 152 ---ATNQFSNSTK------SDLENRVNKSIEDLTKIQPFMPTQDFDFKTKEIQSNISEER 202
Query: 307 YKNAVLEAKEHIQAGDIFQIVLSQRFE-RRTFAD-----PFEVYRALRVVNPSPYMTYLQ 360
+ + KE I GD+FQ+V S+ ++ + F++Y+ L+ NPSPYM YL
Sbjct: 203 FIEMIQYFKEKITEGDMFQVVPSRIYKYAHHASQHLNQLSFQLYQNLKRQNPSPYMYYLN 262
Query: 361 ARGCILVASSPEILTRVKKNKIVNRPLAGTVRRGRTTEEDEMLETQLLKDAKQCAEHVML 420
+V SSPE VK + P+AGT++RG TT+ D QLL D K+C+EH ML
Sbjct: 263 IDQPYIVGSSPESFVSVKDQIVTTNPIAGTIQRGETTQIDNENMKQLLNDPKECSEHRML 322
Query: 421 VDLGRNDVGKVARSGSVKVEKLMNVERYSHVMHISSTITGELQDRLSCWDALRAALPVGT 480
VDLGRND+ +V++ G+ K+ KLM +E+Y HVMHI S +TG++ LS + LP GT
Sbjct: 323 VDLGRNDIHRVSKIGTSKITKLMVIEKYEHVMHIVSEVTGKINQNLSPMTVIANLLPTGT 382
Query: 481 VSGAPKVKAMELIDELEVNRRGPYSGGFGGVSFTGDMDIALALRTMVFQTGTRYDTMYSY 540
VSGAPK++A+E I E ++RG YSGG G ++ ++D ALA+RTM+
Sbjct: 383 VSGAPKLRAIERIYEQYPHKRGVYSGGVGYINCNHNLDFALAIRTMMID----------- 431
Query: 541 KDARKRREWVAYLQAGAGIVADSDPDDEHRECQNKAAGL 579
E ++AG G+V DS P+ E E + KA L
Sbjct: 432 -------EQYINVEAGCGVVYDSIPEKELNETKLKAKSL 463
>gnl|CDD|235651 PRK05940, PRK05940, anthranilate synthase component I-like protein;
Validated.
Length = 463
Score = 282 bits (723), Expect = 7e-89
Identities = 149/413 (36%), Positives = 225/413 (54%), Gaps = 48/413 (11%)
Query: 184 PQIIDELPEAFCGGWVGYFSYDTVRYVEK------KKLPFSKAP-HDDRSLADIHLGLYN 236
+ + LP F GGW+G+ YD +E+ LPF A ++ S A
Sbjct: 88 SALPEHLP--FTGGWLGWLGYDLAWEIERLPHLNPDPLPFPVAYWYEPESFA-------- 137
Query: 237 DVLVFDHVEKKVYVIHWVRLDQHSSVQKAYAEGLEHLEKLVARKVITRSIDLHTHHFGPP 296
+ DH E+ +++ + L+ LE+ + + T DL PP
Sbjct: 138 ---ILDHQEQILWL------------AASDPSQLDRLEQQLEQP--TPEPDLPLDLRTPP 180
Query: 297 LKKSNMTSE-AYKNAVLEAKEHIQAGDIFQIVLSQRFERRTFADPFEVYRALRVVNPSPY 355
T++ Y+ AV +AK++IQAGDIFQ LS RF+ T AD +++YR L+ +NPSP+
Sbjct: 181 SSLIFYTTQQEYEAAVRQAKKYIQAGDIFQANLSLRFQTTTSADSWQIYRRLQQINPSPF 240
Query: 356 MTYLQARGCILVASSPEILTRVKKNKIVNRPLAGTVRRGRTTEEDEMLETQLLKDAKQCA 415
+Y + +V+ SPE L +++ N+ RP+AGT RG+T ED+ L +LL + K+ A
Sbjct: 241 ASYWRTPWGDVVSCSPERLVQLQGNQAQTRPIAGTRPRGKTPAEDQQLAEELLSNIKERA 300
Query: 416 EHVMLVDLGRNDVGKVARSGSVKVEKLMNVERYSHVMHISSTITGELQDRLSCWDALRAA 475
EH+MLVDL RND+G+V + GSV+V++L+ +ERYSHV+H+ S + G LQ D +RA
Sbjct: 301 EHIMLVDLERNDLGRVCQWGSVEVDELLTIERYSHVIHLVSNVVGTLQPNRDAIDLIRAL 360
Query: 476 LPVGTVSGAPKVKAMELIDELEVNRRGPYSGGFGGVSFTGDMDIALALRTMVFQTGTRYD 535
P GT++G PKV+ ME+I+ELE RR + G G + G++D+ + +RT+
Sbjct: 361 FPGGTITGCPKVRCMEIIEELEPVRRNLFYGSCGYLDQRGNLDLNILIRTL--------- 411
Query: 536 TMYSYKDARKRREWVAYLQAGAGIVADSDPDDEHRECQNKAAGLARAIDLAES 588
+Y+ W Q GAGIVADSDP+ E E KA A++L S
Sbjct: 412 -LYTPLSRGLSTIWG---QVGAGIVADSDPEKEWLESLQKAKAQLAALNLVRS 460
>gnl|CDD|233020 TIGR00553, pabB, aminodeoxychorismate synthase, component I,
bacterial clade. Members of this family,
aminodeoxychorismate synthase, component I (PabB), were
designated para-aminobenzoate synthase component I until
it was recognized that PabC, a lyase, completes the
pathway of PABA synthesis. This family is closely
related to anthranilate synthase component I (trpE), and
both act on chorismate. The clade of PabB enzymes
represented by this model includes sequences from
Gram-positive and alpha and gamma Proteobacteria as well
as Chlorobium, Nostoc, Fusobacterium and Arabidopsis. A
closely related clade of fungal PabB enzymes is
identified by TIGR01823, while another bacterial clade
of potential PabB enzymes is more closely related to
TrpE (TIGR01824) [Biosynthesis of cofactors, prosthetic
groups, and carriers, Folic acid].
Length = 328
Score = 261 bits (668), Expect = 2e-82
Identities = 126/380 (33%), Positives = 181/380 (47%), Gaps = 54/380 (14%)
Query: 198 WVGYFSYDTVRYVEKKKLPFSKAPHDDRSLADIHLGLYNDVLVFDHVEKKVYVIHWVRLD 257
VGY SY+ D Y+ L+ DH +R
Sbjct: 1 LVGYLSYEAGP--------------------DAAFEPYDAALLADHRRT-----PLLRFL 35
Query: 258 QHSSVQKAYAEGLEHLEKLVARKVITRSIDLHTHHFGPPLKKSNMTSEAYKNAVLEAKEH 317
V+ +E + A +S MT Y A+ + +++
Sbjct: 36 VFERVEAQPRAAVEAEDDAPAD---------RQAPTSDI--QSEMTRAEYGEAIDQLQDY 84
Query: 318 IQAGDIFQIVLSQRFERRTFADPFEVYRALRVVNPSPYMTYLQARGCILVASSPEILTRV 377
I+AGD +Q L+Q+F DP +R LR P+P+ +L +++ SPE+ +
Sbjct: 85 IRAGDCYQANLTQQFHATWDGDPLAAFRKLRRRQPAPFSAFLDLGDGAILSLSPELFFSI 144
Query: 378 KKNKIVNRPLAGTVRRGRTTEEDEMLETQLLKDAKQCAEHVMLVDLGRNDVGKVARSGSV 437
++I RP+ GT+ RG +ED + L + AK AE++M+VDL RND+G++A GSV
Sbjct: 145 DGSEIETRPIKGTLPRGADPQEDRAQASALAESAKDRAENLMIVDLLRNDLGRIAEVGSV 204
Query: 438 KVEKLMNVERYSHVMHISSTITGELQDRLSCWDALRAALPVGTVSGAPKVKAMELIDELE 497
KV +L VE Y V + STIT L++ L+ D RA P G+++GAPKV+AME+IDELE
Sbjct: 205 KVPELFVVETYPTVHQLVSTITARLREDLTLSDLFRALFPGGSITGAPKVRAMEIIDELE 264
Query: 498 VNRRGPYSGGFGGVSFTGDMDIALALRTMVFQTGTRYDTMYSYKDARKRREWVAYLQAGA 557
RG Y G G +S GDMD +A+RT+ G A G
Sbjct: 265 PQPRGVYCGAIGYLSPEGDMDFNVAIRTLTLDGGR------------------AVYGVGG 306
Query: 558 GIVADSDPDDEHRECQNKAA 577
GIVADSDP+ E+REC KAA
Sbjct: 307 GIVADSDPEAEYRECLLKAA 326
>gnl|CDD|237428 PRK13564, PRK13564, anthranilate synthase component I; Provisional.
Length = 520
Score = 255 bits (655), Expect = 4e-78
Identities = 145/431 (33%), Positives = 203/431 (47%), Gaps = 75/431 (17%)
Query: 185 QIIDELPEAFCGGWVGYFSYDTVRYVEKKKLPFSKAPHDDRS------LADIHLGLYNDV 238
E EA G G F+YD V E P + P + LA+ +
Sbjct: 135 NTPKEEREALFLG--GLFAYDLVAGFE----PLPQLPAGNNCPDYCFYLAET-------L 181
Query: 239 LVFDHVEKKVYVIHWVRLDQHSSVQKAYAEGLEHLEKLVARKVITRSIDLHTHHFGPPLK 298
LV DH +K + ++ A L L++ + P
Sbjct: 182 LVIDHQKKSA-RLQASLFTPDEEEKQRLAARLAQLKQQLT----------QPAPPLPVTS 230
Query: 299 KSNMT------SEAYKNAVLEAKEHIQAGDIFQIVLSQRFERRTF----ADPFEVYRALR 348
+M E + V + KEHI+AGDIFQ+V S R F P YR L+
Sbjct: 231 VPDMEVSVNISDEEFCAVVRKLKEHIRAGDIFQVVPS-----RRFSLPCPSPLAAYRVLK 285
Query: 349 VVNPSPYMTYLQARGCILVASSPEILTRVKKNKIVNR----PLAGTVRRGRTT------E 398
NPSPYM Y+Q L +SPE + +K + + P+AGT RGR +
Sbjct: 286 KSNPSPYMFYMQDEDFTLFGASPE--SALKYDASSRQVEIYPIAGTRPRGRRADGSIDRD 343
Query: 399 EDEMLETQLLKDAKQCAEHVMLVDLGRNDVGKVARSGSVKVEKLMNVERYSHVMHISSTI 458
D +E +L D K+ AEH+MLVDL RND+ ++ + GS V L+ V+RYSHVMH+ S +
Sbjct: 344 LDSRIELELRTDHKELAEHLMLVDLARNDLARICQPGSRYVADLLKVDRYSHVMHLVSRV 403
Query: 459 TGELQDRLSCWDALRAALPVGTVSGAPKVKAMELIDELEVNRRGPYSGGFGGVSFTGDMD 518
GEL+ L A RA + +GT++GAPKV+AM+LI E+E RRG Y G G ++ GD+D
Sbjct: 404 VGELRHDLDALHAYRACMNMGTLTGAPKVRAMQLIREVEGQRRGSYGGAVGYLTGHGDLD 463
Query: 519 IALALRTMVFQTGTRYDTMYSYKDARKRREWVAYLQAGAGIVADSDPDDEHRECQNKAAG 578
+ +R+ + G +A +QAGAG+V DSDP E E +NKA
Sbjct: 464 TCIVIRSAFVENG------------------IATVQAGAGVVLDSDPQSEADETRNKAQA 505
Query: 579 LARAIDLAESA 589
+ RAI A A
Sbjct: 506 VLRAIATAHHA 516
>gnl|CDD|236371 PRK09070, PRK09070, hypothetical protein; Validated.
Length = 447
Score = 242 bits (619), Expect = 8e-74
Identities = 145/473 (30%), Positives = 222/473 (46%), Gaps = 55/473 (11%)
Query: 118 FLFESVEPGVRVSNVGRYSVVGAQPVMEVIVKDNNVTIMDHEKGSLVEEVVDDPMEIPRK 177
L ES G + GR+ V+ + + D + +G ++ + D + R
Sbjct: 26 ALLESSASG---TAQGRWDVLLLAQ-GKCLRLDPDGVTRQLLEGDFLDAL-DAAWQAERV 80
Query: 178 ISEDWKPQIIDELPEAFCGGWVGYFSYDTVRYVEKKKLPFSKAP-HDDRSLADIHLGLYN 236
+ LP F GGW Y+ VE P K P D + L
Sbjct: 81 PHDGE-----SSLP--FRGGWAVLLDYELAGQVE----PILKLPMRTDGLPLALALRAPA 129
Query: 237 DVLVFDHVEKKVYVIHWVRLDQHSSVQKAYAEGLEHLEKLVARKVITRSIDLHTHHFGPP 296
VL D + ++ + L+ +E +A + + P
Sbjct: 130 AVLR-DRHSGRCVLV----------AEPGREHLLDQIEADLAACAALPPLPVWL----AP 174
Query: 297 LKKSNMTSEAYKNAVLEAKEHIQAGDIFQIVLSQRFERRTFA---DPFEVYRALRVVNPS 353
E + + V ++I+AGD+FQ+ LS+ + + FA DP +Y LR NP+
Sbjct: 175 QAVEEDPPERFTDGVERVLDYIRAGDVFQVNLSRAW-QAQFANAVDPAALYARLRAANPA 233
Query: 354 PYMTYLQARGCILVASSPEILTRVKKNKIVNRPLAGTVRRGRTTEEDEMLETQLLKDAKQ 413
P+ A G +V+SSPE L V+ + RP+AGT R ++D L +L+ K+
Sbjct: 234 PFSGLFVAAGRAIVSSSPERLVSVQGGVVQTRPIAGTRPRF-AGDDDAALIRELVGHPKE 292
Query: 414 CAEHVMLVDLGRNDVGKVARSGSVKVEKLMNVERYSHVMHISSTITGELQDRLSCWDALR 473
AEHVML+DL RND+G++ GSV+V++LM VE Y+HV HI S + G L+D ++ + +R
Sbjct: 293 RAEHVMLIDLERNDLGRICAPGSVEVDELMTVESYAHVHHIVSNVRGRLRDGVTPGEVIR 352
Query: 474 AALPVGTVSGAPKVKAMELIDELEVNRRGPYSGGFGGVSFTGDMDIALALRTMVFQTGTR 533
A P GT++G PKV+ M++I ELE RG Y+G FG ++ GDMD+ + +RT
Sbjct: 353 AVFPGGTITGCPKVRCMQIIAELEQTPRGAYTGSFGYLNRDGDMDLNILIRTAE------ 406
Query: 534 YDTMYSYKDARKRREWVAYLQAGAGIVADSDPDDEHRECQNKAAGLARAIDLA 586
+ R + GAGIV DSDP+ E E + KA GL RA++ A
Sbjct: 407 -------VQGNQVR-----FRTGAGIVVDSDPERELDETRAKARGLLRALEQA 447
>gnl|CDD|185362 PRK15465, pabB, aminodeoxychorismate synthase subunit I;
Provisional.
Length = 453
Score = 228 bits (582), Expect = 2e-68
Identities = 140/444 (31%), Positives = 225/444 (50%), Gaps = 47/444 (10%)
Query: 134 RYSVVGAQPVMEVIVKDNNVTIMDHEKGSLVEEVVDDPMEIPRKI--SEDWKPQIIDELP 191
R+ +V A P+ + + + EK DDP+++ +++ D +P ++LP
Sbjct: 45 RFDIVVADPICTLTTFGKETVVSESEK---RTTTTDDPLQVLQQVLDRADIRPTHNEDLP 101
Query: 192 EAFCGGWVGYFSYDTVRYVEKKKLPFSKAPHDDRSLADIHLGLYNDVLVFDHVEKKVYVI 251
F GG +G F YD R E LP + D L D+ +G+Y+ L+ DH + V ++
Sbjct: 102 --FQGGALGLFGYDLGRRFES--LP--EIAEQDIVLPDMAVGIYDWALIVDHQRQTVSLL 155
Query: 252 HWVRLDQHSSVQKAYAEGLEHLEKLVARKVITRSIDLHTHHFGPPLKKSNMTSEAYKNAV 311
++ + L +++ + T + +SNMT E Y
Sbjct: 156 -------------SHNDVNARRAWLESQQFSPQEDFTLTSDW-----QSNMTREQYGEKF 197
Query: 312 LEAKEHIQAGDIFQIVLSQRFERRTFADPFEVYRALRVVNPSPYMTYLQARGCILVASSP 371
+ +E++ +GD +Q+ L+QRF D ++ + L N +P+ +L+ +++ SP
Sbjct: 198 RQVQEYLHSGDCYQVNLAQRFHATYSGDEWQAFLQLNQANRAPFSAFLRLEQGAILSLSP 257
Query: 372 EILTRVKKNKIVNRPLAGTVRRGRTTEEDEMLETQLLKDAKQCAEHVMLVDLGRNDVGKV 431
E ++I RP+ GT+ R +ED +L AK AE++M+VDL RND+G+V
Sbjct: 258 ERFILCDNSEIQTRPIKGTLPRLPDPQEDSKQAEKLANSAKDRAENLMIVDLMRNDIGRV 317
Query: 432 ARSGSVKVEKLMNVERYSHVMHISSTITGELQDRLSCWDALRAALPVGTVSGAPKVKAME 491
A +GSVKV +L VE + V H+ STIT L ++L D LRAA P G+++GAPKV+AME
Sbjct: 318 AVAGSVKVPELFVVEPFPAVHHLVSTITARLPEQLHASDLLRAAFPGGSITGAPKVRAME 377
Query: 492 LIDELEVNRRGPYSGGFGGVSFTGDMDIALALRTMVFQTGTRYDTMYSYKDARKRREWVA 551
+IDELE RR + G G +SF G+MD ++ +RT+ G
Sbjct: 378 IIDELEPQRRNAWCGSIGYLSFCGNMDTSITIRTLTAINGQ------------------I 419
Query: 552 YLQAGAGIVADSDPDDEHRECQNK 575
Y AG GIVADS + E++E +K
Sbjct: 420 YCSAGGGIVADSQEEAEYQETFDK 443
>gnl|CDD|129656 TIGR00565, trpE_proteo, anthranilate synthase component I,
proteobacterial subset. This enzyme resembles some
other chorismate-binding enzymes, including
para-aminobenzoate synthase (pabB) and isochorismate
synthase. There is a fairly deep split between two sets,
seen in the pattern of gaps as well as in amino acid
sequence differences. This group includes proteobacteria
such as E. coli and Helicobacter pylori but also the
gram-positive organism Corynebacterium glutamicum. The
second group includes eukaryotes, archaea, and most
other bacterial lineages; sequences from the second
group may resemble pabB more closely than other trpE
from this group [Amino acid biosynthesis, Aromatic amino
acid family].
Length = 498
Score = 227 bits (581), Expect = 1e-67
Identities = 136/400 (34%), Positives = 193/400 (48%), Gaps = 53/400 (13%)
Query: 200 GYFSYDTVRYVEKKKLPFSKAPHDDRSLADIHLGLYNDVLVFDHVEKKVYVIHWVRLDQH 259
G FSYD V E LP KA + + D L ++V DH +K
Sbjct: 131 GLFSYDLVAGFED--LPHLKA--KNNNCPDFCFYLAETLIVIDHQKKST----------- 175
Query: 260 SSVQKAYAEGLEHLEKLVARKVITRSIDLHTHHFGPPLK------KSNMTSEAYKNAVLE 313
+Q + ++L AR + P + N + + V
Sbjct: 176 -RIQASCFAERFEKQRLQARLDLLEQQKTIKADPVPVKSVPSMEVECNQSDSEFGGVVRS 234
Query: 314 AKEHIQAGDIFQIVLSQRFERRTFADPFEVYRALRVVNPSPYMTYLQARGCILVASSPEI 373
++ I+AG+IFQ+V S+RF P Y L+ NPSPYM Y+Q IL +SPE
Sbjct: 235 LQKAIRAGEIFQVVPSRRFSLPC-PSPLAAYYVLKKSNPSPYMFYMQDNDFILFGASPE- 292
Query: 374 LTRVKKNKIVNR----PLAGTVRRGRTT------EEDEMLETQLLKDAKQCAEHVMLVDL 423
+ +K + + + P+AGT RGR + D +E L D K+ AEH+MLVDL
Sbjct: 293 -SALKYDALSRQIEIYPIAGTRPRGRDADGNIDRDLDSRIELDLRTDHKELAEHLMLVDL 351
Query: 424 GRNDVGKVARSGSVKVEKLMNVERYSHVMHISSTITGELQDRLSCWDALRAALPVGTVSG 483
RND+ +V GS V L V+RYS+VMH+ S + GEL+ L A RA + +GT+SG
Sbjct: 352 ARNDLARVCTPGSRYVADLTKVDRYSYVMHLVSRVVGELRHDLDALHAYRACMNMGTLSG 411
Query: 484 APKVKAMELIDELEVNRRGPYSGGFGGVSFTGDMDIALALRTMVFQTGTRYDTMYSYKDA 543
APK++AM+LI + E RRG Y G G ++ GD+D + +R+ + G
Sbjct: 412 APKIRAMQLIYQAEGQRRGSYGGAVGYLTSHGDLDTCIVIRSAFVENG------------ 459
Query: 544 RKRREWVAYLQAGAGIVADSDPDDEHRECQNKAAGLARAI 583
+A +QAGAGIV DS P E E +NKA + RAI
Sbjct: 460 ------IATVQAGAGIVLDSVPQSEADETRNKARAVLRAI 493
>gnl|CDD|215481 PLN02889, PLN02889, oxo-acid-lyase/anthranilate synthase.
Length = 918
Score = 208 bits (530), Expect = 9e-58
Identities = 136/412 (33%), Positives = 207/412 (50%), Gaps = 51/412 (12%)
Query: 188 DELPEAFCGGWVGYFSYDTVRYVEKKKLPF----SKAPHDDRSLADIHLGLYNDVLVFDH 243
+ LP F GG+VGY YD VE + S P AD +V+V DH
Sbjct: 532 EGLPFDFHGGYVGYIGYDL--KVECG-MASNRHKSTTPDACFFFAD-------NVVVIDH 581
Query: 244 VEKKVYVIHWVRLDQHSSVQKAYAEGLE----HLEKLVARKVITRSIDLHTHHFGPPLKK 299
VY++ L + S+ + + E L+ RK+ ++ T P K
Sbjct: 582 HYDDVYIL---SLHEGSTATTQWLDDTEQKLLGLKASATRKLEVQTSPTATF---SPSKA 635
Query: 300 S---NMTSEAYKNAVLEAKEHIQAGDIFQIVLSQRFERRTF-ADPFEVYRALRVVNPSPY 355
+ + E Y V + ++I+ G+ +++ L+ + +R D +Y LR NP+PY
Sbjct: 636 GFLADKSREQYIKDVQKCLKYIKDGESYELCLTTQMRKRIGEIDSLGLYLHLREKNPAPY 695
Query: 356 MTYL---QARGCILVASSPEILTRVKKNKIVN-RPLAGTVRRGRTTEEDEMLETQLLKDA 411
+L CI +SSPE ++ +N ++ +P+ GT+ RG T EEDE L+ QL
Sbjct: 696 AAWLNFSNENLCI-CSSSPERFLKLDRNGMLEAKPIKGTIARGSTPEEDEQLKLQLQYSE 754
Query: 412 KQCAEHVMLVDLGRNDVGKVARSGSVKVEKLMNVERYSHVMHISSTITGELQDRLSCWDA 471
K AE++M+VDL RND+G+V GSV V LM+VE Y+ V + STI G+ + +S D
Sbjct: 755 KDQAENLMIVDLLRNDLGRVCEPGSVHVPNLMDVESYTTVHTMVSTIRGKKRSNMSPVDC 814
Query: 472 LRAALPVGTVSGAPKVKAMELIDELEVNRRGPYSGGFGGVSFTGDMDIALALRTMVFQTG 531
+RAA P G+++GAPK+++MEL+D LE + RG YSG G S+ D+ + +RT+V G
Sbjct: 815 VRAAFPGGSMTGAPKLRSMELLDSLESSSRGIYSGSIGFFSYNQTFDLNIVIRTVVIHEG 874
Query: 532 TRYDTMYSYKDARKRREWVAYLQAGAGIVADSDPDDEHRECQNKAAGLARAI 583
A + AG IVA S+P+DE+ E K A A+
Sbjct: 875 E------------------ASIGAGGAIVALSNPEDEYEEMILKTRAPANAV 908
>gnl|CDD|236035 PRK07508, PRK07508, aminodeoxychorismate synthase; Provisional.
Length = 378
Score = 191 bits (487), Expect = 2e-55
Identities = 91/276 (32%), Positives = 131/276 (47%), Gaps = 18/276 (6%)
Query: 302 MTSEAYKNAVLEAKEHIQAGDIFQIVLSQRFERRTFADPFEVYRALRVVNPSPYMTYLQA 361
Y HI+AGD +Q L+ + R DP ++ AL P Y +
Sbjct: 109 WDFADYAQRFERLHRHIRAGDCYQANLTFPLDARWGGDPLALFWALAARQPVGYGALVDL 168
Query: 362 RGCILVASSPEILTRVK-KNKIVNRPLAGTVRRGRTTEEDEMLETQLLKDAKQCAEHVML 420
G ++++ SPE+ RV + I P+ GT RG T ED L LL D K AE+ M+
Sbjct: 169 GGPVILSRSPELFFRVDGEGWIETHPMKGTAPRGATPAEDARLRAALLNDEKNQAENRMI 228
Query: 421 VDLGRNDVGKVARSGSVKVEKLMNVERYSHVMHISSTITGELQDRLSCWDALRAALPVGT 480
VDL RND+ +++ GS+ V +L ++E Y V + S + L L D A P G+
Sbjct: 229 VDLLRNDISRISEVGSLDVPELFDIETYPTVHQMVSRVRARLLPGLGLADIFAALFPCGS 288
Query: 481 VSGAPKVKAMELIDELEVNRRGPYSGGFGGVSFTGDMDIALALRTMVFQTGTRYDTMYSY 540
++GAPK++AME++ ELE R Y G G ++ G M +A+RT+ G R
Sbjct: 289 ITGAPKIRAMEILRELEPGPRDLYCGAIGWIAPDGRMRFNVAIRTLSLFPGGR------- 341
Query: 541 KDARKRREWVAYLQAGAGIVADSDPDDEHRECQNKA 576
A G GIV DS + E+ EC KA
Sbjct: 342 ----------AVFNVGGGIVFDSTAEAEYEECLLKA 367
>gnl|CDD|237429 PRK13566, PRK13566, anthranilate synthase; Provisional.
Length = 720
Score = 190 bits (486), Expect = 2e-52
Identities = 127/399 (31%), Positives = 185/399 (46%), Gaps = 49/399 (12%)
Query: 197 GWVGYFSYDTVRYVE--KKKLPFSKAPHDDRSLADIHLGLYNDVLVFDHVEKKVYVIHWV 254
G G F YD E ++KLP P D R D+ L L +++LV DH + +V
Sbjct: 156 GLYGAFGYDLAFQFEPIEQKLP---RPDDQR---DLVLYLPDEILVVDHYAARAWVD--- 206
Query: 255 RLDQHSSVQKAYAEGLEHLEKLVARKVITRSIDLHTHHFGPPLKKSNMTSEAYKNAVLEA 314
R + +V EGL T ++ Y V +A
Sbjct: 207 RYE--FAVGGVSTEGLPR---------ETAPSPYKPTT--ARPGFADHAPGEYAALVEKA 253
Query: 315 KEHIQAGDIFQIVLSQRFERRTFADPFEVYRALRVVNPSPYMTYLQ-ARGCILVASSPEI 373
KE + GD+F++V Q F P E++R L+ +NPSPY ++ G LV +SPE+
Sbjct: 254 KESFRRGDLFEVVPGQTFYEPCERSPSEIFRRLKEINPSPYGFFINLGDGEYLVGASPEM 313
Query: 374 LTRVKKNKIVNRPLAGTVRRGRTTEEDEMLETQLLKDAKQCAEHVMLVDLGRNDVGKVAR 433
RV+ ++ P++GT++RG D +LL K +E M D+ RND +V
Sbjct: 314 FVRVEGRRVETCPISGTIKRGADAIGDAEQIRKLLNSKKDESELTMCTDVDRNDKSRVCE 373
Query: 434 SGSVKVEKLMNVERYSHVMHISSTITGELQDRLSCWDALRAALP---VGTVSGAPKVKAM 490
GSVKV +E YS ++H + G L+ DAL A L TV+GAPK+ AM
Sbjct: 374 PGSVKVIGRRQIEMYSRLIHTVDHVEGRLRPGF---DALDAFLTHAWAVTVTGAPKLWAM 430
Query: 491 ELIDELEVNRRGPYSGGFGGVSFTGDMDIALALRTMVFQTGTRYDTMYSYKDARKRREWV 550
+ I++ E + R Y G G V F GDM+ L LRT R KD V
Sbjct: 431 QFIEDHERSPRRWYGGAVGMVGFDGDMNTGLTLRT------IR------IKDG------V 472
Query: 551 AYLQAGAGIVADSDPDDEHRECQNKAAGLARAIDLAESA 589
A ++ GA ++ DSDP+ E E + KA+ L +A+ A+
Sbjct: 473 AEVRVGATLLFDSDPEAEEAETELKASALLQALRGAKPK 511
>gnl|CDD|130874 TIGR01815, TrpE-clade3, anthranilate synthase, alpha
proteobacterial clade. This model represents a small
clade of anthranilate synthases from alpha
proteobacteria and Nostoc (a cyanobacterium). This
enzyme is the first step in the pathway for the
biosynthesis of tryprophan from chorismate [Amino acid
biosynthesis, Aromatic amino acid family].
Length = 717
Score = 189 bits (482), Expect = 5e-52
Identities = 120/394 (30%), Positives = 175/394 (44%), Gaps = 39/394 (9%)
Query: 197 GWVGYFSYDTVRYVEKKKLPFSKAPHDDRSLADIHLGLYNDVLVFDHVEKKVYVIHWVRL 256
G G F YD E + + P D R D+ L L ++++V D ++ + +
Sbjct: 146 GLYGAFGYDLAFQFEPIRQRLER-PDDQR---DLVLYLPDELVVVDPYAGLARLVAYDFI 201
Query: 257 DQHSSVQKAYAEGLEHLEKLVARKVITRSIDLHTHHFGPPLKKSNMTSEAYKNAVLEAKE 316
S + G R R PP + Y V AK
Sbjct: 202 TAAGSTEGLECGG---------RDHPYRPDTN-----APP--GCDHAPGEYARLVESAKA 245
Query: 317 HIQAGDIFQIVLSQRFERRTFADPFEVYRALRVVNPSPYMTYLQ-ARGCILVASSPEILT 375
+ GD+F++V Q F P V+R L+ +NPSPY ++ RG LV +SPE+
Sbjct: 246 AFRRGDLFEVVPGQTFAEPCEDAPSSVFRRLKAINPSPYEFFVNLGRGEYLVGASPEMFV 305
Query: 376 RVKKNKIVNRPLAGTVRRGRTTEEDEMLETQLLKDAKQCAEHVMLVDLGRNDVGKVARSG 435
RV ++ P++GT+ RG D +LL AK AE M D+ RND +V G
Sbjct: 306 RVAGRRVETCPISGTIARGADAIGDAAQILRLLNSAKDEAELTMCTDVDRNDKSRVCEPG 365
Query: 436 SVKVEKLMNVERYSHVMHISSTITGELQDRLSCWDALRAALPVGTVSGAPKVKAMELIDE 495
SVKV +E YS ++H + G L+ + DA + TV+GAPK AM+ I++
Sbjct: 366 SVKVIGRRQIELYSRLIHTVDHVEGRLRSGMDALDAFLSHSWAVTVTGAPKRWAMQFIED 425
Query: 496 LEVNRRGPYSGGFGGVSFTGDMDIALALRTMVFQTGTRYDTMYSYKDARKRREWVAYLQA 555
E + R Y G FG + F G M+ L LRT+ G +A ++A
Sbjct: 426 TEQSPRRWYGGAFGRLGFNGGMNTGLTLRTIRMADG------------------IAEVRA 467
Query: 556 GAGIVADSDPDDEHRECQNKAAGLARAIDLAESA 589
GA ++ DSDPD E E + KAA AI A++A
Sbjct: 468 GATLLYDSDPDAEEAETRLKAAAFRDAIRRAKAA 501
>gnl|CDD|102361 PRK06404, PRK06404, anthranilate synthase component I; Reviewed.
Length = 351
Score = 179 bits (456), Expect = 3e-51
Identities = 101/278 (36%), Positives = 146/278 (52%), Gaps = 29/278 (10%)
Query: 299 KSNMTSEAYKNAVLEAKEHIQAGDIFQIVLSQRFERRTFADPFEVYRALRVVNPSPYMTY 358
K N + + E E I+AG++ Q+V+S+ FE D E + S Y+ Y
Sbjct: 99 KGNYNDISLSLKIKELIELIRAGEVLQVVISREFEANI--DFKEKLSEFINNDRSRYVFY 156
Query: 359 LQARGCILVASSPEILTRVKKNKIVNRPLAGTVRRGRTTEEDEMLETQLLKDAKQCAEHV 418
+ +V SSPE + V N I P+AGT +D++L +LL K EH
Sbjct: 157 YRFGKYRVVGSSPENVFTVNGNIINVDPIAGTY-------DDKILSNELLNSEKDKLEHR 209
Query: 419 MLVDLGRNDVGKVARSGSVKVEKLMNVERYSHVMHISSTITGELQDRLSCWDALRAALPV 478
ML+DL RND+ K A G++ V+K+M +E +S V H+ S +T + + S D L + P
Sbjct: 210 MLLDLARNDLSKFADIGTLNVDKVMKIEEFSSVKHLVSQVTAKFSNA-SYRDILASMFPA 268
Query: 479 GTVSGAPKVKAMELIDELEVNRRGPYSGGFGGVSFTGDMDIALALRTMVFQTGTRYDTMY 538
GTVSG+PK +A+E+I++ E RGPY G G +S G D+AL +R T Y
Sbjct: 269 GTVSGSPKERAIEIINKYEETPRGPYGGAIGIIS-KGYTDMALVIR-----------TAY 316
Query: 539 SYKDARKRREWVAYLQAGAGIVADSDPDDEHRECQNKA 576
S+ + + R AGAGIV DSDP+DE E +KA
Sbjct: 317 SHGNGFRVR-------AGAGIVKDSDPEDEVNEIYSKA 347
>gnl|CDD|233588 TIGR01823, PabB-fungal, aminodeoxychorismate synthase, fungal
clade. This model represents the fungal clade of a
para-aminobenzoate synthesis enzyme,
aminodeoxychorismate synthase, which acts on chorismate
in a pathway that yields PABA, a precursor of folate.
Length = 742
Score = 167 bits (423), Expect = 6e-44
Identities = 123/527 (23%), Positives = 218/527 (41%), Gaps = 84/527 (15%)
Query: 90 YRCIFSDHLTPVVAYRCLVQEDDREAPSFLFESVEPGVRVSNVGRYSVVGAQPVMEVIVK 149
Y F P + + + P F+ S GRYS++ +
Sbjct: 252 YVKQFEVSEDPKLTFEICNIIRE---PKFVMSSSV------ITGRYSIIALPNSASQVFT 302
Query: 150 DNN---VTIMDHEKGSLVEE----VVDDPMEIPRKISEDW-------KPQIID----ELP 191
T + + + + + ++ S+ W + + ID E+P
Sbjct: 303 HYGAMLKTTVHYWQDTEISYTRLKKCLSGVDSDLDKSQFWITLGKFMENKKIDNPHREIP 362
Query: 192 EAFCGGWVGYFSYDTVRYVEKKKLPFSKAPHDDRSL-ADIHLGLYNDVLVFDHVEKKVYV 250
F GG VG Y+ + + + + D+ SL D L N +V DH + K+YV
Sbjct: 363 --FIGGLVGILGYEIGSDLSTQYIACGRCNDDENSLVPDAKLVFINRSIVIDHKQGKLYV 420
Query: 251 IHWVRLDQHSSVQKAYAEGLEHLEKLVARKVITRSIDLHTHH--FGPPLKKSNMT---SE 305
S+ + LE +L V ++I + P +T E
Sbjct: 421 Q---------SLDNTFPVALEWSGELRDSFVRKKNIKQSLSWPFYLPEEIDFVITFPDKE 471
Query: 306 AYKNAVLEAKEHIQAGDIFQIVLSQRFERRTF-------ADPFEVYRALRVVNPSPYMTY 358
Y A ++++ AGD +++ L+ + T + +E+Y+ LR NP+P+ +
Sbjct: 472 DYAKAFKACQDYLHAGDSYEMCLTTQ----TKVVPPAVISPDWEIYQRLRQRNPAPFSGF 527
Query: 359 LQARGCILVASSPEILTRVKKN-KIVNRPLAGTVRRGRTTEEDEMLE--TQLLKDAKQCA 415
+ + I +++SPE V + RP+ GTV++G LE ++LK K+
Sbjct: 528 FRLKHIIFLSTSPEKFLEVGMDTHAKLRPIKGTVKKG----PQMNLEKARRILKTPKEMG 583
Query: 416 EHVMLVDLGRNDVGKVARSGSVKVEKLMNVERYSHVMHISSTITGELQ------DRLSCW 469
E++M++DL RND+ ++ V VE+LM+VE ++ V + S + R S
Sbjct: 584 ENLMILDLIRNDLYELVPKNDVHVEELMSVEEHATVYQLVSVVKAHGLTSASKKTRYSGI 643
Query: 470 DALRAALPVGTVSGAPKVKAMELIDELEVNRRGPYSGGFGGVSFTGDMDIALALRTMVFQ 529
D L+ +LP G+++GAPK ++++L+ ++E RG YSG G G+ D ++ +R
Sbjct: 644 DVLKHSLPPGSMTGAPKKRSVQLLQDVEGGARGIYSGVTGYWDVNGNGDFSVNIR----- 698
Query: 530 TGTRYDTMYSYKDARKRREWVAYLQAGAGIVADSDPDDEHRECQNKA 576
+SY R + AG + S P+ E E NK
Sbjct: 699 ------CAFSYNGGTSWR-----IGAGGAVTVLSTPEGELEEMYNKL 734
>gnl|CDD|235634 PRK05877, PRK05877, aminodeoxychorismate synthase component I;
Provisional.
Length = 405
Score = 159 bits (404), Expect = 3e-43
Identities = 93/287 (32%), Positives = 140/287 (48%), Gaps = 28/287 (9%)
Query: 305 EAYKNAVLEAKEHIQAGDIFQIVLSQRFERRTFADPFEVYRALRVVNPSP-YMTYLQARG 363
A+++ VL E I AG+++Q + +F P + + V +P YL
Sbjct: 142 AAHRDGVLACLEAIAAGEVYQACVCTQFTGTVTGSPLDFFADG-VARTAPARAAYLAGDW 200
Query: 364 CILVASSPEILTRVKKNKIVNRPLAGTVRRGRTTEEDEMLETQLLKDAKQCAEHVMLVDL 423
+ + SPE+ R + + + + P+ GT+ L AK AE++M+VDL
Sbjct: 201 GAVASLSPELFLRRRGSVVTSSPIKGTLPLDADPSA-------LRASAKDVAENIMIVDL 253
Query: 424 GRNDVGKVARSGSVKVEKLMNVERYSHVMHISSTITGELQDRLSCWDALRAALPVGTVSG 483
RND+G+VAR+G+V V +L+ V V H+ ST++ ++ D L D L A P +V+G
Sbjct: 254 VRNDLGRVARTGTVTVPELLVVRPAPGVWHLVSTVSAQVPDELPMSDLLDATFPPASVTG 313
Query: 484 APKVKAMELIDELEVNRRGPYSGGFGGVSFTGDMDIALALRTMVFQT-GTRYDTMYSYKD 542
PK++A ELI + E RRG Y G G S ++ +A+RT+ F G
Sbjct: 314 TPKLRARELISQWEPVRRGIYCGTVGLASPVAGCELNVAIRTVEFDADGN---------- 363
Query: 543 ARKRREWVAYLQAGAGIVADSDPDDEHRECQNKAAGLARAIDLAESA 589
A L G GI ADSDPD E +EC +KAA + A SA
Sbjct: 364 --------AVLGVGGGITADSDPDAEWQECLHKAAPIVGLPAAATSA 402
>gnl|CDD|233014 TIGR00543, isochor_syn, isochorismate synthases. This enzyme
interconverts chorismate and isochorismate. In E. coli,
different loci encode isochorismate synthases for the
pathways of menaquinone biosynthesis and enterobactin
biosynthesis (via salicilate) and fail to complement
each other. Among isochorismate synthases, the
N-terminal domain is poorly conserved [Biosynthesis of
cofactors, prosthetic groups, and carriers, Menaquinone
and ubiquinone].
Length = 351
Score = 146 bits (370), Expect = 4e-39
Identities = 87/310 (28%), Positives = 144/310 (46%), Gaps = 25/310 (8%)
Query: 278 ARKVITRSIDLHTHHFGPPLKKSNMTSEAYKNAVLEAKEHIQAGDIFQIVLSQRFERRTF 337
A R + + A++ AV EA E+I+ G + ++VL+ R F
Sbjct: 63 AVSSGIRPLRALPEQMTTLTTGEDPDKAAWRTAVEEALENIRQGPLDKVVLA-RALTLKF 121
Query: 338 ADPFEVY---RALRVVNPSPYMTYLQ-ARGCILVASSPEILTRVKKNKIVNRPLAGTVRR 393
AD + LR P+ Y+ L+ +G + + ++PE L +K +++ LAGT R
Sbjct: 122 ADDIDPIAVLANLRQQYPNAYIFLLEPPQGGVFLGATPERLLSREKGELLTEALAGTAPR 181
Query: 394 GRTTEEDEMLETQLLKDAKQCAEHVMLVDLGRNDVGKVARSGSVKVEKLMNVERYSHVMH 453
EED L LLKD K EH ++V+ R + + S+ V + + + ++V H
Sbjct: 182 SADPEEDRKLGELLLKDDKNLREHRLVVEYIRRRLQPICT--SLDVSETPELLKLANVQH 239
Query: 454 ISSTITGELQDRLSCWDALRAALPVGTVSGAPKVKAMELIDELEVNRRGPYSGGFGGVSF 513
+ + I+ L+D S D L+ P V G P+ +A++ I E E RG Y+ G +
Sbjct: 240 LYTPISARLKDGDSLLDLLKQLHPTPAVGGLPREEALDFIREHEPFDRGLYAAPLGWLDG 299
Query: 514 TGDMDIALALRTMVFQTGTRYDTMYSYKDARKRREWVAYLQAGAGIVADSDPDDEHRECQ 573
G+ + A+ +R+ + +D + L AGAGIVADSDP+ E E +
Sbjct: 300 EGNGEFAVGIRSALV------------EDGQ------VRLYAGAGIVADSDPESEWEETE 341
Query: 574 NKAAGLARAI 583
K + RA+
Sbjct: 342 LKLQTMLRAL 351
>gnl|CDD|132533 TIGR03494, salicyl_syn, salicylate synthase. Members of this
protein family are salicylate synthases, bifunctional
enzymes that make salicylate, in two steps, from
chorismate. Members are homologous to anthranilate
synthase component I from Trp biosynthesis. Members
typically are found in gene regions associated with
siderophore or other secondary metabolite biosynthesis.
Length = 425
Score = 146 bits (369), Expect = 3e-38
Identities = 103/321 (32%), Positives = 140/321 (43%), Gaps = 34/321 (10%)
Query: 267 AEGLEHLEKLVARKVITRSIDLHTHHFGPPLKKS-----NMTSEAYKNAVLEAKEHIQAG 321
A L +LVA T I PL ++ AY+ V A I AG
Sbjct: 127 AGERRRLCRLVAEGTTTTQI--------APLPQARAVDTATDPSAYRARVARAVAEIAAG 178
Query: 322 DIFQIVLSQRFERRTFADPFEVYRALRVVNPSPYMTYLQARGCI-LVASSPEILTRVKKN 380
+++LS+ D R N +P ++L G I + SPE++ V+ +
Sbjct: 179 RYHKVILSRAVPLPFAIDFPATLLLGRRHN-TPVRSFLLRLGGIEALGFSPELVMSVRAD 237
Query: 381 -KIVNRPLAGTVRRGRTTEEDEMLETQLLKDAKQCAEHVMLVDLGRNDVGKVARSGSVKV 439
K+V PLAGT G E D+ L +LL D+K+ EH + V ++ +V G+V V
Sbjct: 238 GKVVTEPLAGTRALGGGPEHDKQLRDELLSDSKEIVEHAISVKEAIEELEQVCEPGTVVV 297
Query: 440 EKLMNVERYSHVMHISSTITGELQDRLSCWDALRAALPVGTVSGAPKVKAMELIDELEVN 499
E M V V H+ ST++G+L WDA P T SG PK A+E I LE
Sbjct: 298 EDFMTVRERGSVQHLGSTVSGQLAPSKDAWDAFEVLFPAITASGIPKAAALEAIMRLEKT 357
Query: 500 RRGPYSGGFGGVSFTGDMDIALALRTMVFQTGTRYDTMYSYKDARKRREWVAYLQAGAGI 559
RG YSG + G +D AL LR +Q R W LQAGAGI
Sbjct: 358 PRGLYSGAVLLLDADGTLDAALVLRA-AYQ--------------DSGRTW---LQAGAGI 399
Query: 560 VADSDPDDEHRECQNKAAGLA 580
+A S P+ E E K A +A
Sbjct: 400 IAQSTPERELTETCEKLASIA 420
>gnl|CDD|235932 PRK07093, PRK07093, para-aminobenzoate synthase component I;
Validated.
Length = 323
Score = 136 bits (346), Expect = 5e-36
Identities = 87/292 (29%), Positives = 139/292 (47%), Gaps = 53/292 (18%)
Query: 297 LKKSNMTSEAYKNAVLEAKEHIQAGDIFQIVLSQRFERRTFADPFEVYRALRVVNPSPYM 356
L+K ++ E Y+ +E IQAG+ + + L+ T E+++A + + Y
Sbjct: 66 LQKEPISFEEYQQGFELVQEEIQAGNSYLLNLTYPTPIETNLSLEEIFQASK----AKYK 121
Query: 357 TYLQARGCILVASSPEILTRVKKNKIVNRPLAGTVRRGRTTEEDEML---ETQLLKDAKQ 413
+ + V SPE R++ NKI P+ GT+ D L E +LL D K+
Sbjct: 122 LLFKDQ---FVCFSPEPFVRIEDNKISTYPMKGTI--------DASLPNAEEKLLNDEKE 170
Query: 414 CAEHVMLVDLGRNDVGKVARSGSVKVEKLMNVER-YSH---VMHISSTITGELQDRLSCW 469
AEH +VDL RND+ VA++ V+V + +++ ++ ++ SS I+G L + W
Sbjct: 171 FAEHATIVDLLRNDLSMVAKN--VRVTRFRYIDKIKTNKGEILQTSSEISGTLPEN---W 225
Query: 470 -----DALRAALPVGTVSGAPKVKAMELIDELEVNRRGPYSGGFGGVSFTG-DMDIALAL 523
D L LP G+++GAPK K +E+I++ E RG Y+G FG F G +D A+ +
Sbjct: 226 QENIGDILAKLLPAGSITGAPKEKTVEIIEQAEGYERGFYTGVFG--YFDGESLDSAVMI 283
Query: 524 RTMVFQTGTRYDTMYSYKDARKRREWVAYLQAGAGIVADSDPDDEHRECQNK 575
R + Q Y ++G GI DSD DE+ E K
Sbjct: 284 R-FIEQENDGL-----------------YFKSGGGITIDSDLKDEYNELIQK 317
>gnl|CDD|218224 pfam04715, Anth_synt_I_N, Anthranilate synthase component I, N
terminal region. Anthranilate synthase (EC:4.1.3.27)
catalyzes the first step in the biosynthesis of
tryptophan. Component I catalyzes the formation of
anthranilate using ammonia and chorismate. The catalytic
site lies in the adjacent region, described in the
chorismate binding enzyme family (pfam00425). This
region is involved in feedback inhibition by tryptophan.
This family also contains a region of Para-aminobenzoate
synthase component I (EC 4.1.3.-).
Length = 141
Score = 127 bits (321), Expect = 1e-34
Identities = 59/156 (37%), Positives = 78/156 (50%), Gaps = 16/156 (10%)
Query: 96 DHLTPVVAYRCLVQEDDREAPSFLFESVEPGVRVSNVGRYSVVGAQPVMEVIVKDNNVTI 155
D LTPV + L E +FL ES E G GRYS +G P+ + K +
Sbjct: 1 DSLTPVELFLRLRGEGH----AFLLESAEGGE-----GRYSFIGLDPLATIKAKGGETEL 51
Query: 156 MDHEKGSLVEEVVDDPMEIPRKISEDWK-PQIIDELPEAFCGGWVGYFSYDTVRYVEKKK 214
D E L+ DP + R++ ++ P+ D F GG VGYF YD VRY+E K
Sbjct: 52 SDDEGERLIAG---DPFDALRELLARFRIPEAPDPGLPPFSGGLVGYFGYDLVRYLE-PK 107
Query: 215 LPFSKAPHDDRSLADIHLGLYNDVLVFDHVEKKVYV 250
LP AP D L D GLY+ +LVFDH E+K+ +
Sbjct: 108 LPD--APDDLNELPDAVFGLYDTLLVFDHQEQKLTL 141
>gnl|CDD|224091 COG1169, MenF, Isochorismate synthase [Coenzyme metabolism /
Secondary metabolites biosynthesis, transport, and
catabolism].
Length = 423
Score = 132 bits (334), Expect = 1e-33
Identities = 82/287 (28%), Positives = 129/287 (44%), Gaps = 32/287 (11%)
Query: 305 EAYKNAVLEAKEHIQAGDIFQIVLSQRFERRTFADPFEVY---RALRVVNPSPY--MTYL 359
+ V +A I G++ ++VL++ + TF P + LR NP+ Y + L
Sbjct: 157 ADWLQLVEQALALIAQGELDKVVLARALDL-TFDAPIDAAALLARLRAQNPNCYHFLVAL 215
Query: 360 QARGCILVASSPEILTRVKKNKIVNRPLAGTVRRGRTTEEDEMLETQLLKDAKQCAEHVM 419
G + +SPE L R + ++V LAG+ RG ED L LL DAK EH +
Sbjct: 216 GDGGA-FLGASPERLVRRRGGQLVTEALAGSAPRGADPVEDAQLGNWLLADAKNLHEHQL 274
Query: 420 LVDLGRNDVGKVARSGSVKVEKLMNVE--RYSHVMHISSTITGELQDRLSCWDALRAAL- 476
+VD D+ + +++ + + V H+ + I+ +L+D L AL
Sbjct: 275 VVD----DIRQRLEPLCEELDVPSPPQLIKLRKVQHLRTPISAQLKDPSVTALDLAKALH 330
Query: 477 PVGTVSGAPKVKAMELIDELEVNRRGPYSGGFGGVSFTGDMDIALALRTMVFQTGTRYDT 536
P V G P+ A++ I E E RG Y+G G G+ + +A+R+ +
Sbjct: 331 PTPAVGGLPREAALQFIREHEPFDRGWYAGPVGWCDSEGNGEFVVAIRSALI-------- 382
Query: 537 MYSYKDARKRREWVAYLQAGAGIVADSDPDDEHRECQNKAAGLARAI 583
+ L AGAGIVA SDP++E RE K A + RA+
Sbjct: 383 ----SGNQ------VRLFAGAGIVAGSDPEEEWRETDLKLATMLRAL 419
>gnl|CDD|169151 PRK07912, PRK07912, salicylate synthase MbtI; Reviewed.
Length = 449
Score = 129 bits (327), Expect = 2e-32
Identities = 91/281 (32%), Positives = 129/281 (45%), Gaps = 31/281 (11%)
Query: 307 YKNAVLEAKEHIQAGDIFQIVLSQRFERRTFADPFEV-----YRALRVVNPSPYMTYLQA 361
Y++ V A I AG +++LS+ E PF V YR R N +P ++L
Sbjct: 186 YRDRVAVAVAEIAAGRYHKVILSRCVEV-----PFAVDFPATYRLGRRHN-TPVRSFLLR 239
Query: 362 RGCILVAS-SPEILTRVKKNKIV-NRPLAGTVRRGRTTEEDEMLETQLLKDAKQCAEHVM 419
G I SPE++T V+ + +V PLAGT GR D + L ++K+ EH +
Sbjct: 240 LGGIRALGYSPELVTAVRADGVVITEPLAGTRAFGRGAAIDRLARDDLESNSKEIVEHAI 299
Query: 420 LVDLGRNDVGKVARSGSVKVEKLMNVERYSHVMHISSTITGELQDRLSCWDALRAALPVG 479
V ++ ++A GS V M V V H+ ST+ G L DAL A P
Sbjct: 300 SVRSSLAEITEIAEPGSAAVIDFMTVRERGSVQHLGSTVRGRLDASSDRMDALEALFPAV 359
Query: 480 TVSGAPKVKAMELIDELEVNRRGPYSGGFGGVSFTGDMDIALALRTMVFQTGTRYDTMYS 539
T SG PK ++ I L+ RG YSG +S G +D AL LR +Q G R
Sbjct: 360 TASGIPKAAGVDAIFRLDEAPRGLYSGAVVMLSADGGLDAALTLRA-AYQVGGR------ 412
Query: 540 YKDARKRREWVAYLQAGAGIVADSDPDDEHRECQNKAAGLA 580
+L+AGAGI+ +S+P+ E E K + LA
Sbjct: 413 -----------TWLRAGAGIIEESEPEREFEETCEKLSTLA 442
>gnl|CDD|102546 PRK06772, PRK06772, salicylate synthase Irp9; Reviewed.
Length = 434
Score = 124 bits (312), Expect = 1e-30
Identities = 91/291 (31%), Positives = 134/291 (46%), Gaps = 19/291 (6%)
Query: 290 THHFGPPLKKSNMTSEAYKNAVLEAKEHIQAGDIFQIVLSQRFERRTFADPFEVYRALRV 349
T P + + EAYK V A I+ G+ ++++S+ + D R
Sbjct: 158 TTQNAPLAVDTALNGEAYKQQVARAVAEIRRGEYVKVIVSRAIPLPSRIDMPATLLYGRQ 217
Query: 350 VNPSPYMTYLQARGCILVASSPEILTRVKKNKIVNRPLAGTVRRGRTTEEDEMLETQLLK 409
N + G + SPE++ V NK+V PLAGT R E ++ E +LL
Sbjct: 218 ANTPVRSFMFRQEGREALGFSPELVMSVTGNKVVTEPLAGTRDRMGNPEHNKAKEAELLH 277
Query: 410 DAKQCAEHVMLVDLGRNDVGKVARSGSVKVEKLMNVERYSHVMHISSTITGELQDRLSCW 469
D+K+ EH++ V ++ V + GSV VE LM+V + V H+ S ++G+L + W
Sbjct: 278 DSKEVLEHILSVKEAIAELEAVCQPGSVVVEDLMSVRQRGSVQHLGSGVSGQLAENKDAW 337
Query: 470 DALRAALPVGTVSGAPKVKAMELIDELEVNRRGPYSGGFGGVSFTGDMDIALALRTMVFQ 529
DA P T SG PK A+ I ++E R YSG + T D AL LR+ VFQ
Sbjct: 338 DAFTVLFPSITASGIPKNAALNAIMQIEKTPRELYSGAILLLDDT-RFDAALVLRS-VFQ 395
Query: 530 TGTRYDTMYSYKDARKRREWVAYLQAGAGIVADSDPDDEHRECQNKAAGLA 580
R ++QAGAGI+A S P+ E E + K A +A
Sbjct: 396 DSQR-----------------CWIQAGAGIIAQSTPERELTETREKLASIA 429
>gnl|CDD|235920 PRK07054, PRK07054, salicylate biosynthesis isochorismate synthase;
Validated.
Length = 475
Score = 109 bits (273), Expect = 2e-25
Identities = 79/296 (26%), Positives = 132/296 (44%), Gaps = 24/296 (8%)
Query: 297 LKKSNMTSEAYKNAVLEAKEHIQAGDIFQIVLSQRFERRTFADPFE---VYRALRVVNPS 353
L+ S + + +++ V A + I+ G ++VL+ R + +A P + R LR+ +P
Sbjct: 187 LRASALQAREWQHEVRRAVDAIRGGAFGKVVLA-RDVLQQYARPVAIGPLLRRLRLRDPH 245
Query: 354 PYMTYLQARGCILVASSPEILTRVKKNKIVNRPLAGTVRRGRTTEEDEMLETQLLKDAKQ 413
++ + + ++PE L RV + LAGT+ RG ED L L+ AK
Sbjct: 246 AHLFAFRRGNACFLGATPERLVRVAAGDLHTHALAGTIARGADPAEDARLGAALMASAKD 305
Query: 414 CAEHVMLVDLGRNDVGKVARSGSVKVEKLMNVERYSHVMHISSTITGELQDRLSCWDALR 473
EH ++VD R S ++ + ++ R + H+S+ I L + +
Sbjct: 306 RLEHALVVDAIRA--ALAPLSRALDIPDQPSLHRLPRLQHLSTPIRATLAPDATLLQVVA 363
Query: 474 AALPVGTVSGAPKVKAMELIDELEVNRRGPYSGGFGGVSFTGDMDIALALRTMVFQTGTR 533
A P V G P+ A++ I E RG Y+ G + G+ D A+ALR+ + G
Sbjct: 364 ALHPTPAVGGHPRAAALDYIRAHEGFDRGWYAAPIGWLDAHGNGDFAVALRSALITGGA- 422
Query: 534 YDTMYSYKDARKRREWVAYLQAGAGIVADSDPDDEHRECQNKAAGLARAIDLAESA 589
L AG GIVADS+P E+RE K +G+ A+ ++A
Sbjct: 423 -----------------CRLFAGCGIVADSEPASEYRETCLKLSGMREALRARDAA 461
>gnl|CDD|235886 PRK06923, PRK06923, isochorismate synthase DhbC; Validated.
Length = 399
Score = 84.4 bits (209), Expect = 2e-17
Identities = 76/296 (25%), Positives = 130/296 (43%), Gaps = 33/296 (11%)
Query: 305 EAYKNAVLEAKEHIQAGDIFQIVLSQRFERRTFADPFEVYRALRVV---NPSPYMTYL-- 359
E Y N V + IQ GD+ +IVLS+ + ++ + + LR + N Y +
Sbjct: 122 EVYMNGVKQGIAKIQDGDLKKIVLSRSLDV-KSSEKIDKQKLLRELAEHNKHGYTFAVNL 180
Query: 360 ----QARGCILVASSPEILTRVKKNKIVNRPLAGTVRRGRTTEEDEMLETQLLKDAKQCA 415
L+ +SPE+L ++++ PLAG+ R ED+ +LL K
Sbjct: 181 PKDENENSKTLIGASPELLVSRHGMQVISNPLAGSRPRSDDPVEDKRRAEELLSSPKDLH 240
Query: 416 EHVMLVDLGRNDVGKVARSGSV-KVEKLMNVERYSHVMHISSTITGELQDRLSCWDALRA 474
EH ++V+ + + V + +++ E + H+S+ + GEL+D + L
Sbjct: 241 EHAVVVEAVAAALRPYCHTLHVPEKPSVIHTEA---MWHLSTEVKGELKDPNTSSLELAI 297
Query: 475 AL-PVGTVSGAPKVKAMELIDELEVNRRGPYSGGFGGVSFTGDMDIALALRTMVFQTGTR 533
AL P V G P +A E I ++E R ++G G GD + + +R +
Sbjct: 298 ALHPTPAVCGTPTEEAREAIQQIEPFDREFFTGMLGWSDLNGDGEWIVTIRCAEVE---- 353
Query: 534 YDTMYSYKDARKRREWVAYLQAGAGIVADSDPDDEHRECQNKAAGLARAIDLAESA 589
+ R L AGAG+VA+S P+DE E K + +A+ L +S+
Sbjct: 354 --------ENTLR------LYAGAGVVAESKPEDELAETSAKFQTMLKAMGLNDSS 395
>gnl|CDD|178383 PLN02786, PLN02786, isochorismate synthase.
Length = 533
Score = 84.8 bits (210), Expect = 2e-17
Identities = 79/304 (25%), Positives = 130/304 (42%), Gaps = 33/304 (10%)
Query: 297 LKKSNMTSEAYKN-AVLEAKEHIQAG--DIFQIVL--SQRFERRTFADPFEVYRALRVVN 351
L K+++ S+ + AV +A + I+ + ++VL S R T DP L+V
Sbjct: 251 LSKNHVPSKGAWHLAVNKALQIIKRKSSPLKKVVLARSSRIITDTDIDPIAWLACLQVEG 310
Query: 352 PSPYMTYLQ-ARGCILVASSPEILTRVKKNKIVNRPLAGTVRRGRTTEEDEMLETQLLKD 410
+ Y LQ + ++PE L K + + LA T RG ++ D +E LL
Sbjct: 311 QNAYQFCLQPPDAPAFIGNTPEQLFHRKGLGVCSEALAATRPRGGSSARDLQIELDLLTS 370
Query: 411 AKQCAEHVMLVDLGRNDVGKVARSGSVKVEKLMNVERYSHVMHISSTITGELQDRLSCWD 470
K E ++ + R + + V VE + + + V H+ + + G L+ +D
Sbjct: 371 PKDDLEFSIVRENIREKLEAICDR--VVVEPHKAIRKLARVQHLYAQLAGRLRSEDDEFD 428
Query: 471 ALRAALPVGTVSGAPKVKAMELIDELEVNRRGPYSGG---FGGVSFTGDMDIALALRTMV 527
L A P V G P +A LI E E RG Y+G FGG G+ + A+ +R+ +
Sbjct: 429 ILAALHPTPAVCGHPTEEARLLIAETESFDRGMYAGPVGWFGG----GESEFAVGIRSAL 484
Query: 528 FQTGTRYDTMYSYKDARKRREWVAYLQAGAGIVADSDPDDEHRECQNKAAGLARAIDLAE 587
+ G A + AG GIV S+P E E + K + ++++ E
Sbjct: 485 VEKGLG-----------------ALIYAGTGIVEGSNPSSEWNELELKISQFTKSLEH-E 526
Query: 588 SAFI 591
SA
Sbjct: 527 SALS 530
>gnl|CDD|184974 PRK15012, PRK15012, menaquinone-specific isochorismate synthase;
Provisional.
Length = 431
Score = 64.1 bits (156), Expect = 8e-11
Identities = 92/343 (26%), Positives = 141/343 (41%), Gaps = 58/343 (16%)
Query: 254 VRLDQHS--SVQKAYAEGLEHLEKLVARKVITRSIDLHTHHFGPPLKKSNMTSE------ 305
+RL S S+Q + E L LV+ K + P L T E
Sbjct: 123 LRLTLFSESSLQHDAIQAKEFLATLVSIKPL------------PGLH-LTTTREQHWPDK 169
Query: 306 -AYKNAVLEAKEHIQAGDIFQIVLSQRFERRTFADPFE---VYRALRVVNPSPYMTYL-- 359
+ + A + I G++ ++VL+ R FA P + A R +N + Y Y+
Sbjct: 170 TGWTQLIELATKTIAEGELDKVVLA-RATDLHFASPVNAAAMMAASRRLNLNCYHFYMAF 228
Query: 360 QARGCILVASSPEILTRVKKNKIVNRPLAGTVRRGRTTEEDEMLETQLLKDAKQCAEHVM 419
A L SSPE L R + + LAGTV ++ + L L+ D K E+++
Sbjct: 229 DAENAFL-GSSPERLWRRRDKALRTEALAGTVANHPDDKQAQQLGEWLMADDKNQRENML 287
Query: 420 LVDLGRNDVGKVARSGSVKVEKL-MNVERYSHVMHISSTITGELQ--DRLSCWDALRAAL 476
+V+ D+ + ++ + ++ L V R V H+ I L D + C L+
Sbjct: 288 VVE----DICQRLQADTQTLDVLPPQVLRLRKVQHLRRCIWTSLNKADDVICLHQLQ--- 340
Query: 477 PVGTVSGAPKVKAMELIDELEVNRRGPYSGGFGGVSFTGDMDIALALRTMVFQTGTRYDT 536
P V+G P+ A + I E R Y+G G +S + ++LR+
Sbjct: 341 PTAAVAGLPRDLARQFIARHEPFTREWYAGSAGYLS-LQQSEFCVSLRSA---------- 389
Query: 537 MYSYKDARKRREWVAYLQAGAGIVADSDPDDEHRECQNKAAGL 579
K V L AGAGIV SDP+ E +E NKAAGL
Sbjct: 390 --------KVSGNVVRLYAGAGIVRGSDPEQEWQEIDNKAAGL 424
>gnl|CDD|184977 PRK15016, PRK15016, isochorismate synthase EntC; Provisional.
Length = 391
Score = 48.7 bits (116), Expect = 5e-06
Identities = 68/260 (26%), Positives = 111/260 (42%), Gaps = 37/260 (14%)
Query: 322 DIFQIVLSQRFERRTFADPFEVYRALRVV--NPSPYMTYLQ-ARGCILVASSPEILTRVK 378
+ ++VLS+ + T A R++ NP Y ++ A G +L+ +SPE+L R
Sbjct: 144 QVDKVVLSRLIDITTDAAIDSGALLERLIAQNPVSYNFHVPLADGGVLLGASPELLLRKD 203
Query: 379 KNKIVNRPLAGTVRRGRTTEEDEMLE----TQLLKDAKQCAEHVMLVDLGRNDVGKVARS 434
+ + PLAG+ RR + DE+L+ +LL K EH ++ + + + RS
Sbjct: 204 GERFSSLPLAGSARR----QPDEVLDREAGNRLLASEKDRHEHELVTQAMKEVLRE--RS 257
Query: 435 GSVKVEKLMNVERYSHVMHISSTITGELQDRLSCWDALRAAL---PVGTVSGAPKVKAME 491
+ V + + H+++ G+ + +AL A P +SG P A +
Sbjct: 258 SELHVPSSPQLITTPTLWHLATPFEGKANAQE---NALTLACLLHPTPALSGFPHQAAKQ 314
Query: 492 LIDELEVNRRGPYSGGFGGVSFTGDMDIALALRTMVFQTGTRYDTMYSYKDARKRREWVA 551
+I ELE R + G G G+ + + +R A+ R V
Sbjct: 315 VIAELEPFDRELFGGIVGWCDSEGNGEWVVTIRC-----------------AKLRENQVR 357
Query: 552 YLQAGAGIVADSDPDDEHRE 571
L AGAGIV S P E RE
Sbjct: 358 -LFAGAGIVPASSPLGEWRE 376
>gnl|CDD|214631 smart00350, MCM, minichromosome maintenance proteins.
Length = 509
Score = 38.4 bits (90), Expect = 0.011
Identities = 41/205 (20%), Positives = 66/205 (32%), Gaps = 45/205 (21%)
Query: 18 RLVPPSHRLSLVPVT--VTRINLPKSAATVSTVKCCVSRQTTTT---TSTATAPATKLAS 72
R + H LV ++ VTR + + ++ C T + T P
Sbjct: 6 RELRADHLGKLVRISGIVTRTSGVRPKLKRASFTCEKCGATLGPEIQSGRETEPTVCPPR 65
Query: 73 DASGFSEASKRANLVPLYRCIFSDHLTPVVAYRCLVQEDDREAPS-------------FL 119
+ S + R F D + +QE E P L
Sbjct: 66 ECQ-----SPTPFSLNHERSTFIDF------QKIKLQESPEEVPVGQLPRSVDVILDGDL 114
Query: 120 FESVEPGVRVSNVGRYSVVGAQ---------PVMEVIVKDNNVTIMDHEKGSLVEEVV-- 168
+ +PG RV G Y V PV ++ N+V +D+++
Sbjct: 115 VDKAKPGDRVEVTGIYRNVPYGFKLNTVKGLPVFATYIEANHVRKLDYKRSFEDSSFSVQ 174
Query: 169 ---DDPMEIPRKISEDWKPQIIDEL 190
D+ E RK+S+D P I + L
Sbjct: 175 SLSDEEEEEIRKLSKD--PDIYERL 197
>gnl|CDD|220957 pfam11054, Surface_antigen, Sporozoite TA4 surface antigen. This
family of proteins is a Eukaryotic family of surface
antigens. One of the better characterized members of the
family is the sporulated TA4 antigen. The TA4 gene
encodes a single polypeptide of 25 kDa which contains a
17 and a 8 kD polypeptide.
Length = 258
Score = 29.8 bits (67), Expect = 3.4
Identities = 14/41 (34%), Positives = 18/41 (43%), Gaps = 4/41 (9%)
Query: 39 PKSAATVSTVKCCVSRQTTTTTSTATAPATKLASDASGFSE 79
P S+AT C V T TTSTA +L D ++
Sbjct: 164 PSSSATAD---CRVVTCTQATTSTAPGGR-RLQGDGDSETK 200
>gnl|CDD|215123 PLN02192, PLN02192, 3-ketoacyl-CoA synthase.
Length = 511
Score = 29.9 bits (67), Expect = 4.0
Identities = 20/55 (36%), Positives = 29/55 (52%), Gaps = 10/55 (18%)
Query: 307 YKNAVLEAKEHIQAGD-IFQIVLSQRFERRTFADPFEVYRALRVVNPS----PYM 356
Y+ A EAK I+ GD +QI F+ + V++ALR VNP+ P+M
Sbjct: 446 YELAYSEAKGRIKKGDRTWQIAFGSGFKCNS-----AVWKALRTVNPAKEKNPWM 495
>gnl|CDD|187837 cd09706, Csf2_U, CRISPR/Cas system-associated RAMP superfamily
protein Csf2. CRISPR (Clustered Regularly Interspaced
Short Palindromic Repeats) and associated Cas proteins
comprise a system for heritable host defense by
prokaryotic cells against phage and other foreign DNA;
RAMP superfamily protein; Contains several motifs
similar to Cas7 family.
Length = 328
Score = 29.5 bits (66), Expect = 4.3
Identities = 17/43 (39%), Positives = 19/43 (44%), Gaps = 1/43 (2%)
Query: 471 ALRAALPVGTVSGAPKVKAMELIDELEVNRRGPYSGGFGGVSF 513
AL L G V+G P K M + E R PY G FGG
Sbjct: 88 ALYNVLQCGAVTGNPDGKDMT-LGEYRQARDDPYFGLFGGGPK 129
>gnl|CDD|234112 TIGR03115, cas_csf2, CRISPR type AFERR-associated protein Csf2.
Members of this family show up near CRISPR repeats in
Acidithiobacillus ferrooxidans ATCC 23270, Azoarcus sp.
EbN1, and Rhodoferax ferrireducens DSM 15236. In the
latter two species, the CRISPR/cas locus is found on a
plasmid. This family is one of several characteristic of
a type of CRISPR-associated (cas) gene cluster we
designate Aferr after A. ferrooxidans, where it is both
chromosomal and the only type of cas gene cluster found.
The gene is designated csf2 (CRISPR/cas Subtype as in A.
ferrooxidans protein 2), as it lies second closest to
the repeats [Mobile and extrachromosomal element
functions, Other].
Length = 344
Score = 29.5 bits (66), Expect = 4.6
Identities = 17/43 (39%), Positives = 19/43 (44%), Gaps = 1/43 (2%)
Query: 471 ALRAALPVGTVSGAPKVKAMELIDELEVNRRGPYSGGFGGVSF 513
AL L G V+G P K M + E R PY G FGG
Sbjct: 88 ALYNVLQCGAVTGNPDGKDMT-LGEYRQARDDPYFGLFGGGPK 129
>gnl|CDD|226406 COG3889, COG3889, Predicted solute binding protein [General
function prediction only].
Length = 872
Score = 29.8 bits (67), Expect = 5.1
Identities = 14/64 (21%), Positives = 27/64 (42%), Gaps = 1/64 (1%)
Query: 5 TAATSMQSLSFSNRLVPPSHRLSLVPVTVTRINLPKSAATVSTVKCCVSRQTTTTTSTAT 64
T + S S + + V +T T + ++ S + QT+T+T+T T
Sbjct: 781 TKTETTLSYSAYSN-TSILIETTSVVITKTVTQTQTTTSSPSPTQTTSPTQTSTSTTTTT 839
Query: 65 APAT 68
+P+
Sbjct: 840 SPSQ 843
>gnl|CDD|234400 TIGR03924, T7SS_EccC_a, type VII secretion protein EccCa. This
model represents the N-terminal domain or EccCa subunit
of the type VII secretion protein EccC as found in the
Actinobacteria. Type VII secretion is defined more
broadly as including secretion systems for ESAT-6-like
proteins in the Firmicutes as well as in the
Actinobacteria, but this family does not show close
homologs in the Firmicutes [Protein fate, Protein and
peptide secretion and trafficking].
Length = 661
Score = 29.6 bits (67), Expect = 5.8
Identities = 21/61 (34%), Positives = 24/61 (39%), Gaps = 17/61 (27%)
Query: 471 ALRAALPV-GTVSGAPKVKAMELIDELEVNRRGPYSGGFGGVSFTGDMDIALAL-RTMVF 528
ALR L TV P V+ RG F +S GD D A AL R M+
Sbjct: 156 ALRRFLRAHSTVPDLPVA----------VSLRG-----FARISLVGDRDQARALARAMLC 200
Query: 529 Q 529
Q
Sbjct: 201 Q 201
>gnl|CDD|234210 TIGR03439, methyl_EasF, probable methyltransferase domain, EasF
family. This model represents an uncharacterized domain
of about 300 amino acids with homology to
S-adenosylmethionine-dependent methyltransferases.
Proteins with this domain are exclusively fungal. A few,
such as EasF from Neotyphodium lolii, are associated
with the biosynthesis of ergot alkaloids, a class of
fungal secondary metabolites. EasF may, in fact, be the
AdoMet:dimethylallyltryptophan N-methyltransferase, the
enzyme that follows tryptophan dimethylallyltransferase
(DMATS) in ergot alkaloid biosynthesis. Several other
members of this family, including mug158 (meiotically
up-regulated gene 158 protein) from Schizosaccharomyces
pombe, contain an additional uncharacterized domain
DUF323 (pfam03781).
Length = 319
Score = 29.1 bits (66), Expect = 6.9
Identities = 12/37 (32%), Positives = 18/37 (48%)
Query: 396 TTEEDEMLETQLLKDAKQCAEHVMLVDLGRNDVGKVA 432
T +E E+L+ A MLV+LG ++ KV
Sbjct: 56 TNDEIEILKKHSSDIAASIPSGSMLVELGSGNLRKVG 92
>gnl|CDD|151390 pfam10943, DUF2632, Protein of unknown function (DUF2632). This is
a family of membrane proteins with unknown function.
Length = 233
Score = 28.8 bits (64), Expect = 7.3
Identities = 14/39 (35%), Positives = 23/39 (58%)
Query: 37 NLPKSAATVSTVKCCVSRQTTTTTSTATAPATKLASDAS 75
N P +AA ++ +K CV+ Q T ++T A K+A+ S
Sbjct: 171 NKPLNAAQIAALKICVNGQWFAYTRSSTTSAAKVAAANS 209
>gnl|CDD|188209 TIGR02337, HpaR, homoprotocatechuate degradation operon regulator,
HpaR. This Helix-Turn-Helix transcriptional regulator
is a member of the MarR family (pfam01047) and is found
in association with operons for the degradation of
4-hydroxyphenylacetic acid via homoprotocatechuate.
Length = 118
Score = 27.7 bits (62), Expect = 7.7
Identities = 15/44 (34%), Positives = 24/44 (54%), Gaps = 2/44 (4%)
Query: 344 YRALRVVNPSPYM--TYLQARGCILVASSPEILTRVKKNKIVNR 385
+R LR++ M T L + CIL S IL R++++ +V R
Sbjct: 31 WRILRILAEQGSMEFTQLANQACILRPSLTGILARLERDGLVTR 74
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.318 0.132 0.384
Gapped
Lambda K H
0.267 0.0754 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 30,251,188
Number of extensions: 3002030
Number of successful extensions: 2759
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2615
Number of HSP's successfully gapped: 59
Length of query: 592
Length of database: 10,937,602
Length adjustment: 102
Effective length of query: 490
Effective length of database: 6,413,494
Effective search space: 3142612060
Effective search space used: 3142612060
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 62 (27.6 bits)