BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 039074
(534 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|O49931|TIC55_PEA Protein TIC 55, chloroplastic OS=Pisum sativum GN=TIC55 PE=1 SV=1
Length = 553
Score = 883 bits (2282), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 420/522 (80%), Positives = 458/522 (87%), Gaps = 12/522 (2%)
Query: 22 TPPTTNVIISLNKTPKSRK-----CHAVTDRSSTSTV----GDHKVLVGPASAEERRGER 72
T P++N S NK SR+ C A +T+ D KVLVGP+S +ER+GER
Sbjct: 35 TNPSSN--FSFNKALSSRRRKQAWCVAAAADVKDATLLDGEEDQKVLVGPSSEQERKGER 92
Query: 73 QVADYDWTEEWYPLYLTKDVPDDAPLGLTVFDQQIVLYKDGNGELRCYQDRCPHRLAKLS 132
+VADYDWTEEWYPLYLTK+VP DAPLGL V+D+ IVL++DGN + +CY+DRCPHRLAKLS
Sbjct: 93 EVADYDWTEEWYPLYLTKNVPHDAPLGLKVYDKNIVLFRDGNDQFQCYEDRCPHRLAKLS 152
Query: 133 EGQLIDGRLECLYHGWQFEGEGKCVKIPQLPADAKIPRSACVRTYEVKESQGVVWVWMSQ 192
EGQLIDGRLECLYHGWQFEGEGKCVKIPQLPADAKIP+SACV+TYEV++SQGV+WVWMS+
Sbjct: 153 EGQLIDGRLECLYHGWQFEGEGKCVKIPQLPADAKIPKSACVKTYEVRDSQGVLWVWMSR 212
Query: 193 KTPPNPDKLPWFENFARPGFQDVSTIHELPYDHSILLENLMDPAHIPISHDRTDWTAKRE 252
KTPPN K+PWFENFARPGFQD+ST HELPYDHSILLENLMDPAH+PISHDRTDW+AKRE
Sbjct: 213 KTPPNVSKIPWFENFARPGFQDISTTHELPYDHSILLENLMDPAHVPISHDRTDWSAKRE 272
Query: 253 DAQPLGFEVTERTDRGFAGRWGKEKDEPLPNFLRFEAPCVLQNNRELVDSKTGEKHYFTG 312
DAQ LGFEVTERTDRGFAG WG+EKD PNFLRFEAPCVLQNNRE+VD K GE ++F+G
Sbjct: 273 DAQALGFEVTERTDRGFAGWWGREKDGSKPNFLRFEAPCVLQNNREIVD-KNGEINHFSG 331
Query: 313 LFLCRPTGQGKSMLIVRFGATKRSPLAKLFPKWYFHQNASKVFEQDMGFLSSQNEVLMKE 372
LFLCRPTGQGKSMLIVRFGATKRSPL KLFP+WYFHQNASKVFEQDMGFLSSQNE+L+KE
Sbjct: 332 LFLCRPTGQGKSMLIVRFGATKRSPLIKLFPEWYFHQNASKVFEQDMGFLSSQNEILLKE 391
Query: 373 TVPTKELYLNLRSSDTWVAEYRKWMDKVGHGMPYHFGHSTISLPKVPAVVEHAPAGLVAG 432
VPTKELYLNL+SSDTWVAEYRKWMDKVGHGMPYHFGHSTISLP+ PAVVEHAPAGLVAG
Sbjct: 392 KVPTKELYLNLKSSDTWVAEYRKWMDKVGHGMPYHFGHSTISLPEEPAVVEHAPAGLVAG 451
Query: 433 VSASSPAKGAIGTMHAPNLANRYFRHVIHCKGCSSVIKAFSTWKNSLSVVAAALTVLAIL 492
+SASSPAKG IGTMHAPNLANRYFRHVIHCKGCSS IKAF WKN LS V AL LAIL
Sbjct: 452 LSASSPAKGGIGTMHAPNLANRYFRHVIHCKGCSSAIKAFQIWKNVLSGVVVALAALAIL 511
Query: 493 ASGRQWKAFCLASASLCLAGVYACSTAIAMNTTNFIRTHRRL 534
SGRQWK LASASLC GVYACSTAIAMNTTNFIR HRRL
Sbjct: 512 VSGRQWKVLLLASASLCSVGVYACSTAIAMNTTNFIRVHRRL 553
>sp|Q9SK50|TIC55_ARATH Protein TIC 55, chloroplastic OS=Arabidopsis thaliana GN=TIC55 PE=2
SV=1
Length = 539
Score = 799 bits (2063), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 385/487 (79%), Positives = 422/487 (86%), Gaps = 6/487 (1%)
Query: 49 STSTVGDHKVLVGPASAEERRGERQVADYDWTEEWYPLYLTKDVPDDAPLGLTVFDQQIV 108
S T G VL+ P EE+R E VADYDWTEEWYPLYLTK+VP+DAPLGLTV+D+QIV
Sbjct: 58 SDQTEGGGDVLLNPE--EEKRVE--VADYDWTEEWYPLYLTKNVPEDAPLGLTVYDRQIV 113
Query: 109 LYKDGNGELRCYQDRCPHRLAKLSEGQLIDGRLECLYHGWQFEGEGKCVKIPQLPADAKI 168
LYKDG G LRCY+DRCPHRLAKLSEGQLIDGRLECLYHGWQFEGEGKCVKIPQLPA AKI
Sbjct: 114 LYKDGEGTLRCYEDRCPHRLAKLSEGQLIDGRLECLYHGWQFEGEGKCVKIPQLPASAKI 173
Query: 169 PRSACVRTYEVKESQGVVWVWMSQKTPPNPDKLPWFENFARPGFQDVSTIHELPYDHSIL 228
P++ACV+TYEVK+SQGVVWVWMS KTPPNP+KLPWFENFARPGF D+ST HELPYDHSIL
Sbjct: 174 PKAACVKTYEVKDSQGVVWVWMSTKTPPNPEKLPWFENFARPGFFDISTTHELPYDHSIL 233
Query: 229 LENLMDPAHIPISHDRTDWTAKREDAQPLGFEVTERTDRGFAGRWGKEKDEPL-PNFLRF 287
LENLMDPAH+PISHDRTD+TAKREDAQPL FEVTER++RGFAG WG+EK+ N LRF
Sbjct: 234 LENLMDPAHVPISHDRTDFTAKREDAQPLVFEVTERSNRGFAGTWGREKEGGKGSNLLRF 293
Query: 288 EAPCVLQNNRELVDSKTGEKHYFTGLFLCRPTGQGKSMLIVRFGATKRSPLAKLFPKWYF 347
+APCVLQNNRE + K G K+YF+GLFLCRPTGQGKSMLIVRFG TKRSPL + P+W++
Sbjct: 294 DAPCVLQNNREF-EGKDGVKNYFSGLFLCRPTGQGKSMLIVRFGVTKRSPLVSVLPQWFW 352
Query: 348 HQNASKVFEQDMGFLSSQNEVLMKETVPTKELYLNLRSSDTWVAEYRKWMDKVGHGMPYH 407
HQNA KVFEQDMGFLSSQNEVLMKE VPTK+LYLNL+SSDTWVAEYRKWMDKVGHGMPYH
Sbjct: 353 HQNACKVFEQDMGFLSSQNEVLMKEKVPTKDLYLNLKSSDTWVAEYRKWMDKVGHGMPYH 412
Query: 408 FGHSTISLPKVPAVVEHAPAGLVAGVSASSPAKGAIGTMHAPNLANRYFRHVIHCKGCSS 467
FGH TISLPKVP VVEHAPAGL+A +SAS PAKG IGTMHAPNLANRYFRH+IHC+ CS+
Sbjct: 413 FGHRTISLPKVPPVVEHAPAGLIAALSASYPAKGGIGTMHAPNLANRYFRHIIHCRSCSN 472
Query: 468 VIKAFSTWKNSLSVVAAALTVLAILASGRQWKAFCLASASLCLAGVYACSTAIAMNTTNF 527
VIK+F WKN LS A ALT LAIL RQWKA L SA+LC A Y C AI +NT NF
Sbjct: 473 VIKSFELWKNILSATAVALTALAILVVSRQWKAVLLGSAALCSAAAYTCLRAINLNTNNF 532
Query: 528 IRTHRRL 534
IRTHRRL
Sbjct: 533 IRTHRRL 539
>sp|Q9FYC2|PAO_ARATH Pheophorbide a oxygenase, chloroplastic OS=Arabidopsis thaliana
GN=PAO PE=1 SV=1
Length = 537
Score = 167 bits (423), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 142/507 (28%), Positives = 226/507 (44%), Gaps = 82/507 (16%)
Query: 64 SAEERRGERQV--------ADYDWTEEWYPLYLTKDVPDDAPLGLTVFDQQIVLYKDGNG 115
S EE+R E + +++ W + WYP+ L +D+ + P + + +VL+ D N
Sbjct: 61 STEEKRIEEEYGGDKEEEGSEFKWRDHWYPVSLVEDLDPNVPTPFQLLGRDLVLWFDRND 120
Query: 116 E-LRCYQDRCPHRLAKLSEGQLID-GRLECLYHGWQFEGEGKCVKIPQLPADA------K 167
+ + D CPHRLA LSEG+L + G L+C YHGW F G G C +IPQ K
Sbjct: 121 QKWAAFDDLCPHRLAPLSEGRLDENGHLQCSYHGWSFGGCGSCTRIPQAATSGPEARAVK 180
Query: 168 IPRSACVRTYEVKESQGVVWVWMSQK-----TPPNPDKLPWFENFARPGFQDVSTIHELP 222
PR AC + SQG+++VW + P +LP ++F +P F V+ +L
Sbjct: 181 SPR-ACAIKFPTMVSQGLLFVWPDENGWDRANSIEPPRLP--DDFDKPEFSTVTIQRDLF 237
Query: 223 YDHSILLENLMDPAHIPISHDRTDWTAKREDAQPLGFEVTERTDRGFAGRWGKEKDEPLP 282
Y + L+EN+ DP+HI +H + T +R+ A+PL F+V GF G D P
Sbjct: 238 YGYDTLMENVSDPSHIDFAHHKV--TGRRDRAKPLPFKVESSGPWGFQ---GANDDSPRI 292
Query: 283 NFLRFEAPCVLQNNRELVDSK---TGEKHYFTGLFLCR---PTGQGKSMLIV-----RFG 331
+F APC N EL D+K G + + +++C P GK+ IV F
Sbjct: 293 T-AKFVAPCYSMNKIEL-DAKLPIVGNQKWV--IWICSFNIPMAPGKTRSIVCSARNFFQ 348
Query: 332 ATKRSPL-AKLFPKWYFHQNASKVFEQDMGFLSSQNEVLMKETVPTKELYLNLR------ 384
+ P ++ P+WY H ++ V++ DM L Q +V + +++ + + +N +
Sbjct: 349 FSVPGPAWWQVVPRWYEHWTSNLVYDGDMIVLQGQEKVFLAKSMESPDYDVNKQYTKLTF 408
Query: 385 ---SSDTWVAEYRKWMDKVGHGMPYHFGHSTISLPKVPAVVEHAPAGLVAGVSASSPAKG 441
+D +V +R W+ + G P FG ST S +P+ V
Sbjct: 409 TPTQADRFVLAFRNWLRRHGKSQPEWFG-STPSNQPLPSTV------------------- 448
Query: 442 AIGTMHAPNLANRYFRHVIHCKGCSSVIKAFSTWKNSLSVVAAALTVLAILASGRQWKAF 501
+ + +R+ +H C C +F K L A + S Q +
Sbjct: 449 ----LTKRQMLDRFDQHTQVCSSCKGAYNSFQILKKFLVGATVFWAATAGVPSDVQIR-L 503
Query: 502 CLASASLCLAGVYACSTAIAMNTTNFI 528
LA SL A A + A+ NF+
Sbjct: 504 VLAGLSLISA---ASAYALHEQEKNFV 527
>sp|Q8W496|PTC52_ARATH Protochlorophyllide-dependent translocon component 52,
chloroplastic OS=Arabidopsis thaliana GN=PTC52 PE=2 SV=1
Length = 559
Score = 155 bits (393), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 124/512 (24%), Positives = 206/512 (40%), Gaps = 121/512 (23%)
Query: 77 YDWTEEWYPLYLTKDVPDDAPLGLTVFDQQIVLYKDGN-GELRCYQDRCPHRLAKLSEGQ 135
+DW WYP+ D+ P G V +V++ D N + + D CPHRLA LS+G+
Sbjct: 79 FDWYANWYPVMPICDLDKKVPHGKKVMGIDLVVWWDRNEKQWKVMDDTCPHRLAPLSDGR 138
Query: 136 LID-GRLECLYHGWQFEGEGKCVKIPQLPADA---KIPRSACVRTYEVKESQGVVWVWMS 191
+ GRL+C+YHGW F G G C IPQ P D + ACV Y ++W W +
Sbjct: 139 IDQWGRLQCVYHGWCFNGSGDCKLIPQAPPDGPPVHTFKQACVAVYPSTVQHEIIWFWPN 198
Query: 192 Q----KTPPNPDKLPWFENFARPGFQDVSTIHELPYDHSILLENLMDPAHIPISH----- 242
K +K P+ P F + ++PY + +L+ENLMDPAH+P +H
Sbjct: 199 SDPKYKNIIETNKPPYIPELEDPSFTKLMGNRDIPYGYDVLVENLMDPAHVPYAHYGLMR 258
Query: 243 ----------------------------DRTDWTAKREDAQPLGFEVTERTDRGFAGR-- 272
++ D RE +PL V + ++GF +
Sbjct: 259 FPKPKGKYIICISNSCFNPFTNLQILLAEKID----REGGKPLEINVKKLDNKGFFSKQE 314
Query: 273 WGKEKDEPLPNFLRFEAPCVLQNNRELVDSKTGE-------------KHYFTGLFLCRPT 319
WG + F APCV +++ + + + E K + +F+C P
Sbjct: 315 WG---------YSNFIAPCVYRSSTDPLPEQEHEYPAPAASDKAALSKRRLSLIFICIPV 365
Query: 320 GQGKSMLIVRFGATKRSPLAKLFPKWYFHQNASKVFEQDMGFLSSQNEVLMKET------ 373
G+S LI F + K+ P+W FH + + + D+ L + +++
Sbjct: 366 SPGRSRLIWTFPRNFGVFIDKIVPRWVFHIGQNTILDSDLHLLHVEERKILERGPENWQK 425
Query: 374 ---VPTKELYLNLRSSDTWVAEYRKWMDKVGHG-MPYHFGHSTISLPKVPAVVEHAPAGL 429
+PTK SD V +R+W +K + + LP P +
Sbjct: 426 ACFIPTK--------SDANVVTFRRWFNKYSEARVDWRGKFDPFLLPPTPPREQ------ 471
Query: 430 VAGVSASSPAKGAIGTMHAPNLANRYFRHVIHCKGCSSVIKAFSTWKNSLSVV--AAALT 487
L +RY+ HV +C C K N+L V+ A++
Sbjct: 472 ---------------------LFDRYWSHVENCSSCKKAHKYL----NALEVILQIASVA 506
Query: 488 VLAILASGRQWKAFCLASASLCLAGVYACSTA 519
++ ++A +Q +A ++ +A V + + +
Sbjct: 507 MIGVMAVLKQTTMSNVARIAVLVAAVLSFAAS 538
>sp|Q9ZWM5|CAO_CHLRE Chlorophyllide a oxygenase, chloroplastic OS=Chlamydomonas
reinhardtii GN=CAO PE=2 SV=2
Length = 645
Score = 125 bits (315), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 90/338 (26%), Positives = 153/338 (45%), Gaps = 51/338 (15%)
Query: 83 WYPLYLTKDVPDDAPLGLTVFDQQIVLYKDGNGELRCYQDRCPHRLAKLSEGQLIDGRLE 142
WYP + +P D + +F + V+++D G+ C +D C HR LS G++++G++
Sbjct: 305 WYPAEFSARLPKDTLVPFELFGEPWVMFRDEKGQPSCIRDECAHRGCPLSLGKVVEGQVM 364
Query: 143 CLYHGWQFEGEGKCVKIPQLPADAKIPRSACVRTYEVKESQGVVWVWMSQKTPPNPDKLP 202
C YHGW+F G+G C K+P P R+ V E G +WVW P + LP
Sbjct: 365 CPYHGWEFNGDGACTKMPSTP----FCRNVGVAALPCAEKDGFIWVWPGDGLP--AETLP 418
Query: 203 WFENFARP--GFQDVSTIH-ELPYDHSILLENLMDPAHIPISHDRTDWTAKREDAQPLGF 259
+FA+P GF + I ++P +H +L+ENL+D AH P +H T+ P+
Sbjct: 419 ---DFAQPPEGFLIHAEIMVDVPVEHGLLIENLLDLAHAPFTH-----TSTFARGWPVPD 470
Query: 260 EVTERTDRGFAGRWGKEKDEPLPNFLRFEAPCV------LQNNRELVDSKTGE--KHYFT 311
V ++ +G W +P P + F+ PC+ L +++ T K++
Sbjct: 471 FVKFHANKALSGFW-----DPYPIDMAFQPPCMTLSTIGLAQPGKIMRGVTASQCKNHLH 525
Query: 312 GLFLCRPTGQGKSMLIVRFGATKRSPLAKLFPKWYFH---------QNASKVFEQDMGFL 362
L +C P+ +G + L+ R F W H Q A++V +D+ +
Sbjct: 526 QLHVCMPSKKGHTRLLYRMSLD--------FLPWMRHVPFIDRIWKQVAAQVLGEDLVLV 577
Query: 363 SSQNEVLMKETVPTKELYLNLRSSDTWVAEYRKWMDKV 400
Q + +++ + N D YR+W + V
Sbjct: 578 LGQQDRMLR----GGSNWSNPAPYDKLAVRYRRWRNGV 611
>sp|Q9XJ38|CAO_DUNSA Chlorophyllide a oxygenase, chloroplastic OS=Dunaliella salina
GN=CAO PE=2 SV=1
Length = 463
Score = 113 bits (282), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 92/359 (25%), Positives = 148/359 (41%), Gaps = 42/359 (11%)
Query: 61 GPASAEERRGERQVADYDWTEE-WYPLYLTKDVPDDAPLGLTVFDQQIVLYKDGNGELRC 119
GP + RR + D WYP K + + +F VL++D + C
Sbjct: 111 GPKPKDSRRLRSSLELEDGLRNFWYPTEFAKKLEPGMMVPFDLFGVPWVLFRDEHSAPTC 170
Query: 120 YQDRCPHRLAKLSEGQLIDGRLECLYHGWQFEGEGKCVKIPQLPADAKIPRSACVRTYEV 179
+D C HR LS G++I+G ++C YHGW+F+G G C K+P ++ V
Sbjct: 171 IKDSCAHRACPLSLGKVINGHVQCPYHGWEFDGSGACTKMPS----TRMCHGVGVAALPC 226
Query: 180 KESQGVVWVWMSQKTPPNPDKLPWFENFARPGFQDV--STIHELPYDHSILLENLMDPAH 237
E G VWVW PP+ + A P DV + ++P +H +L+ENL+D AH
Sbjct: 227 VEKDGFVWVWPGDGPPPDLPP----DFTAPPAGYDVHAEIMVDVPVEHGLLMENLLDLAH 282
Query: 238 IPISHDRTDWTAKREDAQPLGFEVTERTDRGFAGRWGKEKDEPLPNFLRFEAPCVL---- 293
P +H T P+ V + AG W +P P + F PC+
Sbjct: 283 APFTH-----TTTFARGWPIPEAVRFHATKMLAGDW-----DPYPISMSFNPPCIALSTI 332
Query: 294 ---QNNRELVDSKTGE-KHYFTGLFLCRPTGQGKSMLIVR-----FGATKRSPLAKLFPK 344
Q + + K E K + L +C P+ +G + L+ R +G K P +
Sbjct: 333 GLSQPGKIMRGYKAEECKRHLHQLHVCMPSKEGHTRLLYRMSLDFWGWAKHVPFVDVL-- 390
Query: 345 WYFHQNASKVFEQDMGFLSSQNEVLMKETVPTKELYLNLRSSDTWVAEYRKWMDKVGHG 403
+ + A +V +D+ + Q + + + + D YR+W + V G
Sbjct: 391 --WKKIAGQVLGEDLVLVLGQQARM----IGGDDTWCTPMPYDKLAVRYRRWRNMVADG 443
>sp|Q8S7E1|CAO_ORYSJ Chlorophyllide a oxygenase, chloroplastic OS=Oryza sativa subsp.
japonica GN=CAO PE=2 SV=1
Length = 541
Score = 107 bits (266), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 91/336 (27%), Positives = 152/336 (45%), Gaps = 41/336 (12%)
Query: 83 WYPLYLTKDVPDDAPLGLTVFDQQIVLYKDGNGELRCYQDRCPHRLAKLSEGQLIDGRLE 142
WYP+ + D+ DD + + F++Q V+++ +G C + C HR L G + +GR++
Sbjct: 220 WYPVAFSSDLKDDTMVPIDCFEEQWVIFRGKDGRPGCVMNTCAHRACPLHLGSVNEGRIQ 279
Query: 143 CLYHGWQFEGEGKCVKIPQLPADAKIPRSACVRTYEVKESQGVVWVWMSQKTPPN--PDK 200
C YHGW++ +GKC K+P + +R+ E +G+VW+W P + P
Sbjct: 280 CPYHGWEYSTDGKCEKMPSTKM-----LNVRIRSLPCFEQEGMVWIWPGNDPPKSTIPSL 334
Query: 201 LPWFENFARPGFQ-DVSTIHELPYDHSILLENLMDPAHIPISHDRTDWTAKREDAQPLGF 259
LP GF + ELP +H +LL+NL+D AH P +H T AK L
Sbjct: 335 LP------PSGFTIHAEIVMELPVEHGLLLDNLLDLAHAPFTHTST--FAKGWSVPSLVK 386
Query: 260 EVTERTDRGFAGRWGKEKDEPLPNFLRFEAPCVLQNNRELVDSKTGE---------KHYF 310
+T + G G W +P P + F PC++ + + SK G+ +
Sbjct: 387 FLTPSS--GLQGYW-----DPYPIDMEFRPPCMVLSTIGI--SKPGKLEGKSTKQCSTHL 437
Query: 311 TGLFLCRPTGQGKSMLIVRFGATKRSPLAKLFPKWY--FHQNASKVFEQDMGFLSSQNEV 368
L +C P+ + K+ L+ R + +P K P + + A KV +D+ + Q E
Sbjct: 438 HQLHICLPSSRNKTRLLYRM-SLDFAPWIKHVPFMHILWSHFAEKVLNEDLRLVLGQQER 496
Query: 369 LMKETVPTKELYLNLRSSDTWVAEYRKWMDKVGHGM 404
++ ++ S D YR W D + G+
Sbjct: 497 MINGA----NVWNWPVSYDKLGIRYRLWRDAIERGV 528
>sp|Q9MBA1|CAO_ARATH Chlorophyllide a oxygenase, chloroplastic OS=Arabidopsis thaliana
GN=CAO PE=1 SV=1
Length = 536
Score = 100 bits (250), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 99/388 (25%), Positives = 177/388 (45%), Gaps = 45/388 (11%)
Query: 28 VIISLNKTPKSRKCHAVT-DRSSTSTVGDHKVLVGPASAEERRGERQVADYDWTEEWYPL 86
V+ L+K S AV DR T+T + GP + ++ WYP+
Sbjct: 174 VVTELDKPSSSTTASAVELDREKTNTGAKSLNVSGPVPPYSP----HLKNF-----WYPV 224
Query: 87 YLTKDVPDDAPLGLTVFDQQIVLYKDGNGELRCYQDRCPHRLAKLSEGQLIDGRLECLYH 146
T D+ D + + F+Q V+++ +G+ C ++ C HR L G + +GR++C YH
Sbjct: 225 AFTADLKHDTMVPIECFEQPWVIFRGEDGKPGCVRNTCAHRACPLDLGTVNEGRIQCPYH 284
Query: 147 GWQFEGEGKCVKIPQLP-ADAKIPRSACVRTYEVKESQGVVWVWMSQKTPPNPDKLPWFE 205
GW++ +G+C K+P KI C+ E +G++W+W + PP P LP +
Sbjct: 285 GWEYSTDGECKKMPSTKLLKVKIKSLPCL------EQEGMIWIWPGDE-PPAP-ILPSLQ 336
Query: 206 NFARPGFQ-DVSTIHELPYDHSILLENLMDPAHIPISHDRTDWTAKREDAQPLGFEVTER 264
+ GF + +LP +H +LL+NL+D AH P +H T AK L +T
Sbjct: 337 PPS--GFLIHAELVMDLPVEHGLLLDNLLDLAHAPFTHTST--FAKGWSVPSLVKFLTPT 392
Query: 265 TDRGFAGRWGKEKDEPLPNFLRFEAPCVLQNNREL-----VDSKTGEK--HYFTGLFLCR 317
+ G G W +P P + F+ PC++ + + ++ K+ ++ + L +C
Sbjct: 393 S--GLQGYW-----DPYPIDMEFKPPCIVLSTIGISKPGKLEGKSTQQCATHLHQLHVCL 445
Query: 318 PTGQGKSMLIVRFGATKRSPLAKLFP--KWYFHQNASKVFEQDMGFLSSQNEVLMKETVP 375
P+ + K+ L+ R + +P+ K P + + A +V +D+ + Q E ++
Sbjct: 446 PSSKNKTRLLYRM-SLDFAPILKNLPFMEHLWRHFAEQVLNEDLRLVLGQQERMLNGA-- 502
Query: 376 TKELYLNLRSSDTWVAEYRKWMDKVGHG 403
++ + D YR W + V G
Sbjct: 503 --NIWNLPVAYDKLGVRYRLWRNAVDRG 528
>sp|Q9AHG3|TSAM2_COMTE Putative toluene-4-sulfonate monooxygenase system iron-sulfur
subunit TsaM2 OS=Comamonas testosteroni GN=tsaM2 PE=5
SV=1
Length = 346
Score = 96.3 bits (238), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 61/197 (30%), Positives = 91/197 (46%), Gaps = 6/197 (3%)
Query: 83 WYPLYLTKDVPDDAPLGLTVFDQQIVLYKDGNGELRCYQDRCPHRLAKLSEGQLIDGRLE 142
WY + D PL T ++++VL++ +G +DRC HRLA LS G + D +
Sbjct: 7 WYVAGMATDC-SRKPLARTFLNEKVVLFRTHDGHAVALEDRCCHRLAPLSLGDVEDAGIR 65
Query: 143 CLYHGWQFEGEGKCVKIPQLPADAKIPRSACVRTYEVKESQGVVWVWMSQKTPPNPDKL- 201
C YHG F G CV+I P +IP CVR + + E G++W+WM NPD +
Sbjct: 66 CRYHGMVFNASGACVEI---PGQEQIPPGMCVRRFPLVERHGLLWIWMGDPARANPDDIV 122
Query: 202 PWFENFARPGFQDVSTIHELPYDHSILLENLMDPAHIPISHDRTDWTAKREDAQPLGFEV 261
N A D IH ++ ++++NL+D H+ H T T +P+
Sbjct: 123 DELWNGAPEWRTDSGYIH-YQANYQLIVDNLLDFTHLAWVHPTTLGTDSAASLKPVIERD 181
Query: 262 TERTDRGFAGRWGKEKD 278
T T + RW D
Sbjct: 182 TTGTGKLTITRWYLNDD 198
>sp|O05616|VANA_PSEUH Vanillate O-demethylase oxygenase subunit OS=Pseudomonas sp.
(strain HR199 / DSM 7063) GN=vanA PE=3 SV=1
Length = 354
Score = 91.7 bits (226), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 50/160 (31%), Positives = 82/160 (51%), Gaps = 5/160 (3%)
Query: 83 WYPLYLTKDVPDDAPLGLTVFDQQIVLYKDGNGELRCYQDRCPHRLAKLSEGQLIDGRLE 142
WY + T D D PLG + +++IV Y+ G + +D CPHR A LS G + DG+L
Sbjct: 7 WY-VACTPDEIADKPLGRQICNEKIVFYRGPEGRVAAVEDFCPHRGAPLSLGFVRDGKLI 65
Query: 143 CLYHGWQFEGEGKCVKIPQLPADAKIPRSACVRTYEVKESQGVVWVWMSQKTPPNPDKLP 202
C YHG + EGK + +P ++ C+++Y V+E G +WVW + +P +
Sbjct: 66 CGYHGLEMGCEGKTLAMP----GQRVQGFPCIKSYAVEERYGFIWVWPGDRELADPALIH 121
Query: 203 WFENFARPGFQDVSTIHELPYDHSILLENLMDPAHIPISH 242
E P + ++ + D+ ++++NLMD H H
Sbjct: 122 HLEWADNPEWAYGGGLYHIACDYRLMIDNLMDLTHETYVH 161
>sp|P94679|TSAM1_COMTE Toluene-4-sulfonate monooxygenase system iron-sulfur subunit TsaM1
OS=Comamonas testosteroni GN=tsaM1 PE=1 SV=1
Length = 347
Score = 90.5 bits (223), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 53/196 (27%), Positives = 90/196 (45%), Gaps = 3/196 (1%)
Query: 83 WYPLYLTKDVPDDAPLGLTVFDQQIVLYKDGNGELRCYQDRCPHRLAKLSEGQLIDGRLE 142
WY ++P + T+ ++ ++LY+D G + ++RC HR A L G+ +
Sbjct: 7 WYVAAWDTEIPAEGLFHRTLLNEPVLLYRDTQGRVVALENRCCHRSAPLHIGRQEGDCVR 66
Query: 143 CLYHGWQFEGEGKCVKIPQLPADAKIPRSACVRTYEVKESQGVVWVWMSQKTPPNPDKLP 202
CLYHG +F G CV+I P +IP C+++Y V E +VW+WM NPD +
Sbjct: 67 CLYHGLKFNPSGACVEI---PGQEQIPPKTCIKSYPVVERNRLVWIWMGDPARANPDDIV 123
Query: 203 WFENFARPGFQDVSTIHELPYDHSILLENLMDPAHIPISHDRTDWTAKREDAQPLGFEVT 262
+ P ++ ++ ++++NL+D H+ H T T +P+ T
Sbjct: 124 DYFWHDSPEWRMKPGYIHYQANYKLIVDNLLDFTHLAWVHPTTLGTDSAASLKPVIERDT 183
Query: 263 ERTDRGFAGRWGKEKD 278
T + RW D
Sbjct: 184 TGTGKLTITRWYLNDD 199
>sp|Q44256|CBAA_COMTE 3-chlorobenzoate-3,4-dioxygenase oxygenase subunit OS=Comamonas
testosteroni GN=cbaA PE=3 SV=1
Length = 432
Score = 85.9 bits (211), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 42/114 (36%), Positives = 65/114 (57%), Gaps = 1/114 (0%)
Query: 81 EEWYP-LYLTKDVPDDAPLGLTVFDQQIVLYKDGNGELRCYQDRCPHRLAKLSEGQLIDG 139
E W P L T+ +P+ L + +++V +++ +G + RCPHR L G++ +G
Sbjct: 25 EYWIPALKSTELEAGGSPVRLLLLGEKLVAFREPSGAVGVMDSRCPHRGVSLFMGRVEEG 84
Query: 140 RLECLYHGWQFEGEGKCVKIPQLPADAKIPRSACVRTYEVKESQGVVWVWMSQK 193
L C+YHGW+F EGKCV +P + + + S V Y VKE GVVWV+M +
Sbjct: 85 GLRCVYHGWKFSAEGKCVDMPSVRPEDEFKNSVRVARYPVKEMAGVVWVYMGTR 138
>sp|P12609|VANA_PSES9 Vanillate O-demethylase oxygenase subunit OS=Pseudomonas sp.
(strain ATCC 19151) GN=vanA PE=3 SV=1
Length = 329
Score = 82.4 bits (202), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 46/172 (26%), Positives = 81/172 (47%), Gaps = 7/172 (4%)
Query: 102 VFDQQIVLYKDGNGELRCYQDRCPHRLAKLSEGQLIDGRLECLYHGWQFEGEGKCVKIPQ 161
+ ++++V+Y+ + +D CPHR A LS G + DG+L C YHG + +G+ +P
Sbjct: 2 ICNERMVIYRGAGQRVAALEDFCPHRGAPLSLGSIQDGKLVCGYHGLVMDCDGRTASMPA 61
Query: 162 LPADAKIPRSACVRTYEVKESQGVVWVWMSQKTPPNPDKLPWFENFARPGFQDVSTIHEL 221
++ C+R + +E G +WVW +P +P E P + ++ +
Sbjct: 62 ----QRVQAFPCIRAFPAQERHGFIWVWPGDAALADPALIPHLEWAENPAWAYGGGLYHI 117
Query: 222 PYDHSILLENLMDPAHIPISHDRTDWTAKREDAQPLGFEVTERTDRGFAGRW 273
D+ ++++NLMD H H + K D P+ V DR GR+
Sbjct: 118 ACDYRLMIDNLMDLTHETYVH-ASSIGQKEIDEAPVSTRV--EGDRLITGRF 166
>sp|Q05183|PHT3_PSEPU Phthalate 4,5-dioxygenase oxygenase subunit OS=Pseudomonas putida
GN=pht3 PE=2 SV=1
Length = 439
Score = 73.6 bits (179), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 37/124 (29%), Positives = 65/124 (52%), Gaps = 7/124 (5%)
Query: 83 WYPLYLTKDV--PDDAPLGLTVFDQQIVLYKDGNGELRCYQDRCPHRLAKLSEGQLIDGR 140
W P+ L ++V PD P+ +F + +V+++D +G + + CPHR L G+ +
Sbjct: 27 WTPVCLLEEVSEPDGTPVRARLFGEDLVVFRDTDGRVGVMDEYCPHRRVSLIYGRNENSG 86
Query: 141 LECLYHGWQFEGEGKCVKIPQLPADAKIPRSACVRTYEVKESQGVVWVWMSQKTPPNPDK 200
L CLYHGW+ + +G V++ PA + + + Y+ +E G VW +M + D
Sbjct: 87 LRCLYHGWKMDVDGNVVEMVSEPAASNMCQKVKHTAYKTREWGGFVWAYMGPQ-----DA 141
Query: 201 LPWF 204
+P F
Sbjct: 142 IPEF 145
>sp|Q8G8B6|CARAA_PSERE Carbazole 1,9a-dioxygenase, terminal oxygenase component CarAa
OS=Pseudomonas resinovorans GN=carAa PE=1 SV=1
Length = 384
Score = 71.2 bits (173), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 59/205 (28%), Positives = 95/205 (46%), Gaps = 20/205 (9%)
Query: 83 WYPLYLTKDVPDDAPLGLTVFDQQIVLYKDGNGELRCYQDRCPHRLAKLSEGQLIDGR-- 140
WYP+ +K++ + P L + + +++ + +G+L C +DRC HR +LS +
Sbjct: 29 WYPVMFSKEIDEGEPKTLKLLGENLLVNRI-DGKLYCLKDRCLHRGVQLSVKVECKTKST 87
Query: 141 LECLYHGWQFEGE-GKCVKIPQLPADAKIPRSACVRTYEVKESQGVVWVWMSQKTPP--N 197
+ C YH W + E G I P A+I R ++TY V+E++G V++++ PP
Sbjct: 88 ITCWYHAWTYRWEDGVLCDILTNPTSAQIGRQK-LKTYPVQEAKGCVFIYLGDGDPPPLA 146
Query: 198 PDKLPWFENFARPGFQDVSTIHELPYDHSILLENLMDPAHIPISHDRTDWTAKRED-AQP 256
D P NF + + + + + +EN DP+HI I D K D A P
Sbjct: 147 RDTPP---NFLDDDMEILGKNQIIKSNWRLAVENGFDPSHIYIHKDSI--LVKDNDLALP 201
Query: 257 LGF-------EVTERTDRGFAGRWG 274
LGF + T D GR G
Sbjct: 202 LGFAPGGDRKQQTRVVDDDVVGRKG 226
>sp|P71875|KSHA_MYCTU 3-ketosteroid-9-alpha-monooxygenase oxygenase subunit
OS=Mycobacterium tuberculosis GN=kshA PE=1 SV=2
Length = 386
Score = 71.2 bits (173), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 41/152 (26%), Positives = 71/152 (46%), Gaps = 16/152 (10%)
Query: 48 SSTSTVGDHKVLVGPASAEERRGERQVADYDWTEEWYPLYLTKDVPDDAPLGLTVFDQQI 107
+ TS VG ++ G RG W+ L + KD + P G+ F ++
Sbjct: 3 TDTSGVGVREIDAGALPTRYARG------------WHCLGVAKDYLEGKPHGVEAFGTKL 50
Query: 108 VLYKDGNGELRCYQDRCPHRLAKLSEGQLIDGRLECLYHGWQFEGEGKCVKIPQLPADAK 167
V++ D +G+L+ C H LSEG + + C +H W++ G+G+C +P +
Sbjct: 51 VVFADSHGDLKVLDGYCRHMGGDLSEGTVKGDEVACPFHDWRWGGDGRCKLVPYA---RR 107
Query: 168 IPRSACVRTYEVKESQGVVWVWMSQK-TPPNP 198
PR A R++ G+++VW + PP+P
Sbjct: 108 TPRMARTRSWTTDVRSGLLFVWHDHEGNPPDP 139
>sp|D5IGG0|CARAA_SPHSX Carbazole 1,9a-dioxygenase, terminal oxygenase component CarAa
OS=Sphingomonas sp. GN=carAa PE=1 SV=1
Length = 378
Score = 68.9 bits (167), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 54/185 (29%), Positives = 90/185 (48%), Gaps = 17/185 (9%)
Query: 83 WYPLYLTKDVPDDAPLGLTVFDQQIVLYKDGNGELRCYQDRCPHRLAKLSEGQ--LIDGR 140
WYP+ L ++ + P+ + + ++I+L + G G++ QDRC HR LS+
Sbjct: 29 WYPVRLASEIAEGTPVPVKLLGEKILLNRVG-GKVYAIQDRCLHRGVTLSDRVECYSKNT 87
Query: 141 LECLYHGWQFE-GEGKCVKIPQLPADAKIPRSACVRTYEVKESQGVVWVWMSQKTPPNPD 199
+ C YHGW + +G+ V I P +I R A ++T+ V+E++G+++V++ P
Sbjct: 88 ISCWYHGWTYRWDDGRLVDILTNPGSVQIGRRA-LKTFPVEEAKGLIFVYVGDGEP---- 142
Query: 200 KLPWFENFARPGFQD----VSTIHELPYDHSIL-LENLMDPAHIPISHDRTDWTAKREDA 254
P E+ PGF D + H L + L EN D H+ I H + +
Sbjct: 143 -TPLIEDVP-PGFLDENRAIHGQHRLVASNWRLGAENGFDAGHVLI-HKNSILVKGNDII 199
Query: 255 QPLGF 259
PLGF
Sbjct: 200 LPLGF 204
>sp|A0R4R3|KSHA_MYCS2 3-ketosteroid-9-alpha-monooxygenase oxygenase subunit
OS=Mycobacterium smegmatis (strain ATCC 700084 /
mc(2)155) GN=kshA PE=1 SV=1
Length = 383
Score = 68.2 bits (165), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 41/152 (26%), Positives = 68/152 (44%), Gaps = 16/152 (10%)
Query: 49 STSTVGDHKVLVGPASAEERRGERQVADYDWTEEWYPLYLTKDVPDDAPLGLTVFDQQIV 108
+T TVG ++ G RG W+ L K+ D P + +F ++V
Sbjct: 2 ATETVGIREIDTGALPDRYARG------------WHCLGPVKNFSDGKPHSVNIFGTKLV 49
Query: 109 LYKDGNGELRCYQDRCPHRLAKLSEGQLIDGRLECLYHGWQFEGEGKCVKIPQLPADAKI 168
++ D GEL C H LS+G + + C +H W++ G+GKC +P +
Sbjct: 50 VFADSKGELNVLDAYCRHMGGDLSKGTVKGDEVACPFHDWRWGGDGKCKLVPYA---KRT 106
Query: 169 PRSACVRTYEVKESQGVVWVWMSQK-TPPNPD 199
PR A R++ G+++VW + PP P+
Sbjct: 107 PRLARTRSWHTDVRGGLLFVWHDHEGNPPQPE 138
>sp|Q52185|POBA_PSEPS Phenoxybenzoate dioxygenase subunit alpha OS=Pseudomonas
pseudoalcaligenes GN=pobA PE=2 SV=1
Length = 409
Score = 67.8 bits (164), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 53/172 (30%), Positives = 76/172 (44%), Gaps = 13/172 (7%)
Query: 83 WYPLYLTKDVPDDAPLGLTVFDQQIVLYKDGNGELRCYQDRCPHRLAKLSEGQLIDGRLE 142
W P+ L+ DV D P + + + +VL++D G RC HR L G + + +
Sbjct: 45 WQPVALSADV-TDRPQMVRILGEDLVLFRDKAGRPGLLYPRCMHRGTSLYYGHVEEAGIR 103
Query: 143 CLYHGWQFEGEGKCVKIPQLPADAKIPRSACVRTYEVKESQGVVWVWMS--QKTP--PNP 198
C YHGW F +G C+ P P +A Y V+E G+V+ +M +K P P
Sbjct: 104 CCYHGWLFAVDGTCLNQPCEPEGGLRREAARQPWYPVEERYGLVFAYMGPPEKKPVLPRY 163
Query: 199 DKLPWFE-----NFARPGFQDVSTIHE---LPYDHSILLENLMDPAHIPISH 242
D L E GF + E +PY EN+MDP H+ I H
Sbjct: 164 DILEDLEEGEFIEVISGGFVSYADHVEDPNVPYHWLQNWENIMDPYHVYILH 215
>sp|Q84BZ3|ANDAC_BURCE Anthranilate 1,2-dioxygenase large subunit OS=Burkholderia cepacia
GN=andAc PE=1 SV=1
Length = 423
Score = 58.5 bits (140), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 46/177 (25%), Positives = 76/177 (42%), Gaps = 18/177 (10%)
Query: 83 WYPLYLTKDVPDDAPLGLT-VFDQQIVLYKDGNGELRCYQDRCPHRLAKLSEGQLIDGRL 141
W + L ++P+ T V D +V+ + +G L + +RC HR A++ +
Sbjct: 53 WNFVALEAEIPNAGDFKSTFVGDTPVVVTRTEDGALSAWVNRCAHRGAQVCRKSRGNASS 112
Query: 142 E-CLYHGWQFEGEGKCVKIP---------QLPADAKIPRSACVRTYEVKESQGVVWVWMS 191
C+YH W F+ EG + +P +PAD P+ +R V +G+V+ S
Sbjct: 113 HTCVYHQWSFDNEGNLLGVPFRRGQKGMTGMPADFD-PKQHGLRKLRVDSYRGLVFATFS 171
Query: 192 QKTPPNPDKL-----PWFEN-FARPGFQDVSTIHELPYDHSILLENLMDPAHIPISH 242
P PD L PW + F +P T + + +EN+ DP H + H
Sbjct: 172 DDVAPLPDYLGAQMRPWIDRIFHKPIEYLGCTRQYSKSNWKLYMENVKDPYHASMLH 228
>sp|Q17938|DAF36_CAEEL Cholesterol desaturase daf-36 OS=Caenorhabditis elegans GN=daf-36
PE=1 SV=2
Length = 428
Score = 55.5 bits (132), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 31/110 (28%), Positives = 56/110 (50%), Gaps = 5/110 (4%)
Query: 83 WYPLYLTKDVPDDAPLGLTVFDQQIVLYKDGNGELRCYQDRCPHRLAKLSEGQLI--DGR 140
WY + ++ + ++ + +TV Q + L + +G + CPH A + G + D
Sbjct: 81 WYCVCESEKLANNQIMEITVLGQFLSLIRSESGAVYITDSYCPHIGANFNIGGRVVRDNC 140
Query: 141 LECLYHGWQFEGE-GKCVKIPQLPADAKIPRSACVRTYEVKESQGVVWVW 189
++C +HGW F E GKCV++P + +IP A V T+ E +++W
Sbjct: 141 IQCPFHGWIFSAETGKCVEVPY--DEGRIPEQAKVTTWPCIERNNNIYLW 188
>sp|P42436|NASE_BACSU Assimilatory nitrite reductase [NAD(P)H] small subunit OS=Bacillus
subtilis (strain 168) GN=nasE PE=2 SV=1
Length = 106
Score = 48.9 bits (115), Expect = 1e-04, Method: Composition-based stats.
Identities = 26/83 (31%), Positives = 43/83 (51%), Gaps = 11/83 (13%)
Query: 98 LGLTVF--DQQIVLYKDGNGELRCYQDRCPHRLAKLSEGQLIDGRLECLYHGWQFEGEGK 155
LG TV+ D+++ ++K +G +R ++RCPH+ L+EG + + C H W+ E
Sbjct: 21 LGKTVYIEDKELAVFKLSDGSIRAIENRCPHKGGVLAEGIVSGQYVFCPMHDWKISLEDG 80
Query: 156 CVKIPQLPADAKIPRSACVRTYE 178
V+ P CV+TYE
Sbjct: 81 IVQEPD---------HGCVKTYE 94
>sp|Q3C1E3|TPDA1_COMSP Terephthalate 1,2-dioxygenase, terminal oxygenase component subunit
alpha 1 OS=Comamonas sp. GN=tphA2I PE=1 SV=1
Length = 413
Score = 47.8 bits (112), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 44/179 (24%), Positives = 71/179 (39%), Gaps = 18/179 (10%)
Query: 81 EEWYPLYLTKDVPDDAPLGLTVFDQQ-IVLYKDGNGELRCYQDRCPHR--LAKLSEGQLI 137
E W L L ++P T + IV+ +D + E+ +++RC HR L L +
Sbjct: 38 EVWNYLCLESEIPGAGDFRTTFAGETPIVVVRDADQEIYAFENRCAHRGALIALEKSGRT 97
Query: 138 DGRLECLYHGWQFEGEGKCVKIP---QLPADAKIPRSACV-----RTYEVKESQGVVWVW 189
D +C+YH W + +G + + +P S C R V G+V+
Sbjct: 98 DS-FQCVYHAWSYNRQGDLTGVAFEKGVKGQGGMPASFCKEEHGPRKLRVAVFCGLVFGS 156
Query: 190 MSQKTPPNPDKL--PWFENFARPGFQDVSTI----HELPYDHSILLENLMDPAHIPISH 242
S+ P D L E R + V I +LP + + EN+ D H + H
Sbjct: 157 FSEDVPSIEDYLGPEICERIERVLHKPVEVIGRFTQKLPNNWKLYFENVKDSYHASLLH 215
>sp|Q3C1D5|TPDA2_COMSP Terephthalate 1,2-dioxygenase, terminal oxygenase component subunit
alpha 2 OS=Comamonas sp. GN=tphA2II PE=1 SV=1
Length = 413
Score = 47.8 bits (112), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 44/179 (24%), Positives = 71/179 (39%), Gaps = 18/179 (10%)
Query: 81 EEWYPLYLTKDVPDDAPLGLTVFDQQ-IVLYKDGNGELRCYQDRCPHR--LAKLSEGQLI 137
E W L L ++P T + IV+ +D + E+ +++RC HR L L +
Sbjct: 38 EVWNYLCLESEIPGAGDFRTTFAGETPIVVVRDADQEIYAFENRCAHRGALIALEKSGRT 97
Query: 138 DGRLECLYHGWQFEGEGKCVKIP---QLPADAKIPRSACV-----RTYEVKESQGVVWVW 189
D +C+YH W + +G + + +P S C R V G+V+
Sbjct: 98 D-SFQCVYHAWSYNRQGDLTGVAFEKGVKGQGGMPASFCKEEHGPRKLRVAVFCGLVFGS 156
Query: 190 MSQKTPPNPDKL--PWFENFARPGFQDVSTI----HELPYDHSILLENLMDPAHIPISH 242
S+ P D L E R + V I +LP + + EN+ D H + H
Sbjct: 157 FSEDVPSIEDYLGPEICERIERVLHKPVEVIGRFTQKLPNNWKLYFENVKDSYHASLLH 215
>sp|Q84BZ1|ANDAB_BURCE Anthranilate 1,2-dioxygenase ferredoxin subunit OS=Burkholderia
cepacia GN=andAb PE=1 SV=1
Length = 108
Score = 44.3 bits (103), Expect = 0.002, Method: Composition-based stats.
Identities = 33/110 (30%), Positives = 48/110 (43%), Gaps = 15/110 (13%)
Query: 82 EWYPLYLTKDVPDDAPLGLTVFDQQIVLYKDGNGELRCYQDRCPHRLAKLSEGQLIDGRL 141
EW+PL + +D P + I +++ G+ EL D C H A+LSEG + DG +
Sbjct: 8 EWHPLGAIDEFTEDEPAARVAGQKPIAVFRIGD-ELFAMHDLCSHGHARLSEGYVEDGCV 66
Query: 142 ECLYHGWQFE---GEGKCVKIPQLPADAKIPRSACVRTYEVKESQGVVWV 188
EC H + G KC P + VR Y ++ G V V
Sbjct: 67 ECPLHQGLIDIRTGAPKCA-----------PITEPVRVYPIRIVDGQVEV 105
>sp|Q51493|NDOA_PSEAI Naphthalene 1,2-dioxygenase system ferredoxin subunit
OS=Pseudomonas aeruginosa GN=ndoA PE=3 SV=3
Length = 104
Score = 44.3 bits (103), Expect = 0.003, Method: Composition-based stats.
Identities = 32/111 (28%), Positives = 53/111 (47%), Gaps = 16/111 (14%)
Query: 80 TEEWYPLYLTKDVPDDAPLGLTVFDQQIVLYKDGNGELRCYQDRCPHRLAKLSEGQLIDG 139
TE+W ++P+ LG+TV +++ LY + GE+ + C H A++S+G L
Sbjct: 2 TEKWIDAVALYEIPEGDVLGVTVEGKELALY-EVEGEIYATDNLCTHGAARMSDGFLEGR 60
Query: 140 RLECLYHGWQFE---GEGKCVKIPQLPADAKIPRSACVRTYEVK-ESQGVV 186
+EC H +F+ G C + Q ++TY VK E Q V+
Sbjct: 61 EIECPLHQGRFDVCTGRALCAPVTQ-----------NIKTYPVKIEGQRVM 100
>sp|P0ABR7|YEAW_ECOLI Putative dioxygenase subunit alpha YeaW OS=Escherichia coli (strain
K12) GN=yeaW PE=3 SV=1
Length = 374
Score = 43.1 bits (100), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 46/206 (22%), Positives = 79/206 (38%), Gaps = 42/206 (20%)
Query: 102 VFDQQIVLYKDGNGELRCYQDRCPHRLAKLSEGQ-LIDGRLECLYHGWQFEGEG------ 154
+ + IVL + + LR + + CPHR +L G+ + C YH W F+ +G
Sbjct: 67 IIGESIVLVRGRDKVLRAFYNVCPHRGHQLLSGEGKAKNVITCPYHAWAFKLDGNLAHAR 126
Query: 155 KCVKIPQLPADAKIPRSACVRTYEVKESQGVVWVWMSQKTPPNPDKLPWFENFARPGFQD 214
C + +D A + ++E G V++ M D+LP +
Sbjct: 127 NCENVANFDSD-----KAQLVPVRLEEYAGFVFINMDPNATSVEDQLP---GLGAKVLEA 178
Query: 215 VSTIHEL----------PYDHSILLENLMDPAHIPISH---------DRT------DWTA 249
+H+L P + +++N ++ H +H DR +WT
Sbjct: 179 CPEVHDLKLAARFTTRTPANWKNIVDNYLECYHCGPAHPGFSDSVQVDRYWHTMHGNWTL 238
Query: 250 KREDAQP--LGFEVTERTDRGFAGRW 273
+ A+P F+ E TD F G W
Sbjct: 239 QYGFAKPSEQSFKFEEGTDAAFHGFW 264
>sp|P0ABR8|YEAW_ECO57 Putative dioxygenase subunit alpha YeaW OS=Escherichia coli O157:H7
GN=yeaW PE=3 SV=1
Length = 374
Score = 43.1 bits (100), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 46/206 (22%), Positives = 79/206 (38%), Gaps = 42/206 (20%)
Query: 102 VFDQQIVLYKDGNGELRCYQDRCPHRLAKLSEGQ-LIDGRLECLYHGWQFEGEG------ 154
+ + IVL + + LR + + CPHR +L G+ + C YH W F+ +G
Sbjct: 67 IIGESIVLVRGRDKVLRAFYNVCPHRGHQLLSGEGKAKNVITCPYHAWAFKLDGNLAHAR 126
Query: 155 KCVKIPQLPADAKIPRSACVRTYEVKESQGVVWVWMSQKTPPNPDKLPWFENFARPGFQD 214
C + +D A + ++E G V++ M D+LP +
Sbjct: 127 NCENVANFDSD-----KAQLVPVRLEEYAGFVFINMDPNATSVEDQLP---GLGAKVLEA 178
Query: 215 VSTIHEL----------PYDHSILLENLMDPAHIPISH---------DRT------DWTA 249
+H+L P + +++N ++ H +H DR +WT
Sbjct: 179 CPEVHDLKLAARFTTRTPANWKNIVDNYLECYHCGPAHPGFSDSVQVDRYWHTMHGNWTL 238
Query: 250 KREDAQP--LGFEVTERTDRGFAGRW 273
+ A+P F+ E TD F G W
Sbjct: 239 QYGFAKPSEQSFKFEEGTDAAFHGFW 264
>sp|Q9SZR0|CHMO_ARATH Choline monooxygenase, chloroplastic OS=Arabidopsis thaliana
GN=At4g29890 PE=2 SV=2
Length = 422
Score = 42.7 bits (99), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 18/61 (29%), Positives = 29/61 (47%)
Query: 104 DQQIVLYKDGNGELRCYQDRCPHRLAKLSEGQLIDGRLECLYHGWQFEGEGKCVKIPQLP 163
D V+ +D NG++ + + C H + L+ G CLYHGW + G VK ++
Sbjct: 118 DVDFVVCRDENGKIHAFHNVCSHHASILASGNGRKSCFVCLYHGWTYSLSGSLVKATRMS 177
Query: 164 A 164
Sbjct: 178 G 178
>sp|P07769|BENA_ACIAD Benzoate 1,2-dioxygenase subunit alpha OS=Acinetobacter sp. (strain
ADP1) GN=benA PE=3 SV=2
Length = 461
Score = 42.4 bits (98), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 27/92 (29%), Positives = 44/92 (47%), Gaps = 3/92 (3%)
Query: 77 YDWTEEWYPLYLTKDVPDDAPLGLTVFDQQ-IVLYKDGNGELRCYQDRCPHRLAKLSEGQ 135
Y + W L +P++ T +Q I++ ++ NGEL + C HR A+L +
Sbjct: 47 YIFEGNWVYLAHESQIPNNNDYYTTYIGRQPILIARNRNGELNAMINACSHRGAQLCRHK 106
Query: 136 LIDGR-LECLYHGWQFEGEGKCVKIPQLPADA 166
+ C +HGW F GK +K+ P+DA
Sbjct: 107 RGNKTTYTCPFHGWTFNNSGKLLKVKD-PSDA 137
>sp|O52381|NAGAB_RALSP Naphthalene 1,2-dioxygenase/salicylate 5-hydroxylase systems,
ferredoxin component OS=Ralstonia sp. GN=nagAb PE=1 SV=1
Length = 104
Score = 42.0 bits (97), Expect = 0.013, Method: Composition-based stats.
Identities = 28/104 (26%), Positives = 48/104 (46%), Gaps = 15/104 (14%)
Query: 80 TEEWYPLYLTKDVPDDAPLGLTVFDQQIVLYKDGNGELRCYQDRCPHRLAKLSEGQLIDG 139
T+ W D+P+ +G+ V ++I LY + GE+ + C H A++S+G L
Sbjct: 2 TQNWIDAACLDDIPEGDVVGVKVNGKEIALY-EVEGEIYATDNLCTHGAARMSDGFLEGR 60
Query: 140 RLECLYHGWQFE---GEGKCVKIPQLPADAKIPRSACVRTYEVK 180
+EC H +F+ G+ C P + ++TY VK
Sbjct: 61 EIECPLHQGRFDVCTGKALCT-----------PLTKDIKTYPVK 93
>sp|P0A111|NDOB_PSEU8 Naphthalene 1,2-dioxygenase subunit alpha OS=Pseudomonas sp.
(strain C18) GN=doxB PE=1 SV=1
Length = 449
Score = 41.2 bits (95), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 42/180 (23%), Positives = 73/180 (40%), Gaps = 30/180 (16%)
Query: 86 LYLTKD----VPDD---APLGLTVFDQQIVLYKDGNGELRCYQDRCPHRLAKLSEGQLID 138
L+LT D P D A +G+ ++++ + +G +R + + C HR L + +
Sbjct: 40 LFLTHDSLIPAPGDYVTAKMGI----DEVIVSRQNDGSIRAFLNVCRHRGKTLVSVEAGN 95
Query: 139 GR-LECLYHGWQFEGEGKCVKIP-QLPADAKIPRSACVRTYEVKESQ---GVVWVWMSQK 193
+ C YHGW F G+ +P + + C+ EV + G ++ Q+
Sbjct: 96 AKGFVCSYHGWGFGSNGELQSVPFEKDLYGESLNKKCLGLKEVARVESFHGFIYGCFDQE 155
Query: 194 TPPNPDKLPWFENFARPGFQDVSTIHELPYDHSILLENLMDPAHIPISHDRTDWTAKRED 253
PP D L D + E + HS LE + P + I + +W A E+
Sbjct: 156 APPLMDYLG-----------DAAWYLEPMFKHSGGLELVGPPGKVVI---KANWKAPAEN 201
>sp|P0A110|NDOB_PSEPU Naphthalene 1,2-dioxygenase subunit alpha OS=Pseudomonas putida
GN=ndoB PE=1 SV=1
Length = 449
Score = 41.2 bits (95), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 42/180 (23%), Positives = 73/180 (40%), Gaps = 30/180 (16%)
Query: 86 LYLTKD----VPDD---APLGLTVFDQQIVLYKDGNGELRCYQDRCPHRLAKLSEGQLID 138
L+LT D P D A +G+ ++++ + +G +R + + C HR L + +
Sbjct: 40 LFLTHDSLIPAPGDYVTAKMGI----DEVIVSRQNDGSIRAFLNVCRHRGKTLVSVEAGN 95
Query: 139 GR-LECLYHGWQFEGEGKCVKIP-QLPADAKIPRSACVRTYEVKESQ---GVVWVWMSQK 193
+ C YHGW F G+ +P + + C+ EV + G ++ Q+
Sbjct: 96 AKGFVCSYHGWGFGSNGELQSVPFEKDLYGESLNKKCLGLKEVARVESFHGFIYGCFDQE 155
Query: 194 TPPNPDKLPWFENFARPGFQDVSTIHELPYDHSILLENLMDPAHIPISHDRTDWTAKRED 253
PP D L D + E + HS LE + P + I + +W A E+
Sbjct: 156 APPLMDYLG-----------DAAWYLEPMFKHSGGLELVGPPGKVVI---KANWKAPAEN 201
>sp|Q8GI16|CARAC_PSERE Ferredoxin CarAc OS=Pseudomonas resinovorans GN=carAc PE=1 SV=1
Length = 107
Score = 40.8 bits (94), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 26/87 (29%), Positives = 43/87 (49%), Gaps = 9/87 (10%)
Query: 107 IVLYKDGNGELRCYQDRCPHRLAKLSEGQLIDGRLECLYHGWQFEGEGKCVKIPQLPADA 166
+ +Y+ G+ + +D C H +A LSEG L +EC +HG F C +P A
Sbjct: 30 LAVYRVGD-QFYATEDTCTHGIASLSEGTLDGDVIECPFHGGAFN---VCTGMP-----A 80
Query: 167 KIPRSACVRTYEVKESQGVVWVWMSQK 193
P + + +EV+ +G V+V +K
Sbjct: 81 SSPCTVPLGVFEVEVKEGEVYVAGEKK 107
>sp|O22553|CHMO_BETVU Choline monooxygenase, chloroplastic OS=Beta vulgaris GN=CMO PE=2
SV=1
Length = 446
Score = 39.7 bits (91), Expect = 0.057, Method: Compositional matrix adjust.
Identities = 17/53 (32%), Positives = 27/53 (50%)
Query: 106 QIVLYKDGNGELRCYQDRCPHRLAKLSEGQLIDGRLECLYHGWQFEGEGKCVK 158
+ ++ +DG GEL + + C HR + L+ G C YHGW + +G K
Sbjct: 151 EYLVSRDGQGELHAFHNVCTHRASILACGSGKKSCFVCPYHGWVYGLDGSLAK 203
>sp|P0A186|NDOA_PSEU8 Naphthalene 1,2-dioxygenase system ferredoxin subunit
OS=Pseudomonas sp. (strain C18) GN=doxA PE=3 SV=2
Length = 104
Score = 39.7 bits (91), Expect = 0.062, Method: Composition-based stats.
Identities = 28/104 (26%), Positives = 48/104 (46%), Gaps = 15/104 (14%)
Query: 80 TEEWYPLYLTKDVPDDAPLGLTVFDQQIVLYKDGNGELRCYQDRCPHRLAKLSEGQLIDG 139
T +W D+ + LG+TV +++ LY + GE+ + C H A++S+G L
Sbjct: 2 TVKWIEAVALSDILEGDVLGVTVEGKELALY-EVEGEIYATDNLCTHGSARMSDGYLEGR 60
Query: 140 RLECLYHGWQFE---GEGKCVKIPQLPADAKIPRSACVRTYEVK 180
+EC H +F+ G+ C + Q ++TY VK
Sbjct: 61 EIECPLHQGRFDVCTGKALCAPVTQ-----------NIKTYPVK 93
>sp|P0A185|NDOA_PSEPU Naphthalene 1,2-dioxygenase system ferredoxin subunit
OS=Pseudomonas putida GN=ndoA PE=1 SV=2
Length = 104
Score = 39.7 bits (91), Expect = 0.062, Method: Composition-based stats.
Identities = 28/104 (26%), Positives = 48/104 (46%), Gaps = 15/104 (14%)
Query: 80 TEEWYPLYLTKDVPDDAPLGLTVFDQQIVLYKDGNGELRCYQDRCPHRLAKLSEGQLIDG 139
T +W D+ + LG+TV +++ LY + GE+ + C H A++S+G L
Sbjct: 2 TVKWIEAVALSDILEGDVLGVTVEGKELALY-EVEGEIYATDNLCTHGSARMSDGYLEGR 60
Query: 140 RLECLYHGWQFE---GEGKCVKIPQLPADAKIPRSACVRTYEVK 180
+EC H +F+ G+ C + Q ++TY VK
Sbjct: 61 EIECPLHQGRFDVCTGKALCAPVTQ-----------NIKTYPVK 93
>sp|O07824|NDOB_PSEFL Naphthalene 1,2-dioxygenase subunit alpha OS=Pseudomonas
fluorescens GN=ndoB PE=3 SV=1
Length = 449
Score = 38.9 bits (89), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 41/180 (22%), Positives = 74/180 (41%), Gaps = 30/180 (16%)
Query: 86 LYLTKDV----PDD---APLGLTVFDQQIVLYKDGNGELRCYQDRCPHRLAKLSEGQLID 138
L+LT D P D A +G+ ++++ + +G +R + + C HR L + +
Sbjct: 40 LFLTHDSLIPSPGDYVTAKMGI----DEVIVSRQSDGSIRAFLNVCRHRGKTLVNAEAGN 95
Query: 139 GR-LECLYHGWQFEGEGKCVKIP---QLPADAKIPRSACVRTYEVKES-QGVVWVWMSQK 193
+ C YHGW F G+ +P +L ++ + ++ ES G ++ Q+
Sbjct: 96 AKGFVCSYHGWGFGSNGELQSVPFEKELYGESLNKKCLGLKEVARVESFHGFIYGCFDQE 155
Query: 194 TPPNPDKLPWFENFARPGFQDVSTIHELPYDHSILLENLMDPAHIPISHDRTDWTAKRED 253
P D L D + E + HS LE + P + I + +W A E+
Sbjct: 156 APSLMDYLG-----------DAAWYLEPIFKHSGGLELVGPPGKVVI---KANWKAPAEN 201
>sp|Q93XE1|CHMO_AMATR Choline monooxygenase, chloroplastic OS=Amaranthus tricolor GN=CMO
PE=2 SV=1
Length = 442
Score = 38.9 bits (89), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 19/74 (25%), Positives = 35/74 (47%)
Query: 106 QIVLYKDGNGELRCYQDRCPHRLAKLSEGQLIDGRLECLYHGWQFEGEGKCVKIPQLPAD 165
+ ++ +DG G++ + + C HR + L+ G C YHGW F +G +K +
Sbjct: 147 EYLVCRDGQGKVHAFHNVCTHRASILACGTGKKSCFVCPYHGWVFGLDGSLMKATKTENQ 206
Query: 166 AKIPRSACVRTYEV 179
P+ + T +V
Sbjct: 207 VFDPKELGLVTLKV 220
>sp|Q51494|NDOB_PSEAI Naphthalene 1,2-dioxygenase subunit alpha OS=Pseudomonas aeruginosa
GN=ndoB PE=3 SV=1
Length = 449
Score = 38.5 bits (88), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 40/180 (22%), Positives = 73/180 (40%), Gaps = 30/180 (16%)
Query: 86 LYLTKDV----PDD---APLGLTVFDQQIVLYKDGNGELRCYQDRCPHRLAKLSEGQLID 138
L+LT D P D A +G+ ++++ + +G +R + + C HR L + +
Sbjct: 40 LFLTHDSLIPSPGDYVTAKMGV----DEVIVSRQNDGSIRAFLNVCRHRGKTLVHAEAGN 95
Query: 139 GR-LECLYHGWQFEGEGKCVKIP---QLPADAKIPRSACVRTYEVKES-QGVVWVWMSQK 193
+ C YHGW F G+ +P +L +A + ++ ES G ++ ++
Sbjct: 96 AKGFVCSYHGWGFGANGELQSVPFEKELYGEALDKKCMGLKEVARVESFHGFIYGCFDEE 155
Query: 194 TPPNPDKLPWFENFARPGFQDVSTIHELPYDHSILLENLMDPAHIPISHDRTDWTAKRED 253
P D + D E + HS LE + P + I + +W A E+
Sbjct: 156 APSLKDYMG-----------DAGWYLEPMFKHSGGLELIGPPGKVII---KANWKAPAEN 201
>sp|P23099|XYLX_PSEPU Toluate 1,2-dioxygenase subunit alpha OS=Pseudomonas putida GN=xylX
PE=3 SV=1
Length = 454
Score = 38.5 bits (88), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 28/102 (27%), Positives = 47/102 (46%), Gaps = 4/102 (3%)
Query: 83 WYPLYLTKDVPDDAPLGLTVFDQQ-IVLYKDGNGELRCYQDRCPHRLAKLSEGQLIDGRL 141
W L +P+ T +Q I + ++ +GEL + + C HR A L + +
Sbjct: 50 WIYLAHESQIPEKNDYYTTQMGRQPIFITRNKDGELNAFVNACSHRGATLCRFRSGNKAT 109
Query: 142 E-CLYHGWQFEGEGKCVKIPQLPADAKIPRS-ACVRTYEVKE 181
C +HGW F GK +K+ P A P S C ++++K+
Sbjct: 110 HTCSFHGWTFSNSGKLLKVKD-PKGAGYPDSFDCDGSHDLKK 150
>sp|Q83K39|HCAE_SHIFL 3-phenylpropionate/cinnamic acid dioxygenase subunit alpha
OS=Shigella flexneri GN=hcaE PE=3 SV=1
Length = 453
Score = 38.5 bits (88), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 18/76 (23%), Positives = 37/76 (48%), Gaps = 3/76 (3%)
Query: 107 IVLYKDGNGELRCYQDRCPHRLAKLSEGQLIDGR-LECLYHGWQFEGEGKCVKIPQLPAD 165
+V+ + +G ++ + ++C HR ++S + R C YHGW + G+ + +P P
Sbjct: 68 VVVVRQKDGSIKAFLNQCRHRAMRVSYADCGNSRAFTCPYHGWSYGINGELIDVPLEP-- 125
Query: 166 AKIPRSACVRTYEVKE 181
P+ C + + E
Sbjct: 126 RAYPQGLCKSHWGLNE 141
>sp|Q62059|CSPG2_MOUSE Versican core protein OS=Mus musculus GN=Vcan PE=1 SV=2
Length = 3357
Score = 38.5 bits (88), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 17/49 (34%), Positives = 25/49 (51%)
Query: 98 LGLTVFDQQIVLYKDGNGELRCYQDRCPHRLAKLSEGQLIDGRLECLYH 146
+G+ D++I + DG GE + D P L K+SE L G L +H
Sbjct: 832 IGINGKDKEIPSFTDGGGEYTLFPDGTPKPLEKVSEEDLASGELTVTFH 880
>sp|Q0T1Y1|HCAE_SHIF8 3-phenylpropionate/cinnamic acid dioxygenase subunit alpha
OS=Shigella flexneri serotype 5b (strain 8401) GN=hcaE
PE=3 SV=1
Length = 453
Score = 38.1 bits (87), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 18/76 (23%), Positives = 37/76 (48%), Gaps = 3/76 (3%)
Query: 107 IVLYKDGNGELRCYQDRCPHRLAKLSEGQLIDGR-LECLYHGWQFEGEGKCVKIPQLPAD 165
+V+ + +G ++ + ++C HR ++S + R C YHGW + G+ + +P P
Sbjct: 68 VVVVRQKDGSIKAFLNQCRHRAMRVSYADCGNTRAFTCPYHGWSYGINGELIDVPLEP-- 125
Query: 166 AKIPRSACVRTYEVKE 181
P+ C + + E
Sbjct: 126 RAYPQGLCKSHWGLNE 141
>sp|A8A344|HCAE_ECOHS 3-phenylpropionate/cinnamic acid dioxygenase subunit alpha
OS=Escherichia coli O9:H4 (strain HS) GN=hcaE PE=3 SV=1
Length = 453
Score = 38.1 bits (87), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 18/76 (23%), Positives = 37/76 (48%), Gaps = 3/76 (3%)
Query: 107 IVLYKDGNGELRCYQDRCPHRLAKLSEGQLIDGR-LECLYHGWQFEGEGKCVKIPQLPAD 165
+V+ + +G ++ + ++C HR ++S + R C YHGW + G+ + +P P
Sbjct: 68 VVVVRQKDGSIKAFLNQCRHRAMRVSYADCGNTRAFTCPYHGWSYGINGELIDVPLEP-- 125
Query: 166 AKIPRSACVRTYEVKE 181
P+ C + + E
Sbjct: 126 RAYPQGLCKSHWGLNE 141
>sp|Q3YZ15|HCAE_SHISS 3-phenylpropionate/cinnamic acid dioxygenase subunit alpha
OS=Shigella sonnei (strain Ss046) GN=hcaE PE=3 SV=1
Length = 453
Score = 38.1 bits (87), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 18/76 (23%), Positives = 37/76 (48%), Gaps = 3/76 (3%)
Query: 107 IVLYKDGNGELRCYQDRCPHRLAKLSEGQLIDGR-LECLYHGWQFEGEGKCVKIPQLPAD 165
+V+ + +G ++ + ++C HR ++S + R C YHGW + G+ + +P P
Sbjct: 68 VVVVRQKDGSIKAFLNQCRHRAMRVSYADCGNTRAFTCPYHGWSYGINGELIDVPLEP-- 125
Query: 166 AKIPRSACVRTYEVKE 181
P+ C + + E
Sbjct: 126 RAYPQGLCKSHWGLNE 141
>sp|Q31XV2|HCAE_SHIBS 3-phenylpropionate/cinnamic acid dioxygenase subunit alpha
OS=Shigella boydii serotype 4 (strain Sb227) GN=hcaE
PE=3 SV=1
Length = 453
Score = 38.1 bits (87), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 18/76 (23%), Positives = 37/76 (48%), Gaps = 3/76 (3%)
Query: 107 IVLYKDGNGELRCYQDRCPHRLAKLSEGQLIDGR-LECLYHGWQFEGEGKCVKIPQLPAD 165
+V+ + +G ++ + ++C HR ++S + R C YHGW + G+ + +P P
Sbjct: 68 VVVVRQKDGSIKAFLNQCRHRAMRVSYADCGNTRAFTCPYHGWSYGINGELIDVPLEP-- 125
Query: 166 AKIPRSACVRTYEVKE 181
P+ C + + E
Sbjct: 126 RAYPQGLCKSHWGLNE 141
>sp|P0ABR5|HCAE_ECOLI 3-phenylpropionate/cinnamic acid dioxygenase subunit alpha
OS=Escherichia coli (strain K12) GN=hcaE PE=1 SV=1
Length = 453
Score = 38.1 bits (87), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 18/76 (23%), Positives = 37/76 (48%), Gaps = 3/76 (3%)
Query: 107 IVLYKDGNGELRCYQDRCPHRLAKLSEGQLIDGR-LECLYHGWQFEGEGKCVKIPQLPAD 165
+V+ + +G ++ + ++C HR ++S + R C YHGW + G+ + +P P
Sbjct: 68 VVVVRQKDGSIKAFLNQCRHRAMRVSYADCGNTRAFTCPYHGWSYGINGELIDVPLEP-- 125
Query: 166 AKIPRSACVRTYEVKE 181
P+ C + + E
Sbjct: 126 RAYPQGLCKSHWGLNE 141
>sp|P0ABR6|HCAE_ECO57 3-phenylpropionate/cinnamic acid dioxygenase subunit alpha
OS=Escherichia coli O157:H7 GN=hcaE PE=3 SV=1
Length = 453
Score = 38.1 bits (87), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 18/76 (23%), Positives = 37/76 (48%), Gaps = 3/76 (3%)
Query: 107 IVLYKDGNGELRCYQDRCPHRLAKLSEGQLIDGR-LECLYHGWQFEGEGKCVKIPQLPAD 165
+V+ + +G ++ + ++C HR ++S + R C YHGW + G+ + +P P
Sbjct: 68 VVVVRQKDGSIKAFLNQCRHRAMRVSYADCGNTRAFTCPYHGWSYGINGELIDVPLEP-- 125
Query: 166 AKIPRSACVRTYEVKE 181
P+ C + + E
Sbjct: 126 RAYPQGLCKSHWGLNE 141
>sp|Q96NN9|AIFM3_HUMAN Apoptosis-inducing factor 3 OS=Homo sapiens GN=AIFM3 PE=1 SV=1
Length = 605
Score = 37.7 bits (86), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 19/45 (42%), Positives = 26/45 (57%), Gaps = 1/45 (2%)
Query: 106 QIVLYKDGNGELRCYQDRCPHRLAKLSEGQLIDGRLECLYHGWQF 150
+++L KD NGE +CPH A L +G L GR+ C +HG F
Sbjct: 92 KVLLVKD-NGEFHALGHKCPHYGAPLVKGVLSRGRVRCPWHGACF 135
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.320 0.135 0.426
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 212,147,497
Number of Sequences: 539616
Number of extensions: 9507912
Number of successful extensions: 19259
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 29
Number of HSP's successfully gapped in prelim test: 54
Number of HSP's that attempted gapping in prelim test: 19163
Number of HSP's gapped (non-prelim): 92
length of query: 534
length of database: 191,569,459
effective HSP length: 122
effective length of query: 412
effective length of database: 125,736,307
effective search space: 51803358484
effective search space used: 51803358484
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 64 (29.3 bits)