BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 027965
(216 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|Q949R4|DIOXL_ARATH 4,5-DOPA dioxygenase extradiol-like protein OS=Arabidopsis thaliana
GN=At4g15093 PE=2 SV=1
Length = 269
Score = 298 bits (762), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 146/203 (71%), Positives = 165/203 (81%), Gaps = 3/203 (1%)
Query: 17 VMDTFFISHGSPTLSIDESLPARGFLQAWQAKVFSQRPNSILVISAHWDTDFPSVNVVQR 76
V TFF+SHGSPTLSID+SL AR F ++W KV Q+P SILVISAHWDT FPSVN V R
Sbjct: 4 VNQTFFLSHGSPTLSIDDSLEARQFFKSWTQKVLPQKPKSILVISAHWDTKFPSVNTVLR 63
Query: 77 NDTIHDFYGFPKQMYDLKYPAPGAPELAKRVKDLL-KASGIKHVNEDRKRGLDHGAWVPL 135
N+TIHDF GFP MY LKY APGA EL KRVK+LL K G+K V+ED KRGLDHGAWVPL
Sbjct: 64 NNTIHDFSGFPDPMYKLKYEAPGAIELGKRVKELLMKEGGMKRVDEDTKRGLDHGAWVPL 123
Query: 136 MLMYPEADIPVCQLSVQMHHTGTYHYNIGKALAPLKEEGVLIIGSGSATHNLRALQFESS 195
MLMYPEADIP+CQLSVQ + G+YHYN+GKALA LK+EGVLIIGSGSATHNLR L F +
Sbjct: 124 MLMYPEADIPICQLSVQSNQNGSYHYNMGKALASLKDEGVLIIGSGSATHNLRKLDFNIT 183
Query: 196 SIS--SWALEFDNWLKDALLEGR 216
S WALEFD+WL+D+LL+GR
Sbjct: 184 DGSPVPWALEFDHWLRDSLLQGR 206
>sp|Q7XA48|DODA_PORGR 4,5-DOPA dioxygenase extradiol OS=Portulaca grandiflora GN=DODA
PE=2 SV=1
Length = 271
Score = 232 bits (592), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 110/208 (52%), Positives = 139/208 (66%)
Query: 9 IAAKSRLSVMDTFFISHGSPTLSIDESLPARGFLQAWQAKVFSQRPNSILVISAHWDTDF 68
+ +S ++FF+SHG+P + DES AR FL W+ VF +P SILV+SAHW+TD
Sbjct: 1 MGVGKEVSFKESFFLSHGNPAMLADESFIARNFLLGWKKNVFPVKPKSILVVSAHWETDV 60
Query: 69 PSVNVVQRNDTIHDFYGFPKQMYDLKYPAPGAPELAKRVKDLLKASGIKHVNEDRKRGLD 128
P V+ Q + I+DF P M+ +KYPAPG P+LAKRV++LL A G K D +RG D
Sbjct: 61 PCVSAGQYPNVIYDFTEVPASMFQMKYPAPGCPKLAKRVQELLIAGGFKSAKLDEERGFD 120
Query: 129 HGAWVPLMLMYPEADIPVCQLSVQMHHTGTYHYNIGKALAPLKEEGVLIIGSGSATHNLR 188
H +WVPL +M PEADIPVCQLSVQ T+H+N+G+ALAPLK EGVL IGSG A H
Sbjct: 121 HSSWVPLSMMCPEADIPVCQLSVQPGLDATHHFNVGRALAPLKGEGVLFIGSGGAVHPSD 180
Query: 189 ALQFESSSISSWALEFDNWLKDALLEGR 216
++ WA EFD WL+DALLEGR
Sbjct: 181 DTPHWFDGVAPWAAEFDQWLEDALLEGR 208
>sp|P24197|YGID_ECOLI Uncharacterized protein YgiD OS=Escherichia coli (strain K12)
GN=ygiD PE=1 SV=3
Length = 262
Score = 151 bits (382), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 77/197 (39%), Positives = 120/197 (60%), Gaps = 8/197 (4%)
Query: 18 MDTFFISHGSPTLSIDESLPARGFLQAWQAKVFS-QRPNSILVISAHWDTDFPSVNVVQR 76
M F+ HGSP ++++L + ++WQ + RP +I+V+SAHW T V ++
Sbjct: 6 MPALFLGHGSPMNVLEDNL----YTRSWQKLGMTLPRPQAIVVVSAHWFTRGTGVTAMET 61
Query: 77 NDTIHDFYGFPKQMYDLKYPAPGAPELAKRVKDLLKASGIKHVNEDRKRGLDHGAWVPLM 136
TIHDF GFP+ +YD YPAPG+P LA+R+ +LL I + G DHG+W L+
Sbjct: 62 PPTIHDFGGFPQALYDTHYPAPGSPALAQRLVELLAP--IPVTLDKEAWGFDHGSWGVLI 119
Query: 137 LMYPEADIPVCQLSVQMHHTGTYHYNIGKALAPLKEEGVLIIGSGSATHNLRALQFES-S 195
MYP+ADIP+ QLS+ +H+ +G+ LA L++EG++++ SG+ HNLR +++ S
Sbjct: 120 KMYPDADIPMVQLSIDSSKPAAWHFEMGRKLAALRDEGIMLVASGNVVHNLRTVKWHGDS 179
Query: 196 SISSWALEFDNWLKDAL 212
S WA F+ ++K L
Sbjct: 180 SPYPWATSFNEYVKANL 196
>sp|O74741|DIOXL_SCHPO 4,5-DOPA dioxygenase extradiol-like protein OS=Schizosaccharomyces
pombe (strain 972 / ATCC 24843) GN=SPBC1709.16c PE=3
SV=2
Length = 297
Score = 85.1 bits (209), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 63/208 (30%), Positives = 99/208 (47%), Gaps = 24/208 (11%)
Query: 17 VMDTFFISHGSPTL-----SIDESLPARGFLQAWQAKVFSQ---------RPNSILVISA 62
++ +++HGSP L S DE L + Q RP I+V SA
Sbjct: 22 LIPALYLAHGSPFLMLPQSSDDEVFSNDSKLGGLHYQFLEQLGPFLLEKFRPKGIIVFSA 81
Query: 63 HWDTDFPSVNVVQRNDT---IHDFYGFPKQMYDLKYPAPGAPELAKRVKDLLKASGIKHV 119
H+++ SV V R+D +D+YGFP +Y +K+ + G+ +A ++ LK I
Sbjct: 82 HYESR-GSVEVYSRDDENPLFYDYYGFPDYLYQIKFHSKGSKRIADQIISALKEYQIPAK 140
Query: 120 NEDRKRGLDHGAWVPLMLMYPEA-DIPVCQLSVQMHHTGTYH-YNIGKALAPLKEEGVLI 177
RGLDHG +VP +M+P+ +IP+ ++S MH Y +G+AL L++E LI
Sbjct: 141 TVSGDRGLDHGVFVPFKIMFPDGLNIPLIEVS--MHTLDPMQLYKVGQALQSLRKE-YLI 197
Query: 178 IGSGSATHNLRALQ-FESSSISSWALEF 204
+ G H L F + + EF
Sbjct: 198 VSGGLNIHTFEDLSAFNEDTAADGYKEF 225
>sp|Q05353|HPCB_ECOLX 3,4-dihydroxyphenylacetate 2,3-dioxygenase OS=Escherichia coli
GN=hpcB PE=1 SV=2
Length = 276
Score = 38.9 bits (89), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 34/166 (20%), Positives = 70/166 (42%), Gaps = 6/166 (3%)
Query: 55 NSILVISAHWDTDFP-SVNVVQRNDTIHDFYGFPKQMYDLKYPAPGAPELAKRVKDLLKA 113
++I+V HW + +N + ++ P + D+ Y G PEL + + D
Sbjct: 49 DTIIVFDTHWLVNSAYHINCADHFEGVYTSNELPHFIRDMTYNYEGNPELGQLIADEALK 108
Query: 114 SGIK-HVNEDRKRGLDHGAWVPLMLMYPEADIPVCQLSVQMH-HTGTYHYNIGKALAPLK 171
G++ + L++G+ VP+ M + V +S H +G+ +
Sbjct: 109 LGVRAKAHNIPSLKLEYGSVVPMRYMNEDKRFKVVSISAFCTVHDFADSRKLGERIVKAI 168
Query: 172 EE---GVLIIGSGSATHNLRALQFESSSISSWALEFDNWLKDALLE 214
E+ V ++ SGS +H Q ++S+ EFD + + +++
Sbjct: 169 EQYDGTVAVLASGSLSHRFIDDQRAEEGMNSYTREFDRQMDERVVK 214
>sp|O28617|Y1656_ARCFU Uncharacterized protein AF_1656 OS=Archaeoglobus fulgidus (strain
ATCC 49558 / VC-16 / DSM 4304 / JCM 9628 / NBRC 100126)
GN=AF_1656 PE=4 SV=1
Length = 144
Score = 37.4 bits (85), Expect = 0.083, Method: Compositional matrix adjust.
Identities = 32/106 (30%), Positives = 56/106 (52%), Gaps = 14/106 (13%)
Query: 61 SAHWDTD-FPSVNVVQRNDTIH-DFYGFPKQ--MYDLKY---PAPGAPELAKRVKDLLKA 113
S HW ++ +P+V+ V++ + DFYG P + YD+ + APG P+L+ D ++
Sbjct: 36 SGHWASEIYPTVSFVKKESPVKIDFYGKPNEEYSYDVIFRIVDAPGIPKLSWDGDDFVQF 95
Query: 114 SGIKHVNEDRKRGL-----DHGAWV-PLMLMYPEADIPVCQLSVQM 153
S K+VN D + G+ +G +V L + YP V ++ V +
Sbjct: 96 SLRKNVN-DWEVGMHLSPQTNGTYVGELTISYPSYPYKVVEIPVTI 140
>sp|Q82CD3|SSUB2_STRAW Aliphatic sulfonates import ATP-binding protein SsuB 2
OS=Streptomyces avermitilis (strain ATCC 31267 / DSM
46492 / JCM 5070 / NCIMB 12804 / NRRL 8165 / MA-4680)
GN=ssuB2 PE=3 SV=1
Length = 250
Score = 31.6 bits (70), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 24/80 (30%), Positives = 38/80 (47%), Gaps = 9/80 (11%)
Query: 143 DIPVCQLSVQMHHTGTYHYNIGKALAPLKEEGVLIIGSGSATHNLR-ALQFESSSISSWA 201
D+P Q + + H+G+ + +A+A L E + GSG T R ++ F+ S + W
Sbjct: 34 DLPAGQFTALLGHSGSGKSTLLRAVAGLDHE---VTGSGQLTAPERVSVVFQDSRLLPWR 90
Query: 202 LEFDNWL-----KDALLEGR 216
DN L K+A GR
Sbjct: 91 RVLDNVLLGTDGKEAAERGR 110
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.320 0.135 0.414
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 84,868,619
Number of Sequences: 539616
Number of extensions: 3558996
Number of successful extensions: 8130
Number of sequences better than 100.0: 7
Number of HSP's better than 100.0 without gapping: 4
Number of HSP's successfully gapped in prelim test: 3
Number of HSP's that attempted gapping in prelim test: 8120
Number of HSP's gapped (non-prelim): 8
length of query: 216
length of database: 191,569,459
effective HSP length: 113
effective length of query: 103
effective length of database: 130,592,851
effective search space: 13451063653
effective search space used: 13451063653
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 59 (27.3 bits)