BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 025580
(250 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|Q949R4|DIOXL_ARATH 4,5-DOPA dioxygenase extradiol-like protein OS=Arabidopsis thaliana
GN=At4g15093 PE=2 SV=1
Length = 269
Score = 298 bits (763), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 146/203 (71%), Positives = 165/203 (81%), Gaps = 3/203 (1%)
Query: 51 VMDTFFISHGSPTLSIDESLPARGFLQAWQAKVFSQRPNSILVISAHWDTDFPSVNVVQR 110
V TFF+SHGSPTLSID+SL AR F ++W KV Q+P SILVISAHWDT FPSVN V R
Sbjct: 4 VNQTFFLSHGSPTLSIDDSLEARQFFKSWTQKVLPQKPKSILVISAHWDTKFPSVNTVLR 63
Query: 111 NDTIHDFYGFPKQMYDLKYPAPGAPELAKRVKDLL-KASGIKHVNEDRKRGLDHGAWVPL 169
N+TIHDF GFP MY LKY APGA EL KRVK+LL K G+K V+ED KRGLDHGAWVPL
Sbjct: 64 NNTIHDFSGFPDPMYKLKYEAPGAIELGKRVKELLMKEGGMKRVDEDTKRGLDHGAWVPL 123
Query: 170 MLMYPEADIPVCQLSVQMHHTGTYHYNIGKALAPLKEEGVLIIGSGSATHNLRALQFESS 229
MLMYPEADIP+CQLSVQ + G+YHYN+GKALA LK+EGVLIIGSGSATHNLR L F +
Sbjct: 124 MLMYPEADIPICQLSVQSNQNGSYHYNMGKALASLKDEGVLIIGSGSATHNLRKLDFNIT 183
Query: 230 SIS--SWALEFDNWLKDALLEGR 250
S WALEFD+WL+D+LL+GR
Sbjct: 184 DGSPVPWALEFDHWLRDSLLQGR 206
>sp|Q7XA48|DODA_PORGR 4,5-DOPA dioxygenase extradiol OS=Portulaca grandiflora GN=DODA
PE=2 SV=1
Length = 271
Score = 233 bits (595), Expect = 8e-61, Method: Compositional matrix adjust.
Identities = 111/207 (53%), Positives = 139/207 (67%)
Query: 44 GQQSRLSVMDTFFISHGSPTLSIDESLPARGFLQAWQAKVFSQRPNSILVISAHWDTDFP 103
G +S ++FF+SHG+P + DES AR FL W+ VF +P SILV+SAHW+TD P
Sbjct: 2 GVGKEVSFKESFFLSHGNPAMLADESFIARNFLLGWKKNVFPVKPKSILVVSAHWETDVP 61
Query: 104 SVNVVQRNDTIHDFYGFPKQMYDLKYPAPGAPELAKRVKDLLKASGIKHVNEDRKRGLDH 163
V+ Q + I+DF P M+ +KYPAPG P+LAKRV++LL A G K D +RG DH
Sbjct: 62 CVSAGQYPNVIYDFTEVPASMFQMKYPAPGCPKLAKRVQELLIAGGFKSAKLDEERGFDH 121
Query: 164 GAWVPLMLMYPEADIPVCQLSVQMHHTGTYHYNIGKALAPLKEEGVLIIGSGSATHNLRA 223
+WVPL +M PEADIPVCQLSVQ T+H+N+G+ALAPLK EGVL IGSG A H
Sbjct: 122 SSWVPLSMMCPEADIPVCQLSVQPGLDATHHFNVGRALAPLKGEGVLFIGSGGAVHPSDD 181
Query: 224 LQFESSSISSWALEFDNWLKDALLEGR 250
++ WA EFD WL+DALLEGR
Sbjct: 182 TPHWFDGVAPWAAEFDQWLEDALLEGR 208
>sp|P24197|YGID_ECOLI Uncharacterized protein YgiD OS=Escherichia coli (strain K12)
GN=ygiD PE=1 SV=3
Length = 262
Score = 151 bits (382), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 77/197 (39%), Positives = 120/197 (60%), Gaps = 8/197 (4%)
Query: 52 MDTFFISHGSPTLSIDESLPARGFLQAWQAKVFS-QRPNSILVISAHWDTDFPSVNVVQR 110
M F+ HGSP ++++L + ++WQ + RP +I+V+SAHW T V ++
Sbjct: 6 MPALFLGHGSPMNVLEDNL----YTRSWQKLGMTLPRPQAIVVVSAHWFTRGTGVTAMET 61
Query: 111 NDTIHDFYGFPKQMYDLKYPAPGAPELAKRVKDLLKASGIKHVNEDRKRGLDHGAWVPLM 170
TIHDF GFP+ +YD YPAPG+P LA+R+ +LL I + G DHG+W L+
Sbjct: 62 PPTIHDFGGFPQALYDTHYPAPGSPALAQRLVELLAP--IPVTLDKEAWGFDHGSWGVLI 119
Query: 171 LMYPEADIPVCQLSVQMHHTGTYHYNIGKALAPLKEEGVLIIGSGSATHNLRALQFES-S 229
MYP+ADIP+ QLS+ +H+ +G+ LA L++EG++++ SG+ HNLR +++ S
Sbjct: 120 KMYPDADIPMVQLSIDSSKPAAWHFEMGRKLAALRDEGIMLVASGNVVHNLRTVKWHGDS 179
Query: 230 SISSWALEFDNWLKDAL 246
S WA F+ ++K L
Sbjct: 180 SPYPWATSFNEYVKANL 196
>sp|O74741|DIOXL_SCHPO 4,5-DOPA dioxygenase extradiol-like protein OS=Schizosaccharomyces
pombe (strain 972 / ATCC 24843) GN=SPBC1709.16c PE=3
SV=2
Length = 297
Score = 84.3 bits (207), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 60/193 (31%), Positives = 94/193 (48%), Gaps = 23/193 (11%)
Query: 51 VMDTFFISHGSPTL-----SIDESLPARGFLQAWQAKVFSQ---------RPNSILVISA 96
++ +++HGSP L S DE L + Q RP I+V SA
Sbjct: 22 LIPALYLAHGSPFLMLPQSSDDEVFSNDSKLGGLHYQFLEQLGPFLLEKFRPKGIIVFSA 81
Query: 97 HWDTDFPSVNVVQRNDT---IHDFYGFPKQMYDLKYPAPGAPELAKRVKDLLKASGIKHV 153
H+++ SV V R+D +D+YGFP +Y +K+ + G+ +A ++ LK I
Sbjct: 82 HYESR-GSVEVYSRDDENPLFYDYYGFPDYLYQIKFHSKGSKRIADQIISALKEYQIPAK 140
Query: 154 NEDRKRGLDHGAWVPLMLMYPEA-DIPVCQLSVQMHHTGTYH-YNIGKALAPLKEEGVLI 211
RGLDHG +VP +M+P+ +IP+ ++S MH Y +G+AL L++E LI
Sbjct: 141 TVSGDRGLDHGVFVPFKIMFPDGLNIPLIEVS--MHTLDPMQLYKVGQALQSLRKE-YLI 197
Query: 212 IGSGSATHNLRAL 224
+ G H L
Sbjct: 198 VSGGLNIHTFEDL 210
>sp|Q05353|HPCB_ECOLX 3,4-dihydroxyphenylacetate 2,3-dioxygenase OS=Escherichia coli
GN=hpcB PE=1 SV=2
Length = 276
Score = 37.7 bits (86), Expect = 0.071, Method: Compositional matrix adjust.
Identities = 34/166 (20%), Positives = 70/166 (42%), Gaps = 6/166 (3%)
Query: 89 NSILVISAHWDTDFP-SVNVVQRNDTIHDFYGFPKQMYDLKYPAPGAPELAKRVKDLLKA 147
++I+V HW + +N + ++ P + D+ Y G PEL + + D
Sbjct: 49 DTIIVFDTHWLVNSAYHINCADHFEGVYTSNELPHFIRDMTYNYEGNPELGQLIADEALK 108
Query: 148 SGIK-HVNEDRKRGLDHGAWVPLMLMYPEADIPVCQLSV-QMHHTGTYHYNIGKALAPLK 205
G++ + L++G+ VP+ M + V +S H +G+ +
Sbjct: 109 LGVRAKAHNIPSLKLEYGSVVPMRYMNEDKRFKVVSISAFCTVHDFADSRKLGERIVKAI 168
Query: 206 EE---GVLIIGSGSATHNLRALQFESSSISSWALEFDNWLKDALLE 248
E+ V ++ SGS +H Q ++S+ EFD + + +++
Sbjct: 169 EQYDGTVAVLASGSLSHRFIDDQRAEEGMNSYTREFDRQMDERVVK 214
>sp|O28617|Y1656_ARCFU Uncharacterized protein AF_1656 OS=Archaeoglobus fulgidus (strain
ATCC 49558 / VC-16 / DSM 4304 / JCM 9628 / NBRC 100126)
GN=AF_1656 PE=4 SV=1
Length = 144
Score = 37.4 bits (85), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 32/106 (30%), Positives = 56/106 (52%), Gaps = 14/106 (13%)
Query: 95 SAHWDTD-FPSVNVVQRNDTIH-DFYGFPKQ--MYDLKY---PAPGAPELAKRVKDLLKA 147
S HW ++ +P+V+ V++ + DFYG P + YD+ + APG P+L+ D ++
Sbjct: 36 SGHWASEIYPTVSFVKKESPVKIDFYGKPNEEYSYDVIFRIVDAPGIPKLSWDGDDFVQF 95
Query: 148 SGIKHVNEDRKRGL-----DHGAWV-PLMLMYPEADIPVCQLSVQM 187
S K+VN D + G+ +G +V L + YP V ++ V +
Sbjct: 96 SLRKNVN-DWEVGMHLSPQTNGTYVGELTISYPSYPYKVVEIPVTI 140
>sp|B7K0A2|DNLJ_CYAP8 DNA ligase OS=Cyanothece sp. (strain PCC 8801) GN=ligA PE=3 SV=1
Length = 674
Score = 32.0 bits (71), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 49/163 (30%), Positives = 69/163 (42%), Gaps = 39/163 (23%)
Query: 36 ANPSNATRGQQSRL--SVMDTFFISHGSPTLSI--DESLPARGFLQAWQAKV-FSQRPNS 90
ANP NA G +L V+ + + TL + D SL ++ W K+ F PN
Sbjct: 200 ANPRNAAAGTLRQLDSKVVSERRLQFFAYTLHLPEDNSLSSQWESLEWLKKMGFLVNPNC 259
Query: 91 ILVIS--------AHWDT---DFPSVN---VVQRND-TIHDFYGF----PKQMYDLKYPA 131
L + HW+T D P + VV+ ND + + GF P+ LKYPA
Sbjct: 260 QLCPTLEAVADYFRHWETARHDLPYMTDGVVVKINDYALQNQLGFTQKFPRWAIALKYPA 319
Query: 132 PGAPELAKRVKDLLKASGIKHVNEDRKRGLDHGAWVPLMLMYP 174
AP RVK+++ VN R GA P+ +M P
Sbjct: 320 EEAP---TRVKEII-------VNVGRT-----GAVTPMAIMEP 347
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.322 0.136 0.418
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 95,959,144
Number of Sequences: 539616
Number of extensions: 4013096
Number of successful extensions: 9076
Number of sequences better than 100.0: 8
Number of HSP's better than 100.0 without gapping: 4
Number of HSP's successfully gapped in prelim test: 4
Number of HSP's that attempted gapping in prelim test: 9066
Number of HSP's gapped (non-prelim): 9
length of query: 250
length of database: 191,569,459
effective HSP length: 115
effective length of query: 135
effective length of database: 129,513,619
effective search space: 17484338565
effective search space used: 17484338565
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 60 (27.7 bits)