BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>020012
MMYSLRRVMAQIGRSQTTVKRIILYSPAHFTRPFSSRAHQFTKATTKTTTSKDVVWPKPS
EIPFQVKVANSVNLIGHVDAPVQFQTSSDGKHWAGTVIVQHAASHSLWIPILFEGDLAHI
ASSHLKKDDHVHIAGQLTADPPAIEGQANVQVMVHSLNLIEPTSQKRMFFVSKKQEAATV
DHSVKISSSKKDGDSALSSWRDLLDNPEQWRDYRSDKLKGLVKPRYPDFKRKDGTLPLWL
NSAPDWVLSELEGVVFDKSKPVLDDQTRKSNYVKKSKGVVFDKSKPVLDDQTQKSNYVKK
SKVDDLWKDLVENPDKWWDNRLDKVILYNLCF

High Scoring Gene Products

Symbol, full name Information P value
OSB3
AT5G44785
protein from Arabidopsis thaliana 1.1e-56
PTAC9
AT4G20010
protein from Arabidopsis thaliana 1.9e-51
OSB4
AT1G31010
protein from Arabidopsis thaliana 5.1e-37
OSB1
Organellar Single-stranded
protein from Arabidopsis thaliana 4.1e-16

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  020012
        (332 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

TAIR|locus:505006672 - symbol:OSB3 "organellar single-str...   476  1.1e-56   2
TAIR|locus:2119767 - symbol:PTAC9 "plastid transcriptiona...   534  1.9e-51   1
TAIR|locus:2015721 - symbol:OSB4 "organellar single-stran...   326  5.1e-37   2
TAIR|locus:2015353 - symbol:OSB1 "Organellar Single-stran...   202  4.1e-16   1


>TAIR|locus:505006672 [details] [associations]
            symbol:OSB3 "organellar single-stranded DNA binding
            protein 3" species:3702 "Arabidopsis thaliana" [GO:0003697
            "single-stranded DNA binding" evidence=IEA] [GO:0008150
            "biological_process" evidence=ND] [GO:0009507 "chloroplast"
            evidence=ISM;IDA] [GO:0005739 "mitochondrion" evidence=IDA]
            InterPro:IPR000424 PROSITE:PS50935 GO:GO:0005739 EMBL:CP002688
            GO:GO:0009507 GO:GO:0003697 Gene3D:2.40.50.140 InterPro:IPR012340
            SUPFAM:SSF50249 IPI:IPI00535730 RefSeq:NP_974885.1 UniGene:At.19937
            ProteinModelPortal:F4KBP3 PRIDE:F4KBP3 EnsemblPlants:AT5G44785.2
            GeneID:834508 KEGG:ath:AT5G44785 OMA:EISNWIN Uniprot:F4KBP3
        Length = 442

 Score = 476 (172.6 bits), Expect = 1.1e-56, Sum P(2) = 1.1e-56
 Identities = 102/246 (41%), Positives = 151/246 (61%)

Query:    59 PSEIPFQVKVANSVNLIGHVDAPVQFQTSSDGKHWAGTVIVQHAASHS--LWIPILFEGD 116
             P +I ++ +++N +NLIG V+ PVQF   SDGK WAGTVI Q + S S   WIPI+FEGD
Sbjct:    71 PKKIEYKPEISNWINLIGFVEQPVQFGPCSDGKFWAGTVISQRSGSKSSNFWIPIIFEGD 130

Query:   117 LAHIASSHLKKDDHVHIAGQLTAD--PPAIE-GQANVQVMVHSLNLIEPTSQKRMFFVSK 173
             LA IA  H+KK+D +H++G+L  D  PP +   Q+NVQVMV +LN ++  +         
Sbjct:   131 LAKIAVQHVKKEDRIHVSGKLFIDSPPPNVTYSQSNVQVMVQNLNFVQAATSTTKTISPP 190

Query:   174 KQEAATVDHSVKISSSKKDGDSALS-SWRDLLDNPEQWRDYRSDKLKGLVKPRYPDFKRK 232
             ++E  ++      S   K  D   S SW+ L++NP++W D+R +K  GLVKP +PDFK K
Sbjct:   191 EKEVTSIKKKPARSKKVKVIDEETSNSWKHLIENPKEWLDHRGNKANGLVKPGHPDFKMK 250

Query:   233 DGTLPLWLNSAPDWVLSELEGVVFDKSKPVLDDQTRKSNYVKKS--KGVVFDKSKPVLDD 290
              G L LWL++APDW L +LE + FD   P  + +  +   + +   K +V +  K  LD+
Sbjct:   251 VGGLSLWLSTAPDWALLKLEELKFDVLVPKGNIKLNQLKDIGEESWKDLVQNPDK-WLDN 309

Query:   291 QTQKSN 296
             ++ K+N
Sbjct:   310 RSDKTN 315

 Score = 158 (60.7 bits), Expect = 1.1e-17, Sum P(2) = 1.1e-17
 Identities = 35/97 (36%), Positives = 58/97 (59%)

Query:   183 SVKISSSKKDGDSALSSWRDLLDNPEQWRDYRSDKLKGLVKPRYPDFKRKDGTLPLWLNS 242
             ++K++  K  G+    SW+DL+ NP++W D RSDK    VK  YPDFK K+    LW+ +
Sbjct:   282 NIKLNQLKDIGEE---SWKDLVQNPDKWLDNRSDKTN--VK--YPDFKHKETGEALWMTN 334

Query:   243 APDWVLSELEGVVFDKSKPVLDDQTRKSNY-VKKSKG 278
             +P WVLS+L  +  ++ +P + ++  +    V+  KG
Sbjct:   335 SPIWVLSKLPPLKKNQERPFMSNKVSQLELDVEVPKG 371

 Score = 134 (52.2 bits), Expect = 5.0e-06, P = 5.0e-06
 Identities = 32/100 (32%), Positives = 54/100 (54%)

Query:   164 SQKRMFFVSKKQEAATVDHSVKISSSKKDGDSALSSWRDLLDNPEQWRDYRSDKLKGLVK 223
             +Q+R F +S K     +D  V   + K+     +  W++L++NP +W D R DK      
Sbjct:   349 NQERPF-MSNKVSQLELDVEVPKGNLKQLKREEI--WKNLVENPSKWWDNRLDKRN---- 401

Query:   224 PRYPDFKRKDGTLPLWLNSAPDWVLSELEGVVFDKSKPVL 263
             P+ PDFK K+    LW+  +P W LS+L  +  ++ +PV+
Sbjct:   402 PKGPDFKHKETGEALWIGDSPTWALSKLPPLKKNQERPVM 441

 Score = 125 (49.1 bits), Expect = 1.1e-56, Sum P(2) = 1.1e-56
 Identities = 22/37 (59%), Positives = 29/37 (78%)

Query:   288 LDDQTQKSNYVKKSKVDDLWKDLVENPDKWWDNRLDK 324
             LD +  K N +K+ K +++WK+LVENP KWWDNRLDK
Sbjct:   364 LDVEVPKGN-LKQLKREEIWKNLVENPSKWWDNRLDK 399

 Score = 39 (18.8 bits), Expect = 6.2e-05, Sum P(2) = 6.2e-05
 Identities = 14/59 (23%), Positives = 23/59 (38%)

Query:   151 QVMVHSLNLIEPTSQKRMFFVSKKQEAATVDHSVKISSSKKDGDSALSSWRDLLDNPEQ 209
             Q+ V S  +I    +K +  VS K               K +    +S+W +L+   EQ
Sbjct:    34 QIRVFSATVISGGGKKPLAKVSVKPPLNVATEKESTPPKKIEYKPEISNWINLIGFVEQ 92


>TAIR|locus:2119767 [details] [associations]
            symbol:PTAC9 "plastid transcriptionally active 9"
            species:3702 "Arabidopsis thaliana" [GO:0003697 "single-stranded
            DNA binding" evidence=IEA] [GO:0008150 "biological_process"
            evidence=ND] [GO:0009507 "chloroplast" evidence=ISM] [GO:0009295
            "nucleoid" evidence=IDA] [GO:0009508 "plastid chromosome"
            evidence=IDA] InterPro:IPR000424 PROSITE:PS50935 GO:GO:0009507
            EMBL:CP002687 GenomeReviews:CT486007_GR EMBL:AL021637 EMBL:AL161552
            GO:GO:0003697 Gene3D:2.40.50.140 InterPro:IPR012340 SUPFAM:SSF50249
            EMBL:AK118242 EMBL:BT026461 IPI:IPI00529933 IPI:IPI00656999
            PIR:T04885 RefSeq:NP_001031674.1 RefSeq:NP_567593.1 UniGene:At.1899
            ProteinModelPortal:Q8GXH3 SMR:Q8GXH3 STRING:Q8GXH3 PaxDb:Q8GXH3
            PRIDE:Q8GXH3 EnsemblPlants:AT4G20010.1 GeneID:827746
            KEGG:ath:AT4G20010 TAIR:At4g20010 eggNOG:NOG326142
            HOGENOM:HOG000114724 InParanoid:O49428 OMA:PNEIAYE PhylomeDB:Q8GXH3
            ProtClustDB:CLSN2688130 Genevestigator:Q8GXH3 GO:GO:0009508
            Uniprot:Q8GXH3
        Length = 371

 Score = 534 (193.0 bits), Expect = 1.9e-51, P = 1.9e-51
 Identities = 113/230 (49%), Positives = 150/230 (65%)

Query:    57 PKPSEIPFQVKVANSVNLIGHVDAPVQFQTSSDGKHWAGTVIVQHAASHS--LWIPILFE 114
             P+P+EI ++ +VAN VNLIG VD PVQF+ SSDGK WAGTVI Q +AS S   WIPI+FE
Sbjct:    86 PRPNEIAYESEVANWVNLIGFVDQPVQFEASSDGKFWAGTVISQRSASDSSGFWIPIIFE 145

Query:   115 GDLAHIASSHLKKDDHVHIAGQLTAD--PPAIE-GQANVQVMVHSLNLIEPTSQKRM-FF 170
             GDLA  A+ ++ KDD +H++G+L  D  PP +   QANVQV+V +LN I+P S     F 
Sbjct:   146 GDLAKTAARYVSKDDQIHVSGKLFIDSPPPNMTYAQANVQVLVQNLNFIQPMSPSPSPFM 205

Query:   171 VSKKQEAATVDHSVKISSSKKDG--DSALSSWRDLLDNPEQWRDYRSDKLKGLVKPRYPD 228
             V    E        + + +K+D   D A  SW  L++NP++W D+R +K+ GLVKPR+PD
Sbjct:   206 VMSSSEKEESGIKKQPARAKQDIVIDEASDSWNHLIENPKEWWDHRENKVNGLVKPRHPD 265

Query:   229 FKRKDGTLPLWLNSAPDWVLSELEGVVFDKSKPVLDDQTRKSNYVKKSKG 278
             FK KD +  LWLN AP+WVL +LEG+ FD   P       K+  VK+ KG
Sbjct:   266 FKSKDSSFSLWLNKAPNWVLPKLEGLEFDVLVP-------KARVVKQLKG 308

 Score = 132 (51.5 bits), Expect = 6.2e-06, P = 6.2e-06
 Identities = 26/56 (46%), Positives = 34/56 (60%)

Query:   199 SWRDLLDNPEQWRDYRSDKLKGLVKPRYPDFKRKDGTLPLWLNSAPDWVLSELEGV 254
             SW+DL+ NP++W D R DK       + PDFK K+    LWLN +P WVL +L  V
Sbjct:   311 SWKDLVQNPDKWWDNRIDKRNA----KAPDFKHKETGEALWLNESPTWVLPKLPPV 362

 Score = 130 (50.8 bits), Expect = 1.0e-05, P = 1.0e-05
 Identities = 27/77 (35%), Positives = 44/77 (57%)

Query:   249 SELEGVVFDKSKPVLDDQTRKSNYVKKSKGVVFDKSKPV-LDDQTQKSNYVKKSKVDDLW 307
             +++ G+V  +        +  S ++ K+   V  K + +  D    K+  VK+ K ++ W
Sbjct:   253 NKVNGLVKPRHPDFKSKDSSFSLWLNKAPNWVLPKLEGLEFDVLVPKARVVKQLKGEESW 312

Query:   308 KDLVENPDKWWDNRLDK 324
             KDLV+NPDKWWDNR+DK
Sbjct:   313 KDLVQNPDKWWDNRIDK 329


>TAIR|locus:2015721 [details] [associations]
            symbol:OSB4 "organellar single-stranded DNA binding
            protein 4" species:3702 "Arabidopsis thaliana" [GO:0003697
            "single-stranded DNA binding" evidence=IEA] [GO:0008150
            "biological_process" evidence=ND] [GO:0009507 "chloroplast"
            evidence=ISM] InterPro:IPR000424 PROSITE:PS50935 EMBL:CP002684
            GenomeReviews:CT485782_GR GO:GO:0009507 GO:GO:0003697
            Gene3D:2.40.50.140 InterPro:IPR012340 SUPFAM:SSF50249 EMBL:AC000107
            HOGENOM:HOG000114724 ProtClustDB:CLSN2688130 EMBL:AY086580
            IPI:IPI00528208 PIR:G86435 RefSeq:NP_564370.1 UniGene:At.40440
            ProteinModelPortal:Q9FYJ2 SMR:Q9FYJ2 PaxDb:Q9FYJ2 PRIDE:Q9FYJ2
            EnsemblPlants:AT1G31010.1 GeneID:839987 KEGG:ath:AT1G31010
            TAIR:At1g31010 eggNOG:NOG304474 InParanoid:Q9FYJ2 OMA:VISHEPS
            PhylomeDB:Q9FYJ2 Genevestigator:Q9FYJ2 Uniprot:Q9FYJ2
        Length = 360

 Score = 326 (119.8 bits), Expect = 2.1e-29, P = 2.1e-29
 Identities = 86/234 (36%), Positives = 124/234 (52%)

Query:   103 ASHSLWIPILFEGDLAHIASSHLKKDDHVHIAGQLTAD---PPAIEGQANVQ-------- 151
             +S + WIP+LFEGDLAH A+S+LKK+D VHI GQ+  D     A   QA+VQ        
Sbjct:   115 SSSNFWIPVLFEGDLAHTANSYLKKNDRVHITGQILGDVIQSGANSDQAHVQLFKSFHGS 174

Query:   152 ----VMVHSLNLIEPTSQKRMFFVSKKQEAATVDHSVKISSSKKDGDSALSSWRDLLDNP 207
                 VMV  L+ IE +        +  Q    + HS  +   ++ G +    W DL+D P
Sbjct:   175 FSHQVMVRDLHYIEGSKAMPKVLPTLDQNEGVLKHSASVQRGREFGTNL---WFDLVDKP 231

Query:   208 EQWRDYRSDKLKGLVKPRYPDFKRKDGTLPLWLNSAPDWVLSELEGVVFDKSKPVLDDQT 267
              +W DYR  K  G V P++PDFK+KDG+  LWLN+AP  +LSEL+ V FD  K     + 
Sbjct:   232 NEWCDYREMKQNGSVNPKHPDFKKKDGSQALWLNNAPTEILSELKDVKFDIPKYAKQPKA 291

Query:   268 RKSNYVKKSKGVVFDKSK---PVLDDQTQKS-NYVKKSKVDDLWKDLVENPDKW 317
              + ++    K +V + +K     +D +T KS ++  K     LW  L ++P  W
Sbjct:   292 GEESW----KDLVDNMNKWWDNRVDKRTPKSPDFKHKETGVGLW--LSDSPS-W 338

 Score = 280 (103.6 bits), Expect = 5.1e-37, Sum P(2) = 5.1e-37
 Identities = 57/109 (52%), Positives = 74/109 (67%)

Query:    56 WPKPSEIPFQVKVANSVNLIGHVDAPVQFQTSSDGKHWAGTVIVQHAASHS--------- 106
             WP+P E+P+Q K+ANS++LIG+V  PVQF ++ DGK WAGTVI    +S S         
Sbjct:    59 WPRPMEVPYQPKIANSIDLIGYVHQPVQFDSTLDGKFWAGTVISHEPSSDSKSESDSSSN 118

Query:   107 LWIPILFEGDLAHIASSHLKKDDHVHIAGQLTAD---PPAIEGQANVQV 152
              WIP+LFEGDLAH A+S+LKK+D VHI GQ+  D     A   QA+VQ+
Sbjct:   119 FWIPVLFEGDLAHTANSYLKKNDRVHITGQILGDVIQSGANSDQAHVQL 167

 Score = 134 (52.2 bits), Expect = 5.1e-37, Sum P(2) = 5.1e-37
 Identities = 28/67 (41%), Positives = 39/67 (58%)

Query:   185 KISSSKKDGDSALSSWRDLLDNPEQWRDYRSDKLKGLVKPRYPDFKRKDGTLPLWLNSAP 244
             K +   K G+    SW+DL+DN  +W D R DK      P+ PDFK K+  + LWL+ +P
Sbjct:   284 KYAKQPKAGEE---SWKDLVDNMNKWWDNRVDKRT----PKSPDFKHKETGVGLWLSDSP 336

Query:   245 DWVLSEL 251
              WVL +L
Sbjct:   337 SWVLEKL 343

 Score = 103 (41.3 bits), Expect = 9.0e-34, Sum P(2) = 9.0e-34
 Identities = 17/29 (58%), Positives = 23/29 (79%)

Query:   297 YVKKSKV-DDLWKDLVENPDKWWDNRLDK 324
             Y K+ K  ++ WKDLV+N +KWWDNR+DK
Sbjct:   285 YAKQPKAGEESWKDLVDNMNKWWDNRVDK 313

 Score = 65 (27.9 bits), Expect = 8.6e-30, Sum P(2) = 8.6e-30
 Identities = 19/59 (32%), Positives = 31/59 (52%)

Query:   268 RKSNYVKKSKGVVFDKSKPVLDDQT---QKSNYVKKSKV--DDLWKDLVENPDKWWDNR 321
             R  +Y++ SK +   K  P LD      + S  V++ +    +LW DLV+ P++W D R
Sbjct:   182 RDLHYIEGSKAM--PKVLPTLDQNEGVLKHSASVQRGREFGTNLWFDLVDKPNEWCDYR 238


>TAIR|locus:2015353 [details] [associations]
            symbol:OSB1 "Organellar Single-stranded" species:3702
            "Arabidopsis thaliana" [GO:0003697 "single-stranded DNA binding"
            evidence=IEA;IDA] [GO:0009507 "chloroplast" evidence=ISM]
            [GO:0000002 "mitochondrial genome maintenance" evidence=IMP]
            [GO:0005739 "mitochondrion" evidence=IDA] [GO:0045910 "negative
            regulation of DNA recombination" evidence=IMP] [GO:0048046
            "apoplast" evidence=IDA] [GO:0006355 "regulation of transcription,
            DNA-dependent" evidence=RCA] [GO:0043687 "post-translational
            protein modification" evidence=RCA] [GO:0045893 "positive
            regulation of transcription, DNA-dependent" evidence=RCA]
            InterPro:IPR000424 PROSITE:PS50935 EMBL:CP002684
            GenomeReviews:CT485782_GR GO:GO:0005739 GO:GO:0048046 GO:GO:0003697
            GO:GO:0000002 InterPro:IPR012340 SUPFAM:SSF50249 EMBL:AC012463
            EMBL:AC007519 GO:GO:0045910 EMBL:BT005269 EMBL:AK119114
            IPI:IPI00522831 RefSeq:NP_175203.2 UniGene:At.38566
            ProteinModelPortal:Q9SX99 STRING:Q9SX99 EnsemblPlants:AT1G47720.1
            GeneID:841183 KEGG:ath:AT1G47720 TAIR:At1g47720 eggNOG:NOG311682
            HOGENOM:HOG000092997 InParanoid:Q9SX99 OMA:SAVYHHA PhylomeDB:Q9SX99
            ProtClustDB:CLSN2918339 Genevestigator:Q9SX99 Uniprot:Q9SX99
        Length = 261

 Score = 202 (76.2 bits), Expect = 4.1e-16, P = 4.1e-16
 Identities = 68/233 (29%), Positives = 107/233 (45%)

Query:    55 VWPKPSEIPFQVKVANSVNLIGHVDAPVQFQTSSDGKHWAGTVI-VQHAAS--HSLWIPI 111
             ++ KP     +  + NSV+L+G VD  +Q   +   +    T++ V+   +   S  I +
Sbjct:    41 LFKKPLSTKLKFNLVNSVSLMGFVDRSIQVMNTGPDRFGVFTILRVKDPLNPNRSFRISL 100

Query:   112 LFEGDLAHIASSHLKKDDHVHIAGQLTA----DPPAIEG-QANVQVMVHSLNLIE-PTSQ 165
                  +A    +HLK +DH+ ++G+L +          G   + QV V  +N +  P S 
Sbjct:   101 RMWDAMARTCIAHLKLNDHILVSGRLESYSKSSSDVYSGLNLDYQVKVAEVNYVAAPPSH 160

Query:   166 KRMFFVSKKQEAATVDHSVKISSSKKDGDSALSSWRDLLDNPEQWRDYRSDKLKGLVKPR 225
                  +SK  +  T D    I  SKKD    +  W+    NP  W D R +K      P+
Sbjct:   161 VLDSQISKNPKTKTEDD---IEESKKD---EIYLWQVFFSNPYDWWDNRRNKKN----PK 210

Query:   226 YPDFKRKDGTLPLWLNS-APDWVLSELEGVVFDKSKPVLDDQ-TRK---SNYV 273
              PDFK KD    LWL S  PDW+   LE  +FD+     D++ TR+   S+Y+
Sbjct:   211 QPDFKHKDTGEALWLCSDLPDWITRRLE--LFDQKNRFYDEEKTRRDRLSDYI 261


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.317   0.132   0.404    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      332       321   0.00085  116 3  11 22  0.41    34
                                                     33  0.45    37


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  4
  No. of states in DFA:  620 (66 KB)
  Total size of DFA:  242 KB (2128 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  27.25u 0.10s 27.35t   Elapsed:  00:00:01
  Total cpu time:  27.25u 0.10s 27.35t   Elapsed:  00:00:01
  Start:  Fri May 10 20:11:42 2013   End:  Fri May 10 20:11:43 2013

Back to top