BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>035140
MNFRHTLYPSYKNNRPPTPDTMVQGLQYLKASIKAMSIKVIEIQVVSPNKDSQILSHSLC
LLRIAPRGFELV

High Scoring Gene Products

Symbol, full name Information P value
AT3G52050 protein from Arabidopsis thaliana 4.2e-23
ECH_0080
DNA polymerase I
protein from Ehrlichia chaffeensis str. Arkansas 8.1e-05
GSU_0541
DNA polymerase I
protein from Geobacter sulfurreducens PCA 0.00022

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  035140
        (72 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

TAIR|locus:2083775 - symbol:AT3G52050 species:3702 "Arabi...   211  4.2e-23   2
TIGR_CMR|ECH_0080 - symbol:ECH_0080 "DNA polymerase I" sp...   107  8.1e-05   1
TIGR_CMR|GSU_0541 - symbol:GSU_0541 "DNA polymerase I" sp...    90  0.00022   2


>TAIR|locus:2083775 [details] [associations]
            symbol:AT3G52050 species:3702 "Arabidopsis thaliana"
            [GO:0003677 "DNA binding" evidence=IEA] [GO:0003824 "catalytic
            activity" evidence=IEA;ISS] [GO:0005634 "nucleus" evidence=ISM]
            [GO:0008409 "5'-3' exonuclease activity" evidence=IEA]
            InterPro:IPR002421 InterPro:IPR008918 InterPro:IPR020045
            InterPro:IPR020046 Pfam:PF01367 Pfam:PF02739 SMART:SM00279
            SMART:SM00475 EMBL:CP002686 GO:GO:0003677 GO:GO:0008152
            EMBL:AL049711 GO:GO:0008409 eggNOG:COG0258 SUPFAM:SSF47807
            IPI:IPI00846770 RefSeq:NP_001078270.1 UniGene:At.50269
            ProteinModelPortal:A8MQG8 SMR:A8MQG8 STRING:A8MQG8 PaxDb:A8MQG8
            PRIDE:A8MQG8 EnsemblPlants:AT3G52050.3 GeneID:824368
            KEGG:ath:AT3G52050 TAIR:At3g52050 HOGENOM:HOG000015113 OMA:HDGVPYG
            PhylomeDB:A8MQG8 ProtClustDB:CLSN2680900 Genevestigator:A8MQG8
            Uniprot:A8MQG8
        Length = 448

 Score = 211 (79.3 bits), Expect = 4.2e-23, Sum P(2) = 4.2e-23
 Identities = 39/43 (90%), Positives = 43/43 (100%)

Query:     1 MNFRHTLYPSYKNNRPPTPDTMVQGLQYLKASIKAMSIKVIEI 43
             MNFRHTLYP+YK+NRPPTPDT+VQGLQYLKASIKAMSIKVIE+
Sbjct:   202 MNFRHTLYPAYKSNRPPTPDTIVQGLQYLKASIKAMSIKVIEV 244

 Score = 83 (34.3 bits), Expect = 4.2e-23, Sum P(2) = 4.2e-23
 Identities = 19/31 (61%), Positives = 25/31 (80%)

Query:    42 EIQVVSPNKDS-QILSHSLCLLRIAPRGFEL 71
             +++VVSP+KD  QILS SL LLR+ PRG E+
Sbjct:   266 KVRVVSPDKDFFQILSPSLRLLRLTPRGSEM 296


>TIGR_CMR|ECH_0080 [details] [associations]
            symbol:ECH_0080 "DNA polymerase I" species:205920
            "Ehrlichia chaffeensis str. Arkansas" [GO:0003887 "DNA-directed DNA
            polymerase activity" evidence=ISS] [GO:0006260 "DNA replication"
            evidence=ISS] InterPro:IPR001098 InterPro:IPR002298
            InterPro:IPR002421 InterPro:IPR003583 InterPro:IPR008918
            InterPro:IPR012337 InterPro:IPR018320 InterPro:IPR019760
            InterPro:IPR020045 InterPro:IPR020046 Pfam:PF00476 Pfam:PF01367
            Pfam:PF02739 PRINTS:PR00868 PROSITE:PS00447 SMART:SM00278
            SMART:SM00279 SMART:SM00475 SMART:SM00482 GO:GO:0003677
            EMBL:CP000236 GenomeReviews:CP000236_GR GO:GO:0006260 GO:GO:0006281
            SUPFAM:SSF53098 GO:GO:0003887 GO:GO:0008409 eggNOG:COG0258
            SUPFAM:SSF47807 KO:K02335 OMA:EQRRYAK TIGRFAMs:TIGR00593
            HOGENOM:HOG000020999 RefSeq:YP_506910.1 ProteinModelPortal:Q2GI23
            STRING:Q2GI23 GeneID:3927608 KEGG:ech:ECH_0080 PATRIC:20575721
            BioCyc:ECHA205920:GJNR-80-MONOMER Uniprot:Q2GI23
        Length = 944

 Score = 107 (42.7 bits), Expect = 8.1e-05, P = 8.1e-05
 Identities = 25/64 (39%), Positives = 33/64 (51%)

Query:     2 NFRHTLYPSYKNNRPPTPDTMVQGLQYLKASIKAMSIKVIEIQVVSPNKDSQI--LSHSL 59
             NFRH +YP YK NRP  PD ++     L+ ++ A +I   E  VV    D  I  LS   
Sbjct:    63 NFRHNIYPEYKGNRPKLPDDLIPQFSLLREAVNAFNIASEE--VVGYEADDVIATLSKKY 120

Query:    60 CLLR 63
             C L+
Sbjct:   121 CKLQ 124


>TIGR_CMR|GSU_0541 [details] [associations]
            symbol:GSU_0541 "DNA polymerase I" species:243231
            "Geobacter sulfurreducens PCA" [GO:0003887 "DNA-directed DNA
            polymerase activity" evidence=ISS] [GO:0006260 "DNA replication"
            evidence=ISS] [GO:0006308 "DNA catabolic process" evidence=ISS]
            [GO:0008409 "5'-3' exonuclease activity" evidence=ISS]
            InterPro:IPR001098 InterPro:IPR002298 InterPro:IPR002421
            InterPro:IPR002562 InterPro:IPR003583 InterPro:IPR008918
            InterPro:IPR012337 InterPro:IPR018320 InterPro:IPR019760
            InterPro:IPR020045 InterPro:IPR020046 Pfam:PF00476 Pfam:PF01367
            Pfam:PF01612 Pfam:PF02739 PRINTS:PR00868 PROSITE:PS00447
            SMART:SM00278 SMART:SM00279 SMART:SM00474 SMART:SM00475
            SMART:SM00482 GO:GO:0003677 GO:GO:0006260 GO:GO:0006281
            GO:GO:0005622 EMBL:AE017180 GenomeReviews:AE017180_GR
            SUPFAM:SSF53098 GO:GO:0008408 GO:GO:0003887 GO:GO:0008409
            SUPFAM:SSF47807 HOGENOM:HOG000020998 KO:K02335 OMA:EQRRYAK
            ProtClustDB:PRK05755 TIGRFAMs:TIGR00593 HSSP:P00582
            RefSeq:NP_951599.1 ProteinModelPortal:Q74FR5 SMR:Q74FR5
            GeneID:2685802 KEGG:gsu:GSU0541 PATRIC:22023837
            BioCyc:GSUL243231:GH27-525-MONOMER Uniprot:Q74FR5
        Length = 891

 Score = 90 (36.7 bits), Expect = 0.00022, Sum P(2) = 0.00022
 Identities = 17/42 (40%), Positives = 26/42 (61%)

Query:     3 FRHTLYPSYKNNRPPTPDTMVQGLQYLKASIKAMSIKVIEIQ 44
             FR  +YP YK NR   PD +V  +  +K  ++A SI V+E++
Sbjct:    66 FRTEIYPDYKANRAAMPDDLVPQIGPIKEMVRAFSIPVLELE 107

 Score = 36 (17.7 bits), Expect = 0.00022, Sum P(2) = 0.00022
 Identities = 9/20 (45%), Positives = 11/20 (55%)

Query:    51 DSQILSHSLCLLRIAPRGFE 70
             D+  L H L L R+A  G E
Sbjct:   465 DAAWLLHRLFLPRVAEAGME 484


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.321   0.135   0.390    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0       72        72   0.00091  102 3  11 22  0.46    28
                                                     29  0.43    29


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  3
  No. of states in DFA:  470 (50 KB)
  Total size of DFA:  84 KB (2065 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  9.41u 0.11s 9.52t   Elapsed:  00:00:01
  Total cpu time:  9.41u 0.11s 9.52t   Elapsed:  00:00:01
  Start:  Fri May 10 05:54:36 2013   End:  Fri May 10 05:54:37 2013

Back to top