BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>021267
MSTSLSSCYVATVLHINQFKQPNPKPWAYCFSRLGHSVSHCRVPWSCKVVSDRLPAISNA
IAKDSKSFTEDETESYDWEDQEDVEEDAGSPWEGAIIYKRNPSITHLEYCTTLERLGLGK
LSTEVSRSRASAMGLRVTKAVKDYPNGTPVQISIDVTKKKQKLRLDGIIRTVLTLGCNRC
GEPAAQSVFSDFSVLLSEQPIEEPEIIHIGMMFGEDKSKSSTGNGSEEEDDDASIDWDDR
LYFPLEEKEIDISKNIRDMVHLEITINVICDPSCKGICLKCGTNLNTSTCNCSKEEVKGK
TYGPLGNLRKQMERR

High Scoring Gene Products

Symbol, full name Information P value
AT3G19810 protein from Arabidopsis thaliana 2.9e-80
GSU1598
Uncharacterized protein
protein from Geobacter sulfurreducens PCA 2.8e-07
GSU_1598
conserved hypothetical protein
protein from Geobacter sulfurreducens PCA 2.8e-07
CHY_1454
Putative uncharacterized protein
protein from Carboxydothermus hydrogenoformans Z-2901 8.0e-05
CHY_1454
conserved hypothetical protein
protein from Carboxydothermus hydrogenoformans Z-2901 8.0e-05

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  021267
        (315 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

TAIR|locus:2092266 - symbol:AT3G19810 "AT3G19810" species...   806  2.9e-80   1
UNIPROTKB|Q74CS4 - symbol:GSU1598 "Uncharacterized protei...   108  2.8e-07   2
TIGR_CMR|GSU_1598 - symbol:GSU_1598 "conserved hypothetic...   108  2.8e-07   2
UNIPROTKB|Q3AC48 - symbol:CHY_1454 "Putative uncharacteri...   111  8.0e-05   1
TIGR_CMR|CHY_1454 - symbol:CHY_1454 "conserved hypothetic...   111  8.0e-05   1


>TAIR|locus:2092266 [details] [associations]
            symbol:AT3G19810 "AT3G19810" species:3702 "Arabidopsis
            thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0008150
            "biological_process" evidence=ND] [GO:0009507 "chloroplast"
            evidence=ISM] EMBL:CP002686 GenomeReviews:BA000014_GR EMBL:AB025631
            InterPro:IPR003772 Pfam:PF02620 UniGene:At.19008 UniGene:At.66494
            EMBL:AY088199 EMBL:BT026117 EMBL:AK226924 IPI:IPI00537927
            RefSeq:NP_566649.1 STRING:Q9LT27 PRIDE:Q9LT27
            EnsemblPlants:AT3G19810.1 GeneID:821518 KEGG:ath:AT3G19810
            TAIR:At3g19810 eggNOG:NOG285927 HOGENOM:HOG000239635
            InParanoid:Q9LT27 OMA:VHLEITI PhylomeDB:Q9LT27
            ProtClustDB:CLSN2688576 Genevestigator:Q9LT27 Uniprot:Q9LT27
        Length = 321

 Score = 806 (288.8 bits), Expect = 2.9e-80, P = 2.9e-80
 Identities = 153/264 (57%), Positives = 196/264 (74%)

Query:    53 RLPAISNA-IAKDSKSFTEDETESYDWEDQEDVEEDAGSPWEGAIIYKRNPSITHLEYCT 111
             +LP ++ + I    +SFTE  T   DWEDQE++E D GSPWEG+++Y+RN S+TH+EYCT
Sbjct:    59 KLPRLAKSRILVSQESFTETSTIDMDWEDQEEIE-DTGSPWEGSVMYRRNASVTHVEYCT 117

Query:   112 TLERLGLGKLSTEVSRSRASAMGLRVTKAVKDYPNGTPVQISIDVTKKKQKLRLDGIIRT 171
             TLERLGLG+LST+VS+ RASAMGLRVTK VKDYP+GTPVQ+S+DV +KK+KLRLDGI+RT
Sbjct:   118 TLERLGLGRLSTDVSKKRASAMGLRVTKDVKDYPDGTPVQVSVDVIRKKKKLRLDGIVRT 177

Query:   172 VLTLGCNRCGEPAAQSVFSDFSVLLSXXXXXXXXXXXXGMMFGEDKSKSSTGNGSXXXXX 231
             V+TLGCNRCGE   +S+FS+FS+LL+            G  FG DK +    + +     
Sbjct:   178 VITLGCNRCGESTGESIFSNFSLLLTEEPVEEPDVIDLGFTFGNDKEEGEDDDDNDDSWI 237

Query:   232 XXXXXXXXRLYFPLEEKEIDISKNIRDMVHLEITINVICDPSCKGICLKCGTNLNTSTCN 291
                     +L+FP E KEIDISK+IRD+VHLEITI  ICD +CKG+CLKCG NLN   C+
Sbjct:   238 DWED----KLHFPPEVKEIDISKHIRDLVHLEITITAICDSACKGMCLKCGANLNKRKCD 293

Query:   292 CSKEEVKGKTYGPLGNLRKQMERR 315
             C +EE K K YGPLGNLR+QM+++
Sbjct:   294 CGREE-KDKGYGPLGNLREQMQQK 316


>UNIPROTKB|Q74CS4 [details] [associations]
            symbol:GSU1598 "Uncharacterized protein" species:243231
            "Geobacter sulfurreducens PCA" [GO:0003674 "molecular_function"
            evidence=ND] [GO:0005575 "cellular_component" evidence=ND]
            [GO:0008150 "biological_process" evidence=ND] EMBL:AE017180
            GenomeReviews:AE017180_GR KO:K07040 InterPro:IPR003772 Pfam:PF02620
            RefSeq:NP_952649.1 GeneID:2687282 KEGG:gsu:GSU1598 PATRIC:22026041
            HOGENOM:HOG000132293 OMA:PVKPLCR ProtClustDB:CLSK828431
            BioCyc:GSUL243231:GH27-1542-MONOMER Uniprot:Q74CS4
        Length = 179

 Score = 108 (43.1 bits), Expect = 2.8e-07, Sum P(2) = 2.8e-07
 Identities = 17/45 (37%), Positives = 27/45 (60%)

Query:   249 EIDISKNIRDMVHLEITINVICDPSCKGICLKCGTNLNTSTCNCS 293
             EID +  I + V +E+ +  +C  SC+G+C  CG +LN   C C+
Sbjct:   117 EIDFAPEIAEQVIMELPLKPLCHESCRGLCPVCGVDLNEQECTCA 161

 Score = 61 (26.5 bits), Expect = 2.8e-07, Sum P(2) = 2.8e-07
 Identities = 18/62 (29%), Positives = 30/62 (48%)

Query:   134 GLRVTKAVKDYPNGTPVQISIDVTKKKQKLRLDGIIRTVLTLGCNRC-G--EPAAQSVFS 190
             GL   +   D    +PV + + V ++   +R+ G +   + L C+RC G  E    SVF+
Sbjct:    26 GLTAVQESGDCEFLSPVTVELTVAREYDHIRVKGNLSARIRLNCSRCLGDFETDLASVFT 85

Query:   191 DF 192
              F
Sbjct:    86 IF 87


>TIGR_CMR|GSU_1598 [details] [associations]
            symbol:GSU_1598 "conserved hypothetical protein"
            species:243231 "Geobacter sulfurreducens PCA" [GO:0003674
            "molecular_function" evidence=ND] [GO:0005575 "cellular_component"
            evidence=ND] [GO:0008150 "biological_process" evidence=ND]
            EMBL:AE017180 GenomeReviews:AE017180_GR KO:K07040
            InterPro:IPR003772 Pfam:PF02620 RefSeq:NP_952649.1 GeneID:2687282
            KEGG:gsu:GSU1598 PATRIC:22026041 HOGENOM:HOG000132293 OMA:PVKPLCR
            ProtClustDB:CLSK828431 BioCyc:GSUL243231:GH27-1542-MONOMER
            Uniprot:Q74CS4
        Length = 179

 Score = 108 (43.1 bits), Expect = 2.8e-07, Sum P(2) = 2.8e-07
 Identities = 17/45 (37%), Positives = 27/45 (60%)

Query:   249 EIDISKNIRDMVHLEITINVICDPSCKGICLKCGTNLNTSTCNCS 293
             EID +  I + V +E+ +  +C  SC+G+C  CG +LN   C C+
Sbjct:   117 EIDFAPEIAEQVIMELPLKPLCHESCRGLCPVCGVDLNEQECTCA 161

 Score = 61 (26.5 bits), Expect = 2.8e-07, Sum P(2) = 2.8e-07
 Identities = 18/62 (29%), Positives = 30/62 (48%)

Query:   134 GLRVTKAVKDYPNGTPVQISIDVTKKKQKLRLDGIIRTVLTLGCNRC-G--EPAAQSVFS 190
             GL   +   D    +PV + + V ++   +R+ G +   + L C+RC G  E    SVF+
Sbjct:    26 GLTAVQESGDCEFLSPVTVELTVAREYDHIRVKGNLSARIRLNCSRCLGDFETDLASVFT 85

Query:   191 DF 192
              F
Sbjct:    86 IF 87


>UNIPROTKB|Q3AC48 [details] [associations]
            symbol:CHY_1454 "Putative uncharacterized protein"
            species:246194 "Carboxydothermus hydrogenoformans Z-2901"
            [GO:0003674 "molecular_function" evidence=ND] [GO:0005575
            "cellular_component" evidence=ND] [GO:0008150 "biological_process"
            evidence=ND] EMBL:CP000141 GenomeReviews:CP000141_GR eggNOG:COG1399
            HOGENOM:HOG000246820 InterPro:IPR003772 Pfam:PF02620
            RefSeq:YP_360286.1 STRING:Q3AC48 GeneID:3728761 KEGG:chy:CHY_1454
            PATRIC:21276039 OMA:YPIKKGY BioCyc:CHYD246194:GJCN-1453-MONOMER
            Uniprot:Q3AC48
        Length = 166

 Score = 111 (44.1 bits), Expect = 8.0e-05, P = 8.0e-05
 Identities = 20/52 (38%), Positives = 30/52 (57%)

Query:   250 IDISKNIRDMVHLEITINV----ICDPSCKGICLKCGTNLNTSTCNCSKEEV 297
             +D   N+ ++V  E  +N+    +C   CKG+C  CG NLN   C+CS EE+
Sbjct:    98 VDFKINLDELVFEETVLNLPLKPVCHHDCKGLCPVCGENLNERECSCSHEEI 149


>TIGR_CMR|CHY_1454 [details] [associations]
            symbol:CHY_1454 "conserved hypothetical protein"
            species:246194 "Carboxydothermus hydrogenoformans Z-2901"
            [GO:0003674 "molecular_function" evidence=ND] [GO:0005575
            "cellular_component" evidence=ND] [GO:0008150 "biological_process"
            evidence=ND] EMBL:CP000141 GenomeReviews:CP000141_GR eggNOG:COG1399
            HOGENOM:HOG000246820 InterPro:IPR003772 Pfam:PF02620
            RefSeq:YP_360286.1 STRING:Q3AC48 GeneID:3728761 KEGG:chy:CHY_1454
            PATRIC:21276039 OMA:YPIKKGY BioCyc:CHYD246194:GJCN-1453-MONOMER
            Uniprot:Q3AC48
        Length = 166

 Score = 111 (44.1 bits), Expect = 8.0e-05, P = 8.0e-05
 Identities = 20/52 (38%), Positives = 30/52 (57%)

Query:   250 IDISKNIRDMVHLEITINV----ICDPSCKGICLKCGTNLNTSTCNCSKEEV 297
             +D   N+ ++V  E  +N+    +C   CKG+C  CG NLN   C+CS EE+
Sbjct:    98 VDFKINLDELVFEETVLNLPLKPVCHHDCKGLCPVCGENLNERECSCSHEEI 149


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.316   0.132   0.398    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      315       290   0.00089  115 3  11 22  0.41    34
                                                     33  0.43    37


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  5
  No. of states in DFA:  611 (65 KB)
  Total size of DFA:  218 KB (2120 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  23.92u 0.17s 24.09t   Elapsed:  00:00:01
  Total cpu time:  23.92u 0.17s 24.09t   Elapsed:  00:00:01
  Start:  Sat May 11 10:23:59 2013   End:  Sat May 11 10:24:00 2013

Back to top