BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>026265
MGAEHLIINREASQAALILGLQPAALIDHVARVDWSLLDQIPGERGGSIPVAIEELEHIL
SEVKTHILDEPSPIKTIAGGSVTNTIRGLSVGFGVPCGLIGAYGDDQQGQLFVSNMQFSG
VDVSRLRMKRGPTGQCVCLVDASGNRTMRPCLSNAVKIQADELIAEDVKGSKWLVLRFGM
FNFEVIQAAIRIAKQEGLSVSMDLASFEMVRNFRTPLLQLLESGDVDLCFANEDEAAELV
R

High Scoring Gene Products

Symbol, full name Information P value
AT1G19600 protein from Arabidopsis thaliana 6.0e-87
DET_0711
carbohydrate kinase, PfkB family
protein from Dehalococcoides ethenogenes 195 3.9e-07
SPO_3582
kinase, pfkB family
protein from Ruegeria pomeroyi DSS-3 3.6e-06
gsk
Gsk
protein from Escherichia coli K-12 6.8e-05
NARA5
GENES NECESSARY FOR THE ACHIEVEMENT OF RUBISCO ACCUMULATION 5
protein from Arabidopsis thaliana 0.00039

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  026265
        (241 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

TAIR|locus:2013094 - symbol:AT1G19600 species:3702 "Arabi...   869  6.0e-87   1
TIGR_CMR|DET_0711 - symbol:DET_0711 "carbohydrate kinase,...   138  3.9e-07   1
TIGR_CMR|SPO_3582 - symbol:SPO_3582 "kinase, pfkB family"...   130  3.6e-06   1
UNIPROTKB|P0AEW6 - symbol:gsk "Gsk" species:83333 "Escher...   121  6.8e-05   1
TAIR|locus:2137782 - symbol:NARA5 "GENES NECESSARY FOR TH...   115  0.00039   1


>TAIR|locus:2013094 [details] [associations]
            symbol:AT1G19600 species:3702 "Arabidopsis thaliana"
            [GO:0005737 "cytoplasm" evidence=ISM] [GO:0016301 "kinase activity"
            evidence=ISS] Pfam:PF00294 EMBL:CP002684 GenomeReviews:CT485782_GR
            GO:GO:0016301 eggNOG:COG0524 InterPro:IPR011611 EMBL:AC024609
            EMBL:AC025808 HSSP:Q9TVW2 EMBL:BT004821 EMBL:AK227761
            IPI:IPI00525669 PIR:F86328 RefSeq:NP_173390.1 UniGene:At.26831
            ProteinModelPortal:Q9LN35 SMR:Q9LN35 PRIDE:Q9LN35
            EnsemblPlants:AT1G19600.1 GeneID:838547 KEGG:ath:AT1G19600
            TAIR:At1g19600 HOGENOM:HOG000235949 InParanoid:Q9LN35 OMA:LEECCKV
            PhylomeDB:Q9LN35 ProtClustDB:PLN02379 Genevestigator:Q9LN35
            Uniprot:Q9LN35
        Length = 355

 Score = 869 (311.0 bits), Expect = 6.0e-87, P = 6.0e-87
 Identities = 171/228 (75%), Positives = 196/228 (85%)

Query:    14 QAALILGLQPAALIDHVARVDWSLLDQIPGERGGSIPVAIEELEHILSEVKTHILDEPSP 73
             +A L+LGLQPAALID+VA VDWSLLDQIPG+RGGSI V  +ELEH+L E+  HI    +P
Sbjct:    11 EAPLVLGLQPAALIDNVAPVDWSLLDQIPGDRGGSIAVQKDELEHMLKELDAHI--SVAP 68

Query:    74 IKTIAGGSVTNTIRGLSVGFGVPCGLIGAYGDDQQGQLFVSNMQFSGVDVSRLRMKRGPT 133
             +K +AGGSVTNT+RGLSVGFGV  G+IGAYGDD+QGQLFVSNM FSGV +SRLR K+G T
Sbjct:    69 LKKMAGGSVTNTVRGLSVGFGVATGIIGAYGDDEQGQLFVSNMGFSGVSISRLRKKKGST 128

Query:   134 GQCVCLVDASGNRTMRPCLSNAVKIQADELIAEDVKGSKWLVLRFGMFNFEVIQAAIRIA 193
              QCVCLVD SGNRTMRPCLS+AVKIQADEL  ED  GSKWLVLR+ + N +VIQAAIR A
Sbjct:   129 AQCVCLVDDSGNRTMRPCLSSAVKIQADELSKEDFTGSKWLVLRYAVLNLQVIQAAIRFA 188

Query:   194 KQEGLSVSMDLASFEMVRNFRTPLLQLLESGDVDLCFANEDEAAELVR 241
             KQEGLSVS+DLASFEMVRN ++ L QLLESG++DLCFANEDEAAEL+R
Sbjct:   189 KQEGLSVSLDLASFEMVRNSKSELRQLLESGNIDLCFANEDEAAELLR 236


>TIGR_CMR|DET_0711 [details] [associations]
            symbol:DET_0711 "carbohydrate kinase, PfkB family"
            species:243164 "Dehalococcoides ethenogenes 195" [GO:0005975
            "carbohydrate metabolic process" evidence=ISS] [GO:0019200
            "carbohydrate kinase activity" evidence=ISS] Pfam:PF00294
            GO:GO:0016301 EMBL:CP000027 GenomeReviews:CP000027_GR
            eggNOG:COG0524 InterPro:IPR011611 HOGENOM:HOG000235950 KO:K00852
            RefSeq:YP_181452.1 ProteinModelPortal:Q3Z8J7 STRING:Q3Z8J7
            GeneID:3229948 KEGG:det:DET0711 PATRIC:21608485
            ProtClustDB:CLSK837320 BioCyc:DETH243164:GJNF-712-MONOMER
            Uniprot:Q3Z8J7
        Length = 328

 Score = 138 (53.6 bits), Expect = 3.9e-07, P = 3.9e-07
 Identities = 38/103 (36%), Positives = 56/103 (54%)

Query:    47 GSIPVA-IEELEHILSEVKTHILDEPSPIKTIAGGSVTNTIRGLSVGFGVPCGLIGAYGD 105
             G++ V  I  +E IL + +  + +  S      GGS  NTI GLS   G+ CG IGA G+
Sbjct:    12 GAVHVDHIYTVEQILHDGECVVTESAS----FPGGSGANTIHGLS-RLGLKCGFIGAVGN 66

Query:   106 DQQGQLFVSNMQFSGVDVSRLRMKR-GPTGQCVCLVDASGNRT 147
             D  G L +S+ + +G+D   L +K    TG  V L ++SG R+
Sbjct:    67 DPDGNLLISDFENAGIDNRYLLVKNEAQTGSVVSLSESSGRRS 109


>TIGR_CMR|SPO_3582 [details] [associations]
            symbol:SPO_3582 "kinase, pfkB family" species:246200
            "Ruegeria pomeroyi DSS-3" [GO:0008152 "metabolic process"
            evidence=ISS] [GO:0016301 "kinase activity" evidence=ISS]
            Pfam:PF00294 EMBL:CP000031 GenomeReviews:CP000031_GR GO:GO:0016301
            InterPro:IPR011611 OMA:DPFCVER HOGENOM:HOG000235949
            RefSeq:YP_168777.1 ProteinModelPortal:Q5LMI1 GeneID:3193718
            KEGG:sil:SPO3582 PATRIC:23380641 ProtClustDB:CLSK934217
            Uniprot:Q5LMI1
        Length = 328

 Score = 130 (50.8 bits), Expect = 3.6e-06, P = 3.6e-06
 Identities = 54/220 (24%), Positives = 93/220 (42%)

Query:    25 ALIDHVARVDWSLLDQIPGERGGSIPVAIEELEHILSEVKTHILDEPSPIKTIAGGSVTN 84
             A++D +++ D S LD +  E+G    +  E  E + + +K  +    +P     GGSV N
Sbjct:    12 AVVDVISQADDSFLDLMGIEKGIMQLIEQERGEVLYASMKERV---QTP-----GGSVAN 63

Query:    85 TIRGLSVGFGVPCGLIGAYGDDQQGQLFVSNMQFSGVD-VSRLRMKRGPTGQCVCLVDAS 143
             TI G     G+    IG   DD  G  +   M   G+D V+        T + +  V   
Sbjct:    64 TIAGAGA-LGLDTAFIGRVRDDALGHFYADAMNEGGIDFVNPPVAGELATSRSMIFVSPD 122

Query:   144 GNRTMRPCLSNAVKIQADELIAEDVKGSKWLVLRFGMFNFEVIQAAIRIA----KQEGLS 199
             G R+M   L  +  + + ++  E    +K + L   +F+ +  ++A R A    +  G  
Sbjct:   123 GERSMNTYLGISTTLSSADVPQEVTGNAKLMFLEGYLFDHDAGKSAFREAARATRAAGGK 182

Query:   200 VSMDLASFEMVRNFRTPLLQLLESGDVDLCFANEDEAAEL 239
               + ++    V   R   L L++  D+D    NE E   L
Sbjct:   183 AGIAISDPFCVDRHRADFLSLIQH-DLDFVIGNEAEIQSL 221


>UNIPROTKB|P0AEW6 [details] [associations]
            symbol:gsk "Gsk" species:83333 "Escherichia coli K-12"
            [GO:0032264 "IMP salvage" evidence=IEA] [GO:0016310
            "phosphorylation" evidence=IEA] [GO:0005524 "ATP binding"
            evidence=IEA] [GO:0000166 "nucleotide binding" evidence=IEA]
            [GO:0008906 "inosine kinase activity" evidence=IEA;IDA]
            InterPro:IPR002173 PROSITE:PS00583 PROSITE:PS00584
            UniPathway:UPA00591 Pfam:PF00294 GO:GO:0005524 EMBL:U00096
            EMBL:AP009048 GenomeReviews:AP009048_GR GenomeReviews:U00096_GR
            EMBL:U82664 eggNOG:COG0524 InterPro:IPR011611 EMBL:D90259
            EMBL:D00798 EMBL:L35149 GO:GO:0032264 PIR:JQ0812 RefSeq:NP_415010.1
            RefSeq:YP_488768.1 ProteinModelPortal:P0AEW6 DIP:DIP-48148N
            IntAct:P0AEW6 MINT:MINT-1264409 PRIDE:P0AEW6
            EnsemblBacteria:EBESCT00000000647 EnsemblBacteria:EBESCT00000014520
            GeneID:12930856 GeneID:946584 KEGG:ecj:Y75_p0464 KEGG:eco:b0477
            PATRIC:32116111 EchoBASE:EB1094 EcoGene:EG11102
            HOGENOM:HOG000278653 KO:K00892 OMA:IRNTNGA ProtClustDB:PRK15074
            BioCyc:EcoCyc:GSK-MONOMER BioCyc:ECOL316407:JW0466-MONOMER
            BioCyc:MetaCyc:GSK-MONOMER Genevestigator:P0AEW6 GO:GO:0008906
            Uniprot:P0AEW6
        Length = 434

 Score = 121 (47.7 bits), Expect = 6.8e-05, P = 6.8e-05
 Identities = 68/245 (27%), Positives = 107/245 (43%)

Query:    13 SQAALILGLQPAALIDHVARVDWSLLDQIPGERGGSIPVAIEELEHILSEVKTHILDEPS 72
             + AA ++G+    L+D  A+VD   +++     G S+ +  +  E +  E+K   L    
Sbjct:    31 TSAAWVVGIDQT-LVDIEAKVDDEFIERYGLSAGHSLVIEDDVAEALYQELKQKNLITHQ 89

Query:    73 PIKTIAGGSVTNTIRGLSVG-------FGVPCGLI--GAYGDDQQGQLFVSNMQFSGVDV 123
                  AGG++ NT+   SV         GV C  I  G+Y        ++ N   S  D+
Sbjct:    90 ----FAGGTIGNTMHNYSVLADDRSVLLGVMCSNIEIGSYA-----YRYLCNTS-SRTDL 139

Query:   124 SRLRMKRGPTGQCVCLVDASGNRTMRPCLSNAVKIQADELIAEDV-KGSKWLVLRFGMFN 182
             + L+   GP G+C  L+  SG RT      +  +++A E I EDV  G+  LVL   +  
Sbjct:   140 NYLQGVDGPIGRCFTLIGESGERTFAISPGHMNQLRA-ESIPEDVIAGASALVLTSYLVR 198

Query:   183 F-------EVIQAAIRIAKQEGLSVSMDLAS-FEMVRNFRTPLLQLLESGDVDLCFANED 234
                     E    AI  AK+  + V + L + F +  N +    Q L+   V +   NED
Sbjct:   199 CKPGEPMPEATMKAIEYAKKYNVPVVLTLGTKFVIAENPQW-WQQFLKD-HVSILAMNED 256

Query:   235 EAAEL 239
             EA  L
Sbjct:   257 EAEAL 261


>TAIR|locus:2137782 [details] [associations]
            symbol:NARA5 "GENES NECESSARY FOR THE  ACHIEVEMENT OF
            RUBISCO ACCUMULATION 5" species:3702 "Arabidopsis thaliana"
            [GO:0005739 "mitochondrion" evidence=ISM] [GO:0016301 "kinase
            activity" evidence=ISS] [GO:0016773 "phosphotransferase activity,
            alcohol group as acceptor" evidence=IEA] [GO:0009507 "chloroplast"
            evidence=IDA] [GO:0015979 "photosynthesis" evidence=IMP]
            [GO:0009902 "chloroplast relocation" evidence=RCA] [GO:0010027
            "thylakoid membrane organization" evidence=RCA] [GO:0019288
            "isopentenyl diphosphate biosynthetic process,
            mevalonate-independent pathway" evidence=RCA] [GO:0034660 "ncRNA
            metabolic process" evidence=RCA] [GO:0042793 "transcription from
            plastid promoter" evidence=RCA] InterPro:IPR002173 PROSITE:PS00584
            Pfam:PF00294 GO:GO:0009507 EMBL:CP002687 GO:GO:0016301
            GO:GO:0016773 InterPro:IPR011611 GO:GO:0015979 HSSP:Q9TVW2
            EMBL:AY052355 EMBL:AY059658 IPI:IPI00532244 RefSeq:NP_567780.1
            UniGene:At.2798 ProteinModelPortal:Q940Y6 SMR:Q940Y6 IntAct:Q940Y6
            STRING:Q940Y6 EnsemblPlants:AT4G27600.1 GeneID:828871
            KEGG:ath:AT4G27600 TAIR:At4g27600 InParanoid:Q940Y6 OMA:DPFCVER
            PhylomeDB:Q940Y6 ProtClustDB:PLN02813 Genevestigator:Q940Y6
            Uniprot:Q940Y6
        Length = 471

 Score = 115 (45.5 bits), Expect = 0.00039, P = 0.00039
 Identities = 54/230 (23%), Positives = 93/230 (40%)

Query:    18 ILGLQPAALIDHVARVDWSLLDQIPGERGGSIPVAIEELEHILSEVKTHILDEPSPIKTI 77
             +LGL  A ++D    VD   L ++  E+G    +  EE   +L       +D  S  K  
Sbjct:   123 VLGLGQA-MVDFSGVVDDEFLKKLGLEKGTRKLINHEERGKLLQA-----MDGCS-YKAA 175

Query:    78 AGGSVTNTIRGLS-VGFGVPC------GLIGAYGDDQQGQLFVSNMQFSGVDVSRLRMKR 130
             AGGS++NT+  L+ +G    C       + G+ G D  G  + + ++ + V+     +K 
Sbjct:   176 AGGSLSNTLVALARLGSQSICDRPLNVAMAGSIGGDPLGSFYGTKLRRANVNFLSAPIKD 235

Query:   131 GPTGQCVCLVDASGNRTMRPCLSNAVKIQADELIAEDVKGSKWLVLRFGMFNF----EVI 186
             G TG  + L      RTM      +  +  D  +A  +  +   V+   +F        I
Sbjct:   236 GTTGTVIVLTTPDAQRTMLAYQGTSSVVNYDSCLASLIAKTNVFVVEGYLFELPDTIRTI 295

Query:   187 QAAIRIAKQEGLSVSMDLASFEMVRNFRTPLLQLLESGDVDLCFANEDEA 236
               A   A + G  V++  +    +         ++     D+ FAN DEA
Sbjct:   296 TKACEEAHRNGALVAVTASDVSCIERHYDDFWDIV-GNYADIVFANSDEA 344


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.321   0.137   0.398    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      241       241   0.00094  113 3  11 22  0.41    33
                                                     32  0.44    36


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  5
  No. of states in DFA:  602 (64 KB)
  Total size of DFA:  170 KB (2100 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  19.48u 0.19s 19.67t   Elapsed:  00:00:01
  Total cpu time:  19.48u 0.19s 19.67t   Elapsed:  00:00:01
  Start:  Fri May 10 05:23:06 2013   End:  Fri May 10 05:23:07 2013

Back to top