BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>017067
MKTSSTSCSLLNSLRFSTAITVSKKSYYHYQATQLRNHNCLSPFPSFSSTFPRNYNFHGK
CLHVNPFSAFSSSSGHDSQNPPRDLAVLLEVDGVLVDAYRFGNRQAFNVAFQKLGLDCAN
WTAPIYTDLLRKSAGDEDRMLVLFFNRIGWPTSVPTNEKKAFVKNVLQEKKNALDEFLAS
KDAPLRPGVEDFVDDAYNEGIPLIVLTAYGKSGDRIARSVVEKLGSERISKIKIVGNEEV
ERSLYGQFVLGKGISSGVDEQLATEARKAVSAQKQEIAEEVASMLKLSVDIDTSSPESLD
KIVAALRAGAEYAEKPVRNCFLIAGSQSGVAGAQRIGMPCVVMRSSLTSRAEFPSANAVM
DGFGGADLTISKLRHSQW

High Scoring Gene Products

Symbol, full name Information P value
AT5G45170 protein from Arabidopsis thaliana 3.5e-107
AT3G48420 protein from Arabidopsis thaliana 3.4e-18
AT4G39970 protein from Arabidopsis thaliana 1.6e-09

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  017067
        (378 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

TAIR|locus:2153348 - symbol:AT5G45170 "AT5G45170" species...  1060  3.5e-107  1
TAIR|locus:2101165 - symbol:AT3G48420 species:3702 "Arabi...   229  3.4e-18   1
TAIR|locus:2140050 - symbol:AT4G39970 species:3702 "Arabi...   125  1.6e-09   2


>TAIR|locus:2153348 [details] [associations]
            symbol:AT5G45170 "AT5G45170" species:3702 "Arabidopsis
            thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0008150
            "biological_process" evidence=ND] [GO:0009507 "chloroplast"
            evidence=ISM;IDA] [GO:0009570 "chloroplast stroma" evidence=IDA]
            [GO:0015979 "photosynthesis" evidence=RCA] EMBL:CP002688
            GenomeReviews:BA000015_GR GO:GO:0009570 Gene3D:3.40.50.1000
            InterPro:IPR023214 SUPFAM:SSF56784 EMBL:BT011752 EMBL:AK222127
            IPI:IPI00533029 RefSeq:NP_199330.2 UniGene:At.27698
            ProteinModelPortal:Q6NMA9 STRING:Q6NMA9 PaxDb:Q6NMA9 PRIDE:Q6NMA9
            EnsemblPlants:AT5G45170.1 GeneID:834553 KEGG:ath:AT5G45170
            TAIR:At5g45170 eggNOG:NOG312108 HOGENOM:HOG000006280
            InParanoid:Q6NMA9 OMA:AFNVAFQ PhylomeDB:Q6NMA9
            ProtClustDB:CLSN2680441 Genevestigator:Q6NMA9 Uniprot:Q6NMA9
        Length = 372

 Score = 1060 (378.2 bits), Expect = 3.5e-107, P = 3.5e-107
 Identities = 200/317 (63%), Positives = 252/317 (79%)

Query:    59 GKCLHVNPXXXXXXXXGHDSQNPPRDLAVLLEVDGVLVDAYRFGNRQAFNVAFQKLGLDC 118
             GKCL +            +  NP  + AV+LEVD V++D +   NRQAFNVAFQKLGLDC
Sbjct:    53 GKCLRLQRFSSICLSASREDVNPSEEFAVILEVDRVMIDTWS-SNRQAFNVAFQKLGLDC 111

Query:   119 ANWTAPIYTDLLRKSAGDEDRMLVLFFNRIGWPTSVPTNEKKAFVKNVLQEKKNALDEFL 178
             ANW  P+Y+DLLRK A DE++ML+L+FN+IGWP+S+PT+EK +FVK+VL+EKKNA+DEFL
Sbjct:   112 ANWPEPVYSDLLRKGAADEEKMLLLYFNQIGWPSSLPTSEKASFVKSVLREKKNAMDEFL 171

Query:   179 ASKDAPLRPGVEDFVDDAYNEGIPLIVLTAYGKSGDRIARSVVEKLGSERISKIKIVGNE 238
              SK  PLR GV++F+D+AY E +P+ ++TAY KSGD++A S+VE LG ER+  +K++G+ 
Sbjct:   172 ISKSLPLRSGVQEFIDNAYAEKVPVAIVTAYCKSGDKVALSIVEMLGQERLPNVKVIGDN 231

Query:   239 EVERSLYGQFVLGKGISSGVDEQLATEARKAVSAQKQEIAEEVASMLKLSVDIDTSSPES 298
             EVE+S+YGQ VLGKG+SS ++EQL  E +KA SA+KQ IAEEVASMLKLSVDIDT+S E 
Sbjct:   232 EVEQSMYGQLVLGKGVSSSLEEQLVKEVKKAASAEKQRIAEEVASMLKLSVDIDTTSSER 291

Query:   299 LDKIVAALRAGAEYAEKPVRNCFLIAGSQSGVAGAQRIGMPCVVMRSSLTSRAEFPSANA 358
             L+KIV ALRA AE+   PV NC L+AGSQ GV+ A+ IGMPCVVMRSSLT+R EFPSA  
Sbjct:   292 LEKIVVALRAAAEHIGLPVNNCVLVAGSQPGVSAAKMIGMPCVVMRSSLTARGEFPSAKG 351

Query:   359 VMDGFGGADLTISKLRH 375
             VMDGFGGADLTI KLR+
Sbjct:   352 VMDGFGGADLTIPKLRN 368


>TAIR|locus:2101165 [details] [associations]
            symbol:AT3G48420 species:3702 "Arabidopsis thaliana"
            [GO:0003824 "catalytic activity" evidence=IEA] [GO:0008152
            "metabolic process" evidence=IEA] [GO:0009507 "chloroplast"
            evidence=ISM;IDA] [GO:0016787 "hydrolase activity"
            evidence=IEA;ISS] [GO:0009941 "chloroplast envelope" evidence=IDA]
            [GO:0009570 "chloroplast stroma" evidence=IDA] [GO:0006098
            "pentose-phosphate shunt" evidence=RCA] [GO:0010027 "thylakoid
            membrane organization" evidence=RCA] [GO:0010103 "stomatal complex
            morphogenesis" evidence=RCA] [GO:0016117 "carotenoid biosynthetic
            process" evidence=RCA] [GO:0019252 "starch biosynthetic process"
            evidence=RCA] [GO:0019684 "photosynthesis, light reaction"
            evidence=RCA] InterPro:IPR006402 GO:GO:0009570 EMBL:CP002686
            Gene3D:3.40.50.1000 InterPro:IPR023214 SUPFAM:SSF56784
            GO:GO:0009941 GO:GO:0016787 Pfam:PF13419 TIGRFAMs:TIGR01509
            ProtClustDB:PLN02779 EMBL:AF370250 EMBL:AY063066 EMBL:AK118118
            EMBL:AK175866 EMBL:AK176795 IPI:IPI00532424 RefSeq:NP_566903.1
            UniGene:At.3168 ProteinModelPortal:Q94K71 SMR:Q94K71 STRING:Q94K71
            PRIDE:Q94K71 ProMEX:Q94K71 EnsemblPlants:AT3G48420.1 GeneID:824000
            KEGG:ath:AT3G48420 TAIR:At3g48420 InParanoid:Q94K71 OMA:HREAFNE
            PhylomeDB:Q94K71 ArrayExpress:Q94K71 Genevestigator:Q94K71
            Uniprot:Q94K71
        Length = 319

 Score = 229 (85.7 bits), Expect = 3.4e-18, P = 3.4e-18
 Identities = 55/160 (34%), Positives = 86/160 (53%)

Query:    86 AVLLEVDGVLVDAYRFGNRQAFNVAFQKLGLDCANWTAPIYTDLLRKSAGDEDRMLVLFF 145
             A+L + DGVLVD  + G+R +FN  F++  L+   W   +Y +LL K  G ++RM   +F
Sbjct:    78 ALLFDCDGVLVDTEKDGHRISFNDTFKERDLN-VTWDVDLYGELL-KIGGGKERMTA-YF 134

Query:   146 NRIGWPTSVPTNE--KKAFVKNVLQEKKNALDEFLASKDAPLRPGVEDFVDDAYNEGIPL 203
             N++GWP   P +E  +K F+  + ++K       +  K  PLRPGV   VD A   G+ +
Sbjct:   135 NKVGWPEKAPKDEAERKEFIAGLHKQKTELFMVLIEKKLLPLRPGVAKLVDQALTNGVKV 194

Query:   204 IVLTAYGKSGDRIARSVVE-KLGSERISKIKIVGNEEVER 242
              V +    S ++   ++V   LG ER  KIKI   + V +
Sbjct:   195 AVCST---SNEKAVSAIVSCLLGPERAEKIKIFAGDVVPK 231


>TAIR|locus:2140050 [details] [associations]
            symbol:AT4G39970 species:3702 "Arabidopsis thaliana"
            [GO:0003824 "catalytic activity" evidence=IEA] [GO:0008152
            "metabolic process" evidence=IEA] [GO:0009507 "chloroplast"
            evidence=ISM;IDA] [GO:0016787 "hydrolase activity"
            evidence=IEA;ISS] [GO:0009941 "chloroplast envelope" evidence=IDA]
            [GO:0009570 "chloroplast stroma" evidence=IDA] [GO:0019761
            "glucosinolate biosynthetic process" evidence=RCA]
            InterPro:IPR006402 GO:GO:0009570 EMBL:CP002687
            GenomeReviews:CT486007_GR Gene3D:3.40.50.1000 InterPro:IPR023214
            SUPFAM:SSF56784 GO:GO:0009941 GO:GO:0016787 Pfam:PF13419
            TIGRFAMs:TIGR01509 eggNOG:COG0637 HOGENOM:HOG000248341
            EMBL:AK175831 EMBL:AK175865 EMBL:AK176082 IPI:IPI00527587
            RefSeq:NP_568077.1 UniGene:At.43709 UniGene:At.68472
            ProteinModelPortal:Q680K2 SMR:Q680K2 STRING:Q680K2 PaxDb:Q680K2
            PRIDE:Q680K2 EnsemblPlants:AT4G39970.1 GeneID:830158
            KEGG:ath:AT4G39970 TAIR:At4g39970 InParanoid:Q680K2 OMA:ADTESAH
            PhylomeDB:Q680K2 ProtClustDB:PLN02779 Genevestigator:Q680K2
            Uniprot:Q680K2
        Length = 316

 Score = 125 (49.1 bits), Expect = 1.6e-09, Sum P(2) = 1.6e-09
 Identities = 44/147 (29%), Positives = 68/147 (46%)

Query:    81 PPRDL-AVLLEVDGVLVDAYRFGNRQAFNVAFQKLGLDCA-------NWTAPIYTDLLRK 132
             P R L A++ + DGV++++    +RQA+N AF    + C        +W+   Y      
Sbjct:    59 PLRSLEALIFDCDGVILESENL-HRQAYNDAFSHFDVRCPPSSSESLDWSLEFYDKFQNL 117

Query:   133 SAGDEDRMLVLFFNRIGWPTSV-----PTNEK-KAFVKNVLQE-KKNALDEFLASKDAPL 185
               G + +M   +F   GWPTS      P N+  +A + + LQ+ K     E + S     
Sbjct:   118 VGGGKPKMR-WYFKENGWPTSTIFDSPPQNDDDRAKLIDTLQDWKTERYKEIIKSGSVEP 176

Query:   186 RPGVEDFVDDAYNEGIPLIVLTAYGKS 212
             RPGV   +D+A   G  L V +A  KS
Sbjct:   177 RPGVIRLMDEAKAAGKKLAVCSAATKS 203

 Score = 80 (33.2 bits), Expect = 1.6e-09, Sum P(2) = 1.6e-09
 Identities = 22/70 (31%), Positives = 33/70 (47%)

Query:   290 DIDTSSPESLDKIVAALRAGAEYAEKPVRNCFLIAGSQSGVAGAQRIGMPCVVMRSSLTS 349
             D+    P+    I AA + G       V++C ++  S  G+  A + GM CV+  +S TS
Sbjct:   229 DVKEKKPDPSIYITAAEKLGVS-----VKDCLVVEDSVIGLQAATKAGMSCVITYTSSTS 283

Query:   350 RAEFPSANAV 359
                F  A AV
Sbjct:   284 DQNFNDAIAV 293


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.317   0.133   0.383    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      378       347   0.00098  116 3  11 22  0.41    34
                                                     33  0.45    37


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  3
  No. of states in DFA:  611 (65 KB)
  Total size of DFA:  211 KB (2118 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  29.80u 0.06s 29.86t   Elapsed:  00:00:02
  Total cpu time:  29.80u 0.06s 29.86t   Elapsed:  00:00:02
  Start:  Fri May 10 10:37:52 2013   End:  Fri May 10 10:37:54 2013

Back to top