BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>020992
MQMDDSRIAMKEKLEFVGIIKEALKIPIKNPKFIFTILLTSFPLFCSLCLHETMFQRALI
ESLRSQADDPPPHEGFVITVRSSGPSLYAFRDWIGNFISRKLLVQSFTLLAIIHLFDLLN
TIITVSTASAIYAEEKEMMTKPFIENKVALFKGPLITSIYALLLNSLALVGLFTLSINIY
VMWSAFTFFTLTYMLLFVALFTKYIEWSAVWNTGIVISILEQNKHGDVALGVSAYISRGS
RKRGFLIMLVFFAWSFGLRLSSLCVGWQKGNVVIEVILAQACLDCLGSVMKWVAFMIYFY
DCKKRFIEKKFDGSAEGKA

High Scoring Gene Products

Symbol, full name Information P value
AT1G23850 protein from Arabidopsis thaliana 7.4e-22
AT1G23840 protein from Arabidopsis thaliana 6.2e-21
AT1G23830 protein from Arabidopsis thaliana 3.1e-17
AT2G18680 protein from Arabidopsis thaliana 5.9e-11
AT2G18690 protein from Arabidopsis thaliana 2.0e-09

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  020992
        (319 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

TAIR|locus:2199832 - symbol:AT1G23850 "AT1G23850" species...   194  7.4e-22   2
TAIR|locus:2034913 - symbol:AT1G23840 species:3702 "Arabi...   202  6.2e-21   2
TAIR|locus:2034908 - symbol:AT1G23830 "AT1G23830" species...   174  3.1e-17   2
TAIR|locus:2054005 - symbol:AT2G18680 "AT2G18680" species...   171  5.9e-11   1
TAIR|locus:2054016 - symbol:AT2G18690 "AT2G18690" species...   160  2.0e-09   1


>TAIR|locus:2199832 [details] [associations]
            symbol:AT1G23850 "AT1G23850" species:3702 "Arabidopsis
            thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0005575
            "cellular_component" evidence=ND] [GO:0008150 "biological_process"
            evidence=ND] EMBL:CP002684 EMBL:AC002423 EMBL:AK175203
            EMBL:AK175422 EMBL:AK176914 IPI:IPI00544060 RefSeq:NP_564207.1
            UniGene:At.41517 PRIDE:Q9LRA9 EnsemblPlants:AT1G23850.1
            GeneID:838996 KEGG:ath:AT1G23850 TAIR:At1g23850 InParanoid:Q9LRA9
            OMA:CTESSNG PhylomeDB:Q9LRA9 ProtClustDB:CLSN2687931
            Genevestigator:Q9LRA9 Uniprot:Q9LRA9
        Length = 354

 Score = 194 (73.4 bits), Expect = 7.4e-22, Sum P(2) = 7.4e-22
 Identities = 42/123 (34%), Positives = 72/123 (58%)

Query:   198 VALFTKYIEWSAVWNTGIVISILEQNKHGDV-----ALGVSAYISRGSRKRGFLIMLVFF 252
             +A+F+K   WSA WN G+V+S+LE+ ++G       AL +S+   +G  KRG  +MLVF 
Sbjct:   224 LAMFSK---WSAGWNMGLVVSVLEEEENGQSIYGTDALTLSSNYGKGHEKRGLQVMLVFL 280

Query:   253 AWSFGLRLSSLCVGWQK---GNVVIEVILAQACLDCLGSVMKWVAFMIYFYDCKKRFIEK 309
              ++  +R+   C    +   GN V+        + C+G+++KWVA ++++ DC+   +EK
Sbjct:   281 VFAIAMRMPCFCFKCTESSNGNRVLYTSFYVGLI-CVGNMIKWVACVVFYEDCRTSVLEK 339

Query:   310 KFD 312
             K D
Sbjct:   340 KGD 342

 Score = 112 (44.5 bits), Expect = 7.4e-22, Sum P(2) = 7.4e-22
 Identities = 45/159 (28%), Positives = 72/159 (45%)

Query:    10 MKEKLEFVGIIKEALKIPIKNPKFIFTILLTSFPLFCSLCLHETMFQRALIESLRSQADD 69
             ++E L F+ I+K A K+   N   +  + L S PLFC L   E   Q  +  SL SQ   
Sbjct:     3 IEEDLGFINILKRATKLLCGNINLVLFLFLCSLPLFCFLIFFELSLQTTV--SLASQ--- 57

Query:    70 PPPHEGFVITVRSSGPSLYAFRDW--IGNFISRKLLVQSFTLLAIIH-LFDLLNTIITVS 126
                   +++   ++    Y  +D   + N I   LL+Q+F L    + L DL  T   VS
Sbjct:    58 ------YLVRQLTNWDYYYVPQDASVLENLIP--LLIQTFLLYLFPYGLIDLFTTTTIVS 109

Query:   127 TASAIYAEEKEMMT-KPFIENKVAL----FKGPLITSIY 160
              +  ++  E+E +     +   V +     +G LITS+Y
Sbjct:   110 ASWTVHTSEEEPLRFGQLVRRTVEICQNRLEGCLITSLY 148


>TAIR|locus:2034913 [details] [associations]
            symbol:AT1G23840 species:3702 "Arabidopsis thaliana"
            [GO:0005739 "mitochondrion" evidence=ISM] EMBL:CP002684
            GenomeReviews:CT485782_GR EMBL:AC005990 ProtClustDB:CLSN2687931
            EMBL:AY037188 EMBL:AY142035 IPI:IPI00548446 PIR:H86372
            RefSeq:NP_564206.1 UniGene:At.10950 STRING:Q9ZUB1
            EnsemblPlants:AT1G23840.1 GeneID:838995 KEGG:ath:AT1G23840
            TAIR:At1g23840 eggNOG:NOG273871 HOGENOM:HOG000153084
            InParanoid:Q9ZUB1 OMA:NINLALF PhylomeDB:Q9ZUB1
            Genevestigator:Q9ZUB1 Uniprot:Q9ZUB1
        Length = 338

 Score = 202 (76.2 bits), Expect = 6.2e-21, Sum P(2) = 6.2e-21
 Identities = 47/130 (36%), Positives = 72/130 (55%)

Query:   198 VALFTKYIEWSAVWNTGIVISILEQNK-----HGDVALGVSAYISRGSRKRGFLIMLVFF 252
             + L  K+ +WSA WN  +V+S+LE+ +     +G  AL +SA+  RG  KR F +MLVF 
Sbjct:   207 IVLAAKFSKWSAGWNISMVVSVLEEEEDSKGIYGSSALSLSAWYLRGQEKRDFWMMLVFL 266

Query:   253 AWSFGLRLSSL---CVGWQKGNVVIEVILAQACLDCLGSVMKWVAFMIYFYDCKKRFIEK 309
               +   R+  L   C     GN V+   L    L C+G+V+KWV+ +++++DC  R + K
Sbjct:   267 VGALVTRMPCLYYKCSESLSGNGVLYTGL-YVSLICVGNVVKWVSCVVWYHDCNTRVLRK 325

Query:   310 KFDGSAEGKA 319
             K D     KA
Sbjct:   326 KGDVEIGSKA 335

 Score = 89 (36.4 bits), Expect = 6.2e-21, Sum P(2) = 6.2e-21
 Identities = 54/216 (25%), Positives = 89/216 (41%)

Query:    11 KEKLEFVGIIKEALKIPIKNPKFIFTILLTSFPLFCSLCLHETMFQRALIESLRSQADDP 70
             +EKL  + ++K ALK+   N      + L S PLFC L   E   Q  +  SL S     
Sbjct:     6 EEKLSVIELLKRALKLLFGNINLALFLFLCSLPLFCFLIFFELSLQTTV--SLAST---- 59

Query:    71 PPHEGFVITVRSSGPSLYAFRDWIGNFISRKLLVQSFTLLAIIHLFDLLNTIITVSTASA 130
                  ++  + +S   L +  D +   I   LL   F    I+ L  L  T I  +++ A
Sbjct:    60 -----YISKLVNSEEDL-SENDLLPWLIQTTLLY--FFPYTILDL--LTTTTIVAASSIA 109

Query:   131 IYAEEKEM-----MTKPF--IENKVALFKGPLITSIYAXXXXXXXXXXXFTLSINIYVMW 183
               +EE+ +     + + F   +NKV    G LITS+Y            F+ S  IY+ +
Sbjct:   110 YTSEEEPLGLLYLVGRSFKLCQNKVG---GCLITSLYVLLLSTSVFLGLFSGS-TIYLYF 165

Query:   184 SAXXXXXXXXXXXXVALFTKYIEWSAVWNTGIVISI 219
             ++            V    +++E + V    +V+ I
Sbjct:   166 ASLTLEQQIFFNQAVVQDQRFLEQAVVLLDVVVVLI 201


>TAIR|locus:2034908 [details] [associations]
            symbol:AT1G23830 "AT1G23830" species:3702 "Arabidopsis
            thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0005739
            "mitochondrion" evidence=ISM] [GO:0008150 "biological_process"
            evidence=ND] EMBL:CP002684 GenomeReviews:CT485782_GR EMBL:AC005990
            ProtClustDB:CLSN2687931 HOGENOM:HOG000153084 EMBL:AY058100
            EMBL:AY098950 IPI:IPI00531196 PIR:G86372 RefSeq:NP_564205.1
            UniGene:At.10951 UniGene:At.22492 STRING:Q9ZUB2
            EnsemblPlants:AT1G23830.1 GeneID:838994 KEGG:ath:AT1G23830
            TAIR:At1g23830 eggNOG:NOG73609 InParanoid:Q9ZUB2 OMA:PYLSREY
            PhylomeDB:Q9ZUB2 Genevestigator:Q9ZUB2 Uniprot:Q9ZUB2
        Length = 345

 Score = 174 (66.3 bits), Expect = 3.1e-17, Sum P(2) = 3.1e-17
 Identities = 38/121 (31%), Positives = 65/121 (53%)

Query:   198 VALFTKYIEWSAVWNTGIVISILEQNK-----HGDVALGVSAYISRGSRKRGFLIMLVFF 252
             + L  KY +WS+ WN G+V+S+LE+++     +G  AL +S +  +G  KR   +ML+F 
Sbjct:   217 IVLTAKYSKWSSGWNMGLVVSVLEEDEDGQGIYGGDALSLSGWYRKGHEKRDLWLMLMFL 276

Query:   253 AWSFGLRLSSLCVGWQ-KGNVVIEVILAQACLDCLGSVMKWVAFMIYFYDCKKRFIEKKF 311
              +    R+  L       GN V+        + C+G+++KWV  +  ++DCK   + KK 
Sbjct:   277 VFGLATRMPCLYSKCSASGNGVMYTGFYVGLI-CVGNLLKWVTCLACYHDCKTMVLRKKR 335

Query:   312 D 312
             D
Sbjct:   336 D 336

 Score = 94 (38.1 bits), Expect = 3.1e-17, Sum P(2) = 3.1e-17
 Identities = 41/162 (25%), Positives = 74/162 (45%)

Query:    11 KEKLEFVGIIKEALKIPIKNPKFIFTILLTSFPLFCSLCLHETMFQRAL------IESLR 64
             +EKL  + ++K ALK+   N   +  + L S PLF  L   E   Q  +      +  L 
Sbjct:     6 EEKLSVIELLKRALKLLFGNINLLLFLCLCSLPLFFFLIFFELSLQTTVYLTSQFLWKLL 65

Query:    65 SQADDPPPHEGFVITVRSSGPSLYAFRDWIGNFISRKLLVQSFTLLAIIH-LFDLLNTII 123
                +D P ++  +I+ + +   L +   W         L+Q+F L    + + DLL T  
Sbjct:    66 ILGEDLPENDLILISEKKN--DLIS---W---------LIQTFLLYFFPYTILDLLTTTT 111

Query:   124 TVSTASAIYAEEKEMMTKPF-IENKVALFK----GPLITSIY 160
              V+ +S +Y  ++E +   + +E  + + +    G LITS+Y
Sbjct:   112 IVAASSIVYTSKEEPLGLLYLVERSIKICQNRVGGCLITSLY 153


>TAIR|locus:2054005 [details] [associations]
            symbol:AT2G18680 "AT2G18680" species:3702 "Arabidopsis
            thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0008150
            "biological_process" evidence=ND] [GO:0005783 "endoplasmic
            reticulum" evidence=IDA] [GO:0030968 "endoplasmic reticulum
            unfolded protein response" evidence=RCA] GO:GO:0005783
            EMBL:CP002685 UniGene:At.13211 UniGene:At.73918 IPI:IPI00530803
            RefSeq:NP_028489.2 EnsemblPlants:AT2G18680.1 GeneID:816383
            KEGG:ath:AT2G18680 OMA:ASALTHK ArrayExpress:F4IRF0 Uniprot:F4IRF0
        Length = 287

 Score = 171 (65.3 bits), Expect = 5.9e-11, P = 5.9e-11
 Identities = 56/210 (26%), Positives = 95/210 (45%)

Query:    97 FISRKLLVQS-FTLLAIIHLFDLLNTIITVSTASAIYAEEKEMMTKPFIENKVALFKGPL 155
             F+  +  V S +  +A+  + +LL+T++ V  ASA+  ++     K F    +  +KGPL
Sbjct:    55 FVDFRQFVNSLYIFIAVSSIINLLSTLVMVH-ASALTHKDDSFEIKDFPILTLKYWKGPL 113

Query:   156 ITSIYAXXXXXXXXXXXFTLSINIYVMWSAXXXXXXXXXXXXVALFTKYIEWSA-VWNTG 214
             +T+ Y            F +  +I V +S               +F  +  + A VWN  
Sbjct:   114 VTNFYIVLFSLGYWFLFFIVLFSI-VFFSTKLDSLAAKSRALFIVFAVFESYLAIVWNLS 172

Query:   215 IVISILEQNKHGDVALGVSAYISRGSRKRGFLIMLVFFAWSFGLRLSSLCVGWQKG-NVV 273
             +VISILE   +G  ALG +A I +G + + FL+ L F   SFGL      V W    +V 
Sbjct:   173 MVISILEDT-YGIQALGKAAKIVKGMKPKLFLLNLFFGLLSFGLVQILRLVDWSSSFSVT 231

Query:   274 IEVILAQACLDCLGSVMKWVAFMIYFYDCK 303
             +   L       +  + + V + + ++ CK
Sbjct:   232 LTTGLVLMSSVFVVRMFQLVTYTVAYFQCK 261


>TAIR|locus:2054016 [details] [associations]
            symbol:AT2G18690 "AT2G18690" species:3702 "Arabidopsis
            thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0008150
            "biological_process" evidence=ND] [GO:0009507 "chloroplast"
            evidence=ISM] [GO:0016020 "membrane" evidence=IDA] [GO:0006569
            "tryptophan catabolic process" evidence=RCA] [GO:0006612 "protein
            targeting to membrane" evidence=RCA] [GO:0009684 "indoleacetic acid
            biosynthetic process" evidence=RCA] [GO:0009863 "salicylic acid
            mediated signaling pathway" evidence=RCA] [GO:0010200 "response to
            chitin" evidence=RCA] [GO:0010363 "regulation of plant-type
            hypersensitive response" evidence=RCA] [GO:0043069 "negative
            regulation of programmed cell death" evidence=RCA] [GO:0050832
            "defense response to fungus" evidence=RCA] EMBL:CP002685
            GenomeReviews:CT485783_GR GO:GO:0016020 EMBL:AC005724
            IPI:IPI00530999 PIR:D84567 RefSeq:NP_028490.1 UniGene:At.13212
            UniGene:At.67780 STRING:Q9ZV49 PRIDE:Q9ZV49
            EnsemblPlants:AT2G18690.1 GeneID:816384 KEGG:ath:AT2G18690
            TAIR:At2g18690 HOGENOM:HOG000015111 InParanoid:Q9ZV49 OMA:DVEYMAL
            PhylomeDB:Q9ZV49 ProtClustDB:CLSN2688238 ArrayExpress:Q9ZV49
            Genevestigator:Q9ZV49 InterPro:IPR010380 Pfam:PF06161
            Uniprot:Q9ZV49
        Length = 322

 Score = 160 (61.4 bits), Expect = 2.0e-09, P = 2.0e-09
 Identities = 73/299 (24%), Positives = 132/299 (44%)

Query:    15 EFVGIIKEALKIPIKNPKFIFTILLTSFPLFCSLCLHETMFQRALIESLRSQADDPPPHE 74
             + V I+ E+ K+ +KN K +F++L+  FPL  + CL    F    +  +  +  +     
Sbjct:    12 DVVAILNESRKLFLKNKKLMFSVLV--FPLLLN-CL--VYFLNIFV--IVPEITNLILEA 64

Query:    75 GFVITVRSSGPSLYAFRDWIGNFIS-RKLLVQSFTLLAIIHLFDLLNTIITVSTASAIYA 133
               + +   + P  YA R  +  F   R+ +  S+   A+  + +L + ++ V  ASAI  
Sbjct:    65 SLLPSTDPTSPE-YAAR-LMRVFTDFRQFVGSSYIFAAVSSIINLFSVLVIVH-ASAITL 121

Query:   134 EEKEMMTKPFIENKVALFKGPLITSIYAXXXXXXXXXXXFTLSINIYVMWSAXXXXXXXX 193
             +++    K F    +  +KGPL+T  Y            F +   I +++S         
Sbjct:   122 KDENFNIKDFPVLSLKSWKGPLVTYFYIALFSLGFGFLFFIILCPI-LLFSIKSGSVENI 180

Query:   194 XXXXVA------LFTKYIEWSAV-WNTGIVISILEQNKHGDVALGVSAYISRGSRKRGFL 246
                 V       +FT    + A+ WN  +VISILE++ +G  ALG +A I +G + + FL
Sbjct:   181 GFLAVEAGVLLIIFTVSQSYFAIYWNLSMVISILEES-YGFQALGKAAKIVKGMKTKLFL 239

Query:   247 IMLVFFAWSFGLR--LSSLCVGWQKGNVVIEVILAQACLDCLGSVMKWVAFMIYFYDCK 303
             + L F   + GL   L  + +G     V +       CL     + + V + + ++ CK
Sbjct:   240 LNLFFGLLASGLAQILQLINMGRSLA-VTLTTGFVLVCLVFAVRMFQLVTYTVAYFQCK 297


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.327   0.140   0.427    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      319       296   0.00093  115 3  11 22  0.42    33
                                                     33  0.44    37


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  5
  No. of states in DFA:  619 (66 KB)
  Total size of DFA:  221 KB (2120 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  21.20u 0.08s 21.28t   Elapsed:  00:00:24
  Total cpu time:  21.21u 0.08s 21.29t   Elapsed:  00:00:25
  Start:  Fri May 10 20:01:32 2013   End:  Fri May 10 20:01:57 2013

Back to top