BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>018114
MASARAFTLSRITTNTNTNTGASFNPLHLPRAPFSRNHHHHHDVIRWPTRRGLFPASYSA
TVSCLSSGGGVSNDDFVSTRKSNFDRGFRVIANMLKRIEPLDNSVISKGISESAKDSMKQ
TISSMLGLLPSDQFSITVRLSKQPLHSLLVSSIITGISLMRNFDISVDGLKRLNFSVEGE
VLDKHCEESENEGGEISVEDLEISPQVLGDLSHDALNYIQKLQSDLSNVKEELNSMKHKN
MLMESDKTCRNSLLEYLRFLDPYMVKELSQPSSIEVEEIIHQLVQNILQRFFKDDASNNF
KGHSIFTNAENLEEVNNENCHSIDTSRDYLAKLLFWCMLLGHHLRGLENRLHLTCAVGLL

High Scoring Gene Products

Symbol, full name Information P value
AT5G14970 protein from Arabidopsis thaliana 2.4e-85
AT2G14910 protein from Arabidopsis thaliana 3.3e-31
AT1G63610 protein from Arabidopsis thaliana 7.3e-12

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  018114
        (360 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

TAIR|locus:2147875 - symbol:AT5G14970 "AT5G14970" species...   854  2.4e-85   1
TAIR|locus:2060495 - symbol:AT2G14910 species:3702 "Arabi...   343  3.3e-31   1
TAIR|locus:2026669 - symbol:AT1G63610 "AT1G63610" species...   152  7.3e-12   2


>TAIR|locus:2147875 [details] [associations]
            symbol:AT5G14970 "AT5G14970" species:3702 "Arabidopsis
            thaliana" [GO:0000023 "maltose metabolic process" evidence=RCA]
            [GO:0015996 "chlorophyll catabolic process" evidence=RCA]
            [GO:0019252 "starch biosynthetic process" evidence=RCA]
            EMBL:CP002688 EMBL:AL391146 InterPro:IPR008479 Pfam:PF05542
            EMBL:AY140088 EMBL:BT008744 IPI:IPI00535231 PIR:T51442
            RefSeq:NP_197001.1 UniGene:At.9888 STRING:Q9LFQ8
            EnsemblPlants:AT5G14970.1 GeneID:831349 KEGG:ath:AT5G14970
            TAIR:At5g14970 InParanoid:Q9LFQ8 OMA:LWNAEYR PhylomeDB:Q9LFQ8
            ProtClustDB:CLSN2916828 ArrayExpress:Q9LFQ8 Genevestigator:Q9LFQ8
            Uniprot:Q9LFQ8
        Length = 355

 Score = 854 (305.7 bits), Expect = 2.4e-85, P = 2.4e-85
 Identities = 204/372 (54%), Positives = 253/372 (68%)

Query:     2 ASARAF-TLSRITTNTNTNTGASFNPLHLPRAPFSRNHHHHHDVIRWPTRRGLFPASYSA 60
             ASARAF  LSR+T  +          LH P  P S + H     + +   R +   S SA
Sbjct:     4 ASARAFFMLSRVTDLSKKKL-----ILHQP--PPSSSPHR----LPYAPNRAV---SSSA 49

Query:    61 TVSCLSSGGGVSNDD-FVSTRKSNFDRGFRVIANMLKRIEPLDNSVISKGISESAKDSMK 119
              +SCLS GGGVS+DD +VSTR+S  DRGF VIAN++ RI+PLD SVISKG+S+SAKDSMK
Sbjct:    50 VISCLS-GGGVSSDDSYVSTRRSKLDRGFAVIANLVNRIQPLDTSVISKGLSDSAKDSMK 108

Query:   120 QTISSMLGLLPSDQFSITVRLSKQPLHSLLVSSIITG---------ISLMRNFDISVDGL 170
             QTISSMLGLLPSDQFS++V +S+QPL+ LL+SSIITG         +SL RNFDI +D  
Sbjct:   109 QTISSMLGLLPSDQFSVSVTISEQPLYRLLISSIITGYTLWNAEYRVSLRRNFDIPIDPR 168

Query:   171 KRLNFSVEGEVLDKHCEESENEGGEISVEDLE-ISPQVLGDLSHDALNYIQKLQSDLSNV 229
             K        + +    E+  +E     VE+ E +SPQV GDLS +AL+YIQ LQS+LS++
Sbjct:   169 KEEEDQSSKDNVRFGSEKGMSEDLGNCVEEFERLSPQVFGDLSPEALSYIQLLQSELSSM 228

Query:   230 KEELNSMKHKNMLMESDKTCRNSLLEYLRFLDPYMVKELSQPSSIEVEEIIHQLVQNILQ 289
             KEEL+S K K + +E +K  RN LL+YLR LDP MV ELSQ SS EVEEI++QLVQN+L+
Sbjct:   229 KEELDSQKKKALRIECEKGNRNDLLDYLRSLDPEMVTELSQLSSPEVEEIVNQLVQNVLE 288

Query:   290 RFFKDDASNNF-KGHSIFTXXXXXXXXXXXXCHSIDTSRDYLAKLLFWCMLLGHHLRGLE 348
             R F+D  ++NF +   I T               +DTSRDYLAKLLFWCMLLGHHLRGLE
Sbjct:   289 RLFEDQTTSNFMQNPGIRTTEGGDGTG-----RKVDTSRDYLAKLLFWCMLLGHHLRGLE 343

Query:   349 NRLHLTCAVGLL 360
             NRLHL+C VGLL
Sbjct:   344 NRLHLSCVVGLL 355


>TAIR|locus:2060495 [details] [associations]
            symbol:AT2G14910 species:3702 "Arabidopsis thaliana"
            [GO:0009507 "chloroplast" evidence=ISM] [GO:0015996 "chlorophyll
            catabolic process" evidence=RCA] EMBL:AC005396 EMBL:CP002685
            GenomeReviews:CT485783_GR InterPro:IPR008479 Pfam:PF05542
            EMBL:AY063898 EMBL:AY081268 EMBL:AY096503 IPI:IPI00531401
            PIR:H84522 RefSeq:NP_179097.1 UniGene:At.25001
            EnsemblPlants:AT2G14910.1 GeneID:815980 KEGG:ath:AT2G14910
            TAIR:At2g14910 HOGENOM:HOG000240235 InParanoid:O82329 OMA:IESLWEP
            PhylomeDB:O82329 ProtClustDB:CLSN2683471 ArrayExpress:O82329
            Genevestigator:O82329 Uniprot:O82329
        Length = 386

 Score = 343 (125.8 bits), Expect = 3.3e-31, P = 3.3e-31
 Identities = 88/236 (37%), Positives = 136/236 (57%)

Query:    70 GVSNDDFVSTRKSNFDRGFRVIANMLKRIEPLDNSVISKGISESAKDSMKQTISSMLGLL 129
             G S DDF     S   +   V++++++ IEPLD S+I K +  +  D+MK+TIS MLGLL
Sbjct:    61 GFSLDDFTLHSDSRSPKKC-VLSDLIQEIEPLDVSLIQKDVPVTTLDAMKRTISGMLGLL 119

Query:   130 PSDQFSITVRLSKQPLHSLLVSSIITGISLM---------RNFDISVDGL-----KRLNF 175
             PSD+F + +    +PL  LLVSS++TG +L          +N D+S  GL     +   +
Sbjct:   120 PSDRFQVHIESLWEPLSKLLVSSMMTGYTLRNAEYRLFLEKNLDMSGGGLDSHASENTEY 179

Query:   176 SVEGEVLDKHCEESENEGGEISVEDLEISPQVLGDLSHDALNYIQKLQSDLSNVKEELNS 235
              +EG   D+    S+ +    ++ +  I  + LG +S +A  YI +LQS LS+VK+EL  
Sbjct:   180 DMEGTFPDEDHVSSKRDSRTQNLSET-IDEEGLGRVSSEAQEYILRLQSQLSSVKKELQE 238

Query:   236 MKHKNMLMESDKTC---RNSLLEYLRFLDPYMVKELSQPSSIEVEEIIHQLVQNIL 288
             M+ KN  ++  +     +N LL+YLR L P  V ELS+P++ EV+E IH +V  +L
Sbjct:   239 MRRKNAALQMQQFVGEEKNDLLDYLRSLQPEKVAELSEPAAPEVKETIHSVVHGLL 294

 Score = 264 (98.0 bits), Expect = 7.8e-23, P = 7.8e-23
 Identities = 71/192 (36%), Positives = 102/192 (53%)

Query:   175 FSVEGEVLDKHCEESENEGGEISVEDLEISPQVLGDLSHDALNYIQKLQSDLSNVKEELN 234
             + +EG   D+    S+ +    ++ +  I  + LG +S +A  YI +LQS LS+VK+EL 
Sbjct:   179 YDMEGTFPDEDHVSSKRDSRTQNLSET-IDEEGLGRVSSEAQEYILRLQSQLSSVKKELQ 237

Query:   235 SMKHKNMLMESDKTC---RNSLLEYLRFLDPYMVKELSQPSSIEVEEIIHQLVQNIL--- 288
              M+ KN  ++  +     +N LL+YLR L P  V ELS+P++ EV+E IH +V  +L   
Sbjct:   238 EMRRKNAALQMQQFVGEEKNDLLDYLRSLQPEKVAELSEPAAPEVKETIHSVVHGLLATL 297

Query:   289 --QRFFKDDASN-----NFKGHSIFTXXXXXXXXXXXXCHSIDTSRDYLAKLLFWCMLLG 341
               +   K  AS        K  S                  I  +RDYLA+LLFWCMLLG
Sbjct:   298 SPKMHSKFPASEVPPTETVKAKSDEDCAELVENTSLQFQPLISLTRDYLARLLFWCMLLG 357

Query:   342 HHLRGLENRLHL 353
             H+LRGLE R+ L
Sbjct:   358 HYLRGLEYRMEL 369


>TAIR|locus:2026669 [details] [associations]
            symbol:AT1G63610 "AT1G63610" species:3702 "Arabidopsis
            thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0008150
            "biological_process" evidence=ND] [GO:0009507 "chloroplast"
            evidence=ISM;IDA] [GO:0009570 "chloroplast stroma" evidence=IDA]
            [GO:0010207 "photosystem II assembly" evidence=RCA] EMBL:CP002684
            GO:GO:0009570 InterPro:IPR008479 Pfam:PF05542 IPI:IPI00535010
            RefSeq:NP_974078.1 UniGene:At.36100 PRIDE:F4I3N6
            EnsemblPlants:AT1G63610.2 GeneID:842666 KEGG:ath:AT1G63610
            OMA:MFRNAQY PhylomeDB:F4I3N6 Uniprot:F4I3N6
        Length = 341

 Score = 152 (58.6 bits), Expect = 7.3e-12, Sum P(2) = 7.3e-12
 Identities = 51/201 (25%), Positives = 99/201 (49%)

Query:    90 VIANMLKRIEPLDNSVISKGISESAKDSMKQTISSMLGLLPSDQFSITVRLSKQPLHSLL 149
             ++   ++ ++P    +  K   +   ++M+QT+++M+G LP   F++TV    + L  L+
Sbjct:    89 ILLEYVQNVKPEFMEMFVKRAPKHVVEAMRQTVTNMIGTLPPQFFAVTVTSVAENLAQLM 148

Query:   150 VSSIITGISLMRNFDISVDGLKRLNFSVEGEVLD-KHCEESENEGGEISVEDLEIS-PQV 207
             +S ++TG  + RN    ++  + L      E  D K  +E    G + +V    I    V
Sbjct:   149 MSVLMTGY-MFRNAQYRLELQQSLEQVALPEPRDQKGGDEDYAPGTQKNVSGEVIRWNNV 207

Query:   208 LGDLSHDALNYIQKLQSDLSNVKEELNSMKHKNMLMESDKTCRNSLLEYLRFLDPYMVKE 267
              G    DA  YI+ L++++    EELN    +    +     +N +LEYL+ L+P  +KE
Sbjct:   208 SGPEKIDAKKYIELLEAEI----EELNRQVGRKSANQ-----QNEILEYLKSLEPQNLKE 258

Query:   268 LSQPSSIEVEEIIHQLVQNIL 288
             L+  +  +V   ++  V+ +L
Sbjct:   259 LTSTAGEDVAVAMNTFVKRLL 279

 Score = 73 (30.8 bits), Expect = 7.3e-12, Sum P(2) = 7.3e-12
 Identities = 14/35 (40%), Positives = 23/35 (65%)

Query:   324 DTSRDYLAKLLFWCMLLGHHLRGLENRLHLTCAVG 358
             +TS   LAKLL+W M++G+ +R +E R  +   +G
Sbjct:   293 ETSAADLAKLLYWLMVVGYSIRNIEVRFDMERVLG 327


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.319   0.133   0.385    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      360       348   0.00099  116 3  11 22  0.39    34
                                                     33  0.45    37


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  3
  No. of states in DFA:  597 (63 KB)
  Total size of DFA:  213 KB (2119 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  35.15u 0.12s 35.27t   Elapsed:  00:00:01
  Total cpu time:  35.16u 0.12s 35.28t   Elapsed:  00:00:01
  Start:  Fri May 10 19:08:16 2013   End:  Fri May 10 19:08:17 2013

Back to top