BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>007079
MASQLSHYPRATGHRANPPLIFTTRRTTPQQINFWSRRTGAKVGVSNSEGGGSYLDMWQK
AVDRDRKEIEFQKIAGSLAESGDVDGNEGGGGRDLTEQLEKKSEEFSKILDVSKEERDRI
QRLQVIDRAAAAIAAARAILEEKNGSVVKNGESSGTAEVSRFVKKNSESSGAAEISPFVK
NSESNGTAEVPERGALSAGIFVPRSGTPGNRTPAPGPDFWSWSPPEDDDRDMRDVRDLQM
AEKSSVYPTPVNPVVEKARSVDILPIPFESKLSEPKPDPLLPPFQSLLGVEKEEVSETNL
ETPSLEEERDLGALFSAHAAEAAHALDKVDELATRGINPDGSRWWKETGIEQRPDGVVCR
WTMTRGVSADEALEWQEKFWEAADELGHKELGSEKSGRDATGNVWREFWTESMWQNQGLV
HLEKTADKWGKNGNGDEWQEKWWEHYDASGKAEKWAHKWCSIDPNTQLDAGHAHVWHERW
GEKYDGHGGSMKYTDKWAERCEGDGWSKWGDKWDENFDPNSHGVKQGETWWAGKYGERWN
RTWGERHNGSGWVHKYGKSSSGELWDTHEQQETWYERFPHFGFYHCFDNSVQLREVRKPS
EFQEEPFEIQDKRSELQEP

High Scoring Gene Products

Symbol, full name Information P value
AT3G55760 protein from Arabidopsis thaliana 1.0e-164
AT1G42430 protein from Arabidopsis thaliana 4.8e-62
KTF1
AT5G04290
protein from Arabidopsis thaliana 0.00037

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  007079
        (619 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

TAIR|locus:2079001 - symbol:AT3G55760 species:3702 "Arabi...  1603  1.0e-164  1
TAIR|locus:2035898 - symbol:AT1G42430 "AT1G42430" species...   634  4.8e-62   1
TAIR|locus:2179979 - symbol:KTF1 "AT5G04290" species:3702...   141  0.00037   3


>TAIR|locus:2079001 [details] [associations]
            symbol:AT3G55760 species:3702 "Arabidopsis thaliana"
            [GO:0009507 "chloroplast" evidence=IDA] [GO:0009570 "chloroplast
            stroma" evidence=IDA] GO:GO:0009570 EMBL:CP002686
            GenomeReviews:BA000014_GR EMBL:BT020590 IPI:IPI00536838
            RefSeq:NP_001190098.1 RefSeq:NP_191135.1 RefSeq:NP_850708.1
            UniGene:At.1705 ProteinModelPortal:Q5EAH9 IntAct:Q5EAH9
            STRING:Q5EAH9 PaxDb:Q5EAH9 PRIDE:Q5EAH9 EnsemblPlants:AT3G55760.1
            EnsemblPlants:AT3G55760.2 EnsemblPlants:AT3G55760.3 GeneID:824742
            KEGG:ath:AT3G55760 TAIR:At3g55760 eggNOG:NOG137712
            HOGENOM:HOG000243874 InParanoid:Q5EAH9 OMA:GWVHKYG PhylomeDB:Q5EAH9
            ProtClustDB:CLSN2683991 Genevestigator:Q5EAH9 Uniprot:Q5EAH9
        Length = 578

 Score = 1603 (569.3 bits), Expect = 1.0e-164, P = 1.0e-164
 Identities = 279/452 (61%), Positives = 335/452 (74%)

Query:   153 SSGTAEVSRFVKKNSESSGAAEISPFVKNSESNGTAEVPERGAL---SAGIFVPRSGTPG 209
             ++     +R +  ++ S    E  P   N+ ++   E P+   L   S  ++VPRS T G
Sbjct:   132 AAAAISAARAILASNNSGDGKEGFPNEDNTVTSEVTETPKNAKLGMWSRTVYVPRSETSG 191

Query:   210 NRTPAPGPDFWSWSPPEXXXXXXXXXXXLQMAEKSSVYPTPVNPVVEKARSVDILPIPFE 269
               TP  GPDFWSW+PP+           LQ  EK + +PT  NPV+EK +S D L IP+E
Sbjct:   192 TETP--GPDFWSWTPPQGSEISSVD---LQAVEKPAEFPTLPNPVLEKDKSADSLSIPYE 246

Query:   270 SKLSEPKPDPLLPPFQSLLGVEKEEVSETNLETPSLEEERDLGALFSXXXXXXXXXLDKV 329
             S LS  +    +PPF+SL+ V KE  +ET   + +L  E DL  + S         LD +
Sbjct:   247 SMLSSERHSFTIPPFESLIEVRKE--AETKPSSETLSTEHDLDLISSANAEEVARVLDSL 304

Query:   330 DELATRGINPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAADELGHK 389
             DE +T G++ DG +WWK+TG+E+RPDGVVCRWTM RGV+AD  +EWQ+K+WEA+D+ G K
Sbjct:   305 DESSTHGVSEDGLKWWKQTGVEKRPDGVVCRWTMIRGVTADGVVEWQDKYWEASDDFGFK 364

Query:   390 ELGSEKSGRDATGNVWREFWTESMWQNQGLVHLEKTADKWGKNGNGDEWQEKWWEHYDAS 449
             ELGSEKSGRDATGNVWREFW ESM Q  G+VH+EKTADKWGK+G GDEWQEKWWEHYDA+
Sbjct:   365 ELGSEKSGRDATGNVWREFWRESMSQENGVVHMEKTADKWGKSGQGDEWQEKWWEHYDAT 424

Query:   450 GKAEKWAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYTDKWAERCEGDGWSKW 509
             GK+EKWAHKWCSID NT LDAGHAHVWHERWGEKYDG GGS KYTDKWAER  GDGW KW
Sbjct:   425 GKSEKWAHKWCSIDRNTPLDAGHAHVWHERWGEKYDGQGGSTKYTDKWAERWVGDGWDKW 484

Query:   510 GDKWDENFDPNSHGVKQGETWWAGKYGERWNRTWGERHNGSGWVHKYGKSSSGELWDTHE 569
             GDKWDENF+P++ GVKQGETWW GK+G+RWNR+WGE HNGSGWVHKYGKSSSGE WDTH 
Sbjct:   485 GDKWDENFNPSAQGVKQGETWWEGKHGDRWNRSWGEGHNGSGWVHKYGKSSSGEHWDTHV 544

Query:   570 QQETWYERFPHFGFYHCFDNSVQLREVRKPSE 601
              QETWYE+FPHFGF+HCFDNSVQLR V+KPS+
Sbjct:   545 PQETWYEKFPHFGFFHCFDNSVQLRAVKKPSD 576

 Score = 386 (140.9 bits), Expect = 7.5e-34, P = 7.5e-34
 Identities = 160/574 (27%), Positives = 238/574 (41%)

Query:    22 FTTRRTTPQQINFWSRRTGAKV-GVSNSEGGGSYLDMWQKAVDRDRKEIEFQKIAGSLAE 80
             FT   T+ + +     RTG ++  VSN EG  SYLDMW+ AVDR++KE  F+KIA ++  
Sbjct:    35 FTAPVTSRRSLR--GSRTGVRILRVSN-EGRESYLDMWKNAVDREKKEKAFEKIAENVVA 91

Query:    81 SXXXXXXXXXXXXXLTEQLEKKSEEFSKILDVSKEERDRIQRLQVIDXXXXXXXXXXXXL 140
                               LEKKS+EF KIL+VS EERDRIQR+QV+D            L
Sbjct:    92 VDGEKEKGG--------DLEKKSDEFQKILEVSVEERDRIQRMQVVDRAAAAISAARAIL 143

Query:   141 EEKNGSVVKNG----ESSGTAEVSRFVKKNSESSGAAEISPFVKNSESNGTAEVPERGAL 196
                N    K G    +++ T+EV+    KN++  G    + +V  SE++GT E P     
Sbjct:   144 ASNNSGDGKEGFPNEDNTVTSEVTE-TPKNAKL-GMWSRTVYVPRSETSGT-ETPGPDFW 200

Query:   197 SAGIFVPRSGTPGNRTPAPGPDFWSWSPPEXXXXXXXXXXXLQMAEKSSVYPTPVNPVVE 256
             S   + P  G+  +       +     P E            + A+  S+   P   ++ 
Sbjct:   201 S---WTPPQGSEISSVDLQAVE----KPAEFPTLPNPVLEKDKSADSLSI---PYESMLS 250

Query:   257 KARSVDILPIPFESKLSEPKPDPLLPPFQSLLGVEKEE--VSETNLET-----PSLEEER 309
               R    +P PFES L E + +    P    L  E +   +S  N E       SL+E  
Sbjct:   251 SERHSFTIP-PFES-LIEVRKEAETKPSSETLSTEHDLDLISSANAEEVARVLDSLDESS 308

Query:   310 DLGALFSXXXXXXXXXLDKVDE------LATRGINPDGSRWWKETGIEQRPD-GVVCRWT 362
               G             ++K  +         RG+  DG   W++   E   D G     +
Sbjct:   309 THGVSEDGLKWWKQTGVEKRPDGVVCRWTMIRGVTADGVVEWQDKYWEASDDFGFKELGS 368

Query:   363 MTRGVSADEALEWQEKFWEAA--DELG--HKELGSEKSGRDATGNVWREFWTESMWQNQG 418
                G  A   + W+E FW  +   E G  H E  ++K G+   G+ W+E W E  +   G
Sbjct:   369 EKSGRDATGNV-WRE-FWRESMSQENGVVHMEKTADKWGKSGQGDEWQEKWWEH-YDATG 425

Query:   419 LVHLEKTADKW---GKN-----GNGDEWQEKWWEHYDASGKAEKWAHKWCSIDPNTQLDA 470
                 EK A KW    +N     G+   W E+W E YD  G + K+  KW         D 
Sbjct:   426 --KSEKWAHKWCSIDRNTPLDAGHAHVWHERWGEKYDGQGGSTKYTDKWAERWVGDGWDK 483

Query:   471 GHAHVWHERWGEKYDGHGGSMKYTDKWAERCEGDGWSKWGDKWDENFDPNSHGVKQGETW 530
                  W ++W E ++     +K  + W E   GD W++    W E  + +    K G++ 
Sbjct:   484 -----WGDKWDENFNPSAQGVKQGETWWEGKHGDRWNR---SWGEGHNGSGWVHKYGKS- 534

Query:   531 WAGKYGERWN-----RTWGERHNGSGWVHKYGKS 559
                  GE W+      TW E+    G+ H +  S
Sbjct:   535 ---SSGEHWDTHVPQETWYEKFPHFGFFHCFDNS 565


>TAIR|locus:2035898 [details] [associations]
            symbol:AT1G42430 "AT1G42430" species:3702 "Arabidopsis
            thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0005575
            "cellular_component" evidence=ND] [GO:0008150 "biological_process"
            evidence=ND] EMBL:CP002684 IPI:IPI00525753 RefSeq:NP_174971.5
            UniGene:At.39108 PRIDE:F4I9G2 DNASU:840847
            EnsemblPlants:AT1G42430.1 GeneID:840847 KEGG:ath:AT1G42430
            OMA:ANEKDWG Uniprot:F4I9G2
        Length = 426

 Score = 634 (228.2 bits), Expect = 4.8e-62, P = 4.8e-62
 Identities = 123/280 (43%), Positives = 171/280 (61%)

Query:   326 LDKVDELATR-GINPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAAD 384
             +D ++E     G N DGS W++E+G +   +G  CRW+   G S D + EW E +WE +D
Sbjct:   134 IDLLNENVNEAGTNEDGSSWFRESGHDLGDNGYRCRWSRMGGRSHDGSSEWTETWWEKSD 193

Query:   385 ELGHKELGSEKSGRDATGNVWREFWTESMWQNQ--GLVHLEKTADKWGKNGNGDE-WQEK 441
               G+KELG EKSG+++ G+ W E W E + Q++   L  +E++A K  K+G  +  W EK
Sbjct:   194 WTGYKELGVEKSGKNSEGDSWWETWQEVLHQDEWSNLARIERSAQKQAKSGTENAGWYEK 253

Query:   442 WWEHYDASGKAEKWAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYTDKWAERC 501
             WWE YDA G  EK AHK+  ++  +         W E+WGE YDG G  +K+TDKWAE  
Sbjct:   254 WWEKYDAKGWTEKGAHKYGRLNEQS---------WWEKWGEHYDGRGSVLKWTDKWAETE 304

Query:   502 EGDGWSKWGDKWDENFDPNSHGVKQGETWWAGKYGERWNRTWGERHNGSGWVHKYGKSSS 561
              G   +KWGDKW+E F  +  G +QGETW      +RW+RTWGE H G+G VHKYGKS++
Sbjct:   305 LG---TKWGDKWEEKFF-SGIGSRQGETWHVSPNSDRWSRTWGEEHFGNGKVHKYGKSTT 360

Query:   562 GELWDTHEQQETWYERFPHFGFYHCFDNSVQLREVRKPSE 601
             GE WD    +ET+YE  PH+G+     +S QL  ++ P E
Sbjct:   361 GESWDIVVDEETYYEAEPHYGWADVVGDSTQLLSIQ-PRE 399


>TAIR|locus:2179979 [details] [associations]
            symbol:KTF1 "AT5G04290" species:3702 "Arabidopsis
            thaliana" [GO:0000166 "nucleotide binding" evidence=ISS]
            [GO:0005634 "nucleus" evidence=ISM] [GO:0006306 "DNA methylation"
            evidence=IMP] [GO:0030422 "production of siRNA involved in RNA
            interference" evidence=IMP] InterPro:IPR017071 EMBL:CP002688
            GO:GO:0006357 GO:GO:0030422 GO:GO:0006306 GO:GO:0032784
            InterPro:IPR005824 SMART:SM00739 InterPro:IPR005100
            PANTHER:PTHR11125:SF7 Pfam:PF03439 IPI:IPI00544683
            RefSeq:NP_196049.1 UniGene:At.54715 ProteinModelPortal:F4JW79
            SMR:F4JW79 IntAct:F4JW79 PRIDE:F4JW79 EnsemblPlants:AT5G04290.1
            GeneID:830308 KEGG:ath:AT5G04290 OMA:SSWGKKD Uniprot:F4JW79
        Length = 1493

 Score = 141 (54.7 bits), Expect = 0.00037, Sum P(3) = 0.00037
 Identities = 64/253 (25%), Positives = 102/253 (40%)

Query:   328 KVDELATRGINPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAADELG 387
             K D  A+ G   DG  W K+     + DG         G   D    W++KF +     G
Sbjct:   947 KGDGAASWGKKDDGGSWGKKDD-GNKDDGGSSWGKKDDGQKDDGGSSWEKKF-DGGSSWG 1004

Query:   388 HKELGSEKSGR-DATGNVW-REFWTESMW--QNQGLVHLEKTAD---KWGKNGNGDE-WQ 439
              K+ G    G+ D  G++W ++    S W  ++ G     K  D    WGK  +G+  W 
Sbjct:  1005 KKDDGGSSWGKKDDGGSLWGKKDDGGSSWGKEDDGGSLWGKKDDGESSWGKKDDGESSWG 1064

Query:   440 EKWWEHYDASGKAEKWAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYTDKWAE 499
             +K  +   + GK ++  +   + D   +   G       R G    G G S   ++  A 
Sbjct:  1065 KKD-DGGSSWGKKDEGGYSEQTFDRGGR-GFGGRRGGGRRGGRDQFGRGSSFGNSEDPAP 1122

Query:   500 RCEGDGWSKWGDKWDENFDPNSHGVKQ----GETWWAGKYGERWNRTWGERHNGSGWVHK 555
               +  G S WG K D +   +S G +     G +W  GK       +WG++++GSG    
Sbjct:  1123 WSKPSGGSSWG-KQDGDGGGSSWGKENDAGGGSSW--GKQDNGVGSSWGKQNDGSGGGSS 1179

Query:   556 YGKSSS---GELW 565
             +GK +    G  W
Sbjct:  1180 WGKQNDAGGGSSW 1192

 Score = 41 (19.5 bits), Expect = 0.00037, Sum P(3) = 0.00037
 Identities = 11/43 (25%), Positives = 24/43 (55%)

Query:   145 GSVVKNGESSGT-AEVSRFVKKNSESSGAAEISPFVKNSESNG 186
             G+V   G++S +  E S + K+ + +S  A++  +  +  S+G
Sbjct:   784 GTVSGWGDTSASNVEASSWEKQGASTSNVADLGSWGTHGGSSG 826

 Score = 40 (19.1 bits), Expect = 0.00047, Sum P(3) = 0.00047
 Identities = 10/46 (21%), Positives = 19/46 (41%)

Query:   169 SSGAAEISPFVKNSESNGTAEVPERGALSAGIFVPRSGTPGNRTPA 214
             S   +++SP V +  ++  A        ++    P    P  +TPA
Sbjct:   735 SKPTSDVSPTVADDNTSAWANAAAENKPASASDQPGGWNPWGKTPA 780

 Score = 39 (18.8 bits), Expect = 0.00037, Sum P(3) = 0.00037
 Identities = 10/20 (50%), Positives = 13/20 (65%)

Query:   101 KKSEEFSKILDVSKEERDRI 120
             K S  F K  D+++EE DRI
Sbjct:    92 KSSFVFPKEEDLNEEEFDRI 111


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.311   0.130   0.418    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      619       574   0.00080  120 3  11 23  0.50    34
                                                     36  0.45    37


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  3
  No. of states in DFA:  629 (67 KB)
  Total size of DFA:  431 KB (2199 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  53.80u 0.10s 53.90t   Elapsed:  00:00:03
  Total cpu time:  53.80u 0.10s 53.90t   Elapsed:  00:00:03
  Start:  Mon May 20 22:14:47 2013   End:  Mon May 20 22:14:50 2013

Back to top