BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>006998
MASQLSHYPRATGHRANPPLIFTTRRTTPQQINFWSRRTGAKVGVSNSEGGGSYLDMWQK
AVDRDRKEIEFQKIAGSLAESGDVDGNEGGGGRDLTEQLEKKSEEFSKILDVSKEERDRI
QRLQVIDRAAAAIAAARAILEEKNGSVVKNGESSGTAEVSRFVKKNSESSGAAEISPFVK
NSESNGTAEVPERDSSGALSAGIFVPRSGTPGNRTPAPGPDFWSWSPPEDDDRDMRDVRD
LQMAEKSSVYPTPVNPVVEKARSVDILPIPFESKLSEPKPDPLLPPFQSLLGVEKEEVSE
TNLETPSLEEERDLGALFSAHAAEAAHALDKVDELATRGINPDGSRWWKETGIEQRPDGV
VCRWTMTRGVSADEALEWQEKFWEAADELGHKELGSEKSGRDATGNVWREFWTESMWQNQ
GLVHLEKTADKWGKNGNGDEWQEKWWEHYDASGKAEKWAHKWCSIDPNTQLDAGHAHVWH
ERWGEKYDGHGGSMKYTDKWAERCEGDGWSKWGDKWDENFDPNSHGVKQGETWWAGKYGE
RWNRTWGERHNGSGWVHKYGKSSSGELWDTHEQQETWYERFPHFGFYHCFDNSVQLREVR
KPSEFQEEPFEIQDKRSELQEP

High Scoring Gene Products

Symbol, full name Information P value
AT3G55760 protein from Arabidopsis thaliana 1.1e-165
AT1G42430 protein from Arabidopsis thaliana 4.8e-62
KTF1
AT5G04290
protein from Arabidopsis thaliana 0.00030

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  006998
        (622 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

TAIR|locus:2079001 - symbol:AT3G55760 species:3702 "Arabi...  1612  1.1e-165  1
TAIR|locus:2035898 - symbol:AT1G42430 "AT1G42430" species...   634  4.8e-62   1
TAIR|locus:2179979 - symbol:KTF1 "AT5G04290" species:3702...   141  0.00030   3


>TAIR|locus:2079001 [details] [associations]
            symbol:AT3G55760 species:3702 "Arabidopsis thaliana"
            [GO:0009507 "chloroplast" evidence=IDA] [GO:0009570 "chloroplast
            stroma" evidence=IDA] GO:GO:0009570 EMBL:CP002686
            GenomeReviews:BA000014_GR EMBL:BT020590 IPI:IPI00536838
            RefSeq:NP_001190098.1 RefSeq:NP_191135.1 RefSeq:NP_850708.1
            UniGene:At.1705 ProteinModelPortal:Q5EAH9 IntAct:Q5EAH9
            STRING:Q5EAH9 PaxDb:Q5EAH9 PRIDE:Q5EAH9 EnsemblPlants:AT3G55760.1
            EnsemblPlants:AT3G55760.2 EnsemblPlants:AT3G55760.3 GeneID:824742
            KEGG:ath:AT3G55760 TAIR:At3g55760 eggNOG:NOG137712
            HOGENOM:HOG000243874 InParanoid:Q5EAH9 OMA:GWVHKYG PhylomeDB:Q5EAH9
            ProtClustDB:CLSN2683991 Genevestigator:Q5EAH9 Uniprot:Q5EAH9
        Length = 578

 Score = 1612 (572.5 bits), Expect = 1.1e-165, P = 1.1e-165
 Identities = 279/452 (61%), Positives = 335/452 (74%)

Query:   153 SSGTAEVSRFVKKNSESSGAAEISPFVKNSESNGTAEVPERDSSGALSAGIFVPRSGTPG 212
             ++     +R +  ++ S    E  P   N+ ++   E P+    G  S  ++VPRS T G
Sbjct:   132 AAAAISAARAILASNNSGDGKEGFPNEDNTVTSEVTETPKNAKLGMWSRTVYVPRSETSG 191

Query:   213 NRTPAPGPDFWSWSPPEXXXXXXXXXXXLQMAEKSSVYPTPVNPVVEKARSVDILPIPFE 272
               TP  GPDFWSW+PP+           LQ  EK + +PT  NPV+EK +S D L IP+E
Sbjct:   192 TETP--GPDFWSWTPPQGSEISSVD---LQAVEKPAEFPTLPNPVLEKDKSADSLSIPYE 246

Query:   273 SKLSEPKPDPLLPPFQSLLGVEKEEVSETNLETPSLEEERDLGALFSXXXXXXXXXLDKV 332
             S LS  +    +PPF+SL+ V KE  +ET   + +L  E DL  + S         LD +
Sbjct:   247 SMLSSERHSFTIPPFESLIEVRKE--AETKPSSETLSTEHDLDLISSANAEEVARVLDSL 304

Query:   333 DELATRGINPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAADELGHK 392
             DE +T G++ DG +WWK+TG+E+RPDGVVCRWTM RGV+AD  +EWQ+K+WEA+D+ G K
Sbjct:   305 DESSTHGVSEDGLKWWKQTGVEKRPDGVVCRWTMIRGVTADGVVEWQDKYWEASDDFGFK 364

Query:   393 ELGSEKSGRDATGNVWREFWTESMWQNQGLVHLEKTADKWGKNGNGDEWQEKWWEHYDAS 452
             ELGSEKSGRDATGNVWREFW ESM Q  G+VH+EKTADKWGK+G GDEWQEKWWEHYDA+
Sbjct:   365 ELGSEKSGRDATGNVWREFWRESMSQENGVVHMEKTADKWGKSGQGDEWQEKWWEHYDAT 424

Query:   453 GKAEKWAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYTDKWAERCEGDGWSKW 512
             GK+EKWAHKWCSID NT LDAGHAHVWHERWGEKYDG GGS KYTDKWAER  GDGW KW
Sbjct:   425 GKSEKWAHKWCSIDRNTPLDAGHAHVWHERWGEKYDGQGGSTKYTDKWAERWVGDGWDKW 484

Query:   513 GDKWDENFDPNSHGVKQGETWWAGKYGERWNRTWGERHNGSGWVHKYGKSSSGELWDTHE 572
             GDKWDENF+P++ GVKQGETWW GK+G+RWNR+WGE HNGSGWVHKYGKSSSGE WDTH 
Sbjct:   485 GDKWDENFNPSAQGVKQGETWWEGKHGDRWNRSWGEGHNGSGWVHKYGKSSSGEHWDTHV 544

Query:   573 QQETWYERFPHFGFYHCFDNSVQLREVRKPSE 604
              QETWYE+FPHFGF+HCFDNSVQLR V+KPS+
Sbjct:   545 PQETWYEKFPHFGFFHCFDNSVQLRAVKKPSD 576

 Score = 383 (139.9 bits), Expect = 1.9e-33, P = 1.9e-33
 Identities = 161/577 (27%), Positives = 237/577 (41%)

Query:    22 FTTRRTTPQQINFWSRRTGAKV-GVSNSEGGGSYLDMWQKAVDRDRKEIEFQKIAGSLAE 80
             FT   T+ + +     RTG ++  VSN EG  SYLDMW+ AVDR++KE  F+KIA ++  
Sbjct:    35 FTAPVTSRRSLR--GSRTGVRILRVSN-EGRESYLDMWKNAVDREKKEKAFEKIAENVVA 91

Query:    81 SXXXXXXXXXXXXXLTEQLEKKSEEFSKILDVSKEERDRIQRLQVIDXXXXXXXXXXXXL 140
                               LEKKS+EF KIL+VS EERDRIQR+QV+D            L
Sbjct:    92 VDGEKEKGG--------DLEKKSDEFQKILEVSVEERDRIQRMQVVDRAAAAISAARAIL 143

Query:   141 EEKNGSVVKNG----ESSGTAEVSRFVKKNSESSGAAEISPFVKNSESNGTAEVPERDSS 196
                N    K G    +++ T+EV+    KN++  G    + +V  SE++GT E P  D  
Sbjct:   144 ASNNSGDGKEGFPNEDNTVTSEVTE-TPKNAKL-GMWSRTVYVPRSETSGT-ETPGPD-- 198

Query:   197 GALSAGIFVPRSGTPGNRTPAPGPDFWSWSPPEXXXXXXXXXXXLQMAEKSSVYPTPVNP 256
                    F   S TP   +     D  +   P            L+  + +     P   
Sbjct:   199 -------FW--SWTPPQGSEISSVDLQAVEKP--AEFPTLPNPVLEKDKSADSLSIPYES 247

Query:   257 VVEKARSVDILPIPFESKLSEPKPDPLLPPFQSLLGVEKEE--VSETNLET-----PSLE 309
             ++   R    +P PFES L E + +    P    L  E +   +S  N E       SL+
Sbjct:   248 MLSSERHSFTIP-PFES-LIEVRKEAETKPSSETLSTEHDLDLISSANAEEVARVLDSLD 305

Query:   310 EERDLGALFSXXXXXXXXXLDKVDE------LATRGINPDGSRWWKETGIEQRPD-GVVC 362
             E    G             ++K  +         RG+  DG   W++   E   D G   
Sbjct:   306 ESSTHGVSEDGLKWWKQTGVEKRPDGVVCRWTMIRGVTADGVVEWQDKYWEASDDFGFKE 365

Query:   363 RWTMTRGVSADEALEWQEKFWEAA--DELG--HKELGSEKSGRDATGNVWREFWTESMWQ 418
               +   G  A   + W+E FW  +   E G  H E  ++K G+   G+ W+E W E  + 
Sbjct:   366 LGSEKSGRDATGNV-WRE-FWRESMSQENGVVHMEKTADKWGKSGQGDEWQEKWWEH-YD 422

Query:   419 NQGLVHLEKTADKW---GKN-----GNGDEWQEKWWEHYDASGKAEKWAHKWCSIDPNTQ 470
               G    EK A KW    +N     G+   W E+W E YD  G + K+  KW        
Sbjct:   423 ATG--KSEKWAHKWCSIDRNTPLDAGHAHVWHERWGEKYDGQGGSTKYTDKWAERWVGDG 480

Query:   471 LDAGHAHVWHERWGEKYDGHGGSMKYTDKWAERCEGDGWSKWGDKWDENFDPNSHGVKQG 530
              D      W ++W E ++     +K  + W E   GD W++    W E  + +    K G
Sbjct:   481 WDK-----WGDKWDENFNPSAQGVKQGETWWEGKHGDRWNR---SWGEGHNGSGWVHKYG 532

Query:   531 ETWWAGKYGERWN-----RTWGERHNGSGWVHKYGKS 562
             ++      GE W+      TW E+    G+ H +  S
Sbjct:   533 KS----SSGEHWDTHVPQETWYEKFPHFGFFHCFDNS 565


>TAIR|locus:2035898 [details] [associations]
            symbol:AT1G42430 "AT1G42430" species:3702 "Arabidopsis
            thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0005575
            "cellular_component" evidence=ND] [GO:0008150 "biological_process"
            evidence=ND] EMBL:CP002684 IPI:IPI00525753 RefSeq:NP_174971.5
            UniGene:At.39108 PRIDE:F4I9G2 DNASU:840847
            EnsemblPlants:AT1G42430.1 GeneID:840847 KEGG:ath:AT1G42430
            OMA:ANEKDWG Uniprot:F4I9G2
        Length = 426

 Score = 634 (228.2 bits), Expect = 4.8e-62, P = 4.8e-62
 Identities = 123/280 (43%), Positives = 171/280 (61%)

Query:   329 LDKVDELATR-GINPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAAD 387
             +D ++E     G N DGS W++E+G +   +G  CRW+   G S D + EW E +WE +D
Sbjct:   134 IDLLNENVNEAGTNEDGSSWFRESGHDLGDNGYRCRWSRMGGRSHDGSSEWTETWWEKSD 193

Query:   388 ELGHKELGSEKSGRDATGNVWREFWTESMWQNQ--GLVHLEKTADKWGKNGNGDE-WQEK 444
               G+KELG EKSG+++ G+ W E W E + Q++   L  +E++A K  K+G  +  W EK
Sbjct:   194 WTGYKELGVEKSGKNSEGDSWWETWQEVLHQDEWSNLARIERSAQKQAKSGTENAGWYEK 253

Query:   445 WWEHYDASGKAEKWAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYTDKWAERC 504
             WWE YDA G  EK AHK+  ++  +         W E+WGE YDG G  +K+TDKWAE  
Sbjct:   254 WWEKYDAKGWTEKGAHKYGRLNEQS---------WWEKWGEHYDGRGSVLKWTDKWAETE 304

Query:   505 EGDGWSKWGDKWDENFDPNSHGVKQGETWWAGKYGERWNRTWGERHNGSGWVHKYGKSSS 564
              G   +KWGDKW+E F  +  G +QGETW      +RW+RTWGE H G+G VHKYGKS++
Sbjct:   305 LG---TKWGDKWEEKFF-SGIGSRQGETWHVSPNSDRWSRTWGEEHFGNGKVHKYGKSTT 360

Query:   565 GELWDTHEQQETWYERFPHFGFYHCFDNSVQLREVRKPSE 604
             GE WD    +ET+YE  PH+G+     +S QL  ++ P E
Sbjct:   361 GESWDIVVDEETYYEAEPHYGWADVVGDSTQLLSIQ-PRE 399


>TAIR|locus:2179979 [details] [associations]
            symbol:KTF1 "AT5G04290" species:3702 "Arabidopsis
            thaliana" [GO:0000166 "nucleotide binding" evidence=ISS]
            [GO:0005634 "nucleus" evidence=ISM] [GO:0006306 "DNA methylation"
            evidence=IMP] [GO:0030422 "production of siRNA involved in RNA
            interference" evidence=IMP] InterPro:IPR017071 EMBL:CP002688
            GO:GO:0006357 GO:GO:0030422 GO:GO:0006306 GO:GO:0032784
            InterPro:IPR005824 SMART:SM00739 InterPro:IPR005100
            PANTHER:PTHR11125:SF7 Pfam:PF03439 IPI:IPI00544683
            RefSeq:NP_196049.1 UniGene:At.54715 ProteinModelPortal:F4JW79
            SMR:F4JW79 IntAct:F4JW79 PRIDE:F4JW79 EnsemblPlants:AT5G04290.1
            GeneID:830308 KEGG:ath:AT5G04290 OMA:SSWGKKD Uniprot:F4JW79
        Length = 1493

 Score = 141 (54.7 bits), Expect = 0.00031, Sum P(3) = 0.00030
 Identities = 64/253 (25%), Positives = 102/253 (40%)

Query:   331 KVDELATRGINPDGSRWWKETGIEQRPDGVVCRWTMTRGVSADEALEWQEKFWEAADELG 390
             K D  A+ G   DG  W K+     + DG         G   D    W++KF +     G
Sbjct:   947 KGDGAASWGKKDDGGSWGKKDD-GNKDDGGSSWGKKDDGQKDDGGSSWEKKF-DGGSSWG 1004

Query:   391 HKELGSEKSGR-DATGNVW-REFWTESMW--QNQGLVHLEKTAD---KWGKNGNGDE-WQ 442
              K+ G    G+ D  G++W ++    S W  ++ G     K  D    WGK  +G+  W 
Sbjct:  1005 KKDDGGSSWGKKDDGGSLWGKKDDGGSSWGKEDDGGSLWGKKDDGESSWGKKDDGESSWG 1064

Query:   443 EKWWEHYDASGKAEKWAHKWCSIDPNTQLDAGHAHVWHERWGEKYDGHGGSMKYTDKWAE 502
             +K  +   + GK ++  +   + D   +   G       R G    G G S   ++  A 
Sbjct:  1065 KKD-DGGSSWGKKDEGGYSEQTFDRGGR-GFGGRRGGGRRGGRDQFGRGSSFGNSEDPAP 1122

Query:   503 RCEGDGWSKWGDKWDENFDPNSHGVKQ----GETWWAGKYGERWNRTWGERHNGSGWVHK 558
               +  G S WG K D +   +S G +     G +W  GK       +WG++++GSG    
Sbjct:  1123 WSKPSGGSSWG-KQDGDGGGSSWGKENDAGGGSSW--GKQDNGVGSSWGKQNDGSGGGSS 1179

Query:   559 YGKSSS---GELW 568
             +GK +    G  W
Sbjct:  1180 WGKQNDAGGGSSW 1192

 Score = 42 (19.8 bits), Expect = 0.00031, Sum P(3) = 0.00030
 Identities = 14/56 (25%), Positives = 28/56 (50%)

Query:   145 GSVVKNGESSGT-AEVSRFVKKNSESSGAAEISPFVKNSESNGTAEVPERDSSGAL 199
             G+V   G++S +  E S + K+ + +S  A++  +  +  S+G  +  E    G L
Sbjct:   784 GTVSGWGDTSASNVEASSWEKQGASTSNVADLGSWGTHGGSSGGNKQDEDSVWGKL 839

 Score = 39 (18.8 bits), Expect = 0.00031, Sum P(3) = 0.00030
 Identities = 10/20 (50%), Positives = 13/20 (65%)

Query:   101 KKSEEFSKILDVSKEERDRI 120
             K S  F K  D+++EE DRI
Sbjct:    92 KSSFVFPKEEDLNEEEFDRI 111

 Score = 38 (18.4 bits), Expect = 0.00075, Sum P(3) = 0.00075
 Identities = 11/34 (32%), Positives = 17/34 (50%)

Query:   167 SESSGAAEISPFVKNSESNGTAEVPERDSSGALS 200
             SESS   E S + K   S+G +    +D + + S
Sbjct:   843 SESSQKKEESSWGKKGGSDGESSWGNKDGNSSAS 876


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.311   0.130   0.417    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      622       577   0.00081  120 3  11 23  0.37    35
                                                     36  0.45    37


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  3
  No. of states in DFA:  629 (67 KB)
  Total size of DFA:  431 KB (2199 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  48.94u 0.10s 49.04t   Elapsed:  00:00:02
  Total cpu time:  48.94u 0.10s 49.04t   Elapsed:  00:00:02
  Start:  Sat May 11 09:55:43 2013   End:  Sat May 11 09:55:45 2013

Back to top