BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>023774
MERSTPVRKPHTSTADLLVWSETPPSDSPAQASSTRSSVRGQPSDGISKVVFGGQVTDEE
VESLNRRKPCSGYKMKEMTGSGIFAAGAENDESESGSANPTPNNKTGLRMYQQAIAGISH
ISFGEEDSISPKKPTTLPEVAKQRELSGTLESESEAKLKKQISDAKSKELSGHDIFAPPP
EILPRPAPAGDQSSISSTEEPVMKTSKKIYDKKFSELSGNDIFKGDVPPSSAEKPLSVAK
LREMSGSNIFADGKVESRDYLGGVRKPPGGESSIALV

High Scoring Gene Products

Symbol, full name Information P value
AT1G35780 protein from Arabidopsis thaliana 1.7e-84
AT4G39860 protein from Arabidopsis thaliana 4.9e-53
AT2G22270 protein from Arabidopsis thaliana 7.8e-23
ALKBH2
homolog of E. coli alkB
protein from Arabidopsis thaliana 4.9e-05

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  023774
        (277 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

TAIR|locus:2011375 - symbol:AT1G35780 "AT1G35780" species...   846  1.7e-84   1
TAIR|locus:2140000 - symbol:AT4G39860 "AT4G39860" species...   549  4.9e-53   1
TAIR|locus:2060425 - symbol:AT2G22270 "AT2G22270" species...   264  7.8e-23   1
TAIR|locus:2060430 - symbol:ALKBH2 "homolog of E. coli al...   121  4.9e-05   1


>TAIR|locus:2011375 [details] [associations]
            symbol:AT1G35780 "AT1G35780" species:3702 "Arabidopsis
            thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0005634
            "nucleus" evidence=ISM] [GO:0008150 "biological_process"
            evidence=ND] EMBL:CP002684 GenomeReviews:CT485782_GR EMBL:AC021198
            EMBL:AF428333 EMBL:AY063902 EMBL:AY122925 IPI:IPI00542942
            PIR:A86480 RefSeq:NP_564463.1 UniGene:At.25479 UniGene:At.74755
            PaxDb:Q9LP18 PRIDE:Q9LP18 EnsemblPlants:AT1G35780.1 GeneID:840483
            KEGG:ath:AT1G35780 TAIR:At1g35780 eggNOG:NOG327187
            HOGENOM:HOG000239183 InParanoid:Q9LP18 OMA:SGNNVFK PhylomeDB:Q9LP18
            ProtClustDB:CLSN2685884 ArrayExpress:Q9LP18 Genevestigator:Q9LP18
            InterPro:IPR025131 Pfam:PF13266 Uniprot:Q9LP18
        Length = 286

 Score = 846 (302.9 bits), Expect = 1.7e-84, P = 1.7e-84
 Identities = 182/289 (62%), Positives = 216/289 (74%)

Query:     1 MERSTPVRKPHTSTADLLVWSETPPSDSPAQASSTRSSVRG-QPSDGISKVVFGGQVTDE 59
             ME++TPVRKPH STADLL W E  P +SPA  SS RS+ R  QPSDGISKVVFGGQVTDE
Sbjct:     1 MEKNTPVRKPHMSTADLLTWPENQPFESPAAVSS-RSAARSHQPSDGISKVVFGGQVTDE 59

Query:    60 EVESLNRRKPCSGYKMKEMTGSGIFAAGAENDESESGSANPTPNNKTGLRMYQQAIAGI- 118
             EVESLN+RKPCS YKMKE+TGSGIF+   END+SE  SAN   N K+  R +QQ  A I 
Sbjct:    60 EVESLNKRKPCSNYKMKEITGSGIFSVYEENDDSELASANSATNGKS--RTFQQPPAAIM 117

Query:   119 SHISFGEEDSISPKKPTTLPEVAKQRELSGTLESESEAKLKKQISDAKSKELSGHDIFX- 177
             SHISFGEE+ ++PKKP T+PEVAKQRELSGTLE +S+AKL KQ SDAK KELSGH+IF  
Sbjct:   118 SHISFGEEEIVTPKKPATVPEVAKQRELSGTLEYQSDAKLNKQFSDAKCKELSGHNIFAP 177

Query:   178 ---XXXXXXXXXXXXGDQSSISSTE-EP--VMKTSKKIYDKKFSELSGNDIFKGDVP-PS 230
                             D   +  ++ +P   +KT+KKI D+KF++LSGN++FK DV  PS
Sbjct:   178 PPEIKLRPTVRALAYKDNFDLGESDTKPDGELKTAKKIADRKFTDLSGNNVFKSDVSSPS 237

Query:   231 SA--EKPLSVAKLREMSGSNIFADGKVESRDYLGGVRKPPGGESSIALV 277
             SA  E+ LS AKL+E+SG++IFAD K +SRDY GGVRKPPGGESSIALV
Sbjct:   238 SATAERLLSTAKLKEISGNDIFADAKAQSRDYFGGVRKPPGGESSIALV 286


>TAIR|locus:2140000 [details] [associations]
            symbol:AT4G39860 "AT4G39860" species:3702 "Arabidopsis
            thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0005634
            "nucleus" evidence=ISM;IDA] [GO:0008150 "biological_process"
            evidence=ND] [GO:0005737 "cytoplasm" evidence=IDA] [GO:0000280
            "nuclear division" evidence=RCA] [GO:0000911 "cytokinesis by cell
            plate formation" evidence=RCA] [GO:0006342 "chromatin silencing"
            evidence=RCA] [GO:0007000 "nucleolus organization" evidence=RCA]
            [GO:0008283 "cell proliferation" evidence=RCA] [GO:0016572 "histone
            phosphorylation" evidence=RCA] [GO:0051567 "histone H3-K9
            methylation" evidence=RCA] GO:GO:0005634 GO:GO:0005737
            EMBL:CP002687 EMBL:AL161596 EMBL:AL035708 ProtClustDB:CLSN2685884
            InterPro:IPR025131 Pfam:PF13266 IPI:IPI00547768 PIR:T06092
            RefSeq:NP_195696.1 UniGene:At.27171 STRING:Q9SMR7 PRIDE:Q9SMR7
            EnsemblPlants:AT4G39860.1 GeneID:830145 KEGG:ath:AT4G39860
            TAIR:At4g39860 InParanoid:Q9SMR7 OMA:GYKLKEM PhylomeDB:Q9SMR7
            Genevestigator:Q9SMR7 Uniprot:Q9SMR7
        Length = 299

 Score = 549 (198.3 bits), Expect = 4.9e-53, P = 4.9e-53
 Identities = 124/256 (48%), Positives = 163/256 (63%)

Query:     1 MERSTPVRKPHTSTADLLVWSETPPSDSPAQASSTRSSVRGQPSDGISKVVFGGQVTDEE 60
             MER+TPVR PHTSTADLL WSETPP    +  S+ RS    QPSDGISK++ GGQ+TDEE
Sbjct:     1 MERNTPVRNPHTSTADLLSWSETPPPPHHSTPSAARSH---QPSDGISKILGGGQITDEE 57

Query:    61 VESLNRRKPCSGYKMKEMTGSGIFAAGAENDESESGS-ANPTPNNKTGLRMYQQAIAGIS 119
              +SLN+ K CSGYK+KEMTGSGIF      D+ + GS ++ T + KTGLR YQQ + G+S
Sbjct:    58 AQSLNKLKNCSGYKLKEMTGSGIFT-----DKGKVGSESDATTDPKTGLRYYQQTLNGMS 112

Query:   120 HISFGEEDSISPKKPTTLPEVAKQRELSGTLESESEAKLKKQISDAKSKELSGHDIFXXX 179
              ISF  + ++SPKKPTTL EVAKQRELSG L +E++ K  KQIS AK +E+SGHDIF   
Sbjct:   113 QISFSADGNVSPKKPTTLTEVAKQRELSGNLLTEADLKSNKQISSAKIEEISGHDIFAPP 172

Query:   180 XXXXXXXXXXGDQSSISSTE--EPV---MKTSKKIYDKKFSELSGNDIFKGDVPPSSAEK 234
                         Q +  + +  EP    ++TS K+ +    +   N +F  + P     K
Sbjct:   173 SEIQPRSLVAAQQEARGNRDMGEPAPRNLRTSVKVSNPAGGQ--SNILFSEE-PVVKTSK 229

Query:   235 PLSVAKLREMSGSNIF 250
              +   K +E++G+ IF
Sbjct:   230 KIHNQKFQELTGNGIF 245

 Score = 332 (121.9 bits), Expect = 4.9e-30, P = 4.9e-30
 Identities = 66/88 (75%), Positives = 76/88 (86%)

Query:   190 GDQSSISSTEEPVMKTSKKIYDKKFSELSGNDIFKGDVPPSSAEKPLSVAKLREMSGSNI 249
             G QS+I  +EEPV+KTSKKI+++KF EL+GN IFKGD  P SA+K LS AKLREMSG+NI
Sbjct:   212 GGQSNILFSEEPVVKTSKKIHNQKFQELTGNGIFKGDESPGSADKQLSSAKLREMSGNNI 271

Query:   250 FADGKVESRDYLGGVRKPPGGESSIALV 277
             FADGK ESRDY GGVRKPPGGESSI+LV
Sbjct:   272 FADGKSESRDYFGGVRKPPGGESSISLV 299


>TAIR|locus:2060425 [details] [associations]
            symbol:AT2G22270 "AT2G22270" species:3702 "Arabidopsis
            thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0005634
            "nucleus" evidence=ISM] [GO:0008150 "biological_process"
            evidence=ND] EMBL:CP002685 GenomeReviews:CT485783_GR EMBL:AC007168
            HOGENOM:HOG000239183 ProtClustDB:CLSN2685884 InterPro:IPR025131
            Pfam:PF13266 IPI:IPI00539460 PIR:G84610 RefSeq:NP_565531.1
            UniGene:At.21395 PRIDE:Q9SID9 EnsemblPlants:AT2G22270.1
            GeneID:816760 KEGG:ath:AT2G22270 TAIR:At2g22270 eggNOG:NOG309446
            InParanoid:Q9SID9 OMA:RIHYHQD PhylomeDB:Q9SID9 ArrayExpress:Q9SID9
            Genevestigator:Q9SID9 Uniprot:Q9SID9
        Length = 328

 Score = 264 (98.0 bits), Expect = 7.8e-23, P = 7.8e-23
 Identities = 74/192 (38%), Positives = 101/192 (52%)

Query:    93 SESGSANPTPNNKTGLRMYQQAIAGISHISFGEEDSISPKKPTTLPEVAKQRELSG-TLE 151
             S SG  N TP   T L    +    +S     + DS   KK  +     K + +SG  + 
Sbjct:   141 SFSGEENVTPKKPTTLNEAAKQ-KELSRTVETQADSKCKKKQISN---TKNKAMSGHDIF 196

Query:   152 SESEAKLKKQISDAKSKELSGH---DIFXXXXXXXXXXXXXGDQSSISSTEEPVMKTSKK 208
             +  E++ ++    A   E+ G+   +               G  S+   +EE V+K+SKK
Sbjct:   197 ASPESQPRRLFGGATQSEVKGNKNTEESAPRSSRASVKTSNGQSSNRLFSEEHVVKSSKK 256

Query:   209 IYDKK--FSELSGNDIFKGD-VPPSSAEKPLSVAKLREMSGSNIFADGKVESRDYLGGVR 265
             I+++K  F  L+ N IFK D +PP  +EK  S AK REMSG NIFADGK E RDY GG R
Sbjct:   257 IHNQKSQFQGLTSNGIFKSDKIPPGYSEKMQSSAKKREMSGHNIFADGKSEYRDYYGGAR 316

Query:   266 KPPGGESSIALV 277
             +PPGGESSI+LV
Sbjct:   317 RPPGGESSISLV 328

 Score = 214 (80.4 bits), Expect = 1.7e-17, P = 1.7e-17
 Identities = 60/180 (33%), Positives = 97/180 (53%)

Query:    81 SGIFAAGAENDESESGSANPTPNNKTGLRMYQQAIAGISHISFGEEDSISPKKPTTLPEV 140
             S I  +G EN  +     +  PN++T +  +Q      S ISF  E++++PKKPTTL E 
Sbjct:   104 SQISFSGEENATTPMNGKDD-PNHQTRIHYHQDQR---SQISFSGEENVTPKKPTTLNEA 159

Query:   141 AKQRELSGTLESESEAKLKK-QISDAKSKELSGHDIFXXXXXXXXXXXXXGDQSSIS--- 196
             AKQ+ELS T+E+++++K KK QIS+ K+K +SGHDIF               QS +    
Sbjct:   160 AKQKELSRTVETQADSKCKKKQISNTKNKAMSGHDIFASPESQPRRLFGGATQSEVKGNK 219

Query:   197 STEEPVMKTSKKIYDKKFSELSGNDIFKGD-VPPSSAEKPLSVAKLREMSGSNIFADGKV 255
             +TEE   ++S+    K  +  S N +F  + V  SS +     ++ + ++ + IF   K+
Sbjct:   220 NTEESAPRSSRASV-KTSNGQSSNRLFSEEHVVKSSKKIHNQKSQFQGLTSNGIFKSDKI 278

 Score = 199 (75.1 bits), Expect = 1.0e-15, P = 1.0e-15
 Identities = 70/177 (39%), Positives = 97/177 (54%)

Query:    10 PHTSTADLLVWSETPPSDSPAQASSTRSSVRGQPSDGISKVVFGG-QVTDEEVESLN--- 65
             PH STADLL WSE    D    A+  RS+   QPSDG++ V+ GG Q+T+ E +SLN   
Sbjct:     8 PHHSTADLLSWSEIRRPDYSTAAN--RSN---QPSDGMNDVLGGGGQITNAETKSLNTNV 62

Query:    66 -RRKPCSGYKMKEMTGSGIFAAGAENDESESGSANPTPNNKTGLRMYQQAIAGISHISF- 123
               RK CSG+K+KEMTGS IF        S+ G  +P  N++T +  +Q  +   S ISF 
Sbjct:    63 SHRKNCSGHKLKEMTGSDIF--------SDDGKYDP--NHQTRIHYHQDQL---SQISFS 109

Query:   124 GEEDSISPKK----P---TTLPEVAKQR-ELSGTLESESEAKLKKQISDA-KSKELS 171
             GEE++ +P      P   T +     QR ++S + E     K    +++A K KELS
Sbjct:   110 GEENATTPMNGKDDPNHQTRIHYHQDQRSQISFSGEENVTPKKPTTLNEAAKQKELS 166


>TAIR|locus:2060430 [details] [associations]
            symbol:ALKBH2 "homolog of E. coli alkB" species:3702
            "Arabidopsis thaliana" [GO:0005634 "nucleus" evidence=ISM]
            [GO:0008152 "metabolic process" evidence=ISS] [GO:0016491
            "oxidoreductase activity" evidence=ISS] [GO:0006281 "DNA repair"
            evidence=IDA] [GO:0035514 "DNA demethylase activity" evidence=IDA]
            InterPro:IPR005123 PROSITE:PS51471 EMBL:CP002685
            GenomeReviews:CT485783_GR GO:GO:0006281 GO:GO:0016706
            eggNOG:COG3145 Pfam:PF13532 HOGENOM:HOG000207105 EMBL:AC007168
            InterPro:IPR025131 Pfam:PF13266 IPI:IPI00524253 PIR:F84610
            RefSeq:NP_565530.1 UniGene:At.39463 ProteinModelPortal:Q9SIE0
            SMR:Q9SIE0 DNASU:816759 EnsemblPlants:AT2G22260.1 GeneID:816759
            KEGG:ath:AT2G22260 TAIR:At2g22260 InParanoid:Q9SIE0 OMA:SFGCERD
            PhylomeDB:Q9SIE0 ProtClustDB:CLSN2688336 ArrayExpress:Q9SIE0
            Genevestigator:Q9SIE0 GO:GO:0035514 Uniprot:Q9SIE0
        Length = 314

 Score = 121 (47.7 bits), Expect = 4.9e-05, P = 4.9e-05
 Identities = 28/60 (46%), Positives = 40/60 (66%)

Query:    33 SSTRSSVRGQPS-DGISKVVFGGQVTDEEVESLNRRKPCSGYKMKEMTGSGIFAAGAEND 91
             +ST ++   QPS DGIS     GQ+T+EE ESL  +K CSG+K+KE+T S  F+   ++D
Sbjct:     6 NSTAANRSNQPSSDGISD----GQITNEEAESLINKKNCSGHKLKEVTDSDTFSDNGKDD 61


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.304   0.124   0.336    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      277       264   0.00092  114 3  11 23  0.37    35
                                                     32  0.50    36


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  4
  No. of states in DFA:  595 (63 KB)
  Total size of DFA:  164 KB (2096 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  25.49u 0.11s 25.60t   Elapsed:  00:00:02
  Total cpu time:  25.49u 0.11s 25.60t   Elapsed:  00:00:02
  Start:  Fri May 10 20:22:16 2013   End:  Fri May 10 20:22:18 2013

Back to top