BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>026032
MATPVRKSHVSTSDLLTWPEAPSSDSSHPPASAPRSHQPSDGVSKVLFGGQITDEEAQSL
NKKKPCSGYKLKEINGSGIFVANGENGASESDAGNRNNRTSVRVYQQAMNGISQISFSAE
ETVSPKKPTSVPEVAKQRELSGSLQSESDLKTKKQISDAKFKEISGHDIFSPAPEIQPRS
LAAARSLESKESKDMGEPAPRNVRTSVKVSNVSYFCYARNFLFSLYSFHLNLFFCFLKRL
HCFY

High Scoring Gene Products

Symbol, full name Information P value
AT4G39860 protein from Arabidopsis thaliana 6.4e-60
AT1G35780 protein from Arabidopsis thaliana 4.2e-47
AT2G22270 protein from Arabidopsis thaliana 4.3e-22
ALKBH2
homolog of E. coli alkB
protein from Arabidopsis thaliana 3.8e-05

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  026032
        (244 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

TAIR|locus:2140000 - symbol:AT4G39860 "AT4G39860" species...   614  6.4e-60   1
TAIR|locus:2011375 - symbol:AT1G35780 "AT1G35780" species...   493  4.2e-47   1
TAIR|locus:2060425 - symbol:AT2G22270 "AT2G22270" species...   257  4.3e-22   1
TAIR|locus:2060430 - symbol:ALKBH2 "homolog of E. coli al...   120  3.8e-05   1


>TAIR|locus:2140000 [details] [associations]
            symbol:AT4G39860 "AT4G39860" species:3702 "Arabidopsis
            thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0005634
            "nucleus" evidence=ISM;IDA] [GO:0008150 "biological_process"
            evidence=ND] [GO:0005737 "cytoplasm" evidence=IDA] [GO:0000280
            "nuclear division" evidence=RCA] [GO:0000911 "cytokinesis by cell
            plate formation" evidence=RCA] [GO:0006342 "chromatin silencing"
            evidence=RCA] [GO:0007000 "nucleolus organization" evidence=RCA]
            [GO:0008283 "cell proliferation" evidence=RCA] [GO:0016572 "histone
            phosphorylation" evidence=RCA] [GO:0051567 "histone H3-K9
            methylation" evidence=RCA] GO:GO:0005634 GO:GO:0005737
            EMBL:CP002687 EMBL:AL161596 EMBL:AL035708 ProtClustDB:CLSN2685884
            InterPro:IPR025131 Pfam:PF13266 IPI:IPI00547768 PIR:T06092
            RefSeq:NP_195696.1 UniGene:At.27171 STRING:Q9SMR7 PRIDE:Q9SMR7
            EnsemblPlants:AT4G39860.1 GeneID:830145 KEGG:ath:AT4G39860
            TAIR:At4g39860 InParanoid:Q9SMR7 OMA:GYKLKEM PhylomeDB:Q9SMR7
            Genevestigator:Q9SMR7 Uniprot:Q9SMR7
        Length = 299

 Score = 614 (221.2 bits), Expect = 6.4e-60, P = 6.4e-60
 Identities = 129/209 (61%), Positives = 155/209 (74%)

Query:     3 TPVRKSHVSTSDLLTWXXXXXXXXXXXXXXXXXXXXXXDGVSKVLFGGQITDEEAQSLNK 62
             TPVR  H ST+DLL+W                      DG+SK+L GGQITDEEAQSLNK
Sbjct:     5 TPVRNPHTSTADLLSWSETPPPPHHSTPSAARSHQPS-DGISKILGGGQITDEEAQSLNK 63

Query:    63 KKPCSGYKLKEINGSGIFVANGENGASESDAGNRNNRTSVRVYQQAMNGISQISFSAEET 122
              K CSGYKLKE+ GSGIF   G+ G SESDA   + +T +R YQQ +NG+SQISFSA+  
Sbjct:    64 LKNCSGYKLKEMTGSGIFTDKGKVG-SESDA-TTDPKTGLRYYQQTLNGMSQISFSADGN 121

Query:   123 VSPKKPTSVPEVAKQRELSGSLQSESDLKTKKQISDAKFKEISGHDIFSPAPEIQPRSLA 182
             VSPKKPT++ EVAKQRELSG+L +E+DLK+ KQIS AK +EISGHDIF+P  EIQPRSL 
Sbjct:   122 VSPKKPTTLTEVAKQRELSGNLLTEADLKSNKQISSAKIEEISGHDIFAPPSEIQPRSLV 181

Query:   183 AARSLESKESKDMGEPAPRNVRTSVKVSN 211
             AA+  E++ ++DMGEPAPRN+RTSVKVSN
Sbjct:   182 AAQQ-EARGNRDMGEPAPRNLRTSVKVSN 209

 Score = 115 (45.5 bits), Expect = 0.00014, P = 0.00014
 Identities = 50/142 (35%), Positives = 71/142 (50%)

Query:    47 LFGGQITDEEAQSLNKKKPCSGYKLKEINGSGIFVANGENG-----ASESDA-GNR---- 96
             L G  +T+ + +S NK+   S  K++EI+G  IF    E       A++ +A GNR    
Sbjct:   139 LSGNLLTEADLKS-NKQ--ISSAKIEEISGHDIFAPPSEIQPRSLVAAQQEARGNRDMGE 195

Query:    97 ----NNRTSVRVYQQAMNGISQISFSAEETVSPKKPTSVPEVAKQRELSGS---LQSESD 149
                 N RTSV+V   A  G S I FS E  V   K        K +EL+G+      ES 
Sbjct:   196 PAPRNLRTSVKVSNPA-GGQSNILFSEEPVVKTSKKI---HNQKFQELTGNGIFKGDESP 251

Query:   150 LKTKKQISDAKFKEISGHDIFS 171
                 KQ+S AK +E+SG++IF+
Sbjct:   252 GSADKQLSSAKLREMSGNNIFA 273


>TAIR|locus:2011375 [details] [associations]
            symbol:AT1G35780 "AT1G35780" species:3702 "Arabidopsis
            thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0005634
            "nucleus" evidence=ISM] [GO:0008150 "biological_process"
            evidence=ND] EMBL:CP002684 GenomeReviews:CT485782_GR EMBL:AC021198
            EMBL:AF428333 EMBL:AY063902 EMBL:AY122925 IPI:IPI00542942
            PIR:A86480 RefSeq:NP_564463.1 UniGene:At.25479 UniGene:At.74755
            PaxDb:Q9LP18 PRIDE:Q9LP18 EnsemblPlants:AT1G35780.1 GeneID:840483
            KEGG:ath:AT1G35780 TAIR:At1g35780 eggNOG:NOG327187
            HOGENOM:HOG000239183 InParanoid:Q9LP18 OMA:SGNNVFK PhylomeDB:Q9LP18
            ProtClustDB:CLSN2685884 ArrayExpress:Q9LP18 Genevestigator:Q9LP18
            InterPro:IPR025131 Pfam:PF13266 Uniprot:Q9LP18
        Length = 286

 Score = 493 (178.6 bits), Expect = 4.2e-47, P = 4.2e-47
 Identities = 108/227 (47%), Positives = 141/227 (62%)

Query:     3 TPVRKSHVSTSDLLTWXXXXXXXXXXXXXXXXXXXXX--XDGVSKVLFGGQITDEEAQSL 60
             TPVRK H+ST+DLLTW                        DG+SKV+FGGQ+TDEE +SL
Sbjct:     5 TPVRKPHMSTADLLTWPENQPFESPAAVSSRSAARSHQPSDGISKVVFGGQVTDEEVESL 64

Query:    61 NKKKPCSGYKLKEINGSGIFVANGENGASESDAGNRNNRTSVRVYQQAMNGI-SQISFSA 119
             NK+KPCS YK+KEI GSGIF    EN  SE  + N       R +QQ    I S ISF  
Sbjct:    65 NKRKPCSNYKMKEITGSGIFSVYEENDDSELASANSATNGKSRTFQQPPAAIMSHISFGE 124

Query:   120 EETVSPKKPTSVPEVAKQRELSGSLQSESDLKTKKQISDAKFKEISGHDIFSPAPEIQPR 179
             EE V+PKKP +VPEVAKQRELSG+L+ +SD K  KQ SDAK KE+SGH+IF+P PEI+ R
Sbjct:   125 EEIVTPKKPATVPEVAKQRELSGTLEYQSDAKLNKQFSDAKCKELSGHNIFAPPPEIKLR 184

Query:   180 SLAAARSLESKESKDMGEPAPR---NVRTSVKVSNVSYFCYARNFLF 223
                  R+L  K++ D+GE   +    ++T+ K+++  +   + N +F
Sbjct:   185 P--TVRALAYKDNFDLGESDTKPDGELKTAKKIADRKFTDLSGNNVF 229


>TAIR|locus:2060425 [details] [associations]
            symbol:AT2G22270 "AT2G22270" species:3702 "Arabidopsis
            thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0005634
            "nucleus" evidence=ISM] [GO:0008150 "biological_process"
            evidence=ND] EMBL:CP002685 GenomeReviews:CT485783_GR EMBL:AC007168
            HOGENOM:HOG000239183 ProtClustDB:CLSN2685884 InterPro:IPR025131
            Pfam:PF13266 IPI:IPI00539460 PIR:G84610 RefSeq:NP_565531.1
            UniGene:At.21395 PRIDE:Q9SID9 EnsemblPlants:AT2G22270.1
            GeneID:816760 KEGG:ath:AT2G22270 TAIR:At2g22270 eggNOG:NOG309446
            InParanoid:Q9SID9 OMA:RIHYHQD PhylomeDB:Q9SID9 ArrayExpress:Q9SID9
            Genevestigator:Q9SID9 Uniprot:Q9SID9
        Length = 328

 Score = 257 (95.5 bits), Expect = 4.3e-22, P = 4.3e-22
 Identities = 63/133 (47%), Positives = 87/133 (65%)

Query:    83 NGENGASESDAG--NRNNRTSVRVYQQAMNGISQISFSAEETVSPKKPTSVPEVAKQREL 140
             +GE  A+    G  + N++T +  +Q      SQISFS EE V+PKKPT++ E AKQ+EL
Sbjct:   109 SGEENATTPMNGKDDPNHQTRIHYHQDQR---SQISFSGEENVTPKKPTTLNEAAKQKEL 165

Query:   141 SGSLQSESDLKTKK-QISDAKFKEISGHDIFSPAPEIQPRSL-AAARSLESKESKDMGEP 198
             S ++++++D K KK QIS+ K K +SGHDIF+ +PE QPR L   A   E K +K+  E 
Sbjct:   166 SRTVETQADSKCKKKQISNTKNKAMSGHDIFA-SPESQPRRLFGGATQSEVKGNKNTEES 224

Query:   199 APRNVRTSVKVSN 211
             APR+ R SVK SN
Sbjct:   225 APRSSRASVKTSN 237

 Score = 146 (56.5 bits), Expect = 2.3e-08, P = 2.3e-08
 Identities = 55/196 (28%), Positives = 86/196 (43%)

Query:     9 HVSTSDLLTWXXXXXXXXXXXXXXXXXXXXXXDGVSKVLFGG-QITDEEAQSLNK----K 63
             H ST+DLL+W                      DG++ VL GG QIT+ E +SLN     +
Sbjct:     9 HHSTADLLSWSEIRRPDYSTAANRSNQPS---DGMNDVLGGGGQITNAETKSLNTNVSHR 65

Query:    64 KPCSGYKLKEINGSGIFVANGENGASESDAGNRNNRTSVRVYQQAMNGISQISFSAEETV 123
             K CSG+KLKE+ GS IF  +G+         + N++T +  +Q  +   SQISFS EE  
Sbjct:    66 KNCSGHKLKEMTGSDIFSDDGKY--------DPNHQTRIHYHQDQL---SQISFSGEENA 114

Query:   124 S-PKKPTSVPEVAKQRELSGSLQSESDLKTKKQISDAK---FKEISGHDIFSPAPEIQPR 179
             + P      P    +       +S+     ++ ++  K     E +     S   E Q  
Sbjct:   115 TTPMNGKDDPNHQTRIHYHQDQRSQISFSGEENVTPKKPTTLNEAAKQKELSRTVETQAD 174

Query:   180 SLAAARSLESKESKDM 195
             S    + + + ++K M
Sbjct:   175 SKCKKKQISNTKNKAM 190


>TAIR|locus:2060430 [details] [associations]
            symbol:ALKBH2 "homolog of E. coli alkB" species:3702
            "Arabidopsis thaliana" [GO:0005634 "nucleus" evidence=ISM]
            [GO:0008152 "metabolic process" evidence=ISS] [GO:0016491
            "oxidoreductase activity" evidence=ISS] [GO:0006281 "DNA repair"
            evidence=IDA] [GO:0035514 "DNA demethylase activity" evidence=IDA]
            InterPro:IPR005123 PROSITE:PS51471 EMBL:CP002685
            GenomeReviews:CT485783_GR GO:GO:0006281 GO:GO:0016706
            eggNOG:COG3145 Pfam:PF13532 HOGENOM:HOG000207105 EMBL:AC007168
            InterPro:IPR025131 Pfam:PF13266 IPI:IPI00524253 PIR:F84610
            RefSeq:NP_565530.1 UniGene:At.39463 ProteinModelPortal:Q9SIE0
            SMR:Q9SIE0 DNASU:816759 EnsemblPlants:AT2G22260.1 GeneID:816759
            KEGG:ath:AT2G22260 TAIR:At2g22260 InParanoid:Q9SIE0 OMA:SFGCERD
            PhylomeDB:Q9SIE0 ProtClustDB:CLSN2688336 ArrayExpress:Q9SIE0
            Genevestigator:Q9SIE0 GO:GO:0035514 Uniprot:Q9SIE0
        Length = 314

 Score = 120 (47.3 bits), Expect = 3.8e-05, P = 3.8e-05
 Identities = 36/90 (40%), Positives = 46/90 (51%)

Query:    41 DGVSKVLFGGQITDEEAQSLNKKKPCSGYKLKEINGSGIFVANGENGASESDAGNRNNRT 100
             DG+S     GQIT+EEA+SL  KK CSG+KLKE+  S  F    +NG  +SD   R +  
Sbjct:    19 DGISD----GQITNEEAESLINKKNCSGHKLKEVTDSDTF---SDNGKDDSDTKKRFH-- 69

Query:   101 SVRVYQQAMNGISQISFSAEETVSPKKPTS 130
                 Y Q    +S  S  A E+ S     S
Sbjct:    70 ----YHQDQRRMSLTSIVAVESPSSSNAPS 95


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.313   0.128   0.362    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      244       222   0.00096  112 3  11 23  0.43    33
                                                     32  0.40    36


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  4
  No. of states in DFA:  583 (62 KB)
  Total size of DFA:  152 KB (2092 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  25.71u 0.10s 25.81t   Elapsed:  00:00:01
  Total cpu time:  25.71u 0.10s 25.81t   Elapsed:  00:00:01
  Start:  Thu May  9 16:21:39 2013   End:  Thu May  9 16:21:40 2013

Back to top