BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>022179
MATPVRKSHVSTSDLLTWPEAPSSDSSHPPASAPRSHQPSDGVSKVLFGGQITDEEAQSL
NKKKPCSGYKLKEINGSGIFVANGENGASESDAGNRNNRTSVRVYQQAMNGISQISFSAE
ETVSPKKPTSVPEVAKQRELSGSLQSESDLKTKKQISDAKFKEISGHDIFSPAPEIQPRS
LAAARSLESKESKDMGEPAPRNVRTSVKVSNPAGGQSNILFGEEPVVKTSKKIHNQKFAE
LTGNDIFKGDVPPGSAEKPLSNAKLREMSGSNIFADEKVESRDYFGGVRKPPGGESSISL
V

High Scoring Gene Products

Symbol, full name Information P value
AT4G39860 protein from Arabidopsis thaliana 3.3e-102
AT2G22270 protein from Arabidopsis thaliana 7.2e-52
AT1G35780 protein from Arabidopsis thaliana 5.2e-49
ALKBH2
homolog of E. coli alkB
protein from Arabidopsis thaliana 7.4e-05

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  022179
        (301 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

TAIR|locus:2140000 - symbol:AT4G39860 "AT4G39860" species...  1013  3.3e-102  1
TAIR|locus:2060425 - symbol:AT2G22270 "AT2G22270" species...   538  7.2e-52   1
TAIR|locus:2011375 - symbol:AT1G35780 "AT1G35780" species...   511  5.2e-49   1
TAIR|locus:2060430 - symbol:ALKBH2 "homolog of E. coli al...   120  7.4e-05   1


>TAIR|locus:2140000 [details] [associations]
            symbol:AT4G39860 "AT4G39860" species:3702 "Arabidopsis
            thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0005634
            "nucleus" evidence=ISM;IDA] [GO:0008150 "biological_process"
            evidence=ND] [GO:0005737 "cytoplasm" evidence=IDA] [GO:0000280
            "nuclear division" evidence=RCA] [GO:0000911 "cytokinesis by cell
            plate formation" evidence=RCA] [GO:0006342 "chromatin silencing"
            evidence=RCA] [GO:0007000 "nucleolus organization" evidence=RCA]
            [GO:0008283 "cell proliferation" evidence=RCA] [GO:0016572 "histone
            phosphorylation" evidence=RCA] [GO:0051567 "histone H3-K9
            methylation" evidence=RCA] GO:GO:0005634 GO:GO:0005737
            EMBL:CP002687 EMBL:AL161596 EMBL:AL035708 ProtClustDB:CLSN2685884
            InterPro:IPR025131 Pfam:PF13266 IPI:IPI00547768 PIR:T06092
            RefSeq:NP_195696.1 UniGene:At.27171 STRING:Q9SMR7 PRIDE:Q9SMR7
            EnsemblPlants:AT4G39860.1 GeneID:830145 KEGG:ath:AT4G39860
            TAIR:At4g39860 InParanoid:Q9SMR7 OMA:GYKLKEM PhylomeDB:Q9SMR7
            Genevestigator:Q9SMR7 Uniprot:Q9SMR7
        Length = 299

 Score = 1013 (361.7 bits), Expect = 3.3e-102, P = 3.3e-102
 Identities = 208/299 (69%), Positives = 237/299 (79%)

Query:     3 TPVRKSHVSTSDLLTWXXXXXXXXXXXXXXXXXXXXXXDGVSKVLFGGQITDEEAQSLNK 62
             TPVR  H ST+DLL+W                      DG+SK+L GGQITDEEAQSLNK
Sbjct:     5 TPVRNPHTSTADLLSWSETPPPPHHSTPSAARSHQPS-DGISKILGGGQITDEEAQSLNK 63

Query:    63 KKPCSGYKLKEINGSGIFVANGENGASESDAGNRNNRTSVRVYQQAMNGISQISFSAEET 122
              K CSGYKLKE+ GSGIF   G+ G SESDA   + +T +R YQQ +NG+SQISFSA+  
Sbjct:    64 LKNCSGYKLKEMTGSGIFTDKGKVG-SESDA-TTDPKTGLRYYQQTLNGMSQISFSADGN 121

Query:   123 VSPKKPTSVPEVAKQRELSGSLQSESDLKTKKQISDAKFKEISGHDIFSPAPEIQPRSLA 182
             VSPKKPT++ EVAKQRELSG+L +E+DLK+ KQIS AK +EISGHDIF+P  EIQPRSL 
Sbjct:   122 VSPKKPTTLTEVAKQRELSGNLLTEADLKSNKQISSAKIEEISGHDIFAPPSEIQPRSLV 181

Query:   183 AARSLESKESKDMGEPAPRNVRTSVKVSNPAGGQSNILFGEEPVVKTSKKIHNQKFAELT 242
             AA+  E++ ++DMGEPAPRN+RTSVKVSNPAGGQSNILF EEPVVKTSKKIHNQKF ELT
Sbjct:   182 AAQQ-EARGNRDMGEPAPRNLRTSVKVSNPAGGQSNILFSEEPVVKTSKKIHNQKFQELT 240

Query:   243 GNDIFKGDVPPGSAEKPLSNAKLREMSGSNIFADEKVESRDYFGGVRKPPGGESSISLV 301
             GN IFKGD  PGSA+K LS+AKLREMSG+NIFAD K ESRDYFGGVRKPPGGESSISLV
Sbjct:   241 GNGIFKGDESPGSADKQLSSAKLREMSGNNIFADGKSESRDYFGGVRKPPGGESSISLV 299


>TAIR|locus:2060425 [details] [associations]
            symbol:AT2G22270 "AT2G22270" species:3702 "Arabidopsis
            thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0005634
            "nucleus" evidence=ISM] [GO:0008150 "biological_process"
            evidence=ND] EMBL:CP002685 GenomeReviews:CT485783_GR EMBL:AC007168
            HOGENOM:HOG000239183 ProtClustDB:CLSN2685884 InterPro:IPR025131
            Pfam:PF13266 IPI:IPI00539460 PIR:G84610 RefSeq:NP_565531.1
            UniGene:At.21395 PRIDE:Q9SID9 EnsemblPlants:AT2G22270.1
            GeneID:816760 KEGG:ath:AT2G22270 TAIR:At2g22270 eggNOG:NOG309446
            InParanoid:Q9SID9 OMA:RIHYHQD PhylomeDB:Q9SID9 ArrayExpress:Q9SID9
            Genevestigator:Q9SID9 Uniprot:Q9SID9
        Length = 328

 Score = 538 (194.4 bits), Expect = 7.2e-52, P = 7.2e-52
 Identities = 126/226 (55%), Positives = 156/226 (69%)

Query:    83 NGENGASESDAG--NRNNRTSVRVYQQAMNGISQISFSAEETVSPKKPTSVPEVAKQREL 140
             +GE  A+    G  + N++T +  +Q      SQISFS EE V+PKKPT++ E AKQ+EL
Sbjct:   109 SGEENATTPMNGKDDPNHQTRIHYHQDQR---SQISFSGEENVTPKKPTTLNEAAKQKEL 165

Query:   141 SGSLQSESDLKTKK-QISDAKFKEISGHDIFSPAPEIQPRSL-AAARSLESKESKDMGEP 198
             S ++++++D K KK QIS+ K K +SGHDIF+ +PE QPR L   A   E K +K+  E 
Sbjct:   166 SRTVETQADSKCKKKQISNTKNKAMSGHDIFA-SPESQPRRLFGGATQSEVKGNKNTEES 224

Query:   199 APRNVRTSVKVSNPAGGQSNILFGEEPVVKTSKKIHNQK--FAELTGNDIFKGD-VPPGS 255
             APR+ R SVK SN  G  SN LF EE VVK+SKKIHNQK  F  LT N IFK D +PPG 
Sbjct:   225 APRSSRASVKTSN--GQSSNRLFSEEHVVKSSKKIHNQKSQFQGLTSNGIFKSDKIPPGY 282

Query:   256 AEKPLSNAKLREMSGSNIFADEKVESRDYFGGVRKPPGGESSISLV 301
             +EK  S+AK REMSG NIFAD K E RDY+GG R+PPGGESSISLV
Sbjct:   283 SEKMQSSAKKREMSGHNIFADGKSEYRDYYGGARRPPGGESSISLV 328

 Score = 152 (58.6 bits), Expect = 1.5e-08, P = 1.5e-08
 Identities = 72/290 (24%), Positives = 122/290 (42%)

Query:     9 HVSTSDLLTWXXXXXXXXXXXXXXXXXXXXXXDGVSKVLFGG-QITDEEAQSLNK----K 63
             H ST+DLL+W                      DG++ VL GG QIT+ E +SLN     +
Sbjct:     9 HHSTADLLSWSEIRRPDYSTAANRSNQPS---DGMNDVLGGGGQITNAETKSLNTNVSHR 65

Query:    64 KPCSGYKLKEINGSGIFVANGENGASESDAGNRNNRTSVRVYQQAMNGISQISFSAEETV 123
             K CSG+KLKE+ GS IF  +G+         + N++T +  +Q  +   SQISFS EE  
Sbjct:    66 KNCSGHKLKEMTGSDIFSDDGKY--------DPNHQTRIHYHQDQL---SQISFSGEENA 114

Query:   124 S-PKKPTSVPEVAKQRELSGSLQSESDLKTKKQISDAK---FKEISGHDIFSPAPEIQPR 179
             + P      P    +       +S+     ++ ++  K     E +     S   E Q  
Sbjct:   115 TTPMNGKDDPNHQTRIHYHQDQRSQISFSGEENVTPKKPTTLNEAAKQKELSRTVETQAD 174

Query:   180 SLAAARSLESKESKDMG--------EPAPRNVRTSVKVSNPAGGQSNILFGEEPVVKTSK 231
             S    + + + ++K M         E  PR +      S   G ++     EE   ++S+
Sbjct:   175 SKCKKKQISNTKNKAMSGHDIFASPESQPRRLFGGATQSEVKGNKNT----EESAPRSSR 230

Query:   232 KIHNQKFAELTGNDIFKGDVPPGSAEKPLSNAK--LREMSGSNIFADEKV 279
                     + + N +F  +    S++K + N K   + ++ + IF  +K+
Sbjct:   231 ASVKTSNGQ-SSNRLFSEEHVVKSSKK-IHNQKSQFQGLTSNGIFKSDKI 278


>TAIR|locus:2011375 [details] [associations]
            symbol:AT1G35780 "AT1G35780" species:3702 "Arabidopsis
            thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0005634
            "nucleus" evidence=ISM] [GO:0008150 "biological_process"
            evidence=ND] EMBL:CP002684 GenomeReviews:CT485782_GR EMBL:AC021198
            EMBL:AF428333 EMBL:AY063902 EMBL:AY122925 IPI:IPI00542942
            PIR:A86480 RefSeq:NP_564463.1 UniGene:At.25479 UniGene:At.74755
            PaxDb:Q9LP18 PRIDE:Q9LP18 EnsemblPlants:AT1G35780.1 GeneID:840483
            KEGG:ath:AT1G35780 TAIR:At1g35780 eggNOG:NOG327187
            HOGENOM:HOG000239183 InParanoid:Q9LP18 OMA:SGNNVFK PhylomeDB:Q9LP18
            ProtClustDB:CLSN2685884 ArrayExpress:Q9LP18 Genevestigator:Q9LP18
            InterPro:IPR025131 Pfam:PF13266 Uniprot:Q9LP18
        Length = 286

 Score = 511 (184.9 bits), Expect = 5.2e-49, P = 5.2e-49
 Identities = 117/257 (45%), Positives = 155/257 (60%)

Query:     3 TPVRKSHVSTSDLLTWXXXXXXXXXXXXXXXXXXXXX--XDGVSKVLFGGQITDEEAQSL 60
             TPVRK H+ST+DLLTW                        DG+SKV+FGGQ+TDEE +SL
Sbjct:     5 TPVRKPHMSTADLLTWPENQPFESPAAVSSRSAARSHQPSDGISKVVFGGQVTDEEVESL 64

Query:    61 NKKKPCSGYKLKEINGSGIFVANGENGASESDAGNRNNRTSVRVYQQAMNGI-SQISFSA 119
             NK+KPCS YK+KEI GSGIF    EN  SE  + N       R +QQ    I S ISF  
Sbjct:    65 NKRKPCSNYKMKEITGSGIFSVYEENDDSELASANSATNGKSRTFQQPPAAIMSHISFGE 124

Query:   120 EETVSPKKPTSVPEVAKQRELSGSLQSESDLKTKKQISDAKFKEISGHDIFSPAPEIQPR 179
             EE V+PKKP +VPEVAKQRELSG+L+ +SD K  KQ SDAK KE+SGH+IF+P PEI+ R
Sbjct:   125 EEIVTPKKPATVPEVAKQRELSGTLEYQSDAKLNKQFSDAKCKELSGHNIFAPPPEIKLR 184

Query:   180 SLAAARSLESKESKDMGEPAPR---NVRTSVKVSNP--AGGQSNILFGEE---PVVKTSK 231
                  R+L  K++ D+GE   +    ++T+ K+++        N +F  +   P   T++
Sbjct:   185 P--TVRALAYKDNFDLGESDTKPDGELKTAKKIADRKFTDLSGNNVFKSDVSSPSSATAE 242

Query:   232 KI-HNQKFAELTGNDIF 247
             ++    K  E++GNDIF
Sbjct:   243 RLLSTAKLKEISGNDIF 259

 Score = 260 (96.6 bits), Expect = 2.1e-22, P = 2.1e-22
 Identities = 77/208 (37%), Positives = 113/208 (54%)

Query:   104 VYQQAMNGISQISFSAEETVSPKKPT-SVPEVAKQRELS-GSLQSESDLKTKKQISDAKF 161
             VY++  N  S+++ SA    + K  T   P  A    +S G  +  +  K       AK 
Sbjct:    86 VYEE--NDDSELA-SANSATNGKSRTFQQPPAAIMSHISFGEEEIVTPKKPATVPEVAKQ 142

Query:   162 KEISGHDIFSPAPEIQPRSLAAARSLESKESKDMGEPAPRNVRTSVKVSNPAGGQSNILF 221
             +E+SG   +    ++  +  + A+  E         P    +R +V+       + N   
Sbjct:   143 RELSGTLEYQSDAKLN-KQFSDAKCKELSGHNIFAPPPEIKLRPTVRA---LAYKDNFDL 198

Query:   222 GEEPV-----VKTSKKIHNQKFAELTGNDIFKGDVP-PGSA--EKPLSNAKLREMSGSNI 273
             GE        +KT+KKI ++KF +L+GN++FK DV  P SA  E+ LS AKL+E+SG++I
Sbjct:   199 GESDTKPDGELKTAKKIADRKFTDLSGNNVFKSDVSSPSSATAERLLSTAKLKEISGNDI 258

Query:   274 FADEKVESRDYFGGVRKPPGGESSISLV 301
             FAD K +SRDYFGGVRKPPGGESSI+LV
Sbjct:   259 FADAKAQSRDYFGGVRKPPGGESSIALV 286


>TAIR|locus:2060430 [details] [associations]
            symbol:ALKBH2 "homolog of E. coli alkB" species:3702
            "Arabidopsis thaliana" [GO:0005634 "nucleus" evidence=ISM]
            [GO:0008152 "metabolic process" evidence=ISS] [GO:0016491
            "oxidoreductase activity" evidence=ISS] [GO:0006281 "DNA repair"
            evidence=IDA] [GO:0035514 "DNA demethylase activity" evidence=IDA]
            InterPro:IPR005123 PROSITE:PS51471 EMBL:CP002685
            GenomeReviews:CT485783_GR GO:GO:0006281 GO:GO:0016706
            eggNOG:COG3145 Pfam:PF13532 HOGENOM:HOG000207105 EMBL:AC007168
            InterPro:IPR025131 Pfam:PF13266 IPI:IPI00524253 PIR:F84610
            RefSeq:NP_565530.1 UniGene:At.39463 ProteinModelPortal:Q9SIE0
            SMR:Q9SIE0 DNASU:816759 EnsemblPlants:AT2G22260.1 GeneID:816759
            KEGG:ath:AT2G22260 TAIR:At2g22260 InParanoid:Q9SIE0 OMA:SFGCERD
            PhylomeDB:Q9SIE0 ProtClustDB:CLSN2688336 ArrayExpress:Q9SIE0
            Genevestigator:Q9SIE0 GO:GO:0035514 Uniprot:Q9SIE0
        Length = 314

 Score = 120 (47.3 bits), Expect = 7.5e-05, P = 7.4e-05
 Identities = 36/90 (40%), Positives = 46/90 (51%)

Query:    41 DGVSKVLFGGQITDEEAQSLNKKKPCSGYKLKEINGSGIFVANGENGASESDAGNRNNRT 100
             DG+S     GQIT+EEA+SL  KK CSG+KLKE+  S  F    +NG  +SD   R +  
Sbjct:    19 DGISD----GQITNEEAESLINKKNCSGHKLKEVTDSDTF---SDNGKDDSDTKKRFH-- 69

Query:   101 SVRVYQQAMNGISQISFSAEETVSPKKPTS 130
                 Y Q    +S  S  A E+ S     S
Sbjct:    70 ----YHQDQRRMSLTSIVAVESPSSSNAPS 95


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.306   0.126   0.342    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      301       279   0.00081  115 3  11 23  0.37    35
                                                     33  0.41    37


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  4
  No. of states in DFA:  573 (61 KB)
  Total size of DFA:  158 KB (2094 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  27.29u 0.18s 27.47t   Elapsed:  00:00:01
  Total cpu time:  27.29u 0.18s 27.47t   Elapsed:  00:00:01
  Start:  Sat May 11 09:09:42 2013   End:  Sat May 11 09:09:43 2013

Back to top