BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>021942
MERSTPVRKPHTSTADLLVWSETPPSDSPAQASSTRSSVRGQPSDGISKVVFGGQVTDEE
VESLNRRKPCSGYKMKEMTGSGIFAAGAENDESESGSANPTPNNKTGLRMYQQAIAGISH
ISFGEEDSISPKKPTTLPEVAKQRELSGTLESESEAKLKKQISDAKSKELSGHDIFAPPP
EILPRPAVRALALKENFNLGDSAPQDVQTSVGVLTPAGDQSSISSTEEPVMKTSKKIYDK
KFSELSGNDIFKGDVPPSSAEKPLSVAKLREMSGSNIFADGKVESRDYLGGVRKPPGGES
SIALV

High Scoring Gene Products

Symbol, full name Information P value
AT4G39860 protein from Arabidopsis thaliana 2.3e-94
AT1G35780 protein from Arabidopsis thaliana 4.6e-73
AT2G22270 protein from Arabidopsis thaliana 2.2e-43
MLIP
Uncharacterized protein
protein from Bos taurus 5.8e-05
ALKBH2
homolog of E. coli alkB
protein from Arabidopsis thaliana 7.0e-05

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  021942
        (305 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

TAIR|locus:2140000 - symbol:AT4G39860 "AT4G39860" species...   939  2.3e-94   1
TAIR|locus:2011375 - symbol:AT1G35780 "AT1G35780" species...   738  4.6e-73   1
TAIR|locus:2060425 - symbol:AT2G22270 "AT2G22270" species...   458  2.2e-43   1
UNIPROTKB|E1BQ40 - symbol:MLIP "Uncharacterized protein" ...    95  5.8e-05   2
TAIR|locus:2060430 - symbol:ALKBH2 "homolog of E. coli al...   121  7.0e-05   1


>TAIR|locus:2140000 [details] [associations]
            symbol:AT4G39860 "AT4G39860" species:3702 "Arabidopsis
            thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0005634
            "nucleus" evidence=ISM;IDA] [GO:0008150 "biological_process"
            evidence=ND] [GO:0005737 "cytoplasm" evidence=IDA] [GO:0000280
            "nuclear division" evidence=RCA] [GO:0000911 "cytokinesis by cell
            plate formation" evidence=RCA] [GO:0006342 "chromatin silencing"
            evidence=RCA] [GO:0007000 "nucleolus organization" evidence=RCA]
            [GO:0008283 "cell proliferation" evidence=RCA] [GO:0016572 "histone
            phosphorylation" evidence=RCA] [GO:0051567 "histone H3-K9
            methylation" evidence=RCA] GO:GO:0005634 GO:GO:0005737
            EMBL:CP002687 EMBL:AL161596 EMBL:AL035708 ProtClustDB:CLSN2685884
            InterPro:IPR025131 Pfam:PF13266 IPI:IPI00547768 PIR:T06092
            RefSeq:NP_195696.1 UniGene:At.27171 STRING:Q9SMR7 PRIDE:Q9SMR7
            EnsemblPlants:AT4G39860.1 GeneID:830145 KEGG:ath:AT4G39860
            TAIR:At4g39860 InParanoid:Q9SMR7 OMA:GYKLKEM PhylomeDB:Q9SMR7
            Genevestigator:Q9SMR7 Uniprot:Q9SMR7
        Length = 299

 Score = 939 (335.6 bits), Expect = 2.3e-94, P = 2.3e-94
 Identities = 194/307 (63%), Positives = 236/307 (76%)

Query:     1 MERSTPVRKPHTSTADLLVWSETPPSDSPAQASSTRSSVRGQPSDGISKVVFGGQVTDEE 60
             MER+TPVR PHTSTADLL WSETPP    +  S+ RS    QPSDGISK++ GGQ+TDEE
Sbjct:     1 MERNTPVRNPHTSTADLLSWSETPPPPHHSTPSAARSH---QPSDGISKILGGGQITDEE 57

Query:    61 VESLNRRKPCSGYKMKEMTGSGIFAAGAENDESESGS-ANPTPNNKTGLRMYQQAIAGIS 119
              +SLN+ K CSGYK+KEMTGSGIF      D+ + GS ++ T + KTGLR YQQ + G+S
Sbjct:    58 AQSLNKLKNCSGYKLKEMTGSGIFT-----DKGKVGSESDATTDPKTGLRYYQQTLNGMS 112

Query:   120 HISFGEEDSISPKKPTTLPEVAKQRELSGTLESESEAKLKKQISDAKSKELSGHDIFAPP 179
              ISF  + ++SPKKPTTL EVAKQRELSG L +E++ K  KQIS AK +E+SGHDIFAPP
Sbjct:   113 QISFSADGNVSPKKPTTLTEVAKQRELSGNLLTEADLKSNKQISSAKIEEISGHDIFAPP 172

Query:   180 PEILPRPAVRALA-LKENFNLGDSAPQDVQTSVGVLTPAGDQSSISSTEEPVMKTSKKIY 238
              EI PR  V A    + N ++G+ AP++++TSV V  PAG QS+I  +EEPV+KTSKKI+
Sbjct:   173 SEIQPRSLVAAQQEARGNRDMGEPAPRNLRTSVKVSNPAGGQSNILFSEEPVVKTSKKIH 232

Query:   239 DKKFSELSGNDIFKGDVPPSSAEKPLSVAKLREMSGSNIFADGKVESRDYLGGVRKPPGG 298
             ++KF EL+GN IFKGD  P SA+K LS AKLREMSG+NIFADGK ESRDY GGVRKPPGG
Sbjct:   233 NQKFQELTGNGIFKGDESPGSADKQLSSAKLREMSGNNIFADGKSESRDYFGGVRKPPGG 292

Query:   299 ESSIALV 305
             ESSI+LV
Sbjct:   293 ESSISLV 299


>TAIR|locus:2011375 [details] [associations]
            symbol:AT1G35780 "AT1G35780" species:3702 "Arabidopsis
            thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0005634
            "nucleus" evidence=ISM] [GO:0008150 "biological_process"
            evidence=ND] EMBL:CP002684 GenomeReviews:CT485782_GR EMBL:AC021198
            EMBL:AF428333 EMBL:AY063902 EMBL:AY122925 IPI:IPI00542942
            PIR:A86480 RefSeq:NP_564463.1 UniGene:At.25479 UniGene:At.74755
            PaxDb:Q9LP18 PRIDE:Q9LP18 EnsemblPlants:AT1G35780.1 GeneID:840483
            KEGG:ath:AT1G35780 TAIR:At1g35780 eggNOG:NOG327187
            HOGENOM:HOG000239183 InParanoid:Q9LP18 OMA:SGNNVFK PhylomeDB:Q9LP18
            ProtClustDB:CLSN2685884 ArrayExpress:Q9LP18 Genevestigator:Q9LP18
            InterPro:IPR025131 Pfam:PF13266 Uniprot:Q9LP18
        Length = 286

 Score = 738 (264.8 bits), Expect = 4.6e-73, P = 4.6e-73
 Identities = 160/262 (61%), Positives = 192/262 (73%)

Query:     1 MERSTPVRKPHTSTADLLVWSETPPSDSPAQASSTRSSVRG-QPSDGISKVVFGGQVTDE 59
             ME++TPVRKPH STADLL W E  P +SPA  SS RS+ R  QPSDGISKVVFGGQVTDE
Sbjct:     1 MEKNTPVRKPHMSTADLLTWPENQPFESPAAVSS-RSAARSHQPSDGISKVVFGGQVTDE 59

Query:    60 EVESLNRRKPCSGYKMKEMTGSGIFAAGAENDESESGSANPTPNNKTGLRMYQQAIAGI- 118
             EVESLN+RKPCS YKMKE+TGSGIF+   END+SE  SAN   N K+  R +QQ  A I 
Sbjct:    60 EVESLNKRKPCSNYKMKEITGSGIFSVYEENDDSELASANSATNGKS--RTFQQPPAAIM 117

Query:   119 SHISFGEEDSISPKKPTTLPEVAKQRELSGTLESESEAKLKKQISDAKSKELSGHDIFAP 178
             SHISFGEE+ ++PKKP T+PEVAKQRELSGTLE +S+AKL KQ SDAK KELSGH+IFAP
Sbjct:   118 SHISFGEEEIVTPKKPATVPEVAKQRELSGTLEYQSDAKLNKQFSDAKCKELSGHNIFAP 177

Query:   179 PPEILPRPAVRALALKENFNLG--DSAPQ-DVQTSVGV----LTP-AGDQSSISSTEEPV 230
             PPEI  RP VRALA K+NF+LG  D+ P  +++T+  +     T  +G+    S    P 
Sbjct:   178 PPEIKLRPTVRALAYKDNFDLGESDTKPDGELKTAKKIADRKFTDLSGNNVFKSDVSSPS 237

Query:   231 MKTSKKIYDK-KFSELSGNDIF 251
               T++++    K  E+SGNDIF
Sbjct:   238 SATAERLLSTAKLKEISGNDIF 259

 Score = 265 (98.3 bits), Expect = 6.1e-23, P = 6.1e-23
 Identities = 54/78 (69%), Positives = 67/78 (85%)

Query:   231 MKTSKKIYDKKFSELSGNDIFKGDVP-PSSA--EKPLSVAKLREMSGSNIFADGKVESRD 287
             +KT+KKI D+KF++LSGN++FK DV  PSSA  E+ LS AKL+E+SG++IFAD K +SRD
Sbjct:   209 LKTAKKIADRKFTDLSGNNVFKSDVSSPSSATAERLLSTAKLKEISGNDIFADAKAQSRD 268

Query:   288 YLGGVRKPPGGESSIALV 305
             Y GGVRKPPGGESSIALV
Sbjct:   269 YFGGVRKPPGGESSIALV 286


>TAIR|locus:2060425 [details] [associations]
            symbol:AT2G22270 "AT2G22270" species:3702 "Arabidopsis
            thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0005634
            "nucleus" evidence=ISM] [GO:0008150 "biological_process"
            evidence=ND] EMBL:CP002685 GenomeReviews:CT485783_GR EMBL:AC007168
            HOGENOM:HOG000239183 ProtClustDB:CLSN2685884 InterPro:IPR025131
            Pfam:PF13266 IPI:IPI00539460 PIR:G84610 RefSeq:NP_565531.1
            UniGene:At.21395 PRIDE:Q9SID9 EnsemblPlants:AT2G22270.1
            GeneID:816760 KEGG:ath:AT2G22270 TAIR:At2g22270 eggNOG:NOG309446
            InParanoid:Q9SID9 OMA:RIHYHQD PhylomeDB:Q9SID9 ArrayExpress:Q9SID9
            Genevestigator:Q9SID9 Uniprot:Q9SID9
        Length = 328

 Score = 458 (166.3 bits), Expect = 2.2e-43, P = 2.2e-43
 Identities = 112/232 (48%), Positives = 149/232 (64%)

Query:    81 SGIFAAGAENDESESGSANPTPNNKTGLRMYQQAIAGISHISFGEEDSISPKKPTTLPEV 140
             S I  +G EN  +     +  PN++T +  +Q      S ISF  E++++PKKPTTL E 
Sbjct:   104 SQISFSGEENATTPMNGKDD-PNHQTRIHYHQDQR---SQISFSGEENVTPKKPTTLNEA 159

Query:   141 AKQRELSGTLESESEAKLKK-QISDAKSKELSGHDIFAPPPEILPRPAVRALA---LKEN 196
             AKQ+ELS T+E+++++K KK QIS+ K+K +SGHDIFA P E  PR          +K N
Sbjct:   160 AKQKELSRTVETQADSKCKKKQISNTKNKAMSGHDIFASP-ESQPRRLFGGATQSEVKGN 218

Query:   197 FNLGDSAPQDVQTSVGVLTPAGDQSSISSTEEPVMKTSKKIYDKK--FSELSGNDIFKGD 254
              N  +SAP+  + SV   T  G  S+   +EE V+K+SKKI+++K  F  L+ N IFK D
Sbjct:   219 KNTEESAPRSSRASVK--TSNGQSSNRLFSEEHVVKSSKKIHNQKSQFQGLTSNGIFKSD 276

Query:   255 -VPPSSAEKPLSVAKLREMSGSNIFADGKVESRDYLGGVRKPPGGESSIALV 305
              +PP  +EK  S AK REMSG NIFADGK E RDY GG R+PPGGESSI+LV
Sbjct:   277 KIPPGYSEKMQSSAKKREMSGHNIFADGKSEYRDYYGGARRPPGGESSISLV 328

 Score = 199 (75.1 bits), Expect = 3.7e-14, P = 3.7e-14
 Identities = 70/177 (39%), Positives = 97/177 (54%)

Query:    10 PHTSTADLLVWSETPPSDSPAQASSTRSSVRGQPSDGISKVVFGG-QVTDEEVESLN--- 65
             PH STADLL WSE    D    A+  RS+   QPSDG++ V+ GG Q+T+ E +SLN   
Sbjct:     8 PHHSTADLLSWSEIRRPDYSTAAN--RSN---QPSDGMNDVLGGGGQITNAETKSLNTNV 62

Query:    66 -RRKPCSGYKMKEMTGSGIFAAGAENDESESGSANPTPNNKTGLRMYQQAIAGISHISF- 123
               RK CSG+K+KEMTGS IF        S+ G  +P  N++T +  +Q  +   S ISF 
Sbjct:    63 SHRKNCSGHKLKEMTGSDIF--------SDDGKYDP--NHQTRIHYHQDQL---SQISFS 109

Query:   124 GEEDSISPKK----P---TTLPEVAKQR-ELSGTLESESEAKLKKQISDA-KSKELS 171
             GEE++ +P      P   T +     QR ++S + E     K    +++A K KELS
Sbjct:   110 GEENATTPMNGKDDPNHQTRIHYHQDQRSQISFSGEENVTPKKPTTLNEAAKQKELS 166


>UNIPROTKB|E1BQ40 [details] [associations]
            symbol:MLIP "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0016605 "PML body" evidence=IEA] [GO:0005635 "nuclear
            envelope" evidence=IEA] GO:GO:0005635 GO:GO:0016605 OMA:KKPKQYK
            GeneTree:ENSGT00390000015862 EMBL:DAAA02054850 EMBL:DAAA02054851
            EMBL:DAAA02054852 EMBL:DAAA02054853 EMBL:DAAA02054854
            EMBL:DAAA02054855 EMBL:DAAA02054856 EMBL:DAAA02054857
            EMBL:DAAA02054858 EMBL:DAAA02054859 EMBL:DAAA02054860
            IPI:IPI00717061 Ensembl:ENSBTAT00000019407 Uniprot:E1BQ40
        Length = 992

 Score = 95 (38.5 bits), Expect = 5.8e-05, Sum P(2) = 5.8e-05
 Identities = 43/157 (27%), Positives = 69/157 (43%)

Query:   122 SFGEEDSISPKKPTTLPEVAKQR--ELSGTLESESEAKLKKQISDAKSKELS---GHDIF 176
             S  ++ ++ P      P   +Q+  EL  T++   +  L    SD+ S  L    G D  
Sbjct:   770 SISKDSTLDPPLELCSPAQLRQQTEELCATIDKVLQDSLSMHSSDSPSSSLQTFLGSDTI 829

Query:   177 APPPEILPRPAVRALALKENFNLGDSAPQDVQTSVGVLTPAGDQSSISSTEEPVMKTSKK 236
               P   LPR A R        +   + P+   T  GV+ PA  +S I      ++K  ++
Sbjct:   830 KMPTT-LPRAAGRETKYANLSSPSSTVPESQLTKPGVIRPAPVKSKI------LLKKEEE 882

Query:   237 IYDKK-FSE-LSGN-DIF-KGDVPPSSAEKPLSVAKL 269
             +Y+   FS+ L  N D F + DVP  +  KP+S+  L
Sbjct:   883 VYEPNPFSKYLEDNSDFFSEQDVP--APPKPVSLHPL 917

 Score = 80 (33.2 bits), Expect = 5.8e-05, Sum P(2) = 5.8e-05
 Identities = 33/139 (23%), Positives = 55/139 (39%)

Query:     4 STPVRKPHTSTADLLVWSETPPSDSPAQASSTRSSVRGQPSDGISKVVFGGQVTDEEVES 63
             S P   P  +++   V      S SP+ + ST     GQ S+ + K V    ++  E   
Sbjct:   285 SAPFFAPKGTSSTSQVPQPAQLSGSPSSSPSTAQQNPGQTSEVLKKTVTSNVLSPRESPR 344

Query:    64 LNRRKPCSGYKMKEMTGSGIFAAGAENDESESGSANPTPNNKTGLRMYQQAIAGISHISF 123
              +   P SG  +K    S I      +  S S  A P+P + +   +  QA +  S    
Sbjct:   345 ASSPSPASGASLKSSAASYIPVRIVTHSLSPSPRAFPSPFHGSSSTVCSQASSSGSLSRS 404

Query:   124 GEEDSISPKKPTTLPEVAK 142
             G +  + P + + L  + K
Sbjct:   405 GVKPPV-PSRLSVLTAILK 422


>TAIR|locus:2060430 [details] [associations]
            symbol:ALKBH2 "homolog of E. coli alkB" species:3702
            "Arabidopsis thaliana" [GO:0005634 "nucleus" evidence=ISM]
            [GO:0008152 "metabolic process" evidence=ISS] [GO:0016491
            "oxidoreductase activity" evidence=ISS] [GO:0006281 "DNA repair"
            evidence=IDA] [GO:0035514 "DNA demethylase activity" evidence=IDA]
            InterPro:IPR005123 PROSITE:PS51471 EMBL:CP002685
            GenomeReviews:CT485783_GR GO:GO:0006281 GO:GO:0016706
            eggNOG:COG3145 Pfam:PF13532 HOGENOM:HOG000207105 EMBL:AC007168
            InterPro:IPR025131 Pfam:PF13266 IPI:IPI00524253 PIR:F84610
            RefSeq:NP_565530.1 UniGene:At.39463 ProteinModelPortal:Q9SIE0
            SMR:Q9SIE0 DNASU:816759 EnsemblPlants:AT2G22260.1 GeneID:816759
            KEGG:ath:AT2G22260 TAIR:At2g22260 InParanoid:Q9SIE0 OMA:SFGCERD
            PhylomeDB:Q9SIE0 ProtClustDB:CLSN2688336 ArrayExpress:Q9SIE0
            Genevestigator:Q9SIE0 GO:GO:0035514 Uniprot:Q9SIE0
        Length = 314

 Score = 121 (47.7 bits), Expect = 7.0e-05, P = 7.0e-05
 Identities = 28/60 (46%), Positives = 40/60 (66%)

Query:    33 SSTRSSVRGQPS-DGISKVVFGGQVTDEEVESLNRRKPCSGYKMKEMTGSGIFAAGAEND 91
             +ST ++   QPS DGIS     GQ+T+EE ESL  +K CSG+K+KE+T S  F+   ++D
Sbjct:     6 NSTAANRSNQPSSDGISD----GQITNEEAESLINKKNCSGHKLKEVTDSDTFSDNGKDD 61


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.305   0.125   0.343    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      305       305   0.00098  115 3  11 23  0.41    35
                                                     33  0.45    37


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  5
  No. of states in DFA:  596 (63 KB)
  Total size of DFA:  177 KB (2102 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  32.63u 0.17s 32.80t   Elapsed:  00:00:01
  Total cpu time:  32.63u 0.17s 32.80t   Elapsed:  00:00:01
  Start:  Fri May 10 21:15:33 2013   End:  Fri May 10 21:15:34 2013

Back to top