BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>011721
MRWKDEELFHVIHKVPAGDSPYVRAKRAQLVEKDPSRAISLFWAAINAGDRVDSALKDMA
VVMKQLDRSEEAIEAIKSFRCLCADDSQESLDNVLLELYKRSKRIEEEIELLKRKLKKTE
EVIACGGKSTKIARSQGRKTQITLVQELSRISGNLAWAYLQQNDYESAERYYMKALSLES
DKNKQCNLAICLIRLNRIAEAKSLLQAVRASSRNEKMDESYAKSFEHASLMLTELESQSM
LQPTDYGEDKRKKILSSCTYINGSEENVSRFMVPRKCRKFYYPKTPCERSNGATVTSTKT
EARAVLKKAYALPAGVITNSASPITQPRRPSWTFDNKDQRNQQGKDDATDSPHWKLPIKQ
MTASENMQEDAVMFTQPRSSWGFGNRAQRRERWREDTVSGSVCKLTFENAITSENMEAHV
INNLNGKLQASTNERSEMGRPDSGAALSSPTCEDWRRRPWSIIAKVKQKQQYSGIHYRY

High Scoring Gene Products

Symbol, full name Information P value
AT5G44330 protein from Arabidopsis thaliana 4.2e-61
MS5
MALE-STERILE 5
protein from Arabidopsis thaliana 2.9e-60
AT3G51280 protein from Arabidopsis thaliana 1.3e-59
ATSDI1
SULPHUR DEFICIENCY-INDUCED 1
protein from Arabidopsis thaliana 2.7e-59
AT1G04770 protein from Arabidopsis thaliana 9.5e-50

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  011721
        (479 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

TAIR|locus:2158750 - symbol:AT5G44330 species:3702 "Arabi...   586  4.2e-61   2
TAIR|locus:2133099 - symbol:MS5 "MALE-STERILE 5" species:...   587  2.9e-60   2
TAIR|locus:2080858 - symbol:AT3G51280 "AT3G51280" species...   611  1.3e-59   1
TAIR|locus:2156574 - symbol:ATSDI1 "SULPHUR DEFICIENCY-IN...   608  2.7e-59   1
TAIR|locus:2010612 - symbol:AT1G04770 "AT1G04770" species...   518  9.5e-50   1


>TAIR|locus:2158750 [details] [associations]
            symbol:AT5G44330 species:3702 "Arabidopsis thaliana"
            [GO:0005634 "nucleus" evidence=ISM] [GO:0008150
            "biological_process" evidence=ND] [GO:0006863 "purine nucleobase
            transport" evidence=RCA] InterPro:IPR001440 InterPro:IPR011990
            InterPro:IPR013026 InterPro:IPR019734 Pfam:PF00515 PROSITE:PS50005
            PROSITE:PS50293 EMBL:CP002688 GenomeReviews:BA000015_GR
            EMBL:AB011475 Gene3D:1.25.40.10 EMBL:DQ056706 IPI:IPI00544319
            RefSeq:NP_199246.1 UniGene:At.55352 ProteinModelPortal:Q9FKV5
            SMR:Q9FKV5 EnsemblPlants:AT5G44330.1 GeneID:834458
            KEGG:ath:AT5G44330 TAIR:At5g44330 eggNOG:NOG271543
            HOGENOM:HOG000152531 InParanoid:Q9FKV5 OMA:WDATIGA PhylomeDB:Q9FKV5
            ProtClustDB:CLSN2685326 Genevestigator:Q9FKV5 Uniprot:Q9FKV5
        Length = 469

 Score = 586 (211.3 bits), Expect = 4.2e-61, Sum P(2) = 4.2e-61
 Identities = 130/234 (55%), Positives = 157/234 (67%)

Query:    14 KVPAGDSPYVRAKRAQLVEKDPSRAISLFWAAINAGDRVDSALKDMAVVMKQLDRSEEAI 73
             +V  GDSPYVRAK AQLV KDP+RAISLFWAAINAGDRVDSALKDM VV+KQL+R +E I
Sbjct:    49 RVRTGDSPYVRAKHAQLVSKDPNRAISLFWAAINAGDRVDSALKDMVVVLKQLNRFDEGI 108

Query:    74 EAIKSFRCLCADDSQESLDNVLLELYXXXXXXXXXXXXXXXXXXXXXXVIACGGKSTKIA 133
             EAIKSFR LC  +SQ+S+DN+LLELY                          GG+  KIA
Sbjct:   109 EAIKSFRYLCPFESQDSIDNLLLELYMKSGRITEVAELLEHKLRTLEQDKHYGGR-IKIA 167

Query:   134 -RSQGRKTQITLVQELSRISGNLAWAYLQQNDYESAERYYMKALSLESDKNKQCNLAICL 192
              RS   +   T+ QE +RI GNLAW +LQ ++Y  AE+YY  ALSLE D NK CNLAICL
Sbjct:   168 KRSHEEQNNKTIEQEKARILGNLAWVHLQLHNYGIAEQYYRNALSLEPDNNKLCNLAICL 227

Query:   193 IRLNRIAEAKSLLQAVRASSRNEKMDESYAKSFEHASLMLTELESQSML-QPTD 245
             IR+ R  EAKSLL+ V+ S  N+  +E + KSFE A+ ML E E  ++  +P D
Sbjct:   228 IRMERTHEAKSLLEDVKQSLGNQWKNEPFCKSFERATEMLAEREQATVADKPED 281

 Score = 57 (25.1 bits), Expect = 4.2e-61, Sum P(2) = 4.2e-61
 Identities = 21/63 (33%), Positives = 26/63 (41%)

Query:   295 VTSTKTEARAVLK-KAYALPAGVITNSASPITQPRRPSWTFDNKDQRNQQGKDDATDSPH 353
             +  T TE   + K  ++A    V  NS    TQPR   W     D+   Q K DAT    
Sbjct:   305 LAGTSTELGNIHKTNSHASSESVEQNSPGLTTQPRECKWV----DEEVDQSKWDATIGAS 360

Query:   354 WKL 356
              KL
Sbjct:   361 RKL 363


>TAIR|locus:2133099 [details] [associations]
            symbol:MS5 "MALE-STERILE 5" species:3702 "Arabidopsis
            thaliana" [GO:0005634 "nucleus" evidence=ISM;ISS] [GO:0009556
            "microsporogenesis" evidence=IMP] InterPro:IPR011990 GO:GO:0005634
            EMBL:CP002687 EMBL:AL080282 Gene3D:1.25.40.10 GO:GO:0009556
            EMBL:AL161553 HOGENOM:HOG000152531 ProtClustDB:CLSN2685326
            IPI:IPI00537393 PIR:T10632 RefSeq:NP_193822.1 UniGene:At.54444
            ProteinModelPortal:Q9SUC3 SMR:Q9SUC3 EnsemblPlants:AT4G20900.1
            GeneID:827838 KEGG:ath:AT4G20900 TAIR:At4g20900 InParanoid:Q9SUC3
            OMA:MKENIAP PhylomeDB:Q9SUC3 ArrayExpress:Q9SUC3
            Genevestigator:Q9SUC3 Uniprot:Q9SUC3
        Length = 450

 Score = 587 (211.7 bits), Expect = 2.9e-60, Sum P(2) = 2.9e-60
 Identities = 129/277 (46%), Positives = 175/277 (63%)

Query:     9 FHVIHKVPAGDSPYVRAKRAQLVEKDPSRAISLFWAAINAGDRVDSALKDMAVVMKQLDR 68
             FH++HKVP+GDSPYVRAK AQL++KDP+RAISLFW AINAGDRVDSALKDMAVVMKQL R
Sbjct:    51 FHIVHKVPSGDSPYVRAKHAQLIDKDPNRAISLFWTAINAGDRVDSALKDMAVVMKQLGR 110

Query:    69 SEEAIEAIKSFRCLCADDSQESLDNVLLELYXXXXXXXXXXXXXXXXXXXXXXVIACGGK 128
             S+E IEAIKSFR LC+ +SQ+S+DN+LLELY                       +  GG+
Sbjct:   111 SDEGIEAIKSFRYLCSFESQDSIDNLLLELYKKSGRIEEEAVLLEHKLQTLEQGMGFGGR 170

Query:   129 STKIARSQGRKTQITLVQELSRISGNLAWAYLQQNDYESAERYY---------------- 172
              ++  R QG+   +T+ QE +RI GNL W +LQ ++Y  AE++Y                
Sbjct:   171 VSRAKRVQGKHVIMTIEQEKARILGNLGWVHLQLHNYGIAEQHYRFGFVTKIPNIDYCLV 230

Query:   173 MKALSLESDKNKQCNLAICLIRLNRIAEAKSLLQAVRASSRNEKMDESYAKSFEHASLML 232
             M+AL LE DKNK CNLAICL+R++RI EAKSLL  VR S    +  +   + F  +    
Sbjct:   231 MRALGLERDKNKLCNLAICLMRMSRIPEAKSLLDDVRDSPAESECGD---EPFAKSYDRA 287

Query:   233 TELESQSMLQPTDYGEDKRKKILSSCTYINGSEENVS 269
              E+ ++  ++      D  +K  + C+++N  +EN++
Sbjct:   288 VEMLAE--IESKKPEADLSEKFYAGCSFVNRMKENIA 322

 Score = 48 (22.0 bits), Expect = 2.9e-60, Sum P(2) = 2.9e-60
 Identities = 11/17 (64%), Positives = 11/17 (64%)

Query:   313 PAGVITNSASPITQPRR 329
             PA V  NSA   TQPRR
Sbjct:   337 PASVRPNSAGLYTQPRR 353


>TAIR|locus:2080858 [details] [associations]
            symbol:AT3G51280 "AT3G51280" species:3702 "Arabidopsis
            thaliana" [GO:0005575 "cellular_component" evidence=ND] [GO:0042802
            "identical protein binding" evidence=IEA] [GO:0006260 "DNA
            replication" evidence=RCA] [GO:0006270 "DNA replication initiation"
            evidence=RCA] [GO:0006275 "regulation of DNA replication"
            evidence=RCA] [GO:0006306 "DNA methylation" evidence=RCA]
            [GO:0008283 "cell proliferation" evidence=RCA] [GO:0051567 "histone
            H3-K9 methylation" evidence=RCA] [GO:0051726 "regulation of cell
            cycle" evidence=RCA] InterPro:IPR011990 InterPro:IPR013026
            InterPro:IPR019734 PROSITE:PS50005 PROSITE:PS50293 EMBL:CP002686
            GenomeReviews:BA000014_GR Gene3D:1.25.40.10 EMBL:AL132980
            EMBL:BT006438 EMBL:AK227990 IPI:IPI00535129 PIR:T45759
            RefSeq:NP_190696.1 UniGene:At.10874 ProteinModelPortal:Q9SD20
            SMR:Q9SD20 IntAct:Q9SD20 STRING:Q9SD20 PRIDE:Q9SD20
            EnsemblPlants:AT3G51280.1 GeneID:824291 KEGG:ath:AT3G51280
            TAIR:At3g51280 eggNOG:NOG299995 HOGENOM:HOG000243274
            InParanoid:Q9SD20 OMA:SIAPDNN PhylomeDB:Q9SD20
            ProtClustDB:CLSN2684599 Genevestigator:Q9SD20 Uniprot:Q9SD20
        Length = 430

 Score = 611 (220.1 bits), Expect = 1.3e-59, P = 1.3e-59
 Identities = 128/247 (51%), Positives = 164/247 (66%)

Query:     7 ELFHVIHKVPAGDSPYVRAKRAQLVEKDPSRAISLFWAAINAGDRVDSALKDMAVVMKQL 66
             E FH IHKVP GDSPYVRAK  QLVEKDP RAI LFW AINAGDRVDSALKDMA+VMKQ 
Sbjct:    28 ESFHAIHKVPVGDSPYVRAKNVQLVEKDPERAIPLFWKAINAGDRVDSALKDMAIVMKQQ 87

Query:    67 DRSEEAIEAIKSFRCLCADDSQESLDNVLLELYXXXXXXXXXXXXXXXXXXXXXXVIACG 126
             +R+EEAIEAIKS R  C+D +QESLDN+LL+LY                       +A  
Sbjct:    88 NRAEEAIEAIKSLRVRCSDQAQESLDNILLDLYKRCGRLDDQIGLLKHKLFLIQKGLAFN 147

Query:   127 GKSTKIARSQGRKTQITLVQELSRISGNLAWAYLQQNDYESAERYYMKALSLESDKNKQC 186
             GK TK ARSQG+K Q+++ QE +R+ GNL WA +Q++++  AE  Y +ALS+  D NK C
Sbjct:   148 GKRTKTARSQGKKFQVSVEQEATRLLGNLGWALMQRDNFVEAEDAYRRALSIAPDNNKMC 207

Query:   187 NLAICLIRLNRIAEAKSLLQAVR-ASSRNEKMDESYAKSFEHASLMLTELESQSMLQPTD 245
             NL ICL++  RI EAK  L+ V+ A     +  +S+ K++E A  ML +L S+ M +  D
Sbjct:   208 NLGICLMKQGRIDEAKETLRRVKPAVVDGPRGVDSHLKAYERAQQMLNDLGSEMMRRGGD 267

Query:   246 YGEDKRK 252
                ++R+
Sbjct:   268 DKVEQRR 274


>TAIR|locus:2156574 [details] [associations]
            symbol:ATSDI1 "SULPHUR DEFICIENCY-INDUCED 1" species:3702
            "Arabidopsis thaliana" [GO:0005634 "nucleus" evidence=ISM]
            [GO:0006792 "regulation of sulfur utilization" evidence=IMP]
            [GO:0010438 "cellular response to sulfur starvation" evidence=IEP]
            InterPro:IPR001440 InterPro:IPR011990 InterPro:IPR013026
            InterPro:IPR019734 Pfam:PF00515 PROSITE:PS50005 PROSITE:PS50293
            EMBL:CP002688 Gene3D:1.25.40.10 GO:GO:0010438 EMBL:BT005297
            EMBL:AK118038 IPI:IPI00545589 RefSeq:NP_199696.2 UniGene:At.29820
            ProteinModelPortal:Q8GXU5 SMR:Q8GXU5 PRIDE:Q8GXU5
            EnsemblPlants:AT5G48850.1 GeneID:834943 KEGG:ath:AT5G48850
            eggNOG:NOG289549 OMA:AFNGKAT ProtClustDB:CLSN2690694
            Genevestigator:Q8GXU5 GO:GO:0006792 Uniprot:Q8GXU5
        Length = 306

 Score = 608 (219.1 bits), Expect = 2.7e-59, P = 2.7e-59
 Identities = 140/267 (52%), Positives = 170/267 (63%)

Query:     4 KDEELFHVIHKVPAGDSPYVRAKRAQLVEKDPSRAISLFWAAINAGDRVDSALKDMAVVM 63
             KD+ELFHVIHKVP GD+PYVRAK AQL+EK+P  AI  FW AIN GDRVDSALKDMAVVM
Sbjct:    22 KDDELFHVIHKVPCGDTPYVRAKHAQLIEKNPEMAIVWFWKAINTGDRVDSALKDMAVVM 81

Query:    64 KQLDRSEEAIEAIKSFRCLCADDSQESLDNVLLELYXXXXXXXXXXXXXXXXXXXXXXVI 123
             KQLDRSEEAIEAIKSFR  C+ +SQ+SLDNVL++LY                        
Sbjct:    82 KQLDRSEEAIEAIKSFRPRCSKNSQDSLDNVLIDLYKKCGRMEEQVELLKRKLRQIYQGE 141

Query:   124 ACGGKSTKIARSQGRKTQITLVQELSRISGNLAWAYLQQNDYESAERYYMKALSLESDKN 183
             A  GK TK ARS G+K Q+T+ QE+SR+ GNL WAY+QQ  Y SAE  Y KA  +E D N
Sbjct:   142 AFNGKPTKTARSHGKKFQVTVQQEISRLLGNLGWAYMQQAKYLSAEAVYRKAQMVEPDAN 201

Query:   184 KQCNLAICLIRLNRIAEAKSLLQAVRASSRNEKMDESYAKSFEHASLMLTELESQSMLQP 243
             K CNLA+CLI+  R  E + +L  V    R    D+   ++ + A  +L+ELES S+ + 
Sbjct:   202 KSCNLAMCLIKQGRFEEGRLVLDDV-LEYRVLGADD--CRTRQRAEELLSELES-SLPRM 257

Query:   244 TDYG-EDKRKKILSSCTYINGSEENVS 269
              D   ED    IL    ++ G EE  S
Sbjct:   258 RDAEMEDVLGNILDD-DFVLGLEEMTS 283


>TAIR|locus:2010612 [details] [associations]
            symbol:AT1G04770 "AT1G04770" species:3702 "Arabidopsis
            thaliana" [GO:0009658 "chloroplast organization" evidence=IMP]
            InterPro:IPR001440 InterPro:IPR011990 InterPro:IPR013026
            InterPro:IPR019734 Pfam:PF00515 PROSITE:PS50005 PROSITE:PS50293
            EMBL:CP002684 GO:GO:0009658 Gene3D:1.25.40.10
            ProtClustDB:CLSN2690694 EMBL:AY139988 EMBL:BT008709 IPI:IPI00540075
            RefSeq:NP_171969.2 UniGene:At.42432 UniGene:At.74161
            ProteinModelPortal:Q8L730 SMR:Q8L730 PRIDE:Q8L730
            EnsemblPlants:AT1G04770.1 GeneID:839418 KEGG:ath:AT1G04770
            OMA:SSAAAYN ArrayExpress:Q8L730 Genevestigator:Q8L730
            Uniprot:Q8L730
        Length = 303

 Score = 518 (187.4 bits), Expect = 9.5e-50, P = 9.5e-50
 Identities = 119/271 (43%), Positives = 154/271 (56%)

Query:     9 FHVIHKVPAGDSPYVRAKRAQLVEKDPSRAISLFWAAINAGDRVDSALKDMAVVMKQLDR 68
             ++V+HK+P GDSPYVRAK  QLVEKD   AI LFW AI A DRVDSALKDMA++MKQ +R
Sbjct:    20 YNVVHKLPHGDSPYVRAKHVQLVEKDAEAAIELFWIAIKARDRVDSALKDMALLMKQQNR 79

Query:    69 SEEAIEAIKSFRCLCADDSQESLDNVLLELYXXXXXXXXXXXXXXXXXXXXXXVIACGGK 128
             +EEAI+AI+SFR LC+  +QESLDNVL++LY                        A  GK
Sbjct:    80 AEEAIDAIQSFRDLCSRQAQESLDNVLIDLYKKCGRIEEQVELLKQKLWMIYQGEAFNGK 139

Query:   129 STKIARSQGRKTQITLVQELSRISGNLAWAYLQQNDYESAERYYMKALSLESDKNKQCNL 188
              TK ARS G+K Q+T+ +E SRI GNL WAY+Q  DY +AE  Y KA  +E D NK CNL
Sbjct:   140 PTKTARSHGKKFQVTVEKETSRILGNLGWAYMQLMDYTAAEAVYRKAQLIEPDANKACNL 199

Query:   189 AICLIRLNRIAEAKSLLQAVRASSRNEKMDESYAKSFEHASLMLTELESQSMLQPTDYGE 248
               CLI+  +  EA+S+L   R      K      +       +L+EL+ Q          
Sbjct:   200 CTCLIKQGKHDEARSIL--FRDVLMENKEGSGDPRLMARVQELLSELKPQEEEAAASVSV 257

Query:   249 DKRKKILSSCTYINGSEENVSRFMVPRKCRK 279
             +    I      + G +E V  +  P + R+
Sbjct:   258 ECEVGI-DEIAVVEGLDEFVKEWRRPYRTRR 287


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.315   0.127   0.370    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      479       457   0.00094  118 3  11 23  0.42    34
                                                     35  0.45    37


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  5
  No. of states in DFA:  620 (66 KB)
  Total size of DFA:  289 KB (2150 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  39.63u 0.17s 39.80t   Elapsed:  00:00:02
  Total cpu time:  39.64u 0.17s 39.81t   Elapsed:  00:00:02
  Start:  Sat May 11 09:37:45 2013   End:  Sat May 11 09:37:47 2013

Back to top