BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>021818
MLIEKLSVRTPTGEVKLKVMKEIAKEFQIDWDTTESEMELLKPAEERIGGPDTFFSASSL
PVKHVPVQSVEQNRPHTRSVVSNRERGTMQFEDTASAAEAAADSAKKAVAAAQAAAYLAS
RDSKQFTQAFGISSKPDASNNHGGFAAFSGSEISLENSTAPGNFHMSQSSYGSHYLSHEE
KRPTDVGSGNFHRRNSYNASSANSDIKFDVCDHDQDNKMEGPPGGKVLRRHSYNAPTAHS
DIQWDESDYDEEIEVEAPSGCTSLPPERTPPPIPPSLGKQGSFHRVHPKLPDYEDLAARF
EALKYRK

High Scoring Gene Products

Symbol, full name Information P value
AT4G35730 protein from Arabidopsis thaliana 3.7e-32
AT1G34220 protein from Arabidopsis thaliana 1.7e-08
AT1G25420 protein from Arabidopsis thaliana 1.5e-06
AT2G19710 protein from Arabidopsis thaliana 1.3e-05
AT4G29440 protein from Arabidopsis thaliana 1.8e-05
AT4G32350 protein from Arabidopsis thaliana 0.00058

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  021818
        (307 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

TAIR|locus:2127988 - symbol:AT4G35730 "AT4G35730" species...   352  3.7e-32   1
TAIR|locus:2026150 - symbol:AT1G34220 "AT1G34220" species...   153  1.7e-08   2
TAIR|locus:2031250 - symbol:AT1G25420 species:3702 "Arabi...   134  1.5e-06   1
TAIR|locus:2052035 - symbol:AT2G19710 "AT2G19710" species...   100  1.3e-05   3
TAIR|locus:2118334 - symbol:AT4G29440 "AT4G29440" species...    95  1.8e-05   4
TAIR|locus:2127796 - symbol:AT4G32350 "AT4G32350" species...    83  0.00058   2


>TAIR|locus:2127988 [details] [associations]
            symbol:AT4G35730 "AT4G35730" species:3702 "Arabidopsis
            thaliana" [GO:0008150 "biological_process" evidence=ND] [GO:0005739
            "mitochondrion" evidence=IDA] GO:GO:0005739 EMBL:CP002687
            InterPro:IPR005061 Pfam:PF03398 IPI:IPI00538631 RefSeq:NP_195298.2
            UniGene:At.31388 ProteinModelPortal:F4JNS8 SMR:F4JNS8 PRIDE:F4JNS8
            EnsemblPlants:AT4G35730.1 GeneID:829726 KEGG:ath:AT4G35730
            OMA:MSINTHY Uniprot:F4JNS8
        Length = 466

 Score = 352 (129.0 bits), Expect = 3.7e-32, P = 3.7e-32
 Identities = 109/313 (34%), Positives = 145/313 (46%)

Query:     1 MLIEKLSVRTPTGEVKLKVMKEIAKEFQIDWDTTESEMELLKPAEERIGGPDTFFSASSL 60
             MLI+KLSVR P GE KLK+MKEIAKEFQ+DWDTTE+E ELLKP EE I GP  F SASSL
Sbjct:   167 MLIDKLSVRNPGGEYKLKIMKEIAKEFQVDWDTTETEQELLKPQEESIDGPRKFVSASSL 226

Query:    61 PVKHVPV-QSVEQNRPHTRSVVSNRERGTMQFEDTXXXXXXXXXXXXXXXXXXXXXXXXX 119
             PV    + + ++  +   RS  S+    T   +                           
Sbjct:   227 PVNRAAINEPIDPTKAVPRST-SSMSINTHYHDTESAAEAATELAKQAVAAAQVASLLAT 285

Query:   120 XRDSKQFTQAFGISSKPDASNNHGGFAAFSGSEISLENSTAPGNFHMSQSSYGSHYLSHE 179
              RDS    + F +SS  D S +         S+    +   PG+   S+ S  S Y +  
Sbjct:   286 RRDSSN--KEFSVSS--DHSTHQ------KDSQYMDHHHHHPGSRRQSRDSETSSYYA-- 333

Query:   180 EKRPTDVGSGNFHRRXXXXXXXXXXDIKFDVCDHDQDNKMEGPPGGKVLRRHSYNAPTAH 239
               +P     G   R           D + +  + + + K E           S   P A 
Sbjct:   334 --KPGAENRGMGRRHSYNNPGINESDYEEEYTNTEAEAK-ETMRRRHSYNPRSV-PPPAT 389

Query:   240 SDIQWDESDY-DEEIEVEAPS-GCTSLXXXXXXXXXXXSLGKQ---GSFHRVHPKLPDYE 294
             S+I++DESDY +EE E + PS G  S              G+     S H+VHPKLPDY+
Sbjct:   390 SEIKFDESDYYEEETEPDEPSQGRVSSLPPNRAPPQAPQSGESRQDSSGHQVHPKLPDYD 449

Query:   295 DLAARFEALKYRK 307
              LAARFEA+++ K
Sbjct:   450 ILAARFEAIRHSK 462


>TAIR|locus:2026150 [details] [associations]
            symbol:AT1G34220 "AT1G34220" species:3702 "Arabidopsis
            thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0008150
            "biological_process" evidence=ND] EMBL:CP002684 InterPro:IPR005061
            Pfam:PF03398 UniGene:At.11393 UniGene:At.48266 IPI:IPI00549089
            RefSeq:NP_174684.2 ProteinModelPortal:F4HUX0 SMR:F4HUX0
            PRIDE:F4HUX0 EnsemblPlants:AT1G34220.1 GeneID:840321
            KEGG:ath:AT1G34220 OMA:FDHQNSS Uniprot:F4HUX0
        Length = 649

 Score = 153 (58.9 bits), Expect = 1.7e-08, Sum P(2) = 1.7e-08
 Identities = 29/61 (47%), Positives = 41/61 (67%)

Query:     2 LIEKLSVRTPTGEVKLKVMKEIAKEFQIDWDTTESEMELLKPAEERIGGPDTFFSASSLP 61
             L+E LSVR P+ E KLK++KEIA+E ++DWD   +E +L K  E+ + GP  F   S LP
Sbjct:   184 LVELLSVRAPSPETKLKLLKEIAEEHELDWDPASTETDLFKSHEDLLDGPKQFGGGSKLP 243

Query:    62 V 62
             +
Sbjct:   244 L 244

 Score = 41 (19.5 bits), Expect = 1.7e-08, Sum P(2) = 1.7e-08
 Identities = 13/53 (24%), Positives = 23/53 (43%)

Query:   205 DIKFDVCDHDQ-DNKMEGPPGGKVLRRHSYNAPTAHSDIQWDESDYDEEIEVE 256
             D ++D+ D  +  N +  P  G      S NAP A     ++ + +D   + E
Sbjct:   267 DSEYDILDFPEVPNVLLRPTPGAT----SVNAPDAAKSASYEHTSHDLPFDSE 315


>TAIR|locus:2031250 [details] [associations]
            symbol:AT1G25420 species:3702 "Arabidopsis thaliana"
            [GO:0005737 "cytoplasm" evidence=ISM] [GO:0008150
            "biological_process" evidence=ND] [GO:0009507 "chloroplast"
            evidence=ISM] [GO:0048573 "photoperiodism, flowering" evidence=RCA]
            EMBL:CP002684 EMBL:AC079281 InterPro:IPR005061 Pfam:PF03398
            EMBL:AY095990 EMBL:AY143866 IPI:IPI00518252 PIR:C86384
            RefSeq:NP_564235.1 UniGene:At.43165 ProteinModelPortal:Q9C6L2
            SMR:Q9C6L2 IntAct:Q9C6L2 PRIDE:Q9C6L2 EnsemblPlants:AT1G25420.1
            GeneID:839128 KEGG:ath:AT1G25420 TAIR:At1g25420 InParanoid:Q9C6L2
            OMA:PRCSEVP PhylomeDB:Q9C6L2 ProtClustDB:CLSN2687956
            Genevestigator:Q9C6L2 Uniprot:Q9C6L2
        Length = 323

 Score = 134 (52.2 bits), Expect = 1.5e-06, P = 1.5e-06
 Identities = 29/91 (31%), Positives = 50/91 (54%)

Query:     2 LIEKLSVRTPTGEVKLKVMKEIAKEFQIDWDTTESEMELLKPAEERIGGPDTFFSASSLP 61
             +IEKLS  +P+G  +LK++KEIA+E+ ++WD++ +E E +K  E+ +GG         + 
Sbjct:   154 IIEKLSPTSPSGAARLKMLKEIAQEYSLNWDSSATEAEFMKSHEDLLGGAKQIHRQDGIS 213

Query:    62 VKHVPVQSVEQNRPHTRSVVSNRERGTMQFE 92
                   Q   Q+   +R V S     T +F+
Sbjct:   214 ESRPSQQGYGQSSV-SREVESLPAEATQRFQ 243


>TAIR|locus:2052035 [details] [associations]
            symbol:AT2G19710 "AT2G19710" species:3702 "Arabidopsis
            thaliana" [GO:0008150 "biological_process" evidence=ND]
            EMBL:CP002685 InterPro:IPR005061 Pfam:PF03398 UniGene:At.48493
            UniGene:At.71155 IPI:IPI00547062 RefSeq:NP_179561.2
            ProteinModelPortal:F4ITF9 SMR:F4ITF9 PRIDE:F4ITF9
            EnsemblPlants:AT2G19710.1 GeneID:816490 KEGG:ath:AT2G19710
            OMA:ERASHVH Uniprot:F4ITF9
        Length = 937

 Score = 100 (40.3 bits), Expect = 1.3e-05, Sum P(3) = 1.3e-05
 Identities = 25/91 (27%), Positives = 49/91 (53%)

Query:     1 MLIEKLSVRTPTGEVKLKVMKEIAKEFQIDWDTTESEMELLKPAEERIGGPDTFFSASSL 60
             +L+EKLS + P G  K+K++  IA+E  + W+  +S +E      E + G ++F  ASS+
Sbjct:   150 LLVEKLSAKAPDGPTKVKILMAIAEEHNVVWEA-QSFVESDPKDTELLNGANSFQPASSM 208

Query:    61 PVKHVPVQSVEQNRPHTRSVVS-NRERGTMQ 90
              +    + S ++  P+  +  + N   G+ +
Sbjct:   209 NMDS-SINSNKEQPPNIHAPATVNAHHGSSE 238

 Score = 64 (27.6 bits), Expect = 1.3e-05, Sum P(3) = 1.3e-05
 Identities = 11/19 (57%), Positives = 15/19 (78%)

Query:   286 VHPKLPDYEDLAARFEALK 304
             VHP LP+Y+D+ AR  AL+
Sbjct:   914 VHPNLPEYDDIFARLGALR 932

 Score = 52 (23.4 bits), Expect = 1.3e-05, Sum P(3) = 1.3e-05
 Identities = 26/87 (29%), Positives = 39/87 (44%)

Query:   123 SKQFTQAFGISSKPDASNNHGGFAAFSGSEISLEN---------STAP-GNFHMSQ---- 168
             SK    A   S K D S +HG  ++ S S++  EN         ST+P  ++H       
Sbjct:   555 SKTSASAASWSFKGDHSKSHGKHSS-SSSQVFQENPSSRLFDDVSTSPPASYHEPDPHAK 613

Query:   169 -SSYGSHYLSHEEKRPTDVGSGNFHRR 194
               +YG +  S  ++ P D  SG+ H R
Sbjct:   614 FDNYGPNSESDGDQ-PIDKVSGDVHER 639

 Score = 38 (18.4 bits), Expect = 0.00031, Sum P(3) = 0.00031
 Identities = 27/137 (19%), Positives = 51/137 (37%)

Query:   122 DSKQFTQAFGISSKP--DASNNHGGFAAFSGSEISLENSTAPGNFHMSQSSYGSHY-LSH 178
             D+ +   +FG   +P  D ++ + G++     ++ L   ++  + H   S+Y     L  
Sbjct:   420 DNSRNNGSFGREKQPSQDETDINVGYS----EDVHLRKQSSRVSSHSHSSNYSDENDLGS 475

Query:   179 E-EKRPTDVGSGNFHRRXXXXXXXXXXDIKFDVCDHDQDNKMEGPPGGKVLRRHSYNAPT 237
             +  K P+ V    F             DI  D  DH  D+               Y  P 
Sbjct:   476 DFMKSPSIVEENIFATEYDHQSQSSFKDI--DSHDHGHDDDAAATDNYDDYSSFFYQ-PK 532

Query:   238 AHSDIQWDESDYDEEIE 254
              H++    ++ Y +EI+
Sbjct:   533 FHAE----DNHYQDEID 545


>TAIR|locus:2118334 [details] [associations]
            symbol:AT4G29440 "AT4G29440" species:3702 "Arabidopsis
            thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0005634
            "nucleus" evidence=ISM] [GO:0008150 "biological_process"
            evidence=ND] EMBL:CP002687 InterPro:IPR005061 Pfam:PF03398
            IPI:IPI00532585 RefSeq:NP_194673.2 UniGene:At.27853 PRIDE:F4JNM2
            EnsemblPlants:AT4G29440.1 GeneID:829065 KEGG:ath:AT4G29440
            OMA:SHYRESP Uniprot:F4JNM2
        Length = 1090

 Score = 95 (38.5 bits), Expect = 1.8e-05, Sum P(4) = 1.8e-05
 Identities = 25/77 (32%), Positives = 42/77 (54%)

Query:     1 MLIEKLSVRTPTGEVKLKVMKEIAKEFQIDWDTTESEMELLKPAEERIGGPDTFFSASSL 60
             +L+EKLSV+ P G  K+K++ EIA +  + W+  ES +E   P E       +  S+ S 
Sbjct:   150 LLVEKLSVKAPDGPTKIKILTEIATQHNVTWEA-ESLVES-DPKETMSASGAS--SSVSQ 205

Query:    61 PVKHVPVQS--VEQNRP 75
             P   +  +S  ++ N+P
Sbjct:   206 PATGIKSESSRIQNNQP 222

 Score = 73 (30.8 bits), Expect = 1.8e-05, Sum P(4) = 1.8e-05
 Identities = 14/26 (53%), Positives = 20/26 (76%)

Query:   279 KQGSFHRVHPKLPDYEDLAARFEALK 304
             K+ + H VHPKLPDY+D+ A+  AL+
Sbjct:  1065 KEKASH-VHPKLPDYDDIFAKLGALR 1089

 Score = 44 (20.5 bits), Expect = 1.8e-05, Sum P(4) = 1.8e-05
 Identities = 11/28 (39%), Positives = 15/28 (53%)

Query:   233 YNAPTAHSDIQWDESDYDEEIEVEAPSG 260
             Y  P+   D Q D+S   EE + E P+G
Sbjct:   650 YFFPSDTED-QGDDSKTQEESDAETPTG 676

 Score = 41 (19.5 bits), Expect = 1.8e-05, Sum P(4) = 1.8e-05
 Identities = 7/20 (35%), Positives = 13/20 (65%)

Query:    68 QSVEQNRPHTRSVVSNRERG 87
             +++  N PH+R+  SN + G
Sbjct:   344 ENLRSNPPHSRTSSSNMQGG 363

 Score = 38 (18.4 bits), Expect = 6.6e-05, Sum P(4) = 6.6e-05
 Identities = 9/25 (36%), Positives = 13/25 (52%)

Query:   234 NAPTAHSDIQWDESDYDEEIEVEAP 258
             N+    S I  D S  D+E ++E P
Sbjct:   744 NSDKRPSSIPPDSSSSDDESDMELP 768


>TAIR|locus:2127796 [details] [associations]
            symbol:AT4G32350 "AT4G32350" species:3702 "Arabidopsis
            thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0005634
            "nucleus" evidence=ISM] [GO:0008150 "biological_process"
            evidence=ND] EMBL:CP002687 InterPro:IPR005061 Pfam:PF03398
            EMBL:AY080684 EMBL:AY122956 IPI:IPI00527589 RefSeq:NP_194961.2
            UniGene:At.31672 ProteinModelPortal:Q8RXT2 SMR:Q8RXT2 PRIDE:Q8RXT2
            EnsemblPlants:AT4G32350.1 GeneID:829369 KEGG:ath:AT4G32350
            TAIR:At4g32350 HOGENOM:HOG000149127 InParanoid:Q8RXT2 OMA:DTIVMKV
            PhylomeDB:Q8RXT2 ProtClustDB:CLSN2692463 Genevestigator:Q8RXT2
            Uniprot:Q8RXT2
        Length = 732

 Score = 83 (34.3 bits), Expect = 0.00058, Sum P(2) = 0.00058
 Identities = 16/21 (76%), Positives = 18/21 (85%)

Query:   286 VHPKLPDYEDLAARFEALKYR 306
             VHPKLP+Y+DLAARF  LK R
Sbjct:   712 VHPKLPNYDDLAARFAELKGR 732

 Score = 78 (32.5 bits), Expect = 0.00058, Sum P(2) = 0.00058
 Identities = 14/41 (34%), Positives = 28/41 (68%)

Query:     2 LIEKLSVRTPTGEVKLKVMKEIAKEFQIDWDTTESEMELLK 42
             L+E +S +  + E K+K+M+++A EF I WD+ + E  +++
Sbjct:   142 LVENMSSKPFSMEKKVKLMEDVALEFSIRWDSKDFEKRIVR 182


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.312   0.130   0.379    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      307       260   0.00089  114 3  11 23  0.40    34
                                                     32  0.49    36


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  6
  No. of states in DFA:  607 (65 KB)
  Total size of DFA:  204 KB (2114 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  21.16u 0.14s 21.30t   Elapsed:  00:00:01
  Total cpu time:  21.16u 0.14s 21.30t   Elapsed:  00:00:01
  Start:  Fri May 10 13:46:56 2013   End:  Fri May 10 13:46:57 2013

Back to top