BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>028143
MRKDQDMDGLRSEEENVIGMFGSDEDVGTQIPTQAQSVVEGSGAVMVSEFKPVPDVDYLQ
ELLAIQQQGPRAIGFFGTRNMGFMHQELIEILSYALVITKNHIYTSGASGTNAAVIRGAL
RAERPDLLTVILPQSLKKQPPESQELLAKVKTVIEKPHNDHLPLIEASRLCNMDIISHVQ
QVICFAFHDSRLLMETCQEAKNLRKIVTLFYLD

High Scoring Gene Products

Symbol, full name Information P value
AT2G43945 protein from Arabidopsis thaliana 4.0e-90
AT3G59870 protein from Arabidopsis thaliana 1.2e-88

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  028143
        (213 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

TAIR|locus:1005716651 - symbol:AT2G43945 species:3702 "Ar...   899  4.0e-90   1
TAIR|locus:2080452 - symbol:AT3G59870 "AT3G59870" species...   885  1.2e-88   1


>TAIR|locus:1005716651 [details] [associations]
            symbol:AT2G43945 species:3702 "Arabidopsis thaliana"
            [GO:0009507 "chloroplast" evidence=ISM;IDA] [GO:0009570
            "chloroplast stroma" evidence=IDA] [GO:0010207 "photosystem II
            assembly" evidence=RCA] GO:GO:0009570 EMBL:CP002685 IPI:IPI00520843
            RefSeq:NP_850404.1 UniGene:At.46732 ProteinModelPortal:F4IT21
            PRIDE:F4IT21 EnsemblPlants:AT2G43945.1 GeneID:818999
            KEGG:ath:AT2G43945 OMA:QQQGPRS Uniprot:F4IT21
        Length = 289

 Score = 899 (321.5 bits), Expect = 4.0e-90, P = 4.0e-90
 Identities = 173/213 (81%), Positives = 195/213 (91%)

Query:     1 MRKDQDMDGLRSEEENVIGMFGSDEDVGTQIPTQAQSVVEGSGAVMVSEFKPVPDVDYLQ 60
             MR  ++ + L  +++++  +  SDED G +IPTQAQ++VEGSG+V VSE KP  DVDY+Q
Sbjct:    77 MRNQENTEILTDKDDHIECVLESDEDSGLRIPTQAQAIVEGSGSVAVSELKPAADVDYIQ 136

Query:    61 ELLAIQQQGPRAIGFFGTRNMGFMHQELIEILSYALVITKNHIYTSGASGTNAAVIRGAL 120
             ELLAIQQQGPR+IGFFGTRNMGFMHQELIEILSYA+VITKNHIYTSGASGTNAAVIRGAL
Sbjct:   137 ELLAIQQQGPRSIGFFGTRNMGFMHQELIEILSYAMVITKNHIYTSGASGTNAAVIRGAL 196

Query:   121 RAERPDLLTVILPQSLKKQPPESQELLAKVKTVIEKPHNDHLPLIEASRLCNMDIISHVQ 180
             RAERP+LLTVILPQSLKKQPPESQELL+KV+ V+EKPHNDHLPL+EASRLCNMDIIS VQ
Sbjct:   197 RAERPELLTVILPQSLKKQPPESQELLSKVQNVVEKPHNDHLPLLEASRLCNMDIISQVQ 256

Query:   181 QVICFAFHDSRLLMETCQEAKNLRKIVTLFYLD 213
             QVICFAFHDS+LLMETCQEAKNLRKIVTLFYLD
Sbjct:   257 QVICFAFHDSKLLMETCQEAKNLRKIVTLFYLD 289


>TAIR|locus:2080452 [details] [associations]
            symbol:AT3G59870 "AT3G59870" species:3702 "Arabidopsis
            thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0008150
            "biological_process" evidence=ND] [GO:0009507 "chloroplast"
            evidence=ISM;IDA] [GO:0009570 "chloroplast stroma" evidence=IDA]
            [GO:0010207 "photosystem II assembly" evidence=RCA] GO:GO:0009570
            EMBL:CP002686 EMBL:AL138647 IPI:IPI00531661 PIR:T47811
            RefSeq:NP_191546.1 UniGene:At.28158 ProteinModelPortal:Q9M1Y7
            STRING:Q9M1Y7 PRIDE:Q9M1Y7 EnsemblPlants:AT3G59870.1 GeneID:825157
            KEGG:ath:AT3G59870 TAIR:At3g59870 InParanoid:Q9M1Y7 OMA:QGPRTIG
            PhylomeDB:Q9M1Y7 ProtClustDB:CLSN2685014 Genevestigator:Q9M1Y7
            Uniprot:Q9M1Y7
        Length = 288

 Score = 885 (316.6 bits), Expect = 1.2e-88, P = 1.2e-88
 Identities = 173/215 (80%), Positives = 195/215 (90%)

Query:     3 KDQD-MDGLRSEEENVIGMFGSDEDV---GTQIPTQAQSVVEGSGAVMVSEFKPVPDVDY 58
             +DQD +D   ++E+NV   F SDED    G +IPTQAQ++VEG G++ VSE + VPDVDY
Sbjct:    74 RDQDDVDSWVNKEDNVTCRFDSDEDTTTTGLRIPTQAQAIVEGPGSLAVSELQRVPDVDY 133

Query:    59 LQELLAIQQQGPRAIGFFGTRNMGFMHQELIEILSYALVITKNHIYTSGASGTNAAVIRG 118
             +QELLAIQQQGPR IGFFGTRNMGFMHQELI+ILSYA+VITKNHIYTSGA+GTNAAVIRG
Sbjct:   134 IQELLAIQQQGPRTIGFFGTRNMGFMHQELIQILSYAMVITKNHIYTSGAAGTNAAVIRG 193

Query:   119 ALRAERPDLLTVILPQSLKKQPPESQELLAKVKTVIEKPHNDHLPLIEASRLCNMDIISH 178
             ALRAERP+LLTVILPQSLKKQPPESQELL+KV+ VIEKPHNDHLPL+EASRLCNMDIIS 
Sbjct:   194 ALRAERPELLTVILPQSLKKQPPESQELLSKVQNVIEKPHNDHLPLLEASRLCNMDIISQ 253

Query:   179 VQQVICFAFHDSRLLMETCQEAKNLRKIVTLFYLD 213
             VQQ+ICFAFHDS+LLMETCQEA+NLRKIVTLFYLD
Sbjct:   254 VQQIICFAFHDSKLLMETCQEARNLRKIVTLFYLD 288


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.320   0.136   0.383    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      213       213   0.00084  112 3  11 22  0.36    33
                                                     31  0.49    35


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  2
  No. of states in DFA:  571 (61 KB)
  Total size of DFA:  154 KB (2093 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  18.67u 0.16s 18.83t   Elapsed:  00:00:01
  Total cpu time:  18.67u 0.16s 18.83t   Elapsed:  00:00:01
  Start:  Fri May 10 04:37:43 2013   End:  Fri May 10 04:37:44 2013

Back to top