BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>002008
MHPFLMRSYLWWCILLGYLYVSTLSFSSGQYLDRAIQSGNWLHDLGSDLKDDFKSTTLNF
VEISILPSQLNDSVSCGDLEGVGSLNTTCLLNSNLYLNYDLYIYGTGNLEILPKISIVCP
VEGCKITFNMSGNINMGQYAAIVAGSVVVSAANLTMDLNSSINTTSLGGLPPSPTSGTPV
GYDGAGGGHGGRGASCHKNNKTSFWGGDVYAWSTLSEPWSYGSKGGGTSAEYQYGGNGGG
RIKLLVKDMLYLNGSVTAEGGDGGLKGGGGSGGSIYVLAVKLKGYGFISAAGGRGWGGGG
GGRVSLDCYSIQEDIKVTVHGGFSIGCPENAGAAGTNFNAYLRSLRVSNDNVTTETETPL
LDFPTRPIWSNVFVENNAKVLVPLLWTRVQVRGQISLYRGGSIIFGLSEYPVSEFELVAE
ELLMSDSVIKVFGAFRVAIKMLLMWNSKILIDGGGNTIVTTSVLEVRNLVVLTENSVISS
NANLGLYGQGLLQLTGQGDAIKGQRLSLSLFYNITVGTGSLLQAPLDDDASRNVVTESLC
KRQTCPIDLINPPDDCHVNYTLSFSLQICRVEDIVVSGLIKGSIVHIQRARTIIVDTYGM
IIASELGCSEGMGKGIYSHGAGSGAGHGGRGGSGFFNGRLINGGHKYGNADLPCELGSGA
EGPNESYAPAIGGGMIVMGSIQWPLFRLDIYGSVKADGESVGKKTINGNSSLIGGLGGGS
GGTILLFLQELTLEDNSSVSVVGGSGGPPGGGGGGGGRVHFHWSKIDSGVEYVPVATISG
SINSSGGAADNTGLFGEVGTVTGKKCPKGLYGTFCKECPIGTYKDMEGSDESLCTPCSLE
LLPRRANFIYVRGGVSQPFCPYECISEKYRMPKCYTPLEELMYTFGGPWPFVLLLSCILV
LLALLLSTLRIKLVGSSPSYREHSIERHSRHHFPYLLSLSEVRGTRAEETQSHVHRMYFM
GPNTFREPWHLPYSPPNAIIEIV

High Scoring Gene Products

Symbol, full name Information P value
AT4G32920 protein from Arabidopsis thaliana 2.9e-183

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  002008
        (983 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

TAIR|locus:2123762 - symbol:AT4G32920 "AT4G32920" species...  1778  2.9e-183  1


>TAIR|locus:2123762 [details] [associations]
            symbol:AT4G32920 "AT4G32920" species:3702 "Arabidopsis
            thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0008150
            "biological_process" evidence=ND] [GO:0009507 "chloroplast"
            evidence=ISM] [GO:0005773 "vacuole" evidence=IDA] [GO:0006486
            "protein glycosylation" evidence=RCA] GO:GO:0005773 EMBL:CP002687
            IPI:IPI00546159 RefSeq:NP_001190893.1 RefSeq:NP_001190894.1
            RefSeq:NP_567910.1 UniGene:At.26242 UniGene:At.67561
            ProteinModelPortal:F4JV81 PRIDE:F4JV81 EnsemblPlants:AT4G32920.1
            EnsemblPlants:AT4G32920.2 EnsemblPlants:AT4G32920.3 GeneID:829429
            KEGG:ath:AT4G32920 OMA:DENEWWI ArrayExpress:F4JV81 Uniprot:F4JV81
        Length = 1432

 Score = 1778 (630.9 bits), Expect = 2.9e-183, P = 2.9e-183
 Identities = 378/926 (40%), Positives = 501/926 (54%)

Query:    67 PSQLNDSVSC-GDLEGVGSXXXXXXXXXXXXXXXXXXXXGTGNLEILPKISIVCPVEGCK 125
             P   + SVSC  DL GVGS                    G GNL +LP + +VC   GC 
Sbjct:    51 PDDDDSSVSCVDDLGGVGSLDSTCKLVADLNLTRDLNITGKGNLHVLPGVRLVCQFPGCS 110

Query:   126 ITFNMSGNINMGQYXXXXXXXXXXXXXNLTMDLNSSINXXXXXXXXXXXXXXXXVXXXXX 185
             I+ N+SGN ++ +              N    L+S+++                      
Sbjct:   111 ISVNISGNFSLAENSSVIAGTFRLAAENAEFGLSSAVDTTGLAGEPPPDTSGTPEGVEGA 170

Query:   186 XXXXXXXXASCHKNNKTSF----WGGDVYAWSTLSEPWSYGSKGGGTSAEYQYGGNGGGR 241
                     A C  +  T      +GGDVY WS+L +P  YGS+GG TS E  YGG GGG 
Sbjct:   171 GGGYGGRGACCLSDTTTKIPEDVFGGDVYGWSSLEKPEIYGSRGGSTSNEVDYGGGGGGT 230

Query:   242 IKLLVKDMLYLNGSVTAEXXXXXXXXXXXXXXSIYVLAVKLKGYGFISAAXXXXXXXXXX 301
             + + +   + LNGSV A+              SI+V+A K+ G G +SA+          
Sbjct:   231 VAIEILGYISLNGSVLADGASGGVKGGGGSGGSIFVMAHKMAGNGRLSASGGDGYAGGGG 290

Query:   302 XXVSLDCYSIQEDIKVTVHGGFSIGCPENAGAAGTNFNAYLRSLRVSNDNVTTETETPLL 361
               VS+D YS   D K+  +GG S GCPENAGAAGT ++    SL + N N TT T+T LL
Sbjct:   291 GRVSVDIYSRHSDPKIFFNGGRSFGCPENAGAAGTLYDVISESLTIDNHNKTTYTDTLLL 350

Query:   362 DFPTRPIWSNVFVENNAKVLVPLLWTRVQVRGQISLYRGGSIIFGLSEYPVSEFELVAEE 421
             +FP   +++N+++ N AKV VPL W+RVQV+G ISL  GG + FGL  Y  SEFEL AEE
Sbjct:   351 EFPNHRLFTNLYIRNMAKVAVPLRWSRVQVQGLISLSNGGELNFGLPRYASSEFELFAEE 410

Query:   422 LLMSDSVIKVFGAFRVAIKMLLMWNSKILIDGGGNTIVTTSVLEVRNLVVLTENSVISSN 481
             LLMS+S IKV+GA R+ +K+ LM  S++ IDGGG TI+ TS+LE+ NL+VL E+SVI SN
Sbjct:   411 LLMSNSAIKVYGALRMTVKVFLMLKSRMFIDGGGVTILGTSMLEISNLLVLKESSVIQSN 470

Query:   482 ANXXXXXXXXXXXXXXXDAIKGQRLSLSLFYNITVGTGSLLQAPLDDDASRNVVTESLCK 541
              N               D I+ QRL LSLFY+I VG G++L+ PL + ++  +  +  C+
Sbjct:   471 GNLGVHGQGLLNLTGTGDTIEAQRLILSLFYSIQVGAGAVLRGPLQNASTGGLTPKLYCQ 530

Query:   542 RQTCPIDLINPPDDCHVNYTLSFSLQICRVEDIVVSGLIKGSIVHIQRARTIIVDTYGMI 601
             RQ CP++L++PP+DC+VN +L F+LQICRVEDI V GLIKGS++    ART++V + G I
Sbjct:   531 RQDCPVELLHPPEDCNVNSSLPFTLQICRVEDITVEGLIKGSVIQFHLARTVLVRSSGTI 590

Query:   602 IASELGCSEGMGKGIYSXXXXXXXXXXXXXXXXX-XXXXLINGGHKYGNADLPCELGSGA 660
              A  +GC  G+G G +                        I GG  YGNADLPCELGSG+
Sbjct:   591 SADGMGCKGGVGTGRFLRSGIGSGGGHGGKGGSGCYNHTCIEGGESYGNADLPCELGSGS 650

Query:   661 EGPNESYAPAIGGGMIVMGSIQWPLFRLDIYGSVKADGESVGKKTXXXXXXXXXXXXXXX 720
              G  ES     GGG+IV+GS++ PL  L + GS+  DGES  +KT               
Sbjct:   651 -GNEESTDSVAGGGIIVLGSLEHPLSSLSLEGSITTDGESP-RKTLKGLSNSSLGPGGGS 708

Query:   721 XXTILLFLQELTLEDNXXXXXXXXXXXXXXXXXXXXXRVHFHWSKIDSGVEYVPVATXXX 780
               T+LLFL+ L +  +                     R+HFHWS I +G  Y PVA    
Sbjct:   709 GGTVLLFLRTLEIGRSAILSSIGGNGSLKGGGGGSGGRIHFHWSDIPTGDVYHPVAIVKG 768

Query:   781 XXXXXXXAADNTGLFGEVGTVTGKKCPKGLYGTFCKECPIGTYKDMEGSDESLCTPCSLE 840
                            G  GT+TGK CP+GLYG FC+ECP GTYK++ GSD++LC  C   
Sbjct:   769 RVYVRGGMGIIEDNIGGNGTLTGKACPEGLYGLFCEECPSGTYKNVTGSDKALCHLCPAN 828

Query:   841 LLPRRANFIYVRGGVSQPFCPYECISEKYRMPKCYTPLEELMYTFGGPWPFVXXXXXXXX 900
              +P RA ++ VRGGV++  CPY+CIS++Y MP CYT LEEL+YTFGGPW F         
Sbjct:   829 DIPHRAVYVTVRGGVAETPCPYKCISDRYHMPHCYTTLEELIYTFGGPWLFGVLLVVVLL 888

Query:   901 XXXXXXXXXXXXXVGSSPSYXXXXXXXXXXX--XFPYLLSLSEVRGT-RAEETQSHVHRM 957
                          V     +              FP+L SL+EV  T R EE+Q H+HR+
Sbjct:   889 LLALVFSVARMKFVSGDELHGSAPTQHGSQIDHSFPFLESLNEVMETSRVEESQGHMHRI 948

Query:   958 YFMGPNTFREPWHLPYSPPNAIIEIV 983
             YF+GPNTF EPWHL ++PP  I EIV
Sbjct:   949 YFLGPNTFSEPWHLSHTPPEEIKEIV 974


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.319   0.137   0.422    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      983       778   0.00094  121 3  11 22  0.40    34
                                                     37  0.45    37


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  1
  No. of states in DFA:  625 (66 KB)
  Total size of DFA:  408 KB (2196 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  53.49u 0.18s 53.67t   Elapsed:  00:00:02
  Total cpu time:  53.49u 0.18s 53.67t   Elapsed:  00:00:02
  Start:  Fri May 10 04:09:33 2013   End:  Fri May 10 04:09:35 2013

Back to top