BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>001005
MATRVVVVEEFQLIYSADMMSQKFLCMVHCVMFDNWNSNAPSLLAMLVTYVLIRGNSFAC
PDNAGGAGTLYDAVPRTLTVSNYNMSTDTETLLLEFPNQPLWTNVYVQNCARATVPLLWS
RVQVQGQISLSCGGVLSFGLAHYATSEFELLAEELLMSDSVIKVYGALRMTVKIFLMWNS
EMLVDGGGDATVATSLLEASNLIVLKEFSIIHSNANLEVHGQGLLNLSGPGDRIEAQRLV
LALFYSIHVGPGSVLRSPLENATTDAVTPRLYCEIQDCPVELLHPPEDCNVNSSLSFTLQ
ICRVEDIVVDGLVEGSVVHFHRARTISVQSSGAISASGMGCTGGVGRGKVIGNGVGSGGG
HGGKGGLGCFNDSCVEGGISYGNANLPCELGSGSGNDTSGNSTAGGGIIVMGSFEHPLSS
LSVEGSVKADGQSFEDLSTKKNYVVRNGSIGGAGGGSGGTILLFLHTLDIGDSAVLSSVG
GYGSHMGGGGGGGGRIHFHWSDIPTGDVYQPIASVRGSIRIGGGLGGHELGGGENGTTTG
KACPKGLYGIFCEECPVGTYKNVTGSDKSLCHQCPPQEFPHRAVYISVRGGIAETPCPYR
CISERYHMPHCYTALEELIYTFGGPWLFCLLLVGLLILLALVLSVARMKFVGVDELPGPA
PTQHGSQIDHSFPFLESLNEVLETNRAEESHSHVHRMYFMGPNTFSQPWHLPHTPPEQIK
EIVYEGAFNSFVDEINAIATYHWWEGAIYSILAILAYPLAWSWQQWRRRMKLQRLREYVR
SEYDHACLRSCRSRALYEGLKVAATPDLMLAYLDFFLGGDEKRTDLPPCLHHRFPMSLIF
GGDGSYMAPFSLQNDNILTSLMSQLVPPTICYRLVAGLNAQLRLVRRGRLRATFRPVLRW
LETHANPTLQLHGLRVDLAWFQATACGYCQYGLLVYAVGGENEPTSIGSFDRGRLIERES
RVKSIDMENPSGRLREETLLTRAQRSSESYMKRKRSHGGIIDTNNVQMLEERRDIFYFLS
FIVHNTKPVGHQDLVGLVISVLLLGDFSLVLLTLLQLYSISLVDVFLVLFILPLGILLPF
PAGINALFSHGPRRSVGLARVYALWNVTSLINVGVAFLCGYVHYSSGSSPNKKVPNFQPW
NFSMDESEWWIFPAGLVLCKIFQSQLVNWHVANLEIQDRTLYSNDFELFWQS

High Scoring Gene Products

Symbol, full name Information P value
AT4G32920 protein from Arabidopsis thaliana 0.

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  001005
        (1192 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

TAIR|locus:2123762 - symbol:AT4G32920 "AT4G32920" species...  3107  0.        3


>TAIR|locus:2123762 [details] [associations]
            symbol:AT4G32920 "AT4G32920" species:3702 "Arabidopsis
            thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0008150
            "biological_process" evidence=ND] [GO:0009507 "chloroplast"
            evidence=ISM] [GO:0005773 "vacuole" evidence=IDA] [GO:0006486
            "protein glycosylation" evidence=RCA] GO:GO:0005773 EMBL:CP002687
            IPI:IPI00546159 RefSeq:NP_001190893.1 RefSeq:NP_001190894.1
            RefSeq:NP_567910.1 UniGene:At.26242 UniGene:At.67561
            ProteinModelPortal:F4JV81 PRIDE:F4JV81 EnsemblPlants:AT4G32920.1
            EnsemblPlants:AT4G32920.2 EnsemblPlants:AT4G32920.3 GeneID:829429
            KEGG:ath:AT4G32920 OMA:DENEWWI ArrayExpress:F4JV81 Uniprot:F4JV81
        Length = 1432

 Score = 3107 (1098.8 bits), Expect = 0., Sum P(3) = 0.
 Identities = 592/979 (60%), Positives = 707/979 (72%)

Query:    55 GNSFACPDNAGGAGTLYDAVPRTLTVSNYNMSTDTETLLLEFPNQPLWTNVYVQNCARAT 114
             G SF CP+NAG AGTLYD +  +LT+ N+N +T T+TLLLEFPN  L+TN+Y++N A+  
Sbjct:   311 GRSFGCPENAGAAGTLYDVISESLTIDNHNKTTYTDTLLLEFPNHRLFTNLYIRNMAKVA 370

Query:   115 VPLLWSRVQVQGQISLSCGGVLSFGLAHYATSEFELLAEELLMSDSVIKVYGALRMTVKI 174
             VPL WSRVQVQG ISLS GG L+FGL  YA+SEFEL AEELLMS+S IKVYGALRMTVK+
Sbjct:   371 VPLRWSRVQVQGLISLSNGGELNFGLPRYASSEFELFAEELLMSNSAIKVYGALRMTVKV 430

Query:   175 FLMWNSEMLVDGGGDATVATSLLEASNLIVLKEFSIIHSNANLEVHGQGLLNLSGPGDRI 234
             FLM  S M +DGGG   + TS+LE SNL+VLKE S+I SN NL VHGQGLLNL+G GD I
Sbjct:   431 FLMLKSRMFIDGGGVTILGTSMLEISNLLVLKESSVIQSNGNLGVHGQGLLNLTGTGDTI 490

Query:   235 EAQRLVLALFYSIHVGPGSVLRSPLENATTDAVTPRLYCEIQDCPVELLHPPEDCNVNSS 294
             EAQRL+L+LFYSI VG G+VLR PL+NA+T  +TP+LYC+ QDCPVELLHPPEDCNVNSS
Sbjct:   491 EAQRLILSLFYSIQVGAGAVLRGPLQNASTGGLTPKLYCQRQDCPVELLHPPEDCNVNSS 550

Query:   295 LSFTLQICRVEDIVVDGLVEGSVVHFHRARTISVQSSGAISASXXXXXXXXXXXXXXXXX 354
             L FTLQICRVEDI V+GL++GSV+ FH ART+ V+SSG ISA                  
Sbjct:   551 LPFTLQICRVEDITVEGLIKGSVIQFHLARTVLVRSSGTISADGMGCKGGVGTGRFLRSG 610

Query:   355 XXXXXXXXXXXXXXCFNDSCVEGGISYGNANLPCELXXXXXXXXXXXXXXXXXIIVMGSF 414
                           C+N +C+EGG SYGNA+LPCEL                 IIV+GS 
Sbjct:   611 IGSGGGHGGKGGSGCYNHTCIEGGESYGNADLPCELGSGSGNEESTDSVAGGGIIVLGSL 670

Query:   415 EHPLSSLSVEGSVKADGQSFEDLSTKKNYVVRNXXXXXXXXXXXXTILLFLHTLDIGDSA 474
             EHPLSSLS+EGS+  DG+S      +K     +            T+LLFL TL+IG SA
Sbjct:   671 EHPLSSLSLEGSITTDGES-----PRKTLKGLSNSSLGPGGGSGGTVLLFLRTLEIGRSA 725

Query:   475 VLSSVXXXXXXXXXXXXXXXRIHFHWSDIPTGDVYQPIASVRGSIRIXXXXXXXXXXXXX 534
             +LSS+               RIHFHWSDIPTGDVY P+A V+G + +             
Sbjct:   726 ILSSIGGNGSLKGGGGGSGGRIHFHWSDIPTGDVYHPVAIVKGRVYVRGGMGIIEDNIGG 785

Query:   535 XXXXXXKACPKGLYGIFCEECPVGTYKNVTGSDKSLCHQCPPQEFPHRAVYISVRGGIAE 594
                   KACP+GLYG+FCEECP GTYKNVTGSDK+LCH CP  + PHRAVY++VRGG+AE
Sbjct:   786 NGTLTGKACPEGLYGLFCEECPSGTYKNVTGSDKALCHLCPANDIPHRAVYVTVRGGVAE 845

Query:   595 TPCPYRCISERYHMPHCYTALEELIYTFGGPWXXXXXXXXXXXXXXXXXXXARMKFVGVD 654
             TPCPY+CIS+RYHMPHCYT LEELIYTFGGPW                   ARMKFV  D
Sbjct:   846 TPCPYKCISDRYHMPHCYTTLEELIYTFGGPWLFGVLLVVVLLLLALVFSVARMKFVSGD 905

Query:   655 ELPGPAPTQHGSQIDHSFPFLESLNEVLETNRAEESHSHVHRMYFMGPNTFSQPWHLPHT 714
             EL G APTQHGSQIDHSFPFLESLNEV+ET+R EES  H+HR+YF+GPNTFS+PWHL HT
Sbjct:   906 ELHGSAPTQHGSQIDHSFPFLESLNEVMETSRVEESQGHMHRIYFLGPNTFSEPWHLSHT 965

Query:   715 PPEQIKEIVYEGAFNSFVDEINAIATYHWWEGAIYSILAILAYPLAWSWQQWRRRMKLQR 774
             PPE+IKEIVYE AFN FVDE+N IA Y WWEGAIY +L++L YPLAWSWQQ RRR+K Q+
Sbjct:   966 PPEEIKEIVYEAAFNGFVDEVNVIAAYQWWEGAIYIMLSVLVYPLAWSWQQSRRRLKFQK 1025

Query:   775 LREYVRSEYDHACLRSCRSRALYEGLKVAATPDLMLAYLDFFLGGDEKRTDLPPCLHHRF 834
             LR++VRSEYDH+CLRSCRSRALYEGLKVAATPDLMLA+LDFFLGGDEKR+DLPP +H R 
Sbjct:  1026 LRDFVRSEYDHSCLRSCRSRALYEGLKVAATPDLMLAHLDFFLGGDEKRSDLPPQVHQRL 1085

Query:   835 PMSLIFGGDGSYMAPFSLQNDNILTSLMSQLVPPTICYRLVAGLNAQLRLVRRGRLRATF 894
             PM LIFGGDGSYMA +SLQ+D+ILTSL+SQLVPPT  YR VAGLNAQLRLV++G+LR+TF
Sbjct:  1086 PMPLIFGGDGSYMAYYSLQSDDILTSLLSQLVPPTTWYRFVAGLNAQLRLVQQGKLRSTF 1145

Query:   895 RPVLRWLETHANPTLQLHGLRVDLAWFQATACGYCQYGLLVYAVGGENEPTSIGSFDRGR 954
             R V+RW+ETH NP L+ HG+RVDLA FQA +   CQYG+LV+ +   +E  S  S D   
Sbjct:  1146 RSVMRWIETHGNPALKRHGVRVDLARFQALSSSSCQYGILVHTIA--DEVASTRSDDE-- 1201

Query:   955 LIERESRVKSIDMENPSGRLREETLLTRAQRSSESYMKRKRSHGGIIDTNNVQMLEERRD 1014
                 +       +EN SG  RE     +  RS  ++++ +   G IID  ++Q L+E +D
Sbjct:  1202 --TEQQHPWGTQIENHSGDFRENF---QPLRSEINHVRHQEC-GEIIDIGSLQFLKEEKD 1255

Query:  1015 IFYFLSFIVHNTKPVGHQD 1033
             +   +SF++HNTKPVGHQD
Sbjct:  1256 VLSLISFLIHNTKPVGHQD 1274

 Score = 454 (164.9 bits), Expect = 0., Sum P(3) = 0.
 Identities = 82/108 (75%), Positives = 90/108 (83%)

Query:  1085 NALFSHGPRRSVGLARVYALWNVTSLINVGVAFLCGYVHYSSGSSPNKKVPNFQPWNFSM 1144
             +ALFSHGPRRS    RVYALWNVTSL+NV VAF+CGYVHY  GSS  KK+P  QPWN SM
Sbjct:  1326 SALFSHGPRRSASRTRVYALWNVTSLVNVVVAFVCGYVHYH-GSSSGKKIPYLQPWNISM 1384

Query:  1145 DESEWWIFPAGLVLCKIFQSQLVNWHVANLEIQDRTLYSNDFELFWQS 1192
             DE+EWWIFP  L LCK+ QSQLVNWHVANLEIQD +LYS+D ELFWQS
Sbjct:  1385 DENEWWIFPVALFLCKVLQSQLVNWHVANLEIQDYSLYSDDSELFWQS 1432

 Score = 47 (21.6 bits), Expect = 3.2e-38, Sum P(2) = 3.2e-38
 Identities = 11/27 (40%), Positives = 15/27 (55%)

Query:    44 LAMLVTYVLIRGNSFACPDNAGGAGTL 70
             +A++   V +RG      DN GG GTL
Sbjct:   763 VAIVKGRVYVRGGMGIIEDNIGGNGTL 789

 Score = 45 (20.9 bits), Expect = 0., Sum P(3) = 0.
 Identities = 11/33 (33%), Positives = 21/33 (63%)

Query:    56 NSFACPDNAGGAGTLYDAVPRTLTVSNYNMSTD 88
             +S +C D+ GG G+L D+  +   V++ N++ D
Sbjct:    56 SSVSCVDDLGGVGSL-DSTCKL--VADLNLTRD 85

 Score = 40 (19.1 bits), Expect = 2.0e-36, Sum P(4) = 2.0e-36
 Identities = 9/16 (56%), Positives = 11/16 (68%)

Query:   418 LSSLSVEGSVKADGQS 433
             L  +S+ GSV ADG S
Sbjct:   236 LGYISLNGSVLADGAS 251

 Score = 37 (18.1 bits), Expect = 2.0e-36, Sum P(4) = 2.0e-36
 Identities = 8/19 (42%), Positives = 12/19 (63%)

Query:   179 NSEMLVDGGGDATVATSLL 197
             ++E+   GGG  TVA  +L
Sbjct:   218 SNEVDYGGGGGGTVAIEIL 236


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.322   0.137   0.429    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0     1192      1030   0.00081  123 3  11 22  0.36    34
                                                     38  0.45    37


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  1
  No. of states in DFA:  631 (67 KB)
  Total size of DFA:  538 KB (2244 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  74.25u 0.13s 74.38t   Elapsed:  00:00:16
  Total cpu time:  74.25u 0.13s 74.38t   Elapsed:  00:00:16
  Start:  Thu May  9 21:01:05 2013   End:  Thu May  9 21:01:21 2013

Back to top