BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>014161
MYGLIKFLVSCRKLIVITIMRICMQLFKSPISHFVMFSYFPFMQVMDKSNFKITTDEEID
VALSGQYLLHLPITVNESKLDKKLLKRYFEEHHHDHLPDFADKYVIFRRGIGVDQTTDYF
FMEKVDMLIGRFWSFLMRRTGLEKLLSRRSKRRHKPDPKKDDEINSETEQNDLSVERHRL
ENMELRFFRNLLGKVTIQEPTFDRIIVLYRQASTKSKAERGVYLKHFRNIPMADMEIVLP
EKKNPGLTPLDWVKFLVSAVVGLVAVITSAQLHEIDLWVGMAILSTVIGYCAKTYFTFQQ
NMAAYQNMITQSMYDKQLDSGKGTLLHLCDDVIQQEVKEVIISFFILMEQGKATRQDLDL
RCEELIKEEFGESCNFDVDDAVHKLEKLGIVARDTIGRYYCVGLKRSNEIIGTTTEEMVL
KAQQGISTT

High Scoring Gene Products

Symbol, full name Information P value
AT3G19340 protein from Arabidopsis thaliana 7.4e-153
AT5G13940 protein from Arabidopsis thaliana 6.4e-125
AT2G46915 protein from Arabidopsis thaliana 3.5e-18

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  014161
        (429 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

TAIR|locus:2090639 - symbol:AT3G19340 species:3702 "Arabi...  1491  7.4e-153  1
TAIR|locus:2159108 - symbol:AT5G13940 species:3702 "Arabi...  1201  6.4e-125  2
TAIR|locus:505006322 - symbol:AT2G46915 "AT2G46915" speci...   228  3.5e-18   3


>TAIR|locus:2090639 [details] [associations]
            symbol:AT3G19340 species:3702 "Arabidopsis thaliana"
            [GO:0005886 "plasma membrane" evidence=IDA] [GO:0005783
            "endoplasmic reticulum" evidence=IDA] GO:GO:0005783 GO:GO:0005886
            EMBL:CP002686 InterPro:IPR022227 Pfam:PF12576 UniGene:At.43813
            EMBL:AY093193 IPI:IPI00528723 RefSeq:NP_188565.1 PRIDE:Q8RWC3
            EnsemblPlants:AT3G19340.1 GeneID:821468 KEGG:ath:AT3G19340
            TAIR:At3g19340 HOGENOM:HOG000005876 InParanoid:Q8RWC3 OMA:DRMIVVY
            PhylomeDB:Q8RWC3 ProtClustDB:CLSN2684575 ArrayExpress:Q8RWC3
            Genevestigator:Q8RWC3 Uniprot:Q8RWC3
        Length = 487

 Score = 1491 (529.9 bits), Expect = 7.4e-153, P = 7.4e-153
 Identities = 283/384 (73%), Positives = 336/384 (87%)

Query:    44 QVMDKSNFKITTDEEIDVALSGQYLLHLPITVNESKLDKKLLKRYFEEHHHDHLPDFADK 103
             QVM+KSNFKIT++EE++VA SGQYLL+LPI V+ESKLDKKLLKRYFEEH H+++PDF+DK
Sbjct:   103 QVMEKSNFKITSNEEMEVAHSGQYLLNLPIKVDESKLDKKLLKRYFEEHPHENIPDFSDK 162

Query:   104 YVIFRRGIGVDQTTDYFFMEKVDMLIGRFWSFLMRRTGLEXXXXXXXXXXXXXXXXXXXE 163
             YVIFRRGIG+D+TTDYFFMEK+D++I RFWSFLMR T LE                   E
Sbjct:   163 YVIFRRGIGLDKTTDYFFMEKLDVIISRFWSFLMRITRLEKLRAKRSSSLNKKDPKKDDE 222

Query:   164 INSETEQNDLSVERHRLENMELRFFRNLLGKVTIQEPTFDRIIVLYRQASTKSKAERGVY 223
              N +T+ ++L VER RLEN +L F ++ L K+TIQEPTFDR+IV+YR+AS+K+  ERG+Y
Sbjct:   223 PNPDTDNDELYVERIRLENSKLSF-KSFLSKLTIQEPTFDRMIVVYRRASSKTNLERGIY 281

Query:   224 LKHFRNIPMADMEIVLPEKKNPGLTPLDWVKFLVSAVVGLVAVITSAQLHEIDLWVGMAI 283
             +KHF+NIPMADMEIVLPEK+NPGLTP+DWVKFL+SAVVGLVAV+TS ++ + D WV +AI
Sbjct:   282 VKHFKNIPMADMEIVLPEKRNPGLTPMDWVKFLISAVVGLVAVLTSVEMPKSDPWVIIAI 341

Query:   284 LSTVIGYCAKTYFTFQQNMAAYQNMITQSMYDKQLDSGKGTLLHLCDDVIQQEVKEVIIS 343
             LSTV+GYCAKTYFTFQQNMA YQN+ITQSMYDKQLDSG+GTLLHLCDDVIQQEVKEV+I 
Sbjct:   342 LSTVLGYCAKTYFTFQQNMATYQNLITQSMYDKQLDSGRGTLLHLCDDVIQQEVKEVMIC 401

Query:   344 FFILMEQGKATRQDLDLRCEELIKEEFGESCNFDVDDAVHKLEKLGIVARDTIGRYYCVG 403
             F+ILMEQGKAT +DLDLRCEELIKEEFG  CNFDV+DAV KLEKLGIVARDTIGRYYC+G
Sbjct:   402 FYILMEQGKATLEDLDLRCEELIKEEFGARCNFDVEDAVQKLEKLGIVARDTIGRYYCMG 461

Query:   404 LKRSNEIIGTTTEEMVLKAQQGIS 427
             LKR+NEIIGTTTEE+VLKA+QG++
Sbjct:   462 LKRANEIIGTTTEELVLKAKQGVT 485


>TAIR|locus:2159108 [details] [associations]
            symbol:AT5G13940 species:3702 "Arabidopsis thaliana"
            [GO:0004177 "aminopeptidase activity" evidence=IEA] [GO:0005622
            "intracellular" evidence=IEA] [GO:0005886 "plasma membrane"
            evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000819 Pfam:PF00883 EMBL:CP002688 GO:GO:0006508
            GO:GO:0004177 GO:GO:0005622 UniGene:At.297 InterPro:IPR022227
            Pfam:PF12576 IPI:IPI00534381 RefSeq:NP_196898.5 UniGene:At.66714
            PRIDE:F4K5D9 EnsemblPlants:AT5G13940.1 GeneID:831242
            KEGG:ath:AT5G13940 Uniprot:F4K5D9
        Length = 809

 Score = 1201 (427.8 bits), Expect = 6.4e-125, Sum P(2) = 6.4e-125
 Identities = 233/383 (60%), Positives = 297/383 (77%)

Query:    44 QVMDKSNFKITTDEEIDVALSGQYLLHLPITVNESKLDKKLLKRYFEEHHHDHLPDFADK 103
             QVM+KSNFK+ T+EEI VALS QY L+LPI VNE+KLD KLL RYF +   D LP FADK
Sbjct:   391 QVMEKSNFKVITNEEIQVALSAQYRLNLPIVVNEAKLDTKLLTRYFSKFPRDDLPHFADK 450

Query:   104 YVIFRRGIGVDQTTDYFFMEKVDMLIGRFWSFLMRRTGLEXXXXXXXXXXXXXXXXXXXE 163
             Y+IFRRG G+D    YFF+ K+D ++ R W FL+  T L+                   +
Sbjct:   451 YIIFRRGFGIDHMKAYFFLAKIDTILVRIWHFLLTITCLKRLVYGKKNDVGLSEQI---D 507

Query:   164 INSETEQNDLSVERHRLENMELRFFRNLLGKVTIQEPTFDRIIVLYRQASTKSKAERGVY 223
             I+ ETE++ L +ER R+E ++L    NL+ K+TIQEPTF+RIIV+YR+ S K ++ER +Y
Sbjct:   508 ISIETEKDSLYIERIRIEKLKLSL-SNLMKKITIQEPTFERIIVVYRRVSGKKESERNIY 566

Query:   224 LKHFRNIPMADMEIVLPEKKNPGLTPLDWVKFLVSAVVGLVAVITSAQLHEIDLWVGMAI 283
             +KHF+ IPMADMEIVLPEKKNPGLTPLDWVKFLVSA +GLV V++S  L + D+ V  AI
Sbjct:   567 VKHFKTIPMADMEIVLPEKKNPGLTPLDWVKFLVSAAIGLVTVVSSVSLKKADIRVIAAI 626

Query:   284 LSTVIGYCAKTYFTFQQNMAAYQNMITQSMYDKQLDSGKGTLLHLCDDVIQQEVKEVIIS 343
             LSTV+ YC KTYFTFQ+N+  YQ++IT+S+YDKQLDSG+GTLLHLCD+VIQQEVKEVIIS
Sbjct:   627 LSTVVAYCVKTYFTFQRNLVDYQSLITRSVYDKQLDSGRGTLLHLCDEVIQQEVKEVIIS 686

Query:   344 FFILMEQGKAT-RQDLDLRCEELIKEEFGESCNFDVDDAVHKLEKLGIVARDTIGRYYCV 402
             FF+L+++G  T +++LD++ E  IKEEF ESCNFDVDDA+ KLEKLG+V+RD+  +Y CV
Sbjct:   687 FFMLIKKGCPTSKEELDMKSEAFIKEEFNESCNFDVDDAITKLEKLGLVSRDSEDKYRCV 746

Query:   403 GLKRSNEIIGTTTEEMVLKAQQG 425
              +K +NEI+GTTTEEMVLKA+QG
Sbjct:   747 EMKEANEIMGTTTEEMVLKARQG 769

 Score = 47 (21.6 bits), Expect = 6.4e-125, Sum P(2) = 6.4e-125
 Identities = 13/36 (36%), Positives = 21/36 (58%)

Query:    36 MFSYFPFMQVMDKSNFKITTDEEIDVALSGQYLLHL 71
             ++S F  ++   + N +  +  EID AL  Q+LLHL
Sbjct:   355 LYSLFEPVRGAHRLNQQNLSTREID-ALEDQFLLHL 389


>TAIR|locus:505006322 [details] [associations]
            symbol:AT2G46915 "AT2G46915" species:3702 "Arabidopsis
            thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0008150
            "biological_process" evidence=ND] [GO:0009507 "chloroplast"
            evidence=ISM] EMBL:CP002685 UniGene:At.27400 UniGene:At.63511
            InterPro:IPR022227 Pfam:PF12576 IPI:IPI00543827 RefSeq:NP_850462.2
            PRIDE:F4IK40 EnsemblPlants:AT2G46915.1 GeneID:819305
            KEGG:ath:AT2G46915 OMA:TLPIYVD Uniprot:F4IK40
        Length = 708

 Score = 228 (85.3 bits), Expect = 3.5e-18, Sum P(3) = 3.5e-18
 Identities = 59/220 (26%), Positives = 115/220 (52%)

Query:   191 LLGKVTIQEPTFDRIIVLY-RQASTK---SKAER--GVYLKHFRNIPMADMEIVLPEKKN 244
             LL   T+QEP F+ +I+LY + AS K   +K E    + L+ F  IP+ D+ ++ P KK 
Sbjct:   463 LLSPSTLQEPAFEELILLYTKDASEKDDKNKDETRSSLQLEIFERIPIPDLPVIFPHKKL 522

Query:   245 PGLTPLDWVKFLVSAVVGLVAVITSAQLHEID-----LWVGMAILSTVIGYCAKTYFTFQ 299
                  +D V+  +++++GL A   + +   I       ++ +  ++ ++ Y  +    ++
Sbjct:   523 Y-FRIIDTVRLDIASILGLTAYFVNYKFENISSSPSAFFLDVIAVTALVIYATRVVLGYK 581

Query:   300 QNMAAYQNMITQSMYDKQLDSGKGTLLHLCDDVIQQEVKEVIISFFILMEQGK---ATRQ 356
             Q    YQ ++ +++Y+K L SG G++  L D   QQ+ KE I+++ I+++ GK    + +
Sbjct:   582 QTWDRYQLLVNKTLYEKTLASGFGSVHFLLDASEQQQYKEAILTYAIILQAGKNQNMSYK 641

Query:   357 DLDLRCEELIKEEFGESCNFDVDDAVHKLEKLGIVARDTI 396
              +  RCE  + + F       V+ A+  L +LG+V    +
Sbjct:   642 GVGDRCERFMYDTFKIKVEMRVEKAISTLVRLGLVTETLV 681

 Score = 55 (24.4 bits), Expect = 3.5e-18, Sum P(3) = 3.5e-18
 Identities = 13/40 (32%), Positives = 25/40 (62%)

Query:    42 FMQVMDKSNFKITTDEEIDV--ALSGQYLLHLPITVNESK 79
             F+Q++D + F+  +  ++ +  AL+  YLL LP+ V+  K
Sbjct:   223 FIQLLDNAGFEELSARDLALTSALNTDYLLTLPVYVDWKK 262

 Score = 46 (21.3 bits), Expect = 3.5e-18, Sum P(3) = 3.5e-18
 Identities = 7/24 (29%), Positives = 14/24 (58%)

Query:   105 VIFRRGIGVDQTTDYFFMEKVDML 128
             ++FRRG   ++      +EK+D +
Sbjct:   269 IVFRRGFATEKEKGLLLVEKLDYI 292


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.325   0.139   0.407    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      429       410   0.00079  118 3  11 22  0.46    33
                                                     34  0.45    37


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  3
  No. of states in DFA:  614 (65 KB)
  Total size of DFA:  253 KB (2136 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  33.49u 0.17s 33.66t   Elapsed:  00:00:02
  Total cpu time:  33.49u 0.17s 33.66t   Elapsed:  00:00:02
  Start:  Fri May 10 03:08:07 2013   End:  Fri May 10 03:08:09 2013

Back to top