BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>028224
MPPIDGSNPYVAHTPAPPSNSFSFKDAVGSVKDVLGRWGKKVGEATKKAEDLAGNTWQHL
KTSPSFADAAMGRIAQGTKVLAEGGYEKIFRQTFETVPEEQLQNSYACYLSTSAGPVMGI
LYVSTAKLAFCSDNPLSYKSSGQTEWSYYKVVIPLHQLRAVNPSSSRNNPAEKYVQVISI
DNHEFWFMGFLNYNGAVEWLQGALEARNLESV

High Scoring Gene Products

Symbol, full name Information P value
GEM
AT2G22475
protein from Arabidopsis thaliana 2.1e-84
FIP1
AT1G28200
protein from Arabidopsis thaliana 6.1e-71
AT5G13200 protein from Arabidopsis thaliana 1.1e-41
AT4G01600 protein from Arabidopsis thaliana 3.6e-41
AT5G23370 protein from Arabidopsis thaliana 3.8e-30
AT5G08350 protein from Arabidopsis thaliana 5.6e-29
AT5G23360 protein from Arabidopsis thaliana 1.2e-28

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  028224
        (212 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

TAIR|locus:505006267 - symbol:GEM "AT2G22475" species:370...   845  2.1e-84   1
TAIR|locus:2032185 - symbol:FIP1 "AT1G28200" species:3702...   718  6.1e-71   1
TAIR|locus:2183901 - symbol:AT5G13200 "AT5G13200" species...   442  1.1e-41   1
TAIR|locus:2133387 - symbol:AT4G01600 "AT4G01600" species...   437  3.6e-41   1
TAIR|locus:2166806 - symbol:AT5G23370 "AT5G23370" species...   333  3.8e-30   1
TAIR|locus:2150823 - symbol:AT5G08350 "AT5G08350" species...   322  5.6e-29   1
TAIR|locus:2166791 - symbol:AT5G23360 "AT5G23360" species...   319  1.2e-28   1


>TAIR|locus:505006267 [details] [associations]
            symbol:GEM "AT2G22475" species:3702 "Arabidopsis
            thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0005634
            "nucleus" evidence=ISM] [GO:0010026 "trichome differentiation"
            evidence=IMP] [GO:0010482 "regulation of epidermal cell division"
            evidence=IMP] [GO:0048765 "root hair cell differentiation"
            evidence=IMP] [GO:0051567 "histone H3-K9 methylation" evidence=IDA]
            [GO:0005829 "cytosol" evidence=IDA] [GO:0042732 "D-xylose metabolic
            process" evidence=RCA] Pfam:PF02893 GO:GO:0005829 EMBL:CP002685
            GenomeReviews:CT485783_GR GO:GO:0051567 EMBL:AC006592 GO:GO:0048765
            InterPro:IPR004182 SMART:SM00568 GO:GO:0010482 GO:GO:0010026
            HOGENOM:HOG000239286 EMBL:EF490993 EMBL:AK118552 EMBL:AY087182
            IPI:IPI00525431 IPI:IPI00531015 RefSeq:NP_565538.1
            RefSeq:NP_973510.1 UniGene:At.19761 PaxDb:Q8S8F8 PRIDE:Q8S8F8
            EnsemblPlants:AT2G22475.1 GeneID:816780 KEGG:ath:AT2G22475
            TAIR:At2g22475 eggNOG:NOG242316 InParanoid:Q8S8F8 OMA:KPYACYL
            PhylomeDB:Q8S8F8 ProtClustDB:CLSN2688346 Genevestigator:Q8S8F8
            Uniprot:Q8S8F8
        Length = 299

 Score = 845 (302.5 bits), Expect = 2.1e-84, P = 2.1e-84
 Identities = 159/207 (76%), Positives = 181/207 (87%)

Query:     6 GSNPYVAHTPAPPSNSFSFKDAVGSVKDVLGRWGKKVGEATKKAEDLAGNTWQHLKTSPS 65
             GSNPY+A +PA  S++ S KD + +VK VLGRWGK+V EA KK E LAGNTWQHL+T+PS
Sbjct:    94 GSNPYIARSPAETSDA-SLKDTMETVKGVLGRWGKRVAEAAKKTESLAGNTWQHLRTAPS 152

Query:    66 FADAAMGRIAQGTKVLAEGGYEKIFRQTFETVPEEQLQNSYACYLSTSAGPVMGILYVST 125
             FADAAMGRIAQ TKV AEGGYEKIFRQTFET PEEQL NS+ACYLSTSAGPVMG+LY+S+
Sbjct:   153 FADAAMGRIAQSTKVFAEGGYEKIFRQTFETDPEEQLLNSFACYLSTSAGPVMGVLYISS 212

Query:   126 AKLAFCSDNPLSYKSSGQTEWSYYKVVIPLHQLRAVNPSSSRNNPAEKYVQVISIDNHEF 185
             AKLA+CSDNPLSYK+  QTEWSYYKVVIPLHQL+AVNPS+S  NPAEKY+QVIS+DNHEF
Sbjct:   213 AKLAYCSDNPLSYKNGDQTEWSYYKVVIPLHQLKAVNPSASIVNPAEKYIQVISVDNHEF 272

Query:   186 WFMGFLNYNGAVEWLQGALEARNLESV 212
             WFMGFLNY+GAV  LQ +L+A  L SV
Sbjct:   273 WFMGFLNYDGAVTSLQDSLQAGALRSV 299


>TAIR|locus:2032185 [details] [associations]
            symbol:FIP1 "AT1G28200" species:3702 "Arabidopsis
            thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0008150
            "biological_process" evidence=ND] [GO:0009507 "chloroplast"
            evidence=ISM] [GO:0006623 "protein targeting to vacuole"
            evidence=RCA] [GO:0006944 "cellular membrane fusion" evidence=RCA]
            [GO:0016192 "vesicle-mediated transport" evidence=RCA] [GO:0016197
            "endosomal transport" evidence=RCA] Pfam:PF02893 EMBL:CP002684
            GenomeReviews:CT485782_GR InterPro:IPR004182 SMART:SM00568
            EMBL:AC021044 EMBL:AF174428 EMBL:AF370177 EMBL:AY056389
            EMBL:AY059137 EMBL:AY086994 IPI:IPI00542127 PIR:A86408
            RefSeq:NP_174141.1 UniGene:At.21796 STRING:Q9SE96 PaxDb:Q9SE96
            PRIDE:Q9SE96 EnsemblPlants:AT1G28200.1 GeneID:839714
            KEGG:ath:AT1G28200 TAIR:At1g28200 eggNOG:NOG276129
            HOGENOM:HOG000239286 InParanoid:Q9SE96 OMA:TTMPAES PhylomeDB:Q9SE96
            ProtClustDB:CLSN2914667 Genevestigator:Q9SE96 Uniprot:Q9SE96
        Length = 259

 Score = 718 (257.8 bits), Expect = 6.1e-71, P = 6.1e-71
 Identities = 133/200 (66%), Positives = 165/200 (82%)

Query:     7 SNPYVAHTPAPPSNSFSFKDAVGSVKDVLGRWGKKVGEATKKAEDLAGNTWQHLKTSPSF 66
             SNPYV+ +PAP       ++ + SVKD LG+WGK   +ATKKAEDLAGN WQHLKT PS 
Sbjct:    64 SNPYVSPSPAP-------RNTMDSVKDTLGKWGKMAADATKKAEDLAGNFWQHLKTGPSV 116

Query:    67 ADAAMGRIAQGTKVLAEGGYEKIFRQTFETVPEEQLQNSYACYLSTSAGPVMGILYVSTA 126
             ADAA+ RIAQGTK+LAEGGYEK+F+QTF+ +P+E+L  +YACYLSTSAGPV+G++Y+ST 
Sbjct:   117 ADAAVSRIAQGTKILAEGGYEKVFKQTFDCLPDEKLLKTYACYLSTSAGPVLGVMYLSTH 176

Query:   127 KLAFCSDNPLSYKSSGQTEWSYYKVVIPLHQLRAVNPSSSRNNPAEKYVQVISIDNHEFW 186
             KLAF SDNPLSYK   QT WSYYKVV+P +QL+AVNPS+SR N ++KY+QVISIDNHEFW
Sbjct:   177 KLAFSSDNPLSYKEGEQTLWSYYKVVLPANQLKAVNPSTSRVNTSDKYIQVISIDNHEFW 236

Query:   187 FMGFLNYNGAVEWLQGALEA 206
             FMGF+ Y  AV+ LQ A+++
Sbjct:   237 FMGFVTYESAVKSLQEAVQS 256


>TAIR|locus:2183901 [details] [associations]
            symbol:AT5G13200 "AT5G13200" species:3702 "Arabidopsis
            thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0005737
            "cytoplasm" evidence=ISM] [GO:0008150 "biological_process"
            evidence=ND] [GO:0010286 "heat acclimation" evidence=RCA]
            Pfam:PF02893 EMBL:CP002688 GenomeReviews:BA000015_GR EMBL:AL391711
            EMBL:AL163491 InterPro:IPR004182 SMART:SM00568 HOGENOM:HOG000239286
            EMBL:BT010714 EMBL:BT012428 EMBL:AK229276 IPI:IPI00528201
            PIR:T48567 RefSeq:NP_196824.1 UniGene:At.25214 STRING:Q9LYV6
            PaxDb:Q9LYV6 PRIDE:Q9LYV6 DNASU:831159 EnsemblPlants:AT5G13200.1
            GeneID:831159 KEGG:ath:AT5G13200 TAIR:At5g13200 eggNOG:NOG289836
            InParanoid:Q9LYV6 OMA:PPEKYIQ PhylomeDB:Q9LYV6
            ProtClustDB:CLSN2686867 Genevestigator:Q9LYV6 Uniprot:Q9LYV6
        Length = 272

 Score = 442 (160.7 bits), Expect = 1.1e-41, P = 1.1e-41
 Identities = 83/194 (42%), Positives = 122/194 (62%)

Query:     9 PYVAHTPAP-PSNSFSFKDAVGSVKDVLGRWGKKVGEATKKAEDLAGNTWQHLKTSPSFA 67
             PYV ++P   P+ +   +  +G    +   W       ++KAE +A N W +LKT PS +
Sbjct:    74 PYVIYSPVEHPTTNNPLEPVIG----MFHTW-------SRKAETVARNLWHNLKTGPSMS 122

Query:    68 DAAMGRIAQGTKVLAEGGYEKIFRQTFETVPEEQLQNSYACYLSTSAGPVMGILYVSTAK 127
             + A G++    K + +GG+E +FRQ F T P E L+ ++ACYLST+ GPV G +Y+S A+
Sbjct:   123 ETAWGKVNLTAKAITKGGFESLFRQIFGTEPNETLKKTFACYLSTTTGPVAGTVYLSNAR 182

Query:   128 LAFCSDNPLSYKS-SGQTEWSYYKVVIPLHQLRAVNPSSSRNNPAEKYVQVISIDNHEFW 186
             +AFCSD PL + + SGQ  WSYY+VV+PL  +  VNP   +  P EKY+Q+ ++D H+FW
Sbjct:   183 VAFCSDRPLYFTAPSGQESWSYYRVVVPLANVATVNPVVVKETPPEKYIQLTTVDGHDFW 242

Query:   187 FMGFLNYNGAVEWL 200
             FMGF+NY  A   L
Sbjct:   243 FMGFVNYEKATHHL 256


>TAIR|locus:2133387 [details] [associations]
            symbol:AT4G01600 "AT4G01600" species:3702 "Arabidopsis
            thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0005634
            "nucleus" evidence=ISM] Pfam:PF02893 EMBL:CP002687
            GenomeReviews:CT486007_GR InterPro:IPR004182 SMART:SM00568
            EMBL:AL161492 HOGENOM:HOG000239286 EMBL:BT015153 EMBL:BT015662
            IPI:IPI00544293 IPI:IPI00656790 PIR:G85020 RefSeq:NP_001031570.1
            RefSeq:NP_192070.1 UniGene:At.34402 EnsemblPlants:AT4G01600.1
            GeneID:828085 KEGG:ath:AT4G01600 TAIR:At4g01600 eggNOG:NOG248358
            OMA:YISNRRI PhylomeDB:Q9M122 ProtClustDB:CLSN2685514
            Genevestigator:Q9M122 Uniprot:Q9M122
        Length = 233

 Score = 437 (158.9 bits), Expect = 3.6e-41, P = 3.6e-41
 Identities = 89/199 (44%), Positives = 132/199 (66%)

Query:     8 NPYVAHTPAPPSNSFSFKDAVGSVKDVLGRWGKKVGEATKKAEDLAGNTWQHLKTSPSFA 67
             NPYV H  +P   S S K +   V +VL R GKKV +AT+KAE L G    HLK SPS +
Sbjct:    29 NPYV-HITSP--TSASDKRSKDKVLEVLNRCGKKVEDATRKAEALVGGLKDHLKFSPSIS 85

Query:    68 DAAMGRIAQGTKVLAEGGYEKIFRQTFETVPEEQLQNSYACYLSTSAGPVMGILYVSTAK 127
             DAAM R++QGTK++ EGG E++F++ F  +  E+L +S+ CY+ST++GPV G++Y+S  +
Sbjct:    86 DAAMARLSQGTKMIVEGGPERVFQREFGVLAVEKLLDSFVCYISTTSGPVTGVIYISNRR 145

Query:   128 LAFCSDNPLSYKSS--GQTEWSYYKVVIPLHQLRAVNPSSSRNNPAEKYVQVISIDNHEF 185
             +AFCSD  +   SS  G    +YYKVV+   ++ +++ S++   P+E+YV +++ D  EF
Sbjct:   146 IAFCSDYAIRLPSSAGGNGVAAYYKVVMEWEKISSISSSTNVLKPSERYVHMVTRDGFEF 205

Query:   186 WFMGFLNYNGAVEWLQGAL 204
             WFMGF++Y  A   L  AL
Sbjct:   206 WFMGFVSYIDAFNCLNKAL 224


>TAIR|locus:2166806 [details] [associations]
            symbol:AT5G23370 "AT5G23370" species:3702 "Arabidopsis
            thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0008150
            "biological_process" evidence=ND] [GO:0009507 "chloroplast"
            evidence=ISM] Pfam:PF02893 EMBL:CP002688 GenomeReviews:BA000015_GR
            InterPro:IPR004182 SMART:SM00568 EMBL:AB007648 HOGENOM:HOG000239286
            ProtClustDB:CLSN2686137 IPI:IPI00523352 RefSeq:NP_197728.1
            UniGene:At.65542 EnsemblPlants:AT5G23370.1 GeneID:832401
            KEGG:ath:AT5G23370 TAIR:At5g23370 eggNOG:NOG264012
            InParanoid:Q9FMW4 OMA:QRCCKYM PhylomeDB:Q9FMW4
            Genevestigator:Q9FMW4 Uniprot:Q9FMW4
        Length = 219

 Score = 333 (122.3 bits), Expect = 3.8e-30, P = 3.8e-30
 Identities = 68/189 (35%), Positives = 108/189 (57%)

Query:    17 PPSNSFSFKDAVGSVKDVLGRWGKKVGEATKKAEDLAGNTWQHLKTSPSFADAAMGRIAQ 76
             P S+ FSF    G  K +L +         KK +          K  P   +    +++ 
Sbjct:    36 PTSSKFSF--LTGKGKSMLRK---------KKNDSFTNGVRDQDKLGPKLTETVKRKLSL 84

Query:    77 GTKVLAEGGYEKIFRQTFETVPEEQLQNSYACYLSTSAGPVMGILYVSTAKLAFCSDNPL 136
             G ++L  GG EKI+++ F+   EE+L  +Y CYLST+AGP+ G+L++S+ K+AFCS+  +
Sbjct:    85 GARILQMGGLEKIYKRLFKVSDEEKLFKAYQCYLSTTAGPIAGLLFISSKKIAFCSERSI 144

Query:   137 SYKS-SGQTEWSYYKVVIPLHQLRAVNPSSSRNNPAEKYVQVISIDNHEFWFMGFLNYNG 195
                S  G+    +YKV IPL ++  VN S +   P++KY++V+++D  +FWFMGFL+Y  
Sbjct:   145 KVASPQGELNRVHYKVSIPLCKINGVNQSQNTTKPSQKYLEVVTVDGFDFWFMGFLSYQK 204

Query:   196 AVEWLQGAL 204
             A   L+ AL
Sbjct:   205 AFNCLEQAL 213


>TAIR|locus:2150823 [details] [associations]
            symbol:AT5G08350 "AT5G08350" species:3702 "Arabidopsis
            thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0005575
            "cellular_component" evidence=ND] [GO:0008150 "biological_process"
            evidence=ND] Pfam:PF02893 EMBL:CP002688 GenomeReviews:BA000015_GR
            InterPro:IPR004182 SMART:SM00568 EMBL:AL392174 HOGENOM:HOG000239286
            EMBL:AY085900 EMBL:AK118619 EMBL:BT004668 IPI:IPI00529042
            RefSeq:NP_196452.1 UniGene:At.32589 PaxDb:Q9FTA0 PRIDE:Q9FTA0
            EnsemblPlants:AT5G08350.1 GeneID:830733 KEGG:ath:AT5G08350
            TAIR:At5g08350 eggNOG:NOG235697 InParanoid:Q9FTA0 OMA:MAFCSER
            PhylomeDB:Q9FTA0 ProtClustDB:CLSN2686137 Genevestigator:Q9FTA0
            Uniprot:Q9FTA0
        Length = 222

 Score = 322 (118.4 bits), Expect = 5.6e-29, P = 5.6e-29
 Identities = 74/209 (35%), Positives = 111/209 (53%)

Query:     3 PIDGSNPYVAHTPAPPS-NSF-----SFKDAVGSVKDVLGRWGKKVGEATKKAEDLAGNT 56
             P   + P V + P P S N F     S K    +VK +L R         KK +      
Sbjct:    14 PAAKATP-VGYLPDPASFNKFRVPASSKKSEQSNVKSILKR---------KKTDGFTNGV 63

Query:    57 WQHLKTSPSFADAAMGRIAQGTKVLAEGGYEKIFRQTFETVPEEQLQNSYACYLSTSAGP 116
                 K  P   +    +++ G ++L  GG EKIF++ F     E+L   Y CYLST+AGP
Sbjct:    64 RDQSKIRPKLTETVKRKLSLGARILQVGGLEKIFKRLFRVSEGEKLFKMYQCYLSTTAGP 123

Query:   117 VMGILYVSTAKLAFCSDNPLSYKS-SGQTEWSYYKVVIPLHQLRAVNPSSSRNNPAEKYV 175
             + G+L++S+ K+AFCS+  +   S  G     +YKV IPL ++  VN S +   P++KY+
Sbjct:   124 IAGLLFISSKKMAFCSERSIKVDSPQGDIIRVHYKVSIPLCKIDRVNQSQNTKKPSQKYL 183

Query:   176 QVISIDNHEFWFMGFLNYNGAVEWLQGAL 204
             +V+++D  +FWFMGFL+Y  A   L+ AL
Sbjct:   184 EVVTVDGFDFWFMGFLSYQKAFNCLEKAL 212


>TAIR|locus:2166791 [details] [associations]
            symbol:AT5G23360 "AT5G23360" species:3702 "Arabidopsis
            thaliana" [GO:0008150 "biological_process" evidence=ND] [GO:0009507
            "chloroplast" evidence=ISM] Pfam:PF02893 EMBL:CP002688
            GenomeReviews:BA000015_GR InterPro:IPR004182 SMART:SM00568
            EMBL:AB007648 HOGENOM:HOG000239286 ProtClustDB:CLSN2686137
            EMBL:BT010826 EMBL:BT011297 IPI:IPI00519830 RefSeq:NP_197727.1
            UniGene:At.31014 EnsemblPlants:AT5G23360.1 GeneID:832400
            KEGG:ath:AT5G23360 TAIR:At5g23360 eggNOG:euNOG10112
            InParanoid:Q9FMW5 OMA:CYLSTTE PhylomeDB:Q9FMW5
            Genevestigator:Q9FMW5 Uniprot:Q9FMW5
        Length = 210

 Score = 319 (117.4 bits), Expect = 1.2e-28, P = 1.2e-28
 Identities = 57/159 (35%), Positives = 96/159 (60%)

Query:    47 KKAEDLAGNTWQHLKTSPSFADAAMGRIAQGTKVLAEGGYEKIFRQTFETVPEEQLQNSY 106
             KK +          K  P   +    +++ G K+L  GG EKI+++ F+   +E+L  +Y
Sbjct:    47 KKTDSFTNGARDQDKLGPKLTETVKRKLSLGAKILQMGGLEKIYKRLFKVCDKEKLFKAY 106

Query:   107 ACYLSTSAGPVMGILYVSTAKLAFCSDNPLSYKS-SGQTEWSYYKVVIPLHQLRAVNPSS 165
              CYLST+ G + G+L++S+ K+AFCS+  +   S  G     +YKV IPL ++  VN S 
Sbjct:   107 QCYLSTTEGSIAGLLFISSKKIAFCSERSIKVTSPQGDLTRVHYKVSIPLCKINGVNQSQ 166

Query:   166 SRNNPAEKYVQVISIDNHEFWFMGFLNYNGAVEWLQGAL 204
             +   P+++Y++V+++DN++FWFMGF++Y  A   L+ AL
Sbjct:   167 NTKKPSQRYLEVVTVDNYDFWFMGFVSYQKAFNCLEKAL 205


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.314   0.130   0.393    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      212       212   0.00083  112 3  11 23  0.42    33
                                                     31  0.49    35


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  7
  No. of states in DFA:  612 (65 KB)
  Total size of DFA:  191 KB (2109 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  19.03u 0.06s 19.09t   Elapsed:  00:00:01
  Total cpu time:  19.03u 0.06s 19.09t   Elapsed:  00:00:01
  Start:  Fri May 10 08:17:06 2013   End:  Fri May 10 08:17:07 2013

Back to top