BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>034665
MVSLTKWVPTPIAVIISAAVFALAHLTPGEFPQLFVLGIALGFSYAQTRNLLTPITIHAF
WNSGVILLLTFLQLQGYDLKELLQASS

High Scoring Gene Products

Symbol, full name Information P value
AT1G14270 protein from Arabidopsis thaliana 1.9e-35
AT5G60750 protein from Arabidopsis thaliana 1.8e-08
AT2G20725 protein from Arabidopsis thaliana 1.2e-05
BA_5183
CAAX amino terminal protease family protein
protein from Bacillus anthracis 2.2e-05
BA_5183
CAAX amino terminal protease family protein
protein from Bacillus anthracis str. Ames 2.2e-05
BAS1041
CAAX amino terminal protease family protein
protein from Bacillus anthracis 2.3e-05
BA_1120
CAAX amino terminal protease family protein
protein from Bacillus anthracis str. Ames 2.3e-05
CHY_1108
CAAX amino terminal protease family protein
protein from Carboxydothermus hydrogenoformans Z-2901 0.00089

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  034665
        (87 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

TAIR|locus:2012487 - symbol:AT1G14270 species:3702 "Arabi...   383  1.9e-35   1
TAIR|locus:2175836 - symbol:AT5G60750 "AT5G60750" species...   135  1.8e-08   1
TAIR|locus:2827507 - symbol:AT2G20725 "AT2G20725" species...   108  1.2e-05   1
UNIPROTKB|Q81XQ0 - symbol:BA_5183 "CAAX amino terminal pr...   102  2.2e-05   1
TIGR_CMR|BA_5183 - symbol:BA_5183 "CAAX amino terminal pr...   102  2.2e-05   1
UNIPROTKB|Q81TY1 - symbol:BAS1041 "CAAX amino terminal pr...   105  2.3e-05   1
TIGR_CMR|BA_1120 - symbol:BA_1120 "CAAX amino terminal pr...   105  2.3e-05   1
TIGR_CMR|CHY_1108 - symbol:CHY_1108 "CAAX amino terminal ...    89  0.00089   1


>TAIR|locus:2012487 [details] [associations]
            symbol:AT1G14270 species:3702 "Arabidopsis thaliana"
            [GO:0005886 "plasma membrane" evidence=ISM] [GO:0006508
            "proteolysis" evidence=ISS] [GO:0016020 "membrane" evidence=IEA]
            [GO:0010207 "photosystem II assembly" evidence=RCA] [GO:0019288
            "isopentenyl diphosphate biosynthetic process,
            mevalonate-independent pathway" evidence=RCA] InterPro:IPR003675
            Pfam:PF02517 EMBL:CP002684 GO:GO:0016020 GO:GO:0006508
            GO:GO:0008233 KO:K07052 IPI:IPI00541596 RefSeq:NP_563943.2
            UniGene:At.41979 UniGene:At.41980 PRIDE:F4HUI0
            EnsemblPlants:AT1G14270.1 GeneID:837988 KEGG:ath:AT1G14270
            OMA:MVSLTKW Uniprot:F4HUI0
        Length = 353

 Score = 383 (139.9 bits), Expect = 1.9e-35, P = 1.9e-35
 Identities = 71/86 (82%), Positives = 79/86 (91%)

Query:     1 MVSLTKWVPTPIAVIISAAVFALAHLTPGEFPQLFVLGIALGFSYAQTRNLLTPITIHAF 60
             MVSLTKWVPTPIA+IIS+A FALAH TPGEFPQLF+LG  LG SYAQTRNL+TP+ IH F
Sbjct:   268 MVSLTKWVPTPIAIIISSAAFALAHFTPGEFPQLFILGSVLGLSYAQTRNLITPMVIHGF 327

Query:    61 WNSGVILLLTFLQLQGYDLKELLQAS 86
             WNSGVILLLTFLQ+QGYD+KELLQA+
Sbjct:   328 WNSGVILLLTFLQIQGYDIKELLQAN 353


>TAIR|locus:2175836 [details] [associations]
            symbol:AT5G60750 "AT5G60750" species:3702 "Arabidopsis
            thaliana" [GO:0004175 "endopeptidase activity" evidence=ISS]
            [GO:0005886 "plasma membrane" evidence=ISM] [GO:0006508
            "proteolysis" evidence=ISS] [GO:0016020 "membrane" evidence=IEA]
            [GO:0006569 "tryptophan catabolic process" evidence=RCA]
            [GO:0009684 "indoleacetic acid biosynthetic process" evidence=RCA]
            InterPro:IPR003675 Pfam:PF02517 EMBL:CP002688
            GenomeReviews:BA000015_GR GO:GO:0016020 GO:GO:0006508 GO:GO:0008233
            eggNOG:COG1266 KO:K07052 EMBL:AY065231 EMBL:AY096531 EMBL:AY086571
            IPI:IPI00516676 RefSeq:NP_568928.1 UniGene:At.7243 MEROPS:M79.A01
            EnsemblPlants:AT5G60750.1 GeneID:836196 KEGG:ath:AT5G60750
            TAIR:At5g60750 HOGENOM:HOG000241876 InParanoid:Q8VZ58 OMA:EQSIMAR
            PhylomeDB:Q8VZ58 ProtClustDB:CLSN2690057 ArrayExpress:Q8VZ58
            Genevestigator:Q8VZ58 Uniprot:Q8VZ58
        Length = 347

 Score = 135 (52.6 bits), Expect = 1.8e-08, P = 1.8e-08
 Identities = 26/67 (38%), Positives = 41/67 (61%)

Query:     3 SLTKWVPTPIAVIISAAVFALAHLTPGEFPQLFVLGIALGFSYAQTRNLLTPITIHAFWN 62
             SLT+++P   A+++S+  FALAH        L  LG+ LG  +A++RNLL  + +H+ WN
Sbjct:   279 SLTRYMPVWCAILVSSIAFALAHFNVQRMLPLVFLGVVLGLIFARSRNLLPSMLLHSLWN 338

Query:    63 SGVILLL 69
               V + L
Sbjct:   339 GFVFMEL 345


>TAIR|locus:2827507 [details] [associations]
            symbol:AT2G20725 "AT2G20725" species:3702 "Arabidopsis
            thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0006508
            "proteolysis" evidence=ISS] [GO:0016020 "membrane" evidence=IEA]
            InterPro:IPR003675 Pfam:PF02517 EMBL:CP002685
            GenomeReviews:CT485783_GR GO:GO:0016020 GO:GO:0006508 GO:GO:0008233
            EMBL:AC006234 eggNOG:COG1266 EMBL:AF370273 EMBL:AY063007
            EMBL:AK117786 IPI:IPI00529798 RefSeq:NP_565483.1 UniGene:At.21608
            STRING:Q94K61 PRIDE:Q94K61 EnsemblPlants:AT2G20725.1 GeneID:816602
            KEGG:ath:AT2G20725 TAIR:At2g20725 HOGENOM:HOG000151187
            InParanoid:Q94K61 OMA:EETIYRG PhylomeDB:Q94K61
            ProtClustDB:CLSN2688283 ArrayExpress:Q94K61 Genevestigator:Q94K61
            Uniprot:Q94K61
        Length = 301

 Score = 108 (43.1 bits), Expect = 1.2e-05, P = 1.2e-05
 Identities = 20/55 (36%), Positives = 35/55 (63%)

Query:    13 AVIISAAVFALAHLTPGEFPQLFVLGIALGFSYAQTRNLLTPITIHAFWNSGVIL 67
             A++IS+ VFA  H +  +F QLF +G  LG  Y+ + NL + + +H+ +N+  +L
Sbjct:   245 ALVISSGVFAAGHFSGEDFVQLFGIGCGLGLCYSWSGNLASSVLVHSLYNALTLL 299


>UNIPROTKB|Q81XQ0 [details] [associations]
            symbol:BA_5183 "CAAX amino terminal protease family
            protein" species:1392 "Bacillus anthracis" [GO:0003674
            "molecular_function" evidence=ND] [GO:0008150 "biological_process"
            evidence=ND] InterPro:IPR003675 Pfam:PF02517 GO:GO:0016020
            EMBL:AE016879 EMBL:AE017334 GenomeReviews:AE016879_GR
            GenomeReviews:AE017334_GR GO:GO:0006508 GO:GO:0008233 KO:K07052
            RefSeq:NP_847366.1 RefSeq:YP_021838.1 DNASU:1084558
            EnsemblBacteria:EBBACT00000013264 EnsemblBacteria:EBBACT00000017392
            GeneID:1084558 GeneID:2815751 KEGG:ban:BA_5183 KEGG:bar:GBAA_5183
            PATRIC:18787994 HOGENOM:HOG000098589 OMA:IFAPIWE
            ProtClustDB:CLSK918263 BioCyc:BANT261594:GJ7F-5058-MONOMER
            Uniprot:Q81XQ0
        Length = 203

 Score = 102 (41.0 bits), Expect = 2.2e-05, P = 2.2e-05
 Identities = 28/71 (39%), Positives = 41/71 (57%)

Query:     4 LTKWVPTPIAVIISAAVFALAH-LTPGEFPQLFVLG--IALGFSYAQTRNLLTPITIHAF 60
             L++   T  +++ISA +F L H LT G    L++LG  I L ++Y +T NLL P  IH  
Sbjct:   134 LSQHFSTFSSIVISAFIFTLGHPLTLGSV--LYILGGGICLAYTYKKTNNLLVPWGIHVL 191

Query:    61 WNSGVILLLTF 71
              N+   LL+ F
Sbjct:   192 -NNSFYLLVNF 201


>TIGR_CMR|BA_5183 [details] [associations]
            symbol:BA_5183 "CAAX amino terminal protease family
            protein" species:198094 "Bacillus anthracis str. Ames" [GO:0003674
            "molecular_function" evidence=ND] [GO:0008150 "biological_process"
            evidence=ND] InterPro:IPR003675 Pfam:PF02517 GO:GO:0016020
            EMBL:AE016879 EMBL:AE017334 GenomeReviews:AE016879_GR
            GenomeReviews:AE017334_GR GO:GO:0006508 GO:GO:0008233 KO:K07052
            RefSeq:NP_847366.1 RefSeq:YP_021838.1 DNASU:1084558
            EnsemblBacteria:EBBACT00000013264 EnsemblBacteria:EBBACT00000017392
            GeneID:1084558 GeneID:2815751 KEGG:ban:BA_5183 KEGG:bar:GBAA_5183
            PATRIC:18787994 HOGENOM:HOG000098589 OMA:IFAPIWE
            ProtClustDB:CLSK918263 BioCyc:BANT261594:GJ7F-5058-MONOMER
            Uniprot:Q81XQ0
        Length = 203

 Score = 102 (41.0 bits), Expect = 2.2e-05, P = 2.2e-05
 Identities = 28/71 (39%), Positives = 41/71 (57%)

Query:     4 LTKWVPTPIAVIISAAVFALAH-LTPGEFPQLFVLG--IALGFSYAQTRNLLTPITIHAF 60
             L++   T  +++ISA +F L H LT G    L++LG  I L ++Y +T NLL P  IH  
Sbjct:   134 LSQHFSTFSSIVISAFIFTLGHPLTLGSV--LYILGGGICLAYTYKKTNNLLVPWGIHVL 191

Query:    61 WNSGVILLLTF 71
              N+   LL+ F
Sbjct:   192 -NNSFYLLVNF 201


>UNIPROTKB|Q81TY1 [details] [associations]
            symbol:BAS1041 "CAAX amino terminal protease family
            protein" species:1392 "Bacillus anthracis" [GO:0003674
            "molecular_function" evidence=ND] [GO:0008150 "biological_process"
            evidence=ND] InterPro:IPR003675 Pfam:PF02517 GO:GO:0016020
            EMBL:AE016879 EMBL:AE017334 EMBL:AE017225 GenomeReviews:AE016879_GR
            GenomeReviews:AE017225_GR GenomeReviews:AE017334_GR GO:GO:0006508
            GO:GO:0008233 KO:K07052 RefSeq:NP_843606.1 RefSeq:YP_017740.1
            RefSeq:YP_027313.1 ProteinModelPortal:Q81TY1 DNASU:1089093
            EnsemblBacteria:EBBACT00000013037 EnsemblBacteria:EBBACT00000015312
            EnsemblBacteria:EBBACT00000024183 GeneID:1089093 GeneID:2815337
            GeneID:2852405 KEGG:ban:BA_1120 KEGG:bar:GBAA_1120 KEGG:bat:BAS1041
            HOGENOM:HOG000085342 OMA:EQEKSFS ProtClustDB:CLSK916100
            BioCyc:BANT260799:GJAJ-1117-MONOMER
            BioCyc:BANT261594:GJ7F-1168-MONOMER Uniprot:Q81TY1
        Length = 284

 Score = 105 (42.0 bits), Expect = 2.3e-05, P = 2.3e-05
 Identities = 26/67 (38%), Positives = 38/67 (56%)

Query:     6 KWVPTPIAVIISAAVFALAHLTPGEFPQLFVLGIALGFSYAQTRNLLTPITIHAFWNSGV 65
             KW  T IA+I+ A +FAL H+   +F    V  + L   Y +T +LL PI IH   N+  
Sbjct:   158 KW-GTSIAIIVVAILFALLHV---DFLGAVVFSVVLSIVYIRTNSLLMPIAIHML-NNAF 212

Query:    66 ILLLTFL 72
             ++ L+FL
Sbjct:   213 VISLSFL 219


>TIGR_CMR|BA_1120 [details] [associations]
            symbol:BA_1120 "CAAX amino terminal protease family
            protein" species:198094 "Bacillus anthracis str. Ames" [GO:0003674
            "molecular_function" evidence=ND] [GO:0008150 "biological_process"
            evidence=ND] InterPro:IPR003675 Pfam:PF02517 GO:GO:0016020
            EMBL:AE016879 EMBL:AE017334 EMBL:AE017225 GenomeReviews:AE016879_GR
            GenomeReviews:AE017225_GR GenomeReviews:AE017334_GR GO:GO:0006508
            GO:GO:0008233 KO:K07052 RefSeq:NP_843606.1 RefSeq:YP_017740.1
            RefSeq:YP_027313.1 ProteinModelPortal:Q81TY1 DNASU:1089093
            EnsemblBacteria:EBBACT00000013037 EnsemblBacteria:EBBACT00000015312
            EnsemblBacteria:EBBACT00000024183 GeneID:1089093 GeneID:2815337
            GeneID:2852405 KEGG:ban:BA_1120 KEGG:bar:GBAA_1120 KEGG:bat:BAS1041
            HOGENOM:HOG000085342 OMA:EQEKSFS ProtClustDB:CLSK916100
            BioCyc:BANT260799:GJAJ-1117-MONOMER
            BioCyc:BANT261594:GJ7F-1168-MONOMER Uniprot:Q81TY1
        Length = 284

 Score = 105 (42.0 bits), Expect = 2.3e-05, P = 2.3e-05
 Identities = 26/67 (38%), Positives = 38/67 (56%)

Query:     6 KWVPTPIAVIISAAVFALAHLTPGEFPQLFVLGIALGFSYAQTRNLLTPITIHAFWNSGV 65
             KW  T IA+I+ A +FAL H+   +F    V  + L   Y +T +LL PI IH   N+  
Sbjct:   158 KW-GTSIAIIVVAILFALLHV---DFLGAVVFSVVLSIVYIRTNSLLMPIAIHML-NNAF 212

Query:    66 ILLLTFL 72
             ++ L+FL
Sbjct:   213 VISLSFL 219


>TIGR_CMR|CHY_1108 [details] [associations]
            symbol:CHY_1108 "CAAX amino terminal protease family
            protein" species:246194 "Carboxydothermus hydrogenoformans Z-2901"
            [GO:0003824 "catalytic activity" evidence=ISS] [GO:0008152
            "metabolic process" evidence=ISS] InterPro:IPR003675 Pfam:PF02517
            GO:GO:0016020 EMBL:CP000141 GenomeReviews:CP000141_GR GO:GO:0006508
            GO:GO:0008233 eggNOG:COG1266 KO:K07052 RefSeq:YP_359954.1
            STRING:Q3AD30 GeneID:3728115 KEGG:chy:CHY_1108 PATRIC:21275354
            OMA:TEMAFFA BioCyc:CHYD246194:GJCN-1107-MONOMER Uniprot:Q3AD30
        Length = 230

 Score = 89 (36.4 bits), Expect = 0.00089, P = 0.00089
 Identities = 19/63 (30%), Positives = 32/63 (50%)

Query:     4 LTKWVPTPIAVIISAAVFALAHLTPGEFPQLFVLGIALGFSYAQTRNLLTPITIHAFWNS 63
             L K++     + +S+ +F   H  P  F  L + G  L + Y +T  +L+P   HA WN 
Sbjct:   158 LKKYMGVVGGIAVSSLLFGAMHFDPYRFLPLSLGGAILAYLYEKTGTILSPFVAHATWN- 216

Query:    64 GVI 66
             G++
Sbjct:   217 GIM 219


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.327   0.139   0.420    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0       87        87   0.00091  102 3  11 22  0.38    29
                                                     29  0.48    30


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  8
  No. of states in DFA:  522 (56 KB)
  Total size of DFA:  101 KB (2071 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  8.75u 0.08s 8.83t   Elapsed:  00:00:22
  Total cpu time:  8.75u 0.08s 8.83t   Elapsed:  00:00:22
  Start:  Thu May  9 23:01:16 2013   End:  Thu May  9 23:01:38 2013

Back to top