BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>023545
MALSSISPLSIKSLNFCSSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQCAG
PYAKLKNWRVFVSTALAAAVVASCSSNISALADLNKYEAETRGEFGIGSAAQFGSADLRK
AVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEA
NLTNAVLVRTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLG
CGNSRRNAYGSPSSPLLSAPPQKLLDRDGFCDSGTGLCDAK

High Scoring Gene Products

Symbol, full name Information P value
AT1G12250 protein from Arabidopsis thaliana 4.2e-79
AT2G44920 protein from Arabidopsis thaliana 1.1e-10
HNE_2256
Pentapeptide repeat domain protein
protein from Hyphomonas neptunium ATCC 15444 5.9e-06
GSU2404
Pentapeptide repeat domain protein
protein from Geobacter sulfurreducens PCA 1.5e-05
GSU_2404
pentapeptide repeat domain protein
protein from Geobacter sulfurreducens PCA 1.5e-05

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  023545
        (281 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

TAIR|locus:2202033 - symbol:AT1G12250 "AT1G12250" species...   795  4.2e-79   1
TAIR|locus:2055023 - symbol:AT2G44920 "AT2G44920" species...   157  1.1e-10   1
UNIPROTKB|Q0BZZ2 - symbol:HNE_2256 "Pentapeptide repeat d...   125  5.9e-06   1
UNIPROTKB|Q74B06 - symbol:GSU2404 "Pentapeptide repeat do...   122  1.5e-05   1
TIGR_CMR|GSU_2404 - symbol:GSU_2404 "pentapeptide repeat ...   122  1.5e-05   1


>TAIR|locus:2202033 [details] [associations]
            symbol:AT1G12250 "AT1G12250" species:3702 "Arabidopsis
            thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0005576
            "extracellular region" evidence=ISM] [GO:0008150
            "biological_process" evidence=ND] [GO:0009507 "chloroplast"
            evidence=ISM;IDA] [GO:0009543 "chloroplast thylakoid lumen"
            evidence=ISS] [GO:0009535 "chloroplast thylakoid membrane"
            evidence=IDA] [GO:0009579 "thylakoid" evidence=IDA] [GO:0009534
            "chloroplast thylakoid" evidence=IDA] EMBL:CP002684
            GenomeReviews:CT485782_GR GO:GO:0009535 GO:GO:0009543 EMBL:AC022522
            eggNOG:COG1357 EMBL:AY142640 EMBL:AY035122 IPI:IPI00519055
            PIR:F86257 RefSeq:NP_563902.1 UniGene:At.19174
            ProteinModelPortal:Q8H1Q1 SMR:Q8H1Q1 IntAct:Q8H1Q1 STRING:Q8H1Q1
            PaxDb:Q8H1Q1 PRIDE:Q8H1Q1 ProMEX:Q8H1Q1 EnsemblPlants:AT1G12250.1
            GeneID:837778 KEGG:ath:AT1G12250 TAIR:At1g12250
            HOGENOM:HOG000239303 InParanoid:Q8H1Q1 OMA:GAYLEKA PhylomeDB:Q8H1Q1
            ProtClustDB:CLSN2687778 Genevestigator:Q8H1Q1 Uniprot:Q8H1Q1
        Length = 280

 Score = 795 (284.9 bits), Expect = 4.2e-79, P = 4.2e-79
 Identities = 171/267 (64%), Positives = 189/267 (70%)

Query:    18 SSSSKGPYQLHALSKPLWVACQISSKTESDGQFPDCSNNQ--CAGPYAKLKNWRXXXXXX 75
             SS S+ PY  H   + L    Q+SS+  S+ +  D SN +  C    A+   W+      
Sbjct:    21 SSVSRSPY--H-FQRYLLRRLQLSSR--SNLEIKDSSNTREGCCSS-AESNTWKRILSAA 74

Query:    76 XXXXXXXXXXXXXXXLADLNKYEAETRGEFGIGSAAQFGSADLRKAVHVKENFR-ANFTS 134
                            +A+LN++EA+TRGEFGIGSAAQ+GSADL K VH  ENFR ANFTS
Sbjct:    75 MAAAVIASSSGVPA-MAELNRFEADTRGEFGIGSAAQYGSADLSKTVHSNENFRRANFTS 133

Query:   135 ADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVRTVLTR 194
             ADMRESDFSGS FNGAYLEKAVAYKANF+GADLSDTLMDRMVLNEANLTNAVLVR+VLTR
Sbjct:   134 ADMRESDFSGSTFNGAYLEKAVAYKANFSGADLSDTLMDRMVLNEANLTNAVLVRSVLTR 193

Query:   195 SDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGCGNSRRNAYGXXXX 254
             SDLGGA IEGADFSDAVIDL QKQALCKYA GTNP+TGV TRKSLGCGNSRRNAYG    
Sbjct:   194 SDLGGAKIEGADFSDAVIDLLQKQALCKYATGTNPLTGVDTRKSLGCGNSRRNAYGSPSS 253

Query:   255 XXXXXXXXXXXDRDGFCDSGTGLCDAK 281
                         RDGFCD  TGLCD K
Sbjct:   254 PLLSAPPQRLLGRDGFCDEKTGLCDVK 280


>TAIR|locus:2055023 [details] [associations]
            symbol:AT2G44920 "AT2G44920" species:3702 "Arabidopsis
            thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0009507
            "chloroplast" evidence=ISM;IDA] [GO:0009543 "chloroplast thylakoid
            lumen" evidence=IDA] [GO:0031977 "thylakoid lumen" evidence=IDA]
            [GO:0009579 "thylakoid" evidence=IDA] [GO:0009535 "chloroplast
            thylakoid membrane" evidence=IDA] [GO:0005515 "protein binding"
            evidence=IPI] [GO:0009534 "chloroplast thylakoid" evidence=IDA]
            [GO:0006098 "pentose-phosphate shunt" evidence=RCA] [GO:0006636
            "unsaturated fatty acid biosynthetic process" evidence=RCA]
            [GO:0009409 "response to cold" evidence=RCA] [GO:0015979
            "photosynthesis" evidence=RCA] [GO:0015995 "chlorophyll
            biosynthetic process" evidence=RCA] [GO:0016117 "carotenoid
            biosynthetic process" evidence=RCA] [GO:0019288 "isopentenyl
            diphosphate biosynthetic process, mevalonate-independent pathway"
            evidence=RCA] [GO:0042742 "defense response to bacterium"
            evidence=RCA] [GO:0043085 "positive regulation of catalytic
            activity" evidence=RCA] Pfam:PF00805 EMBL:CP002685
            GenomeReviews:CT485783_GR EMBL:AC002388 GO:GO:0009535 GO:GO:0009543
            eggNOG:COG1357 InterPro:IPR001646 EMBL:AY050941 EMBL:BT000902
            IPI:IPI00534350 IPI:IPI00535173 PIR:T00401 RefSeq:NP_566030.1
            RefSeq:NP_566031.1 UniGene:At.12323 PDB:3N90 PDBsum:3N90
            ProteinModelPortal:O22160 SMR:O22160 IntAct:O22160 STRING:O22160
            PaxDb:O22160 PRIDE:O22160 ProMEX:O22160 EnsemblPlants:AT2G44920.2
            GeneID:819101 KEGG:ath:AT2G44920 TAIR:At2g44920
            HOGENOM:HOG000232693 InParanoid:O22160 OMA:FLKYFLC PhylomeDB:O22160
            ProtClustDB:CLSN2688933 Genevestigator:O22160 GermOnline:AT2G44920
            Uniprot:O22160
        Length = 224

 Score = 157 (60.3 bits), Expect = 1.1e-10, P = 1.1e-10
 Identities = 45/119 (37%), Positives = 67/119 (56%)

Query:   129 RANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLM---DRMVLN--EANLT 183
             R +F ++ +R+++F G+K  GA       + A+ TGADLS+  +   D  + N  + NLT
Sbjct:   110 RQDFKTSILRQANFKGAKLLGASF-----FDADLTGADLSEADLRGADFSLANVTKVNLT 164

Query:   184 NAVLV-RTVLTRSDLGGAIIEGADFSDAVIDLAQKQALCKYANGTNPITGVSTRKSLGC 241
             NA L   TV   +   G+ I GADF+D  +   Q+  LCK A+G N  TG +TR +L C
Sbjct:   165 NANLEGATVTGNTSFKGSNITGADFTDVPLRDDQRVYLCKVADGVNATTGNATRDTLLC 223


>UNIPROTKB|Q0BZZ2 [details] [associations]
            symbol:HNE_2256 "Pentapeptide repeat domain protein"
            species:228405 "Hyphomonas neptunium ATCC 15444" [GO:0003674
            "molecular_function" evidence=ND] [GO:0008150 "biological_process"
            evidence=ND] Pfam:PF00805 EMBL:CP000158 GenomeReviews:CP000158_GR
            eggNOG:COG1357 InterPro:IPR001646 HOGENOM:HOG000148292
            RefSeq:YP_760951.1 ProteinModelPortal:Q0BZZ2 STRING:Q0BZZ2
            GeneID:4289822 KEGG:hne:HNE_2256 PATRIC:32217357 OMA:GRADFDK
            BioCyc:HNEP228405:GI69-2278-MONOMER Uniprot:Q0BZZ2
        Length = 245

 Score = 125 (49.1 bits), Expect = 5.9e-06, P = 5.9e-06
 Identities = 41/107 (38%), Positives = 56/107 (52%)

Query:   110 AAQFGSADLRKAVHVKENFR-ANFTSADMRE-----SDFSGSKFNGAYLEKAVAYKANFT 163
             +A    ADLR A      F  A F +A M++     +DFS ++  GA LEKA     NF 
Sbjct:    82 SANVTGADLRGADLTSARFADATFNNARMQDVLASGADFSRARLQGANLEKARLIGVNFE 141

Query:   164 GADLSDTLMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADFSDA 210
             GA L   L  R  L  A+L+ A    T+L R++L G I +GA+ S+A
Sbjct:   142 GASL---LFAR--LETADLSGANCTGTILDRANLRGTIFDGANLSEA 183


>UNIPROTKB|Q74B06 [details] [associations]
            symbol:GSU2404 "Pentapeptide repeat domain protein"
            species:243231 "Geobacter sulfurreducens PCA" [GO:0003674
            "molecular_function" evidence=ND] [GO:0008150 "biological_process"
            evidence=ND] Pfam:PF00805 EMBL:AE017180 GenomeReviews:AE017180_GR
            InterPro:IPR001646 HOGENOM:HOG000148292 RefSeq:NP_953450.1
            ProteinModelPortal:Q74B06 GeneID:2686536 KEGG:gsu:GSU2404
            PATRIC:22027655 OMA:QAWALEA ProtClustDB:CLSK743141
            BioCyc:GSUL243231:GH27-2385-MONOMER Uniprot:Q74B06
        Length = 254

 Score = 122 (48.0 bits), Expect = 1.5e-05, P = 1.5e-05
 Identities = 38/106 (35%), Positives = 57/106 (53%)

Query:   111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
             A    AD+RK V+V+   + NF+ A++  ++FSG+K   A L  AV    NF+ ADLS T
Sbjct:   122 ANLSGADMRK-VNVE---KGNFSQANLTNANFSGAKLKYANLGGAVLRGTNFSFADLSAT 177

Query:   171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF-SDAVIDLA 215
              +  + L  AN   A    T+L  + L GA +  + F S ++ D A
Sbjct:   178 DLGSLDLEGANFRGATFNGTLLRDAKLKGADLRQSRFHSVSIYDTA 223

 Score = 119 (46.9 bits), Expect = 3.5e-05, P = 3.5e-05
 Identities = 31/81 (38%), Positives = 46/81 (56%)

Query:   130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
             A F +ADMR +  SG     AY+  A    AN +GAD+    +++   ++ANLTNA    
Sbjct:    97 AIFDTADMRSAHCSG-----AYIHHAKFVGANLSGADMRKVNVEKGNFSQANLTNANFSG 151

Query:   190 TVLTRSDLGGAIIEGADFSDA 210
               L  ++LGGA++ G +FS A
Sbjct:   152 AKLKYANLGGAVLRGTNFSFA 172


>TIGR_CMR|GSU_2404 [details] [associations]
            symbol:GSU_2404 "pentapeptide repeat domain protein"
            species:243231 "Geobacter sulfurreducens PCA" [GO:0003674
            "molecular_function" evidence=ND] [GO:0008150 "biological_process"
            evidence=ND] Pfam:PF00805 EMBL:AE017180 GenomeReviews:AE017180_GR
            InterPro:IPR001646 HOGENOM:HOG000148292 RefSeq:NP_953450.1
            ProteinModelPortal:Q74B06 GeneID:2686536 KEGG:gsu:GSU2404
            PATRIC:22027655 OMA:QAWALEA ProtClustDB:CLSK743141
            BioCyc:GSUL243231:GH27-2385-MONOMER Uniprot:Q74B06
        Length = 254

 Score = 122 (48.0 bits), Expect = 1.5e-05, P = 1.5e-05
 Identities = 38/106 (35%), Positives = 57/106 (53%)

Query:   111 AQFGSADLRKAVHVKENFRANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDT 170
             A    AD+RK V+V+   + NF+ A++  ++FSG+K   A L  AV    NF+ ADLS T
Sbjct:   122 ANLSGADMRK-VNVE---KGNFSQANLTNANFSGAKLKYANLGGAVLRGTNFSFADLSAT 177

Query:   171 LMDRMVLNEANLTNAVLVRTVLTRSDLGGAIIEGADF-SDAVIDLA 215
              +  + L  AN   A    T+L  + L GA +  + F S ++ D A
Sbjct:   178 DLGSLDLEGANFRGATFNGTLLRDAKLKGADLRQSRFHSVSIYDTA 223

 Score = 119 (46.9 bits), Expect = 3.5e-05, P = 3.5e-05
 Identities = 31/81 (38%), Positives = 46/81 (56%)

Query:   130 ANFTSADMRESDFSGSKFNGAYLEKAVAYKANFTGADLSDTLMDRMVLNEANLTNAVLVR 189
             A F +ADMR +  SG     AY+  A    AN +GAD+    +++   ++ANLTNA    
Sbjct:    97 AIFDTADMRSAHCSG-----AYIHHAKFVGANLSGADMRKVNVEKGNFSQANLTNANFSG 151

Query:   190 TVLTRSDLGGAIIEGADFSDA 210
               L  ++LGGA++ G +FS A
Sbjct:   152 AKLKYANLGGAVLRGTNFSFA 172


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.316   0.130   0.386    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      281       233   0.00086  113 3  11 22  0.44    33
                                                     32  0.43    36


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  5
  No. of states in DFA:  604 (64 KB)
  Total size of DFA:  179 KB (2103 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  17.29u 0.10s 17.39t   Elapsed:  00:00:04
  Total cpu time:  17.29u 0.10s 17.39t   Elapsed:  00:00:04
  Start:  Fri May 10 07:24:12 2013   End:  Fri May 10 07:24:16 2013

Back to top