BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>043788
MKAHAPSQRPTPAFISKPVRRHQSAHDISPRHRFNLIHSKRSSHVTTLSLNKNVPPQSAE
FSRRHVFLSPLIAVGASILLQSATASADETQPSPPAQPTTSPVPQNPETVKAEEVVVSRI
YDATVIGEPLAVGKDKRKVWEKLMNARVVYLGEAEQVPVRDDRELELQIVKNLRKRCVES
ERTITLALEAFPSDLQDQLNQYTDKRIDGETLKSYASHWPPQRWQEYEPLLSYCRDNGVQ
LLACGTPLKVLRTVQAEGIHGLSKADRKLYAPPAGSGFISGFTSISHRSSVDMNSLTQSV
PFGPSSYLSAQARVVEDYAMSQIILKAIMDGGANGMLVVVTGASHVTYGSRGTGLPARIS
KKLQKKNQVVILLDLKGNIFEEREKFL

High Scoring Gene Products

Symbol, full name Information P value
AT3G56140 protein from Arabidopsis thaliana 8.7e-111
AT2G40400 protein from Arabidopsis thaliana 3.8e-110
GSU3139
Uncharacterized protein
protein from Geobacter sulfurreducens PCA 1.1e-10
GSU_3139
conserved hypothetical protein
protein from Geobacter sulfurreducens PCA 1.1e-10

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  043788
        (387 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

TAIR|locus:2078446 - symbol:AT3G56140 "AT3G56140" species...  1094  8.7e-111  1
TAIR|locus:2063136 - symbol:AT2G40400 species:3702 "Arabi...  1088  3.8e-110  1
UNIPROTKB|Q747X6 - symbol:GSU3139 "Uncharacterized protei...   171  1.1e-10   1
TIGR_CMR|GSU_3139 - symbol:GSU_3139 "conserved hypothetic...   171  1.1e-10   1


>TAIR|locus:2078446 [details] [associations]
            symbol:AT3G56140 "AT3G56140" species:3702 "Arabidopsis
            thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0008150
            "biological_process" evidence=ND] [GO:0009507 "chloroplast"
            evidence=ISM;IDA] [GO:0009543 "chloroplast thylakoid lumen"
            evidence=ISS] GO:GO:0009507 EMBL:CP002686 InterPro:IPR021825
            Pfam:PF11891 EMBL:AY093111 EMBL:BT010340 EMBL:AK227110
            IPI:IPI00520041 RefSeq:NP_191173.2 UniGene:At.34962
            ProteinModelPortal:Q8RWG3 STRING:Q8RWG3 PaxDb:Q8RWG3 PRIDE:Q8RWG3
            EnsemblPlants:AT3G56140.1 GeneID:824780 KEGG:ath:AT3G56140
            TAIR:At3g56140 eggNOG:NOG242091 HOGENOM:HOG000082965
            InParanoid:Q8RWG3 OMA:RRKENFF PhylomeDB:Q8RWG3
            ProtClustDB:CLSN2688835 ArrayExpress:Q8RWG3 Genevestigator:Q8RWG3
            InterPro:IPR007314 Pfam:PF04187 Uniprot:Q8RWG3
        Length = 745

 Score = 1094 (390.2 bits), Expect = 8.7e-111, P = 8.7e-111
 Identities = 219/351 (62%), Positives = 259/351 (73%)

Query:    35 NLIHSKRSSHVTTLSLNKNVPPQSAEFSRRHVFLSP-LIAVGASILLQSATASADEXXXX 93
             NL   K +S ++ ++L+ +  P    FSRR   L+P L+   AS+ L+ + + A E    
Sbjct:    38 NLTSEKNNS-LSIVALSDSDLPSRTAFSRRAFLLAPPLLVSAASLFLKPSVSLASEESSS 96

Query:    94 XXXXXXXXXXX----------XNPETVKAEEVVVSRIYDATVIGEPLAVGKDKRKVWEKL 143
                                    P  V  EE + SRIYDAT IGEP+A+GKDK+KVWEKL
Sbjct:    97 ATVTSPAESAAPPPPPATTTPSPPPPVNKEETITSRIYDATAIGEPMAMGKDKKKVWEKL 156

Query:   144 MNARVVYLGEAEQVPVRDDRELELQIVKNLRKRCVESERTITLALEAFPSDLQDQLNQYT 203
             +NARVVYLGEAEQVP +DD+ELEL+IV+NLRKRCVESER I++ALEAFP DLQDQLNQY 
Sbjct:   157 LNARVVYLGEAEQVPTKDDKELELEIVRNLRKRCVESERQISVALEAFPLDLQDQLNQYM 216

Query:   204 DKRIDGETLKSYASHWPPQRWQEYEPLLSYCRDNGVQLLACGTPLKVLRTVQAEGIHGLS 263
             DKR+DGETLKSY +HWP QRWQEYEPLLSYCRDN V+L+ACGTPLKVLRTVQAEGI GLS
Sbjct:   217 DKRMDGETLKSYVTHWPAQRWQEYEPLLSYCRDNSVRLIACGTPLKVLRTVQAEGIRGLS 276

Query:   264 KADRKLYAPPAXXXXXXXXXXXXHRSSVDMNSLTQSVPFGPSSYLSAQARVVEDYAMSQI 323
             K++RKLY PPA             RS+ DM+  TQ VPFGPSSYLSAQARVVED+ MSQ+
Sbjct:   277 KSERKLYTPPAGSGFISGFSSFSRRSTFDMSLPTQIVPFGPSSYLSAQARVVEDHTMSQV 336

Query:   324 ILKAIMDGGANGMLVVVTGASHVTYGSRGTGLPARISKKLQKKNQVVILLD 374
             IL+A+ DGG  G+L+VVTGASHV YGSRGTGLPARIS+K  KKNQVV+LLD
Sbjct:   337 ILQAVADGGGTGLLLVVTGASHVEYGSRGTGLPARISRKFPKKNQVVVLLD 387


>TAIR|locus:2063136 [details] [associations]
            symbol:AT2G40400 species:3702 "Arabidopsis thaliana"
            [GO:0009507 "chloroplast" evidence=ISM] [GO:0009543 "chloroplast
            thylakoid lumen" evidence=ISS] [GO:0015995 "chlorophyll
            biosynthetic process" evidence=RCA] EMBL:CP002685 EMBL:AC007020
            InterPro:IPR021825 Pfam:PF11891 ProtClustDB:CLSN2688835
            InterPro:IPR007314 Pfam:PF04187 EMBL:AF410285 EMBL:AY102131
            IPI:IPI00518341 PIR:A84829 RefSeq:NP_565930.1 RefSeq:NP_850329.1
            UniGene:At.14284 ProteinModelPortal:Q9SIY5 STRING:Q9SIY5
            PRIDE:Q9SIY5 EnsemblPlants:AT2G40400.1 EnsemblPlants:AT2G40400.2
            GeneID:818633 KEGG:ath:AT2G40400 TAIR:At2g40400 InParanoid:Q9SIY5
            OMA:IEHRISD PhylomeDB:Q9SIY5 ArrayExpress:Q9SIY5
            Genevestigator:Q9SIY5 Uniprot:Q9SIY5
        Length = 735

 Score = 1088 (388.1 bits), Expect = 3.8e-110, P = 3.8e-110
 Identities = 211/334 (63%), Positives = 259/334 (77%)

Query:    44 HVTTLSLNK--NVPPQSAEFSRRHVFLSP-LIAVGASILLQSATASADEXXXXXXXXXXX 100
             +V TL L+   NV       +RR + ++P L+A  AS+ L  ++A++ E           
Sbjct:    46 NVVTLCLHSHSNVSSSQIAVTRRAILVAPPLLAAAASLFLSISSAASAETSAESVALPPV 105

Query:   101 XXXXXNPETVKAEEVVVSRIYDATVIGEPLAVGKDKRKVWEKLMNARVVYLGEAEQVPVR 160
                   P  V+ EE + SRIYDA+V+GEP+AVGKDK++VWEKL+NAR+VYLGEAEQVP R
Sbjct:   106 ATAPP-PPPVEKEEAITSRIYDASVLGEPMAVGKDKKRVWEKLLNARIVYLGEAEQVPTR 164

Query:   161 DDRELELQIVKNLRKRCVESERTITLALEAFPSDLQDQLNQYTDKRIDGETLKSYASHWP 220
             DD+ LEL+IV+NLRKRC+ES+R ++LALEAFP DLQ+QLNQY DKR+DGE LKSY SHWP
Sbjct:   165 DDKVLELEIVRNLRKRCIESDRQLSLALEAFPLDLQEQLNQYMDKRMDGEVLKSYVSHWP 224

Query:   221 PQRWQEYEPLLSYCRDNGVQLLACGTPLKVLRTVQAEGIHGLSKADRKLYAPPAXXXXXX 280
              QRWQEYEPLLSYCRDNGV+L+ACGTPLKVLRTVQAEGI GLS+++RKLY PPA      
Sbjct:   225 VQRWQEYEPLLSYCRDNGVKLIACGTPLKVLRTVQAEGIRGLSESERKLYTPPAGSGFIS 284

Query:   281 XXXXXXHRSSVDMNSLTQSVPFGPSSYLSAQARVVEDYAMSQIILKAIMDGGANGMLVVV 340
                     SS++MN LTQ VPFGPSSYLSAQARVVED+ MSQ+I++A+ DGG  GMLVVV
Sbjct:   285 GFTSFSRSSSLNMNPLTQIVPFGPSSYLSAQARVVEDHTMSQVIVQAVADGGGTGMLVVV 344

Query:   341 TGASHVTYGSRGTGLPARISKKLQKKNQVVILLD 374
             TGA+HV YGSRGTGLPARIS+K+ KK+Q+V+LLD
Sbjct:   345 TGANHVEYGSRGTGLPARISRKIPKKSQLVVLLD 378


>UNIPROTKB|Q747X6 [details] [associations]
            symbol:GSU3139 "Uncharacterized protein" species:243231
            "Geobacter sulfurreducens PCA" [GO:0003674 "molecular_function"
            evidence=ND] [GO:0005575 "cellular_component" evidence=ND]
            [GO:0008150 "biological_process" evidence=ND] EMBL:AE017180
            GenomeReviews:AE017180_GR InterPro:IPR007314 Pfam:PF04187
            InterPro:IPR016773 PIRSF:PIRSF020419 RefSeq:NP_954180.1
            GeneID:2688434 KEGG:gsu:GSU3139 PATRIC:22029137
            HOGENOM:HOG000012442 OMA:NTHTEFL ProtClustDB:CLSK829132
            BioCyc:GSUL243231:GH27-3157-MONOMER Uniprot:Q747X6
        Length = 282

 Score = 171 (65.3 bits), Expect = 1.1e-10, P = 1.1e-10
 Identities = 58/248 (23%), Positives = 117/248 (47%)

Query:   134 KDKRKV-WEKLMN----ARVVYLGEAEQVPVRDDRELELQIVKNLRKRCVESERTITLAL 188
             KD++++ +E+++      +V+Y+GE    P   D  L+L+IV+ L +  V     + +A+
Sbjct:    34 KDRKEISFEEMLRDLKAGKVIYVGETHDNPYHHD--LQLRIVRELHRAGVP----LAIAM 87

Query:   189 EAFPSDLQDQLNQYTDKRIDGETLKS-YASHWP-PQRWQEYEPLLSYCRDNGVQLLACGT 246
             E F  + Q++L+++   + D    +  Y  +W  P  W  Y  +L + RD  + L+    
Sbjct:    88 EMFTYESQEELDRWVAGKTDPALFQQIYLKNWNFP--WALYGDILLFARDRRIPLVGLNV 145

Query:   247 PLKVLRTVQAEGIHGLSKADRKLYAPPAXXXXXXXXXXXXHRSSVDMNSLTQSVPFGPSS 306
             P +V R V  +G   LS+ +R+   P               RS  D +  T +  F   +
Sbjct:   146 PREVTRKVARQGFESLSREERRKLPPSITCDVDDAYMAMIRRSYSDHD--TSAKTF--KN 201

Query:   307 YLSAQARVVEDYAMSQIILKAIMDGGANGMLVVVTGASHVTYGSRGTGLPARISKKLQKK 366
             +  AQ  ++ + +M+  +++ + +      +VV+TG+ H   G    G+P ++ ++    
Sbjct:   202 FCEAQ--MLWNKSMAYHLVEYLKNNPGR-TVVVITGSGHAVRG----GMPVQVDREKPGL 254

Query:   367 NQVVILLD 374
                V+L D
Sbjct:   255 ASRVVLPD 262


>TIGR_CMR|GSU_3139 [details] [associations]
            symbol:GSU_3139 "conserved hypothetical protein"
            species:243231 "Geobacter sulfurreducens PCA" [GO:0003674
            "molecular_function" evidence=ND] [GO:0005575 "cellular_component"
            evidence=ND] [GO:0008150 "biological_process" evidence=ND]
            EMBL:AE017180 GenomeReviews:AE017180_GR InterPro:IPR007314
            Pfam:PF04187 InterPro:IPR016773 PIRSF:PIRSF020419
            RefSeq:NP_954180.1 GeneID:2688434 KEGG:gsu:GSU3139 PATRIC:22029137
            HOGENOM:HOG000012442 OMA:NTHTEFL ProtClustDB:CLSK829132
            BioCyc:GSUL243231:GH27-3157-MONOMER Uniprot:Q747X6
        Length = 282

 Score = 171 (65.3 bits), Expect = 1.1e-10, P = 1.1e-10
 Identities = 58/248 (23%), Positives = 117/248 (47%)

Query:   134 KDKRKV-WEKLMN----ARVVYLGEAEQVPVRDDRELELQIVKNLRKRCVESERTITLAL 188
             KD++++ +E+++      +V+Y+GE    P   D  L+L+IV+ L +  V     + +A+
Sbjct:    34 KDRKEISFEEMLRDLKAGKVIYVGETHDNPYHHD--LQLRIVRELHRAGVP----LAIAM 87

Query:   189 EAFPSDLQDQLNQYTDKRIDGETLKS-YASHWP-PQRWQEYEPLLSYCRDNGVQLLACGT 246
             E F  + Q++L+++   + D    +  Y  +W  P  W  Y  +L + RD  + L+    
Sbjct:    88 EMFTYESQEELDRWVAGKTDPALFQQIYLKNWNFP--WALYGDILLFARDRRIPLVGLNV 145

Query:   247 PLKVLRTVQAEGIHGLSKADRKLYAPPAXXXXXXXXXXXXHRSSVDMNSLTQSVPFGPSS 306
             P +V R V  +G   LS+ +R+   P               RS  D +  T +  F   +
Sbjct:   146 PREVTRKVARQGFESLSREERRKLPPSITCDVDDAYMAMIRRSYSDHD--TSAKTF--KN 201

Query:   307 YLSAQARVVEDYAMSQIILKAIMDGGANGMLVVVTGASHVTYGSRGTGLPARISKKLQKK 366
             +  AQ  ++ + +M+  +++ + +      +VV+TG+ H   G    G+P ++ ++    
Sbjct:   202 FCEAQ--MLWNKSMAYHLVEYLKNNPGR-TVVVITGSGHAVRG----GMPVQVDREKPGL 254

Query:   367 NQVVILLD 374
                V+L D
Sbjct:   255 ASRVVLPD 262


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.316   0.131   0.375    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      387       359   0.00081  117 3  11 22  0.41    34
                                                     34  0.45    37


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  4
  No. of states in DFA:  610 (65 KB)
  Total size of DFA:  219 KB (2121 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  27.69u 0.10s 27.79t   Elapsed:  00:00:01
  Total cpu time:  27.69u 0.10s 27.79t   Elapsed:  00:00:01
  Start:  Fri May 10 18:10:16 2013   End:  Fri May 10 18:10:17 2013

Back to top