BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>034203
MLLELYLFLMWLITLFQSQDVARDFGAACTPEFFLFKKDGRRPFQLVYHGQFDDSRPSNN
LPVTGRDIRLAIECVLSGQPVSSNQKPSVGCSIKWHPQTVQ

High Scoring Gene Products

Symbol, full name Information P value
AT1G21350 protein from Arabidopsis thaliana 2.2e-34
CBU_1278
Thiol-disulfide isomerase and thioredoxin
protein from Coxiella burnetii RSA 493 7.7e-16
CBU_1278
conserved hypothetical protein
protein from Coxiella burnetii RSA 493 7.7e-16
APH_1306
Putative uncharacterized protein
protein from Anaplasma phagocytophilum HZ 2.7e-13
APH_1306
conserved hypothetical protein
protein from Anaplasma phagocytophilum HZ 2.7e-13
ECH_0147
Putative uncharacterized protein
protein from Ehrlichia chaffeensis str. Arkansas 4.5e-11
ECH_0147
conserved hypothetical protein
protein from Ehrlichia chaffeensis str. Arkansas 4.5e-11
MCA2020
Putative uncharacterized protein
protein from Methylococcus capsulatus str. Bath 3.6e-09
NSE_0884
Putative uncharacterized protein
protein from Neorickettsia sennetsu str. Miyayama 5.9e-09
NSE_0884
conserved hypothetical protein
protein from Neorickettsia sennetsu str. Miyayama 5.9e-09

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  034203
        (101 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

TAIR|locus:2027052 - symbol:AT1G21350 species:3702 "Arabi...   373  2.2e-34   1
UNIPROTKB|Q83C53 - symbol:CBU_1278 "Thiol-disulfide isome...   198  7.7e-16   1
TIGR_CMR|CBU_1278 - symbol:CBU_1278 "conserved hypothetic...   198  7.7e-16   1
UNIPROTKB|Q2GII1 - symbol:APH_1306 "Putative uncharacteri...   174  2.7e-13   1
TIGR_CMR|APH_1306 - symbol:APH_1306 "conserved hypothetic...   174  2.7e-13   1
UNIPROTKB|Q2GHV8 - symbol:ECH_0147 "Putative uncharacteri...   153  4.5e-11   1
TIGR_CMR|ECH_0147 - symbol:ECH_0147 "conserved hypothetic...   153  4.5e-11   1
UNIPROTKB|Q606J6 - symbol:MCA2020 "Putative uncharacteriz...   135  3.6e-09   1
UNIPROTKB|Q2GCP6 - symbol:NSE_0884 "Putative uncharacteri...   133  5.9e-09   1
TIGR_CMR|NSE_0884 - symbol:NSE_0884 "conserved hypothetic...   133  5.9e-09   1


>TAIR|locus:2027052 [details] [associations]
            symbol:AT1G21350 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0005634
            "nucleus" evidence=ISM] [GO:0016209 "antioxidant activity"
            evidence=IEA] [GO:0016491 "oxidoreductase activity" evidence=IEA]
            [GO:0055114 "oxidation-reduction process" evidence=IEA] [GO:0009507
            "chloroplast" evidence=IDA] [GO:0010027 "thylakoid membrane
            organization" evidence=RCA] InterPro:IPR000866 Pfam:PF00578
            EMBL:CP002684 GO:GO:0009507 GO:GO:0016491 Gene3D:3.40.30.10
            InterPro:IPR012336 SUPFAM:SSF52833 GO:GO:0016209 EMBL:AC015447
            PROSITE:PS51352 EMBL:AK317531 IPI:IPI00542371 PIR:F86346
            RefSeq:NP_973876.1 UniGene:At.41661 ProteinModelPortal:Q9LPL8
            SMR:Q9LPL8 STRING:Q9LPL8 PRIDE:Q9LPL8 EnsemblPlants:AT1G21350.3
            GeneID:838734 KEGG:ath:AT1G21350 TAIR:At1g21350
            HOGENOM:HOG000004976 InParanoid:Q9LPL8 OMA:AYTAACT PhylomeDB:Q9LPL8
            ProtClustDB:CLSN2680102 Genevestigator:Q9LPL8 Uniprot:Q9LPL8
        Length = 252

 Score = 373 (136.4 bits), Expect = 2.2e-34, P = 2.2e-34
 Identities = 66/83 (79%), Positives = 76/83 (91%)

Query:    17 QSQDVARDFGAACTPEFFLFKKDGRRPFQLVYHGQFDDSRPSNNLPVTGRDIRLAIECVL 76
             +SQ+VAR+FGA CTPEFFL+KKDGRRPF+LVYHGQFDDSRPS+N PVTGRD+ LAI+  L
Sbjct:   168 ESQEVAREFGAVCTPEFFLYKKDGRRPFELVYHGQFDDSRPSSNSPVTGRDLSLAIDLSL 227

Query:    77 SGQPVSSNQKPSVGCSIKWHPQT 99
             S QP+ SNQKPSVGCSIKWHP+T
Sbjct:   228 SCQPIPSNQKPSVGCSIKWHPET 250


>UNIPROTKB|Q83C53 [details] [associations]
            symbol:CBU_1278 "Thiol-disulfide isomerase and thioredoxin"
            species:227377 "Coxiella burnetii RSA 493" [GO:0003674
            "molecular_function" evidence=ND] [GO:0005575 "cellular_component"
            evidence=ND] [GO:0008150 "biological_process" evidence=ND]
            InterPro:IPR000866 Pfam:PF00578 GO:GO:0016491 Gene3D:3.40.30.10
            InterPro:IPR012336 SUPFAM:SSF52833 GO:GO:0016209 GO:GO:0016853
            EMBL:AE016828 GenomeReviews:AE016828_GR PROSITE:PS51352
            HOGENOM:HOG000004976 OMA:AYTAACT RefSeq:NP_820271.1
            ProteinModelPortal:Q83C53 SMR:Q83C53 PRIDE:Q83C53 GeneID:1209183
            KEGG:cbu:CBU_1278 PATRIC:17931291 ProtClustDB:CLSK914662
            BioCyc:CBUR227377:GJ7S-1265-MONOMER Uniprot:Q83C53
        Length = 185

 Score = 198 (74.8 bits), Expect = 7.7e-16, P = 7.7e-16
 Identities = 37/82 (45%), Positives = 56/82 (68%)

Query:    15 LF-QSQDVARDFGAACTPEFFLFKKDGRRPFQLVYHGQFDDSRPSNNLPVTGRDIRLAIE 73
             LF +SQ++A+ + A CTP+F++F K+       VY G+FD + P  + PVTG D+R A++
Sbjct:   106 LFDESQEIAKAYQAECTPDFYVFDKN----LACVYRGRFDSATPGRDTPVTGEDLRSALD 161

Query:    74 CVLSGQPVSSNQKPSVGCSIKW 95
              +L+G  V  NQ+PS GC+IKW
Sbjct:   162 NILAGNLVDPNQQPSQGCNIKW 183


>TIGR_CMR|CBU_1278 [details] [associations]
            symbol:CBU_1278 "conserved hypothetical protein"
            species:227377 "Coxiella burnetii RSA 493" [GO:0008150
            "biological_process" evidence=ND] [GO:0003674 "molecular_function"
            evidence=ND] [GO:0005575 "cellular_component" evidence=ND]
            InterPro:IPR000866 Pfam:PF00578 GO:GO:0016491 Gene3D:3.40.30.10
            InterPro:IPR012336 SUPFAM:SSF52833 GO:GO:0016209 GO:GO:0016853
            EMBL:AE016828 GenomeReviews:AE016828_GR PROSITE:PS51352
            HOGENOM:HOG000004976 OMA:AYTAACT RefSeq:NP_820271.1
            ProteinModelPortal:Q83C53 SMR:Q83C53 PRIDE:Q83C53 GeneID:1209183
            KEGG:cbu:CBU_1278 PATRIC:17931291 ProtClustDB:CLSK914662
            BioCyc:CBUR227377:GJ7S-1265-MONOMER Uniprot:Q83C53
        Length = 185

 Score = 198 (74.8 bits), Expect = 7.7e-16, P = 7.7e-16
 Identities = 37/82 (45%), Positives = 56/82 (68%)

Query:    15 LF-QSQDVARDFGAACTPEFFLFKKDGRRPFQLVYHGQFDDSRPSNNLPVTGRDIRLAIE 73
             LF +SQ++A+ + A CTP+F++F K+       VY G+FD + P  + PVTG D+R A++
Sbjct:   106 LFDESQEIAKAYQAECTPDFYVFDKN----LACVYRGRFDSATPGRDTPVTGEDLRSALD 161

Query:    74 CVLSGQPVSSNQKPSVGCSIKW 95
              +L+G  V  NQ+PS GC+IKW
Sbjct:   162 NILAGNLVDPNQQPSQGCNIKW 183


>UNIPROTKB|Q2GII1 [details] [associations]
            symbol:APH_1306 "Putative uncharacterized protein"
            species:212042 "Anaplasma phagocytophilum HZ" [GO:0003674
            "molecular_function" evidence=ND] [GO:0005575 "cellular_component"
            evidence=ND] [GO:0008150 "biological_process" evidence=ND]
            InterPro:IPR000866 Pfam:PF00578 GO:GO:0016491 Gene3D:3.40.30.10
            InterPro:IPR012336 SUPFAM:SSF52833 GO:GO:0016209 EMBL:CP000235
            GenomeReviews:CP000235_GR PROSITE:PS51352 eggNOG:COG0526
            HOGENOM:HOG000004976 OMA:AYGAVCT RefSeq:YP_505820.1
            ProteinModelPortal:Q2GII1 STRING:Q2GII1 GeneID:3930924
            KEGG:aph:APH_1306 PATRIC:20951408 ProtClustDB:CLSK747424
            BioCyc:APHA212042:GHPM-1308-MONOMER Uniprot:Q2GII1
        Length = 190

 Score = 174 (66.3 bits), Expect = 2.7e-13, P = 2.7e-13
 Identities = 34/79 (43%), Positives = 49/79 (62%)

Query:    17 QSQDVARDFGAACTPEFFLFKKDGRRPFQLVYHGQFDDSRPSNNLPVTGRDIRLAIECVL 76
             ++QDVAR +GA CTP+FF F +D     QL Y G+FDD +      +   ++  A++ + 
Sbjct:   109 ETQDVARSYGAVCTPDFFCFNRD----LQLCYRGRFDDQKAVEGA-LGASELYEAVKFIA 163

Query:    77 SGQPVSSNQKPSVGCSIKW 95
             +   V  NQKPS+GCSIKW
Sbjct:   164 TTGGVPENQKPSIGCSIKW 182


>TIGR_CMR|APH_1306 [details] [associations]
            symbol:APH_1306 "conserved hypothetical protein"
            species:212042 "Anaplasma phagocytophilum HZ" [GO:0003674
            "molecular_function" evidence=ND] [GO:0005575 "cellular_component"
            evidence=ND] [GO:0008150 "biological_process" evidence=ND]
            InterPro:IPR000866 Pfam:PF00578 GO:GO:0016491 Gene3D:3.40.30.10
            InterPro:IPR012336 SUPFAM:SSF52833 GO:GO:0016209 EMBL:CP000235
            GenomeReviews:CP000235_GR PROSITE:PS51352 eggNOG:COG0526
            HOGENOM:HOG000004976 OMA:AYGAVCT RefSeq:YP_505820.1
            ProteinModelPortal:Q2GII1 STRING:Q2GII1 GeneID:3930924
            KEGG:aph:APH_1306 PATRIC:20951408 ProtClustDB:CLSK747424
            BioCyc:APHA212042:GHPM-1308-MONOMER Uniprot:Q2GII1
        Length = 190

 Score = 174 (66.3 bits), Expect = 2.7e-13, P = 2.7e-13
 Identities = 34/79 (43%), Positives = 49/79 (62%)

Query:    17 QSQDVARDFGAACTPEFFLFKKDGRRPFQLVYHGQFDDSRPSNNLPVTGRDIRLAIECVL 76
             ++QDVAR +GA CTP+FF F +D     QL Y G+FDD +      +   ++  A++ + 
Sbjct:   109 ETQDVARSYGAVCTPDFFCFNRD----LQLCYRGRFDDQKAVEGA-LGASELYEAVKFIA 163

Query:    77 SGQPVSSNQKPSVGCSIKW 95
             +   V  NQKPS+GCSIKW
Sbjct:   164 TTGGVPENQKPSIGCSIKW 182


>UNIPROTKB|Q2GHV8 [details] [associations]
            symbol:ECH_0147 "Putative uncharacterized protein"
            species:205920 "Ehrlichia chaffeensis str. Arkansas" [GO:0003674
            "molecular_function" evidence=ND] [GO:0005575 "cellular_component"
            evidence=ND] [GO:0008150 "biological_process" evidence=ND]
            InterPro:IPR000866 Pfam:PF00578 EMBL:CP000236
            GenomeReviews:CP000236_GR GO:GO:0016491 Gene3D:3.40.30.10
            InterPro:IPR012336 SUPFAM:SSF52833 GO:GO:0016209 PROSITE:PS51352
            eggNOG:COG0526 HOGENOM:HOG000004976 OMA:AYGAVCT RefSeq:YP_506975.1
            ProteinModelPortal:Q2GHV8 STRING:Q2GHV8 GeneID:3927933
            KEGG:ech:ECH_0147 PATRIC:20575837 ProtClustDB:CLSK749068
            BioCyc:ECHA205920:GJNR-147-MONOMER Uniprot:Q2GHV8
        Length = 193

 Score = 153 (58.9 bits), Expect = 4.5e-11, P = 4.5e-11
 Identities = 34/83 (40%), Positives = 47/83 (56%)

Query:    17 QSQDVARDFGAACTPEFFLFKKDGRRPFQLVYHGQFDDSRPSN-NLPVTGRDIRLAIECV 75
             ++Q VAR++GA CTP+FF F        +L Y G+FD S  +  N     RD+  A++ +
Sbjct:   109 ENQTVARNYGAVCTPDFFGFNNK----LELCYRGRFDASGKNQMNSKQEDRDLFNAMKLI 164

Query:    76 LSGQPVSSNQKPSVGCSIKWHPQ 98
                     NQKPS+GCSIKW  Q
Sbjct:   165 SKTGESPENQKPSIGCSIKWKSQ 187


>TIGR_CMR|ECH_0147 [details] [associations]
            symbol:ECH_0147 "conserved hypothetical protein"
            species:205920 "Ehrlichia chaffeensis str. Arkansas" [GO:0003674
            "molecular_function" evidence=ND] [GO:0005575 "cellular_component"
            evidence=ND] [GO:0008150 "biological_process" evidence=ND]
            InterPro:IPR000866 Pfam:PF00578 EMBL:CP000236
            GenomeReviews:CP000236_GR GO:GO:0016491 Gene3D:3.40.30.10
            InterPro:IPR012336 SUPFAM:SSF52833 GO:GO:0016209 PROSITE:PS51352
            eggNOG:COG0526 HOGENOM:HOG000004976 OMA:AYGAVCT RefSeq:YP_506975.1
            ProteinModelPortal:Q2GHV8 STRING:Q2GHV8 GeneID:3927933
            KEGG:ech:ECH_0147 PATRIC:20575837 ProtClustDB:CLSK749068
            BioCyc:ECHA205920:GJNR-147-MONOMER Uniprot:Q2GHV8
        Length = 193

 Score = 153 (58.9 bits), Expect = 4.5e-11, P = 4.5e-11
 Identities = 34/83 (40%), Positives = 47/83 (56%)

Query:    17 QSQDVARDFGAACTPEFFLFKKDGRRPFQLVYHGQFDDSRPSN-NLPVTGRDIRLAIECV 75
             ++Q VAR++GA CTP+FF F        +L Y G+FD S  +  N     RD+  A++ +
Sbjct:   109 ENQTVARNYGAVCTPDFFGFNNK----LELCYRGRFDASGKNQMNSKQEDRDLFNAMKLI 164

Query:    76 LSGQPVSSNQKPSVGCSIKWHPQ 98
                     NQKPS+GCSIKW  Q
Sbjct:   165 SKTGESPENQKPSIGCSIKWKSQ 187


>UNIPROTKB|Q606J6 [details] [associations]
            symbol:MCA2020 "Putative uncharacterized protein"
            species:243233 "Methylococcus capsulatus str. Bath" [GO:0003674
            "molecular_function" evidence=ND] [GO:0005575 "cellular_component"
            evidence=ND] [GO:0008150 "biological_process" evidence=ND]
            InterPro:IPR000866 Pfam:PF00578 GO:GO:0016491 Gene3D:3.40.30.10
            InterPro:IPR012336 SUPFAM:SSF52833 GO:GO:0016209 PROSITE:PS51352
            EMBL:AE017282 GenomeReviews:AE017282_GR HOGENOM:HOG000004976
            OMA:AYGAVCT RefSeq:YP_114452.1 ProteinModelPortal:Q606J6
            GeneID:3104830 KEGG:mca:MCA2020 PATRIC:22607890
            ProtClustDB:CLSK2765679 Uniprot:Q606J6
        Length = 185

 Score = 135 (52.6 bits), Expect = 3.6e-09, P = 3.6e-09
 Identities = 31/81 (38%), Positives = 46/81 (56%)

Query:    15 LFQSQDVARDFGAACTPEFFLFKKDGRRPFQLVYHGQFDDSRPSNNLPVTGRDIRLAIEC 74
             L ++Q VAR +GA CTP+FF +  D     +L Y G+ D SR   + P   R++  A++ 
Sbjct:   106 LDETQAVARAYGAVCTPDFFGYNAD----LELQYRGRLDASRREASPPGCRRELYEAMKQ 161

Query:    75 VLSGQPVSSNQKPSVGCSIKW 95
             +       + Q PS+GCSIKW
Sbjct:   162 IAETGKGPAEQIPSMGCSIKW 182


>UNIPROTKB|Q2GCP6 [details] [associations]
            symbol:NSE_0884 "Putative uncharacterized protein"
            species:222891 "Neorickettsia sennetsu str. Miyayama" [GO:0003674
            "molecular_function" evidence=ND] [GO:0005575 "cellular_component"
            evidence=ND] [GO:0008150 "biological_process" evidence=ND]
            InterPro:IPR000866 Pfam:PF00578 GO:GO:0016491 Gene3D:3.40.30.10
            InterPro:IPR012336 SUPFAM:SSF52833 GO:GO:0016209 PROSITE:PS51352
            EMBL:CP000237 GenomeReviews:CP000237_GR eggNOG:COG0526
            HOGENOM:HOG000004976 RefSeq:YP_506750.1 ProteinModelPortal:Q2GCP6
            STRING:Q2GCP6 GeneID:3931649 KEGG:nse:NSE_0884 PATRIC:22681759
            OMA:AYGAVCT ProtClustDB:CLSK2528180
            BioCyc:NSEN222891:GHFU-887-MONOMER Uniprot:Q2GCP6
        Length = 176

 Score = 133 (51.9 bits), Expect = 5.9e-09, P = 5.9e-09
 Identities = 31/77 (40%), Positives = 43/77 (55%)

Query:    19 QDVARDFGAACTPEFFLFKKDGRRPFQLVYHGQFDDSRPSNNLPVTGRDIRLAIECVLSG 78
             Q +A  +GA CTP+FF F ++     +L Y GQF D     +      D+  A+  +  G
Sbjct:   109 QRIAASYGAVCTPDFFGFDRN----LELQYRGQFIDLASGTH------DLLDAMLEIAGG 158

Query:    79 QPVSSNQKPSVGCSIKW 95
             + VS  QKPS+GCSIKW
Sbjct:   159 RKVSRKQKPSIGCSIKW 175


>TIGR_CMR|NSE_0884 [details] [associations]
            symbol:NSE_0884 "conserved hypothetical protein"
            species:222891 "Neorickettsia sennetsu str. Miyayama" [GO:0003674
            "molecular_function" evidence=ND] [GO:0005575 "cellular_component"
            evidence=ND] [GO:0008150 "biological_process" evidence=ND]
            InterPro:IPR000866 Pfam:PF00578 GO:GO:0016491 Gene3D:3.40.30.10
            InterPro:IPR012336 SUPFAM:SSF52833 GO:GO:0016209 PROSITE:PS51352
            EMBL:CP000237 GenomeReviews:CP000237_GR eggNOG:COG0526
            HOGENOM:HOG000004976 RefSeq:YP_506750.1 ProteinModelPortal:Q2GCP6
            STRING:Q2GCP6 GeneID:3931649 KEGG:nse:NSE_0884 PATRIC:22681759
            OMA:AYGAVCT ProtClustDB:CLSK2528180
            BioCyc:NSEN222891:GHFU-887-MONOMER Uniprot:Q2GCP6
        Length = 176

 Score = 133 (51.9 bits), Expect = 5.9e-09, P = 5.9e-09
 Identities = 31/77 (40%), Positives = 43/77 (55%)

Query:    19 QDVARDFGAACTPEFFLFKKDGRRPFQLVYHGQFDDSRPSNNLPVTGRDIRLAIECVLSG 78
             Q +A  +GA CTP+FF F ++     +L Y GQF D     +      D+  A+  +  G
Sbjct:   109 QRIAASYGAVCTPDFFGFDRN----LELQYRGQFIDLASGTH------DLLDAMLEIAGG 158

Query:    79 QPVSSNQKPSVGCSIKW 95
             + VS  QKPS+GCSIKW
Sbjct:   159 RKVSRKQKPSIGCSIKW 175


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.323   0.138   0.435    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      101        90   0.00091  102 3  11 22  0.45    29
                                                     29  0.39    31


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  10
  No. of states in DFA:  569 (61 KB)
  Total size of DFA:  116 KB (2076 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  10.17u 0.14s 10.31t   Elapsed:  00:00:01
  Total cpu time:  10.17u 0.14s 10.31t   Elapsed:  00:00:01
  Start:  Fri May 10 17:32:55 2013   End:  Fri May 10 17:32:56 2013

Back to top