BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>027155
MASSAKEMMPKIDKSGRFCSPRAARELALLVVYAACLEGSDPIRLFEKRLNSRREPGYEF
DKSSLLEYNHMSFGGPPVTTETVEEADELLRSDEEESAIEAEVLSAPPKLVYSKLLLRFT
RKLLVAVVDKWDAHVHIIDKVVPPIWKDQPAGRILELSILHLAMSEITVVGTRHQIVINE
AVDLAKRFCDGAAPRIINGCLRTFVRNLEGTANIEASKASKEVPSEV

High Scoring Gene Products

Symbol, full name Information P value
AT4G26370 protein from Arabidopsis thaliana 1.4e-87
nusB
N utilization substance protein B homolog
protein from Streptococcus pneumoniae TIGR4 1.8e-05
nusB
N utilization substance protein B homolog
protein from Vibrio cholerae O1 biovar El Tor str. N16961 3.9e-05
VC_2267
N utilization substance protein B
protein from Vibrio cholerae O1 biovar El Tor 3.9e-05
CJE_0431
transcription antitermination factor NusB
protein from Campylobacter jejuni RM1221 0.00011
CPS_1532
transcription termination/antitermination factor NusB
protein from Colwellia psychrerythraea 34H 0.00017
GSU_1692
N utilization substance protein B
protein from Geobacter sulfurreducens PCA 0.00019

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  027155
        (227 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

TAIR|locus:2131428 - symbol:AT4G26370 species:3702 "Arabi...   875  1.4e-87   1
UNIPROTKB|P65582 - symbol:nusB "N utilization substance p...   104  1.8e-05   1
UNIPROTKB|Q9KPU5 - symbol:nusB "N utilization substance p...   109  3.9e-05   1
TIGR_CMR|VC_2267 - symbol:VC_2267 "N utilization substanc...   109  3.9e-05   1
TIGR_CMR|CJE_0431 - symbol:CJE_0431 "transcription antite...    97  0.00011   1
TIGR_CMR|CPS_1532 - symbol:CPS_1532 "transcription termin...    89  0.00017   2
TIGR_CMR|GSU_1692 - symbol:GSU_1692 "N utilization substa...    90  0.00019   2


>TAIR|locus:2131428 [details] [associations]
            symbol:AT4G26370 species:3702 "Arabidopsis thaliana"
            [GO:0003723 "RNA binding" evidence=IEA;ISS] [GO:0005739
            "mitochondrion" evidence=ISM] [GO:0006355 "regulation of
            transcription, DNA-dependent" evidence=IEA;ISS] [GO:0009507
            "chloroplast" evidence=ISM;IDA] [GO:0009073 "aromatic amino acid
            family biosynthetic process" evidence=RCA] [GO:0010103 "stomatal
            complex morphogenesis" evidence=RCA] [GO:0016226 "iron-sulfur
            cluster assembly" evidence=RCA] [GO:0042793 "transcription from
            plastid promoter" evidence=RCA] [GO:0045893 "positive regulation of
            transcription, DNA-dependent" evidence=RCA] InterPro:IPR006027
            Pfam:PF01029 GO:GO:0009507 EMBL:CP002687 GO:GO:0006355
            GO:GO:0003723 Gene3D:1.10.940.10 SUPFAM:SSF48013 EMBL:AY054564
            EMBL:BT008707 IPI:IPI00535428 RefSeq:NP_567745.1 UniGene:At.2177
            ProteinModelPortal:Q93XY7 SMR:Q93XY7 STRING:Q93XY7 PRIDE:Q93XY7
            EnsemblPlants:AT4G26370.1 GeneID:828743 KEGG:ath:AT4G26370
            TAIR:At4g26370 HOGENOM:HOG000240288 InParanoid:Q93XY7 OMA:PPKLVYS
            PhylomeDB:Q93XY7 ProtClustDB:CLSN2689621 Genevestigator:Q93XY7
            Uniprot:Q93XY7
        Length = 301

 Score = 875 (313.1 bits), Expect = 1.4e-87, P = 1.4e-87
 Identities = 170/223 (76%), Positives = 193/223 (86%)

Query:     2 ASSAKEM-MPKIDKSGRFCSPRAARELALLVVYAACLEGSDPIRLFEKRLNSRREPGYEF 60
             A   K++ MPKIDKSGR  SPRAARELAL+++YAACLEGSDPIRLFEKR+N+RREPGYEF
Sbjct:    77 AEEVKDVPMPKIDKSGRLSSPRAARELALVILYAACLEGSDPIRLFEKRINARREPGYEF 136

Query:    61 DKSSLLEYNHMSFGGPPVTTETVEEADELLRSDEEESAIEAEVLSAPPKLVYSKLLLRFT 120
             DKSSLLEYNHMSFGGPPV TET EE DEL+R DE+ES IEAEVLSAPPKLVYSKL+LRF 
Sbjct:   137 DKSSLLEYNHMSFGGPPVKTETKEEEDELVRHDEKESKIEAEVLSAPPKLVYSKLVLRFA 196

Query:   121 RKLLVAVVDKWDAHVHIIDKVVPPIWKDQPAGRILELSILHLAMSEITVVGTRHQIVINE 180
             +KLL AVVDKWD+HV II+K+ PP WK  PAGRILE SILHLAMSE+ V+ TRH IVINE
Sbjct:   197 KKLLAAVVDKWDSHVVIIEKISPPDWKSAPAGRILEFSILHLAMSEVAVLETRHPIVINE 256

Query:   181 AVDLAKRFCDGAAPRIINGCLRTFVRNLEGTANIEASKASKEV 223
             AVDLAKRFCDG+APRIINGCLRTFV++   T+  +A +  +EV
Sbjct:   257 AVDLAKRFCDGSAPRIINGCLRTFVKDRATTSTPQALELKQEV 299


>UNIPROTKB|P65582 [details] [associations]
            symbol:nusB "N utilization substance protein B homolog"
            species:170187 "Streptococcus pneumoniae TIGR4" [GO:0005515
            "protein binding" evidence=IPI] HAMAP:MF_00073 InterPro:IPR006027
            InterPro:IPR011605 Pfam:PF01029 GO:GO:0006355 GO:GO:0003723
            EMBL:AE005672 GenomeReviews:AE005672_GR GO:GO:0006353
            eggNOG:COG0781 HOGENOM:HOG000281867 KO:K03625 ProtClustDB:PRK00202
            Gene3D:1.10.940.10 PANTHER:PTHR11078 SUPFAM:SSF48013
            TIGRFAMs:TIGR01951 PIR:B95050 RefSeq:NP_344955.1
            ProteinModelPortal:P65582 EnsemblBacteria:EBSTRT00000026414
            GeneID:930369 KEGG:spn:SP_0433 PATRIC:19705211 OMA:TGVKDHE
            Uniprot:P65582
        Length = 140

 Score = 104 (41.7 bits), Expect = 1.8e-05, P = 1.8e-05
 Identities = 22/52 (42%), Positives = 32/52 (61%)

Query:   154 ILELSILHLAMSEITVVGTRHQIVINEAVDLAKRFCDGAAPRIINGCLRTFV 205
             ++E ++L L + EIT   T   + +NEA++LAK F D  + R ING L  FV
Sbjct:    85 LVERNLLRLGVFEITSFDTPQLVAVNEAIELAKDFSDQKSARFINGLLSQFV 136


>UNIPROTKB|Q9KPU5 [details] [associations]
            symbol:nusB "N utilization substance protein B homolog"
            species:243277 "Vibrio cholerae O1 biovar El Tor str. N16961"
            [GO:0006353 "DNA-dependent transcription, termination"
            evidence=ISS] HAMAP:MF_00073 InterPro:IPR006027 InterPro:IPR011605
            Pfam:PF01029 GO:GO:0006355 EMBL:AE003852 GenomeReviews:AE003852_GR
            GO:GO:0003723 GO:GO:0006353 eggNOG:COG0781 KO:K03625
            ProtClustDB:PRK00202 Gene3D:1.10.940.10 PANTHER:PTHR11078
            SUPFAM:SSF48013 TIGRFAMs:TIGR01951 OMA:IPYKVVI PIR:B82098
            RefSeq:NP_231898.1 ProteinModelPortal:Q9KPU5 SMR:Q9KPU5
            DNASU:2613189 GeneID:2613189 KEGG:vch:VC2267 PATRIC:20083575
            Uniprot:Q9KPU5
        Length = 156

 Score = 109 (43.4 bits), Expect = 3.9e-05, P = 3.9e-05
 Identities = 39/126 (30%), Positives = 64/126 (50%)

Query:    78 VTTETVEEADE-LLRSDEEESAIEAEVLSAPPKLVYSKLLLRFTRKLLVAVVDKWDAHVH 136
             +T E V   +E  L S + +   E E  +A P L   +  + + R LL  VV     H  
Sbjct:    26 ITKENVATIEEQFLTSGKYD---EEEHRAAEPALAAPETDVSYFRDLLAGVVLN---HNE 79

Query:   137 IIDKVVPPIWKDQPAGRILELSILHLAMSEITV-VGTRHQIVINEAVDLAKRFCDGAAPR 195
             +  K+ P + +      ++EL++L LAM E+T      +++VINEA++LAK F    + +
Sbjct:    80 LDSKLRPFVSRPMQDLDMMELALLRLAMYEMTRREDVPYKVVINEAIELAKVFAAEDSHK 139

Query:   196 IINGCL 201
              +NG L
Sbjct:   140 FVNGVL 145


>TIGR_CMR|VC_2267 [details] [associations]
            symbol:VC_2267 "N utilization substance protein B"
            species:686 "Vibrio cholerae O1 biovar El Tor" [GO:0006353
            "DNA-dependent transcription, termination" evidence=ISS]
            HAMAP:MF_00073 InterPro:IPR006027 InterPro:IPR011605 Pfam:PF01029
            GO:GO:0006355 EMBL:AE003852 GenomeReviews:AE003852_GR GO:GO:0003723
            GO:GO:0006353 eggNOG:COG0781 KO:K03625 ProtClustDB:PRK00202
            Gene3D:1.10.940.10 PANTHER:PTHR11078 SUPFAM:SSF48013
            TIGRFAMs:TIGR01951 OMA:IPYKVVI PIR:B82098 RefSeq:NP_231898.1
            ProteinModelPortal:Q9KPU5 SMR:Q9KPU5 DNASU:2613189 GeneID:2613189
            KEGG:vch:VC2267 PATRIC:20083575 Uniprot:Q9KPU5
        Length = 156

 Score = 109 (43.4 bits), Expect = 3.9e-05, P = 3.9e-05
 Identities = 39/126 (30%), Positives = 64/126 (50%)

Query:    78 VTTETVEEADE-LLRSDEEESAIEAEVLSAPPKLVYSKLLLRFTRKLLVAVVDKWDAHVH 136
             +T E V   +E  L S + +   E E  +A P L   +  + + R LL  VV     H  
Sbjct:    26 ITKENVATIEEQFLTSGKYD---EEEHRAAEPALAAPETDVSYFRDLLAGVVLN---HNE 79

Query:   137 IIDKVVPPIWKDQPAGRILELSILHLAMSEITV-VGTRHQIVINEAVDLAKRFCDGAAPR 195
             +  K+ P + +      ++EL++L LAM E+T      +++VINEA++LAK F    + +
Sbjct:    80 LDSKLRPFVSRPMQDLDMMELALLRLAMYEMTRREDVPYKVVINEAIELAKVFAAEDSHK 139

Query:   196 IINGCL 201
              +NG L
Sbjct:   140 FVNGVL 145


>TIGR_CMR|CJE_0431 [details] [associations]
            symbol:CJE_0431 "transcription antitermination factor
            NusB" species:195099 "Campylobacter jejuni RM1221" [GO:0006353
            "DNA-dependent transcription, termination" evidence=ISS]
            HAMAP:MF_00073 InterPro:IPR006027 InterPro:IPR011605 Pfam:PF01029
            GO:GO:0006355 GO:GO:0003723 EMBL:CP000025 GenomeReviews:CP000025_GR
            GO:GO:0006353 eggNOG:COG0781 HOGENOM:HOG000281867 KO:K03625
            ProtClustDB:PRK00202 Gene3D:1.10.940.10 PANTHER:PTHR11078
            SUPFAM:SSF48013 TIGRFAMs:TIGR01951 RefSeq:YP_178450.1
            ProteinModelPortal:Q5HW85 STRING:Q5HW85 PRIDE:Q5HW85 GeneID:3231193
            KEGG:cjr:CJE0431 PATRIC:20042562 OMA:QMPGVDK
            BioCyc:CJEJ195099:GJC0-436-MONOMER Uniprot:Q5HW85
        Length = 132

 Score = 97 (39.2 bits), Expect = 0.00011, P = 0.00011
 Identities = 27/90 (30%), Positives = 45/90 (50%)

Query:   119 FTRKLLVAVVDKWDAHVHIIDKVVPPIWKDQPAGRI--LELSILHLAMSEITVVGTRHQI 176
             FT  L   ++D    +++ ID+ +     D     +  +E +IL L   E+    T   I
Sbjct:    44 FTLNLYNGILD----NLNNIDETLNSFLNDNQITALGHVERAILRLGAYELLFTDTPSAI 99

Query:   177 VINEAVDLAKRFCDGAAPRIINGCLRTFVR 206
             VINEA++LAK   +  +P+ ING L   ++
Sbjct:   100 VINEAIELAKELANDNSPKFINGVLDALIK 129


>TIGR_CMR|CPS_1532 [details] [associations]
            symbol:CPS_1532 "transcription termination/antitermination
            factor NusB" species:167879 "Colwellia psychrerythraea 34H"
            [GO:0006353 "DNA-dependent transcription, termination"
            evidence=ISS] HAMAP:MF_00073 InterPro:IPR006027 InterPro:IPR011605
            Pfam:PF01029 GO:GO:0006355 GO:GO:0003723 EMBL:CP000083
            GenomeReviews:CP000083_GR GO:GO:0006353 eggNOG:COG0781 KO:K03625
            Gene3D:1.10.940.10 PANTHER:PTHR11078 SUPFAM:SSF48013
            TIGRFAMs:TIGR01951 RefSeq:YP_268274.1 HSSP:Q9X286
            ProteinModelPortal:Q485J1 SMR:Q485J1 STRING:Q485J1 GeneID:3519785
            KEGG:cps:CPS_1532 PATRIC:21466271 HOGENOM:HOG000281868 OMA:IPYKVVI
            BioCyc:CPSY167879:GI48-1613-MONOMER Uniprot:Q485J1
        Length = 138

 Score = 89 (36.4 bits), Expect = 0.00017, Sum P(2) = 0.00017
 Identities = 24/72 (33%), Positives = 42/72 (58%)

Query:   138 IDKVVPPIWKDQPAGRI--LELSILHLAMSEIT-VVGTRHQIVINEAVDLAKRFCDGAAP 194
             ID+ + P + D+P   I  +E +IL +A+ E+       +++VINEA++LAK F    + 
Sbjct:    62 IDEAIIP-YVDRPLDDIDQVEKAILRVAVFELKDCTDVPYRVVINEAIELAKSFAADDSH 120

Query:   195 RIINGCLRTFVR 206
             + +NG L   V+
Sbjct:   121 KFVNGVLDKTVK 132

 Score = 45 (20.9 bits), Expect = 0.00017, Sum P(2) = 0.00017
 Identities = 22/71 (30%), Positives = 33/71 (46%)

Query:    20 SPRA-ARELALLVVYAACLEGSDPIRLFEKRL---NSRREPGYEFDKSSLLEYNHMSFGG 75
             SPR  ARELA+  VY+  +   +P+   E      NS+R     FD    +EY  +   G
Sbjct:     4 SPRRKARELAVQAVYSWQVS-KNPVNDIEVNFIADNSKRR----FD----IEYFQLLLRG 54

Query:    76 PPVTTETVEEA 86
                   +++EA
Sbjct:    55 VTTNIGSIDEA 65


>TIGR_CMR|GSU_1692 [details] [associations]
            symbol:GSU_1692 "N utilization substance protein B"
            species:243231 "Geobacter sulfurreducens PCA" [GO:0006353
            "DNA-dependent transcription, termination" evidence=ISS]
            HAMAP:MF_00073 InterPro:IPR006027 InterPro:IPR011605 Pfam:PF01029
            GO:GO:0006355 GO:GO:0003723 EMBL:AE017180 GenomeReviews:AE017180_GR
            GO:GO:0006353 eggNOG:COG0781 HOGENOM:HOG000281867 KO:K03625
            Gene3D:1.10.940.10 PANTHER:PTHR11078 SUPFAM:SSF48013
            TIGRFAMs:TIGR01951 OMA:QMPGVDK HSSP:Q9X286 RefSeq:NP_952743.1
            ProteinModelPortal:Q74CI1 GeneID:2685421 KEGG:gsu:GSU1692
            PATRIC:22026233 ProtClustDB:CLSK828489
            BioCyc:GSUL243231:GH27-1722-MONOMER Uniprot:Q74CI1
        Length = 138

 Score = 90 (36.7 bits), Expect = 0.00019, Sum P(2) = 0.00019
 Identities = 25/71 (35%), Positives = 38/71 (53%)

Query:   134 HVHIIDKVVPPIWKDQPAGRI--LELSILHLAMSEITVVGT-RHQIVINEAVDLAKRFCD 190
             H   ID  +    K+   GR+  ++LSIL +AM E+         + INEA+++AK+F  
Sbjct:    57 HRQEIDTAITGASKNWSIGRMARVDLSILRMAMYELLFRSDIPKNVTINEAIEVAKKFGT 116

Query:   191 GAAPRIINGCL 201
               +P  ING L
Sbjct:   117 EDSPAFINGIL 127

 Score = 43 (20.2 bits), Expect = 0.00019, Sum P(2) = 0.00019
 Identities = 13/36 (36%), Positives = 18/36 (50%)

Query:    22 RAARELALLVVYAACLEGSDPIRLFEKRLNSRREPG 57
             R  RELAL ++Y+      +   L E  L+   EPG
Sbjct:     5 RLGRELALQMLYSRDYAAGEAAPLLELVLDES-EPG 39


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.318   0.134   0.383    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      227       227   0.00079  113 3  11 22  0.40    33
                                                     32  0.41    36


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  7
  No. of states in DFA:  601 (64 KB)
  Total size of DFA:  167 KB (2098 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  20.32u 0.09s 20.41t   Elapsed:  00:00:03
  Total cpu time:  20.32u 0.09s 20.41t   Elapsed:  00:00:03
  Start:  Fri May 10 02:09:51 2013   End:  Fri May 10 02:09:54 2013

Back to top