BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>014177
MARALPRLNSTATAIMASATSATKETVLDGAHFQMKNRNARVLVLGGTGRVGGSTAVALS
KLCPDLQIVVGSRNREKGAAMVSTLGKNSEFAEVNIYNEGSLLMALRDVDLVVHAAGPFQ
QAPKCTVLEAAIETKTAYIDVCDDTIYSQRAKSFKDRAIAANIPAITTGGIYPGVSNVMA
AELVRVARNESKGEPERLRFSYYTAGTGGAGPTILATSFLLLGEEVVAYNKGEEITLEPY
SGMLSVDFGKGIGRKDVFLLNLPEVRSAREVLGVPTVSARFGTAPFFWNWGMVTMQRLFP
AEYLRDRSKVQQLVQLFDPVVRAFDGIAGERVSMRVDLECTDGRNTVGIFSHRRLSVSVG
TAIAAFVLAVLEGATQPGVWFPEEPEGIAIEAREVLLKRASQGTINFVMNKAPWMVETEP
KELGLGIYI

High Scoring Gene Products

Symbol, full name Information P value
AT1G50450 protein from Arabidopsis thaliana 3.3e-150
PFL_3454
Saccharopine dehydrogenase
protein from Pseudomonas protegens Pf-5 6.4e-07
GSU_2539
saccharopine dehydrogenase
protein from Geobacter sulfurreducens PCA 1.3e-05
VC_1624
Putative uncharacterized protein
protein from Vibrio cholerae O1 biovar El Tor str. N16961 0.00066
VC_1624
conserved hypothetical protein
protein from Vibrio cholerae O1 biovar El Tor 0.00066

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  014177
        (429 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

TAIR|locus:2008041 - symbol:AT1G50450 species:3702 "Arabi...  1466  3.3e-150  1
UNIPROTKB|Q4KB25 - symbol:PFL_3454 "Saccharopine dehydrog...   142  6.4e-07   1
TIGR_CMR|GSU_2539 - symbol:GSU_2539 "saccharopine dehydro...   131  1.3e-05   1
UNIPROTKB|Q9KRL3 - symbol:VC_1624 "Putative uncharacteriz...   116  0.00066   1
TIGR_CMR|VC_1624 - symbol:VC_1624 "conserved hypothetical...   116  0.00066   1


>TAIR|locus:2008041 [details] [associations]
            symbol:AT1G50450 species:3702 "Arabidopsis thaliana"
            [GO:0000166 "nucleotide binding" evidence=IEA] [GO:0016491
            "oxidoreductase activity" evidence=IEA] [GO:0055114
            "oxidation-reduction process" evidence=IEA] [GO:0009507
            "chloroplast" evidence=IDA] [GO:0009534 "chloroplast thylakoid"
            evidence=IDA] InterPro:IPR005097 Pfam:PF03435 InterPro:IPR016040
            EMBL:CP002684 GenomeReviews:CT485782_GR GO:GO:0000166
            Gene3D:3.40.50.720 GO:GO:0016491 eggNOG:COG1748 GO:GO:0009534
            EMBL:AY039543 EMBL:AY097361 IPI:IPI00520569 RefSeq:NP_564570.1
            UniGene:At.38025 UniGene:At.48296 ProteinModelPortal:Q94BZ0
            SMR:Q94BZ0 IntAct:Q94BZ0 STRING:Q94BZ0 PaxDb:Q94BZ0 PRIDE:Q94BZ0
            EnsemblPlants:AT1G50450.1 GeneID:841467 KEGG:ath:AT1G50450
            TAIR:At1g50450 HOGENOM:HOG000012826 InParanoid:Q94BZ0 OMA:IFPGISN
            PhylomeDB:Q94BZ0 ProtClustDB:CLSN2688642 Genevestigator:Q94BZ0
            Uniprot:Q94BZ0
        Length = 428

 Score = 1466 (521.1 bits), Expect = 3.3e-150, P = 3.3e-150
 Identities = 284/409 (69%), Positives = 331/409 (80%)

Query:    24 KETVLDGA---HFQMKNRNXXXXXXXXXXXXXXXXXXALSKLCPDLQIVVGSRNREKGAA 80
             +ET  DG     F   +RN                  ALSKLCP+L+IVVG RNREKG A
Sbjct:    20 RETQYDGVPEVKFSDPSRNYRVLVLGGTGRVGGSTATALSKLCPELKIVVGGRNREKGEA 79

Query:    81 MVSTLGKNSEFAEVNIYNEGSLLMALRDVDLVVHAAGPFQQAPKCTVLEAAIETKTAYID 140
             MV+ LG+NSEF++V+I +   L  +LRDVDLVVHAAGPFQQAP+CTVLEAAI+TKTAY+D
Sbjct:    80 MVAKLGENSEFSQVDINDAKMLETSLRDVDLVVHAAGPFQQAPRCTVLEAAIKTKTAYLD 139

Query:   141 VCDDTIYSQRAKSFKDRAIAANIPAITTGGIYPGVSNVMAAELVRVARNESKGEPERLRF 200
             VCDDT Y+ RAKS +  AIAANIPA+TT GIYPGVSNVMAAE+V  AR+E KG+PE+LRF
Sbjct:   140 VCDDTSYAFRAKSLEAEAIAANIPALTTAGIYPGVSNVMAAEMVAAARSEDKGKPEKLRF 199

Query:   201 SXXXXXXXXXXPTILATSFLLLGEEVVAYNKGEEITLEPYSGMLSVDFGKGIGRKDVFLL 260
             S          PTILATSFLLLGEEV AY +GE++ L PYSGM++VDFGKGI ++DV+LL
Sbjct:   200 SYYTAGTGGAGPTILATSFLLLGEEVTAYKQGEKVKLRPYSGMITVDFGKGIRKRDVYLL 259

Query:   261 NLPEVRSAREVLGVPTVSARFGTAPFFWNWGMVTMQRLFPAEYLRDRSKVQQLVQLFDPV 320
             NLPEVRS  EVLGVPTV ARFGTAPFFWNWGM  M +L P+E LRDR+KVQQ+V+LFDPV
Sbjct:   260 NLPEVRSTHEVLGVPTVVARFGTAPFFWNWGMEIMTKLLPSEVLRDRTKVQQMVELFDPV 319

Query:   321 VRAFDGIAGERVSMRVDLECTDGRNTVGIFSHRRLSVSVGTAIAAFVLAVLEGATQPGVW 380
             VRA DG AGERVSMRVDLEC+DGR TVG+FSH++LSVSVG + AAFV A+LEG+TQPGVW
Sbjct:   320 VRAMDGFAGERVSMRVDLECSDGRTTVGLFSHKKLSVSVGVSTAAFVAAMLEGSTQPGVW 379

Query:   381 FPEEPEGIAIEAREVLLKRASQGTINFVMNKAPWMVETEPKELGLGIYI 429
             FPEEP+GIA+EAREVLLKRASQGT NF++NK PWMVETEPKE+ LGIY+
Sbjct:   380 FPEEPQGIAVEAREVLLKRASQGTFNFILNKPPWMVETEPKEVVLGIYV 428


>UNIPROTKB|Q4KB25 [details] [associations]
            symbol:PFL_3454 "Saccharopine dehydrogenase" species:220664
            "Pseudomonas protegens Pf-5" [GO:0003674 "molecular_function"
            evidence=ND] [GO:0005575 "cellular_component" evidence=ND]
            [GO:0008150 "biological_process" evidence=ND] InterPro:IPR005097
            Pfam:PF03435 InterPro:IPR016040 GO:GO:0000166 Gene3D:3.40.50.720
            EMBL:CP000076 GenomeReviews:CP000076_GR GO:GO:0004754
            eggNOG:COG3268 RefSeq:YP_260558.1 ProteinModelPortal:Q4KB25
            STRING:Q4KB25 GeneID:3475489 KEGG:pfl:PFL_3454 PATRIC:19876243
            HOGENOM:HOG000253294 OMA:DVSTAYY
            BioCyc:PFLU220664:GIX8-3469-MONOMER Uniprot:Q4KB25
        Length = 359

 Score = 142 (55.0 bits), Expect = 6.4e-07, P = 6.4e-07
 Identities = 49/215 (22%), Positives = 94/215 (43%)

Query:    66 LQIVVGSRNREKGAAMVSTLGKNSEFAEVNIYNEGSLLMALRDVDLVVHAAGPFQQAPKC 125
             L+ V+  RNR+K  A+   LG  +    ++  ++  LL  ++ + LV+H AGPF  A   
Sbjct:    35 LRPVLAGRNRDKVEALARELGLEARVFGLD--DDARLLAQVKGLGLVLHCAGPFS-ATAA 91

Query:   126 TVLEAAIETKTAYIDVCDDTIYSQRAKSFKDRAIAANIPAITTGGIYPGVS-NVMAAELV 184
              ++EA +     Y+D+  +    + A+S  +RA AA +       I PGV  +V+  + V
Sbjct:    92 PMIEACLRASAHYLDITGEIAVFEHAQSLNERARAAGVV------ICPGVGFDVVPTDCV 145

Query:   185 RVARNESKGEPERLRFSXXXXXXXXXXPTILATSFLLLGEEVVAYNKGEEITLEPYSGML 244
               A  ++   P+    +          P    TS   + +       G  +++     + 
Sbjct:   146 AAALKDAL--PDATHLALGFDSRSSFSPGTAKTSIEGMAQGGKVRRDGRIVSVPLAYRVR 203

Query:   245 SVDFGKGIGRKDVFLLNLP--EVRSAREVLGVPTV 277
              +DFG G    +   + +P  ++ +A    G+P +
Sbjct:   204 RIDFGAG----EKLSMTIPWGDISTAYHTTGIPNI 234


>TIGR_CMR|GSU_2539 [details] [associations]
            symbol:GSU_2539 "saccharopine dehydrogenase"
            species:243231 "Geobacter sulfurreducens PCA" [GO:0004754
            "saccharopine dehydrogenase (NAD+, L-lysine-forming) activity"
            evidence=ISS] [GO:0009085 "lysine biosynthetic process"
            evidence=ISS] InterPro:IPR005097 Pfam:PF03435 InterPro:IPR016040
            GO:GO:0000166 Gene3D:3.40.50.720 GO:GO:0016491 EMBL:AE017180
            GenomeReviews:AE017180_GR KO:K00290 HOGENOM:HOG000005214
            OMA:KFEYSWQ ProtClustDB:CLSK870070 RefSeq:NP_953585.1
            ProteinModelPortal:Q74A52 DNASU:2687788 GeneID:2687788
            KEGG:gsu:GSU2539 PATRIC:22027929
            BioCyc:GSUL243231:GH27-2515-MONOMER Uniprot:Q74A52
        Length = 398

 Score = 131 (51.2 bits), Expect = 1.3e-05, P = 1.3e-05
 Identities = 62/230 (26%), Positives = 105/230 (45%)

Query:    68 IVVGSRNREKGAAMVSTLGKNSEFAEVNIYNEGSLLMALRDVD--LVVHAAGPFQQAPKC 125
             I + SR + K  A+ + L  +   A+V+  N   L+  ++     LV++ A P+Q     
Sbjct:    30 ITLASRTKSKCDAIAAQLNNSIATAQVDADNVPELVALIKKEQPKLVINVALPYQDL--- 86

Query:   126 TVLEAAIETKTAYIDVCD----DTI---YSQRAKSFKDRAIAANIPAITTGGIYPGVSNV 178
             T+++A +ET   Y+D  +    DT    YS +  +++DR  +A + A+   G  PGV+NV
Sbjct:    87 TIMDACLETGVDYLDTANYEPLDTAKFEYSWQW-AYQDRFKSAGLMALLGSGFDPGVTNV 145

Query:   179 MAAELVRVARNESKGEPERLRFSXXXXXXXXXXPTILATSFL--LLGEEVVA----YNKG 232
               A    +A  +   E + +             P   AT+F   +   EV A    +  G
Sbjct:   146 YTA----LAAKKYLDEVQEIDI-IDANAGSHGQP--FATNFNPEINIREVTAPCRHWENG 198

Query:   233 EEITLEPYSGMLSVDFGKGIGRKDVFLLNLPEVRSAREVLGVPTV-SARF 281
             E +   P S     DF +GIG  +++ L   E+ S   V  +PT+  A+F
Sbjct:   199 EFVETPPLSTKQVFDFPEGIGPMNIYRLYHEEMESI--VKHIPTIRKAQF 246


>UNIPROTKB|Q9KRL3 [details] [associations]
            symbol:VC_1624 "Putative uncharacterized protein"
            species:243277 "Vibrio cholerae O1 biovar El Tor str. N16961"
            [GO:0003674 "molecular_function" evidence=ND] [GO:0005575
            "cellular_component" evidence=ND] [GO:0008150 "biological_process"
            evidence=ND] InterPro:IPR005097 Pfam:PF03435 InterPro:IPR016040
            GO:GO:0000166 Gene3D:3.40.50.720 EMBL:AE003852
            GenomeReviews:AE003852_GR GO:GO:0016491 OMA:KFEYSWQ
            ProtClustDB:CLSK870070 PIR:A82177 RefSeq:NP_231263.1
            ProteinModelPortal:Q9KRL3 DNASU:2613880 GeneID:2613880
            KEGG:vch:VC1624 PATRIC:20082312 KO:K13746
            BioCyc:MetaCyc:MONOMER-15801 Uniprot:Q9KRL3
        Length = 414

 Score = 116 (45.9 bits), Expect = 0.00066, P = 0.00066
 Identities = 53/226 (23%), Positives = 96/226 (42%)

Query:    77 KGAAMVSTLGKNSEFAEVNIYNEGSLLMALRDV--DLVVHAAGPFQQAPKCTVLEAAIET 134
             KG   +    K  E  +VN  +  SL+  + +V  DLV++A  P+       ++EA  + 
Sbjct:    47 KGKNNLKDSSKKLEARQVNADDIESLVKLINEVKPDLVINAGPPWVNV---AIMEACYQA 103

Query:   135 KTAY------IDVCDDTIYSQRAK----SFKDRAIAANIPAITTGGIYPGVSNVMAAELV 184
             K +Y      +D+C        A     +F+D+   A I AI + G  PGV +V AA   
Sbjct:   104 KVSYLDTSVSVDLCSKGQQVPEAYDAQWAFRDKFKQAGITAILSAGFDPGVVSVFAAYAA 163

Query:   185 RVARNESKGEPERLRFSXXXXXXXXXXPTILATSFLLLGEEVVAYNKGEEITLEPYSGML 244
             +   +E     + L  +               T+ L +  + + ++ GE   +  ++ ML
Sbjct:   164 KYLFDEID-TIDVLDINAGDHGKKFATNFDPETNLLEIQGDSIYWDAGEWKRVPCHTRML 222

Query:   245 SVDFGKGIGRKDVFLLNLPEVRSAREVLGVPTVSARFGTAPFFWNW 290
               DF K  G+  V+ ++  E+RS +E +    +    G    + N+
Sbjct:   223 EFDFPK-CGKFKVYSMSHDELRSLKEFIPAKRIEFWMGFGDRYLNY 267


>TIGR_CMR|VC_1624 [details] [associations]
            symbol:VC_1624 "conserved hypothetical protein" species:686
            "Vibrio cholerae O1 biovar El Tor" [GO:0003674 "molecular_function"
            evidence=ND] [GO:0005575 "cellular_component" evidence=ND]
            [GO:0008150 "biological_process" evidence=ND] InterPro:IPR005097
            Pfam:PF03435 InterPro:IPR016040 GO:GO:0000166 Gene3D:3.40.50.720
            EMBL:AE003852 GenomeReviews:AE003852_GR GO:GO:0016491 OMA:KFEYSWQ
            ProtClustDB:CLSK870070 PIR:A82177 RefSeq:NP_231263.1
            ProteinModelPortal:Q9KRL3 DNASU:2613880 GeneID:2613880
            KEGG:vch:VC1624 PATRIC:20082312 KO:K13746
            BioCyc:MetaCyc:MONOMER-15801 Uniprot:Q9KRL3
        Length = 414

 Score = 116 (45.9 bits), Expect = 0.00066, P = 0.00066
 Identities = 53/226 (23%), Positives = 96/226 (42%)

Query:    77 KGAAMVSTLGKNSEFAEVNIYNEGSLLMALRDV--DLVVHAAGPFQQAPKCTVLEAAIET 134
             KG   +    K  E  +VN  +  SL+  + +V  DLV++A  P+       ++EA  + 
Sbjct:    47 KGKNNLKDSSKKLEARQVNADDIESLVKLINEVKPDLVINAGPPWVNV---AIMEACYQA 103

Query:   135 KTAY------IDVCDDTIYSQRAK----SFKDRAIAANIPAITTGGIYPGVSNVMAAELV 184
             K +Y      +D+C        A     +F+D+   A I AI + G  PGV +V AA   
Sbjct:   104 KVSYLDTSVSVDLCSKGQQVPEAYDAQWAFRDKFKQAGITAILSAGFDPGVVSVFAAYAA 163

Query:   185 RVARNESKGEPERLRFSXXXXXXXXXXPTILATSFLLLGEEVVAYNKGEEITLEPYSGML 244
             +   +E     + L  +               T+ L +  + + ++ GE   +  ++ ML
Sbjct:   164 KYLFDEID-TIDVLDINAGDHGKKFATNFDPETNLLEIQGDSIYWDAGEWKRVPCHTRML 222

Query:   245 SVDFGKGIGRKDVFLLNLPEVRSAREVLGVPTVSARFGTAPFFWNW 290
               DF K  G+  V+ ++  E+RS +E +    +    G    + N+
Sbjct:   223 EFDFPK-CGKFKVYSMSHDELRSLKEFIPAKRIEFWMGFGDRYLNY 267


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.320   0.136   0.393    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      429       387   0.00093  117 3  11 22  0.38    34
                                                     34  0.45    37


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  5
  No. of states in DFA:  605 (64 KB)
  Total size of DFA:  212 KB (2118 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  25.60u 0.14s 25.74t   Elapsed:  00:00:02
  Total cpu time:  25.60u 0.14s 25.74t   Elapsed:  00:00:02
  Start:  Fri May 10 05:12:01 2013   End:  Fri May 10 05:12:03 2013

Back to top