BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>003012
MKSSTTSANCVLLICLLLFNSARGGDNSEQNKFRQREATDDQLGLPQIDEDALVNTQCPK
NLELRWQTEVSSSIYATPLIADINSDGKLDIVVPSFLHYLEVLEGSDGDKMPGWPAFHQS
SVHSSPLLYDIDKDGVREIALATYNGEVLFFRVSGYMMTDKLEIPRRKVRKDWYVGLHSD
PVDRSHPDVHDDLIVQESEAARMKSMLETKKSTPETNATVTTSTESNPAPATVSNPDVKK
VNESLVNVSNPSEERKVNESHTEMNIKLPTSVDNSSTTTVSGGTNSSENGTNTGRRLLED
NNSKGSQEGNDKEDVPVATAENDQALDENADSSFELFRDTDELADEYNYDYDDYVDDAMW
GDEEWTEEQHEKIEDYVNVDSHILSTPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLK
ELGGIDIGKYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLD
ILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAADINDDGKIELVTTDTHGNVAAWT
AEGKGIWEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNIYVLSGKDGSKVRPYPYRTH
GRVMNQVLLVDLTKRGEKSKGLTIVTTSFDGYLYLIDGPTSCADVVDIGETSYSMVLADN
VDGGDDLDLIVTTMNGNVFCFSTPAPHHPLKAWRSINQGRNNVAIRYNRAGIYVTHPSRA
FRDEEGRNFWVEIEIVDEYRFPSGSQAPYNVTTTLLVPGNYQGERRIKQSQIFARRGKYR
IKLPTVGVRTTGTVLVEMVDKNGLYFSDEFSLTFHMYYYKLLKWLLVLPMLGMFGVLVIL
RPQEAMPLPSFSRNTDL

High Scoring Gene Products

Symbol, full name Information P value
DEX1
DEFECTIVE IN EXINE FORMATION 1
protein from Arabidopsis thaliana 1.1e-313
VC1888
Hemolysin-related protein
protein from Vibrio cholerae O1 biovar El Tor str. N16961 0.00013
VC_1888
hypothetical protein
protein from Vibrio cholerae O1 biovar El Tor 0.00013

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  003012
        (857 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

TAIR|locus:2095274 - symbol:DEX1 "DEFECTIVE IN EXINE FORM...  2187  1.1e-313  2
UNIPROTKB|Q9KQW0 - symbol:VC1888 "Hemolysin-related prote...   108  0.00013   3
TIGR_CMR|VC_1888 - symbol:VC_1888 "hypothetical protein" ...   108  0.00013   3


>TAIR|locus:2095274 [details] [associations]
            symbol:DEX1 "DEFECTIVE IN EXINE FORMATION 1" species:3702
            "Arabidopsis thaliana" [GO:0005576 "extracellular region"
            evidence=ISM] [GO:0005509 "calcium ion binding" evidence=ISS]
            [GO:0010208 "pollen wall assembly" evidence=IMP] [GO:0016020
            "membrane" evidence=ISS] [GO:0005783 "endoplasmic reticulum"
            evidence=IDA] Pfam:PF01839 GO:GO:0005783 EMBL:CP002686
            GO:GO:0016020 GO:GO:0005509 InterPro:IPR013517 GO:GO:0010208
            IPI:IPI00534148 RefSeq:NP_566343.1 UniGene:At.16907
            UniGene:At.69661 ProteinModelPortal:F4IYM4 SMR:F4IYM4 PRIDE:F4IYM4
            EnsemblPlants:AT3G09090.1 GeneID:820063 KEGG:ath:AT3G09090
            OMA:ADEYSYD Uniprot:F4IYM4
        Length = 896

 Score = 2187 (774.9 bits), Expect = 1.1e-313, Sum P(2) = 1.1e-313
 Identities = 435/654 (66%), Positives = 484/654 (74%)

Query:   211 KSTPET-NATVTTSTESNPAPATVSNPDVKKVNESLVNVSNPSEERKVNESHTEMNIKLP 269
             K TPE  N+++     +  A AT +    + +N ++   +N  ++ K++    E  IKL 
Sbjct:   246 KPTPELHNSSMDAGANNLAANATTAGSR-ENLNRNVT--TNEVDQSKISGDKNETVIKLN 302

Query:   270 XXXXXXXXXXXXXXXXXXXXXXXXX-RRLLEDNNSKGSQEGN-DKED----VPVATAEND 323
                                       RRLLE++ SK S + + D +D    V +AT END
Sbjct:   303 TSTGNSSETLGTSGNSSTAETVTKSGRRLLEEDGSKESVDSHSDSKDNSEGVRMATVEND 362

Query:   324 QALDENADSSFELFRDTDELAXXXXXXXXXXXXXAMWGDEEWTEEQHEKIEDYVNVDSHI 383
               L+ +ADSSFEL R+ DELA              MWGDEEW E QHE  EDYVN+D+HI
Sbjct:   363 GGLEADADSSFELLRENDELADEYSYDYDDYVDEKMWGDEEWVEGQHENSEDYVNIDAHI 422

Query:   384 LSTPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTK 443
             L TPVIADID DGV EMI+AVSYFFD EYYDNPEHLKELGGIDI  Y+A +IVVFNLDTK
Sbjct:   423 LCTPVIADIDKDGVQEMIVAVSYFFDPEYYDNPEHLKELGGIDIKNYIASSIVVFNLDTK 482

Query:   444 QVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKF 503
             QVKW  +LDLSTD A+FRAYIYSSPTVVDLDGDG LDILVGTSFGLFY +DH G IREKF
Sbjct:   483 QVKWIKELDLSTDKANFRAYIYSSPTVVDLDGDGYLDILVGTSFGLFYAMDHRGNIREKF 542

Query:   504 PLEMAEIQGAVVAADINDDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIXX 563
             PLEMAEIQGAVVAADINDDGKIELVTTD+HGN+AAWT +G  IWE HLKSLV QGPSI  
Sbjct:   543 PLEMAEIQGAVVAADINDDGKIELVTTDSHGNIAAWTTQGVEIWEAHLKSLVPQGPSIGD 602

Query:   564 XXXXXXXXXXXPTLSGNIYVLSGKDGSKVRPYPYRTHGRVMNQVLLVDLTKRGEKSKGLT 623
                        PT SGNIYVLSGKDGS VRPYPYRTHGRVMNQ+LLVDL KRGEK KGLT
Sbjct:   603 VDGDGHTEVVVPTSSGNIYVLSGKDGSIVRPYPYRTHGRVMNQLLLVDLNKRGEKKKGLT 662

Query:   624 IVTTSFDGYLYLIDGPTSCADVVDIGETSYSMVLADNVDGGDDLDLIVTTMNGNVFCFST 683
             IVTTSFDGYLYLIDGPTSC DVVDIGETSYSMVLADNVDGGDDLDLIV+TMNGNVFCFST
Sbjct:   663 IVTTSFDGYLYLIDGPTSCTDVVDIGETSYSMVLADNVDGGDDLDLIVSTMNGNVFCFST 722

Query:   684 PAPHHPLKAWRSINQGRNNVAIRYNRAGIYVTHPSRAFRDEEGRNFWVEIEIVDEYRFPS 743
             P+PHHPLKAWRS +QGRNN A RY+R G++VTH +R FRDEEG+NFW EIEIVD+YR+PS
Sbjct:   723 PSPHHPLKAWRSSDQGRNNKANRYDREGVFVTHSTRGFRDEEGKNFWAEIEIVDKYRYPS 782

Query:   744 GSQAPYNVTTTLLVPGNYQGERRIKQSQIFARRGKYRIKLPXXXXXXXXXXXXEMVDKNG 803
             GSQAPYNVTTTLLVPGNYQGERRI QSQI+ R GKYRIKLP            EM DKNG
Sbjct:   783 GSQAPYNVTTTLLVPGNYQGERRITQSQIYDRPGKYRIKLPTVGVRTTGTVMVEMADKNG 842

Query:   804 LYFSDEFSLTFHXXXXXXXXXXXXXXXXXXFGVLVILRPQEAMPLPSFSRNTDL 857
             L+FSDEFSLTFH                  FG+LVILRPQEA+PLPSFSRNTDL
Sbjct:   843 LHFSDEFSLTFHMYYYKLLKWLLVLPMLGMFGLLVILRPQEAVPLPSFSRNTDL 896

 Score = 846 (302.9 bits), Expect = 1.1e-313, Sum P(2) = 1.1e-313
 Identities = 173/267 (64%), Positives = 205/267 (76%)

Query:     1 MKSSTTSANCVLLICLLLFNSARGGDNSEQNKFRQREATDDQLGLPQIDEDALVNTQCPK 60
             MKS       V L+CL L N + G     +NKFR+R+ATDD+LG P IDEDAL+NTQCPK
Sbjct:     1 MKSRARQCLLVCLLCLSLTNLSYG-----ENKFRERKATDDELGYPDIDEDALLNTQCPK 55

Query:    61 NLELRWQTEVSSSIYATPLIADINSDGKLDIVVPSFLHYLEVLEGSDGDKMPGWPAFHQS 120
              LELRWQTEV+SS+YATPLIADINSDGKLDIVVPSF+HYLEVLEG+DGDKMPGWPAFHQS
Sbjct:    56 KLELRWQTEVTSSVYATPLIADINSDGKLDIVVPSFVHYLEVLEGADGDKMPGWPAFHQS 115

Query:   121 SVHSSPLLYDIDKDGVREIALATYNGEVLFFRVSGYMMTDKLEIPRRKVRKDWYVGLHSD 180
             +VHSSPLL+DIDKDGVREIALATYN EVLFFRVSG++M+DKLE+PRRKV K+W+VGL+ D
Sbjct:   116 NVHSSPLLFDIDKDGVREIALATYNAEVLFFRVSGFLMSDKLEVPRRKVHKNWHVGLNPD 175

Query:   181 PVDRSHPDVHDDLIVQESEAARMKSMLETKKSTPETNATVTTSTESNPAPATVSNPDVKK 240
             PVDRSHPDVHDD++  E EA  MKS      ST +TNAT TT         TVS    K+
Sbjct:   176 PVDRSHPDVHDDVL--EEEAMAMKS------STTQTNATTTTPN------VTVSM--TKE 219

Query:   241 VNESLVNVSNPSEERKVNESHTEMNIK 267
             V+ +   VS   ++++   + TE  +K
Sbjct:   220 VHGANSYVSTQEDQKRPENNQTEAIVK 246

 Score = 44 (20.5 bits), Expect = 7.7e-229, Sum P(2) = 7.7e-229
 Identities = 16/71 (22%), Positives = 31/71 (43%)

Query:   179 SDPVDRSHPDVHDDLIVQESEAARMKSMLETKKSTPETNATVTTSTESNPAPATVSNPDV 238
             ++ + +  P++H+  +  ++ A  + +   T  S    N  VTT+       +   N  V
Sbjct:   241 TEAIVKPTPELHNSSM--DAGANNLAANATTAGSRENLNRNVTTNEVDQSKISGDKNETV 298

Query:   239 KKVNESLVNVS 249
              K+N S  N S
Sbjct:   299 IKLNTSTGNSS 309


>UNIPROTKB|Q9KQW0 [details] [associations]
            symbol:VC1888 "Hemolysin-related protein" species:243277
            "Vibrio cholerae O1 biovar El Tor str. N16961" [GO:0003674
            "molecular_function" evidence=ND] [GO:0005575 "cellular_component"
            evidence=ND] [GO:0008150 "biological_process" evidence=ND]
            Prosite:PS00018 GenomeReviews:AE003852_GR InterPro:IPR018247
            EMBL:AE004264 PIR:G82144 RefSeq:NP_231522.1
            ProteinModelPortal:Q9KQW0 DNASU:2613517 GeneID:2613517
            KEGG:vch:VC1888 PATRIC:20082818 ProtClustDB:CLSK793866
            Uniprot:Q9KQW0
        Length = 691

 Score = 108 (43.1 bits), Expect = 0.00013, Sum P(3) = 0.00013
 Identities = 29/70 (41%), Positives = 40/70 (57%)

Query:   467 SPTVVDLDGDGNLDILVGTSFGLFYV--LDHHGKIREKFPLEMAE---IQGAVVAADIND 521
             +P   DLDGDG ++I V TS    Y+  LDH G I+++  L+ A      G +  ADIN 
Sbjct:   120 APAAADLDGDGLIEI-VSTSALTPYINILDHQGNIKKQL-LKSASGWRSVGDIALADING 177

Query:   522 DGKIELVTTD 531
             DG IE++  D
Sbjct:   178 DGNIEILAAD 187

 Score = 58 (25.5 bits), Expect = 0.00013, Sum P(3) = 0.00013
 Identities = 19/81 (23%), Positives = 38/81 (46%)

Query:   381 SHILSTPVIADI--DN-DG-VSEMIIA--VSYFFDHEYYDNPEHLKELGGID---IGKYV 431
             + +++ P++  +  DN DG + E  +A  +   F+   Y N  +++ L G+D   +  Y 
Sbjct:    50 NQVMAAPIVVQLNDDNGDGKIDEKDVADIIVVTFEGNKYANGGYIRALSGVDGSELWSYS 109

Query:   432 AGAIVVFNLDTKQVKWTTDLD 452
              G ++    D +      DLD
Sbjct:   110 NGGVIA---DARYAPAAADLD 127

 Score = 55 (24.4 bits), Expect = 0.00013, Sum P(3) = 0.00013
 Identities = 37/141 (26%), Positives = 62/141 (43%)

Query:   585 SGKDGSKVRPYPYRTHGRVMNQVLLVD-LTKR---GEKSKGLTIVTTSFDGYLYLIDGPT 640
             SGK G  V  Y +   G  +++VL+ D L  R   G+  + + I+  S  G L+  + P 
Sbjct:   551 SGKIG--VSAYDFTGDG--IDEVLVQDRLRMRILDGQTGRVMGIIANS-SGTLW--EYPV 603

Query:   641 SCADVVDI-GETSYSMVLADNVDGGDDLDLIVTTMNGNVFCFSTPAPHHPLKAWRSINQG 699
                 V D+ G  + S+++  N     D D   + +N  VF + +  P  P   WR+  + 
Sbjct:   604 ----VADLEGNNNASLIMVAN-----DYDR-ESQVNHGVFVYESANPSKP---WRNATRI 650

Query:   700 RNNVAIRYNRAGIYVTHPSRA 720
              N  A  ++      T P+ A
Sbjct:   651 WNQYAFNFSDINANGTIPTNA 671

 Score = 54 (24.1 bits), Expect = 0.00033, Sum P(3) = 0.00033
 Identities = 21/70 (30%), Positives = 32/70 (45%)

Query:    62 LELRWQTEV----SSSIYATPLIA---DINSDGKLD------IVVPSFLH-------YLE 101
             L+  W T V    S+ + A P++    D N DGK+D      I+V +F         Y+ 
Sbjct:    36 LKWSWSTSVFHPESNQVMAAPIVVQLNDDNGDGKIDEKDVADIIVVTFEGNKYANGGYIR 95

Query:   102 VLEGSDGDKM 111
              L G DG ++
Sbjct:    96 ALSGVDGSEL 105


>TIGR_CMR|VC_1888 [details] [associations]
            symbol:VC_1888 "hypothetical protein" species:686 "Vibrio
            cholerae O1 biovar El Tor" [GO:0003674 "molecular_function"
            evidence=ND] [GO:0005575 "cellular_component" evidence=ND]
            [GO:0008150 "biological_process" evidence=ND] Prosite:PS00018
            GenomeReviews:AE003852_GR InterPro:IPR018247 EMBL:AE004264
            PIR:G82144 RefSeq:NP_231522.1 ProteinModelPortal:Q9KQW0
            DNASU:2613517 GeneID:2613517 KEGG:vch:VC1888 PATRIC:20082818
            ProtClustDB:CLSK793866 Uniprot:Q9KQW0
        Length = 691

 Score = 108 (43.1 bits), Expect = 0.00013, Sum P(3) = 0.00013
 Identities = 29/70 (41%), Positives = 40/70 (57%)

Query:   467 SPTVVDLDGDGNLDILVGTSFGLFYV--LDHHGKIREKFPLEMAE---IQGAVVAADIND 521
             +P   DLDGDG ++I V TS    Y+  LDH G I+++  L+ A      G +  ADIN 
Sbjct:   120 APAAADLDGDGLIEI-VSTSALTPYINILDHQGNIKKQL-LKSASGWRSVGDIALADING 177

Query:   522 DGKIELVTTD 531
             DG IE++  D
Sbjct:   178 DGNIEILAAD 187

 Score = 58 (25.5 bits), Expect = 0.00013, Sum P(3) = 0.00013
 Identities = 19/81 (23%), Positives = 38/81 (46%)

Query:   381 SHILSTPVIADI--DN-DG-VSEMIIA--VSYFFDHEYYDNPEHLKELGGID---IGKYV 431
             + +++ P++  +  DN DG + E  +A  +   F+   Y N  +++ L G+D   +  Y 
Sbjct:    50 NQVMAAPIVVQLNDDNGDGKIDEKDVADIIVVTFEGNKYANGGYIRALSGVDGSELWSYS 109

Query:   432 AGAIVVFNLDTKQVKWTTDLD 452
              G ++    D +      DLD
Sbjct:   110 NGGVIA---DARYAPAAADLD 127

 Score = 55 (24.4 bits), Expect = 0.00013, Sum P(3) = 0.00013
 Identities = 37/141 (26%), Positives = 62/141 (43%)

Query:   585 SGKDGSKVRPYPYRTHGRVMNQVLLVD-LTKR---GEKSKGLTIVTTSFDGYLYLIDGPT 640
             SGK G  V  Y +   G  +++VL+ D L  R   G+  + + I+  S  G L+  + P 
Sbjct:   551 SGKIG--VSAYDFTGDG--IDEVLVQDRLRMRILDGQTGRVMGIIANS-SGTLW--EYPV 603

Query:   641 SCADVVDI-GETSYSMVLADNVDGGDDLDLIVTTMNGNVFCFSTPAPHHPLKAWRSINQG 699
                 V D+ G  + S+++  N     D D   + +N  VF + +  P  P   WR+  + 
Sbjct:   604 ----VADLEGNNNASLIMVAN-----DYDR-ESQVNHGVFVYESANPSKP---WRNATRI 650

Query:   700 RNNVAIRYNRAGIYVTHPSRA 720
              N  A  ++      T P+ A
Sbjct:   651 WNQYAFNFSDINANGTIPTNA 671

 Score = 54 (24.1 bits), Expect = 0.00033, Sum P(3) = 0.00033
 Identities = 21/70 (30%), Positives = 32/70 (45%)

Query:    62 LELRWQTEV----SSSIYATPLIA---DINSDGKLD------IVVPSFLH-------YLE 101
             L+  W T V    S+ + A P++    D N DGK+D      I+V +F         Y+ 
Sbjct:    36 LKWSWSTSVFHPESNQVMAAPIVVQLNDDNGDGKIDEKDVADIIVVTFEGNKYANGGYIR 95

Query:   102 VLEGSDGDKM 111
              L G DG ++
Sbjct:    96 ALSGVDGSEL 105


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.315   0.134   0.392    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      857       776   0.00093  121 3  11 23  0.44    34
                                                     37  0.45    37


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  3
  No. of states in DFA:  630 (67 KB)
  Total size of DFA:  402 KB (2194 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  71.64u 0.10s 71.74t   Elapsed:  00:00:04
  Total cpu time:  71.64u 0.10s 71.74t   Elapsed:  00:00:04
  Start:  Fri May 10 10:05:15 2013   End:  Fri May 10 10:05:19 2013

Back to top