Your job contains 1 sequence.
>003012
MKSSTTSANCVLLICLLLFNSARGGDNSEQNKFRQREATDDQLGLPQIDEDALVNTQCPK
NLELRWQTEVSSSIYATPLIADINSDGKLDIVVPSFLHYLEVLEGSDGDKMPGWPAFHQS
SVHSSPLLYDIDKDGVREIALATYNGEVLFFRVSGYMMTDKLEIPRRKVRKDWYVGLHSD
PVDRSHPDVHDDLIVQESEAARMKSMLETKKSTPETNATVTTSTESNPAPATVSNPDVKK
VNESLVNVSNPSEERKVNESHTEMNIKLPTSVDNSSTTTVSGGTNSSENGTNTGRRLLED
NNSKGSQEGNDKEDVPVATAENDQALDENADSSFELFRDTDELADEYNYDYDDYVDDAMW
GDEEWTEEQHEKIEDYVNVDSHILSTPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLK
ELGGIDIGKYVAGAIVVFNLDTKQVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLD
ILVGTSFGLFYVLDHHGKIREKFPLEMAEIQGAVVAADINDDGKIELVTTDTHGNVAAWT
AEGKGIWEQHLKSLVTQGPSIGDVDGDGHSDVVVPTLSGNIYVLSGKDGSKVRPYPYRTH
GRVMNQVLLVDLTKRGEKSKGLTIVTTSFDGYLYLIDGPTSCADVVDIGETSYSMVLADN
VDGGDDLDLIVTTMNGNVFCFSTPAPHHPLKAWRSINQGRNNVAIRYNRAGIYVTHPSRA
FRDEEGRNFWVEIEIVDEYRFPSGSQAPYNVTTTLLVPGNYQGERRIKQSQIFARRGKYR
IKLPTVGVRTTGTVLVEMVDKNGLYFSDEFSLTFHMYYYKLLKWLLVLPMLGMFGVLVIL
RPQEAMPLPSFSRNTDL
BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]
Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.
Reference: Gish, W. (1996-2006) http://blast.wustl.edu
Query= 003012
(857 letters)
Database: go_20130330-seqdb.fasta
368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done
Smallest
Sum
High Probability
Sequences producing High-scoring Segment Pairs: Score P(N) N
TAIR|locus:2095274 - symbol:DEX1 "DEFECTIVE IN EXINE FORM... 2187 1.1e-313 2
UNIPROTKB|Q9KQW0 - symbol:VC1888 "Hemolysin-related prote... 108 0.00013 3
TIGR_CMR|VC_1888 - symbol:VC_1888 "hypothetical protein" ... 108 0.00013 3
>TAIR|locus:2095274 [details] [associations]
symbol:DEX1 "DEFECTIVE IN EXINE FORMATION 1" species:3702
"Arabidopsis thaliana" [GO:0005576 "extracellular region"
evidence=ISM] [GO:0005509 "calcium ion binding" evidence=ISS]
[GO:0010208 "pollen wall assembly" evidence=IMP] [GO:0016020
"membrane" evidence=ISS] [GO:0005783 "endoplasmic reticulum"
evidence=IDA] Pfam:PF01839 GO:GO:0005783 EMBL:CP002686
GO:GO:0016020 GO:GO:0005509 InterPro:IPR013517 GO:GO:0010208
IPI:IPI00534148 RefSeq:NP_566343.1 UniGene:At.16907
UniGene:At.69661 ProteinModelPortal:F4IYM4 SMR:F4IYM4 PRIDE:F4IYM4
EnsemblPlants:AT3G09090.1 GeneID:820063 KEGG:ath:AT3G09090
OMA:ADEYSYD Uniprot:F4IYM4
Length = 896
Score = 2187 (774.9 bits), Expect = 1.1e-313, Sum P(2) = 1.1e-313
Identities = 435/654 (66%), Positives = 484/654 (74%)
Query: 211 KSTPET-NATVTTSTESNPAPATVSNPDVKKVNESLVNVSNPSEERKVNESHTEMNIKLP 269
K TPE N+++ + A AT + + +N ++ +N ++ K++ E IKL
Sbjct: 246 KPTPELHNSSMDAGANNLAANATTAGSR-ENLNRNVT--TNEVDQSKISGDKNETVIKLN 302
Query: 270 XXXXXXXXXXXXXXXXXXXXXXXXX-RRLLEDNNSKGSQEGN-DKED----VPVATAEND 323
RRLLE++ SK S + + D +D V +AT END
Sbjct: 303 TSTGNSSETLGTSGNSSTAETVTKSGRRLLEEDGSKESVDSHSDSKDNSEGVRMATVEND 362
Query: 324 QALDENADSSFELFRDTDELAXXXXXXXXXXXXXAMWGDEEWTEEQHEKIEDYVNVDSHI 383
L+ +ADSSFEL R+ DELA MWGDEEW E QHE EDYVN+D+HI
Sbjct: 363 GGLEADADSSFELLRENDELADEYSYDYDDYVDEKMWGDEEWVEGQHENSEDYVNIDAHI 422
Query: 384 LSTPVIADIDNDGVSEMIIAVSYFFDHEYYDNPEHLKELGGIDIGKYVAGAIVVFNLDTK 443
L TPVIADID DGV EMI+AVSYFFD EYYDNPEHLKELGGIDI Y+A +IVVFNLDTK
Sbjct: 423 LCTPVIADIDKDGVQEMIVAVSYFFDPEYYDNPEHLKELGGIDIKNYIASSIVVFNLDTK 482
Query: 444 QVKWTTDLDLSTDNASFRAYIYSSPTVVDLDGDGNLDILVGTSFGLFYVLDHHGKIREKF 503
QVKW +LDLSTD A+FRAYIYSSPTVVDLDGDG LDILVGTSFGLFY +DH G IREKF
Sbjct: 483 QVKWIKELDLSTDKANFRAYIYSSPTVVDLDGDGYLDILVGTSFGLFYAMDHRGNIREKF 542
Query: 504 PLEMAEIQGAVVAADINDDGKIELVTTDTHGNVAAWTAEGKGIWEQHLKSLVTQGPSIXX 563
PLEMAEIQGAVVAADINDDGKIELVTTD+HGN+AAWT +G IWE HLKSLV QGPSI
Sbjct: 543 PLEMAEIQGAVVAADINDDGKIELVTTDSHGNIAAWTTQGVEIWEAHLKSLVPQGPSIGD 602
Query: 564 XXXXXXXXXXXPTLSGNIYVLSGKDGSKVRPYPYRTHGRVMNQVLLVDLTKRGEKSKGLT 623
PT SGNIYVLSGKDGS VRPYPYRTHGRVMNQ+LLVDL KRGEK KGLT
Sbjct: 603 VDGDGHTEVVVPTSSGNIYVLSGKDGSIVRPYPYRTHGRVMNQLLLVDLNKRGEKKKGLT 662
Query: 624 IVTTSFDGYLYLIDGPTSCADVVDIGETSYSMVLADNVDGGDDLDLIVTTMNGNVFCFST 683
IVTTSFDGYLYLIDGPTSC DVVDIGETSYSMVLADNVDGGDDLDLIV+TMNGNVFCFST
Sbjct: 663 IVTTSFDGYLYLIDGPTSCTDVVDIGETSYSMVLADNVDGGDDLDLIVSTMNGNVFCFST 722
Query: 684 PAPHHPLKAWRSINQGRNNVAIRYNRAGIYVTHPSRAFRDEEGRNFWVEIEIVDEYRFPS 743
P+PHHPLKAWRS +QGRNN A RY+R G++VTH +R FRDEEG+NFW EIEIVD+YR+PS
Sbjct: 723 PSPHHPLKAWRSSDQGRNNKANRYDREGVFVTHSTRGFRDEEGKNFWAEIEIVDKYRYPS 782
Query: 744 GSQAPYNVTTTLLVPGNYQGERRIKQSQIFARRGKYRIKLPXXXXXXXXXXXXEMVDKNG 803
GSQAPYNVTTTLLVPGNYQGERRI QSQI+ R GKYRIKLP EM DKNG
Sbjct: 783 GSQAPYNVTTTLLVPGNYQGERRITQSQIYDRPGKYRIKLPTVGVRTTGTVMVEMADKNG 842
Query: 804 LYFSDEFSLTFHXXXXXXXXXXXXXXXXXXFGVLVILRPQEAMPLPSFSRNTDL 857
L+FSDEFSLTFH FG+LVILRPQEA+PLPSFSRNTDL
Sbjct: 843 LHFSDEFSLTFHMYYYKLLKWLLVLPMLGMFGLLVILRPQEAVPLPSFSRNTDL 896
Score = 846 (302.9 bits), Expect = 1.1e-313, Sum P(2) = 1.1e-313
Identities = 173/267 (64%), Positives = 205/267 (76%)
Query: 1 MKSSTTSANCVLLICLLLFNSARGGDNSEQNKFRQREATDDQLGLPQIDEDALVNTQCPK 60
MKS V L+CL L N + G +NKFR+R+ATDD+LG P IDEDAL+NTQCPK
Sbjct: 1 MKSRARQCLLVCLLCLSLTNLSYG-----ENKFRERKATDDELGYPDIDEDALLNTQCPK 55
Query: 61 NLELRWQTEVSSSIYATPLIADINSDGKLDIVVPSFLHYLEVLEGSDGDKMPGWPAFHQS 120
LELRWQTEV+SS+YATPLIADINSDGKLDIVVPSF+HYLEVLEG+DGDKMPGWPAFHQS
Sbjct: 56 KLELRWQTEVTSSVYATPLIADINSDGKLDIVVPSFVHYLEVLEGADGDKMPGWPAFHQS 115
Query: 121 SVHSSPLLYDIDKDGVREIALATYNGEVLFFRVSGYMMTDKLEIPRRKVRKDWYVGLHSD 180
+VHSSPLL+DIDKDGVREIALATYN EVLFFRVSG++M+DKLE+PRRKV K+W+VGL+ D
Sbjct: 116 NVHSSPLLFDIDKDGVREIALATYNAEVLFFRVSGFLMSDKLEVPRRKVHKNWHVGLNPD 175
Query: 181 PVDRSHPDVHDDLIVQESEAARMKSMLETKKSTPETNATVTTSTESNPAPATVSNPDVKK 240
PVDRSHPDVHDD++ E EA MKS ST +TNAT TT TVS K+
Sbjct: 176 PVDRSHPDVHDDVL--EEEAMAMKS------STTQTNATTTTPN------VTVSM--TKE 219
Query: 241 VNESLVNVSNPSEERKVNESHTEMNIK 267
V+ + VS ++++ + TE +K
Sbjct: 220 VHGANSYVSTQEDQKRPENNQTEAIVK 246
Score = 44 (20.5 bits), Expect = 7.7e-229, Sum P(2) = 7.7e-229
Identities = 16/71 (22%), Positives = 31/71 (43%)
Query: 179 SDPVDRSHPDVHDDLIVQESEAARMKSMLETKKSTPETNATVTTSTESNPAPATVSNPDV 238
++ + + P++H+ + ++ A + + T S N VTT+ + N V
Sbjct: 241 TEAIVKPTPELHNSSM--DAGANNLAANATTAGSRENLNRNVTTNEVDQSKISGDKNETV 298
Query: 239 KKVNESLVNVS 249
K+N S N S
Sbjct: 299 IKLNTSTGNSS 309
>UNIPROTKB|Q9KQW0 [details] [associations]
symbol:VC1888 "Hemolysin-related protein" species:243277
"Vibrio cholerae O1 biovar El Tor str. N16961" [GO:0003674
"molecular_function" evidence=ND] [GO:0005575 "cellular_component"
evidence=ND] [GO:0008150 "biological_process" evidence=ND]
Prosite:PS00018 GenomeReviews:AE003852_GR InterPro:IPR018247
EMBL:AE004264 PIR:G82144 RefSeq:NP_231522.1
ProteinModelPortal:Q9KQW0 DNASU:2613517 GeneID:2613517
KEGG:vch:VC1888 PATRIC:20082818 ProtClustDB:CLSK793866
Uniprot:Q9KQW0
Length = 691
Score = 108 (43.1 bits), Expect = 0.00013, Sum P(3) = 0.00013
Identities = 29/70 (41%), Positives = 40/70 (57%)
Query: 467 SPTVVDLDGDGNLDILVGTSFGLFYV--LDHHGKIREKFPLEMAE---IQGAVVAADIND 521
+P DLDGDG ++I V TS Y+ LDH G I+++ L+ A G + ADIN
Sbjct: 120 APAAADLDGDGLIEI-VSTSALTPYINILDHQGNIKKQL-LKSASGWRSVGDIALADING 177
Query: 522 DGKIELVTTD 531
DG IE++ D
Sbjct: 178 DGNIEILAAD 187
Score = 58 (25.5 bits), Expect = 0.00013, Sum P(3) = 0.00013
Identities = 19/81 (23%), Positives = 38/81 (46%)
Query: 381 SHILSTPVIADI--DN-DG-VSEMIIA--VSYFFDHEYYDNPEHLKELGGID---IGKYV 431
+ +++ P++ + DN DG + E +A + F+ Y N +++ L G+D + Y
Sbjct: 50 NQVMAAPIVVQLNDDNGDGKIDEKDVADIIVVTFEGNKYANGGYIRALSGVDGSELWSYS 109
Query: 432 AGAIVVFNLDTKQVKWTTDLD 452
G ++ D + DLD
Sbjct: 110 NGGVIA---DARYAPAAADLD 127
Score = 55 (24.4 bits), Expect = 0.00013, Sum P(3) = 0.00013
Identities = 37/141 (26%), Positives = 62/141 (43%)
Query: 585 SGKDGSKVRPYPYRTHGRVMNQVLLVD-LTKR---GEKSKGLTIVTTSFDGYLYLIDGPT 640
SGK G V Y + G +++VL+ D L R G+ + + I+ S G L+ + P
Sbjct: 551 SGKIG--VSAYDFTGDG--IDEVLVQDRLRMRILDGQTGRVMGIIANS-SGTLW--EYPV 603
Query: 641 SCADVVDI-GETSYSMVLADNVDGGDDLDLIVTTMNGNVFCFSTPAPHHPLKAWRSINQG 699
V D+ G + S+++ N D D + +N VF + + P P WR+ +
Sbjct: 604 ----VADLEGNNNASLIMVAN-----DYDR-ESQVNHGVFVYESANPSKP---WRNATRI 650
Query: 700 RNNVAIRYNRAGIYVTHPSRA 720
N A ++ T P+ A
Sbjct: 651 WNQYAFNFSDINANGTIPTNA 671
Score = 54 (24.1 bits), Expect = 0.00033, Sum P(3) = 0.00033
Identities = 21/70 (30%), Positives = 32/70 (45%)
Query: 62 LELRWQTEV----SSSIYATPLIA---DINSDGKLD------IVVPSFLH-------YLE 101
L+ W T V S+ + A P++ D N DGK+D I+V +F Y+
Sbjct: 36 LKWSWSTSVFHPESNQVMAAPIVVQLNDDNGDGKIDEKDVADIIVVTFEGNKYANGGYIR 95
Query: 102 VLEGSDGDKM 111
L G DG ++
Sbjct: 96 ALSGVDGSEL 105
>TIGR_CMR|VC_1888 [details] [associations]
symbol:VC_1888 "hypothetical protein" species:686 "Vibrio
cholerae O1 biovar El Tor" [GO:0003674 "molecular_function"
evidence=ND] [GO:0005575 "cellular_component" evidence=ND]
[GO:0008150 "biological_process" evidence=ND] Prosite:PS00018
GenomeReviews:AE003852_GR InterPro:IPR018247 EMBL:AE004264
PIR:G82144 RefSeq:NP_231522.1 ProteinModelPortal:Q9KQW0
DNASU:2613517 GeneID:2613517 KEGG:vch:VC1888 PATRIC:20082818
ProtClustDB:CLSK793866 Uniprot:Q9KQW0
Length = 691
Score = 108 (43.1 bits), Expect = 0.00013, Sum P(3) = 0.00013
Identities = 29/70 (41%), Positives = 40/70 (57%)
Query: 467 SPTVVDLDGDGNLDILVGTSFGLFYV--LDHHGKIREKFPLEMAE---IQGAVVAADIND 521
+P DLDGDG ++I V TS Y+ LDH G I+++ L+ A G + ADIN
Sbjct: 120 APAAADLDGDGLIEI-VSTSALTPYINILDHQGNIKKQL-LKSASGWRSVGDIALADING 177
Query: 522 DGKIELVTTD 531
DG IE++ D
Sbjct: 178 DGNIEILAAD 187
Score = 58 (25.5 bits), Expect = 0.00013, Sum P(3) = 0.00013
Identities = 19/81 (23%), Positives = 38/81 (46%)
Query: 381 SHILSTPVIADI--DN-DG-VSEMIIA--VSYFFDHEYYDNPEHLKELGGID---IGKYV 431
+ +++ P++ + DN DG + E +A + F+ Y N +++ L G+D + Y
Sbjct: 50 NQVMAAPIVVQLNDDNGDGKIDEKDVADIIVVTFEGNKYANGGYIRALSGVDGSELWSYS 109
Query: 432 AGAIVVFNLDTKQVKWTTDLD 452
G ++ D + DLD
Sbjct: 110 NGGVIA---DARYAPAAADLD 127
Score = 55 (24.4 bits), Expect = 0.00013, Sum P(3) = 0.00013
Identities = 37/141 (26%), Positives = 62/141 (43%)
Query: 585 SGKDGSKVRPYPYRTHGRVMNQVLLVD-LTKR---GEKSKGLTIVTTSFDGYLYLIDGPT 640
SGK G V Y + G +++VL+ D L R G+ + + I+ S G L+ + P
Sbjct: 551 SGKIG--VSAYDFTGDG--IDEVLVQDRLRMRILDGQTGRVMGIIANS-SGTLW--EYPV 603
Query: 641 SCADVVDI-GETSYSMVLADNVDGGDDLDLIVTTMNGNVFCFSTPAPHHPLKAWRSINQG 699
V D+ G + S+++ N D D + +N VF + + P P WR+ +
Sbjct: 604 ----VADLEGNNNASLIMVAN-----DYDR-ESQVNHGVFVYESANPSKP---WRNATRI 650
Query: 700 RNNVAIRYNRAGIYVTHPSRA 720
N A ++ T P+ A
Sbjct: 651 WNQYAFNFSDINANGTIPTNA 671
Score = 54 (24.1 bits), Expect = 0.00033, Sum P(3) = 0.00033
Identities = 21/70 (30%), Positives = 32/70 (45%)
Query: 62 LELRWQTEV----SSSIYATPLIA---DINSDGKLD------IVVPSFLH-------YLE 101
L+ W T V S+ + A P++ D N DGK+D I+V +F Y+
Sbjct: 36 LKWSWSTSVFHPESNQVMAAPIVVQLNDDNGDGKIDEKDVADIIVVTFEGNKYANGGYIR 95
Query: 102 VLEGSDGDKM 111
L G DG ++
Sbjct: 96 ALSGVDGSEL 105
Parameters:
V=100
filter=SEG
E=0.001
ctxfactor=1.00
Query ----- As Used ----- ----- Computed ----
Frame MatID Matrix name Lambda K H Lambda K H
+0 0 BLOSUM62 0.315 0.134 0.392 same same same
Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a
Query
Frame MatID Length Eff.Length E S W T X E2 S2
+0 0 857 776 0.00093 121 3 11 23 0.44 34
37 0.45 37
Statistics:
Database: /share/blast/go-seqdb.fasta
Title: go_20130330-seqdb.fasta
Posted: 5:47:42 AM PDT Apr 1, 2013
Created: 5:47:42 AM PDT Apr 1, 2013
Format: XDF-1
# of letters in database: 169,044,731
# of sequences in database: 368,745
# of database sequences satisfying E: 3
No. of states in DFA: 630 (67 KB)
Total size of DFA: 402 KB (2194 KB)
Time to generate neighborhood: 0.00u 0.00s 0.00t Elapsed: 00:00:00
No. of threads or processors used: 24
Search cpu time: 71.64u 0.10s 71.74t Elapsed: 00:00:04
Total cpu time: 71.64u 0.10s 71.74t Elapsed: 00:00:04
Start: Fri May 10 10:05:15 2013 End: Fri May 10 10:05:19 2013