Your job contains 1 sequence.
>032020
MGKSGLLDLEKHFAFYGAYHSNKINVLIHTLFVWPILFSTLMFLHFTPSVCDFSDKVSFL
PCGLFGHALVFNLGFLFTLIYASFYYCLDKKAGSLAALLCFACWVGASLLSNRLGFSLAW
KVAVAAQLICWTGQFLGHGIFEGTSSFG
BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]
Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.
Reference: Gish, W. (1996-2006) http://blast.wustl.edu
Query= 032020
(148 letters)
Database: go_20130330-seqdb.fasta
368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done
Smallest
Sum
High Probability
Sequences producing High-scoring Segment Pairs: Score P(N) N
TAIR|locus:2019135 - symbol:AT1G74440 "AT1G74440" species... 486 2.3e-46 1
TAIR|locus:2034975 - symbol:AT1G18720 "AT1G18720" species... 455 4.5e-43 1
SGD|S000002978 - symbol:YGL010W "Putative protein of unkn... 111 1.0e-13 2
CGD|CAL0004693 - symbol:orf19.1477 species:5476 "Candida ... 178 1.0e-13 1
UNIPROTKB|Q5ALU9 - symbol:CaO19.1477 "Putative uncharacte... 178 1.0e-13 1
POMBASE|SPAC16E8.02 - symbol:SPAC16E8.02 "DUF962 family p... 161 6.4e-12 1
ASPGD|ASPL0000047597 - symbol:AN1522 species:162425 "Emer... 120 1.4e-07 1
>TAIR|locus:2019135 [details] [associations]
symbol:AT1G74440 "AT1G74440" species:3702 "Arabidopsis
thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0008150
"biological_process" evidence=ND] [GO:0009507 "chloroplast"
evidence=ISM] [GO:0009627 "systemic acquired resistance"
evidence=RCA] [GO:0031347 "regulation of defense response"
evidence=RCA] EMBL:CP002684 GenomeReviews:CT485782_GR EMBL:AC011765
eggNOG:COG4539 OMA:VQAFLMA InterPro:IPR009305 Pfam:PF06127
HOGENOM:HOG000263779 IPI:IPI00545961 PIR:C96773 RefSeq:NP_177584.1
UniGene:At.11890 UniGene:At.34880 EnsemblPlants:AT1G74440.1
GeneID:843785 KEGG:ath:AT1G74440 TAIR:At1g74440 InParanoid:Q9CA70
PhylomeDB:Q9CA70 ProtClustDB:CLSN2914568 Genevestigator:Q9CA70
Uniprot:Q9CA70
Length = 208
Score = 486 (176.1 bits), Expect = 2.3e-46, P = 2.3e-46
Identities = 89/138 (64%), Positives = 109/138 (78%)
Query: 5 GLLDLEKHFAFYGAYHSNKINVLIHTLFVWPILFSTLMFLHFTPSVCDFSDKVSFLPCGL 64
GLLDLEKHFAFYGAYHSN IN++IHTLFVWP +F+TL+FL+ TP + D S ++ FL
Sbjct: 6 GLLDLEKHFAFYGAYHSNPINIIIHTLFVWPNVFATLLFLYSTPPILDHS-QLGFLKSLT 64
Query: 65 FGHALVFNLGFLFTLIYASFYYCLDKKAGSLAALLCFACWVGASLLSNRLGFSLAWKVAV 124
F L ++GF T+ YA FY CLDKK+G LAALLCF+CW+G+S L+ RLG SL KV V
Sbjct: 65 FDGVLRLDIGFTLTVTYAVFYICLDKKSGVLAALLCFSCWIGSSFLAARLGHSLTLKVGV 124
Query: 125 AAQLICWTGQFLGHGIFE 142
A+QL+CWTGQFLGHG+FE
Sbjct: 125 ASQLLCWTGQFLGHGLFE 142
>TAIR|locus:2034975 [details] [associations]
symbol:AT1G18720 "AT1G18720" species:3702 "Arabidopsis
thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0008150
"biological_process" evidence=ND] [GO:0009507 "chloroplast"
evidence=ISM] EMBL:CP002684 GenomeReviews:CT485782_GR EMBL:AC011809
eggNOG:COG4539 InterPro:IPR009305 Pfam:PF06127 HOGENOM:HOG000263779
OMA:GLWAAQT ProtClustDB:CLSN2914568 EMBL:AY054209 EMBL:AY066034
IPI:IPI00522015 PIR:A86321 RefSeq:NP_564061.1 UniGene:At.11916
IntAct:Q9M9U3 PaxDb:Q9M9U3 PRIDE:Q9M9U3 EnsemblPlants:AT1G18720.1
GeneID:838454 KEGG:ath:AT1G18720 TAIR:At1g18720 InParanoid:Q9M9U3
PhylomeDB:Q9M9U3 Genevestigator:Q9M9U3 Uniprot:Q9M9U3
Length = 206
Score = 455 (165.2 bits), Expect = 4.5e-43, P = 4.5e-43
Identities = 85/138 (61%), Positives = 105/138 (76%)
Query: 5 GLLDLEKHFAFYGAYHSNKINVLIHTLFVWPILFSTLMFLHFTPSVCDFSDKVSFLPCGL 64
GL DLEKHFAFYGAYHSN IN+LIH +FVWPI FS L+ LH + + D S ++ F
Sbjct: 6 GLFDLEKHFAFYGAYHSNPINILIHIIFVWPIFFSVLLLLHSSTPIFDPS-QLGFSQSLT 64
Query: 65 FGHALVFNLGFLFTLIYASFYYCLDKKAGSLAALLCFACWVGASLLSNRLGFSLAWKVAV 124
L FN+GF+F LIYA FY LDKK+G +AAL+CF+CWVG+S L+ RLG SLA KV +
Sbjct: 65 LDGVLRFNVGFIFALIYALFYIGLDKKSGFVAALMCFSCWVGSSFLAVRLGSSLALKVGL 124
Query: 125 AAQLICWTGQFLGHGIFE 142
A+QL+CWTGQF+GHG+FE
Sbjct: 125 ASQLLCWTGQFVGHGVFE 142
>SGD|S000002978 [details] [associations]
symbol:YGL010W "Putative protein of unknown function"
species:4932 "Saccharomyces cerevisiae" [GO:0005789 "endoplasmic
reticulum membrane" evidence=IEA] [GO:0008150 "biological_process"
evidence=ND] [GO:0003674 "molecular_function" evidence=ND]
[GO:0016020 "membrane" evidence=IEA] [GO:0016021 "integral to
membrane" evidence=ISM;IEA] [GO:0005783 "endoplasmic reticulum"
evidence=IEA;IDA] SGD:S000002978 GO:GO:0005783 GO:GO:0016021
EMBL:BK006941 GO:GO:0005789 EMBL:S58126 eggNOG:COG4539
OrthoDB:EOG4BCHXF InterPro:IPR009305 Pfam:PF06127 EMBL:S57893
EMBL:Z72532 PIR:S64012 RefSeq:NP_011505.1 ProteinModelPortal:P25338
DIP:DIP-4727N MINT:MINT-479772 STRING:P25338 PaxDb:P25338
EnsemblFungi:YGL010W GeneID:852874 KEGG:sce:YGL010W CYGD:YGL010w
HOGENOM:HOG000263779 OMA:GLWAAQT NextBio:972508
Genevestigator:P25338 GermOnline:YGL010W Uniprot:P25338
Length = 174
Score = 111 (44.1 bits), Expect = 1.0e-13, Sum P(2) = 1.0e-13
Identities = 25/45 (55%), Positives = 28/45 (62%)
Query: 1 MGKSGLLDLEKHFAFYGAYHSNKINVLIHTLFVWPILFSTLMFLH 45
MG+ GLLDL FY YH N NVLIH++FV ILFS LH
Sbjct: 1 MGE-GLLDLRSQLGFYKFYHHNPKNVLIHSIFVPTILFSGSCMLH 44
Score = 80 (33.2 bits), Expect = 1.0e-13, Sum P(2) = 1.0e-13
Identities = 21/71 (29%), Positives = 38/71 (53%)
Query: 72 NLGFLFTLIYASFYYCLDKKAGSLAALLCFACWVGASLLSNRLGFSLAWKVAVAAQLICW 131
+L + +++++ FY L G LA +L + +L+ +R+ L +K + I W
Sbjct: 53 SLTAVLSVLFSIFYCLLYLPTGLLAGVLLLL--LNLALIDHRV--DLTFKQELGLFTIGW 108
Query: 132 TGQFLGHGIFE 142
QF+GHG+FE
Sbjct: 109 IFQFVGHGVFE 119
>CGD|CAL0004693 [details] [associations]
symbol:orf19.1477 species:5476 "Candida albicans" [GO:0003674
"molecular_function" evidence=ND] [GO:0005783 "endoplasmic
reticulum" evidence=IEA] [GO:0008150 "biological_process"
evidence=ND] CGD:CAL0004693 EMBL:AACQ01000008 EMBL:AACQ01000007
eggNOG:COG4539 InterPro:IPR009305 Pfam:PF06127 HOGENOM:HOG000263779
RefSeq:XP_722387.1 RefSeq:XP_722526.1 GeneID:3635742 GeneID:3635999
KEGG:cal:CaO19.1477 KEGG:cal:CaO19.9052 Uniprot:Q5ALU9
Length = 192
Score = 178 (67.7 bits), Expect = 1.0e-13, P = 1.0e-13
Identities = 53/144 (36%), Positives = 71/144 (49%)
Query: 5 GLLDLEKHFAFYGAYHSNKINVLIHTLFVWPILFSTLMFLHFTPSVCDFSDKVSFLPCGL 64
GL DLE H FY +YH N NV IH + + IL ST+ FL TP +F GL
Sbjct: 3 GLFDLESHLVFYRSYHFNHTNVTIHLICIPIILLSTIAFL--TPVTINFG--------GL 52
Query: 65 FGHALVFNLGFLFTLIYASFYYCLDKKAGSLAA--LLCFACWVGA---SLLSNRLGFSLA 119
++ +NLG L Y +Y LD + G AA L FA ++ +L + S
Sbjct: 53 INNSN-YNLGSLLAWSYGIYYILLDWQIGLPAAGVLFSFAHYIKQYYLTLSETSVPTSNE 111
Query: 120 W-KVAVAAQLICWTGQFLGHGIFE 142
+ K+AVA + W QF GHG+ E
Sbjct: 112 FVKIAVALHVFSWFAQFYGHGVHE 135
>UNIPROTKB|Q5ALU9 [details] [associations]
symbol:CaO19.1477 "Putative uncharacterized protein"
species:237561 "Candida albicans SC5314" [GO:0003674
"molecular_function" evidence=ND] [GO:0008150 "biological_process"
evidence=ND] CGD:CAL0004693 EMBL:AACQ01000008 EMBL:AACQ01000007
eggNOG:COG4539 InterPro:IPR009305 Pfam:PF06127 HOGENOM:HOG000263779
RefSeq:XP_722387.1 RefSeq:XP_722526.1 GeneID:3635742 GeneID:3635999
KEGG:cal:CaO19.1477 KEGG:cal:CaO19.9052 Uniprot:Q5ALU9
Length = 192
Score = 178 (67.7 bits), Expect = 1.0e-13, P = 1.0e-13
Identities = 53/144 (36%), Positives = 71/144 (49%)
Query: 5 GLLDLEKHFAFYGAYHSNKINVLIHTLFVWPILFSTLMFLHFTPSVCDFSDKVSFLPCGL 64
GL DLE H FY +YH N NV IH + + IL ST+ FL TP +F GL
Sbjct: 3 GLFDLESHLVFYRSYHFNHTNVTIHLICIPIILLSTIAFL--TPVTINFG--------GL 52
Query: 65 FGHALVFNLGFLFTLIYASFYYCLDKKAGSLAA--LLCFACWVGA---SLLSNRLGFSLA 119
++ +NLG L Y +Y LD + G AA L FA ++ +L + S
Sbjct: 53 INNSN-YNLGSLLAWSYGIYYILLDWQIGLPAAGVLFSFAHYIKQYYLTLSETSVPTSNE 111
Query: 120 W-KVAVAAQLICWTGQFLGHGIFE 142
+ K+AVA + W QF GHG+ E
Sbjct: 112 FVKIAVALHVFSWFAQFYGHGVHE 135
>POMBASE|SPAC16E8.02 [details] [associations]
symbol:SPAC16E8.02 "DUF962 family protein" species:4896
"Schizosaccharomyces pombe" [GO:0003674 "molecular_function"
evidence=ND] [GO:0005783 "endoplasmic reticulum" evidence=IDA]
[GO:0005789 "endoplasmic reticulum membrane" evidence=IEA]
[GO:0008150 "biological_process" evidence=ND] [GO:0016021 "integral
to membrane" evidence=IEA] PomBase:SPAC16E8.02 GO:GO:0005783
GO:GO:0016021 EMBL:CU329670 GO:GO:0016020 GO:GO:0005789 PIR:T37782
RefSeq:NP_594214.1 EnsemblFungi:SPAC16E8.02.1 GeneID:2542322
KEGG:spo:SPAC16E8.02 eggNOG:COG4539 OMA:VQAFLMA OrthoDB:EOG4BCHXF
NextBio:20803383 InterPro:IPR009305 Pfam:PF06127 Uniprot:O13737
Length = 222
Score = 161 (61.7 bits), Expect = 6.4e-12, P = 6.4e-12
Identities = 44/135 (32%), Positives = 65/135 (48%)
Query: 9 LEKHFAFYGAYHSNKINVLIHTLFVWPILFSTLMFLHFTPSVCDFSDKVSFLPCGLFGHA 68
L + ++FY AYHSN +N+ IH + + +L + L+ LH +F+ L
Sbjct: 4 LSRSYSFYAAYHSNPVNIKIHQVCIPLLLLTALVLLH------------NFV-ITLINSK 50
Query: 69 LVFNLGFLFTLIYASFYYCLDKKAGSL-AALLCFACWVGASLLSNRLGFSLAWKVAVAAQ 127
L N+ L L Y FY LD G L + +L ++ S L SL + A
Sbjct: 51 LQINVAHLVGLAYQIFYVTLDPLDGLLYSPVLYLFSYILPSKLFTIFSRSLVNRSAAVVH 110
Query: 128 LICWTGQFLGHGIFE 142
+ICW QF+GHG+FE
Sbjct: 111 VICWILQFIGHGVFE 125
>ASPGD|ASPL0000047597 [details] [associations]
symbol:AN1522 species:162425 "Emericella nidulans"
[GO:0008150 "biological_process" evidence=ND] [GO:0003674
"molecular_function" evidence=ND] [GO:0005783 "endoplasmic
reticulum" evidence=IEA] EMBL:BN001307 eggNOG:COG4539
OrthoDB:EOG4BCHXF InterPro:IPR009305 Pfam:PF06127 EMBL:AACD01000024
HOGENOM:HOG000263779 RefSeq:XP_659126.1
EnsemblFungi:CADANIAT00008150 GeneID:2875243 KEGG:ani:AN1522.2
OMA:FVGHGAF Uniprot:Q5BD58
Length = 181
Score = 120 (47.3 bits), Expect = 1.4e-07, P = 1.4e-07
Identities = 44/140 (31%), Positives = 63/140 (45%)
Query: 7 LDLEKHFAFYGAYHSNKINVLIHTLFVWPILFSTLMFLHFTPSVCDFSDKVSFLPCGLFG 66
L+LEK F +NV IH V +LF+ + +P + + + F
Sbjct: 3 LNLEKQLLF--------VNVAIHITCVPILLFTGIAMASNSPPLIKLPEVLQF------- 47
Query: 67 HALVFNLGFLFTLIYASFYYCLDKKAGSLAALLCFACWVGASLLSNRLGFSLAWKVAV-- 124
L N+G + L YA FY L+ AG+L A L +GA+ L NRL + V
Sbjct: 48 EDLPPNIGTIAALFYAIFYVLLEPVAGTLIAPLL----LGAAALGNRLIATYGMTVNYWF 103
Query: 125 -AAQLICWTGQFLGHGIFEG 143
++ W QF+GHG FEG
Sbjct: 104 GGIHVVSWLLQFVGHGAFEG 123
Parameters:
V=100
filter=SEG
E=0.001
ctxfactor=1.00
Query ----- As Used ----- ----- Computed ----
Frame MatID Matrix name Lambda K H Lambda K H
+0 0 BLOSUM62 0.332 0.145 0.489 same same same
Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a
Query
Frame MatID Length Eff.Length E S W T X E2 S2
+0 0 148 148 0.00068 104 3 11 21 0.48 30
30 0.39 34
Statistics:
Database: /share/blast/go-seqdb.fasta
Title: go_20130330-seqdb.fasta
Posted: 5:47:42 AM PDT Apr 1, 2013
Created: 5:47:42 AM PDT Apr 1, 2013
Format: XDF-1
# of letters in database: 169,044,731
# of sequences in database: 368,745
# of database sequences satisfying E: 7
No. of states in DFA: 565 (60 KB)
Total size of DFA: 148 KB (2090 KB)
Time to generate neighborhood: 0.00u 0.00s 0.00t Elapsed: 00:00:00
No. of threads or processors used: 24
Search cpu time: 14.46u 0.13s 14.59t Elapsed: 00:00:01
Total cpu time: 14.46u 0.13s 14.59t Elapsed: 00:00:01
Start: Fri May 10 11:37:47 2013 End: Fri May 10 11:37:48 2013