BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>021479
MGSENLLSQSNKSVNLRLSNVSIYTAEVLEIEADDPKLHVLFVPGNPGVITFYKDFVQSL
YEHLGGNASISAIGSAAQTKKNYDHGRLFSLDEQVEHKMDFIRQELQNTEVPIVLVGHSI
GAYVALEMLKRSSEKVIYYIGLYPFLALIRPSVTQSIIGRVAASNIASTALSYIIASLGI
LPSKALRFLVSNSLGRSWSATAVEAACTHLSQYHVMRNVLFMTMTEFKQLKNTPDWAFMR
ENQSKIAFLFGVDDHWGPQELYEEISEQVPDVPLAIERHGHTHNFCCSEAGSAWVASHVA
GLIKNKIPSLSK

High Scoring Gene Products

Symbol, full name Information P value
AT3G11620 protein from Arabidopsis thaliana 3.9e-99
zgc:195062 gene_product from Danio rerio 5.2e-21
DDB_G0286581
UPF0554 family protein
gene from Dictyostelium discoideum 2.8e-18
1110057K04Rik
RIKEN cDNA 1110057K04 gene
protein from Mus musculus 2.9e-14
YPR147C
Putative protein of unknown function
gene from Saccharomyces cerevisiae 4.3e-10
CG9186 protein from Drosophila melanogaster 7.9e-10

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  021479
        (312 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

TAIR|locus:2080802 - symbol:AT3G11620 "AT3G11620" species...   984  3.9e-99   1
ZFIN|ZDB-GENE-081022-178 - symbol:zgc:195062 "zgc:195062"...   247  5.2e-21   1
DICTYBASE|DDB_G0286581 - symbol:DDB_G0286581 "UPF0554 fam...   221  2.8e-18   1
MGI|MGI:1916082 - symbol:1110057K04Rik "RIKEN cDNA 111005...   199  2.9e-14   1
SGD|S000006351 - symbol:YPR147C "Putative protein of unkn...   165  4.3e-10   1
FB|FBgn0035206 - symbol:CG9186 species:7227 "Drosophila m...   163  7.9e-10   1


>TAIR|locus:2080802 [details] [associations]
            symbol:AT3G11620 "AT3G11620" species:3702 "Arabidopsis
            thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0005739
            "mitochondrion" evidence=ISM] [GO:0008150 "biological_process"
            evidence=ND] [GO:0000394 "RNA splicing, via endonucleolytic
            cleavage and ligation" evidence=RCA] [GO:0009086 "methionine
            biosynthetic process" evidence=RCA] [GO:0030003 "cellular cation
            homeostasis" evidence=RCA] EMBL:CP002686 GenomeReviews:BA000014_GR
            eggNOG:NOG149669 InterPro:IPR019363 PANTHER:PTHR13390 Pfam:PF10230
            EMBL:AK229136 IPI:IPI00519523 RefSeq:NP_566394.1 RefSeq:NP_850560.1
            UniGene:At.39721 ProteinModelPortal:Q0WPD9 SMR:Q0WPD9 PaxDb:Q0WPD9
            PRIDE:Q0WPD9 EnsemblPlants:AT3G11620.1 EnsemblPlants:AT3G11620.2
            GeneID:820334 KEGG:ath:AT3G11620 TAIR:At3g11620
            HOGENOM:HOG000030029 InParanoid:Q0WPD9 OMA:RNHDRIS PhylomeDB:Q0WPD9
            ProtClustDB:CLSN2688269 Genevestigator:Q0WPD9 Uniprot:Q0WPD9
        Length = 312

 Score = 984 (351.4 bits), Expect = 3.9e-99, P = 3.9e-99
 Identities = 180/311 (57%), Positives = 236/311 (75%)

Query:     1 MGSEN-LLSQSNKSVNLRLSNVSIYTAEVLEIEADDPKLHVLFVPGNPGVITFYKDFVQS 59
             M ++N L++++ + V  RL  VS    E++EI+A++P  HVLF+PGNPGV++FYKDF++S
Sbjct:     1 METQNKLMNETKRHVKSRLCRVSGSMTEMMEIQAENPTFHVLFIPGNPGVVSFYKDFLES 60

Query:    60 LYEHLXXXXXXXXXXXXXQTKKNYDHGRLFSLDEQVEHKMDFIRQELQNTEVPIVLVGHS 119
             LYE L              T K+++ GRLFS  EQ++HK+DFIRQEL++ +VPI+LVGHS
Sbjct:    61 LYEFLGGNASVIAIGQISHTSKDWESGRLFSFQEQIDHKIDFIRQELESVKVPIILVGHS 120

Query:   120 IGAYVALEMLKRSSEKVIYYIGLYPFLALIRPSVTQSIIGRVAASNIASTALSYIIASLG 179
             IG+Y++LE+LK+ S+KV+Y IGLYPFL L + S  QS+IG++AAS++ S   S++IASL 
Sbjct:   121 IGSYISLELLKKFSDKVVYCIGLYPFLTLNQQSTKQSLIGKLAASSVLSATASFLIASLR 180

Query:   180 ILPSKALRFLVSNSLGRSWSATAVEAACTHLSQYHVMRNVLFMTMTEFKQLKNTPDWAFM 239
             +LP  A R LVS S+G SWS TAV+A CTHL QYH MRNVLFM  +EF++L   PDW FM
Sbjct:   181 LLPMSAARLLVSKSIGASWSDTAVQATCTHLRQYHTMRNVLFMAKSEFRELAAEPDWDFM 240

Query:   240 RENQSKIAFLFGVDDHWGPQELYEEISEQVPDVPLAIERHGHTHNFCCSEAGSAWVASHV 299
             RENQSK+AFLFG+DDHWGP +L+EEIS+Q P   L+IER GHTH FCC+ AGSAWVA HV
Sbjct:   241 RENQSKLAFLFGIDDHWGPLQLFEEISKQAPHTSLSIEREGHTHGFCCTVAGSAWVAQHV 300

Query:   300 AGLIKNKIPSL 310
             A LIKN+   L
Sbjct:   301 ATLIKNRFSQL 311


>ZFIN|ZDB-GENE-081022-178 [details] [associations]
            symbol:zgc:195062 "zgc:195062" species:7955 "Danio
            rerio" [GO:0008150 "biological_process" evidence=ND] [GO:0005575
            "cellular_component" evidence=ND] ZFIN:ZDB-GENE-081022-178
            GeneTree:ENSGT00390000009688 InterPro:IPR019363 PANTHER:PTHR13390
            Pfam:PF10230 OMA:FCLANAA EMBL:BX546455 EMBL:BX072551
            IPI:IPI00901977 Ensembl:ENSDART00000115296 Bgee:F1QII3
            Uniprot:F1QII3
        Length = 370

 Score = 247 (92.0 bits), Expect = 5.2e-21, P = 5.2e-21
 Identities = 79/291 (27%), Positives = 136/291 (46%)

Query:    34 DDPKLHVLFVPGNPGVITFYKDFVQSLYEHLXXXXXXXXXXXXXQTK--KNYD------- 84
             + PK  +L +PGNPGV+ FYK ++ +LY+                    +++D       
Sbjct:    87 NSPKTLILVIPGNPGVVGFYKTYMWTLYQKFLQRYPVWAVSHAGHCMPPESFDMIEDASV 146

Query:    85 --HGRLFSLDEQVEHKMDFIRQEL-QNTEVPIVLVGHSIGAYVALEMLKRSSE-KVIYYI 140
                  +F LD Q+EHK+ F+R+ + Q T   ++L+GHSIG Y+ LEM+KR  E KV+  +
Sbjct:   147 TEKEDVFGLDGQIEHKLAFLRKHVPQGTN--LLLIGHSIGCYIILEMMKRDPELKVVKAV 204

Query:   141 GLYPFLALIRPSVTQSIIGRVAASNIASTALSYIIASLGILPSKALRFLVSNSLGRSWSA 200
              L+P +  +  S    ++  V      +  L   + SL  LP +    +V  +L   ++ 
Sbjct:   205 MLFPTIERMACSPQGKVMTPVLCRLRYAFYLPIFLLSL--LPERLKVGIVRLALRNLYAL 262

Query:   201 -TAVEAACTHLSQYHVMRNVLFMTMTEFKQLKNTPDWAFMRENQSKIAFLFGVDDHWGPQ 259
               ++  A   L       N ++M   E + +    D A + ++ SKI F +G  DHW P 
Sbjct:   263 DNSIIPATVSLINVDCAANGMYMGSQEMRLVVER-DNATIHQHLSKIIFYYGATDHWCPV 321

Query:   260 ELYEEISEQVPDVPLAIERHGHTHNFCCSEAGSAWVASHVAGLIKNKIPSL 310
             +   +I +  P+  + +   G  H F   +AG   +A   A  I + + +L
Sbjct:   322 KYCHDIRKDFPEGDIRLCERGIRHAFVL-DAGEE-MAKMTAEWISDDLRTL 370


>DICTYBASE|DDB_G0286581 [details] [associations]
            symbol:DDB_G0286581 "UPF0554 family protein"
            species:44689 "Dictyostelium discoideum" [GO:0008150
            "biological_process" evidence=ND] [GO:0005575 "cellular_component"
            evidence=ND] [GO:0003674 "molecular_function" evidence=ND]
            [GO:0044351 "macropinocytosis" evidence=RCA] dictyBase:DDB_G0286581
            EMBL:AAFI02000089 eggNOG:NOG149669 InterPro:IPR019363
            PANTHER:PTHR13390 Pfam:PF10230 RefSeq:XP_637543.1
            ProteinModelPortal:Q54LL8 PRIDE:Q54LL8 EnsemblProtists:DDB0305142
            GeneID:8625682 KEGG:ddi:DDB_G0286581 OMA:VNDGWCP Uniprot:Q54LL8
        Length = 304

 Score = 221 (82.9 bits), Expect = 2.8e-18, P = 2.8e-18
 Identities = 65/260 (25%), Positives = 130/260 (50%)

Query:    27 EVLEIEADDPK-LHVLFVPGNPGVITFYKDFVQSLYEHLXXXXXXXXXXXXXQTKKNYDH 85
             E++  ++  P  + ++ + GNPG+ +FY++FV+ L                    K    
Sbjct:    21 EIIYTKSQTPSNIKIIVIAGNPGIESFYQEFVKVLNLSFNSKYDIYGVGHIGHCGKI--E 78

Query:    86 GRLFSLDEQVEHKMDFIRQELQNT-------EVPIVLVGHSIGAYVALEMLKRSSEKVIY 138
              + FS++EQ++HK  F+   L+N        ++  +L+GHS+G+Y++L+++ R SEK  +
Sbjct:    79 NKTFSVEEQIKHKELFLEYLLKNKYGDKDRKDIKFILIGHSVGSYISLKVVSRFSEKFEF 138

Query:   139 Y--IGLYP-FLAL---IRPSVTQSIIGRVAASNIASTALSYIIASLGILPSKALRFLVSN 192
                + L+P F  L   + P + + ++ R +  N  ST L YI +   I+ S  L++++ +
Sbjct:   139 LSVVNLFPTFKNLYDGLSPFI-KMVVMRESTRNGLSTFLHYIPS---IVVSNVLKWILPS 194

Query:   193 SLGRSWSATAVEAACTHLSQYHVMRNVLFMTMTEFKQLKNTPD--WAFMRENQSKIAFLF 250
                R     AV++       Y+   N+L+M  TE + +K   D   +      +++ F++
Sbjct:   195 DESR----IAVQSKIN----YYSALNILYMAYTETEDIKEIDDECHSVFNSRLNQLLFIY 246

Query:   251 GVDDHWGPQELYEEISEQVP 270
             G  D + P+  Y+E+ +  P
Sbjct:   247 GQTDSYTPKSFYDEMKQLYP 266


>MGI|MGI:1916082 [details] [associations]
            symbol:1110057K04Rik "RIKEN cDNA 1110057K04 gene"
            species:10090 "Mus musculus" [GO:0003674 "molecular_function"
            evidence=ND] [GO:0005575 "cellular_component" evidence=ND]
            [GO:0008150 "biological_process" evidence=ND] MGI:MGI:1916082
            EMBL:AK076460 EMBL:AK079099 EMBL:AK162850 EMBL:BC046986
            EMBL:BC057311 IPI:IPI00225796 IPI:IPI00652152 IPI:IPI00856523
            IPI:IPI00857068 RefSeq:NP_001161239.1 RefSeq:NP_765989.3
            UniGene:Mm.25608 ProteinModelPortal:Q8BVA5 PhosphoSite:Q8BVA5
            PaxDb:Q8BVA5 PRIDE:Q8BVA5 DNASU:68832 Ensembl:ENSMUST00000037383
            Ensembl:ENSMUST00000169104 GeneID:68832 KEGG:mmu:68832
            UCSC:uc007mzg.2 UCSC:uc007mzj.2 UCSC:uc007mzk.2 eggNOG:NOG149669
            GeneTree:ENSGT00390000009688 HOGENOM:HOG000045737
            HOVERGEN:HBG107578 InParanoid:Q8BVA5 OrthoDB:EOG469QV3
            NextBio:328011 Bgee:Q8BVA5 CleanEx:MM_1110057K04RIK
            Genevestigator:Q8BVA5 InterPro:IPR019363 PANTHER:PTHR13390
            Pfam:PF10230 Uniprot:Q8BVA5
        Length = 326

 Score = 199 (75.1 bits), Expect = 2.9e-14, P = 2.9e-14
 Identities = 70/296 (23%), Positives = 128/296 (43%)

Query:    32 EADDPKLHVLFVPGNPGVITFYKDFVQSLY------------EHLXXXXXXXXXXXXXQT 79
             +   PK  +  +PGNPG   FY  F ++LY             H                
Sbjct:    39 DVSKPKQLIFIIPGNPGYSAFYVPFAKALYTLMKSRFPVWIISHAGFSVTPKDKKVLAAP 98

Query:    80 KKNYDHGRL---FSLDEQVEHKMDFIRQELQNTEVPIVLVGHSIGAYVALEMLKRSSE-K 135
             ++  +  ++   + L+ Q+EHK+ F+R  +   +V ++L+GHS+G Y+ L ++KR  E  
Sbjct:    99 QEESNAQKIEDVYGLNGQIEHKIAFLRAHVPK-DVKLILIGHSVGTYMTLHVMKRVLELP 157

Query:   136 VIYYIGLYPFLALIRPSVTQSIIGRVAASNIASTA-LSYIIASLGILPSKAL--RFLVSN 192
             V +   L+P +      +++S  G+ A   +     L Y  + L   P   +   F++  
Sbjct:   158 VAHAFLLFPTIE----RMSESPNGKFATPFLCQFRYLLYATSYLLFKPCPEVIKSFIIQK 213

Query:   193 SLGRSWSATAVEAACTHLSQYHVMRNVLFMTMTEFKQLKNTPDWAFMRENQSKIAFLFGV 252
              +G+      +E   T + Q   + N  ++   E  Q+    D   ++E   K+ F +G 
Sbjct:   214 LMGQM--NIKLELPLTDILQPFCLANAAYLGSQEMVQIVKRDD-DIIKEFLPKLKFYYGK 270

Query:   253 DDHWGPQELYEEISEQVPDVPLAIERHGHTHNFCCSEAGSAWVASHVAGLIKNKIP 308
              D W P + YE++ +  P+  + +   G  H F      S  +A+ VA  I N+ P
Sbjct:   271 TDGWCPVKYYEDMKKDFPEGNIYLCEKGIPHAFVLDF--SQEMATIVAEWINNRPP 324


>SGD|S000006351 [details] [associations]
            symbol:YPR147C "Putative protein of unknown function"
            species:4932 "Saccharomyces cerevisiae" [GO:0016020 "membrane"
            evidence=IEA] [GO:0005737 "cytoplasm" evidence=IDA] [GO:0016021
            "integral to membrane" evidence=IEA] [GO:0003674
            "molecular_function" evidence=ND] [GO:0008150 "biological_process"
            evidence=ND] PROSITE:PS00120 SGD:S000006351 GO:GO:0016021
            GO:GO:0005737 EMBL:BK006949 EMBL:U40829 eggNOG:NOG149669
            InterPro:IPR019363 PANTHER:PTHR13390 Pfam:PF10230 PIR:S69034
            RefSeq:NP_015473.1 ProteinModelPortal:Q06522 IntAct:Q06522
            MINT:MINT-4084071 STRING:Q06522 PaxDb:Q06522 PeptideAtlas:Q06522
            EnsemblFungi:YPR147C GeneID:856270 KEGG:sce:YPR147C CYGD:YPR147c
            HOGENOM:HOG000066014 OMA:CISHAGF OrthoDB:EOG480N64 NextBio:981574
            Genevestigator:Q06522 GermOnline:YPR147C Uniprot:Q06522
        Length = 304

 Score = 165 (63.1 bits), Expect = 4.3e-10, P = 4.3e-10
 Identities = 64/258 (24%), Positives = 119/258 (46%)

Query:    34 DDPKLHVLFVPGNPGVITFYKDFVQSLYEHLXXXX-XXXXXXXXXQTKKNYDHGRLFSLD 92
             D P L  +++PGNPG++ +Y++ +  L  HL               T   + +  +FSL 
Sbjct:    28 DAPLL--VWIPGNPGLLYYYQEMLHHL--HLKHPDWEILGISHAGMTLNAHSNTPIFSLQ 83

Query:    93 EQVEHKMDFIRQ-ELQNTEVPIVLVGHSIGAYVALEMLKRSSEKVIYYIGLYPFLALIRP 151
             +QV+H+++ I     +N +  I+++GHS+GAY+  ++    S K++   G    + L+ P
Sbjct:    84 DQVDHQVEVINNFSCKNRK--IIIMGHSVGAYIVQKVCL--SNKLV---GSVQKVGLVTP 136

Query:   152 SVTQ---SIIG-RVAASNIASTALSYIIASLG------ILPSKALRFLVSNSLG-RSWSA 200
             +V     S +G ++ A+      L+++++         IL     RF++   +G  S   
Sbjct:   137 TVMDIHTSEMGIKMTAALRYIPPLAHVVSLFSYIFFYWILSEGFSRFIIDKFMGCGSTGY 196

Query:   201 TAVEAACTHLSQYHVMRNVLFMTMTEFKQLKNTPDWAFM-------RENQSKIAFLFGVD 253
              AV +    L+    +R  L +   E +++  T +W F         EN   I FLF  +
Sbjct:   197 QAVLSTRIFLTHRQFVRQSLGLAAQEMEEI--TTNWEFQDRFINYCEENGISIWFLFSSN 254

Query:   254 DHWGPQELYEEISEQVPD 271
             DHW   +    +S+   D
Sbjct:   255 DHWVSGKTRSHLSDYYKD 272


>FB|FBgn0035206 [details] [associations]
            symbol:CG9186 species:7227 "Drosophila melanogaster"
            [GO:0005811 "lipid particle" evidence=IDA] EMBL:AE014296
            GO:GO:0005811 eggNOG:NOG149669 GeneTree:ENSGT00390000009688
            InterPro:IPR019363 PANTHER:PTHR13390 Pfam:PF10230 EMBL:AY061530
            RefSeq:NP_612097.1 RefSeq:NP_728590.1 UniGene:Dm.9415
            EnsemblMetazoa:FBtr0072727 EnsemblMetazoa:FBtr0072728 GeneID:38150
            KEGG:dme:Dmel_CG9186 UCSC:CG9186-RA FlyBase:FBgn0035206
            InParanoid:Q9W0H3 OMA:FCLANAA OrthoDB:EOG4PNVZS GenomeRNAi:38150
            NextBio:807207 Uniprot:Q9W0H3
        Length = 307

 Score = 163 (62.4 bits), Expect = 7.9e-10, P = 7.9e-10
 Identities = 69/290 (23%), Positives = 123/290 (42%)

Query:    14 VNLRLSNVSIYT-AEVLEIEADDPKLHVLFVPGNPGVITFYKDFVQSLYEHLX------- 65
             VN+      I+T    +E E    K  V+ + GNPG+  FY +F  +L + L        
Sbjct:     6 VNINSIPTHIFTWGRWIE-ETITEKEIVICITGNPGLPGFYTEFAGTLQKELGDLPVWVI 64

Query:    66 --XXXXXXXXXXXXQTKKNYDHGRLFSLDEQVEHKMDFIRQELQNTEVPIVLVGHSIGAY 123
                           +  +   +  LF+LD Q+ HK+ FI + + + +V I L+GHSIGA+
Sbjct:    65 GHAGHDDPPEASIREVPQLSGNEELFNLDGQIRHKIAFIEKYVPS-DVKIHLIGHSIGAW 123

Query:   124 VALEMLK--RSSEKVIYYIGLYPFLALIRPSVTQSIIGRVAASNIASTALSYIIASL-GI 180
             + L++L+  R   ++     L+P +  +  S    +  +VA      +   YI  S    
Sbjct:   124 MILQLLENERIRSRIQKCYMLFPTVERMMESPNGWVFTKVAMP--LYSVFGYIFFSFFNF 181

Query:   181 LPSKALRFLVSN-----SLGRSWSATAVEAACTHLSQYHVMRNVLFMTMTEFKQLKNTPD 235
             LP      L+       S+ R +  TA++      S+  V   V+F+   E  +++    
Sbjct:   182 LPVWLRLMLIQIYFLIFSIPRQFLGTALK-----YSKPSVAEKVVFLADDEMARVRGIQR 236

Query:   236 WAFMRENQSKIAFLFGVDDHWGPQELYEEISEQVPDVPLAIERHGHTHNF 285
                + +N   + F +G  D W P   Y+++ +  P V   ++     H F
Sbjct:   237 -EIVEQNLDLLKFYYGTTDGWVPISYYDQLKKDYPKVDAQLDTKKIDHAF 285


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.319   0.133   0.390    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      312       299   0.00095  115 3  11 22  0.38    34
                                                     33  0.45    37


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  6
  No. of states in DFA:  614 (65 KB)
  Total size of DFA:  211 KB (2118 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:03
  No. of threads or processors used:  24
  Search cpu time:  22.54u 0.06s 22.60t   Elapsed:  00:00:14
  Total cpu time:  22.54u 0.06s 22.60t   Elapsed:  00:00:17
  Start:  Thu May  9 21:01:22 2013   End:  Thu May  9 21:01:39 2013

Back to top