BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>018751
MRKGAKRKGAAAAAAAAAKASSSSQENHKEEDEQQKTSKAHRAKRVKASKPETEPEYFED
QRNMEDLWKEVFPVGTEWDQLDSVYQFKWNFSNLEDAFEEGGVLYGKKVYLFGCTEPQLV
PYENKNKIVCIPVVVAVVSPFPPSDKIGIKSVQREVEEIVPMKEMKMDWVPYIPLEKRDR
QVERLKSQIFILSCTQRRSALRHLKVDRLKKFEYCLPYFYQPFKEDEFEQSTVVQIMFPV
EPPVVCEFDWEFDEVDEFTDKLVEEEALAEDQKDAFKDFVKEKVREAKKANREAKEARRK
AIEEMSEETKAAFESMRFYKFYPVKTPDTPDVSNVKAPFINRYYGKAHEVL

High Scoring Gene Products

Symbol, full name Information P value
HIT4
AT5G10010
protein from Arabidopsis thaliana 2.4e-106
AT5G64910 protein from Arabidopsis thaliana 5.8e-80
DDB_G0294611
unknown
gene from Dictyostelium discoideum 1.4e-07

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  018751
        (351 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

TAIR|locus:2184053 - symbol:HIT4 "AT5G10010" species:3702...  1029  2.4e-106  2
TAIR|locus:2177669 - symbol:AT5G64910 "AT5G64910" species...   781  5.8e-80   2
DICTYBASE|DDB_G0294611 - symbol:DDB_G0294611 "unknown" sp...   131  1.4e-07   2


>TAIR|locus:2184053 [details] [associations]
            symbol:HIT4 "AT5G10010" species:3702 "Arabidopsis
            thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0005737
            "cytoplasm" evidence=ISM] [GO:0005730 "nucleolus" evidence=IDA]
            [GO:0010286 "heat acclimation" evidence=IMP] [GO:0010369
            "chromocenter" evidence=IDA] EMBL:CP002688
            GenomeReviews:BA000015_GR GO:GO:0005730 EMBL:BT029989
            IPI:IPI00547538 RefSeq:NP_196563.2 UniGene:At.26172 STRING:A2RVJ8
            PaxDb:A2RVJ8 PRIDE:A2RVJ8 DNASU:830863 EnsemblPlants:AT5G10010.1
            GeneID:830863 KEGG:ath:AT5G10010 TAIR:At5g10010 eggNOG:NOG328996
            HOGENOM:HOG000236366 InParanoid:A2RVJ8 OMA:KYEYCLP PhylomeDB:A2RVJ8
            ProtClustDB:CLSN2686437 Genevestigator:A2RVJ8 Uniprot:A2RVJ8
        Length = 434

 Score = 1029 (367.3 bits), Expect = 2.4e-106, Sum P(2) = 2.4e-106
 Identities = 193/323 (59%), Positives = 231/323 (71%)

Query:    29 KEEDEQQKTSKAHRAKRVKASKPETEPEYFEDQRNMEDLWKEVFPVGTEWDQLDSVYQFK 88
             K+ + + +     +AK+ +A+K + EP YFE++R++EDLWK  FPVGTEWDQLD++Y+F 
Sbjct:   112 KDTEIKDEKKPVPKAKKPRAAKVKEEPVYFEEKRSLEDLWKVAFPVGTEWDQLDALYEFN 171

Query:    89 WNFSNLEDAFEEGGVLYGKKVYLFGCTEPQLVPYENKNKIXXXXXXXXXXXXXXXSDKIG 148
             W+F NLE+A EEGG LYGKKVY+FGCTEPQLVPY+  NKI               SDKIG
Sbjct:   172 WDFQNLEEALEEGGKLYGKKVYVFGCTEPQLVPYKGANKIVHVPAVVVIESPFPPSDKIG 231

Query:   149 IKSVQREVEEIVPMKEMKMDWVPYIPLEKRDRQVERLKSQIFILSCTQRRSALRHLKVDR 208
             I SVQREVEEI+PMK+MKMDW+PYIP+EKRDRQV+++ SQIF L CTQRRSALRH+K D+
Sbjct:   232 ITSVQREVEEIIPMKKMKMDWLPYIPIEKRDRQVDKMNSQIFTLGCTQRRSALRHMKEDQ 291

Query:   209 LKKFEYCLPYFYQPFKEDEFEQSTVVQIMFPVEPPXXXXXXXXXXXXXXXTDKLVEEEAL 268
             LKKFEYCLPYFYQPFKEDE EQST VQIMFP EPP                DKLVEEEAL
Sbjct:   292 LKKFEYCLPYFYQPFKEDELEQSTEVQIMFPSEPPVVCEFDWEFDELQEFVDKLVEEEAL 351

Query:   269 AEDQKDAFKDFVXXXXXXXXXXXXXXXXXXXXXXXXMSEETKAAFESMRFYKFYPVKTPD 328
               +Q D FK++V                        MSE+TK AF+ M+FYKFYP  +PD
Sbjct:   352 PAEQADEFKEYVKEQVRAAKKANREAKDARKKAIEEMSEDTKQAFQKMKFYKFYPQPSPD 411

Query:   329 TPDVSNVKAPFINRYYGKAHEVL 351
             TPDVS V++PFINRYYGKAHEVL
Sbjct:   412 TPDVSGVQSPFINRYYGKAHEVL 434

 Score = 43 (20.2 bits), Expect = 2.4e-106, Sum P(2) = 2.4e-106
 Identities = 11/29 (37%), Positives = 18/29 (62%)

Query:    26 ENHKEEDEQQKTSKAHRAKRVKASKPETE 54
             E  KEE E++  ++    KR +A+K +TE
Sbjct:    88 EEVKEEVEKKPVAR-RGGKRKRATKKDTE 115


>TAIR|locus:2177669 [details] [associations]
            symbol:AT5G64910 "AT5G64910" species:3702 "Arabidopsis
            thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0005634
            "nucleus" evidence=ISM] [GO:0008150 "biological_process"
            evidence=ND] EMBL:CP002688 GenomeReviews:BA000015_GR EMBL:AB019236
            UniGene:At.75373 UniGene:At.75593 HOGENOM:HOG000236366
            ProtClustDB:CLSN2686437 IPI:IPI00543690 RefSeq:NP_201296.1
            PaxDb:Q9LV86 PRIDE:Q9LV86 EnsemblPlants:AT5G64910.1 GeneID:836615
            KEGG:ath:AT5G64910 TAIR:At5g64910 eggNOG:NOG68829 InParanoid:Q9LV86
            OMA:ANENQEE PhylomeDB:Q9LV86 Genevestigator:Q9LV86 Uniprot:Q9LV86
        Length = 487

 Score = 781 (280.0 bits), Expect = 5.8e-80, Sum P(2) = 5.8e-80
 Identities = 151/306 (49%), Positives = 194/306 (63%)

Query:    32 DEQQKTSKAHRAKRVKASKPE-TEPEYFEDQRNMEDLWKEVFPVGTEWDQLDSVYQFKWN 90
             + ++K S     KR K +K + +EPEYFE++RN+EDLWK  F VGTEWDQ D++ +F W+
Sbjct:   153 EAEKKVSTPRAKKRAKTTKAQASEPEYFEEKRNLEDLWKATFSVGTEWDQQDALNEFNWD 212

Query:    91 FSNLEDAFEEGGVLYGKKVYLFGCTEPQLVPYENKNKIXXXXXXXXXXXXXXXSDKIGIK 150
             F+NLE+A EEGG LYGK+VY+FGCTE   V Y+++NK                SD+IG+ 
Sbjct:   213 FTNLEEALEEGGELYGKQVYVFGCTESHSVTYKDENKDVLVPVVVCIDSPIPPSDEIGVA 272

Query:   151 SVQREVEEIVPMKEMKMDWVPYIPLEKRDRQVERLKSQIFILSCTQRRSALRHLKVDRLK 210
             SVQ EV EI+ MK MKM WVPYIPLE+RDRQV+     IFIL CTQRRSAL+HL  DR+K
Sbjct:   273 SVQGEVGEIIAMKTMKMAWVPYIPLEQRDRQVDNKNFPIFILGCTQRRSALKHLPDDRVK 332

Query:   211 KFEYCLPYFYQPFKEDEFEQSTVVQIMFPVEPPXXXXXXXXXXXXXXXTDKLVEEEALAE 270
             KF YCLPY   P+K D+ E+STVV+IMFP EPP               TD L+ EE L  
Sbjct:   333 KFNYCLPYINNPYKVDDSEKSTVVKIMFPSEPPVECEYDWVKSVIEEFTDSLINEEVLLP 392

Query:   271 DQKDAFKDFVXXXXXXXXXXXXXXXXXXXXXXXXMSEETKAAFESMRFYKFYPVKTPDTP 330
             +QK AF++FV                        +SEETK A++ MR YKFYP+ +PDTP
Sbjct:   393 EQKVAFEEFVKEKSDKAMAAYDTAQEALEKAKEGLSEETKKAYQEMRLYKFYPLPSPDTP 452

Query:   331 DVSNVK 336
               + ++
Sbjct:   453 HTAGIE 458

 Score = 41 (19.5 bits), Expect = 5.8e-80, Sum P(2) = 5.8e-80
 Identities = 11/40 (27%), Positives = 19/40 (47%)

Query:    25 QENHKEEDEQQKTSKAHRAKRVK--ASKPETEPEYFEDQR 62
             +E  KE+ E++K   A   K  +  A KP+      E+ +
Sbjct:    93 EEEAKEDKEEEKEEAAREDKEEEEEAVKPDESASQKEEAK 132

 Score = 38 (18.4 bits), Expect = 1.2e-79, Sum P(2) = 1.2e-79
 Identities = 9/38 (23%), Positives = 18/38 (47%)

Query:    25 QENHKEEDEQQKTSKAHRAKRVKASKPETEPEYFEDQR 62
             +E+ +EE+E  K  ++   K        +EP+    +R
Sbjct:   109 REDKEEEEEAVKPDESASQKEEAKGASSSEPQLRRGKR 146


>DICTYBASE|DDB_G0294611 [details] [associations]
            symbol:DDB_G0294611 "unknown" species:44689
            "Dictyostelium discoideum" [GO:0005575 "cellular_component"
            evidence=ND] [GO:0003674 "molecular_function" evidence=ND]
            [GO:0044351 "macropinocytosis" evidence=RCA] dictyBase:DDB_G0294611
            EMBL:AAFI02000141 InterPro:IPR017859 PRINTS:PR01503
            RefSeq:XP_001733029.1 PRIDE:B0G167 EnsemblProtists:DDB0233684
            GeneID:8627160 KEGG:ddi:DDB_G0294611 OMA:GFEFNTW Uniprot:B0G167
        Length = 611

 Score = 131 (51.2 bits), Expect = 1.4e-07, Sum P(2) = 1.4e-07
 Identities = 43/174 (24%), Positives = 74/174 (42%)

Query:    68 WKEVFPVGTEWDQLDSVYQFKWNFSNLEDAFEEGGVLYG---KKVYLFGCTEPQLVPYEN 124
             WKE+ P+G E++  +S+ +  W F  L+  F+ G + +    K +Y+F   +P +V  E 
Sbjct:     3 WKELTPIGFEFNTWESMKKEGWTFPELKREFKRGVLSHAQTDKPLYMFLGAQP-IV--EG 59

Query:   125 KNKIXXXXXXXXXXXXXXXSDKIGIKSVQREVEEIVPMKEMKMDWVPYIPLEKRDRQVER 184
                                S KI   S+Q   E+I    +  + W PYIP  +   Q   
Sbjct:    60 DYAFNMPYIVVFDCPSPPPS-KICKASIQGGSEDIYNFSDFHLSWSPYIP-SRYSNQASN 117

Query:   185 LKSQIFILSCTQRRSALRHLKVDRLKKFEYCLPYFYQPFKEDEFEQSTVVQIMF 238
              K +IF L+  +R    + +  ++    +Y LPY   P     F  + +  + F
Sbjct:   118 KKYKIFTLNLQERPG--KKISEEKQLNIQYLLPYILIPKIFKTFSVTPITNVQF 169

 Score = 59 (25.8 bits), Expect = 1.4e-07, Sum P(2) = 1.4e-07
 Identities = 15/87 (17%), Positives = 31/87 (35%)

Query:   261 KLVEEEALAEDQKDAFKDFVXXXXXXXXXXXXXXXXXXXXXXXXMSEETKAAFESMRFYK 320
             + VE+  L E+  +     +                         S +    +++ + YK
Sbjct:   257 EFVEDNGLVEEMTEVIDKAIRDEFVKARAVAQKKYDDLKAEIDSYSPKKAEDYDNCKIYK 316

Query:   321 FYPVKTPDTPDVSNVKAPFINRYYGKA 347
             FYP       ++    +P +NR++G A
Sbjct:   317 FYP--RHKKYNIQKYVSPAVNRFFGNA 341


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.316   0.134   0.395    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      351       274   0.00078  115 3  11 22  0.38    34
                                                     33  0.40    37


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  3
  No. of states in DFA:  611 (65 KB)
  Total size of DFA:  217 KB (2119 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  23.46u 0.11s 23.57t   Elapsed:  00:00:01
  Total cpu time:  23.46u 0.11s 23.57t   Elapsed:  00:00:01
  Start:  Tue May 21 01:23:58 2013   End:  Tue May 21 01:23:59 2013

Back to top