BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>005238
MDSSRRAVESYWRSRMIDGATSDEDKVTPVYKLEEICELLRSSHVSIGKEVSDFILKRLE
HKSPVVKQKALRLIKYSVGKSGTDFRREMQRHSAVVRQLFHYKGQLDPLKGDALNKAVRD
MAHEAISAIFSEENKPAPVEDLSLNKRIQGFGNTNFEMPSEDKKSFLSEVVGLGSASIKQ
GLSSFAQGNSMRKNENGSYRSPNLRRSLTIENDYSDRYEPVELRNETQGGYDISKNVAGG
SWNQDSRVLKEDRLNGDSSASYTGSKTREEKLLETIVTYGGVRLQPTRDAIQVFLVEAAK
LDALAMSRALEAKLQSPLWQVRMKAICVLESILRKKDDEKFSIILSYFCENNDVVVKCSE
SPQSSLREKANKVLSLLGEEQAGGLVSGSERSVKAETTTVVQMPDLIDTADPEDHSETNN
YATNPSDQNISNLSTSSTPLIDDLFTDSLGTGANNSEQKNADDPFADVLFHTSEGKEHVE
DLFSGMTVDSKPVASGNLLAADKSGSEPFDDIFGSHTEILPKQENQKNNFNDLMAGFSIN
EDQLKPEGSSAGVPSESIFSDSSSNPSQQLSSDALSSLLGSQSAGMNANPFPFGTMPYNI
PAGMTLNPSIASQPMNYSAMGNLFAQQQFLAAMSNLQHIGNLNVHNSGAANLVGGNGGSP
LPDIFQPNFPTQASMPALNNSKKEDTRAFDFISDHLASARDSKRVA

High Scoring Gene Products

Symbol, full name Information P value
AT3G16270 protein from Arabidopsis thaliana 7.5e-192
Enthd2
ENTH domain containing 2
gene from Rattus norvegicus 2.7e-08
Enthd2
ENTH domain containing 2
protein from Mus musculus 2.8e-08
ENTHD2
AP-4 complex accessory subunit tepsin
protein from Homo sapiens 3.4e-07

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  005238
        (706 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

TAIR|locus:2094887 - symbol:AT3G16270 species:3702 "Arabi...  1859  7.5e-192  1
RGD|1307410 - symbol:Enthd2 "ENTH domain containing 2" sp...   159  2.7e-08   2
MGI|MGI:1926027 - symbol:Enthd2 "ENTH domain containing 2...   159  2.8e-08   2
UNIPROTKB|Q96N21 - symbol:ENTHD2 "AP-4 complex accessory ...   147  3.4e-07   2


>TAIR|locus:2094887 [details] [associations]
            symbol:AT3G16270 species:3702 "Arabidopsis thaliana"
            [GO:0005634 "nucleus" evidence=ISM] [GO:0006886 "intracellular
            protein transport" evidence=IEA] [GO:0016020 "membrane"
            evidence=IDA] [GO:0000902 "cell morphogenesis" evidence=RCA]
            [GO:0006623 "protein targeting to vacuole" evidence=RCA]
            [GO:0016049 "cell growth" evidence=RCA] [GO:0016192
            "vesicle-mediated transport" evidence=RCA] [GO:0048193 "Golgi
            vesicle transport" evidence=RCA] InterPro:IPR002014 PROSITE:PS50179
            Pfam:PF01417 EMBL:CP002686 GO:GO:0006886 GO:GO:0016020
            EMBL:AB023046 EMBL:AC001645 GO:GO:0005622 Gene3D:1.25.40.90
            InterPro:IPR008942 SUPFAM:SSF48464 eggNOG:NOG74274
            InterPro:IPR001026 InterPro:IPR018205 SMART:SM00288 EMBL:AF360245
            EMBL:AY040043 IPI:IPI00536983 RefSeq:NP_566540.1 UniGene:At.28143
            UniGene:At.48135 PDB:1VDY PDB:2DCP PDBsum:1VDY PDBsum:2DCP
            ProteinModelPortal:Q9C5H4 SMR:Q9C5H4 IntAct:Q9C5H4 PaxDb:Q9C5H4
            PRIDE:Q9C5H4 EnsemblPlants:AT3G16270.1 GeneID:820873
            KEGG:ath:AT3G16270 TAIR:At3g16270 InParanoid:Q9C5H4 OMA:YWRSRMI
            PhylomeDB:Q9C5H4 ProtClustDB:CLSN2688441 EvolutionaryTrace:Q9C5H4
            Genevestigator:Q9C5H4 Uniprot:Q9C5H4
        Length = 690

 Score = 1859 (659.5 bits), Expect = 7.5e-192, P = 7.5e-192
 Identities = 404/711 (56%), Positives = 485/711 (68%)

Query:     1 MDSSRRAVESYWRSRMIDGATSDEDKVTPVYKLEEICELLRSSHVSIGKEVSDFILKRLE 60
             MD+SRRAVESYWRSRMID  TSDEDKV PVYKLEEIC+LLRSSHVSI KE S+FILKRL+
Sbjct:     1 MDTSRRAVESYWRSRMIDAVTSDEDKVAPVYKLEEICDLLRSSHVSIVKEFSEFILKRLD 60

Query:    61 HKSPVVKQKALRLIKYSVGKSGTDFRREMQRHSAVVRQLFHYKGQLDPLKGDALNKAVRD 120
             +KSP+VKQKALRLIKY+VGKSG++FRREMQR+S  VR LFHYKG  DPLKGDALNKAVR+
Sbjct:    61 NKSPIVKQKALRLIKYAVGKSGSEFRREMQRNSVAVRNLFHYKGHPDPLKGDALNKAVRE 120

Query:   121 MAHEAISAIFSEENKPAPVEDLSLNKRIQGFGNTNFEMPSEDKKSFLSEVVGLGSASIKQ 180
              AHE ISAIFSEEN   P    S+N+RI+GFGNTNF++PS D KSFLSEVVG+GSASIKQ
Sbjct:   121 TAHETISAIFSEENGTKPAAPESINRRIEGFGNTNFQVPSNDNKSFLSEVVGIGSASIKQ 180

Query:   181 GLSSFAQGNSMRKNENGSYRSPNLRRSLTIENDYSDRYEPVELRNETQGGYDISKNVAGG 240
             G+S+FAQG+  +KNENGS          ++  +  +      ++    G Y  SKN  GG
Sbjct:   181 GISNFAQGHLPKKNENGSSSYRGPNLHRSLTMENENFSRYDPVKLGKDGNYGTSKNTTGG 240

Query:   241 SWNQDSRVLKEDRLNGDSSASY-TGSKTREEKLLETIVTYGGVRLQPTRDAIQVFLVEAA 299
             SW   S    E      SSAS    SKTREEKLLETIVT GGVRLQPTRDA+ VF++EAA
Sbjct:   241 SWGHASGEASE------SSASVRVESKTREEKLLETIVTSGGVRLQPTRDALHVFILEAA 294

Query:   300 KLDALAMSRALEAKLQSPLWQVRMKAICVLESILRKKDDEKFSIILSYFCENNDVVVKCS 359
             K+DA+A+S AL+ KL SP+WQVRMKA+CVLE+ILRKK+DE FSI+ +YF EN D + +C+
Sbjct:   295 KMDAVALSIALDGKLHSPMWQVRMKALCVLEAILRKKEDENFSIVHTYFSENLDAIQRCA 354

Query:   360 ESPQSSLREKANKVLSLLGEEQAGGLVSGSERSVKAETTTVVQMPDLIDTADPEDHSETN 419
             ESPQSSLREKANKVLSLL   Q+ GL+S S+ +VK E    V +PDLIDT D +D    N
Sbjct:   355 ESPQSSLREKANKVLSLLNGGQSSGLMSSSDNTVKREAA--VDLPDLIDTGDSDD--TLN 410

Query:   420 NYATNPSDQNISNLSTSSTPLIDDLFTDSLGTGANNSEQKNADDPFADVLFHTSEGKEHV 479
             N   N  D   S ++T+   + DD F DS   G ++SE+K  DDPFADV FH +E KE  
Sbjct:   411 NL--NAIDTG-STVATAGPLMDDDWFGDSSDIGLSSSEKKTDDDPFADVSFHPNEEKESA 467

Query:   480 EDLFSGMTVDSKPVASGNLLAADKSGSEPFDDIFGSHTEILPKQENQKNNFNDLMAGFSI 539
             +DLFSGMTV  K  A G     D      FD +FGS  ++  + ++ KN  NDLM  FSI
Sbjct:   468 DDLFSGMTVGEKSAAVGGNHVPDL-----FD-MFGSTAKLEAEPKDAKN-INDLMGSFSI 520

Query:   540 NEDQLKPEGSSAG-VPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXMNANPFPFGTMPY 598
             +E+    +GSS+  +P                                N    P G MP+
Sbjct:   521 DENNSNQKGSSSSTLPQDLFAMPSTTSHQAPENPVGGILGSQNPGFIQNTM-LPGGVMPF 579

Query:   599 NIPAGMTLNPSIASQPMNYSAMGNLFAQQQ-FLAAMSNLQHIGNLNVHNSGAANLVGGNG 657
             N P GM +NP+ ASQP+NY+AM +L AQQQ +L  MSN Q  GNLN   SG    +G +G
Sbjct:   580 NFPQGMMMNPAFASQPLNYAAMASLLAQQQQYLGNMSNFQQFGNLNAQGSGNVLSMGTSG 639

Query:   658 G--SPLPDIFQPNFPTQASMPALNNSKKEDTRAFDFISDHLASARDSKRVA 706
             G  S LPDIFQPNF  QA    +N SKKEDTRAFDFISDHL SARD+KRV+
Sbjct:   640 GNQSALPDIFQPNFGNQAPTSTMNGSKKEDTRAFDFISDHLTSARDTKRVS 690


>RGD|1307410 [details] [associations]
            symbol:Enthd2 "ENTH domain containing 2" species:10116 "Rattus
            norvegicus" [GO:0005829 "cytosol" evidence=IEA] [GO:0016023
            "cytoplasmic membrane-bounded vesicle" evidence=IEA] Pfam:PF01417
            RGD:1307410 GO:GO:0005829 GO:GO:0016023 Gene3D:1.25.40.90
            InterPro:IPR008942 SUPFAM:SSF48464 PROSITE:PS50942 CTD:146705
            OMA:SHESPGS InterPro:IPR001026 GeneTree:ENSGT00390000015076
            EMBL:CH473948 EMBL:BC089956 RefSeq:XP_001081797.1
            RefSeq:XP_340947.3 UniGene:Rn.154667 Ensembl:ENSRNOT00000038993
            GeneID:360673 KEGG:rno:360673 NextBio:673712 Uniprot:G3V8Y7
        Length = 569

 Score = 159 (61.0 bits), Expect = 2.7e-08, Sum P(2) = 2.7e-08
 Identities = 40/120 (33%), Positives = 67/120 (55%)

Query:    21 TSDEDKVTPVYKLEEICELLRSSHVSIGKE--VSDFILKRLEHKSPVVKQKALRLIKYSV 78
             TSD+D   P Y  EEI ++   SH S+G    + +++L RL+  S  VK K L+++ Y  
Sbjct:    24 TSDDDIPCPGYLFEEIAKI---SHESLGSSQCLLEYLLNRLDSSSGHVKLKVLKILLYLC 80

Query:    79 GKSGTDFRREMQRHSAVVRQLFHYKGQLDPLKGDALNKAVRDMAHEAISAIFSEENKPAP 138
                 + F   ++R+SA++++   + G  DPL G++L + VR  A +  S +FS+   P P
Sbjct:    81 SHGSSSFMLILRRNSALIQEATAFAGPPDPLHGNSLYQKVRAAAQDLGSTLFSDA-LPQP 139

 Score = 49 (22.3 bits), Expect = 2.7e-08, Sum P(2) = 2.7e-08
 Identities = 17/64 (26%), Positives = 31/64 (48%)

Query:   268 REEKLLETIVTYGGVRLQPTRDAIQVFLVEAAKLDALAMSRALEAKLQSPLWQVRMKAIC 327
             +E  L+ T+    G R+  +R+  Q F+ E   L+  A+   L  +L       +M+A+C
Sbjct:   312 QELNLVRTVTQ--GPRVFLSREETQHFIKECGLLNCEAVLELLLQQLVGTSECEQMRALC 369

Query:   328 VLES 331
              + S
Sbjct:   370 AIAS 373


>MGI|MGI:1926027 [details] [associations]
            symbol:Enthd2 "ENTH domain containing 2" species:10090 "Mus
            musculus" [GO:0003674 "molecular_function" evidence=ND] [GO:0005575
            "cellular_component" evidence=ND] [GO:0005737 "cytoplasm"
            evidence=IEA] [GO:0008150 "biological_process" evidence=ND]
            [GO:0031410 "cytoplasmic vesicle" evidence=IEA] MGI:MGI:1926027
            Pfam:PF01417 GO:GO:0005829 GO:GO:0016023 Gene3D:1.25.40.90
            InterPro:IPR008942 SUPFAM:SSF48464 PROSITE:PS50942 CTD:146705
            eggNOG:NOG74274 HOGENOM:HOG000049246 HOVERGEN:HBG064281 OMA:SHESPGS
            OrthoDB:EOG4CRM0G InterPro:IPR001026 EMBL:AK019105 EMBL:AK154660
            EMBL:AL807824 IPI:IPI00224240 RefSeq:NP_898960.2 UniGene:Mm.23672
            ProteinModelPortal:Q3U3N6 SMR:Q3U3N6 STRING:Q3U3N6
            PhosphoSite:Q3U3N6 PaxDb:Q3U3N6 PRIDE:Q3U3N6
            Ensembl:ENSMUST00000093901 GeneID:78777 KEGG:mmu:78777
            GeneTree:ENSGT00390000015076 InParanoid:Q3U3N6 NextBio:349474
            Bgee:Q3U3N6 CleanEx:MM_2410002I01RIK Genevestigator:Q3U3N6
            Uniprot:Q3U3N6
        Length = 573

 Score = 159 (61.0 bits), Expect = 2.8e-08, Sum P(2) = 2.8e-08
 Identities = 40/120 (33%), Positives = 67/120 (55%)

Query:    21 TSDEDKVTPVYKLEEICELLRSSHVSIGKE--VSDFILKRLEHKSPVVKQKALRLIKYSV 78
             TSD+    P Y  EEI ++   SH S+G    + +++L RL+  S  VK K L+++ Y  
Sbjct:    24 TSDDSIPCPGYLFEEIAKI---SHESLGSSQCLLEYLLNRLDSSSGHVKLKVLKILLYLC 80

Query:    79 GKSGTDFRREMQRHSAVVRQLFHYKGQLDPLKGDALNKAVRDMAHEAISAIFSEENKPAP 138
             G   + F   ++R+SA++++   + G  DPL G++L + VR  A +  S +FS+   P P
Sbjct:    81 GHGSSSFLLILRRNSALIQEATAFSGPPDPLHGNSLYQKVRAAAQDLGSTLFSDA-VPQP 139

 Score = 49 (22.3 bits), Expect = 2.8e-08, Sum P(2) = 2.8e-08
 Identities = 17/64 (26%), Positives = 31/64 (48%)

Query:   268 REEKLLETIVTYGGVRLQPTRDAIQVFLVEAAKLDALAMSRALEAKLQSPLWQVRMKAIC 327
             +E  L+ T+    G R+  +R+  Q F+ E   L+  A+   L  +L       +M+A+C
Sbjct:   312 QELNLVRTVTQ--GPRVFLSREETQHFIKECGLLNCEAVLELLLRQLVGTSECEQMRALC 369

Query:   328 VLES 331
              + S
Sbjct:   370 AIAS 373


>UNIPROTKB|Q96N21 [details] [associations]
            symbol:ENTHD2 "AP-4 complex accessory subunit tepsin"
            species:9606 "Homo sapiens" [GO:0005829 "cytosol" evidence=IEA]
            [GO:0016023 "cytoplasmic membrane-bounded vesicle" evidence=IEA]
            Pfam:PF01417 GO:GO:0005829 GO:GO:0016023 Gene3D:1.25.40.90
            InterPro:IPR008942 SUPFAM:SSF48464 PROSITE:PS50942 EMBL:AK056090
            EMBL:AK127221 EMBL:AK128728 EMBL:BC064483 IPI:IPI00043327
            IPI:IPI00444617 RefSeq:NP_653280.1 UniGene:Hs.631761
            ProteinModelPortal:Q96N21 SMR:Q96N21 STRING:Q96N21
            PhosphoSite:Q96N21 DMDM:74732479 PaxDb:Q96N21 PRIDE:Q96N21
            Ensembl:ENST00000300714 Ensembl:ENST00000374769 GeneID:146705
            KEGG:hsa:146705 UCSC:uc002jzs.2 UCSC:uc002jzu.2 CTD:146705
            H-InvDB:HIX0014244 H-InvDB:HIX0014245 HGNC:HGNC:26458
            neXtProt:NX_Q96N21 PharmGKB:PA142672239 eggNOG:NOG74274
            HOGENOM:HOG000049246 HOVERGEN:HBG064281 InParanoid:Q96N21
            OMA:SHESPGS OrthoDB:EOG4CRM0G PhylomeDB:Q96N21 ChiTaRS:C17orf56
            GenomeRNAi:146705 NextBio:85420 Bgee:Q96N21 CleanEx:HS_C17orf56
            Genevestigator:Q96N21 InterPro:IPR001026 Uniprot:Q96N21
        Length = 525

 Score = 147 (56.8 bits), Expect = 3.4e-07, Sum P(2) = 3.4e-07
 Identities = 64/239 (26%), Positives = 110/239 (46%)

Query:    21 TSDEDKVTPVYKLEEICELLRSSHVSIGKE--VSDFILKRLEHKSPVVKQKALRLIKYSV 78
             TSD+D   P Y  EEI ++   SH S G    + +++L RL   S   K K L+++ Y  
Sbjct:    24 TSDDDVPCPGYLFEEIAKI---SHESPGSSQCLLEYLLSRLHSSSGHGKLKVLKILLYLC 80

Query:    79 GKSGTDFRREMQRHSAVVRQLFHYKGQLDPLKGDALNKAVRDMAHEAISAIFSEENKP-A 137
                 + F   ++R+SA +++   + G  DPL G++L + VR  A +  S +FS+   P A
Sbjct:    81 SHGSSFFLLILKRNSAFIQEAAAFAGPPDPLHGNSLYQKVRAAAQDLGSTLFSDTVLPLA 140

Query:   138 PVEDLSL------------NKRIQGFGNTNFEMPSEDKKSFLSEVVGLGSASIKQGLSS- 184
             P + L              +  +QGFG +  E      +    +  G G   +  G SS 
Sbjct:   141 PSQPLGTPPATGMGSQARPHSTLQGFGYSK-EHGRTAVRHQPGQAGG-GWDELDSGPSSQ 198

Query:   185 -FAQGNSM-RKNENGSYRSPNLRRSLTIE-NDYSDRYEPVELRNETQGGYDISKNVAGG 240
               +Q + + R +++GS+   +     + E  D ++R E V L ++ Q    + + V  G
Sbjct:   199 NSSQNSDLSRVSDSGSHSGSDSHSGASREPGDLAERVEVVAL-SDCQQELSLVRTVTRG 256

 Score = 50 (22.7 bits), Expect = 3.4e-07, Sum P(2) = 3.4e-07
 Identities = 32/146 (21%), Positives = 57/146 (39%)

Query:   268 REEKLLETIVTYGGVRLQPTRDAIQVFLVEAAKLDALAMSRALEAKLQSPLWQVRMKAIC 327
             +E  L+ T+    G R   +R+  Q F+     L+  A+ + L   L+      +++A+C
Sbjct:   245 QELSLVRTVTR--GPRAFLSREEAQHFIKACGLLNCEAVLQLLTCHLRGTSECTQLRALC 302

Query:   328 VLESILRKKDDEKFSIILSYFCENNDVVVKCSESPQSSLREKANKVLSLLGEEQAGGLVS 387
              + S+       +  I+L         + + S      +  KA K+L    E   G L  
Sbjct:   303 AIASLGSSDLLPQEHILL----RTRPWLQELSMGSPGPVTNKATKILRHF-EASCGQLSP 357

Query:   388 GSERSVKAETTTVVQMP-DLIDTADP 412
                 S +   T  +  P DL+  A P
Sbjct:   358 ARGTSAEPGPTAALPGPSDLLTDAVP 383


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.311   0.128   0.358    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      706       675    0.0010  120 3  11 23  0.48    34
                                                     36  0.45    37


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  4
  No. of states in DFA:  609 (65 KB)
  Total size of DFA:  325 KB (2166 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  72.04u 0.09s 72.13t   Elapsed:  00:00:04
  Total cpu time:  72.04u 0.09s 72.13t   Elapsed:  00:00:04
  Start:  Tue May 21 03:26:24 2013   End:  Tue May 21 03:26:28 2013

Back to top