BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>041861
MIRLVYNSFLADFHLCYGYWRKACCALLTRVVEVFEQSMQSATYSSDVWFHYCNLASEVP
PDGHRKSEYQEEK

High Scoring Gene Products

Symbol, full name Information P value
PRP39-2 protein from Arabidopsis thaliana 3.8e-09
prpf39
pre-mRNA processing factor 39
gene from Dictyostelium discoideum 2.1e-05
CG1646 protein from Drosophila melanogaster 0.00041

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  041861
        (73 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

TAIR|locus:2170453 - symbol:PRP39-2 species:3702 "Arabido...   148  3.8e-09   1
DICTYBASE|DDB_G0283307 - symbol:prpf39 "pre-mRNA processi...   111  2.1e-05   1
ASPGD|ASPL0000046692 - symbol:AN1635 species:162425 "Emer...   102  0.00015   1
FB|FBgn0039600 - symbol:CG1646 species:7227 "Drosophila m...   101  0.00041   1


>TAIR|locus:2170453 [details] [associations]
            symbol:PRP39-2 species:3702 "Arabidopsis thaliana"
            [GO:0005622 "intracellular" evidence=IEA] [GO:0005634 "nucleus"
            evidence=ISM] [GO:0006396 "RNA processing" evidence=IEA]
            InterPro:IPR003107 SMART:SM00386 EMBL:CP002688 GO:GO:0005622
            GO:GO:0006396 KO:K13217 IPI:IPI00548633 RefSeq:NP_199452.2
            UniGene:At.29958 ProteinModelPortal:F4KHG8 SMR:F4KHG8 PRIDE:F4KHG8
            EnsemblPlants:AT5G46400.1 GeneID:834683 KEGG:ath:AT5G46400
            OMA:HIKLFPH ArrayExpress:F4KHG8 Uniprot:F4KHG8
        Length = 1036

 Score = 148 (57.2 bits), Expect = 3.8e-09, P = 3.8e-09
 Identities = 34/74 (45%), Positives = 42/74 (56%)

Query:     4 LVYNSFLADFHLCYGYWRKAC------CALLTRVVEVFEQSMQSATYSSDVWFHYCNLAS 57
             LVY++FL +F LC+GYWRK        C L    VEVFE+++Q+ATYS  VW  YC  A 
Sbjct:    69 LVYDAFLLEFPLCHGYWRKYAYHKIKLCTL-EDAVEVFERAVQAATYSVAVWLDYCAFAV 127

Query:    58 EVPPDGHRKSEYQE 71
                 D H  S   E
Sbjct:   128 AAYEDPHDVSRLFE 141


>DICTYBASE|DDB_G0283307 [details] [associations]
            symbol:prpf39 "pre-mRNA processing factor 39"
            species:44689 "Dictyostelium discoideum" [GO:0006397 "mRNA
            processing" evidence=IEA] [GO:0006396 "RNA processing"
            evidence=IEA] [GO:0005634 "nucleus" evidence=IEA] [GO:0005622
            "intracellular" evidence=IEA] [GO:0003674 "molecular_function"
            evidence=ND] InterPro:IPR003107 InterPro:IPR008847
            InterPro:IPR011990 Pfam:PF05843 SMART:SM00386
            dictyBase:DDB_G0283307 GO:GO:0005634 GenomeReviews:CM000153_GR
            GO:GO:0006397 Gene3D:1.25.40.10 EMBL:AAFI02000052 KO:K13217
            eggNOG:NOG298273 RefSeq:XP_639156.1 ProteinModelPortal:Q54R91
            EnsemblProtists:DDB0233547 GeneID:8624026 KEGG:ddi:DDB_G0283307
            InParanoid:Q54R91 OMA:ADHEYAH Uniprot:Q54R91
        Length = 699

 Score = 111 (44.1 bits), Expect = 2.1e-05, P = 2.1e-05
 Identities = 21/57 (36%), Positives = 35/57 (61%)

Query:     2 IRLVYNSFLADFHLCYGYWRKACCALL-----TRVVEVFEQSMQSATYSSDVWFHYC 53
             IR VY+ FL +F LC+ YW++           T+ +E+FE+++ S  +S D+W +YC
Sbjct:    57 IRKVYSEFLNEFPLCFLYWKRFADHEYAHNNTTQSIEIFEKAVSSIPHSVDIWLNYC 113


>ASPGD|ASPL0000046692 [details] [associations]
            symbol:AN1635 species:162425 "Emericella nidulans"
            [GO:0006396 "RNA processing" evidence=IEA] [GO:0005685 "U1 snRNP"
            evidence=IEA] [GO:0003674 "molecular_function" evidence=ND]
            InterPro:IPR003107 InterPro:IPR011990 SMART:SM00386 EMBL:BN001307
            GO:GO:0005622 GO:GO:0006396 eggNOG:COG0457 Gene3D:1.25.40.10
            EMBL:AACD01000026 KO:K13217 HOGENOM:HOG000189748 OMA:ARYFERY
            OrthoDB:EOG4DNJD8 RefSeq:XP_659239.1 ProteinModelPortal:Q5BCU5
            EnsemblFungi:CADANIAT00008273 GeneID:2874721 KEGG:ani:AN1635.2
            Uniprot:Q5BCU5
        Length = 588

 Score = 102 (41.0 bits), Expect = 0.00015, P = 0.00015
 Identities = 23/66 (34%), Positives = 35/66 (53%)

Query:     2 IRLVYNSFLADFHLCYGYWRKACCALL----TRVVE-VFEQSMQSATYSSDVWFHYCNLA 56
             +R VY+ FLA F L +GYW+K          T   + V+E+ + S + S D+W +YC   
Sbjct:    59 VRNVYDRFLAKFPLLFGYWKKYADLEFSITGTEAADMVYERGVASISSSVDLWTNYCTFK 118

Query:    57 SEVPPD 62
             +E   D
Sbjct:   119 AETSHD 124


>FB|FBgn0039600 [details] [associations]
            symbol:CG1646 species:7227 "Drosophila melanogaster"
            [GO:0005685 "U1 snRNP" evidence=ISS] [GO:0000398 "mRNA splicing,
            via spliceosome" evidence=ISS] [GO:0005634 "nucleus" evidence=IC]
            [GO:0000381 "regulation of alternative mRNA splicing, via
            spliceosome" evidence=IMP] InterPro:IPR003107 InterPro:IPR011990
            SMART:SM00386 InterPro:IPR001623 EMBL:AE014297 Gene3D:1.25.40.10
            SMART:SM00271 GO:GO:0000398 GO:GO:0000381 eggNOG:COG5107
            GO:GO:0005685 GeneTree:ENSGT00390000005033 KO:K13217 EMBL:AY051737
            RefSeq:NP_001097957.1 RefSeq:NP_001097958.1 RefSeq:NP_651634.1
            RefSeq:NP_733256.2 RefSeq:NP_788753.1 RefSeq:NP_788754.2
            UniGene:Dm.31288 ProteinModelPortal:Q7KRW8 SMR:Q7KRW8 IntAct:Q7KRW8
            MINT:MINT-820225 STRING:Q7KRW8 PaxDb:Q7KRW8 PRIDE:Q7KRW8
            EnsemblMetazoa:FBtr0085322 GeneID:43399 KEGG:dme:Dmel_CG1646
            UCSC:CG1646-RB FlyBase:FBgn0039600 InParanoid:Q7KRW8 OMA:IRWENES
            OrthoDB:EOG4ZKH2F PhylomeDB:Q7KRW8 GenomeRNAi:43399 NextBio:833726
            Bgee:Q7KRW8 Uniprot:Q7KRW8
        Length = 1066

 Score = 101 (40.6 bits), Expect = 0.00041, P = 0.00041
 Identities = 19/55 (34%), Positives = 30/55 (54%)

Query:     3 RLVYNSFLADFHLCYGYWRKACC-----ALLTRVVEVFEQSMQSATYSSDVWFHY 52
             R  Y++FL+ +  CYGYWRK         +     +VFE+ +++   S D+W HY
Sbjct:   399 REAYDTFLSHYPYCYGYWRKYADYEKRKGIKANCYKVFERGLEAIPLSVDLWIHY 453


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.325   0.134   0.447    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0       73        73   0.00091  102 3  11 22  0.45    28
                                                     29  0.45    29


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  4
  No. of states in DFA:  543 (58 KB)
  Total size of DFA:  118 KB (2078 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  8.55u 0.11s 8.66t   Elapsed:  00:00:00
  Total cpu time:  8.55u 0.11s 8.66t   Elapsed:  00:00:00
  Start:  Sat May 11 08:42:56 2013   End:  Sat May 11 08:42:56 2013

Back to top