BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>021799
METGTSSKEDAPVFLDRSSRVTRGKRMNKLLDDENEEDEAFWNQDALKEEENDDNYEEEQ
EIADEFDSDFDEDEPEPDEEVENEVDERVWTKKRLIFPGKPLTKKKKKKKILSKLDSPDK
DVKSNEQSILPENHDVPNDVEGERIIRKSTRTAVVVRQAERDAIRAALQATMKPIKRKKE
GEEKRMTQEEMLLEAAQTEIMNLRNLERVLAREEEVKKRAIVHKAVYTGPQLRYLSKDGY
SYLEFSKGVSFQSELSTTSVPYPERAVCAVTGLPAKNTMKRTTIATEEFILLWTMTVSNL
LQIGVGE

High Scoring Gene Products

Symbol, full name Information P value
SWC2
AT2G36740
protein from Arabidopsis thaliana 6.5e-67
vps72
vacuolar protein sorting 72 homolog (S. cerevisiae)
gene_product from Danio rerio 5.4e-06
YL-1 protein from Drosophila melanogaster 1.6e-05
LOC100859689
Uncharacterized protein
protein from Gallus gallus 0.00011
DDB_G0293512
Vacuolar protein sorting-associated protein 72
gene from Dictyostelium discoideum 0.00013
si:ch211-203d17.1 gene_product from Danio rerio 0.00032

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  021799
        (307 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

TAIR|locus:2040580 - symbol:SWC2 "AT2G36740" species:3702...   680  6.5e-67   1
ZFIN|ZDB-GENE-060503-88 - symbol:vps72 "vacuolar protein ...   110  5.4e-06   3
FB|FBgn0032321 - symbol:YL-1 "YL-1" species:7227 "Drosoph...   116  1.6e-05   2
UNIPROTKB|H9KYW9 - symbol:LOC100859689 "Uncharacterized p...   108  0.00011   2
DICTYBASE|DDB_G0293512 - symbol:DDB_G0293512 "Vacuolar pr...   110  0.00013   3
ZFIN|ZDB-GENE-100922-13 - symbol:si:ch211-203d17.1 "si:ch...    96  0.00032   2


>TAIR|locus:2040580 [details] [associations]
            symbol:SWC2 "AT2G36740" species:3702 "Arabidopsis
            thaliana" [GO:0003677 "DNA binding" evidence=ISS] [GO:0005634
            "nucleus" evidence=ISM;IEA] [GO:0006355 "regulation of
            transcription, DNA-dependent" evidence=IEA] [GO:0006338 "chromatin
            remodeling" evidence=RCA] [GO:0008284 "positive regulation of cell
            proliferation" evidence=RCA] [GO:0009909 "regulation of flower
            development" evidence=RCA] [GO:0042742 "defense response to
            bacterium" evidence=RCA] InterPro:IPR008895 Pfam:PF05764
            GO:GO:0005634 EMBL:CP002685 GO:GO:0006355 InterPro:IPR013272
            Pfam:PF08265 SMART:SM00993 KO:K11664 IPI:IPI00531367
            RefSeq:NP_181212.2 UniGene:At.37514 EnsemblPlants:AT2G36740.1
            GeneID:818246 KEGG:ath:AT2G36740 OMA:VPYPEKA Uniprot:F4IP06
        Length = 365

 Score = 680 (244.4 bits), Expect = 6.5e-67, P = 6.5e-67
 Identities = 149/275 (54%), Positives = 173/275 (62%)

Query:     8 KEDAPVFLDRSSRVTRGKRMNKLLXXXXXXXXAFWNQDALKXXXXXXXXXXXXXIAXXXX 67
             +E+  VFLDR++R TRGKRM KLL         FWNQ+ALK             +A    
Sbjct:     5 EEEPMVFLDRTTRATRGKRMTKLLDDEVEEDEQFWNQEALKEEEHDDEYEAEREVADEFD 64

Query:    68 XXXXXXXXXXXXXXXXXXXXRVWTKKRLIFPGXXXXXXXXXXXXXXXXXXXXXXVKS--- 124
                                 R   KKRLI+PG                       +    
Sbjct:    65 SDFNDDEPEPDAVAVNEKELRDLPKKRLIYPGKTASKKKKKKTKVVSQLEYIPGDEKPGE 124

Query:   125 ---NEQSILPENHDVPNDVEGERIIRKSTRTAVVVRQAERDAIRAALQATMKPIKRKKEG 181
                N++    E ++   D+EGE++IRKSTRT+VVVRQAERDA+RAA+QAT KPI+RKK G
Sbjct:   125 ELGNKEQEEKEENEAQEDMEGEKVIRKSTRTSVVVRQAERDALRAAIQATTKPIQRKKVG 184

Query:   182 EEKRMTQEEMLLEAAQTEIMNLRNLERVLAREEEVKKRAIVHKAVYTGPQLRYLSKDGYS 241
             EEKRMTQEEMLLEAAQTEIMNLRNLERVLAREEEVKK+AIVHKAVY GPQ+RY SKDG +
Sbjct:   185 EEKRMTQEEMLLEAAQTEIMNLRNLERVLAREEEVKKKAIVHKAVYKGPQIRYHSKDGCN 244

Query:   242 YLEFSKGVSFQSELSTTSVPYPERAVCAVTGLPAK 276
             YLEF  G SF SELST SVPYPE+AVC +TGLPAK
Sbjct:   245 YLEFCNGASFNSELSTKSVPYPEKAVCVITGLPAK 279


>ZFIN|ZDB-GENE-060503-88 [details] [associations]
            symbol:vps72 "vacuolar protein sorting 72 homolog
            (S. cerevisiae)" species:7955 "Danio rerio" [GO:0006355 "regulation
            of transcription, DNA-dependent" evidence=IEA] [GO:0005634
            "nucleus" evidence=IEA] InterPro:IPR008895 Pfam:PF05764
            ZFIN:ZDB-GENE-060503-88 GO:GO:0005634 GO:GO:0006355 EMBL:CR548627
            InterPro:IPR013272 Pfam:PF08265 SMART:SM00993 KO:K11664 CTD:6944
            eggNOG:NOG253988 GeneTree:ENSGT00390000017503 HOGENOM:HOG000033695
            HOVERGEN:HBG083244 OMA:DVEGLDQ OrthoDB:EOG4QFWDG IPI:IPI00632869
            RefSeq:NP_001076413.1 UniGene:Dr.84090 Ensembl:ENSDART00000093383
            GeneID:100001285 KEGG:dre:100001285 NextBio:20784979 Uniprot:Q1L8X4
        Length = 369

 Score = 110 (43.8 bits), Expect = 5.4e-06, Sum P(3) = 5.4e-06
 Identities = 35/105 (33%), Positives = 52/105 (49%)

Query:   132 ENHDVPNDVEGERIIRKSTRTAVVVRQAERDAIRAALQATMKPIKRKKEGEEKRMTQEEM 191
             E   V ++++    IRKS R +      + +  R   +    P KRK    E+ +TQ+E+
Sbjct:   115 ERRTVRDELQDLGDIRKSVRKSTSEHTRKTNE-RLQERQQEAPRKRKGAQSERVLTQDEL 173

Query:   192 LLEAAQTEIMNLRNLERVLAREEEVKKRAIVHKAVYTGPQLRYLS 236
             L EA  T   NLR+LE    R E  KK+ +  K  + GP +RY S
Sbjct:   174 LDEAKLTAESNLRSLENY-ERLEADKKKQVHKKRRFEGPMIRYHS 217

 Score = 48 (22.0 bits), Expect = 5.4e-06, Sum P(3) = 5.4e-06
 Identities = 13/38 (34%), Positives = 20/38 (52%)

Query:   241 SYLEFSKGVSFQSELSTTS--VP-YPERAVCAVTGLPA 275
             +Y+ FS   +F S   + +   P +P + VC VT  PA
Sbjct:   263 TYITFSDDEAFSSAFPSAARCTPTHPVQEVCPVTHKPA 300

 Score = 42 (19.8 bits), Expect = 5.4e-06, Sum P(3) = 5.4e-06
 Identities = 9/15 (60%), Positives = 10/15 (66%)

Query:    17 RSSRVTRGKRMNKLL 31
             R  R T G RM+KLL
Sbjct:     7 REQRSTAGNRMSKLL 21


>FB|FBgn0032321 [details] [associations]
            symbol:YL-1 "YL-1" species:7227 "Drosophila melanogaster"
            [GO:0006355 "regulation of transcription, DNA-dependent"
            evidence=IEA] [GO:0004402 "histone acetyltransferase activity"
            evidence=IDA] [GO:0005634 "nucleus" evidence=IDA] [GO:0016573
            "histone acetylation" evidence=IDA] [GO:0043486 "histone exchange"
            evidence=IDA] [GO:0035267 "NuA4 histone acetyltransferase complex"
            evidence=IPI] [GO:0010629 "negative regulation of gene expression"
            evidence=IMP] InterPro:IPR008895 Pfam:PF05764 EMBL:AE014134
            GO:GO:0006355 GO:GO:0003677 GO:GO:0016573 GO:GO:0006351
            GO:GO:0035267 GO:GO:0010629 GO:GO:0043486 InterPro:IPR013272
            Pfam:PF08265 SMART:SM00993 KO:K11664 eggNOG:NOG253988
            GeneTree:ENSGT00390000017503 OMA:DVEGLDQ EMBL:AY122237
            RefSeq:NP_609475.1 UniGene:Dm.12184 IntAct:Q9VKM6 MINT:MINT-318738
            STRING:Q9VKM6 PaxDb:Q9VKM6 PRIDE:Q9VKM6 EnsemblMetazoa:FBtr0080146
            GeneID:34516 KEGG:dme:Dmel_CG4621 UCSC:CG4621-RA CTD:34516
            FlyBase:FBgn0032321 InParanoid:Q9VKM6 OrthoDB:EOG4T4BBF
            PhylomeDB:Q9VKM6 GenomeRNAi:34516 NextBio:788874 Bgee:Q9VKM6
            GermOnline:CG4621 Uniprot:Q9VKM6
        Length = 351

 Score = 116 (45.9 bits), Expect = 1.6e-05, Sum P(2) = 1.6e-05
 Identities = 34/91 (37%), Positives = 51/91 (56%)

Query:   147 RKSTRTAVVVR-QAERDAIRAALQATMKPIKRKKEGEEKRMTQEEMLLEAAQTEIMNLRN 205
             RKS RT+  ++ QA +  ++  L    K  K+K   E+   TQEE+L EA  TE  N ++
Sbjct:   130 RKSIRTSTAIKTQATKIRLKE-LDDARKRKKKKVRVEDYMPTQEELLEEAKITEEENTKS 188

Query:   206 LERVLAREEEVKKRAIVHKAVYTGPQLRYLS 236
             LE+    E E KK++   K  ++GP +RY S
Sbjct:   189 LEKFQKMELE-KKKSRPTKRTFSGPTIRYHS 218

 Score = 46 (21.3 bits), Expect = 1.6e-05, Sum P(2) = 1.6e-05
 Identities = 11/33 (33%), Positives = 17/33 (51%)

Query:   244 EFSKGVSFQSELSTTSVPYPERAVCAVTGLPAK 276
             +F+  V FQS     + P     +C +T LPA+
Sbjct:   254 DFNDKV-FQSLFRHKAPPKASNGICPITRLPAR 285


>UNIPROTKB|H9KYW9 [details] [associations]
            symbol:LOC100859689 "Uncharacterized protein" species:9031
            "Gallus gallus" [GO:0005634 "nucleus" evidence=IEA] [GO:0006355
            "regulation of transcription, DNA-dependent" evidence=IEA]
            [GO:0035019 "somatic stem cell maintenance" evidence=IEA]
            [GO:0043234 "protein complex" evidence=IEA] InterPro:IPR008895
            Pfam:PF05764 GO:GO:0005634 GO:GO:0006355 InterPro:IPR013272
            Pfam:PF08265 SMART:SM00993 GeneTree:ENSGT00390000017503 OMA:DVEGLDQ
            EMBL:AADN02010484 Ensembl:ENSGALT00000001151 Uniprot:H9KYW9
        Length = 366

 Score = 108 (43.1 bits), Expect = 0.00011, Sum P(2) = 0.00011
 Identities = 41/117 (35%), Positives = 57/117 (48%)

Query:   123 KSNEQSILP-ENHDVPNDVEGERIIRKSTRTAVVVRQAERDAIRAALQATMKPIKRKKEG 181
             K+ E    P E  D   D  G + +R+ST          +  +R  +Q      KRKK G
Sbjct:   110 KAREVKSAPLELQDEVADT-GRKHMRQST-----TEHTRQTFLR--IQERQVQSKRKKGG 161

Query:   182 E--EKRMTQEEMLLEAAQTEIMNLRNLERVLAREEEVKKRAIVHKAVYTGPQLRYLS 236
                ++ +TQEE+L EA  TE +NLR+LE    R E  KK+ +  K    GP +RY S
Sbjct:   162 PNYDRPLTQEELLEEAKITEEINLRSLENY-ERLEADKKKQVQKKRKCVGPVIRYWS 217

 Score = 48 (22.0 bits), Expect = 0.00011, Sum P(2) = 0.00011
 Identities = 11/36 (30%), Positives = 17/36 (47%)

Query:   241 SYLEFSKGVSFQSELSTTSVP-YPERAVCAVTGLPA 275
             +++ FS   +F+        P  P R +C VT  PA
Sbjct:   261 TFISFSDDETFERFFPKAKAPRLPVREICPVTHKPA 296


>DICTYBASE|DDB_G0293512 [details] [associations]
            symbol:DDB_G0293512 "Vacuolar protein
            sorting-associated protein 72" species:44689 "Dictyostelium
            discoideum" [GO:0006355 "regulation of transcription,
            DNA-dependent" evidence=IEA] [GO:0005634 "nucleus" evidence=IEA]
            InterPro:IPR008895 Pfam:PF05764 dictyBase:DDB_G0293512
            GO:GO:0005634 GO:GO:0006355 EMBL:AAFI02000217 InterPro:IPR013272
            Pfam:PF08265 SMART:SM00993 RefSeq:XP_629104.1
            EnsemblProtists:DDB0191982 GeneID:8629263 KEGG:ddi:DDB_G0293512
            eggNOG:NOG304632 InParanoid:Q54BP8 OMA:KETEIYN Uniprot:Q54BP8
        Length = 508

 Score = 110 (43.8 bits), Expect = 0.00013, Sum P(3) = 0.00013
 Identities = 24/64 (37%), Positives = 38/64 (59%)

Query:   186 MTQEEMLLEAAQTEIMNLRNLERVLAREEEVKKRAIVHKAVYTGPQLRYLSKDGYSYLEF 245
             +TQEE+L E  +TEI N  +L  +L +EE+ KK     KA+ TGP++ Y S    + + F
Sbjct:   268 LTQEELLEECKETEIYNTESLNHLLQQEEDKKKVFHPKKAILTGPRIIYRSTPEQTTITF 327

Query:   246 SKGV 249
             +  +
Sbjct:   328 TDSI 331

 Score = 45 (20.9 bits), Expect = 0.00013, Sum P(3) = 0.00013
 Identities = 11/30 (36%), Positives = 17/30 (56%)

Query:   247 KGVSFQSELSTTSVPYPERAVCAVTGLPAK 276
             KG + ++E S   +      +C +TGLPAK
Sbjct:   416 KGDTNRTENSENEIK-KNIELCVITGLPAK 444

 Score = 37 (18.1 bits), Expect = 0.00013, Sum P(3) = 0.00013
 Identities = 8/12 (66%), Positives = 9/12 (75%)

Query:    19 SRVTRGKRMNKL 30
             SR TRGKR  +L
Sbjct:     6 SRSTRGKRTTEL 17


>ZFIN|ZDB-GENE-100922-13 [details] [associations]
            symbol:si:ch211-203d17.1 "si:ch211-203d17.1"
            species:7955 "Danio rerio" [GO:0005634 "nucleus" evidence=IEA]
            [GO:0006355 "regulation of transcription, DNA-dependent"
            evidence=IEA] InterPro:IPR008895 Pfam:PF05764
            ZFIN:ZDB-GENE-100922-13 GO:GO:0005634 GO:GO:0006355
            InterPro:IPR013272 Pfam:PF08265 SMART:SM00993
            GeneTree:ENSGT00390000017503 EMBL:CT954330 IPI:IPI00804092
            Ensembl:ENSDART00000127323 Uniprot:F1QYW2
        Length = 335

 Score = 96 (38.9 bits), Expect = 0.00032, Sum P(2) = 0.00032
 Identities = 33/100 (33%), Positives = 49/100 (49%)

Query:   137 PNDVEGERIIRKSTRTAVVVRQAERDAIRAALQATMKPIKRKKEGEEKRMTQEEMLLEAA 196
             P +++ + I RKS R +    +  R       +  + P +RK    E+ +TQ E+L EA 
Sbjct:    83 PPELQDD-INRKSVRQSTT--EHTRLTYLRLQERQVAPRRRKGTRHERPLTQAELLAEAK 139

Query:   197 QTEIMNLRNLERVLAREEEVKKRAIVHKAVYTGPQLRYLS 236
              T  +NLR+LE    R E  KKR +  K    G  +RY S
Sbjct:   140 VTAEINLRSLENY-ERLEADKKRQVHMKRQCVGSVIRYHS 178

 Score = 56 (24.8 bits), Expect = 0.00032, Sum P(2) = 0.00032
 Identities = 15/41 (36%), Positives = 20/41 (48%)

Query:   236 SKDGYSYLEFSKGVSFQSELSTTSVP-YPERAVCAVTGLPA 275
             +K   +Y+ FS   SFQ     +  P  P + VC VT  PA
Sbjct:   236 TKCSRTYITFSDDESFQRVFPQSPAPRVPVQEVCPVTHKPA 276


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.315   0.129   0.355    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      307       240   0.00093  113 3  11 23  0.45    33
                                                     32  0.44    36


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  6
  No. of states in DFA:  592 (63 KB)
  Total size of DFA:  161 KB (2096 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  24.73u 0.13s 24.86t   Elapsed:  00:00:01
  Total cpu time:  24.73u 0.13s 24.86t   Elapsed:  00:00:01
  Start:  Fri May 10 13:01:32 2013   End:  Fri May 10 13:01:33 2013

Back to top