BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>004377
MPSVGMRRTTRVFGVVKGVDGARVLRSGRRLWPDSGDGKLRRTNYGDDWYHHPVINKKNG
GPGGPKCKPNGWAAHLDDLKVYANNDEKKEVKMCKKVKEELKGADLMYGIVYSRKRKRND
GEKSKILEKKKYGIQFSRRQRRKKSEKIVPFSVFGVGLESSSSGFLVSFLSSVLGCMRRA
TVELPRLASFLLSETISGVFSLRGIRFSWDPPIARTGMCRIFGTMQLIPMFSLDFSAVPS
CFMYIHHCMLVRFMRPPSVNSSASEDDSSEEEDVDYVCESKTVTPVVDNSVNKVALHPSV
RSSKLAARNVQYRSSLNSRAIQKRRSSLRRRRARNPSLIGSQKASGALVSDLTSCRKSSI
PSSSAVSKSKLRSSLQHSSVLSIKEVSSTVDSLMLDLDRSCCCVSILVMESDRCCRVEGA
NVILEMSHSKEWHLVVKKDGETRYSFKAQRIMRPSSFNRFTHAILWAGDDNWKLEFSNRQ
DWLDFKDLYKECSDRNAQVSVSKVIPIPGVCEVLGYEDSNTVPFCRPDSYISVNVDEVSR
ALAKRTANYDMDSEDEEWLKKFNNEFVTENELHEHVSEDTFELIVDAFEKAYFCSPDDYS
NEEAAVNLCLELGQKEVVLAVYNHWKQKRKQKRAALLRVFQGRQPKKPSLIPKPALRKRR
SFKRQASQPGRGKPPVVLLPEVVTQQDALEEQNAMRRVEEAKASAKRSLEEAVLKRQRAQ
LLMQNADLATYKATMALRIAEAAQVAESADAAADHFLD

High Scoring Gene Products

Symbol, full name Information P value
AT5G04670 protein from Arabidopsis thaliana 1.7e-139
AT4G32620 protein from Arabidopsis thaliana 2.2e-38
AT1G16690 protein from Arabidopsis thaliana 9.5e-05
DDB_G0283859
BRD group protein
gene from Dictyostelium discoideum 0.00052

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  004377
        (758 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

TAIR|locus:2180218 - symbol:AT5G04670 "AT5G04670" species...  1165  1.7e-139  3
TAIR|locus:2125682 - symbol:AT4G32620 species:3702 "Arabi...   412  2.2e-38   3
TAIR|locus:2017978 - symbol:AT1G16690 species:3702 "Arabi...   127  9.5e-05   1
DICTYBASE|DDB_G0283859 - symbol:DDB_G0283859 "BRD group p...   130  0.00052   2


>TAIR|locus:2180218 [details] [associations]
            symbol:AT5G04670 "AT5G04670" species:3702 "Arabidopsis
            thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0005634
            "nucleus" evidence=ISM] [GO:0008150 "biological_process"
            evidence=ND] InterPro:IPR024943 EMBL:CP002688
            GenomeReviews:BA000015_GR InterPro:IPR019542 Pfam:PF10513
            EMBL:AL162972 GO:GO:0032777 PANTHER:PTHR14898 IPI:IPI00544200
            PIR:T48463 RefSeq:NP_196087.1 UniGene:At.33087
            EnsemblPlants:AT5G04670.1 GeneID:830344 KEGG:ath:AT5G04670
            TAIR:At5g04670 eggNOG:NOG329901 HOGENOM:HOG000083201
            InParanoid:Q9LZ30 OMA:MRRTTRV PhylomeDB:Q9LZ30
            ProtClustDB:CLSN2686429 Genevestigator:Q9LZ30 Uniprot:Q9LZ30
        Length = 766

 Score = 1165 (415.2 bits), Expect = 1.7e-139, Sum P(3) = 1.7e-139
 Identities = 275/645 (42%), Positives = 363/645 (56%)

Query:   105 DLMYGIVYSRKRKRN-DGEKSKILEKKKYGIQFSRRQRRKKSEKIVPXXXXXXXXXXXXX 163
             D M+GIVYSRKRKR  +   S   E+    ++F RR RRK S++ V              
Sbjct:   112 DKMFGIVYSRKRKRLCEPSSSDRSEEPLRSLKFYRR-RRKLSQR-VSSVLTLTVDWSCED 169

Query:   164 XXXXXXXXXXXXCMRRATVELPRLASFLLSETISGVFSLRGIRFSWDPPIARTGMCRIFG 223
                          +RR  + L  LASF LS+ I+ VF+  G+RF    P++  G+C+ FG
Sbjct:   170 CWFLTVFGLAMRYIRREELRLSSLASFFLSQPINQVFADHGVRFLVRSPLSSRGVCKFFG 229

Query:   224 TMQLIPMFSLDFSAVPSCFMYIHHCMLVRFMRPPXXXXXXXXXXXXXXXXXXXXXXXKTV 283
              M  +P+FS DF+ +P  FM +H  + VR + P                        +  
Sbjct:   230 AMSCLPLFSADFAVIPRWFMDMHFTLFVRVL-PRSFFFVEKSLYLLNNPIEESDSESELA 288

Query:   284 TPVVDNSVNKVA--LHPSVRSSKLAARNVQYRSSLNSRAIQKXXXXXXXXXXXNPSLIGS 341
              P      N V   LHPSVR+SKL   N QYR +L S + QK           N S    
Sbjct:   289 LPEPCTPRNGVVVGLHPSVRASKLTGGNAQYRGNLGSHSFQKRRSSLRRRRARNLSHNAH 348

Query:   342 QKASGALVSDLTSCRXXXXXXXXXXXXXXXXXXLQHSSVLSIKEVSSTVDSLMLDLDRSC 401
             +  +G  V D++  R                  L +SS +S       +     +LD  C
Sbjct:   349 KLNNGTPVFDISGSRKNRTAAVSSKKLRSSV--LSNSSPVSNGISIIPMTKTKEELDSIC 406

Query:   402 CCVSILVMESDRCCRVEGANVILEMSHSKEWHLVVKKDGETRYSFKAQRIMRPSSFNRFT 461
             C  +IL++ SDRC R EG +V+LE S SKEW LV+KKDG  RYS  AQR MRP S NR T
Sbjct:   407 CSANILMIHSDRCTREEGFSVMLEASSSKEWFLVIKKDGAIRYSHMAQRTMRPFSSNRIT 466

Query:   462 HAILWAGDDNWKLEFSNRQDWLDFKDLYKECSDRNAQVSVSKVIPIPGVCEVLGYED--S 519
             HA +W G DNWKLEF +RQDWL FKD+YKEC +RN      KVIPIPGV EV GY +   
Sbjct:   467 HATVWMGGDNWKLEFCDRQDWLGFKDIYKECYERNLLEQSVKVIPIPGVREVCGYAEYID 526

Query:   520 NTVPFCRPD-SYISVNVDEVSRALAKRTANYDMDSEDEEWLKKFNNEFVTE-NELHEHVS 577
             N   F RP  SYISVN DEVSRA+A+  A YDMDSEDEEWL++ N + + E ++ +  + 
Sbjct:   527 NFPSFSRPPVSYISVNEDEVSRAMARSIALYDMDSEDEEWLERQNQKMLNEEDDQYLQLQ 586

Query:   578 EDTFELIVDAFEKAYFCSP-DDYSNEEAA-VNLCLELGQKEVVLAVYNHWKQKRKQKRAA 635
              + FEL++D FEK +F SP DD  +E+AA +     LG++EVV AV+++W +KRKQ++A 
Sbjct:   587 REAFELMIDGFEKYHFHSPADDLLDEKAATIGSISYLGRQEVVEAVHDYWLKKRKQRKAP 646

Query:   636 LLRVFQGRQPKKPSLIPKPALRKRRSFKRQASQ-PGRGKPPVVLLPEVVTQQDALEEQNA 694
             LLR+FQG Q KK  L+ KP  RKRRSFKRQ SQ  G+ K     +  V   +   EE++ 
Sbjct:   647 LLRIFQGHQVKKTQLLSKPVFRKRRSFKRQGSQLHGKAKQTSPWMVAVKAAEP--EEEDD 704

Query:   695 MRRVEEAKASAKRSLEEAVLKRQRAQLLMQNADLATYKATMALRI 739
             + R+EEAK  A +++E A+ KR+RAQ+L +NADLA YKA  ALRI
Sbjct:   705 ILRMEEAKVLADKTMETAIAKRRRAQILAENADLAVYKAMRALRI 749

 Score = 181 (68.8 bits), Expect = 1.7e-139, Sum P(3) = 1.7e-139
 Identities = 34/44 (77%), Positives = 39/44 (88%)

Query:     1 MPSVGMRRTTRVFGVVKGVDGARVLRSGRRLWPDSGDGKLRRTN 44
             MPSVGMRRTTRVFGVVK  DGARVLRSGRR+WP+ G+ K+RR +
Sbjct:     1 MPSVGMRRTTRVFGVVKAADGARVLRSGRRIWPNVGEPKVRRAH 44

 Score = 55 (24.4 bits), Expect = 1.7e-139, Sum P(3) = 1.7e-139
 Identities = 11/32 (34%), Positives = 20/32 (62%)

Query:   111 VYSRKRKRNDGEKSKILEKKKYGIQFSRRQRR 142
             V  R++ RN+G   +    K +GI +SR+++R
Sbjct:    94 VTKRRKVRNEGVGDEKTVDKMFGIVYSRKRKR 125


>TAIR|locus:2125682 [details] [associations]
            symbol:AT4G32620 species:3702 "Arabidopsis thaliana"
            [GO:0005634 "nucleus" evidence=ISM] [GO:0008150
            "biological_process" evidence=ND] [GO:0009616 "virus induced gene
            silencing" evidence=RCA] [GO:0010050 "vegetative phase change"
            evidence=RCA] [GO:0010267 "production of ta-siRNAs involved in RNA
            interference" evidence=RCA] [GO:0035196 "production of miRNAs
            involved in gene silencing by miRNA" evidence=RCA]
            InterPro:IPR002999 EMBL:CP002687 SMART:SM00333 InterPro:IPR019542
            Pfam:PF10513 UniGene:At.46639 IPI:IPI00539967 RefSeq:NP_001154281.1
            UniGene:At.31652 PRIDE:F4JV27 EnsemblPlants:AT4G32620.2
            GeneID:829397 KEGG:ath:AT4G32620 OMA:HHVKYDD Uniprot:F4JV27
        Length = 1540

 Score = 412 (150.1 bits), Expect = 2.2e-38, Sum P(3) = 2.2e-38
 Identities = 94/260 (36%), Positives = 136/260 (52%)

Query:   388 STVD---SLMLDLDRSCCCVSILVMESDRCCRVEGANVILEMSHSKEWHLVVKKDGETRY 444
             ST D    +  DL+ S C  ++LV   DR  R  GA + LE   + EW L VK  G T+Y
Sbjct:  1022 STADVTKGIQKDLESSLCDANVLVTLGDRGWREYGAQIFLEPFDNNEWRLAVKISGTTKY 1081

Query:   445 SFKAQRIMRPSSFNRFTHAILWAGDDNWKLEFSNRQDWLDFKDLYKECSDRNAQVSVSKV 504
             S +A + ++P S NRFTHA++W G  +W LEF +R  W  FK++++EC +RN + ++ + 
Sbjct:  1082 SHRAHQFLQPGSVNRFTHAMMWKGGKDWTLEFPDRGQWFLFKEMHEECYNRNTRAALVRN 1141

Query:   505 IPIPGV--CEVLGYEDSNTVPFCRPDS-YISVNVDEVSRALAKRTANYDMDSEDEEWLKK 561
             IPIPG+   E   ++ + T  F R  S Y      +V  AL      YDMDS+DE+ L +
Sbjct:  1142 IPIPGIRMIERDNFDGTET-EFIRSSSKYFRQTETDVEMALDPSRVMYDMDSDDEQCLLR 1200

Query:   562 FNNEFVTENELHEHVSEDTFELIVDAFEKAYFCSPDDYSNEEAAVNLCLELGQKEVVLAV 621
                    EN     ++ED FE  +D FEKA F    D         L   +G  E +  +
Sbjct:  1201 IRECSSAENSGSCEITEDMFEKAMDMFEKASFVKQRDNFTLIEIQELTAGVGSLEAMETI 1260

Query:   622 YNHWKQKRKQKRAALLRVFQ 641
             Y  W+ KR++K   L+R  Q
Sbjct:  1261 YELWRTKRQRKGMPLIRHLQ 1280

 Score = 77 (32.2 bits), Expect = 2.2e-38, Sum P(3) = 2.2e-38
 Identities = 18/41 (43%), Positives = 26/41 (63%)

Query:   697 RVEEAKASAKRSLEEAVLKRQRAQLLMQNADLATYKATMAL 737
             ++ +A  +A+R+   A LKR+RA+ L   ADLA  KA  AL
Sbjct:  1475 KLRDAAGAARRACALAKLKRERAESLRYKADLAIQKAAAAL 1515

 Score = 43 (20.2 bits), Expect = 2.2e-38, Sum P(3) = 2.2e-38
 Identities = 10/25 (40%), Positives = 15/25 (60%)

Query:   114 RKRKRNDGEKSKILEKKK-YGIQFS 137
             R R+ ND  K K+ +++  Y I FS
Sbjct:   324 RPRRHNDDGKGKVRKRRHFYEILFS 348

 Score = 38 (18.4 bits), Expect = 7.2e-38, Sum P(3) = 7.2e-38
 Identities = 9/30 (30%), Positives = 14/30 (46%)

Query:   120 DGEKSKILEKKKYGIQFSRRQRRKKSEKIV 149
             D        K + GI   + ++ KKS K+V
Sbjct:    33 DSVNKSFKRKHRSGIDGDQLKQDKKSRKVV 62


>TAIR|locus:2017978 [details] [associations]
            symbol:AT1G16690 species:3702 "Arabidopsis thaliana"
            [GO:0005634 "nucleus" evidence=ISM] [GO:0003700 "sequence-specific
            DNA binding transcription factor activity" evidence=ISS]
            [GO:0006355 "regulation of transcription, DNA-dependent"
            evidence=TAS] InterPro:IPR024943 EMBL:CP002684
            GenomeReviews:CT485782_GR GO:GO:0003700 InterPro:IPR019542
            Pfam:PF10513 eggNOG:NOG304124 KO:K11322 GO:GO:0032777
            PANTHER:PTHR14898 EMBL:AC011808 HOGENOM:HOG000006404
            ProtClustDB:CLSN2680665 EMBL:AB493461 IPI:IPI00535090 PIR:B86302
            RefSeq:NP_173113.2 UniGene:At.41874 EnsemblPlants:AT1G16690.1
            GeneID:838238 KEGG:ath:AT1G16690 TAIR:At1g16690 InParanoid:Q9FX82
            OMA:HINCGNT PhylomeDB:Q9FX82 Genevestigator:Q9FX82 Uniprot:Q9FX82
        Length = 439

 Score = 127 (49.8 bits), Expect = 9.5e-05, P = 9.5e-05
 Identities = 67/253 (26%), Positives = 109/253 (43%)

Query:   505 IPIPGVCEVLGYEDSNTVPFCRPDSYISVNVDEVSRALAKRTANYDMDSEDEEWLKKFNN 564
             IP P    V  YE   +  F +P SY+       +RA       YD+D++D++WL ++N 
Sbjct:    67 IPTPQYLVVDTYERDYSRTFNQPASYLRARG---ARAELGEFVEYDLDNDDDDWLYEYNK 123

Query:   565 E-FVTENELHEHVSEDTFEL-IVD--AFEKAYFCSPD------DYSNEEAAVNLCLELGQ 614
             E  +   E+ E V    F+L ++D  A E+A   +P            +AA      L  
Sbjct:   124 ETMILSPEMLEIV---IFKLEVLDHKARERAGVITPTLGLPVPVLLQPDAAGEALKYLSI 180

Query:   615 KEVVL-AVYNHWKQKRKQKRAALLRVFQGRQP---KKPSLIPKPALRKRRSFKRQASQPG 670
             K  V  A+Y++WK KR+  +  +LR  Q   P     P  + +P  +  R   R+  Q  
Sbjct:   181 KYGVFHAIYSYWKNKREIWQKPILRRLQPPPPVNDTNPYNVFRPREKAHRLHARRMQQRE 240

Query:   671 RGKPPVVLLPEVVTQQDALEE-QNAMRRVEEAKA---SAKRSLEEAVLK-RQRAQLLMQN 725
                     L +V    D  +    A+ + EE K    +++ SL+   LK +   +LL  +
Sbjct:   241 NNAQSFEKLRQVRRNLDQAKTILEALIKREEKKRDFMASEVSLQRIQLKYKNETELLEDS 300

Query:   726 ADLATYKATMALR 738
               LA +  + A R
Sbjct:   301 LALAGFPLSTAYR 313


>DICTYBASE|DDB_G0283859 [details] [associations]
            symbol:DDB_G0283859 "BRD group protein" species:44689
            "Dictyostelium discoideum" [GO:0008270 "zinc ion binding"
            evidence=IEA] [GO:0005634 "nucleus" evidence=ISS] [GO:0046872
            "metal ion binding" evidence=IEA] InterPro:IPR001487
            InterPro:IPR001965 InterPro:IPR019787 Pfam:PF00439 PRINTS:PR00503
            PROSITE:PS50014 PROSITE:PS50016 SMART:SM00249 SMART:SM00297
            dictyBase:DDB_G0283859 GO:GO:0005634 GO:GO:0046872 GO:GO:0008270
            Gene3D:3.30.40.10 InterPro:IPR011011 InterPro:IPR013083
            SUPFAM:SSF57903 eggNOG:COG5141 EMBL:AAFI02000057 Gene3D:1.20.920.10
            SUPFAM:SSF47370 InterPro:IPR019542 Pfam:PF10513 RefSeq:XP_638848.1
            ProteinModelPortal:Q54QM3 EnsemblProtists:DDB0220705 GeneID:8624246
            KEGG:ddi:DDB_G0283859 InParanoid:Q54QM3 OMA:HIPLEIN Uniprot:Q54QM3
        Length = 1678

 Score = 130 (50.8 bits), Expect = 0.00052, Sum P(2) = 0.00052
 Identities = 31/77 (40%), Positives = 41/77 (53%)

Query:   517 EDSNTVPFCRPDSYISVNVDEVSRALAKRTANYDMDSEDEEWLKKFNNEFVTENELHEHV 576
             E+S   P+ +P  YI     E S  +      YDMDSEDE+WL++FN    T N    + 
Sbjct:   391 EESPMAPYNKPSGYIIYQ--EKSSEMLHDEVEYDMDSEDEKWLEEFNK--TTNN----NY 442

Query:   577 SEDTFELIVDAFEKAYF 593
             SED FE ++D  EK  F
Sbjct:   443 SEDIFEYVIDRLEKETF 459

 Score = 48 (22.0 bits), Expect = 0.00052, Sum P(2) = 0.00052
 Identities = 11/25 (44%), Positives = 17/25 (68%)

Query:   618 VLAVYNHWKQKRKQKRA-ALLRVFQ 641
             V AVY++W +KR  +   AL++ FQ
Sbjct:   902 VHAVYDYWVKKRISRSGLALIKRFQ 926


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.320   0.133   0.399    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      758       634   0.00092  120 3  11 22  0.37    34
                                                     36  0.45    37


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  4
  No. of states in DFA:  627 (67 KB)
  Total size of DFA:  351 KB (2175 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  53.34u 0.15s 53.49t   Elapsed:  00:00:02
  Total cpu time:  53.34u 0.15s 53.49t   Elapsed:  00:00:02
  Start:  Tue May 21 09:49:06 2013   End:  Tue May 21 09:49:08 2013

Back to top