BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>004302
MALRNGCLRRISRFLPHIYSGSHFQQSRRAIINSLASSLITFPRECEQISRNGVNFSFST
IAQASPAESLSQSDTLSFIESTFNEFQGPHHLWFNIVEDNIHFFKRGGAFLVLAGRFVDN
CDSLIAGCGTVVTFEKVKSIQQSFPQLQVIGFLHGCSTISAVDQTRLVEMLMKEYITFPI
LLSNKNFPQMENGACYLLSKDFGNARVFHENSLDIGMLNKAVEELIMQQQENSSSPSGLK
CTWAKQAEVLKEPHACSSVRNLLLHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILD
CIGSCPGFEDGEFESSKLMRPAASFYHKDDDCLYIVDSENHAIRRADMGRRVLETVYPTS
GISKKNNSLWAWIMEKLGFERDNDTKSEKLDPQSLIFPWHLMKSEDDNLLIINRSFETLW
IMDLASGEIKEAVKGFSKVLEICGVLVMEKVFLLKQMPQDWLLHQIDSSCSLKELPYAGL
ISSSIAFQNHILLCDIVGQRIMRLNRESGVCSNFQFSNFAILGLPYWFAFPLERVYAVAG
GHQGSWTDHIQRCSLLPGRIDIKVNVDIPSDTELVESLQEGCIWRQARGTASVVLRAEDV
AGSLEKVGVAQLWYDELDTLALSTPESESNIEDETTTSDLRSEDDTVHIDCAVNTSPGTS
EVIISAALYLKLRRYPDQQDDGREKYAARISDILKLGRSGAMQRDSFIRFLLKSNQDLRD
VIFVKPLHVSIQFDTLDHPKADNSKDIILTDSNMEVDVSLNT

High Scoring Gene Products

Symbol, full name Information P value
emb1974
AT3G07060
protein from Arabidopsis thaliana 4.7e-183
AT1G56500 protein from Arabidopsis thaliana 9.2e-11
Nhlrc2
NHL repeat containing 2
protein from Mus musculus 1.5e-08

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  004302
        (762 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

TAIR|locus:2077597 - symbol:emb1974 "AT3G07060" species:3...  1776  4.7e-183  1
TAIR|locus:2010728 - symbol:AT1G56500 species:3702 "Arabi...   202  9.2e-11   2
MGI|MGI:1914116 - symbol:Nhlrc2 "NHL repeat containing 2"...   166  1.5e-08   1


>TAIR|locus:2077597 [details] [associations]
            symbol:emb1974 "AT3G07060" species:3702 "Arabidopsis
            thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0009507
            "chloroplast" evidence=ISM] [GO:0009793 "embryo development ending
            in seed dormancy" evidence=NAS] [GO:0006346 "methylation-dependent
            chromatin silencing" evidence=RCA] [GO:0031048 "chromatin silencing
            by small RNA" evidence=RCA] [GO:0051567 "histone H3-K9 methylation"
            evidence=RCA] EMBL:CP002686 Gene3D:2.120.10.30 InterPro:IPR011042
            IPI:IPI00549069 RefSeq:NP_187362.3 UniGene:At.53210
            ProteinModelPortal:F4JD91 SMR:F4JD91 PRIDE:F4JD91
            EnsemblPlants:AT3G07060.1 GeneID:819892 KEGG:ath:AT3G07060
            OMA:ILGLPYW Uniprot:F4JD91
        Length = 774

 Score = 1776 (630.2 bits), Expect = 4.7e-183, P = 4.7e-183
 Identities = 371/783 (47%), Positives = 512/783 (65%)

Query:     1 MALRNGCLRRISRFLPHIYSGSHFQQSRRAIINS---LASSL-----ITFPRECEQISRN 52
             M+ R+  L++IS     I S +   + RR+I      LA SL     I      EQ    
Sbjct:     1 MSNRSLHLKKISWLSSRILSDNVHGRFRRSITTPATCLAPSLDGDMNIGSKTLVEQRFSR 60

Query:    53 GVNFSFSTIAQASPAESLSQS----DTLSFIESTFNEFQGPHHLWFNIVEDNIHFFKRGG 108
             G   S   +++AS + S S S    D LSFI+++ ++ +GP H W N    N   FK  G
Sbjct:    61 GFA-SVRRVSRASSSSSSSPSSPHVDLLSFIKASLDKLEGPSHHWLNRDFGNKQLFKDKG 119

Query:   109 AFLVLAGRFVDNCDSLIAGCGTVVTFEKVKSIQQSFPQLQVIGFLHGCSTISAVDQTRLV 168
              ++VLAG  +D    L +G      FEK+K +QQ  P +  +G         A D+T L 
Sbjct:   120 TYVVLAGHLLDGTSDL-SGF-----FEKLKLLQQRSPGVCFMGIHFSDQARIADDRTALA 173

Query:   169 EMLMKEYITFPILLSNKNFPQMENGACYLLSKDFGNARVFHENSLDIGMLNKAVEELIMQ 228
             E+++KEY+TFP+LLS K FP+      Y++ KDF N  ++ E  LDI  + KA++ L+ Q
Sbjct:   174 ELILKEYLTFPVLLSEKEFPKTSGEVRYIVFKDFKNPLIYEEKDLDIASVVKALDSLLTQ 233

Query:   229 QQENSSSPSGLKCTWAKQAEVLKEPHACSSVRNLLLHFPGCISADESGNRLFLSDSNHHR 288
               E S S      TW+KQAE +KE H  S  ++LLL+FPGCISADE G+RLFLSD+NHHR
Sbjct:   234 DTEKSKSVRLFTNTWSKQAEAIKESHFPSFFQDLLLYFPGCISADEVGDRLFLSDTNHHR 293

Query:   289 IIVFDGNGKILDCIGSCPGFEDGEFESSKLMRPAASFYHKDDDCLYIVDSENHAIRRADM 348
             II+F+ +GKI+D IG  PGFEDG+FES+K++RP  + Y + +DCLYIVDSENHAIRRA++
Sbjct:   294 IIIFENSGKIVDSIGCFPGFEDGDFESAKMLRPTGTLYDEAEDCLYIVDSENHAIRRANI 353

Query:   349 GRRVLETVYPTSGISKKNNSLWAWIMEKLGFERDNDT------KSEKLDPQSLIFPWHLM 402
               RVLETVYP   + KK   LW+WIMEK+G  +D+DT      KSE+ D +SL+FPWH++
Sbjct:   354 NSRVLETVYPK--VIKKTGGLWSWIMEKMGLGKDDDTTVDADTKSEEFDARSLLFPWHIL 411

Query:   403 KSEDDNLLIINRSFETLWIMDLASGEIKEAVKGFSKVLEICGVLVMEKVFLLKQMPQDWL 462
             K +D++LL+IN+SF  LWI++ ASGEI+E V+GFSK++EICG  + EK+ +L+ MP +WL
Sbjct:   412 KRDDESLLVINKSFSKLWIINFASGEIEEVVEGFSKIIEICGQSITEKLSVLEHMPSNWL 471

Query:   463 LHQIDSSCSLKELPYAGLISSSIAFQNHILLCDIVGQRIMRLNRESGVCSNFQFSNFAIL 522
               Q  +  S KE P A L+SS     + I++ DI  QR+++LNR+SG CS+ QFSN  IL
Sbjct:   472 QQQTAAIASFKEQPSASLLSSFTKLGDDIVMTDIACQRVLKLNRDSGACSSIQFSNSGIL 531

Query:   523 GLPYWFAFPLERVYAVAGGHQGSWTDHIQRCSLLPGRIDIKVNVDIPSDTELVESLQEGC 582
             GLPYW   PLERV+ +A G Q +   H Q   LLPG+I I++N++IP  TELVE +QE C
Sbjct:   532 GLPYWLFIPLERVFNLANGVQEAHLSHTQELRLLPGKISIRLNIEIPPCTELVEPIQESC 591

Query:   583 IWRQARGTASVVLRAEDVAGSLEKVGVAQLWYDELDTLA--LSTPES-ESNIEDETTTSD 639
             IWRQ RG  S    A       EK+GV+Q WYDELD+LA  ++ PE+ E   E++   S+
Sbjct:   592 IWRQTRGAISEFSSAGSAVEPSEKIGVSQQWYDELDSLAKEIANPEAAEEEEEEDVNPSE 651

Query:   640 L-RSEDDTVHIDCAVNTSPGTSEVIISAALYLKLRRYPDQQDDGREKYAARISDILKLGR 698
             + R ED  +HIDC V TSPG+SE+I+ AALYL+L R  + +   +E+ A +I+ ILK  R
Sbjct:   652 VDREEDGRIHIDCPVKTSPGSSELIVYAALYLRLARNEETESATQEELARKIAKILKPVR 711

Query:   699 S-GAMQRDSFIRFLLKSNQDLRDVIFVKPLHVSIQFDTLDHPKADNSKDIILTDSNMEVD 757
             +   M+ D F+  L KS ++LRD++F+KP+HV I+ D+ DHPKADNS+D+ILTDS++EVD
Sbjct:   712 NITTMKEDLFVNLLSKSKRELRDIVFIKPMHVRIRLDSKDHPKADNSRDVILTDSSVEVD 771

Query:   758 VSL 760
             VSL
Sbjct:   772 VSL 774


>TAIR|locus:2010728 [details] [associations]
            symbol:AT1G56500 species:3702 "Arabidopsis thaliana"
            [GO:0003824 "catalytic activity" evidence=IEA] [GO:0008152
            "metabolic process" evidence=IEA] [GO:0016491 "oxidoreductase
            activity" evidence=IEA] [GO:0016787 "hydrolase activity"
            evidence=IEA;ISS] [GO:0009570 "chloroplast stroma" evidence=IDA]
            [GO:0009507 "chloroplast" evidence=IDA] [GO:0009534 "chloroplast
            thylakoid" evidence=IDA] [GO:0000023 "maltose metabolic process"
            evidence=RCA] [GO:0009773 "photosynthetic electron transport in
            photosystem I" evidence=RCA] [GO:0009902 "chloroplast relocation"
            evidence=RCA] [GO:0010027 "thylakoid membrane organization"
            evidence=RCA] [GO:0016117 "carotenoid biosynthetic process"
            evidence=RCA] [GO:0019252 "starch biosynthetic process"
            evidence=RCA] [GO:0019288 "isopentenyl diphosphate biosynthetic
            process, mevalonate-independent pathway" evidence=RCA] [GO:0034660
            "ncRNA metabolic process" evidence=RCA] InterPro:IPR001258
            InterPro:IPR005833 InterPro:IPR006402 Pfam:PF01436 PRINTS:PR00413
            InterPro:IPR000033 EMBL:CP002684 GO:GO:0009570 Gene3D:3.40.50.1000
            InterPro:IPR023214 SUPFAM:SSF56784 Gene3D:2.120.10.30
            InterPro:IPR011042 SMART:SM00135 Gene3D:3.40.30.10
            InterPro:IPR012336 SUPFAM:SSF52833 GO:GO:0016787 Pfam:PF13419
            TIGRFAMs:TIGR01509 PROSITE:PS51352 GO:GO:0009534 EMBL:AY065399
            IPI:IPI00547570 RefSeq:NP_564718.2 UniGene:At.28196
            ProteinModelPortal:Q8VZ10 SMR:Q8VZ10 STRING:Q8VZ10 PRIDE:Q8VZ10
            EnsemblPlants:AT1G56500.1 GeneID:842103 KEGG:ath:AT1G56500
            TAIR:At1g56500 HOGENOM:HOG000030168 InParanoid:Q8VZ10 OMA:VCLYQSV
            PhylomeDB:Q8VZ10 ProtClustDB:PLN02919 ArrayExpress:Q8VZ10
            Genevestigator:Q8VZ10 Uniprot:Q8VZ10
        Length = 1055

 Score = 202 (76.2 bits), Expect = 9.2e-11, Sum P(2) = 9.2e-11
 Identities = 42/95 (44%), Positives = 57/95 (60%)

Query:   264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC--PGFEDGEFESSKLMRP 321
             L FPG ++ D   NRLF+SDSNH+RIIV D  G  +  IGS    GF+DG FE +   RP
Sbjct:   566 LKFPGKLAIDTLNNRLFISDSNHNRIIVTDLEGNFIVQIGSSGEEGFQDGSFEDAAFNRP 625

Query:   322 AASFYHKDDDCLYIVDSENHAIRRADMGRRVLETV 356
                 Y+   + LY+ D+ENHA+R  D     ++T+
Sbjct:   626 QGLAYNAKKNLLYVADTENHALREIDFVNERVQTL 660

 Score = 37 (18.1 bits), Expect = 9.2e-11, Sum P(2) = 9.2e-11
 Identities = 19/70 (27%), Positives = 30/70 (42%)

Query:   376 KLGFERDNDTKSEKLDPQSLIFPWHLMKSEDDNLLIINRSFETLWIMDLASGEIKEAVKG 435
             K GF +D   K  +L       P  L  +E+  L + + +   +  +DL  GE  E +  
Sbjct:   844 KAGF-KDGKVKGAQLSE-----PAGLAITENGRLFVADTNNSLIRYIDLNKGEDSEIL-- 895

Query:   436 FSKVLEICGV 445
                 LE+ GV
Sbjct:   896 ---TLELKGV 902


>MGI|MGI:1914116 [details] [associations]
            symbol:Nhlrc2 "NHL repeat containing 2" species:10090 "Mus
            musculus" [GO:0003674 "molecular_function" evidence=ND] [GO:0005575
            "cellular_component" evidence=ND] [GO:0008150 "biological_process"
            evidence=ND] InterPro:IPR001258 Pfam:PF01436 InterPro:IPR000033
            MGI:MGI:1914116 Gene3D:2.120.10.30 InterPro:IPR011042 SMART:SM00135
            Gene3D:3.40.30.10 InterPro:IPR012336 SUPFAM:SSF52833
            PROSITE:PS51352 PROSITE:PS51125 eggNOG:COG0526 EMBL:AK033384
            EMBL:BC041104 EMBL:BC095956 IPI:IPI00227730 RefSeq:NP_080087.1
            UniGene:Mm.392509 UniGene:Mm.46346 ProteinModelPortal:Q8BZW8
            SMR:Q8BZW8 PhosphoSite:Q8BZW8 PaxDb:Q8BZW8 PRIDE:Q8BZW8
            Ensembl:ENSMUST00000071423 GeneID:66866 KEGG:mmu:66866 CTD:374354
            GeneTree:ENSGT00390000015483 HOVERGEN:HBG059736 InParanoid:Q8BZW8
            OMA:SCLFVAD OrthoDB:EOG4W6NVH NextBio:322863 Bgee:Q8BZW8
            CleanEx:MM_NHLRC2 Genevestigator:Q8BZW8 Uniprot:Q8BZW8
        Length = 725

 Score = 166 (63.5 bits), Expect = 1.5e-08, P = 1.5e-08
 Identities = 36/94 (38%), Positives = 52/94 (55%)

Query:   264 LHFPGCISADESGNRLFLSDSNHHRIIVFDGNGKILDCIGSC-PGFEDGEFESSKLMRPA 322
             L FPG ++ D +  RL ++D+ HHRI+V   NG+I   IG   PG +DG F  S    P 
Sbjct:   223 LLFPGKVAVDHATGRLVVADTGHHRILVIQKNGRIQSSIGGPNPGRKDGMFSESSFNSPQ 282

Query:   323 ASFYHKDDDCLYIVDSENHAIRRADMGRRVLETV 356
                    D+ +Y+ D+ENH IR+ D+    + TV
Sbjct:   283 GVAIA--DNVIYVADTENHLIRKIDLEAEKVTTV 314


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.321   0.136   0.405    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      762       762   0.00091  121 3  11 22  0.37    34
                                                     37  0.45    37


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  3
  No. of states in DFA:  621 (66 KB)
  Total size of DFA:  400 KB (2194 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  63.83u 0.16s 63.99t   Elapsed:  00:00:03
  Total cpu time:  63.83u 0.16s 63.99t   Elapsed:  00:00:03
  Start:  Tue May 21 00:44:11 2013   End:  Tue May 21 00:44:14 2013

Back to top