BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>016335
MSWRRVLKSVQAVAAHSLLLTFTLLLVLKLDHVISYSWWIVFFPVWIFHAVVARGRFSLP
APSVPHNRHWAPCHAIVATPLLIAFELLLCIYLESIYEHGFEAVNLKIVFLPLLAFEITI
LIDNFRMCRALMPGDEESMNDEAIWEALPHFWVAISMVFFVAATVFTLLKLCGYVGALGW
WDLFINFGIAECFAFLVCTKWSNPVIHRSPQTRPATSSSAITYLDWNSGLVVSAEEEQNP
DGMCGLSDIGGHIMKVPVIGFQVLLCMHLEGTPAGARNIALPVLFSPLFLLQGVGVVFST
TRLVEKIVILLRSGAGTGIYFRISSRAHDCFGFLHRGSRLLGWWSIDEGSREDQARLVHE
NSSGLVYILHLSGLGDAQIAWVLILVLFAPL

High Scoring Gene Products

Symbol, full name Information P value
AT1G18470 protein from Arabidopsis thaliana 1.5e-145
AT1G73950 protein from Arabidopsis thaliana 2.2e-144
AT1G68820 protein from Arabidopsis thaliana 9.1e-132

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  016335
        (391 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

TAIR|locus:2014089 - symbol:AT1G18470 species:3702 "Arabi...  1422  1.5e-145  1
TAIR|locus:2031471 - symbol:AT1G73950 species:3702 "Arabi...  1411  2.2e-144  1
TAIR|locus:2012453 - symbol:AT1G68820 species:3702 "Arabi...  1292  9.1e-132  1


>TAIR|locus:2014089 [details] [associations]
            symbol:AT1G18470 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0005739
            "mitochondrion" evidence=ISM] [GO:0008270 "zinc ion binding"
            evidence=IEA;ISS] [GO:0005829 "cytosol" evidence=RCA]
            InterPro:IPR001841 PROSITE:PS50089 SMART:SM00184 GO:GO:0016021
            EMBL:CP002684 GenomeReviews:CT485782_GR GO:GO:0046872 GO:GO:0008270
            Gene3D:3.30.40.10 InterPro:IPR013083 InterPro:IPR019396
            Pfam:PF10269 EMBL:AY087249 EMBL:BT030383 IPI:IPI00523865
            RefSeq:NP_564052.1 UniGene:At.27535 UniGene:At.41793
            ProteinModelPortal:Q8LBF1 SMR:Q8LBF1 EnsemblPlants:AT1G18470.1
            GeneID:838427 KEGG:ath:AT1G18470 TAIR:At1g18470 eggNOG:NOG129450
            HOGENOM:HOG000005785 InParanoid:Q8LBF1 OMA:ICRIDIE PhylomeDB:Q8LBF1
            ProtClustDB:CLSN2687854 Genevestigator:Q8LBF1 Uniprot:Q8LBF1
        Length = 467

 Score = 1422 (505.6 bits), Expect = 1.5e-145, P = 1.5e-145
 Identities = 264/365 (72%), Positives = 304/365 (83%)

Query:     1 MSWRRVLKSVQAVAAHSXXXXXXXXXXXXXDHVISYSWWIVFFPVWIFHAVVARGRFSLP 60
             MS RRVLKS+QA+AAHS             DH +S SWW+VFFP+W FHAVVARGRFSLP
Sbjct:     1 MSCRRVLKSIQALAAHSLLFCFTLLLVLKLDHTVSSSWWMVFFPLWAFHAVVARGRFSLP 60

Query:    61 APSVPHNRHWAPCHAIVATPLLIAFELLLCIYLESIYEHGFEAVNLKIVFLPLLAFEITI 120
             AP  P NRHWAPCHA+VATPLL+AFELLLCIYLES Y     AV+LKI FLPLLAFE+TI
Sbjct:    61 APVAPRNRHWAPCHAVVATPLLVAFELLLCIYLESSYARWPPAVSLKIAFLPLLAFELTI 120

Query:   121 LIDNFRMCRALMPGDEESMNDEAIWEALPHFWVAISMVFFVAATVFTLLKLCGYVGALGW 180
             L+DN RMCRALMPGD++S+ D+AIWEALPHFWVAISMVF +AAT FTLLKL G V ALGW
Sbjct:   121 LVDNLRMCRALMPGDDDSITDDAIWEALPHFWVAISMVFTLAATFFTLLKLSGDVVALGW 180

Query:   181 WDLFINFGIAECFAFLVCTKWSNPVIHRSPQTRPATSSS-AITYLDWNSGLVVSAEEEQN 239
             WDLFINFGIAECFAFLVCTKWSNPVIHRS + R   SSS +I YLDWNSGLVV+ EE+++
Sbjct:   181 WDLFINFGIAECFAFLVCTKWSNPVIHRSSRARETGSSSTSIRYLDWNSGLVVAPEEDRH 240

Query:   240 PDGMCGLSDIGGHIMKVPVIGFQVLLCMHLEGTPAGARNIALPVLFSPLFLLQGVGVVFS 299
              D  CGL DIGGH++K+PVI FQV+LCM+LEGTP  A++I++PVLFSPLFLLQG+GV+F+
Sbjct:   241 QDRWCGLQDIGGHMLKIPVILFQVVLCMYLEGTPERAKDISIPVLFSPLFLLQGLGVLFA 300

Query:   300 TTRLVEKIVILLRSGAGTGIYFRISSRAHDCFGFLHRGSRLLGWWSIDEGSREDQARLVH 359
              ++L+EKIV+LLR  AG G+YFR SS AHDC GFLH GSRLLGWWSIDEGSRE+QARL  
Sbjct:   301 ASKLLEKIVLLLRGEAGPGLYFRFSSSAHDCLGFLHHGSRLLGWWSIDEGSREEQARLYF 360

Query:   360 ENSSG 364
             +  SG
Sbjct:   361 DQESG 365


>TAIR|locus:2031471 [details] [associations]
            symbol:AT1G73950 species:3702 "Arabidopsis thaliana"
            [GO:0005634 "nucleus" evidence=ISM] [GO:0008150
            "biological_process" evidence=ND] [GO:0008270 "zinc ion binding"
            evidence=IEA;ISS] [GO:0005773 "vacuole" evidence=IDA]
            InterPro:IPR001841 PROSITE:PS50089 SMART:SM00184 GO:GO:0016021
            EMBL:CP002684 GO:GO:0005773 GO:GO:0046872 GO:GO:0008270
            Gene3D:3.30.40.10 InterPro:IPR013083 InterPro:IPR019396
            Pfam:PF10269 IPI:IPI00529391 RefSeq:NP_177535.2 UniGene:At.28475
            ProteinModelPortal:F4HS21 SMR:F4HS21 PRIDE:F4HS21
            EnsemblPlants:AT1G73950.1 GeneID:843732 KEGG:ath:AT1G73950
            OMA:LAFEVII Uniprot:F4HS21
        Length = 466

 Score = 1411 (501.8 bits), Expect = 2.2e-144, P = 2.2e-144
 Identities = 263/363 (72%), Positives = 298/363 (82%)

Query:     3 WRRVLKSVQAVAAHSXXXXXXXXXXXXXDHVISYSWWIVFFPVWIFHAVVARGRFSLPAP 62
             WR VLKSVQA  AH              DH I+YSWW+V  P+W FHAVVARGRFSLPAP
Sbjct:     4 WR-VLKSVQASVAHCFLFSFTLALVLKLDHSITYSWWVVCLPLWAFHAVVARGRFSLPAP 62

Query:    63 SVPHNRHWAPCHAIVATPLLIAFELLLCIYLESIYEHGFEAVNLKIVFLPLLAFEITILI 122
               P NRHWAPCHAIV+TPLLIAFELLLC+YLE+ Y     AV+LKIVFLPLLAFE+ IL+
Sbjct:    63 IAPRNRHWAPCHAIVSTPLLIAFELLLCVYLETAYADSPPAVSLKIVFLPLLAFEVIILV 122

Query:   123 DNFRMCRALMPGDEESMNDEAIWEALPHFWVAISMVFFVAATVFTLLKLCGYVGALGWWD 182
             DN RMCRALMPGDEES+NDEA+WEALPHFWVAISMVFF+AATVFTLLKL G V ALGWWD
Sbjct:   123 DNARMCRALMPGDEESVNDEAVWEALPHFWVAISMVFFLAATVFTLLKLSGDVAALGWWD 182

Query:   183 LFINFGIAECFAFLVCTKWSNPVIHRSPQTRPATSSSA-ITYLDWNSGLVVSAEEEQNPD 241
             LFINFGIAECFAFLVCTKWSNPVIHRS + R   SSS  I YLDWNSGL V +E+++N D
Sbjct:   183 LFINFGIAECFAFLVCTKWSNPVIHRSSRDRETGSSSTNIRYLDWNSGLGVFSEDDRNQD 242

Query:   242 GMCGLSDIGGHIMKVPVIGFQVLLCMHLEGTPAGARNIALPVLFSPLFLLQGVGVVFSTT 301
               CGL DIGGHIMK+P+I FQV+LCMHLEGTP  A++I++PVLFSPLFLLQGVGV+F+ +
Sbjct:   243 -TCGLQDIGGHIMKIPLIVFQVVLCMHLEGTPEAAKSISVPVLFSPLFLLQGVGVLFAAS 301

Query:   302 RLVEKIVILLRSGAGTGIYFRISSRAHDCFGFLHRGSRLLGWWSIDEGSREDQARLVHEN 361
             +L+EK+V+LLR    TG+YFR  SRAHDC GFLH GSRLLGWWSIDEGSRE++ARL  + 
Sbjct:   302 KLIEKVVLLLRGEDDTGLYFRFLSRAHDCLGFLHHGSRLLGWWSIDEGSREEEARLYFDQ 361

Query:   362 SSG 364
              SG
Sbjct:   362 ESG 364


>TAIR|locus:2012453 [details] [associations]
            symbol:AT1G68820 species:3702 "Arabidopsis thaliana"
            [GO:0005634 "nucleus" evidence=ISM] [GO:0008270 "zinc ion binding"
            evidence=IEA] InterPro:IPR001841 PROSITE:PS50089 SMART:SM00184
            GO:GO:0016021 EMBL:CP002684 GenomeReviews:CT485782_GR GO:GO:0046872
            GO:GO:0008270 Gene3D:3.30.40.10 InterPro:IPR013083 eggNOG:NOG243347
            InterPro:IPR019396 Pfam:PF10269 HOGENOM:HOG000005785
            ProtClustDB:CLSN2687854 EMBL:AF370299 EMBL:AY063085 IPI:IPI00528108
            RefSeq:NP_564945.1 UniGene:At.17970 ProteinModelPortal:Q94K50
            SMR:Q94K50 PaxDb:Q94K50 PRIDE:Q94K50 EnsemblPlants:AT1G68820.1
            GeneID:843214 KEGG:ath:AT1G68820 TAIR:At1g68820 InParanoid:Q94K50
            OMA:ISMVFFI PhylomeDB:Q94K50 ArrayExpress:Q94K50
            Genevestigator:Q94K50 Uniprot:Q94K50
        Length = 468

 Score = 1292 (459.9 bits), Expect = 9.1e-132, P = 9.1e-132
 Identities = 240/367 (65%), Positives = 288/367 (78%)

Query:     1 MSWRRVLKSVQAVAAHSXXXXXXXXXXXXXDHVISYSWWIVFFPVWIFHAVVARGRFSLP 60
             MSWRRV KS QA +AH              DHV+S+SWW VF P+W+FHAV+ARGRFSLP
Sbjct:     8 MSWRRVWKSFQAASAHCLLFSFTLLLALKLDHVVSHSWWFVFAPLWLFHAVIARGRFSLP 67

Query:    61 APSVPHNRHWAPCHAIVATPLLIAFELLLCIYLESIYEHGFEAVNLKIVFLPLLAFEITI 120
             APS+PH+RHWAP H+++ATPLL+AFE+LLC++LE  Y      V+LKIVFLPLLAFE+ I
Sbjct:    68 APSMPHDRHWAPFHSVMATPLLVAFEILLCVHLEDKY-----VVDLKIVFLPLLAFEVAI 122

Query:   121 LIDNFRMCRALMPGDEESMNDEAIWEALPHFWVAISMVFFVAATVFTLLKLCGYVGALGW 180
             LIDN RMCR LMPGDEE+M+DEAIWE LPHFWV+ISMVFF+AAT FTLLKLCG V ALGW
Sbjct:   123 LIDNVRMCRTLMPGDEETMSDEAIWETLPHFWVSISMVFFIAATTFTLLKLCGDVAALGW 182

Query:   181 WDLFINFGIAECFAFLVCTKWSNPVIHR-SPQTRPATSSSAITYLDWNSGLVVSAEEE-Q 238
             WDLFINFGIAECFAFLVCTKWSN  IHR S    P++SS  + YLDWN GLVV+A++E Q
Sbjct:   183 WDLFINFGIAECFAFLVCTKWSNQSIHRYSHIPEPSSSSMVVRYLDWNRGLVVTADDEHQ 242

Query:   239 NPDGMCGLSDIGGHIMKVPVIGFQVLLCMHLEGTPAGARNIALPVLFSPLFLLQGVGVVF 298
               + +CGL DIGGH+MK+P + FQ++L M LEGTPA A+NI + VLF PLFLLQG GV+F
Sbjct:   243 QSNRICGLQDIGGHVMKIPFVTFQIILFMRLEGTPASAKNIPILVLFVPLFLLQGAGVLF 302

Query:   299 STTRLVEKIVILLRSGAGT-GIYFRISSRAHDCFGFLHRGSRLLGWWSIDEGSREDQARL 357
             +  RLVEK V+L+ SG+G+ G YF  +S A +  GF   G+RLLGWWSIDEGSRE+QARL
Sbjct:   303 AMYRLVEKSVLLINSGSGSYGRYFTATSSAREFLGFFQHGARLLGWWSIDEGSREEQARL 362

Query:   358 VHENSSG 364
                 ++G
Sbjct:   363 YSGEATG 369


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.328   0.141   0.468    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      391       378   0.00089  117 3  11 22  0.43    33
                                                     34  0.45    37


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  3
  No. of states in DFA:  619 (66 KB)
  Total size of DFA:  281 KB (2145 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  29.45u 0.10s 29.55t   Elapsed:  00:00:04
  Total cpu time:  29.45u 0.10s 29.55t   Elapsed:  00:00:04
  Start:  Sat May 11 01:14:11 2013   End:  Sat May 11 01:14:15 2013

Back to top