BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>014112
MDSPQSVVSPFKSSVAAEPEKHKSDFFARSGSLTKGTEANRKEAVMSNLGGNFIGVLEVY
IHQARDIHNICIYHKQDVYAKLCLTSDPENTVSTNIINGGGRNPVFNENLKLNVKTVESS
LKCEIFMMSRVKNYLEDQLLGFTLVPLSEVLVKNGKLEKEFSLSSTDLFHSPAGFVQLSL
AYAGASPDVMAIPAVPKPLAADETAQESEISESLDRIEFPDPKIVNENQMMVSEYFGISC
SNMDTETSESLVSSDARNQVSSEIRAPVVESFSTATVESVQHPKLDSPPSSVSTNGVSSP
SVAASSDSSDSPVVSKPQNQEQEPPSKEKKVDVGEGESDSSGGVLSDAINKPVVSVNIEP
EQKVVQQDIVDMYMKSMQQFTESLAKMKLPLDIDSGPPSSTSSGNSSTDQKLQASKNTGS
RVFYGSRAFF

High Scoring Gene Products

Symbol, full name Information P value
AT5G55530 protein from Arabidopsis thaliana 4.2e-115
AT1G50570 protein from Arabidopsis thaliana 2.5e-74
AT5G12300 protein from Arabidopsis thaliana 3.3e-59
SYTC
AT5G04220
protein from Arabidopsis thaliana 0.00042

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  014112
        (430 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

TAIR|locus:2173937 - symbol:AT5G55530 "AT5G55530" species...   913  4.2e-115  2
TAIR|locus:2008011 - symbol:AT1G50570 "AT1G50570" species...   750  2.5e-74   1
TAIR|locus:505006598 - symbol:AT5G12300 "AT5G12300" speci...   474  3.3e-59   2
TAIR|locus:2146688 - symbol:SYTC "AT5G04220" species:3702...   109  0.00042   2


>TAIR|locus:2173937 [details] [associations]
            symbol:AT5G55530 "AT5G55530" species:3702 "Arabidopsis
            thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0005634
            "nucleus" evidence=ISM] [GO:0008150 "biological_process"
            evidence=ND] InterPro:IPR000008 InterPro:IPR008973 Pfam:PF00168
            SMART:SM00239 EMBL:CP002688 SUPFAM:SSF49562 EMBL:BT020521
            EMBL:BT021091 IPI:IPI00542957 RefSeq:NP_200364.2 RefSeq:NP_974935.1
            RefSeq:NP_974936.1 UniGene:At.22993 ProteinModelPortal:Q5HZ03
            PaxDb:Q5HZ03 PRIDE:Q5HZ03 EnsemblPlants:AT5G55530.1
            EnsemblPlants:AT5G55530.2 EnsemblPlants:AT5G55530.3 GeneID:835647
            KEGG:ath:AT5G55530 eggNOG:NOG251891 OMA:EYFGISC
            ProtClustDB:CLSN2688646 Genevestigator:Q5HZ03 Uniprot:Q5HZ03
        Length = 405

 Score = 913 (326.5 bits), Expect = 4.2e-115, Sum P(2) = 4.2e-115
 Identities = 178/269 (66%), Positives = 220/269 (81%)

Query:     1 MDSPQSVVSPFKSSVAAEPEKHKSDFFARSGSLTKGTEANRKEAVMSNLGG-NFIGVLEV 59
             MDSPQSVVSPFK     E E   S+    SG+ + G  +N K++   + G  + +G LEV
Sbjct:     1 MDSPQSVVSPFK---IGESENENSNSVQSSGNQSNGINSNGKDS--KSCGRQDLVGALEV 55

Query:    60 YIHQARDIHNICIYHKQDVYAKLCLTSDPENTVSTNIINGGGRNPVFNENLKLNVKTVES 119
             Y+HQARDIHNICIYHKQDVYAKLCLTSDP+ +VST IINGGGRNPVF++N+KL+V+ +++
Sbjct:    56 YVHQARDIHNICIYHKQDVYAKLCLTSDPDKSVSTKIINGGGRNPVFDDNVKLDVRVLDT 115

Query:   120 SLKCEIFMMSRVKNYLEDQLLGFTLVPLSEVLVKNGKLEKEFSLSSTDLFHSPAGFVQLS 179
             SLKCEI+MMSRVKNYLEDQLLGFTLVP+SE+L KNGKLEKEFSLSSTDL+HSPAGFVQLS
Sbjct:   116 SLKCEIYMMSRVKNYLEDQLLGFTLVPMSELLFKNGKLEKEFSLSSTDLYHSPAGFVQLS 175

Query:   180 LAYAGASPDVMAIPAVPKPLAADETAQESEISES----LDRIEFPDPKIVNENQMMVSEY 235
             L+Y G+ PDVMAIP++P  ++ DET ++ E SES    LD+IEFPDP + NEN+ MVSEY
Sbjct:   176 LSYYGSYPDVMAIPSMPSSVSIDETTKDPEGSESVPGELDKIEFPDPNVANENEKMVSEY 235

Query:   236 FGISCSNMDTETSESLVSSDARNQVSSEI 264
             FGISCS +D+ETS+SLV+SDA N V++ +
Sbjct:   236 FGISCSTIDSETSDSLVTSDAENHVTNSV 264

 Score = 242 (90.2 bits), Expect = 4.2e-115, Sum P(2) = 4.2e-115
 Identities = 53/81 (65%), Positives = 57/81 (70%)

Query:   351 KPVVSVNIEPEQKVVQQDIVDMYMKSMQQFTESLAKMKLPLDIDXXXXXXXXXXXXXXDQ 410
             K V++V +EPE KVVQQDIVDMYMKSMQQFT+SLAKMKLPLDID               Q
Sbjct:   328 KSVLTVKVEPESKVVQQDIVDMYMKSMQQFTDSLAKMKLPLDIDSPTKSENSSSDS---Q 384

Query:   411 KLQASK-NTGSRVFYGSRAFF 430
             KL   K N GSRVFYGSR FF
Sbjct:   385 KLPTPKSNNGSRVFYGSRPFF 405


>TAIR|locus:2008011 [details] [associations]
            symbol:AT1G50570 "AT1G50570" species:3702 "Arabidopsis
            thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0005634
            "nucleus" evidence=ISM] [GO:0008150 "biological_process"
            evidence=ND] InterPro:IPR000008 InterPro:IPR008973 Pfam:PF00168
            SMART:SM00239 EMBL:CP002684 SUPFAM:SSF49562 EMBL:AC012561
            UniGene:At.28017 UniGene:At.48298 EMBL:BT028972 EMBL:AK228967
            IPI:IPI00536660 RefSeq:NP_001031166.1 RefSeq:NP_564576.1
            ProteinModelPortal:Q9LPS7 SMR:Q9LPS7 PRIDE:Q9LPS7
            EnsemblPlants:AT1G50570.1 EnsemblPlants:AT1G50570.2 GeneID:841478
            KEGG:ath:AT1G50570 TAIR:At1g50570 InParanoid:Q9LPS7 OMA:DDFASSE
            PhylomeDB:Q9LPS7 ProtClustDB:CLSN2917220 Genevestigator:Q9LPS7
            Uniprot:Q9LPS7
        Length = 388

 Score = 750 (269.1 bits), Expect = 2.5e-74, P = 2.5e-74
 Identities = 154/253 (60%), Positives = 191/253 (75%)

Query:    30 SGSL-TKGT-EANRKEAVMSNLGGNFIGVLEVYIHQARDIHNICIYHKQDVYAKLCLTSD 87
             +GS+   G+ E   K  VMS+   +FIGVLEV++HQARDIHNICIYHKQDVYAKLCLT+D
Sbjct:    12 NGSIHLNGSGETKTKNIVMSSDSDSFIGVLEVFVHQARDIHNICIYHKQDVYAKLCLTND 71

Query:    88 PENTVSTNIINGGGRNPVFNENLKLNVKTVESSLKCEIFMMSRVKNYLEDQLLGFTLVPL 147
             PEN++ST IINGGG+NPVF++ L+ +VK ++ SLKCEIFMMSRVKNYLEDQLLGF+LVPL
Sbjct:    72 PENSLSTKIINGGGQNPVFDDTLQFDVKNLDCSLKCEIFMMSRVKNYLEDQLLGFSLVPL 131

Query:   148 SEVLVKNGKLEKEFSLSSTDLFHSPAGFVQLSLAYAGASPDVMAIPAVPKPLAADETAQE 207
             SEV+V+NGKLEKEFSLSSTDL+HSPAGFV+LSL+YAG SPDVM IPAVP       TA E
Sbjct:   132 SEVIVRNGKLEKEFSLSSTDLYHSPAGFVELSLSYAGDSPDVMHIPAVP-------TADE 184

Query:   208 SEISE-SLDRIEFPDPKIVNENQMMVSEYFGISCSNMDTETSESLVSSDARNQVSSEIRA 266
             +E++    D  EF DPKIV EN  MVS+YF  +CS+ D   S      +  + +S+ +  
Sbjct:   185 TELAPIEFDESEFLDPKIVCENNQMVSKYFSTTCSDSDDFASSETGFVEVNSILSAVVET 244

Query:   267 PVVESFSTATVES 279
              V E+    +V +
Sbjct:   245 AVDEAAPANSVST 257

 Score = 320 (117.7 bits), Expect = 9.1e-29, P = 9.1e-29
 Identities = 93/233 (39%), Positives = 113/233 (48%)

Query:   199 LAADETAQESEISE-SLDRIEFPDPKIVNENQMMVSEYFGISCSNMDTETSESLVSSDAR 257
             + A  TA E+E++    D  EF DPKIV EN  MVS+YF  +CS+ D   S    S    
Sbjct:   176 IPAVPTADETELAPIEFDESEFLDPKIVCENNQMVSKYFSTTCSDSDDFAS----SETGF 231

Query:   258 NQVSSEIRAPVVESFSTATVESVQHPKLDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKP 317
              +V+S + A VVE   TA  E+   P  +                             + 
Sbjct:   232 VEVNSILSA-VVE---TAVDEAA--PA-NSVSTNGISSPSTAVSSGSSGTHDVSKQSSEG 284

Query:   318 QNQEQEPPSKEKKXXXXXXXXXXXXXXXXXAINKPVVSVNIEPEQKVVQQDIVDMYMKSM 377
              N + E   +E K                 A+ KPV++VNIEPEQKVVQQDIVDMY KS+
Sbjct:   285 NNSDSE---QEAKKPTDIIKSGDLDKTDEEAVVKPVLTVNIEPEQKVVQQDIVDMYTKSL 341

Query:   378 QQFTESLAKMKLPLDIDXXXXXXXXXXXXXXDQKLQASKNTGSRVFYGSRAFF 430
             QQFTESLAKMKLPLDID                  Q  K+  SRVFYGSRAFF
Sbjct:   342 QQFTESLAKMKLPLDIDSPTQSENSSSSQ------QTPKSASSRVFYGSRAFF 388


>TAIR|locus:505006598 [details] [associations]
            symbol:AT5G12300 "AT5G12300" species:3702 "Arabidopsis
            thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0005575
            "cellular_component" evidence=ND] [GO:0008150 "biological_process"
            evidence=ND] InterPro:IPR000008 InterPro:IPR008973 Pfam:PF00168
            SMART:SM00239 EMBL:CP002688 GenomeReviews:BA000015_GR
            SUPFAM:SSF49562 EMBL:AL592312 EMBL:BT012657 EMBL:AK226641
            IPI:IPI00529322 RefSeq:NP_568263.1 UniGene:At.49416
            ProteinModelPortal:Q94CL2 SMR:Q94CL2 EnsemblPlants:AT5G12300.1
            GeneID:831105 KEGG:ath:AT5G12300 TAIR:At5g12300 eggNOG:NOG321261
            HOGENOM:HOG000238664 InParanoid:Q94CL2 OMA:QMVSEYF PhylomeDB:Q94CL2
            ProtClustDB:CLSN2917613 Genevestigator:Q94CL2 Uniprot:Q94CL2
        Length = 374

 Score = 474 (171.9 bits), Expect = 3.3e-59, Sum P(2) = 3.3e-59
 Identities = 93/193 (48%), Positives = 140/193 (72%)

Query:    53 FIGVLEVYIHQARDIHNICIYHKQDVYAKLCLTSDPENTVSTNIINGGGRNPVFNENLKL 112
             F GVL+VY+H AR+I+NICIY  QDVYAK  LT +P++T+ST II+  G+NP FN+ L +
Sbjct:    19 FSGVLQVYVHNARNINNICIYDNQDVYAKFSLTYNPDDTISTRIIHRAGKNPEFNQKLMI 78

Query:   113 NVKTVESS---LKCEIFMMSRVKNYLEDQLLGFTLVPLSEVLVKNGKLEKEFSLSSTDLF 169
             +V  +++    LKCEI+MMSR ++Y+EDQLLGF LVP+S+++ ++  + +++SLSSTDLF
Sbjct:    79 DVTQIDAHAAVLKCEIWMMSRARHYMEDQLLGFALVPISDIIGQDS-VTQDYSLSSTDLF 137

Query:   170 HSPAGFVQLSLAYAGASPDVMAIPAV-PKPLAADETAQESEISESLD--RIEFPDPKIVN 226
             HSPAG V+L+L+    S    + P +    ++++    + ++SE++D  RIEFPD  + N
Sbjct:   138 HSPAGTVKLTLSIVNPSSTSSSNPKINTTSISSEVVLLDPQVSETVDYTRIEFPDINVAN 197

Query:   227 ENQMMVSEYFGIS 239
             EN+ MV+EYF  S
Sbjct:   198 ENKQMVTEYFNES 210

 Score = 151 (58.2 bits), Expect = 3.3e-59, Sum P(2) = 3.3e-59
 Identities = 37/82 (45%), Positives = 47/82 (57%)

Query:   361 EQKVVQQDIVDMYMKSMQQFTESLAKMKLPLDI------DXXXXXXXXXXXXXXDQKLQA 414
             E+  +Q+ I +MYM+SMQQFTESLAKMKLP+D+      +              +Q   A
Sbjct:   293 EETTMQKQIAEMYMRSMQQFTESLAKMKLPMDLHNKPHEEDHSNNNNNTATQIQNQNNNA 352

Query:   415 SKN------TGSRVFYGSRAFF 430
             + N       GSRVFYGSRAFF
Sbjct:   353 NNNGMEKKKEGSRVFYGSRAFF 374


>TAIR|locus:2146688 [details] [associations]
            symbol:SYTC "AT5G04220" species:3702 "Arabidopsis
            thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0005886
            "plasma membrane" evidence=ISM] [GO:0008150 "biological_process"
            evidence=ND] [GO:0016126 "sterol biosynthetic process"
            evidence=RCA] [GO:0019745 "pentacyclic triterpenoid biosynthetic
            process" evidence=RCA] InterPro:IPR000008 InterPro:IPR008973
            Pfam:PF00168 SMART:SM00239 GO:GO:0016021 EMBL:CP002688
            GenomeReviews:BA000015_GR GO:GO:0046872 InterPro:IPR018029
            SUPFAM:SSF49562 PROSITE:PS50004 GO:GO:0008289 HSSP:P21707
            eggNOG:COG5038 EMBL:AL391716 InterPro:IPR020477 PRINTS:PR00360
            HOGENOM:HOG000233385 ProtClustDB:CLSN2688294 EMBL:AB102952
            EMBL:FM213367 EMBL:FM213368 IPI:IPI00538497 RefSeq:NP_568135.1
            RefSeq:NP_974729.1 UniGene:At.10090 ProteinModelPortal:Q7XA06
            SMR:Q7XA06 PRIDE:Q7XA06 EnsemblPlants:AT5G04220.2 GeneID:830301
            KEGG:ath:AT5G04220 TAIR:At5g04220 InParanoid:Q7XA06 OMA:YETNEKE
            PhylomeDB:Q7XA06 Genevestigator:Q7XA06 Uniprot:Q7XA06
        Length = 540

 Score = 109 (43.4 bits), Expect = 0.00042, Sum P(2) = 0.00042
 Identities = 37/111 (33%), Positives = 56/111 (50%)

Query:    54 IGVLEVYIHQARDIHNICIYHKQDVYAKLCLTSDPENTVSTNIINGGGRNPVFNENLKLN 113
             +G+L V I +AR++    +    D Y KL LT +      T I      NP +NE+ KL 
Sbjct:   260 VGLLHVSILRARNLLKKDLLGTSDPYVKLSLTGEKLPAKKTTI-KKRNLNPEWNEHFKLI 318

Query:   114 VKTVESS-LKCEIFMMSRVKNYLEDQLLGFTLVPLSEVLVKNGKLEKEFSL 163
             VK   S  L+ E+F   +V  +  D+L G  ++PL ++   N    KEF+L
Sbjct:   319 VKDPNSQVLQLEVFDWDKVGGH--DRL-GMQMIPLQKI---NPGERKEFNL 363

 Score = 53 (23.7 bits), Expect = 0.00042, Sum P(2) = 0.00042
 Identities = 27/90 (30%), Positives = 36/90 (40%)

Query:   145 VPLSEVLVKNGKLEKEFSLSSTDLFHSPAGFVQLSLAYAGASPDVMAIPAVPKPLAADET 204
             VP  E  +K  K  +E   S  D F S AG   LS+A   A  DV        P A    
Sbjct:   391 VPFREESIKRRKESREEKSSEDDDFLSQAGL--LSVAVQSAK-DVEGKKKHSNPYAVVLF 447

Query:   205 AQESEISESLDRIEFPDPKIVNENQMMVSE 234
               E + ++ L +    DP+   E Q  + E
Sbjct:   448 RGEKKKTKMLKKTR--DPRWNEEFQFTLEE 475


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.312   0.128   0.349    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      430       370   0.00086  117 3  11 23  0.46    34
                                                     34  0.45    37


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  4
  No. of states in DFA:  587 (62 KB)
  Total size of DFA:  202 KB (2114 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  32.46u 0.13s 32.59t   Elapsed:  00:00:01
  Total cpu time:  32.46u 0.13s 32.59t   Elapsed:  00:00:01
  Start:  Fri May 10 00:06:00 2013   End:  Fri May 10 00:06:01 2013

Back to top