BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>019057
MALEQLSLARTPVDGETDGVNNGVEQRTTTASIDGGDDGCKAPRLPRWTRQEILVLIQGK
RVAENRVRRGRAAGMGFGSGQIEPKWASVSSYCKRHGVNRGPVQCRKRWSNLAGDFKKIK
EWESHVKDGTESFWVMRNDLRRERKLPGFFDREVYDILDGAATVASSASPGLGLALAPAE
ETTTDEAVFDSGRSAAADDGLFSDFEPEETTGTPVKDDAPAEAAPAAAKPISATMPIPEK
QYQPNLRGCHGQGTTTEKQPAPEIGSTSQDGRKRKRFTVDGDEEMSNMQYQLIDVLERNG
KMLTAQLEAQNNSFQLDREQRKDHADSLVAVLNKLADALGRIADKL

High Scoring Gene Products

Symbol, full name Information P value
AT2G33550 protein from Arabidopsis thaliana 1.3e-90
AT2G35640 protein from Arabidopsis thaliana 2.1e-12
AT4G31270 protein from Arabidopsis thaliana 2.9e-09
AT1G31310 protein from Arabidopsis thaliana 4.0e-08
AT5G51800 protein from Arabidopsis thaliana 2.2e-07

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  019057
        (346 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

TAIR|locus:2051174 - symbol:AT2G33550 species:3702 "Arabi...   644  1.3e-90   2
TAIR|locus:2058718 - symbol:AT2G35640 species:3702 "Arabi...   186  2.1e-12   1
TAIR|locus:2128186 - symbol:AT4G31270 species:3702 "Arabi...   158  2.9e-09   1
TAIR|locus:2197490 - symbol:AT1G31310 species:3702 "Arabi...   148  4.0e-08   2
TAIR|locus:2165331 - symbol:AT5G51800 species:3702 "Arabi...   150  2.2e-07   1


>TAIR|locus:2051174 [details] [associations]
            symbol:AT2G33550 species:3702 "Arabidopsis thaliana"
            [GO:0005634 "nucleus" evidence=ISM] [GO:0003700 "sequence-specific
            DNA binding transcription factor activity" evidence=ISS]
            [GO:0006355 "regulation of transcription, DNA-dependent"
            evidence=TAS] InterPro:IPR009057 EMBL:CP002685 GO:GO:0003677
            GO:GO:0003700 Gene3D:1.10.10.60 InterPro:IPR017877 PROSITE:PS50090
            EMBL:AY065364 EMBL:AY096389 IPI:IPI00526924 RefSeq:NP_850213.1
            UniGene:At.28516 ProteinModelPortal:Q8VZ20 SMR:Q8VZ20 IntAct:Q8VZ20
            PRIDE:Q8VZ20 EnsemblPlants:AT2G33550.1 GeneID:817920
            KEGG:ath:AT2G33550 TAIR:At2g33550 HOGENOM:HOG000240766
            InParanoid:Q8VZ20 OMA:EETESFW PhylomeDB:Q8VZ20
            ProtClustDB:CLSN2680007 Genevestigator:Q8VZ20 Uniprot:Q8VZ20
        Length = 314

 Score = 644 (231.8 bits), Expect = 1.3e-90, Sum P(2) = 1.3e-90
 Identities = 124/160 (77%), Positives = 136/160 (85%)

Query:     1 MALEQLSLARTPVDGETDGVNNGVEQRTTTASIDGGDDGCKAPRLPRWTRQEILVLIQGK 60
             MALEQL L  + VDG   G N+      +  S DGGDDG K  RLPRWTRQEILVLIQGK
Sbjct:     1 MALEQLGLGVSAVDG---GENS------SAPSNDGGDDGVKTARLPRWTRQEILVLIQGK 51

Query:    61 RVAENRVRRGRAAGMGFGSGQIEPKWASVSSYCKRHGVNRGPVQCRKRWSNLAGDFKKIK 120
             RVAENRVRRGRAAGM  GSGQ+EPKWASVSSYCKRHGVNRGPVQCRKRWSNLAGD+KKIK
Sbjct:    52 RVAENRVRRGRAAGMALGSGQMEPKWASVSSYCKRHGVNRGPVQCRKRWSNLAGDYKKIK 111

Query:   121 EWESHVKDGTESFWVMRNDLRRERKLPGFFDREVYDILDG 160
             EWES +K+ TES+WVMRND+RRE+KLPGFFD+EVYDI+DG
Sbjct:   112 EWESQIKEETESYWVMRNDVRREKKLPGFFDKEVYDIVDG 151

 Score = 279 (103.3 bits), Expect = 1.3e-90, Sum P(2) = 1.3e-90
 Identities = 78/173 (45%), Positives = 97/173 (56%)

Query:   188 VFDSGRSAAADDGLFSDFE----PEETTGTPVKDDXXXXXXXXXXXXXXXTMPIPEKQYQ 243
             V   G + A+D+GL SD +    PE+   TPV                     + +K+ Q
Sbjct:   160 VLSLGLAPASDEGLLSDLDRRESPEKLNSTPV---------------AKSVTDVIDKEKQ 204

Query:   244 PNLRGCHG-QGTTTEKQPAP---EIGSTSQDGRKRKRFTVDGDEE------MSNMQYQLI 293
                  C   QG   EKQP     E GSTSQ+ RKRKR +    EE         MQ QLI
Sbjct:   205 ---EACVADQGRVKEKQPEAANVEGGSTSQEERKRKRTSFGEKEEEEEEGETKKMQNQLI 261

Query:   294 DVLERNGKMLTAQLEAQNNSFQLDREQRKDHADSLVAVLNKLADALGRIADKL 346
             ++LERNG++L AQLE QN + +LDREQRKDH DSLVAVLNKLADA+ +IADK+
Sbjct:   262 EILERNGQLLAAQLEVQNLNLKLDREQRKDHGDSLVAVLNKLADAVAKIADKM 314


>TAIR|locus:2058718 [details] [associations]
            symbol:AT2G35640 species:3702 "Arabidopsis thaliana"
            [GO:0005634 "nucleus" evidence=ISM] [GO:0003700 "sequence-specific
            DNA binding transcription factor activity" evidence=ISS]
            [GO:0006355 "regulation of transcription, DNA-dependent"
            evidence=TAS] EMBL:CP002685 GenomeReviews:CT485783_GR GO:GO:0003700
            EMBL:AC006068 InterPro:IPR017877 PROSITE:PS50090 IPI:IPI00526509
            PIR:B84771 RefSeq:NP_181107.1 UniGene:At.53046 UniGene:At.75395
            ProteinModelPortal:Q9ZQN7 SMR:Q9ZQN7 ProMEX:Q9ZQN7
            EnsemblPlants:AT2G35640.1 GeneID:818133 KEGG:ath:AT2G35640
            TAIR:At2g35640 eggNOG:NOG315255 HOGENOM:HOG000240297
            InParanoid:Q9ZQN7 OMA:VESSFNT PhylomeDB:Q9ZQN7
            ProtClustDB:CLSN2683797 Genevestigator:Q9ZQN7 Uniprot:Q9ZQN7
        Length = 340

 Score = 186 (70.5 bits), Expect = 2.1e-12, P = 2.1e-12
 Identities = 44/122 (36%), Positives = 62/122 (50%)

Query:    44 RLPRWTRQEILVLIQGKRVAENR-VRRGRAAGMGFGSGQIEPKWASVSSYCKRHGVNRGP 102
             R   WT  E LVLI+ K++ + R VRR      G      E +W  +  YC R G  R  
Sbjct:    18 RKGNWTVSETLVLIEAKKMDDQRRVRRSEKQPEGRNK-PAELRWKWIEEYCWRRGCYRNQ 76

Query:   103 VQCRKRWSNLAGDFKKIKEWE-SHVKDG-----TESFWVMRNDLRRERKLPGFFDREVYD 156
              QC  +W NL  D+KKI+E+E S V+       + S+W M    R+E+ LP     ++YD
Sbjct:    77 NQCNDKWDNLMRDYKKIREYERSRVESSFNTVTSSSYWKMDKTERKEKNLPSNMLPQIYD 136

Query:   157 IL 158
             +L
Sbjct:   137 VL 138


>TAIR|locus:2128186 [details] [associations]
            symbol:AT4G31270 species:3702 "Arabidopsis thaliana"
            [GO:0005634 "nucleus" evidence=ISM] [GO:0003700 "sequence-specific
            DNA binding transcription factor activity" evidence=ISS]
            [GO:0006355 "regulation of transcription, DNA-dependent"
            evidence=RCA;TAS] [GO:0009506 "plasmodesma" evidence=IDA]
            [GO:0043687 "post-translational protein modification" evidence=RCA]
            [GO:0045893 "positive regulation of transcription, DNA-dependent"
            evidence=RCA] GO:GO:0009506 EMBL:CP002687 GO:GO:0003700
            EMBL:BT005287 EMBL:AK118674 IPI:IPI00541003 RefSeq:NP_194855.2
            UniGene:At.31756 ProteinModelPortal:Q8GWR8 PRIDE:Q8GWR8
            EnsemblPlants:AT4G31270.1 GeneID:829254 KEGG:ath:AT4G31270
            TAIR:At4g31270 HOGENOM:HOG000148318 InParanoid:Q8GWR8 OMA:LPANCNT
            PhylomeDB:Q8GWR8 ProtClustDB:CLSN2918239 Genevestigator:Q8GWR8
            Uniprot:Q8GWR8
        Length = 294

 Score = 158 (60.7 bits), Expect = 2.9e-09, P = 2.9e-09
 Identities = 37/135 (27%), Positives = 64/135 (47%)

Query:    33 IDGGDDGCKAPR---LPRWTRQEILVLIQGKRVAENRVRRGRAAGMGFGSGQIEPKWASV 89
             ++ G  G +  R    P W  ++ LVL+    +A        A      S Q   KW  +
Sbjct:     1 MEEGTSGSRRTRSQVAPEWAVKDCLVLVN--EIAAVEADCSNA----LSSFQ---KWTMI 51

Query:    90 SSYCKRHGVNRGPVQCRKRWSNLAGDFKKIKEWESHVKDGTESFWVMRNDLRRERKLPGF 149
             +  C    V+R   QCR++W +L  D+ +IK+WES  +    S+W + +D R+   LPG 
Sbjct:    52 TENCNALDVSRNLNQCRRKWDSLMSDYNQIKKWESQYRGTGRSYWSLSSDKRKLLNLPGD 111

Query:   150 FDREVYDILDGAATV 164
              D E+++ ++    +
Sbjct:   112 IDIELFEAINAVVMI 126


>TAIR|locus:2197490 [details] [associations]
            symbol:AT1G31310 species:3702 "Arabidopsis thaliana"
            [GO:0005634 "nucleus" evidence=ISM] [GO:0003700 "sequence-specific
            DNA binding transcription factor activity" evidence=ISS]
            [GO:0006355 "regulation of transcription, DNA-dependent"
            evidence=TAS] EMBL:CP002684 GO:GO:0003700 InterPro:IPR017877
            PROSITE:PS50090 IPI:IPI00541830 RefSeq:NP_174416.2 UniGene:At.40372
            UniGene:At.70648 ProteinModelPortal:F4I9C1 SMR:F4I9C1 PRIDE:F4I9C1
            EnsemblPlants:AT1G31310.1 GeneID:840019 KEGG:ath:AT1G31310
            OMA:MDDERRM Uniprot:F4I9C1
        Length = 383

 Score = 148 (57.2 bits), Expect = 4.0e-08, Sum P(2) = 4.0e-08
 Identities = 44/139 (31%), Positives = 61/139 (43%)

Query:    44 RLPRWTRQEILVLIQGKRVAENRVRRGRAAGMGFGSGQ--------IEPKWASVSSYCKR 95
             R   WT  E +VLI+ KR+ + R R  R+ G+     Q         E +W  +  YC R
Sbjct:    15 RKGNWTLNETMVLIEAKRMDDER-RMRRSIGLPPPEQQQDIRSNKPAELRWKWIEDYCWR 73

Query:    96 HGVNRGPVQCRKRWSNLAGDFKKIKEWE-----SHVKDG-----------TESFWVMRND 139
              G  R   QC  +W NL  D+KK++E+E     S +  G           T S+W M   
Sbjct:    74 KGCMRSQNQCNDKWDNLMRDYKKVREYERRRVESSITAGESSSSSAPAGETASYWKMEKS 133

Query:   140 LRRERKLPGFFDREVYDIL 158
              R+ER LP     + Y  L
Sbjct:   134 ERKERSLPSNMLPQTYQAL 152

 Score = 41 (19.5 bits), Expect = 4.0e-08, Sum P(2) = 4.0e-08
 Identities = 11/50 (22%), Positives = 23/50 (46%)

Query:   297 ERNGKMLTAQLEAQNNSFQLDR---EQRKDHADSLVAVLNKLADALGRIA 343
             ER  +     +  Q    +++    E  ++  + LV  +NKLA ++  +A
Sbjct:   327 ERQDRRHKEVMNVQERRLKIEESNVEMNREGMNGLVEAINKLASSIFALA 376


>TAIR|locus:2165331 [details] [associations]
            symbol:AT5G51800 species:3702 "Arabidopsis thaliana"
            [GO:0005634 "nucleus" evidence=ISM] [GO:0016772 "transferase
            activity, transferring phosphorus-containing groups" evidence=IEA]
            [GO:0048445 "carpel morphogenesis" evidence=RCA] InterPro:IPR000719
            InterPro:IPR011009 Pfam:PF00069 GO:GO:0005524 EMBL:CP002688
            GenomeReviews:BA000015_GR EMBL:AB010074 SUPFAM:SSF56112
            GO:GO:0004672 IPI:IPI00547167 RefSeq:NP_199993.1 UniGene:At.29659
            ProteinModelPortal:Q9FLH9 SMR:Q9FLH9 PRIDE:Q9FLH9
            EnsemblPlants:AT5G51800.1 GeneID:835255 KEGG:ath:AT5G51800
            TAIR:At5g51800 eggNOG:NOG308341 HOGENOM:HOG000090978
            InParanoid:Q9FLH9 OMA:LWLARAW PhylomeDB:Q9FLH9
            ProtClustDB:CLSN2687432 Genevestigator:Q9FLH9 Uniprot:Q9FLH9
        Length = 972

 Score = 150 (57.9 bits), Expect = 2.2e-07, P = 2.2e-07
 Identities = 38/117 (32%), Positives = 56/117 (47%)

Query:    46 PRWTRQEILVLIQGKRVAENRVRRGRAAGMGFGSGQIEP-KWASVSSYCKRHGVNRGPVQ 104
             P W   E+L L +  R        G  +G   G G+    K   V+ Y  RHG+NR    
Sbjct:   149 PVWKPNEMLWLARAWRAQYQTQGTGSGSGSVEGRGKTRAEKDREVAEYLNRHGINRDSKI 208

Query:   105 CRKRWSNLAGDFKKIKEWES---HVKDGTESFWVMRNDLRRERKLPGFFDREVYDIL 158
                +W N+ G+F+K+ EWE      K G +S++ +    R++ +LP  FD EVY  L
Sbjct:   209 AGTKWDNMLGEFRKVYEWEKCGDQDKYG-KSYFRLSPYERKQHRLPASFDEEVYQEL 264


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.315   0.133   0.395    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      346       308   0.00078  116 3  11 22  0.44    34
                                                     33  0.45    37


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  5
  No. of states in DFA:  609 (65 KB)
  Total size of DFA:  221 KB (2121 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  25.38u 0.10s 25.48t   Elapsed:  00:00:01
  Total cpu time:  25.38u 0.10s 25.48t   Elapsed:  00:00:01
  Start:  Tue May 21 03:36:20 2013   End:  Tue May 21 03:36:21 2013

Back to top