BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>001741
MGDNKGDTTKKQKSQQPQQLQQQQQQPQQQQLSSSPKNPLEESTEKGQHHHQQPPAVVVT
GAPYISAPLYVPIGPTSSSPFEQQFEPVNPKRQRYNSSQWKLLPSPSQQQQQKQQAQMAI
LTTESSPSPTTLPITNPQSQAHTTTASSSDTASSPPHSPIPSLSAASGQETNRPELLGEQ
FHPQFRKGKYVSPVWKPNEMLWLARAWRIQYQGGGGSNGSGSSSRVDHHQPESTGQGEVA
AAAQSTRAKTRADKDREVAEFLQRHGVNRDAKTSGTKWDNMLGEFRKVYEWERGGEREQV
GKSYFRLSPYERKLHRLPASFDEEVFEELSQFMGSRMRSTSQSRAASSVFVSSDHDNRST
RALPPPPPFKEDELSLSARAKQLVMTSGGEAFFHGGRGSLLGFESSLDVGSKELRRIGRI
RMTWEESVSLWAEEGEHQRGRIKLQGSSFLNADELTFFDDAMVACNMEAFEEGTLRGFSV
DRFVPGQQVKVFGRRKSSSGSASASSAGFFERVQFPFTEPSIRFTPWEYQDPSDYYVGCL
RVPPTTLPSLFELSWHLQEPPPEEFRFPIRKDVYRDLPQGKEIFFTTSTELLDCRAITYD
ILSPIVRGNNPSLSFSTAASRDSFIGLWDDCINRVVSKFCSSEMVIVRKPSSSSISSPEP
LQDQWPNVTGFVRNLCLWRGEETDQLKEGQLDPSSSIVEKLLWTYMDLPYLFGYYAVGYI
VTFCALSRSQDRVVRTDLYSLDLSSPVERLKALIPCYRIAGLLPLLADRCFNSSSSIISN
GGGGYKQYPFSDFERIDLGSGNIIEMTPNTVTRFFSSKRKWAAVKQIYDFLDHRIPHAEA
IYESSEKDLALVFKPRGCKLKPTSCEQLVEALKYVTKALVALHDLSFMHKDLSWDKVMRR
SDRENEWFVSGFDEAASAPQIYPQLVAAVAGGVEARGRHAPEMGRPGLHGVKVDVWGVGQ
LVKTCGLSNVPKMLRELQNRCLDQNPELRPTAADCYHHLLQLQSSLSVAAASSSTAPY

High Scoring Gene Products

Symbol, full name Information P value
AT5G51800 protein from Arabidopsis thaliana 2.8e-235
AT2G33550 protein from Arabidopsis thaliana 9.4e-05
AT2G35640 protein from Arabidopsis thaliana 0.00037
AT4G31270 protein from Arabidopsis thaliana 0.00049

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  001741
        (1018 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

TAIR|locus:2165331 - symbol:AT5G51800 species:3702 "Arabi...  1681  2.8e-235  4
TAIR|locus:2051174 - symbol:AT2G33550 species:3702 "Arabi...   126  9.4e-05   1
TAIR|locus:2058718 - symbol:AT2G35640 species:3702 "Arabi...   122  0.00037   2
TAIR|locus:2128186 - symbol:AT4G31270 species:3702 "Arabi...   119  0.00049   1


>TAIR|locus:2165331 [details] [associations]
            symbol:AT5G51800 species:3702 "Arabidopsis thaliana"
            [GO:0005634 "nucleus" evidence=ISM] [GO:0016772 "transferase
            activity, transferring phosphorus-containing groups" evidence=IEA]
            [GO:0048445 "carpel morphogenesis" evidence=RCA] InterPro:IPR000719
            InterPro:IPR011009 Pfam:PF00069 GO:GO:0005524 EMBL:CP002688
            GenomeReviews:BA000015_GR EMBL:AB010074 SUPFAM:SSF56112
            GO:GO:0004672 IPI:IPI00547167 RefSeq:NP_199993.1 UniGene:At.29659
            ProteinModelPortal:Q9FLH9 SMR:Q9FLH9 PRIDE:Q9FLH9
            EnsemblPlants:AT5G51800.1 GeneID:835255 KEGG:ath:AT5G51800
            TAIR:At5g51800 eggNOG:NOG308341 HOGENOM:HOG000090978
            InParanoid:Q9FLH9 OMA:LWLARAW PhylomeDB:Q9FLH9
            ProtClustDB:CLSN2687432 Genevestigator:Q9FLH9 Uniprot:Q9FLH9
        Length = 972

 Score = 1681 (596.8 bits), Expect = 2.8e-235, Sum P(4) = 2.8e-235
 Identities = 335/609 (55%), Positives = 428/609 (70%)

Query:   405 SSLDVGSKELRRIGRIRMTWEESVSLWAEEGEHQRGRIKLQGSSFLNADELTFFDDAMVA 464
             SS     ++LRRIG+IR+TWEESV+LWAE GE   GRI++ GSSFLNADELT+ DD+MVA
Sbjct:   373 SSSSSSLRDLRRIGKIRLTWEESVNLWAE-GEVDYGRIRVSGSSFLNADELTYLDDSMVA 431

Query:   465 CNMEAFEEGTLRGFSVDRFVPGQQVKVFGRRKXXXXXX---XXXXXXFFERVQFPFTEPS 521
             C ME+F++G L+GFS+D+F+ GQ +KVFGR++                F+R Q   +EP 
Sbjct:   432 CTMESFQDGPLKGFSLDKFISGQHLKVFGRQRSTSSSAPSPSVNVAGVFDRPQLQLSEPI 491

Query:   522 IR-FTPWEYQDPSDYYVGCLRVPPTTLPSLFELSWHLQXXXXXXFRFPIRKDVYRDLPQG 580
              +  +  E+QDPS++ +  LRVP   LPSLFEL+ +LQ       RFP+R DVY+DLPQG
Sbjct:   492 YKSISTLEFQDPSEHCLSKLRVPAGNLPSLFELARYLQEPPPENLRFPLRPDVYKDLPQG 551

Query:   581 KEIFFT-TSTELLDCRAITYDILSPIVRGNNPSLSFSTAASRDSFIGLWDDCINRVVSKF 639
             KE+FF+ +STELLDCRAITYDI+ PI+   N +  F   +S+DS I LWDDCINR+VSKF
Sbjct:   552 KELFFSISSTELLDCRAITYDIIGPIMSRLNSNNGF-VISSKDSLIPLWDDCINRMVSKF 610

Query:   640 CSSEMVIVRKPXXXXXXXPEPLQDQWPNVTGFVRNLCLWRGEETDQLKEGQLDPSSSIVE 699
             C  EM I+RKP        E +Q QWPNV G+V+   LWRGEE D+++EG  DPSS + E
Sbjct:   611 C--EMAILRKPDSSSCI--ENVQHQWPNVIGYVKGFGLWRGEEADKVREGAADPSSLLAE 666

Query:   700 KLLWTYMDLPYLFGYYAVGYIVTFCALS-RSQDRVVRTDLYSLDLSSPVERLKALIPCYR 758
             K+LW+Y DLPY+ GY+A+G+ VTFCALS  SQDRV+ TDLYS ++SSP +R+KAL+PCYR
Sbjct:   667 KILWSYNDLPYILGYHAIGFTVTFCALSLSSQDRVICTDLYSFNVSSPSDRIKALVPCYR 726

Query:   759 IAGLLPLLADRCFXXXXXXXXXXXXXYKQYPFSDFERIDLGSGNIIEMTPNTVTRFFSSK 818
             +A LLPLLADRC               +   ++DFERID G   + E+TP+TVTR++SSK
Sbjct:   727 LASLLPLLADRCTT-------------RPSCYNDFERIDRGD-YVTELTPHTVTRYYSSK 772

Query:   819 RKWAAVKQIYDFLDHRIPHAEAIYESSEKDLALVFKPRGCKLKPTSCEQLVEALKYVTKA 878
             RKW  VK IYDFLD R+PHAE +  +SEKDL+L FKPRG ++KP + +QL+++L  VTKA
Sbjct:   773 RKWLGVKGIYDFLDQRVPHAEHLDMASEKDLSLSFKPRGIRVKPRNIDQLIDSLMCVTKA 832

Query:   879 LVALHDLSFMHKDLSWDKVMRRS-----DRENEWFVSGFDEAASAPQIYPQLXXXXXXXX 933
             L+ALHDLSFMH+D+ WD VMR +       + +WFV GFD A  APQ+ P          
Sbjct:   833 LLALHDLSFMHRDMGWDNVMRSTATTTTTTDTDWFVCGFDAAVEAPQLNPHRPADKVVDN 892

Query:   934 XXXXXX----XPEMGRPGLHGVKVDVWGVGQLVKTCGLSNVPKMLRELQNRCLDQNPELR 989
                        PEM R GLH VKVDVWGVG ++KTCGLSNVPKMLR+LQ +CL+ N E R
Sbjct:   893 EEREDERGRYAPEMER-GLHAVKVDVWGVGYMIKTCGLSNVPKMLRDLQGKCLEPNQENR 951

Query:   990 PTAADCYHH 998
             PTAADC+HH
Sbjct:   952 PTAADCFHH 960

 Score = 427 (155.4 bits), Expect = 2.8e-235, Sum P(4) = 2.8e-235
 Identities = 84/141 (59%), Positives = 108/141 (76%)

Query:   229 HQPESTGQGEVAAAAQSTRAKTRADKDREVAEFLQRHGVNRDAKTSGTKWDNMLGEFRKV 288
             +Q + TG G   + +   R KTRA+KDREVAE+L RHG+NRD+K +GTKWDNMLGEFRKV
Sbjct:   167 YQTQGTGSG---SGSVEGRGKTRAEKDREVAEYLNRHGINRDSKIAGTKWDNMLGEFRKV 223

Query:   289 YEWERGGEREQVGKSYFRLSPYERKLHRLPASFDEEVFEELSQFMGSRMRSTSQSRAA-- 346
             YEWE+ G++++ GKSYFRLSPYERK HRLPASFDEEV++EL+ FMG R+R+ + +R    
Sbjct:   224 YEWEKCGDQDKYGKSYFRLSPYERKQHRLPASFDEEVYQELALFMGPRVRAPTINRGGGG 283

Query:   347 -SSVFVSSDHDNRSTRALPPP 366
              ++V V+S     S  ALPPP
Sbjct:   284 GATVTVASTPP--SVEALPPP 302

 Score = 166 (63.5 bits), Expect = 2.8e-235, Sum P(4) = 2.8e-235
 Identities = 29/34 (85%), Positives = 30/34 (88%)

Query:   179 EQFHPQFRKGKYVSPVWKPNEMLWLARAWRIQYQ 212
             E F  +FRKGKYVSPVWKPNEMLWLARAWR QYQ
Sbjct:   135 ESFQHKFRKGKYVSPVWKPNEMLWLARAWRAQYQ 168

 Score = 48 (22.0 bits), Expect = 2.8e-235, Sum P(4) = 2.8e-235
 Identities = 16/48 (33%), Positives = 23/48 (47%)

Query:    49 HHHQQPPAVVVTGAPYISAPLYVPIGPTSSSPFEQQFEPVNPKRQRYN 96
             HHH Q          ++  P+++P   T SSP      PV PKR R++
Sbjct:    39 HHHHQS---------FLPTPIFIP---TVSSPGA----PVIPKRPRFS 70

 Score = 41 (19.5 bits), Expect = 2.1e-49, Sum P(4) = 2.1e-49
 Identities = 16/63 (25%), Positives = 25/63 (39%)

Query:   398 GSLLGFESSLDVGSKELRRIGRIRMTWEESVSLWAE-EGEHQRGRIKLQGSSFLNADELT 456
             G L GF     +  + L+  GR R T   + S      G   R +++L    + +   L 
Sbjct:   440 GPLKGFSLDKFISGQHLKVFGRQRSTSSSAPSPSVNVAGVFDRPQLQLSEPIYKSISTLE 499

Query:   457 FFD 459
             F D
Sbjct:   500 FQD 502

 Score = 39 (18.8 bits), Expect = 2.0e-07, Sum P(3) = 2.0e-07
 Identities = 14/46 (30%), Positives = 21/46 (45%)

Query:   320 SFDEEVFEELSQFMGSRMRSTSQSRAASSVFVSSDHDNRSTRALPP 365
             S D+ +  +  +  G R RSTS S  + SV V+   D    +   P
Sbjct:   446 SLDKFISGQHLKVFG-RQRSTSSSAPSPSVNVAGVFDRPQLQLSEP 490


>TAIR|locus:2051174 [details] [associations]
            symbol:AT2G33550 species:3702 "Arabidopsis thaliana"
            [GO:0005634 "nucleus" evidence=ISM] [GO:0003700 "sequence-specific
            DNA binding transcription factor activity" evidence=ISS]
            [GO:0006355 "regulation of transcription, DNA-dependent"
            evidence=TAS] InterPro:IPR009057 EMBL:CP002685 GO:GO:0003677
            GO:GO:0003700 Gene3D:1.10.10.60 InterPro:IPR017877 PROSITE:PS50090
            EMBL:AY065364 EMBL:AY096389 IPI:IPI00526924 RefSeq:NP_850213.1
            UniGene:At.28516 ProteinModelPortal:Q8VZ20 SMR:Q8VZ20 IntAct:Q8VZ20
            PRIDE:Q8VZ20 EnsemblPlants:AT2G33550.1 GeneID:817920
            KEGG:ath:AT2G33550 TAIR:At2g33550 HOGENOM:HOG000240766
            InParanoid:Q8VZ20 OMA:EETESFW PhylomeDB:Q8VZ20
            ProtClustDB:CLSN2680007 Genevestigator:Q8VZ20 Uniprot:Q8VZ20
        Length = 314

 Score = 126 (49.4 bits), Expect = 9.4e-05, P = 9.4e-05
 Identities = 27/87 (31%), Positives = 47/87 (54%)

Query:   241 AAAQSTRAKTRADKDREVAEFLQRHGVNRDAKTSGTKWDNMLGEFRKVYEWERGGEREQV 300
             AA  +  +     K   V+ + +RHGVNR       +W N+ G+++K+ EWE   + E  
Sbjct:    63 AAGMALGSGQMEPKWASVSSYCKRHGVNRGPVQCRKRWSNLAGDYKKIKEWESQIKEET- 121

Query:   301 GKSYFRLSPYERKLHRLPASFDEEVFE 327
              +SY+ +    R+  +LP  FD+EV++
Sbjct:   122 -ESYWVMRNDVRREKKLPGFFDKEVYD 147


>TAIR|locus:2058718 [details] [associations]
            symbol:AT2G35640 species:3702 "Arabidopsis thaliana"
            [GO:0005634 "nucleus" evidence=ISM] [GO:0003700 "sequence-specific
            DNA binding transcription factor activity" evidence=ISS]
            [GO:0006355 "regulation of transcription, DNA-dependent"
            evidence=TAS] EMBL:CP002685 GenomeReviews:CT485783_GR GO:GO:0003700
            EMBL:AC006068 InterPro:IPR017877 PROSITE:PS50090 IPI:IPI00526509
            PIR:B84771 RefSeq:NP_181107.1 UniGene:At.53046 UniGene:At.75395
            ProteinModelPortal:Q9ZQN7 SMR:Q9ZQN7 ProMEX:Q9ZQN7
            EnsemblPlants:AT2G35640.1 GeneID:818133 KEGG:ath:AT2G35640
            TAIR:At2g35640 eggNOG:NOG315255 HOGENOM:HOG000240297
            InParanoid:Q9ZQN7 OMA:VESSFNT PhylomeDB:Q9ZQN7
            ProtClustDB:CLSN2683797 Genevestigator:Q9ZQN7 Uniprot:Q9ZQN7
        Length = 340

 Score = 122 (48.0 bits), Expect = 0.00037, Sum P(2) = 0.00037
 Identities = 25/94 (26%), Positives = 49/94 (52%)

Query:   258 VAEFLQRHGVNRDAKTSGTKWDNMLGEFRKVYEWERGGEREQ----VGKSYFRLSPYERK 313
             + E+  R G  R+      KWDN++ +++K+ E+ER             SY+++   ERK
Sbjct:    63 IEEYCWRRGCYRNQNQCNDKWDNLMRDYKKIREYERSRVESSFNTVTSSSYWKMDKTERK 122

Query:   314 LHRLPASFDEEVFEELSQFMGSRMRSTSQSRAAS 347
                LP++   ++++ LS+ +  +   +S S AA+
Sbjct:   123 EKNLPSNMLPQIYDVLSELVDRKTLPSSSSAAAA 156

 Score = 43 (20.2 bits), Expect = 0.00037, Sum P(2) = 0.00037
 Identities = 10/16 (62%), Positives = 11/16 (68%)

Query:   362 ALPPPPPFKEDELSLS 377
             +LPPPPP     LSLS
Sbjct:   194 SLPPPPP---QSLSLS 206


>TAIR|locus:2128186 [details] [associations]
            symbol:AT4G31270 species:3702 "Arabidopsis thaliana"
            [GO:0005634 "nucleus" evidence=ISM] [GO:0003700 "sequence-specific
            DNA binding transcription factor activity" evidence=ISS]
            [GO:0006355 "regulation of transcription, DNA-dependent"
            evidence=RCA;TAS] [GO:0009506 "plasmodesma" evidence=IDA]
            [GO:0043687 "post-translational protein modification" evidence=RCA]
            [GO:0045893 "positive regulation of transcription, DNA-dependent"
            evidence=RCA] GO:GO:0009506 EMBL:CP002687 GO:GO:0003700
            EMBL:BT005287 EMBL:AK118674 IPI:IPI00541003 RefSeq:NP_194855.2
            UniGene:At.31756 ProteinModelPortal:Q8GWR8 PRIDE:Q8GWR8
            EnsemblPlants:AT4G31270.1 GeneID:829254 KEGG:ath:AT4G31270
            TAIR:At4g31270 HOGENOM:HOG000148318 InParanoid:Q8GWR8 OMA:LPANCNT
            PhylomeDB:Q8GWR8 ProtClustDB:CLSN2918239 Genevestigator:Q8GWR8
            Uniprot:Q8GWR8
        Length = 294

 Score = 119 (46.9 bits), Expect = 0.00049, P = 0.00049
 Identities = 28/94 (29%), Positives = 49/94 (52%)

Query:   238 EVAAA-AQSTRAKTRADKDREVAEFLQRHGVNRDAKTSGTKWDNMLGEFRKVYEWERGGE 296
             E+AA  A  + A +   K   + E      V+R+      KWD+++ ++ ++ +WE   +
Sbjct:    30 EIAAVEADCSNALSSFQKWTMITENCNALDVSRNLNQCRRKWDSLMSDYNQIKKWE--SQ 87

Query:   297 REQVGKSYFRLSPYERKLHRLPASFDEEVFEELS 330
                 G+SY+ LS  +RKL  LP   D E+FE ++
Sbjct:    88 YRGTGRSYWSLSSDKRKLLNLPGDIDIELFEAIN 121


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.320   0.136   0.420    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0     1018       839   0.00081  122 3  11 22  0.39    34
                                                     37  0.45    37


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  4
  No. of states in DFA:  630 (67 KB)
  Total size of DFA:  461 KB (2215 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  69.97u 0.12s 70.09t   Elapsed:  00:00:03
  Total cpu time:  69.97u 0.12s 70.09t   Elapsed:  00:00:03
  Start:  Mon May 20 16:09:08 2013   End:  Mon May 20 16:09:11 2013

Back to top