BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>038571
MEPGSNDKHEDGSSLPNGYNEEQAIDGNTDDRSKTARNPRWTRQETIVLIQGKRVVEDRI
RGFRTSTSAFRSDHSEPKWDSVASYCKQYGVNRRPVQCRKRWGNLLVDFRKIKRWESQMK
EEKQSFWVMRNESRKQMKLPGYFDREVYDVLDGVLAMPAVPLTTMSVSEEDEDDEVFDSD
RSTAAGDGLFSDSEPSQRQEISHNPEKETTERQSPSKKVAAQLHVADTLKEKLAGTTTAN
GSTTQERWKRRRLSSCVSKETNMGDLLFKVLERNSSMLNTQLEAQNINCQLDREQKKEHS
DNLIAAMNKLTDALLRIGNKL

High Scoring Gene Products

Symbol, full name Information P value
AT2G33550 protein from Arabidopsis thaliana 1.8e-64
AT4G31270 protein from Arabidopsis thaliana 1.1e-15
AT2G35640 protein from Arabidopsis thaliana 9.6e-13
AT1G31310 protein from Arabidopsis thaliana 1.2e-08
AT5G51800 protein from Arabidopsis thaliana 1.0e-05

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  038571
        (321 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

TAIR|locus:2051174 - symbol:AT2G33550 species:3702 "Arabi...   657  1.8e-64   1
TAIR|locus:2128186 - symbol:AT4G31270 species:3702 "Arabi...   206  1.1e-15   1
TAIR|locus:2058718 - symbol:AT2G35640 species:3702 "Arabi...   179  9.6e-13   2
TAIR|locus:2197490 - symbol:AT1G31310 species:3702 "Arabi...   146  1.2e-08   2
TAIR|locus:2165331 - symbol:AT5G51800 species:3702 "Arabi...   135  1.0e-05   1


>TAIR|locus:2051174 [details] [associations]
            symbol:AT2G33550 species:3702 "Arabidopsis thaliana"
            [GO:0005634 "nucleus" evidence=ISM] [GO:0003700 "sequence-specific
            DNA binding transcription factor activity" evidence=ISS]
            [GO:0006355 "regulation of transcription, DNA-dependent"
            evidence=TAS] InterPro:IPR009057 EMBL:CP002685 GO:GO:0003677
            GO:GO:0003700 Gene3D:1.10.10.60 InterPro:IPR017877 PROSITE:PS50090
            EMBL:AY065364 EMBL:AY096389 IPI:IPI00526924 RefSeq:NP_850213.1
            UniGene:At.28516 ProteinModelPortal:Q8VZ20 SMR:Q8VZ20 IntAct:Q8VZ20
            PRIDE:Q8VZ20 EnsemblPlants:AT2G33550.1 GeneID:817920
            KEGG:ath:AT2G33550 TAIR:At2g33550 HOGENOM:HOG000240766
            InParanoid:Q8VZ20 OMA:EETESFW PhylomeDB:Q8VZ20
            ProtClustDB:CLSN2680007 Genevestigator:Q8VZ20 Uniprot:Q8VZ20
        Length = 314

 Score = 657 (236.3 bits), Expect = 1.8e-64, P = 1.8e-64
 Identities = 145/321 (45%), Positives = 198/321 (61%)

Query:    13 SSLPNGYNEEQAIDGNTDDRSKTARNPRWTRQETIVLIQGKRVVEDRIRGFRTSTSAFRS 72
             S++  G N     +   DD  KTAR PRWTRQE +VLIQGKRV E+R+R  R +  A  S
Sbjct:    11 SAVDGGENSSAPSNDGGDDGVKTARLPRWTRQEILVLIQGKRVAENRVRRGRAAGMALGS 70

Query:    73 DHSEPKWDSVASYCKQYGVNRRPVQCRKRWGNLLVDFRKIKRWESQMKEEKQSFWVMRNE 132
                EPKW SV+SYCK++GVNR PVQCRKRW NL  D++KIK WESQ+KEE +S+WVMRN+
Sbjct:    71 GQMEPKWASVSSYCKRHGVNRGPVQCRKRWSNLAGDYKKIKEWESQIKEETESYWVMRND 130

Query:   133 SRKQMKLPGYFDREVYDVLDGVLAMPAVPLTTMXXXXXXXXXXXXXXXXXTAAGDGLFSD 192
              R++ KLPG+FD+EVYD++DG +  PAVP+ ++                  A+ +GL SD
Sbjct:   131 VRREKKLPGFFDKEVYDIVDGGVIPPAVPVLSLGLA--------------PASDEGLLSD 176

Query:   193 SEPSQRQE-ISHNP-EKETTERQSPSKKVAAQLHVADT--LKEKLA-GTTTANGSTTQER 247
              +  +  E ++  P  K  T+     K+ A    VAD   +KEK         GST+QE 
Sbjct:   177 LDRRESPEKLNSTPVAKSVTDVIDKEKQEAC---VADQGRVKEKQPEAANVEGGSTSQEE 233

Query:   248 WKRRRLSSCVSKETN-------MGDLLFKVLERNSSMLNTQLEAQNINCQLDREQKKEHS 300
              KR+R S    +E         M + L ++LERN  +L  QLE QN+N +LDREQ+K+H 
Sbjct:   234 RKRKRTSFGEKEEEEEEGETKKMQNQLIEILERNGQLLAAQLEVQNLNLKLDREQRKDHG 293

Query:   301 DNLIAAMNKLTDALLRIGNKL 321
             D+L+A +NKL DA+ +I +K+
Sbjct:   294 DSLVAVLNKLADAVAKIADKM 314


>TAIR|locus:2128186 [details] [associations]
            symbol:AT4G31270 species:3702 "Arabidopsis thaliana"
            [GO:0005634 "nucleus" evidence=ISM] [GO:0003700 "sequence-specific
            DNA binding transcription factor activity" evidence=ISS]
            [GO:0006355 "regulation of transcription, DNA-dependent"
            evidence=RCA;TAS] [GO:0009506 "plasmodesma" evidence=IDA]
            [GO:0043687 "post-translational protein modification" evidence=RCA]
            [GO:0045893 "positive regulation of transcription, DNA-dependent"
            evidence=RCA] GO:GO:0009506 EMBL:CP002687 GO:GO:0003700
            EMBL:BT005287 EMBL:AK118674 IPI:IPI00541003 RefSeq:NP_194855.2
            UniGene:At.31756 ProteinModelPortal:Q8GWR8 PRIDE:Q8GWR8
            EnsemblPlants:AT4G31270.1 GeneID:829254 KEGG:ath:AT4G31270
            TAIR:At4g31270 HOGENOM:HOG000148318 InParanoid:Q8GWR8 OMA:LPANCNT
            PhylomeDB:Q8GWR8 ProtClustDB:CLSN2918239 Genevestigator:Q8GWR8
            Uniprot:Q8GWR8
        Length = 294

 Score = 206 (77.6 bits), Expect = 1.1e-15, P = 1.1e-15
 Identities = 64/301 (21%), Positives = 126/301 (41%)

Query:    22 EQAIDGNTDDRSKTARNPRWTRQETIVLIQGKRVVEDRIRGFRTSTSAFRSDHSEPKWDS 81
             E+   G+   RS+ A  P W  ++ +VL+     VE        + S+F+      KW  
Sbjct:     2 EEGTSGSRRTRSQVA--PEWAVKDCLVLVNEIAAVE---ADCSNALSSFQ------KWTM 50

Query:    82 VASYCKQYGVNRRPVQCRKRWGNLLVDFRKIKRWESQMKEEKQSFWVMRNESRKQMKLPG 141
             +   C    V+R   QCR++W +L+ D+ +IK+WESQ +   +S+W + ++ RK + LPG
Sbjct:    51 ITENCNALDVSRNLNQCRRKWDSLMSDYNQIKKWESQYRGTGRSYWSLSSDKRKLLNLPG 110

Query:   142 YFDREVYDVLDGVLAMPAVPLTTMXXXXXXXXXXXXXXXXXTAAGDGLFSDSEPSQRQEI 201
               D E+++ ++ V+ +      T                     G           ++  
Sbjct:   111 DIDIELFEAINAVVMIQDEKAGTESDSDPEAQDVVDLSAELAFVGSKRSRQRTMVMKETK 170

Query:   202 SHNPEKETTERQSPSKKVAAQL-HVADTLKEKLAGTTTANGSTTQERWKRRRLSSCVS-K 259
                P     +  +  K +  +  H   T+ EK       + ST +E  +   +   V   
Sbjct:   171 KEEPRTSRVQVNTREKPITTKATHQNKTMGEK---KPVEDMSTDEEEDETMNIEEDVEVM 227

Query:   260 ETNMG---DLLFKVLERNSSMLNTQLEAQNINCQLDREQKKEHSDNLIAAMNKLTDALLR 316
             E  +    DL+  ++ RN +  N   +  +++ +L  +  ++  D LI  ++++   L R
Sbjct:   228 EAKLSYKIDLIHAIVGRNLAKDNETKDGVSMDDKL--KSVRQQGDELIGCLSEIVSTLNR 285

Query:   317 I 317
             +
Sbjct:   286 L 286


>TAIR|locus:2058718 [details] [associations]
            symbol:AT2G35640 species:3702 "Arabidopsis thaliana"
            [GO:0005634 "nucleus" evidence=ISM] [GO:0003700 "sequence-specific
            DNA binding transcription factor activity" evidence=ISS]
            [GO:0006355 "regulation of transcription, DNA-dependent"
            evidence=TAS] EMBL:CP002685 GenomeReviews:CT485783_GR GO:GO:0003700
            EMBL:AC006068 InterPro:IPR017877 PROSITE:PS50090 IPI:IPI00526509
            PIR:B84771 RefSeq:NP_181107.1 UniGene:At.53046 UniGene:At.75395
            ProteinModelPortal:Q9ZQN7 SMR:Q9ZQN7 ProMEX:Q9ZQN7
            EnsemblPlants:AT2G35640.1 GeneID:818133 KEGG:ath:AT2G35640
            TAIR:At2g35640 eggNOG:NOG315255 HOGENOM:HOG000240297
            InParanoid:Q9ZQN7 OMA:VESSFNT PhylomeDB:Q9ZQN7
            ProtClustDB:CLSN2683797 Genevestigator:Q9ZQN7 Uniprot:Q9ZQN7
        Length = 340

 Score = 179 (68.1 bits), Expect = 9.6e-13, Sum P(2) = 9.6e-13
 Identities = 40/135 (29%), Positives = 65/135 (48%)

Query:    34 KTARNPRWTRQETIVLIQGKRVVEDR-IRGFRTSTSAFRSDHSEPKWDSVASYCKQYGVN 92
             +  R   WT  ET+VLI+ K++ + R +R         R+  +E +W  +  YC + G  
Sbjct:    15 RECRKGNWTVSETLVLIEAKKMDDQRRVRRSEKQPEG-RNKPAELRWKWIEEYCWRRGCY 73

Query:    93 RRPVQCRKRWGNLLVDFRKIKRWESQMKEEK------QSFWVMRNESRKQMKLPGYFDRE 146
             R   QC  +W NL+ D++KI+ +E    E         S+W M    RK+  LP     +
Sbjct:    74 RNQNQCNDKWDNLMRDYKKIREYERSRVESSFNTVTSSSYWKMDKTERKEKNLPSNMLPQ 133

Query:   147 VYDVLDGVLAMPAVP 161
             +YDVL  ++    +P
Sbjct:   134 IYDVLSELVDRKTLP 148

 Score = 45 (20.9 bits), Expect = 9.6e-13, Sum P(2) = 9.6e-13
 Identities = 25/129 (19%), Positives = 55/129 (42%)

Query:   195 PSQRQEIS-HNPEKETTERQSPSKKVAAQLHVADTLKEKLA-GTTTANGST-TQERWKRR 251
             P Q   +S  +P +        ++ +   +  + T + +   G TTA G    +E     
Sbjct:   199 PPQSLSLSLPSPPQPPPSSSFHAEPIPPTVGTSSTKRRRTTPGETTAGGEREVEEDAVGV 258

Query:   252 RLSSCVSKETNMGDLLFKVLERNSSMLNTQLEAQNINCQLDR-EQKKEHSDNLIAAMNKL 310
              LS C S  T +     +  ER    +  +L+ + +  +  + E  ++  + L+ A+N+L
Sbjct:   259 ALSRCTSVITQVIRENEEGQERRHKEV-VRLQERRLKIEESKTEINRQGMNGLVDAINQL 317

Query:   311 TDALLRIGN 319
               ++L + +
Sbjct:   318 ASSILALAS 326


>TAIR|locus:2197490 [details] [associations]
            symbol:AT1G31310 species:3702 "Arabidopsis thaliana"
            [GO:0005634 "nucleus" evidence=ISM] [GO:0003700 "sequence-specific
            DNA binding transcription factor activity" evidence=ISS]
            [GO:0006355 "regulation of transcription, DNA-dependent"
            evidence=TAS] EMBL:CP002684 GO:GO:0003700 InterPro:IPR017877
            PROSITE:PS50090 IPI:IPI00541830 RefSeq:NP_174416.2 UniGene:At.40372
            UniGene:At.70648 ProteinModelPortal:F4I9C1 SMR:F4I9C1 PRIDE:F4I9C1
            EnsemblPlants:AT1G31310.1 GeneID:840019 KEGG:ath:AT1G31310
            OMA:MDDERRM Uniprot:F4I9C1
        Length = 383

 Score = 146 (56.5 bits), Expect = 1.2e-08, Sum P(2) = 1.2e-08
 Identities = 43/151 (28%), Positives = 68/151 (45%)

Query:    37 RNPRWTRQETIVLIQGKRVVEDR-IR---GFRTSTSA--FRSDH-SEPKWDSVASYCKQY 89
             R   WT  ET+VLI+ KR+ ++R +R   G          RS+  +E +W  +  YC + 
Sbjct:    15 RKGNWTLNETMVLIEAKRMDDERRMRRSIGLPPPEQQQDIRSNKPAELRWKWIEDYCWRK 74

Query:    90 GVNRRPVQCRKRWGNLLVDFRKIKRWESQMKE----------------EKQSFWVMRNES 133
             G  R   QC  +W NL+ D++K++ +E +  E                E  S+W M    
Sbjct:    75 GCMRSQNQCNDKWDNLMRDYKKVREYERRRVESSITAGESSSSSAPAGETASYWKMEKSE 134

Query:   134 RKQMKLPGYFDREVYDVLDGVLAMPAVPLTT 164
             RK+  LP     + Y  L  V+    +P +T
Sbjct:   135 RKERSLPSNMLPQTYQALFEVVESKTLPSST 165

 Score = 48 (22.0 bits), Expect = 1.2e-08, Sum P(2) = 1.2e-08
 Identities = 11/47 (23%), Positives = 24/47 (51%)

Query:   273 RNSSMLNTQLEAQNINCQLDREQKKEHSDNLIAAMNKLTDALLRIGN 319
             R+  ++N Q     I  + + E  +E  + L+ A+NKL  ++  + +
Sbjct:   332 RHKEVMNVQERRLKIE-ESNVEMNREGMNGLVEAINKLASSIFALAS 377


>TAIR|locus:2165331 [details] [associations]
            symbol:AT5G51800 species:3702 "Arabidopsis thaliana"
            [GO:0005634 "nucleus" evidence=ISM] [GO:0016772 "transferase
            activity, transferring phosphorus-containing groups" evidence=IEA]
            [GO:0048445 "carpel morphogenesis" evidence=RCA] InterPro:IPR000719
            InterPro:IPR011009 Pfam:PF00069 GO:GO:0005524 EMBL:CP002688
            GenomeReviews:BA000015_GR EMBL:AB010074 SUPFAM:SSF56112
            GO:GO:0004672 IPI:IPI00547167 RefSeq:NP_199993.1 UniGene:At.29659
            ProteinModelPortal:Q9FLH9 SMR:Q9FLH9 PRIDE:Q9FLH9
            EnsemblPlants:AT5G51800.1 GeneID:835255 KEGG:ath:AT5G51800
            TAIR:At5g51800 eggNOG:NOG308341 HOGENOM:HOG000090978
            InParanoid:Q9FLH9 OMA:LWLARAW PhylomeDB:Q9FLH9
            ProtClustDB:CLSN2687432 Genevestigator:Q9FLH9 Uniprot:Q9FLH9
        Length = 972

 Score = 135 (52.6 bits), Expect = 1.0e-05, P = 1.0e-05
 Identities = 35/117 (29%), Positives = 56/117 (47%)

Query:    38 NPRWTRQETIVLIQGKRV-VEDRIRGFRTSTSAFRSDHSEPKWDSVASYCKQYGVNRRPV 96
             +P W   E + L +  R   + +  G  + +   R      K   VA Y  ++G+NR   
Sbjct:   148 SPVWKPNEMLWLARAWRAQYQTQGTGSGSGSVEGRGKTRAEKDREVAEYLNRHGINRDSK 207

Query:    97 QCRKRWGNLLVDFRKIKRWESQMKEEK--QSFWVMRNESRKQMKLPGYFDREVYDVL 151
                 +W N+L +FRK+  WE    ++K  +S++ +    RKQ +LP  FD EVY  L
Sbjct:   208 IAGTKWDNMLGEFRKVYEWEKCGDQDKYGKSYFRLSPYERKQHRLPASFDEEVYQEL 264


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.313   0.127   0.369    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      321       304   0.00098  115 3  11 23  0.45    34
                                                     33  0.45    37


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  5
  No. of states in DFA:  604 (64 KB)
  Total size of DFA:  224 KB (2123 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  28.28u 0.15s 28.43t   Elapsed:  00:00:01
  Total cpu time:  28.29u 0.15s 28.44t   Elapsed:  00:00:01
  Start:  Sat May 11 13:02:00 2013   End:  Sat May 11 13:02:01 2013

Back to top