BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>021795
MHAKTDSEGTSINDATWPPRSPPRRPIYYVQSPSHPDVEKMSYGSSPMGSPAHHYYHCSP
IHHSRESSTSRFSASLKNPRGVSAWRHVQLDHKDGDGDGDDEEMDGRDEGSGRNVRLYVC
IGFFFVLLFTVFCLILWGASKPYKPKIIVKNIVFENFNVQAGSDESGVPTDMLSLNSTVK
ILYRNPATFFAVHVTSTPLELHYFQLKLASGQMKKFSQSRKSQRNVVTVVQGYQVPLYGG
VPVLASAKGHLDRAEVPLNLTFVMRSRAYILGRLVKSKFYRRIRCSVTLRGNKLGKPLNL
TNACFYQ

High Scoring Gene Products

Symbol, full name Information P value
AT2G41990 protein from Arabidopsis thaliana 3.2e-72
AT4G35170 protein from Arabidopsis thaliana 7.3e-59
AT1G45688 protein from Arabidopsis thaliana 4.1e-52
AT5G42860 protein from Arabidopsis thaliana 2.4e-50
AT3G08490 protein from Arabidopsis thaliana 1.2e-05

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  021795
        (307 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

TAIR|locus:2064637 - symbol:AT2G41990 "AT2G41990" species...   730  3.2e-72   1
TAIR|locus:2132811 - symbol:AT4G35170 "AT4G35170" species...   604  7.3e-59   1
TAIR|locus:2825837 - symbol:AT1G45688 "AT1G45688" species...   333  4.1e-52   3
TAIR|locus:2160026 - symbol:AT5G42860 "AT5G42860" species...   325  2.4e-50   3
TAIR|locus:2103454 - symbol:AT3G08490 "AT3G08490" species...   123  1.2e-05   2


>TAIR|locus:2064637 [details] [associations]
            symbol:AT2G41990 "AT2G41990" species:3702 "Arabidopsis
            thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0005575
            "cellular_component" evidence=ND] [GO:0008150 "biological_process"
            evidence=ND] InterPro:IPR004864 EMBL:CP002685
            GenomeReviews:CT485783_GR EMBL:U90439 Pfam:PF03168 EMBL:BT024714
            IPI:IPI00520778 PIR:F84848 RefSeq:NP_181730.1 UniGene:At.28028
            UniGene:At.68133 ProteinModelPortal:P93747 PRIDE:P93747
            EnsemblPlants:AT2G41990.1 GeneID:818799 KEGG:ath:AT2G41990
            TAIR:At2g41990 eggNOG:NOG322550 HOGENOM:HOG000240810
            InParanoid:P93747 OMA:YKSIRER PhylomeDB:P93747
            ProtClustDB:CLSN2683806 ArrayExpress:P93747 Genevestigator:P93747
            Uniprot:P93747
        Length = 297

 Score = 730 (262.0 bits), Expect = 3.2e-72, P = 3.2e-72
 Identities = 157/307 (51%), Positives = 193/307 (62%)

Query:     1 MHAKTDSEGTSINDATWXXXXXXXXXIYYVQSPSHPDVEKMSYGS--SPMGSPAH-HYYH 57
             MHAKTDSE TSI+ A           +YYVQSPS+ DVEKMS+GS  S MGSP H HYYH
Sbjct:     1 MHAKTDSEATSIDAAALSPPRSAIRPLYYVQSPSNHDVEKMSFGSGCSLMGSPTHPHYYH 60

Query:    58 CSPIHHXXXXXXXXXXXXLKNPRGVSAWRHVQLDHKXXXXXXXXXXXXXXXXXXXRNVRL 117
             CSPIHH              + R + +++ ++ + +                   RNVRL
Sbjct:    61 CSPIHHSRESSTSRF-----SDRALLSYKSIR-ERRRYINDGDDKTDGGDDDDPFRNVRL 114

Query:   118 YVCIGXXXXXXXXXXCLILWGASKPYKPKIIVKNIVFENFNVQAGSDESGVPTDMLSLNS 177
             YV +            LILWGASK Y PK+ VK ++  + N+QAG+D SGVPTDMLSLNS
Sbjct:   115 YVWLLLSVIFLFTVFSLILWGASKSYPPKVTVKGMLVRDLNLQAGNDLSGVPTDMLSLNS 174

Query:   178 TVKILYRNPATFFAVHVTSTPLELHYFQLKLASGQMKKFSQSRKSQRNVVTVVQGYQVPL 237
             TV+I YRNP+TFFAVHVT++PL LHY  L L+SG+M KF+  R  + NVVTVVQG+Q+PL
Sbjct:   175 TVRIYYRNPSTFFAVHVTASPLLLHYSNLLLSSGEMNKFTVGRNGETNVVTVVQGHQIPL 234

Query:   238 YGGVPVLASAKGHLDRAEVPLNLTFVMRSRAYILGRLVKSKFYRRIRCSVTLRGNKLGKP 297
             YGGV        HLD   +PLNLT V+ S+AYILGRLV SKFY RI CS TL  N L K 
Sbjct:   235 YGGVSF------HLDTLSLPLNLTIVLHSKAYILGRLVTSKFYTRIICSFTLDANHLPKS 288

Query:   298 LNLTNAC 304
             ++L  +C
Sbjct:   289 ISLLRSC 295


>TAIR|locus:2132811 [details] [associations]
            symbol:AT4G35170 "AT4G35170" species:3702 "Arabidopsis
            thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0005634
            "nucleus" evidence=ISM] [GO:0008150 "biological_process"
            evidence=ND] InterPro:IPR004864 EMBL:CP002687
            GenomeReviews:CT486007_GR Pfam:PF03168 UniGene:At.31436
            UniGene:At.70694 HOGENOM:HOG000240810 EMBL:AY924829 EMBL:DQ132714
            IPI:IPI00542280 RefSeq:NP_195243.2 UniGene:At.69097
            ProteinModelPortal:Q5BPJ9 EnsemblPlants:AT4G35170.1 GeneID:829670
            KEGG:ath:AT4G35170 TAIR:At4g35170 eggNOG:NOG259526 OMA:NIKCSIT
            PhylomeDB:Q5BPJ9 ProtClustDB:CLSN2913970 Genevestigator:Q5BPJ9
            Uniprot:Q5BPJ9
        Length = 299

 Score = 604 (217.7 bits), Expect = 7.3e-59, P = 7.3e-59
 Identities = 134/283 (47%), Positives = 169/283 (59%)

Query:    27 IYYVQSPSHPDVEKMSYGS--SPMGSPAHHYYHCSPI-HHXXXXXXXXXXXX--LKNPRG 81
             +Y V SP + DV+K+S GS  SP GSP +     S   HH              L+N   
Sbjct:    17 VYVVHSPPNTDVDKISTGSGFSPFGSPLNDQGQVSNFQHHSVAESSSYPRSSGPLRNEYS 76

Query:    82 VSAWRHVQLDHKXXXXXXXXXXXXXXXXXXXRNVRLYVCIGXXXXXXXXXXCLILWGASK 141
              S   H  LD +                   R  R Y C+           CLILWG SK
Sbjct:    77 -SVQVH-DLDRRTHEDEDYDEMDGPDEKRR-RITRFYSCLLFTLVLAFTLFCLILWGVSK 133

Query:   142 PYKPKIIVKNIVFENFNVQAGSDESGVPTDMLSLNSTVKILYRNPATFFAVHVTSTPLEL 201
              + P   +K +V EN NVQ+G+D+SGV TDML+LNSTV+ILYRNPATFF VHVTS PL+L
Sbjct:   134 SFAPIATLKEMVLENLNVQSGNDQSGVLTDMLTLNSTVRILYRNPATFFTVHVTSAPLQL 193

Query:   202 HYFQLKLASGQMKKFSQSRKSQRNVVTVVQGYQVPLYGGVPVLASAKGHLDRAEVPLNLT 261
              Y QL LASGQM +FSQ RKS+R + T V G Q+PLYGGVP L   +   D+  +PLNLT
Sbjct:   194 SYSQLILASGQMGEFSQRRKSERIIETKVFGDQIPLYGGVPALFGQRAEPDQVVLPLNLT 253

Query:   262 FVMRSRAYILGRLVKSKFYRRIRCSVTLRGNKLGKPLNLTNAC 304
             F +R+RAY+LGRLVK+ F+  I+CS+T  G+KLGK L+L+ +C
Sbjct:   254 FTLRARAYVLGRLVKTTFHSNIKCSITFYGDKLGKTLDLSKSC 296


>TAIR|locus:2825837 [details] [associations]
            symbol:AT1G45688 "AT1G45688" species:3702 "Arabidopsis
            thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0008150
            "biological_process" evidence=ND] [GO:0009507 "chloroplast"
            evidence=ISM] [GO:0005886 "plasma membrane" evidence=IDA]
            [GO:0048767 "root hair elongation" evidence=RCA] EMBL:CP002684
            GenomeReviews:CT485782_GR GO:GO:0005886 EMBL:AC083835
            HOGENOM:HOG000240810 EMBL:AY099826 EMBL:AY088597 EMBL:BT000323
            EMBL:AK221686 IPI:IPI00531270 PIR:A96511 RefSeq:NP_564495.1
            UniGene:At.22272 ProteinModelPortal:Q9C636 PRIDE:Q9C636
            EnsemblPlants:AT1G45688.1 GeneID:841102 KEGG:ath:AT1G45688
            TAIR:At1g45688 eggNOG:NOG330557 InParanoid:Q9C636 OMA:KECAVIE
            PhylomeDB:Q9C636 ProtClustDB:CLSN2686629 ArrayExpress:Q9C636
            Genevestigator:Q9C636 Uniprot:Q9C636
        Length = 342

 Score = 333 (122.3 bits), Expect = 4.1e-52, Sum P(3) = 4.1e-52
 Identities = 64/115 (55%), Positives = 86/115 (74%)

Query:   134 LILWGASKPYKPKIIVKNIVFENFNVQAGSDESGVPTDMLSLNSTVKILYRNPATFFAVH 193
             LIL+GA+KP KPKI VK+I FE   +QAG D  GV TDM+++N+T+++LYRN  TFF VH
Sbjct:   147 LILYGAAKPMKPKITVKSITFETLKIQAGQDAGGVGTDMITMNATLRMLYRNTGTFFGVH 206

Query:   194 VTSTPLELHYFQLKLASGQMKKFSQSRKSQRNVVTVVQGYQVPLYG-GVPVLASA 247
             VTSTP++L + Q+K+ SG +KKF Q RKS+R V+  V G ++PLYG G  +L  A
Sbjct:   207 VTSTPIDLSFSQIKIGSGSVKKFYQGRKSERTVLVHVIGEKIPLYGSGSTLLPPA 261

 Score = 136 (52.9 bits), Expect = 4.1e-52, Sum P(3) = 4.1e-52
 Identities = 24/51 (47%), Positives = 35/51 (68%)

Query:   254 AEVPLNLTFVMRSRAYILGRLVKSKFYRRIRCSVTLRGNKLGKPLNLTNAC 304
             A VP+ L+FV+RSRAY+LG+LV+ KFY++I C +      L K + +T  C
Sbjct:   287 APVPMTLSFVVRSRAYVLGKLVQPKFYKKIECDINFEHKNLNKHIVITKNC 337

 Score = 100 (40.3 bits), Expect = 4.1e-52, Sum P(3) = 4.1e-52
 Identities = 35/85 (41%), Positives = 38/85 (44%)

Query:     1 MHAKTDSEGTSINDATWXXXXXXXXXIYYVQSPSHP--DVEKM--SYGS----SPMGSPA 52
             MHAKTDSE TS+  A           +YYVQSPS    D EK   S+ S    SPMGSP 
Sbjct:     1 MHAKTDSEVTSL--AASSPARSPRRPVYYVQSPSRDSHDGEKTATSFHSTPVLSPMGSPP 58

Query:    53 HHYYHCSPIHHXXXXXXXXXXXXLK 77
             H   H S   H            LK
Sbjct:    59 HS--HSSMGRHSRESSSSRFSGSLK 81


>TAIR|locus:2160026 [details] [associations]
            symbol:AT5G42860 "AT5G42860" species:3702 "Arabidopsis
            thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0005634
            "nucleus" evidence=ISM] [GO:0008150 "biological_process"
            evidence=ND] [GO:0005886 "plasma membrane" evidence=IDA]
            GO:GO:0005886 EMBL:CP002688 EMBL:AB008264 ProtClustDB:CLSN2686629
            EMBL:AK227557 IPI:IPI00525741 RefSeq:NP_199100.1 UniGene:At.6518
            UniGene:At.71339 ProteinModelPortal:Q9FMN3
            EnsemblPlants:AT5G42860.1 GeneID:834297 KEGG:ath:AT5G42860
            TAIR:At5g42860 InParanoid:Q9FMN3 OMA:SGSIKKF PhylomeDB:Q9FMN3
            Genevestigator:Q9FMN3 Uniprot:Q9FMN3
        Length = 320

 Score = 325 (119.5 bits), Expect = 2.4e-50, Sum P(3) = 2.4e-50
 Identities = 61/106 (57%), Positives = 81/106 (76%)

Query:   134 LILWGASKPYKPKIIVKNIVFENFNVQAGSDESGVPTDMLSLNSTVKILYRNPATFFAVH 193
             LIL+ A+KP KPKI VK+I FE   VQAG D  G+ TDM+++N+T+++LYRN  TFF VH
Sbjct:   126 LILYAAAKPQKPKISVKSITFEQLKVQAGQDAGGIGTDMITMNATLRMLYRNTGTFFGVH 185

Query:   194 VTSTPLELHYFQLKLASGQMKKFSQSRKSQRNVVTVVQGYQVPLYG 239
             VTS+P++L + Q+ + SG +KKF QSRKSQR VV  V G ++PLYG
Sbjct:   186 VTSSPIDLSFSQITIGSGSIKKFYQSRKSQRTVVVNVLGDKIPLYG 231

 Score = 141 (54.7 bits), Expect = 2.4e-50, Sum P(3) = 2.4e-50
 Identities = 26/51 (50%), Positives = 35/51 (68%)

Query:   254 AEVPLNLTFVMRSRAYILGRLVKSKFYRRIRCSVTLRGNKLGKPLNLTNAC 304
             A VP+ L F +RSRAY+LG+LV+ KFY+RI C +     KL K + +TN C
Sbjct:   265 APVPMRLNFTVRSRAYVLGKLVQPKFYKRIVCLINFEHKKLSKHIPITNNC 315

 Score = 86 (35.3 bits), Expect = 2.4e-50, Sum P(3) = 2.4e-50
 Identities = 28/64 (43%), Positives = 35/64 (54%)

Query:     1 MHAKTDSEGTSINDATWXXXXXXXXXIYYVQSPSHP--DVEKM--SYGS-----SPMGSP 51
             MHAKTDSE TS++ ++           Y+VQSPS    D EK   S+ S     SPMGSP
Sbjct:     1 MHAKTDSEVTSLSASS--PTRSPRRPAYFVQSPSRDSHDGEKTATSFHSTPVLTSPMGSP 58

Query:    52 AHHY 55
              H +
Sbjct:    59 PHSH 62


>TAIR|locus:2103454 [details] [associations]
            symbol:AT3G08490 "AT3G08490" species:3702 "Arabidopsis
            thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0005634
            "nucleus" evidence=ISM] [GO:0008150 "biological_process"
            evidence=ND] EMBL:CP002686 GenomeReviews:BA000014_GR EMBL:AC074395
            EMBL:AY600564 EMBL:AY649300 IPI:IPI00533756 RefSeq:NP_187462.1
            UniGene:At.50830 ProteinModelPortal:Q9C6U2
            EnsemblPlants:AT3G08490.1 GeneID:819996 KEGG:ath:AT3G08490
            TAIR:At3g08490 eggNOG:NOG296339 HOGENOM:HOG000239860
            InParanoid:Q9C6U2 OMA:ERREDHY PhylomeDB:Q9C6U2
            ProtClustDB:CLSN2684947 ArrayExpress:Q9C6U2 Genevestigator:Q9C6U2
            Uniprot:Q9C6U2
        Length = 271

 Score = 123 (48.4 bits), Expect = 1.2e-05, Sum P(2) = 1.2e-05
 Identities = 36/157 (22%), Positives = 71/157 (45%)

Query:   134 LILWGASKPYKPKIIVKNIVFENFNVQAGSDESGVPTDMLSLNSTVKILYRNPATFFAVH 193
             L+ + A++P  P I  +   F  F ++ G D  GV T  L+ N + K++  N +  F +H
Sbjct:    98 LVFYIATQPPHPNISFRIGRFNQFMLEEGVDSHGVSTKFLTFNCSTKLIIDNKSNVFGLH 157

Query:   194 VTSTPLELHYFQLKLASGQMKK-FSQSRKSQRNVVTVVQGYQVPLYGGVPVLASAKGHLD 252
             +    ++  +  L  A  Q  K +  S +S    + +    +  +YG    +      L 
Sbjct:   158 IHPPSIKFFFGPLNFAKAQGPKLYGLSHESTTFQLYIATTNRA-MYGAGTEMNDML--LS 214

Query:   253 RAEVPLNLTFVMRSRAYILGRLVKSKFYRRIRCSVTL 289
             RA +PL L   + S   ++  ++  K++ ++ C + L
Sbjct:   215 RAGLPLILRTSIISDYRVVWNIINPKYHHKVECLLLL 251

 Score = 36 (17.7 bits), Expect = 1.2e-05, Sum P(2) = 1.2e-05
 Identities = 10/23 (43%), Positives = 13/23 (56%)

Query:    28 YYVQSPSHPDVEKMSYGSSPMGS 50
             Y+VQSPS    +  S   SP+ S
Sbjct:    11 YFVQSPSTVFHDPESEFQSPIRS 33


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.321   0.135   0.416    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      307       257   0.00086  114 3  11 22  0.43    33
                                                     32  0.48    36


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  5
  No. of states in DFA:  612 (65 KB)
  Total size of DFA:  195 KB (2110 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  19.48u 0.09s 19.57t   Elapsed:  00:00:01
  Total cpu time:  19.48u 0.09s 19.57t   Elapsed:  00:00:01
  Start:  Fri May 10 12:46:21 2013   End:  Fri May 10 12:46:22 2013

Back to top