BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>008526
MERDSSDDDDDRETLIHQNDTKHGNHRLPTSDNNEDEEHNRRHSTFHIDDFPNAPPIRRR
FTFDFKKLNNKRYLFALSLPLLIILLYFSVNLRSLFSGNYVNFRFDSLADRMRESELRAL
SLLKQQQSHLLSLWNQSFVNNSYGNNTNNPFFQEAKSVLLNQISLNRQIEQILLSPHKVS
NFTPNDAVWGLESCRKIDSIIPNKRTVEWKPKSDKFLFAICLSGQMSNHLICLEKHMFLA
ALLNRVLVIPSSKFDYQYSRVLDIEHINDCLGRKVVVSFENFMEMEKNHAHIDRFLCYFG
LPQPCFVDDEHIKKLKQLGISMGKTETVWKNEDTRKPSKRTVQDIEGKFKTDDDVIAVGD
LFYADVERDWVMQPGGPINHRCKTLIEPSRLIMVTAQRFVQTFLGSNFIALHFRRHGFLK
FCNAKKPSCFYPIPQAADCITRLAERAKAPVIYLSTDAAESETSLLQSLVVLNGKTIALV
KRPPRNSAEKWDSLLYRHHLEDDSQVEAMLDKTICAMSNVFIGASGSTFTEDIMRLRKDW
GSTSLCDEYLCQGEEPNFIAEDE

High Scoring Gene Products

Symbol, full name Information P value
AT5G50420 protein from Arabidopsis thaliana 1.5e-184
AT1G17270 protein from Arabidopsis thaliana 1.4e-172
pad-2
GDP-fucose protein O-fucosyltransferase 2
protein from Caenorhabditis elegans 0.00022
pad-2 gene from Caenorhabditis elegans 0.00022

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  008526
        (563 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

TAIR|locus:2177492 - symbol:AT5G50420 "AT5G50420" species...  1790  1.5e-184  1
TAIR|locus:2020432 - symbol:AT1G17270 "AT1G17270" species...  1677  1.4e-172  1
UNIPROTKB|Q8WR51 - symbol:pad-2 "GDP-fucose protein O-fuc...    96  0.00022   2
WB|WBGene00010757 - symbol:pad-2 species:6239 "Caenorhabd...    96  0.00022   2


>TAIR|locus:2177492 [details] [associations]
            symbol:AT5G50420 "AT5G50420" species:3702 "Arabidopsis
            thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0005634
            "nucleus" evidence=ISM] [GO:0008150 "biological_process"
            evidence=ND] [GO:0007020 "microtubule nucleation" evidence=RCA]
            EMBL:CP002688 GO:GO:0016757 EMBL:AB012248 UniGene:At.63362
            KO:K03691 InterPro:IPR019378 Pfam:PF10250 CAZy:GT68
            ProtClustDB:CLSN2682132 EMBL:BT030356 IPI:IPI00543572
            RefSeq:NP_199853.1 UniGene:At.29731 ProteinModelPortal:Q9FK30
            STRING:Q9FK30 PRIDE:Q9FK30 EnsemblPlants:AT5G50420.1 GeneID:835110
            KEGG:ath:AT5G50420 InParanoid:Q9FK30 OMA:YLCQGEL
            Genevestigator:Q9FK30 Uniprot:Q9FK30
        Length = 566

 Score = 1790 (635.2 bits), Expect = 1.5e-184, P = 1.5e-184
 Identities = 349/559 (62%), Positives = 424/559 (75%)

Query:    15 LIHQNDTKHGNHRLPTSDNNEDEEHNRRHSTFHIDDFPNAPPIRRRFTFDFKKLNNKRXX 74
             LI QNDT+   HR  +  +N       + S F IDD  +    R + +     LN +   
Sbjct:    15 LIPQNDTRI-RHREDSVSSNATTIGGNQRSAFQIDDILHRVQHRGKIS-----LNKRYVI 68

Query:    75 XXXXXXXXXXXXXXXVNLRSLFSGNYVNFRFDSLADRMRESELRAXXXXXXXXXXXXXXW 134
                             + R LF+ N+ +F+ D L++R++ESELRA              W
Sbjct:    69 VFVSLIISIGLLFLLTDPRELFAANFSSFKLDPLSNRVKESELRALYLLRQQQLALLSLW 128

Query:   135 NQSFVNNSYGNNTN----NPFFQEAKSVLLNQISLNRQIEQILLSPHKVSNF---TPNDA 187
             N + VN S   + N    +  F++ KS +  QISLN++I+++LLSPH+ SN+   T  D+
Sbjct:   129 NGTLVNPSLNQSENALGSSVLFEDVKSAVSKQISLNKEIQEVLLSPHRSSNYSGGTDVDS 188

Query:   188 V-WGLESCRKIDSIIPNKRTVEWKPKSDKFLFAICLSGQMSNHLICLEKHMFLAALLNRV 246
             V +    CRK+D  + +++TVEWKP+SDKFLFAICLSGQMSNHLICLEKHMF AALL+RV
Sbjct:   189 VNFSYNRCRKVDQKLSDRKTVEWKPRSDKFLFAICLSGQMSNHLICLEKHMFFAALLDRV 248

Query:   247 LVIPSSKFDYQYSRVLDIEHINDCLGRKVVVSFENFMEM-EKNHAHIDRFLCYFGLPQPC 305
             LVIPSSKFDYQY RV+DIE IN CLGR VVV+F+ F E  +KNH  IDRF+CYF  PQ C
Sbjct:   249 LVIPSSKFDYQYDRVIDIERINTCLGRNVVVAFDQFKEKAKKNHFRIDRFICYFSSPQLC 308

Query:   306 FVDDEHIKKLKQLGISM-GKTETVWKNEDTRKPSKRTVQDIEGKFKTDDDVIAVGDLFYA 364
             +VD+EHIKKLK LGIS+ GK E  W +ED +KPSKRTVQD++ KFK+DDDVIA+GD+FYA
Sbjct:   309 YVDEEHIKKLKGLGISIDGKLEAPW-SEDIKKPSKRTVQDVQMKFKSDDDVIAIGDVFYA 367

Query:   365 DVERDWVMQPGGPINHRCKTLIEPSRLIMVTAQRFVQTFLGSNFIALHFRRHGFLKFCNA 424
             D+E+DWVMQPGGPINH+CKTLIEPS+LI++TAQRF+QTFLG NFIALHFRRHGFLKFCNA
Sbjct:   368 DMEQDWVMQPGGPINHKCKTLIEPSKLILLTAQRFIQTFLGKNFIALHFRRHGFLKFCNA 427

Query:   425 KKPSCFYPIPQAADCITRLAERAKAPVIYLSTDAAESETSLLQSLVVLNGKTIALVKRPP 484
             K PSCFYPIPQAA+CI R+ ER+   VIYLSTDAAESETSLLQSLVV++GK + LVKRPP
Sbjct:   428 KSPSCFYPIPQAAECIARIVERSNGAVIYLSTDAAESETSLLQSLVVVDGKIVPLVKRPP 487

Query:   485 RNSAEKWDSLLYRHHLEDDSQVEAMLDKTICAMSNVFIGASGSTFTEDIMRLRKDWGSTS 544
             RNSAEKWD+LLYRH +EDDSQV+AMLDKTICAMS+VFIGASGSTFTEDI+RLRKDWG++S
Sbjct:   488 RNSAEKWDALLYRHGIEDDSQVDAMLDKTICAMSSVFIGASGSTFTEDILRLRKDWGTSS 547

Query:   545 LCDEYLCQGEEPNFIAEDE 563
              CDEYLC+GEEPNFIAEDE
Sbjct:   548 TCDEYLCRGEEPNFIAEDE 566


>TAIR|locus:2020432 [details] [associations]
            symbol:AT1G17270 "AT1G17270" species:3702 "Arabidopsis
            thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0005634
            "nucleus" evidence=ISM] [GO:0008150 "biological_process"
            evidence=ND] EMBL:CP002684 GenomeReviews:CT485782_GR GO:GO:0016757
            KO:K03691 InterPro:IPR019378 Pfam:PF10250 CAZy:GT68 EMBL:BT002770
            IPI:IPI00527212 RefSeq:NP_173170.2 UniGene:At.41844
            ProteinModelPortal:Q84WU0 PRIDE:Q84WU0 EnsemblPlants:AT1G17270.1
            GeneID:838298 KEGG:ath:AT1G17270 TAIR:At1g17270 eggNOG:NOG314494
            HOGENOM:HOG000242390 InParanoid:Q84WU0 OMA:RMVERAN PhylomeDB:Q84WU0
            ProtClustDB:CLSN2682132 Genevestigator:Q84WU0 Uniprot:Q84WU0
        Length = 564

 Score = 1677 (595.4 bits), Expect = 1.4e-172, P = 1.4e-172
 Identities = 333/561 (59%), Positives = 417/561 (74%)

Query:    15 LIHQNDTKHGNHRL-PTSDN-NEDEEHNRR-HSTFHIDDFPNAPPIRRRFTFDFKKLNNK 71
             LI QNDT+  +  L P +   N      R   S   ID+  +    R R+      +N +
Sbjct:    15 LIPQNDTRDNDLNLRPDARTVNMANGGGRSPRSALQIDEILSRA--RNRWKIS---VNKR 69

Query:    72 RXXXXXXXXXXXXXXXXXVNLRSLFSGNYVNFRFDSLADRMRESELRAXXXXXXXXXXXX 131
                                + R+ FS    +F+ D ++ R++ESEL+A            
Sbjct:    70 YVVAAVSLTLFVGLLFLFTDTRTFFS----SFKLDPMSSRVKESELQALNLLRQQQLALV 125

Query:   132 XXWNQSFVNNSYGNNTNNPFFQEAKSVLLNQISLNRQIEQILLSPHKVSNFT----PNDA 187
                N++  N+S   +++       K+ LL QIS+N++IE++LLSPH+  N++     +D+
Sbjct:   126 SLLNRTNFNSSNAISSS-VVIDNVKAALLKQISVNKEIEEVLLSPHRTGNYSITASGSDS 184

Query:   188 VWG---LESCRKIDSIIPNKRTVEWKPKSDKFLFAICLSGQMSNHLICLEKHMFLAALLN 244
               G    + CRK+D  + +++T+EWKP+ DKFLFAICLSGQMSNHLICLEKHMF AALL+
Sbjct:   185 FTGSYNADICRKVDQKLLDRKTIEWKPRPDKFLFAICLSGQMSNHLICLEKHMFFAALLD 244

Query:   245 RVLVIPSSKFDYQYSRVLDIEHINDCLGRKVVVSFENFMEMEK-NHAHIDRFLCYFGLPQ 303
             RVLVIPSSKFDYQY +V+DIE IN CLGR VV+SF+ F E++K N+AHIDRF+CY   PQ
Sbjct:   245 RVLVIPSSKFDYQYDKVIDIERINTCLGRTVVISFDQFKEIDKKNNAHIDRFICYVSSPQ 304

Query:   304 PCFVDDEHIKKLKQLGISMG-KTETVWKNEDTRKPSKRTVQDIEGKFKTDDDVIAVGDLF 362
             PC+VD++HIKKLK LG+S+G K E  W +ED +KP+KRT Q++  KFK+DD VIA+GD+F
Sbjct:   305 PCYVDEDHIKKLKGLGVSIGGKLEAPW-SEDIKKPTKRTSQEVVEKFKSDDGVIAIGDVF 363

Query:   363 YADVERDWVMQPGGPINHRCKTLIEPSRLIMVTAQRFVQTFLGSNFIALHFRRHGFLKFC 422
             YAD+E+D VMQPGGPINH+CKTLIEPSRLI+VTAQRF+QTFLG NFI+LH RRHGFLKFC
Sbjct:   364 YADMEQDLVMQPGGPINHKCKTLIEPSRLILVTAQRFIQTFLGKNFISLHLRRHGFLKFC 423

Query:   423 NAKKPSCFYPIPQAADCITRLAERAKAPVIYLSTDAAESETSLLQSLVVLNGKTIALVKR 482
             NAK PSCFYPIPQAADCI+R+ ERA APVIYLSTDAAESET LLQSLVV++GK + LVKR
Sbjct:   424 NAKSPSCFYPIPQAADCISRMVERANAPVIYLSTDAAESETGLLQSLVVVDGKVVPLVKR 483

Query:   483 PPRNSAEKWDSLLYRHHLEDDSQVEAMLDKTICAMSNVFIGASGSTFTEDIMRLRKDWGS 542
             PP+NSAEKWDSLLYRH +EDDSQV AMLDKTICAMS+VFIGASGSTFTEDI+RLRKDWG+
Sbjct:   484 PPQNSAEKWDSLLYRHGIEDDSQVYAMLDKTICAMSSVFIGASGSTFTEDILRLRKDWGT 543

Query:   543 TSLCDEYLCQGEEPNFIAEDE 563
             +S+CDEYLC+GEEPNFIAE+E
Sbjct:   544 SSMCDEYLCRGEEPNFIAENE 564


>UNIPROTKB|Q8WR51 [details] [associations]
            symbol:pad-2 "GDP-fucose protein O-fucosyltransferase 2"
            species:6239 "Caenorhabditis elegans" [GO:0046922
            "peptide-O-fucosyltransferase activity" evidence=ISS] [GO:0036066
            "protein O-linked fucosylation" evidence=ISS] [GO:0003674
            "molecular_function" evidence=ND] [GO:0005575 "cellular_component"
            evidence=ND] UniPathway:UPA00378 GO:GO:0005783 GO:GO:0009792
            GO:GO:0005794 GO:GO:0018991 GO:GO:0010171 GO:GO:0007283
            GO:GO:0040025 GO:GO:0006004 GO:GO:0046922 InterPro:IPR019378
            Pfam:PF10250 EMBL:AF455271 EMBL:Z36282 RefSeq:NP_001255070.1
            UniGene:Cel.9168 ProteinModelPortal:Q8WR51 SMR:Q8WR51 CAZy:GT68
            PaxDb:Q8WR51 EnsemblMetazoa:K10G9.3a GeneID:259529
            KEGG:cel:CELE_K10G9.3 UCSC:K10G9.3 CTD:259529 WormBase:K10G9.3
            eggNOG:NOG77810 GeneTree:ENSGT00390000007989 HOGENOM:HOG000015874
            InParanoid:Q8WR51 NextBio:952052 Uniprot:Q8WR51
        Length = 424

 Score = 96 (38.9 bits), Expect = 0.00022, Sum P(2) = 0.00022
 Identities = 26/81 (32%), Positives = 44/81 (54%)

Query:   395 TAQRFVQTFLGSNFIALHFRRHGFLKFCNAKKPSCFYPIPQAADCITRLAERAKAPVIYL 454
             T ++  +T +G  ++ +H+RR  FL    A+ P+    IP  A  +  L ++     IYL
Sbjct:   271 TKEKPRRTAIGGPYLGIHWRRRDFLYARRAQLPT----IPGTAKILQDLCKKLDLQKIYL 326

Query:   455 STDAAESETSLLQSLVVLNGK 475
             +TDA + E   L++L  LNG+
Sbjct:   327 ATDAPDQEVDELKAL--LNGE 345

 Score = 71 (30.1 bits), Expect = 0.00022, Sum P(2) = 0.00022
 Identities = 21/56 (37%), Positives = 30/56 (53%)

Query:   502 DDSQVEAMLDKTICAMSNVFIGASGSTFTEDIMRLRKDWG-STSLCDEYLCQGEEP 556
             +D Q+ A++D+ +CA +  FIG+  STFT  I   R+  G   S     LC   EP
Sbjct:   358 NDGQI-AIIDQYLCAHAAYFIGSYESTFTFRIQEDREIIGFPISTTFNRLCPDTEP 412


>WB|WBGene00010757 [details] [associations]
            symbol:pad-2 species:6239 "Caenorhabditis elegans"
            [GO:0009792 "embryo development ending in birth or egg hatching"
            evidence=IMP] [GO:0010171 "body morphogenesis" evidence=IMP]
            [GO:0018991 "oviposition" evidence=IMP] [GO:0007283
            "spermatogenesis" evidence=IMP] [GO:0040025 "vulval development"
            evidence=IMP] GO:GO:0009792 GO:GO:0018991 GO:GO:0010171
            GO:GO:0007283 GO:GO:0040025 InterPro:IPR019378 Pfam:PF10250
            EMBL:Z36282 UniGene:Cel.9168 GeneID:259529 KEGG:cel:CELE_K10G9.3
            CTD:259529 GeneTree:ENSGT00390000007989 OMA:ICAHARF
            RefSeq:NP_001255069.1 ProteinModelPortal:G1K0W1 SMR:G1K0W1
            EnsemblMetazoa:K10G9.3b WormBase:K10G9.3b Uniprot:G1K0W1
        Length = 426

 Score = 96 (38.9 bits), Expect = 0.00022, Sum P(2) = 0.00022
 Identities = 26/81 (32%), Positives = 44/81 (54%)

Query:   395 TAQRFVQTFLGSNFIALHFRRHGFLKFCNAKKPSCFYPIPQAADCITRLAERAKAPVIYL 454
             T ++  +T +G  ++ +H+RR  FL    A+ P+    IP  A  +  L ++     IYL
Sbjct:   273 TKEKPRRTAIGGPYLGIHWRRRDFLYARRAQLPT----IPGTAKILQDLCKKLDLQKIYL 328

Query:   455 STDAAESETSLLQSLVVLNGK 475
             +TDA + E   L++L  LNG+
Sbjct:   329 ATDAPDQEVDELKAL--LNGE 347

 Score = 71 (30.1 bits), Expect = 0.00022, Sum P(2) = 0.00022
 Identities = 21/56 (37%), Positives = 30/56 (53%)

Query:   502 DDSQVEAMLDKTICAMSNVFIGASGSTFTEDIMRLRKDWG-STSLCDEYLCQGEEP 556
             +D Q+ A++D+ +CA +  FIG+  STFT  I   R+  G   S     LC   EP
Sbjct:   360 NDGQI-AIIDQYLCAHAAYFIGSYESTFTFRIQEDREIIGFPISTTFNRLCPDTEP 414


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.321   0.136   0.415    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      563       520   0.00089  119 3  11 22  0.36    34
                                                     35  0.45    37


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  4
  No. of states in DFA:  624 (66 KB)
  Total size of DFA:  336 KB (2169 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  45.27u 0.15s 45.42t   Elapsed:  00:00:02
  Total cpu time:  45.27u 0.15s 45.42t   Elapsed:  00:00:02
  Start:  Thu May  9 23:32:17 2013   End:  Thu May  9 23:32:19 2013

Back to top