BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>020587
MMSETALRDLNTLPSSDRKNESSSKGSFAKPFVGSANENVDVSLVSTHVNGNQTGNAGPG
IANSEVEYIDSENLIDVEDIDTSVKTLVAGLDSKDWVVVCEALNNVRRLSIFHKEAMLDI
LGDVIPLVVKSLKNPRSAVCKTAIMTAADIFSAYNDRMIDLLDPLLVQLLLKSSQDKRFV
CEAAEKALVAMTTWVSPILLLPKLQPYLKNRNPRIRAKASMCFSRSVPRLGVEGIKEYGI
DKLIQVAASQLSDQLPESREAARTLLLELQSVYEKSHDSAPATVSDSPEMDSWENFCQSK
LSPLSAQAVLRVTNIAREGLVIGS

High Scoring Gene Products

Symbol, full name Information P value
AT5G14790 protein from Arabidopsis thaliana 2.1e-93
AT3G01450 protein from Arabidopsis thaliana 8.6e-88
AT4G15830 protein from Arabidopsis thaliana 3.7e-48
AT3G18530 protein from Arabidopsis thaliana 7.3e-43
Fam179a
family with sequence similarity 179, member A
protein from Mus musculus 1.1e-05
CG42399 protein from Drosophila melanogaster 9.1e-05

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  020587
        (324 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

TAIR|locus:2185505 - symbol:AT5G14790 "AT5G14790" species...   930  2.1e-93   1
TAIR|locus:2096667 - symbol:AT3G01450 "AT3G01450" species...   877  8.6e-88   1
TAIR|locus:2130814 - symbol:AT4G15830 species:3702 "Arabi...   503  3.7e-48   1
TAIR|locus:2086919 - symbol:AT3G18530 species:3702 "Arabi...   453  7.3e-43   1
MGI|MGI:2443498 - symbol:Fam179a "family with sequence si...   135  1.1e-05   1
FB|FBgn0259818 - symbol:CG42399 species:7227 "Drosophila ...   130  9.1e-05   2


>TAIR|locus:2185505 [details] [associations]
            symbol:AT5G14790 "AT5G14790" species:3702 "Arabidopsis
            thaliana" [GO:0005575 "cellular_component" evidence=ND] [GO:0008150
            "biological_process" evidence=ND] InterPro:IPR016024 EMBL:CP002688
            GenomeReviews:BA000015_GR SUPFAM:SSF48371 Gene3D:1.25.10.10
            InterPro:IPR011989 InterPro:IPR024395 Pfam:PF12348 EMBL:AL391149
            eggNOG:NOG300754 EMBL:AY093238 EMBL:AY088875 EMBL:BT001241
            EMBL:AK221795 IPI:IPI00543585 PIR:T51424 RefSeq:NP_196983.1
            UniGene:At.31886 ProteinModelPortal:Q9LEQ9 SMR:Q9LEQ9 PaxDb:Q9LEQ9
            PRIDE:Q9LEQ9 EnsemblPlants:AT5G14790.1 GeneID:831331
            KEGG:ath:AT5G14790 TAIR:At5g14790 HOGENOM:HOG000242934
            InParanoid:Q9LEQ9 OMA:CFSRSVP PhylomeDB:Q9LEQ9
            ProtClustDB:CLSN2916720 ArrayExpress:Q9LEQ9 Genevestigator:Q9LEQ9
            Uniprot:Q9LEQ9
        Length = 346

 Score = 930 (332.4 bits), Expect = 2.1e-93, P = 2.1e-93
 Identities = 207/338 (61%), Positives = 245/338 (72%)

Query:     2 MSETALRDLNTLPSSDRKNESSSKGSFAKPFVGSAN-ENVD------VSLVSTHVNGNQT 54
             M+   L+++N+L  +++   S  K S  KP VG  N ++ D       SL S+     + 
Sbjct:     1 MASNTLKEMNSLHVTEKI--SDCKASLTKPCVGKMNGKSEDRPLPNSASLDSSDSKVVEA 58

Query:    55 GNAGPGIANSEVEYIDSENLIDVEDIDTSVKTLVAGLDSKDWVVVCEALNNVRRLSIFHK 114
                 P IA  EVEYI+SENL +V+D D  +K+++AGL+SKDW+ +C+ALNNVRRLSIFHK
Sbjct:    59 EKPEPEIAIVEVEYIESENLDNVDDADAVLKSVLAGLESKDWISLCDALNNVRRLSIFHK 118

Query:   115 EAMLDILGDVIPLVVKSLKNPRSAVCKTAIMTAADIFSAYNDRMIDXXXXXXXXXXXXXX 174
             E M+ +L  VIPLVVKSLKNPRSAVCKTA MT+ADIFSAYN+ + D              
Sbjct:   119 EEMMHMLEKVIPLVVKSLKNPRSAVCKTACMTSADIFSAYNNHITDLLEPLLTQLLLKSS 178

Query:   175 XXXRFVCEAAEKALVAMTTWVSPILLLPKLQPYLKNRNPRIRAKASMCFSRSVPRLGVEG 234
                RFVCEAAEKAL AMT +VSP LLLPKLQP LKNRNPRIRAKAS+CFSRSVPRLGVEG
Sbjct:   179 QDKRFVCEAAEKALTAMTKYVSPTLLLPKLQPCLKNRNPRIRAKASLCFSRSVPRLGVEG 238

Query:   235 IKEYGIDKLIQVAASQLSDQLPESREAARTLLLELQSVYEKSHDSA-PATVSDS----PE 289
             IKEYGIDKL+Q AASQLSDQLPESREAART+LLELQSVYEK+H    P T S      PE
Sbjct:   239 IKEYGIDKLVQAAASQLSDQLPESREAARTVLLELQSVYEKAHPLINPETSSPEEQQIPE 298

Query:   290 MD--SWENFCQSKLSPLSAQAVLRVTNI----AREGLV 321
             ++  +WE FC+SKLS LSAQAVLRVTN+    AREGLV
Sbjct:   299 VEPITWETFCKSKLSALSAQAVLRVTNVVTVTAREGLV 336


>TAIR|locus:2096667 [details] [associations]
            symbol:AT3G01450 "AT3G01450" species:3702 "Arabidopsis
            thaliana" [GO:0005575 "cellular_component" evidence=ND] [GO:0008150
            "biological_process" evidence=ND] InterPro:IPR016024 EMBL:CP002686
            SUPFAM:SSF48371 Gene3D:1.25.10.10 InterPro:IPR011989
            InterPro:IPR021133 PROSITE:PS50077 InterPro:IPR024395 Pfam:PF12348
            EMBL:AC010870 UniGene:At.28336 ProtClustDB:CLSN2916720
            EMBL:BT025879 IPI:IPI00521122 RefSeq:NP_566138.1 UniGene:At.41277
            ProteinModelPortal:Q9SGH3 SMR:Q9SGH3 PRIDE:Q9SGH3 DNASU:821140
            EnsemblPlants:AT3G01450.1 GeneID:821140 KEGG:ath:AT3G01450
            TAIR:At3g01450 InParanoid:Q9SGH3 OMA:KCANKEE PhylomeDB:Q9SGH3
            Genevestigator:Q9SGH3 Uniprot:Q9SGH3
        Length = 326

 Score = 877 (313.8 bits), Expect = 8.6e-88, P = 8.6e-88
 Identities = 190/329 (57%), Positives = 231/329 (70%)

Query:     2 MSETALRDLNTLPSSDRKNESSSKGSFAKPFVGSANENVDVSLVS---THVNGNQTGNAG 58
             M+  AL+DL  LP S+R  +  +K    K   G+A +    + V     H  G++     
Sbjct:     1 MAAKALKDLKNLPVSERNIDYKTKLLVGK-MNGTAEDKPPQNSVPFDHNHPKGDEIEKPE 59

Query:    59 PGIANSEVEYIDSENLIDVEDIDTSVKTLVAGLDSKDWVVVCEALNNVRRLSIFHKEAML 118
                   E+EYI+S++L +V  +D  +K+LV  LDSKDWV+VC+ALN +RRLSIFHKE ML
Sbjct:    60 AERVIVEIEYIESKDLNNVTQVDAVLKSLVTELDSKDWVLVCDALNTIRRLSIFHKEEML 119

Query:   119 DILGDVIPLVVKSLKNPRSAVCKTAIMTAADIFSAYNDRMIDXXXXXXXXXXXXXXXXXR 178
              +L  VI  +VKSLKNPRSAV KTA MT+ADIFS+YND  ID                 R
Sbjct:   120 HMLEKVILFIVKSLKNPRSAVSKTACMTSADIFSSYNDHTIDQLDLLLTQLLLKSSQDKR 179

Query:   179 FVCEAAEKALVAMTTWVSPILLLPKLQPYLKNRNPRIRAKASMCFSRSVPRLGVEGIKEY 238
             FVCEAAEKALVAMT  VSP LLLPKLQP+LKNRNPRIRAKAS CFSR VPRLG+EGI+EY
Sbjct:   180 FVCEAAEKALVAMTAHVSPALLLPKLQPFLKNRNPRIRAKASTCFSRCVPRLGIEGIREY 239

Query:   239 GIDKLIQVAASQLSDQLPESREAARTLLLELQSVYEKSHDSAPATVSDSPEMDSWENFCQ 298
             GI+KL+Q A+SQLSDQLPESREAAR +LLELQ+VY+K+ +  P    + PE  +W+ FCQ
Sbjct:   240 GIEKLVQAASSQLSDQLPESREAARAVLLELQTVYKKTTNVEPK--EEHPEPVTWQIFCQ 297

Query:   299 SKLSPLSAQAVLRVTNIA---REGLVIGS 324
             S LSPLSAQAV+RVTN+A   REGLV GS
Sbjct:   298 SNLSPLSAQAVIRVTNVAGVAREGLVAGS 326


>TAIR|locus:2130814 [details] [associations]
            symbol:AT4G15830 species:3702 "Arabidopsis thaliana"
            [GO:0005737 "cytoplasm" evidence=ISM] [GO:0008150
            "biological_process" evidence=ND] [GO:0000226 "microtubule
            cytoskeleton organization" evidence=RCA] [GO:0000911 "cytokinesis
            by cell plate formation" evidence=RCA] [GO:0010583 "response to
            cyclopentenone" evidence=RCA] [GO:0051225 "spindle assembly"
            evidence=RCA] InterPro:IPR016024 EMBL:CP002687 SUPFAM:SSF48371
            Gene3D:1.25.10.10 InterPro:IPR011989 InterPro:IPR024395
            Pfam:PF12348 UniGene:At.48861 UniGene:At.68470 IPI:IPI00540230
            RefSeq:NP_567477.1 ProteinModelPortal:F4JKW9 SMR:F4JKW9
            EnsemblPlants:AT4G15830.1 GeneID:827264 KEGG:ath:AT4G15830
            OMA:NDENSAP Uniprot:F4JKW9
        Length = 296

 Score = 503 (182.1 bits), Expect = 3.7e-48, P = 3.7e-48
 Identities = 106/257 (41%), Positives = 167/257 (64%)

Query:    61 IANSEVEYIDSENLIDVEDIDTSVKTLVAGLDSKDWVVVCEALNNVRRLSIFHKEAMLDI 120
             +A S VEY+ SENL    D ++SV+ L+  L SKDW+ VC++LNN RR +I H   +L I
Sbjct:    40 VAESTVEYVASENLKPFSDPESSVQRLLEELASKDWIKVCDSLNNTRRFAIHHSSLLLPI 99

Query:   121 LGDVIPLVVKSLKNPRSAVCKTAIMTAADIFSAYNDRMID-----XXXXXXXXXXXXXXX 175
             L  +I ++VK++KNPRSA+CKT+IMT +DIF+AY +++++                    
Sbjct:   100 LEKLIVVMVKAMKNPRSALCKTSIMTCSDIFTAYGEKLLEGPHLKSMDDLLLQLLMKASQ 159

Query:   176 XXRFVCEAAEKALVAMTTWVSPILLLPKLQPYLKNRNPRIRAKASMCFSRSVPRLGVEGI 235
               +FVCE AEKAL  M   V+ + LL KLQ Y+++ NPR+RAKA++  S  V ++ V  +
Sbjct:   160 DKKFVCEEAEKALNTMVNSVARLPLLRKLQSYVRHSNPRVRAKAAVSTSNCVSKMEVNEM 219

Query:   236 KEYGIDKLIQVAASQLSDQLPESREAARTLLLELQSVYEKSHDSAPATVSDSPEMDSWEN 295
             +E+G+  L Q+AA QLSD+LPE+REAAR+++    S++EK   +       S + ++W+ 
Sbjct:   220 EEFGMILLAQMAADQLSDKLPEAREAARSMV---NSLFEKFTWNEEEDEEGSKQ-EAWKK 275

Query:   296 FCQSKLSPLSAQAVLRV 312
             FC+  ++ L+AQA++++
Sbjct:   276 FCEKNVTGLNAQAMIKI 292


>TAIR|locus:2086919 [details] [associations]
            symbol:AT3G18530 species:3702 "Arabidopsis thaliana"
            [GO:0005634 "nucleus" evidence=ISM] [GO:0008150
            "biological_process" evidence=ND] InterPro:IPR016024 EMBL:CP002686
            SUPFAM:SSF48371 Gene3D:1.25.10.10 InterPro:IPR011989
            InterPro:IPR024395 Pfam:PF12348 IPI:IPI00522288 RefSeq:NP_188483.4
            UniGene:At.38536 UniGene:At.66527 ProteinModelPortal:F4J8S4
            PRIDE:F4J8S4 EnsemblPlants:AT3G18530.1 GeneID:821384
            KEGG:ath:AT3G18530 ArrayExpress:F4J8S4 Uniprot:F4J8S4
        Length = 361

 Score = 453 (164.5 bits), Expect = 7.3e-43, P = 7.3e-43
 Identities = 116/270 (42%), Positives = 148/270 (54%)

Query:     2 MSETALRDLNTLPSSDRKNESSSKGSFAKPFVGSANENV---DVSLVSTHVNGNQTGNAG 58
             M+  AL+DL  LP S+R N    K   A    G A +      V L   H  G++     
Sbjct:     1 MAAKALKDLKNLPVSER-NIDCKKNPCAGKMNGKAEDRPPQNSVPLDHNHPTGDEIEKPE 59

Query:    59 PGIANSEVEYIDSENLIDVEDIDTSVKTLVAGLDSKDWVVVCEALNNVRRLSIFH-KEAM 117
                   E+EYI S++L +V ++D  +K  +  L     ++ C+   ++ R S FH  +  
Sbjct:    60 AERVIVELEYIKSKDLNNVAEVDAVLKVSIV-LSWYYTMLYCDFSFSLDRSSSFHFPQGR 118

Query:   118 LDILGDVIPLVVKSLKNPRSAVCKTAIMTAADIFSAYNDRMIDXXXXXXXXXXXXXXXXX 177
                   VI  VVKSLKNPRSAV KTA MT+ DIFS+YND + D                 
Sbjct:   119 NAAFAKVILFVVKSLKNPRSAVSKTACMTSEDIFSSYNDHIFDQLDRLLTQLLLKSSQDK 178

Query:   178 RFVCEAAEKALVAMTTWVSPILLLPKLQPYLKNRNPRIRAKASMCFSRSVPRLGVEGIKE 237
             RFVCEAAE+ALVAMTT VSP LLLPKL+P LKN++PRIRAKAS CFS  VPRLG+EG++E
Sbjct:   179 RFVCEAAERALVAMTTHVSPALLLPKLRPCLKNKSPRIRAKASACFSGCVPRLGIEGMRE 238

Query:   238 YGIDKLIQVAASQLSDQLPESREAARTLLL 267
             YGI+   Q             R+  R LLL
Sbjct:   239 YGIETNSQNLRRLHGQSSWNLRQCIRKLLL 268


>MGI|MGI:2443498 [details] [associations]
            symbol:Fam179a "family with sequence similarity 179, member
            A" species:10090 "Mus musculus" [GO:0003674 "molecular_function"
            evidence=ND] [GO:0005575 "cellular_component" evidence=ND]
            [GO:0008150 "biological_process" evidence=ND] InterPro:IPR016024
            MGI:MGI:2443498 SUPFAM:SSF48371 Gene3D:1.25.10.10
            InterPro:IPR011989 InterPro:IPR024395 Pfam:PF12348 EMBL:AK052197
            EMBL:AK158647 IPI:IPI00226618 IPI:IPI00652867 RefSeq:NP_796061.2
            UniGene:Mm.187372 ProteinModelPortal:Q3TYG6 SMR:Q3TYG6
            PhosphoSite:Q3TYG6 PRIDE:Q3TYG6 Ensembl:ENSMUST00000097284
            Ensembl:ENSMUST00000153445 GeneID:320159 KEGG:mmu:320159
            UCSC:uc008dmt.1 CTD:165186 eggNOG:NOG300754
            GeneTree:ENSGT00390000012217 HOGENOM:HOG000112453
            HOVERGEN:HBG107880 InParanoid:Q3TYG6 OrthoDB:EOG4RJG0V
            NextBio:396147 Bgee:Q3TYG6 Genevestigator:Q3TYG6 Uniprot:Q3TYG6
        Length = 1002

 Score = 135 (52.6 bits), Expect = 1.1e-05, P = 1.1e-05
 Identities = 42/144 (29%), Positives = 62/144 (43%)

Query:    91 LDSKDWVVVCEALNNVRRLSIFHKEAMLDILGDVIPLVVKSLKNPRSAVCKTAIMTAADI 150
             L+S DW +  + L N++RL+  H E +   L DV   V   + N RS V + AI T  D+
Sbjct:   500 LNSNDWQMKEKGLVNIQRLAACHSEVLGTRLHDVSLAVTAEVTNLRSKVSRLAISTLGDL 559

Query:   151 FSAYNDRMIDXXXXXXXXXXXXXXXXXRFVCEAAEKALVAMTTWVSPILLLPKLQPY-LK 209
             F      M                    F+  AA +AL AM   V+P   L  L    + 
Sbjct:   560 FRVLKKNMDQEAEEIVRCLLQKMGNTSEFIQRAANRALGAMVENVTPARALVALTSAGVY 619

Query:   210 NRNPRIRAKASMCFSRSVPRLGVE 233
             +RNP +R   +   S  + ++G E
Sbjct:   620 HRNPLVRKCTAKHLSAVLEQIGAE 643


>FB|FBgn0259818 [details] [associations]
            symbol:CG42399 species:7227 "Drosophila melanogaster"
            [GO:0003674 "molecular_function" evidence=ND] [GO:0005575
            "cellular_component" evidence=ND] [GO:0008150 "biological_process"
            evidence=ND] InterPro:IPR016024 SUPFAM:SSF48371 Gene3D:1.25.10.10
            InterPro:IPR011989 EMBL:AE014134 InterPro:IPR024395 Pfam:PF12348
            GeneTree:ENSGT00390000012217 UCSC:CG42399-RA FlyBase:FBgn0259818
            RefSeq:NP_608498.3 EnsemblMetazoa:FBtr0301929 GeneID:33176
            KEGG:dme:Dmel_CG42399 InParanoid:Q9VPK5 PhylomeDB:Q9VPK5
            GenomeRNAi:33176 NextBio:782279 ArrayExpress:Q9VPK5 Bgee:Q9VPK5
            Uniprot:Q9VPK5
        Length = 1655

 Score = 130 (50.8 bits), Expect = 9.1e-05, Sum P(2) = 9.1e-05
 Identities = 47/183 (25%), Positives = 80/183 (43%)

Query:    91 LDSKDWVVVCEALNNVRRLSIFHKEAMLDILGDVIPLVVKSLKNPRSAVCKTAIMTAADI 150
             LDS +W V    L ++ RL  +H E + + +      + +S++N RS V + +   AA++
Sbjct:  1442 LDSSNWEVNVSGLKSMVRLIRYHAETLDNQMHMTCIQLTRSVRNLRSQVARASCQAAAEL 1501

Query:   151 FSAYNDRMIDXXXXXXXXXXXXXXXXXRFVCEAAEKALVAMTTWVSPILLLPKLQPY-LK 209
             FS  +  +                   RF+   A +AL +M     P  +L  L     +
Sbjct:  1502 FSLKSTSLQQECDDLVCALLHRTADTNRFLRADANRALESMVDHAQPQKILNILATKGAQ 1561

Query:   210 NRNPRIRAKASMCFSRSVPRLGVEGIKEYGI---DKLIQVAASQLSDQLPESREAARTLL 266
             ++N  +R  ++    R V RLG + I   G    DK   V A+ L +   E+R  A++L 
Sbjct:  1562 HQNALVRTTSAKLLFRLVERLGSDRIYAMGRESRDKFFVVGANLLLEGSLETRSYAKSLF 1621

Query:   267 LEL 269
               L
Sbjct:  1622 RAL 1624

 Score = 45 (20.9 bits), Expect = 9.1e-05, Sum P(2) = 9.1e-05
 Identities = 21/83 (25%), Positives = 40/83 (48%)

Query:    12 TLPSSDR--KNESSSKGSFAKPFVGSANENVDVSLVSTHVNGNQTGNAGPGIANSEVEYI 69
             ++P S +  + +   K SF K  +G  +E    S+ +   N +Q   +G  I+   +E +
Sbjct:  1080 SIPKSQQMIRKKFQHKDSFTK--LG--DEPPGQSIENNKFNESQRRKSG--ISTDHMESL 1133

Query:    70 DSE--NLIDVEDIDTSVKTLVAG 90
             DS+  ++I   D +T V  +  G
Sbjct:  1134 DSKCADVIKSVDTETEVNGMTNG 1156


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.314   0.129   0.360    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      324       307    0.0010  115 3  11 23  0.43    34
                                                     33  0.45    37


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  6
  No. of states in DFA:  613 (65 KB)
  Total size of DFA:  199 KB (2112 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  24.04u 0.08s 24.12t   Elapsed:  00:00:01
  Total cpu time:  24.04u 0.08s 24.12t   Elapsed:  00:00:01
  Start:  Thu May  9 23:10:12 2013   End:  Thu May  9 23:10:13 2013

Back to top