BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>012437
MQEWGTDLAKMATVNSATVKYTEFLLATASGKVEGVKGPGKLATPFEKTKVAAYTLGAMS
PCMRLYAFLGKEFHALLNANEGNHPYTKWIDNYSSESFQASALQNEDLLDKLSVSLTGEE
LDIIEKLYHQAMKLEVEFFCAQPLAQPTVVPLIKGHNPAGDRLIIFSDFDLTCTIVDSSA
ILAEIAIVTAPKSDQNQPENQLGRMSSGELRNTWGLLSKQYTEEYEQCIESFMPSEKVEN
FNYETLHKALEQLSHFEKRANSRVIESGVLKGINLEDIKKAGERLSLQDGCTTFFQKVVK
NENLNANVHVLSYCWCGDLIRASFSSAGLNALNVHANEFSFKESISTGEIIEKVESPIDK
VQAFNNTLEKYGTDRKNLSVYIGDSVGDLLCLLEADIGIVIGSSSSLRRVGSQFGVTFIP
LYPGLVKKQKEYTEGSSSNWKEKSGILYTVSSWAEVHAFILGW

High Scoring Gene Products

Symbol, full name Information P value
AT5G32470 protein from Arabidopsis thaliana 1.7e-178
MGG_04762
Uncharacterized protein
protein from Magnaporthe oryzae 70-15 6.2e-10
CJE0491
TenA/Thi-4 family protein
protein from Campylobacter jejuni RM1221 2.7e-07
CJE_0491
TenA/Thi-4 family protein
protein from Campylobacter jejuni RM1221 2.7e-07
YCR015C
Putative protein of unknown function
gene from Saccharomyces cerevisiae 1.5e-05
orf19.6732 gene_product from Candida albicans 2.0e-05

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  012437
        (463 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

TAIR|locus:2145816 - symbol:AT5G32470 "AT5G32470" species...  1733  1.7e-178  1
UNIPROTKB|G4MTS1 - symbol:MGG_04762 "Uncharacterized prot...   168  6.2e-10   1
POMBASE|SPBC17G9.12c - symbol:SPBC17G9.12c "hydrolase (pr...   148  8.8e-08   1
UNIPROTKB|Q5HW25 - symbol:CJE0491 "TenA/Thi-4 family prot...   140  2.7e-07   1
TIGR_CMR|CJE_0491 - symbol:CJE_0491 "TenA/Thi-4 family pr...   140  2.7e-07   1
SGD|S000000608 - symbol:YCR015C "Putative protein of unkn...   130  1.5e-05   1
CGD|CAL0004824 - symbol:orf19.6732 species:5476 "Candida ...   128  2.0e-05   1


>TAIR|locus:2145816 [details] [associations]
            symbol:AT5G32470 "AT5G32470" species:3702 "Arabidopsis
            thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0008150
            "biological_process" evidence=ND] [GO:0009507 "chloroplast"
            evidence=ISM] [GO:0005829 "cytosol" evidence=IDA] GO:GO:0005829
            EMBL:CP002688 Gene3D:3.40.50.1000 InterPro:IPR023214
            SUPFAM:SSF56784 Gene3D:1.20.910.10 InterPro:IPR016084
            SUPFAM:SSF48613 InterPro:IPR004305 Pfam:PF03070 IPI:IPI00544299
            RefSeq:NP_198287.3 UniGene:At.30675 ProteinModelPortal:F4KFT7
            PRIDE:F4KFT7 EnsemblPlants:AT5G32470.1 GeneID:833206
            KEGG:ath:AT5G32470 OMA:ATTKYTD PhylomeDB:F4KFT7 Uniprot:F4KFT7
        Length = 617

 Score = 1733 (615.1 bits), Expect = 1.7e-178, P = 1.7e-178
 Identities = 336/467 (71%), Positives = 398/467 (85%)

Query:     1 MQEWGTDLAKMATVNSATVKYTEFLLATASGKVEGVKGPGKLATPFEKTKVAAYTLGAMS 60
             +Q+W  D+ K  +VNSAT++YTEFLLATASGKVEG K PG L TPFEKTKVAAYTLGA++
Sbjct:   152 VQDWDLDINKEVSVNSATLRYTEFLLATASGKVEGCKAPGMLDTPFEKTKVAAYTLGAVT 211

Query:    61 PCMRLYAFLGKEFHALLNANEGNHPYTKWIDNYSSESFQASALQNEDLLDKLSVSLTGEE 120
             PCMRLYAFLGKEF +LL+ ++ NHPY KWIDNYSS++FQASA Q EDLL+KLSVS+TGEE
Sbjct:   212 PCMRLYAFLGKEFGSLLDLSDVNHPYKKWIDNYSSDAFQASAKQTEDLLEKLSVSMTGEE 271

Query:   121 LDIIEKLYHQAMKLEVEFFCAQPLAQPTVVPLIKGHNPAGDRLIIFSDFDLTCTIVDSSA 180
             LDIIEKLY QAMKLEVEFF AQPLAQPT+VPL+K H+   D L+IFSDFDLTCT+VDSSA
Sbjct:   272 LDIIEKLYQQAMKLEVEFFHAQPLAQPTIVPLLKNHSK--DDLVIFSDFDLTCTVVDSSA 329

Query:   181 ILAEIAIVTAPKSDQNQPENQLGRMSSGELRNTWGLLSKQYTEEYEQCIESFMPSEKVEN 240
             ILAEIAIVTAPK +Q++   Q+ RM S +L+NTW LLSKQYTE YE+CIES +  +K + 
Sbjct:   330 ILAEIAIVTAPKDEQSRSGQQIHRMLSSDLKNTWNLLSKQYTEHYEECIESILNKKKADK 389

Query:   241 FNYETLHKALEQLSHFEKRANSRVIESGVLKGINLEDIKKAGERLSLQDGCTTFFQKVVK 300
             F+YE L KALEQLS FEK AN+RVIESGVLKG+NLEDIK+AGERL LQDGC   FQK++K
Sbjct:   390 FDYEGLCKALEQLSDFEKEANNRVIESGVLKGLNLEDIKRAGERLILQDGCINVFQKILK 449

Query:   301 NENLNANVHVLSYCWCGDLIRASFSSAGLNALNVHANEFSFKESISTGEIIEKVESPIDK 360
              ENLNA +HVLSYCWCGDLIRA+FS+ G++A+ VHANEF+F+ESISTGEI  KVESPI+K
Sbjct:   450 TENLNAELHVLSYCWCGDLIRAAFSAGGVDAVEVHANEFTFEESISTGEIERKVESPINK 509

Query:   361 VQAFNNTLE--KYGTDRKN-LSVYIGDSVGDLLCLLEADIGIVIGSSSSLRRVGSQFGVT 417
              Q F + L+  K   ++K+ LSVYIGDSVGDLLCLLEADIGIV+ SSSSLRRVGS FGV+
Sbjct:   510 AQQFKSILQNRKNENNKKSFLSVYIGDSVGDLLCLLEADIGIVVSSSSSLRRVGSHFGVS 569

Query:   418 FIPLYPGLVKKQKEYTEGSSSN-WKEKSGILYTVSSWAEVHAFILGW 463
             F+PL+ G+V+KQK++TE SSS+ WK  SG LYTVSSWAE+H+F LGW
Sbjct:   570 FVPLFSGIVQKQKQHTEESSSSAWKGLSGTLYTVSSWAEIHSFALGW 616


>UNIPROTKB|G4MTS1 [details] [associations]
            symbol:MGG_04762 "Uncharacterized protein" species:242507
            "Magnaporthe oryzae 70-15" [GO:0005575 "cellular_component"
            evidence=ND] Gene3D:3.40.50.1000 InterPro:IPR023214 SUPFAM:SSF56784
            EMBL:CM001232 RefSeq:XP_003713717.1 ProteinModelPortal:G4MTS1
            EnsemblFungi:MGG_04762T0 GeneID:2677866 KEGG:mgr:MGG_04762
            Uniprot:G4MTS1
        Length = 304

 Score = 168 (64.2 bits), Expect = 6.2e-10, P = 6.2e-10
 Identities = 68/265 (25%), Positives = 115/265 (43%)

Query:   168 DFDLTCTIVDSSAILAEIAIVTAPKSDQNQPENQLGRMSSGELRNTWGLLSKQYTEEYEQ 227
             DFD T    DS   L E  +    K  Q+            +L  TW  +   Y  +++ 
Sbjct:     6 DFDGTIIAKDSINCLGEFGVSHQQKHRQH------------DLSPTWKQIVSDYLADHKM 53

Query:   228 CIESFMPSEKVENFNYETLHKALEQLSHFEKRANSRVIESGVLKGINLEDIKKAGERLSL 287
              + ++ P+E  +   ++     L  L H + ++ +RV ++ + +G   +D+  AG R ++
Sbjct:    54 HVSAYSPAE-ADRLTHDDERAFLHSLQHVDVKSLARVADARIFEGCTADDLYGAG-REAV 111

Query:   288 QDGCTT----FFQKVVKNENLNANVHVLSYCWCGDLIRASFSSAGLNAL--NVHANEFSF 341
             + G       F + V       A + +LS  W    IR   S  G + +  +V +NE + 
Sbjct:   112 RTGKVAARGGFAEFVAVMRGAGATLSILSVNWSASFIRGVLSQCGDDVVIEDVVSNEITA 171

Query:   342 KESIST-GEIIEKVESP----IDKVQAFNNTLEKYGTDRKNLS----VYIGDSVGDLLCL 392
                I   GE      +P    + K++A         +D   +S    +Y GDS  DL CL
Sbjct:   172 DGKIGCLGEGGGAQGTPMMTSLHKLEALR-ARSAASSDENEISSKITIYFGDSTTDLECL 230

Query:   393 LEADIGIVIGS--SSSLRRVGSQFG 415
             L ADIGIV+ +  +SSL R  ++ G
Sbjct:   231 LAADIGIVMANDENSSLLRALARLG 255


>POMBASE|SPBC17G9.12c [details] [associations]
            symbol:SPBC17G9.12c "hydrolase (predicted)"
            species:4896 "Schizosaccharomyces pombe" [GO:0003674
            "molecular_function" evidence=ND] [GO:0005634 "nucleus"
            evidence=IDA] [GO:0005829 "cytosol" evidence=IDA] [GO:0008150
            "biological_process" evidence=ND] PomBase:SPBC17G9.12c
            GO:GO:0005829 GO:GO:0005634 InterPro:IPR023214 SUPFAM:SSF56784
            EMBL:CU329671 eggNOG:NOG81108 OrthoDB:EOG4PK5JD PIR:T39735
            RefSeq:NP_595901.1 ProteinModelPortal:Q9UUE0
            EnsemblFungi:SPBC17G9.12c.1 GeneID:2539837 KEGG:spo:SPBC17G9.12c
            OMA:MHIHANE NextBio:20800985 Uniprot:Q9UUE0
        Length = 274

 Score = 148 (57.2 bits), Expect = 8.8e-08, P = 8.8e-08
 Identities = 80/311 (25%), Positives = 132/311 (42%)

Query:   163 LIIFSDFDLTCTIVDSSAILAEIAIVTAPKSDQNQPENQLGRMSSGELRNTWGLLSKQYT 222
             ++   DFD T T  D+  +LAE A+        N+PE              W ++S +Y 
Sbjct:     1 MLYIVDFDETITTYDTIHLLAE-AV--------NKPEE-------------WSVISDKYW 38

Query:   223 EEYEQCIESFMPSEKVENFNYETLHKALEQLSHFEKRANSRVIESGVLKGINLEDIKKAG 282
             +EY    E+   S  + +  Y  L   L    + E+ +  R+ +S    G++   +    
Sbjct:    39 QEYLAWREALPHSTTLTS--YLPL---LGGSRYLEEASIKRIEKSQYFSGLSEGALDNIV 93

Query:   283 ERLSLQDGCTTFFQKVVKNENLNANV-HVLSYCWCGDLIRASF-SSAGLNA--LNVHANE 338
             + ++L+ G   F   +V +  ++  + HVLS  W   +I  +      L A  L VHAN+
Sbjct:    94 QLITLRAGFVEFINALVPDLRVSKTIFHVLSVNWSARVIEQTLLHHTDLTADLLCVHAND 153

Query:   339 FSFKESIST--GEIIEKVESPI-----DKVQAFNNTLEKYGTDRKNLSVYIGDSVGDLLC 391
             F F  S +T  G I+ +  S +     DKV+ F   ++          VYIGDS  D  C
Sbjct:   154 FDFDTSTNTTNGRILARNASSLLMNSTDKVREFRRIVQTDAVSSPLNVVYIGDSPTDFGC 213

Query:   392 LLEADIGIVIGSSSSLRRVGSQF-GVTFIPLYPGLVKKQKEYTEGSSSNWKEKSGILYTV 450
             L  + I I++ S+     + S+F  V  + +    V+K      G          I+YT 
Sbjct:   214 LQISPISILMRSNQKYYDILSRFEDVQLVDISEFPVQKA---VPGKK--------IIYTC 262

Query:   451 SSWAEVH-AFI 460
             S W  +  AF+
Sbjct:   263 SDWCAIQKAFL 273


>UNIPROTKB|Q5HW25 [details] [associations]
            symbol:CJE0491 "TenA/Thi-4 family protein" species:195099
            "Campylobacter jejuni RM1221" [GO:0003674 "molecular_function"
            evidence=ND] [GO:0008150 "biological_process" evidence=ND]
            EMBL:CP000025 GenomeReviews:CP000025_GR Gene3D:1.20.910.10
            InterPro:IPR016084 SUPFAM:SSF48613 eggNOG:COG0819
            InterPro:IPR004305 Pfam:PF03070 HOGENOM:HOG000225158
            RefSeq:YP_178510.1 ProteinModelPortal:Q5HW25 STRING:Q5HW25
            GeneID:3231252 KEGG:cjr:CJE0491 PATRIC:20042686 KO:K03707
            OMA:EYAKVFA ProtClustDB:CLSK878792
            BioCyc:CJEJ195099:GJC0-501-MONOMER Uniprot:Q5HW25
        Length = 221

 Score = 140 (54.3 bits), Expect = 2.7e-07, P = 2.7e-07
 Identities = 30/85 (35%), Positives = 49/85 (57%)

Query:    56 LGAMSPCMRLYAFLGKEF-HALLNANEGNHPYTKWIDNYSSESFQASALQNEDLLDKLSV 114
             L A+S C   YA +G E  + L N N  +HPY +WI  Y SE+FQ  A + ED ++  + 
Sbjct:   127 LVALSACAIGYAKIGAEIINRLKNENLKDHPYKEWILTYGSENFQNEAKEFEDFVNSYTS 186

Query:   115 SLTGEELDIIEKLYHQAMKLEVEFF 139
             S+  ++   + +++H   +LEV F+
Sbjct:   187 SVGAQKFQKLSEIFHTVTRLEVAFW 211


>TIGR_CMR|CJE_0491 [details] [associations]
            symbol:CJE_0491 "TenA/Thi-4 family protein" species:195099
            "Campylobacter jejuni RM1221" [GO:0003674 "molecular_function"
            evidence=ND] [GO:0008150 "biological_process" evidence=ND]
            EMBL:CP000025 GenomeReviews:CP000025_GR Gene3D:1.20.910.10
            InterPro:IPR016084 SUPFAM:SSF48613 eggNOG:COG0819
            InterPro:IPR004305 Pfam:PF03070 HOGENOM:HOG000225158
            RefSeq:YP_178510.1 ProteinModelPortal:Q5HW25 STRING:Q5HW25
            GeneID:3231252 KEGG:cjr:CJE0491 PATRIC:20042686 KO:K03707
            OMA:EYAKVFA ProtClustDB:CLSK878792
            BioCyc:CJEJ195099:GJC0-501-MONOMER Uniprot:Q5HW25
        Length = 221

 Score = 140 (54.3 bits), Expect = 2.7e-07, P = 2.7e-07
 Identities = 30/85 (35%), Positives = 49/85 (57%)

Query:    56 LGAMSPCMRLYAFLGKEF-HALLNANEGNHPYTKWIDNYSSESFQASALQNEDLLDKLSV 114
             L A+S C   YA +G E  + L N N  +HPY +WI  Y SE+FQ  A + ED ++  + 
Sbjct:   127 LVALSACAIGYAKIGAEIINRLKNENLKDHPYKEWILTYGSENFQNEAKEFEDFVNSYTS 186

Query:   115 SLTGEELDIIEKLYHQAMKLEVEFF 139
             S+  ++   + +++H   +LEV F+
Sbjct:   187 SVGAQKFQKLSEIFHTVTRLEVAFW 211


>SGD|S000000608 [details] [associations]
            symbol:YCR015C "Putative protein of unknown function"
            species:4932 "Saccharomyces cerevisiae" [GO:0005575
            "cellular_component" evidence=ND] [GO:0008150 "biological_process"
            evidence=ND] [GO:0003674 "molecular_function" evidence=ND]
            SGD:S000000608 InterPro:IPR023214 SUPFAM:SSF56784 EMBL:X59720
            EMBL:BK006937 PIR:S19425 RefSeq:NP_009941.2
            ProteinModelPortal:P25616 DIP:DIP-6719N IntAct:P25616
            MINT:MINT-627776 EnsemblFungi:YCR015C GeneID:850373
            KEGG:sce:YCR015C CYGD:YCR015c eggNOG:NOG81108 HOGENOM:HOG000000897
            OMA:YIGDSET OrthoDB:EOG4PK5JD NextBio:965867 Genevestigator:P25616
            GermOnline:YCR015C Uniprot:P25616
        Length = 317

 Score = 130 (50.8 bits), Expect = 1.5e-05, P = 1.5e-05
 Identities = 70/262 (26%), Positives = 114/262 (43%)

Query:   162 RLIIFSDFDLTCTIVDSSAILAEIAIVTAPKSDQNQPENQLGRMSSGELRNTW--GLLSK 219
             + II SDFD T T VD+   +A++  +  P+    +PE        G    T+  G    
Sbjct:     2 KTIIISDFDETITRVDTICTIAKLPYLLNPRL---KPE-------WGHFTKTYMDGYHKY 51

Query:   220 QYTEEYE-QCIESFMPSE-KVENFNYETLHKALEQLSH---FEKRANSRVIESGVLKGIN 274
             +Y        + S +P+     NFN +     L+  +H    E  + + + +  + K I+
Sbjct:    52 KYNGTRSLPLLSSGVPTIISQSNFN-KLFADELKYQNHNRVVELNSVNEITKQQIFKSIS 110

Query:   275 LEDIKKAG-----ERLSLQDGCTTFFQKVVKNENLNANVHVLSYCWCGDLIRASFSSAGL 329
             L+ +K        E   L+DG  TF   VVKN    ++ +VLS  W  + I        L
Sbjct:   111 LDQMKTFARDQNHEDCLLRDGFKTFCSSVVKN--FESDFYVLSINWSKEFIHEVIGDRRL 168

Query:   330 NALNVHANEF---SFKESIS-TGEIIEKVESPIDKVQAFNNTLEKY--GTDRKNLSV--- 380
                ++  N+    S K S S  GE   ++ +  DKV+     L+K   G +++  S    
Sbjct:   169 KNSHIFCNDLKKVSDKCSQSYNGEFDCRLLTGSDKVKILGEILDKIDSGCNKEGNSCSYW 228

Query:   381 YIGDSVGDLLCLLEADI-GIVI 401
             YIGDS  DLL +L     G+++
Sbjct:   229 YIGDSETDLLSILHPSTNGVLL 250


>CGD|CAL0004824 [details] [associations]
            symbol:orf19.6732 species:5476 "Candida albicans" [GO:0005634
            "nucleus" evidence=IEA] [GO:0005829 "cytosol" evidence=IEA]
            InterPro:IPR006383 CGD:CAL0004824 Gene3D:3.40.50.1000
            InterPro:IPR023214 SUPFAM:SSF56784 TIGRFAMs:TIGR01488 GO:GO:0016311
            GO:GO:0016791 EMBL:AACQ01000029 EMBL:AACQ01000028 eggNOG:NOG81108
            RefSeq:XP_719714.1 RefSeq:XP_719831.1 ProteinModelPortal:Q5ADV9
            GeneID:3638522 GeneID:3638585 KEGG:cal:CaO19.14024
            KEGG:cal:CaO19.6732 Uniprot:Q5ADV9
        Length = 288

 Score = 128 (50.1 bits), Expect = 2.0e-05, P = 2.0e-05
 Identities = 51/202 (25%), Positives = 93/202 (46%)

Query:   219 KQYTEEYEQCIESF-MPSE---KVENFNYETLH-KALEQLSHFEKRANSRVIESGVLKGI 273
             + Y + Y     SF  P+     +E  +Y +   K  ++LS  E ++   + +S + +G+
Sbjct:    61 ESYNQNYTNLKNSFNFPNTTTTSIECSDYLSQQIKFQDELSTVENQSIELIEQSKIFEGL 120

Query:   274 NLEDIKKA----GERLSLQDGCTTFFQKVVKNENLNANVHVLSYCWCGDLIRASFSSAGL 329
               +D +        ++ L+ G   F Q V + +NLN  + ++S  W    I+   ++ GL
Sbjct:   121 TKKDFQDYVNINHNKIKLRPG---FSQFVKRCQNLNIPIIIVSANWTSIFIKQCLANHGL 177

Query:   330 NALNVHANEFSF-------KESISTGEIIEKVESPIDKVQAFNNTLEKYGTDRKNLSVYI 382
                ++  NE SF       K  ++T  + +K +  I   Q   + +++   ++  L +YI
Sbjct:   178 AVDDIITNELSFHSDDEEAKTKMTTTGLWDKSKYTIRTSQDKLDIVKQIQEEKDGLIMYI 237

Query:   383 GDSVGDLLCLLEADIGIVIGSS 404
             GDSV DLL LL  D    I  S
Sbjct:   238 GDSVTDLLPLLNVDFPCAIKGS 259


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.315   0.132   0.382    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      463       463   0.00096  118 3  11 22  0.43    34
                                                     35  0.45    37


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  7
  No. of states in DFA:  624 (66 KB)
  Total size of DFA:  282 KB (2148 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  43.54u 0.09s 43.63t   Elapsed:  00:00:02
  Total cpu time:  43.54u 0.09s 43.63t   Elapsed:  00:00:02
  Start:  Fri May 10 13:30:38 2013   End:  Fri May 10 13:30:40 2013

Back to top