BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>018538
MSSVSFSSIFTNNGRSLGAPVSLLRTLASTHASCHRNIYKSNVVLSFSSSNRNFVCESWK
RHVFTHTDTAAIAAATTPSSYGYPEYHRLLPCPSQNCPPRVEHLVVSEGGPVLEYICREL
NLPPLFVADLIHFGAVYYALVCPKPPLTATPEQMRVFKEVTDPSVLSKRSSIKGKTVREA
QKTFRITHVDQIVEAGTYLRVHVHPKRFPRCYDIDWNSRIIAVTESHVVLDKPAGTSVGG
TTDNIEESCATFASRALGLTTPLRTTHQIDNCTEGCVVLARTQEYCSIFHRKIREKKVKK
LYLALTTAPLPVGIMTHYMRPINIAPRLVSEGKTIFGLALMIREMVWKFEACIL

High Scoring Gene Products

Symbol, full name Information P value
AT4G21770 protein from Arabidopsis thaliana 3.8e-110
DDB_G0277359
Pseudouridylate synthase PUS5
gene from Dictyostelium discoideum 2.8e-05
APH_0549
ribosomal large subunit pseudouridine synthase C
protein from Anaplasma phagocytophilum HZ 0.00013
GSU_0671
ribosomal large subunit pseudouridine synthase C
protein from Geobacter sulfurreducens PCA 0.00032
orf19.2788 gene_product from Candida albicans 0.00084
CaO19.2788
Putative uncharacterized protein
protein from Candida albicans SC5314 0.00084
PFE1080w
ribosomal large subunit pseudouridylate synthase, putative
gene from Plasmodium falciparum 0.00088
PFE1080w
Ribosomal large subunit pseudouridylate synthase, putative
protein from Plasmodium falciparum 3D7 0.00088

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  018538
        (354 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

TAIR|locus:2119073 - symbol:AT4G21770 species:3702 "Arabi...  1088  3.8e-110  1
DICTYBASE|DDB_G0277359 - symbol:DDB_G0277359 "Pseudouridy...   127  2.8e-05   1
TIGR_CMR|APH_0549 - symbol:APH_0549 "ribosomal large subu...   119  0.00013   1
TIGR_CMR|GSU_0671 - symbol:GSU_0671 "ribosomal large subu...   116  0.00032   2
CGD|CAL0004292 - symbol:orf19.2788 species:5476 "Candida ...   114  0.00084   1
UNIPROTKB|Q59PV5 - symbol:CaO19.2788 "Putative uncharacte...   114  0.00084   1
GENEDB_PFALCIPARUM|PFE1080w - symbol:PFE1080w "ribosomal ...   115  0.00088   1
UNIPROTKB|Q8I3P7 - symbol:PFE1080w "Ribosomal large subun...   115  0.00088   1


>TAIR|locus:2119073 [details] [associations]
            symbol:AT4G21770 species:3702 "Arabidopsis thaliana"
            [GO:0001522 "pseudouridine synthesis" evidence=IEA] [GO:0003723
            "RNA binding" evidence=IEA] [GO:0009451 "RNA modification"
            evidence=IEA] [GO:0009507 "chloroplast" evidence=ISM] [GO:0009982
            "pseudouridine synthase activity" evidence=IEA;ISS;TAS]
            InterPro:IPR006145 InterPro:IPR020103 Pfam:PF00849 PROSITE:PS01129
            PROSITE:PS50889 GO:GO:0009507 EMBL:CP002687
            GenomeReviews:CT486007_GR GO:GO:0003723 EMBL:AL161555 EMBL:AL035527
            GO:GO:0009982 GO:GO:0001522 SUPFAM:SSF55120 EMBL:AY142506
            EMBL:AK226645 IPI:IPI00544059 PIR:T05852 RefSeq:NP_193908.1
            UniGene:At.32599 ProteinModelPortal:Q9SVS0 PRIDE:Q9SVS0
            EnsemblPlants:AT4G21770.1 GeneID:828265 KEGG:ath:AT4G21770
            TAIR:At4g21770 eggNOG:NOG306455 HOGENOM:HOG000242495
            InParanoid:Q9SVS0 OMA:THQIDNC PhylomeDB:Q9SVS0
            ProtClustDB:CLSN2685595 Genevestigator:Q9SVS0 Uniprot:Q9SVS0
        Length = 472

 Score = 1088 (388.1 bits), Expect = 3.8e-110, P = 3.8e-110
 Identities = 213/327 (65%), Positives = 255/327 (77%)

Query:     7 SSIFTNNGRSLGAPVSLLRTLASTHASCHRNIYKSNVVLS-FSSSNRNFVCESWKRHVFX 65
             S   T   R+L APVSLLRTLAST  +    +++SN     F SS + F C S  +    
Sbjct:     3 SPALTGGYRNLTAPVSLLRTLASTRVTTP--LFRSNKHSPRFISSPKRFTCLSLLK---- 56

Query:    66 XXXXXXXXXXXXPSSYGYPEYHRLLPCPSQNCPPRVEHLVVSEGGPVL-EYICRELNLPP 124
                          S+ GY EY+RL+PCP+ N PPR+EH+VV E   ++ E+I ++L+LPP
Sbjct:    57 -TDSQNQTTLSSSSNSGYHEYNRLMPCPAYNLPPRIEHMVVLEDDVLVSEFISKQLDLPP 115

Query:   125 LFVADLIHFGAVYYALVCPKPPLTATPEQMRVFKEVTDPSVLSKRSSIKGKTVREAQKTF 184
             L+VADLI FGAV+YALVCPKPP TATPE++ +F+EVT PSVL KRSSIKGKTVREAQKTF
Sbjct:   116 LYVADLIRFGAVHYALVCPKPPPTATPEEIILFEEVTSPSVLKKRSSIKGKTVREAQKTF 175

Query:   185 RITHVDQIVEAGTYLRVHVHPKRFPRCYDIDWNSRIIAVTESHVVLDKPAGTSVGGTTDN 244
             R+TH +Q  EAGTYLRVHVHPKR PRCY+IDW SRI+AVT+S+V+LDKPAGT+VGGTTDN
Sbjct:   176 RVTHTNQYAEAGTYLRVHVHPKRSPRCYEIDWKSRIVAVTDSYVILDKPAGTTVGGTTDN 235

Query:   245 IEESCATFASRALGLTTPLRTTHQIDNCTEGCVVLARTQEYCSIFHRKIREKKVKKLYLA 304
             IEESCATFASRAL L  PL+TTHQIDNCTEGCVV ART+EYCS+FH KIR K+VKKLY A
Sbjct:   236 IEESCATFASRALDLPEPLKTTHQIDNCTEGCVVFARTKEYCSVFHTKIRNKEVKKLYRA 295

Query:   305 LTTAPLPVGIMTHYMRPINIAPRLVSE 331
             L  APLP+GI++HYMRP N+APRLV+E
Sbjct:   296 LAAAPLPIGIISHYMRPKNMAPRLVAE 322


>DICTYBASE|DDB_G0277359 [details] [associations]
            symbol:DDB_G0277359 "Pseudouridylate synthase PUS5"
            species:44689 "Dictyostelium discoideum" [GO:0009982 "pseudouridine
            synthase activity" evidence=IEA] [GO:0009451 "RNA modification"
            evidence=IEA] [GO:0003723 "RNA binding" evidence=IEA] [GO:0001522
            "pseudouridine synthesis" evidence=IEA] InterPro:IPR006145
            InterPro:IPR020103 Pfam:PF00849 dictyBase:DDB_G0277359
            GO:GO:0003723 EMBL:AAFI02000019 GO:GO:0009982 GO:GO:0001522
            SUPFAM:SSF55120 eggNOG:COG0564 KO:K01718 RefSeq:XP_642774.1
            ProteinModelPortal:Q86K31 STRING:Q86K31 EnsemblProtists:DDB0217957
            GeneID:8620965 KEGG:ddi:DDB_G0277359 InParanoid:Q86K31 OMA:YIIFENQ
            Uniprot:Q86K31
        Length = 386

 Score = 127 (49.8 bits), Expect = 2.8e-05, P = 2.8e-05
 Identities = 45/169 (26%), Positives = 74/169 (43%)

Query:   179 EAQKTFRI---THVDQIVEAGTYLRVHVHPKRFPRCYDIDW---NSRIIAVTESHVVLDK 232
             E  KT  I     +D+++  G+ L+   +P R      ID+   N  II   +  +V+DK
Sbjct:    71 EVTKTKTIPKRVQIDKVINKGSLLKYFFYP-RVNSTDHIDFKNFNKYIIFENQYFLVIDK 129

Query:   233 PAGTSVGGTTDNIEESCATFASRALGLTTP-------LRTTHQIDNCTEGCVVLARTQEY 285
             P G +VG   D++  +  +F    L            L   H +D+ T G  V+A+   +
Sbjct:   130 PHGINVGPIVDHLHNNFTSFVKDWLKKRNNNNDNNLILYNPHNLDSPTRGLFVIAKDFNF 189

Query:   286 CSIFHRKIREKKVKKLYLALT-------TAPLPVGIMTHYMRPINIAPR 327
              S F++ + +KKV K Y A            +  GI  H+M   N +P+
Sbjct:   190 LSKFNKLLSDKKVSKKYKAFIPIKENEENIEIKPGIYKHFMEITNHSPK 238


>TIGR_CMR|APH_0549 [details] [associations]
            symbol:APH_0549 "ribosomal large subunit pseudouridine
            synthase C" species:212042 "Anaplasma phagocytophilum HZ"
            [GO:0000154 "rRNA modification" evidence=ISS] [GO:0001522
            "pseudouridine synthesis" evidence=ISS] [GO:0006400 "tRNA
            modification" evidence=ISS] [GO:0009982 "pseudouridine synthase
            activity" evidence=ISS] InterPro:IPR002942 InterPro:IPR006145
            InterPro:IPR006224 InterPro:IPR020103 Pfam:PF00849 PROSITE:PS01129
            PROSITE:PS50889 SMART:SM00363 GO:GO:0003723 EMBL:CP000235
            GenomeReviews:CP000235_GR GO:GO:0009982 GO:GO:0001522
            SUPFAM:SSF55120 Gene3D:3.10.290.10 eggNOG:COG0564
            HOGENOM:HOG000275914 KO:K06179 RefSeq:YP_505141.1
            ProteinModelPortal:Q2GKG0 STRING:Q2GKG0 GeneID:3930562
            KEGG:aph:APH_0549 PATRIC:20949758 OMA:YMIATIN
            ProtClustDB:CLSK747316 BioCyc:APHA212042:GHPM-573-MONOMER
            Uniprot:Q2GKG0
        Length = 297

 Score = 119 (46.9 bits), Expect = 0.00013, P = 0.00013
 Identities = 44/180 (24%), Positives = 79/180 (43%)

Query:   166 LSKRSSIKGKTVREAQKTFRITHVDQIVEAGTYLRVHVHPKRFPRCYDIDWNSRIIAVTE 225
             L+   ++    V+    T  I +VD+I       +  VH     R      +  II   E
Sbjct:    30 LNSSKALASSRVQRGD-TVSIRNVDKIASRSKEAK-EVHDDALLRLV----SDNIIYKDE 83

Query:   226 SHVVLDKPAGTSVGGTTDNIEESCATFASRALGLTTPLRTTHQIDNCTEGCVVLARTQEY 285
             + + ++KPAG +V G T  +  S +   ++ +     L   H++D  T G +++AR Q  
Sbjct:    84 NIIAINKPAGVNVQGGT-KVSISLSDLLNKIIP-GEQLHIVHRLDRDTSGVLIIARGQSI 141

Query:   286 CSIFHRKIREKKVKKLYLALTTAPLPVGIMTHYMRPI------NIAPRLV-SEGKTIFGL 338
               I   ++R ++V+K Y+A+T   +P         PI       +   L+  E KT+F +
Sbjct:   142 ARILSEELRFRRVRKEYIAVTKG-VPAFSKGEITMPIYCKKQSGVGETLIPKEAKTVFSV 200


>TIGR_CMR|GSU_0671 [details] [associations]
            symbol:GSU_0671 "ribosomal large subunit pseudouridine
            synthase C" species:243231 "Geobacter sulfurreducens PCA"
            [GO:0000154 "rRNA modification" evidence=ISS] [GO:0009982
            "pseudouridine synthase activity" evidence=ISS] InterPro:IPR002942
            InterPro:IPR006145 InterPro:IPR020103 Pfam:PF00849 PROSITE:PS50889
            GO:GO:0003723 EMBL:AE017180 GenomeReviews:AE017180_GR GO:GO:0009982
            GO:GO:0001522 SUPFAM:SSF55120 HOGENOM:HOG000275919 KO:K06179
            RefSeq:NP_951728.1 ProteinModelPortal:Q74FD8 GeneID:2685543
            KEGG:gsu:GSU0671 PATRIC:22024099 OMA:MKIEIGA ProtClustDB:CLSK827983
            BioCyc:GSUL243231:GH27-690-MONOMER Uniprot:Q74FD8
        Length = 296

 Score = 116 (45.9 bits), Expect = 0.00032, Sum P(2) = 0.00032
 Identities = 27/87 (31%), Positives = 45/87 (51%)

Query:   228 VVLDKPAGTSVGGTTDNIEESCATFASRAL---GLTTPLRTTHQIDNCTEGCVVLARTQE 284
             V+++KPAG  V  T +  E +    A   +   G    LR  +++D  T G  +LA++  
Sbjct:    88 VIVNKPAGLPVHRTAEAGEANLVELAEHFMAGRGTAVKLRPVNRLDRGTSGATILAKSSS 147

Query:   285 YCSIFHRKIREKKVKKLYLALTTAPLP 311
                +  R ++E+ + KLYLA+T   LP
Sbjct:   148 SAGMLGRYVKEEGLDKLYLAVTDGSLP 174

 Score = 37 (18.1 bits), Expect = 0.00032, Sum P(2) = 0.00032
 Identities = 7/18 (38%), Positives = 12/18 (66%)

Query:   183 TFRITHVDQIVEAGTYLR 200
             T++IT +D    A ++LR
Sbjct:     3 TYQITDIDHCRSAESFLR 20


>CGD|CAL0004292 [details] [associations]
            symbol:orf19.2788 species:5476 "Candida albicans" [GO:0005575
            "cellular_component" evidence=ND] InterPro:IPR006145
            InterPro:IPR006224 InterPro:IPR020103 Pfam:PF00849 PROSITE:PS01129
            CGD:CAL0004292 GO:GO:0003723 GO:GO:0009982 GO:GO:0001522
            SUPFAM:SSF55120 eggNOG:COG0564 EMBL:AACQ01000185 RefSeq:XP_711730.1
            ProteinModelPortal:Q59PV5 STRING:Q59PV5 GeneID:3646675
            KEGG:cal:CaO19.2788 Uniprot:Q59PV5
        Length = 402

 Score = 114 (45.2 bits), Expect = 0.00084, P = 0.00084
 Identities = 36/114 (31%), Positives = 58/114 (50%)

Query:   214 IDWNSRII-AVTESH--VVLDKPAGTSVGGTTDNIEESCATFASRALGLTTPLRTTHQID 270
             I W++ II +V E    +V+DKP+G     T +    S +    + L + + + T H++D
Sbjct:   100 IPWHNDIIKSVYEDDEILVVDKPSGIPTHPTGNYYYNSLSEIIKQQLNMDS-IWTCHRLD 158

Query:   271 NCTEGCVVLARTQEYCSIFHRKIREKK--VKKLYLALTTAPLPVGIMTHYMRPI 322
               T G +VLA+T+     F + +R+ K   +K YLA  +   P GI   Y  PI
Sbjct:   159 KVTSGVLVLAKTKNGGVRFSKLMRDSKQHTEKTYLARVSGKFPEGIH-RYRCPI 211


>UNIPROTKB|Q59PV5 [details] [associations]
            symbol:CaO19.2788 "Putative uncharacterized protein"
            species:237561 "Candida albicans SC5314" [GO:0005575
            "cellular_component" evidence=ND] InterPro:IPR006145
            InterPro:IPR006224 InterPro:IPR020103 Pfam:PF00849 PROSITE:PS01129
            CGD:CAL0004292 GO:GO:0003723 GO:GO:0009982 GO:GO:0001522
            SUPFAM:SSF55120 eggNOG:COG0564 EMBL:AACQ01000185 RefSeq:XP_711730.1
            ProteinModelPortal:Q59PV5 STRING:Q59PV5 GeneID:3646675
            KEGG:cal:CaO19.2788 Uniprot:Q59PV5
        Length = 402

 Score = 114 (45.2 bits), Expect = 0.00084, P = 0.00084
 Identities = 36/114 (31%), Positives = 58/114 (50%)

Query:   214 IDWNSRII-AVTESH--VVLDKPAGTSVGGTTDNIEESCATFASRALGLTTPLRTTHQID 270
             I W++ II +V E    +V+DKP+G     T +    S +    + L + + + T H++D
Sbjct:   100 IPWHNDIIKSVYEDDEILVVDKPSGIPTHPTGNYYYNSLSEIIKQQLNMDS-IWTCHRLD 158

Query:   271 NCTEGCVVLARTQEYCSIFHRKIREKK--VKKLYLALTTAPLPVGIMTHYMRPI 322
               T G +VLA+T+     F + +R+ K   +K YLA  +   P GI   Y  PI
Sbjct:   159 KVTSGVLVLAKTKNGGVRFSKLMRDSKQHTEKTYLARVSGKFPEGIH-RYRCPI 211


>GENEDB_PFALCIPARUM|PFE1080w [details] [associations]
            symbol:PFE1080w "ribosomal large subunit
            pseudouridylate synthase, putative" species:5833 "Plasmodium
            falciparum" [GO:0004730 "pseudouridylate synthase activity"
            evidence=ISS] [GO:0006364 "rRNA processing" evidence=ISS]
            InterPro:IPR006145 InterPro:IPR020103 Pfam:PF00849 GO:GO:0003723
            GO:GO:0006364 EMBL:AL844504 GO:GO:0009982 SUPFAM:SSF55120
            GO:GO:0004730 KO:K01718 RefSeq:XP_001351773.1
            ProteinModelPortal:Q8I3P7 EnsemblProtists:PFE1080w:mRNA
            GeneID:813031 KEGG:pfa:PFE1080w EuPathDB:PlasmoDB:PF3D7_0521500
            HOGENOM:HOG000283688 ProtClustDB:CLSZ2432216 Uniprot:Q8I3P7
        Length = 487

 Score = 115 (45.5 bits), Expect = 0.00088, P = 0.00088
 Identities = 32/106 (30%), Positives = 53/106 (50%)

Query:   219 RIIAVTESHVVLDKPAGTSVG-GTTDNIEESCATFASRALGLTTPLRTTHQIDNCTEGCV 277
             +II   +  +VL+KP    +  G  D+I  S  T   +   L    R   Q+D  T G +
Sbjct:    65 KIIYENDYFLVLNKPYDIKLEKGKLDDIYPSVETLLYQKRKLDV-FRICGQLDYATSGLL 123

Query:   278 VLARTQEYCSIFHRKIREKKVKKLYLALTTAPLPVGIMTHYMRPIN 323
             ++A+ +  C+I +  I  K + K+YLA+    LP+ I+ H   PI+
Sbjct:   124 IVAKDKLSCNILNYNIESKNISKIYLAILYGHLPLDIL-HINTPIS 168


>UNIPROTKB|Q8I3P7 [details] [associations]
            symbol:PFE1080w "Ribosomal large subunit pseudouridylate
            synthase, putative" species:36329 "Plasmodium falciparum 3D7"
            [GO:0004730 "pseudouridylate synthase activity" evidence=ISS]
            [GO:0006364 "rRNA processing" evidence=ISS] InterPro:IPR006145
            InterPro:IPR020103 Pfam:PF00849 GO:GO:0003723 GO:GO:0006364
            EMBL:AL844504 GO:GO:0009982 SUPFAM:SSF55120 GO:GO:0004730 KO:K01718
            RefSeq:XP_001351773.1 ProteinModelPortal:Q8I3P7
            EnsemblProtists:PFE1080w:mRNA GeneID:813031 KEGG:pfa:PFE1080w
            EuPathDB:PlasmoDB:PF3D7_0521500 HOGENOM:HOG000283688
            ProtClustDB:CLSZ2432216 Uniprot:Q8I3P7
        Length = 487

 Score = 115 (45.5 bits), Expect = 0.00088, P = 0.00088
 Identities = 32/106 (30%), Positives = 53/106 (50%)

Query:   219 RIIAVTESHVVLDKPAGTSVG-GTTDNIEESCATFASRALGLTTPLRTTHQIDNCTEGCV 277
             +II   +  +VL+KP    +  G  D+I  S  T   +   L    R   Q+D  T G +
Sbjct:    65 KIIYENDYFLVLNKPYDIKLEKGKLDDIYPSVETLLYQKRKLDV-FRICGQLDYATSGLL 123

Query:   278 VLARTQEYCSIFHRKIREKKVKKLYLALTTAPLPVGIMTHYMRPIN 323
             ++A+ +  C+I +  I  K + K+YLA+    LP+ I+ H   PI+
Sbjct:   124 IVAKDKLSCNILNYNIESKNISKIYLAILYGHLPLDIL-HINTPIS 168


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.322   0.136   0.418    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      354       341   0.00095  116 3  11 22  0.49    33
                                                     33  0.45    37


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  8
  No. of states in DFA:  621 (66 KB)
  Total size of DFA:  248 KB (2133 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  30.13u 0.18s 30.31t   Elapsed:  00:00:02
  Total cpu time:  30.13u 0.18s 30.31t   Elapsed:  00:00:02
  Start:  Mon May 20 17:56:55 2013   End:  Mon May 20 17:56:57 2013

Back to top