BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>025000
MYWKVTNPTLSPSHLQDLPGFTRSVYKRDHALITPESHVLSPLPEWTNTLGAYLITPAMG
SHFVMYLANMQENARSALPPHDVERFIFVVQGSAMLTNASGVSSKLMVDSYTYLPPNFAH
SLRAEGSATLVVFERRYASLENHITEQIVGSTDKQPLLETPGEVFQLRKLLPQAVPFDFN
IHIMDFQPGDFLNVKEVHYNQHGLLLLEGQGIYRLGDSWYPVQAGDVLWMAPFVPQWYAA
LGKTRTRYLLYKDVNRNPL

High Scoring Gene Products

Symbol, full name Information P value
UGLYAH
AT4G17050
protein from Arabidopsis thaliana 3.0e-117
allE
S-ureidoglycine aminohydrolase
protein from Escherichia coli K-12 1.2e-26
SPO0876
Uncharacterized protein
protein from Ruegeria pomeroyi DSS-3 7.6e-25
SPO_0876
conserved hypothetical protein
protein from Ruegeria pomeroyi DSS-3 7.6e-25
PFL_3803
Uncharacterized protein
protein from Pseudomonas protegens Pf-5 6.8e-24

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  025000
        (259 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

TAIR|locus:2130459 - symbol:UGLYAH "AT4G17050" species:37...  1155  3.0e-117  1
UNIPROTKB|P75713 - symbol:allE "S-ureidoglycine aminohydr...   300  1.2e-26   1
UNIPROTKB|Q5LV26 - symbol:SPO0876 "Uncharacterized protei...   283  7.6e-25   1
TIGR_CMR|SPO_0876 - symbol:SPO_0876 "conserved hypothetic...   283  7.6e-25   1
UNIPROTKB|Q4KA30 - symbol:PFL_3803 "Uncharacterized prote...   274  6.8e-24   1


>TAIR|locus:2130459 [details] [associations]
            symbol:UGLYAH "AT4G17050" species:3702 "Arabidopsis
            thaliana" [GO:0003700 "sequence-specific DNA binding transcription
            factor activity" evidence=ISS] [GO:0006355 "regulation of
            transcription, DNA-dependent" evidence=TAS] [GO:0000256 "allantoin
            catabolic process" evidence=IDA] [GO:0071522 "ureidoglycine
            aminohydrolase activity" evidence=IDA] [GO:0000023 "maltose
            metabolic process" evidence=RCA] [GO:0019252 "starch biosynthetic
            process" evidence=RCA] [GO:0043085 "positive regulation of
            catalytic activity" evidence=RCA] Gene3D:2.60.120.10
            InterPro:IPR014710 InterPro:IPR011051 SUPFAM:SSF51182 EMBL:CP002687
            GO:GO:0003700 GO:GO:0000256 InterPro:IPR013096 Pfam:PF07883
            KO:K14977 GO:GO:0071522 EMBL:GQ303359 EMBL:AK118019 IPI:IPI00541898
            RefSeq:NP_193438.2 UniGene:At.44735 PDB:4E2Q PDB:4E2S PDBsum:4E2Q
            PDBsum:4E2S HSSP:P75713 ProteinModelPortal:Q8GXV5 SMR:Q8GXV5
            STRING:Q8GXV5 PRIDE:Q8GXV5 EnsemblPlants:AT4G17050.1 GeneID:827413
            KEGG:ath:AT4G17050 TAIR:At4g17050 InParanoid:Q8GXV5 OMA:GSTDKQP
            PhylomeDB:Q8GXV5 ProtClustDB:CLSN2690303 Genevestigator:Q8GXV5
            Uniprot:Q8GXV5
        Length = 298

 Score = 1155 (411.6 bits), Expect = 3.0e-117, P = 3.0e-117
 Identities = 214/259 (82%), Positives = 229/259 (88%)

Query:     1 MYWKVTNPTLSPSHLQDLPGFTRSVYKRDHALITPESHVLSPLPEWTNTLGAYLITPAMG 60
             +YWK TNPTLSPSHLQDLPGFTRSVYKRDHALITPESHV SPLP+WTNTLGAYLITPA G
Sbjct:    40 IYWKATNPTLSPSHLQDLPGFTRSVYKRDHALITPESHVYSPLPDWTNTLGAYLITPATG 99

Query:    61 SHFVMYLANMQENARSALPPHDVERFIFVVQGSAMLTNASGVSSKLMVDSYTYLPPNFAH 120
             SHFVMYLA M+E + S LPP D+ER IFVV+G+  LTN S  S KL VDSY YLPPNF H
Sbjct:   100 SHFVMYLAKMKEMSSSGLPPQDIERLIFVVEGAVTLTNTSSSSKKLTVDSYAYLPPNFHH 159

Query:   121 SLRAEGSATLVVFERRYASLENHITEQIVGSTDKQPLLETPGEVFQLRKLLPQAVPFDFN 180
             SL    SATLVVFERRY  L +H TE IVGSTDKQPLLETPGEVF+LRKLLP +V +DFN
Sbjct:   160 SLDCVESATLVVFERRYEYLGSHTTELIVGSTDKQPLLETPGEVFELRKLLPMSVAYDFN 219

Query:   181 IHIMDFQPGDFLNVKEVHYNQHGLLLLEGQGIYRLGDSWYPVQAGDVLWMAPFVPQWYAA 240
             IH MDFQPG+FLNVKEVHYNQHGLLLLEGQGIYRLGD+WYPVQAGDV+WMAPFVPQWYAA
Sbjct:   220 IHTMDFQPGEFLNVKEVHYNQHGLLLLEGQGIYRLGDNWYPVQAGDVIWMAPFVPQWYAA 279

Query:   241 LGKTRTRYLLYKDVNRNPL 259
             LGKTR+RYLLYKDVNRNPL
Sbjct:   280 LGKTRSRYLLYKDVNRNPL 298


>UNIPROTKB|P75713 [details] [associations]
            symbol:allE "S-ureidoglycine aminohydrolase" species:83333
            "Escherichia coli K-12" [GO:0071522 "ureidoglycine aminohydrolase
            activity" evidence=IDA] [GO:0030145 "manganese ion binding"
            evidence=IDA] Gene3D:2.60.120.10 InterPro:IPR014710
            InterPro:IPR011051 SUPFAM:SSF51182 EMBL:U00096 EMBL:AP009048
            GenomeReviews:AP009048_GR GenomeReviews:U00096_GR GO:GO:0030145
            EMBL:U82664 EMBL:U89279 InterPro:IPR013096 Pfam:PF07883 PIR:B64783
            RefSeq:NP_415048.1 RefSeq:YP_488805.1 PDB:1RC6 PDBsum:1RC6
            ProteinModelPortal:P75713 SMR:P75713 DIP:DIP-12689N IntAct:P75713
            EnsemblBacteria:EBESCT00000000049 EnsemblBacteria:EBESCT00000017164
            GeneID:12933022 GeneID:945149 KEGG:ecj:Y75_p0501 KEGG:eco:b0515
            PATRIC:32116187 EchoBASE:EB3387 EcoGene:EG13622 eggNOG:COG3257
            HOGENOM:HOG000219622 KO:K14977 OMA:KDVNRHA ProtClustDB:PRK11171
            BioCyc:EcoCyc:G6284-MONOMER BioCyc:ECOL316407:JW0503-MONOMER
            BioCyc:MetaCyc:G6284-MONOMER EvolutionaryTrace:P75713
            Genevestigator:P75713 GO:GO:0071522 InterPro:IPR017627
            InterPro:IPR008579 Pfam:PF05899 TIGRFAMs:TIGR03214 Uniprot:P75713
        Length = 261

 Score = 300 (110.7 bits), Expect = 1.2e-26, P = 1.2e-26
 Identities = 74/245 (30%), Positives = 125/245 (51%)

Query:    23 RSVYKRDH-ALITPESHVLSPLPEWTNTLGAYLITPAMGSHFVMYLANMQENA--RSALP 79
             R++ K  + AL+TP+  V + +P + N     L TP +G+ FV YL  + +N   +    
Sbjct:    18 RAIVKHGNFALLTPDGLVKNIIPGFENCDATILSTPKLGASFVDYLVTLHQNGGNQQGFG 77

Query:    80 PHDVERFIFVVQGSAMLTNASGVSSKLMVDSYTYLPPN----FAHSLRAEGSATLVVFER 135
                +E F++V+ G+ +   A G +  L    Y Y PP     F ++ +AE S  + +++R
Sbjct:    78 GEGIETFLYVISGN-ITAKAEGKTFALSEGGYLYCPPGSLMTFVNA-QAEDSQ-IFLYKR 134

Query:   136 RYASLENHITEQIVGSTDKQPLLETPG--EVFQLRKLLPQAVPFDFNIHIMDFQPGDFLN 193
             RY  +E +    + G+  +   +   G  +V  L   LP+ + FD N+HI+ F PG    
Sbjct:   135 RYVPVEGYAPWLVSGNASELERIHYEGMDDVILL-DFLPKELGFDMNMHILSFAPGASHG 193

Query:   194 VKEVHYNQHGLLLLEGQGIYRLGDSWYPVQAGDVLWMAPFVPQWYAALGKTRT-RYLLYK 252
               E H  +HG  +L GQG+Y L ++W PV+ GD ++M  +  Q    +G+     Y+  K
Sbjct:   194 YIETHVQEHGAYILSGQGVYNLDNNWIPVKKGDYIFMGAYSLQAGYGVGRGEAFSYIYSK 253

Query:   253 DVNRN 257
             D NR+
Sbjct:   254 DCNRD 258


>UNIPROTKB|Q5LV26 [details] [associations]
            symbol:SPO0876 "Uncharacterized protein" species:246200
            "Ruegeria pomeroyi DSS-3" [GO:0003674 "molecular_function"
            evidence=ND] [GO:0005575 "cellular_component" evidence=ND]
            [GO:0008150 "biological_process" evidence=ND] Gene3D:2.60.120.10
            InterPro:IPR014710 InterPro:IPR011051 SUPFAM:SSF51182 EMBL:CP000031
            GenomeReviews:CP000031_GR InterPro:IPR013096 Pfam:PF07883
            HOGENOM:HOG000219622 KO:K14977 OMA:KDVNRHA ProtClustDB:PRK11171
            InterPro:IPR017627 TIGRFAMs:TIGR03214 RefSeq:YP_166129.1
            ProteinModelPortal:Q5LV26 SMR:Q5LV26 GeneID:3195641
            KEGG:sil:SPO0876 PATRIC:23375037 Uniprot:Q5LV26
        Length = 275

 Score = 283 (104.7 bits), Expect = 7.6e-25, P = 7.6e-25
 Identities = 81/248 (32%), Positives = 118/248 (47%)

Query:    23 RSVYKRDHALI---TPESHVLSPLPEWTNTLGAYLITPAMG--SHFVMYLANMQENARSA 77
             R+++ + +A+I   T    V S LP W  T    L  P  G    F  Y+  +Q    S 
Sbjct:    23 RAMFTQAYAVIPKGTMRDIVTSKLPFWQGTRAWILARPLSGFAETFSHYIMEVQPGGGSD 82

Query:    78 LPPHD--VERFIFVVQGSAMLTNASGVSSKLMVDSYTYLPPNFAHSLRAE-GSATLVVFE 134
              P  D   +  +FVV+G+  LT   G    L    Y Y+P      LR   G A    + 
Sbjct:    83 RPDTDPGAQAVLFVVEGAVTLT-LDGAEHVLEPGGYAYIPAGHPWRLRNHAGPAARFHWI 141

Query:   135 RR-YASLE--NHITEQIVGSTDKQP--LLETPGEVFQLRKLLPQAVPFDFNIHIMDFQPG 189
             R+ Y ++E  +  T  ++   D  P  +  T G     R + P+ +  D ++ I++F PG
Sbjct:   142 RKAYEAVEGIDPPTPFVIREQDVTPNEMPGTEGRWSTTRFVDPEDMRHDMHVTIVNFLPG 201

Query:   190 DFLNVKEVHYNQHGLLLLEGQGIYRLGDSWYPVQAGDVLWMAPFVPQWYAALGKTRTRYL 249
               +   E H  +HGL +LEG+ +YRL   W  V+AGD +W+  F PQ   A G    RYL
Sbjct:   202 GVIPFAETHVMEHGLYVLEGKAVYRLNQDWVEVEAGDYMWLRAFCPQACYAGGPGPFRYL 261

Query:   250 LYKDVNRN 257
             LYKDVNR+
Sbjct:   262 LYKDVNRH 269


>TIGR_CMR|SPO_0876 [details] [associations]
            symbol:SPO_0876 "conserved hypothetical protein"
            species:246200 "Ruegeria pomeroyi DSS-3" [GO:0003674
            "molecular_function" evidence=ND] [GO:0005575 "cellular_component"
            evidence=ND] [GO:0008150 "biological_process" evidence=ND]
            Gene3D:2.60.120.10 InterPro:IPR014710 InterPro:IPR011051
            SUPFAM:SSF51182 EMBL:CP000031 GenomeReviews:CP000031_GR
            InterPro:IPR013096 Pfam:PF07883 HOGENOM:HOG000219622 KO:K14977
            OMA:KDVNRHA ProtClustDB:PRK11171 InterPro:IPR017627
            TIGRFAMs:TIGR03214 RefSeq:YP_166129.1 ProteinModelPortal:Q5LV26
            SMR:Q5LV26 GeneID:3195641 KEGG:sil:SPO0876 PATRIC:23375037
            Uniprot:Q5LV26
        Length = 275

 Score = 283 (104.7 bits), Expect = 7.6e-25, P = 7.6e-25
 Identities = 81/248 (32%), Positives = 118/248 (47%)

Query:    23 RSVYKRDHALI---TPESHVLSPLPEWTNTLGAYLITPAMG--SHFVMYLANMQENARSA 77
             R+++ + +A+I   T    V S LP W  T    L  P  G    F  Y+  +Q    S 
Sbjct:    23 RAMFTQAYAVIPKGTMRDIVTSKLPFWQGTRAWILARPLSGFAETFSHYIMEVQPGGGSD 82

Query:    78 LPPHD--VERFIFVVQGSAMLTNASGVSSKLMVDSYTYLPPNFAHSLRAE-GSATLVVFE 134
              P  D   +  +FVV+G+  LT   G    L    Y Y+P      LR   G A    + 
Sbjct:    83 RPDTDPGAQAVLFVVEGAVTLT-LDGAEHVLEPGGYAYIPAGHPWRLRNHAGPAARFHWI 141

Query:   135 RR-YASLE--NHITEQIVGSTDKQP--LLETPGEVFQLRKLLPQAVPFDFNIHIMDFQPG 189
             R+ Y ++E  +  T  ++   D  P  +  T G     R + P+ +  D ++ I++F PG
Sbjct:   142 RKAYEAVEGIDPPTPFVIREQDVTPNEMPGTEGRWSTTRFVDPEDMRHDMHVTIVNFLPG 201

Query:   190 DFLNVKEVHYNQHGLLLLEGQGIYRLGDSWYPVQAGDVLWMAPFVPQWYAALGKTRTRYL 249
               +   E H  +HGL +LEG+ +YRL   W  V+AGD +W+  F PQ   A G    RYL
Sbjct:   202 GVIPFAETHVMEHGLYVLEGKAVYRLNQDWVEVEAGDYMWLRAFCPQACYAGGPGPFRYL 261

Query:   250 LYKDVNRN 257
             LYKDVNR+
Sbjct:   262 LYKDVNRH 269


>UNIPROTKB|Q4KA30 [details] [associations]
            symbol:PFL_3803 "Uncharacterized protein" species:220664
            "Pseudomonas protegens Pf-5" [GO:0003674 "molecular_function"
            evidence=ND] [GO:0005575 "cellular_component" evidence=ND]
            [GO:0008150 "biological_process" evidence=ND] Gene3D:2.60.120.10
            InterPro:IPR014710 InterPro:IPR011051 SUPFAM:SSF51182 EMBL:CP000076
            GenomeReviews:CP000076_GR InterPro:IPR013096 Pfam:PF07883
            eggNOG:COG3257 HOGENOM:HOG000219622 KO:K14977 OMA:KDVNRHA
            ProtClustDB:PRK11171 InterPro:IPR017627 TIGRFAMs:TIGR03214
            RefSeq:YP_260903.1 ProteinModelPortal:Q4KA30 SMR:Q4KA30
            STRING:Q4KA30 GeneID:3476900 KEGG:pfl:PFL_3803 PATRIC:19876991
            BioCyc:PFLU220664:GIX8-3835-MONOMER Uniprot:Q4KA30
        Length = 278

 Score = 274 (101.5 bits), Expect = 6.8e-24, P = 6.8e-24
 Identities = 73/228 (32%), Positives = 106/228 (46%)

Query:    39 VLSPLPEWTNTLGAYLITPAMG--SHFVMYLANMQENARSALPPHD--VERFIFVVQGSA 94
             V S LP W N     +  P  G    F  Y+  +  N  S  P  D   E  +FVV+G  
Sbjct:    42 VTSHLPHWDNMRMWVIARPLSGFAETFSQYIVEVGANGGSDKPEQDPNAEAVLFVVEGEV 101

Query:    95 MLTNASGVSSKLMVDSYTYLPPNFAHSLR-AEGS-ATLVVFERRYASLEN--HITEQIVG 150
              LT   G    L    Y ++PP     LR   G+ A      + Y  ++   +    +  
Sbjct:   102 NLT-LQGQVHVLKPGGYAFIPPAADWKLRNTSGTEARFHWIRKHYQKVDGVPYPDAFVTN 160

Query:   151 STDKQPLL--ETPGEVFQLRKLLPQAVPFDFNIHIMDFQPGDFLNVKEVHYNQHGLLLLE 208
               D +P +  +T G     R +    +  D +++I++F+PG  +   E H  +HGL +LE
Sbjct:   161 EQDIEPRVMPDTEGRWSTTRFVDMSDMRHDMHVNIVNFEPGGVIPFAETHVMEHGLYVLE 220

Query:   209 GQGIYRLGDSWYPVQAGDVLWMAPFVPQWYAALGKTRTRYLLYKDVNR 256
             G+ +YRL   W  V+AGD +W+  F PQ   + G  R RYLLYKDVNR
Sbjct:   221 GKAVYRLNQDWVEVEAGDFMWLRAFCPQACYSGGPGRFRYLLYKDVNR 268


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.321   0.136   0.423    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      259       259   0.00088  114 3  11 22  0.45    33
                                                     32  0.49    36


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  5
  No. of states in DFA:  596 (63 KB)
  Total size of DFA:  207 KB (2116 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  22.11u 0.09s 22.20t   Elapsed:  00:00:01
  Total cpu time:  22.11u 0.09s 22.20t   Elapsed:  00:00:01
  Start:  Sat May 11 11:17:39 2013   End:  Sat May 11 11:17:40 2013

Back to top