BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>019973
MIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLA
LLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHP
AWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKG
HLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDI
WNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGWYYGDPHNTLPYWIKKHAFLGVWRS
RKVSTILYVIFVEILFWGLVTGILHRFGIYWKL

High Scoring Gene Products

Symbol, full name Information P value
AT5G27730 protein from Arabidopsis thaliana 4.1e-120
AT5G47900 protein from Arabidopsis thaliana 1.0e-52
HGSNAT
Uncharacterized protein
protein from Sus scrofa 1.8e-15
HGSNAT
Uncharacterized protein
protein from Bos taurus 3.6e-13
Hgsnat
heparan-alpha-glucosaminide N-acetyltransferase
protein from Mus musculus 8.7e-12
HGSNAT
Heparan-alpha-glucosaminide N-acetyltransferase
protein from Homo sapiens 2.7e-11
DDB_G0270192
DUF1624 family protein
gene from Dictyostelium discoideum 1.2e-07
CPS_0413
Putative membrane protein
protein from Colwellia psychrerythraea 34H 3.7e-05
CPS_0413
putative membrane protein
protein from Colwellia psychrerythraea 34H 3.7e-05
nagX
Uncharacterized protein
protein from Shewanella oneidensis MR-1 0.00017
SO_3504
conserved hypothetical protein
protein from Shewanella oneidensis MR-1 0.00017
DDB_G0286315
transmembrane protein
gene from Dictyostelium discoideum 0.00038

The BLAST search returned 1 gene product which did not match your query constraints. Please see the full BLAST report below for the details.

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  019973
        (333 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

TAIR|locus:2180305 - symbol:AT5G27730 "AT5G27730" species...  1182  4.1e-120  1
TAIR|locus:2160902 - symbol:AT5G47900 "AT5G47900" species...   546  1.0e-52   1
UNIPROTKB|F1SE48 - symbol:HGSNAT "Uncharacterized protein...   150  1.8e-15   2
UNIPROTKB|F1MF45 - symbol:HGSNAT "Uncharacterized protein...   147  3.6e-13   2
MGI|MGI:1196297 - symbol:Hgsnat "heparan-alpha-glucosamin...   131  8.7e-12   2
UNIPROTKB|Q68CP4 - symbol:HGSNAT "Heparan-alpha-glucosami...   125  2.7e-11   2
UNIPROTKB|F1NBK1 - symbol:HGSNAT "Uncharacterized protein...   110  8.7e-10   3
DICTYBASE|DDB_G0270192 - symbol:DDB_G0270192 "DUF1624 fam...   106  1.2e-07   3
UNIPROTKB|Q489U3 - symbol:CPS_0413 "Putative membrane pro...   108  3.7e-05   3
TIGR_CMR|CPS_0413 - symbol:CPS_0413 "putative membrane pr...   108  3.7e-05   3
UNIPROTKB|Q8EBK9 - symbol:nagX "Uncharacterized protein" ...   119  0.00017   1
TIGR_CMR|SO_3504 - symbol:SO_3504 "conserved hypothetical...   119  0.00017   1
DICTYBASE|DDB_G0286315 - symbol:DDB_G0286315 "transmembra...   102  0.00038   2


>TAIR|locus:2180305 [details] [associations]
            symbol:AT5G27730 "AT5G27730" species:3702 "Arabidopsis
            thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0005575
            "cellular_component" evidence=ND] [GO:0008150 "biological_process"
            evidence=ND] [GO:0016556 "mRNA modification" evidence=RCA]
            EMBL:CP002688 GenomeReviews:BA000015_GR GO:GO:0016740
            eggNOG:COG4299 KO:K10532 InterPro:IPR012429 Pfam:PF07786
            OMA:IRIYGIL HOGENOM:HOG000243739 ProtClustDB:CLSN2689879
            EMBL:AY034969 EMBL:BT002370 IPI:IPI00535083 RefSeq:NP_568500.1
            UniGene:At.19161 STRING:Q94CC1 PRIDE:Q94CC1
            EnsemblPlants:AT5G27730.1 GeneID:832835 KEGG:ath:AT5G27730
            TAIR:At5g27730 InParanoid:Q94CC1 PhylomeDB:Q94CC1
            ArrayExpress:Q94CC1 Genevestigator:Q94CC1 Uniprot:Q94CC1
        Length = 472

 Score = 1182 (421.1 bits), Expect = 4.1e-120, P = 4.1e-120
 Identities = 208/333 (62%), Positives = 246/333 (73%)

Query:     1 MIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMXXXXXXXXXX 60
             M+R CG+LQRIALSYL+V+LVEIFTKD  +++ S GRFSIF+ Y WHW++          
Sbjct:   140 MMRFCGILQRIALSYLVVALVEIFTKDSHEENLSTGRFSIFKSYYWHWIVAASVLVIYLA 199

Query:    61 XXXXXXXPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHP 120
                    PDW+F + +KDS  YGK+ +V+CGVR KLNPPCNAVGY+DR+VLGINHMYHHP
Sbjct:   200 TLYGTYVPDWEFVVYDKDSVLYGKILSVSCGVRGKLNPPCNAVGYVDRQVLGINHMYHHP 259

Query:   121 AWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGXXXXXXXXXXXXXXXXXXXXXXXTKG 180
             AWRRSKACT DSP+EG +R+DAPSWC APFEPEG                        KG
Sbjct:   260 AWRRSKACTDDSPYEGAIRQDAPSWCRAPFEPEGILSSISAILSTIIGVHFGHIILHLKG 319

Query:   181 HLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDI 240
             H ARLK W++ G  LL  GLTLHFT+ +PLNKQLY+ SY+CVTSGAAALVFS++Y+LVDI
Sbjct:   320 HSARLKHWISTGLVLLALGLTLHFTHLMPLNKQLYSFSYICVTSGAAALVFSSLYSLVDI 379

Query:   241 WNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGWYYGDPHNTLPYWIKKHAFLGVWRS 300
                K+ FLPL WIGMNAMLVYVM AEGI A F NGWYY  PHNTL  WI++H F+ VW S
Sbjct:   380 LEWKHMFLPLKWIGMNAMLVYVMGAEGILAAFFNGWYYRHPHNTLINWIREHVFIRVWHS 439

Query:   301 RKVSTILYVIFVEILFWGLVTGILHRFGIYWKL 333
             R+V  ++YVIF EILFWGLVTG+ HRF IYWKL
Sbjct:   440 RRVGVLMYVIFAEILFWGLVTGVFHRFKIYWKL 472


>TAIR|locus:2160902 [details] [associations]
            symbol:AT5G47900 "AT5G47900" species:3702 "Arabidopsis
            thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0005634
            "nucleus" evidence=ISM] [GO:0005739 "mitochondrion" evidence=ISM]
            [GO:0008150 "biological_process" evidence=ND] EMBL:CP002688
            GenomeReviews:BA000015_GR EMBL:AB016886 InterPro:IPR012429
            Pfam:PF07786 IPI:IPI00530923 RefSeq:NP_199601.2 UniGene:At.55424
            EnsemblPlants:AT5G47900.1 GeneID:834841 KEGG:ath:AT5G47900
            TAIR:At5g47900 HOGENOM:HOG000243739 OMA:WTSSYVV PhylomeDB:B3H4C1
            ProtClustDB:CLSN2689879 ArrayExpress:B3H4C1 Genevestigator:B3H4C1
            Uniprot:B3H4C1
        Length = 440

 Score = 546 (197.3 bits), Expect = 1.0e-52, P = 1.0e-52
 Identities = 113/291 (38%), Positives = 159/291 (54%)

Query:     2 IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMXXXXXXXXXXX 61
             IRL G+LQRIA++YL+V+L EI+ K   +    +   S+ + Y +HW++           
Sbjct:   150 IRLMGILQRIAIAYLVVALCEIWLKGNHNVSSEL---SMIKKYRFHWVVAFVITTIYLSL 206

Query:    62 XXXXXXPDWQFTIINKDSADYGKVF---NVTCGVRAKLNPPCNAVGYIDRKVLGINHMYH 118
                   PDW++ I+ +D       F    V CGVR    P CNAVG +DR  LGI H+Y 
Sbjct:   207 LYGLYVPDWEYQILKEDQGSTLTTFLNLKVKCGVRGHTGPGCNAVGMLDRMFLGIQHLYR 266

Query:   119 HPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGXXXXXXXXXXXXXXXXXXXXXXXT 178
              P + R+K C+ + P  GPL  DAPSWC APF+PEG                        
Sbjct:   267 KPVYARTKQCSINYPNNGPLPPDAPSWCQAPFDPEGLLSSLMATVTCLVGLHYGHIIIHF 326

Query:   179 KGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALV 238
             K H  RL QW+   F LL+ GL L+    + LNK LYTLSY+CVTSGA+  + SAIY +V
Sbjct:   327 KDHKKRLNQWILRSFCLLMLGLALNLFG-MHLNKPLYTLSYMCVTSGASGFLLSAIYLMV 385

Query:   239 DIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGWYYGDPHNTLPYWI 289
             D++  K   L L W+G++A+ +YV+ A  +    I+G+Y+ +P N L + I
Sbjct:   386 DVYGYKRASLVLEWMGIHALPIYVLIACNLVFLIIHGFYWKNPINNLLHLI 436


>UNIPROTKB|F1SE48 [details] [associations]
            symbol:HGSNAT "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0051259 "protein oligomerization" evidence=IEA]
            [GO:0016746 "transferase activity, transferring acyl groups"
            evidence=IEA] [GO:0007041 "lysosomal transport" evidence=IEA]
            [GO:0005765 "lysosomal membrane" evidence=IEA] GO:GO:0051259
            GO:GO:0005765 GO:GO:0016746 GO:GO:0007041
            GeneTree:ENSGT00390000001491 EMBL:CU640485
            Ensembl:ENSSSCT00000007671 OMA:HEVFEEY Uniprot:F1SE48
        Length = 298

 Score = 150 (57.9 bits), Expect = 1.8e-15, Sum P(2) = 1.8e-15
 Identities = 45/143 (31%), Positives = 69/143 (48%)

Query:   178 TKGHLARLKQWVTMGFALLIFGLTLHFTNA--IPLNKQLYTLSYVCVTSGAAALVFSAIY 235
             TKG L R   W      L+   LT    N   IP+NK L++ SYV   S +A L+   +Y
Sbjct:   163 TKGILIRFAVWGCF-LGLISVALTKASENEGFIPVNKNLWSTSYVTTLSSSAFLILLVLY 221

Query:   236 ALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFIN-GWYYGDPHNTLPYWIKKHAF 294
              +VD+  L +   P  + GMN++LVY M  E +FA +    W  GD  +   + ++    
Sbjct:   222 PIVDVKGL-WTGTPFFYPGMNSILVY-MGHE-VFANYFPFQWRLGDSQSHREHLVQNIVA 278

Query:   295 LGVWRSRKVSTILYVIFVEILFW 317
               +W       I YV++ + +FW
Sbjct:   279 TALW-----VLIAYVLYKKNVFW 296

 Score = 103 (41.3 bits), Expect = 1.8e-15, Sum P(2) = 1.8e-15
 Identities = 37/124 (29%), Positives = 54/124 (43%)

Query:     3 RLCGVLQRIALSYLLVSLVEI-FTKDVQDKDQSVGR-FSIFRLYC-W-HWLMXXXXXXXX 58
             R+ GVLQR+ ++Y +V+++E+ F K V +   S    FS+  +   W  WL         
Sbjct:     7 RIPGVLQRLGVTYFVVAVLELLFAKPVPESCASERSCFSLLDVTSSWPQWLFVLVLEGVW 66

Query:    59 XXXXXXXXXPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMY 117
                      P      +      D GK  N T G          A GYIDR +LG +H+Y
Sbjct:    67 LALTFFLPVPGCPTGYLGPGGIGDLGKYPNCTGG----------AAGYIDRLLLGDDHLY 116

Query:   118 HHPA 121
              HP+
Sbjct:   117 QHPS 120

 Score = 51 (23.0 bits), Expect = 0.00023, Sum P(2) = 0.00023
 Identities = 19/102 (18%), Positives = 46/102 (45%)

Query:   239 DIWNLKYPFLPLAWIGMNAMLVY-VMAAEGIFAGFINGWYYGDPHNTLPYW---IKKHAF 294
             ++W+  Y     +   +  +++Y ++  +G++ G    ++Y   ++ L Y    +  + F
Sbjct:   199 NLWSTSYVTTLSSSAFLILLVLYPIVDVKGLWTG--TPFFYPGMNSILVYMGHEVFANYF 256

Query:   295 LGVWR---SRKVSTILYVIFVEILFWGLVTGILHRFGIYWKL 333
                WR   S+     L    V    W L+  +L++  ++WK+
Sbjct:   257 PFQWRLGDSQSHREHLVQNIVATALWVLIAYVLYKKNVFWKI 298


>UNIPROTKB|F1MF45 [details] [associations]
            symbol:HGSNAT "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0051259 "protein oligomerization" evidence=IEA]
            [GO:0016746 "transferase activity, transferring acyl groups"
            evidence=IEA] [GO:0007041 "lysosomal transport" evidence=IEA]
            [GO:0005765 "lysosomal membrane" evidence=IEA] GO:GO:0051259
            GO:GO:0005765 GO:GO:0016746 GO:GO:0007041 InterPro:IPR012429
            Pfam:PF07786 GeneTree:ENSGT00390000001491 OMA:KHSSWNG
            EMBL:DAAA02060966 IPI:IPI01001394 Ensembl:ENSBTAT00000039742
            Uniprot:F1MF45
        Length = 592

 Score = 147 (56.8 bits), Expect = 3.6e-13, Sum P(2) = 3.6e-13
 Identities = 42/143 (29%), Positives = 69/143 (48%)

Query:   178 TKGHLARLKQWVTMGFALLIFGLTLHFTNA--IPLNKQLYTLSYVCVTSGAAALVFSAIY 235
             T+G L R   W  +   L+   LT    N   IP+NK L+++SYV   S  A L+  A+Y
Sbjct:   457 TRGILIRFAAWGCL-LGLVSVALTKASENEGFIPVNKNLWSISYVTTLSSLAFLILLALY 515

Query:   236 ALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFIN-GWYYGDPHNTLPYWIKKHAF 294
              +VD+  L +   P  + GMN++LVYV     +FA +    W  GD  +   + ++    
Sbjct:   516 PVVDVKGL-WTGAPFFYPGMNSILVYV--GHEVFANYFPFQWKLGDQQSHKEHLVQNMVA 572

Query:   295 LGVWRSRKVSTILYVIFVEILFW 317
               +W       I + ++ + +FW
Sbjct:   573 TALW-----VLIAFALYKKKVFW 590

 Score = 96 (38.9 bits), Expect = 3.6e-13, Sum P(2) = 3.6e-13
 Identities = 35/124 (28%), Positives = 52/124 (41%)

Query:     3 RLCGVLQRIALSYLLVSLVEI-FTKDVQDKDQSVGR-FSIFRLYC-W-HWLMXXXXXXXX 58
             R+ GVLQR+  +Y +V+++E+ F K V +   S    FS+  +   W  WL         
Sbjct:   301 RIPGVLQRLGATYFVVAVLELLFAKPVPETCASERSCFSLLDITASWPQWLFVLILEGVW 360

Query:    59 XXXXXXXXXPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMY 117
                      P      +      D G+  N T G          A GY+DR +LG  H+Y
Sbjct:   361 LALTFFLPVPGCPTGYLGPGGIGDGGRYRNCTGG----------AAGYVDRLLLGDQHLY 410

Query:   118 HHPA 121
              HP+
Sbjct:   411 QHPS 414


>MGI|MGI:1196297 [details] [associations]
            symbol:Hgsnat "heparan-alpha-glucosaminide
            N-acetyltransferase" species:10090 "Mus musculus" [GO:0005764
            "lysosome" evidence=IEA] [GO:0005765 "lysosomal membrane"
            evidence=ISO] [GO:0007041 "lysosomal transport" evidence=ISO]
            [GO:0008152 "metabolic process" evidence=ISO] [GO:0015019
            "heparan-alpha-glucosaminide N-acetyltransferase activity"
            evidence=IEA] [GO:0016020 "membrane" evidence=IEA] [GO:0016021
            "integral to membrane" evidence=IEA] [GO:0016740 "transferase
            activity" evidence=IEA] [GO:0016746 "transferase activity,
            transferring acyl groups" evidence=ISO] [GO:0051259 "protein
            oligomerization" evidence=ISO] MGI:MGI:1196297 GO:GO:0051259
            GO:GO:0016021 GO:GO:0005765 GO:GO:0016746 GO:GO:0007041 CTD:138050
            eggNOG:COG4299 HOGENOM:HOG000006803 HOVERGEN:HBG081599 KO:K10532
            OrthoDB:EOG4548Z7 ChiTaRS:HGSNAT GO:GO:0015019 InterPro:IPR012429
            Pfam:PF07786 EMBL:AK035264 EMBL:AK149883 EMBL:AK152926
            EMBL:AK159649 EMBL:AK160068 EMBL:AC093366 EMBL:BC024084
            IPI:IPI00317488 IPI:IPI00975056 RefSeq:NP_084160.1 UniGene:Mm.28326
            ProteinModelPortal:Q3UDW8 STRING:Q3UDW8 PhosphoSite:Q3UDW8
            PaxDb:Q3UDW8 PRIDE:Q3UDW8 Ensembl:ENSMUST00000037609 GeneID:52120
            KEGG:mmu:52120 UCSC:uc009lhg.1 GeneTree:ENSGT00390000001491
            InParanoid:Q3UDW8 OMA:KHSSWNG NextBio:308520 Bgee:Q3UDW8
            CleanEx:MM_HGSNAT Genevestigator:Q3UDW8 Uniprot:Q3UDW8
        Length = 656

 Score = 131 (51.2 bits), Expect = 8.7e-12, Sum P(2) = 8.7e-12
 Identities = 43/142 (30%), Positives = 65/142 (45%)

Query:   178 TKGHLARLKQWVTMGFALLIFGLTLHFTNA--IPLNKQLYTLSYVCVTSGAAALVFSAIY 235
             TK  L R   W  +   L+   LT    N   IP+NK L+++SYV   S  A  +   +Y
Sbjct:   521 TKAILTRFAAWCCI-LGLISIVLTKVSANEGFIPINKNLWSISYVTTLSCFAFFILLILY 579

Query:   236 ALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFINGWYYGDPHNTLPYWIKKHAFL 295
              +VD+  L +   P  + GMN++LVYV   E +   F   W   D  +   + I+     
Sbjct:   580 PVVDVKGL-WTGTPFFYPGMNSILVYV-GHEVLENYFPFQWKLADEQSHKEHLIQNIVAT 637

Query:   296 GVWRSRKVSTILYVIFVEILFW 317
              +W       I YV++ + LFW
Sbjct:   638 ALW-----VLIAYVLYKKKLFW 654

 Score = 102 (41.0 bits), Expect = 8.7e-12, Sum P(2) = 8.7e-12
 Identities = 37/125 (29%), Positives = 53/125 (42%)

Query:     2 IRLCGVLQRIALSYLLVSLVEIFT-KDVQDK-DQSVGRFSIFRLYC-W-HWLMXXXXXXX 57
             +R+ GVLQR+ ++Y +V+++E F  K V D        FS+  +   W  WL        
Sbjct:   364 VRIPGVLQRLGVTYFVVAVLEFFFWKPVPDSCTLESSCFSLRDITSSWPQWLTILTLESI 423

Query:    58 XXXXXXXXXXPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHM 116
                       P      +      D GK  + T G          A GYIDR +LG NH+
Sbjct:   424 WLALTFFLPVPGCPTGYLGPGGIGDLGKYPHCTGG----------AAGYIDRLLLGDNHL 473

Query:   117 YHHPA 121
             Y HP+
Sbjct:   474 YQHPS 478


>UNIPROTKB|Q68CP4 [details] [associations]
            symbol:HGSNAT "Heparan-alpha-glucosaminide
            N-acetyltransferase" species:9606 "Homo sapiens" [GO:0016021
            "integral to membrane" evidence=IEA] [GO:0015019
            "heparan-alpha-glucosaminide N-acetyltransferase activity"
            evidence=IEA] [GO:0051259 "protein oligomerization" evidence=IDA]
            [GO:0005765 "lysosomal membrane" evidence=IDA;TAS] [GO:0007041
            "lysosomal transport" evidence=IDA] [GO:0016746 "transferase
            activity, transferring acyl groups" evidence=IDA] [GO:0005975
            "carbohydrate metabolic process" evidence=TAS] [GO:0006027
            "glycosaminoglycan catabolic process" evidence=TAS] [GO:0030203
            "glycosaminoglycan metabolic process" evidence=TAS] [GO:0044281
            "small molecule metabolic process" evidence=TAS]
            Reactome:REACT_111217 GO:GO:0051259 GO:GO:0016021
            Reactome:REACT_116125 GO:GO:0005765 GO:GO:0044281 GO:GO:0005975
            GO:GO:0016746 GO:GO:0006027 GO:GO:0007041 EMBL:AC113191
            EMBL:AK304441 EMBL:CR749838 IPI:IPI00739149 IPI:IPI00908672
            RefSeq:NP_689632.2 UniGene:Hs.600384 ProteinModelPortal:Q68CP4
            IntAct:Q68CP4 STRING:Q68CP4 PhosphoSite:Q68CP4 DMDM:124007195
            PaxDb:Q68CP4 PRIDE:Q68CP4 Ensembl:ENST00000379644
            Ensembl:ENST00000458501 GeneID:138050 KEGG:hsa:138050
            UCSC:uc003xpx.4 CTD:138050 GeneCards:GC08P042995 H-InvDB:HIX0007487
            HGNC:HGNC:26527 HPA:HPA029578 MIM:252930 MIM:610453
            neXtProt:NX_Q68CP4 Orphanet:79271 PharmGKB:PA162390851
            eggNOG:COG4299 HOGENOM:HOG000006803 HOVERGEN:HBG081599
            InParanoid:Q68CP4 KO:K10532 OrthoDB:EOG4548Z7 BRENDA:2.3.1.78
            SABIO-RK:Q68CP4 ChiTaRS:HGSNAT GenomeRNAi:138050 NextBio:83735
            ArrayExpress:Q68CP4 Bgee:Q68CP4 CleanEx:HS_HGSNAT
            Genevestigator:Q68CP4 GO:GO:0015019 InterPro:IPR012429 Pfam:PF07786
            Uniprot:Q68CP4
        Length = 663

 Score = 125 (49.1 bits), Expect = 2.7e-11, Sum P(2) = 2.7e-11
 Identities = 40/143 (27%), Positives = 64/143 (44%)

Query:   178 TKGHLARLKQWVTMGFALLIFGLTLHFTNA--IPLNKQLYTLSYVCVTSGAAALVFSAIY 235
             TK  L R   W  +   L+   LT    N   IP+NK L++LSYV   S  A  +   +Y
Sbjct:   528 TKDILIRFTAWCCI-LGLISVALTKVSENEGFIPVNKNLWSLSYVTTLSSFAFFILLVLY 586

Query:   236 ALVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFIN-GWYYGDPHNTLPYWIKKHAF 294
              +VD+  L +   P  + GMN++LVYV     +F  +    W   D  +   +  +    
Sbjct:   587 PVVDVKGL-WTGTPFFYPGMNSILVYV--GHEVFENYFPFQWKLKDNQSHKEHLTQNIVA 643

Query:   295 LGVWRSRKVSTILYVIFVEILFW 317
               +W       I Y+++ + +FW
Sbjct:   644 TALW-----VLIAYILYRKKIFW 661

 Score = 104 (41.7 bits), Expect = 2.7e-11, Sum P(2) = 2.7e-11
 Identities = 36/125 (28%), Positives = 54/125 (43%)

Query:     2 IRLCGVLQRIALSYLLVSLVEI-FTKDVQDKDQSVGRFSIFR--LYCW-HWLMXXXXXXX 57
             +R+ GVLQR+ ++Y +V+++E+ F K V +   S       R     W  WL+       
Sbjct:   371 VRIPGVLQRLGVTYFVVAVLELLFAKPVPEHCASERSCLSLRDITSSWPQWLLILVLEGL 430

Query:    58 XXXXXXXXXXPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHM 116
                       P      +      D+GK  N T G          A GYIDR +LG +H+
Sbjct:   431 WLGLTFLLPVPGCPTGYLGPGGIGDFGKYPNCTGG----------AAGYIDRLLLGDDHL 480

Query:   117 YHHPA 121
             Y HP+
Sbjct:   481 YQHPS 485

 Score = 56 (24.8 bits), Expect = 0.00056, Sum P(2) = 0.00056
 Identities = 22/102 (21%), Positives = 48/102 (47%)

Query:   239 DIWNLKYPFLPLAWIGMNAMLVY-VMAAEGIFAGFINGWYYGDPHNTLPYW---IKKHAF 294
             ++W+L Y     ++     +++Y V+  +G++ G    ++Y   ++ L Y    + ++ F
Sbjct:   564 NLWSLSYVTTLSSFAFFILLVLYPVVDVKGLWTG--TPFFYPGMNSILVYVGHEVFENYF 621

Query:   295 LGVWR---SRKVSTILYVIFVEILFWGLVTGILHRFGIYWKL 333
                W+   ++     L    V    W L+  IL+R  I+WK+
Sbjct:   622 PFQWKLKDNQSHKEHLTQNIVATALWVLIAYILYRKKIFWKI 663


>UNIPROTKB|F1NBK1 [details] [associations]
            symbol:HGSNAT "Uncharacterized protein" species:9031
            "Gallus gallus" [GO:0005765 "lysosomal membrane" evidence=IEA]
            [GO:0007041 "lysosomal transport" evidence=IEA] [GO:0016746
            "transferase activity, transferring acyl groups" evidence=IEA]
            [GO:0051259 "protein oligomerization" evidence=IEA] GO:GO:0051259
            GO:GO:0005765 GO:GO:0016746 GO:GO:0007041 InterPro:IPR012429
            Pfam:PF07786 GeneTree:ENSGT00390000001491 OMA:KHSSWNG
            EMBL:AADN02016166 EMBL:AADN02016165 IPI:IPI00595110
            Ensembl:ENSGALT00000016483 Uniprot:F1NBK1
        Length = 584

 Score = 110 (43.8 bits), Expect = 8.7e-10, Sum P(3) = 8.7e-10
 Identities = 36/142 (25%), Positives = 64/142 (45%)

Query:   178 TKGHLARLKQWVTM-GFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYA 236
             ++G L    +WV++ G    I          IP+NK L++ SYV   S  A ++   +Y 
Sbjct:   449 SEGILPHSLRWVSVQGIIFAILTKCSKEEGFIPINKNLWSTSYVTTMSCFAFILLLLMYY 508

Query:   237 LVDIWNLKYPFLPLAWIGMNAMLVYVMAAEGIFAGFIN-GWYYGDPHNTLPYWIKKHAFL 295
             LVD+  L +   P  + GMN++LVY+     +F  +    W   D  +   +  +     
Sbjct:   509 LVDVKRL-WSGTPFFYPGMNSILVYI--GHEVFENYFPFKWKMQDSQSHAEHLTQNLTAT 565

Query:   296 GVWRSRKVSTILYVIFVEILFW 317
              +W       I Y+++ + +FW
Sbjct:   566 TLW-----VIISYLLYRKKIFW 582

 Score = 76 (31.8 bits), Expect = 8.7e-10, Sum P(3) = 8.7e-10
 Identities = 15/26 (57%), Positives = 17/26 (65%)

Query:    96 LNPPCNAVGYIDRKVLGINHMYHHPA 121
             LN    A GYIDR VLG  H+Y HP+
Sbjct:   380 LNCTGGAAGYIDRLVLGEKHIYQHPS 405

 Score = 66 (28.3 bits), Expect = 8.7e-10, Sum P(3) = 8.7e-10
 Identities = 14/30 (46%), Positives = 23/30 (76%)

Query:     2 IRLCGVLQRIALSYLLVSLVEI-FTKDVQD 30
             +R+ GVLQR+ L+YL+V+ +E+ FT+   D
Sbjct:   291 LRIPGVLQRLGLTYLVVAALELLFTRTGAD 320


>DICTYBASE|DDB_G0270192 [details] [associations]
            symbol:DDB_G0270192 "DUF1624 family protein"
            species:44689 "Dictyostelium discoideum" [GO:0008150
            "biological_process" evidence=ND] [GO:0003674 "molecular_function"
            evidence=ND] [GO:0044351 "macropinocytosis" evidence=RCA]
            dictyBase:DDB_G0270192 EMBL:AAFI02000005 eggNOG:COG4299
            InterPro:IPR012429 Pfam:PF07786 RefSeq:XP_646608.1 STRING:Q55C73
            EnsemblProtists:DDB0190869 GeneID:8617580 KEGG:ddi:DDB_G0270192
            InParanoid:Q55C73 OMA:IRIYGIL Uniprot:Q55C73
        Length = 426

 Score = 106 (42.4 bits), Expect = 1.2e-07, Sum P(3) = 1.2e-07
 Identities = 37/166 (22%), Positives = 81/166 (48%)

Query:   180 GHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 239
             G+   + +W+ +    ++  ++L  T  +P NK++++ S+   T GA+  +    + L+D
Sbjct:   266 GNTDIIVRWILLVILFMVPAISLGAT-VMPFNKKIWSFSFALFTVGASGSLILIAFILID 324

Query:   240 I--W-NLKYPFL---------PLAWIGMNAMLVYVMAAEGIFAGFINGWYYGDPHNTLPY 287
             +  W +LK   +         P+ WIG N + +Y +    +F   I  +Y     N+L  
Sbjct:   325 VIDWESLKCEKVRKIIDLIIKPMKWIGQNPITIYSLM---VFIEIILMYYINVGSNSLWV 381

Query:   288 WIKKHAFLGVWRSRKVSTILYVIFVEILFWGLVTGILHRFGIYWKL 333
              I +  +L   ++  +++ ++ I   ++F+ L+  I+ R  I+ KL
Sbjct:   382 QIYEKMYLSWLKNGYLASTVFSIGW-LIFFILIAYIMQRNKIFIKL 426

 Score = 73 (30.8 bits), Expect = 1.2e-07, Sum P(3) = 1.2e-07
 Identities = 16/27 (59%), Positives = 17/27 (62%)

Query:    90 CGVRAKLNPPCNAVGYIDRKVLGINHM 116
             CG RA L   CNA  YID KV G+N M
Sbjct:   195 CG-RANLTQNCNAGAYIDSKVFGLNIM 220

 Score = 47 (21.6 bits), Expect = 1.2e-07, Sum P(3) = 1.2e-07
 Identities = 10/18 (55%), Positives = 12/18 (66%)

Query:     3 RLCGVLQRIALSYLLVSL 20
             R+ GVLQRIA+ Y    L
Sbjct:   145 RIMGVLQRIAICYFFSCL 162


>UNIPROTKB|Q489U3 [details] [associations]
            symbol:CPS_0413 "Putative membrane protein" species:167879
            "Colwellia psychrerythraea 34H" [GO:0003674 "molecular_function"
            evidence=ND] [GO:0008150 "biological_process" evidence=ND]
            EMBL:CP000083 GenomeReviews:CP000083_GR eggNOG:COG4299
            InterPro:IPR012429 Pfam:PF07786 OMA:IRIYGIL RefSeq:YP_267171.1
            STRING:Q489U3 DNASU:3518441 GeneID:3518441 KEGG:cps:CPS_0413
            PATRIC:21464187 HOGENOM:HOG000295496
            BioCyc:CPSY167879:GI48-508-MONOMER Uniprot:Q489U3
        Length = 358

 Score = 108 (43.1 bits), Expect = 3.7e-05, Sum P(3) = 3.7e-05
 Identities = 42/145 (28%), Positives = 75/145 (51%)

Query:   189 VTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFL 248
             + +GF  L +GL L      P+NK L+T SYV  ++G A L+ +A   L+DI        
Sbjct:   228 LAVGFGAL-WGLVL------PINKSLWTPSYVIYSTGFACLLLAAFIWLIDIMKQVKLAE 280

Query:   249 PLAWIGMNAMLVYVMAAEGIFAGFINGWYYGDPHNTLPYWIKKHAFLGVWRSRKVSTILY 308
             PL   G N + VYV++   +   ++N    GD   ++  W+ K    GV+ + K+++ ++
Sbjct:   281 PLLVYGTNPLFVYVLSFL-VVTMYLN-INVGDV--SMYAWLYKQ-LSGVF-TPKLASFIF 334

Query:   309 VIFVEILFWGLVTGILHRFGIYWKL 333
               F  + F+  V+  L++  I+ K+
Sbjct:   335 A-FSHVAFFWYVSLKLYQRKIFIKI 358

 Score = 45 (20.9 bits), Expect = 3.7e-05, Sum P(3) = 3.7e-05
 Identities = 8/17 (47%), Positives = 10/17 (58%)

Query:   101 NAVGYIDRKVLGINHMY 117
             N +  +D  V G NHMY
Sbjct:   161 NIIRQLDLAVFGANHMY 177

 Score = 44 (20.5 bits), Expect = 3.7e-05, Sum P(3) = 3.7e-05
 Identities = 7/17 (41%), Positives = 13/17 (76%)

Query:     3 RLCGVLQRIALSYLLVS 19
             R+ G+LQRI ++Y + +
Sbjct:   106 RIMGILQRIGIAYTVAA 122


>TIGR_CMR|CPS_0413 [details] [associations]
            symbol:CPS_0413 "putative membrane protein" species:167879
            "Colwellia psychrerythraea 34H" [GO:0003674 "molecular_function"
            evidence=ND] [GO:0008150 "biological_process" evidence=ND]
            [GO:0016020 "membrane" evidence=ISS] EMBL:CP000083
            GenomeReviews:CP000083_GR eggNOG:COG4299 InterPro:IPR012429
            Pfam:PF07786 OMA:IRIYGIL RefSeq:YP_267171.1 STRING:Q489U3
            DNASU:3518441 GeneID:3518441 KEGG:cps:CPS_0413 PATRIC:21464187
            HOGENOM:HOG000295496 BioCyc:CPSY167879:GI48-508-MONOMER
            Uniprot:Q489U3
        Length = 358

 Score = 108 (43.1 bits), Expect = 3.7e-05, Sum P(3) = 3.7e-05
 Identities = 42/145 (28%), Positives = 75/145 (51%)

Query:   189 VTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDIWNLKYPFL 248
             + +GF  L +GL L      P+NK L+T SYV  ++G A L+ +A   L+DI        
Sbjct:   228 LAVGFGAL-WGLVL------PINKSLWTPSYVIYSTGFACLLLAAFIWLIDIMKQVKLAE 280

Query:   249 PLAWIGMNAMLVYVMAAEGIFAGFINGWYYGDPHNTLPYWIKKHAFLGVWRSRKVSTILY 308
             PL   G N + VYV++   +   ++N    GD   ++  W+ K    GV+ + K+++ ++
Sbjct:   281 PLLVYGTNPLFVYVLSFL-VVTMYLN-INVGDV--SMYAWLYKQ-LSGVF-TPKLASFIF 334

Query:   309 VIFVEILFWGLVTGILHRFGIYWKL 333
               F  + F+  V+  L++  I+ K+
Sbjct:   335 A-FSHVAFFWYVSLKLYQRKIFIKI 358

 Score = 45 (20.9 bits), Expect = 3.7e-05, Sum P(3) = 3.7e-05
 Identities = 8/17 (47%), Positives = 10/17 (58%)

Query:   101 NAVGYIDRKVLGINHMY 117
             N +  +D  V G NHMY
Sbjct:   161 NIIRQLDLAVFGANHMY 177

 Score = 44 (20.5 bits), Expect = 3.7e-05, Sum P(3) = 3.7e-05
 Identities = 7/17 (41%), Positives = 13/17 (76%)

Query:     3 RLCGVLQRIALSYLLVS 19
             R+ G+LQRI ++Y + +
Sbjct:   106 RIMGILQRIGIAYTVAA 122


>UNIPROTKB|Q8EBK9 [details] [associations]
            symbol:nagX "Uncharacterized protein" species:211586
            "Shewanella oneidensis MR-1" [GO:0003674 "molecular_function"
            evidence=ND] [GO:0005575 "cellular_component" evidence=ND]
            [GO:0008150 "biological_process" evidence=ND] EMBL:AE014299
            GenomeReviews:AE014299_GR HOGENOM:HOG000295496 RefSeq:NP_719051.1
            DNASU:1171178 GeneID:1171178 KEGG:son:SO_3504 PATRIC:23526700
            OMA:FVGHFIV ProtClustDB:CLSK907194 Uniprot:Q8EBK9
        Length = 395

 Score = 119 (46.9 bits), Expect = 0.00017, P = 0.00017
 Identities = 33/89 (37%), Positives = 51/89 (57%)

Query:   179 KGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALV 238
             KG  A++      G   L  G  L     IP+NK+L+T S+V VTSG + L+ +  YALV
Sbjct:   259 KGEWAKVGLLGAAGGVCLALGWLLDAV--IPVNKELWTSSFVLVTSGWSMLLLALFYALV 316

Query:   239 DIWNLKYPFLPLAW--IGMNAMLVYVMAA 265
             D+  LK+  L   +  IG NA+++Y+ ++
Sbjct:   317 DV--LKWQKLVFVFVVIGTNAIIIYLASS 343


>TIGR_CMR|SO_3504 [details] [associations]
            symbol:SO_3504 "conserved hypothetical protein"
            species:211586 "Shewanella oneidensis MR-1" [GO:0008150
            "biological_process" evidence=ND] [GO:0003674 "molecular_function"
            evidence=ND] [GO:0005575 "cellular_component" evidence=ND]
            EMBL:AE014299 GenomeReviews:AE014299_GR HOGENOM:HOG000295496
            RefSeq:NP_719051.1 DNASU:1171178 GeneID:1171178 KEGG:son:SO_3504
            PATRIC:23526700 OMA:FVGHFIV ProtClustDB:CLSK907194 Uniprot:Q8EBK9
        Length = 395

 Score = 119 (46.9 bits), Expect = 0.00017, P = 0.00017
 Identities = 33/89 (37%), Positives = 51/89 (57%)

Query:   179 KGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALV 238
             KG  A++      G   L  G  L     IP+NK+L+T S+V VTSG + L+ +  YALV
Sbjct:   259 KGEWAKVGLLGAAGGVCLALGWLLDAV--IPVNKELWTSSFVLVTSGWSMLLLALFYALV 316

Query:   239 DIWNLKYPFLPLAW--IGMNAMLVYVMAA 265
             D+  LK+  L   +  IG NA+++Y+ ++
Sbjct:   317 DV--LKWQKLVFVFVVIGTNAIIIYLASS 343


>DICTYBASE|DDB_G0286315 [details] [associations]
            symbol:DDB_G0286315 "transmembrane protein"
            species:44689 "Dictyostelium discoideum" [GO:0008150
            "biological_process" evidence=ND] [GO:0003674 "molecular_function"
            evidence=ND] [GO:0016021 "integral to membrane" evidence=IEA]
            dictyBase:DDB_G0286315 GO:GO:0016021 EMBL:AAFI02000085
            eggNOG:COG4299 KO:K10532 RefSeq:XP_637852.1
            EnsemblProtists:DDB0234045 GeneID:8625566 KEGG:ddi:DDB_G0286315
            InParanoid:Q54LX9 OMA:SITIMIF Uniprot:Q54LX9
        Length = 675

 Score = 102 (41.0 bits), Expect = 0.00038, Sum P(2) = 0.00038
 Identities = 31/120 (25%), Positives = 55/120 (45%)

Query:   150 FEPEGXXXXXXXXXXXXXXXXXXXXXXXTKGHLARLKQW-----VTMGFALLIFGLTLHF 204
             ++PEG                        K + +RL +W     V  G A  + GLT + 
Sbjct:   510 YDPEGTVGYLTSIFLCFIGVQAGRIILTYKSNRSRLIRWMVWSVVLCGIAAGLCGLTQN- 568

Query:   205 TNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDI---WNLKYPFLPLAWIGMNAMLVY 261
                +P+NK L++ S++ + +G    V + ++ L+DI   WN   PF+   ++GMN + +Y
Sbjct:   569 QGWLPVNKNLWSPSFILLMAGFGFFVLTVMFILIDIKKIWNGS-PFI---YVGMNPITIY 624

 Score = 60 (26.2 bits), Expect = 0.00038, Sum P(2) = 0.00038
 Identities = 11/22 (50%), Positives = 17/22 (77%)

Query:     3 RLCGVLQRIALSYLLVSLVEIF 24
             R+ GVLQR ++SYL+V  + +F
Sbjct:   305 RILGVLQRFSISYLVVGSIMLF 326


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.329   0.143   0.491    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      333       293   0.00091  115 3  11 22  0.42    33
                                                     33  0.44    37


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  13
  No. of states in DFA:  623 (66 KB)
  Total size of DFA:  251 KB (2132 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  20.01u 0.11s 20.12t   Elapsed:  00:00:01
  Total cpu time:  20.01u 0.11s 20.12t   Elapsed:  00:00:01
  Start:  Fri May 10 18:18:07 2013   End:  Fri May 10 18:18:08 2013

Back to top