BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>016713
MSEIKAETTHHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWP
EISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQ
GGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRL
YCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAV
GYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSIL
STIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVT
SGAAALVFSAIYALVDCNIFGPWT

High Scoring Gene Products

Symbol, full name Information P value
AT5G27730 protein from Arabidopsis thaliana 4.8e-133
AT5G47900 protein from Arabidopsis thaliana 2.6e-79
Hgsnat
heparan-alpha-glucosaminide N-acetyltransferase
protein from Mus musculus 3.1e-27
HGSNAT
Heparan-alpha-glucosaminide N-acetyltransferase
protein from Homo sapiens 4.2e-25
HGSNAT
Uncharacterized protein
protein from Bos taurus 1.4e-23
DDB_G0286315
transmembrane protein
gene from Dictyostelium discoideum 1.5e-21
DDB_G0270192
DUF1624 family protein
gene from Dictyostelium discoideum 4.9e-19
CPS_0413
Putative membrane protein
protein from Colwellia psychrerythraea 34H 1.2e-15
CPS_0413
putative membrane protein
protein from Colwellia psychrerythraea 34H 1.2e-15
nagX
Uncharacterized protein
protein from Shewanella oneidensis MR-1 1.4e-12
SO_3504
conserved hypothetical protein
protein from Shewanella oneidensis MR-1 1.4e-12
HGSNAT
Uncharacterized protein
protein from Sus scrofa 7.9e-09

The BLAST search returned 1 gene product which did not match your query constraints. Please see the full BLAST report below for the details.

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  016713
        (384 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

TAIR|locus:2180305 - symbol:AT5G27730 "AT5G27730" species...  1304  4.8e-133  1
TAIR|locus:2160902 - symbol:AT5G47900 "AT5G47900" species...   797  2.6e-79   1
MGI|MGI:1196297 - symbol:Hgsnat "heparan-alpha-glucosamin...   253  3.1e-27   2
UNIPROTKB|Q68CP4 - symbol:HGSNAT "Heparan-alpha-glucosami...   247  4.2e-25   2
UNIPROTKB|F1NBK1 - symbol:HGSNAT "Uncharacterized protein...   223  5.8e-24   3
UNIPROTKB|F1MF45 - symbol:HGSNAT "Uncharacterized protein...   231  1.4e-23   2
DICTYBASE|DDB_G0286315 - symbol:DDB_G0286315 "transmembra...   237  1.5e-21   2
DICTYBASE|DDB_G0270192 - symbol:DDB_G0270192 "DUF1624 fam...   186  4.9e-19   3
UNIPROTKB|Q489U3 - symbol:CPS_0413 "Putative membrane pro...   172  1.2e-15   3
TIGR_CMR|CPS_0413 - symbol:CPS_0413 "putative membrane pr...   172  1.2e-15   3
UNIPROTKB|Q8EBK9 - symbol:nagX "Uncharacterized protein" ...   142  1.4e-12   2
TIGR_CMR|SO_3504 - symbol:SO_3504 "conserved hypothetical...   142  1.4e-12   2
UNIPROTKB|F1SE48 - symbol:HGSNAT "Uncharacterized protein...   103  7.9e-09   2


>TAIR|locus:2180305 [details] [associations]
            symbol:AT5G27730 "AT5G27730" species:3702 "Arabidopsis
            thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0005575
            "cellular_component" evidence=ND] [GO:0008150 "biological_process"
            evidence=ND] [GO:0016556 "mRNA modification" evidence=RCA]
            EMBL:CP002688 GenomeReviews:BA000015_GR GO:GO:0016740
            eggNOG:COG4299 KO:K10532 InterPro:IPR012429 Pfam:PF07786
            OMA:IRIYGIL HOGENOM:HOG000243739 ProtClustDB:CLSN2689879
            EMBL:AY034969 EMBL:BT002370 IPI:IPI00535083 RefSeq:NP_568500.1
            UniGene:At.19161 STRING:Q94CC1 PRIDE:Q94CC1
            EnsemblPlants:AT5G27730.1 GeneID:832835 KEGG:ath:AT5G27730
            TAIR:At5g27730 InParanoid:Q94CC1 PhylomeDB:Q94CC1
            ArrayExpress:Q94CC1 Genevestigator:Q94CC1 Uniprot:Q94CC1
        Length = 472

 Score = 1304 (464.1 bits), Expect = 4.8e-133, P = 4.8e-133
 Identities = 241/378 (63%), Positives = 283/378 (74%)

Query:     1 MSEIKAETTHHHPLIISEPDVSDQQEKSHL--KTQRLASLDIFRGLAVALMILVDHAGGD 58
             M+EIK E +H   L+  + D S    +  L     RLASLDIFRGL VALMILVD AGGD
Sbjct:     1 MAEIKVERSHDQHLLEPKEDTSSSYTRRSLAGNRPRLASLDIFRGLTVALMILVDDAGGD 60

Query:    59 WPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGIL 118
             WP I+HAPWNGCNLADFVMPFFLFIVGV+IAL+LKRI ++ +A KKV FRT KLLFWG+L
Sbjct:    61 WPMIAHAPWNGCNLADFVMPFFLFIVGVSIALSLKRISNKFEACKKVGFRTCKLLFWGLL 120

Query:   119 LQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIF 178
             LQGGFSHAPDELTYGVDV M+R CG+LQRIALSYL+V+LVEIFTKD  +++ S GRFSIF
Sbjct:   121 LQGGFSHAPDELTYGVDVTMMRFCGILQRIALSYLVVALVEIFTKDSHEENLSTGRFSIF 180

Query:   179 RLYCWHWLMXXXXXXXXXXXXXXXXXPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCN 238
             + Y WHW++                 PDW+F + +KDS  YGK+ +V+CGVR KLNPPCN
Sbjct:   181 KSYYWHWIVAASVLVIYLATLYGTYVPDWEFVVYDKDSVLYGKILSVSCGVRGKLNPPCN 240

Query:   239 AVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGXXXXXXX 298
             AVGY+DR+VLGINHMYHHPAWRRSKACT DSP+EG +R+DAPSWC APFEPEG       
Sbjct:   241 AVGYVDRQVLGINHMYHHPAWRRSKACTDDSPYEGAIRQDAPSWCRAPFEPEGILSSISA 300

Query:   299 XXXXXXXXXXXXXXXXTKGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVC 358
                              KGH ARLK W++ G  LL  GLTLHFT+ +PLNKQLY+ SY+C
Sbjct:   301 ILSTIIGVHFGHIILHLKGHSARLKHWISTGLVLLALGLTLHFTHLMPLNKQLYSFSYIC 360

Query:   359 VTSGAAALVFSAIYALVD 376
             VTSGAAALVFS++Y+LVD
Sbjct:   361 VTSGAAALVFSSLYSLVD 378


>TAIR|locus:2160902 [details] [associations]
            symbol:AT5G47900 "AT5G47900" species:3702 "Arabidopsis
            thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0005634
            "nucleus" evidence=ISM] [GO:0005739 "mitochondrion" evidence=ISM]
            [GO:0008150 "biological_process" evidence=ND] EMBL:CP002688
            GenomeReviews:BA000015_GR EMBL:AB016886 InterPro:IPR012429
            Pfam:PF07786 IPI:IPI00530923 RefSeq:NP_199601.2 UniGene:At.55424
            EnsemblPlants:AT5G47900.1 GeneID:834841 KEGG:ath:AT5G47900
            TAIR:At5g47900 HOGENOM:HOG000243739 OMA:WTSSYVV PhylomeDB:B3H4C1
            ProtClustDB:CLSN2689879 ArrayExpress:B3H4C1 Genevestigator:B3H4C1
            Uniprot:B3H4C1
        Length = 440

 Score = 797 (285.6 bits), Expect = 2.6e-79, P = 2.6e-79
 Identities = 161/352 (45%), Positives = 209/352 (59%)

Query:    33 QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
             +RL SLD+FRGL VA MILVD  GG  P I+H+PW+G  LADFVMPFFLFIVGV++A A 
Sbjct:    44 ERLVSLDVFRGLTVAFMILVDDVGGILPSINHSPWDGVTLADFVMPFFLFIVGVSLAFAY 103

Query:    93 KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSY 152
             K +  R  A +K + R+LKLL  G+ LQGGF H  + LTYG+DV  IRL G+LQRIA++Y
Sbjct:   104 KNLSCRFVATRKALIRSLKLLLLGLFLQGGFIHGLNNLTYGIDVEKIRLMGILQRIAIAY 163

Query:   153 LLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMXXXXXXXXXXXXXXXXXPDWQFTII 212
             L+V+L EI+ K   +    +   S+ + Y +HW++                 PDW++ I+
Sbjct:   164 LVVALCEIWLKGNHNVSSEL---SMIKKYRFHWVVAFVITTIYLSLLYGLYVPDWEYQIL 220

Query:   213 NKDSADYGKVF---NVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDS 269
              +D       F    V CGVR    P CNAVG +DR  LGI H+Y  P + R+K C+ + 
Sbjct:   221 KEDQGSTLTTFLNLKVKCGVRGHTGPGCNAVGMLDRMFLGIQHLYRKPVYARTKQCSINY 280

Query:   270 PFEGPLRKDAPSWCHAPFEPEGXXXXXXXXXXXXXXXXXXXXXXXTKGHLARLKQWVTMG 329
             P  GPL  DAPSWC APF+PEG                        K H  RL QW+   
Sbjct:   281 PNNGPLPPDAPSWCQAPFDPEGLLSSLMATVTCLVGLHYGHIIIHFKDHKKRLNQWILRS 340

Query:   330 FALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVDCNIFG 381
             F LL+ GL L+    + LNK LYTLSY+CVTSGA+  + SAIY +VD  ++G
Sbjct:   341 FCLLMLGLALNLFG-MHLNKPLYTLSYMCVTSGASGFLLSAIYLMVD--VYG 389


>MGI|MGI:1196297 [details] [associations]
            symbol:Hgsnat "heparan-alpha-glucosaminide
            N-acetyltransferase" species:10090 "Mus musculus" [GO:0005764
            "lysosome" evidence=IEA] [GO:0005765 "lysosomal membrane"
            evidence=ISO] [GO:0007041 "lysosomal transport" evidence=ISO]
            [GO:0008152 "metabolic process" evidence=ISO] [GO:0015019
            "heparan-alpha-glucosaminide N-acetyltransferase activity"
            evidence=IEA] [GO:0016020 "membrane" evidence=IEA] [GO:0016021
            "integral to membrane" evidence=IEA] [GO:0016740 "transferase
            activity" evidence=IEA] [GO:0016746 "transferase activity,
            transferring acyl groups" evidence=ISO] [GO:0051259 "protein
            oligomerization" evidence=ISO] MGI:MGI:1196297 GO:GO:0051259
            GO:GO:0016021 GO:GO:0005765 GO:GO:0016746 GO:GO:0007041 CTD:138050
            eggNOG:COG4299 HOGENOM:HOG000006803 HOVERGEN:HBG081599 KO:K10532
            OrthoDB:EOG4548Z7 ChiTaRS:HGSNAT GO:GO:0015019 InterPro:IPR012429
            Pfam:PF07786 EMBL:AK035264 EMBL:AK149883 EMBL:AK152926
            EMBL:AK159649 EMBL:AK160068 EMBL:AC093366 EMBL:BC024084
            IPI:IPI00317488 IPI:IPI00975056 RefSeq:NP_084160.1 UniGene:Mm.28326
            ProteinModelPortal:Q3UDW8 STRING:Q3UDW8 PhosphoSite:Q3UDW8
            PaxDb:Q3UDW8 PRIDE:Q3UDW8 Ensembl:ENSMUST00000037609 GeneID:52120
            KEGG:mmu:52120 UCSC:uc009lhg.1 GeneTree:ENSGT00390000001491
            InParanoid:Q3UDW8 OMA:KHSSWNG NextBio:308520 Bgee:Q3UDW8
            CleanEx:MM_HGSNAT Genevestigator:Q3UDW8 Uniprot:Q3UDW8
        Length = 656

 Score = 253 (94.1 bits), Expect = 3.1e-27, Sum P(2) = 3.1e-27
 Identities = 76/252 (30%), Positives = 122/252 (48%)

Query:    17 SEPDVSDQQ-EKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADF 75
             ++P  +D Q E       RL  +D FRGLA+ LM+ V++ GG +    H+ WNG  +AD 
Sbjct:   242 ADPLSADYQPETRRSSANRLRCVDTFRGLALVLMVFVNYGGGKYWYFKHSSWNGLTVADL 301

Query:    76 VMPFFLFIVGVAIALALKRIPDRA-DAVK---KVIFRTLKLLFWGILLQGGFSHAPDELT 131
             V P+F+FI+G +I L++  I  R    +K   K+++R+  L+  G+++    ++    L+
Sbjct:   302 VFPWFVFIMGTSIFLSMTSILQRGCSKLKLLGKIVWRSFLLICIGVIIVNP-NYCLGPLS 360

Query:   132 YGVDVRMIRLCGVLQRIALSYLLVSLVEIFT-KDVQDK-DQSVGRFSIFRLYC-W-HWLM 187
             +      +R+ GVLQR+ ++Y +V+++E F  K V D        FS+  +   W  WL 
Sbjct:   361 WD----KVRIPGVLQRLGVTYFVVAVLEFFFWKPVPDSCTLESSCFSLRDITSSWPQWLT 416

Query:   188 XXXXXXXXXXXXXXXXXPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRK 246
                              P      +      D GK  + T G          A GYIDR 
Sbjct:   417 ILTLESIWLALTFFLPVPGCPTGYLGPGGIGDLGKYPHCTGG----------AAGYIDRL 466

Query:   247 VLGINHMYHHPA 258
             +LG NH+Y HP+
Sbjct:   467 LLGDNHLYQHPS 478

 Score = 86 (35.3 bits), Expect = 3.1e-27, Sum P(2) = 3.1e-27
 Identities = 25/72 (34%), Positives = 35/72 (48%)

Query:   315 TKGHLARLKQWVTMGFALLIFGLTLHFTNA--IPLNKQLYTLSYVCVTSGAAALVFSAIY 372
             TK  L R   W  +   L+   LT    N   IP+NK L+++SYV   S  A  +   +Y
Sbjct:   521 TKAILTRFAAWCCI-LGLISIVLTKVSANEGFIPINKNLWSISYVTTLSCFAFFILLILY 579

Query:   373 ALVDCNIFGPWT 384
              +VD  + G WT
Sbjct:   580 PVVD--VKGLWT 589


>UNIPROTKB|Q68CP4 [details] [associations]
            symbol:HGSNAT "Heparan-alpha-glucosaminide
            N-acetyltransferase" species:9606 "Homo sapiens" [GO:0016021
            "integral to membrane" evidence=IEA] [GO:0015019
            "heparan-alpha-glucosaminide N-acetyltransferase activity"
            evidence=IEA] [GO:0051259 "protein oligomerization" evidence=IDA]
            [GO:0005765 "lysosomal membrane" evidence=IDA;TAS] [GO:0007041
            "lysosomal transport" evidence=IDA] [GO:0016746 "transferase
            activity, transferring acyl groups" evidence=IDA] [GO:0005975
            "carbohydrate metabolic process" evidence=TAS] [GO:0006027
            "glycosaminoglycan catabolic process" evidence=TAS] [GO:0030203
            "glycosaminoglycan metabolic process" evidence=TAS] [GO:0044281
            "small molecule metabolic process" evidence=TAS]
            Reactome:REACT_111217 GO:GO:0051259 GO:GO:0016021
            Reactome:REACT_116125 GO:GO:0005765 GO:GO:0044281 GO:GO:0005975
            GO:GO:0016746 GO:GO:0006027 GO:GO:0007041 EMBL:AC113191
            EMBL:AK304441 EMBL:CR749838 IPI:IPI00739149 IPI:IPI00908672
            RefSeq:NP_689632.2 UniGene:Hs.600384 ProteinModelPortal:Q68CP4
            IntAct:Q68CP4 STRING:Q68CP4 PhosphoSite:Q68CP4 DMDM:124007195
            PaxDb:Q68CP4 PRIDE:Q68CP4 Ensembl:ENST00000379644
            Ensembl:ENST00000458501 GeneID:138050 KEGG:hsa:138050
            UCSC:uc003xpx.4 CTD:138050 GeneCards:GC08P042995 H-InvDB:HIX0007487
            HGNC:HGNC:26527 HPA:HPA029578 MIM:252930 MIM:610453
            neXtProt:NX_Q68CP4 Orphanet:79271 PharmGKB:PA162390851
            eggNOG:COG4299 HOGENOM:HOG000006803 HOVERGEN:HBG081599
            InParanoid:Q68CP4 KO:K10532 OrthoDB:EOG4548Z7 BRENDA:2.3.1.78
            SABIO-RK:Q68CP4 ChiTaRS:HGSNAT GenomeRNAi:138050 NextBio:83735
            ArrayExpress:Q68CP4 Bgee:Q68CP4 CleanEx:HS_HGSNAT
            Genevestigator:Q68CP4 GO:GO:0015019 InterPro:IPR012429 Pfam:PF07786
            Uniprot:Q68CP4
        Length = 663

 Score = 247 (92.0 bits), Expect = 4.2e-25, Sum P(2) = 4.2e-25
 Identities = 72/234 (30%), Positives = 116/234 (49%)

Query:    34 RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALK 93
             RL S+D FRG+A+ LM+ V++ GG +    HA WNG  +AD V P+F+FI+G +I L++ 
Sbjct:   267 RLRSVDTFRGIALILMVFVNYGGGKYWYFKHASWNGLTVADLVFPWFVFIMGSSIFLSMT 326

Query:    94 RIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
              I  R  +    + K+ +R+  L+  GI++    ++    L++      +R+ GVLQR+ 
Sbjct:   327 SILQRGCSKFRLLGKIAWRSFLLICIGIIIVNP-NYCLGPLSWD----KVRIPGVLQRLG 381

Query:   150 LSYLLVSLVEI-FTKDVQDKDQSVGRFSIFR--LYCW-HWLMXXXXXXXXXXXXXXXXXP 205
             ++Y +V+++E+ F K V +   S       R     W  WL+                 P
Sbjct:   382 VTYFVVAVLELLFAKPVPEHCASERSCLSLRDITSSWPQWLLILVLEGLWLGLTFLLPVP 441

Query:   206 DWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPA 258
                   +      D+GK  N T G          A GYIDR +LG +H+Y HP+
Sbjct:   442 GCPTGYLGPGGIGDFGKYPNCTGG----------AAGYIDRLLLGDDHLYQHPS 485

 Score = 90 (36.7 bits), Expect = 4.2e-25, Sum P(2) = 4.2e-25
 Identities = 26/72 (36%), Positives = 35/72 (48%)

Query:   315 TKGHLARLKQWVTMGFALLIFGLTLHFTNA--IPLNKQLYTLSYVCVTSGAAALVFSAIY 372
             TK  L R   W  +   L+   LT    N   IP+NK L++LSYV   S  A  +   +Y
Sbjct:   528 TKDILIRFTAWCCI-LGLISVALTKVSENEGFIPVNKNLWSLSYVTTLSSFAFFILLVLY 586

Query:   373 ALVDCNIFGPWT 384
              +VD  + G WT
Sbjct:   587 PVVD--VKGLWT 596


>UNIPROTKB|F1NBK1 [details] [associations]
            symbol:HGSNAT "Uncharacterized protein" species:9031
            "Gallus gallus" [GO:0005765 "lysosomal membrane" evidence=IEA]
            [GO:0007041 "lysosomal transport" evidence=IEA] [GO:0016746
            "transferase activity, transferring acyl groups" evidence=IEA]
            [GO:0051259 "protein oligomerization" evidence=IEA] GO:GO:0051259
            GO:GO:0005765 GO:GO:0016746 GO:GO:0007041 InterPro:IPR012429
            Pfam:PF07786 GeneTree:ENSGT00390000001491 OMA:KHSSWNG
            EMBL:AADN02016166 EMBL:AADN02016165 IPI:IPI00595110
            Ensembl:ENSGALT00000016483 Uniprot:F1NBK1
        Length = 584

 Score = 223 (83.6 bits), Expect = 5.8e-24, Sum P(3) = 5.8e-24
 Identities = 52/136 (38%), Positives = 82/136 (60%)

Query:    33 QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
             QRL SLD FRGL++ +M+ V++ GG +    H  WNG  +AD V P+F+FI+G +I+L+L
Sbjct:   186 QRLRSLDTFRGLSLIIMVFVNYGGGKYWFFKHESWNGLTVADLVFPWFVFIMGTSISLSL 245

Query:    93 KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSY 152
                     + +KV+++ L   F  ILL G     P+     +    +R+ GVLQR+ L+Y
Sbjct:   246 SSTLRWGSSKQKVLWKILWRSFLLILL-GVIVVNPNYCLGALSWENLRIPGVLQRLGLTY 304

Query:   153 LLVSLVEI-FTKDVQD 167
             L+V+ +E+ FT+   D
Sbjct:   305 LVVAALELLFTRTGAD 320

 Score = 76 (31.8 bits), Expect = 5.8e-24, Sum P(3) = 5.8e-24
 Identities = 15/26 (57%), Positives = 17/26 (65%)

Query:   233 LNPPCNAVGYIDRKVLGINHMYHHPA 258
             LN    A GYIDR VLG  H+Y HP+
Sbjct:   380 LNCTGGAAGYIDRLVLGEKHIYQHPS 405

 Score = 71 (30.1 bits), Expect = 5.8e-24, Sum P(3) = 5.8e-24
 Identities = 20/63 (31%), Positives = 31/63 (49%)

Query:   315 TKGHLARLKQWVTM-GFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYA 373
             ++G L    +WV++ G    I          IP+NK L++ SYV   S  A ++   +Y 
Sbjct:   449 SEGILPHSLRWVSVQGIIFAILTKCSKEEGFIPINKNLWSTSYVTTMSCFAFILLLLMYY 508

Query:   374 LVD 376
             LVD
Sbjct:   509 LVD 511


>UNIPROTKB|F1MF45 [details] [associations]
            symbol:HGSNAT "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0051259 "protein oligomerization" evidence=IEA]
            [GO:0016746 "transferase activity, transferring acyl groups"
            evidence=IEA] [GO:0007041 "lysosomal transport" evidence=IEA]
            [GO:0005765 "lysosomal membrane" evidence=IEA] GO:GO:0051259
            GO:GO:0005765 GO:GO:0016746 GO:GO:0007041 InterPro:IPR012429
            Pfam:PF07786 GeneTree:ENSGT00390000001491 OMA:KHSSWNG
            EMBL:DAAA02060966 IPI:IPI01001394 Ensembl:ENSBTAT00000039742
            Uniprot:F1MF45
        Length = 592

 Score = 231 (86.4 bits), Expect = 1.4e-23, Sum P(2) = 1.4e-23
 Identities = 72/233 (30%), Positives = 106/233 (45%)

Query:    34 RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALK 93
             RL  +D FRG+A+ LM+ V++ GG +    H+ WNG  +AD V P+F+FI+G +I L++ 
Sbjct:   196 RLRCVDTFRGMALILMVFVNYGGGKYWYFKHSSWNGLTVADLVFPWFVFIMGTSIFLSMT 255

Query:    94 RIPDRADAVKKVIFRTLKLLFWGILLQ---GGFSHAPDELTYGVDVRMIRLCGVLQRIAL 150
              I  R     K  FR L  + W   L    G F   P      +     R+ GVLQR+  
Sbjct:   256 SILQRG--CSK--FRLLGKIAWRSFLLICIGIFVVNPKYCLGPLSWEKARIPGVLQRLGA 311

Query:   151 SYLLVSLVEI-FTKDVQDKDQSVGR-FSIFRLYC-W-HWLMXXXXXXXXXXXXXXXXXPD 206
             +Y +V+++E+ F K V +   S    FS+  +   W  WL                  P 
Sbjct:   312 TYFVVAVLELLFAKPVPETCASERSCFSLLDITASWPQWLFVLILEGVWLALTFFLPVPG 371

Query:   207 WQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPA 258
                  +      D G+  N T G          A GY+DR +LG  H+Y HP+
Sbjct:   372 CPTGYLGPGGIGDGGRYRNCTGG----------AAGYVDRLLLGDQHLYQHPS 414

 Score = 100 (40.3 bits), Expect = 1.4e-23, Sum P(2) = 1.4e-23
 Identities = 27/72 (37%), Positives = 38/72 (52%)

Query:   315 TKGHLARLKQWVTMGFALLIFGLTLHFTNA--IPLNKQLYTLSYVCVTSGAAALVFSAIY 372
             T+G L R   W  +   L+   LT    N   IP+NK L+++SYV   S  A L+  A+Y
Sbjct:   457 TRGILIRFAAWGCL-LGLVSVALTKASENEGFIPVNKNLWSISYVTTLSSLAFLILLALY 515

Query:   373 ALVDCNIFGPWT 384
              +VD  + G WT
Sbjct:   516 PVVD--VKGLWT 525


>DICTYBASE|DDB_G0286315 [details] [associations]
            symbol:DDB_G0286315 "transmembrane protein"
            species:44689 "Dictyostelium discoideum" [GO:0008150
            "biological_process" evidence=ND] [GO:0003674 "molecular_function"
            evidence=ND] [GO:0016021 "integral to membrane" evidence=IEA]
            dictyBase:DDB_G0286315 GO:GO:0016021 EMBL:AAFI02000085
            eggNOG:COG4299 KO:K10532 RefSeq:XP_637852.1
            EnsemblProtists:DDB0234045 GeneID:8625566 KEGG:ddi:DDB_G0286315
            InParanoid:Q54LX9 OMA:SITIMIF Uniprot:Q54LX9
        Length = 675

 Score = 237 (88.5 bits), Expect = 1.5e-21, Sum P(2) = 1.5e-21
 Identities = 53/136 (38%), Positives = 86/136 (63%)

Query:    26 EKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVG 85
             E+ + K  RL SLD+FRG ++ +MI V++ GG +   +H+ WNG  +AD V P+F+FI+G
Sbjct:   198 ERENRKKDRLRSLDVFRGFSITIMIFVNYGGGGYWFFNHSLWNGLTVADLVFPWFVFIMG 257

Query:    86 VAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVL 145
             +A+ L+   +  R    K++IF+  KLL   I+L   F+     +  GVD++  R+ GVL
Sbjct:   258 IAMPLSFHAMEKRGTP-KRIIFQ--KLLRRSIIL---FALGLF-INNGVDLQQWRILGVL 310

Query:   146 QRIALSYLLVSLVEIF 161
             QR ++SYL+V  + +F
Sbjct:   311 QRFSISYLVVGSIMLF 326

 Score = 74 (31.1 bits), Expect = 1.5e-21, Sum P(2) = 1.5e-21
 Identities = 22/95 (23%), Positives = 41/95 (43%)

Query:   287 FEPEGXXXXXXXXXXXXXXXXXXXXXXXTKGHLARLKQW-----VTMGFALLIFGLTLHF 341
             ++PEG                        K + +RL +W     V  G A  + GLT + 
Sbjct:   510 YDPEGTVGYLTSIFLCFIGVQAGRIILTYKSNRSRLIRWMVWSVVLCGIAAGLCGLTQN- 568

Query:   342 TNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
                +P+NK L++ S++ + +G    V + ++ L+D
Sbjct:   569 QGWLPVNKNLWSPSFILLMAGFGFFVLTVMFILID 603


>DICTYBASE|DDB_G0270192 [details] [associations]
            symbol:DDB_G0270192 "DUF1624 family protein"
            species:44689 "Dictyostelium discoideum" [GO:0008150
            "biological_process" evidence=ND] [GO:0003674 "molecular_function"
            evidence=ND] [GO:0044351 "macropinocytosis" evidence=RCA]
            dictyBase:DDB_G0270192 EMBL:AAFI02000005 eggNOG:COG4299
            InterPro:IPR012429 Pfam:PF07786 RefSeq:XP_646608.1 STRING:Q55C73
            EnsemblProtists:DDB0190869 GeneID:8617580 KEGG:ddi:DDB_G0270192
            InParanoid:Q55C73 OMA:IRIYGIL Uniprot:Q55C73
        Length = 426

 Score = 186 (70.5 bits), Expect = 4.9e-19, Sum P(3) = 4.9e-19
 Identities = 51/128 (39%), Positives = 68/128 (53%)

Query:    33 QRLASLDIFRGLAVALMILVDH-AGGD--WPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
             +R+ SLD  RGL +  MILVD+ AG D  WP ++   WNG + AD + P F+FI G +IA
Sbjct:    44 RRMGSLDAVRGLTIFGMILVDNQAGNDVIWP-LNETEWNGLSTADLIFPSFIFISGFSIA 102

Query:    90 LALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
             LALK   +       +I RTL L F    +Q   +   D   +       R+ GVLQRIA
Sbjct:   103 LALKNSKNTTSTWYGIIRRTLLLFF----IQCFLNLMGDHFNFTT----FRIMGVLQRIA 154

Query:   150 LSYLLVSL 157
             + Y    L
Sbjct:   155 ICYFFSCL 162

 Score = 73 (30.8 bits), Expect = 4.9e-19, Sum P(3) = 4.9e-19
 Identities = 16/27 (59%), Positives = 17/27 (62%)

Query:   227 CGVRAKLNPPCNAVGYIDRKVLGINHM 253
             CG RA L   CNA  YID KV G+N M
Sbjct:   195 CG-RANLTQNCNAGAYIDSKVFGLNIM 220

 Score = 67 (28.6 bits), Expect = 4.9e-19, Sum P(3) = 4.9e-19
 Identities = 13/60 (21%), Positives = 32/60 (53%)

Query:   317 GHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
             G+   + +W+ +    ++  ++L  T  +P NK++++ S+   T GA+  +    + L+D
Sbjct:   266 GNTDIIVRWILLVILFMVPAISLGAT-VMPFNKKIWSFSFALFTVGASGSLILIAFILID 324


>UNIPROTKB|Q489U3 [details] [associations]
            symbol:CPS_0413 "Putative membrane protein" species:167879
            "Colwellia psychrerythraea 34H" [GO:0003674 "molecular_function"
            evidence=ND] [GO:0008150 "biological_process" evidence=ND]
            EMBL:CP000083 GenomeReviews:CP000083_GR eggNOG:COG4299
            InterPro:IPR012429 Pfam:PF07786 OMA:IRIYGIL RefSeq:YP_267171.1
            STRING:Q489U3 DNASU:3518441 GeneID:3518441 KEGG:cps:CPS_0413
            PATRIC:21464187 HOGENOM:HOG000295496
            BioCyc:CPSY167879:GI48-508-MONOMER Uniprot:Q489U3
        Length = 358

 Score = 172 (65.6 bits), Expect = 1.2e-15, Sum P(3) = 1.2e-15
 Identities = 44/129 (34%), Positives = 69/129 (53%)

Query:    34 RLASLDIFRGLAVALMILVDHAGGDWPEIS----HAPWNGCNLADFVMPFFLFIVGVAIA 89
             R  +LD FRG+ +ALMILV+   G W  +     HA W+G    D V PFFLFI+G A+ 
Sbjct:     3 RYLALDAFRGITIALMILVN-TPGTWSHVYAPLLHAEWDGATPTDLVFPFFLFIIGSAMF 61

Query:    90 LALKRIPDRA--DAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
              + K+    A  +  +K+I R   + F G +L        + + + V+    R+ G+LQR
Sbjct:    62 FSFKKSNFSASPEQFRKIIKRGFIMFFIGFML--------NVIPFTVNAEDWRIMGILQR 113

Query:   148 IALSYLLVS 156
             I ++Y + +
Sbjct:   114 IGIAYTVAA 122

 Score = 74 (31.1 bits), Expect = 1.2e-15, Sum P(3) = 1.2e-15
 Identities = 20/51 (39%), Positives = 30/51 (58%)

Query:   326 VTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
             + +GF  L +GL L      P+NK L+T SYV  ++G A L+ +A   L+D
Sbjct:   228 LAVGFGAL-WGLVL------PINKSLWTPSYVIYSTGFACLLLAAFIWLID 271

 Score = 45 (20.9 bits), Expect = 1.2e-15, Sum P(3) = 1.2e-15
 Identities = 8/17 (47%), Positives = 10/17 (58%)

Query:   238 NAVGYIDRKVLGINHMY 254
             N +  +D  V G NHMY
Sbjct:   161 NIIRQLDLAVFGANHMY 177


>TIGR_CMR|CPS_0413 [details] [associations]
            symbol:CPS_0413 "putative membrane protein" species:167879
            "Colwellia psychrerythraea 34H" [GO:0003674 "molecular_function"
            evidence=ND] [GO:0008150 "biological_process" evidence=ND]
            [GO:0016020 "membrane" evidence=ISS] EMBL:CP000083
            GenomeReviews:CP000083_GR eggNOG:COG4299 InterPro:IPR012429
            Pfam:PF07786 OMA:IRIYGIL RefSeq:YP_267171.1 STRING:Q489U3
            DNASU:3518441 GeneID:3518441 KEGG:cps:CPS_0413 PATRIC:21464187
            HOGENOM:HOG000295496 BioCyc:CPSY167879:GI48-508-MONOMER
            Uniprot:Q489U3
        Length = 358

 Score = 172 (65.6 bits), Expect = 1.2e-15, Sum P(3) = 1.2e-15
 Identities = 44/129 (34%), Positives = 69/129 (53%)

Query:    34 RLASLDIFRGLAVALMILVDHAGGDWPEIS----HAPWNGCNLADFVMPFFLFIVGVAIA 89
             R  +LD FRG+ +ALMILV+   G W  +     HA W+G    D V PFFLFI+G A+ 
Sbjct:     3 RYLALDAFRGITIALMILVN-TPGTWSHVYAPLLHAEWDGATPTDLVFPFFLFIIGSAMF 61

Query:    90 LALKRIPDRA--DAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
              + K+    A  +  +K+I R   + F G +L        + + + V+    R+ G+LQR
Sbjct:    62 FSFKKSNFSASPEQFRKIIKRGFIMFFIGFML--------NVIPFTVNAEDWRIMGILQR 113

Query:   148 IALSYLLVS 156
             I ++Y + +
Sbjct:   114 IGIAYTVAA 122

 Score = 74 (31.1 bits), Expect = 1.2e-15, Sum P(3) = 1.2e-15
 Identities = 20/51 (39%), Positives = 30/51 (58%)

Query:   326 VTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALVD 376
             + +GF  L +GL L      P+NK L+T SYV  ++G A L+ +A   L+D
Sbjct:   228 LAVGFGAL-WGLVL------PINKSLWTPSYVIYSTGFACLLLAAFIWLID 271

 Score = 45 (20.9 bits), Expect = 1.2e-15, Sum P(3) = 1.2e-15
 Identities = 8/17 (47%), Positives = 10/17 (58%)

Query:   238 NAVGYIDRKVLGINHMY 254
             N +  +D  V G NHMY
Sbjct:   161 NIIRQLDLAVFGANHMY 177


>UNIPROTKB|Q8EBK9 [details] [associations]
            symbol:nagX "Uncharacterized protein" species:211586
            "Shewanella oneidensis MR-1" [GO:0003674 "molecular_function"
            evidence=ND] [GO:0005575 "cellular_component" evidence=ND]
            [GO:0008150 "biological_process" evidence=ND] EMBL:AE014299
            GenomeReviews:AE014299_GR HOGENOM:HOG000295496 RefSeq:NP_719051.1
            DNASU:1171178 GeneID:1171178 KEGG:son:SO_3504 PATRIC:23526700
            OMA:FVGHFIV ProtClustDB:CLSK907194 Uniprot:Q8EBK9
        Length = 395

 Score = 142 (55.0 bits), Expect = 1.4e-12, Sum P(2) = 1.4e-12
 Identities = 48/144 (33%), Positives = 69/144 (47%)

Query:    34 RLASLDIFRGLAV-----------ALMILVDHAGGDW--PEISHAPWNGCNLADFVMPFF 80
             RL SLD  RG  +           AL+I    AG  W   ++ H+ W+G  L D + P F
Sbjct:    29 RLMSLDALRGFDMFWILGGEALFGALLIFTGWAGWQWGDTQMHHSEWHGFRLYDLIFPLF 88

Query:    81 LFIVGVAIALALKRI---P--DRADAVKKVIFRTLKLLFWGILLQGGF-SHAPDELTYGV 134
             +F+ GVA+ L+ KR+   P  +R    +  + R   LL  GIL   G+ + AP      V
Sbjct:    89 IFLSGVALGLSPKRLDKLPLHERLPVYRHGVKRLFLLLLLGILYNHGWGTGAP------V 142

Query:   135 DVRMIRLCGVLQRIALSYLLVSLV 158
             D   IR   VL RIA ++   +L+
Sbjct:   143 DPDKIRYASVLGRIAFAWFFAALL 166

 Score = 94 (38.1 bits), Expect = 1.4e-12, Sum P(2) = 1.4e-12
 Identities = 25/61 (40%), Positives = 34/61 (55%)

Query:   316 KGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALV 375
             KG  A++      G   L  G  L     IP+NK+L+T S+V VTSG + L+ +  YALV
Sbjct:   259 KGEWAKVGLLGAAGGVCLALGWLLDAV--IPVNKELWTSSFVLVTSGWSMLLLALFYALV 316

Query:   376 D 376
             D
Sbjct:   317 D 317


>TIGR_CMR|SO_3504 [details] [associations]
            symbol:SO_3504 "conserved hypothetical protein"
            species:211586 "Shewanella oneidensis MR-1" [GO:0008150
            "biological_process" evidence=ND] [GO:0003674 "molecular_function"
            evidence=ND] [GO:0005575 "cellular_component" evidence=ND]
            EMBL:AE014299 GenomeReviews:AE014299_GR HOGENOM:HOG000295496
            RefSeq:NP_719051.1 DNASU:1171178 GeneID:1171178 KEGG:son:SO_3504
            PATRIC:23526700 OMA:FVGHFIV ProtClustDB:CLSK907194 Uniprot:Q8EBK9
        Length = 395

 Score = 142 (55.0 bits), Expect = 1.4e-12, Sum P(2) = 1.4e-12
 Identities = 48/144 (33%), Positives = 69/144 (47%)

Query:    34 RLASLDIFRGLAV-----------ALMILVDHAGGDW--PEISHAPWNGCNLADFVMPFF 80
             RL SLD  RG  +           AL+I    AG  W   ++ H+ W+G  L D + P F
Sbjct:    29 RLMSLDALRGFDMFWILGGEALFGALLIFTGWAGWQWGDTQMHHSEWHGFRLYDLIFPLF 88

Query:    81 LFIVGVAIALALKRI---P--DRADAVKKVIFRTLKLLFWGILLQGGF-SHAPDELTYGV 134
             +F+ GVA+ L+ KR+   P  +R    +  + R   LL  GIL   G+ + AP      V
Sbjct:    89 IFLSGVALGLSPKRLDKLPLHERLPVYRHGVKRLFLLLLLGILYNHGWGTGAP------V 142

Query:   135 DVRMIRLCGVLQRIALSYLLVSLV 158
             D   IR   VL RIA ++   +L+
Sbjct:   143 DPDKIRYASVLGRIAFAWFFAALL 166

 Score = 94 (38.1 bits), Expect = 1.4e-12, Sum P(2) = 1.4e-12
 Identities = 25/61 (40%), Positives = 34/61 (55%)

Query:   316 KGHLARLKQWVTMGFALLIFGLTLHFTNAIPLNKQLYTLSYVCVTSGAAALVFSAIYALV 375
             KG  A++      G   L  G  L     IP+NK+L+T S+V VTSG + L+ +  YALV
Sbjct:   259 KGEWAKVGLLGAAGGVCLALGWLLDAV--IPVNKELWTSSFVLVTSGWSMLLLALFYALV 316

Query:   376 D 376
             D
Sbjct:   317 D 317


>UNIPROTKB|F1SE48 [details] [associations]
            symbol:HGSNAT "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0051259 "protein oligomerization" evidence=IEA]
            [GO:0016746 "transferase activity, transferring acyl groups"
            evidence=IEA] [GO:0007041 "lysosomal transport" evidence=IEA]
            [GO:0005765 "lysosomal membrane" evidence=IEA] GO:GO:0051259
            GO:GO:0005765 GO:GO:0016746 GO:GO:0007041
            GeneTree:ENSGT00390000001491 EMBL:CU640485
            Ensembl:ENSSSCT00000007671 OMA:HEVFEEY Uniprot:F1SE48
        Length = 298

 Score = 103 (41.3 bits), Expect = 7.9e-09, Sum P(2) = 7.9e-09
 Identities = 37/124 (29%), Positives = 54/124 (43%)

Query:   140 RLCGVLQRIALSYLLVSLVEI-FTKDVQDKDQSVGR-FSIFRLYC-W-HWLMXXXXXXXX 195
             R+ GVLQR+ ++Y +V+++E+ F K V +   S    FS+  +   W  WL         
Sbjct:     7 RIPGVLQRLGVTYFVVAVLELLFAKPVPESCASERSCFSLLDVTSSWPQWLFVLVLEGVW 66

Query:   196 XXXXXXXXXPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMY 254
                      P      +      D GK  N T G          A GYIDR +LG +H+Y
Sbjct:    67 LALTFFLPVPGCPTGYLGPGGIGDLGKYPNCTGG----------AAGYIDRLLLGDDHLY 116

Query:   255 HHPA 258
              HP+
Sbjct:   117 QHPS 120

 Score = 97 (39.2 bits), Expect = 7.9e-09, Sum P(2) = 7.9e-09
 Identities = 27/72 (37%), Positives = 36/72 (50%)

Query:   315 TKGHLARLKQWVTMGFALLIFGLTLHFTNA--IPLNKQLYTLSYVCVTSGAAALVFSAIY 372
             TKG L R   W      L+   LT    N   IP+NK L++ SYV   S +A L+   +Y
Sbjct:   163 TKGILIRFAVWGCF-LGLISVALTKASENEGFIPVNKNLWSTSYVTTLSSSAFLILLVLY 221

Query:   373 ALVDCNIFGPWT 384
              +VD  + G WT
Sbjct:   222 PIVD--VKGLWT 231


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.327   0.141   0.461    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      384       344   0.00097  116 3  11 22  0.45    33
                                                     33  0.45    37


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  13
  No. of states in DFA:  621 (66 KB)
  Total size of DFA:  251 KB (2133 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  24.09u 0.13s 24.22t   Elapsed:  00:00:01
  Total cpu time:  24.09u 0.13s 24.22t   Elapsed:  00:00:01
  Start:  Mon May 20 22:40:30 2013   End:  Mon May 20 22:40:31 2013

Back to top