BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>010822
MADLRIVEEGLGRTQLVEQEQDDGKDSENGINKEKGLERSEVQDEQKGELQLQQLLQQKS
KRVATLDAFRGLTVVWVYTQLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVA
IALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQR
IALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNW
SFSEHSDHGVKKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSP
NSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGF
GLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAAGIVFSALYVLMDVWELRTPFLFLKWI
GMNAMLVFVLGAQGILAGFVNGWYYKNPDNTLVNWIQNHLFIHVWNSERLGTLLYVIFAE
ITFWGVVAGILHRLGIYWKL

High Scoring Gene Products

Symbol, full name Information P value
AT5G27730 protein from Arabidopsis thaliana 6.7e-152
AT5G47900 protein from Arabidopsis thaliana 1.5e-97
Hgsnat
heparan-alpha-glucosaminide N-acetyltransferase
protein from Mus musculus 6.0e-32
HGSNAT
Uncharacterized protein
protein from Bos taurus 3.6e-31
HGSNAT
Heparan-alpha-glucosaminide N-acetyltransferase
protein from Homo sapiens 7.9e-30
DDB_G0286315
transmembrane protein
gene from Dictyostelium discoideum 1.0e-22
DDB_G0270192
DUF1624 family protein
gene from Dictyostelium discoideum 6.8e-22
nagX
Uncharacterized protein
protein from Shewanella oneidensis MR-1 1.7e-20
SO_3504
conserved hypothetical protein
protein from Shewanella oneidensis MR-1 1.7e-20
CPS_0413
Putative membrane protein
protein from Colwellia psychrerythraea 34H 2.3e-20
CPS_0413
putative membrane protein
protein from Colwellia psychrerythraea 34H 2.3e-20
HGSNAT
Uncharacterized protein
protein from Sus scrofa 5.1e-19

The BLAST search returned 1 gene product which did not match your query constraints. Please see the full BLAST report below for the details.

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  010822
        (500 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

TAIR|locus:2180305 - symbol:AT5G27730 "AT5G27730" species...  1482  6.7e-152  1
TAIR|locus:2160902 - symbol:AT5G47900 "AT5G47900" species...   969  1.5e-97   1
MGI|MGI:1196297 - symbol:Hgsnat "heparan-alpha-glucosamin...   252  6.0e-32   2
UNIPROTKB|F1MF45 - symbol:HGSNAT "Uncharacterized protein...   231  3.6e-31   2
UNIPROTKB|Q68CP4 - symbol:HGSNAT "Heparan-alpha-glucosami...   232  7.9e-30   2
DICTYBASE|DDB_G0286315 - symbol:DDB_G0286315 "transmembra...   208  1.0e-22   3
DICTYBASE|DDB_G0270192 - symbol:DDB_G0270192 "DUF1624 fam...   175  6.8e-22   2
UNIPROTKB|F1NBK1 - symbol:HGSNAT "Uncharacterized protein...   262  1.4e-20   2
UNIPROTKB|Q8EBK9 - symbol:nagX "Uncharacterized protein" ...   159  1.7e-20   2
TIGR_CMR|SO_3504 - symbol:SO_3504 "conserved hypothetical...   159  1.7e-20   2
UNIPROTKB|Q489U3 - symbol:CPS_0413 "Putative membrane pro...   175  2.3e-20   3
TIGR_CMR|CPS_0413 - symbol:CPS_0413 "putative membrane pr...   175  2.3e-20   3
UNIPROTKB|F1SE48 - symbol:HGSNAT "Uncharacterized protein...   180  5.1e-19   2


>TAIR|locus:2180305 [details] [associations]
            symbol:AT5G27730 "AT5G27730" species:3702 "Arabidopsis
            thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0005575
            "cellular_component" evidence=ND] [GO:0008150 "biological_process"
            evidence=ND] [GO:0016556 "mRNA modification" evidence=RCA]
            EMBL:CP002688 GenomeReviews:BA000015_GR GO:GO:0016740
            eggNOG:COG4299 KO:K10532 InterPro:IPR012429 Pfam:PF07786
            OMA:IRIYGIL HOGENOM:HOG000243739 ProtClustDB:CLSN2689879
            EMBL:AY034969 EMBL:BT002370 IPI:IPI00535083 RefSeq:NP_568500.1
            UniGene:At.19161 STRING:Q94CC1 PRIDE:Q94CC1
            EnsemblPlants:AT5G27730.1 GeneID:832835 KEGG:ath:AT5G27730
            TAIR:At5g27730 InParanoid:Q94CC1 PhylomeDB:Q94CC1
            ArrayExpress:Q94CC1 Genevestigator:Q94CC1 Uniprot:Q94CC1
        Length = 472

 Score = 1482 (526.7 bits), Expect = 6.7e-152, P = 6.7e-152
 Identities = 271/442 (61%), Positives = 333/442 (75%)

Query:    62 RVATLDAFRGLTVVWVYTQLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAI 121
             R+A+LD FRGLTV      LMILVDDAGG +  I H+PWNGC LADFVMPFFLFIVGV+I
Sbjct:    36 RLASLDIFRGLTVA-----LMILVDDAGGDWPMIAHAPWNGCNLADFVMPFFLFIVGVSI 90

Query:   122 ALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRI 181
             AL+LK++     A KK+ FRT KLLFWG++LQGG+SHAPD L+YGVD+  +R+CGILQRI
Sbjct:    91 ALSLKRISNKFEACKKVGFRTCKLLFWGLLLQGGFSHAPDELTYGVDVTMMRFCGILQRI 150

Query:   182 ALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWS 241
             AL Y+VVAL+E  T       L     SIF +Y W WI      VIY+ T Y  YVP+W 
Sbjct:   151 ALSYLVVALVEIFTKDSHEENLSTGRFSIFKSYYWHWIVAASVLVIYLATLYGTYVPDWE 210

Query:   242 FSEHSDHGV---KKYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLS 298
             F  +    V   K   V CG+RG L P CNAVGYVDR++ GINH+Y  P W R +ACT  
Sbjct:   211 FVVYDKDSVLYGKILSVSCGVRGKLNPPCNAVGYVDRQVLGINHMYHHPAWRRSKACTDD 270

Query:   299 SPNSGPLREDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSM 358
             SP  G +R+DAPSWCRAPFEPEG+LS+ISAILS  IG+H+GH+++H KGHSARLKHW+S 
Sbjct:   271 SPYEGAIRQDAPSWCRAPFEPEGILSSISAILSTIIGVHFGHIILHLKGHSARLKHWIST 330

Query:   359 GFGLLIIAIILHFTNAIPINKQLYSFSYVCFTAGAAGIVFSALYVLMDVWELRTPFLFLK 418
             G  LL + + LHFT+ +P+NKQLYSFSY+C T+GAA +VFS+LY L+D+ E +  FL LK
Sbjct:   331 GLVLLALGLTLHFTHLMPLNKQLYSFSYICVTSGAAALVFSSLYSLVDILEWKHMFLPLK 390

Query:   419 WIGMNAMLVFVLGAQGILAGFVNGWYYKNPDNTLVNWIQNHLFIHVWNSERLGTLLYVIF 478
             WIGMNAMLV+V+GA+GILA F NGWYY++P NTL+NWI+ H+FI VW+S R+G L+YVIF
Sbjct:   391 WIGMNAMLVYVMGAEGILAAFFNGWYYRHPHNTLINWIREHVFIRVWHSRRVGVLMYVIF 450

Query:   479 AEITFWGVVAGILHRLGIYWKL 500
             AEI FWG+V G+ HR  IYWKL
Sbjct:   451 AEILFWGLVTGVFHRFKIYWKL 472


>TAIR|locus:2160902 [details] [associations]
            symbol:AT5G47900 "AT5G47900" species:3702 "Arabidopsis
            thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0005634
            "nucleus" evidence=ISM] [GO:0005739 "mitochondrion" evidence=ISM]
            [GO:0008150 "biological_process" evidence=ND] EMBL:CP002688
            GenomeReviews:BA000015_GR EMBL:AB016886 InterPro:IPR012429
            Pfam:PF07786 IPI:IPI00530923 RefSeq:NP_199601.2 UniGene:At.55424
            EnsemblPlants:AT5G47900.1 GeneID:834841 KEGG:ath:AT5G47900
            TAIR:At5g47900 HOGENOM:HOG000243739 OMA:WTSSYVV PhylomeDB:B3H4C1
            ProtClustDB:CLSN2689879 ArrayExpress:B3H4C1 Genevestigator:B3H4C1
            Uniprot:B3H4C1
        Length = 440

 Score = 969 (346.2 bits), Expect = 1.5e-97, P = 1.5e-97
 Identities = 196/450 (43%), Positives = 290/450 (64%)

Query:    13 RTQLVEQEQDDGKDSENGINKEKGLERSEVQDEQKGEXXXXXXXXXKSKRVATLDAFRGL 72
             RT+L   E    KD+++  N  +  E+ ++  E   +           +R+ +LD FRGL
Sbjct:     2 RTKLTMYEAI--KDNDD--NDHQWREKKDI--ESALQISRSSSLPPDKERLVSLDVFRGL 55

Query:    73 TVVWVYTQLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKIN 132
             TV +     MILVDD GG    I+HSPW+G TLADFVMPFFLFIVGV++A A K +    
Sbjct:    56 TVAF-----MILVDDVGGILPSINHSPWDGVTLADFVMPFFLFIVGVSLAFAYKNLSCRF 110

Query:   133 GAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIE 192
              A +K + R+LKLL  G+ LQGG+ H  + L+YG+D++ IR  GILQRIA+ Y+VVAL E
Sbjct:   111 VATRKALIRSLKLLLLGLFLQGGFIHGLNNLTYGIDVEKIRLMGILQRIAIAYLVVALCE 170

Query:   193 TLTTKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSE-HSDHG-- 249
              +  K   NV     LS+   Y++ W+  F+   IY+   Y LYVP+W +     D G  
Sbjct:   171 -IWLKGNHNVSS--ELSMIKKYRFHWVVAFVITTIYLSLLYGLYVPDWEYQILKEDQGST 227

Query:   250 VKKYI---VKCGMRGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLR 306
             +  ++   VKCG+RGH GP CNAVG +DR   GI HLY  PV++R + C+++ PN+GPL 
Sbjct:   228 LTTFLNLKVKCGVRGHTGPGCNAVGMLDRMFLGIQHLYRKPVYARTKQCSINYPNNGPLP 287

Query:   307 EDAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIA 366
              DAPSWC+APF+PEGLLS++ A ++  +G+HYGH++IHFK H  RL  W+   F LL++ 
Sbjct:   288 PDAPSWCQAPFDPEGLLSSLMATVTCLVGLHYGHIIIHFKDHKKRLNQWILRSFCLLMLG 347

Query:   367 IILHFTNAIPINKQLYSFSYVCFTAGAAGIVFSALYVLMDVWELRTPFLFLKWIGMNAML 426
             + L+    + +NK LY+ SY+C T+GA+G + SA+Y+++DV+  +   L L+W+G++A+ 
Sbjct:   348 LALNLFG-MHLNKPLYTLSYMCVTSGASGFLLSAIYLMVDVYGYKRASLVLEWMGIHALP 406

Query:   427 VFVLGAQGILAGFVNGWYYKNPDNTLVNWI 456
             ++VL A  ++   ++G+Y+KNP N L++ I
Sbjct:   407 IYVLIACNLVFLIIHGFYWKNPINNLLHLI 436


>MGI|MGI:1196297 [details] [associations]
            symbol:Hgsnat "heparan-alpha-glucosaminide
            N-acetyltransferase" species:10090 "Mus musculus" [GO:0005764
            "lysosome" evidence=IEA] [GO:0005765 "lysosomal membrane"
            evidence=ISO] [GO:0007041 "lysosomal transport" evidence=ISO]
            [GO:0008152 "metabolic process" evidence=ISO] [GO:0015019
            "heparan-alpha-glucosaminide N-acetyltransferase activity"
            evidence=IEA] [GO:0016020 "membrane" evidence=IEA] [GO:0016021
            "integral to membrane" evidence=IEA] [GO:0016740 "transferase
            activity" evidence=IEA] [GO:0016746 "transferase activity,
            transferring acyl groups" evidence=ISO] [GO:0051259 "protein
            oligomerization" evidence=ISO] MGI:MGI:1196297 GO:GO:0051259
            GO:GO:0016021 GO:GO:0005765 GO:GO:0016746 GO:GO:0007041 CTD:138050
            eggNOG:COG4299 HOGENOM:HOG000006803 HOVERGEN:HBG081599 KO:K10532
            OrthoDB:EOG4548Z7 ChiTaRS:HGSNAT GO:GO:0015019 InterPro:IPR012429
            Pfam:PF07786 EMBL:AK035264 EMBL:AK149883 EMBL:AK152926
            EMBL:AK159649 EMBL:AK160068 EMBL:AC093366 EMBL:BC024084
            IPI:IPI00317488 IPI:IPI00975056 RefSeq:NP_084160.1 UniGene:Mm.28326
            ProteinModelPortal:Q3UDW8 STRING:Q3UDW8 PhosphoSite:Q3UDW8
            PaxDb:Q3UDW8 PRIDE:Q3UDW8 Ensembl:ENSMUST00000037609 GeneID:52120
            KEGG:mmu:52120 UCSC:uc009lhg.1 GeneTree:ENSGT00390000001491
            InParanoid:Q3UDW8 OMA:KHSSWNG NextBio:308520 Bgee:Q3UDW8
            CleanEx:MM_HGSNAT Genevestigator:Q3UDW8 Uniprot:Q3UDW8
        Length = 656

 Score = 252 (93.8 bits), Expect = 6.0e-32, Sum P(2) = 6.0e-32
 Identities = 84/243 (34%), Positives = 122/243 (50%)

Query:    60 SKRVATLDAFRGLTVVWVYTQLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGV 119
             + R+  +D FRGL +V     LM+ V+  GG Y    HS WNG T+AD V P+F+FI+G 
Sbjct:   258 ANRLRCVDTFRGLALV-----LMVFVNYGGGKYWYFKHSSWNGLTVADLVFPWFVFIMGT 312

Query:   120 AIALALKKVPKINGAVK-----KIIFRTLKLLFWGIILQG-GYSHAPDALSYGVDMKHIR 173
             +I L++  + +  G  K     KI++R+  L+  G+I+    Y   P  LS+      +R
Sbjct:   313 SIFLSMTSILQ-RGCSKLKLLGKIVWRSFLLICIGVIIVNPNYCLGP--LSWD----KVR 365

Query:   174 WCGILQRIALVYVVVALIETLTTKRRPN--VLEPRHLSI--FTAYQW-QWIGGFIAFVIY 228
               G+LQR+ + Y VVA++E    K  P+   LE    S+   T+  W QW+       I+
Sbjct:   366 IPGVLQRLGVTYFVVAVLEFFFWKPVPDSCTLESSCFSLRDITS-SWPQWLTILTLESIW 424

Query:   229 IITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLG--PACN--AVGYVDRELWGINHLY 284
             +  T+ L VP              Y+   G+ G LG  P C   A GY+DR L G NHLY
Sbjct:   425 LALTFFLPVPGCPTG---------YLGPGGI-GDLGKYPHCTGGAAGYIDRLLLGDNHLY 474

Query:   285 SDP 287
               P
Sbjct:   475 QHP 477

 Score = 173 (66.0 bits), Expect = 6.0e-32, Sum P(2) = 6.0e-32
 Identities = 54/177 (30%), Positives = 94/177 (53%)

Query:   317 FEPEGLLSTISAILSGTIGIHYGHVLIHFKGHS-ARLKHWVSMGFGLLIIAIILHFTNA- 374
             ++PEG+L TI++I+   +G+  G +L+++K  + A L  + +    L +I+I+L   +A 
Sbjct:   489 YDPEGVLGTINSIVMAFLGVQAGKILVYYKDQTKAILTRFAAWCCILGLISIVLTKVSAN 548

Query:   375 ---IPINKQLYSFSYVCFTAGAAGIVFSALYVLMDVWEL--RTPFLFLKWIGMNAMLVFV 429
                IPINK L+S SYV   +  A  +   LY ++DV  L   TPF +    GMN++LV+V
Sbjct:   549 EGFIPINKNLWSISYVTTLSCFAFFILLILYPVVDVKGLWTGTPFFYP---GMNSILVYV 605

Query:   430 LGAQGILAGFVNGWYYKNPDNTLVNWIQNHLFIHVWNSERLGTLLYVIFAEITFWGV 486
              G + +   F   W   +  +   + IQN +   +W       + YV++ +  FW +
Sbjct:   606 -GHEVLENYFPFQWKLADEQSHKEHLIQNIVATALWV-----LIAYVLYKKKLFWKI 656


>UNIPROTKB|F1MF45 [details] [associations]
            symbol:HGSNAT "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0051259 "protein oligomerization" evidence=IEA]
            [GO:0016746 "transferase activity, transferring acyl groups"
            evidence=IEA] [GO:0007041 "lysosomal transport" evidence=IEA]
            [GO:0005765 "lysosomal membrane" evidence=IEA] GO:GO:0051259
            GO:GO:0005765 GO:GO:0016746 GO:GO:0007041 InterPro:IPR012429
            Pfam:PF07786 GeneTree:ENSGT00390000001491 OMA:KHSSWNG
            EMBL:DAAA02060966 IPI:IPI01001394 Ensembl:ENSBTAT00000039742
            Uniprot:F1MF45
        Length = 592

 Score = 231 (86.4 bits), Expect = 3.6e-31, Sum P(2) = 3.6e-31
 Identities = 86/276 (31%), Positives = 126/276 (45%)

Query:    23 DGKDSENGINKEKGLERSEVQDEQKGEXXXXXXXXXKSKRVATLDAFRGLTVVWVYTQLM 82
             + ++++  IN E G   S   D Q               R+  +D FRG+ ++     LM
Sbjct:   162 NSRETDRLINSELG-SPSRASDPQP----EAWRRSAAPLRLRCVDTFRGMALI-----LM 211

Query:    83 ILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVK-----K 137
             + V+  GG Y    HS WNG T+AD V P+F+FI+G +I L++  + +  G  K     K
Sbjct:   212 VFVNYGGGKYWYFKHSSWNGLTVADLVFPWFVFIMGTSIFLSMTSILQ-RGCSKFRLLGK 270

Query:   138 IIFRTLKLLFWGI-ILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTT 196
             I +R+  L+  GI ++   Y   P  LS+    +  R  G+LQR+   Y VVA++E L  
Sbjct:   271 IAWRSFLLICIGIFVVNPKYCLGP--LSW----EKARIPGVLQRLGATYFVVAVLELLFA 324

Query:   197 KRRPNVL--EPRHLSIF--TAYQW-QWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVK 251
             K  P     E    S+   TA  W QW+   I   +++  T+ L VP          G+ 
Sbjct:   325 KPVPETCASERSCFSLLDITA-SWPQWLFVLILEGVWLALTFFLPVPGCPTGYLGPGGIG 383

Query:   252 KYIVKCGMRGHLGPACNAVGYVDRELWGINHLYSDP 287
                   G R +      A GYVDR L G  HLY  P
Sbjct:   384 D-----GGR-YRNCTGGAAGYVDRLLLGDQHLYQHP 413

 Score = 187 (70.9 bits), Expect = 3.6e-31, Sum P(2) = 3.6e-31
 Identities = 64/224 (28%), Positives = 105/224 (46%)

Query:   269 AVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLREDAPSWCRAPFEPEGLLSTISA 328
             A GYVDR L G  HLY  P             +S  L           ++PEG+L TI++
Sbjct:   395 AAGYVDRLLLGDQHLYQHP-------------SSAVLYHT-----EVAYDPEGILGTINS 436

Query:   329 ILSGTIGIHYGHVLIHFK----GHSARLKHWVSMGFGLLIIAIILHFTNA--IPINKQLY 382
             I+   +G+  G +L+++K    G   R   W  +  GL+ +A+     N   IP+NK L+
Sbjct:   437 IVMAFLGVQAGKILLYYKDQTRGILIRFAAWGCL-LGLVSVALTKASENEGFIPVNKNLW 495

Query:   383 SFSYVCFTAGAAGIVFSALYVLMDVWELRTPFLFLKWIGMNAMLVFVLGAQGILAGFVNG 442
             S SYV   +  A ++  ALY ++DV  L T   F  + GMN++LV+V G +     F   
Sbjct:   496 SISYVTTLSSLAFLILLALYPVVDVKGLWTGAPFF-YPGMNSILVYV-GHEVFANYFPFQ 553

Query:   443 WYYKNPDNTLVNWIQNHLFIHVWNSERLGTLLYVIFAEITFWGV 486
             W   +  +   + +QN +   +W       + + ++ +  FW +
Sbjct:   554 WKLGDQQSHKEHLVQNMVATALWV-----LIAFALYKKKVFWKI 592

 Score = 67 (28.6 bits), Expect = 1.2e-18, Sum P(2) = 1.2e-18
 Identities = 40/187 (21%), Positives = 83/187 (44%)

Query:   317 FEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIP 376
             ++PEG+L TI++I+   +G+  G +L+++K  +  +    +  +G L+  + +  T A  
Sbjct:   425 YDPEGILGTINSIVMAFLGVQAGKILLYYKDQTRGILIRFA-AWGCLLGLVSVALTKASE 483

Query:   377 INKQLYSFSYVCFTAGAAGIVFS-ALYVLMDVWELRTPFLFLKWIGMNAMLVFVLGAQGI 435
              N+     +   ++      + S A  +L+ ++    P + +K +   A   F  G   I
Sbjct:   484 -NEGFIPVNKNLWSISYVTTLSSLAFLILLALY----PVVDVKGLWTGAPF-FYPGMNSI 537

Query:   436 LAGFVNGWYYKN--PDNTLVNWIQNHLFIHVWNSERLGTLLYVIFAEITFWGVVAGILHR 493
             L  +V    + N  P    +   Q+H    V N   + T L         W ++A  L++
Sbjct:   538 LV-YVGHEVFANYFPFQWKLGDQQSHKEHLVQNM--VATAL---------WVLIAFALYK 585

Query:   494 LGIYWKL 500
               ++WK+
Sbjct:   586 KKVFWKI 592


>UNIPROTKB|Q68CP4 [details] [associations]
            symbol:HGSNAT "Heparan-alpha-glucosaminide
            N-acetyltransferase" species:9606 "Homo sapiens" [GO:0016021
            "integral to membrane" evidence=IEA] [GO:0015019
            "heparan-alpha-glucosaminide N-acetyltransferase activity"
            evidence=IEA] [GO:0051259 "protein oligomerization" evidence=IDA]
            [GO:0005765 "lysosomal membrane" evidence=IDA;TAS] [GO:0007041
            "lysosomal transport" evidence=IDA] [GO:0016746 "transferase
            activity, transferring acyl groups" evidence=IDA] [GO:0005975
            "carbohydrate metabolic process" evidence=TAS] [GO:0006027
            "glycosaminoglycan catabolic process" evidence=TAS] [GO:0030203
            "glycosaminoglycan metabolic process" evidence=TAS] [GO:0044281
            "small molecule metabolic process" evidence=TAS]
            Reactome:REACT_111217 GO:GO:0051259 GO:GO:0016021
            Reactome:REACT_116125 GO:GO:0005765 GO:GO:0044281 GO:GO:0005975
            GO:GO:0016746 GO:GO:0006027 GO:GO:0007041 EMBL:AC113191
            EMBL:AK304441 EMBL:CR749838 IPI:IPI00739149 IPI:IPI00908672
            RefSeq:NP_689632.2 UniGene:Hs.600384 ProteinModelPortal:Q68CP4
            IntAct:Q68CP4 STRING:Q68CP4 PhosphoSite:Q68CP4 DMDM:124007195
            PaxDb:Q68CP4 PRIDE:Q68CP4 Ensembl:ENST00000379644
            Ensembl:ENST00000458501 GeneID:138050 KEGG:hsa:138050
            UCSC:uc003xpx.4 CTD:138050 GeneCards:GC08P042995 H-InvDB:HIX0007487
            HGNC:HGNC:26527 HPA:HPA029578 MIM:252930 MIM:610453
            neXtProt:NX_Q68CP4 Orphanet:79271 PharmGKB:PA162390851
            eggNOG:COG4299 HOGENOM:HOG000006803 HOVERGEN:HBG081599
            InParanoid:Q68CP4 KO:K10532 OrthoDB:EOG4548Z7 BRENDA:2.3.1.78
            SABIO-RK:Q68CP4 ChiTaRS:HGSNAT GenomeRNAi:138050 NextBio:83735
            ArrayExpress:Q68CP4 Bgee:Q68CP4 CleanEx:HS_HGSNAT
            Genevestigator:Q68CP4 GO:GO:0015019 InterPro:IPR012429 Pfam:PF07786
            Uniprot:Q68CP4
        Length = 663

 Score = 232 (86.7 bits), Expect = 7.9e-30, Sum P(2) = 7.9e-30
 Identities = 78/239 (32%), Positives = 119/239 (49%)

Query:    62 RVATLDAFRGLTVVWVYTQLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAI 121
             R+ ++D FRG+ ++     LM+ V+  GG Y    H+ WNG T+AD V P+F+FI+G +I
Sbjct:   267 RLRSVDTFRGIALI-----LMVFVNYGGGKYWYFKHASWNGLTVADLVFPWFVFIMGSSI 321

Query:   122 ALALKKVPKINGAVK-----KIIFRTLKLLFWGIILQG-GYSHAPDALSYGVDMKHIRWC 175
              L++  + +  G  K     KI +R+  L+  GII+    Y   P  LS+      +R  
Sbjct:   322 FLSMTSILQ-RGCSKFRLLGKIAWRSFLLICIGIIIVNPNYCLGP--LSWD----KVRIP 374

Query:   176 GILQRIALVYVVVALIETLTTKRRPN--VLEPRHLSI--FTAYQW-QWIGGFIAFVIYII 230
             G+LQR+ + Y VVA++E L  K  P     E   LS+   T+  W QW+   +   +++ 
Sbjct:   375 GVLQRLGVTYFVVAVLELLFAKPVPEHCASERSCLSLRDITS-SWPQWLLILVLEGLWLG 433

Query:   231 TTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLGPACN--AVGYVDRELWGINHLYSDP 287
              T+ L VP          G+  +       G   P C   A GY+DR L G +HLY  P
Sbjct:   434 LTFLLPVPGCPTGYLGPGGIGDF-------GKY-PNCTGGAAGYIDRLLLGDDHLYQHP 484

 Score = 175 (66.7 bits), Expect = 7.9e-30, Sum P(2) = 7.9e-30
 Identities = 51/178 (28%), Positives = 90/178 (50%)

Query:   317 FEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSA----RLKHWVSMGFGLLIIAIILHFT 372
             ++PEG+L TI++I+   +G+  G +L+++K  +     R   W  +  GL+ +A+     
Sbjct:   496 YDPEGILGTINSIVMAFLGVQAGKILLYYKARTKDILIRFTAWCCI-LGLISVALTKVSE 554

Query:   373 NA--IPINKQLYSFSYVCFTAGAAGIVFSALYVLMDVWEL--RTPFLFLKWIGMNAMLVF 428
             N   IP+NK L+S SYV   +  A  +   LY ++DV  L   TPF +    GMN++LV+
Sbjct:   555 NEGFIPVNKNLWSLSYVTTLSSFAFFILLVLYPVVDVKGLWTGTPFFYP---GMNSILVY 611

Query:   429 VLGAQGILAGFVNGWYYKNPDNTLVNWIQNHLFIHVWNSERLGTLLYVIFAEITFWGV 486
             V G +     F   W  K+  +   +  QN +   +W       + Y+++ +  FW +
Sbjct:   612 V-GHEVFENYFPFQWKLKDNQSHKEHLTQNIVATALWV-----LIAYILYRKKIFWKI 663


>DICTYBASE|DDB_G0286315 [details] [associations]
            symbol:DDB_G0286315 "transmembrane protein"
            species:44689 "Dictyostelium discoideum" [GO:0008150
            "biological_process" evidence=ND] [GO:0003674 "molecular_function"
            evidence=ND] [GO:0016021 "integral to membrane" evidence=IEA]
            dictyBase:DDB_G0286315 GO:GO:0016021 EMBL:AAFI02000085
            eggNOG:COG4299 KO:K10532 RefSeq:XP_637852.1
            EnsemblProtists:DDB0234045 GeneID:8625566 KEGG:ddi:DDB_G0286315
            InParanoid:Q54LX9 OMA:SITIMIF Uniprot:Q54LX9
        Length = 675

 Score = 208 (78.3 bits), Expect = 1.0e-22, Sum P(3) = 1.0e-22
 Identities = 54/134 (40%), Positives = 79/134 (58%)

Query:    59 KSKRVATLDAFRGLTVVWVYTQLMILVDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVG 118
             K  R+ +LD FRG ++      +MI V+  GG Y   +HS WNG T+AD V P+F+FI+G
Sbjct:   203 KKDRLRSLDVFRGFSIT-----IMIFVNYGGGGYWFFNHSLWNGLTVADLVFPWFVFIMG 257

Query:   119 VAIALALKKVPKINGAVKKIIFRTLKLLFWGIILQG-GYSHAPDALSYGVDMKHIRWCGI 177
             +A+ L+   + K  G  K+IIF+  KLL   IIL   G       ++ GVD++  R  G+
Sbjct:   258 IAMPLSFHAMEK-RGTPKRIIFQ--KLLRRSIILFALGLF-----INNGVDLQQWRILGV 309

Query:   178 LQRIALVYVVVALI 191
             LQR ++ Y+VV  I
Sbjct:   310 LQRFSISYLVVGSI 323

 Score = 137 (53.3 bits), Expect = 1.0e-22, Sum P(3) = 1.0e-22
 Identities = 31/118 (26%), Positives = 66/118 (55%)

Query:   317 FEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAI-ILHFTNA- 374
             ++PEG +  +++I    IG+  G +++ +K + +RL  W+     L  IA  +   T   
Sbjct:   510 YDPEGTVGYLTSIFLCFIGVQAGRIILTYKSNRSRLIRWMVWSVVLCGIAAGLCGLTQNQ 569

Query:   375 --IPINKQLYSFSYVCFTAGAAGIVFSALYVLMDVWEL--RTPFLFLKWIGMNAMLVF 428
               +P+NK L+S S++   AG    V + +++L+D+ ++   +PF++   +GMN + ++
Sbjct:   570 GWLPVNKNLWSPSFILLMAGFGFFVLTVMFILIDIKKIWNGSPFIY---VGMNPITIY 624

 Score = 48 (22.0 bits), Expect = 5.3e-06, Sum P(2) = 5.3e-06
 Identities = 11/30 (36%), Positives = 17/30 (56%)

Query:   215 QWQWIGGFIAFVI-YIIT-TYSLYVPNWSF 242
             QW+ +G    F I Y++  +  L+VP W F
Sbjct:   303 QWRILGVLQRFSISYLVVGSIMLFVPIWKF 332

 Score = 37 (18.1 bits), Expect = 1.0e-22, Sum P(3) = 1.0e-22
 Identities = 8/33 (24%), Positives = 15/33 (45%)

Query:   206 RHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVP 238
             ++ S    Y  QW+   I F  + +  + + VP
Sbjct:   424 KYFSDIAPYWIQWVFALIIFSGWFLLMFLVPVP 456


>DICTYBASE|DDB_G0270192 [details] [associations]
            symbol:DDB_G0270192 "DUF1624 family protein"
            species:44689 "Dictyostelium discoideum" [GO:0008150
            "biological_process" evidence=ND] [GO:0003674 "molecular_function"
            evidence=ND] [GO:0044351 "macropinocytosis" evidence=RCA]
            dictyBase:DDB_G0270192 EMBL:AAFI02000005 eggNOG:COG4299
            InterPro:IPR012429 Pfam:PF07786 RefSeq:XP_646608.1 STRING:Q55C73
            EnsemblProtists:DDB0190869 GeneID:8617580 KEGG:ddi:DDB_G0270192
            InParanoid:Q55C73 OMA:IRIYGIL Uniprot:Q55C73
        Length = 426

 Score = 175 (66.7 bits), Expect = 6.8e-22, Sum P(2) = 6.8e-22
 Identities = 55/203 (27%), Positives = 105/203 (51%)

Query:   318 EPEGLLSTISAILSGTIGIHYGHVLIHFK-----GHSARLKHWVSMGFGLLIIAIILHFT 372
             +PEGL+ST+S+ ++  +G+ +G +   F      G++  +  W+ +    ++ AI L  T
Sbjct:   232 DPEGLISTMSSFITAWMGLEFGRIFTRFYKKHDFGNTDIIVRWILLVILFMVPAISLGAT 291

Query:   373 NAIPINKQLYSFSYVCFTAGAAGIVFSALYVLMDV--WE-LRTPFLF---------LKWI 420
               +P NK+++SFS+  FT GA+G +    ++L+DV  WE L+   +          +KWI
Sbjct:   292 -VMPFNKKIWSFSFALFTVGASGSLILIAFILIDVIDWESLKCEKVRKIIDLIIKPMKWI 350

Query:   421 GMNAMLVFVLGAQGILAGFVNGWYYKNPDNTLVNWIQNHLFIHV-WNSERLGTLLYVIFA 479
             G N + ++ L    +    +  +Y     N+L  W+Q +  +++ W     G L   +F+
Sbjct:   351 GQNPITIYSLM---VFIEIILMYYINVGSNSL--WVQIYEKMYLSWLKN--GYLASTVFS 403

Query:   480 E--ITFWGVVAGILHRLGIYWKL 500
                + F+ ++A I+ R  I+ KL
Sbjct:   404 IGWLIFFILIAYIMQRNKIFIKL 426

 Score = 154 (59.3 bits), Expect = 6.8e-22, Sum P(2) = 6.8e-22
 Identities = 47/127 (37%), Positives = 65/127 (51%)

Query:    61 KRVATLDAFRGLTVVWVYTQLMILVDDAGG--AYARIDHSPWNGCTLADFVMPFFLFIVG 118
             +R+ +LDA RGLT+       MILVD+  G      ++ + WNG + AD + P F+FI G
Sbjct:    44 RRMGSLDAVRGLTIFG-----MILVDNQAGNDVIWPLNETEWNGLSTADLIFPSFIFISG 98

Query:   119 VAIALALKKVPKINGAVKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGIL 178
              +IALALK           II RTL L F    +Q   +   D  ++       R  G+L
Sbjct:    99 FSIALALKNSKNTTSTWYGIIRRTLLLFF----IQCFLNLMGDHFNFTT----FRIMGVL 150

Query:   179 QRIALVY 185
             QRIA+ Y
Sbjct:   151 QRIAICY 157

 Score = 81 (33.6 bits), Expect = 2.7e-14, Sum P(2) = 2.7e-14
 Identities = 39/146 (26%), Positives = 65/146 (44%)

Query:   142 TLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETL------T 195
             T  L+F   I   G+S A  AL    +     W GI++R  L++ +   +  +      T
Sbjct:    85 TADLIFPSFIFISGFSIAL-ALKNSKNTTST-WYGIIRRTLLLFFIQCFLNLMGDHFNFT 142

Query:   196 TKRRPNVLEPRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIV 255
             T R   VL+     I   Y +  +  F+ F I++   + L V     S    + +   + 
Sbjct:   143 TFRIMGVLQ----RIAICYFFSCLS-FLCFPIFLQRLFLLSVTVTYISIM--YALN--VP 193

Query:   256 KCGMRGHLGPACNAVGYVDRELWGIN 281
             KCG R +L   CNA  Y+D +++G+N
Sbjct:   194 KCG-RANLTQNCNAGAYIDSKVFGLN 218

 Score = 42 (19.8 bits), Expect = 7.1e-08, Sum P(2) = 7.1e-08
 Identities = 10/34 (29%), Positives = 18/34 (52%)

Query:   440 VNGWYYKNPDNTLVNWIQNHLFIHVWNSERLGTL 473
             +NG YY +P+  L++ + +  FI  W     G +
Sbjct:   225 LNGPYYNDPEG-LISTMSS--FITAWMGLEFGRI 255


>UNIPROTKB|F1NBK1 [details] [associations]
            symbol:HGSNAT "Uncharacterized protein" species:9031
            "Gallus gallus" [GO:0005765 "lysosomal membrane" evidence=IEA]
            [GO:0007041 "lysosomal transport" evidence=IEA] [GO:0016746
            "transferase activity, transferring acyl groups" evidence=IEA]
            [GO:0051259 "protein oligomerization" evidence=IEA] GO:GO:0051259
            GO:GO:0005765 GO:GO:0016746 GO:GO:0007041 InterPro:IPR012429
            Pfam:PF07786 GeneTree:ENSGT00390000001491 OMA:KHSSWNG
            EMBL:AADN02016166 EMBL:AADN02016165 IPI:IPI00595110
            Ensembl:ENSGALT00000016483 Uniprot:F1NBK1
        Length = 584

 Score = 262 (97.3 bits), Expect = 1.4e-20, Sum P(2) = 1.4e-20
 Identities = 82/286 (28%), Positives = 135/286 (47%)

Query:    25 KDSENGINKEKGLERSEVQDEQKGEXXXXXXXXXKSKRVATLDAFRGLTVVWVYTQLMIL 84
             ++++  IN E G       D    +           +R+ +LD FRGL+++     +M+ 
Sbjct:   152 RETDRLINSELG--SPSTTDSPSSDPSPRLWRATSRQRLRSLDTFRGLSLI-----IMVF 204

Query:    85 VDDAGGAYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKKVPKINGAVKKIIFRTLK 144
             V+  GG Y    H  WNG T+AD V P+F+FI+G +I+L+L    +   + +K++++ L 
Sbjct:   205 VNYGGGKYWFFKHESWNGLTVADLVFPWFVFIMGTSISLSLSSTLRWGSSKQKVLWKILW 264

Query:   145 LLFWGIILQGGYSHAPDALSYGVDMKHIRWCGILQRIALVYVVVALIETLTTKRRPN--V 202
               F  +IL G     P+     +  +++R  G+LQR+ L Y+VVA +E L T+   +   
Sbjct:   265 RSFL-LILLGVIVVNPNYCLGALSWENLRIPGVLQRLGLTYLVVAALELLFTRTGADSGT 323

Query:   203 LE---PRHLSIFTAYQWQWIGGFIAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGM 259
             LE   P    I   +  QWI   +  VI++  T+ L VP          G+  +    G 
Sbjct:   324 LEMSCPALQDILPFWP-QWIFILMLEVIWLCLTFLLPVPGCPRGYLGPGGIGDF----G- 377

Query:   260 RGHLGPACNAVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPL 305
               +L     A GY+DR + G  H+Y  P  + L   T+     G L
Sbjct:   378 -NYLNCTGGAAGYIDRLVLGEKHIYQHPSCNVLYQTTVPYDPEGIL 422

 Score = 159 (61.0 bits), Expect = 6.8e-09, Sum P(2) = 6.8e-09
 Identities = 72/249 (28%), Positives = 112/249 (44%)

Query:   260 RGHLGPA----------CN--AVGYVDRELWGINHLYSDPVWSRLEACTLSSPNSGPLRE 307
             RG+LGP           C   A GY+DR + G  H+Y  P  + L   T+          
Sbjct:   365 RGYLGPGGIGDFGNYLNCTGGAAGYIDRLVLGEKHIYQHPSCNVLYQTTV---------- 414

Query:   308 DAPSWCRAPFEPEGLLSTISAILSGTIGIHYGHVL-IHFKGHSAR-LKH---WVSMGFGL 362
                     P++PEG+L TI+ IL   +G+       + + G S   L H   WVS+  G+
Sbjct:   415 --------PYDPEGILGTINTILMAFLGLQVPLFFSVCYMGKSEGILPHSLRWVSVQ-GI 465

Query:   363 LIIAIILHFTNA---IPINKQLYSFSYVCFTAGAAGIVFSALYVLMDVWEL--RTPFLFL 417
              I AI+   +     IPINK L+S SYV   +  A I+   +Y L+DV  L   TPF + 
Sbjct:   466 -IFAILTKCSKEEGFIPINKNLWSTSYVTTMSCFAFILLLLMYYLVDVKRLWSGTPFFYP 524

Query:   418 KWIGMNAMLVFVLGAQGILAGFVNGWYYKNPDNTLVNWIQNHLFIHVWNSERLGTLLYVI 477
                GMN++LV++ G +     F   W  ++  +   +  QN     +W       + Y++
Sbjct:   525 ---GMNSILVYI-GHEVFENYFPFKWKMQDSQSHAEHLTQNLTATTLWV-----IISYLL 575

Query:   478 FAEITFWGV 486
             + +  FW +
Sbjct:   576 YRKKIFWKI 584

 Score = 51 (23.0 bits), Expect = 1.4e-20, Sum P(2) = 1.4e-20
 Identities = 7/19 (36%), Positives = 14/19 (73%)

Query:   482 TFWGVVAGILHRLGIYWKL 500
             T W +++ +L+R  I+WK+
Sbjct:   566 TLWVIISYLLYRKKIFWKI 584


>UNIPROTKB|Q8EBK9 [details] [associations]
            symbol:nagX "Uncharacterized protein" species:211586
            "Shewanella oneidensis MR-1" [GO:0003674 "molecular_function"
            evidence=ND] [GO:0005575 "cellular_component" evidence=ND]
            [GO:0008150 "biological_process" evidence=ND] EMBL:AE014299
            GenomeReviews:AE014299_GR HOGENOM:HOG000295496 RefSeq:NP_719051.1
            DNASU:1171178 GeneID:1171178 KEGG:son:SO_3504 PATRIC:23526700
            OMA:FVGHFIV ProtClustDB:CLSK907194 Uniprot:Q8EBK9
        Length = 395

 Score = 159 (61.0 bits), Expect = 1.7e-20, Sum P(2) = 1.7e-20
 Identities = 54/186 (29%), Positives = 90/186 (48%)

Query:    62 RVATLDAFRGLTVVWV------YTQLMILVDDAGGAYA--RIDHSPWNGCTLADFVMPFF 113
             R+ +LDA RG  + W+      +  L+I    AG  +   ++ HS W+G  L D + P F
Sbjct:    29 RLMSLDALRGFDMFWILGGEALFGALLIFTGWAGWQWGDTQMHHSEWHGFRLYDLIFPLF 88

Query:   114 LFIVGVAIALALKKVPKINGAVKKIIFRT-LKLLFWGIILQGGYSHAPDALSYGVDMKHI 172
             +F+ GVA+ L+ K++ K+    +  ++R  +K LF  ++L   Y+H        VD   I
Sbjct:    89 IFLSGVALGLSPKRLDKLPLHERLPVYRHGVKRLFLLLLLGILYNHGWGT-GAPVDPDKI 147

Query:   173 RWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQ-WQWIGGFIAFVIYIIT 231
             R+  +L RIA  +   AL+   T+ R   ++    L  + A Q W    G  A V+    
Sbjct:   148 RYASVLGRIAFAWFFAALLVWHTSLRTQVLVAVGILVGYGAMQLWLPFPGGQAGVLSPTV 207

Query:   232 TYSLYV 237
             + + YV
Sbjct:   208 SINAYV 213

 Score = 157 (60.3 bits), Expect = 1.7e-20, Sum P(2) = 1.7e-20
 Identities = 39/120 (32%), Positives = 73/120 (60%)

Query:   314 RAPFEPEGLLSTISAILSGTIGIHYGHVLI--HFKGHSARLKHWVSMGFGLLIIAIILHF 371
             R P +PEG+LST+ A+++   G+  GH ++  H KG  A++    + G   L +  +L  
Sbjct:   226 RMP-DPEGVLSTLPAVVNALAGVFVGHFIVKSHPKGEWAKVGLLGAAGGVCLALGWLLDA 284

Query:   372 TNAIPINKQLYSFSYVCFTAGAAGIVFSALYVLMDV--WELRTPFLFLKWIGMNAMLVFV 429
                IP+NK+L++ S+V  T+G + ++ +  Y L+DV  W+ +  F+F+  IG NA+++++
Sbjct:   285 V--IPVNKELWTSSFVLVTSGWSMLLLALFYALVDVLKWQ-KLVFVFVV-IGTNAIIIYL 340

 Score = 49 (22.3 bits), Expect = 2.7e-09, Sum P(2) = 2.7e-09
 Identities = 14/44 (31%), Positives = 21/44 (47%)

Query:   256 KCGMRGHLGPACNAVGY-------VDRELWGINHLYSDPVWSRL 292
             K G+ G  G  C A+G+       V++ELW  + +     WS L
Sbjct:   264 KVGLLGAAGGVCLALGWLLDAVIPVNKELWTSSFVLVTSGWSML 307


>TIGR_CMR|SO_3504 [details] [associations]
            symbol:SO_3504 "conserved hypothetical protein"
            species:211586 "Shewanella oneidensis MR-1" [GO:0008150
            "biological_process" evidence=ND] [GO:0003674 "molecular_function"
            evidence=ND] [GO:0005575 "cellular_component" evidence=ND]
            EMBL:AE014299 GenomeReviews:AE014299_GR HOGENOM:HOG000295496
            RefSeq:NP_719051.1 DNASU:1171178 GeneID:1171178 KEGG:son:SO_3504
            PATRIC:23526700 OMA:FVGHFIV ProtClustDB:CLSK907194 Uniprot:Q8EBK9
        Length = 395

 Score = 159 (61.0 bits), Expect = 1.7e-20, Sum P(2) = 1.7e-20
 Identities = 54/186 (29%), Positives = 90/186 (48%)

Query:    62 RVATLDAFRGLTVVWV------YTQLMILVDDAGGAYA--RIDHSPWNGCTLADFVMPFF 113
             R+ +LDA RG  + W+      +  L+I    AG  +   ++ HS W+G  L D + P F
Sbjct:    29 RLMSLDALRGFDMFWILGGEALFGALLIFTGWAGWQWGDTQMHHSEWHGFRLYDLIFPLF 88

Query:   114 LFIVGVAIALALKKVPKINGAVKKIIFRT-LKLLFWGIILQGGYSHAPDALSYGVDMKHI 172
             +F+ GVA+ L+ K++ K+    +  ++R  +K LF  ++L   Y+H        VD   I
Sbjct:    89 IFLSGVALGLSPKRLDKLPLHERLPVYRHGVKRLFLLLLLGILYNHGWGT-GAPVDPDKI 147

Query:   173 RWCGILQRIALVYVVVALIETLTTKRRPNVLEPRHLSIFTAYQ-WQWIGGFIAFVIYIIT 231
             R+  +L RIA  +   AL+   T+ R   ++    L  + A Q W    G  A V+    
Sbjct:   148 RYASVLGRIAFAWFFAALLVWHTSLRTQVLVAVGILVGYGAMQLWLPFPGGQAGVLSPTV 207

Query:   232 TYSLYV 237
             + + YV
Sbjct:   208 SINAYV 213

 Score = 157 (60.3 bits), Expect = 1.7e-20, Sum P(2) = 1.7e-20
 Identities = 39/120 (32%), Positives = 73/120 (60%)

Query:   314 RAPFEPEGLLSTISAILSGTIGIHYGHVLI--HFKGHSARLKHWVSMGFGLLIIAIILHF 371
             R P +PEG+LST+ A+++   G+  GH ++  H KG  A++    + G   L +  +L  
Sbjct:   226 RMP-DPEGVLSTLPAVVNALAGVFVGHFIVKSHPKGEWAKVGLLGAAGGVCLALGWLLDA 284

Query:   372 TNAIPINKQLYSFSYVCFTAGAAGIVFSALYVLMDV--WELRTPFLFLKWIGMNAMLVFV 429
                IP+NK+L++ S+V  T+G + ++ +  Y L+DV  W+ +  F+F+  IG NA+++++
Sbjct:   285 V--IPVNKELWTSSFVLVTSGWSMLLLALFYALVDVLKWQ-KLVFVFVV-IGTNAIIIYL 340

 Score = 49 (22.3 bits), Expect = 2.7e-09, Sum P(2) = 2.7e-09
 Identities = 14/44 (31%), Positives = 21/44 (47%)

Query:   256 KCGMRGHLGPACNAVGY-------VDRELWGINHLYSDPVWSRL 292
             K G+ G  G  C A+G+       V++ELW  + +     WS L
Sbjct:   264 KVGLLGAAGGVCLALGWLLDAVIPVNKELWTSSFVLVTSGWSML 307


>UNIPROTKB|Q489U3 [details] [associations]
            symbol:CPS_0413 "Putative membrane protein" species:167879
            "Colwellia psychrerythraea 34H" [GO:0003674 "molecular_function"
            evidence=ND] [GO:0008150 "biological_process" evidence=ND]
            EMBL:CP000083 GenomeReviews:CP000083_GR eggNOG:COG4299
            InterPro:IPR012429 Pfam:PF07786 OMA:IRIYGIL RefSeq:YP_267171.1
            STRING:Q489U3 DNASU:3518441 GeneID:3518441 KEGG:cps:CPS_0413
            PATRIC:21464187 HOGENOM:HOG000295496
            BioCyc:CPSY167879:GI48-508-MONOMER Uniprot:Q489U3
        Length = 358

 Score = 175 (66.7 bits), Expect = 2.3e-20, Sum P(3) = 2.3e-20
 Identities = 53/143 (37%), Positives = 74/143 (51%)

Query:    62 RVATLDAFRGLTVVWVYTQLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVG 118
             R   LDAFRG+T+      LMILV+  G     YA + H+ W+G T  D V PFFLFI+G
Sbjct:     3 RYLALDAFRGITIA-----LMILVNTPGTWSHVYAPLLHAEWDGATPTDLVFPFFLFIIG 57

Query:   119 VAIALALKKVPKINGA---VKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWC 175
              A+  + KK    + +    +KII R   + F G +L        + + + V+ +  R  
Sbjct:    58 SAMFFSFKK-SNFSASPEQFRKIIKRGFIMFFIGFML--------NVIPFTVNAEDWRIM 108

Query:   176 GILQRIALVYVVVALIETLTTKR 198
             GILQRI + Y V A +  LT  R
Sbjct:   109 GILQRIGIAYTVAACL-VLTLNR 130

 Score = 132 (51.5 bits), Expect = 2.3e-20, Sum P(3) = 2.3e-20
 Identities = 51/187 (27%), Positives = 94/187 (50%)

Query:   317 FEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIP 376
             FEPEGLLSTI AI++  +G      L   +   + +     +G GL +    L +   +P
Sbjct:   184 FEPEGLLSTIPAIVNMLLGFELTRYLTSIEDKRSSVIKLTLIG-GLAVGFGAL-WGLVLP 241

Query:   377 INKQLYSFSYVCFTAGAAGIVFSALYVLMDVWE---LRTPFLFLKWIGMNAMLVFVLGAQ 433
             INK L++ SYV ++ G A ++ +A   L+D+ +   L  P L     G N + V+VL   
Sbjct:   242 INKSLWTPSYVIYSTGFACLLLAAFIWLIDIMKQVKLAEPLLVY---GTNPLFVYVLSFL 298

Query:   434 GILAGFVNGWYYKNPDNTLVNWIQNHLFIHVWNSERLGTLLYVIFAEITFWGVVAGILHR 493
              ++  ++N       D ++  W+   L   V+ + +L + ++  F+ + F+  V+  L++
Sbjct:   299 -VVTMYLN---INVGDVSMYAWLYKQLS-GVF-TPKLASFIFA-FSHVAFFWYVSLKLYQ 351

Query:   494 LGIYWKL 500
               I+ K+
Sbjct:   352 RKIFIKI 358

 Score = 40 (19.1 bits), Expect = 2.3e-20, Sum P(3) = 2.3e-20
 Identities = 6/18 (33%), Positives = 12/18 (66%)

Query:   268 NAVGYVDRELWGINHLYS 285
             N +  +D  ++G NH+Y+
Sbjct:   161 NIIRQLDLAVFGANHMYT 178

 Score = 40 (19.1 bits), Expect = 6.4e-06, Sum P(3) = 6.4e-06
 Identities = 10/39 (25%), Positives = 18/39 (46%)

Query:   117 VGVAIALALKKVPKINGAVKKIIFRTLKLLFWGIILQGG 155
             +G+A  +A   V  +N     I    + L +W ++L  G
Sbjct:   114 IGIAYTVAACLVLTLNRTGVFIASAVILLAYWALLLSMG 152


>TIGR_CMR|CPS_0413 [details] [associations]
            symbol:CPS_0413 "putative membrane protein" species:167879
            "Colwellia psychrerythraea 34H" [GO:0003674 "molecular_function"
            evidence=ND] [GO:0008150 "biological_process" evidence=ND]
            [GO:0016020 "membrane" evidence=ISS] EMBL:CP000083
            GenomeReviews:CP000083_GR eggNOG:COG4299 InterPro:IPR012429
            Pfam:PF07786 OMA:IRIYGIL RefSeq:YP_267171.1 STRING:Q489U3
            DNASU:3518441 GeneID:3518441 KEGG:cps:CPS_0413 PATRIC:21464187
            HOGENOM:HOG000295496 BioCyc:CPSY167879:GI48-508-MONOMER
            Uniprot:Q489U3
        Length = 358

 Score = 175 (66.7 bits), Expect = 2.3e-20, Sum P(3) = 2.3e-20
 Identities = 53/143 (37%), Positives = 74/143 (51%)

Query:    62 RVATLDAFRGLTVVWVYTQLMILVDDAGG---AYARIDHSPWNGCTLADFVMPFFLFIVG 118
             R   LDAFRG+T+      LMILV+  G     YA + H+ W+G T  D V PFFLFI+G
Sbjct:     3 RYLALDAFRGITIA-----LMILVNTPGTWSHVYAPLLHAEWDGATPTDLVFPFFLFIIG 57

Query:   119 VAIALALKKVPKINGA---VKKIIFRTLKLLFWGIILQGGYSHAPDALSYGVDMKHIRWC 175
              A+  + KK    + +    +KII R   + F G +L        + + + V+ +  R  
Sbjct:    58 SAMFFSFKK-SNFSASPEQFRKIIKRGFIMFFIGFML--------NVIPFTVNAEDWRIM 108

Query:   176 GILQRIALVYVVVALIETLTTKR 198
             GILQRI + Y V A +  LT  R
Sbjct:   109 GILQRIGIAYTVAACL-VLTLNR 130

 Score = 132 (51.5 bits), Expect = 2.3e-20, Sum P(3) = 2.3e-20
 Identities = 51/187 (27%), Positives = 94/187 (50%)

Query:   317 FEPEGLLSTISAILSGTIGIHYGHVLIHFKGHSARLKHWVSMGFGLLIIAIILHFTNAIP 376
             FEPEGLLSTI AI++  +G      L   +   + +     +G GL +    L +   +P
Sbjct:   184 FEPEGLLSTIPAIVNMLLGFELTRYLTSIEDKRSSVIKLTLIG-GLAVGFGAL-WGLVLP 241

Query:   377 INKQLYSFSYVCFTAGAAGIVFSALYVLMDVWE---LRTPFLFLKWIGMNAMLVFVLGAQ 433
             INK L++ SYV ++ G A ++ +A   L+D+ +   L  P L     G N + V+VL   
Sbjct:   242 INKSLWTPSYVIYSTGFACLLLAAFIWLIDIMKQVKLAEPLLVY---GTNPLFVYVLSFL 298

Query:   434 GILAGFVNGWYYKNPDNTLVNWIQNHLFIHVWNSERLGTLLYVIFAEITFWGVVAGILHR 493
              ++  ++N       D ++  W+   L   V+ + +L + ++  F+ + F+  V+  L++
Sbjct:   299 -VVTMYLN---INVGDVSMYAWLYKQLS-GVF-TPKLASFIFA-FSHVAFFWYVSLKLYQ 351

Query:   494 LGIYWKL 500
               I+ K+
Sbjct:   352 RKIFIKI 358

 Score = 40 (19.1 bits), Expect = 2.3e-20, Sum P(3) = 2.3e-20
 Identities = 6/18 (33%), Positives = 12/18 (66%)

Query:   268 NAVGYVDRELWGINHLYS 285
             N +  +D  ++G NH+Y+
Sbjct:   161 NIIRQLDLAVFGANHMYT 178

 Score = 40 (19.1 bits), Expect = 6.4e-06, Sum P(3) = 6.4e-06
 Identities = 10/39 (25%), Positives = 18/39 (46%)

Query:   117 VGVAIALALKKVPKINGAVKKIIFRTLKLLFWGIILQGG 155
             +G+A  +A   V  +N     I    + L +W ++L  G
Sbjct:   114 IGIAYTVAACLVLTLNRTGVFIASAVILLAYWALLLSMG 152


>UNIPROTKB|F1SE48 [details] [associations]
            symbol:HGSNAT "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0051259 "protein oligomerization" evidence=IEA]
            [GO:0016746 "transferase activity, transferring acyl groups"
            evidence=IEA] [GO:0007041 "lysosomal transport" evidence=IEA]
            [GO:0005765 "lysosomal membrane" evidence=IEA] GO:GO:0051259
            GO:GO:0005765 GO:GO:0016746 GO:GO:0007041
            GeneTree:ENSGT00390000001491 EMBL:CU640485
            Ensembl:ENSSSCT00000007671 OMA:HEVFEEY Uniprot:F1SE48
        Length = 298

 Score = 180 (68.4 bits), Expect = 5.1e-19, Sum P(2) = 5.1e-19
 Identities = 52/181 (28%), Positives = 92/181 (50%)

Query:   314 RAPFEPEGLLSTISAILSGTIGIHYGHVLIHFK----GHSARLKHWVSMGFGLLIIAIIL 369
             +  ++PEG+L TI++IL   +G+  G +L+++K    G   R   W     GL+ +A+  
Sbjct:   128 KVAYDPEGILGTINSILMAYLGVQAGKILLYYKDRTKGILIRFAVWGCF-LGLISVALTK 186

Query:   370 HFTNA--IPINKQLYSFSYVCFTAGAAGIVFSALYVLMDVWEL--RTPFLFLKWIGMNAM 425
                N   IP+NK L+S SYV   + +A ++   LY ++DV  L   TPF +    GMN++
Sbjct:   187 ASENEGFIPVNKNLWSTSYVTTLSSSAFLILLVLYPIVDVKGLWTGTPFFYP---GMNSI 243

Query:   426 LVFVLGAQGILAGFVNGWYYKNPDNTLVNWIQNHLFIHVWNSERLGTLLYVIFAEITFWG 485
             LV+ +G +     F   W   +  +   + +QN +   +W       + YV++ +  FW 
Sbjct:   244 LVY-MGHEVFANYFPFQWRLGDSQSHREHLVQNIVATALWV-----LIAYVLYKKNVFWK 297

Query:   486 V 486
             +
Sbjct:   298 I 298

 Score = 111 (44.1 bits), Expect = 5.1e-19, Sum P(2) = 5.1e-19
 Identities = 41/129 (31%), Positives = 58/129 (44%)

Query:   167 VDMKHIRWCGILQRIALVYVVVALIETLTTKRRPN--VLEPRHLSIF-TAYQW-QWIGGF 222
             V  +  R  G+LQR+ + Y VVA++E L  K  P     E    S+      W QW+   
Sbjct:     1 VSWEKARIPGVLQRLGVTYFVVAVLELLFAKPVPESCASERSCFSLLDVTSSWPQWLFVL 60

Query:   223 IAFVIYIITTYSLYVPNWSFSEHSDHGVKKYIVKCGMRGHLG--PACN--AVGYVDRELW 278
             +   +++  T+ L VP              Y+   G+ G LG  P C   A GY+DR L 
Sbjct:    61 VLEGVWLALTFFLPVPGCPTG---------YLGPGGI-GDLGKYPNCTGGAAGYIDRLLL 110

Query:   279 GINHLYSDP 287
             G +HLY  P
Sbjct:   111 GDDHLYQHP 119


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.326   0.142   0.461    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      500       491   0.00082  119 3  11 22  0.46    33
                                                     35  0.45    37


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  13
  No. of states in DFA:  626 (67 KB)
  Total size of DFA:  330 KB (2163 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  40.55u 0.15s 40.70t   Elapsed:  00:00:02
  Total cpu time:  40.55u 0.15s 40.70t   Elapsed:  00:00:02
  Start:  Sat May 11 15:09:18 2013   End:  Sat May 11 15:09:20 2013

Back to top