BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>psy8678
MVLLPAPPVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQF
QEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVV
HISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPV
VHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFP
TNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTD
KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTGTDKVKKGGSYLC
NEQYCYRHRCAARSQNTPDSSAGNLGFRCAADVS

High Scoring Gene Products

Symbol, full name Information P value
SUMF1
Uncharacterized protein
protein from Canis lupus familiaris 3.6e-82
Sumf1
sulfatase modifying factor 1
protein from Mus musculus 1.2e-81
SUMF1
Sulfatase-modifying factor 1
protein from Bos taurus 2.0e-81
Sumf1
sulfatase modifying factor 1
gene from Rattus norvegicus 5.2e-81
SUMF1
Uncharacterized protein
protein from Sus scrofa 2.2e-80
SUMF1
Sulfatase-modifying factor 1
protein from Homo sapiens 2.9e-80
sumf1
sulfatase modifying factor 1
gene_product from Danio rerio 8.7e-79
CG7049 protein from Drosophila melanogaster 8.8e-63
MT0739
Conserved protein
protein from Mycobacterium tuberculosis 2.9e-56
SUMF2
Uncharacterized protein
protein from Gallus gallus 1.9e-44
SUMF2
Sulfatase-modifying factor 2
protein from Homo sapiens 5.9e-42
Sumf2
sulfatase modifying factor 2
protein from Mus musculus 3.6e-41
LOC100518241
Uncharacterized protein
protein from Sus scrofa 2.2e-40
SUMF2
Uncharacterized protein
protein from Canis lupus familiaris 2.8e-40
SUMF2
Sulfatase-modifying factor 2
protein from Bos taurus 2.8e-40
Sumf2
sulfatase modifying factor 2
gene from Rattus norvegicus 1.9e-39
sumf2
sulfatase modifying factor 2
gene_product from Danio rerio 1.7e-36
SUMF2
Sulfatase-modifying factor 2
protein from Homo sapiens 7.4e-34
HNE_1666
Uncharacterized protein
protein from Hyphomonas neptunium ATCC 15444 8.5e-33
GSU0897
Protein 3-oxoalanine-generating enzyme family protein
protein from Geobacter sulfurreducens PCA 1.9e-20
GSU_0897
conserved hypothetical protein
protein from Geobacter sulfurreducens PCA 1.9e-20
PSPTO_2154
Uncharacterized protein
protein from Pseudomonas syringae pv. tomato str. DC3000 3.1e-15
PSPPH_1930
Uncharacterized protein
protein from Pseudomonas syringae pv. phaseolicola 1448A 2.1e-14
VC_A1077
Putative uncharacterized protein
protein from Vibrio cholerae O1 biovar El Tor str. N16961 5.8e-13
VC_A1077
conserved hypothetical protein
protein from Vibrio cholerae O1 biovar El Tor 5.8e-13
egtB
Iron(II)-dependent oxidoreductase EgtB
protein from Mycobacterium tuberculosis 9.1e-12
CPS_2986
Putative uncharacterized protein
protein from Colwellia psychrerythraea 34H 1.3e-11
CPS_2986
conserved hypothetical protein
protein from Colwellia psychrerythraea 34H 1.3e-11
CPS_2927
Putative uncharacterized protein
protein from Colwellia psychrerythraea 34H 1.5e-11
CPS_2927
conserved hypothetical protein
protein from Colwellia psychrerythraea 34H 1.5e-11
egtB
Iron(II)-dependent oxidoreductase EgtB
protein from Mycobacterium smegmatis str. MC2 155 1.2e-09
PFL_0723
Uncharacterized protein
protein from Pseudomonas protegens Pf-5 5.0e-09
pvdO
Chromophore maturation protein PvdO
protein from Pseudomonas protegens Pf-5 7.8e-09

The BLAST search returned 1 gene product which did not match your query constraints. Please see the full BLAST report below for the details.

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  psy8678
        (394 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

UNIPROTKB|F1P978 - symbol:SUMF1 "Uncharacterized protein"...   824  3.6e-82   1
MGI|MGI:1889844 - symbol:Sumf1 "sulfatase modifying facto...   819  1.2e-81   1
UNIPROTKB|Q0P5L5 - symbol:SUMF1 "Sulfatase-modifying fact...   817  2.0e-81   1
RGD|1309939 - symbol:Sumf1 "sulfatase modifying factor 1"...   813  5.2e-81   1
UNIPROTKB|F1SFL4 - symbol:SUMF1 "Uncharacterized protein"...   807  2.2e-80   1
UNIPROTKB|Q8NBK3 - symbol:SUMF1 "Sulfatase-modifying fact...   806  2.9e-80   1
ZFIN|ZDB-GENE-060421-3113 - symbol:sumf1 "sulfatase modif...   792  8.7e-79   1
UNIPROTKB|E1C234 - symbol:LOC100859261 "Uncharacterized p...   768  3.1e-76   1
FB|FBgn0035102 - symbol:CG7049 species:7227 "Drosophila m...   641  8.8e-63   1
UNIPROTKB|P95060 - symbol:MT0739 "Conserved protein" spec...   455  2.9e-56   2
UNIPROTKB|F1NWF3 - symbol:SUMF2 "Uncharacterized protein"...   468  1.9e-44   1
UNIPROTKB|Q8NBJ7 - symbol:SUMF2 "Sulfatase-modifying fact...   440  5.9e-42   2
MGI|MGI:1915152 - symbol:Sumf2 "sulfatase modifying facto...   437  3.6e-41   1
UNIPROTKB|F1RIU4 - symbol:SUMF2 "Uncharacterized protein"...   429  2.2e-40   2
UNIPROTKB|F1PA71 - symbol:SUMF2 "Uncharacterized protein"...   427  2.8e-40   2
UNIPROTKB|Q58CP2 - symbol:SUMF2 "Sulfatase-modifying fact...   426  2.8e-40   2
RGD|1563253 - symbol:Sumf2 "sulfatase modifying factor 2"...   419  1.9e-39   2
ZFIN|ZDB-GENE-041010-55 - symbol:sumf2 "sulfatase modifyi...   393  1.7e-36   1
UNIPROTKB|F8WA42 - symbol:SUMF2 "Sulfatase-modifying fact...   368  7.4e-34   1
UNIPROTKB|Q0C1L9 - symbol:HNE_1666 "Putative uncharacteri...   358  8.5e-33   1
UNIPROTKB|Q74ER3 - symbol:GSU0897 "Protein 3-oxoalanine-g...   244  1.9e-20   1
TIGR_CMR|GSU_0897 - symbol:GSU_0897 "conserved hypothetic...   244  1.9e-20   1
UNIPROTKB|Q884D9 - symbol:PSPTO_2154 "Uncharacterized pro...   176  3.1e-15   2
UNIPROTKB|Q48KB7 - symbol:PSPPH_1930 "Uncharacterized pro...   176  2.1e-14   2
UNIPROTKB|Q9KKM6 - symbol:VC_A1077 "Putative uncharacteri...   200  5.8e-13   1
TIGR_CMR|VC_A1077 - symbol:VC_A1077 "conserved hypothetic...   200  5.8e-13   1
UNIPROTKB|O69671 - symbol:egtB "Iron(II)-dependent oxidor...   187  9.1e-12   2
UNIPROTKB|Q47ZT3 - symbol:CPS_2986 "Putative uncharacteri...   116  1.3e-11   2
TIGR_CMR|CPS_2986 - symbol:CPS_2986 "conserved hypothetic...   116  1.3e-11   2
UNIPROTKB|Q47ZZ2 - symbol:CPS_2927 "Putative uncharacteri...   174  1.5e-11   1
TIGR_CMR|CPS_2927 - symbol:CPS_2927 "conserved hypothetic...   174  1.5e-11   1
UNIPROTKB|A0R5N0 - symbol:egtB "Iron(II)-dependent oxidor...   170  1.2e-09   2
UNIPROTKB|Q4KIR8 - symbol:PFL_0723 "Uncharacterized prote...   159  5.0e-09   1
UNIPROTKB|Q4K997 - symbol:pvdO "Chromophore maturation pr...   157  7.8e-09   1


>UNIPROTKB|F1P978 [details] [associations]
            symbol:SUMF1 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0042803 "protein homodimerization activity"
            evidence=IEA] [GO:0005783 "endoplasmic reticulum" evidence=IEA]
            GO:GO:0005783 InterPro:IPR016187 SUPFAM:SSF56436
            Gene3D:3.90.1580.10 InterPro:IPR005532 Pfam:PF03781 CTD:285362
            GeneTree:ENSGT00390000008983 KO:K13444 OMA:FRCAADH
            EMBL:AAEX03012095 RefSeq:XP_541796.2 Ensembl:ENSCAFT00000009609
            GeneID:484681 KEGG:cfa:484681 Uniprot:F1P978
        Length = 374

 Score = 824 (295.1 bits), Expect = 3.6e-82, P = 3.6e-82
 Identities = 135/188 (71%), Positives = 157/188 (83%)

Query:   161 SWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPW 220
             +W HPEG DST  HR +HPV+HVSWNDAVAYCTW G RLPTEAEWEY CRGGL+NRLFPW
Sbjct:   187 NWRHPEGPDSTTLHRPDHPVLHVSWNDAVAYCTWAGKRLPTEAEWEYSCRGGLQNRLFPW 246

Query:   221 GNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWW 280
             GN L P+G+H AN+WQGEFP  NT  DG+  TAPV ++  N +GLYN+VGNVWEWT+DWW
Sbjct:   247 GNKLQPKGQHYANIWQGEFPVTNTGEDGFRGTAPVDAFPPNGYGLYNIVGNVWEWTSDWW 306

Query:   281 NVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
              VHH    ++NPKGP +G D+VKKGGSY+C++ YCYR+RCAARSQNTPDSSA NLGFRCA
Sbjct:   307 TVHHSVEKTHNPKGPPSGKDRVKKGGSYMCHKSYCYRYRCAARSQNTPDSSASNLGFRCA 366

Query:   341 ADKGPTTG 348
             AD+ PTTG
Sbjct:   367 ADRQPTTG 374

 Score = 511 (184.9 bits), Expect = 9.7e-72, Sum P(2) = 9.7e-72
 Identities = 96/164 (58%), Positives = 122/164 (74%)

Query:    15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
             MVL+P   F MGT+ P + +DGE P+R V++DAFY+D +EVSN  F++FV++TGY+TEAE
Sbjct:    91 MVLIPAGVFTMGTDDPQIKQDGEAPARRVSIDAFYMDAYEVSNADFEKFVNSTGYLTEAE 150

Query:    75 KFGDTFVFEPLLSEEERAKISQ----------VRH-DMKRFEGLDSTIEHRMHHPVVHIS 123
             KFGD+FVFE +LSE+ +  I Q          V+  + +  EG DST  HR  HPV+H+S
Sbjct:   151 KFGDSFVFEGMLSEQVKTDIQQAVAAAPWWLPVKGANWRHPEGPDSTTLHRPDHPVLHVS 210

Query:   124 WNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEG 167
             WNDAVAYCTW G RLPTEAEWEY CRGGL+NRLFPWG+ L P+G
Sbjct:   211 WNDAVAYCTWAGKRLPTEAEWEYSCRGGLQNRLFPWGNKLQPKG 254

 Score = 233 (87.1 bits), Expect = 9.7e-72, Sum P(2) = 9.7e-72
 Identities = 40/50 (80%), Positives = 46/50 (92%)

Query:   343 KGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 392
             KGP +G D+VKKGGSY+C++ YCYR+RCAARSQNTPDSSA NLGFRCAAD
Sbjct:   319 KGPPSGKDRVKKGGSYMCHKSYCYRYRCAARSQNTPDSSASNLGFRCAAD 368


>MGI|MGI:1889844 [details] [associations]
            symbol:Sumf1 "sulfatase modifying factor 1" species:10090
            "Mus musculus" [GO:0005515 "protein binding" evidence=IPI]
            [GO:0005783 "endoplasmic reticulum" evidence=IDA] [GO:0016491
            "oxidoreductase activity" evidence=IEA] [GO:0042803 "protein
            homodimerization activity" evidence=IPI] [GO:0046872 "metal ion
            binding" evidence=IEA] [GO:0055114 "oxidation-reduction process"
            evidence=IEA] MGI:MGI:1889844 GO:GO:0005783 GO:GO:0046872
            InterPro:IPR016187 SUPFAM:SSF56436 GO:GO:0016491 GO:GO:0005788
            eggNOG:COG1262 Gene3D:3.90.1580.10 InterPro:IPR005532 Pfam:PF03781
            CTD:285362 GeneTree:ENSGT00390000008983 HOGENOM:HOG000135466
            HOVERGEN:HBG054193 KO:K13444 OMA:FRCAADH OrthoDB:EOG461449
            UniPathway:UPA00910 EMBL:AK151874 EMBL:AK159192 EMBL:AK160917
            EMBL:AK161203 EMBL:BC026981 IPI:IPI00153187 RefSeq:NP_666049.2
            UniGene:Mm.439876 ProteinModelPortal:Q8R0F3 SMR:Q8R0F3
            STRING:Q8R0F3 PhosphoSite:Q8R0F3 PaxDb:Q8R0F3 PRIDE:Q8R0F3
            Ensembl:ENSMUST00000032191 GeneID:58911 KEGG:mmu:58911
            UCSC:uc009ddf.2 InParanoid:Q8R0F3 NextBio:314476 Bgee:Q8R0F3
            CleanEx:MM_SUMF1 Genevestigator:Q8R0F3
            GermOnline:ENSMUSG00000030101 Uniprot:Q8R0F3
        Length = 372

 Score = 819 (293.4 bits), Expect = 1.2e-81, P = 1.2e-81
 Identities = 134/186 (72%), Positives = 157/186 (84%)

Query:   161 SWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPW 220
             +W HPEG DS+I HR NHPV+HVSWNDAVAYCTW G RLPTEAEWEY CRGGL+NRLFPW
Sbjct:   185 NWRHPEGPDSSILHRSNHPVLHVSWNDAVAYCTWAGKRLPTEAEWEYSCRGGLQNRLFPW 244

Query:   221 GNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWW 280
             GN L P+G+H AN+WQG+FP +NT  DG+  TAPV ++  N +GLYN+VGNVWEWT+DWW
Sbjct:   245 GNKLQPKGQHYANIWQGKFPVSNTGEDGFQGTAPVDAFPPNGYGLYNIVGNVWEWTSDWW 304

Query:   281 NVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
              VHH    ++NPKGPT+G D+VKKGGSY+C++ YCYR+RCAARSQNTPDSSA NLGFRCA
Sbjct:   305 TVHHSVEETFNPKGPTSGKDRVKKGGSYMCHKSYCYRYRCAARSQNTPDSSASNLGFRCA 364

Query:   341 ADKGPT 346
             AD  PT
Sbjct:   365 ADHLPT 370

 Score = 506 (183.2 bits), Expect = 7.6e-72, Sum P(2) = 7.6e-72
 Identities = 95/164 (57%), Positives = 122/164 (74%)

Query:    15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
             MV +P   F MGT+ P + +DGE P+R VT+D FY+D +EVSN  F++FV++TGY+TEAE
Sbjct:    89 MVPIPAGVFTMGTDDPQIRQDGEAPARRVTVDGFYMDAYEVSNADFEKFVNSTGYLTEAE 148

Query:    75 KFGDTFVFEPLLSEEERAKISQ----------VRH-DMKRFEGLDSTIEHRMHHPVVHIS 123
             KFGD+FVFE +LSE+ +  I Q          V+  + +  EG DS+I HR +HPV+H+S
Sbjct:   149 KFGDSFVFEGMLSEQVKTHIHQAVAAAPWWLPVKGANWRHPEGPDSSILHRSNHPVLHVS 208

Query:   124 WNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEG 167
             WNDAVAYCTW G RLPTEAEWEY CRGGL+NRLFPWG+ L P+G
Sbjct:   209 WNDAVAYCTWAGKRLPTEAEWEYSCRGGLQNRLFPWGNKLQPKG 252

 Score = 239 (89.2 bits), Expect = 7.6e-72, Sum P(2) = 7.6e-72
 Identities = 41/50 (82%), Positives = 47/50 (94%)

Query:   343 KGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 392
             KGPT+G D+VKKGGSY+C++ YCYR+RCAARSQNTPDSSA NLGFRCAAD
Sbjct:   317 KGPTSGKDRVKKGGSYMCHKSYCYRYRCAARSQNTPDSSASNLGFRCAAD 366


>UNIPROTKB|Q0P5L5 [details] [associations]
            symbol:SUMF1 "Sulfatase-modifying factor 1" species:9913
            "Bos taurus" [GO:0005788 "endoplasmic reticulum lumen"
            evidence=IEA] [GO:0042803 "protein homodimerization activity"
            evidence=IEA] [GO:0046872 "metal ion binding" evidence=IEA]
            [GO:0016491 "oxidoreductase activity" evidence=IEA] GO:GO:0046872
            InterPro:IPR016187 SUPFAM:SSF56436 GO:GO:0016491 GO:GO:0005788
            eggNOG:COG1262 Gene3D:3.90.1580.10 InterPro:IPR005532 Pfam:PF03781
            EMBL:BC119885 IPI:IPI00840745 RefSeq:NP_001069544.1
            UniGene:Bt.30055 ProteinModelPortal:Q0P5L5 SMR:Q0P5L5 STRING:Q0P5L5
            Ensembl:ENSBTAT00000055237 GeneID:536435 KEGG:bta:536435 CTD:285362
            GeneTree:ENSGT00390000008983 HOGENOM:HOG000135466
            HOVERGEN:HBG054193 InParanoid:Q0P5L5 KO:K13444 OMA:FRCAADH
            OrthoDB:EOG461449 UniPathway:UPA00910 NextBio:20876949
            Uniprot:Q0P5L5
        Length = 374

 Score = 817 (292.7 bits), Expect = 2.0e-81, P = 2.0e-81
 Identities = 134/188 (71%), Positives = 155/188 (82%)

Query:   161 SWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPW 220
             +W HPEG DST+ HR +HPV+HVSWNDAVAYCTW G RLPTEAEWEY CRGGL+NRLFPW
Sbjct:   187 NWRHPEGPDSTVLHRPDHPVLHVSWNDAVAYCTWAGKRLPTEAEWEYSCRGGLQNRLFPW 246

Query:   221 GNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWW 280
             GN L P+G+H AN+WQGEFP  NT  DG+  TAPV ++  N +GLYN+VGN WEWT+DWW
Sbjct:   247 GNKLQPKGQHYANIWQGEFPVTNTGEDGFRGTAPVDAFPPNGYGLYNIVGNAWEWTSDWW 306

Query:   281 NVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
              VHH    + NPKGP +G D+VKKGGSY+C++ YCYR+RCAARSQNTPDSSA NLGFRCA
Sbjct:   307 TVHHSAEETINPKGPPSGKDRVKKGGSYMCHKSYCYRYRCAARSQNTPDSSASNLGFRCA 366

Query:   341 ADKGPTTG 348
             AD  PTTG
Sbjct:   367 ADHLPTTG 374

 Score = 518 (187.4 bits), Expect = 1.8e-72, Sum P(2) = 1.8e-72
 Identities = 98/174 (56%), Positives = 128/174 (73%)

Query:     5 PAPPVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFV 64
             P+PP +    MV +P   F MGT+ P + +DGE P+R V +DAFY+D +EVSN +F++FV
Sbjct:    85 PSPPTK----MVPIPAGVFTMGTDDPQIKQDGEAPARRVAIDAFYMDAYEVSNAEFEKFV 140

Query:    65 SATGYVTEAEKFGDTFVFEPLLSEEERAKISQ----------VRH-DMKRFEGLDSTIEH 113
             ++TGY+TEAEKFGD+FVFE +LSE+ ++ I Q          V+  + +  EG DST+ H
Sbjct:   141 NSTGYLTEAEKFGDSFVFEGMLSEQVKSDIQQAVAAAPWWLPVKGANWRHPEGPDSTVLH 200

Query:   114 RMHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEG 167
             R  HPV+H+SWNDAVAYCTW G RLPTEAEWEY CRGGL+NRLFPWG+ L P+G
Sbjct:   201 RPDHPVLHVSWNDAVAYCTWAGKRLPTEAEWEYSCRGGLQNRLFPWGNKLQPKG 254

 Score = 233 (87.1 bits), Expect = 1.8e-72, Sum P(2) = 1.8e-72
 Identities = 40/50 (80%), Positives = 46/50 (92%)

Query:   343 KGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 392
             KGP +G D+VKKGGSY+C++ YCYR+RCAARSQNTPDSSA NLGFRCAAD
Sbjct:   319 KGPPSGKDRVKKGGSYMCHKSYCYRYRCAARSQNTPDSSASNLGFRCAAD 368


>RGD|1309939 [details] [associations]
            symbol:Sumf1 "sulfatase modifying factor 1" species:10116
            "Rattus norvegicus" [GO:0003674 "molecular_function" evidence=ND]
            [GO:0005575 "cellular_component" evidence=ND] [GO:0005783
            "endoplasmic reticulum" evidence=IEA;ISO] [GO:0008150
            "biological_process" evidence=ND] [GO:0042803 "protein
            homodimerization activity" evidence=IEA;ISO] RGD:1309939
            GO:GO:0005783 InterPro:IPR016187 SUPFAM:SSF56436 EMBL:CH473957
            Gene3D:3.90.1580.10 InterPro:IPR005532 Pfam:PF03781 CTD:285362
            GeneTree:ENSGT00390000008983 KO:K13444 OrthoDB:EOG461449
            IPI:IPI00358387 RefSeq:NP_001102109.1 UniGene:Rn.203214
            Ensembl:ENSRNOT00000009008 GeneID:362409 KEGG:rno:362409
            UCSC:RGD:1309939 NextBio:679821 Uniprot:D4A7I8
        Length = 372

 Score = 813 (291.2 bits), Expect = 5.2e-81, P = 5.2e-81
 Identities = 134/185 (72%), Positives = 152/185 (82%)

Query:   162 WLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWG 221
             W HPEG DSTI HR NHPV+HVSWNDAVAYC W G RLPTEAEWEY CRGGL+NRLFPWG
Sbjct:   186 WRHPEGPDSTILHRSNHPVLHVSWNDAVAYCAWAGKRLPTEAEWEYSCRGGLQNRLFPWG 245

Query:   222 NNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWN 281
             N L P+G+H AN+WQGEFP  NT  DG+  TAPV ++  N +GLYN+VGN WEWT+DWW 
Sbjct:   246 NKLQPKGQHYANIWQGEFPVTNTGEDGFQGTAPVDAFPPNGYGLYNIVGNAWEWTSDWWT 305

Query:   282 VHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
             VHH    + NPKGPT+G D+VKKGGSY+C++ YCYR+RCAARSQNTPDSSA NLGFRCAA
Sbjct:   306 VHHSAEETLNPKGPTSGKDRVKKGGSYMCHKSYCYRYRCAARSQNTPDSSASNLGFRCAA 365

Query:   342 DKGPT 346
             D  PT
Sbjct:   366 DHLPT 370

 Score = 517 (187.1 bits), Expect = 5.3e-73, Sum P(2) = 5.3e-73
 Identities = 98/164 (59%), Positives = 123/164 (75%)

Query:    15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
             MV +P   F MGT+ P + +DGE P+R VT+DAFY+D +EVSN  F++FV++TGY+TEAE
Sbjct:    89 MVPIPAGVFTMGTDDPQIKQDGEAPARRVTVDAFYMDAYEVSNADFEKFVNSTGYLTEAE 148

Query:    75 KFGDTFVFEPLLSEEERAKISQ----------VRH-DMKRFEGLDSTIEHRMHHPVVHIS 123
             KFGD+FVFE +LSE  +A+I Q          V+  D +  EG DSTI HR +HPV+H+S
Sbjct:   149 KFGDSFVFEGMLSEPVKAQIHQAVAAAPWWLPVKGADWRHPEGPDSTILHRSNHPVLHVS 208

Query:   124 WNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEG 167
             WNDAVAYC W G RLPTEAEWEY CRGGL+NRLFPWG+ L P+G
Sbjct:   209 WNDAVAYCAWAGKRLPTEAEWEYSCRGGLQNRLFPWGNKLQPKG 252

 Score = 239 (89.2 bits), Expect = 5.3e-73, Sum P(2) = 5.3e-73
 Identities = 41/50 (82%), Positives = 47/50 (94%)

Query:   343 KGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 392
             KGPT+G D+VKKGGSY+C++ YCYR+RCAARSQNTPDSSA NLGFRCAAD
Sbjct:   317 KGPTSGKDRVKKGGSYMCHKSYCYRYRCAARSQNTPDSSASNLGFRCAAD 366


>UNIPROTKB|F1SFL4 [details] [associations]
            symbol:SUMF1 "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0042803 "protein homodimerization activity"
            evidence=IEA] [GO:0005783 "endoplasmic reticulum" evidence=IEA]
            GO:GO:0005783 InterPro:IPR016187 SUPFAM:SSF56436
            Gene3D:3.90.1580.10 InterPro:IPR005532 Pfam:PF03781
            GeneTree:ENSGT00390000008983 OMA:FRCAADH EMBL:CU928100
            Ensembl:ENSSSCT00000012622 Uniprot:F1SFL4
        Length = 376

 Score = 807 (289.1 bits), Expect = 2.2e-80, P = 2.2e-80
 Identities = 133/190 (70%), Positives = 155/190 (81%)

Query:   161 SWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPW 220
             +W HPEG DST+ HR +HPV+HVSWNDAVAYCTW G RLPTEAEWEY CRGGL+NRLFPW
Sbjct:   187 NWRHPEGPDSTVVHRPDHPVLHVSWNDAVAYCTWAGKRLPTEAEWEYSCRGGLQNRLFPW 246

Query:   221 GNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWW 280
             GN L P+G+H AN+WQGEFP  NT  DG+  TAPV ++  N +GLYN+VGN WEWT+DWW
Sbjct:   247 GNKLQPKGQHYANIWQGEFPVTNTGEDGFRGTAPVDAFPPNGYGLYNIVGNAWEWTSDWW 306

Query:   281 NVHHHPAPSYNP--KGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
              +HH    + NP  KGP +G D+VKKGGSY+C++ YCYR+RCAARSQNTPDSSA NLGFR
Sbjct:   307 TIHHAAEETINPPQKGPPSGKDRVKKGGSYMCHKSYCYRYRCAARSQNTPDSSASNLGFR 366

Query:   339 CAADKGPTTG 348
             CAAD  PTTG
Sbjct:   367 CAADHQPTTG 376

 Score = 511 (184.9 bits), Expect = 9.7e-72, Sum P(2) = 9.7e-72
 Identities = 99/178 (55%), Positives = 128/178 (71%)

Query:     5 PAP-PVERY---KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQF 60
             P P P ER      MV +P   F MGT+ P + +DGE P+R V +DAFY+D +EVSN +F
Sbjct:    77 PGPVPRERQFPLTKMVPIPAGVFTMGTDDPQIKQDGEAPARRVAIDAFYMDAYEVSNAEF 136

Query:    61 QEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQ----------VRH-DMKRFEGLDS 109
             ++FV++TGY+TEAEKFGD+FVFE +LS++ ++ I Q          V+  + +  EG DS
Sbjct:   137 EKFVNSTGYLTEAEKFGDSFVFEGILSDQVKSDIQQAVAAAPWWLPVKGANWRHPEGPDS 196

Query:   110 TIEHRMHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEG 167
             T+ HR  HPV+H+SWNDAVAYCTW G RLPTEAEWEY CRGGL+NRLFPWG+ L P+G
Sbjct:   197 TVVHRPDHPVLHVSWNDAVAYCTWAGKRLPTEAEWEYSCRGGLQNRLFPWGNKLQPKG 254

 Score = 233 (87.1 bits), Expect = 9.7e-72, Sum P(2) = 9.7e-72
 Identities = 40/50 (80%), Positives = 46/50 (92%)

Query:   343 KGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 392
             KGP +G D+VKKGGSY+C++ YCYR+RCAARSQNTPDSSA NLGFRCAAD
Sbjct:   321 KGPPSGKDRVKKGGSYMCHKSYCYRYRCAARSQNTPDSSASNLGFRCAAD 370


>UNIPROTKB|Q8NBK3 [details] [associations]
            symbol:SUMF1 "Sulfatase-modifying factor 1" species:9606
            "Homo sapiens" [GO:0016491 "oxidoreductase activity" evidence=IEA]
            [GO:0046872 "metal ion binding" evidence=IEA] [GO:0042803 "protein
            homodimerization activity" evidence=IEA] [GO:0005788 "endoplasmic
            reticulum lumen" evidence=TAS] [GO:0006644 "phospholipid metabolic
            process" evidence=TAS] [GO:0006665 "sphingolipid metabolic process"
            evidence=TAS] [GO:0006687 "glycosphingolipid metabolic process"
            evidence=TAS] [GO:0043687 "post-translational protein modification"
            evidence=TAS] [GO:0044267 "cellular protein metabolic process"
            evidence=TAS] [GO:0044281 "small molecule metabolic process"
            evidence=TAS] Reactome:REACT_17015 Reactome:REACT_111217
            GO:GO:0044281 EMBL:CH471055 GO:GO:0046872 GO:GO:0006644
            InterPro:IPR016187 SUPFAM:SSF56436 GO:GO:0016491 GO:GO:0005788
            GO:GO:0043687 PDB:2AIJ PDB:2AIK PDBsum:2AIJ PDBsum:2AIK MIM:272200
            GO:GO:0006687 eggNOG:COG1262 Gene3D:3.90.1580.10 InterPro:IPR005532
            Pfam:PF03781 EMBL:AC024168 EMBL:AC023483 EMBL:AC034191 CTD:285362
            HOGENOM:HOG000135466 HOVERGEN:HBG054193 KO:K13444
            UniPathway:UPA00910 EMBL:AY208752 EMBL:AY323910 EMBL:AB448737
            EMBL:AY358092 EMBL:AK057983 EMBL:AK302018 EMBL:AK075459
            EMBL:AC018822 EMBL:AC023480 EMBL:AC023484 EMBL:AC024167
            EMBL:BC017005 EMBL:BC110862 EMBL:BC121122 EMBL:BC121123
            IPI:IPI00301144 IPI:IPI00332413 IPI:IPI00785161 IPI:IPI00909612
            IPI:IPI01009479 RefSeq:NP_877437.2 UniGene:Hs.350475 PDB:1Y1E
            PDB:1Y1F PDB:1Y1G PDB:1Y1H PDB:1Y1I PDB:1Y1J PDB:1Z70 PDB:2AFT
            PDB:2AFY PDB:2AII PDB:2HI8 PDB:2HIB PDBsum:1Y1E PDBsum:1Y1F
            PDBsum:1Y1G PDBsum:1Y1H PDBsum:1Y1I PDBsum:1Y1J PDBsum:1Z70
            PDBsum:2AFT PDBsum:2AFY PDBsum:2AII PDBsum:2HI8 PDBsum:2HIB
            ProteinModelPortal:Q8NBK3 SMR:Q8NBK3 IntAct:Q8NBK3
            MINT:MINT-4534860 STRING:Q8NBK3 PhosphoSite:Q8NBK3 DMDM:62298562
            PRIDE:Q8NBK3 Ensembl:ENST00000272902 Ensembl:ENST00000383843
            Ensembl:ENST00000405420 GeneID:285362 KEGG:hsa:285362
            UCSC:uc003bpz.2 GeneCards:GC03M003742 HGNC:HGNC:20376 HPA:HPA038025
            MIM:607939 neXtProt:NX_Q8NBK3 Orphanet:585 PharmGKB:PA134977552
            OMA:MCHRSQE PhylomeDB:Q8NBK3 ChiTaRS:SUMF1 EvolutionaryTrace:Q8NBK3
            GenomeRNAi:285362 NextBio:95470 ArrayExpress:Q8NBK3 Bgee:Q8NBK3
            CleanEx:HS_SUMF1 Genevestigator:Q8NBK3 GermOnline:ENSG00000144455
            Uniprot:Q8NBK3
        Length = 374

 Score = 806 (288.8 bits), Expect = 2.9e-80, P = 2.9e-80
 Identities = 133/186 (71%), Positives = 152/186 (81%)

Query:   161 SWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPW 220
             +W HPEG DSTI HR +HPV+HVSWNDAVAYCTW G RLPTEAEWEY CRGGL NRLFPW
Sbjct:   187 NWRHPEGPDSTILHRPDHPVLHVSWNDAVAYCTWAGKRLPTEAEWEYSCRGGLHNRLFPW 246

Query:   221 GNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWW 280
             GN L P+G+H AN+WQGEFP  NT  DG+  TAPV ++  N +GLYN+VGN WEWT+DWW
Sbjct:   247 GNKLQPKGQHYANIWQGEFPVTNTGEDGFQGTAPVDAFPPNGYGLYNIVGNAWEWTSDWW 306

Query:   281 NVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
              VHH    + NPKGP +G D+VKKGGSY+C+  YCYR+RCAARSQNTPDSSA NLGFRCA
Sbjct:   307 TVHHSVEETLNPKGPPSGKDRVKKGGSYMCHRSYCYRYRCAARSQNTPDSSASNLGFRCA 366

Query:   341 ADKGPT 346
             AD+ PT
Sbjct:   367 ADRLPT 372

 Score = 526 (190.2 bits), Expect = 3.2e-73, Sum P(2) = 3.2e-73
 Identities = 103/178 (57%), Positives = 129/178 (72%)

Query:     5 PAP-PVER---YKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQF 60
             P P P ER   +  MV +P   F MGT+ P + +DGE P+R VT+DAFY+D +EVSNT+F
Sbjct:    77 PGPVPGERQLAHSKMVPIPAGVFTMGTDDPQIKQDGEAPARRVTIDAFYMDAYEVSNTEF 136

Query:    61 QEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQ----------VRH-DMKRFEGLDS 109
             ++FV++TGY+TEAEKFGD+FVFE +LSE+ +  I Q          V+  + +  EG DS
Sbjct:   137 EKFVNSTGYLTEAEKFGDSFVFEGMLSEQVKTNIQQAVAAAPWWLPVKGANWRHPEGPDS 196

Query:   110 TIEHRMHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEG 167
             TI HR  HPV+H+SWNDAVAYCTW G RLPTEAEWEY CRGGL NRLFPWG+ L P+G
Sbjct:   197 TILHRPDHPVLHVSWNDAVAYCTWAGKRLPTEAEWEYSCRGGLHNRLFPWGNKLQPKG 254

 Score = 232 (86.7 bits), Expect = 3.2e-73, Sum P(2) = 3.2e-73
 Identities = 40/50 (80%), Positives = 45/50 (90%)

Query:   343 KGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 392
             KGP +G D+VKKGGSY+C+  YCYR+RCAARSQNTPDSSA NLGFRCAAD
Sbjct:   319 KGPPSGKDRVKKGGSYMCHRSYCYRYRCAARSQNTPDSSASNLGFRCAAD 368


>ZFIN|ZDB-GENE-060421-3113 [details] [associations]
            symbol:sumf1 "sulfatase modifying factor 1"
            species:7955 "Danio rerio" [GO:0008150 "biological_process"
            evidence=ND] [GO:0005575 "cellular_component" evidence=ND]
            ZFIN:ZDB-GENE-060421-3113 InterPro:IPR016187 SUPFAM:SSF56436
            Gene3D:3.90.1580.10 InterPro:IPR005532 Pfam:PF03781
            HOGENOM:HOG000135466 HOVERGEN:HBG054193 EMBL:BC095053
            IPI:IPI00607483 UniGene:Dr.78716 STRING:Q504E9 InParanoid:Q504E9
            ArrayExpress:Q504E9 Uniprot:Q504E9
        Length = 388

 Score = 792 (283.9 bits), Expect = 8.7e-79, P = 8.7e-79
 Identities = 132/194 (68%), Positives = 155/194 (79%)

Query:   158 PWGS------WLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRG 211
             PW S      W HPEG DSTI +RMNHP +HVSW+DA AYC W   RLPTEAEWE  CRG
Sbjct:   195 PWWSPVKGADWRHPEGPDSTIHNRMNHPALHVSWDDARAYCQWAKRRLPTEAEWELACRG 254

Query:   212 GLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGN 271
             GL++R++PWGN L PRG+H AN+WQG+FP +NTA DGY +T+PVMS+  N FGLY+MVGN
Sbjct:   255 GLQDRMYPWGNKLMPRGQHYANLWQGDFPNHNTAEDGYANTSPVMSFPANGFGLYDMVGN 314

Query:   272 VWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSS 331
              WEWTADWW VHH     +NPKGP +GTD+VKKGGSY+C++ YCYR+RCAARSQNTPDSS
Sbjct:   315 AWEWTADWWTVHHSAEDKFNPKGPESGTDRVKKGGSYMCHKSYCYRYRCAARSQNTPDSS 374

Query:   332 AGNLGFRCAADKGP 345
             A NLGFRCA+D  P
Sbjct:   375 ASNLGFRCASDVDP 388

 Score = 463 (168.0 bits), Expect = 2.0e-67, Sum P(2) = 2.0e-67
 Identities = 89/164 (54%), Positives = 116/164 (70%)

Query:    15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
             +VLL G  F MGT+ P + +DGE P R V L AFY+++HEV+N QFQ F + TGY+TEAE
Sbjct:   108 LVLLQGGWFLMGTDDPGIPQDGEGPQRKVKLGAFYIEEHEVTNQQFQHFTNQTGYITEAE 167

Query:    75 KFGDTFVFEPLLSEEERAKISQ----------VRH-DMKRFEGLDSTIEHRMHHPVVHIS 123
             +FGD+FVFE LLSEE ++ +S           V+  D +  EG DSTI +RM+HP +H+S
Sbjct:   168 RFGDSFVFEGLLSEEVKSTLSHAVAAAPWWSPVKGADWRHPEGPDSTIHNRMNHPALHVS 227

Query:   124 WNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEG 167
             W+DA AYC W   RLPTEAEWE  CRGGL++R++PWG+ L P G
Sbjct:   228 WDDARAYCQWAKRRLPTEAEWELACRGGLQDRMYPWGNKLMPRG 271

 Score = 240 (89.5 bits), Expect = 2.0e-67, Sum P(2) = 2.0e-67
 Identities = 41/51 (80%), Positives = 48/51 (94%)

Query:   343 KGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
             KGP +GTD+VKKGGSY+C++ YCYR+RCAARSQNTPDSSA NLGFRCA+DV
Sbjct:   336 KGPESGTDRVKKGGSYMCHKSYCYRYRCAARSQNTPDSSASNLGFRCASDV 386

 Score = 37 (18.1 bits), Expect = 1.5e-18, Sum P(2) = 1.5e-18
 Identities = 9/31 (29%), Positives = 14/31 (45%)

Query:   166 EGIDST-IEHRMNHPVVHVSWNDAVAYCTWR 195
             EG+ S  ++  ++H V    W   V    WR
Sbjct:   176 EGLLSEEVKSTLSHAVAAAPWWSPVKGADWR 206


>UNIPROTKB|E1C234 [details] [associations]
            symbol:LOC100859261 "Uncharacterized protein" species:9031
            "Gallus gallus" [GO:0005783 "endoplasmic reticulum" evidence=IEA]
            [GO:0042803 "protein homodimerization activity" evidence=IEA]
            GO:GO:0005783 InterPro:IPR016187 SUPFAM:SSF56436
            Gene3D:3.90.1580.10 InterPro:IPR005532 Pfam:PF03781
            GeneTree:ENSGT00390000008983 OMA:FRCAADH EMBL:AADN02014477
            EMBL:AADN02014478 EMBL:AADN02014479 EMBL:AADN02014480
            IPI:IPI00586162 Ensembl:ENSGALT00000013484 Uniprot:E1C234
        Length = 284

 Score = 768 (275.4 bits), Expect = 3.1e-76, P = 3.1e-76
 Identities = 128/185 (69%), Positives = 152/185 (82%)

Query:   161 SWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPW 220
             +W  PEG  S+   RM+HPV+HVSWNDAVA+CTW G RLPTEAEWEYGCRGGLE RLFPW
Sbjct:    97 NWRQPEGPGSSGFSRMDHPVLHVSWNDAVAFCTWAGKRLPTEAEWEYGCRGGLEKRLFPW 156

Query:   221 GNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWW 280
             GN L P+G+H AN+WQG FPTNNTA DG  ++  V ++  N +GLYN+VGN WEWT+DWW
Sbjct:   157 GNKLQPKGQHYANIWQGVFPTNNTAEDGLKTSFHVTAFPPNGYGLYNIVGNAWEWTSDWW 216

Query:   281 NVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
              VHH    ++NPKGP++GTD+VKKGGSY+C++ YCYR+RCAARSQNTPDSSA NLGFRCA
Sbjct:   217 AVHHSADEAHNPKGPSSGTDRVKKGGSYMCHKSYCYRYRCAARSQNTPDSSASNLGFRCA 276

Query:   341 ADKGP 345
             AD  P
Sbjct:   277 ADALP 281

 Score = 503 (182.1 bits), Expect = 9.7e-72, Sum P(2) = 9.7e-72
 Identities = 95/164 (57%), Positives = 126/164 (76%)

Query:    15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
             MV +PG  F MGT++P + +DGE+P+R V +++FY+DQ+EVSN +F+ FV++TGY+TEAE
Sbjct:     1 MVAIPGGVFTMGTDEPEIQQDGEWPARRVHVNSFYMDQYEVSNQEFERFVNSTGYLTEAE 60

Query:    75 KFGDTFVFEPLLSEEERAKISQ----------VRH-DMKRFEGLDSTIEHRMHHPVVHIS 123
             KFGD+FVFE +LSEE +A+I Q          V+  + ++ EG  S+   RM HPV+H+S
Sbjct:    61 KFGDSFVFEGMLSEEVKAEIHQAVAAAPWWLPVKGANWRQPEGPGSSGFSRMDHPVLHVS 120

Query:   124 WNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEG 167
             WNDAVA+CTW G RLPTEAEWEYGCRGGLE RLFPWG+ L P+G
Sbjct:   121 WNDAVAFCTWAGKRLPTEAEWEYGCRGGLEKRLFPWGNKLQPKG 164

 Score = 241 (89.9 bits), Expect = 9.7e-72, Sum P(2) = 9.7e-72
 Identities = 41/50 (82%), Positives = 48/50 (96%)

Query:   343 KGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 392
             KGP++GTD+VKKGGSY+C++ YCYR+RCAARSQNTPDSSA NLGFRCAAD
Sbjct:   229 KGPSSGTDRVKKGGSYMCHKSYCYRYRCAARSQNTPDSSASNLGFRCAAD 278


>FB|FBgn0035102 [details] [associations]
            symbol:CG7049 species:7227 "Drosophila melanogaster"
            [GO:0005575 "cellular_component" evidence=ND] [GO:0008150
            "biological_process" evidence=ND] [GO:0003674 "molecular_function"
            evidence=ND] EMBL:AE014296 InterPro:IPR016187 SUPFAM:SSF56436
            Gene3D:3.90.1580.10 InterPro:IPR005532 Pfam:PF03781
            GeneTree:ENSGT00390000008983 KO:K13444 OMA:FRCAADH
            FlyBase:FBgn0035102 RefSeq:NP_612003.1 UniGene:Dm.793
            ProteinModelPortal:Q9W0U6 SMR:Q9W0U6 IntAct:Q9W0U6
            MINT:MINT-1627764 STRING:Q9W0U6 PRIDE:Q9W0U6
            EnsemblMetazoa:FBtr0072537 GeneID:38022 KEGG:dme:Dmel_CG7049
            UCSC:CG7049-RA InParanoid:Q9W0U6 PhylomeDB:Q9W0U6 GenomeRNAi:38022
            NextBio:806601 ArrayExpress:Q9W0U6 Bgee:Q9W0U6 Uniprot:Q9W0U6
        Length = 336

 Score = 641 (230.7 bits), Expect = 8.8e-63, P = 8.8e-63
 Identities = 113/180 (62%), Positives = 135/180 (75%)

Query:   161 SWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPW 220
             +W HP G+DS I+H   HPVVHVSW DAV YC W G RLP+EAEWE  CRGG E +LFPW
Sbjct:   164 NWRHPNGVDSDIDHLGRHPVVHVSWRDAVEYCKWAGKRLPSEAEWEAACRGGKERKLFPW 223

Query:   221 GNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWW 280
             GN L PR EH  N+WQG+FP  N A DG+  T+PV ++++N + L+NMVGNVWEWTAD W
Sbjct:   224 GNKLMPRNEHWLNIWQGDFPDGNLAEDGFEYTSPVDAFRQNIYDLHNMVGNVWEWTADLW 283

Query:   281 NVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
             +V+     S NP       ++VKKGGSYLC++ YCYR+RCAARSQNT DSSAGNLGFRCA
Sbjct:   284 DVND---VSDNP-------NRVKKGGSYLCHKSYCYRYRCAARSQNTEDSSAGNLGFRCA 333

 Score = 411 (149.7 bits), Expect = 3.3e-57, Sum P(2) = 3.3e-57
 Identities = 85/176 (48%), Positives = 107/176 (60%)

Query:    14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
             DM LLPG T  MGT+KP    D E P R V L+ FY+D++EVSN  F +FV  T Y TEA
Sbjct:    66 DMSLLPGGTVYMGTDKPHFPADREAPERQVKLNDFYIDKYEVSNEAFAKFVLHTNYTTEA 125

Query:    74 EKFGDTFVFEPLLSEEERAKISQVRH------------DMKRFEGLDSTIEHRMHHPVVH 121
             E++GD+F+F+ LLS  E+  +   R             + +   G+DS I+H   HPVVH
Sbjct:   126 ERYGDSFLFKSLLSPLEQKNLEDFRVASAVWWYKVAGVNWRHPNGVDSDIDHLGRHPVVH 185

Query:   122 ISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMN 177
             +SW DAV YC W G RLP+EAEWE  CRGG E +LFPWG+ L P       EH +N
Sbjct:   186 VSWRDAVEYCKWAGKRLPSEAEWEAACRGGKERKLFPWGNKLMPRN-----EHWLN 236

 Score = 195 (73.7 bits), Expect = 3.3e-57, Sum P(2) = 3.3e-57
 Identities = 34/41 (82%), Positives = 39/41 (95%)

Query:   350 DKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 390
             ++VKKGGSYLC++ YCYR+RCAARSQNT DSSAGNLGFRCA
Sbjct:   293 NRVKKGGSYLCHKSYCYRYRCAARSQNTEDSSAGNLGFRCA 333

 Score = 43 (20.2 bits), Expect = 8.0e-14, Sum P(2) = 8.0e-14
 Identities = 17/62 (27%), Positives = 24/62 (38%)

Query:    13 KDMVLLPGDTFRMGTNKPIL-IKDGEFPSRNVTLDAF-YLDQHEVSNTQFQEFVSATGYV 70
             K+  L P     M  N+  L I  G+FP  N+  D F Y    +       +  +  G V
Sbjct:   216 KERKLFPWGNKLMPRNEHWLNIWQGDFPDGNLAEDGFEYTSPVDAFRQNIYDLHNMVGNV 275

Query:    71 TE 72
              E
Sbjct:   276 WE 277


>UNIPROTKB|P95060 [details] [associations]
            symbol:MT0739 "Conserved protein" species:1773
            "Mycobacterium tuberculosis" [GO:0005618 "cell wall" evidence=IDA]
            [GO:0018083 "peptidyl-L-3-oxoalanine biosynthetic process from
            peptidyl-cysteine or peptidyl-serine" evidence=IDA] GO:GO:0005618
            EMBL:AE000516 GenomeReviews:AE000516_GR GenomeReviews:AL123456_GR
            InterPro:IPR016187 SUPFAM:SSF56436 EMBL:BX842574
            Gene3D:3.90.1580.10 InterPro:IPR005532 Pfam:PF03781
            HOGENOM:HOG000135466 EMBL:AL123456 PIR:C70643 RefSeq:NP_215226.1
            RefSeq:NP_335156.1 RefSeq:YP_006514056.1 SMR:P95060
            EnsemblBacteria:EBMYCT00000002440 EnsemblBacteria:EBMYCT00000070651
            GeneID:13318601 GeneID:888346 GeneID:926029 KEGG:mtc:MT0739
            KEGG:mtu:Rv0712 KEGG:mtv:RVBD_0712 PATRIC:18123355
            TubercuList:Rv0712 OMA:KGKLMAN ProtClustDB:CLSK790692 GO:GO:0018083
            Uniprot:P95060
        Length = 299

 Score = 455 (165.2 bits), Expect = 2.9e-56, Sum P(2) = 2.9e-56
 Identities = 92/199 (46%), Positives = 114/199 (57%)

Query:   158 PWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRL 217
             P   W HP G DS I  R  HPVV V++ DAVAY  W G RLPTEAEWEY  RGG     
Sbjct:   104 PGACWRHPFGRDSDIADRAGHPVVQVAYPDAVAYARWAGRRLPTEAEWEYAARGGT-TAT 162

Query:   218 FPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTA 277
             + WG+   P G   AN WQG FP  N  A G++ T+PV  +  N FGL +M+GNVWEWT 
Sbjct:   163 YAWGDQEKPGGMLMANTWQGRFPYRNDGALGWVGTSPVGRFPANGFGLLDMIGNVWEWTT 222

Query:   278 DWWNVHHHPAPSYN----PKGPTTGTD----KVKKGGSYLCNEQYCYRHRCAARSQNTPD 329
               +  HH   P       P    T  D    +  KGGS+LC  +YC+R+R AARS  + D
Sbjct:   223 TEFYPHHRIDPPSTACCAPVKLATAADPTISQTLKGGSHLCAPEYCHRYRPAARSPQSQD 282

Query:   330 SSAGNLGFRCAADKGPTTG 348
             ++  ++GFRC AD  P +G
Sbjct:   283 TATTHIGFRCVAD--PVSG 299

 Score = 171 (65.3 bits), Expect = 2.2e-31, Sum P(3) = 2.2e-31
 Identities = 33/63 (52%), Positives = 38/63 (60%)

Query:   106 GLDSTIEHRMHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHP 165
             G DS I  R  HPVV +++ DAVAY  W G RLPTEAEWEY  RGG     + WG    P
Sbjct:   113 GRDSDIADRAGHPVVQVAYPDAVAYARWAGRRLPTEAEWEYAARGGT-TATYAWGDQEKP 171

Query:   166 EGI 168
              G+
Sbjct:   172 GGM 174

 Score = 142 (55.0 bits), Expect = 2.9e-56, Sum P(2) = 2.9e-56
 Identities = 31/62 (50%), Positives = 42/62 (67%)

Query:    14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
             ++V LPG +FRMG+ +       E P   VT+ AF +++H V+N QF EFVSATGYVT A
Sbjct:     4 ELVDLPGGSFRMGSTR---FYPEEAPIHTVTVRAFAVERHPVTNAQFAEFVSATGYVTVA 60

Query:    74 EK 75
             E+
Sbjct:    61 EQ 62

 Score = 128 (50.1 bits), Expect = 2.2e-31, Sum P(3) = 2.2e-31
 Identities = 27/65 (41%), Positives = 39/65 (60%)

Query:   328 PDSSAGNLGFRCAADKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGF 387
             P S+A     + A    PT    +  KGGS+LC  +YC+R+R AARS  + D++  ++GF
Sbjct:   233 PPSTACCAPVKLATAADPTIS--QTLKGGSHLCAPEYCHRYRPAARSPQSQDTATTHIGF 290

Query:   388 RCAAD 392
             RC AD
Sbjct:   291 RCVAD 295


>UNIPROTKB|F1NWF3 [details] [associations]
            symbol:SUMF2 "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0005783 "endoplasmic reticulum" evidence=IEA]
            [GO:0042803 "protein homodimerization activity" evidence=IEA]
            GO:GO:0005783 InterPro:IPR016187 SUPFAM:SSF56436
            Gene3D:3.90.1580.10 InterPro:IPR005532 Pfam:PF03781
            GeneTree:ENSGT00390000008983 EMBL:AADN02025924 IPI:IPI00583389
            Ensembl:ENSGALT00000003887 OMA:EGSANHR Uniprot:F1NWF3
        Length = 316

 Score = 468 (169.8 bits), Expect = 1.9e-44, P = 1.9e-44
 Identities = 91/196 (46%), Positives = 120/196 (61%)

Query:   162 WLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWG 221
             W  P G  S I  R++HPV+HVSWNDA A+C W+G RLPTE EWE+  RGGLE RL+PWG
Sbjct:   128 WRQPAGPGSGIADRLDHPVLHVSWNDAQAFCRWKGKRLPTEEEWEFAARGGLEQRLYPWG 187

Query:   222 NNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYK-ENKFGLYNMVGNVWEWTADWW 280
             N   P   +R N+WQG+FP  +TA DGY   +PV ++  +N +GLY+++GN WEWTA  +
Sbjct:   188 NKFQP---NRTNLWQGDFPRGDTAEDGYHGVSPVAAFSPQNSYGLYDLLGNTWEWTASQY 244

Query:   281 NVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCA--ARSQNTPDSSAGNLGFR 338
                  P P   P+ P      V +G S++        HR +   R  NTPDS++ NL FR
Sbjct:   245 TP---PGP---PR-PRAEAMHVLRGASWIDTVDGSANHRASITTRMGNTPDSASDNLSFR 297

Query:   339 CAAD-KGPTTGTDKVK 353
             CAAD    TT + + K
Sbjct:   298 CAADIPNRTTKSSRTK 313

 Score = 361 (132.1 bits), Expect = 1.1e-40, Sum P(2) = 1.1e-40
 Identities = 77/162 (47%), Positives = 98/162 (60%)

Query:    15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
             MV LPG  F+MG++     +D E P R VT+  F +D+  V+N  F+EFV    Y TEAE
Sbjct:    32 MVRLPGGRFQMGSSST-QSRDEEGPIREVTVKPFAIDKFPVTNRDFREFVREKKYKTEAE 90

Query:    75 KFGDTFVFEPLLSEEERAKISQVRHDM-------KRF----EGLDSTIEHRMHHPVVHIS 123
              FG +FVFE  +SEE + KI+Q            K F     G  S I  R+ HPV+H+S
Sbjct:    91 AFGWSFVFEDFVSEELKKKITQKLESAPWWLPVEKAFWRQPAGPGSGIADRLDHPVLHVS 150

Query:   124 WNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHP 165
             WNDA A+C W+G RLPTE EWE+  RGGLE RL+PWG+   P
Sbjct:   151 WNDAQAFCRWKGKRLPTEEEWEFAARGGLEQRLYPWGNKFQP 192

 Score = 88 (36.0 bits), Expect = 1.1e-40, Sum P(2) = 1.1e-40
 Identities = 19/44 (43%), Positives = 26/44 (59%)

Query:   352 VKKGGSYLCNEQYCYRHRCA--ARSQNTPDSSAGNLGFRCAADV 393
             V +G S++        HR +   R  NTPDS++ NL FRCAAD+
Sbjct:   259 VLRGASWIDTVDGSANHRASITTRMGNTPDSASDNLSFRCAADI 302


>UNIPROTKB|Q8NBJ7 [details] [associations]
            symbol:SUMF2 "Sulfatase-modifying factor 2" species:9606
            "Homo sapiens" [GO:0046872 "metal ion binding" evidence=IEA]
            [GO:0005788 "endoplasmic reticulum lumen" evidence=TAS] [GO:0043687
            "post-translational protein modification" evidence=TAS] [GO:0044267
            "cellular protein metabolic process" evidence=TAS]
            Reactome:REACT_17015 GO:GO:0046872 InterPro:IPR016187
            SUPFAM:SSF56436 GO:GO:0005788 GO:GO:0043687 EMBL:CH471140
            eggNOG:COG1262 Gene3D:3.90.1580.10 InterPro:IPR005532 Pfam:PF03781
            HOVERGEN:HBG054193 CTD:25870 EMBL:AY323911 EMBL:AL050037
            EMBL:AY359103 EMBL:AK300488 EMBL:AK301627 EMBL:AK075477
            EMBL:BC000224 EMBL:BC006159 EMBL:BC015600 EMBL:BC084539
            EMBL:BC111092 IPI:IPI00334513 IPI:IPI00334514 IPI:IPI00334516
            IPI:IPI00783919 IPI:IPI00939930 PIR:T08715 RefSeq:NP_001035934.2
            RefSeq:NP_001035935.2 RefSeq:NP_001123541.1 RefSeq:NP_001139805.1
            RefSeq:NP_056226.2 UniGene:Hs.279696 PDB:1Y4J PDBsum:1Y4J
            ProteinModelPortal:Q8NBJ7 SMR:Q8NBJ7 IntAct:Q8NBJ7
            MINT:MINT-1196002 STRING:Q8NBJ7 PhosphoSite:Q8NBJ7 DMDM:296452916
            REPRODUCTION-2DPAGE:IPI00171412 PaxDb:Q8NBJ7 PRIDE:Q8NBJ7
            DNASU:25870 Ensembl:ENST00000275607 GeneID:25870 KEGG:hsa:25870
            UCSC:uc003trt.3 UCSC:uc011kcz.2 UCSC:uc011kda.2
            GeneCards:GC07P056132 HGNC:HGNC:20415 HPA:CAB025743 HPA:HPA024040
            MIM:607940 neXtProt:NX_Q8NBJ7 PharmGKB:PA134921869
            EvolutionaryTrace:Q8NBJ7 GenomeRNAi:25870 NextBio:47250
            PMAP-CutDB:Q8NBJ7 ArrayExpress:Q8NBJ7 Bgee:Q8NBJ7 CleanEx:HS_SUMF2
            Genevestigator:Q8NBJ7 GermOnline:ENSG00000129103 Uniprot:Q8NBJ7
        Length = 301

 Score = 440 (159.9 bits), Expect = 1.7e-41, P = 1.7e-41
 Identities = 86/190 (45%), Positives = 113/190 (59%)

Query:   162 WLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWG 221
             W  P G  S I  R+ HPV+HVSWNDA AYC WRG RLPTE EWE+  RGGL+ +++PWG
Sbjct:   126 WRQPAGPGSGIRERLEHPVLHVSWNDARAYCAWRGKRLPTEEEWEFAARGGLKGQVYPWG 185

Query:   222 NNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYK-ENKFGLYNMVGNVWEWTADWW 280
             N   P   +R N+WQG+FP  + A DG+   +PV ++  +N +GLY+++GNVWEWTA   
Sbjct:   186 NWFQP---NRTNLWQGKFPKGDKAEDGFHGVSPVNAFPAQNNYGLYDLLGNVWEWTA--- 239

Query:   281 NVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRC--AARSQNTPDSSAGNLGFR 338
                       +P        +V +G S++        HR     R  NTPDS++ NLGFR
Sbjct:   240 ----------SPYQAAEQDMRVLRGASWIDTADGSANHRARVTTRMGNTPDSASDNLGFR 289

Query:   339 CAADKGPTTG 348
             CAAD G   G
Sbjct:   290 CAADAGRPPG 299

 Score = 368 (134.6 bits), Expect = 5.9e-42, Sum P(2) = 5.9e-42
 Identities = 76/162 (46%), Positives = 97/162 (59%)

Query:    15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
             MV L G  F MGTN P   +DG+ P R  T+  F +D   V+N  F++FV    Y TEAE
Sbjct:    30 MVQLQGGRFLMGTNSPDS-RDGDGPVREATVKPFAIDIFPVTNKDFRDFVREKKYRTEAE 88

Query:    75 KFGDTFVFEPLLSEEERAKISQVRHDM-------KRF----EGLDSTIEHRMHHPVVHIS 123
              FG +FVFE  +S+E R K +Q    +       K F     G  S I  R+ HPV+H+S
Sbjct:    89 MFGWSFVFEDFVSDELRNKATQPMKSVLWWLPVEKAFWRQPAGPGSGIRERLEHPVLHVS 148

Query:   124 WNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHP 165
             WNDA AYC WRG RLPTE EWE+  RGGL+ +++PWG+W  P
Sbjct:   149 WNDARAYCAWRGKRLPTEEEWEFAARGGLKGQVYPWGNWFQP 190

 Score = 93 (37.8 bits), Expect = 5.9e-42, Sum P(2) = 5.9e-42
 Identities = 20/44 (45%), Positives = 26/44 (59%)

Query:   351 KVKKGGSYLCNEQYCYRHRC--AARSQNTPDSSAGNLGFRCAAD 392
             +V +G S++        HR     R  NTPDS++ NLGFRCAAD
Sbjct:   250 RVLRGASWIDTADGSANHRARVTTRMGNTPDSASDNLGFRCAAD 293


>MGI|MGI:1915152 [details] [associations]
            symbol:Sumf2 "sulfatase modifying factor 2" species:10090
            "Mus musculus" [GO:0005515 "protein binding" evidence=IPI]
            [GO:0005783 "endoplasmic reticulum" evidence=IDA] [GO:0008150
            "biological_process" evidence=ND] [GO:0042803 "protein
            homodimerization activity" evidence=IPI] [GO:0046872 "metal ion
            binding" evidence=IEA] MGI:MGI:1915152 GO:GO:0005783 GO:GO:0046872
            EMBL:CH466529 InterPro:IPR016187 SUPFAM:SSF56436 GO:GO:0005788
            eggNOG:COG1262 Gene3D:3.90.1580.10 InterPro:IPR005532 Pfam:PF03781
            GeneTree:ENSGT00390000008983 HOGENOM:HOG000135466
            HOVERGEN:HBG054193 CTD:25870 OMA:ADQDMRV EMBL:AK076022
            EMBL:AK138300 EMBL:AK157784 IPI:IPI00223483 RefSeq:NP_080721.1
            RefSeq:XP_003689431.1 UniGene:Mm.103546 ProteinModelPortal:Q8BPG6
            SMR:Q8BPG6 PhosphoSite:Q8BPG6 REPRODUCTION-2DPAGE:IPI00223483
            PaxDb:Q8BPG6 PRIDE:Q8BPG6 Ensembl:ENSMUST00000171300 GeneID:67902
            KEGG:mmu:67902 InParanoid:Q3TZL1 NextBio:325902 Bgee:Q8BPG6
            CleanEx:MM_SUMF2 Genevestigator:Q8BPG6
            GermOnline:ENSMUSG00000025538 Uniprot:Q8BPG6
        Length = 308

 Score = 437 (158.9 bits), Expect = 3.6e-41, P = 3.6e-41
 Identities = 85/190 (44%), Positives = 113/190 (59%)

Query:   158 PWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRL 217
             P   W  P G  S I  ++  PVVHVSWNDA AYC WRG RLPTE EWE+  RGGL+ ++
Sbjct:   129 PKAFWRQPAGPGSGIREKLELPVVHVSWNDAGAYCAWRGRRLPTEEEWEFAARGGLKGQV 188

Query:   218 FPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYK-ENKFGLYNMVGNVWEWT 276
             +PWGN   P   +R N+WQG+FP  + A DG+   +PV ++  +N +GLY+++GNVWEWT
Sbjct:   189 YPWGNRFQP---NRTNLWQGKFPKGDKAEDGFHGLSPVNAFPPQNNYGLYDLMGNVWEWT 245

Query:   277 ADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRC--AARSQNTPDSSAGN 334
             A           +Y P G      +V +G S++        HR     R  NTPDS++ N
Sbjct:   246 AS----------TYQPAGQDM---RVLRGASWIDTADGSANHRARVTTRMGNTPDSASDN 292

Query:   335 LGFRCAADKG 344
             LGFRCA+  G
Sbjct:   293 LGFRCASSAG 302

 Score = 343 (125.8 bits), Expect = 8.3e-39, Sum P(2) = 8.3e-39
 Identities = 76/161 (47%), Positives = 96/161 (59%)

Query:    15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
             MV LPG  F MGT+ P   +DGE P+R VT+  F +D   V+N  F+EFV    Y TEAE
Sbjct:    38 MVHLPGGRFLMGTDAPDG-RDGEGPAREVTVKPFAIDIFPVTNKDFREFVREKKYQTEAE 96

Query:    75 KFGDTFVFEPLLSEEERAK--ISQVRHDM----KRF----EGLDSTIEHRMHHPVVHISW 124
              FG +FVFE  +S E R +  +    H      K F     G  S I  ++  PVVH+SW
Sbjct:    97 AFGWSFVFEDFVSPELRKQENLMPAVHWWQPVPKAFWRQPAGPGSGIREKLELPVVHVSW 156

Query:   125 NDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHP 165
             NDA AYC WRG RLPTE EWE+  RGGL+ +++PWG+   P
Sbjct:   157 NDAGAYCAWRGRRLPTEEEWEFAARGGLKGQVYPWGNRFQP 197

 Score = 88 (36.0 bits), Expect = 8.3e-39, Sum P(2) = 8.3e-39
 Identities = 24/62 (38%), Positives = 33/62 (53%)

Query:   333 GNLGFRCAADKGPTTGTD-KVKKGGSYLCNEQYCYRHRC--AARSQNTPDSSAGNLGFRC 389
             GN+    A+   P  G D +V +G S++        HR     R  NTPDS++ NLGFRC
Sbjct:   239 GNVWEWTASTYQPA-GQDMRVLRGASWIDTADGSANHRARVTTRMGNTPDSASDNLGFRC 297

Query:   390 AA 391
             A+
Sbjct:   298 AS 299


>UNIPROTKB|F1RIU4 [details] [associations]
            symbol:SUMF2 "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0042803 "protein homodimerization activity"
            evidence=IEA] [GO:0005783 "endoplasmic reticulum" evidence=IEA]
            GO:GO:0005783 InterPro:IPR016187 SUPFAM:SSF56436
            Gene3D:3.90.1580.10 InterPro:IPR005532 Pfam:PF03781
            GeneTree:ENSGT00390000008983 OMA:ADQDMRV EMBL:FP102627
            Ensembl:ENSSSCT00000008492 Uniprot:F1RIU4
        Length = 302

 Score = 429 (156.1 bits), Expect = 2.6e-40, P = 2.6e-40
 Identities = 84/190 (44%), Positives = 111/190 (58%)

Query:   162 WLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWG 221
             W  P G  S I  R+  PVVHVSWNDA AYC WRG RLPTE EWE+  RGGL+ +++PWG
Sbjct:   127 WRQPAGPGSGIRERLEFPVVHVSWNDARAYCAWRGKRLPTEEEWEFAARGGLKGQVYPWG 186

Query:   222 NNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYK-ENKFGLYNMVGNVWEWTADWW 280
             N   P   +R N+WQG+FP  + A DG+   +PV ++  +N +GLY+++GNVWEWTA   
Sbjct:   187 NQFQP---NRTNLWQGKFPKGDKAEDGFHGVSPVNAFPPQNNYGLYDLMGNVWEWTA--- 240

Query:   281 NVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRC--AARSQNTPDSSAGNLGFR 338
                       +P        +V +G S++        HR     R  NTPDS++ NLGFR
Sbjct:   241 ----------SPYQAADQDMRVLRGASWIDTADGSANHRARVTTRMGNTPDSASDNLGFR 290

Query:   339 CAADKGPTTG 348
             CA+  G   G
Sbjct:   291 CASSAGRPPG 300

 Score = 362 (132.5 bits), Expect = 2.2e-40, Sum P(2) = 2.2e-40
 Identities = 77/163 (47%), Positives = 101/163 (61%)

Query:    15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
             MV LPG  F+MGTN P   +DGE P R VT+  F +D   V+N  F++FV    Y TEAE
Sbjct:    30 MVQLPGGRFQMGTNSPDG-RDGEGPVREVTVKPFAIDIFPVTNKDFRDFVREKKYRTEAE 88

Query:    75 KFGDTFVFEPLLSEEERAKIS-QVRHDM-------KRF----EGLDSTIEHRMHHPVVHI 122
              FG +FVFE L+ +E R+K + Q++  +       + F     G  S I  R+  PVVH+
Sbjct:    89 AFGWSFVFEDLVPDELRSKATHQMQQSLLWWLPVERAFWRQPAGPGSGIRERLEFPVVHV 148

Query:   123 SWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHP 165
             SWNDA AYC WRG RLPTE EWE+  RGGL+ +++PWG+   P
Sbjct:   149 SWNDARAYCAWRGKRLPTEEEWEFAARGGLKGQVYPWGNQFQP 191

 Score = 84 (34.6 bits), Expect = 2.2e-40, Sum P(2) = 2.2e-40
 Identities = 18/43 (41%), Positives = 25/43 (58%)

Query:   351 KVKKGGSYLCNEQYCYRHRC--AARSQNTPDSSAGNLGFRCAA 391
             +V +G S++        HR     R  NTPDS++ NLGFRCA+
Sbjct:   251 RVLRGASWIDTADGSANHRARVTTRMGNTPDSASDNLGFRCAS 293


>UNIPROTKB|F1PA71 [details] [associations]
            symbol:SUMF2 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0042803 "protein homodimerization activity"
            evidence=IEA] [GO:0005783 "endoplasmic reticulum" evidence=IEA]
            GO:GO:0005783 InterPro:IPR016187 SUPFAM:SSF56436
            Gene3D:3.90.1580.10 InterPro:IPR005532 Pfam:PF03781
            GeneTree:ENSGT00390000008983 OMA:ADQDMRV EMBL:AAEX03004181
            EMBL:AAEX03004180 Ensembl:ENSCAFT00000016133 Uniprot:F1PA71
        Length = 300

 Score = 427 (155.4 bits), Expect = 4.2e-40, P = 4.2e-40
 Identities = 83/191 (43%), Positives = 115/191 (60%)

Query:   162 WLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWG 221
             W  P G +S I+ R+  PV+HVSWNDA AYC W+G RLPTE EWE+  RGGL+ +++PWG
Sbjct:   125 WRQPAGPNSGIQERLELPVLHVSWNDARAYCAWKGKRLPTEEEWEFAARGGLKGQVYPWG 184

Query:   222 NNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYK-ENKFGLYNMVGNVWEWTADWW 280
             N   P   +R N+WQG+FP  + A DG+   +PV ++  +N +GLY+++GNVWEWTA  +
Sbjct:   185 NQFQP---NRTNLWQGKFPKGDKAEDGFHGVSPVNAFPPQNNYGLYDLMGNVWEWTASLY 241

Query:   281 NVHHHPAPSYNPKGPTTGTD-KVKKGGSYLCNEQYCYRH--RCAARSQNTPDSSAGNLGF 337
                           P+   D +V +G S++        H  R   R  NTPDS++ NLGF
Sbjct:   242 --------------PSADQDMRVLRGASWIDTADGSANHLARVTTRMGNTPDSASDNLGF 287

Query:   338 RCAADKGPTTG 348
             RCA+  G   G
Sbjct:   288 RCASSIGRLPG 298

 Score = 354 (129.7 bits), Expect = 2.8e-40, Sum P(2) = 2.8e-40
 Identities = 75/171 (43%), Positives = 102/171 (59%)

Query:     6 APPVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVS 65
             AP   +   MV LPG  F+MGTN P   ++GE P R VT+  F +D   V+N  F+EFV 
Sbjct:    20 APGNGQATSMVQLPGGRFQMGTNSPDG-RNGEGPVREVTVKPFAIDVFPVTNKDFREFVR 78

Query:    66 ATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDM-------KRF----EGLDSTIEHR 114
                Y TEAE FG +FVFE  ++ E R K++     +       + F     G +S I+ R
Sbjct:    79 EKKYRTEAEMFGWSFVFEEFVANELRNKVTHQMESVLWWLPVERAFWRQPAGPNSGIQER 138

Query:   115 MHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHP 165
             +  PV+H+SWNDA AYC W+G RLPTE EWE+  RGGL+ +++PWG+   P
Sbjct:   139 LELPVLHVSWNDARAYCAWKGKRLPTEEEWEFAARGGLKGQVYPWGNQFQP 189

 Score = 91 (37.1 bits), Expect = 2.8e-40, Sum P(2) = 2.8e-40
 Identities = 23/64 (35%), Positives = 34/64 (53%)

Query:   333 GNLGFRCAADKGPTTGTD-KVKKGGSYLCNEQYCYRH--RCAARSQNTPDSSAGNLGFRC 389
             GN+ +   A   P+   D +V +G S++        H  R   R  NTPDS++ NLGFRC
Sbjct:   231 GNV-WEWTASLYPSADQDMRVLRGASWIDTADGSANHLARVTTRMGNTPDSASDNLGFRC 289

Query:   390 AADV 393
             A+ +
Sbjct:   290 ASSI 293


>UNIPROTKB|Q58CP2 [details] [associations]
            symbol:SUMF2 "Sulfatase-modifying factor 2" species:9913
            "Bos taurus" [GO:0005788 "endoplasmic reticulum lumen"
            evidence=IEA] [GO:0042803 "protein homodimerization activity"
            evidence=IEA] [GO:0046872 "metal ion binding" evidence=IEA]
            GO:GO:0046872 InterPro:IPR016187 SUPFAM:SSF56436 GO:GO:0005788
            eggNOG:COG1262 Gene3D:3.90.1580.10 InterPro:IPR005532 Pfam:PF03781
            GeneTree:ENSGT00390000008983 HOGENOM:HOG000135466
            HOVERGEN:HBG054193 EMBL:BT021905 EMBL:BC149259 IPI:IPI00714072
            RefSeq:NP_001014881.1 UniGene:Bt.2897 ProteinModelPortal:Q58CP2
            SMR:Q58CP2 PRIDE:Q58CP2 Ensembl:ENSBTAT00000010773 GeneID:509497
            KEGG:bta:509497 CTD:25870 InParanoid:Q58CP2 OMA:ADQDMRV
            OrthoDB:EOG4XD3RS NextBio:20868990 Uniprot:Q58CP2
        Length = 301

 Score = 426 (155.0 bits), Expect = 5.3e-40, P = 5.3e-40
 Identities = 83/190 (43%), Positives = 110/190 (57%)

Query:   162 WLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWG 221
             W  P G  S I  ++  PVVHVSWNDA AYC WRG RLPTE EWE+  RGGL+ +++PWG
Sbjct:   126 WRQPAGPGSGIREKLEFPVVHVSWNDARAYCAWRGKRLPTEEEWEFAARGGLKGQVYPWG 185

Query:   222 NNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYK-ENKFGLYNMVGNVWEWTADWW 280
             N   P   +R N+WQG+FP  + A DG+   +PV ++  +N +GLY++VGNVWEWTA  +
Sbjct:   186 NKFQP---NRTNLWQGKFPKGDKAEDGFHGVSPVNAFPPQNDYGLYDLVGNVWEWTASQY 242

Query:   281 NVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRC--AARSQNTPDSSAGNLGFR 338
                                 +V +G S++        HR     R  NTPDS++ NLGFR
Sbjct:   243 QAADQDM-------------RVLRGASWIDTADGSANHRARVTTRMGNTPDSASDNLGFR 289

Query:   339 CAADKGPTTG 348
             CA+  G   G
Sbjct:   290 CASGAGRPPG 299

 Score = 361 (132.1 bits), Expect = 2.8e-40, Sum P(2) = 2.8e-40
 Identities = 77/163 (47%), Positives = 100/163 (61%)

Query:    14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
             +MV LPG  F+MGT+ P   +DGE P R VT+  F +D   V+N  F+EFV    Y TEA
Sbjct:    29 NMVQLPGGRFQMGTDSPDG-RDGEGPVREVTVKPFAIDIFPVTNKDFREFVREKKYRTEA 87

Query:    74 EKFGDTFVFEPLLSEEERAKISQVRHDM-------KRF----EGLDSTIEHRMHHPVVHI 122
             E FG +FVFE L+S+E R K +Q    +       + F     G  S I  ++  PVVH+
Sbjct:    88 EVFGWSFVFEDLVSDELRNKATQRMQSLLWWLPVERAFWRQPAGPGSGIREKLEFPVVHV 147

Query:   123 SWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHP 165
             SWNDA AYC WRG RLPTE EWE+  RGGL+ +++PWG+   P
Sbjct:   148 SWNDARAYCAWRGKRLPTEEEWEFAARGGLKGQVYPWGNKFQP 190

 Score = 84 (34.6 bits), Expect = 2.8e-40, Sum P(2) = 2.8e-40
 Identities = 18/43 (41%), Positives = 25/43 (58%)

Query:   351 KVKKGGSYLCNEQYCYRHRC--AARSQNTPDSSAGNLGFRCAA 391
             +V +G S++        HR     R  NTPDS++ NLGFRCA+
Sbjct:   250 RVLRGASWIDTADGSANHRARVTTRMGNTPDSASDNLGFRCAS 292


>RGD|1563253 [details] [associations]
            symbol:Sumf2 "sulfatase modifying factor 2" species:10116
            "Rattus norvegicus" [GO:0005783 "endoplasmic reticulum"
            evidence=ISO] [GO:0042803 "protein homodimerization activity"
            evidence=ISO] RGD:1563253 InterPro:IPR016187 SUPFAM:SSF56436
            Gene3D:3.90.1580.10 InterPro:IPR005532 Pfam:PF03781
            HOVERGEN:HBG054193 EMBL:BC091176 IPI:IPI00363630 UniGene:Rn.98463
            ProteinModelPortal:Q5BK78 SMR:Q5BK78 Genevestigator:Q5BK78
            Uniprot:Q5BK78
        Length = 271

 Score = 419 (152.6 bits), Expect = 2.9e-39, P = 2.9e-39
 Identities = 82/186 (44%), Positives = 111/186 (59%)

Query:   162 WLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWG 221
             W  P G  S I  ++  PVVHVSWNDA AYC WRG RLPTE EWE+  RGGL+ +++PWG
Sbjct:    96 WRQPAGPGSGIREKLELPVVHVSWNDAGAYCAWRGKRLPTEEEWEFAARGGLKGQVYPWG 155

Query:   222 NNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYK-ENKFGLYNMVGNVWEWTADWW 280
             N       +R N+WQG+FP  + A DG+   +PV ++  +N +GLY+++GNVWEWTA   
Sbjct:   156 NQFQL---NRTNLWQGKFPKGDRAEDGFHGLSPVNAFPPQNNYGLYDLMGNVWEWTAS-- 210

Query:   281 NVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRC--AARSQNTPDSSAGNLGFR 338
                     +Y   G      +V +G S++        HR     R  NTPDS++ NLGFR
Sbjct:   211 --------TYQAAGQDM---RVLRGASWIDTADGSANHRARVTTRMGNTPDSASDNLGFR 259

Query:   339 CAADKG 344
             CA+ +G
Sbjct:   260 CASSEG 265

 Score = 349 (127.9 bits), Expect = 1.9e-39, Sum P(2) = 1.9e-39
 Identities = 74/157 (47%), Positives = 97/157 (61%)

Query:    15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
             MV LPG  F MGTN P   +DGE P+R VT+  F +D   ++N  F+EFV    Y TEAE
Sbjct:     1 MVHLPGGRFLMGTNAPDG-RDGEGPAREVTVKPFAIDVFPITNKDFREFVREKKYQTEAE 59

Query:    75 KFGDTFVFEPLLSEEERAKISQVRHDM------KRF----EGLDSTIEHRMHHPVVHISW 124
              FG +FVFE  +S E R + +++   +      K F     G  S I  ++  PVVH+SW
Sbjct:    60 AFGWSFVFEDFVSPELRKQANEMPAVLWWLPVQKAFWRQPAGPGSGIREKLELPVVHVSW 119

Query:   125 NDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGS 161
             NDA AYC WRG RLPTE EWE+  RGGL+ +++PWG+
Sbjct:   120 NDAGAYCAWRGKRLPTEEEWEFAARGGLKGQVYPWGN 156

 Score = 88 (36.0 bits), Expect = 1.9e-39, Sum P(2) = 1.9e-39
 Identities = 23/62 (37%), Positives = 32/62 (51%)

Query:   333 GNLGFRCAADKGPTTGTD-KVKKGGSYLCNEQYCYRHRC--AARSQNTPDSSAGNLGFRC 389
             GN+ +   A      G D +V +G S++        HR     R  NTPDS++ NLGFRC
Sbjct:   202 GNV-WEWTASTYQAAGQDMRVLRGASWIDTADGSANHRARVTTRMGNTPDSASDNLGFRC 260

Query:   390 AA 391
             A+
Sbjct:   261 AS 262


>ZFIN|ZDB-GENE-041010-55 [details] [associations]
            symbol:sumf2 "sulfatase modifying factor 2"
            species:7955 "Danio rerio" [GO:0008150 "biological_process"
            evidence=ND] [GO:0005575 "cellular_component" evidence=ND]
            ZFIN:ZDB-GENE-041010-55 InterPro:IPR016187 SUPFAM:SSF56436
            eggNOG:COG1262 Gene3D:3.90.1580.10 InterPro:IPR005532 Pfam:PF03781
            GeneTree:ENSGT00390000008983 HOGENOM:HOG000135466
            HOVERGEN:HBG054193 CTD:25870 OMA:ADQDMRV OrthoDB:EOG4XD3RS
            EMBL:CR388166 EMBL:BC083430 EMBL:BC153567 IPI:IPI00506806
            RefSeq:NP_001005980.1 UniGene:Dr.85610 SMR:Q5XJ75
            Ensembl:ENSDART00000101203 GeneID:449807 KEGG:dre:449807
            InParanoid:Q5XJ75 NextBio:20832878 Uniprot:Q5XJ75
        Length = 299

 Score = 393 (143.4 bits), Expect = 1.7e-36, P = 1.7e-36
 Identities = 83/182 (45%), Positives = 105/182 (57%)

Query:   162 WLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWG 221
             W  P G  S I+ R++ PVV VSWNDA AYC W+  RLPTE EWE   RGGLE R +PWG
Sbjct:   123 WRQPAGPGSGIKDRLDCPVVQVSWNDAQAYCQWKKKRLPTEEEWEMAARGGLEGRSYPWG 182

Query:   222 NNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYK-ENKFGLYNMVGNVWEWTADWW 280
             N       +R N+WQG FP  ++A DGY   APV +Y  +N +GLY+M+GNVWEWT+   
Sbjct:   183 NKYLL---NRTNLWQGPFPDKDSAEDGYHGVAPVTAYPPQNNYGLYDMLGNVWEWTSS-- 237

Query:   281 NVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRC--AARSQNTPDSSAGNLGFR 338
                     S+    P   +  V +G S++        HR     R  NT DS++ NLGFR
Sbjct:   238 --------SF----PGAQSMFVLRGASWIDTADGSANHRARVTTRMGNTADSASDNLGFR 285

Query:   339 CA 340
             CA
Sbjct:   286 CA 287

 Score = 331 (121.6 bits), Expect = 3.5e-36, Sum P(2) = 3.5e-36
 Identities = 72/159 (45%), Positives = 96/159 (60%)

Query:    14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
             +MV +PG    MGT+     +DGE P+R V L  F +D++ V+N+ F+EFV    Y TEA
Sbjct:    26 EMVFIPGGKMLMGTSAADG-RDGESPTRAVALQPFKIDKYPVTNSNFREFVRLQKYKTEA 84

Query:    74 EKFGDTFVFEPLLSEEERAKISQVRHDM-------KRF----EGLDSTIEHRMHHPVVHI 122
             E FG +FVF+  +SEE ++K++Q            K F     G  S I+ R+  PVV +
Sbjct:    85 ETFGWSFVFQDFVSEELKSKVTQKIESAPWWLPVEKVFWRQPAGPGSGIKDRLDCPVVQV 144

Query:   123 SWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGS 161
             SWNDA AYC W+  RLPTE EWE   RGGLE R +PWG+
Sbjct:   145 SWNDAQAYCQWKKKRLPTEEEWEMAARGGLEGRSYPWGN 183

 Score = 75 (31.5 bits), Expect = 3.5e-36, Sum P(2) = 3.5e-36
 Identities = 18/45 (40%), Positives = 24/45 (53%)

Query:   352 VKKGGSYLCNEQYCYRHRC--AARSQNTPDSSAGNLGFRCAADVS 394
             V +G S++        HR     R  NT DS++ NLGFRCA + S
Sbjct:   247 VLRGASWIDTADGSANHRARVTTRMGNTADSASDNLGFRCAMNSS 291


>UNIPROTKB|F8WA42 [details] [associations]
            symbol:SUMF2 "Sulfatase-modifying factor 2" species:9606
            "Homo sapiens" [GO:0005783 "endoplasmic reticulum" evidence=IEA]
            [GO:0042803 "protein homodimerization activity" evidence=IEA]
            GO:GO:0005783 InterPro:IPR016187 SUPFAM:SSF56436
            Gene3D:3.90.1580.10 InterPro:IPR005532 Pfam:PF03781 EMBL:AC092101
            IPI:IPI00334514 HGNC:HGNC:20415 ProteinModelPortal:F8WA42
            SMR:F8WA42 PRIDE:F8WA42 Ensembl:ENST00000342190 UCSC:uc003trv.3
            ArrayExpress:F8WA42 Bgee:F8WA42 Uniprot:F8WA42
        Length = 358

 Score = 368 (134.6 bits), Expect = 7.4e-34, P = 7.4e-34
 Identities = 76/162 (46%), Positives = 97/162 (59%)

Query:    15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
             MV L G  F MGTN P   +DG+ P R  T+  F +D   V+N  F++FV    Y TEAE
Sbjct:    49 MVQLQGGRFLMGTNSPDS-RDGDGPVREATVKPFAIDIFPVTNKDFRDFVREKKYRTEAE 107

Query:    75 KFGDTFVFEPLLSEEERAKISQVRHDM-------KRF----EGLDSTIEHRMHHPVVHIS 123
              FG +FVFE  +S+E R K +Q    +       K F     G  S I  R+ HPV+H+S
Sbjct:   108 MFGWSFVFEDFVSDELRNKATQPMKSVLWWLPVEKAFWRQPAGPGSGIRERLEHPVLHVS 167

Query:   124 WNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHP 165
             WNDA AYC WRG RLPTE EWE+  RGGL+ +++PWG+W  P
Sbjct:   168 WNDARAYCAWRGKRLPTEEEWEFAARGGLKGQVYPWGNWFQP 209

 Score = 310 (114.2 bits), Expect = 1.0e-27, P = 1.0e-27
 Identities = 53/104 (50%), Positives = 70/104 (67%)

Query:   162 WLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWG 221
             W  P G  S I  R+ HPV+HVSWNDA AYC WRG RLPTE EWE+  RGGL+ +++PWG
Sbjct:   145 WRQPAGPGSGIRERLEHPVLHVSWNDARAYCAWRGKRLPTEEEWEFAARGGLKGQVYPWG 204

Query:   222 NNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYK-ENKFG 264
             N   P   +R N+WQG+FP  + A DG+   +PV ++  +N +G
Sbjct:   205 NWFQP---NRTNLWQGKFPKGDKAEDGFHGVSPVNAFPAQNNYG 245


>UNIPROTKB|Q0C1L9 [details] [associations]
            symbol:HNE_1666 "Putative uncharacterized protein"
            species:228405 "Hyphomonas neptunium ATCC 15444" [GO:0003674
            "molecular_function" evidence=ND] [GO:0005575 "cellular_component"
            evidence=ND] [GO:0008150 "biological_process" evidence=ND]
            InterPro:IPR016187 SUPFAM:SSF56436 EMBL:CP000158
            GenomeReviews:CP000158_GR eggNOG:COG1262 Gene3D:3.90.1580.10
            InterPro:IPR005532 Pfam:PF03781 HOGENOM:HOG000135466
            RefSeq:YP_760374.1 ProteinModelPortal:Q0C1L9 STRING:Q0C1L9
            GeneID:4289326 KEGG:hne:HNE_1666 PATRIC:32216155 OMA:CVRYRAS
            ProtClustDB:CLSK2781022 BioCyc:HNEP228405:GI69-1698-MONOMER
            Uniprot:Q0C1L9
        Length = 256

 Score = 358 (131.1 bits), Expect = 8.5e-33, P = 8.5e-33
 Identities = 78/178 (43%), Positives = 99/178 (55%)

Query:   161 SWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPW 220
             SW  PEG  S+IE + N PV+HVS  DA AY  W G RLP+E EWE+  R GL +     
Sbjct:    87 SWKTPEGAGSSIEGKGNWPVMHVSLADAEAYAAWAGGRLPSEEEWEHAARLGLPDPDRET 146

Query:   221 GNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWW 280
                    G+ RAN WQG FP  N   DG+   APV  +  ++ GLY+M+GNVWEWT D  
Sbjct:   147 SGAFEDDGKPRANTWQGIFPVANAGEDGFAGAAPVGCFPADQLGLYDMIGNVWEWT-D-- 203

Query:   281 NVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
                        P  P  G + +K GGSYLC + +C R+R AAR     D S+ ++GFR
Sbjct:   204 ----------TPFAP--GNNTIK-GGSYLCADNFCQRYRPAARHPQEIDFSSNHIGFR 248

 Score = 215 (80.7 bits), Expect = 1.8e-25, Sum P(2) = 1.8e-25
 Identities = 56/140 (40%), Positives = 75/140 (53%)

Query:    16 VLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEK 75
             V +PG   + G      I   E P   + +D F +  HEV+N QF EFV+ATGYVT+AE+
Sbjct:     5 VEVPGGVLQKGRGA---IYPEERPEVTLHVDGFRIQAHEVTNDQFAEFVTATGYVTDAER 61

Query:    76 FGDTFVFEPLLSEE--ERAKISQVRH-DMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
              G      P       + A+   +R    K  EG  S+IE + + PV+H+S  DA AY  
Sbjct:    62 -G-VMEDRPGAGSAVFQGARWHLMREASWKTPEGAGSSIEGKGNWPVMHVSLADAEAYAA 119

Query:   133 WRGARLPTEAEWEYGCRGGL 152
             W G RLP+E EWE+  R GL
Sbjct:   120 WAGGRLPSEEEWEHAARLGL 139

 Score = 98 (39.6 bits), Expect = 1.8e-25, Sum P(2) = 1.8e-25
 Identities = 18/40 (45%), Positives = 26/40 (65%)

Query:   354 KGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
             KGGSYLC + +C R+R AAR     D S+ ++GFR   ++
Sbjct:   214 KGGSYLCADNFCQRYRPAARHPQEIDFSSNHIGFRIVKEL 253


>UNIPROTKB|Q74ER3 [details] [associations]
            symbol:GSU0897 "Protein 3-oxoalanine-generating enzyme
            family protein" species:243231 "Geobacter sulfurreducens PCA"
            [GO:0003674 "molecular_function" evidence=ND] [GO:0005575
            "cellular_component" evidence=ND] [GO:0008150 "biological_process"
            evidence=ND] InterPro:IPR016187 SUPFAM:SSF56436 EMBL:AE017180
            GenomeReviews:AE017180_GR Gene3D:3.90.1580.10 InterPro:IPR005532
            Pfam:PF03781 HOGENOM:HOG000135467 RefSeq:NP_951953.1
            ProteinModelPortal:Q74ER3 GeneID:2687293 KEGG:gsu:GSU0897
            PATRIC:22024563 OMA:AFYISVY ProtClustDB:CLSK743103
            BioCyc:GSUL243231:GH27-884-MONOMER Uniprot:Q74ER3
        Length = 291

 Score = 244 (91.0 bits), Expect = 1.9e-20, P = 1.9e-20
 Identities = 72/213 (33%), Positives = 104/213 (48%)

Query:   135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
             GA   T AE++  C      +      W  P    S    R N P ++VSW+DAVAY  W
Sbjct:    92 GAYEVTFAEYDRFCEATGREKPKDGRRWFGPL---SRNWGRGNKPAMNVSWDDAVAYVKW 148

Query:   195 RGA------RLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADG 248
                      RLP+EAEWEY  RGG +   + WG  +   G+++AN  +G      +  D 
Sbjct:   149 LSDQTGHRYRLPSEAEWEYAARGGKDTPYW-WGGTV---GQNKANC-KG----CGSRWDK 199

Query:   249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTD---KVKKG 305
              + TAPV S+  N +G+++  GNVWEW  D W+  +  AP+     P  G +   +V++G
Sbjct:   200 KI-TAPVGSFAPNPYGMFDTAGNVWEWCVDTWHESYDGAPADG--SPWIGGEDSRRVQRG 256

Query:   306 GSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
             GS+    +Y    R +AR +   D     LGFR
Sbjct:   257 GSFGSKPRYI---RSSARGRGAQDGRYVYLGFR 286

 Score = 120 (47.3 bits), Expect = 2.7e-09, Sum P(3) = 2.7e-09
 Identities = 28/67 (41%), Positives = 38/67 (56%)

Query:   100 DMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGA------RLPTEAEWEYGCRGGLE 153
             D +R+ G  S    R + P +++SW+DAVAY  W         RLP+EAEWEY  RGG +
Sbjct:   115 DGRRWFGPLSRNWGRGNKPAMNVSWDDAVAYVKWLSDQTGHRYRLPSEAEWEYAARGGKD 174

Query:   154 NRLFPWG 160
                + WG
Sbjct:   175 TPYW-WG 180

 Score = 63 (27.2 bits), Expect = 2.7e-09, Sum P(3) = 2.7e-09
 Identities = 25/72 (34%), Positives = 35/72 (48%)

Query:    15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLD-AFYLDQHEVSNTQFQEFVSATGYVTEA 73
             MV++P   FRMG        D E P   V++  AF +  +EV+  ++  F  ATG   E 
Sbjct:    56 MVVIPPGRFRMGAIFGGGDPD-EKPVHEVSIPRAFAIGAYEVTFAEYDRFCEATG--REK 112

Query:    74 EKFGDTFVFEPL 85
              K G  + F PL
Sbjct:   113 PKDGRRW-FGPL 123

 Score = 55 (24.4 bits), Expect = 2.7e-09, Sum P(3) = 2.7e-09
 Identities = 17/56 (30%), Positives = 27/56 (48%)

Query:   341 ADKGPTTGTD---KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
             AD  P  G +   +V++GGS+    +Y    R +AR +   D     LGFR   ++
Sbjct:   239 ADGSPWIGGEDSRRVQRGGSFGSKPRYI---RSSARGRGAQDGRYVYLGFRVVREL 291


>TIGR_CMR|GSU_0897 [details] [associations]
            symbol:GSU_0897 "conserved hypothetical protein"
            species:243231 "Geobacter sulfurreducens PCA" [GO:0003674
            "molecular_function" evidence=ND] [GO:0005575 "cellular_component"
            evidence=ND] [GO:0008150 "biological_process" evidence=ND]
            InterPro:IPR016187 SUPFAM:SSF56436 EMBL:AE017180
            GenomeReviews:AE017180_GR Gene3D:3.90.1580.10 InterPro:IPR005532
            Pfam:PF03781 HOGENOM:HOG000135467 RefSeq:NP_951953.1
            ProteinModelPortal:Q74ER3 GeneID:2687293 KEGG:gsu:GSU0897
            PATRIC:22024563 OMA:AFYISVY ProtClustDB:CLSK743103
            BioCyc:GSUL243231:GH27-884-MONOMER Uniprot:Q74ER3
        Length = 291

 Score = 244 (91.0 bits), Expect = 1.9e-20, P = 1.9e-20
 Identities = 72/213 (33%), Positives = 104/213 (48%)

Query:   135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
             GA   T AE++  C      +      W  P    S    R N P ++VSW+DAVAY  W
Sbjct:    92 GAYEVTFAEYDRFCEATGREKPKDGRRWFGPL---SRNWGRGNKPAMNVSWDDAVAYVKW 148

Query:   195 RGA------RLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADG 248
                      RLP+EAEWEY  RGG +   + WG  +   G+++AN  +G      +  D 
Sbjct:   149 LSDQTGHRYRLPSEAEWEYAARGGKDTPYW-WGGTV---GQNKANC-KG----CGSRWDK 199

Query:   249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTD---KVKKG 305
              + TAPV S+  N +G+++  GNVWEW  D W+  +  AP+     P  G +   +V++G
Sbjct:   200 KI-TAPVGSFAPNPYGMFDTAGNVWEWCVDTWHESYDGAPADG--SPWIGGEDSRRVQRG 256

Query:   306 GSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
             GS+    +Y    R +AR +   D     LGFR
Sbjct:   257 GSFGSKPRYI---RSSARGRGAQDGRYVYLGFR 286

 Score = 120 (47.3 bits), Expect = 2.7e-09, Sum P(3) = 2.7e-09
 Identities = 28/67 (41%), Positives = 38/67 (56%)

Query:   100 DMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGA------RLPTEAEWEYGCRGGLE 153
             D +R+ G  S    R + P +++SW+DAVAY  W         RLP+EAEWEY  RGG +
Sbjct:   115 DGRRWFGPLSRNWGRGNKPAMNVSWDDAVAYVKWLSDQTGHRYRLPSEAEWEYAARGGKD 174

Query:   154 NRLFPWG 160
                + WG
Sbjct:   175 TPYW-WG 180

 Score = 63 (27.2 bits), Expect = 2.7e-09, Sum P(3) = 2.7e-09
 Identities = 25/72 (34%), Positives = 35/72 (48%)

Query:    15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLD-AFYLDQHEVSNTQFQEFVSATGYVTEA 73
             MV++P   FRMG        D E P   V++  AF +  +EV+  ++  F  ATG   E 
Sbjct:    56 MVVIPPGRFRMGAIFGGGDPD-EKPVHEVSIPRAFAIGAYEVTFAEYDRFCEATG--REK 112

Query:    74 EKFGDTFVFEPL 85
              K G  + F PL
Sbjct:   113 PKDGRRW-FGPL 123

 Score = 55 (24.4 bits), Expect = 2.7e-09, Sum P(3) = 2.7e-09
 Identities = 17/56 (30%), Positives = 27/56 (48%)

Query:   341 ADKGPTTGTD---KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
             AD  P  G +   +V++GGS+    +Y    R +AR +   D     LGFR   ++
Sbjct:   239 ADGSPWIGGEDSRRVQRGGSFGSKPRYI---RSSARGRGAQDGRYVYLGFRVVREL 291


>UNIPROTKB|Q884D9 [details] [associations]
            symbol:PSPTO_2154 "Uncharacterized protein" species:223283
            "Pseudomonas syringae pv. tomato str. DC3000" [GO:0003674
            "molecular_function" evidence=ND] [GO:0005575 "cellular_component"
            evidence=ND] [GO:0008150 "biological_process" evidence=ND]
            InterPro:IPR016187 SUPFAM:SSF56436 EMBL:AE016853
            GenomeReviews:AE016853_GR eggNOG:COG1262 Gene3D:3.90.1580.10
            InterPro:IPR005532 Pfam:PF03781 HOGENOM:HOG000135467 OMA:HANTYGP
            ProtClustDB:CLSK867237 RefSeq:NP_791976.1 ProteinModelPortal:Q884D9
            GeneID:1183801 KEGG:pst:PSPTO_2154 PATRIC:19995586
            BioCyc:PSYR223283:GJIX-2195-MONOMER Uniprot:Q884D9
        Length = 282

 Score = 176 (67.0 bits), Expect = 3.1e-15, Sum P(2) = 3.1e-15
 Identities = 56/173 (32%), Positives = 78/173 (45%)

Query:   179 PVVHVSWNDAVAYCTW------RGARLPTEAEWEYGCRGGLENRL-FPWGNNLTPRGEHR 231
             P V + WN+A AY  W      +   + +EA+ EY  RGG +    FP            
Sbjct:   121 PAVCMDWNEAKAYVEWLSKKTGKSYHMVSEAQREYAARGGSKGSFPFPMDEGKPYSIAKH 180

Query:   232 ANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYN 291
             AN +  E        DG+  TAP  SY  N FG+Y+  GNV+EWTAD    +++ AP+ +
Sbjct:   181 ANTYGPE--------DGFSYTAPAGSYSPNAFGIYDAHGNVYEWTADCETSNYNGAPT-D 231

Query:   292 PKGPTTG--TDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
                   G  T K+ +G  +   E   +  R   R+   PD     LGFR A D
Sbjct:   232 GSAWLAGDCTWKMIRGNDW--TEAPIFS-RSGNRNSRQPDVRGDWLGFRVARD 281

 Score = 73 (30.8 bits), Expect = 3.1e-15, Sum P(2) = 3.1e-15
 Identities = 24/79 (30%), Positives = 41/79 (51%)

Query:     2 VLLPAP---PVERYKD------MVLLPGDTFRMGTNKPILIKD-GEFPSRNVT-LDAFYL 50
             V +PAP   P + +KD      MV+LP  TF MG  +  + +   E P  +VT +  F +
Sbjct:    16 VSVPAPVPAPGKTFKDCKDCPEMVVLPAGTFTMGAPEEEMGRQPDEGPLHDVTFVKPFAI 75

Query:    51 DQHEVSNTQFQEFVSATGY 69
              Q +V   ++  ++ ++GY
Sbjct:    76 SQFQVLAGEWDAYIKSSGY 94


>UNIPROTKB|Q48KB7 [details] [associations]
            symbol:PSPPH_1930 "Uncharacterized protein" species:264730
            "Pseudomonas syringae pv. phaseolicola 1448A" [GO:0003674
            "molecular_function" evidence=ND] [GO:0005575 "cellular_component"
            evidence=ND] [GO:0008150 "biological_process" evidence=ND]
            InterPro:IPR016187 SUPFAM:SSF56436 EMBL:CP000058
            GenomeReviews:CP000058_GR eggNOG:COG1262 Gene3D:3.90.1580.10
            InterPro:IPR005532 Pfam:PF03781 HOGENOM:HOG000135467
            RefSeq:YP_274158.1 ProteinModelPortal:Q48KB7 STRING:Q48KB7
            GeneID:3557524 KEGG:psp:PSPPH_1930 PATRIC:19973061 OMA:HANTYGP
            ProtClustDB:CLSK867237 Uniprot:Q48KB7
        Length = 288

 Score = 176 (67.0 bits), Expect = 2.1e-14, Sum P(2) = 2.1e-14
 Identities = 56/173 (32%), Positives = 78/173 (45%)

Query:   179 PVVHVSWNDAVAYCTW------RGARLPTEAEWEYGCRGGLENRL-FPWGNNLTPRGEHR 231
             P V + WN+A AY  W      +   + +EA+ EY  RGG +    FP            
Sbjct:   127 PAVCMDWNEAKAYVEWLSKKTGKSYHMVSEAQREYAARGGSKGSFPFPMDEGKPYSIAKH 186

Query:   232 ANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYN 291
             AN +  E        DG+  TAP  SY  N FG+Y+  GNV+EWTAD    +++ AP+ +
Sbjct:   187 ANTYGPE--------DGFSYTAPAGSYSPNDFGIYDAHGNVYEWTADCETSNYNGAPT-D 237

Query:   292 PKGPTTG--TDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
                   G  T K+ +G  +   E   +  R   R+   PD     LGFR A D
Sbjct:   238 GSAWLAGDCTWKMIRGNDW--TEAPIFS-RSGNRNSRQPDVRGDWLGFRVARD 287

 Score = 66 (28.3 bits), Expect = 2.1e-14, Sum P(2) = 2.1e-14
 Identities = 16/58 (27%), Positives = 32/58 (55%)

Query:    14 DMVLLPGDTFRMGTNKPILIKD-GEFPSRNVTL-DAFYLDQHEVSNTQFQEFVSATGY 69
             +MV+LP  TF MG  +  + +   E P  +VT    F + + +V + ++  ++ ++GY
Sbjct:    43 EMVVLPAGTFTMGAPEEEMGRQPDEGPLHDVTFAKPFAISRFQVLSGEWNAYIKSSGY 100


>UNIPROTKB|Q9KKM6 [details] [associations]
            symbol:VC_A1077 "Putative uncharacterized protein"
            species:243277 "Vibrio cholerae O1 biovar El Tor str. N16961"
            [GO:0003674 "molecular_function" evidence=ND] [GO:0005575
            "cellular_component" evidence=ND] [GO:0008150 "biological_process"
            evidence=ND] InterPro:IPR016187 SUPFAM:SSF56436 EMBL:AE003853
            GenomeReviews:AE003853_GR Gene3D:3.90.1580.10 InterPro:IPR005532
            Pfam:PF03781 PIR:A82381 RefSeq:NP_233458.1
            ProteinModelPortal:Q9KKM6 DNASU:2611976 GeneID:2611976
            KEGG:vch:VCA1077 PATRIC:20086670 OMA:PAVCISK ProtClustDB:CLSK869910
            InterPro:IPR013229 Pfam:PF08308 Uniprot:Q9KKM6
        Length = 605

 Score = 200 (75.5 bits), Expect = 5.8e-13, P = 5.8e-13
 Identities = 59/163 (36%), Positives = 78/163 (47%)

Query:   140 TEAEWEYGCRGGLENRLFPWGS--WLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGA 197
             T+AE +  C    E+ + P     W +P G   +     + PVV VS NDA AY  W   
Sbjct:   432 TDAELKNLCISVNESEIAPVSDSDWRNP-GFKQS----KDSPVVCVSQNDAKAYARWLSK 486

Query:   198 ------RLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANV-WQGEFPTNNTAADGYL 250
                   RLPTE EWE   R G +   + WGN     G  +AN  W G   +N        
Sbjct:   487 ETGFTYRLPTEEEWEIAARAGSKTDYW-WGNKF---GAGKANTGWAGTSWSNK------- 535

Query:   251 STAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPK 293
             ST+PV ++  N  G Y+MVGNVWEWT D   +    A S++P+
Sbjct:   536 STSPVKAFAPNALGFYDMVGNVWEWTGDSRGLAKGGAWSFSPE 578

 Score = 130 (50.8 bits), Expect = 3.4e-05, P = 3.4e-05
 Identities = 44/133 (33%), Positives = 61/133 (45%)

Query:    36 GEFPSRNVTLD-AFYLDQHEVSNTQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKI 94
             GE  ++   LD AF L    V+ +QF+ FV+ T Y T+AE        + L      ++I
Sbjct:   396 GENAAKQYNLDHAFALSSTPVTVSQFENFVTQTKYKTDAE-------LKNLCISVNESEI 448

Query:    95 SQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGA------RLPTEAEWEYGC 148
             + V     R  G   + +     PVV +S NDA AY  W         RLPTE EWE   
Sbjct:   449 APVSDSDWRNPGFKQSKDS----PVVCVSQNDAKAYARWLSKETGFTYRLPTEEEWEIAA 504

Query:   149 RGGLENRLFPWGS 161
             R G +   + WG+
Sbjct:   505 RAGSKTDYW-WGN 516


>TIGR_CMR|VC_A1077 [details] [associations]
            symbol:VC_A1077 "conserved hypothetical protein"
            species:686 "Vibrio cholerae O1 biovar El Tor" [GO:0003674
            "molecular_function" evidence=ND] [GO:0005575 "cellular_component"
            evidence=ND] [GO:0008150 "biological_process" evidence=ND]
            InterPro:IPR016187 SUPFAM:SSF56436 EMBL:AE003853
            GenomeReviews:AE003853_GR Gene3D:3.90.1580.10 InterPro:IPR005532
            Pfam:PF03781 PIR:A82381 RefSeq:NP_233458.1
            ProteinModelPortal:Q9KKM6 DNASU:2611976 GeneID:2611976
            KEGG:vch:VCA1077 PATRIC:20086670 OMA:PAVCISK ProtClustDB:CLSK869910
            InterPro:IPR013229 Pfam:PF08308 Uniprot:Q9KKM6
        Length = 605

 Score = 200 (75.5 bits), Expect = 5.8e-13, P = 5.8e-13
 Identities = 59/163 (36%), Positives = 78/163 (47%)

Query:   140 TEAEWEYGCRGGLENRLFPWGS--WLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGA 197
             T+AE +  C    E+ + P     W +P G   +     + PVV VS NDA AY  W   
Sbjct:   432 TDAELKNLCISVNESEIAPVSDSDWRNP-GFKQS----KDSPVVCVSQNDAKAYARWLSK 486

Query:   198 ------RLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANV-WQGEFPTNNTAADGYL 250
                   RLPTE EWE   R G +   + WGN     G  +AN  W G   +N        
Sbjct:   487 ETGFTYRLPTEEEWEIAARAGSKTDYW-WGNKF---GAGKANTGWAGTSWSNK------- 535

Query:   251 STAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPK 293
             ST+PV ++  N  G Y+MVGNVWEWT D   +    A S++P+
Sbjct:   536 STSPVKAFAPNALGFYDMVGNVWEWTGDSRGLAKGGAWSFSPE 578

 Score = 130 (50.8 bits), Expect = 3.4e-05, P = 3.4e-05
 Identities = 44/133 (33%), Positives = 61/133 (45%)

Query:    36 GEFPSRNVTLD-AFYLDQHEVSNTQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKI 94
             GE  ++   LD AF L    V+ +QF+ FV+ T Y T+AE        + L      ++I
Sbjct:   396 GENAAKQYNLDHAFALSSTPVTVSQFENFVTQTKYKTDAE-------LKNLCISVNESEI 448

Query:    95 SQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGA------RLPTEAEWEYGC 148
             + V     R  G   + +     PVV +S NDA AY  W         RLPTE EWE   
Sbjct:   449 APVSDSDWRNPGFKQSKDS----PVVCVSQNDAKAYARWLSKETGFTYRLPTEEEWEIAA 504

Query:   149 RGGLENRLFPWGS 161
             R G +   + WG+
Sbjct:   505 RAGSKTDYW-WGN 516


>UNIPROTKB|O69671 [details] [associations]
            symbol:egtB "Iron(II)-dependent oxidoreductase EgtB"
            species:1773 "Mycobacterium tuberculosis" [GO:0008198 "ferrous iron
            binding" evidence=ISS] [GO:0016491 "oxidoreductase activity"
            evidence=ISS] [GO:0052704 "ergothioneine biosynthesis from
            histidine via N-alpha,N-alpha,N-alpha-trimethyl-L-histidine"
            evidence=ISS] InterPro:IPR017806 UniPathway:UPA01014
            GenomeReviews:AL123456_GR GO:GO:0008198 EMBL:BX842583
            InterPro:IPR016187 SUPFAM:SSF56436 GO:GO:0016491
            HOGENOM:HOG000253478 OMA:EYSEVFF ProtClustDB:CLSK872236
            GO:GO:0052704 Gene3D:3.90.1580.10 InterPro:IPR024775
            InterPro:IPR005532 Pfam:PF12867 Pfam:PF03781 TIGRFAMs:TIGR03440
            PIR:H70793 RefSeq:NP_218220.1 RefSeq:YP_006517193.1
            ProteinModelPortal:O69671 SMR:O69671 PRIDE:O69671
            EnsemblBacteria:EBMYCT00000001372 GeneID:13317315 GeneID:885128
            KEGG:mtu:Rv3703c KEGG:mtv:RVBD_3703c PATRIC:18156840
            TubercuList:Rv3703c Uniprot:O69671
        Length = 425

 Score = 187 (70.9 bits), Expect = 9.1e-12, Sum P(2) = 9.1e-12
 Identities = 66/203 (32%), Positives = 86/203 (42%)

Query:   118 PVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEH-RM 176
             PV +  W D +    +  +R  +E  W++  R GL    F W S          +E    
Sbjct:   208 PVTNGEWQDFIDDGGYTQSRWWSERGWQHRQRAGLTAPQF-WRSGGRTRTRFGHVEDIPA 266

Query:   177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
             + PV HVS+ +A AY  W GARLPTE EWE  C          W       G  R   W 
Sbjct:   267 DEPVQHVSYFEAEAYAAWAGARLPTEVEWEKACA---------WD---PATGSRRRYPWG 314

Query:   237 GEFPTNNTAADG--YLSTAPVMSYKE--NKFGLYNMVGNVWEWTADWWNVHHHPAPSYNP 292
              E PT+  A  G   L  APV +Y    +  G   M+G+VWEWT         P P + P
Sbjct:   315 TEEPTDTYANLGGQTLRPAPVGAYPAGASACGAEQMLGDVWEWTTS----PLRPWPGFVP 370

Query:   293 ------KGPTTGTD-KVKKGGSY 308
                     P  G D +V +GGS+
Sbjct:   371 MVYERYSQPFFGGDYRVLRGGSW 393

 Score = 155 (59.6 bits), Expect = 3.3e-08, P = 3.3e-08
 Identities = 51/167 (30%), Positives = 77/167 (46%)

Query:     4 LPAPPVERYKDMVLLPGDTFRMGTN---KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQF 60
             LPA         VL+ G  F +G +   +P  + D E P+  V + AF + +  V+N ++
Sbjct:   156 LPAGRPRMAGTSVLVAGGPFVLGVDAADEPCSL-DNERPAHVVDVPAFRIGRVPVTNGEW 214

Query:    61 QEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKIS--QVRHDMKRFEGLDSTIEH-RMHH 117
             Q+F+   GY T++  +      E      +RA ++  Q      R       +E      
Sbjct:   215 QDFIDDGGY-TQSRWWS-----ERGWQHRQRAGLTAPQFWRSGGRTRTRFGHVEDIPADE 268

Query:   118 PVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGL---ENRLFPWGS 161
             PV H+S+ +A AY  W GARLPTE EWE  C         R +PWG+
Sbjct:   269 PVQHVSYFEAEAYAAWAGARLPTEVEWEKACAWDPATGSRRRYPWGT 315

 Score = 39 (18.8 bits), Expect = 9.1e-12, Sum P(2) = 9.1e-12
 Identities = 7/15 (46%), Positives = 11/15 (73%)

Query:    77 GDTFVFEPLLSEEER 91
             GD+FVF  ++S E +
Sbjct:   118 GDSFVFAMVISHENQ 132


>UNIPROTKB|Q47ZT3 [details] [associations]
            symbol:CPS_2986 "Putative uncharacterized protein"
            species:167879 "Colwellia psychrerythraea 34H" [GO:0003674
            "molecular_function" evidence=ND] [GO:0005575 "cellular_component"
            evidence=ND] [GO:0008150 "biological_process" evidence=ND]
            InterPro:IPR016187 SUPFAM:SSF56436 EMBL:CP000083
            GenomeReviews:CP000083_GR eggNOG:COG1262 Gene3D:3.90.1580.10
            InterPro:IPR005532 Pfam:PF03781 RefSeq:YP_269686.1
            ProteinModelPortal:Q47ZT3 STRING:Q47ZT3 GeneID:3522208
            KEGG:cps:CPS_2986 PATRIC:21468989 HOGENOM:HOG000135467 OMA:PNVPVNN
            BioCyc:CPSY167879:GI48-3035-MONOMER Uniprot:Q47ZT3
        Length = 294

 Score = 116 (45.9 bits), Expect = 1.3e-11, Sum P(2) = 1.3e-11
 Identities = 49/179 (27%), Positives = 73/179 (40%)

Query:   179 PVVHVSWNDAVAY------CTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRA 232
             PV ++SW + + +       T +   LPTEA+W Y  +GG  N+      N    G +  
Sbjct:   132 PVNNISWFNMLLFIERLNSATGKEFSLPTEAQWAYAAKGG--NK----SQNYRYSGSNNI 185

Query:   233 N--VWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP-APS 289
             N   W  +   N +         PV   K N+ GLY+M GN+WE+  D  +   +    S
Sbjct:   186 NDVAWFADNAKNKSH--------PVGLKKPNELGLYDMTGNLWEFCLDDMSRQAYTFTES 237

Query:   290 YNP-KGPTTGTD----KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADK 343
             +NP  G          KV +GG Y   E     +    R   T +    ++GFR    K
Sbjct:   238 HNPFMGDKENLKQKAMKVIRGGGY---EFSATENLVFMRDGATNNVRMADIGFRLVMSK 293

 Score = 111 (44.1 bits), Expect = 1.3e-11, Sum P(2) = 1.3e-11
 Identities = 24/62 (38%), Positives = 41/62 (66%)

Query:     4 LPAPPV-ERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQE 62
             LP P + E  K+MVL+   +F MG++ P L ++ E P R V+LDAFY+ ++E++   F++
Sbjct:    59 LPKPMLDELMKNMVLVEAGSFAMGSDSP-LARNREKPVRQVSLDAFYIGKYELTQDLFEQ 117

Query:    63 FV 64
              +
Sbjct:   118 IM 119

 Score = 68 (29.0 bits), Expect = 1.3e-06, Sum P(2) = 1.3e-06
 Identities = 17/51 (33%), Positives = 29/51 (56%)

Query:   118 PVVHISWNDAVAY------CTWRGARLPTEAEWEYGCRGGLENRLFPW-GS 161
             PV +ISW + + +       T +   LPTEA+W Y  +GG +++ + + GS
Sbjct:   132 PVNNISWFNMLLFIERLNSATGKEFSLPTEAQWAYAAKGGNKSQNYRYSGS 182


>TIGR_CMR|CPS_2986 [details] [associations]
            symbol:CPS_2986 "conserved hypothetical protein"
            species:167879 "Colwellia psychrerythraea 34H" [GO:0003674
            "molecular_function" evidence=ND] [GO:0005575 "cellular_component"
            evidence=ND] [GO:0008150 "biological_process" evidence=ND]
            InterPro:IPR016187 SUPFAM:SSF56436 EMBL:CP000083
            GenomeReviews:CP000083_GR eggNOG:COG1262 Gene3D:3.90.1580.10
            InterPro:IPR005532 Pfam:PF03781 RefSeq:YP_269686.1
            ProteinModelPortal:Q47ZT3 STRING:Q47ZT3 GeneID:3522208
            KEGG:cps:CPS_2986 PATRIC:21468989 HOGENOM:HOG000135467 OMA:PNVPVNN
            BioCyc:CPSY167879:GI48-3035-MONOMER Uniprot:Q47ZT3
        Length = 294

 Score = 116 (45.9 bits), Expect = 1.3e-11, Sum P(2) = 1.3e-11
 Identities = 49/179 (27%), Positives = 73/179 (40%)

Query:   179 PVVHVSWNDAVAY------CTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRA 232
             PV ++SW + + +       T +   LPTEA+W Y  +GG  N+      N    G +  
Sbjct:   132 PVNNISWFNMLLFIERLNSATGKEFSLPTEAQWAYAAKGG--NK----SQNYRYSGSNNI 185

Query:   233 N--VWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP-APS 289
             N   W  +   N +         PV   K N+ GLY+M GN+WE+  D  +   +    S
Sbjct:   186 NDVAWFADNAKNKSH--------PVGLKKPNELGLYDMTGNLWEFCLDDMSRQAYTFTES 237

Query:   290 YNP-KGPTTGTD----KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADK 343
             +NP  G          KV +GG Y   E     +    R   T +    ++GFR    K
Sbjct:   238 HNPFMGDKENLKQKAMKVIRGGGY---EFSATENLVFMRDGATNNVRMADIGFRLVMSK 293

 Score = 111 (44.1 bits), Expect = 1.3e-11, Sum P(2) = 1.3e-11
 Identities = 24/62 (38%), Positives = 41/62 (66%)

Query:     4 LPAPPV-ERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQE 62
             LP P + E  K+MVL+   +F MG++ P L ++ E P R V+LDAFY+ ++E++   F++
Sbjct:    59 LPKPMLDELMKNMVLVEAGSFAMGSDSP-LARNREKPVRQVSLDAFYIGKYELTQDLFEQ 117

Query:    63 FV 64
              +
Sbjct:   118 IM 119

 Score = 68 (29.0 bits), Expect = 1.3e-06, Sum P(2) = 1.3e-06
 Identities = 17/51 (33%), Positives = 29/51 (56%)

Query:   118 PVVHISWNDAVAY------CTWRGARLPTEAEWEYGCRGGLENRLFPW-GS 161
             PV +ISW + + +       T +   LPTEA+W Y  +GG +++ + + GS
Sbjct:   132 PVNNISWFNMLLFIERLNSATGKEFSLPTEAQWAYAAKGGNKSQNYRYSGS 182


>UNIPROTKB|Q47ZZ2 [details] [associations]
            symbol:CPS_2927 "Putative uncharacterized protein"
            species:167879 "Colwellia psychrerythraea 34H" [GO:0003674
            "molecular_function" evidence=ND] [GO:0005575 "cellular_component"
            evidence=ND] [GO:0008150 "biological_process" evidence=ND]
            InterPro:IPR016187 SUPFAM:SSF56436 EMBL:CP000083
            GenomeReviews:CP000083_GR eggNOG:COG1262 Gene3D:3.90.1580.10
            InterPro:IPR005532 Pfam:PF03781 RefSeq:YP_269627.1
            ProteinModelPortal:Q47ZZ2 STRING:Q47ZZ2 GeneID:3520658
            KEGG:cps:CPS_2927 PATRIC:21468881
            BioCyc:CPSY167879:GI48-2976-MONOMER Uniprot:Q47ZZ2
        Length = 248

 Score = 174 (66.3 bits), Expect = 1.5e-11, P = 1.5e-11
 Identities = 43/108 (39%), Positives = 54/108 (50%)

Query:   177 NHPVVHVSWNDAVAYCTWRGA------RLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEH 230
             N PVV V +   + YC W  +      RL TEAEWEY  R G +  +F WGN ++P  +H
Sbjct:   126 NLPVVGVDYFTCINYCHWLSSKLNCKVRLLTEAEWEYCARAGTDT-IFSWGNEISPVAKH 184

Query:   231 RANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTAD 278
                 W   F  N       L+   V     N +GLY+M GNVWEW AD
Sbjct:   185 ---AW---FFDN-----AKLNIKMVKQLAPNNWGLYDMTGNVWEWCAD 221

 Score = 114 (45.2 bits), Expect = 0.00041, P = 0.00041
 Identities = 42/191 (21%), Positives = 79/191 (41%)

Query:    55 VSNTQFQEFVSAT---------GYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFE 105
             V+N+ FQ ++ A           ++     F D + F+   ++ E++   ++    K   
Sbjct:    61 VTNSMFQHYLKAQPESLNEMEFSFIANKAYFSD-YYFK---NKSEKSLKDKIEWQSKNLL 116

Query:   106 GLDSTIEHRMHHPVVHISWNDAVAYCTWRGA------RLPTEAEWEYGCRGGLENRLFPW 159
                +  E+  + PVV + +   + YC W  +      RL TEAEWEY  R G +  +F W
Sbjct:   117 NYINKAENS-NLPVVGVDYFTCINYCHWLSSKLNCKVRLLTEAEWEYCARAGTDT-IFSW 174

Query:   160 GSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFP 219
             G+ + P    +      ++  +++     +A   W G    T   WE+ C      + + 
Sbjct:   175 GNEISPVAKHAWF---FDNAKLNIKMVKQLAPNNW-GLYDMTGNVWEW-CADKYSQKFYD 229

Query:   220 WGNNLTPRGEH 230
             + N   P+  H
Sbjct:   230 YSNKKDPKSTH 240


>TIGR_CMR|CPS_2927 [details] [associations]
            symbol:CPS_2927 "conserved hypothetical protein"
            species:167879 "Colwellia psychrerythraea 34H" [GO:0003674
            "molecular_function" evidence=ND] [GO:0005575 "cellular_component"
            evidence=ND] [GO:0008150 "biological_process" evidence=ND]
            InterPro:IPR016187 SUPFAM:SSF56436 EMBL:CP000083
            GenomeReviews:CP000083_GR eggNOG:COG1262 Gene3D:3.90.1580.10
            InterPro:IPR005532 Pfam:PF03781 RefSeq:YP_269627.1
            ProteinModelPortal:Q47ZZ2 STRING:Q47ZZ2 GeneID:3520658
            KEGG:cps:CPS_2927 PATRIC:21468881
            BioCyc:CPSY167879:GI48-2976-MONOMER Uniprot:Q47ZZ2
        Length = 248

 Score = 174 (66.3 bits), Expect = 1.5e-11, P = 1.5e-11
 Identities = 43/108 (39%), Positives = 54/108 (50%)

Query:   177 NHPVVHVSWNDAVAYCTWRGA------RLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEH 230
             N PVV V +   + YC W  +      RL TEAEWEY  R G +  +F WGN ++P  +H
Sbjct:   126 NLPVVGVDYFTCINYCHWLSSKLNCKVRLLTEAEWEYCARAGTDT-IFSWGNEISPVAKH 184

Query:   231 RANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTAD 278
                 W   F  N       L+   V     N +GLY+M GNVWEW AD
Sbjct:   185 ---AW---FFDN-----AKLNIKMVKQLAPNNWGLYDMTGNVWEWCAD 221

 Score = 114 (45.2 bits), Expect = 0.00041, P = 0.00041
 Identities = 42/191 (21%), Positives = 79/191 (41%)

Query:    55 VSNTQFQEFVSAT---------GYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFE 105
             V+N+ FQ ++ A           ++     F D + F+   ++ E++   ++    K   
Sbjct:    61 VTNSMFQHYLKAQPESLNEMEFSFIANKAYFSD-YYFK---NKSEKSLKDKIEWQSKNLL 116

Query:   106 GLDSTIEHRMHHPVVHISWNDAVAYCTWRGA------RLPTEAEWEYGCRGGLENRLFPW 159
                +  E+  + PVV + +   + YC W  +      RL TEAEWEY  R G +  +F W
Sbjct:   117 NYINKAENS-NLPVVGVDYFTCINYCHWLSSKLNCKVRLLTEAEWEYCARAGTDT-IFSW 174

Query:   160 GSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFP 219
             G+ + P    +      ++  +++     +A   W G    T   WE+ C      + + 
Sbjct:   175 GNEISPVAKHAWF---FDNAKLNIKMVKQLAPNNW-GLYDMTGNVWEW-CADKYSQKFYD 229

Query:   220 WGNNLTPRGEH 230
             + N   P+  H
Sbjct:   230 YSNKKDPKSTH 240


>UNIPROTKB|A0R5N0 [details] [associations]
            symbol:egtB "Iron(II)-dependent oxidoreductase EgtB"
            species:246196 "Mycobacterium smegmatis str. MC2 155" [GO:0008198
            "ferrous iron binding" evidence=IDA] [GO:0016491 "oxidoreductase
            activity" evidence=IDA] [GO:0052704 "ergothioneine biosynthesis
            from histidine via N-alpha,N-alpha,N-alpha-trimethyl-L-histidine"
            evidence=IDA] InterPro:IPR017806 UniPathway:UPA01014 GO:GO:0008198
            EMBL:CP000480 EMBL:CP001663 GenomeReviews:CP000480_GR
            InterPro:IPR016187 SUPFAM:SSF56436 GO:GO:0016491
            RefSeq:YP_006570814.1 RefSeq:YP_890468.1 ProteinModelPortal:A0R5N0
            STRING:A0R5N0 EnsemblBacteria:EBMYCT00000040878 GeneID:13427417
            GeneID:4533015 KEGG:msg:MSMEI_6088 KEGG:msm:MSMEG_6249
            PATRIC:18084741 eggNOG:COG1262 HOGENOM:HOG000253478 OMA:EYSEVFF
            ProtClustDB:CLSK872236 BioCyc:MSME246196:GJ4Y-6248-MONOMER
            GO:GO:0052704 Gene3D:3.90.1580.10 InterPro:IPR024775
            InterPro:IPR005532 Pfam:PF12867 Pfam:PF03781 TIGRFAMs:TIGR03440
            Uniprot:A0R5N0
        Length = 428

 Score = 170 (64.9 bits), Expect = 1.2e-09, Sum P(2) = 1.2e-09
 Identities = 62/208 (29%), Positives = 88/208 (42%)

Query:   118 PVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRM- 176
             PV +  W + +    +   R  +   W +    GL    F    W +P+G  +   H   
Sbjct:   208 PVTNAEWREFIDDGGYDQPRWWSPRGWAHRQEAGLVAPQF----W-NPDGTRTRFGHIEE 262

Query:   177 ---NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCR----GGLENRLFPWGNNLTPRGE 229
                + PV HV++ +A AY  W GARLPTE EWE  C      G   R FPWG+       
Sbjct:   263 IPGDEPVQHVTFFEAEAYAAWAGARLPTEIEWEKACAWDPVAGARRR-FPWGSA------ 315

Query:   230 HRANVWQGEFPTNNTAADGYLSTAPVMSYKE--NKFGLYNMVGNVWEWTAD----WWNVH 283
                   Q      N   D     APV +Y    + +G   M+G+VWEWT+     W    
Sbjct:   316 ------QPSAALANLGGDAR-RPAPVGAYPAGASAYGAEQMLGDVWEWTSSPLRPWPGFT 368

Query:   284 HHPAPSYN-P--KGPTTGTDKVKKGGSY 308
                   Y+ P  +G T+G  +V +GGS+
Sbjct:   369 PMIYERYSTPFFEGTTSGDYRVLRGGSW 396

 Score = 154 (59.3 bits), Expect = 8.2e-08, Sum P(2) = 8.2e-08
 Identities = 48/157 (30%), Positives = 74/157 (47%)

Query:    16 VLLPGDTFRMGTN---KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
             VL+PG  F +G +   +P  + D E P+  V + +F + +  V+N +++EF+   GY  +
Sbjct:   168 VLVPGGPFVLGVDALTEPHSL-DNERPAHVVDIPSFRIGRVPVTNAEWREFIDDGGY--D 224

Query:    73 AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMH----HPVVHISWNDAV 128
               ++     + P      R +   V       +G  +   H        PV H+++ +A 
Sbjct:   225 QPRW-----WSPR-GWAHRQEAGLVAPQFWNPDGTRTRFGHIEEIPGDEPVQHVTFFEAE 278

Query:   129 AYCTWRGARLPTEAEWEYGCR----GGLENRLFPWGS 161
             AY  W GARLPTE EWE  C      G   R FPWGS
Sbjct:   279 AYAAWAGARLPTEIEWEKACAWDPVAGARRR-FPWGS 314

 Score = 38 (18.4 bits), Expect = 1.2e-09, Sum P(2) = 1.2e-09
 Identities = 7/16 (43%), Positives = 12/16 (75%)

Query:   343 KGPTTGTDKVKKGGSY 358
             +G T+G  +V +GGS+
Sbjct:   381 EGTTSGDYRVLRGGSW 396


>UNIPROTKB|Q4KIR8 [details] [associations]
            symbol:PFL_0723 "Uncharacterized protein" species:220664
            "Pseudomonas protegens Pf-5" [GO:0003674 "molecular_function"
            evidence=ND] [GO:0005575 "cellular_component" evidence=ND]
            [GO:0008150 "biological_process" evidence=ND] InterPro:IPR016187
            SUPFAM:SSF56436 EMBL:CP000076 GenomeReviews:CP000076_GR
            eggNOG:COG1262 Gene3D:3.90.1580.10 InterPro:IPR005532 Pfam:PF03781
            RefSeq:YP_257865.1 STRING:Q4KIR8 GeneID:3481312 KEGG:pfl:PFL_0723
            PATRIC:19870617 HOGENOM:HOG000225870 OMA:QEAKDYC
            ProtClustDB:CLSK838760 BioCyc:PFLU220664:GIX8-727-MONOMER
            Uniprot:Q4KIR8
        Length = 306

 Score = 159 (61.0 bits), Expect = 5.0e-09, P = 5.0e-09
 Identities = 42/130 (32%), Positives = 60/130 (46%)

Query:   185 WNDAVAYCTWRG------ARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGE 238
             W +A  YC W          LPTEA+WEY  R   +  +F   N     G +        
Sbjct:   146 WQEAKDYCGWLADLSGYAMDLPTEAQWEYAARNRGQPVMFATDNGNLDYGRN-------- 197

Query:   239 FPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH-PAPSYNPKGPTT 297
             FP    A D    T  V  +  N  GLYN+ GN  EW  DW++ +++  +P  NP+GP T
Sbjct:   198 FP----APDDN-ETFAVDRFVPNPLGLYNLTGNASEWVNDWYDKNYYRESPVENPQGPET 252

Query:   298 GTDKVKKGGS 307
             G  + ++G +
Sbjct:   253 GIYRAQRGAN 262


>UNIPROTKB|Q4K997 [details] [associations]
            symbol:pvdO "Chromophore maturation protein PvdO"
            species:220664 "Pseudomonas protegens Pf-5" [GO:0002049 "pyoverdine
            biosynthetic process" evidence=ISS] InterPro:IPR016187
            SUPFAM:SSF56436 EMBL:CP000076 GenomeReviews:CP000076_GR
            eggNOG:COG1262 Gene3D:3.90.1580.10 InterPro:IPR005532 Pfam:PF03781
            GO:GO:0002049 HOGENOM:HOG000135467 OMA:HANTYGP
            ProtClustDB:CLSK867237 RefSeq:YP_261186.1 ProteinModelPortal:Q4K997
            STRING:Q4K997 GeneID:3476390 KEGG:pfl:PFL_4089 PATRIC:19877579
            BioCyc:PFLU220664:GIX8-4123-MONOMER Uniprot:Q4K997
        Length = 297

 Score = 157 (60.3 bits), Expect = 7.8e-09, P = 7.8e-09
 Identities = 42/123 (34%), Positives = 58/123 (47%)

Query:   173 EHRMNHPVVHVSWNDAVAYCTW------RGARLPTEAEWEYGCRGGLENRLFPWGNNLTP 226
             E     P V V + D  AY  W      +  R+ +EAE EY  R G     FP+      
Sbjct:   119 EQGPRQPAVCVDYADVQAYTQWLSKKTGKHYRMVSEAEREYAARAGSTGS-FPFP--FDE 175

Query:   227 RGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP 286
              G+++       +       DG+  TAPV SY  N FG+Y+M GNV+EW AD W+  +  
Sbjct:   176 EGQYQITKHANTYGPK----DGFSFTAPVGSYPPNAFGMYDMHGNVYEWVADCWHPDYVG 231

Query:   287 APS 289
             AP+
Sbjct:   232 APA 234


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.318   0.134   0.450    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      394       394   0.00095  117 3  11 22  0.41    34
                                                     34  0.45    37


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  34
  No. of states in DFA:  623 (66 KB)
  Total size of DFA:  318 KB (2156 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  30.94u 0.11s 31.05t   Elapsed:  00:00:02
  Total cpu time:  30.95u 0.11s 31.06t   Elapsed:  00:00:02
  Start:  Thu Aug 15 17:06:25 2013   End:  Thu Aug 15 17:06:27 2013

Back to top