BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>psy11694
MSQPPWVSLGEKGLGYLHTFMIKVALLESNIFQTRGYLNSPVTRFLNFMRDHDKVYSSVE
DLLRRHENFVTNVEKAEDYQREDSGTAVFEVNKFFDLSDSDLQQLTGLNLDSTLEDIQPS
LQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGV
VEAMHAIQGNNLTELSVQHHDKVYSSVEDLLRRHENFVTNVEKAEDYQSEDSGTAVFGVN
KFFDLSESDLQQLTGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAF
DWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSNGGCNG
GRMDDALQYIIDNGGVVSDQAYPYKASESERGCLVGEEEGFKVKVKEYSRIPYGEEEEMK
KWVATRGPLSVGMNANGLFYYSGGVIDLNQRLYGTSIPYWIVKNSWGSDWGEKVEDKVGS
SGNRTRDLELTGVLPSKLSRLATEKLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAY
PYKASESERGCLVGEEEGFKVKVKEYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFYYS
GGVIDLNQRLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEKVML

High Scoring Gene Products

Symbol, full name Information P value
tag-196 gene from Caenorhabditis elegans 1.2e-45
LOC100525853
Uncharacterized protein
protein from Sus scrofa 4.2e-43
SAG12
senescence-associated gene 12
protein from Arabidopsis thaliana 6.4e-42
cprA
cysteine proteinase 1
gene from Dictyostelium discoideum 1.8e-41
Cp1
Cysteine proteinase-1
protein from Drosophila melanogaster 1.7e-40
AT3G19390 protein from Arabidopsis thaliana 4.4e-40
Ctss
cathepsin S
protein from Mus musculus 5.6e-40
AT2G27420 protein from Arabidopsis thaliana 3.2e-39
AT2G34080 protein from Arabidopsis thaliana 3.3e-39
CEP1
cysteine endopeptidase 1
protein from Arabidopsis thaliana 7.3e-39
CTSL2
Uncharacterized protein
protein from Gallus gallus 1.3e-38
AT3G54940 protein from Arabidopsis thaliana 2.2e-38
Ctsl1
cathepsin L1
gene from Rattus norvegicus 2.3e-38
CG12163 protein from Drosophila melanogaster 2.5e-38
CTSS
Cathepsin S
protein from Homo sapiens 2.6e-38
AT3G49340 protein from Arabidopsis thaliana 5.0e-38
CTSS
Cathepsin S
protein from Canis lupus familiaris 1.3e-37
CTSS
Cathepsin S
protein from Bos taurus 1.5e-37
XCP2
AT1G20850
protein from Arabidopsis thaliana 1.6e-37
CTSS
Cathepsin S
protein from Canis lupus familiaris 1.7e-37
CTSW
Uncharacterized protein
protein from Canis lupus familiaris 2.2e-37
AT1G29090 protein from Arabidopsis thaliana 2.5e-37
CTSS
Uncharacterized protein
protein from Gallus gallus 4.7e-37
RD21A
responsive to dehydration 21A
protein from Arabidopsis thaliana 1.7e-36
LOC100662496
Uncharacterized protein
protein from Loxodonta africana 2.7e-36
LOC100153090
Uncharacterized protein
protein from Sus scrofa 5.2e-36
Ctsl
cathepsin L
protein from Mus musculus 7.2e-36
DDB_G0291191
cysteine protease
gene from Dictyostelium discoideum 9.4e-36
cpl-1 gene from Caenorhabditis elegans 1.1e-35
Ctsw
cathepsin W
gene from Rattus norvegicus 1.3e-35
RD21B
esponsive to dehydration 21B
protein from Arabidopsis thaliana 1.3e-35
AT3G19400 protein from Arabidopsis thaliana 1.5e-35
CTSL1
Cathepsin L1
protein from Gallus gallus 2.0e-35
CTSL2
Cathepsin L2
protein from Homo sapiens 2.4e-35
Ctss
Cathepsin S
protein from Rattus norvegicus 2.6e-35
Ctsw
cathepsin W
protein from Mus musculus 2.7e-35
Ctsll3
cathepsin L-like 3
gene from Rattus norvegicus 4.8e-35
RGD1308751
similar to Cathepsin L precursor (Major excreted protein) (MEP)
gene from Rattus norvegicus 1.3e-34
CEP3
cysteine endopeptidase 3
protein from Arabidopsis thaliana 1.6e-34
Ssc.54235
Cathepsin L1
protein from Sus scrofa 1.9e-34
AT2G21430 protein from Arabidopsis thaliana 1.9e-34
CTSL1
Cathepsin L1
protein from Bos taurus 2.4e-34
LOC420160
Uncharacterized protein
protein from Gallus gallus 3.4e-34
CTSL1
Cathepsin L1
protein from Sus scrofa 4.1e-34
R09F10.1 gene from Caenorhabditis elegans 5.4e-34
CTSW
Uncharacterized protein
protein from Bos taurus 1.0e-33
ctsk
cathepsin K
gene_product from Danio rerio 1.0e-33
4930486L24Rik
RIKEN cDNA 4930486L24 gene
protein from Mus musculus 1.0e-33
AT3G43960 protein from Arabidopsis thaliana 1.4e-33
ctskl
cathepsin K, like
gene_product from Danio rerio 1.4e-33
CTSL2
Cathepsin L2
protein from Bos taurus 2.3e-33
CTSF
Cathepsin F
protein from Homo sapiens 2.9e-33
CTSL1
Cathepsin L1
protein from Canis lupus familiaris 3.1e-33
ctsla
cathepsin La
gene_product from Danio rerio 4.8e-33
CTSH
Uncharacterized protein
protein from Canis lupus familiaris 5.5e-33
CTSL
Cathepsin L1
protein from Ovis aries 7.7e-33
ctssb.2
cathepsin Sb, tandem duplicate 2
gene_product from Danio rerio 8.3e-33
XBCP3
xylem bark cysteine peptidase 3
protein from Arabidopsis thaliana 9.7e-33
AT1G06260 protein from Arabidopsis thaliana 9.7e-33
CTSK
Cathepsin K
protein from Canis lupus familiaris 1.2e-32
CTSK
Cathepsin K
protein from Canis lupus familiaris 1.2e-32
XCP1
xylem cysteine peptidase 1
protein from Arabidopsis thaliana 1.3e-32
CTSH
Uncharacterized protein
protein from Equus caballus 1.4e-32
ALP
aleurain-like protease
protein from Arabidopsis thaliana 2.2e-32
ctssb.1
cathepsin Sb, tandem duplicate 1
gene_product from Danio rerio 2.8e-32
CTSH
Uncharacterized protein
protein from Oryctolagus cuniculus 3.0e-32
Testin
testin gene
gene from Rattus norvegicus 3.1e-32
ctsl.1
cathepsin L.1
gene_product from Danio rerio 4.5e-32
CTSH
Uncharacterized protein
protein from Ailuropoda melanoleuca 5.1e-32
CTSL1
Cathepsin L1
protein from Canis lupus familiaris 6.1e-32
CTSH
Pro-cathepsin H
protein from Bos taurus 6.2e-32
CTSH
Uncharacterized protein
protein from Callithrix jacchus 6.7e-32
CTSH
Uncharacterized protein
protein from Callithrix jacchus 6.7e-32
Ctsq
cathepsin Q
gene from Rattus norvegicus 8.0e-32
CTSL1
Cathepsin L1
protein from Bos taurus 8.1e-32
AT3G45310 protein from Arabidopsis thaliana 8.4e-32
ctsll
cathepsin L, like
gene_product from Danio rerio 1.5e-31
Ctss
cathepsin S
gene from Rattus norvegicus 3.6e-31
CP2
cysteine protease 2
protein from Arabidopsis thaliana 3.6e-31
CTSF
Uncharacterized protein
protein from Sus scrofa 3.6e-31
Ctsf
cathepsin F
protein from Mus musculus 4.9e-31
CTSW
Cathepsin W
protein from Homo sapiens 5.4e-31
P83654
Ervatamin-C
protein from Tabernaemontana divaricata 5.9e-31
zgc:174153 gene_product from Danio rerio 6.2e-31
PF11_0165
falcipain 2 precursor
gene from Plasmodium falciparum 9.7e-31
PF11_0165
Falcipain-2A
protein from Plasmodium falciparum 3D7 9.7e-31
ctslb
cathepsin Lb
gene_product from Danio rerio 1.0e-30
ctssa
cathepsin Sa
gene_product from Danio rerio 1.2e-30
PF11_0161
falcipain-2 precursor, putative
gene from Plasmodium falciparum 1.2e-30
PF11_0161
Falcipain-2B
protein from Plasmodium falciparum 3D7 1.2e-30
Ctsk
cathepsin K
gene from Rattus norvegicus 1.3e-30
Ctsk
cathepsin K
protein from Mus musculus 1.3e-30
CTSW
Cathepsin W
protein from Homo sapiens 1.4e-30
CTSF
Uncharacterized protein
protein from Canis lupus familiaris 1.5e-30
AT4G23520 protein from Arabidopsis thaliana 1.6e-30
ctso
cathepsin O
gene_product from Danio rerio 2.0e-30

The BLAST search returned 4 gene products which did not match your query constraints. Please see the full BLAST report below for the details.

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  psy11694
        (655 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

WB|WBGene00007055 - symbol:tag-196 species:6239 "Caenorha...   451  1.2e-45   2
UNIPROTKB|F1RU23 - symbol:CTSW "Uncharacterized protein" ...   358  4.2e-43   2
UNIPROTKB|Q4QRC2 - symbol:Ctsql2 "Protein Ctsql2" species...   343  8.9e-43   2
UNIPROTKB|E9PSK9 - symbol:Ctsql2 "Protein Ctsql2" species...   380  1.5e-42   2
TAIR|locus:2152445 - symbol:SAG12 "senescence-associated ...   431  6.4e-42   2
DICTYBASE|DDB_G0290957 - symbol:cprA "cysteine proteinase...   325  1.8e-41   2
FB|FBgn0013770 - symbol:Cp1 "Cysteine proteinase-1" speci...   384  1.7e-40   2
TAIR|locus:2090614 - symbol:AT3G19390 species:3702 "Arabi...   391  4.4e-40   2
MGI|MGI:107341 - symbol:Ctss "cathepsin S" species:10090 ...   397  5.6e-40   2
TAIR|locus:2038588 - symbol:AT2G27420 species:3702 "Arabi...   388  3.2e-39   2
TAIR|locus:2055440 - symbol:AT2G34080 species:3702 "Arabi...   399  3.3e-39   2
TAIR|locus:2157712 - symbol:CEP1 "cysteine endopeptidase ...   401  7.3e-39   2
UNIPROTKB|F1NYJ1 - symbol:CTSL2 "Uncharacterized protein"...   353  1.3e-38   2
TAIR|locus:2082687 - symbol:AT3G54940 species:3702 "Arabi...   281  2.2e-38   3
RGD|2448 - symbol:Ctsl1 "cathepsin L1" species:10116 "Rat...   308  2.3e-38   3
FB|FBgn0260462 - symbol:CG12163 species:7227 "Drosophila ...   344  2.5e-38   4
UNIPROTKB|P25774 - symbol:CTSS "Cathepsin S" species:9606...   383  2.6e-38   2
TAIR|locus:2082881 - symbol:AT3G49340 species:3702 "Arabi...   386  5.0e-38   2
UNIPROTKB|Q8HY81 - symbol:CTSS "Cathepsin S" species:9615...   379  1.3e-37   2
UNIPROTKB|P25326 - symbol:CTSS "Cathepsin S" species:9913...   377  1.5e-37   2
TAIR|locus:2030427 - symbol:XCP2 "xylem cysteine peptidas...   406  1.6e-37   1
UNIPROTKB|F1PAK0 - symbol:CTSS "Cathepsin S" species:9615...   378  1.7e-37   2
UNIPROTKB|E2RPX3 - symbol:CTSW "Uncharacterized protein" ...   334  2.2e-37   2
TAIR|locus:2029924 - symbol:AT1G29090 species:3702 "Arabi...   390  2.5e-37   2
UNIPROTKB|H9KYW5 - symbol:CTSS "Uncharacterized protein" ...   369  4.7e-37   2
TAIR|locus:2825832 - symbol:RD21A "responsive to dehydrat...   397  1.7e-36   1
UNIPROTKB|G3SSC1 - symbol:CTSH "Uncharacterized protein" ...   295  2.7e-36   3
UNIPROTKB|F1NEC8 - symbol:CTSL2 "Uncharacterized protein"...   361  4.5e-36   2
UNIPROTKB|F1SS93 - symbol:CTSS "Uncharacterized protein" ...   364  5.2e-36   2
MGI|MGI:88564 - symbol:Ctsl "cathepsin L" species:10090 "...   304  7.2e-36   2
DICTYBASE|DDB_G0291191 - symbol:DDB_G0291191 "cysteine pr...   293  9.4e-36   2
WB|WBGene00000776 - symbol:cpl-1 species:6239 "Caenorhabd...   354  1.1e-35   2
RGD|1309354 - symbol:Ctsw "cathepsin W" species:10116 "Ra...   320  1.3e-35   2
TAIR|locus:2167821 - symbol:RD21B "esponsive to dehydrati...   389  1.3e-35   1
TAIR|locus:2090629 - symbol:AT3G19400 species:3702 "Arabi...   358  1.5e-35   2
UNIPROTKB|P09648 - symbol:CTSL1 "Cathepsin L1" species:90...   358  2.0e-35   2
UNIPROTKB|O60911 - symbol:CTSL2 "Cathepsin L2" species:96...   278  2.4e-35   3
UNIPROTKB|D3ZZR3 - symbol:D3ZZR3 "Uncharacterized protein...   348  2.6e-35   2
MGI|MGI:1338045 - symbol:Ctsw "cathepsin W" species:10090...   320  2.7e-35   2
RGD|1560071 - symbol:Ctsll3 "cathepsin L-like 3" species:...   328  4.8e-35   2
RGD|1308751 - symbol:RGD1308751 "similar to Cathepsin L p...   325  1.3e-34   2
TAIR|locus:505006391 - symbol:CEP3 "cysteine endopeptidas...   341  1.6e-34   2
UNIPROTKB|F1S4J6 - symbol:Ssc.54235 "Cathepsin L1" specie...   299  1.9e-34   2
TAIR|locus:2050145 - symbol:AT2G21430 species:3702 "Arabi...   266  1.9e-34   3
UNIPROTKB|P25975 - symbol:CTSL1 "Cathepsin L1" species:99...   294  2.4e-34   2
UNIPROTKB|F1NZ37 - symbol:LOC420160 "Uncharacterized prot...   279  3.4e-34   3
UNIPROTKB|Q28944 - symbol:CTSL1 "Cathepsin L1" species:98...   289  4.1e-34   2
WB|WBGene00019986 - symbol:R09F10.1 species:6239 "Caenorh...   344  5.4e-34   2
UNIPROTKB|F1MHV4 - symbol:CTSW "Uncharacterized protein" ...   269  1.0e-33   3
ZFIN|ZDB-GENE-001205-4 - symbol:ctsk "cathepsin K" specie...   340  1.0e-33   2
MGI|MGI:1922258 - symbol:4930486L24Rik "RIKEN cDNA 493048...   265  1.0e-33   3
TAIR|locus:2097104 - symbol:AT3G43960 species:3702 "Arabi...   371  1.4e-33   1
ZFIN|ZDB-GENE-050208-336 - symbol:ctskl "cathepsin K, lik...   316  1.4e-33   2
UNIPROTKB|E9PTT3 - symbol:Ctsr "Protein Ctsr" species:101...   253  2.1e-33   3
UNIPROTKB|Q5E998 - symbol:CTSL2 "Cathepsin L2" species:99...   285  2.3e-33   2
UNIPROTKB|Q9UBX1 - symbol:CTSF "Cathepsin F" species:9606...   312  2.9e-33   3
UNIPROTKB|Q9GL24 - symbol:CTSL1 "Cathepsin L1" species:96...   289  3.1e-33   2
ZFIN|ZDB-GENE-030131-106 - symbol:ctsl1a "cathepsin L, 1 ...   285  4.8e-33   2
UNIPROTKB|F6X9C1 - symbol:CTSH "Uncharacterized protein" ...   282  5.5e-33   3
UNIPROTKB|Q10991 - symbol:CTSL "Cathepsin L1" species:994...   330  7.7e-33   2
ZFIN|ZDB-GENE-050626-55 - symbol:ctssb.2 "cathepsin S, b....   350  8.3e-33   2
TAIR|locus:2024362 - symbol:XBCP3 "xylem bark cysteine pe...   331  9.7e-33   2
TAIR|locus:2038515 - symbol:AT1G06260 species:3702 "Arabi...   347  9.7e-33   3
UNIPROTKB|G1K2A7 - symbol:CTSK "Cathepsin K" species:9615...   326  1.2e-32   2
UNIPROTKB|Q3ZKN1 - symbol:CTSK "Cathepsin K" species:9615...   326  1.2e-32   2
TAIR|locus:2122113 - symbol:XCP1 "xylem cysteine peptidas...   362  1.3e-32   1
UNIPROTKB|F7BJD8 - symbol:CTSH "Uncharacterized protein" ...   280  1.4e-32   3
TAIR|locus:2175088 - symbol:ALP "aleurain-like protease" ...   324  2.2e-32   2
ZFIN|ZDB-GENE-050522-559 - symbol:ctssb.1 "cathepsin S, b...   359  2.8e-32   1
UNIPROTKB|G1SQF0 - symbol:CTSH "Uncharacterized protein" ...   300  3.0e-32   2
RGD|708447 - symbol:Testin "testin gene" species:10116 "R...   277  3.1e-32   2
ZFIN|ZDB-GENE-040718-61 - symbol:ctsl.1 "cathepsin L.1" s...   333  4.5e-32   2
UNIPROTKB|G1M0X4 - symbol:CTSH "Uncharacterized protein" ...   291  5.1e-32   3
UNIPROTKB|F1PMM9 - symbol:CTSL1 "Cathepsin L1" species:96...   269  6.1e-32   2
UNIPROTKB|Q3T0I2 - symbol:CTSH "Pro-cathepsin H" species:...   284  6.2e-32   3
UNIPROTKB|F7B939 - symbol:CTSH "Uncharacterized protein" ...   287  6.7e-32   3
UNIPROTKB|F7BRD4 - symbol:CTSH "Uncharacterized protein" ...   287  6.7e-32   3
RGD|631421 - symbol:Ctsq "cathepsin Q" species:10116 "Rat...   290  8.0e-32   2
UNIPROTKB|A4IFS7 - symbol:CTSL1 "CTSL1 protein" species:9...   272  8.1e-32   2
TAIR|locus:2078312 - symbol:AT3G45310 species:3702 "Arabi...   315  8.4e-32   2
ZFIN|ZDB-GENE-041010-76 - symbol:ctsll "cathepsin L, like...   272  1.5e-31   2
RGD|621513 - symbol:Ctss "cathepsin S" species:10116 "Rat...   349  3.6e-31   1
TAIR|locus:2128253 - symbol:AT4G11320 species:3702 "Arabi...   349  3.6e-31   1
UNIPROTKB|F1RU48 - symbol:CTSF "Uncharacterized protein" ...   299  3.6e-31   3
MGI|MGI:1861434 - symbol:Ctsf "cathepsin F" species:10090...   297  4.9e-31   3
UNIPROTKB|E9PI30 - symbol:CTSW "Cathepsin W" species:9606...   279  5.4e-31   2
UNIPROTKB|P83654 - symbol:P83654 "Ervatamin-C" species:52...   347  5.9e-31   1
ZFIN|ZDB-GENE-080215-7 - symbol:zgc:174153 "zgc:174153" s...   266  6.2e-31   2
GENEDB_PFALCIPARUM|PF11_0165 - symbol:PF11_0165 "falcipai...   294  9.7e-31   2
UNIPROTKB|Q8I6U4 - symbol:PF11_0165 "Falcipain-2A" specie...   294  9.7e-31   2
ZFIN|ZDB-GENE-980526-285 - symbol:ctsl1b "cathepsin L, 1 ...   264  1.0e-30   2
ZFIN|ZDB-GENE-040426-1583 - symbol:ctssa "cathepsin S, a"...   302  1.2e-30   2
GENEDB_PFALCIPARUM|PF11_0161 - symbol:PF11_0161 "falcipai...   292  1.2e-30   2
UNIPROTKB|Q8I6U5 - symbol:PF11_0161 "Falcipain-2B" specie...   292  1.2e-30   2
RGD|61810 - symbol:Ctsk "cathepsin K" species:10116 "Ratt...   344  1.3e-30   1
MGI|MGI:107823 - symbol:Ctsk "cathepsin K" species:10090 ...   344  1.3e-30   1
UNIPROTKB|P56202 - symbol:CTSW "Cathepsin W" species:9606...   279  1.4e-30   2
UNIPROTKB|E2RR02 - symbol:CTSF "Uncharacterized protein" ...   299  1.5e-30   3
TAIR|locus:2117979 - symbol:AT4G23520 species:3702 "Arabi...   343  1.6e-30   1
ZFIN|ZDB-GENE-080724-8 - symbol:ctso "cathepsin O" specie...   310  2.0e-30   2

WARNING:  Descriptions of 185 database sequences were not reported due to the
          limiting value of parameter V = 100.


>WB|WBGene00007055 [details] [associations]
            symbol:tag-196 species:6239 "Caenorhabditis elegans"
            [GO:0004869 "cysteine-type endopeptidase inhibitor activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000010
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00031 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00043 SMART:SM00645 InterPro:IPR000169
            GO:GO:0004869 eggNOG:COG4870 HOGENOM:HOG000230774 KO:K01373
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00660000095458 EMBL:FO080488 PIR:T31871
            RefSeq:NP_505215.2 HSSP:Q9UBX1 ProteinModelPortal:O16454 SMR:O16454
            DIP:DIP-27400N IntAct:O16454 MINT:MINT-1044990 MEROPS:C01.A50
            PaxDb:O16454 EnsemblMetazoa:F41E6.6.1 EnsemblMetazoa:F41E6.6.2
            EnsemblMetazoa:F41E6.6.3 GeneID:179240 KEGG:cel:CELE_F41E6.6
            UCSC:F41E6.6.1 CTD:179240 WormBase:F41E6.6 InParanoid:O16454
            OMA:GGGLMTN NextBio:904514 Uniprot:O16454
        Length = 477

 Score = 451 (163.8 bits), Expect = 1.2e-42, P = 1.2e-42
 Identities = 108/292 (36%), Positives = 151/292 (51%)

Query:   197 VQHHDKVYSSVEDLLRRHENFVTNVEKAEDYQSEDSGTAVFGVNKFFDLSESDLQQLTGL 256
             V  H+K Y++  ++L+R   F  N +   + Q  + GTAV+G  KF D++  + +++   
Sbjct:   178 VDRHEKKYTNKREVLKRFRVFKKNAKVIRELQKNEQGTAVYGFTKFSDMTTMEFKKIM-- 235

Query:   257 NLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVKEQGK 316
              L    E  QP     +   Q + E      N     +DLPE+FDWR +G +++VK QG 
Sbjct:   236 -LPYQWE--QPV----YPMEQANFEKHDVTINE----EDLPESFDWREKGAVTQVKNQGN 284

Query:   317 CACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSNGGCNGGRMDDALQYIIDNGGV 376
             C  CWAFS  G VE    I  N L  LS Q+LVDCD  + GCNGG   +A + II  GG+
Sbjct:   285 CGSCWAFSTTGNVEGAWFIAKNKLVSLSEQELVDCDSMDQGCNGGLPSNAYKEIIRMGGL 344

Query:   377 VSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNAN 436
               + AYPY     E   L                +P+ +E EM+KW+ T+GP+S+G+NAN
Sbjct:   345 EPEDAYPYDG-RGETCHLVRKDIAVYINGSV--ELPH-DEVEMQKWLVTKGPISIGLNAN 400

Query:   437 GLFYYSGGVID----------LNQRL----YGTS--IPYWIVKNSWGSDWGE 472
              L +Y  GV+           LN  +    YG     PYWIVKNSWG +WGE
Sbjct:   401 TLQFYRHGVVHPFKIFCEPFMLNHGVLIVGYGKDGRKPYWIVKNSWGPNWGE 452

 Score = 315 (115.9 bits), Expect = 1.2e-45, Sum P(2) = 1.2e-45
 Identities = 67/159 (42%), Positives = 92/159 (57%)

Query:   493 VLPSKLSRLATEKLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCL 552
             +  +KL  L+ ++LVDCD  + GCNGG   +A + II  GG+  + AYPY     E   L
Sbjct:   303 IAKNKLVSLSEQELVDCDSMDQGCNGGLPSNAYKEIIRMGGLEPEDAYPYDG-RGETCHL 361

Query:   553 XXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFYYSGGVIDLNQRLCN 612
                             +P+ +E EM+KW+ T+GP+S+G+NAN L +Y  GV+   +  C 
Sbjct:   362 VRKDIAVYINGSV--ELPH-DEVEMQKWLVTKGPISIGLNANTLQFYRHGVVHPFKIFCE 418

Query:   613 PKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGE 651
             P   NH ++IVGYG    KDG   PYWIVKNSWG +WGE
Sbjct:   419 PFMLNHGVLIVGYG----KDGRK-PYWIVKNSWGPNWGE 452

 Score = 228 (85.3 bits), Expect = 1.2e-45, Sum P(2) = 1.2e-45
 Identities = 57/173 (32%), Positives = 88/173 (50%)

Query:    26 LLESNIFQTRGYLNSPVTRFLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQREDSG 85
             L ++ I + R Y+      FL+F+  H+K Y++  ++L+R   F  N +   + Q+ + G
Sbjct:   157 LRKAKIIRPRDYVIW--NSFLDFVDRHEKKYTNKREVLKRFRVFKKNAKVIRELQKNEQG 214

Query:    86 TAVFEVNKFFDLSDSDLQQLTGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHG 145
             TAV+   KF D++  + +++    L    E  QP     +   Q + E      N     
Sbjct:   215 TAVYGFTKFSDMTTMEFKKIM---LPYQWE--QPV----YPMEQANFEKHDVTINE---- 261

Query:   146 DDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
             +DLPE+FDWR +G +++VK QG C  CWAFS  G VE    I  N L  LS Q
Sbjct:   262 EDLPESFDWREKGAVTQVKNQGNCGSCWAFSTTGNVEGAWFIAKNKLVSLSEQ 314


>UNIPROTKB|F1RU23 [details] [associations]
            symbol:CTSW "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00639
            GeneTree:ENSGT00660000095458 KO:K08569 EMBL:CU928325
            RefSeq:XP_003122571.1 UniGene:Ssc.28940 Ensembl:ENSSSCT00000014177
            GeneID:100525853 KEGG:ssc:100525853 OMA:CWAMAAV Uniprot:F1RU23
        Length = 367

 Score = 358 (131.1 bits), Expect = 1.3e-40, Sum P(2) = 1.3e-40
 Identities = 87/248 (35%), Positives = 132/248 (53%)

Query:   200 HDKVYSSVEDLLRRHENFVTNVEKAEDYQSEDSGTAVFGVNKFFDLSESDLQQLTGLNLD 259
             +++ YS+  +  RR + F  N+ KA+  Q ED GTA FGV  F DL+E +  QL G +  
Sbjct:    49 YNRSYSNPAEHARRLDIFAQNLAKAQRLQEEDLGTAEFGVTPFSDLTEEEFGQLHGHHWG 108

Query:   260 STLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAE-GVISKVKEQGKCA 318
             +      PS+     S ++              G+ +P++ DWR + GVIS +K Q  C 
Sbjct:   109 AGKA---PSMGIKVGSEES--------------GETVPQSCDWRKKPGVISAIKHQKDCN 151

Query:   319 CCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSNGGCNGGRMDDALQYIIDNGGVVS 378
             CCWA +AV  VEA  AI+ +   +LSVQQ++DCD    GCNGG + DA   +++  G+ S
Sbjct:   152 CCWAMAAVDNVEAQWAIKYHQAVQLSVQQVLDCDRCGNGCNGGFVWDAFLTVLNTSGLAS 211

Query:   379 DQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGL 438
             +Q YPYK +     CL             +  + + E+  + +++AT GP++V +NA  L
Sbjct:   212 EQDYPYKGTVKTHRCLAKQHRKVAWIQD-FLMLQFCEQS-IARYLATEGPITVTINAGLL 269

Query:   439 FYYSGGVI 446
               Y  GVI
Sbjct:   270 QQYKRGVI 277

 Score = 289 (106.8 bits), Expect = 4.2e-43, Sum P(2) = 4.2e-43
 Identities = 56/159 (35%), Positives = 91/159 (57%)

Query:   500 RLATEKLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXX 559
             +L+ ++++DCD    GCNGG + DA   +++  G+ S+Q YPYK +     CL       
Sbjct:   175 QLSVQQVLDCDRCGNGCNGGFVWDAFLTVLNTSGLASEQDYPYKGTVKTHRCLAKQHRKV 234

Query:   560 XXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFYYSGGVIDLNQRLCNPKAQNHA 619
                   +  + + E+  + +++AT GP++V +NA  L  Y  GVI      C+P   NH+
Sbjct:   235 AWIQD-FLMLQFCEQS-IARYLATEGPITVTINAGLLQQYKRGVIRATPATCDPHLVNHS 292

Query:   620 LIIVGYGEEEKKDGT------SIPYWIVKNSWGSDWGEK 652
             +++VG+G+ +  +G       SIPYWI+KNSWG DWGE+
Sbjct:   293 VLLVGFGKSKSVEGRRPRPGHSIPYWILKNSWGPDWGEE 331

 Score = 199 (75.1 bits), Expect = 4.2e-43, Sum P(2) = 4.2e-43
 Identities = 54/155 (34%), Positives = 80/155 (51%)

Query:    45 FLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQREDSGTAVFEVNKFFDLSDSDLQQ 104
             F  F   +++ YS+  +  RR + F  N+ KA+  Q ED GTA F V  F DL++ +  Q
Sbjct:    42 FTLFQIQYNRSYSNPAEHARRLDIFAQNLAKAQRLQEEDLGTAEFGVTPFSDLTEEEFGQ 101

Query:   105 LTGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAE-GVISKV 163
             L G +  +      PS+     S ++              G+ +P++ DWR + GVIS +
Sbjct:   102 LHGHHWGAGKA---PSMGIKVGSEES--------------GETVPQSCDWRKKPGVISAI 144

Query:   164 KEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
             K Q  C CCWA +AV  VEA  AI+ +   +LSVQ
Sbjct:   145 KHQKDCNCCWAMAAVDNVEAQWAIKYHQAVQLSVQ 179

 Score = 102 (41.0 bits), Expect = 1.3e-40, Sum P(2) = 1.3e-40
 Identities = 16/20 (80%), Positives = 18/20 (90%)

Query:   454 GTSIPYWIVKNSWGSDWGEK 473
             G SIPYWI+KNSWG DWGE+
Sbjct:   312 GHSIPYWILKNSWGPDWGEE 331


>UNIPROTKB|Q4QRC2 [details] [associations]
            symbol:Ctsql2 "Protein Ctsql2" species:10116 "Rattus
            norvegicus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00639 HOVERGEN:HBG011513 EMBL:CH474032
            RGD:1303225 EMBL:BC097257 IPI:IPI00421946 RefSeq:NP_001002813.2
            UniGene:Rn.128678 SMR:Q4QRC2 MEROPS:C01.111
            Ensembl:ENSRNOT00000038758 GeneID:408201 KEGG:rno:408201 CTD:408201
            InParanoid:Q4QRC2 OMA:NDEGALM NextBio:696394 Genevestigator:Q4QRC2
            Uniprot:Q4QRC2
        Length = 343

 Score = 343 (125.8 bits), Expect = 1.6e-30, P = 1.6e-30
 Identities = 86/257 (33%), Positives = 136/257 (52%)

Query:   200 HDKVYSSVEDLLRRHENFVTNVEKAEDYQSEDS---GTAVFGVNKFFDLSESDLQQL-TG 255
             ++K+YS  E+LL+R   +  NV+K E +  E+S    T +  +N F DL++ + + + TG
Sbjct:    36 YEKLYSPEEELLKRVV-WEENVKKIELHNRENSLGKNTYIMEINNFADLTDEEFKDMITG 94

Query:   256 LNL--DSTLEDI-QPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVK 312
             + L  ++T++ + + +L +PF              NS    D LP++ DWR EG +++V+
Sbjct:    95 ITLPINNTMKSLWKRALGSPFP-------------NSWYWRDALPKSIDWRKEGYVTRVR 141

Query:   313 EQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSNG--GCNGGRMDDALQYI 370
             EQGKC  CWAF   G +E     +   LT LSVQ LVDC    G  GC GG   +A QY+
Sbjct:   142 EQGKCKSCWAFPVAGAIEGQMFKKTGKLTPLSVQNLVDCSKPQGNKGCRGGTTYNAFQYV 201

Query:   371 IDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLS 430
             + NGG+ S+  YPYK  E   G               +  +P  E+  M   +AT+GP++
Sbjct:   202 LQNGGLESEATYPYKGKE---GLCKYNPKNAYAKITRFVALPEDEDVLMDA-LATKGPVA 257

Query:   431 VGMNA--NGLFYYSGGV 445
              G++   + L +Y  G+
Sbjct:   258 AGIHVVYSSLRFYKKGI 274

 Score = 263 (97.6 bits), Expect = 8.9e-43, Sum P(2) = 8.9e-43
 Identities = 58/161 (36%), Positives = 88/161 (54%)

Query:   497 KLSRLATEKLVDCDMSNG--GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXX 554
             KL+ L+ + LVDC    G  GC GG   +A QY++ NGG+ S+  YPYK  E   G    
Sbjct:   168 KLTPLSVQNLVDCSKPQGNKGCRGGTTYNAFQYVLQNGGLESEATYPYKGKE---GLCKY 224

Query:   555 XXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNA--NGLFYYSGGVIDLNQRLCN 612
                        +  +P  E+  M   +AT+GP++ G++   + L +Y  G+   ++  CN
Sbjct:   225 NPKNAYAKITRFVALPEDEDVLMDA-LATKGPVAAGIHVVYSSLRFYKKGIY--HEPKCN 281

Query:   613 PKAQNHALIIVGYG-EEEKKDGTSIPYWIVKNSWGSDWGEK 652
              +  NHA+++VGYG E  + DG +  YW++KNSWG  WG K
Sbjct:   282 NRV-NHAVLVVGYGFEGNETDGNN--YWLIKNSWGKQWGLK 319

 Score = 223 (83.6 bits), Expect = 8.9e-43, Sum P(2) = 8.9e-43
 Identities = 56/155 (36%), Positives = 89/155 (57%)

Query:    52 HDKVYSSVEDLLRRHENFVTNVEKAEDYQREDS---GTAVFEVNKFFDLSDSDLQQL-TG 107
             ++K+YS  E+LL+R   +  NV+K E + RE+S    T + E+N F DL+D + + + TG
Sbjct:    36 YEKLYSPEEELLKRVV-WEENVKKIELHNRENSLGKNTYIMEINNFADLTDEEFKDMITG 94

Query:   108 LNL--DSTLEDI-QPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVK 164
             + L  ++T++ + + +L +PF              NS    D LP++ DWR EG +++V+
Sbjct:    95 ITLPINNTMKSLWKRALGSPFP-------------NSWYWRDALPKSIDWRKEGYVTRVR 141

Query:   165 EQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQH 199
             EQGKC  CWAF   G +E     +   LT LSVQ+
Sbjct:   142 EQGKCKSCWAFPVAGAIEGQMFKKTGKLTPLSVQN 176

 Score = 104 (41.7 bits), Expect = 8.6e-23, Sum P(2) = 8.6e-23
 Identities = 36/144 (25%), Positives = 54/144 (37%)

Query:   355 NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASES--ERGCLXXXXXXXXXXXXXYSRIP 412
             N GC GG   +A QY++ NGG+ S+  YPYK  E   +                      
Sbjct:   186 NKGCRGGTTYNAFQYVLQNGGLESEATYPYKGKEGLCKYNPKNAYAKITRFVALPEDEDV 245

Query:   413 YGEEEEMKKWVATRGPL---SVGMNANGLFY-------YSGGVIDLNQRLYGTSIP---Y 459
               +    K  VA    +   S+     G+++        +  V+ +     G       Y
Sbjct:   246 LMDALATKGPVAAGIHVVYSSLRFYKKGIYHEPKCNNRVNHAVLVVGYGFEGNETDGNNY 305

Query:   460 WIVKNSWGSDWGEKVEDKVGSSGN 483
             W++KNSWG  WG K   K+    N
Sbjct:   306 WLIKNSWGKQWGLKGYMKIAKDRN 329


>UNIPROTKB|E9PSK9 [details] [associations]
            symbol:Ctsql2 "Protein Ctsql2" species:10116 "Rattus
            norvegicus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            IPI:IPI00562656 Ensembl:ENSRNOT00000045847 RGD:1303225
            ArrayExpress:E9PSK9 Uniprot:E9PSK9
        Length = 342

 Score = 380 (138.8 bits), Expect = 1.3e-34, P = 1.3e-34
 Identities = 101/311 (32%), Positives = 155/311 (49%)

Query:   200 HDKVYSSVEDLLRRHENFVTNVEKAEDYQSEDS---GTAVFGVNKFFDLSESDLQQL-TG 255
             ++K+YS  E+LL+R   +  NV+K E +  E+S    T +  +N F DL++ + + + TG
Sbjct:    36 YEKLYSPEEELLKRVV-WEENVKKIELHNRENSLGKNTYIMEINNFADLTDEEFKDMITG 94

Query:   256 LNL--DSTLEDI-QPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVK 312
             + L  ++T++ + + +L +PF              NS    D LP++ DWR EG +++V+
Sbjct:    95 ITLPINNTMKSLWKRALGSPFP-------------NSWYWRDALPKSIDWRKEGYVTRVR 141

Query:   313 EQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSNG--GCNGGRMDDALQYI 370
             EQGKC  CWAF   G +E     +   LT LSVQ LVDC    G  GC GG   +A QY+
Sbjct:   142 EQGKCKSCWAFPVAGAIEGQMFKKTGKLTPLSVQNLVDCSKPQGNKGCRGGTTYNAFQYV 201

Query:   371 IDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLS 430
             + NGG+ S+  YPYK  E   G               +  +P  E+  M   +AT+GP++
Sbjct:   202 LQNGGLESEATYPYKGKE---GLCKYNPKNAYAKITRFVALPEDEDVLMDA-LATKGPVA 257

Query:   431 VGMNA-NGLFYYSGGVID---LNQRL--------YG------TSIPYWIVKNSWGSDWGE 472
              G++     F++  G+      N R+        YG          YW++KNSWG  WG 
Sbjct:   258 AGIHVVYSYFHFVSGIYHEPKCNNRVNHAVLVVGYGFEGNETDGNNYWLIKNSWGKQWGL 317

Query:   473 KVEDKVGSSGN 483
             K   K+    N
Sbjct:   318 KGYMKIAKDRN 328

 Score = 261 (96.9 bits), Expect = 1.5e-42, Sum P(2) = 1.5e-42
 Identities = 57/160 (35%), Positives = 87/160 (54%)

Query:   497 KLSRLATEKLVDCDMSNG--GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXX 554
             KL+ L+ + LVDC    G  GC GG   +A QY++ NGG+ S+  YPYK  E   G    
Sbjct:   168 KLTPLSVQNLVDCSKPQGNKGCRGGTTYNAFQYVLQNGGLESEATYPYKGKE---GLCKY 224

Query:   555 XXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNA-NGLFYYSGGVIDLNQRLCNP 613
                        +  +P  E+  M   +AT+GP++ G++     F++  G+   ++  CN 
Sbjct:   225 NPKNAYAKITRFVALPEDEDVLMDA-LATKGPVAAGIHVVYSYFHFVSGIY--HEPKCNN 281

Query:   614 KAQNHALIIVGYG-EEEKKDGTSIPYWIVKNSWGSDWGEK 652
             +  NHA+++VGYG E  + DG +  YW++KNSWG  WG K
Sbjct:   282 RV-NHAVLVVGYGFEGNETDGNN--YWLIKNSWGKQWGLK 318

 Score = 223 (83.6 bits), Expect = 1.5e-42, Sum P(2) = 1.5e-42
 Identities = 56/155 (36%), Positives = 89/155 (57%)

Query:    52 HDKVYSSVEDLLRRHENFVTNVEKAEDYQREDS---GTAVFEVNKFFDLSDSDLQQL-TG 107
             ++K+YS  E+LL+R   +  NV+K E + RE+S    T + E+N F DL+D + + + TG
Sbjct:    36 YEKLYSPEEELLKRVV-WEENVKKIELHNRENSLGKNTYIMEINNFADLTDEEFKDMITG 94

Query:   108 LNL--DSTLEDI-QPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVK 164
             + L  ++T++ + + +L +PF              NS    D LP++ DWR EG +++V+
Sbjct:    95 ITLPINNTMKSLWKRALGSPFP-------------NSWYWRDALPKSIDWRKEGYVTRVR 141

Query:   165 EQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQH 199
             EQGKC  CWAF   G +E     +   LT LSVQ+
Sbjct:   142 EQGKCKSCWAFPVAGAIEGQMFKKTGKLTPLSVQN 176


>TAIR|locus:2152445 [details] [associations]
            symbol:SAG12 "senescence-associated gene 12" species:3702
            "Arabidopsis thaliana" [GO:0005576 "extracellular region"
            evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA;ISS]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            [GO:0009723 "response to ethylene stimulus" evidence=IEP]
            [GO:0007568 "aging" evidence=IEP;TAS] [GO:0010150 "leaf senescence"
            evidence=IEP;TAS] [GO:0010282 "senescence-associated vacuole"
            evidence=IDA] [GO:0009817 "defense response to fungus, incompatible
            interaction" evidence=IEP] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            EMBL:CP002688 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0010150 GO:GO:0009817 EMBL:AB016870
            HSSP:O65039 OMA:NDEQALM EMBL:AF370131 EMBL:AY040073 IPI:IPI00544181
            RefSeq:NP_568651.1 UniGene:At.75256 UniGene:At.7710
            ProteinModelPortal:Q9FJ47 SMR:Q9FJ47 IntAct:Q9FJ47 STRING:Q9FJ47
            MEROPS:C01.117 PRIDE:Q9FJ47 ProMEX:Q9FJ47 EnsemblPlants:AT5G45890.1
            GeneID:834629 KEGG:ath:AT5G45890 TAIR:At5g45890 InParanoid:Q9FJ47
            PhylomeDB:Q9FJ47 ProtClustDB:CLSN2917735 ArrayExpress:Q9FJ47
            Genevestigator:Q9FJ47 GO:GO:0010282 Uniprot:Q9FJ47
        Length = 346

 Score = 431 (156.8 bits), Expect = 2.1e-40, P = 2.1e-40
 Identities = 104/290 (35%), Positives = 151/290 (52%)

Query:   200 HDKVYSSVEDLLRRHENFVTNVEKAEDYQSEDSG-TAVFGVNKFFDLSESDLQQL-TGLN 257
             H +VY+ V++   R+  F  NVE+ E   S  +G T    VN+F DL+  + + + TG  
Sbjct:    45 HGRVYADVKEENNRYVVFKNNVERIEHLNSIPAGRTFKLAVNQFADLTNDEFRSMYTGFK 104

Query:   258 LDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVKEQGKC 317
               S L            S+Q+ T+M  F++ ++  G  LP + DWR +G ++ +K QG C
Sbjct:   105 GVSAL------------SSQSQTKMSPFRYQNVSSGA-LPVSVDWRKKGAVTPIKNQGSC 151

Query:   318 ACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSNGGCNGGRMDDALQYIIDNGGVV 377
              CCWAFSAV  +E    I+   L  LS QQLVDCD ++ GC GG MD A ++I   GG+ 
Sbjct:   152 GCCWAFSAVAAIEGATQIKKGKLISLSEQQLVDCDTNDFGCEGGLMDTAFEHIKATGGLT 211

Query:   378 SDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANG 437
             ++  YPYK  ++   C              Y  +P  +E+ + K VA + P+SVG+   G
Sbjct:   212 TESNYPYKGEDAT--CNSKKTNPKATSITGYEDVPVNDEQALMKAVAHQ-PVSVGIEGGG 268

Query:   438 L-F-YYSGGVID------LNQRL----YGTSI---PYWIVKNSWGSDWGE 472
               F +YS GV        L+  +    YG S     YWI+KNSWG+ WGE
Sbjct:   269 FDFQFYSSGVFTGECTTYLDHAVTAIGYGESTNGSKYWIIKNSWGTKWGE 318

 Score = 259 (96.2 bits), Expect = 6.4e-42, Sum P(2) = 6.4e-42
 Identities = 59/163 (36%), Positives = 87/163 (53%)

Query:   491 TGVLPSKLSRLATEKLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERG 550
             T +   KL  L+ ++LVDCD ++ GC GG MD A ++I   GG+ ++  YPYK  ++   
Sbjct:   167 TQIKKGKLISLSEQQLVDCDTNDFGCEGGLMDTAFEHIKATGGLTTESNYPYKGEDAT-- 224

Query:   551 CLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGL-F-YYSGGVIDLNQ 608
             C              Y  +P  +E+ + K VA + P+SVG+   G  F +YS GV     
Sbjct:   225 CNSKKTNPKATSITGYEDVPVNDEQALMKAVAHQ-PVSVGIEGGGFDFQFYSSGVFTGE- 282

Query:   609 RLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGE 651
               C     +HA+  +GYGE    +G+   YWI+KNSWG+ WGE
Sbjct:   283 --CTTYL-DHAVTAIGYGEST--NGSK--YWIIKNSWGTKWGE 318

 Score = 219 (82.2 bits), Expect = 6.4e-42, Sum P(2) = 6.4e-42
 Identities = 53/157 (33%), Positives = 82/157 (52%)

Query:    44 RFLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQREDSG-TAVFEVNKFFDLSDSDL 102
             R + +M  H +VY+ V++   R+  F  NVE+ E      +G T    VN+F DL++ + 
Sbjct:    37 RHIEWMTKHGRVYADVKEENNRYVVFKNNVERIEHLNSIPAGRTFKLAVNQFADLTNDEF 96

Query:   103 QQL-TGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVIS 161
             + + TG    S L            S+Q+ T+M  F++ ++  G  LP + DWR +G ++
Sbjct:    97 RSMYTGFKGVSAL------------SSQSQTKMSPFRYQNVSSGA-LPVSVDWRKKGAVT 143

Query:   162 KVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
              +K QG C CCWAFSAV  +E    I+   L  LS Q
Sbjct:   144 PIKNQGSCGCCWAFSAVAAIEGATQIKKGKLISLSEQ 180


>DICTYBASE|DDB_G0290957 [details] [associations]
            symbol:cprA "cysteine proteinase 1" species:44689
            "Dictyostelium discoideum" [GO:0006972 "hyperosmotic response"
            evidence=IEP] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0005764 "lysosome" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 dictyBase:DDB_G0290957
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GenomeReviews:CM000154_GR GO:GO:0005764
            GO:GO:0006972 EMBL:AAFI02000174 KO:K01376 EMBL:X02407 PIR:A22827
            RefSeq:XP_635417.1 ProteinModelPortal:P04988 MEROPS:C01.022
            GlycoSuiteDB:P04988 SWISS-2DPAGE:P04988 EnsemblProtists:DDB0201647
            GeneID:8627918 KEGG:ddi:DDB_G0290957 OMA:KISNFTM
            ProtClustDB:CLSZ2429603 Uniprot:P04988
        Length = 343

 Score = 325 (119.5 bits), Expect = 1.5e-28, P = 1.5e-28
 Identities = 95/282 (33%), Positives = 130/282 (46%)

Query:   201 DKVYSSVEDLLRRHENFVTNVEKAEDYQS---EDSGTAVFGVNKFFDLSESDLQQLTGLN 257
             +K YS  E+ L R E F +N+ K E+             FGVNKF DLS  + +     N
Sbjct:    37 NKKYSH-EEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNN 95

Query:   258 LDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVKEQGKC 317
              ++   D  P       ++  D E      NS+      P AFDWR  G ++ VK QG+C
Sbjct:    96 KEAIFTDDLPV------ADYLDDEF----INSI------PTAFDWRTRGAVTPVKNQGQC 139

Query:   318 ACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMS----NG------GCNGGRMDDAL 367
               CW+FS  G VE  H I  N L  LS Q LVDCD       G      GCNGG   +A 
Sbjct:   140 GSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEQACDEGCNGGLQPNAY 199

Query:   368 QYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRG 427
              YII NGG+ ++ +YPY A E+   C              ++ IP  E   M  ++ + G
Sbjct:   200 NYIIKNGGIQTESSYPYTA-ETGTQC-NFNSANIGAKISNFTMIPKNETV-MAGYIVSTG 256

Query:   428 PLSVGMNANGLFYYSGGVIDL----NQRLYGTSIPYWIVKNS 465
             PL++  +A    +Y GGV D+    N   +G  I  +  KN+
Sbjct:   257 PLAIAADAVEWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNT 298

 Score = 276 (102.2 bits), Expect = 1.8e-41, Sum P(2) = 1.8e-41
 Identities = 61/167 (36%), Positives = 90/167 (53%)

Query:   496 SKLSRLATEKLVDCDMS----------NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKAS 545
             +KL  L+ + LVDCD            + GCNGG   +A  YII NGG+ ++ +YPY A 
Sbjct:   160 NKLVSLSEQNLVDCDHECMEYEGEQACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTA- 218

Query:   546 ESERGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFYYSGGVID 605
             E+   C              ++ IP  E   M  ++ + GPL++  +A    +Y GGV D
Sbjct:   219 ETGTQC-NFNSANIGAKISNFTMIPKNETV-MAGYIVSTGPLAIAADAVEWQFYIGGVFD 276

Query:   606 LNQRLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
             +    CNP + +H ++IVGY  +      ++PYWIVKNSWG+DWGE+
Sbjct:   277 IP---CNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQ 320

 Score = 197 (74.4 bits), Expect = 1.8e-41, Sum P(2) = 1.8e-41
 Identities = 57/169 (33%), Positives = 77/169 (45%)

Query:    34 TRGYLNSPVTRFLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQR---EDSGTAVFE 90
             +RG      ++FL F    +K YS  E+ L R E F +N+ K E+             F 
Sbjct:    18 SRGIPLEEQSQFLEFQDKFNKKYSH-EEYLERFEIFKSNLGKIEELNLIAINHKADTKFG 76

Query:    91 VNKFFDLSDSDLQQLTGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPE 150
             VNKF DLS  + +     N ++   D  P       ++  D E      NS+      P 
Sbjct:    77 VNKFADLSSDEFKNYYLNNKEAIFTDDLPV------ADYLDDEF----INSI------PT 120

Query:   151 AFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQH 199
             AFDWR  G ++ VK QG+C  CW+FS  G VE  H I  N L  LS Q+
Sbjct:   121 AFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQN 169

 Score = 144 (55.7 bits), Expect = 1.3e-23, Sum P(2) = 1.3e-23
 Identities = 41/136 (30%), Positives = 59/136 (43%)

Query:   357 GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXX----------- 405
             GCNGG   +A  YII NGG+ ++ +YPY A E+   C                       
Sbjct:   189 GCNGGLQPNAYNYIIKNGGIQTESSYPYTA-ETGTQCNFNSANIGAKISNFTMIPKNETV 247

Query:   406 -XXY--SRIPYGEEEEMKKW-VATRGPLSVGMNANGLFYYSGGVI----DLNQRLYGTSI 457
                Y  S  P     +  +W     G   +  N N L +   G++         ++  ++
Sbjct:   248 MAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPNSLDH---GILIVGYSAKNTIFRKNM 304

Query:   458 PYWIVKNSWGSDWGEK 473
             PYWIVKNSWG+DWGE+
Sbjct:   305 PYWIVKNSWGADWGEQ 320


>FB|FBgn0013770 [details] [associations]
            symbol:Cp1 "Cysteine proteinase-1" species:7227 "Drosophila
            melanogaster" [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=ISS;NAS] [GO:0030163 "protein catabolic process"
            evidence=NAS] [GO:0005764 "lysosome" evidence=NAS] [GO:0048102
            "autophagic cell death" evidence=IEP] [GO:0035071 "salivary gland
            cell autophagic cell death" evidence=IEP] [GO:0006508 "proteolysis"
            evidence=IEA] [GO:0045169 "fusome" evidence=IDA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 EMBL:AE013599 GO:GO:0007586 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GO:GO:0035071 GO:GO:0045169 GeneTree:ENSGT00660000095458 KO:K01365
            EMBL:U75652 EMBL:AF012089 EMBL:BT016071 EMBL:D31970
            RefSeq:NP_523735.2 RefSeq:NP_725347.1 RefSeq:NP_725348.1
            UniGene:Dm.7400 ProteinModelPortal:Q95029 SMR:Q95029 IntAct:Q95029
            MINT:MINT-814156 STRING:Q95029 MEROPS:C01.092 PaxDb:Q95029
            EnsemblMetazoa:FBtr0087593 GeneID:36546 KEGG:dme:Dmel_CG6692
            CTD:36546 FlyBase:FBgn0013770 InParanoid:Q95029 OMA:ICHGADP
            OrthoDB:EOG46M91C PhylomeDB:Q95029 GenomeRNAi:36546 NextBio:799136
            Bgee:Q95029 GermOnline:CG6692 Uniprot:Q95029
        Length = 371

 Score = 384 (140.2 bits), Expect = 4.9e-35, P = 4.9e-35
 Identities = 107/297 (36%), Positives = 149/297 (50%)

Query:   200 HDKVYSSVEDLLRRHENFVTNVEK-AEDYQSEDSGTAVF--GVNKFFDLSESDLQQL-TG 255
             H K Y    +   R + F  N  K A+  Q    G   F   VNK+ DL   + +QL  G
Sbjct:    66 HRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADLLHHEFRQLMNG 125

Query:   256 LNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVKEQG 315
              N   TL      L+A       D   +   F S  H   LP++ DWR +G ++ VK+QG
Sbjct:   126 FNY--TLHK---QLRA------ADESFKGVTFISPAHVT-LPKSVDWRTKGAVTAVKDQG 173

Query:   316 KCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMS--NGGCNGGRMDDALQYIIDN 373
              C  CWAFS+ G +E  H  +   L  LS Q LVDC     N GCNGG MD+A +YI DN
Sbjct:   174 HCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDN 233

Query:   374 GGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGM 433
             GG+ ++++YPY+A +    C              ++ IP G+E++M + VAT GP+SV +
Sbjct:   234 GGIDTEKSYPYEAIDDS--C-HFNKGTVGATDRGFTDIPQGDEKKMAEAVATVGPVSVAI 290

Query:   434 NAN--GLFYYSGGVIDLNQ----RL--------YGTSIP---YWIVKNSWGSDWGEK 473
             +A+     +YS GV +  Q     L        +GT      YW+VKNSWG+ WG+K
Sbjct:   291 DASHESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTWGDK 347

 Score = 293 (108.2 bits), Expect = 1.7e-40, Sum P(2) = 1.7e-40
 Identities = 64/166 (38%), Positives = 100/166 (60%)

Query:   491 TGVLPSKLSRLATEKLVDCDMS--NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESE 548
             +GVL S    L+ + LVDC     N GCNGG MD+A +YI DNGG+ ++++YPY+A +  
Sbjct:   195 SGVLVS----LSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYEAIDDS 250

Query:   549 RGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNAN--GLFYYSGGVIDL 606
               C              ++ IP G+E++M + VAT GP+SV ++A+     +YS GV   
Sbjct:   251 --C-HFNKGTVGATDRGFTDIPQGDEKKMAEAVATVGPVSVAIDASHESFQFYSEGVY-- 305

Query:   607 NQRLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
             N+  C+ +  +H +++VG+G +E  +     YW+VKNSWG+ WG+K
Sbjct:   306 NEPQCDAQNLDHGVLVVGFGTDESGED----YWLVKNSWGTTWGDK 347

 Score = 170 (64.9 bits), Expect = 1.7e-40, Sum P(2) = 1.7e-40
 Identities = 57/180 (31%), Positives = 83/180 (46%)

Query:    24 VALLESNIFQTRGYLNSPVTRFLNFMRDHDKVYSSVEDLLRRHENFVTNVEK-AEDYQRE 82
             +ALL   + Q   + +  +  +  F  +H K Y    +   R + F  N  K A+  QR 
Sbjct:    40 LALLA--VAQAVSFADVVMEEWHTFKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQRF 97

Query:    83 DSGTAVFE--VNKFFDLSDSDLQQL-TGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQF 139
               G   F+  VNK+ DL   + +QL  G N   TL      L+A       D   +   F
Sbjct:    98 AEGKVSFKLAVNKYADLLHHEFRQLMNGFNY--TLHK---QLRA------ADESFKGVTF 146

Query:   140 NSLRHGDDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQH 199
              S  H   LP++ DWR +G ++ VK+QG C  CWAFS+ G +E  H  +   L  LS Q+
Sbjct:   147 ISPAHVT-LPKSVDWRTKGAVTAVKDQGHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQN 205


>TAIR|locus:2090614 [details] [associations]
            symbol:AT3G19390 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA;ISS] [GO:0000041 "transition metal ion
            transport" evidence=RCA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            EMBL:CP002686 GenomeReviews:BA000014_GR eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 EMBL:AB025624
            MEROPS:I29.003 OMA:KAMDQKC HSSP:O65039 HOGENOM:HOG000230773
            InterPro:IPR000118 Pfam:PF00396 SMART:SM00277 EMBL:AY062725
            EMBL:AY093350 IPI:IPI00520189 RefSeq:NP_566633.1 UniGene:At.27473
            ProteinModelPortal:Q9LT78 SMR:Q9LT78 IntAct:Q9LT78 STRING:Q9LT78
            PaxDb:Q9LT78 PRIDE:Q9LT78 EnsemblPlants:AT3G19390.1 GeneID:821473
            KEGG:ath:AT3G19390 TAIR:At3g19390 InParanoid:Q9LT78
            PhylomeDB:Q9LT78 ProtClustDB:CLSN2917188 Genevestigator:Q9LT78
            Uniprot:Q9LT78
        Length = 452

 Score = 391 (142.7 bits), Expect = 4.4e-40, Sum P(2) = 4.4e-40
 Identities = 82/195 (42%), Positives = 116/195 (59%)

Query:   293 GDDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCD 352
             GD LP+A DWRA+G ++ VK+QG C  CWAFSA+G VE ++ I+   L  LS Q+LVDCD
Sbjct:   126 GDSLPDAIDWRAKGAVNPVKDQGSCGSCWAFSAIGAVEGINQIKTGELISLSEQELVDCD 185

Query:   353 MS-NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRI 411
              S N GC GG MD A ++II+NGG+ +++ YPY A++    C              Y  +
Sbjct:   186 TSYNDGCGGGLMDYAFKFIIENGGIDTEEDYPYIATDVNV-CNSDKKNTRVVTIDGYEDV 244

Query:   412 PYGEEEEMKKWVATRGPLSVGMNANGLFY--YSGGVI------DLNQRL----YGTS--I 457
             P  +E+ +KK +A + P+SV + A G  +  Y+ GV        L+  +    YG+    
Sbjct:   245 PQNDEKSLKKALANQ-PISVAIEAGGRAFQLYTSGVFTGTCGTSLDHGVVAVGYGSEGGQ 303

Query:   458 PYWIVKNSWGSDWGE 472
              YWIV+NSWGS+WGE
Sbjct:   304 DYWIVRNSWGSNWGE 318

 Score = 267 (99.0 bits), Expect = 3.9e-35, Sum P(3) = 3.9e-35
 Identities = 60/158 (37%), Positives = 89/158 (56%)

Query:   497 KLSRLATEKLVDCDMS-NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXX 555
             +L  L+ ++LVDCD S N GC GG MD A ++II+NGG+ +++ YPY A++    C    
Sbjct:   172 ELISLSEQELVDCDTSYNDGCGGGLMDYAFKFIIENGGIDTEEDYPYIATDVNV-CNSDK 230

Query:   556 XXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFY--YSGGVIDLNQRLCNP 613
                       Y  +P  +E+ +KK +A + P+SV + A G  +  Y+ GV       C  
Sbjct:   231 KNTRVVTIDGYEDVPQNDEKSLKKALANQ-PISVAIEAGGRAFQLYTSGVFT---GTCGT 286

Query:   614 KAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGE 651
                +H ++ VGYG E  +D     YWIV+NSWGS+WGE
Sbjct:   287 SL-DHGVVAVGYGSEGGQD-----YWIVRNSWGSNWGE 318

 Score = 169 (64.5 bits), Expect = 3.9e-35, Sum P(3) = 3.9e-35
 Identities = 29/54 (53%), Positives = 38/54 (70%)

Query:   145 GDDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
             GD LP+A DWRA+G ++ VK+QG C  CWAFSA+G VE ++ I+   L  LS Q
Sbjct:   126 GDSLPDAIDWRAKGAVNPVKDQGSCGSCWAFSAIGAVEGINQIKTGELISLSEQ 179

 Score = 61 (26.5 bits), Expect = 4.4e-40, Sum P(2) = 4.4e-40
 Identities = 14/52 (26%), Positives = 28/52 (53%)

Query:   202 KVYSSVEDLLRRHENFVTNVEKAEDYQSEDSGTAVFGVNKFFDLSESDLQQL 253
             K Y+ + +  RR E F  N++  E++ S  + T   G+ +F DL+  + + +
Sbjct:    52 KNYNGLGEKERRFEIFKDNLKFVEEHSSIPNRTYEVGLTRFADLTNDEFRAI 103

 Score = 51 (23.0 bits), Expect = 3.9e-35, Sum P(3) = 3.9e-35
 Identities = 16/87 (18%), Positives = 42/87 (48%)

Query:    21 MIKVALLESNIFQTRGYLNSPVTR--FLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAED 78
             ++ ++L   ++  T    N    R  +  ++ ++ K Y+ + +  RR E F  N++  E+
Sbjct:    17 VLLISLSLGSVTATETTRNEAEARRMYERWLVENRKNYNGLGEKERRFEIFKDNLKFVEE 76

Query:    79 YQREDSGTAVFEVNKFFDLSDSDLQQL 105
             +    + T    + +F DL++ + + +
Sbjct:    77 HSSIPNRTYEVGLTRFADLTNDEFRAI 103


>MGI|MGI:107341 [details] [associations]
            symbol:Ctss "cathepsin S" species:10090 "Mus musculus"
            [GO:0005764 "lysosome" evidence=ISO;IDA] [GO:0006508 "proteolysis"
            evidence=IDA] [GO:0008152 "metabolic process" evidence=ISO]
            [GO:0008233 "peptidase activity" evidence=IDA] [GO:0008234
            "cysteine-type peptidase activity" evidence=ISO] [GO:0009986 "cell
            surface" evidence=ISO] [GO:0016020 "membrane" evidence=IDA]
            [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0045453 "bone
            resorption" evidence=ISO] [GO:0051930 "regulation of sensory
            perception of pain" evidence=ISO] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 MGI:MGI:107341 GO:GO:0016020 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GO:GO:0008233 GO:GO:0031905 Reactome:REACT_102124
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 BRENDA:3.4.22.27
            ChiTaRS:CTSS EMBL:AF051732 EMBL:AF051727 EMBL:AF051728
            EMBL:AF051729 EMBL:AF051726 EMBL:AF051730 EMBL:AF051731
            EMBL:AF038546 EMBL:AJ002386 EMBL:AC092203 EMBL:Y18466 EMBL:AJ223208
            IPI:IPI00309520 UniGene:Mm.3619 PDB:1M0H PDBsum:1M0H
            ProteinModelPortal:O70370 SMR:O70370 STRING:O70370
            PhosphoSite:O70370 PaxDb:O70370 PRIDE:O70370
            Ensembl:ENSMUST00000116304 BindingDB:O70370 ChEMBL:CHEMBL4098
            NextBio:282932 Bgee:O70370 CleanEx:MM_CTSS Genevestigator:O70370
            GermOnline:ENSMUSG00000038642 Uniprot:O70370
        Length = 340

 Score = 397 (144.8 bits), Expect = 1.7e-36, P = 1.7e-36
 Identities = 94/237 (39%), Positives = 134/237 (56%)

Query:   259 DSTLEDI---QPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVKEQG 315
             D T E+I     +L+ P  S +T T    F+  S R    LP+  DWR +G +++VK QG
Sbjct:    90 DMTNEEILCRMGALRIPRQSPKTVT----FRSYSNR---TLPDTVDWREKGCVTEVKYQG 142

Query:   316 KCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCD----MSNGGCNGGRMDDALQYII 371
              C  CWAFSAVG +E    ++   L  LS Q LVDC       N GC GG M +A QYII
Sbjct:   143 SCGACWAFSAVGALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYII 202

Query:   372 DNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSV 431
             DNGG+ +D +YPYKA++ +  C              Y ++P+G+E+ +K+ VAT+GP+SV
Sbjct:   203 DNGGIEADASYPYKATDEK--C-HYNSKNRAATCSRYIQLPFGDEDALKEAVATKGPVSV 259

Query:   432 GMNAN--GLFYYSGGVID-------LNQRL----YGT--SIPYWIVKNSWGSDWGEK 473
             G++A+    F+Y  GV D       +N  +    YGT     YW+VKNSWG ++G++
Sbjct:   260 GIDASHSSFFFYKSGVYDDPSCTGNVNHGVLVVGYGTLDGKDYWLVKNSWGLNFGDQ 316

 Score = 313 (115.2 bits), Expect = 5.6e-40, Sum P(2) = 5.6e-40
 Identities = 66/162 (40%), Positives = 97/162 (59%)

Query:   497 KLSRLATEKLVDCD----MSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCL 552
             KL  L+ + LVDC       N GC GG M +A QYIIDNGG+ +D +YPYKA++ +  C 
Sbjct:   166 KLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIIDNGGIEADASYPYKATDEK--C- 222

Query:   553 XXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNAN--GLFYYSGGVIDLNQRL 610
                          Y ++P+G+E+ +K+ VAT+GP+SVG++A+    F+Y  GV D     
Sbjct:   223 HYNSKNRAATCSRYIQLPFGDEDALKEAVATKGPVSVGIDASHSSFFFYKSGVYD--DPS 280

Query:   611 CNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
             C     NH +++VGYG  + KD     YW+VKNSWG ++G++
Sbjct:   281 CTGNV-NHGVLVVGYGTLDGKD-----YWLVKNSWGLNFGDQ 316

 Score = 144 (55.7 bits), Expect = 5.6e-40, Sum P(2) = 5.6e-40
 Identities = 45/140 (32%), Positives = 70/140 (50%)

Query:    71 TNVEKAEDYQREDSGTAVFEVN-KFFDLSDSDLQQ-----LTGLNL--DSTLEDI---QP 119
             T+ ++ +D   E+    ++E N KF  + + +          G+N   D T E+I     
Sbjct:    42 THEKEYKDKNEEEVRRLIWEKNLKFIMIHNLEYSMGMHTYQVGMNDMGDMTNEEILCRMG 101

Query:   120 SLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVG 179
             +L+ P  S +T T    F+  S R    LP+  DWR +G +++VK QG C  CWAFSAVG
Sbjct:   102 ALRIPRQSPKTVT----FRSYSNR---TLPDTVDWREKGCVTEVKYQGSCGACWAFSAVG 154

Query:   180 VVEAMHAIQGNNLTELSVQH 199
              +E    ++   L  LS Q+
Sbjct:   155 ALEGQLKLKTGKLISLSAQN 174


>TAIR|locus:2038588 [details] [associations]
            symbol:AT2G27420 species:3702 "Arabidopsis thaliana"
            [GO:0006508 "proteolysis" evidence=IEA;ISS] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA;ISS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 EMBL:CP002685
            GenomeReviews:CT485783_GR eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 EMBL:AC006232
            MEROPS:I29.003 OMA:EEFRATH HOGENOM:HOG000230773 HSSP:P53634
            ProtClustDB:CLSN2688476 EMBL:AY064033 EMBL:AY096388 IPI:IPI00539752
            PIR:F84672 RefSeq:NP_565649.1 UniGene:At.27094
            ProteinModelPortal:Q9ZQH7 SMR:Q9ZQH7 PRIDE:Q9ZQH7
            EnsemblPlants:AT2G27420.1 GeneID:817287 KEGG:ath:AT2G27420
            TAIR:At2g27420 InParanoid:Q9ZQH7 PhylomeDB:Q9ZQH7
            ArrayExpress:Q9ZQH7 Genevestigator:Q9ZQH7 Uniprot:Q9ZQH7
        Length = 348

 Score = 388 (141.6 bits), Expect = 1.7e-35, P = 1.7e-35
 Identities = 99/292 (33%), Positives = 145/292 (49%)

Query:   201 DKVYSSVEDLLRRHENFVTNVEKAEDYQSEDSGTAVFGVNKFFDLSESDLQQL-TGLNLD 259
             ++VYS   +   R   F  N+E  +++   +  T    +N+F DL++ + +   TGL + 
Sbjct:    43 NRVYSDETEKRNRFNIFKKNLEFVQNFNMNNKITYKVDINEFSDLTDEEFRATHTGLVVP 102

Query:   260 STLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVKEQGKCAC 319
               +  I     +  SS +       F++ ++    D  E+ DWR EG ++ VK QG+C  
Sbjct:   103 EAITRI-----STLSSGKNTVP---FRYGNV---SDNGESMDWRQEGAVTPVKYQGRCGG 151

Query:   320 CWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMS-NGGCNGGRMDDALQYIIDNGGVVS 378
             CWAFSAV  VE +  I    L  LS QQL+DCD   N GC GG M  A +YII N G+ +
Sbjct:   152 CWAFSAVAAVEGITKITKGELVSLSEQQLLDCDRDYNQGCRGGIMSKAFEYIIKNQGITT 211

Query:   379 DQAYPYKASESERGCLXXXXXXXXXXXXX---YSRIPYGEEEEMKKWVATRGPLSVGMNA 435
             +  YPY+  ES++ C                 Y  +P   EE + + V+ + P+SVG+  
Sbjct:   212 EDNYPYQ--ESQQTCSSSTTLSSSFRAATISGYETVPMNNEEALLQAVSQQ-PVSVGIEG 268

Query:   436 NGLFY--YSGGVI------DLNQRL----YGTS---IPYWIVKNSWGSDWGE 472
              G  +  YSGGV       DL+  +    YG S     YW+VKNSWG  WGE
Sbjct:   269 TGAAFRHYSGGVFNGECGTDLHHAVTIVGYGMSEEGTKYWVVKNSWGETWGE 320

 Score = 265 (98.3 bits), Expect = 3.2e-39, Sum P(2) = 3.2e-39
 Identities = 62/168 (36%), Positives = 89/168 (52%)

Query:   490 LTGVLPSKLSRLATEKLVDCDMS-NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESE 548
             +T +   +L  L+ ++L+DCD   N GC GG M  A +YII N G+ ++  YPY+  ES+
Sbjct:   164 ITKITKGELVSLSEQQLLDCDRDYNQGCRGGIMSKAFEYIIKNQGITTEDNYPYQ--ESQ 221

Query:   549 RGCLXXXXXXXXXXXXX---YSRIPYGEEEEMKKWVATRGPLSVGMNANGLFY--YSGGV 603
             + C                 Y  +P   EE + + V+ + P+SVG+   G  +  YSGGV
Sbjct:   222 QTCSSSTTLSSSFRAATISGYETVPMNNEEALLQAVSQQ-PVSVGIEGTGAAFRHYSGGV 280

Query:   604 IDLNQRLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGE 651
              +     C     +HA+ IVGYG  E  +GT   YW+VKNSWG  WGE
Sbjct:   281 FNGE---CGTDL-HHAVTIVGYGMSE--EGTK--YWVVKNSWGETWGE 320

 Score = 187 (70.9 bits), Expect = 3.2e-39, Sum P(2) = 3.2e-39
 Identities = 53/183 (28%), Positives = 90/183 (49%)

Query:    20 FMIKVAL-LESNIFQTRGYL--NSPVTRFLNFMRDHDKVYSSVEDLLRRHENFVTNVEKA 76
             F++ + L   +++  +RG L   S + +   +M   ++VYS   +   R   F  N+E  
Sbjct:     7 FILTIFLSYRTSLATSRGSLFEASAIEKHEQWMARFNRVYSDETEKRNRFNIFKKNLEFV 66

Query:    77 EDYQREDSGTAVFEVNKFFDLSDSDLQQL-TGLNLDSTLEDIQPSLQAPFSSNQTDTEMR 135
             +++   +  T   ++N+F DL+D + +   TGL +   +  I     +  SS +      
Sbjct:    67 QNFNMNNKITYKVDINEFSDLTDEEFRATHTGLVVPEAITRI-----STLSSGKNTVP-- 119

Query:   136 AFQFNSLRHGDDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTEL 195
              F++ ++    D  E+ DWR EG ++ VK QG+C  CWAFSAV  VE +  I    L  L
Sbjct:   120 -FRYGNV---SDNGESMDWRQEGAVTPVKYQGRCGGCWAFSAVAAVEGITKITKGELVSL 175

Query:   196 SVQ 198
             S Q
Sbjct:   176 SEQ 178


>TAIR|locus:2055440 [details] [associations]
            symbol:AT2G34080 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 EMBL:CP002685 GenomeReviews:CT485783_GR
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 MEROPS:I29.003 EMBL:AC002341
            HOGENOM:HOG000230773 HSSP:P53634 IPI:IPI00530325 PIR:B84752
            RefSeq:NP_565780.1 UniGene:At.28613 UniGene:At.37859
            ProteinModelPortal:O22961 SMR:O22961 EnsemblPlants:AT2G34080.1
            GeneID:817969 KEGG:ath:AT2G34080 TAIR:At2g34080 InParanoid:O22961
            OMA:SENDYSY PhylomeDB:O22961 ProtClustDB:CLSN2688064
            ArrayExpress:O22961 Genevestigator:O22961 Uniprot:O22961
        Length = 345

 Score = 399 (145.5 bits), Expect = 1.0e-36, P = 1.0e-36
 Identities = 102/279 (36%), Positives = 146/279 (52%)

Query:   213 RHENFVTNVEKAEDYQSEDSGTAVFGVNKFFDLSESDLQQL-TGLNLDSTLEDIQPS-LQ 270
             R + F  N++  E++  + + +   GVN+F D +  +   + TGL     L ++ PS + 
Sbjct:    59 RRDVFKKNLKFIENFNKKGNKSYKLGVNEFADWTNEEFLAIHTGLK---GLTEVSPSKVV 115

Query:   271 APFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVE 330
             A   S+QT        +N     D + E+ DWRAEG ++ VK QG+C CCWAFSAV  VE
Sbjct:   116 AKTISSQT--------WNV---SDMVVESKDWRAEGAVTPVKYQGQCGCCWAFSAVAAVE 164

Query:   331 AMHAIQGNSLTELSVQQLVDCDMS-NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASES 389
              +  I G +L  LS QQL+DCD   + GC+GG M DA  Y++ N G+ S+  Y Y+ S+ 
Sbjct:   165 GVAKIAGGNLVSLSEQQLLDCDREYDRGCDGGIMSDAFNYVVQNRGIASENDYSYQGSDG 224

Query:   390 ERGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNA--NGLFYYSGGVID 447
               GC              +  +P   E  + + V+ R P+SV M+A  +G  +YSGGV D
Sbjct:   225 --GC--RSNARPAARISGFQTVPSNNERALLEAVS-RQPVSVSMDATGDGFMHYSGGVYD 279

Query:   448 ------LNQRL----YGTS---IPYWIVKNSWGSDWGEK 473
                    N  +    YGTS     YW+ KNSWG  WGEK
Sbjct:   280 GPCGTSSNHAVTFVGYGTSQDGTKYWLAKNSWGETWGEK 318

 Score = 262 (97.3 bits), Expect = 3.3e-39, Sum P(2) = 3.3e-39
 Identities = 60/158 (37%), Positives = 85/158 (53%)

Query:   498 LSRLATEKLVDCDMS-NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXX 556
             L  L+ ++L+DCD   + GC+GG M DA  Y++ N G+ S+  Y Y+ S+   GC     
Sbjct:   174 LVSLSEQQLLDCDREYDRGCDGGIMSDAFNYVVQNRGIASENDYSYQGSDG--GC--RSN 229

Query:   557 XXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNA--NGLFYYSGGVIDLNQRLCNPK 614
                      +  +P   E  + + V+ R P+SV M+A  +G  +YSGGV D     C   
Sbjct:   230 ARPAARISGFQTVPSNNERALLEAVS-RQPVSVSMDATGDGFMHYSGGVYD---GPCGTS 285

Query:   615 AQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
             + NHA+  VGYG  +  DGT   YW+ KNSWG  WGEK
Sbjct:   286 S-NHAVTFVGYGTSQ--DGTK--YWLAKNSWGETWGEK 318

 Score = 190 (71.9 bits), Expect = 3.3e-39, Sum P(2) = 3.3e-39
 Identities = 58/181 (32%), Positives = 92/181 (50%)

Query:    20 FMIKVALLESNIFQTRGYLNSPVTRFLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDY 79
             F I  A   + IF+ +  ++        F R++      +E  +RR + F  N++  E++
Sbjct:    18 FRISQATSRTVIFREQSMVDKHEQWMARFSREYR---DELEKNMRR-DVFKKNLKFIENF 73

Query:    80 QREDSGTAVFEVNKFFDLSDSDLQQL-TGLNLDSTLEDIQPS-LQAPFSSNQTDTEMRAF 137
              ++ + +    VN+F D ++ +   + TGL     L ++ PS + A   S+QT       
Sbjct:    74 NKKGNKSYKLGVNEFADWTNEEFLAIHTGLK---GLTEVSPSKVVAKTISSQT------- 123

Query:   138 QFNSLRHGDDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSV 197
              +N     D + E+ DWRAEG ++ VK QG+C CCWAFSAV  VE +  I G NL  LS 
Sbjct:   124 -WNV---SDMVVESKDWRAEGAVTPVKYQGQCGCCWAFSAVAAVEGVAKIAGGNLVSLSE 179

Query:   198 Q 198
             Q
Sbjct:   180 Q 180


>TAIR|locus:2157712 [details] [associations]
            symbol:CEP1 "cysteine endopeptidase 1" species:3702
            "Arabidopsis thaliana" [GO:0005576 "extracellular region"
            evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA;ISS]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0005783 EMBL:CP002688
            GenomeReviews:BA000015_GR eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 PROSITE:PS00014
            EMBL:AB024031 MEROPS:I29.003 EMBL:HM367092 EMBL:AY091087
            IPI:IPI00516991 RefSeq:NP_568722.1 UniGene:At.7918 HSSP:O65039
            ProteinModelPortal:Q9FGR9 SMR:Q9FGR9 PaxDb:Q9FGR9 PRIDE:Q9FGR9
            EnsemblPlants:AT5G50260.1 GeneID:835091 KEGG:ath:AT5G50260
            TAIR:At5g50260 HOGENOM:HOG000230773 InParanoid:Q9FGR9 KO:K16292
            OMA:WHSKKYH PhylomeDB:Q9FGR9 ProtClustDB:CLSN2689970
            Genevestigator:Q9FGR9 Uniprot:Q9FGR9
        Length = 361

 Score = 401 (146.2 bits), Expect = 6.0e-37, P = 6.0e-37
 Identities = 106/303 (34%), Positives = 158/303 (52%)

Query:   190 NNLTELSVQ--HHDKVYSSVEDLLRRHENFVTNVEKAEDYQSEDSGTAVFGVNKFFDLSE 247
             N+L EL  +   H  V  S+E+  +R   F  NV+   +   +D    +  +NKF D++ 
Sbjct:    32 NSLWELYERWRSHHTVARSLEEKAKRFNVFKHNVKHIHETNKKDKSYKL-KLNKFGDMTS 90

Query:   248 SDLQQL-TGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEG 306
              + ++   G N+              F   +  T  ++F + ++   + LP + DWR  G
Sbjct:    91 EEFRRTYAGSNIKH---------HRMFQGEKKAT--KSFMYANV---NTLPTSVDWRKNG 136

Query:   307 VISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMS-NGGCNGGRMDD 365
              ++ VK QG+C  CWAFS V  VE ++ I+   LT LS Q+LVDCD + N GCNGG MD 
Sbjct:   137 AVTPVKNQGQCGSCWAFSTVVAVEGINQIRTKKLTSLSEQELVDCDTNQNQGCNGGLMDL 196

Query:   366 ALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVAT 425
             A ++I + GG+ S+  YPYKAS+    C              +  +P   E+++ K VA 
Sbjct:   197 AFEFIKEKGGLTSELVYPYKASDET--CDTNKENAPVVSIDGHEDVPKNSEDDLMKAVAN 254

Query:   426 RGPLSVGMNANGL-F-YYSGGVI------DLNQRL----YGTSIP---YWIVKNSWGSDW 470
             + P+SV ++A G  F +YS GV       +LN  +    YGT+I    YWIVKNSWG +W
Sbjct:   255 Q-PVSVAIDAGGSDFQFYSEGVFTGRCGTELNHGVAVVGYGTTIDGTKYWIVKNSWGEEW 313

Query:   471 GEK 473
             GEK
Sbjct:   314 GEK 316

 Score = 280 (103.6 bits), Expect = 7.3e-39, Sum P(2) = 7.3e-39
 Identities = 67/159 (42%), Positives = 92/159 (57%)

Query:   497 KLSRLATEKLVDCDMS-NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXX 555
             KL+ L+ ++LVDCD + N GCNGG MD A ++I + GG+ S+  YPYKAS+    C    
Sbjct:   169 KLTSLSEQELVDCDTNQNQGCNGGLMDLAFEFIKEKGGLTSELVYPYKASDET--CDTNK 226

Query:   556 XXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGL-F-YYSGGVIDLNQRLCNP 613
                       +  +P   E+++ K VA + P+SV ++A G  F +YS GV     R C  
Sbjct:   227 ENAPVVSIDGHEDVPKNSEDDLMKAVANQ-PVSVAIDAGGSDFQFYSEGVF--TGR-CGT 282

Query:   614 KAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
             +  NH + +VGYG     DGT   YWIVKNSWG +WGEK
Sbjct:   283 EL-NHGVAVVGYGTTI--DGTK--YWIVKNSWGEEWGEK 316

 Score = 168 (64.2 bits), Expect = 7.3e-39, Sum P(2) = 7.3e-39
 Identities = 44/150 (29%), Positives = 74/150 (49%)

Query:    50 RDHDKVYSSVEDLLRRHENFVTNVEKAEDYQREDSGTAVFEVNKFFDLSDSDLQQL-TGL 108
             R H  V  S+E+  +R   F  NV+   +  ++D    + ++NKF D++  + ++   G 
Sbjct:    42 RSHHTVARSLEEKAKRFNVFKHNVKHIHETNKKDKSYKL-KLNKFGDMTSEEFRRTYAGS 100

Query:   109 NLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVKEQGK 168
             N+              F   +  T  ++F + ++   + LP + DWR  G ++ VK QG+
Sbjct:   101 NIKH---------HRMFQGEKKAT--KSFMYANV---NTLPTSVDWRKNGAVTPVKNQGQ 146

Query:   169 CACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
             C  CWAFS V  VE ++ I+   LT LS Q
Sbjct:   147 CGSCWAFSTVVAVEGINQIRTKKLTSLSEQ 176


>UNIPROTKB|F1NYJ1 [details] [associations]
            symbol:CTSL2 "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GeneTree:ENSGT00560000076577 IPI:IPI00602255
            OMA:DITHHEF EMBL:AADN02067812 Ensembl:ENSGALT00000020588
            ArrayExpress:F1NYJ1 Uniprot:F1NYJ1
        Length = 339

 Score = 353 (129.3 bits), Expect = 7.7e-34, Sum P(2) = 7.7e-34
 Identities = 89/247 (36%), Positives = 121/247 (48%)

Query:   255 GLNL--DSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVK 312
             G+N   D T E+ +  L   +   +++ + R  QF      +  P + DWR +G ++ VK
Sbjct:    77 GMNQFGDMTAEEFR-QLMNGYKHKKSERKYRGSQFLEPSFLE-APRSVDWREKGYVTPVK 134

Query:   313 EQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSNG--GCNGGRMDDALQYI 370
             +QG+C  CWAFS  G +E  H  +   L  LS Q LVDC    G  GCNGG MD A QY+
Sbjct:   135 DQGQCGSCWAFSTTGALEGQHFRKTGKLVSLSEQNLVDCSRPEGNQGCNGGLMDQAFQYV 194

Query:   371 IDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLS 430
              DNGG+ S+++YPY A + E  C              +  IP G E  + K VA+ GP+S
Sbjct:   195 QDNGGIDSEESYPYTAKDDE-DC-RYKAEYNAANDTGFVDIPQGHERALMKAVASVGPVS 252

Query:   431 VGMNANGL---FYYSG--------------GVIDLNQRLYGTSIP---YWIVKNSWGSDW 470
             V ++A      FY SG              GV+ +     G  +    YWIVKNSWG  W
Sbjct:   253 VAIDAGHSSFQFYQSGIYYEPDCSSEDLDHGVLVVGYGFEGEDVDGKKYWIVKNSWGEKW 312

Query:   471 GEKVEDK 477
             G    DK
Sbjct:   313 G----DK 315

 Score = 296 (109.3 bits), Expect = 1.3e-38, Sum P(2) = 1.3e-38
 Identities = 66/161 (40%), Positives = 91/161 (56%)

Query:   497 KLSRLATEKLVDCDMSNG--GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXX 554
             KL  L+ + LVDC    G  GCNGG MD A QY+ DNGG+ S+++YPY A + E  C   
Sbjct:   161 KLVSLSEQNLVDCSRPEGNQGCNGGLMDQAFQYVQDNGGIDSEESYPYTAKDDE-DC-RY 218

Query:   555 XXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNA-NGLF-YYSGGVIDLNQRLCN 612
                        +  IP G E  + K VA+ GP+SV ++A +  F +Y  G+    +  C+
Sbjct:   219 KAEYNAANDTGFVDIPQGHERALMKAVASVGPVSVAIDAGHSSFQFYQSGIY--YEPDCS 276

Query:   613 PKAQNHALIIVGYG-EEEKKDGTSIPYWIVKNSWGSDWGEK 652
              +  +H +++VGYG E E  DG    YWIVKNSWG  WG+K
Sbjct:   277 SEDLDHGVLVVGYGFEGEDVDGKK--YWIVKNSWGEKWGDK 315

 Score = 149 (57.5 bits), Expect = 1.3e-38, Sum P(2) = 1.3e-38
 Identities = 43/134 (32%), Positives = 70/134 (52%)

Query:    76 AEDY-QREDSGT-AVFEVN-KFFDLSDSD--LQQLT---GLNL--DSTLEDIQPSLQAPF 125
             ++DY +RE+S    V+E N K  +L + D  L + +   G+N   D T E+ +  L   +
Sbjct:    38 SKDYHEREESWRRVVWEKNLKMIELHNLDHSLGKHSYKLGMNQFGDMTAEEFR-QLMNGY 96

Query:   126 SSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMH 185
                +++ + R  QF      +  P + DWR +G ++ VK+QG+C  CWAFS  G +E  H
Sbjct:    97 KHKKSERKYRGSQFLEPSFLE-APRSVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQH 155

Query:   186 AIQGNNLTELSVQH 199
               +   L  LS Q+
Sbjct:   156 FRKTGKLVSLSEQN 169

 Score = 43 (20.2 bits), Expect = 7.7e-34, Sum P(2) = 7.7e-34
 Identities = 16/57 (28%), Positives = 29/57 (50%)

Query:   200 HDKVYSSVEDLLRRHENFVTNVEKAEDYQSEDS-GTAVF--GVNKFFDLSESDLQQL 253
             H K Y   E+  RR   +  N++  E +  + S G   +  G+N+F D++  + +QL
Sbjct:    37 HSKDYHEREESWRRVV-WEKNLKMIELHNLDHSLGKHSYKLGMNQFGDMTAEEFRQL 92


>TAIR|locus:2082687 [details] [associations]
            symbol:AT3G54940 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 EMBL:CP002686 HOGENOM:HOG000230774 KO:K01373
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 HSSP:P53634
            OMA:GGGLMTN EMBL:AY070063 IPI:IPI00528988 RefSeq:NP_567010.5
            UniGene:At.28412 ProteinModelPortal:Q8VYS0 SMR:Q8VYS0 PRIDE:Q8VYS0
            EnsemblPlants:AT3G54940.2 GeneID:824659 KEGG:ath:AT3G54940
            TAIR:At3g54940 PhylomeDB:Q8VYS0 ProtClustDB:CLSN2718801
            ArrayExpress:Q8VYS0 Genevestigator:Q8VYS0 Uniprot:Q8VYS0
        Length = 367

 Score = 281 (104.0 bits), Expect = 2.2e-38, Sum P(3) = 2.2e-38
 Identities = 71/221 (32%), Positives = 109/221 (49%)

Query:   234 TAVFGVNKFFDLSESDLQQLTGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHG 293
             +AV GV +F DL+E + +++      + + D+  S      +     E+           
Sbjct:    91 SAVHGVTQFSDLTEEEFKRMY-----TGVADVGGSRGGTVGAEAPMVEV----------- 134

Query:   294 DDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDM 353
             D LPE FDWR +G +++VK QG C  CWAFS  G  E  H +    L  LS QQLVDCD 
Sbjct:   135 DGLPEDFDWREKGGVTEVKNQGACGSCWAFSTTGAAEGAHFVSTGKLLSLSEQQLVDCDQ 194

Query:   354 S---------NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXX 404
             +         + GC GG M +A +Y+++ GG+  +++YPY     +RG            
Sbjct:   195 ACDPKDKKACDNGCGGGLMTNAYEYLMEAGGLEEERSYPYTG---KRGHCKFDPEKVAVR 251

Query:   405 XXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFYYSGGV 445
                ++ IP  +E ++   +   GPL+VG+NA  +  Y GGV
Sbjct:   252 VLNFTTIPL-DENQIAANLVRHGPLAVGLNAVFMQTYIGGV 291

 Score = 238 (88.8 bits), Expect = 3.0e-35, Sum P(3) = 3.0e-35
 Identities = 59/174 (33%), Positives = 91/174 (52%)

Query:   493 VLPSKLSRLATEKLVDCDMS---------NGGCNGGRMDDALQYIIDNGGVVSDQAYPYK 543
             V   KL  L+ ++LVDCD +         + GC GG M +A +Y+++ GG+  +++YPY 
Sbjct:   176 VSTGKLLSLSEQQLVDCDQACDPKDKKACDNGCGGGLMTNAYEYLMEAGGLEEERSYPYT 235

Query:   544 ASESERGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFYYSGGV 603
                 +RG               ++ IP  +E ++   +   GPL+VG+NA  +  Y GGV
Sbjct:   236 G---KRGHCKFDPEKVAVRVLNFTTIPL-DENQIAANLVRHGPLAVGLNAVFMQTYIGGV 291

Query:   604 IDLNQRLCNPKAQNHALIIVGYGEEEKKDGTSI------PYWIVKNSWGSDWGE 651
                   +C+ +  NH +++VGYG +    G SI      PYWI+KNSWG  WGE
Sbjct:   292 SC--PLICSKRNVNHGVLLVGYGSK----GFSILRLSNKPYWIIKNSWGKKWGE 339

 Score = 147 (56.8 bits), Expect = 3.0e-35, Sum P(3) = 3.0e-35
 Identities = 26/53 (49%), Positives = 31/53 (58%)

Query:   146 DDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
             D LPE FDWR +G +++VK QG C  CWAFS  G  E  H +    L  LS Q
Sbjct:   135 DGLPEDFDWREKGGVTEVKNQGACGSCWAFSTTGAAEGAHFVSTGKLLSLSEQ 187

 Score = 96 (38.9 bits), Expect = 2.2e-38, Sum P(3) = 2.2e-38
 Identities = 24/67 (35%), Positives = 41/67 (61%)

Query:    43 TRFLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQREDSGTAVFEVNKFFDLSDSDL 102
             ++F  FM D+ K YS+ E+ + R   F  NV KA ++Q  D  +AV  V +F DL++ + 
Sbjct:    49 SKFRLFMSDYGKNYSTREEYIHRLGIFAKNVLKAAEHQMMDP-SAVHGVTQFSDLTEEEF 107

Query:   103 QQL-TGL 108
             +++ TG+
Sbjct:   108 KRMYTGV 114

 Score = 89 (36.4 bits), Expect = 1.2e-37, Sum P(3) = 1.2e-37
 Identities = 23/63 (36%), Positives = 38/63 (60%)

Query:   195 LSVQHHDKVYSSVEDLLRRHENFVTNVEKAEDYQSEDSGTAVFGVNKFFDLSESDLQQL- 253
             L +  + K YS+ E+ + R   F  NV KA ++Q  D  +AV GV +F DL+E + +++ 
Sbjct:    53 LFMSDYGKNYSTREEYIHRLGIFAKNVLKAAEHQMMDP-SAVHGVTQFSDLTEEEFKRMY 111

Query:   254 TGL 256
             TG+
Sbjct:   112 TGV 114

 Score = 85 (35.0 bits), Expect = 2.2e-38, Sum P(3) = 2.2e-38
 Identities = 12/15 (80%), Positives = 13/15 (86%)

Query:   458 PYWIVKNSWGSDWGE 472
             PYWI+KNSWG  WGE
Sbjct:   325 PYWIIKNSWGKKWGE 339


>RGD|2448 [details] [associations]
            symbol:Ctsl1 "cathepsin L1" species:10116 "Rattus norvegicus"
          [GO:0002250 "adaptive immune response" evidence=ISO] [GO:0004177
          "aminopeptidase activity" evidence=IDA] [GO:0004197 "cysteine-type
          endopeptidase activity" evidence=ISO;IDA] [GO:0005576 "extracellular
          region" evidence=IEA] [GO:0005737 "cytoplasm" evidence=IDA]
          [GO:0005764 "lysosome" evidence=ISO;IDA] [GO:0005773 "vacuole"
          evidence=IDA] [GO:0005902 "microvillus" evidence=IDA] [GO:0006508
          "proteolysis" evidence=IEP;ISO] [GO:0007154 "cell communication"
          evidence=IDA] [GO:0007283 "spermatogenesis" evidence=IEP] [GO:0008234
          "cysteine-type peptidase activity" evidence=ISO] [GO:0008584 "male
          gonad development" evidence=IEP] [GO:0009267 "cellular response to
          starvation" evidence=IEP] [GO:0009749 "response to glucose stimulus"
          evidence=IEP] [GO:0009897 "external side of plasma membrane"
          evidence=IDA] [GO:0010259 "multicellular organismal aging"
          evidence=IEP] [GO:0014070 "response to organic cyclic compound"
          evidence=IEP] [GO:0021675 "nerve development" evidence=IEP]
          [GO:0030984 "kininogen binding" evidence=IPI] [GO:0032403 "protein
          complex binding" evidence=IPI] [GO:0034698 "response to gonadotropin
          stimulus" evidence=IEP] [GO:0042277 "peptide binding" evidence=IDA]
          [GO:0042393 "histone binding" evidence=ISO] [GO:0043005 "neuron
          projection" evidence=IDA] [GO:0043204 "perikaryon" evidence=IDA]
          [GO:0046697 "decidualization" evidence=IEP] [GO:0048102 "autophagic
          cell death" evidence=IEP] [GO:0051384 "response to glucocorticoid
          stimulus" evidence=IEP] [GO:0060008 "Sertoli cell differentiation"
          evidence=IEP] [GO:0097067 "cellular response to thyroid hormone
          stimulus" evidence=ISO] [GO:0030141 "secretory granule" evidence=IDA]
          [GO:0045177 "apical part of cell" evidence=IDA] [GO:0060441
          "epithelial tube branching involved in lung morphogenesis"
          evidence=ISO] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
          PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 EMBL:Y00697 RGD:2448
          GO:GO:0005576 GO:GO:0009897 GO:GO:0034698 GO:GO:0043204 GO:GO:0009749
          GO:GO:0051384 eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0006508
          InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
          PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
          PROSITE:PS00139 PROSITE:PS00639 GO:GO:0043005 GO:GO:0007283
          GO:GO:0004177 GO:GO:0005764 GO:GO:0042277 GO:GO:0009267 GO:GO:0021675
          GO:GO:0005902 GO:GO:0010259 GO:GO:0004197 GO:GO:0048102 GO:GO:0046697
          GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 KO:K01365
          OrthoDB:EOG48PMKF MEROPS:C01.032 OMA:FDQNLDT CTD:1514
          BRENDA:3.4.22.15 GO:GO:0060008 EMBL:AF025476 EMBL:BC063175
          EMBL:S85184 IPI:IPI00326070 PIR:S07098 RefSeq:NP_037288.1
          UniGene:Rn.1294 ProteinModelPortal:P07154 SMR:P07154 IntAct:P07154
          STRING:P07154 PhosphoSite:P07154 PRIDE:P07154
          Ensembl:ENSRNOT00000025462 GeneID:25697 KEGG:rno:25697 UCSC:RGD:2448
          InParanoid:P07154 SABIO-RK:P07154 BindingDB:P07154 ChEMBL:CHEMBL2305
          NextBio:607715 Genevestigator:P07154 GermOnline:ENSRNOG00000018566
          Uniprot:P07154
        Length = 334

 Score = 308 (113.5 bits), Expect = 1.3e-28, Sum P(2) = 1.3e-28
 Identities = 65/154 (42%), Positives = 90/154 (58%)

Query:   296 LPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDC--DM 353
             +P+  DWR +G ++ VK QG+C  CWAFSA G +E    ++   L  LS Q LVDC  D 
Sbjct:   114 IPKTVDWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHDQ 173

Query:   354 SNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPY 413
              N GCNGG MD A QYI +NGG+ S+++YPY+A +   G               +  IP 
Sbjct:   174 GNQGCNGGLMDFAFQYIKENGGLDSEESYPYEAKD---GSCKYRAEYAVANDTGFVDIPQ 230

Query:   414 GEEEEMKKWVATRGPLSVGMNAN--GLFYYSGGV 445
              +E+ + K VAT GP+SV M+A+   L +YS G+
Sbjct:   231 -QEKALMKAVATVGPISVAMDASHPSLQFYSSGI 263

 Score = 293 (108.2 bits), Expect = 2.3e-38, Sum P(3) = 2.3e-38
 Identities = 65/158 (41%), Positives = 91/158 (57%)

Query:   497 KLSRLATEKLVDC--DMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXX 554
             KL  L+ + LVDC  D  N GCNGG MD A QYI +NGG+ S+++YPY+A +   G    
Sbjct:   157 KLISLSEQNLVDCSHDQGNQGCNGGLMDFAFQYIKENGGLDSEESYPYEAKD---GSCKY 213

Query:   555 XXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNAN--GLFYYSGGVIDLNQRLCN 612
                        +  IP  +E+ + K VAT GP+SV M+A+   L +YS G+    +  C+
Sbjct:   214 RAEYAVANDTGFVDIPQ-QEKALMKAVATVGPISVAMDASHPSLQFYSSGIY--YEPNCS 270

Query:   613 PKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWG 650
              K  +H +++VGYG E   D     YW+VKNSWG +WG
Sbjct:   271 SKDLDHGVLVVGYGYEGT-DSNKDKYWLVKNSWGKEWG 307

 Score = 136 (52.9 bits), Expect = 5.5e-15, Sum P(3) = 5.5e-15
 Identities = 42/149 (28%), Positives = 62/149 (41%)

Query:   352 DMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESE---RGCLXXXXXXXXXXXXXY 408
             D  N GCNGG MD A QYI +NGG+ S+++YPY+A +     R                 
Sbjct:   172 DQGNQGCNGGLMDFAFQYIKENGGLDSEESYPYEAKDGSCKYRAEYAVANDTGFVDIPQQ 231

Query:   409 SRI---PYGEEEEMKKWVATRGPLSVGMNANGLFYYSG--------GVIDLNQRLYGTSI 457
              +           +   +    P S+   ++G++Y           GV+ +     GT  
Sbjct:   232 EKALMKAVATVGPISVAMDASHP-SLQFYSSGIYYEPNCSSKDLDHGVLVVGYGYEGTDS 290

Query:   458 ---PYWIVKNSWGSDWGEKVEDKVGSSGN 483
                 YW+VKNSWG +WG     K+    N
Sbjct:   291 NKDKYWLVKNSWGKEWGMDGYIKIAKDRN 319

 Score = 127 (49.8 bits), Expect = 2.3e-38, Sum P(3) = 2.3e-38
 Identities = 22/52 (42%), Positives = 32/52 (61%)

Query:   148 LPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQH 199
             +P+  DWR +G ++ VK QG+C  CWAFSA G +E    ++   L  LS Q+
Sbjct:   114 IPKTVDWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQN 165

 Score = 41 (19.5 bits), Expect = 2.3e-38, Sum P(3) = 2.3e-38
 Identities = 14/58 (24%), Positives = 29/58 (50%)

Query:    52 HDKVYSSVEDLLRR---HENF-VTNVEKAEDYQREDSGTAVFEVNKFFDLSDSDLQQL 105
             H ++Y + E+  RR    +N  +  +   E Y     G  + E+N F D+++ + +Q+
Sbjct:    36 HRRLYGTNEEEWRRAVWEKNMRMIQLHNGE-YSNGKHGFTM-EMNAFGDMTNEEFRQI 91


>FB|FBgn0260462 [details] [associations]
            symbol:CG12163 species:7227 "Drosophila melanogaster"
            [GO:0035071 "salivary gland cell autophagic cell death"
            evidence=IEP] [GO:0048102 "autophagic cell death" evidence=IEP]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=ISS] [GO:0004869 "cysteine-type
            endopeptidase inhibitor activity" evidence=IEA] [GO:0045169
            "fusome" evidence=IDA] [GO:0035220 "wing disc development"
            evidence=IGI] [GO:0022416 "chaeta development" evidence=IGI]
            InterPro:IPR000010 InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00043 SMART:SM00645
            InterPro:IPR000169 EMBL:AE014297 GO:GO:0004869 eggNOG:COG4870
            KO:K01373 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0022416 GO:GO:0035220 GO:GO:0035071
            GO:GO:0045169 GeneTree:ENSGT00660000095458 EMBL:AY121614
            EMBL:BT003231 RefSeq:NP_649521.1 RefSeq:NP_730901.1
            RefSeq:NP_730902.2 UniGene:Dm.7315 ProteinModelPortal:Q9VN93
            SMR:Q9VN93 DIP:DIP-17491N IntAct:Q9VN93 MINT:MINT-763966
            STRING:Q9VN93 MEROPS:C01.A27 PaxDb:Q9VN93
            EnsemblMetazoa:FBtr0078823 GeneID:40628 KEGG:dme:Dmel_CG12163
            UCSC:CG12163-RA FlyBase:FBgn0260462 InParanoid:Q9VN93 OMA:GPRWGEQ
            OrthoDB:EOG4CC2G9 PhylomeDB:Q9VN93 GenomeRNAi:40628 NextBio:819744
            Bgee:Q9VN93 GermOnline:CG12163 Uniprot:Q9VN93
        Length = 614

 Score = 344 (126.2 bits), Expect = 2.5e-38, Sum P(4) = 2.5e-38
 Identities = 62/154 (40%), Positives = 94/154 (61%)

Query:   292 HGDDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDC 351
             HG+ LP+ FDWR +  +++VK QG C  CWAFS  G +E ++A++   L E S Q+L+DC
Sbjct:   391 HGE-LPKEFDWRQKDAVTQVKNQGSCGSCWAFSVTGNIEGLYAVKTGELKEFSEQELLDC 449

Query:   352 DMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRI 411
             D ++  CNGG MD+A + I D GG+  +  YPYKA +++  C              +  +
Sbjct:   450 DTTDSACNGGLMDNAYKAIKDIGGLEYEAEYPYKAKKNQ--C-HFNRTLSHVQVAGFVDL 506

Query:   412 PYGEEEEMKKWVATRGPLSVGMNANGLFYYSGGV 445
             P G E  M++W+   GP+S+G+NAN + +Y GGV
Sbjct:   507 PKGNETAMQEWLLANGPISIGINANAMQFYRGGV 540

 Score = 323 (118.8 bits), Expect = 9.9e-30, Sum P(3) = 9.9e-30
 Identities = 63/170 (37%), Positives = 100/170 (58%)

Query:   485 TRDLE-LTGVLPSKLSRLATEKLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYK 543
             T ++E L  V   +L   + ++L+DCD ++  CNGG MD+A + I D GG+  +  YPYK
Sbjct:   424 TGNIEGLYAVKTGELKEFSEQELLDCDTTDSACNGGLMDNAYKAIKDIGGLEYEAEYPYK 483

Query:   544 ASESERGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFYYSGGV 603
             A +++  C              +  +P G E  M++W+   GP+S+G+NAN + +Y GGV
Sbjct:   484 AKKNQ--C-HFNRTLSHVQVAGFVDLPKGNETAMQEWLLANGPISIGINANAMQFYRGGV 540

Query:   604 IDLNQRLCNPKAQNHALIIVGYGEEEKKD-GTSIPYWIVKNSWGSDWGEK 652
                 + LC+ K  +H +++VGYG  +  +   ++PYWIVKNSWG  WGE+
Sbjct:   541 SHPWKALCSKKNLDHGVLVVGYGVSDYPNFHKTLPYWIVKNSWGPRWGEQ 590

 Score = 148 (57.2 bits), Expect = 3.7e-14, Sum P(3) = 3.7e-14
 Identities = 25/55 (45%), Positives = 36/55 (65%)

Query:   144 HGDDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
             HG+ LP+ FDWR +  +++VK QG C  CWAFS  G +E ++A++   L E S Q
Sbjct:   391 HGE-LPKEFDWRQKDAVTQVKNQGSCGSCWAFSVTGNIEGLYAVKTGELKEFSEQ 444

 Score = 91 (37.1 bits), Expect = 2.5e-38, Sum P(4) = 2.5e-38
 Identities = 17/38 (44%), Positives = 23/38 (60%)

Query:   436 NGLFYYSGGVIDLNQRLYGTSIPYWIVKNSWGSDWGEK 473
             +G+     GV D     +  ++PYWIVKNSWG  WGE+
Sbjct:   555 HGVLVVGYGVSDYPN--FHKTLPYWIVKNSWGPRWGEQ 590

 Score = 76 (31.8 bits), Expect = 2.5e-38, Sum P(4) = 2.5e-38
 Identities = 15/53 (28%), Positives = 30/53 (56%)

Query:   204 YSSVEDLLRRHENFVTNVEKAEDYQSEDSGTAVFGVNKFFDLSESDLQQLTGL 256
             Y S  +   R   F  N++  E+  + + G+A +G+ +F D++ S+ ++ TGL
Sbjct:   319 YVSTAERQMRLRIFRQNLKTIEELNANEMGSAKYGITEFADMTSSEYKERTGL 371

 Score = 69 (29.3 bits), Expect = 6.4e-37, Sum P(3) = 6.4e-37
 Identities = 16/64 (25%), Positives = 31/64 (48%)

Query:    45 FLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQREDSGTAVFEVNKFFDLSDSDLQQ 104
             F  F     + Y S  +   R   F  N++  E+    + G+A + + +F D++ S+ ++
Sbjct:   308 FYKFQVRFGRRYVSTAERQMRLRIFRQNLKTIEELNANEMGSAKYGITEFADMTSSEYKE 367

Query:   105 LTGL 108
              TGL
Sbjct:   368 RTGL 371

 Score = 47 (21.6 bits), Expect = 1.2e-34, Sum P(3) = 1.2e-34
 Identities = 26/91 (28%), Positives = 40/91 (43%)

Query:   177 AVGVVEAMHAIQGNNLTELSVQHHDKVYSSVEDLLRRHENFVTNVEKAEDYQSEDSGTAV 236
             AV  V    A +   L E  V H  +   S  D+L RH+ +     KA+  +S D  TA 
Sbjct:   156 AVNSVAGDPAEKARLLNEKYV-HRSR--RSANDILGRHKPYDEEAAKAQLQKSLDKLTAG 212

Query:   237 FGVN-KFFDLSESDLQQLTGL--NLDSTLED 264
              G + K   +  +  Q  +G+   +D+ L D
Sbjct:   213 EGPHYKIVKVYSASRQVDSGILTRIDADLID 243

 Score = 40 (19.1 bits), Expect = 2.5e-38, Sum P(4) = 2.5e-38
 Identities = 14/44 (31%), Positives = 20/44 (45%)

Query:    44 RFLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQREDSGTA 87
             R LN    H +   S  D+L RH+ +     KA+  +  D  TA
Sbjct:   169 RLLNEKYVH-RSRRSANDILGRHKPYDEEAAKAQLQKSLDKLTA 211


>UNIPROTKB|P25774 [details] [associations]
            symbol:CTSS "Cathepsin S" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0016020 "membrane"
            evidence=IEA] [GO:0005576 "extracellular region" evidence=NAS]
            [GO:0005764 "lysosome" evidence=IDA;NAS] [GO:0097067 "cellular
            response to thyroid hormone stimulus" evidence=IEP] [GO:0002250
            "adaptive immune response" evidence=IEP] [GO:0019882 "antigen
            processing and presentation" evidence=TAS] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=TAS] [GO:0006955
            "immune response" evidence=TAS] [GO:0002474 "antigen processing and
            presentation of peptide antigen via MHC class I" evidence=TAS]
            [GO:0002480 "antigen processing and presentation of exogenous
            peptide antigen via MHC class I, TAP-independent" evidence=TAS]
            [GO:0019886 "antigen processing and presentation of exogenous
            peptide antigen via MHC class II" evidence=TAS] [GO:0036021
            "endolysosome lumen" evidence=TAS] [GO:0042590 "antigen processing
            and presentation of exogenous peptide antigen via MHC class I"
            evidence=TAS] [GO:0043202 "lysosomal lumen" evidence=TAS]
            [GO:0045087 "innate immune response" evidence=TAS] [GO:0043231
            "intracellular membrane-bounded organelle" evidence=IDA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 Reactome:REACT_118779
            Reactome:REACT_6900 GO:GO:0005576 GO:GO:0002480 GO:GO:0016020
            GO:GO:0019886 eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045087 EMBL:CH471121
            GO:GO:0004197 GO:GO:0002250 HOVERGEN:HBG011513 GO:GO:0097067
            GO:GO:0036021 EMBL:AL356292 CTD:1520 KO:K01368 OMA:KAMDQKC
            OrthoDB:EOG4JM7Q2 EMBL:S93414 EMBL:M86553 EMBL:M90696 EMBL:U07374
            EMBL:U07370 EMBL:U07371 EMBL:U07372 EMBL:U07373 EMBL:CR541676
            EMBL:AK301472 EMBL:AK314482 EMBL:BC002642 IPI:IPI00299150
            IPI:IPI00910216 PIR:A42482 RefSeq:NP_001186668.1 RefSeq:NP_004070.3
            UniGene:Hs.181301 PDB:1BXF PDB:1GLO PDB:1MS6 PDB:1NPZ PDB:1NQC
            PDB:2C0Y PDB:2F1G PDB:2FQ9 PDB:2FRA PDB:2FRQ PDB:2FT2 PDB:2FUD
            PDB:2FYE PDB:2G6D PDB:2G7Y PDB:2H7J PDB:2HH5 PDB:2HHN PDB:2HXZ
            PDB:2OP3 PDB:2R9M PDB:2R9N PDB:2R9O PDB:3IEJ PDB:3KWN PDB:3MPE
            PDB:3MPF PDB:3N3G PDB:3N4C PDB:3OVX PDBsum:1BXF PDBsum:1GLO
            PDBsum:1MS6 PDBsum:1NPZ PDBsum:1NQC PDBsum:2C0Y PDBsum:2F1G
            PDBsum:2FQ9 PDBsum:2FRA PDBsum:2FRQ PDBsum:2FT2 PDBsum:2FUD
            PDBsum:2FYE PDBsum:2G6D PDBsum:2G7Y PDBsum:2H7J PDBsum:2HH5
            PDBsum:2HHN PDBsum:2HXZ PDBsum:2OP3 PDBsum:2R9M PDBsum:2R9N
            PDBsum:2R9O PDBsum:3IEJ PDBsum:3KWN PDBsum:3MPE PDBsum:3MPF
            PDBsum:3N3G PDBsum:3N4C PDBsum:3OVX ProteinModelPortal:P25774
            SMR:P25774 IntAct:P25774 STRING:P25774 MEROPS:I29.004
            PhosphoSite:P25774 DMDM:88984046 PaxDb:P25774 PeptideAtlas:P25774
            PRIDE:P25774 DNASU:1520 Ensembl:ENST00000368985
            Ensembl:ENST00000448301 GeneID:1520 KEGG:hsa:1520 UCSC:uc001evn.3
            GeneCards:GC01M150702 HGNC:HGNC:2545 HPA:CAB000460 HPA:HPA002988
            MIM:116845 neXtProt:NX_P25774 PharmGKB:PA27041 InParanoid:P25774
            PhylomeDB:P25774 BRENDA:3.4.22.27 BindingDB:P25774
            ChEMBL:CHEMBL2954 ChiTaRS:CTSS EvolutionaryTrace:P25774
            GenomeRNAi:1520 NextBio:6291 PMAP-CutDB:P25774 ArrayExpress:P25774
            Bgee:P25774 CleanEx:HS_CTSS Genevestigator:P25774
            GermOnline:ENSG00000163131 Uniprot:P25774
        Length = 331

 Score = 383 (139.9 bits), Expect = 6.3e-35, P = 6.3e-35
 Identities = 83/196 (42%), Positives = 112/196 (57%)

Query:   296 LPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMS- 354
             LP++ DWR +G +++VK QG C  CWAFSAVG +EA   ++   L  LS Q LVDC    
Sbjct:   115 LPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEK 174

Query:   355 --NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIP 412
               N GCNGG M  A QYIIDN G+ SD +YPYKA + +  C              Y+ +P
Sbjct:   175 YGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAMDQK--C-QYDSKYRAATCSKYTELP 231

Query:   413 YGEEEEMKKWVATRGPLSVGMNAN--GLFYYSGGVI-------DLNQRL----YG--TSI 457
             YG E+ +K+ VA +GP+SVG++A     F Y  GV        ++N  +    YG     
Sbjct:   232 YGREDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYGDLNGK 291

Query:   458 PYWIVKNSWGSDWGEK 473
              YW+VKNSWG ++GE+
Sbjct:   292 EYWLVKNSWGHNFGEE 307

 Score = 297 (109.6 bits), Expect = 2.6e-38, Sum P(2) = 2.6e-38
 Identities = 66/161 (40%), Positives = 91/161 (56%)

Query:   497 KLSRLATEKLVDCDMS---NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLX 553
             KL  L+ + LVDC      N GCNGG M  A QYIIDN G+ SD +YPYKA + +  C  
Sbjct:   158 KLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAMDQK--C-Q 214

Query:   554 XXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNAN--GLFYYSGGVIDLNQRLC 611
                         Y+ +PYG E+ +K+ VA +GP+SVG++A     F Y  GV    +  C
Sbjct:   215 YDSKYRAATCSKYTELPYGREDVLKEAVANKGPVSVGVDARHPSFFLYRSGVY--YEPSC 272

Query:   612 NPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
                  NH +++VGYG+   K+     YW+VKNSWG ++GE+
Sbjct:   273 TQNV-NHGVLVVGYGDLNGKE-----YWLVKNSWGHNFGEE 307

 Score = 145 (56.1 bits), Expect = 2.6e-38, Sum P(2) = 2.6e-38
 Identities = 25/52 (48%), Positives = 35/52 (67%)

Query:   148 LPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQH 199
             LP++ DWR +G +++VK QG C  CWAFSAVG +EA   ++   L  LS Q+
Sbjct:   115 LPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQN 166


>TAIR|locus:2082881 [details] [associations]
            symbol:AT3G49340 species:3702 "Arabidopsis thaliana"
            [GO:0006508 "proteolysis" evidence=IEA;ISS] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA;ISS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 EMBL:CP002686
            GenomeReviews:BA000014_GR EMBL:AC012329 EMBL:AL132956
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 MEROPS:I29.003 HOGENOM:HOG000230773 HSSP:P07711
            KO:K01376 IPI:IPI00520642 PIR:T45839 RefSeq:NP_566920.1
            UniGene:At.53854 ProteinModelPortal:Q9SG15 SMR:Q9SG15
            EnsemblPlants:AT3G49340.1 GeneID:824096 KEGG:ath:AT3G49340
            TAIR:At3g49340 InParanoid:Q9SG15 OMA:PQNDEEA PhylomeDB:Q9SG15
            ProtClustDB:CLSN2688476 Genevestigator:Q9SG15 Uniprot:Q9SG15
        Length = 341

 Score = 386 (140.9 bits), Expect = 2.9e-35, P = 2.9e-35
 Identities = 96/289 (33%), Positives = 143/289 (49%)

Query:   201 DKVYSSVEDLLRRHENFVTNVEKAEDYQSEDSGTAVFGVNKFFDLSESDLQ-QLTGLNLD 259
             ++VYS   +   R E F  N++  E      + T    VN+F DL++ + + + TGL   
Sbjct:    43 NRVYSDDSEKTSRFEIFTNNLKFVESINMNTNKTYTLDVNEFSDLTDEEFKARYTGL--- 99

Query:   260 STLEDIQPSLQAPFSSNQTDT-EMRAFQFNSLRHGDDLPEAFDWRAEGVISKVKEQGKCA 318
                  + P      S+  TD+ E  +F++ ++    +  E+ DW  EG ++ VK Q +C 
Sbjct:   100 -----VVPEGMTRIST--TDSHETVSFRYENV---GETGESMDWIQEGAVTSVKHQQQCG 149

Query:   319 CCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSNGGCNGGRMDDALQYIIDNGGVVS 378
             CCWAFSAV  VE M  I    L  LS QQL+DC   N GC GG M  A  YI +N G+ +
Sbjct:   150 CCWAFSAVAAVEGMTKIANGELVSLSEQQLLDCSTENNGCGGGIMWKAFDYIKENQGITT 209

Query:   379 DQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANG- 437
             +  YPY+ ++    C              Y  +P  +EE + K V+ + P+SV +  +G 
Sbjct:   210 EDNYPYQGAQQT--C--ESNHLAAATISGYETVPQNDEEALLKAVSQQ-PVSVAIEGSGY 264

Query:   438 -LFYYSGGVID------LNQRL----YGTS---IPYWIVKNSWGSDWGE 472
                +YSGG+ +      L   +    YG S   I YW++KNSWG  WGE
Sbjct:   265 EFIHYSGGIFNGECGTQLTHAVTIVGYGVSEEGIKYWLLKNSWGESWGE 313

 Score = 250 (93.1 bits), Expect = 5.0e-38, Sum P(2) = 5.0e-38
 Identities = 55/164 (33%), Positives = 85/164 (51%)

Query:   490 LTGVLPSKLSRLATEKLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESER 549
             +T +   +L  L+ ++L+DC   N GC GG M  A  YI +N G+ ++  YPY+ ++   
Sbjct:   163 MTKIANGELVSLSEQQLLDCSTENNGCGGGIMWKAFDYIKENQGITTEDNYPYQGAQQT- 221

Query:   550 GCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANG--LFYYSGGVIDLN 607
              C              Y  +P  +EE + K V+ + P+SV +  +G    +YSGG+ +  
Sbjct:   222 -C--ESNHLAAATISGYETVPQNDEEALLKAVSQQ-PVSVAIEGSGYEFIHYSGGIFNGE 277

Query:   608 QRLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGE 651
                C  +   HA+ IVGYG  E+     I YW++KNSWG  WGE
Sbjct:   278 ---CGTQL-THAVTIVGYGVSEE----GIKYWLLKNSWGESWGE 313

 Score = 197 (74.4 bits), Expect = 5.0e-38, Sum P(2) = 5.0e-38
 Identities = 59/185 (31%), Positives = 88/185 (47%)

Query:    20 FMIKVALLESNI--FQTRGYL--NSPVTRFLNFMRDHDKVYSSVEDLLRRHENFVTNVEK 75
             F +   LL S      +RG L   S V +   +M   ++VYS   +   R E F  N++ 
Sbjct:     6 FFLLAILLSSRTSGVTSRGGLFEASAVEKHEQWMSRFNRVYSDDSEKTSRFEIFTNNLKF 65

Query:    76 AEDYQREDSGTAVFEVNKFFDLSDSDLQ-QLTGLNLDSTLEDIQPSLQAPFSSNQTDT-E 133
              E      + T   +VN+F DL+D + + + TGL        + P      S+  TD+ E
Sbjct:    66 VESINMNTNKTYTLDVNEFSDLTDEEFKARYTGL--------VVPEGMTRIST--TDSHE 115

Query:   134 MRAFQFNSLRHGDDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLT 193
               +F++ ++    +  E+ DW  EG ++ VK Q +C CCWAFSAV  VE M  I    L 
Sbjct:   116 TVSFRYENV---GETGESMDWIQEGAVTSVKHQQQCGCCWAFSAVAAVEGMTKIANGELV 172

Query:   194 ELSVQ 198
              LS Q
Sbjct:   173 SLSEQ 177


>UNIPROTKB|Q8HY81 [details] [associations]
            symbol:CTSS "Cathepsin S" species:9615 "Canis lupus
            familiaris" [GO:0005764 "lysosome" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0006508
            "proteolysis" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 HOVERGEN:HBG011513
            CTD:1520 KO:K01368 OrthoDB:EOG4JM7Q2 EMBL:AY156692
            RefSeq:NP_001002938.2 UniGene:Cfa.1661 ProteinModelPortal:Q8HY81
            SMR:Q8HY81 STRING:Q8HY81 MEROPS:C01.034 GeneID:403400
            KEGG:cfa:403400 InParanoid:Q8HY81 NextBio:20816922 Uniprot:Q8HY81
        Length = 331

 Score = 379 (138.5 bits), Expect = 1.8e-34, P = 1.8e-34
 Identities = 96/260 (36%), Positives = 139/260 (53%)

Query:   243 FDLSESDLQQLTGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDW 302
             +DL  + L  +TG  + S    +  SL+ P S  Q +   R+   NS      LP++ DW
Sbjct:    73 YDLGMNHLGDMTGEEVIS----LMGSLRVP-SQWQRNVTYRS---NS---NQKLPDSVDW 121

Query:   303 RAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMS---NGGCN 359
             R +G +++VK QG C  CWAFSAVG +EA   ++   L  LS Q LVDC      N GCN
Sbjct:   122 REKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCN 181

Query:   360 GGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYGEEEEM 419
             GG M  A QYIIDN G+ S+ +YPYKA   +  C              Y+ +P+G E+ +
Sbjct:   182 GGFMTTAFQYIIDNNGIDSEASYPYKAMNGK--C-RYDSKKRAATCSKYTELPFGSEDAL 238

Query:   420 KKWVATRGPLSVGMNAN--GLFYYSGGVI-------DLNQRL----YGT--SIPYWIVKN 464
             K+ VA +GP+SV ++A+    F Y  GV        ++N  +    YG      YW+VKN
Sbjct:   239 KEAVANKGPVSVAIDASHYSFFLYRSGVYYEPSCTQNVNHGVLVVGYGNLNGKDYWLVKN 298

Query:   465 SWGSDWGEKVEDKVG-SSGN 483
             SWG ++G++   ++  +SGN
Sbjct:   299 SWGLNFGDQGYIRMARNSGN 318

 Score = 283 (104.7 bits), Expect = 1.3e-37, Sum P(2) = 1.3e-37
 Identities = 63/161 (39%), Positives = 89/161 (55%)

Query:   497 KLSRLATEKLVDCDMS---NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLX 553
             KL  L+ + LVDC      N GCNGG M  A QYIIDN G+ S+ +YPYKA   +  C  
Sbjct:   158 KLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNNGIDSEASYPYKAMNGK--C-R 214

Query:   554 XXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNAN--GLFYYSGGVIDLNQRLC 611
                         Y+ +P+G E+ +K+ VA +GP+SV ++A+    F Y  GV    +  C
Sbjct:   215 YDSKKRAATCSKYTELPFGSEDALKEAVANKGPVSVAIDASHYSFFLYRSGVY--YEPSC 272

Query:   612 NPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
                  NH +++VGYG    KD     YW+VKNSWG ++G++
Sbjct:   273 TQNV-NHGVLVVGYGNLNGKD-----YWLVKNSWGLNFGDQ 307

 Score = 153 (58.9 bits), Expect = 1.3e-37, Sum P(2) = 1.3e-37
 Identities = 39/105 (37%), Positives = 57/105 (54%)

Query:    95 FDLSDSDLQQLTGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDW 154
             +DL  + L  +TG  + S    +  SL+ P S  Q +   R+   NS      LP++ DW
Sbjct:    73 YDLGMNHLGDMTGEEVIS----LMGSLRVP-SQWQRNVTYRS---NS---NQKLPDSVDW 121

Query:   155 RAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQH 199
             R +G +++VK QG C  CWAFSAVG +EA   ++   L  LS Q+
Sbjct:   122 REKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQN 166


>UNIPROTKB|P25326 [details] [associations]
            symbol:CTSS "Cathepsin S" species:9913 "Bos taurus"
            [GO:0005515 "protein binding" evidence=IPI] [GO:0005764 "lysosome"
            evidence=IEA] [GO:0097067 "cellular response to thyroid hormone
            stimulus" evidence=IEA] [GO:0016020 "membrane" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0002250 "adaptive
            immune response" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0016020 eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GO:GO:0002250 GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513
            GO:GO:0097067 EMBL:BC102245 EMBL:M95211 EMBL:X62001 IPI:IPI00702008
            PIR:S15844 RefSeq:NP_001028787.1 UniGene:Bt.7938
            ProteinModelPortal:P25326 SMR:P25326 STRING:P25326 PRIDE:P25326
            Ensembl:ENSBTAT00000022774 GeneID:327711 KEGG:bta:327711 CTD:1520
            InParanoid:P25326 KO:K01368 OMA:KAMDQKC OrthoDB:EOG4JM7Q2
            NextBio:20810175 Uniprot:P25326
        Length = 331

 Score = 377 (137.8 bits), Expect = 2.9e-34, P = 2.9e-34
 Identities = 84/207 (40%), Positives = 119/207 (57%)

Query:   296 LPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMS- 354
             LP++ DWR +G +++VK QG C  CWAFSAVG +EA   ++   L  LS Q LVDC  + 
Sbjct:   115 LPDSMDWREKGCVTEVKYQGACGSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSTAK 174

Query:   355 --NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIP 412
               N GCNGG M +A QYIIDN G+ S+ +YPYKA + +  C              Y  +P
Sbjct:   175 YGNKGCNGGFMTEAFQYIIDNNGIDSEASYPYKAMDGK--C-QYDVKNRAATCSRYIELP 231

Query:   413 YGEEEEMKKWVATRGPLSVGMNAN--GLFYYSGGVI-------DLNQRL----YGT--SI 457
             +G EE +K+ VA +GP+SVG++A+    F Y  GV        ++N  +    YG     
Sbjct:   232 FGSEEALKEAVANKGPVSVGIDASHSSFFLYKTGVYYDPSCTQNVNHGVLVVGYGNLDGK 291

Query:   458 PYWIVKNSWGSDWGEKVEDKVG-SSGN 483
              YW+VKNSWG  +G++   ++  +SGN
Sbjct:   292 DYWLVKNSWGLHFGDQGYIRMARNSGN 318

 Score = 291 (107.5 bits), Expect = 1.5e-37, Sum P(2) = 1.5e-37
 Identities = 65/161 (40%), Positives = 91/161 (56%)

Query:   497 KLSRLATEKLVDCDMS---NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLX 553
             KL  L+ + LVDC  +   N GCNGG M +A QYIIDN G+ S+ +YPYKA + +  C  
Sbjct:   158 KLVSLSAQNLVDCSTAKYGNKGCNGGFMTEAFQYIIDNNGIDSEASYPYKAMDGK--C-Q 214

Query:   554 XXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNAN--GLFYYSGGVIDLNQRLC 611
                         Y  +P+G EE +K+ VA +GP+SVG++A+    F Y  GV       C
Sbjct:   215 YDVKNRAATCSRYIELPFGSEEALKEAVANKGPVSVGIDASHSSFFLYKTGVY--YDPSC 272

Query:   612 NPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
                  NH +++VGYG  + KD     YW+VKNSWG  +G++
Sbjct:   273 TQNV-NHGVLVVGYGNLDGKD-----YWLVKNSWGLHFGDQ 307

 Score = 144 (55.7 bits), Expect = 1.5e-37, Sum P(2) = 1.5e-37
 Identities = 25/52 (48%), Positives = 35/52 (67%)

Query:   148 LPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQH 199
             LP++ DWR +G +++VK QG C  CWAFSAVG +EA   ++   L  LS Q+
Sbjct:   115 LPDSMDWREKGCVTEVKYQGACGSCWAFSAVGALEAQVKLKTGKLVSLSAQN 166


>TAIR|locus:2030427 [details] [associations]
            symbol:XCP2 "xylem cysteine peptidase 2" species:3702
            "Arabidopsis thaliana" [GO:0006508 "proteolysis" evidence=IEA;ISS]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            [GO:0009507 "chloroplast" evidence=ISM] [GO:0008233 "peptidase
            activity" evidence=ISS] [GO:0005618 "cell wall" evidence=IDA]
            [GO:0010623 "developmental programmed cell death" evidence=IMP]
            [GO:0010075 "regulation of meristem growth" evidence=RCA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 EMBL:CP002684
            GenomeReviews:CT485782_GR GO:GO:0005886 GO:GO:0005618 GO:GO:0005773
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 EMBL:AC069251 EMBL:AC007369 GO:GO:0010623
            OMA:YKEIPEG HOGENOM:HOG000230773 KO:K16290 EMBL:AF191028
            EMBL:BT004822 IPI:IPI00526722 PIR:A86341 RefSeq:NP_564126.1
            UniGene:At.21316 ProteinModelPortal:Q9LM66 SMR:Q9LM66 IntAct:Q9LM66
            STRING:Q9LM66 MEROPS:C01.120 PaxDb:Q9LM66 PRIDE:Q9LM66
            ProMEX:Q9LM66 EnsemblPlants:AT1G20850.1 GeneID:838677
            KEGG:ath:AT1G20850 GeneFarm:5034 TAIR:At1g20850 InParanoid:Q9LM66
            PhylomeDB:Q9LM66 ProtClustDB:CLSN2917031 Genevestigator:Q9LM66
            GermOnline:AT1G20850 Uniprot:Q9LM66
        Length = 356

 Score = 406 (148.0 bits), Expect = 1.6e-37, P = 1.6e-37
 Identities = 104/299 (34%), Positives = 159/299 (53%)

Query:   192 LTELSVQHHDKVYSSVEDLLRRHENFVTNVEKAEDYQSEDSGTAVF-GVNKFFDLSESDL 250
             L E  + + +K Y +VE+   R E F  N++  ++  +   G + + G+N+F DLS  + 
Sbjct:    50 LFENWISNFEKAYETVEEKFLRFEVFKDNLKHIDE--TNKKGKSYWLGLNEFADLSHEEF 107

Query:   251 QQL-TGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVIS 309
             +++  GL  D    D                E R++   + R  + +P++ DWR +G ++
Sbjct:   108 KKMYLGLKTDIVRRD----------------EERSYAEFAYRDVEAVPKSVDWRKKGAVA 151

Query:   310 KVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMS-NGGCNGGRMDDALQ 368
             +VK QG C  CWAFS V  VE ++ I   +LT LS Q+L+DCD + N GCNGG MD A +
Sbjct:   152 EVKNQGSCGSCWAFSTVAAVEGINKIVTGNLTTLSEQELIDCDTTYNNGCNGGLMDYAFE 211

Query:   369 YIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGP 428
             YI+ NGG+  ++ YPY   E    C              +  +P  +E+ + K +A + P
Sbjct:   212 YIVKNGGLRKEEDYPYSMEEGT--CEMQKDESETVTINGHQDVPTNDEKSLLKALAHQ-P 268

Query:   429 LSVGMNANGL-F-YYSGGV------IDLNQRL----YGTS--IPYWIVKNSWGSDWGEK 473
             LSV ++A+G  F +YSGGV      +DL+  +    YG+S    Y IVKNSWG  WGEK
Sbjct:   269 LSVAIDASGREFQFYSGGVFDGRCGVDLDHGVAAVGYGSSKGSDYIIVKNSWGPKWGEK 327

 Score = 345 (126.5 bits), Expect = 9.8e-31, P = 9.8e-31
 Identities = 91/274 (33%), Positives = 141/274 (51%)

Query:   206 SVEDLLRRHENFVTNVEKA-EDYQSEDSGTAVFGVN-KFFD-LSESDLQQLTGLN--LDS 260
             S + L+   EN+++N EKA E  + +     VF  N K  D  ++       GLN   D 
Sbjct:    43 SHDKLIELFENWISNFEKAYETVEEKFLRFEVFKDNLKHIDETNKKGKSYWLGLNEFADL 102

Query:   261 TLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVKEQGKCACC 320
             + E+ +       +      E R++   + R  + +P++ DWR +G +++VK QG C  C
Sbjct:   103 SHEEFKKMYLGLKTDIVRRDEERSYAEFAYRDVEAVPKSVDWRKKGAVAEVKNQGSCGSC 162

Query:   321 WAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMS-NGGCNGGRMDDALQYIIDNGGVVSD 379
             WAFS V  VE ++ I   +LT LS Q+L+DCD + N GCNGG MD A +YI+ NGG+  +
Sbjct:   163 WAFSTVAAVEGINKIVTGNLTTLSEQELIDCDTTYNNGCNGGLMDYAFEYIVKNGGLRKE 222

Query:   380 QAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGL- 438
             + YPY   E    C              +  +P  +E+ + K +A + PLSV ++A+G  
Sbjct:   223 EDYPYSMEEGT--CEMQKDESETVTINGHQDVPTNDEKSLLKALAHQ-PLSVAIDASGRE 279

Query:   439 F-YYSGGVIDLNQRLYGTSIPYWIVKNSWGSDWG 471
             F +YSGGV D   R  G  + + +    +GS  G
Sbjct:   280 FQFYSGGVFD--GRC-GVDLDHGVAAVGYGSSKG 310

 Score = 258 (95.9 bits), Expect = 1.7e-20, P = 1.7e-20
 Identities = 61/163 (37%), Positives = 88/163 (53%)

Query:   493 VLPSKLSRLATEKLVDCDMS-NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGC 551
             ++   L+ L+ ++L+DCD + N GCNGG MD A +YI+ NGG+  ++ YPY   E    C
Sbjct:   177 IVTGNLTTLSEQELIDCDTTYNNGCNGGLMDYAFEYIVKNGGLRKEEDYPYSMEEGT--C 234

Query:   552 LXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGL-F-YYSGGVIDLNQR 609
                           +  +P  +E+ + K +A + PLSV ++A+G  F +YSGGV D    
Sbjct:   235 EMQKDESETVTINGHQDVPTNDEKSLLKALAHQ-PLSVAIDASGREFQFYSGGVFDGR-- 291

Query:   610 LCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
              C     +H +  VGYG  +  D     Y IVKNSWG  WGEK
Sbjct:   292 -CGVDL-DHGVAAVGYGSSKGSD-----YIIVKNSWGPKWGEK 327

 Score = 192 (72.6 bits), Expect = 3.5e-12, P = 3.5e-12
 Identities = 76/259 (29%), Positives = 119/259 (45%)

Query:    45 FLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQREDSGTAVFEVNKFFDLSDSDLQQ 104
             F N++ + +K Y +VE+   R E F  N++  ++  ++     +  +N+F DLS  + ++
Sbjct:    51 FENWISNFEKAYETVEEKFLRFEVFKDNLKHIDETNKKGKSYWL-GLNEFADLSHEEFKK 109

Query:   105 L-TGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKV 163
             +  GL  D    D                E R++   + R  + +P++ DWR +G +++V
Sbjct:   110 MYLGLKTDIVRRD----------------EERSYAEFAYRDVEAVPKSVDWRKKGAVAEV 153

Query:   164 KEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQHH---DKVYSS-VEDLLRRH--ENF 217
             K QG C  CWAFS V  VE ++ I   NLT LS Q     D  Y++     L  +  E  
Sbjct:   154 KNQGSCGSCWAFSTVAAVEGINKIVTGNLTTLSEQELIDCDTTYNNGCNGGLMDYAFEYI 213

Query:   218 VTN--VEKAEDYQ-SEDSGTAVFGVNKFFDLSESDLQQLTGLNLDSTLEDIQPSLQA--- 271
             V N  + K EDY  S + GT    + K     ES+   + G + D    D +  L+A   
Sbjct:   214 VKNGGLRKEEDYPYSMEEGTCE--MQK----DESETVTING-HQDVPTNDEKSLLKALAH 266

Query:   272 -PFSSNQTDTEMRAFQFNS 289
              P S    D   R FQF S
Sbjct:   267 QPLSV-AIDASGREFQFYS 284


>UNIPROTKB|F1PAK0 [details] [associations]
            symbol:CTSS "Cathepsin S" species:9615 "Canis lupus
            familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            EMBL:AAEX03011051 Ensembl:ENSCAFT00000019176 OMA:YEPACTQ
            Uniprot:F1PAK0
        Length = 339

 Score = 378 (138.1 bits), Expect = 2.3e-34, P = 2.3e-34
 Identities = 96/260 (36%), Positives = 139/260 (53%)

Query:   243 FDLSESDLQQLTGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDW 302
             +DL  + L  +TG  + S    +  SL+ P S  Q +   R+   NS      LP++ DW
Sbjct:    81 YDLGMNHLGDMTGEEVIS----LMGSLRVP-SQWQRNVTYRS---NS---NQKLPDSVDW 129

Query:   303 RAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMS---NGGCN 359
             R +G +++VK QG C  CWAFSAVG +EA   ++   L  LS Q LVDC      N GCN
Sbjct:   130 REKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCN 189

Query:   360 GGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYGEEEEM 419
             GG M  A QYIIDN G+ S+ +YPYKA   +  C              Y+ +P+G E+ +
Sbjct:   190 GGFMTTAFQYIIDNNGIDSEASYPYKAVNGK--C-RYDSKKRAATCSKYTELPFGSEDAL 246

Query:   420 KKWVATRGPLSVGMNAN--GLFYYSGGVI-------DLNQRL----YGT--SIPYWIVKN 464
             K+ VA +GP+SV ++A+    F Y  GV        ++N  +    YG      YW+VKN
Sbjct:   247 KEAVANKGPVSVAIDASHYSFFLYRSGVYYEPSCTQNVNHGVLVVGYGNLNGKDYWLVKN 306

Query:   465 SWGSDWGEKVEDKVG-SSGN 483
             SWG ++G++   ++  +SGN
Sbjct:   307 SWGLNFGDQGYIRMARNSGN 326

 Score = 282 (104.3 bits), Expect = 1.7e-37, Sum P(2) = 1.7e-37
 Identities = 63/161 (39%), Positives = 89/161 (55%)

Query:   497 KLSRLATEKLVDCDMS---NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLX 553
             KL  L+ + LVDC      N GCNGG M  A QYIIDN G+ S+ +YPYKA   +  C  
Sbjct:   166 KLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNNGIDSEASYPYKAVNGK--C-R 222

Query:   554 XXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNAN--GLFYYSGGVIDLNQRLC 611
                         Y+ +P+G E+ +K+ VA +GP+SV ++A+    F Y  GV    +  C
Sbjct:   223 YDSKKRAATCSKYTELPFGSEDALKEAVANKGPVSVAIDASHYSFFLYRSGVY--YEPSC 280

Query:   612 NPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
                  NH +++VGYG    KD     YW+VKNSWG ++G++
Sbjct:   281 TQNV-NHGVLVVGYGNLNGKD-----YWLVKNSWGLNFGDQ 315

 Score = 153 (58.9 bits), Expect = 1.7e-37, Sum P(2) = 1.7e-37
 Identities = 39/105 (37%), Positives = 57/105 (54%)

Query:    95 FDLSDSDLQQLTGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDW 154
             +DL  + L  +TG  + S    +  SL+ P S  Q +   R+   NS      LP++ DW
Sbjct:    81 YDLGMNHLGDMTGEEVIS----LMGSLRVP-SQWQRNVTYRS---NS---NQKLPDSVDW 129

Query:   155 RAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQH 199
             R +G +++VK QG C  CWAFSAVG +EA   ++   L  LS Q+
Sbjct:   130 REKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQN 174


>UNIPROTKB|E2RPX3 [details] [associations]
            symbol:CTSW "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00639
            GeneTree:ENSGT00660000095458 CTD:1521 KO:K08569 OMA:GRCGDGC
            EMBL:AAEX03011632 RefSeq:XP_540846.2 Ensembl:ENSCAFT00000020910
            GeneID:483725 KEGG:cfa:483725 Uniprot:E2RPX3
        Length = 374

 Score = 334 (122.6 bits), Expect = 2.2e-37, Sum P(2) = 2.2e-37
 Identities = 79/248 (31%), Positives = 127/248 (51%)

Query:   200 HDKVYSSVEDLLRRHENFVTNVEKAEDYQSEDSGTAVFGVNKFFDLSESDLQQLTGLNLD 259
             +++ YS+ E+  RR + F  N+ +A+  + ED GTA FGV  F DL+E +  Q  G    
Sbjct:    49 YNRSYSNPEEYARRLDIFAHNLAQAQQLEDEDLGTAEFGVTPFSDLTEEEFGQFYGHQ-- 106

Query:   260 STLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRA-EGVISKVKEQGKCA 318
                   + + +AP    + ++E           G+ +P   DWR   G+IS +K+QG C 
Sbjct:   107 ------RMAGEAPSVGRKVESE---------EWGEPVPPTCDWRKLPGIISPIKQQGNCR 151

Query:   319 CCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSNGGCNGGRMDDALQYIIDNGGVVS 378
             CCWA +A G +EA+  I+ +   E+SVQ+L+DC     GC GG   DA   +++N G+ S
Sbjct:   152 CCWAMAAAGNIEALWGIRYHQPVEVSVQELLDCGRCGDGCKGGFTWDAFITVLNNSGLAS 211

Query:   379 DQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGL 438
              + YP+  +     CL                +  G E+ +  ++AT+GP++V +N   L
Sbjct:   212 AKDYPFLGNTKPHRCLAKKYKKVAWIQDFI--MLQGNEQAIAWYLATKGPITVTINMKLL 269

Query:   439 FYYSGGVI 446
              +Y  GVI
Sbjct:   270 QHYQKGVI 277

 Score = 241 (89.9 bits), Expect = 9.4e-35, Sum P(2) = 9.4e-35
 Identities = 52/176 (29%), Positives = 93/176 (52%)

Query:   490 LTGVLPSKLSRLATEKLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESER 549
             L G+   +   ++ ++L+DC     GC GG   DA   +++N G+ S + YP+  +    
Sbjct:   165 LWGIRYHQPVEVSVQELLDCGRCGDGCKGGFTWDAFITVLNNSGLASAKDYPFLGNTKPH 224

Query:   550 GCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFYYSGGVIDLNQR 609
              CL                +  G E+ +  ++AT+GP++V +N   L +Y  GVI     
Sbjct:   225 RCLAKKYKKVAWIQDFI--MLQGNEQAIAWYLATKGPITVTINMKLLQHYQKGVIQATHT 282

Query:   610 LCNPKAQNHALIIVGYGE------EEKKDGTS-------IPYWIVKNSWGSDWGEK 652
              C+P+  +H++++VG+G+      ++ + G+S       IPYWI+KNSWG++WGE+
Sbjct:   283 TCDPQRVDHSVLLVGFGKSKSVAGKQAEGGSSRPRPHHPIPYWILKNSWGAEWGEE 338

 Score = 201 (75.8 bits), Expect = 9.4e-35, Sum P(2) = 9.4e-35
 Identities = 50/155 (32%), Positives = 80/155 (51%)

Query:    45 FLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQREDSGTAVFEVNKFFDLSDSDLQQ 104
             F  F   +++ YS+ E+  RR + F  N+ +A+  + ED GTA F V  F DL++ +  Q
Sbjct:    42 FALFQIQYNRSYSNPEEYARRLDIFAHNLAQAQQLEDEDLGTAEFGVTPFSDLTEEEFGQ 101

Query:   105 LTGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRA-EGVISKV 163
               G          + + +AP    + ++E           G+ +P   DWR   G+IS +
Sbjct:   102 FYGHQ--------RMAGEAPSVGRKVESE---------EWGEPVPPTCDWRKLPGIISPI 144

Query:   164 KEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
             K+QG C CCWA +A G +EA+  I+ +   E+SVQ
Sbjct:   145 KQQGNCRCCWAMAAAGNIEALWGIRYHQPVEVSVQ 179

 Score = 128 (50.1 bits), Expect = 4.2e-27, Sum P(2) = 4.2e-27
 Identities = 27/68 (39%), Positives = 39/68 (57%)

Query:   145 GDDLPEAFDWRA-EGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQHHDKV 203
             G+ +P   DWR   G+IS +K+QG C CCWA +A G +EA+  I+          +H  V
Sbjct:   125 GEPVPPTCDWRKLPGIISPIKQQGNCRCCWAMAAAGNIEALWGIR----------YHQPV 174

Query:   204 YSSVEDLL 211
               SV++LL
Sbjct:   175 EVSVQELL 182

 Score = 97 (39.2 bits), Expect = 2.2e-37, Sum P(2) = 2.2e-37
 Identities = 20/53 (37%), Positives = 32/53 (60%)

Query:   457 IPYWIVKNSWGSDWGEKVEDKVGSSGNRTRDLELTGVLPSKLSRLATEKLVDC 509
             IPYWI+KNSWG++WGE+   ++   GN T  +    V  +++     ++LV C
Sbjct:   322 IPYWILKNSWGAEWGEEGYFRL-HRGNNTCGITKYPVT-ARVDLRVKKRLVSC 372


>TAIR|locus:2029924 [details] [associations]
            symbol:AT1G29090 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 EMBL:CP002684 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            HOGENOM:HOG000230773 HSSP:P53634 ProtClustDB:CLSN2688064
            EMBL:BT004146 IPI:IPI00545702 RefSeq:NP_564321.2 UniGene:At.40814
            ProteinModelPortal:Q84W75 SMR:Q84W75 MEROPS:C01.A15
            EnsemblPlants:AT1G29090.1 GeneID:839784 KEGG:ath:AT1G29090
            TAIR:At1g29090 InParanoid:Q84W75 OMA:SIRGHED PhylomeDB:Q84W75
            ArrayExpress:Q84W75 Genevestigator:Q84W75 Uniprot:Q84W75
        Length = 355

 Score = 390 (142.3 bits), Expect = 1.0e-35, P = 1.0e-35
 Identities = 105/291 (36%), Positives = 149/291 (51%)

Query:   202 KVYSSVEDLLRRHENFVTNVEKAEDYQSEDSGTAVFGVNKFFDLSESD-LQQLTGLNLDS 260
             +VYS   +   R + F  N++  E +  +   T   GVN+F D +  + +   TGL    
Sbjct:    56 RVYSDELEKQMRFDVFKKNLKFIEKFNKKGDRTYKLGVNEFADWTREEFIATHTGLK--- 112

Query:   261 TLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLP--EAFDWRAEGVISKVKEQGKCA 318
              +  I      P SS   D  + ++ +N      D+   E  DWR EG ++ VK QG+C 
Sbjct:   113 GVNGI------P-SSEFVDEMIPSWNWNV----SDVAGRETKDWRYEGAVTPVKYQGQCG 161

Query:   319 CCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMS-NGGCNGGRMDDALQYIIDNGGVV 377
             CCWAFS+V  VE +  I GN+L  LS QQL+DCD   + GCNGG M DA  YII N G+ 
Sbjct:   162 CCWAFSSVAAVEGLTKIVGNNLVSLSEQQLLDCDRERDNGCNGGIMSDAFSYIIKNRGIA 221

Query:   378 SDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANG 437
             S+ +YPY+A+E    C              +  +P   E  + + V+ + P+SV ++A+G
Sbjct:   222 SEASYPYQAAEGT--C--RYNGKPSAWIRGFQTVPSNNERALLEAVSKQ-PVSVSIDADG 276

Query:   438 --LFYYSGGVID-------LNQRL----YGTS---IPYWIVKNSWGSDWGE 472
                 +YSGGV D       +N  +    YGTS   I YW+ KNSWG  WGE
Sbjct:   277 PGFMHYSGGVYDEPYCGTNVNHAVTFVGYGTSPEGIKYWLAKNSWGETWGE 327

 Score = 267 (99.0 bits), Expect = 2.5e-37, Sum P(2) = 2.5e-37
 Identities = 61/165 (36%), Positives = 89/165 (53%)

Query:   490 LTGVLPSKLSRLATEKLVDCDMS-NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESE 548
             LT ++ + L  L+ ++L+DCD   + GCNGG M DA  YII N G+ S+ +YPY+A+E  
Sbjct:   175 LTKIVGNNLVSLSEQQLLDCDRERDNGCNGGIMSDAFSYIIKNRGIASEASYPYQAAEGT 234

Query:   549 RGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANG--LFYYSGGVIDL 606
               C              +  +P   E  + + V+ + P+SV ++A+G    +YSGGV D 
Sbjct:   235 --C--RYNGKPSAWIRGFQTVPSNNERALLEAVSKQ-PVSVSIDADGPGFMHYSGGVYD- 288

Query:   607 NQRLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGE 651
              +  C     NHA+  VGYG   +     I YW+ KNSWG  WGE
Sbjct:   289 -EPYCGTNV-NHAVTFVGYGTSPE----GIKYWLAKNSWGETWGE 327

 Score = 167 (63.8 bits), Expect = 2.5e-37, Sum P(2) = 2.5e-37
 Identities = 51/154 (33%), Positives = 75/154 (48%)

Query:    48 FMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQREDSGTAVFEVNKFFDLSDSD-LQQLT 106
             +M    +VYS   +   R + F  N++  E + ++   T    VN+F D +  + +   T
Sbjct:    50 WMTRFSRVYSDELEKQMRFDVFKKNLKFIEKFNKKGDRTYKLGVNEFADWTREEFIATHT 109

Query:   107 GLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLP--EAFDWRAEGVISKVK 164
             GL     +  I      P SS   D  + ++ +N      D+   E  DWR EG ++ VK
Sbjct:   110 GLK---GVNGI------P-SSEFVDEMIPSWNWNV----SDVAGRETKDWRYEGAVTPVK 155

Query:   165 EQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
              QG+C CCWAFS+V  VE +  I GNNL  LS Q
Sbjct:   156 YQGQCGCCWAFSSVAAVEGLTKIVGNNLVSLSEQ 189


>UNIPROTKB|H9KYW5 [details] [associations]
            symbol:CTSS "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0002250 "adaptive immune response" evidence=IEA]
            [GO:0005764 "lysosome" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] [GO:0016020 "membrane" evidence=IEA] [GO:0097067
            "cellular response to thyroid hormone stimulus" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00560000076577 OMA:YEPACTQ EMBL:AADN02010496
            Ensembl:ENSGALT00000001122 Uniprot:H9KYW5
        Length = 245

 Score = 369 (135.0 bits), Expect = 2.2e-33, P = 2.2e-33
 Identities = 91/232 (39%), Positives = 118/232 (50%)

Query:   261 TLEDIQP---SLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVKEQGKC 317
             T ED+      L+ P   NQT T  R       R G   P+A DWR +G +++VK QG C
Sbjct:     1 TSEDVAALLTGLRVPSGHNQTSTYRR-------RGG--APDAMDWREKGCVTEVKNQGAC 51

Query:   318 ACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDM--SNGGCNGGRMDDALQYIIDNGG 375
               CWAFSAVG +EA   ++   L  LS Q LVDC M   N GC GG M  A QYIIDN G
Sbjct:    52 GACWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSMMYGNKGCGGGFMTRAFQYIIDNNG 111

Query:   376 VVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNA 435
             + S+++YPY A   + G               Y  +PY +E  +K  VA  GP+SV ++A
Sbjct:   112 IDSEESYPYMA---QNGTCQYNVSTRAATCSKYVELPYADEAALKDAVANVGPVSVAIDA 168

Query:   436 NG--LFYYSGGVID-------LNQRL----YGT--SIPYWIVKNSWGSDWGE 472
                  F Y  GV D       +N  +    YGT     +W+VKNSWG  +G+
Sbjct:   169 TQPTFFLYRSGVYDDPRCTQEVNHGVLVVGYGTLNEKDFWLVKNSWGERFGD 220

 Score = 275 (101.9 bits), Expect = 4.7e-37, Sum P(2) = 4.7e-37
 Identities = 63/159 (39%), Positives = 86/159 (54%)

Query:   497 KLSRLATEKLVDCDM--SNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXX 554
             KL  L+ + LVDC M   N GC GG M  A QYIIDN G+ S+++YPY A   + G    
Sbjct:    73 KLVSLSAQNLVDCSMMYGNKGCGGGFMTRAFQYIIDNNGIDSEESYPYMA---QNGTCQY 129

Query:   555 XXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANG--LFYYSGGVIDLNQRLCN 612
                        Y  +PY +E  +K  VA  GP+SV ++A     F Y  GV D + R C 
Sbjct:   130 NVSTRAATCSKYVELPYADEAALKDAVANVGPVSVAIDATQPTFFLYRSGVYD-DPR-CT 187

Query:   613 PKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGE 651
              +  NH +++VGYG   +KD     +W+VKNSWG  +G+
Sbjct:   188 QEV-NHGVLVVGYGTLNEKD-----FWLVKNSWGERFGD 220

 Score = 156 (60.0 bits), Expect = 4.7e-37, Sum P(2) = 4.7e-37
 Identities = 37/90 (41%), Positives = 48/90 (53%)

Query:   113 TLEDIQP---SLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVKEQGKC 169
             T ED+      L+ P   NQT T  R       R G   P+A DWR +G +++VK QG C
Sbjct:     1 TSEDVAALLTGLRVPSGHNQTSTYRR-------RGG--APDAMDWREKGCVTEVKNQGAC 51

Query:   170 ACCWAFSAVGVVEAMHAIQGNNLTELSVQH 199
               CWAFSAVG +EA   ++   L  LS Q+
Sbjct:    52 GACWAFSAVGALEAQVKLKTGKLVSLSAQN 81


>TAIR|locus:2825832 [details] [associations]
            symbol:RD21A "responsive to dehydration 21A" species:3702
            "Arabidopsis thaliana" [GO:0005576 "extracellular region"
            evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA;IMP]
            [GO:0008234 "cysteine-type peptidase activity"
            evidence=ISS;IDA;IMP] [GO:0009507 "chloroplast" evidence=IDA]
            [GO:0005773 "vacuole" evidence=IDA] [GO:0008233 "peptidase
            activity" evidence=IDA] [GO:0048046 "apoplast" evidence=IDA]
            [GO:0005515 "protein binding" evidence=IPI] [GO:0005829 "cytosol"
            evidence=RCA] [GO:0009506 "plasmodesma" evidence=IDA] [GO:0050832
            "defense response to fungus" evidence=IMP] [GO:0006096 "glycolysis"
            evidence=RCA] [GO:0006833 "water transport" evidence=RCA]
            [GO:0006972 "hyperosmotic response" evidence=RCA] [GO:0007030
            "Golgi organization" evidence=RCA] [GO:0009266 "response to
            temperature stimulus" evidence=RCA] [GO:0009651 "response to salt
            stress" evidence=RCA] [GO:0015996 "chlorophyll catabolic process"
            evidence=RCA] [GO:0042744 "hydrogen peroxide catabolic process"
            evidence=RCA] [GO:0046686 "response to cadmium ion" evidence=RCA]
            [GO:0009414 "response to water deprivation" evidence=TAS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 EMBL:CP002684
            GenomeReviews:CT485782_GR GO:GO:0009506 GO:GO:0009507 GO:GO:0005773
            GO:GO:0050832 GO:GO:0048046 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 EMBL:AC083835
            HOGENOM:HOG000230773 KO:K01376 InterPro:IPR000118 Pfam:PF00396
            SMART:SM00277 UniGene:At.43549 EMBL:D13043 EMBL:AY072130
            EMBL:AY133781 IPI:IPI00530094 PIR:JN0719 RefSeq:NP_564497.1
            UniGene:At.47599 UniGene:At.71705 ProteinModelPortal:P43297
            SMR:P43297 IntAct:P43297 STRING:P43297 MEROPS:C01.064 PaxDb:P43297
            PRIDE:P43297 ProMEX:P43297 EnsemblPlants:AT1G47128.1 GeneID:841122
            KEGG:ath:AT1G47128 TAIR:At1g47128 InParanoid:P43297 OMA:EAWLVKH
            PhylomeDB:P43297 ProtClustDB:CLSN2688498 Genevestigator:P43297
            GermOnline:AT1G47128 Uniprot:P43297
        Length = 462

 Score = 397 (144.8 bits), Expect = 1.7e-36, P = 1.7e-36
 Identities = 101/287 (35%), Positives = 147/287 (51%)

Query:   206 SVEDLLRRHENFVTNVEKAEDYQS---EDSGTAVFGVN-KFFDL-SESDLQQLTGLNLDS 260
             S  +++  +E ++    KA+   S   +D    +F  N +F D  +E +L    GL   +
Sbjct:    42 SEAEVMSIYEAWLVKHGKAQSQNSLVEKDRRFEIFKDNLRFVDEHNEKNLSYRLGLTRFA 101

Query:   261 TLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVKEQGKCACC 320
              L + +   +   +  +   E R       R GD+LPE+ DWR +G +++VK+QG C  C
Sbjct:   102 DLTNDEYRSKYLGAKMEKKGERRTSLRYEARVGDELPESIDWRKKGAVAEVKDQGGCGSC 161

Query:   321 WAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMS-NGGCNGGRMDDALQYIIDNGGVVSD 379
             WAFS +G VE ++ I    L  LS Q+LVDCD S N GCNGG MD A ++II NGG+ +D
Sbjct:   162 WAFSTIGAVEGINQIVTGDLITLSEQELVDCDTSYNEGCNGGLMDYAFEFIIKNGGIDTD 221

Query:   380 QAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLF 439
             + YPYK  +    C              Y  +P   EE +KK VA + P+S+ + A G  
Sbjct:   222 KDYPYKGVDGT--CDQIRKNAKVVTIDSYEDVPTYSEESLKKAVAHQ-PISIAIEAGGRA 278

Query:   440 Y--YSGGVID------LNQRL----YGTSI--PYWIVKNSWGSDWGE 472
             +  Y  G+ D      L+  +    YGT     YWIV+NSWG  WGE
Sbjct:   279 FQLYDSGIFDGSCGTQLDHGVVAVGYGTENGKDYWIVRNSWGKSWGE 325

 Score = 280 (103.6 bits), Expect = 1.5e-34, Sum P(2) = 1.5e-34
 Identities = 63/162 (38%), Positives = 88/162 (54%)

Query:   493 VLPSKLSRLATEKLVDCDMS-NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGC 551
             ++   L  L+ ++LVDCD S N GCNGG MD A ++II NGG+ +D+ YPYK  +    C
Sbjct:   176 IVTGDLITLSEQELVDCDTSYNEGCNGGLMDYAFEFIIKNGGIDTDKDYPYKGVDGT--C 233

Query:   552 LXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFY--YSGGVIDLNQR 609
                           Y  +P   EE +KK VA + P+S+ + A G  +  Y  G+ D +  
Sbjct:   234 DQIRKNAKVVTIDSYEDVPTYSEESLKKAVAHQ-PISIAIEAGGRAFQLYDSGIFDGS-- 290

Query:   610 LCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGE 651
              C  +  +H ++ VGYG E  KD     YWIV+NSWG  WGE
Sbjct:   291 -CGTQL-DHGVVAVGYGTENGKD-----YWIVRNSWGKSWGE 325

 Score = 163 (62.4 bits), Expect = 1.5e-34, Sum P(2) = 1.5e-34
 Identities = 43/146 (29%), Positives = 74/146 (50%)

Query:    58 SVEDLLRRHENFVTNVEKAEDYQ---REDSGTAVFEVN-KFFDL-SDSDLQQLTGLNLDS 112
             S  +++  +E ++    KA+       +D    +F+ N +F D  ++ +L    GL   +
Sbjct:    42 SEAEVMSIYEAWLVKHGKAQSQNSLVEKDRRFEIFKDNLRFVDEHNEKNLSYRLGLTRFA 101

Query:   113 TLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVKEQGKCACC 172
              L + +   +   +  +   E R       R GD+LPE+ DWR +G +++VK+QG C  C
Sbjct:   102 DLTNDEYRSKYLGAKMEKKGERRTSLRYEARVGDELPESIDWRKKGAVAEVKDQGGCGSC 161

Query:   173 WAFSAVGVVEAMHAIQGNNLTELSVQ 198
             WAFS +G VE ++ I   +L  LS Q
Sbjct:   162 WAFSTIGAVEGINQIVTGDLITLSEQ 187


>UNIPROTKB|G3SSC1 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9785
            "Loxodonta africana" [GO:0001656 "metanephros development"
            evidence=ISS] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=ISS] [GO:0002764 "immune response-regulating signaling
            pathway" evidence=ISS] [GO:0004175 "endopeptidase activity"
            evidence=ISS] [GO:0004177 "aminopeptidase activity" evidence=ISS]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0004252 "serine-type endopeptidase activity" evidence=ISS]
            [GO:0005615 "extracellular space" evidence=ISS] [GO:0005764
            "lysosome" evidence=ISS] [GO:0005829 "cytosol" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISS] [GO:0008284 "positive regulation
            of cell proliferation" evidence=ISS] [GO:0010628 "positive
            regulation of gene expression" evidence=ISS] [GO:0010634 "positive
            regulation of epithelial cell migration" evidence=ISS] [GO:0010813
            "neuropeptide catabolic process" evidence=ISS] [GO:0010815
            "bradykinin catabolic process" evidence=ISS] [GO:0010952 "positive
            regulation of peptidase activity" evidence=ISS] [GO:0016505
            "apoptotic protease activator activity" evidence=ISS] [GO:0030108
            "HLA-A specific activating MHC class I receptor activity"
            evidence=ISS] [GO:0030335 "positive regulation of cell migration"
            evidence=ISS] [GO:0031638 "zymogen activation" evidence=ISS]
            [GO:0031648 "protein destabilization" evidence=ISS] [GO:0032526
            "response to retinoic acid" evidence=ISS] [GO:0033619 "membrane
            protein proteolysis" evidence=ISS] [GO:0043066 "negative regulation
            of apoptotic process" evidence=ISS] [GO:0043129 "surfactant
            homeostasis" evidence=ISS] [GO:0045766 "positive regulation of
            angiogenesis" evidence=ISS] [GO:0060448 "dichotomous subdivision of
            terminal units involved in lung branching" evidence=ISS]
            [GO:0070324 "thyroid hormone binding" evidence=ISS] [GO:0070371
            "ERK1 and ERK2 cascade" evidence=ISS] [GO:0097208 "alveolar
            lamellar body" evidence=ISS] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005829 GO:GO:0043066 GO:GO:0005615 GO:GO:0008284
            GO:GO:0070371 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766
            GO:GO:0004177 GO:GO:0004252 GO:GO:0005764 GO:GO:0097208
            GO:GO:0032526 GO:GO:0010628 GO:GO:0070324 GO:GO:0016505
            GO:GO:0001656 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
            GO:GO:0097067 GO:GO:0031638 GO:GO:0001913
            GeneTree:ENSGT00660000095458 OMA:STSCHKT GO:GO:0030108
            GO:GO:0010815 GO:GO:0060448 GO:GO:0002764 GO:GO:0033619
            GO:GO:0010813 GO:GO:0043129 RefSeq:XP_003413898.1
            Ensembl:ENSLAFT00000003415 GeneID:100662496 Uniprot:G3SSC1
        Length = 335

 Score = 295 (108.9 bits), Expect = 9.4e-31, Sum P(2) = 9.4e-31
 Identities = 75/204 (36%), Positives = 94/204 (46%)

Query:   288 NSLRHGDDLPEAFDWRAEG-VISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQ 346
             N LR     P   DWR +G  +S VK QG C  CW FS  G +E+  AI G  L  L+ Q
Sbjct:   108 NYLRGTGPYPPFVDWRKKGHFVSPVKNQGACGSCWTFSTTGALESAIAIAGGKLLSLAEQ 167

Query:   347 QLVDC--DMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXX 404
             QLVDC  D +N GC GG    A +YI+ N G++ +  YPYK  +    C           
Sbjct:   168 QLVDCAKDFNNHGCQGGLPSQAFEYILYNKGIMGEDTYPYKGQDDV--C-KFQPKKAIAF 224

Query:   405 XXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFY-YSGGVID----------LNQRL- 452
                 + I   +EE M + VA   P+S        F  YS G+            +N  + 
Sbjct:   225 VKDVANITLNDEEAMVEAVALYNPVSFAFEVTDDFMKYSKGIYSSTSCHKTPDKVNHAVL 284

Query:   453 ---YGTS--IPYWIVKNSWGSDWG 471
                YG    IPYWIVKNSWG  WG
Sbjct:   285 AVGYGEEKGIPYWIVKNSWGPYWG 308

 Score = 246 (91.7 bits), Expect = 2.7e-36, Sum P(3) = 2.7e-36
 Identities = 59/157 (37%), Positives = 76/157 (48%)

Query:   497 KLSRLATEKLVDC--DMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXX 554
             KL  LA ++LVDC  D +N GC GG    A +YI+ N G++ +  YPYK  +    C   
Sbjct:   160 KLLSLAEQQLVDCAKDFNNHGCQGGLPSQAFEYILYNKGIMGEDTYPYKGQDDV--C-KF 216

Query:   555 XXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFY-YSGGVIDLNQRLCNP 613
                         + I   +EE M + VA   P+S        F  YS G+         P
Sbjct:   217 QPKKAIAFVKDVANITLNDEEAMVEAVALYNPVSFAFEVTDDFMKYSKGIYSSTSCHKTP 276

Query:   614 KAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWG 650
                NHA++ VGYGEE+      IPYWIVKNSWG  WG
Sbjct:   277 DKVNHAVLAVGYGEEK-----GIPYWIVKNSWGPYWG 308

 Score = 123 (48.4 bits), Expect = 2.7e-36, Sum P(3) = 2.7e-36
 Identities = 26/60 (43%), Positives = 31/60 (51%)

Query:   140 NSLRHGDDLPEAFDWRAEG-VISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
             N LR     P   DWR +G  +S VK QG C  CW FS  G +E+  AI G  L  L+ Q
Sbjct:   108 NYLRGTGPYPPFVDWRKKGHFVSPVKNQGACGSCWTFSTTGALESAIAIAGGKLLSLAEQ 167

 Score = 75 (31.5 bits), Expect = 2.7e-36, Sum P(3) = 2.7e-36
 Identities = 18/60 (30%), Positives = 34/60 (56%)

Query:    45 FLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQREDSGTAVFEVNKFFDLSDSDLQQ 104
             F ++M  H K YSS E+  +R + FV+N  K   +   +  T    +N+F D++ ++++Q
Sbjct:    35 FQSWMAQHQKKYSS-EEYHQRQQTFVSNWRKINAHNARNH-TFKMALNQFSDMTFAEIKQ 92


>UNIPROTKB|F1NEC8 [details] [associations]
            symbol:CTSL2 "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            EMBL:AADN02067812 IPI:IPI00820956 Ensembl:ENSGALT00000037988
            ArrayExpress:F1NEC8 Uniprot:F1NEC8
        Length = 218

 Score = 361 (132.1 bits), Expect = 1.7e-32, P = 1.7e-32
 Identities = 79/195 (40%), Positives = 105/195 (53%)

Query:   297 PEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSNG 356
             P + DWR +G ++ VK+QG+C  CWAFS  G +E  H  +   L  LS Q LVDC    G
Sbjct:     2 PRSVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQHFRKTGKLVSLSEQNLVDCSRPEG 61

Query:   357 --GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYG 414
               GCNGG MD A QY+ DNGG+ S+++YPY A + E  C              +  IP G
Sbjct:    62 NQGCNGGLMDQAFQYVQDNGGIDSEESYPYTAKDDE-DC-RYKAEYNAANDTGFVDIPQG 119

Query:   415 EEEEMKKWVATRGPLSVGMNA-NGLF-YYSGGVI--------DLNQRL----YG--TSIP 458
              E  + K VA+ GP+SV ++A +  F +Y  G+         DL+  +    YG      
Sbjct:   120 HERALMKAVASVGPVSVAIDAGHSSFQFYQSGIYYEPDCSSEDLDHGVLVVGYGFEDGKK 179

Query:   459 YWIVKNSWGSDWGEK 473
             YWIVKNSWG  WG+K
Sbjct:   180 YWIVKNSWGEKWGDK 194

 Score = 289 (106.8 bits), Expect = 4.5e-36, Sum P(2) = 4.5e-36
 Identities = 64/160 (40%), Positives = 90/160 (56%)

Query:   497 KLSRLATEKLVDCDMSNG--GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXX 554
             KL  L+ + LVDC    G  GCNGG MD A QY+ DNGG+ S+++YPY A + E  C   
Sbjct:    44 KLVSLSEQNLVDCSRPEGNQGCNGGLMDQAFQYVQDNGGIDSEESYPYTAKDDE-DC-RY 101

Query:   555 XXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNA-NGLF-YYSGGVIDLNQRLCN 612
                        +  IP G E  + K VA+ GP+SV ++A +  F +Y  G+    +  C+
Sbjct:   102 KAEYNAANDTGFVDIPQGHERALMKAVASVGPVSVAIDAGHSSFQFYQSGIY--YEPDCS 159

Query:   613 PKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
              +  +H +++VGYG E+ K      YWIVKNSWG  WG+K
Sbjct:   160 SEDLDHGVLVVGYGFEDGKK-----YWIVKNSWGEKWGDK 194

 Score = 132 (51.5 bits), Expect = 4.5e-36, Sum P(2) = 4.5e-36
 Identities = 22/51 (43%), Positives = 31/51 (60%)

Query:   149 PEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQH 199
             P + DWR +G ++ VK+QG+C  CWAFS  G +E  H  +   L  LS Q+
Sbjct:     2 PRSVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQHFRKTGKLVSLSEQN 52


>UNIPROTKB|F1SS93 [details] [associations]
            symbol:CTSS "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0097067 "cellular response to thyroid hormone stimulus"
            evidence=IEA] [GO:0016020 "membrane" evidence=IEA] [GO:0006508
            "proteolysis" evidence=IEA] [GO:0005764 "lysosome" evidence=IEA]
            [GO:0002250 "adaptive immune response" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0016020 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 GO:GO:0002250
            GeneTree:ENSGT00560000076577 GO:GO:0097067 EMBL:CU463875
            Ensembl:ENSSSCT00000007284 OMA:CEIESAV Uniprot:F1SS93
        Length = 342

 Score = 364 (133.2 bits), Expect = 8.0e-33, P = 8.0e-33
 Identities = 78/195 (40%), Positives = 112/195 (57%)

Query:   296 LPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMS- 354
             LP++ DWR +G +++VK QG C  CWAFSAVG +EA   ++   L  LS Q LVDC    
Sbjct:   126 LPDSMDWREKGCVTEVKYQGSCGSCWAFSAVGALEAQVKMKTGRLVSLSAQNLVDCSTEK 185

Query:   355 --NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIP 412
               N GCNGG M +A QYIIDN G+ S+ +YPYKA + +  C              Y+ +P
Sbjct:   186 YRNKGCNGGFMTEAFQYIIDNNGIDSEASYPYKAVDGK--C-KYDSKNRAATCSRYTELP 242

Query:   413 YGEEEEMKKWVATRGPLSVGMNA--NGLFYYSGGVI-------DLNQRL----YGT--SI 457
             + +E  +K+ VA +GP+SV ++A  +  F+Y  GV        ++N  +    YG     
Sbjct:   243 FADEYALKEAVANKGPVSVAIDAKHSSFFFYRSGVYYDPSCTQNVNHGVLVVGYGNLNGK 302

Query:   458 PYWIVKNSWGSDWGE 472
              YW+VKNSWG ++G+
Sbjct:   303 DYWLVKNSWGLNFGD 317

 Score = 277 (102.6 bits), Expect = 5.2e-36, Sum P(2) = 5.2e-36
 Identities = 61/160 (38%), Positives = 89/160 (55%)

Query:   497 KLSRLATEKLVDCDMS---NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLX 553
             +L  L+ + LVDC      N GCNGG M +A QYIIDN G+ S+ +YPYKA + +  C  
Sbjct:   169 RLVSLSAQNLVDCSTEKYRNKGCNGGFMTEAFQYIIDNNGIDSEASYPYKAVDGK--C-K 225

Query:   554 XXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNA--NGLFYYSGGVIDLNQRLC 611
                         Y+ +P+ +E  +K+ VA +GP+SV ++A  +  F+Y  GV       C
Sbjct:   226 YDSKNRAATCSRYTELPFADEYALKEAVANKGPVSVAIDAKHSSFFFYRSGVY--YDPSC 283

Query:   612 NPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGE 651
                  NH +++VGYG    KD     YW+VKNSWG ++G+
Sbjct:   284 TQNV-NHGVLVVGYGNLNGKD-----YWLVKNSWGLNFGD 317

 Score = 144 (55.7 bits), Expect = 5.2e-36, Sum P(2) = 5.2e-36
 Identities = 25/52 (48%), Positives = 35/52 (67%)

Query:   148 LPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQH 199
             LP++ DWR +G +++VK QG C  CWAFSAVG +EA   ++   L  LS Q+
Sbjct:   126 LPDSMDWREKGCVTEVKYQGSCGSCWAFSAVGALEAQVKMKTGRLVSLSAQN 177


>MGI|MGI:88564 [details] [associations]
            symbol:Ctsl "cathepsin L" species:10090 "Mus musculus"
            [GO:0004177 "aminopeptidase activity" evidence=ISO] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=ISO] [GO:0005515
            "protein binding" evidence=IPI] [GO:0005730 "nucleolus"
            evidence=NAS] [GO:0005737 "cytoplasm" evidence=ISO] [GO:0005764
            "lysosome" evidence=ISO] [GO:0005773 "vacuole" evidence=ISO]
            [GO:0005902 "microvillus" evidence=ISO] [GO:0006508 "proteolysis"
            evidence=ISO;IDA] [GO:0007154 "cell communication" evidence=ISO]
            [GO:0008152 "metabolic process" evidence=TAS] [GO:0008233
            "peptidase activity" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISO;TAS] [GO:0009897 "external side of
            plasma membrane" evidence=ISO] [GO:0016787 "hydrolase activity"
            evidence=IEA] [GO:0030141 "secretory granule" evidence=ISO]
            [GO:0030984 "kininogen binding" evidence=ISO] [GO:0032403 "protein
            complex binding" evidence=ISO] [GO:0042277 "peptide binding"
            evidence=ISO] [GO:0042393 "histone binding" evidence=ISO;NAS]
            [GO:0043005 "neuron projection" evidence=ISO] [GO:0043204
            "perikaryon" evidence=ISO] [GO:0045177 "apical part of cell"
            evidence=ISO] [GO:0048863 "stem cell differentiation" evidence=NAS]
            [GO:0060441 "epithelial tube branching involved in lung
            morphogenesis" evidence=IMP] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            MGI:MGI:88564 GO:GO:0005730 GO:GO:0009897 GO:GO:0034698
            GO:GO:0043204 GO:GO:0009749 GO:GO:0030141 GO:GO:0048863
            GO:GO:0051384 eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045177 GO:GO:0043005
            GO:GO:0007283 GO:GO:0004177 GO:GO:0005764 GO:GO:0042277
            GO:GO:0009267 GO:GO:0021675 GO:GO:0042393 GO:GO:0005902
            GO:GO:0010259 GO:GO:0004197 GO:GO:0048102 GO:GO:0046697
            HOVERGEN:HBG011513 KO:K01365 OMA:EEFRATH OrthoDB:EOG48PMKF
            MEROPS:C01.032 BRENDA:3.4.22.15 ChiTaRS:CTSL1 EMBL:X06086
            EMBL:J02583 EMBL:M20495 EMBL:AF121837 EMBL:AF121838 EMBL:AF121839
            EMBL:BC068163 EMBL:X04392 IPI:IPI00128154 PIR:S01177
            RefSeq:NP_034114.1 UniGene:Mm.930 PDB:1MVV PDBsum:1MVV
            ProteinModelPortal:P06797 SMR:P06797 STRING:P06797
            PhosphoSite:P06797 PaxDb:P06797 PRIDE:P06797
            Ensembl:ENSMUST00000021933 GeneID:13039 KEGG:mmu:13039 CTD:13039
            InParanoid:P06797 BioCyc:MetaCyc:MONOMER-14812 BindingDB:P06797
            ChEMBL:CHEMBL5291 NextBio:282928 Bgee:P06797 CleanEx:MM_CTSL
            Genevestigator:P06797 GermOnline:ENSMUSG00000021477 GO:GO:0060008
            Uniprot:P06797
        Length = 334

 Score = 304 (112.1 bits), Expect = 2.9e-26, P = 2.9e-26
 Identities = 64/154 (41%), Positives = 91/154 (59%)

Query:   296 LPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSN 355
             +P++ DWR +G ++ VK QG+C  CWAFSA G +E    ++   L  LS Q LVDC  + 
Sbjct:   114 IPKSVDWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQ 173

Query:   356 G--GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPY 413
             G  GCNGG MD A QYI +NGG+ S+++YPY+A +   G               +  IP 
Sbjct:   174 GNQGCNGGLMDFAFQYIKENGGLDSEESYPYEAKD---GSCKYRAEFAVANDTGFVDIPQ 230

Query:   414 GEEEEMKKWVATRGPLSVGMNAN--GLFYYSGGV 445
              +E+ + K VAT GP+SV M+A+   L +YS G+
Sbjct:   231 -QEKALMKAVATVGPISVAMDASHPSLQFYSSGI 263

 Score = 291 (107.5 bits), Expect = 7.2e-36, Sum P(2) = 7.2e-36
 Identities = 65/158 (41%), Positives = 92/158 (58%)

Query:   497 KLSRLATEKLVDCDMSNG--GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXX 554
             KL  L+ + LVDC  + G  GCNGG MD A QYI +NGG+ S+++YPY+A +   G    
Sbjct:   157 KLISLSEQNLVDCSHAQGNQGCNGGLMDFAFQYIKENGGLDSEESYPYEAKD---GSCKY 213

Query:   555 XXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNAN--GLFYYSGGVIDLNQRLCN 612
                        +  IP  +E+ + K VAT GP+SV M+A+   L +YS G+    +  C+
Sbjct:   214 RAEFAVANDTGFVDIPQ-QEKALMKAVATVGPISVAMDASHPSLQFYSSGIY--YEPNCS 270

Query:   613 PKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWG 650
              K  +H +++VGYG E   D     YW+VKNSWGS+WG
Sbjct:   271 SKNLDHGVLLVGYGYEGT-DSNKNKYWLVKNSWGSEWG 307

 Score = 133 (51.9 bits), Expect = 1.7e-14, Sum P(2) = 1.7e-14
 Identities = 40/134 (29%), Positives = 59/134 (44%)

Query:   355 NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESE---RGCLXXXXXXXXXXXXXYSRI 411
             N GCNGG MD A QYI +NGG+ S+++YPY+A +     R                  + 
Sbjct:   175 NQGCNGGLMDFAFQYIKENGGLDSEESYPYEAKDGSCKYRAEFAVANDTGFVDIPQQEKA 234

Query:   412 ---PYGEEEEMKKWVATRGPLSVGMNANGLFYYSG--------GVIDLNQRLYGTSI--- 457
                       +   +    P S+   ++G++Y           GV+ +     GT     
Sbjct:   235 LMKAVATVGPISVAMDASHP-SLQFYSSGIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKN 293

Query:   458 PYWIVKNSWGSDWG 471
              YW+VKNSWGS+WG
Sbjct:   294 KYWLVKNSWGSEWG 307

 Score = 128 (50.1 bits), Expect = 7.2e-36, Sum P(2) = 7.2e-36
 Identities = 22/52 (42%), Positives = 33/52 (63%)

Query:   148 LPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQH 199
             +P++ DWR +G ++ VK QG+C  CWAFSA G +E    ++   L  LS Q+
Sbjct:   114 IPKSVDWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQN 165


>DICTYBASE|DDB_G0291191 [details] [associations]
            symbol:DDB_G0291191 "cysteine protease" species:44689
            "Dictyostelium discoideum" [GO:0005615 "extracellular space"
            evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0044351 "macropinocytosis" evidence=RCA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 dictyBase:DDB_G0291191
            GO:GO:0005615 eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 EMBL:AAFI02000175 MEROPS:C01.022
            ProtClustDB:CLSZ2429603 RefSeq:XP_635374.1
            ProteinModelPortal:Q54F16 PRIDE:Q54F16 EnsemblProtists:DDB0252831
            GeneID:8628022 KEGG:ddi:DDB_G0291191 OMA:NETQIAS Uniprot:Q54F16
        Length = 352

 Score = 293 (108.2 bits), Expect = 4.6e-25, P = 4.6e-25
 Identities = 86/270 (31%), Positives = 130/270 (48%)

Query:   200 HDKVYSSVEDLLRRHENFVTNVEKAE--DYQSEDSGTAV-FGVNKFFDLSESDLQQLTGL 256
             ++K+YS+ E+ L + E F +N+   +  + Q+   G+   FGVNKF DLS+ + ++    
Sbjct:    34 YNKIYSA-EEYLVKFETFKSNLLNIDALNKQATTIGSDTKFGVNKFADLSKEEFKKYYLS 92

Query:   257 NLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISK------ 310
             + ++ L D  P L  P   N +D  + A            P AFDWR  G  +K      
Sbjct:    93 SKEARLTDDLPML--P---NLSDDIISA-----------TPAAFDWRNTGGSTKFPQGTP 136

Query:   311 ---VKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCD----------MSNGG 357
                VK QG+C  CW+FS  G VE  H +   +L  LS Q LVDCD          + N G
Sbjct:   137 VTAVKNQGQCGSCWSFSTTGNVEGQHYLSTGTLVGLSEQNLVDCDHTCMTYENENVCNAG 196

Query:   358 CNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYGEEE 417
             C+GG   +A  YII NGG+ ++  YPY A + E  C              ++ +P  E +
Sbjct:   197 CDGGLQPNAYNYIIKNGGIQTEATYPYTAVDGE--C-KFNSAQVGAKISSFTMVPQNETQ 253

Query:   418 EMKKWVATRGPLSVGMNANGLFYYSGGVID 447
              +  ++   GPL++  +A    +Y GGV D
Sbjct:   254 -IASYLFNNGPLAIAADAEEWQFYMGGVFD 282

 Score = 265 (98.3 bits), Expect = 9.4e-36, Sum P(2) = 9.4e-36
 Identities = 56/161 (34%), Positives = 85/161 (52%)

Query:   501 LATEKLVDCD----------MSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERG 550
             L+ + LVDCD          + N GC+GG   +A  YII NGG+ ++  YPY A + E  
Sbjct:   172 LSEQNLVDCDHTCMTYENENVCNAGCDGGLQPNAYNYIIKNGGIQTEATYPYTAVDGE-- 229

Query:   551 CLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFYYSGGVIDLNQRL 610
             C              ++ +P  E + +  ++   GPL++  +A    +Y GGV D     
Sbjct:   230 C-KFNSAQVGAKISSFTMVPQNETQ-IASYLFNNGPLAIAADAEEWQFYMGGVFDFP--- 284

Query:   611 CNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGE 651
             C  +  +H ++IVGYG ++   G + PYWI+KNSWG+DWGE
Sbjct:   285 CG-QTLDHGILIVGYGAQDTIVGKNTPYWIIKNSWGADWGE 324

 Score = 155 (59.6 bits), Expect = 1.3e-19, Sum P(2) = 1.3e-19
 Identities = 46/154 (29%), Positives = 68/154 (44%)

Query:   352 DMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRI 411
             ++ N GC+GG   +A  YII NGG+ ++  YPY A + E  C              ++ +
Sbjct:   191 NVCNAGCDGGLQPNAYNYIIKNGGIQTEATYPYTAVDGE--C-KFNSAQVGAKISSFTMV 247

Query:   412 PYGEEE---------------EMKKWVATRGPL---SVGMNAN-GLFYYSGGVIDLNQRL 452
             P  E +               + ++W    G +     G   + G+     G  D    +
Sbjct:   248 PQNETQIASYLFNNGPLAIAADAEEWQFYMGGVFDFPCGQTLDHGILIVGYGAQDT---I 304

Query:   453 YGTSIPYWIVKNSWGSDWGE----KVE---DKVG 479
              G + PYWI+KNSWG+DWGE    KVE   DK G
Sbjct:   305 VGKNTPYWIIKNSWGADWGEAGYLKVERNTDKCG 338

 Score = 154 (59.3 bits), Expect = 9.4e-36, Sum P(2) = 9.4e-36
 Identities = 52/169 (30%), Positives = 81/169 (47%)

Query:    43 TRFLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQREDS--GTAV-FEVNKFFDLSD 99
             ++F+ F   ++K+YS+ E+ L + E F +N+   +   ++ +  G+   F VNKF DLS 
Sbjct:    25 SQFIAFQNKYNKIYSA-EEYLVKFETFKSNLLNIDALNKQATTIGSDTKFGVNKFADLSK 83

Query:   100 SDLQQLTGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGV 159
              + ++    + ++ L D  P L  P   N +D  + A            P AFDWR  G 
Sbjct:    84 EEFKKYYLSSKEARLTDDLPML--P---NLSDDIISA-----------TPAAFDWRNTGG 127

Query:   160 ISK---------VKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQH 199
              +K         VK QG+C  CW+FS  G VE  H +    L  LS Q+
Sbjct:   128 STKFPQGTPVTAVKNQGQCGSCWSFSTTGNVEGQHYLSTGTLVGLSEQN 176


>WB|WBGene00000776 [details] [associations]
            symbol:cpl-1 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0040010 "positive regulation
            of growth rate" evidence=IMP] [GO:0009792 "embryo development
            ending in birth or egg hatching" evidence=IMP] [GO:0040011
            "locomotion" evidence=IMP] [GO:0070265 "necrotic cell death"
            evidence=IMP] [GO:0031983 "vesicle lumen" evidence=IDA] [GO:0042718
            "yolk granule" evidence=IDA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0009792 GO:GO:0040010 eggNOG:COG4870 HOGENOM:HOG000230774
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0040011
            GO:GO:0031983 GO:GO:0070265 GeneTree:ENSGT00660000095458 KO:K01365
            GO:GO:0042718 MEROPS:I29.009 EMBL:Z92812 GeneID:180111
            KEGG:cel:CELE_T03E6.7 CTD:180111 PIR:T24387 RefSeq:NP_001256718.1
            HSSP:P80067 ProteinModelPortal:O45734 SMR:O45734 DIP:DIP-26616N
            IntAct:O45734 MINT:MINT-211563 STRING:O45734 PaxDb:O45734
            EnsemblMetazoa:T03E6.7.1 EnsemblMetazoa:T03E6.7.2 UCSC:T03E6.7.1
            WormBase:T03E6.7a InParanoid:O45734 OMA:HIENHNR NextBio:908128
            Uniprot:O45734
        Length = 337

 Score = 354 (129.7 bits), Expect = 1.0e-31, P = 1.0e-31
 Identities = 82/207 (39%), Positives = 111/207 (53%)

Query:   296 LPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSN 355
             +P+  DWR   +++ VK QG C  CWAFSA G +E  HA +   L  LS Q LVDC    
Sbjct:   120 VPDEVDWRDTHLVTDVKNQGMCGSCWAFSATGALEGQHARKLGQLVSLSEQNLVDCSTKY 179

Query:   356 G--GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPY 413
             G  GCNGG MD A +YI DN GV ++++YPYK  + +  C              Y   P 
Sbjct:   180 GNHGCNGGLMDQAFEYIRDNHGVDTEESYPYKGRDMK--C-HFNKKTVGADDKGYVDTPE 236

Query:   414 GEEEEMKKWVATRGPLSVGMNAN---------GLFY---YSGGVIDLNQRL--YGTSIP- 458
             G+EE++K  VAT+GP+S+ ++A          G++Y    S   +D    L  YGT    
Sbjct:   237 GDEEQLKIAVATQGPISIAIDAGHRSFQLYKKGVYYDEECSSEELDHGVLLVGYGTDPEH 296

Query:   459 --YWIVKNSWGSDWGEKVEDKVGSSGN 483
               YWIVKNSWG+ WGEK   ++  + N
Sbjct:   297 GDYWIVKNSWGAGWGEKGYIRIARNRN 323

 Score = 273 (101.2 bits), Expect = 1.1e-35, Sum P(2) = 1.1e-35
 Identities = 61/160 (38%), Positives = 90/160 (56%)

Query:   497 KLSRLATEKLVDCDMSNG--GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXX 554
             +L  L+ + LVDC    G  GCNGG MD A +YI DN GV ++++YPYK  + +  C   
Sbjct:   163 QLVSLSEQNLVDCSTKYGNHGCNGGLMDQAFEYIRDNHGVDTEESYPYKGRDMK--C-HF 219

Query:   555 XXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFY--YSGGVIDLNQRLCN 612
                        Y   P G+EE++K  VAT+GP+S+ ++A    +  Y  GV    +  C+
Sbjct:   220 NKKTVGADDKGYVDTPEGDEEQLKIAVATQGPISIAIDAGHRSFQLYKKGVYYDEE--CS 277

Query:   613 PKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
              +  +H +++VGYG + +       YWIVKNSWG+ WGEK
Sbjct:   278 SEELDHGVLLVGYGTDPEHGD----YWIVKNSWGAGWGEK 313

 Score = 145 (56.1 bits), Expect = 1.1e-35, Sum P(2) = 1.1e-35
 Identities = 48/177 (27%), Positives = 76/177 (42%)

Query:    25 ALLESNIFQTRGYLNSPVTRFLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQREDS 84
             A++  N  +    + S + ++ ++  D DK YS  E+     E FV N+   E++ R+  
Sbjct:    12 AVVAVNSAKLSRQIESAIEKWDDYKEDFDKEYSESEEQTYM-EAFVKNMIHIENHNRDHR 70

Query:    85 -GTAVFEVNKFFDLSDSDLQQLTGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQ-FNSL 142
              G   FE+     ++D    Q   LN            +  F  ++          FN  
Sbjct:    71 LGRKTFEMG-LNHIADLPFSQYRKLN----------GYRRLFGDSRIKNSSSFLAPFNV- 118

Query:   143 RHGDDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQH 199
                  +P+  DWR   +++ VK QG C  CWAFSA G +E  HA +   L  LS Q+
Sbjct:   119 ----QVPDEVDWRDTHLVTDVKNQGMCGSCWAFSATGALEGQHARKLGQLVSLSEQN 171


>RGD|1309354 [details] [associations]
            symbol:Ctsw "cathepsin W" species:10116 "Rattus norvegicus"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            RGD:1309354 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00639 GeneTree:ENSGT00660000095458 MEROPS:C01.037
            CTD:1521 eggNOG:NOG288820 HOVERGEN:HBG100117 KO:K08569 OMA:GRCGDGC
            OrthoDB:EOG4Q2DG3 EMBL:CH473953 EMBL:BC093401 IPI:IPI00371471
            RefSeq:NP_001019413.1 UniGene:Rn.34406 Ensembl:ENSRNOT00000037404
            GeneID:293676 KEGG:rno:293676 UCSC:RGD:1309354 InParanoid:Q561Q9
            NextBio:636716 Genevestigator:Q561Q9 Uniprot:Q561Q9
        Length = 371

 Score = 320 (117.7 bits), Expect = 1.3e-35, Sum P(2) = 1.3e-35
 Identities = 78/247 (31%), Positives = 124/247 (50%)

Query:   201 DKVYSSVEDLLRRHENFVTNVEKAEDYQSEDSGTAVFGVNKFFDLSESDLQQLTGLNLDS 260
             ++ YS+  +  RR   F  N+ +A+  Q ED GTA FG   F DL+E +  QL G     
Sbjct:    48 NRSYSNPAEYTRRLGIFAHNLAQAQRLQEEDLGTAEFGQTPFSDLTEEEFGQLYGHQ--- 104

Query:   261 TLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWR-AEGVISKVKEQGKCAC 319
                      +AP    +    M A +  S R G+ +P   DWR  + +IS +K QG C C
Sbjct:   105 ---------RAP----ERILNM-AKKVKSERWGESVPPTCDWRKVKNIISSIKNQGNCRC 150

Query:   320 CWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSD 379
             CWA +A   ++ +  I+     ++SVQ+L+DCD    GCNGG + DA   +++N G+ S+
Sbjct:   151 CWAIAAADNIQTLWRIKTQQFVDVSVQELLDCDRCGNGCNGGFVWDAYITVLNNSGLASE 210

Query:   380 QAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLF 439
             + YP++  +    CL             ++ +    E+ +  ++A  GP++V +N   L 
Sbjct:   211 EDYPFQGHQKPHRCLADKYRKVAWIQD-FTMLS-SNEQVIAGYLAIHGPITVTINMKLLQ 268

Query:   440 YYSGGVI 446
             YY  GVI
Sbjct:   269 YYQKGVI 275

 Score = 266 (98.7 bits), Expect = 1.5e-35, Sum P(2) = 1.5e-35
 Identities = 53/164 (32%), Positives = 90/164 (54%)

Query:   501 LATEKLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXX 560
             ++ ++L+DCD    GCNGG + DA   +++N G+ S++ YP++  +    CL        
Sbjct:   174 VSVQELLDCDRCGNGCNGGFVWDAYITVLNNSGLASEEDYPFQGHQKPHRCLADKYRKVA 233

Query:   561 XXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFYYSGGVIDLNQRLCNPKAQNHAL 620
                  ++ +    E+ +  ++A  GP++V +N   L YY  GVI      C+P   NH++
Sbjct:   234 WIQD-FTMLS-SNEQVIAGYLAIHGPITVTINMKLLQYYQKGVIKATPSTCDPHLVNHSV 291

Query:   621 IIVGYGEEE------------KKDGTSIPYWIVKNSWGSDWGEK 652
             ++VG+G+E+            +K   S PYWI+KNSWG++WGEK
Sbjct:   292 LLVGFGKEKGGMQTGTLLSHSRKPRRSTPYWILKNSWGAEWGEK 335

 Score = 170 (64.9 bits), Expect = 1.5e-35, Sum P(2) = 1.5e-35
 Identities = 48/155 (30%), Positives = 73/155 (47%)

Query:    45 FLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQREDSGTAVFEVNKFFDLSDSDLQQ 104
             F  F    ++ YS+  +  RR   F  N+ +A+  Q ED GTA F    F DL++ +  Q
Sbjct:    40 FKLFQIQFNRSYSNPAEYTRRLGIFAHNLAQAQRLQEEDLGTAEFGQTPFSDLTEEEFGQ 99

Query:   105 LTGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWR-AEGVISKV 163
             L G              +AP    +    M A +  S R G+ +P   DWR  + +IS +
Sbjct:   100 LYGHQ------------RAP----ERILNM-AKKVKSERWGESVPPTCDWRKVKNIISSI 142

Query:   164 KEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
             K QG C CCWA +A   ++ +  I+     ++SVQ
Sbjct:   143 KNQGNCRCCWAIAAADNIQTLWRIKTQQFVDVSVQ 177

 Score = 95 (38.5 bits), Expect = 1.3e-35, Sum P(2) = 1.3e-35
 Identities = 14/18 (77%), Positives = 17/18 (94%)

Query:   456 SIPYWIVKNSWGSDWGEK 473
             S PYWI+KNSWG++WGEK
Sbjct:   318 STPYWILKNSWGAEWGEK 335


>TAIR|locus:2167821 [details] [associations]
            symbol:RD21B "esponsive to dehydration 21B" species:3702
            "Arabidopsis thaliana" [GO:0005576 "extracellular region"
            evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA;ISS] [GO:0005773
            "vacuole" evidence=IDA] [GO:0009651 "response to salt stress"
            evidence=IEP] [GO:0005515 "protein binding" evidence=IPI]
            [GO:0005829 "cytosol" evidence=IDA] [GO:0052541 "plant-type cell
            wall cellulose metabolic process" evidence=RCA] [GO:0052546 "cell
            wall pectin metabolic process" evidence=RCA] [GO:0005783
            "endoplasmic reticulum" evidence=IDA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005783 GO:GO:0005829 EMBL:CP002688
            GO:GO:0005773 GO:GO:0009651 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 EMBL:AB008267 HSSP:O65039
            HOGENOM:HOG000230773 KO:K01376 InterPro:IPR000118 Pfam:PF00396
            SMART:SM00277 ProtClustDB:CLSN2688498 EMBL:AY062608 EMBL:AY114661
            IPI:IPI00520971 RefSeq:NP_568620.1 UniGene:At.24130 SMR:Q9FMH8
            IntAct:Q9FMH8 STRING:Q9FMH8 MEROPS:C01.A12
            EnsemblPlants:AT5G43060.1 GeneID:834321 KEGG:ath:AT5G43060
            TAIR:At5g43060 InParanoid:Q9FMH8 OMA:ENSEASL Genevestigator:Q9FMH8
            Uniprot:Q9FMH8
        Length = 463

 Score = 389 (142.0 bits), Expect = 1.3e-35, P = 1.3e-35
 Identities = 85/197 (43%), Positives = 112/197 (56%)

Query:   291 RHGDDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVD 350
             R GD LP++ DWR EG ++ VK+QG C  CWAFS +G VE ++ I    L  LS Q+LVD
Sbjct:   133 RVGDALPDSVDWRKEGAVADVKDQGSCGSCWAFSTIGAVEGINKIVTGDLISLSEQELVD 192

Query:   351 CDMS-NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYS 409
             CD S N GCNGG MD A ++II NGG+ ++  YPYKA++    C              Y 
Sbjct:   193 CDTSYNQGCNGGLMDYAFEFIIKNGGIDTEADYPYKAADGR--CDQNRKNAKVVTIDSYE 250

Query:   410 RIPYGEEEEMKKWVATRGPLSVGMNANGLFY--YSGGVID------LNQRL----YGTSI 457
              +P   E  +KK +A + P+SV + A G  +  YS GV D      L+  +    YGT  
Sbjct:   251 DVPENSEASLKKALAHQ-PISVAIEAGGRAFQLYSSGVFDGLCGTELDHGVVAVGYGTEN 309

Query:   458 --PYWIVKNSWGSDWGE 472
                YWIV+NSWG+ WGE
Sbjct:   310 GKDYWIVRNSWGNRWGE 326

 Score = 286 (105.7 bits), Expect = 4.5e-35, Sum P(2) = 4.5e-35
 Identities = 65/162 (40%), Positives = 90/162 (55%)

Query:   493 VLPSKLSRLATEKLVDCDMS-NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGC 551
             ++   L  L+ ++LVDCD S N GCNGG MD A ++II NGG+ ++  YPYKA++    C
Sbjct:   177 IVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAFEFIIKNGGIDTEADYPYKAADGR--C 234

Query:   552 LXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFY--YSGGVIDLNQR 609
                           Y  +P   E  +KK +A + P+SV + A G  +  YS GV D    
Sbjct:   235 DQNRKNAKVVTIDSYEDVPENSEASLKKALAHQ-PISVAIEAGGRAFQLYSSGVFD---G 290

Query:   610 LCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGE 651
             LC  +  +H ++ VGYG E  KD     YWIV+NSWG+ WGE
Sbjct:   291 LCGTEL-DHGVVAVGYGTENGKD-----YWIVRNSWGNRWGE 326

 Score = 161 (61.7 bits), Expect = 4.5e-35, Sum P(2) = 4.5e-35
 Identities = 41/121 (33%), Positives = 63/121 (52%)

Query:    82 EDSGTAVFEVN-KFFDLSDS-DLQQLTGLN--LDSTLEDIQPSLQAPFSSNQTDTEMRAF 137
             +D    +F+ N +F D  ++ +L    GL    D T E+ + S+     +  T   ++  
Sbjct:    71 KDQRFEIFKDNLRFIDEHNTKNLSYKLGLTRFADLTNEEYR-SMY--LGAKPTKRVLKTS 127

Query:   138 QFNSLRHGDDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSV 197
                  R GD LP++ DWR EG ++ VK+QG C  CWAFS +G VE ++ I   +L  LS 
Sbjct:   128 DRYQARVGDALPDSVDWRKEGAVADVKDQGSCGSCWAFSTIGAVEGINKIVTGDLISLSE 187

Query:   198 Q 198
             Q
Sbjct:   188 Q 188


>TAIR|locus:2090629 [details] [associations]
            symbol:AT3G19400 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] [GO:0005829 "cytosol"
            evidence=IDA] [GO:0019344 "cysteine biosynthetic process"
            evidence=RCA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0005829
            EMBL:CP002686 GenomeReviews:BA000014_GR eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 EMBL:AB025624
            MEROPS:I29.003 HOGENOM:HOG000230773 EMBL:AK118509 IPI:IPI00543468
            RefSeq:NP_566634.2 UniGene:At.38409 ProteinModelPortal:Q9LT77
            SMR:Q9LT77 PaxDb:Q9LT77 PRIDE:Q9LT77 EnsemblPlants:AT3G19400.1
            GeneID:821474 KEGG:ath:AT3G19400 TAIR:At3g19400 InParanoid:Q9LT77
            OMA:IGEHERR ProtClustDB:CLSN2679975 Genevestigator:Q9LT77
            Uniprot:Q9LT77
        Length = 362

 Score = 358 (131.1 bits), Expect = 1.5e-35, Sum P(2) = 1.5e-35
 Identities = 78/198 (39%), Positives = 111/198 (56%)

Query:   291 RHGDDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVD 350
             + GD LP+  DWRA G +  VK+QG C  CWAFSAVG VE ++ I    L  LS Q+LVD
Sbjct:   125 KEGDVLPDEVDWRANGAVVSVKDQGNCGSCWAFSAVGAVEGINQITTGELISLSEQELVD 184

Query:   351 CDMS--NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXY 408
             CD    N GC+GG M+ A ++I+ NGG+ +DQ YPY A++                   Y
Sbjct:   185 CDRGFVNAGCDGGIMNYAFEFIMKNGGIETDQDYPYNANDLGLCNADKNNNTRVVTIDGY 244

Query:   409 SRIPYGEEEEMKKWVATRGPLSVGMNANGLFY--YSGGV------IDLNQRL----YGTS 456
               +P  +E+ +KK VA + P+SV + A+   +  Y  GV      I L+  +    YG++
Sbjct:   245 EDVPRDDEKSLKKAVAHQ-PVSVAIEASSQAFQLYKSGVMTGTCGISLDHGVVVVGYGST 303

Query:   457 I--PYWIVKNSWGSDWGE 472
                 YWI++NSWG +WG+
Sbjct:   304 SGEDYWIIRNSWGLNWGD 321

 Score = 239 (89.2 bits), Expect = 2.0e-30, Sum P(2) = 2.0e-30
 Identities = 54/163 (33%), Positives = 87/163 (53%)

Query:   493 VLPSKLSRLATEKLVDCDMS--NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERG 550
             +   +L  L+ ++LVDCD    N GC+GG M+ A ++I+ NGG+ +DQ YPY A++    
Sbjct:   169 ITTGELISLSEQELVDCDRGFVNAGCDGGIMNYAFEFIMKNGGIETDQDYPYNANDLGLC 228

Query:   551 CLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFY--YSGGVIDLNQ 608
                            Y  +P  +E+ +KK VA + P+SV + A+   +  Y  GV+    
Sbjct:   229 NADKNNNTRVVTIDGYEDVPRDDEKSLKKAVAHQ-PVSVAIEASSQAFQLYKSGVMT--- 284

Query:   609 RLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGE 651
               C     +H +++VGYG    +D     YWI++NSWG +WG+
Sbjct:   285 GTCGISL-DHGVVVVGYGSTSGED-----YWIIRNSWGLNWGD 321

 Score = 160 (61.4 bits), Expect = 2.0e-30, Sum P(2) = 2.0e-30
 Identities = 45/134 (33%), Positives = 67/134 (50%)

Query:    73 VEKAEDYQ---REDSGTAVFEVN-KFFDLSDS--DLQQLTGLN--LDSTLEDIQPSLQAP 124
             VE  ++Y     ++    +F+ N KF D  +S  D     GL    D T E+ + ++   
Sbjct:    49 VENRKNYNGLGEKERRFKIFKDNLKFVDEHNSVPDRTFEVGLTRFADLTNEEFR-AIYLR 107

Query:   125 FSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAM 184
                 +T   ++  ++   + GD LP+  DWRA G +  VK+QG C  CWAFSAVG VE +
Sbjct:   108 KKMERTKDSVKTERY-LYKEGDVLPDEVDWRANGAVVSVKDQGNCGSCWAFSAVGAVEGI 166

Query:   185 HAIQGNNLTELSVQ 198
             + I    L  LS Q
Sbjct:   167 NQITTGELISLSEQ 180

 Score = 54 (24.1 bits), Expect = 1.5e-35, Sum P(2) = 1.5e-35
 Identities = 12/52 (23%), Positives = 27/52 (51%)

Query:   202 KVYSSVEDLLRRHENFVTNVEKAEDYQSEDSGTAVFGVNKFFDLSESDLQQL 253
             K Y+ + +  RR + F  N++  +++ S    T   G+ +F DL+  + + +
Sbjct:    53 KNYNGLGEKERRFKIFKDNLKFVDEHNSVPDRTFEVGLTRFADLTNEEFRAI 104

 Score = 47 (21.6 bits), Expect = 7.9e-35, Sum P(2) = 7.9e-35
 Identities = 17/90 (18%), Positives = 43/90 (47%)

Query:    21 MIKVALLESNI---FQTRGYLNSPVTRFL--NFMRDHDKVYSSVEDLLRRHENFVTNVEK 75
             ++ V LL S++    +T    N    R +   ++ ++ K Y+ + +  RR + F  N++ 
Sbjct:    15 ILSVLLLSSSLGVATETEIERNETEVRLMYEQWLVENRKNYNGLGEKERRFKIFKDNLKF 74

Query:    76 AEDYQREDSGTAVFEVNKFFDLSDSDLQQL 105
              +++      T    + +F DL++ + + +
Sbjct:    75 VDEHNSVPDRTFEVGLTRFADLTNEEFRAI 104


>UNIPROTKB|P09648 [details] [associations]
            symbol:CTSL1 "Cathepsin L1" species:9031 "Gallus gallus"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0005764 "lysosome"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 eggNOG:COG4870
            HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 HOVERGEN:HBG011513
            OrthoDB:EOG48PMKF MEROPS:C01.032 IPI:IPI00602255 PIR:S00081
            UniGene:Gga.523 ProteinModelPortal:P09648 SMR:P09648 Uniprot:P09648
        Length = 218

 Score = 358 (131.1 bits), Expect = 3.7e-32, P = 3.7e-32
 Identities = 79/195 (40%), Positives = 104/195 (53%)

Query:   297 PEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSNG 356
             P + DWR +G ++ VK+QG+C  CWAFS  G +E  H      L  LS Q LVDC    G
Sbjct:     2 PRSVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQHFRTKGKLVSLSEQNLVDCSRPEG 61

Query:   357 --GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYG 414
               GCNGG MD A QY+ DNGG+ S+++YPY A + E  C              +  IP G
Sbjct:    62 NQGCNGGLMDQAFQYVQDNGGIDSEESYPYTAKDDE-DC-RYKAEYNAANDTGFVDIPQG 119

Query:   415 EEEEMKKWVATRGPLSVGMNA-NGLF-YYSGGVI--------DLNQRL----YGTS--IP 458
              E  + K VA+ GP+SV ++A +  F +Y  G+         DL+  +    YG      
Sbjct:   120 HERALMKAVASVGPVSVAIDAGHSSFQFYQSGIYYEPDCSSEDLDHGVLVVGYGFEGGKK 179

Query:   459 YWIVKNSWGSDWGEK 473
             YWIVKNSWG  WG+K
Sbjct:   180 YWIVKNSWGEKWGDK 194

 Score = 285 (105.4 bits), Expect = 2.0e-35, Sum P(2) = 2.0e-35
 Identities = 64/160 (40%), Positives = 89/160 (55%)

Query:   497 KLSRLATEKLVDCDMSNG--GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXX 554
             KL  L+ + LVDC    G  GCNGG MD A QY+ DNGG+ S+++YPY A + E  C   
Sbjct:    44 KLVSLSEQNLVDCSRPEGNQGCNGGLMDQAFQYVQDNGGIDSEESYPYTAKDDE-DC-RY 101

Query:   555 XXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNA-NGLF-YYSGGVIDLNQRLCN 612
                        +  IP G E  + K VA+ GP+SV ++A +  F +Y  G+    +  C+
Sbjct:   102 KAEYNAANDTGFVDIPQGHERALMKAVASVGPVSVAIDAGHSSFQFYQSGIY--YEPDCS 159

Query:   613 PKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
              +  +H +++VGYG E  K      YWIVKNSWG  WG+K
Sbjct:   160 SEDLDHGVLVVGYGFEGGKK-----YWIVKNSWGEKWGDK 194

 Score = 130 (50.8 bits), Expect = 2.0e-35, Sum P(2) = 2.0e-35
 Identities = 22/51 (43%), Positives = 30/51 (58%)

Query:   149 PEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQH 199
             P + DWR +G ++ VK+QG+C  CWAFS  G +E  H      L  LS Q+
Sbjct:     2 PRSVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQHFRTKGKLVSLSEQN 52


>UNIPROTKB|O60911 [details] [associations]
            symbol:CTSL2 "Cathepsin L2" species:9606 "Homo sapiens"
            [GO:0004177 "aminopeptidase activity" evidence=IEA] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=IEA] [GO:0005902
            "microvillus" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0007283 "spermatogenesis" evidence=IEA] [GO:0009267 "cellular
            response to starvation" evidence=IEA] [GO:0009749 "response to
            glucose stimulus" evidence=IEA] [GO:0009897 "external side of
            plasma membrane" evidence=IEA] [GO:0010259 "multicellular
            organismal aging" evidence=IEA] [GO:0021675 "nerve development"
            evidence=IEA] [GO:0030984 "kininogen binding" evidence=IEA]
            [GO:0032403 "protein complex binding" evidence=IEA] [GO:0034698
            "response to gonadotropin stimulus" evidence=IEA] [GO:0042277
            "peptide binding" evidence=IEA] [GO:0043005 "neuron projection"
            evidence=IEA] [GO:0043204 "perikaryon" evidence=IEA] [GO:0046697
            "decidualization" evidence=IEA] [GO:0048102 "autophagic cell death"
            evidence=IEA] [GO:0051384 "response to glucocorticoid stimulus"
            evidence=IEA] [GO:0060008 "Sertoli cell differentiation"
            evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=TAS] [GO:0019886 "antigen processing and presentation of
            exogenous peptide antigen via MHC class II" evidence=TAS]
            [GO:0043202 "lysosomal lumen" evidence=TAS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 Reactome:REACT_118779 Reactome:REACT_6900
            GO:GO:0009897 GO:GO:0019886 GO:GO:0034698 GO:GO:0043204
            GO:GO:0009749 GO:GO:0030141 GO:GO:0051384 eggNOG:COG4870
            HOGENOM:HOG000230774 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0045177 GO:GO:0043005 GO:GO:0007283
            GO:GO:0004177 GO:GO:0042277 GO:GO:0009267 GO:GO:0021675
            GO:GO:0043202 GO:GO:0005902 GO:GO:0010259 GO:GO:0004197
            GO:GO:0048102 GO:GO:0046697 HOVERGEN:HBG011513 CTD:1515
            OrthoDB:EOG48PMKF OMA:FDQNLDT GO:GO:0060008 EMBL:Y14734
            EMBL:AB001928 EMBL:AF070448 EMBL:AB019534 EMBL:AY358641
            EMBL:AL445670 EMBL:BC023504 EMBL:BC110512 IPI:IPI00000013
            RefSeq:NP_001188504.1 RefSeq:NP_001324.2 UniGene:Hs.610096 PDB:1FH0
            PDB:3H6S PDB:3KFQ PDBsum:1FH0 PDBsum:3H6S PDBsum:3KFQ
            ProteinModelPortal:O60911 SMR:O60911 IntAct:O60911 STRING:O60911
            MEROPS:I29.010 PhosphoSite:O60911 PaxDb:O60911 PeptideAtlas:O60911
            PRIDE:O60911 Ensembl:ENST00000259470 Ensembl:ENST00000538255
            GeneID:1515 KEGG:hsa:1515 UCSC:uc004awt.3 GeneCards:GC09M099794
            HGNC:HGNC:2538 HPA:CAB017112 MIM:603308 neXtProt:NX_O60911
            PharmGKB:PA27036 InParanoid:O60911 KO:K01375 PhylomeDB:O60911
            BRENDA:3.4.22.43 SABIO-RK:O60911 BindingDB:O60911 ChEMBL:CHEMBL3272
            ChiTaRS:CTSL2 EvolutionaryTrace:O60911 GenomeRNAi:1515 NextBio:6277
            Bgee:O60911 CleanEx:HS_CTSL2 Genevestigator:O60911
            GermOnline:ENSG00000136943 Uniprot:O60911
        Length = 334

 Score = 278 (102.9 bits), Expect = 3.1e-32, Sum P(3) = 3.1e-32
 Identities = 60/155 (38%), Positives = 86/155 (55%)

Query:   295 DLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMS 354
             DLP++ DWR +G ++ VK Q +C  CWAFSA G +E     +   L  LS Q LVDC   
Sbjct:   113 DLPKSVDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRP 172

Query:   355 NG--GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIP 412
              G  GCNGG M  A QY+ +NGG+ S+++YPY A +    C              ++ + 
Sbjct:   173 QGNQGCNGGFMARAFQYVKENGGLDSEESYPYVAVDEI--C-KYRPENSVANDTGFTVVA 229

Query:   413 YGEEEEMKKWVATRGPLSVGMNA--NGLFYYSGGV 445
              G+E+ + K VAT GP+SV M+A  +   +Y  G+
Sbjct:   230 PGKEKALMKAVATVGPISVAMDAGHSSFQFYKSGI 264

 Score = 260 (96.6 bits), Expect = 2.4e-35, Sum P(3) = 2.4e-35
 Identities = 60/158 (37%), Positives = 88/158 (55%)

Query:   497 KLSRLATEKLVDCDMSNG--GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXX 554
             KL  L+ + LVDC    G  GCNGG M  A QY+ +NGG+ S+++YPY A +    C   
Sbjct:   157 KLVSLSEQNLVDCSRPQGNQGCNGGFMARAFQYVKENGGLDSEESYPYVAVDEI--C-KY 213

Query:   555 XXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNA-NGLF-YYSGGVIDLNQRLCN 612
                        ++ +  G+E+ + K VAT GP+SV M+A +  F +Y  G+    +  C+
Sbjct:   214 RPENSVANDTGFTVVAPGKEKALMKAVATVGPISVAMDAGHSSFQFYKSGIYF--EPDCS 271

Query:   613 PKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWG 650
              K  +H +++VGYG E      S  YW+VKNSWG +WG
Sbjct:   272 SKNLDHGVLVVGYGFEGANSNNS-KYWLVKNSWGPEWG 308

 Score = 127 (49.8 bits), Expect = 2.4e-35, Sum P(3) = 2.4e-35
 Identities = 23/53 (43%), Positives = 32/53 (60%)

Query:   147 DLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQH 199
             DLP++ DWR +G ++ VK Q +C  CWAFSA G +E     +   L  LS Q+
Sbjct:   113 DLPKSVDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQN 165

 Score = 78 (32.5 bits), Expect = 3.1e-32, Sum P(3) = 3.1e-32
 Identities = 12/25 (48%), Positives = 15/25 (60%)

Query:   459 YWIVKNSWGSDWGEKVEDKVGSSGN 483
             YW+VKNSWG +WG     K+    N
Sbjct:   296 YWLVKNSWGPEWGSNGYVKIAKDKN 320

 Score = 47 (21.6 bits), Expect = 2.4e-35, Sum P(3) = 2.4e-35
 Identities = 17/68 (25%), Positives = 35/68 (51%)

Query:    43 TRFLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQREDS-GTAVFEV--NKFFDLSD 99
             T++  +   H ++Y + E+  RR   +  N++  E +  E S G   F +  N F D+++
Sbjct:    27 TKWYQWKATHRRLYGANEEGWRRAV-WEKNMKMIELHNGEYSQGKHGFTMAMNAFGDMTN 85

Query:   100 SDLQQLTG 107
              + +Q+ G
Sbjct:    86 EEFRQMMG 93

 Score = 42 (19.8 bits), Expect = 2.4e-10, Sum P(4) = 2.4e-10
 Identities = 13/44 (29%), Positives = 20/44 (45%)

Query:   283 RAFQFNSLRHGDDLPEAFDWRAEGVISKVKEQGKCACCWAFSAV 326
             RAFQ+     G D  E++ + A   I K + +   A    F+ V
Sbjct:   185 RAFQYVKENGGLDSEESYPYVAVDEICKYRPENSVANDTGFTVV 228


>UNIPROTKB|D3ZZR3 [details] [associations]
            symbol:D3ZZR3 "Uncharacterized protein" species:10116
            "Rattus norvegicus" [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0016020 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GO:GO:0002250 GeneTree:ENSGT00560000076577 GO:GO:0097067
            OrthoDB:EOG4JM7Q2 IPI:IPI00210228 PRIDE:D3ZZR3
            Ensembl:ENSRNOT00000028732 Uniprot:D3ZZR3
        Length = 331

 Score = 348 (127.6 bits), Expect = 4.6e-31, P = 4.6e-31
 Identities = 94/282 (33%), Positives = 139/282 (49%)

Query:   219 TNVEKAEDYQSEDSGTAVFGVN-KFFDLSESDLQQLTGLNLDST----LED-IQPSLQAP 272
             T+ ++ +D   ED    ++  N KF  L   +L+   G++  S     + D +  ++   
Sbjct:    31 THEKEYKDQNEEDVRRLIWEKNLKFIMLH--NLEHSMGMHSYSVGMNHMGDMVAETIIGE 88

Query:   273 FSSNQTDTEMRAFQFNSLRHGDDLPEAFDW--RAEGVISKVKEQGKCACCWAFSAVGVVE 330
               S +   + +A          +LP    W  R +G    +  QG C  CWAFSAVG +E
Sbjct:    89 MGSERLPRKRKALGLIPSSVNQNLPAGVKWKERTKGCWKNLVFQGSCGSCWAFSAVGALE 148

Query:   331 AMHAIQGNSLTELSVQQLVDCDMS----NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKA 386
                 ++   L  LS Q LVDC       N GC GG M +A QYIIDNGG+ S+ +YPYKA
Sbjct:   149 GQLKLKTGKLVSLSAQNLVDCSTEEKYGNKGCGGGFMTEAFQYIIDNGGIDSEASYPYKA 208

Query:   387 SESERGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNAN--GLFYYSGG 444
              + +  C              Y  +P+G+EE +K+ VAT+GP+SVG++A+    F Y  G
Sbjct:   209 MDEK--C-HYDPKNRAATCSRYIELPFGDEEALKEAVATKGPVSVGIDASHSSFFLYQSG 265

Query:   445 VID-------LNQRL----YGT--SIPYWIVKNSWGSDWGEK 473
             V D       +N  +    YGT     YW+VKNSWG  +G++
Sbjct:   266 VYDDPSCTENVNHGVLVVGYGTLDGKDYWLVKNSWGLHFGDQ 307

 Score = 303 (111.7 bits), Expect = 2.6e-35, Sum P(2) = 2.6e-35
 Identities = 67/162 (41%), Positives = 93/162 (57%)

Query:   497 KLSRLATEKLVDCDMS----NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCL 552
             KL  L+ + LVDC       N GC GG M +A QYIIDNGG+ S+ +YPYKA + +  C 
Sbjct:   157 KLVSLSAQNLVDCSTEEKYGNKGCGGGFMTEAFQYIIDNGGIDSEASYPYKAMDEK--C- 213

Query:   553 XXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNAN--GLFYYSGGVIDLNQRL 610
                          Y  +P+G+EE +K+ VAT+GP+SVG++A+    F Y  GV D     
Sbjct:   214 HYDPKNRAATCSRYIELPFGDEEALKEAVATKGPVSVGIDASHSSFFLYQSGVYD--DPS 271

Query:   611 CNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
             C     NH +++VGYG  + KD     YW+VKNSWG  +G++
Sbjct:   272 CTENV-NHGVLVVGYGTLDGKD-----YWLVKNSWGLHFGDQ 307

 Score = 110 (43.8 bits), Expect = 2.6e-35, Sum P(2) = 2.6e-35
 Identities = 36/137 (26%), Positives = 60/137 (43%)

Query:    71 TNVEKAEDYQREDSGTAVFEVN-KFFDLSDSDLQQLTGLNLDST----LED-IQPSLQAP 124
             T+ ++ +D   ED    ++E N KF  L +  L+   G++  S     + D +  ++   
Sbjct:    31 THEKEYKDQNEEDVRRLIWEKNLKFIMLHN--LEHSMGMHSYSVGMNHMGDMVAETIIGE 88

Query:   125 FSSNQTDTEMRAFQFNSLRHGDDLPEAFDW--RAEGVISKVKEQGKCACCWAFSAVGVVE 182
               S +   + +A          +LP    W  R +G    +  QG C  CWAFSAVG +E
Sbjct:    89 MGSERLPRKRKALGLIPSSVNQNLPAGVKWKERTKGCWKNLVFQGSCGSCWAFSAVGALE 148

Query:   183 AMHAIQGNNLTELSVQH 199
                 ++   L  LS Q+
Sbjct:   149 GQLKLKTGKLVSLSAQN 165


>MGI|MGI:1338045 [details] [associations]
            symbol:Ctsw "cathepsin W" species:10090 "Mus musculus"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008233 "peptidase
            activity" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0016787 "hydrolase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 MGI:MGI:1338045 HOGENOM:HOG000230774
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 MEROPS:C01.037
            CTD:1521 eggNOG:NOG288820 HOVERGEN:HBG100117 KO:K08569 OMA:GRCGDGC
            OrthoDB:EOG4Q2DG3 EMBL:AF014941 EMBL:AC122861 IPI:IPI00111727
            RefSeq:NP_034115.2 UniGene:Mm.113590 ProteinModelPortal:P56203
            SMR:P56203 PhosphoSite:P56203 PRIDE:P56203 DNASU:13041
            Ensembl:ENSMUST00000025844 GeneID:13041 KEGG:mmu:13041
            InParanoid:P56203 NextBio:282936 Bgee:P56203 CleanEx:MM_CTSW
            Genevestigator:P56203 GermOnline:ENSMUSG00000024910 Uniprot:P56203
        Length = 371

 Score = 320 (117.7 bits), Expect = 2.7e-35, Sum P(2) = 2.7e-35
 Identities = 77/247 (31%), Positives = 126/247 (51%)

Query:   201 DKVYSSVEDLLRRHENFVTNVEKAEDYQSEDSGTAVFGVNKFFDLSESDLQQLTGLNLDS 260
             ++ Y +  +  RR   F  N+ +A+  Q ED GTA FG   F DL+E +  QL G     
Sbjct:    48 NRSYWNPAEYTRRLSIFAHNLAQAQRLQQEDLGTAEFGETPFSDLTEEEFGQLYG----- 102

Query:   261 TLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWR-AEGVISKVKEQGKCAC 319
               ++  P  + P   N T    +  + N+   G+ +P   DWR A+ +IS VK QG C C
Sbjct:   103 --QERSPE-RTP---NMT----KKVESNTW--GESVPRTCDWRKAKNIISSVKNQGSCKC 150

Query:   320 CWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSD 379
             CWA +A   ++A+  I+     ++SVQ+L+DC+    GCNGG + DA   +++N G+ S+
Sbjct:   151 CWAMAAADNIQALWRIKHQQFVDVSVQELLDCERCGNGCNGGFVWDAYLTVLNNSGLASE 210

Query:   380 QAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLF 439
             + YP++       CL             ++ +    E+ +  ++A  GP++V +N   L 
Sbjct:   211 KDYPFQGDRKPHRCLAKKYKKVAWIQD-FTMLS-NNEQAIAHYLAVHGPITVTINMKLLQ 268

Query:   440 YYSGGVI 446
             +Y  GVI
Sbjct:   269 HYQKGVI 275

 Score = 254 (94.5 bits), Expect = 4.7e-34, Sum P(2) = 4.7e-34
 Identities = 51/164 (31%), Positives = 91/164 (55%)

Query:   501 LATEKLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXX 560
             ++ ++L+DC+    GCNGG + DA   +++N G+ S++ YP++       CL        
Sbjct:   174 VSVQELLDCERCGNGCNGGFVWDAYLTVLNNSGLASEKDYPFQGDRKPHRCLAKKYKKVA 233

Query:   561 XXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFYYSGGVIDLNQRLCNPKAQNHAL 620
                  ++ +    E+ +  ++A  GP++V +N   L +Y  GVI      C+P+  +H++
Sbjct:   234 WIQD-FTMLS-NNEQAIAHYLAVHGPITVTINMKLLQHYQKGVIKATPSSCDPRQVDHSV 291

Query:   621 IIVGYGEEEK--KDGT----------SIPYWIVKNSWGSDWGEK 652
             ++VG+G+E++  + GT          S PYWI+KNSWG+ WGEK
Sbjct:   292 LLVGFGKEKEGMQTGTVLSHSRKRRHSSPYWILKNSWGAHWGEK 335

 Score = 176 (67.0 bits), Expect = 4.7e-34, Sum P(2) = 4.7e-34
 Identities = 49/155 (31%), Positives = 77/155 (49%)

Query:    45 FLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQREDSGTAVFEVNKFFDLSDSDLQQ 104
             F  F    ++ Y +  +  RR   F  N+ +A+  Q+ED GTA F    F DL++ +  Q
Sbjct:    40 FKLFQIRFNRSYWNPAEYTRRLSIFAHNLAQAQRLQQEDLGTAEFGETPFSDLTEEEFGQ 99

Query:   105 LTGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWR-AEGVISKV 163
             L G       ++  P  + P   N T    +  + N+   G+ +P   DWR A+ +IS V
Sbjct:   100 LYG-------QERSPE-RTP---NMT----KKVESNTW--GESVPRTCDWRKAKNIISSV 142

Query:   164 KEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
             K QG C CCWA +A   ++A+  I+     ++SVQ
Sbjct:   143 KNQGSCKCCWAMAAADNIQALWRIKHQQFVDVSVQ 177

 Score = 92 (37.4 bits), Expect = 2.7e-35, Sum P(2) = 2.7e-35
 Identities = 16/30 (53%), Positives = 20/30 (66%)

Query:   444 GVIDLNQRLYGTSIPYWIVKNSWGSDWGEK 473
             G +  + R    S PYWI+KNSWG+ WGEK
Sbjct:   306 GTVLSHSRKRRHSSPYWILKNSWGAHWGEK 335


>RGD|1560071 [details] [associations]
            symbol:Ctsll3 "cathepsin L-like 3" species:10116 "Rattus
            norvegicus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 RGD:1560071 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            EMBL:CH474032 IPI:IPI00560469 RefSeq:XP_001065834.2
            RefSeq:XP_573976.3 UniGene:Rn.104851 MEROPS:C01.107
            Ensembl:ENSRNOT00000061398 GeneID:498691 KEGG:rno:498691
            UCSC:RGD:1560071 CTD:70202 OMA:NCGIASD OrthoDB:EOG4HDSTZ
            NextBio:700548 Uniprot:D3ZJV2
        Length = 330

 Score = 328 (120.5 bits), Expect = 2.0e-30, Sum P(2) = 2.0e-30
 Identities = 98/283 (34%), Positives = 132/283 (46%)

Query:   215 ENFVTNVEKAEDYQSEDSGTAVFGVN-KFFDLSESD-LQQLTGLNLD-STLEDIQPS--- 268
             E + T   K  +   E    AV+  N K  +L   D L+   G +L+ +   D+  +   
Sbjct:    30 EEWKTKHGKTYNTNEEGQKRAVWENNMKMINLHNEDYLKGKHGFSLEMNAFGDLTNTEFR 89

Query:   269 -LQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVG 327
              L   F   +T   M+ F    L    D+P+  DWR  G ++ VK QG C  CWAFSAVG
Sbjct:    90 ELMTGFQGQKTKM-MKVFPEPFL---GDVPKTVDWRKHGYVTPVKNQGPCGSCWAFSAVG 145

Query:   328 VVEAMHAIQGNSLTELSVQQLVDCDMSNG--GCNGGRMDDALQYIIDNGGVVSDQAYPYK 385
              +E     +   L  LS Q LVDC  S+G  GC+GG  D A QY+ DNGG+ +  +YPY+
Sbjct:   146 SLEGQVFRKTGKLVPLSEQNLVDCSWSHGNKGCDGGLPDFAFQYVKDNGGLDTSVSYPYE 205

Query:   386 ASESERGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNAN--GLFYYSG 443
             A     G               +  IP  E   MK  VAT GP+SVG++       +Y G
Sbjct:   206 ALN---GTCRYNPKYSAAKVVGFMSIPPSENALMKA-VATVGPISVGIDIKHKSFQFYKG 261

Query:   444 GVI--------DLNQRL----YGTSIP---YWIVKNSWGSDWG 471
             G+         +LN  +    YG       YW+VKNSWG DWG
Sbjct:   262 GMYYEPDCSSTNLNHAVLVVGYGEESDGRKYWLVKNSWGRDWG 304

 Score = 271 (100.5 bits), Expect = 4.8e-35, Sum P(2) = 4.8e-35
 Identities = 65/158 (41%), Positives = 86/158 (54%)

Query:   497 KLSRLATEKLVDCDMSNG--GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXX 554
             KL  L+ + LVDC  S+G  GC+GG  D A QY+ DNGG+ +  +YPY+A     G    
Sbjct:   157 KLVPLSEQNLVDCSWSHGNKGCDGGLPDFAFQYVKDNGGLDTSVSYPYEALN---GTCRY 213

Query:   555 XXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNAN--GLFYYSGGVIDLNQRLCN 612
                        +  IP  E   MK  VAT GP+SVG++       +Y GG+    +  C+
Sbjct:   214 NPKYSAAKVVGFMSIPPSENALMKA-VATVGPISVGIDIKHKSFQFYKGGMY--YEPDCS 270

Query:   613 PKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWG 650
                 NHA+++VGYGEE   DG    YW+VKNSWG DWG
Sbjct:   271 STNLNHAVLVVGYGEES--DGRK--YWLVKNSWGRDWG 304

 Score = 141 (54.7 bits), Expect = 4.8e-35, Sum P(2) = 4.8e-35
 Identities = 45/140 (32%), Positives = 65/140 (46%)

Query:    67 ENFVTNVEKAEDYQREDSGTAVFEVN-KFFDLSDSD-LQQLTGLNLD-STLEDIQPS--- 120
             E + T   K  +   E    AV+E N K  +L + D L+   G +L+ +   D+  +   
Sbjct:    30 EEWKTKHGKTYNTNEEGQKRAVWENNMKMINLHNEDYLKGKHGFSLEMNAFGDLTNTEFR 89

Query:   121 -LQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVG 179
              L   F   +T   M+ F    L    D+P+  DWR  G ++ VK QG C  CWAFSAVG
Sbjct:    90 ELMTGFQGQKTKM-MKVFPEPFL---GDVPKTVDWRKHGYVTPVKNQGPCGSCWAFSAVG 145

Query:   180 VVEAMHAIQGNNLTELSVQH 199
              +E     +   L  LS Q+
Sbjct:   146 SLEGQVFRKTGKLVPLSEQN 165

 Score = 48 (22.0 bits), Expect = 2.6e-25, Sum P(2) = 2.6e-25
 Identities = 33/145 (22%), Positives = 60/145 (41%)

Query:    52 HDKVYSSVEDLLRRH--ENFVTNVE-KAEDYQREDSGTAVFEVNKFFDLSDSDLQQL-TG 107
             H K Y++ E+  +R   EN +  +    EDY +   G ++ E+N F DL++++ ++L TG
Sbjct:    36 HGKTYNTNEEGQKRAVWENNMKMINLHNEDYLKGKHGFSL-EMNAFGDLTNTEFRELMTG 94

Query:   108 LNLDSTLEDIQPSLQAPFSSNQTDT-EMRAFQF-NSLRHGDDLPE--AFD---------W 154
                  T   +      PF  +   T + R   +   +++        AF          +
Sbjct:    95 FQGQKT--KMMKVFPEPFLGDVPKTVDWRKHGYVTPVKNQGPCGSCWAFSAVGSLEGQVF 152

Query:   155 RAEGVISKVKEQGKCACCWAFSAVG 179
             R  G +  + EQ    C W+    G
Sbjct:   153 RKTGKLVPLSEQNLVDCSWSHGNKG 177

 Score = 37 (18.1 bits), Expect = 2.0e-30, Sum P(2) = 2.0e-30
 Identities = 6/12 (50%), Positives = 9/12 (75%)

Query:   530 DNGGVVSDQAYP 541
             +N G+ SD +YP
Sbjct:   317 NNCGIASDASYP 328

 Score = 37 (18.1 bits), Expect = 4.8e-06, Sum P(2) = 4.8e-06
 Identities = 6/12 (50%), Positives = 9/12 (75%)

Query:   372 DNGGVVSDQAYP 383
             +N G+ SD +YP
Sbjct:   317 NNCGIASDASYP 328


>RGD|1308751 [details] [associations]
            symbol:RGD1308751 "similar to Cathepsin L precursor (Major
            excreted protein) (MEP)" species:10116 "Rattus norvegicus"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 RGD:1308751 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            EMBL:CH474032 IPI:IPI00365697 RefSeq:XP_001065885.2
            RefSeq:XP_225137.5 MEROPS:C01.069 Ensembl:ENSRNOT00000061391
            GeneID:290981 KEGG:rno:290981 UCSC:RGD:1308751 CTD:290981
            OMA:ESYAYEA OrthoDB:EOG42823G NextBio:631921 Uniprot:D3ZKC3
        Length = 330

 Score = 325 (119.5 bits), Expect = 1.5e-28, P = 1.5e-28
 Identities = 75/221 (33%), Positives = 114/221 (51%)

Query:   295 DLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMS 354
             D+P++ DWR  G ++ VK QG+C  CWAFSAVG +E     +   L  LS Q LVDC  S
Sbjct:   113 DIPKSLDWREHGYVTPVKNQGQCGSCWAFSAVGSLEGQIFKKTGKLVSLSEQNLVDCSWS 172

Query:   355 NG--GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIP 412
              G  GCNGG M+ A QY+ +N G+ + ++Y Y+A +   G               + ++P
Sbjct:   173 YGNLGCNGGLMEFAFQYVKENRGLDTGESYAYEAQD---GLCRYNPKYSAANVTGFVKVP 229

Query:   413 YGEEEEMKKWVATRGPLSVGMNAN--GLFYYSGGVI--------DLNQRL----YGTSIP 458
               E++ M   VA+ GP+SVG++++     +YSGG+         +++  +    YG    
Sbjct:   230 LSEDDLMSA-VASVGPVSVGIDSHHQSFRFYSGGMYYEPDCSSTEMDHAVLVVGYGEESD 288

Query:   459 ---YWIVKNSWGSDWGEKVEDKVGSSGNRTRDLELTGVLPS 496
                YW+VKNSWG DWG     K+    N    +    + P+
Sbjct:   289 GGKYWLVKNSWGEDWGMDGYIKMAKDQNNNCGIATYAIYPT 329

 Score = 268 (99.4 bits), Expect = 1.3e-34, Sum P(2) = 1.3e-34
 Identities = 60/158 (37%), Positives = 91/158 (57%)

Query:   497 KLSRLATEKLVDCDMSNG--GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXX 554
             KL  L+ + LVDC  S G  GCNGG M+ A QY+ +N G+ + ++Y Y+A +   G    
Sbjct:   157 KLVSLSEQNLVDCSWSYGNLGCNGGLMEFAFQYVKENRGLDTGESYAYEAQD---GLCRY 213

Query:   555 XXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNAN--GLFYYSGGVIDLNQRLCN 612
                        + ++P  E++ M   VA+ GP+SVG++++     +YSGG+    +  C+
Sbjct:   214 NPKYSAANVTGFVKVPLSEDDLMSA-VASVGPVSVGIDSHHQSFRFYSGGMY--YEPDCS 270

Query:   613 PKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWG 650
                 +HA+++VGYGEE   DG    YW+VKNSWG DWG
Sbjct:   271 STEMDHAVLVVGYGEES--DGGK--YWLVKNSWGEDWG 304

 Score = 140 (54.3 bits), Expect = 1.3e-34, Sum P(2) = 1.3e-34
 Identities = 45/140 (32%), Positives = 66/140 (47%)

Query:    67 ENFVTNVEKAEDYQREDSGTAVFEVN-KFFDLSDSD-LQQLTGLNLD-STLEDIQPS--- 120
             E + T   K  +   E    AV+E N K  +L + D L+   G +L+ +   D+  +   
Sbjct:    30 EEWKTKHGKTYNTNEEGQKRAVWENNMKMINLHNEDYLKGKHGFSLEMNAFGDLTNTEFR 89

Query:   121 -LQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVG 179
              L   F S     E   F+   L    D+P++ DWR  G ++ VK QG+C  CWAFSAVG
Sbjct:    90 ELMTGFQS-MGPKETTIFREPFL---GDIPKSLDWREHGYVTPVKNQGQCGSCWAFSAVG 145

Query:   180 VVEAMHAIQGNNLTELSVQH 199
              +E     +   L  LS Q+
Sbjct:   146 SLEGQIFKKTGKLVSLSEQN 165

 Score = 53 (23.7 bits), Expect = 1.7e-25, Sum P(2) = 1.7e-25
 Identities = 35/144 (24%), Positives = 63/144 (43%)

Query:    52 HDKVYSSVEDLLRRH--ENFVTNVE-KAEDYQREDSGTAVFEVNKFFDLSDSDLQQL-TG 107
             H K Y++ E+  +R   EN +  +    EDY +   G ++ E+N F DL++++ ++L TG
Sbjct:    36 HGKTYNTNEEGQKRAVWENNMKMINLHNEDYLKGKHGFSL-EMNAFGDLTNTEFRELMTG 94

Query:   108 LNL----DST------LEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLP--EAFDWR 155
                    ++T      L DI  SL        T  + +  Q  S      +   E   ++
Sbjct:    95 FQSMGPKETTIFREPFLGDIPKSLDWREHGYVTPVKNQG-QCGSCWAFSAVGSLEGQIFK 153

Query:   156 AEGVISKVKEQGKCACCWAFSAVG 179
               G +  + EQ    C W++  +G
Sbjct:   154 KTGKLVSLSEQNLVDCSWSYGNLG 177


>TAIR|locus:505006391 [details] [associations]
            symbol:CEP3 "cysteine endopeptidase 3" species:3702
            "Arabidopsis thaliana" [GO:0005576 "extracellular region"
            evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA;ISS]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0005783 EMBL:CP002686
            GenomeReviews:BA000014_GR eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 PROSITE:PS00014
            EMBL:AL049659 HSSP:O65039 HOGENOM:HOG000230773 KO:K16292
            EMBL:AK119026 IPI:IPI00525150 PIR:T06707 RefSeq:NP_566901.1
            UniGene:At.3162 ProteinModelPortal:Q9STL5 SMR:Q9STL5 MEROPS:C01.A02
            PRIDE:Q9STL5 EnsemblPlants:AT3G48350.1 GeneID:823993
            KEGG:ath:AT3G48350 TAIR:At3g48350 InParanoid:Q9STL5 OMA:DITHHEF
            PhylomeDB:Q9STL5 ProtClustDB:CLSN2917387 Genevestigator:Q9STL5
            Uniprot:Q9STL5
        Length = 364

 Score = 341 (125.1 bits), Expect = 2.7e-30, P = 2.7e-30
 Identities = 79/209 (37%), Positives = 114/209 (54%)

Query:   296 LPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMS- 354
             +P + DWR +G +++VK Q  C  CWAFS V  VE ++ I+ N L  LS Q+LVDCD   
Sbjct:   126 VPSSVDWREKGAVTEVKNQQDCGSCWAFSTVAAVEGINKIRTNKLVSLSEQELVDCDTEE 185

Query:   355 NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYG 414
             N GC GG M+ A ++I +NGG+ +++ YPY +S+ +  C              +  +P  
Sbjct:   186 NQGCAGGLMEPAFEFIKNNGGIKTEETYPYDSSDVQF-CRANSIGGETVTIDGHEHVPEN 244

Query:   415 EEEEMKKWVATRGPLSVGMNANGLFY--YSGGVI------DLNQRL----YGTS---IPY 459
             +EEE+ K VA + P+SV ++A    +  YS GV        LN  +    YG +     Y
Sbjct:   245 DEEELLKAVAHQ-PVSVAIDAGSSDFQLYSEGVFIGECGTQLNHGVVIVGYGETKNGTKY 303

Query:   460 WIVKNSWGSDWGEK--VEDKVGSSGNRTR 486
             WIV+NSWG +WGE   V  + G S N  R
Sbjct:   304 WIVRNSWGPEWGEGGYVRIERGISENEGR 332

 Score = 274 (101.5 bits), Expect = 1.6e-34, Sum P(2) = 1.6e-34
 Identities = 62/159 (38%), Positives = 94/159 (59%)

Query:   496 SKLSRLATEKLVDCDMS-NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXX 554
             +KL  L+ ++LVDCD   N GC GG M+ A ++I +NGG+ +++ YPY +S+ +  C   
Sbjct:   168 NKLVSLSEQELVDCDTEENQGCAGGLMEPAFEFIKNNGGIKTEETYPYDSSDVQF-CRAN 226

Query:   555 XXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFY--YSGGVIDLNQRLCN 612
                        +  +P  +EEE+ K VA + P+SV ++A    +  YS GV  + +  C 
Sbjct:   227 SIGGETVTIDGHEHVPENDEEELLKAVAHQ-PVSVAIDAGSSDFQLYSEGVF-IGE--CG 282

Query:   613 PKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGE 651
              +  NH ++IVGYGE   K+GT   YWIV+NSWG +WGE
Sbjct:   283 TQL-NHGVVIVGYGET--KNGTK--YWIVRNSWGPEWGE 316

 Score = 133 (51.9 bits), Expect = 1.6e-34, Sum P(2) = 1.6e-34
 Identities = 38/150 (25%), Positives = 68/150 (45%)

Query:    50 RDHDKVYSSVEDLLRRHENFVTNVEKAEDYQREDSGTAVFEVNKFFDLSDSDLQQ-LTGL 108
             R H  V  +  + ++R   F  NV       +++    + ++N+F D++  + +    G 
Sbjct:    42 RGHHSVSRASHEAIKRFNVFRHNVLHVHRTNKKNKPYKL-KINRFADITHHEFRSSYAGS 100

Query:   109 NLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVKEQGK 168
             N+          L+ P   +        F + ++     +P + DWR +G +++VK Q  
Sbjct:   101 NVKH-----HRMLRGPKRGSG------GFMYENVTR---VPSSVDWREKGAVTEVKNQQD 146

Query:   169 CACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
             C  CWAFS V  VE ++ I+ N L  LS Q
Sbjct:   147 CGSCWAFSTVAAVEGINKIRTNKLVSLSEQ 176


>UNIPROTKB|F1S4J6 [details] [associations]
            symbol:Ssc.54235 "Cathepsin L1" species:9823 "Sus scrofa"
            [GO:0097067 "cellular response to thyroid hormone stimulus"
            evidence=IEA] [GO:0042393 "histone binding" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0004197 "cysteine-type
            endopeptidase activity" evidence=IEA] [GO:0002250 "adaptive immune
            response" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0004197
            GO:GO:0002250 GeneTree:ENSGT00560000076577 GO:GO:0097067
            EMBL:CU571031 RefSeq:XP_003130681.1 Ensembl:ENSSSCT00000011983
            GeneID:100515919 KEGG:ssc:100515919 OMA:IAICATK Uniprot:F1S4J6
        Length = 332

 Score = 299 (110.3 bits), Expect = 1.0e-25, P = 1.0e-25
 Identities = 74/192 (38%), Positives = 102/192 (53%)

Query:   259 DSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDL-PEAFDWRAEGVISKVKEQGKC 317
             D T E+ + ++   F  NQ   + + F    L  G  L P + DWR +G ++ VK QG C
Sbjct:    82 DMTNEEFRKTMNG-FQ-NQKHKKGKVF----LDAGSALTPHSVDWREKGYVTAVKNQGHC 135

Query:   318 ACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSNG--GCNGGRMDDALQYIIDNGG 375
               CWAFSA G +E     + + L  LS Q LVDC    G  GCNGG MD+A QYI DNGG
Sbjct:   136 GSCWAFSATGALEGQMFRKTSKLISLSEQNLVDCSWPEGNEGCNGGLMDNAFQYIKDNGG 195

Query:   376 VVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNA 435
             + S+++YPY   +   G               Y  IP  +E+ + K VAT GP+SVG++A
Sbjct:   196 LDSEESYPYFGKD---GSCKYKPQSSAANDTGYVDIPK-QEKALMKAVATVGPISVGIDA 251

Query:   436 N--GLFYYSGGV 445
             +     +YS G+
Sbjct:   252 SHESFQFYSTGI 263

 Score = 281 (104.0 bits), Expect = 1.9e-34, Sum P(2) = 1.9e-34
 Identities = 64/159 (40%), Positives = 89/159 (55%)

Query:   496 SKLSRLATEKLVDCDMSNG--GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLX 553
             SKL  L+ + LVDC    G  GCNGG MD+A QYI DNGG+ S+++YPY   +   G   
Sbjct:   156 SKLISLSEQNLVDCSWPEGNEGCNGGLMDNAFQYIKDNGGLDSEESYPYFGKD---GSCK 212

Query:   554 XXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNAN--GLFYYSGGVIDLNQRLC 611
                         Y  IP  +E+ + K VAT GP+SVG++A+     +YS G+    Q  C
Sbjct:   213 YKPQSSAANDTGYVDIPK-QEKALMKAVATVGPISVGIDASHESFQFYSTGIYFEPQ--C 269

Query:   612 NPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWG 650
             + +  +H +++VGYG E         YW+VKNSWG+ WG
Sbjct:   270 SSEDLDHGVLVVGYGVEGAHSNNK--YWLVKNSWGNTWG 306

 Score = 125 (49.1 bits), Expect = 1.9e-34, Sum P(2) = 1.9e-34
 Identities = 32/90 (35%), Positives = 46/90 (51%)

Query:   111 DSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDL-PEAFDWRAEGVISKVKEQGKC 169
             D T E+ + ++   F  NQ   + + F    L  G  L P + DWR +G ++ VK QG C
Sbjct:    82 DMTNEEFRKTMNG-FQ-NQKHKKGKVF----LDAGSALTPHSVDWREKGYVTAVKNQGHC 135

Query:   170 ACCWAFSAVGVVEAMHAIQGNNLTELSVQH 199
               CWAFSA G +E     + + L  LS Q+
Sbjct:   136 GSCWAFSATGALEGQMFRKTSKLISLSEQN 165

 Score = 123 (48.4 bits), Expect = 4.4e-13, Sum P(2) = 4.4e-13
 Identities = 39/135 (28%), Positives = 63/135 (46%)

Query:   355 NGGCNGGRMDDALQYIIDNGGVVSDQAYPY------------KASESERGCLXX--XXXX 400
             N GCNGG MD+A QYI DNGG+ S+++YPY             ++ ++ G +        
Sbjct:   175 NEGCNGGLMDNAFQYIKDNGGLDSEESYPYFGKDGSCKYKPQSSAANDTGYVDIPKQEKA 234

Query:   401 XXXXXXXYSRIPYGEE--EEMKKWVATRGPLSVGMNANGLFYYSGGVIDLNQRLYG--TS 456
                       I  G +   E  ++ +T        ++  L +   GV+ +   + G  ++
Sbjct:   235 LMKAVATVGPISVGIDASHESFQFYSTGIYFEPQCSSEDLDH---GVLVVGYGVEGAHSN 291

Query:   457 IPYWIVKNSWGSDWG 471
               YW+VKNSWG+ WG
Sbjct:   292 NKYWLVKNSWGNTWG 306


>TAIR|locus:2050145 [details] [associations]
            symbol:AT2G21430 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 EMBL:CP002685 GenomeReviews:CT485783_GR
            EMBL:AC006841 EMBL:X74359 IPI:IPI00519637 PIR:B84601
            RefSeq:NP_565512.1 UniGene:At.14069 ProteinModelPortal:P43295
            SMR:P43295 MEROPS:C01.A04 PRIDE:P43295 EnsemblPlants:AT2G21430.1
            GeneID:816682 KEGG:ath:AT2G21430 TAIR:At2g21430 eggNOG:COG4870
            HOGENOM:HOG000230774 InParanoid:P43295 KO:K01373 OMA:GSIEEHY
            PhylomeDB:P43295 ProtClustDB:CLSN2688311 Genevestigator:P43295
            GermOnline:AT2G21430 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 Uniprot:P43295
        Length = 361

 Score = 266 (98.7 bits), Expect = 1.9e-34, Sum P(3) = 1.9e-34
 Identities = 56/160 (35%), Positives = 80/160 (50%)

Query:   295 DLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMS 354
             +LPE FDWR  G ++ VK QG C  CW+FS  G +E  H +    L  LS QQLVDCD  
Sbjct:   131 NLPEEFDWRDRGAVTPVKNQGSCGSCWSFSTTGALEGAHFLATGKLVSLSEQQLVDCDHE 190

Query:   355 ---------NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXX 405
                      + GCNGG M+ A +Y +  GG++ ++ YPY  ++   G             
Sbjct:   191 CDPEEEGSCDSGCNGGLMNSAFEYTLKTGGLMREKDYPYTGTDG--GSCKLDRSKIVASV 248

Query:   406 XXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFYYSGGV 445
               +S +   E++     +   GPL+V +NA  +  Y GGV
Sbjct:   249 SNFSVVSINEDQIAANLIKN-GPLAVAINAAYMQTYIGGV 287

 Score = 223 (83.6 bits), Expect = 2.9e-30, Sum P(3) = 2.9e-30
 Identities = 53/166 (31%), Positives = 81/166 (48%)

Query:   497 KLSRLATEKLVDCDMS---------NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASES 547
             KL  L+ ++LVDCD           + GCNGG M+ A +Y +  GG++ ++ YPY  ++ 
Sbjct:   175 KLVSLSEQQLVDCDHECDPEEEGSCDSGCNGGLMNSAFEYTLKTGGLMREKDYPYTGTDG 234

Query:   548 ERGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFYYSGGVIDLN 607
               G               +S +   E++     +   GPL+V +NA  +  Y GGV    
Sbjct:   235 --GSCKLDRSKIVASVSNFSVVSINEDQIAANLIKN-GPLAVAINAAYMQTYIGGVSC-- 289

Query:   608 QRLCNPKAQNHALIIVGYGEEEKKDGT--SIPYWIVKNSWGSDWGE 651
               +C+ +  NH +++VGYG            PYWI+KNSWG  WGE
Sbjct:   290 PYICSRRL-NHGVLLVGYGSAGFSQARLKEKPYWIIKNSWGESWGE 334

 Score = 141 (54.7 bits), Expect = 2.9e-30, Sum P(3) = 2.9e-30
 Identities = 24/52 (46%), Positives = 30/52 (57%)

Query:   147 DLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
             +LPE FDWR  G ++ VK QG C  CW+FS  G +E  H +    L  LS Q
Sbjct:   131 NLPEEFDWRDRGAVTPVKNQGSCGSCWSFSTTGALEGAHFLATGKLVSLSEQ 182

 Score = 86 (35.3 bits), Expect = 1.9e-34, Sum P(3) = 1.9e-34
 Identities = 12/15 (80%), Positives = 13/15 (86%)

Query:   458 PYWIVKNSWGSDWGE 472
             PYWI+KNSWG  WGE
Sbjct:   320 PYWIIKNSWGESWGE 334

 Score = 73 (30.8 bits), Expect = 1.9e-34, Sum P(3) = 1.9e-34
 Identities = 22/79 (27%), Positives = 39/79 (49%)

Query:    26 LLESNIFQTRGYLNSPVTRFLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQREDSG 85
             L+   + +T   + S    F  F +   KVY S+E+   R   F  N+ +A  +Q+ D  
Sbjct:    29 LIRQVVDETEPKVLSSEDHFTLFKKKFGKVYGSIEEHYYRFSVFKANLLRAMRHQKMDP- 87

Query:    86 TAVFEVNKFFDLSDSDLQQ 104
             +A   V +F DL+ S+ ++
Sbjct:    88 SARHGVTQFSDLTRSEFRR 106

 Score = 71 (30.1 bits), Expect = 3.1e-34, Sum P(3) = 3.1e-34
 Identities = 18/51 (35%), Positives = 29/51 (56%)

Query:   202 KVYSSVEDLLRRHENFVTNVEKAEDYQSEDSGTAVFGVNKFFDLSESDLQQ 252
             KVY S+E+   R   F  N+ +A  +Q  D  +A  GV +F DL+ S+ ++
Sbjct:    57 KVYGSIEEHYYRFSVFKANLLRAMRHQKMDP-SARHGVTQFSDLTRSEFRR 106


>UNIPROTKB|P25975 [details] [associations]
            symbol:CTSL1 "Cathepsin L1" species:9913 "Bos taurus"
            [GO:0005764 "lysosome" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 eggNOG:COG4870
            HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0005764 GeneTree:ENSGT00560000076577
            HOVERGEN:HBG011513 EMBL:X91755 EMBL:BC102312 EMBL:AB017648
            IPI:IPI00687440 PIR:S15845 RefSeq:NP_776457.1 UniGene:Bt.3987
            ProteinModelPortal:P25975 SMR:P25975 STRING:P25975
            Ensembl:ENSBTAT00000022710 Ensembl:ENSBTAT00000036427 GeneID:281108
            KEGG:bta:281108 CTD:1515 InParanoid:P25975 KO:K01365 OMA:EEFRATH
            OrthoDB:EOG48PMKF BindingDB:P25975 ChEMBL:CHEMBL2113
            NextBio:20805179 ArrayExpress:P25975 Uniprot:P25975
        Length = 334

 Score = 294 (108.6 bits), Expect = 2.5e-30, Sum P(2) = 2.5e-30
 Identities = 65/154 (42%), Positives = 88/154 (57%)

Query:   295 DLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMS 354
             D+P++ DW  +G ++ VK QG+C  CWAFSA G +E     +   L  LS Q LVDC  +
Sbjct:   113 DVPKSVDWTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRA 172

Query:   355 NG--GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIP 412
              G  GCNGG MD+A QYI DNGG+ S+++YPY A+++   C              +  IP
Sbjct:   173 QGNQGCNGGLMDNAFQYIKDNGGLDSEESYPYLATDTN-SC-NYKPECSAANDTGFVDIP 230

Query:   413 YGEEEEMKKWVATRGPLSVGMNANGL---FYYSG 443
               E+  MK  VAT GP+SV ++A      FY SG
Sbjct:   231 QREKALMKA-VATVGPISVAIDAGHTSFQFYKSG 263

 Score = 279 (103.3 bits), Expect = 2.4e-34, Sum P(2) = 2.4e-34
 Identities = 66/159 (41%), Positives = 91/159 (57%)

Query:   497 KLSRLATEKLVDCDMSNG--GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXX 554
             KL  L+ + LVDC  + G  GCNGG MD+A QYI DNGG+ S+++YPY A+++   C   
Sbjct:   157 KLVSLSEQNLVDCSRAQGNQGCNGGLMDNAFQYIKDNGGLDSEESYPYLATDTN-SC-NY 214

Query:   555 XXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGL---FYYSGGVIDLNQRLC 611
                        +  IP  E+  MK  VAT GP+SV ++A      FY SG   D +   C
Sbjct:   215 KPECSAANDTGFVDIPQREKALMKA-VATVGPISVAIDAGHTSFQFYKSGIYYDPD---C 270

Query:   612 NPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWG 650
             + K  +H +++VGYG E   D  +  +WIVKNSWG +WG
Sbjct:   271 SSKDLDHGVLVVGYGFEGT-DSNNNKFWIVKNSWGPEWG 308

 Score = 126 (49.4 bits), Expect = 2.4e-34, Sum P(2) = 2.4e-34
 Identities = 22/53 (41%), Positives = 32/53 (60%)

Query:   147 DLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQH 199
             D+P++ DW  +G ++ VK QG+C  CWAFSA G +E     +   L  LS Q+
Sbjct:   113 DVPKSVDWTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQN 165

 Score = 72 (30.4 bits), Expect = 2.5e-30, Sum P(2) = 2.5e-30
 Identities = 10/13 (76%), Positives = 12/13 (92%)

Query:   459 YWIVKNSWGSDWG 471
             +WIVKNSWG +WG
Sbjct:   296 FWIVKNSWGPEWG 308


>UNIPROTKB|F1NZ37 [details] [associations]
            symbol:LOC420160 "Uncharacterized protein" species:9031
            "Gallus gallus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GeneTree:ENSGT00560000076577 EMBL:AADN02062018
            IPI:IPI00587784 Ensembl:ENSGALT00000006765 OMA:CGVANQA
            Uniprot:F1NZ37
        Length = 340

 Score = 279 (103.3 bits), Expect = 4.3e-33, Sum P(2) = 4.3e-33
 Identities = 60/162 (37%), Positives = 91/162 (56%)

Query:   497 KLSRLATEKLVDCD--MSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXX 554
             KL+ L+ + L+DC   + N GC GG M  A QY+ DNGG+ S+  YPY+A+++   C   
Sbjct:   163 KLAVLSEQNLIDCSWKLGNNGCQGGYMTRAFQYVHDNGGMNSEHIYPYQATDTS-SCRYN 221

Query:   555 XXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFY--YSGGVIDLNQRLCN 612
                        +  +  G E  +++ VAT GP+SV ++A+  F+  Y  G+   N   C+
Sbjct:   222 PADRAANCSTVWL-VAQGSEAALEQAVATVGPVSVAVDASSFFFHFYKSGIF--NSMFCS 278

Query:   613 PKAQNHALIIVGYG--EEEKKDGTSIPYWIVKNSWGSDWGEK 652
              K  NH ++ VGYG  +E +K+   + YWI+KNSW   WGEK
Sbjct:   279 QKV-NHGMLAVGYGISQEARKN---VSYWILKNSWSEVWGEK 316

 Score = 276 (102.2 bits), Expect = 3.4e-34, Sum P(3) = 3.4e-34
 Identities = 57/155 (36%), Positives = 82/155 (52%)

Query:   297 PEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCD--MS 354
             P   DWR  G ++ VK QG C  CWAFSA G +E +       L  LS Q L+DC   + 
Sbjct:   121 PAEVDWRMRGYVTPVKNQGHCGSCWAFSATGALEGLVFNWTGKLAVLSEQNLIDCSWKLG 180

Query:   355 NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYG 414
             N GC GG M  A QY+ DNGG+ S+  YPY+A+++   C              +  +  G
Sbjct:   181 NNGCQGGYMTRAFQYVHDNGGMNSEHIYPYQATDTS-SCRYNPADRAANCSTVWL-VAQG 238

Query:   415 EEEEMKKWVATRGPLSVGMNANGLFY--YSGGVID 447
              E  +++ VAT GP+SV ++A+  F+  Y  G+ +
Sbjct:   239 SEAALEQAVATVGPVSVAVDASSFFFHFYKSGIFN 273

 Score = 114 (45.2 bits), Expect = 4.3e-33, Sum P(2) = 4.3e-33
 Identities = 22/51 (43%), Positives = 27/51 (52%)

Query:   149 PEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQH 199
             P   DWR  G ++ VK QG C  CWAFSA G +E +       L  LS Q+
Sbjct:   121 PAEVDWRMRGYVTPVKNQGHCGSCWAFSATGALEGLVFNWTGKLAVLSEQN 171

 Score = 76 (31.8 bits), Expect = 3.4e-34, Sum P(3) = 3.4e-34
 Identities = 11/18 (61%), Positives = 14/18 (77%)

Query:   456 SIPYWIVKNSWGSDWGEK 473
             ++ YWI+KNSW   WGEK
Sbjct:   299 NVSYWILKNSWSEVWGEK 316

 Score = 70 (29.7 bits), Expect = 3.4e-34, Sum P(3) = 3.4e-34
 Identities = 26/85 (30%), Positives = 44/85 (51%)

Query:   202 KVYSSVEDLLRRHENFVTNVEKAEDYQSEDS-GTAVF--GVNKFFDLSESDLQQLTGLNL 258
             K Y    +L+RR E +  N+ + E +  E+S G   F  G+N + DL + +  QL  LN 
Sbjct:    43 KEYPGEAELIRR-EVWENNLRRIEQHNWEESQGQHTFRLGMNHYGDLMDEEFNQL--LNG 99

Query:   259 DSTLEDIQPSLQAPFSSNQ-TDTEM 282
              + ++  +P+L    S+ Q T  E+
Sbjct:   100 FAPVQHEEPALTFQASAAQKTPAEV 124


>UNIPROTKB|Q28944 [details] [associations]
            symbol:CTSL1 "Cathepsin L1" species:9823 "Sus scrofa"
            [GO:0005764 "lysosome" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 eggNOG:COG4870
            HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0005764 GeneTree:ENSGT00560000076577
            HOVERGEN:HBG011513 KO:K01365 OrthoDB:EOG48PMKF MEROPS:C01.032
            CTD:1514 EMBL:D37917 EMBL:AJ315771 PIR:A58195 RefSeq:NP_999057.1
            UniGene:Ssc.54036 ProteinModelPortal:Q28944 SMR:Q28944
            STRING:Q28944 Ensembl:ENSSSCT00000012233 GeneID:396926
            KEGG:ssc:396926 OMA:DASETGK ArrayExpress:Q28944 Uniprot:Q28944
        Length = 334

 Score = 289 (106.8 bits), Expect = 3.4e-30, Sum P(2) = 3.4e-30
 Identities = 61/155 (39%), Positives = 87/155 (56%)

Query:   295 DLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMS 354
             ++P++ DWR +G ++ VK QG+C  CWAFSA G +E     +   L  LS Q LVDC   
Sbjct:   113 EVPKSVDWREKGYVTAVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRP 172

Query:   355 NG--GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIP 412
              G  GCNGG MD+A QY+ DNGG+ ++++YPY   E+   C              +  IP
Sbjct:   173 QGNQGCNGGLMDNAFQYVKDNGGLDTEESYPYLGRETN-SC-TYKPECSAANDTGFVDIP 230

Query:   413 YGEEEEMKKWVATRGPLSVGMNA--NGLFYYSGGV 445
               E+  MK  VAT GP+SV ++A  +   +Y  G+
Sbjct:   231 QREKALMKA-VATVGPISVAIDAGHSSFQFYKSGI 264

 Score = 274 (101.5 bits), Expect = 4.1e-34, Sum P(2) = 4.1e-34
 Identities = 65/159 (40%), Positives = 88/159 (55%)

Query:   497 KLSRLATEKLVDCDMSNG--GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXX 554
             KL  L+ + LVDC    G  GCNGG MD+A QY+ DNGG+ ++++YPY   E+   C   
Sbjct:   157 KLVSLSEQNLVDCSRPQGNQGCNGGLMDNAFQYVKDNGGLDTEESYPYLGRETN-SC-TY 214

Query:   555 XXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGL---FYYSGGVIDLNQRLC 611
                        +  IP  E+  MK  VAT GP+SV ++A      FY SG   D +   C
Sbjct:   215 KPECSAANDTGFVDIPQREKALMKA-VATVGPISVAIDAGHSSFQFYKSGIYYDPD---C 270

Query:   612 NPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWG 650
             + K  +H +++VGYG E   D  S  +WIVKNSWG +WG
Sbjct:   271 SSKDLDHGVLVVGYGFEGT-DSNSSKFWIVKNSWGPEWG 308

 Score = 129 (50.5 bits), Expect = 4.1e-34, Sum P(2) = 4.1e-34
 Identities = 37/121 (30%), Positives = 61/121 (50%)

Query:    87 AVFEVN-KFFDLSDSDLQQ-LTGLNL------DSTLEDIQPSLQAPFSSNQTDTEMRAFQ 138
             AV+E N K  +L + +  Q   G ++      D T E+ +  +   F  NQ   + + F 
Sbjct:    50 AVWEKNMKMIELHNQEYSQGKHGFSMAMNAFGDMTNEEFRQVMNG-FQ-NQKHKKGKVFH 107

Query:   139 FNSLRHGDDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
              + +    ++P++ DWR +G ++ VK QG+C  CWAFSA G +E     +   L  LS Q
Sbjct:   108 ESLVL---EVPKSVDWREKGYVTAVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQ 164

Query:   199 H 199
             +
Sbjct:   165 N 165

 Score = 76 (31.8 bits), Expect = 3.4e-30, Sum P(2) = 3.4e-30
 Identities = 15/31 (48%), Positives = 19/31 (61%)

Query:   444 GVIDLNQRLYGT---SIPYWIVKNSWGSDWG 471
             GV+ +     GT   S  +WIVKNSWG +WG
Sbjct:   278 GVLVVGYGFEGTDSNSSKFWIVKNSWGPEWG 308

 Score = 41 (19.5 bits), Expect = 6.7e-25, Sum P(2) = 6.7e-25
 Identities = 9/23 (39%), Positives = 14/23 (60%)

Query:    66 HENFVTNVEKAEDYQREDSGTAV 88
             HE+ V  V K+ D++ +   TAV
Sbjct:   107 HESLVLEVPKSVDWREKGYVTAV 129

 Score = 41 (19.5 bits), Expect = 6.7e-25, Sum P(2) = 6.7e-25
 Identities = 9/23 (39%), Positives = 14/23 (60%)

Query:   214 HENFVTNVEKAEDYQSEDSGTAV 236
             HE+ V  V K+ D++ +   TAV
Sbjct:   107 HESLVLEVPKSVDWREKGYVTAV 129


>WB|WBGene00019986 [details] [associations]
            symbol:R09F10.1 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0016021 "integral to
            membrane" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076599
            EMBL:FO081137 HSSP:P53634 PIR:D89588 RefSeq:NP_509408.1
            ProteinModelPortal:Q23030 SMR:Q23030 STRING:Q23030 MEROPS:C01.A44
            PaxDb:Q23030 EnsemblMetazoa:R09F10.1 GeneID:181087
            KEGG:cel:CELE_R09F10.1 UCSC:R09F10.1 CTD:181087 WormBase:R09F10.1
            InParanoid:Q23030 OMA:EYPYSAL NextBio:912346 Uniprot:Q23030
        Length = 383

 Score = 344 (126.2 bits), Expect = 1.3e-30, P = 1.3e-30
 Identities = 90/292 (30%), Positives = 147/292 (50%)

Query:   200 HDKVYSSVEDLLRRHENFVTNVEKAE-DYQSEDSGTAVFGVNKFFDLS-ESDLQQLTGLN 257
             H+++++   D + + +   T+VE+ E  YQ        F   +  +L  + D+ + T   
Sbjct:    78 HEQMFN---DFILKFDRKYTSVEEFEYRYQIFLRNVIEFEAEEERNLGLDLDVNEFT--- 131

Query:   258 LDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVKEQGKC 317
              D T E++Q  +Q      + D +   F+ + L  G   P + DWR +G ++ +K QG+C
Sbjct:   132 -DWTDEELQKMVQEN-KYTKYDFDTPKFEGSYLETGVIRPASIDWREQGKLTPIKNQGQC 189

Query:   318 ACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSNGGCNGGRMDDALQYIIDNGGVV 377
               CWAF+ V  VEA +AI+   L  LS Q++VDCD  N GC+GG    A++++ +NG + 
Sbjct:   190 GSCWAFATVASVEAQNAIKKGKLVSLSEQEMVDCDGRNNGCSGGYRPYAMKFVKENG-LE 248

Query:   378 SDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNA-N 436
             S++ YPY A + ++ C                R+    EE++  WV T+GP++ GMN   
Sbjct:   249 SEKEYPYSALKHDQ-CFLKENDTRVFIDDF--RMLSNNEEDIANWVGTKGPVTFGMNVVK 305

Query:   437 GLFYYSGGVI-----DLNQRLYG----TSI--------PYWIVKNSWGSDWG 471
              ++ Y  G+      D  ++  G    T I         YWIVKNSWG+ WG
Sbjct:   306 AMYSYRSGIFNPSVEDCTEKSMGAHALTIIGYGGEGESAYWIVKNSWGTSWG 357

 Score = 263 (97.6 bits), Expect = 5.4e-34, Sum P(2) = 5.4e-34
 Identities = 57/156 (36%), Positives = 87/156 (55%)

Query:   497 KLSRLATEKLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXX 556
             KL  L+ +++VDCD  N GC+GG    A++++ +NG + S++ YPY A + ++ C     
Sbjct:   211 KLVSLSEQEMVDCDGRNNGCSGGYRPYAMKFVKENG-LESEKEYPYSALKHDQ-CFLKEN 268

Query:   557 XXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNA-NGLFYYSGGVIDLNQRLCNPKA 615
                        R+    EE++  WV T+GP++ GMN    ++ Y  G+ + +   C  K+
Sbjct:   269 DTRVFIDDF--RMLSNNEEDIANWVGTKGPVTFGMNVVKAMYSYRSGIFNPSVEDCTEKS 326

Query:   616 QN-HALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWG 650
                HAL I+GYG E    G S  YWIVKNSWG+ WG
Sbjct:   327 MGAHALTIIGYGGE----GESA-YWIVKNSWGTSWG 357

 Score = 166 (63.5 bits), Expect = 5.4e-34, Sum P(2) = 5.4e-34
 Identities = 48/155 (30%), Positives = 75/155 (48%)

Query:    46 LNFMRDHDKVYSSVEDLLRRHENFVTNVEKAE-DYQREDSGTAVFEVNKFFDLS-DSDLQ 103
             LN   ++ K      D + + +   T+VE+ E  YQ        FE  +  +L  D D+ 
Sbjct:    69 LNHKMENLKHEQMFNDFILKFDRKYTSVEEFEYRYQIFLRNVIEFEAEEERNLGLDLDVN 128

Query:   104 QLTGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKV 163
             + T    D T E++Q  +Q      + D +   F+ + L  G   P + DWR +G ++ +
Sbjct:   129 EFT----DWTDEELQKMVQEN-KYTKYDFDTPKFEGSYLETGVIRPASIDWREQGKLTPI 183

Query:   164 KEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
             K QG+C  CWAF+ V  VEA +AI+   L  LS Q
Sbjct:   184 KNQGQCGSCWAFATVASVEAQNAIKKGKLVSLSEQ 218

 Score = 89 (36.4 bits), Expect = 6.3e-26, Sum P(2) = 6.3e-26
 Identities = 21/65 (32%), Positives = 39/65 (60%)

Query:    45 FLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQREDSGTAVFEVNKFFDLSDSDLQQ 104
             F +F+   D+ Y+SVE+   R++ F+ NV + E  +  + G  + +VN+F D +D +LQ+
Sbjct:    82 FNDFILKFDRKYTSVEEFEYRYQIFLRNVIEFEAEEERNLGLDL-DVNEFTDWTDEELQK 140

Query:   105 LTGLN 109
             +   N
Sbjct:   141 MVQEN 145


>UNIPROTKB|F1MHV4 [details] [associations]
            symbol:CTSW "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00639
            GeneTree:ENSGT00660000095458 OMA:GRCGDGC EMBL:DAAA02063574
            IPI:IPI00716321 Ensembl:ENSBTAT00000027681 Uniprot:F1MHV4
        Length = 375

 Score = 269 (99.8 bits), Expect = 1.7e-29, Sum P(2) = 1.7e-29
 Identities = 64/197 (32%), Positives = 97/197 (49%)

Query:   293 GDDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQ-QLVDC 351
             G+  P+  DWR  G IS V++Q  C CCWA +A G +EA+ AI+     E+SVQ +L+DC
Sbjct:   124 GESEPQTCDWRKVGTISPVRDQRNCNCCWAMAAAGNIEALWAIKFRHFVEVSVQPELLDC 183

Query:   352 DMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRI 411
             D    GC GG + DA   +++N G+ S++ YP+  S     CL                I
Sbjct:   184 DRCGNGCRGGFVWDAFLTVLNNSGLASEKDYPFNGSGKTHRCLAKKYKKVAWIQDFI--I 241

Query:   412 PYGEEEEMKKWVATRGPLSVGMNANGLFYYSGGVIDLNQRLYG-TSIPYWIVKNSWG-SD 469
                 E+ M + +AT GP++V +N   L  Y  GVI         T + + ++   +G + 
Sbjct:   242 LQACEQSMARHLATEGPITVTINMTLLQQYQKGVIKATPTTCDPTQVDHSVLLVGFGKTK 301

Query:   470 WGEKVEDKVGSSGNRTR 486
               E  + K  S G+  R
Sbjct:   302 LVEGRQGKAASFGSHAR 318

 Score = 227 (85.0 bits), Expect = 1.0e-33, Sum P(3) = 1.0e-33
 Identities = 51/163 (31%), Positives = 80/163 (49%)

Query:   505 KLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXX 564
             +L+DCD    GC GG + DA   +++N G+ S++ YP+  S     CL            
Sbjct:   179 ELLDCDRCGNGCRGGFVWDAFLTVLNNSGLASEKDYPFNGSGKTHRCLAKKYKKVAWIQD 238

Query:   565 XYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFYYSGGVIDLNQRLCNPKAQNHALIIVG 624
                 I    E+ M + +AT GP++V +N   L  Y  GVI      C+P   +H++++VG
Sbjct:   239 FI--ILQACEQSMARHLATEGPITVTINMTLLQQYQKGVIKATPTTCDPTQVDHSVLLVG 296

Query:   625 YGEEEKKDGT---------------SIPYWIVKNSWGSDWGEK 652
             +G+ +  +G                S+ YWI+KNSWG  WGE+
Sbjct:   297 FGKTKLVEGRQGKAASFGSHARPRRSMAYWILKNSWGPQWGEE 339

 Score = 141 (54.7 bits), Expect = 1.0e-33, Sum P(3) = 1.0e-33
 Identities = 25/54 (46%), Positives = 35/54 (64%)

Query:   145 GDDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
             G+  P+  DWR  G IS V++Q  C CCWA +A G +EA+ AI+  +  E+SVQ
Sbjct:   124 GESEPQTCDWRKVGTISPVRDQRNCNCCWAMAAAGNIEALWAIKFRHFVEVSVQ 177

 Score = 109 (43.4 bits), Expect = 1.7e-29, Sum P(2) = 1.7e-29
 Identities = 23/56 (41%), Positives = 34/56 (60%)

Query:   200 HDKVYSSVEDLLRRHENFVTNVEKAEDYQSEDSGTAVFGVNKFFDLSESDLQQLTG 255
             +++ Y +  +  RR + F  N+ KA+  Q ED GTA FGV +F DL+E +  QL G
Sbjct:    49 YNRSYPNPAEYARRLDIFAQNLAKAQRLQEEDLGTAEFGVTQFSDLTEEEFVQLYG 104

 Score = 104 (41.7 bits), Expect = 1.0e-33, Sum P(3) = 1.0e-33
 Identities = 23/63 (36%), Positives = 35/63 (55%)

Query:    45 FLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQREDSGTAVFEVNKFFDLSDSDLQQ 104
             F  F   +++ Y +  +  RR + F  N+ KA+  Q ED GTA F V +F DL++ +  Q
Sbjct:    42 FRLFQMQYNRSYPNPAEYARRLDIFAQNLAKAQRLQEEDLGTAEFGVTQFSDLTEEEFVQ 101

Query:   105 LTG 107
             L G
Sbjct:   102 LYG 104

 Score = 81 (33.6 bits), Expect = 5.4e-17, Sum P(3) = 5.4e-17
 Identities = 12/18 (66%), Positives = 15/18 (83%)

Query:   456 SIPYWIVKNSWGSDWGEK 473
             S+ YWI+KNSWG  WGE+
Sbjct:   322 SMAYWILKNSWGPQWGEE 339


>ZFIN|ZDB-GENE-001205-4 [details] [associations]
            symbol:ctsk "cathepsin K" species:7955 "Danio rerio"
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 InterPro:IPR015644
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            ZFIN:ZDB-GENE-001205-4 HOGENOM:HOG000230774 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 HOVERGEN:HBG011513
            CTD:1513 KO:K01371 OrthoDB:EOG4SJ5FC PANTHER:PTHR12411:SF55
            EMBL:BC092901 IPI:IPI00512751 RefSeq:NP_001017778.1
            UniGene:Dr.76224 ProteinModelPortal:Q568D6 SMR:Q568D6 GeneID:550475
            KEGG:dre:550475 InParanoid:Q568D6 NextBio:20879718
            ArrayExpress:Q568D6 Uniprot:Q568D6
        Length = 333

 Score = 340 (124.7 bits), Expect = 3.5e-30, P = 3.5e-30
 Identities = 85/235 (36%), Positives = 119/235 (50%)

Query:   259 DSTLEDIQPS---LQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVKEQG 315
             D TLE++      LQ P   +  +T    F  +  R G  LP++ D+R  G ++ VK QG
Sbjct:    84 DMTLEEVAEKVMGLQMPMYRDPANT----FVPDD-RVGK-LPKSIDYRKLGYVTSVKNQG 137

Query:   316 KCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSNGGCNGGRMDDALQYIIDNGG 375
              C  CWAFS+VG +E         L +LS Q LVDC   N GC GG M +A +Y+ +N G
Sbjct:   138 SCGSCWAFSSVGALEGQLMKTKGQLVDLSPQNLVDCVTENDGCGGGYMTNAFRYVSNNQG 197

Query:   376 VVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNA 435
             + S+++YPY  ++ +  C              Y  IP G E  +   VA  GP+SVG++A
Sbjct:   198 IDSEESYPYVGTDQQ--C-AYNTSGVAASCRGYKEIPQGNERALTAAVANVGPVSVGIDA 254

Query:   436 --NGLFYYSGGVI--------DLNQRL----YGTSI---PYWIVKNSWGSDWGEK 473
               +   YY  GV         D+N  +    YG +     YWIVKNSWG +WG+K
Sbjct:   255 MQSTFLYYKSGVYYDPNCNKEDVNHAVLAVGYGATPRGKKYWIVKNSWGEEWGKK 309

 Score = 279 (103.3 bits), Expect = 1.0e-33, Sum P(2) = 1.0e-33
 Identities = 62/159 (38%), Positives = 87/159 (54%)

Query:   497 KLSRLATEKLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXX 556
             +L  L+ + LVDC   N GC GG M +A +Y+ +N G+ S+++YPY  ++ +  C     
Sbjct:   161 QLVDLSPQNLVDCVTENDGCGGGYMTNAFRYVSNNQGIDSEESYPYVGTDQQ--C-AYNT 217

Query:   557 XXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNA--NGLFYYSGGVI-DLNQRLCNP 613
                      Y  IP G E  +   VA  GP+SVG++A  +   YY  GV  D N   CN 
Sbjct:   218 SGVAASCRGYKEIPQGNERALTAAVANVGPVSVGIDAMQSTFLYYKSGVYYDPN---CNK 274

Query:   614 KAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
             +  NHA++ VGYG   +  G    YWIVKNSWG +WG+K
Sbjct:   275 EDVNHAVLAVGYGATPR--GKK--YWIVKNSWGEEWGKK 309

 Score = 120 (47.3 bits), Expect = 1.0e-33, Sum P(2) = 1.0e-33
 Identities = 33/92 (35%), Positives = 47/92 (51%)

Query:   111 DSTLEDIQPS---LQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVKEQG 167
             D TLE++      LQ P   +  +T    F  +  R G  LP++ D+R  G ++ VK QG
Sbjct:    84 DMTLEEVAEKVMGLQMPMYRDPANT----FVPDD-RVGK-LPKSIDYRKLGYVTSVKNQG 137

Query:   168 KCACCWAFSAVGVVEAMHAIQGNNLTELSVQH 199
              C  CWAFS+VG +E         L +LS Q+
Sbjct:   138 SCGSCWAFSSVGALEGQLMKTKGQLVDLSPQN 169


>MGI|MGI:1922258 [details] [associations]
            symbol:4930486L24Rik "RIKEN cDNA 4930486L24 gene"
            species:10090 "Mus musculus" [GO:0005576 "extracellular region"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0030054 "cell
            junction" evidence=ISO] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 MGI:MGI:1922258
            GO:GO:0005576 GO:GO:0030054 eggNOG:COG4870 HOGENOM:HOG000230774
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 HSSP:P07711
            EMBL:AY146988 EMBL:AK145933 EMBL:BC061218 IPI:IPI00280732
            RefSeq:NP_835199.1 UniGene:Mm.19839 ProteinModelPortal:Q80UB0
            SMR:Q80UB0 MEROPS:C01.972 PRIDE:Q80UB0 Ensembl:ENSMUST00000091569
            GeneID:214639 KEGG:mmu:214639 UCSC:uc007qvs.1 InParanoid:Q80UB0
            OMA:RYHAENS OrthoDB:EOG4XWG0N NextBio:374408 Bgee:Q80UB0
            CleanEx:MM_4930486L24RIK Genevestigator:Q80UB0 Uniprot:Q80UB0
        Length = 333

 Score = 265 (98.3 bits), Expect = 1.0e-33, Sum P(3) = 1.0e-33
 Identities = 63/161 (39%), Positives = 90/161 (55%)

Query:   497 KLSRLATEKLVDCDMSN--GGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXX 554
             +L  L+ + L+DC  SN    C+GG M +A QY+ DNGG+ ++++YPY      R C   
Sbjct:   157 RLVPLSEQNLLDCMGSNVTHDCSGGFMQNAFQYVKDNGGLATEESYPYIGPG--RKC-RY 213

Query:   555 XXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNAN--GLFYYSGGVIDLNQRLCN 612
                        + +IP G EE + K VA  GP+SV ++A+     +Y  G+    Q  C 
Sbjct:   214 HAENSAANVRDFVQIP-GREEALMKAVAKVGPISVAVDASHDSFQFYDSGIYYEPQ--CK 270

Query:   613 PKAQNHALIIVGYG-EEEKKDGTSIPYWIVKNSWGSDWGEK 652
                 NHA+++VGYG E E+ DG S  YW+VKNSWG +WG K
Sbjct:   271 RVHLNHAVLVVGYGFEGEESDGNS--YWLVKNSWGEEWGMK 309

 Score = 247 (92.0 bits), Expect = 2.2e-22, Sum P(2) = 2.2e-22
 Identities = 61/168 (36%), Positives = 87/168 (51%)

Query:   282 MRAFQFNSLRHGDDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLT 341
             M  FQ +   +   +P+  DWR  G ++ VK QG CA  WAFSA G +E     +   L 
Sbjct:   103 MHVFQDHQFLY---VPKYVDWRMLGYVTPVKNQGYCASSWAFSATGSLEGQMFKKTGRLV 159

Query:   342 ELSVQQLVDCDMSN--GGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXX 399
              LS Q L+DC  SN    C+GG M +A QY+ DNGG+ ++++YPY      R C      
Sbjct:   160 PLSEQNLLDCMGSNVTHDCSGGFMQNAFQYVKDNGGLATEESYPYIGPG--RKC-RYHAE 216

Query:   400 XXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNAN--GLFYYSGGV 445
                     + +IP G EE + K VA  GP+SV ++A+     +Y  G+
Sbjct:   217 NSAANVRDFVQIP-GREEALMKAVAKVGPISVAVDASHDSFQFYDSGI 263

 Score = 108 (43.1 bits), Expect = 1.0e-33, Sum P(3) = 1.0e-33
 Identities = 25/66 (37%), Positives = 34/66 (51%)

Query:   134 MRAFQFNSLRHGDDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLT 193
             M  FQ +   +   +P+  DWR  G ++ VK QG CA  WAFSA G +E     +   L 
Sbjct:   103 MHVFQDHQFLY---VPKYVDWRMLGYVTPVKNQGYCASSWAFSATGSLEGQMFKKTGRLV 159

Query:   194 ELSVQH 199
              LS Q+
Sbjct:   160 PLSEQN 165

 Score = 99 (39.9 bits), Expect = 2.5e-09, Sum P(3) = 2.5e-09
 Identities = 39/142 (27%), Positives = 57/142 (40%)

Query:   358 CNGGRMDDALQYIIDNGGVVSDQAYPYKASESE-RGCLXXXXXXXXXXXXXYSRIPYGEE 416
             C+GG M +A QY+ DNGG+ ++++YPY     + R                  R     +
Sbjct:   178 CSGGFMQNAFQYVKDNGGLATEESYPYIGPGRKCRYHAENSAANVRDFVQIPGREEALMK 237

Query:   417 EEMKKW---VATRGPL-SVGMNANGLFYYSGGV-IDLNQRL----YG------TSIPYWI 461
                K     VA      S     +G++Y      + LN  +    YG          YW+
Sbjct:   238 AVAKVGPISVAVDASHDSFQFYDSGIYYEPQCKRVHLNHAVLVVGYGFEGEESDGNSYWL 297

Query:   462 VKNSWGSDWGEKVEDKVGSSGN 483
             VKNSWG +WG K   K+    N
Sbjct:   298 VKNSWGEEWGMKGYIKIAKDWN 319

 Score = 45 (20.9 bits), Expect = 1.0e-33, Sum P(3) = 1.0e-33
 Identities = 17/60 (28%), Positives = 29/60 (48%)

Query:    52 HDKVYSSVEDLLRR---HENFVTNVEKAEDYQREDSGTAVFEVNKFFDLSDSD-LQQLTG 107
             H K Y+  E+ LRR    +NF   +E       E        +N F DL++++ ++ +TG
Sbjct:    36 HGKAYNVNEERLRRAVWEKNFKM-IELHNWEYLEGKHDFTMTMNAFGDLTNTEFVKMMTG 94

 Score = 41 (19.5 bits), Expect = 6.4e-24, Sum P(2) = 6.4e-24
 Identities = 11/36 (30%), Positives = 18/36 (50%)

Query:   108 LNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLR 143
             L +DST   + PSL   ++  +T    +A+  N  R
Sbjct:    12 LEIDSTAPTLDPSLDVQWNEWRTK-HGKAYNVNEER 46

 Score = 41 (19.5 bits), Expect = 6.4e-24, Sum P(2) = 6.4e-24
 Identities = 11/36 (30%), Positives = 18/36 (50%)

Query:   256 LNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLR 291
             L +DST   + PSL   ++  +T    +A+  N  R
Sbjct:    12 LEIDSTAPTLDPSLDVQWNEWRTK-HGKAYNVNEER 46


>TAIR|locus:2097104 [details] [associations]
            symbol:AT3G43960 species:3702 "Arabidopsis thaliana"
            [GO:0005886 "plasma membrane" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] [GO:0031225 "anchored to
            membrane" evidence=TAS] [GO:0048767 "root hair elongation"
            evidence=IMP] [GO:0016132 "brassinosteroid biosynthetic process"
            evidence=RCA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 EMBL:CP002686
            GenomeReviews:BA000014_GR GO:GO:0031225 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0048767 MEROPS:I29.003 HOGENOM:HOG000230773
            EMBL:AL163975 EMBL:AK118634 IPI:IPI00526842 PIR:T48950
            RefSeq:NP_566867.1 UniGene:At.43352 ProteinModelPortal:Q9LXW3
            SMR:Q9LXW3 STRING:Q9LXW3 PaxDb:Q9LXW3 PRIDE:Q9LXW3
            EnsemblPlants:AT3G43960.1 GeneID:823513 KEGG:ath:AT3G43960
            TAIR:At3g43960 eggNOG:NOG286334 InParanoid:Q9LXW3 KO:K01376
            OMA:MAISFRT PhylomeDB:Q9LXW3 ProtClustDB:CLSN2917367
            Genevestigator:Q9LXW3 GermOnline:AT3G43960 Uniprot:Q9LXW3
        Length = 376

 Score = 371 (135.7 bits), Expect = 1.4e-33, P = 1.4e-33
 Identities = 104/317 (32%), Positives = 161/317 (50%)

Query:   177 AVGVVEAMHAI--QGNNLT--ELSVQHHDKVYSSVEDLLRRHENFVTNVEKAEDYQSEDS 232
             ++GVV A  +   +G  LT  E  +  + K Y+ + +  RR + F  N+++ E++ S+ +
Sbjct:    21 SLGVVTATESQRNEGEVLTMYEQWLVENGKNYNGLGEKERRFKIFKDNLKRIEEHNSDPN 80

Query:   233 GTAVFGVNKFFDLSESDLQQLTGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRH 292
              +   G+NKF DL+  + Q      L   +E  + SL        +D   R +Q+   + 
Sbjct:    81 RSYERGLNKFSDLTADEFQ---ASYLGGKME--KKSL--------SDVAER-YQY---KE 123

Query:   293 GDDLPEAFDWRAEG-VISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDC 351
             GD LP+  DWR  G V+ +VK QG+C  CWAF+A G VE ++ I    L  LS Q+L+DC
Sbjct:   124 GDVLPDEVDWRERGAVVPRVKRQGECGSCWAFAATGAVEGINQITTGELVSLSEQELIDC 183

Query:   352 DMSNG--GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYS 409
             D  N   GC GG    A ++I +NGG+VSD+ Y Y   ++                  + 
Sbjct:   184 DRGNDNFGCAGGGAVWAFEFIKENGGIVSDEVYGYTGEDTAACKAIEMKTTRVVTINGHE 243

Query:   410 RIPYGEEEEMKKWVATRGPLSVGMNANGLFYYSGGVI---------DLNQRL--YGTSIP 458
              +P  +E  +KK VA + P+SV ++A  +  Y  GV          D N  +  YGTS  
Sbjct:   244 VVPVNDEMSLKKAVAYQ-PISVMISAANMSDYKSGVYKGACSNLWGDHNVLIVGYGTSSD 302

Query:   459 ---YWIVKNSWGSDWGE 472
                YW+++NSWG +WGE
Sbjct:   303 EGDYWLIRNSWGPEWGE 319

 Score = 224 (83.9 bits), Expect = 9.5e-16, P = 9.5e-16
 Identities = 65/216 (30%), Positives = 101/216 (46%)

Query:   440 YYSGGVI--DLNQRLYGTSIPYWIVKNSWGSDWGEKVEDKVGSSGNRTRDLELTGVLPSK 497
             Y  G V+  +++ R  G  +P    +   GS W       V      T     TG L S 
Sbjct:   121 YKEGDVLPDEVDWRERGAVVPRVKRQGECGSCWAFAATGAVEGINQIT-----TGELVS- 174

Query:   498 LSRLATEKLVDCDMSNG--GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXX 555
                L+ ++L+DCD  N   GC GG    A ++I +NGG+VSD+ Y Y   ++        
Sbjct:   175 ---LSEQELIDCDRGNDNFGCAGGGAVWAFEFIKENGGIVSDEVYGYTGEDTAACKAIEM 231

Query:   556 XXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFYYSGGVIDLNQRLCNPKA 615
                       +  +P  +E  +KK VA + P+SV ++A  +  Y  GV    +  C+   
Sbjct:   232 KTTRVVTINGHEVVPVNDEMSLKKAVAYQ-PISVMISAANMSDYKSGVY---KGACSNLW 287

Query:   616 QNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGE 651
              +H ++IVGYG     +G    YW+++NSWG +WGE
Sbjct:   288 GDHNVLIVGYGTSSD-EGD---YWLIRNSWGPEWGE 319

 Score = 173 (66.0 bits), Expect = 5.7e-10, P = 5.7e-10
 Identities = 51/183 (27%), Positives = 91/183 (49%)

Query:    17 LHTFMIKVALLESNIFQTRGYLNSPVTRFLNFMRDHDKVYSSVEDLLRRHENFVTNVEKA 76
             L   +I ++L      +++      +T +  ++ ++ K Y+ + +  RR + F  N+++ 
Sbjct:    13 LSVLLISISLGVVTATESQRNEGEVLTMYEQWLVENGKNYNGLGEKERRFKIFKDNLKRI 72

Query:    77 EDYQREDSGTAVFEVNKFFDLSDSDLQQLTGLNLDSTLEDIQPSLQAPFSSNQTDTEMRA 136
             E++  + + +    +NKF DL+  + Q      L   +E  + SL        +D   R 
Sbjct:    73 EEHNSDPNRSYERGLNKFSDLTADEFQ---ASYLGGKME--KKSL--------SDVAER- 118

Query:   137 FQFNSLRHGDDLPEAFDWRAEG-VISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTEL 195
             +Q+   + GD LP+  DWR  G V+ +VK QG+C  CWAF+A G VE ++ I    L  L
Sbjct:   119 YQY---KEGDVLPDEVDWRERGAVVPRVKRQGECGSCWAFAATGAVEGINQITTGELVSL 175

Query:   196 SVQ 198
             S Q
Sbjct:   176 SEQ 178


>ZFIN|ZDB-GENE-050208-336 [details] [associations]
            symbol:ctskl "cathepsin K, like" species:7955
            "Danio rerio" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            ZFIN:ZDB-GENE-050208-336 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 EMBL:BX465190
            GeneTree:ENSGT00660000095458 IPI:IPI00491185 RefSeq:XP_695425.1
            UniGene:Dr.110795 Ensembl:ENSDART00000062749 GeneID:567046
            KEGG:dre:567046 CTD:567046 NextBio:20888499 Bgee:F1QCP8
            Uniprot:F1QCP8
        Length = 349

 Score = 316 (116.3 bits), Expect = 1.4e-27, P = 1.4e-27
 Identities = 90/294 (30%), Positives = 141/294 (47%)

Query:   200 HDKVYSSVEDLLRRHENFVTNVEKAEDYQSEDS-GTAVF--GVNKFFDLSESDLQQLTGL 256
             H+  Y    + + R   + TN++K     ++ S G ++F   +NK+ DL+  + ++L G 
Sbjct:    48 HEISYDEESEDVHRKTIWETNMQKIWKNNNDFSFGLSMFKMAMNKYGDLTSVEYKRLLGS 107

Query:   257 NLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVKEQGK 316
              +  T                T  +M   + N+ R G       D+RA+G +++VK+QG 
Sbjct:   108 KIKGT---------GNRKGKITSAQM--LRLNAKRLG---VTNIDYRAKGYVTEVKDQGY 153

Query:   317 CACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSNG--GCNGGRMDDALQYIIDNG 374
             C  CW+FS  G +E         L  LS QQLVDC  S G  GC+G  M +A  Y+I+N 
Sbjct:   154 CGSCWSFSTTGAIEGQMYKHTGRLVSLSEQQLVDCSRSYGTYGCSGAWMANAYDYVINNA 213

Query:   375 GVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMN 434
              + S   YPY + +++  C              Y  +P G E+ +   VAT GP+SV ++
Sbjct:   214 -LESSDTYPYTSVDTQP-CFYEKNLAMAGISD-YRFVPAGNEQALADAVATVGPVSVAID 270

Query:   435 ANG--LFYYSGGVI--------DLNQRL----YGTS--IPYWIVKNSWGSDWGE 472
             A+     +YS G+         +LN  +    YG+     YWI+KNSWG+ WGE
Sbjct:   271 ADNPSFLFYSSGIYKESNCNPNNLNHAVLVVGYGSEEGTDYWIIKNSWGTGWGE 324

 Score = 272 (100.8 bits), Expect = 1.4e-33, Sum P(2) = 1.4e-33
 Identities = 61/159 (38%), Positives = 89/159 (55%)

Query:   497 KLSRLATEKLVDCDMSNG--GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXX 554
             +L  L+ ++LVDC  S G  GC+G  M +A  Y+I+N  + S   YPY + +++  C   
Sbjct:   176 RLVSLSEQQLVDCSRSYGTYGCSGAWMANAYDYVINNA-LESSDTYPYTSVDTQP-CFYE 233

Query:   555 XXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANG--LFYYSGGVIDLNQRLCN 612
                        Y  +P G E+ +   VAT GP+SV ++A+     +YS G+    +  CN
Sbjct:   234 KNLAMAGISD-YRFVPAGNEQALADAVATVGPVSVAIDADNPSFLFYSSGIY--KESNCN 290

Query:   613 PKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGE 651
             P   NHA+++VGYG EE   GT   YWI+KNSWG+ WGE
Sbjct:   291 PNNLNHAVLVVGYGSEE---GTD--YWIIKNSWGTGWGE 324

 Score = 126 (49.4 bits), Expect = 1.4e-33, Sum P(2) = 1.4e-33
 Identities = 40/152 (26%), Positives = 69/152 (45%)

Query:    50 RDHDKVYSSVEDLLRRHENFVTNVEKAEDYQREDS-GTAVFEV--NKFFDLSDSDLQQLT 106
             + H+  Y    + + R   + TN++K      + S G ++F++  NK+ DL+  + ++L 
Sbjct:    46 KKHEISYDEESEDVHRKTIWETNMQKIWKNNNDFSFGLSMFKMAMNKYGDLTSVEYKRLL 105

Query:   107 GLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVKEQ 166
             G  +  T                T  +M   + N+ R G       D+RA+G +++VK+Q
Sbjct:   106 GSKIKGT---------GNRKGKITSAQM--LRLNAKRLG---VTNIDYRAKGYVTEVKDQ 151

Query:   167 GKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
             G C  CW+FS  G +E         L  LS Q
Sbjct:   152 GYCGSCWSFSTTGAIEGQMYKHTGRLVSLSEQ 183


>UNIPROTKB|E9PTT3 [details] [associations]
            symbol:Ctsr "Protein Ctsr" species:10116 "Rattus
            norvegicus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            IPI:IPI00627092 Ensembl:ENSRNOT00000024115 RGD:631422
            Uniprot:E9PTT3
        Length = 334

 Score = 253 (94.1 bits), Expect = 3.9e-23, Sum P(2) = 3.9e-23
 Identities = 57/159 (35%), Positives = 83/159 (52%)

Query:   293 GDDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCD 352
             G+ LP+  DWR +G +++V+ Q  C  CWAF+  G +E     +   LT LSVQ LVDC 
Sbjct:   112 GNVLPKFVDWRKKGYVTRVQNQKFCNSCWAFAVTGAIEGQMFNKTGQLTPLSVQNLVDCT 171

Query:   353 MSNG--GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSR 410
              S G  GC  G    A +Y+++NGG+ ++  YPYK  E   G               +  
Sbjct:   172 KSQGNEGCQWGDPHIAYEYVLNNGGLEAEATYPYKGKE---GVCRYNPKHSKAEITGFVS 228

Query:   411 IPYGEEEEMKKWVATRGPLSVGMNA--NGLFYYSGGVID 447
             +P  E+  M+  VAT GP+SV ++A  N   +Y  G+ D
Sbjct:   229 LPESEDILMEA-VATIGPISVAVDASFNSFGFYKKGLYD 266

 Score = 250 (93.1 bits), Expect = 2.1e-33, Sum P(3) = 2.1e-33
 Identities = 57/159 (35%), Positives = 86/159 (54%)

Query:   497 KLSRLATEKLVDCDMSNG--GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXX 554
             +L+ L+ + LVDC  S G  GC  G    A +Y+++NGG+ ++  YPYK  E   G    
Sbjct:   158 QLTPLSVQNLVDCTKSQGNEGCQWGDPHIAYEYVLNNGGLEAEATYPYKGKE---GVCRY 214

Query:   555 XXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNA--NGLFYYSGGVIDLNQRLCN 612
                        +  +P  E+  M+  VAT GP+SV ++A  N   +Y  G+ D  +  C+
Sbjct:   215 NPKHSKAEITGFVSLPESEDILMEA-VATIGPISVAVDASFNSFGFYKKGLYD--EPNCS 271

Query:   613 PKAQNHALIIVGYG-EEEKKDGTSIPYWIVKNSWGSDWG 650
                 NH++++VGYG E  + DG S  YW++KNSWG  WG
Sbjct:   272 NNTVNHSVLVVGYGFEGNETDGNS--YWLIKNSWGRKWG 308

 Score = 120 (47.3 bits), Expect = 2.1e-33, Sum P(3) = 2.1e-33
 Identities = 22/55 (40%), Positives = 33/55 (60%)

Query:   145 GDDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQH 199
             G+ LP+  DWR +G +++V+ Q  C  CWAF+  G +E     +   LT LSVQ+
Sbjct:   112 GNVLPKFVDWRKKGYVTRVQNQKFCNSCWAFAVTGAIEGQMFNKTGQLTPLSVQN 166

 Score = 80 (33.2 bits), Expect = 7.1e-09, Sum P(3) = 7.1e-09
 Identities = 34/145 (23%), Positives = 51/145 (35%)

Query:   355 NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYG 414
             N GC  G    A +Y+++NGG+ ++  YPYK  E                          
Sbjct:   176 NEGCQWGDPHIAYEYVLNNGGLEAEATYPYKGKEGVCRYNPKHSKAEITGFVSLPESEDI 235

Query:   415 EEEEMKKW----VATRGPL-SVGMNANGLF---YYSGGVIDLNQRLYGTSIP-------- 458
               E +       VA      S G    GL+     S   ++ +  + G            
Sbjct:   236 LMEAVATIGPISVAVDASFNSFGFYKKGLYDEPNCSNNTVNHSVLVVGYGFEGNETDGNS 295

Query:   459 YWIVKNSWGSDWGEKVEDKVGSSGN 483
             YW++KNSWG  WG +   K+    N
Sbjct:   296 YWLIKNSWGRKWGLRGYMKIPKDQN 320

 Score = 46 (21.3 bits), Expect = 2.1e-33, Sum P(3) = 2.1e-33
 Identities = 16/58 (27%), Positives = 34/58 (58%)

Query:    51 DHDKVYSSVEDLLRRHENFVTNVEKAEDYQREDS-GTAVF--EVNKFFDLSDSDLQQL 105
             +++K Y+  E+  RR   +  N++  + + RE+S G   F  E+N+F DL+  + +++
Sbjct:    35 EYEKSYTMEEEGHRRAV-WEENMKMIKLHNRENSLGKNGFIMEMNEFGDLTAEEFRKM 91


>UNIPROTKB|Q5E998 [details] [associations]
            symbol:CTSL2 "Cathepsin L2" species:9913 "Bos taurus"
            [GO:0005764 "lysosome" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            HOVERGEN:HBG011513 UniGene:Bt.3987 MEROPS:C01.032 EMBL:BT021022
            IPI:IPI00711962 ProteinModelPortal:Q5E998 SMR:Q5E998 STRING:Q5E998
            InParanoid:Q5E998 Uniprot:Q5E998
        Length = 334

 Score = 285 (105.4 bits), Expect = 2.4e-29, Sum P(2) = 2.4e-29
 Identities = 64/154 (41%), Positives = 87/154 (56%)

Query:   295 DLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMS 354
             D+P++ DW  +G ++ VK QG+C  CWAFSA G +E     +   L  LS Q LVDC  +
Sbjct:   113 DVPKSVDWTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRA 172

Query:   355 NG--GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIP 412
              G  GCNGG MD+A QYI DNG + S+++YPY A+++   C              +  IP
Sbjct:   173 QGNQGCNGGLMDNAFQYIKDNGCLDSEESYPYLATDTN-SC-NYKPECSAANDTGFVDIP 230

Query:   413 YGEEEEMKKWVATRGPLSVGMNANGL---FYYSG 443
               E+  MK  VAT GP+SV ++A      FY SG
Sbjct:   231 QREKALMKA-VATVGPISVAIDAGHTSFQFYKSG 263

 Score = 270 (100.1 bits), Expect = 2.3e-33, Sum P(2) = 2.3e-33
 Identities = 65/159 (40%), Positives = 90/159 (56%)

Query:   497 KLSRLATEKLVDCDMSNG--GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXX 554
             KL  L+ + LVDC  + G  GCNGG MD+A QYI DNG + S+++YPY A+++   C   
Sbjct:   157 KLVSLSEQNLVDCSRAQGNQGCNGGLMDNAFQYIKDNGCLDSEESYPYLATDTN-SC-NY 214

Query:   555 XXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGL---FYYSGGVIDLNQRLC 611
                        +  IP  E+  MK  VAT GP+SV ++A      FY SG   D +   C
Sbjct:   215 KPECSAANDTGFVDIPQREKALMKA-VATVGPISVAIDAGHTSFQFYKSGIYYDPD---C 270

Query:   612 NPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWG 650
             + K  +H +++VGYG E   D  +  +WIVKNSWG +WG
Sbjct:   271 SSKDLDHGVLVVGYGFEGT-DSNNNKFWIVKNSWGPEWG 308

 Score = 126 (49.4 bits), Expect = 2.3e-33, Sum P(2) = 2.3e-33
 Identities = 22/53 (41%), Positives = 32/53 (60%)

Query:   147 DLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQH 199
             D+P++ DW  +G ++ VK QG+C  CWAFSA G +E     +   L  LS Q+
Sbjct:   113 DVPKSVDWTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQN 165

 Score = 72 (30.4 bits), Expect = 2.4e-29, Sum P(2) = 2.4e-29
 Identities = 10/13 (76%), Positives = 12/13 (92%)

Query:   459 YWIVKNSWGSDWG 471
             +WIVKNSWG +WG
Sbjct:   296 FWIVKNSWGPEWG 308


>UNIPROTKB|Q9UBX1 [details] [associations]
            symbol:CTSF "Cathepsin F" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0004197 "cysteine-type
            endopeptidase activity" evidence=TAS] [GO:0005764 "lysosome"
            evidence=TAS] [GO:0019886 "antigen processing and presentation of
            exogenous peptide antigen via MHC class II" evidence=TAS]
            [GO:0043202 "lysosomal lumen" evidence=TAS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 Reactome:REACT_6900 GO:GO:0019886 eggNOG:COG4870
            HOGENOM:HOG000230774 KO:K01373 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0043202
            GO:GO:0004197 HOVERGEN:HBG011513 EMBL:AJ007331 EMBL:AF088886
            EMBL:AF132894 EMBL:AF136279 EMBL:AF071748 EMBL:AF071749
            EMBL:AK313657 EMBL:BC011682 EMBL:BC036451 EMBL:AL137742
            IPI:IPI00002816 RefSeq:NP_003784.2 UniGene:Hs.11590 PDB:1D5U
            PDB:1M6D PDBsum:1D5U PDBsum:1M6D ProteinModelPortal:Q9UBX1
            SMR:Q9UBX1 STRING:Q9UBX1 MEROPS:C01.018 PhosphoSite:Q9UBX1
            DMDM:12643325 PaxDb:Q9UBX1 PeptideAtlas:Q9UBX1 PRIDE:Q9UBX1
            DNASU:8722 Ensembl:ENST00000310325 GeneID:8722 KEGG:hsa:8722
            UCSC:uc001oip.3 CTD:8722 GeneCards:GC11M066332 HGNC:HGNC:2531
            HPA:CAB002141 MIM:603539 neXtProt:NX_Q9UBX1 PharmGKB:PA27031
            InParanoid:Q9UBX1 OMA:LAPPEWD OrthoDB:EOG4CC41T PhylomeDB:Q9UBX1
            BindingDB:Q9UBX1 ChEMBL:CHEMBL2517 ChiTaRS:CTSF
            EvolutionaryTrace:Q9UBX1 GenomeRNAi:8722 NextBio:32715
            ArrayExpress:Q9UBX1 Bgee:Q9UBX1 CleanEx:HS_CTSF
            Genevestigator:Q9UBX1 GermOnline:ENSG00000174080 Uniprot:Q9UBX1
        Length = 484

 Score = 312 (114.9 bits), Expect = 1.7e-31, Sum P(2) = 1.7e-31
 Identities = 69/197 (35%), Positives = 101/197 (51%)

Query:   293 GDDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCD 352
             GD  P  +DWR++G ++KVK+QG C  CWAFS  G VE    +   +L  LS Q+L+DCD
Sbjct:   268 GDLAPPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCD 327

Query:   353 MSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIP 412
               +  C GG   +A   I + GG+ ++  Y Y+                       S+  
Sbjct:   328 KMDKACMGGLPSNAYSAIKNLGGLETEDDYSYQGHMQSCNFSAEKAKVYINDSVELSQ-- 385

Query:   413 YGEEEEMKKWVATRGPLSVGMNANGLFYYSGGV------------IDLNQRL--YG--TS 456
                E+++  W+A RGP+SV +NA G+ +Y  G+            ID    L  YG  + 
Sbjct:   386 --NEQKLAAWLAKRGPISVAINAFGMQFYRHGISRPLRPLCSPWLIDHAVLLVGYGNRSD 443

Query:   457 IPYWIVKNSWGSDWGEK 473
             +P+W +KNSWG+DWGEK
Sbjct:   444 VPFWAIKNSWGTDWGEK 460

 Score = 242 (90.2 bits), Expect = 2.9e-33, Sum P(3) = 2.9e-33
 Identities = 50/155 (32%), Positives = 82/155 (52%)

Query:   498 LSRLATEKLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXX 557
             L  L+ ++L+DCD  +  C GG   +A   I + GG+ ++  Y Y+              
Sbjct:   315 LLSLSEQELLDCDKMDKACMGGLPSNAYSAIKNLGGLETEDDYSYQGHMQSCNFSAEKAK 374

Query:   558 XXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFYYSGGVIDLNQRLCNPKAQN 617
                      S+     E+++  W+A RGP+SV +NA G+ +Y  G+    + LC+P   +
Sbjct:   375 VYINDSVELSQ----NEQKLAAWLAKRGPISVAINAFGMQFYRHGISRPLRPLCSPWLID 430

Query:   618 HALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
             HA+++VGYG       + +P+W +KNSWG+DWGEK
Sbjct:   431 HAVLLVGYGNR-----SDVPFWAIKNSWGTDWGEK 460

 Score = 137 (53.3 bits), Expect = 2.9e-33, Sum P(3) = 2.9e-33
 Identities = 25/54 (46%), Positives = 32/54 (59%)

Query:   145 GDDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
             GD  P  +DWR++G ++KVK+QG C  CWAFS  G VE    +    L  LS Q
Sbjct:   268 GDLAPPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQ 321

 Score = 98 (39.6 bits), Expect = 2.9e-33, Sum P(3) = 2.9e-33
 Identities = 21/61 (34%), Positives = 35/61 (57%)

Query:    45 FLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQREDSGTAVFEVNKFFDLSDSDLQQ 104
             F NF+  +++ Y S E+   R   FV N+ +A+  Q  D GTA + V KF DL++ + + 
Sbjct:   187 FKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 246

Query:   105 L 105
             +
Sbjct:   247 I 247

 Score = 98 (39.6 bits), Expect = 4.6e-23, Sum P(2) = 4.6e-23
 Identities = 20/54 (37%), Positives = 33/54 (61%)

Query:   200 HDKVYSSVEDLLRRHENFVTNVEKAEDYQSEDSGTAVFGVNKFFDLSESDLQQL 253
             +++ Y S E+   R   FV N+ +A+  Q+ D GTA +GV KF DL+E + + +
Sbjct:   194 YNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRTI 247


>UNIPROTKB|Q9GL24 [details] [associations]
            symbol:CTSL1 "Cathepsin L1" species:9615 "Canis lupus
            familiaris" [GO:0005764 "lysosome" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0006508
            "proteolysis" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 CTD:1515 KO:K01365
            OrthoDB:EOG48PMKF EMBL:AJ279008 RefSeq:NP_001239115.1
            UniGene:Cfa.3571 ProteinModelPortal:Q9GL24 SMR:Q9GL24
            MEROPS:C01.032 Ensembl:ENSCAFT00000001770
            Ensembl:ENSCAFT00000023837 GeneID:100684364 KEGG:cfa:100684364
            InParanoid:Q9GL24 OMA:FDQNLDT NextBio:20817211 Uniprot:Q9GL24
        Length = 333

 Score = 289 (106.8 bits), Expect = 1.2e-24, P = 1.2e-24
 Identities = 60/155 (38%), Positives = 88/155 (56%)

Query:   295 DLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMS 354
             ++P++ DWR +G ++ VK QG+C  CWAFSA G +E     +   L  LS Q LVDC  +
Sbjct:   113 EIPKSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRA 172

Query:   355 NG--GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIP 412
              G  GCNGG MD+A +Y+ DNGG+ S+++YPY   ++E  C              +  +P
Sbjct:   173 QGNEGCNGGLMDNAFRYVKDNGGLDSEESYPYLGRDTET-C-NYKPECSAANDTGFVDLP 230

Query:   413 YGEEEEMKKWVATRGPLSVGMNAN--GLFYYSGGV 445
               E+  MK  VAT GP+SV ++A      +Y  G+
Sbjct:   231 QREKALMKA-VATLGPISVAIDAGHQSFQFYKSGI 264

 Score = 265 (98.3 bits), Expect = 3.1e-33, Sum P(2) = 3.1e-33
 Identities = 62/159 (38%), Positives = 88/159 (55%)

Query:   497 KLSRLATEKLVDCDMSNG--GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXX 554
             KL  L+ + LVDC  + G  GCNGG MD+A +Y+ DNGG+ S+++YPY   ++E  C   
Sbjct:   157 KLVSLSEQNLVDCSRAQGNEGCNGGLMDNAFRYVKDNGGLDSEESYPYLGRDTET-C-NY 214

Query:   555 XXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGL---FYYSGGVIDLNQRLC 611
                        +  +P  E+  MK  VAT GP+SV ++A      FY SG   D +   C
Sbjct:   215 KPECSAANDTGFVDLPQREKALMKA-VATLGPISVAIDAGHQSFQFYKSGIYFDPD---C 270

Query:   612 NPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWG 650
             + K  +H +++VGYG E         +WIVKNSWG +WG
Sbjct:   271 SSKDLDHGVLVVGYGFEGTDSNNK--FWIVKNSWGPEWG 307

 Score = 130 (50.8 bits), Expect = 3.1e-33, Sum P(2) = 3.1e-33
 Identities = 38/121 (31%), Positives = 59/121 (48%)

Query:    87 AVFEVN-KFFDLSDSDLQQ-----LTGLNL--DSTLEDIQPSLQAPFSSNQTDTEMRAFQ 138
             AV+E N K  +L + +  Q        +N   D T E+ +  +   F  NQ   + + FQ
Sbjct:    50 AVWEKNMKMIELHNREYSQGKHGFTMAMNAFGDMTNEEFRQVMNG-FQ-NQKHKKGKMFQ 107

Query:   139 FNSLRHGDDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
                     ++P++ DWR +G ++ VK QG+C  CWAFSA G +E     +   L  LS Q
Sbjct:   108 EPLFA---EIPKSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQ 164

Query:   199 H 199
             +
Sbjct:   165 N 165

 Score = 130 (50.8 bits), Expect = 2.3e-14, Sum P(2) = 2.3e-14
 Identities = 39/133 (29%), Positives = 57/133 (42%)

Query:   355 NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESER-----GCLXXXXXXXXXXXXXYS 409
             N GCNGG MD+A +Y+ DNGG+ S+++YPY   ++E       C                
Sbjct:   175 NEGCNGGLMDNAFRYVKDNGGLDSEESYPYLGRDTETCNYKPECSAANDTGFVDLPQREK 234

Query:   410 RIPYGEEEEMKKWVAT-RGPLSVGMNANGLFY--------YSGGVIDLNQRLYGTSI--P 458
              +           VA   G  S     +G+++           GV+ +     GT     
Sbjct:   235 ALMKAVATLGPISVAIDAGHQSFQFYKSGIYFDPDCSSKDLDHGVLVVGYGFEGTDSNNK 294

Query:   459 YWIVKNSWGSDWG 471
             +WIVKNSWG +WG
Sbjct:   295 FWIVKNSWGPEWG 307


>ZFIN|ZDB-GENE-030131-106 [details] [associations]
            symbol:ctsl1a "cathepsin L, 1 a" species:7955
            "Danio rerio" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            ZFIN:ZDB-GENE-030131-106 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 HOVERGEN:HBG011513 HSSP:P43235
            KO:K01365 EMBL:BC066490 IPI:IPI00495935 RefSeq:NP_997749.1
            UniGene:Dr.104499 ProteinModelPortal:Q6NYR5 SMR:Q6NYR5
            MEROPS:C01.074 PRIDE:Q6NYR5 GeneID:321453 KEGG:dre:321453
            CTD:321453 InParanoid:Q6NYR5 NextBio:20807387 ArrayExpress:Q6NYR5
            Bgee:Q6NYR5 Uniprot:Q6NYR5
        Length = 337

 Score = 285 (105.4 bits), Expect = 3.4e-24, P = 3.4e-24
 Identities = 64/188 (34%), Positives = 97/188 (51%)

Query:   295 DLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMS 354
             ++P   DWR +G ++ VK+QG+C  CWAFS  G +E     +   L  LS Q LVDC   
Sbjct:   115 EVPNKLDWREKGYVTPVKDQGECGSCWAFSTTGALEGQMFRKTGKLVSLSEQNLVDCSRP 174

Query:   355 NG--GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIP 412
              G  GCNGG MD A QY+ D  G+ S+++YPY  ++ ++ C              +  IP
Sbjct:   175 EGNEGCNGGLMDQAFQYVKDQNGLDSEESYPYLGTD-DQPC-HFDPKNSAANDTGFVDIP 232

Query:   413 YGEEEEMKKWVATRGPLSVGMNANG---LFYYSGGVIDLNQRLYGTSIPYWIVKNSWGSD 469
              G+E  + K +A  GP+SV ++A      FY SG  I   +      + + ++   +G +
Sbjct:   233 SGKERALMKAIAAVGPVSVAIDAGHESFQFYQSG--IYYEKECSSEELDHGVLAVGYGFE 290

Query:   470 WGEKVEDK 477
              GE V+ K
Sbjct:   291 -GEDVDGK 297

 Score = 269 (99.8 bits), Expect = 4.8e-33, Sum P(2) = 4.8e-33
 Identities = 59/161 (36%), Positives = 88/161 (54%)

Query:   497 KLSRLATEKLVDCDMSNG--GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXX 554
             KL  L+ + LVDC    G  GCNGG MD A QY+ D  G+ S+++YPY  ++ ++ C   
Sbjct:   159 KLVSLSEQNLVDCSRPEGNEGCNGGLMDQAFQYVKDQNGLDSEESYPYLGTD-DQPC-HF 216

Query:   555 XXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNAN--GLFYYSGGVIDLNQRLCN 612
                        +  IP G+E  + K +A  GP+SV ++A      +Y  G+    ++ C+
Sbjct:   217 DPKNSAANDTGFVDIPSGKERALMKAIAAVGPVSVAIDAGHESFQFYQSGIY--YEKECS 274

Query:   613 PKAQNHALIIVGYG-EEEKKDGTSIPYWIVKNSWGSDWGEK 652
              +  +H ++ VGYG E E  DG    YWIVKNSW  +WG+K
Sbjct:   275 SEELDHGVLAVGYGFEGEDVDGKK--YWIVKNSWSENWGDK 313

 Score = 124 (48.7 bits), Expect = 4.8e-33, Sum P(2) = 4.8e-33
 Identities = 21/53 (39%), Positives = 31/53 (58%)

Query:   147 DLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQH 199
             ++P   DWR +G ++ VK+QG+C  CWAFS  G +E     +   L  LS Q+
Sbjct:   115 EVPNKLDWREKGYVTPVKDQGECGSCWAFSTTGALEGQMFRKTGKLVSLSEQN 167

 Score = 77 (32.2 bits), Expect = 3.4e-08, Sum P(2) = 3.4e-08
 Identities = 11/15 (73%), Positives = 13/15 (86%)

Query:   459 YWIVKNSWGSDWGEK 473
             YWIVKNSW  +WG+K
Sbjct:   299 YWIVKNSWSENWGDK 313


>UNIPROTKB|F6X9C1 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00660000095458
            OMA:STSCHKT Ensembl:ENSCAFT00000036196 EMBL:AAEX03002388
            Uniprot:F6X9C1
        Length = 305

 Score = 282 (104.3 bits), Expect = 5.7e-28, Sum P(2) = 5.7e-28
 Identities = 71/204 (34%), Positives = 94/204 (46%)

Query:   288 NSLRHGDDLPEAFDWRAEG-VISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQ 346
             N LR     P   DWR +G  +S VK QG C  CW FS  G +E+  AI+   L  L+ Q
Sbjct:    78 NYLRGTGPYPPFVDWRKKGKFVSPVKNQGSCGSCWTFSTTGALESAIAIKSGKLLSLAEQ 137

Query:   347 QLVDC--DMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXX 404
             QLVDC  + +N GC GG    A +YI  N G++ + +YPYK  + +  C           
Sbjct:   138 QLVDCAQNFNNHGCQGGAPLQAFEYIRYNKGIMGEDSYPYKGQDGD--C-KYQPSKAIAF 194

Query:   405 XXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFY-YSGGVID----------LNQRL- 452
                 + I   +E+ M + VA   P+S        F  Y  G+            +N  + 
Sbjct:   195 VKDVANITINDEQAMVEAVALYNPVSFAFEVTSDFMMYRKGIYSSTSCHKTPDKVNHAVL 254

Query:   453 ---YG--TSIPYWIVKNSWGSDWG 471
                YG    IPYWIVKNSWG  WG
Sbjct:   255 AVGYGEQNGIPYWIVKNSWGPQWG 278

 Score = 231 (86.4 bits), Expect = 5.5e-33, Sum P(3) = 5.5e-33
 Identities = 55/157 (35%), Positives = 75/157 (47%)

Query:   497 KLSRLATEKLVDC--DMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXX 554
             KL  LA ++LVDC  + +N GC GG    A +YI  N G++ + +YPYK  + +  C   
Sbjct:   130 KLLSLAEQQLVDCAQNFNNHGCQGGAPLQAFEYIRYNKGIMGEDSYPYKGQDGD--C-KY 186

Query:   555 XXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFY-YSGGVIDLNQRLCNP 613
                         + I   +E+ M + VA   P+S        F  Y  G+         P
Sbjct:   187 QPSKAIAFVKDVANITINDEQAMVEAVALYNPVSFAFEVTSDFMMYRKGIYSSTSCHKTP 246

Query:   614 KAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWG 650
                NHA++ VGYGE+       IPYWIVKNSWG  WG
Sbjct:   247 DKVNHAVLAVGYGEQN-----GIPYWIVKNSWGPQWG 278

 Score = 120 (47.3 bits), Expect = 5.5e-33, Sum P(3) = 5.5e-33
 Identities = 25/60 (41%), Positives = 31/60 (51%)

Query:   140 NSLRHGDDLPEAFDWRAEG-VISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
             N LR     P   DWR +G  +S VK QG C  CW FS  G +E+  AI+   L  L+ Q
Sbjct:    78 NYLRGTGPYPPFVDWRKKGKFVSPVKNQGSCGSCWTFSTTGALESAIAIKSGKLLSLAEQ 137

 Score = 62 (26.9 bits), Expect = 5.5e-33, Sum P(3) = 5.5e-33
 Identities = 18/61 (29%), Positives = 35/61 (57%)

Query:    45 FLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQREDSGTAVFEV--NKFFDLSDSDL 102
             F ++   H K YSS E+ L+R + FV N  K   +   ++G   F++  N+F D++ +++
Sbjct:     5 FKSWAVQHQKKYSS-EEYLQRLQTFVGNWRKINAH---NAGNHTFKMGLNQFSDMNFAEI 60

Query:   103 Q 103
             +
Sbjct:    61 K 61


>UNIPROTKB|Q10991 [details] [associations]
            symbol:CTSL "Cathepsin L1" species:9940 "Ovis aries"
            [GO:0005515 "protein binding" evidence=IPI] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 HOVERGEN:HBG011513
            MEROPS:C01.032 ProteinModelPortal:Q10991 SMR:Q10991 Uniprot:Q10991
        Length = 217

 Score = 330 (121.2 bits), Expect = 4.3e-29, P = 4.3e-29
 Identities = 79/207 (38%), Positives = 109/207 (52%)

Query:   296 LPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSN 355
             +P++ DW  +G ++ VK QG+C  CWAFSA G +E     +   L  LS Q LVD     
Sbjct:     1 VPKSVDWTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDSSRPQ 60

Query:   356 G--GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPY 413
             G  GCNGG MD+A QYI +NGG+ S+++YPY+A+++   C              +  IP 
Sbjct:    61 GNQGCNGGLMDNAFQYIKENGGLDSEESYPYEATDTS--C-NYKPEYSAAKDTGFVDIPQ 117

Query:   414 GEEEEMKKWVATRGPLSVGMNANGL---FYYSG--------------GVIDLNQRLYGTS 456
              E+  MK  VAT GP+SV ++A      FY SG              GV+ +     GT+
Sbjct:   118 REKALMKA-VATVGPISVAIDAGHSSFQFYKSGIYYDPDCSSKDLDHGVLVVGYGFEGTN 176

Query:   457 IPYWIVKNSWGSDWGEKVEDKVGSSGN 483
               +WIVKNSWG +WG K   K+    N
Sbjct:   177 NKFWIVKNSWGPEWGNKGYVKMAKDQN 203

 Score = 271 (100.5 bits), Expect = 7.7e-33, Sum P(2) = 7.7e-33
 Identities = 66/161 (40%), Positives = 92/161 (57%)

Query:   497 KLSRLATEKLVDCDMSNG--GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXX 554
             KL  L+ + LVD     G  GCNGG MD+A QYI +NGG+ S+++YPY+A+++   C   
Sbjct:    44 KLVSLSEQNLVDSSRPQGNQGCNGGLMDNAFQYIKENGGLDSEESYPYEATDTS--C-NY 100

Query:   555 XXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGL---FYYSGGVIDLNQRLC 611
                        +  IP  E+  MK  VAT GP+SV ++A      FY SG   D +   C
Sbjct:   101 KPEYSAAKDTGFVDIPQREKALMKA-VATVGPISVAIDAGHSSFQFYKSGIYYDPD---C 156

Query:   612 NPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
             + K  +H +++VGYG E    GT+  +WIVKNSWG +WG K
Sbjct:   157 SSKDLDHGVLVVGYGFE----GTNNKFWIVKNSWGPEWGNK 193

 Score = 120 (47.3 bits), Expect = 7.7e-33, Sum P(2) = 7.7e-33
 Identities = 21/52 (40%), Positives = 31/52 (59%)

Query:   148 LPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQH 199
             +P++ DW  +G ++ VK QG+C  CWAFSA G +E     +   L  LS Q+
Sbjct:     1 VPKSVDWTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQN 52


>ZFIN|ZDB-GENE-050626-55 [details] [associations]
            symbol:ctssb.2 "cathepsin S, b.2" species:7955
            "Danio rerio" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-050626-55
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 HOVERGEN:HBG011513
            KO:K01368 EMBL:BC093339 IPI:IPI00507098 RefSeq:NP_001017661.1
            UniGene:Dr.132688 ProteinModelPortal:Q566T8 SMR:Q566T8
            GeneID:337572 KEGG:dre:337572 CTD:337572 InParanoid:Q566T8
            NextBio:20812306 ArrayExpress:Q566T8 Uniprot:Q566T8
        Length = 330

 Score = 350 (128.3 bits), Expect = 2.8e-31, P = 2.8e-31
 Identities = 94/296 (31%), Positives = 144/296 (48%)

Query:   194 ELSVQHHDKVYSSVEDLLRRHENFVTNVEKAEDYQSEDSGTAVFGVNKFFDLSESDLQQL 253
             EL  + H K+YS  ++ + R E +  N+E    +  E S     G++ + DL+ + +  +
Sbjct:    28 ELWKKKHVKLYSCEDEEVGRRELWERNLELIAIHNLEAS----MGMHSY-DLAINHMADM 82

Query:   254 TGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVKE 313
             T   +  TL       + P    +   E  +  F  +      P+  DWR +G ++ VK 
Sbjct:    83 TTEEILQTLA----VTRVPPGFKRPTAEYVSSSFAVV------PDTLDWRDKGYVTSVKN 132

Query:   314 QGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSNG--GCNGGRMDDALQYII 371
             QG C  CWAFS+VG +E         L +LS Q LVDC    G  GCNGG M  A QY+I
Sbjct:   133 QGACGSCWAFSSVGALEGQLMKTTGKLVDLSPQNLVDCSSKYGNLGCNGGYMSQAFQYVI 192

Query:   372 DNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSV 431
             DNGG+ S+ +YPY+ ++   G               Y  +  G+E+ +K+ +A  GP+SV
Sbjct:   193 DNGGIDSESSYPYQGTQ---GSCRYDPSQRAANCTSYKFVSQGDEQALKEALANIGPVSV 249

Query:   432 GMNANG--LFYYSGGVID---LNQRL--------YGT--SIPYWIVKNSWGSDWGE 472
              ++A      +Y  GV D     Q++        YGT     YW+VKNSWG+ +G+
Sbjct:   250 AIDATRPQFIFYRSGVYDDPSCTQKVNHGVLAVGYGTLSGQDYWLVKNSWGAGFGD 305

 Score = 264 (98.0 bits), Expect = 8.3e-33, Sum P(2) = 8.3e-33
 Identities = 59/159 (37%), Positives = 85/159 (53%)

Query:   497 KLSRLATEKLVDCDMSNG--GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXX 554
             KL  L+ + LVDC    G  GCNGG M  A QY+IDNGG+ S+ +YPY+ ++   G    
Sbjct:   158 KLVDLSPQNLVDCSSKYGNLGCNGGYMSQAFQYVIDNGGIDSESSYPYQGTQ---GSCRY 214

Query:   555 XXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANG--LFYYSGGVIDLNQRLCN 612
                        Y  +  G+E+ +K+ +A  GP+SV ++A      +Y  GV D     C 
Sbjct:   215 DPSQRAANCTSYKFVSQGDEQALKEALANIGPVSVAIDATRPQFIFYRSGVYD--DPSCT 272

Query:   613 PKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGE 651
              K  NH ++ VGYG    +D     YW+VKNSWG+ +G+
Sbjct:   273 QKV-NHGVLAVGYGTLSGQD-----YWLVKNSWGAGFGD 305

 Score = 127 (49.8 bits), Expect = 8.3e-33, Sum P(2) = 8.3e-33
 Identities = 39/153 (25%), Positives = 69/153 (45%)

Query:    50 RDHDKVYSSVEDLLRRHENFVTNVEKAEDYQREDS-GTAVFE--VNKFFDLSDSDLQQLT 106
             + H K+YS  ++ + R E +  N+E    +  E S G   ++  +N   D++  ++ Q  
Sbjct:    32 KKHVKLYSCEDEEVGRRELWERNLELIAIHNLEASMGMHSYDLAINHMADMTTEEILQTL 91

Query:   107 GLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVKEQ 166
              +        + P  + P +   + +    F          +P+  DWR +G ++ VK Q
Sbjct:    92 AVTR------VPPGFKRPTAEYVSSS----FAV--------VPDTLDWRDKGYVTSVKNQ 133

Query:   167 GKCACCWAFSAVGVVEAMHAIQGNNLTELSVQH 199
             G C  CWAFS+VG +E         L +LS Q+
Sbjct:   134 GACGSCWAFSSVGALEGQLMKTTGKLVDLSPQN 166


>TAIR|locus:2024362 [details] [associations]
            symbol:XBCP3 "xylem bark cysteine peptidase 3"
            species:3702 "Arabidopsis thaliana" [GO:0005576 "extracellular
            region" evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            [GO:0005773 "vacuole" evidence=IDA] [GO:0005783 "endoplasmic
            reticulum" evidence=IDA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005783 EMBL:CP002684 GO:GO:0005773 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 MEROPS:I29.003
            HOGENOM:HOG000230773 InterPro:IPR000118 Pfam:PF00396 SMART:SM00277
            UniGene:At.10233 OMA:CEIESAV EMBL:BT026490 EMBL:AK226753
            IPI:IPI00536687 RefSeq:NP_563855.1 ProteinModelPortal:Q0WVJ5
            SMR:Q0WVJ5 PRIDE:Q0WVJ5 EnsemblPlants:AT1G09850.1 GeneID:837517
            KEGG:ath:AT1G09850 TAIR:At1g09850 InParanoid:Q0WVJ5
            PhylomeDB:Q0WVJ5 ProtClustDB:CLSN2687747 Genevestigator:Q0WVJ5
            Uniprot:Q0WVJ5
        Length = 437

 Score = 331 (121.6 bits), Expect = 9.7e-33, Sum P(2) = 9.7e-33
 Identities = 70/191 (36%), Positives = 107/191 (56%)

Query:   296 LPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMS- 354
             +P++ DWR +G ++ VK+QG C  CW+FSA G +E ++ I    L  LS Q+L+DCD S 
Sbjct:   118 VPDSVDWRKKGAVTNVKDQGSCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSY 177

Query:   355 NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYG 414
             N GCNGG MD A +++I N G+ +++ YPY+  E +  C              Y+ +   
Sbjct:   178 NAGCNGGLMDYAFEFVIKNHGIDTEKDYPYQ--ERDGTCKKDKLKQKVVTIDSYAGVKSN 235

Query:   415 EEEEMKKWVATRGPLSVGM-NANGLFY-YSGGVID------LNQRL----YGTS--IPYW 460
             +E+ + + VA + P+SVG+  +   F  YS G+        L+  +    YG+   + YW
Sbjct:   236 DEKALMEAVAAQ-PVSVGICGSERAFQLYSSGIFSGPCSTSLDHAVLIVGYGSQNGVDYW 294

Query:   461 IVKNSWGSDWG 471
             IVKNSWG  WG
Sbjct:   295 IVKNSWGKSWG 305

 Score = 243 (90.6 bits), Expect = 4.2e-19, Sum P(2) = 4.2e-19
 Identities = 66/196 (33%), Positives = 103/196 (52%)

Query:   469 DWGEK-----VEDKVGSSG-----NRTRDLE-LTGVLPSKLSRLATEKLVDCDMS-NGGC 516
             DW +K     V+D+ GS G     + T  +E +  ++   L  L+ ++L+DCD S N GC
Sbjct:   123 DWRKKGAVTNVKDQ-GSCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGC 181

Query:   517 NGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYGEEEE 576
             NGG MD A +++I N G+ +++ YPY+  E +  C              Y+ +   +E+ 
Sbjct:   182 NGGLMDYAFEFVIKNHGIDTEKDYPYQ--ERDGTCKKDKLKQKVVTIDSYAGVKSNDEKA 239

Query:   577 MKKWVATRGPLSVGM-NANGLFY-YSGGVIDLNQRLCNPKAQNHALIIVGYGEEEKKDGT 634
             + + VA + P+SVG+  +   F  YS G+       C+    +HA++IVGYG +   D  
Sbjct:   240 LMEAVAAQ-PVSVGICGSERAFQLYSSGIFS---GPCSTSL-DHAVLIVGYGSQNGVD-- 292

Query:   635 SIPYWIVKNSWGSDWG 650
                YWIVKNSWG  WG
Sbjct:   293 ---YWIVKNSWGKSWG 305

 Score = 138 (53.6 bits), Expect = 5.7e-07, Sum P(2) = 5.7e-07
 Identities = 25/62 (40%), Positives = 39/62 (62%)

Query:   148 LPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQHH---DKVY 204
             +P++ DWR +G ++ VK+QG C  CW+FSA G +E ++ I   +L  LS Q     DK Y
Sbjct:   118 VPDSVDWRKKGAVTNVKDQGSCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSY 177

Query:   205 SS 206
             ++
Sbjct:   178 NA 179

 Score = 56 (24.8 bits), Expect = 9.7e-33, Sum P(2) = 9.7e-33
 Identities = 16/64 (25%), Positives = 29/64 (45%)

Query:   198 QHHDKVYSSVEDLLRRHENFVTNVEKAEDYQSEDSGTAVFGVNKFFDLSESDLQ-QLTGL 256
             Q H K Y S E+  +R + F  N +    +    + T    +N F DL+  + +    GL
Sbjct:    37 QKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHHEFKASRLGL 96

Query:   257 NLDS 260
             ++ +
Sbjct:    97 SVSA 100

 Score = 54 (24.1 bits), Expect = 1.6e-32, Sum P(2) = 1.6e-32
 Identities = 16/69 (23%), Positives = 32/69 (46%)

Query:    45 FLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQREDSGTAVFEVNKFFDLSDSDLQ- 103
             F ++ + H K Y S E+  +R + F  N +    +    + T    +N F DL+  + + 
Sbjct:    32 FDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHHEFKA 91

Query:   104 QLTGLNLDS 112
                GL++ +
Sbjct:    92 SRLGLSVSA 100


>TAIR|locus:2038515 [details] [associations]
            symbol:AT1G06260 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] [GO:0048046 "apoplast"
            evidence=IDA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 EMBL:CP002684
            GenomeReviews:CT485782_GR GO:GO:0048046 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 EMBL:AC025290
            MEROPS:I29.003 HSSP:O65039 HOGENOM:HOG000230773 OMA:METAFEF
            IPI:IPI00525965 PIR:D86198 RefSeq:NP_563764.1 UniGene:At.24617
            ProteinModelPortal:Q9LNC1 SMR:Q9LNC1 PaxDb:Q9LNC1 PRIDE:Q9LNC1
            EnsemblPlants:AT1G06260.1 GeneID:837137 KEGG:ath:AT1G06260
            TAIR:At1g06260 InParanoid:Q9LNC1 PhylomeDB:Q9LNC1
            ProtClustDB:CLSN2916975 Genevestigator:Q9LNC1 Uniprot:Q9LNC1
        Length = 343

 Score = 347 (127.2 bits), Expect = 5.9e-31, P = 5.9e-31
 Identities = 92/294 (31%), Positives = 142/294 (48%)

Query:   203 VYSSVEDLLRRHENFVTNVEKAEDYQSEDSGTAVFGVNKFFDLSESDLQQLTGLNL---- 258
             VY   + L +R E ++    K   Y   D     FG+   +  +   +  +  L+L    
Sbjct:    32 VYDPHKTLKQRFEKWLKTHSKL--YGGRDEWMLRFGI---YQSNVQLIDYINSLHLPFKL 86

Query:   259 -DSTLEDIQPS-LQAPFSS-NQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVKEQG 315
              D+   D+  S  +A F   N +   +   Q        ++P+A DWR +G ++ ++ QG
Sbjct:    87 TDNRFADMTNSEFKAHFLGLNTSSLRLHKKQRPVCDPAGNVPDAVDWRTQGAVTPIRNQG 146

Query:   316 KCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMS--NGGCNGGRMDDALQYIIDN 373
             KC  CWAFSAV  +E ++ I+  +L  LS QQL+DCD+   N GC+GG M+ A ++I  N
Sbjct:   147 KCGGCWAFSAVAAIEGINKIKTGNLVSLSEQQLIDCDVGTYNKGCSGGLMETAFEFIKTN 206

Query:   374 GGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGM 433
             GG+ ++  YPY   E    C              Y ++   + E   +  A + P+SVG+
Sbjct:   207 GGLATETDYPYTGIEGT--CDQEKSKNKVVTIQGYQKV--AQNEASLQIAAAQQPVSVGI 262

Query:   434 NANGLFY--YSGGVI------DLNQRL----YGTS--IPYWIVKNSWGSDWGEK 473
             +A G  +  YS GV       +LN  +    YG      YWIVKNSWG+ WGE+
Sbjct:   263 DAGGFIFQLYSSGVFTNYCGTNLNHGVTVVGYGVEGDQKYWIVKNSWGTGWGEE 316

 Score = 234 (87.4 bits), Expect = 9.7e-33, Sum P(3) = 9.7e-33
 Identities = 55/159 (34%), Positives = 81/159 (50%)

Query:   498 LSRLATEKLVDCDMS--NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXX 555
             L  L+ ++L+DCD+   N GC+GG M+ A ++I  NGG+ ++  YPY   E    C    
Sbjct:   171 LVSLSEQQLIDCDVGTYNKGCSGGLMETAFEFIKTNGGLATETDYPYTGIEGT--CDQEK 228

Query:   556 XXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFY--YSGGVIDLNQRLCNP 613
                       Y ++   + E   +  A + P+SVG++A G  +  YS GV       C  
Sbjct:   229 SKNKVVTIQGYQKV--AQNEASLQIAAAQQPVSVGIDAGGFIFQLYSSGVFT---NYCGT 283

Query:   614 KAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
                NH + +VGYG E  +      YWIVKNSWG+ WGE+
Sbjct:   284 NL-NHGVTVVGYGVEGDQK-----YWIVKNSWGTGWGEE 316

 Score = 149 (57.5 bits), Expect = 9.7e-33, Sum P(3) = 9.7e-33
 Identities = 24/52 (46%), Positives = 36/52 (69%)

Query:   147 DLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
             ++P+A DWR +G ++ ++ QGKC  CWAFSAV  +E ++ I+  NL  LS Q
Sbjct:   126 NVPDAVDWRTQGAVTPIRNQGKCGGCWAFSAVAAIEGINKIKTGNLVSLSEQ 177

 Score = 67 (28.6 bits), Expect = 9.7e-33, Sum P(3) = 9.7e-33
 Identities = 21/91 (23%), Positives = 43/91 (47%)

Query:    23 KVALLESNIFQTRGYLNSPVTRFLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQRE 82
             K+  ++S+++     L     RF  +++ H K+Y   ++ + R   + +NV+   DY   
Sbjct:    24 KLCSVDSSVYDPHKTLKQ---RFEKWLKTHSKLYGGRDEWMLRFGIYQSNVQLI-DYINS 79

Query:    83 DSGTAVFEVNKFFDLSDSDLQ-QLTGLNLDS 112
                      N+F D+++S+ +    GLN  S
Sbjct:    80 LHLPFKLTDNRFADMTNSEFKAHFLGLNTSS 110


>UNIPROTKB|G1K2A7 [details] [associations]
            symbol:CTSK "Cathepsin K" species:9615 "Canis lupus
            familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 InterPro:IPR015644
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00560000076577 PANTHER:PTHR12411:SF55 OMA:LKVPPSH
            EMBL:AAEX03011051 Ensembl:ENSCAFT00000019202 Uniprot:G1K2A7
        Length = 333

 Score = 326 (119.8 bits), Expect = 1.2e-28, P = 1.2e-28
 Identities = 91/288 (31%), Positives = 142/288 (49%)

Query:   202 KVYSSVEDLLRRHENFVTNVEKAEDYQSEDSGTAVFGVNKFFDLSESDLQQLTGLNLDST 261
             K Y+S  D L R   +  N++    +  E S     GV+ + +L+ + L  +T    +  
Sbjct:    39 KQYNSKVDELSRRLIWEKNLKHISIHNLEAS----LGVHTY-ELAMNHLGDMTS---EEV 90

Query:   262 LEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVKEQGKCACCW 321
             ++ +   L+ P S ++++  +    + S       P++ D+R +G ++ VK QG+C  CW
Sbjct:    91 VQKMT-GLKVPPSHSRSNDTLYIPDWESRA-----PDSVDYRKKGYVTPVKNQGQCGSCW 144

Query:   322 AFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQA 381
             AFS+VG +E     +   L  LS Q LVDC   N GC GG M +A QY+  N G+ S+ A
Sbjct:   145 AFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQKNRGIDSEDA 204

Query:   382 YPYKASESERGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNAN--GLF 439
             YPY   +    C+             Y  IP G E+ +K+ VA  GP+SV ++A+     
Sbjct:   205 YPYVGQDES--CMYNPTGKAAKCRG-YREIPEGNEKALKRAVARVGPISVAIDASLTSFQ 261

Query:   440 YYSGGVI--------DLNQRL----YGTSI--PYWIVKNSWGSDWGEK 473
             +YS GV         +LN  +    YG      +WI+KNSWG +WG K
Sbjct:   262 FYSKGVYYDENCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNK 309

 Score = 272 (100.8 bits), Expect = 1.2e-32, Sum P(2) = 1.2e-32
 Identities = 63/159 (39%), Positives = 86/159 (54%)

Query:   497 KLSRLATEKLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXX 556
             KL  L+ + LVDC   N GC GG M +A QY+  N G+ S+ AYPY   +    C+    
Sbjct:   162 KLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQDES--CMYNPT 219

Query:   557 XXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNAN--GLFYYSGGVI-DLNQRLCNP 613
                      Y  IP G E+ +K+ VA  GP+SV ++A+     +YS GV  D N   CN 
Sbjct:   220 GKAAKCRG-YREIPEGNEKALKRAVARVGPISVAIDASLTSFQFYSKGVYYDEN---CNS 275

Query:   614 KAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
                NHA++ VGYG ++   G    +WI+KNSWG +WG K
Sbjct:   276 DNLNHAVLAVGYGIQK---GNK--HWIIKNSWGENWGNK 309

 Score = 117 (46.2 bits), Expect = 1.2e-32, Sum P(2) = 1.2e-32
 Identities = 21/51 (41%), Positives = 32/51 (62%)

Query:   149 PEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQH 199
             P++ D+R +G ++ VK QG+C  CWAFS+VG +E     +   L  LS Q+
Sbjct:   120 PDSVDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQN 170


>UNIPROTKB|Q3ZKN1 [details] [associations]
            symbol:CTSK "Cathepsin K" species:9615 "Canis lupus
            familiaris" [GO:0005764 "lysosome" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0006508
            "proteolysis" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            InterPro:IPR015644 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 eggNOG:COG4870 HOGENOM:HOG000230774
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            HOVERGEN:HBG011513 MEROPS:I29.007 CTD:1513 KO:K01371
            OrthoDB:EOG4SJ5FC PANTHER:PTHR12411:SF55 EMBL:AY738221
            RefSeq:NP_001029168.1 UniGene:Cfa.588 HSSP:P43235
            ProteinModelPortal:Q3ZKN1 SMR:Q3ZKN1 STRING:Q3ZKN1 GeneID:608843
            KEGG:cfa:608843 InParanoid:Q3ZKN1 NextBio:20894470 Uniprot:Q3ZKN1
        Length = 330

 Score = 326 (119.8 bits), Expect = 1.2e-28, P = 1.2e-28
 Identities = 91/288 (31%), Positives = 142/288 (49%)

Query:   202 KVYSSVEDLLRRHENFVTNVEKAEDYQSEDSGTAVFGVNKFFDLSESDLQQLTGLNLDST 261
             K Y+S  D L R   +  N++    +  E S     GV+ + +L+ + L  +T    +  
Sbjct:    36 KQYNSKVDELSRRLIWEKNLKHISIHNLEAS----LGVHTY-ELAMNHLGDMTS---EEV 87

Query:   262 LEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVKEQGKCACCW 321
             ++ +   L+ P S ++++  +    + S       P++ D+R +G ++ VK QG+C  CW
Sbjct:    88 VQKMT-GLKVPPSHSRSNDTLYIPDWESRA-----PDSVDYRKKGYVTPVKNQGQCGSCW 141

Query:   322 AFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQA 381
             AFS+VG +E     +   L  LS Q LVDC   N GC GG M +A QY+  N G+ S+ A
Sbjct:   142 AFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQKNRGIDSEDA 201

Query:   382 YPYKASESERGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNAN--GLF 439
             YPY   +    C+             Y  IP G E+ +K+ VA  GP+SV ++A+     
Sbjct:   202 YPYVGQDES--CMYNPTGKAAKCRG-YREIPEGNEKALKRAVARVGPISVAIDASLTSFQ 258

Query:   440 YYSGGVI--------DLNQRL----YGTSI--PYWIVKNSWGSDWGEK 473
             +YS GV         +LN  +    YG      +WI+KNSWG +WG K
Sbjct:   259 FYSKGVYYDENCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNK 306

 Score = 272 (100.8 bits), Expect = 1.2e-32, Sum P(2) = 1.2e-32
 Identities = 63/159 (39%), Positives = 86/159 (54%)

Query:   497 KLSRLATEKLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXX 556
             KL  L+ + LVDC   N GC GG M +A QY+  N G+ S+ AYPY   +    C+    
Sbjct:   159 KLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQDES--CMYNPT 216

Query:   557 XXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNAN--GLFYYSGGVI-DLNQRLCNP 613
                      Y  IP G E+ +K+ VA  GP+SV ++A+     +YS GV  D N   CN 
Sbjct:   217 GKAAKCRG-YREIPEGNEKALKRAVARVGPISVAIDASLTSFQFYSKGVYYDEN---CNS 272

Query:   614 KAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
                NHA++ VGYG ++   G    +WI+KNSWG +WG K
Sbjct:   273 DNLNHAVLAVGYGIQK---GNK--HWIIKNSWGENWGNK 306

 Score = 117 (46.2 bits), Expect = 1.2e-32, Sum P(2) = 1.2e-32
 Identities = 21/51 (41%), Positives = 32/51 (62%)

Query:   149 PEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQH 199
             P++ D+R +G ++ VK QG+C  CWAFS+VG +E     +   L  LS Q+
Sbjct:   117 PDSVDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQN 167


>TAIR|locus:2122113 [details] [associations]
            symbol:XCP1 "xylem cysteine peptidase 1" species:3702
            "Arabidopsis thaliana" [GO:0005576 "extracellular region"
            evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA;ISS]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            [GO:0000325 "plant-type vacuole" evidence=IDA] [GO:0005634
            "nucleus" evidence=IDA] [GO:0010623 "developmental programmed cell
            death" evidence=IMP] [GO:0010413 "glucuronoxylan metabolic process"
            evidence=RCA] [GO:0045492 "xylan biosynthetic process"
            evidence=RCA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0005886
            GO:GO:0005634 EMBL:CP002687 GenomeReviews:CT486007_GR
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0000325
            EMBL:AL022604 EMBL:AL161587 GO:GO:0010623 MEROPS:I29.003
            HOGENOM:HOG000230773 EMBL:AF191027 EMBL:AK117394 EMBL:BT005179
            IPI:IPI00532220 PIR:T06122 RefSeq:NP_567983.1 UniGene:At.2280
            UniGene:At.67622 ProteinModelPortal:O65493 SMR:O65493 STRING:O65493
            PaxDb:O65493 PRIDE:O65493 EnsemblPlants:AT4G35350.1 GeneID:829688
            KEGG:ath:AT4G35350 GeneFarm:5033 TAIR:At4g35350 InParanoid:O65493
            KO:K16290 OMA:FEVFREN PhylomeDB:O65493 ProtClustDB:CLSN2689772
            Genevestigator:O65493 Uniprot:O65493
        Length = 355

 Score = 362 (132.5 bits), Expect = 1.3e-32, P = 1.3e-32
 Identities = 102/294 (34%), Positives = 149/294 (50%)

Query:   205 SSVEDLLRRHENFVTNVEKAEDYQSEDSGT---AVFGVNKF-FDLSESDLQQL-TGLN-- 257
             ++ + LL   E++++   KA  Y+S +       VF  N    D   +++     GLN  
Sbjct:    42 TNTDKLLELFESWMSEHSKA--YKSVEEKVHRFEVFRENLMHIDQRNNEINSYWLGLNEF 99

Query:   258 LDSTLEDIQP---SLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVKEQ 314
              D T E+ +     L  P  S +       F++  +    DLP++ DWR +G ++ VK+Q
Sbjct:   100 ADLTHEEFKGRYLGLAKPQFSRKRQPSAN-FRYRDIT---DLPKSVDWRKKGAVAPVKDQ 155

Query:   315 GKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMS-NGGCNGGRMDDALQYIIDN 373
             G+C  CWAFS V  VE ++ I   +L+ LS Q+L+DCD + N GCNGG MD A QYII  
Sbjct:   156 GQCGSCWAFSTVAAVEGINQITTGNLSSLSEQELIDCDTTFNSGCNGGLMDYAFQYIIST 215

Query:   374 GGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGM 433
             GG+  +  YPY   E    C              Y  +P  ++E + K +A + P+SV +
Sbjct:   216 GGLHKEDDYPYLMEEGI--CQEQKEDVERVTISGYEDVPENDDESLVKALAHQ-PVSVAI 272

Query:   434 NANGL-F-YYSGGVI------DLNQRL----YGTS--IPYWIVKNSWGSDWGEK 473
              A+G  F +Y GGV       DL+  +    YG+S    Y IVKNSWG  WGEK
Sbjct:   273 EASGRDFQFYKGGVFNGKCGTDLDHGVAAVGYGSSKGSDYVIVKNSWGPRWGEK 326

 Score = 247 (92.0 bits), Expect = 1.2e-31, Sum P(2) = 1.2e-31
 Identities = 62/163 (38%), Positives = 84/163 (51%)

Query:   493 VLPSKLSRLATEKLVDCDMS-NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGC 551
             +    LS L+ ++L+DCD + N GCNGG MD A QYII  GG+  +  YPY   E    C
Sbjct:   176 ITTGNLSSLSEQELIDCDTTFNSGCNGGLMDYAFQYIISTGGLHKEDDYPYLMEEGI--C 233

Query:   552 LXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGL-F-YYSGGVIDLNQR 609
                           Y  +P  ++E + K +A + P+SV + A+G  F +Y GGV   N +
Sbjct:   234 QEQKEDVERVTISGYEDVPENDDESLVKALAHQ-PVSVAIEASGRDFQFYKGGVF--NGK 290

Query:   610 LCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
              C     +H +  VGYG  +  D     Y IVKNSWG  WGEK
Sbjct:   291 -CGTDL-DHGVAAVGYGSSKGSD-----YVIVKNSWGPRWGEK 326

 Score = 158 (60.7 bits), Expect = 1.2e-31, Sum P(2) = 1.2e-31
 Identities = 45/150 (30%), Positives = 75/150 (50%)

Query:    57 SSVEDLLRRHENFVTNVEKA-EDYQREDSGTAVFEVNKF-FDLSDSDLQQL-TGLN--LD 111
             ++ + LL   E++++   KA +  + +     VF  N    D  ++++     GLN   D
Sbjct:    42 TNTDKLLELFESWMSEHSKAYKSVEEKVHRFEVFRENLMHIDQRNNEINSYWLGLNEFAD 101

Query:   112 STLEDIQP---SLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVKEQGK 168
              T E+ +     L  P  S +       F++  +    DLP++ DWR +G ++ VK+QG+
Sbjct:   102 LTHEEFKGRYLGLAKPQFSRKRQPSAN-FRYRDIT---DLPKSVDWRKKGAVAPVKDQGQ 157

Query:   169 CACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
             C  CWAFS V  VE ++ I   NL+ LS Q
Sbjct:   158 CGSCWAFSTVAAVEGINQITTGNLSSLSEQ 187

 Score = 90 (36.7 bits), Expect = 1.6e-24, Sum P(2) = 1.6e-24
 Identities = 31/104 (29%), Positives = 46/104 (44%)

Query:   169 CACCWAFSAVGVVEAMHAIQGNNLTELS---VQHHDKVYSSVEDLLRRHENFVTNVEKAE 225
             CA    FS VG     H    + L EL    +  H K Y SVE+ + R E F  N+    
Sbjct:    25 CAFARDFSIVGYTPE-HLTNTDKLLELFESWMSEHSKAYKSVEEKVHRFEVFRENLMHI- 82

Query:   226 DYQSEDSGTAVFGVNKFFDLSESDLQ-QLTGLNLDSTLEDIQPS 268
             D ++ +  +   G+N+F DL+  + + +  GL         QPS
Sbjct:    83 DQRNNEINSYWLGLNEFADLTHEEFKGRYLGLAKPQFSRKRQPS 126

 Score = 75 (31.5 bits), Expect = 6.0e-23, Sum P(2) = 6.0e-23
 Identities = 22/77 (28%), Positives = 37/77 (48%)

Query:    45 FLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQREDSGTAVFEVNKFFDLSDSDLQ- 103
             F ++M +H K Y SVE+ + R E F  N+    D +  +  +    +N+F DL+  + + 
Sbjct:    51 FESWMSEHSKAYKSVEEKVHRFEVFRENLMHI-DQRNNEINSYWLGLNEFADLTHEEFKG 109

Query:   104 QLTGLNLDSTLEDIQPS 120
             +  GL         QPS
Sbjct:   110 RYLGLAKPQFSRKRQPS 126


>UNIPROTKB|F7BJD8 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9796 "Equus
            caballus" [GO:0001656 "metanephros development" evidence=ISS]
            [GO:0001913 "T cell mediated cytotoxicity" evidence=ISS]
            [GO:0002764 "immune response-regulating signaling pathway"
            evidence=ISS] [GO:0004175 "endopeptidase activity" evidence=ISS]
            [GO:0004177 "aminopeptidase activity" evidence=ISS] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=ISS] [GO:0004252
            "serine-type endopeptidase activity" evidence=ISS] [GO:0005615
            "extracellular space" evidence=ISS] [GO:0005764 "lysosome"
            evidence=ISS] [GO:0005829 "cytosol" evidence=ISS] [GO:0006508
            "proteolysis" evidence=ISS] [GO:0008234 "cysteine-type peptidase
            activity" evidence=ISS] [GO:0008284 "positive regulation of cell
            proliferation" evidence=ISS] [GO:0010628 "positive regulation of
            gene expression" evidence=ISS] [GO:0010634 "positive regulation of
            epithelial cell migration" evidence=ISS] [GO:0010813 "neuropeptide
            catabolic process" evidence=ISS] [GO:0010815 "bradykinin catabolic
            process" evidence=ISS] [GO:0010952 "positive regulation of
            peptidase activity" evidence=ISS] [GO:0016505 "apoptotic protease
            activator activity" evidence=ISS] [GO:0030108 "HLA-A specific
            activating MHC class I receptor activity" evidence=ISS] [GO:0030335
            "positive regulation of cell migration" evidence=ISS] [GO:0031638
            "zymogen activation" evidence=ISS] [GO:0031648 "protein
            destabilization" evidence=ISS] [GO:0032526 "response to retinoic
            acid" evidence=ISS] [GO:0033619 "membrane protein proteolysis"
            evidence=ISS] [GO:0043066 "negative regulation of apoptotic
            process" evidence=ISS] [GO:0043129 "surfactant homeostasis"
            evidence=ISS] [GO:0045766 "positive regulation of angiogenesis"
            evidence=ISS] [GO:0060448 "dichotomous subdivision of terminal
            units involved in lung branching" evidence=ISS] [GO:0070324
            "thyroid hormone binding" evidence=ISS] [GO:0070371 "ERK1 and ERK2
            cascade" evidence=ISS] [GO:0097208 "alveolar lamellar body"
            evidence=ISS] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0005829
            GO:GO:0043066 GO:GO:0005615 GO:GO:0008284 GO:GO:0070371
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766 GO:GO:0004177
            GO:GO:0004252 GO:GO:0005764 GO:GO:0097208 GO:GO:0032526
            GO:GO:0010628 GO:GO:0070324 GO:GO:0016505 GO:GO:0001656
            GO:GO:0010634 GO:GO:0004197 GO:GO:0031648 GO:GO:0097067
            GO:GO:0031638 GO:GO:0001913 GeneTree:ENSGT00660000095458
            OMA:STSCHKT GO:GO:0030108 GO:GO:0010815 GO:GO:0060448 GO:GO:0002764
            GO:GO:0033619 GO:GO:0010813 GO:GO:0043129
            Ensembl:ENSECAT00000013967 Uniprot:F7BJD8
        Length = 305

 Score = 280 (103.6 bits), Expect = 4.0e-27, Sum P(2) = 4.0e-27
 Identities = 71/204 (34%), Positives = 93/204 (45%)

Query:   288 NSLRHGDDLPEAFDWRAEG-VISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQ 346
             N LR     P + DWR +G  +S VK QG C  CW FS  G +E+  AI    L  L+ Q
Sbjct:    78 NYLRGAGPYPPSVDWRKKGNFVSPVKNQGGCGSCWTFSTTGALESAVAIASGKLLSLAEQ 137

Query:   347 QLVDC--DMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXX 404
             QLVDC  + +N GC GG    A +YI  N G++ +  YPYK  + +  C           
Sbjct:   138 QLVDCAQNFNNHGCQGGLPSQAFEYIRYNKGIMGEDTYPYKGQDGD--C-KFQPNKAIAF 194

Query:   405 XXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFY-YSGGVID----------LNQRL- 452
                 + I   +E+ M + VA   P+S        F  Y  G+            +N  + 
Sbjct:   195 VKDVANITLNDEKAMVEAVALYNPVSFAFEVTEDFMMYRKGIYSSTSCHKTPDKVNHAVL 254

Query:   453 ---YG--TSIPYWIVKNSWGSDWG 471
                YG    IPYWIVKNSWG  WG
Sbjct:   255 AVGYGEENGIPYWIVKNSWGPHWG 278

 Score = 232 (86.7 bits), Expect = 1.4e-32, Sum P(3) = 1.4e-32
 Identities = 56/161 (34%), Positives = 75/161 (46%)

Query:   493 VLPSKLSRLATEKLVDC--DMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERG 550
             +   KL  LA ++LVDC  + +N GC GG    A +YI  N G++ +  YPYK  + +  
Sbjct:   126 IASGKLLSLAEQQLVDCAQNFNNHGCQGGLPSQAFEYIRYNKGIMGEDTYPYKGQDGD-- 183

Query:   551 CLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFY-YSGGVIDLNQR 609
             C               + I   +E+ M + VA   P+S        F  Y  G+      
Sbjct:   184 C-KFQPNKAIAFVKDVANITLNDEKAMVEAVALYNPVSFAFEVTEDFMMYRKGIYSSTSC 242

Query:   610 LCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWG 650
                P   NHA++ VGYGEE       IPYWIVKNSWG  WG
Sbjct:   243 HKTPDKVNHAVLAVGYGEEN-----GIPYWIVKNSWGPHWG 278

 Score = 121 (47.7 bits), Expect = 1.4e-32, Sum P(3) = 1.4e-32
 Identities = 25/60 (41%), Positives = 31/60 (51%)

Query:   140 NSLRHGDDLPEAFDWRAEG-VISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
             N LR     P + DWR +G  +S VK QG C  CW FS  G +E+  AI    L  L+ Q
Sbjct:    78 NYLRGAGPYPPSVDWRKKGNFVSPVKNQGGCGSCWTFSTTGALESAVAIASGKLLSLAEQ 137

 Score = 56 (24.8 bits), Expect = 1.4e-32, Sum P(3) = 1.4e-32
 Identities = 18/61 (29%), Positives = 33/61 (54%)

Query:    45 FLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQREDSGTAVFEV--NKFFDLSDSDL 102
             F ++M  H K YSS E+   R + FV+N  K   +   ++G   F +  N+F  ++ ++L
Sbjct:     5 FKSWMVQHQKKYSS-EEYHHRLQTFVSNWRKINAH---NTGNHTFRMGLNQFSAMNFAEL 60

Query:   103 Q 103
             +
Sbjct:    61 K 61


>TAIR|locus:2175088 [details] [associations]
            symbol:ALP "aleurain-like protease" species:3702
            "Arabidopsis thaliana" [GO:0005576 "extracellular region"
            evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA;ISS]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            [GO:0009723 "response to ethylene stimulus" evidence=IEP]
            [GO:0005773 "vacuole" evidence=IDA] [GO:0005829 "cytosol"
            evidence=RCA] [GO:0006096 "glycolysis" evidence=RCA] [GO:0006816
            "calcium ion transport" evidence=RCA] [GO:0006833 "water transport"
            evidence=RCA] [GO:0006972 "hyperosmotic response" evidence=RCA]
            [GO:0007030 "Golgi organization" evidence=RCA] [GO:0009266
            "response to temperature stimulus" evidence=RCA] [GO:0009651
            "response to salt stress" evidence=RCA] [GO:0009750 "response to
            fructose stimulus" evidence=RCA] [GO:0042744 "hydrogen peroxide
            catabolic process" evidence=RCA] [GO:0046686 "response to cadmium
            ion" evidence=RCA] [GO:0007568 "aging" evidence=IEP]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 EMBL:CP002688 GO:GO:0005773
            GO:GO:0007568 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 EMBL:AB011483 KO:K01366
            ProtClustDB:CLSN2689015 UniGene:At.25414 IPI:IPI00846287
            RefSeq:NP_001078774.1 ProteinModelPortal:A8MQZ1 SMR:A8MQZ1
            STRING:A8MQZ1 PRIDE:A8MQZ1 EnsemblPlants:AT5G60360.3 GeneID:836158
            KEGG:ath:AT5G60360 OMA:CGSTPMD Genevestigator:A8MQZ1 Uniprot:A8MQZ1
        Length = 361

 Score = 324 (119.1 bits), Expect = 2.2e-32, Sum P(2) = 2.2e-32
 Identities = 75/198 (37%), Positives = 102/198 (51%)

Query:   296 LPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAM-HAIQGNSLTELSVQQLVDCD-- 352
             LPE  DWR +G++S VK+QG C  CW FS  G +EA  H   G  ++ LS QQLVDC   
Sbjct:   141 LPETKDWREDGIVSPVKDQGGCGSCWTFSTTGALEAAYHQAFGKGIS-LSEQQLVDCAGA 199

Query:   353 MSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIP 412
              +N GCNGG    A +YI  NGG+ +++AYPY   +    C               + I 
Sbjct:   200 FNNYGCNGGLPSQAFEYIKSNGGLDTEKAYPYTGKDET--CKFSAENVGVQVLNSVN-IT 256

Query:   413 YGEEEEMKKWVATRGPLSVGMNANGLF-YYSGGV----------IDLNQRL----YGTS- 456
              G E+E+K  V    P+S+       F  Y  GV          +D+N  +    YG   
Sbjct:   257 LGAEDELKHAVGLVRPVSIAFEVIHSFRLYKSGVYTDSHCGSTPMDVNHAVLAVGYGVED 316

Query:   457 -IPYWIVKNSWGSDWGEK 473
              +PYW++KNSWG+DWG+K
Sbjct:   317 GVPYWLIKNSWGADWGDK 334

 Score = 246 (91.7 bits), Expect = 3.8e-31, Sum P(3) = 3.8e-31
 Identities = 57/155 (36%), Positives = 80/155 (51%)

Query:   501 LATEKLVDCD--MSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXX 558
             L+ ++LVDC    +N GCNGG    A +YI  NGG+ +++AYPY   +    C       
Sbjct:   188 LSEQQLVDCAGAFNNYGCNGGLPSQAFEYIKSNGGLDTEKAYPYTGKDET--CKFSAENV 245

Query:   559 XXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLF-YYSGGVIDLNQRLCNPKAQN 617
                     + I  G E+E+K  V    P+S+       F  Y  GV   +     P   N
Sbjct:   246 GVQVLNSVN-ITLGAEDELKHAVGLVRPVSIAFEVIHSFRLYKSGVYTDSHCGSTPMDVN 304

Query:   618 HALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
             HA++ VGYG E   DG  +PYW++KNSWG+DWG+K
Sbjct:   305 HAVLAVGYGVE---DG--VPYWLIKNSWGADWGDK 334

 Score = 129 (50.5 bits), Expect = 3.8e-31, Sum P(3) = 3.8e-31
 Identities = 25/52 (48%), Positives = 32/52 (61%)

Query:   148 LPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAM-HAIQGNNLTELSVQ 198
             LPE  DWR +G++S VK+QG C  CW FS  G +EA  H   G  ++ LS Q
Sbjct:   141 LPETKDWREDGIVSPVKDQGGCGSCWTFSTTGALEAAYHQAFGKGIS-LSEQ 191

 Score = 60 (26.2 bits), Expect = 2.2e-32, Sum P(2) = 2.2e-32
 Identities = 19/78 (24%), Positives = 36/78 (46%)

Query:    27 LESNIFQTRGYLNSPVTRFLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQREDSGT 86
             +E ++ Q  G  +  V  F  F   + K Y +VE++  R   F  N++      ++    
Sbjct:    42 VEESVSQILGQ-SRHVLSFARFTHRYGKKYQNVEEMKLRFSIFKENLDLIRSTNKKGLSY 100

Query:    87 AVFEVNKFFDLSDSDLQQ 104
              +  VN+F DL+  + Q+
Sbjct:   101 KL-GVNQFADLTWQEFQR 117

 Score = 56 (24.8 bits), Expect = 1.3e-20, Sum P(2) = 1.3e-20
 Identities = 20/77 (25%), Positives = 35/77 (45%)

Query:   180 VVEAMHAIQGNNLTELS----VQHHDKVYSSVEDLLRRHENFVTNVEKAEDYQSEDSGTA 235
             V E++  I G +   LS       + K Y +VE++  R   F  N++       +     
Sbjct:    42 VEESVSQILGQSRHVLSFARFTHRYGKKYQNVEEMKLRFSIFKENLDLIRSTNKKGLSYK 101

Query:   236 VFGVNKFFDLSESDLQQ 252
             + GVN+F DL+  + Q+
Sbjct:   102 L-GVNQFADLTWQEFQR 117


>ZFIN|ZDB-GENE-050522-559 [details] [associations]
            symbol:ctssb.1 "cathepsin S, b.1" species:7955
            "Danio rerio" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            ZFIN:ZDB-GENE-050522-559 eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 HOVERGEN:HBG011513 MEROPS:C01.034
            EMBL:BC095694 IPI:IPI00607338 UniGene:Dr.75553
            ProteinModelPortal:Q502H6 SMR:Q502H6 InParanoid:Q502H6
            ArrayExpress:Q502H6 Uniprot:Q502H6
        Length = 330

 Score = 359 (131.4 bits), Expect = 2.8e-32, P = 2.8e-32
 Identities = 101/297 (34%), Positives = 154/297 (51%)

Query:   194 ELSVQHHDKVYSS-VEDLLRRHENFVTNVEKAEDYQSEDSGTAVFGVNKFFDLSESDLQQ 252
             EL  + + K+Y++ VE+  RR + +  N++    +  E S     G++ + DLS + +  
Sbjct:    28 ELWKKTYGKIYTTEVEEFGRR-QLWERNLQLITVHNLEAS----MGMHSY-DLSMNHM-- 79

Query:   253 LTGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVK 312
               G   D T E+I  +L      +    ++     +S   GD +P++ DWR +G +S VK
Sbjct:    80 --G---DLTTEEILQTLALTHVPSGFKRQIANIVGSS---GDAVPDSLDWREKGYVSSVK 131

Query:   313 EQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSNG--GCNGGRMDDALQYI 370
              QG C  CWAFS+VG +E         L +LS Q LVDC    G  GCNGG M DA QY+
Sbjct:   132 MQGACGSCWAFSSVGALEGQLKKTTGKLVDLSPQNLVDCSSKYGNKGCNGGFMSDAFQYV 191

Query:   371 IDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLS 430
             IDNGG+ SD AYPY+  + +  C              Y  +  G+E  +K+ VA+ GP+S
Sbjct:   192 IDNGGIASDSAYPYRGVQQQ--CSYSSSQRAANCTKYYF-VRQGDENALKQAVASVGPIS 248

Query:   431 VGMNANG---LFYYSGGVID--LNQRL--------YGT--SIPYWIVKNSWGSDWGE 472
             V ++A     + Y+SG   D   ++R+        YGT     +W+VKNSWG+ +G+
Sbjct:   249 VAIDATRPQFVLYHSGVYNDPTCSKRVNHAVLVVGYGTLSGQDHWLVKNSWGTRFGD 305

 Score = 278 (102.9 bits), Expect = 1.9e-23, P = 1.9e-23
 Identities = 73/210 (34%), Positives = 110/210 (52%)

Query:   446 IDLNQRLYGTSIPYWIVKNSWGSDWGEKVEDKVGSSGNRTRDLELTGVLPSKLSRLATEK 505
             +D  ++ Y +S+    ++ + GS W       VG+   + +  + TG    KL  L+ + 
Sbjct:   119 LDWREKGYVSSVK---MQGACGSCWAFS---SVGALEGQLK--KTTG----KLVDLSPQN 166

Query:   506 LVDCDMSNG--GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXX 563
             LVDC    G  GCNGG M DA QY+IDNGG+ SD AYPY+  + +  C            
Sbjct:   167 LVDCSSKYGNKGCNGGFMSDAFQYVIDNGGIASDSAYPYRGVQQQ--CSYSSSQRAANCT 224

Query:   564 XXYSRIPYGEEEEMKKWVATRGPLSVGMNANG--LFYYSGGVIDLNQRLCNPKAQNHALI 621
               Y  +  G+E  +K+ VA+ GP+SV ++A       Y  GV   N   C+ K  NHA++
Sbjct:   225 KYYF-VRQGDENALKQAVASVGPISVAIDATRPQFVLYHSGVY--NDPTCS-KRVNHAVL 280

Query:   622 IVGYGEEEKKDGTSIPYWIVKNSWGSDWGE 651
             +VGYG    +D     +W+VKNSWG+ +G+
Sbjct:   281 VVGYGTLSGQD-----HWLVKNSWGTRFGD 305

 Score = 139 (54.0 bits), Expect = 2.6e-06, P = 2.6e-06
 Identities = 25/55 (45%), Positives = 34/55 (61%)

Query:   145 GDDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQH 199
             GD +P++ DWR +G +S VK QG C  CWAFS+VG +E         L +LS Q+
Sbjct:   112 GDAVPDSLDWREKGYVSSVKMQGACGSCWAFSSVGALEGQLKKTTGKLVDLSPQN 166


>UNIPROTKB|G1SQF0 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9986
            "Oryctolagus cuniculus" [GO:0001656 "metanephros development"
            evidence=ISS] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=ISS] [GO:0002764 "immune response-regulating signaling
            pathway" evidence=ISS] [GO:0004175 "endopeptidase activity"
            evidence=ISS] [GO:0004177 "aminopeptidase activity" evidence=ISS]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0004252 "serine-type endopeptidase activity" evidence=ISS]
            [GO:0005615 "extracellular space" evidence=ISS] [GO:0005764
            "lysosome" evidence=ISS] [GO:0005829 "cytosol" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISS] [GO:0008284 "positive regulation
            of cell proliferation" evidence=ISS] [GO:0010628 "positive
            regulation of gene expression" evidence=ISS] [GO:0010634 "positive
            regulation of epithelial cell migration" evidence=ISS] [GO:0010813
            "neuropeptide catabolic process" evidence=ISS] [GO:0010815
            "bradykinin catabolic process" evidence=ISS] [GO:0010952 "positive
            regulation of peptidase activity" evidence=ISS] [GO:0016505
            "apoptotic protease activator activity" evidence=ISS] [GO:0030108
            "HLA-A specific activating MHC class I receptor activity"
            evidence=ISS] [GO:0030335 "positive regulation of cell migration"
            evidence=ISS] [GO:0031638 "zymogen activation" evidence=ISS]
            [GO:0031648 "protein destabilization" evidence=ISS] [GO:0032526
            "response to retinoic acid" evidence=ISS] [GO:0033619 "membrane
            protein proteolysis" evidence=ISS] [GO:0043066 "negative regulation
            of apoptotic process" evidence=ISS] [GO:0043129 "surfactant
            homeostasis" evidence=ISS] [GO:0045766 "positive regulation of
            angiogenesis" evidence=ISS] [GO:0060448 "dichotomous subdivision of
            terminal units involved in lung branching" evidence=ISS]
            [GO:0070324 "thyroid hormone binding" evidence=ISS] [GO:0070371
            "ERK1 and ERK2 cascade" evidence=ISS] [GO:0097208 "alveolar
            lamellar body" evidence=ISS] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005829 GO:GO:0043066 GO:GO:0005615 GO:GO:0008284
            GO:GO:0070371 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766
            GO:GO:0004177 GO:GO:0004252 GO:GO:0005764 GO:GO:0097208
            GO:GO:0032526 GO:GO:0010628 GO:GO:0070324 GO:GO:0016505
            GO:GO:0001656 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
            GO:GO:0097067 GO:GO:0031638 GO:GO:0001913
            GeneTree:ENSGT00660000095458 OMA:STSCHKT GO:GO:0030108
            GO:GO:0010815 GO:GO:0060448 GO:GO:0002764 GO:GO:0033619
            GO:GO:0010813 GO:GO:0043129 RefSeq:XP_002721635.1 UniGene:Ocu.7137
            Ensembl:ENSOCUT00000006138 GeneID:100101597 Uniprot:G1SQF0
        Length = 333

 Score = 300 (110.7 bits), Expect = 3.0e-32, Sum P(2) = 3.0e-32
 Identities = 74/208 (35%), Positives = 99/208 (47%)

Query:   284 AFQFNSLRHGDDLPEAFDWRAEG-VISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTE 342
             A + N LR     P + DWR +G  +S VK QG C  CW FS  G +E+  AI G  +  
Sbjct:   102 ATKSNYLRGTGPYPSSVDWRKKGNFVSPVKNQGACGSCWTFSTTGALESAVAIAGGKMLS 161

Query:   343 LSVQQLVDC--DMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXX 400
             L+ QQLVDC  + +N GC GG    A +YI+ N G++ + +YPY+A E    C       
Sbjct:   162 LAEQQLVDCAQNFNNHGCEGGLPSQAFEYILYNKGIMGEDSYPYRAMEGR--C-KFQPQK 218

Query:   401 XXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFY-YSGGVID----------LN 449
                     + I   +EE M + VA   P+S        F  Y  G+            +N
Sbjct:   219 AIAFVKDVANITLNDEEAMVEAVALYNPVSFAFEVTEDFMQYRKGIYSSTSCHKTPDKVN 278

Query:   450 QRL----YG--TSIPYWIVKNSWGSDWG 471
               +    YG    +PYWIVKNSWGS WG
Sbjct:   279 HAVLAVGYGEENGVPYWIVKNSWGSHWG 306

 Score = 245 (91.3 bits), Expect = 3.1e-26, Sum P(2) = 3.1e-26
 Identities = 68/204 (33%), Positives = 96/204 (47%)

Query:   453 YGTSIPYWIVKNSWGSDWGEKVEDKVGS--SGNRTRDLE-LTGVLPSKLSRLATEKLVDC 509
             Y +S+  W  K ++ S    K +   GS  + + T  LE    +   K+  LA ++LVDC
Sbjct:   114 YPSSVD-WRKKGNFVSP--VKNQGACGSCWTFSTTGALESAVAIAGGKMLSLAEQQLVDC 170

Query:   510 --DMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYS 567
               + +N GC GG    A +YI+ N G++ + +YPY+A E    C               +
Sbjct:   171 AQNFNNHGCEGGLPSQAFEYILYNKGIMGEDSYPYRAMEGR--C-KFQPQKAIAFVKDVA 227

Query:   568 RIPYGEEEEMKKWVATRGPLSVGMNANGLFY-YSGGVIDLNQRLCNPKAQNHALIIVGYG 626
              I   +EE M + VA   P+S        F  Y  G+         P   NHA++ VGYG
Sbjct:   228 NITLNDEEAMVEAVALYNPVSFAFEVTEDFMQYRKGIYSSTSCHKTPDKVNHAVLAVGYG 287

Query:   627 EEEKKDGTSIPYWIVKNSWGSDWG 650
             EE       +PYWIVKNSWGS WG
Sbjct:   288 EEN-----GVPYWIVKNSWGSHWG 306

 Score = 126 (49.4 bits), Expect = 1.6e-07, Sum P(2) = 1.6e-07
 Identities = 26/64 (40%), Positives = 34/64 (53%)

Query:   136 AFQFNSLRHGDDLPEAFDWRAEG-VISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTE 194
             A + N LR     P + DWR +G  +S VK QG C  CW FS  G +E+  AI G  +  
Sbjct:   102 ATKSNYLRGTGPYPSSVDWRKKGNFVSPVKNQGACGSCWTFSTTGALESAVAIAGGKMLS 161

Query:   195 LSVQ 198
             L+ Q
Sbjct:   162 LAEQ 165

 Score = 84 (34.6 bits), Expect = 3.0e-32, Sum P(2) = 3.0e-32
 Identities = 28/88 (31%), Positives = 42/88 (47%)

Query:   169 CACCWAFSAVGVVEAMHAIQGNNLTELSV-----QHHDKVYSSVEDLLRRHENFVTNVEK 223
             CA  W   A G      A   NNL +        QHH K YS+ E+  RR + FV N  K
Sbjct:     9 CAGAWLLGAPGA----DAFSANNLEKFHFKSWMSQHHKK-YSA-EEYPRRLQTFVRNWRK 62

Query:   224 AEDYQSEDSGTAVFGVNKFFDLSESDLQ 251
                + + +  T   G+N+F D+S ++++
Sbjct:    63 INAHNNGNH-TFQMGLNQFSDMSFAEIK 89

 Score = 68 (29.0 bits), Expect = 1.4e-30, Sum P(2) = 1.4e-30
 Identities = 19/61 (31%), Positives = 35/61 (57%)

Query:    45 FLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQREDSGTAVFEV--NKFFDLSDSDL 102
             F ++M  H K YS+ E+  RR + FV N  K   +   ++G   F++  N+F D+S +++
Sbjct:    33 FKSWMSQHHKKYSA-EEYPRRLQTFVRNWRKINAH---NNGNHTFQMGLNQFSDMSFAEI 88

Query:   103 Q 103
             +
Sbjct:    89 K 89


>RGD|708447 [details] [associations]
            symbol:Testin "testin gene" species:10116 "Rattus norvegicus"
            [GO:0005576 "extracellular region" evidence=IEA] [GO:0006508
            "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0030054 "cell junction" evidence=IDA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 RGD:708447 GO:GO:0005576 GO:GO:0030054 eggNOG:COG4870
            HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513
            MEROPS:C01.972 OMA:RYHAENS OrthoDB:EOG4XWG0N EMBL:U16858
            IPI:IPI00207173 PIR:I52525 PIR:PC1251 RefSeq:NP_775155.1
            UniGene:Rn.10029 ProteinModelPortal:P15242 SMR:P15242
            Ensembl:ENSRNOT00000024467 GeneID:286916 KEGG:rno:286916
            UCSC:RGD:708447 CTD:286916 InParanoid:P15242 NextBio:625036
            Genevestigator:P15242 GermOnline:ENSRNOG00000018028 Uniprot:P15242
        Length = 333

 Score = 277 (102.6 bits), Expect = 3.1e-32, Sum P(2) = 3.1e-32
 Identities = 65/161 (40%), Positives = 93/161 (57%)

Query:   497 KLSRLATEKLVDCDMSN--GGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXX 554
             +L  L+ + L+DC  SN   GC+GG M  A QY+ DNGG+ ++++YPY+    E  C   
Sbjct:   157 RLIPLSEQNLLDCMGSNVTHGCSGGFMQYAFQYVKDNGGLATEESYPYRGQGRE--C-RY 213

Query:   555 XXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNAN-GLF-YYSGGVIDLNQRLCN 612
                        + +IP G EE + K VA  GP+SV ++A+ G F +Y  G+    Q  C 
Sbjct:   214 HAENSAANVRDFVQIP-GSEEALMKAVAKVGPISVAVDASHGSFQFYGSGIYYEPQ--CK 270

Query:   613 PKAQNHALIIVGYG-EEEKKDGTSIPYWIVKNSWGSDWGEK 652
                 NHA+++VGYG E E+ DG S  +W+VKNSWG +WG K
Sbjct:   271 RVHLNHAVLVVGYGFEGEESDGNS--FWLVKNSWGEEWGMK 309

 Score = 263 (97.6 bits), Expect = 8.1e-22, P = 8.1e-22
 Identities = 61/154 (39%), Positives = 85/154 (55%)

Query:   296 LPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSN 355
             +P+  DWR  G ++ VK QG CA  WAFSA G +E     +   L  LS Q L+DC  SN
Sbjct:   114 VPKRVDWRQLGYVTPVKNQGHCASSWAFSATGSLEGQMFRKTERLIPLSEQNLLDCMGSN 173

Query:   356 --GGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPY 413
                GC+GG M  A QY+ DNGG+ ++++YPY+    E  C              + +IP 
Sbjct:   174 VTHGCSGGFMQYAFQYVKDNGGLATEESYPYRGQGRE--C-RYHAENSAANVRDFVQIP- 229

Query:   414 GEEEEMKKWVATRGPLSVGMNAN-GLF-YYSGGV 445
             G EE + K VA  GP+SV ++A+ G F +Y  G+
Sbjct:   230 GSEEALMKAVAKVGPISVAVDASHGSFQFYGSGI 263

 Score = 108 (43.1 bits), Expect = 3.1e-32, Sum P(2) = 3.1e-32
 Identities = 22/52 (42%), Positives = 29/52 (55%)

Query:   148 LPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQH 199
             +P+  DWR  G ++ VK QG CA  WAFSA G +E     +   L  LS Q+
Sbjct:   114 VPKRVDWRQLGYVTPVKNQGHCASSWAFSATGSLEGQMFRKTERLIPLSEQN 165

 Score = 108 (43.1 bits), Expect = 1.3e-09, Sum P(2) = 1.3e-09
 Identities = 37/150 (24%), Positives = 65/150 (43%)

Query:   340 LTELSVQQLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXX 399
             L+E ++   +  ++++G C+GG M  A QY+ DNGG+ ++++YPY+    E         
Sbjct:   161 LSEQNLLDCMGSNVTHG-CSGGFMQYAFQYVKDNGGLATEESYPYRGQGRECRYHAENSA 219

Query:   400 XXXXXXXXYSRIPYGEEEEMKKW--VATRGPLSVG---MNANGLFYYSGGV-IDLNQRL- 452
                              + + K   ++     S G      +G++Y      + LN  + 
Sbjct:   220 ANVRDFVQIPGSEEALMKAVAKVGPISVAVDASHGSFQFYGSGIYYEPQCKRVHLNHAVL 279

Query:   453 ---YG------TSIPYWIVKNSWGSDWGEK 473
                YG          +W+VKNSWG +WG K
Sbjct:   280 VVGYGFEGEESDGNSFWLVKNSWGEEWGMK 309


>ZFIN|ZDB-GENE-040718-61 [details] [associations]
            symbol:ctsl.1 "cathepsin L.1" species:7955 "Danio
            rerio" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-040718-61
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 HOVERGEN:HBG011513
            GeneTree:ENSGT00660000095458 MEROPS:C01.092 EMBL:FP015965
            EMBL:BC075887 IPI:IPI00513499 RefSeq:NP_001002368.1
            UniGene:Dr.85174 SMR:Q6DHT0 Ensembl:ENSDART00000017756
            GeneID:436641 KEGG:dre:436641 CTD:436641 InParanoid:Q6DHT0
            OMA:GGQMENA OrthoDB:EOG41ZFB9 NextBio:20831086 Uniprot:Q6DHT0
        Length = 334

 Score = 333 (122.3 bits), Expect = 2.0e-29, P = 2.0e-29
 Identities = 102/305 (33%), Positives = 147/305 (48%)

Query:   202 KVYSSVEDLLRRHENFVTNVEKAEDYQS-EDSGTAVF--GVNKFFDLSESDLQQLTGLNL 258
             K Y S E+   R   ++TN +    +    D G   +  G+  F D+S  + +QL     
Sbjct:    35 KSYRSAEEESHRQLTWLTNRKLVLVHNMMADQGLKSYRLGMTYFADMSNEEYRQLV---- 90

Query:   259 DSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVKEQGKCA 318
                    +  L    S N T     +  F  LR    +P+  DWR +G ++ +K+Q +C 
Sbjct:    91 ------FRGCLG---SMNNTKARGGSTFFR-LRKAAVVPDTVDWRDKGYVTDIKDQKQCG 140

Query:   319 CCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSNG--GCNGGRMDDALQYIIDNGGV 376
              CWAFSA G +E     +   L  LS QQLVDC  S G  GC+GG MD A QYI  N G+
Sbjct:   141 SCWAFSATGSLEGQTFRKTGKLVSLSEQQLVDCSGSYGNYGCDGGLMDQAFQYIEANKGL 200

Query:   377 VSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNAN 436
              ++ +YPY+A + E  C              Y  I  G+E  +++ VAT GP+SV ++A 
Sbjct:   201 DTEDSYPYEAQDGE--C-RFNPSTVGASCTGYVDIASGDESALQEAVATIGPISVAIDAG 257

Query:   437 GLFY--YSGGVI--------DLNQRL----YGTSI--PYWIVKNSWGSDWGEKVEDKVGS 480
                +  YS GV         +L+  +    YG+S    YWIVKNSWG DWG  V+  +  
Sbjct:   258 HSSFQLYSSGVYNEPDCSSSELDHGVLAVGYGSSNGDDYWIVKNSWGLDWG--VQGYILM 315

Query:   481 SGNRT 485
             S N++
Sbjct:   316 SRNKS 320

 Score = 264 (98.0 bits), Expect = 4.5e-32, Sum P(2) = 4.5e-32
 Identities = 63/158 (39%), Positives = 86/158 (54%)

Query:   497 KLSRLATEKLVDCDMSNG--GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXX 554
             KL  L+ ++LVDC  S G  GC+GG MD A QYI  N G+ ++ +YPY+A + E  C   
Sbjct:   161 KLVSLSEQQLVDCSGSYGNYGCDGGLMDQAFQYIEANKGLDTEDSYPYEAQDGE--C-RF 217

Query:   555 XXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFY--YSGGVIDLNQRLCN 612
                        Y  I  G+E  +++ VAT GP+SV ++A    +  YS GV   N+  C+
Sbjct:   218 NPSTVGASCTGYVDIASGDESALQEAVATIGPISVAIDAGHSSFQLYSSGVY--NEPDCS 275

Query:   613 PKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWG 650
                 +H ++ VGYG     D     YWIVKNSWG DWG
Sbjct:   276 SSELDHGVLAVGYGSSNGDD-----YWIVKNSWGLDWG 308

 Score = 120 (47.3 bits), Expect = 4.5e-32, Sum P(2) = 4.5e-32
 Identities = 23/60 (38%), Positives = 33/60 (55%)

Query:   139 FNSLRHGDDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
             F  LR    +P+  DWR +G ++ +K+Q +C  CWAFSA G +E     +   L  LS Q
Sbjct:   109 FFRLRKAAVVPDTVDWRDKGYVTDIKDQKQCGSCWAFSATGSLEGQTFRKTGKLVSLSEQ 168


>UNIPROTKB|G1M0X4 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9646
            "Ailuropoda melanoleuca" [GO:0001656 "metanephros development"
            evidence=ISS] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=ISS] [GO:0002764 "immune response-regulating signaling
            pathway" evidence=ISS] [GO:0004175 "endopeptidase activity"
            evidence=ISS] [GO:0004177 "aminopeptidase activity" evidence=ISS]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0004252 "serine-type endopeptidase activity" evidence=ISS]
            [GO:0005615 "extracellular space" evidence=ISS] [GO:0005764
            "lysosome" evidence=ISS] [GO:0005829 "cytosol" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISS] [GO:0008284 "positive regulation
            of cell proliferation" evidence=ISS] [GO:0010628 "positive
            regulation of gene expression" evidence=ISS] [GO:0010634 "positive
            regulation of epithelial cell migration" evidence=ISS] [GO:0010813
            "neuropeptide catabolic process" evidence=ISS] [GO:0010815
            "bradykinin catabolic process" evidence=ISS] [GO:0010952 "positive
            regulation of peptidase activity" evidence=ISS] [GO:0016505
            "apoptotic protease activator activity" evidence=ISS] [GO:0030108
            "HLA-A specific activating MHC class I receptor activity"
            evidence=ISS] [GO:0030335 "positive regulation of cell migration"
            evidence=ISS] [GO:0031638 "zymogen activation" evidence=ISS]
            [GO:0031648 "protein destabilization" evidence=ISS] [GO:0032526
            "response to retinoic acid" evidence=ISS] [GO:0033619 "membrane
            protein proteolysis" evidence=ISS] [GO:0043066 "negative regulation
            of apoptotic process" evidence=ISS] [GO:0043129 "surfactant
            homeostasis" evidence=ISS] [GO:0045766 "positive regulation of
            angiogenesis" evidence=ISS] [GO:0060448 "dichotomous subdivision of
            terminal units involved in lung branching" evidence=ISS]
            [GO:0070324 "thyroid hormone binding" evidence=ISS] [GO:0070371
            "ERK1 and ERK2 cascade" evidence=ISS] [GO:0097208 "alveolar
            lamellar body" evidence=ISS] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005829 GO:GO:0043066 GO:GO:0005615 GO:GO:0008284
            GO:GO:0070371 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766
            GO:GO:0004177 GO:GO:0004252 GO:GO:0005764 GO:GO:0097208
            GO:GO:0032526 GO:GO:0010628 GO:GO:0070324 GO:GO:0016505
            GO:GO:0001656 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
            GO:GO:0097067 GO:GO:0031638 GO:GO:0001913
            GeneTree:ENSGT00660000095458 OMA:STSCHKT GO:GO:0030108
            GO:GO:0010815 GO:GO:0060448 GO:GO:0002764 GO:GO:0033619
            GO:GO:0010813 GO:GO:0043129 EMBL:ACTA01057330 EMBL:ACTA01065330
            Ensembl:ENSAMET00000013529 Uniprot:G1M0X4
        Length = 337

 Score = 291 (107.5 bits), Expect = 2.9e-29, Sum P(2) = 2.9e-29
 Identities = 73/204 (35%), Positives = 95/204 (46%)

Query:   288 NSLRHGDDLPEAFDWRAEG-VISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQ 346
             N LR     P   DWR +G  +S VK QG C  CW FS  G +E+  AI+   L  L+ Q
Sbjct:   110 NYLRGTGPYPPFVDWRKKGKFVSPVKNQGGCGSCWTFSTTGALESAIAIKTGKLLSLAEQ 169

Query:   347 QLVDC--DMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXX 404
             QLVDC  D +N GC GG    A +YI  N G++ + +YPYK  + +  C           
Sbjct:   170 QLVDCAQDFNNHGCQGGLPSQAFEYIRYNRGIMGEDSYPYKGQDGD--C-KFQPSKAIAF 226

Query:   405 XXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFY-YSGGVID----------LNQRL- 452
                 + I   +E+ M + VA   P+S      G F  Y  GV            +N  + 
Sbjct:   227 VKDVANITINDEQAMVEAVALFNPVSFAFEVTGDFMMYRKGVYSSTSCHKTPDKVNHAVL 286

Query:   453 ---YG--TSIPYWIVKNSWGSDWG 471
                YG    +PYWIVKNSWG  WG
Sbjct:   287 AVGYGEQNGVPYWIVKNSWGPQWG 310

 Score = 244 (91.0 bits), Expect = 5.1e-32, Sum P(3) = 5.1e-32
 Identities = 57/157 (36%), Positives = 76/157 (48%)

Query:   497 KLSRLATEKLVDC--DMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXX 554
             KL  LA ++LVDC  D +N GC GG    A +YI  N G++ + +YPYK  + +  C   
Sbjct:   162 KLLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYIRYNRGIMGEDSYPYKGQDGD--C-KF 218

Query:   555 XXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFY-YSGGVIDLNQRLCNP 613
                         + I   +E+ M + VA   P+S      G F  Y  GV         P
Sbjct:   219 QPSKAIAFVKDVANITINDEQAMVEAVALFNPVSFAFEVTGDFMMYRKGVYSSTSCHKTP 278

Query:   614 KAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWG 650
                NHA++ VGYGE+       +PYWIVKNSWG  WG
Sbjct:   279 DKVNHAVLAVGYGEQN-----GVPYWIVKNSWGPQWG 310

 Score = 116 (45.9 bits), Expect = 5.1e-32, Sum P(3) = 5.1e-32
 Identities = 25/60 (41%), Positives = 31/60 (51%)

Query:   140 NSLRHGDDLPEAFDWRAEG-VISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
             N LR     P   DWR +G  +S VK QG C  CW FS  G +E+  AI+   L  L+ Q
Sbjct:   110 NYLRGTGPYPPFVDWRKKGKFVSPVKNQGGCGSCWTFSTTGALESAIAIKTGKLLSLAEQ 169

 Score = 65 (27.9 bits), Expect = 5.1e-32, Sum P(3) = 5.1e-32
 Identities = 20/70 (28%), Positives = 37/70 (52%)

Query:    37 YLNSPVTRFLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQREDSGTAVFEV--NKF 94
             +L +    F ++M  H K YSS E+   R   FV N  K   +   ++G   F++  N+F
Sbjct:    29 FLFTEKVHFKSWMVQHQKKYSS-EEYQHRLRTFVGNWRKINAH---NAGNHTFKMGLNQF 84

Query:    95 FDLSDSDLQQ 104
              D+S +++++
Sbjct:    85 SDMSFAEIKR 94

 Score = 64 (27.6 bits), Expect = 1.7e-22, Sum P(2) = 1.7e-22
 Identities = 19/56 (33%), Positives = 31/56 (55%)

Query:   197 VQHHDKVYSSVEDLLRRHENFVTNVEKAEDYQSEDSGTAVFGVNKFFDLSESDLQQ 252
             VQH  K YSS E+   R   FV N  K   + + +  T   G+N+F D+S +++++
Sbjct:    42 VQHQKK-YSS-EEYQHRLRTFVGNWRKINAHNAGNH-TFKMGLNQFSDMSFAEIKR 94


>UNIPROTKB|F1PMM9 [details] [associations]
            symbol:CTSL1 "Cathepsin L1" species:9615 "Canis lupus
            familiaris" [GO:0097067 "cellular response to thyroid hormone
            stimulus" evidence=IEA] [GO:0042393 "histone binding" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0004197 "cysteine-type
            endopeptidase activity" evidence=IEA] [GO:0002250 "adaptive immune
            response" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00639 GO:GO:0004197 GO:GO:0002250
            GeneTree:ENSGT00560000076577 GO:GO:0097067 EMBL:AAEX03000499
            Ensembl:ENSCAFT00000002029 OMA:EFKQVLN Uniprot:F1PMM9
        Length = 341

 Score = 269 (99.8 bits), Expect = 1.8e-22, P = 1.8e-22
 Identities = 57/155 (36%), Positives = 84/155 (54%)

Query:   295 DLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMS 354
             ++P + DWR +G ++ VK+QG+C  CWAFSA G +E     +   L  LS Q LVDC  S
Sbjct:   121 EVPSSVDWREQGYVTPVKDQGQCLGCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSWS 180

Query:   355 NG--GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIP 412
              G  GCNGG M+ A QY+ DNGG+ S+++YPY A      C              +  + 
Sbjct:   181 QGNRGCNGGLMEYAFQYVKDNGGLDSEESYPYLARNEP--CKYRPEKSAANVTAFWPIL- 237

Query:   413 YGEEEEMKKWVATRGPLSVGMNAN--GLFYYSGGV 445
               EE+ +   VAT GP+S  ++++     +Y  G+
Sbjct:   238 -NEEDGLMTTVATVGPVSAAVDSSPQSFQFYKKGI 271

 Score = 256 (95.2 bits), Expect = 6.1e-32, Sum P(2) = 6.1e-32
 Identities = 58/158 (36%), Positives = 84/158 (53%)

Query:   497 KLSRLATEKLVDCDMSNG--GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXX 554
             KL  L+ + LVDC  S G  GCNGG M+ A QY+ DNGG+ S+++YPY A      C   
Sbjct:   165 KLVSLSEQNLVDCSWSQGNRGCNGGLMEYAFQYVKDNGGLDSEESYPYLARNEP--CKYR 222

Query:   555 XXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNAN--GLFYYSGGVIDLNQRLCN 612
                        +  +   EE+ +   VAT GP+S  ++++     +Y  G+       C+
Sbjct:   223 PEKSAANVTAFWPIL--NEEDGLMTTVATVGPVSAAVDSSPQSFQFYKKGIY--YDPKCS 278

Query:   613 PKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWG 650
              K  NH +++VGYG E  +      YWIVKNSWG++WG
Sbjct:   279 NKLLNHGVLVVGYGFEGAESDNK-KYWIVKNSWGTNWG 315

 Score = 127 (49.8 bits), Expect = 6.1e-32, Sum P(2) = 6.1e-32
 Identities = 22/53 (41%), Positives = 33/53 (62%)

Query:   147 DLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQH 199
             ++P + DWR +G ++ VK+QG+C  CWAFSA G +E     +   L  LS Q+
Sbjct:   121 EVPSSVDWREQGYVTPVKDQGQCLGCWAFSATGALEGQMFRKTGKLVSLSEQN 173

 Score = 126 (49.4 bits), Expect = 1.4e-13, Sum P(2) = 1.4e-13
 Identities = 39/134 (29%), Positives = 55/134 (41%)

Query:   355 NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASES------ERGCLXXXXXXXXXXXXXY 408
             N GCNGG M+ A QY+ DNGG+ S+++YPY A         E+                 
Sbjct:   183 NRGCNGGLMEYAFQYVKDNGGLDSEESYPYLARNEPCKYRPEKSAANVTAFWPILNEEDG 242

Query:   409 SRIPYGEEEEMKKWVATRGPLSVGMNANGLFY--------YSGGVIDLNQRLYGTSIP-- 458
                       +   V +  P S      G++Y         + GV+ +     G      
Sbjct:   243 LMTTVATVGPVSAAVDS-SPQSFQFYKKGIYYDPKCSNKLLNHGVLVVGYGFEGAESDNK 301

Query:   459 -YWIVKNSWGSDWG 471
              YWIVKNSWG++WG
Sbjct:   302 KYWIVKNSWGTNWG 315

 Score = 37 (18.1 bits), Expect = 1.6e-22, Sum P(2) = 1.6e-22
 Identities = 34/141 (24%), Positives = 60/141 (42%)

Query:    52 HDKVYSSVEDLLRRHENFVTNVEKAEDYQREDS-GTAVFEV--NKFFDLSDSDLQQLTGL 108
             H K+Y   E+  RR   +  N+E  E + +E S G   F +  N F D+++ + +Q+  L
Sbjct:    44 HGKLYDKDEEGWRRTV-WERNMEMIEQHNQEYSQGEHSFTLAMNAFGDMTNEEFKQV--L 100

Query:   109 NLDSTLEDIQPS--LQAP-FSSNQTDTEMRAFQF-NSLR-HGDDLP-EAFD--------- 153
             N D  ++  +      AP F+   +  + R   +   ++  G  L   AF          
Sbjct:   101 N-DFKIQKHKKGKVFPAPLFAEVPSSVDWREQGYVTPVKDQGQCLGCWAFSATGALEGQM 159

Query:   154 WRAEGVISKVKEQGKCACCWA 174
             +R  G +  + EQ    C W+
Sbjct:   160 FRKTGKLVSLSEQNLVDCSWS 180


>UNIPROTKB|Q3T0I2 [details] [associations]
            symbol:CTSH "Pro-cathepsin H" species:9913 "Bos taurus"
            [GO:0031638 "zymogen activation" evidence=ISS] [GO:0031648 "protein
            destabilization" evidence=ISS] [GO:0004197 "cysteine-type
            endopeptidase activity" evidence=ISS] [GO:0010815 "bradykinin
            catabolic process" evidence=ISS] [GO:0043129 "surfactant
            homeostasis" evidence=ISS] [GO:0010813 "neuropeptide catabolic
            process" evidence=ISS] [GO:0005829 "cytosol" evidence=ISS]
            [GO:0045766 "positive regulation of angiogenesis" evidence=ISS]
            [GO:0008284 "positive regulation of cell proliferation"
            evidence=ISS] [GO:0008234 "cysteine-type peptidase activity"
            evidence=ISS] [GO:0030335 "positive regulation of cell migration"
            evidence=ISS] [GO:0001656 "metanephros development" evidence=ISS]
            [GO:0033619 "membrane protein proteolysis" evidence=ISS]
            [GO:0043066 "negative regulation of apoptotic process"
            evidence=ISS] [GO:0004252 "serine-type endopeptidase activity"
            evidence=ISS] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=ISS] [GO:0016505 "apoptotic protease activator activity"
            evidence=ISS] [GO:0010952 "positive regulation of peptidase
            activity" evidence=ISS] [GO:0030108 "HLA-A specific activating MHC
            class I receptor activity" evidence=ISS] [GO:0002764 "immune
            response-regulating signaling pathway" evidence=ISS] [GO:0070371
            "ERK1 and ERK2 cascade" evidence=ISS] [GO:0010628 "positive
            regulation of gene expression" evidence=ISS] [GO:0070324 "thyroid
            hormone binding" evidence=ISS] [GO:0006508 "proteolysis"
            evidence=ISS] [GO:0060448 "dichotomous subdivision of terminal
            units involved in lung branching" evidence=ISS] [GO:0097208
            "alveolar lamellar body" evidence=ISS] [GO:0005764 "lysosome"
            evidence=ISS] [GO:0005615 "extracellular space" evidence=ISS]
            [GO:0004177 "aminopeptidase activity" evidence=ISS] [GO:0004175
            "endopeptidase activity" evidence=ISS] [GO:0032526 "response to
            retinoic acid" evidence=ISS] [GO:0010634 "positive regulation of
            epithelial cell migration" evidence=ISS] [GO:0005515 "protein
            binding" evidence=IPI] [GO:0097067 "cellular response to thyroid
            hormone stimulus" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005829 GO:GO:0043066 GO:GO:0005615
            GO:GO:0008284 GO:GO:0070371 eggNOG:COG4870 HOGENOM:HOG000230774
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766 GO:GO:0004177
            GO:GO:0004252 GO:GO:0005764 GO:GO:0097208 GO:GO:0032526
            GO:GO:0010628 GO:GO:0070324 KO:K01366 GO:GO:0016505 GO:GO:0001656
            GO:GO:0010634 GO:GO:0004197 GO:GO:0031648 HOVERGEN:HBG011513
            GO:GO:0097067 GO:GO:0031638 GO:GO:0001913
            GeneTree:ENSGT00660000095458 EMBL:BC102386 IPI:IPI00693034
            RefSeq:NP_001029557.1 UniGene:Bt.52393 ProteinModelPortal:Q3T0I2
            SMR:Q3T0I2 STRING:Q3T0I2 MEROPS:C01.040 PRIDE:Q3T0I2
            Ensembl:ENSBTAT00000014593 GeneID:510524 KEGG:bta:510524 CTD:1512
            InParanoid:Q3T0I2 OMA:STSCHKT OrthoDB:EOG4W9J43 NextBio:20869490
            GO:GO:0030108 GO:GO:0010815 GO:GO:0060448 GO:GO:0002764
            GO:GO:0033619 GO:GO:0010813 GO:GO:0043129 Uniprot:Q3T0I2
        Length = 335

 Score = 284 (105.0 bits), Expect = 2.8e-30, Sum P(2) = 2.8e-30
 Identities = 72/210 (34%), Positives = 97/210 (46%)

Query:   284 AFQFNSLRHGDDLPEAFDWRAEG-VISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTE 342
             A + N LR     P + DWR +G  ++ VK QG C  CW FS  G +E+  AI    L  
Sbjct:   104 ATKSNYLRGTGPYPPSMDWRKKGNFVTPVKNQGSCGSCWTFSTTGALESAVAIATGKLPF 163

Query:   343 LSVQQLVDC--DMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXX 400
             L+ QQLVDC  + +N GC GG    A +YI  N G++ +  YPY+  + +  C       
Sbjct:   164 LAEQQLVDCAQNFNNHGCQGGLPSQAFEYIRYNKGIMGEDTYPYRGQDGD--C-KYQPSK 220

Query:   401 XXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFY-YSGGVID----------LN 449
                     + I   +EE M + VA   P+S        F  Y  G+            +N
Sbjct:   221 AIAFVKDVANITLNDEEAMVEAVALHNPVSFAFEVTADFMMYRKGIYSSTSCHKTPDKVN 280

Query:   450 QRL----YGTS--IPYWIVKNSWGSDWGEK 473
               +    YG    IPYWIVKNSWG +WG K
Sbjct:   281 HAVLAVGYGEEKGIPYWIVKNSWGPNWGMK 310

 Score = 243 (90.6 bits), Expect = 6.2e-32, Sum P(3) = 6.2e-32
 Identities = 57/163 (34%), Positives = 78/163 (47%)

Query:   493 VLPSKLSRLATEKLVDC--DMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERG 550
             +   KL  LA ++LVDC  + +N GC GG    A +YI  N G++ +  YPY+  + +  
Sbjct:   156 IATGKLPFLAEQQLVDCAQNFNNHGCQGGLPSQAFEYIRYNKGIMGEDTYPYRGQDGD-- 213

Query:   551 CLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFY-YSGGVIDLNQR 609
             C               + I   +EE M + VA   P+S        F  Y  G+      
Sbjct:   214 C-KYQPSKAIAFVKDVANITLNDEEAMVEAVALHNPVSFAFEVTADFMMYRKGIYSSTSC 272

Query:   610 LCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
                P   NHA++ VGYGEE+      IPYWIVKNSWG +WG K
Sbjct:   273 HKTPDKVNHAVLAVGYGEEK-----GIPYWIVKNSWGPNWGMK 310

 Score = 115 (45.5 bits), Expect = 6.2e-32, Sum P(3) = 6.2e-32
 Identities = 25/64 (39%), Positives = 33/64 (51%)

Query:   136 AFQFNSLRHGDDLPEAFDWRAEG-VISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTE 194
             A + N LR     P + DWR +G  ++ VK QG C  CW FS  G +E+  AI    L  
Sbjct:   104 ATKSNYLRGTGPYPPSMDWRKKGNFVTPVKNQGSCGSCWTFSTTGALESAVAIATGKLPF 163

Query:   195 LSVQ 198
             L+ Q
Sbjct:   164 LAEQ 167

 Score = 82 (33.9 bits), Expect = 2.8e-30, Sum P(2) = 2.8e-30
 Identities = 25/87 (28%), Positives = 44/87 (50%)

Query:   169 CACCWAFSA--VGVVE-AMHAIQGNNLTELSVQHHDKVYSSVEDLLRRHENFVTNVEKAE 225
             CA  W   A   G  E A ++++  +     VQH  K YSS E+   R + F +N+ +  
Sbjct:     9 CAGAWLLGAPACGAAELAANSLEKFHFQSWMVQHQKK-YSS-EEYYHRLQAFASNLREIN 66

Query:   226 DYQSEDSGTAVFGVNKFFDLSESDLQQ 252
              + + +  T   G+N+F D+S  +L++
Sbjct:    67 AHNARNH-TFKMGLNQFSDMSFDELKR 92

 Score = 64 (27.6 bits), Expect = 6.2e-32, Sum P(3) = 6.2e-32
 Identities = 17/60 (28%), Positives = 32/60 (53%)

Query:    45 FLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQREDSGTAVFEVNKFFDLSDSDLQQ 104
             F ++M  H K YSS E+   R + F +N+ +   +   +  T    +N+F D+S  +L++
Sbjct:    35 FQSWMVQHQKKYSS-EEYYHRLQAFASNLREINAHNARNH-TFKMGLNQFSDMSFDELKR 92


>UNIPROTKB|F7B939 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9483
            "Callithrix jacchus" [GO:0001656 "metanephros development"
            evidence=ISS] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=ISS] [GO:0002764 "immune response-regulating signaling
            pathway" evidence=ISS] [GO:0004175 "endopeptidase activity"
            evidence=ISS] [GO:0004177 "aminopeptidase activity" evidence=ISS]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0004252 "serine-type endopeptidase activity" evidence=ISS]
            [GO:0005615 "extracellular space" evidence=ISS] [GO:0005764
            "lysosome" evidence=ISS] [GO:0005829 "cytosol" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISS] [GO:0008284 "positive regulation
            of cell proliferation" evidence=ISS] [GO:0010628 "positive
            regulation of gene expression" evidence=ISS] [GO:0010634 "positive
            regulation of epithelial cell migration" evidence=ISS] [GO:0010813
            "neuropeptide catabolic process" evidence=ISS] [GO:0010815
            "bradykinin catabolic process" evidence=ISS] [GO:0010952 "positive
            regulation of peptidase activity" evidence=ISS] [GO:0016505
            "apoptotic protease activator activity" evidence=ISS] [GO:0030108
            "HLA-A specific activating MHC class I receptor activity"
            evidence=ISS] [GO:0030335 "positive regulation of cell migration"
            evidence=ISS] [GO:0031638 "zymogen activation" evidence=ISS]
            [GO:0031648 "protein destabilization" evidence=ISS] [GO:0032526
            "response to retinoic acid" evidence=ISS] [GO:0033619 "membrane
            protein proteolysis" evidence=ISS] [GO:0043066 "negative regulation
            of apoptotic process" evidence=ISS] [GO:0043129 "surfactant
            homeostasis" evidence=ISS] [GO:0045766 "positive regulation of
            angiogenesis" evidence=ISS] [GO:0060448 "dichotomous subdivision of
            terminal units involved in lung branching" evidence=ISS]
            [GO:0070324 "thyroid hormone binding" evidence=ISS] [GO:0070371
            "ERK1 and ERK2 cascade" evidence=ISS] [GO:0097208 "alveolar
            lamellar body" evidence=ISS] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005829 GO:GO:0043066 GO:GO:0005615 GO:GO:0008284
            GO:GO:0070371 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766
            GO:GO:0004177 GO:GO:0004252 GO:GO:0005764 GO:GO:0097208
            GO:GO:0032526 GO:GO:0010628 GO:GO:0070324 GO:GO:0016505
            GO:GO:0001656 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
            GO:GO:0031638 GO:GO:0001913 GeneTree:ENSGT00660000095458 CTD:1512
            GO:GO:0030108 GO:GO:0010815 GO:GO:0060448 GO:GO:0002764
            GO:GO:0033619 GO:GO:0010813 GO:GO:0043129 EMBL:ACFV01158341
            EMBL:ACFV01158342 EMBL:ACFV01158343 RefSeq:XP_002753411.1
            Ensembl:ENSCJAT00000004397 GeneID:100413104 Uniprot:F7B939
        Length = 336

 Score = 287 (106.1 bits), Expect = 1.1e-29, Sum P(2) = 1.1e-29
 Identities = 72/208 (34%), Positives = 97/208 (46%)

Query:   284 AFQFNSLRHGDDLPEAFDWRAEG-VISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTE 342
             A + N LR     P + DWR +G  +S VK QG C  CW FS  G +E+  AI    +  
Sbjct:   105 ATKSNYLRGTGPYPPSVDWRKKGHFVSPVKNQGACGSCWTFSTTGALESAIAIATGKMLS 164

Query:   343 LSVQQLVDC--DMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXX 400
             L+ QQLVDC  D +N GC GG    A +YI+ N G++ +  YPY+  +S+  C       
Sbjct:   165 LAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNNGIMGEDTYPYQGKDSD--C-KFQPGK 221

Query:   401 XXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFY-YSGGVID----------LN 449
                     + I   +E+ M + VA   P+S        F  Y  G+            +N
Sbjct:   222 AIGFVKDVANITIYDEDAMVEAVALYNPVSFAFEVTQDFMMYKRGIYSSTSCHKTPDKVN 281

Query:   450 QRL----YG--TSIPYWIVKNSWGSDWG 471
               +    YG    IPYWIVKNSWG  WG
Sbjct:   282 HAVLAVGYGEENGIPYWIVKNSWGPQWG 309

 Score = 241 (89.9 bits), Expect = 6.7e-32, Sum P(3) = 6.7e-32
 Identities = 56/161 (34%), Positives = 77/161 (47%)

Query:   493 VLPSKLSRLATEKLVDC--DMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERG 550
             +   K+  LA ++LVDC  D +N GC GG    A +YI+ N G++ +  YPY+  +S+  
Sbjct:   157 IATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNNGIMGEDTYPYQGKDSD-- 214

Query:   551 CLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFY-YSGGVIDLNQR 609
             C               + I   +E+ M + VA   P+S        F  Y  G+      
Sbjct:   215 C-KFQPGKAIGFVKDVANITIYDEDAMVEAVALYNPVSFAFEVTQDFMMYKRGIYSSTSC 273

Query:   610 LCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWG 650
                P   NHA++ VGYGEE       IPYWIVKNSWG  WG
Sbjct:   274 HKTPDKVNHAVLAVGYGEEN-----GIPYWIVKNSWGPQWG 309

 Score = 117 (46.2 bits), Expect = 6.7e-32, Sum P(3) = 6.7e-32
 Identities = 25/64 (39%), Positives = 33/64 (51%)

Query:   136 AFQFNSLRHGDDLPEAFDWRAEG-VISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTE 194
             A + N LR     P + DWR +G  +S VK QG C  CW FS  G +E+  AI    +  
Sbjct:   105 ATKSNYLRGTGPYPPSVDWRKKGHFVSPVKNQGACGSCWTFSTTGALESAIAIATGKMLS 164

Query:   195 LSVQ 198
             L+ Q
Sbjct:   165 LAEQ 168

 Score = 73 (30.8 bits), Expect = 1.1e-29, Sum P(2) = 1.1e-29
 Identities = 22/87 (25%), Positives = 44/87 (50%)

Query:   169 CA--CCWAFSAVGVVE-AMHAIQGNNLTELSVQHHDKVYSSVEDLLRRHENFVTNVEKAE 225
             CA  C     A G  E ++++++  +      +HH K YS  E+  +R + F +N  K  
Sbjct:     9 CAGVCLLGAPARGAAELSVNSLEKFHFKSWMAKHH-KTYSREEEYHQRLQTFASNWRKIN 67

Query:   226 DYQSEDSGTAVFGVNKFFDLSESDLQQ 252
              + + +  T    VN+F D+S +++++
Sbjct:    68 AHNNGNH-TFKMAVNQFSDMSFAEIKR 93

 Score = 71 (30.1 bits), Expect = 6.7e-32, Sum P(3) = 6.7e-32
 Identities = 17/60 (28%), Positives = 32/60 (53%)

Query:    45 FLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQREDSGTAVFEVNKFFDLSDSDLQQ 104
             F ++M  H K YS  E+  +R + F +N  K   +   +  T    VN+F D+S +++++
Sbjct:    35 FKSWMAKHHKTYSREEEYHQRLQTFASNWRKINAHNNGNH-TFKMAVNQFSDMSFAEIKR 93


>UNIPROTKB|F7BRD4 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9483
            "Callithrix jacchus" [GO:0001656 "metanephros development"
            evidence=ISS] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0001656
            GeneTree:ENSGT00660000095458 EMBL:ACFV01158341 EMBL:ACFV01158342
            EMBL:ACFV01158343 Ensembl:ENSCJAT00000004396 Uniprot:F7BRD4
        Length = 336

 Score = 287 (106.1 bits), Expect = 1.8e-29, Sum P(2) = 1.8e-29
 Identities = 72/208 (34%), Positives = 97/208 (46%)

Query:   284 AFQFNSLRHGDDLPEAFDWRAEG-VISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTE 342
             A + N LR     P + DWR +G  +S VK QG C  CW FS  G +E+  AI    +  
Sbjct:   105 ATKSNYLRGTGPYPPSVDWRKKGHFVSPVKNQGACGSCWTFSTTGALESAIAIATGKMLS 164

Query:   343 LSVQQLVDC--DMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXX 400
             L+ QQLVDC  D +N GC GG    A +YI+ N G++ +  YPY+  +S+  C       
Sbjct:   165 LAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNNGIMGEDTYPYQGKDSD--C-KFQPGK 221

Query:   401 XXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFY-YSGGVID----------LN 449
                     + I   +E+ M + VA   P+S        F  Y  G+            +N
Sbjct:   222 AIGFVKDVANITIYDEDAMVEAVALYNPVSFAFEVTQDFMMYKRGIYSSTSCHKTPDKVN 281

Query:   450 QRL----YG--TSIPYWIVKNSWGSDWG 471
               +    YG    IPYWIVKNSWG  WG
Sbjct:   282 HAVLAVGYGEENGIPYWIVKNSWGPQWG 309

 Score = 241 (89.9 bits), Expect = 6.7e-32, Sum P(3) = 6.7e-32
 Identities = 56/161 (34%), Positives = 77/161 (47%)

Query:   493 VLPSKLSRLATEKLVDC--DMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERG 550
             +   K+  LA ++LVDC  D +N GC GG    A +YI+ N G++ +  YPY+  +S+  
Sbjct:   157 IATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNNGIMGEDTYPYQGKDSD-- 214

Query:   551 CLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFY-YSGGVIDLNQR 609
             C               + I   +E+ M + VA   P+S        F  Y  G+      
Sbjct:   215 C-KFQPGKAIGFVKDVANITIYDEDAMVEAVALYNPVSFAFEVTQDFMMYKRGIYSSTSC 273

Query:   610 LCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWG 650
                P   NHA++ VGYGEE       IPYWIVKNSWG  WG
Sbjct:   274 HKTPDKVNHAVLAVGYGEEN-----GIPYWIVKNSWGPQWG 309

 Score = 117 (46.2 bits), Expect = 6.7e-32, Sum P(3) = 6.7e-32
 Identities = 25/64 (39%), Positives = 33/64 (51%)

Query:   136 AFQFNSLRHGDDLPEAFDWRAEG-VISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTE 194
             A + N LR     P + DWR +G  +S VK QG C  CW FS  G +E+  AI    +  
Sbjct:   105 ATKSNYLRGTGPYPPSVDWRKKGHFVSPVKNQGACGSCWTFSTTGALESAIAIATGKMLS 164

Query:   195 LSVQ 198
             L+ Q
Sbjct:   165 LAEQ 168

 Score = 71 (30.1 bits), Expect = 6.7e-32, Sum P(3) = 6.7e-32
 Identities = 17/60 (28%), Positives = 32/60 (53%)

Query:    45 FLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQREDSGTAVFEVNKFFDLSDSDLQQ 104
             F ++M  H K YS  E+  +R + F +N  K   +   +  T    VN+F D+S +++++
Sbjct:    35 FKSWMAKHHKTYSREEEYHQRLQTFASNWRKINAHNNGNH-TFKMAVNQFSDMSFAEIKR 93

 Score = 67 (28.6 bits), Expect = 3.8e-22, Sum P(2) = 3.8e-22
 Identities = 18/69 (26%), Positives = 36/69 (52%)

Query:   184 MHAIQGNNLTELSVQHHDKVYSSVEDLLRRHENFVTNVEKAEDYQSEDSGTAVFGVNKFF 243
             M A++  +      +HH K YS  E+  +R + F +N  K   + + +  T    VN+F 
Sbjct:    27 MLALEKFHFKSWMAKHH-KTYSREEEYHQRLQTFASNWRKINAHNNGNH-TFKMAVNQFS 84

Query:   244 DLSESDLQQ 252
             D+S +++++
Sbjct:    85 DMSFAEIKR 93


>RGD|631421 [details] [associations]
            symbol:Ctsq "cathepsin Q" species:10116 "Rattus norvegicus"
            [GO:0005764 "lysosome" evidence=NAS] [GO:0006508 "proteolysis"
            evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 RGD:631421 eggNOG:COG4870
            HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0005764 GeneTree:ENSGT00560000076577
            HOVERGEN:HBG011513 UniGene:Rn.34875 EMBL:AF187323 IPI:IPI00214897
            PIR:JC7183 RefSeq:NP_640355.1 UniGene:Rn.35820
            ProteinModelPortal:Q9QZE3 SMR:Q9QZE3 STRING:Q9QZE3 MEROPS:C01.039
            PRIDE:Q9QZE3 Ensembl:ENSRNOT00000024208 GeneID:246147
            KEGG:rno:246147 UCSC:RGD:631421 CTD:104002 InParanoid:Q9QZE3
            OMA:ESEDVLM OrthoDB:EOG4HHP48 NextBio:623425 Genevestigator:Q9QZE3
            GermOnline:ENSRNOG00000017946 Uniprot:Q9QZE3
        Length = 343

 Score = 290 (107.1 bits), Expect = 9.7e-25, P = 9.7e-25
 Identities = 78/253 (30%), Positives = 124/253 (49%)

Query:   200 HDKVYSSVEDLLRRHENFVTNVEKAEDYQSEDSGTAVFGVNKFFDLSESDLQQLTGLNLD 259
             ++K+YS  E++L+R   +  NV+K E +  E+S     G N +  +  +D   +T    D
Sbjct:    36 YEKLYSPEEEVLKRVV-WEENVKKIELHNRENS----LGKNTY-TMEINDFADMT----D 85

Query:   260 STLEDIQPSLQAPFSSNQTDTEMRAF-QF--NSLRHGDDLPEAFDWRAEGVISKVKEQGK 316
                +D+    Q P  + +     RA   F  NS    D LP+  DWR EG +++V++QG 
Sbjct:    86 EEFKDMIIGFQLPVHNTEKRLWKRALGSFFPNSWNWRDALPKFVDWRNEGYVTRVRKQGG 145

Query:   317 CACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSNG--GCNGGRMDDALQYIIDNG 374
             C+ CWAF   G +E     +   L  LSVQ L+DC    G  GC  G   +A QY++ NG
Sbjct:   146 CSSCWAFPVTGAIEGQMFKKTGKLIPLSVQNLIDCSKPQGNRGCLWGNTYNAFQYVLHNG 205

Query:   375 GVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMN 434
             G+ ++  YPY+  E   G               +  +P  E+  M   VAT+GP++ G++
Sbjct:   206 GLEAEATYPYERKE---GVCRYNPKNSSAKITGFVVLPESEDVLMDA-VATKGPIATGVH 261

Query:   435 --ANGLFYYSGGV 445
               ++   +Y  GV
Sbjct:   262 VISSSFRFYQKGV 274

 Score = 230 (86.0 bits), Expect = 8.0e-32, Sum P(2) = 8.0e-32
 Identities = 53/159 (33%), Positives = 84/159 (52%)

Query:   497 KLSRLATEKLVDCDMSNG--GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXX 554
             KL  L+ + L+DC    G  GC  G   +A QY++ NGG+ ++  YPY+  E   G    
Sbjct:   168 KLIPLSVQNLIDCSKPQGNRGCLWGNTYNAFQYVLHNGGLEAEATYPYERKE---GVCRY 224

Query:   555 XXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMN--ANGLFYYSGGVIDLNQRLCN 612
                        +  +P  E+  M   VAT+GP++ G++  ++   +Y  GV   ++  C+
Sbjct:   225 NPKNSSAKITGFVVLPESEDVLMDA-VATKGPIATGVHVISSSFRFYQKGVY--HEPKCS 281

Query:   613 PKAQNHALIIVGYG-EEEKKDGTSIPYWIVKNSWGSDWG 650
                 NHA+++VGYG E  + DG +  YW++KNSWG  WG
Sbjct:   282 SYV-NHAVLVVGYGFEGNETDGNN--YWLIKNSWGKRWG 317

 Score = 181 (68.8 bits), Expect = 8.0e-32, Sum P(2) = 8.0e-32
 Identities = 50/154 (32%), Positives = 78/154 (50%)

Query:    52 HDKVYSSVEDLLRRHENFVTNVEKAEDYQREDS---GTAVFEVNKFFDLSDSDLQQLTGL 108
             ++K+YS  E++L+R   +  NV+K E + RE+S    T   E+N F D++D + +     
Sbjct:    36 YEKLYSPEEEVLKRVV-WEENVKKIELHNRENSLGKNTYTMEINDFADMTDEEFK----- 89

Query:   109 NLDSTLEDIQPSLQAPFSSNQTDTEMRAF-QF--NSLRHGDDLPEAFDWRAEGVISKVKE 165
                    D+    Q P  + +     RA   F  NS    D LP+  DWR EG +++V++
Sbjct:    90 -------DMIIGFQLPVHNTEKRLWKRALGSFFPNSWNWRDALPKFVDWRNEGYVTRVRK 142

Query:   166 QGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQH 199
             QG C+ CWAF   G +E     +   L  LSVQ+
Sbjct:   143 QGGCSSCWAFPVTGAIEGQMFKKTGKLIPLSVQN 176

 Score = 91 (37.1 bits), Expect = 3.6e-16, Sum P(2) = 3.6e-16
 Identities = 34/144 (23%), Positives = 53/144 (36%)

Query:   355 NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASES--ERGCLXXXXXXXXXXXXXYSRIP 412
             N GC  G   +A QY++ NGG+ ++  YPY+  E                      S   
Sbjct:   186 NRGCLWGNTYNAFQYVLHNGGLEAEATYPYERKEGVCRYNPKNSSAKITGFVVLPESEDV 245

Query:   413 YGEEEEMKKWVATRGPL---SVGMNANGLF-------YYSGGVIDLNQRLYGTSIP---Y 459
               +    K  +AT   +   S      G++       Y +  V+ +     G       Y
Sbjct:   246 LMDAVATKGPIATGVHVISSSFRFYQKGVYHEPKCSSYVNHAVLVVGYGFEGNETDGNNY 305

Query:   460 WIVKNSWGSDWGEKVEDKVGSSGN 483
             W++KNSWG  WG +   K+    N
Sbjct:   306 WLIKNSWGKRWGLRGYMKIAKDRN 329


>UNIPROTKB|A4IFS7 [details] [associations]
            symbol:CTSL1 "CTSL1 protein" species:9913 "Bos taurus"
            [GO:0097067 "cellular response to thyroid hormone stimulus"
            evidence=IEA] [GO:0042393 "histone binding" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0004197 "cysteine-type
            endopeptidase activity" evidence=IEA] [GO:0002250 "adaptive immune
            response" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0004197 GO:GO:0002250
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 GO:GO:0097067
            OrthoDB:EOG48PMKF MEROPS:C01.032 CTD:1514 EMBL:DAAA02023987
            EMBL:BC134741 IPI:IPI00708619 RefSeq:NP_001077155.1
            UniGene:Bt.23199 SMR:A4IFS7 Ensembl:ENSBTAT00000000962
            GeneID:515200 KEGG:bta:515200 InParanoid:A4IFS7 OMA:NDEQALM
            NextBio:20871707 Uniprot:A4IFS7
        Length = 333

 Score = 272 (100.8 bits), Expect = 8.6e-23, P = 8.6e-23
 Identities = 62/186 (33%), Positives = 97/186 (52%)

Query:   296 LPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSN 355
             +P + DWR +G ++ VK QGKC  CWAFSA G +E     +   L  LS Q LVDC    
Sbjct:   114 IPPSVDWREKGYVTPVKNQGKCGSCWAFSATGALEGQMFQKTGKLVSLSEQNLVDCSQPE 173

Query:   356 G--GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPY 413
             G  GC+GG +D+A QY++D GG+ S+++YPY        CL             +  +P 
Sbjct:   174 GNRGCHGGFIDNAFQYVLDVGGLDSEESYPYTGLVGT--CLYNPNNSAANETG-FVDLPK 230

Query:   414 GEEEEMKKWVATRGPLSVGMNA-NGLF-YYSGGVIDLNQRLYGTSIPYWIVKNSWGSDWG 471
              +E+ + K VA  GP+SV ++A N  F +Y  G+          S+ + ++   +G +  
Sbjct:   231 -QEKALMKAVANLGPISVAVDAHNPSFQFYKSGIY-YEPNCSSESVDHAVLVVGYGFEGA 288

Query:   472 EKVEDK 477
             +  ++K
Sbjct:   289 DSDDNK 294

 Score = 253 (94.1 bits), Expect = 8.1e-32, Sum P(2) = 8.1e-32
 Identities = 59/159 (37%), Positives = 88/159 (55%)

Query:   497 KLSRLATEKLVDCDMSNG--GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXX 554
             KL  L+ + LVDC    G  GC+GG +D+A QY++D GG+ S+++YPY        CL  
Sbjct:   157 KLVSLSEQNLVDCSQPEGNRGCHGGFIDNAFQYVLDVGGLDSEESYPYTGLVGT--CLYN 214

Query:   555 XXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANG---LFYYSGGVIDLNQRLC 611
                        +  +P  +E+ + K VA  GP+SV ++A+     FY SG   + N   C
Sbjct:   215 PNNSAANETG-FVDLPK-QEKALMKAVANLGPISVAVDAHNPSFQFYKSGIYYEPN---C 269

Query:   612 NPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWG 650
             + ++ +HA+++VGYG E   D     YW+VKNSWG  WG
Sbjct:   270 SSESVDHAVLVVGYGFEGA-DSDDNKYWLVKNSWGEHWG 307

 Score = 129 (50.5 bits), Expect = 8.1e-32, Sum P(2) = 8.1e-32
 Identities = 23/52 (44%), Positives = 31/52 (59%)

Query:   148 LPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQH 199
             +P + DWR +G ++ VK QGKC  CWAFSA G +E     +   L  LS Q+
Sbjct:   114 IPPSVDWREKGYVTPVKNQGKCGSCWAFSATGALEGQMFQKTGKLVSLSEQN 165

 Score = 72 (30.4 bits), Expect = 2.9e-08, Sum P(2) = 2.9e-08
 Identities = 10/13 (76%), Positives = 11/13 (84%)

Query:   459 YWIVKNSWGSDWG 471
             YW+VKNSWG  WG
Sbjct:   295 YWLVKNSWGEHWG 307


>TAIR|locus:2078312 [details] [associations]
            symbol:AT3G45310 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005773 EMBL:CP002686
            GenomeReviews:BA000014_GR eggNOG:COG4870 HOGENOM:HOG000230774
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 EMBL:AL132953
            EMBL:AY091771 IPI:IPI00540369 PIR:T47471 RefSeq:NP_566880.1
            UniGene:At.25239 ProteinModelPortal:Q8RWQ9 SMR:Q8RWQ9
            MEROPS:C01.162 PaxDb:Q8RWQ9 PRIDE:Q8RWQ9 EnsemblPlants:AT3G45310.1
            GeneID:823669 KEGG:ath:AT3G45310 GeneFarm:5032 TAIR:At3g45310
            InParanoid:Q8RWQ9 KO:K01366 OMA:AFEVVHE PhylomeDB:Q8RWQ9
            ProtClustDB:CLSN2689015 Genevestigator:Q8RWQ9 Uniprot:Q8RWQ9
        Length = 358

 Score = 315 (115.9 bits), Expect = 8.4e-32, Sum P(2) = 8.4e-32
 Identities = 73/197 (37%), Positives = 102/197 (51%)

Query:   296 LPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAM-HAIQGNSLTELSVQQLVDC--D 352
             +P+  DWR +G++S VKEQG C  CW FS  G +EA  H   G  ++ LS QQLVDC   
Sbjct:   141 VPDTKDWREDGIVSPVKEQGHCGSCWTFSTTGALEAAYHQAFGKGIS-LSEQQLVDCAGT 199

Query:   353 MSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIP 412
              +N GC+GG    A +YI  NGG+ +++AYPY   +   GC               + I 
Sbjct:   200 FNNFGCHGGLPSQAFEYIKYNGGLDTEEAYPYTGKDG--GCKFSAKNIGVQVRDSVN-IT 256

Query:   413 YGEEEEMKKWVATRGPLSVGMNANGLF-YYSGGV----------IDLNQRL----YGTS- 456
              G E+E+K  V    P+SV       F +Y  GV          +D+N  +    YG   
Sbjct:   257 LGAEDELKHAVGLVRPVSVAFEVVHEFRFYKKGVFTSNTCGNTPMDVNHAVLAVGYGVED 316

Query:   457 -IPYWIVKNSWGSDWGE 472
              +PYW++KNSWG +WG+
Sbjct:   317 DVPYWLIKNSWGGEWGD 333

 Score = 241 (89.9 bits), Expect = 1.2e-30, Sum P(3) = 1.2e-30
 Identities = 55/154 (35%), Positives = 79/154 (51%)

Query:   501 LATEKLVDC--DMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXX 558
             L+ ++LVDC    +N GC+GG    A +YI  NGG+ +++AYPY   +   GC       
Sbjct:   188 LSEQQLVDCAGTFNNFGCHGGLPSQAFEYIKYNGGLDTEEAYPYTGKDG--GCKFSAKNI 245

Query:   559 XXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLF-YYSGGVIDLNQRLCNPKAQN 617
                     + I  G E+E+K  V    P+SV       F +Y  GV   N     P   N
Sbjct:   246 GVQVRDSVN-ITLGAEDELKHAVGLVRPVSVAFEVVHEFRFYKKGVFTSNTCGNTPMDVN 304

Query:   618 HALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGE 651
             HA++ VGYG E+      +PYW++KNSWG +WG+
Sbjct:   305 HAVLAVGYGVED-----DVPYWLIKNSWGGEWGD 333

 Score = 127 (49.8 bits), Expect = 1.2e-30, Sum P(3) = 1.2e-30
 Identities = 24/52 (46%), Positives = 32/52 (61%)

Query:   148 LPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAM-HAIQGNNLTELSVQ 198
             +P+  DWR +G++S VKEQG C  CW FS  G +EA  H   G  ++ LS Q
Sbjct:   141 VPDTKDWREDGIVSPVKEQGHCGSCWTFSTTGALEAAYHQAFGKGIS-LSEQ 191

 Score = 64 (27.6 bits), Expect = 8.4e-32, Sum P(2) = 8.4e-32
 Identities = 20/78 (25%), Positives = 35/78 (44%)

Query:    27 LESNIFQTRGYLNSPVTRFLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQREDSGT 86
             LE  + Q  G  +  V  F  F   + K Y SVE++  R   F  N++      ++    
Sbjct:    42 LEDTVVQILGQ-SRHVLSFSRFTHRYGKKYQSVEEMKLRFSVFKENLDLIRSTNKKGLSY 100

Query:    87 AVFEVNKFFDLSDSDLQQ 104
              +  +N+F DL+  + Q+
Sbjct:   101 KL-SLNQFADLTWQEFQR 117

 Score = 47 (21.6 bits), Expect = 5.3e-19, Sum P(2) = 5.3e-19
 Identities = 13/53 (24%), Positives = 25/53 (47%)

Query:   200 HDKVYSSVEDLLRRHENFVTNVEKAEDYQSEDSGTAVFGVNKFFDLSESDLQQ 252
             + K Y SVE++  R   F  N++       +     +  +N+F DL+  + Q+
Sbjct:    66 YGKKYQSVEEMKLRFSVFKENLDLIRSTNKKGLSYKL-SLNQFADLTWQEFQR 117


>ZFIN|ZDB-GENE-041010-76 [details] [associations]
            symbol:ctsll "cathepsin L, like" species:7955 "Danio
            rerio" [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0016787 "hydrolase activity"
            evidence=IEA] [GO:0008233 "peptidase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-041010-76
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513
            OrthoDB:EOG48PMKF MEROPS:C01.032 EMBL:BX119902 IPI:IPI00616622
            UniGene:Dr.79994 SMR:A2BEM8 Ensembl:ENSDART00000144226
            InParanoid:A2BEM8 OMA:PRYSAAN Uniprot:A2BEM8
        Length = 337

 Score = 272 (100.8 bits), Expect = 6.4e-28, Sum P(2) = 6.4e-28
 Identities = 56/152 (36%), Positives = 84/152 (55%)

Query:   297 PEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCD--MS 354
             P+  DWR +G ++ +K+Q +C  CWAFS+ G +E     +   L  LS Q L+DC     
Sbjct:   117 PQQIDWRQKGYVTPIKDQKRCGSCWAFSSTGALEGQVFRKTGKLVSLSEQNLMDCSRPQG 176

Query:   355 NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYG 414
             N GC+GG MD A QY+ DN G+ S+++YPY A++ ++ C              +  IP G
Sbjct:   177 NNGCDGGLMDQAFQYVQDNNGLDSEESYPYLATD-DQPC-HYDPRYSAANVTGFVDIPSG 234

Query:   415 EEEEMKKWVATRGPLSVGMNANG---LFYYSG 443
             +E  + K VA  GP++V ++A      FY SG
Sbjct:   235 KEHALMKAVAAVGPVAVAIDAGHESFQFYQSG 266

 Score = 264 (98.0 bits), Expect = 1.5e-31, Sum P(2) = 1.5e-31
 Identities = 57/160 (35%), Positives = 88/160 (55%)

Query:   497 KLSRLATEKLVDCD--MSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXX 554
             KL  L+ + L+DC     N GC+GG MD A QY+ DN G+ S+++YPY A++ ++ C   
Sbjct:   159 KLVSLSEQNLMDCSRPQGNNGCDGGLMDQAFQYVQDNNGLDSEESYPYLATD-DQPC-HY 216

Query:   555 XXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNAN--GLFYYSGGVIDLNQRLCN 612
                        +  IP G+E  + K VA  GP++V ++A      +Y  G+    ++ C+
Sbjct:   217 DPRYSAANVTGFVDIPSGKEHALMKAVAAVGPVAVAIDAGHESFQFYQSGIY--YEKACS 274

Query:   613 PKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
              +  +H +++VGYG E   D     YWIVKNSW   WG+K
Sbjct:   275 TEELDHGVLVVGYGYEGV-DVAGRRYWIVKNSWTDRWGDK 313

 Score = 129 (50.5 bits), Expect = 3.1e-08, Sum P(2) = 3.1e-08
 Identities = 53/201 (26%), Positives = 92/201 (45%)

Query:   200 HDKVYSSVEDLLRRHENFVTNVEKAEDYQSEDS-GTAVF--GVNKFFDLSESDLQQ-LTG 255
             H+K Y   E+  RR   +  N++K E +  E S G   F  G+N+F D++  + +Q + G
Sbjct:    36 HEKSYHEKEEGWRRMV-WEKNLKKIELHNLEHSVGKHTFRLGMNQFGDMTNEEFRQAMNG 94

Query:   256 LNLDSTLED-----IQPSL-QAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVIS 309
              N D   +      I+PS   AP    Q D   + +    ++        + + + G + 
Sbjct:    95 YNRDPNRKSKGSLFIEPSFFTAP---QQIDWRQKGY-VTPIKDQKRCGSCWAFSSTGAL- 149

Query:   310 KVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSNGGCNGGRMDDALQY 369
                 +G+      F   G    + ++   +L + S  Q       N GC+GG MD A QY
Sbjct:   150 ----EGQ-----VFRKTG---KLVSLSEQNLMDCSRPQ------GNNGCDGGLMDQAFQY 191

Query:   370 IIDNGGVVSDQAYPYKASESE 390
             + DN G+ S+++YPY A++ +
Sbjct:   192 VQDNNGLDSEESYPYLATDDQ 212

 Score = 115 (45.5 bits), Expect = 1.5e-31, Sum P(2) = 1.5e-31
 Identities = 19/51 (37%), Positives = 30/51 (58%)

Query:   149 PEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQH 199
             P+  DWR +G ++ +K+Q +C  CWAFS+ G +E     +   L  LS Q+
Sbjct:   117 PQQIDWRQKGYVTPIKDQKRCGSCWAFSSTGALEGQVFRKTGKLVSLSEQN 167

 Score = 72 (30.4 bits), Expect = 6.4e-28, Sum P(2) = 6.4e-28
 Identities = 11/15 (73%), Positives = 12/15 (80%)

Query:   459 YWIVKNSWGSDWGEK 473
             YWIVKNSW   WG+K
Sbjct:   299 YWIVKNSWTDRWGDK 313


>RGD|621513 [details] [associations]
            symbol:Ctss "cathepsin S" species:10116 "Rattus norvegicus"
            [GO:0001656 "metanephros development" evidence=IEP] [GO:0002250
            "adaptive immune response" evidence=ISO] [GO:0005764 "lysosome"
            evidence=IEA;ISO] [GO:0006508 "proteolysis" evidence=IEA;ISO]
            [GO:0008233 "peptidase activity" evidence=ISO] [GO:0008234
            "cysteine-type peptidase activity" evidence=IDA] [GO:0009986 "cell
            surface" evidence=IDA] [GO:0016020 "membrane" evidence=ISO]
            [GO:0043231 "intracellular membrane-bounded organelle"
            evidence=ISO] [GO:0045453 "bone resorption" evidence=IMP]
            [GO:0051930 "regulation of sensory perception of pain"
            evidence=IMP] [GO:0097067 "cellular response to thyroid hormone
            stimulus" evidence=ISO] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            RGD:621513 GO:GO:0009986 GO:GO:0051930 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 GO:GO:0045453
            GO:GO:0001656 HOVERGEN:HBG011513 CTD:1520 KO:K01368 MEROPS:I29.004
            BRENDA:3.4.22.27 EMBL:L03201 IPI:IPI00210228 PIR:A45087
            RefSeq:NP_059016.1 UniGene:Rn.11347 ProteinModelPortal:Q02765
            PhosphoSite:Q02765 PRIDE:Q02765 GeneID:50654 KEGG:rno:50654
            UCSC:RGD:621513 ChEMBL:CHEMBL1075217 NextBio:610462
            Genevestigator:Q02765 Uniprot:Q02765
        Length = 330

 Score = 349 (127.9 bits), Expect = 3.6e-31, P = 3.6e-31
 Identities = 79/198 (39%), Positives = 109/198 (55%)

Query:   296 LPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMS- 354
             LP++ DWR +G ++ VK QG C  CWAFSA G +E    ++   L  LS Q LVDC    
Sbjct:   113 LPDSVDWREKGCVTNVKYQGSCGSCWAFSAEGALEGQLKLKTGKLVSLSAQNLVDCSTEE 172

Query:   355 ---NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRI 411
                N GC GG M +A QYIID   + S+ +YPYKA + +  CL             Y  +
Sbjct:   173 KYGNKGCGGGFMTEAFQYIIDTS-IDSEASYPYKAMDEK--CLYDPKNRAATCSR-YIEL 228

Query:   412 PYGEEEEMKKWVATRGPLSVGMNA---NGLFYYSGGVID-------LNQRL----YGT-- 455
             P+G+EE +K+ VAT+GP+SVG++    +  F Y  GV D       +N  +    YGT  
Sbjct:   229 PFGDEEALKEAVATKGPVSVGIDDASHSSFFLYQSGVYDDPSCTENMNHGVLVVGYGTLD 288

Query:   456 SIPYWIVKNSWGSDWGEK 473
                YW+VKNSWG  +G++
Sbjct:   289 GKDYWLVKNSWGLHFGDQ 306

 Score = 276 (102.2 bits), Expect = 3.2e-23, P = 3.2e-23
 Identities = 77/215 (35%), Positives = 110/215 (51%)

Query:   446 IDLNQRLYGTSIPYWIVKNSWGSDWGEKVEDKVGSSGNRTRDLEL-TGVLPSKLSRLATE 504
             +D  ++   T++ Y   + S GS W    E      G     L+L TG    KL  L+ +
Sbjct:   117 VDWREKGCVTNVKY---QGSCGSCWAFSAE------GALEGQLKLKTG----KLVSLSAQ 163

Query:   505 KLVDCDMS----NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXX 560
              LVDC       N GC GG M +A QYIID   + S+ +YPYKA + +  CL        
Sbjct:   164 NLVDCSTEEKYGNKGCGGGFMTEAFQYIIDTS-IDSEASYPYKAMDEK--CLYDPKNRAA 220

Query:   561 XXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNA---NGLFYYSGGVIDLNQRLCNPKAQN 617
                  Y  +P+G+EE +K+ VAT+GP+SVG++    +  F Y  GV D     C     N
Sbjct:   221 TCSR-YIELPFGDEEALKEAVATKGPVSVGIDDASHSSFFLYQSGVYD--DPSCTEN-MN 276

Query:   618 HALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
             H +++VGYG  + KD     YW+VKNSWG  +G++
Sbjct:   277 HGVLVVGYGTLDGKD-----YWLVKNSWGLHFGDQ 306

 Score = 139 (54.0 bits), Expect = 2.6e-06, P = 2.6e-06
 Identities = 43/140 (30%), Positives = 66/140 (47%)

Query:    71 TNVEKAEDYQREDSGTAVFEVN-KFFDLSDSDLQQ-----LTGLNL--DSTLEDI---QP 119
             T + +  D   ED    ++E N KF  L + +          G+N   D T E++     
Sbjct:    32 TRMRRNTDQNEEDVRRLIWEKNLKFIMLHNLEHSMGMHSYSVGMNHMGDMTPEEVIGYMG 91

Query:   120 SLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVG 179
             SL+ P   N++ T     + +S      LP++ DWR +G ++ VK QG C  CWAFSA G
Sbjct:    92 SLRIPRPWNRSGT----LKSSS---NQTLPDSVDWREKGCVTNVKYQGSCGSCWAFSAEG 144

Query:   180 VVEAMHAIQGNNLTELSVQH 199
              +E    ++   L  LS Q+
Sbjct:   145 ALEGQLKLKTGKLVSLSAQN 164


>TAIR|locus:2128253 [details] [associations]
            symbol:AT4G11320 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            EMBL:CP002687 GenomeReviews:CT486007_GR EMBL:AL096882 EMBL:AL161531
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 OMA:ICHGADP
            HOGENOM:HOG000230773 KO:K01376 ProtClustDB:CLSN2689395
            EMBL:AY035055 EMBL:AY051062 IPI:IPI00520480 PIR:T13023
            RefSeq:NP_567377.1 UniGene:At.25206 ProteinModelPortal:Q9SUS9
            SMR:Q9SUS9 STRING:Q9SUS9 MEROPS:C01.A21 PaxDb:Q9SUS9 PRIDE:Q9SUS9
            EnsemblPlants:AT4G11320.1 GeneID:826734 KEGG:ath:AT4G11320
            TAIR:At4g11320 InParanoid:Q9SUS9 PhylomeDB:Q9SUS9
            Genevestigator:Q9SUS9 GermOnline:AT4G11320 Uniprot:Q9SUS9
        Length = 371

 Score = 349 (127.9 bits), Expect = 3.6e-31, P = 3.6e-31
 Identities = 101/290 (34%), Positives = 143/290 (49%)

Query:   200 HDKVYSSVEDLLRRHENFVTNVEKAEDYQSEDSGTAVFGVNKFFDLSESDLQQLT-GLNL 258
             H KVY SV +  RR   F  N+    +  +E+    + G+N+F DLS  +  ++  G   
Sbjct:    63 HGKVYDSVAEKERRLTIFEDNLRFITNRNAENLSYRL-GLNRFADLSLHEYGEICHGA-- 119

Query:   259 DSTLEDIQPSLQAPF--SSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVKEQGK 316
                  D +P     F  SSN+  T            GD LP++ DWR EG +++VK+QG 
Sbjct:   120 -----DPRPPRNHVFMTSSNRYKTS----------DGDVLPKSVDWRNEGAVTEVKDQGL 164

Query:   317 CACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSNGGCNGGRMDDALQYIIDNGGV 376
             C  CWAFS VG VE ++ I    L  LS Q L++C+  N GC GG+++ A ++I++NGG+
Sbjct:   165 CRSCWAFSTVGAVEGLNKIVTGELVTLSEQDLINCNKENNGCGGGKVETAYEFIMNNGGL 224

Query:   377 VSDQAYPYKASESE-RGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNA 435
              +D  YPYKA      G L             Y  +P  +E  + K VA +   +V  ++
Sbjct:   225 GTDNDYPYKALNGVCEGRLKEDNKNVMIDG--YENLPANDEAALMKAVAHQPVTAVVDSS 282

Query:   436 NGLFY-YSGGVID------LNQRL----YGTSI--PYWIVKNSWGSDWGE 472
             +  F  Y  GV D      LN  +    YGT     YWIVKNS G  WGE
Sbjct:   283 SREFQLYESGVFDGTCGTNLNHGVVVVGYGTENGRDYWIVKNSRGDTWGE 332

 Score = 227 (85.0 bits), Expect = 6.1e-30, Sum P(2) = 6.1e-30
 Identities = 55/164 (33%), Positives = 84/164 (51%)

Query:   490 LTGVLPSKLSRLATEKLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESE- 548
             L  ++  +L  L+ + L++C+  N GC GG+++ A ++I++NGG+ +D  YPYKA     
Sbjct:   180 LNKIVTGELVTLSEQDLINCNKENNGCGGGKVETAYEFIMNNGGLGTDNDYPYKALNGVC 239

Query:   549 RGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFY-YSGGVIDLN 607
              G L             Y  +P  +E  + K VA +   +V  +++  F  Y  GV D  
Sbjct:   240 EGRLKEDNKNVMIDG--YENLPANDEAALMKAVAHQPVTAVVDSSSREFQLYESGVFD-- 295

Query:   608 QRLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGE 651
                C     NH +++VGYG E  +D     YWIVKNS G  WGE
Sbjct:   296 -GTCGTNL-NHGVVVVGYGTENGRD-----YWIVKNSRGDTWGE 332

 Score = 172 (65.6 bits), Expect = 6.1e-30, Sum P(2) = 6.1e-30
 Identities = 56/168 (33%), Positives = 81/168 (48%)

Query:    35 RGYLNSPVT-RFLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQREDSGTAVFEVNK 93
             +G  ++  T  F ++M  H KVY SV +  RR   F  N+    +   E+    +  +N+
Sbjct:    45 QGIFDAEATLMFESWMVKHGKVYDSVAEKERRLTIFEDNLRFITNRNAENLSYRL-GLNR 103

Query:    94 FFDLSDSDLQQLT-GLNLDSTLEDIQPSLQAPF--SSNQTDTEMRAFQFNSLRHGDDLPE 150
             F DLS  +  ++  G        D +P     F  SSN+  T            GD LP+
Sbjct:   104 FADLSLHEYGEICHGA-------DPRPPRNHVFMTSSNRYKTS----------DGDVLPK 146

Query:   151 AFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
             + DWR EG +++VK+QG C  CWAFS VG VE ++ I    L  LS Q
Sbjct:   147 SVDWRNEGAVTEVKDQGLCRSCWAFSTVGAVEGLNKIVTGELVTLSEQ 194


>UNIPROTKB|F1RU48 [details] [associations]
            symbol:CTSF "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00139 PROSITE:PS00639
            OMA:LAPPEWD GeneTree:ENSGT00660000095458 EMBL:CU928034
            EMBL:FP565364 Ensembl:ENSSSCT00000014140 Ensembl:ENSSSCT00000014154
            Uniprot:F1RU48
        Length = 460

 Score = 299 (110.3 bits), Expect = 9.2e-30, Sum P(2) = 9.2e-30
 Identities = 75/232 (32%), Positives = 114/232 (49%)

Query:   259 DSTLEDIQPSLQAPFSSNQTDTEMR-AFQFNSLRHGDDLPEAFDWRAEGVISKVKEQGKC 317
             D T E+ +     P    +   +MR A   +SL      P  +DWR +G ++KVK+QG C
Sbjct:   214 DLTEEEFRTIYLNPLLQEEPGRKMRLAKSVSSLP-----PPEWDWRKKGAVTKVKDQGMC 268

Query:   318 ACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSNGGCNGGRMDDALQYIIDNGGVV 377
               CWAFS  G VE    ++  +L  LS Q+L+DCD  + GC GG   +A   I   GG+ 
Sbjct:   269 GSCWAFSVTGNVEGQWFLKQGTLLSLSEQELLDCDKVDKGCMGGLPSNAYSAIKTLGGLE 328

Query:   378 SDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANG 437
             +++ Y Y+                       S+     E+++  W+A +GP+SV +NA G
Sbjct:   329 TEEDYSYRGHLQTCSFNAEKAKVYINDSVELSQ----NEQKLAAWLAEKGPISVAINAFG 384

Query:   438 LFYYSGGV------------IDLNQRL--YG--TSIPYWIVKNSWGSDWGEK 473
             + +Y  G+            ID    L  YG  ++ P+W +KNSWG+DWGE+
Sbjct:   385 MQFYRHGISHPLRPLCSPWLIDHAVLLVGYGNRSATPFWAIKNSWGTDWGEE 436

 Score = 236 (88.1 bits), Expect = 3.6e-31, Sum P(3) = 3.6e-31
 Identities = 49/155 (31%), Positives = 83/155 (53%)

Query:   498 LSRLATEKLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXX 557
             L  L+ ++L+DCD  + GC GG   +A   I   GG+ +++ Y Y+              
Sbjct:   291 LLSLSEQELLDCDKVDKGCMGGLPSNAYSAIKTLGGLETEEDYSYRGHLQTCSFNAEKAK 350

Query:   558 XXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFYYSGGVIDLNQRLCNPKAQN 617
                      S+     E+++  W+A +GP+SV +NA G+ +Y  G+    + LC+P   +
Sbjct:   351 VYINDSVELSQ----NEQKLAAWLAEKGPISVAINAFGMQFYRHGISHPLRPLCSPWLID 406

Query:   618 HALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
             HA+++VGYG       ++ P+W +KNSWG+DWGE+
Sbjct:   407 HAVLLVGYGNR-----SATPFWAIKNSWGTDWGEE 436

 Score = 131 (51.2 bits), Expect = 3.6e-31, Sum P(3) = 3.6e-31
 Identities = 32/89 (35%), Positives = 44/89 (49%)

Query:   111 DSTLEDIQPSLQAPFSSNQTDTEMR-AFQFNSLRHGDDLPEAFDWRAEGVISKVKEQGKC 169
             D T E+ +     P    +   +MR A   +SL      P  +DWR +G ++KVK+QG C
Sbjct:   214 DLTEEEFRTIYLNPLLQEEPGRKMRLAKSVSSLP-----PPEWDWRKKGAVTKVKDQGMC 268

Query:   170 ACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
               CWAFS  G VE    ++   L  LS Q
Sbjct:   269 GSCWAFSVTGNVEGQWFLKQGTLLSLSEQ 297

 Score = 94 (38.1 bits), Expect = 9.2e-30, Sum P(2) = 9.2e-30
 Identities = 19/57 (33%), Positives = 34/57 (59%)

Query:   197 VQHHDKVYSSVEDLLRRHENFVTNVEKAEDYQSEDSGTAVFGVNKFFDLSESDLQQL 253
             V  +++ Y + E+   R   F  N+ +A+  Q+ D+GTA +GV KF DL+E + + +
Sbjct:   167 VTTYNRTYDTKEEARWRMSVFANNMVRAQKIQALDTGTARYGVTKFSDLTEEEFRTI 223

 Score = 89 (36.4 bits), Expect = 3.6e-31, Sum P(3) = 3.6e-31
 Identities = 18/61 (29%), Positives = 34/61 (55%)

Query:    45 FLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQREDSGTAVFEVNKFFDLSDSDLQQ 104
             F  F+  +++ Y + E+   R   F  N+ +A+  Q  D+GTA + V KF DL++ + + 
Sbjct:   163 FKEFVTTYNRTYDTKEEARWRMSVFANNMVRAQKIQALDTGTARYGVTKFSDLTEEEFRT 222

Query:   105 L 105
             +
Sbjct:   223 I 223


>MGI|MGI:1861434 [details] [associations]
            symbol:Ctsf "cathepsin F" species:10090 "Mus musculus"
            [GO:0005764 "lysosome" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] [GO:0008152 "metabolic process" evidence=ISS]
            [GO:0008233 "peptidase activity" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=ISS] [GO:0016787
            "hydrolase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 MGI:MGI:1861434 eggNOG:COG4870
            HOGENOM:HOG000230774 KO:K01373 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0005764 HOVERGEN:HBG011513 MEROPS:C01.018
            CTD:8722 OMA:LAPPEWD OrthoDB:EOG4CC41T EMBL:AF136280 EMBL:AF217224
            EMBL:AJ131851 EMBL:AK075862 EMBL:BC058758 IPI:IPI00126769
            RefSeq:NP_063914.1 UniGene:Mm.29561 ProteinModelPortal:Q9R013
            SMR:Q9R013 STRING:Q9R013 PhosphoSite:Q9R013 PaxDb:Q9R013
            PRIDE:Q9R013 Ensembl:ENSMUST00000119694 GeneID:56464 KEGG:mmu:56464
            UCSC:uc008gbc.1 GeneTree:ENSGT00660000095458 InParanoid:Q9R013
            NextBio:312722 Bgee:Q9R013 CleanEx:MM_CTSF Genevestigator:Q9R013
            GermOnline:ENSMUSG00000006458 Uniprot:Q9R013
        Length = 462

 Score = 297 (109.6 bits), Expect = 6.1e-29, Sum P(2) = 6.1e-29
 Identities = 70/197 (35%), Positives = 100/197 (50%)

Query:   294 DDL-PEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCD 352
             +DL P  +DWR +G +++VK QG C  CWAFS  G VE    +   +L  LS Q+L+DCD
Sbjct:   246 NDLAPPEWDWRKKGAVTEVKNQGMCGSCWAFSVTGNVEGQWFLNRGTLLSLSEQELLDCD 305

Query:   353 MSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIP 412
               +  C GG   +A   I + GG+ ++  Y Y+                       SR  
Sbjct:   306 KVDKACLGGLPSNAYAAIKNLGGLETEDDYGYQGHVQTCNFSAQMAKVYINDSVELSR-- 363

Query:   413 YGEEEEMKKWVATRGPLSVGMNANGLFYYSGGV------------IDLNQRL--YG--TS 456
                E ++  W+A +GP+SV +NA G+ +Y  G+            ID    L  YG  ++
Sbjct:   364 --NENKIAAWLAQKGPISVAINAFGMQFYRHGIAHPFRPLCSPWFIDHAVLLVGYGNRSN 421

Query:   457 IPYWIVKNSWGSDWGEK 473
             IPYW +KNSWGSDWGE+
Sbjct:   422 IPYWAIKNSWGSDWGEE 438

 Score = 241 (89.9 bits), Expect = 4.9e-31, Sum P(3) = 4.9e-31
 Identities = 52/155 (33%), Positives = 82/155 (52%)

Query:   498 LSRLATEKLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXX 557
             L  L+ ++L+DCD  +  C GG   +A   I + GG+ ++  Y Y+              
Sbjct:   293 LLSLSEQELLDCDKVDKACLGGLPSNAYAAIKNLGGLETEDDYGYQGHVQTCNFSAQMAK 352

Query:   558 XXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFYYSGGVIDLNQRLCNPKAQN 617
                      SR     E ++  W+A +GP+SV +NA G+ +Y  G+    + LC+P   +
Sbjct:   353 VYINDSVELSR----NENKIAAWLAQKGPISVAINAFGMQFYRHGIAHPFRPLCSPWFID 408

Query:   618 HALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
             HA+++VGYG       ++IPYW +KNSWGSDWGE+
Sbjct:   409 HAVLLVGYGNR-----SNIPYWAIKNSWGSDWGEE 438

 Score = 124 (48.7 bits), Expect = 4.9e-31, Sum P(3) = 4.9e-31
 Identities = 24/54 (44%), Positives = 31/54 (57%)

Query:   146 DDL-PEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
             +DL P  +DWR +G +++VK QG C  CWAFS  G VE    +    L  LS Q
Sbjct:   246 NDLAPPEWDWRKKGAVTEVKNQGMCGSCWAFSVTGNVEGQWFLNRGTLLSLSEQ 299

 Score = 89 (36.4 bits), Expect = 4.9e-31, Sum P(3) = 4.9e-31
 Identities = 19/61 (31%), Positives = 33/61 (54%)

Query:    45 FLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQREDSGTAVFEVNKFFDLSDSDLQQ 104
             F +FM  +++ Y S E+   R   F  N+ +A+  Q  D GTA + + KF DL++ +   
Sbjct:   165 FKDFMTTYNRTYESREEAQWRLTVFARNMIRAQKIQALDRGTAQYGITKFSDLTEEEFHT 224

Query:   105 L 105
             +
Sbjct:   225 I 225

 Score = 88 (36.0 bits), Expect = 5.0e-22, Sum P(2) = 5.0e-22
 Identities = 18/54 (33%), Positives = 31/54 (57%)

Query:   200 HDKVYSSVEDLLRRHENFVTNVEKAEDYQSEDSGTAVFGVNKFFDLSESDLQQL 253
             +++ Y S E+   R   F  N+ +A+  Q+ D GTA +G+ KF DL+E +   +
Sbjct:   172 YNRTYESREEAQWRLTVFARNMIRAQKIQALDRGTAQYGITKFSDLTEEEFHTI 225


>UNIPROTKB|E9PI30 [details] [associations]
            symbol:CTSW "Cathepsin W" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00639
            EMBL:AP001201 HGNC:HGNC:2546 IPI:IPI00984532
            ProteinModelPortal:E9PI30 SMR:E9PI30 Ensembl:ENST00000528419
            ArrayExpress:E9PI30 Bgee:E9PI30 Uniprot:E9PI30
        Length = 364

 Score = 279 (103.3 bits), Expect = 5.4e-31, Sum P(2) = 5.4e-31
 Identities = 73/247 (29%), Positives = 117/247 (47%)

Query:   201 DKVYSSVEDLLRRHENFVTNVEKAEDYQSEDSGTAVFGVNKFFDLSESDLQQLTGLNLDS 260
             ++ Y S E+   R + F  N+ +A+  Q ED GTA FGV  F DL+E +  QL G     
Sbjct:    50 NRSYLSPEEHAHRLDIFAHNLAQAQRLQEEDLGTAEFGVTPFSDLTEEEFGQLYGYR--R 107

Query:   261 TLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWR-AEGVISKVKEQGKCAC 319
                 + PS+     S + +              + +P + DWR     IS +K+Q  C C
Sbjct:   108 AAGGV-PSMGREIRSEEPE--------------ESVPFSCDWRKVASAISPIKDQKNCNC 152

Query:   320 CWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSD 379
             CWA +A G +E +  I      ++SVQ+L+DC     GC+GG + DA   +++N G+ S+
Sbjct:   153 CWAMAAAGNIETLWRISFWDFVDVSVQELLDCGRCGDGCHGGFVWDAFITVLNNSGLASE 212

Query:   380 QAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLF 439
             + YP++       C              +  +    E  + +++AT GP++V +N   L 
Sbjct:   213 KDYPFQGKVRAHRC-HPKKYQKVAWIQDFIMLQ-NNEHRIAQYLATYGPITVTINMKPLQ 270

Query:   440 YYSGGVI 446
              Y  GVI
Sbjct:   271 LYRKGVI 277

 Score = 224 (83.9 bits), Expect = 6.8e-26, Sum P(2) = 6.8e-26
 Identities = 48/168 (28%), Positives = 85/168 (50%)

Query:   501 LATEKLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXX 560
             ++ ++L+DC     GC+GG + DA   +++N G+ S++ YP++       C         
Sbjct:   176 VSVQELLDCGRCGDGCHGGFVWDAFITVLNNSGLASEKDYPFQGKVRAHRC-HPKKYQKV 234

Query:   561 XXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFYYSGGVIDLNQRLCNPKAQNHAL 620
                  +  +    E  + +++AT GP++V +N   L  Y  GVI      C+P+  +H++
Sbjct:   235 AWIQDFIMLQ-NNEHRIAQYLATYGPITVTINMKPLQLYRKGVIKATPTTCDPQLVDHSV 293

Query:   621 IIVGYGEEEKKDGT---------------SIPYWIVKNSWGSDWGEKV 653
             ++VG+G  + ++G                  PYWI+KNSWG+ WGEKV
Sbjct:   294 LLVGFGSVKSEEGIWAETVSSQSQPQPPHPTPYWILKNSWGAQWGEKV 341

 Score = 136 (52.9 bits), Expect = 6.8e-26, Sum P(2) = 6.8e-26
 Identities = 44/154 (28%), Positives = 68/154 (44%)

Query:    45 FLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQREDSGTAVFEVNKFFDLSDSDLQQ 104
             F  F    ++ Y S E+   R + F  N+ +A+  Q ED GTA F V  F DL++ +  Q
Sbjct:    42 FKLFQIQFNRSYLSPEEHAHRLDIFAHNLAQAQRLQEEDLGTAEFGVTPFSDLTEEEFGQ 101

Query:   105 LTGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVK 164
             L G         + PS+     S + +         S+    D       +    IS +K
Sbjct:   102 LYGYR--RAAGGV-PSMGREIRSEEPE--------ESVPFSCDWR-----KVASAISPIK 145

Query:   165 EQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
             +Q  C CCWA +A G +E +  I   +  ++SVQ
Sbjct:   146 DQKNCNCCWAMAAAGNIETLWRISFWDFVDVSVQ 179

 Score = 94 (38.1 bits), Expect = 5.4e-31, Sum P(2) = 5.4e-31
 Identities = 14/17 (82%), Positives = 16/17 (94%)

Query:   458 PYWIVKNSWGSDWGEKV 474
             PYWI+KNSWG+ WGEKV
Sbjct:   325 PYWILKNSWGAQWGEKV 341


>UNIPROTKB|P83654 [details] [associations]
            symbol:P83654 "Ervatamin-C" species:52861 "Tabernaemontana
            divaricata" [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=IDA] [GO:0006508 "proteolysis" evidence=IDA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0005576 GO:GO:0006508
            InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0004197 PDB:1O0E PDB:2PNS
            PDBsum:1O0E PDBsum:2PNS MEROPS:C01.116 EvolutionaryTrace:P83654
            Uniprot:P83654
        Length = 208

 Score = 347 (127.2 bits), Expect = 5.9e-31, P = 5.9e-31
 Identities = 76/188 (40%), Positives = 103/188 (54%)

Query:   296 LPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSN 355
             LPE  DWR +G ++ VK QG C  CWAFS V  VE+++ I+  +L  LS Q+LVDCD  N
Sbjct:     1 LPEQIDWRKKGAVTPVKNQGSCGSCWAFSTVSTVESINQIRTGNLISLSEQELVDCDKKN 60

Query:   356 GGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYGE 415
              GC GG    A QYII+NGG+ +   YPYKA +    C              Y+ +P+  
Sbjct:    61 HGCLGGAFVFAYQYIINNGGIDTQANYPYKAVQGP--C---QAASKVVSIDGYNGVPFCN 115

Query:   416 EEEMKKWVATRGPLSVGMNANGLFY--YSGGVID------LNQ--RLYGTSIPYWIVKNS 465
             E  +K+ VA + P +V ++A+   +  YS G+        LN    + G    YWIV+NS
Sbjct:   116 EXALKQAVAVQ-PSTVAIDASSAQFQQYSSGIFSGPCGTKLNHGVTIVGYQANYWIVRNS 174

Query:   466 WGSDWGEK 473
             WG  WGEK
Sbjct:   175 WGRYWGEK 182

 Score = 179 (68.1 bits), Expect = 9.2e-13, P = 9.2e-13
 Identities = 60/168 (35%), Positives = 81/168 (48%)

Query:   462 VKN--SWGSDWGEKVEDKVGSSGNRTRDLELTGVLPSKLSRLATEKLVDCDMSNGGCNGG 519
             VKN  S GS W       V S  N+ R    TG L S    L+ ++LVDCD  N GC GG
Sbjct:    16 VKNQGSCGSCWAFSTVSTVESI-NQIR----TGNLIS----LSEQELVDCDKKNHGCLGG 66

Query:   520 RMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYGEEEEMKK 579
                 A QYII+NGG+ +   YPYKA +    C              Y+ +P+  E  +K+
Sbjct:    67 AFVFAYQYIINNGGIDTQANYPYKAVQGP--C---QAASKVVSIDGYNGVPFCNEXALKQ 121

Query:   580 WVATRGPLSVGMNANGLFY--YSGGVIDLNQRLCNPKAQNHALIIVGY 625
              VA + P +V ++A+   +  YS G+       C  K  NH + IVGY
Sbjct:   122 AVAVQ-PSTVAIDASSAQFQQYSSGIFS---GPCGTKL-NHGVTIVGY 164

 Score = 146 (56.5 bits), Expect = 2.8e-17, Sum P(2) = 2.8e-17
 Identities = 26/51 (50%), Positives = 33/51 (64%)

Query:   148 LPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
             LPE  DWR +G ++ VK QG C  CWAFS V  VE+++ I+  NL  LS Q
Sbjct:     1 LPEQIDWRKKGAVTPVKNQGSCGSCWAFSTVSTVESINQIRTGNLISLSEQ 51

 Score = 119 (46.9 bits), Expect = 2.8e-17, Sum P(2) = 2.8e-17
 Identities = 32/89 (35%), Positives = 46/89 (51%)

Query:   566 YSRIPYGEEEEMKKWVATRGPLSVGMNANGLFY--YSGGVIDLNQRLCNPKAQNHALIIV 623
             Y+ +P+  E  +K+ VA + P +V ++A+   +  YS G+       C  K  NH + IV
Sbjct:   108 YNGVPFCNEXALKQAVAVQ-PSTVAIDASSAQFQQYSSGIFS---GPCGTKL-NHGVTIV 162

Query:   624 GYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
             GY            YWIV+NSWG  WGEK
Sbjct:   163 GYQAN---------YWIVRNSWGRYWGEK 182


>ZFIN|ZDB-GENE-080215-7 [details] [associations]
            symbol:zgc:174153 "zgc:174153" species:7955 "Danio
            rerio" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0016787 "hydrolase activity"
            evidence=IEA] [GO:0008233 "peptidase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-080215-7
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00560000076577 EMBL:BX000534 EMBL:BX322603
            IPI:IPI00483644 Ensembl:ENSDART00000113654 OMA:ITLCISA Bgee:F1R8Y0
            Uniprot:F1R8Y0
        Length = 336

 Score = 266 (98.7 bits), Expect = 6.2e-31, Sum P(2) = 6.2e-31
 Identities = 60/160 (37%), Positives = 86/160 (53%)

Query:   497 KLSRLATEKLVDCDMSNG--GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXX 554
             KL  ++ + LVDC    G  GCNGG MD A QY+ +N G+ S+Q+YPY A + +  C   
Sbjct:   158 KLISMSEQNLVDCSRPQGNQGCNGGLMDQAFQYVKENKGLDSEQSYPYLARD-DLPC-RY 215

Query:   555 XXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNAN--GLFYYSGGVIDLNQRLCN 612
                        +  IP G E  +   VA  GP+SV ++A+   L +Y  G+    +R C+
Sbjct:   216 DPRFNVAKITGFVDIPSGNEPALMNAVAAVGPVSVAIDASHQSLQFYQSGIY--YERACS 273

Query:   613 PKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
                 +HA+++VGYG +   D     YWIVKNSW   WG+K
Sbjct:   274 SSRLDHAVLVVGYGYQGA-DVAGNRYWIVKNSWSDKWGDK 312

 Score = 261 (96.9 bits), Expect = 4.9e-27, Sum P(2) = 4.9e-27
 Identities = 56/153 (36%), Positives = 81/153 (52%)

Query:   297 PEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSNG 356
             P+  DWR  G ++ VK+Q +C  CW+FS+ G +E     +   L  +S Q LVDC    G
Sbjct:   116 PQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQG 175

Query:   357 --GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYG 414
               GCNGG MD A QY+ +N G+ S+Q+YPY A + +  C              +  IP G
Sbjct:   176 NQGCNGGLMDQAFQYVKENKGLDSEQSYPYLARD-DLPC-RYDPRFNVAKITGFVDIPSG 233

Query:   415 EEEEMKKWVATRGPLSVGMNAN--GLFYYSGGV 445
              E  +   VA  GP+SV ++A+   L +Y  G+
Sbjct:   234 NEPALMNAVAAVGPVSVAIDASHQSLQFYQSGI 266

 Score = 125 (49.1 bits), Expect = 4.2e-08, Sum P(2) = 4.2e-08
 Identities = 52/198 (26%), Positives = 85/198 (42%)

Query:   198 QHHDKVYSSVEDLLRRHENFVTNVEKAEDYQSEDS-GTAVF--GVNKFFDLSESDLQQ-L 253
             QH    +  VE  + R   +  N+ K E +  E S G   F  G+N+F D++  + +Q +
Sbjct:    34 QHGKSYHEDVE--VGRRMIWEENLRKIEQHNFEYSYGNHTFKMGMNQFGDMTNEEFRQAM 91

Query:   254 TGLNLDSTLEDIQPSLQAP--FSS-NQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISK 310
              G   D       P    P  F++  Q D   R +    ++        + + + G +  
Sbjct:    92 NGYKHDPNQTSQGPLFMEPSFFAAPQQVDWRQRGY-VTPVKDQKQCGSCWSFSSTGAL-- 148

Query:   311 VKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSNGGCNGGRMDDALQYI 370
                +G+      F   G + +M      +L + S  Q       N GCNGG MD A QY+
Sbjct:   149 ---EGQL-----FRKTGKLISMSE---QNLVDCSRPQ------GNQGCNGGLMDQAFQYV 191

Query:   371 IDNGGVVSDQAYPYKASE 388
              +N G+ S+Q+YPY A +
Sbjct:   192 KENKGLDSEQSYPYLARD 209

 Score = 107 (42.7 bits), Expect = 6.2e-31, Sum P(2) = 6.2e-31
 Identities = 18/51 (35%), Positives = 29/51 (56%)

Query:   149 PEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQH 199
             P+  DWR  G ++ VK+Q +C  CW+FS+ G +E     +   L  +S Q+
Sbjct:   116 PQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQN 166

 Score = 75 (31.5 bits), Expect = 4.9e-27, Sum P(2) = 4.9e-27
 Identities = 11/15 (73%), Positives = 12/15 (80%)

Query:   459 YWIVKNSWGSDWGEK 473
             YWIVKNSW   WG+K
Sbjct:   298 YWIVKNSWSDKWGDK 312


>GENEDB_PFALCIPARUM|PF11_0165 [details] [associations]
            symbol:PF11_0165 "falcipain 2 precursor"
            species:5833 "Plasmodium falciparum" [GO:0020020 "food vacuole"
            evidence=TAS] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00139
            PROSITE:PS00639 EMBL:AE014186 HSSP:O65039 GO:GO:0020020
            RefSeq:XP_001347836.1 ProteinModelPortal:Q8I6U4 SMR:Q8I6U4
            IntAct:Q8I6U4 MINT:MINT-1559493 MEROPS:C01.046
            EnsemblProtists:PF11_0165:mRNA GeneID:810712 KEGG:pfa:PF11_0165
            EuPathDB:PlasmoDB:PF3D7_1115700 HOGENOM:HOG000065857 OMA:NESLHAN
            ProtClustDB:PTZ00021 BindingDB:Q8I6U4 ChEMBL:CHEMBL3470
            Uniprot:Q8I6U4
        Length = 484

 Score = 294 (108.6 bits), Expect = 1.6e-26, Sum P(2) = 1.6e-26
 Identities = 70/252 (27%), Positives = 128/252 (50%)

Query:   197 VQHHDKVYSSVEDLLRRHENFVTNVEKAEDYQSEDSGTAVFGVNKFFDLSESDLQQLTGL 256
             ++ ++K Y+S  ++  R + F+ N  K   + +  +      +N+F DL+  + +     
Sbjct:   169 IKTNNKQYNSPNEMKERFQVFLQNAHKVNMHNNNKNSLYKKELNRFADLTYHEFK----- 223

Query:   257 NLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVKEQGK 316
             N   +L   +P   + +  +Q + E    ++    + D    A+DWR    ++ VK+Q  
Sbjct:   224 NKYLSLRSSKPLKNSKYLLDQMNYEEVIKKYKGNENFDHA--AYDWRLHSGVTPVKDQKN 281

Query:   317 CACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSNGGCNGGRMDDALQYIIDNGGV 376
             C  CWAFS++G VE+ +AI+ N L  LS Q+LVDC   N GCNGG +++A + +I+ GG+
Sbjct:   282 CGSCWAFSSIGSVESQYAIRKNKLITLSEQELVDCSFKNYGCNGGLINNAFEDMIELGGI 341

Query:   377 VSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNAN 436
              +D  YPY  S++   C              Y  +P   + ++K+ +   GP+S+ +  +
Sbjct:   342 CTDDDYPY-VSDAPNLC-NIDRCTEKYGIKNYLSVP---DNKLKEALRFLGPISISVAVS 396

Query:   437 GLF-YYSGGVID 447
               F +Y  G+ D
Sbjct:   397 DDFAFYKEGIFD 408

 Score = 227 (85.0 bits), Expect = 9.7e-31, Sum P(2) = 9.7e-31
 Identities = 53/163 (32%), Positives = 88/163 (53%)

Query:   496 SKLSRLATEKLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXX 555
             +KL  L+ ++LVDC   N GCNGG +++A + +I+ GG+ +D  YPY  S++   C    
Sbjct:   303 NKLITLSEQELVDCSFKNYGCNGGLINNAFEDMIELGGICTDDDYPY-VSDAPNLC-NID 360

Query:   556 XXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLF-YYSGGVIDLNQRLCNPK 614
                       Y  +P   + ++K+ +   GP+S+ +  +  F +Y  G+ D     C  +
Sbjct:   361 RCTEKYGIKNYLSVP---DNKLKEALRFLGPISISVAVSDDFAFYKEGIFDGE---CGDQ 414

Query:   615 AQNHALIIVGYGEEE-----KKDGTSIPYWIVKNSWGSDWGEK 652
               NHA+++VG+G +E      K G    Y+I+KNSWG  WGE+
Sbjct:   415 L-NHAVMLVGFGMKEIVNPLTKKGEKHYYYIIKNSWGQQWGER 456

 Score = 188 (71.2 bits), Expect = 9.7e-31, Sum P(2) = 9.7e-31
 Identities = 46/171 (26%), Positives = 85/171 (49%)

Query:    30 NIFQTRGYLNSP--VTRFLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQREDSGTA 87
             N F  +  +N+   + +F  F++ ++K Y+S  ++  R + F+ N  K   +    +   
Sbjct:   148 NFFDNKFLMNNAEHINQFYMFIKTNNKQYNSPNEMKERFQVFLQNAHKVNMHNNNKNSLY 207

Query:    88 VFEVNKFFDLSDSDLQQLTGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDD 147
               E+N+F DL+  + +     N   +L   +P   + +  +Q + E    ++    + D 
Sbjct:   208 KKELNRFADLTYHEFK-----NKYLSLRSSKPLKNSKYLLDQMNYEEVIKKYKGNENFDH 262

Query:   148 LPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
                A+DWR    ++ VK+Q  C  CWAFS++G VE+ +AI+ N L  LS Q
Sbjct:   263 A--AYDWRLHSGVTPVKDQKNCGSCWAFSSIGSVESQYAIRKNKLITLSEQ 311

 Score = 72 (30.4 bits), Expect = 1.6e-26, Sum P(2) = 1.6e-26
 Identities = 10/15 (66%), Positives = 13/15 (86%)

Query:   459 YWIVKNSWGSDWGEK 473
             Y+I+KNSWG  WGE+
Sbjct:   442 YYIIKNSWGQQWGER 456


>UNIPROTKB|Q8I6U4 [details] [associations]
            symbol:PF11_0165 "Falcipain-2A" species:36329 "Plasmodium
            falciparum 3D7" [GO:0020020 "food vacuole" evidence=TAS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00139 PROSITE:PS00639
            EMBL:AE014186 HSSP:O65039 GO:GO:0020020 RefSeq:XP_001347836.1
            ProteinModelPortal:Q8I6U4 SMR:Q8I6U4 IntAct:Q8I6U4
            MINT:MINT-1559493 MEROPS:C01.046 EnsemblProtists:PF11_0165:mRNA
            GeneID:810712 KEGG:pfa:PF11_0165 EuPathDB:PlasmoDB:PF3D7_1115700
            HOGENOM:HOG000065857 OMA:NESLHAN ProtClustDB:PTZ00021
            BindingDB:Q8I6U4 ChEMBL:CHEMBL3470 Uniprot:Q8I6U4
        Length = 484

 Score = 294 (108.6 bits), Expect = 1.6e-26, Sum P(2) = 1.6e-26
 Identities = 70/252 (27%), Positives = 128/252 (50%)

Query:   197 VQHHDKVYSSVEDLLRRHENFVTNVEKAEDYQSEDSGTAVFGVNKFFDLSESDLQQLTGL 256
             ++ ++K Y+S  ++  R + F+ N  K   + +  +      +N+F DL+  + +     
Sbjct:   169 IKTNNKQYNSPNEMKERFQVFLQNAHKVNMHNNNKNSLYKKELNRFADLTYHEFK----- 223

Query:   257 NLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVKEQGK 316
             N   +L   +P   + +  +Q + E    ++    + D    A+DWR    ++ VK+Q  
Sbjct:   224 NKYLSLRSSKPLKNSKYLLDQMNYEEVIKKYKGNENFDHA--AYDWRLHSGVTPVKDQKN 281

Query:   317 CACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSNGGCNGGRMDDALQYIIDNGGV 376
             C  CWAFS++G VE+ +AI+ N L  LS Q+LVDC   N GCNGG +++A + +I+ GG+
Sbjct:   282 CGSCWAFSSIGSVESQYAIRKNKLITLSEQELVDCSFKNYGCNGGLINNAFEDMIELGGI 341

Query:   377 VSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNAN 436
              +D  YPY  S++   C              Y  +P   + ++K+ +   GP+S+ +  +
Sbjct:   342 CTDDDYPY-VSDAPNLC-NIDRCTEKYGIKNYLSVP---DNKLKEALRFLGPISISVAVS 396

Query:   437 GLF-YYSGGVID 447
               F +Y  G+ D
Sbjct:   397 DDFAFYKEGIFD 408

 Score = 227 (85.0 bits), Expect = 9.7e-31, Sum P(2) = 9.7e-31
 Identities = 53/163 (32%), Positives = 88/163 (53%)

Query:   496 SKLSRLATEKLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXX 555
             +KL  L+ ++LVDC   N GCNGG +++A + +I+ GG+ +D  YPY  S++   C    
Sbjct:   303 NKLITLSEQELVDCSFKNYGCNGGLINNAFEDMIELGGICTDDDYPY-VSDAPNLC-NID 360

Query:   556 XXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLF-YYSGGVIDLNQRLCNPK 614
                       Y  +P   + ++K+ +   GP+S+ +  +  F +Y  G+ D     C  +
Sbjct:   361 RCTEKYGIKNYLSVP---DNKLKEALRFLGPISISVAVSDDFAFYKEGIFDGE---CGDQ 414

Query:   615 AQNHALIIVGYGEEE-----KKDGTSIPYWIVKNSWGSDWGEK 652
               NHA+++VG+G +E      K G    Y+I+KNSWG  WGE+
Sbjct:   415 L-NHAVMLVGFGMKEIVNPLTKKGEKHYYYIIKNSWGQQWGER 456

 Score = 188 (71.2 bits), Expect = 9.7e-31, Sum P(2) = 9.7e-31
 Identities = 46/171 (26%), Positives = 85/171 (49%)

Query:    30 NIFQTRGYLNSP--VTRFLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQREDSGTA 87
             N F  +  +N+   + +F  F++ ++K Y+S  ++  R + F+ N  K   +    +   
Sbjct:   148 NFFDNKFLMNNAEHINQFYMFIKTNNKQYNSPNEMKERFQVFLQNAHKVNMHNNNKNSLY 207

Query:    88 VFEVNKFFDLSDSDLQQLTGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDD 147
               E+N+F DL+  + +     N   +L   +P   + +  +Q + E    ++    + D 
Sbjct:   208 KKELNRFADLTYHEFK-----NKYLSLRSSKPLKNSKYLLDQMNYEEVIKKYKGNENFDH 262

Query:   148 LPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
                A+DWR    ++ VK+Q  C  CWAFS++G VE+ +AI+ N L  LS Q
Sbjct:   263 A--AYDWRLHSGVTPVKDQKNCGSCWAFSSIGSVESQYAIRKNKLITLSEQ 311

 Score = 72 (30.4 bits), Expect = 1.6e-26, Sum P(2) = 1.6e-26
 Identities = 10/15 (66%), Positives = 13/15 (86%)

Query:   459 YWIVKNSWGSDWGEK 473
             Y+I+KNSWG  WGE+
Sbjct:   442 YYIIKNSWGQQWGER 456


>ZFIN|ZDB-GENE-980526-285 [details] [associations]
            symbol:ctsl1b "cathepsin L, 1 b" species:7955
            "Danio rerio" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0005576
            "extracellular region" evidence=IDA] [GO:0008233 "peptidase
            activity" evidence=IEA] [GO:0016787 "hydrolase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            ZFIN:ZDB-GENE-980526-285 GO:GO:0005576 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            EMBL:BX465840 IPI:IPI00498443 Ensembl:ENSDART00000145570
            Bgee:F1R7B3 Uniprot:F1R7B3
        Length = 352

 Score = 264 (98.0 bits), Expect = 1.0e-30, Sum P(2) = 1.0e-30
 Identities = 60/160 (37%), Positives = 86/160 (53%)

Query:   497 KLSRLATEKLVDCDMSNG--GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXX 554
             KL  ++ + LVDC    G  GCNGG MD A QY+ +N G+ S+Q+YPY A + +  C   
Sbjct:   174 KLISMSEQNLVDCSRPQGNQGCNGGLMDQAFQYVKENKGLDSEQSYPYLARD-DLPC-RY 231

Query:   555 XXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNAN--GLFYYSGGVIDLNQRLCN 612
                        +  IP G E  +   VA  GP+SV ++A+   L +Y  G+    +R C+
Sbjct:   232 DPRFNVAKITGFVDIPSGNELALMNAVAAVGPVSVAIDASHQSLQFYQSGIY--YERACS 289

Query:   613 PKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
                 +HA+++VGYG +   D     YWIVKNSW   WG+K
Sbjct:   290 SSRLDHAVLVVGYGYQGA-DVAGNRYWIVKNSWSDKWGDK 328

 Score = 259 (96.2 bits), Expect = 8.0e-27, Sum P(2) = 8.0e-27
 Identities = 56/153 (36%), Positives = 81/153 (52%)

Query:   297 PEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSNG 356
             P+  DWR  G ++ VK+Q +C  CW+FS+ G +E     +   L  +S Q LVDC    G
Sbjct:   132 PQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQG 191

Query:   357 --GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYG 414
               GCNGG MD A QY+ +N G+ S+Q+YPY A + +  C              +  IP G
Sbjct:   192 NQGCNGGLMDQAFQYVKENKGLDSEQSYPYLARD-DLPC-RYDPRFNVAKITGFVDIPSG 249

Query:   415 EEEEMKKWVATRGPLSVGMNAN--GLFYYSGGV 445
              E  +   VA  GP+SV ++A+   L +Y  G+
Sbjct:   250 NELALMNAVAAVGPVSVAIDASHQSLQFYQSGI 282

 Score = 125 (49.1 bits), Expect = 5.0e-08, Sum P(2) = 5.0e-08
 Identities = 52/198 (26%), Positives = 85/198 (42%)

Query:   198 QHHDKVYSSVEDLLRRHENFVTNVEKAEDYQSEDS-GTAVF--GVNKFFDLSESDLQQ-L 253
             QH    +  VE  + R   +  N+ K E +  E S G   F  G+N+F D++  + +Q +
Sbjct:    50 QHGKSYHEDVE--VGRRMIWEENLRKIEQHNFEYSYGNHTFKMGMNQFGDMTNEEFRQAM 107

Query:   254 TGLNLDSTLEDIQPSLQAP--FSS-NQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISK 310
              G   D       P    P  F++  Q D   R +    ++        + + + G +  
Sbjct:   108 NGYTHDPNQTSQGPLFMEPSFFAAPQQVDWRQRGY-VTPVKDQKQCGSCWSFSSTGAL-- 164

Query:   311 VKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSNGGCNGGRMDDALQYI 370
                +G+      F   G + +M      +L + S  Q       N GCNGG MD A QY+
Sbjct:   165 ---EGQL-----FRKTGKLISMSE---QNLVDCSRPQ------GNQGCNGGLMDQAFQYV 207

Query:   371 IDNGGVVSDQAYPYKASE 388
              +N G+ S+Q+YPY A +
Sbjct:   208 KENKGLDSEQSYPYLARD 225

 Score = 107 (42.7 bits), Expect = 1.0e-30, Sum P(2) = 1.0e-30
 Identities = 18/51 (35%), Positives = 29/51 (56%)

Query:   149 PEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQH 199
             P+  DWR  G ++ VK+Q +C  CW+FS+ G +E     +   L  +S Q+
Sbjct:   132 PQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQN 182

 Score = 75 (31.5 bits), Expect = 8.0e-27, Sum P(2) = 8.0e-27
 Identities = 11/15 (73%), Positives = 12/15 (80%)

Query:   459 YWIVKNSWGSDWGEK 473
             YWIVKNSW   WG+K
Sbjct:   314 YWIVKNSWSDKWGDK 328


>ZFIN|ZDB-GENE-040426-1583 [details] [associations]
            symbol:ctssa "cathepsin S, a" species:7955 "Danio
            rerio" [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-040426-1583
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 HOVERGEN:HBG011513
            GeneTree:ENSGT00660000095458 EMBL:CR548627 IPI:IPI00491948
            UniGene:Dr.81560 SMR:Q1L8W8 Ensembl:ENSDART00000053638 OMA:RNTREER
            OrthoDB:EOG480HX9 Uniprot:Q1L8W8
        Length = 328

 Score = 302 (111.4 bits), Expect = 4.8e-26, P = 4.8e-26
 Identities = 86/266 (32%), Positives = 126/266 (47%)

Query:   250 LQQLTGLNLDSTLEDIQPSLQAPFSS-NQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVI 308
             L QL+ +  D  + D+   L+  F   N T      F   SL+    LP+  +W   G++
Sbjct:    76 LNQLSDMTADE-VNDMNGLLEEDFPDVNAT------FSPPSLQ---TLPQRVNWTEHGMV 125

Query:   309 SKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMS--NGGCNGGRMDDA 366
             S V+ QG C  CWAFSAVG +EA    +  +L  LS Q L+DC +S  N GC GG +  A
Sbjct:   126 SPVQNQGPCGSCWAFSAVGSLEAQMKRRTAALVPLSAQNLLDCSVSLGNRGCKGGFLSRA 185

Query:   367 LQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATR 426
               Y+I N G+ S   YPY   E + G               +  +P   E  ++  VA  
Sbjct:   186 FLYVIQNRGIDSSTFYPY---EHKEGVCRYSVSGRAGYCTGFRIVPRHNEAALQSAVANI 242

Query:   427 GPLSVGMNANGLFY--YSGGVID--------LNQRL----YGTSI--PYWIVKNSWGSDW 470
             GP+SVG+NA  L +  Y  G+ +        +N  +    YG+     YW+VKNSWG+ W
Sbjct:   243 GPVSVGINAKLLSFHRYRSGIYNDPKCSSALINHAVLVVGYGSENGQDYWLVKNSWGTAW 302

Query:   471 GEKVEDKVGSSGNRTRDLELTGVLPS 496
             GE    ++  + N    +   G+ P+
Sbjct:   303 GENGYIRMARNKNMC-GISSFGIYPT 327

 Score = 251 (93.4 bits), Expect = 1.2e-30, Sum P(2) = 1.2e-30
 Identities = 57/155 (36%), Positives = 80/155 (51%)

Query:   501 LATEKLVDCDMS--NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXX 558
             L+ + L+DC +S  N GC GG +  A  Y+I N G+ S   YPY   E + G        
Sbjct:   160 LSAQNLLDCSVSLGNRGCKGGFLSRAFLYVIQNRGIDSSTFYPY---EHKEGVCRYSVSG 216

Query:   559 XXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFY--YSGGVIDLNQRLCNPKAQ 616
                    +  +P   E  ++  VA  GP+SVG+NA  L +  Y  G+   N   C+    
Sbjct:   217 RAGYCTGFRIVPRHNEAALQSAVANIGPVSVGINAKLLSFHRYRSGIY--NDPKCSSALI 274

Query:   617 NHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGE 651
             NHA+++VGYG E  +D     YW+VKNSWG+ WGE
Sbjct:   275 NHAVLVVGYGSENGQD-----YWLVKNSWGTAWGE 304

 Score = 120 (47.3 bits), Expect = 1.2e-30, Sum P(2) = 1.2e-30
 Identities = 35/99 (35%), Positives = 49/99 (49%)

Query:   102 LQQLTGLNLDSTLEDIQPSLQAPFSS-NQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVI 160
             L QL+ +  D  + D+   L+  F   N T      F   SL+    LP+  +W   G++
Sbjct:    76 LNQLSDMTADE-VNDMNGLLEEDFPDVNAT------FSPPSLQ---TLPQRVNWTEHGMV 125

Query:   161 SKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQH 199
             S V+ QG C  CWAFSAVG +EA    +   L  LS Q+
Sbjct:   126 SPVQNQGPCGSCWAFSAVGSLEAQMKRRTAALVPLSAQN 164

 Score = 52 (23.4 bits), Expect = 1.5e-23, Sum P(2) = 1.5e-23
 Identities = 22/92 (23%), Positives = 42/92 (45%)

Query:   200 HDKVYSSV-EDLLRRH--ENFVTNVEKAEDYQSEDSGTAVFGVNKFFDLSESDLQQLTGL 256
             H+K Y +  E+ LRR   +  + ++    +  +    +   G+N+  D++  ++  + GL
Sbjct:    34 HNKTYRNTREERLRRSVWKQNLQDILLHNEAAAVGLHSYTLGLNQLSDMTADEVNDMNGL 93

Query:   257 NLDSTLEDIQ-----PSLQA-PFSSNQTDTEM 282
              L+    D+      PSLQ  P   N T+  M
Sbjct:    94 -LEEDFPDVNATFSPPSLQTLPQRVNWTEHGM 124


>GENEDB_PFALCIPARUM|PF11_0161 [details] [associations]
            symbol:PF11_0161 "falcipain-2 precursor,
            putative" species:5833 "Plasmodium falciparum" [GO:0020020 "food
            vacuole" evidence=TAS] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00139
            PROSITE:PS00639 EMBL:AE014186 HSSP:O65039 OMA:NNVEHIN GO:GO:0020020
            MEROPS:C01.046 HOGENOM:HOG000065857 ProtClustDB:PTZ00021
            RefSeq:XP_001347832.1 ProteinModelPortal:Q8I6U5 SMR:Q8I6U5
            IntAct:Q8I6U5 MINT:MINT-1546851 EnsemblProtists:PF11_0161:mRNA
            GeneID:810708 KEGG:pfa:PF11_0161 EuPathDB:PlasmoDB:PF3D7_1115300
            Uniprot:Q8I6U5
        Length = 482

 Score = 292 (107.8 bits), Expect = 2.7e-26, Sum P(2) = 2.7e-26
 Identities = 71/265 (26%), Positives = 132/265 (49%)

Query:   184 MHAIQGNNLTELSVQHHDKVYSSVEDLLRRHENFVTNVEKAEDYQSEDSGTAVFGVNKFF 243
             M+ ++  N     ++ ++K Y+S  ++  R + F+ N  K + + +         +N+F 
Sbjct:   154 MNNVEHINQFYTFIKTNNKQYNSPNEMKERFQVFLQNAHKVKMHNNNKKSLYKKELNRFA 213

Query:   244 DLSESDLQQLTGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWR 303
             DL+  + +         TL   +P   + +  +Q + +    ++    + D    A+DWR
Sbjct:   214 DLTYHEFKSKY-----LTLRSSKPLKNSKYLLDQINYDAVIKKYKGNENFDHA--AYDWR 266

Query:   304 AEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSNGGCNGGRM 363
                 ++ VK+Q  C  CWAFS++G VE+ +AI+ N L  LS Q+LVDC   N GCNGG +
Sbjct:   267 LHSGVTPVKDQKNCGSCWAFSSIGSVESQYAIRKNKLITLSEQELVDCSFKNYGCNGGLI 326

Query:   364 DDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWV 423
             ++A + +I+ GG+ +D  YPY  S++   C              Y  +P   + ++K+ +
Sbjct:   327 NNAFEDMIELGGICTDDDYPY-VSDAPNLC-NIDRCTEKYGIKNYLSVP---DNKLKEAL 381

Query:   424 ATRGPLSVGMNANGLF-YYSGGVID 447
                GP+S+ +  +  F +Y  G+ D
Sbjct:   382 RFLGPISISIAVSDDFPFYKEGIFD 406

 Score = 227 (85.0 bits), Expect = 1.2e-30, Sum P(2) = 1.2e-30
 Identities = 53/163 (32%), Positives = 88/163 (53%)

Query:   496 SKLSRLATEKLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXX 555
             +KL  L+ ++LVDC   N GCNGG +++A + +I+ GG+ +D  YPY  S++   C    
Sbjct:   301 NKLITLSEQELVDCSFKNYGCNGGLINNAFEDMIELGGICTDDDYPY-VSDAPNLC-NID 358

Query:   556 XXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLF-YYSGGVIDLNQRLCNPK 614
                       Y  +P   + ++K+ +   GP+S+ +  +  F +Y  G+ D     C  +
Sbjct:   359 RCTEKYGIKNYLSVP---DNKLKEALRFLGPISISIAVSDDFPFYKEGIFDGE---CGDE 412

Query:   615 AQNHALIIVGYGEEE-----KKDGTSIPYWIVKNSWGSDWGEK 652
               NHA+++VG+G +E      K G    Y+I+KNSWG  WGE+
Sbjct:   413 L-NHAVMLVGFGMKEIVNPLTKKGEKHYYYIIKNSWGQQWGER 454

 Score = 187 (70.9 bits), Expect = 1.2e-30, Sum P(2) = 1.2e-30
 Identities = 45/171 (26%), Positives = 85/171 (49%)

Query:    30 NIFQTRGYLNSP--VTRFLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQREDSGTA 87
             N+F  +  +N+   + +F  F++ ++K Y+S  ++  R + F+ N  K + +        
Sbjct:   146 NVFDHKFLMNNVEHINQFYTFIKTNNKQYNSPNEMKERFQVFLQNAHKVKMHNNNKKSLY 205

Query:    88 VFEVNKFFDLSDSDLQQLTGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDD 147
               E+N+F DL+  + +         TL   +P   + +  +Q + +    ++    + D 
Sbjct:   206 KKELNRFADLTYHEFKSKY-----LTLRSSKPLKNSKYLLDQINYDAVIKKYKGNENFDH 260

Query:   148 LPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
                A+DWR    ++ VK+Q  C  CWAFS++G VE+ +AI+ N L  LS Q
Sbjct:   261 A--AYDWRLHSGVTPVKDQKNCGSCWAFSSIGSVESQYAIRKNKLITLSEQ 309

 Score = 72 (30.4 bits), Expect = 2.7e-26, Sum P(2) = 2.7e-26
 Identities = 10/15 (66%), Positives = 13/15 (86%)

Query:   459 YWIVKNSWGSDWGEK 473
             Y+I+KNSWG  WGE+
Sbjct:   440 YYIIKNSWGQQWGER 454


>UNIPROTKB|Q8I6U5 [details] [associations]
            symbol:PF11_0161 "Falcipain-2B" species:36329 "Plasmodium
            falciparum 3D7" [GO:0020020 "food vacuole" evidence=TAS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00139 PROSITE:PS00639
            EMBL:AE014186 HSSP:O65039 OMA:NNVEHIN GO:GO:0020020 MEROPS:C01.046
            HOGENOM:HOG000065857 ProtClustDB:PTZ00021 RefSeq:XP_001347832.1
            ProteinModelPortal:Q8I6U5 SMR:Q8I6U5 IntAct:Q8I6U5
            MINT:MINT-1546851 EnsemblProtists:PF11_0161:mRNA GeneID:810708
            KEGG:pfa:PF11_0161 EuPathDB:PlasmoDB:PF3D7_1115300 Uniprot:Q8I6U5
        Length = 482

 Score = 292 (107.8 bits), Expect = 2.7e-26, Sum P(2) = 2.7e-26
 Identities = 71/265 (26%), Positives = 132/265 (49%)

Query:   184 MHAIQGNNLTELSVQHHDKVYSSVEDLLRRHENFVTNVEKAEDYQSEDSGTAVFGVNKFF 243
             M+ ++  N     ++ ++K Y+S  ++  R + F+ N  K + + +         +N+F 
Sbjct:   154 MNNVEHINQFYTFIKTNNKQYNSPNEMKERFQVFLQNAHKVKMHNNNKKSLYKKELNRFA 213

Query:   244 DLSESDLQQLTGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWR 303
             DL+  + +         TL   +P   + +  +Q + +    ++    + D    A+DWR
Sbjct:   214 DLTYHEFKSKY-----LTLRSSKPLKNSKYLLDQINYDAVIKKYKGNENFDHA--AYDWR 266

Query:   304 AEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSNGGCNGGRM 363
                 ++ VK+Q  C  CWAFS++G VE+ +AI+ N L  LS Q+LVDC   N GCNGG +
Sbjct:   267 LHSGVTPVKDQKNCGSCWAFSSIGSVESQYAIRKNKLITLSEQELVDCSFKNYGCNGGLI 326

Query:   364 DDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWV 423
             ++A + +I+ GG+ +D  YPY  S++   C              Y  +P   + ++K+ +
Sbjct:   327 NNAFEDMIELGGICTDDDYPY-VSDAPNLC-NIDRCTEKYGIKNYLSVP---DNKLKEAL 381

Query:   424 ATRGPLSVGMNANGLF-YYSGGVID 447
                GP+S+ +  +  F +Y  G+ D
Sbjct:   382 RFLGPISISIAVSDDFPFYKEGIFD 406

 Score = 227 (85.0 bits), Expect = 1.2e-30, Sum P(2) = 1.2e-30
 Identities = 53/163 (32%), Positives = 88/163 (53%)

Query:   496 SKLSRLATEKLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXX 555
             +KL  L+ ++LVDC   N GCNGG +++A + +I+ GG+ +D  YPY  S++   C    
Sbjct:   301 NKLITLSEQELVDCSFKNYGCNGGLINNAFEDMIELGGICTDDDYPY-VSDAPNLC-NID 358

Query:   556 XXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLF-YYSGGVIDLNQRLCNPK 614
                       Y  +P   + ++K+ +   GP+S+ +  +  F +Y  G+ D     C  +
Sbjct:   359 RCTEKYGIKNYLSVP---DNKLKEALRFLGPISISIAVSDDFPFYKEGIFDGE---CGDE 412

Query:   615 AQNHALIIVGYGEEE-----KKDGTSIPYWIVKNSWGSDWGEK 652
               NHA+++VG+G +E      K G    Y+I+KNSWG  WGE+
Sbjct:   413 L-NHAVMLVGFGMKEIVNPLTKKGEKHYYYIIKNSWGQQWGER 454

 Score = 187 (70.9 bits), Expect = 1.2e-30, Sum P(2) = 1.2e-30
 Identities = 45/171 (26%), Positives = 85/171 (49%)

Query:    30 NIFQTRGYLNSP--VTRFLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQREDSGTA 87
             N+F  +  +N+   + +F  F++ ++K Y+S  ++  R + F+ N  K + +        
Sbjct:   146 NVFDHKFLMNNVEHINQFYTFIKTNNKQYNSPNEMKERFQVFLQNAHKVKMHNNNKKSLY 205

Query:    88 VFEVNKFFDLSDSDLQQLTGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDD 147
               E+N+F DL+  + +         TL   +P   + +  +Q + +    ++    + D 
Sbjct:   206 KKELNRFADLTYHEFKSKY-----LTLRSSKPLKNSKYLLDQINYDAVIKKYKGNENFDH 260

Query:   148 LPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
                A+DWR    ++ VK+Q  C  CWAFS++G VE+ +AI+ N L  LS Q
Sbjct:   261 A--AYDWRLHSGVTPVKDQKNCGSCWAFSSIGSVESQYAIRKNKLITLSEQ 309

 Score = 72 (30.4 bits), Expect = 2.7e-26, Sum P(2) = 2.7e-26
 Identities = 10/15 (66%), Positives = 13/15 (86%)

Query:   459 YWIVKNSWGSDWGEK 473
             Y+I+KNSWG  WGE+
Sbjct:   440 YYIIKNSWGQQWGER 454


>RGD|61810 [details] [associations]
            symbol:Ctsk "cathepsin K" species:10116 "Rattus norvegicus"
           [GO:0001957 "intramembranous ossification" evidence=IEP] [GO:0005615
           "extracellular space" evidence=IDA] [GO:0005737 "cytoplasm"
           evidence=IDA] [GO:0005764 "lysosome" evidence=IDA] [GO:0006508
           "proteolysis" evidence=TAS] [GO:0008234 "cysteine-type peptidase
           activity" evidence=TAS] [GO:0045453 "bone resorption" evidence=IMP]
           InterPro:IPR000668 InterPro:IPR013128 InterPro:IPR015644
           Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
           RGD:61810 GO:GO:0005615 eggNOG:COG4870 HOGENOM:HOG000230774
           GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
           InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
           PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
           GO:GO:0045453 GO:GO:0001957 GeneTree:ENSGT00560000076577
           HOVERGEN:HBG011513 MEROPS:I29.007 CTD:1513 KO:K01371
           OrthoDB:EOG4SJ5FC PANTHER:PTHR12411:SF55 EMBL:AF010306 EMBL:BC078793
           IPI:IPI00206378 RefSeq:NP_113748.1 UniGene:Rn.5598
           ProteinModelPortal:O35186 SMR:O35186 STRING:O35186
           PhosphoSite:O35186 PRIDE:O35186 Ensembl:ENSRNOT00000028730
           GeneID:29175 KEGG:rno:29175 UCSC:RGD:61810 InParanoid:O35186
           OMA:YKEIPEG BindingDB:O35186 ChEMBL:CHEMBL3034 NextBio:608248
           Genevestigator:O35186 GermOnline:ENSRNOG00000021155 Uniprot:O35186
        Length = 329

 Score = 344 (126.2 bits), Expect = 1.3e-30, P = 1.3e-30
 Identities = 93/296 (31%), Positives = 145/296 (48%)

Query:   194 ELSVQHHDKVYSSVEDLLRRHENFVTNVEKAEDYQSEDSGTAVFGVNKFFDLSESDLQQL 253
             EL  + H K Y+S  D + R   +  N++K   +  E S     G + + +L+ + L  +
Sbjct:    27 ELWKKTHGKQYNSKVDEISRRLIWEKNLKKISVHNLEAS----LGAHTY-ELAMNHLGDM 81

Query:   254 TGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVKE 313
             T    +  ++ +   L+ P S + ++  +   ++        +P++ D+R +G ++ VK 
Sbjct:    82 TS---EEVVQKMT-GLRVPPSRSFSNDTLYTPEWEGR-----VPDSIDYRKKGYVTPVKN 132

Query:   314 QGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSNGGCNGGRMDDALQYIIDN 373
             QG+C  CWAFS+ G +E     +   L  LS Q LVDC   N GC GG M  A QY+  N
Sbjct:   133 QGQCGSCWAFSSAGALEGQLKKKTGKLLALSPQNLVDCVSENYGCGGGYMTTAFQYVQQN 192

Query:   374 GGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGM 433
             GG+ S+ AYPY   +    C+             Y  IP G E+ +K+ VA  GP+SV +
Sbjct:   193 GGIDSEDAYPYVGQDES--CMYNATAKAAKCRG-YREIPVGNEKALKRAVARVGPVSVSI 249

Query:   434 NAN--GLFYYSGGVI--------DLNQRL----YGTSI--PYWIVKNSWGSDWGEK 473
             +A+     +YS GV         ++N  +    YGT     YWI+KNSWG  WG K
Sbjct:   250 DASLTSFQFYSRGVYYDENCDRDNVNHAVLVVGYGTQKGNKYWIIKNSWGESWGNK 305

 Score = 291 (107.5 bits), Expect = 7.5e-25, P = 7.5e-25
 Identities = 88/246 (35%), Positives = 120/246 (48%)

Query:   416 EEEMKKWVATRGPLSVGMNANGLFY--YSGGVID-LNQRLYGTSIPYWIVKNSW--GSDW 470
             EE ++K    R P S   + + L+   + G V D ++ R  G   P   VKN    GS W
Sbjct:    84 EEVVQKMTGLRVPPSRSFSNDTLYTPEWEGRVPDSIDYRKKGYVTP---VKNQGQCGSCW 140

Query:   471 GEKVEDKVGSSGNRTRDLEL-TGVLPSKLSRLATEKLVDCDMSNGGCNGGRMDDALQYII 529
                      S+G     L+  TG    KL  L+ + LVDC   N GC GG M  A QY+ 
Sbjct:   141 A------FSSAGALEGQLKKKTG----KLLALSPQNLVDCVSENYGCGGGYMTTAFQYVQ 190

Query:   530 DNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSV 589
              NGG+ S+ AYPY   +    C+             Y  IP G E+ +K+ VA  GP+SV
Sbjct:   191 QNGGIDSEDAYPYVGQDES--CMYNATAKAAKCRG-YREIPVGNEKALKRAVARVGPVSV 247

Query:   590 GMNAN--GLFYYSGGVI-DLNQRLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWG 646
              ++A+     +YS GV  D N   C+    NHA+++VGYG ++   G    YWI+KNSWG
Sbjct:   248 SIDASLTSFQFYSRGVYYDEN---CDRDNVNHAVLVVGYGTQK---GNK--YWIIKNSWG 299

Query:   647 SDWGEK 652
               WG K
Sbjct:   300 ESWGNK 305

 Score = 117 (46.2 bits), Expect = 0.00071, P = 0.00071
 Identities = 39/158 (24%), Positives = 76/158 (48%)

Query:    43 TRFLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQREDS-GTAVFEVNKFFDLSDSD 101
             T++  + + H K Y+S  D + R   +  N++K   +  E S G   +E      L+ + 
Sbjct:    24 TQWELWKKTHGKQYNSKVDEISRRLIWEKNLKKISVHNLEASLGAHTYE------LAMNH 77

Query:   102 LQQLTGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVIS 161
             L  +T    +  ++ +   L+ P S + ++  +   ++        +P++ D+R +G ++
Sbjct:    78 LGDMTS---EEVVQKMT-GLRVPPSRSFSNDTLYTPEWEGR-----VPDSIDYRKKGYVT 128

Query:   162 KVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQH 199
              VK QG+C  CWAFS+ G +E     +   L  LS Q+
Sbjct:   129 PVKNQGQCGSCWAFSSAGALEGQLKKKTGKLLALSPQN 166


>MGI|MGI:107823 [details] [associations]
            symbol:Ctsk "cathepsin K" species:10090 "Mus musculus"
            [GO:0005615 "extracellular space" evidence=ISO] [GO:0005737
            "cytoplasm" evidence=ISO] [GO:0005764 "lysosome" evidence=ISO]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008233 "peptidase
            activity" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0016787 "hydrolase activity"
            evidence=IEA] [GO:0045453 "bone resorption" evidence=ISO]
            InterPro:IPR000668 InterPro:IPR013128 InterPro:IPR015644
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            MGI:MGI:107823 GO:GO:0005615 eggNOG:COG4870 HOGENOM:HOG000230774
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 GO:GO:0045453
            GO:GO:0001957 HOVERGEN:HBG011513 MEROPS:I29.007 CTD:1513 KO:K01371
            OrthoDB:EOG4SJ5FC PANTHER:PTHR12411:SF55 OMA:LKVPPSH EMBL:X94444
            EMBL:AJ006033 EMBL:BC046320 IPI:IPI00316575 PIR:S74227
            RefSeq:NP_031828.2 UniGene:Mm.272085 ProteinModelPortal:P55097
            SMR:P55097 MINT:MINT-3089515 STRING:P55097 PhosphoSite:P55097
            PRIDE:P55097 Ensembl:ENSMUST00000015664 GeneID:13038 KEGG:mmu:13038
            InParanoid:P55097 BioCyc:MetaCyc:MONOMER-14811 ChEMBL:CHEMBL1075277
            NextBio:282924 Bgee:P55097 CleanEx:MM_CTSK Genevestigator:P55097
            GermOnline:ENSMUSG00000028111 Uniprot:P55097
        Length = 329

 Score = 344 (126.2 bits), Expect = 1.3e-30, P = 1.3e-30
 Identities = 93/296 (31%), Positives = 147/296 (49%)

Query:   194 ELSVQHHDKVYSSVEDLLRRHENFVTNVEKAEDYQSEDSGTAVFGVNKFFDLSESDLQQL 253
             EL  + H K Y+S  D + R   +  N+++   +  E S     GV+ + +L+ + L  +
Sbjct:    27 ELWKKTHQKQYNSKVDEISRRLIWEKNLKQISAHNLEAS----LGVHTY-ELAMNHLGDM 81

Query:   254 TGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVKE 313
             T    +  ++ +   L+ P S + ++  +   ++        +P++ D+R +G ++ VK 
Sbjct:    82 TS---EEVVQKMT-GLRIPPSRSYSNDTLYTPEWEGR-----VPDSIDYRKKGYVTPVKN 132

Query:   314 QGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSNGGCNGGRMDDALQYIIDN 373
             QG+C  CWAFS+ G +E     +   L  LS Q LVDC   N GC GG M  A QY+  N
Sbjct:   133 QGQCGSCWAFSSAGALEGQLKKKTGKLLALSPQNLVDCVTENYGCGGGYMTTAFQYVQQN 192

Query:   374 GGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGM 433
             GG+ S+ AYPY   +    C+             Y  IP G E+ +K+ VA  GP+SV +
Sbjct:   193 GGIDSEDAYPYVGQDES--CMYNATAKAAKCRG-YREIPVGNEKALKRAVARVGPISVSI 249

Query:   434 NAN-GLF-YYSGGVI--------DLNQRL----YGTS--IPYWIVKNSWGSDWGEK 473
             +A+   F +YS GV         ++N  +    YGT     +WI+KNSWG  WG K
Sbjct:   250 DASLASFQFYSRGVYYDENCDRDNVNHAVLVVGYGTQKGSKHWIIKNSWGESWGNK 305

 Score = 286 (105.7 bits), Expect = 2.6e-24, P = 2.6e-24
 Identities = 88/246 (35%), Positives = 122/246 (49%)

Query:   416 EEEMKKWVATRGPLSVGMNANGLFY--YSGGVID-LNQRLYGTSIPYWIVKNSW--GSDW 470
             EE ++K    R P S   + + L+   + G V D ++ R  G   P   VKN    GS W
Sbjct:    84 EEVVQKMTGLRIPPSRSYSNDTLYTPEWEGRVPDSIDYRKKGYVTP---VKNQGQCGSCW 140

Query:   471 GEKVEDKVGSSGNRTRDLEL-TGVLPSKLSRLATEKLVDCDMSNGGCNGGRMDDALQYII 529
                      S+G     L+  TG    KL  L+ + LVDC   N GC GG M  A QY+ 
Sbjct:   141 A------FSSAGALEGQLKKKTG----KLLALSPQNLVDCVTENYGCGGGYMTTAFQYVQ 190

Query:   530 DNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSV 589
              NGG+ S+ AYPY   +    C+             Y  IP G E+ +K+ VA  GP+SV
Sbjct:   191 QNGGIDSEDAYPYVGQDES--CMYNATAKAAKCRG-YREIPVGNEKALKRAVARVGPISV 247

Query:   590 GMNAN-GLF-YYSGGVI-DLNQRLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWG 646
              ++A+   F +YS GV  D N   C+    NHA+++VGYG ++   G+   +WI+KNSWG
Sbjct:   248 SIDASLASFQFYSRGVYYDEN---CDRDNVNHAVLVVGYGTQK---GSK--HWIIKNSWG 299

Query:   647 SDWGEK 652
               WG K
Sbjct:   300 ESWGNK 305

 Score = 116 (45.9 bits), Expect = 0.00091, P = 0.00091
 Identities = 38/158 (24%), Positives = 76/158 (48%)

Query:    43 TRFLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQREDS-GTAVFEVNKFFDLSDSD 101
             T++  + + H K Y+S  D + R   +  N+++   +  E S G   +E      L+ + 
Sbjct:    24 TQWELWKKTHQKQYNSKVDEISRRLIWEKNLKQISAHNLEASLGVHTYE------LAMNH 77

Query:   102 LQQLTGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVIS 161
             L  +T    +  ++ +   L+ P S + ++  +   ++        +P++ D+R +G ++
Sbjct:    78 LGDMTS---EEVVQKMT-GLRIPPSRSYSNDTLYTPEWEGR-----VPDSIDYRKKGYVT 128

Query:   162 KVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQH 199
              VK QG+C  CWAFS+ G +E     +   L  LS Q+
Sbjct:   129 PVKNQGQCGSCWAFSSAGALEGQLKKKTGKLLALSPQN 166


>UNIPROTKB|P56202 [details] [associations]
            symbol:CTSW "Cathepsin W" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0006955 "immune response"
            evidence=TAS] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 GO:GO:0006955 HOGENOM:HOG000230774
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 EMBL:AF013611
            EMBL:AF015954 EMBL:AF055903 EMBL:AP001201 EMBL:BC048255
            IPI:IPI00328978 RefSeq:NP_001326.2 UniGene:Hs.416848
            ProteinModelPortal:P56202 SMR:P56202 STRING:P56202 MEROPS:C01.037
            PhosphoSite:P56202 DMDM:259016196 PaxDb:P56202 PRIDE:P56202
            Ensembl:ENST00000307886 GeneID:1521 KEGG:hsa:1521 UCSC:uc001ogc.1
            CTD:1521 GeneCards:GC11P065647 HGNC:HGNC:2546 HPA:CAB016345
            MIM:602364 neXtProt:NX_P56202 PharmGKB:PA27042 eggNOG:NOG288820
            HOVERGEN:HBG100117 InParanoid:P56202 KO:K08569 OMA:GRCGDGC
            OrthoDB:EOG4Q2DG3 PhylomeDB:P56202 GenomeRNAi:1521 NextBio:6295
            ArrayExpress:P56202 Bgee:P56202 CleanEx:HS_CTSW
            Genevestigator:P56202 GermOnline:ENSG00000172543 Uniprot:P56202
        Length = 376

 Score = 279 (103.3 bits), Expect = 1.4e-30, Sum P(2) = 1.4e-30
 Identities = 73/247 (29%), Positives = 117/247 (47%)

Query:   201 DKVYSSVEDLLRRHENFVTNVEKAEDYQSEDSGTAVFGVNKFFDLSESDLQQLTGLNLDS 260
             ++ Y S E+   R + F  N+ +A+  Q ED GTA FGV  F DL+E +  QL G     
Sbjct:    50 NRSYLSPEEHAHRLDIFAHNLAQAQRLQEEDLGTAEFGVTPFSDLTEEEFGQLYGYR--R 107

Query:   261 TLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWR-AEGVISKVKEQGKCAC 319
                 + PS+     S + +              + +P + DWR     IS +K+Q  C C
Sbjct:   108 AAGGV-PSMGREIRSEEPE--------------ESVPFSCDWRKVASAISPIKDQKNCNC 152

Query:   320 CWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSD 379
             CWA +A G +E +  I      ++SVQ+L+DC     GC+GG + DA   +++N G+ S+
Sbjct:   153 CWAMAAAGNIETLWRISFWDFVDVSVQELLDCGRCGDGCHGGFVWDAFITVLNNSGLASE 212

Query:   380 QAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLF 439
             + YP++       C              +  +    E  + +++AT GP++V +N   L 
Sbjct:   213 KDYPFQGKVRAHRC-HPKKYQKVAWIQDFIMLQ-NNEHRIAQYLATYGPITVTINMKPLQ 270

Query:   440 YYSGGVI 446
              Y  GVI
Sbjct:   271 LYRKGVI 277

 Score = 220 (82.5 bits), Expect = 3.0e-25, Sum P(2) = 3.0e-25
 Identities = 47/167 (28%), Positives = 84/167 (50%)

Query:   501 LATEKLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXX 560
             ++ ++L+DC     GC+GG + DA   +++N G+ S++ YP++       C         
Sbjct:   176 VSVQELLDCGRCGDGCHGGFVWDAFITVLNNSGLASEKDYPFQGKVRAHRC-HPKKYQKV 234

Query:   561 XXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFYYSGGVIDLNQRLCNPKAQNHAL 620
                  +  +    E  + +++AT GP++V +N   L  Y  GVI      C+P+  +H++
Sbjct:   235 AWIQDFIMLQ-NNEHRIAQYLATYGPITVTINMKPLQLYRKGVIKATPTTCDPQLVDHSV 293

Query:   621 IIVGYGEEEKKDGT---------------SIPYWIVKNSWGSDWGEK 652
             ++VG+G  + ++G                  PYWI+KNSWG+ WGEK
Sbjct:   294 LLVGFGSVKSEEGIWAETVSSQSQPQPPHPTPYWILKNSWGAQWGEK 340

 Score = 136 (52.9 bits), Expect = 3.0e-25, Sum P(2) = 3.0e-25
 Identities = 44/154 (28%), Positives = 68/154 (44%)

Query:    45 FLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQREDSGTAVFEVNKFFDLSDSDLQQ 104
             F  F    ++ Y S E+   R + F  N+ +A+  Q ED GTA F V  F DL++ +  Q
Sbjct:    42 FKLFQIQFNRSYLSPEEHAHRLDIFAHNLAQAQRLQEEDLGTAEFGVTPFSDLTEEEFGQ 101

Query:   105 LTGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVK 164
             L G         + PS+     S + +         S+    D       +    IS +K
Sbjct:   102 LYGYR--RAAGGV-PSMGREIRSEEPE--------ESVPFSCDWR-----KVASAISPIK 145

Query:   165 EQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
             +Q  C CCWA +A G +E +  I   +  ++SVQ
Sbjct:   146 DQKNCNCCWAMAAAGNIETLWRISFWDFVDVSVQ 179

 Score = 90 (36.7 bits), Expect = 1.4e-30, Sum P(2) = 1.4e-30
 Identities = 13/16 (81%), Positives = 15/16 (93%)

Query:   458 PYWIVKNSWGSDWGEK 473
             PYWI+KNSWG+ WGEK
Sbjct:   325 PYWILKNSWGAQWGEK 340


>UNIPROTKB|E2RR02 [details] [associations]
            symbol:CTSF "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00139 PROSITE:PS00639
            OMA:LAPPEWD GeneTree:ENSGT00660000095458 EMBL:AAEX03011628
            Ensembl:ENSCAFT00000019742 Uniprot:E2RR02
        Length = 460

 Score = 299 (110.3 bits), Expect = 5.5e-24, P = 5.5e-24
 Identities = 86/289 (29%), Positives = 132/289 (45%)

Query:   202 KVYSSVEDLLRRHENFVTNVEKAEDYQSEDSGTAVFGVNKFFDLSESDLQQ-LTGLNLDS 260
             K+ S  ++ +  +       E+AE   S  S   V    K   L     Q  +T  + D 
Sbjct:   157 KMASVFKEFVTTYNRTYETKEEAEWRMSVFSNNMV-RAQKIQALDRGTAQYGITKFS-DL 214

Query:   261 TLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVKEQGKCACC 320
             T E+ +     P        +MR  +  S  H    P  +DWR++G ++KVK+QG C  C
Sbjct:   215 TEEEFRTIYLNPLLRENRGKKMRLAKSIS-DHAP--PPEWDWRSKGAVTKVKDQGMCGSC 271

Query:   321 WAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQ 380
             WAFS  G VE    ++  +L  LS Q+L+DCD  +  C GG   +A   I+  GG+ ++ 
Sbjct:   272 WAFSVTGNVEGQWFLKEGTLLSLSEQELLDCDKVDKACLGGLPSNAYSAIMTLGGLETED 331

Query:   381 AYPYKASESERGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFY 440
              Y Y+                       S+     E+++  W+A +GP+SV +NA G+ +
Sbjct:   332 DYSYQGHLQACSFSAKKARVYINDSMELSQ----NEQKLAAWLAKKGPISVAINAFGMQF 387

Query:   441 YSGGV------------IDLNQRL--YG--TSIPYWIVKNSWGSDWGEK 473
             Y  G+            ID    L  YG  + IP+W +KNSWG+DWGE+
Sbjct:   388 YRHGISHPLRPLCSPWLIDHAVLLVGYGNRSGIPFWAIKNSWGTDWGEE 436

 Score = 235 (87.8 bits), Expect = 1.5e-30, Sum P(3) = 1.5e-30
 Identities = 49/155 (31%), Positives = 82/155 (52%)

Query:   498 LSRLATEKLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXX 557
             L  L+ ++L+DCD  +  C GG   +A   I+  GG+ ++  Y Y+              
Sbjct:   291 LLSLSEQELLDCDKVDKACLGGLPSNAYSAIMTLGGLETEDDYSYQGHLQACSFSAKKAR 350

Query:   558 XXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFYYSGGVIDLNQRLCNPKAQN 617
                      S+     E+++  W+A +GP+SV +NA G+ +Y  G+    + LC+P   +
Sbjct:   351 VYINDSMELSQ----NEQKLAAWLAKKGPISVAINAFGMQFYRHGISHPLRPLCSPWLID 406

Query:   618 HALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
             HA+++VGYG       + IP+W +KNSWG+DWGE+
Sbjct:   407 HAVLLVGYGNR-----SGIPFWAIKNSWGTDWGEE 436

 Score = 131 (51.2 bits), Expect = 1.5e-30, Sum P(3) = 1.5e-30
 Identities = 23/50 (46%), Positives = 31/50 (62%)

Query:   149 PEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
             P  +DWR++G ++KVK+QG C  CWAFS  G VE    ++   L  LS Q
Sbjct:   248 PPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNVEGQWFLKEGTLLSLSEQ 297

 Score = 89 (36.4 bits), Expect = 1.9e-21, Sum P(2) = 1.9e-21
 Identities = 18/57 (31%), Positives = 33/57 (57%)

Query:   197 VQHHDKVYSSVEDLLRRHENFVTNVEKAEDYQSEDSGTAVFGVNKFFDLSESDLQQL 253
             V  +++ Y + E+   R   F  N+ +A+  Q+ D GTA +G+ KF DL+E + + +
Sbjct:   166 VTTYNRTYETKEEAEWRMSVFSNNMVRAQKIQALDRGTAQYGITKFSDLTEEEFRTI 222

 Score = 84 (34.6 bits), Expect = 1.5e-30, Sum P(3) = 1.5e-30
 Identities = 17/61 (27%), Positives = 33/61 (54%)

Query:    45 FLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQREDSGTAVFEVNKFFDLSDSDLQQ 104
             F  F+  +++ Y + E+   R   F  N+ +A+  Q  D GTA + + KF DL++ + + 
Sbjct:   162 FKEFVTTYNRTYETKEEAEWRMSVFSNNMVRAQKIQALDRGTAQYGITKFSDLTEEEFRT 221

Query:   105 L 105
             +
Sbjct:   222 I 222


>TAIR|locus:2117979 [details] [associations]
            symbol:AT4G23520 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            EMBL:CP002687 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00639
            KO:K01376 IPI:IPI00527171 RefSeq:NP_567686.2 UniGene:At.32421
            ProteinModelPortal:F4JNL3 SMR:F4JNL3 MEROPS:C01.A22 PRIDE:F4JNL3
            EnsemblPlants:AT4G23520.1 GeneID:828452 KEGG:ath:AT4G23520
            OMA:PANDEIS ArrayExpress:F4JNL3 Uniprot:F4JNL3
        Length = 356

 Score = 343 (125.8 bits), Expect = 1.6e-30, P = 1.6e-30
 Identities = 76/196 (38%), Positives = 111/196 (56%)

Query:   293 GDDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCD 352
             GD LPE+ DWR EG +S++K+QG C  CWAFS V  VE ++ I    L  LS Q+LVDC+
Sbjct:   130 GDQLPESVDWRQEGAVSEIKDQGTCNSCWAFSTVAAVEGLNKIVTGELISLSEQELVDCN 189

Query:   353 MSNGGCNG-GRMDDALQYIIDNGGVVSDQAYPYKASESERGC-LXXXXXXXXXXXXXYSR 410
             + N GC G G MD A Q++I+N G+ S++ YPY+ ++    C               Y  
Sbjct:   190 LVNNGCYGSGLMDTAFQFLINNNGLDSEKDYPYQGTQGS--CNRKQSTSNKVITIDSYED 247

Query:   411 IPYGEEEEMKKWVATRGPLSVGMNANG---LFY----YSGGV-IDLNQRL----YGTSI- 457
             +P  +E  ++K VA + P+SVG++      + Y    Y+G    +L+  L    YG+   
Sbjct:   248 VPANDEISLQKAVAHQ-PVSVGVDKKSQEFMLYRSCIYNGPCGTNLDHALVIVGYGSENG 306

Query:   458 -PYWIVKNSWGSDWGE 472
               YWIV+NSWG+ WG+
Sbjct:   307 QDYWIVRNSWGTTWGD 322

 Score = 237 (88.5 bits), Expect = 2.7e-30, Sum P(2) = 2.7e-30
 Identities = 56/164 (34%), Positives = 88/164 (53%)

Query:   490 LTGVLPSKLSRLATEKLVDCDMSNGGCNG-GRMDDALQYIIDNGGVVSDQAYPYKASESE 548
             L  ++  +L  L+ ++LVDC++ N GC G G MD A Q++I+N G+ S++ YPY+ ++  
Sbjct:   169 LNKIVTGELISLSEQELVDCNLVNNGCYGSGLMDTAFQFLINNNGLDSEKDYPYQGTQGS 228

Query:   549 RGC-LXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFYYSGGVIDLN 607
               C               Y  +P  +E  ++K VA + P+SVG++     +        N
Sbjct:   229 --CNRKQSTSNKVITIDSYEDVPANDEISLQKAVAHQ-PVSVGVDKKSQEFMLYRSCIYN 285

Query:   608 QRLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGE 651
                C     +HAL+IVGYG E  +D     YWIV+NSWG+ WG+
Sbjct:   286 GP-CGTNL-DHALVIVGYGSENGQD-----YWIVRNSWGTTWGD 322

 Score = 160 (61.4 bits), Expect = 2.7e-30, Sum P(2) = 2.7e-30
 Identities = 47/152 (30%), Positives = 68/152 (44%)

Query:    50 RDHDKVYSSVEDLLRRHENFVTNV--EKAEDYQREDSGTAVFEVNKFFDLS-DSDLQQLT 106
             R +++V    +  + +H    TN   EK   +Q         + +   +LS    L +  
Sbjct:    38 RSNEEVEFIFQMWMSKHGKTYTNALGEKERRFQNFKDNLRFIDQHNAKNLSYQLGLTRFA 97

Query:   107 GLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVKEQ 166
              L +     D+ P    P   N   T  R   +  L  GD LPE+ DWR EG +S++K+Q
Sbjct:    98 DLTVQE-YRDLFPGSPKPKQRN-LKTSRR---YVPLA-GDQLPESVDWRQEGAVSEIKDQ 151

Query:   167 GKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
             G C  CWAFS V  VE ++ I    L  LS Q
Sbjct:   152 GTCNSCWAFSTVAAVEGLNKIVTGELISLSEQ 183


>ZFIN|ZDB-GENE-080724-8 [details] [associations]
            symbol:ctso "cathepsin O" species:7955 "Danio rerio"
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            ZFIN:ZDB-GENE-080724-8 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00639
            GeneTree:ENSGT00560000076599 CTD:1519 KO:K01374 EMBL:CR931784
            IPI:IPI00513613 RefSeq:XP_695717.3 UniGene:Dr.88386
            Ensembl:ENSDART00000074786 GeneID:567333 KEGG:dre:567333
            NextBio:20888622 Uniprot:E7FA09
        Length = 334

 Score = 310 (114.2 bits), Expect = 2.0e-30, Sum P(2) = 2.0e-30
 Identities = 71/191 (37%), Positives = 99/191 (51%)

Query:   297 PEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSNG 356
             P  FDWR  GV+  V  QG C  CWAFS V  +E++ A  G  L +LSVQQ++DC   N 
Sbjct:   122 PPRFDWRDHGVVGPVHNQGSCGGCWAFSIVEAIESVSAKGGEKLQQLSVQQVIDCSYQNQ 181

Query:   357 GCNGGRMDDALQYIIDNG-GVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPY-G 414
             GCNGG   +AL ++  +   +VS+  YP+K ++    C              YS   + G
Sbjct:   182 GCNGGSPVEALYWLTQSKLKLVSEAEYPFKGADGV--CQFFPQAHAGVAVRNYSAYDFSG 239

Query:   415 EEEEMKKWVATRGPLSVGMNANGLFYYSGGVID-------LNQRL----YGTS--IPYWI 461
             +EE M   +   GPL V ++A     Y GG+I         N  +    Y T+  +PYWI
Sbjct:   240 QEEVMMSALVDFGPLVVIVDAISWQDYLGGIIQHHCSSHKANHAVLITGYDTTGEVPYWI 299

Query:   462 VKNSWGSDWGE 472
             V+NSWG+ WG+
Sbjct:   300 VRNSWGTSWGD 310

 Score = 233 (87.1 bits), Expect = 4.1e-28, Sum P(2) = 4.1e-28
 Identities = 55/158 (34%), Positives = 83/158 (52%)

Query:   497 KLSRLATEKLVDCDMSNGGCNGGRMDDALQYIIDNG-GVVSDQAYPYKASESERGCLXXX 555
             KL +L+ ++++DC   N GCNGG   +AL ++  +   +VS+  YP+K ++    C    
Sbjct:   164 KLQQLSVQQVIDCSYQNQGCNGGSPVEALYWLTQSKLKLVSEAEYPFKGADGV--CQFFP 221

Query:   556 XXXXXXXXXXYSRIPY-GEEEEMKKWVATRGPLSVGMNANGLFYYSGGVIDLNQRLCNPK 614
                       YS   + G+EE M   +   GPL V ++A     Y GG+I   Q  C+  
Sbjct:   222 QAHAGVAVRNYSAYDFSGQEEVMMSALVDFGPLVVIVDAISWQDYLGGII---QHHCSSH 278

Query:   615 AQNHALIIVGYGEEEKKDGTS-IPYWIVKNSWGSDWGE 651
               NHA++I GY      D T  +PYWIV+NSWG+ WG+
Sbjct:   279 KANHAVLITGY------DTTGEVPYWIVRNSWGTSWGD 310

 Score = 138 (53.6 bits), Expect = 4.1e-28, Sum P(2) = 4.1e-28
 Identities = 42/139 (30%), Positives = 65/139 (46%)

Query:    63 LRRHENFVTNVEKAEDYQREDSGTAVFEVNKFFD--LSDSDLQQLTGLNLDSTLEDIQPS 120
             L+  + F  +V   E YQR  +  +  +   F +  L  S+     G+N  S L   Q  
Sbjct:    36 LQHSDTFQQDVNN-ELYQRWINYQSSLQRQAFLNSALGKSNQSAQYGVNQFSYLS--QKQ 92

Query:   121 LQAPFSSNQTDTEMRAFQFNS-LRHGDDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVG 179
              +  + + + +   +  Q  S ++   + P  FDWR  GV+  V  QG C  CWAFS V 
Sbjct:    93 FKEQYLTARAEAAPKFDQSKSEIKVKANNPPRFDWRDHGVVGPVHNQGSCGGCWAFSIVE 152

Query:   180 VVEAMHAIQGNNLTELSVQ 198
              +E++ A  G  L +LSVQ
Sbjct:   153 AIESVSAKGGEKLQQLSVQ 171

 Score = 56 (24.8 bits), Expect = 2.0e-30, Sum P(2) = 2.0e-30
 Identities = 20/79 (25%), Positives = 43/79 (54%)

Query:   179 GVVEAMHAIQGNNLTELS-VQHHDKVYSSVE-DLLRRHENFVTNVEKAEDYQS---EDSG 233
             G++ ++  I+  +LTE   +QH D     V  +L +R  N+ +++++     S   + + 
Sbjct:    19 GII-SVEVIR-KSLTEGERLQHSDTFQQDVNNELYQRWINYQSSLQRQAFLNSALGKSNQ 76

Query:   234 TAVFGVNKFFDLSESDLQQ 252
             +A +GVN+F  LS+   ++
Sbjct:    77 SAQYGVNQFSYLSQKQFKE 95


>UNIPROTKB|P43234 [details] [associations]
            symbol:CTSO "Cathepsin O" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005764 "lysosome"
            evidence=IEA] [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=TAS] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 Reactome:REACT_6900
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0006508
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 GO:GO:0004197
            CleanEx:HS_CTSO EMBL:X77383 EMBL:BC049206 IPI:IPI00017257
            PIR:A55090 RefSeq:NP_001325.1 UniGene:Hs.75262
            ProteinModelPortal:P43234 SMR:P43234 IntAct:P43234 STRING:P43234
            MEROPS:C01.035 PhosphoSite:P43234 DMDM:1168795 PRIDE:P43234
            DNASU:1519 Ensembl:ENST00000433477 GeneID:1519 KEGG:hsa:1519
            UCSC:uc003ipg.3 CTD:1519 GeneCards:GC04M156845 HGNC:HGNC:2542
            HPA:HPA002041 MIM:600550 neXtProt:NX_P43234 PharmGKB:PA27040
            HOVERGEN:HBG105050 InParanoid:P43234 KO:K01374 OMA:SNVCGIA
            OrthoDB:EOG4V6ZH1 PhylomeDB:P43234 BindingDB:P43234
            ChEMBL:CHEMBL3035 GenomeRNAi:1519 NextBio:6287 Bgee:P43234
            Genevestigator:P43234 GermOnline:ENSG00000151792 Uniprot:P43234
        Length = 321

 Score = 325 (119.5 bits), Expect = 2.1e-30, Sum P(2) = 2.1e-30
 Identities = 76/191 (39%), Positives = 103/191 (53%)

Query:   296 LPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSN 355
             LP  FDWR + V+++V+ Q  C  CWAFS VG VE+ +AI+G  L +LSVQQ++DC  +N
Sbjct:   108 LPLRFDWRDKQVVTQVRNQQMCGGCWAFSVVGAVESAYAIKGKPLEDLSVQQVIDCSYNN 167

Query:   356 GGCNGGRMDDALQYIID-NGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYG 414
              GCNGG   +AL ++      +V D  YP+KA      C              YS   + 
Sbjct:   168 YGCNGGSTLNALNWLNKMQVKLVKDSEYPFKAQNGL--CHYFSGSHSGFSIKGYSAYDFS 225

Query:   415 EEE-EMKKWVATRGPLSVGMNANGLFYYSGGVI-------DLNQRLYGT------SIPYW 460
             ++E EM K + T GPL V ++A     Y GG+I       + N  +  T      S PYW
Sbjct:   226 DQEDEMAKALLTFGPLVVIVDAVSWQDYLGGIIQHHCSSGEANHAVLITGFDKTGSTPYW 285

Query:   461 IVKNSWGSDWG 471
             IV+NSWGS WG
Sbjct:   286 IVRNSWGSSWG 296

 Score = 230 (86.0 bits), Expect = 4.1e-29, Sum P(2) = 4.1e-29
 Identities = 57/155 (36%), Positives = 80/155 (51%)

Query:   498 LSRLATEKLVDCDMSNGGCNGGRMDDALQYIID-NGGVVSDQAYPYKASESERGCLXXXX 556
             L  L+ ++++DC  +N GCNGG   +AL ++      +V D  YP+KA      C     
Sbjct:   152 LEDLSVQQVIDCSYNNYGCNGGSTLNALNWLNKMQVKLVKDSEYPFKAQNGL--CHYFSG 209

Query:   557 XXXXXXXXXYSRIPYGEEE-EMKKWVATRGPLSVGMNANGLFYYSGGVIDLNQRLCNPKA 615
                      YS   + ++E EM K + T GPL V ++A     Y GG+I   Q  C+   
Sbjct:   210 SHSGFSIKGYSAYDFSDQEDEMAKALLTFGPLVVIVDAVSWQDYLGGII---QHHCSSGE 266

Query:   616 QNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWG 650
              NHA++I G+     K G S PYWIV+NSWGS WG
Sbjct:   267 ANHAVLITGFD----KTG-STPYWIVRNSWGSSWG 296

 Score = 145 (56.1 bits), Expect = 4.1e-29, Sum P(2) = 4.1e-29
 Identities = 27/51 (52%), Positives = 36/51 (70%)

Query:   148 LPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
             LP  FDWR + V+++V+ Q  C  CWAFS VG VE+ +AI+G  L +LSVQ
Sbjct:   108 LPLRFDWRDKQVVTQVRNQQMCGGCWAFSVVGAVESAYAIKGKPLEDLSVQ 158

 Score = 40 (19.1 bits), Expect = 2.1e-30, Sum P(2) = 2.1e-30
 Identities = 8/16 (50%), Positives = 13/16 (81%)

Query:   227 YQSEDSGTAVFGVNKF 242
             + SE+S TA +G+N+F
Sbjct:    58 FPSENS-TAFYGINQF 72


>UNIPROTKB|F1PGK4 [details] [associations]
            symbol:CTSO "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00139
            PROSITE:PS00639 GeneTree:ENSGT00560000076599 OMA:SNVCGIA
            EMBL:AAEX03010073 Ensembl:ENSCAFT00000013638 Uniprot:F1PGK4
        Length = 316

 Score = 322 (118.4 bits), Expect = 2.2e-30, Sum P(2) = 2.2e-30
 Identities = 77/203 (37%), Positives = 106/203 (52%)

Query:   296 LPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSN 355
             LP  FDWR + V+++V+ Q  C  CWAFS VG VE+ +AI+G  L ++SVQQ++DC  +N
Sbjct:   103 LPLRFDWRDKRVVTQVRNQQTCGGCWAFSVVGAVESAYAIKGKPLADISVQQVIDCSYNN 162

Query:   356 GGCNGGRMDDALQYIIDNG-GVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYG 414
              GC+GG   +AL ++      +V D  YP+KA      C              YS   + 
Sbjct:   163 YGCSGGSTLNALNWLNKTQVKLVRDSEYPFKAQNGL--CHYFSDSYSGFSIRGYSAYDFS 220

Query:   415 EEE-EMKKWVATRGPLSVGMNANGLFYYSGGVI-------DLNQRLYGT------SIPYW 460
             ++E EM K + T GPL V ++A     Y GG+I       + N  +  T      S PYW
Sbjct:   221 DQEDEMAKVLLTFGPLVVVVDAVSWQDYLGGIIQHHCSSGEANHAVLITGFDKIGSTPYW 280

Query:   461 IVKNSWGSDWGEKVEDKVGSSGN 483
             IV+NSWGS WG      V   GN
Sbjct:   281 IVRNSWGSSWGVDGYAHVKMGGN 303

 Score = 223 (83.6 bits), Expect = 2.3e-29, Sum P(3) = 2.3e-29
 Identities = 55/155 (35%), Positives = 81/155 (52%)

Query:   498 LSRLATEKLVDCDMSNGGCNGGRMDDALQYIIDNG-GVVSDQAYPYKASESERGCLXXXX 556
             L+ ++ ++++DC  +N GC+GG   +AL ++      +V D  YP+KA      C     
Sbjct:   147 LADISVQQVIDCSYNNYGCSGGSTLNALNWLNKTQVKLVRDSEYPFKAQNGL--CHYFSD 204

Query:   557 XXXXXXXXXYSRIPYGEEE-EMKKWVATRGPLSVGMNANGLFYYSGGVIDLNQRLCNPKA 615
                      YS   + ++E EM K + T GPL V ++A     Y GG+I   Q  C+   
Sbjct:   205 SYSGFSIRGYSAYDFSDQEDEMAKVLLTFGPLVVVVDAVSWQDYLGGII---QHHCSSGE 261

Query:   616 QNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWG 650
              NHA++I G+     K G S PYWIV+NSWGS WG
Sbjct:   262 ANHAVLITGFD----KIG-STPYWIVRNSWGSSWG 291

 Score = 144 (55.7 bits), Expect = 2.3e-29, Sum P(3) = 2.3e-29
 Identities = 26/51 (50%), Positives = 36/51 (70%)

Query:   148 LPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
             LP  FDWR + V+++V+ Q  C  CWAFS VG VE+ +AI+G  L ++SVQ
Sbjct:   103 LPLRFDWRDKRVVTQVRNQQTCGGCWAFSVVGAVESAYAIKGKPLADISVQ 153

 Score = 43 (20.2 bits), Expect = 2.2e-30, Sum P(2) = 2.2e-30
 Identities = 14/56 (25%), Positives = 30/56 (53%)

Query:    50 RDHDKVYSSVEDLLRRHENFVTNVEKAEDYQREDSGTAVFEVNKFFDLSDSDLQQL 105
             R  +   ++  + L RH  ++ +V     + RE+S +AV+ +N+F  LS  + + +
Sbjct:    30 RSREPPAAAFRESLNRHR-YLNSV-----FPRENS-SAVYGINQFSYLSPEEFKAI 78

 Score = 43 (20.2 bits), Expect = 2.2e-17, Sum P(2) = 2.2e-17
 Identities = 7/23 (30%), Positives = 16/23 (69%)

Query:   231 DSGTAVFGVNKFFDLSESDLQQL 253
             ++ +AV+G+N+F  LS  + + +
Sbjct:    56 ENSSAVYGINQFSYLSPEEFKAI 78


>UNIPROTKB|F6R7P5 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9544 "Macaca
            mulatta" [GO:0001656 "metanephros development" evidence=ISS]
            [GO:0001913 "T cell mediated cytotoxicity" evidence=ISS]
            [GO:0002764 "immune response-regulating signaling pathway"
            evidence=ISS] [GO:0004175 "endopeptidase activity" evidence=ISS]
            [GO:0004177 "aminopeptidase activity" evidence=ISS] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=ISS] [GO:0004252
            "serine-type endopeptidase activity" evidence=ISS] [GO:0005615
            "extracellular space" evidence=ISS] [GO:0005764 "lysosome"
            evidence=ISS] [GO:0005829 "cytosol" evidence=ISS] [GO:0006508
            "proteolysis" evidence=ISS] [GO:0008234 "cysteine-type peptidase
            activity" evidence=ISS] [GO:0008284 "positive regulation of cell
            proliferation" evidence=ISS] [GO:0010628 "positive regulation of
            gene expression" evidence=ISS] [GO:0010634 "positive regulation of
            epithelial cell migration" evidence=ISS] [GO:0010813 "neuropeptide
            catabolic process" evidence=ISS] [GO:0010815 "bradykinin catabolic
            process" evidence=ISS] [GO:0010952 "positive regulation of
            peptidase activity" evidence=ISS] [GO:0016505 "apoptotic protease
            activator activity" evidence=ISS] [GO:0030108 "HLA-A specific
            activating MHC class I receptor activity" evidence=ISS] [GO:0030335
            "positive regulation of cell migration" evidence=ISS] [GO:0031638
            "zymogen activation" evidence=ISS] [GO:0031648 "protein
            destabilization" evidence=ISS] [GO:0032526 "response to retinoic
            acid" evidence=ISS] [GO:0033619 "membrane protein proteolysis"
            evidence=ISS] [GO:0043066 "negative regulation of apoptotic
            process" evidence=ISS] [GO:0043129 "surfactant homeostasis"
            evidence=ISS] [GO:0045766 "positive regulation of angiogenesis"
            evidence=ISS] [GO:0060448 "dichotomous subdivision of terminal
            units involved in lung branching" evidence=ISS] [GO:0070324
            "thyroid hormone binding" evidence=ISS] [GO:0070371 "ERK1 and ERK2
            cascade" evidence=ISS] [GO:0097208 "alveolar lamellar body"
            evidence=ISS] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0005829
            GO:GO:0043066 GO:GO:0005615 GO:GO:0008284 GO:GO:0070371
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766 GO:GO:0004177
            GO:GO:0004252 GO:GO:0005764 GO:GO:0097208 GO:GO:0032526
            GO:GO:0010628 GO:GO:0070324 KO:K01366 GO:GO:0016505 GO:GO:0001656
            GO:GO:0010634 GO:GO:0004197 GO:GO:0031648 GO:GO:0097067
            GO:GO:0031638 GO:GO:0001913 GeneTree:ENSGT00660000095458 CTD:1512
            OMA:STSCHKT GO:GO:0030108 GO:GO:0010815 GO:GO:0060448 GO:GO:0002764
            GO:GO:0033619 GO:GO:0010813 GO:GO:0043129 RefSeq:XP_001108862.1
            UniGene:Mmu.3000 Ensembl:ENSMMUT00000014095 GeneID:711437
            KEGG:mcc:711437 NextBio:19969972 Uniprot:F6R7P5
        Length = 335

 Score = 285 (105.4 bits), Expect = 2.7e-28, Sum P(2) = 2.7e-28
 Identities = 72/208 (34%), Positives = 96/208 (46%)

Query:   284 AFQFNSLRHGDDLPEAFDWRAEG-VISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTE 342
             A + N LR     P + DWR +G  +S VK QG C  CW FS  G +E+  AI    +  
Sbjct:   104 ATKSNYLRGTGPYPPSMDWRKKGNFVSPVKNQGACGSCWTFSTTGALESAIAIATGKMLS 163

Query:   343 LSVQQLVDC--DMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXX 400
             L+ QQLVDC  D +N GC GG    A +YI+ N G++ +  YPY+  + +  C       
Sbjct:   164 LAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNKGIMGEDTYPYQGKDGD--C-KFRPGK 220

Query:   401 XXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFY-YSGGVID----------LN 449
                     + I   +EE M + VA   P+S        F  Y  G+            +N
Sbjct:   221 AIGFVKDVANITIYDEEAMVEAVALYNPVSFAFEVTQDFMIYKTGIYSSTSCHKTPDKVN 280

Query:   450 QRL----YG--TSIPYWIVKNSWGSDWG 471
               +    YG    IPYWIVKNSWG  WG
Sbjct:   281 HAVLAVGYGEENGIPYWIVKNSWGPQWG 308

 Score = 238 (88.8 bits), Expect = 2.2e-30, Sum P(3) = 2.2e-30
 Identities = 56/161 (34%), Positives = 76/161 (47%)

Query:   493 VLPSKLSRLATEKLVDC--DMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERG 550
             +   K+  LA ++LVDC  D +N GC GG    A +YI+ N G++ +  YPY+  + +  
Sbjct:   156 IATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNKGIMGEDTYPYQGKDGD-- 213

Query:   551 CLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFY-YSGGVIDLNQR 609
             C               + I   +EE M + VA   P+S        F  Y  G+      
Sbjct:   214 C-KFRPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFAFEVTQDFMIYKTGIYSSTSC 272

Query:   610 LCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWG 650
                P   NHA++ VGYGEE       IPYWIVKNSWG  WG
Sbjct:   273 HKTPDKVNHAVLAVGYGEEN-----GIPYWIVKNSWGPQWG 308

 Score = 118 (46.6 bits), Expect = 2.2e-30, Sum P(3) = 2.2e-30
 Identities = 25/64 (39%), Positives = 33/64 (51%)

Query:   136 AFQFNSLRHGDDLPEAFDWRAEG-VISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTE 194
             A + N LR     P + DWR +G  +S VK QG C  CW FS  G +E+  AI    +  
Sbjct:   104 ATKSNYLRGTGPYPPSMDWRKKGNFVSPVKNQGACGSCWTFSTTGALESAIAIATGKMLS 163

Query:   195 LSVQ 198
             L+ Q
Sbjct:   164 LAEQ 167

 Score = 62 (26.9 bits), Expect = 2.2e-30, Sum P(3) = 2.2e-30
 Identities = 16/59 (27%), Positives = 31/59 (52%)

Query:    45 FLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQREDSGTAVFEVNKFFDLSDSDLQ 103
             F ++M  H K YS+ E+   R + F +N  K   +   +  T    +N+F D+S ++++
Sbjct:    35 FKSWMSKHHKTYST-EEYHHRMQTFASNWRKINAHNNGNH-TFKMALNQFSDMSFAEIK 91


>ZFIN|ZDB-GENE-071004-74 [details] [associations]
            symbol:zgc:174855 "zgc:174855" species:7955 "Danio
            rerio" [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0016787 "hydrolase activity"
            evidence=IEA] [GO:0008233 "peptidase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-071004-74
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            HOVERGEN:HBG011513 MEROPS:C01.032 EMBL:BX000534 EMBL:BC152282
            IPI:IPI00773140 RefSeq:NP_001096592.1 UniGene:Dr.104905 SMR:A7MCR6
            STRING:A7MCR6 Ensembl:ENSDART00000109968 GeneID:569326
            KEGG:dre:569326 NextBio:20889622 Uniprot:A7MCR6
        Length = 335

 Score = 258 (95.9 bits), Expect = 1.0e-26, Sum P(2) = 1.0e-26
 Identities = 56/153 (36%), Positives = 81/153 (52%)

Query:   297 PEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSNG 356
             P+  DWR  G ++ VK+Q +C  CW+FS+ G +E     +   L  +S Q LVDC    G
Sbjct:   116 PQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQG 175

Query:   357 --GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYG 414
               GCNGG MD A QY+ +N G+ S+Q+YPY A + +  C              +  IP G
Sbjct:   176 NQGCNGGIMDQAFQYVKENKGLDSEQSYPYLARD-DLPC-RYDPRFNVAKITGFVDIPRG 233

Query:   415 EEEEMKKWVATRGPLSVGMNAN--GLFYYSGGV 445
              E  +   VA  GP+SV ++A+   L +Y  G+
Sbjct:   234 NELALMNAVAAVGPVSVAIDASHQSLQFYQSGI 266

 Score = 257 (95.5 bits), Expect = 5.9e-30, Sum P(2) = 5.9e-30
 Identities = 60/160 (37%), Positives = 86/160 (53%)

Query:   497 KLSRLATEKLVDCDMSNG--GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXX 554
             KL  ++ + LVDC    G  GCNGG MD A QY+ +N G+ S+Q+YPY A + +  C   
Sbjct:   158 KLISMSEQNLVDCSRPQGNQGCNGGIMDQAFQYVKENKGLDSEQSYPYLARD-DLPC-RY 215

Query:   555 XXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNAN--GLFYYSGGVIDLNQRLCN 612
                        +  IP G E  +   VA  GP+SV ++A+   L +Y  G+    +R C 
Sbjct:   216 DPRFNVAKITGFVDIPRGNELALMNAVAAVGPVSVAIDASHQSLQFYQSGIY--YERACT 273

Query:   613 PKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
              +  +HA+++VGYG +   D     YWIVKNSW   WG+K
Sbjct:   274 SRL-DHAVLVVGYGYQGA-DVAGNRYWIVKNSWSDKWGDK 311

 Score = 118 (46.6 bits), Expect = 2.5e-07, Sum P(2) = 2.5e-07
 Identities = 52/200 (26%), Positives = 86/200 (43%)

Query:   198 QHHDKVYSSVEDLLRRHENFVTNVEKAEDYQSEDS-GTAVF--GVNKFFDLSESDLQQ-L 253
             QH    +  VE  + R   +  N+ K E +  E S G   F  G+N+F D++  + +Q +
Sbjct:    34 QHGKSYHEDVE--VGRRMIWEENLRKIEQHNFEYSLGNHTFKMGMNQFGDMTNEEFRQAM 91

Query:   254 TGLNLDSTLED-----IQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVI 308
              G   D          ++PS  A  +  Q D   R +    ++        + + + G +
Sbjct:    92 NGYKQDPNRTSKGALFMEPSFFA--APQQVDWRQRGY-VTPVKDQKQCGSCWSFSSTGAL 148

Query:   309 SKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSNGGCNGGRMDDALQ 368
                  +G+      F   G + +M      +L + S  Q       N GCNGG MD A Q
Sbjct:   149 -----EGQL-----FRKTGKLISMSE---QNLVDCSRPQ------GNQGCNGGIMDQAFQ 189

Query:   369 YIIDNGGVVSDQAYPYKASE 388
             Y+ +N G+ S+Q+YPY A +
Sbjct:   190 YVKENKGLDSEQSYPYLARD 209

 Score = 107 (42.7 bits), Expect = 5.9e-30, Sum P(2) = 5.9e-30
 Identities = 18/51 (35%), Positives = 29/51 (56%)

Query:   149 PEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQH 199
             P+  DWR  G ++ VK+Q +C  CW+FS+ G +E     +   L  +S Q+
Sbjct:   116 PQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQN 166

 Score = 75 (31.5 bits), Expect = 1.0e-26, Sum P(2) = 1.0e-26
 Identities = 11/15 (73%), Positives = 12/15 (80%)

Query:   459 YWIVKNSWGSDWGEK 473
             YWIVKNSW   WG+K
Sbjct:   297 YWIVKNSWSDKWGDK 311


>TAIR|locus:2030027 [details] [associations]
            symbol:AT1G29110 species:3702 "Arabidopsis thaliana"
            [GO:0006508 "proteolysis" evidence=IEA;ISS] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA;ISS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 SMART:SM00645
            EMBL:CP002684 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00639
            IPI:IPI00544534 RefSeq:NP_564322.1 UniGene:At.51816
            ProteinModelPortal:F4HZW2 SMR:F4HZW2 EnsemblPlants:AT1G29110.1
            GeneID:839786 KEGG:ath:AT1G29110 OMA:SCRANAR Uniprot:F4HZW2
        Length = 334

 Score = 268 (99.4 bits), Expect = 6.8e-30, Sum P(2) = 6.8e-30
 Identities = 68/191 (35%), Positives = 95/191 (49%)

Query:   298 EAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMS-NG 356
             E+ DWR EG ++ VK QG C              +  I G +L  LS QQL+DCD+  NG
Sbjct:   132 ESKDWRDEGAVTPVKYQGACR-------------LTKISGKNLLTLSEQQLIDCDIEKNG 178

Query:   357 GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYGEE 416
             GCNGG  ++A +YII NGGV  +  YPY+  +    C              +  +P   E
Sbjct:   179 GCNGGEFEEAFKYIIKNGGVSLETEYPYQVKKES--CRANARRAPHTQIRGFQMVPSHNE 236

Query:   417 EEMKKWVATRGPLSVGMNA--NGLFYYSGGVI-------DLNQRL----YGT--SIPYWI 461
               + + V  R P+SV ++A  +   +Y GGV        D+N  +    YGT   + YW+
Sbjct:   237 RALLEAVR-RQPVSVLIDARADSFGHYKGGVYAGLDCGTDVNHAVTIVGYGTMSGLNYWV 295

Query:   462 VKNSWGSDWGE 472
             +KNSWG  WGE
Sbjct:   296 LKNSWGESWGE 306

 Score = 247 (92.0 bits), Expect = 1.3e-27, Sum P(2) = 1.3e-27
 Identities = 59/166 (35%), Positives = 86/166 (51%)

Query:   490 LTGVLPSKLSRLATEKLVDCDMS-NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESE 548
             LT +    L  L+ ++L+DCD+  NGGCNGG  ++A +YII NGGV  +  YPY+  +  
Sbjct:   153 LTKISGKNLLTLSEQQLIDCDIEKNGGCNGGEFEEAFKYIIKNGGVSLETEYPYQVKKES 212

Query:   549 RGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNA--NGLFYYSGGVI-D 605
               C              +  +P   E  + + V  R P+SV ++A  +   +Y GGV   
Sbjct:   213 --CRANARRAPHTQIRGFQMVPSHNERALLEAVR-RQPVSVLIDARADSFGHYKGGVYAG 269

Query:   606 LNQRLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGE 651
             L+   C     NHA+ IVGYG       + + YW++KNSWG  WGE
Sbjct:   270 LD---CGTDV-NHAVTIVGYGTM-----SGLNYWVLKNSWGESWGE 306

 Score = 95 (38.5 bits), Expect = 6.8e-30, Sum P(2) = 6.8e-30
 Identities = 35/117 (29%), Positives = 56/117 (47%)

Query:   202 KVYSSVEDLLRRHENFVTNVEKAEDYQSEDSGTAVFGVNKFFDL-SESDLQQLTGLNLDS 260
             +VY    +   R + F  N++  E++ +  + +   GVN+F D  +E  L   TGL ++ 
Sbjct:    47 RVYKDESEKEMRLKVFKKNLKFIENFNNMGNQSYTLGVNEFTDWKTEEFLATHTGLRVNV 106

Query:   261 TLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVKEQGKC 317
             T      SL   F  N+T    R +  + +   D   E+ DWR EG ++ VK QG C
Sbjct:   107 T------SLSELF--NKTKPS-RNWNMSDIDMED---ESKDWRDEGAVTPVKYQGAC 151

 Score = 90 (36.7 bits), Expect = 2.3e-29, Sum P(2) = 2.3e-29
 Identities = 41/153 (26%), Positives = 67/153 (43%)

Query:    22 IKVALLESNIFQTRGY--LN--SPVTRFLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAE 77
             + +  ++  I Q R +  LN  S V     +M    +VY    +   R + F  N++  E
Sbjct:    11 LTILSMDLRISQARPHVTLNEQSIVDYHQQWMTQFSRVYKDESEKEMRLKVFKKNLKFIE 70

Query:    78 DYQREDSGTAVFEVNKFFDLSDSD-LQQLTGLNLDSTLEDIQPSLQAPFSSNQTDTEMRA 136
             ++    + +    VN+F D    + L   TGL ++ T      SL   F  N+T    R 
Sbjct:    71 NFNNMGNQSYTLGVNEFTDWKTEEFLATHTGLRVNVT------SLSELF--NKTKPS-RN 121

Query:   137 FQFNSLRHGDDLPEAFDWRAEGVISKVKEQGKC 169
             +  + +   D   E+ DWR EG ++ VK QG C
Sbjct:   122 WNMSDIDMED---ESKDWRDEGAVTPVKYQGAC 151


>DICTYBASE|DDB_G0279799 [details] [associations]
            symbol:cprB "cysteine proteinase 2" species:44689
            "Dictyostelium discoideum" [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0008233
            "peptidase activity" evidence=IEA] [GO:0005764 "lysosome"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            dictyBase:DDB_G0279799 GenomeReviews:CM000152_GR eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            MEROPS:I29.003 KO:K01365 EMBL:AAFI02000033 EMBL:M16039 EMBL:X03344
            PIR:A25439 RefSeq:XP_641494.1 ProteinModelPortal:P04989 SMR:P04989
            EnsemblProtists:DDB0214998 GeneID:8622234 KEGG:ddi:DDB_G0279799
            OMA:YVNITAG Uniprot:P04989
        Length = 376

 Score = 284 (105.0 bits), Expect = 7.3e-30, Sum P(2) = 7.3e-30
 Identities = 74/250 (29%), Positives = 120/250 (48%)

Query:   201 DKVYSSVEDLLRRHENFVTNVEKAEDYQSEDSGTAVFGVNKFFDLSESDLQQL-TGLNLD 259
             ++ YSS E    R+  F +N++  +++ S+     V G+N F D++  + ++   G  ++
Sbjct:    44 NRQYSSSE-FSNRYSIFKSNMDYVDNWNSKGDSQTVLGLNNFADITNEEYRKTYLGTRVN 102

Query:   260 STLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVKEQGKCAC 319
             +             S N  D          L+     P++ DWR +  ++ +K+QG+C  
Sbjct:   103 A------------HSYNGYDGR-EVLNVEDLQTN---PKSIDWRTKNAVTPIKDQGQCGS 146

Query:   320 CWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCD--MSNGGCNGGRMDDALQYIIDNGGVV 377
             CW+FS  G  E  HA++   L  LS Q LVDC     N GC+GG M++A  YII N G+ 
Sbjct:   147 CWSFSTTGSTEGAHALKTKKLVSLSEQNLVDCSGPEENFGCDGGLMNNAFDYIIKNKGID 206

Query:   378 SDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNA-- 435
             ++ +YPY A E+   CL             Y  I  G E  ++   A  GP+SV ++A  
Sbjct:   207 TESSYPYTA-ETGSTCLFNKSDIGATIKG-YVNITAGSEISLENG-AQHGPVSVAIDASH 263

Query:   436 NGLFYYSGGV 445
             N    Y+ G+
Sbjct:   264 NSFQLYTSGI 273

 Score = 191 (72.3 bits), Expect = 1.0e-15, Sum P(2) = 1.0e-15
 Identities = 68/225 (30%), Positives = 104/225 (46%)

Query:   416 EEEMKKWVATRGPLSVGMNANGLFYYSGG-VIDLNQRLYGTSIPYWIVKNSWGSDWGEKV 474
             EE  K ++ TR      +NA+    Y G  V+++           W  KN+       K 
Sbjct:    90 EEYRKTYLGTR------VNAHSYNGYDGREVLNVEDLQTNPKSIDWRTKNAVTPI---KD 140

Query:   475 EDKVGS--SGNRTRDLELTGVLPSK-LSRLATEKLVDCD--MSNGGCNGGRMDDALQYII 529
             + + GS  S + T   E    L +K L  L+ + LVDC     N GC+GG M++A  YII
Sbjct:   141 QGQCGSCWSFSTTGSTEGAHALKTKKLVSLSEQNLVDCSGPEENFGCDGGLMNNAFDYII 200

Query:   530 DNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSV 589
              N G+ ++ +YPY A E+   CL             Y  I  G E  ++   A  GP+SV
Sbjct:   201 KNKGIDTESSYPYTA-ETGSTCLFNKSDIGATIKG-YVNITAGSEISLENG-AQHGPVSV 257

Query:   590 GMNA--NGLFYYSGGVIDLNQRLCNPKAQNHALIIVGYGEEEKKD 632
              ++A  N    Y+ G+    +  C+P   +H +++VGYG + K D
Sbjct:   258 AIDASHNSFQLYTSGIY--YEPKCSPTELDHGVLVVGYGVQGKDD 300

 Score = 144 (55.7 bits), Expect = 2.4e-10, Sum P(2) = 2.4e-10
 Identities = 38/158 (24%), Positives = 72/158 (45%)

Query:    43 TRFLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQREDSGTAVFEVNKFFDLSDSDL 102
             T F  +    ++ YSS E    R+  F +N++  +++  +     V  +N F D+++ + 
Sbjct:    34 TAFTEWTLKFNRQYSSSE-FSNRYSIFKSNMDYVDNWNSKGDSQTVLGLNNFADITNEEY 92

Query:   103 QQL-TGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVIS 161
             ++   G  +++             S N  D          L+     P++ DWR +  ++
Sbjct:    93 RKTYLGTRVNA------------HSYNGYDGR-EVLNVEDLQTN---PKSIDWRTKNAVT 136

Query:   162 KVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQH 199
              +K+QG+C  CW+FS  G  E  HA++   L  LS Q+
Sbjct:   137 PIKDQGQCGSCWSFSTTGSTEGAHALKTKKLVSLSEQN 174

 Score = 78 (32.5 bits), Expect = 7.3e-30, Sum P(2) = 7.3e-30
 Identities = 12/15 (80%), Positives = 13/15 (86%)

Query:   459 YWIVKNSWGSDWGEK 473
             YWIVKNSWG+ WG K
Sbjct:   338 YWIVKNSWGTSWGIK 352

 Score = 78 (32.5 bits), Expect = 7.3e-30, Sum P(2) = 7.3e-30
 Identities = 12/15 (80%), Positives = 13/15 (86%)

Query:   638 YWIVKNSWGSDWGEK 652
             YWIVKNSWG+ WG K
Sbjct:   338 YWIVKNSWGTSWGIK 352


>TAIR|locus:2130180 [details] [associations]
            symbol:AT4G16190 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] [GO:0005773 "vacuole"
            evidence=IDA] [GO:0042744 "hydrogen peroxide catabolic process"
            evidence=RCA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0005773
            EMBL:CP002687 HOGENOM:HOG000230774 KO:K01373 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 EMBL:Z97340 EMBL:AL161543 UniGene:At.25555
            EMBL:AY039556 EMBL:AY129473 EMBL:AY136316 EMBL:BT000733
            EMBL:AK226366 IPI:IPI00543588 PIR:D71428 RefSeq:NP_567489.1
            HSSP:P25779 ProteinModelPortal:Q9SUL1 SMR:Q9SUL1 STRING:Q9SUL1
            MEROPS:C01.A06 PRIDE:Q9SUL1 EnsemblPlants:AT4G16190.1 GeneID:827311
            KEGG:ath:AT4G16190 TAIR:At4g16190 InParanoid:Q9SUL1 OMA:NACGINK
            PhylomeDB:Q9SUL1 ProtClustDB:CLSN2917559 Genevestigator:Q9SUL1
            Uniprot:Q9SUL1
        Length = 373

 Score = 269 (99.8 bits), Expect = 7.7e-30, Sum P(3) = 7.7e-30
 Identities = 56/160 (35%), Positives = 85/160 (53%)

Query:   295 DLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCD-- 352
             DLP  FDWR +G ++ VK QG C  CW+FSA+G +E  H +    L  LS QQLVDCD  
Sbjct:   139 DLPTEFDWREQGAVTPVKNQGMCGSCWSFSAIGALEGAHFLATKELVSLSEQQLVDCDHE 198

Query:   353 ----MSNG---GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXX 405
                  +N    GC+GG M++A +Y +  GG++ ++ YPY   +    C            
Sbjct:   199 CDPAQANSCDSGCSGGLMNNAFEYALKAGGLMKEEDYPYTGRD-HTAC-KFDKSKIVASV 256

Query:   406 XXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFYYSGGV 445
               +S +   +E+++   +   GPL++ +NA  +  Y GGV
Sbjct:   257 SNFSVVS-SDEDQIAANLVQHGPLAIAINAMWMQTYIGGV 295

 Score = 211 (79.3 bits), Expect = 3.3e-27, Sum P(3) = 3.3e-27
 Identities = 55/176 (31%), Positives = 91/176 (51%)

Query:   488 LELTGVLPSK-LSRLATEKLVDCD------MSNG---GCNGGRMDDALQYIIDNGGVVSD 537
             LE    L +K L  L+ ++LVDCD       +N    GC+GG M++A +Y +  GG++ +
Sbjct:   173 LEGAHFLATKELVSLSEQQLVDCDHECDPAQANSCDSGCSGGLMNNAFEYALKAGGLMKE 232

Query:   538 QAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLF 597
             + YPY   +    C              +S +   +E+++   +   GPL++ +NA  + 
Sbjct:   233 EDYPYTGRD-HTAC-KFDKSKIVASVSNFSVVS-SDEDQIAANLVQHGPLAIAINAMWMQ 289

Query:   598 YYSGGVIDLNQRLCNPKAQNHALIIVGYGEEEKKDGT--SIPYWIVKNSWGSDWGE 651
              Y GGV      +C+ K+Q+H +++VG+G            PYWI+KNSWG+ WGE
Sbjct:   290 TYIGGVSC--PYVCS-KSQDHGVLLVGFGSSGYAPIRLKEKPYWIIKNSWGAMWGE 342

 Score = 149 (57.5 bits), Expect = 3.3e-27, Sum P(3) = 3.3e-27
 Identities = 25/52 (48%), Positives = 32/52 (61%)

Query:   147 DLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
             DLP  FDWR +G ++ VK QG C  CW+FSA+G +E  H +    L  LS Q
Sbjct:   139 DLPTEFDWREQGAVTPVKNQGMCGSCWSFSAIGALEGAHFLATKELVSLSEQ 190

 Score = 84 (34.6 bits), Expect = 7.7e-30, Sum P(3) = 7.7e-30
 Identities = 12/15 (80%), Positives = 14/15 (93%)

Query:   458 PYWIVKNSWGSDWGE 472
             PYWI+KNSWG+ WGE
Sbjct:   328 PYWIIKNSWGAMWGE 342

 Score = 53 (23.7 bits), Expect = 7.7e-30, Sum P(3) = 7.7e-30
 Identities = 16/60 (26%), Positives = 29/60 (48%)

Query:    45 FLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQREDSGTAVFEVNKFFDLSDSDLQQ 104
             F  F   ++K Y++  +   R   F  N+ +A   Q  D  +AV  V +F DL+  + ++
Sbjct:    55 FTLFKSKYEKTYATQVEHDHRFRVFKANLRRARRNQLLDP-SAVHGVTQFSDLTPKEFRR 113


>GENEDB_PFALCIPARUM|PF11_0162 [details] [associations]
            symbol:PF11_0162 "falcipain-3" species:5833
            "Plasmodium falciparum" [GO:0020020 "food vacuole" evidence=TAS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00139 HSSP:P43235 EMBL:AE014186 GO:GO:0020020
            HOGENOM:HOG000065857 ProtClustDB:PTZ00021 RefSeq:XP_001347833.1
            ProteinModelPortal:Q8IIL0 SMR:Q8IIL0 MEROPS:C01.063
            EnsemblProtists:PF11_0162:mRNA GeneID:810709 KEGG:pfa:PF11_0162
            EuPathDB:PlasmoDB:PF3D7_1115400 OMA:ENDEDYW ChEMBL:CHEMBL1250373
            Uniprot:Q8IIL0
        Length = 492

 Score = 314 (115.6 bits), Expect = 9.0e-30, Sum P(2) = 9.0e-30
 Identities = 91/338 (26%), Positives = 159/338 (47%)

Query:   112 STLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVKEQGKCAC 171
             S LE++  S         T  E      N +   D+   +F  +  G + KV    + + 
Sbjct:    94 SKLEELISSYDKEKKMRTTGAEENNMNMNGIDDKDNKSVSFVNKKNGNL-KVNNNNQVSY 152

Query:   172 CWAFSAVGVVEAMHAIQGNNLTELSVQHHDKVYSSVEDLLRRHENFVTNVEKAEDYQSED 231
                F    +++ +  +   NL  + ++ ++K Y + E++ +R   F  N  K E +  + 
Sbjct:   153 SNLFDTKFLMDNLETV---NLFYIFLKENNKKYETSEEMQKRFIIFSENYRKIELHNKKT 209

Query:   232 SGTAVFGVNKFFDLSESDLQ-QLTGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSL 290
             +     G+NKF DLS  + + +   L      + + P +   + +N  D  ++ ++    
Sbjct:   210 NSLYKRGMNKFGDLSPEEFRSKYLNLKTHGPFKTLSPPVS--YEANYEDV-IKKYKPADA 266

Query:   291 RHGDDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVD 350
             +  D +  A+DWR  G ++ VK+Q  C  CWAFS+VG VE+ +AI+  +L   S Q+LVD
Sbjct:   267 KL-DRI--AYDWRLHGGVTPVKDQALCGSCWAFSSVGSVESQYAIRKKALFLFSEQELVD 323

Query:   351 CDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSR 410
             C + N GC GG + +A   +ID GG+ S   YPY ++  E  C              Y  
Sbjct:   324 CSVKNNGCYGGYITNAFDDMIDLGGLCSQDDYPYVSNLPET-C-NLKRCNERYTIKSYVS 381

Query:   411 IPYGEEEEMKKWVATRGPLSVGMNANGLF-YYSGGVID 447
             IP   +++ K+ +   GP+S+ + A+  F +Y GG  D
Sbjct:   382 IP---DDKFKEALRYLGPISISIAASDDFAFYRGGFYD 416

 Score = 221 (82.9 bits), Expect = 8.4e-30, Sum P(2) = 8.4e-30
 Identities = 56/154 (36%), Positives = 83/154 (53%)

Query:   504 EKLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXX 563
             ++LVDC + N GC GG + +A   +ID GG+ S   YPY ++  E  C            
Sbjct:   319 QELVDCSVKNNGCYGGYITNAFDDMIDLGGLCSQDDYPYVSNLPET-C-NLKRCNERYTI 376

Query:   564 XXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLF-YYSGGVIDLNQRLCNPKAQNHALII 622
               Y  IP   +++ K+ +   GP+S+ + A+  F +Y GG  D     C   A NHA+I+
Sbjct:   377 KSYVSIP---DDKFKEALRYLGPISISIAASDDFAFYRGGFYDGE---CGA-APNHAVIL 429

Query:   623 VGYGEEE--KKDGTSIP---YWIVKNSWGSDWGE 651
             VGYG ++   +D   +    Y+I+KNSWGSDWGE
Sbjct:   430 VGYGMKDIYNEDTGRMEKFYYYIIKNSWGSDWGE 463

 Score = 186 (70.5 bits), Expect = 8.4e-30, Sum P(2) = 8.4e-30
 Identities = 49/173 (28%), Positives = 90/173 (52%)

Query:    29 SNIFQTRGYLNS--PVTRFLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQREDSGT 86
             SN+F T+  +++   V  F  F+++++K Y + E++ +R   F  N  K E + ++ +  
Sbjct:   153 SNLFDTKFLMDNLETVNLFYIFLKENNKKYETSEEMQKRFIIFSENYRKIELHNKKTNSL 212

Query:    87 AVFEVNKFFDLSDSDLQ-QLTGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHG 145
                 +NKF DLS  + + +   L      + + P +   + +N  D  ++ ++    +  
Sbjct:   213 YKRGMNKFGDLSPEEFRSKYLNLKTHGPFKTLSPPVS--YEANYEDV-IKKYKPADAKL- 268

Query:   146 DDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
             D +  A+DWR  G ++ VK+Q  C  CWAFS+VG VE+ +AI+   L   S Q
Sbjct:   269 DRI--AYDWRLHGGVTPVKDQALCGSCWAFSSVGSVESQYAIRKKALFLFSEQ 319

 Score = 80 (33.2 bits), Expect = 9.0e-30, Sum P(2) = 9.0e-30
 Identities = 12/14 (85%), Positives = 14/14 (100%)

Query:   459 YWIVKNSWGSDWGE 472
             Y+I+KNSWGSDWGE
Sbjct:   450 YYIIKNSWGSDWGE 463


>UNIPROTKB|Q8IIL0 [details] [associations]
            symbol:PF11_0162 "Falcipain-3" species:36329 "Plasmodium
            falciparum 3D7" [GO:0020020 "food vacuole" evidence=TAS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00139 HSSP:P43235 EMBL:AE014186 GO:GO:0020020
            HOGENOM:HOG000065857 ProtClustDB:PTZ00021 RefSeq:XP_001347833.1
            ProteinModelPortal:Q8IIL0 SMR:Q8IIL0 MEROPS:C01.063
            EnsemblProtists:PF11_0162:mRNA GeneID:810709 KEGG:pfa:PF11_0162
            EuPathDB:PlasmoDB:PF3D7_1115400 OMA:ENDEDYW ChEMBL:CHEMBL1250373
            Uniprot:Q8IIL0
        Length = 492

 Score = 314 (115.6 bits), Expect = 9.0e-30, Sum P(2) = 9.0e-30
 Identities = 91/338 (26%), Positives = 159/338 (47%)

Query:   112 STLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVKEQGKCAC 171
             S LE++  S         T  E      N +   D+   +F  +  G + KV    + + 
Sbjct:    94 SKLEELISSYDKEKKMRTTGAEENNMNMNGIDDKDNKSVSFVNKKNGNL-KVNNNNQVSY 152

Query:   172 CWAFSAVGVVEAMHAIQGNNLTELSVQHHDKVYSSVEDLLRRHENFVTNVEKAEDYQSED 231
                F    +++ +  +   NL  + ++ ++K Y + E++ +R   F  N  K E +  + 
Sbjct:   153 SNLFDTKFLMDNLETV---NLFYIFLKENNKKYETSEEMQKRFIIFSENYRKIELHNKKT 209

Query:   232 SGTAVFGVNKFFDLSESDLQ-QLTGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSL 290
             +     G+NKF DLS  + + +   L      + + P +   + +N  D  ++ ++    
Sbjct:   210 NSLYKRGMNKFGDLSPEEFRSKYLNLKTHGPFKTLSPPVS--YEANYEDV-IKKYKPADA 266

Query:   291 RHGDDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVD 350
             +  D +  A+DWR  G ++ VK+Q  C  CWAFS+VG VE+ +AI+  +L   S Q+LVD
Sbjct:   267 KL-DRI--AYDWRLHGGVTPVKDQALCGSCWAFSSVGSVESQYAIRKKALFLFSEQELVD 323

Query:   351 CDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSR 410
             C + N GC GG + +A   +ID GG+ S   YPY ++  E  C              Y  
Sbjct:   324 CSVKNNGCYGGYITNAFDDMIDLGGLCSQDDYPYVSNLPET-C-NLKRCNERYTIKSYVS 381

Query:   411 IPYGEEEEMKKWVATRGPLSVGMNANGLF-YYSGGVID 447
             IP   +++ K+ +   GP+S+ + A+  F +Y GG  D
Sbjct:   382 IP---DDKFKEALRYLGPISISIAASDDFAFYRGGFYD 416

 Score = 221 (82.9 bits), Expect = 8.4e-30, Sum P(2) = 8.4e-30
 Identities = 56/154 (36%), Positives = 83/154 (53%)

Query:   504 EKLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXX 563
             ++LVDC + N GC GG + +A   +ID GG+ S   YPY ++  E  C            
Sbjct:   319 QELVDCSVKNNGCYGGYITNAFDDMIDLGGLCSQDDYPYVSNLPET-C-NLKRCNERYTI 376

Query:   564 XXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLF-YYSGGVIDLNQRLCNPKAQNHALII 622
               Y  IP   +++ K+ +   GP+S+ + A+  F +Y GG  D     C   A NHA+I+
Sbjct:   377 KSYVSIP---DDKFKEALRYLGPISISIAASDDFAFYRGGFYDGE---CGA-APNHAVIL 429

Query:   623 VGYGEEE--KKDGTSIP---YWIVKNSWGSDWGE 651
             VGYG ++   +D   +    Y+I+KNSWGSDWGE
Sbjct:   430 VGYGMKDIYNEDTGRMEKFYYYIIKNSWGSDWGE 463

 Score = 186 (70.5 bits), Expect = 8.4e-30, Sum P(2) = 8.4e-30
 Identities = 49/173 (28%), Positives = 90/173 (52%)

Query:    29 SNIFQTRGYLNS--PVTRFLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQREDSGT 86
             SN+F T+  +++   V  F  F+++++K Y + E++ +R   F  N  K E + ++ +  
Sbjct:   153 SNLFDTKFLMDNLETVNLFYIFLKENNKKYETSEEMQKRFIIFSENYRKIELHNKKTNSL 212

Query:    87 AVFEVNKFFDLSDSDLQ-QLTGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHG 145
                 +NKF DLS  + + +   L      + + P +   + +N  D  ++ ++    +  
Sbjct:   213 YKRGMNKFGDLSPEEFRSKYLNLKTHGPFKTLSPPVS--YEANYEDV-IKKYKPADAKL- 268

Query:   146 DDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
             D +  A+DWR  G ++ VK+Q  C  CWAFS+VG VE+ +AI+   L   S Q
Sbjct:   269 DRI--AYDWRLHGGVTPVKDQALCGSCWAFSSVGSVESQYAIRKKALFLFSEQ 319

 Score = 80 (33.2 bits), Expect = 9.0e-30, Sum P(2) = 9.0e-30
 Identities = 12/14 (85%), Positives = 14/14 (100%)

Query:   459 YWIVKNSWGSDWGE 472
             Y+I+KNSWGSDWGE
Sbjct:   450 YYIIKNSWGSDWGE 463


>MGI|MGI:1349426 [details] [associations]
            symbol:Ctsj "cathepsin J" species:10090 "Mus musculus"
            [GO:0005764 "lysosome" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] [GO:0008152 "metabolic process" evidence=ISO]
            [GO:0008233 "peptidase activity" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=ISO] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0048471 "perinuclear region
            of cytoplasm" evidence=ISO] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            MGI:MGI:1349426 eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 EMBL:AF136272
            EMBL:AF158182 EMBL:AY034579 EMBL:AK005526 EMBL:AK131661
            EMBL:BC103769 IPI:IPI00126770 RefSeq:NP_036137.1 UniGene:Mm.31948
            ProteinModelPortal:Q9R014 SMR:Q9R014 MEROPS:C01.038 PRIDE:Q9R014
            Ensembl:ENSMUST00000071526 GeneID:26898 KEGG:mmu:26898
            UCSC:uc007qwa.1 CTD:26898 InParanoid:Q9R014 KO:K09599
            NextBio:304745 Bgee:Q9R014 CleanEx:MM_CTSJ Genevestigator:Q9R014
            GermOnline:ENSMUSG00000055298 Uniprot:Q9R014
        Length = 334

 Score = 261 (96.9 bits), Expect = 1.3e-21, P = 1.3e-21
 Identities = 63/194 (32%), Positives = 95/194 (48%)

Query:   259 DSTLEDIQPSLQ-APFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVKEQGKC 317
             D T E+ + S+   P  +  TD   +    N +  G  LP+  DWR EG ++ V+ QGKC
Sbjct:    82 DQTSEEFRKSIDNIPIPAAMTDPHAQ----NHVSIG--LPDYKDWREEGYVTPVRNQGKC 135

Query:   318 ACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCD--MSNGGCNGGRMDDALQYIIDNGG 375
               CWAF+A G +E     +  +LT LSVQ L+DC   + N GC  G    A +Y++ N G
Sbjct:   136 GSCWAFAAAGAIEGQMFWKTGNLTPLSVQNLLDCSKTVGNKGCQSGTAHQAFEYVLKNKG 195

Query:   376 VVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATR--GPLSVGM 433
             + ++  YPY   E + G               Y  +P     E+  WVA    GP+S  +
Sbjct:   196 LEAEATYPY---EGKDGPCRYRSENASANITDYVNLP---PNELYLWVAVASIGPVSAAI 249

Query:   434 NAN--GLFYYSGGV 445
             +A+     +Y+GG+
Sbjct:   250 DASHDSFRFYNGGI 263

 Score = 233 (87.1 bits), Expect = 8.7e-30, Sum P(2) = 8.7e-30
 Identities = 52/160 (32%), Positives = 83/160 (51%)

Query:   498 LSRLATEKLVDCD--MSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXX 555
             L+ L+ + L+DC   + N GC  G    A +Y++ N G+ ++  YPY   E + G     
Sbjct:   158 LTPLSVQNLLDCSKTVGNKGCQSGTAHQAFEYVLKNKGLEAEATYPY---EGKDGPCRYR 214

Query:   556 XXXXXXXXXXYSRIPYGEEEEMKKWVATR--GPLSVGMNAN--GLFYYSGGVIDLNQRLC 611
                       Y  +P     E+  WVA    GP+S  ++A+     +Y+GG+    +  C
Sbjct:   215 SENASANITDYVNLP---PNELYLWVAVASIGPVSAAIDASHDSFRFYNGGIY--YEPNC 269

Query:   612 NPKAQNHALIIVGYGEE-EKKDGTSIPYWIVKNSWGSDWG 650
             +    NHA+++VGYG E + KDG +  YW++KNSWG +WG
Sbjct:   270 SSYFVNHAVLVVGYGSEGDVKDGNN--YWLIKNSWGEEWG 307

 Score = 154 (59.3 bits), Expect = 8.7e-30, Sum P(2) = 8.7e-30
 Identities = 35/90 (38%), Positives = 49/90 (54%)

Query:   111 DSTLEDIQPSLQ-APFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVKEQGKC 169
             D T E+ + S+   P  +  TD   +    N +  G  LP+  DWR EG ++ V+ QGKC
Sbjct:    82 DQTSEEFRKSIDNIPIPAAMTDPHAQ----NHVSIG--LPDYKDWREEGYVTPVRNQGKC 135

Query:   170 ACCWAFSAVGVVEAMHAIQGNNLTELSVQH 199
               CWAF+A G +E     +  NLT LSVQ+
Sbjct:   136 GSCWAFAAAGAIEGQMFWKTGNLTPLSVQN 165

 Score = 85 (35.0 bits), Expect = 1.9e-12, Sum P(2) = 1.9e-12
 Identities = 31/138 (22%), Positives = 50/138 (36%)

Query:   353 MSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASE------SERGCLXXXXXXXXXXXX 406
             + N GC  G    A +Y++ N G+ ++  YPY+  +      SE                
Sbjct:   173 VGNKGCQSGTAHQAFEYVLKNKGLEAEATYPYEGKDGPCRYRSENASANITDYVNLPPNE 232

Query:   407 XYSRIPYGEEEEMKKWV-ATRGPLSVGMNANGLFYYSGGVID--LNQRL----YGTSIP- 458
              Y  +       +   + A+          NG  YY        +N  +    YG+    
Sbjct:   233 LYLWVAVASIGPVSAAIDASHDSFRF---YNGGIYYEPNCSSYFVNHAVLVVGYGSEGDV 289

Query:   459 -----YWIVKNSWGSDWG 471
                  YW++KNSWG +WG
Sbjct:   290 KDGNNYWLIKNSWGEEWG 307


>FB|FBgn0033874 [details] [associations]
            symbol:CG6347 species:7227 "Drosophila melanogaster"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=NAS]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            EMBL:AE013599 eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00639
            GeneTree:ENSGT00560000076599 HSSP:P53634 EMBL:AY069609
            RefSeq:NP_610906.1 UniGene:Dm.608 SMR:Q7K0S6 MEROPS:C01.A29
            EnsemblMetazoa:FBtr0087637 GeneID:36531 KEGG:dme:Dmel_CG6347
            UCSC:CG6347-RA FlyBase:FBgn0033874 InParanoid:Q7K0S6 OMA:FEYIRDH
            OrthoDB:EOG4FQZ74 GenomeRNAi:36531 NextBio:799046 Uniprot:Q7K0S6
        Length = 352

 Score = 296 (109.3 bits), Expect = 2.2e-25, P = 2.2e-25
 Identities = 102/344 (29%), Positives = 160/344 (46%)

Query:   169 CACCWAFSAVGV--VEAMHAIQGNNLTEL-SVQHHD-------KVYSSVEDLLRRHENFV 218
             C+  W    +G+  + A+   Q  +  +L  VQ+ D       KVYS  E+ + R   F 
Sbjct:     4 CSTMWLQMTLGLALLGAVSLQQLQSFPKLCDVQNFDDFLRQTGKVYSD-EERVYRESIFA 62

Query:   219 TNVEKAE-DYQSEDSGTAVF--GVNKFFDLSESDLQQLTGLNLDSTLEDIQPSLQAPFSS 275
               +       ++ D+G + F  GVN   D++  ++  L G  +              F  
Sbjct:    63 AKMSLITLSNKNADNGVSGFRLGVNTLADMTRKEIATLLGSKISE------------FGE 110

Query:   276 NQTDTEMRAFQFNSLRH--GDDLPEAFDWRAEGVISKVKEQGK-CACCWAFSAVGVVEAM 332
               T+  +    F + R+    +LPE FDWR +G ++    QG  C  CW+F+  G +E  
Sbjct:   111 RYTNGHIN---FVTARNPASANLPEMFDWREKGGVTPPGFQGVGCGACWSFATTGALEG- 166

Query:   333 HAIQGNS-LTELSVQQLVDC--DMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASES 389
             H  +    L  LS Q LVDC  D  N GC+GG  +   +YI D+G  ++++ YPY  +++
Sbjct:   167 HLFRRTGVLASLSQQNLVDCADDYGNMGCDGGFQEYGFEYIRDHGVTLANK-YPY--TQT 223

Query:   390 ERGCLXXXXXX-----XXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFY--YS 442
             E  C                   Y+ I  G+EE+MK+ +AT GPL+  MNA+ + +  YS
Sbjct:   224 EMQCRQNETAGRPPRESLVKIRDYATITPGDEEKMKEVIATLGPLACSMNADTISFEQYS 283

Query:   443 GGVID--------LNQRL----YGTSI--PYWIVKNSWGSDWGE 472
             GG+ +        LN  +    YGT     YWI+KNS+  +WGE
Sbjct:   284 GGIYEDEECNQGELNHSVTVVGYGTENGRDYWIIKNSYSQNWGE 327

 Score = 257 (95.5 bits), Expect = 9.6e-30, Sum P(2) = 9.6e-30
 Identities = 63/170 (37%), Positives = 95/170 (55%)

Query:   491 TGVLPSKLSRLATEKLVDC--DMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESE 548
             TGVL S    L+ + LVDC  D  N GC+GG  +   +YI D+G  ++++ YPY  +++E
Sbjct:   172 TGVLAS----LSQQNLVDCADDYGNMGCDGGFQEYGFEYIRDHGVTLANK-YPY--TQTE 224

Query:   549 RGCLXXXXXX-----XXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFY--YSG 601
               C                   Y+ I  G+EE+MK+ +AT GPL+  MNA+ + +  YSG
Sbjct:   225 MQCRQNETAGRPPRESLVKIRDYATITPGDEEKMKEVIATLGPLACSMNADTISFEQYSG 284

Query:   602 GVIDLNQRLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGE 651
             G+ +  +  CN    NH++ +VGYG E  +D     YWI+KNS+  +WGE
Sbjct:   285 GIYEDEE--CNQGELNHSVTVVGYGTENGRD-----YWIIKNSYSQNWGE 327

 Score = 114 (45.2 bits), Expect = 9.6e-30, Sum P(2) = 9.6e-30
 Identities = 44/165 (26%), Positives = 70/165 (42%)

Query:    42 VTRFLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAE-DYQREDSGTAVFE--VNKFFDLS 98
             V  F +F+R   KVYS  E+ + R   F   +       +  D+G + F   VN   D++
Sbjct:    35 VQNFDDFLRQTGKVYSD-EERVYRESIFAAKMSLITLSNKNADNGVSGFRLGVNTLADMT 93

Query:    99 DSDLQQLTGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRH--GDDLPEAFDWRA 156
               ++  L G  +              F    T+  +    F + R+    +LPE FDWR 
Sbjct:    94 RKEIATLLGSKISE------------FGERYTNGHIN---FVTARNPASANLPEMFDWRE 138

Query:   157 EGVISKVKEQGK-CACCWAFSAVGVVEAMHAIQGNN-LTELSVQH 199
             +G ++    QG  C  CW+F+  G +E  H  +    L  LS Q+
Sbjct:   139 KGGVTPPGFQGVGCGACWSFATTGALEG-HLFRRTGVLASLSQQN 182


>UNIPROTKB|F1P0K2 [details] [associations]
            symbol:CTSO "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076599
            OMA:SNVCGIA EMBL:AADN02016534 IPI:IPI00651180
            Ensembl:ENSGALT00000015270 Uniprot:F1P0K2
        Length = 320

 Score = 316 (116.3 bits), Expect = 1.0e-29, Sum P(2) = 1.0e-29
 Identities = 77/203 (37%), Positives = 111/203 (54%)

Query:   296 LPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSN 355
             LP+ FDWR + VI++V+ Q  C  CWAFS VG +E+ +AI+G++L ELSVQQ++DC  SN
Sbjct:   107 LPKKFDWRDKKVIAEVRNQQTCGGCWAFSVVGGIESAYAIKGHNLEELSVQQVIDCSYSN 166

Query:   356 GGCNGGRMDDALQYIIDNG-GVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPY- 413
              GC+GG    AL ++      +V D  Y +KA      C              ++   + 
Sbjct:   167 YGCSGGSTITALSWLNQTKVKLVRDSEYTFKAQTGL--CHYFPHSDFGVSITGFAAYDFS 224

Query:   414 GEEEEMKKWVATRGPLSVGMNANGLFYYSGGVIDL-------NQRL----YGTS--IPYW 460
             G+EEEM + +   GPL+V ++A     Y GG+I         N  +    + T+  IPYW
Sbjct:   225 GQEEEMMRVLVDWGPLAVTVDAVSWQDYLGGIIQYHCSSGKANHAVLITGFDTTGIIPYW 284

Query:   461 IVKNSWGSDWGEK--VEDKVGSS 481
             IV+NSWG  WG    V  K+GS+
Sbjct:   285 IVQNSWGRTWGIDGYVRVKIGSN 307

 Score = 206 (77.6 bits), Expect = 1.1e-26, Sum P(2) = 1.1e-26
 Identities = 52/156 (33%), Positives = 78/156 (50%)

Query:   498 LSRLATEKLVDCDMSNGGCNGGRMDDALQYIIDNG-GVVSDQAYPYKASESERGCLXXXX 556
             L  L+ ++++DC  SN GC+GG    AL ++      +V D  Y +KA      C     
Sbjct:   151 LEELSVQQVIDCSYSNYGCSGGSTITALSWLNQTKVKLVRDSEYTFKAQTGL--CHYFPH 208

Query:   557 XXXXXXXXXYSRIPY-GEEEEMKKWVATRGPLSVGMNANGLFYYSGGVIDLNQRLCNPKA 615
                      ++   + G+EEEM + +   GPL+V ++A     Y GG+I  +   C+   
Sbjct:   209 SDFGVSITGFAAYDFSGQEEEMMRVLVDWGPLAVTVDAVSWQDYLGGIIQYH---CSSGK 265

Query:   616 QNHALIIVGYGEEEKKDGTSI-PYWIVKNSWGSDWG 650
              NHA++I G+      D T I PYWIV+NSWG  WG
Sbjct:   266 ANHAVLITGF------DTTGIIPYWIVQNSWGRTWG 295

 Score = 159 (61.0 bits), Expect = 1.1e-26, Sum P(2) = 1.1e-26
 Identities = 32/60 (53%), Positives = 43/60 (71%)

Query:   148 LPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQHH-DKVYSS 206
             LP+ FDWR + VI++V+ Q  C  CWAFS VG +E+ +AI+G+NL ELSVQ   D  YS+
Sbjct:   107 LPKKFDWRDKKVIAEVRNQQTCGGCWAFSVVGGIESAYAIKGHNLEELSVQQVIDCSYSN 166

 Score = 43 (20.2 bits), Expect = 1.0e-29, Sum P(2) = 1.0e-29
 Identities = 8/25 (32%), Positives = 15/25 (60%)

Query:   229 SEDSGTAVFGVNKFFDLSESDLQQL 253
             S D+G+A +G N+F  L   + + +
Sbjct:    60 SNDNGSAFYGKNQFSHLFPEEFKAI 84

 Score = 37 (18.1 bits), Expect = 4.2e-15, Sum P(3) = 4.2e-15
 Identities = 6/14 (42%), Positives = 10/14 (71%)

Query:   408 YSRIPYGEEEEMKK 421
             Y ++P GEE+ + K
Sbjct:    96 YIKVPKGEEKPLPK 109


>UNIPROTKB|E1BPI9 [details] [associations]
            symbol:CTSO "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00560000076599 CTD:1519 KO:K01374 OMA:SNVCGIA
            EMBL:DAAA02044933 IPI:IPI01004081 RefSeq:XP_002694471.2
            RefSeq:XP_874012.4 Ensembl:ENSBTAT00000014691 GeneID:616804
            KEGG:bta:616804 Uniprot:E1BPI9
        Length = 313

 Score = 317 (116.6 bits), Expect = 1.3e-29, Sum P(2) = 1.3e-29
 Identities = 77/203 (37%), Positives = 106/203 (52%)

Query:   296 LPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSN 355
             LP  FDWR + V+++V+ Q  C  CWAFS VG VE++ AI+G  L  LSVQQ++DC  SN
Sbjct:   100 LPLRFDWRDKHVVTQVRNQKTCGGCWAFSVVGAVESVCAIKGQPLEVLSVQQVIDCSYSN 159

Query:   356 GGCNGGRMDDALQYIID-NGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPY- 413
              GCNGG    AL ++      +V D  YP++A      C              YS   + 
Sbjct:   160 YGCNGGSPLSALYWLNKLQVKLVRDSEYPFQAQNGL--CRYFSDSHSGSSIKGYSAYDFS 217

Query:   414 GEEEEMKKWVATRGPLSVGMNANGLFYYSGGVI-------DLNQRLYGT------SIPYW 460
             G+E++M + +   GPL V ++A     Y GG+I       + N  +  T      SIPYW
Sbjct:   218 GQEDKMAEALLALGPLIVVVDAMSWQDYLGGIIQHHCSSGEANHAVLVTGFDKTGSIPYW 277

Query:   461 IVKNSWGSDWGEKVEDKVGSSGN 483
             IV+NSWG+ WG     +V   GN
Sbjct:   278 IVRNSWGTSWGIDGYVRVKMGGN 300

 Score = 223 (83.6 bits), Expect = 1.2e-27, Sum P(2) = 1.2e-27
 Identities = 54/155 (34%), Positives = 80/155 (51%)

Query:   498 LSRLATEKLVDCDMSNGGCNGGRMDDALQYIID-NGGVVSDQAYPYKASESERGCLXXXX 556
             L  L+ ++++DC  SN GCNGG    AL ++      +V D  YP++A      C     
Sbjct:   144 LEVLSVQQVIDCSYSNYGCNGGSPLSALYWLNKLQVKLVRDSEYPFQAQNGL--CRYFSD 201

Query:   557 XXXXXXXXXYSRIPY-GEEEEMKKWVATRGPLSVGMNANGLFYYSGGVIDLNQRLCNPKA 615
                      YS   + G+E++M + +   GPL V ++A     Y GG+I   Q  C+   
Sbjct:   202 SHSGSSIKGYSAYDFSGQEDKMAEALLALGPLIVVVDAMSWQDYLGGII---QHHCSSGE 258

Query:   616 QNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWG 650
              NHA+++ G+     K G SIPYWIV+NSWG+ WG
Sbjct:   259 ANHAVLVTGFD----KTG-SIPYWIVRNSWGTSWG 288

 Score = 139 (54.0 bits), Expect = 1.2e-27, Sum P(2) = 1.2e-27
 Identities = 30/60 (50%), Positives = 39/60 (65%)

Query:   148 LPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQHH-DKVYSS 206
             LP  FDWR + V+++V+ Q  C  CWAFS VG VE++ AI+G  L  LSVQ   D  YS+
Sbjct:   100 LPLRFDWRDKHVVTQVRNQKTCGGCWAFSVVGAVESVCAIKGQPLEVLSVQQVIDCSYSN 159

 Score = 41 (19.5 bits), Expect = 1.3e-29, Sum P(2) = 1.3e-29
 Identities = 6/12 (50%), Positives = 11/12 (91%)

Query:   231 DSGTAVFGVNKF 242
             ++ TAV+G+N+F
Sbjct:    53 ENSTAVYGINQF 64


>UNIPROTKB|G1RBY1 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:61853
            "Nomascus leucogenys" [GO:0001656 "metanephros development"
            evidence=ISS] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=ISS] [GO:0002764 "immune response-regulating signaling
            pathway" evidence=ISS] [GO:0004175 "endopeptidase activity"
            evidence=ISS] [GO:0004177 "aminopeptidase activity" evidence=ISS]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0004252 "serine-type endopeptidase activity" evidence=ISS]
            [GO:0005615 "extracellular space" evidence=ISS] [GO:0005764
            "lysosome" evidence=ISS] [GO:0005829 "cytosol" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISS] [GO:0008284 "positive regulation
            of cell proliferation" evidence=ISS] [GO:0010628 "positive
            regulation of gene expression" evidence=ISS] [GO:0010634 "positive
            regulation of epithelial cell migration" evidence=ISS] [GO:0010813
            "neuropeptide catabolic process" evidence=ISS] [GO:0010815
            "bradykinin catabolic process" evidence=ISS] [GO:0010952 "positive
            regulation of peptidase activity" evidence=ISS] [GO:0016505
            "apoptotic protease activator activity" evidence=ISS] [GO:0030108
            "HLA-A specific activating MHC class I receptor activity"
            evidence=ISS] [GO:0030335 "positive regulation of cell migration"
            evidence=ISS] [GO:0031638 "zymogen activation" evidence=ISS]
            [GO:0031648 "protein destabilization" evidence=ISS] [GO:0032526
            "response to retinoic acid" evidence=ISS] [GO:0033619 "membrane
            protein proteolysis" evidence=ISS] [GO:0043066 "negative regulation
            of apoptotic process" evidence=ISS] [GO:0043129 "surfactant
            homeostasis" evidence=ISS] [GO:0045766 "positive regulation of
            angiogenesis" evidence=ISS] [GO:0060448 "dichotomous subdivision of
            terminal units involved in lung branching" evidence=ISS]
            [GO:0070324 "thyroid hormone binding" evidence=ISS] [GO:0070371
            "ERK1 and ERK2 cascade" evidence=ISS] [GO:0097208 "alveolar
            lamellar body" evidence=ISS] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005829 GO:GO:0043066 GO:GO:0005615 GO:GO:0008284
            GO:GO:0070371 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766
            GO:GO:0004177 GO:GO:0004252 GO:GO:0005764 GO:GO:0097208
            GO:GO:0032526 GO:GO:0010628 GO:GO:0070324 GO:GO:0016505
            GO:GO:0001656 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
            GO:GO:0097067 GO:GO:0031638 GO:GO:0001913
            GeneTree:ENSGT00660000095458 CTD:1512 OMA:STSCHKT GO:GO:0030108
            GO:GO:0010815 GO:GO:0060448 GO:GO:0002764 GO:GO:0033619
            GO:GO:0010813 GO:GO:0043129 EMBL:ADFV01087552 RefSeq:XP_003275518.1
            Ensembl:ENSNLET00000011249 GeneID:100584322 Uniprot:G1RBY1
        Length = 335

 Score = 283 (104.7 bits), Expect = 9.2e-28, Sum P(2) = 9.2e-28
 Identities = 72/208 (34%), Positives = 95/208 (45%)

Query:   284 AFQFNSLRHGDDLPEAFDWRAEG-VISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTE 342
             A + N LR     P + DWR +G  +S VK QG C  CW FS  G +E+  AI    +  
Sbjct:   104 ATKSNYLRGTGPYPPSMDWRKKGNFVSPVKNQGACGSCWTFSTTGALESAIAIATGKMLS 163

Query:   343 LSVQQLVDC--DMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXX 400
             L+ QQLVDC  D +N GC GG    A +YI+ N G++ +  YPY+  +   G        
Sbjct:   164 LAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNKGIMGEDTYPYQGKD---GYCKFRPGK 220

Query:   401 XXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFY-YSGGVID----------LN 449
                     + I   +EE M + VA   P+S        F  Y  G+            +N
Sbjct:   221 AIGFVKDVANITIYDEEAMVEAVALYNPVSFAFEVTQDFMMYRRGIYSSTSCHKTPDKVN 280

Query:   450 QRL----YG--TSIPYWIVKNSWGSDWG 471
               +    YG    IPYWIVKNSWG  WG
Sbjct:   281 HAVLAVGYGEKNGIPYWIVKNSWGPQWG 308

 Score = 236 (88.1 bits), Expect = 1.5e-29, Sum P(3) = 1.5e-29
 Identities = 57/161 (35%), Positives = 77/161 (47%)

Query:   493 VLPSKLSRLATEKLVDC--DMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERG 550
             +   K+  LA ++LVDC  D +N GC GG    A +YI+ N G++ +  YPY+  +   G
Sbjct:   156 IATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNKGIMGEDTYPYQGKD---G 212

Query:   551 CLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFY-YSGGVIDLNQR 609
                             + I   +EE M + VA   P+S        F  Y  G+      
Sbjct:   213 YCKFRPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFAFEVTQDFMMYRRGIYSSTSC 272

Query:   610 LCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWG 650
                P   NHA++ VGYGE   K+G  IPYWIVKNSWG  WG
Sbjct:   273 HKTPDKVNHAVLAVGYGE---KNG--IPYWIVKNSWGPQWG 308

 Score = 118 (46.6 bits), Expect = 1.5e-29, Sum P(3) = 1.5e-29
 Identities = 25/64 (39%), Positives = 33/64 (51%)

Query:   136 AFQFNSLRHGDDLPEAFDWRAEG-VISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTE 194
             A + N LR     P + DWR +G  +S VK QG C  CW FS  G +E+  AI    +  
Sbjct:   104 ATKSNYLRGTGPYPPSMDWRKKGNFVSPVKNQGACGSCWTFSTTGALESAIAIATGKMLS 163

Query:   195 LSVQ 198
             L+ Q
Sbjct:   164 LAEQ 167

 Score = 59 (25.8 bits), Expect = 1.5e-29, Sum P(3) = 1.5e-29
 Identities = 16/59 (27%), Positives = 31/59 (52%)

Query:    45 FLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQREDSGTAVFEVNKFFDLSDSDLQ 103
             F ++M  H K YS+ E+   R + F +N  K   +   +  T    +N+F D+S ++++
Sbjct:    35 FKSWMSKHHKTYST-EEYHHRLQMFASNWRKINAHNNGNH-TFKMALNQFSDMSFAEIK 91


>UNIPROTKB|P09668 [details] [associations]
            symbol:CTSH "Pro-cathepsin H" species:9606 "Homo sapiens"
            [GO:0001520 "outer dense fiber" evidence=IEA] [GO:0001669
            "acrosomal vesicle" evidence=IEA] [GO:0007283 "spermatogenesis"
            evidence=IEA] [GO:0030984 "kininogen binding" evidence=IEA]
            [GO:0032403 "protein complex binding" evidence=IEA] [GO:0043621
            "protein self-association" evidence=IEA] [GO:0005515 "protein
            binding" evidence=IPI] [GO:0031648 "protein destabilization"
            evidence=IMP] [GO:0010634 "positive regulation of epithelial cell
            migration" evidence=ISS] [GO:0032526 "response to retinoic acid"
            evidence=ISS] [GO:0060448 "dichotomous subdivision of terminal
            units involved in lung branching" evidence=ISS] [GO:0030108 "HLA-A
            specific activating MHC class I receptor activity" evidence=IDA]
            [GO:0097067 "cellular response to thyroid hormone stimulus"
            evidence=IEP] [GO:0010813 "neuropeptide catabolic process"
            evidence=IDA] [GO:0010815 "bradykinin catabolic process"
            evidence=IDA] [GO:0030335 "positive regulation of cell migration"
            evidence=IDA] [GO:0070371 "ERK1 and ERK2 cascade" evidence=IDA]
            [GO:0010628 "positive regulation of gene expression" evidence=IDA]
            [GO:0006508 "proteolysis" evidence=IDA;TAS] [GO:0031638 "zymogen
            activation" evidence=IDA] [GO:0016505 "apoptotic protease activator
            activity" evidence=IDA] [GO:0010952 "positive regulation of
            peptidase activity" evidence=IDA] [GO:0001913 "T cell mediated
            cytotoxicity" evidence=ISS] [GO:0004252 "serine-type endopeptidase
            activity" evidence=ISS] [GO:0043066 "negative regulation of
            apoptotic process" evidence=ISS] [GO:0005829 "cytosol"
            evidence=IDA] [GO:0033619 "membrane protein proteolysis"
            evidence=IDA] [GO:0004175 "endopeptidase activity" evidence=IDA]
            [GO:0004177 "aminopeptidase activity" evidence=IDA] [GO:0005764
            "lysosome" evidence=IDA] [GO:0001656 "metanephros development"
            evidence=ISS] [GO:0002250 "adaptive immune response" evidence=IEP]
            [GO:0019882 "antigen processing and presentation" evidence=TAS]
            [GO:0002764 "immune response-regulating signaling pathway"
            evidence=IDA] [GO:0005615 "extracellular space" evidence=IDA]
            [GO:0070324 "thyroid hormone binding" evidence=IDA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IDA] [GO:0008284
            "positive regulation of cell proliferation" evidence=ISS]
            [GO:0045766 "positive regulation of angiogenesis" evidence=ISS]
            [GO:0008233 "peptidase activity" evidence=IDA] [GO:0097208
            "alveolar lamellar body" evidence=IDA] [GO:0043129 "surfactant
            homeostasis" evidence=IDA] [GO:0004197 "cysteine-type endopeptidase
            activity" evidence=IDA;TAS] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005829 Reactome:REACT_6900 GO:GO:0043066 GO:GO:0005615
            GO:GO:0008284 GO:GO:0070371 eggNOG:COG4870 HOGENOM:HOG000230774
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766 GO:GO:0004177
            GO:GO:0004252 GO:GO:0005764 GO:GO:0097208 GO:GO:0032526
            GO:GO:0010628 GO:GO:0070324 KO:K01366 GO:GO:0016505 GO:GO:0001656
            GO:GO:0010634 GO:GO:0004197 GO:GO:0031648 HOVERGEN:HBG011513
            GO:GO:0097067 GO:GO:0031638 GO:GO:0001913 MEROPS:C01.040 CTD:1512
            OMA:STSCHKT OrthoDB:EOG4W9J43 GO:GO:0030108 GO:GO:0010815
            GO:GO:0060448 GO:GO:0002764 GO:GO:0033619 GO:GO:0010813
            GO:GO:0043129 EMBL:X16832 EMBL:AF426247 EMBL:AK314698 EMBL:AC011944
            EMBL:BC002479 EMBL:X07549 IPI:IPI00297487 PIR:S12486
            RefSeq:NP_004381.2 UniGene:Hs.148641 PDB:1BZN PDBsum:1BZN
            ProteinModelPortal:P09668 SMR:P09668 IntAct:P09668 STRING:P09668
            PhosphoSite:P09668 DMDM:288558851 PaxDb:P09668 PRIDE:P09668
            DNASU:1512 Ensembl:ENST00000220166 GeneID:1512 KEGG:hsa:1512
            UCSC:uc021srk.1 GeneCards:GC15M079213 H-InvDB:HIX0012481
            HGNC:HGNC:2535 HPA:CAB000458 HPA:HPA003524 MIM:116820
            neXtProt:NX_P09668 PharmGKB:PA27033 InParanoid:P09668
            PhylomeDB:P09668 BRENDA:3.4.22.16 ChEMBL:CHEMBL2225 GenomeRNAi:1512
            NextBio:6261 ArrayExpress:P09668 Bgee:P09668 CleanEx:HS_CTSH
            Genevestigator:P09668 GermOnline:ENSG00000103811 GO:GO:0019882
            Uniprot:P09668
        Length = 335

 Score = 282 (104.3 bits), Expect = 9.2e-28, Sum P(2) = 9.2e-28
 Identities = 72/208 (34%), Positives = 95/208 (45%)

Query:   284 AFQFNSLRHGDDLPEAFDWRAEG-VISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTE 342
             A + N LR     P + DWR +G  +S VK QG C  CW FS  G +E+  AI    +  
Sbjct:   104 ATKSNYLRGTGPYPPSVDWRKKGNFVSPVKNQGACGSCWTFSTTGALESAIAIATGKMLS 163

Query:   343 LSVQQLVDC--DMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXX 400
             L+ QQLVDC  D +N GC GG    A +YI+ N G++ +  YPY+  +   G        
Sbjct:   164 LAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNKGIMGEDTYPYQGKD---GYCKFQPGK 220

Query:   401 XXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFY-YSGGVID----------LN 449
                     + I   +EE M + VA   P+S        F  Y  G+            +N
Sbjct:   221 AIGFVKDVANITIYDEEAMVEAVALYNPVSFAFEVTQDFMMYRTGIYSSTSCHKTPDKVN 280

Query:   450 QRL----YG--TSIPYWIVKNSWGSDWG 471
               +    YG    IPYWIVKNSWG  WG
Sbjct:   281 HAVLAVGYGEKNGIPYWIVKNSWGPQWG 308

 Score = 236 (88.1 bits), Expect = 1.5e-29, Sum P(3) = 1.5e-29
 Identities = 57/161 (35%), Positives = 77/161 (47%)

Query:   493 VLPSKLSRLATEKLVDC--DMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERG 550
             +   K+  LA ++LVDC  D +N GC GG    A +YI+ N G++ +  YPY+  +   G
Sbjct:   156 IATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNKGIMGEDTYPYQGKD---G 212

Query:   551 CLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFY-YSGGVIDLNQR 609
                             + I   +EE M + VA   P+S        F  Y  G+      
Sbjct:   213 YCKFQPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFAFEVTQDFMMYRTGIYSSTSC 272

Query:   610 LCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWG 650
                P   NHA++ VGYGE   K+G  IPYWIVKNSWG  WG
Sbjct:   273 HKTPDKVNHAVLAVGYGE---KNG--IPYWIVKNSWGPQWG 308

 Score = 117 (46.2 bits), Expect = 1.5e-29, Sum P(3) = 1.5e-29
 Identities = 25/64 (39%), Positives = 33/64 (51%)

Query:   136 AFQFNSLRHGDDLPEAFDWRAEG-VISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTE 194
             A + N LR     P + DWR +G  +S VK QG C  CW FS  G +E+  AI    +  
Sbjct:   104 ATKSNYLRGTGPYPPSVDWRKKGNFVSPVKNQGACGSCWTFSTTGALESAIAIATGKMLS 163

Query:   195 LSVQ 198
             L+ Q
Sbjct:   164 LAEQ 167

 Score = 60 (26.2 bits), Expect = 1.5e-29, Sum P(3) = 1.5e-29
 Identities = 16/59 (27%), Positives = 31/59 (52%)

Query:    45 FLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQREDSGTAVFEVNKFFDLSDSDLQ 103
             F ++M  H K YS+ E+   R + F +N  K   +   +  T    +N+F D+S ++++
Sbjct:    35 FKSWMSKHRKTYST-EEYHHRLQTFASNWRKINAHNNGNH-TFKMALNQFSDMSFAEIK 91


>MGI|MGI:107285 [details] [associations]
            symbol:Ctsh "cathepsin H" species:10090 "Mus musculus"
            [GO:0001520 "outer dense fiber" evidence=ISO] [GO:0001669
            "acrosomal vesicle" evidence=ISO] [GO:0001913 "T cell mediated
            cytotoxicity" evidence=IGI] [GO:0002764 "immune response-regulating
            signaling pathway" evidence=ISO] [GO:0004175 "endopeptidase
            activity" evidence=ISO;IMP] [GO:0004177 "aminopeptidase activity"
            evidence=ISO] [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=ISO;IDA] [GO:0004252 "serine-type endopeptidase activity"
            evidence=IMP] [GO:0005615 "extracellular space" evidence=ISO]
            [GO:0005764 "lysosome" evidence=ISO] [GO:0005829 "cytosol"
            evidence=ISO] [GO:0006508 "proteolysis" evidence=ISO;IMP]
            [GO:0008233 "peptidase activity" evidence=ISO] [GO:0008234
            "cysteine-type peptidase activity" evidence=ISO] [GO:0008284
            "positive regulation of cell proliferation" evidence=IMP]
            [GO:0010628 "positive regulation of gene expression" evidence=ISO]
            [GO:0010634 "positive regulation of epithelial cell migration"
            evidence=IMP] [GO:0010813 "neuropeptide catabolic process"
            evidence=ISO] [GO:0010815 "bradykinin catabolic process"
            evidence=ISO] [GO:0010952 "positive regulation of peptidase
            activity" evidence=IGI;ISO] [GO:0016505 "apoptotic protease
            activator activity" evidence=IGI;ISO] [GO:0016787 "hydrolase
            activity" evidence=IEA] [GO:0030108 "HLA-A specific activating MHC
            class I receptor activity" evidence=ISO] [GO:0030335 "positive
            regulation of cell migration" evidence=ISO] [GO:0030984 "kininogen
            binding" evidence=ISO] [GO:0031638 "zymogen activation"
            evidence=ISO;IMP] [GO:0031648 "protein destabilization"
            evidence=ISO;IMP] [GO:0032403 "protein complex binding"
            evidence=ISO] [GO:0032526 "response to retinoic acid" evidence=IDA]
            [GO:0033619 "membrane protein proteolysis" evidence=ISO;IMP]
            [GO:0035085 "cilium axoneme" evidence=ISO] [GO:0043066 "negative
            regulation of apoptotic process" evidence=IMP] [GO:0043129
            "surfactant homeostasis" evidence=ISO] [GO:0043621 "protein
            self-association" evidence=ISO] [GO:0045766 "positive regulation of
            angiogenesis" evidence=IMP] [GO:0060448 "dichotomous subdivision of
            terminal units involved in lung branching" evidence=IMP]
            [GO:0070324 "thyroid hormone binding" evidence=ISO] [GO:0070371
            "ERK1 and ERK2 cascade" evidence=ISO] [GO:0097208 "alveolar
            lamellar body" evidence=ISO] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            MGI:MGI:107285 GO:GO:0005829 GO:GO:0043066 GO:GO:0005615
            GO:GO:0008284 GO:GO:0070371 eggNOG:COG4870 HOGENOM:HOG000230774
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766 GO:GO:0004177
            GO:GO:0004252 GO:GO:0005764 GO:GO:0097208 GO:GO:0032526
            GO:GO:0010628 GO:GO:0070324 KO:K01366 EMBL:CH466560 GO:GO:0016505
            GO:GO:0001656 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
            HOVERGEN:HBG011513 GO:GO:0097067 GO:GO:0031638 GO:GO:0001913
            GeneTree:ENSGT00660000095458 CTD:1512 OMA:STSCHKT OrthoDB:EOG4W9J43
            GO:GO:0030108 GO:GO:0010815 GO:GO:0060448 GO:GO:0002764
            GO:GO:0033619 GO:GO:0010813 GO:GO:0043129 BRENDA:3.4.22.16
            EMBL:U06119 EMBL:AK149949 EMBL:AK150583 EMBL:AK157376 EMBL:AK160026
            EMBL:Y18464 IPI:IPI00118987 RefSeq:NP_031827.2 UniGene:Mm.2277
            ProteinModelPortal:P49935 SMR:P49935 STRING:P49935 MEROPS:I29.003
            PhosphoSite:P49935 PaxDb:P49935 PRIDE:P49935
            Ensembl:ENSMUST00000034915 GeneID:13036 KEGG:mmu:13036
            InParanoid:Q3UCD6 ChEMBL:CHEMBL1949491 NextBio:282920 Bgee:P49935
            CleanEx:MM_CTSH Genevestigator:P49935 GermOnline:ENSMUSG00000032359
            Uniprot:P49935
        Length = 333

 Score = 281 (104.0 bits), Expect = 3.2e-29, Sum P(2) = 3.2e-29
 Identities = 73/209 (34%), Positives = 96/209 (45%)

Query:   284 AFQFNSLRHGDDLPEAFDWRAEG-VISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTE 342
             A + N LR     P + DWR +G V+S VK QG C  CW FS  G +E+  AI    +  
Sbjct:   102 ATKSNYLRGTGPYPSSMDWRKKGNVVSPVKNQGACGSCWTFSTTGALESAVAIASGKMLS 161

Query:   343 LSVQQLVDCDMS--NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXX 400
             L+ QQLVDC  +  N GC GG    A +YI+ N G++ + +YPY   +S   C       
Sbjct:   162 LAEQQLVDCAQAFNNHGCKGGLPSQAFEYILYNKGIMEEDSYPYIGKDSS--CRFNPQKA 219

Query:   401 XXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFY-YSGGVID----------LN 449
                     + I   +E  M + VA   P+S        F  Y  GV            +N
Sbjct:   220 VAFVKNVVN-ITLNDEAAMVEAVALYNPVSFAFEVTEDFLMYKSGVYSSKSCHKTPDKVN 278

Query:   450 QRL----YG--TSIPYWIVKNSWGSDWGE 472
               +    YG    + YWIVKNSWGS WGE
Sbjct:   279 HAVLAVGYGEQNGLLYWIVKNSWGSQWGE 307

 Score = 223 (83.6 bits), Expect = 1.5e-29, Sum P(3) = 1.5e-29
 Identities = 55/162 (33%), Positives = 75/162 (46%)

Query:   493 VLPSKLSRLATEKLVDCDMS--NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERG 550
             +   K+  LA ++LVDC  +  N GC GG    A +YI+ N G++ + +YPY   +S   
Sbjct:   154 IASGKMLSLAEQQLVDCAQAFNNHGCKGGLPSQAFEYILYNKGIMEEDSYPYIGKDSS-- 211

Query:   551 CLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFY-YSGGVIDLNQR 609
             C               + I   +E  M + VA   P+S        F  Y  GV      
Sbjct:   212 CRFNPQKAVAFVKNVVN-ITLNDEAAMVEAVALYNPVSFAFEVTEDFLMYKSGVYSSKSC 270

Query:   610 LCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGE 651
                P   NHA++ VGYGE+       + YWIVKNSWGS WGE
Sbjct:   271 HKTPDKVNHAVLAVGYGEQN-----GLLYWIVKNSWGSQWGE 307

 Score = 126 (49.4 bits), Expect = 1.5e-29, Sum P(3) = 1.5e-29
 Identities = 26/64 (40%), Positives = 34/64 (53%)

Query:   136 AFQFNSLRHGDDLPEAFDWRAEG-VISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTE 194
             A + N LR     P + DWR +G V+S VK QG C  CW FS  G +E+  AI    +  
Sbjct:   102 ATKSNYLRGTGPYPSSMDWRKKGNVVSPVKNQGACGSCWTFSTTGALESAVAIASGKMLS 161

Query:   195 LSVQ 198
             L+ Q
Sbjct:   162 LAEQ 165

 Score = 75 (31.5 bits), Expect = 3.2e-29, Sum P(2) = 3.2e-29
 Identities = 24/84 (28%), Positives = 39/84 (46%)

Query:   169 CACCWAFSAVGVVE-AMHAIQGNNLTELSVQHHDKVYSSVEDLLRRHENFVTNVEKAEDY 227
             CA  W  S     E  ++AI+  +      QH  K YSSVE    R + F  N  K + +
Sbjct:     9 CAGAWLLSTGATAELTVNAIEKFHFKSWMKQHQ-KTYSSVE-YNHRLQMFANNWRKIQAH 66

Query:   228 QSEDSGTAVFGVNKFFDLSESDLQ 251
                +  T    +N+F D+S ++++
Sbjct:    67 NQRNH-TFKMALNQFSDMSFAEIK 89

 Score = 73 (30.8 bits), Expect = 1.5e-29, Sum P(3) = 1.5e-29
 Identities = 18/59 (30%), Positives = 33/59 (55%)

Query:    45 FLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQREDSGTAVFEVNKFFDLSDSDLQ 103
             F ++M+ H K YSSVE    R + F  N  K + + + +  T    +N+F D+S ++++
Sbjct:    33 FKSWMKQHQKTYSSVE-YNHRLQMFANNWRKIQAHNQRNH-TFKMALNQFSDMSFAEIK 89


>UNIPROTKB|P43235 [details] [associations]
            symbol:CTSK "Cathepsin K" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0001957
            "intramembranous ossification" evidence=IEA] [GO:0005615
            "extracellular space" evidence=IEA] [GO:0045453 "bone resorption"
            evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=TAS] [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=TAS] [GO:0036021 "endolysosome lumen" evidence=TAS]
            [GO:0045087 "innate immune response" evidence=TAS]
            InterPro:IPR000668 InterPro:IPR013128 InterPro:IPR015644
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            Reactome:REACT_118779 Reactome:REACT_6900 GO:GO:0005615
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045087 GO:GO:0045453
            EMBL:CH471121 EMBL:AL355860 GO:GO:0004197 GO:GO:0001957
            HOVERGEN:HBG011513 GO:GO:0036021 MEROPS:I29.007 CTD:1513 KO:K01371
            OrthoDB:EOG4SJ5FC PANTHER:PTHR12411:SF55 EMBL:U13665 EMBL:X82153
            EMBL:U20280 EMBL:S79895 EMBL:CR541675 EMBL:AL356292 EMBL:BC016058
            IPI:IPI00300599 PIR:JC2476 RefSeq:NP_000387.1 UniGene:Hs.632466
            PDB:1ATK PDB:1AU0 PDB:1AU2 PDB:1AU3 PDB:1AU4 PDB:1AYU PDB:1AYV
            PDB:1AYW PDB:1BGO PDB:1BY8 PDB:1MEM PDB:1NL6 PDB:1NLJ PDB:1Q6K
            PDB:1SNK PDB:1TU6 PDB:1U9V PDB:1U9W PDB:1U9X PDB:1VSN PDB:1YK7
            PDB:1YK8 PDB:1YT7 PDB:2ATO PDB:2AUX PDB:2AUZ PDB:2BDL PDB:2R6N
            PDB:3C9E PDB:3H7D PDB:3KW9 PDB:3KWB PDB:3KWZ PDB:3KX1 PDB:3O0U
            PDB:3O1G PDB:3OVZ PDB:4DMX PDB:4DMY PDB:7PCK PDBsum:1ATK
            PDBsum:1AU0 PDBsum:1AU2 PDBsum:1AU3 PDBsum:1AU4 PDBsum:1AYU
            PDBsum:1AYV PDBsum:1AYW PDBsum:1BGO PDBsum:1BY8 PDBsum:1MEM
            PDBsum:1NL6 PDBsum:1NLJ PDBsum:1Q6K PDBsum:1SNK PDBsum:1TU6
            PDBsum:1U9V PDBsum:1U9W PDBsum:1U9X PDBsum:1VSN PDBsum:1YK7
            PDBsum:1YK8 PDBsum:1YT7 PDBsum:2ATO PDBsum:2AUX PDBsum:2AUZ
            PDBsum:2BDL PDBsum:2R6N PDBsum:3C9E PDBsum:3H7D PDBsum:3KW9
            PDBsum:3KWB PDBsum:3KWZ PDBsum:3KX1 PDBsum:3O0U PDBsum:3O1G
            PDBsum:3OVZ PDBsum:4DMX PDBsum:4DMY PDBsum:7PCK
            ProteinModelPortal:P43235 SMR:P43235 DIP:DIP-39993N IntAct:P43235
            STRING:P43235 PhosphoSite:P43235 DMDM:1168793 PaxDb:P43235
            PRIDE:P43235 DNASU:1513 Ensembl:ENST00000271651 GeneID:1513
            KEGG:hsa:1513 UCSC:uc001evp.2 GeneCards:GC01M150768 HGNC:HGNC:2536
            MIM:265800 MIM:601105 neXtProt:NX_P43235 Orphanet:763
            PharmGKB:PA27034 InParanoid:P43235 OMA:LKVPPSH PhylomeDB:P43235
            BindingDB:P43235 ChEMBL:CHEMBL268 EvolutionaryTrace:P43235
            GenomeRNAi:1513 NextBio:6267 ArrayExpress:P43235 Bgee:P43235
            CleanEx:HS_CTSK CleanEx:HS_CTSO Genevestigator:P43235
            GermOnline:ENSG00000143387 Uniprot:P43235
        Length = 329

 Score = 334 (122.6 bits), Expect = 1.6e-29, P = 1.6e-29
 Identities = 93/296 (31%), Positives = 147/296 (49%)

Query:   194 ELSVQHHDKVYSSVEDLLRRHENFVTNVEKAEDYQSEDSGTAVFGVNKFFDLSESDLQQL 253
             EL  + H K Y++  D + R   +  N++    Y S  +  A  GV+ + +L+ + L  +
Sbjct:    27 ELWKKTHRKQYNNKVDEISRRLIWEKNLK----YISIHNLEASLGVHTY-ELAMNHLGDM 81

Query:   254 TGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVKE 313
             T    +  ++ +   L+ P S ++++  +   ++         P++ D+R +G ++ VK 
Sbjct:    82 TS---EEVVQKMT-GLKVPLSHSRSNDTLYIPEWEGRA-----PDSVDYRKKGYVTPVKN 132

Query:   314 QGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSNGGCNGGRMDDALQYIIDN 373
             QG+C  CWAFS+VG +E     +   L  LS Q LVDC   N GC GG M +A QY+  N
Sbjct:   133 QGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQKN 192

Query:   374 GGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGM 433
              G+ S+ AYPY   E    C+             Y  IP G E+ +K+ VA  GP+SV +
Sbjct:   193 RGIDSEDAYPYVGQEES--CMYNPTGKAAKCRG-YREIPEGNEKALKRAVARVGPVSVAI 249

Query:   434 NAN--GLFYYSGGVI--------DLNQRL----YGTSI--PYWIVKNSWGSDWGEK 473
             +A+     +YS GV         +LN  +    YG      +WI+KNSWG +WG K
Sbjct:   250 DASLTSFQFYSKGVYYDESCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNK 305

 Score = 282 (104.3 bits), Expect = 7.1e-24, P = 7.1e-24
 Identities = 84/244 (34%), Positives = 119/244 (48%)

Query:   416 EEEMKKWVATRGPLSVGMNANGLFY--YSGGVID-LNQRLYGTSIPYWIVKNSW--GSDW 470
             EE ++K    + PLS   + + L+   + G   D ++ R  G   P   VKN    GS W
Sbjct:    84 EEVVQKMTGLKVPLSHSRSNDTLYIPEWEGRAPDSVDYRKKGYVTP---VKNQGQCGSCW 140

Query:   471 GEKVEDKVGSSGNRTRDLELTGVLPSKLSRLATEKLVDCDMSNGGCNGGRMDDALQYIID 530
                    VG+   + +  + TG    KL  L+ + LVDC   N GC GG M +A QY+  
Sbjct:   141 AFS---SVGALEGQLK--KKTG----KLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQK 191

Query:   531 NGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVG 590
             N G+ S+ AYPY   E    C+             Y  IP G E+ +K+ VA  GP+SV 
Sbjct:   192 NRGIDSEDAYPYVGQEES--CMYNPTGKAAKCRG-YREIPEGNEKALKRAVARVGPVSVA 248

Query:   591 MNAN--GLFYYSGGVIDLNQRLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSD 648
             ++A+     +YS GV       CN    NHA++ VGYG ++   G    +WI+KNSWG +
Sbjct:   249 IDASLTSFQFYSKGVY--YDESCNSDNLNHAVLAVGYGIQK---GNK--HWIIKNSWGEN 301

Query:   649 WGEK 652
             WG K
Sbjct:   302 WGNK 305

 Score = 117 (46.2 bits), Expect = 0.00071, P = 0.00071
 Identities = 21/51 (41%), Positives = 32/51 (62%)

Query:   149 PEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQH 199
             P++ D+R +G ++ VK QG+C  CWAFS+VG +E     +   L  LS Q+
Sbjct:   116 PDSVDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQN 166


>RGD|2447 [details] [associations]
            symbol:Ctsh "cathepsin H" species:10116 "Rattus norvegicus"
          [GO:0001520 "outer dense fiber" evidence=IDA] [GO:0001656
          "metanephros development" evidence=IEP] [GO:0001669 "acrosomal
          vesicle" evidence=IDA] [GO:0001913 "T cell mediated cytotoxicity"
          evidence=ISO;ISS] [GO:0002250 "adaptive immune response"
          evidence=ISO] [GO:0002764 "immune response-regulating signaling
          pathway" evidence=ISO;ISS] [GO:0004175 "endopeptidase activity"
          evidence=ISO] [GO:0004177 "aminopeptidase activity" evidence=ISO;IDA]
          [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISO;ISS]
          [GO:0004252 "serine-type endopeptidase activity" evidence=ISO;ISS]
          [GO:0005615 "extracellular space" evidence=ISO;ISS;IDA] [GO:0005764
          "lysosome" evidence=ISO;ISS;IDA] [GO:0005829 "cytosol"
          evidence=ISO;ISS] [GO:0006508 "proteolysis" evidence=IEP;ISO]
          [GO:0007283 "spermatogenesis" evidence=IEP] [GO:0008233 "peptidase
          activity" evidence=ISO] [GO:0008234 "cysteine-type peptidase
          activity" evidence=ISO] [GO:0008284 "positive regulation of cell
          proliferation" evidence=ISO;ISS] [GO:0010628 "positive regulation of
          gene expression" evidence=ISO;ISS] [GO:0010634 "positive regulation
          of epithelial cell migration" evidence=ISO;ISS] [GO:0010813
          "neuropeptide catabolic process" evidence=ISO;ISS] [GO:0010815
          "bradykinin catabolic process" evidence=ISO;ISS] [GO:0010952
          "positive regulation of peptidase activity" evidence=ISO;ISS]
          [GO:0016505 "apoptotic protease activator activity" evidence=ISO;ISS]
          [GO:0030108 "HLA-A specific activating MHC class I receptor activity"
          evidence=ISO;ISS] [GO:0030335 "positive regulation of cell migration"
          evidence=ISO;ISS] [GO:0030984 "kininogen binding" evidence=IPI]
          [GO:0031638 "zymogen activation" evidence=ISO;ISS] [GO:0031648
          "protein destabilization" evidence=ISO;ISS] [GO:0032403 "protein
          complex binding" evidence=IPI] [GO:0032526 "response to retinoic
          acid" evidence=ISO;ISS] [GO:0033619 "membrane protein proteolysis"
          evidence=ISO;ISS] [GO:0035085 "cilium axoneme" evidence=IDA]
          [GO:0043066 "negative regulation of apoptotic process"
          evidence=ISO;ISS] [GO:0043129 "surfactant homeostasis"
          evidence=ISO;ISS] [GO:0043621 "protein self-association"
          evidence=IDA] [GO:0045766 "positive regulation of angiogenesis"
          evidence=ISO;ISS] [GO:0060448 "dichotomous subdivision of terminal
          units involved in lung branching" evidence=ISO;ISS] [GO:0070324
          "thyroid hormone binding" evidence=ISO;ISS] [GO:0070371 "ERK1 and
          ERK2 cascade" evidence=ISO;ISS] [GO:0097067 "cellular response to
          thyroid hormone stimulus" evidence=ISO;IEP] [GO:0097208 "alveolar
          lamellar body" evidence=ISO;ISS;IDA] InterPro:IPR000668
          InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
          InterPro:IPR000169 RGD:2447 GO:GO:0005829 GO:GO:0043066 GO:GO:0005615
          GO:GO:0008284 GO:GO:0070371 GO:GO:0001669 eggNOG:COG4870
          HOGENOM:HOG000230774 InterPro:IPR025661 InterPro:IPR025660
          InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
          PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0007283
          GO:GO:0045766 GO:GO:0004177 GO:GO:0004252 GO:GO:0005764 GO:GO:0097208
          GO:GO:0043621 GO:GO:0032526 GO:GO:0010628 GO:GO:0070324 KO:K01366
          GO:GO:0016505 GO:GO:0001656 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
          HOVERGEN:HBG011513 GO:GO:0097067 GO:GO:0031638 GO:GO:0001913
          GeneTree:ENSGT00660000095458 CTD:1512 OMA:STSCHKT OrthoDB:EOG4W9J43
          GO:GO:0030108 GO:GO:0010815 GO:GO:0060448 GO:GO:0002764 GO:GO:0033619
          GO:GO:0010813 GO:GO:0043129 MEROPS:I29.003 EMBL:Y00708 EMBL:BC085352
          EMBL:M38135 IPI:IPI00212809 PIR:S00211 RefSeq:NP_037071.1
          UniGene:Rn.1997 ProteinModelPortal:P00786 SMR:P00786 STRING:P00786
          PRIDE:P00786 Ensembl:ENSRNOT00000019285 GeneID:25425 KEGG:rno:25425
          UCSC:RGD:2447 InParanoid:P00786 BindingDB:P00786 NextBio:606599
          Genevestigator:P00786 GermOnline:ENSRNOG00000014064 GO:GO:0035086
          GO:GO:0001520 Uniprot:P00786
        Length = 333

 Score = 275 (101.9 bits), Expect = 1.7e-29, Sum P(2) = 1.7e-29
 Identities = 71/208 (34%), Positives = 96/208 (46%)

Query:   284 AFQFNSLRHGDDLPEAFDWRAEG-VISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTE 342
             A + N LR     P + DWR +G V+S VK QG C  CW FS  G +E+  AI    +  
Sbjct:   102 ATKSNYLRGTGPYPSSMDWRKKGNVVSPVKNQGACGSCWTFSTTGALESAVAIASGKMMT 161

Query:   343 LSVQQLVDC--DMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXX 400
             L+ QQLVDC  + +N GC GG    A +YI+ N G++ + +YPY     +  C       
Sbjct:   162 LAEQQLVDCAQNFNNHGCQGGLPSQAFEYILYNKGIMGEDSYPYIGKNGQ--CKFNPEKA 219

Query:   401 XXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFY-YSGGVIDLNQ------RL- 452
                     + I   +E  M + VA   P+S        F  Y  GV   N       ++ 
Sbjct:   220 VAFVKNVVN-ITLNDEAAMVEAVALYNPVSFAFEVTEDFMMYKSGVYSSNSCHKTPDKVN 278

Query:   453 -------YG--TSIPYWIVKNSWGSDWG 471
                    YG    + YWIVKNSWGS+WG
Sbjct:   279 HAVLAVGYGEQNGLLYWIVKNSWGSNWG 306

 Score = 225 (84.3 bits), Expect = 5.1e-29, Sum P(3) = 5.1e-29
 Identities = 54/161 (33%), Positives = 76/161 (47%)

Query:   493 VLPSKLSRLATEKLVDC--DMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERG 550
             +   K+  LA ++LVDC  + +N GC GG    A +YI+ N G++ + +YPY     +  
Sbjct:   154 IASGKMMTLAEQQLVDCAQNFNNHGCQGGLPSQAFEYILYNKGIMGEDSYPYIGKNGQ-- 211

Query:   551 CLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFY-YSGGVIDLNQR 609
             C               + I   +E  M + VA   P+S        F  Y  GV   N  
Sbjct:   212 CKFNPEKAVAFVKNVVN-ITLNDEAAMVEAVALYNPVSFAFEVTEDFMMYKSGVYSSNSC 270

Query:   610 LCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWG 650
                P   NHA++ VGYGE+       + YWIVKNSWGS+WG
Sbjct:   271 HKTPDKVNHAVLAVGYGEQN-----GLLYWIVKNSWGSNWG 306

 Score = 125 (49.1 bits), Expect = 5.1e-29, Sum P(3) = 5.1e-29
 Identities = 26/64 (40%), Positives = 34/64 (53%)

Query:   136 AFQFNSLRHGDDLPEAFDWRAEG-VISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTE 194
             A + N LR     P + DWR +G V+S VK QG C  CW FS  G +E+  AI    +  
Sbjct:   102 ATKSNYLRGTGPYPSSMDWRKKGNVVSPVKNQGACGSCWTFSTTGALESAVAIASGKMMT 161

Query:   195 LSVQ 198
             L+ Q
Sbjct:   162 LAEQ 165

 Score = 84 (34.6 bits), Expect = 1.7e-29, Sum P(2) = 1.7e-29
 Identities = 26/84 (30%), Positives = 41/84 (48%)

Query:   169 CACCWAFSAVGVVE-AMHAIQGNNLTELSVQHHDKVYSSVEDLLRRHENFVTNVEKAEDY 227
             CA  W  SA    E  ++AI+  + T    QH  K YSS E    R + F  N  K + +
Sbjct:     9 CAGAWLLSAGATAELTVNAIEKFHFTSWMKQHQ-KTYSSRE-YSHRLQVFANNWRKIQAH 66

Query:   228 QSEDSGTAVFGVNKFFDLSESDLQ 251
                +  T   G+N+F D+S ++++
Sbjct:    67 NQRNH-TFKMGLNQFSDMSFAEIK 89

 Score = 66 (28.3 bits), Expect = 5.1e-29, Sum P(3) = 5.1e-29
 Identities = 17/59 (28%), Positives = 32/59 (54%)

Query:    45 FLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQREDSGTAVFEVNKFFDLSDSDLQ 103
             F ++M+ H K YSS E    R + F  N  K + + + +  T    +N+F D+S ++++
Sbjct:    33 FTSWMKQHQKTYSSRE-YSHRLQVFANNWRKIQAHNQRNH-TFKMGLNQFSDMSFAEIK 89


>RGD|1308181 [details] [associations]
            symbol:Ctsf "cathepsin F" species:10116 "Rattus norvegicus"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 RGD:1308181 eggNOG:COG4870 HOGENOM:HOG000230774
            KO:K01373 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00139 PROSITE:PS00639 HOVERGEN:HBG011513 MEROPS:C01.018
            CTD:8722 OMA:LAPPEWD OrthoDB:EOG4CC41T GeneTree:ENSGT00660000095458
            EMBL:CH473953 EMBL:BC099780 EMBL:EU253481 IPI:IPI00201100
            RefSeq:NP_001029282.1 UniGene:Rn.25087 SMR:Q499S6
            Ensembl:ENSRNOT00000026718 GeneID:361704 KEGG:rno:361704
            UCSC:RGD:1308181 InParanoid:Q499S6 NextBio:677325
            Genevestigator:Q499S6 Uniprot:Q499S6
        Length = 462

 Score = 298 (110.0 bits), Expect = 2.2e-29, Sum P(2) = 2.2e-29
 Identities = 69/197 (35%), Positives = 101/197 (51%)

Query:   294 DDL-PEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCD 352
             +DL P  +DWR +G +++VK+QG C  CWAFS  G VE    +   +L  LS Q+L+DCD
Sbjct:   246 NDLAPPEWDWRKKGAVTEVKDQGMCGSCWAFSVTGNVEGQWFLNRGTLLSLSEQELLDCD 305

Query:   353 MSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIP 412
               +  C GG   +A   I + GG+ ++  Y Y+                       SR  
Sbjct:   306 KMDKACMGGLPSNAYTAIKNLGGLETEDDYGYQGHVQACNFSTQMAKVYINDSVELSR-- 363

Query:   413 YGEEEEMKKWVATRGPLSVGMNANGLFYYSGGV------------IDLNQRL--YG--TS 456
               +E ++  W+A +GP+SV +NA G+ +Y  G+            ID    L  YG  ++
Sbjct:   364 --DENKIAAWLAQKGPISVAINAFGMQFYRHGIAHPFRPLCSPWFIDHAVLLVGYGNRSN 421

Query:   457 IPYWIVKNSWGSDWGEK 473
             IPYW +KNSWG DWGE+
Sbjct:   422 IPYWAIKNSWGRDWGEE 438

 Score = 244 (91.0 bits), Expect = 8.6e-23, Sum P(2) = 8.6e-23
 Identities = 64/214 (29%), Positives = 106/214 (49%)

Query:   442 SGGVIDLNQRLYGTSIPYWIVKNSWGSDWGEKVEDKVGS--SGNRTRDLELTGVLP-SKL 498
             SGG + L + +   + P W  +   G+    K +   GS  + + T ++E    L    L
Sbjct:   235 SGGKMSLAKSINDLAPPEWDWRKK-GAVTEVKDQGMCGSCWAFSVTGNVEGQWFLNRGTL 293

Query:   499 SRLATEKLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXX 558
               L+ ++L+DCD  +  C GG   +A   I + GG+ ++  Y Y+               
Sbjct:   294 LSLSEQELLDCDKMDKACMGGLPSNAYTAIKNLGGLETEDDYGYQGHVQACNFSTQMAKV 353

Query:   559 XXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFYYSGGVIDLNQRLCNPKAQNH 618
                     SR    +E ++  W+A +GP+SV +NA G+ +Y  G+    + LC+P   +H
Sbjct:   354 YINDSVELSR----DENKIAAWLAQKGPISVAINAFGMQFYRHGIAHPFRPLCSPWFIDH 409

Query:   619 ALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
             A+++VGYG       ++IPYW +KNSWG DWGE+
Sbjct:   410 AVLLVGYGNR-----SNIPYWAIKNSWGRDWGEE 438

 Score = 126 (49.4 bits), Expect = 1.7e-09, Sum P(2) = 1.7e-09
 Identities = 24/54 (44%), Positives = 32/54 (59%)

Query:   146 DDL-PEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
             +DL P  +DWR +G +++VK+QG C  CWAFS  G VE    +    L  LS Q
Sbjct:   246 NDLAPPEWDWRKKGAVTEVKDQGMCGSCWAFSVTGNVEGQWFLNRGTLLSLSEQ 299

 Score = 92 (37.4 bits), Expect = 2.2e-29, Sum P(2) = 2.2e-29
 Identities = 20/63 (31%), Positives = 34/63 (53%)

Query:    43 TRFLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQREDSGTAVFEVNKFFDLSDSDL 102
             T F +FM  +++ Y S E+   R   F  N+ +A+  Q  D GTA + + KF DL++ + 
Sbjct:   163 TLFKDFMTTYNRTYESREEAQWRLTVFARNMIRAQKIQALDRGTAQYGITKFSDLTEEEF 222

Query:   103 QQL 105
               +
Sbjct:   223 HTI 225

 Score = 88 (36.0 bits), Expect = 5.9e-29, Sum P(2) = 5.9e-29
 Identities = 18/54 (33%), Positives = 31/54 (57%)

Query:   200 HDKVYSSVEDLLRRHENFVTNVEKAEDYQSEDSGTAVFGVNKFFDLSESDLQQL 253
             +++ Y S E+   R   F  N+ +A+  Q+ D GTA +G+ KF DL+E +   +
Sbjct:   172 YNRTYESREEAQWRLTVFARNMIRAQKIQALDRGTAQYGITKFSDLTEEEFHTI 225


>UNIPROTKB|G3R9A7 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9595 "Gorilla
            gorilla gorilla" [GO:0001656 "metanephros development"
            evidence=ISS] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=ISS] [GO:0002764 "immune response-regulating signaling
            pathway" evidence=ISS] [GO:0004175 "endopeptidase activity"
            evidence=ISS] [GO:0004177 "aminopeptidase activity" evidence=ISS]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0004252 "serine-type endopeptidase activity" evidence=ISS]
            [GO:0005615 "extracellular space" evidence=ISS] [GO:0005764
            "lysosome" evidence=ISS] [GO:0005829 "cytosol" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISS] [GO:0008284 "positive regulation
            of cell proliferation" evidence=ISS] [GO:0010628 "positive
            regulation of gene expression" evidence=ISS] [GO:0010634 "positive
            regulation of epithelial cell migration" evidence=ISS] [GO:0010813
            "neuropeptide catabolic process" evidence=ISS] [GO:0010815
            "bradykinin catabolic process" evidence=ISS] [GO:0010952 "positive
            regulation of peptidase activity" evidence=ISS] [GO:0016505
            "apoptotic protease activator activity" evidence=ISS] [GO:0030108
            "HLA-A specific activating MHC class I receptor activity"
            evidence=ISS] [GO:0030335 "positive regulation of cell migration"
            evidence=ISS] [GO:0031638 "zymogen activation" evidence=ISS]
            [GO:0031648 "protein destabilization" evidence=ISS] [GO:0032526
            "response to retinoic acid" evidence=ISS] [GO:0033619 "membrane
            protein proteolysis" evidence=ISS] [GO:0043066 "negative regulation
            of apoptotic process" evidence=ISS] [GO:0043129 "surfactant
            homeostasis" evidence=ISS] [GO:0045766 "positive regulation of
            angiogenesis" evidence=ISS] [GO:0060448 "dichotomous subdivision of
            terminal units involved in lung branching" evidence=ISS]
            [GO:0070324 "thyroid hormone binding" evidence=ISS] [GO:0070371
            "ERK1 and ERK2 cascade" evidence=ISS] [GO:0097208 "alveolar
            lamellar body" evidence=ISS] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005829 GO:GO:0043066 GO:GO:0005615 GO:GO:0008284
            GO:GO:0070371 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766
            GO:GO:0004177 GO:GO:0004252 GO:GO:0005764 GO:GO:0097208
            GO:GO:0032526 GO:GO:0010628 GO:GO:0070324 GO:GO:0016505
            GO:GO:0001656 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
            GO:GO:0031638 GO:GO:0001913 OMA:STSCHKT GO:GO:0030108 GO:GO:0010815
            GO:GO:0060448 GO:GO:0002764 GO:GO:0033619 GO:GO:0010813
            GO:GO:0043129 RefSeq:XP_004056662.1 Ensembl:ENSGGOT00000012331
            GeneID:101144312 Uniprot:G3R9A7
        Length = 335

 Score = 281 (104.0 bits), Expect = 1.2e-27, Sum P(2) = 1.2e-27
 Identities = 72/208 (34%), Positives = 95/208 (45%)

Query:   284 AFQFNSLRHGDDLPEAFDWRAEG-VISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTE 342
             A + N LR     P + DWR +G  +S VK QG C  CW FS  G +E+  AI    +  
Sbjct:   104 ATKSNYLRGTGPYPPSVDWRKKGNFVSPVKNQGACGSCWTFSTTGALESAIAIATGKMLS 163

Query:   343 LSVQQLVDC--DMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXX 400
             L+ QQLVDC  D +N GC GG    A +YI+ N G++ +  YPY+  +   G        
Sbjct:   164 LAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNKGIMGEDTYPYQGKD---GYCKFQPGK 220

Query:   401 XXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFY-YSGGVID----------LN 449
                     + I   +EE M + VA   P+S        F  Y  G+            +N
Sbjct:   221 AIGFVKDVANITIYDEEAMVEAVALYNPVSFAFEVTQDFMMYRTGIYSSTSCHKTPDKVN 280

Query:   450 QRL----YG--TSIPYWIVKNSWGSDWG 471
               +    YG    IPYWIVKNSWG  WG
Sbjct:   281 HAVLAVGYGEKNGIPYWIVKNSWGPKWG 308

 Score = 235 (87.8 bits), Expect = 2.3e-29, Sum P(3) = 2.3e-29
 Identities = 57/161 (35%), Positives = 77/161 (47%)

Query:   493 VLPSKLSRLATEKLVDC--DMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERG 550
             +   K+  LA ++LVDC  D +N GC GG    A +YI+ N G++ +  YPY+  +   G
Sbjct:   156 IATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNKGIMGEDTYPYQGKD---G 212

Query:   551 CLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFY-YSGGVIDLNQR 609
                             + I   +EE M + VA   P+S        F  Y  G+      
Sbjct:   213 YCKFQPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFAFEVTQDFMMYRTGIYSSTSC 272

Query:   610 LCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWG 650
                P   NHA++ VGYGE   K+G  IPYWIVKNSWG  WG
Sbjct:   273 HKTPDKVNHAVLAVGYGE---KNG--IPYWIVKNSWGPKWG 308

 Score = 117 (46.2 bits), Expect = 2.3e-29, Sum P(3) = 2.3e-29
 Identities = 25/64 (39%), Positives = 33/64 (51%)

Query:   136 AFQFNSLRHGDDLPEAFDWRAEG-VISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTE 194
             A + N LR     P + DWR +G  +S VK QG C  CW FS  G +E+  AI    +  
Sbjct:   104 ATKSNYLRGTGPYPPSVDWRKKGNFVSPVKNQGACGSCWTFSTTGALESAIAIATGKMLS 163

Query:   195 LSVQ 198
             L+ Q
Sbjct:   164 LAEQ 167

 Score = 60 (26.2 bits), Expect = 2.3e-29, Sum P(3) = 2.3e-29
 Identities = 16/59 (27%), Positives = 31/59 (52%)

Query:    45 FLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQREDSGTAVFEVNKFFDLSDSDLQ 103
             F ++M  H K YS+ E+   R + F +N  K   +   +  T    +N+F D+S ++++
Sbjct:    35 FRSWMSKHRKTYST-EEYHHRLQTFASNWRKINAHNNGNH-TFKMALNQFSDMSFAEIK 91


>RGD|1562210 [details] [associations]
            symbol:MGC114246 "similar to cathepsin R" species:10116 "Rattus
            norvegicus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            RGD:1562210 eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            HOVERGEN:HBG011513 EMBL:CH474032 MEROPS:C01.042 KO:K09601
            OMA:ASHESFK OrthoDB:EOG4ZCT6D EMBL:BC091563 IPI:IPI00555186
            RefSeq:NP_001017509.1 UniGene:Rn.198321 SMR:Q5BJA0
            Ensembl:ENSRNOT00000061470 GeneID:498688 KEGG:rno:498688
            UCSC:RGD:1562210 InParanoid:Q5BJA0 NextBio:700535
            Genevestigator:Q5BJA0 Uniprot:Q5BJA0
        Length = 334

 Score = 262 (97.3 bits), Expect = 2.5e-24, Sum P(2) = 2.5e-24
 Identities = 65/191 (34%), Positives = 93/191 (48%)

Query:   259 DSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVKEQGKCA 318
             D+T E+ +  +   F   QT  E ++    +   G   P+  DWR +G ++ V+ QG C 
Sbjct:    82 DTTGEEFR-KMMVEFPV-QTHREGKSIMKRAA--GSIFPKFVDWRKKGYVTPVRRQGNCN 137

Query:   319 CCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCD--MSNGGCNGGRMDDALQYIIDNGGV 376
              CWAFS  G +EA    Q   L  LSVQ LVDC     N GC GG   +A QY++ NGG+
Sbjct:   138 ACWAFSVTGAIEAQTIWQSGKLIPLSVQNLVDCSKPQGNNGCLGGDTYNAFQYVLHNGGL 197

Query:   377 VSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNAN 436
              S+  YPY   E + G               +  +P  E+  M   VAT GP+S G++A+
Sbjct:   198 QSEATYPY---EGKDGPCRYNPKNSSAEITGFVSLPESEDILMVA-VATIGPISAGIDAS 253

Query:   437 --GLFYYSGGV 445
                  +Y  G+
Sbjct:   254 HESFKFYKKGI 264

 Score = 231 (86.4 bits), Expect = 3.6e-29, Sum P(3) = 3.6e-29
 Identities = 53/158 (33%), Positives = 80/158 (50%)

Query:   497 KLSRLATEKLVDCD--MSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXX 554
             KL  L+ + LVDC     N GC GG   +A QY++ NGG+ S+  YPY   E + G    
Sbjct:   158 KLIPLSVQNLVDCSKPQGNNGCLGGDTYNAFQYVLHNGGLQSEATYPY---EGKDGPCRY 214

Query:   555 XXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNAN--GLFYYSGGVIDLNQRLCN 612
                        +  +P  E+  M   VAT GP+S G++A+     +Y  G+   ++  C+
Sbjct:   215 NPKNSSAEITGFVSLPESEDILMVA-VATIGPISAGIDASHESFKFYKKGIY--HEPNCS 271

Query:   613 PKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWG 650
               +  H +++VGYG +    G    YW++KNSWG  WG
Sbjct:   272 SNSVTHGVLVVGYGFKGNDTGGD-HYWLIKNSWGKQWG 308

 Score = 134 (52.2 bits), Expect = 3.6e-29, Sum P(3) = 3.6e-29
 Identities = 31/89 (34%), Positives = 45/89 (50%)

Query:   111 DSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVKEQGKCA 170
             D+T E+ +  +   F   QT  E ++    +   G   P+  DWR +G ++ V+ QG C 
Sbjct:    82 DTTGEEFR-KMMVEFPV-QTHREGKSIMKRAA--GSIFPKFVDWRKKGYVTPVRRQGNCN 137

Query:   171 CCWAFSAVGVVEAMHAIQGNNLTELSVQH 199
              CWAFS  G +EA    Q   L  LSVQ+
Sbjct:   138 ACWAFSVTGAIEAQTIWQSGKLIPLSVQN 166

 Score = 96 (38.9 bits), Expect = 2.6e-12, Sum P(3) = 2.6e-12
 Identities = 37/146 (25%), Positives = 56/146 (38%)

Query:   355 NGGCNGGRMDDALQYIIDNGGVVSDQAYPY-----------KASESE-RG--CLXXXXXX 400
             N GC GG   +A QY++ NGG+ S+  YPY           K S +E  G   L      
Sbjct:   176 NNGCLGGDTYNAFQYVLHNGGLQSEATYPYEGKDGPCRYNPKNSSAEITGFVSLPESEDI 235

Query:   401 XXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFYYSGGVIDLNQRLYGTSIP-- 458
                       I  G +   + +   +  +    N +     + GV+ +     G      
Sbjct:   236 LMVAVATIGPISAGIDASHESFKFYKKGIYHEPNCSSNSV-THGVLVVGYGFKGNDTGGD 294

Query:   459 -YWIVKNSWGSDWGEKVEDKVGSSGN 483
              YW++KNSWG  WG +   K+    N
Sbjct:   295 HYWLIKNSWGKQWGIRGYMKITKDKN 320

 Score = 48 (22.0 bits), Expect = 3.6e-29, Sum P(3) = 3.6e-29
 Identities = 16/60 (26%), Positives = 31/60 (51%)

Query:    50 RDHDKVYSSVEDLLRR---HENF-VTNVEKAEDYQREDSGTAVFEVNKFFDLSDSDLQQL 105
             + +DK YS  E+ LRR    EN  +  +   E+   ++  T   E+N+F D +  + +++
Sbjct:    34 KKYDKSYSLEEEELRRAVWEENLKMIKLHNGENGLGKNGFT--MEINEFGDTTGEEFRKM 91

 Score = 41 (19.5 bits), Expect = 1.1e-17, Sum P(2) = 1.1e-17
 Identities = 15/58 (25%), Positives = 29/58 (50%)

Query:   200 HDKVYSSVEDLLRR---HENF-VTNVEKAEDYQSEDSGTAVFGVNKFFDLSESDLQQL 253
             +DK YS  E+ LRR    EN  +  +   E+   ++  T    +N+F D +  + +++
Sbjct:    36 YDKSYSLEEEELRRAVWEENLKMIKLHNGENGLGKNGFT--MEINEFGDTTGEEFRKM 91


>UNIPROTKB|Q24940 [details] [associations]
            symbol:Cat-1 "Cathepsin L-like proteinase" species:6192
            "Fasciola hepatica" [GO:0004175 "endopeptidase activity"
            evidence=IDA] [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=IDA] [GO:0005576 "extracellular region" evidence=IDA]
            [GO:0006508 "proteolysis" evidence=IDA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005576 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0004197 EMBL:L33771 PIR:S43991 PDB:2O6X
            PDBsum:2O6X ProteinModelPortal:Q24940 SMR:Q24940 MEROPS:C01.033
            EvolutionaryTrace:Q24940 Uniprot:Q24940
        Length = 326

 Score = 299 (110.3 bits), Expect = 1.0e-25, P = 1.0e-25
 Identities = 76/238 (31%), Positives = 111/238 (46%)

Query:   255 GLN--LDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVK 312
             GLN   D T E+ +       S   +D       + +      +P+  DWR  G +++VK
Sbjct:    68 GLNQFTDMTFEEFKAKYLTEMS-RASDILSHGVPYEANNRA--VPDKIDWRESGYVTEVK 124

Query:   313 EQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCD--MSNGGCNGGRMDDALQYI 370
             +QG C  CWAFS  G +E  +     +    S QQLVDC     N GC+GG M++A QY+
Sbjct:   125 DQGNCGSCWAFSTTGTMEGQYMKNERTSISFSEQQLVDCSGPWGNNGCSGGLMENAYQYL 184

Query:   371 IDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLS 430
                 G+ ++ +YPY A E +  C              Y+ +  G E E+K  V  R P +
Sbjct:   185 -KQFGLETESSYPYTAVEGQ--CRYNKQLGVAKVTGYYT-VHSGSEVELKNLVGARRPAA 240

Query:   431 VGMNANGLFY-YSGGV--------IDLNQRL----YGTS--IPYWIVKNSWGSDWGEK 473
             V ++    F  Y  G+        + +N  +    YGT     YWIVKNSWG+ WGE+
Sbjct:   241 VAVDVESDFMMYRSGIYQSQTCSPLRVNHAVLAVGYGTQGGTDYWIVKNSWGTYWGER 298

 Score = 235 (87.8 bits), Expect = 6.3e-29, Sum P(2) = 6.3e-29
 Identities = 55/152 (36%), Positives = 80/152 (52%)

Query:   504 EKLVDCD--MSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXX 561
             ++LVDC     N GC+GG M++A QY+    G+ ++ +YPY A E +  C          
Sbjct:   158 QQLVDCSGPWGNNGCSGGLMENAYQYL-KQFGLETESSYPYTAVEGQ--CRYNKQLGVAK 214

Query:   562 XXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFY-YSGGVIDLNQRLCNPKAQNHAL 620
                 Y+ +  G E E+K  V  R P +V ++    F  Y  G+     + C+P   NHA+
Sbjct:   215 VTGYYT-VHSGSEVELKNLVGARRPAAVAVDVESDFMMYRSGIYQ--SQTCSPLRVNHAV 271

Query:   621 IIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
             + VGYG +    GT   YWIVKNSWG+ WGE+
Sbjct:   272 LAVGYGTQ---GGTD--YWIVKNSWGTYWGER 298

 Score = 136 (52.9 bits), Expect = 6.3e-29, Sum P(2) = 6.3e-29
 Identities = 44/171 (25%), Positives = 72/171 (42%)

Query:    34 TRGYLNSPVTRFLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQ-REDSGTAVFEVN 92
             T G L S    +  + R ++K Y+  +D  RR+  +  NV+  +++  R D G   + + 
Sbjct:    10 TVGVLGSNDDLWHQWKRMYNKEYNGADDQHRRNI-WEKNVKHIQEHNLRHDLGLVTYTLG 68

Query:    93 KFFDLSDSDLQQLTGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAF 152
                      L Q T    D T E+ +       S   +D       + +      +P+  
Sbjct:    69 ---------LNQFT----DMTFEEFKAKYLTEMS-RASDILSHGVPYEANNRA--VPDKI 112

Query:   153 DWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQHHDKV 203
             DWR  G +++VK+QG C  CWAFS  G +E  +    N  T +S      V
Sbjct:   113 DWRESGYVTEVKDQGNCGSCWAFSTTGTMEGQY--MKNERTSISFSEQQLV 161


>DICTYBASE|DDB_G0272815 [details] [associations]
            symbol:cprE "cysteine proteinase 5" species:44689
            "Dictyostelium discoideum" [GO:0005615 "extracellular space"
            evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0005764 "lysosome" evidence=IEA] [GO:0044351
            "macropinocytosis" evidence=RCA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 dictyBase:DDB_G0272815 GO:GO:0005615
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GenomeReviews:CM000151_GR GO:GO:0005764
            EMBL:AAFI02000008 MEROPS:I29.003 KO:K01376 EMBL:L36205
            RefSeq:XP_644977.1 ProteinModelPortal:P54640 SMR:P54640
            PRIDE:P54640 EnsemblProtists:DDB0185092 GeneID:8618654
            KEGG:ddi:DDB_G0272815 OMA:METAFEF ProtClustDB:CLSZ2430780
            Uniprot:P54640
        Length = 344

 Score = 275 (101.9 bits), Expect = 7.1e-29, Sum P(2) = 7.1e-29
 Identities = 61/186 (32%), Positives = 90/186 (48%)

Query:   301 DWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSNGGCNG 360
             DWR+EG ++ VK QG+C  CW+FS  G  E  H      L  LS Q L+DC   N GC+G
Sbjct:   117 DWRSEGAVTPVKNQGQCGGCWSFSTTGSTEGAHFQSKGELVSLSEQNLIDCSTENSGCDG 176

Query:   361 GRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYGEEEEMK 420
             G M  A +YII+N G+ ++ +YPYKA   E G               Y  +  G E  ++
Sbjct:   177 GLMTYAFEYIINNNGIDTESSYPYKA---ENGKCEYKSENSGATLSSYKTVTAGSESSLE 233

Query:   421 KWVATRGPLSVGMNANGLFY--YSGGVIDLNQRLYGTSIPYWIVKNSWGSDWGEKVEDKV 478
               V    P+SV ++A+   +  Y+ G+          ++ + ++   +GS  G       
Sbjct:   234 SAVNVN-PVSVAIDASHQSFQLYTSGIY-YEPECSSENLDHGVLAVGYGSGSGSSSGQSS 291

Query:   479 G-SSGN 483
             G SSGN
Sbjct:   292 GQSSGN 297

 Score = 216 (81.1 bits), Expect = 5.3e-28, Sum P(3) = 5.3e-28
 Identities = 53/170 (31%), Positives = 83/170 (48%)

Query:   497 KLSRLATEKLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXX 556
             +L  L+ + L+DC   N GC+GG M  A +YII+N G+ ++ +YPYKA   E G      
Sbjct:   155 ELVSLSEQNLIDCSTENSGCDGGLMTYAFEYIINNNGIDTESSYPYKA---ENGKCEYKS 211

Query:   557 XXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFY--YSGGVIDLNQRLCNPK 614
                      Y  +  G E  ++  V    P+SV ++A+   +  Y+ G+    +  C+ +
Sbjct:   212 ENSGATLSSYKTVTAGSESSLESAVNVN-PVSVAIDASHQSFQLYTSGIY--YEPECSSE 268

Query:   615 AQNHALIIVGYGEEE-----KKDG---------TSIPYWIVKNSWGSDWG 650
               +H ++ VGYG        +  G         +S  YWIVKNSWG+ WG
Sbjct:   269 NLDHGVLAVGYGSGSGSSSGQSSGQSSGNLSASSSNEYWIVKNSWGTSWG 318

 Score = 120 (47.3 bits), Expect = 5.3e-28, Sum P(3) = 5.3e-28
 Identities = 21/47 (44%), Positives = 27/47 (57%)

Query:   153 DWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQH 199
             DWR+EG ++ VK QG+C  CW+FS  G  E  H      L  LS Q+
Sbjct:   117 DWRSEGAVTPVKNQGQCGGCWSFSTTGSTEGAHFQSKGELVSLSEQN 163

 Score = 82 (33.9 bits), Expect = 5.4e-13, Sum P(4) = 5.4e-13
 Identities = 16/30 (53%), Positives = 20/30 (66%)

Query:   455 TSIPYWIVKNSWGSDWGEKVEDKVGSSGNR 484
             +S  YWIVKNSWG+ WG  +E  +  S NR
Sbjct:   302 SSNEYWIVKNSWGTSWG--IEGYILMSRNR 329

 Score = 78 (32.5 bits), Expect = 7.1e-29, Sum P(2) = 7.1e-29
 Identities = 26/103 (25%), Positives = 49/103 (47%)

Query:    12 KGLGYLHTFMIKVALLESNIFQTRGYLNSPVTRFLNFMRDHDKVYSSVEDLLRRHENFVT 71
             K L +L   ++ VA  +   F    Y N+    F ++M  H K Y+S E+   R+  F  
Sbjct:     2 KVLSFLCVLLVSVATAKQQ-FSELQYRNA----FTDWMITHQKSYTS-EEFGARYNIFKA 55

Query:    72 NVEKAEDYQREDSGTAVFEVNKFFDLSDSDLQQL-TGLNLDST 113
             N++  + +  + S T V  +N F D+++ + +    G   D++
Sbjct:    56 NMDYVQQWNSKGSET-VLGLNNFADITNEEYRNTYLGTKFDAS 97

 Score = 43 (20.2 bits), Expect = 5.4e-13, Sum P(4) = 5.4e-13
 Identities = 10/32 (31%), Positives = 16/32 (50%)

Query:   226 DYQSEDSGTAVFGVNKFFDLSESDLQQLTGLN 257
             +Y+SE+SG  +         SES L+    +N
Sbjct:   208 EYKSENSGATLSSYKTVTAGSESSLESAVNVN 239


>TAIR|locus:2128243 [details] [associations]
            symbol:AT4G11310 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] [GO:0005618 "cell wall"
            evidence=IDA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 GO:GO:0005618 EMBL:CP002687
            GenomeReviews:CT486007_GR EMBL:AL096882 EMBL:AL161531
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            HOGENOM:HOG000230773 KO:K01376 EMBL:AY093066 EMBL:BT000099
            IPI:IPI00520496 PIR:T13022 RefSeq:NP_567376.1 UniGene:At.43189
            ProteinModelPortal:Q9SUT0 SMR:Q9SUT0 IntAct:Q9SUT0 STRING:Q9SUT0
            MEROPS:C01.A20 PaxDb:Q9SUT0 PRIDE:Q9SUT0 EnsemblPlants:AT4G11310.1
            GeneID:826733 KEGG:ath:AT4G11310 TAIR:At4g11310 InParanoid:Q9SUT0
            OMA:EVCHGAD PhylomeDB:Q9SUT0 ProtClustDB:CLSN2689395
            Genevestigator:Q9SUT0 GermOnline:AT4G11310 Uniprot:Q9SUT0
        Length = 364

 Score = 328 (120.5 bits), Expect = 7.1e-29, P = 7.1e-29
 Identities = 94/289 (32%), Positives = 141/289 (48%)

Query:   200 HDKVYSSVEDLLRRHENFVTNVEKAEDYQSEDSGTAVFGVNKFFDLSESDLQQLT-GLNL 258
             H KVY SV +  RR   F  N+    +  +E+    + G+  F DLS  + +++  G   
Sbjct:    56 HGKVYGSVAEKERRLTIFEDNLRFINNRNAENLSYRL-GLTGFADLSLHEYKEVCHGA-- 112

Query:   259 DSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVKEQGKCA 318
                  D +P     F ++    +  A         D LP++ DWR EG +++VK+QG C 
Sbjct:   113 -----DPRPPRNHVFMTSSDRYKTSA--------DDVLPKSVDWRNEGAVTEVKDQGHCR 159

Query:   319 CCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSNGGCNGGRMDDALQYIIDNGGVVS 378
              CWAFS VG VE ++ I    L  LS Q L++C+  N GC GG+++ A ++I+ NGG+ +
Sbjct:   160 SCWAFSTVGAVEGLNKIVTGELVTLSEQDLINCNKENNGCGGGKLETAYEFIMKNGGLGT 219

Query:   379 DQAYPYKASESE-RGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANG 437
             D  YPYKA      G L             Y  +P  +E  + K VA + P++  ++++ 
Sbjct:   220 DNDYPYKAVNGVCDGRLKENNKNVMIDG--YENLPANDESALMKAVAHQ-PVTAVIDSSS 276

Query:   438 LFY--YSGGVID------LNQRL----YGTSI--PYWIVKNSWGSDWGE 472
               +  Y  GV D      LN  +    YGT     YW+VKNS G  WGE
Sbjct:   277 REFQLYESGVFDGSCGTNLNHGVVVVGYGTENGRDYWLVKNSRGITWGE 325

 Score = 221 (82.9 bits), Expect = 1.7e-27, Sum P(2) = 1.7e-27
 Identities = 53/165 (32%), Positives = 86/165 (52%)

Query:   490 LTGVLPSKLSRLATEKLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESE- 548
             L  ++  +L  L+ + L++C+  N GC GG+++ A ++I+ NGG+ +D  YPYKA     
Sbjct:   173 LNKIVTGELVTLSEQDLINCNKENNGCGGGKLETAYEFIMKNGGLGTDNDYPYKAVNGVC 232

Query:   549 RGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFY--YSGGVIDL 606
              G L             Y  +P  +E  + K VA + P++  ++++   +  Y  GV D 
Sbjct:   233 DGRLKENNKNVMIDG--YENLPANDESALMKAVAHQ-PVTAVIDSSSREFQLYESGVFDG 289

Query:   607 NQRLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGE 651
             +   C     NH +++VGYG E  +D     YW+VKNS G  WGE
Sbjct:   290 S---CGTNL-NHGVVVVGYGTENGRD-----YWLVKNSRGITWGE 325

 Score = 155 (59.6 bits), Expect = 1.7e-27, Sum P(2) = 1.7e-27
 Identities = 49/155 (31%), Positives = 73/155 (47%)

Query:    45 FLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQREDSGTAVFEVNKFFDLSDSDLQQ 104
             F ++M  H KVY SV +  RR   F  N+    +   E+    +  +  F DLS  + ++
Sbjct:    49 FESWMVKHGKVYGSVAEKERRLTIFEDNLRFINNRNAENLSYRL-GLTGFADLSLHEYKE 107

Query:   105 LT-GLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKV 163
             +  G        D +P     F ++    +  A         D LP++ DWR EG +++V
Sbjct:   108 VCHGA-------DPRPPRNHVFMTSSDRYKTSA--------DDVLPKSVDWRNEGAVTEV 152

Query:   164 KEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
             K+QG C  CWAFS VG VE ++ I    L  LS Q
Sbjct:   153 KDQGHCRSCWAFSTVGAVEGLNKIVTGELVTLSEQ 187


>RGD|69241 [details] [associations]
            symbol:Ctsj "cathepsin J" species:10116 "Rattus norvegicus"
           [GO:0005764 "lysosome" evidence=IEA] [GO:0006508 "proteolysis"
           evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
           evidence=IEA] [GO:0048471 "perinuclear region of cytoplasm"
           evidence=IDA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
           PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 EMBL:L14776
           RGD:69241 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
           InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246
           SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
           GO:GO:0005764 HOVERGEN:HBG011513 MEROPS:C01.038 CTD:26898 KO:K09599
           EMBL:AF310623 EMBL:BC097263 IPI:IPI00205027 PIR:I58002
           RefSeq:NP_058817.1 UniGene:Rn.34875 ProteinModelPortal:Q63088
           SMR:Q63088 PRIDE:Q63088 GeneID:29174 KEGG:rno:29174 NextBio:608244
           Genevestigator:Q63088 Uniprot:Q63088
        Length = 334

 Score = 256 (95.2 bits), Expect = 4.6e-21, P = 4.6e-21
 Identities = 60/157 (38%), Positives = 81/157 (51%)

Query:   296 LPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEA-MHAIQGNSLTELSVQQLVDCDMS 354
             LP   DWR EG ++ V+ QGKC  CWAF+AVG +E  M +  GN LT LSVQ L+DC  S
Sbjct:   114 LPNFKDWRKEGYVTPVRNQGKCGSCWAFAAVGAIEGQMFSKTGN-LTPLSVQNLLDCSKS 172

Query:   355 NG--GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIP 412
              G  GC  G    A  Y++ N G+ ++  YPY   E + G               +  +P
Sbjct:   173 EGNNGCRWGTAHQAFNYVLKNKGLEAEATYPY---EGKDGPCRYHSENASANITGFVNLP 229

Query:   413 YGEEEEMKKWVATR--GPLSVGMNAN--GLFYYSGGV 445
                  E+  WVA    GP+S  ++A+     +YSGGV
Sbjct:   230 ---PNELYLWVAVASIGPVSAAIDASHDSFRFYSGGV 263

 Score = 231 (86.4 bits), Expect = 7.5e-29, Sum P(2) = 7.5e-29
 Identities = 53/160 (33%), Positives = 82/160 (51%)

Query:   498 LSRLATEKLVDCDMSNG--GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXX 555
             L+ L+ + L+DC  S G  GC  G    A  Y++ N G+ ++  YPY   E + G     
Sbjct:   158 LTPLSVQNLLDCSKSEGNNGCRWGTAHQAFNYVLKNKGLEAEATYPY---EGKDGPCRYH 214

Query:   556 XXXXXXXXXXYSRIPYGEEEEMKKWVATR--GPLSVGMNAN--GLFYYSGGVIDLNQRLC 611
                       +  +P     E+  WVA    GP+S  ++A+     +YSGGV   ++  C
Sbjct:   215 SENASANITGFVNLP---PNELYLWVAVASIGPVSAAIDASHDSFRFYSGGVY--HEPNC 269

Query:   612 NPKAQNHALIIVGYG-EEEKKDGTSIPYWIVKNSWGSDWG 650
             +    NHA+++VGYG E  + DG +  YW++KNSWG +WG
Sbjct:   270 SSYVVNHAVLVVGYGFEGNETDGNN--YWLIKNSWGEEWG 307

 Score = 148 (57.2 bits), Expect = 7.5e-29, Sum P(2) = 7.5e-29
 Identities = 27/52 (51%), Positives = 34/52 (65%)

Query:   148 LPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQH 199
             LP   DWR EG ++ V+ QGKC  CWAF+AVG +E     +  NLT LSVQ+
Sbjct:   114 LPNFKDWRKEGYVTPVRNQGKCGSCWAFAAVGAIEGQMFSKTGNLTPLSVQN 165

 Score = 80 (33.2 bits), Expect = 3.1e-11, Sum P(2) = 3.1e-11
 Identities = 28/134 (20%), Positives = 48/134 (35%)

Query:   355 NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASE------SERGCLXXXXXXXXXXXXXY 408
             N GC  G    A  Y++ N G+ ++  YPY+  +      SE                 Y
Sbjct:   175 NNGCRWGTAHQAFNYVLKNKGLEAEATYPYEGKDGPCRYHSENASANITGFVNLPPNELY 234

Query:   409 SRIPYGEEEEMKKWVATRGPLSVGMNANGLFY------Y--SGGVIDLNQRLYGTSIP-- 458
               +       +   +      S    + G+++      Y  +  V+ +     G      
Sbjct:   235 LWVAVASIGPVSAAIDASHD-SFRFYSGGVYHEPNCSSYVVNHAVLVVGYGFEGNETDGN 293

Query:   459 -YWIVKNSWGSDWG 471
              YW++KNSWG +WG
Sbjct:   294 NYWLIKNSWGEEWG 307


>MGI|MGI:2139628 [details] [associations]
            symbol:Ctso "cathepsin O" species:10090 "Mus musculus"
            [GO:0005575 "cellular_component" evidence=ND] [GO:0005764
            "lysosome" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0008233 "peptidase activity" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 MGI:MGI:2139628 eggNOG:COG4870
            HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0005764 GeneTree:ENSGT00560000076599 MEROPS:C01.035 CTD:1519
            HOVERGEN:HBG105050 KO:K01374 OMA:SNVCGIA OrthoDB:EOG4V6ZH1
            EMBL:AK034490 EMBL:AK049470 EMBL:AK165930 EMBL:AK166103
            EMBL:BC044664 IPI:IPI00453524 RefSeq:NP_808330.1 UniGene:Mm.254642
            ProteinModelPortal:Q8BM88 SMR:Q8BM88 STRING:Q8BM88
            PhosphoSite:Q8BM88 PRIDE:Q8BM88 Ensembl:ENSMUST00000029649
            GeneID:229445 KEGG:mmu:229445 UCSC:uc008pon.1 InParanoid:Q8BM88
            NextBio:379433 Bgee:Q8BM88 CleanEx:MM_CTSO Genevestigator:Q8BM88
            GermOnline:ENSMUSG00000028015 Uniprot:Q8BM88
        Length = 312

 Score = 310 (114.2 bits), Expect = 7.6e-29, Sum P(2) = 7.6e-29
 Identities = 74/203 (36%), Positives = 106/203 (52%)

Query:   296 LPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSN 355
             LP  FDWR + V++ V+ Q  C  CWAFS V  +E+  AIQG SL  LSVQQ++DC  +N
Sbjct:    99 LPLRFDWRDKHVVNPVRNQEMCGGCWAFSVVSAIESARAIQGKSLDYLSVQQVIDCSFNN 158

Query:   356 GGCNGGRMDDALQYIIDNG-GVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPY- 413
              GC GG    AL+++ +    +V+D  YP+KA   +  C              +S   + 
Sbjct:   159 SGCLGGSPLCALRWLNETQLKLVADSQYPFKAVNGQ--CRHFPQSQAGVSVKDFSAYNFR 216

Query:   414 GEEEEMKKWVATRGPLSVGMNANGLFYYSGGVI-------DLNQRLYGTSI------PYW 460
             G+E+EM + + + GPL V ++A     Y GG+I       + N  +  T        PYW
Sbjct:   217 GQEDEMARALLSFGPLVVIVDAMSWQDYLGGIIQHHCSSGEANHAVLITGFDRTGNTPYW 276

Query:   461 IVKNSWGSDWGEKVEDKVGSSGN 483
             +V+NSWGS WG +    V   GN
Sbjct:   277 MVRNSWGSSWGVEGYAHVKMGGN 299

 Score = 215 (80.7 bits), Expect = 5.7e-26, Sum P(2) = 5.7e-26
 Identities = 51/155 (32%), Positives = 83/155 (53%)

Query:   498 LSRLATEKLVDCDMSNGGCNGGRMDDALQYIIDNG-GVVSDQAYPYKASESERGCLXXXX 556
             L  L+ ++++DC  +N GC GG    AL+++ +    +V+D  YP+KA   +  C     
Sbjct:   143 LDYLSVQQVIDCSFNNSGCLGGSPLCALRWLNETQLKLVADSQYPFKAVNGQ--CRHFPQ 200

Query:   557 XXXXXXXXXYSRIPY-GEEEEMKKWVATRGPLSVGMNANGLFYYSGGVIDLNQRLCNPKA 615
                      +S   + G+E+EM + + + GPL V ++A     Y GG+I   Q  C+   
Sbjct:   201 SQAGVSVKDFSAYNFRGQEDEMARALLSFGPLVVIVDAMSWQDYLGGII---QHHCSSGE 257

Query:   616 QNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWG 650
              NHA++I G+     + G + PYW+V+NSWGS WG
Sbjct:   258 ANHAVLITGFD----RTGNT-PYWMVRNSWGSSWG 287

 Score = 137 (53.3 bits), Expect = 5.7e-26, Sum P(2) = 5.7e-26
 Identities = 26/51 (50%), Positives = 33/51 (64%)

Query:   148 LPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
             LP  FDWR + V++ V+ Q  C  CWAFS V  +E+  AIQG +L  LSVQ
Sbjct:    99 LPLRFDWRDKHVVNPVRNQEMCGGCWAFSVVSAIESARAIQGKSLDYLSVQ 149

 Score = 41 (19.5 bits), Expect = 7.6e-29, Sum P(2) = 7.6e-29
 Identities = 14/54 (25%), Positives = 25/54 (46%)

Query:   200 HDKVYSSVEDLLRRHENFVTNVEKAEDYQSEDSGTAVFGVNKFFDLSESDLQQL 253
             H +  +++ + L RH            +  E+S TA +GVN+F  L   + + L
Sbjct:    29 HQREAAALRESLHRHRYL-------NSFPHENS-TAFYGVNQFSYLFPEEFKAL 74


>UNIPROTKB|F1P3U9 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0001913 "T cell mediated cytotoxicity" evidence=IEA]
            [GO:0002764 "immune response-regulating signaling pathway"
            evidence=IEA] [GO:0004177 "aminopeptidase activity" evidence=IEA]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=IEA]
            [GO:0004252 "serine-type endopeptidase activity" evidence=IEA]
            [GO:0005615 "extracellular space" evidence=IEA] [GO:0005764
            "lysosome" evidence=IEA] [GO:0005829 "cytosol" evidence=IEA]
            [GO:0008284 "positive regulation of cell proliferation"
            evidence=IEA] [GO:0010628 "positive regulation of gene expression"
            evidence=IEA] [GO:0010634 "positive regulation of epithelial cell
            migration" evidence=IEA] [GO:0010813 "neuropeptide catabolic
            process" evidence=IEA] [GO:0010815 "bradykinin catabolic process"
            evidence=IEA] [GO:0016505 "apoptotic protease activator activity"
            evidence=IEA] [GO:0030108 "HLA-A specific activating MHC class I
            receptor activity" evidence=IEA] [GO:0031638 "zymogen activation"
            evidence=IEA] [GO:0031648 "protein destabilization" evidence=IEA]
            [GO:0032526 "response to retinoic acid" evidence=IEA] [GO:0033619
            "membrane protein proteolysis" evidence=IEA] [GO:0043066 "negative
            regulation of apoptotic process" evidence=IEA] [GO:0043129
            "surfactant homeostasis" evidence=IEA] [GO:0045766 "positive
            regulation of angiogenesis" evidence=IEA] [GO:0060448 "dichotomous
            subdivision of terminal units involved in lung branching"
            evidence=IEA] [GO:0070324 "thyroid hormone binding" evidence=IEA]
            [GO:0070371 "ERK1 and ERK2 cascade" evidence=IEA] [GO:0097067
            "cellular response to thyroid hormone stimulus" evidence=IEA]
            [GO:0097208 "alveolar lamellar body" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0005829 GO:GO:0043066
            GO:GO:0005615 GO:GO:0008284 GO:GO:0070371 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766 GO:GO:0004177
            GO:GO:0004252 GO:GO:0005764 GO:GO:0032526 GO:GO:0010628
            GO:GO:0070324 GO:GO:0016505 GO:GO:0010634 GO:GO:0004197
            GO:GO:0042599 GO:GO:0031648 GO:GO:0097067 GO:GO:0031638
            GO:GO:0001913 GeneTree:ENSGT00660000095458 OMA:STSCHKT
            GO:GO:0030108 GO:GO:0010815 GO:GO:0002764 GO:GO:0033619
            GO:GO:0010813 EMBL:AADN02038832 EMBL:AADN02038831 IPI:IPI00594147
            Ensembl:ENSGALT00000013440 Uniprot:F1P3U9
        Length = 261

 Score = 267 (99.0 bits), Expect = 3.0e-22, P = 3.0e-22
 Identities = 71/204 (34%), Positives = 92/204 (45%)

Query:   288 NSLRHGDDLPEAFDWRAEG-VISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQ 346
             N LR     PEA DWR +G  ++ VK QG C  CW FS  G +E+  AI    L  L+ Q
Sbjct:    34 NFLRSDGPCPEAVDWRKKGNFVTPVKNQGPCGSCWTFSTTGCLESAIAIATGKLLSLAEQ 93

Query:   347 QLVDCDMS--NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXX 404
              LVDC  +  N GC+GG    A +YI+ N G++ + AYPY+A   + G            
Sbjct:    94 LLVDCAQAFNNHGCSGGLPSQAFEYILYNKGLMGEDAYPYRA---QNGTCKFQPDKAIAF 150

Query:   405 XXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLF-YYSGGVID----------LNQRL- 452
                   I   +E  M + V    P+S        F +Y  GV            +N  + 
Sbjct:   151 VKDVINITQYDEAGMVEAVGKHNPVSFAFEVTSDFMHYRKGVYSNPRCEHTPDKVNHAVL 210

Query:   453 ---YGTSI--PYWIVKNSWGSDWG 471
                YG     PYWIVKNSWG  WG
Sbjct:   211 AVGYGEEDGRPYWIVKNSWGPLWG 234

 Score = 228 (85.3 bits), Expect = 1.1e-28, Sum P(2) = 1.1e-28
 Identities = 61/163 (37%), Positives = 78/163 (47%)

Query:   493 VLPSKLSRLATEKLVDCDMS--NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERG 550
             +   KL  LA + LVDC  +  N GC+GG    A +YI+ N G++ + AYPY+A   + G
Sbjct:    82 IATGKLLSLAEQLLVDCAQAFNNHGCSGGLPSQAFEYILYNKGLMGEDAYPYRA---QNG 138

Query:   551 CLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLF-YYSGGVIDLNQR 609
                               I   +E  M + V    P+S        F +Y  GV   N R
Sbjct:   139 TCKFQPDKAIAFVKDVINITQYDEAGMVEAVGKHNPVSFAFEVTSDFMHYRKGVYS-NPR 197

Query:   610 LCN--PKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWG 650
              C   P   NHA++ VGYGEE   DG   PYWIVKNSWG  WG
Sbjct:   198 -CEHTPDKVNHAVLAVGYGEE---DGR--PYWIVKNSWGPLWG 234

 Score = 125 (49.1 bits), Expect = 1.1e-28, Sum P(2) = 1.1e-28
 Identities = 26/60 (43%), Positives = 32/60 (53%)

Query:   140 NSLRHGDDLPEAFDWRAEG-VISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
             N LR     PEA DWR +G  ++ VK QG C  CW FS  G +E+  AI    L  L+ Q
Sbjct:    34 NFLRSDGPCPEAVDWRKKGNFVTPVKNQGPCGSCWTFSTTGCLESAIAIATGKLLSLAEQ 93


>FB|FBgn0034229 [details] [associations]
            symbol:CG4847 species:7227 "Drosophila melanogaster"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0032504
            "multicellular organism reproduction" evidence=IEP] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISS] [GO:0005615 "extracellular space"
            evidence=ISM;IDA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            EMBL:AE013599 GO:GO:0005615 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 GO:GO:0032504 GeneTree:ENSGT00560000076599
            KO:K01371 EMBL:BT099507 RefSeq:NP_725686.1 UniGene:Dm.4677
            SMR:A1ZAU4 IntAct:A1ZAU4 MEROPS:C01.A28 EnsemblMetazoa:FBtr0086935
            GeneID:36973 KEGG:dme:Dmel_CG4847 UCSC:CG4847-RB
            FlyBase:FBgn0034229 InParanoid:A1ZAU4 OMA:GGFQEYA OrthoDB:EOG4J9KFC
            ChiTaRS:CG4847 GenomeRNAi:36973 NextBio:801302 Uniprot:A1ZAU4
        Length = 420

 Score = 255 (94.8 bits), Expect = 4.4e-19, P = 4.4e-19
 Identities = 85/281 (30%), Positives = 126/281 (44%)

Query:   202 KVYSSVEDLLRRHENFVTNVEKAEDYQSEDS-GTAVF--GVNKFFDLSESD-LQQLTGLN 257
             K Y S  D       F +     E   +  + G   F   VN F DL+ S+ L QLTGL 
Sbjct:   121 KTYLSAADRALHEGAFASTKNLVEAGNAAFAQGVHTFKQAVNAFADLTHSEFLSQLTGLK 180

Query:   258 LDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVKEQGKC 317
                      P  +A  +++     + A           +P+AFDWR  G ++ VK QG C
Sbjct:   181 RS-------PEAKARAAASLKLVNLPA---------KPIPDAFDWREHGGVTPVKFQGTC 224

Query:   318 ACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDC----DMSNGGCNGGRMDDALQYIID- 372
               CWAF+  G +E     +  SL  LS Q LVDC    D    GC+GG  + A  +I + 
Sbjct:   225 GSCWAFATTGAIEGHTFRKTGSLPNLSEQNLVDCGPVEDFGLNGCDGGFQEAAFCFIDEV 284

Query:   373 NGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVG 432
               GV  + AYPY      +G               ++ IP  +EE++KK VAT GP++  
Sbjct:   285 QKGVSQEGAYPYI---DNKGTCKYDGSKSGATLQGFAAIPPKDEEQLKKVVATLGPVACS 341

Query:   433 MNA-NGLFYYSGGVIDLNQRLYGTSIPYWIVKNSWGSDWGE 472
             +N    L  Y+GG+ + ++   G    + I+   +GS+ G+
Sbjct:   342 VNGLETLKNYAGGIYNDDECNKGEP-NHSILVVGYGSEKGQ 381

 Score = 248 (92.4 bits), Expect = 1.2e-28, Sum P(2) = 1.2e-28
 Identities = 62/168 (36%), Positives = 87/168 (51%)

Query:   491 TGVLPSKLSRLATEKLVDC----DMSNGGCNGGRMDDALQYIID-NGGVVSDQAYPYKAS 545
             TG LP+    L+ + LVDC    D    GC+GG  + A  +I +   GV  + AYPY   
Sbjct:   244 TGSLPN----LSEQNLVDCGPVEDFGLNGCDGGFQEAAFCFIDEVQKGVSQEGAYPYI-- 297

Query:   546 ESERGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNA-NGLFYYSGGVI 604
                +G               ++ IP  +EE++KK VAT GP++  +N    L  Y+GG+ 
Sbjct:   298 -DNKGTCKYDGSKSGATLQGFAAIPPKDEEQLKKVVATLGPVACSVNGLETLKNYAGGIY 356

Query:   605 DLNQRLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
               N   CN    NH++++VGYG E+ +D     YWIVKNSW   WGEK
Sbjct:   357 --NDDECNKGEPNHSILVVGYGSEKGQD-----YWIVKNSWDDTWGEK 397

 Score = 156 (60.0 bits), Expect = 6.1e-18, Sum P(2) = 6.1e-18
 Identities = 45/138 (32%), Positives = 64/138 (46%)

Query:   352 DMSNGGCNGGRMDDALQYIID-NGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSR 410
             D    GC+GG  + A  +I +   GV  + AYPY      +G               ++ 
Sbjct:   263 DFGLNGCDGGFQEAAFCFIDEVQKGVSQEGAYPYI---DNKGTCKYDGSKSGATLQGFAA 319

Query:   411 IPYGEEEEMKKWVATRGPLSVGMNA-NGLFYYSGGVI--------DLNQRL----YGTSI 457
             IP  +EE++KK VAT GP++  +N    L  Y+GG+         + N  +    YG+  
Sbjct:   320 IPPKDEEQLKKVVATLGPVACSVNGLETLKNYAGGIYNDDECNKGEPNHSILVVGYGSEK 379

Query:   458 --PYWIVKNSWGSDWGEK 473
                YWIVKNSW   WGEK
Sbjct:   380 GQDYWIVKNSWDDTWGEK 397

 Score = 140 (54.3 bits), Expect = 1.2e-28, Sum P(2) = 1.2e-28
 Identities = 50/166 (30%), Positives = 71/166 (42%)

Query:    38 LNSPVTRFLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQREDS-GTAVFE--VNKF 94
             L S V  F +F+    K Y S  D       F +     E      + G   F+  VN F
Sbjct:   105 LLSNVQDFGDFLSQSGKTYLSAADRALHEGAFASTKNLVEAGNAAFAQGVHTFKQAVNAF 164

Query:    95 FDLSDSD-LQQLTGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFD 153
              DL+ S+ L QLTGL          P  +A  +++     + A           +P+AFD
Sbjct:   165 ADLTHSEFLSQLTGLKRS-------PEAKARAAASLKLVNLPA---------KPIPDAFD 208

Query:   154 WRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQH 199
             WR  G ++ VK QG C  CWAF+  G +E     +  +L  LS Q+
Sbjct:   209 WREHGGVTPVKFQGTCGSCWAFATTGAIEGHTFRKTGSLPNLSEQN 254


>DICTYBASE|DDB_G0274385 [details] [associations]
            symbol:DDB_G0274385 "Cysteine proteinase 1,
            mitochondrial" species:44689 "Dictyostelium discoideum" [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0006508
            "proteolysis" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 SMART:SM00645 InterPro:IPR000169
            dictyBase:DDB_G0274385 eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00139 EMBL:AAFI02000012 RefSeq:XP_644301.1
            ProteinModelPortal:Q86KD4 EnsemblProtists:DDB0167535 GeneID:8619729
            KEGG:ddi:DDB_G0274385 InParanoid:Q86KD4 OMA:SICVDAS Uniprot:Q86KD4
        Length = 358

 Score = 324 (119.1 bits), Expect = 1.9e-28, P = 1.9e-28
 Identities = 87/291 (29%), Positives = 138/291 (47%)

Query:   200 HDKVYSSVEDLLRRHENFVTNVEKAEDYQSEDSGTAVFGVNKFFDLSESDLQQLTGLNLD 259
             H K+Y    ++  R  NF  N++K  +  S  +G A F  N F DLSE   ++ +  +L+
Sbjct:    51 HSKIYKDSIEMENRFSNFKENMKKNIELNSMHAGKAKFESNGFSDLSE---EEFSNFHLN 107

Query:   260 STLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAF--DWRAEGVISKVKEQGKC 317
                +     L+       T        +  + +GD L E +  DWR +G+++ VK+QG+C
Sbjct:   108 KAFKGKPSHLRNSIKPQPTPHHSLINGYKEMENGD-LNELYSIDWRKKGLVTPVKDQGQC 166

Query:   318 ACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSNGGCNGGRMDDALQYIIDNGGVV 377
               C+ FSAV  +E      GN    LS QQ VDCD  +G C GG      +Y    GGV 
Sbjct:   167 GSCYIFSAVEQIETAWIKAGNKPILLSEQQAVDCDPYDGQCGGGDPYTVYEYFSQVGGVS 226

Query:   378 SDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANG 437
             ++  YPY A++    C+              ++   G+E  + K +   GP+S+ ++A+ 
Sbjct:   227 TNAQYPYTATDGT--CVNMSRAVPVVSYHYVTQ--GGDENTLIKTIVNDGPVSICVDAST 282

Query:   438 LFYYSGGVI--------DLNQRLYGTSI-------P--YWIVKNSWGSDWG 471
                YSGG+I        D   ++ G  +       P  Y+I++NSWG+DWG
Sbjct:   283 WQSYSGGIITTGCGKNIDHCVQVVGLEVDKTDPSNPVQYYIIRNSWGTDWG 333

 Score = 196 (74.1 bits), Expect = 6.7e-27, Sum P(2) = 6.7e-27
 Identities = 47/151 (31%), Positives = 79/151 (52%)

Query:   501 LATEKLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXX 560
             L+ ++ VDCD  +G C GG      +Y    GGV ++  YPY A++    C+        
Sbjct:   192 LSEQQAVDCDPYDGQCGGGDPYTVYEYFSQVGGVSTNAQYPYTATDGT--CVNMSRAVPV 249

Query:   561 XXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFYYSGGVIDLNQRLCNPKAQNHAL 620
                   ++   G+E  + K +   GP+S+ ++A+    YSGG+I      C  K  +H +
Sbjct:   250 VSYHYVTQ--GGDENTLIKTIVNDGPVSICVDASTWQSYSGGIITTG---CG-KNIDHCV 303

Query:   621 IIVGYGEEEKKDGTS-IPYWIVKNSWGSDWG 650
              +VG  E +K D ++ + Y+I++NSWG+DWG
Sbjct:   304 QVVGL-EVDKTDPSNPVQYYIIRNSWGTDWG 333

 Score = 178 (67.7 bits), Expect = 6.7e-27, Sum P(2) = 6.7e-27
 Identities = 45/156 (28%), Positives = 75/156 (48%)

Query:    45 FLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQREDSGTAVFEVNKFFDLSDSDLQQ 104
             F ++ + H K+Y    ++  R  NF  N++K  +     +G A FE N F DLS+   ++
Sbjct:    44 FNHWAKKHSKIYKDSIEMENRFSNFKENMKKNIELNSMHAGKAKFESNGFSDLSE---EE 100

Query:   105 LTGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAF--DWRAEGVISK 162
              +  +L+   +     L+       T        +  + +GD L E +  DWR +G+++ 
Sbjct:   101 FSNFHLNKAFKGKPSHLRNSIKPQPTPHHSLINGYKEMENGD-LNELYSIDWRKKGLVTP 159

Query:   163 VKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
             VK+QG+C  C+ FSAV  +E      GN    LS Q
Sbjct:   160 VKDQGQCGSCYIFSAVEQIETAWIKAGNKPILLSEQ 195


>UNIPROTKB|Q9GLE3 [details] [associations]
            symbol:CTSK "Cathepsin K" species:9823 "Sus scrofa"
            [GO:0005764 "lysosome" evidence=IEA] [GO:0005576 "extracellular
            region" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 InterPro:IPR015644
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005576 eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 MEROPS:I29.007
            CTD:1513 KO:K01371 OrthoDB:EOG4SJ5FC PANTHER:PTHR12411:SF55
            OMA:LKVPPSH EMBL:AF292030 RefSeq:NP_999467.1 UniGene:Ssc.1020
            ProteinModelPortal:Q9GLE3 SMR:Q9GLE3 STRING:Q9GLE3
            Ensembl:ENSSSCT00000007283 GeneID:397569 KEGG:ssc:397569
            ArrayExpress:Q9GLE3 Uniprot:Q9GLE3
        Length = 330

 Score = 324 (119.1 bits), Expect = 1.9e-28, P = 1.9e-28
 Identities = 73/193 (37%), Positives = 102/193 (52%)

Query:   297 PEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSNG 356
             P++ D+R +G ++ VK QG+C  CWAFS+VG +E     +   L  LS Q LVDC   N 
Sbjct:   117 PDSIDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSEND 176

Query:   357 GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYGEE 416
             GC GG M +A QY+  N G+ S+ AYPY   +    C+             Y  IP G E
Sbjct:   177 GCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQDEN--CMYNPTGKAAKCRG-YREIPEGNE 233

Query:   417 EEMKKWVATRGPLSVGMNAN--GLFYYSGGVI--------DLNQRL----YGTSI--PYW 460
             + +K+ VA  GP+SV ++A+     +YS GV         +LN  +    YG      +W
Sbjct:   234 KALKRAVARVGPVSVAIDASLTSFQFYSKGVYYDENCNSDNLNHAVLAVGYGIQKGKKHW 293

Query:   461 IVKNSWGSDWGEK 473
             I+KNSWG +WG K
Sbjct:   294 IIKNSWGENWGNK 306

 Score = 278 (102.9 bits), Expect = 1.9e-23, P = 1.9e-23
 Identities = 84/245 (34%), Positives = 120/245 (48%)

Query:   416 EEEMKKWVATRGPLSVGMNANGLFY--YSGGVID-LNQRLYGTSIPYWIVKNSW--GSDW 470
             EE ++K    + P S   + + L+   + G   D ++ R  G   P   VKN    GS W
Sbjct:    85 EEVVQKMTGLKVPPSHSRSNDTLYIPDWEGRTPDSIDYRKKGYVTP---VKNQGQCGSCW 141

Query:   471 GEKVEDKVGSSGNRTRDLELTGVLPSKLSRLATEKLVDCDMSNGGCNGGRMDDALQYIID 530
                    VG+   + +  + TG    KL  L+ + LVDC   N GC GG M +A QY+  
Sbjct:   142 AFS---SVGALEGQLK--KKTG----KLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQK 192

Query:   531 NGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVG 590
             N G+ S+ AYPY   +    C+             Y  IP G E+ +K+ VA  GP+SV 
Sbjct:   193 NRGIDSEDAYPYVGQDEN--CMYNPTGKAAKCRG-YREIPEGNEKALKRAVARVGPVSVA 249

Query:   591 MNAN--GLFYYSGGVI-DLNQRLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGS 647
             ++A+     +YS GV  D N   CN    NHA++ VGYG ++ K      +WI+KNSWG 
Sbjct:   250 IDASLTSFQFYSKGVYYDEN---CNSDNLNHAVLAVGYGIQKGKK-----HWIIKNSWGE 301

Query:   648 DWGEK 652
             +WG K
Sbjct:   302 NWGNK 306


>UNIPROTKB|Q90686 [details] [associations]
            symbol:CTSK "Cathepsin K" species:9031 "Gallus gallus"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015644 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 HOVERGEN:HBG011513
            CTD:1513 KO:K01371 PANTHER:PTHR12411:SF55 EMBL:U37691
            IPI:IPI00575213 RefSeq:NP_990302.1 UniGene:Gga.51509
            ProteinModelPortal:Q90686 SMR:Q90686 MEROPS:C01.036 GeneID:395818
            KEGG:gga:395818 NextBio:20815886 Uniprot:Q90686
        Length = 334

 Score = 323 (118.8 bits), Expect = 2.5e-28, P = 2.5e-28
 Identities = 72/193 (37%), Positives = 103/193 (53%)

Query:   297 PEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSNG 356
             P A DWR +G ++ VK+QG+C  CWAFS+VG +E     +   L  LS Q LV C  +N 
Sbjct:   121 PAAVDWRRKGYVTPVKDQGQCGSCWAFSSVGALEGQLKRRTGKLLSLSPQNLVYCVSNNN 180

Query:   357 GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYGEE 416
             GC GG M +A +Y+  N G+ S+ AYPY   +    C+             Y  IP   E
Sbjct:   181 GCGGGYMTNAFEYVRLNRGIDSEDAYPYIGQDES--CMYSPTGKAAKCRG-YREIPEDNE 237

Query:   417 EEMKKWVATRGPLSVGMNAN--GLFYYSGGVI--------DLNQRL----YGTS--IPYW 460
             + +K+ VA  GP+SVG++A+     +YS GV         ++N  +    YG      +W
Sbjct:   238 KALKRAVARIGPVSVGIDASLPSFQFYSRGVYYDTGCNPENINHAVLAVGYGAQKGTKHW 297

Query:   461 IVKNSWGSDWGEK 473
             I+KNSWG++WG K
Sbjct:   298 IIKNSWGTEWGNK 310

 Score = 273 (101.2 bits), Expect = 6.7e-23, P = 6.7e-23
 Identities = 77/217 (35%), Positives = 109/217 (50%)

Query:   449 NQRLYGTSIPYWIVKNSWGSDWGEK-----VEDK--VGSSGNRTRDLELTGVLP---SKL 498
             N  LY   +P W  +     DW  K     V+D+   GS    +    L G L     KL
Sbjct:   108 NGTLY---VPDWSSRAPAAVDWRRKGYVTPVKDQGQCGSCWAFSSVGALEGQLKRRTGKL 164

Query:   499 SRLATEKLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXX 558
               L+ + LV C  +N GC GG M +A +Y+  N G+ S+ AYPY   +    C+      
Sbjct:   165 LSLSPQNLVYCVSNNNGCGGGYMTNAFEYVRLNRGIDSEDAYPYIGQDES--CMYSPTGK 222

Query:   559 XXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNAN--GLFYYSGGVI-DLNQRLCNPKA 615
                    Y  IP   E+ +K+ VA  GP+SVG++A+     +YS GV  D     CNP+ 
Sbjct:   223 AAKCRG-YREIPEDNEKALKRAVARIGPVSVGIDASLPSFQFYSRGVYYDTG---CNPEN 278

Query:   616 QNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
              NHA++ VGYG ++   GT   +WI+KNSWG++WG K
Sbjct:   279 INHAVLAVGYGAQK---GTK--HWIIKNSWGTEWGNK 310

 Score = 128 (50.1 bits), Expect = 4.6e-05, Sum P(2) = 4.6e-05
 Identities = 23/51 (45%), Positives = 32/51 (62%)

Query:   149 PEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQH 199
             P A DWR +G ++ VK+QG+C  CWAFS+VG +E     +   L  LS Q+
Sbjct:   121 PAAVDWRRKGYVTPVKDQGQCGSCWAFSSVGALEGQLKRRTGKLLSLSPQN 171

 Score = 42 (19.8 bits), Expect = 4.6e-05, Sum P(2) = 4.6e-05
 Identities = 10/43 (23%), Positives = 18/43 (41%)

Query:   284 AFQFNSLRHGDDLPEAFDWRAEGVISKVKEQGKCACCWAFSAV 326
             AF++  L  G D  +A+ +  +         GK A C  +  +
Sbjct:   190 AFEYVRLNRGIDSEDAYPYIGQDESCMYSPTGKAAKCRGYREI 232


>TAIR|locus:2029934 [details] [associations]
            symbol:AT1G29080 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 EMBL:CP002684 GenomeReviews:CT485782_GR
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 EMBL:AC021043 MEROPS:I29.003 HOGENOM:HOG000230773
            HSSP:P53634 ProtClustDB:CLSN2688064 EMBL:DQ056468 IPI:IPI00521747
            PIR:C86413 RefSeq:NP_564320.1 UniGene:At.51814
            ProteinModelPortal:Q9LP39 SMR:Q9LP39 EnsemblPlants:AT1G29080.1
            GeneID:839783 KEGG:ath:AT1G29080 TAIR:At1g29080 InParanoid:Q9LP39
            OMA:KTWGENG PhylomeDB:Q9LP39 Genevestigator:Q9LP39 Uniprot:Q9LP39
        Length = 346

 Score = 321 (118.1 bits), Expect = 4.1e-28, P = 4.1e-28
 Identities = 92/290 (31%), Positives = 139/290 (47%)

Query:   202 KVYSSVEDLLRRHENFVTNVEKAEDYQSEDSGTAVFGVNKFFDLSESD-LQQLTGLNLDS 260
             +VY    +   R +    N++  E + +  + +   GVN+F D ++ + L   TGL    
Sbjct:    48 RVYDDEFEKQLRLQVLTENLKFIESFNNMGNQSYKLGVNEFTDWTKEEFLATYTGLR--- 104

Query:   261 TLEDIQPSLQAPFSS-NQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVKEQGKCAC 319
                    ++ +PF   N+T     A+ +      D L    DWR EG ++ VK QG+C  
Sbjct:   105 -----GVNVTSPFEVVNETKP---AWNWTV---SDVLGTNKDWRNEGAVTPVKSQGECGG 153

Query:   320 CWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMS-NGGCNGGRMDDALQYIIDNGGVVS 378
             CWAFSA+  VE +  I   +L  LS QQL+DC    N GC GG   +A  YII + G+ S
Sbjct:   154 CWAFSAIAAVEGLTKIARGNLISLSEQQLLDCTREQNNGCKGGTFVNAFNYIIKHRGISS 213

Query:   379 DQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNAN-- 436
             +  YPY+  E    C              +  +P   E  + + V+ R P++V ++A+  
Sbjct:   214 ENEYPYQVKEGP--C--RSNARPAILIRGFENVPSNNERALLEAVS-RQPVAVAIDASEA 268

Query:   437 GLFYYSGGVID-------LNQRL----YGTS---IPYWIVKNSWGSDWGE 472
             G  +YSGGV +       +N  +    YGTS   + YW+ KNSWG  WGE
Sbjct:   269 GFVHYSGGVYNARNCGTSVNHAVTLVGYGTSPEGMKYWLAKNSWGKTWGE 318

 Score = 235 (87.8 bits), Expect = 1.9e-27, Sum P(2) = 1.9e-27
 Identities = 56/165 (33%), Positives = 82/165 (49%)

Query:   490 LTGVLPSKLSRLATEKLVDCDMS-NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESE 548
             LT +    L  L+ ++L+DC    N GC GG   +A  YII + G+ S+  YPY+  E  
Sbjct:   166 LTKIARGNLISLSEQQLLDCTREQNNGCKGGTFVNAFNYIIKHRGISSENEYPYQVKEGP 225

Query:   549 RGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNAN--GLFYYSGGVIDL 606
               C              +  +P   E  + + V+ R P++V ++A+  G  +YSGGV   
Sbjct:   226 --C--RSNARPAILIRGFENVPSNNERALLEAVS-RQPVAVAIDASEAGFVHYSGGVY-- 278

Query:   607 NQRLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGE 651
             N R C     NHA+ +VGYG   +     + YW+ KNSWG  WGE
Sbjct:   279 NARNCGTSV-NHAVTLVGYGTSPE----GMKYWLAKNSWGKTWGE 318

 Score = 133 (51.9 bits), Expect = 1.9e-27, Sum P(2) = 1.9e-27
 Identities = 45/153 (29%), Positives = 68/153 (44%)

Query:    48 FMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQREDSGTAVFEVNKFFDLSDSD-LQQLT 106
             +M    +VY    +   R +    N++  E +    + +    VN+F D +  + L   T
Sbjct:    42 WMIQFSRVYDDEFEKQLRLQVLTENLKFIESFNNMGNQSYKLGVNEFTDWTKEEFLATYT 101

Query:   107 GLNLDSTLEDIQPSLQAPFSS-NQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVKE 165
             GL           ++ +PF   N+T     A+ +      D L    DWR EG ++ VK 
Sbjct:   102 GLR--------GVNVTSPFEVVNETKP---AWNWTV---SDVLGTNKDWRNEGAVTPVKS 147

Query:   166 QGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
             QG+C  CWAFSA+  VE +  I   NL  LS Q
Sbjct:   148 QGECGGCWAFSAIAAVEGLTKIARGNLISLSEQ 180


>UNIPROTKB|Q86GF7 [details] [associations]
            symbol:Cys "Crustapain" species:6703 "Pandalus borealis"
            [GO:0005576 "extracellular region" evidence=IC] [GO:0007586
            "digestion" evidence=NAS] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IDA] [GO:0030163 "protein catabolic process"
            evidence=NAS] [GO:0030574 "collagen catabolic process"
            evidence=IDA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0005576
            GO:GO:0007586 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0030163 GO:GO:0030574 EMBL:AB091669
            ProteinModelPortal:Q86GF7 SMR:Q86GF7 MEROPS:C01.030 Uniprot:Q86GF7
        Length = 323

 Score = 312 (114.9 bits), Expect = 4.0e-27, P = 4.0e-27
 Identities = 76/191 (39%), Positives = 98/191 (51%)

Query:   301 DWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSNG--GC 358
             DWR +G ++ VK+QG+C  CWAFSAV  +E  H ++   L  LS Q LVDC  S G  GC
Sbjct:   111 DWRNKGAVTPVKDQGQCGSCWAFSAVAALEGAHFLKTGDLVSLSEQNLVDCSSSYGNQGC 170

Query:   359 NGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYGEEEE 418
             NGG    A QYII N G+ ++ +YPYKA +    C              Y     G+E  
Sbjct:   171 NGGWPYQAYQYIIANRGIDTESSYPYKAIDDN--C-RYDAGNIGATVSSYVEPASGDESA 227

Query:   419 MKKWVATRGPLSVGMNANGLFY--YSGGVI-----D---LNQRL----YGTSI---PYWI 461
             ++  V   GP+SV ++A    +  Y GGV      D    N  +    YGT      YWI
Sbjct:   228 LQHAVQNEGPVSVCIDAGQSSFGSYGGGVYYEPNCDSWYANHAVTAVGYGTDANGGDYWI 287

Query:   462 VKNSWGSDWGE 472
             VKNSWG+ WGE
Sbjct:   288 VKNSWGAWWGE 298

 Score = 232 (86.7 bits), Expect = 4.5e-28, Sum P(2) = 4.5e-28
 Identities = 60/158 (37%), Positives = 78/158 (49%)

Query:   498 LSRLATEKLVDCDMSNG--GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXX 555
             L  L+ + LVDC  S G  GCNGG    A QYII N G+ ++ +YPYKA +    C    
Sbjct:   150 LVSLSEQNLVDCSSSYGNQGCNGGWPYQAYQYIIANRGIDTESSYPYKAIDDN--C-RYD 206

Query:   556 XXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFY--YSGGVIDLNQRLCNP 613
                       Y     G+E  ++  V   GP+SV ++A    +  Y GGV    +  C+ 
Sbjct:   207 AGNIGATVSSYVEPASGDESALQHAVQNEGPVSVCIDAGQSSFGSYGGGVY--YEPNCDS 264

Query:   614 KAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGE 651
                NHA+  VGYG     D     YWIVKNSWG+ WGE
Sbjct:   265 WYANHAVTAVGYGT----DANGGDYWIVKNSWGAWWGE 298

 Score = 132 (51.5 bits), Expect = 4.5e-28, Sum P(2) = 4.5e-28
 Identities = 22/47 (46%), Positives = 32/47 (68%)

Query:   153 DWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQH 199
             DWR +G ++ VK+QG+C  CWAFSAV  +E  H ++  +L  LS Q+
Sbjct:   111 DWRNKGAVTPVKDQGQCGSCWAFSAVAALEGAHFLKTGDLVSLSEQN 157


>MGI|MGI:1860262 [details] [associations]
            symbol:Cts7 "cathepsin 7" species:10090 "Mus musculus"
            [GO:0005575 "cellular_component" evidence=ND] [GO:0005576
            "extracellular region" evidence=IEA] [GO:0005634 "nucleus"
            evidence=IEA] [GO:0005737 "cytoplasm" evidence=IEA] [GO:0005764
            "lysosome" evidence=IEA] [GO:0005768 "endosome" evidence=IEA]
            [GO:0005794 "Golgi apparatus" evidence=IEA] [GO:0006508
            "proteolysis" evidence=ISA] [GO:0007049 "cell cycle" evidence=IEA]
            [GO:0007067 "mitosis" evidence=IEA] [GO:0008152 "metabolic process"
            evidence=IEA] [GO:0008233 "peptidase activity" evidence=ISA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0051301 "cell
            division" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            MGI:MGI:1860262 GO:GO:0005634 GO:GO:0005794 GO:GO:0048471
            GO:GO:0005615 GO:GO:0051301 GO:GO:0007067 GO:GO:0005768
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0005764 GO:GO:0008233 EMBL:CH466546
            EMBL:AY014779 EMBL:CT030645 EMBL:BC064740 EMBL:AF250837
            IPI:IPI00131132 RefSeq:NP_062412.1 UniGene:Mm.3692 HSSP:O60911
            ProteinModelPortal:Q91ZF2 SMR:Q91ZF2 STRING:Q91ZF2 MEROPS:C01.016
            PRIDE:Q91ZF2 Ensembl:ENSMUST00000021892 GeneID:56092 KEGG:mmu:56092
            UCSC:uc007qwi.1 CTD:56092 GeneTree:ENSGT00560000076577
            HOVERGEN:HBG011513 InParanoid:Q91ZF2 OMA:ERRVIWE OrthoDB:EOG44QT2S
            NextBio:311908 Bgee:Q91ZF2 Genevestigator:Q91ZF2 Uniprot:Q91ZF2
        Length = 331

 Score = 255 (94.8 bits), Expect = 3.4e-24, Sum P(2) = 3.4e-24
 Identities = 58/174 (33%), Positives = 89/174 (51%)

Query:   280 TEMRAFQFNSLRH----GDDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAI 335
             TE  ++   + +H       +P   DWR EG ++ V+ QG C  CWAFS    +E     
Sbjct:    92 TESSSYPLRNGKHIQKRNPKIPPTLDWRKEGYVTPVRRQGSCGACWAFSVTACIEGQLFK 151

Query:   336 QGNSLTELSVQQLVDCDMSNG--GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGC 393
             +   L  LSVQ L+DC +S G  GC+GGR  DA QY+ +NGG+ ++  YPY+A      C
Sbjct:   152 KTGKLIPLSVQNLMDCSVSYGTKGCDGGRPYDAFQYVKNNGGLEAEATYPYEAKAKH--C 209

Query:   394 LXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMN-ANGLFY-YSGGV 445
                           +  +P  EE  ++  V T GP++V ++ ++  F+ Y GG+
Sbjct:   210 RYRPERSVVKVNRFFV-VPRNEEALLQALV-THGPIAVAIDGSHASFHSYRGGI 261

 Score = 228 (85.3 bits), Expect = 4.7e-28, Sum P(3) = 4.7e-28
 Identities = 54/159 (33%), Positives = 86/159 (54%)

Query:   497 KLSRLATEKLVDCDMSNG--GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXX 554
             KL  L+ + L+DC +S G  GC+GGR  DA QY+ +NGG+ ++  YPY+A      C   
Sbjct:   155 KLIPLSVQNLMDCSVSYGTKGCDGGRPYDAFQYVKNNGGLEAEATYPYEAKAKH--CRYR 212

Query:   555 XXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMN-ANGLFY-YSGGVIDLNQRLCN 612
                        +  +P  EE  ++  V T GP++V ++ ++  F+ Y GG+   ++  C 
Sbjct:   213 PERSVVKVNRFFV-VPRNEEALLQALV-THGPIAVAIDGSHASFHSYRGGIY--HEPKCR 268

Query:   613 PKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGE 651
                 +H L++VGYG E   +  +  YW++KNS G  WGE
Sbjct:   269 KDTLDHGLLLVGYGYEGH-ESENRKYWLLKNSHGERWGE 306

 Score = 121 (47.7 bits), Expect = 4.7e-28, Sum P(3) = 4.7e-28
 Identities = 24/72 (33%), Positives = 35/72 (48%)

Query:   132 TEMRAFQFNSLRH----GDDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAI 187
             TE  ++   + +H       +P   DWR EG ++ V+ QG C  CWAFS    +E     
Sbjct:    92 TESSSYPLRNGKHIQKRNPKIPPTLDWRKEGYVTPVRRQGSCGACWAFSVTACIEGQLFK 151

Query:   188 QGNNLTELSVQH 199
             +   L  LSVQ+
Sbjct:   152 KTGKLIPLSVQN 163

 Score = 84 (34.6 bits), Expect = 3.2e-10, Sum P(3) = 3.2e-10
 Identities = 35/132 (26%), Positives = 51/132 (38%)

Query:   357 GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYGEE 416
             GC+GGR  DA QY+ +NGG+ ++  YPY+A                       R      
Sbjct:   175 GCDGGRPYDAFQYVKNNGGLEAEATYPYEAKAKHCRYRPERSVVKVNRFFVVPRNEEALL 234

Query:   417 EEMKKW----VATRGPLSVGMNANGLFYYSG----GVIDLNQRL--YG------TSIPYW 460
             + +       VA  G  +   +  G  Y+        +D    L  YG       +  YW
Sbjct:   235 QALVTHGPIAVAIDGSHASFHSYRGGIYHEPKCRKDTLDHGLLLVGYGYEGHESENRKYW 294

Query:   461 IVKNSWGSDWGE 472
             ++KNS G  WGE
Sbjct:   295 LLKNSHGERWGE 306

 Score = 54 (24.1 bits), Expect = 4.7e-28, Sum P(3) = 4.7e-28
 Identities = 16/60 (26%), Positives = 31/60 (51%)

Query:    50 RDHDKVYSSVEDLLRRHENFVTNVEKAEDYQREDS---GTAVFEVNKFFDLSDSDLQQLT 106
             R +D+ YS  E+  RR   +  NV+  + +  E+         E+N+F D++  +++ LT
Sbjct:    34 RSNDRTYSPEEEKQRRAV-WEGNVKWIKQHIMENGLWMNNFTIEMNEFGDMTGEEMKMLT 92


>UNIPROTKB|Q5E968 [details] [associations]
            symbol:CTSK "Cathepsin K" species:9913 "Bos taurus"
            [GO:0005764 "lysosome" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            InterPro:IPR015644 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 eggNOG:COG4870 HOGENOM:HOG000230774
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 EMBL:BT021052
            EMBL:BC109853 IPI:IPI00709374 RefSeq:NP_001029607.1
            UniGene:Bt.23218 ProteinModelPortal:Q5E968 SMR:Q5E968 STRING:Q5E968
            MEROPS:I29.007 PRIDE:Q5E968 Ensembl:ENSBTAT00000028016
            GeneID:513038 KEGG:bta:513038 CTD:1513 InParanoid:Q5E968 KO:K01371
            OrthoDB:EOG4SJ5FC NextBio:20870669 PANTHER:PTHR12411:SF55
            Uniprot:Q5E968
        Length = 329

 Score = 320 (117.7 bits), Expect = 5.3e-28, P = 5.3e-28
 Identities = 90/296 (30%), Positives = 144/296 (48%)

Query:   194 ELSVQHHDKVYSSVEDLLRRHENFVTNVEKAEDYQSEDSGTAVFGVNKFFDLSESDLQQL 253
             EL  + + K Y+S  D + R   +  N++    +  E S     GV+ + +L+ + L  +
Sbjct:    27 ELWKKTYRKQYNSKGDEISRRLIWEKNLKHISIHNLEAS----LGVHTY-ELAMNHLGDM 81

Query:   254 TGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVKE 313
             T    +  ++ +   L+ P S ++++  +    +         P++ D+R +G ++ VK 
Sbjct:    82 TS---EEVVQKMT-GLKVPASRSRSNDTLYIPDWEGRA-----PDSVDYRKKGYVTPVKN 132

Query:   314 QGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSNGGCNGGRMDDALQYIIDN 373
             QG+C  CWAFS+VG +E     +   L  LS Q LVDC   N GC GG M +A QY+  N
Sbjct:   133 QGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQKN 192

Query:   374 GGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGM 433
              G+ S+ AYPY   +    C+             Y  IP G E+ +K+ VA  GP+SV +
Sbjct:   193 RGIDSEDAYPYVGQDEN--CMYNPTGKAAKCRG-YREIPEGNEKALKRAVARVGPISVAI 249

Query:   434 NAN--GLFYYSGGVI--------DLNQRL----YGTSI--PYWIVKNSWGSDWGEK 473
             +A+     +Y  GV         +LN  +    YG      +WI+KNSWG +WG K
Sbjct:   250 DASLTSFQFYRKGVYYDENCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNK 305

 Score = 273 (101.2 bits), Expect = 6.7e-23, P = 6.7e-23
 Identities = 83/245 (33%), Positives = 118/245 (48%)

Query:   416 EEEMKKWVATRGPLSVGMNANGLFY--YSGGVID-LNQRLYGTSIPYWIVKNSW--GSDW 470
             EE ++K    + P S   + + L+   + G   D ++ R  G   P   VKN    GS W
Sbjct:    84 EEVVQKMTGLKVPASRSRSNDTLYIPDWEGRAPDSVDYRKKGYVTP---VKNQGQCGSCW 140

Query:   471 GEKVEDKVGSSGNRTRDLELTGVLPSKLSRLATEKLVDCDMSNGGCNGGRMDDALQYIID 530
                    VG+   + +  + TG    KL  L+ + LVDC   N GC GG M +A QY+  
Sbjct:   141 AFS---SVGALEGQLK--KKTG----KLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQK 191

Query:   531 NGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVG 590
             N G+ S+ AYPY   +    C+             Y  IP G E+ +K+ VA  GP+SV 
Sbjct:   192 NRGIDSEDAYPYVGQDEN--CMYNPTGKAAKCRG-YREIPEGNEKALKRAVARVGPISVA 248

Query:   591 MNANGL---FYYSGGVIDLNQRLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGS 647
             ++A+     FY  G   D N   CN    NHA++ VGYG ++   G    +WI+KNSWG 
Sbjct:   249 IDASLTSFQFYRKGVYYDEN---CNSDNLNHAVLAVGYGIQK---GNK--HWIIKNSWGE 300

Query:   648 DWGEK 652
             +WG K
Sbjct:   301 NWGNK 305

 Score = 117 (46.2 bits), Expect = 0.00071, P = 0.00071
 Identities = 21/51 (41%), Positives = 32/51 (62%)

Query:   149 PEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQH 199
             P++ D+R +G ++ VK QG+C  CWAFS+VG +E     +   L  LS Q+
Sbjct:   116 PDSVDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQN 166


>UNIPROTKB|P83443 [details] [associations]
            symbol:P83443 "Macrodontain-1" species:203992 "Pseudananas
            sagenarius" [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=IDA] [GO:0005575 "cellular_component" evidence=ND]
            [GO:0006508 "proteolysis" evidence=IDA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0004197
            ProteinModelPortal:P83443 SMR:P83443 MEROPS:C01.028 Uniprot:P83443
        Length = 213

 Score = 314 (115.6 bits), Expect = 2.4e-27, P = 2.4e-27
 Identities = 69/189 (36%), Positives = 104/189 (55%)

Query:   296 LPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSN 355
             +P++ DWR  G +++VK QG C  CWAF+A+  VE ++ I+  +L  LS Q+++DC +S 
Sbjct:     2 VPQSIDWRDYGAVNEVKNQGPCGGCWAFAAIATVEGIYKIRKGNLVYLSEQEVLDCAVSY 61

Query:   356 GGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYGE 415
             G C GG ++ A  +II N GV +D+ YPY+A +   G               YS +   +
Sbjct:    62 G-CKGGWVNRAYDFIISNNGVTTDENYPYRAYQ---GTCNANYFPNSAYITGYSYVRRND 117

Query:   416 EEEMKKWVATRGPLSVGMNANG--LFYYSGGV------IDLNQRL----YGTSIPYWIVK 463
             E  M   V+ + P++  ++A+G    YY GGV        LN  +    YG    YWIV+
Sbjct:   118 ESHMMYAVSNQ-PIAALIDASGDNFQYYKGGVYSGPCGFSLNHAITIIGYGRD-SYWIVR 175

Query:   464 NSWGSDWGE 472
             NSWGS WG+
Sbjct:   176 NSWGSSWGQ 184

 Score = 210 (79.0 bits), Expect = 6.8e-28, Sum P(2) = 6.8e-28
 Identities = 52/153 (33%), Positives = 80/153 (52%)

Query:   501 LATEKLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXX 560
             L+ ++++DC +S G C GG ++ A  +II N GV +D+ YPY+A +   G          
Sbjct:    49 LSEQEVLDCAVSYG-CKGGWVNRAYDFIISNNGVTTDENYPYRAYQ---GTCNANYFPNS 104

Query:   561 XXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANG--LFYYSGGVIDLNQRLCNPKAQNH 618
                  YS +   +E  M   V+ + P++  ++A+G    YY GGV       C   + NH
Sbjct:   105 AYITGYSYVRRNDESHMMYAVSNQ-PIAALIDASGDNFQYYKGGVYS---GPCG-FSLNH 159

Query:   619 ALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGE 651
             A+ I+GYG +         YWIV+NSWGS WG+
Sbjct:   160 AITIIGYGRDS--------YWIVRNSWGSSWGQ 184

 Score = 136 (52.9 bits), Expect = 6.8e-28, Sum P(2) = 6.8e-28
 Identities = 23/51 (45%), Positives = 34/51 (66%)

Query:   148 LPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
             +P++ DWR  G +++VK QG C  CWAF+A+  VE ++ I+  NL  LS Q
Sbjct:     2 VPQSIDWRDYGAVNEVKNQGPCGGCWAFAAIATVEGIYKIRKGNLVYLSEQ 52


>MGI|MGI:1861723 [details] [associations]
            symbol:Ctsr "cathepsin R" species:10090 "Mus musculus"
            [GO:0005764 "lysosome" evidence=ISA] [GO:0006508 "proteolysis"
            evidence=IEA] [GO:0008233 "peptidase activity" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=ISA]
            [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0030163 "protein
            catabolic process" evidence=ISA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            MGI:MGI:1861723 eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 GO:GO:0030163
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 EMBL:AF245399
            EMBL:AY014778 EMBL:AK014432 EMBL:AK005429 IPI:IPI00120321
            RefSeq:NP_064680.1 UniGene:Mm.315715 ProteinModelPortal:Q9JIA9
            SMR:Q9JIA9 MEROPS:C01.042 PRIDE:Q9JIA9 Ensembl:ENSMUST00000021889
            GeneID:56835 KEGG:mmu:56835 CTD:56835 InParanoid:Q9JIA9 KO:K09601
            OMA:ASHESFK OrthoDB:EOG4ZCT6D NextBio:313379 Bgee:Q9JIA9
            CleanEx:MM_CTSR Genevestigator:Q9JIA9 GermOnline:ENSMUSG00000055679
            Uniprot:Q9JIA9
        Length = 334

 Score = 267 (99.0 bits), Expect = 3.0e-22, P = 3.0e-22
 Identities = 59/157 (37%), Positives = 82/157 (52%)

Query:   293 GDDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCD 352
             G  LP+  DWR +G ++ V+ QG C  CWAF+  G +EA    Q   LT LSVQ LVDC 
Sbjct:   112 GSILPKFVDWRKKGYVTPVRRQGDCDACWAFAVTGAIEAQAIWQTGKLTPLSVQNLVDCS 171

Query:   353 --MSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSR 410
                 N GC GG   +A QY++ NGG+ S+  YPY   E + G               +  
Sbjct:   172 KPQGNNGCLGGDTYNAFQYVLHNGGLESEATYPY---EGKDGPCRYNPKNSKAEITGFVS 228

Query:   411 IPYGEEEEMKKWVATRGPLSVGMNANGLFY--YSGGV 445
             +P  E+  M   VAT GP++ G++A+   +  Y GG+
Sbjct:   229 LPQSEDILMAA-VATIGPITAGIDASHESFKNYKGGI 264

 Score = 232 (86.7 bits), Expect = 7.5e-28, Sum P(2) = 7.5e-28
 Identities = 54/159 (33%), Positives = 83/159 (52%)

Query:   497 KLSRLATEKLVDCD--MSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXX 554
             KL+ L+ + LVDC     N GC GG   +A QY++ NGG+ S+  YPY   E + G    
Sbjct:   158 KLTPLSVQNLVDCSKPQGNNGCLGGDTYNAFQYVLHNGGLESEATYPY---EGKDGPCRY 214

Query:   555 XXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFY--YSGGVIDLNQRLCN 612
                        +  +P  E+  M   VAT GP++ G++A+   +  Y GG+   ++  C+
Sbjct:   215 NPKNSKAEITGFVSLPQSEDILMAA-VATIGPITAGIDASHESFKNYKGGIY--HEPNCS 271

Query:   613 PKAQNHALIIVGYGEEE-KKDGTSIPYWIVKNSWGSDWG 650
                  H +++VGYG +  + DG    YW++KNSWG  WG
Sbjct:   272 SDTVTHGVLVVGYGFKGIETDGNH--YWLIKNSWGKRWG 308

 Score = 137 (53.3 bits), Expect = 7.5e-28, Sum P(2) = 7.5e-28
 Identities = 25/55 (45%), Positives = 33/55 (60%)

Query:   145 GDDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQH 199
             G  LP+  DWR +G ++ V+ QG C  CWAF+  G +EA    Q   LT LSVQ+
Sbjct:   112 GSILPKFVDWRKKGYVTPVRRQGDCDACWAFAVTGAIEAQAIWQTGKLTPLSVQN 166

 Score = 104 (41.7 bits), Expect = 1.8e-12, Sum P(2) = 1.8e-12
 Identities = 38/146 (26%), Positives = 58/146 (39%)

Query:   355 NGGCNGGRMDDALQYIIDNGGVVSDQAYPY-----------KASESE-RG--CLXXXXXX 400
             N GC GG   +A QY++ NGG+ S+  YPY           K S++E  G   L      
Sbjct:   176 NNGCLGGDTYNAFQYVLHNGGLESEATYPYEGKDGPCRYNPKNSKAEITGFVSLPQSEDI 235

Query:   401 XXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFYYSGGVIDLNQRLYGTSIP-- 458
                       I  G +   + +   +G +    N +     + GV+ +     G      
Sbjct:   236 LMAAVATIGPITAGIDASHESFKNYKGGIYHEPNCSS-DTVTHGVLVVGYGFKGIETDGN 294

Query:   459 -YWIVKNSWGSDWGEKVEDKVGSSGN 483
              YW++KNSWG  WG +   K+    N
Sbjct:   295 HYWLIKNSWGKRWGIRGYMKLAKDKN 320


>UNIPROTKB|H0YD65 [details] [associations]
            symbol:CTSF "Cathepsin F" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 SMART:SM00645 InterPro:IPR000169
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00139 EMBL:AP002748
            HGNC:HGNC:2531 ChiTaRS:CTSF Ensembl:ENST00000524994 Uniprot:H0YD65
        Length = 283

 Score = 254 (94.5 bits), Expect = 1.2e-27, Sum P(2) = 1.2e-27
 Identities = 52/153 (33%), Positives = 79/153 (51%)

Query:   293 GDDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCD 352
             GD  P  +DWR++G ++KVK+QG C  CWAFS  G VE    +   +L  LS Q+L+DCD
Sbjct:   116 GDLAPPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCD 175

Query:   353 MSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIP 412
               +  C GG   +A   I + GG+ ++  Y Y+                       S+  
Sbjct:   176 KMDKACMGGLPSNAYSAIKNLGGLETEDDYSYQGHMQSCNFSAEKAKVYINDSVELSQ-- 233

Query:   413 YGEEEEMKKWVATRGPLSVGMNANGLFYYSGGV 445
                E+++  W+A RGP+SV +NA G+ +Y  G+
Sbjct:   234 --NEQKLAAWLAKRGPISVAINAFGMQFYRHGI 264

 Score = 143 (55.4 bits), Expect = 3.2e-22, Sum P(3) = 3.2e-22
 Identities = 34/125 (27%), Positives = 61/125 (48%)

Query:   498 LSRLATEKLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXX 557
             L  L+ ++L+DCD  +  C GG   +A   I + GG+ ++  Y Y+              
Sbjct:   163 LLSLSEQELLDCDKMDKACMGGLPSNAYSAIKNLGGLETEDDYSYQGHMQSCNFSAEKAK 222

Query:   558 XXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFYYSGGVIDLNQRLCNPKAQN 617
                      S+     E+++  W+A RGP+SV +NA G+ +Y  G+    + LC+P   +
Sbjct:   223 VYINDSVELSQ----NEQKLAAWLAKRGPISVAINAFGMQFYRHGISRPLRPLCSPWLID 278

Query:   618 HALII 622
             HA+++
Sbjct:   279 HAVLL 283

 Score = 137 (53.3 bits), Expect = 3.2e-22, Sum P(3) = 3.2e-22
 Identities = 25/54 (46%), Positives = 32/54 (59%)

Query:   145 GDDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
             GD  P  +DWR++G ++KVK+QG C  CWAFS  G VE    +    L  LS Q
Sbjct:   116 GDLAPPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQ 169

 Score = 88 (36.0 bits), Expect = 1.2e-27, Sum P(2) = 1.2e-27
 Identities = 20/54 (37%), Positives = 32/54 (59%)

Query:   200 HDKVYSSVEDLLRRHENFVTNVEKAEDYQSEDSGTAVFGVNKFFDLSESDLQQL 253
             +++ Y S E   R    FV N+ +A+  Q+ D GTA +GV KF DL+E + + +
Sbjct:    43 YNRTYESKEARWRLSV-FVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRTI 95

 Score = 88 (36.0 bits), Expect = 3.2e-22, Sum P(3) = 3.2e-22
 Identities = 21/61 (34%), Positives = 34/61 (55%)

Query:    45 FLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQREDSGTAVFEVNKFFDLSDSDLQQ 104
             F NF+  +++ Y S E   R    FV N+ +A+  Q  D GTA + V KF DL++ + + 
Sbjct:    36 FKNFVITYNRTYESKEARWRLSV-FVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 94

Query:   105 L 105
             +
Sbjct:    95 I 95


>TAIR|locus:2120222 [details] [associations]
            symbol:RD19 "RESPONSIVE TO DEHYDRATION 19" species:3702
            "Arabidopsis thaliana" [GO:0005576 "extracellular region"
            evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA;ISS]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            [GO:0009269 "response to desiccation" evidence=IEP] [GO:0006970
            "response to osmotic stress" evidence=IGI] [GO:0005634 "nucleus"
            evidence=IDA] [GO:0005773 "vacuole" evidence=IDA] [GO:0042742
            "defense response to bacterium" evidence=IMP] [GO:0006096
            "glycolysis" evidence=RCA] [GO:0006833 "water transport"
            evidence=RCA] [GO:0006972 "hyperosmotic response" evidence=RCA]
            [GO:0007030 "Golgi organization" evidence=RCA] [GO:0009266
            "response to temperature stimulus" evidence=RCA] [GO:0009651
            "response to salt stress" evidence=IEP;RCA] [GO:0046686 "response
            to cadmium ion" evidence=RCA] [GO:0004197 "cysteine-type
            endopeptidase activity" evidence=ISS] [GO:0009414 "response to
            water deprivation" evidence=TAS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005634 GO:GO:0005773 EMBL:CP002687
            GenomeReviews:CT486007_GR GO:GO:0009651 GO:GO:0042742
            eggNOG:COG4870 HOGENOM:HOG000230774 KO:K01373
            ProtClustDB:CLSN2688311 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 EMBL:AL035679 EMBL:AL161594 GO:GO:0004197
            MEROPS:C01.022 EMBL:D13042 EMBL:AY080598 EMBL:AY133844
            IPI:IPI00544363 PIR:JN0718 RefSeq:NP_568052.1 UniGene:At.2850
            UniGene:At.74924 ProteinModelPortal:P43296 SMR:P43296 STRING:P43296
            PaxDb:P43296 PRIDE:P43296 EnsemblPlants:AT4G39090.1 GeneID:830064
            KEGG:ath:AT4G39090 TAIR:At4g39090 InParanoid:P43296 OMA:EDFDWRD
            PhylomeDB:P43296 Genevestigator:P43296 GermOnline:AT4G39090
            Uniprot:P43296
        Length = 368

 Score = 271 (100.5 bits), Expect = 1.3e-27, Sum P(2) = 1.3e-27
 Identities = 58/161 (36%), Positives = 84/161 (52%)

Query:   294 DDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDM 353
             ++LPE FDWR  G ++ VK QG C  CW+FSA G +E  + +    L  LS QQLVDCD 
Sbjct:   133 ENLPEDFDWRDHGAVTPVKNQGSCGSCWSFSATGALEGANFLATGKLVSLSEQQLVDCDH 192

Query:   354 S---------NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXX 404
                       + GCNGG M+ A +Y +  GG++ ++ YPY   + +  C           
Sbjct:   193 ECDPEEADSCDSGCNGGLMNSAFEYTLKTGGLMKEEDYPYTGKDGKT-C-KLDKSKIVAS 250

Query:   405 XXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFYYSGGV 445
                +S I   +EE++   +   GPL+V +NA  +  Y GGV
Sbjct:   251 VSNFSVISI-DEEQIAANLVKNGPLAVAINAGYMQTYIGGV 290

 Score = 229 (85.7 bits), Expect = 1.4e-19, Sum P(2) = 1.4e-19
 Identities = 71/219 (32%), Positives = 102/219 (46%)

Query:   447 DLNQRLYGTSIPYWIVKN--SWGSDWGEKVEDKVGSSGNRTRDLELTGVLPS-KLSRLAT 503
             D + R +G   P   VKN  S GS W          S + T  LE    L + KL  L+ 
Sbjct:   138 DFDWRDHGAVTP---VKNQGSCGSCW----------SFSATGALEGANFLATGKLVSLSE 184

Query:   504 EKLVDCDMS---------NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXX 554
             ++LVDCD           + GCNGG M+ A +Y +  GG++ ++ YPY   + +  C   
Sbjct:   185 QQLVDCDHECDPEEADSCDSGCNGGLMNSAFEYTLKTGGLMKEEDYPYTGKDGKT-C-KL 242

Query:   555 XXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFYYSGGVIDLNQRLCNPK 614
                        +S I   +EE++   +   GPL+V +NA  +  Y GGV      +C  +
Sbjct:   243 DKSKIVASVSNFSVISI-DEEQIAANLVKNGPLAVAINAGYMQTYIGGVSC--PYICTRR 299

Query:   615 AQNHALIIVGYGEEEKKDGT--SIPYWIVKNSWGSDWGE 651
               NH +++VGYG            PYWI+KNSWG  WGE
Sbjct:   300 L-NHGVLLVGYGAAGYAPARFKEKPYWIIKNSWGETWGE 337

 Score = 139 (54.0 bits), Expect = 5.3e-09, Sum P(2) = 5.3e-09
 Identities = 24/53 (45%), Positives = 32/53 (60%)

Query:   146 DDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
             ++LPE FDWR  G ++ VK QG C  CW+FSA G +E  + +    L  LS Q
Sbjct:   133 ENLPEDFDWRDHGAVTPVKNQGSCGSCWSFSATGALEGANFLATGKLVSLSEQ 185

 Score = 70 (29.7 bits), Expect = 1.3e-27, Sum P(2) = 1.3e-27
 Identities = 20/60 (33%), Positives = 32/60 (53%)

Query:    45 FLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQREDSGTAVFEVNKFFDLSDSDLQQ 104
             F  F R   KVY+S E+   R   F  N+ +A  +Q+ D  +A   V +F DL+ S+ ++
Sbjct:    51 FSLFKRKFGKVYASNEEHDYRFSVFKANLRRARRHQKLDP-SATHGVTQFSDLTRSEFRK 109


>DICTYBASE|DDB_G0272298 [details] [associations]
            symbol:DDB_G0272298 species:44689 "Dictyostelium
            discoideum" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 dictyBase:DDB_G0272298 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246
            SMART:SM00848 EMBL:AAFI02000008 KO:K01365 RefSeq:XP_645281.1
            ProteinModelPortal:Q559Q3 MEROPS:C01.A53 EnsemblProtists:DDB0203746
            GeneID:8618447 KEGG:ddi:DDB_G0272298 InParanoid:Q559Q3 OMA:PANINWR
            Uniprot:Q559Q3
        Length = 305

 Score = 314 (115.6 bits), Expect = 2.4e-27, P = 2.4e-27
 Identities = 86/291 (29%), Positives = 136/291 (46%)

Query:   200 HDKVYSSVEDLLRRHENFVTNVEKAEDYQSEDSGTAVFGVNKFFDLSESDLQQLTGLNLD 259
             ++K Y + ++ L+R + F  N     ++++++       +N++ DL++ +         D
Sbjct:     4 YNKHYKNNKEYLKRFDIFQDNYNFILNHRNKNGENIEMDLNEYSDLTQKEFA-------D 56

Query:   260 STLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVKEQGKCAC 319
                E + P    P S    D +   F+ N       +P++FDWR  G + KVK QG CA 
Sbjct:    57 KFFEKLVPE---PRSGPINDIKATPFKHNV---NATIPKSFDWRDHGAVGKVKNQGSCAS 110

Query:   320 CWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSNG--GCNGGRMDDALQYIIDNGGVV 377
             CW+FSA+G +E  + I+   L +LS Q LVDC    G  GC  G M DA +YII +GGV 
Sbjct:   111 CWSFSALGALEGHYYIKYGELLDLSEQNLVDCATPFGPKGCKTGWMHDAFKYIISSGGVN 170

Query:   378 SDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNAN- 436
              +  YPY   +    C              +  IP  +E  + + +A  GP++V ++ + 
Sbjct:   171 LESQYPYTGKDEV--C-KFNQSEKEAKVSGFVMIPKFDESALMEAIALYGPVAVPIDTST 227

Query:   437 -------GLFYYSGGVIDLNQRL------YGTS---IPYWIVKNSWGSDWG 471
                    G  YYS      N         YGT    + Y+++KNSWG  WG
Sbjct:   228 KEFQHLSGGIYYSDSCDPWNTIHAVLAIGYGTDENGVDYFLMKNSWGKSWG 278

 Score = 196 (74.1 bits), Expect = 2.0e-27, Sum P(2) = 2.0e-27
 Identities = 50/158 (31%), Positives = 79/158 (50%)

Query:   497 KLSRLATEKLVDCDMSNG--GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXX 554
             +L  L+ + LVDC    G  GC  G M DA +YII +GGV  +  YPY   +    C   
Sbjct:   130 ELLDLSEQNLVDCATPFGPKGCKTGWMHDAFKYIISSGGVNLESQYPYTGKDEV--C-KF 186

Query:   555 XXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGL-FYY-SGGVIDLNQRLCN 612
                        +  IP  +E  + + +A  GP++V ++ +   F + SGG+   +   C+
Sbjct:   187 NQSEKEAKVSGFVMIPKFDESALMEAIALYGPVAVPIDTSTKEFQHLSGGIYYSDS--CD 244

Query:   613 PKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWG 650
             P    HA++ +GYG +E      + Y+++KNSWG  WG
Sbjct:   245 PWNTIHAVLAIGYGTDEN----GVDYFLMKNSWGKSWG 278

 Score = 176 (67.0 bits), Expect = 2.0e-27, Sum P(2) = 2.0e-27
 Identities = 43/151 (28%), Positives = 75/151 (49%)

Query:    49 MRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQREDSGTAVFEVNKFFDLSDSDLQQLTGL 108
             M  ++K Y + ++ L+R + F  N     +++ ++      ++N++ DL+  +       
Sbjct:     1 MVKYNKHYKNNKEYLKRFDIFQDNYNFILNHRNKNGENIEMDLNEYSDLTQKEFA----- 55

Query:   109 NLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVKEQGK 168
               D   E + P    P S    D +   F+ N       +P++FDWR  G + KVK QG 
Sbjct:    56 --DKFFEKLVPE---PRSGPINDIKATPFKHNV---NATIPKSFDWRDHGAVGKVKNQGS 107

Query:   169 CACCWAFSAVGVVEAMHAIQGNNLTELSVQH 199
             CA CW+FSA+G +E  + I+   L +LS Q+
Sbjct:   108 CASCWSFSALGALEGHYYIKYGELLDLSEQN 138


>DICTYBASE|DDB_G0278401 [details] [associations]
            symbol:cprH "cysteine proteinase 8" species:44689
            "Dictyostelium discoideum" [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 dictyBase:DDB_G0278401 EMBL:AAFI02000023
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00639 ProtClustDB:CLSZ2430780 RefSeq:XP_642342.1
            ProteinModelPortal:Q54Y60 MEROPS:C01.A62 EnsemblProtists:DDB0205428
            GeneID:8621547 KEGG:ddi:DDB_G0278401 InParanoid:Q54Y60 OMA:FANMENE
            Uniprot:Q54Y60
        Length = 337

 Score = 241 (89.9 bits), Expect = 4.5e-26, Sum P(3) = 4.5e-26
 Identities = 51/151 (33%), Positives = 78/151 (51%)

Query:   301 DWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTEL---SVQQLVDCD--MSN 355
             DWR +G ++ VK Q  C+ CW+FSA G  E  H +  N   EL   S Q L+DC     N
Sbjct:   117 DWRKKGAVTHVKNQQSCSGCWSFSATGATEGAHKLANNGTNELVSLSEQNLIDCSTPFGN 176

Query:   356 GGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYGE 415
              GCNGG +  A +YII NGG+ ++++YP++ ++   G               Y  + +G 
Sbjct:   177 TGCNGGVITYAFEYIISNGGIDTEKSYPFEGTD---GTCRYKSENSGATISSYVNVTFGS 233

Query:   416 EEEMKKWVATRGPLSVGMNANG---LFYYSG 443
             E  ++  V    P++  ++A+    LFY SG
Sbjct:   234 ESSLESAVNVN-PVACSIDASHSSFLFYKSG 263

 Score = 212 (79.7 bits), Expect = 2.1e-27, Sum P(3) = 2.1e-27
 Identities = 47/161 (29%), Positives = 83/161 (51%)

Query:   496 SKLSRLATEKLVDCD--MSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLX 553
             ++L  L+ + L+DC     N GCNGG +  A +YII NGG+ ++++YP++ ++   G   
Sbjct:   157 NELVSLSEQNLIDCSTPFGNTGCNGGVITYAFEYIISNGGIDTEKSYPFEGTD---GTCR 213

Query:   554 XXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNAN--GLFYYSGGVIDLNQRLC 611
                         Y  + +G E  ++  V    P++  ++A+     +Y  G+    +  C
Sbjct:   214 YKSENSGATISSYVNVTFGSESSLESAVNVN-PVACSIDASHSSFLFYKSGIYF--EPAC 270

Query:   612 NPKAQNHALIIVGYGEE--EKKDGTSIP----YWIVKNSWG 646
             +    +H +++VGYG E  + +D +S P    YWI KNSWG
Sbjct:   271 SRTNLDHGVLVVGYGTENSQSQDSSSEPNHSNYWIAKNSWG 311

 Score = 146 (56.5 bits), Expect = 2.6e-08, Sum P(2) = 2.6e-08
 Identities = 50/192 (26%), Positives = 88/192 (45%)

Query:   200 HDKVYSSVEDLLRRHENFVTNVEKAEDYQSEDSGTAVFGVNKFFDLSESDLQQL-TGLNL 258
             + K YSS E  + R+  F TN +  E++ S+ S T V G+NK  D++  + + L  G   
Sbjct:    37 NQKSYSSSE-FITRYNIFKTNFDYIEEWNSKGSET-VLGLNKMADITNEEYRSLYLGKPF 94

Query:   259 DSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVKEQGKCA 318
             D++   +  + +    SN+  + +   +  ++ H               +   +    C 
Sbjct:    95 DAS--SLIGTKEEILFSNKFSSTVDWRKKGAVTH---------------VKNQQSCSGCW 137

Query:   319 CCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCD--MSNGGCNGGRMDDALQYIIDNGGV 376
                A  A      +     N L  LS Q L+DC     N GCNGG +  A +YII NGG+
Sbjct:   138 SFSATGATEGAHKLANNGTNELVSLSEQNLIDCSTPFGNTGCNGGVITYAFEYIISNGGI 197

Query:   377 VSDQAYPYKASE 388
              ++++YP++ ++
Sbjct:   198 DTEKSYPFEGTD 209

 Score = 117 (46.2 bits), Expect = 2.1e-27, Sum P(3) = 2.1e-27
 Identities = 19/43 (44%), Positives = 25/43 (58%)

Query:   153 DWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTEL 195
             DWR +G ++ VK Q  C+ CW+FSA G  E  H +  N   EL
Sbjct:   117 DWRKKGAVTHVKNQQSCSGCWSFSATGATEGAHKLANNGTNEL 159

 Score = 79 (32.9 bits), Expect = 2.1e-27, Sum P(3) = 2.1e-27
 Identities = 26/94 (27%), Positives = 49/94 (52%)

Query:    12 KGLGYLHTFMIKVALLESNIFQTRGYLNSPVTRFLNFMRDHDKVYSSVEDLLRRHENFVT 71
             K L  L   +I VA  +  + +++ Y ++    F ++M  + K YSS E  + R+  F T
Sbjct:     2 KVLSVLCALLITVATAKQELSESQ-YRDA----FTDWMISNQKSYSSSE-FITRYNIFKT 55

Query:    72 NVEKAEDYQREDSGTAVFEVNKFFDLSDSDLQQL 105
             N +  E++  + S T V  +NK  D+++ + + L
Sbjct:    56 NFDYIEEWNSKGSET-VLGLNKMADITNEEYRSL 88

 Score = 54 (24.1 bits), Expect = 4.5e-26, Sum P(3) = 4.5e-26
 Identities = 8/9 (88%), Positives = 8/9 (88%)

Query:   459 YWIVKNSWG 467
             YWI KNSWG
Sbjct:   303 YWIAKNSWG 311

 Score = 47 (21.6 bits), Expect = 2.0e-10, Sum P(4) = 2.0e-10
 Identities = 13/32 (40%), Positives = 18/32 (56%)

Query:   227 YQSEDSGTAVFG-VNKFFDLSESDLQQLTGLN 257
             Y+SE+SG  +   VN  F  SES L+    +N
Sbjct:   214 YKSENSGATISSYVNVTFG-SESSLESAVNVN 244


>DICTYBASE|DDB_G0279185 [details] [associations]
            symbol:cprF "cysteine proteinase 6" species:44689
            "Dictyostelium discoideum" [GO:0005615 "extracellular space"
            evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0005764 "lysosome" evidence=IEA] [GO:0044351
            "macropinocytosis" evidence=RCA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 dictyBase:DDB_G0279185 GO:GO:0005615
            GenomeReviews:CM000152_GR eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            EMBL:AAFI02000030 HSSP:P07711 ProtClustDB:CLSZ2846820 EMBL:U72745
            RefSeq:XP_641725.1 ProteinModelPortal:Q94503 SMR:Q94503
            MEROPS:C01.081 PRIDE:Q94503 EnsemblProtists:DDB0215002
            GeneID:8621921 KEGG:ddi:DDB_G0279185 Uniprot:Q94503
        Length = 434

 Score = 288 (106.4 bits), Expect = 3.5e-27, Sum P(2) = 3.5e-27
 Identities = 83/290 (28%), Positives = 141/290 (48%)

Query:   213 RHENFVTN--VEKAEDYQSED-SGTA-VFGVNKFF--DLSESDLQQLTGLNL--DSTLED 264
             ++ N  TN  +     Y SE+ +G   +F  N  +  + +    + + GLN+  D T E+
Sbjct:    25 QYRNAFTNWMIAHQRHYSSEEFNGRFNIFKANMDYINEWNTKGSETVLGLNVFADITNEE 84

Query:   265 IQPS-LQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVKEQGKCACCWAF 323
              + + L  PF ++  +       F  ++       + DWRA+G ++ +K QG+C  CW+F
Sbjct:    85 YRATYLGTPFDASSLEMTPSEKVFGGVQ-----ANSVDWRAKGAVTPIKNQGECGGCWSF 139

Query:   324 SAVGVVE-AMHAIQGNS-LTELSVQQLVDCDMS--NGGCNGGRMDDALQYIIDNGGVVSD 379
             SA G  E A +   G+S LT +S QQL+DC  S  N GC GG M  A +YII+NGG+ ++
Sbjct:   140 SATGATEGAQYIANGDSDLTSVSEQQLIDCSGSYGNNGCEGGLMTLAFEYIINNGGIDTE 199

Query:   380 QAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNAN--G 437
              +YP+ A+ +E+ C              Y  +  G E ++   V T+GP SV ++A+   
Sbjct:   200 SSYPFTAN-TEK-C-KYNPSNIGAELSSYVNVTSGSESDLAAKV-TQGPTSVAIDASQPS 255

Query:   438 LFYYSGGVIDLNQRLYGTSIPYWIVKNSWGSDWGEKVEDKVGSSGNRTRD 487
               +YS G+ +       T + + ++   +GS          GS    + +
Sbjct:   256 FQFYSSGIYN-EPACSSTQLDHGVLAVGFGSGSSGSQSQSAGSQSQSSNN 304

 Score = 179 (68.1 bits), Expect = 1.1e-23, Sum P(3) = 1.1e-23
 Identities = 43/135 (31%), Positives = 74/135 (54%)

Query:   496 SKLSRLATEKLVDCDMS--NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLX 553
             S L+ ++ ++L+DC  S  N GC GG M  A +YII+NGG+ ++ +YP+ A+ +E+ C  
Sbjct:   156 SDLTSVSEQQLIDCSGSYGNNGCEGGLMTLAFEYIINNGGIDTESSYPFTAN-TEK-C-K 212

Query:   554 XXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNAN--GLFYYSGGVIDLNQRLC 611
                         Y  +  G E ++   V T+GP SV ++A+     +YS G+   N+  C
Sbjct:   213 YNPSNIGAELSSYVNVTSGSESDLAAKV-TQGPTSVAIDASQPSFQFYSSGIY--NEPAC 269

Query:   612 NPKAQNHALIIVGYG 626
             +    +H ++ VG+G
Sbjct:   270 SSTQLDHGVLAVGFG 284

 Score = 132 (51.5 bits), Expect = 1.1e-23, Sum P(3) = 1.1e-23
 Identities = 40/145 (27%), Positives = 72/145 (49%)

Query:    65 RHENFVTN--VEKAEDYQRED-SGTA-VFEVNKFF--DLSDSDLQQLTGLNL--DSTLED 116
             ++ N  TN  +     Y  E+ +G   +F+ N  +  + +    + + GLN+  D T E+
Sbjct:    25 QYRNAFTNWMIAHQRHYSSEEFNGRFNIFKANMDYINEWNTKGSETVLGLNVFADITNEE 84

Query:   117 IQPS-LQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVKEQGKCACCWAF 175
              + + L  PF ++  +       F  ++       + DWRA+G ++ +K QG+C  CW+F
Sbjct:    85 YRATYLGTPFDASSLEMTPSEKVFGGVQ-----ANSVDWRAKGAVTPIKNQGECGGCWSF 139

Query:   176 SAVGVVE-AMHAIQGNN-LTELSVQ 198
             SA G  E A +   G++ LT +S Q
Sbjct:   140 SATGATEGAQYIANGDSDLTSVSEQ 164

 Score = 89 (36.4 bits), Expect = 6.4e-10, Sum P(2) = 6.4e-10
 Identities = 21/47 (44%), Positives = 25/47 (53%)

Query:   427 GPLSVGMNANGLFYYSGGVIDLNQRL--YGTSIPYWIVKNSWGSDWG 471
             G +S   +A+G   +SG     N     Y T   YWIVKNSWG DWG
Sbjct:   355 GSVSGSGSASGSSSFSGSSNGGNSNSGDYPTDGNYWIVKNSWGLDWG 401

 Score = 79 (32.9 bits), Expect = 3.5e-27, Sum P(2) = 3.5e-27
 Identities = 12/13 (92%), Positives = 12/13 (92%)

Query:   638 YWIVKNSWGSDWG 650
             YWIVKNSWG DWG
Sbjct:   389 YWIVKNSWGLDWG 401

 Score = 77 (32.2 bits), Expect = 5.1e-18, Sum P(3) = 5.1e-18
 Identities = 32/127 (25%), Positives = 61/127 (48%)

Query:    12 KGLGYLHTFMIKVALLESNIFQTRGYLNSPVTRFLNFMRDHDKVYSSVEDLLRRHENFVT 71
             K L  L   ++ VA  +  + + + Y N+    F N+M  H + YSS E+   R   F  
Sbjct:     2 KVLSALCVLLVSVATAKQQLSELQ-YRNA----FTNWMIAHQRHYSS-EEFNGRFNIFKA 55

Query:    72 NVEKAEDYQREDSGTAVFEVNKFFDLSDSDLQQL-TGLNLDSTLEDIQPSLQAPFSSNQT 130
             N++   ++  + S T V  +N F D+++ + +    G   D++  ++ PS +  F   Q 
Sbjct:    56 NMDYINEWNTKGSET-VLGLNVFADITNEEYRATYLGTPFDASSLEMTPS-EKVFGGVQA 113

Query:   131 DT-EMRA 136
             ++ + RA
Sbjct:   114 NSVDWRA 120


>DICTYBASE|DDB_G0283867 [details] [associations]
            symbol:cprC "cysteine proteinase 3" species:44689
            "Dictyostelium discoideum" [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0008233
            "peptidase activity" evidence=IEA] [GO:0005764 "lysosome"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            dictyBase:DDB_G0283867 GenomeReviews:CM000153_GR eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 EMBL:AAFI02000057
            KO:K01365 EMBL:X03930 RefSeq:XP_638859.1 ProteinModelPortal:Q23894
            SMR:Q23894 MEROPS:C01.114 EnsemblProtists:DDB0220784 GeneID:8624257
            KEGG:ddi:DDB_G0283867 OMA:NNVEHIN Uniprot:Q23894
        Length = 337

 Score = 312 (114.9 bits), Expect = 4.0e-27, P = 4.0e-27
 Identities = 89/293 (30%), Positives = 140/293 (47%)

Query:   197 VQHHDKVYSSVEDLLRRHENFVTNVEKAEDYQSEDSGTAVFGVNKFFDLSESDLQQLTGL 256
             ++ ++K Y+  E  + R+E F  N++   ++ S+ S T V G+N+  DLS  + +    L
Sbjct:    38 MRSNNKAYTHKE-FMPRYEEFKKNMDYVHNWNSKGSKT-VLGLNQHADLSNEEYR----L 91

Query:   257 NLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVKEQGK 316
             N   T   I+  L   +        +   QF         P   DWR +  ++ VK+QG+
Sbjct:    92 NYLGTRAHIK--LNG-YHKRNLGLRLNRPQFKQ-------PLNVDWREKDAVTPVKDQGQ 141

Query:   317 CACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMS--NGGCNGGRMDDALQYIIDNG 374
             C  C++FS  G VE + AI+   L  LS Q ++DC  S  N GCNGG M +A +YII N 
Sbjct:   142 CGSCYSFSTTGSVEGVTAIKTGKLVSLSEQNILDCSSSFGNEGCNGGLMTNAFEYIIKNN 201

Query:   375 GVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMN 434
             G+ S++ YPY+   ++  C              Y  I  G+E +++  +    P+SV ++
Sbjct:   202 GLNSEEQYPYEMKVNDE-C-KFQEGSVAAKITSYKEIEAGDENDLQNALLLN-PVSVAID 258

Query:   435 A--NGLFYYSGGVI--------DLNQRLYGTSI------PYWIVKNSWGSDWG 471
             A  N    Y+ GV         DL+  +    +       Y+IVKNSWG  WG
Sbjct:   259 ASHNSFQLYTAGVYYEPACSSEDLDHGVLAVGMGTDNGEDYYIVKNSWGPSWG 311

 Score = 233 (87.1 bits), Expect = 3.0e-17, P = 3.0e-17
 Identities = 64/196 (32%), Positives = 100/196 (51%)

Query:   469 DWGEK-----VEDK--VGS--SGNRTRDLE-LTGVLPSKLSRLATEKLVDCDMS--NGGC 516
             DW EK     V+D+   GS  S + T  +E +T +   KL  L+ + ++DC  S  N GC
Sbjct:   126 DWREKDAVTPVKDQGQCGSCYSFSTTGSVEGVTAIKTGKLVSLSEQNILDCSSSFGNEGC 185

Query:   517 NGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYGEEEE 576
             NGG M +A +YII N G+ S++ YPY+   ++  C              Y  I  G+E +
Sbjct:   186 NGGLMTNAFEYIIKNNGLNSEEQYPYEMKVNDE-C-KFQEGSVAAKITSYKEIEAGDEND 243

Query:   577 MKKWVATRGPLSVGMNA--NGLFYYSGGVIDLNQRLCNPKAQNHALIIVGYGEEEKKDGT 634
             ++  +    P+SV ++A  N    Y+ GV    +  C+ +  +H ++ VG G +  +D  
Sbjct:   244 LQNALLLN-PVSVAIDASHNSFQLYTAGVY--YEPACSSEDLDHGVLAVGMGTDNGED-- 298

Query:   635 SIPYWIVKNSWGSDWG 650
                Y+IVKNSWG  WG
Sbjct:   299 ---YYIVKNSWGPSWG 311

 Score = 153 (58.9 bits), Expect = 7.4e-08, P = 7.4e-08
 Identities = 45/155 (29%), Positives = 76/155 (49%)

Query:    45 FLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQREDSGTAVFEVNKFFDLSDSDLQQ 104
             F+++MR ++K Y+  E  + R+E F  N++   ++  + S T V  +N+  DLS+ + + 
Sbjct:    34 FIDWMRSNNKAYTHKE-FMPRYEEFKKNMDYVHNWNSKGSKT-VLGLNQHADLSNEEYR- 90

Query:   105 LTGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVK 164
                LN   T   I+  L   +        +   QF         P   DWR +  ++ VK
Sbjct:    91 ---LNYLGTRAHIK--LNG-YHKRNLGLRLNRPQFKQ-------PLNVDWREKDAVTPVK 137

Query:   165 EQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQH 199
             +QG+C  C++FS  G VE + AI+   L  LS Q+
Sbjct:   138 DQGQCGSCYSFSTTGSVEGVTAIKTGKLVSLSEQN 172


>ZFIN|ZDB-GENE-030131-572 [details] [associations]
            symbol:wu:fb37b09 "wu:fb37b09" species:7955 "Danio
            rerio" [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008233 "peptidase
            activity" evidence=IEA] [GO:0016787 "hydrolase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            ZFIN:ZDB-GENE-030131-572 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            EMBL:BX465840 IPI:IPI00866294 RefSeq:XP_001923796.1
            UniGene:Dr.25683 PRIDE:E9QBE2 Ensembl:ENSDART00000133962
            GeneID:321853 KEGG:dre:321853 NextBio:20807556 Uniprot:E9QBE2
        Length = 335

 Score = 260 (96.6 bits), Expect = 6.3e-27, Sum P(2) = 6.3e-27
 Identities = 56/153 (36%), Positives = 82/153 (53%)

Query:   297 PEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSNG 356
             P+  DWR  G ++ VK+Q +C  CW+FS+ G +E     +   L  +S Q LVDC   +G
Sbjct:   116 PQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPHG 175

Query:   357 --GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYG 414
               GCNGG MD A QY+ +N G+ S+Q+YPY A + +  C              +  IP G
Sbjct:   176 NQGCNGGLMDQAFQYVKENKGLDSEQSYPYLARD-DLPC-RYDPRFNVAKITGFVDIPKG 233

Query:   415 EEEEMKKWVATRGPLSVGMNAN--GLFYYSGGV 445
              E  +   VA  GP+SV ++A+   L +Y  G+
Sbjct:   234 NELALMNAVAAVGPVSVAIDASHQSLQFYQSGI 266

 Score = 258 (95.9 bits), Expect = 2.8e-21, P = 2.8e-21
 Identities = 60/160 (37%), Positives = 87/160 (54%)

Query:   497 KLSRLATEKLVDCDMSNG--GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXX 554
             KL  ++ + LVDC   +G  GCNGG MD A QY+ +N G+ S+Q+YPY A + +  C   
Sbjct:   158 KLISMSEQNLVDCSRPHGNQGCNGGLMDQAFQYVKENKGLDSEQSYPYLARD-DLPC-RY 215

Query:   555 XXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNAN--GLFYYSGGVIDLNQRLCN 612
                        +  IP G E  +   VA  GP+SV ++A+   L +Y  G+    +R C 
Sbjct:   216 DPRFNVAKITGFVDIPKGNELALMNAVAAVGPVSVAIDASHQSLQFYQSGIY--YERACT 273

Query:   613 PKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
              +  +HA+++VGYG +   D     YWIVKNSW   WG+K
Sbjct:   274 SQL-DHAVLVVGYGYQGA-DVAGNRYWIVKNSWSDKWGDK 311

 Score = 120 (47.3 bits), Expect = 1.5e-07, Sum P(2) = 1.5e-07
 Identities = 51/198 (25%), Positives = 84/198 (42%)

Query:   198 QHHDKVYSSVEDLLRRHENFVTNVEKAEDYQSEDS-GTAVF--GVNKFFDLSESDLQQ-L 253
             QH    +  VE  + R   +  N+ K E +  E S G   F  G+N+F D++  + +Q +
Sbjct:    34 QHGKSYHEDVE--VGRRMIWEENLRKIEQHNFEYSLGNHTFKMGMNQFGDMTNEEFRQAM 91

Query:   254 TGLNLDSTLEDIQPSLQAP--FSS-NQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISK 310
              G   D       P    P  F++  Q D   R +    ++        + + + G +  
Sbjct:    92 NGYKHDPNRTSQGPLFMEPKFFAAPQQVDWRQRGY-VTPVKDQKQCGSCWSFSSTGAL-- 148

Query:   311 VKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSNGGCNGGRMDDALQYI 370
                +G+      F   G + +M      +L + S          N GCNGG MD A QY+
Sbjct:   149 ---EGQL-----FRKTGKLISMSE---QNLVDCSRPH------GNQGCNGGLMDQAFQYV 191

Query:   371 IDNGGVVSDQAYPYKASE 388
              +N G+ S+Q+YPY A +
Sbjct:   192 KENKGLDSEQSYPYLARD 209

 Score = 111 (44.1 bits), Expect = 1.5e-06, Sum P(2) = 1.5e-06
 Identities = 27/88 (30%), Positives = 43/88 (48%)

Query:   149 PEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQHHDKVYSSVE 208
             P+  DWR  G ++ VK+Q +C  CW+FS+ G +E     +   L  +S Q+       + 
Sbjct:   116 PQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQN-------LV 168

Query:   209 DLLRRHENFVTN---VEKAEDYQSEDSG 233
             D  R H N   N   +++A  Y  E+ G
Sbjct:   169 DCSRPHGNQGCNGGLMDQAFQYVKENKG 196

 Score = 75 (31.5 bits), Expect = 6.3e-27, Sum P(2) = 6.3e-27
 Identities = 11/15 (73%), Positives = 12/15 (80%)

Query:   459 YWIVKNSWGSDWGEK 473
             YWIVKNSW   WG+K
Sbjct:   297 YWIVKNSWSDKWGDK 311


>UNIPROTKB|O46427 [details] [associations]
            symbol:CTSH "Pro-cathepsin H" species:9823 "Sus scrofa"
            [GO:0002764 "immune response-regulating signaling pathway"
            evidence=ISS] [GO:0032526 "response to retinoic acid" evidence=ISS]
            [GO:0031648 "protein destabilization" evidence=ISS] [GO:0043129
            "surfactant homeostasis" evidence=ISS] [GO:0010815 "bradykinin
            catabolic process" evidence=ISS] [GO:0010813 "neuropeptide
            catabolic process" evidence=ISS] [GO:0005829 "cytosol"
            evidence=ISS] [GO:0045766 "positive regulation of angiogenesis"
            evidence=ISS] [GO:0008284 "positive regulation of cell
            proliferation" evidence=ISS] [GO:0008234 "cysteine-type peptidase
            activity" evidence=ISS] [GO:0030335 "positive regulation of cell
            migration" evidence=ISS] [GO:0001656 "metanephros development"
            evidence=ISS] [GO:0033619 "membrane protein proteolysis"
            evidence=ISS] [GO:0043066 "negative regulation of apoptotic
            process" evidence=ISS] [GO:0016505 "apoptotic protease activator
            activity" evidence=ISS] [GO:0004252 "serine-type endopeptidase
            activity" evidence=ISS] [GO:0010952 "positive regulation of
            peptidase activity" evidence=ISS] [GO:0001913 "T cell mediated
            cytotoxicity" evidence=ISS] [GO:0031638 "zymogen activation"
            evidence=ISS] [GO:0030108 "HLA-A specific activating MHC class I
            receptor activity" evidence=ISS] [GO:0070371 "ERK1 and ERK2
            cascade" evidence=ISS] [GO:0010628 "positive regulation of gene
            expression" evidence=ISS] [GO:0070324 "thyroid hormone binding"
            evidence=ISS] [GO:0006508 "proteolysis" evidence=ISS] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=ISS] [GO:0060448
            "dichotomous subdivision of terminal units involved in lung
            branching" evidence=ISS] [GO:0097208 "alveolar lamellar body"
            evidence=ISS] [GO:0005764 "lysosome" evidence=ISS] [GO:0005615
            "extracellular space" evidence=ISS] [GO:0004177 "aminopeptidase
            activity" evidence=ISS] [GO:0004175 "endopeptidase activity"
            evidence=ISS] [GO:0010634 "positive regulation of epithelial cell
            migration" evidence=ISS] [GO:0097067 "cellular response to thyroid
            hormone stimulus" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005829 GO:GO:0043066 GO:GO:0005615
            GO:GO:0008284 GO:GO:0070371 eggNOG:COG4870 HOGENOM:HOG000230774
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766 GO:GO:0004177
            GO:GO:0004252 GO:GO:0005764 GO:GO:0097208 GO:GO:0032526
            GO:GO:0010628 GO:GO:0070324 KO:K01366 GO:GO:0016505 GO:GO:0001656
            GO:GO:0010634 GO:GO:0004197 GO:GO:0031648 HOVERGEN:HBG011513
            GO:GO:0097067 GO:GO:0031638 GO:GO:0001913
            GeneTree:ENSGT00660000095458 MEROPS:C01.040 CTD:1512 OMA:STSCHKT
            GO:GO:0030108 GO:GO:0010815 GO:GO:0060448 GO:GO:0002764
            GO:GO:0033619 GO:GO:0010813 GO:GO:0043129 EMBL:AF001169
            RefSeq:NP_999094.1 UniGene:Ssc.3593 PDB:1NB3 PDB:1NB5 PDB:8PCH
            PDBsum:1NB3 PDBsum:1NB5 PDBsum:8PCH ProteinModelPortal:O46427
            SMR:O46427 Ensembl:ENSSSCT00000001983 GeneID:396969 KEGG:ssc:396969
            EvolutionaryTrace:O46427 ArrayExpress:O46427 Uniprot:O46427
        Length = 335

 Score = 285 (105.4 bits), Expect = 3.4e-24, P = 3.4e-24
 Identities = 71/204 (34%), Positives = 92/204 (45%)

Query:   288 NSLRHGDDLPEAFDWRAEG-VISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQ 346
             N LR     P + DWR +G  +S VK QG C  CW FS  G +E+  AI    +  L+ Q
Sbjct:   108 NYLRGTGPYPPSMDWRKKGNFVSPVKNQGSCGSCWTFSTTGALESAVAIATGKMLSLAEQ 167

Query:   347 QLVDC--DMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXX 404
             QLVDC  + +N GC GG    A +YI  N G++ +  YPYK  +    C           
Sbjct:   168 QLVDCAQNFNNHGCQGGLPSQAFEYIRYNKGIMGEDTYPYKGQDDH--C-KFQPDKAIAF 224

Query:   405 XXXYSRIPYGEEEEMKKWVATRGPLSVGMNA-NGLFYYSGGVID----------LNQRL- 452
                 + I   +EE M + VA   P+S      N    Y  G+            +N  + 
Sbjct:   225 VKDVANITMNDEEAMVEAVALYNPVSFAFEVTNDFLMYRKGIYSSTSCHKTPDKVNHAVL 284

Query:   453 ---YG--TSIPYWIVKNSWGSDWG 471
                YG    IPYWIVKNSWG  WG
Sbjct:   285 AVGYGEENGIPYWIVKNSWGPQWG 308

 Score = 238 (88.8 bits), Expect = 6.6e-27, Sum P(2) = 6.6e-27
 Identities = 56/161 (34%), Positives = 74/161 (45%)

Query:   493 VLPSKLSRLATEKLVDC--DMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERG 550
             +   K+  LA ++LVDC  + +N GC GG    A +YI  N G++ +  YPYK  +    
Sbjct:   156 IATGKMLSLAEQQLVDCAQNFNNHGCQGGLPSQAFEYIRYNKGIMGEDTYPYKGQDDH-- 213

Query:   551 CLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNA-NGLFYYSGGVIDLNQR 609
             C               + I   +EE M + VA   P+S      N    Y  G+      
Sbjct:   214 C-KFQPDKAIAFVKDVANITMNDEEAMVEAVALYNPVSFAFEVTNDFLMYRKGIYSSTSC 272

Query:   610 LCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWG 650
                P   NHA++ VGYGEE       IPYWIVKNSWG  WG
Sbjct:   273 HKTPDKVNHAVLAVGYGEEN-----GIPYWIVKNSWGPQWG 308

 Score = 118 (46.6 bits), Expect = 6.6e-27, Sum P(2) = 6.6e-27
 Identities = 24/60 (40%), Positives = 31/60 (51%)

Query:   140 NSLRHGDDLPEAFDWRAEG-VISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
             N LR     P + DWR +G  +S VK QG C  CW FS  G +E+  AI    +  L+ Q
Sbjct:   108 NYLRGTGPYPPSMDWRKKGNFVSPVKNQGSCGSCWTFSTTGALESAVAIATGKMLSLAEQ 167


>ZFIN|ZDB-GENE-030131-3539 [details] [associations]
            symbol:ctsh "cathepsin H" species:7955 "Danio
            rerio" [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-030131-3539
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 KO:K01366 HOVERGEN:HBG011513
            CTD:1512 OrthoDB:EOG4W9J43 MEROPS:I29.003 HSSP:P43235 EMBL:BC067615
            IPI:IPI00506892 RefSeq:NP_997853.1 UniGene:Dr.14176
            ProteinModelPortal:Q6NWF2 SMR:Q6NWF2 PRIDE:Q6NWF2 GeneID:324818
            KEGG:dre:324818 InParanoid:Q6NWF2 NextBio:20808976 Bgee:Q6NWF2
            Uniprot:Q6NWF2
        Length = 330

 Score = 280 (103.6 bits), Expect = 2.2e-26, Sum P(2) = 2.2e-26
 Identities = 70/197 (35%), Positives = 96/197 (48%)

Query:   297 PEAFDWRAEG-VISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDC--DM 353
             P+A DWR +G  I+ VK QG C  CW FS  G +E++ AI    L +L+ QQL+DC  D 
Sbjct:   112 PDAIDWRTKGHYITDVKNQGPCGSCWTFSTTGCLESVTAIATGKLLQLAEQQLIDCAGDF 171

Query:   354 SNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPY 413
              N GCNGG    A +YI+ N G++++  YPY+A   +  C               +   Y
Sbjct:   172 DNHGCNGGLPSHAFEYIMYNKGLMTEDDYPYQAKGGQ--CRFKPQLAAAFVKEVVNITKY 229

Query:   414 GEEEEMKKWVATRGPLSVGMNANGLF-YYSGGVI---------DL-NQRL----YG--TS 456
              +E  M   VA   P+S        F +Y  G+          D+ N  +    Y     
Sbjct:   230 -DEMGMVDAVARLNPVSFAYEVTSDFMHYKDGIYTSTECHNTTDMVNHAVLAVGYAEENG 288

Query:   457 IPYWIVKNSWGSDWGEK 473
              PYWIVKNSWG++WG K
Sbjct:   289 TPYWIVKNSWGTNWGIK 305

 Score = 243 (90.6 bits), Expect = 2.3e-22, Sum P(2) = 2.3e-22
 Identities = 70/207 (33%), Positives = 101/207 (48%)

Query:   452 LYGTSIPYWIVKNSWGSDWGEKVEDKVGS--SGNRTRDLE-LTGVLPSKLSRLATEKLVD 508
             LY  +I  W  K  + +D   K +   GS  + + T  LE +T +   KL +LA ++L+D
Sbjct:   110 LYPDAID-WRTKGHYITD--VKNQGPCGSCWTFSTTGCLESVTAIATGKLLQLAEQQLID 166

Query:   509 C--DMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXY 566
             C  D  N GCNGG    A +YI+ N G++++  YPY+A   +  C               
Sbjct:   167 CAGDFDNHGCNGGLPSHAFEYIMYNKGLMTEDDYPYQAKGGQ--CRFKPQLAAAFVKEVV 224

Query:   567 SRIPYGEEEEMKKWVATRGPLSVGMNANGLF-YYSGGVIDLNQRLCNPKAQNHALIIVGY 625
             +   Y +E  M   VA   P+S        F +Y  G+    +        NHA++ VGY
Sbjct:   225 NITKY-DEMGMVDAVARLNPVSFAYEVTSDFMHYKDGIYTSTECHNTTDMVNHAVLAVGY 283

Query:   626 GEEEKKDGTSIPYWIVKNSWGSDWGEK 652
              EE   +GT  PYWIVKNSWG++WG K
Sbjct:   284 AEE---NGT--PYWIVKNSWGTNWGIK 305

 Score = 122 (48.0 bits), Expect = 0.00020, P = 0.00020
 Identities = 23/51 (45%), Positives = 31/51 (60%)

Query:   149 PEAFDWRAEG-VISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
             P+A DWR +G  I+ VK QG C  CW FS  G +E++ AI    L +L+ Q
Sbjct:   112 PDAIDWRTKGHYITDVKNQGPCGSCWTFSTTGCLESVTAIATGKLLQLAEQ 162

 Score = 49 (22.3 bits), Expect = 2.2e-26, Sum P(2) = 2.2e-26
 Identities = 12/56 (21%), Positives = 31/56 (55%)

Query:   197 VQHHDKVYSSVEDLLRRHENFVTNVEKAEDYQSEDSGTAVFGVNKFFDLSESDLQQ 252
             +  ++K Y  + +  +R + F+ N +K  D  +E +     G+N+F D++ ++ ++
Sbjct:    34 MSQYNKKYE-INEFYQRLQIFLEN-KKRIDQHNEGNHKFSMGLNQFSDMTFAEFKK 87


>DICTYBASE|DDB_G0278721 [details] [associations]
            symbol:cprD "cysteine proteinase 4" species:44689
            "Dictyostelium discoideum" [GO:0005615 "extracellular space"
            evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0005764 "lysosome" evidence=IEA] [GO:0044351
            "macropinocytosis" evidence=RCA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 dictyBase:DDB_G0278721 GO:GO:0005615
            GenomeReviews:CM000152_GR eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            EMBL:AAFI02000024 EMBL:L36204 RefSeq:XP_641963.1
            ProteinModelPortal:P54639 SMR:P54639 MEROPS:C01.A57 PRIDE:P54639
            EnsemblProtists:DDB0214999 GeneID:8621695 KEGG:ddi:DDB_G0278721
            OMA:NAFADIT ProtClustDB:CLSZ2846820 Uniprot:P54639
        Length = 442

 Score = 248 (92.4 bits), Expect = 2.5e-26, Sum P(3) = 2.5e-26
 Identities = 66/231 (28%), Positives = 101/231 (43%)

Query:   301 DWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNS---LTELSVQQLVDCDMS--N 355
             DWRA+G ++ +K QG+C  CW+FS  G  E  H I   +   L  LS Q L+DC  S  N
Sbjct:   116 DWRAQGAVTPIKNQGQCGGCWSFSTTGSTEGAHFIASGTKKDLVSLSEQNLIDCSKSYGN 175

Query:   356 GGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYGE 415
              GC GG M  A +YII+N G+ ++ +YPY A + +  C              Y  +  G 
Sbjct:   176 NGCEGGLMTLAFEYIINNKGIDTESSYPYTAEDGKE-C-KFKTSNIGAQIVSYQNVTSGS 233

Query:   416 EEEMKKWVATRGPLSVGMNA-NGLFY-YSGGVIDLNQRLYGTSIPYWIVKNSWGSDWGEK 473
             E  ++   +   P+SV ++A N  F  Y  G+         T + + ++   +GS  G  
Sbjct:   234 EASLQS-ASNNAPVSVAIDASNESFQLYESGIY-YEPACSPTQLDHGVLVVGYGS--GSS 289

Query:   474 VEDKVGSSGNRTRDLELTGVLPSKLSRLATEKLVDCDMSNGGCNGGRMDDA 524
                  GSS  ++     TG   S  S            ++   + G+   A
Sbjct:   290 SSS--GSSSGKSSSSSSTGGKTSSSSSSGKASSSSSGKASSSSSSGKTSSA 338

 Score = 174 (66.3 bits), Expect = 1.7e-17, Sum P(3) = 1.7e-17
 Identities = 43/143 (30%), Positives = 69/143 (48%)

Query:   498 LSRLATEKLVDCDMS--NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXX 555
             L  L+ + L+DC  S  N GC GG M  A +YII+N G+ ++ +YPY A + +  C    
Sbjct:   158 LVSLSEQNLIDCSKSYGNNGCEGGLMTLAFEYIINNKGIDTESSYPYTAEDGKE-C-KFK 215

Query:   556 XXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNA-NGLFY-YSGGVIDLNQRLCNP 613
                       Y  +  G E  ++   +   P+SV ++A N  F  Y  G+    +  C+P
Sbjct:   216 TSNIGAQIVSYQNVTSGSEASLQS-ASNNAPVSVAIDASNESFQLYESGIY--YEPACSP 272

Query:   614 KAQNHALIIVGYGE-EEKKDGTS 635
                +H +++VGYG       G+S
Sbjct:   273 TQLDHGVLVVGYGSGSSSSSGSS 295

 Score = 120 (47.3 bits), Expect = 2.7e-11, Sum P(3) = 2.7e-11
 Identities = 46/145 (31%), Positives = 64/145 (44%)

Query:   153 DWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGN---NLTELSVQHH---DKVYSS 206
             DWRA+G ++ +K QG+C  CW+FS  G  E  H I      +L  LS Q+     K Y +
Sbjct:   116 DWRAQGAVTPIKNQGQCGGCWSFSTTGSTEGAHFIASGTKKDLVSLSEQNLIDCSKSYGN 175

Query:   207 --VED-LLRRHENFVTNVEKAEDYQSEDSGTAVFGVNKFFDLSESDLQQLTGLNLDSTLE 263
                E  L+     ++ N  K  D +S    TA  G    F  S    Q ++  N+ S  E
Sbjct:   176 NGCEGGLMTLAFEYIIN-NKGIDTESSYPYTAEDGKECKFKTSNIGAQIVSYQNVTSGSE 234

Query:   264 -DIQP-SLQAPFSSNQTDTEMRAFQ 286
               +Q  S  AP S    D    +FQ
Sbjct:   235 ASLQSASNNAPVSV-AIDASNESFQ 258

 Score = 80 (33.2 bits), Expect = 2.5e-26, Sum P(3) = 2.5e-26
 Identities = 15/27 (55%), Positives = 16/27 (59%)

Query:   624 GYGEEEKKDGTSIPYWIVKNSWGSDWG 650
             G G  E   G    YWIVKNSWG+ WG
Sbjct:   390 GSGAVEASSGN---YWIVKNSWGTSWG 413

 Score = 77 (32.2 bits), Expect = 2.5e-26, Sum P(3) = 2.5e-26
 Identities = 20/79 (25%), Positives = 36/79 (45%)

Query:   171 CCWAFSAVGVVEAMHAIQGNNLTELSVQHHDKVYSSVEDLLRRHENFVTNVEKAEDYQSE 230
             C    S     +    +Q  N     +Q H + YSS E+   R++ F +N++    + S+
Sbjct:     8 CLLLVSYASAKQQFSELQYRNAFTNWMQAHQRTYSS-EEFNARYQIFKSNMDYVHQWNSK 66

Query:   231 DSGTAVFGVNKFFDLSESD 249
               G  V G+N F D++  +
Sbjct:    67 -GGETVLGLNVFADITNQE 84

 Score = 76 (31.8 bits), Expect = 3.1e-26, Sum P(3) = 3.1e-26
 Identities = 22/88 (25%), Positives = 43/88 (48%)

Query:    14 LGYLHTFMIKVALLESNIFQTRGYLNSPVTRFLNFMRDHDKVYSSVEDLLRRHENFVTNV 73
             L +L   ++  A  +   F    Y N+    F N+M+ H + YSS E+   R++ F +N+
Sbjct:     4 LSFLCLLLVSYASAKQQ-FSELQYRNA----FTNWMQAHQRTYSS-EEFNARYQIFKSNM 57

Query:    74 EKAEDYQREDSGTAVFEVNKFFDLSDSD 101
             +    +  +  G  V  +N F D+++ +
Sbjct:    58 DYVHQWNSK-GGETVLGLNVFADITNQE 84

 Score = 76 (31.8 bits), Expect = 6.9e-11, Sum P(3) = 6.9e-11
 Identities = 11/13 (84%), Positives = 12/13 (92%)

Query:   459 YWIVKNSWGSDWG 471
             YWIVKNSWG+ WG
Sbjct:   401 YWIVKNSWGTSWG 413


>UNIPROTKB|F1RWA9 [details] [associations]
            symbol:CTSO "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00639 GeneTree:ENSGT00560000076599 EMBL:CU855637
            Ensembl:ENSSSCT00000009707 OMA:WAFSIVG Uniprot:F1RWA9
        Length = 194

 Score = 280 (103.6 bits), Expect = 1.2e-23, P = 1.2e-23
 Identities = 69/171 (40%), Positives = 90/171 (52%)

Query:   316 KCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSNGGCNGGRMDDALQYIIDNG- 374
             +C  CWAFS V  VE+ +AI+G  L  LSVQQ++DC  +N GCNGG   +AL ++     
Sbjct:     1 QCGGCWAFSVVSAVESAYAIKGQPLEVLSVQQVIDCSYNNYGCNGGSTLNALYWLNKTQV 60

Query:   375 GVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPY-GEEEEMKKWVATRGPLSVGM 433
              VVSD  YP+KA      C              YS   + G+E+EM K + T GPL V +
Sbjct:    61 KVVSDSEYPFKAQNGL--CHYFSCSHSGVSIKDYSAYDFSGQEDEMAKTLLTLGPLIVIV 118

Query:   434 NANGLFYYSGGVI-------DLNQRLYGT------SIPYWIVKNSWGSDWG 471
             +A     Y GG+I       + N  +  T      S PYWIV+NSWGS WG
Sbjct:   119 DAVSWQDYLGGIIQHHCSSGEANHAVLVTGFDKTGSTPYWIVRNSWGSAWG 169

 Score = 240 (89.5 bits), Expect = 5.2e-26, Sum P(2) = 5.2e-26
 Identities = 59/155 (38%), Positives = 82/155 (52%)

Query:   498 LSRLATEKLVDCDMSNGGCNGGRMDDALQYIIDNG-GVVSDQAYPYKASESERGCLXXXX 556
             L  L+ ++++DC  +N GCNGG   +AL ++      VVSD  YP+KA      C     
Sbjct:    25 LEVLSVQQVIDCSYNNYGCNGGSTLNALYWLNKTQVKVVSDSEYPFKAQNGL--CHYFSC 82

Query:   557 XXXXXXXXXYSRIPY-GEEEEMKKWVATRGPLSVGMNANGLFYYSGGVIDLNQRLCNPKA 615
                      YS   + G+E+EM K + T GPL V ++A     Y GG+I   Q  C+   
Sbjct:    83 SHSGVSIKDYSAYDFSGQEDEMAKTLLTLGPLIVIVDAVSWQDYLGGII---QHHCSSGE 139

Query:   616 QNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWG 650
              NHA+++ G+     K G S PYWIV+NSWGS WG
Sbjct:   140 ANHAVLVTGFD----KTG-STPYWIVRNSWGSAWG 169

 Score = 87 (35.7 bits), Expect = 5.2e-26, Sum P(2) = 5.2e-26
 Identities = 17/31 (54%), Positives = 21/31 (67%)

Query:   168 KCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
             +C  CWAFS V  VE+ +AI+G  L  LSVQ
Sbjct:     1 QCGGCWAFSVVSAVESAYAIKGQPLEVLSVQ 31


>WB|WBGene00011102 [details] [associations]
            symbol:R07E3.1 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0016021 "integral to
            membrane" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076599
            EMBL:Z49207 HSSP:P53634 PIR:T24030 RefSeq:NP_001041280.1
            ProteinModelPortal:Q21810 SMR:Q21810 STRING:Q21810 MEROPS:C01.A43
            PaxDb:Q21810 EnsemblMetazoa:R07E3.1a GeneID:181242
            KEGG:cel:CELE_R07E3.1 UCSC:R07E3.1a CTD:181242 WormBase:R07E3.1a
            HOGENOM:HOG000021028 InParanoid:Q21810 OMA:ACKNEVI NextBio:913066
            ArrayExpress:Q21810 Uniprot:Q21810
        Length = 402

 Score = 298 (110.0 bits), Expect = 1.3e-25, P = 1.3e-25
 Identities = 88/291 (30%), Positives = 137/291 (47%)

Query:   201 DKVYSSVEDLLRRHENFVTNVEKAEDYQSEDS-GTAVFGVNKFFDLSESDLQQLTGLNLD 259
             DK Y++ ++ L+R   +    E   ++  ++  G+A +G N   D ++ + ++   L   
Sbjct:    98 DKSYATSQESLKRLNAYYNTDENIANWNIQNEHGSAEYGHNDMSDWTDEEFEKT--LLPK 155

Query:   260 STLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVKEQGKCAC 319
             S  + +    +A F     ++ + A +  S       P+ FDWR + VI+ VK QG+C  
Sbjct:   156 SFYKRLHK--EAEFIEPIPES-LTAKKGES---SSPFPDFFDWRDKNVITPVKAQGQCGS 209

Query:   320 CWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSD 379
             CWAF++   VEA  AI       LS Q L+DCD+ +  C+GG  D A +YI  NG + + 
Sbjct:   210 CWAFASTATVEAAWAIAHGEKRNLSEQTLLDCDLVDNACDGGDEDKAFRYIHRNG-LANA 268

Query:   380 QAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNA-NGL 438
                PY A   + GC              Y    + +E+ +  W+   GP+++GM     +
Sbjct:   269 VDLPYVAHR-QNGCAVNDHWNTTRIKAAY--FLHHDEDSIINWLVNFGPVNIGMAVIQPM 325

Query:   439 FYYSGGV------------IDLNQRL---YGTSIP---YWIVKNSWGSDWG 471
               Y GGV            I L+  L   YGTS     YWIVKNSWG+ WG
Sbjct:   326 RAYKGGVFTPSEYACKNEVIGLHALLITGYGTSKTGEKYWIVKNSWGNTWG 376

 Score = 216 (81.1 bits), Expect = 1.0e-14, P = 1.0e-14
 Identities = 51/152 (33%), Positives = 75/152 (49%)

Query:   501 LATEKLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXX 560
             L+ + L+DCD+ +  C+GG  D A +YI  NG + +    PY A   + GC         
Sbjct:   233 LSEQTLLDCDLVDNACDGGDEDKAFRYIHRNG-LANAVDLPYVAHR-QNGCAVNDHWNTT 290

Query:   561 XXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNA-NGLFYYSGGVIDLNQRLC-NPKAQNH 618
                  Y    + +E+ +  W+   GP+++GM     +  Y GGV   ++  C N     H
Sbjct:   291 RIKAAY--FLHHDEDSIINWLVNFGPVNIGMAVIQPMRAYKGGVFTPSEYACKNEVIGLH 348

Query:   619 ALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWG 650
             AL+I GYG  +    T   YWIVKNSWG+ WG
Sbjct:   349 ALLITGYGTSK----TGEKYWIVKNSWGNTWG 376

 Score = 143 (55.4 bits), Expect = 1.4e-06, P = 1.4e-06
 Identities = 55/212 (25%), Positives = 95/212 (44%)

Query:    35 RGYLNSPVTRFLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQ-REDSGTAVFEVNK 93
             RG  N     ++ +    DK Y++ ++ L+R   +    E   ++  + + G+A +  N 
Sbjct:    81 RGIQNI-AKEYIAYTEKFDKSYATSQESLKRLNAYYNTDENIANWNIQNEHGSAEYGHND 139

Query:    94 FFDLSDSDLQQLTGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFD 153
               D +D + ++   L   S  + +    +A F     ++ + A +  S       P+ FD
Sbjct:   140 MSDWTDEEFEKT--LLPKSFYKRLHK--EAEFIEPIPES-LTAKKGES---SSPFPDFFD 191

Query:   154 WRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAI---QGNNLTELSVQHHDKVYSSV--- 207
             WR + VI+ VK QG+C  CWAF++   VEA  AI   +  NL+E ++   D V ++    
Sbjct:   192 WRDKNVITPVKAQGQCGSCWAFASTATVEAAWAIAHGEKRNLSEQTLLDCDLVDNACDGG 251

Query:   208 -EDLLRR--HENFVTNVEKAEDYQSEDSGTAV 236
              ED   R  H N + N           +G AV
Sbjct:   252 DEDKAFRYIHRNGLANAVDLPYVAHRQNGCAV 283


>UNIPROTKB|G3V9F8 [details] [associations]
            symbol:Ctsm "RCG24133" species:10116 "Rattus norvegicus"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015645 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00639 GeneTree:ENSGT00560000076577 EMBL:CH474032
            PANTHER:PTHR12411:SF58 Ensembl:ENSRNOT00000045830 RGD:631420
            Uniprot:G3V9F8
        Length = 333

 Score = 247 (92.0 bits), Expect = 1.8e-24, Sum P(2) = 1.8e-24
 Identities = 58/158 (36%), Positives = 79/158 (50%)

Query:   291 RHGDDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVD 350
             R   ++P   +WR  G ++ V+ QG+C  CWAFS  G +E     +   L  LSVQ LVD
Sbjct:   109 RQAVNVPNFINWRKRGYVTPVRRQGRCNVCWAFSVAGAIEGQMFQKTGQLIPLSVQNLVD 168

Query:   351 CDMSNG--GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXY 408
             C    G  GC  G    ALQY+ +NGG+ S+  YPY+  E E  C              +
Sbjct:   169 CSRPQGNLGCYLGNTYLALQYVKENGGLESEATYPYE--EKEGSC-RYHPDNSTASITDF 225

Query:   409 SRIPYGEEEEMKKWVATRGPLSVGMNANG---LFYYSG 443
               +P  E+  M   VAT GP+SV ++A     LFY +G
Sbjct:   226 EFVPKNEDALMNA-VATLGPISVAIDARHESFLFYRNG 262

 Score = 227 (85.0 bits), Expect = 4.4e-25, Sum P(2) = 4.4e-25
 Identities = 56/161 (34%), Positives = 82/161 (50%)

Query:   497 KLSRLATEKLVDCDMSNG--GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXX 554
             +L  L+ + LVDC    G  GC  G    ALQY+ +NGG+ S+  YPY+  E E  C   
Sbjct:   157 QLIPLSVQNLVDCSRPQGNLGCYLGNTYLALQYVKENGGLESEATYPYE--EKEGSC-RY 213

Query:   555 XXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNAN--GLFYYSGGVIDLNQRLCN 612
                        +  +P  E+  M   VAT GP+SV ++A      +Y  G+   ++  C+
Sbjct:   214 HPDNSTASITDFEFVPKNEDALMNA-VATLGPISVAIDARHESFLFYRNGIY--HEPNCS 270

Query:   613 PKAQNHALIIVGYG-EEEKKDGTSIPYWIVKNSWGSDWGEK 652
                  HA+++VGYG   E+ DG    YWI+KNS G+ WG +
Sbjct:   271 SSVVTHAMLLVGYGFVGEESDGRK--YWILKNSMGNKWGNR 309

 Score = 118 (46.6 bits), Expect = 4.4e-25, Sum P(2) = 4.4e-25
 Identities = 21/57 (36%), Positives = 31/57 (54%)

Query:   143 RHGDDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQH 199
             R   ++P   +WR  G ++ V+ QG+C  CWAFS  G +E     +   L  LSVQ+
Sbjct:   109 RQAVNVPNFINWRKRGYVTPVRRQGRCNVCWAFSVAGAIEGQMFQKTGQLIPLSVQN 165

 Score = 65 (27.9 bits), Expect = 1.8e-24, Sum P(2) = 1.8e-24
 Identities = 12/26 (46%), Positives = 16/26 (61%)

Query:   459 YWIVKNSWGSDWGEKVEDKVGSS-GN 483
             YWI+KNS G+ WG +   K+    GN
Sbjct:   295 YWILKNSMGNKWGNRGYMKIAKDQGN 320


>UNIPROTKB|P07711 [details] [associations]
            symbol:CTSL1 "Cathepsin L1" species:9606 "Homo sapiens"
            [GO:0005576 "extracellular region" evidence=NAS] [GO:0005764
            "lysosome" evidence=IDA;NAS] [GO:0006508 "proteolysis"
            evidence=IDA] [GO:0002250 "adaptive immune response" evidence=IEP]
            [GO:0019882 "antigen processing and presentation" evidence=TAS]
            [GO:0097067 "cellular response to thyroid hormone stimulus"
            evidence=IEP] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IDA] [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=IDA] [GO:0019886 "antigen processing and presentation of
            exogenous peptide antigen via MHC class II" evidence=TAS]
            [GO:0036021 "endolysosome lumen" evidence=TAS] [GO:0043202
            "lysosomal lumen" evidence=TAS] [GO:0045087 "innate immune
            response" evidence=TAS] [GO:0005515 "protein binding" evidence=IPI]
            [GO:0042393 "histone binding" evidence=IDA] [GO:0005634 "nucleus"
            evidence=TAS] [GO:0071888 "macrophage apoptotic process"
            evidence=NAS] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            Reactome:REACT_118779 EMBL:X12451 GO:GO:0005634 Reactome:REACT_6900
            GO:GO:0005576 GO:GO:0019886 eggNOG:COG4870 HOGENOM:HOG000230774
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045087
            GO:GO:0042393 GO:GO:0004197 GO:GO:0002250 HOVERGEN:HBG011513
            GO:GO:0097067 GO:GO:0036021 KO:K01365 OrthoDB:EOG48PMKF EMBL:M20496
            EMBL:CR457053 EMBL:BX537395 EMBL:AL160279 EMBL:BC012612 EMBL:X05256
            IPI:IPI00012887 PIR:S01002 RefSeq:NP_001244900.1
            RefSeq:NP_001244901.1 RefSeq:NP_001903.1 RefSeq:NP_666023.1
            UniGene:Hs.731507 UniGene:Hs.731952 PDB:1CJL PDB:1CS8 PDB:1ICF
            PDB:1MHW PDB:2NQD PDB:2VHS PDB:2XU1 PDB:2XU3 PDB:2XU4 PDB:2XU5
            PDB:2YJ2 PDB:2YJ8 PDB:2YJ9 PDB:2YJB PDB:2YJC PDB:3BC3 PDB:3H89
            PDB:3H8B PDB:3H8C PDB:3HHA PDB:3HWN PDB:3IV2 PDB:3K24 PDB:3KSE
            PDB:3OF8 PDB:3OF9 PDBsum:1CJL PDBsum:1CS8 PDBsum:1ICF PDBsum:1MHW
            PDBsum:2NQD PDBsum:2VHS PDBsum:2XU1 PDBsum:2XU3 PDBsum:2XU4
            PDBsum:2XU5 PDBsum:2YJ2 PDBsum:2YJ8 PDBsum:2YJ9 PDBsum:2YJB
            PDBsum:2YJC PDBsum:3BC3 PDBsum:3H89 PDBsum:3H8B PDBsum:3H8C
            PDBsum:3HHA PDBsum:3HWN PDBsum:3IV2 PDBsum:3K24 PDBsum:3KSE
            PDBsum:3OF8 PDBsum:3OF9 ProteinModelPortal:P07711 SMR:P07711
            IntAct:P07711 STRING:P07711 MEROPS:I29.001 PhosphoSite:P07711
            DMDM:115741 PaxDb:P07711 PeptideAtlas:P07711 PRIDE:P07711
            DNASU:1514 Ensembl:ENST00000340342 Ensembl:ENST00000343150
            GeneID:1514 KEGG:hsa:1514 UCSC:uc004aph.3 CTD:1514
            GeneCards:GC09P090341 H-InvDB:HIX0058839 H-InvDB:HIX0170314
            HGNC:HGNC:2537 HPA:CAB000459 MIM:116880 neXtProt:NX_P07711
            PharmGKB:PA162382890 InParanoid:P07711 OMA:REPLFAQ PhylomeDB:P07711
            BRENDA:3.4.22.15 BindingDB:P07711 ChEMBL:CHEMBL3837 ChiTaRS:CTSL1
            DrugBank:DB00040 EvolutionaryTrace:P07711 GenomeRNAi:1514
            NextBio:6271 PMAP-CutDB:P07711 ArrayExpress:P07711 Bgee:P07711
            CleanEx:HS_CTSL1 Genevestigator:P07711 GermOnline:ENSG00000135047
            GO:GO:0071888 Uniprot:P07711
        Length = 333

 Score = 288 (106.4 bits), Expect = 1.6e-24, P = 1.6e-24
 Identities = 64/154 (41%), Positives = 86/154 (55%)

Query:   295 DLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCD-- 352
             + P + DWR +G ++ VK QG+C  CWAFSA G +E     +   L  LS Q LVDC   
Sbjct:   113 EAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGP 172

Query:   353 MSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIP 412
               N GCNGG MD A QY+ DNGG+ S+++YPY+A+E    C              +  IP
Sbjct:   173 QGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEES--C-KYNPKYSVANDTGFVDIP 229

Query:   413 YGEEEEMKKWVATRGPLSVGMNANG---LFYYSG 443
               +E+ + K VAT GP+SV ++A     LFY  G
Sbjct:   230 K-QEKALMKAVATVGPISVAIDAGHESFLFYKEG 262

 Score = 273 (101.2 bits), Expect = 6.7e-23, P = 6.7e-23
 Identities = 74/218 (33%), Positives = 110/218 (50%)

Query:   438 LFYYSGGVIDLNQRLYGTSIPYWIVKNSWGSDWGEKVEDKVGSSGNRTRDLELTGVLPSK 497
             LFY +   +D  ++ Y T +     +   GS W       +   G   R    TG    +
Sbjct:   110 LFYEAPRSVDWREKGYVTPVKN---QGQCGSCWAFSATGAL--EGQMFRK---TG----R 157

Query:   498 LSRLATEKLVDCD--MSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXX 555
             L  L+ + LVDC     N GCNGG MD A QY+ DNGG+ S+++YPY+A+E    C    
Sbjct:   158 LISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEES--C-KYN 214

Query:   556 XXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANG---LFYYSGGVIDLNQRLCN 612
                       +  IP  +E+ + K VAT GP+SV ++A     LFY  G   + +   C+
Sbjct:   215 PKYSVANDTGFVDIPK-QEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPD---CS 270

Query:   613 PKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWG 650
              +  +H +++VGYG E  +   +  YW+VKNSWG +WG
Sbjct:   271 SEDMDHGVLVVGYGFESTESDNN-KYWLVKNSWGEEWG 307

 Score = 125 (49.1 bits), Expect = 0.00016, Sum P(2) = 0.00016
 Identities = 37/121 (30%), Positives = 60/121 (49%)

Query:    87 AVFEVN-KFFDLSDSDLQQ-----LTGLNL--DSTLEDIQPSLQAPFSSNQTDTEMRAFQ 138
             AV+E N K  +L + + ++        +N   D T E+ +  +   F  N+   + + FQ
Sbjct:    50 AVWEKNMKMIELHNQEYREGKHSFTMAMNAFGDMTSEEFRQVMNG-FQ-NRKPRKGKVFQ 107

Query:   139 FNSLRHGDDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
                L +  + P + DWR +G ++ VK QG+C  CWAFSA G +E     +   L  LS Q
Sbjct:   108 -EPLFY--EAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQ 164

Query:   199 H 199
             +
Sbjct:   165 N 165

 Score = 40 (19.1 bits), Expect = 0.00016, Sum P(2) = 0.00016
 Identities = 14/39 (35%), Positives = 18/39 (46%)

Query:   214 HENFVTNVEKA---EDYQSEDSGTAVFGVNKFFDLSESD 249
             HE+F+   E      D  SED    V  V   F+ +ESD
Sbjct:   253 HESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESD 291


>FB|FBgn0032228 [details] [associations]
            symbol:CG5367 species:7227 "Drosophila melanogaster"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            EMBL:AE014134 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00639 GeneTree:ENSGT00560000076599 HSSP:P80067
            RefSeq:NP_609387.1 UniGene:Dm.26782 ProteinModelPortal:Q9VKY4
            SMR:Q9VKY4 MEROPS:C01.A30 EnsemblMetazoa:FBtr0080055 GeneID:34401
            KEGG:dme:Dmel_CG5367 UCSC:CG5367-RA FlyBase:FBgn0032228
            InParanoid:Q9VKY4 OMA:QIVDCSV OrthoDB:EOG4THT8X PhylomeDB:Q9VKY4
            GenomeRNAi:34401 NextBio:788324 ArrayExpress:Q9VKY4 Bgee:Q9VKY4
            Uniprot:Q9VKY4
        Length = 338

 Score = 264 (98.0 bits), Expect = 6.3e-22, P = 6.3e-22
 Identities = 84/293 (28%), Positives = 139/293 (47%)

Query:   199 HHDKVYSSVEDLLRRHENFVTNVEKAEDY-QSEDSGTAVFGV--NKFFDLS-ESDLQQLT 254
             ++++ Y    D +R ++ F  N +  E++ Q+   G   F +  N F D+S +  L+   
Sbjct:    42 NNNRKYLRTYDEMRSYKAFEENFKVIEEHNQNYKEGQTSFRLKPNIFADMSTDGYLKGFL 101

Query:   255 GLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVKEQ 314
              L L S +ED         + N  +        N       +PE+ DWR++G I+    Q
Sbjct:   102 RL-LKSNIEDS--------ADNMAEIVGSPLMAN-------VPESLDWRSKGFITPPYNQ 145

Query:   315 GKCACCWAFS-AVGVVEAMHAIQGNSLTELSVQQLVDCDMSNG--GCNGGRMDDALQYII 371
               C  C+AFS A  ++  +    G  L+ LS QQ+VDC +S+G  GC GG + + L Y+ 
Sbjct:   146 LSCGSCYAFSIAESIMGQVFKRTGKILS-LSKQQIVDCSVSHGNQGCVGGSLRNTLSYLQ 204

Query:   372 DNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSV 431
               GG++ DQ YPY A    +G               ++ +P  +E+ ++  V   GP+++
Sbjct:   205 STGGIMRDQDYPYVA---RKGKCQFVPDLSVVNVTSWAILPVRDEQAIQAAVTHIGPVAI 261

Query:   432 GMNANGLFY--YSGGVID--------LNQRLY--GTSIPYWIVKNSWGSDWGE 472
              +NA+   +  YS G+ D        +N  +   G    YWI+KN WG +WGE
Sbjct:   262 SINASPKTFQLYSDGIYDDPLCSSASVNHAMVVIGFGKDYWILKNWWGQNWGE 314

 Score = 237 (88.5 bits), Expect = 2.3e-24, Sum P(2) = 2.3e-24
 Identities = 52/159 (32%), Positives = 88/159 (55%)

Query:   497 KLSRLATEKLVDCDMSNG--GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXX 554
             K+  L+ +++VDC +S+G  GC GG + + L Y+   GG++ DQ YPY A    +G    
Sbjct:   170 KILSLSKQQIVDCSVSHGNQGCVGGSLRNTLSYLQSTGGIMRDQDYPYVA---RKGKCQF 226

Query:   555 XXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFY--YSGGVIDLNQRLCN 612
                        ++ +P  +E+ ++  V   GP+++ +NA+   +  YS G+ D    LC+
Sbjct:   227 VPDLSVVNVTSWAILPVRDEQAIQAAVTHIGPVAISINASPKTFQLYSDGIYD--DPLCS 284

Query:   613 PKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGE 651
               + NHA++++G+G    KD     YWI+KN WG +WGE
Sbjct:   285 SASVNHAMVVIGFG----KD-----YWILKNWWGQNWGE 314

 Score = 97 (39.2 bits), Expect = 2.3e-24, Sum P(2) = 2.3e-24
 Identities = 39/155 (25%), Positives = 67/155 (43%)

Query:    26 LLESNIFQTRGYLNSPVTRFLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDY-QREDS 84
             ++ SN+ +      +  + F  F  ++++ Y    D +R ++ F  N +  E++ Q    
Sbjct:    17 IVTSNLSEGNSSSANCKSEFEKFKNNNNRKYLRTYDEMRSYKAFEENFKVIEEHNQNYKE 76

Query:    85 GTAVFEV--NKFFDLS-DSDLQQLTGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNS 141
             G   F +  N F D+S D  L+    L L S +ED         + N  +        N 
Sbjct:    77 GQTSFRLKPNIFADMSTDGYLKGFLRL-LKSNIEDS--------ADNMAEIVGSPLMAN- 126

Query:   142 LRHGDDLPEAFDWRAEGVISKVKEQGKCACCWAFS 176
                   +PE+ DWR++G I+    Q  C  C+AFS
Sbjct:   127 ------VPESLDWRSKGFITPPYNQLSCGSCYAFS 155


>RGD|1309226 [details] [associations]
            symbol:Cts7 "cathepsin 7" species:10116 "Rattus norvegicus"
            [GO:0005615 "extracellular space" evidence=IEA] [GO:0005634
            "nucleus" evidence=IEA] [GO:0005764 "lysosome" evidence=IEA]
            [GO:0005768 "endosome" evidence=IEA] [GO:0005794 "Golgi apparatus"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0007067
            "mitosis" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0048471 "perinuclear region of
            cytoplasm" evidence=IEA] [GO:0051301 "cell division" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 RGD:1309226 GO:GO:0005634
            GO:GO:0005794 GO:GO:0048471 GO:GO:0005615 GO:GO:0051301
            GO:GO:0007067 GO:GO:0005768 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0005764 MEROPS:C01.016 CTD:56092
            GeneTree:ENSGT00560000076577 OrthoDB:EOG44QT2S EMBL:CH474032
            IPI:IPI00870531 RefSeq:NP_001099569.1 UniGene:Rn.218615
            Ensembl:ENSRNOT00000043686 GeneID:290970 KEGG:rno:290970
            UCSC:RGD:1309226 OMA:VESFNAN Uniprot:D3ZZ07
        Length = 331

 Score = 231 (86.4 bits), Expect = 6.1e-22, Sum P(3) = 6.1e-22
 Identities = 51/154 (33%), Positives = 80/154 (51%)

Query:   296 LPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMS- 354
             +P+  DWR  G ++ V+ QG C  CWAFS    +E+    +   L  LSVQ L+DC ++ 
Sbjct:   112 IPKTLDWRDTGCVAPVRSQGGCGACWAFSVAASIESQLFKKTGKLIPLSVQNLIDCTVTY 171

Query:   355 -NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPY 413
              N  C+GG+   A QY+ +NGG+ ++  YPY+A    R C              +  +P 
Sbjct:   172 GNNDCSGGKPYTAFQYVKNNGGLEAEATYPYEAKL--RHCRYRPERSVVKIARFFV-VPR 228

Query:   414 GEEEEMKKWVATRGPLSVGMNANGLFY--YSGGV 445
              EE  M+  V T GP++V ++ +   +  Y GG+
Sbjct:   229 NEEALMQALV-TYGPIAVAIDGSHASFKRYRGGI 261

 Score = 215 (80.7 bits), Expect = 3.3e-24, Sum P(3) = 3.3e-24
 Identities = 51/160 (31%), Positives = 84/160 (52%)

Query:   497 KLSRLATEKLVDCDMS--NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXX 554
             KL  L+ + L+DC ++  N  C+GG+   A QY+ +NGG+ ++  YPY+A    R C   
Sbjct:   155 KLIPLSVQNLIDCTVTYGNNDCSGGKPYTAFQYVKNNGGLEAEATYPYEAKL--RHCRYR 212

Query:   555 XXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFY--YSGGVIDLNQRLCN 612
                        +  +P  EE  M+  V T GP++V ++ +   +  Y GG+   ++  C 
Sbjct:   213 PERSVVKIARFFV-VPRNEEALMQALV-TYGPIAVAIDGSHASFKRYRGGIY--HEPKCR 268

Query:   613 PKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
                 +H L++VGYG E   +  +  YW++KNS G  WGE+
Sbjct:   269 RDTLDHGLLLVGYGYEGH-ESENRKYWLLKNSHGEQWGER 307

 Score = 113 (44.8 bits), Expect = 3.3e-24, Sum P(3) = 3.3e-24
 Identities = 20/52 (38%), Positives = 29/52 (55%)

Query:   148 LPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQH 199
             +P+  DWR  G ++ V+ QG C  CWAFS    +E+    +   L  LSVQ+
Sbjct:   112 IPKTLDWRDTGCVAPVRSQGGCGACWAFSVAASIESQLFKKTGKLIPLSVQN 163

 Score = 64 (27.6 bits), Expect = 6.1e-22, Sum P(3) = 6.1e-22
 Identities = 9/15 (60%), Positives = 12/15 (80%)

Query:   459 YWIVKNSWGSDWGEK 473
             YW++KNS G  WGE+
Sbjct:   293 YWLLKNSHGEQWGER 307

 Score = 46 (21.3 bits), Expect = 3.3e-24, Sum P(3) = 3.3e-24
 Identities = 14/60 (23%), Positives = 31/60 (51%)

Query:    50 RDHDKVYSSVEDLLRRHENFVTNVEKAEDYQREDS---GTAVFEVNKFFDLSDSDLQQLT 106
             R++ K YS  E+  RR   +  NV+  + +  ++         E+N+F D++  +++ +T
Sbjct:    34 RNNAKTYSPEEEKQRRAV-WEENVKMIKWHTMQNGLWMNNFTIEMNEFGDMTGEEMRMMT 92


>GENEDB_PFALCIPARUM|PF14_0553 [details] [associations]
            symbol:PF14_0553 "cysteine proteinase
            falcipain-1" species:5833 "Plasmodium falciparum" [GO:0042540
            "hemoglobin catabolic process" evidence=TAS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 EMBL:AE014187 KO:K01376 HSSP:P07688 GO:GO:0042540
            RefSeq:XP_001348727.1 ProteinModelPortal:Q8I6V0 PRIDE:Q8I6V0
            EnsemblProtists:PF14_0553:mRNA GeneID:812135 KEGG:pfa:PF14_0553
            EuPathDB:PlasmoDB:PF3D7_1458000 HOGENOM:HOG000065906
            ProtClustDB:CLSZ2457715 ChEMBL:CHEMBL1250371 Uniprot:Q8I6V0
        Length = 569

 Score = 274 (101.5 bits), Expect = 1.0e-23, Sum P(2) = 1.0e-23
 Identities = 75/257 (29%), Positives = 130/257 (50%)

Query:   197 VQHHDKVYSSVEDLLRRHENFVTNVEKAEDYQSEDSGTAVFG--VNKFFDLSESDLQQLT 254
             ++ H+KVY ++++ +R+ E F  N    +++   +   A++   VN+F D SE +L++  
Sbjct:   229 MKEHNKVYKNIDEQMRKFEIFKINYISIKNHNKLNKN-AMYKKKVNQFSDYSEEELKEYF 287

Query:   255 GLNLDSTLEDIQPSLQAPFSSNQTDTEMRA-FQFNSLRHGDDL----PEAFDWRAEGVIS 309
                L      I+     PF ++  D  + + F  N  R+  D+    PE  D+R +G++ 
Sbjct:   288 KTLLHVPNHMIE-KYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYREKGIVH 346

Query:   310 KVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSNGGCNGGRMDDALQY 369
             + K+QG C  CWAF++VG +E++ A +  ++   S Q++VDC   N GC+GG    +  Y
Sbjct:   347 EPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKDNFGCDGGHPFYSFLY 406

Query:   370 IIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPL 429
             ++ N   + D+ Y YKA + +  CL              S I   +E ++   +   GPL
Sbjct:   407 VLQNELCLGDE-YKYKAKD-DMFCLNYRCKRKVSL----SSIGAVKENQLILALNEVGPL 460

Query:   430 SVGMNANGLFY-YSGGV 445
             SV +  N  F  YS GV
Sbjct:   461 SVNVGVNNDFVAYSEGV 477

 Score = 208 (78.3 bits), Expect = 4.1e-20, Sum P(2) = 4.1e-20
 Identities = 47/163 (28%), Positives = 91/163 (55%)

Query:    43 TRFLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQREDSGTAVFE--VNKFFDLSDS 100
             ++F  FM++H+KVY ++++ +R+ E F  N    +++ + +   A+++  VN+F D S+ 
Sbjct:   223 SKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKNHNKLNKN-AMYKKKVNQFSDYSEE 281

Query:   101 DLQQLTGLNLDSTLEDIQPSLQAPFSSNQTDTEMRA-FQFNSLRHGDDL----PEAFDWR 155
             +L++     L      I+     PF ++  D  + + F  N  R+  D+    PE  D+R
Sbjct:   282 ELKEYFKTLLHVPNHMIE-KYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYR 340

Query:   156 AEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
              +G++ + K+QG C  CWAF++VG +E++ A +  N+   S Q
Sbjct:   341 EKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQ 383

 Score = 110 (43.8 bits), Expect = 4.1e-20, Sum P(2) = 4.1e-20
 Identities = 45/164 (27%), Positives = 67/164 (40%)

Query:   504 EKLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXX 563
             +++VDC   N GC+GG    +  Y++ N   + D+ Y YKA + +  CL           
Sbjct:   383 QEVVDCSKDNFGCDGGHPFYSFLYVLQNELCLGDE-YKYKAKD-DMFCLNYRCKRKVSLS 440

Query:   564 XXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFYYSGGVI------DLNQRLCN----- 612
                  +   +       V     ++VG+N N    YS GV       +LN  +       
Sbjct:   441 SI-GAVKENQLILALNEVGPLS-VNVGVN-NDFVAYSEGVYNGTCSEELNHSVLLVGYGQ 497

Query:   613 -PKAQ-NHALIIVGYGEEEKK---DGTSIPYWIVKNSWGSDWGE 651
               K + N+   I  Y  +E     D   I YWI+KNSW   WGE
Sbjct:   498 VEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGE 541

 Score = 73 (30.8 bits), Expect = 1.0e-23, Sum P(2) = 1.0e-23
 Identities = 11/16 (68%), Positives = 12/16 (75%)

Query:   457 IPYWIVKNSWGSDWGE 472
             I YWI+KNSW   WGE
Sbjct:   526 IYYWIIKNSWSKKWGE 541


>UNIPROTKB|Q8I6V0 [details] [associations]
            symbol:PF14_0553 "Cysteine proteinase falcipain-1"
            species:36329 "Plasmodium falciparum 3D7" [GO:0042540 "hemoglobin
            catabolic process" evidence=TAS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 EMBL:AE014187 KO:K01376 HSSP:P07688 GO:GO:0042540
            RefSeq:XP_001348727.1 ProteinModelPortal:Q8I6V0 PRIDE:Q8I6V0
            EnsemblProtists:PF14_0553:mRNA GeneID:812135 KEGG:pfa:PF14_0553
            EuPathDB:PlasmoDB:PF3D7_1458000 HOGENOM:HOG000065906
            ProtClustDB:CLSZ2457715 ChEMBL:CHEMBL1250371 Uniprot:Q8I6V0
        Length = 569

 Score = 274 (101.5 bits), Expect = 1.0e-23, Sum P(2) = 1.0e-23
 Identities = 75/257 (29%), Positives = 130/257 (50%)

Query:   197 VQHHDKVYSSVEDLLRRHENFVTNVEKAEDYQSEDSGTAVFG--VNKFFDLSESDLQQLT 254
             ++ H+KVY ++++ +R+ E F  N    +++   +   A++   VN+F D SE +L++  
Sbjct:   229 MKEHNKVYKNIDEQMRKFEIFKINYISIKNHNKLNKN-AMYKKKVNQFSDYSEEELKEYF 287

Query:   255 GLNLDSTLEDIQPSLQAPFSSNQTDTEMRA-FQFNSLRHGDDL----PEAFDWRAEGVIS 309
                L      I+     PF ++  D  + + F  N  R+  D+    PE  D+R +G++ 
Sbjct:   288 KTLLHVPNHMIE-KYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYREKGIVH 346

Query:   310 KVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSNGGCNGGRMDDALQY 369
             + K+QG C  CWAF++VG +E++ A +  ++   S Q++VDC   N GC+GG    +  Y
Sbjct:   347 EPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKDNFGCDGGHPFYSFLY 406

Query:   370 IIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPL 429
             ++ N   + D+ Y YKA + +  CL              S I   +E ++   +   GPL
Sbjct:   407 VLQNELCLGDE-YKYKAKD-DMFCLNYRCKRKVSL----SSIGAVKENQLILALNEVGPL 460

Query:   430 SVGMNANGLFY-YSGGV 445
             SV +  N  F  YS GV
Sbjct:   461 SVNVGVNNDFVAYSEGV 477

 Score = 208 (78.3 bits), Expect = 4.1e-20, Sum P(2) = 4.1e-20
 Identities = 47/163 (28%), Positives = 91/163 (55%)

Query:    43 TRFLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQREDSGTAVFE--VNKFFDLSDS 100
             ++F  FM++H+KVY ++++ +R+ E F  N    +++ + +   A+++  VN+F D S+ 
Sbjct:   223 SKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKNHNKLNKN-AMYKKKVNQFSDYSEE 281

Query:   101 DLQQLTGLNLDSTLEDIQPSLQAPFSSNQTDTEMRA-FQFNSLRHGDDL----PEAFDWR 155
             +L++     L      I+     PF ++  D  + + F  N  R+  D+    PE  D+R
Sbjct:   282 ELKEYFKTLLHVPNHMIE-KYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYR 340

Query:   156 AEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
              +G++ + K+QG C  CWAF++VG +E++ A +  N+   S Q
Sbjct:   341 EKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQ 383

 Score = 110 (43.8 bits), Expect = 4.1e-20, Sum P(2) = 4.1e-20
 Identities = 45/164 (27%), Positives = 67/164 (40%)

Query:   504 EKLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXX 563
             +++VDC   N GC+GG    +  Y++ N   + D+ Y YKA + +  CL           
Sbjct:   383 QEVVDCSKDNFGCDGGHPFYSFLYVLQNELCLGDE-YKYKAKD-DMFCLNYRCKRKVSLS 440

Query:   564 XXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFYYSGGVI------DLNQRLCN----- 612
                  +   +       V     ++VG+N N    YS GV       +LN  +       
Sbjct:   441 SI-GAVKENQLILALNEVGPLS-VNVGVN-NDFVAYSEGVYNGTCSEELNHSVLLVGYGQ 497

Query:   613 -PKAQ-NHALIIVGYGEEEKK---DGTSIPYWIVKNSWGSDWGE 651
               K + N+   I  Y  +E     D   I YWI+KNSW   WGE
Sbjct:   498 VEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGE 541

 Score = 73 (30.8 bits), Expect = 1.0e-23, Sum P(2) = 1.0e-23
 Identities = 11/16 (68%), Positives = 12/16 (75%)

Query:   457 IPYWIVKNSWGSDWGE 472
             I YWI+KNSW   WGE
Sbjct:   526 IYYWIIKNSWSKKWGE 541


>FB|FBgn0037396 [details] [associations]
            symbol:CG11459 species:7227 "Drosophila melanogaster"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=NAS]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            EMBL:AE014297 eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00639
            GeneTree:ENSGT00560000076599 KO:K01365 HSSP:P07711 EMBL:AY060710
            RefSeq:NP_649608.1 UniGene:Dm.3894 SMR:Q9VNK6 MEROPS:C01.A31
            EnsemblMetazoa:FBtr0078623 GeneID:40741 KEGG:dme:Dmel_CG11459
            UCSC:CG11459-RA FlyBase:FBgn0037396 InParanoid:Q9VNK6 OMA:NYDEREL
            OrthoDB:EOG4MGQPX ChiTaRS:CG11459 GenomeRNAi:40741 NextBio:820359
            Uniprot:Q9VNK6
        Length = 336

 Score = 278 (102.9 bits), Expect = 1.9e-23, P = 1.9e-23
 Identities = 96/306 (31%), Positives = 144/306 (47%)

Query:   200 HDKVYSSVEDLLRR-HENFVTNVEKAEDYQSEDSGTAVFGVNKFFDLSESDLQQLTGLNL 258
             ++K Y + +   R  +E  V  VE       +       G+NKF   S++D Q++     
Sbjct:    37 YNKQYRNRDKYHRALYEQRVLAVESHNQLYLQGKVAFKMGLNKF---SDTD-QRI----- 87

Query:   259 DSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVKEQG-KC 317
                L + + S+ AP    +T T       N  R+ D + E  DWR  G IS V +QG +C
Sbjct:    88 ---LFNYRSSIPAPL---ETSTNALTETVNYKRY-DQITEGIDWRQYGYISPVGDQGTEC 140

Query:   318 ACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDC-DMSNGGCNGGRMDDALQYIIDNGGV 376
               CWAFS  GV+EA  A +  +L  LS + LVDC    N GC+GG +  A  Y  D+G +
Sbjct:   141 LSCWAFSTSGVLEAHMAKKYGNLVPLSPKHLVDCVPYPNNGCSGGWVSVAFNYTRDHG-I 199

Query:   377 VSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMN-A 435
              + ++YPY+    E  CL             Y  +   +E E+ + V   GP++V ++  
Sbjct:   200 ATKESYPYEPVSGE--CLWKSDRSAGTLSG-YVTLGNYDERELAEVVYNIGPVAVSIDHL 256

Query:   436 NGLF-YYSGGVI----------DLNQRL----YGTSIP---YWIVKNSWGSDWGEKVEDK 477
             +  F  YSGGV+          DL   +    +GT      YWI+KNS+G+DWGE    K
Sbjct:   257 HEEFDQYSGGVLSIPACRSKRQDLTHSVLLVGFGTHRKWGDYWIIKNSYGTDWGESGYLK 316

Query:   478 VGSSGN 483
             +  + N
Sbjct:   317 LARNAN 322

 Score = 203 (76.5 bits), Expect = 1.4e-13, P = 1.4e-13
 Identities = 62/201 (30%), Positives = 99/201 (49%)

Query:   467 GSDWGEK-VEDKVGSSGNRTRD---LELTGVLPSKLSR-------LATEKLVDC-DMSNG 514
             G DW +      VG  G           +GVL + +++       L+ + LVDC    N 
Sbjct:   121 GIDWRQYGYISPVGDQGTECLSCWAFSTSGVLEAHMAKKYGNLVPLSPKHLVDCVPYPNN 180

Query:   515 GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYGEE 574
             GC+GG +  A  Y  D+G + + ++YPY+    E  CL             Y  +   +E
Sbjct:   181 GCSGGWVSVAFNYTRDHG-IATKESYPYEPVSGE--CLWKSDRSAGTLSG-YVTLGNYDE 236

Query:   575 EEMKKWVATRGPLSVGMN-ANGLF-YYSGGVIDLNQRLCNPKAQN--HALIIVGYGEEEK 630
              E+ + V   GP++V ++  +  F  YSGGV+ +    C  K Q+  H++++VG+G   +
Sbjct:   237 RELAEVVYNIGPVAVSIDHLHEEFDQYSGGVLSIPA--CRSKRQDLTHSVLLVGFGTH-R 293

Query:   631 KDGTSIPYWIVKNSWGSDWGE 651
             K G    YWI+KNS+G+DWGE
Sbjct:   294 KWGD---YWIIKNSYGTDWGE 311

 Score = 138 (53.6 bits), Expect = 3.4e-06, P = 3.4e-06
 Identities = 50/159 (31%), Positives = 73/159 (45%)

Query:    43 TRFLNFMRDHDKVYSSVEDLLRR-HENFVTNVEKAEDYQREDSGTAVFEVNKFFDLSDSD 101
             T +  +   ++K Y + +   R  +E  V  VE       +        +NKF   SD+D
Sbjct:    28 TEWDQYKAKYNKQYRNRDKYHRALYEQRVLAVESHNQLYLQGKVAFKMGLNKF---SDTD 84

Query:   102 LQQLTGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVIS 161
              Q++        L + + S+ AP    +T T       N  R+ D + E  DWR  G IS
Sbjct:    85 -QRI--------LFNYRSSIPAPL---ETSTNALTETVNYKRY-DQITEGIDWRQYGYIS 131

Query:   162 KVKEQG-KCACCWAFSAVGVVEAMHAIQGNNLTELSVQH 199
              V +QG +C  CWAFS  GV+EA  A +  NL  LS +H
Sbjct:   132 PVGDQGTECLSCWAFSTSGVLEAHMAKKYGNLVPLSPKH 170


>UNIPROTKB|Q0VCU3 [details] [associations]
            symbol:CTSF "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 eggNOG:COG4870
            HOGENOM:HOG000230774 KO:K01373 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00139 PROSITE:PS00639
            HOVERGEN:HBG011513 MEROPS:C01.018 CTD:8722 OMA:LAPPEWD
            OrthoDB:EOG4CC41T GeneTree:ENSGT00660000095458 EMBL:DAAA02063594
            EMBL:BC120003 IPI:IPI00717812 RefSeq:NP_001068884.1 UniGene:Bt.7264
            SMR:Q0VCU3 Ensembl:ENSBTAT00000014587 GeneID:509715 KEGG:bta:509715
            InParanoid:Q0VCU3 NextBio:20869091 Uniprot:Q0VCU3
        Length = 460

 Score = 294 (108.6 bits), Expect = 2.2e-23, P = 2.2e-23
 Identities = 85/281 (30%), Positives = 129/281 (45%)

Query:   215 ENFVTNVEKAEDYQSEDSGT-AVFGVNKFFDLSESDLQQLT---GLNL--DSTLEDIQPS 268
             ++FVT   +  D Q E S   +VF  N         L + T   G+    D T E+ +  
Sbjct:   164 KDFVTTYNRTYDSQEEASWRMSVFANNMVRAQKIQALDRGTARYGVTKFSDLTEEEFRTI 223

Query:   269 LQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGV 328
                P   +     MR  Q  +    D  P  +DWR +G ++ VK+QG C  CWAFS  G 
Sbjct:   224 YLNPLLKDAPGRNMRPAQPVT----DVPPPQWDWRNKGAVTNVKDQGMCGSCWAFSVTGN 279

Query:   329 VEAMHAIQGNSLTELSVQQLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASE 388
             VE    ++  +L  LS Q+L+DCD ++  C GG   +A   I   GG+ ++  Y Y+   
Sbjct:   280 VEGQWFLKRGTLLSLSEQELLDCDKTDKACLGGLPSNAYSAIRTLGGLETEDDYSYRGRL 339

Query:   389 SERGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFYYSGGV--- 445
                                 S+     E+++  W+A  GP+S+ +NA G+ +Y  G+   
Sbjct:   340 QTCSFSAEKAKVYINDSVELSK----NEQKLAAWLAKNGPVSIAINAFGMQFYRHGISHP 395

Query:   446 ---------IDLNQRL--YG--TSIPYWIVKNSWGSDWGEK 473
                      ID    L  YG  ++IP+W +KNSWG+DWGE+
Sbjct:   396 LRPLCSPWLIDHAVLLVGYGNRSAIPFWAIKNSWGTDWGEE 436

 Score = 234 (87.4 bits), Expect = 4.6e-22, Sum P(2) = 4.6e-22
 Identities = 58/198 (29%), Positives = 99/198 (50%)

Query:   458 PYWIVKNSWGSDWGEKVEDKVGS--SGNRTRDLELTGVLP-SKLSRLATEKLVDCDMSNG 514
             P W  +N  G+    K +   GS  + + T ++E    L    L  L+ ++L+DCD ++ 
Sbjct:   249 PQWDWRNK-GAVTNVKDQGMCGSCWAFSVTGNVEGQWFLKRGTLLSLSEQELLDCDKTDK 307

Query:   515 GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYGEE 574
              C GG   +A   I   GG+ ++  Y Y+                       S+     E
Sbjct:   308 ACLGGLPSNAYSAIRTLGGLETEDDYSYRGRLQTCSFSAEKAKVYINDSVELSK----NE 363

Query:   575 EEMKKWVATRGPLSVGMNANGLFYYSGGVIDLNQRLCNPKAQNHALIIVGYGEEEKKDGT 634
             +++  W+A  GP+S+ +NA G+ +Y  G+    + LC+P   +HA+++VGYG       +
Sbjct:   364 QKLAAWLAKNGPVSIAINAFGMQFYRHGISHPLRPLCSPWLIDHAVLLVGYGNR-----S 418

Query:   635 SIPYWIVKNSWGSDWGEK 652
             +IP+W +KNSWG+DWGE+
Sbjct:   419 AIPFWAIKNSWGTDWGEE 436

 Score = 128 (50.1 bits), Expect = 7.9e-05, P = 7.9e-05
 Identities = 43/138 (31%), Positives = 60/138 (43%)

Query:    67 ENFVTNVEKAEDYQREDSGT-AVFEVNKFFDLSDSDLQQLT---GLNL--DSTLEDIQPS 120
             ++FVT   +  D Q E S   +VF  N         L + T   G+    D T E+ +  
Sbjct:   164 KDFVTTYNRTYDSQEEASWRMSVFANNMVRAQKIQALDRGTARYGVTKFSDLTEEEFRTI 223

Query:   121 LQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGV 180
                P   +     MR  Q  +    D  P  +DWR +G ++ VK+QG C  CWAFS  G 
Sbjct:   224 YLNPLLKDAPGRNMRPAQPVT----DVPPPQWDWRNKGAVTNVKDQGMCGSCWAFSVTGN 279

Query:   181 VEAMHAIQGNNLTELSVQ 198
             VE    ++   L  LS Q
Sbjct:   280 VEGQWFLKRGTLLSLSEQ 297

 Score = 96 (38.9 bits), Expect = 4.6e-22, Sum P(2) = 4.6e-22
 Identities = 24/74 (32%), Positives = 41/74 (55%)

Query:   197 VQHHDKVYSSVEDLLRRHENFVTNVEKAEDYQSEDSGTAVFGVNKFFDLSESDLQQ--LT 254
             V  +++ Y S E+   R   F  N+ +A+  Q+ D GTA +GV KF DL+E + +   L 
Sbjct:   167 VTTYNRTYDSQEEASWRMSVFANNMVRAQKIQALDRGTARYGVTKFSDLTEEEFRTIYLN 226

Query:   255 GLNLDSTLEDIQPS 268
              L  D+   +++P+
Sbjct:   227 PLLKDAPGRNMRPA 240

 Score = 92 (37.4 bits), Expect = 1.2e-21, Sum P(2) = 1.2e-21
 Identities = 23/78 (29%), Positives = 42/78 (53%)

Query:    45 FLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQREDSGTAVFEVNKFFDLSDSDLQQ 104
             F +F+  +++ Y S E+   R   F  N+ +A+  Q  D GTA + V KF DL++ + + 
Sbjct:   163 FKDFVTTYNRTYDSQEEASWRMSVFANNMVRAQKIQALDRGTARYGVTKFSDLTEEEFRT 222

Query:   105 --LTGLNLDSTLEDIQPS 120
               L  L  D+   +++P+
Sbjct:   223 IYLNPLLKDAPGRNMRPA 240


>ZFIN|ZDB-GENE-030131-9831 [details] [associations]
            symbol:ctsf "cathepsin F" species:7955 "Danio
            rerio" [GO:0004869 "cysteine-type endopeptidase inhibitor activity"
            evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000010 InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00031 Pfam:PF00112 PRINTS:PR00705 SMART:SM00043
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-030131-9831
            GO:GO:0004869 eggNOG:COG4870 HOGENOM:HOG000230774 KO:K01373
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00139
            PROSITE:PS00639 HOVERGEN:HBG011513 CTD:8722 OrthoDB:EOG4CC41T
            MEROPS:I25.006 EMBL:BC124243 IPI:IPI00503226 RefSeq:NP_001071036.1
            UniGene:Dr.81265 ProteinModelPortal:Q08CH0 SMR:Q08CH0 GeneID:565588
            KEGG:dre:565588 InParanoid:Q08CH0 NextBio:20885952
            ArrayExpress:Q08CH0 Uniprot:Q08CH0
        Length = 473

 Score = 270 (100.1 bits), Expect = 1.5e-20, P = 1.5e-20
 Identities = 82/297 (27%), Positives = 129/297 (43%)

Query:   200 HDKVYSSVEDLLRRHENFVTNVEKAEDYQSE-DSGTAVFGVNKFFDLSESDLQQLT---G 255
             H K      +LL   +NF+    +    Q E +    +F  N     +   L+Q +   G
Sbjct:   161 HSKPMKESVELLTMFKNFMITYNRTYSSQEEAEKRLRIFQQNMKTAQTLQSLEQGSAEYG 220

Query:   256 LNL--DSTLEDIQPSLQAPFSSNQT-DTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVK 312
             +    D T ++ +     P  S  +   EM+     ++      P+ +DWR  G +S VK
Sbjct:   221 ITKFSDLTEDEFRMMYLNPMLSQWSLKKEMKP----AIPASAPAPDTWDWRDHGAVSPVK 276

Query:   313 EQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSNGGCNGGRMDDALQYIID 372
              QG C  CWAFS  G +E     +   L  LS Q+LVDCD  +  C GG   +A + I +
Sbjct:   277 NQGMCGSCWAFSVTGNIEGQWFKKTGQLLSLSEQELVDCDKLDQACGGGLPSNAYEAIEN 336

Query:   373 NGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVG 432
              GG+ ++  Y Y   +  + C                 +P  +E+E+  ++A  GP+S  
Sbjct:   337 LGGLETETDYSYTGHK--QSCDFSTGKVAAYINSSVE-LPK-DEKEIAAFLAENGPVSAA 392

Query:   433 MNANGLFYYSGGV------------IDLNQRLYG----TSIPYWIVKNSWGSDWGEK 473
             +NA  + +Y  GV            ID    L G      +P+W +KNSWG D+GE+
Sbjct:   393 LNAFAMQFYRKGVSHPLKIFCNPWMIDHAVLLVGFGQRNGVPFWAIKNSWGEDYGEQ 449

 Score = 219 (82.2 bits), Expect = 2.8e-23, Sum P(2) = 2.8e-23
 Identities = 48/156 (30%), Positives = 85/156 (54%)

Query:   497 KLSRLATEKLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXX 556
             +L  L+ ++LVDCD  +  C GG   +A + I + GG+ ++  Y Y   +  + C     
Sbjct:   303 QLLSLSEQELVDCDKLDQACGGGLPSNAYEAIENLGGLETETDYSYTGHK--QSCDFSTG 360

Query:   557 XXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFYYSGGVIDLNQRLCNPKAQ 616
                         +P  +E+E+  ++A  GP+S  +NA  + +Y  GV    +  CNP   
Sbjct:   361 KVAAYINSSVE-LPK-DEKEIAAFLAENGPVSAALNAFAMQFYRKGVSHPLKIFCNPWMI 418

Query:   617 NHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
             +HA+++VG+G+   ++G  +P+W +KNSWG D+GE+
Sbjct:   419 DHAVLLVGFGQ---RNG--VPFWAIKNSWGEDYGEQ 449

 Score = 125 (49.1 bits), Expect = 2.8e-23, Sum P(2) = 2.8e-23
 Identities = 41/154 (26%), Positives = 64/154 (41%)

Query:    52 HDKVYSSVEDLLRRHENFVTNVEKAEDYQRE-DSGTAVFEVNKFFDLSDSDLQQLT---G 107
             H K      +LL   +NF+    +    Q E +    +F+ N     +   L+Q +   G
Sbjct:   161 HSKPMKESVELLTMFKNFMITYNRTYSSQEEAEKRLRIFQQNMKTAQTLQSLEQGSAEYG 220

Query:   108 LNL--DSTLEDIQPSLQAPFSSNQT-DTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVK 164
             +    D T ++ +     P  S  +   EM+     ++      P+ +DWR  G +S VK
Sbjct:   221 ITKFSDLTEDEFRMMYLNPMLSQWSLKKEMKP----AIPASAPAPDTWDWRDHGAVSPVK 276

Query:   165 EQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
              QG C  CWAFS  G +E     +   L  LS Q
Sbjct:   277 NQGMCGSCWAFSVTGNIEGQWFKKTGQLLSLSEQ 310

 Score = 105 (42.0 bits), Expect = 3.4e-21, Sum P(2) = 3.4e-21
 Identities = 27/94 (28%), Positives = 53/94 (56%)

Query:    42 VTRFLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQREDSGTAVFEVNKFFDLSDSD 101
             +T F NFM  +++ YSS E+  +R   F  N++ A+  Q  + G+A + + KF DL++ +
Sbjct:   172 LTMFKNFMITYNRTYSSQEEAEKRLRIFQQNMKTAQTLQSLEQGSAEYGITKFSDLTEDE 231

Query:   102 LQQL---TGLNLDSTLEDIQPSLQAPFSSNQTDT 132
              + +     L+  S  ++++P++  P S+   DT
Sbjct:   232 FRMMYLNPMLSQWSLKKEMKPAI--PASAPAPDT 263


>WB|WBGene00013076 [details] [associations]
            symbol:Y51A2D.8 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 GeneTree:ENSGT00560000076599
            EMBL:AL021497 HSSP:P53634 HOGENOM:HOG000019851 PIR:T27079
            RefSeq:NP_507627.1 ProteinModelPortal:Q9XXQ7 SMR:Q9XXQ7
            MEROPS:C01.A49 EnsemblMetazoa:Y51A2D.8 GeneID:180208
            KEGG:cel:CELE_Y51A2D.8 UCSC:Y51A2D.8 CTD:180208 WormBase:Y51A2D.8
            eggNOG:NOG307864 InParanoid:Q9XXQ7 OMA:VAVYFKV NextBio:908434
            Uniprot:Q9XXQ7
        Length = 386

 Score = 230 (86.0 bits), Expect = 1.9e-21, Sum P(2) = 1.9e-21
 Identities = 63/206 (30%), Positives = 99/206 (48%)

Query:   200 HDKVYSSVEDLLRRHENFV---TNVEKAEDYQSEDSG-TAVFGVNKFFDLSESDLQQLTG 255
             +++ Y    +  +R  NFV    NV+K  + +S+ +G    FG+NKF DLS ++      
Sbjct:    50 YNRKYKDESENQQRFNNFVKSYNNVDKL-NAKSKAAGYDTQFGINKFSDLSTAEFHGRLS 108

Query:   256 LNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRH---GDDLPEAFDWRAE---G--V 307
               + S    + P L   F   + D   RA   N  RH       P+ FD R E   G  +
Sbjct:   109 NVVPSNNTGL-PMLN--FDKKKPD--FRAADMNKTRHKRRSTRYPDYFDLRNEKINGRYI 163

Query:   308 ISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSNG-GCNGGRMDDA 366
             +  +K+QG+CACCW F+   +VE ++A        LS Q++ DC      GC GG +   
Sbjct:   164 VGPIKDQGQCACCWGFAVTALVETVYAAHSGKFKSLSDQEVCDCGTEGTPGCKGGSLTLG 223

Query:   367 LQYIIDNGGVVSDQAYPYKASESERG 392
             +QY+    G+  D+ YPY  + + +G
Sbjct:   224 VQYV-KKYGLSGDEDYPYDQNRANQG 248

 Score = 178 (67.7 bits), Expect = 4.1e-23, Sum P(2) = 4.1e-23
 Identities = 51/162 (31%), Positives = 74/162 (45%)

Query:   497 KLSRLATEKLVDCDMSNG-GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERG--C-L 552
             K   L+ +++ DC      GC GG +   +QY+    G+  D+ YPY  + + +G  C L
Sbjct:   195 KFKSLSDQEVCDCGTEGTPGCKGGSLTLGVQYV-KKYGLSGDEDYPYDQNRANQGRRCRL 253

Query:   553 XXXXXXXXXXXXXYSRI-PYGEEEEMKKWVAT-RGPLSVGMNANGLFY-YSGGVIDLNQR 609
                          ++ I P   EE++ + +   + P++V       F  Y  GVI  +  
Sbjct:   254 RETDRIVPARAFNFAVINPRRAEEQIIQVLTEWKVPVAVYFKVGDQFKEYKEGVIIEDD- 312

Query:   610 LCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGE 651
              C    Q HA  IVGY   E   G S  YWI+KNSWG DW E
Sbjct:   313 -CRRATQWHAGAIVGYDTVEDSRGRSHDYWIIKNSWGGDWAE 353

 Score = 164 (62.8 bits), Expect = 4.1e-23, Sum P(2) = 4.1e-23
 Identities = 51/192 (26%), Positives = 82/192 (42%)

Query:    21 MIKVALLESNIFQT---RGYLNSPVTRFLNFMRDHDKVYSSVEDLLRRHENFV---TNVE 74
             ++ + + E   F+    R +       F +F + +++ Y    +  +R  NFV    NV+
Sbjct:    16 VVSINITEPEFFEINIDRDHPEKLYKAFEDFKKKYNRKYKDESENQQRFNNFVKSYNNVD 75

Query:    75 KAEDYQREDSGTAVFEVNKFFDLSDSDLQQLTGLNLDSTLEDIQPSLQAPFSSNQTDTEM 134
             K     +       F +NKF DLS ++        + S    + P L   F   + D   
Sbjct:    76 KLNAKSKAAGYDTQFGINKFSDLSTAEFHGRLSNVVPSNNTGL-PMLN--FDKKKPD--F 130

Query:   135 RAFQFNSLRH---GDDLPEAFDWRAE---G--VISKVKEQGKCACCWAFSAVGVVEAMHA 186
             RA   N  RH       P+ FD R E   G  ++  +K+QG+CACCW F+   +VE ++A
Sbjct:   131 RAADMNKTRHKRRSTRYPDYFDLRNEKINGRYIVGPIKDQGQCACCWGFAVTALVETVYA 190

Query:   187 IQGNNLTELSVQ 198
                     LS Q
Sbjct:   191 AHSGKFKSLSDQ 202

 Score = 89 (36.4 bits), Expect = 1.9e-21, Sum P(2) = 1.9e-21
 Identities = 20/57 (35%), Positives = 29/57 (50%)

Query:   440 YYSGGVI--DLNQRLYGTSIPYWIVKNSWGSDWGEKVEDKV--GSSGNRTRDLELTG 492
             +++G ++  D  +   G S  YWI+KNSWG DW E    +V  G       D  +TG
Sbjct:   319 WHAGAIVGYDTVEDSRGRSHDYWIIKNSWGGDWAESGYVRVVRGRDWCSIEDQPMTG 375


>DICTYBASE|DDB_G0272742 [details] [associations]
            symbol:DDB_G0272742 species:44689 "Dictyostelium
            discoideum" [GO:0005615 "extracellular space" evidence=IDA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 SMART:SM00645
            dictyBase:DDB_G0272742 GO:GO:0005615 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00639 EMBL:AAFI02000008
            eggNOG:NOG331187 RefSeq:XP_644986.1 ProteinModelPortal:Q7KWP5
            PRIDE:Q7KWP5 EnsemblProtists:DDB0168242 GeneID:8618663
            KEGG:ddi:DDB_G0272742 InParanoid:Q7KWP5 OMA:ATESAHF Uniprot:Q7KWP5
        Length = 345

 Score = 242 (90.2 bits), Expect = 9.1e-22, Sum P(2) = 9.1e-22
 Identities = 72/264 (27%), Positives = 127/264 (48%)

Query:   187 IQGNNLTELSVQHHDKVYSSVEDLLRRHENFVTNVEKAEDYQSEDSGTAVFGVNKFFDLS 246
             IQ  N     +  + + Y+S E    R+  F +N++    + S+ S T V  +N+F D+S
Sbjct:    23 IQYRNEFTAWMTSNQRTYASSE-FTNRYNTFKSNLDFINQWNSKGSKT-VLALNEFADIS 80

Query:   247 ESDLQQLTGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEG 306
               + ++   L  D+ +  +   L     +++ D E+++   +S   G       DWR +G
Sbjct:    81 NEEYRK-NYLRNDNNINKLSSLL----INDKEDKEIKSS--SSSGSGSS---GIDWRKKG 130

Query:   307 VISKVKEQ-GKCACCWAFSAVGVVEAMHAIQG--NSLTELSVQQLVDCDMSNGGCNGGRM 363
              +  VK Q G C   W  +AVG  E+ H +    +    LS+Q L+DC   N  C  G +
Sbjct:   131 AVPSVKSQIGGCGS-WPITAVGATESAHFLANPKDPFISLSMQNLIDCSNLNKQCYQGTV 189

Query:   364 DDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWV 423
             ++A QYII+NGG+ S+++Y  K S  E G               Y ++  G E  ++  V
Sbjct:   190 NEAFQYIIENGGIDSEESY--KFSGGEPGKCKYNSSNSVAKITSYEKVKSGSESSLESAV 247

Query:   424 ATRGPLSVGMNAN-GLF-YYSGGV 445
             + + P++  ++A+   F +YS G+
Sbjct:   248 SLK-PVAAYIDASLSSFQFYSSGI 270

 Score = 210 (79.0 bits), Expect = 1.3e-22, Sum P(2) = 1.3e-22
 Identities = 49/157 (31%), Positives = 83/157 (52%)

Query:   501 LATEKLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXX 560
             L+ + L+DC   N  C  G +++A QYII+NGG+ S+++Y  K S  E G          
Sbjct:   169 LSMQNLIDCSNLNKQCYQGTVNEAFQYIIENGGIDSEESY--KFSGGEPGKCKYNSSNSV 226

Query:   561 XXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNAN-GLF-YYSGGVIDLNQRLCNPKAQNH 618
                  Y ++  G E  ++  V+ + P++  ++A+   F +YS G+    +  CN    NH
Sbjct:   227 AKITSYEKVKSGSESSLESAVSLK-PVAAYIDASLSSFQFYSSGIY--YEPSCNSTDLNH 283

Query:   619 ALIIVGYGEEEKKDGTSIP----YWIVKNSWGSDWGE 651
             +++IVG+ +       S+     YWIV+NS+G +WGE
Sbjct:   284 SILIVGFSDFSTTPTDSLKHSSNYWIVQNSFGKNWGE 320

 Score = 119 (46.9 bits), Expect = 1.3e-22, Sum P(2) = 1.3e-22
 Identities = 40/158 (25%), Positives = 75/158 (47%)

Query:    45 FLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQREDSGTAVFEVNKFFDLSDSDLQQ 104
             F  +M  + + Y+S E    R+  F +N++    +  + S T V  +N+F D+S+ + ++
Sbjct:    29 FTAWMTSNQRTYASSE-FTNRYNTFKSNLDFINQWNSKGSKT-VLALNEFADISNEEYRK 86

Query:   105 LTGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVK 164
                L  D+ +  +   L     +++ D E+++   +S   G       DWR +G +  VK
Sbjct:    87 -NYLRNDNNINKLSSLL----INDKEDKEIKSS--SSSGSGSS---GIDWRKKGAVPSVK 136

Query:   165 EQ-GKCACCWAFSAVGVVEAMHAIQG--NNLTELSVQH 199
              Q G C   W  +AVG  E+ H +    +    LS+Q+
Sbjct:   137 SQIGGCGS-WPITAVGATESAHFLANPKDPFISLSMQN 173

 Score = 67 (28.6 bits), Expect = 9.1e-22, Sum P(2) = 9.1e-22
 Identities = 10/14 (71%), Positives = 13/14 (92%)

Query:   459 YWIVKNSWGSDWGE 472
             YWIV+NS+G +WGE
Sbjct:   307 YWIVQNSFGKNWGE 320


>MGI|MGI:1927229 [details] [associations]
            symbol:Ctsm "cathepsin M" species:10090 "Mus musculus"
            [GO:0005575 "cellular_component" evidence=ND] [GO:0005764
            "lysosome" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0008152 "metabolic process" evidence=ISS] [GO:0008233
            "peptidase activity" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISS] [GO:0016787 "hydrolase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            InterPro:IPR015645 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            MGI:MGI:1927229 eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 EMBL:AF202528
            EMBL:AY014777 EMBL:AY057446 EMBL:AK005550 EMBL:AK005428
            IPI:IPI00131133 RefSeq:NP_071721.2 UniGene:Mm.279933
            ProteinModelPortal:Q9JL96 SMR:Q9JL96 STRING:Q9JL96 MEROPS:C01.023
            PRIDE:Q9JL96 DNASU:64139 Ensembl:ENSMUST00000099451 GeneID:64139
            KEGG:mmu:64139 UCSC:uc007qwj.1 CTD:64139 InParanoid:Q9JL96
            KO:K09600 OrthoDB:EOG4TTGKR NextBio:319931 Bgee:Q9JL96
            CleanEx:MM_CTSM Genevestigator:Q9JL96 GermOnline:ENSMUSG00000074484
            GermOnline:ENSMUSG00000074871 PANTHER:PTHR12411:SF58 Uniprot:Q9JL96
        Length = 333

 Score = 234 (87.4 bits), Expect = 1.6e-17, P = 1.6e-17
 Identities = 52/155 (33%), Positives = 78/155 (50%)

Query:   295 DLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMS 354
             +LP+  +W+  G ++ V+ QG+C  CWAFS  G +E     +   L  LSVQ LVDC   
Sbjct:   113 NLPKFINWKKRGYVTPVQTQGRCNSCWAFSVTGAIEGQMFRKTGQLIPLSVQNLVDCSRP 172

Query:   355 NG--GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIP 412
              G  GC  G    AL Y+++NGG+ S+  YPY+  E +  C              +  +P
Sbjct:   173 QGNWGCYLGNTYLALHYVMENGGLESEATYPYE--EKDGSC-RYSPENSTANITGFEFVP 229

Query:   413 YGEEEEMKKWVATRGPLSVGMNAN--GLFYYSGGV 445
               E+  M   VA+ GP+SV ++A      +Y  G+
Sbjct:   230 KNEDALMNA-VASIGPISVAIDARHASFLFYKRGI 263

 Score = 211 (79.3 bits), Expect = 1.6e-22, Sum P(2) = 1.6e-22
 Identities = 55/162 (33%), Positives = 81/162 (50%)

Query:   497 KLSRLATEKLVDCDMSNG--GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXX 554
             +L  L+ + LVDC    G  GC  G    AL Y+++NGG+ S+  YPY+  E +  C   
Sbjct:   157 QLIPLSVQNLVDCSRPQGNWGCYLGNTYLALHYVMENGGLESEATYPYE--EKDGSC-RY 213

Query:   555 XXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANG---LFYYSGGVIDLNQRLC 611
                        +  +P  E+  M   VA+ GP+SV ++A     LFY  G   + N   C
Sbjct:   214 SPENSTANITGFEFVPKNEDALMNA-VASIGPISVAIDARHASFLFYKRGIYYEPN---C 269

Query:   612 NPKAQNHALIIVGYG-EEEKKDGTSIPYWIVKNSWGSDWGEK 652
             +     H++++VGYG    + DG    YW+VKNS G+ WG K
Sbjct:   270 SSCVVTHSMLLVGYGFTGRESDGRK--YWLVKNSMGTQWGNK 309

 Score = 115 (45.5 bits), Expect = 1.6e-22, Sum P(2) = 1.6e-22
 Identities = 20/53 (37%), Positives = 31/53 (58%)

Query:   147 DLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQH 199
             +LP+  +W+  G ++ V+ QG+C  CWAFS  G +E     +   L  LSVQ+
Sbjct:   113 NLPKFINWKKRGYVTPVQTQGRCNSCWAFSVTGAIEGQMFRKTGQLIPLSVQN 165

 Score = 77 (32.2 bits), Expect = 3.3e-07, Sum P(2) = 3.3e-07
 Identities = 36/147 (24%), Positives = 52/147 (35%)

Query:   355 NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASES------ERGCLXXXXXXXXXXXXXY 408
             N GC  G    AL Y+++NGG+ S+  YPY+  +       E                  
Sbjct:   175 NWGCYLGNTYLALHYVMENGGLESEATYPYEEKDGSCRYSPENSTANITGFEFVPKNEDA 234

Query:   409 SRIPYGEEEEMKKWVATRGPLSVGMNANGLFYY---SGGVIDLNQRLYGTSIP------- 458
                       +   +  R   S      G++Y    S  V+  +  L G           
Sbjct:   235 LMNAVASIGPISVAIDARHA-SFLFYKRGIYYEPNCSSCVVTHSMLLVGYGFTGRESDGR 293

Query:   459 -YWIVKNSWGSDWGEKVEDKVG-SSGN 483
              YW+VKNS G+ WG K   K+    GN
Sbjct:   294 KYWLVKNSMGTQWGNKGYMKISRDKGN 320


>DICTYBASE|DDB_G0281077 [details] [associations]
            symbol:DDB_G0281077 species:44689 "Dictyostelium
            discoideum" [GO:0030246 "carbohydrate binding" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR008965 InterPro:IPR013128 InterPro:IPR019028
            Pfam:PF00112 Pfam:PF09478 PRINTS:PR00705 SMART:SM00645
            SMART:SM01063 InterPro:IPR000169 dictyBase:DDB_G0281077
            GO:GO:0030246 EMBL:AAFI02000040 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00139
            PROSITE:PS00639 SUPFAM:SSF49384 ProtClustDB:CLSZ2430562
            RefSeq:XP_640803.1 ProteinModelPortal:Q54UH3
            EnsemblProtists:DDB0203998 GeneID:8622857 KEGG:ddi:DDB_G0281077
            InParanoid:Q54UH3 OMA:LINDFNF Uniprot:Q54UH3
        Length = 662

 Score = 270 (100.1 bits), Expect = 2.1e-22, Sum P(2) = 2.1e-22
 Identities = 59/153 (38%), Positives = 81/153 (52%)

Query:   297 PEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDC--DMS 354
             P + DWR  G++SKVK QG C  C+AFS VG +EA +  + N +  LS Q LVDC  +  
Sbjct:   472 PISIDWRTWGMVSKVKNQGSCGSCYAFSTVGALEAHYYRKNNRMLNLSEQNLVDCTRNYG 531

Query:   355 NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYG 414
             NG C+GG M +  +YI +NGG+     YPY   E   G               Y  I   
Sbjct:   532 NGECSGGWMHNCFRYIKENGGINLQSTYPY---EGRVGLCRYNSGDAQSRISNYVMIKQH 588

Query:   415 EEEEMKKWVATRGPLSVGMNANG--LFYYSGGV 445
             +EE++   VA+ GP+SV  +A+     YYS G+
Sbjct:   589 DEEDLANAVASVGPVSVAYDASTREFMYYSSGI 621

 Score = 201 (75.8 bits), Expect = 9.9e-15, Sum P(2) = 9.9e-15
 Identities = 63/203 (31%), Positives = 93/203 (45%)

Query:   451 RLYGTSIPYWIVKNSWGSDWGEKVEDKVGSSGNRTRDLELTGVLPS-------KLSRLAT 503
             RL   S P  I   +WG     KV+++ GS G+        G L +       ++  L+ 
Sbjct:   465 RLLKWSRPISIDWRTWGMV--SKVKNQ-GSCGS-CYAFSTVGALEAHYYRKNNRMLNLSE 520

Query:   504 EKLVDC--DMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXX 561
             + LVDC  +  NG C+GG M +  +YI +NGG+     YPY   E   G           
Sbjct:   521 QNLVDCTRNYGNGECSGGWMHNCFRYIKENGGINLQSTYPY---EGRVGLCRYNSGDAQS 577

Query:   562 XXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANG--LFYYSGGVIDLNQRLCNPKAQNHA 619
                 Y  I   +EE++   VA+ GP+SV  +A+     YYS G+   N   C+     HA
Sbjct:   578 RISNYVMIKQHDEEDLANAVASVGPVSVAYDASTREFMYYSSGIY--NSDSCDKYRTTHA 635

Query:   620 LIIVGYGEEEKKDGTSIPYWIVK 642
             +++VGYG E   D     +WI+K
Sbjct:   636 VVVVGYGIENGVD-----FWIIK 653

 Score = 131 (51.2 bits), Expect = 4.1e-07, Sum P(2) = 4.1e-07
 Identities = 24/51 (47%), Positives = 33/51 (64%)

Query:   149 PEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQH 199
             P + DWR  G++SKVK QG C  C+AFS VG +EA +  + N +  LS Q+
Sbjct:   472 PISIDWRTWGMVSKVKNQGSCGSCYAFSTVGALEAHYYRKNNRMLNLSEQN 522

 Score = 68 (29.0 bits), Expect = 2.1e-22, Sum P(2) = 2.1e-22
 Identities = 24/90 (26%), Positives = 43/90 (47%)

Query:    45 FLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQRED-SGTAVFEVNKFFDLS-DSDL 102
             F+ +    ++ Y + + LL+ +E F  +    E Y+RE+ + T    + +F D++ D  L
Sbjct:   162 FIQWSNQFNRTYRADQFLLK-YEAFKDSSRFIEQYKRENQNSTMELGLTQFSDMTHDEFL 220

Query:   103 QQLTGLNLDSTLEDIQPSLQAPFSSNQTDT 132
                T    +  L +  PS  +P S N T T
Sbjct:   221 NIYTSKLYEFNLNETTPS-NSPCSVNITQT 249


>UNIPROTKB|J9P7C5 [details] [associations]
            symbol:J9P7C5 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00560000076577 EMBL:AAEX03010953
            Ensembl:ENSCAFT00000012925 Uniprot:J9P7C5
        Length = 321

 Score = 259 (96.2 bits), Expect = 2.2e-21, P = 2.2e-21
 Identities = 67/195 (34%), Positives = 97/195 (49%)

Query:   295 DLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMS 354
             ++P++ DWR +G ++ VK QG+C  CWAFSA G  E     +  +L  LS Q L      
Sbjct:   108 EIPKSVDWREKGYVTPVKNQGQCGSCWAFSATGAFEGQMFWKTGNLVPLSEQNLAQ---G 164

Query:   355 NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYG 414
             N GCNGG MD+A QY+ DN  + S+++YPY   +++  C              +  +P  
Sbjct:   165 NEGCNGGLMDNAFQYVKDNRCLDSEESYPYLGRDTDT-C-NYKPECSAAHDSGFVDLPQR 222

Query:   415 EEEEMKKWVATRGPLSVGMNANGL---FYYSG--------------GVIDLNQRLYGT-S 456
             E+  MK  +AT G ++V ++A      FY S               GV+ +     GT S
Sbjct:   223 EKALMKA-MATLGSITVAIDAGHQYFQFYKSSIYFDPDCSSKDLDHGVLVVGYGFEGTDS 281

Query:   457 IPYWIVKNSWGSDWG 471
                WIVKNSW  +WG
Sbjct:   282 NNKWIVKNSWSPEWG 296

 Score = 193 (73.0 bits), Expect = 2.3e-22, Sum P(2) = 2.3e-22
 Identities = 51/142 (35%), Positives = 75/142 (52%)

Query:   513 NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYG 572
             N GCNGG MD+A QY+ DN  + S+++YPY   +++  C              +  +P  
Sbjct:   165 NEGCNGGLMDNAFQYVKDNRCLDSEESYPYLGRDTDT-C-NYKPECSAAHDSGFVDLPQR 222

Query:   573 EEEEMKKWVATRGPLSVGMNANGL---FYYSGGVIDLNQRLCNPKAQNHALIIVGYGEEE 629
             E+  MK  +AT G ++V ++A      FY S    D +   C+ K  +H +++VGYG E 
Sbjct:   223 EKALMKA-MATLGSITVAIDAGHQYFQFYKSSIYFDPD---CSSKDLDHGVLVVGYGFE- 277

Query:   630 KKDGT-SIPYWIVKNSWGSDWG 650
                GT S   WIVKNSW  +WG
Sbjct:   278 ---GTDSNNKWIVKNSWSPEWG 296

 Score = 134 (52.2 bits), Expect = 2.3e-22, Sum P(2) = 2.3e-22
 Identities = 39/121 (32%), Positives = 59/121 (48%)

Query:    87 AVFEVN-KFFDLSDSDLQQ-----LTGLNL--DSTLEDIQPSLQAPFSSNQTDTEMRAFQ 138
             AV+E N K  +L + +  Q        +N   D T E+ +  +   F  NQ   + + FQ
Sbjct:    45 AVWEKNMKMIELHNREYSQGKHGFTMAMNAFGDMTNEEFRQVING-FQ-NQKHKKGKVFQ 102

Query:   139 FNSLRHGDDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
                     ++P++ DWR +G ++ VK QG+C  CWAFSA G  E     +  NL  LS Q
Sbjct:   103 EPLFA---EIPKSVDWREKGYVTPVKNQGQCGSCWAFSATGAFEGQMFWKTGNLVPLSEQ 159

Query:   199 H 199
             +
Sbjct:   160 N 160


>RGD|1564827 [details] [associations]
            symbol:RGD1564827 "similar to cathepsin M" species:10116 "Rattus
            norvegicus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00639 IPI:IPI00192321
            Ensembl:ENSRNOT00000023990 ArrayExpress:D3ZY04 Uniprot:D3ZY04
        Length = 338

 Score = 231 (86.4 bits), Expect = 5.3e-22, Sum P(2) = 5.3e-22
 Identities = 54/159 (33%), Positives = 83/159 (52%)

Query:   497 KLSRLATEKLVDCDMSNG--GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXX 554
             KL+ L+ + LVDC    G  GC GG   +A QY++ NGG+ S+  YPY+  E     L  
Sbjct:   164 KLTPLSVQNLVDCSKPQGNKGCRGGTTYNAFQYVLQNGGLESEATYPYEGKEG----LCR 219

Query:   555 XXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNA--NGLFYYSGGVIDLNQRLCN 612
                            P   E+ +   VAT+ P++ G++   + L +Y  G+   ++  CN
Sbjct:   220 YNPNSSAKITXICAPPQKNEDVLMDAVATK-PVAAGIHVVHSSLRFYKKGIY--HEPKCN 276

Query:   613 PKAQNHALIIVGYG-EEEKKDGTSIPYWIVKNSWGSDWG 650
                 NHA+++VGYG E  + DG +  YW+++NSWG  WG
Sbjct:   277 NYV-NHAVLVVGYGFEGNETDGNN--YWLIQNSWGERWG 312

 Score = 225 (84.3 bits), Expect = 3.4e-16, P = 3.4e-16
 Identities = 60/191 (31%), Positives = 85/191 (44%)

Query:   314 QGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSNG--GCNGGRMDDALQYII 371
             QG+C  CWAF  VG +E     +   LT LSVQ LVDC    G  GC GG   +A QY++
Sbjct:   139 QGRCNSCWAFPVVGAIEGQMFKKTGKLTPLSVQNLVDCSKPQGNKGCRGGTTYNAFQYVL 198

Query:   372 DNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSV 431
              NGG+ S+  YPY+  E     L                 P   E+ +   VAT+ P++ 
Sbjct:   199 QNGGLESEATYPYEGKEG----LCRYNPNSSAKITXICAPPQKNEDVLMDAVATK-PVAA 253

Query:   432 GMNA--NGLFYYSGGVID-------LNQRL----YG------TSIPYWIVKNSWGSDWGE 472
             G++   + L +Y  G+         +N  +    YG          YW+++NSWG  WG 
Sbjct:   254 GIHVVHSSLRFYKKGIYHEPKCNNYVNHAVLVVGYGFEGNETDGNNYWLIQNSWGERWGL 313

Query:   473 KVEDKVGSSGN 483
                 K+    N
Sbjct:   314 NGYMKIAKDRN 324

 Score = 84 (34.6 bits), Expect = 5.3e-22, Sum P(2) = 5.3e-22
 Identities = 16/34 (47%), Positives = 20/34 (58%)

Query:   166 QGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQH 199
             QG+C  CWAF  VG +E     +   LT LSVQ+
Sbjct:   139 QGRCNSCWAFPVVGAIEGQMFKKTGKLTPLSVQN 172


>DICTYBASE|DDB_G0279187 [details] [associations]
            symbol:cprG "cysteine proteinase 7" species:44689
            "Dictyostelium discoideum" [GO:0005615 "extracellular space"
            evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0005764 "lysosome" evidence=IEA] [GO:0044351
            "macropinocytosis" evidence=RCA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 dictyBase:DDB_G0279187 GO:GO:0005615
            GenomeReviews:CM000152_GR eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            EMBL:AAFI02000030 ProtClustDB:CLSZ2846820 MEROPS:C01.081
            EMBL:U72746 RefSeq:XP_641720.2 ProteinModelPortal:Q94504 SMR:Q94504
            PRIDE:Q94504 EnsemblProtists:DDB0215005 GeneID:8621915
            KEGG:ddi:DDB_G0279187 OMA:INTETEK Uniprot:Q94504
        Length = 460

 Score = 250 (93.1 bits), Expect = 6.1e-22, Sum P(2) = 6.1e-22
 Identities = 77/274 (28%), Positives = 129/274 (47%)

Query:   214 HENFVTN--VEKAEDYQSED-SGTA-VFGVNKFF--DLSESDLQQLTGLNL--DSTLEDI 265
             + N  TN  +     Y SE+ +G   +F  N  +  + +    + + GLN+  D + E+ 
Sbjct:    26 YRNAFTNWMIAHQRHYSSEEFNGRYNIFKANMDYVNEWNTKGSETVLGLNVFADISNEEY 85

Query:   266 QPS-LQAPFSSNQTD-TEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVKEQGKCACCWAF 323
             + + L  PF ++  + TE     F       D     DWR +G ++ +K QG+C  CW+F
Sbjct:    86 RATYLGTPFDASSLEMTESDKI-F-------DASAQVDWRTQGAVTPIKNQGQCGGCWSF 137

Query:   324 SAVGVVE-AMHAIQGN-SLTELSVQQLVDCDMS--NGGCNGGRMDDALQYIIDNGGVVSD 379
             S  G  E A +   G  +L  LS Q L+DC  S  N GC GG M  A +YII+N G+ ++
Sbjct:   138 STTGATEGAQYLANGKKNLVSLSEQNLIDCSGSYGNNGCEGGLMTLAFEYIINNKGIDTE 197

Query:   380 QAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLF 439
              +YPY A + ++ C              Y  +  G E ++   V T+GP SV ++A+   
Sbjct:   198 SSYPYTAEDGKK-C-KFNPKNVAAQLSSYVNVTSGSESDLAAKV-TQGPTSVAIDASNQS 254

Query:   440 Y--YSGGVIDLNQRLYGTSIPYWIVKNSWGSDWG 471
             +  Y  G+ +       T + + ++   +G+  G
Sbjct:   255 FQLYVSGIYN-EPACSSTQLDHGVLAVGFGTGSG 287

 Score = 171 (65.3 bits), Expect = 6.1e-21, Sum P(3) = 6.1e-21
 Identities = 42/141 (29%), Positives = 70/141 (49%)

Query:   498 LSRLATEKLVDCDMS--NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXX 555
             L  L+ + L+DC  S  N GC GG M  A +YII+N G+ ++ +YPY A + ++ C    
Sbjct:   156 LVSLSEQNLIDCSGSYGNNGCEGGLMTLAFEYIINNKGIDTESSYPYTAEDGKK-C-KFN 213

Query:   556 XXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFY--YSGGVIDLNQRLCNP 613
                       Y  +  G E ++   V T+GP SV ++A+   +  Y  G+   N+  C+ 
Sbjct:   214 PKNVAAQLSSYVNVTSGSESDLAAKV-TQGPTSVAIDASNQSFQLYVSGIY--NEPACSS 270

Query:   614 KAQNHALIIVGYGEEEKKDGT 634
                +H ++ VG+G      G+
Sbjct:   271 TQLDHGVLAVGFGTGSGSSGS 291

 Score = 118 (46.6 bits), Expect = 6.1e-21, Sum P(3) = 6.1e-21
 Identities = 41/146 (28%), Positives = 68/146 (46%)

Query:    66 HENFVTN--VEKAEDYQRED-SGTA-VFEVNKFF--DLSDSDLQQLTGLNL--DSTLEDI 117
             + N  TN  +     Y  E+ +G   +F+ N  +  + +    + + GLN+  D + E+ 
Sbjct:    26 YRNAFTNWMIAHQRHYSSEEFNGRYNIFKANMDYVNEWNTKGSETVLGLNVFADISNEEY 85

Query:   118 QPS-LQAPFSSNQTD-TEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVKEQGKCACCWAF 175
             + + L  PF ++  + TE     F       D     DWR +G ++ +K QG+C  CW+F
Sbjct:    86 RATYLGTPFDASSLEMTESDKI-F-------DASAQVDWRTQGAVTPIKNQGQCGGCWSF 137

Query:   176 SAVGVVE-AMHAIQGN-NLTELSVQH 199
             S  G  E A +   G  NL  LS Q+
Sbjct:   138 STTGATEGAQYLANGKKNLVSLSEQN 163

 Score = 87 (35.7 bits), Expect = 4.2e-08, Sum P(2) = 4.2e-08
 Identities = 18/42 (42%), Positives = 24/42 (57%)

Query:   430 SVGMNANGLFYYSGGVIDLNQRLYGTSIPYWIVKNSWGSDWG 471
             S   +A+G    S    + N  +Y T+  YWIVKNSWG+ WG
Sbjct:   389 SASGSASGSASGSSSGSNSNGGVYPTAGDYWIVKNSWGTSWG 430

 Score = 78 (32.5 bits), Expect = 7.8e-17, Sum P(3) = 7.8e-17
 Identities = 30/106 (28%), Positives = 51/106 (48%)

Query:    12 KGLGYLHTFMIKVALLESNIFQTRGYLNSPVTRFLNFMRDHDKVYSSVEDLLRRHENFVT 71
             K L  L   ++ VA  +  + +   Y N+    F N+M  H + YSS E+   R+  F  
Sbjct:     2 KVLSALCVLLVSVATAKQQLSEVE-YRNA----FTNWMIAHQRHYSS-EEFNGRYNIFKA 55

Query:    72 NVEKAEDYQREDSGTAVFEVNKFFDLSDSDLQQL-TGLNLD-STLE 115
             N++   ++  + S T V  +N F D+S+ + +    G   D S+LE
Sbjct:    56 NMDYVNEWNTKGSET-VLGLNVFADISNEEYRATYLGTPFDASSLE 100

 Score = 77 (32.2 bits), Expect = 6.1e-22, Sum P(2) = 6.1e-22
 Identities = 12/17 (70%), Positives = 14/17 (82%)

Query:   634 TSIPYWIVKNSWGSDWG 650
             T+  YWIVKNSWG+ WG
Sbjct:   414 TAGDYWIVKNSWGTSWG 430


>DICTYBASE|DDB_G0281079 [details] [associations]
            symbol:DDB_G0281079 species:44689 "Dictyostelium
            discoideum" [GO:0030246 "carbohydrate binding" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR008965 InterPro:IPR013128 InterPro:IPR019028
            Pfam:PF00112 Pfam:PF09478 PRINTS:PR00705 SMART:SM00645
            SMART:SM01063 InterPro:IPR000169 dictyBase:DDB_G0281079
            GO:GO:0030246 EMBL:AAFI02000040 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00139
            PROSITE:PS00639 SUPFAM:SSF49384 RefSeq:XP_640804.1
            ProteinModelPortal:Q54UH2 EnsemblProtists:DDB0204000 GeneID:8622858
            KEGG:ddi:DDB_G0281079 InParanoid:Q54UH2 OMA:ALESHYY
            ProtClustDB:CLSZ2430562 Uniprot:Q54UH2
        Length = 664

 Score = 272 (100.8 bits), Expect = 6.7e-22, Sum P(2) = 6.7e-22
 Identities = 69/189 (36%), Positives = 97/189 (51%)

Query:   297 PEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSN- 355
             P + DWR  G++SKVK QG C  C+AFS VG +E+ +  + N + +LS Q LVDC  SN 
Sbjct:   471 PISIDWRTWGMVSKVKNQGSCGSCYAFSTVGALESHYYRKNNRMLDLSEQNLVDCTASNK 530

Query:   356 ---GGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIP 412
                GGC+GG M +   YI +NGG+  +  YPY   E + G               +  I 
Sbjct:   531 YRNGGCSGGWMHNCYSYIQENGGINQESTYPY---EGKFGQCRYNSGDAQSRISKFVMIK 587

Query:   413 YGEEEEMKKWVATRGPLSVGMNANG--LFYYSGGVI-DLNQRLYGTSIPYWIV--KNSWG 467
               +EE++   VA+ GP+SV  +A+     YYS G+    N   Y T+    +V   N  G
Sbjct:   588 QHDEEDLADTVASVGPVSVAYDASTREFMYYSRGIYYSDNCNKYRTTHAVVVVGYDNENG 647

Query:   468 SD-WGEKVE 475
              D W  KV+
Sbjct:   648 VDYWIIKVK 656

 Score = 203 (76.5 bits), Expect = 3.2e-14, Sum P(2) = 3.2e-14
 Identities = 66/205 (32%), Positives = 93/205 (45%)

Query:   451 RLYGTSIPYWIVKNSWGSDWGEKVEDKVGSSGNRTRDLELTGVLPSKLSR-------LAT 503
             RL   S P  I   +WG     KV+++ GS G+        G L S   R       L+ 
Sbjct:   464 RLLKWSRPISIDWRTWGMV--SKVKNQ-GSCGS-CYAFSTVGALESHYYRKNNRMLDLSE 519

Query:   504 EKLVDCDMSN----GGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXX 559
             + LVDC  SN    GGC+GG M +   YI +NGG+  +  YPY   E + G         
Sbjct:   520 QNLVDCTASNKYRNGGCSGGWMHNCYSYIQENGGINQESTYPY---EGKFGQCRYNSGDA 576

Query:   560 XXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANG--LFYYSGGVIDLNQRLCNPKAQN 617
                   +  I   +EE++   VA+ GP+SV  +A+     YYS G+   +   CN     
Sbjct:   577 QSRISKFVMIKQHDEEDLADTVASVGPVSVAYDASTREFMYYSRGIYYSDN--CNKYRTT 634

Query:   618 HALIIVGYGEEEKKDGTSIPYWIVK 642
             HA+++VGY  E   D     YWI+K
Sbjct:   635 HAVVVVGYDNENGVD-----YWIIK 654

 Score = 130 (50.8 bits), Expect = 2.7e-06, Sum P(2) = 2.7e-06
 Identities = 23/51 (45%), Positives = 34/51 (66%)

Query:   149 PEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQH 199
             P + DWR  G++SKVK QG C  C+AFS VG +E+ +  + N + +LS Q+
Sbjct:   471 PISIDWRTWGMVSKVKNQGSCGSCYAFSTVGALESHYYRKNNRMLDLSEQN 521

 Score = 61 (26.5 bits), Expect = 6.7e-22, Sum P(2) = 6.7e-22
 Identities = 23/90 (25%), Positives = 42/90 (46%)

Query:    45 FLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQRED-SGTAVFEVNKFFDLS-DSDL 102
             F+ +    ++ Y + + LL+ +E F  +    E Y+RE+ + T    + +F D++ D  L
Sbjct:   161 FIQWSNQFNRTYRADQFLLK-YEAFKDSSRFIEQYKRENQNSTMELGLTQFSDMTHDEFL 219

Query:   103 QQLTGLNLDSTLEDIQPSLQAPFSSNQTDT 132
                T    +  L +  PS  +  S N T T
Sbjct:   220 NVYTSKLYEFNLNETTPS-NSSCSVNITQT 248

 Score = 58 (25.5 bits), Expect = 1.4e-21, Sum P(2) = 1.4e-21
 Identities = 40/188 (21%), Positives = 72/188 (38%)

Query:    96 DLSDSDLQQLTGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWR 155
             D+ D    +  G N+++   D   SL +   S   D        N     ++  E    R
Sbjct:    71 DIQDDKANRCKGFNINNNNVDGSDSLDSEIGSGG-DIS------NDSNGDNENNEENAKR 123

Query:   156 AEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQ-GNNLTELSVQHHDKVYSSVEDLLRRH 214
                 +      G+  C   F   G       ++  N+  + S Q + + Y + + LL+ +
Sbjct:   124 LHNHVHHHHHDGRDICGCIFGRYGKDCRKRELEYQNSFIQWSNQFN-RTYRADQFLLK-Y 181

Query:   215 ENFVTNVEKAEDYQSED-SGTAVFGVNKFFDLSESD-LQQLTGLNLDSTLEDIQPSLQAP 272
             E F  +    E Y+ E+ + T   G+ +F D++  + L   T    +  L +  PS  + 
Sbjct:   182 EAFKDSSRFIEQYKRENQNSTMELGLTQFSDMTHDEFLNVYTSKLYEFNLNETTPS-NSS 240

Query:   273 FSSNQTDT 280
              S N T T
Sbjct:   241 CSVNITQT 248


>RGD|1588248 [details] [associations]
            symbol:Cts8 "cathepsin 8" species:10116 "Rattus norvegicus"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            RGD:1588248 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00639 GeneTree:ENSGT00560000076577 IPI:IPI00765053
            RefSeq:NP_001121688.1 UniGene:Rn.220599 Ensembl:ENSRNOT00000061486
            GeneID:680718 KEGG:rno:680718 UCSC:RGD:1588248 CTD:56094
            OMA:DSEWQEW OrthoDB:EOG4JT07C NextBio:719350 Uniprot:D3ZP54
        Length = 333

 Score = 241 (89.9 bits), Expect = 1.5e-21, Sum P(2) = 1.5e-21
 Identities = 57/154 (37%), Positives = 78/154 (50%)

Query:   296 LPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSN 355
             LP+  DWR  G ++ VK QG C  CWAFS  G +E     +   L  LS Q LVDC    
Sbjct:   114 LPKFVDWRRRGYVTSVKNQGTCNSCWAFSVAGAIEGQMFRKTGRLVSLSPQNLVDCSRPE 173

Query:   356 G--GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPY 413
             G  GC+ G    AL+Y+  NGG+ ++  YPY+  E   G               +S +  
Sbjct:   174 GNHGCHMGSTLYALKYVWSNGGLEAESTYPYEGKE---GPCRYLPRRSAARVTGFSTVAR 230

Query:   414 GEEEEMKKWVATRGPLSVGMNANGL-F-YYSGGV 445
              EE  M   VAT GP+SVG++A+ + F +Y  G+
Sbjct:   231 SEEALMHA-VATIGPISVGIDASHVSFRFYRRGI 263

 Score = 223 (83.6 bits), Expect = 5.1e-16, P = 5.1e-16
 Identities = 70/218 (32%), Positives = 105/218 (48%)

Query:   438 LFYYSGGVIDLNQRLYGTSIPYWIVKNSWGSDWGEKVEDKVGSSGNRTRDLELTGVLPSK 497
             +F Y    +D  +R Y TS+     + +  S W   V   +   G   R    TG L S 
Sbjct:   110 IFRYLPKFVDWRRRGYVTSVKN---QGTCNSCWAFSVAGAI--EGQMFRK---TGRLVS- 160

Query:   498 LSRLATEKLVDCDMSNG--GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXX 555
                L+ + LVDC    G  GC+ G    AL+Y+  NGG+ ++  YPY+  E   G     
Sbjct:   161 ---LSPQNLVDCSRPEGNHGCHMGSTLYALKYVWSNGGLEAESTYPYEGKE---GPCRYL 214

Query:   556 XXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGL-F-YYSGGVIDLNQRLCNP 613
                       +S +   EE  M   VAT GP+SVG++A+ + F +Y  G+    +  C+ 
Sbjct:   215 PRRSAARVTGFSTVARSEEALMHA-VATIGPISVGIDASHVSFRFYRRGIY--YEPRCSS 271

Query:   614 KAQNHALIIVGYGEEEKK-DGTSIPYWIVKNSWGSDWG 650
                NH++++VGYG E ++ DG    YW++KNS G  WG
Sbjct:   272 NRINHSVLVVGYGYEGRESDGRK--YWLIKNSHGVGWG 307

 Score = 120 (47.3 bits), Expect = 8.2e-06, Sum P(2) = 8.2e-06
 Identities = 22/52 (42%), Positives = 28/52 (53%)

Query:   148 LPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQH 199
             LP+  DWR  G ++ VK QG C  CWAFS  G +E     +   L  LS Q+
Sbjct:   114 LPKFVDWRRRGYVTSVKNQGTCNSCWAFSVAGAIEGQMFRKTGRLVSLSPQN 165

 Score = 58 (25.5 bits), Expect = 1.5e-21, Sum P(2) = 1.5e-21
 Identities = 12/37 (32%), Positives = 16/37 (43%)

Query:   459 YWIVKNSWGSDWGEKVEDKVGSSGNRTRDLELTGVLP 495
             YW++KNS G  WG     K+    N    +   G  P
Sbjct:   295 YWLIKNSHGVGWGMNGYMKLARGWNNHCGIATYGFYP 331


>WB|WBGene00012747 [details] [associations]
            symbol:Y40H7A.10 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00139
            GeneTree:ENSGT00560000076599 HOGENOM:HOG000230773 EMBL:AL033510
            HSSP:P80067 MEROPS:C01.A48 PIR:T26792 RefSeq:NP_502836.1
            ProteinModelPortal:Q9XWA4 SMR:Q9XWA4 STRING:Q9XWA4
            EnsemblMetazoa:Y40H7A.10 GeneID:189809 KEGG:cel:CELE_Y40H7A.10
            UCSC:Y40H7A.10 CTD:189809 WormBase:Y40H7A.10 eggNOG:NOG286423
            InParanoid:Q9XWA4 OMA:NGPMIVC NextBio:943702 Uniprot:Q9XWA4
        Length = 343

 Score = 258 (95.9 bits), Expect = 2.8e-21, P = 2.8e-21
 Identities = 81/284 (28%), Positives = 128/284 (45%)

Query:   204 YSSVEDLLRRHENFVTNVEKAEDYQSEDSGTAVFGVNKFFDLSESDLQQLTGLNLDSTLE 263
             Y +  ++++R   F  N++  E Y  ED+G   + +N F DL+E               E
Sbjct:    62 YPNEYEIVKRFTIFSRNLDLVERYNKEDAGKVTYELNDFSDLTE---------------E 106

Query:   264 DIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWR-AEGV--ISKVKEQGKCACC 320
             + +  L  P    + D   ++ +  +L    +LP + DWR   G   ++ +K QG C  C
Sbjct:   107 EWKKYLMTP----KPDHSEKSLKPKTLIDKKNLPNSVDWRNVNGTNHVTGIKYQGPCGSC 162

Query:   321 WAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQ 380
             WAF+    +E+  +I G  L  LS QQL+DC + +  C GG   +AL+Y   + G+ +  
Sbjct:   163 WAFATAAAIESAVSISGGGLQSLSSQQLLDCTVVSDKCGGGEPVEALKYA-QSHGITTAH 221

Query:   381 AYPYKASESERGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMN--AN-G 437
              YPY    ++  C               S +    E+EM + VA  GP+ V  N   N  
Sbjct:   222 NYPYYFWTTK--CRETVPTVARIS----SWMKAESEDEMAQIVALNGPMIVCANFATNKN 275

Query:   438 LFYYSG-------GVIDLNQRLY-GTSIPYWIVKNSWGSDWGEK 473
              FY+SG       G    +  +  G    YWI+KN++   WGEK
Sbjct:   276 RFYHSGIAEDPDCGTEPTHALIVIGYGPDYWILKNTYSKVWGEK 319

 Score = 140 (54.3 bits), Expect = 1.6e-14, Sum P(2) = 1.6e-14
 Identities = 38/155 (24%), Positives = 70/155 (45%)

Query:    45 FLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQREDSGTAVFEVNKFFDLSDSDLQQ 104
             F NF+  + + Y +  ++++R   F  N++  E Y +ED+G   +E+N F DL++ + ++
Sbjct:    51 FQNFLVKYLREYPNEYEIVKRFTIFSRNLDLVERYNKEDAGKVTYELNDFSDLTEEEWKK 110

Query:   105 -LTGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKV 163
              L     D + + ++P       +     + R    N   H               ++ +
Sbjct:   111 YLMTPKPDHSEKSLKPKTLIDKKNLPNSVDWR--NVNGTNH---------------VTGI 153

Query:   164 KEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
             K QG C  CWAF+    +E+  +I G  L  LS Q
Sbjct:   154 KYQGPCGSCWAFATAAAIESAVSISGGGLQSLSSQ 188

 Score = 121 (47.7 bits), Expect = 1.6e-14, Sum P(2) = 1.6e-14
 Identities = 40/134 (29%), Positives = 64/134 (47%)

Query:   498 LSRLATEKLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXX 557
             L  L++++L+DC + +  C GG   +AL+Y   + G+ +   YPY    ++  C      
Sbjct:   182 LQSLSSQQLLDCTVVSDKCGGGEPVEALKYA-QSHGITTAHNYPYYFWTTK--CRETVPT 238

Query:   558 XXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMN--AN-GLFYYSGGVIDLNQRLCNPK 614
                      S +    E+EM + VA  GP+ V  N   N   FY+SG   D +   C  +
Sbjct:   239 VARIS----SWMKAESEDEMAQIVALNGPMIVCANFATNKNRFYHSGIAEDPD---CGTE 291

Query:   615 AQNHALIIVGYGEE 628
                HALI++GYG +
Sbjct:   292 P-THALIVIGYGPD 304

 Score = 114 (45.2 bits), Expect = 8.6e-14, Sum P(2) = 8.6e-14
 Identities = 32/82 (39%), Positives = 44/82 (53%)

Query:   574 EEEMKKWVATRGPLSVGMN--AN-GLFYYSGGVIDLNQRLCNPKAQNHALIIVGYGEEEK 630
             E+EM + VA  GP+ V  N   N   FY+SG   D +   C  +   HALI++GYG +  
Sbjct:   251 EDEMAQIVALNGPMIVCANFATNKNRFYHSGIAEDPD---CGTEP-THALIVIGYGPD-- 304

Query:   631 KDGTSIPYWIVKNSWGSDWGEK 652
                    YWI+KN++   WGEK
Sbjct:   305 -------YWILKNTYSKVWGEK 319


>UNIPROTKB|Q5T8F0 [details] [associations]
            symbol:CTSL1 "Cathepsin L1 light chain" species:9606 "Homo
            sapiens" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 SMART:SM00645 InterPro:IPR000169
            HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00139
            EMBL:AL160279 UniGene:Hs.731507 UniGene:Hs.731952 HGNC:HGNC:2537
            ChiTaRS:CTSL1 IPI:IPI00640540 SMR:Q5T8F0 Ensembl:ENST00000342020
            ChEMBL:CHEMBL1293261 Uniprot:Q5T8F0
        Length = 225

 Score = 257 (95.5 bits), Expect = 3.6e-21, P = 3.6e-21
 Identities = 48/101 (47%), Positives = 63/101 (62%)

Query:   295 DLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCD-- 352
             + P + DWR +G ++ VK QG+C  CWAFSA G +E     +   L  LS Q LVDC   
Sbjct:   113 EAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGP 172

Query:   353 MSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGC 393
               N GCNGG MD A QY+ DNGG+ S+++YPY+A+ S   C
Sbjct:   173 QGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATVSGAPC 213

 Score = 147 (56.8 bits), Expect = 6.3e-08, P = 6.3e-08
 Identities = 41/116 (35%), Positives = 58/116 (50%)

Query:   438 LFYYSGGVIDLNQRLYGTSIPYWIVKNSWGSDWGEKVEDKVGSSGNRTRDLELTGVLPSK 497
             LFY +   +D  ++ Y T +     +   GS W       +   G   R    TG    +
Sbjct:   110 LFYEAPRSVDWREKGYVTPVKN---QGQCGSCWAFSATGAL--EGQMFRK---TG----R 157

Query:   498 LSRLATEKLVDCD--MSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGC 551
             L  L+ + LVDC     N GCNGG MD A QY+ DNGG+ S+++YPY+A+ S   C
Sbjct:   158 LISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATVSGAPC 213

 Score = 125 (49.1 bits), Expect = 3.2e-05, P = 3.2e-05
 Identities = 37/121 (30%), Positives = 60/121 (49%)

Query:    87 AVFEVN-KFFDLSDSDLQQ-----LTGLNL--DSTLEDIQPSLQAPFSSNQTDTEMRAFQ 138
             AV+E N K  +L + + ++        +N   D T E+ +  +   F  N+   + + FQ
Sbjct:    50 AVWEKNMKMIELHNQEYREGKHSFTMAMNAFGDMTSEEFRQVMNG-FQ-NRKPRKGKVFQ 107

Query:   139 FNSLRHGDDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
                L +  + P + DWR +G ++ VK QG+C  CWAFSA G +E     +   L  LS Q
Sbjct:   108 -EPLFY--EAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQ 164

Query:   199 H 199
             +
Sbjct:   165 N 165


>WB|WBGene00008231 [details] [associations]
            symbol:tag-329 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 GeneTree:ENSGT00560000076599
            eggNOG:NOG288820 EMBL:Z70750 HSSP:P53634 HOGENOM:HOG000019851
            PIR:T20110 RefSeq:NP_505458.1 ProteinModelPortal:Q18740 SMR:Q18740
            MEROPS:C01.A36 EnsemblMetazoa:C50F4.3 GeneID:183677
            KEGG:cel:CELE_C50F4.3 UCSC:C50F4.3 CTD:183677 WormBase:C50F4.3
            InParanoid:Q18740 OMA:WIFRNSW NextBio:921986 Uniprot:Q18740
        Length = 374

 Score = 212 (79.7 bits), Expect = 1.5e-18, Sum P(2) = 1.5e-18
 Identities = 73/275 (26%), Positives = 122/275 (44%)

Query:   194 ELSVQHH--DKVYSSVEDLLRRHE-NFVTNVEKAEDYQS-EDSGTAVFGVNKFFDLSESD 249
             E+++  +  +K+Y   ED + +++ N+   +EK   +Q    +   V  +NK    +  D
Sbjct:    32 EINIDRNNPEKLYKEFEDFIVKYKRNYKDEIEKKFRFQQFVATHNRVGKMNKAAKKAGHD 91

Query:   250 LQQLTGLNLDSTLEDIQ-PSLQAPFSSNQTDTEMRAFQFNSLR---HGDDLPEAFDWRAE 305
              +   G+N  S L   +   + + F   + +T +  F   +LR     + LP+ FD R +
Sbjct:    92 TKY--GINKFSDLSKKEIHGMYSKFGPPKNNTNVPKFNLKNLRVKRQMEGLPKTFDLRNK 149

Query:   306 GV-----ISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSNG-GCN 359
              V     I  +K Q  CACCW F+A  V EA   +       LS Q++ DC   +G GCN
Sbjct:   150 KVGGHYIIGPIKTQDSCACCWGFAATAVAEAALTVHLKKAMNLSEQEVCDCAPKHGPGCN 209

Query:   360 GGRMDDALQYIIDNGGVVSDQAYPYKASESER-G-CLXXXXXXXXX--XXXXYSRIPYGE 415
             GG   D L+YI + G +   + YP+  + S + G C                Y+  P+  
Sbjct:   210 GGDPVDGLEYIKEMG-LTGGKEYPFNVNRSTQLGRCESEKYDRELNPLELDYYAIDPFNA 268

Query:   416 EEEMKKWVATRG-PLSVGMNANG-LFYYSGGVIDL 448
             E +M   +     P+SV       L  Y  G+++L
Sbjct:   269 EYQMTHHLYLLNLPISVAFRTGASLSSYLSGILEL 303

 Score = 177 (67.4 bits), Expect = 7.1e-21, Sum P(2) = 7.1e-21
 Identities = 52/182 (28%), Positives = 83/182 (45%)

Query:   479 GSSGNRTRDLELTGVLPSKLSRLATEKLVDCDMSNG-GCNGGRMDDALQYIIDNGGVVSD 537
             G +     +  LT  L  K   L+ +++ DC   +G GCNGG   D L+YI + G +   
Sbjct:   171 GFAATAVAEAALTVHL-KKAMNLSEQEVCDCAPKHGPGCNGGDPVDGLEYIKEMG-LTGG 228

Query:   538 QAYPYKASESER-G-CLXXXXXXXXX--XXXXYSRIPYGEEEEMKKWVATRG-PLSVGMN 592
             + YP+  + S + G C                Y+  P+  E +M   +     P+SV   
Sbjct:   229 KEYPFNVNRSTQLGRCESEKYDRELNPLELDYYAIDPFNAEYQMTHHLYLLNLPISVAFR 288

Query:   593 ANG-LFYYSGGVIDLNQRLCNPKAQNH--ALIIVGYGEEEKKDGTSIPYWIVKNSWGSDW 649
                 L  Y  G+++L    C+ +   H  +  IVGYG  +   G ++ YWI +NSW +DW
Sbjct:   289 TGASLSSYLSGILELAD--CDDEKGGHWHSGAIVGYGTTKNSAGRTVDYWIFRNSWWTDW 346

Query:   650 GE 651
             G+
Sbjct:   347 GD 348

 Score = 143 (55.4 bits), Expect = 7.1e-21, Sum P(2) = 7.1e-21
 Identities = 47/167 (28%), Positives = 81/167 (48%)

Query:    46 LNFMRDH-DKVYSSVEDLLRRHE-NFVTNVEKAEDYQR-EDSGTAVFEVNKFFDLSDSDL 102
             +N  R++ +K+Y   ED + +++ N+   +EK   +Q+   +   V ++NK    +  D 
Sbjct:    33 INIDRNNPEKLYKEFEDFIVKYKRNYKDEIEKKFRFQQFVATHNRVGKMNKAAKKAGHDT 92

Query:   103 QQLTGLNLDSTLEDIQ-PSLQAPFSSNQTDTEMRAFQFNSLR---HGDDLPEAFDWRAEG 158
             +   G+N  S L   +   + + F   + +T +  F   +LR     + LP+ FD R + 
Sbjct:    93 KY--GINKFSDLSKKEIHGMYSKFGPPKNNTNVPKFNLKNLRVKRQMEGLPKTFDLRNKK 150

Query:   159 V-----ISKVKEQGKCACCWAFSAVGVVEA---MHAIQGNNLTELSV 197
             V     I  +K Q  CACCW F+A  V EA   +H  +  NL+E  V
Sbjct:   151 VGGHYIIGPIKTQDSCACCWGFAATAVAEAALTVHLKKAMNLSEQEV 197

 Score = 81 (33.6 bits), Expect = 1.5e-18, Sum P(2) = 1.5e-18
 Identities = 13/38 (34%), Positives = 23/38 (60%)

Query:   437 GLFYYSGGVIDLN--QRLYGTSIPYWIVKNSWGSDWGE 472
             G  ++SG ++     +   G ++ YWI +NSW +DWG+
Sbjct:   311 GGHWHSGAIVGYGTTKNSAGRTVDYWIFRNSWWTDWGD 348

 Score = 40 (19.1 bits), Expect = 1.0e-10, Sum P(3) = 1.0e-10
 Identities = 11/23 (47%), Positives = 14/23 (60%)

Query:   337 GNSLTE-LS-VQQLVDCDMSNGG 357
             G SL+  LS + +L DCD   GG
Sbjct:   290 GASLSSYLSGILELADCDDEKGG 312


>WB|WBGene00013764 [details] [associations]
            symbol:Y113G7B.15 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 GeneTree:ENSGT00560000076599
            EMBL:AL110477 HOGENOM:HOG000019851 RefSeq:NP_507904.2
            ProteinModelPortal:Q9U2X1 SMR:Q9U2X1 DIP:DIP-25339N IntAct:Q9U2X1
            MINT:MINT-1058673 STRING:Q9U2X1 MEROPS:C01.A47
            EnsemblMetazoa:Y113G7B.15 GeneID:190976 KEGG:cel:CELE_Y113G7B.15
            UCSC:Y113G7B.15 CTD:190976 WormBase:Y113G7B.15 eggNOG:NOG302449
            OMA:AEEDIME Uniprot:Q9U2X1
        Length = 362

 Score = 208 (78.3 bits), Expect = 2.0e-19, Sum P(2) = 2.0e-19
 Identities = 75/287 (26%), Positives = 119/287 (41%)

Query:   190 NNLTELSVQHHDKVYSSVEDLLRRHENFVTNVEKAEDYQSE---DSGTAVFGVNKFFDLS 246
             NN T     HH K Y +  +  RR  +F  N +K ++  ++   +     FG NKF D +
Sbjct:    31 NNFT----MHHKKHYRTPAEKDRRLAHFAKNHQKIQELNAKARREGRNVTFGWNKFADKN 86

Query:   247 ESDLQQLTGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWR--- 303
                 Q+L+  N  S +     +    +               S R   D+P+ FD R   
Sbjct:    87 R---QELSARN--SKIHPKNHTDLPIYKPRHPRGSRNHHNKRSKRQSGDIPDYFDLRDIY 141

Query:   304 AEG--VISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSNG--GCN 359
              +G  V+  VK+Q +C CCWAF+   + EA + +   S T LS Q++ DC  S    GC 
Sbjct:   142 VDGSPVVGPVKDQEQCGCCWAFATTAITEAANTLYSKSFTSLSDQEICDCADSGDTPGCV 201

Query:   360 GGRMDDALQYIIDNGGVVSDQAYPYKA--SESERGCLXXXXXXXXX-XXXXYSRIP--YG 414
             GG   + L+ ++   G  SD  YPY+   + +   C+                R    Y 
Sbjct:   202 GGDPRNGLK-MVHLRGQSSDGDYPYEEYRANTTGNCVGDEKSTVIQPETLNVYRFDQDYA 260

Query:   415 EEEEMKKWVATRGPLSVGMNANGLF-YYSGGVIDLNQRLYGTSIPYW 460
             EE+ M+       P +V       F +Y+ GV+  ++  Y  +   W
Sbjct:   261 EEDIMENLYLNHIPTAVYFRVGENFEWYTSGVLQ-SEDCYQMTPAEW 306

 Score = 162 (62.1 bits), Expect = 2.3e-20, Sum P(2) = 2.3e-20
 Identities = 51/171 (29%), Positives = 77/171 (45%)

Query:   489 ELTGVLPSK-LSRLATEKLVDCDMSNG--GCNGGRMDDALQYIIDNGGVVSDQAYPYKA- 544
             E    L SK  + L+ +++ DC  S    GC GG   + L+ ++   G  SD  YPY+  
Sbjct:   170 EAANTLYSKSFTSLSDQEICDCADSGDTPGCVGGDPRNGLK-MVHLRGQSSDGDYPYEEY 228

Query:   545 -SESERGCLXXXXXXXXX-XXXXYSRIP--YGEEEEMKKWVATRGPLSVGMNANGLF-YY 599
              + +   C+                R    Y EE+ M+       P +V       F +Y
Sbjct:   229 RANTTGNCVGDEKSTVIQPETLNVYRFDQDYAEEDIMENLYLNHIPTAVYFRVGENFEWY 288

Query:   600 SGGVIDLNQRLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWG 650
             + GV+          A+ H++ IVGYG  +  DG  +PYW+V+NSW SDWG
Sbjct:   289 TSGVLQSEDCYQMTPAEWHSVAIVGYGTSD--DG--VPYWLVRNSWNSDWG 335

 Score = 154 (59.3 bits), Expect = 2.3e-20, Sum P(2) = 2.3e-20
 Identities = 45/165 (27%), Positives = 72/165 (43%)

Query:    42 VTRFLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQ---REDSGTAVFEVNKFFDLS 98
             ++ F NF   H K Y +  +  RR  +F  N +K ++     R +     F  NKF   +
Sbjct:    27 LSHFNNFTMHHKKHYRTPAEKDRRLAHFAKNHQKIQELNAKARREGRNVTFGWNKF---A 83

Query:    99 DSDLQQLTGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWR--- 155
             D + Q+L+  N  S +     +    +               S R   D+P+ FD R   
Sbjct:    84 DKNRQELSARN--SKIHPKNHTDLPIYKPRHPRGSRNHHNKRSKRQSGDIPDYFDLRDIY 141

Query:   156 AEG--VISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
              +G  V+  VK+Q +C CCWAF+   + EA + +   + T LS Q
Sbjct:   142 VDGSPVVGPVKDQEQCGCCWAFATTAITEAANTLYSKSFTSLSDQ 186

 Score = 93 (37.8 bits), Expect = 2.0e-19, Sum P(2) = 2.0e-19
 Identities = 15/22 (68%), Positives = 18/22 (81%)

Query:   453 YGTS---IPYWIVKNSWGSDWG 471
             YGTS   +PYW+V+NSW SDWG
Sbjct:   314 YGTSDDGVPYWLVRNSWNSDWG 335


>DICTYBASE|DDB_G0282991 [details] [associations]
            symbol:DDB_G0282991 species:44689 "Dictyostelium
            discoideum" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 SMART:SM00645
            dictyBase:DDB_G0282991 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            EMBL:AAFI02000049 eggNOG:NOG331187 RefSeq:XP_639299.1
            ProteinModelPortal:Q54RQ2 EnsemblProtists:DDB0185304 GeneID:8623870
            KEGG:ddi:DDB_G0282991 InParanoid:Q54RQ2 OMA:PENGNEY Uniprot:Q54RQ2
        Length = 339

 Score = 226 (84.6 bits), Expect = 1.3e-18, Sum P(2) = 1.3e-18
 Identities = 74/255 (29%), Positives = 121/255 (47%)

Query:   200 HDKVYSSVEDLLRRHENFVTNVEKAEDYQSEDSGTAVFGVNKFFDLSESDLQQLTGLNLD 259
             ++K+YS+ E  +R   NF  N E  + +  +   T +  +N F DLS ++       N  
Sbjct:    34 YNKIYSNKEFYMR-FNNFKKNKEYVDQWNEKQLET-ILELNFFADLSRNEYIN----NYL 87

Query:   260 STLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVKEQGKCAC 319
             ++  DI        +  Q +T+   ++ N   + ++  ++ DWR    ++ VK QG C+ 
Sbjct:    88 ASFIDIS-------NIEQKNTK---YEGNLKNNFNNSIKSIDWRNFDAVTPVKNQGLCSG 137

Query:   320 C-WAFSAVGVVEAMHAIQGNSLTELSVQQLVDC--DMSNGGCNGGRMDDALQYIIDNGGV 376
               ++FSA+GV+E+ H I+   L  LS Q ++DC  DM N GC GG    A  YII   G+
Sbjct:   138 AGYSFSAIGVIESSHFIKNKELITLSEQNIIDCTTDMGNNGCMGGLALIAFDYIIKQKGI 197

Query:   377 VSDQAYPYKASESE----RGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVG 432
              S+  YPY+    E    RG               Y  I    E E+ + +  + P+SV 
Sbjct:   198 DSEFNYPYEGYLIEPYEGRGRCRYNSFYSKASISSYIEIERFNENELTQSLI-KSPVSVM 256

Query:   433 MNANGLFY--YSGGV 445
             ++A+ L +  Y  GV
Sbjct:   257 IDASQLSFMLYKSGV 271

 Score = 176 (67.0 bits), Expect = 2.7e-20, Sum P(2) = 2.7e-20
 Identities = 51/164 (31%), Positives = 78/164 (47%)

Query:   497 KLSRLATEKLVDC--DMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESE----RG 550
             +L  L+ + ++DC  DM N GC GG    A  YII   G+ S+  YPY+    E    RG
Sbjct:   158 ELITLSEQNIIDCTTDMGNNGCMGGLALIAFDYIIKQKGIDSEFNYPYEGYLIEPYEGRG 217

Query:   551 CLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFY--YSGGVIDLNQ 608
                            Y  I    E E+ + +  + P+SV ++A+ L +  Y  GV     
Sbjct:   218 RCRYNSFYSKASISSYIEIERFNENELTQSLI-KSPVSVMIDASQLSFMLYKSGVY--KD 274

Query:   609 RLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
               C+    NH ++ +G+G   + +G    Y+I+KNS+GS WG K
Sbjct:   275 PSCSSTILNHGILNIGFGVTPE-NGNE--YYILKNSFGSKWGMK 315

 Score = 136 (52.9 bits), Expect = 2.7e-20, Sum P(2) = 2.7e-20
 Identities = 43/156 (27%), Positives = 79/156 (50%)

Query:    45 FLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQREDSGTAVFEVNKFFDLSDSDLQQ 104
             F+ +   ++K+YS+ E  +R   NF  N E  + +  +   T + E+N F DLS ++   
Sbjct:    27 FIEWTNKYNKIYSNKEFYMR-FNNFKKNKEYVDQWNEKQLET-ILELNFFADLSRNEYIN 84

Query:   105 LTGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVK 164
                 N  ++  DI        +  Q +T+   ++ N   + ++  ++ DWR    ++ VK
Sbjct:    85 ----NYLASFIDIS-------NIEQKNTK---YEGNLKNNFNNSIKSIDWRNFDAVTPVK 130

Query:   165 EQGKCACC-WAFSAVGVVEAMHAIQGNNLTELSVQH 199
              QG C+   ++FSA+GV+E+ H I+   L  LS Q+
Sbjct:   131 NQGLCSGAGYSFSAIGVIESSHFIKNKELITLSEQN 166

 Score = 59 (25.8 bits), Expect = 1.3e-18, Sum P(2) = 1.3e-18
 Identities = 10/15 (66%), Positives = 13/15 (86%)

Query:   459 YWIVKNSWGSDWGEK 473
             Y+I+KNS+GS WG K
Sbjct:   301 YYILKNSFGSKWGMK 315


>DICTYBASE|DDB_G0281605 [details] [associations]
            symbol:cfaD "peptidase C1A family protein"
            species:44689 "Dictyostelium discoideum" [GO:0005615 "extracellular
            space" evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IDA] [GO:0006508 "proteolysis" evidence=IDA] [GO:0031410
            "cytoplasmic vesicle" evidence=IDA] [GO:0031288 "sorocarp
            morphogenesis" evidence=IMP] [GO:0008285 "negative regulation of
            cell proliferation" evidence=IGI;IDA] [GO:0005576 "extracellular
            region" evidence=IEA;IDA] [GO:0005515 "protein binding"
            evidence=IPI] [GO:0044351 "macropinocytosis" evidence=RCA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 dictyBase:DDB_G0281605
            GO:GO:0008285 GO:GO:0005615 GenomeReviews:CM000152_GR
            eggNOG:COG4870 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0031410 EMBL:AAFI02000042
            GO:GO:0031288 RefSeq:XP_640530.1 HSSP:P07711
            ProteinModelPortal:Q54TR1 STRING:Q54TR1 PRIDE:Q54TR1
            EnsemblProtists:DDB0229857 GeneID:8623140 KEGG:ddi:DDB_G0281605
            InParanoid:Q54TR1 OMA:PSAHEHE ProtClustDB:CLSZ2430523
            Uniprot:Q54TR1
        Length = 531

 Score = 247 (92.0 bits), Expect = 9.6e-20, Sum P(2) = 9.6e-20
 Identities = 73/275 (26%), Positives = 120/275 (43%)

Query:   190 NNLTELSVQHH--DKVYSSVEDLLRRHENFVTNVEKA------EDYQSEDSGTAVF---- 237
             NN  E   + +  D ++SS+ D L   E   +N+ K       ++Y S+D     F    
Sbjct:   191 NNAKEAPAKENQFDGLFSSIGDNLLAKEEQASNLFKEYKAQYNKEYSSQDEHDERFINFK 250

Query:   238 GVNKFFDLSES-DLQQLTGLNLDSTLEDIQ-PSLQAPFSSNQTDTEMRAFQFN-SLRHGD 294
                K      + +     G+N  + L + +  +L  P  +  + T   +   + SLR   
Sbjct:   251 AARKIIATHNAKESSYKLGMNHYADLSNKEFNTLVKPKVARPSVTGADSVHDDESLR--- 307

Query:   295 DLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMS 354
              +P   DWR +  ++ VK+QG C  CW F + G +E  + +    L  LS QQLVDC + 
Sbjct:   308 SIPSTVDWRNQNCVTPVKDQGICGSCWTFGSTGSLEGTNCVTNGELVSLSEQQLVDCAIL 367

Query:   355 NG--GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIP 412
              G  GC GG    A QY+++ G + ++  YPY        C              Y  + 
Sbjct:   368 TGSQGCGGGFASSAFQYVMEIGSLATESNYPYLMQNGL--CRDRTVTPSGVSITGYVNVT 425

Query:   413 YGEEEEMKKWVATRGPLSVGMNAN--GLFYYSGGV 445
              G E  ++  +AT GP+++ ++A+     YY  GV
Sbjct:   426 SGSESALQNAIATTGPVAIAIDASVDDFRYYMSGV 460

 Score = 202 (76.2 bits), Expect = 3.0e-14, Sum P(2) = 3.0e-14
 Identities = 51/173 (29%), Positives = 82/173 (47%)

Query:   485 TRDLELTG-VLPSKLSRLATEKLVDCDMSNG--GCNGGRMDDALQYIIDNGGVVSDQAYP 541
             T  LE T  V   +L  L+ ++LVDC +  G  GC GG    A QY+++ G + ++  YP
Sbjct:   339 TGSLEGTNCVTNGELVSLSEQQLVDCAILTGSQGCGGGFASSAFQYVMEIGSLATESNYP 398

Query:   542 YKASESERGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNAN--GLFYY 599
             Y        C              Y  +  G E  ++  +AT GP+++ ++A+     YY
Sbjct:   399 YLMQNGL--CRDRTVTPSGVSITGYVNVTSGSESALQNAIATTGPVAIAIDASVDDFRYY 456

Query:   600 SGGVIDLNQRLCNPKAQN--HALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWG 650
               GV   N   C     +  H ++ +GYG  + +D     Y++VKNSW ++WG
Sbjct:   457 MSGVY--NNPACKNGLDDLDHEVLAIGYGTYQGQD-----YFLVKNSWSTNWG 502

 Score = 113 (44.8 bits), Expect = 2.7e-05, Sum P(3) = 2.7e-05
 Identities = 18/51 (35%), Positives = 27/51 (52%)

Query:   148 LPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
             +P   DWR +  ++ VK+QG C  CW F + G +E  + +    L  LS Q
Sbjct:   309 IPSTVDWRNQNCVTPVKDQGICGSCWTFGSTGSLEGTNCVTNGELVSLSEQ 359

 Score = 63 (27.2 bits), Expect = 9.6e-20, Sum P(2) = 9.6e-20
 Identities = 14/34 (41%), Positives = 21/34 (61%)

Query:   444 GVIDLNQRL----YGT--SIPYWIVKNSWGSDWG 471
             G+ DL+  +    YGT     Y++VKNSW ++WG
Sbjct:   469 GLDDLDHEVLAIGYGTYQGQDYFLVKNSWSTNWG 502

 Score = 59 (25.8 bits), Expect = 3.0e-14, Sum P(2) = 3.0e-14
 Identities = 20/79 (25%), Positives = 37/79 (46%)

Query:   175 FSAVGVVEAMHAIQGNNLTELSVQHHDKVYSSVEDLLRRHENFVTNVEKAEDYQSEDSGT 234
             FS++G        Q +NL +     ++K YSS ++   R  NF    +    + +++S  
Sbjct:   207 FSSIGDNLLAKEEQASNLFKEYKAQYNKEYSSQDEHDERFINFKAARKIIATHNAKESSY 266

Query:   235 AVFGVNKFFDLSESDLQQL 253
              + G+N + DLS  +   L
Sbjct:   267 KL-GMNHYADLSNKEFNTL 284

 Score = 44 (20.5 bits), Expect = 2.7e-05, Sum P(3) = 2.7e-05
 Identities = 14/61 (22%), Positives = 28/61 (45%)

Query:    45 FLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQREDSGTAVFEVNKFFDLSDSDLQQ 104
             F  +   ++K YSS ++   R  NF    +    +  ++S   +  +N + DLS+ +   
Sbjct:   225 FKEYKAQYNKEYSSQDEHDERFINFKAARKIIATHNAKESSYKL-GMNHYADLSNKEFNT 283

Query:   105 L 105
             L
Sbjct:   284 L 284


>UNIPROTKB|E2QXH3 [details] [associations]
            symbol:TINAGL1 "Uncharacterized protein" species:9615
            "Canis lupus familiaris" [GO:0043236 "laminin binding"
            evidence=IEA] [GO:0031012 "extracellular matrix" evidence=IEA]
            [GO:0005737 "cytoplasm" evidence=IEA] [GO:0030247 "polysaccharide
            binding" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0006955 "immune response" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005044 "scavenger
            receptor activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR001212 InterPro:IPR013128 Pfam:PF00112 PROSITE:PS00524
            PROSITE:PS50958 SMART:SM00201 SMART:SM00645 GO:GO:0005737
            GO:GO:0006955 GO:GO:0030247 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00639 GO:GO:0031012
            GO:GO:0005044 GeneTree:ENSGT00560000076599 CTD:64129 OMA:DNCNRCT
            EMBL:AAEX03001668 RefSeq:XP_535330.3 Ensembl:ENSCAFT00000035659
            GeneID:478155 KEGG:cfa:478155 NextBio:20853523 Uniprot:E2QXH3
        Length = 467

 Score = 166 (63.5 bits), Expect = 7.7e-18, Sum P(2) = 7.7e-18
 Identities = 43/108 (39%), Positives = 59/108 (54%)

Query:   290 LRHGDDLPEAFDWRAE---GVISKVKEQGKCACCWAFSAVGVVE---AMHAIQGNSLTEL 343
             LR G+ LP AF+  AE    +I +  +QG CA  WAFS   V     ++H++ G+    L
Sbjct:   197 LRPGEVLPTAFE-AAEKWPNLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSL-GHMTPVL 254

Query:   344 SVQQLVDCDMSNG-GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESE 390
             S Q L+ CD  N  GC GGR+D A  + +   GVVSD  YP+   E +
Sbjct:   255 SPQNLLSCDTHNQQGCRGGRLDGAW-WFLRRRGVVSDHCYPFVGREQD 301

 Score = 130 (50.8 bits), Expect = 7.7e-18, Sum P(2) = 7.7e-18
 Identities = 29/85 (34%), Positives = 42/85 (49%)

Query:   574 EEEMKKWVATRGPLSVGMNAN-GLFYYSGGVIDLNQ-RLCNPKAQN----HALIIVGYGE 627
             E+E+ K +   GP+   M  +   F Y GG+       L  P+       H++ I G+GE
Sbjct:   350 EKEIMKELMENGPVQALMEVHEDFFLYQGGIYSHTPVSLGRPERYRRHGTHSVKITGWGE 409

Query:   628 EEKKDGTSIPYWIVKNSWGSDWGEK 652
             E   DG ++ YW   NSWG  WGE+
Sbjct:   410 ETLPDGRTLKYWTAANSWGPAWGER 434

 Score = 96 (38.9 bits), Expect = 6.4e-14, Sum P(3) = 6.4e-14
 Identities = 20/49 (40%), Positives = 28/49 (57%)

Query:   501 LATEKLVDCDMSNG-GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESE 548
             L+ + L+ CD  N  GC GGR+D A  + +   GVVSD  YP+   E +
Sbjct:   254 LSPQNLLSCDTHNQQGCRGGRLDGAW-WFLRRRGVVSDHCYPFVGREQD 301

 Score = 76 (31.8 bits), Expect = 6.4e-14, Sum P(3) = 6.4e-14
 Identities = 24/64 (37%), Positives = 35/64 (54%)

Query:   142 LRHGDDLPEAFDWRAE---GVISKVKEQGKCACCWAFSAVGVVE---AMHAIQGNNLTEL 195
             LR G+ LP AF+  AE    +I +  +QG CA  WAFS   V     ++H++ G+    L
Sbjct:   197 LRPGEVLPTAFE-AAEKWPNLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSL-GHMTPVL 254

Query:   196 SVQH 199
             S Q+
Sbjct:   255 SPQN 258

 Score = 75 (31.5 bits), Expect = 3.9e-12, Sum P(2) = 3.9e-12
 Identities = 13/36 (36%), Positives = 19/36 (52%)

Query:   454 GTSIPYWIVKNSWGSDWGEKVEDKVGSSGNRTRDLE 489
             G ++ YW   NSWG  WGE+   ++    N   D+E
Sbjct:   415 GRTLKYWTAANSWGPAWGERGHFRIVRGANEC-DIE 449


>UNIPROTKB|E9PKT6 [details] [associations]
            symbol:CTSH "Cathepsin H" species:9606 "Homo sapiens"
            [GO:0001520 "outer dense fiber" evidence=IEA] [GO:0001656
            "metanephros development" evidence=IEA] [GO:0001669 "acrosomal
            vesicle" evidence=IEA] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=IEA] [GO:0004177 "aminopeptidase activity" evidence=IEA]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=IEA]
            [GO:0004252 "serine-type endopeptidase activity" evidence=IEA]
            [GO:0007283 "spermatogenesis" evidence=IEA] [GO:0008284 "positive
            regulation of cell proliferation" evidence=IEA] [GO:0010634
            "positive regulation of epithelial cell migration" evidence=IEA]
            [GO:0016505 "apoptotic protease activator activity" evidence=IEA]
            [GO:0030984 "kininogen binding" evidence=IEA] [GO:0031638 "zymogen
            activation" evidence=IEA] [GO:0031648 "protein destabilization"
            evidence=IEA] [GO:0032403 "protein complex binding" evidence=IEA]
            [GO:0032526 "response to retinoic acid" evidence=IEA] [GO:0033619
            "membrane protein proteolysis" evidence=IEA] [GO:0043066 "negative
            regulation of apoptotic process" evidence=IEA] [GO:0043621 "protein
            self-association" evidence=IEA] [GO:0045766 "positive regulation of
            angiogenesis" evidence=IEA] [GO:0060448 "dichotomous subdivision of
            terminal units involved in lung branching" evidence=IEA]
            [GO:0097067 "cellular response to thyroid hormone stimulus"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            InterPro:IPR000169 GO:GO:0043066 GO:GO:0008284 PANTHER:PTHR12411
            PROSITE:PS00139 GO:GO:0045766 GO:GO:0004252 GO:GO:0032526
            GO:GO:0016505 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
            GO:GO:0031638 GO:GO:0001913 GO:GO:0060448 GO:GO:0033619
            EMBL:AC011944 HGNC:HGNC:2535 IPI:IPI00375426
            ProteinModelPortal:E9PKT6 SMR:E9PKT6 PRIDE:E9PKT6
            Ensembl:ENST00000528741 ArrayExpress:E9PKT6 Bgee:E9PKT6
            Uniprot:E9PKT6
        Length = 134

 Score = 222 (83.2 bits), Expect = 2.2e-17, P = 2.2e-17
 Identities = 44/105 (41%), Positives = 59/105 (56%)

Query:   284 AFQFNSLRHGDDLPEAFDWRAEG-VISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTE 342
             A + N LR     P + DWR +G  +S VK QG C  CW FS  G +E+  AI    +  
Sbjct:    28 ATKSNYLRGTGPYPPSVDWRKKGNFVSPVKNQGACGSCWTFSTTGALESAIAIATGKMLS 87

Query:   343 LSVQQLVDC--DMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYK 385
             L+ QQLVDC  D +N GC GG    A +YI+ N G++ +  YPY+
Sbjct:    88 LAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNKGIMGEDTYPYQ 132

 Score = 117 (46.2 bits), Expect = 4.2e-06, P = 4.2e-06
 Identities = 25/64 (39%), Positives = 33/64 (51%)

Query:   136 AFQFNSLRHGDDLPEAFDWRAEG-VISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTE 194
             A + N LR     P + DWR +G  +S VK QG C  CW FS  G +E+  AI    +  
Sbjct:    28 ATKSNYLRGTGPYPPSVDWRKKGNFVSPVKNQGACGSCWTFSTTGALESAIAIATGKMLS 87

Query:   195 LSVQ 198
             L+ Q
Sbjct:    88 LAEQ 91

 Score = 115 (45.5 bits), Expect = 6.8e-06, P = 6.8e-06
 Identities = 21/53 (39%), Positives = 32/53 (60%)

Query:   493 VLPSKLSRLATEKLVDC--DMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYK 543
             +   K+  LA ++LVDC  D +N GC GG    A +YI+ N G++ +  YPY+
Sbjct:    80 IATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNKGIMGEDTYPYQ 132


>MGI|MGI:109553 [details] [associations]
            symbol:Ctsc "cathepsin C" species:10090 "Mus musculus"
            [GO:0001913 "T cell mediated cytotoxicity" evidence=IGI]
            [GO:0004252 "serine-type endopeptidase activity" evidence=IMP]
            [GO:0005764 "lysosome" evidence=ISO] [GO:0005783 "endoplasmic
            reticulum" evidence=ISO] [GO:0005794 "Golgi apparatus"
            evidence=ISO] [GO:0006508 "proteolysis" evidence=ISO;IMP]
            [GO:0008233 "peptidase activity" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=ISO] [GO:0010033
            "response to organic substance" evidence=ISO] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0031404 "chloride ion
            binding" evidence=ISO] [GO:0042802 "identical protein binding"
            evidence=ISO] [GO:0043621 "protein self-association" evidence=ISO]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 MGI:MGI:109553 GO:GO:0005783
            GO:GO:0005794 GO:GO:0007568 GO:GO:0010033 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0004252 GO:GO:0005764 GO:GO:0031404 CTD:1075
            HOGENOM:HOG000068022 HOVERGEN:HBG005248 KO:K01275 OrthoDB:EOG4H19VZ
            InterPro:IPR014882 Pfam:PF08773 MEROPS:C01.070 OMA:YDDFLHY
            GO:GO:0001913 EMBL:U89269 EMBL:U74683 EMBL:BC067063 IPI:IPI00130015
            RefSeq:NP_034112.3 UniGene:Mm.322945 ProteinModelPortal:P97821
            SMR:P97821 STRING:P97821 PhosphoSite:P97821 PaxDb:P97821
            PRIDE:P97821 Ensembl:ENSMUST00000032779 GeneID:13032 KEGG:mmu:13032
            InParanoid:P97821 BindingDB:P97821 ChEMBL:CHEMBL3454 ChiTaRS:CTSC
            NextBio:282904 Bgee:P97821 CleanEx:MM_CTSC Genevestigator:P97821
            Uniprot:P97821
        Length = 462

 Score = 182 (69.1 bits), Expect = 1.2e-16, Sum P(2) = 1.2e-16
 Identities = 52/161 (32%), Positives = 76/161 (47%)

Query:   501 LATEKLVDCDMSNGGCNGG-RMDDALQYIIDNGGVVSDQAYPYKASES----ERGCLXXX 555
             L+ +++V C     GC+GG     A +Y  D G VV +  +PY A +S       CL   
Sbjct:   282 LSPQEVVSCSPYAQGCDGGFPYLIAGKYAQDFG-VVEESCFPYTAKDSPCKPRENCLRYY 340

Query:   556 XXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLF-YYSGGVIDLNQRLCNP- 613
                       Y       E  MK  +   GP++V    +  F +Y  G+   +  L +P 
Sbjct:   341 SSDYYYVGGFYGGC---NEALMKLELVKHGPMAVAFEVHDDFLHYHSGIYH-HTGLSDPF 396

Query:   614 ---KAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGE 651
                +  NHA+++VGYG +     T I YWI+KNSWGS+WGE
Sbjct:   397 NPFELTNHAVLLVGYGRDPV---TGIEYWIIKNSWGSNWGE 434

 Score = 172 (65.6 bits), Expect = 1.4e-14, Sum P(2) = 1.4e-14
 Identities = 52/161 (32%), Positives = 79/161 (49%)

Query:   295 DLPEAFDWR-AEGV--ISKVKEQGKCACCWAFSAVGVVEA-MHAIQGNSLTE-LSVQQLV 349
             +LPE++DWR  +GV  +S V+ Q  C  C++F+++G++EA +  +  NS T  LS Q++V
Sbjct:   229 NLPESWDWRNVQGVNYVSPVRNQESCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVV 288

Query:   350 DCDMSNGGCNGG-RMDDALQYIIDNGGVVSDQAYPYKASES----ERGCLXXXXXXXXXX 404
              C     GC+GG     A +Y  D G VV +  +PY A +S       CL          
Sbjct:   289 SCSPYAQGCDGGFPYLIAGKYAQDFG-VVEESCFPYTAKDSPCKPRENCLRYYSSDYYYV 347

Query:   405 XXXYSRIPYGEEEEMKKWVATRGPLSVGMNANG--LFYYSG 443
                Y       E  MK  +   GP++V    +   L Y+SG
Sbjct:   348 GGFYGGC---NEALMKLELVKHGPMAVAFEVHDDFLHYHSG 385

 Score = 101 (40.6 bits), Expect = 1.2e-16, Sum P(2) = 1.2e-16
 Identities = 20/48 (41%), Positives = 33/48 (68%)

Query:   147 DLPEAFDWR-AEGV--ISKVKEQGKCACCWAFSAVGVVEAMHAIQGNN 191
             +LPE++DWR  +GV  +S V+ Q  C  C++F+++G++EA   I  NN
Sbjct:   229 NLPESWDWRNVQGVNYVSPVRNQESCGSCYSFASMGMLEARIRILTNN 276

 Score = 92 (37.4 bits), Expect = 1.4e-14, Sum P(2) = 1.4e-14
 Identities = 14/18 (77%), Positives = 16/18 (88%)

Query:   455 TSIPYWIVKNSWGSDWGE 472
             T I YWI+KNSWGS+WGE
Sbjct:   417 TGIEYWIIKNSWGSNWGE 434


>ZFIN|ZDB-GENE-030619-9 [details] [associations]
            symbol:ctsc "cathepsin C" species:7955 "Danio rerio"
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008233 "peptidase
            activity" evidence=IEA] [GO:0016787 "hydrolase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            ZFIN:ZDB-GENE-030619-9 eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 CTD:1075
            HOGENOM:HOG000068022 HOVERGEN:HBG005248 KO:K01275 OrthoDB:EOG4H19VZ
            InterPro:IPR014882 Pfam:PF08773 MEROPS:C01.070 HSSP:P43235
            EMBL:BC064286 IPI:IPI00486570 RefSeq:NP_999887.1 UniGene:Dr.32463
            ProteinModelPortal:Q6P2V1 SMR:Q6P2V1 PRIDE:Q6P2V1 GeneID:368704
            KEGG:dre:368704 InParanoid:Q6P2V1 NextBio:20813127
            ArrayExpress:Q6P2V1 Bgee:Q6P2V1 Uniprot:Q6P2V1
        Length = 455

 Score = 180 (68.4 bits), Expect = 1.5e-16, Sum P(2) = 1.5e-16
 Identities = 55/159 (34%), Positives = 74/159 (46%)

Query:   504 EKLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGC-LXXXXXXXXXX 562
             +++V C   + GC+GG      +YI D G +V +  +PY  S+S   C L          
Sbjct:   279 QQVVSCSQYSQGCDGGFPYLIGKYIQDFG-IVEEDCFPYTGSDSP--CNLPAKCTKYYAS 335

Query:   563 XXXYSRIPYG--EEEEMKKWVATRGPLSVG-------MNANGLFYYSGGVIDLNQRLCNP 613
                Y    YG   E  M   +   GP+ V        MN     Y+  G+ D N    NP
Sbjct:   336 DYHYVGGFYGGCSESAMMLELVKNGPMGVALEVYPDFMNYKEGIYHHTGLRDAN----NP 391

Query:   614 -KAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGE 651
              +  NHA+++VGYG+  K   T   YWIVKNSWGS WGE
Sbjct:   392 FELTNHAVLLVGYGQCHK---TGEKYWIVKNSWGSGWGE 427

 Score = 167 (63.8 bits), Expect = 3.2e-13, Sum P(2) = 3.2e-13
 Identities = 50/159 (31%), Positives = 75/159 (47%)

Query:   296 LPEAFDWR-AEGV--ISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTE--LSVQQLVD 350
             LP+ +DWR   GV  +S V+ Q +C  C++F+ +G++EA   IQ N+  +   S QQ+V 
Sbjct:   224 LPQHWDWRNVNGVNFVSPVRNQAQCGSCYSFATMGMLEARVRIQTNNTQQPVFSPQQVVS 283

Query:   351 CDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGC-LXXXXXXXXXXXXXYS 409
             C   + GC+GG      +YI D G +V +  +PY  S+S   C L             Y 
Sbjct:   284 CSQYSQGCDGGFPYLIGKYIQDFG-IVEEDCFPYTGSDSP--CNLPAKCTKYYASDYHYV 340

Query:   410 RIPYG--EEEEMKKWVATRGPLSVGMNANGLFY-YSGGV 445
                YG   E  M   +   GP+ V +     F  Y  G+
Sbjct:   341 GGFYGGCSESAMMLELVKNGPMGVALEVYPDFMNYKEGI 379

 Score = 102 (41.0 bits), Expect = 1.5e-16, Sum P(2) = 1.5e-16
 Identities = 24/61 (39%), Positives = 37/61 (60%)

Query:   148 LPEAFDWR-AEGV--ISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQHHDKVY 204
             LP+ +DWR   GV  +S V+ Q +C  C++F+ +G++EA   IQ NN T+  V    +V 
Sbjct:   224 LPQHWDWRNVNGVNFVSPVRNQAQCGSCYSFATMGMLEARVRIQTNN-TQQPVFSPQQVV 282

Query:   205 S 205
             S
Sbjct:   283 S 283

 Score = 84 (34.6 bits), Expect = 3.2e-13, Sum P(2) = 3.2e-13
 Identities = 14/18 (77%), Positives = 14/18 (77%)

Query:   455 TSIPYWIVKNSWGSDWGE 472
             T   YWIVKNSWGS WGE
Sbjct:   410 TGEKYWIVKNSWGSGWGE 427


>UNIPROTKB|Q9GZM7 [details] [associations]
            symbol:TINAGL1 "Tubulointerstitial nephritis antigen-like"
            species:9606 "Homo sapiens" [GO:0005044 "scavenger receptor
            activity" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0006955 "immune response" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0030247
            "polysaccharide binding" evidence=IEA] [GO:0005737 "cytoplasm"
            evidence=IEA] [GO:0043236 "laminin binding" evidence=IEA]
            [GO:0016197 "endosomal transport" evidence=TAS] [GO:0005201
            "extracellular matrix structural constituent" evidence=NAS]
            [GO:0005576 "extracellular region" evidence=NAS] [GO:0031012
            "extracellular matrix" evidence=IDA] [GO:0005615 "extracellular
            space" evidence=ISS] InterPro:IPR000668 InterPro:IPR001212
            InterPro:IPR013128 Pfam:PF00112 PROSITE:PS00524 PROSITE:PS50958
            SMART:SM00201 SMART:SM00645 GO:GO:0005737 GO:GO:0005615
            GO:GO:0006955 GO:GO:0030247 EMBL:CH471059 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00639
            GO:GO:0016197 EMBL:AC114488 GO:GO:0005044 GO:GO:0005201
            eggNOG:NOG310046 HOGENOM:HOG000241342 HOVERGEN:HBG053961
            EMBL:AF236155 EMBL:AF236151 EMBL:AF236152 EMBL:AF236153
            EMBL:AF236154 EMBL:AF236150 EMBL:AF205436 EMBL:AB050716
            EMBL:AB050719 EMBL:AK074124 EMBL:AY358421 EMBL:AF289569
            EMBL:AK027839 EMBL:AK292770 EMBL:AK298382 EMBL:AK075398
            EMBL:BC009048 EMBL:BC064633 IPI:IPI00005563 IPI:IPI00439435
            IPI:IPI00910801 RefSeq:NP_001191343.1 RefSeq:NP_001191344.1
            RefSeq:NP_071447.1 UniGene:Hs.199368 ProteinModelPortal:Q9GZM7
            SMR:Q9GZM7 IntAct:Q9GZM7 MINT:MINT-253718 STRING:Q9GZM7
            MEROPS:C01.975 PhosphoSite:Q9GZM7 DMDM:61213628 PaxDb:Q9GZM7
            PRIDE:Q9GZM7 Ensembl:ENST00000271064 Ensembl:ENST00000457433
            GeneID:64129 KEGG:hsa:64129 UCSC:uc001bta.3 CTD:64129
            GeneCards:GC01P032042 HGNC:HGNC:19168 HPA:HPA048695
            neXtProt:NX_Q9GZM7 PharmGKB:PA38810 InParanoid:Q9GZM7 OMA:DNCNRCT
            OrthoDB:EOG4BG8W0 PhylomeDB:Q9GZM7 ChiTaRS:TINAGL1 GenomeRNAi:64129
            NextBio:66016 ArrayExpress:Q9GZM7 Bgee:Q9GZM7 CleanEx:HS_TINAGL1
            Genevestigator:Q9GZM7 GermOnline:ENSG00000142910 Uniprot:Q9GZM7
        Length = 467

 Score = 157 (60.3 bits), Expect = 1.6e-16, Sum P(2) = 1.6e-16
 Identities = 39/110 (35%), Positives = 57/110 (51%)

Query:   287 FNSLRHGDDLPEAFDWRAE--GVISKVKEQGKCACCWAFSAVGVVE---AMHAIQGNSLT 341
             +  L  G+ LP AF+   +   +I +  +QG CA  WAFS   V     ++H++ G+   
Sbjct:   194 YTVLNPGEVLPTAFEASEKWPNLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSL-GHMTP 252

Query:   342 ELSVQQLVDCDM-SNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESE 390
              LS Q L+ CD     GC GGR+D A  + +   GVVSD  YP+   E +
Sbjct:   253 VLSPQNLLSCDTHQQQGCRGGRLDGAW-WFLRRRGVVSDHCYPFSGRERD 301

 Score = 127 (49.8 bits), Expect = 1.6e-16, Sum P(2) = 1.6e-16
 Identities = 28/85 (32%), Positives = 42/85 (49%)

Query:   574 EEEMKKWVATRGPLSVGMNAN-GLFYYSGGVIDLNQ-RLCNPKAQN----HALIIVGYGE 627
             ++E+ K +   GP+   M  +   F Y GG+       L  P+       H++ I G+GE
Sbjct:   350 DKEIMKELMENGPVQALMEVHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGWGE 409

Query:   628 EEKKDGTSIPYWIVKNSWGSDWGEK 652
             E   DG ++ YW   NSWG  WGE+
Sbjct:   410 ETLPDGRTLKYWTAANSWGPAWGER 434

 Score = 92 (37.4 bits), Expect = 1.1e-12, Sum P(3) = 1.1e-12
 Identities = 19/49 (38%), Positives = 27/49 (55%)

Query:   501 LATEKLVDCDM-SNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESE 548
             L+ + L+ CD     GC GGR+D A  + +   GVVSD  YP+   E +
Sbjct:   254 LSPQNLLSCDTHQQQGCRGGRLDGAW-WFLRRRGVVSDHCYPFSGRERD 301

 Score = 72 (30.4 bits), Expect = 8.1e-11, Sum P(2) = 8.1e-11
 Identities = 13/36 (36%), Positives = 19/36 (52%)

Query:   454 GTSIPYWIVKNSWGSDWGEKVEDKVGSSGNRTRDLE 489
             G ++ YW   NSWG  WGE+   ++    N   D+E
Sbjct:   415 GRTLKYWTAANSWGPAWGERGHFRIVRGVNEC-DIE 449

 Score = 71 (30.1 bits), Expect = 1.1e-12, Sum P(3) = 1.1e-12
 Identities = 21/66 (31%), Positives = 34/66 (51%)

Query:   139 FNSLRHGDDLPEAFDWRAE--GVISKVKEQGKCACCWAFSAVGVVE---AMHAIQGNNLT 193
             +  L  G+ LP AF+   +   +I +  +QG CA  WAFS   V     ++H++ G+   
Sbjct:   194 YTVLNPGEVLPTAFEASEKWPNLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSL-GHMTP 252

Query:   194 ELSVQH 199
              LS Q+
Sbjct:   253 VLSPQN 258


>ZFIN|ZDB-GENE-060503-240 [details] [associations]
            symbol:tinagl1 "tubulointerstitial nephritis
            antigen-like 1" species:7955 "Danio rerio" [GO:0005044 "scavenger
            receptor activity" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0030247 "polysaccharide
            binding" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0030414 "peptidase inhibitor activity" evidence=IEA]
            [GO:0006955 "immune response" evidence=IEA] [GO:0002040 "sprouting
            angiogenesis" evidence=IMP] InterPro:IPR000668 InterPro:IPR001212
            InterPro:IPR008037 InterPro:IPR013128 Pfam:PF00112 Pfam:PF05375
            PROSITE:PS00524 PROSITE:PS50958 SMART:SM00201 SMART:SM00645
            ZFIN:ZDB-GENE-060503-240 GO:GO:0006955 GO:GO:0030247 GO:GO:0030414
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00639 GO:GO:0002040
            GO:GO:0005044 GeneTree:ENSGT00560000076599 GO:GO:0010466
            SUPFAM:SSF57283 HOVERGEN:HBG053961 MEROPS:C01.975 OMA:DNCNRCT
            EMBL:BX950864 IPI:IPI00609339 UniGene:Dr.103937
            Ensembl:ENSDART00000087096 Ensembl:ENSDART00000126228
            InParanoid:Q1LUC6 Uniprot:Q1LUC6
        Length = 471

 Score = 155 (59.6 bits), Expect = 2.2e-16, Sum P(2) = 2.2e-16
 Identities = 46/146 (31%), Positives = 67/146 (45%)

Query:   248 SDLQQLTGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDW--RAE 305
             ++  Q  G+ LD  L   +   + P    +T   M   Q N +   D LP  F+   +  
Sbjct:   157 ANYSQFWGMTLDEGLR-FRLGTKRP---TRTIMNMNEMQMN-MNGNDHLPSYFNAVDKWP 211

Query:   306 GVISKVKEQGKCACCWAFSAVGVVEAMHAIQ--GNSLTELSVQQLVDCDMSN-GGCNGGR 362
             G I +  +QG C   WAFS   V     +IQ  G+   +LS Q L+ CD  +  GC GGR
Sbjct:   212 GKIHEPLDQGNCNASWAFSTAAVASDRISIQSMGHMTPQLSPQNLISCDTRHQDGCAGGR 271

Query:   363 MDDALQYIIDNGGVVSDQAYPYKASE 388
             +D A  + +   GVV+   YP+   E
Sbjct:   272 IDGAW-WFMRRRGVVTQDCYPFSPPE 296

 Score = 128 (50.1 bits), Expect = 2.2e-16, Sum P(2) = 2.2e-16
 Identities = 31/90 (34%), Positives = 44/90 (48%)

Query:   568 RIPYGEEEEMKKWVATRGPLSVGMNAN-GLFYYSGGVI---DLNQRLCNP--KAQNHALI 621
             R+   E E MK+ +   GP+   M  +   F Y  G+    D+N    +   K   H++ 
Sbjct:   341 RLSTNENEIMKE-IMDNGPVQAIMEVHEDFFVYKSGIFRHTDVNYHKPSQYRKHATHSVR 399

Query:   622 IVGYGEEEKKDGTSIPYWIVKNSWGSDWGE 651
             I G+GEE    G +  YWI  NSWG +WGE
Sbjct:   400 ITGWGEERDYSGRTRKYWIGANSWGKNWGE 429

 Score = 82 (33.9 bits), Expect = 9.1e-13, Sum P(3) = 9.1e-13
 Identities = 32/115 (27%), Positives = 50/115 (43%)

Query:    90 EVNKF-FDLSDSDLQQLTGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDL 148
             E+N+  +    ++  Q  G+ LD  L   +   + P    +T   M   Q N +   D L
Sbjct:   146 EINRRDYGWRAANYSQFWGMTLDEGLR-FRLGTKRP---TRTIMNMNEMQMN-MNGNDHL 200

Query:   149 PEAFDW--RAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQ--GNNLTELSVQH 199
             P  F+   +  G I +  +QG C   WAFS   V     +IQ  G+   +LS Q+
Sbjct:   201 PSYFNAVDKWPGKIHEPLDQGNCNASWAFSTAAVASDRISIQSMGHMTPQLSPQN 255

 Score = 81 (33.6 bits), Expect = 9.1e-13, Sum P(3) = 9.1e-13
 Identities = 17/48 (35%), Positives = 27/48 (56%)

Query:   500 RLATEKLVDCDMSN-GGCNGGRMDDALQYIIDNGGVVSDQAYPYKASE 546
             +L+ + L+ CD  +  GC GGR+D A  + +   GVV+   YP+   E
Sbjct:   250 QLSPQNLISCDTRHQDGCAGGRIDGAW-WFMRRRGVVTQDCYPFSPPE 296

 Score = 72 (30.4 bits), Expect = 1.4e-10, Sum P(2) = 1.4e-10
 Identities = 16/41 (39%), Positives = 22/41 (53%)

Query:   450 QRLY-GTSIPYWIVKNSWGSDWGEKVEDKVGSSGNRTRDLE 489
             +R Y G +  YWI  NSWG +WGE    ++    N   D+E
Sbjct:   406 ERDYSGRTRKYWIGANSWGKNWGEDGYFRIARGVNEC-DIE 445


>UNIPROTKB|F1NWG2 [details] [associations]
            symbol:CTSC "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0001913 "T cell mediated cytotoxicity" evidence=IEA]
            [GO:0004252 "serine-type endopeptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0004252
            GeneTree:ENSGT00560000076599 InterPro:IPR014882 Pfam:PF08773
            OMA:YDDFLHY GO:GO:0001913 EMBL:AADN02004805 IPI:IPI00577371
            Ensembl:ENSGALT00000027869 Uniprot:F1NWG2
        Length = 463

 Score = 178 (67.7 bits), Expect = 1.1e-13, Sum P(2) = 1.1e-13
 Identities = 51/167 (30%), Positives = 79/167 (47%)

Query:   290 LRHGDDLPEAFDWR-AEGV--ISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTE--LS 344
             L+    LPE++DWR   GV  +S V+ Q  C  C+AF+++G++EA   I  N+  +   S
Sbjct:   225 LKKVSGLPESWDWRNVNGVNYVSPVRNQASCGSCYAFASMGMLEARIRILTNNTQKPVFS 284

Query:   345 VQQLVDCDMSNGGCNGG-RMDDALQYIIDNGGVVSDQAYPYKASES----ERGCLXXXXX 399
              QQ+V C   + GC+GG     A +Y+ D G VV +  +PY A ++    +R C      
Sbjct:   285 PQQVVSCSQYSQGCDGGFPYLIAGKYVQDFG-VVEEDCFPYTAKDTPCLFKRSCYHYYTS 343

Query:   400 XXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNA-NGLFYYSGGV 445
                     Y       E  MK  +   GP++V     N   +Y  G+
Sbjct:   344 EYHYVGGFYGAC---NEALMKLELVLSGPMAVAFEVYNDFMFYKEGI 387

 Score = 176 (67.0 bits), Expect = 2.3e-16, Sum P(2) = 2.3e-16
 Identities = 49/158 (31%), Positives = 75/158 (47%)

Query:   504 EKLVDCDMSNGGCNGG-RMDDALQYIIDNGGVVSDQAYPYKASES----ERGCLXXXXXX 558
             +++V C   + GC+GG     A +Y+ D G VV +  +PY A ++    +R C       
Sbjct:   286 QQVVSCSQYSQGCDGGFPYLIAGKYVQDFG-VVEEDCFPYTAKDTPCLFKRSCYHYYTSE 344

Query:   559 XXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNA-NGLFYYSGGVID---LNQRLCNP- 613
                    Y       E  MK  +   GP++V     N   +Y  G+     L     NP 
Sbjct:   345 YHYVGGFYGAC---NEALMKLELVLSGPMAVAFEVYNDFMFYKEGIYHHTGLKDEF-NPF 400

Query:   614 KAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGE 651
             +  NHA+++VGYG++ +  G    +WIVKNSWG+ WGE
Sbjct:   401 ELTNHAVLLVGYGKDPES-GEK--FWIVKNSWGTSWGE 435

 Score = 105 (42.0 bits), Expect = 2.3e-16, Sum P(2) = 2.3e-16
 Identities = 26/67 (38%), Positives = 39/67 (58%)

Query:   142 LRHGDDLPEAFDWR-AEGV--ISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
             L+    LPE++DWR   GV  +S V+ Q  C  C+AF+++G++EA   I  NN T+  V 
Sbjct:   225 LKKVSGLPESWDWRNVNGVNYVSPVRNQASCGSCYAFASMGMLEARIRILTNN-TQKPVF 283

Query:   199 HHDKVYS 205
                +V S
Sbjct:   284 SPQQVVS 290

 Score = 77 (32.2 bits), Expect = 1.1e-13, Sum P(2) = 1.1e-13
 Identities = 11/14 (78%), Positives = 13/14 (92%)

Query:   459 YWIVKNSWGSDWGE 472
             +WIVKNSWG+ WGE
Sbjct:   422 FWIVKNSWGTSWGE 435


>UNIPROTKB|Q5QP40 [details] [associations]
            symbol:CTSK "Cathepsin K" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015644 Pfam:PF00112
            InterPro:IPR000169 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00139 EMBL:AL355860 HOVERGEN:HBG011513
            PANTHER:PTHR12411:SF55 EMBL:AL356292 UniGene:Hs.632466
            HGNC:HGNC:2536 IPI:IPI00514633 SMR:Q5QP40 STRING:Q5QP40
            Ensembl:ENST00000443913 Uniprot:Q5QP40
        Length = 258

 Score = 209 (78.6 bits), Expect = 5.4e-16, P = 5.4e-16
 Identities = 56/186 (30%), Positives = 96/186 (51%)

Query:   194 ELSVQHHDKVYSSVEDLLRRHENFVTNVEKAEDYQSEDSGTAVFGVNKFFDLSESDLQQL 253
             EL  + H K Y++  D + R   +  N++    Y S  +  A  GV+ + +L+ + L  +
Sbjct:    86 ELWKKTHRKQYNNKVDEISRRLIWEKNLK----YISIHNLEASLGVHTY-ELAMNHLGDM 140

Query:   254 TGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVKE 313
             T    +  ++ +   L+ P S ++++  +   ++         P++ D+R +G ++ VK 
Sbjct:   141 TS---EEVVQKMT-GLKVPLSHSRSNDTLYIPEWEGRA-----PDSVDYRKKGYVTPVKN 191

Query:   314 QGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSNGGCNGGRMDDALQYIIDN 373
             QG+C  CWAFS+VG +E     +   L  LS Q LVDC   N GC GG M +A QY+  N
Sbjct:   192 QGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQKN 251

Query:   374 GGVVSD 379
              G+ S+
Sbjct:   252 RGIDSE 257

 Score = 117 (46.2 bits), Expect = 0.00041, P = 0.00041
 Identities = 21/51 (41%), Positives = 32/51 (62%)

Query:   149 PEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQH 199
             P++ D+R +G ++ VK QG+C  CWAFS+VG +E     +   L  LS Q+
Sbjct:   175 PDSVDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQN 225


>RGD|2445 [details] [associations]
            symbol:Ctsc "cathepsin C" species:10116 "Rattus norvegicus"
          [GO:0001913 "T cell mediated cytotoxicity" evidence=IEA;ISO]
          [GO:0004197 "cysteine-type endopeptidase activity" evidence=NAS]
          [GO:0004252 "serine-type endopeptidase activity" evidence=IEA;ISO]
          [GO:0005764 "lysosome" evidence=IDA;TAS] [GO:0005783 "endoplasmic
          reticulum" evidence=IDA] [GO:0005794 "Golgi apparatus" evidence=IDA]
          [GO:0006508 "proteolysis" evidence=IEP;ISO;TAS] [GO:0007568 "aging"
          evidence=IEP] [GO:0008234 "cysteine-type peptidase activity"
          evidence=ISO] [GO:0010033 "response to organic substance"
          evidence=IDA] [GO:0031404 "chloride ion binding" evidence=IDA]
          [GO:0042802 "identical protein binding" evidence=IDA] [GO:0043621
          "protein self-association" evidence=IDA] InterPro:IPR000668
          InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
          InterPro:IPR000169 RGD:2445 GO:GO:0005783 GO:GO:0005794 GO:GO:0007568
          GO:GO:0010033 eggNOG:COG4870 GO:GO:0006508 InterPro:IPR025661
          InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139
          PROSITE:PS00639 GO:GO:0004252 GO:GO:0005764 GO:GO:0043621
          GO:GO:0042802 GO:GO:0031404 GO:GO:0004197
          GeneTree:ENSGT00560000076599 CTD:1075 HOGENOM:HOG000068022
          HOVERGEN:HBG005248 KO:K01275 OrthoDB:EOG4H19VZ InterPro:IPR014882
          Pfam:PF08773 MEROPS:C01.070 OMA:YDDFLHY GO:GO:0001913 EMBL:D90404
          IPI:IPI00193765 PIR:A41158 RefSeq:NP_058793.1 UniGene:Rn.203177
          PDB:1JQP PDBsum:1JQP ProteinModelPortal:P80067 SMR:P80067
          STRING:P80067 PhosphoSite:P80067 PRIDE:P80067
          Ensembl:ENSRNOT00000022342 GeneID:25423 KEGG:rno:25423
          InParanoid:P80067 SABIO-RK:P80067 EvolutionaryTrace:P80067
          NextBio:606591 ArrayExpress:P80067 Genevestigator:P80067
          GermOnline:ENSRNOG00000016496 Uniprot:P80067
        Length = 462

 Score = 180 (68.4 bits), Expect = 5.5e-16, Sum P(2) = 5.5e-16
 Identities = 51/161 (31%), Positives = 78/161 (48%)

Query:   501 LATEKLVDCDMSNGGCNGG-RMDDALQYIIDNGGVVSDQAYPYKASES----ERGCLXXX 555
             L+ +++V C     GC+GG     A +Y  D G VV +  +PY A+++    +  CL   
Sbjct:   282 LSPQEVVSCSPYAQGCDGGFPYLIAGKYAQDFG-VVEENCFPYTATDAPCKPKENCLRYY 340

Query:   556 XXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLF-YYSGGVIDLNQRLCNP- 613
                       Y       E  MK  +   GP++V    +  F +Y  G+   +  L +P 
Sbjct:   341 SSEYYYVGGFYGGC---NEALMKLELVKHGPMAVAFEVHDDFLHYHSGIYH-HTGLSDPF 396

Query:   614 ---KAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGE 651
                +  NHA+++VGYG++     T + YWIVKNSWGS WGE
Sbjct:   397 NPFELTNHAVLLVGYGKDPV---TGLDYWIVKNSWGSQWGE 434

 Score = 167 (63.8 bits), Expect = 8.2e-14, Sum P(2) = 8.2e-14
 Identities = 50/160 (31%), Positives = 79/160 (49%)

Query:   296 LPEAFDWR-AEGV--ISKVKEQGKCACCWAFSAVGVVEA-MHAIQGNSLTE-LSVQQLVD 350
             LPE++DWR   G+  +S V+ Q  C  C++F+++G++EA +  +  NS T  LS Q++V 
Sbjct:   230 LPESWDWRNVRGINFVSPVRNQESCGSCYSFASLGMLEARIRILTNNSQTPILSPQEVVS 289

Query:   351 CDMSNGGCNGG-RMDDALQYIIDNGGVVSDQAYPYKASES----ERGCLXXXXXXXXXXX 405
             C     GC+GG     A +Y  D G VV +  +PY A+++    +  CL           
Sbjct:   290 CSPYAQGCDGGFPYLIAGKYAQDFG-VVEENCFPYTATDAPCKPKENCLRYYSSEYYYVG 348

Query:   406 XXYSRIPYGEEEEMKKWVATRGPLSVGMNANG--LFYYSG 443
               Y       E  MK  +   GP++V    +   L Y+SG
Sbjct:   349 GFYGGC---NEALMKLELVKHGPMAVAFEVHDDFLHYHSG 385

 Score = 97 (39.2 bits), Expect = 5.5e-16, Sum P(2) = 5.5e-16
 Identities = 19/47 (40%), Positives = 31/47 (65%)

Query:   148 LPEAFDWR-AEGV--ISKVKEQGKCACCWAFSAVGVVEAMHAIQGNN 191
             LPE++DWR   G+  +S V+ Q  C  C++F+++G++EA   I  NN
Sbjct:   230 LPESWDWRNVRGINFVSPVRNQESCGSCYSFASLGMLEARIRILTNN 276

 Score = 90 (36.7 bits), Expect = 8.2e-14, Sum P(2) = 8.2e-14
 Identities = 14/18 (77%), Positives = 15/18 (83%)

Query:   455 TSIPYWIVKNSWGSDWGE 472
             T + YWIVKNSWGS WGE
Sbjct:   417 TGLDYWIVKNSWGSQWGE 434


>MGI|MGI:2137617 [details] [associations]
            symbol:Tinagl1 "tubulointerstitial nephritis antigen-like 1"
            species:10090 "Mus musculus" [GO:0005044 "scavenger receptor
            activity" evidence=IEA] [GO:0005576 "extracellular region"
            evidence=IEA] [GO:0005737 "cytoplasm" evidence=IDA] [GO:0006508
            "proteolysis" evidence=IEA] [GO:0006955 "immune response"
            evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0030247 "polysaccharide binding" evidence=IEA]
            [GO:0043236 "laminin binding" evidence=IPI] InterPro:IPR000668
            InterPro:IPR001212 InterPro:IPR013128 Pfam:PF00112 PROSITE:PS00524
            PROSITE:PS50958 SMART:SM00201 SMART:SM00645 MGI:MGI:2137617
            GO:GO:0005737 GO:GO:0005576 GO:GO:0006955 GO:GO:0030247
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00639 GO:GO:0031012 CleanEx:MM_ARG1 GO:GO:0005044
            GeneTree:ENSGT00560000076599 eggNOG:NOG310046 HOGENOM:HOG000241342
            HOVERGEN:HBG053961 MEROPS:C01.975 CTD:64129 OMA:DNCNRCT
            OrthoDB:EOG4BG8W0 EMBL:AB047402 EMBL:AB050626 EMBL:BC005738
            EMBL:BC018539 IPI:IPI00115458 RefSeq:NP_001161805.1
            RefSeq:NP_075965.2 UniGene:Mm.15801 ProteinModelPortal:Q99JR5
            SMR:Q99JR5 STRING:Q99JR5 PhosphoSite:Q99JR5 PaxDb:Q99JR5
            PRIDE:Q99JR5 Ensembl:ENSMUST00000030560 Ensembl:ENSMUST00000105998
            Ensembl:ENSMUST00000105999 GeneID:94242 KEGG:mmu:94242
            InParanoid:Q99JR5 NextBio:352247 Bgee:Q99JR5 Genevestigator:Q99JR5
            GermOnline:ENSMUSG00000028776 Uniprot:Q99JR5
        Length = 466

 Score = 158 (60.7 bits), Expect = 6.5e-16, Sum P(2) = 6.5e-16
 Identities = 44/140 (31%), Positives = 64/140 (45%)

Query:   287 FNSLRHGDDLPEAFDWRAE--GVISKVKEQGKCACCWAFSAVGVVE---AMHAIQGNSLT 341
             +  L  G+ LP AF+   +   +I +  +QG CA  WAFS   V     ++H++ G+   
Sbjct:   193 YTVLGQGEVLPTAFEASEKWPNLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSL-GHMTP 251

Query:   342 ELSVQQLVDCDMSNG-GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERG-----CLX 395
              LS Q L+ CD  +  GC GGR+D A  + +   GVVSD  YP+   E         C+ 
Sbjct:   252 ILSPQNLLSCDTHHQQGCRGGRLDGAW-WFLRRRGVVSDNCYPFSGREQNEASPTPRCMM 310

Query:   396 XXXXXXXXXXXXYSRIPYGE 415
                          SR P G+
Sbjct:   311 HSRAMGRGKRQATSRCPNGQ 330

 Score = 120 (47.3 bits), Expect = 6.5e-16, Sum P(2) = 6.5e-16
 Identities = 28/86 (32%), Positives = 44/86 (51%)

Query:   573 EEEEMKKWVATRGPLSVGMNAN-GLFYYSGGVID---LNQRLCNPKAQN--HALIIVGYG 626
             +E+E+ K +   GP+   M  +   F Y  G+     ++Q       ++  H++ I G+G
Sbjct:   348 DEKEIMKELMENGPVQALMEVHEDFFLYQRGIYSHTPVSQGRPEQYRRHGTHSVKITGWG 407

Query:   627 EEEKKDGTSIPYWIVKNSWGSDWGEK 652
             EE   DG +I YW   NSWG  WGE+
Sbjct:   408 EETLPDGRTIKYWTAANSWGPWWGER 433

 Score = 94 (38.1 bits), Expect = 5.3e-12, Sum P(3) = 5.3e-12
 Identities = 24/79 (30%), Positives = 34/79 (43%)

Query:   501 LATEKLVDCDMSNG-GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERG-----CLXX 554
             L+ + L+ CD  +  GC GGR+D A  + +   GVVSD  YP+   E         C+  
Sbjct:   253 LSPQNLLSCDTHHQQGCRGGRLDGAW-WFLRRRGVVSDNCYPFSGREQNEASPTPRCMMH 311

Query:   555 XXXXXXXXXXXYSRIPYGE 573
                         SR P G+
Sbjct:   312 SRAMGRGKRQATSRCPNGQ 330

 Score = 74 (31.1 bits), Expect = 3.8e-11, Sum P(2) = 3.8e-11
 Identities = 17/41 (41%), Positives = 22/41 (53%)

Query:   454 GTSIPYWIVKNSWGSDWGEKVEDKVGSSGNRTRDLELTGVL 494
             G +I YW   NSWG  WGE+   ++    N   D+E T VL
Sbjct:   414 GRTIKYWTAANSWGPWWGERGHFRIVRGTNEC-DIE-TFVL 452

 Score = 70 (29.7 bits), Expect = 5.3e-12, Sum P(3) = 5.3e-12
 Identities = 21/66 (31%), Positives = 34/66 (51%)

Query:   139 FNSLRHGDDLPEAFDWRAE--GVISKVKEQGKCACCWAFSAVGVVE---AMHAIQGNNLT 193
             +  L  G+ LP AF+   +   +I +  +QG CA  WAFS   V     ++H++ G+   
Sbjct:   193 YTVLGQGEVLPTAFEASEKWPNLIHEPLDQGNCAGSWAFSTAAVASDRVSIHSL-GHMTP 251

Query:   194 ELSVQH 199
              LS Q+
Sbjct:   252 ILSPQN 257


>RGD|70956 [details] [associations]
            symbol:Tinagl1 "tubulointerstitial nephritis antigen-like 1"
           species:10116 "Rattus norvegicus" [GO:0005044 "scavenger receptor
           activity" evidence=IEA] [GO:0005576 "extracellular region"
           evidence=IEA] [GO:0005737 "cytoplasm" evidence=IEA;ISO] [GO:0006508
           "proteolysis" evidence=IEA] [GO:0006955 "immune response"
           evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
           evidence=IEA] [GO:0030247 "polysaccharide binding" evidence=IEA]
           [GO:0031012 "extracellular matrix" evidence=IEA;ISO] [GO:0043236
           "laminin binding" evidence=IEA;ISO] InterPro:IPR000668
           InterPro:IPR001212 InterPro:IPR013128 Pfam:PF00112 PROSITE:PS00524
           PROSITE:PS50958 SMART:SM00201 SMART:SM00645 RGD:70956 GO:GO:0005737
           GO:GO:0005576 GO:GO:0006955 GO:GO:0030247 GO:GO:0008234
           GO:GO:0006508 InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00639
           GO:GO:0031012 GO:GO:0005044 eggNOG:NOG310046 HOGENOM:HOG000241342
           HOVERGEN:HBG053961 MEROPS:C01.975 CTD:64129 OrthoDB:EOG4BG8W0
           EMBL:AB050717 IPI:IPI00190428 RefSeq:NP_446034.1 UniGene:Rn.1256
           ProteinModelPortal:Q9EQT5 PRIDE:Q9EQT5 GeneID:94174 KEGG:rno:94174
           UCSC:RGD:70956 InParanoid:Q9EQT5 NextBio:617830 ArrayExpress:Q9EQT5
           Genevestigator:Q9EQT5 GermOnline:ENSRNOG00000013179 Uniprot:Q9EQT5
        Length = 467

 Score = 156 (60.0 bits), Expect = 1.1e-15, Sum P(2) = 1.1e-15
 Identities = 46/140 (32%), Positives = 70/140 (50%)

Query:   255 GLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAE--GVISKVK 312
             G+ LD  +     +++ P SS     E+    +  L  G+ LP AF+   +   +I +  
Sbjct:   166 GMTLDEGIRYRLGTIR-PSSSVMNMNEI----YTVLGQGEVLPTAFEASEKWPNLIHEPL 220

Query:   313 EQGKCACCWAFSAVGVVE---AMHAIQGNSLTELSVQQLVDCDMSNG-GCNGGRMDDALQ 368
             +QG CA  WAFS   V     ++H++ G+    LS Q L+ CD  +  GC GGR+D A  
Sbjct:   221 DQGNCAGSWAFSTAAVASDRVSIHSL-GHMTPILSPQNLLSCDTHHQKGCRGGRLDGAW- 278

Query:   369 YIIDNGGVVSDQAYPYKASE 388
             + +   GVVSD  YP+   E
Sbjct:   279 WFLRRRGVVSDNCYPFSGRE 298

 Score = 120 (47.3 bits), Expect = 1.1e-15, Sum P(2) = 1.1e-15
 Identities = 28/86 (32%), Positives = 44/86 (51%)

Query:   573 EEEEMKKWVATRGPLSVGMNAN-GLFYYSGGVID---LNQRLCNPKAQN--HALIIVGYG 626
             +E+E+ K +   GP+   M  +   F Y  G+     ++Q       ++  H++ I G+G
Sbjct:   349 DEKEIMKELMENGPVQALMEVHEDFFLYQRGIYSHTPVSQGRPEQYRRHGTHSVKITGWG 408

Query:   627 EEEKKDGTSIPYWIVKNSWGSDWGEK 652
             EE   DG +I YW   NSWG  WGE+
Sbjct:   409 EETLPDGRTIKYWTAANSWGPWWGER 434

 Score = 91 (37.1 bits), Expect = 8.5e-12, Sum P(3) = 8.5e-12
 Identities = 19/47 (40%), Positives = 27/47 (57%)

Query:   501 LATEKLVDCDMSNG-GCNGGRMDDALQYIIDNGGVVSDQAYPYKASE 546
             L+ + L+ CD  +  GC GGR+D A  + +   GVVSD  YP+   E
Sbjct:   253 LSPQNLLSCDTHHQKGCRGGRLDGAW-WFLRRRGVVSDNCYPFSGRE 298

 Score = 72 (30.4 bits), Expect = 1.0e-10, Sum P(2) = 1.0e-10
 Identities = 17/41 (41%), Positives = 22/41 (53%)

Query:   454 GTSIPYWIVKNSWGSDWGEKVEDKVGSSGNRTRDLELTGVL 494
             G +I YW   NSWG  WGE+   ++    N   D+E T VL
Sbjct:   415 GRTIKYWTAANSWGPWWGERGHFRIVRGINEC-DIE-TFVL 453

 Score = 71 (30.1 bits), Expect = 8.5e-12, Sum P(3) = 8.5e-12
 Identities = 28/98 (28%), Positives = 47/98 (47%)

Query:   107 GLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAE--GVISKVK 164
             G+ LD  +     +++ P SS     E+    +  L  G+ LP AF+   +   +I +  
Sbjct:   166 GMTLDEGIRYRLGTIR-PSSSVMNMNEI----YTVLGQGEVLPTAFEASEKWPNLIHEPL 220

Query:   165 EQGKCACCWAFSAVGVVE---AMHAIQGNNLTELSVQH 199
             +QG CA  WAFS   V     ++H++ G+    LS Q+
Sbjct:   221 DQGNCAGSWAFSTAAVASDRVSIHSL-GHMTPILSPQN 257


>UNIPROTKB|Q9EQT5 [details] [associations]
            symbol:Tinagl1 "Tubulointerstitial nephritis antigen-like"
            species:10116 "Rattus norvegicus" [GO:0005044 "scavenger receptor
            activity" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0006955 "immune response" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0030247
            "polysaccharide binding" evidence=IEA] InterPro:IPR000668
            InterPro:IPR001212 InterPro:IPR013128 Pfam:PF00112 PROSITE:PS00524
            PROSITE:PS50958 SMART:SM00201 SMART:SM00645 RGD:70956 GO:GO:0005737
            GO:GO:0005576 GO:GO:0006955 GO:GO:0030247 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00639
            GO:GO:0031012 GO:GO:0005044 eggNOG:NOG310046 HOGENOM:HOG000241342
            HOVERGEN:HBG053961 MEROPS:C01.975 CTD:64129 OrthoDB:EOG4BG8W0
            EMBL:AB050717 IPI:IPI00190428 RefSeq:NP_446034.1 UniGene:Rn.1256
            ProteinModelPortal:Q9EQT5 PRIDE:Q9EQT5 GeneID:94174 KEGG:rno:94174
            UCSC:RGD:70956 InParanoid:Q9EQT5 NextBio:617830 ArrayExpress:Q9EQT5
            Genevestigator:Q9EQT5 GermOnline:ENSRNOG00000013179 Uniprot:Q9EQT5
        Length = 467

 Score = 156 (60.0 bits), Expect = 1.1e-15, Sum P(2) = 1.1e-15
 Identities = 46/140 (32%), Positives = 70/140 (50%)

Query:   255 GLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAE--GVISKVK 312
             G+ LD  +     +++ P SS     E+    +  L  G+ LP AF+   +   +I +  
Sbjct:   166 GMTLDEGIRYRLGTIR-PSSSVMNMNEI----YTVLGQGEVLPTAFEASEKWPNLIHEPL 220

Query:   313 EQGKCACCWAFSAVGVVE---AMHAIQGNSLTELSVQQLVDCDMSNG-GCNGGRMDDALQ 368
             +QG CA  WAFS   V     ++H++ G+    LS Q L+ CD  +  GC GGR+D A  
Sbjct:   221 DQGNCAGSWAFSTAAVASDRVSIHSL-GHMTPILSPQNLLSCDTHHQKGCRGGRLDGAW- 278

Query:   369 YIIDNGGVVSDQAYPYKASE 388
             + +   GVVSD  YP+   E
Sbjct:   279 WFLRRRGVVSDNCYPFSGRE 298

 Score = 120 (47.3 bits), Expect = 1.1e-15, Sum P(2) = 1.1e-15
 Identities = 28/86 (32%), Positives = 44/86 (51%)

Query:   573 EEEEMKKWVATRGPLSVGMNAN-GLFYYSGGVID---LNQRLCNPKAQN--HALIIVGYG 626
             +E+E+ K +   GP+   M  +   F Y  G+     ++Q       ++  H++ I G+G
Sbjct:   349 DEKEIMKELMENGPVQALMEVHEDFFLYQRGIYSHTPVSQGRPEQYRRHGTHSVKITGWG 408

Query:   627 EEEKKDGTSIPYWIVKNSWGSDWGEK 652
             EE   DG +I YW   NSWG  WGE+
Sbjct:   409 EETLPDGRTIKYWTAANSWGPWWGER 434

 Score = 91 (37.1 bits), Expect = 8.5e-12, Sum P(3) = 8.5e-12
 Identities = 19/47 (40%), Positives = 27/47 (57%)

Query:   501 LATEKLVDCDMSNG-GCNGGRMDDALQYIIDNGGVVSDQAYPYKASE 546
             L+ + L+ CD  +  GC GGR+D A  + +   GVVSD  YP+   E
Sbjct:   253 LSPQNLLSCDTHHQKGCRGGRLDGAW-WFLRRRGVVSDNCYPFSGRE 298

 Score = 72 (30.4 bits), Expect = 1.0e-10, Sum P(2) = 1.0e-10
 Identities = 17/41 (41%), Positives = 22/41 (53%)

Query:   454 GTSIPYWIVKNSWGSDWGEKVEDKVGSSGNRTRDLELTGVL 494
             G +I YW   NSWG  WGE+   ++    N   D+E T VL
Sbjct:   415 GRTIKYWTAANSWGPWWGERGHFRIVRGINEC-DIE-TFVL 453

 Score = 71 (30.1 bits), Expect = 8.5e-12, Sum P(3) = 8.5e-12
 Identities = 28/98 (28%), Positives = 47/98 (47%)

Query:   107 GLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAE--GVISKVK 164
             G+ LD  +     +++ P SS     E+    +  L  G+ LP AF+   +   +I +  
Sbjct:   166 GMTLDEGIRYRLGTIR-PSSSVMNMNEI----YTVLGQGEVLPTAFEASEKWPNLIHEPL 220

Query:   165 EQGKCACCWAFSAVGVVE---AMHAIQGNNLTELSVQH 199
             +QG CA  WAFS   V     ++H++ G+    LS Q+
Sbjct:   221 DQGNCAGSWAFSTAAVASDRVSIHSL-GHMTPILSPQN 257


>UNIPROTKB|F1NHB8 [details] [associations]
            symbol:F1NHB8 "Uncharacterized protein" species:9031
            "Gallus gallus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GeneTree:ENSGT00660000095458 EMBL:AADN02044011
            IPI:IPI00586027 Ensembl:ENSGALT00000021873 OMA:SELDHAV
            Uniprot:F1NHB8
        Length = 329

 Score = 214 (80.4 bits), Expect = 1.6e-15, Sum P(2) = 1.6e-15
 Identities = 48/157 (30%), Positives = 77/157 (49%)

Query:   296 LPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCD--M 353
             LPE+ DWR  G ++ VK+Q  C  CW+F+  G +E    ++   LT LS Q L+DC    
Sbjct:   110 LPESLDWRLYGAVTPVKDQAVCGSCWSFATTGAMEGALFLKTGVLTPLSQQVLIDCSWGF 169

Query:   354 SNGGCNGGRMDDALQYIIDNGGVVSDQAY-PYKASESERGCLXXXXXXXXXXXXXYSRIP 412
              N  C+GG    A ++I  +GG+ S ++Y PY     + G               Y  + 
Sbjct:   170 GNYACDGGEEWRAYEWIKKHGGIASTESYGPYLG---QNGYCHYNQSELVAPLAGYVTVE 226

Query:   413 YGEEEEMKKWVATRGPLSVGMNAN--GLFYYSGGVID 447
              G  E +K  +   GP++V ++A+     +Y+ GV +
Sbjct:   227 SGNAEALKAALFKHGPVAVNIDASHKSFTFYANGVYE 263

 Score = 178 (67.7 bits), Expect = 5.8e-11, Sum P(2) = 5.8e-11
 Identities = 63/213 (29%), Positives = 99/213 (46%)

Query:   448 LNQRLYGTSIPYWIVKNS--WGSDWGEKVEDKVGSSGNRTRDLEL-TGVLPSKLSRLATE 504
             L+ RLYG   P   VK+    GS W         ++G     L L TGVL    + L+ +
Sbjct:   114 LDWRLYGAVTP---VKDQAVCGSCWS------FATTGAMEGALFLKTGVL----TPLSQQ 160

Query:   505 KLVDCD--MSNGGCNGGRMDDALQYIIDNGGVVSDQAY-PYKASESERGCLXXXXXXXXX 561
              L+DC     N  C+GG    A ++I  +GG+ S ++Y PY     + G           
Sbjct:   161 VLIDCSWGFGNYACDGGEEWRAYEWIKKHGGIASTESYGPYLG---QNGYCHYNQSELVA 217

Query:   562 XXXXYSRIPYGEEEEMKKWVATRGPLSVGMNAN--GLFYYSGGVIDLNQRLC-NPKAQ-N 617
                 Y  +  G  E +K  +   GP++V ++A+     +Y+ GV +  +  C N  ++ +
Sbjct:   218 PLAGYVTVESGNAEALKAALFKHGPVAVNIDASHKSFTFYANGVYE--EPHCGNETSELD 275

Query:   618 HALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWG 650
             HA++ VGYG    K      YW++KNSW + WG
Sbjct:   276 HAVLAVGYGVLHGKS-----YWLIKNSWSTYWG 303

 Score = 118 (46.6 bits), Expect = 0.00044, Sum P(2) = 0.00044
 Identities = 21/51 (41%), Positives = 30/51 (58%)

Query:   148 LPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
             LPE+ DWR  G ++ VK+Q  C  CW+F+  G +E    ++   LT LS Q
Sbjct:   110 LPESLDWRLYGAVTPVKDQAVCGSCWSFATTGAMEGALFLKTGVLTPLSQQ 160

 Score = 43 (20.2 bits), Expect = 1.6e-15, Sum P(2) = 1.6e-15
 Identities = 18/79 (22%), Positives = 31/79 (39%)

Query:    45 FLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQREDSGTAVFEVNKFFDLSDSDLQQ 104
             F ++     K YSS E+   R   F+ N+       R     ++  +N   D +  ++  
Sbjct:    26 FHHYKERFGKRYSSEEEHEHRKRTFIHNMRFVHSKNRAALSYSL-ALNHLADRTPQEMAA 84

Query:   105 LTGLNLDSTLEDIQP-SLQ 122
             L G       +  QP S+Q
Sbjct:    85 LRGRRRSGDPKSGQPFSMQ 103

 Score = 38 (18.4 bits), Expect = 5.4e-15, Sum P(2) = 5.4e-15
 Identities = 16/70 (22%), Positives = 29/70 (41%)

Query:   202 KVYSSVEDLLRRHENFVTNVEKAEDYQSEDSGTAVFGVNKFFDLSESDLQQLTGLNLDST 261
             K YSS E+   R   F+ N+      ++  + +    +N   D +  ++  L G      
Sbjct:    35 KRYSSEEEHEHRKRTFIHNMRFVHS-KNRAALSYSLALNHLADRTPQEMAALRGRRRSGD 93

Query:   262 LEDIQP-SLQ 270
              +  QP S+Q
Sbjct:    94 PKSGQPFSMQ 103


>UNIPROTKB|E1B9H1 [details] [associations]
            symbol:TINAGL1 "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0043236 "laminin binding" evidence=IEA] [GO:0031012
            "extracellular matrix" evidence=IEA] [GO:0005737 "cytoplasm"
            evidence=IEA] [GO:0030247 "polysaccharide binding" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006955 "immune response" evidence=IEA] [GO:0006508
            "proteolysis" evidence=IEA] [GO:0005044 "scavenger receptor
            activity" evidence=IEA] InterPro:IPR000668 InterPro:IPR001212
            InterPro:IPR013128 Pfam:PF00112 PROSITE:PS00524 PROSITE:PS50958
            SMART:SM00201 SMART:SM00645 GO:GO:0005737 GO:GO:0006955
            GO:GO:0030247 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00639 GO:GO:0031012 GO:GO:0005044
            GeneTree:ENSGT00560000076599 OMA:DNCNRCT EMBL:DAAA02006255
            IPI:IPI00732137 Ensembl:ENSBTAT00000038022 Uniprot:E1B9H1
        Length = 469

 Score = 148 (57.2 bits), Expect = 2.1e-15, Sum P(2) = 2.1e-15
 Identities = 37/98 (37%), Positives = 52/98 (53%)

Query:   293 GDDLPEAFDWRAE--GVISKVKEQGKCACCWAFSAVGVVE---AMHAIQGNSLTELSVQQ 347
             G+ LP  F+   +   +I    +QG CA  WAFS   V     ++H++ G+    LS Q 
Sbjct:   202 GEVLPRTFEASEKWPNLIHDPLDQGNCAGSWAFSTAAVASDRVSIHSL-GHMSPVLSPQN 260

Query:   348 LVDCDMSNG-GCNGGRMDDALQYIIDNGGVVSDQAYPY 384
             L+ CD  N  GC GGR+D A  + +   GVVSD  YP+
Sbjct:   261 LLSCDTHNQQGCRGGRLDGAW-WFLRRRGVVSDHCYPF 297

 Score = 126 (49.4 bits), Expect = 2.1e-15, Sum P(2) = 2.1e-15
 Identities = 29/85 (34%), Positives = 41/85 (48%)

Query:   574 EEEMKKWVATRGPLSVGMNAN-GLFYYSGGVIDLNQ-RLCNPKAQN----HALIIVGYGE 627
             E+E+ K +   GP+   M  +   F Y  G+       L  P+       H++ I G+GE
Sbjct:   352 EKEIMKELMENGPVQALMEVHEDFFLYQSGIYSHTPVSLGRPERYRRHGTHSVKITGWGE 411

Query:   628 EEKKDGTSIPYWIVKNSWGSDWGEK 652
             E   DG +I YW   NSWG  WGE+
Sbjct:   412 ETLPDGRTIKYWTAANSWGPAWGER 436

 Score = 92 (37.4 bits), Expect = 1.2e-11, Sum P(3) = 1.2e-11
 Identities = 19/43 (44%), Positives = 26/43 (60%)

Query:   501 LATEKLVDCDMSNG-GCNGGRMDDALQYIIDNGGVVSDQAYPY 542
             L+ + L+ CD  N  GC GGR+D A  + +   GVVSD  YP+
Sbjct:   256 LSPQNLLSCDTHNQQGCRGGRLDGAW-WFLRRRGVVSDHCYPF 297

 Score = 77 (32.2 bits), Expect = 2.5e-10, Sum P(2) = 2.5e-10
 Identities = 14/36 (38%), Positives = 19/36 (52%)

Query:   454 GTSIPYWIVKNSWGSDWGEKVEDKVGSSGNRTRDLE 489
             G +I YW   NSWG  WGE+   ++    N   D+E
Sbjct:   417 GRTIKYWTAANSWGPAWGERGHFRIVRGANEC-DIE 451

 Score = 62 (26.9 bits), Expect = 1.2e-11, Sum P(3) = 1.2e-11
 Identities = 19/60 (31%), Positives = 30/60 (50%)

Query:   145 GDDLPEAFDWRAE--GVISKVKEQGKCACCWAFSAVGVVE---AMHAIQGNNLTELSVQH 199
             G+ LP  F+   +   +I    +QG CA  WAFS   V     ++H++ G+    LS Q+
Sbjct:   202 GEVLPRTFEASEKWPNLIHDPLDQGNCAGSWAFSTAAVASDRVSIHSL-GHMSPVLSPQN 260


>UNIPROTKB|F1SVA2 [details] [associations]
            symbol:TINAGL1 "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0005615 "extracellular space" evidence=IDA] [GO:0043236
            "laminin binding" evidence=IEA] [GO:0031012 "extracellular matrix"
            evidence=IEA] [GO:0005737 "cytoplasm" evidence=IEA] [GO:0030247
            "polysaccharide binding" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0006955 "immune response"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0005044
            "scavenger receptor activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR001212 InterPro:IPR013128 Pfam:PF00112 PROSITE:PS00524
            PROSITE:PS50958 SMART:SM00201 SMART:SM00645 GO:GO:0005737
            GO:GO:0005615 GO:GO:0006955 GO:GO:0030247 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00639
            GO:GO:0031012 GO:GO:0005044 GeneTree:ENSGT00560000076599
            OMA:DNCNRCT EMBL:CU856262 Ensembl:ENSSSCT00000003995 Uniprot:F1SVA2
        Length = 467

 Score = 159 (61.0 bits), Expect = 2.1e-15, Sum P(2) = 2.1e-15
 Identities = 41/107 (38%), Positives = 57/107 (53%)

Query:   293 GDDLPEAFDWRAE--GVISKVKEQGKCACCWAFSAVGVVE---AMHAIQGNSLTELSVQQ 347
             G+ LP AF+   +   +I    +QG CA  WAFS   V     ++H++ G+    LS Q 
Sbjct:   200 GEVLPRAFEASEKWPNLIHDPLDQGNCAGSWAFSTAAVASDRVSIHSL-GHMTPVLSPQN 258

Query:   348 LVDCDMSNG-GCNGGRMDDALQYIIDNGGVVSDQAYPYKASE-SERG 392
             L+ CD  N  GC GGR+D A  + +   GVVSD  YP+   E +E G
Sbjct:   259 LLSCDTHNQQGCQGGRLDGAW-WFLRRRGVVSDHCYPFSGHERNEAG 304

 Score = 114 (45.2 bits), Expect = 2.1e-15, Sum P(2) = 2.1e-15
 Identities = 25/86 (29%), Positives = 42/86 (48%)

Query:   574 EEEMKKWVATRGPLSVGMNAN-GLFYYSGGVIDLNQRLCNPKAQN------HALIIVGYG 626
             E+++ K +   GP+   M  +   F Y  G+   +  + + + +       H++ I G+G
Sbjct:   350 EKDIMKELMENGPVQALMEVHEDFFLYQSGIYS-HTPVSHGRPERYRRHGTHSVKITGWG 408

Query:   627 EEEKKDGTSIPYWIVKNSWGSDWGEK 652
             EE   DG  + YW   NSWG  WGE+
Sbjct:   409 EETLPDGRMLKYWTAANSWGPGWGER 434

 Score = 98 (39.6 bits), Expect = 1.9e-11, Sum P(3) = 1.9e-11
 Identities = 22/52 (42%), Positives = 30/52 (57%)

Query:   501 LATEKLVDCDMSNG-GCNGGRMDDALQYIIDNGGVVSDQAYPYKASE-SERG 550
             L+ + L+ CD  N  GC GGR+D A  + +   GVVSD  YP+   E +E G
Sbjct:   254 LSPQNLLSCDTHNQQGCQGGRLDGAW-WFLRRRGVVSDHCYPFSGHERNEAG 304

 Score = 74 (31.1 bits), Expect = 3.0e-11, Sum P(2) = 3.0e-11
 Identities = 13/36 (36%), Positives = 18/36 (50%)

Query:   454 GTSIPYWIVKNSWGSDWGEKVEDKVGSSGNRTRDLE 489
             G  + YW   NSWG  WGE+   ++    N   D+E
Sbjct:   415 GRMLKYWTAANSWGPGWGERGHFRIVRGANEC-DIE 449

 Score = 67 (28.6 bits), Expect = 1.9e-11, Sum P(3) = 1.9e-11
 Identities = 20/60 (33%), Positives = 31/60 (51%)

Query:   145 GDDLPEAFDWRAE--GVISKVKEQGKCACCWAFSAVGVVE---AMHAIQGNNLTELSVQH 199
             G+ LP AF+   +   +I    +QG CA  WAFS   V     ++H++ G+    LS Q+
Sbjct:   200 GEVLPRAFEASEKWPNLIHDPLDQGNCAGSWAFSTAAVASDRVSIHSL-GHMTPVLSPQN 258


>FB|FBgn0250848 [details] [associations]
            symbol:26-29-p "26-29kD-proteinase" species:7227 "Drosophila
            melanogaster" [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=ISS] [GO:0006508 "proteolysis" evidence=IEA] [GO:0005811
            "lipid particle" evidence=IDA] [GO:0005875 "microtubule associated
            complex" evidence=IDA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005875 EMBL:AE014296 GO:GO:0005811 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00660000095458 MEROPS:I29.003 HSSP:O65039
            EMBL:AY122222 EMBL:AB011376 RefSeq:NP_620470.1 UniGene:Dm.3049
            SMR:Q9V3U6 MINT:MINT-890485 STRING:Q9V3U6
            EnsemblMetazoa:FBtr0075766 GeneID:39547 KEGG:dme:Dmel_CG8947
            UCSC:CG8947-RA CTD:39547 FlyBase:FBgn0250848 InParanoid:Q9V3U6
            OMA:IHSKNRA OrthoDB:EOG4BVQ8T GenomeRNAi:39547 NextBio:814210
            Uniprot:Q9V3U6
        Length = 549

 Score = 224 (83.9 bits), Expect = 3.2e-15, P = 3.2e-15
 Identities = 52/193 (26%), Positives = 95/193 (49%)

Query:   259 DSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVKEQGKCA 318
             D T E+++   +  + S+      + F ++  ++ D++P+ +DWR  G ++ VK+Q  C 
Sbjct:   295 DKTEEELKA--RRGYKSSGIYNTGKPFPYDVPKYKDEIPDQYDWRLYGAVTPVKDQSVCG 352

Query:   319 CCWAFSAVGVVEAMHAIQ-GNSLTELSVQQLVDCDMS--NGGCNGGRMDDALQYIIDNGG 375
              CW+F  +G +E    ++ G +L  LS Q L+DC  +  N GC+GG      Q+++ +GG
Sbjct:   353 SCWSFGTIGHLEGAFFLKNGGNLVRLSQQALIDCSWAYGNNGCDGGEDFRVYQWMLQSGG 412

Query:   376 VVSDQAY-PYKASESERGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMN 434
             V +++ Y PY   +   G               +  +   +    K  +   GPLSV ++
Sbjct:   413 VPTEEEYGPYLGQD---GYCHVNNVTLVAPIKGFVNVTSNDPNAFKLALLKHGPLSVAID 469

Query:   435 ANG--LFYYSGGV 445
             A+     +YS GV
Sbjct:   470 ASPKTFSFYSHGV 482

 Score = 181 (68.8 bits), Expect = 1.7e-10, P = 1.7e-10
 Identities = 74/251 (29%), Positives = 109/251 (43%)

Query:   416 EEEMKKWVATRGPLSVGM-NANGLF-Y----YSGGVID-LNQRLYGTSIPYWIVKNS--W 466
             EEE+K   A RG  S G+ N    F Y    Y   + D  + RLYG   P   VK+    
Sbjct:   298 EEELK---ARRGYKSSGIYNTGKPFPYDVPKYKDEIPDQYDWRLYGAVTP---VKDQSVC 351

Query:   467 GSDWGEKVEDKVGSSGNRTRDLELTGVLPSKLSRLATEKLVDCDMS--NGGCNGGRMDDA 524
             GS W        G+ G+      L       L RL+ + L+DC  +  N GC+GG     
Sbjct:   352 GSCWS------FGTIGHLEGAFFLKN--GGNLVRLSQQALIDCSWAYGNNGCDGGEDFRV 403

Query:   525 LQYIIDNGGVVSDQAY-PYKASESERGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVAT 583
              Q+++ +GGV +++ Y PY   +   G               +  +   +    K  +  
Sbjct:   404 YQWMLQSGGVPTEEEYGPYLGQD---GYCHVNNVTLVAPIKGFVNVTSNDPNAFKLALLK 460

Query:   584 RGPLSVGMNANG--LFYYSGGVIDLNQRLCNPKAQ--NHALIIVGYGEEEKKDGTSIPYW 639
              GPLSV ++A+     +YS GV    +  C       +HA++ VGYG    +D     YW
Sbjct:   461 HGPLSVAIDASPKTFSFYSHGVY--YEPTCKNDVDGLDHAVLAVGYGSINGED-----YW 513

Query:   640 IVKNSWGSDWG 650
             +VKNSW + WG
Sbjct:   514 LVKNSWSTYWG 524

 Score = 136 (52.9 bits), Expect = 1.4e-05, P = 1.4e-05
 Identities = 26/89 (29%), Positives = 49/89 (55%)

Query:   111 DSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGVISKVKEQGKCA 170
             D T E+++   +  + S+      + F ++  ++ D++P+ +DWR  G ++ VK+Q  C 
Sbjct:   295 DKTEEELKA--RRGYKSSGIYNTGKPFPYDVPKYKDEIPDQYDWRLYGAVTPVKDQSVCG 352

Query:   171 CCWAFSAVGVVEAMHAIQ-GNNLTELSVQ 198
              CW+F  +G +E    ++ G NL  LS Q
Sbjct:   353 SCWSFGTIGHLEGAFFLKNGGNLVRLSQQ 381


>UNIPROTKB|F1N455 [details] [associations]
            symbol:CTSC "Dipeptidyl peptidase 1 exclusion domain chain"
            species:9913 "Bos taurus" [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0004252 "serine-type endopeptidase activity" evidence=IEA]
            [GO:0001913 "T cell mediated cytotoxicity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0004252
            GeneTree:ENSGT00560000076599 IPI:IPI00697314 UniGene:Bt.49573
            InterPro:IPR014882 Pfam:PF08773 OMA:YDDFLHY GO:GO:0001913
            EMBL:DAAA02062487 EMBL:DAAA02062488 Ensembl:ENSBTAT00000014735
            Uniprot:F1N455
        Length = 463

 Score = 168 (64.2 bits), Expect = 9.5e-15, Sum P(2) = 9.5e-15
 Identities = 48/160 (30%), Positives = 74/160 (46%)

Query:   501 LATEKLVDCDMSNGGCNGG-RMDDALQYIIDNGGVVSDQAYPYKASES----ERGCLXXX 555
             L+ +++V C     GC GG     A +Y  D G +V +  +PY  ++S    + GC    
Sbjct:   283 LSPQEVVSCSQYAQGCEGGFPYLIAGKYAQDFG-LVEEDCFPYTGTDSPCRLKEGCFRYY 341

Query:   556 XXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNA-NGLFYYSGGVIDLN--QRLCN 612
                       Y       E  MK  +  +GP++V     +   +Y  GV      +   N
Sbjct:   342 SSEYHYVGGFYGGC---NEALMKLELVHQGPMAVAFEVYDDFLHYRKGVYHHTGLRDPFN 398

Query:   613 P-KAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGE 651
             P +  NHA+++VGYG +     + + YWIVKNSWG+ WGE
Sbjct:   399 PFELTNHAVLLVGYGTDA---ASGLDYWIVKNSWGTSWGE 435

 Score = 158 (60.7 bits), Expect = 2.7e-12, Sum P(2) = 2.7e-12
 Identities = 46/161 (28%), Positives = 77/161 (47%)

Query:   296 LPEAFDWR-AEGV--ISKVKEQGKCACCWAFSAVGVVEA-MHAIQGNSLTE-LSVQQLVD 350
             LP ++DWR   G+  ++ V+ QG C  C++F+++G++EA +  +  N+ T  LS Q++V 
Sbjct:   231 LPTSWDWRNVHGINFVTPVRNQGSCGSCYSFASMGMMEARIRILTNNTQTPILSPQEVVS 290

Query:   351 CDMSNGGCNGG-RMDDALQYIIDNGGVVSDQAYPYKASES----ERGCLXXXXXXXXXXX 405
             C     GC GG     A +Y  D G +V +  +PY  ++S    + GC            
Sbjct:   291 CSQYAQGCEGGFPYLIAGKYAQDFG-LVEEDCFPYTGTDSPCRLKEGCFRYYSSEYHYVG 349

Query:   406 XXYSRIPYGEEEEMKKWVATRGPLSVGMNA-NGLFYYSGGV 445
               Y       E  MK  +  +GP++V     +   +Y  GV
Sbjct:   350 GFYGGC---NEALMKLELVHQGPMAVAFEVYDDFLHYRKGV 387

 Score = 98 (39.6 bits), Expect = 9.5e-15, Sum P(2) = 9.5e-15
 Identities = 21/61 (34%), Positives = 37/61 (60%)

Query:   148 LPEAFDWR-AEGV--ISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQHHDKVY 204
             LP ++DWR   G+  ++ V+ QG C  C++F+++G++EA   I  NN T+  +    +V 
Sbjct:   231 LPTSWDWRNVHGINFVTPVRNQGSCGSCYSFASMGMMEARIRILTNN-TQTPILSPQEVV 289

Query:   205 S 205
             S
Sbjct:   290 S 290

 Score = 85 (35.0 bits), Expect = 2.7e-12, Sum P(2) = 2.7e-12
 Identities = 15/24 (62%), Positives = 17/24 (70%)

Query:   453 YGTS----IPYWIVKNSWGSDWGE 472
             YGT     + YWIVKNSWG+ WGE
Sbjct:   412 YGTDAASGLDYWIVKNSWGTSWGE 435


>UNIPROTKB|Q3ZCJ8 [details] [associations]
            symbol:CTSC "Dipeptidyl peptidase 1" species:9913 "Bos
            taurus" [GO:0031638 "zymogen activation" evidence=IDA] [GO:0005764
            "lysosome" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0005764 EMBL:BC102115 IPI:IPI00697314 RefSeq:NP_001028789.1
            UniGene:Bt.49573 ProteinModelPortal:Q3ZCJ8 SMR:Q3ZCJ8 STRING:Q3ZCJ8
            PRIDE:Q3ZCJ8 GeneID:352958 KEGG:bta:352958 CTD:1075
            HOGENOM:HOG000068022 HOVERGEN:HBG005248 InParanoid:Q3ZCJ8 KO:K01275
            OrthoDB:EOG4H19VZ BindingDB:Q3ZCJ8 ChEMBL:CHEMBL1075050
            NextBio:20812686 GO:GO:0031638 InterPro:IPR014882 Pfam:PF08773
            Uniprot:Q3ZCJ8
        Length = 463

 Score = 168 (64.2 bits), Expect = 9.5e-15, Sum P(2) = 9.5e-15
 Identities = 48/160 (30%), Positives = 74/160 (46%)

Query:   501 LATEKLVDCDMSNGGCNGG-RMDDALQYIIDNGGVVSDQAYPYKASES----ERGCLXXX 555
             L+ +++V C     GC GG     A +Y  D G +V +  +PY  ++S    + GC    
Sbjct:   283 LSPQEVVSCSQYAQGCEGGFPYLIAGKYAQDFG-LVEEDCFPYTGTDSPCRLKEGCFRYY 341

Query:   556 XXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNA-NGLFYYSGGVIDLN--QRLCN 612
                       Y       E  MK  +  +GP++V     +   +Y  GV      +   N
Sbjct:   342 SSEYHYVGGFYGGC---NEALMKLELVHQGPMAVAFEVYDDFLHYRKGVYHHTGLRDPFN 398

Query:   613 P-KAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGE 651
             P +  NHA+++VGYG +     + + YWIVKNSWG+ WGE
Sbjct:   399 PFELTNHAVLLVGYGTDA---ASGLDYWIVKNSWGTSWGE 435

 Score = 158 (60.7 bits), Expect = 2.7e-12, Sum P(2) = 2.7e-12
 Identities = 46/161 (28%), Positives = 77/161 (47%)

Query:   296 LPEAFDWR-AEGV--ISKVKEQGKCACCWAFSAVGVVEA-MHAIQGNSLTE-LSVQQLVD 350
             LP ++DWR   G+  ++ V+ QG C  C++F+++G++EA +  +  N+ T  LS Q++V 
Sbjct:   231 LPTSWDWRNVHGINFVTPVRNQGSCGSCYSFASMGMMEARIRILTNNTQTPILSPQEVVS 290

Query:   351 CDMSNGGCNGG-RMDDALQYIIDNGGVVSDQAYPYKASES----ERGCLXXXXXXXXXXX 405
             C     GC GG     A +Y  D G +V +  +PY  ++S    + GC            
Sbjct:   291 CSQYAQGCEGGFPYLIAGKYAQDFG-LVEEDCFPYTGTDSPCRLKEGCFRYYSSEYHYVG 349

Query:   406 XXYSRIPYGEEEEMKKWVATRGPLSVGMNA-NGLFYYSGGV 445
               Y       E  MK  +  +GP++V     +   +Y  GV
Sbjct:   350 GFYGGC---NEALMKLELVHQGPMAVAFEVYDDFLHYRKGV 387

 Score = 98 (39.6 bits), Expect = 9.5e-15, Sum P(2) = 9.5e-15
 Identities = 21/61 (34%), Positives = 37/61 (60%)

Query:   148 LPEAFDWR-AEGV--ISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQHHDKVY 204
             LP ++DWR   G+  ++ V+ QG C  C++F+++G++EA   I  NN T+  +    +V 
Sbjct:   231 LPTSWDWRNVHGINFVTPVRNQGSCGSCYSFASMGMMEARIRILTNN-TQTPILSPQEVV 289

Query:   205 S 205
             S
Sbjct:   290 S 290

 Score = 85 (35.0 bits), Expect = 2.7e-12, Sum P(2) = 2.7e-12
 Identities = 15/24 (62%), Positives = 17/24 (70%)

Query:   453 YGTS----IPYWIVKNSWGSDWGE 472
             YGT     + YWIVKNSWG+ WGE
Sbjct:   412 YGTDAASGLDYWIVKNSWGTSWGE 435


>WB|WBGene00009158 [details] [associations]
            symbol:F26E4.3 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR001212 InterPro:IPR013128 Pfam:PF00112 PROSITE:PS00524
            PROSITE:PS50958 SMART:SM00201 SMART:SM00645 GO:GO:0005576
            GO:GO:0006955 GO:GO:0030247 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005044
            GeneTree:ENSGT00560000076599 HSSP:P07711 EMBL:Z81070
            eggNOG:NOG310046 HOGENOM:HOG000241342 OMA:DNCNRCT PIR:T21421
            RefSeq:NP_492593.2 ProteinModelPortal:P90850 SMR:P90850
            PaxDb:P90850 EnsemblMetazoa:F26E4.3.1 EnsemblMetazoa:F26E4.3.2
            GeneID:172827 KEGG:cel:CELE_F26E4.3 UCSC:F26E4.3.1 CTD:172827
            WormBase:F26E4.3 InParanoid:P90850 NextBio:877161 Uniprot:P90850
        Length = 452

 Score = 148 (57.2 bits), Expect = 1.2e-14, Sum P(2) = 1.2e-14
 Identities = 43/107 (40%), Positives = 56/107 (52%)

Query:   295 DLPEAFDWRAE-G-VISKVKEQGKCACCWAFSAVGVVEAMHAI--QGNSLTELSVQQLVD 350
             +LPE FD R + G +I  V +QG C   W+ S   +     AI  +G   + LS QQL+ 
Sbjct:   183 ELPEHFDARDKWGPLIHPVADQGDCGSSWSVSTTAISSDRLAIISEGRINSTLSSQQLLS 242

Query:   351 CDMSNG-GCNGGRMDDALQYIIDNGGVVSDQAYPYKASES-ERG-CL 394
             C+     GC GG +D A  YI    GVV D  YPY + +S E G CL
Sbjct:   243 CNQHRQKGCEGGYLDRAWWYI-RKLGVVGDHCYPYVSGQSREPGHCL 288

 Score = 118 (46.6 bits), Expect = 1.2e-14, Sum P(2) = 1.2e-14
 Identities = 30/91 (32%), Positives = 48/91 (52%)

Query:   570 PY---GEEEEMKKWVATRGPLSVGMNAN-GLFYYSGGVI---DLN-QRLCNPKAQN-HAL 620
             PY     EE+++  + T GP+      +   F Y+GGV    DL  Q+  +  A+  H++
Sbjct:   317 PYKVSSREEDIQTELMTNGPVQATFVVHEDFFMYAGGVYQHSDLAAQKGASSVAEGYHSV 376

Query:   621 IIVGYGEEEKKDGTSIPYWIVKNSWGSDWGE 651
              ++G+G +    G  I YW+  NSWG+ WGE
Sbjct:   377 RVLGWGVDHST-GKPIKYWLCANSWGTQWGE 406

 Score = 92 (37.4 bits), Expect = 8.0e-11, Sum P(3) = 8.0e-11
 Identities = 24/57 (42%), Positives = 33/57 (57%)

Query:   499 SRLATEKLVDCDMSNG-GCNGGRMDDALQYIIDNGGVVSDQAYPYKASES-ERG-CL 552
             S L++++L+ C+     GC GG +D A  YI    GVV D  YPY + +S E G CL
Sbjct:   233 STLSSQQLLSCNQHRQKGCEGGYLDRAWWYI-RKLGVVGDHCYPYVSGQSREPGHCL 288

 Score = 75 (31.5 bits), Expect = 3.5e-10, Sum P(2) = 3.5e-10
 Identities = 11/19 (57%), Positives = 13/19 (68%)

Query:   454 GTSIPYWIVKNSWGSDWGE 472
             G  I YW+  NSWG+ WGE
Sbjct:   388 GKPIKYWLCANSWGTQWGE 406

 Score = 62 (26.9 bits), Expect = 8.0e-11, Sum P(3) = 8.0e-11
 Identities = 20/56 (35%), Positives = 28/56 (50%)

Query:   147 DLPEAFDWRAE-G-VISKVKEQGKCACCWAFSAVGVVEAMHAI--QGNNLTELSVQ 198
             +LPE FD R + G +I  V +QG C   W+ S   +     AI  +G   + LS Q
Sbjct:   183 ELPEHFDARDKWGPLIHPVADQGDCGSSWSVSTTAISSDRLAIISEGRINSTLSSQ 238


>UNIPROTKB|O97578 [details] [associations]
            symbol:CTSC "Dipeptidyl peptidase 1" species:9615 "Canis
            lupus familiaris" [GO:0005764 "lysosome" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0006508
            "proteolysis" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 CTD:1075
            HOGENOM:HOG000068022 HOVERGEN:HBG005248 KO:K01275 OrthoDB:EOG4H19VZ
            InterPro:IPR014882 Pfam:PF08773 EMBL:AF060171 RefSeq:NP_001182763.1
            UniGene:Cfa.28653 ProteinModelPortal:O97578 SMR:O97578
            MEROPS:C01.070 PRIDE:O97578 GeneID:403458 KEGG:cfa:403458
            InParanoid:O97578 NextBio:20816976 Uniprot:O97578
        Length = 435

 Score = 176 (67.0 bits), Expect = 1.3e-14, Sum P(2) = 1.3e-14
 Identities = 52/159 (32%), Positives = 74/159 (46%)

Query:   501 LATEKLVDCDMSNGGCNGG-RMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXX 559
             L+ +++V C     GC GG     A +Y  D G +V +  +PY  S+S   C        
Sbjct:   256 LSPQEIVSCSQYAQGCEGGFPYLIAGKYAQDFG-LVEEACFPYAGSDSP--CKPNDCFRY 312

Query:   560 XXXXXXYSRIPYG--EEEEMKKWVATRGPLSVGMNA-NGLFYYSGGVI---DLNQRLCNP 613
                   Y    YG   E  MK  +   GP++V     +  F+Y  G+     L     NP
Sbjct:   313 YSSEYYYVGGFYGACNEALMKLELVRHGPMAVAFEVYDDFFHYQKGIYYHTGLRDPF-NP 371

Query:   614 -KAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGE 651
              +  NHA+++VGYG +     + + YWIVKNSWGS WGE
Sbjct:   372 FELTNHAVLLVGYGTDS---ASGMDYWIVKNSWGSRWGE 407

 Score = 155 (59.6 bits), Expect = 4.7e-12, Sum P(2) = 4.7e-12
 Identities = 48/159 (30%), Positives = 73/159 (45%)

Query:   296 LPEAFDWR-AEGV--ISKVKEQGKCACCWAFSAVGVVEA-MHAIQGNSLTE-LSVQQLVD 350
             LP ++DWR   G   +S V+ Q  C  C+AF++  ++EA +  +  N+ T  LS Q++V 
Sbjct:   204 LPTSWDWRNVRGTNFVSPVRNQASCGSCYAFASTAMLEARIRILTNNTQTPILSPQEIVS 263

Query:   351 CDMSNGGCNGG-RMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYS 409
             C     GC GG     A +Y  D G +V +  +PY  S+S   C              Y 
Sbjct:   264 CSQYAQGCEGGFPYLIAGKYAQDFG-LVEEACFPYAGSDSP--CKPNDCFRYYSSEYYYV 320

Query:   410 RIPYG--EEEEMKKWVATRGPLSVGMNA-NGLFYYSGGV 445
                YG   E  MK  +   GP++V     +  F+Y  G+
Sbjct:   321 GGFYGACNEALMKLELVRHGPMAVAFEVYDDFFHYQKGI 359

 Score = 87 (35.7 bits), Expect = 1.3e-14, Sum P(2) = 1.3e-14
 Identities = 20/61 (32%), Positives = 33/61 (54%)

Query:   148 LPEAFDWR-AEGV--ISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQHHDKVY 204
             LP ++DWR   G   +S V+ Q  C  C+AF++  ++EA   I  NN T+  +    ++ 
Sbjct:   204 LPTSWDWRNVRGTNFVSPVRNQASCGSCYAFASTAMLEARIRILTNN-TQTPILSPQEIV 262

Query:   205 S 205
             S
Sbjct:   263 S 263

 Score = 85 (35.0 bits), Expect = 4.7e-12, Sum P(2) = 4.7e-12
 Identities = 16/24 (66%), Positives = 17/24 (70%)

Query:   453 YGTS----IPYWIVKNSWGSDWGE 472
             YGT     + YWIVKNSWGS WGE
Sbjct:   384 YGTDSASGMDYWIVKNSWGSRWGE 407


>WB|WBGene00019314 [details] [associations]
            symbol:K02E7.10 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0016021 "integral to
            membrane" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 SMART:SM00645 GO:GO:0008234 GO:GO:0006508
            PANTHER:PTHR12411 GeneTree:ENSGT00560000076599 HSSP:P07711
            EMBL:FO080411 PIR:T32392 RefSeq:NP_493904.1 UniGene:Cel.14828
            ProteinModelPortal:O17255 SMR:O17255 EnsemblMetazoa:K02E7.10
            GeneID:186889 KEGG:cel:CELE_K02E7.10 UCSC:K02E7.10 CTD:186889
            WormBase:K02E7.10 eggNOG:NOG331187 HOGENOM:HOG000114005
            InParanoid:O17255 OMA:GNANEAR NextBio:933344 Uniprot:O17255
        Length = 299

 Score = 191 (72.3 bits), Expect = 1.9e-12, P = 1.9e-12
 Identities = 63/223 (28%), Positives = 103/223 (46%)

Query:   272 PFSSNQTDTEMRAFQFNS-LRHGDDLPEAF-DWRAEGVISKVKEQGKCACCWAFSAVGVV 329
             P+    ++T  R  Q+ + L H   + + F DWR +G++  VK+QGKC   +AF+A+  +
Sbjct:    57 PYQPKTSETP-RPPQYQTKLSH--HMTQDFLDWREKGIVGPVKDQGKCNASYAFAAIAAI 113

Query:   330 EAMHAIQGNS-LTELSVQQLVDCDMSNGGCNGGRMDDAL--QYIIDNGGVVSDQAYPYKA 386
             E+M+A   N  L   S QQ++DC      C    +++ L  +++ +NG V ++  YPY  
Sbjct:   114 ESMYAKANNGKLLSFSEQQIIDCANFTNPCQEN-LENVLSNRFLKENG-VGTEADYPYVG 171

Query:   387 SESERGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNAN-GLFYYSGGV 445
              E+   C              Y  + Y  EE  +  + T G     M +    F+Y  G+
Sbjct:   172 KENVGKC--EYDSSKMKLRPTYIDV-YPNEEWARAHITTFGTGYFRMRSPPSFFHYKTGI 228

Query:   446 ID--------LNQRL------YGT--SIPYWIVKNSWGSDWGE 472
              +         N+        YG   +  YWIVK S+G+ WGE
Sbjct:   229 YNPTKEECGNANEARSLAIVGYGKDGAEKYWIVKGSFGTSWGE 271

 Score = 145 (56.1 bits), Expect = 1.5e-14, Sum P(2) = 1.5e-14
 Identities = 44/158 (27%), Positives = 71/158 (44%)

Query:   497 KLSRLATEKLVDCDMSNGGCNGGRMDDAL--QYIIDNGGVVSDQAYPYKASESERGCLXX 554
             KL   + ++++DC      C    +++ L  +++ +NG V ++  YPY   E+   C   
Sbjct:   124 KLLSFSEQQIIDCANFTNPCQEN-LENVLSNRFLKENG-VGTEADYPYVGKENVGKC--E 179

Query:   555 XXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNAN-GLFYYSGGVIDLNQRLCNP 613
                        Y  + Y  EE  +  + T G     M +    F+Y  G+ +  +  C  
Sbjct:   180 YDSSKMKLRPTYIDV-YPNEEWARAHITTFGTGYFRMRSPPSFFHYKTGIYNPTKEECGN 238

Query:   614 KAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGE 651
               +  +L IVGYG    KDG    YWIVK S+G+ WGE
Sbjct:   239 ANEARSLAIVGYG----KDGAE-KYWIVKGSFGTSWGE 271

 Score = 113 (44.8 bits), Expect = 1.5e-14, Sum P(2) = 1.5e-14
 Identities = 25/70 (35%), Positives = 43/70 (61%)

Query:   124 PFSSNQTDTEMRAFQFNS-LRHGDDLPEAF-DWRAEGVISKVKEQGKCACCWAFSAVGVV 181
             P+    ++T  R  Q+ + L H   + + F DWR +G++  VK+QGKC   +AF+A+  +
Sbjct:    57 PYQPKTSETP-RPPQYQTKLSH--HMTQDFLDWREKGIVGPVKDQGKCNASYAFAAIAAI 113

Query:   182 EAMHAIQGNN 191
             E+M+A + NN
Sbjct:   114 ESMYA-KANN 122


>WB|WBGene00008861 [details] [associations]
            symbol:F15D4.4 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR013201
            PANTHER:PTHR12411 SMART:SM00848 PROSITE:PS00640
            GeneTree:ENSGT00560000076599 EMBL:Z80344 HSSP:P53634
            eggNOG:NOG310593 PIR:T20981 ProteinModelPortal:Q93512 SMR:Q93512
            MEROPS:C01.A45 EnsemblMetazoa:F15D4.4 KEGG:cel:CELE_F15D4.4
            UCSC:F15D4.4 CTD:184530 WormBase:F15D4.4 InParanoid:Q93512
            OMA:ITMEQNI NextBio:925068 Uniprot:Q93512
        Length = 608

 Score = 145 (56.1 bits), Expect = 1.6e-14, Sum P(2) = 1.6e-14
 Identities = 33/90 (36%), Positives = 46/90 (51%)

Query:   289 SLRHGDDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQL 348
             S R   D     DWR    +  + +Q  C  CWAFS + ++E+  AIQG + + LSVQQL
Sbjct:   216 SSRKKRDTEPTVDWRP--FLKPILDQSTCGGCWAFSMISMIESFFAIQGYNTSSLSVQQL 273

Query:   349 VDCD--------MSNGGCNGGRMDDALQYI 370
             + CD        ++N GC GG    A  Y+
Sbjct:   274 LTCDTKVDSTYGLANVGCKGGYFQIAGSYL 303

 Score = 134 (52.2 bits), Expect = 1.9e-13, Sum P(3) = 1.9e-13
 Identities = 47/168 (27%), Positives = 78/168 (46%)

Query:    34 TRGYLNSPVTRFLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQREDSGTAVFEVNK 93
             T  YL SP+ +F N   ++D  + S+ D++  +      +++   Y +      V E N 
Sbjct:   115 TTPYL-SPLEKF-NEAMNNDGAFKSLMDVINFNSTAKEGLKRFNVYSKVKK--EVDEHNI 170

Query:    94 FFDLSDSDLQQLTG---LNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPE 150
              ++L  S  +  T    + LD  +  +  +L A      T T + A   +S +  D  P 
Sbjct:   171 MYELGMSSYKMSTNQFSVALDGEVAPLTLNLDA---LTPTATVIPA-TISSRKKRDTEPT 226

Query:   151 AFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
               DWR    +  + +Q  C  CWAFS + ++E+  AIQG N + LSVQ
Sbjct:   227 V-DWRP--FLKPILDQSTCGGCWAFSMISMIESFFAIQGYNTSSLSVQ 271

 Score = 124 (48.7 bits), Expect = 1.6e-14, Sum P(2) = 1.6e-14
 Identities = 28/69 (40%), Positives = 42/69 (60%)

Query:   584 RGPLSVGMNAN-GLFYYSGGVIDLNQRLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVK 642
             +GP++VGM A   ++ YS GV D +   C     NHA++IVG+ ++         YWI++
Sbjct:   365 KGPIAVGMAAGPDIYKYSEGVYDGD---CGTII-NHAVVIVGFTDD---------YWIIR 411

Query:   643 NSWGSDWGE 651
             NSWG+ WGE
Sbjct:   412 NSWGASWGE 420

 Score = 108 (43.1 bits), Expect = 7.3e-13, Sum P(2) = 7.3e-13
 Identities = 23/56 (41%), Positives = 34/56 (60%)

Query:   426 RGPLSVGMNAN-GLFYYSGGVID------LNQR--LYGTSIPYWIVKNSWGSDWGE 472
             +GP++VGM A   ++ YS GV D      +N    + G +  YWI++NSWG+ WGE
Sbjct:   365 KGPIAVGMAAGPDIYKYSEGVYDGDCGTIINHAVVIVGFTDDYWIIRNSWGASWGE 420

 Score = 44 (20.5 bits), Expect = 1.9e-13, Sum P(3) = 1.9e-13
 Identities = 12/38 (31%), Positives = 19/38 (50%)

Query:   499 SRLATEKLVDCD--------MSNGGCNGGRMDDALQYI 528
             S L+ ++L+ CD        ++N GC GG    A  Y+
Sbjct:   266 SSLSVQQLLTCDTKVDSTYGLANVGCKGGYFQIAGSYL 303


>ZFIN|ZDB-GENE-050417-107 [details] [associations]
            symbol:zgc:110239 "zgc:110239" species:7955 "Danio
            rerio" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-050417-107
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 MEROPS:I29.003 OrthoDB:EOG412M56 EMBL:BC092817
            IPI:IPI00503987 RefSeq:NP_001017633.1 UniGene:Dr.39081
            ProteinModelPortal:Q568K7 GeneID:550326 KEGG:dre:550326
            HOGENOM:HOG000007373 HOVERGEN:HBG105018 InParanoid:Q568K7
            NextBio:20879584 ArrayExpress:Q568K7 Uniprot:Q568K7
        Length = 546

 Score = 217 (81.4 bits), Expect = 1.9e-14, P = 1.9e-14
 Identities = 68/281 (24%), Positives = 120/281 (42%)

Query:   172 CWAFSAVGVVEAMHAIQGNNLTELS-VQHHDKVYSSVEDLL-RRHENFVTNVEKAEDYQS 229
             C  F   GV   + A    +  E S V H  +++   ++   R+++N + + E+  ++  
Sbjct:   210 CGGFPGPGVEHHLLANPIQDFVETSPVSHAHRMFGHYKEKFNRQYDNEMEHEEREHNF-- 267

Query:   230 EDSGTAVFGVNKFFDLSESDLQQLTGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQ-FN 288
                   V  +     ++ + L     L+++   +  Q  L       +T    R  Q F 
Sbjct:   268 ------VHNIRYVHSMNRAGLS--FSLSVNHLADRSQKELSMMRGCQRTHKVHRKAQPFP 319

Query:   289 SLRHGDDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQL 348
             S       P + DWR  G ++ VK+Q  C  CW+F+  G +E    ++   LT LS Q L
Sbjct:   320 SEIRSIATPNSVDWRLYGAVTPVKDQAVCGSCWSFATTGTLEGALFLKTGQLTSLSQQML 379

Query:   349 VDCD--MSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXX 406
             VDC     N GC+GG    A ++I+ +GG+ + ++Y   A     G              
Sbjct:   380 VDCTWGFGNNGCDGGEEWRAFEWIMKHGGISTAESYG--AYMGMNGLCHYDKSSMVAQLT 437

Query:   407 XYSRIPYGEEEEMKKWVATRGPLSVGMNA--NGLFYYSGGV 445
              Y+ +  G+   +K  +   GP++V ++A      +YS GV
Sbjct:   438 GYTNVTSGDILALKAAIFKFGPVAVSIDAAHRSFAFYSNGV 478

 Score = 171 (65.3 bits), Expect = 7.5e-10, Sum P(2) = 7.5e-10
 Identities = 61/207 (29%), Positives = 91/207 (43%)

Query:   451 RLYGTSIPYWIVKNS--WGSDWGEKVEDKVGSSGNRTRDLEL-TGVLPSKLSRLATEKLV 507
             RLYG   P   VK+    GS W         ++G     L L TG L S    L+ + LV
Sbjct:   334 RLYGAVTP---VKDQAVCGSCWS------FATTGTLEGALFLKTGQLTS----LSQQMLV 380

Query:   508 DCD--MSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXX 565
             DC     N GC+GG    A ++I+ +GG+ + ++Y   A     G               
Sbjct:   381 DCTWGFGNNGCDGGEEWRAFEWIMKHGGISTAESYG--AYMGMNGLCHYDKSSMVAQLTG 438

Query:   566 YSRIPYGEEEEMKKWVATRGPLSVGMNA--NGLFYYSGGVIDLNQRLCNPKAQNHALIIV 623
             Y+ +  G+   +K  +   GP++V ++A      +YS GV    +        +HA++ V
Sbjct:   439 YTNVTSGDILALKAAIFKFGPVAVSIDAAHRSFAFYSNGVYYEPECKNGINDLDHAVLAV 498

Query:   624 GYGEEEKKDGTSIPYWIVKNSWGSDWG 650
             GYG    +      YW+VKNSW S WG
Sbjct:   499 GYGIMNNES-----YWLVKNSWSSYWG 520

 Score = 50 (22.7 bits), Expect = 7.5e-10, Sum P(2) = 7.5e-10
 Identities = 17/72 (23%), Positives = 29/72 (40%)

Query:    40 SPVTR----FLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQREDSGTAVFEVNKFF 95
             SPV+     F ++    ++ Y +  +   R  NFV N+       R     ++  VN   
Sbjct:   234 SPVSHAHRMFGHYKEKFNRQYDNEMEHEEREHNFVHNIRYVHSMNRAGLSFSL-SVNHLA 292

Query:    96 DLSDSDLQQLTG 107
             D S  +L  + G
Sbjct:   293 DRSQKELSMMRG 304


>WB|WBGene00000786 [details] [associations]
            symbol:cpr-6 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0005764 "lysosome"
            evidence=IDA] InterPro:IPR000668 InterPro:IPR013128
            InterPro:IPR015643 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GeneTree:ENSGT00560000076599 HOGENOM:HOG000241341
            PANTHER:PTHR12411:SF16 EMBL:L39894 EMBL:L39939 EMBL:FO080666
            PIR:T37274 RefSeq:NP_741818.1 UniGene:Cel.18138
            ProteinModelPortal:P43510 SMR:P43510 DIP:DIP-25139N
            MINT:MINT-1074025 STRING:P43510 MEROPS:C01.A51 PaxDb:P43510
            PRIDE:P43510 EnsemblMetazoa:C25B8.3a GeneID:180931
            KEGG:cel:CELE_C25B8.3 UCSC:C25B8.3a CTD:180931 WormBase:C25B8.3a
            InParanoid:P43510 OMA:KAKWGLM NextBio:911608 ArrayExpress:P43510
            Uniprot:P43510
        Length = 379

 Score = 130 (50.8 bits), Expect = 1.2e-13, Sum P(2) = 1.2e-13
 Identities = 35/115 (30%), Positives = 52/115 (45%)

Query:   542 YKASESERGCLXXXXXXXXXXXXXYSRIPYG---EEEEMKKWVATRGPLSVGMNANGLFY 598
             Y   + E+ C+             +    YG   + E ++K + T GPL +       F 
Sbjct:   228 YPTPKCEKKCVSDYTDKTYSEDKFFGASAYGVKDDVEAIQKELMTHGPLEIAFEVYEDFL 287

Query:   599 -YSGGV-IDLNQRLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGE 651
              Y GGV +    +L       HA+ ++G+G +   DG  IPYW V NSW +DWGE
Sbjct:   288 NYDGGVYVHTGGKL----GGGHAVKLIGWGID---DG--IPYWTVANSWNTDWGE 333

 Score = 125 (49.1 bits), Expect = 1.2e-13, Sum P(2) = 1.2e-13
 Identities = 36/98 (36%), Positives = 47/98 (47%)

Query:   295 DLPEAFD----WRAEGVISKVKEQGKCACCWAFSAVGVVEAMH-----AIQGNSLTELSV 345
             D+PE+FD    W     I  +++Q  C  CWAF   G VEAM      A  G     LS 
Sbjct:   104 DIPESFDSRDNWPKCDSIKVIRDQSSCGSCWAF---GAVEAMSDRICIASHGELQVTLSA 160

Query:   346 QQLVDCDMSNG-GCNGGRMDDALQYIIDNGGVVSDQAY 382
               L+ C  S G GCNGG    A +Y + +G +V+   Y
Sbjct:   161 DDLLSCCKSCGFGCNGGDPLAAWRYWVKDG-IVTGSNY 197

 Score = 94 (38.1 bits), Expect = 7.2e-10, Sum P(2) = 7.2e-10
 Identities = 23/70 (32%), Positives = 34/70 (48%)

Query:   417 EEMKKWVATRGPLSVGMNA--------NGLFYYSGGVIDLNQ--RLYG----TSIPYWIV 462
             E ++K + T GPL +             G++ ++GG +      +L G      IPYW V
Sbjct:   264 EAIQKELMTHGPLEIAFEVYEDFLNYDGGVYVHTGGKLGGGHAVKLIGWGIDDGIPYWTV 323

Query:   463 KNSWGSDWGE 472
              NSW +DWGE
Sbjct:   324 ANSWNTDWGE 333

 Score = 82 (33.9 bits), Expect = 6.3e-11, Sum P(3) = 6.3e-11
 Identities = 18/42 (42%), Positives = 23/42 (54%)

Query:   147 DLPEAFD----WRAEGVISKVKEQGKCACCWAFSAVGVVEAM 184
             D+PE+FD    W     I  +++Q  C  CWAF   G VEAM
Sbjct:   104 DIPESFDSRDNWPKCDSIKVIRDQSSCGSCWAF---GAVEAM 142

 Score = 56 (24.8 bits), Expect = 6.3e-11, Sum P(3) = 6.3e-11
 Identities = 15/41 (36%), Positives = 23/41 (56%)

Query:   501 LATEKLVDCDMSNG-GCNGGRMDDALQYIIDNGGVVSDQAY 540
             L+ + L+ C  S G GCNGG    A +Y + +G +V+   Y
Sbjct:   158 LSADDLLSCCKSCGFGCNGGDPLAAWRYWVKDG-IVTGSNY 197


>UNIPROTKB|J9NSE7 [details] [associations]
            symbol:CTSC "Dipeptidyl peptidase 1" species:9615 "Canis
            lupus familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00560000076599 InterPro:IPR014882 Pfam:PF08773
            EMBL:AAEX03017125 Ensembl:ENSCAFT00000014269 OMA:INGQICH
            Uniprot:J9NSE7
        Length = 458

 Score = 170 (64.9 bits), Expect = 1.5e-13, Sum P(2) = 1.5e-13
 Identities = 51/159 (32%), Positives = 74/159 (46%)

Query:   501 LATEKLVDCDMSNGGCNGG-RMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXX 559
             L+ +++V C     GC GG     A +Y  D G +V +  + Y  S+S   C        
Sbjct:   279 LSPQEIVSCSQYAQGCEGGFPYLIAGKYAQDFG-LVDEACFSYAGSDSP--CKPNDCFHY 335

Query:   560 XXXXXXYSRIPYG--EEEEMKKWVATRGPLSVGMNA-NGLFYYSGGVI---DLNQRLCNP 613
                   Y    YG   E  MK  +   GP++V     +  F+Y  G+     L   + NP
Sbjct:   336 YSSEYHYVGGFYGACNEALMKLELVRHGPMAVAFEVYDDFFHYQKGIYYHTGLRDPI-NP 394

Query:   614 -KAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGE 651
              +  NHA+++VGYG +     + + YWIVKNSWGS WGE
Sbjct:   395 FELTNHAVLLVGYGTDS---ASGMDYWIVKNSWGSRWGE 430

 Score = 144 (55.7 bits), Expect = 9.3e-11, Sum P(2) = 9.3e-11
 Identities = 47/159 (29%), Positives = 72/159 (45%)

Query:   296 LPEAFDWR-AEGV--ISKVKEQGKCACCWAFSAVGVVEA-MHAIQGNSLTE-LSVQQLVD 350
             LP ++DWR   G   +S V+ Q  C  C+AF++  ++EA +  +  N+ T  LS Q++V 
Sbjct:   227 LPTSWDWRNVRGTNFVSPVRNQASCGSCYAFASTVMLEARIRILTNNTQTPILSPQEIVS 286

Query:   351 CDMSNGGCNGG-RMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYS 409
             C     GC GG     A +Y  D G +V +  + Y  S+S   C              Y 
Sbjct:   287 CSQYAQGCEGGFPYLIAGKYAQDFG-LVDEACFSYAGSDSP--CKPNDCFHYYSSEYHYV 343

Query:   410 RIPYG--EEEEMKKWVATRGPLSVGMNA-NGLFYYSGGV 445
                YG   E  MK  +   GP++V     +  F+Y  G+
Sbjct:   344 GGFYGACNEALMKLELVRHGPMAVAFEVYDDFFHYQKGI 382

 Score = 85 (35.0 bits), Expect = 9.3e-11, Sum P(2) = 9.3e-11
 Identities = 16/24 (66%), Positives = 17/24 (70%)

Query:   453 YGTS----IPYWIVKNSWGSDWGE 472
             YGT     + YWIVKNSWGS WGE
Sbjct:   407 YGTDSASGMDYWIVKNSWGSRWGE 430

 Score = 84 (34.6 bits), Expect = 1.5e-13, Sum P(2) = 1.5e-13
 Identities = 20/61 (32%), Positives = 33/61 (54%)

Query:   148 LPEAFDWR-AEGV--ISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQHHDKVY 204
             LP ++DWR   G   +S V+ Q  C  C+AF++  ++EA   I  NN T+  +    ++ 
Sbjct:   227 LPTSWDWRNVRGTNFVSPVRNQASCGSCYAFASTVMLEARIRILTNN-TQTPILSPQEIV 285

Query:   205 S 205
             S
Sbjct:   286 S 286


>UNIPROTKB|F1NT07 [details] [associations]
            symbol:LOC100857883 "Uncharacterized protein" species:9031
            "Gallus gallus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GeneTree:ENSGT00660000095458 EMBL:AADN02044012
            EMBL:AADN02044013 EMBL:AADN02044014 IPI:IPI00577314
            Ensembl:ENSGALT00000000192 OMA:IYKHGPV Uniprot:F1NT07
        Length = 317

 Score = 201 (75.8 bits), Expect = 1.7e-13, P = 1.7e-13
 Identities = 46/154 (29%), Positives = 73/154 (47%)

Query:   296 LPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDCDMSN 355
             LPE+ DWR  G ++ VK+Q  C  CW+F+  G +E    ++   LT LS Q L+DC    
Sbjct:    96 LPESLDWRMYGAVTPVKDQAVCGSCWSFATTGAMEGALFLKTGVLTPLSQQVLIDCSWGK 155

Query:   356 GG--CNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPY 413
             G   C+GG    A  +I  +GG+ S ++ P      + G               Y  +  
Sbjct:   156 GNYACDGGEEWRAKGWIKKHGGIASTESPPSFPLVLQNGLCHYNQSEMLAKITGYVNVTS 215

Query:   414 GEEEEMKKWVATRGPLSVGMNANG-LF-YYSGGV 445
             G    +K  +   GP++V ++A+   F +YS G+
Sbjct:   216 GNITAVKTAIYKHGPVAVSIDASHKTFSFYSNGI 249

 Score = 174 (66.3 bits), Expect = 2.6e-10, P = 2.6e-10
 Identities = 68/245 (27%), Positives = 110/245 (44%)

Query:   420 KKWVATRGPLSVGMNANGL-F---YYSGGVI--DLNQRLYGTSIPYWIVKNS--WGSDWG 471
             ++  A RG    G   +GL F   +Y+G ++   L+ R+YG   P   VK+    GS W 
Sbjct:    66 QEMAALRGRRRSGDPNHGLPFPAEHYTGIILPESLDWRMYGAVTP---VKDQAVCGSCWS 122

Query:   472 EKVEDKVGSSGNRTRDLEL-TGVLPSKLSRLATEKLVDCDMSNGG--CNGGRMDDALQYI 528
                     ++G     L L TGVL    + L+ + L+DC    G   C+GG    A  +I
Sbjct:   123 ------FATTGAMEGALFLKTGVL----TPLSQQVLIDCSWGKGNYACDGGEEWRAKGWI 172

Query:   529 IDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLS 588
               +GG+ S ++ P      + G               Y  +  G    +K  +   GP++
Sbjct:   173 KKHGGIASTESPPSFPLVLQNGLCHYNQSEMLAKITGYVNVTSGNITAVKTAIYKHGPVA 232

Query:   589 VGMNANG--LFYYSGGVIDLNQRLCNPKAQ-NHALIIVGYGEEEKKDGTSIPYWIVKNSW 645
             V ++A+     +YS G+     +  N   Q +HA++ VGYG  +   G +  YW++KNSW
Sbjct:   233 VSIDASHKTFSFYSNGIY-YEPKCANKPGQLDHAVLAVGYGVLQ---GET--YWLIKNSW 286

Query:   646 GSDWG 650
              + WG
Sbjct:   287 STYWG 291

 Score = 118 (46.6 bits), Expect = 0.00051, P = 0.00051
 Identities = 21/51 (41%), Positives = 30/51 (58%)

Query:   148 LPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
             LPE+ DWR  G ++ VK+Q  C  CW+F+  G +E    ++   LT LS Q
Sbjct:    96 LPESLDWRMYGAVTPVKDQAVCGSCWSFATTGAMEGALFLKTGVLTPLSQQ 146


>DICTYBASE|DDB_G0286055 [details] [associations]
            symbol:DDB_G0286055 "peptidase C1A family protein"
            species:44689 "Dictyostelium discoideum" [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            SMART:SM00645 dictyBase:DDB_G0286055 Pfam:PF00188 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 PANTHER:PTHR12411 EMBL:AAFI02000085
            Gene3D:3.40.33.10 InterPro:IPR001283 InterPro:IPR014044
            PRINTS:PR00837 SMART:SM00198 SUPFAM:SSF55797
            ProtClustDB:CLSZ2429919 RefSeq:XP_637918.1
            ProteinModelPortal:Q54MB6 EnsemblProtists:DDB0186794 GeneID:8625429
            KEGG:ddi:DDB_G0286055 InParanoid:Q54MB6 OMA:GENGFAR Uniprot:Q54MB6
        Length = 435

 Score = 177 (67.4 bits), Expect = 2.9e-10, P = 2.9e-10
 Identities = 51/170 (30%), Positives = 79/170 (46%)

Query:   299 AFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDC------- 351
             +FDWR  GV+   K+   CA  WAF+A G+ E+  A++     + S QQL+DC       
Sbjct:   211 SFDWRDNGVVGFPKDSSNCASGWAFTAAGIFESRSAMRTRHRYDYSAQQLIDCINVCIII 270

Query:   352 --DMSNGG---CN--GGRMDDALQYIIDNGGVVSDQAYPYKASESERGC-LXXXXXXXXX 403
               + S G    C+   G ++ AL Y     G+ +   YPY  + S  GC           
Sbjct:   271 FSNFSIGNYTKCSRFSGELNKALMYA-QAYGLQATSTYPYVGASSI-GCSYNQSSIAVEG 328

Query:   404 XXXXYSRIPYGEEEEMKKWVATRGPLSVGMNA-NGLFYYSGGVIDLNQRL 452
                 YS++  G +  ++K    +GP+ VG+   N   YY+GG+ + N  L
Sbjct:   329 GDVEYSQV--GRDSIVEK-CRKQGPVGVGIYVTNEFLYYAGGIFECNNTL 375

 Score = 152 (58.6 bits), Expect = 2.8e-13, Sum P(2) = 2.8e-13
 Identities = 42/135 (31%), Positives = 66/135 (48%)

Query:   519 GRMDDALQYIIDNGGVVSDQAYPYKASESERGC-LXXXXXXXXXXXXXYSRIPYGEEEEM 577
             G ++ AL Y     G+ +   YPY  + S  GC               YS++  G +  +
Sbjct:   287 GELNKALMYA-QAYGLQATSTYPYVGASSI-GCSYNQSSIAVEGGDVEYSQV--GRDSIV 342

Query:   578 KKWVATRGPLSVGMNA-NGLFYYSGGVIDLNQRLCNPKAQNHALIIVGYGEEEKKDGTSI 636
             +K    +GP+ VG+   N   YY+GG+ + N  L +    NH +++VGY E   KD    
Sbjct:   343 EK-CRKQGPVGVGIYVTNEFLYYAGGIFECNNTLIDNANINHNVLLVGYNE---KDN--- 395

Query:   637 PYWIVKNSWGSDWGE 651
              Y+I+KN++G  WGE
Sbjct:   396 -YYIIKNNFGRTWGE 409

 Score = 105 (42.0 bits), Expect = 4.3e-08, Sum P(2) = 4.3e-08
 Identities = 36/140 (25%), Positives = 60/140 (42%)

Query:   361 GRMDDALQYIIDNGGVVSDQAYPYKASESERGC-LXXXXXXXXXXXXXYSRIPYGEEEEM 419
             G ++ AL Y     G+ +   YPY  + S  GC               YS++  G +  +
Sbjct:   287 GELNKALMYA-QAYGLQATSTYPYVGASSI-GCSYNQSSIAVEGGDVEYSQV--GRDSIV 342

Query:   420 KKWVATRGPLSVGMNA-NGLFYYSGGVIDLNQRL--------------YGTSIPYWIVKN 464
             +K    +GP+ VG+   N   YY+GG+ + N  L              Y     Y+I+KN
Sbjct:   343 EK-CRKQGPVGVGIYVTNEFLYYAGGIFECNNTLIDNANINHNVLLVGYNEKDNYYIIKN 401

Query:   465 SWGSDWGEKVEDKVGSSGNR 484
             ++G  WGE    ++ +  N+
Sbjct:   402 NFGRTWGENGFARITADVNK 421

 Score = 100 (40.3 bits), Expect = 2.8e-13, Sum P(2) = 2.8e-13
 Identities = 18/48 (37%), Positives = 28/48 (58%)

Query:   151 AFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
             +FDWR  GV+   K+   CA  WAF+A G+ E+  A++  +  + S Q
Sbjct:   211 SFDWRDNGVVGFPKDSSNCASGWAFTAAGIFESRSAMRTRHRYDYSAQ 258


>UNIPROTKB|P53634 [details] [associations]
            symbol:CTSC "Dipeptidyl peptidase 1" species:9606 "Homo
            sapiens" [GO:0001913 "T cell mediated cytotoxicity" evidence=IEA]
            [GO:0004252 "serine-type endopeptidase activity" evidence=IEA]
            [GO:0005783 "endoplasmic reticulum" evidence=IEA] [GO:0005794
            "Golgi apparatus" evidence=IEA] [GO:0007568 "aging" evidence=IEA]
            [GO:0010033 "response to organic substance" evidence=IEA]
            [GO:0031404 "chloride ion binding" evidence=IEA] [GO:0042802
            "identical protein binding" evidence=IEA] [GO:0043621 "protein
            self-association" evidence=IEA] [GO:0005764 "lysosome"
            evidence=TAS] [GO:0006508 "proteolysis" evidence=IDA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IDA] [GO:0006955
            "immune response" evidence=TAS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005783 GO:GO:0005794 Reactome:REACT_6900
            GO:GO:0006955 GO:GO:0007568 GO:GO:0010033 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0004252 GO:GO:0005764 GO:GO:0031404 CTD:1075
            HOVERGEN:HBG005248 KO:K01275 OrthoDB:EOG4H19VZ InterPro:IPR014882
            Pfam:PF08773 MEROPS:C01.070 EMBL:X87212 EMBL:U79415 EMBL:AF234263
            EMBL:AF234264 EMBL:AF254757 EMBL:AF525032 EMBL:AF525033
            EMBL:AK292117 EMBL:AK311923 EMBL:AK223038 EMBL:BX537913
            EMBL:AC011088 EMBL:CH471185 EMBL:BC054028 EMBL:BC100891
            EMBL:BC100892 EMBL:BC100893 EMBL:BC100894 EMBL:BC109386
            EMBL:BC110071 EMBL:BC113850 EMBL:BC113897 IPI:IPI00022810
            IPI:IPI00171323 IPI:IPI00872258 PIR:S23941 PIR:S66504
            RefSeq:NP_001107645.1 RefSeq:NP_001805.3 RefSeq:NP_680475.1
            UniGene:Hs.128065 PDB:1K3B PDB:2DJF PDB:2DJG PDB:3PDF PDBsum:1K3B
            PDBsum:2DJF PDBsum:2DJG PDBsum:3PDF ProteinModelPortal:P53634
            SMR:P53634 IntAct:P53634 MINT:MINT-4655964 STRING:P53634
            PhosphoSite:P53634 DMDM:1705632 PaxDb:P53634 PRIDE:P53634
            DNASU:1075 Ensembl:ENST00000227266 Ensembl:ENST00000524463
            Ensembl:ENST00000529974 GeneID:1075 KEGG:hsa:1075 UCSC:uc001pck.4
            UCSC:uc001pcm.4 GeneCards:GC11M088026 HGNC:HGNC:2528 HPA:CAB025364
            MIM:170650 MIM:245000 MIM:245010 MIM:602365 neXtProt:NX_P53634
            Orphanet:2342 Orphanet:678 PharmGKB:PA27028 HOGENOM:HOG000127503
            InParanoid:P53634 OMA:YDDFLHY PhylomeDB:P53634
            BioCyc:MetaCyc:HS03265-MONOMER SABIO-RK:P53634 BindingDB:P53634
            ChEMBL:CHEMBL2252 EvolutionaryTrace:P53634 GenomeRNAi:1075
            NextBio:4488 PMAP-CutDB:P53634 ArrayExpress:P53634 Bgee:P53634
            Genevestigator:P53634 GermOnline:ENSG00000109861 GO:GO:0001913
            Uniprot:P53634
        Length = 463

 Score = 159 (61.0 bits), Expect = 3.2e-13, Sum P(2) = 3.2e-13
 Identities = 48/159 (30%), Positives = 74/159 (46%)

Query:   501 LATEKLVDCDMSNGGCNGG-RMDDALQYIIDNGGVVSDQAYPYKASESERGC-LXXXXXX 558
             L+ +++V C     GC GG     A +Y  D G +V +  +PY  ++S   C +      
Sbjct:   283 LSPQEVVSCSQYAQGCEGGFPYLIAGKYAQDFG-LVEEACFPYTGTDSP--CKMKEDCFR 339

Query:   559 XXXXXXXYSRIPYG--EEEEMKKWVATRGPLSVGMNA-NGLFYYSGGVIDLN--QRLCNP 613
                    Y    YG   E  MK  +   GP++V     +   +Y  G+      +   NP
Sbjct:   340 YYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRDPFNP 399

Query:   614 -KAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGE 651
              +  NHA+++VGYG +     + + YWIVKNSWG+ WGE
Sbjct:   400 FELTNHAVLLVGYGTDS---ASGMDYWIVKNSWGTGWGE 435

 Score = 150 (57.9 bits), Expect = 3.4e-11, Sum P(2) = 3.4e-11
 Identities = 47/160 (29%), Positives = 76/160 (47%)

Query:   296 LPEAFDWR-AEGV--ISKVKEQGKCACCWAFSAVGVVEA-MHAIQGNSLTE-LSVQQLVD 350
             LP ++DWR   G+  +S V+ Q  C  C++F+++G++EA +  +  NS T  LS Q++V 
Sbjct:   231 LPTSWDWRNVHGINFVSPVRNQASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVS 290

Query:   351 CDMSNGGCNGG-RMDDALQYIIDNGGVVSDQAYPYKASESERGC-LXXXXXXXXXXXXXY 408
             C     GC GG     A +Y  D G +V +  +PY  ++S   C +             Y
Sbjct:   291 CSQYAQGCEGGFPYLIAGKYAQDFG-LVEEACFPYTGTDSP--CKMKEDCFRYYSSEYHY 347

Query:   409 SRIPYG--EEEEMKKWVATRGPLSVGMNA-NGLFYYSGGV 445
                 YG   E  MK  +   GP++V     +   +Y  G+
Sbjct:   348 VGGFYGGCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGI 387

 Score = 93 (37.8 bits), Expect = 3.2e-13, Sum P(2) = 3.2e-13
 Identities = 18/47 (38%), Positives = 30/47 (63%)

Query:   148 LPEAFDWR-AEGV--ISKVKEQGKCACCWAFSAVGVVEAMHAIQGNN 191
             LP ++DWR   G+  +S V+ Q  C  C++F+++G++EA   I  NN
Sbjct:   231 LPTSWDWRNVHGINFVSPVRNQASCGSCYSFASMGMLEARIRILTNN 277

 Score = 83 (34.3 bits), Expect = 3.4e-11, Sum P(2) = 3.4e-11
 Identities = 15/24 (62%), Positives = 17/24 (70%)

Query:   453 YGTS----IPYWIVKNSWGSDWGE 472
             YGT     + YWIVKNSWG+ WGE
Sbjct:   412 YGTDSASGMDYWIVKNSWGTGWGE 435


>WB|WBGene00044760 [details] [associations]
            symbol:Y71H2AM.25 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0004181
            "metallocarboxypeptidase activity" evidence=IEA] [GO:0008270 "zinc
            ion binding" evidence=IEA] [GO:0016021 "integral to membrane"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            SMART:SM00645 GO:GO:0008234 GO:GO:0006508 PANTHER:PTHR12411
            GeneTree:ENSGT00560000076599 EMBL:FO081822 eggNOG:NOG331187
            HOGENOM:HOG000114005 RefSeq:NP_001040887.1
            ProteinModelPortal:Q2AAB9 SMR:Q2AAB9 EnsemblMetazoa:Y71H2AM.25
            GeneID:4363054 KEGG:cel:CELE_Y71H2AM.25 UCSC:Y71H2AM.25 CTD:4363054
            WormBase:Y71H2AM.25 InParanoid:Q2AAB9 NextBio:959635 Uniprot:Q2AAB9
        Length = 299

 Score = 197 (74.4 bits), Expect = 3.3e-13, P = 3.3e-13
 Identities = 63/210 (30%), Positives = 96/210 (45%)

Query:   285 FQFNSLRHGDDLPEAF-DWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGN-SLTE 342
             FQ+ + ++     E F DWR +G++  VK+QGKC    AF+    +E+M+A   N SL  
Sbjct:    70 FQWKTPKYTIQTTEEFLDWRDKGIVGPVKDQGKCNASHAFAISSSIESMYAKATNGSLLS 129

Query:   343 LSVQQLVDCDMSNG--GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXX 400
              S QQL+DCD  +G  GC      +A+ Y I +G + ++  YPY   E+ + C       
Sbjct:   130 FSEQQLIDCD-DHGFKGCEEQPAINAVSYFIFHG-IETEADYPYAGKENGK-CTFDSTKS 186

Query:   401 XXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNAN-GLFYYSGGVID------------ 447
                     +      E + K+ V   GP    M A   L+ Y  G+ +            
Sbjct:   187 KIQLKD--AEFVVSNETQGKELVTNYGPAFFTMRAPPSLYDYKIGIYNPSIEECTSTHEI 244

Query:   448 LNQRLYGTSIP----YWIVKNSWGSDWGEK 473
              +  + G  I     YWIVK S+G+ WGE+
Sbjct:   245 RSMVIVGYGIEGVQKYWIVKGSFGTSWGEQ 274

 Score = 148 (57.2 bits), Expect = 1.9e-07, P = 1.9e-07
 Identities = 45/158 (28%), Positives = 72/158 (45%)

Query:   498 LSRLATEKLVDCDMSNG--GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXX 555
             L   + ++L+DCD  +G  GC      +A+ Y I +G + ++  YPY   E+ + C    
Sbjct:   127 LLSFSEQQLIDCD-DHGFKGCEEQPAINAVSYFIFHG-IETEADYPYAGKENGK-CTFDS 183

Query:   556 XXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNAN-GLFYYSGGVIDLNQRLCNPK 614
                        +      E + K+ V   GP    M A   L+ Y  G+ + +   C   
Sbjct:   184 TKSKIQLKD--AEFVVSNETQGKELVTNYGPAFFTMRAPPSLYDYKIGIYNPSIEECTST 241

Query:   615 AQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
              +  +++IVGYG E    G    YWIVK S+G+ WGE+
Sbjct:   242 HEIRSMVIVGYGIE----GVQ-KYWIVKGSFGTSWGEQ 274


>WB|WBGene00022189 [details] [associations]
            symbol:Y71H2AR.2 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0004181
            "metallocarboxypeptidase activity" evidence=IEA] [GO:0008270 "zinc
            ion binding" evidence=IEA] [GO:0016021 "integral to membrane"
            evidence=IEA] [GO:0008340 "determination of adult lifespan"
            evidence=IMP] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            SMART:SM00645 GO:GO:0008340 GO:GO:0008234 GO:GO:0006508
            PANTHER:PTHR12411 GeneTree:ENSGT00560000076599 HSSP:P07711
            eggNOG:NOG331187 HOGENOM:HOG000114005 EMBL:FO081570
            RefSeq:NP_497627.1 UniGene:Cel.28419 ProteinModelPortal:Q9BL26
            SMR:Q9BL26 EnsemblMetazoa:Y71H2AR.2 GeneID:190615
            KEGG:cel:CELE_Y71H2AR.2 UCSC:Y71H2AR.2 CTD:190615
            WormBase:Y71H2AR.2 InParanoid:Q9BL26 OMA:CAMATTI NextBio:946382
            Uniprot:Q9BL26
        Length = 345

 Score = 200 (75.5 bits), Expect = 3.7e-13, P = 3.7e-13
 Identities = 63/209 (30%), Positives = 94/209 (44%)

Query:   285 FQFNSLRHGDDLPEAF-DWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGN-SLTE 342
             FQ+ +  H D   E F DWR +G++  VK+QGKC    AF+    +E+M+A   N +L  
Sbjct:    70 FQWETPIHMDRTTEEFLDWREKGIVGPVKDQGKCNASHAFAITSSIESMYAKATNGTLLS 129

Query:   343 LSVQQLVDC-DMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXX 401
              S QQL+DC D    GC      +A+ Y+  +G + ++  YPY    +E+ C        
Sbjct:   130 FSEQQLIDCNDQGYKGCEEQFAMNAIGYLATHG-IETEADYPYVDKTNEK-CTFDSTKSK 187

Query:   402 XXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNAN-GLFYYSGGVID------------L 448
                      +  G E   K +V   GP    M A   L+ Y  G+ +             
Sbjct:   188 IHLKKGV--VAEGNEVLGKVYVTNYGPAFFTMRAPPSLYDYKIGIYNPSIEECTSTHEIR 245

Query:   449 NQRLYGTSIP----YWIVKNSWGSDWGEK 473
             +  + G  I     YWIVK S+G+ WGE+
Sbjct:   246 SMVIVGYGIEGEQKYWIVKGSFGTSWGEQ 274

 Score = 149 (57.5 bits), Expect = 2.2e-07, P = 2.2e-07
 Identities = 53/198 (26%), Positives = 86/198 (43%)

Query:   469 DWGEK-VEDKVGSSG--NRTRDLELTGVLPSKLSR--------LATEKLVDC-DMSNGGC 516
             DW EK +   V   G  N +    +T  + S  ++         + ++L+DC D    GC
Sbjct:    87 DWREKGIVGPVKDQGKCNASHAFAITSSIESMYAKATNGTLLSFSEQQLIDCNDQGYKGC 146

Query:   517 NGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYGEEEE 576
                   +A+ Y+  +G + ++  YPY    +E+ C                 +  G E  
Sbjct:   147 EEQFAMNAIGYLATHG-IETEADYPYVDKTNEK-CTFDSTKSKIHLKKGV--VAEGNEVL 202

Query:   577 MKKWVATRGPLSVGMNAN-GLFYYSGGVIDLNQRLCNPKAQNHALIIVGYG-EEEKKDGT 634
              K +V   GP    M A   L+ Y  G+ + +   C    +  +++IVGYG E E+K   
Sbjct:   203 GKVYVTNYGPAFFTMRAPPSLYDYKIGIYNPSIEECTSTHEIRSMVIVGYGIEGEQK--- 259

Query:   635 SIPYWIVKNSWGSDWGEK 652
                YWIVK S+G+ WGE+
Sbjct:   260 ---YWIVKGSFGTSWGEQ 274


>ZFIN|ZDB-GENE-070323-1 [details] [associations]
            symbol:ctsbb "capthepsin B, b" species:7955 "Danio
            rerio" [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0050790
            "regulation of catalytic activity" evidence=IEA] [GO:0008233
            "peptidase activity" evidence=IEA] [GO:0016787 "hydrolase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR012599
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            ZFIN:ZDB-GENE-070323-1 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0050790 GO:GO:0004197
            GeneTree:ENSGT00560000076599 PANTHER:PTHR12411:SF16 OMA:CCGFLCG
            EMBL:CU207296 EMBL:CABZ01037785 IPI:IPI00877452
            Ensembl:ENSDART00000097263 Bgee:F1QZT5 Uniprot:F1QZT5
        Length = 326

 Score = 127 (49.8 bits), Expect = 3.9e-13, Sum P(2) = 3.9e-13
 Identities = 34/116 (29%), Positives = 60/116 (51%)

Query:   288 NSLRHGDD--LPEAFDWRAEG----VISKVKEQGKCACCWAFSAVGVVE---AMHAIQGN 338
             ++++H  +  LP++FD R +      ++++++QG C  CWAF AV  +     +H+ +G 
Sbjct:    65 HTVKHSTNVKLPDSFDLRDQWPNCKTLNQIRDQGSCGSCWAFGAVESISDRICIHS-KGK 123

Query:   339 SLTELSVQQLVDC-DMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGC 393
                E+S + L+ C D    GC+GG   +A  Y     G+V+   Y      S+ GC
Sbjct:   124 QSPEISAEDLLSCCDQCGFGCSGGFPAEAWDYW-RRSGLVTGGLY-----NSDVGC 173

 Score = 121 (47.7 bits), Expect = 3.9e-13, Sum P(2) = 3.9e-13
 Identities = 30/84 (35%), Positives = 44/84 (52%)

Query:   569 IPYGEEEEMKKWVATRGPLSVGMNANGLF-YYSGGVIDLNQRLCNPKAQNHALIIVGYGE 627
             +P  +++ M + + T GP+         F  Y  GV    Q L       HA+ I+G+GE
Sbjct:   226 VPSDQQQIMTE-LYTNGPVEAAFTVYEDFPLYKSGVY---QHLTGSALGGHAVKILGWGE 281

Query:   628 EEKKDGTSIPYWIVKNSWGSDWGE 651
             E   +GT  P+W+V NSW SDWG+
Sbjct:   282 E---NGT--PFWLVANSWNSDWGD 300

 Score = 86 (35.3 bits), Expect = 7.3e-10, Sum P(3) = 7.3e-10
 Identities = 19/68 (27%), Positives = 39/68 (57%)

Query:   140 NSLRHGDD--LPEAFDWRAEG----VISKVKEQGKCACCWAFSAVGVVE---AMHAIQGN 190
             ++++H  +  LP++FD R +      ++++++QG C  CWAF AV  +     +H+ +G 
Sbjct:    65 HTVKHSTNVKLPDSFDLRDQWPNCKTLNQIRDQGSCGSCWAFGAVESISDRICIHS-KGK 123

Query:   191 NLTELSVQ 198
                E+S +
Sbjct:   124 QSPEISAE 131

 Score = 76 (31.8 bits), Expect = 1.7e-08, Sum P(2) = 1.7e-08
 Identities = 10/15 (66%), Positives = 13/15 (86%)

Query:   458 PYWIVKNSWGSDWGE 472
             P+W+V NSW SDWG+
Sbjct:   286 PFWLVANSWNSDWGD 300

 Score = 48 (22.0 bits), Expect = 7.3e-10, Sum P(3) = 7.3e-10
 Identities = 16/52 (30%), Positives = 24/52 (46%)

Query:   501 LATEKLVDC-DMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGC 551
             ++ E L+ C D    GC+GG   +A  Y     G+V+   Y      S+ GC
Sbjct:   128 ISAEDLLSCCDQCGFGCSGGFPAEAWDYW-RRSGLVTGGLY-----NSDVGC 173


>FB|FBgn0034709 [details] [associations]
            symbol:Swim "Secreted Wg-interacting molecule" species:7227
            "Drosophila melanogaster" [GO:0004197 "cysteine-type endopeptidase
            activity" evidence=ISS] [GO:0006955 "immune response" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005044 "scavenger
            receptor activity" evidence=IEA] [GO:0030247 "polysaccharide
            binding" evidence=IEA] [GO:0042600 "chorion" evidence=IDA]
            [GO:0035593 "positive regulation of Wnt receptor signaling pathway
            by establishment of Wnt protein localization to extracellular
            region" evidence=IMP] [GO:0030177 "positive regulation of Wnt
            receptor signaling pathway" evidence=IDA] [GO:0005615
            "extracellular space" evidence=IDA] [GO:0017147 "Wnt-protein
            binding" evidence=IDA] InterPro:IPR000668 InterPro:IPR001212
            InterPro:IPR013128 Pfam:PF00112 PROSITE:PS50958 SMART:SM00201
            SMART:SM00645 EMBL:AE013599 GO:GO:0005615 GO:GO:0006955
            GO:GO:0030247 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00639 GO:GO:0017147 GO:GO:0005044
            GeneTree:ENSGT00560000076599 GO:GO:0042600 eggNOG:NOG310046
            OMA:DNCNRCT HSSP:P80067 EMBL:AY113377 RefSeq:NP_611652.2
            RefSeq:NP_726176.1 UniGene:Dm.732 SMR:Q7JWQ7 IntAct:Q7JWQ7
            EnsemblMetazoa:FBtr0071784 EnsemblMetazoa:FBtr0071785 GeneID:37537
            KEGG:dme:Dmel_CG3074 UCSC:CG3074-RA FlyBase:FBgn0034709
            HOGENOM:HOG000264150 InParanoid:Q7JWQ7 OrthoDB:EOG48CZ9P
            GenomeRNAi:37537 NextBio:804155 GO:GO:0035593 Uniprot:Q7JWQ7
        Length = 431

 Score = 173 (66.0 bits), Expect = 6.0e-13, Sum P(2) = 6.0e-13
 Identities = 50/167 (29%), Positives = 71/167 (42%)

Query:   500 RLATEKLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPY-------KASESER--- 549
             +L+ + ++ C     GC GG +D A +Y+    GVV +  YPY       K   + R   
Sbjct:   237 QLSAQNILSCTRRQQGCEGGHLDAAWRYL-HKKGVVDENCYPYTQHRDTCKIRHNSRSLR 295

Query:   550 --GCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFY-YSGGVIDL 606
               GC                      E ++   +   GP+   M  N  F+ YSGGV   
Sbjct:   296 ANGCQKPVNVDRDSLYTVGPAYSLNREADIMAEIFHSGPVQATMRVNRDFFAYSGGVY-- 353

Query:   607 NQRLCNPKAQN--HALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGE 651
              +   N KA    H++ +VG+GEE   +     YWI  NSWGS WGE
Sbjct:   354 RETAANRKAPTGFHSVKLVGWGEEHNGE----KYWIAANSWGSWWGE 396

 Score = 159 (61.0 bits), Expect = 5.8e-11, Sum P(2) = 5.8e-11
 Identities = 56/199 (28%), Positives = 76/199 (38%)

Query:   294 DDLPEAFDW--RAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQ--GNSLTELSVQQLV 349
             D LP +F+   +    IS+V +QG C   W  S   V     AIQ  G    +LS Q ++
Sbjct:   185 DGLPSSFNALDKWSSYISEVPDQGWCGASWVLSTTSVASDRFAIQSKGKENVQLSAQNIL 244

Query:   350 DCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPY-------KASESER-----GCLXXX 397
              C     GC GG +D A +Y+    GVV +  YPY       K   + R     GC    
Sbjct:   245 SCTRRQQGCEGGHLDAAWRYL-HKKGVVDENCYPYTQHRDTCKIRHNSRSLRANGCQKPV 303

Query:   398 XXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFY-YSGGVIDLNQRLYGTS 456
                               E ++   +   GP+   M  N  F+ YSGGV           
Sbjct:   304 NVDRDSLYTVGPAYSLNREADIMAEIFHSGPVQATMRVNRDFFAYSGGVYRETAANRKAP 363

Query:   457 IPYWIVKN-SWGSDW-GEK 473
               +  VK   WG +  GEK
Sbjct:   364 TGFHSVKLVGWGEEHNGEK 382

 Score = 74 (31.1 bits), Expect = 6.0e-13, Sum P(2) = 6.0e-13
 Identities = 20/58 (34%), Positives = 27/58 (46%)

Query:   146 DDLPEAFDW--RAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQ--GNNLTELSVQH 199
             D LP +F+   +    IS+V +QG C   W  S   V     AIQ  G    +LS Q+
Sbjct:   185 DGLPSSFNALDKWSSYISEVPDQGWCGASWVLSTTSVASDRFAIQSKGKENVQLSAQN 242

 Score = 70 (29.7 bits), Expect = 5.8e-11, Sum P(2) = 5.8e-11
 Identities = 11/14 (78%), Positives = 11/14 (78%)

Query:   459 YWIVKNSWGSDWGE 472
             YWI  NSWGS WGE
Sbjct:   383 YWIAANSWGSWWGE 396


>UNIPROTKB|F1STR1 [details] [associations]
            symbol:CTSC "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0004252
            "serine-type endopeptidase activity" evidence=IEA] [GO:0001913 "T
            cell mediated cytotoxicity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0004252
            GeneTree:ENSGT00560000076599 KO:K01275 InterPro:IPR014882
            Pfam:PF08773 OMA:YDDFLHY GO:GO:0001913 EMBL:CU855751
            RefSeq:XP_003129789.1 UniGene:Ssc.6155 Ensembl:ENSSSCT00000016280
            GeneID:100522387 KEGG:ssc:100522387 Uniprot:F1STR1
        Length = 463

 Score = 160 (61.4 bits), Expect = 6.4e-13, Sum P(2) = 6.4e-13
 Identities = 47/160 (29%), Positives = 73/160 (45%)

Query:   501 LATEKLVDCDMSNGGCNGG-RMDDALQYIIDNGGVVSDQAYPYKASES----ERGCLXXX 555
             L+ +++V C     GC GG     A +Y  D G +V +  +PY  ++S    + GC    
Sbjct:   283 LSPQEVVSCSQYAQGCAGGFPYLIAGKYAQDFG-LVEEACFPYTGTDSPCTVKEGCFRYY 341

Query:   556 XXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNA-NGLFYYSGGVIDLN--QRLCN 612
                       Y       E  MK  +   GP++V     +   +Y  G+      +   N
Sbjct:   342 SSEYHYVGGFYGGC---NEALMKLELVHHGPMAVAFEVYDDFLHYRKGIYHHTGLRDPFN 398

Query:   613 P-KAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGE 651
             P +  NHA+++VGYG +     + + YWIVKNSWG+ WGE
Sbjct:   399 PFELTNHAVLLVGYGTDL---ASGMDYWIVKNSWGTSWGE 435

 Score = 144 (55.7 bits), Expect = 9.7e-11, Sum P(2) = 9.7e-11
 Identities = 44/161 (27%), Positives = 74/161 (45%)

Query:   296 LPEAFDWR-AEGV--ISKVKEQGKCACCWAFSAVGVVEA-MHAIQGNSLTE-LSVQQLVD 350
             LP ++DWR   G   ++ V+ Q  C  C++F+++G++EA +  +  N+ T  LS Q++V 
Sbjct:   231 LPASWDWRNVRGTNFVTPVRNQASCGSCYSFASMGMMEARIRILTNNTQTPILSPQEVVS 290

Query:   351 CDMSNGGCNGG-RMDDALQYIIDNGGVVSDQAYPYKASES----ERGCLXXXXXXXXXXX 405
             C     GC GG     A +Y  D G +V +  +PY  ++S    + GC            
Sbjct:   291 CSQYAQGCAGGFPYLIAGKYAQDFG-LVEEACFPYTGTDSPCTVKEGCFRYYSSEYHYVG 349

Query:   406 XXYSRIPYGEEEEMKKWVATRGPLSVGMNA-NGLFYYSGGV 445
               Y       E  MK  +   GP++V     +   +Y  G+
Sbjct:   350 GFYGGC---NEALMKLELVHHGPMAVAFEVYDDFLHYRKGI 387

 Score = 89 (36.4 bits), Expect = 6.4e-13, Sum P(2) = 6.4e-13
 Identities = 20/61 (32%), Positives = 35/61 (57%)

Query:   148 LPEAFDWR-AEGV--ISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQHHDKVY 204
             LP ++DWR   G   ++ V+ Q  C  C++F+++G++EA   I  NN T+  +    +V 
Sbjct:   231 LPASWDWRNVRGTNFVTPVRNQASCGSCYSFASMGMMEARIRILTNN-TQTPILSPQEVV 289

Query:   205 S 205
             S
Sbjct:   290 S 290

 Score = 85 (35.0 bits), Expect = 9.7e-11, Sum P(2) = 9.7e-11
 Identities = 15/24 (62%), Positives = 17/24 (70%)

Query:   453 YGTSIP----YWIVKNSWGSDWGE 472
             YGT +     YWIVKNSWG+ WGE
Sbjct:   412 YGTDLASGMDYWIVKNSWGTSWGE 435


>WB|WBGene00013072 [details] [associations]
            symbol:Y51A2D.1 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 GeneTree:ENSGT00560000076599
            EMBL:AL021497 RefSeq:NP_001256811.1 ProteinModelPortal:O62484
            SMR:O62484 MEROPS:C01.A37 EnsemblMetazoa:Y51A2D.1 GeneID:180204
            KEGG:cel:CELE_Y51A2D.1 UCSC:Y51A2D.1 CTD:180204 WormBase:Y51A2D.1a
            HOGENOM:HOG000019851 NextBio:908416 Uniprot:O62484
        Length = 314

 Score = 132 (51.5 bits), Expect = 7.7e-13, Sum P(2) = 7.7e-13
 Identities = 38/151 (25%), Positives = 69/151 (45%)

Query:    45 FLNFMRDHDKVYSSVEDLLRRHENFV---TNVEKAEDYQREDSGTAVFEVNKFFDLSDSD 101
             F+ F +   + Y S  +   R +NFV    NV +     ++    + F VN+F DL+ S+
Sbjct:    44 FVEFKKKFSRTYKSEAENQLRLQNFVKSRNNVVRLNKNAQKAGRNSNFAVNQFSDLTTSE 103

Query:   102 LQQLTGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGV-- 159
             L Q         L    P+L      ++   ++   +  + R   +    FD R++ V  
Sbjct:   104 LHQ--------RLSRFPPNLTENSVFHKNFKKLLG-KTRTKRQNSEFARNFDLRSQKVNG 154

Query:   160 ---ISKVKEQGKCACCWAFSAVGVVEAMHAI 187
                +  +K QG+CACCW F+   ++E ++A+
Sbjct:   155 RYIVGPIKNQGQCACCWGFAVTAMLETIYAV 185

 Score = 120 (47.3 bits), Expect = 1.8e-11, Sum P(2) = 1.8e-11
 Identities = 36/142 (25%), Positives = 64/142 (45%)

Query:   202 KVYSSVEDLLRRHENFV---TNVEKAEDYQSEDSGTAVFGVNKFFDLSESDLQQLTGLNL 258
             + Y S  +   R +NFV    NV +      +    + F VN+F DL+ S+L Q      
Sbjct:    53 RTYKSEAENQLRLQNFVKSRNNVVRLNKNAQKAGRNSNFAVNQFSDLTTSELHQ------ 106

Query:   259 DSTLEDIQPSLQAPFSSNQTDTEMRAFQFNSLRHGDDLPEAFDWRAEGV-----ISKVKE 313
                L    P+L      ++   ++   +  + R   +    FD R++ V     +  +K 
Sbjct:   107 --RLSRFPPNLTENSVFHKNFKKLLG-KTRTKRQNSEFARNFDLRSQKVNGRYIVGPIKN 163

Query:   314 QGKCACCWAFSAVGVVEAMHAI 335
             QG+CACCW F+   ++E ++A+
Sbjct:   164 QGQCACCWGFAVTAMLETIYAV 185

 Score = 112 (44.5 bits), Expect = 7.7e-13, Sum P(2) = 7.7e-13
 Identities = 33/85 (38%), Positives = 43/85 (50%)

Query:   570 PYGEEEEMKKWVAT-RGPLSVGMNANGLFY-YSGGVIDLNQRLCNPKAQN-HALIIVGYG 626
             P   E E+ + + T + P++V   A   F  Y  GV+      C+      HA  IVGYG
Sbjct:   201 PENAESEIIEILNTWKTPVAVYFAAGTAFLQYKSGVLVTED--CDLAGTVWHAGAIVGYG 258

Query:   627 EEEKKDGTSIPYWIVKNSWG-SDWG 650
             EE    G S  +WI+KNSWG S WG
Sbjct:   259 EENDLRGRSQRFWIMKNSWGVSGWG 283

 Score = 78 (32.5 bits), Expect = 2.5e-09, Sum P(2) = 2.5e-09
 Identities = 15/38 (39%), Positives = 23/38 (60%)

Query:   437 GLFYYSGGVIDLNQR--LYGTSIPYWIVKNSWG-SDWG 471
             G  +++G ++   +   L G S  +WI+KNSWG S WG
Sbjct:   246 GTVWHAGAIVGYGEENDLRGRSQRFWIMKNSWGVSGWG 283


>UNIPROTKB|F1N9D7 [details] [associations]
            symbol:CTSB "Cathepsin B" species:9031 "Gallus gallus"
            [GO:0050790 "regulation of catalytic activity" evidence=IEA]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005730 "nucleolus"
            evidence=IEA] [GO:0005739 "mitochondrion" evidence=IEA] [GO:0005764
            "lysosome" evidence=IEA] [GO:0097067 "cellular response to thyroid
            hormone stimulus" evidence=IEA] InterPro:IPR000668
            InterPro:IPR012599 InterPro:IPR013128 InterPro:IPR015643
            Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005739 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0050790 GO:GO:0005764
            GO:GO:0004197 GeneTree:ENSGT00560000076599 OMA:GYPSGAW
            GO:GO:0097067 PANTHER:PTHR12411:SF16 IPI:IPI00573387
            EMBL:AADN02018292 Ensembl:ENSGALT00000026896
            Ensembl:ENSGALT00000036723 Uniprot:F1N9D7
        Length = 340

 Score = 136 (52.9 bits), Expect = 7.8e-13, Sum P(2) = 7.8e-13
 Identities = 37/107 (34%), Positives = 53/107 (49%)

Query:   295 DLPEAFDWRAEG----VISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLT--ELSVQQL 348
             DLP+ FD R +      IS++++QG C  CWAF AV  +     +  N+    E+S + L
Sbjct:    79 DLPDTFDSRKQWPNCPTISEIRDQGSCGSCWAFGAVEAISDRICVHTNAKVSVEVSAEDL 138

Query:   349 VDC-DMSNG-GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGC 393
             + C     G GCNGG    A +Y  + G +VS   Y     +S  GC
Sbjct:   139 LSCCGFECGMGCNGGYPSGAWRYWTERG-LVSGGLY-----DSHVGC 179

 Score = 109 (43.4 bits), Expect = 7.8e-13, Sum P(2) = 7.8e-13
 Identities = 27/84 (32%), Positives = 41/84 (48%)

Query:   569 IPYGEEEEMKKWVATRGPLSVGMNA-NGLFYYSGGVIDLNQRLCNPKAQNHALIIVGYGE 627
             +P  E+E M + +   GP+            Y  GV    Q +   +   HA+ I+G+G 
Sbjct:   233 VPRSEKEIMAE-IYKNGPVEGAFIVYEDFLMYKSGVY---QHVSGEQVGGHAIRILGWGV 288

Query:   628 EEKKDGTSIPYWIVKNSWGSDWGE 651
             E   +GT  PYW+  NSW +DWG+
Sbjct:   289 E---NGT--PYWLAANSWNTDWGD 307

 Score = 97 (39.2 bits), Expect = 1.5e-08, Sum P(2) = 1.5e-08
 Identities = 20/59 (33%), Positives = 29/59 (49%)

Query:   147 DLPEAFDWRAEG----VISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQHHD 201
             DLP+ FD R +      IS++++QG C  CWAF AV  +     +  N    + V   D
Sbjct:    79 DLPDTFDSRKQWPNCPTISEIRDQGSCGSCWAFGAVEAISDRICVHTNAKVSVEVSAED 137

 Score = 73 (30.8 bits), Expect = 4.1e-09, Sum P(2) = 4.1e-09
 Identities = 9/15 (60%), Positives = 12/15 (80%)

Query:   458 PYWIVKNSWGSDWGE 472
             PYW+  NSW +DWG+
Sbjct:   293 PYWLAANSWNTDWGD 307

 Score = 52 (23.4 bits), Expect = 0.00059, Sum P(2) = 0.00059
 Identities = 20/59 (33%), Positives = 29/59 (49%)

Query:   496 SKLS-RLATEKLVDC-DMSNG-GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGC 551
             +K+S  ++ E L+ C     G GCNGG    A +Y  + G +VS   Y     +S  GC
Sbjct:   127 AKVSVEVSAEDLLSCCGFECGMGCNGGYPSGAWRYWTERG-LVSGGLY-----DSHVGC 179


>UNIPROTKB|P43233 [details] [associations]
            symbol:CTSB "Cathepsin B" species:9031 "Gallus gallus"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=IEA]
            [GO:0050790 "regulation of catalytic activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005764 "lysosome"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR012599
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 eggNOG:COG4870
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0050790 GO:GO:0005764 GO:GO:0004197 MEROPS:C01.060 CTD:1508
            HOGENOM:HOG000241341 HOVERGEN:HBG003480 KO:K01363 OrthoDB:EOG4K6G4C
            PANTHER:PTHR12411:SF16 EMBL:U18083 IPI:IPI00573387 PIR:S58770
            RefSeq:NP_990702.1 UniGene:Gga.3854 ProteinModelPortal:P43233
            SMR:P43233 STRING:P43233 PRIDE:P43233 GeneID:396329 KEGG:gga:396329
            InParanoid:P43233 NextBio:20816377 Uniprot:P43233
        Length = 340

 Score = 137 (53.3 bits), Expect = 9.7e-13, Sum P(2) = 9.7e-13
 Identities = 37/107 (34%), Positives = 53/107 (49%)

Query:   295 DLPEAFDWRAEG----VISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLT--ELSVQQL 348
             DLP+ FD R +      IS++++QG C  CWAF AV  +     +  N+    E+S + L
Sbjct:    79 DLPDTFDTRKQWPNCPTISEIRDQGSCGSCWAFGAVEAISDRICVHTNAKVSVEVSAEDL 138

Query:   349 VDC-DMSNG-GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGC 393
             + C     G GCNGG    A +Y  + G +VS   Y     +S  GC
Sbjct:   139 LSCCGFECGMGCNGGYPSGAWRYWTERG-LVSGGLY-----DSHVGC 179

 Score = 107 (42.7 bits), Expect = 9.7e-13, Sum P(2) = 9.7e-13
 Identities = 27/83 (32%), Positives = 40/83 (48%)

Query:   569 IPYGEEEEMKKWVATRGPLSVGMNA-NGLFYYSGGVIDLNQRLCNPKAQNHALIIVGYGE 627
             +P  E+E M + +   GP+            Y  GV    Q +   +   HA+ I+G+G 
Sbjct:   233 VPRSEKEIMAE-IYKNGPVEGAFIVYEDFLMYKSGVY---QHVSGEQVGGHAIRILGWGV 288

Query:   628 EEKKDGTSIPYWIVKNSWGSDWG 650
             E   +GT  PYW+  NSW +DWG
Sbjct:   289 E---NGT--PYWLAANSWNTDWG 306

 Score = 98 (39.6 bits), Expect = 1.9e-08, Sum P(2) = 1.9e-08
 Identities = 20/59 (33%), Positives = 29/59 (49%)

Query:   147 DLPEAFDWRAEG----VISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQHHD 201
             DLP+ FD R +      IS++++QG C  CWAF AV  +     +  N    + V   D
Sbjct:    79 DLPDTFDTRKQWPNCPTISEIRDQGSCGSCWAFGAVEAISDRICVHTNAKVSVEVSAED 137

 Score = 71 (30.1 bits), Expect = 5.1e-09, Sum P(2) = 5.1e-09
 Identities = 9/14 (64%), Positives = 11/14 (78%)

Query:   458 PYWIVKNSWGSDWG 471
             PYW+  NSW +DWG
Sbjct:   293 PYWLAANSWNTDWG 306

 Score = 52 (23.4 bits), Expect = 0.00097, Sum P(2) = 0.00097
 Identities = 20/59 (33%), Positives = 29/59 (49%)

Query:   496 SKLS-RLATEKLVDC-DMSNG-GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGC 551
             +K+S  ++ E L+ C     G GCNGG    A +Y  + G +VS   Y     +S  GC
Sbjct:   127 AKVSVEVSAEDLLSCCGFECGMGCNGGYPSGAWRYWTERG-LVSGGLY-----DSHVGC 179


>UNIPROTKB|E2R6Q7 [details] [associations]
            symbol:CTSB "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0097067 "cellular response to thyroid hormone
            stimulus" evidence=IEA] [GO:0005764 "lysosome" evidence=IEA]
            [GO:0005739 "mitochondrion" evidence=IEA] [GO:0005730 "nucleolus"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0050790
            "regulation of catalytic activity" evidence=IEA] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR012599 InterPro:IPR013128
            InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0005739 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0050790
            GO:GO:0005764 GO:GO:0004197 CTD:1508 GeneTree:ENSGT00560000076599
            KO:K01363 OMA:GYPSGAW GO:GO:0097067 PANTHER:PTHR12411:SF16
            EMBL:AAEX03014318 RefSeq:XP_543203.3 Ensembl:ENSCAFT00000012692
            GeneID:486077 KEGG:cfa:486077 NextBio:20859923 Uniprot:E2R6Q7
        Length = 339

 Score = 129 (50.5 bits), Expect = 1.1e-12, Sum P(2) = 1.1e-12
 Identities = 35/106 (33%), Positives = 52/106 (49%)

Query:   296 LPEAFDWRAEG----VISKVKEQGKCACCWAFSAVGVVEAMHAIQ--GNSLTELSVQQLV 349
             LPE+FD R +      I ++++QG C  CWAF AV  +     I+  G+   E+S + ++
Sbjct:    80 LPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRTNGHVNVEVSAEDML 139

Query:   350 DC--DMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGC 393
              C  D    GCNGG   +A  +    G +VS   Y     +S  GC
Sbjct:   140 TCCGDQCGDGCNGGFPAEAWNFWTKQG-LVSGGLY-----DSHVGC 179

 Score = 115 (45.5 bits), Expect = 1.1e-12, Sum P(2) = 1.1e-12
 Identities = 28/79 (35%), Positives = 39/79 (49%)

Query:   574 EEEMKKWVATRGPLSVGMNANGLFY-YSGGVIDLNQRLCNPKAQNHALIIVGYGEEEKKD 632
             E+E+   +   GP+         F  Y  GV    Q +       HA+ I+G+G E   D
Sbjct:   236 EKEIMAEIYKNGPVEAAFTVYSDFLLYKSGVY---QHVTGEMMGGHAVRILGWGVE---D 289

Query:   633 GTSIPYWIVKNSWGSDWGE 651
             GT  PYW+V NSW +DWG+
Sbjct:   290 GT--PYWLVGNSWNTDWGD 306

 Score = 94 (38.1 bits), Expect = 7.0e-10, Sum P(3) = 7.0e-10
 Identities = 20/58 (34%), Positives = 29/58 (50%)

Query:   148 LPEAFDWRAEG----VISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQHHD 201
             LPE+FD R +      I ++++QG C  CWAF AV  +     I+ N    + V   D
Sbjct:    80 LPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRTNGHVNVEVSAED 137

 Score = 76 (31.8 bits), Expect = 1.2e-08, Sum P(2) = 1.2e-08
 Identities = 10/15 (66%), Positives = 13/15 (86%)

Query:   458 PYWIVKNSWGSDWGE 472
             PYW+V NSW +DWG+
Sbjct:   292 PYWLVGNSWNTDWGD 306

 Score = 48 (22.0 bits), Expect = 7.0e-10, Sum P(3) = 7.0e-10
 Identities = 16/53 (30%), Positives = 24/53 (45%)

Query:   501 LATEKLVDC--DMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGC 551
             ++ E ++ C  D    GCNGG   +A  +    G +VS   Y     +S  GC
Sbjct:   133 VSAEDMLTCCGDQCGDGCNGGFPAEAWNFWTKQG-LVSGGLY-----DSHVGC 179


>UNIPROTKB|I3L9E7 [details] [associations]
            symbol:LOC100153159 "Uncharacterized protein" species:9823
            "Sus scrofa" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411
            PROSITE:PS00640 GeneTree:ENSGT00560000076599 OMA:WGQLTSS
            Ensembl:ENSSSCT00000031207 Uniprot:I3L9E7
        Length = 358

 Score = 134 (52.2 bits), Expect = 2.6e-12, Sum P(2) = 2.6e-12
 Identities = 41/112 (36%), Positives = 51/112 (45%)

Query:   289 SLRHGDDLPEAF--DWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQ--GNSLTELS 344
             SL    DLPE F   ++  G      +Q  CA  WAFS   V     AIQ  G     LS
Sbjct:    92 SLPETTDLPEFFVASYKWPGWTHGPLDQKNCAASWAFSTASVAADRIAIQSEGRYTANLS 151

Query:   345 VQQLVDCDMSNG-GCNGGRMDDALQYIIDNGGVVSDQAYP-YK-ASESERGC 393
              Q L+ C   N  GCN G +D A  Y+   G +VS   YP +K  + +  GC
Sbjct:   152 PQNLISCCAKNRHGCNSGSIDRAWWYLRKRG-LVSHACYPLFKDQNATNNGC 202

 Score = 107 (42.7 bits), Expect = 2.6e-12, Sum P(2) = 2.6e-12
 Identities = 27/90 (30%), Positives = 41/90 (45%)

Query:   568 RIPYGEEEEMKKWVATRGPLSVGMNAN-GLFYYSGG----VIDLNQRLCN-PKAQNHALI 621
             R+   E E M++ +   GP+   M  +   F+Y  G    V   N+      K + HA+ 
Sbjct:   238 RVSSNETEIMRE-IMQNGPVQAIMQVHEDFFHYKTGIYRHVTSTNEESDKYRKLRTHAVK 296

Query:   622 IVGYGEEEKKDGTSIPYWIVKNSWGSDWGE 651
             + G+G  +   G    +WI  NSWG  WGE
Sbjct:   297 LTGWGTLKGAQGRKEKFWIAANSWGKSWGE 326

 Score = 73 (30.8 bits), Expect = 7.2e-09, Sum P(3) = 7.2e-09
 Identities = 19/56 (33%), Positives = 29/56 (51%)

Query:   499 SRLATEKLVDCDMSNG-GCNGGRMDDALQYIIDNGGVVSDQAYP-YK-ASESERGC 551
             + L+ + L+ C   N  GCN G +D A  Y+   G +VS   YP +K  + +  GC
Sbjct:   148 ANLSPQNLISCCAKNRHGCNSGSIDRAWWYLRKRG-LVSHACYPLFKDQNATNNGC 202

 Score = 69 (29.3 bits), Expect = 7.2e-09, Sum P(3) = 7.2e-09
 Identities = 23/63 (36%), Positives = 27/63 (42%)

Query:   141 SLRHGDDLPEAF--DWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQ--GNNLTELS 196
             SL    DLPE F   ++  G      +Q  CA  WAFS   V     AIQ  G     LS
Sbjct:    92 SLPETTDLPEFFVASYKWPGWTHGPLDQKNCAASWAFSTASVAADRIAIQSEGRYTANLS 151

Query:   197 VQH 199
              Q+
Sbjct:   152 PQN 154

 Score = 67 (28.6 bits), Expect = 3.5e-08, Sum P(2) = 3.5e-08
 Identities = 10/19 (52%), Positives = 11/19 (57%)

Query:   454 GTSIPYWIVKNSWGSDWGE 472
             G    +WI  NSWG  WGE
Sbjct:   308 GRKEKFWIAANSWGKSWGE 326


>FB|FBgn0030521 [details] [associations]
            symbol:CtsB1 "Cathepsin B1" species:7227 "Drosophila
            melanogaster" [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=ISS] [GO:0035071 "salivary gland cell autophagic cell
            death" evidence=IEP] [GO:0048102 "autophagic cell death"
            evidence=IEP] [GO:0006508 "proteolysis" evidence=IEA] [GO:0050790
            "regulation of catalytic activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR012599 InterPro:IPR013128 InterPro:IPR015643
            Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 EMBL:AE014298 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0050790 GO:GO:0035071
            GO:GO:0004197 MEROPS:C01.060 eggNOG:NOG315657
            GeneTree:ENSGT00560000076599 KO:K01363 PANTHER:PTHR12411:SF16
            HSSP:P07688 EMBL:AY060640 RefSeq:NP_572920.1 UniGene:Dm.3926
            SMR:Q9VY87 IntAct:Q9VY87 MINT:MINT-932864 STRING:Q9VY87
            EnsemblMetazoa:FBtr0073838 GeneID:32341 KEGG:dme:Dmel_CG10992
            UCSC:CG10992-RA FlyBase:FBgn0030521 InParanoid:Q9VY87 OMA:TEGHIRR
            OrthoDB:EOG48W9HM ChiTaRS:CG10992 GenomeRNAi:32341 NextBio:778020
            Uniprot:Q9VY87
        Length = 340

 Score = 131 (51.2 bits), Expect = 3.6e-12, Sum P(2) = 3.6e-12
 Identities = 39/110 (35%), Positives = 51/110 (46%)

Query:   294 DDLPEAFDWRAEG----VISKVKEQGKCACCWAFSAVGVVEAMHAIQ-GNSLT-ELSVQQ 347
             D+LPE FD R +      I ++++QG C  CWAF AV  +     I  G  +    S   
Sbjct:    85 DELPEEFDSRKQWPNCPTIGEIRDQGSCGSCWAFGAVEAMSDRVCIHSGGKVNFHFSADD 144

Query:   348 LVDCDMSNG-GCNGGRMDDALQY-----IIDNGGVVSDQAY-PYKASESE 390
             LV C  + G GCNGG    A  Y     I+  G   S+Q   PY+ S  E
Sbjct:   145 LVSCCHTCGFGCNGGFPGAAWSYWTRKGIVSGGPYGSNQGCRPYEISPCE 194

 Score = 108 (43.1 bits), Expect = 3.6e-12, Sum P(2) = 3.6e-12
 Identities = 27/79 (34%), Positives = 40/79 (50%)

Query:   576 EMKKWVATRGPLSVGMNA-NGLFYYSGGVIDLNQRLCNPKAQNHALIIVGYGE--EEKKD 632
             E+++ + T GP+         L  Y  GV    Q     +   HA+ I+G+G   EEK  
Sbjct:   244 EIQEEIMTNGPVEGAFTVYEDLILYKDGVY---QHEHGKELGGHAIRILGWGVWGEEK-- 298

Query:   633 GTSIPYWIVKNSWGSDWGE 651
                IPYW++ NSW +DWG+
Sbjct:   299 ---IPYWLIGNSWNTDWGD 314

 Score = 93 (37.8 bits), Expect = 1.7e-09, Sum P(3) = 1.7e-09
 Identities = 20/43 (46%), Positives = 26/43 (60%)

Query:   146 DDLPEAFDWRAEG----VISKVKEQGKCACCWAFSAVGVVEAM 184
             D+LPE FD R +      I ++++QG C  CWAF   G VEAM
Sbjct:    85 DELPEEFDSRKQWPNCPTIGEIRDQGSCGSCWAF---GAVEAM 124

 Score = 79 (32.9 bits), Expect = 3.6e-09, Sum P(2) = 3.6e-09
 Identities = 10/16 (62%), Positives = 14/16 (87%)

Query:   457 IPYWIVKNSWGSDWGE 472
             IPYW++ NSW +DWG+
Sbjct:   299 IPYWLIGNSWNTDWGD 314

 Score = 53 (23.7 bits), Expect = 1.7e-09, Sum P(3) = 1.7e-09
 Identities = 19/54 (35%), Positives = 25/54 (46%)

Query:   502 ATEKLVDCDMSNG-GCNGGRMDDALQY-----IIDNGGVVSDQAY-PYKASESE 548
             + + LV C  + G GCNGG    A  Y     I+  G   S+Q   PY+ S  E
Sbjct:   141 SADDLVSCCHTCGFGCNGGFPGAAWSYWTRKGIVSGGPYGSNQGCRPYEISPCE 194


>UNIPROTKB|E1BTI7 [details] [associations]
            symbol:TINAG "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0005044 "scavenger receptor activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0006955 "immune
            response" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0030247 "polysaccharide binding"
            evidence=IEA] [GO:0005604 "basement membrane" evidence=IEA]
            [GO:0007155 "cell adhesion" evidence=IEA] InterPro:IPR000668
            InterPro:IPR001212 InterPro:IPR013128 Pfam:PF00112 PROSITE:PS50958
            SMART:SM00201 SMART:SM00645 GO:GO:0006955 GO:GO:0030247
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411
            PROSITE:PS00640 GO:GO:0007155 GO:GO:0005604 GO:GO:0005044
            GeneTree:ENSGT00560000076599 CTD:27283 OMA:WGQLTSS
            EMBL:AADN02002720 EMBL:AADN02002721 IPI:IPI00581566
            RefSeq:XP_419905.3 UniGene:Gga.11215 Ensembl:ENSGALT00000026295
            GeneID:421888 KEGG:gga:421888 Uniprot:E1BTI7
        Length = 467

 Score = 132 (51.5 bits), Expect = 5.5e-12, Sum P(2) = 5.5e-12
 Identities = 31/74 (41%), Positives = 40/74 (54%)

Query:   313 EQGKCACCWAFSAVGVVEAMHAIQGNS-LTE-LSVQQLVDCDMSNG-GCNGGRMDDALQY 369
             +Q  C   WAFS   V      I  +  +T+ LSVQ L+ CD  N  GCNGG +D A +Y
Sbjct:   241 DQRNCGASWAFSTASVAADRITIHSDGQITDNLSVQNLISCDTGNQRGCNGGSIDGAWRY 300

Query:   370 IIDNGGVVSDQAYP 383
             +  +G VVS   YP
Sbjct:   301 LTTHG-VVSYACYP 313

 Score = 110 (43.8 bits), Expect = 5.5e-12, Sum P(2) = 5.5e-12
 Identities = 22/80 (27%), Positives = 40/80 (50%)

Query:   573 EEEEMKKWVATRGPLSVGMNA-NGLFYYSGGVIDLNQRLCNPKAQNHALIIVGYGEEEKK 631
             +E ++ + +  +GP+   M      F Y  G+   + +    K + H++ ++G+G    K
Sbjct:   365 KETDIMEEIMAKGPVQAIMKVYEDFFLYKEGIYRHSYK-AGSKWKTHSVKLLGWGSLPGK 423

Query:   632 DGTSIPYWIVKNSWGSDWGE 651
             +G    +WI  NSWG  WGE
Sbjct:   424 NGQKQKFWIAANSWGKYWGE 443

 Score = 91 (37.1 bits), Expect = 1.8e-08, Sum P(3) = 1.8e-08
 Identities = 19/42 (45%), Positives = 26/42 (61%)

Query:   501 LATEKLVDCDMSNG-GCNGGRMDDALQYIIDNGGVVSDQAYP 541
             L+ + L+ CD  N  GCNGG +D A +Y+  +G VVS   YP
Sbjct:   273 LSVQNLISCDTGNQRGCNGGSIDGAWRYLTTHG-VVSYACYP 313

 Score = 64 (27.6 bits), Expect = 3.0e-07, Sum P(2) = 3.0e-07
 Identities = 10/19 (52%), Positives = 11/19 (57%)

Query:   454 GTSIPYWIVKNSWGSDWGE 472
             G    +WI  NSWG  WGE
Sbjct:   425 GQKQKFWIAANSWGKYWGE 443

 Score = 49 (22.3 bits), Expect = 1.8e-08, Sum P(3) = 1.8e-08
 Identities = 13/37 (35%), Positives = 18/37 (48%)

Query:   165 EQGKCACCWAFSAVGVVEAMHAIQGNN-LTE-LSVQH 199
             +Q  C   WAFS   V      I  +  +T+ LSVQ+
Sbjct:   241 DQRNCGASWAFSTASVAADRITIHSDGQITDNLSVQN 277


>UNIPROTKB|Q9UJW2 [details] [associations]
            symbol:TINAG "Tubulointerstitial nephritis antigen"
            species:9606 "Homo sapiens" [GO:0005044 "scavenger receptor
            activity" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0006955 "immune response" evidence=IEA] [GO:0030247
            "polysaccharide binding" evidence=IEA] [GO:0007155 "cell adhesion"
            evidence=IDA] [GO:0005604 "basement membrane" evidence=IDA]
            [GO:0000166 "nucleotide binding" evidence=TAS] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=TAS]
            InterPro:IPR000668 InterPro:IPR001212 InterPro:IPR013128
            Pfam:PF00112 Pfam:PF01033 PROSITE:PS00524 PROSITE:PS50958
            SMART:SM00201 SMART:SM00645 GO:GO:0006955 EMBL:CH471081
            GO:GO:0000166 GO:GO:0030247 GO:GO:0006508 InterPro:IPR025661
            PANTHER:PTHR12411 PROSITE:PS00640 GO:GO:0007155 GO:GO:0005604
            GO:GO:0004197 GO:GO:0005044 EMBL:AL359380 MEROPS:C01.973 CTD:27283
            eggNOG:NOG310046 HOGENOM:HOG000241342 HOVERGEN:HBG053961
            OMA:WGQLTSS EMBL:AB022277 EMBL:AF195116 EMBL:AF195117 EMBL:AK312918
            EMBL:AL589946 IPI:IPI00099386 IPI:IPI00478705 PIR:JC7189
            RefSeq:NP_055279.3 UniGene:Hs.127011 ProteinModelPortal:Q9UJW2
            SMR:Q9UJW2 IntAct:Q9UJW2 STRING:Q9UJW2 PhosphoSite:Q9UJW2
            DMDM:212276468 PRIDE:Q9UJW2 DNASU:27283 Ensembl:ENST00000259782
            GeneID:27283 KEGG:hsa:27283 UCSC:uc003pcj.2 GeneCards:GC06P054220
            H-InvDB:HIX0025004 HGNC:HGNC:14599 HPA:HPA035427 MIM:606749
            neXtProt:NX_Q9UJW2 PharmGKB:PA37905 InParanoid:Q9UJW2
            PhylomeDB:Q9UJW2 GenomeRNAi:27283 NextBio:50212 ArrayExpress:Q9UJW2
            Bgee:Q9UJW2 CleanEx:HS_TINAG Genevestigator:Q9UJW2
            GermOnline:ENSG00000137251 Uniprot:Q9UJW2
        Length = 476

 Score = 132 (51.5 bits), Expect = 5.8e-12, Sum P(2) = 5.8e-12
 Identities = 41/112 (36%), Positives = 51/112 (45%)

Query:   289 SLRHGDDLPEAF--DWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQ--GNSLTELS 344
             SL    DLPE F   ++  G      +Q  CA  WAFS   V     AIQ  G     LS
Sbjct:   210 SLPATTDLPEFFVASYKWPGWTHGPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLS 269

Query:   345 VQQLVDCDMSNG-GCNGGRMDDALQYIIDNGGVVSDQAYP-YK-ASESERGC 393
              Q L+ C   N  GCN G +D A  Y+   G +VS   YP +K  + +  GC
Sbjct:   270 PQNLISCCAKNRHGCNSGSIDRAWWYLRKRG-LVSHACYPLFKDQNATNNGC 320

 Score = 110 (43.8 bits), Expect = 5.8e-12, Sum P(2) = 5.8e-12
 Identities = 29/90 (32%), Positives = 39/90 (43%)

Query:   568 RIPYGEEEEMKKWVATRGPLSVGMNAN-GLFYYSGG----VIDLNQRLCN-PKAQNHALI 621
             R+   E E MK+ +   GP+   M      F+Y  G    V   N+      K Q HA+ 
Sbjct:   356 RVSSNETEIMKE-IMQNGPVQAIMQVREDFFHYKTGIYRHVTSTNKESEKYRKLQTHAVK 414

Query:   622 IVGYGEEEKKDGTSIPYWIVKNSWGSDWGE 651
             + G+G      G    +WI  NSWG  WGE
Sbjct:   415 LTGWGTLRGAQGQKEKFWIAANSWGKSWGE 444

 Score = 73 (30.8 bits), Expect = 1.9e-08, Sum P(3) = 1.9e-08
 Identities = 19/56 (33%), Positives = 29/56 (51%)

Query:   499 SRLATEKLVDCDMSNG-GCNGGRMDDALQYIIDNGGVVSDQAYP-YK-ASESERGC 551
             + L+ + L+ C   N  GCN G +D A  Y+   G +VS   YP +K  + +  GC
Sbjct:   266 ANLSPQNLISCCAKNRHGCNSGSIDRAWWYLRKRG-LVSHACYPLFKDQNATNNGC 320

 Score = 67 (28.6 bits), Expect = 1.9e-08, Sum P(3) = 1.9e-08
 Identities = 23/63 (36%), Positives = 27/63 (42%)

Query:   141 SLRHGDDLPEAF--DWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQ--GNNLTELS 196
             SL    DLPE F   ++  G      +Q  CA  WAFS   V     AIQ  G     LS
Sbjct:   210 SLPATTDLPEFFVASYKWPGWTHGPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLS 269

Query:   197 VQH 199
              Q+
Sbjct:   270 PQN 272

 Score = 67 (28.6 bits), Expect = 1.6e-07, Sum P(2) = 1.6e-07
 Identities = 10/19 (52%), Positives = 11/19 (57%)

Query:   454 GTSIPYWIVKNSWGSDWGE 472
             G    +WI  NSWG  WGE
Sbjct:   426 GQKEKFWIAANSWGKSWGE 444


>RGD|1359482 [details] [associations]
            symbol:Tinag "tubulointerstitial nephritis antigen"
            species:10116 "Rattus norvegicus" [GO:0005044 "scavenger receptor
            activity" evidence=IEA] [GO:0005604 "basement membrane"
            evidence=ISO] [GO:0006508 "proteolysis" evidence=IEA] [GO:0006955
            "immune response" evidence=IEA] [GO:0007155 "cell adhesion"
            evidence=ISO] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0030247 "polysaccharide binding" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR001212 InterPro:IPR013128
            Pfam:PF00112 Pfam:PF01033 PROSITE:PS00524 PROSITE:PS50958
            SMART:SM00201 SMART:SM00645 RGD:1359482 GO:GO:0006955 GO:GO:0030247
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411
            PROSITE:PS00640 GO:GO:0007155 EMBL:CH473954 GO:GO:0005604
            GO:GO:0005044 MEROPS:C01.973 CTD:27283 eggNOG:NOG310046
            HOGENOM:HOG000241342 HOVERGEN:HBG053961 OMA:WGQLTSS
            OrthoDB:EOG47PX5P EMBL:BC081887 IPI:IPI00370427
            RefSeq:NP_001005549.1 UniGene:Rn.43851 STRING:Q66HF6
            Ensembl:ENSRNOT00000041567 GeneID:300846 KEGG:rno:300846
            UCSC:RGD:1359482 InParanoid:Q66HF6 NextBio:647630
            Genevestigator:Q66HF6 Uniprot:Q66HF6
        Length = 475

 Score = 127 (49.8 bits), Expect = 7.8e-12, Sum P(2) = 7.8e-12
 Identities = 38/106 (35%), Positives = 48/106 (45%)

Query:   295 DLPEAF--DWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQ--GNSLTELSVQQLVD 350
             DLPE F   ++  G      +Q  CA  WAFS   V     AIQ  G     LS Q L+ 
Sbjct:   215 DLPEVFIASYKWPGWTHGPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQNLIS 274

Query:   351 CDMSNG-GCNGGRMDDALQYIIDNGGVVSDQAYP-YKA-SESERGC 393
             C   N  GCN G +D A  + +   G+VS   YP +K  S +   C
Sbjct:   275 CCAKNRHGCNSGSIDRAW-WFLRKRGLVSHACYPLFKEQSTNNNSC 319

 Score = 114 (45.2 bits), Expect = 7.8e-12, Sum P(2) = 7.8e-12
 Identities = 29/90 (32%), Positives = 41/90 (45%)

Query:   568 RIPYGEEEEMKKWVATRGPLSVGMNAN-GLFYYSGG----VIDLNQRLCN-PKAQNHALI 621
             RI   E E M++ +   GP+   M  +   FYY  G    V+  N+      K + HA+ 
Sbjct:   355 RISSNETEIMRE-IIQNGPVQAIMQVHEDFFYYKTGIYRHVVSTNEEPEKYRKLRTHAVK 413

Query:   622 IVGYGEEEKKDGTSIPYWIVKNSWGSDWGE 651
             + G+G      G    +WI  NSWG  WGE
Sbjct:   414 LTGWGTLRGAQGKKEKFWIAANSWGKSWGE 443

 Score = 68 (29.0 bits), Expect = 2.2e-08, Sum P(3) = 2.2e-08
 Identities = 21/57 (36%), Positives = 25/57 (43%)

Query:   147 DLPEAF--DWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQ--GNNLTELSVQH 199
             DLPE F   ++  G      +Q  CA  WAFS   V     AIQ  G     LS Q+
Sbjct:   215 DLPEVFIASYKWPGWTHGPLDQKNCAASWAFSTASVAADRIAIQSKGRYTANLSPQN 271

 Score = 67 (28.6 bits), Expect = 2.2e-08, Sum P(3) = 2.2e-08
 Identities = 18/56 (32%), Positives = 28/56 (50%)

Query:   499 SRLATEKLVDCDMSNG-GCNGGRMDDALQYIIDNGGVVSDQAYP-YKA-SESERGC 551
             + L+ + L+ C   N  GCN G +D A  + +   G+VS   YP +K  S +   C
Sbjct:   265 ANLSPQNLISCCAKNRHGCNSGSIDRAW-WFLRKRGLVSHACYPLFKEQSTNNNSC 319

 Score = 67 (28.6 bits), Expect = 1.7e-07, Sum P(3) = 1.7e-07
 Identities = 10/19 (52%), Positives = 11/19 (57%)

Query:   454 GTSIPYWIVKNSWGSDWGE 472
             G    +WI  NSWG  WGE
Sbjct:   425 GKKEKFWIAANSWGKSWGE 443

 Score = 45 (20.9 bits), Expect = 1.7e-07, Sum P(3) = 1.7e-07
 Identities = 12/37 (32%), Positives = 18/37 (48%)

Query:   410 RIPYGEEEEMKKWVATRGPLSVGMNAN-GLFYYSGGV 445
             RI   E E M++ +   GP+   M  +   FYY  G+
Sbjct:   355 RISSNETEIMRE-IIQNGPVQAIMQVHEDFFYYKTGI 390


>DICTYBASE|DDB_G0283401 [details] [associations]
            symbol:ctsZ "cathepsin Z precursor" species:44689
            "Dictyostelium discoideum" [GO:0005615 "extracellular space"
            evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0044351
            "macropinocytosis" evidence=RCA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            dictyBase:DDB_G0283401 GO:GO:0005615 GenomeReviews:CM000153_GR
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411
            PROSITE:PS00640 EMBL:AAFI02000055 KO:K08568 OMA:QCGTCTE
            eggNOG:NOG275763 RefSeq:XP_639036.1 ProteinModelPortal:Q54R55
            IntAct:Q54R55 MEROPS:C01.A60 PRIDE:Q54R55
            EnsemblProtists:DDB0233836 GeneID:8624061 KEGG:ddi:DDB_G0283401
            InParanoid:Q54R55 Uniprot:Q54R55
        Length = 296

 Score = 171 (65.3 bits), Expect = 8.1e-12, Sum P(2) = 8.1e-12
 Identities = 49/162 (30%), Positives = 78/162 (48%)

Query:   501 LATEKLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKAS----ESERGCLXXXX 556
             +A + L+DC+   G C+GG   DA  +I +NG +V +   PY+A     E    C     
Sbjct:   114 VAPQHLIDCN-GGGTCDGGDPGDAFAFINENG-IVDETCKPYQAKNLPDECSPACKTCNP 171

Query:   557 XXXXXXXXXYSRIP---YGEEEEMKKWVA---TRGPLSVGMNANG-LFYYSGGVIDLNQR 609
                      ++ I    YG     K  +A    RGP++  ++A   L  Y+ G+    + 
Sbjct:   172 DGTCQAIPVHTNITVTEYGSVRGAKDMMAEIYARGPIACSIDATSKLEAYTSGIF--KEF 229

Query:   610 LCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGE 651
               +P   NH + ++G+G ++     S PYWIV+NSWGS +GE
Sbjct:   230 KLDP-LPNHIISVIGWGVQD-----STPYWIVRNSWGSYYGE 265

 Score = 134 (52.2 bits), Expect = 7.1e-06, P = 7.1e-06
 Identities = 46/171 (26%), Positives = 78/171 (45%)

Query:   295 DLPEAFDWR-AEGV--ISKVKEQG---KCACCWAFSAVGVVEAMHAIQGNS-LTELSV-- 345
             ++P+++DWR   GV  ++  + Q     C  CWAF++   +     IQ  +   +++V  
Sbjct:    57 EVPQSWDWRNVSGVNYLTMNRNQHIPQYCGGCWAFASTSSISDRIKIQRKAAFPDVNVAP 116

Query:   346 QQLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKAS----ESERGCLXXXXXXX 401
             Q L+DC+   G C+GG   DA  +I +NG +V +   PY+A     E    C        
Sbjct:   117 QHLIDCN-GGGTCDGGDPGDAFAFINENG-IVDETCKPYQAKNLPDECSPACKTCNPDGT 174

Query:   402 XXXXXXYSRIP---YGEEEEMKKWVA---TRGPLSVGMNANG-LFYYSGGV 445
                   ++ I    YG     K  +A    RGP++  ++A   L  Y+ G+
Sbjct:   175 CQAIPVHTNITVTEYGSVRGAKDMMAEIYARGPIACSIDATSKLEAYTSGI 225

 Score = 56 (24.8 bits), Expect = 8.1e-12, Sum P(2) = 8.1e-12
 Identities = 14/48 (29%), Positives = 25/48 (52%)

Query:   147 DLPEAFDWR-AEGV--ISKVKEQG---KCACCWAFSAVGVVEAMHAIQ 188
             ++P+++DWR   GV  ++  + Q     C  CWAF++   +     IQ
Sbjct:    57 EVPQSWDWRNVSGVNYLTMNRNQHIPQYCGGCWAFASTSSISDRIKIQ 104


>WB|WBGene00010204 [details] [associations]
            symbol:F57F5.1 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0009792 "embryo development
            ending in birth or egg hatching" evidence=IMP] [GO:0040007 "growth"
            evidence=IMP] [GO:0002119 "nematode larval development"
            evidence=IMP] [GO:0040011 "locomotion" evidence=IMP] [GO:0006898
            "receptor-mediated endocytosis" evidence=IMP] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0009792 GO:GO:0006898
            GO:GO:0040007 GO:GO:0002119 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0040011
            EMBL:Z75953 GeneTree:ENSGT00560000076599 HOGENOM:HOG000241341
            PANTHER:PTHR12411:SF16 RefSeq:NP_506011.2 ProteinModelPortal:Q20950
            SMR:Q20950 DIP:DIP-24447N IntAct:Q20950 MINT:MINT-211137
            STRING:Q20950 MEROPS:C01.A42 EnsemblMetazoa:F57F5.1 GeneID:179645
            KEGG:cel:CELE_F57F5.1 UCSC:F57F5.1 CTD:179645 WormBase:F57F5.1
            OMA:ADDINAC Uniprot:Q20950
        Length = 351

 Score = 119 (46.9 bits), Expect = 8.5e-12, Sum P(2) = 8.5e-12
 Identities = 28/77 (36%), Positives = 40/77 (51%)

Query:   576 EMKKWVATRGPLSVGMNANGLF-YYSGGVIDLNQRLCNPKAQNHALIIVGYGEEEKKDGT 634
             E++K + T GP+ V       F +YSGGV              HA+ ++G+G +   +GT
Sbjct:   257 EIQKEIMTHGPVEVAFTVYEDFEHYSGGVY---VHTAGASLGGHAVKMLGWGVD---NGT 310

Query:   635 SIPYWIVKNSWGSDWGE 651
               PYW+  NSW  DWGE
Sbjct:   311 --PYWLCANSWNEDWGE 325

 Score = 118 (46.6 bits), Expect = 8.5e-12, Sum P(2) = 8.5e-12
 Identities = 39/125 (31%), Positives = 54/125 (43%)

Query:   281 EMRAFQFNSLRHGDD-LPEAFD----WRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAI 335
             E R F+       D  +P++FD    W     ISK+++Q  C  CWA SA   +     I
Sbjct:    81 EYRVFEMTHPEVEDAAVPDSFDSRTAWPNCPSISKIRDQSSCGSCWAVSAAETISDRICI 140

Query:   336 QGNSLTELSVQQ---LVDCDMSNG-GCNGGRMDDALQYIIDNG----GVVSDQA----YP 383
               N+ T LS+        C M  G GCNGG   +A ++ +  G    G   D+     YP
Sbjct:   141 ASNAKTILSISADDINACCGMVCGNGCNGGYPIEAWRHYVKKGYVTGGSYQDKTGCKPYP 200

Query:   384 YKASE 388
             Y   E
Sbjct:   201 YPPCE 205

 Score = 90 (36.7 bits), Expect = 7.4e-10, Sum P(3) = 7.4e-10
 Identities = 24/74 (32%), Positives = 33/74 (44%)

Query:   133 EMRAFQFNSLRHGDD-LPEAFD----WRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAI 187
             E R F+       D  +P++FD    W     ISK+++Q  C  CWA SA   +     I
Sbjct:    81 EYRVFEMTHPEVEDAAVPDSFDSRTAWPNCPSISKIRDQSSCGSCWAVSAAETISDRICI 140

Query:   188 QGNNLTELSVQHHD 201
               N  T LS+   D
Sbjct:   141 ASNAKTILSISADD 154

 Score = 90 (36.7 bits), Expect = 8.6e-09, Sum P(2) = 8.6e-09
 Identities = 24/69 (34%), Positives = 32/69 (46%)

Query:   418 EMKKWVATRGPLSVGMNANGLF-YYSGGVI--DLNQRLYGTSI-----------PYWIVK 463
             E++K + T GP+ V       F +YSGGV        L G ++           PYW+  
Sbjct:   257 EIQKEIMTHGPVEVAFTVYEDFEHYSGGVYVHTAGASLGGHAVKMLGWGVDNGTPYWLCA 316

Query:   464 NSWGSDWGE 472
             NSW  DWGE
Sbjct:   317 NSWNEDWGE 325

 Score = 48 (22.0 bits), Expect = 7.4e-10, Sum P(3) = 7.4e-10
 Identities = 16/47 (34%), Positives = 21/47 (44%)

Query:   509 CDMSNG-GCNGGRMDDALQYIIDNG----GVVSDQA----YPYKASE 546
             C M  G GCNGG   +A ++ +  G    G   D+     YPY   E
Sbjct:   159 CGMVCGNGCNGGYPIEAWRHYVKKGYVTGGSYQDKTGCKPYPYPPCE 205


>UNIPROTKB|P07858 [details] [associations]
            symbol:CTSB "Cathepsin B" species:9606 "Homo sapiens"
            [GO:0050790 "regulation of catalytic activity" evidence=IEA]
            [GO:0005739 "mitochondrion" evidence=IEA] [GO:0042470 "melanosome"
            evidence=IEA] [GO:0005515 "protein binding" evidence=IPI]
            [GO:0042981 "regulation of apoptotic process" evidence=TAS]
            [GO:0006508 "proteolysis" evidence=IDA] [GO:0005764 "lysosome"
            evidence=IDA] [GO:0097067 "cellular response to thyroid hormone
            stimulus" evidence=IEP] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IDA] [GO:0048471 "perinuclear region of
            cytoplasm" evidence=IDA] [GO:0005622 "intracellular" evidence=TAS]
            [GO:0036021 "endolysosome lumen" evidence=TAS] [GO:0045087 "innate
            immune response" evidence=TAS] [GO:0008233 "peptidase activity"
            evidence=IDA] [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=IDA] [GO:0005615 "extracellular space" evidence=ISS]
            [GO:0005730 "nucleolus" evidence=IDA] [GO:0043231 "intracellular
            membrane-bounded organelle" evidence=IDA] InterPro:IPR000668
            InterPro:IPR012599 InterPro:IPR013128 InterPro:IPR015643
            Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 Reactome:REACT_118779 GO:GO:0005739
            GO:GO:0042470 GO:GO:0048471 Reactome:REACT_6900 GO:GO:0005615
            GO:GO:0009612 GO:GO:0009611 GO:GO:0005730 GO:GO:0042981
            GO:GO:0009897 GO:GO:0045471 GO:GO:0016324 GO:GO:0009749
            GO:GO:0006914 GO:GO:0043434 eggNOG:COG4870 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045087
            GO:GO:0050790 GO:GO:0042383 GO:GO:0014070 GO:GO:0042277
            GO:GO:0060548 GO:GO:0005901 GO:GO:0014075 GO:GO:0004197
            GO:GO:0070670 EMBL:CH471157 GO:GO:0007519 MEROPS:C01.060 CTD:1508
            HOGENOM:HOG000241341 HOVERGEN:HBG003480 KO:K01363 OMA:GYPSGAW
            OrthoDB:EOG4K6G4C BRENDA:3.4.22.1 GO:GO:0097067
            PANTHER:PTHR12411:SF16 EMBL:M14221 EMBL:L16510 EMBL:AK092070
            EMBL:AK075393 EMBL:BC010240 EMBL:BC095408 EMBL:M13230
            IPI:IPI00295741 PIR:A26498 RefSeq:NP_001899.1 RefSeq:NP_680090.1
            RefSeq:NP_680091.1 RefSeq:NP_680092.1 RefSeq:NP_680093.1
            UniGene:Hs.520898 PDB:1CSB PDB:1GMY PDB:1HUC PDB:1PBH PDB:2IPP
            PDB:2PBH PDB:3AI8 PDB:3CBJ PDB:3CBK PDB:3K9M PDB:3PBH PDBsum:1CSB
            PDBsum:1GMY PDBsum:1HUC PDBsum:1PBH PDBsum:2IPP PDBsum:2PBH
            PDBsum:3AI8 PDBsum:3CBJ PDBsum:3CBK PDBsum:3K9M PDBsum:3PBH
            ProteinModelPortal:P07858 SMR:P07858 DIP:DIP-42785N IntAct:P07858
            MINT:MINT-1397666 STRING:P07858 PhosphoSite:P07858 DMDM:68067549
            SWISS-2DPAGE:P07858 UCD-2DPAGE:P07858 PaxDb:P07858
            PeptideAtlas:P07858 PRIDE:P07858 DNASU:1508 Ensembl:ENST00000345125
            Ensembl:ENST00000353047 Ensembl:ENST00000434271
            Ensembl:ENST00000453527 Ensembl:ENST00000530640
            Ensembl:ENST00000531089 Ensembl:ENST00000533455
            Ensembl:ENST00000534510 GeneID:1508 KEGG:hsa:1508 UCSC:uc003wum.3
            GeneCards:GC08M011700 H-InvDB:HIX0007320 HGNC:HGNC:2527
            HPA:CAB000457 HPA:HPA018156 MIM:116810 neXtProt:NX_P07858
            PharmGKB:PA27027 InParanoid:P07858 PhylomeDB:P07858
            BindingDB:P07858 ChEMBL:CHEMBL4072 ChiTaRS:CTSB
            EvolutionaryTrace:P07858 GenomeRNAi:1508 NextBio:6235
            PMAP-CutDB:P07858 ArrayExpress:P07858 Bgee:P07858 CleanEx:HS_CTSB
            Genevestigator:P07858 GermOnline:ENSG00000164733 GO:GO:0036021
            Uniprot:P07858
        Length = 339

 Score = 128 (50.1 bits), Expect = 1.0e-11, Sum P(2) = 1.0e-11
 Identities = 36/106 (33%), Positives = 49/106 (46%)

Query:   296 LPEAFD----WRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNS--LTELSVQQLV 349
             LP +FD    W     I ++++QG C  CWAF AV  +     I  N+    E+S + L+
Sbjct:    80 LPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLL 139

Query:   350 DC--DMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGC 393
              C   M   GCNGG   +A  +    G +VS   Y     ES  GC
Sbjct:   140 TCCGSMCGDGCNGGYPAEAWNFWTRKG-LVSGGLY-----ESHVGC 179

 Score = 107 (42.7 bits), Expect = 1.0e-11, Sum P(2) = 1.0e-11
 Identities = 26/79 (32%), Positives = 40/79 (50%)

Query:   574 EEEMKKWVATRGPLSVGMNANGLFY-YSGGVIDLNQRLCNPKAQNHALIIVGYGEEEKKD 632
             E+++   +   GP+    +    F  Y  GV    Q +       HA+ I+G+G E   +
Sbjct:   236 EKDIMAEIYKNGPVEGAFSVYSDFLLYKSGVY---QHVTGEMMGGHAIRILGWGVE---N 289

Query:   633 GTSIPYWIVKNSWGSDWGE 651
             GT  PYW+V NSW +DWG+
Sbjct:   290 GT--PYWLVANSWNTDWGD 306

 Score = 86 (35.3 bits), Expect = 1.4e-08, Sum P(3) = 1.4e-08
 Identities = 19/58 (32%), Positives = 26/58 (44%)

Query:   148 LPEAFD----WRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQHHD 201
             LP +FD    W     I ++++QG C  CWAF AV  +     I  N    + V   D
Sbjct:    80 LPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAED 137

 Score = 77 (32.2 bits), Expect = 1.2e-08, Sum P(2) = 1.2e-08
 Identities = 10/15 (66%), Positives = 13/15 (86%)

Query:   458 PYWIVKNSWGSDWGE 472
             PYW+V NSW +DWG+
Sbjct:   292 PYWLVANSWNTDWGD 306

 Score = 52 (23.4 bits), Expect = 1.4e-08, Sum P(3) = 1.4e-08
 Identities = 18/53 (33%), Positives = 24/53 (45%)

Query:   501 LATEKLVDC--DMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGC 551
             ++ E L+ C   M   GCNGG   +A  +    G +VS   Y     ES  GC
Sbjct:   133 VSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKG-LVSGGLY-----ESHVGC 179


>UNIPROTKB|F1M8U6 [details] [associations]
            symbol:F1M8U6 "Uncharacterized protein" species:10116
            "Rattus norvegicus" [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00639
            IPI:IPI00782277 Ensembl:ENSRNOT00000055587 OMA:EREIAAW
            Uniprot:F1M8U6
        Length = 163

 Score = 169 (64.5 bits), Expect = 1.1e-11, P = 1.1e-11
 Identities = 44/149 (29%), Positives = 72/149 (48%)

Query:   504 EKLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXX 563
             ++L+DCD  +  C GG   +A   I + GG+ ++  Y Y+        L           
Sbjct:     2 KELLDCDKMDKACLGGLPSNAYTAIKNLGGLETEDGYGYEGHFQACNFLAQMTKVYISDS 61

Query:   564 XXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFYYSGGVIDLNQRLCNPKAQNHALIIV 623
                S+     E  +   +A +G +SV +     F+  G V  L   LC+P   +H++++V
Sbjct:    62 VELSQ----NESSIAALLAQKGLISVAIMQ---FHRYGTVHPLRP-LCSPGFTDHSVLLV 113

Query:   624 GYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
             GYG   +   ++IPYW +KN  GSDWGE+
Sbjct:   114 GYGNRPR---SNIPYWAIKNIQGSDWGEE 139

 Score = 111 (44.1 bits), Expect = 0.00022, P = 0.00022
 Identities = 39/142 (27%), Positives = 62/142 (43%)

Query:   346 QQLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXX 405
             ++L+DCD  +  C GG   +A   I + GG+ ++  Y Y+        L           
Sbjct:     2 KELLDCDKMDKACLGGLPSNAYTAIKNLGGLETEDGYGYEGHFQACNFLAQMTKVYISDS 61

Query:   406 XXYSRIPYGEEEEMKKWVATRGPLSVGM---NANGLFY-----YSGGVIDLNQRL--YG- 454
                S+     E  +   +A +G +SV +   +  G  +      S G  D +  L  YG 
Sbjct:    62 VELSQ----NESSIAALLAQKGLISVAIMQFHRYGTVHPLRPLCSPGFTDHSVLLVGYGN 117

Query:   455 ---TSIPYWIVKNSWGSDWGEK 473
                ++IPYW +KN  GSDWGE+
Sbjct:   118 RPRSNIPYWAIKNIQGSDWGEE 139


>UNIPROTKB|E2RNP9 [details] [associations]
            symbol:TINAG "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0007155 "cell adhesion" evidence=IEA]
            [GO:0005604 "basement membrane" evidence=IEA] [GO:0030247
            "polysaccharide binding" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0006955 "immune response"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0005044
            "scavenger receptor activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR001212 InterPro:IPR013128 Pfam:PF00112 Pfam:PF01033
            PROSITE:PS00524 PROSITE:PS50958 SMART:SM00201 SMART:SM00645
            GO:GO:0006955 GO:GO:0030247 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640 GO:GO:0007155
            GO:GO:0005604 GO:GO:0005044 GeneTree:ENSGT00560000076599 CTD:27283
            OMA:WGQLTSS EMBL:AAEX03008403 RefSeq:XP_538969.2
            ProteinModelPortal:E2RNP9 Ensembl:ENSCAFT00000003638 GeneID:481848
            KEGG:cfa:481848 NextBio:20856579 Uniprot:E2RNP9
        Length = 476

 Score = 129 (50.5 bits), Expect = 1.2e-11, Sum P(2) = 1.2e-11
 Identities = 40/112 (35%), Positives = 50/112 (44%)

Query:   289 SLRHGDDLPEAF--DWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNS--LTELS 344
             SL    DLPE F   ++  G      +Q  CA  WAFS   V     AIQ N      LS
Sbjct:   210 SLPATTDLPEFFIASYKWPGWTHGPLDQKNCAASWAFSTASVAADRIAIQSNGRYTANLS 269

Query:   345 VQQLVDCDMSNG-GCNGGRMDDALQYIIDNGGVVSDQAYP-YKASESER-GC 393
              Q L+ C   N  GCN G +D A  + +   G+VS   YP +K   +   GC
Sbjct:   270 PQNLISCCAKNRHGCNSGSIDRAW-WFLRKRGLVSHACYPLFKDQNATNYGC 320

 Score = 110 (43.8 bits), Expect = 1.2e-11, Sum P(2) = 1.2e-11
 Identities = 28/90 (31%), Positives = 41/90 (45%)

Query:   568 RIPYGEEEEMKKWVATRGPLSVGMNAN-GLFYYSGGVID----LNQRLCN-PKAQNHALI 621
             R+   E E MK+ +   GP+   M  +   F+Y  G+       N+      K Q HA+ 
Sbjct:   356 RVSSNETEIMKE-IMQNGPVQAIMQVHEDFFHYKTGIYRHITRTNEESRKYQKLQTHAVK 414

Query:   622 IVGYGEEEKKDGTSIPYWIVKNSWGSDWGE 651
             + G+G  +   G    +WI  NSWG  WGE
Sbjct:   415 LTGWGTLKGAQGQKEKFWIAANSWGISWGE 444

 Score = 69 (29.3 bits), Expect = 3.8e-08, Sum P(3) = 3.8e-08
 Identities = 20/52 (38%), Positives = 23/52 (44%)

Query:   141 SLRHGDDLPEAF--DWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGN 190
             SL    DLPE F   ++  G      +Q  CA  WAFS   V     AIQ N
Sbjct:   210 SLPATTDLPEFFIASYKWPGWTHGPLDQKNCAASWAFSTASVAADRIAIQSN 261

 Score = 68 (29.0 bits), Expect = 3.8e-08, Sum P(3) = 3.8e-08
 Identities = 18/56 (32%), Positives = 28/56 (50%)

Query:   499 SRLATEKLVDCDMSNG-GCNGGRMDDALQYIIDNGGVVSDQAYP-YKASESER-GC 551
             + L+ + L+ C   N  GCN G +D A  + +   G+VS   YP +K   +   GC
Sbjct:   266 ANLSPQNLISCCAKNRHGCNSGSIDRAW-WFLRKRGLVSHACYPLFKDQNATNYGC 320

 Score = 65 (27.9 bits), Expect = 5.4e-07, Sum P(2) = 5.4e-07
 Identities = 10/19 (52%), Positives = 11/19 (57%)

Query:   454 GTSIPYWIVKNSWGSDWGE 472
             G    +WI  NSWG  WGE
Sbjct:   426 GQKEKFWIAANSWGISWGE 444


>UNIPROTKB|A1E295 [details] [associations]
            symbol:CTSB "Cathepsin B" species:9823 "Sus scrofa"
            [GO:0005615 "extracellular space" evidence=IDA] [GO:0042470
            "melanosome" evidence=IEA] [GO:0005764 "lysosome" evidence=IEA]
            [GO:0097067 "cellular response to thyroid hormone stimulus"
            evidence=IEA] [GO:0005739 "mitochondrion" evidence=IEA] [GO:0005730
            "nucleolus" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0050790 "regulation of catalytic activity" evidence=IEA]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR012599 InterPro:IPR013128
            InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0005739 GO:GO:0042470
            GO:GO:0005615 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0050790 GO:GO:0005764 GO:GO:0004197 MEROPS:C01.060 CTD:1508
            GeneTree:ENSGT00560000076599 HOVERGEN:HBG003480 KO:K01363
            OMA:GYPSGAW GO:GO:0097067 PANTHER:PTHR12411:SF16 EMBL:EF095956
            RefSeq:NP_001090927.1 UniGene:Ssc.53773 ProteinModelPortal:A1E295
            SMR:A1E295 PRIDE:A1E295 Ensembl:ENSSSCT00000026923 GeneID:100037961
            KEGG:ssc:100037961 Uniprot:A1E295
        Length = 335

 Score = 124 (48.7 bits), Expect = 2.7e-11, Sum P(2) = 2.7e-11
 Identities = 34/106 (32%), Positives = 50/106 (47%)

Query:   296 LPEAFDWRAEG----VISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLT--ELSVQQLV 349
             LP++FD R +      I ++++QG C  CWAF AV  +     I+ N     E+S + ++
Sbjct:    80 LPKSFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRSNGRVNVEVSAEDML 139

Query:   350 DC--DMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGC 393
              C  D    GCNGG    A  +    G +VS   Y     +S  GC
Sbjct:   140 TCCGDECGDGCNGGFPSGAWNFWTKKG-LVSGGLY-----DSHVGC 179

 Score = 107 (42.7 bits), Expect = 2.7e-11, Sum P(2) = 2.7e-11
 Identities = 27/79 (34%), Positives = 39/79 (49%)

Query:   574 EEEMKKWVATRGPLSVGMNANGLFY-YSGGVIDLNQRLCNPKAQNHALIIVGYGEEEKKD 632
             E+E+   +   GP+         F  Y  GV    Q +       HA+ I+G+G E   +
Sbjct:   236 EKEIMAEIYKNGPVEGAFTVYSDFLQYKSGVY---QHVTGDLMGGHAIRILGWGVE---N 289

Query:   633 GTSIPYWIVKNSWGSDWGE 651
             GT  PYW+V NSW +DWG+
Sbjct:   290 GT--PYWLVGNSWNTDWGD 306

 Score = 93 (37.8 bits), Expect = 1.3e-08, Sum P(3) = 1.3e-08
 Identities = 19/58 (32%), Positives = 29/58 (50%)

Query:   148 LPEAFDWRAEG----VISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQHHD 201
             LP++FD R +      I ++++QG C  CWAF AV  +     I+ N    + V   D
Sbjct:    80 LPKSFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRSNGRVNVEVSAED 137

 Score = 76 (31.8 bits), Expect = 4.2e-08, Sum P(2) = 4.2e-08
 Identities = 10/15 (66%), Positives = 13/15 (86%)

Query:   458 PYWIVKNSWGSDWGE 472
             PYW+V NSW +DWG+
Sbjct:   292 PYWLVGNSWNTDWGD 306

 Score = 45 (20.9 bits), Expect = 1.3e-08, Sum P(3) = 1.3e-08
 Identities = 16/53 (30%), Positives = 23/53 (43%)

Query:   501 LATEKLVDC--DMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGC 551
             ++ E ++ C  D    GCNGG    A  +    G +VS   Y     +S  GC
Sbjct:   133 VSAEDMLTCCGDECGDGCNGGFPSGAWNFWTKKG-LVSGGLY-----DSHVGC 179


>ZFIN|ZDB-GENE-040426-2650 [details] [associations]
            symbol:ctsba "cathepsin B, a" species:7955 "Danio
            rerio" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0050790
            "regulation of catalytic activity" evidence=IEA] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0031101 "fin regeneration"
            evidence=IEP] [GO:0008233 "peptidase activity" evidence=IEA]
            [GO:0016787 "hydrolase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR012599 InterPro:IPR013128 InterPro:IPR015643
            Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 ZFIN:ZDB-GENE-040426-2650 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0050790
            GO:GO:0004197 GO:GO:0031101 MEROPS:C01.060 HOVERGEN:HBG003480
            PANTHER:PTHR12411:SF16 HSSP:P07688 EMBL:BC044517 IPI:IPI00485996
            UniGene:Dr.3374 ProteinModelPortal:Q803E4 SMR:Q803E4 STRING:Q803E4
            PRIDE:Q803E4 InParanoid:Q803E4 ArrayExpress:Q803E4 Bgee:Q803E4
            Uniprot:Q803E4
        Length = 330

 Score = 123 (48.4 bits), Expect = 3.3e-11, Sum P(2) = 3.3e-11
 Identities = 31/89 (34%), Positives = 44/89 (49%)

Query:   293 GDDLPEAFDWRAEG----VISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLT--ELSVQ 346
             G  LP+ FD R +      + ++++QG C  CWAF A   +     IQ N+    E+S Q
Sbjct:    76 GLKLPKNFDAREQWPNCPTLKEIRDQGSCGSCWAFGAAEAISDRVCIQSNAKVSVEISSQ 135

Query:   347 QLVDCDMSNG-GCNGGRMDDALQYIIDNG 374
              L+ C  S G GCNGG    A  +   +G
Sbjct:   136 DLLTCCDSCGMGCNGGYPSAAWDFWTTDG 164

 Score = 107 (42.7 bits), Expect = 3.3e-11, Sum P(2) = 3.3e-11
 Identities = 26/87 (29%), Positives = 39/87 (44%)

Query:   566 YSRIPYGEEEEMKKWVATRGPLSVGMNA-NGLFYYSGGVIDLNQRLCNPKAQNHALIIVG 624
             YS +P  +   M + +   GP+            Y  GV    Q +       HA+ I+G
Sbjct:   229 YS-VPSNQNGIMAE-LFKNGPVEAAFTVYEDFLLYKSGVY---QHMSGSALGGHAIKILG 283

Query:   625 YGEEEKKDGTSIPYWIVKNSWGSDWGE 651
             +GEE       +PYW+  NSW +DWG+
Sbjct:   284 WGEEN-----GVPYWLAANSWNTDWGD 305

 Score = 87 (35.7 bits), Expect = 2.3e-07, Sum P(2) = 2.3e-07
 Identities = 18/61 (29%), Positives = 28/61 (45%)

Query:   145 GDDLPEAFDWRAEG----VISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQHH 200
             G  LP+ FD R +      + ++++QG C  CWAF A   +     IQ N    + +   
Sbjct:    76 GLKLPKNFDAREQWPNCPTLKEIRDQGSCGSCWAFGAAEAISDRVCIQSNAKVSVEISSQ 135

Query:   201 D 201
             D
Sbjct:   136 D 136

 Score = 76 (31.8 bits), Expect = 5.2e-08, Sum P(2) = 5.2e-08
 Identities = 9/16 (56%), Positives = 13/16 (81%)

Query:   457 IPYWIVKNSWGSDWGE 472
             +PYW+  NSW +DWG+
Sbjct:   290 VPYWLAANSWNTDWGD 305


>WB|WBGene00000789 [details] [associations]
            symbol:cpz-2 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411
            PROSITE:PS00640 GeneTree:ENSGT00560000076599 KO:K08568 EMBL:Z81103
            HSSP:P80067 PIR:T23720 RefSeq:NP_506318.1 ProteinModelPortal:P92005
            SMR:P92005 STRING:P92005 MEROPS:C01.A41 PaxDb:P92005
            EnsemblMetazoa:M04G12.2 GeneID:179818 KEGG:cel:CELE_M04G12.2
            UCSC:M04G12.2 CTD:179818 WormBase:M04G12.2 eggNOG:NOG275763
            InParanoid:P92005 OMA:VEYWIAR NextBio:906990 Uniprot:P92005
        Length = 467

 Score = 137 (53.3 bits), Expect = 3.4e-11, Sum P(2) = 3.4e-11
 Identities = 41/166 (24%), Positives = 73/166 (43%)

Query:   498 LSRLATEKLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESE-----R--G 550
             +++L+ ++++DC+   G C GG + + L++    G +V +    Y+A+  E     R   
Sbjct:   274 MTQLSPQEIIDCN-GKGNCQGGEIGNVLEHAKIQG-LVEEGCNVYRATNGECNPYHRCGS 331

Query:   551 CLXXXXXXXXXXXXXYSRIPYGE---EEEMKKWVATRGPLSVGMNANGLFYYS--GGVID 605
             C              Y +  YG+    +++   +   GP++  + A   F Y    GV  
Sbjct:   332 CWPNECFSLTNYTRYYVK-DYGQVQGRDKIMSEIKKGGPIACAIGATKKFEYEYVKGVYS 390

Query:   606 LNQRLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGE 651
                 L      NH + + G+G +E      + YWI +NSWG  WGE
Sbjct:   391 EKSDL----ESNHIISLTGWGVDEN----GVEYWIARNSWGEAWGE 428

 Score = 134 (52.2 bits), Expect = 5.2e-05, Sum P(2) = 5.2e-05
 Identities = 53/214 (24%), Positives = 88/214 (41%)

Query:   277 QTDTEMRAFQFNSLRHGDDLPEAFDWR-AEGV--ISKVKEQG---KCACCWAFSAVGVVE 330
             ++ T  R ++ +S +  +DLP  +DWR   GV   S  + Q     C  CW F   G + 
Sbjct:   203 ESKTAPREWESSSFK-SNDLPTGWDWRNVSGVNYCSPTRNQHIPVYCGSCWVFGTTGALN 261

Query:   331 AMHAI--QGN-SLTELSVQQLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKAS 387
                 +  +G   +T+LS Q+++DC+   G C GG + + L++    G +V +    Y+A+
Sbjct:   262 DRFNVARKGRWPMTQLSPQEIIDCN-GKGNCQGGEIGNVLEHAKIQG-LVEEGCNVYRAT 319

Query:   388 ESE-----R--GCLXXXXXXXXXXXXXYSRIPYGE---EEEMKKWVATRGPLSVGMNANG 437
               E     R   C              Y +  YG+    +++   +   GP++  + A  
Sbjct:   320 NGECNPYHRCGSCWPNECFSLTNYTRYYVK-DYGQVQGRDKIMSEIKKGGPIACAIGATK 378

Query:   438 LFYYS--GGVIDLNQRLYGTSIPYWIVKNSWGSD 469
              F Y    GV      L    I   I    WG D
Sbjct:   379 KFEYEYVKGVYSEKSDLESNHI---ISLTGWGVD 409

 Score = 97 (39.2 bits), Expect = 3.4e-11, Sum P(2) = 3.4e-11
 Identities = 40/184 (21%), Positives = 83/184 (45%)

Query:    32 FQTRGYLNSPVTRFLNFMRDHDKVYSSVEDLLRRHENFVTNVEKAEDYQREDSGTAVFEV 91
             F+    +N PV R+ N  +++ K+   +       E+ V  ++  ++ + + S     ++
Sbjct:    98 FELNKKVNKPVVRYPNIAKNNQKIREEIVYPADFDEHVVEILDSRKERKIDLSPMIKAKL 157

Query:    92 NK-FFDLSDSDLQQLTGLNLDST--LEDIQPSLQAPFSSN-----QTDTEMRAFQFNSLR 143
              K +++ +D  L  ++  + +S+   E+ +P L+           ++ T  R ++ +S +
Sbjct:   158 EKGYYEPNDEALVDMSSESEESSEEWEEARPYLKCGCLKKSGKVFESKTAPREWESSSFK 217

Query:   144 HGDDLPEAFDWR-AEGV--ISKVKEQG---KCACCWAFSAVGVVEAMHAI--QGN-NLTE 194
               +DLP  +DWR   GV   S  + Q     C  CW F   G +     +  +G   +T+
Sbjct:   218 -SNDLPTGWDWRNVSGVNYCSPTRNQHIPVYCGSCWVFGTTGALNDRFNVARKGRWPMTQ 276

Query:   195 LSVQ 198
             LS Q
Sbjct:   277 LSPQ 280

 Score = 74 (31.1 bits), Expect = 0.00018, Sum P(2) = 0.00018
 Identities = 14/40 (35%), Positives = 21/40 (52%)

Query:   433 MNANGLFYYSGGVIDLNQRLYGTSIPYWIVKNSWGSDWGE 472
             + +N +   +G  +D N       + YWI +NSWG  WGE
Sbjct:   395 LESNHIISLTGWGVDEN------GVEYWIARNSWGEAWGE 428

 Score = 40 (19.1 bits), Expect = 2.5e-05, Sum P(2) = 2.5e-05
 Identities = 15/43 (34%), Positives = 20/43 (46%)

Query:    73 VEKAEDYQREDSGTAVFEVNKFFDLSDSDLQQLTGLNLDSTLE 115
             V+  ED   ED  T +  V     ++D DL    G +LDS  E
Sbjct:    64 VDDMEDSSEED--TPLARV-----VNDDDLLYKKGFHLDSPFE 99


>UNIPROTKB|F1PSK8 [details] [associations]
            symbol:F1PSK8 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00639 GeneTree:ENSGT00560000076599 InterPro:IPR014882
            Pfam:PF08773 EMBL:AAEX03012741 Ensembl:ENSCAFT00000007054
            Uniprot:F1PSK8
        Length = 405

 Score = 178 (67.7 bits), Expect = 1.9e-10, P = 1.9e-10
 Identities = 62/208 (29%), Positives = 93/208 (44%)

Query:   460 WIVKNSWGSDWGEKVEDKVGSSGN-----RTRDLELT-GVLPSKLSR--LATEKLVDCDM 511
             W  +N  G+++   V ++  S G+      T  LE    +L +      L+ +++V C  
Sbjct:   177 WDWRNVRGTNFVSPVRNQAASCGSCYAFASTAMLEARIRILTNNTQTPILSPQEIVSCSQ 236

Query:   512 SNGGCNGG-RMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIP 570
                GC GG     A +Y  D G +V +  +PY  S+S   C              Y    
Sbjct:   237 YAQGCEGGFPYLIAGKYAQDFG-LVEEACFPYAGSDSP--CKPNDCFRYYSSEYYYVGGF 293

Query:   571 YG--EEEEMKKWVATRGPLSVGMNA-NGLFYYSGGVI---DLNQRLCNP-KAQNHALIIV 623
             YG   E  MK  +   GP++V     +  F+Y  G+     L     NP +  NHA+++V
Sbjct:   294 YGACNEALMKLELVRHGPMAVAFEVYDDFFHYQKGIYYHTGLRDPF-NPFELTNHAVLLV 352

Query:   624 GYGEEEKKDGTSIPYWIVKNSWGSDWGE 651
             GYG +     + + YWIVKNSWGS WGE
Sbjct:   353 GYGTDS---ASGMDYWIVKNSWGSRWGE 377

 Score = 146 (56.5 bits), Expect = 3.6e-11, Sum P(2) = 3.6e-11
 Identities = 48/160 (30%), Positives = 73/160 (45%)

Query:   296 LPEAFDWR-AEGV--ISKVKEQG-KCACCWAFSAVGVVEA-MHAIQGNSLTE-LSVQQLV 349
             LP ++DWR   G   +S V+ Q   C  C+AF++  ++EA +  +  N+ T  LS Q++V
Sbjct:   173 LPTSWDWRNVRGTNFVSPVRNQAASCGSCYAFASTAMLEARIRILTNNTQTPILSPQEIV 232

Query:   350 DCDMSNGGCNGG-RMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXY 408
              C     GC GG     A +Y  D G +V +  +PY  S+S   C              Y
Sbjct:   233 SCSQYAQGCEGGFPYLIAGKYAQDFG-LVEEACFPYAGSDSP--CKPNDCFRYYSSEYYY 289

Query:   409 SRIPYG--EEEEMKKWVATRGPLSVGMNA-NGLFYYSGGV 445
                 YG   E  MK  +   GP++V     +  F+Y  G+
Sbjct:   290 VGGFYGACNEALMKLELVRHGPMAVAFEVYDDFFHYQKGI 329

 Score = 85 (35.0 bits), Expect = 3.6e-11, Sum P(2) = 3.6e-11
 Identities = 16/24 (66%), Positives = 17/24 (70%)

Query:   453 YGTS----IPYWIVKNSWGSDWGE 472
             YGT     + YWIVKNSWGS WGE
Sbjct:   354 YGTDSASGMDYWIVKNSWGSRWGE 377

 Score = 78 (32.5 bits), Expect = 0.00092, Sum P(2) = 0.00092
 Identities = 20/62 (32%), Positives = 33/62 (53%)

Query:   148 LPEAFDWR-AEGV--ISKVKEQG-KCACCWAFSAVGVVEAMHAIQGNNLTELSVQHHDKV 203
             LP ++DWR   G   +S V+ Q   C  C+AF++  ++EA   I  NN T+  +    ++
Sbjct:   173 LPTSWDWRNVRGTNFVSPVRNQAASCGSCYAFASTAMLEARIRILTNN-TQTPILSPQEI 231

Query:   204 YS 205
              S
Sbjct:   232 VS 233


>UNIPROTKB|J9P219 [details] [associations]
            symbol:J9P219 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00639 GeneTree:ENSGT00560000076599 InterPro:IPR014882
            Pfam:PF08773 OMA:YDDFLHY EMBL:AAEX03012741
            Ensembl:ENSCAFT00000050015 Uniprot:J9P219
        Length = 406

 Score = 178 (67.7 bits), Expect = 1.9e-10, P = 1.9e-10
 Identities = 62/208 (29%), Positives = 93/208 (44%)

Query:   460 WIVKNSWGSDWGEKVEDKVGSSGN-----RTRDLELT-GVLPSKLSR--LATEKLVDCDM 511
             W  +N  G+++   V ++  S G+      T  LE    +L +      L+ +++V C  
Sbjct:   178 WDWRNVRGTNFVSPVRNQAASCGSCYAFASTAMLEARIRILTNNTQTPILSPQEIVSCSQ 237

Query:   512 SNGGCNGG-RMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIP 570
                GC GG     A +Y  D G +V +  +PY  S+S   C              Y    
Sbjct:   238 YAQGCEGGFPYLIAGKYAQDFG-LVEEACFPYAGSDSP--CKPNDCFRYYSSEYYYVGGF 294

Query:   571 YG--EEEEMKKWVATRGPLSVGMNA-NGLFYYSGGVI---DLNQRLCNP-KAQNHALIIV 623
             YG   E  MK  +   GP++V     +  F+Y  G+     L     NP +  NHA+++V
Sbjct:   295 YGACNEALMKLELVRHGPMAVAFEVYDDFFHYQKGIYYHTGLRDPF-NPFELTNHAVLLV 353

Query:   624 GYGEEEKKDGTSIPYWIVKNSWGSDWGE 651
             GYG +     + + YWIVKNSWGS WGE
Sbjct:   354 GYGTDS---ASGMDYWIVKNSWGSRWGE 378

 Score = 146 (56.5 bits), Expect = 3.7e-11, Sum P(2) = 3.7e-11
 Identities = 48/160 (30%), Positives = 73/160 (45%)

Query:   296 LPEAFDWR-AEGV--ISKVKEQG-KCACCWAFSAVGVVEA-MHAIQGNSLTE-LSVQQLV 349
             LP ++DWR   G   +S V+ Q   C  C+AF++  ++EA +  +  N+ T  LS Q++V
Sbjct:   174 LPTSWDWRNVRGTNFVSPVRNQAASCGSCYAFASTAMLEARIRILTNNTQTPILSPQEIV 233

Query:   350 DCDMSNGGCNGG-RMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXY 408
              C     GC GG     A +Y  D G +V +  +PY  S+S   C              Y
Sbjct:   234 SCSQYAQGCEGGFPYLIAGKYAQDFG-LVEEACFPYAGSDSP--CKPNDCFRYYSSEYYY 290

Query:   409 SRIPYG--EEEEMKKWVATRGPLSVGMNA-NGLFYYSGGV 445
                 YG   E  MK  +   GP++V     +  F+Y  G+
Sbjct:   291 VGGFYGACNEALMKLELVRHGPMAVAFEVYDDFFHYQKGI 330

 Score = 85 (35.0 bits), Expect = 3.7e-11, Sum P(2) = 3.7e-11
 Identities = 16/24 (66%), Positives = 17/24 (70%)

Query:   453 YGTS----IPYWIVKNSWGSDWGE 472
             YGT     + YWIVKNSWGS WGE
Sbjct:   355 YGTDSASGMDYWIVKNSWGSRWGE 378

 Score = 78 (32.5 bits), Expect = 0.00093, Sum P(2) = 0.00093
 Identities = 20/62 (32%), Positives = 33/62 (53%)

Query:   148 LPEAFDWR-AEGV--ISKVKEQG-KCACCWAFSAVGVVEAMHAIQGNNLTELSVQHHDKV 203
             LP ++DWR   G   +S V+ Q   C  C+AF++  ++EA   I  NN T+  +    ++
Sbjct:   174 LPTSWDWRNVRGTNFVSPVRNQAASCGSCYAFASTAMLEARIRILTNN-TQTPILSPQEI 232

Query:   204 YS 205
              S
Sbjct:   233 VS 234


>UNIPROTKB|F1PIF2 [details] [associations]
            symbol:CTSZ "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0060441 "epithelial tube branching involved
            in lung morphogenesis" evidence=IEA] [GO:0005783 "endoplasmic
            reticulum" evidence=IEA] [GO:0005615 "extracellular space"
            evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 GO:GO:0005783 GO:GO:0005615 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640
            GO:GO:0060441 GeneTree:ENSGT00560000076599 OMA:QCGTCTE
            EMBL:AAEX03014054 Ensembl:ENSCAFT00000019357 Uniprot:F1PIF2
        Length = 261

 Score = 153 (58.9 bits), Expect = 6.8e-11, Sum P(2) = 6.8e-11
 Identities = 50/177 (28%), Positives = 79/177 (44%)

Query:   487 DLELTGVLPSKLSRLATEKLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASE 546
             +++  G  PS L  L+ + ++DC  + G C GG       Y  ++G +  +    Y+A +
Sbjct:    64 NIKRKGAWPSTL--LSVQHVLDC-ANAGSCEGGNDLPVWSYAHEHG-IPDETCNNYQAKD 119

Query:   547 SE----RGCLXXXXXXXXXXXXXYS--RI-PYGE---EEEMKKWVATRGPLSVGMNANG- 595
              E      C              Y+  R+  YG     E+M   +   GP+S G+ A   
Sbjct:   120 QECNKFNQCGTCTEFKECHAIQNYTLWRVGDYGSLSGREKMMAEIYANGPISCGIMATEK 179

Query:   596 LFYYSGGVIDLNQRLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
             +  Y+GG+   +         NH + +VG+G     DGT   YWIV+NSWG  WGE+
Sbjct:   180 MVNYTGGI---HAEYQEQAYINHVISVVGWGVS---DGTE--YWIVRNSWGEPWGER 228

 Score = 99 (39.9 bits), Expect = 1.9e-07, Sum P(2) = 1.9e-07
 Identities = 31/105 (29%), Positives = 47/105 (44%)

Query:   295 DLPEAFDWR-AEGV--ISKVKEQG---KCACCWAFSAVGVVEAMHAIQGNSL---TELSV 345
             DLP+++DWR   GV   S  + Q     C  CWA  +   +     I+       T LSV
Sbjct:    19 DLPKSWDWRNVNGVNYASATRNQHIPQYCGSCWAHGSTSAMADRINIKRKGAWPSTLLSV 78

Query:   346 QQLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESE 390
             Q ++DC  + G C GG       Y  ++G +  +    Y+A + E
Sbjct:    79 QHVLDC-ANAGSCEGGNDLPVWSYAHEHG-IPDETCNNYQAKDQE 121

 Score = 93 (37.8 bits), Expect = 1.9e-07, Sum P(2) = 1.9e-07
 Identities = 24/71 (33%), Positives = 36/71 (50%)

Query:   417 EEMKKWVATRGPLSVGMNANG-LFYYSGGV-IDLNQRLY----------GTS--IPYWIV 462
             E+M   +   GP+S G+ A   +  Y+GG+  +  ++ Y          G S    YWIV
Sbjct:   158 EKMMAEIYANGPISCGIMATEKMVNYTGGIHAEYQEQAYINHVISVVGWGVSDGTEYWIV 217

Query:   463 KNSWGSDWGEK 473
             +NSWG  WGE+
Sbjct:   218 RNSWGEPWGER 228

 Score = 64 (27.6 bits), Expect = 6.8e-11, Sum P(2) = 6.8e-11
 Identities = 21/62 (33%), Positives = 28/62 (45%)

Query:   147 DLPEAFDWR-AEGV--ISKVKEQG---KCACCWAFSAVGVVEAMHAIQGNNL---TELSV 197
             DLP+++DWR   GV   S  + Q     C  CWA  +   +     I+       T LSV
Sbjct:    19 DLPKSWDWRNVNGVNYASATRNQHIPQYCGSCWAHGSTSAMADRINIKRKGAWPSTLLSV 78

Query:   198 QH 199
             QH
Sbjct:    79 QH 80


>DICTYBASE|DDB_G0288563 [details] [associations]
            symbol:DDB_G0288563 species:44689 "Dictyostelium
            discoideum" [GO:0005615 "extracellular space" evidence=IDA]
            [GO:0050790 "regulation of catalytic activity" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0004197 "cysteine-type
            endopeptidase activity" evidence=IEA] [GO:0016787 "hydrolase
            activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0044351 "macropinocytosis" evidence=RCA]
            InterPro:IPR000668 InterPro:IPR012599 InterPro:IPR013128
            InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 dictyBase:DDB_G0288563
            GO:GO:0005615 eggNOG:COG4870 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0050790 GO:GO:0004197
            EMBL:AAFI02000117 PANTHER:PTHR12411:SF16 RefSeq:XP_636643.1
            MEROPS:C01.A58 PRIDE:Q54IS1 EnsemblProtists:DDB0187993
            GeneID:8626689 KEGG:ddi:DDB_G0288563 InParanoid:Q54IS1 OMA:AWEYMEL
            Uniprot:Q54IS1
        Length = 314

 Score = 142 (55.0 bits), Expect = 8.7e-11, Sum P(2) = 8.7e-11
 Identities = 44/159 (27%), Positives = 73/159 (45%)

Query:   501 LATEKLVDCDM-SNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASES-----ERGCLXX 554
             L+ + LV CD+  N GC+GG    A +Y+ +  G+ +D   PY A        +R C   
Sbjct:   140 LSPQTLVACDVYGNDGCSGGIPQLAWEYM-ELKGLPTDSCVPYTAGNGTVYSCQRSC-SD 197

Query:   555 XXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFY-YSGGVIDLNQRLCNP 613
                        ++       + +++ +   GP+   M     F  YS GV  +     + 
Sbjct:   198 SEDYSLYRAKPFTLKTCSSVQCIQENILAYGPIVGTMEVYEDFMSYSSGVYVMTPG--SS 255

Query:   614 KAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
                 HA+ IVG+G ++    + + YWIV NSWG+DWG++
Sbjct:   256 LLGGHAIKIVGWGFDQT---SQLNYWIVANSWGADWGQQ 291

 Score = 138 (53.6 bits), Expect = 3.2e-10, Sum P(2) = 3.2e-10
 Identities = 40/117 (34%), Positives = 58/117 (49%)

Query:   280 TEMRAFQFNSLRHGDDL----PEAFDWRAE--GVISKVKEQGKCACCWAFSAVGVVEAMH 333
             T+  A  F    +G++L    P +FD R +    I  +  Q +C  CWAFS+  V+    
Sbjct:    68 TKKTAAPFKLTENGEELKGSIPTSFDSRVQWPDCIHPILNQEQCGSCWAFSSSEVLSDRL 127

Query:   334 AIQGNSLTE---LSVQQLVDCDM-SNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKA 386
              I  N+ T    LS Q LV CD+  N GC+GG    A +Y+ +  G+ +D   PY A
Sbjct:   128 CIASNNKTNPGALSPQTLVACDVYGNDGCSGGIPQLAWEYM-ELKGLPTDSCVPYTA 183

 Score = 81 (33.6 bits), Expect = 8.7e-11, Sum P(2) = 8.7e-11
 Identities = 22/68 (32%), Positives = 32/68 (47%)

Query:   132 TEMRAFQFNSLRHGDDL----PEAFDWRAE--GVISKVKEQGKCACCWAFSAVGVVEAMH 185
             T+  A  F    +G++L    P +FD R +    I  +  Q +C  CWAFS+  V+    
Sbjct:    68 TKKTAAPFKLTENGEELKGSIPTSFDSRVQWPDCIHPILNQEQCGSCWAFSSSEVLSDRL 127

Query:   186 AIQGNNLT 193
              I  NN T
Sbjct:   128 CIASNNKT 135

 Score = 80 (33.2 bits), Expect = 3.2e-10, Sum P(2) = 3.2e-10
 Identities = 11/19 (57%), Positives = 16/19 (84%)

Query:   455 TSIPYWIVKNSWGSDWGEK 473
             + + YWIV NSWG+DWG++
Sbjct:   273 SQLNYWIVANSWGADWGQQ 291


>UNIPROTKB|P07688 [details] [associations]
            symbol:CTSB "Cathepsin B" species:9913 "Bos taurus"
            [GO:0042470 "melanosome" evidence=IEA] [GO:0005764 "lysosome"
            evidence=IEA] [GO:0005615 "extracellular space" evidence=IEA]
            [GO:0097067 "cellular response to thyroid hormone stimulus"
            evidence=IEA] [GO:0005739 "mitochondrion" evidence=IEA] [GO:0005730
            "nucleolus" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0050790 "regulation of catalytic activity" evidence=IEA]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR012599 InterPro:IPR013128
            InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0005739 GO:GO:0042470
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0050790 GO:GO:0005764 GO:GO:0004197 EMBL:L06075 EMBL:M64620
            EMBL:U16336 EMBL:U16337 EMBL:U16338 EMBL:U16339 EMBL:U16341
            EMBL:U16342 EMBL:U16343 EMBL:BC102997 IPI:IPI00692061 PIR:S38328
            RefSeq:NP_776456.1 UniGene:Bt.393 PDB:1ITO PDB:1QDQ PDB:1SP4
            PDB:2DC6 PDB:2DC7 PDB:2DC8 PDB:2DC9 PDB:2DCA PDB:2DCB PDB:2DCC
            PDB:2DCD PDBsum:1ITO PDBsum:1QDQ PDBsum:1SP4 PDBsum:2DC6
            PDBsum:2DC7 PDBsum:2DC8 PDBsum:2DC9 PDBsum:2DCA PDBsum:2DCB
            PDBsum:2DCC PDBsum:2DCD ProteinModelPortal:P07688 SMR:P07688
            STRING:P07688 MEROPS:C01.060 PRIDE:P07688
            Ensembl:ENSBTAT00000036795 GeneID:281105 KEGG:bta:281105 CTD:1508
            eggNOG:NOG315657 GeneTree:ENSGT00560000076599 HOGENOM:HOG000241341
            HOVERGEN:HBG003480 InParanoid:P07688 KO:K01363 OMA:GYPSGAW
            OrthoDB:EOG4K6G4C BRENDA:3.4.22.1 BindingDB:P07688
            ChEMBL:CHEMBL2323 EvolutionaryTrace:P07688 NextBio:20805177
            ArrayExpress:P07688 GO:GO:0097067 PANTHER:PTHR12411:SF16
            Uniprot:P07688
        Length = 335

 Score = 119 (46.9 bits), Expect = 9.8e-11, Sum P(2) = 9.8e-11
 Identities = 31/95 (32%), Positives = 44/95 (46%)

Query:   296 LPEAFDWRAEG----VISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLT--ELSVQQLV 349
             LPE+FD R +      I ++++QG C  CWAF AV  +     I  N     E+S + ++
Sbjct:    80 LPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHSNGRVNVEVSAEDML 139

Query:   350 DC--DMSNGGCNGGRMDDALQYIIDNGGVVSDQAY 382
              C       GCNGG    A  +    G +VS   Y
Sbjct:   140 TCCGGECGDGCNGGFPSGAWNFWTKKG-LVSGGLY 173

 Score = 107 (42.7 bits), Expect = 9.8e-11, Sum P(2) = 9.8e-11
 Identities = 27/79 (34%), Positives = 40/79 (50%)

Query:   574 EEEMKKWVATRGPLSVGMNANGLFY-YSGGVIDLNQRLCNPKAQNHALIIVGYGEEEKKD 632
             E+E+   +   GP+    +    F  Y  GV    Q +       HA+ I+G+G E   +
Sbjct:   236 EKEIMAEIYKNGPVEGAFSVYSDFLLYKSGVY---QHVSGEIMGGHAIRILGWGVE---N 289

Query:   633 GTSIPYWIVKNSWGSDWGE 651
             GT  PYW+V NSW +DWG+
Sbjct:   290 GT--PYWLVGNSWNTDWGD 306

 Score = 96 (38.9 bits), Expect = 2.9e-08, Sum P(2) = 2.9e-08
 Identities = 20/58 (34%), Positives = 28/58 (48%)

Query:   148 LPEAFDWRAEG----VISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQHHD 201
             LPE+FD R +      I ++++QG C  CWAF AV  +     I  N    + V   D
Sbjct:    80 LPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHSNGRVNVEVSAED 137

 Score = 76 (31.8 bits), Expect = 1.5e-07, Sum P(2) = 1.5e-07
 Identities = 10/15 (66%), Positives = 13/15 (86%)

Query:   458 PYWIVKNSWGSDWGE 472
             PYW+V NSW +DWG+
Sbjct:   292 PYWLVGNSWNTDWGD 306


>WB|WBGene00000784 [details] [associations]
            symbol:cpr-4 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 eggNOG:NOG315657
            GeneTree:ENSGT00560000076599 HOGENOM:HOG000241341 KO:K01363
            PANTHER:PTHR12411:SF16 EMBL:L39895 EMBL:L39926 EMBL:FO081381
            PIR:T37280 RefSeq:NP_504682.1 UniGene:Cel.5404
            ProteinModelPortal:P43508 SMR:P43508 DIP:DIP-25376N
            MINT:MINT-1069892 STRING:P43508 MEROPS:C01.A34 PaxDb:P43508
            EnsemblMetazoa:F44C4.3 GeneID:179053 KEGG:cel:CELE_F44C4.3
            UCSC:F44C4.3 CTD:179053 WormBase:F44C4.3 InParanoid:P43508
            OMA:CCGFLCG NextBio:903704 Uniprot:P43508
        Length = 335

 Score = 126 (49.4 bits), Expect = 1.7e-10, Sum P(2) = 1.7e-10
 Identities = 36/125 (28%), Positives = 58/125 (46%)

Query:   259 DSTLEDIQPSL-QAPFSSNQT-DTEMRAFQFNSLRHGDDLPEAFDWRAEGV----ISKVK 312
             D T+E ++  L +  F +  T D E+     N     D +P  FD R +      I+ ++
Sbjct:    46 DITIEQVKKRLMRTEFVAPHTPDVEVVKHDINE----DTIPATFDARTQWPNCMSINNIR 101

Query:   313 EQGKCACCWAFSAVGVVEAMHAIQGNSL--TELSVQQLVDCDMSNG-GCNGGRMDDALQY 369
             +Q  C  CWAF+A         I  N    T LS + ++ C  + G GC GG   +A +Y
Sbjct:   102 DQSDCGSCWAFAAAEAASDRFCIASNGAVNTLLSAEDVLSCCSNCGYGCEGGYPINAWKY 161

Query:   370 IIDNG 374
             ++ +G
Sbjct:   162 LVKSG 166

 Score = 97 (39.2 bits), Expect = 1.7e-10, Sum P(2) = 1.7e-10
 Identities = 24/72 (33%), Positives = 34/72 (47%)

Query:   581 VATRGPLSVGMNANGLFY-YSGGVIDLNQRLCNPKAQNHALIIVGYGEEEKKDGTSIPYW 639
             +   GP+         FY Y  GV          +   HA+ I+G+G +   +GT  PYW
Sbjct:   246 IIAHGPVEAAFTVYEDFYQYKTGVY---VHTTGQELGGHAIRILGWGTD---NGT--PYW 297

Query:   640 IVKNSWGSDWGE 651
             +V NSW  +WGE
Sbjct:   298 LVANSWNVNWGE 309

 Score = 83 (34.3 bits), Expect = 2.8e-07, Sum P(3) = 2.8e-07
 Identities = 25/97 (25%), Positives = 39/97 (40%)

Query:   111 DSTLEDIQPSL-QAPFSSNQT-DTEMRAFQFNSLRHGDDLPEAFDWRAEGV----ISKVK 164
             D T+E ++  L +  F +  T D E+     N     D +P  FD R +      I+ ++
Sbjct:    46 DITIEQVKKRLMRTEFVAPHTPDVEVVKHDINE----DTIPATFDARTQWPNCMSINNIR 101

Query:   165 EQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQHHD 201
             +Q  C  CWAF+A         I  N      +   D
Sbjct:   102 DQSDCGSCWAFAAAEAASDRFCIASNGAVNTLLSAED 138

 Score = 73 (30.8 bits), Expect = 5.2e-08, Sum P(2) = 5.2e-08
 Identities = 12/22 (54%), Positives = 15/22 (68%)

Query:   453 YGTS--IPYWIVKNSWGSDWGE 472
             +GT    PYW+V NSW  +WGE
Sbjct:   288 WGTDNGTPYWLVANSWNVNWGE 309

 Score = 53 (23.7 bits), Expect = 2.8e-07, Sum P(3) = 2.8e-07
 Identities = 11/33 (33%), Positives = 20/33 (60%)

Query:   501 LATEKLVDCDMSNG-GCNGGRMDDALQYIIDNG 532
             L+ E ++ C  + G GC GG   +A +Y++ +G
Sbjct:   134 LSAEDVLSCCSNCGYGCEGGYPINAWKYLVKSG 166


>DICTYBASE|DDB_G0292462 [details] [associations]
            symbol:DDB_G0292462 species:44689 "Dictyostelium
            discoideum" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 dictyBase:DDB_G0292462 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640
            EMBL:AAFI02000190 RefSeq:XP_629634.1 MEROPS:C01.A56
            EnsemblProtists:DDB0184413 GeneID:8628698 KEGG:ddi:DDB_G0292462
            InParanoid:Q54D62 OMA:NTQVESH Uniprot:Q54D62
        Length = 323

 Score = 150 (57.9 bits), Expect = 2.1e-10, Sum P(2) = 2.1e-10
 Identities = 48/166 (28%), Positives = 76/166 (45%)

Query:   501 LATEKLVDCDMS---------NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESER-- 549
             L+ + L+DCD S         N GC GG +  AL  +I N G+VSD+   Y+AS+     
Sbjct:    97 LSPQYLMDCDGSCVSDGVSGCNNGCKGGFVGLALTRLI-NEGIVSDECLSYQASKDSSCP 155

Query:   550 -GCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNANGLFY--YSGGVIDL 606
               C               S   +   ++ +  + T GP+     A  + Y  +     D+
Sbjct:   156 TTCDDGSPISNTTIYKATSCRAFPTVQDAQYEIMTNGPVI----ATFMLYSDFKPHKWDV 211

Query:   607 NQRLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
               +  N + ++HA+ +VG+G     DG  + YWI  NSWG+ WG+K
Sbjct:   212 YIKSSNTQVESHAVRVVGWGTTS--DG--VDYWIAANSWGTGWGDK 253

 Score = 140 (54.3 bits), Expect = 7.1e-10, Sum P(2) = 7.1e-10
 Identities = 41/113 (36%), Positives = 58/113 (51%)

Query:   294 DDLPEAFDWRAE-G-VISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTE--LSVQQLV 349
             D +P +FD R   G  +S V+EQ  C  CWA    G++     I+ +   +  LS Q L+
Sbjct:    44 DTIPASFDVRTNWGDCMSPVREQQSCGSCWAQVTSGILADRMCIESDKNIKMLLSPQYLM 103

Query:   350 DCDMS---------NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGC 393
             DCD S         N GC GG +  AL  +I N G+VSD+   Y+AS+ +  C
Sbjct:   104 DCDGSCVSDGVSGCNNGCKGGFVGLALTRLI-NEGIVSDECLSYQASK-DSSC 154

 Score = 75 (31.5 bits), Expect = 7.1e-10, Sum P(2) = 7.1e-10
 Identities = 12/24 (50%), Positives = 17/24 (70%)

Query:   453 YGTS---IPYWIVKNSWGSDWGEK 473
             +GT+   + YWI  NSWG+ WG+K
Sbjct:   230 WGTTSDGVDYWIAANSWGTGWGDK 253

 Score = 69 (29.3 bits), Expect = 2.1e-10, Sum P(2) = 2.1e-10
 Identities = 16/47 (34%), Positives = 24/47 (51%)

Query:   146 DDLPEAFDWRAE-G-VISKVKEQGKCACCWAFSAVGVVEAMHAIQGN 190
             D +P +FD R   G  +S V+EQ  C  CWA    G++     I+ +
Sbjct:    44 DTIPASFDVRTNWGDCMSPVREQQSCGSCWAQVTSGILADRMCIESD 90


>MGI|MGI:88561 [details] [associations]
            symbol:Ctsb "cathepsin B" species:10090 "Mus musculus"
            [GO:0004175 "endopeptidase activity" evidence=ISO] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=ISO] [GO:0005576
            "extracellular region" evidence=ISO] [GO:0005615 "extracellular
            space" evidence=ISO] [GO:0005737 "cytoplasm" evidence=ISO]
            [GO:0005739 "mitochondrion" evidence=ISO;IDA] [GO:0005764
            "lysosome" evidence=ISO;IDA] [GO:0005901 "caveola" evidence=ISO]
            [GO:0006508 "proteolysis" evidence=ISO] [GO:0008233 "peptidase
            activity" evidence=ISO] [GO:0008234 "cysteine-type peptidase
            activity" evidence=ISO] [GO:0009897 "external side of plasma
            membrane" evidence=ISO] [GO:0009986 "cell surface" evidence=ISO]
            [GO:0016324 "apical plasma membrane" evidence=ISO] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0030984 "kininogen binding"
            evidence=ISO] [GO:0032403 "protein complex binding" evidence=ISO]
            [GO:0042277 "peptide binding" evidence=ISO] [GO:0042383
            "sarcolemma" evidence=ISO] [GO:0043621 "protein self-association"
            evidence=ISO] [GO:0048471 "perinuclear region of cytoplasm"
            evidence=ISO] [GO:0050790 "regulation of catalytic activity"
            evidence=IEA] [GO:0060548 "negative regulation of cell death"
            evidence=ISO] InterPro:IPR000668 InterPro:IPR012599
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 MGI:MGI:88561
            GO:GO:0005739 GO:GO:0042470 GO:GO:0048471 GO:GO:0005615
            GO:GO:0009612 GO:GO:0009611 GO:GO:0009897 GO:GO:0045471
            GO:GO:0016324 GO:GO:0009749 GO:GO:0006914 GO:GO:0043434
            eggNOG:COG4870 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0050790 GO:GO:0005764 GO:GO:0042383 GO:GO:0014070
            GO:GO:0042277 GO:GO:0060548 GO:GO:0005901 GO:GO:0014075
            GO:GO:0004197 GO:GO:0070670 GO:GO:0007519 MEROPS:C01.060 CTD:1508
            GeneTree:ENSGT00560000076599 HOGENOM:HOG000241341
            HOVERGEN:HBG003480 KO:K01363 OMA:GYPSGAW OrthoDB:EOG4K6G4C
            BRENDA:3.4.22.1 GO:GO:0097067 PANTHER:PTHR12411:SF16 ChiTaRS:CTSB
            EMBL:M65270 EMBL:M65263 EMBL:M65264 EMBL:M65265 EMBL:M65266
            EMBL:M65267 EMBL:M65268 EMBL:M65269 EMBL:M14222 EMBL:X54966
            EMBL:S69034 EMBL:AK083393 EMBL:AK147192 EMBL:AK149884 EMBL:AK151790
            EMBL:AK167361 EMBL:BC006656 IPI:IPI00113517 PIR:A38458
            RefSeq:NP_031824.1 UniGene:Mm.236553 UniGene:Mm.489070
            ProteinModelPortal:P10605 SMR:P10605 IntAct:P10605 STRING:P10605
            PhosphoSite:P10605 SWISS-2DPAGE:P10605 PaxDb:P10605 PRIDE:P10605
            Ensembl:ENSMUST00000006235 GeneID:13030 KEGG:mmu:13030
            UCSC:uc007uhh.1 InParanoid:P10605 BioCyc:MetaCyc:MONOMER-14810
            BindingDB:P10605 ChEMBL:CHEMBL5187 NextBio:282900 Bgee:P10605
            CleanEx:MM_CTSB Genevestigator:P10605 GermOnline:ENSMUSG00000021939
            Uniprot:P10605
        Length = 339

 Score = 132 (51.5 bits), Expect = 2.6e-10, Sum P(2) = 2.6e-10
 Identities = 38/108 (35%), Positives = 49/108 (45%)

Query:   295 DLPEAFD----WRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLT--ELSVQQL 348
             DLPE FD    W     I ++++QG C  CWAF AV  +     I  N     E+S + L
Sbjct:    79 DLPETFDAREQWSNCPTIGQIRDQGSCGSCWAFGAVEAISDRTCIHTNGRVNVEVSAEDL 138

Query:   349 VDC-DMSNG-GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCL 394
             + C  +  G GCNGG    A  +    G +VS   Y      S  GCL
Sbjct:   139 LTCCGIQCGDGCNGGYPSGAWSFWTKKG-LVSGGVY-----NSHVGCL 180

 Score = 101 (40.6 bits), Expect = 8.4e-08, Sum P(3) = 8.4e-08
 Identities = 21/59 (35%), Positives = 27/59 (45%)

Query:   147 DLPEAFD----WRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQHHD 201
             DLPE FD    W     I ++++QG C  CWAF AV  +     I  N    + V   D
Sbjct:    79 DLPETFDAREQWSNCPTIGQIRDQGSCGSCWAFGAVEAISDRTCIHTNGRVNVEVSAED 137

 Score = 89 (36.4 bits), Expect = 2.6e-10, Sum P(2) = 2.6e-10
 Identities = 15/34 (44%), Positives = 21/34 (61%)

Query:   618 HALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGE 651
             HA+ I+G+G E       +PYW+  NSW  DWG+
Sbjct:   278 HAIRILGWGVEN-----GVPYWLAANSWNLDWGD 306

 Score = 73 (30.8 bits), Expect = 1.2e-08, Sum P(2) = 1.2e-08
 Identities = 9/16 (56%), Positives = 12/16 (75%)

Query:   457 IPYWIVKNSWGSDWGE 472
             +PYW+  NSW  DWG+
Sbjct:   291 VPYWLAANSWNLDWGD 306

 Score = 48 (22.0 bits), Expect = 8.4e-08, Sum P(3) = 8.4e-08
 Identities = 18/52 (34%), Positives = 22/52 (42%)

Query:   502 ATEKLVDCDMSNG-GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCL 552
             A + L  C +  G GCNGG    A  +    G +VS   Y      S  GCL
Sbjct:   135 AEDLLTCCGIQCGDGCNGGYPSGAWSFWTKKG-LVSGGVY-----NSHVGCL 180


>UNIPROTKB|Q3SZI1 [details] [associations]
            symbol:TINAG "Tubulointerstitial nephritis antigen"
            species:9913 "Bos taurus" [GO:0005604 "basement membrane"
            evidence=IEA] [GO:0007155 "cell adhesion" evidence=IEA] [GO:0030247
            "polysaccharide binding" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0006955 "immune response"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0005044
            "scavenger receptor activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR001212 InterPro:IPR013128 Pfam:PF00112 Pfam:PF01033
            PROSITE:PS00524 PROSITE:PS50958 SMART:SM00201 SMART:SM00645
            GO:GO:0006955 GO:GO:0030247 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640 GO:GO:0007155
            GO:GO:0005604 GO:GO:0005044 GeneTree:ENSGT00560000076599
            EMBL:BC102843 IPI:IPI00689615 RefSeq:NP_001030279.1
            UniGene:Bt.29080 ProteinModelPortal:Q3SZI1 MEROPS:C01.973
            PRIDE:Q3SZI1 Ensembl:ENSBTAT00000016790 GeneID:512517
            KEGG:bta:512517 CTD:27283 eggNOG:NOG310046 HOGENOM:HOG000241342
            HOVERGEN:HBG053961 InParanoid:Q3SZI1 OMA:WGQLTSS OrthoDB:EOG47PX5P
            NextBio:20870427 Uniprot:Q3SZI1
        Length = 476

 Score = 129 (50.5 bits), Expect = 2.7e-10, Sum P(2) = 2.7e-10
 Identities = 40/112 (35%), Positives = 50/112 (44%)

Query:   289 SLRHGDDLPEAF--DWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQ--GNSLTELS 344
             SL    DLPE F   ++  G      +Q  CA  WAFS   V     AIQ  G     LS
Sbjct:   210 SLTKTTDLPEFFIASYKWPGWTHGPLDQKNCAASWAFSTASVAADRIAIQSQGRYTANLS 269

Query:   345 VQQLVDC-DMSNGGCNGGRMDDALQYIIDNGGVVSDQAYP-YK-ASESERGC 393
              Q L+ C      GCN G +D A  Y+   G +VS   YP +K  + +  GC
Sbjct:   270 PQNLISCCAKKRHGCNSGSVDRAWWYLRKRG-LVSHACYPLFKDQNATNNGC 320

 Score = 97 (39.2 bits), Expect = 2.7e-10, Sum P(2) = 2.7e-10
 Identities = 26/90 (28%), Positives = 40/90 (44%)

Query:   568 RIPYGEEEEMKKWVATRGPLSVGMNANGLFY-YSGGVI----DLNQRLCN-PKAQNHALI 621
             R+   E E M++ +   GP+   M  +  F+ Y  G+       N+      K + HA+ 
Sbjct:   356 RVSSNETEIMRE-IMQNGPVQAIMQVHEDFFNYKTGIYRHITSTNEDSEKYRKFRTHAVK 414

Query:   622 IVGYGEEEKKDGTSIPYWIVKNSWGSDWGE 651
             + G+G      G    +WI  NSWG  WGE
Sbjct:   415 LTGWGTLRGAQGQKEKFWIAANSWGKSWGE 444

 Score = 69 (29.3 bits), Expect = 9.8e-07, Sum P(3) = 9.8e-07
 Identities = 23/63 (36%), Positives = 27/63 (42%)

Query:   141 SLRHGDDLPEAF--DWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQ--GNNLTELS 196
             SL    DLPE F   ++  G      +Q  CA  WAFS   V     AIQ  G     LS
Sbjct:   210 SLTKTTDLPEFFIASYKWPGWTHGPLDQKNCAASWAFSTASVAADRIAIQSQGRYTANLS 269

Query:   197 VQH 199
              Q+
Sbjct:   270 PQN 272

 Score = 68 (29.0 bits), Expect = 9.8e-07, Sum P(3) = 9.8e-07
 Identities = 18/56 (32%), Positives = 28/56 (50%)

Query:   499 SRLATEKLVDC-DMSNGGCNGGRMDDALQYIIDNGGVVSDQAYP-YK-ASESERGC 551
             + L+ + L+ C      GCN G +D A  Y+   G +VS   YP +K  + +  GC
Sbjct:   266 ANLSPQNLISCCAKKRHGCNSGSVDRAWWYLRKRG-LVSHACYPLFKDQNATNNGC 320

 Score = 67 (28.6 bits), Expect = 3.4e-07, Sum P(2) = 3.4e-07
 Identities = 10/19 (52%), Positives = 11/19 (57%)

Query:   454 GTSIPYWIVKNSWGSDWGE 472
             G    +WI  NSWG  WGE
Sbjct:   426 GQKEKFWIAANSWGKSWGE 444


>RGD|621509 [details] [associations]
            symbol:Ctsb "cathepsin B" species:10116 "Rattus norvegicus"
            [GO:0004175 "endopeptidase activity" evidence=IMP;IDA] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=IEA;ISO;IDA]
            [GO:0005515 "protein binding" evidence=IPI] [GO:0005576
            "extracellular region" evidence=IDA] [GO:0005615 "extracellular
            space" evidence=IDA] [GO:0005730 "nucleolus" evidence=IEA;ISO]
            [GO:0005737 "cytoplasm" evidence=IDA] [GO:0005739 "mitochondrion"
            evidence=IEA;ISO;IDA] [GO:0005764 "lysosome" evidence=IEA;ISO;IDA]
            [GO:0006508 "proteolysis" evidence=IEA;IEP;ISO;IMP;IDA;TAS]
            [GO:0006914 "autophagy" evidence=IEP] [GO:0006950 "response to
            stress" evidence=IEP] [GO:0007283 "spermatogenesis" evidence=IEP]
            [GO:0007519 "skeletal muscle tissue development" evidence=IEP]
            [GO:0008233 "peptidase activity" evidence=ISO] [GO:0008234
            "cysteine-type peptidase activity" evidence=ISO] [GO:0009611
            "response to wounding" evidence=IEP] [GO:0009612 "response to
            mechanical stimulus" evidence=IEP] [GO:0009749 "response to glucose
            stimulus" evidence=IEP] [GO:0009897 "external side of plasma
            membrane" evidence=IDA] [GO:0009986 "cell surface" evidence=IDA]
            [GO:0014070 "response to organic cyclic compound" evidence=IEP]
            [GO:0014075 "response to amine stimulus" evidence=IEP] [GO:0016324
            "apical plasma membrane" evidence=IDA] [GO:0030984 "kininogen
            binding" evidence=IPI] [GO:0032403 "protein complex binding"
            evidence=IPI] [GO:0034097 "response to cytokine stimulus"
            evidence=IEP] [GO:0042277 "peptide binding" evidence=IDA]
            [GO:0042383 "sarcolemma" evidence=IDA] [GO:0042470 "melanosome"
            evidence=IEA] [GO:0043231 "intracellular membrane-bounded
            organelle" evidence=ISO] [GO:0043434 "response to peptide hormone
            stimulus" evidence=IEP] [GO:0043621 "protein self-association"
            evidence=IDA] [GO:0045471 "response to ethanol" evidence=IEP]
            [GO:0048471 "perinuclear region of cytoplasm" evidence=ISO;IDA]
            [GO:0050790 "regulation of catalytic activity" evidence=IEA]
            [GO:0060548 "negative regulation of cell death" evidence=IMP]
            [GO:0070670 "response to interleukin-4" evidence=IEP] [GO:0097067
            "cellular response to thyroid hormone stimulus" evidence=IEA;ISO]
            [GO:0005901 "caveola" evidence=IDA] InterPro:IPR000668
            InterPro:IPR012599 InterPro:IPR013128 InterPro:IPR015643
            Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 RGD:621509 GO:GO:0005739 GO:GO:0042470
            GO:GO:0005615 GO:GO:0009612 GO:GO:0009611 GO:GO:0009897
            GO:GO:0045471 GO:GO:0016324 GO:GO:0009749 GO:GO:0006914
            GO:GO:0043434 eggNOG:COG4870 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0050790 GO:GO:0007283
            GO:GO:0005764 GO:GO:0042383 GO:GO:0043621 GO:GO:0014070
            GO:GO:0042277 GO:GO:0060548 GO:GO:0014075 GO:GO:0004197
            GO:GO:0070670 GO:GO:0007519 MEROPS:C01.060 HOGENOM:HOG000241341
            HOVERGEN:HBG003480 OrthoDB:EOG4K6G4C BRENDA:3.4.22.1 GO:GO:0097067
            PANTHER:PTHR12411:SF16 EMBL:X82396 EMBL:M11305 IPI:IPI00212811
            PIR:S51041 UniGene:Rn.100909 PDB:1CPJ PDB:1CTE PDB:1MIR PDB:1THE
            PDBsum:1CPJ PDBsum:1CTE PDBsum:1MIR PDBsum:1THE
            ProteinModelPortal:P00787 SMR:P00787 STRING:P00787 PRIDE:P00787
            UCSC:RGD:621509 InParanoid:P00787 SABIO-RK:P00787 BindingDB:P00787
            ChEMBL:CHEMBL2602 EvolutionaryTrace:P00787 ArrayExpress:P00787
            Genevestigator:P00787 GermOnline:ENSRNOG00000010331 Uniprot:P00787
        Length = 339

 Score = 125 (49.1 bits), Expect = 4.8e-10, Sum P(2) = 4.8e-10
 Identities = 37/108 (34%), Positives = 51/108 (47%)

Query:   295 DLPEAFD----WRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLT--ELSVQQL 348
             +LPE+FD    W     I+++++QG C  CWAF AV  +     I  N     E+S + L
Sbjct:    79 NLPESFDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRVNVEVSAEDL 138

Query:   349 VDC-DMSNG-GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCL 394
             + C  +  G GCNGG    A  +    G +VS   Y      S  GCL
Sbjct:   139 LTCCGIQCGDGCNGGYPSGAWNFWTRKG-LVSGGVY-----NSHIGCL 180

 Score = 95 (38.5 bits), Expect = 1.1e-06, Sum P(2) = 1.1e-06
 Identities = 20/59 (33%), Positives = 29/59 (49%)

Query:   147 DLPEAFD----WRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQHHD 201
             +LPE+FD    W     I+++++QG C  CWAF AV  +     I  N    + V   D
Sbjct:    79 NLPESFDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRVNVEVSAED 137

 Score = 94 (38.1 bits), Expect = 4.8e-10, Sum P(2) = 4.8e-10
 Identities = 24/79 (30%), Positives = 35/79 (44%)

Query:   574 EEEMKKWVATRGPLSVGMNANGLFY-YSGGVIDLNQRLCNPKAQNHALIIVGYGEEEKKD 632
             E+E+   +   GP+         F  Y  GV    +         HA+ I+G+G E    
Sbjct:   236 EKEIMAEIYKNGPVEGAFTVFSDFLTYKSGVY---KHEAGDVMGGHAIRILGWGIEN--- 289

Query:   633 GTSIPYWIVKNSWGSDWGE 651
                +PYW+V NSW  DWG+
Sbjct:   290 --GVPYWLVANSWNVDWGD 306

 Score = 77 (32.2 bits), Expect = 2.7e-08, Sum P(2) = 2.7e-08
 Identities = 10/16 (62%), Positives = 13/16 (81%)

Query:   457 IPYWIVKNSWGSDWGE 472
             +PYW+V NSW  DWG+
Sbjct:   291 VPYWLVANSWNVDWGD 306


>UNIPROTKB|Q6IN22 [details] [associations]
            symbol:Ctsb "Cathepsin B" species:10116 "Rattus norvegicus"
            [GO:0050790 "regulation of catalytic activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR012599 InterPro:IPR013128
            InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 RGD:621509 GO:GO:0005739
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0050790 GO:GO:0005764 GO:GO:0004197 CTD:1508
            GeneTree:ENSGT00560000076599 HOVERGEN:HBG003480 KO:K01363
            OMA:GYPSGAW GO:GO:0097067 PANTHER:PTHR12411:SF16 UniGene:Rn.100909
            EMBL:CH474023 HSSP:P00785 EMBL:BC072490 IPI:IPI00562653
            RefSeq:NP_072119.2 SMR:Q6IN22 IntAct:Q6IN22 STRING:Q6IN22
            Ensembl:ENSRNOT00000014177 GeneID:64529 KEGG:rno:64529
            InParanoid:Q6IN22 NextBio:613362 Genevestigator:Q6IN22
            Uniprot:Q6IN22
        Length = 339

 Score = 125 (49.1 bits), Expect = 4.8e-10, Sum P(2) = 4.8e-10
 Identities = 37/108 (34%), Positives = 51/108 (47%)

Query:   295 DLPEAFD----WRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLT--ELSVQQL 348
             +LPE+FD    W     I+++++QG C  CWAF AV  +     I  N     E+S + L
Sbjct:    79 NLPESFDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRVNVEVSAEDL 138

Query:   349 VDC-DMSNG-GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCL 394
             + C  +  G GCNGG    A  +    G +VS   Y      S  GCL
Sbjct:   139 LTCCGIQCGDGCNGGYPSGAWNFWTRKG-LVSGGVY-----NSHIGCL 180

 Score = 95 (38.5 bits), Expect = 1.1e-06, Sum P(2) = 1.1e-06
 Identities = 20/59 (33%), Positives = 29/59 (49%)

Query:   147 DLPEAFD----WRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQHHD 201
             +LPE+FD    W     I+++++QG C  CWAF AV  +     I  N    + V   D
Sbjct:    79 NLPESFDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRVNVEVSAED 137

 Score = 94 (38.1 bits), Expect = 4.8e-10, Sum P(2) = 4.8e-10
 Identities = 24/79 (30%), Positives = 35/79 (44%)

Query:   574 EEEMKKWVATRGPLSVGMNANGLFY-YSGGVIDLNQRLCNPKAQNHALIIVGYGEEEKKD 632
             E+E+   +   GP+         F  Y  GV    +         HA+ I+G+G E    
Sbjct:   236 EKEIMAEIYKNGPVEGAFTVFSDFLTYKSGVY---KHEAGDVMGGHAIRILGWGIEN--- 289

Query:   633 GTSIPYWIVKNSWGSDWGE 651
                +PYW+V NSW  DWG+
Sbjct:   290 --GVPYWLVANSWNVDWGD 306

 Score = 77 (32.2 bits), Expect = 2.7e-08, Sum P(2) = 2.7e-08
 Identities = 10/16 (62%), Positives = 13/16 (81%)

Query:   457 IPYWIVKNSWGSDWGE 472
             +PYW+V NSW  DWG+
Sbjct:   291 VPYWLVANSWNVDWGD 306


>UNIPROTKB|E2QV47 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0097208 "alveolar lamellar body"
            evidence=IEA] [GO:0097067 "cellular response to thyroid hormone
            stimulus" evidence=IEA] [GO:0070371 "ERK1 and ERK2 cascade"
            evidence=IEA] [GO:0070324 "thyroid hormone binding" evidence=IEA]
            [GO:0060448 "dichotomous subdivision of terminal units involved in
            lung branching" evidence=IEA] [GO:0045766 "positive regulation of
            angiogenesis" evidence=IEA] [GO:0043129 "surfactant homeostasis"
            evidence=IEA] [GO:0043066 "negative regulation of apoptotic
            process" evidence=IEA] [GO:0033619 "membrane protein proteolysis"
            evidence=IEA] [GO:0032526 "response to retinoic acid" evidence=IEA]
            [GO:0031648 "protein destabilization" evidence=IEA] [GO:0031638
            "zymogen activation" evidence=IEA] [GO:0030108 "HLA-A specific
            activating MHC class I receptor activity" evidence=IEA] [GO:0016505
            "apoptotic protease activator activity" evidence=IEA] [GO:0010815
            "bradykinin catabolic process" evidence=IEA] [GO:0010813
            "neuropeptide catabolic process" evidence=IEA] [GO:0010634
            "positive regulation of epithelial cell migration" evidence=IEA]
            [GO:0010628 "positive regulation of gene expression" evidence=IEA]
            [GO:0008284 "positive regulation of cell proliferation"
            evidence=IEA] [GO:0005829 "cytosol" evidence=IEA] [GO:0005764
            "lysosome" evidence=IEA] [GO:0005615 "extracellular space"
            evidence=IEA] [GO:0004252 "serine-type endopeptidase activity"
            evidence=IEA] [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=IEA] [GO:0004177 "aminopeptidase activity" evidence=IEA]
            [GO:0002764 "immune response-regulating signaling pathway"
            evidence=IEA] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            SMART:SM00645 GO:GO:0005829 GO:GO:0043066 GO:GO:0005615
            GO:GO:0008284 GO:GO:0070371 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00639 GO:GO:0045766
            GO:GO:0004177 GO:GO:0004252 GO:GO:0005764 GO:GO:0032526
            GO:GO:0010628 GO:GO:0070324 GO:GO:0016505 GO:GO:0010634
            GO:GO:0004197 GO:GO:0042599 GO:GO:0031648 GO:GO:0097067
            GO:GO:0031638 GO:GO:0001913 GO:GO:0030108 GO:GO:0010815
            GO:GO:0060448 GO:GO:0002764 GO:GO:0033619 GO:GO:0010813
            GO:GO:0043129 Ensembl:ENSCAFT00000036196 Uniprot:E2QV47
        Length = 136

 Score = 153 (58.9 bits), Expect = 5.7e-10, P = 5.7e-10
 Identities = 37/117 (31%), Positives = 51/117 (43%)

Query:   535 VSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNAN 594
             + + +YPYK  + +  C               + I   +E+ M + VA   P+S      
Sbjct:     1 MGEDSYPYKGQDGD--C-KYQPSKAIAFVKDVANITINDEQAMVEAVALYNPVSFAFEVT 57

Query:   595 GLFY-YSGGVIDLNQRLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWG 650
               F  Y  G+         P   NHA++ VGYGE+       IPYWIVKNSWG  WG
Sbjct:    58 SDFMMYRKGIYSSTSCHKTPDKVNHAVLAVGYGEQN-----GIPYWIVKNSWGPQWG 109


>RGD|708479 [details] [associations]
            symbol:Ctsz "cathepsin Z" species:10116 "Rattus norvegicus"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=TAS]
            [GO:0005615 "extracellular space" evidence=IEA;ISO] [GO:0005783
            "endoplasmic reticulum" evidence=IEA;ISO] [GO:0006508 "proteolysis"
            evidence=IEA] [GO:0060441 "epithelial tube branching involved in
            lung morphogenesis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            RGD:708479 GO:GO:0005576 GO:GO:0006508 InterPro:IPR025661
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0004197 MEROPS:C01.013 CTD:1522 HOVERGEN:HBG004456 KO:K08568
            EMBL:AB023781 EMBL:BC091110 IPI:IPI00207663 RefSeq:NP_899159.1
            UniGene:Rn.1475 ProteinModelPortal:Q9R1T3 SMR:Q9R1T3 PRIDE:Q9R1T3
            GeneID:252929 KEGG:rno:252929 BindingDB:Q9R1T3 NextBio:624097
            Genevestigator:Q9R1T3 Uniprot:Q9R1T3
        Length = 306

 Score = 158 (60.7 bits), Expect = 7.9e-10, Sum P(2) = 7.9e-10
 Identities = 51/177 (28%), Positives = 79/177 (44%)

Query:   487 DLELTGVLPSKLSRLATEKLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASE 546
             +++  G  PS L  L+ + ++DC  + G C GG      +Y   +G +  +    Y+A +
Sbjct:   108 NIKRKGAWPSTL--LSVQNVIDCGNA-GSCEGGNDLPVWEYAHKHG-IPDETCNNYQAKD 163

Query:   547 SE----RGCLXXXXXXXXXXXXXYS--RI-PYGE---EEEMKKWVATRGPLSVGMNANG- 595
              E      C              Y+  R+  YG     E+M   +   GP+S G+ A   
Sbjct:   164 QECDKFNQCGTCTEFKECHTIQNYTLWRVGDYGSLSGREKMMAEIYANGPISCGIMATER 223

Query:   596 LFYYSGGVIDLNQRLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
             +  Y+GG+    Q   N    NH + + G+G     DG  I YWIV+NSWG  WGE+
Sbjct:   224 MSNYTGGIYTEYQ---NQAIINHIISVAGWGVSN--DG--IEYWIVRNSWGEPWGER 273

 Score = 96 (38.9 bits), Expect = 3.6e-07, Sum P(2) = 3.6e-07
 Identities = 26/72 (36%), Positives = 37/72 (51%)

Query:   417 EEMKKWVATRGPLSVGMNANG-LFYYSGGVID--LNQRL---------YGTS---IPYWI 461
             E+M   +   GP+S G+ A   +  Y+GG+     NQ +         +G S   I YWI
Sbjct:   202 EKMMAEIYANGPISCGIMATERMSNYTGGIYTEYQNQAIINHIISVAGWGVSNDGIEYWI 261

Query:   462 VKNSWGSDWGEK 473
             V+NSWG  WGE+
Sbjct:   262 VRNSWGEPWGER 273

 Score = 96 (38.9 bits), Expect = 3.6e-07, Sum P(2) = 3.6e-07
 Identities = 31/105 (29%), Positives = 46/105 (43%)

Query:   295 DLPEAFDWR-AEGV--ISKVKEQG---KCACCWAFSAVGVVEAMHAIQGNSL---TELSV 345
             DLP+ +DWR   GV   S  + Q     C  CWA  +   +     I+       T LSV
Sbjct:    63 DLPKNWDWRNVNGVNYASVTRNQHIPQYCGSCWAHGSTSALADRINIKRKGAWPSTLLSV 122

Query:   346 QQLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESE 390
             Q ++DC  + G C GG      +Y   +G +  +    Y+A + E
Sbjct:   123 QNVIDCGNA-GSCEGGNDLPVWEYAHKHG-IPDETCNNYQAKDQE 165

 Score = 53 (23.7 bits), Expect = 7.9e-10, Sum P(2) = 7.9e-10
 Identities = 14/41 (34%), Positives = 19/41 (46%)

Query:   147 DLPEAFDWR-AEGV--ISKVKEQG---KCACCWAFSAVGVV 181
             DLP+ +DWR   GV   S  + Q     C  CWA  +   +
Sbjct:    63 DLPKNWDWRNVNGVNYASVTRNQHIPQYCGSCWAHGSTSAL 103


>DICTYBASE|DDB_G0276111 [details] [associations]
            symbol:DDB_G0276111 species:44689 "Dictyostelium
            discoideum" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 SMART:SM00645
            InterPro:IPR000169 dictyBase:DDB_G0276111 Pfam:PF00188
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 PANTHER:PTHR12411
            PROSITE:PS00139 EMBL:AAFI02000014 Gene3D:3.40.33.10
            InterPro:IPR001283 InterPro:IPR014044 PRINTS:PR00837 SMART:SM00198
            SUPFAM:SSF55797 ProtClustDB:CLSZ2429919 RefSeq:XP_643261.1
            ProteinModelPortal:Q75JH0 EnsemblProtists:DDB0169514 GeneID:8620304
            KEGG:ddi:DDB_G0276111 InParanoid:Q75JH0 OMA:GFVTSIK Uniprot:Q75JH0
        Length = 415

 Score = 175 (66.7 bits), Expect = 9.9e-10, Sum P(2) = 9.9e-10
 Identities = 52/196 (26%), Positives = 89/196 (45%)

Query:   301 DWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGN---SLTELSVQQLVDCDMSNGG 357
             DW++ G ++ +K QG+C  C++F+    +E+ + I+ N   +  +LS Q  V C   N G
Sbjct:   214 DWKSLGFVTSIKNQGQCGGCYSFATCAALESAYLIKNNLPNTDIDLSEQNFVSC--VNYG 271

Query:   358 CNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYGEEE 417
             C GG     L  +  + G++ + +YPYKA      C              YS I  G +E
Sbjct:   272 CGGGNGQSCLDKL-KSTGIMYETSYPYKAVTGS--CPNVIQSPQPFKWTGYSNIQ-GNKE 327

Query:   418 EMKKWVATRGPLSVGMNAN-GLFYYSGGVIDLNQRL----------YGTSIPYWIVKNSW 466
                  + + GP+   +  + G   Y  G+   +Q            Y ++   +++KNSW
Sbjct:   328 AFLNALKS-GPIYASLYVDSGFQLYKSGIYSCSQSSTPNHAITIVGYSSADNSYLIKNSW 386

Query:   467 GSDWGEK--VEDKVGS 480
             G+ +GE   +  K GS
Sbjct:   387 GTIYGESGYIRLKEGS 402

 Score = 121 (47.7 bits), Expect = 5.1e-09, Sum P(3) = 5.1e-09
 Identities = 38/140 (27%), Positives = 59/140 (42%)

Query:   513 NGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGCLXXXXXXXXXXXXXYSRIPYG 572
             N GC GG     L  +  + G++ + +YPYKA      C              YS I  G
Sbjct:   269 NYGCGGGNGQSCLDKL-KSTGIMYETSYPYKAVTGS--CPNVIQSPQPFKWTGYSNIQ-G 324

Query:   573 EEEEMKKWVATRGPLSVGMNAN-GLFYYSGGVIDLNQRLCNPKAQNHALIIVGYGEEEKK 631
              +E     + + GP+   +  + G   Y  G+   +Q        NHA+ IVGY   +  
Sbjct:   325 NKEAFLNALKS-GPIYASLYVDSGFQLYKSGIYSCSQS----STPNHAITIVGYSSADNS 379

Query:   632 DGTSIPYWIVKNSWGSDWGE 651
                    +++KNSWG+ +GE
Sbjct:   380 -------YLIKNSWGTIYGE 392

 Score = 91 (37.1 bits), Expect = 5.1e-09, Sum P(3) = 5.1e-09
 Identities = 15/54 (27%), Positives = 32/54 (59%)

Query:   153 DWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQHHDKVYSS 206
             DW++ G ++ +K QG+C  C++F+    +E+ + I+ NNL    +   ++ + S
Sbjct:   214 DWKSLGFVTSIKNQGQCGGCYSFATCAALESAYLIK-NNLPNTDIDLSEQNFVS 266

 Score = 40 (19.1 bits), Expect = 9.9e-10, Sum P(2) = 9.9e-10
 Identities = 12/40 (30%), Positives = 19/40 (47%)

Query:    61 DLLRRHENFVTNVEKAEDYQREDSGTAVFEV-NKFFDLSD 99
             D+  + + FV+          E +G A+F+V  K FD  D
Sbjct:    58 DIANKVQTFVSQCNLGVPQNIEGTGFALFKVMGKEFDPVD 97


>MGI|MGI:1891190 [details] [associations]
            symbol:Ctsz "cathepsin Z" species:10090 "Mus musculus"
            [GO:0005615 "extracellular space" evidence=ISO] [GO:0005764
            "lysosome" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0008233 "peptidase activity" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            MGI:MGI:1891190 GO:GO:0005783 GO:GO:0005615 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GO:GO:0060441 GeneTree:ENSGT00560000076599 MEROPS:C01.013 CTD:1522
            HOVERGEN:HBG004456 KO:K08568 OrthoDB:EOG42Z4QN OMA:QCGTCTE
            ChiTaRS:CTSZ EMBL:AJ242663 EMBL:AF136277 EMBL:AF136278
            EMBL:BC008619 IPI:IPI00986833 RefSeq:NP_071720.1 UniGene:Mm.156919
            ProteinModelPortal:Q9WUU7 SMR:Q9WUU7 IntAct:Q9WUU7 STRING:Q9WUU7
            PaxDb:Q9WUU7 PRIDE:Q9WUU7 Ensembl:ENSMUST00000016400 GeneID:64138
            KEGG:mmu:64138 InParanoid:Q9WUU7 NextBio:319927 Bgee:Q9WUU7
            CleanEx:MM_CTSZ Genevestigator:Q9WUU7 GermOnline:ENSMUSG00000016256
            Uniprot:Q9WUU7
        Length = 306

 Score = 156 (60.0 bits), Expect = 1.4e-09, Sum P(2) = 1.4e-09
 Identities = 50/177 (28%), Positives = 80/177 (45%)

Query:   487 DLELTGVLPSKLSRLATEKLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASE 546
             +++  G  PS L  L+ + ++DC  + G C GG      +Y   +G +  +    Y+A +
Sbjct:   108 NIKRKGAWPSIL--LSVQNVIDCGNA-GSCEGGNDLPVWEYAHKHG-IPDETCNNYQAKD 163

Query:   547 SE----RGCLXXXXXXXXXXXXXYS--RI-PYGE---EEEMKKWVATRGPLSVGMNANGL 596
              +      C              Y+  R+  YG     E+M   +   GP+S G+ A  +
Sbjct:   164 QDCDKFNQCGTCTEFKECHTIQNYTLWRVGDYGSLSGREKMMAEIYANGPISCGIMATEM 223

Query:   597 FY-YSGGVIDLNQRLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
                Y+GG+   +Q   +    NH + + G+G     DG  I YWIV+NSWG  WGEK
Sbjct:   224 MSNYTGGIYAEHQ---DQAVINHIISVAGWGVSN--DG--IEYWIVRNSWGEPWGEK 273

 Score = 98 (39.6 bits), Expect = 1.4e-06, Sum P(2) = 1.4e-06
 Identities = 27/72 (37%), Positives = 38/72 (52%)

Query:   417 EEMKKWVATRGPLSVGMNANGLFY-YSGGVI----D---LNQRL----YGTS---IPYWI 461
             E+M   +   GP+S G+ A  +   Y+GG+     D   +N  +    +G S   I YWI
Sbjct:   202 EKMMAEIYANGPISCGIMATEMMSNYTGGIYAEHQDQAVINHIISVAGWGVSNDGIEYWI 261

Query:   462 VKNSWGSDWGEK 473
             V+NSWG  WGEK
Sbjct:   262 VRNSWGEPWGEK 273

 Score = 88 (36.0 bits), Expect = 1.4e-06, Sum P(2) = 1.4e-06
 Identities = 29/105 (27%), Positives = 45/105 (42%)

Query:   295 DLPEAFDWR-AEGV--ISKVKEQG---KCACCWAFSAVGVVEAMHAIQGNSLTE---LSV 345
             DLP+ +DWR   GV   S  + Q     C  CWA  +   +     I+         LSV
Sbjct:    63 DLPKNWDWRNVNGVNYASVTRNQHIPQYCGSCWAHGSTSAMADRINIKRKGAWPSILLSV 122

Query:   346 QQLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESE 390
             Q ++DC  + G C GG      +Y   +G +  +    Y+A + +
Sbjct:   123 QNVIDCGNA-GSCEGGNDLPVWEYAHKHG-IPDETCNNYQAKDQD 165

 Score = 53 (23.7 bits), Expect = 1.4e-09, Sum P(2) = 1.4e-09
 Identities = 14/41 (34%), Positives = 19/41 (46%)

Query:   147 DLPEAFDWR-AEGV--ISKVKEQG---KCACCWAFSAVGVV 181
             DLP+ +DWR   GV   S  + Q     C  CWA  +   +
Sbjct:    63 DLPKNWDWRNVNGVNYASVTRNQHIPQYCGSCWAHGSTSAM 103


>UNIPROTKB|Q9UBR2 [details] [associations]
            symbol:CTSZ "Cathepsin Z" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0060441 "epithelial tube
            branching involved in lung morphogenesis" evidence=IEA] [GO:0005764
            "lysosome" evidence=IEA] [GO:0005783 "endoplasmic reticulum"
            evidence=IDA] [GO:0005615 "extracellular space" evidence=IDA]
            Reactome:REACT_11123 Reactome:REACT_17015 InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0005783 EMBL:CH471077 GO:GO:0005615 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            EMBL:AL109840 GO:GO:0060441 MEROPS:C01.013 CTD:1522
            HOGENOM:HOG000264454 HOVERGEN:HBG004456 KO:K08568 OrthoDB:EOG42Z4QN
            BRENDA:3.4.18.1 EMBL:AF073890 EMBL:AF032906 EMBL:AF136273
            EMBL:AF136276 EMBL:AF136274 EMBL:AF136275 EMBL:AK314931
            EMBL:BC042168 EMBL:AF009923 IPI:IPI00002745 RefSeq:NP_001327.2
            UniGene:Hs.252549 PDB:1DEU PDB:1EF7 PDBsum:1DEU PDBsum:1EF7
            ProteinModelPortal:Q9UBR2 SMR:Q9UBR2 STRING:Q9UBR2 DMDM:12643324
            PaxDb:Q9UBR2 PeptideAtlas:Q9UBR2 PRIDE:Q9UBR2 DNASU:1522
            Ensembl:ENST00000217131 GeneID:1522 KEGG:hsa:1522 UCSC:uc002yai.2
            GeneCards:GC20M057570 HGNC:HGNC:2547 HPA:CAB025114 MIM:603169
            neXtProt:NX_Q9UBR2 PharmGKB:PA27043 InParanoid:Q9UBR2 OMA:QCGTCTE
            PhylomeDB:Q9UBR2 BindingDB:Q9UBR2 ChEMBL:CHEMBL4160 ChiTaRS:CTSZ
            EvolutionaryTrace:Q9UBR2 GenomeRNAi:1522 NextBio:6299 Bgee:Q9UBR2
            CleanEx:HS_CTSZ Genevestigator:Q9UBR2 GermOnline:ENSG00000101160
            Uniprot:Q9UBR2
        Length = 303

 Score = 151 (58.2 bits), Expect = 1.5e-09, Sum P(2) = 1.5e-09
 Identities = 51/177 (28%), Positives = 78/177 (44%)

Query:   487 DLELTGVLPSKLSRLATEKLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASE 546
             +++  G  PS L  L+ + ++DC  + G C GG       Y   +G +  +    Y+A +
Sbjct:   106 NIKRKGAWPSTL--LSVQNVIDCGNA-GSCEGGNDLSVWDYAHQHG-IPDETCNNYQAKD 161

Query:   547 SE----RGCLXXXXXXXXXXXXXYS--RI-PYGE---EEEMKKWVATRGPLSVGMNANG- 595
              E      C              Y+  R+  YG     E+M   +   GP+S G+ A   
Sbjct:   162 QECDKFNQCGTCNEFKECHAIRNYTLWRVGDYGSLSGREKMMAEIYANGPISCGIMATER 221

Query:   596 LFYYSGGVIDLNQRLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
             L  Y+GG+    Q   +    NH + + G+G     DGT   YWIV+NSWG  WGE+
Sbjct:   222 LANYTGGIYAEYQ---DTTYINHVVSVAGWGIS---DGTE--YWIVRNSWGEPWGER 270

 Score = 101 (40.6 bits), Expect = 4.0e-07, Sum P(2) = 4.0e-07
 Identities = 31/105 (29%), Positives = 48/105 (45%)

Query:   295 DLPEAFDWR-AEGV--ISKVKEQG---KCACCWAFSAVGVVEAMHAIQGNSL---TELSV 345
             DLP+++DWR  +GV   S  + Q     C  CWA ++   +     I+       T LSV
Sbjct:    61 DLPKSWDWRNVDGVNYASITRNQHIPQYCGSCWAHASTSAMADRINIKRKGAWPSTLLSV 120

Query:   346 QQLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESE 390
             Q ++DC  + G C GG       Y   +G +  +    Y+A + E
Sbjct:   121 QNVIDCGNA-GSCEGGNDLSVWDYAHQHG-IPDETCNNYQAKDQE 163

 Score = 90 (36.7 bits), Expect = 4.0e-07, Sum P(2) = 4.0e-07
 Identities = 26/71 (36%), Positives = 37/71 (52%)

Query:   417 EEMKKWVATRGPLSVGMNANG-LFYYSGGVI----D---LNQRL----YGTS--IPYWIV 462
             E+M   +   GP+S G+ A   L  Y+GG+     D   +N  +    +G S    YWIV
Sbjct:   200 EKMMAEIYANGPISCGIMATERLANYTGGIYAEYQDTTYINHVVSVAGWGISDGTEYWIV 259

Query:   463 KNSWGSDWGEK 473
             +NSWG  WGE+
Sbjct:   260 RNSWGEPWGER 270

 Score = 58 (25.5 bits), Expect = 1.5e-09, Sum P(2) = 1.5e-09
 Identities = 14/41 (34%), Positives = 22/41 (53%)

Query:   147 DLPEAFDWR-AEGV--ISKVKEQG---KCACCWAFSAVGVV 181
             DLP+++DWR  +GV   S  + Q     C  CWA ++   +
Sbjct:    61 DLPKSWDWRNVDGVNYASITRNQHIPQYCGSCWAHASTSAM 101


>WB|WBGene00000785 [details] [associations]
            symbol:cpr-5 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 eggNOG:NOG315657
            GeneTree:ENSGT00560000076599 HOGENOM:HOG000241341 KO:K01363
            PANTHER:PTHR12411:SF16 EMBL:L39896 EMBL:L39927 EMBL:FO081739
            PIR:T37277 RefSeq:NP_503383.1 UniGene:Cel.19730
            ProteinModelPortal:P43509 SMR:P43509 DIP:DIP-25329N IntAct:P43509
            MINT:MINT-1051285 STRING:P43509 MEROPS:C01.A35 PaxDb:P43509
            EnsemblMetazoa:W07B8.5 GeneID:178612 KEGG:cel:CELE_W07B8.5
            UCSC:W07B8.5.1 CTD:178612 WormBase:W07B8.5 InParanoid:P43509
            OMA:DAIPDHF NextBio:901840 Uniprot:P43509
        Length = 344

 Score = 113 (44.8 bits), Expect = 1.7e-09, Sum P(2) = 1.7e-09
 Identities = 28/79 (35%), Positives = 40/79 (50%)

Query:   575 EEMKKWVATRGPLSVGMNANGLFY-YSGGVIDLNQRLCNPKAQNHALIIVGYGEEEKKDG 633
             E+++  + T GP+ V       FY Y+ GV              HA+ I+G+G +   +G
Sbjct:   245 EQIQTEILTNGPIEVAFTVYEDFYQYTTGVY---VHTAGASLGGHAVKILGWGVD---NG 298

Query:   634 TSIPYWIVKNSWGSDWGEK 652
             T  PYW+V NSW   WGEK
Sbjct:   299 T--PYWLVANSWNVAWGEK 315

 Score = 102 (41.0 bits), Expect = 1.7e-09, Sum P(2) = 1.7e-09
 Identities = 33/110 (30%), Positives = 53/110 (48%)

Query:   294 DDLPEAFDWRAEGV----ISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSL--TELSVQQ 347
             D +P+ FD R +      I+ +++Q  C  CWAF+A   +     I  N    T LS + 
Sbjct:    80 DAIPDHFDARDQWPNCMSINNIRDQSDCGSCWAFAAAEAISDRTCIASNGAVNTLLSSED 139

Query:   348 LVDCD---MSNG-GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESERGC 393
             L+ C     S G GC GG    A ++ + +G +V+  +Y     E++ GC
Sbjct:   140 LLSCCTGMFSCGNGCEGGYPIQAWKWWVKHG-LVTGGSY-----ETQFGC 183

 Score = 81 (33.6 bits), Expect = 4.0e-06, Sum P(2) = 4.0e-06
 Identities = 23/71 (32%), Positives = 32/71 (45%)

Query:   417 EEMKKWVATRGPLSVGMNANGLFY-YSGGVI--DLNQRLYGTSI-----------PYWIV 462
             E+++  + T GP+ V       FY Y+ GV        L G ++           PYW+V
Sbjct:   245 EQIQTEILTNGPIEVAFTVYEDFYQYTTGVYVHTAGASLGGHAVKILGWGVDNGTPYWLV 304

Query:   463 KNSWGSDWGEK 473
              NSW   WGEK
Sbjct:   305 ANSWNVAWGEK 315

 Score = 74 (31.1 bits), Expect = 1.3e-06, Sum P(2) = 1.3e-06
 Identities = 16/60 (26%), Positives = 26/60 (43%)

Query:   146 DDLPEAFDWRAEGV----ISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQHHD 201
             D +P+ FD R +      I+ +++Q  C  CWAF+A   +     I  N      +   D
Sbjct:    80 DAIPDHFDARDQWPNCMSINNIRDQSDCGSCWAFAAAEAISDRTCIASNGAVNTLLSSED 139


>UNIPROTKB|F1N8G6 [details] [associations]
            symbol:F1N8G6 "Uncharacterized protein" species:9031
            "Gallus gallus" [GO:0005044 "scavenger receptor activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0006955
            "immune response" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0030247 "polysaccharide
            binding" evidence=IEA] InterPro:IPR000668 InterPro:IPR001212
            InterPro:IPR013128 Pfam:PF00112 PROSITE:PS00524 PROSITE:PS50958
            GO:GO:0006955 GO:GO:0030247 GO:GO:0008234 GO:GO:0006508
            PANTHER:PTHR12411 GO:GO:0005044 GeneTree:ENSGT00560000076599
            EMBL:AADN02078667 EMBL:AADN02078668 IPI:IPI00600996
            Ensembl:ENSGALT00000028420 Uniprot:F1N8G6
        Length = 329

 Score = 167 (63.8 bits), Expect = 1.8e-09, P = 1.8e-09
 Identities = 39/101 (38%), Positives = 58/101 (57%)

Query:   296 LPEAFDWRAE--GVISKVKEQGKCACCWAFSAVGVVE---AMHAIQGNSLTELSVQQLVD 350
             LP  FD   +  G+I +  +QG CA  WAFS   V     ++H++ G+    LS Q L+ 
Sbjct:   187 LPRHFDAATKWPGMIHEPLDQGNCAGSWAFSTAAVASDRISIHSM-GHMTPSLSPQNLLS 245

Query:   351 CDMSNG-GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESE 390
             CD  N  GC+GGR+D A  Y+    GVV+D+ YP+ + +S+
Sbjct:   246 CDTRNQRGCSGGRLDGAWWYL-RRRGVVTDECYPFTSQDSQ 285

 Score = 103 (41.3 bits), Expect = 3.6e-05, Sum P(2) = 3.6e-05
 Identities = 20/49 (40%), Positives = 32/49 (65%)

Query:   501 LATEKLVDCDMSNG-GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESE 548
             L+ + L+ CD  N  GC+GGR+D A  Y+    GVV+D+ YP+ + +S+
Sbjct:   238 LSPQNLLSCDTRNQRGCSGGRLDGAWWYL-RRRGVVTDECYPFTSQDSQ 285

 Score = 70 (29.7 bits), Expect = 3.6e-05, Sum P(2) = 3.6e-05
 Identities = 20/57 (35%), Positives = 30/57 (52%)

Query:   148 LPEAFDWRAE--GVISKVKEQGKCACCWAFSAVGVVE---AMHAIQGNNLTELSVQH 199
             LP  FD   +  G+I +  +QG CA  WAFS   V     ++H++ G+    LS Q+
Sbjct:   187 LPRHFDAATKWPGMIHEPLDQGNCAGSWAFSTAAVASDRISIHSM-GHMTPSLSPQN 242


>UNIPROTKB|F1NMW1 [details] [associations]
            symbol:F1NMW1 "Uncharacterized protein" species:9031
            "Gallus gallus" [GO:0005044 "scavenger receptor activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0006955
            "immune response" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0030247 "polysaccharide
            binding" evidence=IEA] [GO:0005737 "cytoplasm" evidence=IEA]
            [GO:0031012 "extracellular matrix" evidence=IEA] [GO:0043236
            "laminin binding" evidence=IEA] InterPro:IPR000668
            InterPro:IPR001212 InterPro:IPR013128 Pfam:PF00112 PROSITE:PS00524
            PROSITE:PS50958 GO:GO:0005737 GO:GO:0006955 GO:GO:0030247
            GO:GO:0008234 GO:GO:0006508 PANTHER:PTHR12411 GO:GO:0031012
            GO:GO:0005044 GeneTree:ENSGT00560000076599 EMBL:AADN02078667
            EMBL:AADN02078668 IPI:IPI00819331 Ensembl:ENSGALT00000040767
            OMA:QVHANDI Uniprot:F1NMW1
        Length = 340

 Score = 167 (63.8 bits), Expect = 2.0e-09, P = 2.0e-09
 Identities = 39/101 (38%), Positives = 58/101 (57%)

Query:   296 LPEAFDWRAE--GVISKVKEQGKCACCWAFSAVGVVE---AMHAIQGNSLTELSVQQLVD 350
             LP  FD   +  G+I +  +QG CA  WAFS   V     ++H++ G+    LS Q L+ 
Sbjct:   199 LPRHFDAATKWPGMIHEPLDQGNCAGSWAFSTAAVASDRISIHSM-GHMTPSLSPQNLLS 257

Query:   351 CDMSNG-GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESE 390
             CD  N  GC+GGR+D A  Y+    GVV+D+ YP+ + +S+
Sbjct:   258 CDTRNQRGCSGGRLDGAWWYL-RRRGVVTDECYPFTSQDSQ 297

 Score = 103 (41.3 bits), Expect = 4.0e-05, Sum P(2) = 4.0e-05
 Identities = 20/49 (40%), Positives = 32/49 (65%)

Query:   501 LATEKLVDCDMSNG-GCNGGRMDDALQYIIDNGGVVSDQAYPYKASESE 548
             L+ + L+ CD  N  GC+GGR+D A  Y+    GVV+D+ YP+ + +S+
Sbjct:   250 LSPQNLLSCDTRNQRGCSGGRLDGAWWYL-RRRGVVTDECYPFTSQDSQ 297

 Score = 70 (29.7 bits), Expect = 4.0e-05, Sum P(2) = 4.0e-05
 Identities = 20/57 (35%), Positives = 30/57 (52%)

Query:   148 LPEAFDWRAE--GVISKVKEQGKCACCWAFSAVGVVE---AMHAIQGNNLTELSVQH 199
             LP  FD   +  G+I +  +QG CA  WAFS   V     ++H++ G+    LS Q+
Sbjct:   199 LPRHFDAATKWPGMIHEPLDQGNCAGSWAFSTAAVASDRISIHSM-GHMTPSLSPQN 254


>GENEDB_PFALCIPARUM|PFB0355c [details] [associations]
            symbol:PFB0355c "cysteine protease, putative"
            species:5833 "Plasmodium falciparum" [GO:0005575
            "cellular_component" evidence=ND] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 SMART:SM00645 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640
            EMBL:AE001362 GenomeReviews:AE001362_GR HOGENOM:HOG000284262
            ProtClustDB:CLSZ2446607 RefSeq:XP_001349589.2
            ProteinModelPortal:O96166 EnsemblProtists:PFB0355c:mRNA
            GeneID:812671 KEGG:pfa:PFB0355c EuPathDB:PlasmoDB:PF3D7_0207900
            Uniprot:O96166
        Length = 1105

 Score = 124 (48.7 bits), Expect = 2.6e-09, Sum P(2) = 2.6e-09
 Identities = 22/44 (50%), Positives = 26/44 (59%)

Query:   608 QRLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGE 651
             Q LC  K  +HA+ IVGYG      G    YWIV+NSWG  WG+
Sbjct:   758 QNLCGDKKPDHAVNIVGYGNYINNKGEKKSYWIVRNSWGKYWGD 801

 Score = 102 (41.0 bits), Expect = 2.6e-09, Sum P(2) = 2.6e-09
 Identities = 48/209 (22%), Positives = 95/209 (45%)

Query:   190 NNLTELSVQHHDKVYSSVEDLLRRHENFVTNVEKAEDYQSEDSGTAV----FGVNKFFDL 245
             NNLT+L ++ H +  + V  L  + +N V  ++ A D+    +G  +    + +NKF   
Sbjct:   477 NNLTKL-LEEHKEENNYV--LYHKMKNEVLCLKNANDWMKNKTGLVLPQLKYSLNKFNKN 533

Query:   246 SESDLQQLTGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQF-NSLRHGDDLPEAFDWRA 304
              E+ +++    N+    E+ +  +    +    DT   ++ + +SL    +         
Sbjct:   534 KENYIKE----NI---FEEDENGI-VDLTKFPVDTSYSSYNYADSLYCNREYCNRLKDH- 584

Query:   305 EGVISK--VKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDC--DMSNGGCNG 360
                ISK  V++Q  CA  WAF+++  +E +  ++G      SV  + +C  + +N  C  
Sbjct:   585 NNCISKINVEDQKNCALSWAFASIYHLETIKCMKGYEPLNASVLYVTNCLKNKNNDVCTE 644

Query:   361 GRMDDA-LQYIIDNGGVVSDQAYPYKASE 388
             G      L+ I + G + ++  YPY  S+
Sbjct:   645 GSNPLVFLETIEEKGFLPTESNYPYDQSK 673

 Score = 75 (31.5 bits), Expect = 0.00032, Sum P(2) = 0.00032
 Identities = 15/32 (46%), Positives = 17/32 (53%)

Query:   454 GTSIPYWIVKNSWGSDWGEKVEDKVGSSGNRT 485
             G    YWIV+NSWG  WG+    KV   G  T
Sbjct:   783 GEKKSYWIVRNSWGKYWGDDGYFKVDMYGPPT 814

 Score = 58 (25.5 bits), Expect = 8.3e-05, Sum P(2) = 8.3e-05
 Identities = 14/40 (35%), Positives = 23/40 (57%)

Query:   160 ISK--VKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSV 197
             ISK  V++Q  CA  WAF+++  +E +  ++G      SV
Sbjct:   588 ISKINVEDQKNCALSWAFASIYHLETIKCMKGYEPLNASV 627


>UNIPROTKB|O96166 [details] [associations]
            symbol:SERA-2 "Serine repeat antigen 2 (SERA-2)"
            species:36329 "Plasmodium falciparum 3D7" [GO:0005575
            "cellular_component" evidence=ND] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 SMART:SM00645 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640
            EMBL:AE001362 GenomeReviews:AE001362_GR HOGENOM:HOG000284262
            ProtClustDB:CLSZ2446607 RefSeq:XP_001349589.2
            ProteinModelPortal:O96166 EnsemblProtists:PFB0355c:mRNA
            GeneID:812671 KEGG:pfa:PFB0355c EuPathDB:PlasmoDB:PF3D7_0207900
            Uniprot:O96166
        Length = 1105

 Score = 124 (48.7 bits), Expect = 2.6e-09, Sum P(2) = 2.6e-09
 Identities = 22/44 (50%), Positives = 26/44 (59%)

Query:   608 QRLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGE 651
             Q LC  K  +HA+ IVGYG      G    YWIV+NSWG  WG+
Sbjct:   758 QNLCGDKKPDHAVNIVGYGNYINNKGEKKSYWIVRNSWGKYWGD 801

 Score = 102 (41.0 bits), Expect = 2.6e-09, Sum P(2) = 2.6e-09
 Identities = 48/209 (22%), Positives = 95/209 (45%)

Query:   190 NNLTELSVQHHDKVYSSVEDLLRRHENFVTNVEKAEDYQSEDSGTAV----FGVNKFFDL 245
             NNLT+L ++ H +  + V  L  + +N V  ++ A D+    +G  +    + +NKF   
Sbjct:   477 NNLTKL-LEEHKEENNYV--LYHKMKNEVLCLKNANDWMKNKTGLVLPQLKYSLNKFNKN 533

Query:   246 SESDLQQLTGLNLDSTLEDIQPSLQAPFSSNQTDTEMRAFQF-NSLRHGDDLPEAFDWRA 304
              E+ +++    N+    E+ +  +    +    DT   ++ + +SL    +         
Sbjct:   534 KENYIKE----NI---FEEDENGI-VDLTKFPVDTSYSSYNYADSLYCNREYCNRLKDH- 584

Query:   305 EGVISK--VKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQLVDC--DMSNGGCNG 360
                ISK  V++Q  CA  WAF+++  +E +  ++G      SV  + +C  + +N  C  
Sbjct:   585 NNCISKINVEDQKNCALSWAFASIYHLETIKCMKGYEPLNASVLYVTNCLKNKNNDVCTE 644

Query:   361 GRMDDA-LQYIIDNGGVVSDQAYPYKASE 388
             G      L+ I + G + ++  YPY  S+
Sbjct:   645 GSNPLVFLETIEEKGFLPTESNYPYDQSK 673

 Score = 75 (31.5 bits), Expect = 0.00032, Sum P(2) = 0.00032
 Identities = 15/32 (46%), Positives = 17/32 (53%)

Query:   454 GTSIPYWIVKNSWGSDWGEKVEDKVGSSGNRT 485
             G    YWIV+NSWG  WG+    KV   G  T
Sbjct:   783 GEKKSYWIVRNSWGKYWGDDGYFKVDMYGPPT 814

 Score = 58 (25.5 bits), Expect = 8.3e-05, Sum P(2) = 8.3e-05
 Identities = 14/40 (35%), Positives = 23/40 (57%)

Query:   160 ISK--VKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSV 197
             ISK  V++Q  CA  WAF+++  +E +  ++G      SV
Sbjct:   588 ISKINVEDQKNCALSWAFASIYHLETIKCMKGYEPLNASV 627


>WB|WBGene00000783 [details] [associations]
            symbol:cpr-3 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0009792 "embryo development
            ending in birth or egg hatching" evidence=IMP] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0009792 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00560000076599 HOGENOM:HOG000241341 KO:K01363
            PANTHER:PTHR12411:SF16 EMBL:L39890 EMBL:L39925 EMBL:Z81119
            EMBL:Z82057 PIR:T37282 RefSeq:NP_506790.1 UniGene:Cel.23503
            ProteinModelPortal:P43507 SMR:P43507 MEROPS:C01.A33
            EnsemblMetazoa:T10H4.12 GeneID:180033 KEGG:cel:CELE_T10H4.12
            UCSC:T10H4.12 CTD:180033 WormBase:T10H4.12 eggNOG:NOG240190
            InParanoid:P43507 OMA:PVEASYK NextBio:907824 Uniprot:P43507
        Length = 370

 Score = 122 (48.0 bits), Expect = 2.9e-09, Sum P(2) = 2.9e-09
 Identities = 32/113 (28%), Positives = 49/113 (43%)

Query:   294 DDLPEAFD----WRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTE--LSVQQ 347
             + LP+ FD    W     I  ++ Q  C  CWAF A  V+     IQ N   +  +SV+ 
Sbjct:    90 EPLPDTFDAREKWPDCNTIKLIRNQATCGSCWAFGAAEVISDRVCIQSNGTQQPVISVED 149

Query:   348 LVDC-DMSNG-GCNGGRMDDALQYIIDNGGVVSDQ-----AYPYKASESERGC 393
             ++ C   + G GC GG   +AL++   +G V           PY  +   + C
Sbjct:   150 ILSCCGTTCGYGCKGGYSIEALRFWASSGAVTGGDYGGHGCMPYSFAPCTKNC 202

 Score = 91 (37.1 bits), Expect = 2.9e-09, Sum P(2) = 2.9e-09
 Identities = 23/69 (33%), Positives = 31/69 (44%)

Query:   585 GPLSVGMNANGLFY-YSGGVIDLNQRLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKN 643
             GP+         FY Y  GV              HA+ I+G+G E   D     YW++ N
Sbjct:   253 GPVEASYKVYEDFYHYKSGVYHYTS---GKLVGGHAVKIIGWGVENGVD-----YWLIAN 304

Query:   644 SWGSDWGEK 652
             SWG+ +GEK
Sbjct:   305 SWGTSFGEK 313

 Score = 82 (33.9 bits), Expect = 9.8e-06, Sum P(3) = 9.8e-06
 Identities = 19/59 (32%), Positives = 27/59 (45%)

Query:   146 DDLPEAFD----WRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTE--LSVQ 198
             + LP+ FD    W     I  ++ Q  C  CWAF A  V+     IQ N   +  +SV+
Sbjct:    90 EPLPDTFDAREKWPDCNTIKLIRNQATCGSCWAFGAAEVISDRVCIQSNGTQQPVISVE 148

 Score = 79 (32.9 bits), Expect = 5.0e-08, Sum P(2) = 5.0e-08
 Identities = 18/74 (24%), Positives = 40/74 (54%)

Query:   436 NGLFYYSGGVIDLNQ--RLYG----TSIPYWIVKNSWGSDWGEKVEDKVGSSGNRTRDLE 489
             +G+++Y+ G +      ++ G      + YW++ NSWG+ +GEK   K+      T + +
Sbjct:   270 SGVYHYTSGKLVGGHAVKIIGWGVENGVDYWLIANSWGTSFGEKGFFKIRRG---TNECQ 326

Query:   490 LTGVLPSKLSRLAT 503
             + G + + +++L T
Sbjct:   327 IEGNVVAGIAKLGT 340

 Score = 47 (21.6 bits), Expect = 9.8e-06, Sum P(3) = 9.8e-06
 Identities = 14/58 (24%), Positives = 25/58 (43%)

Query:   501 LATEKLVDC-DMSNG-GCNGGRMDDALQYIIDNGGVVSDQ-----AYPYKASESERGC 551
             ++ E ++ C   + G GC GG   +AL++   +G V           PY  +   + C
Sbjct:   145 ISVEDILSCCGTTCGYGCKGGYSIEALRFWASSGAVTGGDYGGHGCMPYSFAPCTKNC 202


>FB|FBgn0033873 [details] [associations]
            symbol:CG6337 species:7227 "Drosophila melanogaster"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 SMART:SM00645 EMBL:AE013599
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 HSSP:P80067 EMBL:AY084123
            RefSeq:NP_610905.1 UniGene:Dm.5230 SMR:Q7JYA0 IntAct:Q7JYA0
            EnsemblMetazoa:FBtr0087646 GeneID:36530 KEGG:dme:Dmel_CG6337
            UCSC:CG6337-RA FlyBase:FBgn0033873 eggNOG:NOG310593
            InParanoid:Q7JYA0 OMA:NRTTYRE OrthoDB:EOG4MCVFZ GenomeRNAi:36530
            NextBio:799041 Uniprot:Q7JYA0
        Length = 340

 Score = 140 (54.3 bits), Expect = 6.0e-09, Sum P(2) = 6.0e-09
 Identities = 45/165 (27%), Positives = 73/165 (44%)

Query:   494 LPSKLSRLATEKLVDCDMSNGGCNGGRMDDALQYI--IDNGGVVSDQAYPYKASESERG- 550
             LPS LS    ++L+DC     GC+      AL Y+  + +  +  +  YP   S    G 
Sbjct:   159 LPSSLS---AQQLLDCAGMGTGCSTQTPLAALNYLTQLTDAYLYPEVDYPNNNSLKTPGM 215

Query:   551 CLXXXXXXXXXXXXXYSRIPYGEEEEMKKWVATRGPLSVGMNAN--GLFYYSGGV-IDLN 607
             C              YS +   ++  + ++V+   P+ V  N    G   YS GV +   
Sbjct:   216 CQPPSSVSVGVKLAGYSTVADNDDAAVMRYVSNGFPVIVEYNPATFGFMQYSSGVYVQET 275

Query:   608 QRLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
             + L NPK+    L++VGY  +     +++ YW   NS+G  WGE+
Sbjct:   276 RALTNPKSSQF-LVVVGYDHDVD---SNLDYWRCLNSFGDTWGEE 316

 Score = 131 (51.2 bits), Expect = 4.1e-07, Sum P(2) = 4.1e-07
 Identities = 42/148 (28%), Positives = 66/148 (44%)

Query:   306 GVISKVKEQG-KCACCWAFSAVGVVEAMHAIQ-GNSL-TELSVQQLVDCDMSNGGCNGGR 362
             G+   V++QG  C+  WA++    VE M+A+Q  N L + LS QQL+DC     GC+   
Sbjct:   123 GLTVAVEDQGVNCSSSWAYATAKAVEIMNAVQTANPLPSSLSAQQLLDCAGMGTGCSTQT 182

Query:   363 MDDALQYI--IDNGGVVSDQAYPYKASESERG-CLXXXXXXXXXXXXXYSRIPYGEEEEM 419
                AL Y+  + +  +  +  YP   S    G C              YS +   ++  +
Sbjct:   183 PLAALNYLTQLTDAYLYPEVDYPNNNSLKTPGMCQPPSSVSVGVKLAGYSTVADNDDAAV 242

Query:   420 KKWVATRGPLSVGMNAN--GLFYYSGGV 445
              ++V+   P+ V  N    G   YS GV
Sbjct:   243 MRYVSNGFPVIVEYNPATFGFMQYSSGV 270

 Score = 67 (28.6 bits), Expect = 6.0e-09, Sum P(2) = 6.0e-09
 Identities = 17/44 (38%), Positives = 26/44 (59%)

Query:   158 GVISKVKEQG-KCACCWAFSAVGVVEAMHAIQ-GNNL-TELSVQ 198
             G+   V++QG  C+  WA++    VE M+A+Q  N L + LS Q
Sbjct:   123 GLTVAVEDQGVNCSSSWAYATAKAVEIMNAVQTANPLPSSLSAQ 166

 Score = 59 (25.8 bits), Expect = 4.1e-07, Sum P(2) = 4.1e-07
 Identities = 13/45 (28%), Positives = 23/45 (51%)

Query:   455 TSIPYWIVKNSWGSDWGEKVEDKVGSSGNRTRDLELTGVLPSKLS 499
             +++ YW   NS+G  WGE+   ++    N+   +    V PS L+
Sbjct:   298 SNLDYWRCLNSFGDTWGEEGYIRIVRRSNQP--IAKNAVFPSALA 340


>UNIPROTKB|P05689 [details] [associations]
            symbol:CTSZ "Cathepsin Z" species:9913 "Bos taurus"
            [GO:0005764 "lysosome" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 EMBL:BC122603
            EMBL:X01809 IPI:IPI00708474 PIR:A29172 RefSeq:NP_001071303.1
            UniGene:Bt.4902 ProteinModelPortal:P05689 SMR:P05689 MEROPS:C01.013
            PRIDE:P05689 GeneID:404187 KEGG:bta:404187 CTD:1522
            HOGENOM:HOG000264454 HOVERGEN:HBG004456 InParanoid:P05689 KO:K08568
            OrthoDB:EOG42Z4QN BRENDA:3.4.18.1 NextBio:20817615 Uniprot:P05689
        Length = 304

 Score = 141 (54.7 bits), Expect = 6.9e-09, Sum P(2) = 6.9e-09
 Identities = 50/177 (28%), Positives = 79/177 (44%)

Query:   487 DLELTGVLPSKLSRLATEKLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASE 546
             +++  G  PS L  L+ + ++DC  + G C GG      +Y     G+  +    Y+A +
Sbjct:   107 NIKRKGAWPSTL--LSVQHVIDCGDA-GSCEGGNDLPVWEYA-HRHGIPDETCNNYQAKD 162

Query:   547 SE----RGCLXXXXXXXXXXXXXYS--RI-PYGE---EEEMKKWVATRGPLSVGMNANG- 595
              E      C              Y+  ++  YG     E+M   + T GP+S G+ A   
Sbjct:   163 QECDKFNQCGTCTEFKECHVIKNYTLWKVGDYGSLSGREKMMAEIYTNGPISCGIMATEK 222

Query:   596 LFYYSGGVIDLNQRLCNPKAQ-NHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGE 651
             +  Y+GG+        N +A  NH + + G+G     DG  + YWIV+NSWG  WGE
Sbjct:   223 MSNYTGGIYSEY----NDQAFINHIVSVAGWGVS---DG--MEYWIVRNSWGEPWGE 270

 Score = 98 (39.6 bits), Expect = 1.7e-07, Sum P(2) = 1.7e-07
 Identities = 31/105 (29%), Positives = 46/105 (43%)

Query:   295 DLPEAFDWR-AEGV--ISKVKEQG---KCACCWAFSAVGVVEAMHAIQGNSL---TELSV 345
             DLP+++DWR   GV   S  + Q     C  CWA  +   +     I+       T LSV
Sbjct:    62 DLPKSWDWRNVNGVNYASVTRNQHIPQYCGSCWAHGSTSAMADRINIKRKGAWPSTLLSV 121

Query:   346 QQLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESE 390
             Q ++DC  + G C GG      +Y     G+  +    Y+A + E
Sbjct:   122 QHVIDCGDA-GSCEGGNDLPVWEYA-HRHGIPDETCNNYQAKDQE 164

 Score = 97 (39.2 bits), Expect = 1.7e-07, Sum P(2) = 1.7e-07
 Identities = 25/70 (35%), Positives = 37/70 (52%)

Query:   417 EEMKKWVATRGPLSVGMNANG-LFYYSGGVI-DLNQRLY----------GTS--IPYWIV 462
             E+M   + T GP+S G+ A   +  Y+GG+  + N + +          G S  + YWIV
Sbjct:   201 EKMMAEIYTNGPISCGIMATEKMSNYTGGIYSEYNDQAFINHIVSVAGWGVSDGMEYWIV 260

Query:   463 KNSWGSDWGE 472
             +NSWG  WGE
Sbjct:   261 RNSWGEPWGE 270

 Score = 63 (27.2 bits), Expect = 6.9e-09, Sum P(2) = 6.9e-09
 Identities = 21/62 (33%), Positives = 28/62 (45%)

Query:   147 DLPEAFDWR-AEGV--ISKVKEQG---KCACCWAFSAVGVVEAMHAIQGNNL---TELSV 197
             DLP+++DWR   GV   S  + Q     C  CWA  +   +     I+       T LSV
Sbjct:    62 DLPKSWDWRNVNGVNYASVTRNQHIPQYCGSCWAHGSTSAMADRINIKRKGAWPSTLLSV 121

Query:   198 QH 199
             QH
Sbjct:   122 QH 123


>UNIPROTKB|E1C4M3 [details] [associations]
            symbol:CTSZ "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0005615
            "extracellular space" evidence=IEA] [GO:0005783 "endoplasmic
            reticulum" evidence=IEA] [GO:0060441 "epithelial tube branching
            involved in lung morphogenesis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0005783 GO:GO:0005615 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640
            GeneTree:ENSGT00560000076599 CTD:1522 KO:K08568 OMA:QCGTCTE
            EMBL:AADN02019004 IPI:IPI00596430 RefSeq:XP_417483.3
            Ensembl:ENSGALT00000012067 GeneID:419311 KEGG:gga:419311
            Uniprot:E1C4M3
        Length = 305

 Score = 149 (57.5 bits), Expect = 7.1e-09, Sum P(2) = 7.1e-09
 Identities = 50/178 (28%), Positives = 81/178 (45%)

Query:   487 DLELTGVLPSKLSRLATEKLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASE 546
             +++  G  PS  + L+ + ++DC  + G C GG       Y  D+G +  +    Y+A  
Sbjct:   107 NIKRKGAWPS--AYLSVQNVIDC-ANAGSCEGGDHTGVWMYAHDHG-IPDETCNNYQAKN 162

Query:   547 ------SERG-CLXXXXXXXXXXXXXYSRIPYGE---EEEMKKWVATRGPLSVGMNANG- 595
                   ++ G C+             +    YG     E+M   +   GP+S G+ A   
Sbjct:   163 QKCKKFNQCGTCVTFGECHVIKNYTLWKVADYGAVSGREKMMAEIYANGPISCGIMATEK 222

Query:   596 LFYYSGGVI-DLNQRLCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
             L  Y+GG+  + N    +P   NH + + G+G E   +GT   YWIV+NSWG  WGE+
Sbjct:   223 LDAYTGGLYTEYNP---SPTV-NHIVSVAGWGVE---NGTE--YWIVRNSWGEPWGER 271

 Score = 94 (38.1 bits), Expect = 1.3e-05, Sum P(2) = 1.3e-05
 Identities = 29/105 (27%), Positives = 45/105 (42%)

Query:   295 DLPEAFDWR-AEGV--ISKVKEQG---KCACCWAFSAVGVVEAMHAIQGNSL---TELSV 345
             +LP+++DWR   GV   S  + Q     C  CWA  +   +     I+         LSV
Sbjct:    62 ELPQSWDWRNVNGVNYASTTRNQHIPQYCGSCWAHGSTSALADRINIKRKGAWPSAYLSV 121

Query:   346 QQLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASESE 390
             Q ++DC  + G C GG       Y  D+G +  +    Y+A   +
Sbjct:   122 QNVIDC-ANAGSCEGGDHTGVWMYAHDHG-IPDETCNNYQAKNQK 164

 Score = 83 (34.3 bits), Expect = 1.3e-05, Sum P(2) = 1.3e-05
 Identities = 24/71 (33%), Positives = 35/71 (49%)

Query:   417 EEMKKWVATRGPLSVGMNANG-LFYYSGGVID-------LNQRL----YGTS--IPYWIV 462
             E+M   +   GP+S G+ A   L  Y+GG+         +N  +    +G      YWIV
Sbjct:   201 EKMMAEIYANGPISCGIMATEKLDAYTGGLYTEYNPSPTVNHIVSVAGWGVENGTEYWIV 260

Query:   463 KNSWGSDWGEK 473
             +NSWG  WGE+
Sbjct:   261 RNSWGEPWGER 271

 Score = 54 (24.1 bits), Expect = 7.1e-09, Sum P(2) = 7.1e-09
 Identities = 13/41 (31%), Positives = 20/41 (48%)

Query:   147 DLPEAFDWR-AEGV--ISKVKEQG---KCACCWAFSAVGVV 181
             +LP+++DWR   GV   S  + Q     C  CWA  +   +
Sbjct:    62 ELPQSWDWRNVNGVNYASTTRNQHIPQYCGSCWAHGSTSAL 102


>ZFIN|ZDB-GENE-041010-139 [details] [associations]
            symbol:ctsz "cathepsin Z" species:7955 "Danio
            rerio" [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0001525 "angiogenesis"
            evidence=IMP] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            SMART:SM00645 ZFIN:ZDB-GENE-041010-139 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640 GO:GO:0001525
            CTD:1522 HOGENOM:HOG000264454 HOVERGEN:HBG004456 KO:K08568
            OrthoDB:EOG42Z4QN UniGene:Dr.935 eggNOG:NOG275763 EMBL:BC083369
            IPI:IPI00483065 RefSeq:NP_001006043.1 ProteinModelPortal:Q5XJD4
            SMR:Q5XJD4 STRING:Q5XJD4 GeneID:450022 KEGG:dre:450022
            InParanoid:Q5XJD4 NextBio:20833005 ArrayExpress:Q5XJD4
            Uniprot:Q5XJD4
        Length = 301

 Score = 147 (56.8 bits), Expect = 8.7e-09, Sum P(2) = 8.7e-09
 Identities = 43/163 (26%), Positives = 76/163 (46%)

Query:   501 LATEKLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASE------SERG-CLX 553
             L+ + ++DC  + G C+GG      +Y   N G+  +    Y+A +      ++ G C  
Sbjct:   110 LSVQNVIDCGDA-GSCSGGDHSGVWEYA-HNKGIPDETCNNYQAKDQDCKPFNQCGTCTT 167

Query:   554 XXXXXXXXXXXXYSRIPYGEE---EEMKKWVATRGPLSVGMNANG-LFYYSGGVIDLNQR 609
                         +    YG     ++MK  + + GP+S G+ A   L  Y+GG+   ++ 
Sbjct:   168 FGVCNIVKNFTLWKVGDYGSASGLDKMKAEIYSGGPISCGIMATDKLDAYTGGLY--SEY 225

Query:   610 LCNPKAQNHALIIVGYGEEEKKDGTSIPYWIVKNSWGSDWGEK 652
             +  P   NH + + G+G +E      + +W+V+NSWG  WGEK
Sbjct:   226 VQEPYI-NHIVSVAGWGVDEN----GVEFWVVRNSWGEPWGEK 263

 Score = 115 (45.5 bits), Expect = 0.00099, P = 0.00099
 Identities = 48/195 (24%), Positives = 83/195 (42%)

Query:   295 DLPEAFDWR-AEGV--ISKVKEQG---KCACCWAFSAVGVVEAMHAIQGNSL---TELSV 345
             +LP+ +DWR  +GV  +S  + Q     C  CWA  +   +     I+  +      LSV
Sbjct:    53 ELPKEWDWRNIKGVNYVSTTRNQHIPQYCGSCWAHGSTSALADRINIKRKAAWPSAYLSV 112

Query:   346 QQLVDCDMSNGGCNGGRMDDALQYIIDNGGVVSDQAYPYKASE------SERG-CLXXXX 398
             Q ++DC  + G C+GG      +Y   N G+  +    Y+A +      ++ G C     
Sbjct:   113 QNVIDCGDA-GSCSGGDHSGVWEYA-HNKGIPDETCNNYQAKDQDCKPFNQCGTCTTFGV 170

Query:   399 XXXXXXXXXYSRIPYGEE---EEMKKWVATRGPLSVGMNANG-LFYYSGGVIDLNQRLYG 454
                      +    YG     ++MK  + + GP+S G+ A   L  Y+GG+   ++ +  
Sbjct:   171 CNIVKNFTLWKVGDYGSASGLDKMKAEIYSGGPISCGIMATDKLDAYTGGLY--SEYVQE 228

Query:   455 TSIPYWIVKNSWGSD 469
               I + +    WG D
Sbjct:   229 PYINHIVSVAGWGVD 243

 Score = 55 (24.4 bits), Expect = 8.7e-09, Sum P(2) = 8.7e-09
 Identities = 13/41 (31%), Positives = 21/41 (51%)

Query:   147 DLPEAFDWR-AEGV--ISKVKEQG---KCACCWAFSAVGVV 181
             +LP+ +DWR  +GV  +S  + Q     C  CWA  +   +
Sbjct:    53 ELPKEWDWRNIKGVNYVSTTRNQHIPQYCGSCWAHGSTSAL 93


>UNIPROTKB|H0YE42 [details] [associations]
            symbol:CTSF "Cathepsin F" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 InterPro:IPR000169 GO:GO:0008234
            GO:GO:0006508 PANTHER:PTHR12411 PROSITE:PS00139 EMBL:AP002748
            HGNC:HGNC:2531 ChiTaRS:CTSF Ensembl:ENST00000525733 Uniprot:H0YE42
        Length = 82

 Score = 141 (54.7 bits), Expect = 1.1e-08, P = 1.1e-08
 Identities = 26/56 (46%), Positives = 34/56 (60%)

Query:   293 GDDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNSLTELSVQQL 348
             GD  P  +DWR++G ++KVK+QG C  CWAFS  G VE    +   +L  LS Q L
Sbjct:    25 GDLAPPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQAL 80

 Score = 137 (53.3 bits), Expect = 3.0e-08, P = 3.0e-08
 Identities = 25/54 (46%), Positives = 32/54 (59%)

Query:   145 GDDLPEAFDWRAEGVISKVKEQGKCACCWAFSAVGVVEAMHAIQGNNLTELSVQ 198
             GD  P  +DWR++G ++KVK+QG C  CWAFS  G VE    +    L  LS Q
Sbjct:    25 GDLAPPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQ 78

WARNING:  HSPs involving 35 database sequences were not reported due to the
          limiting value of parameter B = 250.


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.315   0.133   0.402    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      655       629   0.00091  120 3  11 22  0.44    34
                                                     36  0.45    37


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  285
  No. of states in DFA:  616 (65 KB)
  Total size of DFA:  345 KB (2167 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:08
  No. of threads or processors used:  24
  Search cpu time:  53.52u 0.08s 53.60t   Elapsed:  00:00:29
  Total cpu time:  53.64u 0.09s 53.73t   Elapsed:  00:00:37
  Start:  Thu Aug 15 12:28:50 2013   End:  Thu Aug 15 12:29:27 2013
WARNINGS ISSUED:  2

Back to top