BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>018781
MAFFSHSKLLLLSLSLSLFACSSLAHDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYK
CIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQ
PSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSL
SEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVT
ISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGY
GKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLKK

High Scoring Gene Products

Symbol, full name Information P value
XCP1
xylem cysteine peptidase 1
protein from Arabidopsis thaliana 1.1e-149
XCP2
AT1G20850
protein from Arabidopsis thaliana 1.9e-140
RD21B
esponsive to dehydration 21B
protein from Arabidopsis thaliana 6.4e-99
RD21A
responsive to dehydration 21A
protein from Arabidopsis thaliana 1.1e-96
CEP1
cysteine endopeptidase 1
protein from Arabidopsis thaliana 6.8e-95
AT3G19390 protein from Arabidopsis thaliana 3.5e-91
XBCP3
xylem bark cysteine peptidase 3
protein from Arabidopsis thaliana 3.2e-88
SAG12
senescence-associated gene 12
protein from Arabidopsis thaliana 2.6e-86
AT3G19400 protein from Arabidopsis thaliana 4.9e-85
AT4G23520 protein from Arabidopsis thaliana 9.1e-84
CEP3
cysteine endopeptidase 3
protein from Arabidopsis thaliana 2.4e-83
CP2
cysteine protease 2
protein from Arabidopsis thaliana 1.0e-82
CP1
cysteine protease 1
protein from Arabidopsis thaliana 2.2e-82
AT1G06260 protein from Arabidopsis thaliana 1.3e-77
AT2G27420 protein from Arabidopsis thaliana 1.1e-76
AT3G49340 protein from Arabidopsis thaliana 1.9e-76
AT1G29090 protein from Arabidopsis thaliana 4.6e-73
AT2G34080 protein from Arabidopsis thaliana 4.6e-73
AT3G43960 protein from Arabidopsis thaliana 2.1e-70
AT1G29080 protein from Arabidopsis thaliana 6.3e-69
cprC
cysteine proteinase 3
gene from Dictyostelium discoideum 5.1e-67
cprE
cysteine proteinase 5
gene from Dictyostelium discoideum 6.8e-67
Cp1
Cysteine proteinase-1
protein from Drosophila melanogaster 4.7e-64
CTSL2
Uncharacterized protein
protein from Gallus gallus 6.0e-64
ctsl.1
cathepsin L.1
gene_product from Danio rerio 3.3e-63
cprF
cysteine proteinase 6
gene from Dictyostelium discoideum 9.9e-62
ctssb.2
cathepsin S, b.2
gene_product from Danio rerio 1.0e-61
CTSH
Pro-cathepsin H
protein from Bos taurus 1.3e-61
ctsk
cathepsin K
gene_product from Danio rerio 3.4e-61
ctsl1a
cathepsin L, 1 a
gene_product from Danio rerio 3.4e-61
cprD
cysteine proteinase 4
gene from Dictyostelium discoideum 5.4e-61
cfaD
peptidase C1A family protein
gene from Dictyostelium discoideum 5.5e-61
AT1G29110 protein from Arabidopsis thaliana 5.5e-61
cprB
cysteine proteinase 2
gene from Dictyostelium discoideum 6.9e-61
CTSL1
Cathepsin L1
protein from Sus scrofa 1.2e-60
zgc:174855 gene_product from Danio rerio 1.2e-60
P83654
Ervatamin-C
protein from Tabernaemontana divaricata 2.4e-60
CTSL1
Cathepsin L1
protein from Bos taurus 3.1e-60
wu:fb37b09 gene_product from Danio rerio 3.1e-60
cprG
cysteine proteinase 7
gene from Dictyostelium discoideum 4.8e-60
CTSL1
Cathepsin L1
protein from Canis lupus familiaris 6.4e-60
Ctsl
cathepsin L
protein from Mus musculus 6.4e-60
Ctsh
cathepsin H
gene from Rattus norvegicus 8.1e-60
Ctsh
cathepsin H
protein from Mus musculus 1.0e-59
CTSK
Cathepsin K
protein from Homo sapiens 2.2e-59
CTSL2
Cathepsin L2
protein from Bos taurus 2.7e-59
Ssc.54235
Uncharacterized protein
protein from Sus scrofa 2.7e-59
CTSH
Uncharacterized protein
protein from Gorilla gorilla gorilla 2.7e-59
Ctsl1
cathepsin L1
gene from Rattus norvegicus 2.7e-59
CTSK
Cathepsin K
protein from Canis lupus familiaris 3.5e-59
CTSK
Cathepsin K
protein from Canis lupus familiaris 3.5e-59
CTSH
Pro-cathepsin H
protein from Sus scrofa 3.5e-59
Cat-1
Cathepsin L-like proteinase
protein from Fasciola hepatica 3.5e-59
Cys
Crustapain
protein from Pandalus borealis 3.5e-59
CTSL1
Cathepsin L1
protein from Homo sapiens 4.5e-59
CTSK
Cathepsin K
protein from Bos taurus 7.3e-59
CTSK
Cathepsin K
protein from Sus scrofa 9.3e-59
CTSH
Uncharacterized protein
protein from Macaca mulatta 9.3e-59
CTSH
Uncharacterized protein
protein from Nomascus leucogenys 9.3e-59
CTSH
Pro-cathepsin H
protein from Homo sapiens 1.5e-58
CTSL1
CTSL1 protein
protein from Bos taurus 3.2e-58
CTSH
Uncharacterized protein
protein from Ailuropoda melanoleuca 3.2e-58
CTSH
Uncharacterized protein
protein from Oryctolagus cuniculus 3.2e-58
Ctsk
cathepsin K
gene from Rattus norvegicus 3.2e-58
zgc:174153 gene_product from Danio rerio 4.0e-58
CTSS
Uncharacterized protein
protein from Sus scrofa 1.4e-57
CG12163 protein from Drosophila melanogaster 1.7e-57
RGD1308751
similar to Cathepsin L precursor (Major excreted protein) (MEP)
gene from Rattus norvegicus 1.7e-57
ctsh
cathepsin H
gene_product from Danio rerio 1.7e-57
CTSS
Cathepsin S
protein from Bos taurus 2.2e-57
CTSS
Cathepsin S
protein from Canis lupus familiaris 2.2e-57
CTSH
Uncharacterized protein
protein from Canis lupus familiaris 2.2e-57
CTSS
Cathepsin S
protein from Canis lupus familiaris 2.2e-57
CTSH
Uncharacterized protein
protein from Equus caballus 2.2e-57
ctsl1b
cathepsin L, 1 b
gene_product from Danio rerio 2.2e-57
DDB_G0272298 gene from Dictyostelium discoideum 4.6e-57
Ctsk
cathepsin K
protein from Mus musculus 4.6e-57
RD19
RESPONSIVE TO DEHYDRATION 19
protein from Arabidopsis thaliana 4.6e-57
ctsll
cathepsin L, like
gene_product from Danio rerio 5.9e-57
CTSH
Uncharacterized protein
protein from Callithrix jacchus 7.5e-57
CTSH
Uncharacterized protein
protein from Callithrix jacchus 7.5e-57
Ctsll3
cathepsin L-like 3
gene from Rattus norvegicus 9.6e-57
cpl-1 gene from Caenorhabditis elegans 9.6e-57
CTSL2
Cathepsin L2
protein from Homo sapiens 1.6e-56
LOC100662496
Uncharacterized protein
protein from Loxodonta africana 2.5e-56
DDB_G0291191
cysteine protease
gene from Dictyostelium discoideum 3.3e-56
Ctss
cathepsin S
protein from Mus musculus 4.2e-56
AT2G21430 protein from Arabidopsis thaliana 4.2e-56
cprA
cysteine proteinase 1
gene from Dictyostelium discoideum 1.1e-55
AT4G16190 protein from Arabidopsis thaliana 1.1e-55
tag-196 gene from Caenorhabditis elegans 1.1e-55
CTSS
Cathepsin S
protein from Homo sapiens 2.3e-55
cprH
cysteine proteinase 8
gene from Dictyostelium discoideum 3.7e-55
ALP
aleurain-like protease
protein from Arabidopsis thaliana 4.8e-55
CTSL2
Uncharacterized protein
protein from Gallus gallus 6.1e-55
CTSL1
Cathepsin L1
protein from Gallus gallus 6.1e-55
PF11_0162
falcipain-3
gene from Plasmodium falciparum 2.1e-54
CTSL1
Cathepsin L1
protein from Canis lupus familiaris 2.1e-54
PF11_0162
Falcipain-3
protein from Plasmodium falciparum 3D7 2.1e-54
LOC420160
Uncharacterized protein
protein from Gallus gallus 2.6e-54

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  018781
        (350 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

TAIR|locus:2122113 - symbol:XCP1 "xylem cysteine peptidas...  1461  1.1e-149  1
TAIR|locus:2030427 - symbol:XCP2 "xylem cysteine peptidas...  1374  1.9e-140  1
TAIR|locus:2167821 - symbol:RD21B "esponsive to dehydrati...   982  6.4e-99   1
TAIR|locus:2825832 - symbol:RD21A "responsive to dehydrat...   961  1.1e-96   1
TAIR|locus:2157712 - symbol:CEP1 "cysteine endopeptidase ...   944  6.8e-95   1
TAIR|locus:2090614 - symbol:AT3G19390 species:3702 "Arabi...   909  3.5e-91   1
TAIR|locus:2024362 - symbol:XBCP3 "xylem bark cysteine pe...   881  3.2e-88   1
TAIR|locus:2152445 - symbol:SAG12 "senescence-associated ...   863  2.6e-86   1
TAIR|locus:2090629 - symbol:AT3G19400 species:3702 "Arabi...   851  4.9e-85   1
TAIR|locus:2117979 - symbol:AT4G23520 species:3702 "Arabi...   839  9.1e-84   1
TAIR|locus:505006391 - symbol:CEP3 "cysteine endopeptidas...   835  2.4e-83   1
TAIR|locus:2128253 - symbol:AT4G11320 species:3702 "Arabi...   829  1.0e-82   1
TAIR|locus:2128243 - symbol:AT4G11310 species:3702 "Arabi...   826  2.2e-82   1
TAIR|locus:2038515 - symbol:AT1G06260 species:3702 "Arabi...   781  1.3e-77   1
TAIR|locus:2038588 - symbol:AT2G27420 species:3702 "Arabi...   772  1.1e-76   1
TAIR|locus:2082881 - symbol:AT3G49340 species:3702 "Arabi...   770  1.9e-76   1
TAIR|locus:2029924 - symbol:AT1G29090 species:3702 "Arabi...   738  4.6e-73   1
TAIR|locus:2055440 - symbol:AT2G34080 species:3702 "Arabi...   738  4.6e-73   1
TAIR|locus:2097104 - symbol:AT3G43960 species:3702 "Arabi...   713  2.1e-70   1
TAIR|locus:2029934 - symbol:AT1G29080 species:3702 "Arabi...   699  6.3e-69   1
DICTYBASE|DDB_G0283867 - symbol:cprC "cysteine proteinase...   681  5.1e-67   1
DICTYBASE|DDB_G0272815 - symbol:cprE "cysteine proteinase...   585  6.8e-67   2
FB|FBgn0013770 - symbol:Cp1 "Cysteine proteinase-1" speci...   653  4.7e-64   1
UNIPROTKB|F1NYJ1 - symbol:CTSL2 "Uncharacterized protein"...   652  6.0e-64   1
ZFIN|ZDB-GENE-040718-61 - symbol:ctsl.1 "cathepsin L.1" s...   645  3.3e-63   1
DICTYBASE|DDB_G0279185 - symbol:cprF "cysteine proteinase...   545  9.9e-62   2
ZFIN|ZDB-GENE-050626-55 - symbol:ctssb.2 "cathepsin S, b....   631  1.0e-61   1
UNIPROTKB|Q3T0I2 - symbol:CTSH "Pro-cathepsin H" species:...   630  1.3e-61   1
ZFIN|ZDB-GENE-001205-4 - symbol:ctsk "cathepsin K" specie...   626  3.4e-61   1
ZFIN|ZDB-GENE-030131-106 - symbol:ctsl1a "cathepsin L, 1 ...   626  3.4e-61   1
DICTYBASE|DDB_G0278721 - symbol:cprD "cysteine proteinase...   540  5.4e-61   2
DICTYBASE|DDB_G0281605 - symbol:cfaD "peptidase C1A famil...   624  5.5e-61   1
TAIR|locus:2030027 - symbol:AT1G29110 species:3702 "Arabi...   624  5.5e-61   1
DICTYBASE|DDB_G0279799 - symbol:cprB "cysteine proteinase...   534  6.9e-61   2
UNIPROTKB|Q28944 - symbol:CTSL1 "Cathepsin L1" species:98...   621  1.2e-60   1
ZFIN|ZDB-GENE-071004-74 - symbol:zgc:174855 "zgc:174855" ...   621  1.2e-60   1
UNIPROTKB|P83654 - symbol:P83654 "Ervatamin-C" species:52...   618  2.4e-60   1
UNIPROTKB|P25975 - symbol:CTSL1 "Cathepsin L1" species:99...   617  3.1e-60   1
ZFIN|ZDB-GENE-030131-572 - symbol:wu:fb37b09 "wu:fb37b09"...   617  3.1e-60   1
DICTYBASE|DDB_G0279187 - symbol:cprG "cysteine proteinase...   531  4.8e-60   2
UNIPROTKB|Q9GL24 - symbol:CTSL1 "Cathepsin L1" species:96...   614  6.4e-60   1
MGI|MGI:88564 - symbol:Ctsl "cathepsin L" species:10090 "...   614  6.4e-60   1
RGD|2447 - symbol:Ctsh "cathepsin H" species:10116 "Rattu...   613  8.1e-60   1
MGI|MGI:107285 - symbol:Ctsh "cathepsin H" species:10090 ...   612  1.0e-59   1
UNIPROTKB|P43235 - symbol:CTSK "Cathepsin K" species:9606...   609  2.2e-59   1
UNIPROTKB|Q5E998 - symbol:CTSL2 "Cathepsin L2" species:99...   608  2.7e-59   1
UNIPROTKB|F1S4J6 - symbol:Ssc.54235 "Cathepsin L1" specie...   608  2.7e-59   1
UNIPROTKB|G3R9A7 - symbol:CTSH "Uncharacterized protein" ...   608  2.7e-59   1
RGD|2448 - symbol:Ctsl1 "cathepsin L1" species:10116 "Rat...   608  2.7e-59   1
UNIPROTKB|G1K2A7 - symbol:CTSK "Cathepsin K" species:9615...   607  3.5e-59   1
UNIPROTKB|Q3ZKN1 - symbol:CTSK "Cathepsin K" species:9615...   607  3.5e-59   1
UNIPROTKB|O46427 - symbol:CTSH "Pro-cathepsin H" species:...   607  3.5e-59   1
UNIPROTKB|Q24940 - symbol:Cat-1 "Cathepsin L-like protein...   607  3.5e-59   1
UNIPROTKB|Q86GF7 - symbol:Cys "Crustapain" species:6703 "...   607  3.5e-59   1
UNIPROTKB|P07711 - symbol:CTSL1 "Cathepsin L1" species:96...   606  4.5e-59   1
UNIPROTKB|Q5E968 - symbol:CTSK "Cathepsin K" species:9913...   604  7.3e-59   1
UNIPROTKB|Q9GLE3 - symbol:CTSK "Cathepsin K" species:9823...   603  9.3e-59   1
UNIPROTKB|F6R7P5 - symbol:CTSH "Uncharacterized protein" ...   603  9.3e-59   1
UNIPROTKB|G1RBY1 - symbol:CTSH "Uncharacterized protein" ...   603  9.3e-59   1
UNIPROTKB|P09668 - symbol:CTSH "Pro-cathepsin H" species:...   601  1.5e-58   1
UNIPROTKB|A4IFS7 - symbol:CTSL1 "CTSL1 protein" species:9...   598  3.2e-58   1
UNIPROTKB|G1M0X4 - symbol:CTSH "Uncharacterized protein" ...   598  3.2e-58   1
UNIPROTKB|G1SQF0 - symbol:CTSH "Uncharacterized protein" ...   598  3.2e-58   1
RGD|61810 - symbol:Ctsk "cathepsin K" species:10116 "Ratt...   598  3.2e-58   1
ZFIN|ZDB-GENE-080215-7 - symbol:zgc:174153 "zgc:174153" s...   597  4.0e-58   1
UNIPROTKB|F1SS93 - symbol:CTSS "Uncharacterized protein" ...   592  1.4e-57   1
FB|FBgn0260462 - symbol:CG12163 species:7227 "Drosophila ...   591  1.7e-57   1
RGD|1308751 - symbol:RGD1308751 "similar to Cathepsin L p...   591  1.7e-57   1
ZFIN|ZDB-GENE-030131-3539 - symbol:ctsh "cathepsin H" spe...   591  1.7e-57   1
UNIPROTKB|P25326 - symbol:CTSS "Cathepsin S" species:9913...   590  2.2e-57   1
UNIPROTKB|F1PAK0 - symbol:CTSS "Cathepsin S" species:9615...   590  2.2e-57   1
UNIPROTKB|F6X9C1 - symbol:CTSH "Uncharacterized protein" ...   590  2.2e-57   1
UNIPROTKB|Q8HY81 - symbol:CTSS "Cathepsin S" species:9615...   590  2.2e-57   1
UNIPROTKB|F7BJD8 - symbol:CTSH "Uncharacterized protein" ...   590  2.2e-57   1
ZFIN|ZDB-GENE-980526-285 - symbol:ctsl1b "cathepsin L, 1 ...   590  2.2e-57   1
DICTYBASE|DDB_G0272298 - symbol:DDB_G0272298 species:4468...   587  4.6e-57   1
MGI|MGI:107823 - symbol:Ctsk "cathepsin K" species:10090 ...   587  4.6e-57   1
TAIR|locus:2120222 - symbol:RD19 "RESPONSIVE TO DEHYDRATI...   587  4.6e-57   1
ZFIN|ZDB-GENE-041010-76 - symbol:ctsll "cathepsin L, like...   586  5.9e-57   1
UNIPROTKB|F7B939 - symbol:CTSH "Uncharacterized protein" ...   585  7.5e-57   1
UNIPROTKB|F7BRD4 - symbol:CTSH "Uncharacterized protein" ...   585  7.5e-57   1
RGD|1560071 - symbol:Ctsll3 "cathepsin L-like 3" species:...   584  9.6e-57   1
WB|WBGene00000776 - symbol:cpl-1 species:6239 "Caenorhabd...   584  9.6e-57   1
UNIPROTKB|O60911 - symbol:CTSL2 "Cathepsin L2" species:96...   582  1.6e-56   1
UNIPROTKB|G3SSC1 - symbol:CTSH "Uncharacterized protein" ...   580  2.5e-56   1
DICTYBASE|DDB_G0291191 - symbol:DDB_G0291191 "cysteine pr...   579  3.3e-56   1
MGI|MGI:107341 - symbol:Ctss "cathepsin S" species:10090 ...   578  4.2e-56   1
TAIR|locus:2050145 - symbol:AT2G21430 species:3702 "Arabi...   578  4.2e-56   1
DICTYBASE|DDB_G0290957 - symbol:cprA "cysteine proteinase...   574  1.1e-55   1
TAIR|locus:2130180 - symbol:AT4G16190 species:3702 "Arabi...   574  1.1e-55   1
WB|WBGene00007055 - symbol:tag-196 species:6239 "Caenorha...   574  1.1e-55   1
UNIPROTKB|P25774 - symbol:CTSS "Cathepsin S" species:9606...   571  2.3e-55   1
DICTYBASE|DDB_G0278401 - symbol:cprH "cysteine proteinase...   569  3.7e-55   1
TAIR|locus:2175088 - symbol:ALP "aleurain-like protease" ...   568  4.8e-55   1
UNIPROTKB|F1NEC8 - symbol:CTSL2 "Uncharacterized protein"...   567  6.1e-55   1
UNIPROTKB|P09648 - symbol:CTSL1 "Cathepsin L1" species:90...   567  6.1e-55   1
GENEDB_PFALCIPARUM|PF11_0162 - symbol:PF11_0162 "falcipai...   562  2.1e-54   1
UNIPROTKB|F1PMM9 - symbol:CTSL1 "Cathepsin L1" species:96...   562  2.1e-54   1
UNIPROTKB|Q8IIL0 - symbol:PF11_0162 "Falcipain-3" species...   562  2.1e-54   1
UNIPROTKB|F1NZ37 - symbol:LOC420160 "Uncharacterized prot...   561  2.6e-54   1

WARNING:  Descriptions of 186 database sequences were not reported due to the
          limiting value of parameter V = 100.


>TAIR|locus:2122113 [details] [associations]
            symbol:XCP1 "xylem cysteine peptidase 1" species:3702
            "Arabidopsis thaliana" [GO:0005576 "extracellular region"
            evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA;ISS]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            [GO:0000325 "plant-type vacuole" evidence=IDA] [GO:0005634
            "nucleus" evidence=IDA] [GO:0010623 "developmental programmed cell
            death" evidence=IMP] [GO:0010413 "glucuronoxylan metabolic process"
            evidence=RCA] [GO:0045492 "xylan biosynthetic process"
            evidence=RCA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0005886
            GO:GO:0005634 EMBL:CP002687 GenomeReviews:CT486007_GR
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0000325
            EMBL:AL022604 EMBL:AL161587 GO:GO:0010623 MEROPS:I29.003
            HOGENOM:HOG000230773 EMBL:AF191027 EMBL:AK117394 EMBL:BT005179
            IPI:IPI00532220 PIR:T06122 RefSeq:NP_567983.1 UniGene:At.2280
            UniGene:At.67622 ProteinModelPortal:O65493 SMR:O65493 STRING:O65493
            PaxDb:O65493 PRIDE:O65493 EnsemblPlants:AT4G35350.1 GeneID:829688
            KEGG:ath:AT4G35350 GeneFarm:5033 TAIR:At4g35350 InParanoid:O65493
            KO:K16290 OMA:FEVFREN PhylomeDB:O65493 ProtClustDB:CLSN2689772
            Genevestigator:O65493 Uniprot:O65493
        Length = 355

 Score = 1461 (519.4 bits), Expect = 1.1e-149, P = 1.1e-149
 Identities = 258/324 (79%), Positives = 296/324 (91%)

Query:    27 DFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEV 86
             DFSIVGY+PEHLT+ DKL+ELFESWMS+H K YK +EEK+HRFE+F+ENL HIDQRN E+
Sbjct:    30 DFSIVGYTPEHLTNTDKLLELFESWMSEHSKAYKSVEEKVHRFEVFRENLMHIDQRNNEI 89

Query:    87 TSYWLGLNEFADMSHEEFKNKYLGL-KPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAV 145
              SYWLGLNEFAD++HEEFK +YLGL KPQF  +RQPSA F YRD+  LPKSVDWRKKGAV
Sbjct:    90 NSYWLGLNEFADLTHEEFKGRYLGLAKPQFSRKRQPSANFRYRDITDLPKSVDWRKKGAV 149

Query:   146 TPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAF 205
              PVK+QG CGSCWAFSTVAAVEGINQI +GNL+SLSEQELIDCDT+FN+GCNGGLMDYAF
Sbjct:   150 APVKDQGQCGSCWAFSTVAAVEGINQITTGNLSSLSEQELIDCDTTFNSGCNGGLMDYAF 209

Query:   206 KYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVS 265
             +YI+++GGLHKE+DYPYLMEEG C+++KE++E VTISGY+DVPEND++SL+KALAHQPVS
Sbjct:   210 QYIISTGGLHKEDDYPYLMEEGICQEQKEDVERVTISGYEDVPENDDESLVKALAHQPVS 269

Query:   266 VAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYI 325
             VAIEASG DFQFY GGVF G CG +LDHGVAAVGYG SKGSDY+IVKNSWGP+WGE+G+I
Sbjct:   270 VAIEASGRDFQFYKGGVFNGKCGTDLDHGVAAVGYGSSKGSDYVIVKNSWGPRWGEKGFI 329

Query:   326 RMKRNTGKPEGLCGINKMASIPLK 349
             RMKRNTGKPEGLCGINKMAS P K
Sbjct:   330 RMKRNTGKPEGLCGINKMASYPTK 353


>TAIR|locus:2030427 [details] [associations]
            symbol:XCP2 "xylem cysteine peptidase 2" species:3702
            "Arabidopsis thaliana" [GO:0006508 "proteolysis" evidence=IEA;ISS]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            [GO:0009507 "chloroplast" evidence=ISM] [GO:0008233 "peptidase
            activity" evidence=ISS] [GO:0005618 "cell wall" evidence=IDA]
            [GO:0010623 "developmental programmed cell death" evidence=IMP]
            [GO:0010075 "regulation of meristem growth" evidence=RCA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 EMBL:CP002684
            GenomeReviews:CT485782_GR GO:GO:0005886 GO:GO:0005618 GO:GO:0005773
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 EMBL:AC069251 EMBL:AC007369 GO:GO:0010623
            OMA:YKEIPEG HOGENOM:HOG000230773 KO:K16290 EMBL:AF191028
            EMBL:BT004822 IPI:IPI00526722 PIR:A86341 RefSeq:NP_564126.1
            UniGene:At.21316 ProteinModelPortal:Q9LM66 SMR:Q9LM66 IntAct:Q9LM66
            STRING:Q9LM66 MEROPS:C01.120 PaxDb:Q9LM66 PRIDE:Q9LM66
            ProMEX:Q9LM66 EnsemblPlants:AT1G20850.1 GeneID:838677
            KEGG:ath:AT1G20850 GeneFarm:5034 TAIR:At1g20850 InParanoid:Q9LM66
            PhylomeDB:Q9LM66 ProtClustDB:CLSN2917031 Genevestigator:Q9LM66
            GermOnline:AT1G20850 Uniprot:Q9LM66
        Length = 356

 Score = 1374 (488.7 bits), Expect = 1.9e-140, P = 1.9e-140
 Identities = 253/326 (77%), Positives = 285/326 (87%)

Query:    26 HDFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKE 85
             HD+SIVGYSPE L S DKLIELFE+W+S   K Y+ +EEK  RFE+FK+NLKHID+ NK+
Sbjct:    29 HDYSIVGYSPEDLESHDKLIELFENWISNFEKAYETVEEKFLRFEVFKDNLKHIDETNKK 88

Query:    86 VTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPS--AEFSYRDVKALPKSVDWRKKG 143
               SYWLGLNEFAD+SHEEFK  YLGLK     R +    AEF+YRDV+A+PKSVDWRKKG
Sbjct:    89 GKSYWLGLNEFADLSHEEFKKMYLGLKTDIVRRDEERSYAEFAYRDVEAVPKSVDWRKKG 148

Query:   144 AVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDY 203
             AV  VKNQGSCGSCWAFSTVAAVEGIN+IV+GNLT+LSEQELIDCDT++NNGCNGGLMDY
Sbjct:   149 AVAEVKNQGSCGSCWAFSTVAAVEGINKIVTGNLTTLSEQELIDCDTTYNNGCNGGLMDY 208

Query:   204 AFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQP 263
             AF+YIV +GGL KEEDYPY MEEGTCE +K+E E VTI+G+QDVP NDE+SLLKALAHQP
Sbjct:   209 AFEYIVKNGGLRKEEDYPYSMEEGTCEMQKDESETVTINGHQDVPTNDEKSLLKALAHQP 268

Query:   264 VSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERG 323
             +SVAI+ASG +FQFYSGGVF G CG +LDHGVAAVGYG SKGSDYIIVKNSWGPKWGE+G
Sbjct:   269 LSVAIDASGREFQFYSGGVFDGRCGVDLDHGVAAVGYGSSKGSDYIIVKNSWGPKWGEKG 328

Query:   324 YIRMKRNTGKPEGLCGINKMASIPLK 349
             YIR+KRNTGKPEGLCGINKMAS P K
Sbjct:   329 YIRLKRNTGKPEGLCGINKMASFPTK 354


>TAIR|locus:2167821 [details] [associations]
            symbol:RD21B "esponsive to dehydration 21B" species:3702
            "Arabidopsis thaliana" [GO:0005576 "extracellular region"
            evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA;ISS] [GO:0005773
            "vacuole" evidence=IDA] [GO:0009651 "response to salt stress"
            evidence=IEP] [GO:0005515 "protein binding" evidence=IPI]
            [GO:0005829 "cytosol" evidence=IDA] [GO:0052541 "plant-type cell
            wall cellulose metabolic process" evidence=RCA] [GO:0052546 "cell
            wall pectin metabolic process" evidence=RCA] [GO:0005783
            "endoplasmic reticulum" evidence=IDA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005783 GO:GO:0005829 EMBL:CP002688
            GO:GO:0005773 GO:GO:0009651 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 EMBL:AB008267 HSSP:O65039
            HOGENOM:HOG000230773 KO:K01376 InterPro:IPR000118 Pfam:PF00396
            SMART:SM00277 ProtClustDB:CLSN2688498 EMBL:AY062608 EMBL:AY114661
            IPI:IPI00520971 RefSeq:NP_568620.1 UniGene:At.24130 SMR:Q9FMH8
            IntAct:Q9FMH8 STRING:Q9FMH8 MEROPS:C01.A12
            EnsemblPlants:AT5G43060.1 GeneID:834321 KEGG:ath:AT5G43060
            TAIR:At5g43060 InParanoid:Q9FMH8 OMA:ENSEASL Genevestigator:Q9FMH8
            Uniprot:Q9FMH8
        Length = 463

 Score = 982 (350.7 bits), Expect = 6.4e-99, P = 6.4e-99
 Identities = 193/337 (57%), Positives = 239/337 (70%)

Query:    27 DFSIVGYSPEH-LTS----MDKLIE-LFESWMSKHGKTYKCIE-----EKLHRFEIFKEN 75
             D SI+ Y   H +T+     D  +E ++E+WM +HGK  K  +     EK  RFEIFK+N
Sbjct:    23 DMSIISYDENHHITTETSRSDSEVERIYEAWMVEHGKK-KMNQNGLGAEKDQRFEIFKDN 81

Query:    76 LKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRR--QPSAEFSYRDVKAL 133
             L+ ID+ N +  SY LGL  FAD+++EE+++ YLG KP   T+R  + S  +  R   AL
Sbjct:    82 LRFIDEHNTKNLSYKLGLTRFADLTNEEYRSMYLGAKP---TKRVLKTSDRYQARVGDAL 138

Query:   134 PKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFN 193
             P SVDWRK+GAV  VK+QGSCGSCWAFST+ AVEGIN+IV+G+L SLSEQEL+DCDTS+N
Sbjct:   139 PDSVDWRKEGAVADVKDQGSCGSCWAFSTIGAVEGINKIVTGDLISLSEQELVDCDTSYN 198

Query:   194 NGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQ 253
              GCNGGLMDYAF++I+ +GG+  E DYPY   +G C+  ++  +VVTI  Y+DVPEN E 
Sbjct:   199 QGCNGGLMDYAFEFIIKNGGIDTEADYPYKAADGRCDQNRKNAKVVTIDSYEDVPENSEA 258

Query:   254 SLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKN 313
             SL KALAHQP+SVAIEA G  FQ YS GVF G CG ELDHGV AVGYG   G DY IV+N
Sbjct:   259 SLKKALAHQPISVAIEAGGRAFQLYSSGVFDGLCGTELDHGVVAVGYGTENGKDYWIVRN 318

Query:   314 SWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLKK 350
             SWG +WGE GYI+M RN   P G CGI   AS P+KK
Sbjct:   319 SWGNRWGESGYIKMARNIEAPTGKCGIAMEASYPIKK 355


>TAIR|locus:2825832 [details] [associations]
            symbol:RD21A "responsive to dehydration 21A" species:3702
            "Arabidopsis thaliana" [GO:0005576 "extracellular region"
            evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA;IMP]
            [GO:0008234 "cysteine-type peptidase activity"
            evidence=ISS;IDA;IMP] [GO:0009507 "chloroplast" evidence=IDA]
            [GO:0005773 "vacuole" evidence=IDA] [GO:0008233 "peptidase
            activity" evidence=IDA] [GO:0048046 "apoplast" evidence=IDA]
            [GO:0005515 "protein binding" evidence=IPI] [GO:0005829 "cytosol"
            evidence=RCA] [GO:0009506 "plasmodesma" evidence=IDA] [GO:0050832
            "defense response to fungus" evidence=IMP] [GO:0006096 "glycolysis"
            evidence=RCA] [GO:0006833 "water transport" evidence=RCA]
            [GO:0006972 "hyperosmotic response" evidence=RCA] [GO:0007030
            "Golgi organization" evidence=RCA] [GO:0009266 "response to
            temperature stimulus" evidence=RCA] [GO:0009651 "response to salt
            stress" evidence=RCA] [GO:0015996 "chlorophyll catabolic process"
            evidence=RCA] [GO:0042744 "hydrogen peroxide catabolic process"
            evidence=RCA] [GO:0046686 "response to cadmium ion" evidence=RCA]
            [GO:0009414 "response to water deprivation" evidence=TAS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 EMBL:CP002684
            GenomeReviews:CT485782_GR GO:GO:0009506 GO:GO:0009507 GO:GO:0005773
            GO:GO:0050832 GO:GO:0048046 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 EMBL:AC083835
            HOGENOM:HOG000230773 KO:K01376 InterPro:IPR000118 Pfam:PF00396
            SMART:SM00277 UniGene:At.43549 EMBL:D13043 EMBL:AY072130
            EMBL:AY133781 IPI:IPI00530094 PIR:JN0719 RefSeq:NP_564497.1
            UniGene:At.47599 UniGene:At.71705 ProteinModelPortal:P43297
            SMR:P43297 IntAct:P43297 STRING:P43297 MEROPS:C01.064 PaxDb:P43297
            PRIDE:P43297 ProMEX:P43297 EnsemblPlants:AT1G47128.1 GeneID:841122
            KEGG:ath:AT1G47128 TAIR:At1g47128 InParanoid:P43297 OMA:EAWLVKH
            PhylomeDB:P43297 ProtClustDB:CLSN2688498 Genevestigator:P43297
            GermOnline:AT1G47128 Uniprot:P43297
        Length = 462

 Score = 961 (343.3 bits), Expect = 1.1e-96, P = 1.1e-96
 Identities = 175/331 (52%), Positives = 233/331 (70%)

Query:    27 DFSIVGYSPEHLTSMD------KLIELFESWMSKHGK--TYKCIEEKLHRFEIFKENLKH 78
             D SI+ Y  +H  S        +++ ++E+W+ KHGK  +   + EK  RFEIFK+NL+ 
Sbjct:    23 DMSIISYDEKHGVSTTGGRSEAEVMSIYEAWLVKHGKAQSQNSLVEKDRRFEIFKDNLRF 82

Query:    79 IDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVD 138
             +D+ N++  SY LGL  FAD++++E+++KYLG K +    R+ S  +  R    LP+S+D
Sbjct:    83 VDEHNEKNLSYRLGLTRFADLTNDEYRSKYLGAKMEKKGERRTSLRYEARVGDELPESID 142

Query:   139 WRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNG 198
             WRKKGAV  VK+QG CGSCWAFST+ AVEGINQIV+G+L +LSEQEL+DCDTS+N GCNG
Sbjct:   143 WRKKGAVAEVKDQGGCGSCWAFSTIGAVEGINQIVTGDLITLSEQELVDCDTSYNEGCNG 202

Query:   199 GLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKA 258
             GLMDYAF++I+ +GG+  ++DYPY   +GTC+  ++  +VVTI  Y+DVP   E+SL KA
Sbjct:   203 GLMDYAFEFIIKNGGIDTDKDYPYKGVDGTCDQIRKNAKVVTIDSYEDVPTYSEESLKKA 262

Query:   259 LAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPK 318
             +AHQP+S+AIEA G  FQ Y  G+F G CG +LDHGV AVGYG   G DY IV+NSWG  
Sbjct:   263 VAHQPISIAIEAGGRAFQLYDSGIFDGSCGTQLDHGVVAVGYGTENGKDYWIVRNSWGKS 322

Query:   319 WGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
             WGE GY+RM RN     G CGI    S P+K
Sbjct:   323 WGESGYLRMARNIASSSGKCGIAIEPSYPIK 353


>TAIR|locus:2157712 [details] [associations]
            symbol:CEP1 "cysteine endopeptidase 1" species:3702
            "Arabidopsis thaliana" [GO:0005576 "extracellular region"
            evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA;ISS]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0005783 EMBL:CP002688
            GenomeReviews:BA000015_GR eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 PROSITE:PS00014
            EMBL:AB024031 MEROPS:I29.003 EMBL:HM367092 EMBL:AY091087
            IPI:IPI00516991 RefSeq:NP_568722.1 UniGene:At.7918 HSSP:O65039
            ProteinModelPortal:Q9FGR9 SMR:Q9FGR9 PaxDb:Q9FGR9 PRIDE:Q9FGR9
            EnsemblPlants:AT5G50260.1 GeneID:835091 KEGG:ath:AT5G50260
            TAIR:At5g50260 HOGENOM:HOG000230773 InParanoid:Q9FGR9 KO:K16292
            OMA:WHSKKYH PhylomeDB:Q9FGR9 ProtClustDB:CLSN2689970
            Genevestigator:Q9FGR9 Uniprot:Q9FGR9
        Length = 361

 Score = 944 (337.4 bits), Expect = 6.8e-95, P = 6.8e-95
 Identities = 185/315 (58%), Positives = 223/315 (70%)

Query:    40 SMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADM 99
             S + L EL+E W S H    + +EEK  RF +FK N+KHI + NK+  SY L LN+F DM
Sbjct:    30 SENSLWELYERWRSHH-TVARSLEEKAKRFNVFKHNVKHIHETNKKDKSYKLKLNKFGDM 88

Query:   100 SHEEFKNKYLG--LKPQ--FPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCG 155
             + EEF+  Y G  +K    F   ++ +  F Y +V  LP SVDWRK GAVTPVKNQG CG
Sbjct:    89 TSEEFRRTYAGSNIKHHRMFQGEKKATKSFMYANVNTLPTSVDWRKNGAVTPVKNQGQCG 148

Query:   156 SCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLH 215
             SCWAFSTV AVEGINQI +  LTSLSEQEL+DCDT+ N GCNGGLMD AF++I   GGL 
Sbjct:   149 SCWAFSTVVAVEGINQIRTKKLTSLSEQELVDCDTNQNQGCNGGLMDLAFEFIKEKGGLT 208

Query:   216 KEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDF 275
              E  YPY   + TC+  KE   VV+I G++DVP+N E  L+KA+A+QPVSVAI+A G+DF
Sbjct:   209 SELVYPYKASDETCDTNKENAPVVSIDGHEDVPKNSEDDLMKAVANQPVSVAIDAGGSDF 268

Query:   276 QFYSGGVFTGPCGAELDHGVAAVGYGKS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKP 334
             QFYS GVFTG CG EL+HGVA VGYG +  G+ Y IVKNSWG +WGE+GYIRM+R     
Sbjct:   269 QFYSEGVFTGRCGTELNHGVAVVGYGTTIDGTKYWIVKNSWGEEWGEKGYIRMQRGIRHK 328

Query:   335 EGLCGINKMASIPLK 349
             EGLCGI   AS PLK
Sbjct:   329 EGLCGIAMEASYPLK 343


>TAIR|locus:2090614 [details] [associations]
            symbol:AT3G19390 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA;ISS] [GO:0000041 "transition metal ion
            transport" evidence=RCA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            EMBL:CP002686 GenomeReviews:BA000014_GR eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 EMBL:AB025624
            MEROPS:I29.003 OMA:KAMDQKC HSSP:O65039 HOGENOM:HOG000230773
            InterPro:IPR000118 Pfam:PF00396 SMART:SM00277 EMBL:AY062725
            EMBL:AY093350 IPI:IPI00520189 RefSeq:NP_566633.1 UniGene:At.27473
            ProteinModelPortal:Q9LT78 SMR:Q9LT78 IntAct:Q9LT78 STRING:Q9LT78
            PaxDb:Q9LT78 PRIDE:Q9LT78 EnsemblPlants:AT3G19390.1 GeneID:821473
            KEGG:ath:AT3G19390 TAIR:At3g19390 InParanoid:Q9LT78
            PhylomeDB:Q9LT78 ProtClustDB:CLSN2917188 Genevestigator:Q9LT78
            Uniprot:Q9LT78
        Length = 452

 Score = 909 (325.0 bits), Expect = 3.5e-91, P = 3.5e-91
 Identities = 169/306 (55%), Positives = 218/306 (71%)

Query:    47 LFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFK 105
             ++E W+ ++ K Y  + EK  RFEIFK+NLK +++ +     +Y +GL  FAD++++EF+
Sbjct:    42 MYERWLVENRKNYNGLGEKERRFEIFKDNLKFVEEHSSIPNRTYEVGLTRFADLTNDEFR 101

Query:   106 NKYLGLKPQFPTRRQPSAE-FSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVA 164
               YL  K +  TR     E + Y+   +LP ++DWR KGAV PVK+QGSCGSCWAFS + 
Sbjct:   102 AIYLRSKME-RTRVPVKGEKYLYKVGDSLPDAIDWRAKGAVNPVKDQGSCGSCWAFSAIG 160

Query:   165 AVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLM 224
             AVEGINQI +G L SLSEQEL+DCDTS+N+GC GGLMDYAFK+I+ +GG+  EEDYPY+ 
Sbjct:   161 AVEGINQIKTGELISLSEQELVDCDTSYNDGCGGGLMDYAFKFIIENGGIDTEEDYPYIA 220

Query:   225 EE-GTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVF 283
              +   C   K+   VVTI GY+DVP+NDE+SL KALA+QP+SVAIEA G  FQ Y+ GVF
Sbjct:   221 TDVNVCNSDKKNTRVVTIDGYEDVPQNDEKSLKKALANQPISVAIEAGGRAFQLYTSGVF 280

Query:   284 TGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKM 343
             TG CG  LDHGV AVGYG   G DY IV+NSWG  WGE GY +++RN  +  G CG+  M
Sbjct:   281 TGTCGTSLDHGVVAVGYGSEGGQDYWIVRNSWGSNWGESGYFKLERNIKESSGKCGVAMM 340

Query:   344 ASIPLK 349
             AS P K
Sbjct:   341 ASYPTK 346


>TAIR|locus:2024362 [details] [associations]
            symbol:XBCP3 "xylem bark cysteine peptidase 3"
            species:3702 "Arabidopsis thaliana" [GO:0005576 "extracellular
            region" evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            [GO:0005773 "vacuole" evidence=IDA] [GO:0005783 "endoplasmic
            reticulum" evidence=IDA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005783 EMBL:CP002684 GO:GO:0005773 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 MEROPS:I29.003
            HOGENOM:HOG000230773 InterPro:IPR000118 Pfam:PF00396 SMART:SM00277
            UniGene:At.10233 OMA:CEIESAV EMBL:BT026490 EMBL:AK226753
            IPI:IPI00536687 RefSeq:NP_563855.1 ProteinModelPortal:Q0WVJ5
            SMR:Q0WVJ5 PRIDE:Q0WVJ5 EnsemblPlants:AT1G09850.1 GeneID:837517
            KEGG:ath:AT1G09850 TAIR:At1g09850 InParanoid:Q0WVJ5
            PhylomeDB:Q0WVJ5 ProtClustDB:CLSN2687747 Genevestigator:Q0WVJ5
            Uniprot:Q0WVJ5
        Length = 437

 Score = 881 (315.2 bits), Expect = 3.2e-88, P = 3.2e-88
 Identities = 167/312 (53%), Positives = 211/312 (67%)

Query:    39 TSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTS-YWLGLNEFA 97
             +S D + ELF+ W  KHGKTY   EE+  R +IFK+N   + Q N    + Y L LN FA
Sbjct:    23 SSSDDISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFA 82

Query:    98 DMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSC 157
             D++H EFK   LGL    P+    S   S      +P SVDWRKKGAVT VK+QGSCG+C
Sbjct:    83 DLTHHEFKASRLGLSVSAPSVIMASKGQSLGGSVKVPDSVDWRKKGAVTNVKDQGSCGAC 142

Query:   158 WAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKE 217
             W+FS   A+EGINQIV+G+L SLSEQELIDCD S+N GCNGGLMDYAF++++ + G+  E
Sbjct:   143 WSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTE 202

Query:   218 EDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQF 277
             +DYPY   +GTC+  K + +VVTI  Y  V  NDE++L++A+A QPVSV I  S   FQ 
Sbjct:   203 KDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSERAFQL 262

Query:   278 YSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGL 337
             YS G+F+GPC   LDH V  VGYG   G DY IVKNSWG  WG  G++ M+RNT   +G+
Sbjct:   263 YSSGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTENSDGV 322

Query:   338 CGINKMASIPLK 349
             CGIN +AS P+K
Sbjct:   323 CGINMLASYPIK 334


>TAIR|locus:2152445 [details] [associations]
            symbol:SAG12 "senescence-associated gene 12" species:3702
            "Arabidopsis thaliana" [GO:0005576 "extracellular region"
            evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA;ISS]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            [GO:0009723 "response to ethylene stimulus" evidence=IEP]
            [GO:0007568 "aging" evidence=IEP;TAS] [GO:0010150 "leaf senescence"
            evidence=IEP;TAS] [GO:0010282 "senescence-associated vacuole"
            evidence=IDA] [GO:0009817 "defense response to fungus, incompatible
            interaction" evidence=IEP] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            EMBL:CP002688 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0010150 GO:GO:0009817 EMBL:AB016870
            HSSP:O65039 OMA:NDEQALM EMBL:AF370131 EMBL:AY040073 IPI:IPI00544181
            RefSeq:NP_568651.1 UniGene:At.75256 UniGene:At.7710
            ProteinModelPortal:Q9FJ47 SMR:Q9FJ47 IntAct:Q9FJ47 STRING:Q9FJ47
            MEROPS:C01.117 PRIDE:Q9FJ47 ProMEX:Q9FJ47 EnsemblPlants:AT5G45890.1
            GeneID:834629 KEGG:ath:AT5G45890 TAIR:At5g45890 InParanoid:Q9FJ47
            PhylomeDB:Q9FJ47 ProtClustDB:CLSN2917735 ArrayExpress:Q9FJ47
            Genevestigator:Q9FJ47 GO:GO:0010282 Uniprot:Q9FJ47
        Length = 346

 Score = 863 (308.9 bits), Expect = 2.6e-86, P = 2.6e-86
 Identities = 163/305 (53%), Positives = 215/305 (70%)

Query:    51 WMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT--SYWLGLNEFADMSHEEFKNKY 108
             WM+KHG+ Y  ++E+ +R+ +FK N++ I+  N      ++ L +N+FAD++++EF++ Y
Sbjct:    41 WMTKHGRVYADVKEENNRYVVFKNNVERIEHLNSIPAGRTFKLAVNQFADLTNDEFRSMY 100

Query:   109 LGLKPQFPTRRQPSAE---FSYRDVK--ALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTV 163
              G K       Q   +   F Y++V   ALP SVDWRKKGAVTP+KNQGSCG CWAFS V
Sbjct:   101 TGFKGVSALSSQSQTKMSPFRYQNVSSGALPVSVDWRKKGAVTPIKNQGSCGCCWAFSAV 160

Query:   164 AAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYL 223
             AA+EG  QI  G L SLSEQ+L+DCDT+ + GC GGLMD AF++I A+GGL  E +YPY 
Sbjct:   161 AAIEGATQIKKGKLISLSEQQLVDCDTN-DFGCEGGLMDTAFEHIKATGGLTTESNYPYK 219

Query:   224 MEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVF 283
              E+ TC  KK   +  +I+GY+DVP NDEQ+L+KA+AHQPVSV IE  G DFQFYS GVF
Sbjct:   220 GEDATCNSKKTNPKATSITGYEDVPVNDEQALMKAVAHQPVSVGIEGGGFDFQFYSSGVF 279

Query:   284 TGPCGAELDHGVAAVGYGKS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINK 342
             TG C   LDH V A+GYG+S  GS Y I+KNSWG KWGE GY+R++++    +GLCG+  
Sbjct:   280 TGECTTYLDHAVTAIGYGESTNGSKYWIIKNSWGTKWGESGYMRIQKDVKDKQGLCGLAM 339

Query:   343 MASIP 347
              AS P
Sbjct:   340 KASYP 344


>TAIR|locus:2090629 [details] [associations]
            symbol:AT3G19400 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] [GO:0005829 "cytosol"
            evidence=IDA] [GO:0019344 "cysteine biosynthetic process"
            evidence=RCA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0005829
            EMBL:CP002686 GenomeReviews:BA000014_GR eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 EMBL:AB025624
            MEROPS:I29.003 HOGENOM:HOG000230773 EMBL:AK118509 IPI:IPI00543468
            RefSeq:NP_566634.2 UniGene:At.38409 ProteinModelPortal:Q9LT77
            SMR:Q9LT77 PaxDb:Q9LT77 PRIDE:Q9LT77 EnsemblPlants:AT3G19400.1
            GeneID:821474 KEGG:ath:AT3G19400 TAIR:At3g19400 InParanoid:Q9LT77
            OMA:IGEHERR ProtClustDB:CLSN2679975 Genevestigator:Q9LT77
            Uniprot:Q9LT77
        Length = 362

 Score = 851 (304.6 bits), Expect = 4.9e-85, P = 4.9e-85
 Identities = 159/307 (51%), Positives = 208/307 (67%)

Query:    47 LFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFK 105
             ++E W+ ++ K Y  + EK  RF+IFK+NLK +D+ N     ++ +GL  FAD+++EEF+
Sbjct:    43 MYEQWLVENRKNYNGLGEKERRFKIFKDNLKFVDEHNSVPDRTFEVGLTRFADLTNEEFR 102

Query:   106 NKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAA 165
               YL  K +       +  + Y++   LP  VDWR  GAV  VK+QG+CGSCWAFS V A
Sbjct:   103 AIYLRKKMERTKDSVKTERYLYKEGDVLPDEVDWRANGAVVSVKDQGNCGSCWAFSAVGA 162

Query:   166 VEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLM 224
             VEGINQI +G L SLSEQEL+DCD  F N GC+GG+M+YAF++I+ +GG+  ++DYPY  
Sbjct:   163 VEGINQITTGELISLSEQELVDCDRGFVNAGCDGGIMNYAFEFIMKNGGIETDQDYPYNA 222

Query:   225 EE-GTCE-DKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGV 282
              + G C  DK     VVTI GY+DVP +DE+SL KA+AHQPVSVAIEAS   FQ Y  GV
Sbjct:   223 NDLGLCNADKNNNTRVVTIDGYEDVPRDDEKSLKKAVAHQPVSVAIEASSQAFQLYKSGV 282

Query:   283 FTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINK 342
              TG CG  LDHGV  VGYG + G DY I++NSWG  WG+ GY++++RN   P G CGI  
Sbjct:   283 MTGTCGISLDHGVVVVGYGSTSGEDYWIIRNSWGLNWGDSGYVKLQRNIDDPFGKCGIAM 342

Query:   343 MASIPLK 349
             M S P K
Sbjct:   343 MPSYPTK 349


>TAIR|locus:2117979 [details] [associations]
            symbol:AT4G23520 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            EMBL:CP002687 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00639
            KO:K01376 IPI:IPI00527171 RefSeq:NP_567686.2 UniGene:At.32421
            ProteinModelPortal:F4JNL3 SMR:F4JNL3 MEROPS:C01.A22 PRIDE:F4JNL3
            EnsemblPlants:AT4G23520.1 GeneID:828452 KEGG:ath:AT4G23520
            OMA:PANDEIS ArrayExpress:F4JNL3 Uniprot:F4JNL3
        Length = 356

 Score = 839 (300.4 bits), Expect = 9.1e-84, P = 9.1e-84
 Identities = 160/318 (50%), Positives = 218/318 (68%)

Query:    37 HLTSMDKLIELFESWMSKHGKTY-KCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNE 95
             H  S +++  +F+ WMSKHGKTY   + EK  RF+ FK+NL+ IDQ N +  SY LGL  
Sbjct:    36 HNRSNEEVEFIFQMWMSKHGKTYTNALGEKERRFQNFKDNLRFIDQHNAKNLSYQLGLTR 95

Query:    96 FADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKA--LPKSVDWRKKGAVTPVKNQGS 153
             FAD++ +E+++ + G  P+ P +R       Y  +    LP+SVDWR++GAV+ +K+QG+
Sbjct:    96 FADLTVQEYRDLFPG-SPK-PKQRNLKTSRRYVPLAGDQLPESVDWRQEGAVSEIKDQGT 153

Query:   154 CGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNG-GLMDYAFKYIVASG 212
             C SCWAFSTVAAVEG+N+IV+G L SLSEQEL+DC+   NNGC G GLMD AF++++ + 
Sbjct:   154 CNSCWAFSTVAAVEGLNKIVTGELISLSEQELVDCNL-VNNGCYGSGLMDTAFQFLINNN 212

Query:   213 GLHKEEDYPYLMEEGTCEDKKEEM-EVVTISGYQDVPENDEQSLLKALAHQPVSVAIEAS 271
             GL  E+DYPY   +G+C  K+    +V+TI  Y+DVP NDE SL KA+AHQPVSV ++  
Sbjct:   213 GLDSEKDYPYQGTQGSCNRKQSTSNKVITIDSYEDVPANDEISLQKAVAHQPVSVGVDKK 272

Query:   272 GTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNT 331
               +F  Y   ++ GPCG  LDH +  VGYG   G DY IV+NSWG  WG+ GYI++ RN 
Sbjct:   273 SQEFMLYRSCIYNGPCGTNLDHALVIVGYGSENGQDYWIVRNSWGTTWGDAGYIKIARNF 332

Query:   332 GKPEGLCGINKMASIPLK 349
               P+GLCGI  +AS P+K
Sbjct:   333 EDPKGLCGIAMLASYPIK 350


>TAIR|locus:505006391 [details] [associations]
            symbol:CEP3 "cysteine endopeptidase 3" species:3702
            "Arabidopsis thaliana" [GO:0005576 "extracellular region"
            evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA;ISS]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0005783 EMBL:CP002686
            GenomeReviews:BA000014_GR eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 PROSITE:PS00014
            EMBL:AL049659 HSSP:O65039 HOGENOM:HOG000230773 KO:K16292
            EMBL:AK119026 IPI:IPI00525150 PIR:T06707 RefSeq:NP_566901.1
            UniGene:At.3162 ProteinModelPortal:Q9STL5 SMR:Q9STL5 MEROPS:C01.A02
            PRIDE:Q9STL5 EnsemblPlants:AT3G48350.1 GeneID:823993
            KEGG:ath:AT3G48350 TAIR:At3g48350 InParanoid:Q9STL5 OMA:DITHHEF
            PhylomeDB:Q9STL5 ProtClustDB:CLSN2917387 Genevestigator:Q9STL5
            Uniprot:Q9STL5
        Length = 364

 Score = 835 (299.0 bits), Expect = 2.4e-83, P = 2.4e-83
 Identities = 163/323 (50%), Positives = 216/323 (66%)

Query:    33 YSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLG 92
             +  + L + + + +L+E W   H  + +   E + RF +F+ N+ H+ + NK+   Y L 
Sbjct:    23 FDEKELETEENVWKLYERWRGHHSVS-RASHEAIKRFNVFRHNVLHVHRTNKKNKPYKLK 81

Query:    93 LNEFADMSHEEFKNKYLG--LKPQFPTR--RQPSAEFSYRDVKALPKSVDWRKKGAVTPV 148
             +N FAD++H EF++ Y G  +K     R  ++ S  F Y +V  +P SVDWR+KGAVT V
Sbjct:    82 INRFADITHHEFRSSYAGSNVKHHRMLRGPKRGSGGFMYENVTRVPSSVDWREKGAVTEV 141

Query:   149 KNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYI 208
             KNQ  CGSCWAFSTVAAVEGIN+I +  L SLSEQEL+DCDT  N GC GGLM+ AF++I
Sbjct:   142 KNQQDCGSCWAFSTVAAVEGINKIRTNKLVSLSEQELVDCDTEENQGCAGGLMEPAFEFI 201

Query:   209 VASGGLHKEEDYPYLMEEGT-CEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVA 267
               +GG+  EE YPY   +   C       E VTI G++ VPENDE+ LLKA+AHQPVSVA
Sbjct:   202 KNNGGIKTEETYPYDSSDVQFCRANSIGGETVTIDGHEHVPENDEEELLKAVAHQPVSVA 261

Query:   268 IEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSK-GSDYIIVKNSWGPKWGERGYIR 326
             I+A  +DFQ YS GVF G CG +L+HGV  VGYG++K G+ Y IV+NSWGP+WGE GY+R
Sbjct:   262 IDAGSSDFQLYSEGVFIGECGTQLNHGVVIVGYGETKNGTKYWIVRNSWGPEWGEGGYVR 321

Query:   327 MKRNTGKPEGLCGINKMASIPLK 349
             ++R   + EG CGI   AS P K
Sbjct:   322 IERGISENEGRCGIAMEASYPTK 344


>TAIR|locus:2128253 [details] [associations]
            symbol:AT4G11320 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            EMBL:CP002687 GenomeReviews:CT486007_GR EMBL:AL096882 EMBL:AL161531
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 OMA:ICHGADP
            HOGENOM:HOG000230773 KO:K01376 ProtClustDB:CLSN2689395
            EMBL:AY035055 EMBL:AY051062 IPI:IPI00520480 PIR:T13023
            RefSeq:NP_567377.1 UniGene:At.25206 ProteinModelPortal:Q9SUS9
            SMR:Q9SUS9 STRING:Q9SUS9 MEROPS:C01.A21 PaxDb:Q9SUS9 PRIDE:Q9SUS9
            EnsemblPlants:AT4G11320.1 GeneID:826734 KEGG:ath:AT4G11320
            TAIR:At4g11320 InParanoid:Q9SUS9 PhylomeDB:Q9SUS9
            Genevestigator:Q9SUS9 GermOnline:AT4G11320 Uniprot:Q9SUS9
        Length = 371

 Score = 829 (296.9 bits), Expect = 1.0e-82, P = 1.0e-82
 Identities = 160/307 (52%), Positives = 203/307 (66%)

Query:    47 LFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKN 106
             +FESWM KHGK Y  + EK  R  IF++NL+ I  RN E  SY LGLN FAD+S  E+  
Sbjct:    55 MFESWMVKHGKVYDSVAEKERRLTIFEDNLRFITNRNAENLSYRLGLNRFADLSLHEYGE 114

Query:   107 KYLGLKPQFPTRR---QPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTV 163
                G  P+ P        S  +   D   LPKSVDWR +GAVT VK+QG C SCWAFSTV
Sbjct:   115 ICHGADPRPPRNHVFMTSSNRYKTSDGDVLPKSVDWRNEGAVTEVKDQGLCRSCWAFSTV 174

Query:   164 AAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYL 223
              AVEG+N+IV+G L +LSEQ+LI+C+   NNGC GG ++ A+++I+ +GGL  + DYPY 
Sbjct:   175 GAVEGLNKIVTGELVTLSEQDLINCNKE-NNGCGGGKVETAYEFIMNNGGLGTDNDYPYK 233

Query:   224 MEEGTCEDK-KEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGV 282
                G CE + KE+ + V I GY+++P NDE +L+KA+AHQPV+  +++S  +FQ Y  GV
Sbjct:   234 ALNGVCEGRLKEDNKNVMIDGYENLPANDEAALMKAVAHQPVTAVVDSSSREFQLYESGV 293

Query:   283 FTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINK 342
             F G CG  L+HGV  VGYG   G DY IVKNS G  WGE GY++M RN   P GLCGI  
Sbjct:   294 FDGTCGTNLNHGVVVVGYGTENGRDYWIVKNSRGDTWGEAGYMKMARNIANPRGLCGIAM 353

Query:   343 MASIPLK 349
              AS PLK
Sbjct:   354 RASYPLK 360


>TAIR|locus:2128243 [details] [associations]
            symbol:AT4G11310 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] [GO:0005618 "cell wall"
            evidence=IDA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 GO:GO:0005618 EMBL:CP002687
            GenomeReviews:CT486007_GR EMBL:AL096882 EMBL:AL161531
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            HOGENOM:HOG000230773 KO:K01376 EMBL:AY093066 EMBL:BT000099
            IPI:IPI00520496 PIR:T13022 RefSeq:NP_567376.1 UniGene:At.43189
            ProteinModelPortal:Q9SUT0 SMR:Q9SUT0 IntAct:Q9SUT0 STRING:Q9SUT0
            MEROPS:C01.A20 PaxDb:Q9SUT0 PRIDE:Q9SUT0 EnsemblPlants:AT4G11310.1
            GeneID:826733 KEGG:ath:AT4G11310 TAIR:At4g11310 InParanoid:Q9SUT0
            OMA:EVCHGAD PhylomeDB:Q9SUT0 ProtClustDB:CLSN2689395
            Genevestigator:Q9SUT0 GermOnline:AT4G11310 Uniprot:Q9SUT0
        Length = 364

 Score = 826 (295.8 bits), Expect = 2.2e-82, P = 2.2e-82
 Identities = 166/330 (50%), Positives = 212/330 (64%)

Query:    27 DFSIVGYSPEH-LTSM-DKLIEL-FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRN 83
             D S+V Y   + L S+ D    L FESWM KHGK Y  + EK  R  IF++NL+ I+ RN
Sbjct:    25 DMSVVSYDDNNRLHSVFDAEASLIFESWMVKHGKVYGSVAEKERRLTIFEDNLRFINNRN 84

Query:    84 KEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRR---QPSAEFSYRDVKALPKSVDWR 140
              E  SY LGL  FAD+S  E+K    G  P+ P        S  +       LPKSVDWR
Sbjct:    85 AENLSYRLGLTGFADLSLHEYKEVCHGADPRPPRNHVFMTSSDRYKTSADDVLPKSVDWR 144

Query:   141 KKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGL 200
              +GAVT VK+QG C SCWAFSTV AVEG+N+IV+G L +LSEQ+LI+C+   NNGC GG 
Sbjct:   145 NEGAVTEVKDQGHCRSCWAFSTVGAVEGLNKIVTGELVTLSEQDLINCNKE-NNGCGGGK 203

Query:   201 MDYAFKYIVASGGLHKEEDYPYLMEEGTCEDK-KEEMEVVTISGYQDVPENDEQSLLKAL 259
             ++ A+++I+ +GGL  + DYPY    G C+ + KE  + V I GY+++P NDE +L+KA+
Sbjct:   204 LETAYEFIMKNGGLGTDNDYPYKAVNGVCDGRLKENNKNVMIDGYENLPANDESALMKAV 263

Query:   260 AHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKW 319
             AHQPV+  I++S  +FQ Y  GVF G CG  L+HGV  VGYG   G DY +VKNS G  W
Sbjct:   264 AHQPVTAVIDSSSREFQLYESGVFDGSCGTNLNHGVVVVGYGTENGRDYWLVKNSRGITW 323

Query:   320 GERGYIRMKRNTGKPEGLCGINKMASIPLK 349
             GE GY++M RN   P GLCGI   AS PLK
Sbjct:   324 GEAGYMKMARNIANPRGLCGIAMRASYPLK 353


>TAIR|locus:2038515 [details] [associations]
            symbol:AT1G06260 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] [GO:0048046 "apoplast"
            evidence=IDA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 EMBL:CP002684
            GenomeReviews:CT485782_GR GO:GO:0048046 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 EMBL:AC025290
            MEROPS:I29.003 HSSP:O65039 HOGENOM:HOG000230773 OMA:METAFEF
            IPI:IPI00525965 PIR:D86198 RefSeq:NP_563764.1 UniGene:At.24617
            ProteinModelPortal:Q9LNC1 SMR:Q9LNC1 PaxDb:Q9LNC1 PRIDE:Q9LNC1
            EnsemblPlants:AT1G06260.1 GeneID:837137 KEGG:ath:AT1G06260
            TAIR:At1g06260 InParanoid:Q9LNC1 PhylomeDB:Q9LNC1
            ProtClustDB:CLSN2916975 Genevestigator:Q9LNC1 Uniprot:Q9LNC1
        Length = 343

 Score = 781 (280.0 bits), Expect = 1.3e-77, P = 1.3e-77
 Identities = 163/319 (51%), Positives = 203/319 (63%)

Query:    33 YSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLG 92
             Y P H T    L + FE W+  H K Y   +E + RF I++ N++ ID  N     + L 
Sbjct:    33 YDP-HKT----LKQRFEKWLKTHSKLYGGRDEWMLRFGIYQSNVQLIDYINSLHLPFKLT 87

Query:    93 LNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKA-LPKSVDWRKKGAVTPVKNQ 151
              N FADM++ EFK  +LGL     + R    +    D    +P +VDWR +GAVTP++NQ
Sbjct:    88 DNRFADMTNSEFKAHFLGLNTS--SLRLHKKQRPVCDPAGNVPDAVDWRTQGAVTPIRNQ 145

Query:   152 GSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDT-SFNNGCNGGLMDYAFKYIVA 210
             G CG CWAFS VAA+EGIN+I +GNL SLSEQ+LIDCD  ++N GC+GGLM+ AF++I  
Sbjct:   146 GKCGGCWAFSAVAAIEGINKIKTGNLVSLSEQQLIDCDVGTYNKGCSGGLMETAFEFIKT 205

Query:   211 SGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEA 270
             +GGL  E DYPY   EGTC+ +K + +VVTI GYQ V +N E SL  A A QPVSV I+A
Sbjct:   206 NGGLATETDYPYTGIEGTCDQEKSKNKVVTIQGYQKVAQN-EASLQIAAAQQPVSVGIDA 264

Query:   271 SGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRN 330
              G  FQ YS GVFT  CG  L+HGV  VGYG      Y IVKNSWG  WGE GYIRM+R 
Sbjct:   265 GGFIFQLYSSGVFTNYCGTNLNHGVTVVGYGVEGDQKYWIVKNSWGTGWGEEGYIRMERG 324

Query:   331 TGKPEGLCGINKMASIPLK 349
               +  G CGI  MAS PL+
Sbjct:   325 VSEDTGKCGIAMMASYPLQ 343


>TAIR|locus:2038588 [details] [associations]
            symbol:AT2G27420 species:3702 "Arabidopsis thaliana"
            [GO:0006508 "proteolysis" evidence=IEA;ISS] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA;ISS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 EMBL:CP002685
            GenomeReviews:CT485783_GR eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 EMBL:AC006232
            MEROPS:I29.003 OMA:EEFRATH HOGENOM:HOG000230773 HSSP:P53634
            ProtClustDB:CLSN2688476 EMBL:AY064033 EMBL:AY096388 IPI:IPI00539752
            PIR:F84672 RefSeq:NP_565649.1 UniGene:At.27094
            ProteinModelPortal:Q9ZQH7 SMR:Q9ZQH7 PRIDE:Q9ZQH7
            EnsemblPlants:AT2G27420.1 GeneID:817287 KEGG:ath:AT2G27420
            TAIR:At2g27420 InParanoid:Q9ZQH7 PhylomeDB:Q9ZQH7
            ArrayExpress:Q9ZQH7 Genevestigator:Q9ZQH7 Uniprot:Q9ZQH7
        Length = 348

 Score = 772 (276.8 bits), Expect = 1.1e-76, P = 1.1e-76
 Identities = 150/317 (47%), Positives = 206/317 (64%)

Query:    45 IELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQ--RNKEVTSYWLGLNEFADMSHE 102
             IE  E WM++  + Y    EK +RF IFK+NL+ +     N ++T Y + +NEF+D++ E
Sbjct:    32 IEKHEQWMARFNRVYSDETEKRNRFNIFKKNLEFVQNFNMNNKIT-YKVDINEFSDLTDE 90

Query:   103 EFKNKYLGLK-PQFPTR------RQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCG 155
             EF+  + GL  P+  TR       + +  F Y +V    +S+DWR++GAVTPVK QG CG
Sbjct:    91 EFRATHTGLVVPEAITRISTLSSGKNTVPFRYGNVSDNGESMDWRQEGAVTPVKYQGRCG 150

Query:   156 SCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLH 215
              CWAFS VAAVEGI +I  G L SLSEQ+L+DCD  +N GC GG+M  AF+YI+ + G+ 
Sbjct:   151 GCWAFSAVAAVEGITKITKGELVSLSEQQLLDCDRDYNQGCRGGIMSKAFEYIIKNQGIT 210

Query:   216 KEEDYPYLMEEGTCEDK---KEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASG 272
              E++YPY   + TC             TISGY+ VP N+E++LL+A++ QPVSV IE +G
Sbjct:   211 TEDNYPYQESQQTCSSSTTLSSSFRAATISGYETVPMNNEEALLQAVSQQPVSVGIEGTG 270

Query:   273 TDFQFYSGGVFTGPCGAELDHGVAAVGYGKSK-GSDYIIVKNSWGPKWGERGYIRMKRNT 331
               F+ YSGGVF G CG +L H V  VGYG S+ G+ Y +VKNSWG  WGE GY+R+KR+ 
Sbjct:   271 AAFRHYSGGVFNGECGTDLHHAVTIVGYGMSEEGTKYWVVKNSWGETWGENGYMRIKRDV 330

Query:   332 GKPEGLCGINKMASIPL 348
               P+G+CG+  +A  PL
Sbjct:   331 DAPQGMCGLAILAFYPL 347


>TAIR|locus:2082881 [details] [associations]
            symbol:AT3G49340 species:3702 "Arabidopsis thaliana"
            [GO:0006508 "proteolysis" evidence=IEA;ISS] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA;ISS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 EMBL:CP002686
            GenomeReviews:BA000014_GR EMBL:AC012329 EMBL:AL132956
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 MEROPS:I29.003 HOGENOM:HOG000230773 HSSP:P07711
            KO:K01376 IPI:IPI00520642 PIR:T45839 RefSeq:NP_566920.1
            UniGene:At.53854 ProteinModelPortal:Q9SG15 SMR:Q9SG15
            EnsemblPlants:AT3G49340.1 GeneID:824096 KEGG:ath:AT3G49340
            TAIR:At3g49340 InParanoid:Q9SG15 OMA:PQNDEEA PhylomeDB:Q9SG15
            ProtClustDB:CLSN2688476 Genevestigator:Q9SG15 Uniprot:Q9SG15
        Length = 341

 Score = 770 (276.1 bits), Expect = 1.9e-76, P = 1.9e-76
 Identities = 151/312 (48%), Positives = 202/312 (64%)

Query:    45 IELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEE 103
             +E  E WMS+  + Y    EK  RFEIF  NLK ++  N     +Y L +NEF+D++ EE
Sbjct:    32 VEKHEQWMSRFNRVYSDDSEKTSRFEIFTNNLKFVESINMNTNKTYTLDVNEFSDLTDEE 91

Query:   104 FKNKYLGLK-PQFPTR-----RQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSC 157
             FK +Y GL  P+  TR        +  F Y +V    +S+DW ++GAVT VK+Q  CG C
Sbjct:    92 FKARYTGLVVPEGMTRISTTDSHETVSFRYENVGETGESMDWIQEGAVTSVKHQQQCGCC 151

Query:   158 WAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKE 217
             WAFS VAAVEG+ +I +G L SLSEQ+L+DC T  NNGC GG+M  AF YI  + G+  E
Sbjct:   152 WAFSAVAAVEGMTKIANGELVSLSEQQLLDCSTE-NNGCGGGIMWKAFDYIKENQGITTE 210

Query:   218 EDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQF 277
             ++YPY   + TCE     +   TISGY+ VP+NDE++LLKA++ QPVSVAIE SG +F  
Sbjct:   211 DNYPYQGAQQTCESN--HLAAATISGYETVPQNDEEALLKAVSQQPVSVAIEGSGYEFIH 268

Query:   278 YSGGVFTGPCGAELDHGVAAVGYGKSK-GSDYIIVKNSWGPKWGERGYIRMKRNTGKPEG 336
             YSGG+F G CG +L H V  VGYG S+ G  Y ++KNSWG  WGE GY+R+ R+   P+G
Sbjct:   269 YSGGIFNGECGTQLTHAVTIVGYGVSEEGIKYWLLKNSWGESWGENGYMRIMRDVDSPQG 328

Query:   337 LCGINKMASIPL 348
             +CG+  +A  P+
Sbjct:   329 MCGLASLAYYPV 340


>TAIR|locus:2029924 [details] [associations]
            symbol:AT1G29090 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 EMBL:CP002684 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            HOGENOM:HOG000230773 HSSP:P53634 ProtClustDB:CLSN2688064
            EMBL:BT004146 IPI:IPI00545702 RefSeq:NP_564321.2 UniGene:At.40814
            ProteinModelPortal:Q84W75 SMR:Q84W75 MEROPS:C01.A15
            EnsemblPlants:AT1G29090.1 GeneID:839784 KEGG:ath:AT1G29090
            TAIR:At1g29090 InParanoid:Q84W75 OMA:SIRGHED PhylomeDB:Q84W75
            ArrayExpress:Q84W75 Genevestigator:Q84W75 Uniprot:Q84W75
        Length = 355

 Score = 738 (264.8 bits), Expect = 4.6e-73, P = 4.6e-73
 Identities = 152/329 (46%), Positives = 210/329 (63%)

Query:    31 VGYSPEHLTSMDKLI-ELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT-S 88
             V  +   +T  + ++ E  + WM++  + Y    EK  RF++FK+NLK I++ NK+   +
Sbjct:    29 VSQATSRVTFHEPIVAEHHQQWMTRFSRVYSDELEKQMRFDVFKKNLKFIEKFNKKGDRT 88

Query:    89 YWLGLNEFADMSHEEFKNKYLGLK-----P--QFPTRRQPSAEFSYRDVKALPKSVDWRK 141
             Y LG+NEFAD + EEF   + GLK     P  +F     PS  ++  DV A  ++ DWR 
Sbjct:    89 YKLGVNEFADWTREEFIATHTGLKGVNGIPSSEFVDEMIPSWNWNVSDV-AGRETKDWRY 147

Query:   142 KGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLM 201
             +GAVTPVK QG CG CWAFS+VAAVEG+ +IV  NL SLSEQ+L+DCD   +NGCNGG+M
Sbjct:   148 EGAVTPVKYQGQCGCCWAFSSVAAVEGLTKIVGNNLVSLSEQQLLDCDRERDNGCNGGIM 207

Query:   202 DYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH 261
               AF YI+ + G+  E  YPY   EGTC    +      I G+Q VP N+E++LL+A++ 
Sbjct:   208 SDAFSYIIKNRGIASEASYPYQAAEGTCRYNGKPS--AWIRGFQTVPSNNERALLEAVSK 265

Query:   262 QPVSVAIEASGTDFQFYSGGVFTGP-CGAELDHGVAAVGYGKS-KGSDYIIVKNSWGPKW 319
             QPVSV+I+A G  F  YSGGV+  P CG  ++H V  VGYG S +G  Y + KNSWG  W
Sbjct:   266 QPVSVSIDADGPGFMHYSGGVYDEPYCGTNVNHAVTFVGYGTSPEGIKYWLAKNSWGETW 325

Query:   320 GERGYIRMKRNTGKPEGLCGINKMASIPL 348
             GE GYIR++R+   P+G+CG+ + A  P+
Sbjct:   326 GENGYIRIRRDVAWPQGMCGVAQYAFYPV 354


>TAIR|locus:2055440 [details] [associations]
            symbol:AT2G34080 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 EMBL:CP002685 GenomeReviews:CT485783_GR
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 MEROPS:I29.003 EMBL:AC002341
            HOGENOM:HOG000230773 HSSP:P53634 IPI:IPI00530325 PIR:B84752
            RefSeq:NP_565780.1 UniGene:At.28613 UniGene:At.37859
            ProteinModelPortal:O22961 SMR:O22961 EnsemblPlants:AT2G34080.1
            GeneID:817969 KEGG:ath:AT2G34080 TAIR:At2g34080 InParanoid:O22961
            OMA:SENDYSY PhylomeDB:O22961 ProtClustDB:CLSN2688064
            ArrayExpress:O22961 Genevestigator:O22961 Uniprot:O22961
        Length = 345

 Score = 738 (264.8 bits), Expect = 4.6e-73, P = 4.6e-73
 Identities = 143/313 (45%), Positives = 205/313 (65%)

Query:    44 LIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHE 102
             +++  E WM++  + Y+   EK  R ++FK+NLK I+  NK+   SY LG+NEFAD ++E
Sbjct:    35 MVDKHEQWMARFSREYRDELEKNMRRDVFKKNLKFIENFNKKGNKSYKLGVNEFADWTNE 94

Query:   103 EFKNKYLGLK------PQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGS 156
             EF   + GLK      P     +  S++ ++     + +S DWR +GAVTPVK QG CG 
Sbjct:    95 EFLAIHTGLKGLTEVSPSKVVAKTISSQ-TWNVSDMVVESKDWRAEGAVTPVKYQGQCGC 153

Query:   157 CWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHK 216
             CWAFS VAAVEG+ +I  GNL SLSEQ+L+DCD  ++ GC+GG+M  AF Y+V + G+  
Sbjct:   154 CWAFSAVAAVEGVAKIAGGNLVSLSEQQLLDCDREYDRGCDGGIMSDAFNYVVQNRGIAS 213

Query:   217 EEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQ 276
             E DY Y   +G C  +        ISG+Q VP N+E++LL+A++ QPVSV+++A+G  F 
Sbjct:   214 ENDYSYQGSDGGC--RSNARPAARISGFQTVPSNNERALLEAVSRQPVSVSMDATGDGFM 271

Query:   277 FYSGGVFTGPCGAELDHGVAAVGYGKSK-GSDYIIVKNSWGPKWGERGYIRMKRNTGKPE 335
              YSGGV+ GPCG   +H V  VGYG S+ G+ Y + KNSWG  WGE+GYIR++R+   P+
Sbjct:   272 HYSGGVYDGPCGTSSNHAVTFVGYGTSQDGTKYWLAKNSWGETWGEKGYIRIRRDVAWPQ 331

Query:   336 GLCGINKMASIPL 348
             G+CG+ + A  P+
Sbjct:   332 GMCGVAQYAFYPV 344


>TAIR|locus:2097104 [details] [associations]
            symbol:AT3G43960 species:3702 "Arabidopsis thaliana"
            [GO:0005886 "plasma membrane" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] [GO:0031225 "anchored to
            membrane" evidence=TAS] [GO:0048767 "root hair elongation"
            evidence=IMP] [GO:0016132 "brassinosteroid biosynthetic process"
            evidence=RCA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 EMBL:CP002686
            GenomeReviews:BA000014_GR GO:GO:0031225 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0048767 MEROPS:I29.003 HOGENOM:HOG000230773
            EMBL:AL163975 EMBL:AK118634 IPI:IPI00526842 PIR:T48950
            RefSeq:NP_566867.1 UniGene:At.43352 ProteinModelPortal:Q9LXW3
            SMR:Q9LXW3 STRING:Q9LXW3 PaxDb:Q9LXW3 PRIDE:Q9LXW3
            EnsemblPlants:AT3G43960.1 GeneID:823513 KEGG:ath:AT3G43960
            TAIR:At3g43960 eggNOG:NOG286334 InParanoid:Q9LXW3 KO:K01376
            OMA:MAISFRT PhylomeDB:Q9LXW3 ProtClustDB:CLSN2917367
            Genevestigator:Q9LXW3 GermOnline:AT3G43960 Uniprot:Q9LXW3
        Length = 376

 Score = 713 (256.0 bits), Expect = 2.1e-70, P = 2.1e-70
 Identities = 146/316 (46%), Positives = 205/316 (64%)

Query:    43 KLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSH 101
             +++ ++E W+ ++GK Y  + EK  RF+IFK+NLK I++ N +   SY  GLN+F+D++ 
Sbjct:    36 EVLTMYEQWLVENGKNYNGLGEKERRFKIFKDNLKRIEEHNSDPNRSYERGLNKFSDLTA 95

Query:   102 EEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTP-VKNQGSCGSCWAF 160
             +EF+  YLG K +  +    +  + Y++   LP  VDWR++GAV P VK QG CGSCWAF
Sbjct:    96 DEFQASYLGGKMEKKSLSDVAERYQYKEGDVLPDEVDWRERGAVVPRVKRQGECGSCWAF 155

Query:   161 STVAAVEGINQIVSGNLTSLSEQELIDCDTSFNN-GCNGGLMDYAFKYIVASGGLHKEED 219
             +   AVEGINQI +G L SLSEQELIDCD   +N GC GG   +AF++I  +GG+  +E 
Sbjct:   156 AATGAVEGINQITTGELVSLSEQELIDCDRGNDNFGCAGGGAVWAFEFIKENGGIVSDEV 215

Query:   220 YPYLMEEGTCEDKKEEME---VVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQ 276
             Y Y  E+ T   K  EM+   VVTI+G++ VP NDE SL KA+A+QP+SV I A+  +  
Sbjct:   216 YGYTGED-TAACKAIEMKTTRVVTINGHEVVPVNDEMSLKKAVAYQPISVMISAA--NMS 272

Query:   277 FYSGGVFTGPCGAEL--DHGVAAVGYGKSKGS-DYIIVKNSWGPKWGERGYIRMKRNTGK 333
              Y  GV+ G C + L  DH V  VGYG S    DY +++NSWGP+WGE GY+R++RN  +
Sbjct:   273 DYKSGVYKGAC-SNLWGDHNVLIVGYGTSSDEGDYWLIRNSWGPEWGEGGYLRLQRNFHE 331

Query:   334 PEGLCGINKMASIPLK 349
             P G C +      P+K
Sbjct:   332 PTGKCAVAVAPVYPIK 347


>TAIR|locus:2029934 [details] [associations]
            symbol:AT1G29080 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 EMBL:CP002684 GenomeReviews:CT485782_GR
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 EMBL:AC021043 MEROPS:I29.003 HOGENOM:HOG000230773
            HSSP:P53634 ProtClustDB:CLSN2688064 EMBL:DQ056468 IPI:IPI00521747
            PIR:C86413 RefSeq:NP_564320.1 UniGene:At.51814
            ProteinModelPortal:Q9LP39 SMR:Q9LP39 EnsemblPlants:AT1G29080.1
            GeneID:839783 KEGG:ath:AT1G29080 TAIR:At1g29080 InParanoid:Q9LP39
            OMA:KTWGENG PhylomeDB:Q9LP39 Genevestigator:Q9LP39 Uniprot:Q9LP39
        Length = 346

 Score = 699 (251.1 bits), Expect = 6.3e-69, P = 6.3e-69
 Identities = 139/315 (44%), Positives = 194/315 (61%)

Query:    44 LIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQ-RNKEVTSYWLGLNEFADMSHE 102
             +++  + WM +  + Y    EK  R ++  ENLK I+   N    SY LG+NEF D + E
Sbjct:    35 IVDYHQQWMIQFSRVYDDEFEKQLRLQVLTENLKFIESFNNMGNQSYKLGVNEFTDWTKE 94

Query:   103 EFKNKYLGLK------P-QFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCG 155
             EF   Y GL+      P +     +P+  ++  DV    K  DWR +GAVTPVK+QG CG
Sbjct:    95 EFLATYTGLRGVNVTSPFEVVNETKPAWNWTVSDVLGTNK--DWRNEGAVTPVKSQGECG 152

Query:   156 SCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLH 215
              CWAFS +AAVEG+ +I  GNL SLSEQ+L+DC    NNGC GG    AF YI+   G+ 
Sbjct:   153 GCWAFSAIAAVEGLTKIARGNLISLSEQQLLDCTREQNNGCKGGTFVNAFNYIIKHRGIS 212

Query:   216 KEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDF 275
              E +YPY ++EG C  +      + I G+++VP N+E++LL+A++ QPV+VAI+AS   F
Sbjct:   213 SENEYPYQVKEGPC--RSNARPAILIRGFENVPSNNERALLEAVSRQPVAVAIDASEAGF 270

Query:   276 QFYSGGVFTGP-CGAELDHGVAAVGYGKS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGK 333
               YSGGV+    CG  ++H V  VGYG S +G  Y + KNSWG  WGE GYIR++R+   
Sbjct:   271 VHYSGGVYNARNCGTSVNHAVTLVGYGTSPEGMKYWLAKNSWGKTWGENGYIRIRRDVEW 330

Query:   334 PEGLCGINKMASIPL 348
             P+G+CG+ + AS P+
Sbjct:   331 PQGMCGVAQYASYPV 345


>DICTYBASE|DDB_G0283867 [details] [associations]
            symbol:cprC "cysteine proteinase 3" species:44689
            "Dictyostelium discoideum" [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0008233
            "peptidase activity" evidence=IEA] [GO:0005764 "lysosome"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            dictyBase:DDB_G0283867 GenomeReviews:CM000153_GR eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 EMBL:AAFI02000057
            KO:K01365 EMBL:X03930 RefSeq:XP_638859.1 ProteinModelPortal:Q23894
            SMR:Q23894 MEROPS:C01.114 EnsemblProtists:DDB0220784 GeneID:8624257
            KEGG:ddi:DDB_G0283867 OMA:NNVEHIN Uniprot:Q23894
        Length = 337

 Score = 681 (244.8 bits), Expect = 5.1e-67, P = 5.1e-67
 Identities = 148/328 (45%), Positives = 198/328 (60%)

Query:    29 SIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTS 88
             SI   S  ++ S  +  + F  WM  + K Y   +E + R+E FK+N+ ++   N + + 
Sbjct:    15 SISFISAGNVFSHKQYQDSFIDWMRSNNKAYTH-KEFMPRYEEFKKNMDYVHNWNSKGSK 73

Query:    89 YWLGLNEFADMSHEEFKNKYLGLKPQFPT----RRQPSAEFSYRDVKALPKSVDWRKKGA 144
               LGLN+ AD+S+EE++  YLG +         +R      +    K  P +VDWR+K A
Sbjct:    74 TVLGLNQHADLSNEEYRLNYLGTRAHIKLNGYHKRNLGLRLNRPQFKQ-PLNVDWREKDA 132

Query:   145 VTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNN-GCNGGLMDY 203
             VTPVK+QG CGSC++FST  +VEG+  I +G L SLSEQ ++DC +SF N GCNGGLM  
Sbjct:   133 VTPVKDQGQCGSCYSFSTTGSVEGVTAIKTGKLVSLSEQNILDCSSSFGNEGCNGGLMTN 192

Query:   204 AFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVT-ISGYQDVPENDEQSLLKALAHQ 262
             AF+YI+ + GL+ EE YPY M+    E K +E  V   I+ Y+++   DE  L  AL   
Sbjct:   193 AFEYIIKNNGLNSEEQYPYEMKVND-ECKFQEGSVAAKITSYKEIEAGDENDLQNALLLN 251

Query:   263 PVSVAIEASGTDFQFYSGGVFTGP-CGAE-LDHGVAAVGYGKSKGSDYIIVKNSWGPKWG 320
             PVSVAI+AS   FQ Y+ GV+  P C +E LDHGV AVG G   G DY IVKNSWGP WG
Sbjct:   252 PVSVAIDASHNSFQLYTAGVYYEPACSSEDLDHGVLAVGMGTDNGEDYYIVKNSWGPSWG 311

Query:   321 ERGYIRMKRNTGKPEGLCGINKMASIPL 348
               GYI M RN    +  CGI+ MAS P+
Sbjct:   312 LNGYIHMARNK---DNNCGISTMASYPI 336


>DICTYBASE|DDB_G0272815 [details] [associations]
            symbol:cprE "cysteine proteinase 5" species:44689
            "Dictyostelium discoideum" [GO:0005615 "extracellular space"
            evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0005764 "lysosome" evidence=IEA] [GO:0044351
            "macropinocytosis" evidence=RCA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 dictyBase:DDB_G0272815 GO:GO:0005615
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GenomeReviews:CM000151_GR GO:GO:0005764
            EMBL:AAFI02000008 MEROPS:I29.003 KO:K01376 EMBL:L36205
            RefSeq:XP_644977.1 ProteinModelPortal:P54640 SMR:P54640
            PRIDE:P54640 EnsemblProtists:DDB0185092 GeneID:8618654
            KEGG:ddi:DDB_G0272815 OMA:METAFEF ProtClustDB:CLSZ2430780
            Uniprot:P54640
        Length = 344

 Score = 585 (211.0 bits), Expect = 6.8e-67, Sum P(2) = 6.8e-67
 Identities = 124/261 (47%), Positives = 156/261 (59%)

Query:    48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNK 107
             F  WM  H K+Y   EE   R+ IFK N+ ++ Q N + +   LGLN FAD+++EE++N 
Sbjct:    30 FTDWMITHQKSYTS-EEFGARYNIFKANMDYVQQWNSKGSETVLGLNNFADITNEEYRNT 88

Query:   108 YLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVE 167
             YLG K    +      E  +    A  K  DWR +GAVTPVKNQG CG CW+FST  + E
Sbjct:    89 YLGTKFDASSLIGTQEEKVFTTSSAASK--DWRSEGAVTPVKNQGQCGGCWSFSTTGSTE 146

Query:   168 GINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEG 227
             G +    G L SLSEQ LIDC T  N+GC+GGLM YAF+YI+ + G+  E  YPY  E G
Sbjct:   147 GAHFQSKGELVSLSEQNLIDCSTE-NSGCDGGLMTYAFEYIINNNGIDTESSYPYKAENG 205

Query:   228 TCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGP- 286
              CE K E     T+S Y+ V    E SL  A+   PVSVAI+AS   FQ Y+ G++  P 
Sbjct:   206 KCEYKSENSGA-TLSSYKTVTAGSESSLESAVNVNPVSVAIDASHQSFQLYTSGIYYEPE 264

Query:   287 CGAE-LDHGVAAVGYGKSKGS 306
             C +E LDHGV AVGYG   GS
Sbjct:   265 CSSENLDHGVLAVGYGSGSGS 285

 Score = 113 (44.8 bits), Expect = 6.8e-67, Sum P(2) = 6.8e-67
 Identities = 24/55 (43%), Positives = 30/55 (54%)

Query:   294 GVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPL 348
             G ++     S  ++Y IVKNSWG  WG  GYI M RN    +  CGI   AS P+
Sbjct:   292 GQSSGNLSASSSNEYWIVKNSWGTSWGIEGYILMSRNR---DNNCGIASSASFPV 343


>FB|FBgn0013770 [details] [associations]
            symbol:Cp1 "Cysteine proteinase-1" species:7227 "Drosophila
            melanogaster" [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=ISS;NAS] [GO:0030163 "protein catabolic process"
            evidence=NAS] [GO:0005764 "lysosome" evidence=NAS] [GO:0048102
            "autophagic cell death" evidence=IEP] [GO:0035071 "salivary gland
            cell autophagic cell death" evidence=IEP] [GO:0006508 "proteolysis"
            evidence=IEA] [GO:0045169 "fusome" evidence=IDA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 EMBL:AE013599 GO:GO:0007586 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GO:GO:0035071 GO:GO:0045169 GeneTree:ENSGT00660000095458 KO:K01365
            EMBL:U75652 EMBL:AF012089 EMBL:BT016071 EMBL:D31970
            RefSeq:NP_523735.2 RefSeq:NP_725347.1 RefSeq:NP_725348.1
            UniGene:Dm.7400 ProteinModelPortal:Q95029 SMR:Q95029 IntAct:Q95029
            MINT:MINT-814156 STRING:Q95029 MEROPS:C01.092 PaxDb:Q95029
            EnsemblMetazoa:FBtr0087593 GeneID:36546 KEGG:dme:Dmel_CG6692
            CTD:36546 FlyBase:FBgn0013770 InParanoid:Q95029 OMA:ICHGADP
            OrthoDB:EOG46M91C PhylomeDB:Q95029 GenomeRNAi:36546 NextBio:799136
            Bgee:Q95029 GermOnline:CG6692 Uniprot:Q95029
        Length = 371

 Score = 653 (234.9 bits), Expect = 4.7e-64, P = 4.7e-64
 Identities = 145/324 (44%), Positives = 199/324 (61%)

Query:    42 DKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT----SYWLGLNEFA 97
             D ++E + ++  +H K Y+   E+  R +IF EN   I + N+       S+ L +N++A
Sbjct:    53 DVVMEEWHTFKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYA 112

Query:    98 DMSHEEFKNKYLGLKPQFPTRRQ-PSAEFSYRDVK-------ALPKSVDWRKKGAVTPVK 149
             D+ H EF+    G    +   +Q  +A+ S++ V         LPKSVDWR KGAVT VK
Sbjct:   113 DLLHHEFRQLMNGFN--YTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVK 170

Query:   150 NQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYI 208
             +QG CGSCWAFS+  A+EG +   SG L SLSEQ L+DC T + NNGCNGGLMD AF+YI
Sbjct:   171 DQGHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYI 230

Query:   209 VASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQ-PVSVA 267
               +GG+  E+ YPY   + +C   K  +   T  G+ D+P+ DE+ + +A+A   PVSVA
Sbjct:   231 KDNGGIDTEKSYPYEAIDDSCHFNKGTVGA-TDRGFTDIPQGDEKKMAEAVATVGPVSVA 289

Query:   268 IEASGTDFQFYSGGVFTGP-CGAE-LDHGVAAVGYGKSK-GSDYIIVKNSWGPKWGERGY 324
             I+AS   FQFYS GV+  P C A+ LDHGV  VG+G  + G DY +VKNSWG  WG++G+
Sbjct:   290 IDASHESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTWGDKGF 349

Query:   325 IRMKRNTGKPEGLCGINKMASIPL 348
             I+M RN    E  CGI   +S PL
Sbjct:   350 IKMLRNK---ENQCGIASASSYPL 370


>UNIPROTKB|F1NYJ1 [details] [associations]
            symbol:CTSL2 "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GeneTree:ENSGT00560000076577 IPI:IPI00602255
            OMA:DITHHEF EMBL:AADN02067812 Ensembl:ENSGALT00000020588
            ArrayExpress:F1NYJ1 Uniprot:F1NYJ1
        Length = 339

 Score = 652 (234.6 bits), Expect = 6.0e-64, P = 6.0e-64
 Identities = 141/313 (45%), Positives = 189/313 (60%)

Query:    48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT----SYWLGLNEFADMSHEE 103
             ++ W S H K Y   EE   R  ++++NLK I+  N + +    SY LG+N+F DM+ EE
Sbjct:    30 WQLWKSWHSKDYHEREESWRRV-VWEKNLKMIELHNLDHSLGKHSYKLGMNQFGDMTAEE 88

Query:   104 FKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTV 163
             F+    G K +   R+   ++F        P+SVDWR+KG VTPVK+QG CGSCWAFST 
Sbjct:    89 FRQLMNGYKHKKSERKYRGSQFLEPSFLEAPRSVDWREKGYVTPVKDQGQCGSCWAFSTT 148

Query:   164 AAVEGINQIVSGNLTSLSEQELIDCDT-SFNNGCNGGLMDYAFKYIVASGGLHKEEDYPY 222
              A+EG +   +G L SLSEQ L+DC     N GCNGGLMD AF+Y+  +GG+  EE YPY
Sbjct:   149 GALEGQHFRKTGKLVSLSEQNLVDCSRPEGNQGCNGGLMDQAFQYVQDNGGIDSEESYPY 208

Query:   223 LMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQ-PVSVAIEASGTDFQFYSGG 281
               ++      K E      +G+ D+P+  E++L+KA+A   PVSVAI+A  + FQFY  G
Sbjct:   209 TAKDDEDCRYKAEYNAANDTGFVDIPQGHERALMKAVASVGPVSVAIDAGHSSFQFYQSG 268

Query:   282 VFTGP-CGAE-LDHGVAAVGYGKS----KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPE 335
             ++  P C +E LDHGV  VGYG       G  Y IVKNSWG KWG++GYI M ++    +
Sbjct:   269 IYYEPDCSSEDLDHGVLVVGYGFEGEDVDGKKYWIVKNSWGEKWGDKGYIYMAKDR---K 325

Query:   336 GLCGINKMASIPL 348
               CGI   AS PL
Sbjct:   326 NHCGIATAASYPL 338


>ZFIN|ZDB-GENE-040718-61 [details] [associations]
            symbol:ctsl.1 "cathepsin L.1" species:7955 "Danio
            rerio" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-040718-61
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 HOVERGEN:HBG011513
            GeneTree:ENSGT00660000095458 MEROPS:C01.092 EMBL:FP015965
            EMBL:BC075887 IPI:IPI00513499 RefSeq:NP_001002368.1
            UniGene:Dr.85174 SMR:Q6DHT0 Ensembl:ENSDART00000017756
            GeneID:436641 KEGG:dre:436641 CTD:436641 InParanoid:Q6DHT0
            OMA:GGQMENA OrthoDB:EOG41ZFB9 NextBio:20831086 Uniprot:Q6DHT0
        Length = 334

 Score = 645 (232.1 bits), Expect = 3.3e-63, P = 3.3e-63
 Identities = 143/312 (45%), Positives = 186/312 (59%)

Query:    48 FESWMSKHGKTYKCIEEKLHRFEIFKENLK----HIDQRNKEVTSYWLGLNEFADMSHEE 103
             F +W  K GK+Y+  EE+ HR   +  N K    H    ++ + SY LG+  FADMS+EE
Sbjct:    26 FHAWKLKFGKSYRSAEEESHRQLTWLTNRKLVLVHNMMADQGLKSYRLGMTYFADMSNEE 85

Query:   104 FKNK-YLG-LKPQFPTR-RQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAF 160
             ++   + G L     T+ R  S  F  R    +P +VDWR KG VT +K+Q  CGSCWAF
Sbjct:    86 YRQLVFRGCLGSMNNTKARGGSTFFRLRKAAVVPDTVDWRDKGYVTDIKDQKQCGSCWAF 145

Query:   161 STVAAVEGINQIVSGNLTSLSEQELIDCDTSFNN-GCNGGLMDYAFKYIVASGGLHKEED 219
             S   ++EG     +G L SLSEQ+L+DC  S+ N GC+GGLMD AF+YI A+ GL  E+ 
Sbjct:   146 SATGSLEGQTFRKTGKLVSLSEQQLVDCSGSYGNYGCDGGLMDQAFQYIEANKGLDTEDS 205

Query:   220 YPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQFY 278
             YPY  ++G C      +   + +GY D+   DE +L +A+A   P+SVAI+A  + FQ Y
Sbjct:   206 YPYEAQDGECRFNPSTVGA-SCTGYVDIASGDESALQEAVATIGPISVAIDAGHSSFQLY 264

Query:   279 SGGVFTGP-CGA-ELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEG 336
             S GV+  P C + ELDHGV AVGYG S G DY IVKNSWG  WG +GYI M RN      
Sbjct:   265 SSGVYNEPDCSSSELDHGVLAVGYGSSNGDDYWIVKNSWGLDWGVQGYILMSRNKSNQ-- 322

Query:   337 LCGINKMASIPL 348
              CGI   AS PL
Sbjct:   323 -CGIATAASYPL 333


>DICTYBASE|DDB_G0279185 [details] [associations]
            symbol:cprF "cysteine proteinase 6" species:44689
            "Dictyostelium discoideum" [GO:0005615 "extracellular space"
            evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0005764 "lysosome" evidence=IEA] [GO:0044351
            "macropinocytosis" evidence=RCA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 dictyBase:DDB_G0279185 GO:GO:0005615
            GenomeReviews:CM000152_GR eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            EMBL:AAFI02000030 HSSP:P07711 ProtClustDB:CLSZ2846820 EMBL:U72745
            RefSeq:XP_641725.1 ProteinModelPortal:Q94503 SMR:Q94503
            MEROPS:C01.081 PRIDE:Q94503 EnsemblProtists:DDB0215002
            GeneID:8621921 KEGG:ddi:DDB_G0279185 Uniprot:Q94503
        Length = 434

 Score = 545 (196.9 bits), Expect = 9.9e-62, Sum P(2) = 9.9e-62
 Identities = 117/265 (44%), Positives = 157/265 (59%)

Query:    48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNK 107
             F +WM  H + Y   EE   RF IFK N+ +I++ N + +   LGLN FAD+++EE++  
Sbjct:    30 FTNWMIAHQRHYSS-EEFNGRFNIFKANMDYINEWNTKGSETVLGLNVFADITNEEYRAT 88

Query:   108 YLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVE 167
             YLG      +     +E  +  V+A   SVDWR KGAVTP+KNQG CG CW+FS   A E
Sbjct:    89 YLGTPFDASSLEMTPSEKVFGGVQA--NSVDWRAKGAVTPIKNQGECGGCWSFSATGATE 146

Query:   168 GINQIVSGN--LTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLM 224
             G   I +G+  LTS+SEQ+LIDC  S+ NNGC GGLM  AF+YI+ +GG+  E  YP+  
Sbjct:   147 GAQYIANGDSDLTSVSEQQLIDCSGSYGNNGCEGGLMTLAFEYIINNGGIDTESSYPFTA 206

Query:   225 EEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFT 284
                 C+     +    +S Y +V    E  L   +   P SVAI+AS   FQFYS G++ 
Sbjct:   207 NTEKCKYNPSNIGA-ELSSYVNVTSGSESDLAAKVTQGPTSVAIDASQPSFQFYSSGIYN 265

Query:   285 GP-CGA-ELDHGVAAVGYGK-SKGS 306
              P C + +LDHGV AVG+G  S GS
Sbjct:   266 EPACSSTQLDHGVLAVGFGSGSSGS 290

 Score = 104 (41.7 bits), Expect = 9.9e-62, Sum P(2) = 9.9e-62
 Identities = 22/41 (53%), Positives = 26/41 (63%)

Query:   307 DYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
             +Y IVKNSWG  WG  GYI M ++    +  CGI  MASIP
Sbjct:   388 NYWIVKNSWGLDWGINGYILMSKDK---DNQCGIATMASIP 425


>ZFIN|ZDB-GENE-050626-55 [details] [associations]
            symbol:ctssb.2 "cathepsin S, b.2" species:7955
            "Danio rerio" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-050626-55
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 HOVERGEN:HBG011513
            KO:K01368 EMBL:BC093339 IPI:IPI00507098 RefSeq:NP_001017661.1
            UniGene:Dr.132688 ProteinModelPortal:Q566T8 SMR:Q566T8
            GeneID:337572 KEGG:dre:337572 CTD:337572 InParanoid:Q566T8
            NextBio:20812306 ArrayExpress:Q566T8 Uniprot:Q566T8
        Length = 330

 Score = 631 (227.2 bits), Expect = 1.0e-61, P = 1.0e-61
 Identities = 132/308 (42%), Positives = 184/308 (59%)

Query:    48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT----SYWLGLNEFADMSHEE 103
             +E W  KH K Y C +E++ R E+++ NL+ I   N E +    SY L +N  ADM+ EE
Sbjct:    27 WELWKKKHVKLYSCEDEEVGRRELWERNLELIAIHNLEASMGMHSYDLAINHMADMTTEE 86

Query:   104 FKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTV 163
                  L +    P  ++P+AE+       +P ++DWR KG VT VKNQG+CGSCWAFS+V
Sbjct:    87 ILQT-LAVTRVPPGFKRPTAEYVSSSFAVVPDTLDWRDKGYVTSVKNQGACGSCWAFSSV 145

Query:   164 AAVEGINQIVSGNLTSLSEQELIDCDTSFNN-GCNGGLMDYAFKYIVASGGLHKEEDYPY 222
              A+EG     +G L  LS Q L+DC + + N GCNGG M  AF+Y++ +GG+  E  YPY
Sbjct:   146 GALEGQLMKTTGKLVDLSPQNLVDCSSKYGNLGCNGGYMSQAFQYVIDNGGIDSESSYPY 205

Query:   223 LMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQFYSGG 281
                +G+C     +      + Y+ V + DEQ+L +ALA+  PVSVAI+A+   F FY  G
Sbjct:   206 QGTQGSCRYDPSQ-RAANCTSYKFVSQGDEQALKEALANIGPVSVAIDATRPQFIFYRSG 264

Query:   282 VFTGP-CGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGI 340
             V+  P C  +++HGV AVGYG   G DY +VKNSWG  +G+ GYIR+ RN      +CGI
Sbjct:   265 VYDDPSCTQKVNHGVLAVGYGTLSGQDYWLVKNSWGAGFGDGGYIRIARNKNN---MCGI 321

Query:   341 NKMASIPL 348
                A  P+
Sbjct:   322 ASEACYPI 329


>UNIPROTKB|Q3T0I2 [details] [associations]
            symbol:CTSH "Pro-cathepsin H" species:9913 "Bos taurus"
            [GO:0031638 "zymogen activation" evidence=ISS] [GO:0031648 "protein
            destabilization" evidence=ISS] [GO:0004197 "cysteine-type
            endopeptidase activity" evidence=ISS] [GO:0010815 "bradykinin
            catabolic process" evidence=ISS] [GO:0043129 "surfactant
            homeostasis" evidence=ISS] [GO:0010813 "neuropeptide catabolic
            process" evidence=ISS] [GO:0005829 "cytosol" evidence=ISS]
            [GO:0045766 "positive regulation of angiogenesis" evidence=ISS]
            [GO:0008284 "positive regulation of cell proliferation"
            evidence=ISS] [GO:0008234 "cysteine-type peptidase activity"
            evidence=ISS] [GO:0030335 "positive regulation of cell migration"
            evidence=ISS] [GO:0001656 "metanephros development" evidence=ISS]
            [GO:0033619 "membrane protein proteolysis" evidence=ISS]
            [GO:0043066 "negative regulation of apoptotic process"
            evidence=ISS] [GO:0004252 "serine-type endopeptidase activity"
            evidence=ISS] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=ISS] [GO:0016505 "apoptotic protease activator activity"
            evidence=ISS] [GO:0010952 "positive regulation of peptidase
            activity" evidence=ISS] [GO:0030108 "HLA-A specific activating MHC
            class I receptor activity" evidence=ISS] [GO:0002764 "immune
            response-regulating signaling pathway" evidence=ISS] [GO:0070371
            "ERK1 and ERK2 cascade" evidence=ISS] [GO:0010628 "positive
            regulation of gene expression" evidence=ISS] [GO:0070324 "thyroid
            hormone binding" evidence=ISS] [GO:0006508 "proteolysis"
            evidence=ISS] [GO:0060448 "dichotomous subdivision of terminal
            units involved in lung branching" evidence=ISS] [GO:0097208
            "alveolar lamellar body" evidence=ISS] [GO:0005764 "lysosome"
            evidence=ISS] [GO:0005615 "extracellular space" evidence=ISS]
            [GO:0004177 "aminopeptidase activity" evidence=ISS] [GO:0004175
            "endopeptidase activity" evidence=ISS] [GO:0032526 "response to
            retinoic acid" evidence=ISS] [GO:0010634 "positive regulation of
            epithelial cell migration" evidence=ISS] [GO:0005515 "protein
            binding" evidence=IPI] [GO:0097067 "cellular response to thyroid
            hormone stimulus" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005829 GO:GO:0043066 GO:GO:0005615
            GO:GO:0008284 GO:GO:0070371 eggNOG:COG4870 HOGENOM:HOG000230774
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766 GO:GO:0004177
            GO:GO:0004252 GO:GO:0005764 GO:GO:0097208 GO:GO:0032526
            GO:GO:0010628 GO:GO:0070324 KO:K01366 GO:GO:0016505 GO:GO:0001656
            GO:GO:0010634 GO:GO:0004197 GO:GO:0031648 HOVERGEN:HBG011513
            GO:GO:0097067 GO:GO:0031638 GO:GO:0001913
            GeneTree:ENSGT00660000095458 EMBL:BC102386 IPI:IPI00693034
            RefSeq:NP_001029557.1 UniGene:Bt.52393 ProteinModelPortal:Q3T0I2
            SMR:Q3T0I2 STRING:Q3T0I2 MEROPS:C01.040 PRIDE:Q3T0I2
            Ensembl:ENSBTAT00000014593 GeneID:510524 KEGG:bta:510524 CTD:1512
            InParanoid:Q3T0I2 OMA:STSCHKT OrthoDB:EOG4W9J43 NextBio:20869490
            GO:GO:0030108 GO:GO:0010815 GO:GO:0060448 GO:GO:0002764
            GO:GO:0033619 GO:GO:0010813 GO:GO:0043129 Uniprot:Q3T0I2
        Length = 335

 Score = 630 (226.8 bits), Expect = 1.3e-61, P = 1.3e-61
 Identities = 133/309 (43%), Positives = 188/309 (60%)

Query:    48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNK 107
             F+SWM +H K Y   EE  HR + F  NL+ I+  N    ++ +GLN+F+DMS +E K K
Sbjct:    35 FQSWMVQHQKKYSS-EEYYHRLQAFASNLREINAHNARNHTFKMGLNQFSDMSFDELKRK 93

Query:   108 YLGLKPQFPTRRQPSAEFSY-RDVKALPKSVDWRKKGA-VTPVKNQGSCGSCWAFSTVAA 165
             YL  +PQ       + + +Y R     P S+DWRKKG  VTPVKNQGSCGSCW FST  A
Sbjct:    94 YLWSEPQ----NCSATKSNYLRGTGPYPPSMDWRKKGNFVTPVKNQGSCGSCWTFSTTGA 149

Query:   166 VEGINQIVSGNLTSLSEQELIDCDTSFNN-GCNGGLMDYAFKYIVASGGLHKEEDYPYLM 224
             +E    I +G L  L+EQ+L+DC  +FNN GC GGL   AF+YI  + G+  E+ YPY  
Sbjct:   150 LESAVAIATGKLPFLAEQQLVDCAQNFNNHGCQGGLPSQAFEYIRYNKGIMGEDTYPYRG 209

Query:   225 EEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALA-HQPVSVAIEASGTDFQFYSGGVF 283
             ++G C+ +  +  +  +    ++  NDE+++++A+A H PVS A E +  DF  Y  G++
Sbjct:   210 QDGDCKYQPSKA-IAFVKDVANITLNDEEAMVEAVALHNPVSFAFEVTA-DFMMYRKGIY 267

Query:   284 TGP-CGA---ELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCG 339
             +   C     +++H V AVGYG+ KG  Y IVKNSWGP WG +GY  ++R  GK   +CG
Sbjct:   268 SSTSCHKTPDKVNHAVLAVGYGEEKGIPYWIVKNSWGPNWGMKGYFLIER--GK--NMCG 323

Query:   340 INKMASIPL 348
             +   AS P+
Sbjct:   324 LAACASFPI 332


>ZFIN|ZDB-GENE-001205-4 [details] [associations]
            symbol:ctsk "cathepsin K" species:7955 "Danio rerio"
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 InterPro:IPR015644
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            ZFIN:ZDB-GENE-001205-4 HOGENOM:HOG000230774 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 HOVERGEN:HBG011513
            CTD:1513 KO:K01371 OrthoDB:EOG4SJ5FC PANTHER:PTHR12411:SF55
            EMBL:BC092901 IPI:IPI00512751 RefSeq:NP_001017778.1
            UniGene:Dr.76224 ProteinModelPortal:Q568D6 SMR:Q568D6 GeneID:550475
            KEGG:dre:550475 InParanoid:Q568D6 NextBio:20879718
            ArrayExpress:Q568D6 Uniprot:Q568D6
        Length = 333

 Score = 626 (225.4 bits), Expect = 3.4e-61, P = 3.4e-61
 Identities = 138/320 (43%), Positives = 191/320 (59%)

Query:    40 SMDKLI--ELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKE----VTSYWLGL 93
             S+D L   E +ESW   H + Y  + E+  R  I+++N+  I+  NKE    + +Y LG+
Sbjct:    20 SLDNLSLDEAWESWKITHKREYNGLNEESIRRTIWEKNMLFIEAHNKEYELGIHTYDLGM 79

Query:    94 NEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRD-VKALPKSVDWRKKGAVTPVKNQG 152
             N F DM+ EE   K +GL  Q P  R P+  F   D V  LPKS+D+RK G VT VKNQG
Sbjct:    80 NHFGDMTLEEVAEKVMGL--QMPMYRDPANTFVPDDRVGKLPKSIDYRKLGYVTSVKNQG 137

Query:   153 SCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASG 212
             SCGSCWAFS+V A+EG      G L  LS Q L+DC T  N+GC GG M  AF+Y+  + 
Sbjct:   138 SCGSCWAFSSVGALEGQLMKTKGQLVDLSPQNLVDCVTE-NDGCGGGYMTNAFRYVSNNQ 196

Query:   213 GLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQ-PVSVAIEAS 271
             G+  EE YPY+  +  C      +   +  GY+++P+ +E++L  A+A+  PVSV I+A 
Sbjct:   197 GIDSEESYPYVGTDQQCAYNTSGV-AASCRGYKEIPQGNERALTAAVANVGPVSVGIDAM 255

Query:   272 GTDFQFYSGGVFTGP-CGAE-LDHGVAAVGYGKS-KGSDYIIVKNSWGPKWGERGYIRMK 328
              + F +Y  GV+  P C  E ++H V AVGYG + +G  Y IVKNSWG +WG++GY+ M 
Sbjct:   256 QSTFLYYKSGVYYDPNCNKEDVNHAVLAVGYGATPRGKKYWIVKNSWGEEWGKKGYVLMA 315

Query:   329 RNTGKPEGLCGINKMASIPL 348
             RN       CGI  +AS P+
Sbjct:   316 RNRNNA---CGIANLASFPV 332


>ZFIN|ZDB-GENE-030131-106 [details] [associations]
            symbol:ctsl1a "cathepsin L, 1 a" species:7955
            "Danio rerio" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            ZFIN:ZDB-GENE-030131-106 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 HOVERGEN:HBG011513 HSSP:P43235
            KO:K01365 EMBL:BC066490 IPI:IPI00495935 RefSeq:NP_997749.1
            UniGene:Dr.104499 ProteinModelPortal:Q6NYR5 SMR:Q6NYR5
            MEROPS:C01.074 PRIDE:Q6NYR5 GeneID:321453 KEGG:dre:321453
            CTD:321453 InParanoid:Q6NYR5 NextBio:20807387 ArrayExpress:Q6NYR5
            Bgee:Q6NYR5 Uniprot:Q6NYR5
        Length = 337

 Score = 626 (225.4 bits), Expect = 3.4e-61, P = 3.4e-61
 Identities = 142/320 (44%), Positives = 184/320 (57%)

Query:    43 KLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKE----VTSYWLGLNEFAD 98
             +L + ++ W   H K Y   EE   R  I+++NLK I+  N E    + +Y LG+N F D
Sbjct:    24 QLNDHWDQWKKWHSKKYHATEEGWRRV-IWEKNLKKIEMHNLEHSMGIHTYRLGMNHFGD 82

Query:    99 MSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCW 158
             M+HEEF+    G K +   RR   + F   +   +P  +DWR+KG VTPVK+QG CGSCW
Sbjct:    83 MTHEEFRQVMNGFKHK-KDRRFRGSLFMEPNFIEVPNKLDWREKGYVTPVKDQGECGSCW 141

Query:   159 AFSTVAAVEGINQIVSGNLTSLSEQELIDCDT-SFNNGCNGGLMDYAFKYIVASGGLHKE 217
             AFST  A+EG     +G L SLSEQ L+DC     N GCNGGLMD AF+Y+    GL  E
Sbjct:   142 AFSTTGALEGQMFRKTGKLVSLSEQNLVDCSRPEGNEGCNGGLMDQAFQYVKDQNGLDSE 201

Query:   218 EDYPYL-MEEGTCE-DKKEEMEVVTISGYQDVPENDEQSLLKALAHQ-PVSVAIEASGTD 274
             E YPYL  ++  C  D K        +G+ D+P   E++L+KA+A   PVSVAI+A    
Sbjct:   202 ESYPYLGTDDQPCHFDPKNS--AANDTGFVDIPSGKERALMKAIAAVGPVSVAIDAGHES 259

Query:   275 FQFYSGGVF-TGPCGAE-LDHGVAAVGYGKS----KGSDYIIVKNSWGPKWGERGYIRMK 328
             FQFY  G++    C +E LDHGV AVGYG       G  Y IVKNSW   WG++GYI M 
Sbjct:   260 FQFYQSGIYYEKECSSEELDHGVLAVGYGFEGEDVDGKKYWIVKNSWSENWGDKGYIYMA 319

Query:   329 RNTGKPEGLCGINKMASIPL 348
             ++       CGI   AS PL
Sbjct:   320 KDR---HNHCGIATAASYPL 336


>DICTYBASE|DDB_G0278721 [details] [associations]
            symbol:cprD "cysteine proteinase 4" species:44689
            "Dictyostelium discoideum" [GO:0005615 "extracellular space"
            evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0005764 "lysosome" evidence=IEA] [GO:0044351
            "macropinocytosis" evidence=RCA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 dictyBase:DDB_G0278721 GO:GO:0005615
            GenomeReviews:CM000152_GR eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            EMBL:AAFI02000024 EMBL:L36204 RefSeq:XP_641963.1
            ProteinModelPortal:P54639 SMR:P54639 MEROPS:C01.A57 PRIDE:P54639
            EnsemblProtists:DDB0214999 GeneID:8621695 KEGG:ddi:DDB_G0278721
            OMA:NAFADIT ProtClustDB:CLSZ2846820 Uniprot:P54639
        Length = 442

 Score = 540 (195.1 bits), Expect = 5.4e-61, Sum P(2) = 5.4e-61
 Identities = 120/269 (44%), Positives = 158/269 (58%)

Query:    48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNK 107
             F +WM  H +TY   EE   R++IFK N+ ++ Q N +     LGLN FAD++++E++  
Sbjct:    30 FTNWMQAHQRTYSS-EEFNARYQIFKSNMDYVHQWNSKGGETVLGLNVFADITNQEYRTT 88

Query:   108 YLGLKPQFPTRRQPSAEFSYRDVK--ALPK-SVDWRKKGAVTPVKNQGSCGSCWAFSTVA 164
             YLG      T    SA     + K  + P  +VDWR +GAVTP+KNQG CG CW+FST  
Sbjct:    89 YLG------TPFDGSALIGTEEEKIFSTPAPTVDWRAQGAVTPIKNQGQCGGCWSFSTTG 142

Query:   165 AVEGINQIVSG---NLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDY 220
             + EG + I SG   +L SLSEQ LIDC  S+ NNGC GGLM  AF+YI+ + G+  E  Y
Sbjct:   143 STEGAHFIASGTKKDLVSLSEQNLIDCSKSYGNNGCEGGLMTLAFEYIINNKGIDTESSY 202

Query:   221 PYLMEEGT-CEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYS 279
             PY  E+G  C+ K   +    +S YQ+V    E SL  A  + PVSVAI+AS   FQ Y 
Sbjct:   203 PYTAEDGKECKFKTSNIGAQIVS-YQNVTSGSEASLQSASNNAPVSVAIDASNESFQLYE 261

Query:   280 GGVFTGP-CG-AELDHGVAAVGYGKSKGS 306
              G++  P C   +LDHGV  VGYG    S
Sbjct:   262 SGIYYEPACSPTQLDHGVLVVGYGSGSSS 290

 Score = 102 (41.0 bits), Expect = 5.4e-61, Sum P(2) = 5.4e-61
 Identities = 22/49 (44%), Positives = 27/49 (55%)

Query:   299 GYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
             G  ++   +Y IVKNSWG  WG  GYI M ++       CGI  MAS P
Sbjct:   392 GAVEASSGNYWIVKNSWGTSWGMDGYIFMSKDRNNN---CGIATMASFP 437


>DICTYBASE|DDB_G0281605 [details] [associations]
            symbol:cfaD "peptidase C1A family protein"
            species:44689 "Dictyostelium discoideum" [GO:0005615 "extracellular
            space" evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IDA] [GO:0006508 "proteolysis" evidence=IDA] [GO:0031410
            "cytoplasmic vesicle" evidence=IDA] [GO:0031288 "sorocarp
            morphogenesis" evidence=IMP] [GO:0008285 "negative regulation of
            cell proliferation" evidence=IGI;IDA] [GO:0005576 "extracellular
            region" evidence=IEA;IDA] [GO:0005515 "protein binding"
            evidence=IPI] [GO:0044351 "macropinocytosis" evidence=RCA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 dictyBase:DDB_G0281605
            GO:GO:0008285 GO:GO:0005615 GenomeReviews:CM000152_GR
            eggNOG:COG4870 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0031410 EMBL:AAFI02000042
            GO:GO:0031288 RefSeq:XP_640530.1 HSSP:P07711
            ProteinModelPortal:Q54TR1 STRING:Q54TR1 PRIDE:Q54TR1
            EnsemblProtists:DDB0229857 GeneID:8623140 KEGG:ddi:DDB_G0281605
            InParanoid:Q54TR1 OMA:PSAHEHE ProtClustDB:CLSZ2430523
            Uniprot:Q54TR1
        Length = 531

 Score = 624 (224.7 bits), Expect = 5.5e-61, P = 5.5e-61
 Identities = 130/329 (39%), Positives = 192/329 (58%)

Query:    28 FSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT 87
             FS +G     L   ++   LF+ + +++ K Y   +E   RF  FK   K I   N + +
Sbjct:   207 FSSIG--DNLLAKEEQASNLFKEYKAQYNKEYSSQDEHDERFINFKAARKIIATHNAKES 264

Query:    88 SYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRD--VKALPKSVDWRKKGAV 145
             SY LG+N +AD+S++EF      +KP+        A+  + D  ++++P +VDWR +  V
Sbjct:   265 SYKLGMNHYADLSNKEFNTL---VKPKVARPSVTGADSVHDDESLRSIPSTVDWRNQNCV 321

Query:   146 TPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDT-SFNNGCNGGLMDYA 204
             TPVK+QG CGSCW F +  ++EG N + +G L SLSEQ+L+DC   + + GC GG    A
Sbjct:   322 TPVKDQGICGSCWTFGSTGSLEGTNCVTNGELVSLSEQQLVDCAILTGSQGCGGGFASSA 381

Query:   205 FKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQ-P 263
             F+Y++  G L  E +YPYLM+ G C D+      V+I+GY +V    E +L  A+A   P
Sbjct:   382 FQYVMEIGSLATESNYPYLMQNGLCRDRTVTPSGVSITGYVNVTSGSESALQNAIATTGP 441

Query:   264 VSVAIEASGTDFQFYSGGVFTGP-C--GAE-LDHGVAAVGYGKSKGSDYIIVKNSWGPKW 319
             V++AI+AS  DF++Y  GV+  P C  G + LDH V A+GYG  +G DY +VKNSW   W
Sbjct:   442 VAIAIDASVDDFRYYMSGVYNNPACKNGLDDLDHEVLAIGYGTYQGQDYFLVKNSWSTNW 501

Query:   320 GERGYIRMKRNTGKPEGLCGINKMASIPL 348
             G  GY+ M RN      LCG++  A+ P+
Sbjct:   502 GMDGYVYMARNDNN---LCGVSSQATYPI 527


>TAIR|locus:2030027 [details] [associations]
            symbol:AT1G29110 species:3702 "Arabidopsis thaliana"
            [GO:0006508 "proteolysis" evidence=IEA;ISS] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA;ISS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 SMART:SM00645
            EMBL:CP002684 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00639
            IPI:IPI00544534 RefSeq:NP_564322.1 UniGene:At.51816
            ProteinModelPortal:F4HZW2 SMR:F4HZW2 EnsemblPlants:AT1G29110.1
            GeneID:839786 KEGG:ath:AT1G29110 OMA:SCRANAR Uniprot:F4HZW2
        Length = 334

 Score = 624 (224.7 bits), Expect = 5.5e-61, P = 5.5e-61
 Identities = 127/321 (39%), Positives = 191/321 (59%)

Query:    37 HLTSMDK-LIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQ-RNKEVTSYWLGLN 94
             H+T  ++ +++  + WM++  + YK   EK  R ++FK+NLK I+   N    SY LG+N
Sbjct:    26 HVTLNEQSIVDYHQQWMTQFSRVYKDESEKEMRLKVFKKNLKFIENFNNMGNQSYTLGVN 85

Query:    95 EFADMSHEEFKNKYLGLKPQFPT------RRQPSAEFSYRDVKALPKSVDWRKKGAVTPV 148
             EF D   EEF   + GL+    +      + +PS  ++  D+    +S DWR +GAVTPV
Sbjct:    86 EFTDWKTEEFLATHTGLRVNVTSLSELFNKTKPSRNWNMSDIDMEDESKDWRDEGAVTPV 145

Query:   149 KNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYI 208
             K QG+C              + +I   NL +LSEQ+LIDCD   N GCNGG  + AFKYI
Sbjct:   146 KYQGACR-------------LTKISGKNLLTLSEQQLIDCDIEKNGGCNGGEFEEAFKYI 192

Query:   209 VASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAI 268
             + +GG+  E +YPY +++ +C           I G+Q VP ++E++LL+A+  QPVSV I
Sbjct:   193 IKNGGVSLETEYPYQVKKESCRANARRAPHTQIRGFQMVPSHNERALLEAVRRQPVSVLI 252

Query:   269 EASGTDFQFYSGGVFTG-PCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRM 327
             +A    F  Y GGV+ G  CG +++H V  VGYG   G +Y ++KNSWG  WGE GY+R+
Sbjct:   253 DARADSFGHYKGGVYAGLDCGTDVNHAVTIVGYGTMSGLNYWVLKNSWGESWGENGYMRI 312

Query:   328 KRNTGKPEGLCGINKMASIPL 348
             +R+   P+G+CGI ++A+ P+
Sbjct:   313 RRDVEWPQGMCGIAQVAAYPV 333


>DICTYBASE|DDB_G0279799 [details] [associations]
            symbol:cprB "cysteine proteinase 2" species:44689
            "Dictyostelium discoideum" [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0008233
            "peptidase activity" evidence=IEA] [GO:0005764 "lysosome"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            dictyBase:DDB_G0279799 GenomeReviews:CM000152_GR eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            MEROPS:I29.003 KO:K01365 EMBL:AAFI02000033 EMBL:M16039 EMBL:X03344
            PIR:A25439 RefSeq:XP_641494.1 ProteinModelPortal:P04989 SMR:P04989
            EnsemblProtists:DDB0214998 GeneID:8622234 KEGG:ddi:DDB_G0279799
            OMA:YVNITAG Uniprot:P04989
        Length = 376

 Score = 534 (193.0 bits), Expect = 6.9e-61, Sum P(2) = 6.9e-61
 Identities = 114/267 (42%), Positives = 154/267 (57%)

Query:    48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYW-LGLNEFADMSHEEFKN 106
             F  W  K  + Y   E   +R+ IFK N+ ++D  N +  S   LGLN FAD+++EE++ 
Sbjct:    36 FTEWTLKFNRQYSSSEFS-NRYSIFKSNMDYVDNWNSKGDSQTVLGLNNFADITNEEYRK 94

Query:   107 KYLGLKPQFPTRRQPSAE--FSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVA 164
              YLG +    +          +  D++  PKS+DWR K AVTP+K+QG CGSCW+FST  
Sbjct:    95 TYLGTRVNAHSYNGYDGREVLNVEDLQTNPKSIDWRTKNAVTPIKDQGQCGSCWSFSTTG 154

Query:   165 AVEGINQIVSGNLTSLSEQELIDCDTSFNN-GCNGGLMDYAFKYIVASGGLHKEEDYPYL 223
             + EG + + +  L SLSEQ L+DC     N GC+GGLM+ AF YI+ + G+  E  YPY 
Sbjct:   155 STEGAHALKTKKLVSLSEQNLVDCSGPEENFGCDGGLMNNAFDYIIKNKGIDTESSYPYT 214

Query:   224 MEEG-TCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGV 282
              E G TC   K ++   TI GY ++    E SL     H PVSVAI+AS   FQ Y+ G+
Sbjct:   215 AETGSTCLFNKSDIGA-TIKGYVNITAGSEISLENGAQHGPVSVAIDASHNSFQLYTSGI 273

Query:   283 FTGP-CG-AELDHGVAAVGYGKSKGSD 307
             +  P C   ELDHGV  VGYG  +G D
Sbjct:   274 YYEPKCSPTELDHGVLVVGYGV-QGKD 299

 Score = 107 (42.7 bits), Expect = 6.9e-61, Sum P(2) = 6.9e-61
 Identities = 21/47 (44%), Positives = 30/47 (63%)

Query:   302 KSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPL 348
             + K ++Y IVKNSWG  WG +GYI M ++    +  CGI  ++S PL
Sbjct:   332 RPKANNYWIVKNSWGTSWGIKGYILMSKDR---KNNCGIASVSSYPL 375


>UNIPROTKB|Q28944 [details] [associations]
            symbol:CTSL1 "Cathepsin L1" species:9823 "Sus scrofa"
            [GO:0005764 "lysosome" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 eggNOG:COG4870
            HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0005764 GeneTree:ENSGT00560000076577
            HOVERGEN:HBG011513 KO:K01365 OrthoDB:EOG48PMKF MEROPS:C01.032
            CTD:1514 EMBL:D37917 EMBL:AJ315771 PIR:A58195 RefSeq:NP_999057.1
            UniGene:Ssc.54036 ProteinModelPortal:Q28944 SMR:Q28944
            STRING:Q28944 Ensembl:ENSSSCT00000012233 GeneID:396926
            KEGG:ssc:396926 OMA:DASETGK ArrayExpress:Q28944 Uniprot:Q28944
        Length = 334

 Score = 621 (223.7 bits), Expect = 1.2e-60, P = 1.2e-60
 Identities = 135/309 (43%), Positives = 184/309 (59%)

Query:    51 WMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTS----YWLGLNEFADMSHEEFKN 106
             W + HG+ Y   EE   R  ++++N+K I+  N+E +     + + +N F DM++EEF+ 
Sbjct:    32 WKATHGRLYGMNEEGWRR-AVWEKNMKMIELHNQEYSQGKHGFSMAMNAFGDMTNEEFRQ 90

Query:   107 KYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAV 166
                G + Q   + +    F    V  +PKSVDWR+KG VT VKNQG CGSCWAFS   A+
Sbjct:    91 VMNGFQNQ---KHKKGKVFHESLVLEVPKSVDWREKGYVTAVKNQGQCGSCWAFSATGAL 147

Query:   167 EGINQIVSGNLTSLSEQELIDCDT-SFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLME 225
             EG     +G L SLSEQ L+DC     N GCNGGLMD AF+Y+  +GGL  EE YPYL  
Sbjct:   148 EGQMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGLMDNAFQYVKDNGGLDTEESYPYLGR 207

Query:   226 EGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQ-PVSVAIEASGTDFQFYSGGVFT 284
             E      K E      +G+ D+P+  E++L+KA+A   P+SVAI+A  + FQFY  G++ 
Sbjct:   208 ETNSCTYKPECSAANDTGFVDIPQR-EKALMKAVATVGPISVAIDAGHSSFQFYKSGIYY 266

Query:   285 GP-CGA-ELDHGVAAVGYG----KSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLC 338
              P C + +LDHGV  VGYG     S  S + IVKNSWGP+WG  GY++M ++       C
Sbjct:   267 DPDCSSKDLDHGVLVVGYGFEGTDSNSSKFWIVKNSWGPEWGWNGYVKMAKDQNNH---C 323

Query:   339 GINKMASIP 347
             GI+  AS P
Sbjct:   324 GISTAASYP 332


>ZFIN|ZDB-GENE-071004-74 [details] [associations]
            symbol:zgc:174855 "zgc:174855" species:7955 "Danio
            rerio" [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0016787 "hydrolase activity"
            evidence=IEA] [GO:0008233 "peptidase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-071004-74
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            HOVERGEN:HBG011513 MEROPS:C01.032 EMBL:BX000534 EMBL:BC152282
            IPI:IPI00773140 RefSeq:NP_001096592.1 UniGene:Dr.104905 SMR:A7MCR6
            STRING:A7MCR6 Ensembl:ENSDART00000109968 GeneID:569326
            KEGG:dre:569326 NextBio:20889622 Uniprot:A7MCR6
        Length = 335

 Score = 621 (223.7 bits), Expect = 1.2e-60, P = 1.2e-60
 Identities = 137/321 (42%), Positives = 191/321 (59%)

Query:    40 SMD-KLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT----SYWLGLN 94
             S+D +L + + SW S+HGK+Y   + ++ R  I++ENL+ I+Q N E +    ++ +G+N
Sbjct:    19 SIDIQLDDHWNSWKSQHGKSYH-EDVEVGRRMIWEENLRKIEQHNFEYSLGNHTFKMGMN 77

Query:    95 EFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSC 154
             +F DM++EEF+    G K Q P R    A F      A P+ VDWR++G VTPVK+Q  C
Sbjct:    78 QFGDMTNEEFRQAMNGYK-QDPNRTSKGALFMEPSFFAAPQQVDWRQRGYVTPVKDQKQC 136

Query:   155 GSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDT-SFNNGCNGGLMDYAFKYIVASGG 213
             GSCW+FS+  A+EG     +G L S+SEQ L+DC     N GCNGG+MD AF+Y+  + G
Sbjct:   137 GSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQGNQGCNGGIMDQAFQYVKENKG 196

Query:   214 LHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQ-PVSVAIEASG 272
             L  E+ YPYL  +           V  I+G+ D+P  +E +L+ A+A   PVSVAI+AS 
Sbjct:   197 LDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPRGNELALMNAVAAVGPVSVAIDASH 256

Query:   273 TDFQFYSGGVF-TGPCGAELDHGVAAVGYGKS----KGSDYIIVKNSWGPKWGERGYIRM 327
                QFY  G++    C + LDH V  VGYG       G+ Y IVKNSW  KWG++GYI M
Sbjct:   257 QSLQFYQSGIYYERACTSRLDHAVLVVGYGYQGADVAGNRYWIVKNSWSDKWGDKGYIYM 316

Query:   328 KRNTGKPEGLCGINKMASIPL 348
              ++       CGI  MAS PL
Sbjct:   317 AKDKNNH---CGIATMASYPL 334


>UNIPROTKB|P83654 [details] [associations]
            symbol:P83654 "Ervatamin-C" species:52861 "Tabernaemontana
            divaricata" [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=IDA] [GO:0006508 "proteolysis" evidence=IDA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0005576 GO:GO:0006508
            InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0004197 PDB:1O0E PDB:2PNS
            PDBsum:1O0E PDBsum:2PNS MEROPS:C01.116 EvolutionaryTrace:P83654
            Uniprot:P83654
        Length = 208

 Score = 618 (222.6 bits), Expect = 2.4e-60, P = 2.4e-60
 Identities = 121/217 (55%), Positives = 155/217 (71%)

Query:   133 LPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF 192
             LP+ +DWRKKGAVTPVKNQGSCGSCWAFSTV+ VE INQI +GNL SLSEQEL+DCD   
Sbjct:     1 LPEQIDWRKKGAVTPVKNQGSCGSCWAFSTVSTVESINQIRTGNLISLSEQELVDCDKK- 59

Query:   193 NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDE 252
             N+GC GG   +A++YI+ +GG+  + +YPY   +G C+   +   VV+I GY  VP  +E
Sbjct:    60 NHGCLGGAFVFAYQYIINNGGIDTQANYPYKAVQGPCQAASK---VVSIDGYNGVPFCNE 116

Query:   253 QSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVK 312
              +L +A+A QP +VAI+AS   FQ YS G+F+GPCG +L+HGV  VGY     ++Y IV+
Sbjct:   117 XALKQAVAVQPSTVAIDASSAQFQQYSSGIFSGPCGTKLNHGVTIVGYQ----ANYWIVR 172

Query:   313 NSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
             NSWG  WGE+GYIRM R  G   GLCGI ++   P K
Sbjct:   173 NSWGRYWGEKGYIRMLRVGGC--GLCGIARLPYYPTK 207


>UNIPROTKB|P25975 [details] [associations]
            symbol:CTSL1 "Cathepsin L1" species:9913 "Bos taurus"
            [GO:0005764 "lysosome" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 eggNOG:COG4870
            HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0005764 GeneTree:ENSGT00560000076577
            HOVERGEN:HBG011513 EMBL:X91755 EMBL:BC102312 EMBL:AB017648
            IPI:IPI00687440 PIR:S15845 RefSeq:NP_776457.1 UniGene:Bt.3987
            ProteinModelPortal:P25975 SMR:P25975 STRING:P25975
            Ensembl:ENSBTAT00000022710 Ensembl:ENSBTAT00000036427 GeneID:281108
            KEGG:bta:281108 CTD:1515 InParanoid:P25975 KO:K01365 OMA:EEFRATH
            OrthoDB:EOG48PMKF BindingDB:P25975 ChEMBL:CHEMBL2113
            NextBio:20805179 ArrayExpress:P25975 Uniprot:P25975
        Length = 334

 Score = 617 (222.3 bits), Expect = 3.1e-60, P = 3.1e-60
 Identities = 137/312 (43%), Positives = 185/312 (59%)

Query:    48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTS----YWLGLNEFADMSHEE 103
             +  W + H + Y   EE+  R  ++++N K ID  N+E +     + + +N F DM++EE
Sbjct:    29 WHQWKATHRRLYGMNEEEWRR-AVWEKNKKIIDLHNQEYSEGKHGFRMAMNAFGDMTNEE 87

Query:   104 FKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTV 163
             F+    G + Q   + +   E    DV   PKSVDW KKG VTPVKNQG CGSCWAFS  
Sbjct:    88 FRQVMNGFQNQKHKKGKLFHEPLLVDV---PKSVDWTKKGYVTPVKNQGQCGSCWAFSAT 144

Query:   164 AAVEGINQIVSGNLTSLSEQELIDCDTSFNN-GCNGGLMDYAFKYIVASGGLHKEEDYPY 222
              A+EG     +G L SLSEQ L+DC  +  N GCNGGLMD AF+YI  +GGL  EE YPY
Sbjct:   145 GALEGQMFRKTGKLVSLSEQNLVDCSRAQGNQGCNGGLMDNAFQYIKDNGGLDSEESYPY 204

Query:   223 LMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQ-PVSVAIEASGTDFQFYSGG 281
             L  +    + K E      +G+ D+P+  E++L+KA+A   P+SVAI+A  T FQFY  G
Sbjct:   205 LATDTNSCNYKPECSAANDTGFVDIPQR-EKALMKAVATVGPISVAIDAGHTSFQFYKSG 263

Query:   282 VFTGP-CGA-ELDHGVAAVGYG----KSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPE 335
             ++  P C + +LDHGV  VGYG     S  + + IVKNSWGP+WG  GY++M ++     
Sbjct:   264 IYYDPDCSSKDLDHGVLVVGYGFEGTDSNNNKFWIVKNSWGPEWGWNGYVKMAKDQNNH- 322

Query:   336 GLCGINKMASIP 347
               CGI   AS P
Sbjct:   323 --CGIATAASYP 332


>ZFIN|ZDB-GENE-030131-572 [details] [associations]
            symbol:wu:fb37b09 "wu:fb37b09" species:7955 "Danio
            rerio" [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008233 "peptidase
            activity" evidence=IEA] [GO:0016787 "hydrolase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            ZFIN:ZDB-GENE-030131-572 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            EMBL:BX465840 IPI:IPI00866294 RefSeq:XP_001923796.1
            UniGene:Dr.25683 PRIDE:E9QBE2 Ensembl:ENSDART00000133962
            GeneID:321853 KEGG:dre:321853 NextBio:20807556 Uniprot:E9QBE2
        Length = 335

 Score = 617 (222.3 bits), Expect = 3.1e-60, P = 3.1e-60
 Identities = 136/321 (42%), Positives = 191/321 (59%)

Query:    40 SMD-KLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT----SYWLGLN 94
             S+D +L + + SW S+HGK+Y   + ++ R  I++ENL+ I+Q N E +    ++ +G+N
Sbjct:    19 SIDIQLDDHWNSWKSQHGKSYH-EDVEVGRRMIWEENLRKIEQHNFEYSLGNHTFKMGMN 77

Query:    95 EFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSC 154
             +F DM++EEF+    G K   P R      F      A P+ VDWR++G VTPVK+Q  C
Sbjct:    78 QFGDMTNEEFRQAMNGYKHD-PNRTSQGPLFMEPKFFAAPQQVDWRQRGYVTPVKDQKQC 136

Query:   155 GSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNN-GCNGGLMDYAFKYIVASGG 213
             GSCW+FS+  A+EG     +G L S+SEQ L+DC     N GCNGGLMD AF+Y+  + G
Sbjct:   137 GSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPHGNQGCNGGLMDQAFQYVKENKG 196

Query:   214 LHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQ-PVSVAIEASG 272
             L  E+ YPYL  +           V  I+G+ D+P+ +E +L+ A+A   PVSVAI+AS 
Sbjct:   197 LDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPKGNELALMNAVAAVGPVSVAIDASH 256

Query:   273 TDFQFYSGGVF-TGPCGAELDHGVAAVGYGKS----KGSDYIIVKNSWGPKWGERGYIRM 327
                QFY  G++    C ++LDH V  VGYG       G+ Y IVKNSW  KWG++GYI M
Sbjct:   257 QSLQFYQSGIYYERACTSQLDHAVLVVGYGYQGADVAGNRYWIVKNSWSDKWGDKGYIYM 316

Query:   328 KRNTGKPEGLCGINKMASIPL 348
              ++       CGI  MAS PL
Sbjct:   317 AKDKNNH---CGIATMASYPL 334


>DICTYBASE|DDB_G0279187 [details] [associations]
            symbol:cprG "cysteine proteinase 7" species:44689
            "Dictyostelium discoideum" [GO:0005615 "extracellular space"
            evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0005764 "lysosome" evidence=IEA] [GO:0044351
            "macropinocytosis" evidence=RCA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 dictyBase:DDB_G0279187 GO:GO:0005615
            GenomeReviews:CM000152_GR eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            EMBL:AAFI02000030 ProtClustDB:CLSZ2846820 MEROPS:C01.081
            EMBL:U72746 RefSeq:XP_641720.2 ProteinModelPortal:Q94504 SMR:Q94504
            PRIDE:Q94504 EnsemblProtists:DDB0215005 GeneID:8621915
            KEGG:ddi:DDB_G0279187 OMA:INTETEK Uniprot:Q94504
        Length = 460

 Score = 531 (192.0 bits), Expect = 4.8e-60, Sum P(2) = 4.8e-60
 Identities = 113/265 (42%), Positives = 154/265 (58%)

Query:    48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNK 107
             F +WM  H + Y   EE   R+ IFK N+ ++++ N + +   LGLN FAD+S+EE++  
Sbjct:    30 FTNWMIAHQRHYSS-EEFNGRYNIFKANMDYVNEWNTKGSETVLGLNVFADISNEEYRAT 88

Query:   108 YLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVE 167
             YLG      +     ++  + D  A    VDWR +GAVTP+KNQG CG CW+FST  A E
Sbjct:    89 YLGTPFDASSLEMTESDKIF-DASA---QVDWRTQGAVTPIKNQGQCGGCWSFSTTGATE 144

Query:   168 GINQIVSG--NLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLM 224
             G   + +G  NL SLSEQ LIDC  S+ NNGC GGLM  AF+YI+ + G+  E  YPY  
Sbjct:   145 GAQYLANGKKNLVSLSEQNLIDCSGSYGNNGCEGGLMTLAFEYIINNKGIDTESSYPYTA 204

Query:   225 EEGT-CEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVF 283
             E+G  C+   + +    +S Y +V    E  L   +   P SVAI+AS   FQ Y  G++
Sbjct:   205 EDGKKCKFNPKNV-AAQLSSYVNVTSGSESDLAAKVTQGPTSVAIDASNQSFQLYVSGIY 263

Query:   284 TGP-CGA-ELDHGVAAVGYGKSKGS 306
               P C + +LDHGV AVG+G   GS
Sbjct:   264 NEPACSSTQLDHGVLAVGFGTGSGS 288

 Score = 102 (41.0 bits), Expect = 4.8e-60, Sum P(2) = 4.8e-60
 Identities = 22/41 (53%), Positives = 23/41 (56%)

Query:   307 DYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
             DY IVKNSWG  WG  GYI M +        CGI  MAS P
Sbjct:   417 DYWIVKNSWGTSWGMDGYILMTKGNNNQ---CGIATMASRP 454


>UNIPROTKB|Q9GL24 [details] [associations]
            symbol:CTSL1 "Cathepsin L1" species:9615 "Canis lupus
            familiaris" [GO:0005764 "lysosome" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0006508
            "proteolysis" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 CTD:1515 KO:K01365
            OrthoDB:EOG48PMKF EMBL:AJ279008 RefSeq:NP_001239115.1
            UniGene:Cfa.3571 ProteinModelPortal:Q9GL24 SMR:Q9GL24
            MEROPS:C01.032 Ensembl:ENSCAFT00000001770
            Ensembl:ENSCAFT00000023837 GeneID:100684364 KEGG:cfa:100684364
            InParanoid:Q9GL24 OMA:FDQNLDT NextBio:20817211 Uniprot:Q9GL24
        Length = 333

 Score = 614 (221.2 bits), Expect = 6.4e-60, P = 6.4e-60
 Identities = 135/310 (43%), Positives = 189/310 (60%)

Query:    51 WMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTS----YWLGLNEFADMSHEEFKN 106
             W + H + Y   EE   R  ++++N+K I+  N+E +     + + +N F DM++EEF+ 
Sbjct:    32 WKATHRRLYGMNEEGWRR-AVWEKNMKMIELHNREYSQGKHGFTMAMNAFGDMTNEEFRQ 90

Query:   107 KYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAV 166
                G + Q   + +   E  + ++   PKSVDWR+KG VTPVKNQG CGSCWAFS   A+
Sbjct:    91 VMNGFQNQKHKKGKMFQEPLFAEI---PKSVDWREKGYVTPVKNQGQCGSCWAFSATGAL 147

Query:   167 EGINQIVSGNLTSLSEQELIDCDTSFNN-GCNGGLMDYAFKYIVASGGLHKEEDYPYL-M 224
             EG     +G L SLSEQ L+DC  +  N GCNGGLMD AF+Y+  +GGL  EE YPYL  
Sbjct:   148 EGQMFRKTGKLVSLSEQNLVDCSRAQGNEGCNGGLMDNAFRYVKDNGGLDSEESYPYLGR 207

Query:   225 EEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQ-PVSVAIEASGTDFQFYSGGVF 283
             +  TC + K E      +G+ D+P+  E++L+KA+A   P+SVAI+A    FQFY  G++
Sbjct:   208 DTETC-NYKPECSAANDTGFVDLPQR-EKALMKAVATLGPISVAIDAGHQSFQFYKSGIY 265

Query:   284 TGP-CGA-ELDHGVAAVGYGKSKGSD----YIIVKNSWGPKWGERGYIRMKRNTGKPEGL 337
               P C + +LDHGV  VGYG  +G+D    + IVKNSWGP+WG  GY++M ++       
Sbjct:   266 FDPDCSSKDLDHGVLVVGYG-FEGTDSNNKFWIVKNSWGPEWGWNGYVKMAKDQNNH--- 321

Query:   338 CGINKMASIP 347
             CGI   AS P
Sbjct:   322 CGIATAASYP 331


>MGI|MGI:88564 [details] [associations]
            symbol:Ctsl "cathepsin L" species:10090 "Mus musculus"
            [GO:0004177 "aminopeptidase activity" evidence=ISO] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=ISO] [GO:0005515
            "protein binding" evidence=IPI] [GO:0005730 "nucleolus"
            evidence=NAS] [GO:0005737 "cytoplasm" evidence=ISO] [GO:0005764
            "lysosome" evidence=ISO] [GO:0005773 "vacuole" evidence=ISO]
            [GO:0005902 "microvillus" evidence=ISO] [GO:0006508 "proteolysis"
            evidence=ISO;IDA] [GO:0007154 "cell communication" evidence=ISO]
            [GO:0008152 "metabolic process" evidence=TAS] [GO:0008233
            "peptidase activity" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISO;TAS] [GO:0009897 "external side of
            plasma membrane" evidence=ISO] [GO:0016787 "hydrolase activity"
            evidence=IEA] [GO:0030141 "secretory granule" evidence=ISO]
            [GO:0030984 "kininogen binding" evidence=ISO] [GO:0032403 "protein
            complex binding" evidence=ISO] [GO:0042277 "peptide binding"
            evidence=ISO] [GO:0042393 "histone binding" evidence=ISO;NAS]
            [GO:0043005 "neuron projection" evidence=ISO] [GO:0043204
            "perikaryon" evidence=ISO] [GO:0045177 "apical part of cell"
            evidence=ISO] [GO:0048863 "stem cell differentiation" evidence=NAS]
            [GO:0060441 "epithelial tube branching involved in lung
            morphogenesis" evidence=IMP] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            MGI:MGI:88564 GO:GO:0005730 GO:GO:0009897 GO:GO:0034698
            GO:GO:0043204 GO:GO:0009749 GO:GO:0030141 GO:GO:0048863
            GO:GO:0051384 eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045177 GO:GO:0043005
            GO:GO:0007283 GO:GO:0004177 GO:GO:0005764 GO:GO:0042277
            GO:GO:0009267 GO:GO:0021675 GO:GO:0042393 GO:GO:0005902
            GO:GO:0010259 GO:GO:0004197 GO:GO:0048102 GO:GO:0046697
            HOVERGEN:HBG011513 KO:K01365 OMA:EEFRATH OrthoDB:EOG48PMKF
            MEROPS:C01.032 BRENDA:3.4.22.15 ChiTaRS:CTSL1 EMBL:X06086
            EMBL:J02583 EMBL:M20495 EMBL:AF121837 EMBL:AF121838 EMBL:AF121839
            EMBL:BC068163 EMBL:X04392 IPI:IPI00128154 PIR:S01177
            RefSeq:NP_034114.1 UniGene:Mm.930 PDB:1MVV PDBsum:1MVV
            ProteinModelPortal:P06797 SMR:P06797 STRING:P06797
            PhosphoSite:P06797 PaxDb:P06797 PRIDE:P06797
            Ensembl:ENSMUST00000021933 GeneID:13039 KEGG:mmu:13039 CTD:13039
            InParanoid:P06797 BioCyc:MetaCyc:MONOMER-14812 BindingDB:P06797
            ChEMBL:CHEMBL5291 NextBio:282928 Bgee:P06797 CleanEx:MM_CTSL
            Genevestigator:P06797 GermOnline:ENSMUSG00000021477 GO:GO:0060008
            Uniprot:P06797
        Length = 334

 Score = 614 (221.2 bits), Expect = 6.4e-60, P = 6.4e-60
 Identities = 133/314 (42%), Positives = 193/314 (61%)

Query:    48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTS----YWLGLNEFADMSHEE 103
             +  W S H + Y   EE+  R  I+++N++ I   N E ++    + + +N F DM++EE
Sbjct:    29 WHQWKSTHRRLYGTNEEEWRR-AIWEKNMRMIQLHNGEYSNGQHGFSMEMNAFGDMTNEE 87

Query:   104 FKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTV 163
             F+    G + Q   + +    F    +  +PKSVDWR+KG VTPVKNQG CGSCWAFS  
Sbjct:    88 FRQVVNGYRHQ---KHKKGRLFQEPLMLKIPKSVDWREKGCVTPVKNQGQCGSCWAFSAS 144

Query:   164 AAVEGINQIVSGNLTSLSEQELIDCD-TSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPY 222
               +EG   + +G L SLSEQ L+DC     N GCNGGLMD+AF+YI  +GGL  EE YPY
Sbjct:   145 GCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDFAFQYIKENGGLDSEESYPY 204

Query:   223 LMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQ-PVSVAIEASGTDFQFYSGG 281
               ++G+C+  + E  V   +G+ D+P+  E++L+KA+A   P+SVA++AS    QFYS G
Sbjct:   205 EAKDGSCK-YRAEFAVANDTGFVDIPQQ-EKALMKAVATVGPISVAMDASHPSLQFYSSG 262

Query:   282 VFTGP-CGAE-LDHGVAAVGYGKSKGSD-----YIIVKNSWGPKWGERGYIRMKRNTGKP 334
             ++  P C ++ LDHGV  VGYG  +G+D     Y +VKNSWG +WG  GYI++ ++    
Sbjct:   263 IYYEPNCSSKNLDHGVLLVGYGY-EGTDSNKNKYWLVKNSWGSEWGMEGYIKIAKDR--- 318

Query:   335 EGLCGINKMASIPL 348
             +  CG+   AS P+
Sbjct:   319 DNHCGLATAASYPV 332


>RGD|2447 [details] [associations]
            symbol:Ctsh "cathepsin H" species:10116 "Rattus norvegicus"
          [GO:0001520 "outer dense fiber" evidence=IDA] [GO:0001656
          "metanephros development" evidence=IEP] [GO:0001669 "acrosomal
          vesicle" evidence=IDA] [GO:0001913 "T cell mediated cytotoxicity"
          evidence=ISO;ISS] [GO:0002250 "adaptive immune response"
          evidence=ISO] [GO:0002764 "immune response-regulating signaling
          pathway" evidence=ISO;ISS] [GO:0004175 "endopeptidase activity"
          evidence=ISO] [GO:0004177 "aminopeptidase activity" evidence=ISO;IDA]
          [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISO;ISS]
          [GO:0004252 "serine-type endopeptidase activity" evidence=ISO;ISS]
          [GO:0005615 "extracellular space" evidence=ISO;ISS;IDA] [GO:0005764
          "lysosome" evidence=ISO;ISS;IDA] [GO:0005829 "cytosol"
          evidence=ISO;ISS] [GO:0006508 "proteolysis" evidence=IEP;ISO]
          [GO:0007283 "spermatogenesis" evidence=IEP] [GO:0008233 "peptidase
          activity" evidence=ISO] [GO:0008234 "cysteine-type peptidase
          activity" evidence=ISO] [GO:0008284 "positive regulation of cell
          proliferation" evidence=ISO;ISS] [GO:0010628 "positive regulation of
          gene expression" evidence=ISO;ISS] [GO:0010634 "positive regulation
          of epithelial cell migration" evidence=ISO;ISS] [GO:0010813
          "neuropeptide catabolic process" evidence=ISO;ISS] [GO:0010815
          "bradykinin catabolic process" evidence=ISO;ISS] [GO:0010952
          "positive regulation of peptidase activity" evidence=ISO;ISS]
          [GO:0016505 "apoptotic protease activator activity" evidence=ISO;ISS]
          [GO:0030108 "HLA-A specific activating MHC class I receptor activity"
          evidence=ISO;ISS] [GO:0030335 "positive regulation of cell migration"
          evidence=ISO;ISS] [GO:0030984 "kininogen binding" evidence=IPI]
          [GO:0031638 "zymogen activation" evidence=ISO;ISS] [GO:0031648
          "protein destabilization" evidence=ISO;ISS] [GO:0032403 "protein
          complex binding" evidence=IPI] [GO:0032526 "response to retinoic
          acid" evidence=ISO;ISS] [GO:0033619 "membrane protein proteolysis"
          evidence=ISO;ISS] [GO:0035085 "cilium axoneme" evidence=IDA]
          [GO:0043066 "negative regulation of apoptotic process"
          evidence=ISO;ISS] [GO:0043129 "surfactant homeostasis"
          evidence=ISO;ISS] [GO:0043621 "protein self-association"
          evidence=IDA] [GO:0045766 "positive regulation of angiogenesis"
          evidence=ISO;ISS] [GO:0060448 "dichotomous subdivision of terminal
          units involved in lung branching" evidence=ISO;ISS] [GO:0070324
          "thyroid hormone binding" evidence=ISO;ISS] [GO:0070371 "ERK1 and
          ERK2 cascade" evidence=ISO;ISS] [GO:0097067 "cellular response to
          thyroid hormone stimulus" evidence=ISO;IEP] [GO:0097208 "alveolar
          lamellar body" evidence=ISO;ISS;IDA] InterPro:IPR000668
          InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
          InterPro:IPR000169 RGD:2447 GO:GO:0005829 GO:GO:0043066 GO:GO:0005615
          GO:GO:0008284 GO:GO:0070371 GO:GO:0001669 eggNOG:COG4870
          HOGENOM:HOG000230774 InterPro:IPR025661 InterPro:IPR025660
          InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
          PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0007283
          GO:GO:0045766 GO:GO:0004177 GO:GO:0004252 GO:GO:0005764 GO:GO:0097208
          GO:GO:0043621 GO:GO:0032526 GO:GO:0010628 GO:GO:0070324 KO:K01366
          GO:GO:0016505 GO:GO:0001656 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
          HOVERGEN:HBG011513 GO:GO:0097067 GO:GO:0031638 GO:GO:0001913
          GeneTree:ENSGT00660000095458 CTD:1512 OMA:STSCHKT OrthoDB:EOG4W9J43
          GO:GO:0030108 GO:GO:0010815 GO:GO:0060448 GO:GO:0002764 GO:GO:0033619
          GO:GO:0010813 GO:GO:0043129 MEROPS:I29.003 EMBL:Y00708 EMBL:BC085352
          EMBL:M38135 IPI:IPI00212809 PIR:S00211 RefSeq:NP_037071.1
          UniGene:Rn.1997 ProteinModelPortal:P00786 SMR:P00786 STRING:P00786
          PRIDE:P00786 Ensembl:ENSRNOT00000019285 GeneID:25425 KEGG:rno:25425
          UCSC:RGD:2447 InParanoid:P00786 BindingDB:P00786 NextBio:606599
          Genevestigator:P00786 GermOnline:ENSRNOG00000014064 GO:GO:0035086
          GO:GO:0001520 Uniprot:P00786
        Length = 333

 Score = 613 (220.8 bits), Expect = 8.1e-60, P = 8.1e-60
 Identities = 130/309 (42%), Positives = 185/309 (59%)

Query:    48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNK 107
             F SWM +H KTY    E  HR ++F  N + I   N+   ++ +GLN+F+DMS  E K+K
Sbjct:    33 FTSWMKQHQKTYSS-REYSHRLQVFANNWRKIQAHNQRNHTFKMGLNQFSDMSFAEIKHK 91

Query:   108 YLGLKPQFPTRRQPSAEFSY-RDVKALPKSVDWRKKG-AVTPVKNQGSCGSCWAFSTVAA 165
             YL  +PQ       + + +Y R     P S+DWRKKG  V+PVKNQG+CGSCW FST  A
Sbjct:    92 YLWSEPQ----NCSATKSNYLRGTGPYPSSMDWRKKGNVVSPVKNQGACGSCWTFSTTGA 147

Query:   166 VEGINQIVSGNLTSLSEQELIDCDTSFNN-GCNGGLMDYAFKYIVASGGLHKEEDYPYLM 224
             +E    I SG + +L+EQ+L+DC  +FNN GC GGL   AF+YI+ + G+  E+ YPY+ 
Sbjct:   148 LESAVAIASGKMMTLAEQQLVDCAQNFNNHGCQGGLPSQAFEYILYNKGIMGEDSYPYIG 207

Query:   225 EEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALA-HQPVSVAIEASGTDFQFYSGGVF 283
             + G C+   E+  V  +    ++  NDE ++++A+A + PVS A E +  DF  Y  GV+
Sbjct:   208 KNGQCKFNPEKA-VAFVKNVVNITLNDEAAMVEAVALYNPVSFAFEVT-EDFMMYKSGVY 265

Query:   284 TG-PCGA---ELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCG 339
             +   C     +++H V AVGYG+  G  Y IVKNSWG  WG  GY  ++R  GK   +CG
Sbjct:   266 SSNSCHKTPDKVNHAVLAVGYGEQNGLLYWIVKNSWGSNWGNNGYFLIER--GK--NMCG 321

Query:   340 INKMASIPL 348
             +   AS P+
Sbjct:   322 LAACASYPI 330


>MGI|MGI:107285 [details] [associations]
            symbol:Ctsh "cathepsin H" species:10090 "Mus musculus"
            [GO:0001520 "outer dense fiber" evidence=ISO] [GO:0001669
            "acrosomal vesicle" evidence=ISO] [GO:0001913 "T cell mediated
            cytotoxicity" evidence=IGI] [GO:0002764 "immune response-regulating
            signaling pathway" evidence=ISO] [GO:0004175 "endopeptidase
            activity" evidence=ISO;IMP] [GO:0004177 "aminopeptidase activity"
            evidence=ISO] [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=ISO;IDA] [GO:0004252 "serine-type endopeptidase activity"
            evidence=IMP] [GO:0005615 "extracellular space" evidence=ISO]
            [GO:0005764 "lysosome" evidence=ISO] [GO:0005829 "cytosol"
            evidence=ISO] [GO:0006508 "proteolysis" evidence=ISO;IMP]
            [GO:0008233 "peptidase activity" evidence=ISO] [GO:0008234
            "cysteine-type peptidase activity" evidence=ISO] [GO:0008284
            "positive regulation of cell proliferation" evidence=IMP]
            [GO:0010628 "positive regulation of gene expression" evidence=ISO]
            [GO:0010634 "positive regulation of epithelial cell migration"
            evidence=IMP] [GO:0010813 "neuropeptide catabolic process"
            evidence=ISO] [GO:0010815 "bradykinin catabolic process"
            evidence=ISO] [GO:0010952 "positive regulation of peptidase
            activity" evidence=IGI;ISO] [GO:0016505 "apoptotic protease
            activator activity" evidence=IGI;ISO] [GO:0016787 "hydrolase
            activity" evidence=IEA] [GO:0030108 "HLA-A specific activating MHC
            class I receptor activity" evidence=ISO] [GO:0030335 "positive
            regulation of cell migration" evidence=ISO] [GO:0030984 "kininogen
            binding" evidence=ISO] [GO:0031638 "zymogen activation"
            evidence=ISO;IMP] [GO:0031648 "protein destabilization"
            evidence=ISO;IMP] [GO:0032403 "protein complex binding"
            evidence=ISO] [GO:0032526 "response to retinoic acid" evidence=IDA]
            [GO:0033619 "membrane protein proteolysis" evidence=ISO;IMP]
            [GO:0035085 "cilium axoneme" evidence=ISO] [GO:0043066 "negative
            regulation of apoptotic process" evidence=IMP] [GO:0043129
            "surfactant homeostasis" evidence=ISO] [GO:0043621 "protein
            self-association" evidence=ISO] [GO:0045766 "positive regulation of
            angiogenesis" evidence=IMP] [GO:0060448 "dichotomous subdivision of
            terminal units involved in lung branching" evidence=IMP]
            [GO:0070324 "thyroid hormone binding" evidence=ISO] [GO:0070371
            "ERK1 and ERK2 cascade" evidence=ISO] [GO:0097208 "alveolar
            lamellar body" evidence=ISO] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            MGI:MGI:107285 GO:GO:0005829 GO:GO:0043066 GO:GO:0005615
            GO:GO:0008284 GO:GO:0070371 eggNOG:COG4870 HOGENOM:HOG000230774
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766 GO:GO:0004177
            GO:GO:0004252 GO:GO:0005764 GO:GO:0097208 GO:GO:0032526
            GO:GO:0010628 GO:GO:0070324 KO:K01366 EMBL:CH466560 GO:GO:0016505
            GO:GO:0001656 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
            HOVERGEN:HBG011513 GO:GO:0097067 GO:GO:0031638 GO:GO:0001913
            GeneTree:ENSGT00660000095458 CTD:1512 OMA:STSCHKT OrthoDB:EOG4W9J43
            GO:GO:0030108 GO:GO:0010815 GO:GO:0060448 GO:GO:0002764
            GO:GO:0033619 GO:GO:0010813 GO:GO:0043129 BRENDA:3.4.22.16
            EMBL:U06119 EMBL:AK149949 EMBL:AK150583 EMBL:AK157376 EMBL:AK160026
            EMBL:Y18464 IPI:IPI00118987 RefSeq:NP_031827.2 UniGene:Mm.2277
            ProteinModelPortal:P49935 SMR:P49935 STRING:P49935 MEROPS:I29.003
            PhosphoSite:P49935 PaxDb:P49935 PRIDE:P49935
            Ensembl:ENSMUST00000034915 GeneID:13036 KEGG:mmu:13036
            InParanoid:Q3UCD6 ChEMBL:CHEMBL1949491 NextBio:282920 Bgee:P49935
            CleanEx:MM_CTSH Genevestigator:P49935 GermOnline:ENSMUSG00000032359
            Uniprot:P49935
        Length = 333

 Score = 612 (220.5 bits), Expect = 1.0e-59, P = 1.0e-59
 Identities = 128/309 (41%), Positives = 189/309 (61%)

Query:    48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNK 107
             F+SWM +H KTY  +E   HR ++F  N + I   N+   ++ + LN+F+DMS  E K+K
Sbjct:    33 FKSWMKQHQKTYSSVEYN-HRLQMFANNWRKIQAHNQRNHTFKMALNQFSDMSFAEIKHK 91

Query:   108 YLGLKPQFPTRRQPSAEFSY-RDVKALPKSVDWRKKG-AVTPVKNQGSCGSCWAFSTVAA 165
             +L  +PQ       + + +Y R     P S+DWRKKG  V+PVKNQG+CGSCW FST  A
Sbjct:    92 FLWSEPQ----NCSATKSNYLRGTGPYPSSMDWRKKGNVVSPVKNQGACGSCWTFSTTGA 147

Query:   166 VEGINQIVSGNLTSLSEQELIDCDTSFNN-GCNGGLMDYAFKYIVASGGLHKEEDYPYLM 224
             +E    I SG + SL+EQ+L+DC  +FNN GC GGL   AF+YI+ + G+ +E+ YPY+ 
Sbjct:   148 LESAVAIASGKMLSLAEQQLVDCAQAFNNHGCKGGLPSQAFEYILYNKGIMEEDSYPYIG 207

Query:   225 EEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALA-HQPVSVAIEASGTDFQFYSGGVF 283
             ++ +C    ++  V  +    ++  NDE ++++A+A + PVS A E +  DF  Y  GV+
Sbjct:   208 KDSSCRFNPQKA-VAFVKNVVNITLNDEAAMVEAVALYNPVSFAFEVT-EDFLMYKSGVY 265

Query:   284 TGP-CGA---ELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCG 339
             +   C     +++H V AVGYG+  G  Y IVKNSWG +WGE GY  ++R  GK   +CG
Sbjct:   266 SSKSCHKTPDKVNHAVLAVGYGEQNGLLYWIVKNSWGSQWGENGYFLIER--GK--NMCG 321

Query:   340 INKMASIPL 348
             +   AS P+
Sbjct:   322 LAACASYPI 330


>UNIPROTKB|P43235 [details] [associations]
            symbol:CTSK "Cathepsin K" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0001957
            "intramembranous ossification" evidence=IEA] [GO:0005615
            "extracellular space" evidence=IEA] [GO:0045453 "bone resorption"
            evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=TAS] [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=TAS] [GO:0036021 "endolysosome lumen" evidence=TAS]
            [GO:0045087 "innate immune response" evidence=TAS]
            InterPro:IPR000668 InterPro:IPR013128 InterPro:IPR015644
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            Reactome:REACT_118779 Reactome:REACT_6900 GO:GO:0005615
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045087 GO:GO:0045453
            EMBL:CH471121 EMBL:AL355860 GO:GO:0004197 GO:GO:0001957
            HOVERGEN:HBG011513 GO:GO:0036021 MEROPS:I29.007 CTD:1513 KO:K01371
            OrthoDB:EOG4SJ5FC PANTHER:PTHR12411:SF55 EMBL:U13665 EMBL:X82153
            EMBL:U20280 EMBL:S79895 EMBL:CR541675 EMBL:AL356292 EMBL:BC016058
            IPI:IPI00300599 PIR:JC2476 RefSeq:NP_000387.1 UniGene:Hs.632466
            PDB:1ATK PDB:1AU0 PDB:1AU2 PDB:1AU3 PDB:1AU4 PDB:1AYU PDB:1AYV
            PDB:1AYW PDB:1BGO PDB:1BY8 PDB:1MEM PDB:1NL6 PDB:1NLJ PDB:1Q6K
            PDB:1SNK PDB:1TU6 PDB:1U9V PDB:1U9W PDB:1U9X PDB:1VSN PDB:1YK7
            PDB:1YK8 PDB:1YT7 PDB:2ATO PDB:2AUX PDB:2AUZ PDB:2BDL PDB:2R6N
            PDB:3C9E PDB:3H7D PDB:3KW9 PDB:3KWB PDB:3KWZ PDB:3KX1 PDB:3O0U
            PDB:3O1G PDB:3OVZ PDB:4DMX PDB:4DMY PDB:7PCK PDBsum:1ATK
            PDBsum:1AU0 PDBsum:1AU2 PDBsum:1AU3 PDBsum:1AU4 PDBsum:1AYU
            PDBsum:1AYV PDBsum:1AYW PDBsum:1BGO PDBsum:1BY8 PDBsum:1MEM
            PDBsum:1NL6 PDBsum:1NLJ PDBsum:1Q6K PDBsum:1SNK PDBsum:1TU6
            PDBsum:1U9V PDBsum:1U9W PDBsum:1U9X PDBsum:1VSN PDBsum:1YK7
            PDBsum:1YK8 PDBsum:1YT7 PDBsum:2ATO PDBsum:2AUX PDBsum:2AUZ
            PDBsum:2BDL PDBsum:2R6N PDBsum:3C9E PDBsum:3H7D PDBsum:3KW9
            PDBsum:3KWB PDBsum:3KWZ PDBsum:3KX1 PDBsum:3O0U PDBsum:3O1G
            PDBsum:3OVZ PDBsum:4DMX PDBsum:4DMY PDBsum:7PCK
            ProteinModelPortal:P43235 SMR:P43235 DIP:DIP-39993N IntAct:P43235
            STRING:P43235 PhosphoSite:P43235 DMDM:1168793 PaxDb:P43235
            PRIDE:P43235 DNASU:1513 Ensembl:ENST00000271651 GeneID:1513
            KEGG:hsa:1513 UCSC:uc001evp.2 GeneCards:GC01M150768 HGNC:HGNC:2536
            MIM:265800 MIM:601105 neXtProt:NX_P43235 Orphanet:763
            PharmGKB:PA27034 InParanoid:P43235 OMA:LKVPPSH PhylomeDB:P43235
            BindingDB:P43235 ChEMBL:CHEMBL268 EvolutionaryTrace:P43235
            GenomeRNAi:1513 NextBio:6267 ArrayExpress:P43235 Bgee:P43235
            CleanEx:HS_CTSK CleanEx:HS_CTSO Genevestigator:P43235
            GermOnline:ENSG00000143387 Uniprot:P43235
        Length = 329

 Score = 609 (219.4 bits), Expect = 2.2e-59, P = 2.2e-59
 Identities = 136/308 (44%), Positives = 186/308 (60%)

Query:    48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKE----VTSYWLGLNEFADMSHEE 103
             +E W   H K Y    +++ R  I+++NLK+I   N E    V +Y L +N   DM+ EE
Sbjct:    26 WELWKKTHRKQYNNKVDEISRRLIWEKNLKYISIHNLEASLGVHTYELAMNHLGDMTSEE 85

Query:   104 FKNKYLGLK-PQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFST 162
                K  GLK P   +R   +      + +A P SVD+RKKG VTPVKNQG CGSCWAFS+
Sbjct:    86 VVQKMTGLKVPLSHSRSNDTLYIPEWEGRA-PDSVDYRKKGYVTPVKNQGQCGSCWAFSS 144

Query:   163 VAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPY 222
             V A+EG  +  +G L +LS Q L+DC  S N+GC GG M  AF+Y+  + G+  E+ YPY
Sbjct:   145 VGALEGQLKKKTGKLLNLSPQNLVDC-VSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPY 203

Query:   223 LMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQ-PVSVAIEASGTDFQFYSGG 281
             + +E +C       +     GY+++PE +E++L +A+A   PVSVAI+AS T FQFYS G
Sbjct:   204 VGQEESCMYNPTG-KAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSKG 262

Query:   282 VFTGP-CGAE-LDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCG 339
             V+    C ++ L+H V AVGYG  KG+ + I+KNSWG  WG +GYI M RN       CG
Sbjct:   263 VYYDESCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMARNKNNA---CG 319

Query:   340 INKMASIP 347
             I  +AS P
Sbjct:   320 IANLASFP 327


>UNIPROTKB|Q5E998 [details] [associations]
            symbol:CTSL2 "Cathepsin L2" species:9913 "Bos taurus"
            [GO:0005764 "lysosome" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            HOVERGEN:HBG011513 UniGene:Bt.3987 MEROPS:C01.032 EMBL:BT021022
            IPI:IPI00711962 ProteinModelPortal:Q5E998 SMR:Q5E998 STRING:Q5E998
            InParanoid:Q5E998 Uniprot:Q5E998
        Length = 334

 Score = 608 (219.1 bits), Expect = 2.7e-59, P = 2.7e-59
 Identities = 136/312 (43%), Positives = 184/312 (58%)

Query:    48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTS----YWLGLNEFADMSHEE 103
             +  W + H + Y   EE+  R  ++++N K ID  N+E +     + + +N F DM++EE
Sbjct:    29 WHQWKATHRRLYGMNEEEWRR-AVWEKNKKIIDLHNQEYSEGKHGFRMAMNAFGDMTNEE 87

Query:   104 FKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTV 163
             F+    G + Q   + +   E    DV   PKSVDW KKG VTPVKNQG CGSCWAFS  
Sbjct:    88 FRQVMNGFQNQKHKKGKLFHEPLLVDV---PKSVDWTKKGYVTPVKNQGQCGSCWAFSAT 144

Query:   164 AAVEGINQIVSGNLTSLSEQELIDCDTSFNN-GCNGGLMDYAFKYIVASGGLHKEEDYPY 222
              A+EG     +G L SLSEQ L+DC  +  N GCNGGLMD AF+YI  +G L  EE YPY
Sbjct:   145 GALEGQMFRKTGKLVSLSEQNLVDCSRAQGNQGCNGGLMDNAFQYIKDNGCLDSEESYPY 204

Query:   223 LMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQ-PVSVAIEASGTDFQFYSGG 281
             L  +    + K E      +G+ D+P+  E++L+KA+A   P+SVAI+A  T FQFY  G
Sbjct:   205 LATDTNSCNYKPECSAANDTGFVDIPQR-EKALMKAVATVGPISVAIDAGHTSFQFYKSG 263

Query:   282 VFTGP-CGA-ELDHGVAAVGYG----KSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPE 335
             ++  P C + +LDHGV  VGYG     S  + + IVKNSWGP+WG  GY++M ++     
Sbjct:   264 IYYDPDCSSKDLDHGVLVVGYGFEGTDSNNNKFWIVKNSWGPEWGWNGYVKMAKDQNNH- 322

Query:   336 GLCGINKMASIP 347
               CGI   AS P
Sbjct:   323 --CGIATAASYP 332


>UNIPROTKB|F1S4J6 [details] [associations]
            symbol:Ssc.54235 "Cathepsin L1" species:9823 "Sus scrofa"
            [GO:0097067 "cellular response to thyroid hormone stimulus"
            evidence=IEA] [GO:0042393 "histone binding" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0004197 "cysteine-type
            endopeptidase activity" evidence=IEA] [GO:0002250 "adaptive immune
            response" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0004197
            GO:GO:0002250 GeneTree:ENSGT00560000076577 GO:GO:0097067
            EMBL:CU571031 RefSeq:XP_003130681.1 Ensembl:ENSSSCT00000011983
            GeneID:100515919 KEGG:ssc:100515919 OMA:IAICATK Uniprot:F1S4J6
        Length = 332

 Score = 608 (219.1 bits), Expect = 2.7e-59, P = 2.7e-59
 Identities = 141/330 (42%), Positives = 191/330 (57%)

Query:    30 IVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT-- 87
             I   +P H  S+D   + ++ W + H K Y   EE   R  I+++N+K I++ N E    
Sbjct:    14 IASAAPRHDHSLDA--DWYK-WKATHRKLYGLNEEGRRR-AIWEKNMKMIERHNWEHRQG 69

Query:    88 --SYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAV 145
               S+ + +N F DM++EEF+    G + Q   + +    F        P SVDWR+KG V
Sbjct:    70 KHSFTMAMNAFGDMTNEEFRKTMNGFQNQ---KHKKGKVFLDAGSALTPHSVDWREKGYV 126

Query:   146 TPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCD-TSFNNGCNGGLMDYA 204
             T VKNQG CGSCWAFS   A+EG     +  L SLSEQ L+DC     N GCNGGLMD A
Sbjct:   127 TAVKNQGHCGSCWAFSATGALEGQMFRKTSKLISLSEQNLVDCSWPEGNEGCNGGLMDNA 186

Query:   205 FKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQ-P 263
             F+YI  +GGL  EE YPY  ++G+C+ K +       +GY D+P+  E++L+KA+A   P
Sbjct:   187 FQYIKDNGGLDSEESYPYFGKDGSCKYKPQS-SAANDTGYVDIPKQ-EKALMKAVATVGP 244

Query:   264 VSVAIEASGTDFQFYSGGVFTGP-CGAE-LDHGVAAVGYGKS---KGSDYIIVKNSWGPK 318
             +SV I+AS   FQFYS G++  P C +E LDHGV  VGYG       + Y +VKNSWG  
Sbjct:   245 ISVGIDASHESFQFYSTGIYFEPQCSSEDLDHGVLVVGYGVEGAHSNNKYWLVKNSWGNT 304

Query:   319 WGERGYIRMKRNTGKPEGLCGINKMASIPL 348
             WG  GYI+M ++       CGI  MAS P+
Sbjct:   305 WGMDGYIKMTKDQNNH---CGIATMASYPV 331


>UNIPROTKB|G3R9A7 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9595 "Gorilla
            gorilla gorilla" [GO:0001656 "metanephros development"
            evidence=ISS] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=ISS] [GO:0002764 "immune response-regulating signaling
            pathway" evidence=ISS] [GO:0004175 "endopeptidase activity"
            evidence=ISS] [GO:0004177 "aminopeptidase activity" evidence=ISS]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0004252 "serine-type endopeptidase activity" evidence=ISS]
            [GO:0005615 "extracellular space" evidence=ISS] [GO:0005764
            "lysosome" evidence=ISS] [GO:0005829 "cytosol" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISS] [GO:0008284 "positive regulation
            of cell proliferation" evidence=ISS] [GO:0010628 "positive
            regulation of gene expression" evidence=ISS] [GO:0010634 "positive
            regulation of epithelial cell migration" evidence=ISS] [GO:0010813
            "neuropeptide catabolic process" evidence=ISS] [GO:0010815
            "bradykinin catabolic process" evidence=ISS] [GO:0010952 "positive
            regulation of peptidase activity" evidence=ISS] [GO:0016505
            "apoptotic protease activator activity" evidence=ISS] [GO:0030108
            "HLA-A specific activating MHC class I receptor activity"
            evidence=ISS] [GO:0030335 "positive regulation of cell migration"
            evidence=ISS] [GO:0031638 "zymogen activation" evidence=ISS]
            [GO:0031648 "protein destabilization" evidence=ISS] [GO:0032526
            "response to retinoic acid" evidence=ISS] [GO:0033619 "membrane
            protein proteolysis" evidence=ISS] [GO:0043066 "negative regulation
            of apoptotic process" evidence=ISS] [GO:0043129 "surfactant
            homeostasis" evidence=ISS] [GO:0045766 "positive regulation of
            angiogenesis" evidence=ISS] [GO:0060448 "dichotomous subdivision of
            terminal units involved in lung branching" evidence=ISS]
            [GO:0070324 "thyroid hormone binding" evidence=ISS] [GO:0070371
            "ERK1 and ERK2 cascade" evidence=ISS] [GO:0097208 "alveolar
            lamellar body" evidence=ISS] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005829 GO:GO:0043066 GO:GO:0005615 GO:GO:0008284
            GO:GO:0070371 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766
            GO:GO:0004177 GO:GO:0004252 GO:GO:0005764 GO:GO:0097208
            GO:GO:0032526 GO:GO:0010628 GO:GO:0070324 GO:GO:0016505
            GO:GO:0001656 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
            GO:GO:0031638 GO:GO:0001913 OMA:STSCHKT GO:GO:0030108 GO:GO:0010815
            GO:GO:0060448 GO:GO:0002764 GO:GO:0033619 GO:GO:0010813
            GO:GO:0043129 RefSeq:XP_004056662.1 Ensembl:ENSGGOT00000012331
            GeneID:101144312 Uniprot:G3R9A7
        Length = 335

 Score = 608 (219.1 bits), Expect = 2.7e-59, P = 2.7e-59
 Identities = 134/327 (40%), Positives = 194/327 (59%)

Query:    30 IVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSY 89
             + G +   + S++K    F SWMSKH KTY   EE  HR + F  N + I+  N    ++
Sbjct:    19 VCGAAELSVNSLEKFY--FRSWMSKHRKTYST-EEYHHRLQTFASNWRKINAHNNGNHTF 75

Query:    90 WLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSY-RDVKALPKSVDWRKKGA-VTP 147
              + LN+F+DMS  E K+KYL  +PQ       + + +Y R     P SVDWRKKG  V+P
Sbjct:    76 KMALNQFSDMSFAEIKHKYLWSEPQ----NCSATKSNYLRGTGPYPPSVDWRKKGNFVSP 131

Query:   148 VKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNN-GCNGGLMDYAFK 206
             VKNQG+CGSCW FST  A+E    I +G + SL+EQ+L+DC   FNN GC GGL   AF+
Sbjct:   132 VKNQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFE 191

Query:   207 YIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALA-HQPVS 265
             YI+ + G+  E+ YPY  ++G C+ +  +  +  +    ++   DE+++++A+A + PVS
Sbjct:   192 YILYNKGIMGEDTYPYQGKDGYCKFQPGKA-IGFVKDVANITIYDEEAMVEAVALYNPVS 250

Query:   266 VAIEASGTDFQFYSGGVFTGP-CGA---ELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGE 321
              A E +  DF  Y  G+++   C     +++H V AVGYG+  G  Y IVKNSWGPKWG 
Sbjct:   251 FAFEVT-QDFMMYRTGIYSSTSCHKTPDKVNHAVLAVGYGEKNGIPYWIVKNSWGPKWGM 309

Query:   322 RGYIRMKRNTGKPEGLCGINKMASIPL 348
              GY  ++R  GK   +CG+   AS P+
Sbjct:   310 NGYFLIER--GK--NMCGLAACASYPI 332


>RGD|2448 [details] [associations]
            symbol:Ctsl1 "cathepsin L1" species:10116 "Rattus norvegicus"
          [GO:0002250 "adaptive immune response" evidence=ISO] [GO:0004177
          "aminopeptidase activity" evidence=IDA] [GO:0004197 "cysteine-type
          endopeptidase activity" evidence=ISO;IDA] [GO:0005576 "extracellular
          region" evidence=IEA] [GO:0005737 "cytoplasm" evidence=IDA]
          [GO:0005764 "lysosome" evidence=ISO;IDA] [GO:0005773 "vacuole"
          evidence=IDA] [GO:0005902 "microvillus" evidence=IDA] [GO:0006508
          "proteolysis" evidence=IEP;ISO] [GO:0007154 "cell communication"
          evidence=IDA] [GO:0007283 "spermatogenesis" evidence=IEP] [GO:0008234
          "cysteine-type peptidase activity" evidence=ISO] [GO:0008584 "male
          gonad development" evidence=IEP] [GO:0009267 "cellular response to
          starvation" evidence=IEP] [GO:0009749 "response to glucose stimulus"
          evidence=IEP] [GO:0009897 "external side of plasma membrane"
          evidence=IDA] [GO:0010259 "multicellular organismal aging"
          evidence=IEP] [GO:0014070 "response to organic cyclic compound"
          evidence=IEP] [GO:0021675 "nerve development" evidence=IEP]
          [GO:0030984 "kininogen binding" evidence=IPI] [GO:0032403 "protein
          complex binding" evidence=IPI] [GO:0034698 "response to gonadotropin
          stimulus" evidence=IEP] [GO:0042277 "peptide binding" evidence=IDA]
          [GO:0042393 "histone binding" evidence=ISO] [GO:0043005 "neuron
          projection" evidence=IDA] [GO:0043204 "perikaryon" evidence=IDA]
          [GO:0046697 "decidualization" evidence=IEP] [GO:0048102 "autophagic
          cell death" evidence=IEP] [GO:0051384 "response to glucocorticoid
          stimulus" evidence=IEP] [GO:0060008 "Sertoli cell differentiation"
          evidence=IEP] [GO:0097067 "cellular response to thyroid hormone
          stimulus" evidence=ISO] [GO:0030141 "secretory granule" evidence=IDA]
          [GO:0045177 "apical part of cell" evidence=IDA] [GO:0060441
          "epithelial tube branching involved in lung morphogenesis"
          evidence=ISO] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
          PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 EMBL:Y00697 RGD:2448
          GO:GO:0005576 GO:GO:0009897 GO:GO:0034698 GO:GO:0043204 GO:GO:0009749
          GO:GO:0051384 eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0006508
          InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
          PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
          PROSITE:PS00139 PROSITE:PS00639 GO:GO:0043005 GO:GO:0007283
          GO:GO:0004177 GO:GO:0005764 GO:GO:0042277 GO:GO:0009267 GO:GO:0021675
          GO:GO:0005902 GO:GO:0010259 GO:GO:0004197 GO:GO:0048102 GO:GO:0046697
          GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 KO:K01365
          OrthoDB:EOG48PMKF MEROPS:C01.032 OMA:FDQNLDT CTD:1514
          BRENDA:3.4.22.15 GO:GO:0060008 EMBL:AF025476 EMBL:BC063175
          EMBL:S85184 IPI:IPI00326070 PIR:S07098 RefSeq:NP_037288.1
          UniGene:Rn.1294 ProteinModelPortal:P07154 SMR:P07154 IntAct:P07154
          STRING:P07154 PhosphoSite:P07154 PRIDE:P07154
          Ensembl:ENSRNOT00000025462 GeneID:25697 KEGG:rno:25697 UCSC:RGD:2448
          InParanoid:P07154 SABIO-RK:P07154 BindingDB:P07154 ChEMBL:CHEMBL2305
          NextBio:607715 Genevestigator:P07154 GermOnline:ENSRNOG00000018566
          Uniprot:P07154
        Length = 334

 Score = 608 (219.1 bits), Expect = 2.7e-59, P = 2.7e-59
 Identities = 131/314 (41%), Positives = 192/314 (61%)

Query:    48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTS----YWLGLNEFADMSHEE 103
             +  W S H + Y   EE+  R  ++++N++ I   N E ++    + + +N F DM++EE
Sbjct:    29 WHQWKSTHRRLYGTNEEEWRR-AVWEKNMRMIQLHNGEYSNGKHGFTMEMNAFGDMTNEE 87

Query:   104 FKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTV 163
             F+    G + Q   + +    F    +  +PK+VDWR+KG VTPVKNQG CGSCWAFS  
Sbjct:    88 FRQIVNGYRHQ---KHKKGRLFQEPLMLQIPKTVDWREKGCVTPVKNQGQCGSCWAFSAS 144

Query:   164 AAVEGINQIVSGNLTSLSEQELIDCD-TSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPY 222
               +EG   + +G L SLSEQ L+DC     N GCNGGLMD+AF+YI  +GGL  EE YPY
Sbjct:   145 GCLEGQMFLKTGKLISLSEQNLVDCSHDQGNQGCNGGLMDFAFQYIKENGGLDSEESYPY 204

Query:   223 LMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQ-PVSVAIEASGTDFQFYSGG 281
               ++G+C+  + E  V   +G+ D+P+  E++L+KA+A   P+SVA++AS    QFYS G
Sbjct:   205 EAKDGSCK-YRAEYAVANDTGFVDIPQQ-EKALMKAVATVGPISVAMDASHPSLQFYSSG 262

Query:   282 VFTGP-CGA-ELDHGVAAVGYGKSKGSD-----YIIVKNSWGPKWGERGYIRMKRNTGKP 334
             ++  P C + +LDHGV  VGYG  +G+D     Y +VKNSWG +WG  GYI++ ++    
Sbjct:   263 IYYEPNCSSKDLDHGVLVVGYGY-EGTDSNKDKYWLVKNSWGKEWGMDGYIKIAKDRNNH 321

Query:   335 EGLCGINKMASIPL 348
                CG+   AS P+
Sbjct:   322 ---CGLATAASYPI 332


>UNIPROTKB|G1K2A7 [details] [associations]
            symbol:CTSK "Cathepsin K" species:9615 "Canis lupus
            familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 InterPro:IPR015644
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00560000076577 PANTHER:PTHR12411:SF55 OMA:LKVPPSH
            EMBL:AAEX03011051 Ensembl:ENSCAFT00000019202 Uniprot:G1K2A7
        Length = 333

 Score = 607 (218.7 bits), Expect = 3.5e-59, P = 3.5e-59
 Identities = 134/308 (43%), Positives = 186/308 (60%)

Query:    48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKE----VTSYWLGLNEFADMSHEE 103
             ++ W   + K Y    ++L R  I+++NLKHI   N E    V +Y L +N   DM+ EE
Sbjct:    30 WDLWKKTYRKQYNSKVDELSRRLIWEKNLKHISIHNLEASLGVHTYELAMNHLGDMTSEE 89

Query:   104 FKNKYLGLK-PQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFST 162
                K  GLK P   +R   +      + +A P SVD+RKKG VTPVKNQG CGSCWAFS+
Sbjct:    90 VVQKMTGLKVPPSHSRSNDTLYIPDWESRA-PDSVDYRKKGYVTPVKNQGQCGSCWAFSS 148

Query:   163 VAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPY 222
             V A+EG  +  +G L +LS Q L+DC  S N+GC GG M  AF+Y+  + G+  E+ YPY
Sbjct:   149 VGALEGQLKKKTGKLLNLSPQNLVDC-VSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPY 207

Query:   223 LMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQ-PVSVAIEASGTDFQFYSGG 281
             + ++ +C       +     GY+++PE +E++L +A+A   P+SVAI+AS T FQFYS G
Sbjct:   208 VGQDESCMYNPTG-KAAKCRGYREIPEGNEKALKRAVARVGPISVAIDASLTSFQFYSKG 266

Query:   282 VFTGP-CGAE-LDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCG 339
             V+    C ++ L+H V AVGYG  KG+ + I+KNSWG  WG +GYI M RN       CG
Sbjct:   267 VYYDENCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMARNKNNA---CG 323

Query:   340 INKMASIP 347
             I  +AS P
Sbjct:   324 IANLASFP 331


>UNIPROTKB|Q3ZKN1 [details] [associations]
            symbol:CTSK "Cathepsin K" species:9615 "Canis lupus
            familiaris" [GO:0005764 "lysosome" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0006508
            "proteolysis" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            InterPro:IPR015644 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 eggNOG:COG4870 HOGENOM:HOG000230774
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            HOVERGEN:HBG011513 MEROPS:I29.007 CTD:1513 KO:K01371
            OrthoDB:EOG4SJ5FC PANTHER:PTHR12411:SF55 EMBL:AY738221
            RefSeq:NP_001029168.1 UniGene:Cfa.588 HSSP:P43235
            ProteinModelPortal:Q3ZKN1 SMR:Q3ZKN1 STRING:Q3ZKN1 GeneID:608843
            KEGG:cfa:608843 InParanoid:Q3ZKN1 NextBio:20894470 Uniprot:Q3ZKN1
        Length = 330

 Score = 607 (218.7 bits), Expect = 3.5e-59, P = 3.5e-59
 Identities = 134/308 (43%), Positives = 186/308 (60%)

Query:    48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKE----VTSYWLGLNEFADMSHEE 103
             ++ W   + K Y    ++L R  I+++NLKHI   N E    V +Y L +N   DM+ EE
Sbjct:    27 WDLWKKTYRKQYNSKVDELSRRLIWEKNLKHISIHNLEASLGVHTYELAMNHLGDMTSEE 86

Query:   104 FKNKYLGLK-PQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFST 162
                K  GLK P   +R   +      + +A P SVD+RKKG VTPVKNQG CGSCWAFS+
Sbjct:    87 VVQKMTGLKVPPSHSRSNDTLYIPDWESRA-PDSVDYRKKGYVTPVKNQGQCGSCWAFSS 145

Query:   163 VAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPY 222
             V A+EG  +  +G L +LS Q L+DC  S N+GC GG M  AF+Y+  + G+  E+ YPY
Sbjct:   146 VGALEGQLKKKTGKLLNLSPQNLVDC-VSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPY 204

Query:   223 LMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQ-PVSVAIEASGTDFQFYSGG 281
             + ++ +C       +     GY+++PE +E++L +A+A   P+SVAI+AS T FQFYS G
Sbjct:   205 VGQDESCMYNPTG-KAAKCRGYREIPEGNEKALKRAVARVGPISVAIDASLTSFQFYSKG 263

Query:   282 VFTGP-CGAE-LDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCG 339
             V+    C ++ L+H V AVGYG  KG+ + I+KNSWG  WG +GYI M RN       CG
Sbjct:   264 VYYDENCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMARNKNNA---CG 320

Query:   340 INKMASIP 347
             I  +AS P
Sbjct:   321 IANLASFP 328


>UNIPROTKB|O46427 [details] [associations]
            symbol:CTSH "Pro-cathepsin H" species:9823 "Sus scrofa"
            [GO:0002764 "immune response-regulating signaling pathway"
            evidence=ISS] [GO:0032526 "response to retinoic acid" evidence=ISS]
            [GO:0031648 "protein destabilization" evidence=ISS] [GO:0043129
            "surfactant homeostasis" evidence=ISS] [GO:0010815 "bradykinin
            catabolic process" evidence=ISS] [GO:0010813 "neuropeptide
            catabolic process" evidence=ISS] [GO:0005829 "cytosol"
            evidence=ISS] [GO:0045766 "positive regulation of angiogenesis"
            evidence=ISS] [GO:0008284 "positive regulation of cell
            proliferation" evidence=ISS] [GO:0008234 "cysteine-type peptidase
            activity" evidence=ISS] [GO:0030335 "positive regulation of cell
            migration" evidence=ISS] [GO:0001656 "metanephros development"
            evidence=ISS] [GO:0033619 "membrane protein proteolysis"
            evidence=ISS] [GO:0043066 "negative regulation of apoptotic
            process" evidence=ISS] [GO:0016505 "apoptotic protease activator
            activity" evidence=ISS] [GO:0004252 "serine-type endopeptidase
            activity" evidence=ISS] [GO:0010952 "positive regulation of
            peptidase activity" evidence=ISS] [GO:0001913 "T cell mediated
            cytotoxicity" evidence=ISS] [GO:0031638 "zymogen activation"
            evidence=ISS] [GO:0030108 "HLA-A specific activating MHC class I
            receptor activity" evidence=ISS] [GO:0070371 "ERK1 and ERK2
            cascade" evidence=ISS] [GO:0010628 "positive regulation of gene
            expression" evidence=ISS] [GO:0070324 "thyroid hormone binding"
            evidence=ISS] [GO:0006508 "proteolysis" evidence=ISS] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=ISS] [GO:0060448
            "dichotomous subdivision of terminal units involved in lung
            branching" evidence=ISS] [GO:0097208 "alveolar lamellar body"
            evidence=ISS] [GO:0005764 "lysosome" evidence=ISS] [GO:0005615
            "extracellular space" evidence=ISS] [GO:0004177 "aminopeptidase
            activity" evidence=ISS] [GO:0004175 "endopeptidase activity"
            evidence=ISS] [GO:0010634 "positive regulation of epithelial cell
            migration" evidence=ISS] [GO:0097067 "cellular response to thyroid
            hormone stimulus" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005829 GO:GO:0043066 GO:GO:0005615
            GO:GO:0008284 GO:GO:0070371 eggNOG:COG4870 HOGENOM:HOG000230774
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766 GO:GO:0004177
            GO:GO:0004252 GO:GO:0005764 GO:GO:0097208 GO:GO:0032526
            GO:GO:0010628 GO:GO:0070324 KO:K01366 GO:GO:0016505 GO:GO:0001656
            GO:GO:0010634 GO:GO:0004197 GO:GO:0031648 HOVERGEN:HBG011513
            GO:GO:0097067 GO:GO:0031638 GO:GO:0001913
            GeneTree:ENSGT00660000095458 MEROPS:C01.040 CTD:1512 OMA:STSCHKT
            GO:GO:0030108 GO:GO:0010815 GO:GO:0060448 GO:GO:0002764
            GO:GO:0033619 GO:GO:0010813 GO:GO:0043129 EMBL:AF001169
            RefSeq:NP_999094.1 UniGene:Ssc.3593 PDB:1NB3 PDB:1NB5 PDB:8PCH
            PDBsum:1NB3 PDBsum:1NB5 PDBsum:8PCH ProteinModelPortal:O46427
            SMR:O46427 Ensembl:ENSSSCT00000001983 GeneID:396969 KEGG:ssc:396969
            EvolutionaryTrace:O46427 ArrayExpress:O46427 Uniprot:O46427
        Length = 335

 Score = 607 (218.7 bits), Expect = 3.5e-59, P = 3.5e-59
 Identities = 133/325 (40%), Positives = 198/325 (60%)

Query:    32 GYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWL 91
             G S   ++S +KL   F+SWM +H K Y  +EE  HR ++F  N + I+  N    ++ L
Sbjct:    21 GASNLAVSSFEKLH--FKSWMVQHQKKYS-LEEYHHRLQVFVSNWRKINAHNAGNHTFKL 77

Query:    92 GLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSY-RDVKALPKSVDWRKKGA-VTPVK 149
             GLN+F+DMS +E ++KYL  +PQ       + + +Y R     P S+DWRKKG  V+PVK
Sbjct:    78 GLNQFSDMSFDEIRHKYLWSEPQ----NCSATKGNYLRGTGPYPPSMDWRKKGNFVSPVK 133

Query:   150 NQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNN-GCNGGLMDYAFKYI 208
             NQGSCGSCW FST  A+E    I +G + SL+EQ+L+DC  +FNN GC GGL   AF+YI
Sbjct:   134 NQGSCGSCWTFSTTGALESAVAIATGKMLSLAEQQLVDCAQNFNNHGCQGGLPSQAFEYI 193

Query:   209 VASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALA-HQPVSVA 267
               + G+  E+ YPY  ++  C+ + ++  +  +    ++  NDE+++++A+A + PVS A
Sbjct:   194 RYNKGIMGEDTYPYKGQDDHCKFQPDKA-IAFVKDVANITMNDEEAMVEAVALYNPVSFA 252

Query:   268 IEASGTDFQFYSGGVFTGP-CGA---ELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERG 323
              E +  DF  Y  G+++   C     +++H V AVGYG+  G  Y IVKNSWGP+WG  G
Sbjct:   253 FEVTN-DFLMYRKGIYSSTSCHKTPDKVNHAVLAVGYGEENGIPYWIVKNSWGPQWGMNG 311

Query:   324 YIRMKRNTGKPEGLCGINKMASIPL 348
             Y  ++R  GK   +CG+   AS P+
Sbjct:   312 YFLIER--GK--NMCGLAACASYPI 332


>UNIPROTKB|Q24940 [details] [associations]
            symbol:Cat-1 "Cathepsin L-like proteinase" species:6192
            "Fasciola hepatica" [GO:0004175 "endopeptidase activity"
            evidence=IDA] [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=IDA] [GO:0005576 "extracellular region" evidence=IDA]
            [GO:0006508 "proteolysis" evidence=IDA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005576 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0004197 EMBL:L33771 PIR:S43991 PDB:2O6X
            PDBsum:2O6X ProteinModelPortal:Q24940 SMR:Q24940 MEROPS:C01.033
            EvolutionaryTrace:Q24940 Uniprot:Q24940
        Length = 326

 Score = 607 (218.7 bits), Expect = 3.5e-59, P = 3.5e-59
 Identities = 129/313 (41%), Positives = 183/313 (58%)

Query:    46 ELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKE----VTSYWLGLNEFADMSH 101
             +L+  W   + K Y   +++ HR  I+++N+KHI + N      + +Y LGLN+F DM+ 
Sbjct:    19 DLWHQWKRMYNKEYNGADDQ-HRRNIWEKNVKHIQEHNLRHDLGLVTYTLGLNQFTDMTF 77

Query:   102 EEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFS 161
             EEFK KYL    +          +   + +A+P  +DWR+ G VT VK+QG+CGSCWAFS
Sbjct:    78 EEFKAKYLTEMSRASDILSHGVPYEANN-RAVPDKIDWRESGYVTEVKDQGNCGSCWAFS 136

Query:   162 TVAAVEGINQIVSGNLTSLS--EQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEE 218
             T   +EG  Q +    TS+S  EQ+L+DC   + NNGC+GGLM+ A++Y+    GL  E 
Sbjct:   137 TTGTMEG--QYMKNERTSISFSEQQLVDCSGPWGNNGCSGGLMENAYQYL-KQFGLETES 193

Query:   219 DYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKAL-AHQPVSVAIEASGTDFQF 277
              YPY   EG C   K+ + V  ++GY  V    E  L   + A +P +VA++   +DF  
Sbjct:   194 SYPYTAVEGQCRYNKQ-LGVAKVTGYYTVHSGSEVELKNLVGARRPAAVAVDVE-SDFMM 251

Query:   278 YSGGVFTGP-CGA-ELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPE 335
             Y  G++    C    ++H V AVGYG   G+DY IVKNSWG  WGERGYIRM RN G   
Sbjct:   252 YRSGIYQSQTCSPLRVNHAVLAVGYGTQGGTDYWIVKNSWGTYWGERGYIRMARNRGN-- 309

Query:   336 GLCGINKMASIPL 348
              +CGI  +AS+P+
Sbjct:   310 -MCGIASLASLPM 321


>UNIPROTKB|Q86GF7 [details] [associations]
            symbol:Cys "Crustapain" species:6703 "Pandalus borealis"
            [GO:0005576 "extracellular region" evidence=IC] [GO:0007586
            "digestion" evidence=NAS] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IDA] [GO:0030163 "protein catabolic process"
            evidence=NAS] [GO:0030574 "collagen catabolic process"
            evidence=IDA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0005576
            GO:GO:0007586 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0030163 GO:GO:0030574 EMBL:AB091669
            ProteinModelPortal:Q86GF7 SMR:Q86GF7 MEROPS:C01.030 Uniprot:Q86GF7
        Length = 323

 Score = 607 (218.7 bits), Expect = 3.5e-59, P = 3.5e-59
 Identities = 132/295 (44%), Positives = 177/295 (60%)

Query:    48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNK-----EVTSYWLGLNEFADMSHE 102
             +E++ +K GK Y   EE+ HR  +F + LK I + N+     EVT YWL +N F+D++HE
Sbjct:    20 WENFKTKFGKKYANSEEESHRMSVFMDKLKFIQEHNERYDKGEVT-YWLKINNFSDLTHE 78

Query:   103 EFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKS--VDWRKKGAVTPVKNQGSCGSCWAF 160
             E     L  K     RR P +    +     P +  VDWR KGAVTPVK+QG CGSCWAF
Sbjct:    79 EV----LATKTGMTRRRHPLSVLP-KSAPTTPMAADVDWRNKGAVTPVKDQGQCGSCWAF 133

Query:   161 STVAAVEGINQIVSGNLTSLSEQELIDCDTSFNN-GCNGGLMDYAFKYIVASGGLHKEED 219
             S VAA+EG + + +G+L SLSEQ L+DC +S+ N GCNGG    A++YI+A+ G+  E  
Sbjct:   134 SAVAALEGAHFLKTGDLVSLSEQNLVDCSSSYGNQGCNGGWPYQAYQYIIANRGIDTESS 193

Query:   220 YPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQ-PVSVAIEASGTDFQFY 278
             YPY   +  C      +   T+S Y +    DE +L  A+ ++ PVSV I+A  + F  Y
Sbjct:   194 YPYKAIDDNCRYDAGNIGA-TVSSYVEPASGDESALQHAVQNEGPVSVCIDAGQSSFGSY 252

Query:   279 SGGVFTGP-CGA-ELDHGVAAVGYGK-SKGSDYIIVKNSWGPKWGERGYIRMKRN 330
              GGV+  P C +   +H V AVGYG  + G DY IVKNSWG  WGE GYI+M RN
Sbjct:   253 GGGVYYEPNCDSWYANHAVTAVGYGTDANGGDYWIVKNSWGAWWGESGYIKMARN 307


>UNIPROTKB|P07711 [details] [associations]
            symbol:CTSL1 "Cathepsin L1" species:9606 "Homo sapiens"
            [GO:0005576 "extracellular region" evidence=NAS] [GO:0005764
            "lysosome" evidence=IDA;NAS] [GO:0006508 "proteolysis"
            evidence=IDA] [GO:0002250 "adaptive immune response" evidence=IEP]
            [GO:0019882 "antigen processing and presentation" evidence=TAS]
            [GO:0097067 "cellular response to thyroid hormone stimulus"
            evidence=IEP] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IDA] [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=IDA] [GO:0019886 "antigen processing and presentation of
            exogenous peptide antigen via MHC class II" evidence=TAS]
            [GO:0036021 "endolysosome lumen" evidence=TAS] [GO:0043202
            "lysosomal lumen" evidence=TAS] [GO:0045087 "innate immune
            response" evidence=TAS] [GO:0005515 "protein binding" evidence=IPI]
            [GO:0042393 "histone binding" evidence=IDA] [GO:0005634 "nucleus"
            evidence=TAS] [GO:0071888 "macrophage apoptotic process"
            evidence=NAS] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            Reactome:REACT_118779 EMBL:X12451 GO:GO:0005634 Reactome:REACT_6900
            GO:GO:0005576 GO:GO:0019886 eggNOG:COG4870 HOGENOM:HOG000230774
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045087
            GO:GO:0042393 GO:GO:0004197 GO:GO:0002250 HOVERGEN:HBG011513
            GO:GO:0097067 GO:GO:0036021 KO:K01365 OrthoDB:EOG48PMKF EMBL:M20496
            EMBL:CR457053 EMBL:BX537395 EMBL:AL160279 EMBL:BC012612 EMBL:X05256
            IPI:IPI00012887 PIR:S01002 RefSeq:NP_001244900.1
            RefSeq:NP_001244901.1 RefSeq:NP_001903.1 RefSeq:NP_666023.1
            UniGene:Hs.731507 UniGene:Hs.731952 PDB:1CJL PDB:1CS8 PDB:1ICF
            PDB:1MHW PDB:2NQD PDB:2VHS PDB:2XU1 PDB:2XU3 PDB:2XU4 PDB:2XU5
            PDB:2YJ2 PDB:2YJ8 PDB:2YJ9 PDB:2YJB PDB:2YJC PDB:3BC3 PDB:3H89
            PDB:3H8B PDB:3H8C PDB:3HHA PDB:3HWN PDB:3IV2 PDB:3K24 PDB:3KSE
            PDB:3OF8 PDB:3OF9 PDBsum:1CJL PDBsum:1CS8 PDBsum:1ICF PDBsum:1MHW
            PDBsum:2NQD PDBsum:2VHS PDBsum:2XU1 PDBsum:2XU3 PDBsum:2XU4
            PDBsum:2XU5 PDBsum:2YJ2 PDBsum:2YJ8 PDBsum:2YJ9 PDBsum:2YJB
            PDBsum:2YJC PDBsum:3BC3 PDBsum:3H89 PDBsum:3H8B PDBsum:3H8C
            PDBsum:3HHA PDBsum:3HWN PDBsum:3IV2 PDBsum:3K24 PDBsum:3KSE
            PDBsum:3OF8 PDBsum:3OF9 ProteinModelPortal:P07711 SMR:P07711
            IntAct:P07711 STRING:P07711 MEROPS:I29.001 PhosphoSite:P07711
            DMDM:115741 PaxDb:P07711 PeptideAtlas:P07711 PRIDE:P07711
            DNASU:1514 Ensembl:ENST00000340342 Ensembl:ENST00000343150
            GeneID:1514 KEGG:hsa:1514 UCSC:uc004aph.3 CTD:1514
            GeneCards:GC09P090341 H-InvDB:HIX0058839 H-InvDB:HIX0170314
            HGNC:HGNC:2537 HPA:CAB000459 MIM:116880 neXtProt:NX_P07711
            PharmGKB:PA162382890 InParanoid:P07711 OMA:REPLFAQ PhylomeDB:P07711
            BRENDA:3.4.22.15 BindingDB:P07711 ChEMBL:CHEMBL3837 ChiTaRS:CTSL1
            DrugBank:DB00040 EvolutionaryTrace:P07711 GenomeRNAi:1514
            NextBio:6271 PMAP-CutDB:P07711 ArrayExpress:P07711 Bgee:P07711
            CleanEx:HS_CTSL1 Genevestigator:P07711 GermOnline:ENSG00000135047
            GO:GO:0071888 Uniprot:P07711
        Length = 333

 Score = 606 (218.4 bits), Expect = 4.5e-59, P = 4.5e-59
 Identities = 137/329 (41%), Positives = 192/329 (58%)

Query:    31 VGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT--- 87
             +G +   LT    L   +  W + H + Y   EE   R  ++++N+K I+  N+E     
Sbjct:    12 LGIASATLTFDHSLEAQWTKWKAMHNRLYGMNEEGWRR-AVWEKNMKMIELHNQEYREGK 70

Query:    88 -SYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVT 146
              S+ + +N F DM+ EEF+    G + + P + +   E  + +    P+SVDWR+KG VT
Sbjct:    71 HSFTMAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYEA---PRSVDWREKGYVT 127

Query:   147 PVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCD-TSFNNGCNGGLMDYAF 205
             PVKNQG CGSCWAFS   A+EG     +G L SLSEQ L+DC     N GCNGGLMDYAF
Sbjct:   128 PVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAF 187

Query:   206 KYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQ-PV 264
             +Y+  +GGL  EE YPY   E +C+    +  V   +G+ D+P+  E++L+KA+A   P+
Sbjct:   188 QYVQDNGGLDSEESYPYEATEESCK-YNPKYSVANDTGFVDIPKQ-EKALMKAVATVGPI 245

Query:   265 SVAIEASGTDFQFYSGGVFTGP-CGAE-LDHGVAAVGYG-KSKGSD---YIIVKNSWGPK 318
             SVAI+A    F FY  G++  P C +E +DHGV  VGYG +S  SD   Y +VKNSWG +
Sbjct:   246 SVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEE 305

Query:   319 WGERGYIRMKRNTGKPEGLCGINKMASIP 347
             WG  GY++M ++       CGI   AS P
Sbjct:   306 WGMGGYVKMAKDR---RNHCGIASAASYP 331


>UNIPROTKB|Q5E968 [details] [associations]
            symbol:CTSK "Cathepsin K" species:9913 "Bos taurus"
            [GO:0005764 "lysosome" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            InterPro:IPR015644 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 eggNOG:COG4870 HOGENOM:HOG000230774
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 EMBL:BT021052
            EMBL:BC109853 IPI:IPI00709374 RefSeq:NP_001029607.1
            UniGene:Bt.23218 ProteinModelPortal:Q5E968 SMR:Q5E968 STRING:Q5E968
            MEROPS:I29.007 PRIDE:Q5E968 Ensembl:ENSBTAT00000028016
            GeneID:513038 KEGG:bta:513038 CTD:1513 InParanoid:Q5E968 KO:K01371
            OrthoDB:EOG4SJ5FC NextBio:20870669 PANTHER:PTHR12411:SF55
            Uniprot:Q5E968
        Length = 329

 Score = 604 (217.7 bits), Expect = 7.3e-59, P = 7.3e-59
 Identities = 135/309 (43%), Positives = 184/309 (59%)

Query:    48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKE----VTSYWLGLNEFADMSHEE 103
             +E W   + K Y    +++ R  I+++NLKHI   N E    V +Y L +N   DM+ EE
Sbjct:    26 WELWKKTYRKQYNSKGDEISRRLIWEKNLKHISIHNLEASLGVHTYELAMNHLGDMTSEE 85

Query:   104 FKNKYLGLKPQFPTRRQPSAEFSY-RDVKA-LPKSVDWRKKGAVTPVKNQGSCGSCWAFS 161
                K  GLK   P  R  S +  Y  D +   P SVD+RKKG VTPVKNQG CGSCWAFS
Sbjct:    86 VVQKMTGLK--VPASRSRSNDTLYIPDWEGRAPDSVDYRKKGYVTPVKNQGQCGSCWAFS 143

Query:   162 TVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYP 221
             +V A+EG  +  +G L +LS Q L+DC  S N+GC GG M  AF+Y+  + G+  E+ YP
Sbjct:   144 SVGALEGQLKKKTGKLLNLSPQNLVDC-VSENDGCGGGYMTNAFQYVQKNRGIDSEDAYP 202

Query:   222 YLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQ-PVSVAIEASGTDFQFYSG 280
             Y+ ++  C       +     GY+++PE +E++L +A+A   P+SVAI+AS T FQFY  
Sbjct:   203 YVGQDENCMYNPTG-KAAKCRGYREIPEGNEKALKRAVARVGPISVAIDASLTSFQFYRK 261

Query:   281 GVFTGP-CGAE-LDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLC 338
             GV+    C ++ L+H V AVGYG  KG+ + I+KNSWG  WG +GYI M RN       C
Sbjct:   262 GVYYDENCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMARNKNNA---C 318

Query:   339 GINKMASIP 347
             GI  +AS P
Sbjct:   319 GIANLASFP 327


>UNIPROTKB|Q9GLE3 [details] [associations]
            symbol:CTSK "Cathepsin K" species:9823 "Sus scrofa"
            [GO:0005764 "lysosome" evidence=IEA] [GO:0005576 "extracellular
            region" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 InterPro:IPR015644
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005576 eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 MEROPS:I29.007
            CTD:1513 KO:K01371 OrthoDB:EOG4SJ5FC PANTHER:PTHR12411:SF55
            OMA:LKVPPSH EMBL:AF292030 RefSeq:NP_999467.1 UniGene:Ssc.1020
            ProteinModelPortal:Q9GLE3 SMR:Q9GLE3 STRING:Q9GLE3
            Ensembl:ENSSSCT00000007283 GeneID:397569 KEGG:ssc:397569
            ArrayExpress:Q9GLE3 Uniprot:Q9GLE3
        Length = 330

 Score = 603 (217.3 bits), Expect = 9.3e-59, P = 9.3e-59
 Identities = 133/308 (43%), Positives = 183/308 (59%)

Query:    48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKE----VTSYWLGLNEFADMSHEE 103
             +E W   + K Y    +++ R  I+++NLKHI   N E    V +Y L +N   DM+ EE
Sbjct:    27 WELWKKTYRKQYNSKVDEISRRLIWEKNLKHISIHNLEASLGVHTYELAMNHLGDMTSEE 86

Query:   104 FKNKYLGLKPQFPTRRQPSAEFSYRDVKA-LPKSVDWRKKGAVTPVKNQGSCGSCWAFST 162
                K  GLK   P+  + +      D +   P S+D+RKKG VTPVKNQG CGSCWAFS+
Sbjct:    87 VVQKMTGLKVP-PSHSRSNDTLYIPDWEGRTPDSIDYRKKGYVTPVKNQGQCGSCWAFSS 145

Query:   163 VAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPY 222
             V A+EG  +  +G L +LS Q L+DC  S N+GC GG M  AF+Y+  + G+  E+ YPY
Sbjct:   146 VGALEGQLKKKTGKLLNLSPQNLVDC-VSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPY 204

Query:   223 LMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQ-PVSVAIEASGTDFQFYSGG 281
             + ++  C       +     GY+++PE +E++L +A+A   PVSVAI+AS T FQFYS G
Sbjct:   205 VGQDENCMYNPTG-KAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSKG 263

Query:   282 VFTGP-CGAE-LDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCG 339
             V+    C ++ L+H V AVGYG  KG  + I+KNSWG  WG +GYI M RN       CG
Sbjct:   264 VYYDENCNSDNLNHAVLAVGYGIQKGKKHWIIKNSWGENWGNKGYILMARNKNNA---CG 320

Query:   340 INKMASIP 347
             I  +AS P
Sbjct:   321 IANLASFP 328


>UNIPROTKB|F6R7P5 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9544 "Macaca
            mulatta" [GO:0001656 "metanephros development" evidence=ISS]
            [GO:0001913 "T cell mediated cytotoxicity" evidence=ISS]
            [GO:0002764 "immune response-regulating signaling pathway"
            evidence=ISS] [GO:0004175 "endopeptidase activity" evidence=ISS]
            [GO:0004177 "aminopeptidase activity" evidence=ISS] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=ISS] [GO:0004252
            "serine-type endopeptidase activity" evidence=ISS] [GO:0005615
            "extracellular space" evidence=ISS] [GO:0005764 "lysosome"
            evidence=ISS] [GO:0005829 "cytosol" evidence=ISS] [GO:0006508
            "proteolysis" evidence=ISS] [GO:0008234 "cysteine-type peptidase
            activity" evidence=ISS] [GO:0008284 "positive regulation of cell
            proliferation" evidence=ISS] [GO:0010628 "positive regulation of
            gene expression" evidence=ISS] [GO:0010634 "positive regulation of
            epithelial cell migration" evidence=ISS] [GO:0010813 "neuropeptide
            catabolic process" evidence=ISS] [GO:0010815 "bradykinin catabolic
            process" evidence=ISS] [GO:0010952 "positive regulation of
            peptidase activity" evidence=ISS] [GO:0016505 "apoptotic protease
            activator activity" evidence=ISS] [GO:0030108 "HLA-A specific
            activating MHC class I receptor activity" evidence=ISS] [GO:0030335
            "positive regulation of cell migration" evidence=ISS] [GO:0031638
            "zymogen activation" evidence=ISS] [GO:0031648 "protein
            destabilization" evidence=ISS] [GO:0032526 "response to retinoic
            acid" evidence=ISS] [GO:0033619 "membrane protein proteolysis"
            evidence=ISS] [GO:0043066 "negative regulation of apoptotic
            process" evidence=ISS] [GO:0043129 "surfactant homeostasis"
            evidence=ISS] [GO:0045766 "positive regulation of angiogenesis"
            evidence=ISS] [GO:0060448 "dichotomous subdivision of terminal
            units involved in lung branching" evidence=ISS] [GO:0070324
            "thyroid hormone binding" evidence=ISS] [GO:0070371 "ERK1 and ERK2
            cascade" evidence=ISS] [GO:0097208 "alveolar lamellar body"
            evidence=ISS] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0005829
            GO:GO:0043066 GO:GO:0005615 GO:GO:0008284 GO:GO:0070371
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766 GO:GO:0004177
            GO:GO:0004252 GO:GO:0005764 GO:GO:0097208 GO:GO:0032526
            GO:GO:0010628 GO:GO:0070324 KO:K01366 GO:GO:0016505 GO:GO:0001656
            GO:GO:0010634 GO:GO:0004197 GO:GO:0031648 GO:GO:0097067
            GO:GO:0031638 GO:GO:0001913 GeneTree:ENSGT00660000095458 CTD:1512
            OMA:STSCHKT GO:GO:0030108 GO:GO:0010815 GO:GO:0060448 GO:GO:0002764
            GO:GO:0033619 GO:GO:0010813 GO:GO:0043129 RefSeq:XP_001108862.1
            UniGene:Mmu.3000 Ensembl:ENSMMUT00000014095 GeneID:711437
            KEGG:mcc:711437 NextBio:19969972 Uniprot:F6R7P5
        Length = 335

 Score = 603 (217.3 bits), Expect = 9.3e-59, P = 9.3e-59
 Identities = 132/327 (40%), Positives = 195/327 (59%)

Query:    30 IVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSY 89
             + G +   + S++K    F+SWMSKH KTY   EE  HR + F  N + I+  N    ++
Sbjct:    19 VCGAAELSVNSLEKFH--FKSWMSKHHKTYST-EEYHHRMQTFASNWRKINAHNNGNHTF 75

Query:    90 WLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSY-RDVKALPKSVDWRKKGA-VTP 147
              + LN+F+DMS  E K+KYL  +PQ       + + +Y R     P S+DWRKKG  V+P
Sbjct:    76 KMALNQFSDMSFAEIKHKYLWSEPQ----NCSATKSNYLRGTGPYPPSMDWRKKGNFVSP 131

Query:   148 VKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNN-GCNGGLMDYAFK 206
             VKNQG+CGSCW FST  A+E    I +G + SL+EQ+L+DC   FNN GC GGL   AF+
Sbjct:   132 VKNQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFE 191

Query:   207 YIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALA-HQPVS 265
             YI+ + G+  E+ YPY  ++G C+ +  +  +  +    ++   DE+++++A+A + PVS
Sbjct:   192 YILYNKGIMGEDTYPYQGKDGDCKFRPGKA-IGFVKDVANITIYDEEAMVEAVALYNPVS 250

Query:   266 VAIEASGTDFQFYSGGVFTGP-CGA---ELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGE 321
              A E +  DF  Y  G+++   C     +++H V AVGYG+  G  Y IVKNSWGP+WG 
Sbjct:   251 FAFEVT-QDFMIYKTGIYSSTSCHKTPDKVNHAVLAVGYGEENGIPYWIVKNSWGPQWGM 309

Query:   322 RGYIRMKRNTGKPEGLCGINKMASIPL 348
              GY  ++R  GK   +CG+   AS P+
Sbjct:   310 NGYFLIER--GK--NMCGLAACASYPI 332


>UNIPROTKB|G1RBY1 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:61853
            "Nomascus leucogenys" [GO:0001656 "metanephros development"
            evidence=ISS] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=ISS] [GO:0002764 "immune response-regulating signaling
            pathway" evidence=ISS] [GO:0004175 "endopeptidase activity"
            evidence=ISS] [GO:0004177 "aminopeptidase activity" evidence=ISS]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0004252 "serine-type endopeptidase activity" evidence=ISS]
            [GO:0005615 "extracellular space" evidence=ISS] [GO:0005764
            "lysosome" evidence=ISS] [GO:0005829 "cytosol" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISS] [GO:0008284 "positive regulation
            of cell proliferation" evidence=ISS] [GO:0010628 "positive
            regulation of gene expression" evidence=ISS] [GO:0010634 "positive
            regulation of epithelial cell migration" evidence=ISS] [GO:0010813
            "neuropeptide catabolic process" evidence=ISS] [GO:0010815
            "bradykinin catabolic process" evidence=ISS] [GO:0010952 "positive
            regulation of peptidase activity" evidence=ISS] [GO:0016505
            "apoptotic protease activator activity" evidence=ISS] [GO:0030108
            "HLA-A specific activating MHC class I receptor activity"
            evidence=ISS] [GO:0030335 "positive regulation of cell migration"
            evidence=ISS] [GO:0031638 "zymogen activation" evidence=ISS]
            [GO:0031648 "protein destabilization" evidence=ISS] [GO:0032526
            "response to retinoic acid" evidence=ISS] [GO:0033619 "membrane
            protein proteolysis" evidence=ISS] [GO:0043066 "negative regulation
            of apoptotic process" evidence=ISS] [GO:0043129 "surfactant
            homeostasis" evidence=ISS] [GO:0045766 "positive regulation of
            angiogenesis" evidence=ISS] [GO:0060448 "dichotomous subdivision of
            terminal units involved in lung branching" evidence=ISS]
            [GO:0070324 "thyroid hormone binding" evidence=ISS] [GO:0070371
            "ERK1 and ERK2 cascade" evidence=ISS] [GO:0097208 "alveolar
            lamellar body" evidence=ISS] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005829 GO:GO:0043066 GO:GO:0005615 GO:GO:0008284
            GO:GO:0070371 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766
            GO:GO:0004177 GO:GO:0004252 GO:GO:0005764 GO:GO:0097208
            GO:GO:0032526 GO:GO:0010628 GO:GO:0070324 GO:GO:0016505
            GO:GO:0001656 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
            GO:GO:0097067 GO:GO:0031638 GO:GO:0001913
            GeneTree:ENSGT00660000095458 CTD:1512 OMA:STSCHKT GO:GO:0030108
            GO:GO:0010815 GO:GO:0060448 GO:GO:0002764 GO:GO:0033619
            GO:GO:0010813 GO:GO:0043129 EMBL:ADFV01087552 RefSeq:XP_003275518.1
            Ensembl:ENSNLET00000011249 GeneID:100584322 Uniprot:G1RBY1
        Length = 335

 Score = 603 (217.3 bits), Expect = 9.3e-59, P = 9.3e-59
 Identities = 132/327 (40%), Positives = 196/327 (59%)

Query:    30 IVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSY 89
             + G +   + S++K    F+SWMSKH KTY   EE  HR ++F  N + I+  N    ++
Sbjct:    19 VCGAAELSVNSLEKFH--FKSWMSKHHKTYST-EEYHHRLQMFASNWRKINAHNNGNHTF 75

Query:    90 WLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSY-RDVKALPKSVDWRKKGA-VTP 147
              + LN+F+DMS  E K+KYL  +PQ       + + +Y R     P S+DWRKKG  V+P
Sbjct:    76 KMALNQFSDMSFAEIKHKYLWSEPQ----NCSATKSNYLRGTGPYPPSMDWRKKGNFVSP 131

Query:   148 VKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNN-GCNGGLMDYAFK 206
             VKNQG+CGSCW FST  A+E    I +G + SL+EQ+L+DC   FNN GC GGL   AF+
Sbjct:   132 VKNQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFE 191

Query:   207 YIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALA-HQPVS 265
             YI+ + G+  E+ YPY  ++G C+ +  +  +  +    ++   DE+++++A+A + PVS
Sbjct:   192 YILYNKGIMGEDTYPYQGKDGYCKFRPGKA-IGFVKDVANITIYDEEAMVEAVALYNPVS 250

Query:   266 VAIEASGTDFQFYSGGVFTGP-CGA---ELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGE 321
              A E +  DF  Y  G+++   C     +++H V AVGYG+  G  Y IVKNSWGP+WG 
Sbjct:   251 FAFEVT-QDFMMYRRGIYSSTSCHKTPDKVNHAVLAVGYGEKNGIPYWIVKNSWGPQWGM 309

Query:   322 RGYIRMKRNTGKPEGLCGINKMASIPL 348
              GY  ++R  GK   +CG+   AS P+
Sbjct:   310 NGYFLIER--GK--NMCGLAACASYPI 332


>UNIPROTKB|P09668 [details] [associations]
            symbol:CTSH "Pro-cathepsin H" species:9606 "Homo sapiens"
            [GO:0001520 "outer dense fiber" evidence=IEA] [GO:0001669
            "acrosomal vesicle" evidence=IEA] [GO:0007283 "spermatogenesis"
            evidence=IEA] [GO:0030984 "kininogen binding" evidence=IEA]
            [GO:0032403 "protein complex binding" evidence=IEA] [GO:0043621
            "protein self-association" evidence=IEA] [GO:0005515 "protein
            binding" evidence=IPI] [GO:0031648 "protein destabilization"
            evidence=IMP] [GO:0010634 "positive regulation of epithelial cell
            migration" evidence=ISS] [GO:0032526 "response to retinoic acid"
            evidence=ISS] [GO:0060448 "dichotomous subdivision of terminal
            units involved in lung branching" evidence=ISS] [GO:0030108 "HLA-A
            specific activating MHC class I receptor activity" evidence=IDA]
            [GO:0097067 "cellular response to thyroid hormone stimulus"
            evidence=IEP] [GO:0010813 "neuropeptide catabolic process"
            evidence=IDA] [GO:0010815 "bradykinin catabolic process"
            evidence=IDA] [GO:0030335 "positive regulation of cell migration"
            evidence=IDA] [GO:0070371 "ERK1 and ERK2 cascade" evidence=IDA]
            [GO:0010628 "positive regulation of gene expression" evidence=IDA]
            [GO:0006508 "proteolysis" evidence=IDA;TAS] [GO:0031638 "zymogen
            activation" evidence=IDA] [GO:0016505 "apoptotic protease activator
            activity" evidence=IDA] [GO:0010952 "positive regulation of
            peptidase activity" evidence=IDA] [GO:0001913 "T cell mediated
            cytotoxicity" evidence=ISS] [GO:0004252 "serine-type endopeptidase
            activity" evidence=ISS] [GO:0043066 "negative regulation of
            apoptotic process" evidence=ISS] [GO:0005829 "cytosol"
            evidence=IDA] [GO:0033619 "membrane protein proteolysis"
            evidence=IDA] [GO:0004175 "endopeptidase activity" evidence=IDA]
            [GO:0004177 "aminopeptidase activity" evidence=IDA] [GO:0005764
            "lysosome" evidence=IDA] [GO:0001656 "metanephros development"
            evidence=ISS] [GO:0002250 "adaptive immune response" evidence=IEP]
            [GO:0019882 "antigen processing and presentation" evidence=TAS]
            [GO:0002764 "immune response-regulating signaling pathway"
            evidence=IDA] [GO:0005615 "extracellular space" evidence=IDA]
            [GO:0070324 "thyroid hormone binding" evidence=IDA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IDA] [GO:0008284
            "positive regulation of cell proliferation" evidence=ISS]
            [GO:0045766 "positive regulation of angiogenesis" evidence=ISS]
            [GO:0008233 "peptidase activity" evidence=IDA] [GO:0097208
            "alveolar lamellar body" evidence=IDA] [GO:0043129 "surfactant
            homeostasis" evidence=IDA] [GO:0004197 "cysteine-type endopeptidase
            activity" evidence=IDA;TAS] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005829 Reactome:REACT_6900 GO:GO:0043066 GO:GO:0005615
            GO:GO:0008284 GO:GO:0070371 eggNOG:COG4870 HOGENOM:HOG000230774
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766 GO:GO:0004177
            GO:GO:0004252 GO:GO:0005764 GO:GO:0097208 GO:GO:0032526
            GO:GO:0010628 GO:GO:0070324 KO:K01366 GO:GO:0016505 GO:GO:0001656
            GO:GO:0010634 GO:GO:0004197 GO:GO:0031648 HOVERGEN:HBG011513
            GO:GO:0097067 GO:GO:0031638 GO:GO:0001913 MEROPS:C01.040 CTD:1512
            OMA:STSCHKT OrthoDB:EOG4W9J43 GO:GO:0030108 GO:GO:0010815
            GO:GO:0060448 GO:GO:0002764 GO:GO:0033619 GO:GO:0010813
            GO:GO:0043129 EMBL:X16832 EMBL:AF426247 EMBL:AK314698 EMBL:AC011944
            EMBL:BC002479 EMBL:X07549 IPI:IPI00297487 PIR:S12486
            RefSeq:NP_004381.2 UniGene:Hs.148641 PDB:1BZN PDBsum:1BZN
            ProteinModelPortal:P09668 SMR:P09668 IntAct:P09668 STRING:P09668
            PhosphoSite:P09668 DMDM:288558851 PaxDb:P09668 PRIDE:P09668
            DNASU:1512 Ensembl:ENST00000220166 GeneID:1512 KEGG:hsa:1512
            UCSC:uc021srk.1 GeneCards:GC15M079213 H-InvDB:HIX0012481
            HGNC:HGNC:2535 HPA:CAB000458 HPA:HPA003524 MIM:116820
            neXtProt:NX_P09668 PharmGKB:PA27033 InParanoid:P09668
            PhylomeDB:P09668 BRENDA:3.4.22.16 ChEMBL:CHEMBL2225 GenomeRNAi:1512
            NextBio:6261 ArrayExpress:P09668 Bgee:P09668 CleanEx:HS_CTSH
            Genevestigator:P09668 GermOnline:ENSG00000103811 GO:GO:0019882
            Uniprot:P09668
        Length = 335

 Score = 601 (216.6 bits), Expect = 1.5e-58, P = 1.5e-58
 Identities = 130/309 (42%), Positives = 187/309 (60%)

Query:    48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNK 107
             F+SWMSKH KTY   EE  HR + F  N + I+  N    ++ + LN+F+DMS  E K+K
Sbjct:    35 FKSWMSKHRKTYST-EEYHHRLQTFASNWRKINAHNNGNHTFKMALNQFSDMSFAEIKHK 93

Query:   108 YLGLKPQFPTRRQPSAEFSY-RDVKALPKSVDWRKKGA-VTPVKNQGSCGSCWAFSTVAA 165
             YL  +PQ       + + +Y R     P SVDWRKKG  V+PVKNQG+CGSCW FST  A
Sbjct:    94 YLWSEPQ----NCSATKSNYLRGTGPYPPSVDWRKKGNFVSPVKNQGACGSCWTFSTTGA 149

Query:   166 VEGINQIVSGNLTSLSEQELIDCDTSFNN-GCNGGLMDYAFKYIVASGGLHKEEDYPYLM 224
             +E    I +G + SL+EQ+L+DC   FNN GC GGL   AF+YI+ + G+  E+ YPY  
Sbjct:   150 LESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNKGIMGEDTYPYQG 209

Query:   225 EEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALA-HQPVSVAIEASGTDFQFYSGGVF 283
             ++G C+ +  +  +  +    ++   DE+++++A+A + PVS A E +  DF  Y  G++
Sbjct:   210 KDGYCKFQPGKA-IGFVKDVANITIYDEEAMVEAVALYNPVSFAFEVT-QDFMMYRTGIY 267

Query:   284 TGP-CGA---ELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCG 339
             +   C     +++H V AVGYG+  G  Y IVKNSWGP+WG  GY  ++R  GK   +CG
Sbjct:   268 SSTSCHKTPDKVNHAVLAVGYGEKNGIPYWIVKNSWGPQWGMNGYFLIER--GK--NMCG 323

Query:   340 INKMASIPL 348
             +   AS P+
Sbjct:   324 LAACASYPI 332


>UNIPROTKB|A4IFS7 [details] [associations]
            symbol:CTSL1 "CTSL1 protein" species:9913 "Bos taurus"
            [GO:0097067 "cellular response to thyroid hormone stimulus"
            evidence=IEA] [GO:0042393 "histone binding" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0004197 "cysteine-type
            endopeptidase activity" evidence=IEA] [GO:0002250 "adaptive immune
            response" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0004197 GO:GO:0002250
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 GO:GO:0097067
            OrthoDB:EOG48PMKF MEROPS:C01.032 CTD:1514 EMBL:DAAA02023987
            EMBL:BC134741 IPI:IPI00708619 RefSeq:NP_001077155.1
            UniGene:Bt.23199 SMR:A4IFS7 Ensembl:ENSBTAT00000000962
            GeneID:515200 KEGG:bta:515200 InParanoid:A4IFS7 OMA:NDEQALM
            NextBio:20871707 Uniprot:A4IFS7
        Length = 333

 Score = 598 (215.6 bits), Expect = 3.2e-58, P = 3.2e-58
 Identities = 136/330 (41%), Positives = 191/330 (57%)

Query:    30 IVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT-- 87
             I   +P+   S+D   +L   W + H K Y   EE   +  ++K+N+K I+  N+E +  
Sbjct:    14 IASAAPKFDHSLDTQWKL---WKAAHRKPYDLNEEGWRK-AVWKKNMKMIELHNQEYSQG 69

Query:    88 --SYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAV 145
               S+ + +N F DM++EEF++   G + Q   + +   EF      ++P SVDWR+KG V
Sbjct:    70 KHSFSMAMNAFGDMTNEEFRHTMNGFQRQ---KNKKGKEFHETIFASIPPSVDWREKGYV 126

Query:   146 TPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDT-SFNNGCNGGLMDYA 204
             TPVKNQG CGSCWAFS   A+EG     +G L SLSEQ L+DC     N GC+GG +D A
Sbjct:   127 TPVKNQGKCGSCWAFSATGALEGQMFQKTGKLVSLSEQNLVDCSQPEGNRGCHGGFIDNA 186

Query:   205 FKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQ-P 263
             F+Y++  GGL  EE YPY    GTC            +G+ D+P+  E++L+KA+A+  P
Sbjct:   187 FQYVLDVGGLDSEESYPYTGLVGTCLYNPNN-SAANETGFVDLPKQ-EKALMKAVANLGP 244

Query:   264 VSVAIEASGTDFQFYSGGVFTGP-CGAE-LDHGVAAVGYG-KSKGSD---YIIVKNSWGP 317
             +SVA++A    FQFY  G++  P C +E +DH V  VGYG +   SD   Y +VKNSWG 
Sbjct:   245 ISVAVDAHNPSFQFYKSGIYYEPNCSSESVDHAVLVVGYGFEGADSDDNKYWLVKNSWGE 304

Query:   318 KWGERGYIRMKRNTGKPEGLCGINKMASIP 347
              WG  GYI+M ++       CGI  MAS P
Sbjct:   305 HWGMNGYIKMAKDRNNH---CGIATMASYP 331


>UNIPROTKB|G1M0X4 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9646
            "Ailuropoda melanoleuca" [GO:0001656 "metanephros development"
            evidence=ISS] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=ISS] [GO:0002764 "immune response-regulating signaling
            pathway" evidence=ISS] [GO:0004175 "endopeptidase activity"
            evidence=ISS] [GO:0004177 "aminopeptidase activity" evidence=ISS]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0004252 "serine-type endopeptidase activity" evidence=ISS]
            [GO:0005615 "extracellular space" evidence=ISS] [GO:0005764
            "lysosome" evidence=ISS] [GO:0005829 "cytosol" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISS] [GO:0008284 "positive regulation
            of cell proliferation" evidence=ISS] [GO:0010628 "positive
            regulation of gene expression" evidence=ISS] [GO:0010634 "positive
            regulation of epithelial cell migration" evidence=ISS] [GO:0010813
            "neuropeptide catabolic process" evidence=ISS] [GO:0010815
            "bradykinin catabolic process" evidence=ISS] [GO:0010952 "positive
            regulation of peptidase activity" evidence=ISS] [GO:0016505
            "apoptotic protease activator activity" evidence=ISS] [GO:0030108
            "HLA-A specific activating MHC class I receptor activity"
            evidence=ISS] [GO:0030335 "positive regulation of cell migration"
            evidence=ISS] [GO:0031638 "zymogen activation" evidence=ISS]
            [GO:0031648 "protein destabilization" evidence=ISS] [GO:0032526
            "response to retinoic acid" evidence=ISS] [GO:0033619 "membrane
            protein proteolysis" evidence=ISS] [GO:0043066 "negative regulation
            of apoptotic process" evidence=ISS] [GO:0043129 "surfactant
            homeostasis" evidence=ISS] [GO:0045766 "positive regulation of
            angiogenesis" evidence=ISS] [GO:0060448 "dichotomous subdivision of
            terminal units involved in lung branching" evidence=ISS]
            [GO:0070324 "thyroid hormone binding" evidence=ISS] [GO:0070371
            "ERK1 and ERK2 cascade" evidence=ISS] [GO:0097208 "alveolar
            lamellar body" evidence=ISS] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005829 GO:GO:0043066 GO:GO:0005615 GO:GO:0008284
            GO:GO:0070371 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766
            GO:GO:0004177 GO:GO:0004252 GO:GO:0005764 GO:GO:0097208
            GO:GO:0032526 GO:GO:0010628 GO:GO:0070324 GO:GO:0016505
            GO:GO:0001656 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
            GO:GO:0097067 GO:GO:0031638 GO:GO:0001913
            GeneTree:ENSGT00660000095458 OMA:STSCHKT GO:GO:0030108
            GO:GO:0010815 GO:GO:0060448 GO:GO:0002764 GO:GO:0033619
            GO:GO:0010813 GO:GO:0043129 EMBL:ACTA01057330 EMBL:ACTA01065330
            Ensembl:ENSAMET00000013529 Uniprot:G1M0X4
        Length = 337

 Score = 598 (215.6 bits), Expect = 3.2e-58, P = 3.2e-58
 Identities = 132/309 (42%), Positives = 182/309 (58%)

Query:    48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNK 107
             F+SWM +H K Y   EE  HR   F  N + I+  N    ++ +GLN+F+DMS  E K K
Sbjct:    37 FKSWMVQHQKKYSS-EEYQHRLRTFVGNWRKINAHNAGNHTFKMGLNQFSDMSFAEIKRK 95

Query:   108 YLGLKPQFPTRRQPSAEFSY-RDVKALPKSVDWRKKGA-VTPVKNQGSCGSCWAFSTVAA 165
             YL  +PQ       + + +Y R     P  VDWRKKG  V+PVKNQG CGSCW FST  A
Sbjct:    96 YLWSEPQ----NCSATKGNYLRGTGPYPPFVDWRKKGKFVSPVKNQGGCGSCWTFSTTGA 151

Query:   166 VEGINQIVSGNLTSLSEQELIDCDTSFNN-GCNGGLMDYAFKYIVASGGLHKEEDYPYLM 224
             +E    I +G L SL+EQ+L+DC   FNN GC GGL   AF+YI  + G+  E+ YPY  
Sbjct:   152 LESAIAIKTGKLLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYIRYNRGIMGEDSYPYKG 211

Query:   225 EEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALA-HQPVSVAIEASGTDFQFYSGGVF 283
             ++G C+ +  +  +  +    ++  NDEQ++++A+A   PVS A E +G DF  Y  GV+
Sbjct:   212 QDGDCKFQPSKA-IAFVKDVANITINDEQAMVEAVALFNPVSFAFEVTG-DFMMYRKGVY 269

Query:   284 TGP-CGA---ELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCG 339
             +   C     +++H V AVGYG+  G  Y IVKNSWGP+WG  GY  ++R  GK   +CG
Sbjct:   270 SSTSCHKTPDKVNHAVLAVGYGEQNGVPYWIVKNSWGPQWGMHGYFLIER--GK--NMCG 325

Query:   340 INKMASIPL 348
             +   AS P+
Sbjct:   326 LAACASYPI 334


>UNIPROTKB|G1SQF0 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9986
            "Oryctolagus cuniculus" [GO:0001656 "metanephros development"
            evidence=ISS] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=ISS] [GO:0002764 "immune response-regulating signaling
            pathway" evidence=ISS] [GO:0004175 "endopeptidase activity"
            evidence=ISS] [GO:0004177 "aminopeptidase activity" evidence=ISS]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0004252 "serine-type endopeptidase activity" evidence=ISS]
            [GO:0005615 "extracellular space" evidence=ISS] [GO:0005764
            "lysosome" evidence=ISS] [GO:0005829 "cytosol" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISS] [GO:0008284 "positive regulation
            of cell proliferation" evidence=ISS] [GO:0010628 "positive
            regulation of gene expression" evidence=ISS] [GO:0010634 "positive
            regulation of epithelial cell migration" evidence=ISS] [GO:0010813
            "neuropeptide catabolic process" evidence=ISS] [GO:0010815
            "bradykinin catabolic process" evidence=ISS] [GO:0010952 "positive
            regulation of peptidase activity" evidence=ISS] [GO:0016505
            "apoptotic protease activator activity" evidence=ISS] [GO:0030108
            "HLA-A specific activating MHC class I receptor activity"
            evidence=ISS] [GO:0030335 "positive regulation of cell migration"
            evidence=ISS] [GO:0031638 "zymogen activation" evidence=ISS]
            [GO:0031648 "protein destabilization" evidence=ISS] [GO:0032526
            "response to retinoic acid" evidence=ISS] [GO:0033619 "membrane
            protein proteolysis" evidence=ISS] [GO:0043066 "negative regulation
            of apoptotic process" evidence=ISS] [GO:0043129 "surfactant
            homeostasis" evidence=ISS] [GO:0045766 "positive regulation of
            angiogenesis" evidence=ISS] [GO:0060448 "dichotomous subdivision of
            terminal units involved in lung branching" evidence=ISS]
            [GO:0070324 "thyroid hormone binding" evidence=ISS] [GO:0070371
            "ERK1 and ERK2 cascade" evidence=ISS] [GO:0097208 "alveolar
            lamellar body" evidence=ISS] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005829 GO:GO:0043066 GO:GO:0005615 GO:GO:0008284
            GO:GO:0070371 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766
            GO:GO:0004177 GO:GO:0004252 GO:GO:0005764 GO:GO:0097208
            GO:GO:0032526 GO:GO:0010628 GO:GO:0070324 GO:GO:0016505
            GO:GO:0001656 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
            GO:GO:0097067 GO:GO:0031638 GO:GO:0001913
            GeneTree:ENSGT00660000095458 OMA:STSCHKT GO:GO:0030108
            GO:GO:0010815 GO:GO:0060448 GO:GO:0002764 GO:GO:0033619
            GO:GO:0010813 GO:GO:0043129 RefSeq:XP_002721635.1 UniGene:Ocu.7137
            Ensembl:ENSOCUT00000006138 GeneID:100101597 Uniprot:G1SQF0
        Length = 333

 Score = 598 (215.6 bits), Expect = 3.2e-58, P = 3.2e-58
 Identities = 129/309 (41%), Positives = 185/309 (59%)

Query:    48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNK 107
             F+SWMS+H K Y   EE   R + F  N + I+  N    ++ +GLN+F+DMS  E K+K
Sbjct:    33 FKSWMSQHHKKYSA-EEYPRRLQTFVRNWRKINAHNNGNHTFQMGLNQFSDMSFAEIKHK 91

Query:   108 YLGLKPQFPTRRQPSAEFSY-RDVKALPKSVDWRKKGA-VTPVKNQGSCGSCWAFSTVAA 165
             YL  +PQ       + + +Y R     P SVDWRKKG  V+PVKNQG+CGSCW FST  A
Sbjct:    92 YLWTEPQ----NCSATKSNYLRGTGPYPSSVDWRKKGNFVSPVKNQGACGSCWTFSTTGA 147

Query:   166 VEGINQIVSGNLTSLSEQELIDCDTSFNN-GCNGGLMDYAFKYIVASGGLHKEEDYPYLM 224
             +E    I  G + SL+EQ+L+DC  +FNN GC GGL   AF+YI+ + G+  E+ YPY  
Sbjct:   148 LESAVAIAGGKMLSLAEQQLVDCAQNFNNHGCEGGLPSQAFEYILYNKGIMGEDSYPYRA 207

Query:   225 EEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALA-HQPVSVAIEASGTDFQFYSGGVF 283
              EG C+ + ++  +  +    ++  NDE+++++A+A + PVS A E +  DF  Y  G++
Sbjct:   208 MEGRCKFQPQKA-IAFVKDVANITLNDEEAMVEAVALYNPVSFAFEVT-EDFMQYRKGIY 265

Query:   284 TGP-CGA---ELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCG 339
             +   C     +++H V AVGYG+  G  Y IVKNSWG  WG  GY  ++R  GK   +CG
Sbjct:   266 SSTSCHKTPDKVNHAVLAVGYGEENGVPYWIVKNSWGSHWGMNGYFYIER--GK--NMCG 321

Query:   340 INKMASIPL 348
             +   AS P+
Sbjct:   322 LAACASYPI 330


>RGD|61810 [details] [associations]
            symbol:Ctsk "cathepsin K" species:10116 "Rattus norvegicus"
           [GO:0001957 "intramembranous ossification" evidence=IEP] [GO:0005615
           "extracellular space" evidence=IDA] [GO:0005737 "cytoplasm"
           evidence=IDA] [GO:0005764 "lysosome" evidence=IDA] [GO:0006508
           "proteolysis" evidence=TAS] [GO:0008234 "cysteine-type peptidase
           activity" evidence=TAS] [GO:0045453 "bone resorption" evidence=IMP]
           InterPro:IPR000668 InterPro:IPR013128 InterPro:IPR015644
           Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
           RGD:61810 GO:GO:0005615 eggNOG:COG4870 HOGENOM:HOG000230774
           GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
           InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
           PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
           GO:GO:0045453 GO:GO:0001957 GeneTree:ENSGT00560000076577
           HOVERGEN:HBG011513 MEROPS:I29.007 CTD:1513 KO:K01371
           OrthoDB:EOG4SJ5FC PANTHER:PTHR12411:SF55 EMBL:AF010306 EMBL:BC078793
           IPI:IPI00206378 RefSeq:NP_113748.1 UniGene:Rn.5598
           ProteinModelPortal:O35186 SMR:O35186 STRING:O35186
           PhosphoSite:O35186 PRIDE:O35186 Ensembl:ENSRNOT00000028730
           GeneID:29175 KEGG:rno:29175 UCSC:RGD:61810 InParanoid:O35186
           OMA:YKEIPEG BindingDB:O35186 ChEMBL:CHEMBL3034 NextBio:608248
           Genevestigator:O35186 GermOnline:ENSRNOG00000021155 Uniprot:O35186
        Length = 329

 Score = 598 (215.6 bits), Expect = 3.2e-58, P = 3.2e-58
 Identities = 131/319 (41%), Positives = 185/319 (57%)

Query:    38 LTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT----SYWLGL 93
             L+  + L   +E W   HGK Y    +++ R  I+++NLK I   N E +    +Y L +
Sbjct:    16 LSPEETLDTQWELWKKTHGKQYNSKVDEISRRLIWEKNLKKISVHNLEASLGAHTYELAM 75

Query:    94 NEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDV--KALPKSVDWRKKGAVTPVKNQ 151
             N   DM+ EE   K  GL+   P  R  S +  Y       +P S+D+RKKG VTPVKNQ
Sbjct:    76 NHLGDMTSEEVVQKMTGLR--VPPSRSFSNDTLYTPEWEGRVPDSIDYRKKGYVTPVKNQ 133

Query:   152 GSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVAS 211
             G CGSCWAFS+  A+EG  +  +G L +LS Q L+DC  S N GC GG M  AF+Y+  +
Sbjct:   134 GQCGSCWAFSSAGALEGQLKKKTGKLLALSPQNLVDC-VSENYGCGGGYMTTAFQYVQQN 192

Query:   212 GGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQ-PVSVAIEA 270
             GG+  E+ YPY+ ++ +C       +     GY+++P  +E++L +A+A   PVSV+I+A
Sbjct:   193 GGIDSEDAYPYVGQDESCM-YNATAKAAKCRGYREIPVGNEKALKRAVARVGPVSVSIDA 251

Query:   271 SGTDFQFYSGGVFTGP-CGAE-LDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMK 328
             S T FQFYS GV+    C  + ++H V  VGYG  KG+ Y I+KNSWG  WG +GY+ + 
Sbjct:   252 SLTSFQFYSRGVYYDENCDRDNVNHAVLVVGYGTQKGNKYWIIKNSWGESWGNKGYVLLA 311

Query:   329 RNTGKPEGLCGINKMASIP 347
             RN       CGI  +AS P
Sbjct:   312 RNKNNA---CGITNLASFP 327


>ZFIN|ZDB-GENE-080215-7 [details] [associations]
            symbol:zgc:174153 "zgc:174153" species:7955 "Danio
            rerio" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0016787 "hydrolase activity"
            evidence=IEA] [GO:0008233 "peptidase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-080215-7
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00560000076577 EMBL:BX000534 EMBL:BX322603
            IPI:IPI00483644 Ensembl:ENSDART00000113654 OMA:ITLCISA Bgee:F1R8Y0
            Uniprot:F1R8Y0
        Length = 336

 Score = 597 (215.2 bits), Expect = 4.0e-58, P = 4.0e-58
 Identities = 133/322 (41%), Positives = 188/322 (58%)

Query:    40 SMD-KLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT----SYWLGLN 94
             S+D +L + + SW S+HGK+Y   + ++ R  I++ENL+ I+Q N E +    ++ +G+N
Sbjct:    19 SIDIQLDDHWNSWKSQHGKSYH-EDVEVGRRMIWEENLRKIEQHNFEYSYGNHTFKMGMN 77

Query:    95 EFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSC 154
             +F DM++EEF+    G K   P +      F      A P+ VDWR++G VTPVK+Q  C
Sbjct:    78 QFGDMTNEEFRQAMNGYKHD-PNQTSQGPLFMEPSFFAAPQQVDWRQRGYVTPVKDQKQC 136

Query:   155 GSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDT-SFNNGCNGGLMDYAFKYIVASGG 213
             GSCW+FS+  A+EG     +G L S+SEQ L+DC     N GCNGGLMD AF+Y+  + G
Sbjct:   137 GSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQGNQGCNGGLMDQAFQYVKENKG 196

Query:   214 LHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQ-PVSVAIEASG 272
             L  E+ YPYL  +           V  I+G+ D+P  +E +L+ A+A   PVSVAI+AS 
Sbjct:   197 LDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPSGNEPALMNAVAAVGPVSVAIDASH 256

Query:   273 TDFQFYSGGVF-TGPCGAE-LDHGVAAVGYGKS----KGSDYIIVKNSWGPKWGERGYIR 326
                QFY  G++    C +  LDH V  VGYG       G+ Y IVKNSW  KWG++GYI 
Sbjct:   257 QSLQFYQSGIYYERACSSSRLDHAVLVVGYGYQGADVAGNRYWIVKNSWSDKWGDKGYIY 316

Query:   327 MKRNTGKPEGLCGINKMASIPL 348
             M ++       CG+   AS PL
Sbjct:   317 MAKDKNNH---CGVATKASYPL 335


>UNIPROTKB|F1SS93 [details] [associations]
            symbol:CTSS "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0097067 "cellular response to thyroid hormone stimulus"
            evidence=IEA] [GO:0016020 "membrane" evidence=IEA] [GO:0006508
            "proteolysis" evidence=IEA] [GO:0005764 "lysosome" evidence=IEA]
            [GO:0002250 "adaptive immune response" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0016020 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 GO:GO:0002250
            GeneTree:ENSGT00560000076577 GO:GO:0097067 EMBL:CU463875
            Ensembl:ENSSSCT00000007284 OMA:CEIESAV Uniprot:F1SS93
        Length = 342

 Score = 592 (213.5 bits), Expect = 1.4e-57, P = 1.4e-57
 Identities = 132/309 (42%), Positives = 181/309 (58%)

Query:    48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKE----VTSYWLGLNEFADMSHEE 103
             ++ W   +GK YK   E++ R  I+++NLK +   N E    + SY LG+N   DM+ EE
Sbjct:    39 WDLWKKTYGKQYKEKNEEVARRLIWEKNLKTVMLHNLEHSMGMHSYDLGMNHLGDMTSEE 98

Query:   104 FKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTV 163
               +    ++   P++   +  +     + LP S+DWR+KG VT VK QGSCGSCWAFS V
Sbjct:    99 VISLMSCVR--VPSQWPRNVTYKSNPNQKLPDSMDWREKGCVTEVKYQGSCGSCWAFSAV 156

Query:   164 AAVEGINQIVSGNLTSLSEQELIDCDTSF--NNGCNGGLMDYAFKYIVASGGLHKEEDYP 221
              A+E   ++ +G L SLS Q L+DC T    N GCNGG M  AF+YI+ + G+  E  YP
Sbjct:   157 GALEAQVKMKTGRLVSLSAQNLVDCSTEKYRNKGCNGGFMTEAFQYIIDNNGIDSEASYP 216

Query:   222 YLMEEGTCE-DKKEEMEVVTISGYQDVPENDEQSLLKALAHQ-PVSVAIEASGTDFQFYS 279
             Y   +G C+ D K      T S Y ++P  DE +L +A+A++ PVSVAI+A  + F FY 
Sbjct:   217 YKAVDGKCKYDSKNR--AATCSRYTELPFADEYALKEAVANKGPVSVAIDAKHSSFFFYR 274

Query:   280 GGVFTGP-CGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLC 338
              GV+  P C   ++HGV  VGYG   G DY +VKNSWG  +G+ GYIRM RN+   E  C
Sbjct:   275 SGVYYDPSCTQNVNHGVLVVGYGNLNGKDYWLVKNSWGLNFGDGGYIRMARNS---ENHC 331

Query:   339 GINKMASIP 347
             GI    S P
Sbjct:   332 GIANYPSYP 340


>FB|FBgn0260462 [details] [associations]
            symbol:CG12163 species:7227 "Drosophila melanogaster"
            [GO:0035071 "salivary gland cell autophagic cell death"
            evidence=IEP] [GO:0048102 "autophagic cell death" evidence=IEP]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=ISS] [GO:0004869 "cysteine-type
            endopeptidase inhibitor activity" evidence=IEA] [GO:0045169
            "fusome" evidence=IDA] [GO:0035220 "wing disc development"
            evidence=IGI] [GO:0022416 "chaeta development" evidence=IGI]
            InterPro:IPR000010 InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00043 SMART:SM00645
            InterPro:IPR000169 EMBL:AE014297 GO:GO:0004869 eggNOG:COG4870
            KO:K01373 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0022416 GO:GO:0035220 GO:GO:0035071
            GO:GO:0045169 GeneTree:ENSGT00660000095458 EMBL:AY121614
            EMBL:BT003231 RefSeq:NP_649521.1 RefSeq:NP_730901.1
            RefSeq:NP_730902.2 UniGene:Dm.7315 ProteinModelPortal:Q9VN93
            SMR:Q9VN93 DIP:DIP-17491N IntAct:Q9VN93 MINT:MINT-763966
            STRING:Q9VN93 MEROPS:C01.A27 PaxDb:Q9VN93
            EnsemblMetazoa:FBtr0078823 GeneID:40628 KEGG:dme:Dmel_CG12163
            UCSC:CG12163-RA FlyBase:FBgn0260462 InParanoid:Q9VN93 OMA:GPRWGEQ
            OrthoDB:EOG4CC2G9 PhylomeDB:Q9VN93 GenomeRNAi:40628 NextBio:819744
            Bgee:Q9VN93 GermOnline:CG12163 Uniprot:Q9VN93
        Length = 614

 Score = 591 (213.1 bits), Expect = 1.7e-57, P = 1.7e-57
 Identities = 134/323 (41%), Positives = 189/323 (58%)

Query:    36 EHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRN-KEVTSYWLGLN 94
             +H    DK+  LF  +  + G+ Y    E+  R  IF++NLK I++ N  E+ S   G+ 
Sbjct:   296 KHSHRFDKVDHLFYKFQVRFGRRYVSTAERQMRLRIFRQNLKTIEELNANEMGSAKYGIT 355

Query:    95 EFADMSHEEFKNKYLGLKPQFPTRRQP-SAEFSYRDVKALPKSVDWRKKGAVTPVKNQGS 153
             EFADM+  E+K +  GL  +   +    SA         LPK  DWR+K AVT VKNQGS
Sbjct:   356 EFADMTSSEYKER-TGLWQRDEAKATGGSAAVVPAYHGELPKEFDWRQKDAVTQVKNQGS 414

Query:   154 CGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGG 213
             CGSCWAFS    +EG+  + +G L   SEQEL+DCDT+ ++ CNGGLMD A+K I   GG
Sbjct:   415 CGSCWAFSVTGNIEGLYAVKTGELKEFSEQELLDCDTT-DSACNGGLMDNAYKAIKDIGG 473

Query:   214 LHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLK-ALAHQPVSVAIEASG 272
             L  E +YPY  ++  C   +  +  V ++G+ D+P+ +E ++ +  LA+ P+S+ I A+ 
Sbjct:   474 LEYEAEYPYKAKKNQCHFNRT-LSHVQVAGFVDLPKGNETAMQEWLLANGPISIGINANA 532

Query:   273 TDFQFYSGGV---FTGPCGAE-LDHGVAAVGYGKS------KGSDYIIVKNSWGPKWGER 322
                QFY GGV   +   C  + LDHGV  VGYG S      K   Y IVKNSWGP+WGE+
Sbjct:   533 --MQFYRGGVSHPWKALCSKKNLDHGVLVVGYGVSDYPNFHKTLPYWIVKNSWGPRWGEQ 590

Query:   323 GYIRMKRNTGKPEGLCGINKMAS 345
             GY R+ R     +  CG+++MA+
Sbjct:   591 GYYRVYRG----DNTCGVSEMAT 609


>RGD|1308751 [details] [associations]
            symbol:RGD1308751 "similar to Cathepsin L precursor (Major
            excreted protein) (MEP)" species:10116 "Rattus norvegicus"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 RGD:1308751 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            EMBL:CH474032 IPI:IPI00365697 RefSeq:XP_001065885.2
            RefSeq:XP_225137.5 MEROPS:C01.069 Ensembl:ENSRNOT00000061391
            GeneID:290981 KEGG:rno:290981 UCSC:RGD:1308751 CTD:290981
            OMA:ESYAYEA OrthoDB:EOG42823G NextBio:631921 Uniprot:D3ZKC3
        Length = 330

 Score = 591 (213.1 bits), Expect = 1.7e-57, P = 1.7e-57
 Identities = 132/327 (40%), Positives = 186/327 (56%)

Query:    30 IVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTS- 88
             ++  +P H  S D +   +E W +KHGKTY   EE   R  +++ N+K I+  N++    
Sbjct:    14 MISAAPTHDPSFDTV---WEEWKTKHGKTYNTNEEGQKR-AVWENNMKMINLHNEDYLKG 69

Query:    89 ---YWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAV 145
                + L +N F D+++ EF+    G +   P       E    D+   PKS+DWR+ G V
Sbjct:    70 KHGFSLEMNAFGDLTNTEFRELMTGFQSMGPKETTIFREPFLGDI---PKSLDWREHGYV 126

Query:   146 TPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNN-GCNGGLMDYA 204
             TPVKNQG CGSCWAFS V ++EG     +G L SLSEQ L+DC  S+ N GCNGGLM++A
Sbjct:   127 TPVKNQGQCGSCWAFSAVGSLEGQIFKKTGKLVSLSEQNLVDCSWSYGNLGCNGGLMEFA 186

Query:   205 FKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQ-P 263
             F+Y+  + GL   E Y Y  ++G C     +     ++G+  VP + E  L+ A+A   P
Sbjct:   187 FQYVKENRGLDTGESYAYEAQDGLCR-YNPKYSAANVTGFVKVPLS-EDDLMSAVASVGP 244

Query:   264 VSVAIEASGTDFQFYSGGVFTGP-CGA-ELDHGVAAVGYGK-SKGSDYIIVKNSWGPKWG 320
             VSV I++    F+FYSGG++  P C + E+DH V  VGYG+ S G  Y +VKNSWG  WG
Sbjct:   245 VSVGIDSHHQSFRFYSGGMYYEPDCSSTEMDHAVLVVGYGEESDGGKYWLVKNSWGEDWG 304

Query:   321 ERGYIRMKRNTGKPEGLCGINKMASIP 347
               GYI+M ++       CGI   A  P
Sbjct:   305 MDGYIKMAKDQNNN---CGIATYAIYP 328


>ZFIN|ZDB-GENE-030131-3539 [details] [associations]
            symbol:ctsh "cathepsin H" species:7955 "Danio
            rerio" [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-030131-3539
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 KO:K01366 HOVERGEN:HBG011513
            CTD:1512 OrthoDB:EOG4W9J43 MEROPS:I29.003 HSSP:P43235 EMBL:BC067615
            IPI:IPI00506892 RefSeq:NP_997853.1 UniGene:Dr.14176
            ProteinModelPortal:Q6NWF2 SMR:Q6NWF2 PRIDE:Q6NWF2 GeneID:324818
            KEGG:dre:324818 InParanoid:Q6NWF2 NextBio:20808976 Bgee:Q6NWF2
            Uniprot:Q6NWF2
        Length = 330

 Score = 591 (213.1 bits), Expect = 1.7e-57, P = 1.7e-57
 Identities = 128/308 (41%), Positives = 181/308 (58%)

Query:    48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNK 107
             F+SWMS++ K Y+ I E   R +IF EN K IDQ N+    + +GLN+F+DM+  EFK  
Sbjct:    30 FKSWMSQYNKKYE-INEFYQRLQIFLENKKRIDQHNEGNHKFSMGLNQFSDMTFAEFKKT 88

Query:   108 YLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGA-VTPVKNQGSCGSCWAFSTVAAV 166
             YL  +PQ  +  + +   S   +   P ++DWR KG  +T VKNQG CGSCW FST   +
Sbjct:    89 YLLTEPQNCSATRGN-HVSSNGL--YPDAIDWRTKGHYITDVKNQGPCGSCWTFSTTGCL 145

Query:   167 EGINQIVSGNLTSLSEQELIDCDTSFNN-GCNGGLMDYAFKYIVASGGLHKEEDYPYLME 225
             E +  I +G L  L+EQ+LIDC   F+N GCNGGL  +AF+YI+ + GL  E+DYPY  +
Sbjct:   146 ESVTAIATGKLLQLAEQQLIDCAGDFDNHGCNGGLPSHAFEYIMYNKGLMTEDDYPYQAK 205

Query:   226 EGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQFYSGGVFT 284
              G C  K + +    +    ++ + DE  ++ A+A   PVS A E + +DF  Y  G++T
Sbjct:   206 GGQCRFKPQ-LAAAFVKEVVNITKYDEMGMVDAVARLNPVSFAYEVT-SDFMHYKDGIYT 263

Query:   285 GP-CGAELD---HGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGI 340
                C    D   H V AVGY +  G+ Y IVKNSWG  WG +GY  ++R  GK   +CG+
Sbjct:   264 STECHNTTDMVNHAVLAVGYAEENGTPYWIVKNSWGTNWGIKGYFYIER--GK--NMCGL 319

Query:   341 NKMASIPL 348
                +S P+
Sbjct:   320 AACSSYPI 327


>UNIPROTKB|P25326 [details] [associations]
            symbol:CTSS "Cathepsin S" species:9913 "Bos taurus"
            [GO:0005515 "protein binding" evidence=IPI] [GO:0005764 "lysosome"
            evidence=IEA] [GO:0097067 "cellular response to thyroid hormone
            stimulus" evidence=IEA] [GO:0016020 "membrane" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0002250 "adaptive
            immune response" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0016020 eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GO:GO:0002250 GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513
            GO:GO:0097067 EMBL:BC102245 EMBL:M95211 EMBL:X62001 IPI:IPI00702008
            PIR:S15844 RefSeq:NP_001028787.1 UniGene:Bt.7938
            ProteinModelPortal:P25326 SMR:P25326 STRING:P25326 PRIDE:P25326
            Ensembl:ENSBTAT00000022774 GeneID:327711 KEGG:bta:327711 CTD:1520
            InParanoid:P25326 KO:K01368 OMA:KAMDQKC OrthoDB:EOG4JM7Q2
            NextBio:20810175 Uniprot:P25326
        Length = 331

 Score = 590 (212.7 bits), Expect = 2.2e-57, P = 2.2e-57
 Identities = 130/309 (42%), Positives = 182/309 (58%)

Query:    48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKE----VTSYWLGLNEFADMSHEE 103
             ++ W   +GK YK   E++ R  I+++NLK +   N E    + SY LG+N   DM+ EE
Sbjct:    28 WDLWKKTYGKQYKEKNEEVARRLIWEKNLKTVTLHNLEHSMGMHSYELGMNHLGDMTSEE 87

Query:   104 FKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTV 163
               +    L+   P++   +  +     + LP S+DWR+KG VT VK QG+CGSCWAFS V
Sbjct:    88 VISLMSSLR--VPSQWPRNVTYKSDPNQKLPDSMDWREKGCVTEVKYQGACGSCWAFSAV 145

Query:   164 AAVEGINQIVSGNLTSLSEQELIDCDTSF--NNGCNGGLMDYAFKYIVASGGLHKEEDYP 221
              A+E   ++ +G L SLS Q L+DC T+   N GCNGG M  AF+YI+ + G+  E  YP
Sbjct:   146 GALEAQVKLKTGKLVSLSAQNLVDCSTAKYGNKGCNGGFMTEAFQYIIDNNGIDSEASYP 205

Query:   222 YLMEEGTCE-DKKEEMEVVTISGYQDVPENDEQSLLKALAHQ-PVSVAIEASGTDFQFYS 279
             Y   +G C+ D K      T S Y ++P   E++L +A+A++ PVSV I+AS + F  Y 
Sbjct:   206 YKAMDGKCQYDVKNR--AATCSRYIELPFGSEEALKEAVANKGPVSVGIDASHSSFFLYK 263

Query:   280 GGVFTGP-CGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLC 338
              GV+  P C   ++HGV  VGYG   G DY +VKNSWG  +G++GYIRM RN+G     C
Sbjct:   264 TGVYYDPSCTQNVNHGVLVVGYGNLDGKDYWLVKNSWGLHFGDQGYIRMARNSGNH---C 320

Query:   339 GINKMASIP 347
             GI    S P
Sbjct:   321 GIANYPSYP 329


>UNIPROTKB|F1PAK0 [details] [associations]
            symbol:CTSS "Cathepsin S" species:9615 "Canis lupus
            familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            EMBL:AAEX03011051 Ensembl:ENSCAFT00000019176 OMA:YEPACTQ
            Uniprot:F1PAK0
        Length = 339

 Score = 590 (212.7 bits), Expect = 2.2e-57, P = 2.2e-57
 Identities = 132/306 (43%), Positives = 177/306 (57%)

Query:    51 WMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKE----VTSYWLGLNEFADMSHEEFKN 106
             W   + K YK   E++ R  I+++NLK +   N E    + SY LG+N   DM+ EE  +
Sbjct:    39 WKKTYSKQYKEENEEVARRLIWEKNLKFVMLHNLEHSMGMHSYDLGMNHLGDMTGEEVIS 98

Query:   107 KYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAV 166
                 L+   P++ Q +  +     + LP SVDWR+KG VT VK QGSCG+CWAFS V A+
Sbjct:    99 LMGSLR--VPSQWQRNVTYRSNSNQKLPDSVDWREKGCVTEVKYQGSCGACWAFSAVGAL 156

Query:   167 EGINQIVSGNLTSLSEQELIDCDTSF--NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLM 224
             E   ++ +G L SLS Q L+DC T    N GCNGG M  AF+YI+ + G+  E  YPY  
Sbjct:   157 EAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNNGIDSEASYPYKA 216

Query:   225 EEGTCE-DKKEEMEVVTISGYQDVPENDEQSLLKALAHQ-PVSVAIEASGTDFQFYSGGV 282
               G C  D K+     T S Y ++P   E +L +A+A++ PVSVAI+AS   F  Y  GV
Sbjct:   217 VNGKCRYDSKKR--AATCSKYTELPFGSEDALKEAVANKGPVSVAIDASHYSFFLYRSGV 274

Query:   283 FTGP-CGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGIN 341
             +  P C   ++HGV  VGYG   G DY +VKNSWG  +G++GYIRM RN+G     CGI 
Sbjct:   275 YYEPSCTQNVNHGVLVVGYGNLNGKDYWLVKNSWGLNFGDQGYIRMARNSGNH---CGIA 331

Query:   342 KMASIP 347
                S P
Sbjct:   332 SYPSYP 337


>UNIPROTKB|F6X9C1 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00660000095458
            OMA:STSCHKT Ensembl:ENSCAFT00000036196 EMBL:AAEX03002388
            Uniprot:F6X9C1
        Length = 305

 Score = 590 (212.7 bits), Expect = 2.2e-57, P = 2.2e-57
 Identities = 130/309 (42%), Positives = 185/309 (59%)

Query:    48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNK 107
             F+SW  +H K Y   EE L R + F  N + I+  N    ++ +GLN+F+DM+  E K+K
Sbjct:     5 FKSWAVQHQKKYSS-EEYLQRLQTFVGNWRKINAHNAGNHTFKMGLNQFSDMNFAEIKHK 63

Query:   108 YLGLKPQFPTRRQPSAEFSY-RDVKALPKSVDWRKKGA-VTPVKNQGSCGSCWAFSTVAA 165
             YL  +PQ       + + +Y R     P  VDWRKKG  V+PVKNQGSCGSCW FST  A
Sbjct:    64 YLWSEPQ----NCSATKGNYLRGTGPYPPFVDWRKKGKFVSPVKNQGSCGSCWTFSTTGA 119

Query:   166 VEGINQIVSGNLTSLSEQELIDCDTSFNN-GCNGGLMDYAFKYIVASGGLHKEEDYPYLM 224
             +E    I SG L SL+EQ+L+DC  +FNN GC GG    AF+YI  + G+  E+ YPY  
Sbjct:   120 LESAIAIKSGKLLSLAEQQLVDCAQNFNNHGCQGGAPLQAFEYIRYNKGIMGEDSYPYKG 179

Query:   225 EEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALA-HQPVSVAIEASGTDFQFYSGGVF 283
             ++G C+ +  +  +  +    ++  NDEQ++++A+A + PVS A E + +DF  Y  G++
Sbjct:   180 QDGDCKYQPSKA-IAFVKDVANITINDEQAMVEAVALYNPVSFAFEVT-SDFMMYRKGIY 237

Query:   284 TGP-CGA---ELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCG 339
             +   C     +++H V AVGYG+  G  Y IVKNSWGP+WG  GY  M+R  GK   +CG
Sbjct:   238 SSTSCHKTPDKVNHAVLAVGYGEQNGIPYWIVKNSWGPQWGMNGYFLMER--GK--NMCG 293

Query:   340 INKMASIPL 348
             +   AS P+
Sbjct:   294 LAACASYPI 302


>UNIPROTKB|Q8HY81 [details] [associations]
            symbol:CTSS "Cathepsin S" species:9615 "Canis lupus
            familiaris" [GO:0005764 "lysosome" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0006508
            "proteolysis" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 HOVERGEN:HBG011513
            CTD:1520 KO:K01368 OrthoDB:EOG4JM7Q2 EMBL:AY156692
            RefSeq:NP_001002938.2 UniGene:Cfa.1661 ProteinModelPortal:Q8HY81
            SMR:Q8HY81 STRING:Q8HY81 MEROPS:C01.034 GeneID:403400
            KEGG:cfa:403400 InParanoid:Q8HY81 NextBio:20816922 Uniprot:Q8HY81
        Length = 331

 Score = 590 (212.7 bits), Expect = 2.2e-57, P = 2.2e-57
 Identities = 132/306 (43%), Positives = 177/306 (57%)

Query:    51 WMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKE----VTSYWLGLNEFADMSHEEFKN 106
             W   + K YK   E++ R  I+++NLK +   N E    + SY LG+N   DM+ EE  +
Sbjct:    31 WKKTYSKQYKEENEEVARRLIWEKNLKFVMLHNLEHSMGMHSYDLGMNHLGDMTGEEVIS 90

Query:   107 KYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAV 166
                 L+   P++ Q +  +     + LP SVDWR+KG VT VK QGSCG+CWAFS V A+
Sbjct:    91 LMGSLR--VPSQWQRNVTYRSNSNQKLPDSVDWREKGCVTEVKYQGSCGACWAFSAVGAL 148

Query:   167 EGINQIVSGNLTSLSEQELIDCDTSF--NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLM 224
             E   ++ +G L SLS Q L+DC T    N GCNGG M  AF+YI+ + G+  E  YPY  
Sbjct:   149 EAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNNGIDSEASYPYKA 208

Query:   225 EEGTCE-DKKEEMEVVTISGYQDVPENDEQSLLKALAHQ-PVSVAIEASGTDFQFYSGGV 282
               G C  D K+     T S Y ++P   E +L +A+A++ PVSVAI+AS   F  Y  GV
Sbjct:   209 MNGKCRYDSKKR--AATCSKYTELPFGSEDALKEAVANKGPVSVAIDASHYSFFLYRSGV 266

Query:   283 FTGP-CGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGIN 341
             +  P C   ++HGV  VGYG   G DY +VKNSWG  +G++GYIRM RN+G     CGI 
Sbjct:   267 YYEPSCTQNVNHGVLVVGYGNLNGKDYWLVKNSWGLNFGDQGYIRMARNSGNH---CGIA 323

Query:   342 KMASIP 347
                S P
Sbjct:   324 SYPSYP 329


>UNIPROTKB|F7BJD8 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9796 "Equus
            caballus" [GO:0001656 "metanephros development" evidence=ISS]
            [GO:0001913 "T cell mediated cytotoxicity" evidence=ISS]
            [GO:0002764 "immune response-regulating signaling pathway"
            evidence=ISS] [GO:0004175 "endopeptidase activity" evidence=ISS]
            [GO:0004177 "aminopeptidase activity" evidence=ISS] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=ISS] [GO:0004252
            "serine-type endopeptidase activity" evidence=ISS] [GO:0005615
            "extracellular space" evidence=ISS] [GO:0005764 "lysosome"
            evidence=ISS] [GO:0005829 "cytosol" evidence=ISS] [GO:0006508
            "proteolysis" evidence=ISS] [GO:0008234 "cysteine-type peptidase
            activity" evidence=ISS] [GO:0008284 "positive regulation of cell
            proliferation" evidence=ISS] [GO:0010628 "positive regulation of
            gene expression" evidence=ISS] [GO:0010634 "positive regulation of
            epithelial cell migration" evidence=ISS] [GO:0010813 "neuropeptide
            catabolic process" evidence=ISS] [GO:0010815 "bradykinin catabolic
            process" evidence=ISS] [GO:0010952 "positive regulation of
            peptidase activity" evidence=ISS] [GO:0016505 "apoptotic protease
            activator activity" evidence=ISS] [GO:0030108 "HLA-A specific
            activating MHC class I receptor activity" evidence=ISS] [GO:0030335
            "positive regulation of cell migration" evidence=ISS] [GO:0031638
            "zymogen activation" evidence=ISS] [GO:0031648 "protein
            destabilization" evidence=ISS] [GO:0032526 "response to retinoic
            acid" evidence=ISS] [GO:0033619 "membrane protein proteolysis"
            evidence=ISS] [GO:0043066 "negative regulation of apoptotic
            process" evidence=ISS] [GO:0043129 "surfactant homeostasis"
            evidence=ISS] [GO:0045766 "positive regulation of angiogenesis"
            evidence=ISS] [GO:0060448 "dichotomous subdivision of terminal
            units involved in lung branching" evidence=ISS] [GO:0070324
            "thyroid hormone binding" evidence=ISS] [GO:0070371 "ERK1 and ERK2
            cascade" evidence=ISS] [GO:0097208 "alveolar lamellar body"
            evidence=ISS] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0005829
            GO:GO:0043066 GO:GO:0005615 GO:GO:0008284 GO:GO:0070371
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766 GO:GO:0004177
            GO:GO:0004252 GO:GO:0005764 GO:GO:0097208 GO:GO:0032526
            GO:GO:0010628 GO:GO:0070324 GO:GO:0016505 GO:GO:0001656
            GO:GO:0010634 GO:GO:0004197 GO:GO:0031648 GO:GO:0097067
            GO:GO:0031638 GO:GO:0001913 GeneTree:ENSGT00660000095458
            OMA:STSCHKT GO:GO:0030108 GO:GO:0010815 GO:GO:0060448 GO:GO:0002764
            GO:GO:0033619 GO:GO:0010813 GO:GO:0043129
            Ensembl:ENSECAT00000013967 Uniprot:F7BJD8
        Length = 305

 Score = 590 (212.7 bits), Expect = 2.2e-57, P = 2.2e-57
 Identities = 129/309 (41%), Positives = 184/309 (59%)

Query:    48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNK 107
             F+SWM +H K Y   EE  HR + F  N + I+  N    ++ +GLN+F+ M+  E K+K
Sbjct:     5 FKSWMVQHQKKYSS-EEYHHRLQTFVSNWRKINAHNTGNHTFRMGLNQFSAMNFAELKHK 63

Query:   108 YLGLKPQFPTRRQPSAEFSY-RDVKALPKSVDWRKKGA-VTPVKNQGSCGSCWAFSTVAA 165
             YL  +PQ       + + +Y R     P SVDWRKKG  V+PVKNQG CGSCW FST  A
Sbjct:    64 YLWSEPQ----NCSATKGNYLRGAGPYPPSVDWRKKGNFVSPVKNQGGCGSCWTFSTTGA 119

Query:   166 VEGINQIVSGNLTSLSEQELIDCDTSFNN-GCNGGLMDYAFKYIVASGGLHKEEDYPYLM 224
             +E    I SG L SL+EQ+L+DC  +FNN GC GGL   AF+YI  + G+  E+ YPY  
Sbjct:   120 LESAVAIASGKLLSLAEQQLVDCAQNFNNHGCQGGLPSQAFEYIRYNKGIMGEDTYPYKG 179

Query:   225 EEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALA-HQPVSVAIEASGTDFQFYSGGVF 283
             ++G C+ +  +  +  +    ++  NDE+++++A+A + PVS A E +  DF  Y  G++
Sbjct:   180 QDGDCKFQPNKA-IAFVKDVANITLNDEKAMVEAVALYNPVSFAFEVT-EDFMMYRKGIY 237

Query:   284 TGP-CGA---ELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCG 339
             +   C     +++H V AVGYG+  G  Y IVKNSWGP WG  GY  ++R  GK   +CG
Sbjct:   238 SSTSCHKTPDKVNHAVLAVGYGEENGIPYWIVKNSWGPHWGMNGYFLIER--GK--NMCG 293

Query:   340 INKMASIPL 348
             +   AS P+
Sbjct:   294 LAACASYPI 302


>ZFIN|ZDB-GENE-980526-285 [details] [associations]
            symbol:ctsl1b "cathepsin L, 1 b" species:7955
            "Danio rerio" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0005576
            "extracellular region" evidence=IDA] [GO:0008233 "peptidase
            activity" evidence=IEA] [GO:0016787 "hydrolase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            ZFIN:ZDB-GENE-980526-285 GO:GO:0005576 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            EMBL:BX465840 IPI:IPI00498443 Ensembl:ENSDART00000145570
            Bgee:F1R7B3 Uniprot:F1R7B3
        Length = 352

 Score = 590 (212.7 bits), Expect = 2.2e-57, P = 2.2e-57
 Identities = 132/322 (40%), Positives = 187/322 (58%)

Query:    40 SMD-KLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT----SYWLGLN 94
             S+D +L + + SW S+HGK+Y   + ++ R  I++ENL+ I+Q N E +    ++ +G+N
Sbjct:    35 SIDIQLDDHWNSWKSQHGKSYH-EDVEVGRRMIWEENLRKIEQHNFEYSYGNHTFKMGMN 93

Query:    95 EFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSC 154
             +F DM++EEF+    G     P +      F      A P+ VDWR++G VTPVK+Q  C
Sbjct:    94 QFGDMTNEEFRQAMNGYTHD-PNQTSQGPLFMEPSFFAAPQQVDWRQRGYVTPVKDQKQC 152

Query:   155 GSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDT-SFNNGCNGGLMDYAFKYIVASGG 213
             GSCW+FS+  A+EG     +G L S+SEQ L+DC     N GCNGGLMD AF+Y+  + G
Sbjct:   153 GSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQGNQGCNGGLMDQAFQYVKENKG 212

Query:   214 LHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQ-PVSVAIEASG 272
             L  E+ YPYL  +           V  I+G+ D+P  +E +L+ A+A   PVSVAI+AS 
Sbjct:   213 LDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPSGNELALMNAVAAVGPVSVAIDASH 272

Query:   273 TDFQFYSGGVF-TGPCGAE-LDHGVAAVGYGKS----KGSDYIIVKNSWGPKWGERGYIR 326
                QFY  G++    C +  LDH V  VGYG       G+ Y IVKNSW  KWG++GYI 
Sbjct:   273 QSLQFYQSGIYYERACSSSRLDHAVLVVGYGYQGADVAGNRYWIVKNSWSDKWGDKGYIY 332

Query:   327 MKRNTGKPEGLCGINKMASIPL 348
             M ++       CG+   AS PL
Sbjct:   333 MAKDKNNH---CGVATKASYPL 351


>DICTYBASE|DDB_G0272298 [details] [associations]
            symbol:DDB_G0272298 species:44689 "Dictyostelium
            discoideum" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 dictyBase:DDB_G0272298 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246
            SMART:SM00848 EMBL:AAFI02000008 KO:K01365 RefSeq:XP_645281.1
            ProteinModelPortal:Q559Q3 MEROPS:C01.A53 EnsemblProtists:DDB0203746
            GeneID:8618447 KEGG:ddi:DDB_G0272298 InParanoid:Q559Q3 OMA:PANINWR
            Uniprot:Q559Q3
        Length = 305

 Score = 587 (211.7 bits), Expect = 4.6e-57, P = 4.6e-57
 Identities = 127/307 (41%), Positives = 182/307 (59%)

Query:    52 MSKHGKTYKCIEEKLHRFEIFKENLKHI-DQRNKEVTSYWLGLNEFADMSHEEFKNKYLG 110
             M K+ K YK  +E L RF+IF++N   I + RNK   +  + LNE++D++ +EF +K+  
Sbjct:     1 MVKYNKHYKNNKEYLKRFDIFQDNYNFILNHRNKNGENIEMDLNEYSDLTQKEFADKFFE 60

Query:   111 -LKPQ---FPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAV 166
              L P+    P     +  F +     +PKS DWR  GAV  VKNQGSC SCW+FS + A+
Sbjct:    61 KLVPEPRSGPINDIKATPFKHNVNATIPKSFDWRDHGAVGKVKNQGSCASCWSFSALGAL 120

Query:   167 EGINQIVSGNLTSLSEQELIDCDTSFN-NGCNGGLMDYAFKYIVASGGLHKEEDYPYLME 225
             EG   I  G L  LSEQ L+DC T F   GC  G M  AFKYI++SGG++ E  YPY  +
Sbjct:   121 EGHYYIKYGELLDLSEQNLVDCATPFGPKGCKTGWMHDAFKYIISSGGVNLESQYPYTGK 180

Query:   226 EGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALA-HQPVSVAIEASGTDFQFYSGGVF- 283
             +  C+  + E E   +SG+  +P+ DE +L++A+A + PV+V I+ S  +FQ  SGG++ 
Sbjct:   181 DEVCKFNQSEKEA-KVSGFVMIPKFDESALMEAIALYGPVAVPIDTSTKEFQHLSGGIYY 239

Query:   284 TGPCGA-ELDHGVAAVGYGKSK-GSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGIN 341
             +  C      H V A+GYG  + G DY ++KNSWG  WG  G+ ++KR     +G CGI 
Sbjct:   240 SDSCDPWNTIHAVLAIGYGTDENGVDYFLMKNSWGKSWGTNGFFKVKRGV---KGKCGIV 296

Query:   342 KMASIPL 348
               AS P+
Sbjct:   297 TAASYPI 303


>MGI|MGI:107823 [details] [associations]
            symbol:Ctsk "cathepsin K" species:10090 "Mus musculus"
            [GO:0005615 "extracellular space" evidence=ISO] [GO:0005737
            "cytoplasm" evidence=ISO] [GO:0005764 "lysosome" evidence=ISO]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008233 "peptidase
            activity" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0016787 "hydrolase activity"
            evidence=IEA] [GO:0045453 "bone resorption" evidence=ISO]
            InterPro:IPR000668 InterPro:IPR013128 InterPro:IPR015644
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            MGI:MGI:107823 GO:GO:0005615 eggNOG:COG4870 HOGENOM:HOG000230774
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 GO:GO:0045453
            GO:GO:0001957 HOVERGEN:HBG011513 MEROPS:I29.007 CTD:1513 KO:K01371
            OrthoDB:EOG4SJ5FC PANTHER:PTHR12411:SF55 OMA:LKVPPSH EMBL:X94444
            EMBL:AJ006033 EMBL:BC046320 IPI:IPI00316575 PIR:S74227
            RefSeq:NP_031828.2 UniGene:Mm.272085 ProteinModelPortal:P55097
            SMR:P55097 MINT:MINT-3089515 STRING:P55097 PhosphoSite:P55097
            PRIDE:P55097 Ensembl:ENSMUST00000015664 GeneID:13038 KEGG:mmu:13038
            InParanoid:P55097 BioCyc:MetaCyc:MONOMER-14811 ChEMBL:CHEMBL1075277
            NextBio:282924 Bgee:P55097 CleanEx:MM_CTSK Genevestigator:P55097
            GermOnline:ENSMUSG00000028111 Uniprot:P55097
        Length = 329

 Score = 587 (211.7 bits), Expect = 4.6e-57, P = 4.6e-57
 Identities = 129/318 (40%), Positives = 184/318 (57%)

Query:    40 SMDKLIEL-FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKE----VTSYWLGLN 94
             S +++++  +E W   H K Y    +++ R  I+++NLK I   N E    V +Y L +N
Sbjct:    17 SPEEMLDTQWELWKKTHQKQYNSKVDEISRRLIWEKNLKQISAHNLEASLGVHTYELAMN 76

Query:    95 EFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDV--KALPKSVDWRKKGAVTPVKNQG 152
                DM+ EE   K  GL+   P  R  S +  Y       +P S+D+RKKG VTPVKNQG
Sbjct:    77 HLGDMTSEEVVQKMTGLR--IPPSRSYSNDTLYTPEWEGRVPDSIDYRKKGYVTPVKNQG 134

Query:   153 SCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASG 212
              CGSCWAFS+  A+EG  +  +G L +LS Q L+DC T  N GC GG M  AF+Y+  +G
Sbjct:   135 QCGSCWAFSSAGALEGQLKKKTGKLLALSPQNLVDCVTE-NYGCGGGYMTTAFQYVQQNG 193

Query:   213 GLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQ-PVSVAIEAS 271
             G+  E+ YPY+ ++ +C       +     GY+++P  +E++L +A+A   P+SV+I+AS
Sbjct:   194 GIDSEDAYPYVGQDESCM-YNATAKAAKCRGYREIPVGNEKALKRAVARVGPISVSIDAS 252

Query:   272 GTDFQFYSGGVFTGP-CGAE-LDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKR 329
                FQFYS GV+    C  + ++H V  VGYG  KGS + I+KNSWG  WG +GY  + R
Sbjct:   253 LASFQFYSRGVYYDENCDRDNVNHAVLVVGYGTQKGSKHWIIKNSWGESWGNKGYALLAR 312

Query:   330 NTGKPEGLCGINKMASIP 347
             N       CGI  MAS P
Sbjct:   313 NKNNA---CGITNMASFP 327


>TAIR|locus:2120222 [details] [associations]
            symbol:RD19 "RESPONSIVE TO DEHYDRATION 19" species:3702
            "Arabidopsis thaliana" [GO:0005576 "extracellular region"
            evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA;ISS]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            [GO:0009269 "response to desiccation" evidence=IEP] [GO:0006970
            "response to osmotic stress" evidence=IGI] [GO:0005634 "nucleus"
            evidence=IDA] [GO:0005773 "vacuole" evidence=IDA] [GO:0042742
            "defense response to bacterium" evidence=IMP] [GO:0006096
            "glycolysis" evidence=RCA] [GO:0006833 "water transport"
            evidence=RCA] [GO:0006972 "hyperosmotic response" evidence=RCA]
            [GO:0007030 "Golgi organization" evidence=RCA] [GO:0009266
            "response to temperature stimulus" evidence=RCA] [GO:0009651
            "response to salt stress" evidence=IEP;RCA] [GO:0046686 "response
            to cadmium ion" evidence=RCA] [GO:0004197 "cysteine-type
            endopeptidase activity" evidence=ISS] [GO:0009414 "response to
            water deprivation" evidence=TAS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005634 GO:GO:0005773 EMBL:CP002687
            GenomeReviews:CT486007_GR GO:GO:0009651 GO:GO:0042742
            eggNOG:COG4870 HOGENOM:HOG000230774 KO:K01373
            ProtClustDB:CLSN2688311 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 EMBL:AL035679 EMBL:AL161594 GO:GO:0004197
            MEROPS:C01.022 EMBL:D13042 EMBL:AY080598 EMBL:AY133844
            IPI:IPI00544363 PIR:JN0718 RefSeq:NP_568052.1 UniGene:At.2850
            UniGene:At.74924 ProteinModelPortal:P43296 SMR:P43296 STRING:P43296
            PaxDb:P43296 PRIDE:P43296 EnsemblPlants:AT4G39090.1 GeneID:830064
            KEGG:ath:AT4G39090 TAIR:At4g39090 InParanoid:P43296 OMA:EDFDWRD
            PhylomeDB:P43296 Genevestigator:P43296 GermOnline:AT4G39090
            Uniprot:P43296
        Length = 368

 Score = 587 (211.7 bits), Expect = 4.6e-57, P = 4.6e-57
 Identities = 135/332 (40%), Positives = 194/332 (58%)

Query:    32 GYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWL 91
             G  P+ LTS D    LF+    K GK Y   EE  +RF +FK NL+   +  K   S   
Sbjct:    39 GAEPQVLTSEDHF-SLFKR---KFGKVYASNEEHDYRFSVFKANLRRARRHQKLDPSATH 94

Query:    92 GLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQ 151
             G+ +F+D++  EF+ K+LG++  F   +  + +      + LP+  DWR  GAVTPVKNQ
Sbjct:    95 GVTQFSDLTRSEFRKKHLGVRSGFKLPKDAN-KAPILPTENLPEDFDWRDHGAVTPVKNQ 153

Query:   152 GSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCD--------TSFNNGCNGGLMDY 203
             GSCGSCW+FS   A+EG N + +G L SLSEQ+L+DCD         S ++GCNGGLM+ 
Sbjct:   154 GSCGSCWSFSATGALEGANFLATGKLVSLSEQQLVDCDHECDPEEADSCDSGCNGGLMNS 213

Query:   204 AFKYIVASGGLHKEEDYPYLMEEG-TCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQ 262
             AF+Y + +GGL KEEDYPY  ++G TC+  K ++ V ++S +  +  ++EQ     + + 
Sbjct:   214 AFEYTLKTGGLMKEEDYPYTGKDGKTCKLDKSKI-VASVSNFSVISIDEEQIAANLVKNG 272

Query:   263 PVSVAIEASGTDFQFYSGGVFTGP--CGAELDHGVAAVGYGKS-------KGSDYIIVKN 313
             P++VAI A     Q Y GGV + P  C   L+HGV  VGYG +       K   Y I+KN
Sbjct:   273 PLAVAINAGY--MQTYIGGV-SCPYICTRRLNHGVLLVGYGAAGYAPARFKEKPYWIIKN 329

Query:   314 SWGPKWGERGYIRMKRNTGKPEGLCGINKMAS 345
             SWG  WGE G+ ++ +  G+   +CG++ M S
Sbjct:   330 SWGETWGENGFYKICK--GR--NICGVDSMVS 357


>ZFIN|ZDB-GENE-041010-76 [details] [associations]
            symbol:ctsll "cathepsin L, like" species:7955 "Danio
            rerio" [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0016787 "hydrolase activity"
            evidence=IEA] [GO:0008233 "peptidase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-041010-76
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513
            OrthoDB:EOG48PMKF MEROPS:C01.032 EMBL:BX119902 IPI:IPI00616622
            UniGene:Dr.79994 SMR:A2BEM8 Ensembl:ENSDART00000144226
            InParanoid:A2BEM8 OMA:PRYSAAN Uniprot:A2BEM8
        Length = 337

 Score = 586 (211.3 bits), Expect = 5.9e-57, P = 5.9e-57
 Identities = 132/319 (41%), Positives = 182/319 (57%)

Query:    43 KLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT----SYWLGLNEFAD 98
             KL + +  W   H K+Y   EE   R  ++++NLK I+  N E +    ++ LG+N+F D
Sbjct:    24 KLDDHWHLWKRWHEKSYHEKEEGWRRM-VWEKNLKKIELHNLEHSVGKHTFRLGMNQFGD 82

Query:    99 MSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCW 158
             M++EEF+    G     P R+   + F        P+ +DWR+KG VTP+K+Q  CGSCW
Sbjct:    83 MTNEEFRQAMNGYNRD-PNRKSKGSLFIEPSFFTAPQQIDWRQKGYVTPIKDQKRCGSCW 141

Query:   159 AFSTVAAVEGINQIVSGNLTSLSEQELIDCDT-SFNNGCNGGLMDYAFKYIVASGGLHKE 217
             AFS+  A+EG     +G L SLSEQ L+DC     NNGC+GGLMD AF+Y+  + GL  E
Sbjct:   142 AFSSTGALEGQVFRKTGKLVSLSEQNLMDCSRPQGNNGCDGGLMDQAFQYVQDNNGLDSE 201

Query:   218 EDYPYLM-EEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQ-PVSVAIEASGTDF 275
             E YPYL  ++  C           ++G+ D+P   E +L+KA+A   PV+VAI+A    F
Sbjct:   202 ESYPYLATDDQPCH-YDPRYSAANVTGFVDIPSGKEHALMKAVAAVGPVAVAIDAGHESF 260

Query:   276 QFYSGGVF-TGPCGAE-LDHGVAAVGYGKS----KGSDYIIVKNSWGPKWGERGYIRMKR 329
             QFY  G++    C  E LDHGV  VGYG       G  Y IVKNSW  +WG++GYI M +
Sbjct:   261 QFYQSGIYYEKACSTEELDHGVLVVGYGYEGVDVAGRRYWIVKNSWTDRWGDKGYIYMAK 320

Query:   330 NTGKPEGLCGINKMASIPL 348
             +    +  CGI   AS PL
Sbjct:   321 DL---KNHCGIATSASYPL 336


>UNIPROTKB|F7B939 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9483
            "Callithrix jacchus" [GO:0001656 "metanephros development"
            evidence=ISS] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=ISS] [GO:0002764 "immune response-regulating signaling
            pathway" evidence=ISS] [GO:0004175 "endopeptidase activity"
            evidence=ISS] [GO:0004177 "aminopeptidase activity" evidence=ISS]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0004252 "serine-type endopeptidase activity" evidence=ISS]
            [GO:0005615 "extracellular space" evidence=ISS] [GO:0005764
            "lysosome" evidence=ISS] [GO:0005829 "cytosol" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISS] [GO:0008284 "positive regulation
            of cell proliferation" evidence=ISS] [GO:0010628 "positive
            regulation of gene expression" evidence=ISS] [GO:0010634 "positive
            regulation of epithelial cell migration" evidence=ISS] [GO:0010813
            "neuropeptide catabolic process" evidence=ISS] [GO:0010815
            "bradykinin catabolic process" evidence=ISS] [GO:0010952 "positive
            regulation of peptidase activity" evidence=ISS] [GO:0016505
            "apoptotic protease activator activity" evidence=ISS] [GO:0030108
            "HLA-A specific activating MHC class I receptor activity"
            evidence=ISS] [GO:0030335 "positive regulation of cell migration"
            evidence=ISS] [GO:0031638 "zymogen activation" evidence=ISS]
            [GO:0031648 "protein destabilization" evidence=ISS] [GO:0032526
            "response to retinoic acid" evidence=ISS] [GO:0033619 "membrane
            protein proteolysis" evidence=ISS] [GO:0043066 "negative regulation
            of apoptotic process" evidence=ISS] [GO:0043129 "surfactant
            homeostasis" evidence=ISS] [GO:0045766 "positive regulation of
            angiogenesis" evidence=ISS] [GO:0060448 "dichotomous subdivision of
            terminal units involved in lung branching" evidence=ISS]
            [GO:0070324 "thyroid hormone binding" evidence=ISS] [GO:0070371
            "ERK1 and ERK2 cascade" evidence=ISS] [GO:0097208 "alveolar
            lamellar body" evidence=ISS] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005829 GO:GO:0043066 GO:GO:0005615 GO:GO:0008284
            GO:GO:0070371 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766
            GO:GO:0004177 GO:GO:0004252 GO:GO:0005764 GO:GO:0097208
            GO:GO:0032526 GO:GO:0010628 GO:GO:0070324 GO:GO:0016505
            GO:GO:0001656 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
            GO:GO:0031638 GO:GO:0001913 GeneTree:ENSGT00660000095458 CTD:1512
            GO:GO:0030108 GO:GO:0010815 GO:GO:0060448 GO:GO:0002764
            GO:GO:0033619 GO:GO:0010813 GO:GO:0043129 EMBL:ACFV01158341
            EMBL:ACFV01158342 EMBL:ACFV01158343 RefSeq:XP_002753411.1
            Ensembl:ENSCJAT00000004397 GeneID:100413104 Uniprot:F7B939
        Length = 336

 Score = 585 (211.0 bits), Expect = 7.5e-57, P = 7.5e-57
 Identities = 126/309 (40%), Positives = 183/309 (59%)

Query:    48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNK 107
             F+SWM+KH KTY   EE   R + F  N + I+  N    ++ + +N+F+DMS  E K K
Sbjct:    35 FKSWMAKHHKTYSREEEYHQRLQTFASNWRKINAHNNGNHTFKMAVNQFSDMSFAEIKRK 94

Query:   108 YLGLKPQFPTRRQPSAEFSY-RDVKALPKSVDWRKKGA-VTPVKNQGSCGSCWAFSTVAA 165
             YL  +PQ       + + +Y R     P SVDWRKKG  V+PVKNQG+CGSCW FST  A
Sbjct:    95 YLWSEPQ----NCSATKSNYLRGTGPYPPSVDWRKKGHFVSPVKNQGACGSCWTFSTTGA 150

Query:   166 VEGINQIVSGNLTSLSEQELIDCDTSFNN-GCNGGLMDYAFKYIVASGGLHKEEDYPYLM 224
             +E    I +G + SL+EQ+L+DC   FNN GC GGL   AF+YI+ + G+  E+ YPY  
Sbjct:   151 LESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNNGIMGEDTYPYQG 210

Query:   225 EEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALA-HQPVSVAIEASGTDFQFYSGGVF 283
             ++  C+ +  +  +  +    ++   DE ++++A+A + PVS A E +  DF  Y  G++
Sbjct:   211 KDSDCKFQPGKA-IGFVKDVANITIYDEDAMVEAVALYNPVSFAFEVT-QDFMMYKRGIY 268

Query:   284 TGP-CGA---ELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCG 339
             +   C     +++H V AVGYG+  G  Y IVKNSWGP+WG  GY  ++R  GK   +CG
Sbjct:   269 SSTSCHKTPDKVNHAVLAVGYGEENGIPYWIVKNSWGPQWGMNGYFLIER--GK--NMCG 324

Query:   340 INKMASIPL 348
             +   AS P+
Sbjct:   325 LAACASYPV 333


>UNIPROTKB|F7BRD4 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9483
            "Callithrix jacchus" [GO:0001656 "metanephros development"
            evidence=ISS] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0001656
            GeneTree:ENSGT00660000095458 EMBL:ACFV01158341 EMBL:ACFV01158342
            EMBL:ACFV01158343 Ensembl:ENSCJAT00000004396 Uniprot:F7BRD4
        Length = 336

 Score = 585 (211.0 bits), Expect = 7.5e-57, P = 7.5e-57
 Identities = 126/309 (40%), Positives = 183/309 (59%)

Query:    48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNK 107
             F+SWM+KH KTY   EE   R + F  N + I+  N    ++ + +N+F+DMS  E K K
Sbjct:    35 FKSWMAKHHKTYSREEEYHQRLQTFASNWRKINAHNNGNHTFKMAVNQFSDMSFAEIKRK 94

Query:   108 YLGLKPQFPTRRQPSAEFSY-RDVKALPKSVDWRKKGA-VTPVKNQGSCGSCWAFSTVAA 165
             YL  +PQ       + + +Y R     P SVDWRKKG  V+PVKNQG+CGSCW FST  A
Sbjct:    95 YLWSEPQ----NCSATKSNYLRGTGPYPPSVDWRKKGHFVSPVKNQGACGSCWTFSTTGA 150

Query:   166 VEGINQIVSGNLTSLSEQELIDCDTSFNN-GCNGGLMDYAFKYIVASGGLHKEEDYPYLM 224
             +E    I +G + SL+EQ+L+DC   FNN GC GGL   AF+YI+ + G+  E+ YPY  
Sbjct:   151 LESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNNGIMGEDTYPYQG 210

Query:   225 EEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALA-HQPVSVAIEASGTDFQFYSGGVF 283
             ++  C+ +  +  +  +    ++   DE ++++A+A + PVS A E +  DF  Y  G++
Sbjct:   211 KDSDCKFQPGKA-IGFVKDVANITIYDEDAMVEAVALYNPVSFAFEVT-QDFMMYKRGIY 268

Query:   284 TGP-CGA---ELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCG 339
             +   C     +++H V AVGYG+  G  Y IVKNSWGP+WG  GY  ++R  GK   +CG
Sbjct:   269 SSTSCHKTPDKVNHAVLAVGYGEENGIPYWIVKNSWGPQWGMNGYFLIER--GK--NMCG 324

Query:   340 INKMASIPL 348
             +   AS P+
Sbjct:   325 LAACASYPV 333


>RGD|1560071 [details] [associations]
            symbol:Ctsll3 "cathepsin L-like 3" species:10116 "Rattus
            norvegicus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 RGD:1560071 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            EMBL:CH474032 IPI:IPI00560469 RefSeq:XP_001065834.2
            RefSeq:XP_573976.3 UniGene:Rn.104851 MEROPS:C01.107
            Ensembl:ENSRNOT00000061398 GeneID:498691 KEGG:rno:498691
            UCSC:RGD:1560071 CTD:70202 OMA:NCGIASD OrthoDB:EOG4HDSTZ
            NextBio:700548 Uniprot:D3ZJV2
        Length = 330

 Score = 584 (210.6 bits), Expect = 9.6e-57, P = 9.6e-57
 Identities = 133/328 (40%), Positives = 184/328 (56%)

Query:    30 IVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTS- 88
             ++  +P H  S D +   +E W +KHGKTY   EE   R  +++ N+K I+  N++    
Sbjct:    14 MISAAPTHDPSFDTV---WEEWKTKHGKTYNTNEEGQKR-AVWENNMKMINLHNEDYLKG 69

Query:    89 ---YWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAV 145
                + L +N F D+++ EF+    G + Q     +   E    DV   PK+VDWRK G V
Sbjct:    70 KHGFSLEMNAFGDLTNTEFRELMTGFQGQKTKMMKVFPEPFLGDV---PKTVDWRKHGYV 126

Query:   146 TPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNN-GCNGGLMDYA 204
             TPVKNQG CGSCWAFS V ++EG     +G L  LSEQ L+DC  S  N GC+GGL D+A
Sbjct:   127 TPVKNQGPCGSCWAFSAVGSLEGQVFRKTGKLVPLSEQNLVDCSWSHGNKGCDGGLPDFA 186

Query:   205 FKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQ-P 263
             F+Y+  +GGL     YPY    GTC     +     + G+  +P + E +L+KA+A   P
Sbjct:   187 FQYVKDNGGLDTSVSYPYEALNGTCR-YNPKYSAAKVVGFMSIPPS-ENALMKAVATVGP 244

Query:   264 VSVAIEASGTDFQFYSGGVFTGP-CGA-ELDHGVAAVGYGK-SKGSDYIIVKNSWGPKWG 320
             +SV I+     FQFY GG++  P C +  L+H V  VGYG+ S G  Y +VKNSWG  WG
Sbjct:   245 ISVGIDIKHKSFQFYKGGMYYEPDCSSTNLNHAVLVVGYGEESDGRKYWLVKNSWGRDWG 304

Query:   321 ERGYIRMKRNTGKPEGLCGINKMASIPL 348
               GYI+M ++       CGI   AS P+
Sbjct:   305 MDGYIKMAKDWNNN---CGIASDASYPI 329


>WB|WBGene00000776 [details] [associations]
            symbol:cpl-1 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0040010 "positive regulation
            of growth rate" evidence=IMP] [GO:0009792 "embryo development
            ending in birth or egg hatching" evidence=IMP] [GO:0040011
            "locomotion" evidence=IMP] [GO:0070265 "necrotic cell death"
            evidence=IMP] [GO:0031983 "vesicle lumen" evidence=IDA] [GO:0042718
            "yolk granule" evidence=IDA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0009792 GO:GO:0040010 eggNOG:COG4870 HOGENOM:HOG000230774
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0040011
            GO:GO:0031983 GO:GO:0070265 GeneTree:ENSGT00660000095458 KO:K01365
            GO:GO:0042718 MEROPS:I29.009 EMBL:Z92812 GeneID:180111
            KEGG:cel:CELE_T03E6.7 CTD:180111 PIR:T24387 RefSeq:NP_001256718.1
            HSSP:P80067 ProteinModelPortal:O45734 SMR:O45734 DIP:DIP-26616N
            IntAct:O45734 MINT:MINT-211563 STRING:O45734 PaxDb:O45734
            EnsemblMetazoa:T03E6.7.1 EnsemblMetazoa:T03E6.7.2 UCSC:T03E6.7.1
            WormBase:T03E6.7a InParanoid:O45734 OMA:HIENHNR NextBio:908128
            Uniprot:O45734
        Length = 337

 Score = 584 (210.6 bits), Expect = 9.6e-57, P = 9.6e-57
 Identities = 136/331 (41%), Positives = 186/331 (56%)

Query:    29 SIVGYSPEHLT-SMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT 87
             ++V  +   L+  ++  IE ++ +     K Y   EE+ +  E F +N+ HI+  N++  
Sbjct:    12 AVVAVNSAKLSRQIESAIEKWDDYKEDFDKEYSESEEQTY-MEAFVKNMIHIENHNRDHR 70

Query:    88 ----SYWLGLNEFADMSHEEFKNKYLGLKPQF-PTRRQPSAEFSYRDVKALPKSVDWRKK 142
                 ++ +GLN  AD+   +++ K  G +  F  +R + S+ F       +P  VDWR  
Sbjct:    71 LGRKTFEMGLNHIADLPFSQYR-KLNGYRRLFGDSRIKNSSSFLAPFNVQVPDEVDWRDT 129

Query:   143 GAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLM 201
               VT VKNQG CGSCWAFS   A+EG +    G L SLSEQ L+DC T + N+GCNGGLM
Sbjct:   130 HLVTDVKNQGMCGSCWAFSATGALEGQHARKLGQLVSLSEQNLVDCSTKYGNHGCNGGLM 189

Query:   202 DYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH 261
             D AF+YI  + G+  EE YPY   +  C   K+ +      GY D PE DE+ L  A+A 
Sbjct:   190 DQAFEYIRDNHGVDTEESYPYKGRDMKCHFNKKTVGADD-KGYVDTPEGDEEQLKIAVAT 248

Query:   262 Q-PVSVAIEASGTDFQFYSGGVFTGP-CGAE-LDHGVAAVGYGKS-KGSDYIIVKNSWGP 317
             Q P+S+AI+A    FQ Y  GV+    C +E LDHGV  VGYG   +  DY IVKNSWG 
Sbjct:   249 QGPISIAIDAGHRSFQLYKKGVYYDEECSSEELDHGVLLVGYGTDPEHGDYWIVKNSWGA 308

Query:   318 KWGERGYIRMKRNTGKPEGLCGINKMASIPL 348
              WGE+GYIR+ RN       CG+   AS PL
Sbjct:   309 GWGEKGYIRIARNRNNH---CGVATKASYPL 336


>UNIPROTKB|O60911 [details] [associations]
            symbol:CTSL2 "Cathepsin L2" species:9606 "Homo sapiens"
            [GO:0004177 "aminopeptidase activity" evidence=IEA] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=IEA] [GO:0005902
            "microvillus" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0007283 "spermatogenesis" evidence=IEA] [GO:0009267 "cellular
            response to starvation" evidence=IEA] [GO:0009749 "response to
            glucose stimulus" evidence=IEA] [GO:0009897 "external side of
            plasma membrane" evidence=IEA] [GO:0010259 "multicellular
            organismal aging" evidence=IEA] [GO:0021675 "nerve development"
            evidence=IEA] [GO:0030984 "kininogen binding" evidence=IEA]
            [GO:0032403 "protein complex binding" evidence=IEA] [GO:0034698
            "response to gonadotropin stimulus" evidence=IEA] [GO:0042277
            "peptide binding" evidence=IEA] [GO:0043005 "neuron projection"
            evidence=IEA] [GO:0043204 "perikaryon" evidence=IEA] [GO:0046697
            "decidualization" evidence=IEA] [GO:0048102 "autophagic cell death"
            evidence=IEA] [GO:0051384 "response to glucocorticoid stimulus"
            evidence=IEA] [GO:0060008 "Sertoli cell differentiation"
            evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=TAS] [GO:0019886 "antigen processing and presentation of
            exogenous peptide antigen via MHC class II" evidence=TAS]
            [GO:0043202 "lysosomal lumen" evidence=TAS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 Reactome:REACT_118779 Reactome:REACT_6900
            GO:GO:0009897 GO:GO:0019886 GO:GO:0034698 GO:GO:0043204
            GO:GO:0009749 GO:GO:0030141 GO:GO:0051384 eggNOG:COG4870
            HOGENOM:HOG000230774 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0045177 GO:GO:0043005 GO:GO:0007283
            GO:GO:0004177 GO:GO:0042277 GO:GO:0009267 GO:GO:0021675
            GO:GO:0043202 GO:GO:0005902 GO:GO:0010259 GO:GO:0004197
            GO:GO:0048102 GO:GO:0046697 HOVERGEN:HBG011513 CTD:1515
            OrthoDB:EOG48PMKF OMA:FDQNLDT GO:GO:0060008 EMBL:Y14734
            EMBL:AB001928 EMBL:AF070448 EMBL:AB019534 EMBL:AY358641
            EMBL:AL445670 EMBL:BC023504 EMBL:BC110512 IPI:IPI00000013
            RefSeq:NP_001188504.1 RefSeq:NP_001324.2 UniGene:Hs.610096 PDB:1FH0
            PDB:3H6S PDB:3KFQ PDBsum:1FH0 PDBsum:3H6S PDBsum:3KFQ
            ProteinModelPortal:O60911 SMR:O60911 IntAct:O60911 STRING:O60911
            MEROPS:I29.010 PhosphoSite:O60911 PaxDb:O60911 PeptideAtlas:O60911
            PRIDE:O60911 Ensembl:ENST00000259470 Ensembl:ENST00000538255
            GeneID:1515 KEGG:hsa:1515 UCSC:uc004awt.3 GeneCards:GC09M099794
            HGNC:HGNC:2538 HPA:CAB017112 MIM:603308 neXtProt:NX_O60911
            PharmGKB:PA27036 InParanoid:O60911 KO:K01375 PhylomeDB:O60911
            BRENDA:3.4.22.43 SABIO-RK:O60911 BindingDB:O60911 ChEMBL:CHEMBL3272
            ChiTaRS:CTSL2 EvolutionaryTrace:O60911 GenomeRNAi:1515 NextBio:6277
            Bgee:O60911 CleanEx:HS_CTSL2 Genevestigator:O60911
            GermOnline:ENSG00000136943 Uniprot:O60911
        Length = 334

 Score = 582 (209.9 bits), Expect = 1.6e-56, P = 1.6e-56
 Identities = 129/309 (41%), Positives = 179/309 (57%)

Query:    51 WMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTS----YWLGLNEFADMSHEEFKN 106
             W + H + Y   EE   R  ++++N+K I+  N E +     + + +N F DM++EEF+ 
Sbjct:    32 WKATHRRLYGANEEGWRR-AVWEKNMKMIELHNGEYSQGKHGFTMAMNAFGDMTNEEFRQ 90

Query:   107 KYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAV 166
                  + Q   + +   E  + D   LPKSVDWRKKG VTPVKNQ  CGSCWAFS   A+
Sbjct:    91 MMGCFRNQKFRKGKVFREPLFLD---LPKSVDWRKKGYVTPVKNQKQCGSCWAFSATGAL 147

Query:   167 EGINQIVSGNLTSLSEQELIDCDT-SFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLME 225
             EG     +G L SLSEQ L+DC     N GCNGG M  AF+Y+  +GGL  EE YPY+  
Sbjct:   148 EGQMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGFMARAFQYVKENGGLDSEESYPYVAV 207

Query:   226 EGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQ-PVSVAIEASGTDFQFYSGGVFT 284
             +  C+ + E   V   +G+  V    E++L+KA+A   P+SVA++A  + FQFY  G++ 
Sbjct:   208 DEICKYRPEN-SVANDTGFTVVAPGKEKALMKAVATVGPISVAMDAGHSSFQFYKSGIYF 266

Query:   285 GP-CGAE-LDHGVAAVGYG----KSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLC 338
              P C ++ LDHGV  VGYG     S  S Y +VKNSWGP+WG  GY+++ ++       C
Sbjct:   267 EPDCSSKNLDHGVLVVGYGFEGANSNNSKYWLVKNSWGPEWGSNGYVKIAKDKNNH---C 323

Query:   339 GINKMASIP 347
             GI   AS P
Sbjct:   324 GIATAASYP 332


>UNIPROTKB|G3SSC1 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9785
            "Loxodonta africana" [GO:0001656 "metanephros development"
            evidence=ISS] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=ISS] [GO:0002764 "immune response-regulating signaling
            pathway" evidence=ISS] [GO:0004175 "endopeptidase activity"
            evidence=ISS] [GO:0004177 "aminopeptidase activity" evidence=ISS]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0004252 "serine-type endopeptidase activity" evidence=ISS]
            [GO:0005615 "extracellular space" evidence=ISS] [GO:0005764
            "lysosome" evidence=ISS] [GO:0005829 "cytosol" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISS] [GO:0008284 "positive regulation
            of cell proliferation" evidence=ISS] [GO:0010628 "positive
            regulation of gene expression" evidence=ISS] [GO:0010634 "positive
            regulation of epithelial cell migration" evidence=ISS] [GO:0010813
            "neuropeptide catabolic process" evidence=ISS] [GO:0010815
            "bradykinin catabolic process" evidence=ISS] [GO:0010952 "positive
            regulation of peptidase activity" evidence=ISS] [GO:0016505
            "apoptotic protease activator activity" evidence=ISS] [GO:0030108
            "HLA-A specific activating MHC class I receptor activity"
            evidence=ISS] [GO:0030335 "positive regulation of cell migration"
            evidence=ISS] [GO:0031638 "zymogen activation" evidence=ISS]
            [GO:0031648 "protein destabilization" evidence=ISS] [GO:0032526
            "response to retinoic acid" evidence=ISS] [GO:0033619 "membrane
            protein proteolysis" evidence=ISS] [GO:0043066 "negative regulation
            of apoptotic process" evidence=ISS] [GO:0043129 "surfactant
            homeostasis" evidence=ISS] [GO:0045766 "positive regulation of
            angiogenesis" evidence=ISS] [GO:0060448 "dichotomous subdivision of
            terminal units involved in lung branching" evidence=ISS]
            [GO:0070324 "thyroid hormone binding" evidence=ISS] [GO:0070371
            "ERK1 and ERK2 cascade" evidence=ISS] [GO:0097208 "alveolar
            lamellar body" evidence=ISS] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005829 GO:GO:0043066 GO:GO:0005615 GO:GO:0008284
            GO:GO:0070371 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766
            GO:GO:0004177 GO:GO:0004252 GO:GO:0005764 GO:GO:0097208
            GO:GO:0032526 GO:GO:0010628 GO:GO:0070324 GO:GO:0016505
            GO:GO:0001656 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
            GO:GO:0097067 GO:GO:0031638 GO:GO:0001913
            GeneTree:ENSGT00660000095458 OMA:STSCHKT GO:GO:0030108
            GO:GO:0010815 GO:GO:0060448 GO:GO:0002764 GO:GO:0033619
            GO:GO:0010813 GO:GO:0043129 RefSeq:XP_003413898.1
            Ensembl:ENSLAFT00000003415 GeneID:100662496 Uniprot:G3SSC1
        Length = 335

 Score = 580 (209.2 bits), Expect = 2.5e-56, P = 2.5e-56
 Identities = 127/309 (41%), Positives = 184/309 (59%)

Query:    48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNK 107
             F+SWM++H K Y   EE   R + F  N + I+  N    ++ + LN+F+DM+  E K K
Sbjct:    35 FQSWMAQHQKKYSS-EEYHQRQQTFVSNWRKINAHNARNHTFKMALNQFSDMTFAEIKQK 93

Query:   108 YLGLKPQFPTRRQPSAEFSY-RDVKALPKSVDWRKKGA-VTPVKNQGSCGSCWAFSTVAA 165
             YL  +PQ       + + +Y R     P  VDWRKKG  V+PVKNQG+CGSCW FST  A
Sbjct:    94 YLWSEPQ----NCSATKGNYLRGTGPYPPFVDWRKKGHFVSPVKNQGACGSCWTFSTTGA 149

Query:   166 VEGINQIVSGNLTSLSEQELIDCDTSFNN-GCNGGLMDYAFKYIVASGGLHKEEDYPYLM 224
             +E    I  G L SL+EQ+L+DC   FNN GC GGL   AF+YI+ + G+  E+ YPY  
Sbjct:   150 LESAIAIAGGKLLSLAEQQLVDCAKDFNNHGCQGGLPSQAFEYILYNKGIMGEDTYPYKG 209

Query:   225 EEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALA-HQPVSVAIEASGTDFQFYSGGVF 283
             ++  C+ + ++  +  +    ++  NDE+++++A+A + PVS A E +  DF  YS G++
Sbjct:   210 QDDVCKFQPKKA-IAFVKDVANITLNDEEAMVEAVALYNPVSFAFEVTD-DFMKYSKGIY 267

Query:   284 TGP-CGA---ELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCG 339
             +   C     +++H V AVGYG+ KG  Y IVKNSWGP WG  GY  ++R  GK   +CG
Sbjct:   268 SSTSCHKTPDKVNHAVLAVGYGEEKGIPYWIVKNSWGPYWGMDGYFLIER--GK--NMCG 323

Query:   340 INKMASIPL 348
             +   AS P+
Sbjct:   324 LAACASYPI 332


>DICTYBASE|DDB_G0291191 [details] [associations]
            symbol:DDB_G0291191 "cysteine protease" species:44689
            "Dictyostelium discoideum" [GO:0005615 "extracellular space"
            evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0044351 "macropinocytosis" evidence=RCA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 dictyBase:DDB_G0291191
            GO:GO:0005615 eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 EMBL:AAFI02000175 MEROPS:C01.022
            ProtClustDB:CLSZ2429603 RefSeq:XP_635374.1
            ProteinModelPortal:Q54F16 PRIDE:Q54F16 EnsemblProtists:DDB0252831
            GeneID:8628022 KEGG:ddi:DDB_G0291191 OMA:NETQIAS Uniprot:Q54F16
        Length = 352

 Score = 579 (208.9 bits), Expect = 3.3e-56, P = 3.3e-56
 Identities = 132/326 (40%), Positives = 179/326 (54%)

Query:    48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYW----LGLNEFADMSHEE 103
             F ++ +K+ K Y   EE L +FE FK NL +ID  NK+ T+       G+N+FAD+S EE
Sbjct:    27 FIAFQNKYNKIYSA-EEYLVKFETFKSNLLNIDALNKQATTIGSDTKFGVNKFADLSKEE 85

Query:   104 FKNKYLGLKPQFPTRRQPSAEFSYRDV-KALPKSVDWRKKGA---------VTPVKNQGS 153
             FK  YL  K    T   P       D+  A P + DWR  G          VT VKNQG 
Sbjct:    86 FKKYYLSSKEARLTDDLPMLPNLSDDIISATPAAFDWRNTGGSTKFPQGTPVTAVKNQGQ 145

Query:   154 CGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCD---TSFNN------GCNGGLMDYA 204
             CGSCW+FST   VEG + + +G L  LSEQ L+DCD    ++ N      GC+GGL   A
Sbjct:   146 CGSCWSFSTTGNVEGQHYLSTGTLVGLSEQNLVDCDHTCMTYENENVCNAGCDGGLQPNA 205

Query:   205 FKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPV 264
             + YI+ +GG+  E  YPY   +G C+    ++    IS +  VP+N+ Q       + P+
Sbjct:   206 YNYIIKNGGIQTEATYPYTAVDGECKFNSAQVGA-KISSFTMVPQNETQIASYLFNNGPL 264

Query:   265 SVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKS-----KGSDYIIVKNSWGPKW 319
             ++A +A   ++QFY GGVF  PCG  LDHG+  VGYG       K + Y I+KNSWG  W
Sbjct:   265 AIAADAE--EWQFYMGGVFDFPCGQTLDHGILIVGYGAQDTIVGKNTPYWIIKNSWGADW 322

Query:   320 GERGYIRMKRNTGKPEGLCGINKMAS 345
             GE GY++++RNT K    CG+    S
Sbjct:   323 GEAGYLKVERNTDK----CGVANFVS 344


>MGI|MGI:107341 [details] [associations]
            symbol:Ctss "cathepsin S" species:10090 "Mus musculus"
            [GO:0005764 "lysosome" evidence=ISO;IDA] [GO:0006508 "proteolysis"
            evidence=IDA] [GO:0008152 "metabolic process" evidence=ISO]
            [GO:0008233 "peptidase activity" evidence=IDA] [GO:0008234
            "cysteine-type peptidase activity" evidence=ISO] [GO:0009986 "cell
            surface" evidence=ISO] [GO:0016020 "membrane" evidence=IDA]
            [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0045453 "bone
            resorption" evidence=ISO] [GO:0051930 "regulation of sensory
            perception of pain" evidence=ISO] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 MGI:MGI:107341 GO:GO:0016020 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GO:GO:0008233 GO:GO:0031905 Reactome:REACT_102124
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 BRENDA:3.4.22.27
            ChiTaRS:CTSS EMBL:AF051732 EMBL:AF051727 EMBL:AF051728
            EMBL:AF051729 EMBL:AF051726 EMBL:AF051730 EMBL:AF051731
            EMBL:AF038546 EMBL:AJ002386 EMBL:AC092203 EMBL:Y18466 EMBL:AJ223208
            IPI:IPI00309520 UniGene:Mm.3619 PDB:1M0H PDBsum:1M0H
            ProteinModelPortal:O70370 SMR:O70370 STRING:O70370
            PhosphoSite:O70370 PaxDb:O70370 PRIDE:O70370
            Ensembl:ENSMUST00000116304 BindingDB:O70370 ChEMBL:CHEMBL4098
            NextBio:282932 Bgee:O70370 CleanEx:MM_CTSS Genevestigator:O70370
            GermOnline:ENSMUSG00000038642 Uniprot:O70370
        Length = 340

 Score = 578 (208.5 bits), Expect = 4.2e-56, P = 4.2e-56
 Identities = 129/310 (41%), Positives = 178/310 (57%)

Query:    48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT----SYWLGLNEFADMSHEE 103
             ++ W   H K YK   E+  R  I+++NLK I   N E +    +Y +G+N+  DM++EE
Sbjct:    36 WDLWKKTHEKEYKDKNEEEVRRLIWEKNLKFIMIHNLEYSMGMHTYQVGMNDMGDMTNEE 95

Query:   104 FKNKYLGLKPQFPTRRQPSAEF-SYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFST 162
                +   L+   P +   +  F SY + + LP +VDWR+KG VT VK QGSCG+CWAFS 
Sbjct:    96 ILCRMGALR--IPRQSPKTVTFRSYSN-RTLPDTVDWREKGCVTEVKYQGSCGACWAFSA 152

Query:   163 VAAVEGINQIVSGNLTSLSEQELIDCDTSF---NNGCNGGLMDYAFKYIVASGGLHKEED 219
             V A+EG  ++ +G L SLS Q L+DC       N GC GG M  AF+YI+ +GG+  +  
Sbjct:   153 VGALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIIDNGGIEADAS 212

Query:   220 YPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQ-PVSVAIEASGTDFQFY 278
             YPY   +  C    +     T S Y  +P  DE +L +A+A + PVSV I+AS + F FY
Sbjct:   213 YPYKATDEKCHYNSKN-RAATCSRYIQLPFGDEDALKEAVATKGPVSVGIDASHSSFFFY 271

Query:   279 SGGVFTGP-CGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGL 337
               GV+  P C   ++HGV  VGYG   G DY +VKNSWG  +G++GYIRM RN    +  
Sbjct:   272 KSGVYDDPSCTGNVNHGVLVVGYGTLDGKDYWLVKNSWGLNFGDQGYIRMARNN---KNH 328

Query:   338 CGINKMASIP 347
             CGI    S P
Sbjct:   329 CGIASYCSYP 338


>TAIR|locus:2050145 [details] [associations]
            symbol:AT2G21430 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 EMBL:CP002685 GenomeReviews:CT485783_GR
            EMBL:AC006841 EMBL:X74359 IPI:IPI00519637 PIR:B84601
            RefSeq:NP_565512.1 UniGene:At.14069 ProteinModelPortal:P43295
            SMR:P43295 MEROPS:C01.A04 PRIDE:P43295 EnsemblPlants:AT2G21430.1
            GeneID:816682 KEGG:ath:AT2G21430 TAIR:At2g21430 eggNOG:COG4870
            HOGENOM:HOG000230774 InParanoid:P43295 KO:K01373 OMA:GSIEEHY
            PhylomeDB:P43295 ProtClustDB:CLSN2688311 Genevestigator:P43295
            GermOnline:AT2G21430 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 Uniprot:P43295
        Length = 361

 Score = 578 (208.5 bits), Expect = 4.2e-56, P = 4.2e-56
 Identities = 131/329 (39%), Positives = 195/329 (59%)

Query:    35 PEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLN 94
             P+ L+S D    LF+    K GK Y  IEE  +RF +FK NL    +  K   S   G+ 
Sbjct:    39 PKVLSSEDHFT-LFKK---KFGKVYGSIEEHYYRFSVFKANLLRAMRHQKMDPSARHGVT 94

Query:    95 EFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSC 154
             +F+D++  EF+ K+LG+K  F   +  + +      + LP+  DWR +GAVTPVKNQGSC
Sbjct:    95 QFSDLTRSEFRRKHLGVKGGFKLPKDAN-QAPILPTQNLPEEFDWRDRGAVTPVKNQGSC 153

Query:   155 GSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDT--------SFNNGCNGGLMDYAFK 206
             GSCW+FST  A+EG + + +G L SLSEQ+L+DCD         S ++GCNGGLM+ AF+
Sbjct:   154 GSCWSFSTTGALEGAHFLATGKLVSLSEQQLVDCDHECDPEEEGSCDSGCNGGLMNSAFE 213

Query:   207 YIVASGGLHKEEDYPYL-MEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVS 265
             Y + +GGL +E+DYPY   + G+C+  + ++ V ++S +  V  N++Q     + + P++
Sbjct:   214 YTLKTGGLMREKDYPYTGTDGGSCKLDRSKI-VASVSNFSVVSINEDQIAANLIKNGPLA 272

Query:   266 VAIEASGTDFQFYSGGVFTGP--CGAELDHGVAAVGYGKS-------KGSDYIIVKNSWG 316
             VAI A+    Q Y GGV + P  C   L+HGV  VGYG +       K   Y I+KNSWG
Sbjct:   273 VAINAAY--MQTYIGGV-SCPYICSRRLNHGVLLVGYGSAGFSQARLKEKPYWIIKNSWG 329

Query:   317 PKWGERGYIRMKRNTGKPEGLCGINKMAS 345
               WGE G+ ++ +  G+   +CG++ + S
Sbjct:   330 ESWGENGFYKICK--GR--NICGVDSLVS 354


>DICTYBASE|DDB_G0290957 [details] [associations]
            symbol:cprA "cysteine proteinase 1" species:44689
            "Dictyostelium discoideum" [GO:0006972 "hyperosmotic response"
            evidence=IEP] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0005764 "lysosome" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 dictyBase:DDB_G0290957
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GenomeReviews:CM000154_GR GO:GO:0005764
            GO:GO:0006972 EMBL:AAFI02000174 KO:K01376 EMBL:X02407 PIR:A22827
            RefSeq:XP_635417.1 ProteinModelPortal:P04988 MEROPS:C01.022
            GlycoSuiteDB:P04988 SWISS-2DPAGE:P04988 EnsemblProtists:DDB0201647
            GeneID:8627918 KEGG:ddi:DDB_G0290957 OMA:KISNFTM
            ProtClustDB:CLSZ2429603 Uniprot:P04988
        Length = 343

 Score = 574 (207.1 bits), Expect = 1.1e-55, P = 1.1e-55
 Identities = 132/319 (41%), Positives = 177/319 (55%)

Query:    48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYW----LGLNEFADMSHEE 103
             F  +  K  K Y   EE L RFEIFK NL  I++ N    ++      G+N+FAD+S +E
Sbjct:    29 FLEFQDKFNKKYSH-EEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVNKFADLSSDE 87

Query:   104 FKNKYLGLKPQFPTRRQPSAEFSYRD-VKALPKSVDWRKKGAVTPVKNQGSCGSCWAFST 162
             FKN YL  K    T   P A++   + + ++P + DWR +GAVTPVKNQG CGSCW+FST
Sbjct:    88 FKNYYLNNKEAIFTDDLPVADYLDDEFINSIPTAFDWRTRGAVTPVKNQGQCGSCWSFST 147

Query:   163 VAAVEGINQIVSGNLTSLSEQELIDCD---------TSFNNGCNGGLMDYAFKYIVASGG 213
                VEG + I    L SLSEQ L+DCD          + + GCNGGL   A+ YI+ +GG
Sbjct:   148 TGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEQACDEGCNGGLQPNAYNYIIKNGG 207

Query:   214 LHKEEDYPYLMEEGT-CEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASG 272
             +  E  YPY  E GT C      +    IS +  +P+N+       ++  P+++A +A  
Sbjct:   208 IQTESSYPYTAETGTQCNFNSANIGA-KISNFTMIPKNETVMAGYIVSTGPLAIAADA-- 264

Query:   273 TDFQFYSGGVFTGPCGAE-LDHGVAAVGYGKS-----KGSDYIIVKNSWGPKWGERGYIR 326
              ++QFY GGVF  PC    LDHG+  VGY        K   Y IVKNSWG  WGE+GYI 
Sbjct:   265 VEWQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIY 324

Query:   327 MKRNTGKPEGLCGINKMAS 345
             ++R  GK    CG++   S
Sbjct:   325 LRR--GK--NTCGVSNFVS 339


>TAIR|locus:2130180 [details] [associations]
            symbol:AT4G16190 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] [GO:0005773 "vacuole"
            evidence=IDA] [GO:0042744 "hydrogen peroxide catabolic process"
            evidence=RCA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0005773
            EMBL:CP002687 HOGENOM:HOG000230774 KO:K01373 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 EMBL:Z97340 EMBL:AL161543 UniGene:At.25555
            EMBL:AY039556 EMBL:AY129473 EMBL:AY136316 EMBL:BT000733
            EMBL:AK226366 IPI:IPI00543588 PIR:D71428 RefSeq:NP_567489.1
            HSSP:P25779 ProteinModelPortal:Q9SUL1 SMR:Q9SUL1 STRING:Q9SUL1
            MEROPS:C01.A06 PRIDE:Q9SUL1 EnsemblPlants:AT4G16190.1 GeneID:827311
            KEGG:ath:AT4G16190 TAIR:At4g16190 InParanoid:Q9SUL1 OMA:NACGINK
            PhylomeDB:Q9SUL1 ProtClustDB:CLSN2917559 Genevestigator:Q9SUL1
            Uniprot:Q9SUL1
        Length = 373

 Score = 574 (207.1 bits), Expect = 1.1e-55, P = 1.1e-55
 Identities = 132/320 (41%), Positives = 188/320 (58%)

Query:    48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT-SYWLGLNEFADMSHEEFKN 106
             F  + SK+ KTY    E  HRF +FK NL+   +RN+ +  S   G+ +F+D++ +EF+ 
Sbjct:    55 FTLFKSKYEKTYATQVEHDHRFRVFKANLRRA-RRNQLLDPSAVHGVTQFSDLTPKEFRR 113

Query:   107 KYLGLKPQ---FPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTV 163
             K+LGLK +    PT  Q +      D   LP   DWR++GAVTPVKNQG CGSCW+FS +
Sbjct:   114 KFLGLKRRGFRLPTDTQTAPILPTSD---LPTEFDWREQGAVTPVKNQGMCGSCWSFSAI 170

Query:   164 AAVEGINQIVSGNLTSLSEQELIDCD--------TSFNNGCNGGLMDYAFKYIVASGGLH 215
              A+EG + + +  L SLSEQ+L+DCD         S ++GC+GGLM+ AF+Y + +GGL 
Sbjct:   171 GALEGAHFLATKELVSLSEQQLVDCDHECDPAQANSCDSGCSGGLMNNAFEYALKAGGLM 230

Query:   216 KEEDYPYLMEEGT-CEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTD 274
             KEEDYPY   + T C+  K ++ V ++S +  V  +++Q     + H P+++AI A    
Sbjct:   231 KEEDYPYTGRDHTACKFDKSKI-VASVSNFSVVSSDEDQIAANLVQHGPLAIAINAMW-- 287

Query:   275 FQFYSGGVFTGP--CGAELDHGVAAVGYGKS-------KGSDYIIVKNSWGPKWGERGYI 325
              Q Y GGV + P  C    DHGV  VG+G S       K   Y I+KNSWG  WGE GY 
Sbjct:   288 MQTYIGGV-SCPYVCSKSQDHGVLLVGFGSSGYAPIRLKEKPYWIIKNSWGAMWGEHGYY 346

Query:   326 RMKRNTGKPEGLCGINKMAS 345
             ++ R    P  +CG++ M S
Sbjct:   347 KICRG---PHNMCGMDTMVS 363


>WB|WBGene00007055 [details] [associations]
            symbol:tag-196 species:6239 "Caenorhabditis elegans"
            [GO:0004869 "cysteine-type endopeptidase inhibitor activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000010
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00031 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00043 SMART:SM00645 InterPro:IPR000169
            GO:GO:0004869 eggNOG:COG4870 HOGENOM:HOG000230774 KO:K01373
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00660000095458 EMBL:FO080488 PIR:T31871
            RefSeq:NP_505215.2 HSSP:Q9UBX1 ProteinModelPortal:O16454 SMR:O16454
            DIP:DIP-27400N IntAct:O16454 MINT:MINT-1044990 MEROPS:C01.A50
            PaxDb:O16454 EnsemblMetazoa:F41E6.6.1 EnsemblMetazoa:F41E6.6.2
            EnsemblMetazoa:F41E6.6.3 GeneID:179240 KEGG:cel:CELE_F41E6.6
            UCSC:F41E6.6.1 CTD:179240 WormBase:F41E6.6 InParanoid:O16454
            OMA:GGGLMTN NextBio:904514 Uniprot:O16454
        Length = 477

 Score = 574 (207.1 bits), Expect = 1.1e-55, P = 1.1e-55
 Identities = 137/312 (43%), Positives = 182/312 (58%)

Query:    48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHID--QRNKEVTSYWLGLNEFADMSHEEFK 105
             F  ++ +H K Y    E L RF +FK+N K I   Q+N++ T+ + G  +F+DM+  EFK
Sbjct:   174 FLDFVDRHEKKYTNKREVLKRFRVFKKNAKVIRELQKNEQGTAVY-GFTKFSDMTTMEFK 232

Query:   106 NKYLGLKPQFPTRRQPSAEFSYRDV----KALPKSVDWRKKGAVTPVKNQGSCGSCWAFS 161
                L  + + P      A F   DV    + LP+S DWR+KGAVT VKNQG+CGSCWAFS
Sbjct:   233 KIMLPYQWEQPVYPMEQANFEKHDVTINEEDLPESFDWREKGAVTQVKNQGNCGSCWAFS 292

Query:   162 TVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYP 221
             T   VEG   I    L SLSEQEL+DCD S + GCNGGL   A+K I+  GGL  E+ YP
Sbjct:   293 TTGNVEGAWFIAKNKLVSLSEQELVDCD-SMDQGCNGGLPSNAYKEIIRMGGLEPEDAYP 351

Query:   222 YLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQ-PVSVAIEASGTDFQFYSG 280
             Y     TC   ++++ V  I+G  ++P +DE  + K L  + P+S+ + A+    QFY  
Sbjct:   352 YDGRGETCHLVRKDIAVY-INGSVELP-HDEVEMQKWLVTKGPISIGLNAN--TLQFYRH 407

Query:   281 GV---FTGPCGA-ELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEG 336
             GV   F   C    L+HGV  VGYGK     Y IVKNSWGP WGE GY ++ R  GK   
Sbjct:   408 GVVHPFKIFCEPFMLNHGVLIVGYGKDGRKPYWIVKNSWGPNWGEAGYFKLYR--GK--N 463

Query:   337 LCGINKMASIPL 348
             +CG+ +MA+  L
Sbjct:   464 VCGVQEMATSAL 475


>UNIPROTKB|P25774 [details] [associations]
            symbol:CTSS "Cathepsin S" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0016020 "membrane"
            evidence=IEA] [GO:0005576 "extracellular region" evidence=NAS]
            [GO:0005764 "lysosome" evidence=IDA;NAS] [GO:0097067 "cellular
            response to thyroid hormone stimulus" evidence=IEP] [GO:0002250
            "adaptive immune response" evidence=IEP] [GO:0019882 "antigen
            processing and presentation" evidence=TAS] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=TAS] [GO:0006955
            "immune response" evidence=TAS] [GO:0002474 "antigen processing and
            presentation of peptide antigen via MHC class I" evidence=TAS]
            [GO:0002480 "antigen processing and presentation of exogenous
            peptide antigen via MHC class I, TAP-independent" evidence=TAS]
            [GO:0019886 "antigen processing and presentation of exogenous
            peptide antigen via MHC class II" evidence=TAS] [GO:0036021
            "endolysosome lumen" evidence=TAS] [GO:0042590 "antigen processing
            and presentation of exogenous peptide antigen via MHC class I"
            evidence=TAS] [GO:0043202 "lysosomal lumen" evidence=TAS]
            [GO:0045087 "innate immune response" evidence=TAS] [GO:0043231
            "intracellular membrane-bounded organelle" evidence=IDA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 Reactome:REACT_118779
            Reactome:REACT_6900 GO:GO:0005576 GO:GO:0002480 GO:GO:0016020
            GO:GO:0019886 eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045087 EMBL:CH471121
            GO:GO:0004197 GO:GO:0002250 HOVERGEN:HBG011513 GO:GO:0097067
            GO:GO:0036021 EMBL:AL356292 CTD:1520 KO:K01368 OMA:KAMDQKC
            OrthoDB:EOG4JM7Q2 EMBL:S93414 EMBL:M86553 EMBL:M90696 EMBL:U07374
            EMBL:U07370 EMBL:U07371 EMBL:U07372 EMBL:U07373 EMBL:CR541676
            EMBL:AK301472 EMBL:AK314482 EMBL:BC002642 IPI:IPI00299150
            IPI:IPI00910216 PIR:A42482 RefSeq:NP_001186668.1 RefSeq:NP_004070.3
            UniGene:Hs.181301 PDB:1BXF PDB:1GLO PDB:1MS6 PDB:1NPZ PDB:1NQC
            PDB:2C0Y PDB:2F1G PDB:2FQ9 PDB:2FRA PDB:2FRQ PDB:2FT2 PDB:2FUD
            PDB:2FYE PDB:2G6D PDB:2G7Y PDB:2H7J PDB:2HH5 PDB:2HHN PDB:2HXZ
            PDB:2OP3 PDB:2R9M PDB:2R9N PDB:2R9O PDB:3IEJ PDB:3KWN PDB:3MPE
            PDB:3MPF PDB:3N3G PDB:3N4C PDB:3OVX PDBsum:1BXF PDBsum:1GLO
            PDBsum:1MS6 PDBsum:1NPZ PDBsum:1NQC PDBsum:2C0Y PDBsum:2F1G
            PDBsum:2FQ9 PDBsum:2FRA PDBsum:2FRQ PDBsum:2FT2 PDBsum:2FUD
            PDBsum:2FYE PDBsum:2G6D PDBsum:2G7Y PDBsum:2H7J PDBsum:2HH5
            PDBsum:2HHN PDBsum:2HXZ PDBsum:2OP3 PDBsum:2R9M PDBsum:2R9N
            PDBsum:2R9O PDBsum:3IEJ PDBsum:3KWN PDBsum:3MPE PDBsum:3MPF
            PDBsum:3N3G PDBsum:3N4C PDBsum:3OVX ProteinModelPortal:P25774
            SMR:P25774 IntAct:P25774 STRING:P25774 MEROPS:I29.004
            PhosphoSite:P25774 DMDM:88984046 PaxDb:P25774 PeptideAtlas:P25774
            PRIDE:P25774 DNASU:1520 Ensembl:ENST00000368985
            Ensembl:ENST00000448301 GeneID:1520 KEGG:hsa:1520 UCSC:uc001evn.3
            GeneCards:GC01M150702 HGNC:HGNC:2545 HPA:CAB000460 HPA:HPA002988
            MIM:116845 neXtProt:NX_P25774 PharmGKB:PA27041 InParanoid:P25774
            PhylomeDB:P25774 BRENDA:3.4.22.27 BindingDB:P25774
            ChEMBL:CHEMBL2954 ChiTaRS:CTSS EvolutionaryTrace:P25774
            GenomeRNAi:1520 NextBio:6291 PMAP-CutDB:P25774 ArrayExpress:P25774
            Bgee:P25774 CleanEx:HS_CTSS Genevestigator:P25774
            GermOnline:ENSG00000163131 Uniprot:P25774
        Length = 331

 Score = 571 (206.1 bits), Expect = 2.3e-55, P = 2.3e-55
 Identities = 128/306 (41%), Positives = 172/306 (56%)

Query:    51 WMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKE----VTSYWLGLNEFADMSHEEFKN 106
             W   +GK YK   E+  R  I+++NLK +   N E    + SY LG+N   DM+ EE  +
Sbjct:    31 WKKTYGKQYKEKNEEAVRRLIWEKNLKFVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMS 90

Query:   107 KYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAV 166
                 L+   P++ Q +  +     + LP SVDWR+KG VT VK QGSCG+CWAFS V A+
Sbjct:    91 LMSSLR--VPSQWQRNITYKSNPNRILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGAL 148

Query:   167 EGINQIVSGNLTSLSEQELIDCDTSF--NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLM 224
             E   ++ +G L SLS Q L+DC T    N GCNGG M  AF+YI+ + G+  +  YPY  
Sbjct:   149 EAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKA 208

Query:   225 EEGTCE-DKKEEMEVVTISGYQDVPENDEQSLLKALAHQ-PVSVAIEASGTDFQFYSGGV 282
              +  C+ D K      T S Y ++P   E  L +A+A++ PVSV ++A    F  Y  GV
Sbjct:   209 MDQKCQYDSK--YRAATCSKYTELPYGREDVLKEAVANKGPVSVGVDARHPSFFLYRSGV 266

Query:   283 FTGP-CGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGIN 341
             +  P C   ++HGV  VGYG   G +Y +VKNSWG  +GE GYIRM RN G     CGI 
Sbjct:   267 YYEPSCTQNVNHGVLVVGYGDLNGKEYWLVKNSWGHNFGEEGYIRMARNKGNH---CGIA 323

Query:   342 KMASIP 347
                S P
Sbjct:   324 SFPSYP 329


>DICTYBASE|DDB_G0278401 [details] [associations]
            symbol:cprH "cysteine proteinase 8" species:44689
            "Dictyostelium discoideum" [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 dictyBase:DDB_G0278401 EMBL:AAFI02000023
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00639 ProtClustDB:CLSZ2430780 RefSeq:XP_642342.1
            ProteinModelPortal:Q54Y60 MEROPS:C01.A62 EnsemblProtists:DDB0205428
            GeneID:8621547 KEGG:ddi:DDB_G0278401 InParanoid:Q54Y60 OMA:FANMENE
            Uniprot:Q54Y60
        Length = 337

 Score = 569 (205.4 bits), Expect = 3.7e-55, P = 3.7e-55
 Identities = 130/320 (40%), Positives = 180/320 (56%)

Query:    46 ELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFK 105
             + F  WM  + K+Y    E + R+ IFK N  +I++ N + +   LGLN+ AD+++EE++
Sbjct:    28 DAFTDWMISNQKSYSS-SEFITRYNIFKTNFDYIEEWNSKGSETVLGLNKMADITNEEYR 86

Query:   106 NKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAA 165
             + YLG KP F        +           +VDWRKKGAVT VKNQ SC  CW+FS   A
Sbjct:    87 SLYLG-KP-FDASSLIGTKEEILFSNKFSSTVDWRKKGAVTHVKNQQSCSGCWSFSATGA 144

Query:   166 VEGINQIVSGN---LTSLSEQELIDCDTSFNN-GCNGGLMDYAFKYIVASGGLHKEEDYP 221
              EG +++ +     L SLSEQ LIDC T F N GCNGG++ YAF+YI+++GG+  E+ YP
Sbjct:   145 TEGAHKLANNGTNELVSLSEQNLIDCSTPFGNTGCNGGVITYAFEYIISNGGIDTEKSYP 204

Query:   222 YLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGG 281
             +   +GTC  K E     TIS Y +V    E SL  A+   PV+ +I+AS + F FY  G
Sbjct:   205 FEGTDGTCRYKSENSGA-TISSYVNVTFGSESSLESAVNVNPVACSIDASHSSFLFYKSG 263

Query:   282 VFTGP-CG-AELDHGVAAVGYGKSKG-----------SDYIIVKNSWGPKWGERGYIRMK 328
             ++  P C    LDHGV  VGYG               S+Y I KNSWG      GYI M 
Sbjct:   264 IYFEPACSRTNLDHGVLVVGYGTENSQSQDSSSEPNHSNYWIAKNSWGIN----GYILMS 319

Query:   329 RNTGKPEGLCGINKMASIPL 348
             ++    + +CGI+ +AS P+
Sbjct:   320 KDR---DNMCGISTLASFPI 336


>TAIR|locus:2175088 [details] [associations]
            symbol:ALP "aleurain-like protease" species:3702
            "Arabidopsis thaliana" [GO:0005576 "extracellular region"
            evidence=ISM] [GO:0006508 "proteolysis" evidence=IEA;ISS]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA;ISS]
            [GO:0009723 "response to ethylene stimulus" evidence=IEP]
            [GO:0005773 "vacuole" evidence=IDA] [GO:0005829 "cytosol"
            evidence=RCA] [GO:0006096 "glycolysis" evidence=RCA] [GO:0006816
            "calcium ion transport" evidence=RCA] [GO:0006833 "water transport"
            evidence=RCA] [GO:0006972 "hyperosmotic response" evidence=RCA]
            [GO:0007030 "Golgi organization" evidence=RCA] [GO:0009266
            "response to temperature stimulus" evidence=RCA] [GO:0009651
            "response to salt stress" evidence=RCA] [GO:0009750 "response to
            fructose stimulus" evidence=RCA] [GO:0042744 "hydrogen peroxide
            catabolic process" evidence=RCA] [GO:0046686 "response to cadmium
            ion" evidence=RCA] [GO:0007568 "aging" evidence=IEP]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 EMBL:CP002688 GO:GO:0005773
            GO:GO:0007568 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 EMBL:AB011483 KO:K01366
            ProtClustDB:CLSN2689015 UniGene:At.25414 IPI:IPI00846287
            RefSeq:NP_001078774.1 ProteinModelPortal:A8MQZ1 SMR:A8MQZ1
            STRING:A8MQZ1 PRIDE:A8MQZ1 EnsemblPlants:AT5G60360.3 GeneID:836158
            KEGG:ath:AT5G60360 OMA:CGSTPMD Genevestigator:A8MQZ1 Uniprot:A8MQZ1
        Length = 361

 Score = 568 (205.0 bits), Expect = 4.8e-55, P = 4.8e-55
 Identities = 124/298 (41%), Positives = 177/298 (59%)

Query:    48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNK 107
             F  +  ++GK Y+ +EE   RF IFKENL  I   NK+  SY LG+N+FAD++ +EF+  
Sbjct:    59 FARFTHRYGKKYQNVEEMKLRFSIFKENLDLIRSTNKKGLSYKLGVNQFADLTWQEFQRT 118

Query:   108 YLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVE 167
              LG         + S + +     ALP++ DWR+ G V+PVK+QG CGSCW FST  A+E
Sbjct:   119 KLGAAQNCSATLKGSHKVTEA---ALPETKDWREDGIVSPVKDQGGCGSCWTFSTTGALE 175

Query:   168 GINQIVSGNLTSLSEQELIDCDTSFNN-GCNGGLMDYAFKYIVASGGLHKEEDYPYLMEE 226
                    G   SLSEQ+L+DC  +FNN GCNGGL   AF+YI ++GGL  E+ YPY  ++
Sbjct:   176 AAYHQAFGKGISLSEQQLVDCAGAFNNYGCNGGLPSQAFEYIKSNGGLDTEKAYPYTGKD 235

Query:   227 GTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQFYSGGVFTG 285
              TC+   E + V  ++   ++    E  L  A+   +PVS+A E   + F+ Y  GV+T 
Sbjct:   236 ETCKFSAENVGVQVLNSV-NITLGAEDELKHAVGLVRPVSIAFEVIHS-FRLYKSGVYTD 293

Query:   286 P-CGA---ELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCG 339
               CG+   +++H V AVGYG   G  Y ++KNSWG  WG++GY +M+   GK   +CG
Sbjct:   294 SHCGSTPMDVNHAVLAVGYGVEDGVPYWLIKNSWGADWGDKGYFKMEM--GK--NMCG 347


>UNIPROTKB|F1NEC8 [details] [associations]
            symbol:CTSL2 "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            EMBL:AADN02067812 IPI:IPI00820956 Ensembl:ENSGALT00000037988
            ArrayExpress:F1NEC8 Uniprot:F1NEC8
        Length = 218

 Score = 567 (204.7 bits), Expect = 6.1e-55, P = 6.1e-55
 Identities = 113/219 (51%), Positives = 144/219 (65%)

Query:   134 PKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDT-SF 192
             P+SVDWR+KG VTPVK+QG CGSCWAFST  A+EG +   +G L SLSEQ L+DC     
Sbjct:     2 PRSVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQHFRKTGKLVSLSEQNLVDCSRPEG 61

Query:   193 NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDE 252
             N GCNGGLMD AF+Y+  +GG+  EE YPY  ++      K E      +G+ D+P+  E
Sbjct:    62 NQGCNGGLMDQAFQYVQDNGGIDSEESYPYTAKDDEDCRYKAEYNAANDTGFVDIPQGHE 121

Query:   253 QSLLKALAHQ-PVSVAIEASGTDFQFYSGGVFTGP-CGAE-LDHGVAAVGYGKSKGSDYI 309
             ++L+KA+A   PVSVAI+A  + FQFY  G++  P C +E LDHGV  VGYG   G  Y 
Sbjct:   122 RALMKAVASVGPVSVAIDAGHSSFQFYQSGIYYEPDCSSEDLDHGVLVVGYGFEDGKKYW 181

Query:   310 IVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPL 348
             IVKNSWG KWG++GYI M ++    +  CGI   AS PL
Sbjct:   182 IVKNSWGEKWGDKGYIYMAKDR---KNHCGIATAASYPL 217


>UNIPROTKB|P09648 [details] [associations]
            symbol:CTSL1 "Cathepsin L1" species:9031 "Gallus gallus"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0005764 "lysosome"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 eggNOG:COG4870
            HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 HOVERGEN:HBG011513
            OrthoDB:EOG48PMKF MEROPS:C01.032 IPI:IPI00602255 PIR:S00081
            UniGene:Gga.523 ProteinModelPortal:P09648 SMR:P09648 Uniprot:P09648
        Length = 218

 Score = 567 (204.7 bits), Expect = 6.1e-55, P = 6.1e-55
 Identities = 113/219 (51%), Positives = 143/219 (65%)

Query:   134 PKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDT-SF 192
             P+SVDWR+KG VTPVK+QG CGSCWAFST  A+EG +    G L SLSEQ L+DC     
Sbjct:     2 PRSVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQHFRTKGKLVSLSEQNLVDCSRPEG 61

Query:   193 NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDE 252
             N GCNGGLMD AF+Y+  +GG+  EE YPY  ++      K E      +G+ D+P+  E
Sbjct:    62 NQGCNGGLMDQAFQYVQDNGGIDSEESYPYTAKDDEDCRYKAEYNAANDTGFVDIPQGHE 121

Query:   253 QSLLKALAHQ-PVSVAIEASGTDFQFYSGGVFTGP-CGAE-LDHGVAAVGYGKSKGSDYI 309
             ++L+KA+A   PVSVAI+A  + FQFY  G++  P C +E LDHGV  VGYG   G  Y 
Sbjct:   122 RALMKAVASVGPVSVAIDAGHSSFQFYQSGIYYEPDCSSEDLDHGVLVVGYGFEGGKKYW 181

Query:   310 IVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPL 348
             IVKNSWG KWG++GYI M ++    +  CGI   AS PL
Sbjct:   182 IVKNSWGEKWGDKGYIYMAKDR---KNHCGIATAASYPL 217


>GENEDB_PFALCIPARUM|PF11_0162 [details] [associations]
            symbol:PF11_0162 "falcipain-3" species:5833
            "Plasmodium falciparum" [GO:0020020 "food vacuole" evidence=TAS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00139 HSSP:P43235 EMBL:AE014186 GO:GO:0020020
            HOGENOM:HOG000065857 ProtClustDB:PTZ00021 RefSeq:XP_001347833.1
            ProteinModelPortal:Q8IIL0 SMR:Q8IIL0 MEROPS:C01.063
            EnsemblProtists:PF11_0162:mRNA GeneID:810709 KEGG:pfa:PF11_0162
            EuPathDB:PlasmoDB:PF3D7_1115400 OMA:ENDEDYW ChEMBL:CHEMBL1250373
            Uniprot:Q8IIL0
        Length = 492

 Score = 562 (202.9 bits), Expect = 2.1e-54, P = 2.1e-54
 Identities = 138/334 (41%), Positives = 180/334 (53%)

Query:    41 MDKL--IELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTS-YWLGLNEFA 97
             MD L  + LF  ++ ++ K Y+  EE   RF IF EN + I+  NK+  S Y  G+N+F 
Sbjct:   162 MDNLETVNLFYIFLKENNKKYETSEEMQKRFIIFSENYRKIELHNKKTNSLYKRGMNKFG 221

Query:    98 DMSHEEFKNKYLGLKPQ--FPTRRQP-SAEFSYRDV--KALPKSV-------DWRKKGAV 145
             D+S EEF++KYL LK    F T   P S E +Y DV  K  P          DWR  G V
Sbjct:   222 DLSPEEFRSKYLNLKTHGPFKTLSPPVSYEANYEDVIKKYKPADAKLDRIAYDWRLHGGV 281

Query:   146 TPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAF 205
             TPVK+Q  CGSCWAFS+V +VE    I    L   SEQEL+DC    NNGC GG +  AF
Sbjct:   282 TPVKDQALCGSCWAFSSVGSVESQYAIRKKALFLFSEQELVDCSVK-NNGCYGGYITNAF 340

Query:   206 KYIVASGGLHKEEDYPYLME-EGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPV 264
               ++  GGL  ++DYPY+     TC  K+   E  TI  Y  +P++  +  L+ L   P+
Sbjct:   341 DDMIDLGGLCSQDDYPYVSNLPETCNLKRCN-ERYTIKSYVSIPDDKFKEALRYLG--PI 397

Query:   265 SVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSK--GSD--------YIIVKNS 314
             S++I AS  DF FY GG + G CGA  +H V  VGYG       D        Y I+KNS
Sbjct:   398 SISIAASD-DFAFYRGGFYDGECGAAPNHAVILVGYGMKDIYNEDTGRMEKFYYYIIKNS 456

Query:   315 WGPKWGERGYIRMKRNTGKPEGLCGINKMASIPL 348
             WG  WGE GYI ++ +    +  C I   A +PL
Sbjct:   457 WGSDWGEGGYINLETDENGYKKTCSIGTEAYVPL 490


>UNIPROTKB|F1PMM9 [details] [associations]
            symbol:CTSL1 "Cathepsin L1" species:9615 "Canis lupus
            familiaris" [GO:0097067 "cellular response to thyroid hormone
            stimulus" evidence=IEA] [GO:0042393 "histone binding" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0004197 "cysteine-type
            endopeptidase activity" evidence=IEA] [GO:0002250 "adaptive immune
            response" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00639 GO:GO:0004197 GO:GO:0002250
            GeneTree:ENSGT00560000076577 GO:GO:0097067 EMBL:AAEX03000499
            Ensembl:ENSCAFT00000002029 OMA:EFKQVLN Uniprot:F1PMM9
        Length = 341

 Score = 562 (202.9 bits), Expect = 2.1e-54, P = 2.1e-54
 Identities = 130/331 (39%), Positives = 184/331 (55%)

Query:    30 IVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT-- 87
             I   +P+   S+D     +  W   HGK Y   EE   R  +++ N++ I+Q N+E +  
Sbjct:    22 IASAAPQQDHSLDAH---WSQWKEAHGKLYDKDEEGWRR-TVWERNMEMIEQHNQEYSQG 77

Query:    88 --SYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAV 145
               S+ L +N F DM++EEFK      K Q   + +    F       +P SVDWR++G V
Sbjct:    78 EHSFTLAMNAFGDMTNEEFKQVLNDFKIQ---KHKKGKVFPAPLFAEVPSSVDWREQGYV 134

Query:   146 TPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNN-GCNGGLMDYA 204
             TPVK+QG C  CWAFS   A+EG     +G L SLSEQ L+DC  S  N GCNGGLM+YA
Sbjct:   135 TPVKDQGQCLGCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSWSQGNRGCNGGLMEYA 194

Query:   205 FKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQ-P 263
             F+Y+  +GGL  EE YPYL     C+ + E+     ++ +  +  N+E  L+  +A   P
Sbjct:   195 FQYVKDNGGLDSEESYPYLARNEPCKYRPEK-SAANVTAFWPIL-NEEDGLMTTVATVGP 252

Query:   264 VSVAIEASGTDFQFYSGGVFTGP-CGAEL-DHGVAAVGYG----KSKGSDYIIVKNSWGP 317
             VS A+++S   FQFY  G++  P C  +L +HGV  VGYG    +S    Y IVKNSWG 
Sbjct:   253 VSAAVDSSPQSFQFYKKGIYYDPKCSNKLLNHGVLVVGYGFEGAESDNKKYWIVKNSWGT 312

Query:   318 KWGERGYIRMKRNTGKPEGLCGINKMASIPL 348
              WG +GY+ + ++    +  CGI   AS P+
Sbjct:   313 NWGMQGYMLLAKDR---DNHCGIATRASYPV 340


>UNIPROTKB|Q8IIL0 [details] [associations]
            symbol:PF11_0162 "Falcipain-3" species:36329 "Plasmodium
            falciparum 3D7" [GO:0020020 "food vacuole" evidence=TAS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00139 HSSP:P43235 EMBL:AE014186 GO:GO:0020020
            HOGENOM:HOG000065857 ProtClustDB:PTZ00021 RefSeq:XP_001347833.1
            ProteinModelPortal:Q8IIL0 SMR:Q8IIL0 MEROPS:C01.063
            EnsemblProtists:PF11_0162:mRNA GeneID:810709 KEGG:pfa:PF11_0162
            EuPathDB:PlasmoDB:PF3D7_1115400 OMA:ENDEDYW ChEMBL:CHEMBL1250373
            Uniprot:Q8IIL0
        Length = 492

 Score = 562 (202.9 bits), Expect = 2.1e-54, P = 2.1e-54
 Identities = 138/334 (41%), Positives = 180/334 (53%)

Query:    41 MDKL--IELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTS-YWLGLNEFA 97
             MD L  + LF  ++ ++ K Y+  EE   RF IF EN + I+  NK+  S Y  G+N+F 
Sbjct:   162 MDNLETVNLFYIFLKENNKKYETSEEMQKRFIIFSENYRKIELHNKKTNSLYKRGMNKFG 221

Query:    98 DMSHEEFKNKYLGLKPQ--FPTRRQP-SAEFSYRDV--KALPKSV-------DWRKKGAV 145
             D+S EEF++KYL LK    F T   P S E +Y DV  K  P          DWR  G V
Sbjct:   222 DLSPEEFRSKYLNLKTHGPFKTLSPPVSYEANYEDVIKKYKPADAKLDRIAYDWRLHGGV 281

Query:   146 TPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAF 205
             TPVK+Q  CGSCWAFS+V +VE    I    L   SEQEL+DC    NNGC GG +  AF
Sbjct:   282 TPVKDQALCGSCWAFSSVGSVESQYAIRKKALFLFSEQELVDCSVK-NNGCYGGYITNAF 340

Query:   206 KYIVASGGLHKEEDYPYLME-EGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPV 264
               ++  GGL  ++DYPY+     TC  K+   E  TI  Y  +P++  +  L+ L   P+
Sbjct:   341 DDMIDLGGLCSQDDYPYVSNLPETCNLKRCN-ERYTIKSYVSIPDDKFKEALRYLG--PI 397

Query:   265 SVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSK--GSD--------YIIVKNS 314
             S++I AS  DF FY GG + G CGA  +H V  VGYG       D        Y I+KNS
Sbjct:   398 SISIAASD-DFAFYRGGFYDGECGAAPNHAVILVGYGMKDIYNEDTGRMEKFYYYIIKNS 456

Query:   315 WGPKWGERGYIRMKRNTGKPEGLCGINKMASIPL 348
             WG  WGE GYI ++ +    +  C I   A +PL
Sbjct:   457 WGSDWGEGGYINLETDENGYKKTCSIGTEAYVPL 490


>UNIPROTKB|F1NZ37 [details] [associations]
            symbol:LOC420160 "Uncharacterized protein" species:9031
            "Gallus gallus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GeneTree:ENSGT00560000076577 EMBL:AADN02062018
            IPI:IPI00587784 Ensembl:ENSGALT00000006765 OMA:CGVANQA
            Uniprot:F1NZ37
        Length = 340

 Score = 561 (202.5 bits), Expect = 2.6e-54, P = 2.6e-54
 Identities = 128/323 (39%), Positives = 177/323 (54%)

Query:    39 TSMDKLIE-LFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT----SYWLGL 93
             T++D ++E  +E W S + K Y   E +L R E+++ NL+ I+Q N E +    ++ LG+
Sbjct:    24 TALDPVLEEAWERWKSLYAKEYPG-EAELIRREVWENNLRRIEQHNWEESQGQHTFRLGM 82

Query:    94 NEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGS 153
             N + D+  EEF     G  P      +P+  F     +  P  VDWR +G VTPVKNQG 
Sbjct:    83 NHYGDLMDEEFNQLLNGFAPV--QHEEPALTFQASAAQKTPAEVDWRMRGYVTPVKNQGH 140

Query:   154 CGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASG 212
             CGSCWAFS   A+EG+    +G L  LSEQ LIDC     NNGC GG M  AF+Y+  +G
Sbjct:   141 CGSCWAFSATGALEGLVFNWTGKLAVLSEQNLIDCSWKLGNNGCQGGYMTRAFQYVHDNG 200

Query:   213 GLHKEEDYPY-LMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQ-PVSVAIEA 270
             G++ E  YPY   +  +C     +      S    V +  E +L +A+A   PVSVA++A
Sbjct:   201 GMNSEHIYPYQATDTSSCRYNPAD-RAANCSTVWLVAQGSEAALEQAVATVGPVSVAVDA 259

Query:   271 SGTDFQFYSGGVFTGP-CGAELDHGVAAVGYGKS----KGSDYIIVKNSWGPKWGERGYI 325
             S   F FY  G+F    C  +++HG+ AVGYG S    K   Y I+KNSW   WGE+GYI
Sbjct:   260 SSFFFHFYKSGIFNSMFCSQKVNHGMLAVGYGISQEARKNVSYWILKNSWSEVWGEKGYI 319

Query:   326 RMKRNTGKPEGLCGINKMASIPL 348
             R+ +        CG+   AS PL
Sbjct:   320 RLLKGVNNH---CGVANQASFPL 339


>TAIR|locus:2078312 [details] [associations]
            symbol:AT3G45310 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005773 EMBL:CP002686
            GenomeReviews:BA000014_GR eggNOG:COG4870 HOGENOM:HOG000230774
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 EMBL:AL132953
            EMBL:AY091771 IPI:IPI00540369 PIR:T47471 RefSeq:NP_566880.1
            UniGene:At.25239 ProteinModelPortal:Q8RWQ9 SMR:Q8RWQ9
            MEROPS:C01.162 PaxDb:Q8RWQ9 PRIDE:Q8RWQ9 EnsemblPlants:AT3G45310.1
            GeneID:823669 KEGG:ath:AT3G45310 GeneFarm:5032 TAIR:At3g45310
            InParanoid:Q8RWQ9 KO:K01366 OMA:AFEVVHE PhylomeDB:Q8RWQ9
            ProtClustDB:CLSN2689015 Genevestigator:Q8RWQ9 Uniprot:Q8RWQ9
        Length = 358

 Score = 558 (201.5 bits), Expect = 5.5e-54, P = 5.5e-54
 Identities = 125/307 (40%), Positives = 175/307 (57%)

Query:    48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNK 107
             F  +  ++GK Y+ +EE   RF +FKENL  I   NK+  SY L LN+FAD++ +EF+  
Sbjct:    59 FSRFTHRYGKKYQSVEEMKLRFSVFKENLDLIRSTNKKGLSYKLSLNQFADLTWQEFQRY 118

Query:   108 YLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVE 167
              LG         + S + +   V   P + DWR+ G V+PVK QG CGSCW FST  A+E
Sbjct:   119 KLGAAQNCSATLKGSHKITEATV---PDTKDWREDGIVSPVKEQGHCGSCWTFSTTGALE 175

Query:   168 GINQIVSGNLTSLSEQELIDCDTSFNN-GCNGGLMDYAFKYIVASGGLHKEEDYPYLMEE 226
                    G   SLSEQ+L+DC  +FNN GC+GGL   AF+YI  +GGL  EE YPY  ++
Sbjct:   176 AAYHQAFGKGISLSEQQLVDCAGTFNNFGCHGGLPSQAFEYIKYNGGLDTEEAYPYTGKD 235

Query:   227 GTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQFYSGGVFTG 285
             G C+   + + V  +    ++    E  L  A+   +PVSVA E    +F+FY  GVFT 
Sbjct:   236 GGCKFSAKNIGV-QVRDSVNITLGAEDELKHAVGLVRPVSVAFEVVH-EFRFYKKGVFTS 293

Query:   286 -PCG---AELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGIN 341
               CG    +++H V AVGYG      Y ++KNSWG +WG+ GY +M+   GK   +CG+ 
Sbjct:   294 NTCGNTPMDVNHAVLAVGYGVEDDVPYWLIKNSWGGEWGDNGYFKMEM--GK--NMCGVA 349

Query:   342 KMASIPL 348
               +S P+
Sbjct:   350 TCSSYPV 356


>TAIR|locus:2082687 [details] [associations]
            symbol:AT3G54940 species:3702 "Arabidopsis thaliana"
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 EMBL:CP002686 HOGENOM:HOG000230774 KO:K01373
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 HSSP:P53634
            OMA:GGGLMTN EMBL:AY070063 IPI:IPI00528988 RefSeq:NP_567010.5
            UniGene:At.28412 ProteinModelPortal:Q8VYS0 SMR:Q8VYS0 PRIDE:Q8VYS0
            EnsemblPlants:AT3G54940.2 GeneID:824659 KEGG:ath:AT3G54940
            TAIR:At3g54940 PhylomeDB:Q8VYS0 ProtClustDB:CLSN2718801
            ArrayExpress:Q8VYS0 Genevestigator:Q8VYS0 Uniprot:Q8VYS0
        Length = 367

 Score = 558 (201.5 bits), Expect = 5.5e-54, P = 5.5e-54
 Identities = 128/319 (40%), Positives = 181/319 (56%)

Query:    48 FESWMSKHGKTYKCIEEKLHRFEIFKEN-LKHIDQRNKEVTSYWLGLNEFADMSHEEFKN 106
             F  +MS +GK Y   EE +HR  IF +N LK  + +  + ++   G+ +F+D++ EEFK 
Sbjct:    51 FRLFMSDYGKNYSTREEYIHRLGIFAKNVLKAAEHQMMDPSAVH-GVTQFSDLTEEEFKR 109

Query:   107 KYLGLKPQFPTRRQP-SAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAA 165
              Y G+     +R     AE    +V  LP+  DWR+KG VT VKNQG+CGSCWAFST  A
Sbjct:   110 MYTGVADVGGSRGGTVGAEAPMVEVDGLPEDFDWREKGGVTEVKNQGACGSCWAFSTTGA 169

Query:   166 VEGINQIVSGNLTSLSEQELIDCDTSFN--------NGCNGGLMDYAFKYIVASGGLHKE 217
              EG + + +G L SLSEQ+L+DCD + +        NGC GGLM  A++Y++ +GGL +E
Sbjct:   170 AEGAHFVSTGKLLSLSEQQLVDCDQACDPKDKKACDNGCGGGLMTNAYEYLMEAGGLEEE 229

Query:   218 EDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQF 277
               YPY  + G C+   E++  V +  +  +P ++ Q     + H P++V + A     Q 
Sbjct:   230 RSYPYTGKRGHCKFDPEKV-AVRVLNFTTIPLDENQIAANLVRHGPLAVGLNA--VFMQT 286

Query:   278 YSGGVFTGP--CGAE-LDHGVAAVGYGKSKG--------SDYIIVKNSWGPKWGERGYIR 326
             Y GGV + P  C    ++HGV  VGYG SKG          Y I+KNSWG KWGE GY +
Sbjct:   287 YIGGV-SCPLICSKRNVNHGVLLVGYG-SKGFSILRLSNKPYWIIKNSWGKKWGENGYYK 344

Query:   327 MKRNTGKPEGLCGINKMAS 345
             + R       +CGIN M S
Sbjct:   345 LCRG----HDICGINSMVS 359


>RGD|69241 [details] [associations]
            symbol:Ctsj "cathepsin J" species:10116 "Rattus norvegicus"
           [GO:0005764 "lysosome" evidence=IEA] [GO:0006508 "proteolysis"
           evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
           evidence=IEA] [GO:0048471 "perinuclear region of cytoplasm"
           evidence=IDA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
           PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 EMBL:L14776
           RGD:69241 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
           InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246
           SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
           GO:GO:0005764 HOVERGEN:HBG011513 MEROPS:C01.038 CTD:26898 KO:K09599
           EMBL:AF310623 EMBL:BC097263 IPI:IPI00205027 PIR:I58002
           RefSeq:NP_058817.1 UniGene:Rn.34875 ProteinModelPortal:Q63088
           SMR:Q63088 PRIDE:Q63088 GeneID:29174 KEGG:rno:29174 NextBio:608244
           Genevestigator:Q63088 Uniprot:Q63088
        Length = 334

 Score = 553 (199.7 bits), Expect = 1.9e-53, P = 1.9e-53
 Identities = 129/333 (38%), Positives = 187/333 (56%)

Query:    28 FSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKE-- 85
             F +   +P    ++D   E ++ W +K+ K+Y  +EE+L R  +++ENLK I   NKE  
Sbjct:    12 FGVASGAPARDPNLDA--E-WQDWKTKYAKSYSPVEEELKR-AVWEENLKMIQLHNKENG 67

Query:    86 --VTSYWLGLNEFADMSHEEFKNKYLG-LKPQFPTRRQPSAEFSYRDVK-ALPKSVDWRK 141
                  + + +N FAD + EEF+      L P   T   PSA+   + V   LP   DWRK
Sbjct:    68 LGKNGFTMEMNAFADTTGEEFRKSLSDILIPAAVTN--PSAQ---KQVSIGLPNFKDWRK 122

Query:   142 KGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGL 200
             +G VTPV+NQG CGSCWAF+ V A+EG     +GNLT LS Q L+DC  S  NNGC  G 
Sbjct:   123 EGYVTPVRNQGKCGSCWAFAAVGAIEGQMFSKTGNLTPLSVQNLLDCSKSEGNNGCRWGT 182

Query:   201 MDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALA 260
                AF Y++ + GL  E  YPY  ++G C    E      I+G+ ++P N+    +   +
Sbjct:   183 AHQAFNYVLKNKGLEAEATYPYEGKDGPCRYHSENASA-NITGFVNLPPNELYLWVAVAS 241

Query:   261 HQPVSVAIEASGTDFQFYSGGVFTGP-CGAEL-DHGVAAVGYG----KSKGSDYIIVKNS 314
               PVS AI+AS   F+FYSGGV+  P C + + +H V  VGYG    ++ G++Y ++KNS
Sbjct:   242 IGPVSAAIDASHDSFRFYSGGVYHEPNCSSYVVNHAVLVVGYGFEGNETDGNNYWLIKNS 301

Query:   315 WGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
             WG +WG  G++++ ++       CGI   AS P
Sbjct:   302 WGEEWGINGFMKIAKDRNNH---CGIASQASFP 331


>ZFIN|ZDB-GENE-040426-1583 [details] [associations]
            symbol:ctssa "cathepsin S, a" species:7955 "Danio
            rerio" [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-040426-1583
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 HOVERGEN:HBG011513
            GeneTree:ENSGT00660000095458 EMBL:CR548627 IPI:IPI00491948
            UniGene:Dr.81560 SMR:Q1L8W8 Ensembl:ENSDART00000053638 OMA:RNTREER
            OrthoDB:EOG480HX9 Uniprot:Q1L8W8
        Length = 328

 Score = 553 (199.7 bits), Expect = 1.9e-53, P = 1.9e-53
 Identities = 121/313 (38%), Positives = 173/313 (55%)

Query:    43 KLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT----SYWLGLNEFAD 98
             +L   + +W S+H KTY+   E+  R  ++K+NL+ I   N+       SY LGLN+ +D
Sbjct:    22 RLTNQWTTWKSQHNKTYRNTREERLRRSVWKQNLQDILLHNEAAAVGLHSYTLGLNQLSD 81

Query:    99 MSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCW 158
             M+ +E  +    L+  FP     +A FS   ++ LP+ V+W + G V+PV+NQG CGSCW
Sbjct:    82 MTADEVNDMNGLLEEDFP---DVNATFSPPSLQTLPQRVNWTEHGMVSPVQNQGPCGSCW 138

Query:   159 AFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNN-GCNGGLMDYAFKYIVASGGLHKE 217
             AFS V ++E   +  +  L  LS Q L+DC  S  N GC GG +  AF Y++ + G+   
Sbjct:   139 AFSAVGSLEAQMKRRTAALVPLSAQNLLDCSVSLGNRGCKGGFLSRAFLYVIQNRGIDSS 198

Query:   218 EDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQ 276
               YPY  +EG C            +G++ VP ++E +L  A+A+  PVSV I A    F 
Sbjct:   199 TFYPYEHKEGVCRYSVSG-RAGYCTGFRIVPRHNEAALQSAVANIGPVSVGINAKLLSFH 257

Query:   277 FYSGGVFTGP-CGAEL-DHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKP 334
              Y  G++  P C + L +H V  VGYG   G DY +VKNSWG  WGE GYIRM RN    
Sbjct:   258 RYRSGIYNDPKCSSALINHAVLVVGYGSENGQDYWLVKNSWGTAWGENGYIRMARN---- 313

Query:   335 EGLCGINKMASIP 347
             + +CGI+     P
Sbjct:   314 KNMCGISSFGIYP 326


>FB|FBgn0250848 [details] [associations]
            symbol:26-29-p "26-29kD-proteinase" species:7227 "Drosophila
            melanogaster" [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=ISS] [GO:0006508 "proteolysis" evidence=IEA] [GO:0005811
            "lipid particle" evidence=IDA] [GO:0005875 "microtubule associated
            complex" evidence=IDA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0005875 EMBL:AE014296 GO:GO:0005811 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00660000095458 MEROPS:I29.003 HSSP:O65039
            EMBL:AY122222 EMBL:AB011376 RefSeq:NP_620470.1 UniGene:Dm.3049
            SMR:Q9V3U6 MINT:MINT-890485 STRING:Q9V3U6
            EnsemblMetazoa:FBtr0075766 GeneID:39547 KEGG:dme:Dmel_CG8947
            UCSC:CG8947-RA CTD:39547 FlyBase:FBgn0250848 InParanoid:Q9V3U6
            OMA:IHSKNRA OrthoDB:EOG4BVQ8T GenomeRNAi:39547 NextBio:814210
            Uniprot:Q9V3U6
        Length = 549

 Score = 550 (198.7 bits), Expect = 3.8e-53, P = 3.8e-53
 Identities = 128/312 (41%), Positives = 176/312 (56%)

Query:    36 EHLTSMDKLIE-LFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLN 94
             E ++  D+ ++  F  +  KHG  Y    E  HR  IF++NL++I  +N+   +Y L +N
Sbjct:   232 EFISGTDEHVDKAFHHFKRKHGVAYHSDTEHEHRKNIFRQNLRYIHSKNRAKLTYTLAVN 291

Query:    95 EFADMSHEEFKNKYLGLKPQ--FPTRRQ-PSAEFSYRDVKALPKSVDWRKKGAVTPVKNQ 151
               AD + EE K +  G K    + T +  P     Y+D   +P   DWR  GAVTPVK+Q
Sbjct:   292 HLADKTEEELKARR-GYKSSGIYNTGKPFPYDVPKYKD--EIPDQYDWRLYGAVTPVKDQ 348

Query:   152 GSCGSCWAFSTVAAVEGINQIVSG-NLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIV 209
               CGSCW+F T+  +EG   + +G NL  LS+Q LIDC  ++ NNGC+GG     +++++
Sbjct:   349 SVCGSCWSFGTIGHLEGAFFLKNGGNLVRLSQQALIDCSWAYGNNGCDGGEDFRVYQWML 408

Query:   210 ASGGLHKEEDY-PYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSL-LKALAHQPVSVA 267
              SGG+  EE+Y PYL ++G C      + V  I G+ +V  ND  +  L  L H P+SVA
Sbjct:   409 QSGGVPTEEEYGPYLGQDGYCHVNNVTL-VAPIKGFVNVTSNDPNAFKLALLKHGPLSVA 467

Query:   268 IEASGTDFQFYSGGVFTGP-CGAE---LDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERG 323
             I+AS   F FYS GV+  P C  +   LDH V AVGYG   G DY +VKNSW   WG  G
Sbjct:   468 IDASPKTFSFYSHGVYYEPTCKNDVDGLDHAVLAVGYGSINGEDYWLVKNSWSTYWGNDG 527

Query:   324 YIRM---KRNTG 332
             YI M   K N G
Sbjct:   528 YILMSAKKNNCG 539


>UNIPROTKB|P83443 [details] [associations]
            symbol:P83443 "Macrodontain-1" species:203992 "Pseudananas
            sagenarius" [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=IDA] [GO:0005575 "cellular_component" evidence=ND]
            [GO:0006508 "proteolysis" evidence=IDA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0004197
            ProteinModelPortal:P83443 SMR:P83443 MEROPS:C01.028 Uniprot:P83443
        Length = 213

 Score = 550 (198.7 bits), Expect = 3.8e-53, P = 3.8e-53
 Identities = 97/209 (46%), Positives = 141/209 (67%)

Query:   132 ALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTS 191
             A+P+S+DWR  GAV  VKNQG CG CWAF+ +A VEGI +I  GNL  LSEQE++DC  S
Sbjct:     1 AVPQSIDWRDYGAVNEVKNQGPCGGCWAFAAIATVEGIYKIRKGNLVYLSEQEVLDCAVS 60

Query:   192 FNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPEND 251
             +  GC GG ++ A+ +I+++ G+  +E+YPY   +GTC           I+GY  V  ND
Sbjct:    61 Y--GCKGGWVNRAYDFIISNNGVTTDENYPYRAYQGTCNANYFPNSAY-ITGYSYVRRND 117

Query:   252 EQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIV 311
             E  ++ A+++QP++  I+ASG +FQ+Y GGV++GPCG  L+H +  +GYG+     Y IV
Sbjct:   118 ESHMMYAVSNQPIAALIDASGDNFQYYKGGVYSGPCGFSLNHAITIIGYGRDS---YWIV 174

Query:   312 KNSWGPKWGERGYIRMKRNTGKPEGLCGI 340
             +NSWG  WG+ GY+R++R+     G+CGI
Sbjct:   175 RNSWGSSWGQGGYVRIRRDVSHSGGVCGI 203


>UNIPROTKB|F1NHB8 [details] [associations]
            symbol:F1NHB8 "Uncharacterized protein" species:9031
            "Gallus gallus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GeneTree:ENSGT00660000095458 EMBL:AADN02044011
            IPI:IPI00586027 Ensembl:ENSGALT00000021873 OMA:SELDHAV
            Uniprot:F1NHB8
        Length = 329

 Score = 545 (196.9 bits), Expect = 1.3e-52, P = 1.3e-52
 Identities = 124/309 (40%), Positives = 167/309 (54%)

Query:    47 LFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKN 106
             LF  +  + GK Y   EE  HR   F  N++ +  +N+   SY L LN  AD + +E   
Sbjct:    25 LFHHYKERFGKRYSSEEEHEHRKRTFIHNMRFVHSKNRAALSYSLALNHLADRTPQEMAA 84

Query:   107 KYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAV 166
                  +   P   QP +   Y  +  LP+S+DWR  GAVTPVK+Q  CGSCW+F+T  A+
Sbjct:    85 LRGRRRSGDPKSGQPFSMQLYASL-VLPESLDWRLYGAVTPVKDQAVCGSCWSFATTGAM 143

Query:   167 EGINQIVSGNLTSLSEQELIDCDTSFNN-GCNGGLMDYAFKYIVASGGLHKEEDY-PYLM 224
             EG   + +G LT LS+Q LIDC   F N  C+GG    A+++I   GG+   E Y PYL 
Sbjct:   144 EGALFLKTGVLTPLSQQVLIDCSWGFGNYACDGGEEWRAYEWIKKHGGIASTESYGPYLG 203

Query:   225 EEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKAL-AHQPVSVAIEASGTDFQFYSGGVF 283
             + G C   + E+ V  ++GY  V   + ++L  AL  H PV+V I+AS   F FY+ GV+
Sbjct:   204 QNGYCHYNQSEL-VAPLAGYVTVESGNAEALKAALFKHGPVAVNIDASHKSFTFYANGVY 262

Query:   284 TGP-CG---AELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCG 339
               P CG   +ELDH V AVGYG   G  Y ++KNSW   WG  GYI M          CG
Sbjct:   263 EEPHCGNETSELDHAVLAVGYGVLHGKSYWLIKNSWSTYWGNDGYILMAMKDNN----CG 318

Query:   340 INKMASIPL 348
             +   AS P+
Sbjct:   319 VATAASFPI 327


>UNIPROTKB|Q90686 [details] [associations]
            symbol:CTSK "Cathepsin K" species:9031 "Gallus gallus"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015644 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 HOVERGEN:HBG011513
            CTD:1513 KO:K01371 PANTHER:PTHR12411:SF55 EMBL:U37691
            IPI:IPI00575213 RefSeq:NP_990302.1 UniGene:Gga.51509
            ProteinModelPortal:Q90686 SMR:Q90686 MEROPS:C01.036 GeneID:395818
            KEGG:gga:395818 NextBio:20815886 Uniprot:Q90686
        Length = 334

 Score = 542 (195.9 bits), Expect = 2.7e-52, P = 2.7e-52
 Identities = 117/266 (43%), Positives = 166/266 (62%)

Query:    88 SYWLGLNEFADMSHEEFKNKYLGLK-PQFPTRRQPSAEFSYRDVKA-LPKSVDWRKKGAV 145
             S+ L +N   DM+ EE      GL+ P+  +R +P+      D  +  P +VDWR+KG V
Sbjct:    75 SFQLAMNYLGDMTSEEVVRTMTGLRVPR--SRPRPNGTLYVPDWSSRAPAAVDWRRKGYV 132

Query:   146 TPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAF 205
             TPVK+QG CGSCWAFS+V A+EG  +  +G L SLS Q L+ C  S NNGC GG M  AF
Sbjct:   133 TPVKDQGQCGSCWAFSSVGALEGQLKRRTGKLLSLSPQNLVYC-VSNNNGCGGGYMTNAF 191

Query:   206 KYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPV 264
             +Y+  + G+  E+ YPY+ ++ +C       +     GY+++PE++E++L +A+A   PV
Sbjct:   192 EYVRLNRGIDSEDAYPYIGQDESCMYSPTG-KAAKCRGYREIPEDNEKALKRAVARIGPV 250

Query:   265 SVAIEASGTDFQFYSGGVF--TGPCGAE-LDHGVAAVGYGKSKGSDYIIVKNSWGPKWGE 321
             SV I+AS   FQFYS GV+  TG C  E ++H V AVGYG  KG+ + I+KNSWG +WG 
Sbjct:   251 SVGIDASLPSFQFYSRGVYYDTG-CNPENINHAVLAVGYGAQKGTKHWIIKNSWGTEWGN 309

Query:   322 RGYIRMKRNTGKPEGLCGINKMASIP 347
             +GY+ + RN  +    CGI  +AS P
Sbjct:   310 KGYVLLARNMKQT---CGIANLASFP 332


>UNIPROTKB|Q10991 [details] [associations]
            symbol:CTSL "Cathepsin L1" species:9940 "Ovis aries"
            [GO:0005515 "protein binding" evidence=IPI] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 HOVERGEN:HBG011513
            MEROPS:C01.032 ProteinModelPortal:Q10991 SMR:Q10991 Uniprot:Q10991
        Length = 217

 Score = 540 (195.1 bits), Expect = 4.4e-52, P = 4.4e-52
 Identities = 110/220 (50%), Positives = 143/220 (65%)

Query:   133 LPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDT-S 191
             +PKSVDW KKG VTPVKNQG CGSCWAFS   A+EG     +G L SLSEQ L+D     
Sbjct:     1 VPKSVDWTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDSSRPQ 60

Query:   192 FNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPEND 251
              N GCNGGLMD AF+YI  +GGL  EE YPY   + +C + K E      +G+ D+P+  
Sbjct:    61 GNQGCNGGLMDNAFQYIKENGGLDSEESYPYEATDTSC-NYKPEYSAAKDTGFVDIPQR- 118

Query:   252 EQSLLKALAHQ-PVSVAIEASGTDFQFYSGGVFTGP-CGA-ELDHGVAAVGYG-KSKGSD 307
             E++L+KA+A   P+SVAI+A  + FQFY  G++  P C + +LDHGV  VGYG +   + 
Sbjct:   119 EKALMKAVATVGPISVAIDAGHSSFQFYKSGIYYDPDCSSKDLDHGVLVVGYGFEGTNNK 178

Query:   308 YIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
             + IVKNSWGP+WG +GY++M ++       CGI   AS P
Sbjct:   179 FWIVKNSWGPEWGNKGYVKMAKDQNNH---CGIATAASYP 215


>ZFIN|ZDB-GENE-050522-559 [details] [associations]
            symbol:ctssb.1 "cathepsin S, b.1" species:7955
            "Danio rerio" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            ZFIN:ZDB-GENE-050522-559 eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 HOVERGEN:HBG011513 MEROPS:C01.034
            EMBL:BC095694 IPI:IPI00607338 UniGene:Dr.75553
            ProteinModelPortal:Q502H6 SMR:Q502H6 InParanoid:Q502H6
            ArrayExpress:Q502H6 Uniprot:Q502H6
        Length = 330

 Score = 536 (193.7 bits), Expect = 1.2e-51, P = 1.2e-51
 Identities = 123/319 (38%), Positives = 177/319 (55%)

Query:    39 TSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT----SYWLGLN 94
             T++D+  EL   W   +GK Y    E+  R ++++ NL+ I   N E +    SY L +N
Sbjct:    21 TNLDQHWEL---WKKTYGKIYTTEVEEFGRRQLWERNLQLITVHNLEASMGMHSYDLSMN 77

Query:    95 EFADMSHEEFKNKYLGLKPQFPT--RRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQG 152
                D++ EE     L L    P+  +RQ  A        A+P S+DWR+KG V+ VK QG
Sbjct:    78 HMGDLTTEEILQT-LALT-HVPSGFKRQ-IANIVGSSGDAVPDSLDWREKGYVSSVKMQG 134

Query:   153 SCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNN-GCNGGLMDYAFKYIVAS 211
             +CGSCWAFS+V A+EG  +  +G L  LS Q L+DC + + N GCNGG M  AF+Y++ +
Sbjct:   135 ACGSCWAFSSVGALEGQLKKTTGKLVDLSPQNLVDCSSKYGNKGCNGGFMSDAFQYVIDN 194

Query:   212 GGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQ-PVSVAIEA 270
             GG+  +  YPY   +  C     +      + Y  V + DE +L +A+A   P+SVAI+A
Sbjct:   195 GGIASDSAYPYRGVQQQCSYSSSQ-RAANCTKYYFVRQGDENALKQAVASVGPISVAIDA 253

Query:   271 SGTDFQFYSGGVFTGP-CGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKR 329
             +   F  Y  GV+  P C   ++H V  VGYG   G D+ +VKNSWG ++G+ GYIRM R
Sbjct:   254 TRPQFVLYHSGVYNDPTCSKRVNHAVLVVGYGTLSGQDHWLVKNSWGTRFGDGGYIRMAR 313

Query:   330 NTGKPEGLCGINKMASIPL 348
             N      +CGI   A  P+
Sbjct:   314 NKNN---MCGIASYACYPV 329


>GENEDB_PFALCIPARUM|PF11_0165 [details] [associations]
            symbol:PF11_0165 "falcipain 2 precursor"
            species:5833 "Plasmodium falciparum" [GO:0020020 "food vacuole"
            evidence=TAS] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00139
            PROSITE:PS00639 EMBL:AE014186 HSSP:O65039 GO:GO:0020020
            RefSeq:XP_001347836.1 ProteinModelPortal:Q8I6U4 SMR:Q8I6U4
            IntAct:Q8I6U4 MINT:MINT-1559493 MEROPS:C01.046
            EnsemblProtists:PF11_0165:mRNA GeneID:810712 KEGG:pfa:PF11_0165
            EuPathDB:PlasmoDB:PF3D7_1115700 HOGENOM:HOG000065857 OMA:NESLHAN
            ProtClustDB:PTZ00021 BindingDB:Q8I6U4 ChEMBL:CHEMBL3470
            Uniprot:Q8I6U4
        Length = 484

 Score = 535 (193.4 bits), Expect = 1.5e-51, P = 1.5e-51
 Identities = 125/344 (36%), Positives = 183/344 (53%)

Query:    27 DFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEV 86
             D+ I  +  + L +  + I  F  ++  + K Y    E   RF++F +N   ++  N   
Sbjct:   144 DYFINFFDNKFLMNNAEHINQFYMFIKTNNKQYNSPNEMKERFQVFLQNAHKVNMHNNNK 203

Query:    87 TS-YWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSA--EFSYRDVKALPK-------- 135
              S Y   LN FAD+++ EFKNKYL L+   P +       + +Y +V    K        
Sbjct:   204 NSLYKKELNRFADLTYHEFKNKYLSLRSSKPLKNSKYLLDQMNYEEVIKKYKGNENFDHA 263

Query:   136 SVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNN- 194
             + DWR    VTPVK+Q +CGSCWAFS++ +VE    I    L +LSEQEL+DC  SF N 
Sbjct:   264 AYDWRLHSGVTPVKDQKNCGSCWAFSSIGSVESQYAIRKNKLITLSEQELVDC--SFKNY 321

Query:   195 GCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQS 254
             GCNGGL++ AF+ ++  GG+  ++DYPY+ +     +     E   I  Y  VP+N  + 
Sbjct:   322 GCNGGLINNAFEDMIELGGICTDDDYPYVSDAPNLCNIDRCTEKYGIKNYLSVPDNKLKE 381

Query:   255 LLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKS--------KGS 306
              L+ L   P+S+++  S  DF FY  G+F G CG +L+H V  VG+G          KG 
Sbjct:   382 ALRFLG--PISISVAVSD-DFAFYKEGIFDGECGDQLNHAVMLVGFGMKEIVNPLTKKGE 438

Query:   307 D--YIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPL 348
                Y I+KNSWG +WGERG+I ++ +       CG+   A IPL
Sbjct:   439 KHYYYIIKNSWGQQWGERGFINIETDESGLMRKCGLGTDAFIPL 482


>UNIPROTKB|Q8I6U4 [details] [associations]
            symbol:PF11_0165 "Falcipain-2A" species:36329 "Plasmodium
            falciparum 3D7" [GO:0020020 "food vacuole" evidence=TAS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00139 PROSITE:PS00639
            EMBL:AE014186 HSSP:O65039 GO:GO:0020020 RefSeq:XP_001347836.1
            ProteinModelPortal:Q8I6U4 SMR:Q8I6U4 IntAct:Q8I6U4
            MINT:MINT-1559493 MEROPS:C01.046 EnsemblProtists:PF11_0165:mRNA
            GeneID:810712 KEGG:pfa:PF11_0165 EuPathDB:PlasmoDB:PF3D7_1115700
            HOGENOM:HOG000065857 OMA:NESLHAN ProtClustDB:PTZ00021
            BindingDB:Q8I6U4 ChEMBL:CHEMBL3470 Uniprot:Q8I6U4
        Length = 484

 Score = 535 (193.4 bits), Expect = 1.5e-51, P = 1.5e-51
 Identities = 125/344 (36%), Positives = 183/344 (53%)

Query:    27 DFSIVGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEV 86
             D+ I  +  + L +  + I  F  ++  + K Y    E   RF++F +N   ++  N   
Sbjct:   144 DYFINFFDNKFLMNNAEHINQFYMFIKTNNKQYNSPNEMKERFQVFLQNAHKVNMHNNNK 203

Query:    87 TS-YWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSA--EFSYRDVKALPK-------- 135
              S Y   LN FAD+++ EFKNKYL L+   P +       + +Y +V    K        
Sbjct:   204 NSLYKKELNRFADLTYHEFKNKYLSLRSSKPLKNSKYLLDQMNYEEVIKKYKGNENFDHA 263

Query:   136 SVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNN- 194
             + DWR    VTPVK+Q +CGSCWAFS++ +VE    I    L +LSEQEL+DC  SF N 
Sbjct:   264 AYDWRLHSGVTPVKDQKNCGSCWAFSSIGSVESQYAIRKNKLITLSEQELVDC--SFKNY 321

Query:   195 GCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQS 254
             GCNGGL++ AF+ ++  GG+  ++DYPY+ +     +     E   I  Y  VP+N  + 
Sbjct:   322 GCNGGLINNAFEDMIELGGICTDDDYPYVSDAPNLCNIDRCTEKYGIKNYLSVPDNKLKE 381

Query:   255 LLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKS--------KGS 306
              L+ L   P+S+++  S  DF FY  G+F G CG +L+H V  VG+G          KG 
Sbjct:   382 ALRFLG--PISISVAVSD-DFAFYKEGIFDGECGDQLNHAVMLVGFGMKEIVNPLTKKGE 438

Query:   307 D--YIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPL 348
                Y I+KNSWG +WGERG+I ++ +       CG+   A IPL
Sbjct:   439 KHYYYIIKNSWGQQWGERGFINIETDESGLMRKCGLGTDAFIPL 482


>UNIPROTKB|J9P7C5 [details] [associations]
            symbol:J9P7C5 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00560000076577 EMBL:AAEX03010953
            Ensembl:ENSCAFT00000012925 Uniprot:J9P7C5
        Length = 321

 Score = 534 (193.0 bits), Expect = 1.9e-51, P = 1.9e-51
 Identities = 129/316 (40%), Positives = 181/316 (57%)

Query:    43 KLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTS----YWLGLNEFAD 98
             KL + ++ W + H + Y   EE   R  ++++N+K I+  N+E +     + + +N F D
Sbjct:    20 KLDQRYQ-WKAMHRRLYGMNEEGWRR-AVWEKNMKMIELHNREYSQGKHGFTMAMNAFGD 77

Query:    99 MSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCW 158
             M++EEF+    G + Q   + +   E  + ++   PKSVDWR+KG VTPVKNQG CGSCW
Sbjct:    78 MTNEEFRQVINGFQNQKHKKGKVFQEPLFAEI---PKSVDWREKGYVTPVKNQGQCGSCW 134

Query:   159 AFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEE 218
             AFS   A EG     +GNL  LSEQ L       N GCNGGLMD AF+Y+  +  L  EE
Sbjct:   135 AFSATGAFEGQMFWKTGNLVPLSEQNL----AQGNEGCNGGLMDNAFQYVKDNRCLDSEE 190

Query:   219 DYPYL-MEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQ-PVSVAIEASGTDFQ 276
              YPYL  +  TC + K E      SG+ D+P+  E++L+KA+A    ++VAI+A    FQ
Sbjct:   191 SYPYLGRDTDTC-NYKPECSAAHDSGFVDLPQR-EKALMKAMATLGSITVAIDAGHQYFQ 248

Query:   277 FYSGGVFTGP-CGA-ELDHGVAAVGYGKSKGSDYI---IVKNSWGPKWGERGYIRMKRNT 331
             FY   ++  P C + +LDHGV  VGYG  +G+D     IVKNSW P+WG   Y++M +  
Sbjct:   249 FYKSSIYFDPDCSSKDLDHGVLVVGYG-FEGTDSNNKWIVKNSWSPEWGWNSYVKMAKGQ 307

Query:   332 GKPEGLCGINKMASIP 347
                   CGI   AS P
Sbjct:   308 NNH---CGITA-ASYP 319


>RGD|621513 [details] [associations]
            symbol:Ctss "cathepsin S" species:10116 "Rattus norvegicus"
            [GO:0001656 "metanephros development" evidence=IEP] [GO:0002250
            "adaptive immune response" evidence=ISO] [GO:0005764 "lysosome"
            evidence=IEA;ISO] [GO:0006508 "proteolysis" evidence=IEA;ISO]
            [GO:0008233 "peptidase activity" evidence=ISO] [GO:0008234
            "cysteine-type peptidase activity" evidence=IDA] [GO:0009986 "cell
            surface" evidence=IDA] [GO:0016020 "membrane" evidence=ISO]
            [GO:0043231 "intracellular membrane-bounded organelle"
            evidence=ISO] [GO:0045453 "bone resorption" evidence=IMP]
            [GO:0051930 "regulation of sensory perception of pain"
            evidence=IMP] [GO:0097067 "cellular response to thyroid hormone
            stimulus" evidence=ISO] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            RGD:621513 GO:GO:0009986 GO:GO:0051930 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 GO:GO:0045453
            GO:GO:0001656 HOVERGEN:HBG011513 CTD:1520 KO:K01368 MEROPS:I29.004
            BRENDA:3.4.22.27 EMBL:L03201 IPI:IPI00210228 PIR:A45087
            RefSeq:NP_059016.1 UniGene:Rn.11347 ProteinModelPortal:Q02765
            PhosphoSite:Q02765 PRIDE:Q02765 GeneID:50654 KEGG:rno:50654
            UCSC:RGD:621513 ChEMBL:CHEMBL1075217 NextBio:610462
            Genevestigator:Q02765 Uniprot:Q02765
        Length = 330

 Score = 533 (192.7 bits), Expect = 2.4e-51, P = 2.4e-51
 Identities = 130/297 (43%), Positives = 171/297 (57%)

Query:    63 EEKLHRFEIFKENLKHIDQRNKE----VTSYWLGLNEFADMSHEEFKNKYLG-LKPQFPT 117
             EE + R  I+++NLK I   N E    + SY +G+N   DM+ EE    Y+G L+   P 
Sbjct:    42 EEDVRRL-IWEKNLKFIMLHNLEHSMGMHSYSVGMNHMGDMTPEEVIG-YMGSLRIPRPW 99

Query:   118 RRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNL 177
              R  + + S    + LP SVDWR+KG VT VK QGSCGSCWAFS   A+EG  ++ +G L
Sbjct:   100 NRSGTLKSSSN--QTLPDSVDWREKGCVTNVKYQGSCGSCWAFSAEGALEGQLKLKTGKL 157

Query:   178 TSLSEQELIDCDTSF---NNGCNGGLMDYAFKYIVASGGLHKEEDYPY-LMEEGTCEDKK 233
              SLS Q L+DC T     N GC GG M  AF+YI+ +  +  E  YPY  M+E    D K
Sbjct:   158 VSLSAQNLVDCSTEEKYGNKGCGGGFMTEAFQYIIDTS-IDSEASYPYKAMDEKCLYDPK 216

Query:   234 EEMEVVTISGYQDVPENDEQSLLKALAHQ-PVSVAIE-ASGTDFQFYSGGVFTGP-CGAE 290
                   T S Y ++P  DE++L +A+A + PVSV I+ AS + F  Y  GV+  P C   
Sbjct:   217 NR--AATCSRYIELPFGDEEALKEAVATKGPVSVGIDDASHSSFFLYQSGVYDDPSCTEN 274

Query:   291 LDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
             ++HGV  VGYG   G DY +VKNSWG  +G++GYIRM RN    +  CGI    S P
Sbjct:   275 MNHGVLVVGYGTLDGKDYWLVKNSWGLHFGDQGYIRMARNN---KNHCGIASYCSYP 328


>UNIPROTKB|F1P3U9 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0001913 "T cell mediated cytotoxicity" evidence=IEA]
            [GO:0002764 "immune response-regulating signaling pathway"
            evidence=IEA] [GO:0004177 "aminopeptidase activity" evidence=IEA]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=IEA]
            [GO:0004252 "serine-type endopeptidase activity" evidence=IEA]
            [GO:0005615 "extracellular space" evidence=IEA] [GO:0005764
            "lysosome" evidence=IEA] [GO:0005829 "cytosol" evidence=IEA]
            [GO:0008284 "positive regulation of cell proliferation"
            evidence=IEA] [GO:0010628 "positive regulation of gene expression"
            evidence=IEA] [GO:0010634 "positive regulation of epithelial cell
            migration" evidence=IEA] [GO:0010813 "neuropeptide catabolic
            process" evidence=IEA] [GO:0010815 "bradykinin catabolic process"
            evidence=IEA] [GO:0016505 "apoptotic protease activator activity"
            evidence=IEA] [GO:0030108 "HLA-A specific activating MHC class I
            receptor activity" evidence=IEA] [GO:0031638 "zymogen activation"
            evidence=IEA] [GO:0031648 "protein destabilization" evidence=IEA]
            [GO:0032526 "response to retinoic acid" evidence=IEA] [GO:0033619
            "membrane protein proteolysis" evidence=IEA] [GO:0043066 "negative
            regulation of apoptotic process" evidence=IEA] [GO:0043129
            "surfactant homeostasis" evidence=IEA] [GO:0045766 "positive
            regulation of angiogenesis" evidence=IEA] [GO:0060448 "dichotomous
            subdivision of terminal units involved in lung branching"
            evidence=IEA] [GO:0070324 "thyroid hormone binding" evidence=IEA]
            [GO:0070371 "ERK1 and ERK2 cascade" evidence=IEA] [GO:0097067
            "cellular response to thyroid hormone stimulus" evidence=IEA]
            [GO:0097208 "alveolar lamellar body" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0005829 GO:GO:0043066
            GO:GO:0005615 GO:GO:0008284 GO:GO:0070371 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045766 GO:GO:0004177
            GO:GO:0004252 GO:GO:0005764 GO:GO:0032526 GO:GO:0010628
            GO:GO:0070324 GO:GO:0016505 GO:GO:0010634 GO:GO:0004197
            GO:GO:0042599 GO:GO:0031648 GO:GO:0097067 GO:GO:0031638
            GO:GO:0001913 GeneTree:ENSGT00660000095458 OMA:STSCHKT
            GO:GO:0030108 GO:GO:0010815 GO:GO:0002764 GO:GO:0033619
            GO:GO:0010813 EMBL:AADN02038832 EMBL:AADN02038831 IPI:IPI00594147
            Ensembl:ENSGALT00000013440 Uniprot:F1P3U9
        Length = 261

 Score = 527 (190.6 bits), Expect = 1.1e-50, P = 1.1e-50
 Identities = 115/267 (43%), Positives = 162/267 (60%)

Query:    89 YWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGA-VTP 147
             + + LN+F+DM+  EFK  YL  +PQ  +  +    F   D    P++VDWRKKG  VTP
Sbjct:     1 FLVALNQFSDMTFAEFKKLYLWSEPQNCSATR--GNFLRSDGPC-PEAVDWRKKGNFVTP 57

Query:   148 VKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNN-GCNGGLMDYAFK 206
             VKNQG CGSCW FST   +E    I +G L SL+EQ L+DC  +FNN GC+GGL   AF+
Sbjct:    58 VKNQGPCGSCWTFSTTGCLESAIAIATGKLLSLAEQLLVDCAQAFNNHGCSGGLPSQAFE 117

Query:   207 YIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALA-HQPVS 265
             YI+ + GL  E+ YPY  + GTC+ + ++  +  +    ++ + DE  +++A+  H PVS
Sbjct:   118 YILYNKGLMGEDAYPYRAQNGTCKFQPDKA-IAFVKDVINITQYDEAGMVEAVGKHNPVS 176

Query:   266 VAIEASGTDFQFYSGGVFTGP-CGA---ELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGE 321
              A E + +DF  Y  GV++ P C     +++H V AVGYG+  G  Y IVKNSWGP WG 
Sbjct:   177 FAFEVT-SDFMHYRKGVYSNPRCEHTPDKVNHAVLAVGYGEEDGRPYWIVKNSWGPLWGM 235

Query:   322 RGYIRMKRNTGKPEGLCGINKMASIPL 348
              GY  ++R  GK   +CG+   AS P+
Sbjct:   236 DGYFLIER--GK--NMCGLAACASYPV 258


>MGI|MGI:1922258 [details] [associations]
            symbol:4930486L24Rik "RIKEN cDNA 4930486L24 gene"
            species:10090 "Mus musculus" [GO:0005576 "extracellular region"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0030054 "cell
            junction" evidence=ISO] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 MGI:MGI:1922258
            GO:GO:0005576 GO:GO:0030054 eggNOG:COG4870 HOGENOM:HOG000230774
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 HSSP:P07711
            EMBL:AY146988 EMBL:AK145933 EMBL:BC061218 IPI:IPI00280732
            RefSeq:NP_835199.1 UniGene:Mm.19839 ProteinModelPortal:Q80UB0
            SMR:Q80UB0 MEROPS:C01.972 PRIDE:Q80UB0 Ensembl:ENSMUST00000091569
            GeneID:214639 KEGG:mmu:214639 UCSC:uc007qvs.1 InParanoid:Q80UB0
            OMA:RYHAENS OrthoDB:EOG4XWG0N NextBio:374408 Bgee:Q80UB0
            CleanEx:MM_4930486L24RIK Genevestigator:Q80UB0 Uniprot:Q80UB0
        Length = 333

 Score = 525 (189.9 bits), Expect = 1.7e-50, P = 1.7e-50
 Identities = 119/322 (36%), Positives = 178/322 (55%)

Query:    40 SMDKLIEL-FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTS----YWLGLN 94
             ++D  +++ +  W +KHGK Y   EE+L R  ++++N K I+  N E       + + +N
Sbjct:    20 TLDPSLDVQWNEWRTKHGKAYNVNEERLRR-AVWEKNFKMIELHNWEYLEGKHDFTMTMN 78

Query:    95 EFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSC 154
              F D+++ EF     G + Q   R     +  +  V   PK VDWR  G VTPVKNQG C
Sbjct:    79 AFGDLTNTEFVKMMTGFRRQKIKRMHVFQDHQFLYV---PKYVDWRMLGYVTPVKNQGYC 135

Query:   155 GSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDC-DTSFNNGCNGGLMDYAFKYIVASGG 213
              S WAFS   ++EG     +G L  LSEQ L+DC  ++  + C+GG M  AF+Y+  +GG
Sbjct:   136 ASSWAFSATGSLEGQMFKKTGRLVPLSEQNLLDCMGSNVTHDCSGGFMQNAFQYVKDNGG 195

Query:   214 LHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQ-PVSVAIEASG 272
             L  EE YPY+     C    E      +  +  +P  +E +L+KA+A   P+SVA++AS 
Sbjct:   196 LATEESYPYIGPGRKCRYHAEN-SAANVRDFVQIPGREE-ALMKAVAKVGPISVAVDASH 253

Query:   273 TDFQFYSGGVFTGP-CG-AELDHGVAAVGYG----KSKGSDYIIVKNSWGPKWGERGYIR 326
               FQFY  G++  P C    L+H V  VGYG    +S G+ Y +VKNSWG +WG +GYI+
Sbjct:   254 DSFQFYDSGIYYEPQCKRVHLNHAVLVVGYGFEGEESDGNSYWLVKNSWGEEWGMKGYIK 313

Query:   327 MKRNTGKPEGLCGINKMASIPL 348
             + ++       CGI  +A+ P+
Sbjct:   314 IAKDWNNH---CGIATLATYPI 332


>MGI|MGI:1349426 [details] [associations]
            symbol:Ctsj "cathepsin J" species:10090 "Mus musculus"
            [GO:0005764 "lysosome" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] [GO:0008152 "metabolic process" evidence=ISO]
            [GO:0008233 "peptidase activity" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=ISO] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0048471 "perinuclear region
            of cytoplasm" evidence=ISO] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            MGI:MGI:1349426 eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 EMBL:AF136272
            EMBL:AF158182 EMBL:AY034579 EMBL:AK005526 EMBL:AK131661
            EMBL:BC103769 IPI:IPI00126770 RefSeq:NP_036137.1 UniGene:Mm.31948
            ProteinModelPortal:Q9R014 SMR:Q9R014 MEROPS:C01.038 PRIDE:Q9R014
            Ensembl:ENSMUST00000071526 GeneID:26898 KEGG:mmu:26898
            UCSC:uc007qwa.1 CTD:26898 InParanoid:Q9R014 KO:K09599
            NextBio:304745 Bgee:Q9R014 CleanEx:MM_CTSJ Genevestigator:Q9R014
            GermOnline:ENSMUSG00000055298 Uniprot:Q9R014
        Length = 334

 Score = 524 (189.5 bits), Expect = 2.2e-50, P = 2.2e-50
 Identities = 117/316 (37%), Positives = 179/316 (56%)

Query:    43 KLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT----SYWLGLNEFAD 98
             KL   ++ W +K+ K+Y   EE L R  +++EN++ I   NKE +    ++ + +N+F D
Sbjct:    24 KLDAEWKDWKTKYAKSYSPKEEALRR-AVWEENMRMIKLHNKENSLGKNNFTMKMNKFGD 82

Query:    99 MSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCW 158
              + EEF+ K +   P       P A+ ++  +  LP   DWR++G VTPV+NQG CGSCW
Sbjct:    83 QTSEEFR-KSIDNIPIPAAMTDPHAQ-NHVSI-GLPDYKDWREEGYVTPVRNQGKCGSCW 139

Query:   159 AFSTVAAVEGINQIVSGNLTSLSEQELIDCD-TSFNNGCNGGLMDYAFKYIVASGGLHKE 217
             AF+   A+EG     +GNLT LS Q L+DC  T  N GC  G    AF+Y++ + GL  E
Sbjct:   140 AFAAAGAIEGQMFWKTGNLTPLSVQNLLDCSKTVGNKGCQSGTAHQAFEYVLKNKGLEAE 199

Query:   218 EDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQF 277
               YPY  ++G C  + E      I+ Y ++P N+    +   +  PVS AI+AS   F+F
Sbjct:   200 ATYPYEGKDGPCRYRSENASA-NITDYVNLPPNELYLWVAVASIGPVSAAIDASHDSFRF 258

Query:   278 YSGGVFTGP-CGAE-LDHGVAAVGYGKS----KGSDYIIVKNSWGPKWGERGYIRMKRNT 331
             Y+GG++  P C +  ++H V  VGYG       G++Y ++KNSWG +WG  GY+++ ++ 
Sbjct:   259 YNGGIYYEPNCSSYFVNHAVLVVGYGSEGDVKDGNNYWLIKNSWGEEWGMNGYMQIAKDH 318

Query:   332 GKPEGLCGINKMASIP 347
                   CGI  +AS P
Sbjct:   319 NNH---CGIASLASYP 331


>RGD|708447 [details] [associations]
            symbol:Testin "testin gene" species:10116 "Rattus norvegicus"
            [GO:0005576 "extracellular region" evidence=IEA] [GO:0006508
            "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0030054 "cell junction" evidence=IDA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 RGD:708447 GO:GO:0005576 GO:GO:0030054 eggNOG:COG4870
            HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513
            MEROPS:C01.972 OMA:RYHAENS OrthoDB:EOG4XWG0N EMBL:U16858
            IPI:IPI00207173 PIR:I52525 PIR:PC1251 RefSeq:NP_775155.1
            UniGene:Rn.10029 ProteinModelPortal:P15242 SMR:P15242
            Ensembl:ENSRNOT00000024467 GeneID:286916 KEGG:rno:286916
            UCSC:RGD:708447 CTD:286916 InParanoid:P15242 NextBio:625036
            Genevestigator:P15242 GermOnline:ENSRNOG00000018028 Uniprot:P15242
        Length = 333

 Score = 524 (189.5 bits), Expect = 2.2e-50, P = 2.2e-50
 Identities = 116/313 (37%), Positives = 175/313 (55%)

Query:    48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTS----YWLGLNEFADMSHEE 103
             +  W +KHGKTY   EE+L R  ++++N K I+  N E       + + +N F D+++ E
Sbjct:    29 WNEWRTKHGKTYNMNEERLKR-AVWEKNFKMIELHNWEYLEGRHDFTMAMNAFGDLTNIE 87

Query:   104 FKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTV 163
             F     G + Q   + + +  F       +PK VDWR+ G VTPVKNQG C S WAFS  
Sbjct:    88 FVKMMTGFQRQ---KIKKTHIFQDHQFLYVPKRVDWRQLGYVTPVKNQGHCASSWAFSAT 144

Query:   164 AAVEGINQIVSGNLTSLSEQELIDC-DTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPY 222
              ++EG     +  L  LSEQ L+DC  ++  +GC+GG M YAF+Y+  +GGL  EE YPY
Sbjct:   145 GSLEGQMFRKTERLIPLSEQNLLDCMGSNVTHGCSGGFMQYAFQYVKDNGGLATEESYPY 204

Query:   223 LMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQ-PVSVAIEASGTDFQFYSGG 281
               +   C    E      +  +  +P   E++L+KA+A   P+SVA++AS   FQFY  G
Sbjct:   205 RGQGRECRYHAEN-SAANVRDFVQIP-GSEEALMKAVAKVGPISVAVDASHGSFQFYGSG 262

Query:   282 VFTGP-CG-AELDHGVAAVGYG----KSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPE 335
             ++  P C    L+H V  VGYG    +S G+ + +VKNSWG +WG +GY+++ ++     
Sbjct:   263 IYYEPQCKRVHLNHAVLVVGYGFEGEESDGNSFWLVKNSWGEEWGMKGYMKLAKDWSNH- 321

Query:   336 GLCGINKMASIPL 348
               CGI   ++ P+
Sbjct:   322 --CGIATYSTYPI 332


>ZFIN|ZDB-GENE-050208-336 [details] [associations]
            symbol:ctskl "cathepsin K, like" species:7955
            "Danio rerio" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            ZFIN:ZDB-GENE-050208-336 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 EMBL:BX465190
            GeneTree:ENSGT00660000095458 IPI:IPI00491185 RefSeq:XP_695425.1
            UniGene:Dr.110795 Ensembl:ENSDART00000062749 GeneID:567046
            KEGG:dre:567046 CTD:567046 NextBio:20888499 Bgee:F1QCP8
            Uniprot:F1QCP8
        Length = 349

 Score = 523 (189.2 bits), Expect = 2.8e-50, P = 2.8e-50
 Identities = 125/336 (37%), Positives = 191/336 (56%)

Query:    28 FSIVGYSPEHLTSM--DKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKE 85
             F+++ ++P  + S   ++    +  W  KH  +Y    E +HR  I++ N++ I + N +
Sbjct:    19 FALLVWAPVQVASESEEEAPTEWNLWKKKHEISYDEESEDVHRKTIWETNMQKIWKNNND 78

Query:    86 ----VTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQ---PSAEFSYRDVKALP-KSV 137
                 ++ + + +N++ D++  E+K + LG K +    R+    SA+    + K L   ++
Sbjct:    79 FSFGLSMFKMAMNKYGDLTSVEYK-RLLGSKIKGTGNRKGKITSAQMLRLNAKRLGVTNI 137

Query:   138 DWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNN-GC 196
             D+R KG VT VK+QG CGSCW+FST  A+EG     +G L SLSEQ+L+DC  S+   GC
Sbjct:   138 DYRAKGYVTEVKDQGYCGSCWSFSTTGAIEGQMYKHTGRLVSLSEQQLVDCSRSYGTYGC 197

Query:   197 NGGLMDYAFKYIVASGGLHKEEDYPYL-MEEGTCEDKKEEMEVVTISGYQDVPENDEQSL 255
             +G  M  A+ Y++ +  L   + YPY  ++   C  +K  + +  IS Y+ VP  +EQ+L
Sbjct:   198 SGAWMANAYDYVI-NNALESSDTYPYTSVDTQPCFYEKN-LAMAGISDYRFVPAGNEQAL 255

Query:   256 LKALAHQ-PVSVAIEASGTDFQFYSGGVFT-GPCGAE-LDHGVAAVGYGKSKGSDYIIVK 312
               A+A   PVSVAI+A    F FYS G++    C    L+H V  VGYG  +G+DY I+K
Sbjct:   256 ADAVATVGPVSVAIDADNPSFLFYSSGIYKESNCNPNNLNHAVLVVGYGSEEGTDYWIIK 315

Query:   313 NSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPL 348
             NSWG  WGE GY+RM RN GK    CGI   A  P+
Sbjct:   316 NSWGTGWGEGGYMRMIRN-GK--NTCGIASYALYPI 348


>GENEDB_PFALCIPARUM|PF11_0161 [details] [associations]
            symbol:PF11_0161 "falcipain-2 precursor,
            putative" species:5833 "Plasmodium falciparum" [GO:0020020 "food
            vacuole" evidence=TAS] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00139
            PROSITE:PS00639 EMBL:AE014186 HSSP:O65039 OMA:NNVEHIN GO:GO:0020020
            MEROPS:C01.046 HOGENOM:HOG000065857 ProtClustDB:PTZ00021
            RefSeq:XP_001347832.1 ProteinModelPortal:Q8I6U5 SMR:Q8I6U5
            IntAct:Q8I6U5 MINT:MINT-1546851 EnsemblProtists:PF11_0161:mRNA
            GeneID:810708 KEGG:pfa:PF11_0161 EuPathDB:PlasmoDB:PF3D7_1115300
            Uniprot:Q8I6U5
        Length = 482

 Score = 521 (188.5 bits), Expect = 4.6e-50, P = 4.6e-50
 Identities = 123/326 (37%), Positives = 174/326 (53%)

Query:    45 IELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTS-YWLGLNEFADMSHEE 103
             I  F +++  + K Y    E   RF++F +N   +   N    S Y   LN FAD+++ E
Sbjct:   160 INQFYTFIKTNNKQYNSPNEMKERFQVFLQNAHKVKMHNNNKKSLYKKELNRFADLTYHE 219

Query:   104 FKNKYLGLKPQFPTRRQPSA--EFSYRDVKALPK--------SVDWRKKGAVTPVKNQGS 153
             FK+KYL L+   P +       + +Y  V    K        + DWR    VTPVK+Q +
Sbjct:   220 FKSKYLTLRSSKPLKNSKYLLDQINYDAVIKKYKGNENFDHAAYDWRLHSGVTPVKDQKN 279

Query:   154 CGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNN-GCNGGLMDYAFKYIVASG 212
             CGSCWAFS++ +VE    I    L +LSEQEL+DC  SF N GCNGGL++ AF+ ++  G
Sbjct:   280 CGSCWAFSSIGSVESQYAIRKNKLITLSEQELVDC--SFKNYGCNGGLINNAFEDMIELG 337

Query:   213 GLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASG 272
             G+  ++DYPY+ +     +     E   I  Y  VP+N  +  L+ L   P+S++I  S 
Sbjct:   338 GICTDDDYPYVSDAPNLCNIDRCTEKYGIKNYLSVPDNKLKEALRFLG--PISISIAVSD 395

Query:   273 TDFQFYSGGVFTGPCGAELDHGVAAVGYGKS--------KGSD--YIIVKNSWGPKWGER 322
              DF FY  G+F G CG EL+H V  VG+G          KG    Y I+KNSWG +WGER
Sbjct:   396 -DFPFYKEGIFDGECGDELNHAVMLVGFGMKEIVNPLTKKGEKHYYYIIKNSWGQQWGER 454

Query:   323 GYIRMKRNTGKPEGLCGINKMASIPL 348
             G+I ++ +       CG+   A IPL
Sbjct:   455 GFINIETDESGLMRKCGLGTDAFIPL 480


>UNIPROTKB|Q8I6U5 [details] [associations]
            symbol:PF11_0161 "Falcipain-2B" species:36329 "Plasmodium
            falciparum 3D7" [GO:0020020 "food vacuole" evidence=TAS]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00139 PROSITE:PS00639
            EMBL:AE014186 HSSP:O65039 OMA:NNVEHIN GO:GO:0020020 MEROPS:C01.046
            HOGENOM:HOG000065857 ProtClustDB:PTZ00021 RefSeq:XP_001347832.1
            ProteinModelPortal:Q8I6U5 SMR:Q8I6U5 IntAct:Q8I6U5
            MINT:MINT-1546851 EnsemblProtists:PF11_0161:mRNA GeneID:810708
            KEGG:pfa:PF11_0161 EuPathDB:PlasmoDB:PF3D7_1115300 Uniprot:Q8I6U5
        Length = 482

 Score = 521 (188.5 bits), Expect = 4.6e-50, P = 4.6e-50
 Identities = 123/326 (37%), Positives = 174/326 (53%)

Query:    45 IELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTS-YWLGLNEFADMSHEE 103
             I  F +++  + K Y    E   RF++F +N   +   N    S Y   LN FAD+++ E
Sbjct:   160 INQFYTFIKTNNKQYNSPNEMKERFQVFLQNAHKVKMHNNNKKSLYKKELNRFADLTYHE 219

Query:   104 FKNKYLGLKPQFPTRRQPSA--EFSYRDVKALPK--------SVDWRKKGAVTPVKNQGS 153
             FK+KYL L+   P +       + +Y  V    K        + DWR    VTPVK+Q +
Sbjct:   220 FKSKYLTLRSSKPLKNSKYLLDQINYDAVIKKYKGNENFDHAAYDWRLHSGVTPVKDQKN 279

Query:   154 CGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNN-GCNGGLMDYAFKYIVASG 212
             CGSCWAFS++ +VE    I    L +LSEQEL+DC  SF N GCNGGL++ AF+ ++  G
Sbjct:   280 CGSCWAFSSIGSVESQYAIRKNKLITLSEQELVDC--SFKNYGCNGGLINNAFEDMIELG 337

Query:   213 GLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASG 272
             G+  ++DYPY+ +     +     E   I  Y  VP+N  +  L+ L   P+S++I  S 
Sbjct:   338 GICTDDDYPYVSDAPNLCNIDRCTEKYGIKNYLSVPDNKLKEALRFLG--PISISIAVSD 395

Query:   273 TDFQFYSGGVFTGPCGAELDHGVAAVGYGKS--------KGSD--YIIVKNSWGPKWGER 322
              DF FY  G+F G CG EL+H V  VG+G          KG    Y I+KNSWG +WGER
Sbjct:   396 -DFPFYKEGIFDGECGDELNHAVMLVGFGMKEIVNPLTKKGEKHYYYIIKNSWGQQWGER 454

Query:   323 GYIRMKRNTGKPEGLCGINKMASIPL 348
             G+I ++ +       CG+   A IPL
Sbjct:   455 GFINIETDESGLMRKCGLGTDAFIPL 480


>UNIPROTKB|D3ZZR3 [details] [associations]
            symbol:D3ZZR3 "Uncharacterized protein" species:10116
            "Rattus norvegicus" [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0016020 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GO:GO:0002250 GeneTree:ENSGT00560000076577 GO:GO:0097067
            OrthoDB:EOG4JM7Q2 IPI:IPI00210228 PRIDE:D3ZZR3
            Ensembl:ENSRNOT00000028732 Uniprot:D3ZZR3
        Length = 331

 Score = 521 (188.5 bits), Expect = 4.6e-50, P = 4.6e-50
 Identities = 126/312 (40%), Positives = 170/312 (54%)

Query:    48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKE----VTSYWLGLNEFADMSHEE 103
             ++ W   H K YK   E+  R  I+++NLK I   N E    + SY +G+N   DM  E 
Sbjct:    25 WDLWKKTHEKEYKDQNEEDVRRLIWEKNLKFIMLHNLEHSMGMHSYSVGMNHMGDMVAET 84

Query:   104 FKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDW--RKKGAVTPVKNQGSCGSCWAFS 161
                + +G + + P +R+          + LP  V W  R KG    +  QGSCGSCWAFS
Sbjct:    85 IIGE-MGSE-RLPRKRKALGLIPSSVNQNLPAGVKWKERTKGCWKNLVFQGSCGSCWAFS 142

Query:   162 TVAAVEGINQIVSGNLTSLSEQELIDCDTSF---NNGCNGGLMDYAFKYIVASGGLHKEE 218
              V A+EG  ++ +G L SLS Q L+DC T     N GC GG M  AF+YI+ +GG+  E 
Sbjct:   143 AVGALEGQLKLKTGKLVSLSAQNLVDCSTEEKYGNKGCGGGFMTEAFQYIIDNGGIDSEA 202

Query:   219 DYPYLMEEGTCE-DKKEEMEVVTISGYQDVPENDEQSLLKALAHQ-PVSVAIEASGTDFQ 276
              YPY   +  C  D K      T S Y ++P  DE++L +A+A + PVSV I+AS + F 
Sbjct:   203 SYPYKAMDEKCHYDPKNR--AATCSRYIELPFGDEEALKEAVATKGPVSVGIDASHSSFF 260

Query:   277 FYSGGVFTGP-CGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPE 335
              Y  GV+  P C   ++HGV  VGYG   G DY +VKNSWG  +G++GYIRM RN    +
Sbjct:   261 LYQSGVYDDPSCTENVNHGVLVVGYGTLDGKDYWLVKNSWGLHFGDQGYIRMARNN---K 317

Query:   336 GLCGINKMASIP 347
               CGI    S P
Sbjct:   318 NHCGIASYCSYP 329


>UNIPROTKB|H9KYW5 [details] [associations]
            symbol:CTSS "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0002250 "adaptive immune response" evidence=IEA]
            [GO:0005764 "lysosome" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] [GO:0016020 "membrane" evidence=IEA] [GO:0097067
            "cellular response to thyroid hormone stimulus" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00560000076577 OMA:YEPACTQ EMBL:AADN02010496
            Ensembl:ENSGALT00000001122 Uniprot:H9KYW5
        Length = 245

 Score = 517 (187.1 bits), Expect = 1.2e-49, P = 1.2e-49
 Identities = 104/225 (46%), Positives = 138/225 (61%)

Query:   126 SYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQEL 185
             +YR     P ++DWR+KG VT VKNQG+CG+CWAFS V A+E   ++ +G L SLS Q L
Sbjct:    23 TYRRRGGAPDAMDWREKGCVTEVKNQGACGACWAFSAVGALEAQVKLKTGKLVSLSAQNL 82

Query:   186 IDCDTSFNN-GCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGY 244
             +DC   + N GC GG M  AF+YI+ + G+  EE YPY+ + GTC+         T S Y
Sbjct:    83 VDCSMMYGNKGCGGGFMTRAFQYIIDNNGIDSEESYPYMAQNGTCQ-YNVSTRAATCSKY 141

Query:   245 QDVPENDEQSLLKALAHQ-PVSVAIEASGTDFQFYSGGVFTGP-CGAELDHGVAAVGYGK 302
              ++P  DE +L  A+A+  PVSVAI+A+   F  Y  GV+  P C  E++HGV  VGYG 
Sbjct:   142 VELPYADEAALKDAVANVGPVSVAIDATQPTFFLYRSGVYDDPRCTQEVNHGVLVVGYGT 201

Query:   303 SKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
                 D+ +VKNSWG ++G+ GYIRM RN       CGI   AS P
Sbjct:   202 LNEKDFWLVKNSWGERFGDGGYIRMSRNHANH---CGIASYASYP 243


>MGI|MGI:1861723 [details] [associations]
            symbol:Ctsr "cathepsin R" species:10090 "Mus musculus"
            [GO:0005764 "lysosome" evidence=ISA] [GO:0006508 "proteolysis"
            evidence=IEA] [GO:0008233 "peptidase activity" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=ISA]
            [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0030163 "protein
            catabolic process" evidence=ISA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            MGI:MGI:1861723 eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 GO:GO:0030163
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 EMBL:AF245399
            EMBL:AY014778 EMBL:AK014432 EMBL:AK005429 IPI:IPI00120321
            RefSeq:NP_064680.1 UniGene:Mm.315715 ProteinModelPortal:Q9JIA9
            SMR:Q9JIA9 MEROPS:C01.042 PRIDE:Q9JIA9 Ensembl:ENSMUST00000021889
            GeneID:56835 KEGG:mmu:56835 CTD:56835 InParanoid:Q9JIA9 KO:K09601
            OMA:ASHESFK OrthoDB:EOG4ZCT6D NextBio:313379 Bgee:Q9JIA9
            CleanEx:MM_CTSR Genevestigator:Q9JIA9 GermOnline:ENSMUSG00000055679
            Uniprot:Q9JIA9
        Length = 334

 Score = 517 (187.1 bits), Expect = 1.2e-49, P = 1.2e-49
 Identities = 115/312 (36%), Positives = 174/312 (55%)

Query:    48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT----SYWLGLNEFADMSHEE 103
             ++ W  K+ K+Y   EEKL R  +++E LK I   N+E +     + + +NEF D + EE
Sbjct:    29 WQDWKIKYNKSYSLKEEKLKRV-VWEEKLKMIKLHNRENSLGKNGFTMKMNEFGDQTDEE 87

Query:   104 FKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTV 163
             F+   + +     T R+  +         LPK VDWRKKG VTPV+ QG C +CWAF+  
Sbjct:    88 FRKMMIEISVW--THREGKSIMKREAGSILPKFVDWRKKGYVTPVRRQGDCDACWAFAVT 145

Query:   164 AAVEGINQIVSGNLTSLSEQELIDCDT-SFNNGCNGGLMDYAFKYIVASGGLHKEEDYPY 222
              A+E      +G LT LS Q L+DC     NNGC GG    AF+Y++ +GGL  E  YPY
Sbjct:   146 GAIEAQAIWQTGKLTPLSVQNLVDCSKPQGNNGCLGGDTYNAFQYVLHNGGLESEATYPY 205

Query:   223 LMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQFYSGG 281
               ++G C    +  +   I+G+  +P++ E  L+ A+A   P++  I+AS   F+ Y GG
Sbjct:   206 EGKDGPCRYNPKNSKA-EITGFVSLPQS-EDILMAAVATIGPITAGIDASHESFKNYKGG 263

Query:   282 VFTGP-CGAE-LDHGVAAVGYG----KSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPE 335
             ++  P C ++ + HGV  VGYG    ++ G+ Y ++KNSWG +WG RGY+++ ++     
Sbjct:   264 IYHEPNCSSDTVTHGVLVVGYGFKGIETDGNHYWLIKNSWGKRWGIRGYMKLAKDKNNH- 322

Query:   336 GLCGINKMASIP 347
               CGI   A  P
Sbjct:   323 --CGIASYAHYP 332


>MGI|MGI:1860262 [details] [associations]
            symbol:Cts7 "cathepsin 7" species:10090 "Mus musculus"
            [GO:0005575 "cellular_component" evidence=ND] [GO:0005576
            "extracellular region" evidence=IEA] [GO:0005634 "nucleus"
            evidence=IEA] [GO:0005737 "cytoplasm" evidence=IEA] [GO:0005764
            "lysosome" evidence=IEA] [GO:0005768 "endosome" evidence=IEA]
            [GO:0005794 "Golgi apparatus" evidence=IEA] [GO:0006508
            "proteolysis" evidence=ISA] [GO:0007049 "cell cycle" evidence=IEA]
            [GO:0007067 "mitosis" evidence=IEA] [GO:0008152 "metabolic process"
            evidence=IEA] [GO:0008233 "peptidase activity" evidence=ISA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0051301 "cell
            division" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            MGI:MGI:1860262 GO:GO:0005634 GO:GO:0005794 GO:GO:0048471
            GO:GO:0005615 GO:GO:0051301 GO:GO:0007067 GO:GO:0005768
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0005764 GO:GO:0008233 EMBL:CH466546
            EMBL:AY014779 EMBL:CT030645 EMBL:BC064740 EMBL:AF250837
            IPI:IPI00131132 RefSeq:NP_062412.1 UniGene:Mm.3692 HSSP:O60911
            ProteinModelPortal:Q91ZF2 SMR:Q91ZF2 STRING:Q91ZF2 MEROPS:C01.016
            PRIDE:Q91ZF2 Ensembl:ENSMUST00000021892 GeneID:56092 KEGG:mmu:56092
            UCSC:uc007qwi.1 CTD:56092 GeneTree:ENSGT00560000076577
            HOVERGEN:HBG011513 InParanoid:Q91ZF2 OMA:ERRVIWE OrthoDB:EOG44QT2S
            NextBio:311908 Bgee:Q91ZF2 Genevestigator:Q91ZF2 Uniprot:Q91ZF2
        Length = 331

 Score = 513 (185.6 bits), Expect = 3.2e-49, P = 3.2e-49
 Identities = 122/312 (39%), Positives = 175/312 (56%)

Query:    48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKE----VTSYWLGLNEFADMSHEE 103
             +E W   + +TY   EEK  R  +++ N+K I Q   E    + ++ + +NEF DM+ EE
Sbjct:    29 WEEWKRSNDRTYSPEEEKQRR-AVWEGNVKWIKQHIMENGLWMNNFTIEMNEFGDMTGEE 87

Query:   104 FKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTV 163
              K   L     +P R         R+ K +P ++DWRK+G VTPV+ QGSCG+CWAFS  
Sbjct:    88 MK--MLTESSSYPLRN--GKHIQKRNPK-IPPTLDWRKEGYVTPVRRQGSCGACWAFSVT 142

Query:   164 AAVEGINQIVSGNLTSLSEQELIDCDTSFNN-GCNGGLMDYAFKYIVASGGLHKEEDYPY 222
             A +EG     +G L  LS Q L+DC  S+   GC+GG    AF+Y+  +GGL  E  YPY
Sbjct:   143 ACIEGQLFKKTGKLIPLSVQNLMDCSVSYGTKGCDGGRPYDAFQYVKNNGGLEAEATYPY 202

Query:   223 LMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALA-HQPVSVAIEASGTDFQFYSGG 281
               +   C  + E   VV ++ +  VP N+E +LL+AL  H P++VAI+ S   F  Y GG
Sbjct:   203 EAKAKHCRYRPER-SVVKVNRFFVVPRNEE-ALLQALVTHGPIAVAIDGSHASFHSYRGG 260

Query:   282 VFTGP-CGAE-LDHGVAAVGYG----KSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPE 335
             ++  P C  + LDHG+  VGYG    +S+   Y ++KNS G +WGE GY+++ R  G+  
Sbjct:   261 IYHEPKCRKDTLDHGLLLVGYGYEGHESENRKYWLLKNSHGERWGENGYMKLPR--GQ-N 317

Query:   336 GLCGINKMASIP 347
               CGI   A  P
Sbjct:   318 NYCGIASYAMYP 329


>RGD|1562210 [details] [associations]
            symbol:MGC114246 "similar to cathepsin R" species:10116 "Rattus
            norvegicus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            RGD:1562210 eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            HOVERGEN:HBG011513 EMBL:CH474032 MEROPS:C01.042 KO:K09601
            OMA:ASHESFK OrthoDB:EOG4ZCT6D EMBL:BC091563 IPI:IPI00555186
            RefSeq:NP_001017509.1 UniGene:Rn.198321 SMR:Q5BJA0
            Ensembl:ENSRNOT00000061470 GeneID:498688 KEGG:rno:498688
            UCSC:RGD:1562210 InParanoid:Q5BJA0 NextBio:700535
            Genevestigator:Q5BJA0 Uniprot:Q5BJA0
        Length = 334

 Score = 511 (184.9 bits), Expect = 5.2e-49, P = 5.2e-49
 Identities = 115/312 (36%), Positives = 167/312 (53%)

Query:    48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKE----VTSYWLGLNEFADMSHEE 103
             ++ W  K+ K+Y   EE+L R  +++ENLK I   N E       + + +NEF D + EE
Sbjct:    29 WQEWKKKYDKSYSLEEEELRR-AVWEENLKMIKLHNGENGLGKNGFTMEINEFGDTTGEE 87

Query:   104 FKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTV 163
             F+   +    Q  T R+  +          PK VDWRKKG VTPV+ QG+C +CWAFS  
Sbjct:    88 FRKMMVEFPVQ--THREGKSIMKRAAGSIFPKFVDWRKKGYVTPVRRQGNCNACWAFSVT 145

Query:   164 AAVEGINQIVSGNLTSLSEQELIDCDT-SFNNGCNGGLMDYAFKYIVASGGLHKEEDYPY 222
              A+E      SG L  LS Q L+DC     NNGC GG    AF+Y++ +GGL  E  YPY
Sbjct:   146 GAIEAQTIWQSGKLIPLSVQNLVDCSKPQGNNGCLGGDTYNAFQYVLHNGGLQSEATYPY 205

Query:   223 LMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGV 282
               ++G C    +      I+G+  +PE+++  ++      P+S  I+AS   F+FY  G+
Sbjct:   206 EGKDGPCRYNPKNSSA-EITGFVSLPESEDILMVAVATIGPISAGIDASHESFKFYKKGI 264

Query:   283 FTGP-CGAE-LDHGVAAVGYGKSKGSD-----YIIVKNSWGPKWGERGYIRMKRNTGKPE 335
             +  P C +  + HGV  VGYG  KG+D     Y ++KNSWG +WG RGY+++ ++     
Sbjct:   265 YHEPNCSSNSVTHGVLVVGYG-FKGNDTGGDHYWLIKNSWGKQWGIRGYMKITKDKNNH- 322

Query:   336 GLCGINKMASIP 347
               C I   A  P
Sbjct:   323 --CAIASYAHYP 332


>UNIPROTKB|Q4QRC2 [details] [associations]
            symbol:Ctsql2 "Protein Ctsql2" species:10116 "Rattus
            norvegicus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00639 HOVERGEN:HBG011513 EMBL:CH474032
            RGD:1303225 EMBL:BC097257 IPI:IPI00421946 RefSeq:NP_001002813.2
            UniGene:Rn.128678 SMR:Q4QRC2 MEROPS:C01.111
            Ensembl:ENSRNOT00000038758 GeneID:408201 KEGG:rno:408201 CTD:408201
            InParanoid:Q4QRC2 OMA:NDEGALM NextBio:696394 Genevestigator:Q4QRC2
            Uniprot:Q4QRC2
        Length = 343

 Score = 508 (183.9 bits), Expect = 1.1e-48, P = 1.1e-48
 Identities = 120/322 (37%), Positives = 179/322 (55%)

Query:    48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT----SYWLGLNEFADMSHEE 103
             ++ W  K+ K Y   EE L R  +++EN+K I+  N+E +    +Y + +N FAD++ EE
Sbjct:    29 WQEWKMKYEKLYSPEEELLKRV-VWEENVKKIELHNRENSLGKNTYIMEINNFADLTDEE 87

Query:   104 FKNKYLGLK-PQFPT-----RRQPSAEFS----YRDVKALPKSVDWRKKGAVTPVKNQGS 153
             FK+   G+  P   T     +R   + F     +RD  ALPKS+DWRK+G VT V+ QG 
Sbjct:    88 FKDMITGITLPINNTMKSLWKRALGSPFPNSWYWRD--ALPKSIDWRKEGYVTRVREQGK 145

Query:   154 CGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDT-SFNNGCNGGLMDYAFKYIVASG 212
             C SCWAF    A+EG     +G LT LS Q L+DC     N GC GG    AF+Y++ +G
Sbjct:   146 CKSCWAFPVAGAIEGQMFKKTGKLTPLSVQNLVDCSKPQGNKGCRGGTTYNAFQYVLQNG 205

Query:   213 GLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQ-PVSVAIEAS 271
             GL  E  YPY  +EG C+   +      I+ +  +PE DE  L+ ALA + PV+  I   
Sbjct:   206 GLESEATYPYKGKEGLCKYNPKNA-YAKITRFVALPE-DEDVLMDALATKGPVAAGIHVV 263

Query:   272 GTDFQFYSGGVFTGP-CGAELDHGVAAVGYG----KSKGSDYIIVKNSWGPKWGERGYIR 326
              +  +FY  G++  P C   ++H V  VGYG    ++ G++Y ++KNSWG +WG +GY++
Sbjct:   264 YSSLRFYKKGIYHEPKCNNRVNHAVLVVGYGFEGNETDGNNYWLIKNSWGKQWGLKGYMK 323

Query:   327 MKRNTGKPEGLCGINKMASIPL 348
             + ++       CGI   A  P+
Sbjct:   324 IAKDRNNH---CGIATFAQYPI 342


>FB|FBgn0034229 [details] [associations]
            symbol:CG4847 species:7227 "Drosophila melanogaster"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0032504
            "multicellular organism reproduction" evidence=IEP] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISS] [GO:0005615 "extracellular space"
            evidence=ISM;IDA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            EMBL:AE013599 GO:GO:0005615 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 GO:GO:0032504 GeneTree:ENSGT00560000076599
            KO:K01371 EMBL:BT099507 RefSeq:NP_725686.1 UniGene:Dm.4677
            SMR:A1ZAU4 IntAct:A1ZAU4 MEROPS:C01.A28 EnsemblMetazoa:FBtr0086935
            GeneID:36973 KEGG:dme:Dmel_CG4847 UCSC:CG4847-RB
            FlyBase:FBgn0034229 InParanoid:A1ZAU4 OMA:GGFQEYA OrthoDB:EOG4J9KFC
            ChiTaRS:CG4847 GenomeRNAi:36973 NextBio:801302 Uniprot:A1ZAU4
        Length = 420

 Score = 506 (183.2 bits), Expect = 1.8e-48, P = 1.8e-48
 Identities = 118/318 (37%), Positives = 173/318 (54%)

Query:    45 IELFESWMSKHGKTY-KCIEEKLHRFEIFKENLKHIDQRN----KEVTSYWLGLNEFADM 99
             ++ F  ++S+ GKTY    +  LH    F      ++  N    + V ++   +N FAD+
Sbjct:   109 VQDFGDFLSQSGKTYLSAADRALHE-GAFASTKNLVEAGNAAFAQGVHTFKQAVNAFADL 167

Query:   100 SHEEFKNKYLGLK--PQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSC 157
             +H EF ++  GLK  P+   R   S +      K +P + DWR+ G VTPVK QG+CGSC
Sbjct:   168 THSEFLSQLTGLKRSPEAKARAAASLKLVNLPAKPIPDAFDWREHGGVTPVKFQGTCGSC 227

Query:   158 WAFSTVAAVEGINQIVSGNLTSLSEQELIDCDT--SFN-NGCNGGLMDYAFKYI-VASGG 213
             WAF+T  A+EG     +G+L +LSEQ L+DC     F  NGC+GG  + AF +I     G
Sbjct:   228 WAFATTGAIEGHTFRKTGSLPNLSEQNLVDCGPVEDFGLNGCDGGFQEAAFCFIDEVQKG 287

Query:   214 LHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQ-PVSVAIEASG 272
             + +E  YPY+  +GTC+    +    T+ G+  +P  DE+ L K +A   PV+ ++    
Sbjct:   288 VSQEGAYPYIDNKGTCKYDGSKSGA-TLQGFAAIPPKDEEQLKKVVATLGPVACSVNGLE 346

Query:   273 TDFQFYSGGVFTGP-CG-AELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRN 330
             T  + Y+GG++    C   E +H +  VGYG  KG DY IVKNSW   WGE+GY R+ R 
Sbjct:   347 T-LKNYAGGIYNDDECNKGEPNHSILVVGYGSEKGQDYWIVKNSWDDTWGEKGYFRLPR- 404

Query:   331 TGKPEGLCGINKMASIPL 348
              GK    C I +  S P+
Sbjct:   405 -GK--NYCFIAEECSYPV 419


>RGD|631421 [details] [associations]
            symbol:Ctsq "cathepsin Q" species:10116 "Rattus norvegicus"
            [GO:0005764 "lysosome" evidence=NAS] [GO:0006508 "proteolysis"
            evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 RGD:631421 eggNOG:COG4870
            HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0005764 GeneTree:ENSGT00560000076577
            HOVERGEN:HBG011513 UniGene:Rn.34875 EMBL:AF187323 IPI:IPI00214897
            PIR:JC7183 RefSeq:NP_640355.1 UniGene:Rn.35820
            ProteinModelPortal:Q9QZE3 SMR:Q9QZE3 STRING:Q9QZE3 MEROPS:C01.039
            PRIDE:Q9QZE3 Ensembl:ENSRNOT00000024208 GeneID:246147
            KEGG:rno:246147 UCSC:RGD:631421 CTD:104002 InParanoid:Q9QZE3
            OMA:ESEDVLM OrthoDB:EOG4HHP48 NextBio:623425 Genevestigator:Q9QZE3
            GermOnline:ENSRNOG00000017946 Uniprot:Q9QZE3
        Length = 343

 Score = 504 (182.5 bits), Expect = 2.9e-48, P = 2.9e-48
 Identities = 118/332 (35%), Positives = 188/332 (56%)

Query:    39 TSMDKLIEL-FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT----SYWLGL 93
             +++D  +++ ++ W  K+ K Y   EE L R  +++EN+K I+  N+E +    +Y + +
Sbjct:    19 SALDLSLDVQWQEWKIKYEKLYSPEEEVLKRV-VWEENVKKIELHNRENSLGKNTYTMEI 77

Query:    94 NEFADMSHEEFKNKYLGLK-PQFPTRRQ----------PSAEFSYRDVKALPKSVDWRKK 142
             N+FADM+ EEFK+  +G + P   T ++          P++ +++RD  ALPK VDWR +
Sbjct:    78 NDFADMTDEEFKDMIIGFQLPVHNTEKRLWKRALGSFFPNS-WNWRD--ALPKFVDWRNE 134

Query:   143 GAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDT-SFNNGCNGGLM 201
             G VT V+ QG C SCWAF    A+EG     +G L  LS Q LIDC     N GC  G  
Sbjct:   135 GYVTRVRKQGGCSSCWAFPVTGAIEGQMFKKTGKLIPLSVQNLIDCSKPQGNRGCLWGNT 194

Query:   202 DYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH 261
               AF+Y++ +GGL  E  YPY  +EG C    +      I+G+  +PE+ E  L+ A+A 
Sbjct:   195 YNAFQYVLHNGGLEAEATYPYERKEGVCRYNPKNSSA-KITGFVVLPES-EDVLMDAVAT 252

Query:   262 Q-PVSVAIEASGTDFQFYSGGVFTGP-CGAELDHGVAAVGYG----KSKGSDYIIVKNSW 315
             + P++  +    + F+FY  GV+  P C + ++H V  VGYG    ++ G++Y ++KNSW
Sbjct:   253 KGPIATGVHVISSSFRFYQKGVYHEPKCSSYVNHAVLVVGYGFEGNETDGNNYWLIKNSW 312

Query:   316 GPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
             G +WG RGY+++ ++       C I  +A  P
Sbjct:   313 GKRWGLRGYMKIAKDRNNH---CAIASLAQYP 341


>FB|FBgn0032228 [details] [associations]
            symbol:CG5367 species:7227 "Drosophila melanogaster"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            EMBL:AE014134 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00639 GeneTree:ENSGT00560000076599 HSSP:P80067
            RefSeq:NP_609387.1 UniGene:Dm.26782 ProteinModelPortal:Q9VKY4
            SMR:Q9VKY4 MEROPS:C01.A30 EnsemblMetazoa:FBtr0080055 GeneID:34401
            KEGG:dme:Dmel_CG5367 UCSC:CG5367-RA FlyBase:FBgn0032228
            InParanoid:Q9VKY4 OMA:QIVDCSV OrthoDB:EOG4THT8X PhylomeDB:Q9VKY4
            GenomeRNAi:34401 NextBio:788324 ArrayExpress:Q9VKY4 Bgee:Q9VKY4
            Uniprot:Q9VKY4
        Length = 338

 Score = 503 (182.1 bits), Expect = 3.7e-48, P = 3.7e-48
 Identities = 111/308 (36%), Positives = 177/308 (57%)

Query:    48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRN---KE-VTSYWLGLNEFADMSHEE 103
             FE + + + + Y    +++  ++ F+EN K I++ N   KE  TS+ L  N FADMS + 
Sbjct:    36 FEKFKNNNNRKYLRTYDEMRSYKAFEENFKVIEEHNQNYKEGQTSFRLKPNIFADMSTDG 95

Query:   104 FKNKYLGL-KPQFPTRRQPSAEFSYRDVKA-LPKSVDWRKKGAVTPVKNQGSCGSCWAFS 161
             +   +L L K          AE     + A +P+S+DWR KG +TP  NQ SCGSC+AFS
Sbjct:    96 YLKGFLRLLKSNIEDSADNMAEIVGSPLMANVPESLDWRSKGFITPPYNQLSCGSCYAFS 155

Query:   162 TVAAVEGINQIVSGNLTSLSEQELIDCDTSFNN-GCNGGLMDYAFKYIVASGGLHKEEDY 220
                ++ G     +G + SLS+Q+++DC  S  N GC GG +     Y+ ++GG+ +++DY
Sbjct:   156 IAESIMGQVFKRTGKILSLSKQQIVDCSVSHGNQGCVGGSLRNTLSYLQSTGGIMRDQDY 215

Query:   221 PYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQFYS 279
             PY+  +G C+   + + VV ++ +  +P  DEQ++  A+ H  PV+++I AS   FQ YS
Sbjct:   216 PYVARKGKCQFVPD-LSVVNVTSWAILPVRDEQAIQAAVTHIGPVAISINASPKTFQLYS 274

Query:   280 GGVFTGP-CG-AELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGL 337
              G++  P C  A ++H +  +G+GK    DY I+KN WG  WGE GYIR+++       +
Sbjct:   275 DGIYDDPLCSSASVNHAMVVIGFGK----DYWILKNWWGQNWGENGYIRIRKGVN----M 326

Query:   338 CGINKMAS 345
             CGI   A+
Sbjct:   327 CGIANYAA 334


>UNIPROTKB|F1RU48 [details] [associations]
            symbol:CTSF "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00139 PROSITE:PS00639
            OMA:LAPPEWD GeneTree:ENSGT00660000095458 EMBL:CU928034
            EMBL:FP565364 Ensembl:ENSSSCT00000014140 Ensembl:ENSSSCT00000014154
            Uniprot:F1RU48
        Length = 460

 Score = 503 (182.1 bits), Expect = 3.7e-48, P = 3.7e-48
 Identities = 125/311 (40%), Positives = 169/311 (54%)

Query:    43 KLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQ-RNKEVTSYWLGLNEFADMSH 101
             K+  +F+ +++ + +TY   EE   R  +F  N+    + +  +  +   G+ +F+D++ 
Sbjct:   158 KMASIFKEFVTTYNRTYDTKEEARWRMSVFANNMVRAQKIQALDTGTARYGVTKFSDLTE 217

Query:   102 EEFKNKYLG-LKPQFPTRRQPSAEFSYRDVKALPKSV-DWRKKGAVTPVKNQGSCGSCWA 159
             EEF+  YL  L  + P R+   A    + V +LP    DWRKKGAVT VK+QG CGSCWA
Sbjct:   218 EEFRTIYLNPLLQEEPGRKMRLA----KSVSSLPPPEWDWRKKGAVTKVKDQGMCGSCWA 273

Query:   160 FSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEED 219
             FS    VEG   +  G L SLSEQEL+DCD   + GC GGL   A+  I   GGL  EED
Sbjct:   274 FSVTGNVEGQWFLKQGTLLSLSEQELLDCD-KVDKGCMGGLPSNAYSAIKTLGGLETEED 332

Query:   220 YPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQ-PVSVAIEASGTDFQFY 278
             Y Y     TC    E+ +V  I+   ++ +N EQ L   LA + P+SVAI A G   QFY
Sbjct:   333 YSYRGHLQTCSFNAEKAKVY-INDSVELSQN-EQKLAAWLAEKGPISVAINAFG--MQFY 388

Query:   279 SGGVF--TGP-CGAEL-DHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKP 334
               G+     P C   L DH V  VGYG    + +  +KNSWG  WGE GY  + R +G  
Sbjct:   389 RHGISHPLRPLCSPWLIDHAVLLVGYGNRSATPFWAIKNSWGTDWGEEGYYYLYRGSGA- 447

Query:   335 EGLCGINKMAS 345
                CG+N MAS
Sbjct:   448 ---CGVNIMAS 455


>RGD|1309226 [details] [associations]
            symbol:Cts7 "cathepsin 7" species:10116 "Rattus norvegicus"
            [GO:0005615 "extracellular space" evidence=IEA] [GO:0005634
            "nucleus" evidence=IEA] [GO:0005764 "lysosome" evidence=IEA]
            [GO:0005768 "endosome" evidence=IEA] [GO:0005794 "Golgi apparatus"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0007067
            "mitosis" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0048471 "perinuclear region of
            cytoplasm" evidence=IEA] [GO:0051301 "cell division" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 RGD:1309226 GO:GO:0005634
            GO:GO:0005794 GO:GO:0048471 GO:GO:0005615 GO:GO:0051301
            GO:GO:0007067 GO:GO:0005768 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0005764 MEROPS:C01.016 CTD:56092
            GeneTree:ENSGT00560000076577 OrthoDB:EOG44QT2S EMBL:CH474032
            IPI:IPI00870531 RefSeq:NP_001099569.1 UniGene:Rn.218615
            Ensembl:ENSRNOT00000043686 GeneID:290970 KEGG:rno:290970
            UCSC:RGD:1309226 OMA:VESFNAN Uniprot:D3ZZ07
        Length = 331

 Score = 503 (182.1 bits), Expect = 3.7e-48, P = 3.7e-48
 Identities = 120/313 (38%), Positives = 176/313 (56%)

Query:    48 FESWMSKHGKTYKCIEEKLHRFEIFKENLK----HIDQRNKEVTSYWLGLNEFADMSHEE 103
             +E W   + KTY   EEK  R  +++EN+K    H  Q    + ++ + +NEF DM+ EE
Sbjct:    29 WEEWKRNNAKTYSPEEEKQRR-AVWEENVKMIKWHTMQNGLWMNNFTIEMNEFGDMTGEE 87

Query:   104 FKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTV 163
              +   +       T R        R+VK +PK++DWR  G V PV++QG CG+CWAFS  
Sbjct:    88 MR---MMTDSSALTLRN-GKHIQKRNVK-IPKTLDWRDTGCVAPVRSQGGCGACWAFSVA 142

Query:   164 AAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYPY 222
             A++E      +G L  LS Q LIDC  ++ NN C+GG    AF+Y+  +GGL  E  YPY
Sbjct:   143 ASIESQLFKKTGKLIPLSVQNLIDCTVTYGNNDCSGGKPYTAFQYVKNNGGLEAEATYPY 202

Query:   223 LMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALA-HQPVSVAIEASGTDFQFYSGG 281
               +   C  + E   VV I+ +  VP N+E +L++AL  + P++VAI+ S   F+ Y GG
Sbjct:   203 EAKLRHCRYRPER-SVVKIARFFVVPRNEE-ALMQALVTYGPIAVAIDGSHASFKRYRGG 260

Query:   282 VFTGP-CGAE-LDHGVAAVGYG----KSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPE 335
             ++  P C  + LDHG+  VGYG    +S+   Y ++KNS G +WGERGY+++ R+     
Sbjct:   261 IYHEPKCRRDTLDHGLLLVGYGYEGHESENRKYWLLKNSHGEQWGERGYMKLPRDQNN-- 318

Query:   336 GLCGINKMASIPL 348
               CGI   A  PL
Sbjct:   319 -YCGIASYAMYPL 330


>MGI|MGI:1861434 [details] [associations]
            symbol:Ctsf "cathepsin F" species:10090 "Mus musculus"
            [GO:0005764 "lysosome" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] [GO:0008152 "metabolic process" evidence=ISS]
            [GO:0008233 "peptidase activity" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=ISS] [GO:0016787
            "hydrolase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 MGI:MGI:1861434 eggNOG:COG4870
            HOGENOM:HOG000230774 KO:K01373 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0005764 HOVERGEN:HBG011513 MEROPS:C01.018
            CTD:8722 OMA:LAPPEWD OrthoDB:EOG4CC41T EMBL:AF136280 EMBL:AF217224
            EMBL:AJ131851 EMBL:AK075862 EMBL:BC058758 IPI:IPI00126769
            RefSeq:NP_063914.1 UniGene:Mm.29561 ProteinModelPortal:Q9R013
            SMR:Q9R013 STRING:Q9R013 PhosphoSite:Q9R013 PaxDb:Q9R013
            PRIDE:Q9R013 Ensembl:ENSMUST00000119694 GeneID:56464 KEGG:mmu:56464
            UCSC:uc008gbc.1 GeneTree:ENSGT00660000095458 InParanoid:Q9R013
            NextBio:312722 Bgee:Q9R013 CleanEx:MM_CTSF Genevestigator:Q9R013
            GermOnline:ENSMUSG00000006458 Uniprot:Q9R013
        Length = 462

 Score = 499 (180.7 bits), Expect = 9.8e-48, P = 9.8e-48
 Identities = 125/311 (40%), Positives = 164/311 (52%)

Query:    43 KLIELFESWMSKHGKTYKCIEEKLHRFEIFKENL---KHIDQRNKEVTSYWLGLNEFADM 99
             K+  LF+ +M+ + +TY+  EE   R  +F  N+   + I   ++    Y  G+ +F+D+
Sbjct:   160 KMAPLFKDFMTTYNRTYESREEAQWRLTVFARNMIRAQKIQALDRGTAQY--GITKFSDL 217

Query:   100 SHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWA 159
             + EEF   YL    Q  + R+ S   S  D+   P   DWRKKGAVT VKNQG CGSCWA
Sbjct:   218 TEEEFHTIYLNPLLQKESGRKMSPAKSINDLA--PPEWDWRKKGAVTEVKNQGMCGSCWA 275

Query:   160 FSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEED 219
             FS    VEG   +  G L SLSEQEL+DCD   +  C GGL   A+  I   GGL  E+D
Sbjct:   276 FSVTGNVEGQWFLNRGTLLSLSEQELLDCD-KVDKACLGGLPSNAYAAIKNLGGLETEDD 334

Query:   220 YPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQ-PVSVAIEASGTDFQFY 278
             Y Y     TC +   +M  V I+   ++  N E  +   LA + P+SVAI A G   QFY
Sbjct:   335 YGYQGHVQTC-NFSAQMAKVYINDSVELSRN-ENKIAAWLAQKGPISVAINAFG--MQFY 390

Query:   279 SGGV---FTGPCGAE-LDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKP 334
               G+   F   C    +DH V  VGYG      Y  +KNSWG  WGE GY  + R +G  
Sbjct:   391 RHGIAHPFRPLCSPWFIDHAVLLVGYGNRSNIPYWAIKNSWGSDWGEEGYYYLYRGSGA- 449

Query:   335 EGLCGINKMAS 345
                CG+N MAS
Sbjct:   450 ---CGVNTMAS 457


>UNIPROTKB|E9PSK9 [details] [associations]
            symbol:Ctsql2 "Protein Ctsql2" species:10116 "Rattus
            norvegicus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            IPI:IPI00562656 Ensembl:ENSRNOT00000045847 RGD:1303225
            ArrayExpress:E9PSK9 Uniprot:E9PSK9
        Length = 342

 Score = 499 (180.7 bits), Expect = 9.8e-48, P = 9.8e-48
 Identities = 121/322 (37%), Positives = 179/322 (55%)

Query:    48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT----SYWLGLNEFADMSHEE 103
             ++ W  K+ K Y   EE L R  +++EN+K I+  N+E +    +Y + +N FAD++ EE
Sbjct:    29 WQEWKMKYEKLYSPEEELLKRV-VWEENVKKIELHNRENSLGKNTYIMEINNFADLTDEE 87

Query:   104 FKNKYLGLK-PQFPT-----RRQPSAEFS----YRDVKALPKSVDWRKKGAVTPVKNQGS 153
             FK+   G+  P   T     +R   + F     +RD  ALPKS+DWRK+G VT V+ QG 
Sbjct:    88 FKDMITGITLPINNTMKSLWKRALGSPFPNSWYWRD--ALPKSIDWRKEGYVTRVREQGK 145

Query:   154 CGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDT-SFNNGCNGGLMDYAFKYIVASG 212
             C SCWAF    A+EG     +G LT LS Q L+DC     N GC GG    AF+Y++ +G
Sbjct:   146 CKSCWAFPVAGAIEGQMFKKTGKLTPLSVQNLVDCSKPQGNKGCRGGTTYNAFQYVLQNG 205

Query:   213 GLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQ-PVSVAIEAS 271
             GL  E  YPY  +EG C+   +      I+ +  +PE DE  L+ ALA + PV+  I   
Sbjct:   206 GLESEATYPYKGKEGLCKYNPKNA-YAKITRFVALPE-DEDVLMDALATKGPVAAGIHVV 263

Query:   272 GTDFQFYSGGVFTGP-CGAELDHGVAAVGYG----KSKGSDYIIVKNSWGPKWGERGYIR 326
              + F F SG ++  P C   ++H V  VGYG    ++ G++Y ++KNSWG +WG +GY++
Sbjct:   264 YSYFHFVSG-IYHEPKCNNRVNHAVLVVGYGFEGNETDGNNYWLIKNSWGKQWGLKGYMK 322

Query:   327 MKRNTGKPEGLCGINKMASIPL 348
             + ++       CGI   A  P+
Sbjct:   323 IAKDRNNH---CGIATFAQYPI 341


>UNIPROTKB|E9PTT3 [details] [associations]
            symbol:Ctsr "Protein Ctsr" species:10116 "Rattus
            norvegicus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00639 GeneTree:ENSGT00560000076577
            IPI:IPI00627092 Ensembl:ENSRNOT00000024115 RGD:631422
            Uniprot:E9PTT3
        Length = 334

 Score = 499 (180.7 bits), Expect = 9.8e-48, P = 9.8e-48
 Identities = 117/309 (37%), Positives = 175/309 (56%)

Query:    53 SKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT----SYWLGLNEFADMSHEEFKNKY 108
             +++ K+Y  +EE+ HR  +++EN+K I   N+E +     + + +NEF D++ EEF+   
Sbjct:    34 TEYEKSYT-MEEEGHRRAVWEENMKMIKLHNRENSLGKNGFIMEMNEFGDLTAEEFRKMM 92

Query:   109 LGLKPQFPTRRQPSAEF-SYRDV-KALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAV 166
             + +    P R     +    RDV   LPK VDWRKKG VT V+NQ  C SCWAF+   A+
Sbjct:    93 VNI----PIRSHRKGKIIRKRDVGNVLPKFVDWRKKGYVTRVQNQKFCNSCWAFAVTGAI 148

Query:   167 EGINQIVSGNLTSLSEQELIDCDTSFNN-GCNGGLMDYAFKYIVASGGLHKEEDYPYLME 225
             EG     +G LT LS Q L+DC  S  N GC  G    A++Y++ +GGL  E  YPY  +
Sbjct:   149 EGQMFNKTGQLTPLSVQNLVDCTKSQGNEGCQWGDPHIAYEYVLNNGGLEAEATYPYKGK 208

Query:   226 EGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQFYSGGVFT 284
             EG C    +  +   I+G+  +PE+ E  L++A+A   P+SVA++AS   F FY  G++ 
Sbjct:   209 EGVCRYNPKHSKA-EITGFVSLPES-EDILMEAVATIGPISVAVDASFNSFGFYKKGLYD 266

Query:   285 GP-CGAE-LDHGVAAVGYG----KSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLC 338
              P C    ++H V  VGYG    ++ G+ Y ++KNSWG KWG RGY+++ ++       C
Sbjct:   267 EPNCSNNTVNHSVLVVGYGFEGNETDGNSYWLIKNSWGRKWGLRGYMKIPKDQNN---FC 323

Query:   339 GINKMASIP 347
              I   A  P
Sbjct:   324 AIASYAHYP 332


>RGD|1588248 [details] [associations]
            symbol:Cts8 "cathepsin 8" species:10116 "Rattus norvegicus"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            RGD:1588248 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00639 GeneTree:ENSGT00560000076577 IPI:IPI00765053
            RefSeq:NP_001121688.1 UniGene:Rn.220599 Ensembl:ENSRNOT00000061486
            GeneID:680718 KEGG:rno:680718 UCSC:RGD:1588248 CTD:56094
            OMA:DSEWQEW OrthoDB:EOG4JT07C NextBio:719350 Uniprot:D3ZP54
        Length = 333

 Score = 498 (180.4 bits), Expect = 1.2e-47, P = 1.2e-47
 Identities = 114/305 (37%), Positives = 172/305 (56%)

Query:    48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRN----KEVTSYWLGLNEFADMSHEE 103
             ++ W +K+ K Y  +EE+  +  +++EN+K + Q N    +E  ++ + LN FADM+ EE
Sbjct:    29 WQEWKTKYEKNYS-LEEEGQKRAVWEENMKVVKQHNIEYDQEKKNFTMELNAFADMTGEE 87

Query:   104 FKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTV 163
             F+     +  Q   +++   +  +R    LPK VDWR++G VT VKNQG+C SCWAFS  
Sbjct:    88 FRKMMTNIPVQNLRKKKSIHQPIFR---YLPKFVDWRRRGYVTSVKNQGTCNSCWAFSVA 144

Query:   164 AAVEGINQIVSGNLTSLSEQELIDCDT-SFNNGCNGGLMDYAFKYIVASGGLHKEEDYPY 222
              A+EG     +G L SLS Q L+DC     N+GC+ G   YA KY+ ++GGL  E  YPY
Sbjct:   145 GAIEGQMFRKTGRLVSLSPQNLVDCSRPEGNHGCHMGSTLYALKYVWSNGGLEAESTYPY 204

Query:   223 LMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQFYSGG 281
               +EG C           ++G+  V  ++E +L+ A+A   P+SV I+AS   F+FY  G
Sbjct:   205 EGKEGPCRYLPRR-SAARVTGFSTVARSEE-ALMHAVATIGPISVGIDASHVSFRFYRRG 262

Query:   282 VFTGP-CGAE-LDHGVAAVGYG----KSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPE 335
             ++  P C +  ++H V  VGYG    +S G  Y ++KNS G  WG  GY+++ R      
Sbjct:   263 IYYEPRCSSNRINHSVLVVGYGYEGRESDGRKYWLIKNSHGVGWGMNGYMKLARGWNNH- 321

Query:   336 GLCGI 340
               CGI
Sbjct:   322 --CGI 324


>DICTYBASE|DDB_G0282991 [details] [associations]
            symbol:DDB_G0282991 species:44689 "Dictyostelium
            discoideum" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 SMART:SM00645
            dictyBase:DDB_G0282991 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            EMBL:AAFI02000049 eggNOG:NOG331187 RefSeq:XP_639299.1
            ProteinModelPortal:Q54RQ2 EnsemblProtists:DDB0185304 GeneID:8623870
            KEGG:ddi:DDB_G0282991 InParanoid:Q54RQ2 OMA:PENGNEY Uniprot:Q54RQ2
        Length = 339

 Score = 490 (177.5 bits), Expect = 8.8e-47, P = 8.8e-47
 Identities = 118/313 (37%), Positives = 170/313 (54%)

Query:    47 LFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKN 106
             LF  W +K+ K Y   +E   RF  FK+N +++DQ N++     L LN FAD+S  E+ N
Sbjct:    26 LFIEWTNKYNKIYSN-KEFYMRFNNFKKNKEYVDQWNEKQLETILELNFFADLSRNEYIN 84

Query:   107 KYLGLKPQFPTRRQPSAEFS---YRDVKALPKSVDWRKKGAVTPVKNQGSC-GSCWAFST 162
              YL          Q + ++      +     KS+DWR   AVTPVKNQG C G+ ++FS 
Sbjct:    85 NYLASFIDISNIEQKNTKYEGNLKNNFNNSIKSIDWRNFDAVTPVKNQGLCSGAGYSFSA 144

Query:   163 VAAVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDYP 221
             +  +E  + I +  L +LSEQ +IDC T   NNGC GGL   AF YI+   G+  E +YP
Sbjct:   145 IGVIESSHFIKNKELITLSEQNIIDCTTDMGNNGCMGGLALIAFDYIIKQKGIDSEFNYP 204

Query:   222 Y---LME----EGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTD 274
             Y   L+E     G C       +  +IS Y ++   +E  L ++L   PVSV I+AS   
Sbjct:   205 YEGYLIEPYEGRGRCRYNSFYSKA-SISSYIEIERFNENELTQSLIKSPVSVMIDASQLS 263

Query:   275 FQFYSGGVFTGP-CGAE-LDHGVAAVGYGKS--KGSDYIIVKNSWGPKWGERGYIRMKRN 330
             F  Y  GV+  P C +  L+HG+  +G+G +   G++Y I+KNS+G KWG +GYI + RN
Sbjct:   264 FMLYKSGVYKDPSCSSTILNHGILNIGFGVTPENGNEYYILKNSFGSKWGMKGYIYLSRN 323

Query:   331 TGKPEGLCGINKM 343
                    CGI+ +
Sbjct:   324 FNNH---CGISSV 333


>UNIPROTKB|Q9UBX1 [details] [associations]
            symbol:CTSF "Cathepsin F" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0004197 "cysteine-type
            endopeptidase activity" evidence=TAS] [GO:0005764 "lysosome"
            evidence=TAS] [GO:0019886 "antigen processing and presentation of
            exogenous peptide antigen via MHC class II" evidence=TAS]
            [GO:0043202 "lysosomal lumen" evidence=TAS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 Reactome:REACT_6900 GO:GO:0019886 eggNOG:COG4870
            HOGENOM:HOG000230774 KO:K01373 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0043202
            GO:GO:0004197 HOVERGEN:HBG011513 EMBL:AJ007331 EMBL:AF088886
            EMBL:AF132894 EMBL:AF136279 EMBL:AF071748 EMBL:AF071749
            EMBL:AK313657 EMBL:BC011682 EMBL:BC036451 EMBL:AL137742
            IPI:IPI00002816 RefSeq:NP_003784.2 UniGene:Hs.11590 PDB:1D5U
            PDB:1M6D PDBsum:1D5U PDBsum:1M6D ProteinModelPortal:Q9UBX1
            SMR:Q9UBX1 STRING:Q9UBX1 MEROPS:C01.018 PhosphoSite:Q9UBX1
            DMDM:12643325 PaxDb:Q9UBX1 PeptideAtlas:Q9UBX1 PRIDE:Q9UBX1
            DNASU:8722 Ensembl:ENST00000310325 GeneID:8722 KEGG:hsa:8722
            UCSC:uc001oip.3 CTD:8722 GeneCards:GC11M066332 HGNC:HGNC:2531
            HPA:CAB002141 MIM:603539 neXtProt:NX_Q9UBX1 PharmGKB:PA27031
            InParanoid:Q9UBX1 OMA:LAPPEWD OrthoDB:EOG4CC41T PhylomeDB:Q9UBX1
            BindingDB:Q9UBX1 ChEMBL:CHEMBL2517 ChiTaRS:CTSF
            EvolutionaryTrace:Q9UBX1 GenomeRNAi:8722 NextBio:32715
            ArrayExpress:Q9UBX1 Bgee:Q9UBX1 CleanEx:HS_CTSF
            Genevestigator:Q9UBX1 GermOnline:ENSG00000174080 Uniprot:Q9UBX1
        Length = 484

 Score = 486 (176.1 bits), Expect = 2.3e-46, P = 2.3e-46
 Identities = 126/329 (38%), Positives = 174/329 (52%)

Query:    29 SIVGYSPEHLTSMD---KLIELFESWMSKHGKTYKCIEEKLHRFEIFKENL---KHIDQR 82
             S++    E   S D   K+  +F++++  + +TY+  EE   R  +F  N+   + I   
Sbjct:   165 SVISLLNEDPLSQDLPVKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQAL 224

Query:    83 NKEVTSYWLGLNEFADMSHEEFKNKYLG-LKPQFPTRRQPSAEFSYRDVKALPKSVDWRK 141
             ++    Y  G+ +F+D++ EEF+  YL  L  + P  +   A+ S  D+   P   DWR 
Sbjct:   225 DRGTAQY--GVTKFSDLTEEEFRTIYLNTLLRKEPGNKMKQAK-SVGDLA--PPEWDWRS 279

Query:   142 KGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLM 201
             KGAVT VK+QG CGSCWAFS    VEG   +  G L SLSEQEL+DCD   +  C GGL 
Sbjct:   280 KGAVTKVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCD-KMDKACMGGLP 338

Query:   202 DYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH 261
               A+  I   GGL  E+DY Y     +C    E+ +V  I+   ++ +N EQ L   LA 
Sbjct:   339 SNAYSAIKNLGGLETEDDYSYQGHMQSCNFSAEKAKVY-INDSVELSQN-EQKLAAWLAK 396

Query:   262 Q-PVSVAIEASGTDFQFYSGGVFTG--P-CGAEL-DHGVAAVGYGKSKGSDYIIVKNSWG 316
             + P+SVAI A G   QFY  G+     P C   L DH V  VGYG      +  +KNSWG
Sbjct:   397 RGPISVAINAFG--MQFYRHGISRPLRPLCSPWLIDHAVLLVGYGNRSDVPFWAIKNSWG 454

Query:   317 PKWGERGYIRMKRNTGKPEGLCGINKMAS 345
               WGE+GY  + R +G     CG+N MAS
Sbjct:   455 TDWGEKGYYYLHRGSGA----CGVNTMAS 479


>MGI|MGI:1927229 [details] [associations]
            symbol:Ctsm "cathepsin M" species:10090 "Mus musculus"
            [GO:0005575 "cellular_component" evidence=ND] [GO:0005764
            "lysosome" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0008152 "metabolic process" evidence=ISS] [GO:0008233
            "peptidase activity" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=ISS] [GO:0016787 "hydrolase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            InterPro:IPR015645 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            MGI:MGI:1927229 eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GeneTree:ENSGT00560000076577 HOVERGEN:HBG011513 EMBL:AF202528
            EMBL:AY014777 EMBL:AY057446 EMBL:AK005550 EMBL:AK005428
            IPI:IPI00131133 RefSeq:NP_071721.2 UniGene:Mm.279933
            ProteinModelPortal:Q9JL96 SMR:Q9JL96 STRING:Q9JL96 MEROPS:C01.023
            PRIDE:Q9JL96 DNASU:64139 Ensembl:ENSMUST00000099451 GeneID:64139
            KEGG:mmu:64139 UCSC:uc007qwj.1 CTD:64139 InParanoid:Q9JL96
            KO:K09600 OrthoDB:EOG4TTGKR NextBio:319931 Bgee:Q9JL96
            CleanEx:MM_CTSM Genevestigator:Q9JL96 GermOnline:ENSMUSG00000074484
            GermOnline:ENSMUSG00000074871 PANTHER:PTHR12411:SF58 Uniprot:Q9JL96
        Length = 333

 Score = 486 (176.1 bits), Expect = 2.3e-46, P = 2.3e-46
 Identities = 115/319 (36%), Positives = 176/319 (55%)

Query:    42 DKLIEL-FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKE----VTSYWLGLNEF 96
             D ++++ ++ W  K+GK Y  +EE+  +  ++++N+K I   N E       + + +N F
Sbjct:    22 DPILDVEWQKWKIKYGKAYS-LEEEGQKRAVWEDNMKKIKLHNGENGLGKHGFTMEMNAF 80

Query:    97 ADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGS 156
              DM+ EEF+   + + P  PT ++  +      V  LPK ++W+K+G VTPV+ QG C S
Sbjct:    81 GDMTLEEFRKVMIEI-P-VPTVKKGKSVQKRLSVN-LPKFINWKKRGYVTPVQTQGRCNS 137

Query:   157 CWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNN-GCNGGLMDYAFKYIVASGGLH 215
             CWAFS   A+EG     +G L  LS Q L+DC     N GC  G    A  Y++ +GGL 
Sbjct:   138 CWAFSVTGAIEGQMFRKTGQLIPLSVQNLVDCSRPQGNWGCYLGNTYLALHYVMENGGLE 197

Query:   216 KEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTD 274
              E  YPY  ++G+C    E      I+G++ VP+N E +L+ A+A   P+SVAI+A    
Sbjct:   198 SEATYPYEEKDGSCRYSPEN-STANITGFEFVPKN-EDALMNAVASIGPISVAIDARHAS 255

Query:   275 FQFYSGGVFTGP-CGA-ELDHGVAAVGYG----KSKGSDYIIVKNSWGPKWGERGYIRMK 328
             F FY  G++  P C +  + H +  VGYG    +S G  Y +VKNS G +WG +GY+++ 
Sbjct:   256 FLFYKRGIYYEPNCSSCVVTHSMLLVGYGFTGRESDGRKYWLVKNSMGTQWGNKGYMKIS 315

Query:   329 RNTGKPEGLCGINKMASIP 347
             R+ G     CGI   A  P
Sbjct:   316 RDKGNH---CGIATYALYP 331


>UNIPROTKB|E2RR02 [details] [associations]
            symbol:CTSF "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00139 PROSITE:PS00639
            OMA:LAPPEWD GeneTree:ENSGT00660000095458 EMBL:AAEX03011628
            Ensembl:ENSCAFT00000019742 Uniprot:E2RR02
        Length = 460

 Score = 485 (175.8 bits), Expect = 3.0e-46, P = 3.0e-46
 Identities = 122/312 (39%), Positives = 167/312 (53%)

Query:    43 KLIELFESWMSKHGKTYKCIEEKLHRFEIFKENL---KHIDQRNKEVTSYWLGLNEFADM 99
             K+  +F+ +++ + +TY+  EE   R  +F  N+   + I   ++    Y  G+ +F+D+
Sbjct:   157 KMASVFKEFVTTYNRTYETKEEAEWRMSVFSNNMVRAQKIQALDRGTAQY--GITKFSDL 214

Query:   100 SHEEFKNKYLG-LKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCW 158
             + EEF+  YL  L  +   ++   A+ S  D  A P   DWR KGAVT VK+QG CGSCW
Sbjct:   215 TEEEFRTIYLNPLLRENRGKKMRLAK-SISD-HAPPPEWDWRSKGAVTKVKDQGMCGSCW 272

Query:   159 AFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEE 218
             AFS    VEG   +  G L SLSEQEL+DCD   +  C GGL   A+  I+  GGL  E+
Sbjct:   273 AFSVTGNVEGQWFLKEGTLLSLSEQELLDCD-KVDKACLGGLPSNAYSAIMTLGGLETED 331

Query:   219 DYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQ-PVSVAIEASGTDFQF 277
             DY Y      C    ++  V  I+   ++ +N EQ L   LA + P+SVAI A G   QF
Sbjct:   332 DYSYQGHLQACSFSAKKARVY-INDSMELSQN-EQKLAAWLAKKGPISVAINAFG--MQF 387

Query:   278 YSGGVF--TGP-CGAEL-DHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGK 333
             Y  G+     P C   L DH V  VGYG   G  +  +KNSWG  WGE GY  + R +G 
Sbjct:   388 YRHGISHPLRPLCSPWLIDHAVLLVGYGNRSGIPFWAIKNSWGTDWGEEGYYYLHRGSGA 447

Query:   334 PEGLCGINKMAS 345
                 CG+N MAS
Sbjct:   448 ----CGVNTMAS 455


>RGD|1308181 [details] [associations]
            symbol:Ctsf "cathepsin F" species:10116 "Rattus norvegicus"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 RGD:1308181 eggNOG:COG4870 HOGENOM:HOG000230774
            KO:K01373 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00139 PROSITE:PS00639 HOVERGEN:HBG011513 MEROPS:C01.018
            CTD:8722 OMA:LAPPEWD OrthoDB:EOG4CC41T GeneTree:ENSGT00660000095458
            EMBL:CH473953 EMBL:BC099780 EMBL:EU253481 IPI:IPI00201100
            RefSeq:NP_001029282.1 UniGene:Rn.25087 SMR:Q499S6
            Ensembl:ENSRNOT00000026718 GeneID:361704 KEGG:rno:361704
            UCSC:RGD:1308181 InParanoid:Q499S6 NextBio:677325
            Genevestigator:Q499S6 Uniprot:Q499S6
        Length = 462

 Score = 484 (175.4 bits), Expect = 3.8e-46, P = 3.8e-46
 Identities = 122/311 (39%), Positives = 162/311 (52%)

Query:    43 KLIELFESWMSKHGKTYKCIEEKLHRFEIFKENL---KHIDQRNKEVTSYWLGLNEFADM 99
             K+  LF+ +M+ + +TY+  EE   R  +F  N+   + I   ++    Y  G+ +F+D+
Sbjct:   160 KMATLFKDFMTTYNRTYESREEAQWRLTVFARNMIRAQKIQALDRGTAQY--GITKFSDL 217

Query:   100 SHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWA 159
             + EEF   YL    Q  +  + S   S  D+   P   DWRKKGAVT VK+QG CGSCWA
Sbjct:   218 TEEEFHTIYLNPLLQKESGGKMSLAKSINDLA--PPEWDWRKKGAVTEVKDQGMCGSCWA 275

Query:   160 FSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEED 219
             FS    VEG   +  G L SLSEQEL+DCD   +  C GGL   A+  I   GGL  E+D
Sbjct:   276 FSVTGNVEGQWFLNRGTLLSLSEQELLDCD-KMDKACMGGLPSNAYTAIKNLGGLETEDD 334

Query:   220 YPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQ-PVSVAIEASGTDFQFY 278
             Y Y      C +   +M  V I+   ++   DE  +   LA + P+SVAI A G   QFY
Sbjct:   335 YGYQGHVQAC-NFSTQMAKVYINDSVEL-SRDENKIAAWLAQKGPISVAINAFG--MQFY 390

Query:   279 SGGV---FTGPCGAE-LDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKP 334
               G+   F   C    +DH V  VGYG      Y  +KNSWG  WGE GY  + R +G  
Sbjct:   391 RHGIAHPFRPLCSPWFIDHAVLLVGYGNRSNIPYWAIKNSWGRDWGEEGYYYLYRGSGA- 449

Query:   335 EGLCGINKMAS 345
                CG+N MAS
Sbjct:   450 ---CGVNTMAS 457


>UNIPROTKB|Q0VCU3 [details] [associations]
            symbol:CTSF "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 eggNOG:COG4870
            HOGENOM:HOG000230774 KO:K01373 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00139 PROSITE:PS00639
            HOVERGEN:HBG011513 MEROPS:C01.018 CTD:8722 OMA:LAPPEWD
            OrthoDB:EOG4CC41T GeneTree:ENSGT00660000095458 EMBL:DAAA02063594
            EMBL:BC120003 IPI:IPI00717812 RefSeq:NP_001068884.1 UniGene:Bt.7264
            SMR:Q0VCU3 Ensembl:ENSBTAT00000014587 GeneID:509715 KEGG:bta:509715
            InParanoid:Q0VCU3 NextBio:20869091 Uniprot:Q0VCU3
        Length = 460

 Score = 483 (175.1 bits), Expect = 4.8e-46, P = 4.8e-46
 Identities = 124/312 (39%), Positives = 164/312 (52%)

Query:    43 KLIELFESWMSKHGKTYKCIEEKLHRFEIFKENL---KHIDQRNKEVTSYWLGLNEFADM 99
             K+  +F+ +++ + +TY   EE   R  +F  N+   + I   ++    Y  G+ +F+D+
Sbjct:   158 KMASIFKDFVTTYNRTYDSQEEASWRMSVFANNMVRAQKIQALDRGTARY--GVTKFSDL 215

Query:   100 SHEEFKNKYLG-LKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCW 158
             + EEF+  YL  L    P R    A+    DV   P   DWR KGAVT VK+QG CGSCW
Sbjct:   216 TEEEFRTIYLNPLLKDAPGRNMRPAQ-PVTDVP--PPQWDWRNKGAVTNVKDQGMCGSCW 272

Query:   159 AFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEE 218
             AFS    VEG   +  G L SLSEQEL+DCD + +  C GGL   A+  I   GGL  E+
Sbjct:   273 AFSVTGNVEGQWFLKRGTLLSLSEQELLDCDKT-DKACLGGLPSNAYSAIRTLGGLETED 331

Query:   219 DYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQ-PVSVAIEASGTDFQF 277
             DY Y     TC    E+ +V  I+   ++ +N EQ L   LA   PVS+AI A G   QF
Sbjct:   332 DYSYRGRLQTCSFSAEKAKVY-INDSVELSKN-EQKLAAWLAKNGPVSIAINAFG--MQF 387

Query:   278 YSGGVF--TGP-CGAEL-DHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGK 333
             Y  G+     P C   L DH V  VGYG      +  +KNSWG  WGE GY  + R +G 
Sbjct:   388 YRHGISHPLRPLCSPWLIDHAVLLVGYGNRSAIPFWAIKNSWGTDWGEEGYYYLHRGSGA 447

Query:   334 PEGLCGINKMAS 345
                 CG+N MAS
Sbjct:   448 ----CGVNIMAS 455


>UNIPROTKB|F1NT07 [details] [associations]
            symbol:LOC100857883 "Uncharacterized protein" species:9031
            "Gallus gallus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 GeneTree:ENSGT00660000095458 EMBL:AADN02044012
            EMBL:AADN02044013 EMBL:AADN02044014 IPI:IPI00577314
            Ensembl:ENSGALT00000000192 OMA:IYKHGPV Uniprot:F1NT07
        Length = 317

 Score = 481 (174.4 bits), Expect = 7.9e-46, P = 7.9e-46
 Identities = 112/310 (36%), Positives = 161/310 (51%)

Query:    48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNK 107
             F  +  + G+ Y    E  HR  IF  +++ +  +N+   SY L LN  AD + +E    
Sbjct:    12 FHHYRRRLGRPYGSAREMEHRQRIFAHHMRFVHSKNRAALSYSLALNHLADRTPQEMAAL 71

Query:   108 YLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVE 167
                 +   P    P     Y  +  LP+S+DWR  GAVTPVK+Q  CGSCW+F+T  A+E
Sbjct:    72 RGRRRSGDPNHGLPFPAEHYTGI-ILPESLDWRMYGAVTPVKDQAVCGSCWSFATTGAME 130

Query:   168 GINQIVSGNLTSLSEQELIDCDTSFNN-GCNGGLMDYAFKYIVASGGLHKEED---YPYL 223
             G   + +G LT LS+Q LIDC     N  C+GG    A  +I   GG+   E    +P +
Sbjct:   131 GALFLKTGVLTPLSQQVLIDCSWGKGNYACDGGEEWRAKGWIKKHGGIASTESPPSFPLV 190

Query:   224 MEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKAL-AHQPVSVAIEASGTDFQFYSGGV 282
             ++ G C   + EM +  I+GY +V   +  ++  A+  H PV+V+I+AS   F FYS G+
Sbjct:   191 LQNGLCHYNQSEM-LAKITGYVNVTSGNITAVKTAIYKHGPVAVSIDASHKTFSFYSNGI 249

Query:   283 FTGP-CG---AELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLC 338
             +  P C     +LDH V AVGYG  +G  Y ++KNSW   WG  GYI M          C
Sbjct:   250 YYEPKCANKPGQLDHAVLAVGYGVLQGETYWLIKNSWSTYWGNDGYILMAMKDNN----C 305

Query:   339 GINKMASIPL 348
             G+   A+ P+
Sbjct:   306 GVATEATYPI 315


>ZFIN|ZDB-GENE-050417-107 [details] [associations]
            symbol:zgc:110239 "zgc:110239" species:7955 "Danio
            rerio" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-050417-107
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 MEROPS:I29.003 OrthoDB:EOG412M56 EMBL:BC092817
            IPI:IPI00503987 RefSeq:NP_001017633.1 UniGene:Dr.39081
            ProteinModelPortal:Q568K7 GeneID:550326 KEGG:dre:550326
            HOGENOM:HOG000007373 HOVERGEN:HBG105018 InParanoid:Q568K7
            NextBio:20879584 ArrayExpress:Q568K7 Uniprot:Q568K7
        Length = 546

 Score = 479 (173.7 bits), Expect = 1.3e-45, P = 1.3e-45
 Identities = 117/303 (38%), Positives = 158/303 (52%)

Query:    47 LFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKN 106
             +F  +  K  + Y    E   R   F  N++++   N+   S+ L +N  AD S +E  +
Sbjct:   242 MFGHYKEKFNRQYDNEMEHEEREHNFVHNIRYVHSMNRAGLSFSLSVNHLADRSQKEL-S 300

Query:   107 KYLGLKPQFPTRR--QPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVA 164
                G +      R  QP      R + A P SVDWR  GAVTPVK+Q  CGSCW+F+T  
Sbjct:   301 MMRGCQRTHKVHRKAQPFPS-EIRSI-ATPNSVDWRLYGAVTPVKDQAVCGSCWSFATTG 358

Query:   165 AVEGINQIVSGNLTSLSEQELIDCDTSF-NNGCNGGLMDYAFKYIVASGGLHKEEDY-PY 222
              +EG   + +G LTSLS+Q L+DC   F NNGC+GG    AF++I+  GG+   E Y  Y
Sbjct:   359 TLEGALFLKTGQLTSLSQQMLVDCTWGFGNNGCDGGEEWRAFEWIMKHGGISTAESYGAY 418

Query:   223 LMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKAL-AHQPVSVAIEASGTDFQFYSGG 281
             +   G C   K  M V  ++GY +V   D  +L  A+    PV+V+I+A+   F FYS G
Sbjct:   419 MGMNGLCHYDKSSM-VAQLTGYTNVTSGDILALKAAIFKFGPVAVSIDAAHRSFAFYSNG 477

Query:   282 VFTGP-C--GA-ELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGL 337
             V+  P C  G  +LDH V AVGYG      Y +VKNSW   WG  GYI M          
Sbjct:   478 VYYEPECKNGINDLDHAVLAVGYGIMNNESYWLVKNSWSSYWGNDGYILMSMKDNN---- 533

Query:   338 CGI 340
             CG+
Sbjct:   534 CGV 536


>WB|WBGene00019986 [details] [associations]
            symbol:R09F10.1 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0016021 "integral to
            membrane" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076599
            EMBL:FO081137 HSSP:P53634 PIR:D89588 RefSeq:NP_509408.1
            ProteinModelPortal:Q23030 SMR:Q23030 STRING:Q23030 MEROPS:C01.A44
            PaxDb:Q23030 EnsemblMetazoa:R09F10.1 GeneID:181087
            KEGG:cel:CELE_R09F10.1 UCSC:R09F10.1 CTD:181087 WormBase:R09F10.1
            InParanoid:Q23030 OMA:EYPYSAL NextBio:912346 Uniprot:Q23030
        Length = 383

 Score = 478 (173.3 bits), Expect = 1.6e-45, P = 1.6e-45
 Identities = 113/315 (35%), Positives = 161/315 (51%)

Query:    46 ELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFK 105
             ++F  ++ K  + Y  +EE  +R++IF  N+   +   +      L +NEF D + EE +
Sbjct:    80 QMFNDFILKFDRKYTSVEEFEYRYQIFLRNVIEFEAEEERNLGLDLDVNEFTDWTDEELQ 139

Query:   106 -----NKYLGLKPQFPTRRQPSAEFSYRDVKAL-PKSVDWRKKGAVTPVKNQGSCGSCWA 159
                  NKY   K  F T   P  E SY +   + P S+DWR++G +TP+KNQG CGSCWA
Sbjct:   140 KMVQENKYT--KYDFDT---PKFEGSYLETGVIRPASIDWREQGKLTPIKNQGQCGSCWA 194

Query:   160 FSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEED 219
             F+TVA+VE  N I  G L SLSEQE++DCD   NNGC+GG   YA K+ V   GL  E++
Sbjct:   195 FATVASVEAQNAIKKGKLVSLSEQEMVDCDGR-NNGCSGGYRPYAMKF-VKENGLESEKE 252

Query:   220 YPY-LMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFY 278
             YPY  ++   C  K+ +  V  I  ++ +  N+E          PV+  +      +  Y
Sbjct:   253 YPYSALKHDQCFLKENDTRVF-IDDFRMLSNNEEDIANWVGTKGPVTFGMNVVKAMYS-Y 310

Query:   279 SGGVFTGP---CGAEL--DHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGK 333
               G+F      C  +    H +  +GYG    S Y IVKNSWG  WG  GY R+ R    
Sbjct:   311 RSGIFNPSVEDCTEKSMGAHALTIIGYGGEGESAYWIVKNSWGTSWGASGYFRLARGVNS 370

Query:   334 PEGLCGINKMASIPL 348
                 CG+      P+
Sbjct:   371 ----CGLANTVVAPI 381


>UNIPROTKB|G3V9F8 [details] [associations]
            symbol:Ctsm "RCG24133" species:10116 "Rattus norvegicus"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015645 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00639 GeneTree:ENSGT00560000076577 EMBL:CH474032
            PANTHER:PTHR12411:SF58 Ensembl:ENSRNOT00000045830 RGD:631420
            Uniprot:G3V9F8
        Length = 333

 Score = 474 (171.9 bits), Expect = 4.4e-45, P = 4.4e-45
 Identities = 113/312 (36%), Positives = 170/312 (54%)

Query:    48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKE----VTSYWLGLNEFADMSHEE 103
             ++ W  K+ KTY  +EE+  +  +++EN+K I   N E       + + +N F DM+ EE
Sbjct:    29 WQKWKIKYEKTYS-LEEEGQKRAVWEENMKKIKLHNGENGLGKHGFTMEMNAFGDMTIEE 87

Query:   104 FKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTV 163
             F+   L ++   PT ++ ++    R    +P  ++WRK+G VTPV+ QG C  CWAFS  
Sbjct:    88 FRK--LMIEIPIPTVKKENS-VQKRQAVNVPNFINWRKRGYVTPVRRQGRCNVCWAFSVA 144

Query:   164 AAVEGINQIVSGNLTSLSEQELIDCDTSFNN-GCNGGLMDYAFKYIVASGGLHKEEDYPY 222
              A+EG     +G L  LS Q L+DC     N GC  G    A +Y+  +GGL  E  YPY
Sbjct:   145 GAIEGQMFQKTGQLIPLSVQNLVDCSRPQGNLGCYLGNTYLALQYVKENGGLESEATYPY 204

Query:   223 LMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQ-PVSVAIEASGTDFQFYSGG 281
               +EG+C    +     +I+ ++ VP+N E +L+ A+A   P+SVAI+A    F FY  G
Sbjct:   205 EEKEGSCRYHPDN-STASITDFEFVPKN-EDALMNAVATLGPISVAIDARHESFLFYRNG 262

Query:   282 VFTGP-CGAEL-DHGVAAVGYG----KSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPE 335
             ++  P C + +  H +  VGYG    +S G  Y I+KNS G KWG RGY+++ ++ G   
Sbjct:   263 IYHEPNCSSSVVTHAMLLVGYGFVGEESDGRKYWILKNSMGNKWGNRGYMKIAKDQGNH- 321

Query:   336 GLCGINKMASIP 347
               CGI   A  P
Sbjct:   322 --CGIATYALYP 331


>ZFIN|ZDB-GENE-030131-9831 [details] [associations]
            symbol:ctsf "cathepsin F" species:7955 "Danio
            rerio" [GO:0004869 "cysteine-type endopeptidase inhibitor activity"
            evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000010 InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00031 Pfam:PF00112 PRINTS:PR00705 SMART:SM00043
            SMART:SM00645 InterPro:IPR000169 ZFIN:ZDB-GENE-030131-9831
            GO:GO:0004869 eggNOG:COG4870 HOGENOM:HOG000230774 KO:K01373
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00139
            PROSITE:PS00639 HOVERGEN:HBG011513 CTD:8722 OrthoDB:EOG4CC41T
            MEROPS:I25.006 EMBL:BC124243 IPI:IPI00503226 RefSeq:NP_001071036.1
            UniGene:Dr.81265 ProteinModelPortal:Q08CH0 SMR:Q08CH0 GeneID:565588
            KEGG:dre:565588 InParanoid:Q08CH0 NextBio:20885952
            ArrayExpress:Q08CH0 Uniprot:Q08CH0
        Length = 473

 Score = 472 (171.2 bits), Expect = 7.1e-45, P = 7.1e-45
 Identities = 115/311 (36%), Positives = 171/311 (54%)

Query:    43 KLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQ-RNKEVTSYWLGLNEFADMSH 101
             +L+ +F+++M  + +TY   EE   R  IF++N+K     ++ E  S   G+ +F+D++ 
Sbjct:   170 ELLTMFKNFMITYNRTYSSQEEAEKRLRIFQQNMKTAQTLQSLEQGSAEYGITKFSDLTE 229

Query:   102 EEFKNKYLG-LKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAF 160
             +EF+  YL  +  Q+  +++           A P + DWR  GAV+PVKNQG CGSCWAF
Sbjct:   230 DEFRMMYLNPMLSQWSLKKEMKPAIP-ASAPA-PDTWDWRDHGAVSPVKNQGMCGSCWAF 287

Query:   161 STVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDY 220
             S    +EG     +G L SLSEQEL+DCD   +  C GGL   A++ I   GGL  E DY
Sbjct:   288 SVTGNIEGQWFKKTGQLLSLSEQELVDCD-KLDQACGGGLPSNAYEAIENLGGLETETDY 346

Query:   221 PYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQ-PVSVAIEASGTDFQFYS 279
              Y   + +C+    ++    I+   ++P+ DE+ +   LA   PVS A+ A     QFY 
Sbjct:   347 SYTGHKQSCDFSTGKVAAY-INSSVELPK-DEKEIAAFLAENGPVSAALNAFA--MQFYR 402

Query:   280 GGVFTGP----CGA-ELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKP 334
              GV + P    C    +DH V  VG+G+  G  +  +KNSWG  +GE+GY  + R +G  
Sbjct:   403 KGV-SHPLKIFCNPWMIDHAVLLVGFGQRNGVPFWAIKNSWGEDYGEQGYYYLYRGSG-- 459

Query:   335 EGLCGINKMAS 345
               LCGI+KM S
Sbjct:   460 --LCGIHKMCS 468


>DICTYBASE|DDB_G0272742 [details] [associations]
            symbol:DDB_G0272742 species:44689 "Dictyostelium
            discoideum" [GO:0005615 "extracellular space" evidence=IDA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 SMART:SM00645
            dictyBase:DDB_G0272742 GO:GO:0005615 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00639 EMBL:AAFI02000008
            eggNOG:NOG331187 RefSeq:XP_644986.1 ProteinModelPortal:Q7KWP5
            PRIDE:Q7KWP5 EnsemblProtists:DDB0168242 GeneID:8618663
            KEGG:ddi:DDB_G0272742 InParanoid:Q7KWP5 OMA:ATESAHF Uniprot:Q7KWP5
        Length = 345

 Score = 459 (166.6 bits), Expect = 1.7e-43, P = 1.7e-43
 Identities = 121/335 (36%), Positives = 179/335 (53%)

Query:    40 SMDKLIEL-----FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLN 94
             S  KL E+     F +WM+ + +TY    E  +R+  FK NL  I+Q N + +   L LN
Sbjct:    16 SFSKLTEIQYRNEFTAWMTSNQRTYAS-SEFTNRYNTFKSNLDFINQWNSKGSKTVLALN 74

Query:    95 EFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKS---------VDWRKKGAV 145
             EFAD+S+EE++  YL  +      +  S   + ++ K +  S         +DWRKKGAV
Sbjct:    75 EFADISNEEYRKNYL--RNDNNINKLSSLLINDKEDKEIKSSSSSGSGSSGIDWRKKGAV 132

Query:   146 TPVKNQ-GSCGSCWAFSTVAAVEGINQIVSGN--LTSLSEQELIDCDTSFNNGCNGGLMD 202
               VK+Q G CGS W  + V A E  + + +      SLS Q LIDC ++ N  C  G ++
Sbjct:   133 PSVKSQIGGCGS-WPITAVGATESAHFLANPKDPFISLSMQNLIDC-SNLNKQCYQGTVN 190

Query:   203 YAFKYIVASGGLHKEEDYPYLM-EEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH 261
              AF+YI+ +GG+  EE Y +   E G C+       V  I+ Y+ V    E SL  A++ 
Sbjct:   191 EAFQYIIENGGIDSEESYKFSGGEPGKCKYNSSN-SVAKITSYEKVKSGSESSLESAVSL 249

Query:   262 QPVSVAIEASGTDFQFYSGGVFTGP-CGA-ELDHGVAAVGYGK---------SKGSDYII 310
             +PV+  I+AS + FQFYS G++  P C + +L+H +  VG+              S+Y I
Sbjct:   250 KPVAAYIDASLSSFQFYSSGIYYEPSCNSTDLNHSILIVGFSDFSTTPTDSLKHSSNYWI 309

Query:   311 VKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMAS 345
             V+NS+G  WGE GYI M ++    +  CGI+KMAS
Sbjct:   310 VQNSFGKNWGENGYIFMSKDR---DDNCGISKMAS 341


>DICTYBASE|DDB_G0274385 [details] [associations]
            symbol:DDB_G0274385 "Cysteine proteinase 1,
            mitochondrial" species:44689 "Dictyostelium discoideum" [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0006508
            "proteolysis" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 SMART:SM00645 InterPro:IPR000169
            dictyBase:DDB_G0274385 eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00139 EMBL:AAFI02000012 RefSeq:XP_644301.1
            ProteinModelPortal:Q86KD4 EnsemblProtists:DDB0167535 GeneID:8619729
            KEGG:ddi:DDB_G0274385 InParanoid:Q86KD4 OMA:SICVDAS Uniprot:Q86KD4
        Length = 358

 Score = 458 (166.3 bits), Expect = 2.2e-43, P = 2.2e-43
 Identities = 121/319 (37%), Positives = 169/319 (52%)

Query:    46 ELFESWMSKHGKTYKCIEEKLHRFEIFKENLK-HIDQRNKEVTSYWLGLNEFADMSHEEF 104
             + F  W  KH K YK   E  +RF  FKEN+K +I+  +          N F+D+S EEF
Sbjct:    42 DTFNHWAKKHSKIYKDSIEMENRFSNFKENMKKNIELNSMHAGKAKFESNGFSDLSEEEF 101

Query:   105 KNKYLGL----KPQF---PTRRQPSAEFS----YRDVKA--LPK--SVDWRKKGAVTPVK 149
              N +L      KP       + QP+   S    Y++++   L +  S+DWRKKG VTPVK
Sbjct:   102 SNFHLNKAFKGKPSHLRNSIKPQPTPHHSLINGYKEMENGDLNELYSIDWRKKGLVTPVK 161

Query:   150 NQGSCGSCWAFSTVAAVEGINQIVSGNLTSL-SEQELIDCDTSFNNGCNGGLMDYAFKYI 208
             +QG CGSC+ FS V  +E    I +GN   L SEQ+ +DCD  ++  C GG     ++Y 
Sbjct:   162 DQGQCGSCYIFSAVEQIETA-WIKAGNKPILLSEQQAVDCDP-YDGQCGGGDPYTVYEYF 219

Query:   209 VASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPEN-DEQSLLKALAHQ-PVSV 266
                GG+     YPY   +GTC +    + VV+   Y  V +  DE +L+K + +  PVS+
Sbjct:   220 SQVGGVSTNAQYPYTATDGTCVNMSRAVPVVS---YHYVTQGGDENTLIKTIVNDGPVSI 276

Query:   267 AIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGY--GKSKGSD---YIIVKNSWGPKWGE 321
              ++AS   +Q YSGG+ T  CG  +DH V  VG    K+  S+   Y I++NSWG  WG 
Sbjct:   277 CVDAS--TWQSYSGGIITTGCGKNIDHCVQVVGLEVDKTDPSNPVQYYIIRNSWGTDWGI 334

Query:   322 RGYIRMKRNTGKPEGLCGI 340
              GYI +   TG    LCGI
Sbjct:   335 DGYIYVA--TGSD--LCGI 349


>DICTYBASE|DDB_G0281077 [details] [associations]
            symbol:DDB_G0281077 species:44689 "Dictyostelium
            discoideum" [GO:0030246 "carbohydrate binding" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR008965 InterPro:IPR013128 InterPro:IPR019028
            Pfam:PF00112 Pfam:PF09478 PRINTS:PR00705 SMART:SM00645
            SMART:SM01063 InterPro:IPR000169 dictyBase:DDB_G0281077
            GO:GO:0030246 EMBL:AAFI02000040 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00139
            PROSITE:PS00639 SUPFAM:SSF49384 ProtClustDB:CLSZ2430562
            RefSeq:XP_640803.1 ProteinModelPortal:Q54UH3
            EnsemblProtists:DDB0203998 GeneID:8622857 KEGG:ddi:DDB_G0281077
            InParanoid:Q54UH3 OMA:LINDFNF Uniprot:Q54UH3
        Length = 662

 Score = 365 (133.5 bits), Expect = 3.8e-42, Sum P(2) = 3.8e-42
 Identities = 77/183 (42%), Positives = 107/183 (58%)

Query:   134 PKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFN 193
             P S+DWR  G V+ VKNQGSCGSC+AFSTV A+E      +  + +LSEQ L+DC  ++ 
Sbjct:   472 PISIDWRTWGMVSKVKNQGSCGSCYAFSTVGALEAHYYRKNNRMLNLSEQNLVDCTRNYG 531

Query:   194 NG-CNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDE 252
             NG C+GG M   F+YI  +GG++ +  YPY    G C     + +   IS Y  + ++DE
Sbjct:   532 NGECSGGWMHNCFRYIKENGGINLQSTYPYEGRVGLCRYNSGDAQS-RISNYVMIKQHDE 590

Query:   253 QSLLKALAHQ-PVSVAIEASGTDFQFYSGGVFTGP-CGA-ELDHGVAAVGYGKSKGSDYI 309
             + L  A+A   PVSVA +AS  +F +YS G++    C      H V  VGYG   G D+ 
Sbjct:   591 EDLANAVASVGPVSVAYDASTREFMYYSSGIYNSDSCDKYRTTHAVVVVGYGIENGVDFW 650

Query:   310 IVK 312
             I+K
Sbjct:   651 IIK 653

 Score = 111 (44.1 bits), Expect = 3.8e-42, Sum P(2) = 3.8e-42
 Identities = 22/63 (34%), Positives = 41/63 (65%)

Query:    48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQ--RNKEVTSYWLGLNEFADMSHEEFK 105
             F  W ++  +TY+  ++ L ++E FK++ + I+Q  R  + ++  LGL +F+DM+H+EF 
Sbjct:   162 FIQWSNQFNRTYRA-DQFLLKYEAFKDSSRFIEQYKRENQNSTMELGLTQFSDMTHDEFL 220

Query:   106 NKY 108
             N Y
Sbjct:   221 NIY 223


>FB|FBgn0033874 [details] [associations]
            symbol:CG6347 species:7227 "Drosophila melanogaster"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=NAS]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            EMBL:AE013599 eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00639
            GeneTree:ENSGT00560000076599 HSSP:P53634 EMBL:AY069609
            RefSeq:NP_610906.1 UniGene:Dm.608 SMR:Q7K0S6 MEROPS:C01.A29
            EnsemblMetazoa:FBtr0087637 GeneID:36531 KEGG:dme:Dmel_CG6347
            UCSC:CG6347-RA FlyBase:FBgn0033874 InParanoid:Q7K0S6 OMA:FEYIRDH
            OrthoDB:EOG4FQZ74 GenomeRNAi:36531 NextBio:799046 Uniprot:Q7K0S6
        Length = 352

 Score = 444 (161.4 bits), Expect = 6.6e-42, P = 6.6e-42
 Identities = 109/322 (33%), Positives = 164/322 (50%)

Query:    45 IELFESWMSKHGKTYKCIEEKLHR-FEIFKENLKHIDQRNKE--VTSYWLGLNEFADMSH 101
             ++ F+ ++ + GK Y   E          K +L  +  +N +  V+ + LG+N  ADM+ 
Sbjct:    35 VQNFDDFLRQTGKVYSDEERVYRESIFAAKMSLITLSNKNADNGVSGFRLGVNTLADMTR 94

Query:   102 EEFKNKYLGLKPQFPTRRQPSAEFSY---RDVKA--LPKSVDWRKKGAVTPVKNQG-SCG 155
             +E     LG K      R  +   ++   R+  +  LP+  DWR+KG VTP   QG  CG
Sbjct:    95 KEIAT-LLGSKISEFGERYTNGHINFVTARNPASANLPEMFDWREKGGVTPPGFQGVGCG 153

Query:   156 SCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNN-GCNGGLMDYAFKYIVASGGL 214
             +CW+F+T  A+EG     +G L SLS+Q L+DC   + N GC+GG  +Y F+YI    G+
Sbjct:   154 ACWSFATTGALEGHLFRRTGVLASLSQQNLVDCADDYGNMGCDGGFQEYGFEYI-RDHGV 212

Query:   215 HKEEDYPYLMEEGTCEDKKE-----EMEVVTISGYQDVPENDEQSLLKALAHQ-PVSVAI 268
                  YPY   E  C   +         +V I  Y  +   DE+ + + +A   P++ ++
Sbjct:   213 TLANKYPYTQTEMQCRQNETAGRPPRESLVKIRDYATITPGDEEKMKEVIATLGPLACSM 272

Query:   269 EASGTDFQFYSGGVFTGP-CG-AELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIR 326
              A    F+ YSGG++    C   EL+H V  VGYG   G DY I+KNS+   WGE G++R
Sbjct:   273 NADTISFEQYSGGIYEDEECNQGELNHSVTVVGYGTENGRDYWIIKNSYSQNWGEGGFMR 332

Query:   327 MKRNTGKPEGLCGINKMASIPL 348
             + RN G   G CGI    S P+
Sbjct:   333 ILRNAG---GFCGIASECSYPI 351


>WB|WBGene00012747 [details] [associations]
            symbol:Y40H7A.10 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00139
            GeneTree:ENSGT00560000076599 HOGENOM:HOG000230773 EMBL:AL033510
            HSSP:P80067 MEROPS:C01.A48 PIR:T26792 RefSeq:NP_502836.1
            ProteinModelPortal:Q9XWA4 SMR:Q9XWA4 STRING:Q9XWA4
            EnsemblMetazoa:Y40H7A.10 GeneID:189809 KEGG:cel:CELE_Y40H7A.10
            UCSC:Y40H7A.10 CTD:189809 WormBase:Y40H7A.10 eggNOG:NOG286423
            InParanoid:Q9XWA4 OMA:NGPMIVC NextBio:943702 Uniprot:Q9XWA4
        Length = 343

 Score = 443 (161.0 bits), Expect = 8.4e-42, P = 8.4e-42
 Identities = 115/312 (36%), Positives = 164/312 (52%)

Query:    37 HLTSMD-KLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKE-VTSYWLGLN 94
             H+ + D K    F++++ K+ + Y    E + RF IF  NL  +++ NKE        LN
Sbjct:    39 HIPTPDVKYTNAFQNFLVKYLREYPNEYEIVKRFTIFSRNLDLVERYNKEDAGKVTYELN 98

Query:    95 EFADMSHEEFKNKYLGLKPQFPTRRQPSAE-FSYRDVKALPKSVDWRK-KGA--VTPVKN 150
             +F+D++ EE+K KYL + P+ P   + S +  +  D K LP SVDWR   G   VT +K 
Sbjct:    99 DFSDLTEEEWK-KYL-MTPK-PDHSEKSLKPKTLIDKKNLPNSVDWRNVNGTNHVTGIKY 155

Query:   151 QGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVA 210
             QG CGSCWAF+T AA+E    I  G L SLS Q+L+DC T  ++ C GG    A KY   
Sbjct:   156 QGPCGSCWAFATAAAIESAVSISGGGLQSLSSQQLLDC-TVVSDKCGGGEPVEALKY-AQ 213

Query:   211 SGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEA 270
             S G+    +YPY      C +      V  IS +      DE + + AL + P+ V    
Sbjct:   214 SHGITTAHNYPYYFWTTKCRETVPT--VARISSWMKAESEDEMAQIVAL-NGPMIVCANF 270

Query:   271 SGTDFQFYSGGVFTGP-CGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKR 329
             +    +FY  G+   P CG E  H +  +GYG     DY I+KN++   WGE+GY+R+KR
Sbjct:   271 ATNKNRFYHSGIAEDPDCGTEPTHALIVIGYGP----DYWILKNTYSKVWGEKGYMRVKR 326

Query:   330 NTGKPEGLCGIN 341
             +       CGIN
Sbjct:   327 DVN----WCGIN 334


>DICTYBASE|DDB_G0281079 [details] [associations]
            symbol:DDB_G0281079 species:44689 "Dictyostelium
            discoideum" [GO:0030246 "carbohydrate binding" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR008965 InterPro:IPR013128 InterPro:IPR019028
            Pfam:PF00112 Pfam:PF09478 PRINTS:PR00705 SMART:SM00645
            SMART:SM01063 InterPro:IPR000169 dictyBase:DDB_G0281079
            GO:GO:0030246 EMBL:AAFI02000040 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00139
            PROSITE:PS00639 SUPFAM:SSF49384 RefSeq:XP_640804.1
            ProteinModelPortal:Q54UH2 EnsemblProtists:DDB0204000 GeneID:8622858
            KEGG:ddi:DDB_G0281079 InParanoid:Q54UH2 OMA:ALESHYY
            ProtClustDB:CLSZ2430562 Uniprot:Q54UH2
        Length = 664

 Score = 346 (126.9 bits), Expect = 4.0e-40, Sum P(2) = 4.0e-40
 Identities = 76/185 (41%), Positives = 105/185 (56%)

Query:   134 PKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSF- 192
             P S+DWR  G V+ VKNQGSCGSC+AFSTV A+E      +  +  LSEQ L+DC  S  
Sbjct:   471 PISIDWRTWGMVSKVKNQGSCGSCYAFSTVGALESHYYRKNNRMLDLSEQNLVDCTASNK 530

Query:   193 --NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPEN 250
               N GC+GG M   + YI  +GG+++E  YPY  + G C     + +   IS +  + ++
Sbjct:   531 YRNGGCSGGWMHNCYSYIQENGGINQESTYPYEGKFGQCRYNSGDAQS-RISKFVMIKQH 589

Query:   251 DEQSLLKALAHQ-PVSVAIEASGTDFQFYSGGVF-TGPCGA-ELDHGVAAVGYGKSKGSD 307
             DE+ L   +A   PVSVA +AS  +F +YS G++ +  C      H V  VGY    G D
Sbjct:   590 DEEDLADTVASVGPVSVAYDASTREFMYYSRGIYYSDNCNKYRTTHAVVVVGYDNENGVD 649

Query:   308 YIIVK 312
             Y I+K
Sbjct:   650 YWIIK 654

 Score = 112 (44.5 bits), Expect = 4.0e-40, Sum P(2) = 4.0e-40
 Identities = 22/63 (34%), Positives = 41/63 (65%)

Query:    48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQ--RNKEVTSYWLGLNEFADMSHEEFK 105
             F  W ++  +TY+  ++ L ++E FK++ + I+Q  R  + ++  LGL +F+DM+H+EF 
Sbjct:   161 FIQWSNQFNRTYRA-DQFLLKYEAFKDSSRFIEQYKRENQNSTMELGLTQFSDMTHDEFL 219

Query:   106 NKY 108
             N Y
Sbjct:   220 NVY 222


>FB|FBgn0037396 [details] [associations]
            symbol:CG11459 species:7227 "Drosophila melanogaster"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=NAS]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            EMBL:AE014297 eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00639
            GeneTree:ENSGT00560000076599 KO:K01365 HSSP:P07711 EMBL:AY060710
            RefSeq:NP_649608.1 UniGene:Dm.3894 SMR:Q9VNK6 MEROPS:C01.A31
            EnsemblMetazoa:FBtr0078623 GeneID:40741 KEGG:dme:Dmel_CG11459
            UCSC:CG11459-RA FlyBase:FBgn0037396 InParanoid:Q9VNK6 OMA:NYDEREL
            OrthoDB:EOG4MGQPX ChiTaRS:CG11459 GenomeRNAi:40741 NextBio:820359
            Uniprot:Q9VNK6
        Length = 336

 Score = 422 (153.6 bits), Expect = 1.4e-39, P = 1.4e-39
 Identities = 101/312 (32%), Positives = 162/312 (51%)

Query:    48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEV----TSYWLGLNEFADMSHEE 103
             ++ + +K+ K Y+   +K HR  ++++ +  ++  N+       ++ +GLN+F+D     
Sbjct:    30 WDQYKAKYNKQYRN-RDKYHR-ALYEQRVLAVESHNQLYLQGKVAFKMGLNKFSDTDQRI 87

Query:   104 FKNKYLGLKPQFPTRRQPSAE-FSYRDVKALPKSVDWRKKGAVTPVKNQGS-CGSCWAFS 161
               N    +     T      E  +Y+    + + +DWR+ G ++PV +QG+ C SCWAFS
Sbjct:    88 LFNYRSSIPAPLETSTNALTETVNYKRYDQITEGIDWRQYGYISPVGDQGTECLSCWAFS 147

Query:   162 TVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYP 221
             T   +E       GNL  LS + L+DC    NNGC+GG +  AF Y     G+  +E YP
Sbjct:   148 TSGVLEAHMAKKYGNLVPLSPKHLVDCVPYPNNGCSGGWVSVAFNY-TRDHGIATKESYP 206

Query:   222 YLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGTDFQFYSG 280
             Y    G C  K +     T+SGY  +   DE+ L + + +  PV+V+I+    +F  YSG
Sbjct:   207 YEPVSGECLWKSDR-SAGTLSGYVTLGNYDERELAEVVYNIGPVAVSIDHLHEEFDQYSG 265

Query:   281 GVFTGP-CGA---ELDHGVAAVGYGKS-KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPE 335
             GV + P C +   +L H V  VG+G   K  DY I+KNS+G  WGE GY+++ RN     
Sbjct:   266 GVLSIPACRSKRQDLTHSVLLVGFGTHRKWGDYWIIKNSYGTDWGESGYLKLARNANN-- 323

Query:   336 GLCGINKMASIP 347
              +CG+  +   P
Sbjct:   324 -MCGVASLPQYP 334


>UNIPROTKB|Q5T8F0 [details] [associations]
            symbol:CTSL1 "Cathepsin L1 light chain" species:9606 "Homo
            sapiens" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 SMART:SM00645 InterPro:IPR000169
            HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00139
            EMBL:AL160279 UniGene:Hs.731507 UniGene:Hs.731952 HGNC:HGNC:2537
            ChiTaRS:CTSL1 IPI:IPI00640540 SMR:Q5T8F0 Ensembl:ENST00000342020
            ChEMBL:CHEMBL1293261 Uniprot:Q5T8F0
        Length = 225

 Score = 391 (142.7 bits), Expect = 2.7e-36, P = 2.7e-36
 Identities = 84/197 (42%), Positives = 115/197 (58%)

Query:    31 VGYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVT--- 87
             +G +   LT    L   +  W + H + Y   EE   R  ++++N+K I+  N+E     
Sbjct:    12 LGIASATLTFDHSLEAQWTKWKAMHNRLYGMNEEGWRR-AVWEKNMKMIELHNQEYREGK 70

Query:    88 -SYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVT 146
              S+ + +N F DM+ EEF+    G + + P + +   E  + +    P+SVDWR+KG VT
Sbjct:    71 HSFTMAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYEA---PRSVDWREKGYVT 127

Query:   147 PVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCD-TSFNNGCNGGLMDYAF 205
             PVKNQG CGSCWAFS   A+EG     +G L SLSEQ L+DC     N GCNGGLMDYAF
Sbjct:   128 PVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAF 187

Query:   206 KYIVASGGLHKEEDYPY 222
             +Y+  +GGL  EE YPY
Sbjct:   188 QYVQDNGGLDSEESYPY 204


>UNIPROTKB|F1PGK4 [details] [associations]
            symbol:CTSO "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00139
            PROSITE:PS00639 GeneTree:ENSGT00560000076599 OMA:SNVCGIA
            EMBL:AAEX03010073 Ensembl:ENSCAFT00000013638 Uniprot:F1PGK4
        Length = 316

 Score = 388 (141.6 bits), Expect = 5.7e-36, P = 5.7e-36
 Identities = 98/278 (35%), Positives = 141/278 (50%)

Query:    72 FKENLKHIDQRN----KEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSY 127
             F+E+L      N    +E +S   G+N+F+ +S EEFK  YL  KP    R       S 
Sbjct:    39 FRESLNRHRYLNSVFPRENSSAVYGINQFSYLSPEEFKAIYLRSKPSRSPRYPAEVRTSI 98

Query:   128 RDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELID 187
             R+V +LP   DWR K  VT V+NQ +CG CWAFS V AVE    I    L  +S Q++ID
Sbjct:    99 RNV-SLPLRFDWRDKRVVTQVRNQQTCGGCWAFSVVGAVESAYAIKGKPLADISVQQVID 157

Query:   188 CDTSFNN-GCNGGLMDYAFKYIVASG-GLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQ 245
             C  S+NN GC+GG    A  ++  +   L ++ +YP+  + G C    +     +I GY 
Sbjct:   158 C--SYNNYGCSGGSTLNALNWLNKTQVKLVRDSEYPFKAQNGLCHYFSDSYSGFSIRGYS 215

Query:   246 --DVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGA-ELDHGVAAVGYGK 302
               D  + +++     L   P+ V ++A    +Q Y GG+    C + E +H V   G+ K
Sbjct:   216 AYDFSDQEDEMAKVLLTFGPLVVVVDA--VSWQDYLGGIIQHHCSSGEANHAVLITGFDK 273

Query:   303 SKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGI 340
                + Y IV+NSWG  WG  GY  +K   G    +CGI
Sbjct:   274 IGSTPYWIVRNSWGSSWGVDGYAHVKMG-GN---ICGI 307


>UNIPROTKB|P43234 [details] [associations]
            symbol:CTSO "Cathepsin O" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005764 "lysosome"
            evidence=IEA] [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=TAS] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 Reactome:REACT_6900
            eggNOG:COG4870 HOGENOM:HOG000230774 GO:GO:0006508
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 GO:GO:0004197
            CleanEx:HS_CTSO EMBL:X77383 EMBL:BC049206 IPI:IPI00017257
            PIR:A55090 RefSeq:NP_001325.1 UniGene:Hs.75262
            ProteinModelPortal:P43234 SMR:P43234 IntAct:P43234 STRING:P43234
            MEROPS:C01.035 PhosphoSite:P43234 DMDM:1168795 PRIDE:P43234
            DNASU:1519 Ensembl:ENST00000433477 GeneID:1519 KEGG:hsa:1519
            UCSC:uc003ipg.3 CTD:1519 GeneCards:GC04M156845 HGNC:HGNC:2542
            HPA:HPA002041 MIM:600550 neXtProt:NX_P43234 PharmGKB:PA27040
            HOVERGEN:HBG105050 InParanoid:P43234 KO:K01374 OMA:SNVCGIA
            OrthoDB:EOG4V6ZH1 PhylomeDB:P43234 BindingDB:P43234
            ChEMBL:CHEMBL3035 GenomeRNAi:1519 NextBio:6287 Bgee:P43234
            Genevestigator:P43234 GermOnline:ENSG00000151792 Uniprot:P43234
        Length = 321

 Score = 388 (141.6 bits), Expect = 5.7e-36, P = 5.7e-36
 Identities = 104/286 (36%), Positives = 146/286 (51%)

Query:    72 FKENLKHIDQRNK----EVTSYWLGLNEFADMSHEEFKNKYLGLKP-QFPTRRQPSAEFS 126
             F+E+L      N     E ++ + G+N+F+ +  EEFK  YL  KP +FP R       S
Sbjct:    44 FRESLNRHRYLNSLFPSENSTAFYGINQFSYLFPEEFKAIYLRSKPSKFP-RYSAEVHMS 102

Query:   127 YRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELI 186
               +V +LP   DWR K  VT V+NQ  CG CWAFS V AVE    I    L  LS Q++I
Sbjct:   103 IPNV-SLPLRFDWRDKQVVTQVRNQQMCGGCWAFSVVGAVESAYAIKGKPLEDLSVQQVI 161

Query:   187 DCDTSFNN-GCNGGLMDYAFKYI-VASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGY 244
             DC  S+NN GCNGG    A  ++      L K+ +YP+  + G C          +I GY
Sbjct:   162 DC--SYNNYGCNGGSTLNALNWLNKMQVKLVKDSEYPFKAQNGLCHYFSGSHSGFSIKGY 219

Query:   245 QDVPEND-EQSLLKAL-AHQPVSVAIEASGTDFQFYSGGVFTGPCGA-ELDHGVAAVGYG 301
                  +D E  + KAL    P+ V ++A    +Q Y GG+    C + E +H V   G+ 
Sbjct:   220 SAYDFSDQEDEMAKALLTFGPLVVIVDA--VSWQDYLGGIIQHHCSSGEANHAVLITGFD 277

Query:   302 KSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGI-NKMASI 346
             K+  + Y IV+NSWG  WG  GY  +K  +     +CGI + ++SI
Sbjct:   278 KTGSTPYWIVRNSWGSSWGVDGYAHVKMGSN----VCGIADSVSSI 319


>UNIPROTKB|F1MHV4 [details] [associations]
            symbol:CTSW "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00639
            GeneTree:ENSGT00660000095458 OMA:GRCGDGC EMBL:DAAA02063574
            IPI:IPI00716321 Ensembl:ENSBTAT00000027681 Uniprot:F1MHV4
        Length = 375

 Score = 297 (109.6 bits), Expect = 1.2e-35, Sum P(2) = 1.2e-35
 Identities = 85/274 (31%), Positives = 142/274 (51%)

Query:    43 KLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKE-VTSYWLGLNEFADMSH 101
             +L E+F  +  ++ ++Y    E   R +IF +NL    +  +E + +   G+ +F+D++ 
Sbjct:    37 ELKEVFRLFQMQYNRSYPNPAEYARRLDIFAQNLAKAQRLQEEDLGTAEFGVTQFSDLTE 96

Query:   102 EEFKNKY---LGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCW 158
             EEF   Y   +  +    +R+  S E+   +    P++ DWRK G ++PV++Q +C  CW
Sbjct:    97 EEFVQLYGSQVAGEALGVSRKVGSEEWGESE----PQTCDWRKVGTISPVRDQRNCNCCW 152

Query:   159 AFSTVAAVEGINQIVSGNLTSLSEQ-ELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKE 217
             A +    +E +  I   +   +S Q EL+DCD    NGC GG +  AF  ++ + GL  E
Sbjct:   153 AMAAAGNIEALWAIKFRHFVEVSVQPELLDCDRC-GNGCRGGFVWDAFLTVLNNSGLASE 211

Query:   218 EDYPYLMEEGT--CEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQ-PVSVAIEASGTD 274
             +DYP+     T  C  KK + +V  I  +  + +  EQS+ + LA + P++V I    T 
Sbjct:   212 KDYPFNGSGKTHRCLAKKYK-KVAWIQDFI-ILQACEQSMARHLATEGPITVTINM--TL 267

Query:   275 FQFYSGGVFTG-P--CG-AELDHGVAAVGYGKSK 304
              Q Y  GV    P  C   ++DH V  VG+GK+K
Sbjct:   268 LQQYQKGVIKATPTTCDPTQVDHSVLLVGFGKTK 301

 Score = 104 (41.7 bits), Expect = 1.2e-35, Sum P(2) = 1.2e-35
 Identities = 19/35 (54%), Positives = 23/35 (65%)

Query:   308 YIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINK 342
             Y I+KNSWGP+WGE GY R+ R +      CGI K
Sbjct:   325 YWILKNSWGPQWGEEGYFRLHRGSNT----CGITK 355


>UNIPROTKB|E1BPI9 [details] [associations]
            symbol:CTSO "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00560000076599 CTD:1519 KO:K01374 OMA:SNVCGIA
            EMBL:DAAA02044933 IPI:IPI01004081 RefSeq:XP_002694471.2
            RefSeq:XP_874012.4 Ensembl:ENSBTAT00000014691 GeneID:616804
            KEGG:bta:616804 Uniprot:E1BPI9
        Length = 313

 Score = 382 (139.5 bits), Expect = 2.4e-35, P = 2.4e-35
 Identities = 101/283 (35%), Positives = 143/283 (50%)

Query:    67 HRFEIFKENLKHIDQRNK----EVTSYWLGLNEFADMSHEEFKNKYLGLKP-QFPTRRQP 121
             H    F+E+L      N     E ++   G+N+F+ +  EEFK  YL   P +FP  R P
Sbjct:    31 HPAAAFRESLNRQRYLNSLFPYENSTAVYGINQFSYLFPEEFKAIYLRSSPSRFP--RFP 88

Query:   122 SAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLS 181
             + E++     +LP   DWR K  VT V+NQ +CG CWAFS V AVE +  I    L  LS
Sbjct:    89 AEEYTSISNLSLPLRFDWRDKHVVTQVRNQKTCGGCWAFSVVGAVESVCAIKGQPLEVLS 148

Query:   182 EQELIDCDTSFNNGCNGGLMDYAFKYI-VASGGLHKEEDYPYLMEEGTCEDKKEEMEVVT 240
              Q++IDC  S N GCNGG    A  ++      L ++ +YP+  + G C    +     +
Sbjct:   149 VQQVIDCSYS-NYGCNGGSPLSALYWLNKLQVKLVRDSEYPFQAQNGLCRYFSDSHSGSS 207

Query:   241 ISGYQDVP-ENDEQSLLKAL-AHQPVSVAIEASGTDFQFYSGGVFTGPCGA-ELDHGVAA 297
             I GY        E  + +AL A  P+ V ++A    +Q Y GG+    C + E +H V  
Sbjct:   208 IKGYSAYDFSGQEDKMAEALLALGPLIVVVDAMS--WQDYLGGIIQHHCSSGEANHAVLV 265

Query:   298 VGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGI 340
              G+ K+    Y IV+NSWG  WG  GY+R+K   G    +CGI
Sbjct:   266 TGFDKTGSIPYWIVRNSWGTSWGIDGYVRVKMG-GN---VCGI 304


>UNIPROTKB|P56202 [details] [associations]
            symbol:CTSW "Cathepsin W" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0006955 "immune response"
            evidence=TAS] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 GO:GO:0006955 HOGENOM:HOG000230774
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 EMBL:AF013611
            EMBL:AF015954 EMBL:AF055903 EMBL:AP001201 EMBL:BC048255
            IPI:IPI00328978 RefSeq:NP_001326.2 UniGene:Hs.416848
            ProteinModelPortal:P56202 SMR:P56202 STRING:P56202 MEROPS:C01.037
            PhosphoSite:P56202 DMDM:259016196 PaxDb:P56202 PRIDE:P56202
            Ensembl:ENST00000307886 GeneID:1521 KEGG:hsa:1521 UCSC:uc001ogc.1
            CTD:1521 GeneCards:GC11P065647 HGNC:HGNC:2546 HPA:CAB016345
            MIM:602364 neXtProt:NX_P56202 PharmGKB:PA27042 eggNOG:NOG288820
            HOVERGEN:HBG100117 InParanoid:P56202 KO:K08569 OMA:GRCGDGC
            OrthoDB:EOG4Q2DG3 PhylomeDB:P56202 GenomeRNAi:1521 NextBio:6295
            ArrayExpress:P56202 Bgee:P56202 CleanEx:HS_CTSW
            Genevestigator:P56202 GermOnline:ENSG00000172543 Uniprot:P56202
        Length = 376

 Score = 298 (110.0 bits), Expect = 3.9e-35, Sum P(2) = 3.9e-35
 Identities = 84/277 (30%), Positives = 138/277 (49%)

Query:    43 KLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKE-VTSYWLGLNEFADMSH 101
             +L E F+ +  +  ++Y   EE  HR +IF  NL    +  +E + +   G+  F+D++ 
Sbjct:    37 ELKEAFKLFQIQFNRSYLSPEEHAHRLDIFAHNLAQAQRLQEEDLGTAEFGVTPFSDLTE 96

Query:   102 EEFKNKYLGLKPQFPTRRQPSAEF-SYRDVKALPKSVDWRK-KGAVTPVKNQGSCGSCWA 159
             EEF   Y G +           E  S    +++P S DWRK   A++P+K+Q +C  CWA
Sbjct:    97 EEFGQLY-GYRRAAGGVPSMGREIRSEEPEESVPFSCDWRKVASAISPIKDQKNCNCCWA 155

Query:   160 FSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEED 219
              +    +E + +I   +   +S QEL+DC     +GC+GG +  AF  ++ + GL  E+D
Sbjct:   156 MAAAGNIETLWRISFWDFVDVSVQELLDCGRC-GDGCHGGFVWDAFITVLNNSGLASEKD 214

Query:   220 YPYL--MEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALA-HQPVSVAIEASGTDFQ 276
             YP+   +    C  KK + +V  I  +  + +N+E  + + LA + P++V I       Q
Sbjct:   215 YPFQGKVRAHRCHPKKYQ-KVAWIQDFIML-QNNEHRIAQYLATYGPITVTINMK--PLQ 270

Query:   277 FYSGGVFTG-P--CGAEL-DHGVAAVGYGKSKGSDYI 309
              Y  GV    P  C  +L DH V  VG+G  K  + I
Sbjct:   271 LYRKGVIKATPTTCDPQLVDHSVLLVGFGSVKSEEGI 307

 Score = 98 (39.6 bits), Expect = 3.9e-35, Sum P(2) = 3.9e-35
 Identities = 18/35 (51%), Positives = 23/35 (65%)

Query:   308 YIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINK 342
             Y I+KNSWG +WGE+GY R+ R +      CGI K
Sbjct:   326 YWILKNSWGAQWGEKGYFRLHRGSNT----CGITK 356


>MGI|MGI:1338045 [details] [associations]
            symbol:Ctsw "cathepsin W" species:10090 "Mus musculus"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008233 "peptidase
            activity" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0016787 "hydrolase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 MGI:MGI:1338045 HOGENOM:HOG000230774
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 MEROPS:C01.037
            CTD:1521 eggNOG:NOG288820 HOVERGEN:HBG100117 KO:K08569 OMA:GRCGDGC
            OrthoDB:EOG4Q2DG3 EMBL:AF014941 EMBL:AC122861 IPI:IPI00111727
            RefSeq:NP_034115.2 UniGene:Mm.113590 ProteinModelPortal:P56203
            SMR:P56203 PhosphoSite:P56203 PRIDE:P56203 DNASU:13041
            Ensembl:ENSMUST00000025844 GeneID:13041 KEGG:mmu:13041
            InParanoid:P56203 NextBio:282936 Bgee:P56203 CleanEx:MM_CTSW
            Genevestigator:P56203 GermOnline:ENSMUSG00000024910 Uniprot:P56203
        Length = 371

 Score = 298 (110.0 bits), Expect = 6.3e-35, Sum P(2) = 6.3e-35
 Identities = 84/271 (30%), Positives = 134/271 (49%)

Query:    43 KLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKE-VTSYWLGLNEFADMSH 101
             +L E+F+ +  +  ++Y    E   R  IF  NL    +  +E + +   G   F+D++ 
Sbjct:    35 ELKEVFKLFQIRFNRSYWNPAEYTRRLSIFAHNLAQAQRLQQEDLGTAEFGETPFSDLTE 94

Query:   102 EEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRK-KGAVTPVKNQGSCGSCWAF 160
             EEF   Y   +    T        S    +++P++ DWRK K  ++ VKNQGSC  CWA 
Sbjct:    95 EEFGQLYGQERSPERTPNMTKKVESNTWGESVPRTCDWRKAKNIISSVKNQGSCKCCWAM 154

Query:   161 STVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDY 220
             +    ++ + +I       +S QEL+DC+    NGCNGG +  A+  ++ + GL  E+DY
Sbjct:   155 AAADNIQALWRIKHQQFVDVSVQELLDCERC-GNGCNGGFVWDAYLTVLNNSGLASEKDY 213

Query:   221 PYLMEEGT--CEDKKEEMEVVTISGYQDVPENDEQSLLKALA-HQPVSVAIEASGTDFQF 277
             P+  +     C  KK + +V  I  +  +  N+EQ++   LA H P++V I       Q 
Sbjct:   214 PFQGDRKPHRCLAKKYK-KVAWIQDFTML-SNNEQAIAHYLAVHGPITVTINMKL--LQH 269

Query:   278 YSGGVFTG-P--CGA-ELDHGVAAVGYGKSK 304
             Y  GV    P  C   ++DH V  VG+GK K
Sbjct:   270 YQKGVIKATPSSCDPRQVDHSVLLVGFGKEK 300

 Score = 96 (38.9 bits), Expect = 6.3e-35, Sum P(2) = 6.3e-35
 Identities = 18/37 (48%), Positives = 22/37 (59%)

Query:   306 SDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINK 342
             S Y I+KNSWG  WGE+GY R+ R        CG+ K
Sbjct:   319 SPYWILKNSWGAHWGEKGYFRLYRGNNT----CGVTK 351


>RGD|1309354 [details] [associations]
            symbol:Ctsw "cathepsin W" species:10116 "Rattus norvegicus"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            RGD:1309354 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00640
            PROSITE:PS00639 GeneTree:ENSGT00660000095458 MEROPS:C01.037
            CTD:1521 eggNOG:NOG288820 HOVERGEN:HBG100117 KO:K08569 OMA:GRCGDGC
            OrthoDB:EOG4Q2DG3 EMBL:CH473953 EMBL:BC093401 IPI:IPI00371471
            RefSeq:NP_001019413.1 UniGene:Rn.34406 Ensembl:ENSRNOT00000037404
            GeneID:293676 KEGG:rno:293676 UCSC:RGD:1309354 InParanoid:Q561Q9
            NextBio:636716 Genevestigator:Q561Q9 Uniprot:Q561Q9
        Length = 371

 Score = 296 (109.3 bits), Expect = 1.0e-34, Sum P(2) = 1.0e-34
 Identities = 88/274 (32%), Positives = 137/274 (50%)

Query:    43 KLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKE-VTSYWLGLNEFADMSH 101
             +L E+F+ +  +  ++Y    E   R  IF  NL    +  +E + +   G   F+D++ 
Sbjct:    35 ELKEVFKLFQIQFNRSYSNPAEYTRRLGIFAHNLAQAQRLQEEDLGTAEFGQTPFSDLTE 94

Query:   102 EEFKNKYLGLKPQFPTRRQPSAEF--SYRDVKALPKSVDWRK-KGAVTPVKNQGSCGSCW 158
             EEF   Y G + + P R    A+   S R  +++P + DWRK K  ++ +KNQG+C  CW
Sbjct:    95 EEFGQLY-GHQ-RAPERILNMAKKVKSERWGESVPPTCDWRKVKNIISSIKNQGNCRCCW 152

Query:   159 AFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEE 218
             A +    ++ + +I +     +S QEL+DCD    NGCNGG +  A+  ++ + GL  EE
Sbjct:   153 AIAAADNIQTLWRIKTQQFVDVSVQELLDCDRC-GNGCNGGFVWDAYITVLNNSGLASEE 211

Query:   219 DYPYL--MEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALA-HQPVSVAIEASGTDF 275
             DYP+    +   C   K   +V  I  +  +  N EQ +   LA H P++V I       
Sbjct:   212 DYPFQGHQKPHRCLADKYR-KVAWIQDFTMLSSN-EQVIAGYLAIHGPITVTINMKL--L 267

Query:   276 QFYSGGVFTG-P--CGAEL-DHGVAAVGYGKSKG 305
             Q+Y  GV    P  C   L +H V  VG+GK KG
Sbjct:   268 QYYQKGVIKATPSTCDPHLVNHSVLLVGFGKEKG 301

 Score = 96 (38.9 bits), Expect = 1.0e-34, Sum P(2) = 1.0e-34
 Identities = 18/39 (46%), Positives = 24/39 (61%)

Query:   304 KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINK 342
             + + Y I+KNSWG +WGE+GY R+ R        CGI K
Sbjct:   317 RSTPYWILKNSWGAEWGEKGYFRLYRGNNT----CGIAK 351


>UNIPROTKB|F1RU23 [details] [associations]
            symbol:CTSW "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00639
            GeneTree:ENSGT00660000095458 KO:K08569 EMBL:CU928325
            RefSeq:XP_003122571.1 UniGene:Ssc.28940 Ensembl:ENSSSCT00000014177
            GeneID:100525853 KEGG:ssc:100525853 OMA:CWAMAAV Uniprot:F1RU23
        Length = 367

 Score = 371 (135.7 bits), Expect = 3.6e-34, P = 3.6e-34
 Identities = 107/323 (33%), Positives = 162/323 (50%)

Query:    44 LIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKE-VTSYWLGLNEFADMSHE 102
             L E+F  +  ++ ++Y    E   R +IF +NL    +  +E + +   G+  F+D++ E
Sbjct:    38 LKEVFTLFQIQYNRSYSNPAEHARRLDIFAQNLAKAQRLQEEDLGTAEFGVTPFSDLTEE 97

Query:   103 EFKNKYLGLKPQFPTRRQPSAEF---SYRDVKALPKSVDWRKK-GAVTPVKNQGSCGSCW 158
             EF   + G    +   + PS      S    + +P+S DWRKK G ++ +K+Q  C  CW
Sbjct:    98 EFGQLH-G--HHWGAGKAPSMGIKVGSEESGETVPQSCDWRKKPGVISAIKHQKDCNCCW 154

Query:   159 AFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEE 218
             A + V  VE    I       LS Q+++DCD    NGCNGG +  AF  ++ + GL  E+
Sbjct:   155 AMAAVDNVEAQWAIKYHQAVQLSVQQVLDCDRC-GNGCNGGFVWDAFLTVLNTSGLASEQ 213

Query:   219 DYPYLMEEGTCEDK----KEEMEVVTISGYQDVPENDEQSLLKALAHQ-PVSVAIEASGT 273
             DYPY   +GT +      K+  +V  I  +  + +  EQS+ + LA + P++V I A G 
Sbjct:   214 DYPY---KGTVKTHRCLAKQHRKVAWIQDFLML-QFCEQSIARYLATEGPITVTINA-GL 268

Query:   274 DFQFYSGGVFTGP--CGAEL-DHGVAAVGYGKSKGSD-----------YIIVKNSWGPKW 319
               Q+  G +   P  C   L +H V  VG+GKSK  +           Y I+KNSWGP W
Sbjct:   269 LQQYKRGVIRATPATCDPHLVNHSVLLVGFGKSKSVEGRRPRPGHSIPYWILKNSWGPDW 328

Query:   320 GERGYIRMKRNTGKPEGLCGINK 342
             GE GY R+ R +      CGI K
Sbjct:   329 GEEGYFRLHRGSNT----CGITK 347


>MGI|MGI:2139628 [details] [associations]
            symbol:Ctso "cathepsin O" species:10090 "Mus musculus"
            [GO:0005575 "cellular_component" evidence=ND] [GO:0005764
            "lysosome" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0008233 "peptidase activity" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 MGI:MGI:2139628 eggNOG:COG4870
            HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0005764 GeneTree:ENSGT00560000076599 MEROPS:C01.035 CTD:1519
            HOVERGEN:HBG105050 KO:K01374 OMA:SNVCGIA OrthoDB:EOG4V6ZH1
            EMBL:AK034490 EMBL:AK049470 EMBL:AK165930 EMBL:AK166103
            EMBL:BC044664 IPI:IPI00453524 RefSeq:NP_808330.1 UniGene:Mm.254642
            ProteinModelPortal:Q8BM88 SMR:Q8BM88 STRING:Q8BM88
            PhosphoSite:Q8BM88 PRIDE:Q8BM88 Ensembl:ENSMUST00000029649
            GeneID:229445 KEGG:mmu:229445 UCSC:uc008pon.1 InParanoid:Q8BM88
            NextBio:379433 Bgee:Q8BM88 CleanEx:MM_CTSO Genevestigator:Q8BM88
            GermOnline:ENSMUSG00000028015 Uniprot:Q8BM88
        Length = 312

 Score = 371 (135.7 bits), Expect = 3.6e-34, P = 3.6e-34
 Identities = 96/303 (31%), Positives = 151/303 (49%)

Query:    50 SWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYL 109
             +W   H +    + E LHR   +  +  H     +  T+++ G+N+F+ +  EEFK  YL
Sbjct:    24 TWSWSHQREAAALRESLHRHR-YLNSFPH-----ENSTAFY-GVNQFSYLFPEEFKALYL 76

Query:   110 GLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGI 169
             G K  +  R     +    +V +LP   DWR K  V PV+NQ  CG CWAFS V+A+E  
Sbjct:    77 GSKYAWAPRYPAEGQRPIPNV-SLPLRFDWRDKHVVNPVRNQEMCGGCWAFSVVSAIESA 135

Query:   170 NQIVSGNLTSLSEQELIDCDTSFNN-GCNGGLMDYAFKYIVASG-GLHKEEDYPYLMEEG 227
               I   +L  LS Q++IDC  SFNN GC GG    A +++  +   L  +  YP+    G
Sbjct:   136 RAIQGKSLDYLSVQQVIDC--SFNNSGCLGGSPLCALRWLNETQLKLVADSQYPFKAVNG 193

Query:   228 TCEDKKEEMEVVTISGYQDVP-ENDEQSLLKAL-AHQPVSVAIEASGTDFQFYSGGVFTG 285
              C    +    V++  +        E  + +AL +  P+ V ++A    +Q Y GG+   
Sbjct:   194 QCRHFPQSQAGVSVKDFSAYNFRGQEDEMARALLSFGPLVVIVDAMS--WQDYLGGIIQH 251

Query:   286 PCGA-ELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGI-NKM 343
              C + E +H V   G+ ++  + Y +V+NSWG  WG  GY  +K   G    +CGI + +
Sbjct:   252 HCSSGEANHAVLITGFDRTGNTPYWMVRNSWGSSWGVEGYAHVKMG-GN---VCGIADSV 307

Query:   344 ASI 346
             A++
Sbjct:   308 AAV 310


>WB|WBGene00011102 [details] [associations]
            symbol:R07E3.1 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0016021 "integral to
            membrane" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076599
            EMBL:Z49207 HSSP:P53634 PIR:T24030 RefSeq:NP_001041280.1
            ProteinModelPortal:Q21810 SMR:Q21810 STRING:Q21810 MEROPS:C01.A43
            PaxDb:Q21810 EnsemblMetazoa:R07E3.1a GeneID:181242
            KEGG:cel:CELE_R07E3.1 UCSC:R07E3.1a CTD:181242 WormBase:R07E3.1a
            HOGENOM:HOG000021028 InParanoid:Q21810 OMA:ACKNEVI NextBio:913066
            ArrayExpress:Q21810 Uniprot:Q21810
        Length = 402

 Score = 367 (134.2 bits), Expect = 9.5e-34, P = 9.5e-34
 Identities = 112/314 (35%), Positives = 152/314 (48%)

Query:    50 SWMSKHGKTYKCIEEKLHRFEIF---KENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKN 106
             ++  K  K+Y   +E L R   +    EN+ + + +N+  ++ + G N+ +D + EEF+ 
Sbjct:    92 AYTEKFDKSYATSQESLKRLNAYYNTDENIANWNIQNEHGSAEY-GHNDMSDWTDEEFE- 149

Query:   107 KYLGLKPQFPTRRQPSAEF------SYRDVKA-----LPKSVDWRKKGAVTPVKNQGSCG 155
             K L L   F  R    AEF      S    K       P   DWR K  +TPVK QG CG
Sbjct:   150 KTL-LPKSFYKRLHKEAEFIEPIPESLTAKKGESSSPFPDFFDWRDKNVITPVKAQGQCG 208

Query:   156 SCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLH 215
             SCWAF++ A VE    I  G   +LSEQ L+DCD   +N C+GG  D AF+YI  +G L 
Sbjct:   209 SCWAFASTATVEAAWAIAHGEKRNLSEQTLLDCDL-VDNACDGGDEDKAFRYIHRNG-LA 266

Query:   216 KEEDYPYLME-EGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH-QPVSVAIEASGT 273
                D PY+   +  C    +      I     +  +DE S++  L +  PV++ + A   
Sbjct:   267 NAVDLPYVAHRQNGCA-VNDHWNTTRIKAAYFL-HHDEDSIINWLVNFGPVNIGM-AVIQ 323

Query:   274 DFQFYSGGVFTGP---CGAELD--HGVAAVGYGKSK-GSDYIIVKNSWGPKWG-ERGYIR 326
               + Y GGVFT     C  E+   H +   GYG SK G  Y IVKNSWG  WG E GYI 
Sbjct:   324 PMRAYKGGVFTPSEYACKNEVIGLHALLITGYGTSKTGEKYWIVKNSWGNTWGVEHGYIY 383

Query:   327 MKRNTGKPEGLCGI 340
               R        CGI
Sbjct:   384 FARGINA----CGI 393


>UNIPROTKB|H0YD65 [details] [associations]
            symbol:CTSF "Cathepsin F" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 SMART:SM00645 InterPro:IPR000169
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00139 EMBL:AP002748
            HGNC:HGNC:2531 ChiTaRS:CTSF Ensembl:ENST00000524994 Uniprot:H0YD65
        Length = 283

 Score = 361 (132.1 bits), Expect = 4.1e-33, P = 4.1e-33
 Identities = 97/262 (37%), Positives = 140/262 (53%)

Query:    29 SIVGYSPEHLTSMD---KLIELFESWMSKHGKTYKCIEEKLHRFEIFKENL---KHIDQR 82
             S++    E   S D   K+  +F++++  + +TY+  E +  R  +F  N+   + I   
Sbjct:    14 SVISLLNEDPLSQDLPVKMASIFKNFVITYNRTYESKEARW-RLSVFVNNMVRAQKIQAL 72

Query:    83 NKEVTSYWLGLNEFADMSHEEFKNKYLG-LKPQFPTRRQPSAEFSYRDVKALPKSVDWRK 141
             ++    Y  G+ +F+D++ EEF+  YL  L  + P  +   A+ S  D+   P   DWR 
Sbjct:    73 DRGTAQY--GVTKFSDLTEEEFRTIYLNTLLRKEPGNKMKQAK-SVGDLA--PPEWDWRS 127

Query:   142 KGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLM 201
             KGAVT VK+QG CGSCWAFS    VEG   +  G L SLSEQEL+DCD   +  C GGL 
Sbjct:   128 KGAVTKVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCD-KMDKACMGGLP 186

Query:   202 DYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAH 261
               A+  I   GGL  E+DY Y     +C    E+ +V  I+   ++ +N EQ L   LA 
Sbjct:   187 SNAYSAIKNLGGLETEDDYSYQGHMQSCNFSAEKAKVY-INDSVELSQN-EQKLAAWLAK 244

Query:   262 Q-PVSVAIEASGTDFQFYSGGV 282
             + P+SVAI A G   QFY  G+
Sbjct:   245 RGPISVAINAFG--MQFYRHGI 264


>UNIPROTKB|E2RPX3 [details] [associations]
            symbol:CTSW "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00639
            GeneTree:ENSGT00660000095458 CTD:1521 KO:K08569 OMA:GRCGDGC
            EMBL:AAEX03011632 RefSeq:XP_540846.2 Ensembl:ENSCAFT00000020910
            GeneID:483725 KEGG:cfa:483725 Uniprot:E2RPX3
        Length = 374

 Score = 280 (103.6 bits), Expect = 6.2e-33, Sum P(2) = 6.2e-33
 Identities = 83/275 (30%), Positives = 137/275 (49%)

Query:    43 KLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQ-RNKEVTSYWLGLNEFADMSH 101
             +L ++F  +  ++ ++Y   EE   R +IF  NL    Q  ++++ +   G+  F+D++ 
Sbjct:    37 ELKQVFALFQIQYNRSYSNPEEYARRLDIFAHNLAQAQQLEDEDLGTAEFGVTPFSDLTE 96

Query:   102 EEFKNKY--LGLKPQFPT--RRQPSAEFSYRDVKALPKSVDWRK-KGAVTPVKNQGSCGS 156
             EEF   Y    +  + P+  R+  S E+     + +P + DWRK  G ++P+K QG+C  
Sbjct:    97 EEFGQFYGHQRMAGEAPSVGRKVESEEWG----EPVPPTCDWRKLPGIISPIKQQGNCRC 152

Query:   157 CWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHK 216
             CWA +    +E +  I       +S QEL+DC     +GC GG    AF  ++ + GL  
Sbjct:   153 CWAMAAAGNIEALWGIRYHQPVEVSVQELLDCGRC-GDGCKGGFTWDAFITVLNNSGLAS 211

Query:   217 EEDYPYL--MEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQ-PVSVAIEASGT 273
              +DYP+L   +   C  KK + +V  I  +  +  N EQ++   LA + P++V I     
Sbjct:   212 AKDYPFLGNTKPHRCLAKKYK-KVAWIQDFIMLQGN-EQAIAWYLATKGPITVTINMKL- 268

Query:   274 DFQFYSGGVFTGP---CGAE-LDHGVAAVGYGKSK 304
               Q Y  GV       C  + +DH V  VG+GKSK
Sbjct:   269 -LQHYQKGVIQATHTTCDPQRVDHSVLLVGFGKSK 302

 Score = 95 (38.5 bits), Expect = 6.2e-33, Sum P(2) = 6.2e-33
 Identities = 18/35 (51%), Positives = 21/35 (60%)

Query:   308 YIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINK 342
             Y I+KNSWG +WGE GY R+ R        CGI K
Sbjct:   324 YWILKNSWGAEWGEEGYFRLHRGNNT----CGITK 354


>GENEDB_PFALCIPARUM|PF14_0553 [details] [associations]
            symbol:PF14_0553 "cysteine proteinase
            falcipain-1" species:5833 "Plasmodium falciparum" [GO:0042540
            "hemoglobin catabolic process" evidence=TAS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 EMBL:AE014187 KO:K01376 HSSP:P07688 GO:GO:0042540
            RefSeq:XP_001348727.1 ProteinModelPortal:Q8I6V0 PRIDE:Q8I6V0
            EnsemblProtists:PF14_0553:mRNA GeneID:812135 KEGG:pfa:PF14_0553
            EuPathDB:PlasmoDB:PF3D7_1458000 HOGENOM:HOG000065906
            ProtClustDB:CLSZ2457715 ChEMBL:CHEMBL1250371 Uniprot:Q8I6V0
        Length = 569

 Score = 281 (104.0 bits), Expect = 6.5e-33, Sum P(2) = 6.5e-33
 Identities = 75/243 (30%), Positives = 129/243 (53%)

Query:    65 KLHRFEIFKENLKHI-DQRNKEVTSYWLGLNEFADMSHEEFKNKYLG-LKPQFPTRR-QP 121
             KL++  ++K+ +    D   +E+  Y+  L    +   E++   +   LK          
Sbjct:   261 KLNKNAMYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYT 320

Query:   122 SAEFSYRDV-KALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSL 180
             + + + +D+   +P+ +D+R+KG V   K+QG CGSCWAF++V  +E +    + N+ S 
Sbjct:   321 NGKRNEKDIFSKVPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSF 380

Query:   181 SEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGT-CEDKKEEMEVV 239
             SEQE++DC    N GC+GG   Y+F Y++ +  L   ++Y Y  ++   C + + + +V 
Sbjct:   381 SEQEVVDCSKD-NFGCDGGHPFYSFLYVLQNE-LCLGDEYKYKAKDDMFCLNYRCKRKV- 437

Query:   240 TISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVG 299
             ++S    V EN  Q +L      P+SV +  +  DF  YS GV+ G C  EL+H V  VG
Sbjct:   438 SLSSIGAVKEN--QLILALNEVGPLSVNVGVNN-DFVAYSEGVYNGTCSEELNHSVLLVG 494

Query:   300 YGK 302
             YG+
Sbjct:   495 YGQ 497

 Score = 116 (45.9 bits), Expect = 1.2e-10, Sum P(2) = 1.2e-10
 Identities = 28/78 (35%), Positives = 40/78 (51%)

Query:    33 YSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNK--EVTSYW 90
             Y  E   +  K    F  +M +H K YK I+E++ +FEIFK N   I   NK  +   Y 
Sbjct:   210 YKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKNHNKLNKNAMYK 269

Query:    91 LGLNEFADMSHEEFKNKY 108
               +N+F+D S EE K  +
Sbjct:   270 KKVNQFSDYSEEELKEYF 287

 Score = 108 (43.1 bits), Expect = 6.5e-33, Sum P(2) = 6.5e-33
 Identities = 18/41 (43%), Positives = 24/41 (58%)

Query:   308 YIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPL 348
             Y I+KNSW  KWGE G++R+ RN       CGI +    P+
Sbjct:   528 YWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYPI 568


>UNIPROTKB|Q8I6V0 [details] [associations]
            symbol:PF14_0553 "Cysteine proteinase falcipain-1"
            species:36329 "Plasmodium falciparum 3D7" [GO:0042540 "hemoglobin
            catabolic process" evidence=TAS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 PROSITE:PS00640 PROSITE:PS00139
            PROSITE:PS00639 EMBL:AE014187 KO:K01376 HSSP:P07688 GO:GO:0042540
            RefSeq:XP_001348727.1 ProteinModelPortal:Q8I6V0 PRIDE:Q8I6V0
            EnsemblProtists:PF14_0553:mRNA GeneID:812135 KEGG:pfa:PF14_0553
            EuPathDB:PlasmoDB:PF3D7_1458000 HOGENOM:HOG000065906
            ProtClustDB:CLSZ2457715 ChEMBL:CHEMBL1250371 Uniprot:Q8I6V0
        Length = 569

 Score = 281 (104.0 bits), Expect = 6.5e-33, Sum P(2) = 6.5e-33
 Identities = 75/243 (30%), Positives = 129/243 (53%)

Query:    65 KLHRFEIFKENLKHI-DQRNKEVTSYWLGLNEFADMSHEEFKNKYLG-LKPQFPTRR-QP 121
             KL++  ++K+ +    D   +E+  Y+  L    +   E++   +   LK          
Sbjct:   261 KLNKNAMYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYT 320

Query:   122 SAEFSYRDV-KALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSL 180
             + + + +D+   +P+ +D+R+KG V   K+QG CGSCWAF++V  +E +    + N+ S 
Sbjct:   321 NGKRNEKDIFSKVPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSF 380

Query:   181 SEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGT-CEDKKEEMEVV 239
             SEQE++DC    N GC+GG   Y+F Y++ +  L   ++Y Y  ++   C + + + +V 
Sbjct:   381 SEQEVVDCSKD-NFGCDGGHPFYSFLYVLQNE-LCLGDEYKYKAKDDMFCLNYRCKRKV- 437

Query:   240 TISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVG 299
             ++S    V EN  Q +L      P+SV +  +  DF  YS GV+ G C  EL+H V  VG
Sbjct:   438 SLSSIGAVKEN--QLILALNEVGPLSVNVGVNN-DFVAYSEGVYNGTCSEELNHSVLLVG 494

Query:   300 YGK 302
             YG+
Sbjct:   495 YGQ 497

 Score = 116 (45.9 bits), Expect = 1.2e-10, Sum P(2) = 1.2e-10
 Identities = 28/78 (35%), Positives = 40/78 (51%)

Query:    33 YSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNK--EVTSYW 90
             Y  E   +  K    F  +M +H K YK I+E++ +FEIFK N   I   NK  +   Y 
Sbjct:   210 YKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKNHNKLNKNAMYK 269

Query:    91 LGLNEFADMSHEEFKNKY 108
               +N+F+D S EE K  +
Sbjct:   270 KKVNQFSDYSEEELKEYF 287

 Score = 108 (43.1 bits), Expect = 6.5e-33, Sum P(2) = 6.5e-33
 Identities = 18/41 (43%), Positives = 24/41 (58%)

Query:   308 YIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPL 348
             Y I+KNSW  KWGE G++R+ RN       CGI +    P+
Sbjct:   528 YWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYPI 568


>WB|WBGene00013076 [details] [associations]
            symbol:Y51A2D.8 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 GeneTree:ENSGT00560000076599
            EMBL:AL021497 HSSP:P53634 HOGENOM:HOG000019851 PIR:T27079
            RefSeq:NP_507627.1 ProteinModelPortal:Q9XXQ7 SMR:Q9XXQ7
            MEROPS:C01.A49 EnsemblMetazoa:Y51A2D.8 GeneID:180208
            KEGG:cel:CELE_Y51A2D.8 UCSC:Y51A2D.8 CTD:180208 WormBase:Y51A2D.8
            eggNOG:NOG307864 InParanoid:Q9XXQ7 OMA:VAVYFKV NextBio:908434
            Uniprot:Q9XXQ7
        Length = 386

 Score = 278 (102.9 bits), Expect = 7.9e-33, Sum P(2) = 7.9e-33
 Identities = 69/199 (34%), Positives = 93/199 (46%)

Query:   145 VTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYA 204
             V P+K+QG C  CW F+  A VE +    SG   SLS+QE+ DC T    GC GG +   
Sbjct:   164 VGPIKDQGQCACCWGFAVTALVETVYAAHSGKFKSLSDQEVCDCGTEGTPGCKGGSLTLG 223

Query:   205 FKYIVASGGLHKEEDYPY---LMEEGTCEDKKEEMEVVTISGYQDV---PENDEQSLLKA 258
              +Y V   GL  +EDYPY      +G     +E   +V    +      P   E+ +++ 
Sbjct:   224 VQY-VKKYGLSGDEDYPYDQNRANQGRRCRLRETDRIVPARAFNFAVINPRRAEEQIIQV 282

Query:   259 LAHQPVSVAIEAS-GTDFQFYSGGVFT-GPCGAELD-HGVAAVGY-----GKSKGSDYII 310
             L    V VA+    G  F+ Y  GV     C      H  A VGY      + +  DY I
Sbjct:   283 LTEWKVPVAVYFKVGDQFKEYKEGVIIEDDCRRATQWHAGAIVGYDTVEDSRGRSHDYWI 342

Query:   311 VKNSWGPKWGERGYIRMKR 329
             +KNSWG  W E GY+R+ R
Sbjct:   343 IKNSWGGDWAESGYVRVVR 361

 Score = 96 (38.9 bits), Expect = 7.9e-33, Sum P(2) = 7.9e-33
 Identities = 22/76 (28%), Positives = 39/76 (51%)

Query:    42 DKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRN--KEVTSY--WLGLNEFA 97
             +KL + FE +  K+ + YK   E   RF  F ++  ++D+ N   +   Y    G+N+F+
Sbjct:    37 EKLYKAFEDFKKKYNRKYKDESENQQRFNNFVKSYNNVDKLNAKSKAAGYDTQFGINKFS 96

Query:    98 DMSHEEFKNKYLGLKP 113
             D+S  EF  +   + P
Sbjct:    97 DLSTAEFHGRLSNVVP 112


>ZFIN|ZDB-GENE-080724-8 [details] [associations]
            symbol:ctso "cathepsin O" species:7955 "Danio rerio"
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            ZFIN:ZDB-GENE-080724-8 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00639
            GeneTree:ENSGT00560000076599 CTD:1519 KO:K01374 EMBL:CR931784
            IPI:IPI00513613 RefSeq:XP_695717.3 UniGene:Dr.88386
            Ensembl:ENSDART00000074786 GeneID:567333 KEGG:dre:567333
            NextBio:20888622 Uniprot:E7FA09
        Length = 334

 Score = 357 (130.7 bits), Expect = 1.1e-32, P = 1.1e-32
 Identities = 90/262 (34%), Positives = 134/262 (51%)

Query:    92 GLNEFADMSHEEFKNKYLGLKPQF-PTRRQPSAEFSYRDVKAL-PKSVDWRKKGAVTPVK 149
             G+N+F+ +S ++FK +YL  + +  P   Q  +E     VKA  P   DWR  G V PV 
Sbjct:    81 GVNQFSYLSQKQFKEQYLTARAEAAPKFDQSKSEIK---VKANNPPRFDWRDHGVVGPVH 137

Query:   150 NQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIV 209
             NQGSCG CWAFS V A+E ++      L  LS Q++IDC    N GCNGG    A  ++ 
Sbjct:   138 NQGSCGGCWAFSIVEAIESVSAKGGEKLQQLSVQQVIDCSYQ-NQGCNGGSPVEALYWLT 196

Query:   210 ASG-GLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVP-ENDEQSLLKALAH-QPVSV 266
              S   L  E +YP+   +G C+   +    V +  Y        E+ ++ AL    P+ V
Sbjct:   197 QSKLKLVSEAEYPFKGADGVCQFFPQAHAGVAVRNYSAYDFSGQEEVMMSALVDFGPLVV 256

Query:   267 AIEASGTDFQFYSGGVFTGPCGA-ELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYI 325
              ++A    +Q Y GG+    C + + +H V   GY  +    Y IV+NSWG  WG+ GY 
Sbjct:   257 IVDA--ISWQDYLGGIIQHHCSSHKANHAVLITGYDTTGEVPYWIVRNSWGTSWGDDGYA 314

Query:   326 RMKRNTGKPEGLCGI-NKMASI 346
              +K   G    +CG+ + +A++
Sbjct:   315 YIK--IGND--VCGVADSVAAV 332


>RGD|1564827 [details] [associations]
            symbol:RGD1564827 "similar to cathepsin M" species:10116 "Rattus
            norvegicus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00639 IPI:IPI00192321
            Ensembl:ENSRNOT00000023990 ArrayExpress:D3ZY04 Uniprot:D3ZY04
        Length = 338

 Score = 356 (130.4 bits), Expect = 1.4e-32, P = 1.4e-32
 Identities = 75/211 (35%), Positives = 112/211 (53%)

Query:   145 VTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDT-SFNNGCNGGLMDY 203
             V     QG C SCWAF  V A+EG     +G LT LS Q L+DC     N GC GG    
Sbjct:   133 VHTASTQGRCNSCWAFPVVGAIEGQMFKKTGKLTPLSVQNLVDCSKPQGNKGCRGGTTYN 192

Query:   204 AFKYIVASGGLHKEEDYPYLMEEGTCE-DKKEEMEVVTISGYQDVPENDEQSLLKALAHQ 262
             AF+Y++ +GGL  E  YPY  +EG C  +     ++  I      P+ +E  L+ A+A +
Sbjct:   193 AFQYVLQNGGLESEATYPYEGKEGLCRYNPNSSAKITXICA---PPQKNEDVLMDAVATK 249

Query:   263 PVSVAIEASGTDFQFYSGGVFTGP-CGAELDHGVAAVGYG----KSKGSDYIIVKNSWGP 317
             PV+  I    +  +FY  G++  P C   ++H V  VGYG    ++ G++Y +++NSWG 
Sbjct:   250 PVAAGIHVVHSSLRFYKKGIYHEPKCNNYVNHAVLVVGYGFEGNETDGNNYWLIQNSWGE 309

Query:   318 KWGERGYIRMKRNTGKPEGLCGINKMASIPL 348
             +WG  GY+++ ++       CGI   A  P+
Sbjct:   310 RWGLNGYMKIAKDRNNH---CGIATFAQYPI 337


>UNIPROTKB|E9PI30 [details] [associations]
            symbol:CTSW "Cathepsin W" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848 PROSITE:PS00639
            EMBL:AP001201 HGNC:HGNC:2546 IPI:IPI00984532
            ProteinModelPortal:E9PI30 SMR:E9PI30 Ensembl:ENST00000528419
            ArrayExpress:E9PI30 Bgee:E9PI30 Uniprot:E9PI30
        Length = 364

 Score = 298 (110.0 bits), Expect = 8.8e-32, Sum P(2) = 8.8e-32
 Identities = 84/277 (30%), Positives = 138/277 (49%)

Query:    43 KLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKE-VTSYWLGLNEFADMSH 101
             +L E F+ +  +  ++Y   EE  HR +IF  NL    +  +E + +   G+  F+D++ 
Sbjct:    37 ELKEAFKLFQIQFNRSYLSPEEHAHRLDIFAHNLAQAQRLQEEDLGTAEFGVTPFSDLTE 96

Query:   102 EEFKNKYLGLKPQFPTRRQPSAEF-SYRDVKALPKSVDWRK-KGAVTPVKNQGSCGSCWA 159
             EEF   Y G +           E  S    +++P S DWRK   A++P+K+Q +C  CWA
Sbjct:    97 EEFGQLY-GYRRAAGGVPSMGREIRSEEPEESVPFSCDWRKVASAISPIKDQKNCNCCWA 155

Query:   160 FSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEED 219
              +    +E + +I   +   +S QEL+DC     +GC+GG +  AF  ++ + GL  E+D
Sbjct:   156 MAAAGNIETLWRISFWDFVDVSVQELLDCGRC-GDGCHGGFVWDAFITVLNNSGLASEKD 214

Query:   220 YPYL--MEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALA-HQPVSVAIEASGTDFQ 276
             YP+   +    C  KK + +V  I  +  + +N+E  + + LA + P++V I       Q
Sbjct:   215 YPFQGKVRAHRCHPKKYQ-KVAWIQDFIML-QNNEHRIAQYLATYGPITVTINMK--PLQ 270

Query:   277 FYSGGVFTG-P--CGAEL-DHGVAAVGYGKSKGSDYI 309
              Y  GV    P  C  +L DH V  VG+G  K  + I
Sbjct:   271 LYRKGVIKATPTTCDPQLVDHSVLLVGFGSVKSEEGI 307

 Score = 66 (28.3 bits), Expect = 8.8e-32, Sum P(2) = 8.8e-32
 Identities = 13/27 (48%), Positives = 17/27 (62%)

Query:   308 YIIVKNSWGPKWGER-GYIRMKRNTGK 333
             Y I+KNSWG +WGE+   I   R  G+
Sbjct:   326 YWILKNSWGAQWGEKVSVIYWGRGQGR 352


>WB|WBGene00008861 [details] [associations]
            symbol:F15D4.4 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR013201
            PANTHER:PTHR12411 SMART:SM00848 PROSITE:PS00640
            GeneTree:ENSGT00560000076599 EMBL:Z80344 HSSP:P53634
            eggNOG:NOG310593 PIR:T20981 ProteinModelPortal:Q93512 SMR:Q93512
            MEROPS:C01.A45 EnsemblMetazoa:F15D4.4 KEGG:cel:CELE_F15D4.4
            UCSC:F15D4.4 CTD:184530 WormBase:F15D4.4 InParanoid:Q93512
            OMA:ITMEQNI NextBio:925068 Uniprot:Q93512
        Length = 608

 Score = 353 (129.3 bits), Expect = 1.1e-31, P = 1.1e-31
 Identities = 94/292 (32%), Positives = 139/292 (47%)

Query:    63 EEKLHRFEIFKENLKHIDQRN----KEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTR 118
             +E L RF ++ +  K +D+ N      ++SY +  N+F+     E     L L    PT 
Sbjct:   149 KEGLKRFNVYSKVKKEVDEHNIMYELGMSSYKMSTNQFSVALDGEVAPLTLNLDALTPTA 208

Query:   119 RQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLT 178
                 A  S R  +    +VDWR    + P+ +Q +CG CWAFS ++ +E    I   N +
Sbjct:   209 TVIPATISSRKKRDTEPTVDWRP--FLKPILDQSTCGGCWAFSMISMIESFFAIQGYNTS 266

Query:   179 SLSEQELIDCDTSF-------NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCED 231
             SLS Q+L+ CDT         N GC GG    A  Y+  S         P+ +E+ +C+ 
Sbjct:   267 SLSVQQLLTCDTKVDSTYGLANVGCKGGYFQIAGSYLEVSAA-RDASLIPFDLEDTSCDS 325

Query:   232 KKEEMEVVTISGYQD--VPEND--------EQSLLKALAHQPVSVAIEASGTDFQFYSGG 281
                   V TI  + D  +  N         EQ++   +   P++V + A+G D   YS G
Sbjct:   326 SFFPPVVPTILLFDDGYISGNFTAAQLITMEQNIEDKVRKGPIAVGM-AAGPDIYKYSEG 384

Query:   282 VFTGPCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGK 333
             V+ G CG  ++H V  VG+      DY I++NSWG  WGE GY R+KR  GK
Sbjct:   385 VYDGDCGTIINHAVVIVGFT----DDYWIIRNSWGASWGEAGYFRVKRTPGK 432


>WB|WBGene00013764 [details] [associations]
            symbol:Y113G7B.15 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00640 PROSITE:PS00139 GeneTree:ENSGT00560000076599
            EMBL:AL110477 HOGENOM:HOG000019851 RefSeq:NP_507904.2
            ProteinModelPortal:Q9U2X1 SMR:Q9U2X1 DIP:DIP-25339N IntAct:Q9U2X1
            MINT:MINT-1058673 STRING:Q9U2X1 MEROPS:C01.A47
            EnsemblMetazoa:Y113G7B.15 GeneID:190976 KEGG:cel:CELE_Y113G7B.15
            UCSC:Y113G7B.15 CTD:190976 WormBase:Y113G7B.15 eggNOG:NOG302449
            OMA:AEEDIME Uniprot:Q9U2X1
        Length = 362

 Score = 287 (106.1 bits), Expect = 7.7e-31, Sum P(2) = 7.7e-31
 Identities = 89/242 (36%), Positives = 119/242 (49%)

Query:   112 KPQFPT-RRQPSAEFSYRDVKALPKSVDWRK---KGA--VTPVKNQGSCGSCWAFSTVAA 165
             KP+ P   R    + S R    +P   D R     G+  V PVK+Q  CG CWAF+T A 
Sbjct:   109 KPRHPRGSRNHHNKRSKRQSGDIPDYFDLRDIYVDGSPVVGPVKDQEQCGCCWAFATTAI 168

Query:   166 VEGINQIVSGNLTSLSEQELIDC-DTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLM 224
              E  N + S + TSLS+QE+ DC D+    GC GG      K +V   G   + DYPY  
Sbjct:   169 TEAANTLYSKSFTSLSDQEICDCADSGDTPGCVGGDPRNGLK-MVHLRGQSSDGDYPY-- 225

Query:   225 EE------GTC--EDKKEEMEVVTISGY---QDVPENDEQSLLKALAHQPVSVAIEASGT 273
             EE      G C  ++K   ++  T++ Y   QD  E D    L  L H P +V     G 
Sbjct:   226 EEYRANTTGNCVGDEKSTVIQPETLNVYRFDQDYAEEDIMENLY-LNHIPTAVYFRV-GE 283

Query:   274 DFQFYSGGVFTGP-C----GAELDHGVAAVGYGKSK-GSDYIIVKNSWGPKWGERGYIRM 327
             +F++Y+ GV     C     AE  H VA VGYG S  G  Y +V+NSW   WG  GY+++
Sbjct:   284 NFEWYTSGVLQSEDCYQMTPAEW-HSVAIVGYGTSDDGVPYWLVRNSWNSDWGLHGYVKI 342

Query:   328 KR 329
             +R
Sbjct:   343 RR 344

 Score = 68 (29.0 bits), Expect = 7.7e-31, Sum P(2) = 7.7e-31
 Identities = 19/76 (25%), Positives = 36/76 (47%)

Query:    43 KLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRN----KEVTSYWLGLNEFAD 98
             +++  F ++   H K Y+   EK  R   F +N + I + N    +E  +   G N+FAD
Sbjct:    25 EVLSHFNNFTMHHKKHYRTPAEKDRRLAHFAKNHQKIQELNAKARREGRNVTFGWNKFAD 84

Query:    99 MSHEEFKNKYLGLKPQ 114
              + +E   +   + P+
Sbjct:    85 KNRQELSARNSKIHPK 100


>UNIPROTKB|F1P0K2 [details] [associations]
            symbol:CTSO "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GeneTree:ENSGT00560000076599
            OMA:SNVCGIA EMBL:AADN02016534 IPI:IPI00651180
            Ensembl:ENSGALT00000015270 Uniprot:F1P0K2
        Length = 320

 Score = 336 (123.3 bits), Expect = 1.8e-30, P = 1.8e-30
 Identities = 88/284 (30%), Positives = 140/284 (49%)

Query:    63 EEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKP-QFPTR-RQ 120
             EE+        + ++ ++  + +  S + G N+F+ +  EEFK  YL   P + P   + 
Sbjct:    40 EEEAAALRESAKRIRLLNSPSNDNGSAFYGKNQFSHLFPEEFKAIYLRSIPYKLPRYIKV 99

Query:   121 PSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSL 180
             P  E      K LPK  DWR K  +  V+NQ +CG CWAFS V  +E    I   NL  L
Sbjct:   100 PKGE-----EKPLPKKFDWRDKKVIAEVRNQQTCGGCWAFSVVGGIESAYAIKGHNLEEL 154

Query:   181 SEQELIDCDTSFNNGCNGGLMDYAFKYIVASG-GLHKEEDYPYLMEEGTCEDKKEEMEVV 239
             S Q++IDC  S N GC+GG    A  ++  +   L ++ +Y +  + G C         V
Sbjct:   155 SVQQVIDCSYS-NYGCSGGSTITALSWLNQTKVKLVRDSEYTFKAQTGLCHYFPHSDFGV 213

Query:   240 TISGYQDVP-ENDEQSLLKALAHQ-PVSVAIEASGTDFQFYSGGVFTGPCGA-ELDHGVA 296
             +I+G+        E+ +++ L    P++V ++A    +Q Y GG+    C + + +H V 
Sbjct:   214 SITGFAAYDFSGQEEEMMRVLVDWGPLAVTVDA--VSWQDYLGGIIQYHCSSGKANHAVL 271

Query:   297 AVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGI 340
               G+  +    Y IV+NSWG  WG  GY+R+K  +     +CGI
Sbjct:   272 ITGFDTTGIIPYWIVQNSWGRTWGIDGYVRVKIGSN----VCGI 311


>UNIPROTKB|Q5QP40 [details] [associations]
            symbol:CTSK "Cathepsin K" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015644 Pfam:PF00112
            InterPro:IPR000169 HOGENOM:HOG000230774 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            PROSITE:PS00139 EMBL:AL355860 HOVERGEN:HBG011513
            PANTHER:PTHR12411:SF55 EMBL:AL356292 UniGene:Hs.632466
            HGNC:HGNC:2536 IPI:IPI00514633 SMR:Q5QP40 STRING:Q5QP40
            Ensembl:ENST00000443913 Uniprot:Q5QP40
        Length = 258

 Score = 333 (122.3 bits), Expect = 3.8e-30, P = 3.8e-30
 Identities = 76/176 (43%), Positives = 104/176 (59%)

Query:    48 FESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKE----VTSYWLGLNEFADMSHEE 103
             +E W   H K Y    +++ R  I+++NLK+I   N E    V +Y L +N   DM+ EE
Sbjct:    85 WELWKKTHRKQYNNKVDEISRRLIWEKNLKYISIHNLEASLGVHTYELAMNHLGDMTSEE 144

Query:   104 FKNKYLGLK-PQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFST 162
                K  GLK P   +R   +      + +A P SVD+RKKG VTPVKNQG CGSCWAFS+
Sbjct:   145 VVQKMTGLKVPLSHSRSNDTLYIPEWEGRA-PDSVDYRKKGYVTPVKNQGQCGSCWAFSS 203

Query:   163 VAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEE 218
             V A+EG  +  +G L +LS Q L+DC  S N+GC GG M  AF+Y+  + G+  E+
Sbjct:   204 VGALEGQLKKKTGKLLNLSPQNLVDC-VSENDGCGGGYMTNAFQYVQKNRGIDSED 258


>UNIPROTKB|O97578 [details] [associations]
            symbol:CTSC "Dipeptidyl peptidase 1" species:9615 "Canis
            lupus familiaris" [GO:0005764 "lysosome" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0006508
            "proteolysis" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 CTD:1075
            HOGENOM:HOG000068022 HOVERGEN:HBG005248 KO:K01275 OrthoDB:EOG4H19VZ
            InterPro:IPR014882 Pfam:PF08773 EMBL:AF060171 RefSeq:NP_001182763.1
            UniGene:Cfa.28653 ProteinModelPortal:O97578 SMR:O97578
            MEROPS:C01.070 PRIDE:O97578 GeneID:403458 KEGG:cfa:403458
            InParanoid:O97578 NextBio:20816976 Uniprot:O97578
        Length = 435

 Score = 332 (121.9 bits), Expect = 4.9e-30, P = 4.9e-30
 Identities = 99/304 (32%), Positives = 151/304 (49%)

Query:    71 IFKENLKHIDQRNKEVTSYWLGLN--EFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYR 128
             ++K N + +   N  +   W      E+  ++  +   +  G K   P     +AE  + 
Sbjct:   142 LYKYNYEFVKAINT-IQKSWTATRYIEYETLTLRDMMTRVGGRKIPRPKPTPLTAEI-HE 199

Query:   129 DVKALPKSVDWRK-KGA--VTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTS--LSEQ 183
             ++  LP S DWR  +G   V+PV+NQ SCGSC+AF++ A +E   +I++ N  +  LS Q
Sbjct:   200 EISRLPTSWDWRNVRGTNFVSPVRNQASCGSCYAFASTAMLEARIRILTNNTQTPILSPQ 259

Query:   184 ELIDCDTSFNNGCNGGLMDY--AFKYIVASGGLHKEEDYPYLMEEGTCED----KKEEME 237
             E++ C + +  GC GG   Y  A KY     GL +E  +PY   +  C+     +    E
Sbjct:   260 EIVSC-SQYAQGCEGGF-PYLIAGKY-AQDFGLVEEACFPYAGSDSPCKPNDCFRYYSSE 316

Query:   238 VVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVF--TG---PCGA-EL 291
                + G+     N+    L+ + H P++VA E    DF  Y  G++  TG   P    EL
Sbjct:   317 YYYVGGFYGAC-NEALMKLELVRHGPMAVAFEVYD-DFFHYQKGIYYHTGLRDPFNPFEL 374

Query:   292 -DHGVAAVGYG--KSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMA--SI 346
              +H V  VGYG   + G DY IVKNSWG +WGE GY R++R T +    C I  +A  + 
Sbjct:   375 TNHAVLLVGYGTDSASGMDYWIVKNSWGSRWGEDGYFRIRRGTDE----CAIESIAVAAT 430

Query:   347 PLKK 350
             P+ K
Sbjct:   431 PIPK 434


>UNIPROTKB|E9PKT6 [details] [associations]
            symbol:CTSH "Cathepsin H" species:9606 "Homo sapiens"
            [GO:0001520 "outer dense fiber" evidence=IEA] [GO:0001656
            "metanephros development" evidence=IEA] [GO:0001669 "acrosomal
            vesicle" evidence=IEA] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=IEA] [GO:0004177 "aminopeptidase activity" evidence=IEA]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=IEA]
            [GO:0004252 "serine-type endopeptidase activity" evidence=IEA]
            [GO:0007283 "spermatogenesis" evidence=IEA] [GO:0008284 "positive
            regulation of cell proliferation" evidence=IEA] [GO:0010634
            "positive regulation of epithelial cell migration" evidence=IEA]
            [GO:0016505 "apoptotic protease activator activity" evidence=IEA]
            [GO:0030984 "kininogen binding" evidence=IEA] [GO:0031638 "zymogen
            activation" evidence=IEA] [GO:0031648 "protein destabilization"
            evidence=IEA] [GO:0032403 "protein complex binding" evidence=IEA]
            [GO:0032526 "response to retinoic acid" evidence=IEA] [GO:0033619
            "membrane protein proteolysis" evidence=IEA] [GO:0043066 "negative
            regulation of apoptotic process" evidence=IEA] [GO:0043621 "protein
            self-association" evidence=IEA] [GO:0045766 "positive regulation of
            angiogenesis" evidence=IEA] [GO:0060448 "dichotomous subdivision of
            terminal units involved in lung branching" evidence=IEA]
            [GO:0097067 "cellular response to thyroid hormone stimulus"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            InterPro:IPR000169 GO:GO:0043066 GO:GO:0008284 PANTHER:PTHR12411
            PROSITE:PS00139 GO:GO:0045766 GO:GO:0004252 GO:GO:0032526
            GO:GO:0016505 GO:GO:0010634 GO:GO:0004197 GO:GO:0031648
            GO:GO:0031638 GO:GO:0001913 GO:GO:0060448 GO:GO:0033619
            EMBL:AC011944 HGNC:HGNC:2535 IPI:IPI00375426
            ProteinModelPortal:E9PKT6 SMR:E9PKT6 PRIDE:E9PKT6
            Ensembl:ENST00000528741 ArrayExpress:E9PKT6 Bgee:E9PKT6
            Uniprot:E9PKT6
        Length = 134

 Score = 329 (120.9 bits), Expect = 1.0e-29, P = 1.0e-29
 Identities = 67/135 (49%), Positives = 88/135 (65%)

Query:    91 LGLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSY-RDVKALPKSVDWRKKGA-VTPV 148
             + LN+F+DMS  E K+KYL  +PQ       + + +Y R     P SVDWRKKG  V+PV
Sbjct:     1 MALNQFSDMSFAEIKHKYLWSEPQ----NCSATKSNYLRGTGPYPPSVDWRKKGNFVSPV 56

Query:   149 KNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNN-GCNGGLMDYAFKY 207
             KNQG+CGSCW FST  A+E    I +G + SL+EQ+L+DC   FNN GC GGL   AF+Y
Sbjct:    57 KNQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEY 116

Query:   208 IVASGGLHKEEDYPY 222
             I+ + G+  E+ YPY
Sbjct:   117 ILYNKGIMGEDTYPY 131


>WB|WBGene00008231 [details] [associations]
            symbol:tag-329 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 GeneTree:ENSGT00560000076599
            eggNOG:NOG288820 EMBL:Z70750 HSSP:P53634 HOGENOM:HOG000019851
            PIR:T20110 RefSeq:NP_505458.1 ProteinModelPortal:Q18740 SMR:Q18740
            MEROPS:C01.A36 EnsemblMetazoa:C50F4.3 GeneID:183677
            KEGG:cel:CELE_C50F4.3 UCSC:C50F4.3 CTD:183677 WormBase:C50F4.3
            InParanoid:Q18740 OMA:WIFRNSW NextBio:921986 Uniprot:Q18740
        Length = 374

 Score = 328 (120.5 bits), Expect = 1.3e-29, P = 1.3e-29
 Identities = 99/319 (31%), Positives = 142/319 (44%)

Query:    42 DKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEV------TSYWLGLNE 95
             +KL + FE ++ K+ + YK   EK  RF+ F      + + NK        T Y  G+N+
Sbjct:    41 EKLYKEFEDFIVKYKRNYKDEIEKKFRFQQFVATHNRVGKMNKAAKKAGHDTKY--GINK 98

Query:    96 FADMSHEEFKNKYLGLKPQFPTRRQPSAEFSY----RDVKALPKSVDWRKK--GA---VT 146
             F+D+S +E    Y    P       P          R ++ LPK+ D R K  G    + 
Sbjct:    99 FSDLSKKEIHGMYSKFGPPKNNTNVPKFNLKNLRVKRQMEGLPKTFDLRNKKVGGHYIIG 158

Query:   147 PVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGCNGGLMDYAFK 206
             P+K Q SC  CW F+  A  E    +      +LSEQE+ DC      GCNGG      +
Sbjct:   159 PIKTQDSCACCWGFAATAVAEAALTVHLKKAMNLSEQEVCDCAPKHGPGCNGGDPVDGLE 218

Query:   207 YIVASGGLHKEEDYPYLMEEGT----CEDKKEEMEV--VTISGYQDVPENDEQSLLKAL- 259
             YI   G L   ++YP+ +   T    CE +K + E+  + +  Y   P N E  +   L 
Sbjct:   219 YIKEMG-LTGGKEYPFNVNRSTQLGRCESEKYDRELNPLELDYYAIDPFNAEYQMTHHLY 277

Query:   260 -AHQPVSVAIEASGTDFQFYSGGVFTGPCGAELD---HGVAAVGYGKSKGS-----DYII 310
               + P+SVA     +   + SG +    C  E     H  A VGYG +K S     DY I
Sbjct:   278 LLNLPISVAFRTGASLSSYLSGILELADCDDEKGGHWHSGAIVGYGTTKNSAGRTVDYWI 337

Query:   311 VKNSWGPKWGERGYIRMKR 329
              +NSW   WG+ GY R+ R
Sbjct:   338 FRNSWWTDWGDDGYARIVR 356


>RGD|2445 [details] [associations]
            symbol:Ctsc "cathepsin C" species:10116 "Rattus norvegicus"
          [GO:0001913 "T cell mediated cytotoxicity" evidence=IEA;ISO]
          [GO:0004197 "cysteine-type endopeptidase activity" evidence=NAS]
          [GO:0004252 "serine-type endopeptidase activity" evidence=IEA;ISO]
          [GO:0005764 "lysosome" evidence=IDA;TAS] [GO:0005783 "endoplasmic
          reticulum" evidence=IDA] [GO:0005794 "Golgi apparatus" evidence=IDA]
          [GO:0006508 "proteolysis" evidence=IEP;ISO;TAS] [GO:0007568 "aging"
          evidence=IEP] [GO:0008234 "cysteine-type peptidase activity"
          evidence=ISO] [GO:0010033 "response to organic substance"
          evidence=IDA] [GO:0031404 "chloride ion binding" evidence=IDA]
          [GO:0042802 "identical protein binding" evidence=IDA] [GO:0043621
          "protein self-association" evidence=IDA] InterPro:IPR000668
          InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
          InterPro:IPR000169 RGD:2445 GO:GO:0005783 GO:GO:0005794 GO:GO:0007568
          GO:GO:0010033 eggNOG:COG4870 GO:GO:0006508 InterPro:IPR025661
          InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139
          PROSITE:PS00639 GO:GO:0004252 GO:GO:0005764 GO:GO:0043621
          GO:GO:0042802 GO:GO:0031404 GO:GO:0004197
          GeneTree:ENSGT00560000076599 CTD:1075 HOGENOM:HOG000068022
          HOVERGEN:HBG005248 KO:K01275 OrthoDB:EOG4H19VZ InterPro:IPR014882
          Pfam:PF08773 MEROPS:C01.070 OMA:YDDFLHY GO:GO:0001913 EMBL:D90404
          IPI:IPI00193765 PIR:A41158 RefSeq:NP_058793.1 UniGene:Rn.203177
          PDB:1JQP PDBsum:1JQP ProteinModelPortal:P80067 SMR:P80067
          STRING:P80067 PhosphoSite:P80067 PRIDE:P80067
          Ensembl:ENSRNOT00000022342 GeneID:25423 KEGG:rno:25423
          InParanoid:P80067 SABIO-RK:P80067 EvolutionaryTrace:P80067
          NextBio:606591 ArrayExpress:P80067 Genevestigator:P80067
          GermOnline:ENSRNOG00000016496 Uniprot:P80067
        Length = 462

 Score = 326 (119.8 bits), Expect = 2.1e-29, P = 2.1e-29
 Identities = 91/253 (35%), Positives = 136/253 (53%)

Query:   121 PSAEFSYRDVKALPKSVDWRK-KGA--VTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNL 177
             P  +   + + +LP+S DWR  +G   V+PV+NQ SCGSC++F+++  +E   +I++ N 
Sbjct:   218 PITDEIQQQILSLPESWDWRNVRGINFVSPVRNQESCGSCYSFASLGMLEARIRILTNNS 277

Query:   178 TS--LSEQELIDCDTSFNNGCNGGLMDY--AFKYIVASGGLHKEEDYPYLMEEGTCEDKK 233
              +  LS QE++ C + +  GC+GG   Y  A KY     G+ +E  +PY   +  C+ K+
Sbjct:   278 QTPILSPQEVVSC-SPYAQGCDGGF-PYLIAGKY-AQDFGVVEENCFPYTATDAPCKPKE 334

Query:   234 E-----EMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVF--TG- 285
                     E   + G+     N+    L+ + H P++VA E    DF  Y  G++  TG 
Sbjct:   335 NCLRYYSSEYYYVGGFYGGC-NEALMKLELVKHGPMAVAFEVHD-DFLHYHSGIYHHTGL 392

Query:   286 --PCGA-EL-DHGVAAVGYGKSK--GSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCG 339
               P    EL +H V  VGYGK    G DY IVKNSWG +WGE GY R++R T +    C 
Sbjct:   393 SDPFNPFELTNHAVLLVGYGKDPVTGLDYWIVKNSWGSQWGESGYFRIRRGTDE----CA 448

Query:   340 INK--MASIPLKK 350
             I    MA+IP+ K
Sbjct:   449 IESIAMAAIPIPK 461


>UNIPROTKB|J9P219 [details] [associations]
            symbol:J9P219 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00639 GeneTree:ENSGT00560000076599 InterPro:IPR014882
            Pfam:PF08773 OMA:YDDFLHY EMBL:AAEX03012741
            Ensembl:ENSCAFT00000050015 Uniprot:J9P219
        Length = 406

 Score = 319 (117.4 bits), Expect = 1.2e-28, P = 1.2e-28
 Identities = 96/280 (34%), Positives = 144/280 (51%)

Query:    95 EFADMSHEEFKNKYLGLK-PQFPTRRQPSAEFSYRDVKALPKSVDWRK-KGA--VTPVKN 150
             E+  ++  +   +  G K P+ P     +AE  + ++  LP S DWR  +G   V+PV+N
Sbjct:   136 EYETLTLRDMMTRGGGRKIPRKPKPTPLTAEI-HEEISRLPTSWDWRNVRGTNFVSPVRN 194

Query:   151 QG-SCGSCWAFSTVAAVEGINQIVSGNLTS--LSEQELIDCDTSFNNGCNGGLMDY--AF 205
             Q  SCGSC+AF++ A +E   +I++ N  +  LS QE++ C + +  GC GG   Y  A 
Sbjct:   195 QAASCGSCYAFASTAMLEARIRILTNNTQTPILSPQEIVSC-SQYAQGCEGGF-PYLIAG 252

Query:   206 KYIVASGGLHKEEDYPYLMEEGTCED----KKEEMEVVTISGYQDVPENDEQSLLKALAH 261
             KY     GL +E  +PY   +  C+     +    E   + G+     N+    L+ + H
Sbjct:   253 KY-AQDFGLVEEACFPYAGSDSPCKPNDCFRYYSSEYYYVGGFYGAC-NEALMKLELVRH 310

Query:   262 QPVSVAIEASGTDFQFYSGGVF--TG---PCGA-EL-DHGVAAVGYG--KSKGSDYIIVK 312
              P++VA E    DF  Y  G++  TG   P    EL +H V  VGYG   + G DY IVK
Sbjct:   311 GPMAVAFEVYD-DFFHYQKGIYYHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVK 369

Query:   313 NSWGPKWGERGYIRMKRNTGKPEGLCGINKMA--SIPLKK 350
             NSWG +WGE GY R++R T +    C I  +A  + P+ K
Sbjct:   370 NSWGSRWGEDGYFRIRRGTDE----CAIESIAVAATPIPK 405


>UNIPROTKB|F1PSK8 [details] [associations]
            symbol:F1PSK8 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00639 GeneTree:ENSGT00560000076599 InterPro:IPR014882
            Pfam:PF08773 EMBL:AAEX03012741 Ensembl:ENSCAFT00000007054
            Uniprot:F1PSK8
        Length = 405

 Score = 316 (116.3 bits), Expect = 2.4e-28, P = 2.4e-28
 Identities = 95/279 (34%), Positives = 142/279 (50%)

Query:    95 EFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRK-KGA--VTPVKNQ 151
             E+  ++  +   +  G K   P     +AE  + ++  LP S DWR  +G   V+PV+NQ
Sbjct:   136 EYETLTLRDMMTRGGGRKIPRPKPTPLTAEI-HEEISRLPTSWDWRNVRGTNFVSPVRNQ 194

Query:   152 G-SCGSCWAFSTVAAVEGINQIVSGNLTS--LSEQELIDCDTSFNNGCNGGLMDY--AFK 206
               SCGSC+AF++ A +E   +I++ N  +  LS QE++ C + +  GC GG   Y  A K
Sbjct:   195 AASCGSCYAFASTAMLEARIRILTNNTQTPILSPQEIVSC-SQYAQGCEGGF-PYLIAGK 252

Query:   207 YIVASGGLHKEEDYPYLMEEGTCED----KKEEMEVVTISGYQDVPENDEQSLLKALAHQ 262
             Y     GL +E  +PY   +  C+     +    E   + G+     N+    L+ + H 
Sbjct:   253 Y-AQDFGLVEEACFPYAGSDSPCKPNDCFRYYSSEYYYVGGFYGAC-NEALMKLELVRHG 310

Query:   263 PVSVAIEASGTDFQFYSGGVF--TG---PCGA-EL-DHGVAAVGYG--KSKGSDYIIVKN 313
             P++VA E    DF  Y  G++  TG   P    EL +H V  VGYG   + G DY IVKN
Sbjct:   311 PMAVAFEVYD-DFFHYQKGIYYHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKN 369

Query:   314 SWGPKWGERGYIRMKRNTGKPEGLCGINKMA--SIPLKK 350
             SWG +WGE GY R++R T +    C I  +A  + P+ K
Sbjct:   370 SWGSRWGEDGYFRIRRGTDE----CAIESIAVAATPIPK 404


>WB|WBGene00019314 [details] [associations]
            symbol:K02E7.10 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0016021 "integral to
            membrane" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 SMART:SM00645 GO:GO:0008234 GO:GO:0006508
            PANTHER:PTHR12411 GeneTree:ENSGT00560000076599 HSSP:P07711
            EMBL:FO080411 PIR:T32392 RefSeq:NP_493904.1 UniGene:Cel.14828
            ProteinModelPortal:O17255 SMR:O17255 EnsemblMetazoa:K02E7.10
            GeneID:186889 KEGG:cel:CELE_K02E7.10 UCSC:K02E7.10 CTD:186889
            WormBase:K02E7.10 eggNOG:NOG331187 HOGENOM:HOG000114005
            InParanoid:O17255 OMA:GNANEAR NextBio:933344 Uniprot:O17255
        Length = 299

 Score = 315 (115.9 bits), Expect = 3.1e-28, P = 3.1e-28
 Identities = 78/220 (35%), Positives = 113/220 (51%)

Query:   137 VDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGIN-QIVSGNLTSLSEQELIDCDTSFNNG 195
             +DWR+KG V PVK+QG C + +AF+ +AA+E +  +  +G L S SEQ++IDC  +F N 
Sbjct:    84 LDWREKGIVGPVKDQGKCNASYAFAAIAAIESMYAKANNGKLLSFSEQQIIDC-ANFTNP 142

Query:   196 CNGGLMDYAFKYIVASGGLHKEEDYPYLMEE--GTCEDKKEEMEVVTISGYQDVPENDEQ 253
             C   L +      +   G+  E DYPY+ +E  G CE    +M++     Y DV  N+E 
Sbjct:   143 CQENLENVLSNRFLKENGVGTEADYPYVGKENVGKCEYDSSKMKLRPT--YIDVYPNEEW 200

Query:   254 SLLKALAHQPVSVAIEASGTDFQFYSGGVFTGP---CG-AELDHGVAAVGYGKSKGSDYI 309
             +             +  S   F  Y  G++      CG A     +A VGYGK     Y 
Sbjct:   201 ARAHITTFGTGYFRMR-SPPSFFHYKTGIYNPTKEECGNANEARSLAIVGYGKDGAEKYW 259

Query:   310 IVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIPLK 349
             IVK S+G  WGE GY+++ RN       CG+ +  SIP+K
Sbjct:   260 IVKGSFGTSWGEHGYMKLARNVNA----CGMAESISIPIK 295


>UNIPROTKB|F1N455 [details] [associations]
            symbol:CTSC "Dipeptidyl peptidase 1 exclusion domain chain"
            species:9913 "Bos taurus" [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0004252 "serine-type endopeptidase activity" evidence=IEA]
            [GO:0001913 "T cell mediated cytotoxicity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0004252
            GeneTree:ENSGT00560000076599 IPI:IPI00697314 UniGene:Bt.49573
            InterPro:IPR014882 Pfam:PF08773 OMA:YDDFLHY GO:GO:0001913
            EMBL:DAAA02062487 EMBL:DAAA02062488 Ensembl:ENSBTAT00000014735
            Uniprot:F1N455
        Length = 463

 Score = 312 (114.9 bits), Expect = 7.7e-28, P = 7.7e-28
 Identities = 96/252 (38%), Positives = 134/252 (53%)

Query:   122 SAEFSYRDVKALPKSVDWRK-KGA--VTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLT 178
             +AE   + +  LP S DWR   G   VTPV+NQGSCGSC++F+++  +E   +I++ N  
Sbjct:   221 TAEIQ-KKILHLPTSWDWRNVHGINFVTPVRNQGSCGSCYSFASMGMMEARIRILTNNTQ 279

Query:   179 S--LSEQELIDCDTSFNNGCNGGLMDY--AFKYIVASGGLHKEEDYPYLMEEGTCEDKKE 234
             +  LS QE++ C + +  GC GG   Y  A KY     GL +E+ +PY   +  C  K+ 
Sbjct:   280 TPILSPQEVVSC-SQYAQGCEGGF-PYLIAGKY-AQDFGLVEEDCFPYTGTDSPCRLKEG 336

Query:   235 EMEVVTISGYQDVPE---NDEQSLLKA-LAHQ-PVSVAIEASGTDFQFYSGGVF--TG-- 285
                  + S Y  V        ++L+K  L HQ P++VA E    DF  Y  GV+  TG  
Sbjct:   337 CFRYYS-SEYHYVGGFYGGCNEALMKLELVHQGPMAVAFEVYD-DFLHYRKGVYHHTGLR 394

Query:   286 -PCGA-EL-DHGVAAVGYGK--SKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGI 340
              P    EL +H V  VGYG   + G DY IVKNSWG  WGE GY R++R T +    C I
Sbjct:   395 DPFNPFELTNHAVLLVGYGTDAASGLDYWIVKNSWGTSWGENGYFRIRRGTDE----CAI 450

Query:   341 NK--MASIPLKK 350
                 +A+ P+ K
Sbjct:   451 ESIALAATPIPK 462


>UNIPROTKB|Q3ZCJ8 [details] [associations]
            symbol:CTSC "Dipeptidyl peptidase 1" species:9913 "Bos
            taurus" [GO:0031638 "zymogen activation" evidence=IDA] [GO:0005764
            "lysosome" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0005764 EMBL:BC102115 IPI:IPI00697314 RefSeq:NP_001028789.1
            UniGene:Bt.49573 ProteinModelPortal:Q3ZCJ8 SMR:Q3ZCJ8 STRING:Q3ZCJ8
            PRIDE:Q3ZCJ8 GeneID:352958 KEGG:bta:352958 CTD:1075
            HOGENOM:HOG000068022 HOVERGEN:HBG005248 InParanoid:Q3ZCJ8 KO:K01275
            OrthoDB:EOG4H19VZ BindingDB:Q3ZCJ8 ChEMBL:CHEMBL1075050
            NextBio:20812686 GO:GO:0031638 InterPro:IPR014882 Pfam:PF08773
            Uniprot:Q3ZCJ8
        Length = 463

 Score = 312 (114.9 bits), Expect = 7.7e-28, P = 7.7e-28
 Identities = 96/252 (38%), Positives = 134/252 (53%)

Query:   122 SAEFSYRDVKALPKSVDWRK-KGA--VTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLT 178
             +AE   + +  LP S DWR   G   VTPV+NQGSCGSC++F+++  +E   +I++ N  
Sbjct:   221 TAEIQ-KKILHLPTSWDWRNVHGINFVTPVRNQGSCGSCYSFASMGMMEARIRILTNNTQ 279

Query:   179 S--LSEQELIDCDTSFNNGCNGGLMDY--AFKYIVASGGLHKEEDYPYLMEEGTCEDKKE 234
             +  LS QE++ C + +  GC GG   Y  A KY     GL +E+ +PY   +  C  K+ 
Sbjct:   280 TPILSPQEVVSC-SQYAQGCEGGF-PYLIAGKY-AQDFGLVEEDCFPYTGTDSPCRLKEG 336

Query:   235 EMEVVTISGYQDVPE---NDEQSLLKA-LAHQ-PVSVAIEASGTDFQFYSGGVF--TG-- 285
                  + S Y  V        ++L+K  L HQ P++VA E    DF  Y  GV+  TG  
Sbjct:   337 CFRYYS-SEYHYVGGFYGGCNEALMKLELVHQGPMAVAFEVYD-DFLHYRKGVYHHTGLR 394

Query:   286 -PCGA-EL-DHGVAAVGYGK--SKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGI 340
              P    EL +H V  VGYG   + G DY IVKNSWG  WGE GY R++R T +    C I
Sbjct:   395 DPFNPFELTNHAVLLVGYGTDAASGLDYWIVKNSWGTSWGENGYFRIRRGTDE----CAI 450

Query:   341 NK--MASIPLKK 350
                 +A+ P+ K
Sbjct:   451 ESIALAATPIPK 462


>MGI|MGI:109553 [details] [associations]
            symbol:Ctsc "cathepsin C" species:10090 "Mus musculus"
            [GO:0001913 "T cell mediated cytotoxicity" evidence=IGI]
            [GO:0004252 "serine-type endopeptidase activity" evidence=IMP]
            [GO:0005764 "lysosome" evidence=ISO] [GO:0005783 "endoplasmic
            reticulum" evidence=ISO] [GO:0005794 "Golgi apparatus"
            evidence=ISO] [GO:0006508 "proteolysis" evidence=ISO;IMP]
            [GO:0008233 "peptidase activity" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=ISO] [GO:0010033
            "response to organic substance" evidence=ISO] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0031404 "chloride ion
            binding" evidence=ISO] [GO:0042802 "identical protein binding"
            evidence=ISO] [GO:0043621 "protein self-association" evidence=ISO]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 MGI:MGI:109553 GO:GO:0005783
            GO:GO:0005794 GO:GO:0007568 GO:GO:0010033 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0004252 GO:GO:0005764 GO:GO:0031404 CTD:1075
            HOGENOM:HOG000068022 HOVERGEN:HBG005248 KO:K01275 OrthoDB:EOG4H19VZ
            InterPro:IPR014882 Pfam:PF08773 MEROPS:C01.070 OMA:YDDFLHY
            GO:GO:0001913 EMBL:U89269 EMBL:U74683 EMBL:BC067063 IPI:IPI00130015
            RefSeq:NP_034112.3 UniGene:Mm.322945 ProteinModelPortal:P97821
            SMR:P97821 STRING:P97821 PhosphoSite:P97821 PaxDb:P97821
            PRIDE:P97821 Ensembl:ENSMUST00000032779 GeneID:13032 KEGG:mmu:13032
            InParanoid:P97821 BindingDB:P97821 ChEMBL:CHEMBL3454 ChiTaRS:CTSC
            NextBio:282904 Bgee:P97821 CleanEx:MM_CTSC Genevestigator:P97821
            Uniprot:P97821
        Length = 462

 Score = 309 (113.8 bits), Expect = 1.7e-27, P = 1.7e-27
 Identities = 87/253 (34%), Positives = 136/253 (53%)

Query:   121 PSAEFSYRDVKALPKSVDWRK-KGA--VTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNL 177
             P  +   + +  LP+S DWR  +G   V+PV+NQ SCGSC++F+++  +E   +I++ N 
Sbjct:   218 PMTDEIQQQILNLPESWDWRNVQGVNYVSPVRNQESCGSCYSFASMGMLEARIRILTNNS 277

Query:   178 TS--LSEQELIDCDTSFNNGCNGGLMDY--AFKYIVASGGLHKEEDYPYLMEEGTCEDKK 233
              +  LS QE++ C + +  GC+GG   Y  A KY     G+ +E  +PY  ++  C+ ++
Sbjct:   278 QTPILSPQEVVSC-SPYAQGCDGGF-PYLIAGKY-AQDFGVVEESCFPYTAKDSPCKPRE 334

Query:   234 EEMEVVTISGYQDVPE-----NDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVF--TG- 285
               +   + S Y  V       N+    L+ + H P++VA E    DF  Y  G++  TG 
Sbjct:   335 NCLRYYS-SDYYYVGGFYGGCNEALMKLELVKHGPMAVAFEVHD-DFLHYHSGIYHHTGL 392

Query:   286 --PCGA-EL-DHGVAAVGYGKSK--GSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCG 339
               P    EL +H V  VGYG+    G +Y I+KNSWG  WGE GY R++R T +    C 
Sbjct:   393 SDPFNPFELTNHAVLLVGYGRDPVTGIEYWIIKNSWGSNWGESGYFRIRRGTDE----CA 448

Query:   340 INKMA--SIPLKK 350
             I  +A  +IP+ K
Sbjct:   449 IESIAVAAIPIPK 461


>DICTYBASE|DDB_G0288221 [details] [associations]
            symbol:DDB_G0288221 species:44689 "Dictyostelium
            discoideum" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 SMART:SM00645
            dictyBase:DDB_G0288221 Pfam:PF00188 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 PANTHER:PTHR12411 EMBL:AAFI02000109 Gene3D:3.40.33.10
            InterPro:IPR001283 InterPro:IPR014044 SMART:SM00198 SUPFAM:SSF55797
            MEROPS:C01.A52 ProtClustDB:CLSZ2429919 RefSeq:XP_636852.1
            ProteinModelPortal:Q54J84 EnsemblProtists:DDB0187839 GeneID:8626520
            KEGG:ddi:DDB_G0288221 InParanoid:Q54J84 Uniprot:Q54J84
        Length = 395

 Score = 305 (112.4 bits), Expect = 3.5e-27, P = 3.5e-27
 Identities = 85/232 (36%), Positives = 119/232 (51%)

Query:   108 YLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVE 167
             + G+ P  PT   PSA  + +       SVDW      TPV++QG C SCW F ++AA+E
Sbjct:   163 FKGVLPYKPTSINPSASTTPKMPNFSSGSVDW--SDYQTPVRDQGECKSCWVFGSLAALE 220

Query:   168 G---INQIVSGNLT-SLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPY- 222
                 I   VS   T  LS Q  ++C TS   GC  G     F Y  +SG +  E+DYPY 
Sbjct:   221 SRYLIKNGVSEKSTLHLSAQNAMNCITS---GCESGWPANVFDYFESSG-IAFEKDYPYD 276

Query:   223 LMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGV 282
              +    C     + E    SGY  V EN + SL++ L + P+++A+  S T FQ Y+GG+
Sbjct:   277 AIGSDNCTSSSNKFEY---SGYDSV-ENTKDSLIQELKNGPITIALY-SDTAFQSYAGGI 331

Query:   283 FTGPCG-AELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGK 333
             +       +++H V  VGY K   +D   +KNS G KWGE GY R+  +  K
Sbjct:   332 YDSVEEYKDVNHIVLLVGYDKP--TDSWKIKNSLGTKWGELGYARITASNDK 381


>UNIPROTKB|J9NSE7 [details] [associations]
            symbol:CTSC "Dipeptidyl peptidase 1" species:9615 "Canis
            lupus familiaris" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00560000076599 InterPro:IPR014882 Pfam:PF08773
            EMBL:AAEX03017125 Ensembl:ENSCAFT00000014269 OMA:INGQICH
            Uniprot:J9NSE7
        Length = 458

 Score = 306 (112.8 bits), Expect = 3.6e-27, P = 3.6e-27
 Identities = 96/304 (31%), Positives = 146/304 (48%)

Query:    71 IFKENLKHIDQRNKEVTSYWLGLN--EFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYR 128
             ++K N + +   N  +   W      E+  ++  +   +  G K   P     +AE  + 
Sbjct:   165 LYKYNYEFVKAINT-IQKSWTATRYIEYETLTLRDMMRRAGGRKIPRPKPTPLTAEI-HE 222

Query:   129 DVKALPKSVDWRK-KGA--VTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTS--LSEQ 183
             ++  LP S DWR  +G   V+PV+NQ SCGSC+AF++   +E   +I++ N  +  LS Q
Sbjct:   223 EISRLPTSWDWRNVRGTNFVSPVRNQASCGSCYAFASTVMLEARIRILTNNTQTPILSPQ 282

Query:   184 ELIDCDTSFNNGCNGGLMDY--AFKYIVASGGLHKEEDYPYLMEEGTCEDKK----EEME 237
             E++ C + +  GC GG   Y  A KY     GL  E  + Y   +  C+          E
Sbjct:   283 EIVSC-SQYAQGCEGGF-PYLIAGKY-AQDFGLVDEACFSYAGSDSPCKPNDCFHYYSSE 339

Query:   238 VVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVF--TG---PCGA-EL 291
                + G+     N+    L+ + H P++VA E    DF  Y  G++  TG   P    EL
Sbjct:   340 YHYVGGFYGAC-NEALMKLELVRHGPMAVAFEVYD-DFFHYQKGIYYHTGLRDPINPFEL 397

Query:   292 -DHGVAAVGYG--KSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMA--SI 346
              +H V  VGYG   + G DY IVKNSWG +WGE GY ++ R T +    C I  +A  + 
Sbjct:   398 TNHAVLLVGYGTDSASGMDYWIVKNSWGSRWGEDGYFQICRGTDE----CAIESIAVAAT 453

Query:   347 PLKK 350
             P+ K
Sbjct:   454 PIPK 457


>DICTYBASE|DDB_G0276111 [details] [associations]
            symbol:DDB_G0276111 species:44689 "Dictyostelium
            discoideum" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 SMART:SM00645
            InterPro:IPR000169 dictyBase:DDB_G0276111 Pfam:PF00188
            eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508 PANTHER:PTHR12411
            PROSITE:PS00139 EMBL:AAFI02000014 Gene3D:3.40.33.10
            InterPro:IPR001283 InterPro:IPR014044 PRINTS:PR00837 SMART:SM00198
            SUPFAM:SSF55797 ProtClustDB:CLSZ2429919 RefSeq:XP_643261.1
            ProteinModelPortal:Q75JH0 EnsemblProtists:DDB0169514 GeneID:8620304
            KEGG:ddi:DDB_G0276111 InParanoid:Q75JH0 OMA:GFVTSIK Uniprot:Q75JH0
        Length = 415

 Score = 304 (112.1 bits), Expect = 4.5e-27, P = 4.5e-27
 Identities = 72/214 (33%), Positives = 110/214 (51%)

Query:   137 VDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTS----LSEQELIDCDTSF 192
             VDW+  G VT +KNQG CG C++F+T AA+E    ++  NL +    LSEQ  + C    
Sbjct:   213 VDWKSLGFVTSIKNQGQCGGCYSFATCAALESA-YLIKNNLPNTDIDLSEQNFVSC---V 268

Query:   193 NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDE 252
             N GC GG        + ++G ++ E  YPY    G+C +  +  +    +GY ++  N E
Sbjct:   269 NYGCGGGNGQSCLDKLKSTGIMY-ETSYPYKAVTGSCPNVIQSPQPFKWTGYSNIQGNKE 327

Query:   253 QSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELDHGVAAVGYGKSKGSDYIIVK 312
              + L AL   P+  ++    + FQ Y  G+++    +  +H +  VGY  +  S Y+I K
Sbjct:   328 -AFLNALKSGPIYASLYVD-SGFQLYKSGIYSCSQSSTPNHAITIVGYSSADNS-YLI-K 383

Query:   313 NSWGPKWGERGYIRMKRNTGKPEGLCGINKMASI 346
             NSWG  +GE GYIR+K      EG C +     I
Sbjct:   384 NSWGTIYGESGYIRLK------EGSCNLYSFTGI 411


>UNIPROTKB|F1NWG2 [details] [associations]
            symbol:CTSC "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0001913 "T cell mediated cytotoxicity" evidence=IEA]
            [GO:0004252 "serine-type endopeptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0004252
            GeneTree:ENSGT00560000076599 InterPro:IPR014882 Pfam:PF08773
            OMA:YDDFLHY GO:GO:0001913 EMBL:AADN02004805 IPI:IPI00577371
            Ensembl:ENSGALT00000027869 Uniprot:F1NWG2
        Length = 463

 Score = 305 (112.4 bits), Expect = 5.2e-27, P = 5.2e-27
 Identities = 86/246 (34%), Positives = 129/246 (52%)

Query:   128 RDVKALPKSVDWRK-KGA--VTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTS--LSE 182
             + V  LP+S DWR   G   V+PV+NQ SCGSC+AF+++  +E   +I++ N      S 
Sbjct:   226 KKVSGLPESWDWRNVNGVNYVSPVRNQASCGSCYAFASMGMLEARIRILTNNTQKPVFSP 285

Query:   183 QELIDCDTSFNNGCNGGLMDY--AFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVT 240
             Q+++ C + ++ GC+GG   Y  A KY V   G+ +E+ +PY  ++  C  K+      T
Sbjct:   286 QQVVSC-SQYSQGCDGGF-PYLIAGKY-VQDFGVVEEDCFPYTAKDTPCLFKRSCYHYYT 342

Query:   241 -----ISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVF--TGPCGA---- 289
                  + G+     N+    L+ +   P++VA E    DF FY  G++  TG        
Sbjct:   343 SEYHYVGGFYGAC-NEALMKLELVLSGPMAVAFEVYN-DFMFYKEGIYHHTGLKDEFNPF 400

Query:   290 EL-DHGVAAVGYGKS--KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMA-- 344
             EL +H V  VGYGK    G  + IVKNSWG  WGE GY R++R T +    C I  +A  
Sbjct:   401 ELTNHAVLLVGYGKDPESGEKFWIVKNSWGTSWGEDGYFRIRRGTDE----CAIESIAVA 456

Query:   345 SIPLKK 350
             + P+ K
Sbjct:   457 ATPIPK 462


>UNIPROTKB|P53634 [details] [associations]
            symbol:CTSC "Dipeptidyl peptidase 1" species:9606 "Homo
            sapiens" [GO:0001913 "T cell mediated cytotoxicity" evidence=IEA]
            [GO:0004252 "serine-type endopeptidase activity" evidence=IEA]
            [GO:0005783 "endoplasmic reticulum" evidence=IEA] [GO:0005794
            "Golgi apparatus" evidence=IEA] [GO:0007568 "aging" evidence=IEA]
            [GO:0010033 "response to organic substance" evidence=IEA]
            [GO:0031404 "chloride ion binding" evidence=IEA] [GO:0042802
            "identical protein binding" evidence=IEA] [GO:0043621 "protein
            self-association" evidence=IEA] [GO:0005764 "lysosome"
            evidence=TAS] [GO:0006508 "proteolysis" evidence=IDA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IDA] [GO:0006955
            "immune response" evidence=TAS] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005783 GO:GO:0005794 Reactome:REACT_6900
            GO:GO:0006955 GO:GO:0007568 GO:GO:0010033 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0004252 GO:GO:0005764 GO:GO:0031404 CTD:1075
            HOVERGEN:HBG005248 KO:K01275 OrthoDB:EOG4H19VZ InterPro:IPR014882
            Pfam:PF08773 MEROPS:C01.070 EMBL:X87212 EMBL:U79415 EMBL:AF234263
            EMBL:AF234264 EMBL:AF254757 EMBL:AF525032 EMBL:AF525033
            EMBL:AK292117 EMBL:AK311923 EMBL:AK223038 EMBL:BX537913
            EMBL:AC011088 EMBL:CH471185 EMBL:BC054028 EMBL:BC100891
            EMBL:BC100892 EMBL:BC100893 EMBL:BC100894 EMBL:BC109386
            EMBL:BC110071 EMBL:BC113850 EMBL:BC113897 IPI:IPI00022810
            IPI:IPI00171323 IPI:IPI00872258 PIR:S23941 PIR:S66504
            RefSeq:NP_001107645.1 RefSeq:NP_001805.3 RefSeq:NP_680475.1
            UniGene:Hs.128065 PDB:1K3B PDB:2DJF PDB:2DJG PDB:3PDF PDBsum:1K3B
            PDBsum:2DJF PDBsum:2DJG PDBsum:3PDF ProteinModelPortal:P53634
            SMR:P53634 IntAct:P53634 MINT:MINT-4655964 STRING:P53634
            PhosphoSite:P53634 DMDM:1705632 PaxDb:P53634 PRIDE:P53634
            DNASU:1075 Ensembl:ENST00000227266 Ensembl:ENST00000524463
            Ensembl:ENST00000529974 GeneID:1075 KEGG:hsa:1075 UCSC:uc001pck.4
            UCSC:uc001pcm.4 GeneCards:GC11M088026 HGNC:HGNC:2528 HPA:CAB025364
            MIM:170650 MIM:245000 MIM:245010 MIM:602365 neXtProt:NX_P53634
            Orphanet:2342 Orphanet:678 PharmGKB:PA27028 HOGENOM:HOG000127503
            InParanoid:P53634 OMA:YDDFLHY PhylomeDB:P53634
            BioCyc:MetaCyc:HS03265-MONOMER SABIO-RK:P53634 BindingDB:P53634
            ChEMBL:CHEMBL2252 EvolutionaryTrace:P53634 GenomeRNAi:1075
            NextBio:4488 PMAP-CutDB:P53634 ArrayExpress:P53634 Bgee:P53634
            Genevestigator:P53634 GermOnline:ENSG00000109861 GO:GO:0001913
            Uniprot:P53634
        Length = 463

 Score = 302 (111.4 bits), Expect = 1.2e-26, P = 1.2e-26
 Identities = 90/252 (35%), Positives = 132/252 (52%)

Query:   122 SAEFSYRDVKALPKSVDWRK-KGA--VTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLT 178
             +AE   + +  LP S DWR   G   V+PV+NQ SCGSC++F+++  +E   +I++ N  
Sbjct:   221 TAEIQQK-ILHLPTSWDWRNVHGINFVSPVRNQASCGSCYSFASMGMLEARIRILTNNSQ 279

Query:   179 S--LSEQELIDCDTSFNNGCNGGLMDY--AFKYIVASGGLHKEEDYPYLMEEGTCEDKKE 234
             +  LS QE++ C + +  GC GG   Y  A KY     GL +E  +PY   +  C+ K++
Sbjct:   280 TPILSPQEVVSC-SQYAQGCEGGF-PYLIAGKY-AQDFGLVEEACFPYTGTDSPCKMKED 336

Query:   235 -----EMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVF--TG-- 285
                    E   + G+     N+    L+ + H P++VA E    DF  Y  G++  TG  
Sbjct:   337 CFRYYSSEYHYVGGFYGGC-NEALMKLELVHHGPMAVAFEVYD-DFLHYKKGIYHHTGLR 394

Query:   286 -PCGA-EL-DHGVAAVGYG--KSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGI 340
              P    EL +H V  VGYG   + G DY IVKNSWG  WGE GY R++R T +    C I
Sbjct:   395 DPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWGENGYFRIRRGTDE----CAI 450

Query:   341 NKMA--SIPLKK 350
               +A  + P+ K
Sbjct:   451 ESIAVAATPIPK 462


>UNIPROTKB|F1STR1 [details] [associations]
            symbol:CTSC "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0004252
            "serine-type endopeptidase activity" evidence=IEA] [GO:0001913 "T
            cell mediated cytotoxicity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0004252
            GeneTree:ENSGT00560000076599 KO:K01275 InterPro:IPR014882
            Pfam:PF08773 OMA:YDDFLHY GO:GO:0001913 EMBL:CU855751
            RefSeq:XP_003129789.1 UniGene:Ssc.6155 Ensembl:ENSSSCT00000016280
            GeneID:100522387 KEGG:ssc:100522387 Uniprot:F1STR1
        Length = 463

 Score = 301 (111.0 bits), Expect = 1.5e-26, P = 1.5e-26
 Identities = 91/252 (36%), Positives = 131/252 (51%)

Query:   122 SAEFSYRDVKALPKSVDWRK-KGA--VTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLT 178
             +AE   + +  LP S DWR  +G   VTPV+NQ SCGSC++F+++  +E   +I++ N  
Sbjct:   221 TAEIQEKSLH-LPASWDWRNVRGTNFVTPVRNQASCGSCYSFASMGMMEARIRILTNNTQ 279

Query:   179 S--LSEQELIDCDTSFNNGCNGGLMDY--AFKYIVASGGLHKEEDYPYLMEEGTCEDKKE 234
             +  LS QE++ C + +  GC GG   Y  A KY     GL +E  +PY   +  C  K+ 
Sbjct:   280 TPILSPQEVVSC-SQYAQGCAGGF-PYLIAGKY-AQDFGLVEEACFPYTGTDSPCTVKEG 336

Query:   235 -----EMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVF--TG-- 285
                    E   + G+     N+    L+ + H P++VA E    DF  Y  G++  TG  
Sbjct:   337 CFRYYSSEYHYVGGFYGGC-NEALMKLELVHHGPMAVAFEVYD-DFLHYRKGIYHHTGLR 394

Query:   286 -PCGA-EL-DHGVAAVGYGK--SKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGI 340
              P    EL +H V  VGYG   + G DY IVKNSWG  WGE GY R++R T +    C I
Sbjct:   395 DPFNPFELTNHAVLLVGYGTDLASGMDYWIVKNSWGTSWGEDGYFRIRRGTDE----CAI 450

Query:   341 NKMA--SIPLKK 350
               +A  + P+ K
Sbjct:   451 ESIAVAATPIPK 462


>ZFIN|ZDB-GENE-030619-9 [details] [associations]
            symbol:ctsc "cathepsin C" species:7955 "Danio rerio"
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008233 "peptidase
            activity" evidence=IEA] [GO:0016787 "hydrolase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            ZFIN:ZDB-GENE-030619-9 eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 CTD:1075
            HOGENOM:HOG000068022 HOVERGEN:HBG005248 KO:K01275 OrthoDB:EOG4H19VZ
            InterPro:IPR014882 Pfam:PF08773 MEROPS:C01.070 HSSP:P43235
            EMBL:BC064286 IPI:IPI00486570 RefSeq:NP_999887.1 UniGene:Dr.32463
            ProteinModelPortal:Q6P2V1 SMR:Q6P2V1 PRIDE:Q6P2V1 GeneID:368704
            KEGG:dre:368704 InParanoid:Q6P2V1 NextBio:20813127
            ArrayExpress:Q6P2V1 Bgee:Q6P2V1 Uniprot:Q6P2V1
        Length = 455

 Score = 287 (106.1 bits), Expect = 5.6e-25, P = 5.6e-25
 Identities = 89/302 (29%), Positives = 147/302 (48%)

Query:    71 IFKENLKHIDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQP-SAEFSYRD 129
             +F + +  + Q++   T+Y    +E   + HE  +    G   + P R +P +     + 
Sbjct:   166 MFVDEINSV-QKSWTATAY--SFHETLSI-HEMLRRSG-GPASRIPRRVRPVTVAADSKA 220

Query:   130 VKALPKSVDWRK-KGA--VTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTS--LSEQE 184
                LP+  DWR   G   V+PV+NQ  CGSC++F+T+  +E   +I + N      S Q+
Sbjct:   221 ASGLPQHWDWRNVNGVNFVSPVRNQAQCGSCYSFATMGMLEARVRIQTNNTQQPVFSPQQ 280

Query:   185 LIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCE-----DKKEEMEVV 239
             ++ C + ++ GC+GG      KYI    G+ +E+ +PY   +  C       K    +  
Sbjct:   281 VVSC-SQYSQGCDGGFPYLIGKYI-QDFGIVEEDCFPYTGSDSPCNLPAKCTKYYASDYH 338

Query:   240 TISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVF--TGPCGA----EL-D 292
              + G+     ++   +L+ + + P+ VA+E    DF  Y  G++  TG   A    EL +
Sbjct:   339 YVGGFYGGC-SESAMMLELVKNGPMGVALEVY-PDFMNYKEGIYHHTGLRDANNPFELTN 396

Query:   293 HGVAAVGYGK--SKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMA--SIPL 348
             H V  VGYG+    G  Y IVKNSWG  WGE G+ R++R T +    C I  +A  + P+
Sbjct:   397 HAVLLVGYGQCHKTGEKYWIVKNSWGSGWGENGFFRIRRGTDE----CAIESIAVAATPI 452

Query:   349 KK 350
              K
Sbjct:   453 PK 454


>FB|FBgn0033873 [details] [associations]
            symbol:CG6337 species:7227 "Drosophila melanogaster"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=ISS]
            [GO:0006508 "proteolysis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 SMART:SM00645 EMBL:AE013599
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 HSSP:P80067 EMBL:AY084123
            RefSeq:NP_610905.1 UniGene:Dm.5230 SMR:Q7JYA0 IntAct:Q7JYA0
            EnsemblMetazoa:FBtr0087646 GeneID:36530 KEGG:dme:Dmel_CG6337
            UCSC:CG6337-RA FlyBase:FBgn0033873 eggNOG:NOG310593
            InParanoid:Q7JYA0 OMA:NRTTYRE OrthoDB:EOG4MCVFZ GenomeRNAi:36530
            NextBio:799041 Uniprot:Q7JYA0
        Length = 340

 Score = 257 (95.5 bits), Expect = 4.3e-22, P = 4.3e-22
 Identities = 87/307 (28%), Positives = 140/307 (45%)

Query:    48 FESWMSKHGKTYKCIEEK--LHRFEIFKEN--LKHIDQRNKEVTSYWLGLNEFADMSHEE 103
             F+++     KTY     +   + + I+  N   +H  Q ++  T+Y   +N+F+D+   +
Sbjct:    28 FQTYEDNFNKTYASTSARNFANYYFIYNRNQVAQHNAQADRNRTTYREAVNQFSDIRLIQ 87

Query:   104 FKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKK-GAVTPVKNQG-SCGSCWAFS 161
             F      L P+       SA       +A   S D     G    V++QG +C S WA++
Sbjct:    88 FA----ALLPK-AVNTVTSAASDPPASQAASASFDIITDFGLTVAVEDQGVNCSSSWAYA 142

Query:   162 TVAAVEGINQIVSGNL--TSLSEQELIDCDTSFNNGCNGGLMDYAFKYI--VASGGLHKE 217
             T  AVE +N + + N   +SLS Q+L+DC      GC+      A  Y+  +    L+ E
Sbjct:   143 TAKAVEIMNAVQTANPLPSSLSAQQLLDC-AGMGTGCSTQTPLAALNYLTQLTDAYLYPE 201

Query:   218 EDYPY---LMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQ-PVSVAIEASGT 273
              DYP    L   G C+        V ++GY  V +ND+ ++++ +++  PV V    +  
Sbjct:   202 VDYPNNNSLKTPGMCQPPSSVSVGVKLAGYSTVADNDDAAVMRYVSNGFPVIVEYNPATF 261

Query:   274 DFQFYSGGVFTGPCGA----ELDHGVAAVGYGKSKGS--DYIIVKNSWGPKWGERGYIRM 327
              F  YS GV+     A    +    +  VGY     S  DY    NS+G  WGE GYIR+
Sbjct:   262 GFMQYSSGVYVQETRALTNPKSSQFLVVVGYDHDVDSNLDYWRCLNSFGDTWGEEGYIRI 321

Query:   328 KRNTGKP 334
              R + +P
Sbjct:   322 VRRSNQP 328


>UNIPROTKB|F1RWA9 [details] [associations]
            symbol:CTSO "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00639 GeneTree:ENSGT00560000076599 EMBL:CU855637
            Ensembl:ENSSSCT00000009707 OMA:WAFSIVG Uniprot:F1RWA9
        Length = 194

 Score = 255 (94.8 bits), Expect = 7.0e-22, P = 7.0e-22
 Identities = 66/192 (34%), Positives = 95/192 (49%)

Query:   154 CGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFNN-GCNGGLMDYAFKYIVASG 212
             CG CWAFS V+AVE    I    L  LS Q++IDC  S+NN GCNGG    A  ++  + 
Sbjct:     2 CGGCWAFSVVSAVESAYAIKGQPLEVLSVQQVIDC--SYNNYGCNGGSTLNALYWLNKTQ 59

Query:   213 -GLHKEEDYPYLMEEGTCEDKKEEMEVVTISGYQ--DVPENDEQSLLKALAHQPVSVAIE 269
               +  + +YP+  + G C         V+I  Y   D    +++     L   P+ V ++
Sbjct:    60 VKVVSDSEYPFKAQNGLCHYFSCSHSGVSIKDYSAYDFSGQEDEMAKTLLTLGPLIVIVD 119

Query:   270 ASGTDFQFYSGGVFTGPCGA-ELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMK 328
             A    +Q Y GG+    C + E +H V   G+ K+  + Y IV+NSWG  WG  GY  +K
Sbjct:   120 A--VSWQDYLGGIIQHHCSSGEANHAVLVTGFDKTGSTPYWIVRNSWGSAWGIDGYALVK 177

Query:   329 RNTGKPEGLCGI 340
                G    +CGI
Sbjct:   178 MG-GN---ICGI 185


>DICTYBASE|DDB_G0286015 [details] [associations]
            symbol:gmsA species:44689 "Dictyostelium discoideum"
            [GO:0005615 "extracellular space" evidence=IDA] [GO:0019953 "sexual
            reproduction" evidence=IEP] [GO:0009897 "external side of plasma
            membrane" evidence=ISS] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA;ISS] [GO:0000747 "conjugation with cellular
            fusion" evidence=IMP] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0008233
            "peptidase activity" evidence=IEA] [GO:0005576 "extracellular
            region" evidence=IEA] InterPro:IPR000668 InterPro:IPR013128
            Pfam:PF00112 PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            dictyBase:DDB_G0286015 Pfam:PF00188 GO:GO:0005615
            GenomeReviews:CM000153_GR GO:GO:0009897 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 EMBL:AAFI02000085 GO:GO:0000747
            Gene3D:3.40.33.10 InterPro:IPR001283 InterPro:IPR014044
            SMART:SM00198 SUPFAM:SSF55797 HSSP:P07688 RefSeq:XP_637893.1
            ProteinModelPortal:Q54ME1 MEROPS:C01.A52 EnsemblProtists:DDB0191145
            GeneID:8625403 KEGG:ddi:DDB_G0286015 InParanoid:Q54ME1 OMA:PGIAYEK
            ProtClustDB:CLSZ2429919 Uniprot:Q54ME1
        Length = 448

 Score = 256 (95.2 bits), Expect = 1.6e-21, P = 1.6e-21
 Identities = 69/217 (31%), Positives = 102/217 (47%)

Query:   116 PTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSG 175
             PT  +P+             +VDW      TP+++QG CGSCWAF++ AA+E    I  G
Sbjct:   223 PTTPKPTTPAPTTPAPTSTLTVDWTSYQ--TPIRDQGQCGSCWAFASSAALESRYLIKYG 280

Query:   176 ----NLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCED 231
                 +   LS Q  ++C  S   GCNGG     F +   + G+  E+D PY    GT   
Sbjct:   281 TAQKSTLQLSNQNAVNCIAS---GCNGGWSGNYFNFF-KTPGIAYEKDDPYKAVTGTSCI 336

Query:   232 KKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCG-AE 290
                 +     + Y    E  + +LL  L   PV++A+      FQ Y  G++        
Sbjct:   337 TTSSVARFKYTNY-GYTEKTKAALLAELKKGPVTIAVYVDSA-FQNYKSGIYNSATKYTG 394

Query:   291 LDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRM 327
             ++H V  VGY ++  +D   +KNSWG  WGE GY+R+
Sbjct:   395 INHLVLLVGYDQA--TDAYKIKNSWGSWWGESGYMRI 429


>DICTYBASE|DDB_G0288563 [details] [associations]
            symbol:DDB_G0288563 species:44689 "Dictyostelium
            discoideum" [GO:0005615 "extracellular space" evidence=IDA]
            [GO:0050790 "regulation of catalytic activity" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0004197 "cysteine-type
            endopeptidase activity" evidence=IEA] [GO:0016787 "hydrolase
            activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0044351 "macropinocytosis" evidence=RCA]
            InterPro:IPR000668 InterPro:IPR012599 InterPro:IPR013128
            InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 dictyBase:DDB_G0288563
            GO:GO:0005615 eggNOG:COG4870 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0050790 GO:GO:0004197
            EMBL:AAFI02000117 PANTHER:PTHR12411:SF16 RefSeq:XP_636643.1
            MEROPS:C01.A58 PRIDE:Q54IS1 EnsemblProtists:DDB0187993
            GeneID:8626689 KEGG:ddi:DDB_G0288563 InParanoid:Q54IS1 OMA:AWEYMEL
            Uniprot:Q54IS1
        Length = 314

 Score = 250 (93.1 bits), Expect = 2.4e-21, P = 2.4e-21
 Identities = 73/223 (32%), Positives = 106/223 (47%)

Query:   132 ALPKSVDWRKK--GAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTS---LSEQELI 186
             ++P S D R +    + P+ NQ  CGSCWAFS+   +     I S N T+   LS Q L+
Sbjct:    87 SIPTSFDSRVQWPDCIHPILNQEQCGSCWAFSSSEVLSDRLCIASNNKTNPGALSPQTLV 146

Query:   187 DCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGT-------CEDKKE----- 234
              CD   N+GC+GG+   A++Y+   G L  +   PY    GT       C D ++     
Sbjct:   147 ACDVYGNDGCSGGIPQLAWEYMELKG-LPTDSCVPYTAGNGTVYSCQRSCSDSEDYSLYR 205

Query:   235 --EMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAEL- 291
                  + T S  Q + EN        LA+ P+   +E    DF  YS GV+    G+ L 
Sbjct:   206 AKPFTLKTCSSVQCIQEN-------ILAYGPIVGTMEVY-EDFMSYSSGVYVMTPGSSLL 257

Query:   292 -DHGVAAVGYGKSKGS--DYIIVKNSWGPKWGERGYIRMKRNT 331
               H +  VG+G  + S  +Y IV NSWG  WG++G+  +   T
Sbjct:   258 GGHAIKIVGWGFDQTSQLNYWIVANSWGADWGQQGFFFISMET 300


>WB|WBGene00044760 [details] [associations]
            symbol:Y71H2AM.25 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0004181
            "metallocarboxypeptidase activity" evidence=IEA] [GO:0008270 "zinc
            ion binding" evidence=IEA] [GO:0016021 "integral to membrane"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            SMART:SM00645 GO:GO:0008234 GO:GO:0006508 PANTHER:PTHR12411
            GeneTree:ENSGT00560000076599 EMBL:FO081822 eggNOG:NOG331187
            HOGENOM:HOG000114005 RefSeq:NP_001040887.1
            ProteinModelPortal:Q2AAB9 SMR:Q2AAB9 EnsemblMetazoa:Y71H2AM.25
            GeneID:4363054 KEGG:cel:CELE_Y71H2AM.25 UCSC:Y71H2AM.25 CTD:4363054
            WormBase:Y71H2AM.25 InParanoid:Q2AAB9 NextBio:959635 Uniprot:Q2AAB9
        Length = 299

 Score = 250 (93.1 bits), Expect = 2.4e-21, P = 2.4e-21
 Identities = 65/224 (29%), Positives = 106/224 (47%)

Query:   130 VKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGIN-QIVSGNLTSLSEQELIDC 188
             ++   + +DWR KG V PVK+QG C +  AF+  +++E +  +  +G+L S SEQ+LIDC
Sbjct:    79 IQTTEEFLDWRDKGIVGPVKDQGKCNASHAFAISSSIESMYAKATNGSLLSFSEQQLIDC 138

Query:   189 DTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYL-MEEGTCEDKKEEMEVVTISGYQDV 247
             D     GC       A  Y +  G +  E DYPY   E G C     + ++  +   + V
Sbjct:   139 DDHGFKGCEEQPAINAVSYFIFHG-IETEADYPYAGKENGKCTFDSTKSKI-QLKDAEFV 196

Query:   248 PENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGP---CGAELD-HGVAAVGYGKS 303
               N+ Q       + P    + A  + +  Y  G++      C +  +   +  VGYG  
Sbjct:   197 VSNETQGKELVTNYGPAFFTMRAPPSLYD-YKIGIYNPSIEECTSTHEIRSMVIVGYGIE 255

Query:   304 KGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMASIP 347
                 Y IVK S+G  WGE+GY+++ R+       C +    ++P
Sbjct:   256 GVQKYWIVKGSFGTSWGEQGYMKLARDVNA----CAMADFITVP 295


>UNIPROTKB|E2QV47 [details] [associations]
            symbol:CTSH "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0097208 "alveolar lamellar body"
            evidence=IEA] [GO:0097067 "cellular response to thyroid hormone
            stimulus" evidence=IEA] [GO:0070371 "ERK1 and ERK2 cascade"
            evidence=IEA] [GO:0070324 "thyroid hormone binding" evidence=IEA]
            [GO:0060448 "dichotomous subdivision of terminal units involved in
            lung branching" evidence=IEA] [GO:0045766 "positive regulation of
            angiogenesis" evidence=IEA] [GO:0043129 "surfactant homeostasis"
            evidence=IEA] [GO:0043066 "negative regulation of apoptotic
            process" evidence=IEA] [GO:0033619 "membrane protein proteolysis"
            evidence=IEA] [GO:0032526 "response to retinoic acid" evidence=IEA]
            [GO:0031648 "protein destabilization" evidence=IEA] [GO:0031638
            "zymogen activation" evidence=IEA] [GO:0030108 "HLA-A specific
            activating MHC class I receptor activity" evidence=IEA] [GO:0016505
            "apoptotic protease activator activity" evidence=IEA] [GO:0010815
            "bradykinin catabolic process" evidence=IEA] [GO:0010813
            "neuropeptide catabolic process" evidence=IEA] [GO:0010634
            "positive regulation of epithelial cell migration" evidence=IEA]
            [GO:0010628 "positive regulation of gene expression" evidence=IEA]
            [GO:0008284 "positive regulation of cell proliferation"
            evidence=IEA] [GO:0005829 "cytosol" evidence=IEA] [GO:0005764
            "lysosome" evidence=IEA] [GO:0005615 "extracellular space"
            evidence=IEA] [GO:0004252 "serine-type endopeptidase activity"
            evidence=IEA] [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=IEA] [GO:0004177 "aminopeptidase activity" evidence=IEA]
            [GO:0002764 "immune response-regulating signaling pathway"
            evidence=IEA] [GO:0001913 "T cell mediated cytotoxicity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            SMART:SM00645 GO:GO:0005829 GO:GO:0043066 GO:GO:0005615
            GO:GO:0008284 GO:GO:0070371 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00639 GO:GO:0045766
            GO:GO:0004177 GO:GO:0004252 GO:GO:0005764 GO:GO:0032526
            GO:GO:0010628 GO:GO:0070324 GO:GO:0016505 GO:GO:0010634
            GO:GO:0004197 GO:GO:0042599 GO:GO:0031648 GO:GO:0097067
            GO:GO:0031638 GO:GO:0001913 GO:GO:0030108 GO:GO:0010815
            GO:GO:0060448 GO:GO:0002764 GO:GO:0033619 GO:GO:0010813
            GO:GO:0043129 Ensembl:ENSCAFT00000036196 Uniprot:E2QV47
        Length = 136

 Score = 242 (90.2 bits), Expect = 1.7e-20, P = 1.7e-20
 Identities = 52/137 (37%), Positives = 82/137 (59%)

Query:   217 EEDYPYLMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALA-HQPVSVAIEASGTDF 275
             E+ YPY  ++G C+ +  +  +  +    ++  NDEQ++++A+A + PVS A E + +DF
Sbjct:     3 EDSYPYKGQDGDCKYQPSKA-IAFVKDVANITINDEQAMVEAVALYNPVSFAFEVT-SDF 60

Query:   276 QFYSGGVFTGP-CGA---ELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNT 331
               Y  G+++   C     +++H V AVGYG+  G  Y IVKNSWGP+WG  GY  M+R  
Sbjct:    61 MMYRKGIYSSTSCHKTPDKVNHAVLAVGYGEQNGIPYWIVKNSWGPQWGMNGYFLMER-- 118

Query:   332 GKPEGLCGINKMASIPL 348
             GK   +CG+   AS P+
Sbjct:   119 GK--NMCGLAACASYPI 133


>WB|WBGene00022189 [details] [associations]
            symbol:Y71H2AR.2 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0004181
            "metallocarboxypeptidase activity" evidence=IEA] [GO:0008270 "zinc
            ion binding" evidence=IEA] [GO:0016021 "integral to membrane"
            evidence=IEA] [GO:0008340 "determination of adult lifespan"
            evidence=IMP] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            SMART:SM00645 GO:GO:0008340 GO:GO:0008234 GO:GO:0006508
            PANTHER:PTHR12411 GeneTree:ENSGT00560000076599 HSSP:P07711
            eggNOG:NOG331187 HOGENOM:HOG000114005 EMBL:FO081570
            RefSeq:NP_497627.1 UniGene:Cel.28419 ProteinModelPortal:Q9BL26
            SMR:Q9BL26 EnsemblMetazoa:Y71H2AR.2 GeneID:190615
            KEGG:cel:CELE_Y71H2AR.2 UCSC:Y71H2AR.2 CTD:190615
            WormBase:Y71H2AR.2 InParanoid:Q9BL26 OMA:CAMATTI NextBio:946382
            Uniprot:Q9BL26
        Length = 345

 Score = 239 (89.2 bits), Expect = 4.4e-20, P = 4.4e-20
 Identities = 67/222 (30%), Positives = 108/222 (48%)

Query:   116 PTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGIN-QIVS 174
             PTR Q      + D +   + +DWR+KG V PVK+QG C +  AF+  +++E +  +  +
Sbjct:    67 PTRFQWETPI-HMD-RTTEEFLDWREKGIVGPVKDQGKCNASHAFAITSSIESMYAKATN 124

Query:   175 GNLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGT-CEDKK 233
             G L S SEQ+LIDC+     GC       A  Y+ A+ G+  E DYPY+ +    C    
Sbjct:   125 GTLLSFSEQQLIDCNDQGYKGCEEQFAMNAIGYL-ATHGIETEADYPYVDKTNEKCTFDS 183

Query:   234 EEMEVVTISGYQDVPENDEQ-SLLKALAHQPVSVAIEASGTDFQFYSGGVFTGP---CGA 289
              + ++    G   V E +E    +    + P    + A  + +  Y  G++      C +
Sbjct:   184 TKSKIHLKKGV--VAEGNEVLGKVYVTNYGPAFFTMRAPPSLYD-YKIGIYNPSIEECTS 240

Query:   290 ELD-HGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRN 330
               +   +  VGYG      Y IVK S+G  WGE+GY+++ R+
Sbjct:   241 THEIRSMVIVGYGIEGEQKYWIVKGSFGTSWGEQGYMKLARD 282


>WB|WBGene00010204 [details] [associations]
            symbol:F57F5.1 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0009792 "embryo development
            ending in birth or egg hatching" evidence=IMP] [GO:0040007 "growth"
            evidence=IMP] [GO:0002119 "nematode larval development"
            evidence=IMP] [GO:0040011 "locomotion" evidence=IMP] [GO:0006898
            "receptor-mediated endocytosis" evidence=IMP] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0009792 GO:GO:0006898
            GO:GO:0040007 GO:GO:0002119 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0040011
            EMBL:Z75953 GeneTree:ENSGT00560000076599 HOGENOM:HOG000241341
            PANTHER:PTHR12411:SF16 RefSeq:NP_506011.2 ProteinModelPortal:Q20950
            SMR:Q20950 DIP:DIP-24447N IntAct:Q20950 MINT:MINT-211137
            STRING:Q20950 MEROPS:C01.A42 EnsemblMetazoa:F57F5.1 GeneID:179645
            KEGG:cel:CELE_F57F5.1 UCSC:F57F5.1 CTD:179645 WormBase:F57F5.1
            OMA:ADDINAC Uniprot:Q20950
        Length = 351

 Score = 158 (60.7 bits), Expect = 2.0e-19, Sum P(2) = 2.0e-19
 Identities = 34/83 (40%), Positives = 43/83 (51%)

Query:   259 LAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELD-HGVAAVGYGKSKGSDYIIVKNSWGP 317
             + H PV VA      DF+ YSGGV+    GA L  H V  +G+G   G+ Y +  NSW  
Sbjct:   263 MTHGPVEVAFTVY-EDFEHYSGGVYVHTAGASLGGHAVKMLGWGVDNGTPYWLCANSWNE 321

Query:   318 KWGERGYIRMKRNTGKPEGLCGI 340
              WGE GY R+ R   +    CGI
Sbjct:   322 DWGENGYFRIIRGVNE----CGI 340

 Score = 138 (53.6 bits), Expect = 2.0e-19, Sum P(2) = 2.0e-19
 Identities = 45/162 (27%), Positives = 76/162 (46%)

Query:    79 IDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKP-QFPTRRQPSAEFSYRDVK--ALPK 135
             +D  NK  TS+   L  +     +  K + +G K  + P   +   E ++ +V+  A+P 
Sbjct:    41 VDYVNKVQTSFKAELGSYFSSYPDTIKKQLMGAKMVEIPEEYRVF-EMTHPEVEDAAVPD 99

Query:   136 SVD----WRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLT--SLSEQEL-IDC 188
             S D    W    +++ +++Q SCGSCWA S    +     I S   T  S+S  ++   C
Sbjct:   100 SFDSRTAWPNCPSISKIRDQSSCGSCWAVSAAETISDRICIASNAKTILSISADDINACC 159

Query:   189 DTSFNNGCNGGLMDYAFKYIV----ASGGLHKEED----YPY 222
                  NGCNGG    A+++ V     +GG ++++     YPY
Sbjct:   160 GMVCGNGCNGGYPIEAWRHYVKKGYVTGGSYQDKTGCKPYPY 201


>TAIR|locus:2133402 [details] [associations]
            symbol:AT4G01610 species:3702 "Arabidopsis thaliana"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=IEA]
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] [GO:0050790 "regulation of
            catalytic activity" evidence=IEA] [GO:0005773 "vacuole"
            evidence=IDA] [GO:0005829 "cytosol" evidence=IDA] [GO:0000902 "cell
            morphogenesis" evidence=RCA] [GO:0006635 "fatty acid
            beta-oxidation" evidence=RCA] [GO:0010162 "seed dormancy process"
            evidence=RCA] [GO:0016049 "cell growth" evidence=RCA] [GO:0048193
            "Golgi vesicle transport" evidence=RCA] InterPro:IPR000668
            InterPro:IPR012599 InterPro:IPR013128 InterPro:IPR015643
            Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005829 GO:GO:0005773 EMBL:CP002687
            GO:GO:0006508 InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0050790 GO:GO:0004197 eggNOG:NOG315657
            HOGENOM:HOG000241341 KO:K01363 PANTHER:PTHR12411:SF16 OMA:DAIPDHF
            HSSP:P07858 ProtClustDB:CLSN2687619 EMBL:AF370193 EMBL:AY065167
            EMBL:AY114015 EMBL:AY086034 EMBL:AF083797 EMBL:BT001190
            EMBL:AK175280 EMBL:AK175481 EMBL:AK175539 EMBL:AK176165
            EMBL:AK176244 EMBL:AK176281 EMBL:AK176330 EMBL:AK176416
            EMBL:AK176433 EMBL:AK176487 EMBL:AK221398 EMBL:AK230235
            IPI:IPI00530811 RefSeq:NP_567215.1 UniGene:At.24471
            ProteinModelPortal:Q94K85 SMR:Q94K85 STRING:Q94K85 MEROPS:C01.144
            PaxDb:Q94K85 PRIDE:Q94K85 EnsemblPlants:AT4G01610.1 GeneID:826792
            KEGG:ath:AT4G01610 TAIR:At4g01610 InParanoid:Q94K85
            PhylomeDB:Q94K85 Genevestigator:Q94K85 Uniprot:Q94K85
        Length = 359

 Score = 152 (58.6 bits), Expect = 2.2e-19, Sum P(2) = 2.2e-19
 Identities = 57/199 (28%), Positives = 96/199 (48%)

Query:    38 LTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYW-LGLNE- 95
             L S+  L+ L  ++  K  +     ++KL   +I ++ +  + + N+   + W   +N+ 
Sbjct:    10 LASVFLLLGLLLAFDLKGIEAESLTKQKLDS-KILQDEI--VKKVNENPNAGWKAAINDR 66

Query:    96 FADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKA-LPKSVD----WRKKGAVTPVKN 150
             F++ +  EFK + LG+KP  P +          D    LPK+ D    W +  ++  + +
Sbjct:    67 FSNATVAEFK-RLLGVKPT-PKKHFLGVPIVSHDPSLKLPKAFDARTAWPQCTSIGNILD 124

Query:   151 QGSCGSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCDTSFN--NGCNGGLMDYAFKYI 208
             QG CGSCWAF  V ++     I  G   SLS  +L+ C   F   +GC+GG    A++Y 
Sbjct:   125 QGHCGSCWAFGAVESLSDRFCIQFGMNISLSVNDLLAC-CGFRCGDGCDGGYPIAAWQYF 183

Query:   209 VASGGLHKEEDYPYLMEEG 227
               SG + +E D PY    G
Sbjct:   184 SYSGVVTEECD-PYFDNTG 201

 Score = 145 (56.1 bits), Expect = 2.2e-19, Sum P(2) = 2.2e-19
 Identities = 42/141 (29%), Positives = 64/141 (45%)

Query:   217 EEDYPYLMEEGTC-EDKK--EEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGT 273
             E  YP       C  D K   E +  ++S Y  V  N +  + +   + PV V+      
Sbjct:   208 EPAYPTPKCSRKCVSDNKLWSESKHYSVSTYT-VKSNPQDIMAEVYKNGPVEVSFTVY-E 265

Query:   274 DFQFYSGGVFTGPCGAELD-HGVAAVGYG-KSKGSDYIIVKNSWGPKWGERGYIRMKRNT 331
             DF  Y  GV+    G+ +  H V  +G+G  S+G DY ++ N W   WG+ GY  ++R T
Sbjct:   266 DFAHYKSGVYKHITGSNIGGHAVKLIGWGTSSEGEDYWLMANQWNRGWGDDGYFMIRRGT 325

Query:   332 GKPEGLCGINK--MASIPLKK 350
              +    CGI    +A +P  K
Sbjct:   326 NE----CGIEDEPVAGLPSSK 342


>UNIPROTKB|Q9UBR2 [details] [associations]
            symbol:CTSZ "Cathepsin Z" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0060441 "epithelial tube
            branching involved in lung morphogenesis" evidence=IEA] [GO:0005764
            "lysosome" evidence=IEA] [GO:0005783 "endoplasmic reticulum"
            evidence=IDA] [GO:0005615 "extracellular space" evidence=IDA]
            Reactome:REACT_11123 Reactome:REACT_17015 InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0005783 EMBL:CH471077 GO:GO:0005615 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            EMBL:AL109840 GO:GO:0060441 MEROPS:C01.013 CTD:1522
            HOGENOM:HOG000264454 HOVERGEN:HBG004456 KO:K08568 OrthoDB:EOG42Z4QN
            BRENDA:3.4.18.1 EMBL:AF073890 EMBL:AF032906 EMBL:AF136273
            EMBL:AF136276 EMBL:AF136274 EMBL:AF136275 EMBL:AK314931
            EMBL:BC042168 EMBL:AF009923 IPI:IPI00002745 RefSeq:NP_001327.2
            UniGene:Hs.252549 PDB:1DEU PDB:1EF7 PDBsum:1DEU PDBsum:1EF7
            ProteinModelPortal:Q9UBR2 SMR:Q9UBR2 STRING:Q9UBR2 DMDM:12643324
            PaxDb:Q9UBR2 PeptideAtlas:Q9UBR2 PRIDE:Q9UBR2 DNASU:1522
            Ensembl:ENST00000217131 GeneID:1522 KEGG:hsa:1522 UCSC:uc002yai.2
            GeneCards:GC20M057570 HGNC:HGNC:2547 HPA:CAB025114 MIM:603169
            neXtProt:NX_Q9UBR2 PharmGKB:PA27043 InParanoid:Q9UBR2 OMA:QCGTCTE
            PhylomeDB:Q9UBR2 BindingDB:Q9UBR2 ChEMBL:CHEMBL4160 ChiTaRS:CTSZ
            EvolutionaryTrace:Q9UBR2 GenomeRNAi:1522 NextBio:6299 Bgee:Q9UBR2
            CleanEx:HS_CTSZ Genevestigator:Q9UBR2 GermOnline:ENSG00000101160
            Uniprot:Q9UBR2
        Length = 303

 Score = 231 (86.4 bits), Expect = 2.5e-19, P = 2.5e-19
 Identities = 72/224 (32%), Positives = 110/224 (49%)

Query:   133 LPKSVDWRKKGAV---TPVKNQGS---CGSCWAF-STVAAVEGINQIVSGNLTS--LSEQ 183
             LPKS DWR    V   +  +NQ     CGSCWA  ST A  + IN    G   S  LS Q
Sbjct:    62 LPKSWDWRNVDGVNYASITRNQHIPQYCGSCWAHASTSAMADRINIKRKGAWPSTLLSVQ 121

Query:   184 ELIDCDTSFNNGCNGG----LMDYAFKYIVASGGLH----KEEDYPYLMEEGTCEDKKE- 234
              +IDC  +    C GG    + DYA ++ +     +    K+++     + GTC + KE 
Sbjct:   122 NVIDCGNA--GSCEGGNDLSVWDYAHQHGIPDETCNNYQAKDQECDKFNQCGTCNEFKEC 179

Query:   235 ----EMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAE 290
                    +  +  Y  +    E+ + +  A+ P+S  I A+      Y+GG++       
Sbjct:   180 HAIRNYTLWRVGDYGSL-SGREKMMAEIYANGPISCGIMATER-LANYTGGIYAEYQDTT 237

Query:   291 -LDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGK 333
              ++H V+  G+G S G++Y IV+NSWG  WGERG++R+  +T K
Sbjct:   238 YINHVVSVAGWGISDGTEYWIVRNSWGEPWGERGWLRIVTSTYK 281


>WB|WBGene00000784 [details] [associations]
            symbol:cpr-4 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 eggNOG:NOG315657
            GeneTree:ENSGT00560000076599 HOGENOM:HOG000241341 KO:K01363
            PANTHER:PTHR12411:SF16 EMBL:L39895 EMBL:L39926 EMBL:FO081381
            PIR:T37280 RefSeq:NP_504682.1 UniGene:Cel.5404
            ProteinModelPortal:P43508 SMR:P43508 DIP:DIP-25376N
            MINT:MINT-1069892 STRING:P43508 MEROPS:C01.A34 PaxDb:P43508
            EnsemblMetazoa:F44C4.3 GeneID:179053 KEGG:cel:CELE_F44C4.3
            UCSC:F44C4.3 CTD:179053 WormBase:F44C4.3 InParanoid:P43508
            OMA:CCGFLCG NextBio:903704 Uniprot:P43508
        Length = 335

 Score = 150 (57.9 bits), Expect = 4.5e-19, Sum P(2) = 4.5e-19
 Identities = 33/83 (39%), Positives = 42/83 (50%)

Query:   259 LAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELD-HGVAAVGYGKSKGSDYIIVKNSWGP 317
             +AH PV  A      DF  Y  GV+    G EL  H +  +G+G   G+ Y +V NSW  
Sbjct:   247 IAHGPVEAAFTVY-EDFYQYKTGVYVHTTGQELGGHAIRILGWGTDNGTPYWLVANSWNV 305

Query:   318 KWGERGYIRMKRNTGKPEGLCGI 340
              WGE GY R+ R T +    CGI
Sbjct:   306 NWGENGYFRIIRGTNE----CGI 324

 Score = 143 (55.4 bits), Expect = 4.5e-19, Sum P(2) = 4.5e-19
 Identities = 35/123 (28%), Positives = 59/123 (47%)

Query:    98 DMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVK--ALPKSVD----WRKKGAVTPVKNQ 151
             D++ E+ K + +  + +F     P  E    D+    +P + D    W    ++  +++Q
Sbjct:    46 DITIEQVKKRLM--RTEFVAPHTPDVEVVKHDINEDTIPATFDARTQWPNCMSINNIRDQ 103

Query:   152 GSCGSCWAFSTVAAVEGINQIVSGNL--TSLSEQELIDCDTSFNNGCNGGLMDYAFKYIV 209
               CGSCWAF+   A      I S     T LS ++++ C ++   GC GG    A+KY+V
Sbjct:   104 SDCGSCWAFAAAEAASDRFCIASNGAVNTLLSAEDVLSCCSNCGYGCEGGYPINAWKYLV 163

Query:   210 ASG 212
              SG
Sbjct:   164 KSG 166


>RGD|708479 [details] [associations]
            symbol:Ctsz "cathepsin Z" species:10116 "Rattus norvegicus"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=TAS]
            [GO:0005615 "extracellular space" evidence=IEA;ISO] [GO:0005783
            "endoplasmic reticulum" evidence=IEA;ISO] [GO:0006508 "proteolysis"
            evidence=IEA] [GO:0060441 "epithelial tube branching involved in
            lung morphogenesis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            RGD:708479 GO:GO:0005576 GO:GO:0006508 InterPro:IPR025661
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0004197 MEROPS:C01.013 CTD:1522 HOVERGEN:HBG004456 KO:K08568
            EMBL:AB023781 EMBL:BC091110 IPI:IPI00207663 RefSeq:NP_899159.1
            UniGene:Rn.1475 ProteinModelPortal:Q9R1T3 SMR:Q9R1T3 PRIDE:Q9R1T3
            GeneID:252929 KEGG:rno:252929 BindingDB:Q9R1T3 NextBio:624097
            Genevestigator:Q9R1T3 Uniprot:Q9R1T3
        Length = 306

 Score = 228 (85.3 bits), Expect = 5.1e-19, P = 5.1e-19
 Identities = 76/239 (31%), Positives = 115/239 (48%)

Query:   118 RRQPSAEFSYRDVKALPKSVDWRKKGAV---TPVKNQGS---CGSCWAF-STVAAVEGIN 170
             RR       Y     LPK+ DWR    V   +  +NQ     CGSCWA  ST A  + IN
Sbjct:    49 RRTYPRPHEYLSPADLPKNWDWRNVNGVNYASVTRNQHIPQYCGSCWAHGSTSALADRIN 108

Query:   171 QIVSGNLTS--LSEQELIDCDTSFNNGCNGG----LMDYAFKYIVASGGLH----KEEDY 220
                 G   S  LS Q +IDC  +    C GG    + +YA K+ +     +    K+++ 
Sbjct:   109 IKRKGAWPSTLLSVQNVIDCGNA--GSCEGGNDLPVWEYAHKHGIPDETCNNYQAKDQEC 166

Query:   221 PYLMEEGTCEDKKE--EMEVVTISGYQDVPE--NDEQSLLKALAHQPVSVAIEASGTDFQ 276
                 + GTC + KE   ++  T+    D       E+ + +  A+ P+S  I A+     
Sbjct:   167 DKFNQCGTCTEFKECHTIQNYTLWRVGDYGSLSGREKMMAEIYANGPISCGIMATER-MS 225

Query:   277 FYSGGVFTGPCG-AELDHGVAAVGYGKSK-GSDYIIVKNSWGPKWGERGYIRMKRNTGK 333
              Y+GG++T     A ++H ++  G+G S  G +Y IV+NSWG  WGERG++R+  +T K
Sbjct:   226 NYTGGIYTEYQNQAIINHIISVAGWGVSNDGIEYWIVRNSWGEPWGERGWMRIVTSTYK 284


>UNIPROTKB|A5GFX7 [details] [associations]
            symbol:CTSZ "Cathepsin Z" species:9823 "Sus scrofa"
            [GO:0060441 "epithelial tube branching involved in lung
            morphogenesis" evidence=IEA] [GO:0005783 "endoplasmic reticulum"
            evidence=IEA] [GO:0005615 "extracellular space" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0005783 GO:GO:0005615 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640
            GO:GO:0060441 GeneTree:ENSGT00560000076599 MEROPS:C01.013 CTD:1522
            HOGENOM:HOG000264454 HOVERGEN:HBG004456 KO:K08568 OrthoDB:EOG42Z4QN
            OMA:QCGTCTE EMBL:CR956646 RefSeq:NP_001116576.1 UniGene:Ssc.16769
            ProteinModelPortal:A5GFX7 SMR:A5GFX7 STRING:A5GFX7
            Ensembl:ENSSSCT00000008249 GeneID:100141405 KEGG:ssc:100141405
            ArrayExpress:A5GFX7 Uniprot:A5GFX7
        Length = 304

 Score = 226 (84.6 bits), Expect = 8.3e-19, P = 8.3e-19
 Identities = 75/245 (30%), Positives = 114/245 (46%)

Query:   112 KPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAV---TPVKNQGS---CGSCWAF-STVA 164
             + Q   R  P     Y     LP+S DWR    V   +  +NQ     CGSCWA  ST A
Sbjct:    43 RTQLGHRTYPRPH-EYLSPSDLPRSWDWRNVNGVNYASVTRNQHIPQYCGSCWAHGSTSA 101

Query:   165 AVEGINQIVSGNLTS--LSEQELIDCDTSFNNGCNGG----LMDYAFKYIVASGGLHKEE 218
               + IN    G   S  LS Q +IDC  +    C GG    +  YA ++ +     +  +
Sbjct:   102 MADRINIKRKGAWPSTLLSVQHVIDCGNA--GSCEGGDDLPVWAYAHRHGIPDETCNNYQ 159

Query:   219 DYPYLMEE----GTCEDKKE-----EMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIE 269
                 + ++    GTC + KE        +  +  Y  V    E+ + +  A+ P+S  I 
Sbjct:   160 AKDQVCDKFNQCGTCTEFKECHVIQNYTLWKVGDYGSV-SGREKMMAEIYANGPISCGIM 218

Query:   270 ASGTDFQFYSGGVFTG-PCGAELDHGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMK 328
             A+      Y+GG++      A ++H V+  G+G S G++Y IV+NSWG  WGERG++R+ 
Sbjct:   219 AT-EKMSNYTGGIYAEYKDQAYINHIVSVAGWGVSGGTEYWIVRNSWGEPWGERGWMRIV 277

Query:   329 RNTGK 333
              +T K
Sbjct:   278 TSTYK 282


>ZFIN|ZDB-GENE-040426-2650 [details] [associations]
            symbol:ctsba "cathepsin B, a" species:7955 "Danio
            rerio" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0050790
            "regulation of catalytic activity" evidence=IEA] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=IEA] [GO:0005575
            "cellular_component" evidence=ND] [GO:0031101 "fin regeneration"
            evidence=IEP] [GO:0008233 "peptidase activity" evidence=IEA]
            [GO:0016787 "hydrolase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR012599 InterPro:IPR013128 InterPro:IPR015643
            Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 ZFIN:ZDB-GENE-040426-2650 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0050790
            GO:GO:0004197 GO:GO:0031101 MEROPS:C01.060 HOVERGEN:HBG003480
            PANTHER:PTHR12411:SF16 HSSP:P07688 EMBL:BC044517 IPI:IPI00485996
            UniGene:Dr.3374 ProteinModelPortal:Q803E4 SMR:Q803E4 STRING:Q803E4
            PRIDE:Q803E4 InParanoid:Q803E4 ArrayExpress:Q803E4 Bgee:Q803E4
            Uniprot:Q803E4
        Length = 330

 Score = 150 (57.9 bits), Expect = 8.6e-19, Sum P(2) = 8.6e-19
 Identities = 54/163 (33%), Positives = 77/163 (47%)

Query:    83 NKEVTSYWLGLNEFADMSHEEFKNKYLG--LK-PQFPTRRQPSAEFSYRDVKALPKSVDW 139
             NK  T++  G N F D+ +   K +  G  LK P+ P   Q      Y +   LPK+ D 
Sbjct:    34 NKANTTWTAGHN-FRDVDYSYVK-RLCGTFLKGPKLPVMVQ------YTEGLKLPKNFDA 85

Query:   140 RKKGAVTP----VKNQGSCGSCWAFSTVAAVEGINQIVSGNLTS--LSEQELIDCDTSFN 193
             R++    P    +++QGSCGSCWAF    A+     I S    S  +S Q+L+ C  S  
Sbjct:    86 REQWPNCPTLKEIRDQGSCGSCWAFGAAEAISDRVCIQSNAKVSVEISSQDLLTCCDSCG 145

Query:   194 NGCNGGLMDYAFKYI----VASGGLHKEED--YPYLMEEGTCE 230
              GCNGG    A+ +     + +GGL+       PY +E   CE
Sbjct:   146 MGCNGGYPSAAWDFWTTDGLVTGGLYNSHIGCRPYTIEP--CE 186

 Score = 140 (54.3 bits), Expect = 8.6e-19, Sum P(2) = 8.6e-19
 Identities = 33/105 (31%), Positives = 48/105 (45%)

Query:   247 VPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELD-HGVAAVGYGKSKG 305
             VP N    + +   + PV  A      DF  Y  GV+    G+ L  H +  +G+G+  G
Sbjct:   231 VPSNQNGIMAELFKNGPVEAAFTVY-EDFLLYKSGVYQHMSGSALGGHAIKILGWGEENG 289

Query:   306 SDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINK--MASIPL 348
               Y +  NSW   WG+ GY ++ R     E  CGI    +A IP+
Sbjct:   290 VPYWLAANSWNTDWGDNGYFKILRG----EDHCGIESEIVAGIPM 330


>TAIR|locus:505006093 [details] [associations]
            symbol:AT1G02305 species:3702 "Arabidopsis thaliana"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=IEA]
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] [GO:0050790 "regulation of
            catalytic activity" evidence=IEA] [GO:0005773 "vacuole"
            evidence=IDA] [GO:0005829 "cytosol" evidence=RCA]
            InterPro:IPR000668 InterPro:IPR012599 InterPro:IPR013128
            InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 EMBL:CP002684 GO:GO:0005773
            GO:GO:0006508 InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00139
            PROSITE:PS00639 GO:GO:0050790 GO:GO:0004197 HOGENOM:HOG000241341
            KO:K01363 PANTHER:PTHR12411:SF16 OMA:CCGFLCG UniGene:At.23486
            UniGene:At.42610 UniGene:At.43952 EMBL:AY039887 EMBL:AF428337
            EMBL:BT002227 IPI:IPI00524601 RefSeq:NP_563648.1 HSSP:P07858
            ProteinModelPortal:Q93VC9 SMR:Q93VC9 IntAct:Q93VC9 STRING:Q93VC9
            MEROPS:C01.049 PRIDE:Q93VC9 ProMEX:Q93VC9 EnsemblPlants:AT1G02305.1
            GeneID:839538 KEGG:ath:AT1G02305 TAIR:At1g02305 InParanoid:Q93VC9
            PhylomeDB:Q93VC9 ProtClustDB:CLSN2687619 Genevestigator:Q93VC9
            Uniprot:Q93VC9
        Length = 362

 Score = 152 (58.6 bits), Expect = 9.7e-19, Sum P(2) = 9.7e-19
 Identities = 53/174 (30%), Positives = 80/174 (45%)

Query:    63 EEKLHRFEIFKENLKHIDQRNKEVTSYW-LGLNE-FADMSHEEFKNKYLGLKPQFPTRRQ 120
             ++KL  + +  E +K +   N+   + W    N+ FA+ +  EFK + LG+KP  P    
Sbjct:    38 KQKLTSWILQNEIVKEV---NENPNAGWKASFNDRFANATVAEFK-RLLGVKPT-PKTEF 92

Query:   121 PSAEFSYRDVKA-LPKSVD----WRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSG 175
                     D+   LPK  D    W +  ++  + +QG CGSCWAF  V ++     I   
Sbjct:    93 LGVPIVSHDISLKLPKEFDARTAWSQCTSIGRILDQGHCGSCWAFGAVESLSDRFCIKYN 152

Query:   176 NLTSLSEQELIDCDTSF--NNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEG 227
                SLS  +L+ C   F    GCNGG    A++Y    G + +E D PY    G
Sbjct:   153 MNVSLSVNDLLAC-CGFLCGQGCNGGYPIAAWRYFKHHGVVTEECD-PYFDNTG 204

 Score = 139 (54.0 bits), Expect = 9.7e-19, Sum P(2) = 9.7e-19
 Identities = 34/111 (30%), Positives = 54/111 (48%)

Query:   241 ISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELD-HGVAAVG 299
             +S Y+ V  + +  + +   + PV VA      DF  Y  GV+    G  +  H V  +G
Sbjct:   238 VSAYK-VRSHPDDIMAEVYKNGPVEVAFTVY-EDFAHYKSGVYKHITGTNIGGHAVKLIG 295

Query:   300 YGKSK-GSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINK--MASIP 347
             +G S  G DY ++ N W   WG+ GY +++R T +    CGI    +A +P
Sbjct:   296 WGTSDDGEDYWLLANQWNRSWGDDGYFKIRRGTNE----CGIEHGVVAGLP 342


>UNIPROTKB|F1RKR7 [details] [associations]
            symbol:CTSH "Cathepsin H light chain" species:9823 "Sus
            scrofa" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] InterPro:IPR013128 GO:GO:0008234 InterPro:IPR013201
            PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            GeneTree:ENSGT00660000095458 EMBL:CU326382
            Ensembl:ENSSSCT00000001985 ArrayExpress:F1RKR7 Uniprot:F1RKR7
        Length = 197

 Score = 220 (82.5 bits), Expect = 3.6e-18, P = 3.6e-18
 Identities = 57/144 (39%), Positives = 84/144 (58%)

Query:    32 GYSPEHLTSMDKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWL 91
             G S   ++S +KL   F+SWM +H K Y  +EE  HR ++F  N + I+  N    ++ L
Sbjct:    21 GASNLAVSSFEKLH--FKSWMVQHQKKYS-LEEYHHRLQVFVSNWRKINAHNAGNHTFKL 77

Query:    92 GLNEFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSY-RDVKALPKSVDWRKKGA-VTPVK 149
             GLN+F+DMS +E ++KYL  +PQ       + + +Y R     P S+DWRKKG  V+PVK
Sbjct:    78 GLNQFSDMSFDEIRHKYLWSEPQ----NCSATKGNYLRGTGPYPPSMDWRKKGNFVSPVK 133

Query:   150 NQGSCGSCWAF---STVAAVEGIN 170
             NQ S  S W     ST+ A +G++
Sbjct:   134 NQNS--SWWTAPRTSTITAAKGVS 155


>UNIPROTKB|A1E295 [details] [associations]
            symbol:CTSB "Cathepsin B" species:9823 "Sus scrofa"
            [GO:0005615 "extracellular space" evidence=IDA] [GO:0042470
            "melanosome" evidence=IEA] [GO:0005764 "lysosome" evidence=IEA]
            [GO:0097067 "cellular response to thyroid hormone stimulus"
            evidence=IEA] [GO:0005739 "mitochondrion" evidence=IEA] [GO:0005730
            "nucleolus" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0050790 "regulation of catalytic activity" evidence=IEA]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR012599 InterPro:IPR013128
            InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0005739 GO:GO:0042470
            GO:GO:0005615 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0050790 GO:GO:0005764 GO:GO:0004197 MEROPS:C01.060 CTD:1508
            GeneTree:ENSGT00560000076599 HOVERGEN:HBG003480 KO:K01363
            OMA:GYPSGAW GO:GO:0097067 PANTHER:PTHR12411:SF16 EMBL:EF095956
            RefSeq:NP_001090927.1 UniGene:Ssc.53773 ProteinModelPortal:A1E295
            SMR:A1E295 PRIDE:A1E295 Ensembl:ENSSSCT00000026923 GeneID:100037961
            KEGG:ssc:100037961 Uniprot:A1E295
        Length = 335

 Score = 157 (60.3 bits), Expect = 3.7e-18, Sum P(2) = 3.7e-18
 Identities = 54/165 (32%), Positives = 82/165 (49%)

Query:    64 EKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEF-ADMSH-EEFKNKYLGLKPQFPTRRQP 121
             E LH F+   + L  ++  NK+ T++  G N +  D+S+ ++    +LG  P+ P R   
Sbjct:    19 ESLH-FQPLSDEL--VNFINKQNTTWTAGHNFYNVDLSYVKKLCGTFLG-GPKLPQR--- 71

Query:   122 SAEFSYRDVKALPKSVD----WRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSG-- 175
              A F+  D+  LPKS D    W     +  +++QGSCGSCWAF  V A+     I S   
Sbjct:    72 -AAFA-ADM-ILPKSFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRSNGR 128

Query:   176 -NLTSLSEQELIDCDTSFNNGCNGGLMDYAFKYI----VASGGLH 215
              N+   +E  L  C     +GCNGG    A+ +     + SGGL+
Sbjct:   129 VNVEVSAEDMLTCCGDECGDGCNGGFPSGAWNFWTKKGLVSGGLY 173

 Score = 126 (49.4 bits), Expect = 3.7e-18, Sum P(2) = 3.7e-18
 Identities = 36/128 (28%), Positives = 59/128 (46%)

Query:   223 LMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGV 282
             + E G     KE+      S Y  +  N+++ + +   + PV  A     +DF  Y  GV
Sbjct:   210 ICEPGYTPSYKEDKHF-GCSSYS-ISRNEKEIMAEIYKNGPVEGAFTVY-SDFLQYKSGV 266

Query:   283 FTGPCGAELD-HGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGIN 341
             +    G  +  H +  +G+G   G+ Y +V NSW   WG+ G+ ++ R  G+    CGI 
Sbjct:   267 YQHVTGDLMGGHAIRILGWGVENGTPYWLVGNSWNTDWGDNGFFKILR--GQDH--CGIE 322

Query:   342 K--MASIP 347
                +A IP
Sbjct:   323 SEIVAGIP 330


>WB|WBGene00000785 [details] [associations]
            symbol:cpr-5 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 eggNOG:NOG315657
            GeneTree:ENSGT00560000076599 HOGENOM:HOG000241341 KO:K01363
            PANTHER:PTHR12411:SF16 EMBL:L39896 EMBL:L39927 EMBL:FO081739
            PIR:T37277 RefSeq:NP_503383.1 UniGene:Cel.19730
            ProteinModelPortal:P43509 SMR:P43509 DIP:DIP-25329N IntAct:P43509
            MINT:MINT-1051285 STRING:P43509 MEROPS:C01.A35 PaxDb:P43509
            EnsemblMetazoa:W07B8.5 GeneID:178612 KEGG:cel:CELE_W07B8.5
            UCSC:W07B8.5.1 CTD:178612 WormBase:W07B8.5 InParanoid:P43509
            OMA:DAIPDHF NextBio:901840 Uniprot:P43509
        Length = 344

 Score = 151 (58.2 bits), Expect = 8.9e-18, Sum P(2) = 8.9e-18
 Identities = 37/99 (37%), Positives = 51/99 (51%)

Query:   252 EQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELD-HGVAAVGYGKSKGSDYII 310
             EQ   + L + P+ VA      DF  Y+ GV+    GA L  H V  +G+G   G+ Y +
Sbjct:   245 EQIQTEILTNGPIEVAFTVY-EDFYQYTTGVYVHTAGASLGGHAVKILGWGVDNGTPYWL 303

Query:   311 VKNSWGPKWGERGYIRMKRNTGKPEGLCGI--NKMASIP 347
             V NSW   WGE+GY R+ R   +    CGI  + +A IP
Sbjct:   304 VANSWNVAWGEKGYFRIIRGLNE----CGIEHSAVAGIP 338

 Score = 130 (50.8 bits), Expect = 8.9e-18, Sum P(2) = 8.9e-18
 Identities = 36/120 (30%), Positives = 54/120 (45%)

Query:   102 EEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVD----WRKKGAVTPVKNQGSCGSC 157
             E+   K + +K   P + +        D  A+P   D    W    ++  +++Q  CGSC
Sbjct:    53 EKITKKLMDVKYLVPHKDEDIVATEVSD--AIPDHFDARDQWPNCMSINNIRDQSDCGSC 110

Query:   158 WAFSTVAAVEGINQIVSGNL--TSLSEQELIDCDT---SFNNGCNGGLMDYAFKYIVASG 212
             WAF+   A+     I S     T LS ++L+ C T   S  NGC GG    A+K+ V  G
Sbjct:   111 WAFAAAEAISDRTCIASNGAVNTLLSSEDLLSCCTGMFSCGNGCEGGYPIQAWKWWVKHG 170


>WB|WBGene00021072 [details] [associations]
            symbol:W07B8.4 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 eggNOG:NOG315657
            GeneTree:ENSGT00560000076599 HOGENOM:HOG000241341 KO:K01363
            PANTHER:PTHR12411:SF16 EMBL:FO081739 PIR:T31728 RefSeq:NP_503382.1
            HSSP:P53634 ProteinModelPortal:O16288 SMR:O16288 STRING:O16288
            MEROPS:C01.A39 PaxDb:O16288 EnsemblMetazoa:W07B8.4 GeneID:178611
            KEGG:cel:CELE_W07B8.4 UCSC:W07B8.4 CTD:178611 WormBase:W07B8.4
            InParanoid:O16288 OMA:ESQYGCK NextBio:901836 Uniprot:O16288
        Length = 335

 Score = 156 (60.0 bits), Expect = 1.6e-17, Sum P(2) = 1.6e-17
 Identities = 39/104 (37%), Positives = 51/104 (49%)

Query:   242 SGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELD-HGVAAVGY 300
             S Y  +  + +Q   + LAH PV V       DF  Y  G++T   G EL  H V  +G+
Sbjct:   227 SAYA-IGRSAKQIQTEILAHGPVEVGFIVY-EDFYLYKTGIYTHVAGGELGGHAVKMLGW 284

Query:   301 GKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMA 344
             G   G+ Y +  NSW   WGE+GY R+ R  G  E  CGI   A
Sbjct:   285 GVDNGTPYWLAANSWNTVWGEKGYFRILR--GVDE--CGIESAA 324

 Score = 121 (47.7 bits), Expect = 1.6e-17, Sum P(2) = 1.6e-17
 Identities = 31/90 (34%), Positives = 50/90 (55%)

Query:   132 ALPKSVD----WRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVS-GNL-TSLSEQEL 185
             ++P S D    W +  +V  +++Q  CGSCWA +   A+     I S G++ T LS +++
Sbjct:    72 SIPDSYDVRDHWPQCISVNNIRDQSHCGSCWAVAAAEAISDRTCIASNGDVNTLLSAEDI 131

Query:   186 IDCDTS-FN--NGCNGGLMDYAFKYIVASG 212
             + C T  FN  +GC GG    A++Y V +G
Sbjct:   132 LTCCTGKFNCGDGCEGGYPIQAWRYWVKNG 161


>UNIPROTKB|F1N9D7 [details] [associations]
            symbol:CTSB "Cathepsin B" species:9031 "Gallus gallus"
            [GO:0050790 "regulation of catalytic activity" evidence=IEA]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005730 "nucleolus"
            evidence=IEA] [GO:0005739 "mitochondrion" evidence=IEA] [GO:0005764
            "lysosome" evidence=IEA] [GO:0097067 "cellular response to thyroid
            hormone stimulus" evidence=IEA] InterPro:IPR000668
            InterPro:IPR012599 InterPro:IPR013128 InterPro:IPR015643
            Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 GO:GO:0005739 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0050790 GO:GO:0005764
            GO:GO:0004197 GeneTree:ENSGT00560000076599 OMA:GYPSGAW
            GO:GO:0097067 PANTHER:PTHR12411:SF16 IPI:IPI00573387
            EMBL:AADN02018292 Ensembl:ENSGALT00000026896
            Ensembl:ENSGALT00000036723 Uniprot:F1N9D7
        Length = 340

 Score = 143 (55.4 bits), Expect = 4.9e-17, Sum P(2) = 4.9e-17
 Identities = 44/146 (30%), Positives = 73/146 (50%)

Query:    79 IDQRNKEVTSYWLGLN-EFADMSH-EEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKS 136
             ++  NK  T++  G N    DMS+ ++    +LG  P+ P R   +A+    D     K 
Sbjct:    31 VNHINKLNTTWKAGHNFHNTDMSYVKKLCGTFLG-GPKLPERVDFAADMDLPDTFDSRKQ 89

Query:   137 VDWRKKGAVTPVKNQGSCGSCWAFSTVAAV-EGINQIVSGNLT-SLSEQELIDC-DTSFN 193
               W     ++ +++QGSCGSCWAF  V A+ + I    +  ++  +S ++L+ C      
Sbjct:    90 --WPNCPTISEIRDQGSCGSCWAFGAVEAISDRICVHTNAKVSVEVSAEDLLSCCGFECG 147

Query:   194 NGCNGGLMDYAFKYI----VASGGLH 215
              GCNGG    A++Y     + SGGL+
Sbjct:   148 MGCNGGYPSGAWRYWTERGLVSGGLY 173

 Score = 132 (51.5 bits), Expect = 4.9e-17, Sum P(2) = 4.9e-17
 Identities = 35/126 (27%), Positives = 58/126 (46%)

Query:   225 EEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFT 284
             E G     KE+     I+ Y  VP ++++ + +   + PV  A      DF  Y  GV+ 
Sbjct:   213 EPGYSPSYKEDKHY-GITSY-GVPRSEKEIMAEIYKNGPVEGAFIVY-EDFLMYKSGVYQ 269

Query:   285 GPCGAELD-HGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINK- 342
                G ++  H +  +G+G   G+ Y +  NSW   WG+ G+ ++ R     E  CGI   
Sbjct:   270 HVSGEQVGGHAIRILGWGVENGTPYWLAANSWNTDWGDNGFFKILRG----EDHCGIESE 325

Query:   343 -MASIP 347
              +A +P
Sbjct:   326 IVAGVP 331


>UNIPROTKB|E2R6Q7 [details] [associations]
            symbol:CTSB "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0097067 "cellular response to thyroid hormone
            stimulus" evidence=IEA] [GO:0005764 "lysosome" evidence=IEA]
            [GO:0005739 "mitochondrion" evidence=IEA] [GO:0005730 "nucleolus"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0050790
            "regulation of catalytic activity" evidence=IEA] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR012599 InterPro:IPR013128
            InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0005739 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0050790
            GO:GO:0005764 GO:GO:0004197 CTD:1508 GeneTree:ENSGT00560000076599
            KO:K01363 OMA:GYPSGAW GO:GO:0097067 PANTHER:PTHR12411:SF16
            EMBL:AAEX03014318 RefSeq:XP_543203.3 Ensembl:ENSCAFT00000012692
            GeneID:486077 KEGG:cfa:486077 NextBio:20859923 Uniprot:E2R6Q7
        Length = 339

 Score = 138 (53.6 bits), Expect = 5.7e-17, Sum P(2) = 5.7e-17
 Identities = 38/128 (29%), Positives = 60/128 (46%)

Query:   223 LMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGV 282
             + E G     KE+      S Y  V +N+++ + +   + PV  A     +DF  Y  GV
Sbjct:   210 ICEPGYSPSYKEDKHY-GCSSYS-VSDNEKEIMAEIYKNGPVEAAFTVY-SDFLLYKSGV 266

Query:   283 FTGPCGAELD-HGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGIN 341
             +    G  +  H V  +G+G   G+ Y +V NSW   WG+ G+ ++ R  G+    CGI 
Sbjct:   267 YQHVTGEMMGGHAVRILGWGVEDGTPYWLVGNSWNTDWGDNGFFKILR--GRDH--CGIE 322

Query:   342 K--MASIP 347
                +A IP
Sbjct:   323 SEIVAGIP 330

 Score = 137 (53.3 bits), Expect = 5.7e-17, Sum P(2) = 5.7e-17
 Identities = 46/150 (30%), Positives = 74/150 (49%)

Query:    79 IDQRNKEVTSYWLGLN-EFADMSH-EEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKS 136
             +D  NK  T++  G N    D S+       +LG  P+ P R Q    F+   +  LP+S
Sbjct:    31 VDYVNKRNTTWKAGHNFHNVDPSYLRRLCGTFLG-GPKLPQRVQ----FAKNLI--LPES 83

Query:   137 VD----WRKKGAVTPVKNQGSCGSCWAFSTVAAV-EGINQIVSGNLT-SLSEQELIDC-D 189
              D    W     +  +++QGSCGSCWAF  V A+ + I    +G++   +S ++++ C  
Sbjct:    84 FDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIRTNGHVNVEVSAEDMLTCCG 143

Query:   190 TSFNNGCNGGLMDYAFKYI----VASGGLH 215
                 +GCNGG    A+ +     + SGGL+
Sbjct:   144 DQCGDGCNGGFPAEAWNFWTKQGLVSGGLY 173


>ZFIN|ZDB-GENE-070323-1 [details] [associations]
            symbol:ctsbb "capthepsin B, b" species:7955 "Danio
            rerio" [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0050790
            "regulation of catalytic activity" evidence=IEA] [GO:0008233
            "peptidase activity" evidence=IEA] [GO:0016787 "hydrolase activity"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR012599
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169
            ZFIN:ZDB-GENE-070323-1 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0050790 GO:GO:0004197
            GeneTree:ENSGT00560000076599 PANTHER:PTHR12411:SF16 OMA:CCGFLCG
            EMBL:CU207296 EMBL:CABZ01037785 IPI:IPI00877452
            Ensembl:ENSDART00000097263 Bgee:F1QZT5 Uniprot:F1QZT5
        Length = 326

 Score = 146 (56.5 bits), Expect = 5.8e-17, Sum P(2) = 5.8e-17
 Identities = 34/105 (32%), Positives = 53/105 (50%)

Query:   246 DVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELD-HGVAAVGYGKSK 304
             +VP + +Q + +   + PV  A      DF  Y  GV+    G+ L  H V  +G+G+  
Sbjct:   225 NVPSDQQQIMTELYTNGPVEAAFTVY-EDFPLYKSGVYQHLTGSALGGHAVKILGWGEEN 283

Query:   305 GSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINK--MASIP 347
             G+ + +V NSW   WG+ GY ++ R  G  E  CGI    +A +P
Sbjct:   284 GTPFWLVANSWNSDWGDNGYFKILR--GHDE--CGIESEMVAGLP 324

 Score = 127 (49.8 bits), Expect = 5.8e-17, Sum P(2) = 5.8e-17
 Identities = 42/133 (31%), Positives = 61/133 (45%)

Query:   129 DVKALPKSVD----WRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTS--LSE 182
             +VK LP S D    W     +  +++QGSCGSCWAF  V ++     I S    S  +S 
Sbjct:    72 NVK-LPDSFDLRDQWPNCKTLNQIRDQGSCGSCWAFGAVESISDRICIHSKGKQSPEISA 130

Query:   183 QELIDCDTSFNNGCNGGLMDYAFKYI----VASGGLHKEED--YPYLMEEGTCEDKKEEM 236
             ++L+ C      GC+GG    A+ Y     + +GGL+  +    PY +    CE      
Sbjct:   131 EDLLSCCDQCGFGCSGGFPAEAWDYWRRSGLVTGGLYNSDVGCRPYSI--APCEHHVNGT 188

Query:   237 EVVTISGYQDVPE 249
                  SG QD P+
Sbjct:   189 RP-PCSGEQDTPK 200


>WB|WBGene00000786 [details] [associations]
            symbol:cpr-6 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0005764 "lysosome"
            evidence=IDA] InterPro:IPR000668 InterPro:IPR013128
            InterPro:IPR015643 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 eggNOG:COG4870 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GeneTree:ENSGT00560000076599 HOGENOM:HOG000241341
            PANTHER:PTHR12411:SF16 EMBL:L39894 EMBL:L39939 EMBL:FO080666
            PIR:T37274 RefSeq:NP_741818.1 UniGene:Cel.18138
            ProteinModelPortal:P43510 SMR:P43510 DIP:DIP-25139N
            MINT:MINT-1074025 STRING:P43510 MEROPS:C01.A51 PaxDb:P43510
            PRIDE:P43510 EnsemblMetazoa:C25B8.3a GeneID:180931
            KEGG:cel:CELE_C25B8.3 UCSC:C25B8.3a CTD:180931 WormBase:C25B8.3a
            InParanoid:P43510 OMA:KAKWGLM NextBio:911608 ArrayExpress:P43510
            Uniprot:P43510
        Length = 379

 Score = 138 (53.6 bits), Expect = 9.1e-17, Sum P(2) = 9.1e-17
 Identities = 34/86 (39%), Positives = 47/86 (54%)

Query:   133 LPKSVD----WRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVS-GNL-TSLSEQELI 186
             +P+S D    W K  ++  +++Q SCGSCWAF  V A+     I S G L  +LS  +L+
Sbjct:   105 IPESFDSRDNWPKCDSIKVIRDQSSCGSCWAFGAVEAMSDRICIASHGELQVTLSADDLL 164

Query:   187 DCDTSFNNGCNGGLMDYAFKYIVASG 212
              C  S   GCNGG    A++Y V  G
Sbjct:   165 SCCKSCGFGCNGGDPLAAWRYWVKDG 190

 Score = 137 (53.3 bits), Expect = 9.1e-17, Sum P(2) = 9.1e-17
 Identities = 38/104 (36%), Positives = 52/104 (50%)

Query:   249 ENDEQSLLKAL-AHQPVSVAIEASGTDFQFYSGGVFTGPCGAELD--HGVAAVGYGKSKG 305
             ++D +++ K L  H P+ +A E    DF  Y GGV+    G +L   H V  +G+G   G
Sbjct:   260 KDDVEAIQKELMTHGPLEIAFEVY-EDFLNYDGGVYVHT-GGKLGGGHAVKLIGWGIDDG 317

Query:   306 SDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINK--MASIP 347
               Y  V NSW   WGE G+ R+ R  G  E  CGI    +  IP
Sbjct:   318 IPYWTVANSWNTDWGEDGFFRILR--GVDE--CGIESGVVGGIP 357


>UNIPROTKB|P07858 [details] [associations]
            symbol:CTSB "Cathepsin B" species:9606 "Homo sapiens"
            [GO:0050790 "regulation of catalytic activity" evidence=IEA]
            [GO:0005739 "mitochondrion" evidence=IEA] [GO:0042470 "melanosome"
            evidence=IEA] [GO:0005515 "protein binding" evidence=IPI]
            [GO:0042981 "regulation of apoptotic process" evidence=TAS]
            [GO:0006508 "proteolysis" evidence=IDA] [GO:0005764 "lysosome"
            evidence=IDA] [GO:0097067 "cellular response to thyroid hormone
            stimulus" evidence=IEP] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IDA] [GO:0048471 "perinuclear region of
            cytoplasm" evidence=IDA] [GO:0005622 "intracellular" evidence=TAS]
            [GO:0036021 "endolysosome lumen" evidence=TAS] [GO:0045087 "innate
            immune response" evidence=TAS] [GO:0008233 "peptidase activity"
            evidence=IDA] [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=IDA] [GO:0005615 "extracellular space" evidence=ISS]
            [GO:0005730 "nucleolus" evidence=IDA] [GO:0043231 "intracellular
            membrane-bounded organelle" evidence=IDA] InterPro:IPR000668
            InterPro:IPR012599 InterPro:IPR013128 InterPro:IPR015643
            Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 Reactome:REACT_118779 GO:GO:0005739
            GO:GO:0042470 GO:GO:0048471 Reactome:REACT_6900 GO:GO:0005615
            GO:GO:0009612 GO:GO:0009611 GO:GO:0005730 GO:GO:0042981
            GO:GO:0009897 GO:GO:0045471 GO:GO:0016324 GO:GO:0009749
            GO:GO:0006914 GO:GO:0043434 eggNOG:COG4870 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0045087
            GO:GO:0050790 GO:GO:0042383 GO:GO:0014070 GO:GO:0042277
            GO:GO:0060548 GO:GO:0005901 GO:GO:0014075 GO:GO:0004197
            GO:GO:0070670 EMBL:CH471157 GO:GO:0007519 MEROPS:C01.060 CTD:1508
            HOGENOM:HOG000241341 HOVERGEN:HBG003480 KO:K01363 OMA:GYPSGAW
            OrthoDB:EOG4K6G4C BRENDA:3.4.22.1 GO:GO:0097067
            PANTHER:PTHR12411:SF16 EMBL:M14221 EMBL:L16510 EMBL:AK092070
            EMBL:AK075393 EMBL:BC010240 EMBL:BC095408 EMBL:M13230
            IPI:IPI00295741 PIR:A26498 RefSeq:NP_001899.1 RefSeq:NP_680090.1
            RefSeq:NP_680091.1 RefSeq:NP_680092.1 RefSeq:NP_680093.1
            UniGene:Hs.520898 PDB:1CSB PDB:1GMY PDB:1HUC PDB:1PBH PDB:2IPP
            PDB:2PBH PDB:3AI8 PDB:3CBJ PDB:3CBK PDB:3K9M PDB:3PBH PDBsum:1CSB
            PDBsum:1GMY PDBsum:1HUC PDBsum:1PBH PDBsum:2IPP PDBsum:2PBH
            PDBsum:3AI8 PDBsum:3CBJ PDBsum:3CBK PDBsum:3K9M PDBsum:3PBH
            ProteinModelPortal:P07858 SMR:P07858 DIP:DIP-42785N IntAct:P07858
            MINT:MINT-1397666 STRING:P07858 PhosphoSite:P07858 DMDM:68067549
            SWISS-2DPAGE:P07858 UCD-2DPAGE:P07858 PaxDb:P07858
            PeptideAtlas:P07858 PRIDE:P07858 DNASU:1508 Ensembl:ENST00000345125
            Ensembl:ENST00000353047 Ensembl:ENST00000434271
            Ensembl:ENST00000453527 Ensembl:ENST00000530640
            Ensembl:ENST00000531089 Ensembl:ENST00000533455
            Ensembl:ENST00000534510 GeneID:1508 KEGG:hsa:1508 UCSC:uc003wum.3
            GeneCards:GC08M011700 H-InvDB:HIX0007320 HGNC:HGNC:2527
            HPA:CAB000457 HPA:HPA018156 MIM:116810 neXtProt:NX_P07858
            PharmGKB:PA27027 InParanoid:P07858 PhylomeDB:P07858
            BindingDB:P07858 ChEMBL:CHEMBL4072 ChiTaRS:CTSB
            EvolutionaryTrace:P07858 GenomeRNAi:1508 NextBio:6235
            PMAP-CutDB:P07858 ArrayExpress:P07858 Bgee:P07858 CleanEx:HS_CTSB
            Genevestigator:P07858 GermOnline:ENSG00000164733 GO:GO:0036021
            Uniprot:P07858
        Length = 339

 Score = 143 (55.4 bits), Expect = 2.0e-16, Sum P(2) = 2.0e-16
 Identities = 48/147 (32%), Positives = 78/147 (53%)

Query:    83 NKEVTSYWLGLNEF-ADMSH-EEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVD-- 138
             NK  T++  G N +  DMS+ +     +LG  P+ P R      F+  D+K LP S D  
Sbjct:    35 NKRNTTWQAGHNFYNVDMSYLKRLCGTFLG-GPKPPQR----VMFT-EDLK-LPASFDAR 87

Query:   139 --WRKKGAVTPVKNQGSCGSCWAFSTVAAV-EGINQIVSGNLT-SLSEQELIDCDTSF-N 193
               W +   +  +++QGSCGSCWAF  V A+ + I    + +++  +S ++L+ C  S   
Sbjct:    88 EQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSMCG 147

Query:   194 NGCNGGLMDYAFKYI----VASGGLHK 216
             +GCNGG    A+ +     + SGGL++
Sbjct:   148 DGCNGGYPAEAWNFWTRKGLVSGGLYE 174

 Score = 126 (49.4 bits), Expect = 2.0e-16, Sum P(2) = 2.0e-16
 Identities = 33/110 (30%), Positives = 52/110 (47%)

Query:   243 GYQDVP-ENDEQSLLKAL-AHQPVSVAIEASGTDFQFYSGGVFTGPCGAELD-HGVAAVG 299
             GY      N E+ ++  +  + PV  A     +DF  Y  GV+    G  +  H +  +G
Sbjct:   226 GYNSYSVSNSEKDIMAEIYKNGPVEGAFSVY-SDFLLYKSGVYQHVTGEMMGGHAIRILG 284

Query:   300 YGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINK--MASIP 347
             +G   G+ Y +V NSW   WG+ G+ ++ R  G+    CGI    +A IP
Sbjct:   285 WGVENGTPYWLVANSWNTDWGDNGFFKILR--GQDH--CGIESEVVAGIP 330


>UNIPROTKB|P07688 [details] [associations]
            symbol:CTSB "Cathepsin B" species:9913 "Bos taurus"
            [GO:0042470 "melanosome" evidence=IEA] [GO:0005764 "lysosome"
            evidence=IEA] [GO:0005615 "extracellular space" evidence=IEA]
            [GO:0097067 "cellular response to thyroid hormone stimulus"
            evidence=IEA] [GO:0005739 "mitochondrion" evidence=IEA] [GO:0005730
            "nucleolus" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0050790 "regulation of catalytic activity" evidence=IEA]
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR012599 InterPro:IPR013128
            InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0005739 GO:GO:0042470
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0050790 GO:GO:0005764 GO:GO:0004197 EMBL:L06075 EMBL:M64620
            EMBL:U16336 EMBL:U16337 EMBL:U16338 EMBL:U16339 EMBL:U16341
            EMBL:U16342 EMBL:U16343 EMBL:BC102997 IPI:IPI00692061 PIR:S38328
            RefSeq:NP_776456.1 UniGene:Bt.393 PDB:1ITO PDB:1QDQ PDB:1SP4
            PDB:2DC6 PDB:2DC7 PDB:2DC8 PDB:2DC9 PDB:2DCA PDB:2DCB PDB:2DCC
            PDB:2DCD PDBsum:1ITO PDBsum:1QDQ PDBsum:1SP4 PDBsum:2DC6
            PDBsum:2DC7 PDBsum:2DC8 PDBsum:2DC9 PDBsum:2DCA PDBsum:2DCB
            PDBsum:2DCC PDBsum:2DCD ProteinModelPortal:P07688 SMR:P07688
            STRING:P07688 MEROPS:C01.060 PRIDE:P07688
            Ensembl:ENSBTAT00000036795 GeneID:281105 KEGG:bta:281105 CTD:1508
            eggNOG:NOG315657 GeneTree:ENSGT00560000076599 HOGENOM:HOG000241341
            HOVERGEN:HBG003480 InParanoid:P07688 KO:K01363 OMA:GYPSGAW
            OrthoDB:EOG4K6G4C BRENDA:3.4.22.1 BindingDB:P07688
            ChEMBL:CHEMBL2323 EvolutionaryTrace:P07688 NextBio:20805177
            ArrayExpress:P07688 GO:GO:0097067 PANTHER:PTHR12411:SF16
            Uniprot:P07688
        Length = 335

 Score = 141 (54.7 bits), Expect = 2.0e-16, Sum P(2) = 2.0e-16
 Identities = 48/146 (32%), Positives = 71/146 (48%)

Query:    83 NKEVTSYWLGLNEF-ADMSH-EEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVD-- 138
             NK+ T++  G N +  D+S+ ++     LG  P+ P R   +A     DV  LP+S D  
Sbjct:    35 NKQNTTWKAGHNFYNVDLSYVKKLCGAILG-GPKLPQRDAFAA-----DV-VLPESFDAR 87

Query:   139 --WRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSG---NLTSLSEQELIDCDTSFN 193
               W     +  +++QGSCGSCWAF  V A+     I S    N+   +E  L  C     
Sbjct:    88 EQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHSNGRVNVEVSAEDMLTCCGGECG 147

Query:   194 NGCNGGLMDYAFKYI----VASGGLH 215
             +GCNGG    A+ +     + SGGL+
Sbjct:   148 DGCNGGFPSGAWNFWTKKGLVSGGLY 173

 Score = 128 (50.1 bits), Expect = 2.0e-16, Sum P(2) = 2.0e-16
 Identities = 36/126 (28%), Positives = 58/126 (46%)

Query:   225 EEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFT 284
             E G     KE+      S Y  V  N+++ + +   + PV  A     +DF  Y  GV+ 
Sbjct:   212 EPGYSPSYKEDKHF-GCSSYS-VANNEKEIMAEIYKNGPVEGAFSVY-SDFLLYKSGVYQ 268

Query:   285 GPCGAELD-HGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINK- 342
                G  +  H +  +G+G   G+ Y +V NSW   WG+ G+ ++ R  G+    CGI   
Sbjct:   269 HVSGEIMGGHAIRILGWGVENGTPYWLVGNSWNTDWGDNGFFKILR--GQDH--CGIESE 324

Query:   343 -MASIP 347
              +A +P
Sbjct:   325 IVAGMP 330


>DICTYBASE|DDB_G0286055 [details] [associations]
            symbol:DDB_G0286055 "peptidase C1A family protein"
            species:44689 "Dictyostelium discoideum" [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            SMART:SM00645 dictyBase:DDB_G0286055 Pfam:PF00188 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 PANTHER:PTHR12411 EMBL:AAFI02000085
            Gene3D:3.40.33.10 InterPro:IPR001283 InterPro:IPR014044
            PRINTS:PR00837 SMART:SM00198 SUPFAM:SSF55797
            ProtClustDB:CLSZ2429919 RefSeq:XP_637918.1
            ProteinModelPortal:Q54MB6 EnsemblProtists:DDB0186794 GeneID:8625429
            KEGG:ddi:DDB_G0286055 InParanoid:Q54MB6 OMA:GENGFAR Uniprot:Q54MB6
        Length = 435

 Score = 221 (82.9 bits), Expect = 3.0e-16, P = 3.0e-16
 Identities = 77/273 (28%), Positives = 122/273 (44%)

Query:    95 EFADMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSC 154
             +   MS+EE+ NK + L  +   RR    +  Y        S DWR  G V   K+  +C
Sbjct:   173 DLTTMSYEEWPNKIVNLNQRL-VRRDD--DHIYTASVPTDGSFDWRDNGVVGFPKDSSNC 229

Query:   155 GSCWAFSTVAAVEGINQIVSGNLTSLSEQELIDCD-------TSFNNG----CN--GGLM 201
              S WAF+     E  + + + +    S Q+LIDC        ++F+ G    C+   G +
Sbjct:   230 ASGWAFTAAGIFESRSAMRTRHRYDYSAQQLIDCINVCIIIFSNFSIGNYTKCSRFSGEL 289

Query:   202 DYAFKYIVASGGLHKEEDYPYLMEEGT-CEDKKEEMEVVTISGYQDVPENDEQSLLKALA 260
             + A  Y  A G L     YPY+      C   +  + V    G  +  +    S+++   
Sbjct:   290 NKALMYAQAYG-LQATSTYPYVGASSIGCSYNQSSIAVE--GGDVEYSQVGRDSIVEKCR 346

Query:   261 HQ-PVSVAIEASGTDFQFYSGGVF----TGPCGAELDHGVAAVGYGKSKGSDYIIVKNSW 315
              Q PV V I  +  +F +Y+GG+F    T    A ++H V  VGY +    +Y I+KN++
Sbjct:   347 KQGPVGVGIYVTN-EFLYYAGGIFECNNTLIDNANINHNVLLVGYNEK--DNYYIIKNNF 403

Query:   316 GPKWGERGYIRMKRNTGKPEGLCGINKMASIPL 348
             G  WGE G+ R+  +  K + L   N   SI +
Sbjct:   404 GRTWGENGFARITADVNK-DCLIAKNPAYSIQI 435


>UNIPROTKB|Q6IN22 [details] [associations]
            symbol:Ctsb "Cathepsin B" species:10116 "Rattus norvegicus"
            [GO:0050790 "regulation of catalytic activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR012599 InterPro:IPR013128
            InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 RGD:621509 GO:GO:0005739
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0050790 GO:GO:0005764 GO:GO:0004197 CTD:1508
            GeneTree:ENSGT00560000076599 HOVERGEN:HBG003480 KO:K01363
            OMA:GYPSGAW GO:GO:0097067 PANTHER:PTHR12411:SF16 UniGene:Rn.100909
            EMBL:CH474023 HSSP:P00785 EMBL:BC072490 IPI:IPI00562653
            RefSeq:NP_072119.2 SMR:Q6IN22 IntAct:Q6IN22 STRING:Q6IN22
            Ensembl:ENSRNOT00000014177 GeneID:64529 KEGG:rno:64529
            InParanoid:Q6IN22 NextBio:613362 Genevestigator:Q6IN22
            Uniprot:Q6IN22
        Length = 339

 Score = 143 (55.4 bits), Expect = 3.3e-16, Sum P(2) = 3.3e-16
 Identities = 45/143 (31%), Positives = 72/143 (50%)

Query:    79 IDQRNKEVTSYWLGLNEF-ADMSH-EEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKS 136
             I+  NK+ T++  G N +  D+S+ ++     LG  P+ P R      FS  D+  LP+S
Sbjct:    31 INYINKQNTTWQAGRNFYNVDISYLKKLCGTVLG-GPKLPER----VGFS-EDIN-LPES 83

Query:   137 VD----WRKKGAVTPVKNQGSCGSCWAFSTVAAV-EGINQIVSGNLT-SLSEQELIDC-D 189
              D    W     +  +++QGSCGSCWAF  V A+ + I    +G +   +S ++L+ C  
Sbjct:    84 FDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRVNVEVSAEDLLTCCG 143

Query:   190 TSFNNGCNGGLMDYAFKYIVASG 212
                 +GCNGG    A+ +    G
Sbjct:   144 IQCGDGCNGGYPSGAWNFWTRKG 166

 Score = 124 (48.7 bits), Expect = 3.3e-16, Sum P(2) = 3.3e-16
 Identities = 36/128 (28%), Positives = 58/128 (45%)

Query:   223 LMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGV 282
             + E G     KE+      S Y  V +++++ + +   + PV  A     +DF  Y  GV
Sbjct:   210 MCEAGYSTSYKEDKHYGYTS-YS-VSDSEKEIMAEIYKNGPVEGAFTVF-SDFLTYKSGV 266

Query:   283 FTGPCGAELD-HGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGIN 341
             +    G  +  H +  +G+G   G  Y +V NSW   WG+ G+ ++ R     E  CGI 
Sbjct:   267 YKHEAGDVMGGHAIRILGWGIENGVPYWLVANSWNVDWGDNGFFKILRG----ENHCGIE 322

Query:   342 K--MASIP 347
                +A IP
Sbjct:   323 SEIVAGIP 330


>RGD|621509 [details] [associations]
            symbol:Ctsb "cathepsin B" species:10116 "Rattus norvegicus"
            [GO:0004175 "endopeptidase activity" evidence=IMP;IDA] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=IEA;ISO;IDA]
            [GO:0005515 "protein binding" evidence=IPI] [GO:0005576
            "extracellular region" evidence=IDA] [GO:0005615 "extracellular
            space" evidence=IDA] [GO:0005730 "nucleolus" evidence=IEA;ISO]
            [GO:0005737 "cytoplasm" evidence=IDA] [GO:0005739 "mitochondrion"
            evidence=IEA;ISO;IDA] [GO:0005764 "lysosome" evidence=IEA;ISO;IDA]
            [GO:0006508 "proteolysis" evidence=IEA;IEP;ISO;IMP;IDA;TAS]
            [GO:0006914 "autophagy" evidence=IEP] [GO:0006950 "response to
            stress" evidence=IEP] [GO:0007283 "spermatogenesis" evidence=IEP]
            [GO:0007519 "skeletal muscle tissue development" evidence=IEP]
            [GO:0008233 "peptidase activity" evidence=ISO] [GO:0008234
            "cysteine-type peptidase activity" evidence=ISO] [GO:0009611
            "response to wounding" evidence=IEP] [GO:0009612 "response to
            mechanical stimulus" evidence=IEP] [GO:0009749 "response to glucose
            stimulus" evidence=IEP] [GO:0009897 "external side of plasma
            membrane" evidence=IDA] [GO:0009986 "cell surface" evidence=IDA]
            [GO:0014070 "response to organic cyclic compound" evidence=IEP]
            [GO:0014075 "response to amine stimulus" evidence=IEP] [GO:0016324
            "apical plasma membrane" evidence=IDA] [GO:0030984 "kininogen
            binding" evidence=IPI] [GO:0032403 "protein complex binding"
            evidence=IPI] [GO:0034097 "response to cytokine stimulus"
            evidence=IEP] [GO:0042277 "peptide binding" evidence=IDA]
            [GO:0042383 "sarcolemma" evidence=IDA] [GO:0042470 "melanosome"
            evidence=IEA] [GO:0043231 "intracellular membrane-bounded
            organelle" evidence=ISO] [GO:0043434 "response to peptide hormone
            stimulus" evidence=IEP] [GO:0043621 "protein self-association"
            evidence=IDA] [GO:0045471 "response to ethanol" evidence=IEP]
            [GO:0048471 "perinuclear region of cytoplasm" evidence=ISO;IDA]
            [GO:0050790 "regulation of catalytic activity" evidence=IEA]
            [GO:0060548 "negative regulation of cell death" evidence=IMP]
            [GO:0070670 "response to interleukin-4" evidence=IEP] [GO:0097067
            "cellular response to thyroid hormone stimulus" evidence=IEA;ISO]
            [GO:0005901 "caveola" evidence=IDA] InterPro:IPR000668
            InterPro:IPR012599 InterPro:IPR013128 InterPro:IPR015643
            Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 RGD:621509 GO:GO:0005739 GO:GO:0042470
            GO:GO:0005615 GO:GO:0009612 GO:GO:0009611 GO:GO:0009897
            GO:GO:0045471 GO:GO:0016324 GO:GO:0009749 GO:GO:0006914
            GO:GO:0043434 eggNOG:COG4870 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0050790 GO:GO:0007283
            GO:GO:0005764 GO:GO:0042383 GO:GO:0043621 GO:GO:0014070
            GO:GO:0042277 GO:GO:0060548 GO:GO:0014075 GO:GO:0004197
            GO:GO:0070670 GO:GO:0007519 MEROPS:C01.060 HOGENOM:HOG000241341
            HOVERGEN:HBG003480 OrthoDB:EOG4K6G4C BRENDA:3.4.22.1 GO:GO:0097067
            PANTHER:PTHR12411:SF16 EMBL:X82396 EMBL:M11305 IPI:IPI00212811
            PIR:S51041 UniGene:Rn.100909 PDB:1CPJ PDB:1CTE PDB:1MIR PDB:1THE
            PDBsum:1CPJ PDBsum:1CTE PDBsum:1MIR PDBsum:1THE
            ProteinModelPortal:P00787 SMR:P00787 STRING:P00787 PRIDE:P00787
            UCSC:RGD:621509 InParanoid:P00787 SABIO-RK:P00787 BindingDB:P00787
            ChEMBL:CHEMBL2602 EvolutionaryTrace:P00787 ArrayExpress:P00787
            Genevestigator:P00787 GermOnline:ENSRNOG00000010331 Uniprot:P00787
        Length = 339

 Score = 142 (55.0 bits), Expect = 4.3e-16, Sum P(2) = 4.3e-16
 Identities = 45/143 (31%), Positives = 71/143 (49%)

Query:    79 IDQRNKEVTSYWLGLNEF-ADMSH-EEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKS 136
             I+  NK+ T++  G N +  D+S+ ++     LG  P  P R      FS  D+  LP+S
Sbjct:    31 INYINKQNTTWQAGRNFYNVDISYLKKLCGTVLG-GPNLPER----VGFS-EDIN-LPES 83

Query:   137 VD----WRKKGAVTPVKNQGSCGSCWAFSTVAAV-EGINQIVSGNLT-SLSEQELIDC-D 189
              D    W     +  +++QGSCGSCWAF  V A+ + I    +G +   +S ++L+ C  
Sbjct:    84 FDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRVNVEVSAEDLLTCCG 143

Query:   190 TSFNNGCNGGLMDYAFKYIVASG 212
                 +GCNGG    A+ +    G
Sbjct:   144 IQCGDGCNGGYPSGAWNFWTRKG 166

 Score = 124 (48.7 bits), Expect = 4.3e-16, Sum P(2) = 4.3e-16
 Identities = 36/128 (28%), Positives = 58/128 (45%)

Query:   223 LMEEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGV 282
             + E G     KE+      S Y  V +++++ + +   + PV  A     +DF  Y  GV
Sbjct:   210 MCEAGYSTSYKEDKHYGYTS-YS-VSDSEKEIMAEIYKNGPVEGAFTVF-SDFLTYKSGV 266

Query:   283 FTGPCGAELD-HGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGIN 341
             +    G  +  H +  +G+G   G  Y +V NSW   WG+ G+ ++ R     E  CGI 
Sbjct:   267 YKHEAGDVMGGHAIRILGWGIENGVPYWLVANSWNVDWGDNGFFKILRG----ENHCGIE 322

Query:   342 K--MASIP 347
                +A IP
Sbjct:   323 SEIVAGIP 330


>UNIPROTKB|F1PIF2 [details] [associations]
            symbol:CTSZ "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0060441 "epithelial tube branching involved
            in lung morphogenesis" evidence=IEA] [GO:0005783 "endoplasmic
            reticulum" evidence=IEA] [GO:0005615 "extracellular space"
            evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 GO:GO:0005783 GO:GO:0005615 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640
            GO:GO:0060441 GeneTree:ENSGT00560000076599 OMA:QCGTCTE
            EMBL:AAEX03014054 Ensembl:ENSCAFT00000019357 Uniprot:F1PIF2
        Length = 261

 Score = 202 (76.2 bits), Expect = 4.5e-16, P = 4.5e-16
 Identities = 62/196 (31%), Positives = 99/196 (50%)

Query:   154 CGSCWAF-STVAAVEGINQIVSGNLTS--LSEQELIDCDTSFNNGCNGG----LMDYAFK 206
             CGSCWA  ST A  + IN    G   S  LS Q ++DC  +    C GG    +  YA +
Sbjct:    47 CGSCWAHGSTSAMADRINIKRKGAWPSTLLSVQHVLDCANA--GSCEGGNDLPVWSYAHE 104

Query:   207 YIVASGGLH----KEEDYPYLMEEGTCEDKKE--EMEVVTISGYQDVPE--NDEQSLLKA 258
             + +     +    K+++     + GTC + KE   ++  T+    D       E+ + + 
Sbjct:   105 HGIPDETCNNYQAKDQECNKFNQCGTCTEFKECHAIQNYTLWRVGDYGSLSGREKMMAEI 164

Query:   259 LAHQPVSVAIEASGTDFQFYSGGVFTG-PCGAELDHGVAAVGYGKSKGSDYIIVKNSWGP 317
              A+ P+S  I A+      Y+GG+       A ++H ++ VG+G S G++Y IV+NSWG 
Sbjct:   165 YANGPISCGIMATEKMVN-YTGGIHAEYQEQAYINHVISVVGWGVSDGTEYWIVRNSWGE 223

Query:   318 KWGERGYIRMKRNTGK 333
              WGERG++R+  +T K
Sbjct:   224 PWGERGWMRIVTSTYK 239


>DICTYBASE|DDB_G0280187 [details] [associations]
            symbol:DDB_G0280187 "cathepsin Z-like protein"
            species:44689 "Dictyostelium discoideum" [GO:0005615 "extracellular
            space" evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0044351
            "macropinocytosis" evidence=RCA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            dictyBase:DDB_G0280187 GO:GO:0005615 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640
            EMBL:AAFI02000035 KO:K08568 RefSeq:XP_641294.1
            ProteinModelPortal:Q54VR1 MEROPS:C01.A61 PRIDE:Q54VR1
            EnsemblProtists:DDB0233838 GeneID:8622427 KEGG:ddi:DDB_G0280187
            InParanoid:Q54VR1 OMA:VWKVGDY Uniprot:Q54VR1
        Length = 291

 Score = 144 (55.7 bits), Expect = 7.4e-16, Sum P(2) = 7.4e-16
 Identities = 28/83 (33%), Positives = 51/83 (61%)

Query:   250 NDEQSLLKAL-AHQPVSVAIEASGTDFQFYSGGVFTGPCGA--ELDHGVAAVGYGKSKGS 306
             N   ++++ + A  P++  +E +   F+ Y+ GVFT   G+  E++H ++ +G+G   G 
Sbjct:   190 NGSVAMMQEIFARGPIACGMEVTDA-FESYTSGVFTSSVGSTGEINHEISIIGWGTENGV 248

Query:   307 DYIIVKNSWGPKWGERGYIRMKR 329
             DY I +NSWG  +GE G+ R++R
Sbjct:   249 DYWIGRNSWGTYFGELGFFRIQR 271

 Score = 116 (45.9 bits), Expect = 7.4e-16, Sum P(2) = 7.4e-16
 Identities = 37/112 (33%), Positives = 52/112 (46%)

Query:   127 YRDVKALPKSVDWRK-KGA--VTPVKNQGS---CGSCWAFSTVAAV-EGINQIVSGNLTS 179
             Y D   LP   DWR   G+  +T  +NQ     CGSCWA  T +A+ + I     G    
Sbjct:    43 YIDEDTLPTQYDWRNISGSSYITITRNQHLPQYCGSCWAHGTTSALGDRIKIGRKGTFPE 102

Query:   180 --LSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTC 229
               L+ Q L++C    +N C+GG    A+ Y+ A G +  E   PY   +  C
Sbjct:   103 VVLAPQVLLNC-AGPDNTCDGGDPTEAYAYMAAKG-ITDETCAPYEAIDNEC 152


>UNIPROTKB|P43233 [details] [associations]
            symbol:CTSB "Cathepsin B" species:9031 "Gallus gallus"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=IEA]
            [GO:0050790 "regulation of catalytic activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005764 "lysosome"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR012599
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 eggNOG:COG4870
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0050790 GO:GO:0005764 GO:GO:0004197 MEROPS:C01.060 CTD:1508
            HOGENOM:HOG000241341 HOVERGEN:HBG003480 KO:K01363 OrthoDB:EOG4K6G4C
            PANTHER:PTHR12411:SF16 EMBL:U18083 IPI:IPI00573387 PIR:S58770
            RefSeq:NP_990702.1 UniGene:Gga.3854 ProteinModelPortal:P43233
            SMR:P43233 STRING:P43233 PRIDE:P43233 GeneID:396329 KEGG:gga:396329
            InParanoid:P43233 NextBio:20816377 Uniprot:P43233
        Length = 340

 Score = 134 (52.2 bits), Expect = 2.5e-15, Sum P(2) = 2.5e-15
 Identities = 47/150 (31%), Positives = 78/150 (52%)

Query:    79 IDQRNKEVTSYWLGLN-EFADMSH-EEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKS 136
             ++  NK  T+   G N    DMS+ ++    +LG  P+ P R     +F+  D+  LP +
Sbjct:    31 VNHINKLNTTGRAGHNFHNTDMSYVKKLCGTFLG-GPKAPER----VDFA-EDMD-LPDT 83

Query:   137 VDWRKKG----AVTPVKNQGSCGSCWAFSTVAAV-EGINQIVSGNLT-SLSEQELIDC-D 189
              D RK+      ++ +++QGSCGSCWAF  V A+ + I    +  ++  +S ++L+ C  
Sbjct:    84 FDTRKQWPNCPTISEIRDQGSCGSCWAFGAVEAISDRICVHTNAKVSVEVSAEDLLSCCG 143

Query:   190 TSFNNGCNGGLMDYAFKYI----VASGGLH 215
                  GCNGG    A++Y     + SGGL+
Sbjct:   144 FECGMGCNGGYPSGAWRYWTERGLVSGGLY 173

 Score = 126 (49.4 bits), Expect = 2.5e-15, Sum P(2) = 2.5e-15
 Identities = 35/126 (27%), Positives = 57/126 (45%)

Query:   225 EEGTCEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFT 284
             E G     KE+     I+ Y  VP ++++ + +   + PV  A      DF  Y  GV+ 
Sbjct:   213 EPGYSPSYKEDKHY-GITSY-GVPRSEKEIMAEIYKNGPVEGAFIVY-EDFLMYKSGVYQ 269

Query:   285 GPCGAELD-HGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINK- 342
                G ++  H +  +G+G   G+ Y +  NSW   WG  G+ ++ R     E  CGI   
Sbjct:   270 HVSGEQVGGHAIRILGWGVENGTPYWLAANSWNTDWGITGFFKILRG----EDHCGIESE 325

Query:   343 -MASIP 347
              +A +P
Sbjct:   326 IVAGVP 331


>DICTYBASE|DDB_G0292462 [details] [associations]
            symbol:DDB_G0292462 species:44689 "Dictyostelium
            discoideum" [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 dictyBase:DDB_G0292462 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640
            EMBL:AAFI02000190 RefSeq:XP_629634.1 MEROPS:C01.A56
            EnsemblProtists:DDB0184413 GeneID:8628698 KEGG:ddi:DDB_G0292462
            InParanoid:Q54D62 OMA:NTQVESH Uniprot:Q54D62
        Length = 323

 Score = 205 (77.2 bits), Expect = 9.0e-15, P = 9.0e-15
 Identities = 68/227 (29%), Positives = 106/227 (46%)

Query:   145 VTPVKNQGSCGSCWAFSTVAAVEGINQIVSG-NLTSL-SEQELIDCDTSF--------NN 194
             ++PV+ Q SCGSCWA  T   +     I S  N+  L S Q L+DCD S         NN
Sbjct:    60 MSPVREQQSCGSCWAQVTSGILADRMCIESDKNIKMLLSPQYLMDCDGSCVSDGVSGCNN 119

Query:   195 GCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEG-----TCEDKK--EEMEVVTISGYQDV 247
             GC GG +  A   ++ + G+  +E   Y   +      TC+D        +   +  +  
Sbjct:   120 GCKGGFVGLALTRLI-NEGIVSDECLSYQASKDSSCPTTCDDGSPISNTTIYKATSCRAF 178

Query:   248 PE-NDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELD-HGVAAVGYGK-SK 304
             P   D Q   + + + PV +A     +DF+ +   V+      +++ H V  VG+G  S 
Sbjct:   179 PTVQDAQ--YEIMTNGPV-IATFMLYSDFKPHKWDVYIKSSNTQVESHAVRVVGWGTTSD 235

Query:   305 GSDYIIVKNSWGPKWGERGYIRMKRNTGKP---EGLCGINK-MASIP 347
             G DY I  NSWG  WG++GY +++R + +    EG   +    AS+P
Sbjct:   236 GVDYWIAANSWGTGWGDKGYFKIRRGSDEAAFEEGFITVTADTASVP 282


>MGI|MGI:1891190 [details] [associations]
            symbol:Ctsz "cathepsin Z" species:10090 "Mus musculus"
            [GO:0005615 "extracellular space" evidence=ISO] [GO:0005764
            "lysosome" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0008233 "peptidase activity" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            MGI:MGI:1891190 GO:GO:0005783 GO:GO:0005615 eggNOG:COG4870
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764
            GO:GO:0060441 GeneTree:ENSGT00560000076599 MEROPS:C01.013 CTD:1522
            HOVERGEN:HBG004456 KO:K08568 OrthoDB:EOG42Z4QN OMA:QCGTCTE
            ChiTaRS:CTSZ EMBL:AJ242663 EMBL:AF136277 EMBL:AF136278
            EMBL:BC008619 IPI:IPI00986833 RefSeq:NP_071720.1 UniGene:Mm.156919
            ProteinModelPortal:Q9WUU7 SMR:Q9WUU7 IntAct:Q9WUU7 STRING:Q9WUU7
            PaxDb:Q9WUU7 PRIDE:Q9WUU7 Ensembl:ENSMUST00000016400 GeneID:64138
            KEGG:mmu:64138 InParanoid:Q9WUU7 NextBio:319927 Bgee:Q9WUU7
            CleanEx:MM_CTSZ Genevestigator:Q9WUU7 GermOnline:ENSMUSG00000016256
            Uniprot:Q9WUU7
        Length = 306

 Score = 203 (76.5 bits), Expect = 1.0e-14, P = 1.0e-14
 Identities = 63/197 (31%), Positives = 99/197 (50%)

Query:   154 CGSCWAF-STVAAVEGINQIVSGNLTS--LSEQELIDCDTSFNNGCNGG----LMDYAFK 206
             CGSCWA  ST A  + IN    G   S  LS Q +IDC  +    C GG    + +YA K
Sbjct:    91 CGSCWAHGSTSAMADRINIKRKGAWPSILLSVQNVIDCGNA--GSCEGGNDLPVWEYAHK 148

Query:   207 YIVASGGLH----KEEDYPYLMEEGTCEDKKE--EMEVVTISGYQDVPE--NDEQSLLKA 258
             + +     +    K++D     + GTC + KE   ++  T+    D       E+ + + 
Sbjct:   149 HGIPDETCNNYQAKDQDCDKFNQCGTCTEFKECHTIQNYTLWRVGDYGSLSGREKMMAEI 208

Query:   259 LAHQPVSVAIEASGTDFQFYSGGVFTGPCG-AELDHGVAAVGYGKSK-GSDYIIVKNSWG 316
              A+ P+S  I A+      Y+GG++      A ++H ++  G+G S  G +Y IV+NSWG
Sbjct:   209 YANGPISCGIMATEM-MSNYTGGIYAEHQDQAVINHIISVAGWGVSNDGIEYWIVRNSWG 267

Query:   317 PKWGERGYIRMKRNTGK 333
               WGE+G++R+  +T K
Sbjct:   268 EPWGEKGWMRIVTSTYK 284

 Score = 118 (46.6 bits), Expect = 0.00017, P = 0.00017
 Identities = 44/134 (32%), Positives = 59/134 (44%)

Query:   118 RRQPSAEFSYRDVKALPKSVDWRKKGAV---TPVKNQGS---CGSCWAF-STVAAVEGIN 170
             RR       Y     LPK+ DWR    V   +  +NQ     CGSCWA  ST A  + IN
Sbjct:    49 RRTYPRPHEYLSPADLPKNWDWRNVNGVNYASVTRNQHIPQYCGSCWAHGSTSAMADRIN 108

Query:   171 QIVSGNLTS--LSEQELIDCDTSFNNGCNGG----LMDYAFKYIVASGGLH----KEEDY 220
                 G   S  LS Q +IDC  +    C GG    + +YA K+ +     +    K++D 
Sbjct:   109 IKRKGAWPSILLSVQNVIDCGNA--GSCEGGNDLPVWEYAHKHGIPDETCNNYQAKDQDC 166

Query:   221 PYLMEEGTCEDKKE 234
                 + GTC + KE
Sbjct:   167 DKFNQCGTCTEFKE 180


>WB|WBGene00000788 [details] [associations]
            symbol:cpz-1 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0016021 "integral to
            membrane" evidence=IEA] [GO:0009792 "embryo development ending in
            birth or egg hatching" evidence=IMP] [GO:0040007 "growth"
            evidence=IMP] [GO:0002119 "nematode larval development"
            evidence=IMP] [GO:0010171 "body morphogenesis" evidence=IMP]
            [GO:0018996 "molting cycle, collagen and cuticulin-based cuticle"
            evidence=IMP] [GO:0031012 "extracellular matrix" evidence=IDA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 GO:GO:0009792 GO:GO:0040007 GO:GO:0002119
            GO:GO:0018996 GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661
            PANTHER:PTHR12411 PROSITE:PS00640 GO:GO:0010171 GO:GO:0031012
            GeneTree:ENSGT00560000076599 KO:K08568 OMA:QCGTCTE EMBL:FO081275
            EMBL:BK001409 PIR:T29872 RefSeq:NP_491023.2 HSSP:Q9UBR2
            ProteinModelPortal:G5EGP8 SMR:G5EGP8 IntAct:G5EGP8 MEROPS:C01.A38
            EnsemblMetazoa:F32B5.8 GeneID:171829 KEGG:cel:CELE_F32B5.8
            CTD:171829 WormBase:F32B5.8 NextBio:872879 Uniprot:G5EGP8
        Length = 306

 Score = 202 (76.2 bits), Expect = 1.4e-14, P = 1.4e-14
 Identities = 70/235 (29%), Positives = 107/235 (45%)

Query:   129 DVKALPKSVDWRKKGAVTPV---KNQGS---CGSCWAF-STVAAVEGINQIVSGNL---T 178
             D + LPK+ DWR    +      +NQ     CGSCWAF +T A  + IN I   N     
Sbjct:    61 DSEDLPKTWDWRDANGINYASADRNQHIPQYCGSCWAFGATSALADRIN-IKRKNAWPQA 119

Query:   179 SLSEQELIDCD---TSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKE- 234
              LS QE+IDC    T    G  GG+  YA ++     G+  E    Y   +G C+     
Sbjct:   120 YLSVQEVIDCSGAGTCVMGGEPGGVYKYAHEH-----GIPHETCNNYQARDGKCDPYNRC 174

Query:   235 ----EMEVVTISGYQ--DVPE----NDEQSLLKALAHQ-PVSVAIEASGTDFQFYSGGVF 283
                   E  +I  Y    V E    +  + +   + H+ P++  I A+   F+ Y+GG++
Sbjct:   175 GSCWPGECFSIKNYTLYKVSEYGTVHGYEKMKAEIYHKGPIACGIAATKA-FETYAGGIY 233

Query:   284 TGPCGAELDHGVAAVGYG--KSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEG 336
                   ++DH ++  G+G     G +Y I +NSWG  WGE G+ ++  +  K  G
Sbjct:   234 KEVTDEDIDHIISVHGWGVDHESGVEYWIGRNSWGEPWGEHGWFKIVTSQYKNAG 288


>UNIPROTKB|H0YDT2 [details] [associations]
            symbol:CTSW "Cathepsin W" species:9606 "Homo sapiens"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR013201 PANTHER:PTHR12411 Pfam:PF08246 SMART:SM00848
            EMBL:AP001201 HGNC:HGNC:2546 Ensembl:ENST00000526034 Bgee:H0YDT2
            Uniprot:H0YDT2
        Length = 211

 Score = 160 (61.4 bits), Expect = 2.2e-14, Sum P(2) = 2.2e-14
 Identities = 41/144 (28%), Positives = 71/144 (49%)

Query:    43 KLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKE-VTSYWLGLNEFADMSH 101
             +L E F+ +  +  ++Y   EE  HR +IF  NL    +  +E + +   G+  F+D++ 
Sbjct:    36 ELKEAFKLFQIQFNRSYLSPEEHAHRLDIFAHNLAQAQRLQEEDLGTAEFGVTPFSDLTE 95

Query:   102 EEFKNKYLGLKPQFPTRRQPSAEF-SYRDVKALPKSVDWRK-KGAVTPVKNQGSCGSCWA 159
             EEF   Y G +           E  S    +++P S DWRK   A++P+K+Q +C  CWA
Sbjct:    96 EEFGQLY-GYRRAAGGVPSMGREIRSEEPEESVPFSCDWRKVASAISPIKDQKNCNCCWA 154

Query:   160 FSTVAAVEGINQIVSGNLTSLSEQ 183
              +    +E + +I   +   +S Q
Sbjct:   155 MAAAGNIETLWRISFWDFVDVSVQ 178

 Score = 47 (21.6 bits), Expect = 2.2e-14, Sum P(2) = 2.2e-14
 Identities = 11/31 (35%), Positives = 15/31 (48%)

Query:   205 FKYIVASGGLHKEEDYPYL--MEEGTCEDKK 233
             F  +   GGL  E+DYP+   +    C  KK
Sbjct:   172 FVDVSVQGGLASEKDYPFQGKVRAHRCHPKK 202


>MGI|MGI:88561 [details] [associations]
            symbol:Ctsb "cathepsin B" species:10090 "Mus musculus"
            [GO:0004175 "endopeptidase activity" evidence=ISO] [GO:0004197
            "cysteine-type endopeptidase activity" evidence=ISO] [GO:0005576
            "extracellular region" evidence=ISO] [GO:0005615 "extracellular
            space" evidence=ISO] [GO:0005737 "cytoplasm" evidence=ISO]
            [GO:0005739 "mitochondrion" evidence=ISO;IDA] [GO:0005764
            "lysosome" evidence=ISO;IDA] [GO:0005901 "caveola" evidence=ISO]
            [GO:0006508 "proteolysis" evidence=ISO] [GO:0008233 "peptidase
            activity" evidence=ISO] [GO:0008234 "cysteine-type peptidase
            activity" evidence=ISO] [GO:0009897 "external side of plasma
            membrane" evidence=ISO] [GO:0009986 "cell surface" evidence=ISO]
            [GO:0016324 "apical plasma membrane" evidence=ISO] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0030984 "kininogen binding"
            evidence=ISO] [GO:0032403 "protein complex binding" evidence=ISO]
            [GO:0042277 "peptide binding" evidence=ISO] [GO:0042383
            "sarcolemma" evidence=ISO] [GO:0043621 "protein self-association"
            evidence=ISO] [GO:0048471 "perinuclear region of cytoplasm"
            evidence=ISO] [GO:0050790 "regulation of catalytic activity"
            evidence=IEA] [GO:0060548 "negative regulation of cell death"
            evidence=ISO] InterPro:IPR000668 InterPro:IPR012599
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 Pfam:PF08127
            PRINTS:PR00705 SMART:SM00645 InterPro:IPR000169 MGI:MGI:88561
            GO:GO:0005739 GO:GO:0042470 GO:GO:0048471 GO:GO:0005615
            GO:GO:0009612 GO:GO:0009611 GO:GO:0009897 GO:GO:0045471
            GO:GO:0016324 GO:GO:0009749 GO:GO:0006914 GO:GO:0043434
            eggNOG:COG4870 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GO:GO:0050790 GO:GO:0005764 GO:GO:0042383 GO:GO:0014070
            GO:GO:0042277 GO:GO:0060548 GO:GO:0005901 GO:GO:0014075
            GO:GO:0004197 GO:GO:0070670 GO:GO:0007519 MEROPS:C01.060 CTD:1508
            GeneTree:ENSGT00560000076599 HOGENOM:HOG000241341
            HOVERGEN:HBG003480 KO:K01363 OMA:GYPSGAW OrthoDB:EOG4K6G4C
            BRENDA:3.4.22.1 GO:GO:0097067 PANTHER:PTHR12411:SF16 ChiTaRS:CTSB
            EMBL:M65270 EMBL:M65263 EMBL:M65264 EMBL:M65265 EMBL:M65266
            EMBL:M65267 EMBL:M65268 EMBL:M65269 EMBL:M14222 EMBL:X54966
            EMBL:S69034 EMBL:AK083393 EMBL:AK147192 EMBL:AK149884 EMBL:AK151790
            EMBL:AK167361 EMBL:BC006656 IPI:IPI00113517 PIR:A38458
            RefSeq:NP_031824.1 UniGene:Mm.236553 UniGene:Mm.489070
            ProteinModelPortal:P10605 SMR:P10605 IntAct:P10605 STRING:P10605
            PhosphoSite:P10605 SWISS-2DPAGE:P10605 PaxDb:P10605 PRIDE:P10605
            Ensembl:ENSMUST00000006235 GeneID:13030 KEGG:mmu:13030
            UCSC:uc007uhh.1 InParanoid:P10605 BioCyc:MetaCyc:MONOMER-14810
            BindingDB:P10605 ChEMBL:CHEMBL5187 NextBio:282900 Bgee:P10605
            CleanEx:MM_CTSB Genevestigator:P10605 GermOnline:ENSMUSG00000021939
            Uniprot:P10605
        Length = 339

 Score = 138 (53.6 bits), Expect = 2.3e-14, Sum P(2) = 2.3e-14
 Identities = 44/143 (30%), Positives = 73/143 (51%)

Query:    79 IDQRNKEVTSYWLGLNEF-ADMSH-EEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKS 136
             I+  NK+ T++  G N +  D+S+ ++     LG  P+ P R      F   D+  LP++
Sbjct:    31 INYINKQNTTWQAGRNFYNVDISYLKKLCGTVLG-GPKLPGR----VAFG-EDID-LPET 83

Query:   137 VDWRKKGAVTP----VKNQGSCGSCWAFSTVAAVEGINQI-VSGNLT-SLSEQELIDC-D 189
              D R++ +  P    +++QGSCGSCWAF  V A+     I  +G +   +S ++L+ C  
Sbjct:    84 FDAREQWSNCPTIGQIRDQGSCGSCWAFGAVEAISDRTCIHTNGRVNVEVSAEDLLTCCG 143

Query:   190 TSFNNGCNGGLMDYAFKYIVASG 212
                 +GCNGG    A+ +    G
Sbjct:   144 IQCGDGCNGGYPSGAWSFWTKKG 166

 Score = 112 (44.5 bits), Expect = 2.3e-14, Sum P(2) = 2.3e-14
 Identities = 31/110 (28%), Positives = 49/110 (44%)

Query:   243 GYQD--VPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELD-HGVAAVG 299
             GY    V  + ++ + +   + PV  A     +DF  Y  GV+    G  +  H +  +G
Sbjct:   226 GYTSYSVSNSVKEIMAEIYKNGPVEGAFTVF-SDFLTYKSGVYKHEAGDMMGGHAIRILG 284

Query:   300 YGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINK--MASIP 347
             +G   G  Y +  NSW   WG+ G+ ++ R     E  CGI    +A IP
Sbjct:   285 WGVENGVPYWLAANSWNLDWGDNGFFKILRG----ENHCGIESEIVAGIP 330


>FB|FBgn0030521 [details] [associations]
            symbol:CtsB1 "Cathepsin B1" species:7227 "Drosophila
            melanogaster" [GO:0004197 "cysteine-type endopeptidase activity"
            evidence=ISS] [GO:0035071 "salivary gland cell autophagic cell
            death" evidence=IEP] [GO:0048102 "autophagic cell death"
            evidence=IEP] [GO:0006508 "proteolysis" evidence=IEA] [GO:0050790
            "regulation of catalytic activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR012599 InterPro:IPR013128 InterPro:IPR015643
            Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705 SMART:SM00645
            InterPro:IPR000169 EMBL:AE014298 GO:GO:0006508 InterPro:IPR025661
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0050790 GO:GO:0035071
            GO:GO:0004197 MEROPS:C01.060 eggNOG:NOG315657
            GeneTree:ENSGT00560000076599 KO:K01363 PANTHER:PTHR12411:SF16
            HSSP:P07688 EMBL:AY060640 RefSeq:NP_572920.1 UniGene:Dm.3926
            SMR:Q9VY87 IntAct:Q9VY87 MINT:MINT-932864 STRING:Q9VY87
            EnsemblMetazoa:FBtr0073838 GeneID:32341 KEGG:dme:Dmel_CG10992
            UCSC:CG10992-RA FlyBase:FBgn0030521 InParanoid:Q9VY87 OMA:TEGHIRR
            OrthoDB:EOG48W9HM ChiTaRS:CG10992 GenomeRNAi:32341 NextBio:778020
            Uniprot:Q9VY87
        Length = 340

 Score = 149 (57.5 bits), Expect = 2.5e-14, Sum P(2) = 2.5e-14
 Identities = 51/167 (30%), Positives = 75/167 (44%)

Query:    82 RNKEVTSYW-LGLNEFADMSHEEFKNKYLGLKPQ-----FPTRRQPSAEFSYRDVKALPK 135
             R+K  T  W +G N  A ++    + + +G+ P       P +R+   +     V  LP+
Sbjct:    33 RSKAKT--WTVGRNFDASVTEGHIR-RLMGVHPDAHKFALPDKREVLGDLYVNSVDELPE 89

Query:   136 SVDWRKKGAVTP----VKNQGSCGSCWAFSTVAAVEGINQIVSGNLTSL--SEQELIDCD 189
               D RK+    P    +++QGSCGSCWAF  V A+     I SG   +   S  +L+ C 
Sbjct:    90 EFDSRKQWPNCPTIGEIRDQGSCGSCWAFGAVEAMSDRVCIHSGGKVNFHFSADDLVSCC 149

Query:   190 TSFNNGCNGGLMDYAFKYI----VASGGLHKEED--YPYLMEEGTCE 230
              +   GCNGG    A+ Y     + SGG +       PY  E   CE
Sbjct:   150 HTCGFGCNGGFPGAAWSYWTRKGIVSGGPYGSNQGCRPY--EISPCE 194

 Score = 99 (39.9 bits), Expect = 2.5e-14, Sum P(2) = 2.5e-14
 Identities = 26/80 (32%), Positives = 37/80 (46%)

Query:   274 DFQFYSGGVFTGPCGAELD-HGVAAVGYGK--SKGSDYIIVKNSWGPKWGERGYIRMKRN 330
             D   Y  GV+    G EL  H +  +G+G    +   Y ++ NSW   WG+ G+ R+ R 
Sbjct:   264 DLILYKDGVYQHEHGKELGGHAIRILGWGVWGEEKIPYWLIGNSWNTDWGDHGFFRILR- 322

Query:   331 TGKPEGLCGINKMASIPLKK 350
              G+    CGI    S  L K
Sbjct:   323 -GQDH--CGIESSISAGLPK 339


>WB|WBGene00000783 [details] [associations]
            symbol:cpr-3 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0009792 "embryo development
            ending in birth or egg hatching" evidence=IMP] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 PRINTS:PR00705
            SMART:SM00645 InterPro:IPR000169 GO:GO:0009792 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639
            GeneTree:ENSGT00560000076599 HOGENOM:HOG000241341 KO:K01363
            PANTHER:PTHR12411:SF16 EMBL:L39890 EMBL:L39925 EMBL:Z81119
            EMBL:Z82057 PIR:T37282 RefSeq:NP_506790.1 UniGene:Cel.23503
            ProteinModelPortal:P43507 SMR:P43507 MEROPS:C01.A33
            EnsemblMetazoa:T10H4.12 GeneID:180033 KEGG:cel:CELE_T10H4.12
            UCSC:T10H4.12 CTD:180033 WormBase:T10H4.12 eggNOG:NOG240190
            InParanoid:P43507 OMA:PVEASYK NextBio:907824 Uniprot:P43507
        Length = 370

 Score = 127 (49.8 bits), Expect = 2.9e-14, Sum P(2) = 2.9e-14
 Identities = 27/77 (35%), Positives = 43/77 (55%)

Query:   274 DFQFYSGGVFTGPCGAELD-HGVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTG 332
             DF  Y  GV+    G  +  H V  +G+G   G DY ++ NSWG  +GE+G+ +++R T 
Sbjct:   264 DFYHYKSGVYHYTSGKLVGGHAVKIIGWGVENGVDYWLIANSWGTSFGEKGFFKIRRGTN 323

Query:   333 KP--EG--LCGINKMAS 345
             +   EG  + GI K+ +
Sbjct:   324 ECQIEGNVVAGIAKLGT 340

 Score = 125 (49.1 bits), Expect = 2.9e-14, Sum P(2) = 2.9e-14
 Identities = 40/149 (26%), Positives = 70/149 (46%)

Query:    79 IDQRNKEVTSYWLGLNEFADMSHEEFKNKYLGLKPQFPTRRQP--SAEFSYR-DV--KAL 133
             +D  N   TS W+   E  ++S  E K K + +K   P  +    ++E   R ++  + L
Sbjct:    36 VDHVNTVQTS-WVA--EHNEISEFEMKFKVMDVKFAEPLEKDSDVASELFVRGEIVPEPL 92

Query:   134 PKSVDWRKK----GAVTPVKNQGSCGSCWAFSTVAAVEGINQIVSGNLTS---LSEQELI 186
             P + D R+K      +  ++NQ +CGSCWAF     +      +  N T    +S ++++
Sbjct:    93 PDTFDAREKWPDCNTIKLIRNQATCGSCWAFGAAEVISD-RVCIQSNGTQQPVISVEDIL 151

Query:   187 DC-DTSFNNGCNGGLMDYAFKYIVASGGL 214
              C  T+   GC GG    A ++  +SG +
Sbjct:   152 SCCGTTCGYGCKGGYSIEALRFWASSGAV 180


>DICTYBASE|DDB_G0283921 [details] [associations]
            symbol:ctsB "cathepsin B precursor" species:44689
            "Dictyostelium discoideum" [GO:0005615 "extracellular space"
            evidence=IDA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0008233 "peptidase activity"
            evidence=IEA] [GO:0005764 "lysosome" evidence=IEA] [GO:0044351
            "macropinocytosis" evidence=RCA] InterPro:IPR000668
            InterPro:IPR013128 InterPro:IPR015643 Pfam:PF00112 SMART:SM00645
            InterPro:IPR000169 dictyBase:DDB_G0283921 GO:GO:0005615
            GenomeReviews:CM000153_GR GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 EMBL:AAFI02000058
            eggNOG:NOG315657 PANTHER:PTHR12411:SF16 OMA:CSLSCQS
            RefSeq:XP_638805.1 HSSP:P07688 MEROPS:C01.A59
            EnsemblProtists:DDB0233997 GeneID:8624329 KEGG:ddi:DDB_G0283921
            Uniprot:Q54QD9
        Length = 311

 Score = 126 (49.4 bits), Expect = 3.1e-14, Sum P(2) = 3.1e-14
 Identities = 30/93 (32%), Positives = 50/93 (53%)

Query:   138 DWRKKGAVTPVKNQGSCGSCWAF-STVAAVEGINQIVSGNLTSLSEQELIDCDTSFNNGC 196
             +W     ++ ++NQ  CGSCWAF +T +A + +  I +     LS  +++ CD + +NGC
Sbjct:    88 NWPNCTTISQIQNQARCGSCWAFGATESATDRLC-IHNNENVQLSFMDMVTCDET-DNGC 145

Query:   197 NGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTC 229
              GG    A+ ++   G +  EE  PY +   TC
Sbjct:   146 EGGDAFSAWNWLRKQGAV-SEECLPYTIP--TC 175

 Score = 123 (48.4 bits), Expect = 3.1e-14, Sum P(2) = 3.1e-14
 Identities = 32/102 (31%), Positives = 46/102 (45%)

Query:   249 ENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELD-HGVAAVGYGKSKGSD 307
             ++DE  + + + + PV         DF  Y  GV+    G +L  H V  VG+G   G D
Sbjct:   217 DSDEAIMQEIVTNGPVEACFTVF-EDFLAYKSGVYVHTTGKDLGGHCVKLVGFGTLNGVD 275

Query:   308 YIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINK--MASIP 347
             Y    N W   WG+ G   +KR      G CGI+   +A +P
Sbjct:   276 YYAANNQWTTSWGDNGTFLIKR------GDCGISDDVVAGLP 311


>TAIR|locus:2204873 [details] [associations]
            symbol:AT1G02300 species:3702 "Arabidopsis thaliana"
            [GO:0004197 "cysteine-type endopeptidase activity" evidence=IEA]
            [GO:0005576 "extracellular region" evidence=ISM] [GO:0006508
            "proteolysis" evidence=IEA;ISS] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA;ISS] [GO:0050790 "regulation of
            catalytic activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR012599 InterPro:IPR013128 InterPro:IPR015643
            Pfam:PF00112 Pfam:PF08127 PRINTS:PR00705 SMART:SM00645
            EMBL:CP002684 GO:GO:0006508 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00639 GO:GO:0050790 GO:GO:0004197 KO:K01363
            PANTHER:PTHR12411:SF16 OMA:ADDINAC IPI:IPI00534431
            RefSeq:NP_563647.1 UniGene:At.43952 ProteinModelPortal:F4HVZ1
            SMR:F4HVZ1 MEROPS:C01.A10 EnsemblPlants:AT1G02300.1 GeneID:839576
            KEGG:ath:AT1G02300 ArrayExpress:F4HVZ1 Uniprot:F4HVZ1
        Length = 379

 Score = 148 (57.2 bits), Expect = 4.2e-14, Sum P(2) = 4.2e-14
 Identities = 42/142 (29%), Positives = 67/142 (47%)

Query:   217 EEDYPYLMEEGTCEDKKE---EMEVVTISGYQDVPENDEQSLL-KALAHQPVSVAIEASG 272
             E  YP    E  C  + +   E +   +  Y+  P  D Q ++ +   + PV VA     
Sbjct:   228 EPTYPTPKCERKCVSRNQLWGESKHYGVGAYRINP--DPQDIMAEVYKNGPVEVAFTVY- 284

Query:   273 TDFQFYSGGVFTGPCGAELD-HGVAAVGYGKSK-GSDYIIVKNSWGPKWGERGYIRMKRN 330
              DF  Y  GV+    G ++  H V  +G+G S  G DY ++ N W   WG+ GY +++R 
Sbjct:   285 EDFAHYKSGVYKYITGTKIGGHAVKLIGWGTSDDGEDYWLLANQWNRSWGDDGYFKIRRG 344

Query:   331 TGKPEGLCGINK--MASIPLKK 350
             T +    CGI +  +A +P +K
Sbjct:   345 TNE----CGIEQSVVAGLPSEK 362

 Score = 100 (40.3 bits), Expect = 4.2e-14, Sum P(2) = 4.2e-14
 Identities = 29/78 (37%), Positives = 37/78 (47%)

Query:   152 GSCGSCWAFSTVAAVEGINQIVSGNLT-SLSEQELIDC-DTSFNNGCNGGLMDYAFKYIV 209
             G CGSCWAF  V ++      +  NL  SLS  ++I C       GCNGG    A+ Y  
Sbjct:   146 GHCGSCWAFGAVESLSD-RFCIKYNLNVSLSANDVIACCGLLCGFGCNGGFPMGAWLYFK 204

Query:   210 ASGGLHKEEDYPYLMEEG 227
               G + +E D PY    G
Sbjct:   205 YHGVVTQECD-PYFDNTG 221


>UNIPROTKB|P05689 [details] [associations]
            symbol:CTSZ "Cathepsin Z" species:9913 "Bos taurus"
            [GO:0005764 "lysosome" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0006508 "proteolysis"
            evidence=IEA] InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112
            PRINTS:PR00705 SMART:SM00645 eggNOG:COG4870 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640
            PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005764 EMBL:BC122603
            EMBL:X01809 IPI:IPI00708474 PIR:A29172 RefSeq:NP_001071303.1
            UniGene:Bt.4902 ProteinModelPortal:P05689 SMR:P05689 MEROPS:C01.013
            PRIDE:P05689 GeneID:404187 KEGG:bta:404187 CTD:1522
            HOGENOM:HOG000264454 HOVERGEN:HBG004456 InParanoid:P05689 KO:K08568
            OrthoDB:EOG42Z4QN BRENDA:3.4.18.1 NextBio:20817615 Uniprot:P05689
        Length = 304

 Score = 198 (74.8 bits), Expect = 4.6e-14, P = 4.6e-14
 Identities = 60/197 (30%), Positives = 97/197 (49%)

Query:   154 CGSCWAF-STVAAVEGINQIVSGNLTS--LSEQELIDCDTSFNNGCNGG----LMDYAFK 206
             CGSCWA  ST A  + IN    G   S  LS Q +IDC  +    C GG    + +YA +
Sbjct:    90 CGSCWAHGSTSAMADRINIKRKGAWPSTLLSVQHVIDCGDA--GSCEGGNDLPVWEYAHR 147

Query:   207 YIVASGGLH----KEEDYPYLMEEGTCEDKKE-----EMEVVTISGYQDVPENDEQSLLK 257
             + +     +    K+++     + GTC + KE        +  +  Y  +    E+ + +
Sbjct:   148 HGIPDETCNNYQAKDQECDKFNQCGTCTEFKECHVIKNYTLWKVGDYGSL-SGREKMMAE 206

Query:   258 ALAHQPVSVAIEASGTDFQFYSGGVFTGPCG-AELDHGVAAVGYGKSKGSDYIIVKNSWG 316
                + P+S  I A+      Y+GG+++     A ++H V+  G+G S G +Y IV+NSWG
Sbjct:   207 IYTNGPISCGIMAT-EKMSNYTGGIYSEYNDQAFINHIVSVAGWGVSDGMEYWIVRNSWG 265

Query:   317 PKWGERGYIRMKRNTGK 333
               WGE G++R+  +T K
Sbjct:   266 EPWGEHGWMRIVTSTYK 282

 Score = 114 (45.2 bits), Expect = 0.00047, P = 0.00047
 Identities = 43/134 (32%), Positives = 59/134 (44%)

Query:   118 RRQPSAEFSYRDVKALPKSVDWRKKGAV---TPVKNQGS---CGSCWAF-STVAAVEGIN 170
             RR       Y     LPKS DWR    V   +  +NQ     CGSCWA  ST A  + IN
Sbjct:    48 RRTYPRPHEYLSPSDLPKSWDWRNVNGVNYASVTRNQHIPQYCGSCWAHGSTSAMADRIN 107

Query:   171 QIVSGNLTS--LSEQELIDCDTSFNNGCNGG----LMDYAFKYIVASGGLH----KEEDY 220
                 G   S  LS Q +IDC  +    C GG    + +YA ++ +     +    K+++ 
Sbjct:   108 IKRKGAWPSTLLSVQHVIDCGDA--GSCEGGNDLPVWEYAHRHGIPDETCNNYQAKDQEC 165

Query:   221 PYLMEEGTCEDKKE 234
                 + GTC + KE
Sbjct:   166 DKFNQCGTCTEFKE 179


>WB|WBGene00009158 [details] [associations]
            symbol:F26E4.3 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR001212 InterPro:IPR013128 Pfam:PF00112 PROSITE:PS00524
            PROSITE:PS50958 SMART:SM00201 SMART:SM00645 GO:GO:0005576
            GO:GO:0006955 GO:GO:0030247 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00640 PROSITE:PS00139 PROSITE:PS00639 GO:GO:0005044
            GeneTree:ENSGT00560000076599 HSSP:P07711 EMBL:Z81070
            eggNOG:NOG310046 HOGENOM:HOG000241342 OMA:DNCNRCT PIR:T21421
            RefSeq:NP_492593.2 ProteinModelPortal:P90850 SMR:P90850
            PaxDb:P90850 EnsemblMetazoa:F26E4.3.1 EnsemblMetazoa:F26E4.3.2
            GeneID:172827 KEGG:cel:CELE_F26E4.3 UCSC:F26E4.3.1 CTD:172827
            WormBase:F26E4.3 InParanoid:P90850 NextBio:877161 Uniprot:P90850
        Length = 452

 Score = 151 (58.2 bits), Expect = 7.4e-14, Sum P(2) = 7.4e-14
 Identities = 43/123 (34%), Positives = 59/123 (47%)

Query:   115 FPTRRQPSAEFSYRDVKALPKSVDWRKKGA--VTPVKNQGSCGSCWAFSTVAAVEGINQI 172
             FP R   +        + LP+  D R K    + PV +QG CGS W+ ST A       I
Sbjct:   166 FPERSVQNMNEILIKPRELPEHFDARDKWGPLIHPVADQGDCGSSWSVSTTAISSDRLAI 225

Query:   173 VS-GNLTS-LSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLM----EE 226
             +S G + S LS Q+L+ C+     GC GG +D A+ YI    G+  +  YPY+     E 
Sbjct:   226 ISEGRINSTLSSQQLLSCNQHRQKGCEGGYLDRAWWYI-RKLGVVGDHCYPYVSGQSREP 284

Query:   227 GTC 229
             G C
Sbjct:   285 GHC 287

 Score = 97 (39.2 bits), Expect = 7.4e-14, Sum P(2) = 7.4e-14
 Identities = 33/122 (27%), Positives = 50/122 (40%)

Query:   222 YLMEEGT-CEDKKEEMEVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSG 280
             Y   +G  C    ++     ++    V   +E    + + + PV         DF  Y+G
Sbjct:   294 YTNRQGLRCPSGSQDSTAFKMTPPYKVSSREEDIQTELMTNGPVQATFVVH-EDFFMYAG 352

Query:   281 GVFT--------GPCG-AELDHGVAAVGYG--KSKGSD--YIIVKNSWGPKWGERGYIRM 327
             GV+         G    AE  H V  +G+G   S G    Y +  NSWG +WGE GY ++
Sbjct:   353 GVYQHSDLAAQKGASSVAEGYHSVRVLGWGVDHSTGKPIKYWLCANSWGTQWGEDGYFKV 412

Query:   328 KR 329
              R
Sbjct:   413 LR 414


>UNIPROTKB|F1MW68 [details] [associations]
            symbol:CTSZ "Cathepsin Z" species:9913 "Bos taurus"
            [GO:0060441 "epithelial tube branching involved in lung
            morphogenesis" evidence=IEA] [GO:0005783 "endoplasmic reticulum"
            evidence=IEA] [GO:0005615 "extracellular space" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0005783 GO:GO:0005615 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640 GO:GO:0060441
            GeneTree:ENSGT00560000076599 IPI:IPI00708474 UniGene:Bt.4902
            OMA:QCGTCTE EMBL:DAAA02036315 PRIDE:F1MW68
            Ensembl:ENSBTAT00000025007 Uniprot:F1MW68
        Length = 304

 Score = 196 (74.1 bits), Expect = 8.6e-14, P = 8.6e-14
 Identities = 59/197 (29%), Positives = 97/197 (49%)

Query:   154 CGSCWAF-STVAAVEGINQIVSGNLTS--LSEQELIDCDTSFNNGCNGG----LMDYAFK 206
             CGSCWA  ST A  + IN    G   S  LS Q ++DC  +    C GG    + +YA +
Sbjct:    90 CGSCWAHGSTSAMADRINIKRKGAWPSTLLSVQHVLDCGDA--GSCEGGNDLPVWEYAHR 147

Query:   207 YIVASGGLH----KEEDYPYLMEEGTCEDKKE-----EMEVVTISGYQDVPENDEQSLLK 257
             + +     +    K+++     + GTC + KE        +  +  Y  +    E+ + +
Sbjct:   148 HGIPDETCNNYQAKDQECDKFNQCGTCTEFKECHVIKNYTLWKVGDYGSL-SGREKMMAE 206

Query:   258 ALAHQPVSVAIEASGTDFQFYSGGVFTGPCG-AELDHGVAAVGYGKSKGSDYIIVKNSWG 316
                + P+S  I A+      Y+GG+++     A ++H V+  G+G S G +Y IV+NSWG
Sbjct:   207 IYTNGPISCGIMAT-EKMSNYTGGIYSEYNDQAFINHIVSVAGWGVSDGMEYWIVRNSWG 265

Query:   317 PKWGERGYIRMKRNTGK 333
               WGE G++R+  +T K
Sbjct:   266 EPWGEHGWMRIVTSTYK 282

 Score = 112 (44.5 bits), Expect = 0.00079, P = 0.00079
 Identities = 42/134 (31%), Positives = 59/134 (44%)

Query:   118 RRQPSAEFSYRDVKALPKSVDWRKKGAV---TPVKNQGS---CGSCWAF-STVAAVEGIN 170
             RR       Y     LPKS DWR    V   +  +NQ     CGSCWA  ST A  + IN
Sbjct:    48 RRTYPRPHEYLSPSDLPKSWDWRNVNGVNYASVTRNQHIPQYCGSCWAHGSTSAMADRIN 107

Query:   171 QIVSGNLTS--LSEQELIDCDTSFNNGCNGG----LMDYAFKYIVASGGLH----KEEDY 220
                 G   S  LS Q ++DC  +    C GG    + +YA ++ +     +    K+++ 
Sbjct:   108 IKRKGAWPSTLLSVQHVLDCGDA--GSCEGGNDLPVWEYAHRHGIPDETCNNYQAKDQEC 165

Query:   221 PYLMEEGTCEDKKE 234
                 + GTC + KE
Sbjct:   166 DKFNQCGTCTEFKE 179


>UNIPROTKB|E1BTI7 [details] [associations]
            symbol:TINAG "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0005044 "scavenger receptor activity" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0006955 "immune
            response" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0030247 "polysaccharide binding"
            evidence=IEA] [GO:0005604 "basement membrane" evidence=IEA]
            [GO:0007155 "cell adhesion" evidence=IEA] InterPro:IPR000668
            InterPro:IPR001212 InterPro:IPR013128 Pfam:PF00112 PROSITE:PS50958
            SMART:SM00201 SMART:SM00645 GO:GO:0006955 GO:GO:0030247
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411
            PROSITE:PS00640 GO:GO:0007155 GO:GO:0005604 GO:GO:0005044
            GeneTree:ENSGT00560000076599 CTD:27283 OMA:WGQLTSS
            EMBL:AADN02002720 EMBL:AADN02002721 IPI:IPI00581566
            RefSeq:XP_419905.3 UniGene:Gga.11215 Ensembl:ENSGALT00000026295
            GeneID:421888 KEGG:gga:421888 Uniprot:E1BTI7
        Length = 467

 Score = 139 (54.0 bits), Expect = 1.5e-13, Sum P(2) = 1.5e-13
 Identities = 29/65 (44%), Positives = 40/65 (61%)

Query:   150 NQGSCGSCWAFSTVA-AVEGINQIVSGNLT-SLSEQELIDCDTSFNNGCNGGLMDYAFKY 207
             +Q +CG+ WAFST + A + I     G +T +LS Q LI CDT    GCNGG +D A++Y
Sbjct:   241 DQRNCGASWAFSTASVAADRITIHSDGQITDNLSVQNLISCDTGNQRGCNGGSIDGAWRY 300

Query:   208 IVASG 212
             +   G
Sbjct:   301 LTTHG 305

 Score = 108 (43.1 bits), Expect = 1.5e-13, Sum P(2) = 1.5e-13
 Identities = 37/126 (29%), Positives = 57/126 (45%)

Query:   227 GTCEDKKEEMEVVTISG-YQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVF-- 283
             G C +  E+   +   G +  V   +   + + +A  PV  AI     DF  Y  G++  
Sbjct:   341 GPCPNALEDSNRLYRCGSHYRVSSKETDIMEEIMAKGPVQ-AIMKVYEDFFLYKEGIYRH 399

Query:   284 TGPCGAELD-HGVAAVGYGKSKGSD-----YIIVKNSWGPKWGERGYIRMKRNTGKPEGL 337
             +   G++   H V  +G+G   G +     + I  NSWG  WGE GY R+ R  G+ E  
Sbjct:   400 SYKAGSKWKTHSVKLLGWGSLPGKNGQKQKFWIAANSWGKYWGENGYFRILR--GQNE-- 455

Query:   338 CGINKM 343
             C I K+
Sbjct:   456 CDIEKL 461


>UNIPROTKB|E1C4M3 [details] [associations]
            symbol:CTSZ "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0005615
            "extracellular space" evidence=IEA] [GO:0005783 "endoplasmic
            reticulum" evidence=IEA] [GO:0060441 "epithelial tube branching
            involved in lung morphogenesis" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0005783 GO:GO:0005615 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025661 PANTHER:PTHR12411 PROSITE:PS00640
            GeneTree:ENSGT00560000076599 CTD:1522 KO:K08568 OMA:QCGTCTE
            EMBL:AADN02019004 IPI:IPI00596430 RefSeq:XP_417483.3
            Ensembl:ENSGALT00000012067 GeneID:419311 KEGG:gga:419311
            Uniprot:E1C4M3
        Length = 305

 Score = 187 (70.9 bits), Expect = 1.2e-12, P = 1.2e-12
 Identities = 61/194 (31%), Positives = 93/194 (47%)

Query:   154 CGSCWAF-STVAAVEGINQIVSGNLTS--LSEQELIDCDTSFNNGCNGGLMDYAFKYIVA 210
             CGSCWA  ST A  + IN    G   S  LS Q +IDC  +    C GG  D+   ++ A
Sbjct:    90 CGSCWAHGSTSALADRINIKRKGAWPSAYLSVQNVIDCANA--GSCEGG--DHTGVWMYA 145

Query:   211 SG-GLHKEEDYPYLMEEGTCEDKKE--------EMEVVT------ISGYQDVPENDEQSL 255
                G+  E    Y  +   C+   +        E  V+       ++ Y  V    E+ +
Sbjct:   146 HDHGIPDETCNNYQAKNQKCKKFNQCGTCVTFGECHVIKNYTLWKVADYGAV-SGREKMM 204

Query:   256 LKALAHQPVSVAIEASGTDFQFYSGGVFT--GPCGAELDHGVAAVGYGKSKGSDYIIVKN 313
              +  A+ P+S  I A+      Y+GG++T   P    ++H V+  G+G   G++Y IV+N
Sbjct:   205 AEIYANGPISCGIMAT-EKLDAYTGGLYTEYNP-SPTVNHIVSVAGWGVENGTEYWIVRN 262

Query:   314 SWGPKWGERGYIRM 327
             SWG  WGERG++R+
Sbjct:   263 SWGEPWGERGWLRI 276

 Score = 125 (49.1 bits), Expect = 2.6e-05, P = 2.6e-05
 Identities = 65/223 (29%), Positives = 95/223 (42%)

Query:   127 YRDVKALPKSVDWRKKGAV---TPVKNQGS---CGSCWAF-STVAAVEGINQIVSGNLTS 179
             Y D+  LP+S DWR    V   +  +NQ     CGSCWA  ST A  + IN    G   S
Sbjct:    57 YLDMAELPQSWDWRNVNGVNYASTTRNQHIPQYCGSCWAHGSTSALADRINIKRKGAWPS 116

Query:   180 --LSEQELIDCDTSFNNGCNGGLMDYAFKYIVASG-GLHKEEDYPYLMEEGTCEDKKEEM 236
               LS Q +IDC  +    C GG  D+   ++ A   G+  E    Y  +   C+   +  
Sbjct:   117 AYLSVQNVIDCANA--GSCEGG--DHTGVWMYAHDHGIPDETCNNYQAKNQKCKKFNQCG 172

Query:   237 EVVTISGYQDVPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAE-LDH-- 293
               VT  G   V +N   +L K   +  VS   E    +  + +G +  G    E LD   
Sbjct:   173 TCVTF-GECHVIKN--YTLWKVADYGAVS-GREKMMAEI-YANGPISCGIMATEKLDAYT 227

Query:   294 GVAAVGYGKSKGSDYIIVKNSWGPKWGERGYIRMKRNTGKPEG 336
             G     Y  S   ++I+    WG + G   +I ++ + G+P G
Sbjct:   228 GGLYTEYNPSPTVNHIVSVAGWGVENGTEYWI-VRNSWGEPWG 269


>UNIPROTKB|E2QXH3 [details] [associations]
            symbol:TINAGL1 "Uncharacterized protein" species:9615
            "Canis lupus familiaris" [GO:0043236 "laminin binding"
            evidence=IEA] [GO:0031012 "extracellular matrix" evidence=IEA]
            [GO:0005737 "cytoplasm" evidence=IEA] [GO:0030247 "polysaccharide
            binding" evidence=IEA] [GO:0008234 "cysteine-type peptidase
            activity" evidence=IEA] [GO:0006955 "immune response" evidence=IEA]
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0005044 "scavenger
            receptor activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR001212 InterPro:IPR013128 Pfam:PF00112 PROSITE:PS00524
            PROSITE:PS50958 SMART:SM00201 SMART:SM00645 GO:GO:0005737
            GO:GO:0006955 GO:GO:0030247 GO:GO:0008234 GO:GO:0006508
            InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00639 GO:GO:0031012
            GO:GO:0005044 GeneTree:ENSGT00560000076599 CTD:64129 OMA:DNCNRCT
            EMBL:AAEX03001668 RefSeq:XP_535330.3 Ensembl:ENSCAFT00000035659
            GeneID:478155 KEGG:cfa:478155 NextBio:20853523 Uniprot:E2QXH3
        Length = 467

 Score = 132 (51.5 bits), Expect = 2.6e-12, Sum P(2) = 2.6e-12
 Identities = 40/136 (29%), Positives = 62/136 (45%)

Query:    96 FADMSHEEFKNKYLG-LKPQFPTRRQPSAEFSYRDVKALPKSVDWRKK--GAVTPVKNQG 152
             F  M+ +E     LG ++P              R  + LP + +  +K    +    +QG
Sbjct:   165 FWGMTLDEGIRYRLGTIRPSSSVTNMNEIHTVLRPGEVLPTAFEAAEKWPNLIHEPLDQG 224

Query:   153 SCGSCWAFSTVAAVEGINQIVS-GNLTS-LSEQELIDCDTSFNNGCNGGLMDYAFKYIVA 210
             +C   WAFST A       I S G++T  LS Q L+ CDT    GC GG +D A+ + + 
Sbjct:   225 NCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSCDTHNQQGCRGGRLDGAW-WFLR 283

Query:   211 SGGLHKEEDYPYLMEE 226
               G+  +  YP++  E
Sbjct:   284 RRGVVSDHCYPFVGRE 299

 Score = 104 (41.7 bits), Expect = 2.6e-12, Sum P(2) = 2.6e-12
 Identities = 31/98 (31%), Positives = 45/98 (45%)

Query:   251 DEQSLLKALAHQ-PVSVAIEASGTDFQFYSGGVFT------GPCGAELDHGVAAV---GY 300
             +E+ ++K L    PV   +E    DF  Y GG+++      G       HG  +V   G+
Sbjct:   349 NEKEIMKELMENGPVQALMEVH-EDFFLYQGGIYSHTPVSLGRPERYRRHGTHSVKITGW 407

Query:   301 GKSKGSD-----YIIVKNSWGPKWGERGYIRMKRNTGK 333
             G+    D     Y    NSWGP WGERG+ R+ R   +
Sbjct:   408 GEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGANE 445


>UNIPROTKB|F1M8U6 [details] [associations]
            symbol:F1M8U6 "Uncharacterized protein" species:10116
            "Rattus norvegicus" [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0008234 "cysteine-type peptidase activity" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR013128 Pfam:PF00112 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00639
            IPI:IPI00782277 Ensembl:ENSRNOT00000055587 OMA:EREIAAW
            Uniprot:F1M8U6
        Length = 163

 Score = 168 (64.2 bits), Expect = 3.4e-12, P = 3.4e-12
 Identities = 59/171 (34%), Positives = 80/171 (46%)

Query:   182 EQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKEEMEVVTI 241
             ++EL+DCD   +  C GGL   A+  I   GGL  E+ Y Y      C +   +M  V I
Sbjct:     1 KKELLDCD-KMDKACLGGLPSNAYTAIKNLGGLETEDGYGYEGHFQAC-NFLAQMTKVYI 58

Query:   242 SGYQDVPENDEQSLLKALAHQP-VSVAIEASGTDFQFYSGGVF--TGP-CGAEL-DHGVA 296
             S   ++ +N E S+   LA +  +SVAI       QF+  G      P C     DH V 
Sbjct:    59 SDSVELSQN-ESSIAALLAQKGLISVAI------MQFHRYGTVHPLRPLCSPGFTDHSVL 111

Query:   297 AVGYGKSKGSD--YIIVKNSWGPKWGERGYIRMKRNTGKPEGLCGINKMAS 345
              VGYG    S+  Y  +KN  G  WGE G+  + R +G      G+N MAS
Sbjct:   112 LVGYGNRPRSNIPYWAIKNIQGSDWGEEGHYYLYRGSGDR----GVNTMAS 158


>UNIPROTKB|Q9GZM7 [details] [associations]
            symbol:TINAGL1 "Tubulointerstitial nephritis antigen-like"
            species:9606 "Homo sapiens" [GO:0005044 "scavenger receptor
            activity" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0006955 "immune response" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0030247
            "polysaccharide binding" evidence=IEA] [GO:0005737 "cytoplasm"
            evidence=IEA] [GO:0043236 "laminin binding" evidence=IEA]
            [GO:0016197 "endosomal transport" evidence=TAS] [GO:0005201
            "extracellular matrix structural constituent" evidence=NAS]
            [GO:0005576 "extracellular region" evidence=NAS] [GO:0031012
            "extracellular matrix" evidence=IDA] [GO:0005615 "extracellular
            space" evidence=ISS] InterPro:IPR000668 InterPro:IPR001212
            InterPro:IPR013128 Pfam:PF00112 PROSITE:PS00524 PROSITE:PS50958
            SMART:SM00201 SMART:SM00645 GO:GO:0005737 GO:GO:0005615
            GO:GO:0006955 GO:GO:0030247 EMBL:CH471059 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00639
            GO:GO:0016197 EMBL:AC114488 GO:GO:0005044 GO:GO:0005201
            eggNOG:NOG310046 HOGENOM:HOG000241342 HOVERGEN:HBG053961
            EMBL:AF236155 EMBL:AF236151 EMBL:AF236152 EMBL:AF236153
            EMBL:AF236154 EMBL:AF236150 EMBL:AF205436 EMBL:AB050716
            EMBL:AB050719 EMBL:AK074124 EMBL:AY358421 EMBL:AF289569
            EMBL:AK027839 EMBL:AK292770 EMBL:AK298382 EMBL:AK075398
            EMBL:BC009048 EMBL:BC064633 IPI:IPI00005563 IPI:IPI00439435
            IPI:IPI00910801 RefSeq:NP_001191343.1 RefSeq:NP_001191344.1
            RefSeq:NP_071447.1 UniGene:Hs.199368 ProteinModelPortal:Q9GZM7
            SMR:Q9GZM7 IntAct:Q9GZM7 MINT:MINT-253718 STRING:Q9GZM7
            MEROPS:C01.975 PhosphoSite:Q9GZM7 DMDM:61213628 PaxDb:Q9GZM7
            PRIDE:Q9GZM7 Ensembl:ENST00000271064 Ensembl:ENST00000457433
            GeneID:64129 KEGG:hsa:64129 UCSC:uc001bta.3 CTD:64129
            GeneCards:GC01P032042 HGNC:HGNC:19168 HPA:HPA048695
            neXtProt:NX_Q9GZM7 PharmGKB:PA38810 InParanoid:Q9GZM7 OMA:DNCNRCT
            OrthoDB:EOG4BG8W0 PhylomeDB:Q9GZM7 ChiTaRS:TINAGL1 GenomeRNAi:64129
            NextBio:66016 ArrayExpress:Q9GZM7 Bgee:Q9GZM7 CleanEx:HS_TINAGL1
            Genevestigator:Q9GZM7 GermOnline:ENSG00000142910 Uniprot:Q9GZM7
        Length = 467

 Score = 127 (49.8 bits), Expect = 3.7e-12, Sum P(2) = 3.7e-12
 Identities = 29/75 (38%), Positives = 41/75 (54%)

Query:   150 NQGSCGSCWAFSTVAAVEGINQIVS-GNLTS-LSEQELIDCDTSFNNGCNGGLMDYAFKY 207
             +QG+C   WAFST A       I S G++T  LS Q L+ CDT    GC GG +D A+ +
Sbjct:   222 DQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLLSCDTHQQQGCRGGRLDGAW-W 280

Query:   208 IVASGGLHKEEDYPY 222
              +   G+  +  YP+
Sbjct:   281 FLRRRGVVSDHCYPF 295

 Score = 108 (43.1 bits), Expect = 3.7e-12, Sum P(2) = 3.7e-12
 Identities = 30/98 (30%), Positives = 46/98 (46%)

Query:   250 NDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFT------GPCGAELDHGVAAV---GY 300
             ND++ + + + + PV   +E    DF  Y GG+++      G       HG  +V   G+
Sbjct:   349 NDKEIMKELMENGPVQALMEVH-EDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGW 407

Query:   301 GKSKGSD-----YIIVKNSWGPKWGERGYIRMKRNTGK 333
             G+    D     Y    NSWGP WGERG+ R+ R   +
Sbjct:   408 GEETLPDGRTLKYWTAANSWGPAWGERGHFRIVRGVNE 445


>MGI|MGI:2137617 [details] [associations]
            symbol:Tinagl1 "tubulointerstitial nephritis antigen-like 1"
            species:10090 "Mus musculus" [GO:0005044 "scavenger receptor
            activity" evidence=IEA] [GO:0005576 "extracellular region"
            evidence=IEA] [GO:0005737 "cytoplasm" evidence=IDA] [GO:0006508
            "proteolysis" evidence=IEA] [GO:0006955 "immune response"
            evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0030247 "polysaccharide binding" evidence=IEA]
            [GO:0043236 "laminin binding" evidence=IPI] InterPro:IPR000668
            InterPro:IPR001212 InterPro:IPR013128 Pfam:PF00112 PROSITE:PS00524
            PROSITE:PS50958 SMART:SM00201 SMART:SM00645 MGI:MGI:2137617
            GO:GO:0005737 GO:GO:0005576 GO:GO:0006955 GO:GO:0030247
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025660 PANTHER:PTHR12411
            PROSITE:PS00639 GO:GO:0031012 CleanEx:MM_ARG1 GO:GO:0005044
            GeneTree:ENSGT00560000076599 eggNOG:NOG310046 HOGENOM:HOG000241342
            HOVERGEN:HBG053961 MEROPS:C01.975 CTD:64129 OMA:DNCNRCT
            OrthoDB:EOG4BG8W0 EMBL:AB047402 EMBL:AB050626 EMBL:BC005738
            EMBL:BC018539 IPI:IPI00115458 RefSeq:NP_001161805.1
            RefSeq:NP_075965.2 UniGene:Mm.15801 ProteinModelPortal:Q99JR5
            SMR:Q99JR5 STRING:Q99JR5 PhosphoSite:Q99JR5 PaxDb:Q99JR5
            PRIDE:Q99JR5 Ensembl:ENSMUST00000030560 Ensembl:ENSMUST00000105998
            Ensembl:ENSMUST00000105999 GeneID:94242 KEGG:mmu:94242
            InParanoid:Q99JR5 NextBio:352247 Bgee:Q99JR5 Genevestigator:Q99JR5
            GermOnline:ENSMUSG00000028776 Uniprot:Q99JR5
        Length = 466

 Score = 129 (50.5 bits), Expect = 4.4e-12, Sum P(2) = 4.4e-12
 Identities = 29/75 (38%), Positives = 41/75 (54%)

Query:   150 NQGSCGSCWAFSTVAAVEGINQIVS-GNLTS-LSEQELIDCDTSFNNGCNGGLMDYAFKY 207
             +QG+C   WAFST A       I S G++T  LS Q L+ CDT    GC GG +D A+ +
Sbjct:   221 DQGNCAGSWAFSTAAVASDRVSIHSLGHMTPILSPQNLLSCDTHHQQGCRGGRLDGAW-W 279

Query:   208 IVASGGLHKEEDYPY 222
              +   G+  +  YP+
Sbjct:   280 FLRRRGVVSDNCYPF 294

 Score = 105 (42.0 bits), Expect = 4.4e-12, Sum P(2) = 4.4e-12
 Identities = 32/99 (32%), Positives = 46/99 (46%)

Query:   250 NDEQSLLKALAHQ-PVSVAIEASGTDFQFYSGGVFT------GPCGAELDHGVAAV---G 299
             +DE+ ++K L    PV   +E    DF  Y  G+++      G       HG  +V   G
Sbjct:   347 SDEKEIMKELMENGPVQALMEVH-EDFFLYQRGIYSHTPVSQGRPEQYRRHGTHSVKITG 405

Query:   300 YGKSKGSD-----YIIVKNSWGPKWGERGYIRMKRNTGK 333
             +G+    D     Y    NSWGP WGERG+ R+ R T +
Sbjct:   406 WGEETLPDGRTIKYWTAANSWGPWWGERGHFRIVRGTNE 444


>WB|WBGene00013072 [details] [associations]
            symbol:Y51A2D.1 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR013201 PANTHER:PTHR12411
            Pfam:PF08246 SMART:SM00848 GeneTree:ENSGT00560000076599
            EMBL:AL021497 RefSeq:NP_001256811.1 ProteinModelPortal:O62484
            SMR:O62484 MEROPS:C01.A37 EnsemblMetazoa:Y51A2D.1 GeneID:180204
            KEGG:cel:CELE_Y51A2D.1 UCSC:Y51A2D.1 CTD:180204 WormBase:Y51A2D.1a
            HOGENOM:HOG000019851 NextBio:908416 Uniprot:O62484
        Length = 314

 Score = 128 (50.1 bits), Expect = 5.7e-12, Sum P(2) = 5.7e-12
 Identities = 39/107 (36%), Positives = 55/107 (51%)

Query:   248 PENDEQSLLKALAHQPVSVAIE-ASGTDF-QFYSGGVFTGPC--GAELDHGVAAVGYG-- 301
             PEN E  +++ L      VA+  A+GT F Q+ SG + T  C     + H  A VGYG  
Sbjct:   201 PENAESEIIEILNTWKTPVAVYFAAGTAFLQYKSGVLVTEDCDLAGTVWHAGAIVGYGEE 260

Query:   302 ---KSKGSDYIIVKNSWGPK-WGERGYIRMKRNTGKPEGLCGINKMA 344
                + +   + I+KNSWG   WG  GY+++ R  GK    CGI + A
Sbjct:   261 NDLRGRSQRFWIMKNSWGVSGWGTGGYVKLIR--GK--NWCGIERGA 303

 Score = 99 (39.9 bits), Expect = 5.7e-12, Sum P(2) = 5.7e-12
 Identities = 30/75 (40%), Positives = 41/75 (54%)

Query:   101 HEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKSVDWRKKGAVTPVKNQGSCGSCWAF 160
             H+ FK K LG K +  T+RQ S EF+ R+     + V+ R    V P+KNQG C  CW F
Sbjct:   122 HKNFK-KLLG-KTR--TKRQNS-EFA-RNFDLRSQKVNGRY--IVGPIKNQGQCACCWGF 173

Query:   161 STVAAVEGINQIVSG 175
             +  A +E I  +  G
Sbjct:   174 AVTAMLETIYAVNVG 188

 Score = 58 (25.5 bits), Expect = 9.7e-08, Sum P(2) = 9.7e-08
 Identities = 21/99 (21%), Positives = 42/99 (42%)

Query:    42 DKLIELFESWMSKHGKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYW----LGLNEFA 97
             +K+ + F  +  K  +TYK   E   R + F ++  ++ + NK            +N+F+
Sbjct:    38 EKVYQEFVEFKKKFSRTYKSEAENQLRLQNFVKSRNNVVRLNKNAQKAGRNSNFAVNQFS 97

Query:    98 DMSHEEFKNKYLGLKPQFPTRRQPSAEFSYRDVKALPKS 136
             D++  E   +      +FP     ++ F     K L K+
Sbjct:    98 DLTTSELHQRL----SRFPPNLTENSVFHKNFKKLLGKT 132


>RGD|1359482 [details] [associations]
            symbol:Tinag "tubulointerstitial nephritis antigen"
            species:10116 "Rattus norvegicus" [GO:0005044 "scavenger receptor
            activity" evidence=IEA] [GO:0005604 "basement membrane"
            evidence=ISO] [GO:0006508 "proteolysis" evidence=IEA] [GO:0006955
            "immune response" evidence=IEA] [GO:0007155 "cell adhesion"
            evidence=ISO] [GO:0008234 "cysteine-type peptidase activity"
            evidence=IEA] [GO:0030247 "polysaccharide binding" evidence=IEA]
            InterPro:IPR000668 InterPro:IPR001212 InterPro:IPR013128
            Pfam:PF00112 Pfam:PF01033 PROSITE:PS00524 PROSITE:PS50958
            SMART:SM00201 SMART:SM00645 RGD:1359482 GO:GO:0006955 GO:GO:0030247
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411
            PROSITE:PS00640 GO:GO:0007155 EMBL:CH473954 GO:GO:0005604
            GO:GO:0005044 MEROPS:C01.973 CTD:27283 eggNOG:NOG310046
            HOGENOM:HOG000241342 HOVERGEN:HBG053961 OMA:WGQLTSS
            OrthoDB:EOG47PX5P EMBL:BC081887 IPI:IPI00370427
            RefSeq:NP_001005549.1 UniGene:Rn.43851 STRING:Q66HF6
            Ensembl:ENSRNOT00000041567 GeneID:300846 KEGG:rno:300846
            UCSC:RGD:1359482 InParanoid:Q66HF6 NextBio:647630
            Genevestigator:Q66HF6 Uniprot:Q66HF6
        Length = 475

 Score = 136 (52.9 bits), Expect = 6.2e-12, Sum P(2) = 6.2e-12
 Identities = 52/180 (28%), Positives = 83/180 (46%)

Query:    56 GKTYKCIEEKLHRFEIFKENLKHIDQRNKEVTSYWLGLNEFADMSHEE-FKNKYLGLKPQ 114
             G+ +KC +   H   +  E + HI++ +   T+     ++F  M+ EE FK + LG  P 
Sbjct:   144 GQQWKCSQ---HVCLVLPELIDHINKGDYGWTAQ--NYSQFWGMTLEEGFKFR-LGTLPP 197

Query:   115 FP---TRRQPSAEFSYRDV-KALPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVA-AVEGI 169
              P   +  + +A +   D+ +    S  W   G      +Q +C + WAFST + A + I
Sbjct:   198 SPMLLSMNEMTASYPRADLPEVFIASYKW--PGWTHGPLDQKNCAASWAFSTASVAADRI 255

Query:   170 NQIVSGNLTS-LSEQELIDCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPYLMEEGT 228
                  G  T+ LS Q LI C     +GCN G +D A+ + +   GL     YP   E+ T
Sbjct:   256 AIQSKGRYTANLSPQNLISCCAKNRHGCNSGSIDRAW-WFLRKRGLVSHACYPLFKEQST 314

 Score = 96 (38.9 bits), Expect = 6.2e-12, Sum P(2) = 6.2e-12
 Identities = 24/103 (23%), Positives = 44/103 (42%)

Query:   247 VPENDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTGPCGAELD---------HGVAA 297
             +  N+ + + + + + PV   ++    DF +Y  G++        +         H V  
Sbjct:   356 ISSNETEIMREIIQNGPVQAIMQVH-EDFFYYKTGIYRHVVSTNEEPEKYRKLRTHAVKL 414

Query:   298 VGYGKSKGSD-----YIIVKNSWGPKWGERGYIRMKRNTGKPE 335
              G+G  +G+      + I  NSWG  WGE GY R+ R   + +
Sbjct:   415 TGWGTLRGAQGKKEKFWIAANSWGKSWGENGYFRILRGVNESD 457


>RGD|70956 [details] [associations]
            symbol:Tinagl1 "tubulointerstitial nephritis antigen-like 1"
           species:10116 "Rattus norvegicus" [GO:0005044 "scavenger receptor
           activity" evidence=IEA] [GO:0005576 "extracellular region"
           evidence=IEA] [GO:0005737 "cytoplasm" evidence=IEA;ISO] [GO:0006508
           "proteolysis" evidence=IEA] [GO:0006955 "immune response"
           evidence=IEA] [GO:0008234 "cysteine-type peptidase activity"
           evidence=IEA] [GO:0030247 "polysaccharide binding" evidence=IEA]
           [GO:0031012 "extracellular matrix" evidence=IEA;ISO] [GO:0043236
           "laminin binding" evidence=IEA;ISO] InterPro:IPR000668
           InterPro:IPR001212 InterPro:IPR013128 Pfam:PF00112 PROSITE:PS00524
           PROSITE:PS50958 SMART:SM00201 SMART:SM00645 RGD:70956 GO:GO:0005737
           GO:GO:0005576 GO:GO:0006955 GO:GO:0030247 GO:GO:0008234
           GO:GO:0006508 InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00639
           GO:GO:0031012 GO:GO:0005044 eggNOG:NOG310046 HOGENOM:HOG000241342
           HOVERGEN:HBG053961 MEROPS:C01.975 CTD:64129 OrthoDB:EOG4BG8W0
           EMBL:AB050717 IPI:IPI00190428 RefSeq:NP_446034.1 UniGene:Rn.1256
           ProteinModelPortal:Q9EQT5 PRIDE:Q9EQT5 GeneID:94174 KEGG:rno:94174
           UCSC:RGD:70956 InParanoid:Q9EQT5 NextBio:617830 ArrayExpress:Q9EQT5
           Genevestigator:Q9EQT5 GermOnline:ENSRNOG00000013179 Uniprot:Q9EQT5
        Length = 467

 Score = 129 (50.5 bits), Expect = 1.9e-11, Sum P(2) = 1.9e-11
 Identities = 29/75 (38%), Positives = 41/75 (54%)

Query:   150 NQGSCGSCWAFSTVAAVEGINQIVS-GNLTS-LSEQELIDCDTSFNNGCNGGLMDYAFKY 207
             +QG+C   WAFST A       I S G++T  LS Q L+ CDT    GC GG +D A+ +
Sbjct:   221 DQGNCAGSWAFSTAAVASDRVSIHSLGHMTPILSPQNLLSCDTHHQKGCRGGRLDGAW-W 279

Query:   208 IVASGGLHKEEDYPY 222
              +   G+  +  YP+
Sbjct:   280 FLRRRGVVSDNCYPF 294

 Score = 99 (39.9 bits), Expect = 1.9e-11, Sum P(2) = 1.9e-11
 Identities = 31/95 (32%), Positives = 44/95 (46%)

Query:   250 NDEQSLLKALAHQ-PVSVAIEASGTDFQFYSGGVFT------GPCGAELDHGVAAV---G 299
             +DE+ ++K L    PV   +E    DF  Y  G+++      G       HG  +V   G
Sbjct:   348 SDEKEIMKELMENGPVQALMEVH-EDFFLYQRGIYSHTPVSQGRPEQYRRHGTHSVKITG 406

Query:   300 YGKSKGSD-----YIIVKNSWGPKWGERGYIRMKR 329
             +G+    D     Y    NSWGP WGERG+ R+ R
Sbjct:   407 WGEETLPDGRTIKYWTAANSWGPWWGERGHFRIVR 441


>UNIPROTKB|Q9EQT5 [details] [associations]
            symbol:Tinagl1 "Tubulointerstitial nephritis antigen-like"
            species:10116 "Rattus norvegicus" [GO:0005044 "scavenger receptor
            activity" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0006955 "immune response" evidence=IEA] [GO:0008234
            "cysteine-type peptidase activity" evidence=IEA] [GO:0030247
            "polysaccharide binding" evidence=IEA] InterPro:IPR000668
            InterPro:IPR001212 InterPro:IPR013128 Pfam:PF00112 PROSITE:PS00524
            PROSITE:PS50958 SMART:SM00201 SMART:SM00645 RGD:70956 GO:GO:0005737
            GO:GO:0005576 GO:GO:0006955 GO:GO:0030247 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00639
            GO:GO:0031012 GO:GO:0005044 eggNOG:NOG310046 HOGENOM:HOG000241342
            HOVERGEN:HBG053961 MEROPS:C01.975 CTD:64129 OrthoDB:EOG4BG8W0
            EMBL:AB050717 IPI:IPI00190428 RefSeq:NP_446034.1 UniGene:Rn.1256
            ProteinModelPortal:Q9EQT5 PRIDE:Q9EQT5 GeneID:94174 KEGG:rno:94174
            UCSC:RGD:70956 InParanoid:Q9EQT5 NextBio:617830 ArrayExpress:Q9EQT5
            Genevestigator:Q9EQT5 GermOnline:ENSRNOG00000013179 Uniprot:Q9EQT5
        Length = 467

 Score = 129 (50.5 bits), Expect = 1.9e-11, Sum P(2) = 1.9e-11
 Identities = 29/75 (38%), Positives = 41/75 (54%)

Query:   150 NQGSCGSCWAFSTVAAVEGINQIVS-GNLTS-LSEQELIDCDTSFNNGCNGGLMDYAFKY 207
             +QG+C   WAFST A       I S G++T  LS Q L+ CDT    GC GG +D A+ +
Sbjct:   221 DQGNCAGSWAFSTAAVASDRVSIHSLGHMTPILSPQNLLSCDTHHQKGCRGGRLDGAW-W 279

Query:   208 IVASGGLHKEEDYPY 222
              +   G+  +  YP+
Sbjct:   280 FLRRRGVVSDNCYPF 294

 Score = 99 (39.9 bits), Expect = 1.9e-11, Sum P(2) = 1.9e-11
 Identities = 31/95 (32%), Positives = 44/95 (46%)

Query:   250 NDEQSLLKALAHQ-PVSVAIEASGTDFQFYSGGVFT------GPCGAELDHGVAAV---G 299
             +DE+ ++K L    PV   +E    DF  Y  G+++      G       HG  +V   G
Sbjct:   348 SDEKEIMKELMENGPVQALMEVH-EDFFLYQRGIYSHTPVSQGRPEQYRRHGTHSVKITG 406

Query:   300 YGKSKGSD-----YIIVKNSWGPKWGERGYIRMKR 329
             +G+    D     Y    NSWGP WGERG+ R+ R
Sbjct:   407 WGEETLPDGRTIKYWTAANSWGPWWGERGHFRIVR 441


>WB|WBGene00000789 [details] [associations]
            symbol:cpz-2 species:6239 "Caenorhabditis elegans"
            [GO:0006508 "proteolysis" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR013128 Pfam:PF00112 PRINTS:PR00705 SMART:SM00645
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 PANTHER:PTHR12411
            PROSITE:PS00640 GeneTree:ENSGT00560000076599 KO:K08568 EMBL:Z81103
            HSSP:P80067 PIR:T23720 RefSeq:NP_506318.1 ProteinModelPortal:P92005
            SMR:P92005 STRING:P92005 MEROPS:C01.A41 PaxDb:P92005
            EnsemblMetazoa:M04G12.2 GeneID:179818 KEGG:cel:CELE_M04G12.2
            UCSC:M04G12.2 CTD:179818 WormBase:M04G12.2 eggNOG:NOG275763
            InParanoid:P92005 OMA:VEYWIAR NextBio:906990 Uniprot:P92005
        Length = 467

 Score = 182 (69.1 bits), Expect = 2.0e-11, P = 2.0e-11
 Identities = 60/202 (29%), Positives = 92/202 (45%)

Query:   146 TPVKNQGS---CGSCWAFSTVAAV-EGINQIVSGN--LTSLSEQELIDCDTSFNNGCNGG 199
             +P +NQ     CGSCW F T  A+ +  N    G   +T LS QE+IDC+   N  C GG
Sbjct:   237 SPTRNQHIPVYCGSCWVFGTTGALNDRFNVARKGRWPMTQLSPQEIIDCNGKGN--CQGG 294

Query:   200 LMDYAFKYIVASGGLHKEEDYPYLMEEGTCEDKKE-----EMEVVTISGYQDVPENDE-- 252
              +    ++    G L +E    Y    G C            E  +++ Y      D   
Sbjct:   295 EIGNVLEHAKIQG-LVEEGCNVYRATNGECNPYHRCGSCWPNECFSLTNYTRYYVKDYGQ 353

Query:   253 -QSLLKALAH----QPVSVAIEASGTDFQF-YSGGVFTGPCGAELDHGVAAVGYGKSK-G 305
              Q   K ++      P++ AI A+   F++ Y  GV++     E +H ++  G+G  + G
Sbjct:   354 VQGRDKIMSEIKKGGPIACAIGAT-KKFEYEYVKGVYSEKSDLESNHIISLTGWGVDENG 412

Query:   306 SDYIIVKNSWGPKWGERGYIRM 327
              +Y I +NSWG  WGE G+ R+
Sbjct:   413 VEYWIARNSWGEAWGELGWFRV 434


>UNIPROTKB|F1SVA2 [details] [associations]
            symbol:TINAGL1 "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0005615 "extracellular space" evidence=IDA] [GO:0043236
            "laminin binding" evidence=IEA] [GO:0031012 "extracellular matrix"
            evidence=IEA] [GO:0005737 "cytoplasm" evidence=IEA] [GO:0030247
            "polysaccharide binding" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0006955 "immune response"
            evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA] [GO:0005044
            "scavenger receptor activity" evidence=IEA] InterPro:IPR000668
            InterPro:IPR001212 InterPro:IPR013128 Pfam:PF00112 PROSITE:PS00524
            PROSITE:PS50958 SMART:SM00201 SMART:SM00645 GO:GO:0005737
            GO:GO:0005615 GO:GO:0006955 GO:GO:0030247 GO:GO:0008234
            GO:GO:0006508 InterPro:IPR025660 PANTHER:PTHR12411 PROSITE:PS00639
            GO:GO:0031012 GO:GO:0005044 GeneTree:ENSGT00560000076599
            OMA:DNCNRCT EMBL:CU856262 Ensembl:ENSSSCT00000003995 Uniprot:F1SVA2
        Length = 467

 Score = 129 (50.5 bits), Expect = 2.4e-11, Sum P(2) = 2.4e-11
 Identities = 32/96 (33%), Positives = 50/96 (52%)

Query:   131 KALPKSVDWRKK--GAVTPVKNQGSCGSCWAFSTVAAVEGINQIVS-GNLTS-LSEQELI 186
             + LP++ +  +K    +    +QG+C   WAFST A       I S G++T  LS Q L+
Sbjct:   201 EVLPRAFEASEKWPNLIHDPLDQGNCAGSWAFSTAAVASDRVSIHSLGHMTPVLSPQNLL 260

Query:   187 DCDTSFNNGCNGGLMDYAFKYIVASGGLHKEEDYPY 222
              CDT    GC GG +D A+ + +   G+  +  YP+
Sbjct:   261 SCDTHNQQGCQGGRLDGAW-WFLRRRGVVSDHCYPF 295

 Score = 98 (39.6 bits), Expect = 2.4e-11, Sum P(2) = 2.4e-11
 Identities = 30/99 (30%), Positives = 45/99 (45%)

Query:   250 NDEQSLLKALAHQ-PVSVAIEASGTDFQFYSGGVFT------GPCGAELDHGVAAV---G 299
             ++E+ ++K L    PV   +E    DF  Y  G+++      G       HG  +V   G
Sbjct:   348 SNEKDIMKELMENGPVQALMEVH-EDFFLYQSGIYSHTPVSHGRPERYRRHGTHSVKITG 406

Query:   300 YGKSKGSD-----YIIVKNSWGPKWGERGYIRMKRNTGK 333
             +G+    D     Y    NSWGP WGERG+ R+ R   +
Sbjct:   407 WGEETLPDGRMLKYWTAANSWGPGWGERGHFRIVRGANE 445


>ZFIN|ZDB-GENE-060503-240 [details] [associations]
            symbol:tinagl1 "tubulointerstitial nephritis
            antigen-like 1" species:7955 "Danio rerio" [GO:0005044 "scavenger
            receptor activity" evidence=IEA] [GO:0008234 "cysteine-type
            peptidase activity" evidence=IEA] [GO:0030247 "polysaccharide
            binding" evidence=IEA] [GO:0006508 "proteolysis" evidence=IEA]
            [GO:0030414 "peptidase inhibitor activity" evidence=IEA]
            [GO:0006955 "immune response" evidence=IEA] [GO:0002040 "sprouting
            angiogenesis" evidence=IMP] InterPro:IPR000668 InterPro:IPR001212
            InterPro:IPR008037 InterPro:IPR013128 Pfam:PF00112 Pfam:PF05375
            PROSITE:PS00524 PROSITE:PS50958 SMART:SM00201 SMART:SM00645
            ZFIN:ZDB-GENE-060503-240 GO:GO:0006955 GO:GO:0030247 GO:GO:0030414
            GO:GO:0008234 GO:GO:0006508 InterPro:IPR025661 InterPro:IPR025660
            PANTHER:PTHR12411 PROSITE:PS00640 PROSITE:PS00639 GO:GO:0002040
            GO:GO:0005044 GeneTree:ENSGT00560000076599 GO:GO:0010466
            SUPFAM:SSF57283 HOVERGEN:HBG053961 MEROPS:C01.975 OMA:DNCNRCT
            EMBL:BX950864 IPI:IPI00609339 UniGene:Dr.103937
            Ensembl:ENSDART00000087096 Ensembl:ENSDART00000126228
            InParanoid:Q1LUC6 Uniprot:Q1LUC6
        Length = 471

 Score = 137 (53.3 bits), Expect = 2.5e-11, Sum P(2) = 2.5e-11
 Identities = 47/144 (32%), Positives = 68/144 (47%)

Query:    90 WLGLN--EFADMSHEEFKNKYLGLKPQFPTR---RQPSAEFSYRDVKALPK---SVD-WR 140
             W   N  +F  M+ +E     LG K   PTR        + +      LP    +VD W 
Sbjct:   154 WRAANYSQFWGMTLDEGLRFRLGTKR--PTRTIMNMNEMQMNMNGNDHLPSYFNAVDKW- 210

Query:   141 KKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVS-GNLT-SLSEQELIDCDTSFNNGCNG 198
               G +    +QG+C + WAFST A       I S G++T  LS Q LI CDT   +GC G
Sbjct:   211 -PGKIHEPLDQGNCNASWAFSTAAVASDRISIQSMGHMTPQLSPQNLISCDTRHQDGCAG 269

Query:   199 GLMDYAFKYIVASGGLHKEEDYPY 222
             G +D A+ + +   G+  ++ YP+
Sbjct:   270 GRIDGAW-WFMRRRGVVTQDCYPF 292

 Score = 89 (36.4 bits), Expect = 2.5e-11, Sum P(2) = 2.5e-11
 Identities = 27/98 (27%), Positives = 40/98 (40%)

Query:   250 NDEQSLLKALAHQPVSVAIEASGTDFQFYSGGVFTG-------PCG--AELDHGVAAVGY 300
             N+ + + + + + PV   +E    DF  Y  G+F         P        H V   G+
Sbjct:   345 NENEIMKEIMDNGPVQAIMEVH-EDFFVYKSGIFRHTDVNYHKPSQYRKHATHSVRITGW 403

Query:   301 GKSKGSD-----YIIVKNSWGPKWGERGYIRMKRNTGK 333
             G+ +        Y I  NSWG  WGE GY R+ R   +
Sbjct:   404 GEERDYSGRTRKYWIGANSWGKNWGEDGYFRIARGVNE 441

WARNING:  HSPs involving 36 database sequences were not reported due to the
          limiting value of parameter B = 250.


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.315   0.134   0.409    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      350       326   0.00087  116 3  11 22  0.44    34
                                                     33  0.45    37


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  286
  No. of states in DFA:  617 (66 KB)
  Total size of DFA:  254 KB (2136 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  29.85u 0.09s 29.94t   Elapsed:  00:00:02
  Total cpu time:  29.90u 0.09s 29.99t   Elapsed:  00:00:02
  Start:  Fri May 10 03:36:22 2013   End:  Fri May 10 03:36:24 2013
WARNINGS ISSUED:  2

Back to top